1. Introduction
A national animal health surveillance system (AHSS) has a number of important aims, among which timely detection of emerging diseases and monitoring of the health status of the animal population are most common. The system should give insight into the prevalence of endemic diseases and prevent major outbreaks of (exotic) diseases by allowing a timely reaction to anomalies [
1]. Surveillance is characterized by the attached intervention plan aiming to control disease, different from monitoring, where intervention is normally absent [
2]. In addition to the main objectives to maintain and improve the health status of the animal population, the risks of zoonotic disease transmission make AHSS also important for food safety and human health [
3]. In addition, the AHSS’s objective to prove disease-free status is essential for the international trade of animals and their products. A good working AHS, therefore, is not only important for animal and human health but also of economic importance [
4].
In April 2021, the new European Animal health law came into effect [
5] and replaced the Community Animal Health Policy, which included multiple legislations and guidelines. The Animal health law encompasses requirements for all EU countries to have an AHSS in place to prevent and control animal disease and detect emerging diseases [
6]. The legislation includes a list of diseases, including specifications on how EU countries have to act when one of these diseases is or becomes present in their country [
7]. Although prescribed criteria have to be met, countries can adapt the framework to best fit their current health status, farm industry and infrastructure [
5]. Therefore, countries have different surveillance systems for different diseases, and due to country differences (e.g., livestock species and trade), there is not a single best general AHSS for all countries [
8]. Systematic evaluations are needed to determine the functionality and quality of these AHSS.
AHSS usually has different passive and active components for data collection. In a passive approach, data comes from the field at the initiative of farmers, veterinarians and other stakeholders [
2]. Passive surveillance may result in the underrepresentation of specific groups or geographic regions. An active approach means that data collection is initiated by the AHSS. This offers the possibility to reduce the bias of underrepresentation mentioned above. Peyre et al. developed the EVA tool, a framework for the evaluation of AHSS [
4]. This framework includes a list of attributes that can be integrated into an AHSS evaluation. Representativeness and coverage are included in this attributes list and are defined as “the extent to which the features of the population of interest are reflected by the population included in the surveillance activity“ and “the proportion of the population that was included in the surveillance”. Features of representativeness may include herd size, production type, age, sex or geographical location and time of sampling. Evaluation of these attributes is an important step to investigate if groups or regions are underrepresented, increasing the risks of not detecting disease outbreaks in a timely manner.
The Netherlands is an important importing and exporting country with a dense animal and human population. International trade combined with these high densities results in an increased risk of outbreaks with major consequences, making a well-functioning AHSS important. In the Netherlands, the voluntary national AHSS system is commissioned by Royal GD (Gezondheidsdienst voor Dieren) [
9] by the livestock industry and the Ministry of Agriculture. The AHSS consists of surveillance components to actively (e.g., disease prevalence, trend analyses of census data) and passively (e.g., veterinary helpdesk, postmortem examination) collect data about the animal health status [
10]. These components aim to follow trends and detect introductions and outbreaks of unknown or exotic diseases. When one or a combination of component(s) results in suspicion of a notifiable disease, the veterinary authority (i.e., the Dutch Food Safety Authority) has to be contacted immediately to plan and start following-up investigations and interventions when applicable. Passive surveillance holds a potential risk of over or underrepresentation of specific groups or regions. Scientific evaluation of representativeness and geographical coverage of the data has not been performed previously. Because passive surveillance is an important component of disease control in many, if not all, countries, such evaluation is highly relevant to other countries.
The aim of this study was to evaluate the representativeness and coverage of passively collected data from the veterinary helpdesk and postmortem examination in the Dutch cattle and pig sector. Insight into the representativeness of passively collected data is important to find over and underrepresented groups and regions, which can cause bias in the data, potentially resulting in a wrong interpretation of the situation within a country. Poor representation of an AHSS can result in missing signals and, therefore, negatively influence the timeliness of disease detection with possible health risks for the animal population in the country itself but also for the populations in neighboring countries and trading partners.
2. Materials and Methods
2.1. Study Population
This study focused on the Dutch cattle and pig population. Although both species are important for Dutch animal production, structures of these sectors differ significantly. The cattle sector consists of more farms with smaller numbers of animals than the pig sector and is more equally distributed across the country. Service by veterinary practices in cattle is more locally organized compared to the pig sector.
The number of farm animals, their locations and their movements are recorded by the Dutch identification and registration system (I&R). This database was used to determine the number of cattle and pig farms each year. The total number of cattle was available from the national identification and registration system (I&R). Unlike cattle, pigs are not individually registered in I&R, and therefore, their total number was not available. The information was collected for each year from 2015 to 2019. Within this period, there were no notifiable disease outbreaks in the Netherlands in the cattle population or in the pig population.
2.2. Data Collection
Data consisted of information from two surveillance components of GD, the veterinary helpdesk and postmortem examination. The veterinary helpdesk can be contacted by veterinarians and farmers, either by phone or e-mail, with all sorts of questions related to animal health. The overall purpose of the system of the helpdesk is to identify early signs of emerging diseases. However, the reason for veterinarians and farmers to contact the helpdesk is often to obtain help with an animal health problem that they encounter. The employees of the helpdesk are trained veterinarians. The phone calls and e-mails are registered in a system from which the data used in this study were obtained. In 2019 the registration system changed. However, similar information was registered. Each week, the aggregated information is discussed and analyzed to timely detect anomalies and take follow-up actions when needed. The following information is registered and was available for this study; farm and veterinary practice ID and location, animal species (cattle, pig), animal type (e.g., milking cows, calves, sows), categorized reasons for contact (e.g., clinical signs, disease, management), anonymous contact (yes/no), suspicion of a notifiable disease (yes/no), new or emerging disease (yes/no), change in endemic situation (yes/no), date and how the contact took place (e.g., phone, e-mail).
Postmortem examination is another surveillance component at GD for early detection of emerging diseases as well as a diagnostic tool that provides information on cause of death to the farmer and veterinarian. This surveillance component is subsidized, and farmers thus pay a fairly low rate. When animals are registered for postmortem examination, GD will send a vehicle to collect the animals, after which necropsy will take place. The costs of transport and necropsy are independent of the distance of the farm to GD. The number of submitted animals and the results are stored in a database. For our study, we used date of submission, farm ID and location, animal species and the number of animals per submission.
Additional to the postmortem examination and helpdesk data, two smaller datasets were available: information on veterinary practices about the number of farms that they attend (each farm in the Netherlands has to be linked to a veterinary practice), the location of the practice and information of the total number of farms per year per postal code area.
2.3. Location Definition
The previously described data contained the location of the farms and veterinary practices. For privacy reasons, the data was anonymized, and locations were categorized by the first two numbers of the Dutch postal code system (PC2) (
Figure 1). This system is based on the human population density. Population-dense regions within one PC2, like cities, have a smaller total surface (km
2) compared to more rural areas, having more surface within one PC2 area. The use of this system divides the Netherlands into 90 areas.
2.4. Data Inclusion
A number of analyses were performed to obtain insight into the representativeness and coverage of the helpdesk and postmortem examination data. For each analysis, a different subset of the data was used. The helpdesk data contained several variables with missing values. The inclusion criteria for each analysis were based on the availability of critical variables for that analysis to be performed (i.e., Herd ID, Vet ID, registration of clinical signs). When those values were missing, the record was removed.
Table 1 shows the inclusion criteria per analysis.
2.5. Reported Clinical Signs
The dataset of the veterinary helpdesk included specifications on the clinical signs addressed in the phone calls or e-mails, although this was not registered for all records. As multiple clinical signs can be registered per contact, the function “grepl” of the R package “base” was used to subtract all different categories, after which the frequency per year was counted. In total, 22 and 53 problem categories were identified for cattle and pigs, respectively. These were further combined based on organ system leading to 14 categories for cattle and 15 for pigs (
Table A1). Bar and pie plots were made to compare the frequency of the different categories to see if there were any patterns in the type or frequency of reported clinical signs between years. The R-package “ggplot2” and “plotly” were used to make the figures. The R-version used for this study was 4.0.2.
2.6. Space-Time Analysis
The first analysis was a space-time cluster analysis performed in SaTScanTM (v9.6.1 64-bit) [
12]. This program can perform spatial cluster analysis to detect significant clustering of data over spatial areas within a specified timeframe [
13]. The program uses cylindric shapes in which the base covers the space component, and the height of the cylinder covers the time. The program determines the best-fitting cylinder, followed by the second best and so on.
SaTScanTM needs a case, population and geographic file to run the analysis. The postmortem examination data and helpdesk data for pigs and cattle were used to perform four space-time analyses, one for each dataset per species. Because location was essential for the analysis, records with missing values for PC2 were excluded. The case file contained the number of contacts or postmortem examinations, the population file, the total population per location, and the geographic file contained the longitude and latitude coordinates of the centroids of the PC2 areas.
Based on the three input files, the program determines the expected value, the likelihood and relative risk (
RR) for a cluster. Equation (1) shows how the expected value
E[c] was calculated with
C as the total number of cases in the population,
P as the total population and
p the population within i, where i can be a single region or cluster. The
RR can be calculated by Equation (2) where
c is the total number of cases in region i,
C is the total number of cases and
e is the expected value calculated by Equation (1).
SatScan uses a likelihood ratio test to determine if the null hypothesis (H0), the RR is equal in all areas, is true, or if the alternative hypothesis (HA) is true, there are areas with a different RR. Equation (3) shows how this was calculated by the program. L(z) is the likelihood for the cluster, L0 is the likelihood under H0, c is the number of cases in i, C is the total number of cases, E[c] is the expected number of cases, and E[T] is the expected number of cases in i over time. When the outcome is different from 1, H0 has to be rejected. In that case, there are clusters that have an increased or decreased RR. Although SatScan also gives non-significant clusters, only significant clusters (p < 0.01) are shown in the result section.
Table 2 shows the SatScan settings for the analysis that differed from the default settings of the program. We used space-time analysis to be able to look retrospectively in space and time simultaneously for high and low clusters with the default use of the Poisson distribution. This cluster setting was chosen because, for surveillance purposes, clusters with low RRs are interesting to show gaps in representativeness; however, clusters with high RRs show where there is overrepresentation, both of which can lead to wrong conclusions about disease status. One cluster could obtain a maximum of 15% of the population at risk to prevent single clusters covering a large part of the country. The chosen maximum of 15% made the clusters more precise and more informative in terms of representativeness. Because the 15% was an arbitrary choice, a sensitivity analysis was performed using a maximum of 20% and a maximum of 10%. Using a maximum of 10% resulted in a partitioning of larger clusters into several smaller clusters in the same areas. The opposite occurred using 20% as maximum, where large clusters became even larger. The largest difference in RR was found in one cluster for which the RR increased from 2.14 (max. cluster size 15% and 20%) to 3.64 (max. cluster size 10%). Although there were some changes, most differences were small and did not result in different conclusions. The minimum and maximum time periods of the clusters were set at 2 and 4 years, respectively. Clusters of only one year were prevented with this setting. Clusters existing for one year were not included, as the aim of the study was representativeness of an ongoing surveillance system. Clusters of one year could exist due to chance, as there are regions with only a few farms. Setting the maximum temporal cluster size beyond 4 years was analytically not possible.
2.7. GLMM Association Distance GD, Farm Density, DAP Size with Contact
Three associations were examined using GLMM. The first two associations were the distance of the farm to GD and the farm density compared to postmortem examination and helpdesk contact. Farm distance was expected not to be related to the number of submissions for postmortem examinations nor helpdesk contact because cost and availability were independent of the distance, but it was tested to confirm this hypothesis. The association was also examined for the helpdesk contact, although there was no association expected. The distance to GD was determined as the linear distance from the centroid of the PC2 area to the GD facility and divided by ten, resulting in a per 10 km unit. Farm density was also added to the model and was defined as the number of farms divided by the surface in km2 of the PC2 area. The farm density was divided into 5 categories, each including 20% of the data points. The first group had the lowest density and the last group the highest.
Four GLMM models were run to determine the effect of the distance to GD and the farm density on the outcome variables contact and postmortem examination in both pigs and cattle. In all models, the variable ‘year’ was forced into the models. Because of the skewness of the data, three models were run with a negative binomial distribution. For the model on ‘helpdesk contact’ for pigs, a Poisson distribution was used. The negative binomial model did not fit the number of contacts for pigs. In all models, an offset with the number of farms within the PC2 area, and a random intercept with the PC2, were used to account for differences in the numbers of farms and multiple measurements per PC2 area.
The third analysis focused on the frequency of contacts and the size of the veterinary practice. Size was defined as the total number of relations (farms) registered to the veterinary practice for the specified animal species. The hypothesis was that larger practices had more expertise and were, therefore, less likely to contact the helpdesk. In that case, information about animal health and possible outbreaks can be missed or detected later. The size of veterinary practices was categorized (
Table 3). The lowest categories represent veterinary practices assumed not to have pigs or cattle as core business. This was also the motivation to use category 2 as reference category in the models, instead of category 1, for both species. The highest category contained the top 5% biggest veterinary practices. The remaining veterinary practices were divided into three groups using the AIC for optimization. A subset of the helpdesk data was used with availability of veterinary practice ID as inclusion criteria.
Two models were run to determine the association between the size of the veterinary practice on contact. The dependent variable was the frequency of contact. The independent variables were veterinary practice size and year. The number of relations was added as an offset to the models. For both models, the negative binomial distribution was used due to skewness of the data. For all GLMM models using the negative binomial distribution, the “Mass” package of R was used.
4. Discussion
We studied the coverage and representativeness of passively collected data from the veterinary helpdesk and postmortem examinations. These are components in the Dutch cattle and pig health surveillance system aiming for the early detection of emerging diseases. The characteristics of the monitoring data were examined by descriptive statistics, spatial-temporal analyses and GLMM models. Coverage and representativeness can indicate the effectiveness of surveillance components and were defined as “the proportion of the population that was included in the surveillance” and “the extent to which the features of the population of interest are reflected by the population included in the surveillance activity”, respectively [
5].
The helpdesk registered 19,615 contacts for cattle over five years, on average covering 3.5% of the total cattle farms and 55.6% of the veterinary practices per year. For pigs, 6047 contacts were registered, covering on average 2.7% of the pig farms and 33.0% of the veterinary practices per year. Over the five years, 2.5% of the cattle veterinary practices with more than 15 relations and 2.2% of the pig veterinary practices with more than 10 relations had no contact with the helpdesk. There were 13,219 postmortem examinations for cattle and 12,300 postmortems for pigs, on average covering 4.5% and 10.2% of the farms per year, respectively. Overall, farmers further away from GD had more contacts and submitted more animals for postmortem examination. We saw the same result for farmers in the lowest farm-dense areas and cattle farmers located in lower farm-dense areas. We did not have an explanation for these results, and further research, such as interviews with farmers and veterinarians, is needed for clarification. Pig farmers located in the highest farm-dense areas had a lower rate of postmortem examination. The number of contacts of veterinary practices was associated with the number of clients. For cattle, veterinarians with more than 300 clients had relatively fewer contacts per client compared to the reference group with 16–80 clients. In contrast, veterinarians from practices with 31–100 clients with pigs had more contacts than the reference group with 11–30 clients.
The proportion of registered clinical signs for cattle was relatively stable, while for pigs, we saw an increase over the years. In both species, clinical signs related to known endemic health problems were most frequently reported. Clinical signs potentially related to emerging diseases (e.g., mortality, respiratory disease, fever) were reported as well. As the main aim of passive surveillance is the early detection of emerging or exotic diseases, it is important that non-specific clinical signs are reported. Late or no reporting of these signs can result in delayed detection of disease outbreaks. In 2019, we saw an increase in suspicions of notifiable disease and a decrease in possible zoonotic risk. Because these changes took place in both species in the same year and there were no national outbreaks known for these species, the change is most likely attributable to the new registration system that was introduced in 2019. “Miscellaneous” was the largest category for cattle. As the registration for cattle lacks a category for general advice (e.g., eradication programs, lab results), which was largest for pigs, this category was expected to contain similar topics.
We saw a decreasing trend in the number of contacts for cattle. The decreasing trend in absolute numbers of helpdesk contact for cattle may be explained by the decrease in the number of farms. In the Netherlands, there is a trend that the number of farms decreases while the remaining farms increase in herd size. In contrast with cattle, we saw an increase in the number of contacts for pigs, while in this sector, a decrease in farms is also observed. Nevertheless, the decrease for cattle and increase for pigs, cattle farmers and veterinarians still had a higher percentage of contacts than pig farmers and veterinarians. In contrast, for postmortem examinations, the proportion of farms submitting pigs was higher than for cattle. The discontinued farms seem to be randomly spread over the country. Thus, the spatial distribution of farms remained similar over years. Therefore, differences in spatial distribution nor decreasing herd numbers between years explained differences in coverage or representativeness.
In general, the clusters of the spatiotemporal analysis matched well with the farm density, but some specific clusters did not match. For cattle, such a cluster was located in the middle of the country and was underrepresented for contacts but overrepresented for postmortem examination. This can likely be explained by the distribution of different cattle farm types in the Netherlands. Veal farms are mostly concentrated in this specific area, whereas dairy farms are more equally spread across the country. The hypothesis is that veal farmers are more inclined to submit their calves for postmortem examination, while dairy farms are probably more selective or prefer to first contact the helpdesk before submitting cattle. In the midsouth of the country was a comparable situation for pigs. We do not have an explanation for this observed inconsistency. Although excluding records of the helpdesk with missing farm IDs or locations may have introduced bias, we expect this to be limited when the information was missing randomly and was not related to the variables of interest.
The number of contacts of veterinary practices was partly related to their number of clients. The veterinary practices for both species with the least clients had significantly fewer contacts per client compared to the reference, which was expected. It is hypothesized that these veterinary practices serve smallholder farms (e.g., private persons and petting farms). Therefore, this category was expected to be different from other categories. In cattle, the largest veterinary practices had significantly less contacts with the helpdesk compared to the reference. This group of veterinary practices has more specialized veterinarians resulting in less need for secondary advice and thus less contact with the helpdesk than the smaller veterinary practices. This does not necessarily decrease the likelihood of early detection of emerging diseases as long as they contact the helpdesk when unexplained changes in health status or suspicion of the notifiable disease occur. Otherwise, this will increase the risk of delayed detection of disease outbreaks. In pigs, the largest veterinary practices had no different contact rates, while the two groups of middle-sized practices had more contacts per client compared to the reference group. Overall, there were no indications that the size of the veterinary pig practices would influence the contact rate and thus impact the effectiveness of early detection of emerging diseases.
Although the cluster analysis gave insight into over- and underrepresented geographic clusters, these clusters are not necessarily a risk regarding the early detection of disease. The program SatScan uses the population mean to calculate the predicted value of the areas. As far as known, there are no thresholds for what a population mean should be for passive surveillance systems to detect disease timely. We hypothesized that the mean of the low clusters might be high enough already to detect an emerging disease outbreak if only the signals on emerging diseases are directly reported, and no other disease information is shared. Additionally, fewer contacts and postmortem examinations are expected when the animal population is healthy compared to a diseased population. Formulating a general threshold will therefore be very difficult. In addition to the absence of a threshold, defined clusters can be sensitive to small changes in areas with just a few farms. When one farmer called twice, this already caused a significant cluster. The opposite occurred when there were no contacts by these few farmers, which resulted in a low cluster. In-depth surveys and interviews are needed to further examine differences in the motivation of farmers and veterinarians.
In our study, only farm density was included as an explanatory variable in the model. The animal density was not available and, therefore, not taken into account. In general, farm and animal density will be strongly correlated. In addition, between farm spread of emerging diseases, the farm density is more important than the animal density [
14]. Farm density was categorized into five groups. Although this categorization did not always result in the best model based on the AIC compared to using farm density as a continuous variable, categorization was preferred for easier interpretation of the results. Furthermore, our conclusions would be similar. Veterinary practices were categorized by the number of relations (i.e., the number of clients). The smallest category was chosen by expert opinion, assuming this category only served smallholder farms. The other categories were chosen on the model’s AIC. In the Netherlands, there are increasingly large corporations and collaborations between veterinary practices. For this study, data on these corporations and collaborations was not available. Correcting for effect was, therefore, not possible.
Combining the spatial results, there were over- and underrepresented regions, but data (contact or animal submission) is coming from all regions. For pigs, contact with farmers in farm-dense areas could be improved. The same was true for the contacts of the largest veterinary practices for cattle. Although there is some over- and underrepresentation, it is not clear if this really affects the early detection capacity of passive surveillance. For early detection receiving the right signals is of more importance than the amount of contact, but it is difficult to define “right signals”. This will depend on the disease, the frequency of reporting within a certain timeframe and the difficulty of recognizing the disease. When the percentage of postmortem examinations of the total number of farms in the Netherlands (7.6% cattle, 35.1% pigs) was compared with other countries with a similar passive surveillance system, the Netherlands ranked in between Flanders (Belgium) (±8.5% cattle; ±55.1% pigs) and Scotland (±4.7% cattle, ±23.1% pigs) from 2015 to 2019, [
15,
16,
17,
18]. Both countries have comparable systems; although Scotland lacks a pick-up service for the carcasses, combined with the larger distances to the facilities, this can have influenced the percentages for postmortem examinations [
19]. Flanders has a much higher percentage of postmortem examinations in pigs. Most likely, this is related to the outbreak of African Swine Fever in wild boar in September 2018, which likely influenced the numbers in 2019 [
20].
In the Netherlands, history already showed the effectiveness of the surveillance as it detected the first signals of an outbreak of bluetongue virus (BTV) in August 2006 and of the Schmallenberg virus in 2011 [
21,
22]. Regarding BTV, this signal concerned contact with the helpdesk reporting 10 seriously diseased sheep, which were found positive for BTV after blood tests. At the beginning of the Schmallenberg outbreak, the helpdesk received an increased number of reports about production losses and diarrhea. As far as we know, only one comparable study has been conducted on the coverage and representativeness of passive surveillance in Scotland [
23]. Their study focused on the diagnostic submissions of pigs, cattle and sheep. Based on a histogram representing the proportion of submitting holdings and the distance to the closest surveillance center, they concluded that the distance to the closest Disease Surveillance Centre did influence the rate of submissions. In our study, we did not find a negative influence of distance on the submission rate. This difference may be caused by the existence of a collection service for carcasses in the Netherlands, while in Scotland, the farmer or vet has to arrange the transport themselves [
19,
24].