Next Article in Journal
Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform
Previous Article in Journal
A Multi-Resident Number Estimation Method for Smart Homes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integration of Scales and Cameras in Nondisruptive Electronic Beehive Monitoring: On the Within-Day Relationship of Hive Weight and Traffic in Honeybee (Apis mellifera) Colonies in Langstroth Hives in Tucson, Arizona, USA

1
Department of Computer Science, Utah State University, Logan, UT 84322, USA
2
Carl Hayden Bee Research Center, USDA-ARS, Tucson, AZ 85719, USA
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(13), 4824; https://doi.org/10.3390/s22134824
Submission received: 17 May 2022 / Revised: 17 June 2022 / Accepted: 22 June 2022 / Published: 25 June 2022
(This article belongs to the Section Physical Sensors)

Abstract

:
The relationship between beehive weight and traffic is a fundamental open research problem for electronic beehive monitoring and digital apiculture, because weight and traffic affect many aspects of honeybee (Apis mellifera) colony dynamics. An investigation of this relationship was conducted with a nondisruptive two-sensor (scale and camera) system on the weight and video data collected on six Apis mellifera colonies in Langstroth hives at the USDA-ARS Carl Hayden Bee Research Center in Tucson, Arizona, USA, from 15 May to 15 August 2021. Three hives had positive and two hives had negative correlations between weight and traffic. In one hive, weight and traffic were uncorrelated. The strength of the correlation between weight and traffic was stronger for longer time intervals. The traffic spread and mean, when taken separately, did not affect the correlation between weight and traffic more significantly than the exact traffic counts from videos. Lateral traffic did not have a significant impact on weight.

1. Introduction

Hive weight is an important indicator of colony activity [1], and many amateur and commercial operations measure weight continuously to estimate colony food reserves and to gauge optimal honey harvesting times [2,3]. Weight changes are indicative of the forager loss and gain during the day and of pollination activity [4,5]. Traffic is another important factor affecting colony dynamics. Traffic at hive entrance may predict honey weight gain [6], and rapid traffic increases at hive entrance may be due to robbing and swarming events [7]. Continuous video traffic measurement is a nondisruptive, robust, and inexpensive method to estimate hive traffic levels [8]. Sensor-based methods of monitoring colonies have shown their effectiveness in estimating the effects of stressors (e.g., poor nutrition or agrochemical exposure) on colony foraging activity and thermoregulation that are difficult to detect using other means such as visual colony assessments by human beekeepers [9]. Electronic beehive monitoring researchers have used sensors to measure internal and external temperature, humidity, atmospheric pressure, wind direction and speed, rainfall, shortwave radiation, weight, and traffic. While many researchers have investigated the relationship between traffic and weather (e.g., [10,11,12]) or weight and weather (e.g., [9,13,14,15,16]), the literature on continuous beehive monitoring, with few notable exceptions (e.g., [6]), has a dearth of studies on the relationship of hive weight and traffic. This problem is fundamental, because hive weight and traffic affect many aspects of colony dynamics. Furthermore, hive weight is a function of several factors such as colony food collection and consumption, bee development and loss, moisture gain or loss due to nectar inflow, ambient humidity and bee respiration, water inflow and outflow, robbing and swarming, and external weather events. Some of these factors are associated with colony traffic, while others are not. For example, humid weather adds to hive weight, because moist wood (and many hives worldwide are made out of wood) is heavier than dry wood. Thus, the weight change due to humidity, especially at night when bees do not fly, is not associated with colony traffic. Thus, if we understand the relationship between hive weight and traffic, we can use both measurements more precisely to identify behavioral markers of Apis mellifera colonies and to improve data interpretation.
The study by Marceau et al. [6] is a rare attempt to shed light on the relationship between the traffic at hive entrance estimated with an electronic bee counter and hive weight on five Langstroth hives with Apis mellifera at an apiary of 22 Langstroth hives for 35 days in July and August 50 km west of Quebec city. The monitored period of each day was from 9:00 to 16:00. For the first four hives, traffic counts were recorded at 16:00 for the 9:00–16:00 period, and the weight difference for every 24 h period was logged at 9:00. The fifth hive was automatically monitored at 15 min intervals using data logging equipment from 9:00 to 16:00 for the mean daily traffic activity (bees/h), and the hive weight difference (kg) was recorded for the 24 h period at 9:00. The researchers proposed the quadratic model G A I N = B 0 + B 1 · A C T 2 , where GAIN estimates the hive honey gain (kg), ACT is the average bee activity between 9:00 and 16:00 (bees/h), and B 0 and B 1 are model coefficients. Marceau et al. reported that the honey gain varied from 28.7 to 58.4 kg and that the average bee activity for the 35 observation days varied from 19,403 bees/h for the least productive hive to 27,408 bees/h for the most productive hive. The four resulting models were very similar, with the best curve fitting obtained on the two most productive hives with R 2 = 0.88 and R 2 = 0.90 . The researchers concluded that the more active a colony was, the more honey it produced, and the minimum activity rate required to obtain a positive daily gain was 14,000 bees/h. When the daily average activity remained below 14,000 bees/h, the hive weight decreased. While the findings by Marceau et al. are significant, their investigation had several important limitations. First, hive weight can be only an approximate estimate of honey gain, because the latter is included in the former. Second, the directionality of bee motion was not taken into account. Specifically, traffic in the vicinity of the hive consists of incoming bees, outgoing bees, and laterally flying bees, which Marceau et al. did not take into account. Third, the researchers made no attempt to distinguish the weight associated with traffic and the weight not associated with it. Fourth, the researchers did not justify why traffic at hive entrance was estimated from 9:00 to 16:00. Research (e.g., [9]) shows that foragers start flying out as early as 5:00 and return to the hive as late 20:30 or even later. Fifth, the datasets described in the article do not appear to be publicly available for replication, standardization, and improvement.
Our investigation addresses the gap in the literature on the relationship of hive weight and colony traffic by investigating the within-day relationship between hive weight and traffic in the vicinity of a hive with a nondisruptive two-sensor (scale and camera) electronic beehive monitoring (EBM) system. We make the following contributions to the body of research on continuous hive monitoring. First, we formulate, prove, and experimentally validate a necessary condition for the within-day independence of weight and traffic on time periods from 1 h up to 6 h. Second, our experiments indicate that the correlation of weight and traffic becomes stronger on time periods longer than 1 h. Third, while the necessary condition for the independence was experimentally verified in our investigation, the executed χ 2 tests failed to verify the implied sufficiency condition for the within-day independence of weight and traffic for any tested time period from 1 h up to 6 h. Thus, the formulation of the within-day sufficiency conditions remains an open problem for electronic beehive monitoring and theoretical apiary science. Fourth, our experiments show that some hives had positive and some hives had negative correlations between weight and traffic. We offer several conjectures on possible causes that may warrant further investigation. Fifth, the computed correlation coefficients and the executed χ 2 tests showed that lateral traffic did not have a significant impact on weight change and may be omitted in within-day computational models that predict hive weight from traffic. Sixth, our experiments suggest that the traffic spreads and means, when taken separately, did not affect the correlation of weight and traffic more significantly than the exact traffic counts. Thus, exact traffic counts may suffice as traffic estimates. Finally, we made public our curated datasets of time-aligned weight and traffic measures from our field deployment at the USDA-ARS Carl Hayden Bee Research Center in Tucson, Arizona (AZ), USA, in May–August 2021. These datasets can be used as benchmarks for replication, standardization, and improvement.
Since EBM is a relatively recent branch of digital apiculture and does not yet have standard terminology, we conclude the introduction with several definitions. We use the terms bee and honeybee to refer to the Apis mellifera honeybee. We use the terms hive and beehive to refer to a standard Langstroth hive or a variant thereof with an Apis mellifera colony. We define the vicinity of a hive to be the cube-shaped space in front of the hive’s entrance with dimensions 3 m × 3 m × 3 m continuously monitored with a camera-computer unit. We use the term traffic to refer to all bee traffic in the vicinity of the hive. We use the term total traffic to refer to the bee traffic that includes incoming traffic (number of bees flying into the hive), outgoing traffic (number of bees flying out of the hive), and lateral traffic (number of bees flying parallel to the landing pad of the hive) over a given period of time. We use the term electronic beehive monitoring (EBM) to refer to the acquisition and analysis of digital data on the behavior of a managed bee colony through various sensors deployed in or around the hive. We use the adjectives nondisruptive and noninvasive with respect to EBM to describe the type of EBM that requires no structural modification of the hive and no deployment of active or passive sensors inside the hive or on individual bees. EBM solutions are nondisruptive insomuch as they do not disrupt any natural cycles of the monitored colonies and preserve the sacredness of the honeybee space. We note that, unlike other state-of-the-art EBM investigations (e.g., [5,16]) that rely on disruptive solutions (e.g., radio tags on bees or structural hive modifications), we used only nondisruptive methods in our study.
The remainder of our article is organized as follows. In Section 2, we detail the materials and methods of our investigation. In Section 3, we present our results. In Section 4, we discuss our results. In Section 5, we present our conclusions. Our supplementary materials include not only the datasets and additional tables and plots but also several short videos that illustrate important hardware and software aspects of our EBM system, which the readers may want to watch before proceeding to the remainder of the article. References to the figures and tables in the supplementary materials start with the prefix S (e.g., S54).

2. Materials and Methods

2.1. Data

The dataset was acquired during the deployment of 10 BeePi monitors (e.g., [17]) on Apis mellifera colonies in Langstroth hives at the USDA-ARS Carl Hayden Bee Research Center in Tucson, Arizona (AZ), USA (GPS coordinates: 32 · 13 18.274 N, 110 · 55 35.324 W) from 20 May to 15 August 2021. All colonies had Italian queens from two breeders: one in California and one in Hawaii. The queens were all painted (blue for breeder 1; green and yellow for breeder 2) to enable queen verification throughout the experiment. All queens were one year old. No hives swarmed during the monitored period. Each BeePi monitor was equipped with a Raspberry Pi 3 model B v1.2 computer coupled to a Raspberry Pi v2 8-megapixel camera. Timestamped 30-second mp4 25 frames per second videos were taken by each monitor every 15 min of the 3 m × 3 m × 3 m cube-shaped space in front of the hive on top of which the monitor was mounted. The videos were captured from 7:00 to 20:30 due to the poor visibility at the site apiary before 7:00 and after 20:30. The videos were saved on each monitor’s 5 TB USB storage device. All 10 hives were each placed on the 10 stainless steel electronic scales (Tekfa model B-2418 and Avery Weigh-Tronix model BSAO1824-200; max. capacity: 100 kg, precision: ±20 g; operating temperature: −30 °C to 70 °C) and linked to 16-bit dataloggers (Hobo UX120-006M External Channel data logger, Onset Computer Corporation, Bourne, MA). The hive weight was measured in kilograms (kg) and logged every 5 min, which was the default time period of the data logger. Four of the ten BeePi monitors were damaged during a severe storm in Tucson, AZ, in July 2021 and were fixed in early August 2021. However, due to this data acquisition gap, the data from these hives were not used in this investigation. We henceforth refer to the remaining six hives from which the weight and traffic data were collected by their IDs used in our logs: H17, H19, H41, H43, H47, and H53. Each of the 13,353 videos (see Table 1) collected from the six hives was processed by the BeePIV algorithm [8]. BeePIV converts video frames to particle motion frames, computes particle displacement vector fields, classifies individual displacement vectors as incoming, outgoing, and lateral, and uses vector counts (non-negative integers) to measure incoming, outgoing, and lateral bee traffic. Total traffic is estimated as the sum of the incoming, outgoing, and lateral measurements. The timestamps on weight and traffic measurements were used to time-align them into one CSV file for each hive. The final dataset consisted of six CSV files (one per hive) of time-aligned incoming (IN), outgoing (OUT), lateral (LAT), total (TOT) counts and weight measurements. Since weight measurements were logged every 5 min while traffic measurements were logged every 15 min, each weight measurement time-aligned with a traffic measurement was computed as the mean of the three weight measurements the middle of which had the same timestamp with the traffic measurements. Weight measurements were raw in that they included external impacts (e.g., someone puts a heavy object such as a brick or a super on top of the hive). Thus, when the weight rose or dropped abruptly by ≈20 kg and returned to the previous level within 30 min, which is physically impossible in a real beehive, the measurement was considered to reflect an external impact and was replaced with the mean of the neighbors before and after it. Table 1 summarizes the information on the CSV data files.

2.2. Hive Inspection and Treatment

All monitored hives had regular hive inspections carried out by the fifth author for the duration of the experiment. We counted frames of bees and mite drops and logged qualitative brood assessments. Frames of bees are counts of individual frames in a hive that are completely covered by bees on both sides. If only one side of a frame is completely covered with bees, then it is counted as 1/2 of a full frame. It should be noted that such measurements as 1/2, 1/4, 1/8 of a full frame are visual assessments by the beekeeper. Counts of frames of bees are an estimate of the overall health of a colony. Mite drops are counts of Varroa mites on a sticky board. A sticky board is a thin (≈2 cm thick) rectangular piece of corrugated plastic on which a thin film of Vaseline (or other adhesive substances such as plant oil) is placed with a paper towel. The sticky board was inserted into the screened bottom board underneath each monitored hive. As mites drop from bees in a colony, they stick to the board and can be visually counted by the beekeeper. Greater mite drops indicate higher levels of mite infestation, which may negatively impact the colony’s productivity. Brood assessments are the beekeeper’s qualitative assessments of the brood’s condition. We used the following qualitative labels in our brood assessments: straight, straight with punctured caps, spotty, PMS (parasitic mite syndrome), no brood, and chalk brood. Straight brood indicates a productive laying queen. Straight brood with punctured caps also indicates a productive queen with potential minor laying problems. Spotty brood shows that a queen lays eggs in isolated and typically disconnected cell regions, which may cause productivity problems or colony failure later on. PMS characterizes the brood with white larvae that appear chewed or sunken on the side of some cells. Chalk brood is caused by a fungus called Ascosphaera apis. Frames affected by chalk brood have white chunks of mummified brood that resemble small pieces of white chalk. This disease infects a hive through reproductive spores attached to pollen, robbing bees, or tools used in already infected hives. Hives infected with chalk brood often fail and present a danger to the other hives in the apiary due to bee drift. Apivar strips were applied to all monitored hives on 8 July 2021 to treat Varroa mites. We assessed the population strength of each colony by calculating the weight of the adult bee mass (bee mass) by subtracting from the total weight of the hive the combined weight of the woodenware, the electronics, and the frames without the bees [9]. The adult bee mass measurements were conducted twice on each hive at the beginning (June 2021) and at the end (August 2021) of the monitored period.

2.3. Random Variables and Correlations

We measured hive weight and traffic as two jointly observed random variables W (weight) and T (traffic). Our samples were
( W 1 , T 1 ) , , ( W n , T n ) ,
where W i and T i , 1 i n , are time-aligned weight and traffic measurements for a given hive (see Table 1 for specific values of n). We use the notation W t and T t to denote random variables whose values range over the values of W and T at time t. The correlation between two random variables is typically measured with Pearson’s, Spearman’s, and Kendall’s correlation coefficients [18,19,20], denoted as ρ P , ρ S , and ρ K , respectively. We tested the absence against the presence of correlation with the following hypotheses:
H ρ , 0 : ρ = 0 ; H ρ , 1 : ρ 0 .
where ρ = ρ P , ρ = ρ S , or ρ = ρ K . H ρ , 0 was rejected in favor of H ρ , 1 at p 0.05 . We use the notation ρ ^ P , ρ ^ S , and ρ ^ K to denote the computed estimates of ρ P , ρ S , and ρ K . Thus, we computed Pearson, a statistical measure of linear dependency between two variables, of n measurements ( W i , T i ) as
ρ ^ P = i = 1 n ( W i W ¯ ) ( T i T ¯ ) i = 1 n ( W i W ¯ ) 2 i = 1 n ( T i T ¯ ) 2 .
Spearman and Kendall measure a monotonic association between two random variables, and are more robust versions of Pearson, because they rely on ranks. The rank of an observation W i , R ( W i ) , is its position in a list of measurements for a random variable sorted in ascending order. For example, if there are four measurements W 1 = 6.3 , W 2 = 8 , W 3 = 2.5 , and W 4 = 7.4 , then the sorted list is ( W 3 = 2.5 , W 1 = 6.3 , W 4 = 7.4 , W 2 = 8 ) and the ranks are R ( W 1 ) = 2 , R ( W 2 ) = 4 , R ( W 3 ) = 1 , and R ( W 4 ) = 3 . We computed Spearman and Kendall of n time-aligned measurements ( W i , T i ) as
ρ ^ S = 1 n i = 1 n R ( W i ) R ( T i ) ( n + 1 ) 2 / 4 ( n 2 1 ) / 12 ; ρ ^ K = i = 1 n j > i n I { W i W j , T i T j } i = 1 n j > i n I { W i W j , T i T j } n ( n 1 ) / 2 ,
where the event { W i W j , T i T j } refers to the situation when the comparison signs between W i and W j and T i and T j are the same. Thus, if W i < W j , then T i < T j , and if W i > W j , then T i > T j . The event { W i W j , T i T j } refers to the situation when the comparison signs between W i and W j and T i and T j are different. Thus, if W i > W j , then T i < T j , and if W i < W j , then T i > T j .
We computed the correlation coefficient heat map of weight (W) and the five traffic types for each hive: IN, OUT, difference between IN and OUT (IN-OUT), sum of IN and OUT (IN+OUT), and total (TOT=IN+OUT+LAT). We used the autocorrelation function (ACF) to detect non-randomness in weight and traffic viewed as time series [21]. The ACF evaluates the similarity between a time series (i.e., the signal) and its copy with a shift, which is referred to as a lag. The ACF is a function of a lag. Let S 1 , S 2 , , S n be a time series of observations, then the ACF of the lag L is defined as the Pearson correlation ρ P between S i and S i + L as
ρ ^ P = i = 1 n L ( S i S 1 : ( n L ) ¯ ) ( S i + L S L : n ¯ ) i = 1 n ( S i S 1 : ( n L ) ¯ ) 2 i = 1 ( S i + L S L : n ¯ ) 2 ,
where S 1 : ( n L ) ¯ and S 1 : ( n L ) ¯ are the mean values of the series S 1 , S 2 , , S n L and S L , S 2 , , S n , respectively. Autocorrelation plots are used to visually assess the presence of trends and cycles in data. A trend is a pattern in a time series that does not repeat at least within the captured period. Cyclicity is a component that regularly repeats itself over time. If a time series has a trend, the ACF does not reach zero unless the lag is sufficiently long. If a time series contains a significant cycle, the autocorrelation plot typically shows spikes at multiples of lags equal to the period.

2.4. Weight and Traffic Changes

Since each BeePi monitor captured a 30-s video every 15 min from which the BeePIV algorithm extracted the integers IN, OUT, LAT, and TOT, we measured the change in traffic over a lag of 15 k minutes as
Δ k T t = i = 1 + ( t 1 ) k t k T i i = 1 + ( t 2 ) k ( t 1 ) k T i .
Thus, if k = 4 , then the lag is 1 h; if k = 8 , then the lag is 2 h, etc. The values of Δ k T t were logarithmically transformed to make the rate of change distribution closer to normal as they approximately follow a log-normal distribution (see Figures S1 and S2 in the Supplementary Materials). Since weight measurements were logged every 5 min, for every T i the time-aligned W i was computed as the mean of W i 1 , W i , and W i + 1 . The change in weight over a lag of 15 k minutes was computed as
Δ k W t = | W 1 + t k W 1 + ( t 1 ) k | ,
where the values of 1 + t k and 1 + ( t 1 ) k were chosen so that W 1 + t k and W 1 + ( t 1 ) k always belonged to the video monitoring period (7:00–20:30) of the same day. The variables Δ k W t were assumed to be independent and identically distributed for any k. Specifically, if k k , then Δ k W t and Δ k W t may not have identical distributions. However, if d and d are two different days, Δ k W t on day d is assumed to be independent of Δ k W t on day d , but the distribution of Δ k W t is assumed to be identical on d and d . The variables Δ k T t were analogously assumed to be independent and identically distributed. We denote the weight change over the lag 15 k as Δ k W and the traffic change over the same lag as Δ k T . The change in the variance ( σ 2 ) and the mean ( μ ) of T were computed as
Δ k σ 2 ( T ) t = | σ 2 T 1 + ( t 1 ) k , T 2 + ( t 1 ) k , . . . , T t k σ 2 T 1 + ( t 2 ) k , . . . , T ( t 1 ) k | ; Δ k μ ( T ) t = | μ T 1 + ( t 1 ) k , T 2 + ( t 1 ) k , . . . , T t k μ T 1 + ( t 2 ) k , . . . , T ( t 1 ) k | .
An example of calculating Δ k W , Δ k T , Δ k σ 2 ( T ) t , and Δ k μ ( T ) t is given in Appendix A.

2.5. Joint Probabilities

We defined the function
D ϵ ( Z t ) = I ϵ ( Δ k Z t ) = 0 , i f | Z 1 + t k Z 1 + ( t 1 ) k | < ϵ 1 , o t h e r w i s e ,
to be indicative of the event { Δ k Z t = | Z 1 + t k Z 1 + ( t 1 ) k | ϵ } . The function divides the values of Δ k Z t into two categories: 0 and 1. Thus, if ϵ W and ϵ T are two thresholds for the change in weight and traffic, respectively, then D ϵ W ( W t ) = 1 signifies the change in weight between times t 1 and t at or above ϵ W , while D ϵ T ( T t ) = 1 signifies the change in bee traffic between t 1 and t at or above ϵ T . If Z ^ i is the estimate of Z t at time i (i.e., the observed value), the probability of { D ϵ ( Z t ) = 1 } can be estimated as the average of the occurrences of this event in n trials as
P ( D ϵ ( Z t ) = 1 ) i = 1 n D ϵ ( Z ^ i ) n = P * ( D ϵ ( Z t ) = 1 ) .
A necessary condition of the independence between the random variables of X t , Y t , X t 1 , and Y t 1 can be formulated as follows and proved as a theorem (a proof is in Appendix A).
A Necessary Condition for Independence (NCI): If X t , X t 1 , Y t , Y t 1 are discrete independent random variables, then, for any ϵ X and ϵ Y ,
P ( D ϵ X ( X t ) = 1 , D ϵ Y ( Y t ) = 1 ) = P ( D ϵ X ( X t ) = 1 ) · P ( D ϵ Y ( Y t ) = 1 ) .
If ϵ X = ϵ Y = 0 and X t = W t , X t 1 = W t 1 , Y t = T t , Y t 1 = T t 1 , the following paradox occurs
1 1 P * ( D 0 ( W t ) = 1 | D 0 ( T t ) = 1 ) 2 P * ( D 0 ( W t ) = 1 ) .
Equation (11) is a paradox in the following sense. If ϵ W = ϵ T = 0 , both W t and T t change in most trials between times t 1 and t, which makes the left side of Equation (11) true. In other words,
1 1 P * ( D 0 ( W t ) = 1 | D 0 ( T t ) = 1 ) .
However, if no restrictions are placed on weight and traffic measurements through ϵ W and ϵ T , W t does not depend on T t , which makes the right side of Equation (11) true. In other words,
P * ( D 0 ( W t ) = 1 | D 0 ( T t ) = 1 ) 2 P * ( D 0 ( W t ) = 1 ) .
Thus, the thresholds for weight and traffic, ϵ W and ϵ T , must be above zero. Furthermore, the traffic threshold ϵ T must be further constrained as
ϵ T > 0 : P ( D ϵ T ( T t ) = 1 ) > 0 ,
because otherwise the joint probability of { D ϵ W ( W t ) = 1 } and { D ϵ T ( T t ) = 1 } is 0 whenever P ( D ϵ T ( T t ) = 1 ) is 0. To establish the feasible values of ϵ W and ϵ T , we used our datasets to compute the sets
Δ T = { Δ k T t : t = 1 , 2 , . . . , n / k } ; Δ W = { Δ k W t : t = 1 , 2 , . . . , n / k } ,
where n is the total number of records in the dataset (see Table 1) and k { 4 , 8 , 12 , 16 , 20 , 24 } . Thus, since all the lags are 15 k, we tested the lags from 15 × 4 min (1 h) up to 15 × 24 min (6 h). For each hive, we computed the suprema (i.e., the upper bounds) as
ϵ W * = max ϵ W Δ W ϵ W ; ϵ T * = max ϵ T Δ T ϵ T
for each lag (1 h, 2 h, 3 h, 4 h, 5 h, 6 h), and 15 traffic measurements: 5 types of traffic (i.e., IN, OUT, TOT, IN-OUT, IN+OUT), 5 variances (i.e., σ 2 (IN), σ 2 (OUT), σ 2 (TOT), σ 2 (IN-OUT), σ 2 (IN+OUT)), and 5 means (i.e., μ (IN), μ (OUT), μ (TOT), μ (IN-OUT), μ (IN+OUT)). We also tested the necessary condition for Independence (NCI) in Equation (10) on our dataset for each tuple of the three feasible values (i.e., k, 0 < ϵ W < ϵ W * , 0 < ϵ T < ϵ T * ) by computing for each hive the threshold θ such that
max θ { | P * ( D ϵ W ( W t ) = 1 , D ϵ T ( T t ) = 1 ) P * ( D ϵ W ( W t ) = 1 ) P * ( D ϵ T ( T t ) = 1 ) | θ } .
in order to discover the upper bound of the difference between the joint and marginal probabilities. The value of θ in Expression (16), to which we refer as m a x D , was computed for all 15 traffic measurements. We also computed the argmaxima of absolute differences between the joint probability (the left-hand side of Equation (10)) and the product of the marginal probabilities (the right-hand side of Equation (10)) for all hives.

2.6. X 2 Tests

The NCI implies a sufficiency criterion of the independence of X t , X t 1 , Y t , Y t 1 , which can be formulated as follows.
A Sufficiency Condition for Independence (SCI): Let X t , X t 1 , Y t , Y t 1 are discrete independent random variables. If, for any ϵ X and ϵ Y ,
P ( D ϵ X ( X t ) = 1 , D ϵ Y ( Y t ) = 1 ) = P ( D ϵ X ( X t ) = 1 ) · P ( D ϵ Y ( Y t ) = 1 ) ,
then X t , X t 1 , Y t , Y t 1 are independent.
Since we cannot prove this criterion as a theorem, because, as of now, it is unclear to us how to formulate general, entomological realistic assumptions on the distributions of the variables, we executed the χ 2 tests to estimate the independence of W t , W t 1 , T t , T t 1 on our dataset. For each sample ( W 1 , T 1 ) , , ( W n , T n ) , we introduced k grouping intervals Δ 1 , , Δ k for values Δ k W and m grouping intervals 1 , , m for Δ k T values. We used the following hypotheses in the χ 2 tests:
H χ 2 , 0 : p r c = p r · p c ; H χ 2 , 1 : p r c p r · p c ,
where p r c is the probability of an observation belonging to Δ r × c , p r is the probability of Δ k W Δ r , and p c is the probability of Δ k T c . We split the domain of Δ k W and Δ k T into sub-intervals with the same probability to ensure that the probability of Δ k W t falling into any sub-interval is equal for all intervals. Since the true distributions of W and T are unknown, we separated both domains into the intervals with the the same number of counts from our dataset, which is the standard approach in χ 2 tests [22]. The literature on the χ 2 tests has two recommendations, which we used for all lags in our χ 2 tests. The first recommendation, R E C 1 , is that each cell in the χ 2 cumulative table for ( W t , T t ) contain at least 5 observations on average, which is a standard requirement for the practicability of χ 2 tests [22]. The second recommendation, R E C 2 , is to split the domain of the investigated samples into a number of intervals m in accordance with the number of samples n. For example, if n is in [ 40 , 100 ) , then m [ 7 , 9 ] , if n is in [ 100 , 500 ) , then m [ 8 , 12 ] [23]. If R E C 2 is followed, the deviation of the histogram from the actual distribution density is minimal [24]. To estimate the impact of lags on the relationship between hive weight and traffic, we computed the H χ 2 , 0 rejection ratios for all hives and lags from 1 h up to 6 h in 1 h increments. We also computed the correlation coefficients and their p-values for all hives and the same lags between W t and T t to see if these two different statistical methods agree on the impact.

3. Results

Figure 1 shows the correlation heat map of Pearson, Spearman, Kendall for all monitored hives and five traffic types. Each cell of the heat map gives the computed coefficient value between IN, OUT, IN-OUT, IN+OUT, and TOT, on the one hand, and the weight of a given hive, on the other hand. Thus, in the bottom left cell (IN, H53), 0.527 is the Pearson coefficient between IN and W of hive H53. Table 2 gives the p-values for Pearson, Spearman, and Kendall for the monitored hives. Table 3 provides the suprema ϵ W * and ϵ T * for hive H17 computed by Equation (15). The suprema values for the other hives are given in the supplementary materials (see Tables S1–S5). For space considerations, we chose to present the results in terms of hive H17 as a representative of the group that included hives H17, H19, H41, H43, and H53, because these hives exhibited similar trends and patterns different from those of hive H47.
The autocorrelation plots for weight and total traffic (TOT) for hive H17 are given in Figure 2. The plots for the other hives are in the supplementary materials (Figures S3–S7).
The ACF plots indicate that both weight and traffic include trend components insomuch as the amplitudes gradually decrease as the lag increases. Figure 2a,c show that trends and cycles are present in the weight data. Figures S3–S7 in the supplementary materials indicate that the weight and traffic of the other hives also exhibit trends and cyclical patterns. Figure 2b reflects a cyclical pattern in TOT with a period of 54 points (i.e., the full number of records per day) with the ACF peaks corresponding to L 54 and L 108 .
The computation of Equation (16) on our dataset showed that if the difference between t and t 1 (i.e., the lag) was no longer than 6 h, then for any 0 < ϵ W < ϵ W * and 0 < ϵ T < ϵ T * , the difference between the joint probability P * ( D ϵ W ( W t ) = 1 , D ϵ T ( T t ) = 1 ) and the marginal probabilities P * ( D ϵ W ( W t ) = 1 ) · P * ( D ϵ T ( T t ) = 1 ) did not exceed 0.15. In other words,
| P * ( D ϵ W ( W t ) = 1 , D ϵ T ( T t ) = 1 ) P * ( D ϵ W ( W t ) = 1 ) · P * ( D ϵ T ( T t ) = 1 ) | 0.15 .
Figure 3 shows the graphs of the joint and marginal probabilities for lags of 1 h, 3 h, and 6 h and five traffic types for all values of 0 < ϵ W < ϵ W * and 0 < ϵ T < ϵ T * . Table 4, Tables S6 and S7 give the argmaxima of the absolute differences between the exact counts, variances, and means of incoming, outgoing, total, and lateral traffic, as well as the difference between incoming and outgoing traffic, the sum of incoming and outgoing traffic, and the corresponding value of ϵ W and ϵ T for hive H17.
Table 5, Tables S41 and S42 give the χ 2 statistics C for the exact traffic counts, their variances, means, and different lags, and their p-values for hive H17.
Table 6 gives the traffic types and lags for which H χ 2 , 0 was rejected at p 0.05 , which occurred in 142 out of 540 tested cases.
Table 7 presents the Pearson coefficients and the corresponding p-values for hive H17 for different lags. The analogous tables for the other hives are available in the supplementary materials (see Tables S58–S62).
Table 8, Table 9, Table 10, Table 11 and Table 12 show our counts of the frames of bees, mite drops, brood quality assessments, adult bee mass measurements, and the queen status inspections.

4. Discussion

Figure 1 shows that the correlation coefficients between weight and all traffic measurements, except for IN-OUT, are relatively close to each other. Hive H17 was the only hive with notable correlations between IN-OUT and weight for all three coefficients. Hive H53 showed strong correlations between weight and the four traffic measurements (IN, OUT, TOT, and IN+OUT). Hives H19 and H41 showed moderately positive correlations which were not as strong as those of hive H53. Hives H17 and H43 showed negative correlations, with H17 correlations being more negative than those of H43. H17 was the only hive for which the sign of the correlation coefficients between W t and IN-OUT were different from those of IN, OUT, TOT, and IN+OUT. All correlations of H47 were 0 , which suggests that measured traffic and weight for this hive were uncorrelated. Table 2 shows that H47 was the only hive for which H ρ , 0 was not rejected for any coefficient and any traffic type at p 0.05 . For all other hives, H ρ , 0 was rejected at p 0.05 . H47 was the only hive whose weight did not increase with time. The weight of H47 oscillated between 40 and 41.5 kg throughout the experiment (see Figure 4). This observation leads to a conjecture that weight and traffic may not be correlated in hives that are not gaining weight, which merits further investigation. There may exist a traffic threshold per given lag below which a hive does not gain weight. Marceau et al. [6] experimentally concluded that for a positive daily gain the traffic must be at least 14,000 bees/h, but did not elaborate on whether this was mean traffic or real traffic. Our assumption of the independence and identical distribution of the values of Δ k W t (and Δ k T t ) on different days is reasonable insomuch as it is hard to assume that the differences in weight (and traffic) taken on a specific time interval on one day are related to the differences in weight (and traffic) taken on the same time interval but on a different day. More field data are required to investigate the independence and identical distribution of Δ k W t and Δ k T t for different lags.
Table 3, Table 4, Tables S6 and S7 for hive H17, the analogous tables for the other hives in the supplementary materials, and the plots in Figure 3 give an experimental validation of the result in Inequality (18) in that the difference between the joint and marginal probabilities was relatively small. In other words, traffic W t and weight T t might have been necessarily (but not sufficiently!) independent in the monitored hives so long as the lag did not exceed 6 h. The joint probabilities and the marginal probability products in Table 4 and Tables S6–S22 diverge by no more than 15% across all 15 traffic measurements for all hives (See Inequality (18)). The maximum absolute difference occurred mostly on the median values for weight and traffic. The divergence as a function of ϵ W and ϵ T reached its maximum when ϵ W and ϵ T were such that P * ( D ϵ W ( W t ) = 1 ) P * ( D ϵ T ( T t ) = 1 ) 0.5 (see Tables S23–S40). As the tables with P * at the extrema in the supplementary materials show, this observation held for all hives. Inequality (18) suggests that the events { Δ k W t ϵ W } and { Δ k T t ϵ T } are independent for any ϵ W and ϵ T , for all 15 traffic measurements and all lags from 1 h up to 6 h. Consequently, the necessary condition of independence (NCI) of weight and traffic was experimentally verified on our dataset. Of course, this verification does not imply that weight and traffic were independent on the tested lags, because our χ 2 results failed to verify the implied sufficiency condition for independence (SCI).
The results in Table 5 and Tables S41–S57 in the supplementary materials, and Table 6 indicate that there was an association between W t and traffic measurements at p 0.05 for all hives. However, these results should be interpreted with caution because the χ 2 tests fail to distinguish H χ 2 , 0 from H χ 2 , 1 if the probabilities of the observations falling into the partitioning intervals are the same for both hypotheses. On our dataset, the longer the lag was, the smaller the number of records for that lag were recorded. More field deployments are needed to increase the number of records for longer lags. Several rejections of H χ 2 , 0 at smaller lags suggest that W and T may be correlated even on smaller lags. The hives whose weight and several traffic measures were related at lags of 1 h and 2 h (hives H17, H19, and H53) saw significant weight gains ( + 20 kg) for the entire monitored period of May–August 2021. This might indicate that the more frequently foragers leave the hive, the more weight the hive gains. It is unclear, however, whether the weight gain of a hive depends on the forager fly rate or on the efficiency of individual foragers (i.e., how much payload each forager brings back to the hive). Table 6 indicates that there is a relationship between traffic as measured by IN, TOT, IN+OUT, μ (OUT), μ (TOT), and μ (IN+OUT) and the weight for all hives.
The Pearson coefficients in Table 7 indicate that as the lag increased from 1 h to 6 h, the absolute value of Pearson increased, which means that on longer lags W t and T t became more correlated for hive H17 and the other hives (see the tables in the supplementary materials), except for H47. This observation, however, should not be interpreted as causality, because there may have been an underlying hidden variable (e.g., the declining health of the queen) that caused the two random variables to change in tandem. The p-values in Table 7 show that for hive H17, H ρ , 0 was rejected for all lags. Similar observations (i.e., stronger Pearson correlation between W t and T t on longer lags, and H ρ , 0 rejection for all lags) can be made on the analogous tables for all other hives, except for hive H47, in the supplementary materials (see Tables S58–S62). For H47, H ρ , 0 was not rejected for any lags. H41 and H47 were the only two hives that did not have significant gains in both weight (see Figure 4) and bee mass (see Table 11) at the end of the experiment. The weight of H41 increased only by 6 kg, and the weight of H47 essentially remained the same. However, H41 had positive within-day correlations between weight and traffic, while the same correlations in H47 were negative and close to zero. H47 had lower counts of frames of bees, with the counts falling from 6 on 27 May 2021 to 5 on 13 August 2021 (see Table 8). H41 was the only other hive where the frames of bees fell from six to four during the same period. For H47, the mite drop started at 1.7 (the lowest of all hives) at the beginning of the monitored period and rose to 3.7 in August, 2021 (see Table 9). Table 10 shows that H47 and H41 were infected with the PMS and had spotty brood patterns. Table 11 shows that H47 had the lowest bee mass in June, 2021, and the second lowest bee mass at the end of the experiment in August, 2021, which may explain why H47 had the lowest mite drop. In H41, a supersedure queen was present in the hive for about one week. This queen was removed on 8 June 2021 (see Table 12) and replaced by a new one year old Italian queen from the same breeder from the queen bank at the USDA ARS Center in Tucson, AZ. The new queen might have been more productive than the removed queen, which may explain why H41 had positive and H47 negative correlations between weight and traffic in Figure 1. The supersedure queen may have mated with Africanized feral drones in Tucson, AZ, which may have affected traffic patterns in H41. In H47, an Italian queen from breeder 1 was removed on 19 May 2021 (four days after the start of the experiment) due to lack of productivity and replaced with a one-year-old Italian queen from breeder 2 from the same queen bank. Thus, H47 was the only hive where a one-year-old Italian queen from one breeder was replaced by a one-year-old Italian queen from a different breeder, which may have been a contributing factor in the lack of correlation between hive weight and traffic in this hive. All other queens survived and no other hives were re-queened during the monitored period.
Table 13 and Table 14 summarize the H χ 2 , 0 rejection ratios for all lags. Table 13 takes into account the χ 2 results for all lags, while Table 14 gives the results for the lags satisfying both recommendations, R E C 1 and R E C 2 , for χ 2 tests at the end of Section 2.6. Table 13 shows that as the lag increased from 1 h to 6 h, the frequency of H χ 2 , 0 rejection also increased, indicating that for all hives W t and T t became more frequently associated as the lags became longer, which corroborates the result achieved with the Pearson coefficients in Table 7. The same tendency is observed in Table 14. Therefore, the observation that the strength of the correlation between W t and T t increased with the lag was confirmed by the correlation coefficients and χ 2 results. Additional field deployments will enable us to collect more data in order to investigate which lags maximize weight–traffic correlation and to find computational methods for determining the optimal duration of traffic observations for accurate weight prediction from traffic. Table 15 gives the H χ 2 , 0 rejection ratios for exact traffic measurements, traffic measurement variances, and traffic measurement means and indicates that the rejection ratios are basically the same for the three statistics, which suggests that traffic variance and traffic mean, when taken separately, do not affect the correlation between W t and T t more significantly than exact traffic counts. Thus, the latter may suffice in computational models that relate weight and traffic.
While the necessary condition for the independence (NCI) of weight and traffic was experimentally verified, the χ 2 tests failed to verify the implied sufficiency condition for independence (SCI) for any lag. A possible explanation is that D ϵ ( · ) in Equation (8) divides the values of Δ k W and Δ k T into two categories, while the χ 2 tests divide Δ k Z t into multiple (more than two) categories. The computed correlation coefficients and the executed χ 2 tests indicate that the relation between total traffic (TOT) and weight W t and the sum of the incoming and outgoing traffic (IN+OUT) and W t are basically the same, which suggests that lateral traffic did not have a significant impact on weight change and may be omitted in computational models that relate weight and traffic. Additional field deployments will allow us curate larger datasets to test the strength of the relation and, as opportunity arises, discover its statistical or mathematical nature.

5. Conclusions

Hive weight W t and traffic T t were more correlated on longer lags. The strength of the correlation increased with the lag when estimated with Pearson, Spearman, and Kendall. The χ 2 tests also showed that as the lag increased from 1 h to 6 h, the frequency of H χ 2 , 0 rejection increased, indicating that W t and T t became more related on longer lags for all hives. This conclusion may not be causal, because there may have been underlying hidden variables (e.g., the health of the queen) that caused W t and T t to change in tandem. Our autocorrelation analysis suggests that both weight and total traffic exhibit trends and cyclical patterns. A feasible reason why some hives had positive and some hives had negative correlations between W t and T t may lie in the genetic differences between the queen lines from two U.S. breeders of Italian queens, which may warrant further investigation in the future. A number of factors could contribute to the lack of correlation between hive weight and traffic in H47: the replacement of the original queen with a queen from a different breeder, insignificant gain in bee mass, spotty brood patterns, and PMS infection. The spread in traffic and average traffic, when taken separately, did not affect the correlation of W t and T t more significantly than the exact traffic counts from videos. The correlation coefficients and the χ 2 tests showed that the relation between total traffic (TOT) and weight W t and the sum of incoming and outgoing traffic (IN+OUT) and W t are basically the same. Thus, lateral traffic (LAT) did not have a significant impact on weight change and may be omitted in computational models that relate hive weight and traffic in the vicinity of the hive. We hope that our study and similar studies will eventually result in a methodology to separate the hive weight associated with traffic from the hive weight not associated with it, which, in turn, will lead to the construction of computational models that predict hive weight from hive traffic.

Supplementary Materials

The following supplementary materials are available at https://www.mdpi.com/article/10.3390/s22134824/s1: (1) the PDF with three video sets that illustrate how the BeePIV algorithm processes videos with different levels of bee traffic; Tables S1–S62 with additional hive-specific correlation coefficients and p-values; and Figures S1–S7 that illustrate additional hive-specific aspects of autocorrelation of weight and traffic; (2) six datasets of weight and traffic which we used for the reported research.

Author Contributions

Conceptualization, V.K., A.T., K.P. and W.M.; methodology, V.K., A.T., K.P. and W.M.; software, A.T., K.P. and V.K.; formal analysis, A.T.; investigation, V.K., A.T., K.P., W.M. and M.W.; resources, V.K. and W.M.; data curation, K.P., V.K., W.M. and M.W.; writing (original draft preparation), V.K. and A.T.; writing (review and editing), V.K., A.T., W.M. and K.P.; supervision, V.K.; project administration, V.K. and W.M.; funding acquisition, V.K. and W.M.; hardware design, K.P. and V.K.; hardware deployment, V.K., W.M. and M.W.; entomological analysis and hive inspection, W.M. and M.W. All authors have read and agreed to the published version of the manuscript.

Funding

The BeePi monitor video hardware and woodenware have been funded by three open science fundraisers by the first author [25,26,27]; the Apis mellifera packages and the weight measurement hardware have been funded by the USDA-ARS Carl Hayden Bee Research Center in Tucson, Arizona, USA.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Details regarding where data supporting reported results can be found are given in the supplementary materials.

Acknowledgments

The authors express their profound gratitude to Bharat Chatla, Sanket Korgaonkar, Tharun Tej Tammineni, Aliasgar Kutiyanawala, Tanwir Zaman, Ahmet Yildrim, Laasya Alavala, Chris Blay, Gregg Lind, Joel Devey, Laasya Laasya Alavala, Aditya Bhouraskar, and Chelsi Gupta for supporting their open science electronic beehive monitoring project.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
EBMElectronic beehive monitoring
USDA-ARSUnited States Department of Agriculture-Agricultural Research Service
TBTerabyte = 1024 gigabytes
USBUniversal serial bus
kg, gKilogram, gram
km, mKilometer, meter
hHour
° CDegree Celsius
MP4MPEG-4 Part 14: Motion Picture Experts Group software for real-time data streams
IDIdentification
PIVParticle image velocimetry
CSVComma-separated values
INIncoming bee traffic (bees flying toward the hive)
OUTOutgoing bee traffic (bees flying away from the hive)
LATLateral bee traffic (bees flying in parallel to the hive)
TOTTotal bee traffic in the vicinity of the hive
NCINecessary condition for independence
SCISufficiency condition for independence
RECRecommendation

Appendix A

Suppose that Table A1 is our dataset. If we want to measure the change in the exact counts with the lag of one hour, then the exact counts are as follows.
Δ 4 W 1 = | W 1 + 1 · 4 W 1 + ( 1 1 ) 4 | = | W 5 W 1 | = 5 2 = 3 ; Δ 4 I N 1 = | I N 1 + ( 1 1 ) 4 + I N 2 + ( 1 1 ) 4 + I N 3 + ( 1 1 ) 4 + I N 1 · 4 0 | = = | I N 1 + I N 2 + I N 3 + I N 4 | = 20 + 12 + 10 + 18 = 60 ; Δ 4 O U T 1 = | O U T 1 + O U T 2 + O U T 3 + O U T 4 0 | = 10 + 6 + 15 + 9 = 40 ; Δ 4 T O T 1 = | T O T 1 + T O T 2 + T O T 3 + T O T 4 0 | = 35 + 19 + 29 + 30 = 113 ; Δ 4 I N O U T 1 = | ( I N 1 O U T 1 ) + ( I N 2 O U T 2 ) + ( I N 3 O U T 3 ) + ( I N 4 O U T 4 ) 0 | = = ( 20 10 ) + ( 12 6 ) + ( 10 15 ) + ( 18 9 ) = 10 + 6 5 + 9 = 20 ; Δ 4 I N + O U T 1 = | ( I N 1 + O U T 1 ) + ( I N 2 + O U T 2 ) + ( I N 3 + O U T 3 ) + ( I N 4 + O U T 4 ) 0 | = ( 20 + 10 ) + ( 12 + 6 ) + ( 10 + 15 ) + ( 18 + 9 ) = 30 + 18 + 25 + 27 = 100 ; Δ 4 W 2 = | W 1 + 2 · 4 W 1 + ( 2 1 ) 4 | = | W 9 W 5 | = | 3 5 | = 2 ; Δ 4 I N 2 = | I N 1 + ( 2 1 ) 4 + I N 2 + ( 2 1 ) 4 + I N 3 + ( 2 1 ) 4 + I N 2 · 4 I N 1 + ( 1 1 ) 4 I N 2 + ( 1 1 ) 4 I N 3 + ( 1 1 ) 4 I N 1 · 4 | = | I N 5 + I N 6 + I N 7 + I N 8 I N 1 I N 2 I N 3 I N 4 | = = | 13 + 8 + 11 + 17 20 12 10 18 | = | 49 60 | = 11 ; Δ 4 O U T 2 = | O U T 5 + O U T 6 + O U T 7 + O U T 8 O U T 1 O U T 2 O U T 3 O U T 4 | = = | 8 + 7 + 5 + 10 10 6 15 9 | = | 30 40 | = 10 ; Δ 4 T O T 2 = | T O T 5 + T O T 6 + T O T 7 + T O T 8 T O T 1 T O T 2 T O T 3 T O T 4 | = = | 26 + 17 + 18 + 34 35 19 29 30 | = | 95 113 | = 18 ; Δ 4 I N O U T 2 = | ( I N 5 O U T 5 ) + ( I N 6 O U T 6 ) + ( I N 7 O U T 7 ) + ( I N 8 O U T 8 ) ( I N 1 O U T 1 ) ( I N 2 O U T 2 ) ( I N 3 O U T 3 ) ( I N 4 O U T 4 ) | = = | ( 13 8 ) + ( 8 7 ) + ( 11 5 ) + ( 17 10 ) ( 20 10 ) ( 12 6 ) ( 10 15 ) ( 18 9 ) | = | 5 + 1 + 6 + 7 10 6 + 5 9 | = | 1 | = 1 ; Δ 4 I N + O U T 2 = | ( I N 5 + O U T 5 ) + ( I N 6 + O U T 6 ) + ( I N 7 + O U T 7 ) + ( I N 8 + O U T 8 ) ( I N 1 + O U T 1 ) ( I N 2 + O U T 2 ) ( I N 3 + O U T 3 ) ( I N 4 + O U T 4 ) | = = | ( 13 + 8 ) + ( 8 + 7 ) + ( 11 + 5 ) + ( 17 + 10 ) ( 20 + 10 ) ( 12 + 6 ) ( 10 + 15 ) ( 18 + 9 ) | = | 21 + 15 + 16 + 27 30 18 25 27 | = = | 21 | = 21 .
Table A1. Example dataset with nine records of W—weight, IN—incoming traffic, OUT—outgoing traffic, LAT—lateral traffic, and TOT—total traffic; TOT = IN + OUT + LAT.
Table A1. Example dataset with nine records of W—weight, IN—incoming traffic, OUT—outgoing traffic, LAT—lateral traffic, and TOT—total traffic; TOT = IN + OUT + LAT.
TIME STEPWINOUTLATTOT
122010535
23126119
311015429
44189330
55138526
6287217
71115218
811710734
93314926
If we want to measure the change in the variation of measurements I N , O U T , T O T , I N O U T , and I N + O U T with the lag of one hour, then the variations are as follows.
Δ 4 σ 2 ( I N ) 1 = | σ 2 ( I N 1 , I N 2 , I N 3 , I N 4 ) 0 | = σ 2 ( 20 , 12 , 10 , 18 ) = 17 ; Δ 4 σ 2 ( O U T ) 1 = σ 2 ( O U T 1 , O U T 2 , O U T 3 , O U T 4 ) = σ 2 ( 10 , 6 , 15 , 9 ) = 10.5 ; Δ 4 σ 2 ( T O T ) 1 = σ 2 ( T O T 1 , T O T 2 , T O T 3 , T O T 4 ) = σ 2 ( 35 , 19 , 29 , 30 ) = 33.6875 ; Δ 4 σ 2 ( I N O U T ) 1 = σ 2 ( I N 1 O U T 1 , I N 2 O U T 2 , I N 3 O U T 3 , I N 4 O U T 4 ) = = σ 2 ( 20 10 , 12 6 , 10 15 , 18 9 ) = σ 2 ( 10 , 6 , 5 , 9 ) = 35.5 ; Δ 4 σ 2 ( I N + O U T ) 1 = σ 2 ( I N 1 + O U T 1 ) , I N 2 + O U T 2 , I N 3 + O U T 3 , I N 4 + O U T 4 ) = = σ 2 ( 30 , 18 , 25 , 27 ) = 19.5 ; Δ 4 σ 2 ( I N ) 2 = | σ 2 ( I N 5 , I N 6 , I N 7 , I N 8 ) σ 2 ( I N 1 , I N 2 , I N 3 , I N 4 ) | = = | σ 2 ( 13 , 8 , 11 , 17 ) σ 2 ( 20 , 12 , 10 , 18 ) | = | 10.6875 17 | = 6.3125 ; Δ 4 σ 2 ( O U T ) 2 = | σ 2 ( O U T 5 , O U T 6 , O U T 7 , O U T 8 ) σ 2 ( O U T 1 , O U T 2 , O U T 3 , O U T 4 ) | = = | σ 2 ( 8 , 7 , 5 , 10 ) σ 2 ( 10 , 6 , 15 , 9 ) | = 7.25 ; Δ 4 σ 2 ( T O T ) 2 = | σ 2 ( T O T 5 , T O T 6 , T O T 7 , T O T 8 ) σ 2 ( T O T 1 , T O T 2 , T O T 3 , T O T 4 ) | = = | σ 2 ( 26 , 17 , 18 , 34 ) σ 2 ( 35 , 19 , 29 , 30 ) | = 13.5 ; Δ 4 σ 2 ( I N O U T ) 2 = | σ 2 ( I N 5 O U T 5 , I N 6 O U T 6 , I N 7 O U T 7 , I N 8 O U T 8 ) σ 2 ( I N 1 O U T 1 , I N 2 O U T 2 , I N 3 O U T 3 , I N 4 O U T 4 ) | = = | σ 2 ( 5 , 1 , 6 , 7 ) σ 2 ( 10 , 6 , 5 , 9 ) | = 30.3125 ; Δ 4 σ 2 ( I N + O U T ) 2 = | σ 2 ( I N 5 + O U T 5 , I N 6 + O U T 6 , I N 7 + O U T 7 , I N 8 + O U T 8 ) σ 2 ( I N 1 + O U T 1 , I N 2 + O U T 2 , I N 3 + O U T 3 , I N 4 + O U T 4 ) | = = | σ 2 ( 21 , 15 , 16 , 27 ) σ 2 ( 30 , 18 , 25 , 27 ) | = 3.1875 .
The measure of the change in the mean of I N , O U T , T O T , I N O U T , and I N + O U T with the lag of one hour are as follows.
Δ 4 μ ( I N ) 1 = | μ ( I N 1 , I N 2 , I N 3 , I N 4 ) 0 | = μ ( 20 , 12 , 10 , 18 ) = 15 ; Δ 4 μ ( O U T ) 1 = μ ( O U T 1 , O U T 2 , O U T 3 , O U T 4 ) = μ ( 10 , 6 , 15 , 9 ) = 10 ; Δ 4 μ ( T O T ) 1 = μ ( T O T 1 , T O T 2 , T O T 3 , T O T 4 ) = μ ( 35 , 19 , 29 , 30 ) = 28.25 ; Δ 4 μ ( I N O U T ) 1 = μ ( I N 1 O U T 1 , I N 2 O U T 2 , I N 3 O U T 3 , I N 4 O U T 4 ) = = μ ( 20 10 , 12 6 , 10 15 , 18 9 ) = μ ( 10 , 6 , 5 , 9 ) = 5 ; Δ 4 μ ( I N + O U T ) 1 = μ ( I N 1 + O U T 1 ) , I N 2 + O U T 2 , I N 3 + O U T 3 , I N 4 + O U T 4 ) = = μ ( 30 , 18 , 25 , 27 ) = 25 ; Δ 4 μ ( I N ) 2 = | μ ( I N 5 , I N 6 , I N 7 , I N 8 ) μ ( I N 1 , I N 2 , I N 3 , I N 4 ) | = = | μ ( 13 , 8 , 11 , 17 ) μ ( 20 , 12 , 10 , 18 ) | = | 12.25 15 | = 2.75 ; Δ 4 μ ( O U T ) 2 = | μ ( O U T 5 , O U T 6 , O U T 7 , O U T 8 ) μ ( O U T 1 , O U T 2 , O U T 3 , O U T 4 ) | = = | μ ( 8 , 7 , 5 , 10 ) μ ( 10 , 6 , 15 , 9 ) | = | 7.5 10 | = 2.5 ; Δ 4 μ ( T O T ) 2 = | μ ( T O T 5 , T O T 6 , T O T 7 , T O T 8 ) μ ( T O T 1 , T O T 2 , T O T 3 , T O T 4 ) | = = | μ ( 26 , 17 , 18 , 34 ) μ ( 35 , 19 , 29 , 30 ) | = | 23.75 28.25 | = = 4.5 ; Δ 4 μ ( I N O U T ) 2 = | μ ( I N 5 O U T 5 , I N 6 O U T 6 , I N 7 O U T 7 , I N 8 O U T 8 ) μ ( I N 1 O U T 1 , I N 2 O U T 2 , I N 3 O U T 3 , I N 4 O U T 4 ) | = = | μ ( 5 , 1 , 6 , 7 ) μ ( 10 , 6 , 5 , 9 ) | = | 4.75 5 | = 0.25 ; Δ 4 μ ( I N + O U T ) 2 = | μ ( I N 5 + O U T 5 ) , I N 6 + O U T 6 , I N 7 + O U T 7 , I N 8 + O U T 8 ) μ ( I N 1 + O U T 1 ) , I N 2 + O U T 2 , I N 3 + O U T 3 , I N 4 + O U T 4 ) | = = | μ ( 21 , 15 , 16 , 27 ) μ ( 30 , 18 , 25 , 27 ) | = | 19.75 25 | = = 5.25 .
A Necessary Condition for Independence Theorem (NCIT): If X t , X t 1 , Y t , Y t 1 are discrete independent random variables, then, for any ϵ X and ϵ Y ,
P ( D ϵ X ( X t ) = 1 , D ϵ Y ( Y t ) = 1 ) = P ( D ϵ X ( X t ) = 1 ) · P ( D ϵ Y ( Y t ) = 1 ) .
Proof. 
Let us consider the probability P ( X t X t 1 = x , Y t Y t 1 = y ) , where x , y R are arbitrary numbers. Since X t , X t 1 , Y t , Y t 1 are discrete and independent, this probability can be written as
P ( X t X t 1 = x , Y t Y t 1 = y )
= i j P ( X t = x a i , X t 1 = a i , Y t = y b j , Y t 1 = b j ) = i j P ( X t = x a i ) P ( X t 1 = a i ) P ( Y t = y b j ) P ( Y t 1 = b j ) = i P ( X t = x a i ) P ( X t 1 = a i ) · j P ( Y t = y b j ) P ( Y t 1 = b j ) = i P ( X t = x a i ) P ( X t 1 = a i ) · P ( Y t Y t 1 = y ) = P ( X t X t 1 = x ) P ( Y t Y t 1 = y ) .
Thus, X t X t 1 and Y t Y t 1 are independent discrete variables, and, consequently, | X t X t 1 | and | Y t Y t 1 | are independent as well. Therefore, for any ϵ X and ϵ Y in R , we have
P ( | X t X t 1 | ϵ X , | Y t Y t 1 | ϵ Y ) = P ( | X t X t 1 | ϵ X , | Y t Y t 1 | ϵ Y ) ,
if and only if
P ( D ϵ X ( X t ) = 1 , D ϵ Y ( Y t ) = 1 ) = P ( D ϵ X ( X t ) = 1 ) · P ( D ϵ Y ( Y t ) = 1 ) .
Thus, ϵ X ( 0 , ϵ X * ) , ϵ Y ( 0 , ϵ Y * ) ,
P ( D ϵ X ( X t ) = 1 , D ϵ Y ( Y t ) = 1 ) = P ( D ϵ X ( X t ) = 1 ) · P ( D ϵ Y ( Y t ) = 1 ) .

References

  1. Meikle, W.; Holst, N. Application of continuous monitoring of honeybee colonies. Apidologie 2015, 46, 10–22. [Google Scholar] [CrossRef] [Green Version]
  2. Buchmann, S.L.; Thoenes, S.C. The electronic scale honey bee colony as a management and research tool. Bee Sci. 1990, 1, 40–47. [Google Scholar]
  3. Harbo, J.R. Worker-bee crowding affects brood production, honey production, and longevity of honey bees (Hymenoptera Apidae). J. Econ. Entomol. 1993, 86, 1672–1678. [Google Scholar] [CrossRef]
  4. Henry, M.; Beguin, M.; Requier, F.; Rollin, O.; Odoux, J.; Aupine, P.; Aptel, J.; Tchamitchianand, S.; Decourtye, A. A common pesticide decreases foraging success and survival in honey bees. Science 2012, 336, 348–350. [Google Scholar] [CrossRef] [PubMed]
  5. Klein, S.; Pasquaretta, C.; He, X.J.; Perry, C.; Sovik, E.; Devaud, J.M.; Barron, A.B. Honey bees increase their foraging performance and frequency of pollen trips through experience. Sci. Rep. 2018, 9, 6778. [Google Scholar] [CrossRef] [PubMed]
  6. Marceau, J.; Boily, R.; Perron, J.M. The relationship between hive productivity and honeybee flight activity. J. Apic. Res. 1991, 29, 28–34. [Google Scholar] [CrossRef]
  7. Meikle, W.G.; Holst, N.; Mercadier, G.; Derouane, F.; James, R.R. Using balances linked to dataloggers to monitor honey bee colonies. J. Apic. Res. 2006, 45, 39–41. [Google Scholar] [CrossRef]
  8. Kulyukin, V.; Mukherjee, S.; Minichiello, A.; Truscott, T. BeePIV: A method to measure apis mellifera traffic with particle image velocimetry in videos. Appl. Sci. 2021, 11, 2276. [Google Scholar] [CrossRef]
  9. Meikle, W.G.; Holst, N.; Colin, T.; Weiss, M.; Carroll, M.J.; McFederick, Q.S.; Barron, A.B. Using within-day hive weight changes to measure environmental effects on honey bee colonies. PLoS ONE 2018, 13, e0197589. [Google Scholar] [CrossRef] [PubMed]
  10. Burrill, R.M.; Dietz, A. The response of honey bees to variations in solar radiation and temperature. Apidologie 1981, 12, 319–328. [Google Scholar] [CrossRef]
  11. Devillers, J.; Dore, J.C.; Tisseur, M.; Cluzeau, S.; Maurin, G. Modelling the flight activity of Apis mellifera at the hive entrance. Comput. Electron. Agric. 2004, 42, 87–109. [Google Scholar] [CrossRef]
  12. Clarke, D.; Robert, D. Predictive modelling of honey bee foraging activity using local weather conditions. Apidologie 2018, 49, 386–396. [Google Scholar] [CrossRef] [Green Version]
  13. Meikle, W.G.; Weiss, M.; Stillwell, A. Monitoring colony phenology using within-day variability in continuous weight and temperature of honey bee hivest. Apidologie 2016, 47, 1–14. [Google Scholar] [CrossRef] [Green Version]
  14. Hennessy, G.; Harris, C.; Eaton, C.; Wright, P.; Jackson, E.; Goulson, D.; Ratnieks, F. Gone with the wind: Effects of wind on honey bee visit rate and foraging behaviour. Anim. Behav. 2020, 161, 23–31. [Google Scholar] [CrossRef]
  15. Jiang, J.; Wang, C.H.; Chen, C.H.; Liao, M.S.; Su, Y.L.; Chen, W.S.; Huang, C.P.; Yang, E.C.; Chuang, C.L. A WSN-based automatic monitoring system for the foraging behavior of honey bees and environmental factors of beehives. Comput. Electron. Agric. 2016, 123, 304–318. [Google Scholar] [CrossRef]
  16. Xujiang, H.; Tian, L.Q.; Wu, X.; Zeng, Z.J. RFID monitoring indicates honeybees work harder before a rainy day. Insect Sci. 2015, 23, 157–159. [Google Scholar]
  17. Mukherjee, S.; Kulyukin, V. Application of digital particle image velocimetry to insect motion: Measurement of incoming, outgoing, and lateral honeybee traffic. Appl. Sci. 2020, 10, 2042. [Google Scholar] [CrossRef] [Green Version]
  18. Pearson, K. Mathematical contributions to the theory of evolution. III. regression, heredity and panmixia. Philos. Trans. R. Soc. Lond. 1896, 187, 253–318. [Google Scholar]
  19. Spearman, C. The proof and measurement of association between two things. Am. J. Psychol. 1904, 15, 72–101. [Google Scholar] [CrossRef]
  20. Kendall, M.G. A new measure of rank correlation. Biometrika 1938, 30, 81–89. [Google Scholar] [CrossRef]
  21. Box, G.E.P.; Jenkins, G.M.; Reinsel, G.C. Time Series Analysis: Forecasting and Control, 3rd ed.; Prentice Hall: Englewood Cliff, NJ, USA, 1994. [Google Scholar]
  22. Moor, D.D.; McCabe, G.P. Introduction to the Practice of Statistics, 2nd ed.; W.H. Freeman and Company: New York, NY, USA, 1993. [Google Scholar]
  23. Burdun, G.D.; Markov, B.N. Basics of Metrology; Publishing House of Standards: Moscow, Russia, 1985. [Google Scholar]
  24. Chentsov, N.N. Statistical Decision Rules and Optimal Inferences; Nauka: Moscow, Russia, 1972. [Google Scholar]
  25. BeePi: A Multisensor Electronic Beehive Monitor. BeePi Open Science Crowdfunder 1. Available online: https://www.kickstarter.com/projects/970162847/beepi-a-multisensor-electronic-beehive-monitor (accessed on 16 May 2022).
  26. BeePi: Honeybees Meet AI: Stage 2. BeePi Open Science Crowdfunder 2. Available online: https://www.kickstarter.com/projects/beepihoneybeesmeetai/beepi-honeybees-meet-ai-stage-2 (accessed on 16 May 2022).
  27. BeePi: Honeybees Meet AI: Stage 3. BeePi Open Science Crowdfunder 3. Available online: https://www.kickstarter.com/projects/beepihoneybeesmeetai/beepi-honeybees-meet-ai-stage-3/ (accessed on 16 May 2022).
Figure 1. Correlation heat map of weight (W) and 5 types of traffic (T) on the x-axis for hives H17, H19, H41, H43, H47, and H53 on the y-axis; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; IN-OUT—difference of IN and OUT; IN+OUT—sum of IN and OUT; TOT = IN + OUT + LAT (sum of IN, OUT, and LAT), where LAT—lateral traffic; IN, OUT, LAT, and TOT are non-negative integers.
Figure 1. Correlation heat map of weight (W) and 5 types of traffic (T) on the x-axis for hives H17, H19, H41, H43, H47, and H53 on the y-axis; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; IN-OUT—difference of IN and OUT; IN+OUT—sum of IN and OUT; TOT = IN + OUT + LAT (sum of IN, OUT, and LAT), where LAT—lateral traffic; IN, OUT, LAT, and TOT are non-negative integers.
Sensors 22 04824 g001
Figure 2. Autocorrelation plots of weight (a), total traffic (TOT) (b), and change in weight over 1 h (c) for hive H17 given using a standard python method statsmodels.graphics.tsaplots; the lags for (a,b) change from 0 to 145, where value 145 approximately equals 2.5 periods as 54 is the full number of records per day; the lag for (c) runs from 0 to 50 (≈3.5 periods) with a period of 12 (i.e., the number of data points per day for k = 4 ); semi-transparent solid blue regions represent confidence intervals for ACF values.
Figure 2. Autocorrelation plots of weight (a), total traffic (TOT) (b), and change in weight over 1 h (c) for hive H17 given using a standard python method statsmodels.graphics.tsaplots; the lags for (a,b) change from 0 to 145, where value 145 approximately equals 2.5 periods as 54 is the full number of records per day; the lag for (c) runs from 0 to 50 (≈3.5 periods) with a period of 12 (i.e., the number of data points per day for k = 4 ); semi-transparent solid blue regions represent confidence intervals for ACF values.
Sensors 22 04824 g002
Figure 3. Joint probability values (green) and product of marginal probabilities (red) for incoming (IN), outgoing (OUT), total (TOT), difference of incoming and outgoing (IN-OUT), sum of incoming and outgoing (IN+OUT), and all values 0 < ϵ W < ϵ W * and 0 < ϵ T < ϵ T * ; IN, OUT, and TOT are non-negative integers.
Figure 3. Joint probability values (green) and product of marginal probabilities (red) for incoming (IN), outgoing (OUT), total (TOT), difference of incoming and outgoing (IN-OUT), sum of incoming and outgoing (IN+OUT), and all values 0 < ϵ W < ϵ W * and 0 < ϵ T < ϵ T * ; IN, OUT, and TOT are non-negative integers.
Sensors 22 04824 g003
Figure 4. Weight (kg) vs. time of hives H17, H19, H41, H43, H47, and H53; first (leftmost) and last (rightmost) x-labels depict first and last day of time-aligned weights and video records; additional marks on x-axis in between leftmost and rightmost labels denote periods when videos were not taken due to hardware failures (e.g., 13 June/4 July on x-axis of H53 plot means that in H53 videos were not taken from 13 June 2021 up to 4 July 2021); row (A): weight vs. time for hives with positive correlation coefficients; row (B): weight vs. time with negative correlation coefficients; in row (B) weight vs. time plot of H47 is in gray box; H47 is the only hive for which H ρ , 0 and H χ 2 , 0 were not rejected.
Figure 4. Weight (kg) vs. time of hives H17, H19, H41, H43, H47, and H53; first (leftmost) and last (rightmost) x-labels depict first and last day of time-aligned weights and video records; additional marks on x-axis in between leftmost and rightmost labels denote periods when videos were not taken due to hardware failures (e.g., 13 June/4 July on x-axis of H53 plot means that in H53 videos were not taken from 13 June 2021 up to 4 July 2021); row (A): weight vs. time for hives with positive correlation coefficients; row (B): weight vs. time with negative correlation coefficients; in row (B) weight vs. time plot of H47 is in gray box; H47 is the only hive for which H ρ , 0 and H χ 2 , 0 were not rejected.
Sensors 22 04824 g004
Table 1. A quantitative summary of the curated dataset; the number of records n for each hive specifies the number of time-aligned weight and traffic measurements.
Table 1. A quantitative summary of the curated dataset; the number of records n for each hive specifies the number of time-aligned weight and traffic measurements.
HiveNum Records (n)Num Recorded DaysMean Records per Day
H1718383651
H1930195654
H4118424046
H4316303054
H4725184655
H5325065645
Total13,35356≈50.83
Table 2. Pearson, Spearman, and Kendall p-values rounded to 3 decimal places for hives H17, H19, H41, H43, H47, and H53 and five traffic types: IN (incoming), OUT (outgoing), TOT (total), IN-OUT (difference of IN and OUT), and IN+OUT (sum of IN and OUT), where TOT = IN+OUT+LAT and LAT is lateral traffic; IN, OUT, LAT, and TOT are non-negative integers; p-values for H47 are bolded, because it was the only hive for which H ρ , 0 was not rejected at p 0.05 for any coefficient and any traffic type; HXX refers to H17, H19, H41, H43, and H53, because their p-values were identically 0.
Table 2. Pearson, Spearman, and Kendall p-values rounded to 3 decimal places for hives H17, H19, H41, H43, H47, and H53 and five traffic types: IN (incoming), OUT (outgoing), TOT (total), IN-OUT (difference of IN and OUT), and IN+OUT (sum of IN and OUT), where TOT = IN+OUT+LAT and LAT is lateral traffic; IN, OUT, LAT, and TOT are non-negative integers; p-values for H47 are bolded, because it was the only hive for which H ρ , 0 was not rejected at p 0.05 for any coefficient and any traffic type; HXX refers to H17, H19, H41, H43, and H53, because their p-values were identically 0.
Hive Pearson Spearman Kendall
INOUTTOTIN-OUTIN+OUTINOUTTOTIN-OUTIN+OUTINOUTTOTIN-OUTIN+OUT
H470.7240.7150.7210.8010.7190.6750.2840.6610.6290.6560.6610.2650.6510.6040.639
HXX000000000000000
Table 3. Suprema ϵ W * and ϵ T * computed according to Equation (15), for hive H17 and for 5 types of traffic, 5 variances, and 5 means; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; LAT—lateral traffic; IN—OUT (difference between IN and OUT), IN + OUT (sum of IN and OUT); TOT = IN + OUT + LAT; W column gives ϵ W * for weight measurements and corresponding lags; IN, OUT, TOT, and LAT are non-negative integers; exact measurement columns give ϵ W * for exact measurements of IN, OUT, TOT, IN-OUT, IN+OUT and lags; variance columns give ϵ T * for σ 2 (X), where X ∈ {IN, OUT, TOT, IN-OUT, IN+OUT} and lags; mean columns give ϵ T * for μ (X), where where X ∈ {IN, OUT, TOT, IN-OUT, IN+OUT}.
Table 3. Suprema ϵ W * and ϵ T * computed according to Equation (15), for hive H17 and for 5 types of traffic, 5 variances, and 5 means; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; LAT—lateral traffic; IN—OUT (difference between IN and OUT), IN + OUT (sum of IN and OUT); TOT = IN + OUT + LAT; W column gives ϵ W * for weight measurements and corresponding lags; IN, OUT, TOT, and LAT are non-negative integers; exact measurement columns give ϵ W * for exact measurements of IN, OUT, TOT, IN-OUT, IN+OUT and lags; variance columns give ϵ T * for σ 2 (X), where X ∈ {IN, OUT, TOT, IN-OUT, IN+OUT} and lags; mean columns give ϵ T * for μ (X), where where X ∈ {IN, OUT, TOT, IN-OUT, IN+OUT}.
Lag ExactMeasurement Variance Mean
(hours)WINOUTTOTIN-OUTIN+OUTINOUTTOTIN-OUTIN+OUTINOUTTOTIN-OUTIN+OUT
10.7547.8567.9688.7076.5858.60611.8511.94913.45910.89913.2616.5816.477.3215.1987.22
20.8078.4638.6599.3287.2239.24612.08312.05113.65210.413.456.586.3837.2495.1437.167
31.0568.7839.0139.687.4949.59711.87311.86513.44510.05113.2516.5286.2987.1955.017.113
41.3448.9889.1799.8657.6029.78111.81411.79213.379.77613.1856.4066.2157.0934.837.009
51.439.1299.32610.0087.7199.92611.76211.75313.3249.60813.146.3316.1337.0124.7236.93
61.4559.2549.44610.1297.79510.04811.67211.67313.2389.45513.0556.2686.0766.9514.6176.87
Table 4. The maxima and argmaxima of the absolute difference between the joint probability and the product of marginal probabilities of exact traffic counts and different lags for hive H17; each entry has format ( m a x D , ϵ W , ϵ T ); IN—incoming traffic, OUT—outgoing traffic; TOT—total traffic; IN-OUT—difference between IN and OUT; IN+OUT—sum of IN+OUT; IN, OUT, TOT, IN-OUT, and IN+OUT are integers.
Table 4. The maxima and argmaxima of the absolute difference between the joint probability and the product of marginal probabilities of exact traffic counts and different lags for hive H17; each entry has format ( m a x D , ϵ W , ϵ T ); IN—incoming traffic, OUT—outgoing traffic; TOT—total traffic; IN-OUT—difference between IN and OUT; IN+OUT—sum of IN+OUT; IN, OUT, TOT, IN-OUT, and IN+OUT are integers.
Lag maxD, ϵ W , ϵ T
(hours)INOUTTOTIN-OUTIN+OUT
1(0.072, 0.051, 5.416)(0.059, 0.059, 6.227)(0.068, 0.051, 6.339)(0.029, 0.077, 3.871)(0.066, 0.051, 6.382)
2(0.059, 0.181, 7.050)(0.051, 0.145, 7.098)(0.060, 0.145, 7.848)(0.038, 0.284, 4.727)(0.054, 0.145, 7.813)
3(0.051, 0.226, 6.701)(0.042, 0.214, 6.347)(0.046, 0.228, 7.496)(0.044, 0.054, 4.754)(0.044, 0.228, 7.419)
4(0.071, 0.602, 7.467)(0.065, 0.602, 7.505)(0.069, 0.602, 8.305)(0.045, 0.049, 5.182)(0.069, 0.602, 8.234)
5(0.055, 0.080, 8.027)(0.060, 0.080, 8.019)(0.058, 0.314, 8.971)(0.078, 0.574, 4.963)(0.060, 0.295, 8.895)
6(0.077, 0.181, 7.840)(0.070, 0.113, 7.946)(0.083, 0.181, 8.695)(0.053, 0.493, 6.435)(0.078, 0.181, 8.627)
Table 5. Chi-square statistics and p-values of exact traffic counts for different lags for hive H17; 3 p values smaller than 0.05 are bolded; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic.
Table 5. Chi-square statistics and p-values of exact traffic counts for different lags for hive H17; 3 p values smaller than 0.05 are bolded; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic.
Lag C and p-Value
(hours)INOUTTOTIN-OUTIN+OUT
1(88.523, 0.023)(77.522, 0.119)(90.778, 0.016)(51.015, 0.880)(94.585, 0.008)
2(35.668, 0.077)(27.817, 0.316)(36.710, 0.061)(30.741, 0.198)(33.929, 0.109)
3(16.438, 0.423)(14.751, 0.543)(16.985, 0.387)(26.175, 0.052)(15.488, 0.489)
4(24.382, 0.081)(22.313, 0.133)(24.379, 0.082)(15.655, 0.477)(20.985, 0.179)
5(7.203, 0.616)(12.325, 0.196)(7.551, 0.580)(10.224, 0.333)(7.551, 0.580)
6(10.964, 0.278)(7.732, 0.561)(9.093, 0.429)(13.680, 0.134)(8.386, 0.496)
Table 6. Lags for which H χ 2 , 0 was rejected for monitored hives; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; σ 2 (X)—variance of X; μ (X)—mean of X; ⋄ means that H χ 2 , 0 was not rejected for any lag; bolded lags are the lags for which both recommendations at the end of Section 2.6 are satisfied.
Table 6. Lags for which H χ 2 , 0 was rejected for monitored hives; IN—incoming traffic; OUT—outgoing traffic; TOT—total traffic; σ 2 (X)—variance of X; μ (X)—mean of X; ⋄ means that H χ 2 , 0 was not rejected for any lag; bolded lags are the lags for which both recommendations at the end of Section 2.6 are satisfied.
Traffic MeasurementH17H19H41H43H47H53
IN15,65,654,54,5,6
OUT5,65,65,654,5,6
TOT15,65,65,654,5,6
IN-OUT5,665
IN+OUT15,65,6554,5,6
σ 2 (IN)1,2,3,63,563,4,5,62,4,5,6
σ 2 (OUT)1,2,3,622,4,5,6
σ 2 (TOT)1,3163,5,62,4,5,6
σ 2 (IN-OUT)3,4435,6
σ 2 (IN+OUT)1,2,363,64,5,6
μ (IN)665,64,5,6
μ (OUT)165,664,54,5,6
μ (TOT)1,264,61,65,64,5,6
μ (IN-OUT)3,64,5,64
μ (IN+OUT)1,26665,61,4,5,6
Table 7. Pearson coefficients and corresponding p-values of different types of traffic measurements and different lags for hive H17; each cell is a tuple ( c , p ) where c is Pearson and p is p-value; IN—incoming traffic; OUT—outgoing; TOT—total.
Table 7. Pearson coefficients and corresponding p-values of different types of traffic measurements and different lags for hive H17; each cell is a tuple ( c , p ) where c is Pearson and p is p-value; IN—incoming traffic; OUT—outgoing; TOT—total.
Lag (in hours)INOUTTOTIN-OUTIN+OUT
1(−0.288, 1.33 × 10 9 )(−0.362, 1.17 × 10 14 )(−0.324, 6.76 × 10 12 )(0.490, 3.92 × 10 27 )(−0.329, 3.06 × 10 12 )
2(−0.328, 1.98 × 10 6 )(−0.406, 2.15 × 10 9 )(−0.367, 8.61 × 10 8 )(0.539, 1.53 × 10 16 )(−0.372, 5.34 × 10 8 )
3(−0.367, 1.64 × 10 5 )(−0.445, 1.01 × 10 7 )(−0.406, 1.50 × 10 6 )(0.579, 4.22 × 10 13 )(−0.411, 1.09 × 10 6 )
4(−0.394, 6.03 × 10 5 )(−0.480, 5.82 × 10 7 )(−0.436, 7.10 × 10 6 )(0.627, 4.71 × 10 12 )(−0.442, 5.11 × 10 6 )
5(−0.399, 8.17 × 10 4 )(−0.483, 3.55 × 10 5 )(−0.440, 1.95 × 10 4 )(0.628, 1.24 × 10 8 )(−0.446, 1.54 × 10 4 )
6(−0.496, 4.17 × 10 5 )(−0.582, 7.16 × 10 7 )(−0.539, 6.07 × 10 6 )(0.716, 6.08 × 10 11 )(−0.545, 4.71 × 10 6 )
Table 8. Frames of bees in monitored hives; frames of bees is a visual count of frames completely covered with bees on both sides; all inspection dates in columns were in 2021; NA—not available.
Table 8. Frames of bees in monitored hives; frames of bees is a visual count of frames completely covered with bees on both sides; all inspection dates in columns were in 2021; NA—not available.
Hive27 May8 June30 July28 July13 August
H1789131818
H191211131418
H4166444
H437NA7815
H476NA435
H5310871015
Table 9. Mite drop measurements in monitored hives; mite drop is the mean number of mites per day on a sticky board under the hive; all periods in columns refer to 2021; the start and end of each period are in the month/day format; a new period started on 7 August 2021 after new Apivar strips were installed in monitored hives; the last row represents Apivar treatment periods; Apivar is a polymer strip used to treat Varroa mites.
Table 9. Mite drop measurements in monitored hives; mite drop is the mean number of mites per day on a sticky board under the hive; all periods in columns refer to 2021; the start and end of each period are in the month/day format; a new period started on 7 August 2021 after new Apivar strips were installed in monitored hives; the last row represents Apivar treatment periods; Apivar is a polymer strip used to treat Varroa mites.
Hive27 May–7 June7 June–6 July6 July–8 July8 July–16 August
H172.379.519.3
H1913264229.7
H413.72.57.54.7
H436.36118.3
H471.76.54.53.7
H531091710.3
Pre-treatmentMite treatmentPost-treatment
Table 10. Brood quality measurements in monitored hives; STR—straight; SPT—spotty; PMS—parasitic mite syndrome; NA—not available; colonies with PMS have white larvae that appear chewed or sunken on the side of the cell.
Table 10. Brood quality measurements in monitored hives; STR—straight; SPT—spotty; PMS—parasitic mite syndrome; NA—not available; colonies with PMS have white larvae that appear chewed or sunken on the side of the cell.
Hive27 May8 June30 July28 July13 August
H17STRSTRSTRSTRSTR
H19STRSTRSTRSTRSTR
H41SPTPMSSPTSTR/SPTSPT
H43NANASTRSTR/SPTSTR/SPT
H47NANASPT/PMSSPT/PMSSPT
H53STRSTRSPTSTRSPT
Table 11. Bee mass (kg) in monitored hives measured using the method in [9]; N/A—not applicable.
Table 11. Bee mass (kg) in monitored hives measured using the method in [9]; N/A—not applicable.
Hive8 June 202130 June 202113 August 2021
H172.32N/A4.42
H19N/A2.124.10
H410.86N/A1.32
H43N/A2.053.15
H47N/A0.791.45
H53N/A1.363.08
Table 12. Queen status inspections of monitored hives; all dates in columns were in 2021; blu—queen from breeder 1 colored blue; gr—queen from breeder 2 colored green; yel—queen from breeder 2 colored yellow; Q+—queen spotted; Q+?—queen not spotted but its presence is clear from eggs in cells; Q?—queen not spotted; SSQR—supersedure queen removed; QR—queen spotted and removed; N/A – not available.
Table 12. Queen status inspections of monitored hives; all dates in columns were in 2021; blu—queen from breeder 1 colored blue; gr—queen from breeder 2 colored green; yel—queen from breeder 2 colored yellow; Q+—queen spotted; Q+?—queen not spotted but its presence is clear from eggs in cells; Q?—queen not spotted; SSQR—supersedure queen removed; QR—queen spotted and removed; N/A – not available.
Hive19 May27 May8 June30 June28 July13 August
H17Q+bluQ+bluQ+bluQ+bluQ+?Q+blu
H19Q+yelQ+?Q+yelQ+yelQ+yelQ?
H41Q+yelQ+yelSSQRQ+yelQ+yelQ+yel
H43Q+bluQ?N/AQ+bluQ+bluQ?
H47QR+bluQ+yelN/AQ+yelQ+yelQ+yel
H53Q+grQ+grQ+grQ+grQ+grQ+gr
Table 13. Lags and H χ 2 , 0 rejection ratios for all monitored hives.
Table 13. Lags and H χ 2 , 0 rejection ratios for all monitored hives.
Lag (in Hours) H χ 2 , 0 -Rejected/Number of Tests
113/90 ≈ 14.4%
2  9/90 = 10.0%
311/90 ≈ 12.2%
420/90 ≈ 22.2%
540/90 ≈ 44.4%
649/90 ≈ 54.4%
Table 14. Lags and H χ 2 , 0 rejection ratios for which the domains of W t and T t were split into the numbers of intervals satisfying the recommendations R E C 1 and R E C 2 .
Table 14. Lags and H χ 2 , 0 rejection ratios for which the domains of W t and T t were split into the numbers of intervals satisfying the recommendations R E C 1 and R E C 2 .
Lag (in Hours) H χ 2 , 0 -Rejected/Number of Tests
113/90 ≈ 14.4%
2 4/45 ≈  8.9%
3  2/15 ≈ 13.3%
Table 15. H χ 2 , 0 rejection ratios for exact traffic measurements (i.e., counts), traffic measurement variances, and traffic measurement means.
Table 15. H χ 2 , 0 rejection ratios for exact traffic measurements (i.e., counts), traffic measurement variances, and traffic measurement means.
Traffic Measurements H χ 2 , 0 -Rejected/Number of Tests
Exact traffic measurements46/180 ≈ 25.6%
Traffic measurement variances ( σ 2 )50/180 ≈ 27.8%
Traffic measurement means ( μ )46/180 ≈ 25.6%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kulyukin, V.; Tkachenko, A.; Price, K.; Meikle, W.; Weiss, M. Integration of Scales and Cameras in Nondisruptive Electronic Beehive Monitoring: On the Within-Day Relationship of Hive Weight and Traffic in Honeybee (Apis mellifera) Colonies in Langstroth Hives in Tucson, Arizona, USA. Sensors 2022, 22, 4824. https://doi.org/10.3390/s22134824

AMA Style

Kulyukin V, Tkachenko A, Price K, Meikle W, Weiss M. Integration of Scales and Cameras in Nondisruptive Electronic Beehive Monitoring: On the Within-Day Relationship of Hive Weight and Traffic in Honeybee (Apis mellifera) Colonies in Langstroth Hives in Tucson, Arizona, USA. Sensors. 2022; 22(13):4824. https://doi.org/10.3390/s22134824

Chicago/Turabian Style

Kulyukin, Vladimir, Anastasiia Tkachenko, Kristoffer Price, William Meikle, and Milagra Weiss. 2022. "Integration of Scales and Cameras in Nondisruptive Electronic Beehive Monitoring: On the Within-Day Relationship of Hive Weight and Traffic in Honeybee (Apis mellifera) Colonies in Langstroth Hives in Tucson, Arizona, USA" Sensors 22, no. 13: 4824. https://doi.org/10.3390/s22134824

APA Style

Kulyukin, V., Tkachenko, A., Price, K., Meikle, W., & Weiss, M. (2022). Integration of Scales and Cameras in Nondisruptive Electronic Beehive Monitoring: On the Within-Day Relationship of Hive Weight and Traffic in Honeybee (Apis mellifera) Colonies in Langstroth Hives in Tucson, Arizona, USA. Sensors, 22(13), 4824. https://doi.org/10.3390/s22134824

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop