Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China

Gou, Xiaowei; Tsunekawa, Atsushi; Peng, Fei; Zhao, Xueyong; Li, Yulin; Lian, Jie

doi:10.3390/s19235334

Open AccessArticle

Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China

by

Xiaowei Gou

¹,

Atsushi Tsunekawa

²

,

Fei Peng

^3,4,*

,

Xueyong Zhao

⁵,

Yulin Li

⁵ and

Jie Lian

⁵

¹

The United Graduate School of Agricultural Sciences, Tottori University, 4-101 Koyama-Minami, Tottori 680-8553, Japan

²

Arid Land Research Center, Tottori University, 1390 Hamasaka, Tottori 680-0001, Japan

³

International Platform for Dryland Research and Education, Tottori University, 1390 Hamasaka, Tottori 680-0001, Japan

⁴

Key Laboratory of Desert and Desertification, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 73000, China

⁵

Naiman Desertification Research Station, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Tongliao 028300, China

^*

Author to whom correspondence should be addressed.

Sensors 2019, 19(23), 5334; https://doi.org/10.3390/s19235334

Submission received: 27 August 2019 / Revised: 5 November 2019 / Accepted: 29 November 2019 / Published: 3 December 2019

(This article belongs to the Section Remote Sensors)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Different livestock behaviors have distinct effects on grassland degradation. However, because direct observation of livestock behavior is time- and labor-intensive, an automated methodology to classify livestock behavior according to animal position and posture is necessary. We applied the Random Forest algorithm to predict livestock behaviors in the Horqin Sand Land by using Global Positioning System (GPS) and tri-axis accelerometer data and then confirmed the results through field observations. The overall accuracy of GPS models was 85% to 90% when the time interval was greater than 300–800 s, which was approximated to the tri-axis model (96%) and GPS-tri models (96%). In the GPS model, the linear backward or forward distance were the most important determinants of behavior classification, and nongrazing was less than 30% when livestock travelled more than 30–50 m over a 5-min interval. For the tri-axis accelerometer model, the anteroposterior acceleration (–3 m/s²) of neck movement was the most accurate determinant of livestock behavior classification. Using instantaneous acceleration of livestock body movement more precisely classified livestock behaviors than did GPS location-based distance metrics. When a tri-axis model is unavailable, GPS models will yield sufficiently reliable classification accuracy when an appropriate time interval is defined.

Keywords:

livestock; behavior classification; GPS; accelerometer; Random Forest; Kappa coefficient; dryland

1. Introduction

Drylands cover more than 41% of the Earth’s land area, and desertification directly affects more than 250 million people [1]. Overgrazing is considered to be the primary cause of land degradation [2]. Previous studies examining overgrazing of rangeland generally used the number of livestock in a given area as the grazing intensity; this practice assumes that livestock foraging is spatially distributed evenly and that all livestock behaviors have the same influence on the rangeland [3]. However, the livestock always shows patchy and selective grazing even in homogenous rangeland to minimize their activity range and to maximize energy use efficiency [4]. In fact, vegetation typically shows a mosaic distribution, whether induced by abiotic factors, such as elevation and slope, or by selective grazing, which aggravates the overuse of some areas of the grassland [5].

The spatial distribution of different behavioral activities was critical for understanding the effects of grazing on ecosystem function, growth, reproduction and survival, how to make efficient use of resources [6], and mechanisms for coping with environmental conditions [6]. In the grazing areas, the vegetation was significantly reduced by the selective foraging of livestock. Moreover, concentrated grazing depletes the soil of nutrients [7], thus promoting further degradation of grassland [8], whereas light grazing can improve plant diversity by restraining inherent inter and intra-specific competition [9]. In comparison, nongrazing behaviors, including resting and walking, trample plants and compact the soil surface in overused areas, and the cumulative deposition of excreta alters various physical properties of soil, including soil bulk density, aggregate stability, aggregate size distribution, and surface microrelief. Recovering rangeland from degradation due to nongrazing behaviors is considered more difficult than remediating the effects of concentrated grazing [10].

Accurately classifying different behaviors of livestock is necessary to understand rangeland degradation and to devise effective interventions to restore the degraded land. One such method involves applying several statistical [11] and deep-learning [12] models to collected data from accelerometers for classifying livestock behaviors, which have been developed by using large data sets placed on animals in managed grassland [13,14]. These accelerometers measure the instantaneous and independent local movement of animals’ legs, heads, or bodies, thus ensuring high accuracy of behavior classification [15,16,17,18]. However, accelerometers cannot provide information regarding the location of the livestock, which is crucial for identifying the spatial distribution of animals and grassland management. Another method is to use Global Positioning System (GPS) data and machine-learning algorithms to classify livestock behaviors [19]. Using the location records, the GPS data-based method can project the spatial distribution of various behaviors, which is crucial for herd management and the prevention of rangeland degradation. However, GPS data-based methods require an optimal time interval, during which metrics such as linear distance (d), cumulative distance (d), and turning angle (t) are calculated to predict behaviors [12]. To build models for predicting livestock movement, the time intervals for metric calculation have previously been selected empirically [19,20]. The optimal time interval for GPS data-based methods varies with the ecosystem, livestock species, topography, and spatial distribution of available resources to evaluate [21].

The Horqin Sandy Land in northern China has been seriously degraded since the mid-1980s, and various restoration countermeasures (e.g., fencing) have been introduced to restore the degraded land [22]. In Horqin Sandy Land, the average area of the fenced rangeland per household is approximately 15–30 ha [23]. Fencing limits the space, and thus the forage, available to animals and consequently might aggravate mosaic grazing in areas; in addition, dense walking along the fence might lead to mosaic degradation. The objectives of our study were to develop a method for classifying livestock behavior by using location information and to define the optimal time interval for a GPS data-based model for fenced rangeland.

2. Materials and Methods

The study was conducted in a fenced household pasture, which is located in the southwestern part (42°55′N, 120°42′E; altitude, ~360 m) of Horqin Sandy Land, China. The climate is temperate, semi-arid, continental, and monsoonal. Average annual precipitation is 360 mm, with an annual mean temperature of 6.4 °C. The minimal and maximal monthly mean temperatures are −13.1 °C in January and 23.7 °C in July, respectively.

The pasture was grazed by Simmental cattle from 1 July through 1 October, 2018 (three months). During our study, the rangeland area was 20.1 ha, and herd size was 13 cattle. The stocking rate was calculated in terms of the common method [24], which the value was 0.51 Animal Unit Months per hectare. The total grazing time was approximately 3 months yearly due to the implement of ‘suspending grazing’ policy by the local government, which was for preventing grassland degradation. The availability of forage in our study area was about 53 g/m² in July and 243 g/m² in August for enclosure rangeland [25]. The vegetation was composed mainly of herbage belonging to arid grassland types (Pennisetum centrasiaticum, Cleistogenes squarrosa), with some dwarf shrubs (Artemisia oxycephala, Artemisia halodendron).

2.1. Equipment and Animals

All 13 cattle in the pastured herd were fitted with GPS devices (catalog no. GT-600, i-gotU, Mobile Action Technology, Taipei, Taiwan) and tri-axis accelerometers (catalog no. UA-004-64, Hobo model, Onset, Bourne, MA, USA). GPS devices were attached on the neck only, whereas tri-axis accelerometers were placed on the neck, one leg, and the tail of each animal. The GPS device recorded cattle location at 50 s intervals throughout two consecutive days, after which the GPS devices were removed, recharged, and re-attached to the cattle; this process continued throughout the 10-d study period. The three-dimensional accelerometers recorded the anterior–posterior, transverse, and superior-inferior acceleration of livestock movement. The batteries of the tri-axis devices were able to record acceleration at 50 s resolution throughout the 10-day study period without needing to be recharged.

2.2. Observation of Livestock Behaviors

Classification and criteria for animal behavior followed the method of Ganskopp and Bohnert [12]. In the experiment period, one observer observed one cattle at two days. According to our observation, a herd of cattle behaved similarly in a group. Thus, the observed behavior can represent the behavior of the cattle. In each day, the observer kept tracking one randomly selected cattle. The direct visual behavioral observation was recorded continuously by one observer following one cattle at approximately 20 meters away from the cattle in consecutive two days (23 and 24 September 2018). The observer held a timer which is synchronized with the time of the GPS. The field observation of behaviors started from 9:00 local time. The time interval of the GPS to record each location is 50 s. The GPS will flash when recording the location of the cattle. When the GPS flashes, the observer will read the timing from the timer and record the cattle behavior. If the cattle were foraging with head down when the GPS recording the location, it is considered as grazing behavior. If the cattle were standing still, chewing, or walking it is considered as nongrazing behavior. In total, 9 hours and 539 behaviors were recorded; approximately 80% of activities were grazing behaviors, and the remaining 20% was the nongrazing activity. Detailed information regarding the behavior classification is given in Table 1.

2.3. Movement Metrics Derived from GPS and Tri-Axis Accelerometer Data

Coordinates of GPS device were converted from latitude/longitude form to a Universal Transverse Mercator (UTM) format to facilitate metrics of distances and turning angle [20]. Metrics related to distances cattle moved and the turning angle were derived to classify the animal behaviors at the GPS-determined locations (Figure 1). In the first step, we calculated the basic two metrics over two recording positions (100 s), then we extended the time interval and recalculated the metrics from 100 to 800 s. The distance moved included the cumulative distance travelled and linear distances between focal locations. Distances that occurred temporally before a considered location are called backward distances, and those after a focal location are called forward distances. The linear distance

d_{(b 3, a 1)}

between b3 and a1 was calculated by Equation (1), and the

d_{(b 1, b 2)}

,

d_{(b 2, b 3)}

,

d_{(a 3, a 4)}

,

d_{(a 2, a 3)}

,

d_{(a 1, a 2)}

,

d_{1}

,

d_{2}

,

d_{3}

and

d_{4}

were used the same equation. The backward accumulative distance

d_{(b 3, a 1)}

and the forward accumulative distance

d_{(a 1, a 2)}

was the same as linear distance. For extending time intervals of GPS positions, the backward accumulative distance between a1 and b2 was the sum of

d_{(b 2, b 3)}

and

d_{(b 3, a 1)}

in Equation (2) and forward accumulative distance between a1 and a3 was the sum of

d_{(a 1, a 2)}

and

d_{(a 2, a 3)}

in Equation (3). For further processing of accumulative distance, the backward accumulative distance between a1 and b1 was the sum of

d_{(b 3, a 1)}

,

d_{(b 2, b 3)}

and

d_{(b 1, b 2)}

in Equation (4) and forward accumulative distance between a1 and a4 was the sum of

d_{(a 1, a 2)}

,

d_{(a 2, a 3)}

and

d_{(a 3, a 4)}

in Equation (5) (Figure 1). Calculation of distances metrics in other time intervals followed the same procedure. Metrics used and their meaning at time intervals of 100–800s were illustrated in Figure 2.

d 1 = \sqrt{{(b 3_{x} - a 1_{x})}^{2} + {(b 3_{y} - a 1_{y})}^{2}}

(1)

d_{(b 1, a 1)} = |d_{(b 2, b 3)}| + |d_{(b 3, a 1)}|

(2)

d_{(a 1, a 3)} = |d_{(b 2, b 3)}| + |d_{(b 3, a 1)}|

(3)

d_{(a 1, b 1)} = |d_{(b 3, a 1)}| + |d_{(b 2, b 3)}| + |d_{(b 1, b 2)}|

(4)

d_{(a 1, a 4)} = |d_{(a 1, a 2)}| + |d_{(a 2, a 3)}| + |d_{(a 3, a 4)}|

(5)

Metrics of tri-axis accelerometer were calculated at 50 s intervals across the dataset of cattle, including accelerations along three orthogonal axes (

\ddot{d_{x}}

,

\ddot{d_{y}}

, and

\ddot{d_{z}}

), which was defined as three dimensional Cartesian system in neck (

{\ddot{d}}_{x n e c k}

,

{\ddot{d}}_{y n e c k}

, and

{\ddot{d}}_{z n e c k}

), leg (

{\ddot{d}}_{x l e g}

,

{\ddot{d}}_{y l e g}

, and

{\ddot{d}}_{z l e g}

), and tail (

{\ddot{d}}_{x t a i l}

,

{\ddot{d}}_{y t a i l}

, and

{\ddot{d}}_{z t a i l}

).

\ddot{d_{x}}

is acceleration (m/s²) in the superiorinferior axis,

\ddot{d_{y}}

is acceleration (m/s²) in the anteroposterior axis and

\ddot{d_{z}}

is acceleration (m/s²) in transverse axis; Magnitude of acceleration in the neck (

M_{n e c k}

)was calculated by Equation (6) and

M_{l e g}

and

M_{t a i l}

were calucalated by the same equation; (

S D_{x}

) standard deviation of the

\ddot{d_{x}}

were standard deviation of

\ddot{d_{x}}

at neck, leg, and tail calculated by Equation (7). The calculation of

S D_{y}

and

S D_{z}

used the same equation,

\bar{\ddot{d_{x}}}

is average of

\ddot{d_{x}}

at the neck, leg, and tail in the x-direction at the same time;

M_{n e c k} = \sqrt{{({\ddot{d}}_{x n e c k})}^{2} + {({\ddot{d}}_{y n e c k})}^{2} + {({\ddot{d}}_{z n e c k})}^{2}}

(6)

S D_{x} = \sqrt{\frac{\sum {(\ddot{d_{x}} - \bar{\ddot{d_{x}}})}^{2}}{n}}

(7)

The raw acceleration is divided into static and dynamic acceleration. The static acceleration for a focal point is average of 7 accelerations at 2.5 min. before (3 accelerations) and 2.5 min. after (3 accelerations). The dynamic acceleration was the difference between the instantaneous acceleration and the running-mean derived static acceleration [26]. Overall dynamic body acceleration (ODBA) at the neck, leg, or tail was the sum of absolute value of dynamic acceleration at x, y, z at the neck, leg, and tail [27]. For example, the ODBA at neck was calculated by Equation (8) where

A_{X n e c k}

,

A_{Y n e c k}

,

A_{Z n e c k}

are the dynamic acceleration at

{\ddot{d}}_{x n e c k}

,

{\ddot{d}}_{y n e c k}

, and

{\ddot{d}}_{z n e c k}

at the neck.

A_{X n e c k}

,

A_{Y n e c k}

, and

A_{Z n e c k}

were calculated by Equation (9). The ODBA in neck (

O D B A_{h e a d}

) was the sum of the absolute values of the dynamic accelerations from all three axes by Equation (8) and the

O D B A_{n e c k}

and

O D B A_{t a i l}

used the same equation. The calculation of ODBA for leg and tail was the same as for neck.

O D B A_{n e c k} = |A_{X n e c k}| + |A_{Y n e c k}| + |A_{Z n e c k}|

(8)

A_{X n e c k} = {\ddot{d}}_{x n e c k} - {\bar{\ddot{d}}}_{x n e c k}

(9)

Using the various metrics derived at intervals of 100–800 s, we built three types of models: one using GPS data-based metrics only (GPS model); another from the tri-axis accelerometer data only (tri-axis model); and a model combining the tri-axis accelerometer and GPS data-based metrics (GPS-tri model).

2.4. Livestock Behavior Modelling

The Random Forest algorithm classification model was used to categorize livestock behavior, with movement metrics as dependent variables and observed behaviors as independent variables [20]. Random Forest is a machine-learning algorithm that especially suits data sets with many dependent variables. Random Forest provides well-supported predictions from large numbers of dependent variables and has the ability to identify the important variables of the model [28]. The modelling process of Random Forest can be summarized as consisting of many decision trees [29]:

Construct bootstrap data set (bag data set) from approximate 2/3 of the original data set; the remaining 1/3 of the data set is recognized as ‘out of bag’ (OOB).
Randomly select several predictor variables to calculate nodes in the bootstrap dataset.
At each decision tree node, test a random subset of predictor variables, to partition the bootstrap data into increasingly homogeneous subsets. The node-splitting variable selected from the variable subset is that which results in the greatest increase in data purity (Gini) before and after the tree node split.
The trees are fully grown, and each tree is used to predict OOB data, compute accuracy, and average error rates over all predictions.
The predictions are calculated by means of the majority vote of OOB predictions of the tree, and all predictions are averaged together to determine the class for the observation.

Three training parameters need to be defined in the Random Forest algorithm; these parameters then determine the model prediction power:

Our analysis is carried out with the caret package in R Studio (R Development Core Team 2011) by using the Random Forest, caret, and plotmo packages. When building Random Forest models within this package there are two main user-controlled parameters: the number of variables to try at each node (the ‘mtry’ argument), and the number of trees in the forest (the ‘ntree’ argument). We used the train() function from the caret package to get an optimal combination of ‘mtry’ and ‘ntree’. The train() function was run for 10 (‘mtry’ from 1 to 10) times. To determine the optimal number of trees for our data, the approach was to create many ‘caret’ models for our algorithm and pass in a different value of ‘ntree’ while holding ‘mtry’ constant at the default value above. We tested models with varying numbers of trees as a function of tree number of tress approaches a flat line between 500 and 2000 trees.

Mean decrease in Gini is used to determine the importance of variables in the classification model; this parameter is based on the Gini impurity index used for the calculation of splits during training [20]. When a tree is built, the decision regarding which variable to split at each node uses the Gini parameter. For each variable, the sum of the Gini decrease across every tree of the forest is accumulated every time that variable is chosen to split a node. The sum is divided by the number of trees in the forest to give the mean decrease in Gini.

2.5. Performance of the Random Forest Classifier

The performance of Random Forest classification models was evaluated by using two indices: overall accuracy and the κ coefficient [30]. Overall accuracy represents the proportion of the total number of correctly classified observations. The κ coefficient, which considers the agreement occurring by chance, is a statistical measure of inter-rater agreement for categorical items [30].

To evaluate the performance of the Random Forest model, we used 10-fold (i.e., performed 5 times) cross-validation to separate the data set into different, smaller data sets as training data sets and testing data sets. This process enabled us to more precisely control the number of samples compared with the inherent bootstrap sample in the Random Forest model [31].

3. Results

3.1. Performance of GPS, Tri-Axis, and GPS-Tri Axis Models

Overall classification accuracy increased as the time interval increased: 84.4%, 84.5%, 86.44%, and 87.6% at time intervals of 100, 150, 200, and 250 s. For all GPS models, accuracy began to plateau around 0.89–0.91, when the time interval was greater than 300–800 s. For both the GPS-tri and tri-axis models, overall classification accuracy was approximately 96% at all time intervals (Figure 2).

Compared with the relatively small change in overall classification accuracy with different time intervals, the κ coefficient for GPS models increased dramatically from 7% to 42% as the time interval increased from 100 to 250 s. The κ coefficient stabilized at 57% to 65% when the time interval exceeded 300 s (Figure 2). The GPS-tri and tri-axis models yielded approximately the same κ coefficient (91% to 92%, 92%) at all time intervals (Figure 3).

3.2. Cross-Validation

For GPS models with time intervals of 100 to 800 s, the accuracy for grazing behavior was 92% to 98%, whereas the accuracy for nongrazing behavior increased from 20% to 47% as the time interval increased from 100 to 250 s and from 58% to 66% with time intervals of 300–800 s (Table 2). The performances of tri-axis were showed accuracy for grazing behaviors (98%) and nongrazing (92%) (Table 3).

3.3. Relative Importance of Variables

The first four metrics in order of importance (as indicated by the mean decrease in Gini) for the GPS model with time intervals from 100 to 800 s are shown in Figure 3 and Figure S1. In most of the models, either linear or accumulated distance, rather than turning angle, was the important metric in the modelling. The time lag until the important distance metric occurred increased with the time interval from 100 to 800 s (Figure 4). Among all of the important metrics at different time intervals, d19 (the backward linear distance at a time interval of 300 s) and d43 (backward linear distance at a time interval of 350 s) were the most frequently used metrics in the classification of livestock behaviors. The variable d19 was the most important for the GPS models when the time interval was 300–600 s, and d43 was most important for time intervals from 350 to 700 s.

In the tri-axis model, the variable

{\ddot{d}}_{y n e c k}

(acceleration of anterior–posterior movement in the neck) had the highest mean decrease in Gini, and

M_{t a i l}

(square root mean of the sum of acceleration in the neck, leg, and tail) the second largest. The mean decrease in Gini gradually declined from

{\ddot{d}}_{y l e g}

(acceleration of anterior–posterior movement in the foot) to

{\ddot{d}}_{x l e g}

(acceleration of superior-inferior movement in the foot) but then dramatically decreased from

{\ddot{d}}_{x l e g}

to

{\ddot{d}}_{z n e c k}

(acceleration of transverse movement in the neck) (Figure 5).

3.4. Marginal Effect of the Variable on Livestock Behavior Classification

We used partial dependence plots to show the marginal effect of the metrics used in the behavior classification. For all GPS models, we generated partial dependence plots for the first four most important variables determined according to the mean decrease in Gini (Figure 2).

Although d19 and d43 had important roles in behavior modeling, the marginal probability of classifying a behavior as nongrazing decreased as the time interval increased. The probability of nongrazing showed a sharp decrease when d19 and d43 were greater than approximately 35–50 m. In the GPS model at the 300 s time interval, the marginal probability to classify a behavior as nongrazing was around 0.4 when d19, d18 (the backward linear distance at a time interval of 250 s), d17 (the backward linear distance at a time interval of 200 s), and d20 (the backward accumulative distance at a time interval of 200 s) were less than 35–50 m (Figure 6A), thus accounting for more than 80% of the total behavior in this range of distance (Figure 6B). The utility power of these four distances in classifying a behavior as nongrazing gradually decreased and then stabilized around 0.22 when they were greater than 50 m (Figure 6A).

In the tri-axis model, when

{\ddot{d}}_{y n e c k}

was less than −3 m/s², the behavior was never classified as nongrazing, whereas the probability of a behavior being classified as nongrazing was around 0.8 when

{\ddot{d}}_{y n e c k}

was greater than −3 m/s². For the variable

M_{t a i l}

, the probability of a behavior being classified as nongrazing was 0.5 when

M_{t a i l}

was 0 m/s² and dropped dramatically to 0.3 when

M_{t a i l}

was 7 m/s². The behavior being classified as nongrazing was 0.3 when

{\ddot{d}}_{y l e g}

was from −20 to 0 m/s², dropped to 0.22 when

{\ddot{d}}_{y l e g}

was 8 m/s², increased to 0.25 when

{\ddot{d}}_{y l e g}

was more than 11 m/s². By using

{\ddot{d}}_{x l e g}

, the highest marginal probability of determining a behavior as nongrazing was 0.31 and dropped to 0 when

{\ddot{d}}_{x l e g}

was 11 m/s² (Figure 7).

4. Discussion

4.1. Optimal Time Interval for GPS Models

GPS location data can be used to infer latent states of behavior from within individual movement trajectories [19]. The duration to complete a specific behavioral activity depends on the type of livestock and the condition of the pasture [6]. Distance and turning angle metrics extracted from GPS data over specific time intervals can be used to classify livestock behaviors, such as 1 min for beef cows on desert grassland [6], 3 min for Brown Swiss cows in a cow shed [11], and 5 min (i.e., 300 s) for dairy cows on upland grassland [19]. In our study, the optimal time interval for behavior classification was approximately 300 s because the κ coefficient at this time interval was higher than for shorter time intervals and was nearly stable afterward (Figure 3). In addition, the most frequently used metric (d19) was the backward linear distance at the 300 s time interval (Figure 4).

Although overall accuracy did not vary over time intervals from 100 to 800 s, it may be a poor measure for assessing model performance, given that overall accuracy can happen just due to coincidence, especially when the data are imbalanced [6]. In contrast, the κ coefficient, which estimates accuracy beyond expectation, can correctly assess the accuracy of imbalanced data [32]. For imbalanced data, the observed and predicted accuracies and their agreement in regard to minor behaviors determine the κ coefficient. In reality, foraging occurs more often than other behaviors. During the cross-validation, given that the accuracies for grazing behavior were relatively high and stable, the critical determinants of the κ coefficient were the accuracies for nongrazing behaviors. For the GPS models, the low accuracies of the nongrazing behaviors during cross-validation (Table 2) explain the low κ coefficients for the time intervals from 100 to 250 s (Figure 3). At time intervals of 300 s and greater, the κ coefficient stabilized around 0.5–0.6 because of the increase in the accuracies of nongrazing behavior (Table 2). In addition, the d19 (backward linear distance at 300 s) was the most frequent metric in other models when the time interval was greater than 300 s (Figure 4). Therefore, the optimal time interval for using the GPS location data to classify the livestock behavior in the study area was 300 s.

4.2. Model Performance

Predicting the accuracy of models by using GPS data depends on the livestock type and the pasture condition [21], but when using tri-axis accelerometer data it depends only on the instantaneous body posture of the animal [15]. With the same time step to log the GPS position and the body posture by tri-axis accelerometer, models using tri-axis accelerometer data-based metrics only or combined tri-axis and GPS data-based metrics showed higher overall accuracies and κ coefficients than the models that used only GPS data-based metrics (Figure 3).

The distance moved by a livestock over a given time interval is expected to be an indicator of its activity. Short distances are likely to indicate static behavior (standing, ruminating), and long distances typically are associated with foraging [33]. In the current study, distance variables were the first four most important variables in most of the GPS models (Figure 4), thus supporting the power of using distance to classify cattle behavior.

The GPS models demonstrated several critical distances for classifying grazing and nongrazing behaviors (Figure 4). But, the marginal probabilities of the important variables to distinguish between grazing and nongrazing behaviors were lower for the GPS models than for the tri-axis models (Figure S1 and Figure 7). Moreover, the distances tended to be within the range that ambiguously classified the two behaviors (Figure S1). Therefore, distinguishing between grazing and nongrazing was particularly challenging and relied on the use of multiple movement metrics, including backward and forward linear and accumulative distances (Figure 4). For example, for the 300 s time interval, d19 was the first most important metric to determine the two behaviors. The marginal probability for nongrazing was approximately 40%, meaning unclear differentiation between grazing and nongrazing when d19 was less than 35 m. However, the probability of nongrazing was around 20%, indicating that the two behaviors were clearly differentiated when d19 exceeded 35 m. Unclear classification at shorter distances than this critical distance (35 m) might reflect the condition of the specific habitat. For example, the presence of woody vegetation might have made it more difficult to distinguish between grazing and nongrazing, because the consumption of shrubs slows movement and can blur the graze signature in terms of the motion sensor counts. In addition, 89% of the d19 data were less than 35 m. Hence, the lower probability of the distance metrics to classify the two behaviors under the threshold value and the skewed distribution of these metrics could be responsible for the relatively low accuracy of the GPS models.

The tri-axis accelerometer model was based on the body posture that was simultaneously associated with a specific behavior and did not need to account for any time interval, which might lead to uncertainty regarding behavior classification [34]. Unlike the GPS model, the tri-axis model can measure the instantaneous and independent local movement of the legs, heads, or entire bodies of animals, thus ensuring high accuracy of behavior classification [15,16,17,18]. Our findings showed that the anteroposterior movement of the neck was critical for distinguishing livestock behaviors (Figure 5), in agreement with the results of another study, which used x-axis sensor counts [14].

Livestock behaviors were influenced by the available forage and stocking density. With increasing stocking density, the average intake of each livestock will reduce due to the given availability forage in the rangeland [35]. Livestock preferred to spend less time on grazing behaviors when consuming of energy was more than grain [35]. More available forage in August (243 g/m²) than that in July (53 g/m²) in Horqin Sandy Land might lead to the livestock spending more time on grazing with sufficient energy of forage in August. For the behavior’s classification, livestock may spend less time over a given distance for finishing grazing behavior. So, the optimal time-interval of the GPS method for classifying behaviors will decrease. Our GSP model was built over 100–800 s to cover various situations corresponding with the change of rangeland pasture, thus the method can be applied in other sites.

5. Conclusions

Our current study demonstrates that data from both GPS devices and tri-axis accelerometer can be applied to build reliable models for livestock behavior classification.

To achieve the high and stable performance of the GPS model, we selected the optimal time interval from 300 to 800s, which is sufficient for most livestock activities associated with behaviors to be displayed. Metrics of linear distance had the most important effects on behavior classification. In addition, the marginal effects of linear distance indicated a distance of 35–50 m was the threshold for differentiating behaviors. At longer distances, grazing was more likely than nongrazing behavior.

Because it is based on the instantaneous acceleration of livestock body movement, the tri-axis model achieves higher performance regarding livestock behavior classification than does the GPS model. The anteroposterior movement of the animal’s neck was the most important metric for the tri-axis model. The marginal effects showed that acceleration of −3 m/s² was the threshold for differentiation of behaviors; at greater values, nongrazing was more likely than grazing.

In summary, compared with GPS models, a tri-axis model can better support livestock behavior classification, which is advantageous for assessing the detailed activities associated with investigating livestock physiology. But the main disadvantage of a tri-axis model is its lack of location information. A GPS model is sufficient for livestock behaviors classification and provides information regarding an animal’s location; this feature is associated with the interaction between livestock activities and the rangeland ecosystem. These findings may improve our understanding of how the selection of the time interval influences the process of distinguishing livestock activities in a GPS model and provide insight into selecting an optimal time interval when using GPS data only to classify livestock behaviors.

Supplementary Materials

The following are available online at https://www.mdpi.com/1424-8220/19/23/5334/s1. Figure S1. Partial dependence plots of nongrazing according to the four most important variables for time intervals of 250–800 s in the GPS model.

Author Contributions

Methodology, X.G., A.T., and F.P.; software, X.G.; resources, X.Z.; data curation, A.T., F.P.; writing—original draft preparation, X.Z., Y.L., and J.L.; writing—review and editing, F.P.; supervision, A.T.; funding acquisition, A.T.

Funding

This research was funded by ‘International Platform for Dryland Research Education’ and ‘Tottori University and Marginal Region Agriculture Project of Tottori University’.

Acknowledgments

The authors would like to thank International Platform for Dryland Research and Education, Tottori University, for financial support; Li Yuqiang, Liu Xinping, Luo Yongqing, Wang Xuyang, He Zhaoquan, and Liu Hongqing from Northwest Institute of Eco-Environment and Resources, CAS, for providing them with the equipment and the assistance in the field.

Conflicts of Interest

The authors declare no conflict of interest.

References

Assessment, Millennium Ecosystem. Ecosystems and Human Well-Being; Island Press: Washington, DC, USA, 2005. [Google Scholar]
Massa, C.; Bichet, V.; Gauthier, É.; Perren, B.B.; Mathieu, O.; Petit, C.; Monna, F.; Giraudeau, J.; Losno, R.; Richard, H. A 2500 year record of natural and anthropogenic soil erosion in South Greenland. Quat. Sci. Rev. 2012, 32, 119–130. [Google Scholar] [CrossRef]
Okayasu, T.; Okuro, T.; Jamsran, U.; Takeuchi, K. Impact of the spatial and temporal arrangement of pastoral use on land degradation around animal concentration points. Land Degrad. Dev. 2010, 21, 248–259. [Google Scholar] [CrossRef]
Manthey, M.; Peper, J. Estimation of grazing intensity along grazing gradients–the bias of nonlinearity. J. Arid Environ. 2010, 74, 1351–1354. [Google Scholar] [CrossRef]
Bailey, D.W.; Gross, J.E.; Laca, E.A.; Rittenhouse, L.R.; Coughenour, M.B.; Swift, D.M.; Sims, P.L. Mechanisms that result in large herbivore grazing distribution patterns. Rangel. Ecol. Manag. J. Range Manag. Arch. 1996, 49, 386–400. [Google Scholar] [CrossRef]
Anderson, D.M.; Winters, C.; Estell, R.E.; Fredrickson, E.L.; Doniec, M.; Detweiler, C.; Rus, D.; James, D.; Nolen, B. Characterising the spatial and temporal activities of free-ranging cows from GPS data. Rangel. J. 2012, 34, 149–161. [Google Scholar] [CrossRef]
Li, C.; Hao, X.; Zhao, M.; Han, G.; Willms, W.D. Influence of historic sheep grazing on vegetation and soil properties of a Desert Steppe in Inner Mongolia. Agric. Ecosyst. Environ. 2008, 128, 109–116. [Google Scholar] [CrossRef]
Fernandez-Gimenez, M.; Allen-Diaz, B. Vegetation change along gradients from water sources in three grazed Mongolian ecosystems. Plant Ecol. 2001, 157, 101–118. [Google Scholar] [CrossRef]
Scimone, M.; Rook, A.; Garel, J.; Sahin, N. Effects of livestock breed and grazing intensity on grazing systems: 3. Effects on diversity of vegetation. Grass Forage Sci. 2007, 62, 172–184. [Google Scholar] [CrossRef]
Warren, S.; Thurow, T.; Blackburn, W.; Garza, N. The influence of livestock trampling under intensive rotation grazing on soil hydrologic characteristics. Rangel. Ecol. Manag. J. Range Manag. Arch. 1986, 39, 491–495. [Google Scholar] [CrossRef]
Lagarde, F.; Guillon, N.; Dubroca, L.; Bonnet, X.; Kaddour, K.B.; Slimani, T.; El Mouden, E. Slowness and acceleration: A new method to quantify the activity budget of chelonians. Anim. Behav. 2008, 75, 319–329. [Google Scholar] [CrossRef]
Cornou, C.; Lundbye-Christensen, S. Classifying sows’ activity types from acceleration patterns: An application of the multi-process Kalman filter. Appl. Anim. Behav. Sci. 2008, 111, 262–273. [Google Scholar] [CrossRef]
Martiskainen, P.; Järvinen, M.; Skön, J.-P.; Tiirikainen, J.; Kolehmainen, M.; Mononen, J. Cow behaviour pattern recognition using a three-dimensional accelerometer and support vector machines. Appl. Anim. Behav. Sci. 2009, 119, 32–38. [Google Scholar] [CrossRef]
González, L.; Bishop-Hurley, G.; Handcock, R.N.; Crossman, C. Behavioral classification of data from collars containing motion sensors in grazing cattle. Comput. Electron. Agric. 2015, 110, 91–102. [Google Scholar] [CrossRef]
Fahlman, A.; Wilson, R.; Svärd, C.; Rosen, D.A.; Trites, A.W. Activity and diving metabolism correlate in Steller sea lion Eumetopias jubatus. Aquat. Biol. 2008, 2, 75–84. [Google Scholar] [CrossRef]
Gleiss, A.C.; Dale, J.J.; Holland, K.N.; Wilson, R.P. Accelerating estimates of activity-specific metabolic rate in fishes: Testing the applicability of acceleration data-loggers. J. Exp. Mar. Biol. Ecol. 2010, 385, 85–91. [Google Scholar] [CrossRef]
Green, J.; Halsey, L.; Wilson, R.; Frappell, P. Estimating energy expenditure of animals using the accelerometry technique: Activity, inactivity and comparison with the heart-rate technique. J. Exp. Biol. 2009, 212, 471–482. [Google Scholar] [CrossRef]
Halsey, L.G.; Shepard, E.L.; Hulston, C.J.; Venables, M.C.; White, C.R.; Jeukendrup, A.E.; Wilson, R.P. Acceleration versus heart rate for estimating energy expenditure and speed during locomotion in animals: Tests with an easy model species, Homo sapiens. Zoology 2008, 111, 231–241. [Google Scholar] [CrossRef]
Homburger, H.; Schneider, M.K.; Hilfiker, S.; Lüscher, A. Inferring behavioral states of grazing livestock from high-frequency position data alone. PLoS ONE 2014, 9, e114522. [Google Scholar] [CrossRef]
Schlecht, E.; Hülsebusch, C.; Mahler, F.; Becker, K. The use of differentially corrected global positioning system to monitor activities of cattle at pasture. Appl. Anim. Behav. Sci. 2004, 85, 185–202. [Google Scholar] [CrossRef]
De Weerd, N.; van Langevelde, F.; van Oeveren, H.; Nolet, B.A.; Kölzsch, A.; Prins, H.H.; de Boer, W.F. Deriving animal behaviour from high-frequency GPS: Tracking cows in open and forested habitat. PLoS ONE 2015, 10, e0129030. [Google Scholar] [CrossRef]
Li, Y.; Zhao, X.; Wang, S.; Zhang, F.; Lian, J.; Huang, W.; Mao, W. Carbon accumulation in the bulk soil and different soil fractions during the rehabilitation of desertified grassland in Horqin Sandy Land (Northern China). Pol. J. Ecol. 2015, 63, 88–102. [Google Scholar] [CrossRef]
Zuo, X.; Zhao, X.; Zhao, H.; Zhang, T.; Guo, Y.; Li, Y.; Huang, Y. Spatial heterogeneity of soil properties and vegetation–soil relationships following vegetation restoration of mobile dunes in Horqin Sandy Land, Northern China. Plant Soil 2009, 318, 153–167. [Google Scholar] [CrossRef]
Scarnecchia, D.L. The animal-unit and animal-unit-equivalent concepts in range science. Rangel. Ecol. Manag. J. Range Manag. Arch. 1985, 38, 346–349. [Google Scholar] [CrossRef]
Zuo, X.; Knops, J.; Zhao, X.; Zhao, H.; Zhang, T.; Li, Y.; Guo, Y. Indirect drivers of plant diversity-productivity relationship in semiarid sandy grasslands. Biogeosciences 2012, 9, 1277–1289. [Google Scholar] [CrossRef]
Wilson, R.P.; White, C.R.; Quintana, F.; Halsey, L.G.; Liebsch, N.; Martin, G.R.; Butler, P.J. Moving towards acceleration for estimates of activity-specific metabolic rate in free-living animals: The case of the cormorant. J. Anim. Ecol. 2006, 75, 1081–1090. [Google Scholar] [CrossRef]
Shepard, E.L.; Wilson, R.P.; Quintana, F.; Laich, A.G.; Liebsch, N.; Albareda, D.A.; Halsey, L.G.; Gleiss, A.; Morgan, D.T.; Myers, A.E. Identification of animal movement patterns using tri-axial accelerometry. Endanger. Species Res. 2008, 10, 47–60. [Google Scholar] [CrossRef]
Evans, J.S.; Cushman, S.A. Gradient modeling of conifer species using random forests. Landsc. Ecol. 2009, 24, 673–683. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Mouton, A.M.; De Baets, B.; Goethals, P.L. Ecological relevance of performance criteria for species distribution models. Ecol. Model. 2010, 221, 1995–2002. [Google Scholar] [CrossRef]
Cutler, D.R.; Edwards, T.C., Jr.; Beard, K.H.; Cutler, A.; Hess, K.T.; Gibson, J.; Lawler, J.J. Random forests for classification in ecology. Ecology 2007, 88, 2783–2792. [Google Scholar] [CrossRef]
Shoukri, M.; Martin, S. Estimating the number of clusters for the analysis of correlated binary response variables from unbalanced data. Stat. Med. 1992, 11, 751–760. [Google Scholar] [CrossRef] [PubMed]
Augustine, D.; Derner, J. Assessing herbivore foraging behavior with GPS collars in a semiarid grassland. Sensors 2013, 13, 3711–3723. [Google Scholar] [CrossRef] [PubMed]
Scheibe, K.M.; Gromann, C. Application testing of a new three-dimensional acceleration measuring system with wireless data transfer (WAS) for behavior analysis. Behav. Res. Methods 2006, 38, 427–433. [Google Scholar] [CrossRef] [PubMed]
Hepworth, K.; Test, P.; Hart, R.; Waggoner, J.; Smith, M. Grazing systems, stocking rates, and cattle behavior in southeastern Wyoming. Rangel. Ecol. Manag. J. Range Manag. Arch. 1991, 44, 259–262. [Google Scholar] [CrossRef]

Figure 1. Schematic representation of movement metrics used as predictive metric in the classification. Movement metrics include backward accumulative distance (

d_{(b 2, a 1)}

,

d_{(a 1, b 1)}

), forward accumulative distance (

d_{(a 1, a 3)}

,

d_{(a 1, a 4)}),

backward linear distance (

d_{1}

,

d_{2}

), forward linear distance (

d_{3}

,

d_{4}

), and turning angle between Global Positioning System (GPS) positions (c).

Figure 1. Schematic representation of movement metrics used as predictive metric in the classification. Movement metrics include backward accumulative distance (

d_{(b 2, a 1)}

,

d_{(a 1, b 1)}

), forward accumulative distance (

d_{(a 1, a 3)}

,

d_{(a 1, a 4)}),

backward linear distance (

d_{1}

,

d_{2}

), forward linear distance (

d_{3}

,

d_{4}

), and turning angle between Global Positioning System (GPS) positions (c).

Figure 2. Metrics of distance extracted from GPS device were used to classify livestock behaviors from 100 to 800s time intervals in Random Forest model. a₁ is the focal point, a_2-17 and b_1-16 were forward and backward locations at time interval from 100 to 800s. d₁-d₆₂ were the forward and backward linear distance metrics of distance from 100 to 800s time interval. Accumulative distances were calculated by Equations (2)–(5). d₆₂-d₉₂ were the accumulative distances metrics used in the model. Forward accumulative distance: d63 = d2 + d3; d64 = d63 + d8; d65 = d64 + d12; d66 = d65 + d14; d67 = d66 + d16; d68 = d67 + d23; d69 = d68 + d34; d70 = d69 + d35; d71 = d70 + d36; d72 = d71 + d37; d73 = d72 + d38; d74 = d73 + d39; d75 = d74 + d39; d76 = d75 + d41; d77 = d76 + d42. Backward accumulative distance: d78 = d1 + d4; d79 = d78 + d7; d80 = d79 + d11; d81 = d80 + d13; d82 = d81 + d15; d83 = d82 + d23; d84 = d83 + d24; d85 = d84 + d25; d86 = d85 + d26; d87 = d86 + d27; d88 = d87 + d28; d89 = d88 + d29; d90 = d89 + d30; d91 = d90 + d31; d92 = d91 + d32. The meaning and time interval of a specific accumulative distance can be read from Figure 2. For example, d63 = d2 + d3, thus d63 is the forward accumulative distance at 100s.

Figure 3. (a) Overall accuracy and (b) κ coefficients of the GPS (gray bars) and GPS-tri (white bars) with time intervals of 100–800 s and of the tri-axis model (black bars).

Figure 4. Variable importance plot generated by using the Random Forest algorithm with GPS models. The plot shows the first four important metrics of each GPS model (1, 2, 3, 4) according to the mean decrease in Gini; as this parameter increases, the variable is more important and a more accurate predictor of behavior classification. See Figure 2 and equation (Equations (1)–(6)) for the meaning of metrics.

Figure 5. Variable importance plot generated by using the Random Forest algorithm with the tri-axis model. The plot shows the importance of each variable according to the mean decrease in Gini; as this parameter increases, the variable is more important and a more accurate predictor of behavior classification. See Equations (6)–(9).

Figure 6. Partial dependence plots of nongrazing (A) and the proportion of behaviors corresponding to threshold in the GPS model (B). Partial plots represent the marginal effect of a single metric (d19, d18, d17, d20) of 300 s time-interval included in the Random Forest model on the probability of nongrazing behavior, when the effects of all other metrics are averaged out. The criteria of threshold distance of each partial plot are recognized that the nongrazing behaviors remain same probability. See Figure 2 and Equations (6)–(9) for the meaning of metrics.

Figure 7. Partial dependence plots of nongrazing (A) and the proportion of behaviors corresponding to threshold in the tri-axis model (B). Partial plots represent the marginal effect of a single metric (

{\ddot{d}}_{y n e c k,} M_{t a i l,} {\ddot{d}}_{y l e g}, {\ddot{d}}_{x l e g}

) included in the Random Forest model on the probability of nongrazing behavior, when the effects of all other metrics are averaged out. The criteria of threshold distance of each partial plot are recognized that the nongrazing behaviors remain same probability. See Equations (6)–(9) for the meaning of metrics.

Figure 7. Partial dependence plots of nongrazing (A) and the proportion of behaviors corresponding to threshold in the tri-axis model (B). Partial plots represent the marginal effect of a single metric (

{\ddot{d}}_{y n e c k,} M_{t a i l,} {\ddot{d}}_{y l e g}, {\ddot{d}}_{x l e g}

) included in the Random Forest model on the probability of nongrazing behavior, when the effects of all other metrics are averaged out. The criteria of threshold distance of each partial plot are recognized that the nongrazing behaviors remain same probability. See Equations (6)–(9) for the meaning of metrics.

Table 1. Descriptions of the observed behaviors (modified from Ganskopp and Bohnert [12]).

Behavior category	Definition	Explanation
Grazing	Foraging, Foraging–walking	Foraging: foraging continuously (head lowered) Foraging–walking: foraging while walking (head raised and lowered)
Nongrazing	Standing, Lying down, Rumination	Standing: the animal stands on all four legs, with head erect and without swinging its head from side to side Lying down: the cattle lies on the ground in any position (except flat on its side) without ruminating Ruminating: the cattle lies in a stall masticating regurgitated feed, swallowing masticated feed, or regurgitating feed with head erect

Table 2. The confusion matrix for livestock behaviors classification as categorized by using GPS models with time intervals of 100–800 s.

Observed Behaviors	Predicted Behaviors
	Grazing	Nongrazing	Percent Accuracy	Grazing	Nongrazing	Percent Accuracy	Grazing	Nongrazing	Percent Accuracy
	100 s			150 s			200 s
Grazing	421	35	0.92	428	28	0.94	428	28	0.94
Nongrazing	66	17	0.20	63	20	0.24	51	32	0.39
	250 s			300 s			350 s
Grazing	427	29	0.94	430	26	0.94	433	23	0.95
Nongrazing	44	39	0.47	30	53	0.64	34	49	0.59
	400 s			450s			500 s
Grazing	447	9	0.98	440	16	0.96	446	10	0.98
Nongrazing	33	50	0.60	31	52	52	35	48	0.58
	550 s			600 s			650 s
Grazing	446	10	0.98	444	12	0.97	445	11	0.98
Nongrazing	35	48	0.59	33	50	0.6	32	51	0.61
	700 s			750 s			800 s
Grazing	442	14	0.97	440	15	0.96	435	21	0.95
Nongrazing	32	51	0.61	28	55	0.66	29	56	0.66

For each row, accuracy was calculated as the proportion of the observed class relative to the total number of behaviors.

Table 3. The confusion matrix for livestock behaviors classification as categorized by using the tri-axis model.

Observed Behaviors	Predicted Behaviors
Observed Behaviors	Grazing	Nongrazing	Accuracy
Grazing	447	9	0.98
Nongrazing	7	76	0.92

For each row, accuracy was calculated as the proportion of the observed class relative to the total number of behaviors.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gou, X.; Tsunekawa, A.; Peng, F.; Zhao, X.; Li, Y.; Lian, J. Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China. Sensors 2019, 19, 5334. https://doi.org/10.3390/s19235334

AMA Style

Gou X, Tsunekawa A, Peng F, Zhao X, Li Y, Lian J. Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China. Sensors. 2019; 19(23):5334. https://doi.org/10.3390/s19235334

Chicago/Turabian Style

Gou, Xiaowei, Atsushi Tsunekawa, Fei Peng, Xueyong Zhao, Yulin Li, and Jie Lian. 2019. "Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China" Sensors 19, no. 23: 5334. https://doi.org/10.3390/s19235334

APA Style

Gou, X., Tsunekawa, A., Peng, F., Zhao, X., Li, Y., & Lian, J. (2019). Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China. Sensors, 19(23), 5334. https://doi.org/10.3390/s19235334

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Method for Classifying Behavior of Livestock on Fenced Temperate Rangeland in Northern China

Abstract

1. Introduction

2. Materials and Methods

2.1. Equipment and Animals

2.2. Observation of Livestock Behaviors

2.3. Movement Metrics Derived from GPS and Tri-Axis Accelerometer Data

2.4. Livestock Behavior Modelling

2.5. Performance of the Random Forest Classifier

3. Results

3.1. Performance of GPS, Tri-Axis, and GPS-Tri Axis Models

3.2. Cross-Validation

3.3. Relative Importance of Variables

3.4. Marginal Effect of the Variable on Livestock Behavior Classification

4. Discussion

4.1. Optimal Time Interval for GPS Models

4.2. Model Performance

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI