Discriminating pathological, reproductive or stress conditions in cows using machine learning on sensor-based activity data

[Display omitted] •57–86 % of diseases, oestrus and calving of cows are detected with random forest applied on activity data.•Some cow conditions can be detected from activity-one or two days before caretakers noticing them.•Highest activity level during the day and the cyclic components are the mos...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers and electronics in agriculture 2023-01, Vol.204, p.107556, Article 107556
Hauptverfasser: Lardy, Romain, Ruin, Quentin, Veissier, Isabelle
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •57–86 % of diseases, oestrus and calving of cows are detected with random forest applied on activity data.•Some cow conditions can be detected from activity-one or two days before caretakers noticing them.•Highest activity level during the day and the cyclic components are the most discriminant features.•The proportion of false alarms remains very high (from 46 to 89 %).•The model was tested on 5 independent datasets (total, 120,000 cow*days) to check its robustness. Precision livestock farming (PLF) technologies help to detect animals experiencing specific conditions such as illness, stress, or oestrus. PLF systems either detect one or more conditions at a time, or several conditions together but without distinguishing them. The aim of this work was to detect and classify a wide range of conditions of health, stress and reproduction based on cow daily activity using machine learning on time series data derived from sensors. We used five datasets (120,000 cow*days) from experimental or commercial farms, with 28 to 300 Holstein cows per dataset and observations lasting from 1 to 12 months. Caretakers checked the animals every day and recorded any specific cow condition related to disease (lameness, mastitis, acidosis, accident, and so on), oestrus, calving, as well as potential stress due to mixing or handling. We labelled a number of days before and after the specific condition during which the behaviour might be altered. Sensors (location systems or accelerometers) captured the per-hour duration of key cow activities - eating, resting, in alleys - that served to compute the activity level. The distribution of the activity level in each 24 h time series was described by 21 time-domain or frequency-domain features. We then ran the random forest algorithm to relate the features of the time series to animal condition (1000 trees; 10 random splits of 70 % train / 30 % test subsets; oversampling applied on train subsets to reduce the imbalance between the occurrences of cow conditions). The most important features in the random forest algorithm were, in descending order: maximum, 90% quantile, Fourier harmonics 2 and 1, 50 % and 75 % quantiles, Fourier harmonics 3 and 4. This approach correctly classified over 90 % of the control series (i.e. with no specific condition) and classified 24 % of the specific condition-labelled series as non-control series. The condition-labelled series can be confused with control series and to a lesser extent w
ISSN:0168-1699
1872-7107
DOI:10.1016/j.compag.2022.107556