Advances in Audio Classification and Artificial Intelligence for Respiratory Health and Welfare Monitoring in Swine
Simple Summary
Abstract
1. Introduction
2. Source of Pig Sounds in Farm Environments
2.1. Physiological Mechanisms
2.2. Behavioral and Emotional Contexts
3. Types of Pig Vocalizations
4. Fundamental Concepts of Audio Classification
5. Audio Classification Challenges Specific to Pig Farms
6. Sound Acquisition Technologies in Pig Farms
7. Audio Annotation and Labeling Platforms
8. Embedded Edge Computing Devices for On-Farm Audio Analysis
9. Audio Feature Engineering for Pig Sound Recognition
10. Machine Learning and Deep Learning Approaches for Pig Audio Classification
11. Disease-Focused Audio Classification Research
12. Behavior Recognition Through Acoustics
13. Real-Time, Edge & On-Farm Deployments
14. Evaluation Strategies
15. Generalization Challenges
16. Gaps and Limitations Identified in Current Research
17. Emerging Opportunities for Future AI Models
18. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| STFT | Short-Time Fourier Transform |
| MFCCs | Mel-Frequency Cepstral Coefficients |
| LPC | Linear Predictive Coding |
| CNN | Convolutional Neural Network |
| RNN | Recurrent Neural Network |
| ROC | Receiver Operating Characteristic |
| SNR | Signal-to-Noise Ratio |
| MEMS | Micro-Electro-Mechanical Systems |
| ML | Machine Learning |
| MCU | Microcontroller Unit |
| AIoT | Artificial Intelligence of Things |
| DNN–HMM | Deep Neural Network–Hidden Markov Model |
| SVM | Support Vector Machine |
| CQT | Constant-Q Transform |
| RF | Random Forest |
| KNN | K-Nearest Neighbors |
| DWT | Discrete Wavelet Transform |
| RMS | Root Mean Square |
| ZCR | Zero-Crossing Rate |
| LBP | Local Binary Patterns |
| HOG | Histogram of Oriented Gradients |
| GMM-HMM | Gaussian Mixture Model–Hidden Markov Model |
| STE | Short-Time Energy |
| FC | Frequency Centroid |
| FF | Formant Frequency |
| PCA | Principal Component Analysis |
| BP Neural Network | Backpropagation Neural Network |
| GA | Genetic Algorithm |
| DCNN | Deep Convolutional Neural Network |
| EMD-TEO | Empirical Mode Decomposition–Teager Energy Operator |
| SVDD | Support Vector Data Description |
| PRRSV | Porcine Reproductive and Respiratory Syndrome Virus |
| AST | Audio Spectrogram Transformer |
| SRC | Sparse Representation Classifier |
| AUC | Area Under the Receiver Operating Characteristic Curve |
Appendix A
| Platform Name | Key Features & Use Cases | Ref. |
|---|---|---|
| Crowsetta | Python-based, works with any annotation format, supports flexible workflows for animal vocalizations and bioacoustics data. | [85] |
| Whombat | Browser-based, user-friendly, supports collaborative annotation, visualization, and ML-assisted workflows. | [91] |
| ecoSound-web | Open-source, online, supports manual and automatic annotation, peer review, and reference libraries. | [88] |
| DISCO | Open-source, deep learning ensemble for segmentation and labeling, includes visual tools for analysis. | [87] |
| NEAL | R/Shiny-based, interactive, designed for large datasets, modifiable for generic or farm animal audio. | [86] |
| Custom/Semi-automatic Tools | Tailored for specific species (e.g., monophonic cow sound annotation tool, goat vocalization annotation tool), often semi-automatic and visually transparent. | [89,90] |
| Study/Method | Behaviors Detected | Model/Features Used | Accuracy/Findings | Ref. |
|---|---|---|---|---|
| TinyML CNN | Agonistic, social | CNN, embedded system | >90% accuracy | [67] |
| DCNN Mixed-MMCT | Vocalization/non-vocalization | DCNN, multi-feature fusion | Up to 99.67% accuracy | [49] |
| Multi-feature fusion NN | Grunting, squealing, coughing | NN, time/frequency features | 93.2% average accuracy | [24] |
| Neural network on call valence | Emotional valence, context | Neural network, spectrograms | 91.5% (valence), 81.5% (context) | [21] |
| Factor | Importance/Role | References |
|---|---|---|
| Microphone quality | Accurate sound capture, noise reduction | [10,125,126] |
| Edge device efficiency | Real-time, low-power inference | [92,125,126,127] |
| Model compression | Fit models to device constraints | [125,127,128,129] |
| Noise robustness | Reliable detection in farm environments | [10,125,126,130] |
| Explainable outputs | Farmer trust, regulatory compliance | [92] |
| Power/connectivity | Continuous, autonomous operation | [10,125] |
| User adoption | Practicality, integration, low false alarms | [6,131] |
| Strategy | Description/Use Case | References |
|---|---|---|
| Manual annotation | Human-labeled ground truth | [24,49,77] |
| Cross-validation | K-fold (5/10) for robustness | [24,49,77] |
| External validation | Testing on new farms/populations | [21,49] |
| Standard metrics | Accuracy, precision, recall, F1, AUC | [24,40,49,78] |
| Field trials | Real-world deployment and comparison | [75,77] |
| Limitation | Description/Impact | References |
|---|---|---|
| Dataset diversity | Small, non-standardized, limited environments | [24,40,41,49] |
| Annotation challenges | Labor-intensive, error-prone, inconsistent | [40,41,49,134] |
| Generalization | Poor transfer to new farms/environments | [40,49,133] |
| Overlapping sounds | Rarely addressed in model design | [40] |
| Real-world noise | Underrepresented in curated datasets | [24,40,49] |
References
- Tzanidakis, C.; Simitzis, P.; Arvanitis, K.; Panagakis, P. An Overview of the Current Trends in Precision Pig Farming Technologies. Livest. Sci. 2021, 249, 104530. [Google Scholar] [CrossRef]
- Marić, K.; Gvozdanović, K.; Djurkin Kušec, I.; Kušec, G.; Margeta, V. Smart Pig Farms: Integration and Application of Digital Technologies in Pig Production. Agriculture 2025, 15, 937. [Google Scholar] [CrossRef]
- Racewicz, P.; Ludwiczak, A.; Skrzypczak, E.; Składanowska-Baryza, J.; Biesiada, H.; Nowak, T.; Nowaczewski, S.; Zaborowicz, M.; Stanisz, M.; Ślósarz, P. Welfare Health and Productivity in Commercial Pig Herds. Animals 2021, 11, 1176. [Google Scholar] [CrossRef]
- Chen, C.-S.; Chen, W.-C. Research and Development of Automatic Monitoring System for Livestock Farms. Appl. Sci. 2019, 9, 1132. [Google Scholar] [CrossRef]
- Matthews, S.G.; Miller, A.L.; PlÖtz, T.; Kyriazakis, I. Automated Tracking to Measure Behavioural Changes in Pigs for Health and Welfare Monitoring. Sci. Rep. 2017, 7, 17582. [Google Scholar] [CrossRef]
- Pandey, S.; Kalwa, U.; Kong, T.; Guo, B.; Gauger, P.C.; Peters, D.J.; Yoon, K.-J. Behavioral Monitoring Tool for Pig Farmers: Ear Tag Sensors, Machine Intelligence, and Technology Adoption Roadmap. Animals 2021, 11, 2665. [Google Scholar] [CrossRef]
- Dong, Y.; Bonde, A.; Codling, J.R.; Bannis, A.; Cao, J.; Macon, A.; Rohrer, G.; Miles, J.; Sharma, S.; Brown-Brandl, T.; et al. PigSense: Structural Vibration-Based Activity and Health Monitoring System for Pigs. ACM Trans. Sen. Netw. 2023, 20, 1–43. [Google Scholar] [CrossRef]
- Benjamin, M.; Yik, S. Precision Livestock Farming in Swine Welfare: A Review for Swine Practitioners. Animals 2019, 9, 133. [Google Scholar] [CrossRef] [PubMed]
- Guo, Q.; Sun, Y.; Orsini, C.; Bolhuis, J.E.; de Vlieg, J.; Bijma, P.; de With, P.H.N. Enhanced Camera-Based Individual Pig Detection and Tracking for Smart Pig Farms. Comput. Electron. Agric. 2023, 211, 108009. [Google Scholar] [CrossRef]
- Reza, M.N.; Ali, M.R.; Haque, M.A.; Jin, H.; Kyoung, H.; Choi, Y.K.; Kim, G.; Chung, S.-O. A Review of Sound-Based Pig Monitoring for Enhanced Precision Production. J. Anim. Sci. Technol. 2025, 67, 277–302. [Google Scholar] [CrossRef]
- Wang, S.; Jiang, H.; Qiao, Y.; Jiang, S.; Lin, H.; Sun, Q. The Research Progress of Vision-Based Artificial Intelligence in Smart Pig Farming. Sensors 2022, 22, 6541. [Google Scholar] [CrossRef]
- Herbst, C.T.; Nishimura, T.; Garcia, M.; Migimatsu, K.; Tokuda, I.T. Effect of Ventricular Folds on Vocalization Fundamental Frequency in Domestic Pigs (Sus scrofa domesticus). J. Voice 2021, 35, 805.e1–805.e15. [Google Scholar] [CrossRef] [PubMed]
- Briefer, E.F.; Vizier, E.; Gygax, L.; Hillmann, E. Expression of Emotional Valence in Pig Closed-Mouth Grunts: Involvement of Both Source- and Filter-Related Parameters. J. Acoust. Soc. Am. 2019, 145, 2895–2908. [Google Scholar] [CrossRef]
- Taylor, A.M.; Reby, D. The Contribution of Source–Filter Theory to Mammal Vocal Communication Research. J. Zool. 2010, 280, 221–236. [Google Scholar] [CrossRef]
- Linhart, P.; Ratcliffe, V.F.; Reby, D.; Špinka, M. Expression of Emotional Arousal in Two Different Piglet Call Types. PLoS ONE 2015, 10, e0135414. [Google Scholar] [CrossRef] [PubMed]
- Leliveld, L.M.C.; Düpjan, S.; Tuchscherer, A.; Puppe, B. Vocal Correlates of Emotional Reactivity within and across Contexts in Domestic Pigs (Sus scrofa). Physiol. Behav. 2017, 181, 117–126. [Google Scholar] [CrossRef] [PubMed]
- Park, J.; Choi, S.; Takatoh, J.; Zhao, S.; Harrahill, A.; Han, B.-X.; Wang, F. Brainstem Control of Vocalization and Its Coordination with Respiration. Science 2024, 383, eadi8081. [Google Scholar] [CrossRef]
- Villain, A.S.; Lanthony, M.; Guérin, C.; Tallet, C. Manipulable Object and Human Contact: Preference and Modulation of Emotional States in Weaned Pigs. Front. Vet. Sci. 2020, 7, 577433. [Google Scholar] [CrossRef]
- Villain, A.S.; Guérin, C.; Tallet, C. The Use of Pigs Vocalisation Structure to Assess the Quality of Human-Pig Relationship. Peer Community J. 2023, 3, e36. [Google Scholar] [CrossRef]
- Friel, M.; Kunc, H.P.; Griffin, K.; Asher, L.; Collins, L.M. Positive and Negative Contexts Predict Duration of Pig Vocalisations. Sci. Rep. 2019, 9, 2062. [Google Scholar] [CrossRef]
- Briefer, E.F.; Sypherd, C.C.-R.; Linhart, P.; Leliveld, L.M.C.; Padilla de la Torre, M.; Read, E.R.; Guérin, C.; Deiss, V.; Monestier, C.; Rasmussen, J.H.; et al. Author Correction: Classification of Pig Calls Produced from Birth to Slaughter According to Their Emotional Valence and Context of Production. Sci. Rep. 2023, 13, 18824. [Google Scholar] [CrossRef] [PubMed]
- Dalmau, A.; Sánchez-Matamoros, A.; Molina, J.M.; Xercavins, A.; Varvaró-Porter, A.; Muñoz, I.; Moles, X.; Baulida, B.; Fàbrega, E.; Velarde, A.; et al. Intramuscular vs. Intradermic Needle-Free Vaccination in Piglets: Relevance for Animal Welfare Based on an Aversion Learning Test and Vocalizations. Front. Vet. Sci. 2021, 8, 715260. [Google Scholar] [CrossRef]
- Villain, A.S.; Hazard, A.; Danglot, M.; Guérin, C.; Boissy, A.; Tallet, C. Piglets Vocally Express the Anticipation of Pseudo-Social Contexts in Their Grunts. Sci. Rep. 2020, 10, 18496. [Google Scholar] [CrossRef]
- Hou, Y.; Li, Q.; Wang, Z.; Liu, T.; He, Y.; Li, H.; Ren, Z.; Guo, X.; Yang, G.; Liu, Y.; et al. Study on a Pig Vocalization Classification Method Based on Multi-Feature Fusion. Sensors 2024, 24, 313. [Google Scholar] [CrossRef]
- Olczak, K.; Penar, W.; Nowicki, J.; Magiera, A.; Klocek, C. The Role of Sound in Livestock Farming—Selected Aspects. Animals 2023, 13, 2307. [Google Scholar] [CrossRef]
- Zhao, J.; Li, X.; Liu, W.; Gao, Y.; Lei, M.; Tan, H.; Yang, D. DNN-HMM Based Acoustic Model for Continuous Pig Cough Sound Recognition. Int. J. Agric. Biol. Eng. 2020, 13, 186–193. [Google Scholar] [CrossRef]
- Wang, X.; Yin, Y.; Dai, X.; Shen, W.; Kou, S.; Dai, B. Automatic Detection of Continuous Pig Cough in a Complex Piggery Environment. Biosyst. Eng. 2024, 238, 78–88. [Google Scholar] [CrossRef]
- Shen, W.; Ji, N.; Yin, Y.; Dai, B.; Tu, D.; Sun, B.; Hou, H.; Kou, S.; Zhao, Y. Fusion of Acoustic and Deep Features for Pig Cough Sound Recognition. Comput. Electron. Agric. 2022, 197, 106994. [Google Scholar] [CrossRef]
- Yin, Y.; Ji, N.; Wang, X.; Shen, W.; Dai, B.; Kou, S.; Liang, C. An Investigation of Fusion Strategies for Boosting Pig Cough Sound Recognition. Comput. Electron. Agric. 2023, 205, 107645. [Google Scholar] [CrossRef]
- Pan, W.; Li, H.; Zhou, X.; Jiao, J.; Zhu, C.; Zhang, Q. Research on Pig Sound Recognition Based on Deep Neural Network and Hidden Markov Models. Sensors 2024, 24, 1269. [Google Scholar] [CrossRef] [PubMed]
- Chung, Y.; Oh, S.; Lee, J.; Park, D.; Chang, H.-H.; Kim, S. Automatic Detection and Recognition of Pig Wasting Diseases Using Sound Data in Audio Surveillance Systems. Sensors 2013, 13, 12929–12942. [Google Scholar] [CrossRef]
- Wang, X.; Zhao, X.; He, Y.; Wang, K. Cough Sound Analysis to Assess Air Quality in Commercial Weaner Barns. Comput. Electron. Agric. 2019, 160, 8–13. [Google Scholar] [CrossRef]
- Ferrari, S.; Silva, M.; Guarino, M.; Aerts, J.M.; Berckmans, D. Cough Sound Analysis to Identify Respiratory Infection in Pigs. Comput. Electron. Agric. 2008, 64, 318–325. [Google Scholar] [CrossRef]
- Cohen, L.M.; Grøntvedt, C.A.; Klem, T.B.; Gulliksen, S.M.; Ranheim, B.; Nielsen, J.P.; Valheim, M.; Kielland, C. A Descriptive Study of Acute Outbreaks of Respiratory Disease in Norwegian Fattening Pig Herds. Acta Vet. Scand. 2020, 62, 35. [Google Scholar] [CrossRef] [PubMed]
- Haimi-Hakala, M.; Hälli, O.; Laurila, T.; Raunio-Saarnisto, M.; Nokireki, T.; Laine, T.; Nykäsenoja, S.; Pelkola, K.; Segales, J.; Sibila, M.; et al. Etiology of Acute Respiratory Disease in Fattening Pigs in Finland. Porc. Health Manag. 2017, 3, 19. [Google Scholar] [CrossRef] [PubMed]
- Mito, M.; Aoki, T.; Mizutani, K.; Zempo, K.; Wakatsuki, N.; Maeda, Y.; Takemae, N.; Tsunekuni, R.; Saito, T. Frequency Analysis of the Sneeze Caused by Swine Influenza Virus Strains: Automatic Sneeze around-the-Clock Detection Using a Support Vector Machine. Comput. Electron. Agric. 2020, 179, 105789. [Google Scholar] [CrossRef]
- Laurijs, K.A.; Briefer, E.F.; Reimert, I.; Webb, L.E. Vocalisations in Farm Animals: A Step towards Positive Welfare Assessment. Appl. Anim. Behav. Sci. 2021, 236, 105264. [Google Scholar] [CrossRef]
- Vandermeulen, J.; Bahr, C.; Tullo, E.; Fontana, I.; Ott, S.; Kashiha, M.; Guarino, M.; Moons, C.P.H.; Tuyttens, F.a.M.; Niewold, T.A.; et al. Discerning Pig Screams in Production Environments. PLoS ONE 2015, 10, e0123111. [Google Scholar] [CrossRef]
- Heseker, P.; Bergmann, T.; Scheumann, M.; Traulsen, I.; Kemper, N.; Probst, J. Detecting Tail Biters by Monitoring Pig Screams in Weaning Pigs. Sci. Rep. 2024, 14, 4523. [Google Scholar] [CrossRef]
- Wu, X.; Zhou, S.; Chen, M.; Zhao, Y.; Wang, Y.; Zhao, X.; Li, D.; Pu, H. Combined Spectral and Speech Features for Pig Speech Recognition. PLoS ONE 2022, 17, e0276778. [Google Scholar] [CrossRef]
- Liao, J.; Li, H.; Feng, A.; Wu, X.; Luo, Y.; Duan, X.; Ni, M.; Li, J. Domestic Pig Sound Classification Based on TransformerCNN. Appl. Intell. 2023, 53, 4907–4923. [Google Scholar] [CrossRef]
- Zapata Cardona, J.; Ceballos, M.C.; Tarazona Morales, A.M.; David Jaramillo, E.; de Jesús Rodríguez, B. Spectro-Temporal Acoustic Elements of Music Interact in an Integrated Way to Modulate Emotional Responses in Pigs. Sci. Rep. 2023, 13, 2994. [Google Scholar] [CrossRef]
- Alvarez-Hernandez, N.; Vallejo-Timarán, D.; de Jesús Rodriguez, B. Adapted Original Music as an Environmental Enrichment in an Intensive Pig Production System Reduced Aggression in Weaned Pigs during Regrouping. Animals 2023, 13, 3599. [Google Scholar] [CrossRef]
- Ricci, G.D.; de Sousa, R.T.; da Silva Miranda, K.O.; Tonon, E.; Passere, A.L.; de Matos Passere, A.L.; Titto, C.G. Evaluating the Influence of Noise Emitted by Environmental Enrichment Equipment in Routine Farrowing Management Practices. Rev. Ciências Agroveterinárias 2020, 19, 434–439. [Google Scholar] [CrossRef]
- Damasceno, F.A.; Oliveira, C.E.A.; Ferraz, G.A.; Nascimento, J.A.; Saraz, J.A.; Brandão, L.F. Welfare and Spatial Distribution of Noise Levels in Swine Nursery. Rev. Bras. Eng. Agrícla Ambient. 2019, 23, 196–202. [Google Scholar] [CrossRef]
- Wegner, B.; Spiekermeier, I.; Nienhoff, H.; Große-Kleimann, J.; Rohn, K.; Meyer, H.; Plate, H.; Gerhardy, H.; Kreienbrock, L.; grosse Beilage, E.; et al. Status Quo Analysis of Noise Levels in Pig Fattening Units in Germany. Livest. Sci. 2019, 230, 103847. [Google Scholar] [CrossRef]
- Hong, M.; Ahn, H.; Atif, O.; Lee, J.; Park, D.; Chung, Y. Field-Applicable Pig Anomaly Detection System Using Vocalization for Embedded Board Implementations. Appl. Sci. 2020, 10, 6991. [Google Scholar] [CrossRef]
- Kroll, L.; Herbrandt, S.; Kemper, N.; Fels, M. Determination of the Sound Level during Different Management Measures in Piglet Rearing Related to Animal Welfare and Human Health and Safety. Livest. Sci. 2024, 280, 105410. [Google Scholar] [CrossRef]
- Pann, V.; Kwon, K.; Kim, B.; Jang, D.-H.; Kim, J.-B. DCNN for Pig Vocalization and Non-Vocalization Classification: Evaluate Model Robustness with New Data. Animals 2024, 14, 2029. [Google Scholar] [CrossRef]
- Humann, M.J.; Donham, K.J.; Jones, M.L.; Achutan, C.; Smith, B.J. Occupational Noise Exposure Assessment in Intensive Swine Farrowing Systems: Dosimetry, Octave Band, and Specific Task Analysis. J. Agromedicine 2005, 10, 23–37. [Google Scholar] [CrossRef] [PubMed]
- Talling, J.C.; Lines, J.A.; Wathes, C.M.; Waran, N.K. The Acoustic Environment of the Domestic Pig. J. Agric. Eng. Res. 1998, 71, 1–12. [Google Scholar] [CrossRef]
- Xie, Y.; Wang, J.; Chen, C.; Yin, T.; Yang, S.; Li, Z.; Zhang, Y.; Ke, J.; Song, L.; Gan, L. Sound Identification of Abnormal Pig Vocalizations: Enhancing Livestock Welfare Monitoring on Smart Farms. Inf. Process. Manag. 2024, 61, 103770. [Google Scholar] [CrossRef]
- Gourisaria, M.K.; Agrawal, R.; Sahni, M.; Singh, P.K. Comparative Analysis of Audio Classification with MFCC and STFT Features Using Machine Learning Techniques. Discov. Internet Things 2024, 4, 1. [Google Scholar] [CrossRef]
- Zhang, J. Music Feature Extraction and Classification Algorithm Based on Deep Learning. Sci. Program. 2021, 2021, 1651560. [Google Scholar] [CrossRef]
- Franzoni, V. Cross-Domain Synergy: Leveraging Image Processing Techniques for Enhanced Sound Classification through Spectrogram Analysis Using CNNs. J. Auton. Intell. 2023, 6, 678. [Google Scholar] [CrossRef]
- Singh, J. pyAudioProcessing: Audio Processing, Feature Extraction, and Machine Learning Modeling. In Proceedings of the 21st Python in Science Conference (SciPy 2022), Austin, TX, USA, 11–17 July 2022. [Google Scholar] [CrossRef]
- Zaman, K.; Sah, M.; Direkoglu, C.; Unoki, M. A Survey of Audio Classification Using Deep Learning. IEEE Access 2023, 11, 106620–106649. [Google Scholar] [CrossRef]
- Sharma, G.; Umapathy, K.; Krishnan, S. Trends in Audio Signal Feature Extraction Methods. Appl. Acoust. 2020, 158, 107020. [Google Scholar] [CrossRef]
- Melo, D.D.F.P.; Fadigas, I.D.S.; Pereira, H.B.D.B. Graph-Based Feature Extraction: A New Proposal to Study the Classification of Music Signals Outside the Time-Frequency Domain. PLoS ONE 2020, 15, e0240915. [Google Scholar] [CrossRef]
- Jin, W.; Wang, X.; Zhan, Y. Environmental Sound Classification Algorithm Based on Region Joint Signal Analysis Feature and Boosting Ensemble Learning. Electronics 2022, 11, 3743. [Google Scholar] [CrossRef]
- Purwins, H.; Li, B.; Virtanen, T.; Schlüter, J.; Chang, S.-Y.; Sainath, T. Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 2019, 13, 206–219. [Google Scholar] [CrossRef]
- Levy, J.; Naitsat, A.; Zeevi, Y.Y. Classification of Audio Signals Using Spectrogram Surfaces and Extrinsic Distortion Measures. EURASIP J. Adv. Signal Process. 2022, 2022, 100. [Google Scholar] [CrossRef]
- Pan, W.; Jiao, J.; Zhou, X.; Xu, Z.; Gu, L.; Zhu, C. Underdetermined Blind Source Separation of Audio Signals for Group Reared Pigs Based on Sparse Component Analysis. Sensors 2024, 24, 5173. [Google Scholar] [CrossRef]
- Chung, S.Y.; Kim, S.C. Analysis of Livestock Vocal Data using Lightweight MobileNet. Smart Media J. 2024, 13, 16–23. [Google Scholar] [CrossRef]
- Chung, S.; Zhou, H.; Arsa, D.M.S.; Kim, S.; Kim, H. A Multi-Stage Ensemble Framework for Classifying Pig Vocalizations under Noisy Animal Farm Environments. Sci. Rep. 2025, 15, 34703. [Google Scholar] [CrossRef]
- Ji, N.; Yin, Y.; Shen, W.; Kou, S.; Dai, B.; Wang, G. Pig Sound Analysis: A Measure of Welfare. Smart Agric. 2022, 4, 19–35. [Google Scholar] [CrossRef]
- Wen, T.C.; Freire, C.F.; Benicio, L.M.; de Moura, G.B.; Amorim, M.D.; da Silva-Miranda, K.O. TinyML-Based Swine Vocalization Pattern Recognition for Enhancing Animal Welfare in Embedded Systems. Inventions 2025, 10, 52. [Google Scholar] [CrossRef]
- Kaur, J.; Hazrati Fard, S.M.; Amiri-Zarandi, M.; Dara, R. Protecting Farmers’ Data Privacy and Confidentiality: Recommendations and Considerations. Front. Sustain. Food Syst. 2022, 6, 903230. [Google Scholar] [CrossRef]
- Darras, K.F.A.; Deppe, F.; Fabian, Y.; Kartono, A.P.; Angulo, A.; Kolbrek, B.; Mulyani, Y.A.; Prawiradilaga, D.M. High Microphone Signal-to-Noise Ratio Enhances Acoustic Sampling of Wildlife. PeerJ 2020, 8, e9955. [Google Scholar] [CrossRef] [PubMed]
- van Erp-van der Kooij, E.; de Graaf, L.F.; de Kruijff, D.A.; Pellegrom, D.; de Rooij, R.; Welters, N.I.T.; van Poppel, J. Using Sound Location to Monitor Farrowing in Sows. Animals 2023, 13, 3538. [Google Scholar] [CrossRef]
- Heath, B.E.; Suzuki, R.; Le Penru, N.P.; Skinner, J.; Orme, C.D.L.; Ewers, R.M.; Sethi, S.S.; Picinali, L. Spatial Ecosystem Monitoring with a Multichannel Acoustic Autonomous Recording Unit (MAARU). Methods Ecol. Evol. 2024, 15, 1568–1579. [Google Scholar] [CrossRef]
- Mennill, D.J.; Battiston, M.; Wilson, D.R.; Foote, J.R.; Doucet, S.M. Field Test of an Affordable, Portable, Wireless Microphone Array for Spatial Monitoring of Animal Ecology and Behaviour. Methods Ecol. Evol. 2012, 3, 704–712. [Google Scholar] [CrossRef]
- Rhinehart, T.A.; Chronister, L.M.; Devlin, T.; Kitzes, J. Acoustic Localization of Terrestrial Wildlife: Current Practices and Future Opportunities. Ecol. Evol. 2020, 10, 6794–6818. [Google Scholar] [CrossRef]
- Waidmann, E.N.; Yang, V.H.Y.; Luo, E.; Doyle, W.C.; Jarvis, E.D. Mountable Miniature Microphones to Identify and Assign Mouse Ultrasonic Vocalizations. Cell Rep. Methods 2025, 5, 101081. [Google Scholar] [CrossRef]
- Gómez, Y.; Stygar, A.H.; Boumans, I.J.M.M.; Bokkers, E.A.M.; Pedersen, L.J.; Niemi, J.K.; Pastell, M.; Manteca, X.; Llonch, P. A Systematic Review on Validated Precision Livestock Farming Technologies for Pig Production and Its Potential to Assess Animal Welfare. Front. Vet. Sci. 2021, 8, 660565. [Google Scholar] [CrossRef] [PubMed]
- Gill, L.F.; D’Amelio, P.B.; Adreani, N.M.; Sagunsky, H.; Gahr, M.C.; ter Maat, A. A Minimum-Impact, Flexible Tool to Study Vocal Communication of Small Animals with Precise Individual-Level Resolution. Methods Ecol. Evol. 2016, 7, 1349–1358. [Google Scholar] [CrossRef]
- Garrido, L.F.C.; Rodrigues, G.S.T.; Costa, L.B.; Kurtz, D.J.; Daros, R.R. Validation of a Swine Cough Monitoring System Under Field Conditions. Agriengineering 2025, 7, 140. [Google Scholar] [CrossRef]
- Lagua, E.B.; Mun, H.-S.; Ampode, K.M.B.; Chem, V.; Kim, Y.-H.; Yang, C.-J. Artificial Intelligence for Automatic Monitoring of Respiratory Health Conditions in Smart Swine Farming. Animals 2023, 13, 1860. [Google Scholar] [CrossRef]
- Eddicks, M.; Feicht, F.; Beckjunker, J.; Genzow, M.; Alonso, C.; Reese, S.; Ritzmann, M.; Stadler, J. Monitoring of Respiratory Disease Patterns in a Multimicrobially Infected Pig Population Using Artificial Intelligence and Aggregate Samples. Viruses 2024, 16, 1575. [Google Scholar] [CrossRef]
- Zhang, S.; Tian, J.; Banerjee, A.; Li, J. An Efficient Porcine Acoustic Signal Denoising Technique Based on EEMD-ICA-WTD. Math. Probl. Eng. 2019, 2019, 2858740. [Google Scholar] [CrossRef]
- Cao, Y.; Yin, Z.; Duan, Y.; Cao, R.; Hu, G.; Liu, Z. Research on Improved Sound Recognition Model for Oestrus Detection in Sows. Comput. Electron. Agric. 2025, 231, 109975. [Google Scholar] [CrossRef]
- Barnhill, A.; Noeth, E.; Maier, A.; Bergler, C. ANIMAL-CLEAN—A Deep Denoising Toolkit for Animal-Independent Signal Enhancement. In Proceedings of the Interspeech 2024, Kos, Greece, 1–5 September 2024; pp. 632–636. [Google Scholar]
- Daneshyari, R.D. Deep Learning Approaches for Acoustic Animal Classification. Int. J. Comput. Artif. Intell. 2024, 5, 199–204. [Google Scholar] [CrossRef]
- Wei, S.; Zou, S.; Liao, F. A Comparison on Data Augmentation Methods Based on Deep Learning for Audio Classification. J. Phys. Conf. Ser. 2020, 1453, 012085. [Google Scholar] [CrossRef]
- Nicholson, D. Crowsetta: A Python Tool to Work with Any Format for Annotating Animal Vocalizations and Bioacoustics Data. J. Open Source Softw. 2023, 8, 5338. [Google Scholar] [CrossRef]
- Gibbons, A.; Donohue, I.; Gorman, C.; King, E.; Parnell, A. NEAL: An Open-Source Tool for Audio Annotation. PeerJ 2023, 11, e15913. [Google Scholar] [CrossRef]
- Colligan, T.; Irish, K.; Emlen, D.J.; Wheeler, T.J. DISCO: A Deep Learning Ensemble for Uncertainty-Aware Segmentation of Acoustic Signals. PLoS ONE 2023, 18, e0288172. [Google Scholar] [CrossRef]
- Darras, K.F.A.; Pérez, N.; Dilong, L.; Hanf-Dressler, T.; Markolf, M.; Wanger, T.C.; Cord, A.F. ecoSound-Web: An Open-Source, Online Platform for Ecoacoustics. F1000Research 2024, 9, 1224. [Google Scholar] [CrossRef] [PubMed]
- Ntalampiras, S.; Ludovico, L.A.; Presti, G.; Vena, M.V.; Fantini, D.; Ogel, T.; Celozzi, S.; Battini, M.; Mattiello, S. An Integrated System for the Acoustic Monitoring of Goat Farms. Ecol. Inform. 2023, 75, 102043. [Google Scholar] [CrossRef]
- Pandeya, Y.R.; Bhattarai, B.; Afzaal, U.; Kim, J.-B.; Lee, J. A Monophonic Cow Sound Annotation Tool Using a Semi-Automatic Method on Audio/Video Data. Livest. Sci. 2022, 256, 104811. [Google Scholar] [CrossRef]
- Martínez Balvanera, S.; Mac Aodha, O.; Weldy, M.J.; Pringle, H.; Browning, E.; Jones, K.E. Whombat: An Open-Source Audio Annotation Tool for Machine Learning Assisted Bioacoustics. Methods Ecol. Evol. 2025, 16, 19–28. [Google Scholar] [CrossRef]
- Manikandan, V.; Neethirajan, S. AI-Powered Vocalization Analysis in Poultry: Systematic Review of Health, Behavior, and Welfare Monitoring. Sensors 2025, 25, 4058. [Google Scholar] [CrossRef]
- Teterja, D.; Garcia-Rodriguez, J.; Azorin-Lopez, J.; Sebastian-Gonzalez, E.; Nedić, D.; Leković, D.; Knežević, P.; Drajić, D.; Vukobratović, D. A Video Mosaicing-Based Sensing Method for Chicken Behavior Recognition on Edge Computing Devices. Sensors 2024, 24, 3409. [Google Scholar] [CrossRef] [PubMed]
- Semenov, S.; Karlov, D.; Solecki, M.; Ruban, I.; Kovalenko, A.; Piskarov, O. Integrated Model for Intelligent Monitoring and Diagnostics of Animal Health Based on IoT Technology for the Digital Farm. Sustainability 2025, 17, 8507. [Google Scholar] [CrossRef]
- Taneja, M.; Byabazaire, J.; Jalodia, N.; Davy, A.; Olariu, C.; Malone, P. Machine Learning Based Fog Computing Assisted Data-Driven Approach for Early Lameness Detection in Dairy Cattle. Comput. Electron. Agric. 2020, 171, 105286. [Google Scholar] [CrossRef]
- Goyal, V.; Yadav, A.; Kumar, S.; Mukherjee, R. Lightweight LAE for Anomaly Detection With Sound-Based Architecture in Smart Poultry Farm. IEEE Internet Things J. 2024, 11, 8199–8209. [Google Scholar] [CrossRef]
- Kim, J.; Suh, Y.; Lee, J.; Chae, H.; Ahn, H.; Chung, Y.; Park, D. EmbeddedPigCount: Pig Counting with Video Object Detection and Tracking on an Embedded Board. Sensors 2022, 22, 2689. [Google Scholar] [CrossRef] [PubMed]
- Ngo, H.Q.T.; Nguyen, T.P.; Nguyen, H. Research on a Low-Cost, Open-Source, and Remote Monitoring Data Collector to Predict Livestock’s Habits Based on Location and Auditory Information: A Case Study from Vietnam. Agriculture 2020, 10, 180. [Google Scholar] [CrossRef]
- Alzuhair, A.; Alghaihab, A. The Design and Optimization of an Acoustic and Ambient Sensing AIoT Platform for Agricultural Applications. Sensors 2023, 23, 6262. [Google Scholar] [CrossRef] [PubMed]
- Song, H.; Zhao, B.; Hu, J.; Sun, H.; Zhou, Z. Research on Improved DenseNets Pig Cough Sound Recognition Model Based on SENets. Electronics 2022, 11, 3562. [Google Scholar] [CrossRef]
- Shi, Y.-M. An Improved Machine Learning Model for Pig Abnormal Voice Recognition. J. Comput. 2022, 33, 155–166. [Google Scholar] [CrossRef]
- Du, C.; Gong, M.; Du, W.; Xie, J.; Gao, Z. Research and Application of End-to-End Pig Speech Recognition Model. In Proceedings of the International Conference on Signal Processing and Communication Technology (SPCT 2022), Harbin, China, 23–25 December 2022; SPIE: Bellingham, WA, USA, 2022; Volume 12615, pp. 413–417. [Google Scholar]
- Wang, B.; Duan, W.; Liu, N.; Qi, J. A CNN-SVM Study Based on the Fusion of Spectrogram and Thermal Imaging Heterogeneous Features for Pig Cough Recognition in Field Situation. Int. J. Patt. Recogn. Artif. Intell. 2023, 37, 2354021. [Google Scholar] [CrossRef]
- Zhang, S.; Tian, J.; Banerjee, A.; Li, J. Automatic Recognition of Porcine Abnormalities Based on a Sound Detection and Recognition System. Trans. Asabe 2019, 62, 1755–1765. [Google Scholar] [CrossRef]
- Yin, Y.; Tu, D.; Shen, W.; Bao, J. Recognition of Sick Pig Cough Sounds Based on Convolutional Neural Network in Field Situations. Inf. Process. Agric. 2021, 8, 369–379. [Google Scholar] [CrossRef]
- Wang, Y.; Li, S.; Zhang, H.; Liu, T. A Lightweight CNN-Based Model for Early Warning in Sow Oestrus Sound Monitoring. Ecol. Inform. 2022, 72, 101863. [Google Scholar] [CrossRef]
- Peng, S.; Zha, W.; Chen, C.; Tang, X.; Wu, G.; Gu, L.; Jiao, J. Pig Audio Recognition Based on Deep Neural Network (DNN) and Hidden Markov Models (HMM). SSRN Electron. J. 2022; Preprint. Available online: https://ssrn.com/abstract=4312454 (accessed on 10 December 2025). [CrossRef]
- Wang, B.; Miao, X.; Jiang, M.; Yin, C.; Pu, C. Application of Deep Learning Algorithm Based on Neural Network. In Proceedings of the 2022 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS), Dalian, China, 11–12 December 2022; pp. 780–783. [Google Scholar]
- Wang, B.; Qi, J.; An, X.; Wang, Y. Heterogeneous Fusion of Biometric and Deep Physiological Features for Accurate Porcine Cough Recognition. PLoS ONE 2024, 19, e0297655. [Google Scholar] [CrossRef]
- Shen, W.; Tu, D.; Yin, Y.; Bao, J. A New Fusion Feature Based on Convolutional Neural Network for Pig Cough Recognition in Field Situations. Inf. Process. Agric. 2021, 8, 573–580. [Google Scholar] [CrossRef]
- Ji, N.; Shen, W.; Yin, Y.; Bao, J.; Dai, B.; Hou, H.; Kou, S.; Zhao, Y. Investigation of Acoustic and Visual Features for Pig Cough Classification. Biosyst. Eng. 2022, 219, 281–293. [Google Scholar] [CrossRef]
- Tsalera, E.; Papadakis, A.; Samarakou, M. Comparison of Pre-Trained CNNs for Audio Classification Using Transfer Learning. J. Sens. Actuator Netw. 2021, 10, 72. [Google Scholar] [CrossRef]
- de Carvalho Soster, P.; Grzywalski, T.; Hou, Y.; Thomas, P.; Dedeurwaerder, A.; De Gussem, M.; Tuyttens, F.; Devos, P.; Botteldooren, D.; Antonissen, G. Automated Detection of Broiler Vocalizations a Machine Learning Approach for Broiler Chicken Vocalization Monitoring. Poult. Sci. 2025, 104, 104962. [Google Scholar] [CrossRef]
- Khochare, J.; Joshi, C.; Yenarkar, B.; Suratkar, S.; Kazi, F. A Deep Learning Framework for Audio Deepfake Detection. Arab. J. Sci. Eng. 2022, 47, 3447–3458. [Google Scholar] [CrossRef]
- Min, K.-J.; Lee, H.-J.; Hwang, H.; Lee, S.; Lee, K.; Moon, S.-H.; Lee, J.; Lee, J. A Study on Classification of Pig Sounds Based on Supervised Learning. J. Inst. Electr. Eng. 2021, 70, 805–822. [Google Scholar] [CrossRef]
- Nanni, L.; Maguolo, G.; Paci, M. Data Augmentation Approaches for Improving Animal Audio Classification. Ecol. Inform. 2020, 57, 101084. [Google Scholar] [CrossRef]
- Shankar, R.; Nalla, N.R.; Muthulakshmi, M. Real-Time Pig Vocal Analysis for Early Respiratory Infection Study. In Proceedings of the 2023 IEEE 20th India Council International Conference (INDICON), Hyderabad, India, 14–17 December 2023; pp. 630–635. [Google Scholar]
- Nanni, L.; Costa, Y.M.G.; Aguiar, R.L.; Mangolin, R.B.; Brahnam, S.; Silla, C.N. Ensemble of Convolutional Neural Networks to Improve Animal Audio Classification. J. Audio Speech Music Proc. 2020, 2020, 8. [Google Scholar] [CrossRef]
- Marx, G.; Horn, T.; Thielebein, J.; Knubel, B.; von Borell, E. Analysis of Pain-Related Vocalization in Young Pigs. J. Sound Vib. 2003, 266, 687–698. [Google Scholar] [CrossRef]
- Gutierrez, W.M.; Kim, S.; Kim, D.H.; Yeon, S.C.; Chang, H.H. Classification of Porcine Wasting Diseases Using Sound Analysis. Asian-Australas. J. Anim. Sci. 2010, 23, 1096–1104. [Google Scholar] [CrossRef]
- Guarino, M.; Jans, P.; Costa, A.; Aerts, J.-M.; Berckmans, D. Field Test of Algorithm for Automatic Cough Detection in Pig Houses. Comput. Electron. Agric. 2008, 62, 22–28. [Google Scholar] [CrossRef]
- Olson, M.J.; Creamer, M.; Horback, K.M. Identification of Specific Call Types Produced by Pre-Weaning Gilts in Response to Isolation. Appl. Anim. Behav. Sci. 2021, 234, 105203. [Google Scholar] [CrossRef]
- Da Silva Cordeiro, A.F.; De Alencar Nääs, I.; Oliveira, S.R.M.; Violaro, F.; De Almeida, A.C.M.; Neves, D.P. Understanding Vocalization Might Help to Assess Stressful Conditions in Piglets. Animals 2013, 3, 923–934. [Google Scholar] [CrossRef]
- Chae, H.; Lee, J.; Kim, J.; Lee, S.; Lee, J.; Chung, Y.; Park, D. Novel Method for Detecting Coughing Pigs with Audio-Visual Multimodality for Smart Agriculture Monitoring. Sensors 2024, 24, 7232. [Google Scholar] [CrossRef]
- Ranmal, D.; Ranasinghe, P.; Paranayapa, T.; Meedeniya, D.; Perera, C. ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge. Sensors 2024, 24, 3749. [Google Scholar] [CrossRef]
- Bibbó, G.; Singh, A.; Plumbley, M.D. Environmental Sound Classification on an Embedded Hardware Platform. In INTER-NOISE and NOISE-CON Congress and Conference Proceedings INTER-NOISE24, Nantes, France; Institute of Noise Control Engineering: Wakefield, MA, USA, 2024; Available online: https://ince.publisher.ingentaconnect.com/content/ince/incecp/2024/00000270/00000005/art00041 (accessed on 16 December 2025).
- Mou, A.; Milanova, M. Performance Analysis of Deep Learning Model-Compression Techniques for Audio Classification on Edge Devices. Sci 2024, 6, 21. [Google Scholar] [CrossRef]
- Mohaimenuzzaman, M.; Bergmeir, C.; West, I.; Meyer, B. Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices. Pattern Recognit. 2023, 133, 109025. [Google Scholar] [CrossRef]
- Speckhard, D.T.; Misiunas, K.; Perel, S.; Zhu, T.; Carlile, S.; Slaney, M. Neural Architecture Search for Energy-Efficient Always-on Audio Machine Learning. Neural Comput. Appl. 2023, 35, 12133–12144. [Google Scholar] [CrossRef]
- Srinivasagan, R.; Sayed, M.S.E.; Al-Rasheed, M.I.; Alzahrani, A.S. Edge Intelligence for Poultry Welfare: Utilizing Tiny Machine Learning Neural Network Processors for Vocalization Analysis. PLoS ONE 2025, 20, e0316920. [Google Scholar] [CrossRef] [PubMed]
- Rajesh, A.; Ananthasubramanian, P.; Raghavan, N.; Kumar, A. Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency. arXiv 2025, arXiv:2411.19611. [Google Scholar]
- Hassan, E.; Elbedwehy, S.; Shams, M.Y.; Abd El-Hafeez, T.; El-Rashidy, N. Optimizing Poultry Audio Signal Classification with Deep Learning and Burn Layer Fusion. J. Big Data 2024, 11, 135. [Google Scholar] [CrossRef]
- Upadhyaya, S.; Buyens, W.; Vranken, E.; Desmet, W.; Karsmakers, P. Assessment of Data Augmentation and Transfer Learning for Making PIG Cough Classifier Robust to Changing Farm Conditions. In Proceedings of the 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 15–17 December 2023; pp. 952–957. [Google Scholar]
- Chu, H.-C.; Zhang, Y.-L.; Chiang, H.-C. A CNN Sound Classification Mechanism Using Data Augmentation. Sensors 2023, 23, 6972. [Google Scholar] [CrossRef]
- Jung, D.-H.; Kim, N.Y.; Moon, S.H.; Jhin, C.; Kim, H.-J.; Yang, J.-S.; Kim, H.S.; Lee, T.S.; Lee, J.Y.; Park, S.H. Deep Learning-Based Cattle Vocal Classification Model and Real-Time Livestock Monitoring System with Noise Filtering. Animals 2021, 11, 357. [Google Scholar] [CrossRef]
| Microphone Type | Description & Use Case | Models/Systems | Ref. |
|---|---|---|---|
| Unidirectional (Cardioid) | Captures sound mainly from one direction, reducing background noise. Used for targeted monitoring (e.g., above pens). | Audio-Technica M260C (Audio-Technica Corp., Tokyo, Japan); Morbo M66 (Morbo Microphones, Bologna, Italy) * | [10] |
| Omnidirectional (Electret Condenser) | Captures sound from all directions, suitable for general ambient monitoring. | Panasonic WM-61A (Panasonic Corp., Osaka, Japan); PUI Audio electret microphones (PUI Audio Inc., Dayton, OH, USA) | [10,69] |
| MEMS Microphones | Micro-Electro-Mechanical Systems; small, robust, and suitable for integration in sensor networks. | InvenSense ICS-40720 (InvenSense Inc., San Jose, CA, USA); SPU0410LR5H-QB (Knowles Electronics, Itasca, IL, USA) | [69] |
| Microphone Arrays | Multiple microphones arranged spatially for sound localization and source separation. | Sorama L642V sound camera (Sorama B.V., Eindhoven, Netherlands); custom research-built arrays (various academic institutions) | [10,70,71,72] |
| Autonomous Recording Units (ARUs) | Standalone, weatherproof devices for long-term, remote monitoring. | Song Meter SM2 (Wildlife Acoustics Inc., Maynard, MA, USA); MAARU ARU (research prototype, academic development) | [71,72,73] |
| Wearable/Mountable Microphones | Attached to animals for individual vocalization tracking (mainly in research). | Custom miniature microphones (research prototypes; various locations) | [74] |
| Digital Recorders/Camcorders | Used for synchronized audio-video monitoring in some studies. | Sony ICD-UX560F (Sony Corp., Tokyo, Japan); JVC GR-DVL520A (JVCKENWOOD Corp., Yokohama, Japan) | [10] |
| Product Name/Developer | Product Type | Specifications | Limitations | Ref. |
|---|---|---|---|---|
| PecSmart® (Pecuária Smart S/A, Florianópolis, Santa Catarina, Brazil) | Integrated hardware + analytics service |
|
| [77] |
| SoundTalks® (SoundTalks NV, Leuven, Flemish Region, Belgium) | Hardware platform with cloud-based service |
|
| [78,79] |
| Pig Cough Monitor (Fancom B.V., Panningen, Limburg, Netherlands) | Hardware-based monitoring system |
|
| [10] |
| Device/Platform | Description & Use Case | Models/Systems | Ref. |
|---|---|---|---|
| Embedded AI Boards | Run real-time sound processing and machine learning models locally. | NVIDIA Jetson TX2, Jetson Nano (NVIDIA Corporation, Santa Clara, CA, USA) | [47,97] |
| Microcontroller Platforms | Low-power, cost-effective boards for basic sound acquisition and preprocessing. | ESP32-WROOM (Shanghai, China), Arduino (Zurich, Canton of Zurich, Switzerland), Raspberry Pi (Cambridge, England, UK) | [67,96,98] |
| TinyML Devices | Specialized for running lightweight ML models on resource-constrained hardware. | Edge Impulse-enabled MCUs (San Jose, CA, USA) | [67] |
| Custom Sensor Boards | Integrated boards with microphones, accelerometers, and wireless modules for wearables or environmental monitoring. | Ear tag sensor boards, multiparameter sensor boards (Various academic/research institutions) | [6,98] |
| Commercial Audio Sensors | Standalone or networked microphones with onboard processing for continuous monitoring. | PILLAR CM-5010Pro (Suzhou, Jiangsu Province, China), SoundTalks® (Leuven, Flemish Region, Belgium) | [10,11,31] |
| AIoT Sensor Nodes | Combine acoustic sensing, wireless communication, and edge ML for distributed monitoring. | Custom AIoT platforms | [99] |
| Claim | Feature Type | Application Task | Model Type | Reasoning | Ref. |
|---|---|---|---|---|---|
| Fusion of acoustic and deep features improves cough recognition | Acoustic + Deep (CQT, STFT, CNN) | Pig cough detection | SVM (with feature fusion) | Combining time-frequency and deep features yields robust, high-accuracy classification | [28] |
| MFCC and Mel spectrograms enhance oestrus detection | MFCC, Mel spectrograms, DWT | Sow oestrus stage classification | MobileViT (lightweight CNN) | MFCC and Mel spectrograms capture key vocal cues, DWT denoises, enabling efficient, accurate detection | [81] |
| Hybrid deep features and feature selection yield high cough accuracy | MFCC, spectral, proprietary | Pig cough detection | CNN + RNN hybrid | Feature selection and hybrid deep learning models achieve high precision and recall in field conditions | [77] |
| Spectral + speech features outperform single features | Spectrogram, time-domain, speech | Pig sound state classification | Parallel CNN + RNN + SVM | Dual input leverages both spectral and temporal cues, boosting accuracy for multiple pig states | [40] |
| Multi-feature fusion (acoustic + visual) outperforms single features | RMS, MFCC, ZCR, LBP, HOG, CQT | Pig cough detection | SVM, RF, KNN | Fusing acoustic and visual features (from spectrograms) increases recognition rates over single domains | [111] |
| Mixed-MMCT (MFCC, Mel, Chroma, Tonnetz) boosts vocalization detection | MFCC, Mel-spectrogram, Chroma, Tonnetz | Vocalization vs. non-vocalization | Deep CNN | Integrating multiple feature types and data augmentation improves generalization and robustness | [49] |
| MFCC-CNN fusion outperforms MFCC alone for cough recognition | MFCC + CNN-derived features | Pig cough detection | CNN, SVM | Fusing MFCC with CNN features increases F1-score and accuracy, especially with optimal frame selection | [110] |
| DNN-HMM with MFCCs excels in continuous cough recognition | MFCC | Continuous cough detection | DNN-HMM | DNN-HMM models with MFCCs reduce word error rate and outperform GMM-HMM in continuous sound environments | [26] |
| Heterogeneous fusion (acoustic + thermal) achieves highest accuracy | Acoustic + Deep thermal (images) | Pig cough detection | SVM (with feature fusion) | Combining sound and thermal features provides robust, multi-modal representation for cough recognition | [109] |
| Rule-based features (formant, power, duration) discern pig screams | Formant, power, frequency, duration | Pig scream detection | Rule-based classifier | Physically meaningful features enable explicit, interpretable scream detection in noisy environments | [38] |
| Sensors Name | Sensor Number | Number of Pigs Used | Age of Pigs Used | Research Duration | Dataset Number | Objectives | Findings | Ref. |
|---|---|---|---|---|---|---|---|---|
| Microphone (Sennheiser ME66, Wedemark, Lower Saxony, Germany) | 1 | 411, from 5 previous studies | Birth to slaughter | Not specified (multi-study aggregation) | >38,000 calls (Dataset S1) | Classify pig calls by emotional valence/context | Neural network outperformed pDFA; robust across contexts and ages | [21] |
| PLM-Q5 noise reduction microphone, Raspberry Pi 4 (Cambridge, England, UK) | 1 per pen | ~36 (12 per pen, 3 farms) | Not specified | 24 h continuous per farm | 3 datasets × 4000 files | Classify vocalization vs. non-vocalization | Mixed-MMCT feature extraction improved robustness and accuracy | [49] |
| Acoustic test analyzer (BK 2270-S-C, 4189 mic, Darmstadt, Hesse, Germany) | 1 | 189 | Fattening stage | 1 month | Not specified | Classify grunting, squealing, coughing | Multi-feature fusion improved recognition of vocal types | [24] |
| Directional microphones (Various) | 1 | 24 | Piglets | Not specified | Not specified | Classify agonistic/social vocalizations | TinyML feasible for real-time embedded monitoring | [67] |
| Microphones (SmartMic, PecSmart, Florianópolis, Santa Catarina, Brazil) | 1 per pen | 256 (16 per pen, 16 pens) | Growing–finishing | 6 days | 1110 coughs, 8938 other sounds | Detect coughs in field conditions | High performance for cough detection in commercial barns | [77] |
| Microphone (external iTalk-02) | 1 | 10 | Adult Landrace | 10 h | Not specified | Classify eating, estrus, howling, humming, panting | DNN-HMM outperformed HMM, GMM-HMM, SVM, ResNet18 | [107] |
| Microphone (TCD-D8, SONY, Tokyo, Japan) | 2 | 70 | Young pigs | Not specified | 4537 calls | Classify pain-related vocalizations | Screams indicate pain; automatic classification feasible | [119] |
| Microphone (JVC GR-DVL520A, Yokohama, Kanagawa, Japan) | 1 | 36 | 25–30 kg | 30 min per pig | Not specified | Detect wasting diseases via cough | Acoustic differences in coughs by disease; early detection possible | [120] |
| Microphone (U.S. Blaster condenser) | 1 | 44 | 150 days | Not specified | Not specified | Detect coughs in field | Feasible for field cough detection | [121] |
| Microphone | 1 | 40 | Pre-weaning gilts | 5 min isolation | 14,000+ vocalizations | Identify call types, relate to arousal | Seven call types identified; call type linked to behavior | [122] |
| Microphone (Yoga®, Taipei City, Taiwan) | 1 | 40 | 22 weeks | Not specified | Not specified | Classify stress (pain, cold, hunger) | Vocalization effective for stress detection | [123] |
| Extraction Technique | Classification Technique | Accuracy | Precision | Recall | F1 | References |
|---|---|---|---|---|---|---|
| Acoustic features (MFCC, spectral, temporal) | Neural network, pDFA | 91.5% (NN, valence), 81.5% (NN, context) | Not specified | Not specified | Not specified | [21] |
| MFCC, Mel-spectrogram, Chroma, Tonnetz, Mixed-MMCT | Deep CNN (DCNN) | 99.5–99.7% (per farm), 95.67% (cross-farm) | 96.25% | 95.68% | 95.96% | [49] |
| STE, FC, FF, MFCC, PCA | BP neural network (GA optimized) | 93.2% | 92.9% | 92.8% | Not specified | [24] |
| Not specified | CNN (TinyML, Edge Impulse) | >90% | Not specified | Not specified | Not specified | [67] |
| 34 audio features (MFCC, others) | Hybrid CNN-RNN | 99.6% | 98.8% | 98.6% | 98.6% | [77] |
| MFCC (39-dim), Kalman filter, EMD-TEO | DNN-HMM | 83% | Not specified | Not specified | Not specified | [107] |
| Multiparametric call analysis | Discriminant analysis | 94.6% (call type classification) | Not specified | Not specified | Not specified | [119] |
| MFCC, labeling | SVDD, SRC | 94% (detection), 91% (classification) | Not specified | Not specified | Not specified | [120] |
| Filter bank, amplitude demodulation | Dynamic time warping | 85.5% (cough), 86.6% (other) | Not specified | Not specified | Not specified | [121] |
| Acoustic/spectral features | Manual + statistical classification | Not specified | Not specified | Not specified | Not specified | [122] |
| Acoustic features | J48 decision tree | 81.1% | Not specified | Not specified | Not specified | [123] |
| Device Used | Objectives | Methodology | Results | Ref. |
|---|---|---|---|---|
| 16 microphones (SmartMic, PecSmart, Florianópolis, Santa Catarina, Brazil) | Detect pig coughs in commercial farm (field validation) | Hybrid deep learning (CNN + RNN); 34 audio features; 10-fold cross-validation; feature selection | High accuracy: 99.6%, recall: 98.6%, F1: 98.6%. Efficient for on-farm cough monitoring. | [77] |
| Raspberry Pi 4 Model B (Cambridge, England, UK), PLM-Q5 microphone | Classify pig vocalization vs. non-vocalization | Deep CNN; Mixed-MMCT feature extraction (MFCC, Mel-spectrogram, Chroma, Tonnetz); data augmentation | Accuracy: 99.5–99.7% (intra-farm); 95.7% (cross-farm). Robust to new data. | [49] |
| Audio Spectrogram Transformer (unspecified HW) | Detect abnormal pig vocalizations for welfare monitoring | Audio segmentation; AST model with attention; feature selection; interpretability analysis | Accuracy: 93%; inference speed 19× faster than CNNs; improved efficiency and scalability. | [52] |
| UM-ASPP-MobileViT (edge-optimized) | Detect sow oestrus via vocalization | MobileViT-based model; DWT denoising; MFCC and Mel spectrogram features; annotated oestrus dataset | Precision: 96.5%, F1: 96.5%; only 1.44 GFLOPs; fast, accurate, and efficient for real-time. | [81] |
| Lenovo B610 recorder (Beijing, China) | Classify pig vocalizations (sows) | Dual input: spectrogram + time-domain features; parallel network (CNN + custom classifier) | Accuracy: 93.4%, AUC: 0.99; robust in noisy, multi-pig environments. | [40] |
| NanoPc-T4 (Shenzhen, Guangdong, China), iTalk-02 microphone | Pig sound recognition (behavioral states) | DNN-HMM model; Kalman filtering; MFCC features; empirical mode decomposition for endpoint detection | Accuracy: 83% (custom), 79% (AudioSet); outperforms SVM, ResNet18, GMM-HMM. | [30] |
| BK 2270-S-C analyzer, 4189 microphone (Darmstadt, Hesse, Germany) | Classify pig grunting, squealing, coughing | Multi-feature fusion (STE, MFCC, formant, etc.); BP neural network; principal component analysis | Avg. accuracy: 93.2%; precision: 87.9–98.1%; recall: 87.4–99.1%. | [24] |
| Digital camcorder, PC soundcard | Detect pig wasting diseases via cough | MFCC extraction; SVDD for anomaly detection; SRC for disease classification | Detection: 94%, classification: 91%; works with low-cost microphones. | [31] |
| TransformerCNN (unspecified HW) | Classify domestic pig sounds (behavior/emotion) | Parallel CNN + Transformer; multiple audio features; open-access dataset | Accuracy: 96.1%, AUC: 98.4%, recall: 90.5%; robust and generalizable. | [41] |
| Audio-visual system (multimodal, unspecified) | Detect coughing pigs and localize individuals | Audio cough detection, video pig detection, multimodal fusion; real farm data; spectrum preprocessing | Detection accuracy: 95%; robust in noisy, real-world farm conditions. | [124] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Sharifuzzaman, M.; Mun, H.-S.; Lagua, E.B.; Hasan, M.K.; Kang, J.-G.; Kim, Y.-H.; Mehtab, A.; Park, H.-R.; Yang, C.-J. Advances in Audio Classification and Artificial Intelligence for Respiratory Health and Welfare Monitoring in Swine. Biology 2026, 15, 177. https://doi.org/10.3390/biology15020177
Sharifuzzaman M, Mun H-S, Lagua EB, Hasan MK, Kang J-G, Kim Y-H, Mehtab A, Park H-R, Yang C-J. Advances in Audio Classification and Artificial Intelligence for Respiratory Health and Welfare Monitoring in Swine. Biology. 2026; 15(2):177. https://doi.org/10.3390/biology15020177
Chicago/Turabian StyleSharifuzzaman, Md, Hong-Seok Mun, Eddiemar B. Lagua, Md Kamrul Hasan, Jin-Gu Kang, Young-Hwa Kim, Ahsan Mehtab, Hae-Rang Park, and Chul-Ju Yang. 2026. "Advances in Audio Classification and Artificial Intelligence for Respiratory Health and Welfare Monitoring in Swine" Biology 15, no. 2: 177. https://doi.org/10.3390/biology15020177
APA StyleSharifuzzaman, M., Mun, H.-S., Lagua, E. B., Hasan, M. K., Kang, J.-G., Kim, Y.-H., Mehtab, A., Park, H.-R., & Yang, C.-J. (2026). Advances in Audio Classification and Artificial Intelligence for Respiratory Health and Welfare Monitoring in Swine. Biology, 15(2), 177. https://doi.org/10.3390/biology15020177

