Comparative Analysis of Machine-Learning and Deep-Learning Approaches for Accurate Animal Disease Prediction and Health Risk Assessment †
Abstract
1. Introduction
1.1. Major Contribution
- In this paper, we have developed a framework that integrates traditional ML with DL-based features to enhance animal health risk detection and disease prediction.
- Deep-learning models, along with multiple ML models, are used to effectively predict animal risk.
- The proposed model demonstrates improved performance across key evaluation metrics.
- In this paper, we emphasize macro- and weighted-average performance metrics to assess whether the model is robust to class imbalance.
- RF-based model performed well as compared to standalone deep-learning and conventional models across the different performance metrics.
1.2. Objective
1.3. Limitation
- In this paper, we used a single dataset, which is limited for our study.
- In our experimental work, we used only the structure data.
- This study does not utilize for real-time validation.
2. Literature Review
3. Study Design
3.1. Phase 1: Animal Health Dataset
3.2. Phase 2: Data Preprocessing
- (1)
- Handling missing values: Various statistical methods are used to deal with missing records.
- (2)
- Normalization: To ensure effective learning, all numerical attributes are brought to a uniform range using feature scaling.
- (3)
- Feature Encoding: Non-numeric features are encoded into numerical formats compatible with ML algorithms.
3.3. Phase 3: Feature Extraction
- (1)
- Handcrafted Features: Important features related to animal health prediction are manually extracted using domain knowledge. Then, features are classified using a random forest model that ensures better accuracy by combining multiple decision trees.
- (2)
- Deep Feature Extraction: At the same time, an ANN automatically learns features from data, which helps capture hidden patterns that are difficult to identify through manual feature extraction.
3.4. Phase 4: Ensemble Decision Layer
3.5. Phase 5: Final Classification
3.6. Phase 6: Output Prediction
3.7. Model Training
- Random Forest Classifier: It is an ensemble learning technique that builds multiple decision trees during training and assigns a final class label by majority voting. It makes the model robust to overfitting and suitable for nonlinear and high-dimensional data.
- Support Vector Machine: It is a supervised learning method that classifies data by constructing a maximum-margin hyperplane separating the classes. This characteristic makes SVMs suitable for small and medium-scale datasets with distinct class separations.
- Logistic Regression: It is a supervised linear classifier that estimates class probabilities through the logistic function. Its interpretability and efficiency make it a widely used baseline method.
- Gaussian Naïve Bayes: It is a Bayesian classifier that models feature distribution as Gaussian and assumes feature independence. Its low computational complexity makes it effective for small datasets.
- Decision Tree Classifier: The decision tree classifier constructs a tree-like structure by repeatedly dividing the data into subsets using feature-based criteria. Its transparent decision-making process makes it interpretable, though it can overfit when used independently.
- K-Nearest Neighbours: A distance-based classifier that assigns a class label based on the majority class among the k-nearest data points in the feature space.
3.8. Hybrid Deep-Learning–Machine-Learning Model
- Random Forest with CNN features: A CNN was first trained to automatically learn high-level feature representations from the input data. The extracted feature vectors from the CNN intermediate layer were then used as input to a random forest classifier for final classification.
- Random Forest with ANN features: An ANN was employed as a feature extractor. The learned latent feature from the ANN was passed to a random forest classifier to improve classification performance.
4. Result Analysis


5. Discussion
5.1. Statistical Test for Model Comparison
5.2. FRIEDMAN TEST—Multiple Classifier Comparison
5.3. ONE-WAY ANOVA (F-Test)
5.4. Simplified Pairwise Comparisons
5.5. EFFECT SIZE: Cohen’s f2
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| RF | Random Forest Classifier |
| LoGR | Logistic Regression |
| ML | Machine Learning |
| DT | Decision Tree |
| KNN | K-Nearest Neighbour |
| CNN | Convolutional Neural Network |
| SVM | Support Vector Machine |
| DL | Deep Learning |
References
- Das, S.; Roy, R.K.; Bezboruah, T. Machine learning in animal healthcare: A comprehensive review. Int. J. Recent Eng. Sci. 2024, 11, 89–93. [Google Scholar] [CrossRef]
- Shah, H.J.; Sharma, C.; Joshi, C.; Engineer, Y.; Educare, S.K. Cattle medical diagnosis and prediction using machine learning. Int. Res. J. Eng. Technol. 2022, 9, 3867–3870. [Google Scholar]
- Evangelista, I.R.S.; Catajay, L.T.; Bandala, A.A.; Concepcion, R.S., II; Sybingco, E.; Dadios, E.P. Exploring deep learning for the detection of poultry activities—Towards an autonomous health and welfare monitoring in poultry farms. In Proceedings of the 2023 17th International Conference on Ubiquitous Information Management and Communication (IMCOM), Seoul, Republic of Korea, 3—5 January 2023; IEEE: New York, NY, USA, 2023; pp. 1–5. [Google Scholar]
- Hwang, S.; Shin, H.K.; Park, J.M.; Kwon, B.; Kang, M.G. Classification of dog skin diseases using deep learning with images captured from a multispectral imaging device. Mol. Cell. Toxicol. 2022, 18, 299–309. [Google Scholar] [CrossRef]
- Khan, Z.; Raj, P.; Kumar, M. Comparative Analysis: Machine Learning vs. Artificial Neural Networks for Animal Disease Prediction. In Proceedings of the 2024 6th International Conference on Computational Intelligence and Networks (CINE), Bhubaneswar, India, 19-21 December 2024; IEEE: New York, NY, USA, 2024; pp. 1–5. [Google Scholar]
- Bhardwaj, P.; Kumar, S.J.; Kanna, G.P.; Mithila, A. Machine Learning Based Approaches for Livestock Symptoms and Diseases Prediction and Classification. In Proceedings of the 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE), Gautam Buddha Nagar, India, 9—11 May 2024; IEEE: New York, NY, USA, 2024; pp. 1–6. [Google Scholar]
- Nayak, S.; Jena, L.; Palai, P.; Mishra, S.; Swain, M.K. Application of Machine Learning in the Analysis and Prediction of Animal Disease. In Sustainable Farming through Machine Learning; CRC Press: Boca Raton, FL, USA, 2024; pp. 207–219. [Google Scholar]
- Swapna, P.; Geetha, M.; Nuthana, B.; Rohan, R. Using AI and Machine Learning for Early Detection and Management of Cattle Diseases to Improve Livestock Health and Productivity. Int. Res. J. Educ. Technol. 2024, 6, 1815–1822. [Google Scholar]
- Nadar, A.; Sane, A.; Muga, G.; Masih, E.; Rukhande, S. Animal healthcare and farm animal disease prediction using machine learning. In Proceedings of the 2023 5th Biennial International Conference on Nascent Technologies in Engineering (ICNTE), Navi Mumbai, India, 20-21 January 2023; IEEE: New York, NY, USA, 2023; pp. 1–6. [Google Scholar]
- Nur, Z.M.; Hassan, A.H.; Mohamed, A.M.; Ibrahim, U.M.; Ali, M.A.; Siyad, A.A. Leveraging Supervised Learning Algorithms for Automated and Accurate Cattle Disease Diagnosis in Livestock Farming in Somalia. In Proceedings of the International Conference on Artificial Intelligence and Smart Energy, Coimbatore, India, 30-31 January 2025; Springer Nature: Cham, Switzerland, 2025; pp. 152–164. [Google Scholar]
- Lasser, J.; Matzhold, C.; Egger-Danner, C.; Fuerst-Waltl, B.; Steininger, F.; Wittek, T.; Klimek, P. Integrating diverse data sources to predict disease risk in dairy cattle—A machine learning approach. J. Anim. Sci. 2021, 99, skab294. [Google Scholar] [CrossRef] [PubMed]
- Antônio, W.H.; Da Silva, M.; Miani, R.S.; Souza, J.R. A proposal of an animal detection system using machine learning. Appl. Artif. Intell. 2019, 33, 1093–1106. [Google Scholar] [CrossRef]
- Zimpel, T.; Riekert, M.; Klein, A.; Hoffmann, C. Machine learning for predicting animal welfare risks in pig farming. Landtechnik 2021, 76, 24–35. [Google Scholar]
- Ezanno, P.; Picault, S.; Beaunée, G.; Bailly, X.; Muñoz, F.; Duboz, R.; Monod, H.; Guégan, J.-F. Research perspectives on animal health in the era of artificial intelligence. Vet. Res. 2021, 52, 40. [Google Scholar] [CrossRef] [PubMed]
- Valletta, J.J.; Torney, C.; Kings, M.; Thornton, A.; Madden, J. Applications of machine learning in animal behavior studies. Anim. Behav. 2017, 124, 203–220. [Google Scholar] [CrossRef]















| Model | Accuracy |
|---|---|
| RF | 0.957746 |
| SVM | 0.816901 |
| Logistic Regression | 0.943662 |
| Gaussian Naive Bayes | 0.880282 |
| Decision Tree Classifier | 0.901408 |
| K-Nearest Neighbors | 0.873239 |
| Random Forest (CNN Features) | 0.873239 |
| Random Forest (ANN Features) | 0.908451 |
| Model | Macro Avg Precision | Macro Avg Recall | Macro Avg F1-Score |
|---|---|---|---|
| RF | 0.937634 | 0.943548 | 0.934306 |
| SVM | 0.803125 | 0.808594 | 0.788963 |
| Logistic Regression | 0.914785 | 0.926075 | 0.915719 |
| Gaussian Naive Bayes | 0.844974 | 0.843034 | 0.831135 |
| Decision Tree Classifier | 0.841282 | 0.837179 | 0.82677 |
| K-Nearest Neighbors | 0.818229 | 0.822396 | 0.804849 |
| RF(CNN Features) | 0.837179 | 0.82094 | 0.813242 |
| RF (ANN Features) | 0.878348 | 0.88151 | 0.872842 |
| Model | Weighted | Weighted | Weighted |
|---|---|---|---|
| Avg Precision | Avg Recall | Avg F1-Score | |
| RF | 0.955634 | 0.957746 | 0.951464 |
| SVM | 0.828052 | 0.816901 | 0.805405 |
| LoGR | 0.935094 | 0.943662 | 0.934563 |
| GNB | 0.896714 | 0.880282 | 0.874209 |
| DTC | 0.917019 | 0.901408 | 0.895337 |
| K-NN | 0.881455 | 0.873239 | 0.861508 |
| Random Forest (CNN Features) | 0.911385 | 0.873239 | 0.875075 |
| Random Forest (ANN Features) | 0.926559 | 0.908451 | 0.908082 |
| Model | Accuracy | Macro F1 | Weighted F1 | Rank |
|---|---|---|---|---|
| Random Forest | 0.9577 | 0.9343 | 0.9515 | 1 |
| Logistic Regression | 0.9437 | 0.9157 | 0.9346 | 2 |
| RF–ANN Features | 0.9085 | 0.8728 | 0.9081 | 3 |
| Decision Tree | 0.9014 | 0.8268 | 0.8953 | 4 |
| Gaussian Naive Bayes | 0.8803 | 0.8311 | 0.8742 | 5 |
| K-Nearest Neighbors | 0.8732 | 0.8048 | 0.8615 | 6 |
| RF–CNN Features | 0.8732 | 0.8132 | 0.8751 | 6 |
| Support Vector Machine | 0.8169 | 0.789 | 0.8054 | 8 |
| Metric | Mean ± Standard Deviation |
|---|---|
| Accuracy | 0.8946 ± 0.0413 |
| Macro F1 | 0.8485 ± 0.0485 |
| Weighted F1 | 0.8882 ± 0.0434 |
| Comparison | Accuracy Difference | Practical Significance | Statistical Significance |
|---|---|---|---|
| RF vs. SVM | 0.1408 | Large | Significant |
| RF vs. Logistic Regression | 0.0141 | Small | Marginal |
| RF vs. Gaussian NB | 0.0775 | Large | Marginal |
| RF vs. Decision Tree | 0.0563 | Large | Marginal |
| RF vs. KNN | 0.0845 | Large | Marginal |
| RF vs. RF–CNN | 0.0845 | Large | Marginal |
| RF vs. RF–ANN | 0.0493 | Medium | Marginal |
| Performance Tier | Models | Accuracy Range |
|---|---|---|
| Excellent (>94%) | Random Forest, Logistic Regression | 0.9437–0.9577 |
| Good (90–94%) | RF–ANN Features, Decision Tree | 0.9014–0.9085 |
| Fair (85–90%) | Gaussian Naive Bayes | 0.8803 |
| Moderate (<85%) | KNN, RF–CNN Features, SVM | 0.8169–0.8732 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Panigrahy, B.; Subudhi, A.; Harichandan, T.; Padhy, N.; Panigrahi, R. Comparative Analysis of Machine-Learning and Deep-Learning Approaches for Accurate Animal Disease Prediction and Health Risk Assessment. Eng. Proc. 2026, 124, 52. https://doi.org/10.3390/engproc2026124052
Panigrahy B, Subudhi A, Harichandan T, Padhy N, Panigrahi R. Comparative Analysis of Machine-Learning and Deep-Learning Approaches for Accurate Animal Disease Prediction and Health Risk Assessment. Engineering Proceedings. 2026; 124(1):52. https://doi.org/10.3390/engproc2026124052
Chicago/Turabian StylePanigrahy, Bhagyashree, Akhil Subudhi, Tanushree Harichandan, Neelamadhab Padhy, and Rasmita Panigrahi. 2026. "Comparative Analysis of Machine-Learning and Deep-Learning Approaches for Accurate Animal Disease Prediction and Health Risk Assessment" Engineering Proceedings 124, no. 1: 52. https://doi.org/10.3390/engproc2026124052
APA StylePanigrahy, B., Subudhi, A., Harichandan, T., Padhy, N., & Panigrahi, R. (2026). Comparative Analysis of Machine-Learning and Deep-Learning Approaches for Accurate Animal Disease Prediction and Health Risk Assessment. Engineering Proceedings, 124(1), 52. https://doi.org/10.3390/engproc2026124052

