Unlike zebus, taurine cattle have the natural ability to resist trypanosomosis, a parasitic disease endemic to the humid areas of West Africa. However, repeated crossbreeding between zebus and taurine cattle is jeopardizing the genetic heritage of the Taurines and their ability to resist trypanosomosis. To strengthen protection and conservation efforts, it is essential to accurately distinguish purebred taurines from crossbreds. In this study, five Machine Learning models were built using morphological data collected from 1968 cattle. These models were trained to determine whether a given individual is purebred taurine or not. The classifiers yielded promising results. The random forest model and RBF Kernel SVM performed the best with up to 86% and 85% accuracy respectively. Moreover, the study of the correlation coefficients and the feature importance scores allowed us to define the most discriminating morphological traits.
References
[1]
Rish, I., et al. (2001) An Empirical Study of the Naive Bayes Classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Seattle, 4-6 August 2001, 41-46.
[2]
Wanga, H.P., Ghani, N. and Kalegele, K. (2015) Designing a Machine Learning-Based Framework for Enhancing Performance of Livestock Mobile Application System. American Journal of Software Engineering and Applications, 4, 56. https://doi.org/10.11648/j.ajsea.20150403.13
[3]
Olasehinde, O. (2021) Infrared Thermography and Machine Learning in Livestock Production. International Journal of Advanced Research and Review, 6, 38-57.
[4]
Libbrecht, M.W. and Noble, W.S. (2015) Machine Learning Applications in Genetics and Genomics. Nature Reviews Genetics, 16, 321-332. https://doi.org/10.1038/nrg3920
[5]
Ouedraogo, D., Ouedraogo-Kone, S., Yougbare, B., et al. (2021) Population Structure, Inbreeding and Admixture in Local Cattle Populations Managed by Community-Based Breeding Programs in Burkina Faso. Journal of Animal Breeding and Genetics, 138, 379-388. https://doi.org/10.1111/jbg.12529
[6]
Yougbare, B., Soudre, A., Ouedraogo, D., et al. (2021) Genome-Wide Association Study of Trypanosome Prevalence and Morphometric Traits in Purebred and Crossbred Baoulé Cattle of Burkina Faso. PLOS ONE, 16, e0255089. https://doi.org/10.1371/journal.pone.0255089
[7]
Dodo, K., Pandey, V.S. and Illiassou, M.S. (2001) Utilisation de labarymetrie pour l’estimation du poids chez le zebu Azawak au Niger. Revue d’élevage et de médecine vétérinaire des pays tropicaux, 54, 63-68. https://doi.org/10.19182/remvt.9808
[8]
Rudenko, O., Megel, Y., Bezsonov, O., et al. (2020) Cattle Breed Identification and Live Weight Evaluation on the Basis of Machine Learning and Computer Vision. Proceedings of the Third International Workshop on Computer Modeling and Intelligent Systems (CMIS-2020), Zaporizhzhia, 27 April-1 May, 2020, 939-954. https://doi.org/10.32782/cmis/2608-70
[9]
Raduly, Z., Sulyok, C., et al. (2018) Dog Breed Identification Using Deep Learning. IEEE 16th International Symposium on Intelligent Systems and Informatics (SISY), Subotica, 13-15 September 2018, 271-276. https://doi.org/10.1109/SISY.2018.8524715
[10]
Kumar, R., Sharma, M., Dhawale, K., et al. (2019) Identification of Dog Breeds Using Deep Learning. 2019 IEEE 9th International Conference on Advanced Computing (IACC), Tiruchirappalli, 13-14 December 2019, 193-198. https://doi.org/10.1109/IACC48062.2019.8971604
[11]
Xu, Z.T., Diao, S.Q., Teng, J.Y., et al. (2021) Breed Identification of Meat Using Machine Learning and Breed Tag SNPs. Food Control, 125, Article ID: 107971. https://doi.org/10.1016/j.foodcont.2021.107971
[12]
Mahesh, B. (2020) Machine Learning Algorithms—A Review. International Journal of Science and Research (IJSR), 9, 381-386.
[13]
Grus, J. (2015) Data Science from Scratch: First Principles with Python. O’Reilly, Sebastopol.
[14]
Bradley, A.P. (1997) The Use of the Area under the ROC Curve in the Evaluation of Machine Learning Algorithms. Pattern Recognition, 30, 1145-1159. https://doi.org/10.1016/S0031-3203(96)00142-2
[15]
Rokach, L. and Maimon, O. (2009) Classification Trees. In: Data Mining and Knowledge Discovery Handbook, Springer, Berlin, 149-174. https://doi.org/10.1007/978-0-387-09823-4_9
[16]
Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32. https://doi.org/10.1023/A:1010933404324
[17]
Raschka, S. and Mirjalili, V. (2019) Python Machine Learning: Machine Learning and Deep Learning with Python, Scikit-Learn and TensorFlow 2. 3rd Edition, Packt Publishing, Birmingham.
[18]
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M. and Duchesnay, E. (2011) Scikit-Learn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825-2830.
[19]
Huang, K.X., Xiao, C., Glass, L.M., et al. (2021) Machine Learning Applications for Therapeutic Tasks with Genomics Data. Patterns, 2, Article ID: 100328.
[20]
Hawkins, D.M., Basak, S.C. and Mills, D. (2003) Assessing Model Fit by Cross-Va- lidation. The Journal for Chemical Information and Computer Scientists, 43, 579-586. https://doi.org/10.1021/ci025626i