Real-Time Cattle Pose Estimation Based on Improved RTMPose

Li, Xiaowu; Sun, Kun; Fan, Hongbo; He, Zihan

Real-Time Cattle Pose Estimation Based on Improved RTMPose

Xiaowu Li, Kun Sun, Hongbo Fan () and Zihan He
Additional contact information
Xiaowu Li: Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
Kun Sun: Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
Hongbo Fan: Faculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming 650300, China
Zihan He: Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China

Agriculture, 2023, vol. 13, issue 10, 1-18

Abstract: Accurate cattle pose estimation is essential for Precision Livestock Farming (PLF). Computer vision-based, non-contact cattle pose estimation technology can be applied for behaviour recognition and lameness detection. Existing methods still face challenges in achieving fast cattle pose estimation in complex scenarios. In this work, we introduce the FasterNest Block and Depth Block to enhance the performance of cattle pose estimation based on the RTMPose model. First, the accuracy of cattle pose estimation relies on the capture of high-level image features. The FasterNest Block, with its three-branch structure, effectively utilizes high-level feature map information, significantly improving accuracy without a significant decrease in inference speed. Second, large kernel convolutions can increase the computation cost of the model. Therefore, the Depth Block adopts a method based on depthwise separable convolutions to replace large kernel convolutions. This addresses the insensitivity to semantic information while reducing the model’s parameter. Additionally, the SimAM module enhances the model’s spatial learning capabilities without introducing extra parameters. We conducted tests on various datasets, including our collected complex scene dataset (cattle dataset) and the AP-10K public dataset. The results demonstrate that our model achieves the best average accuracy with the lowest model parameters and computational requirements, achieving 82.9% on the cattle test set and 72.0% on the AP-10K test set. Furthermore, in conjunction with the object detection model RTMDet-m, our model reaches a remarkable inference speed of 39FPS on an NVIDIA GTX 2080Ti GPU using the PyTorch framework, making it the fastest among all models. This work provides adequate technical support for fast and accurate cattle pose estimation in complex farm environments.

Keywords: cattle pose estimation; RTMPose; FasterNest Block; SimAM attention; Depth Block (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2077-0472/13/10/1938/pdf (application/pdf)
https://www.mdpi.com/2077-0472/13/10/1938/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:13:y:2023:i:10:p:1938-:d:1253476

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().