A curated dataset and lightweight deep learning framework for tea leaf disease classification
Sakibul Hasan Chowdhury,
Md Shohel Arman,
Masrafe Bin Hannan Siam,
Md Rayhan Khan,
Afia Hasan and
Parvez Ahmed Moju Fahim
PLOS ONE, 2026, vol. 21, issue 5, 1-36
Abstract:
Tea (Camellia sinensis) is the world’s second most consumed beverage, enjoyed daily by more than two billion people. In Bangladesh, it serves as a cornerstone agricultural export and a major sector of the domestic economy. However, commercial tea cultivation remains highly vulnerable to fungal and pest-related diseases such as Blight, Red Rust, and Helopeltis which severely reduce crop yield and compromise leaf quality. While early detection is critical to preventing widespread outbreaks, traditional manual inspection is slow, subjective, and highly error-prone. Deep learning provides a scalable alternative, yet single-branch networks often struggle to capture both minute disease lesions and broader structural degradation simultaneously. To address this, we propose a Hybrid Feature Fusion architecture that runs two highly efficient feature extractors in parallel: EfficientNetV2-Small to isolate fine-grained local textures, and MobileNetV3-Small to capture the global structural context of the leaf. The models were trained and evaluated on a real-world dataset of 2,000 annotated images, evenly distributed across the four target classes (Blight, Red Rust, Helopeltis, and Healthy). Before training, the images underwent a standardized preprocessing pipeline including resizing to 224 × 224 pixels and normalization, supplemented by a dynamic augmentation strategy featuring random rotations, horizontal flips, and brightness adjustments to improve model robustness. The proposed hybrid framework achieved an outstanding peak classification accuracy of 96.80% alongside a macro Area Under the Curve (AUC) of 0.9980. To rigorously validate its performance, the hybrid model was benchmarked against six diverse architectures: a Vision Transformer (ViT-B16 at 76.40%), a Custom CNN (89.60%), MobileNetV3 (94.40%), ResNet50 (95.60%), DenseNet121 (96.40%), and EfficientNetV2-B3 (97.60%). Although EfficientNetV2-B3 achieved a marginally higher raw accuracy, the proposed dual-branch framework delivered a superior precision-recall balance and faster convergence stability. These findings demonstrate that the proposed hybrid methodology is highly reliable and computationally balanced, making it an ideal candidate for integration into Internet of Things (IoT) edge devices for real-time disease monitoring in precision agriculture.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0349210 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 49210&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0349210
DOI: 10.1371/journal.pone.0349210
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().