Classification of knowledge of fertility period among adolescent girls in East Africa from 2012 to 2022: Machine learning algorithm
Andualem Addisu Birlie,
Kassahun Dessie Gashu,
Mulugeta Desalegn Kasaye,
Ayana Alebachew Muluneh,
Abdulaziz Kebede Kassaw,
Hailemariam Kassahun Desalegn,
Tamir Wondim Desta and
Shimels Derso Kebede
PLOS Digital Health, 2026, vol. 5, issue 2, 1-22
Abstract:
Understanding the time of the menstrual cycle would help women to avoid getting pregnant without the need for surgical, hormonal, or mechanical contraception. Women who do not use contraception and do not know when they are fertile are at a higher risk (17%) of unplanned pregnancy and abortion. Classifying knowledge of fertility periods using machine learning algorithms would help to automate decision-making, produce more precise and accurate classification, and scale up to manage big and complex datasets. Therefore, this study aimed to classify knowledge of the fertility period among adolescent girls in East Africa from 2012 to 2022 using a machine-learning algorithm. A community-based cross-sectional study design was used from 12 East African countries’ DHS datasets spanning 2012–2022. The machine learning algorithms were applied to classify knowledge of the fertility period and identify its predictors using R software and Python, particularly Jupiter Notebook in Anaconda. Data cleaning, one-hot encoding, data splitting, data balancing, and ten-fold cross-validation were performed. Ten machine learning algorithms and SHAP were used to select and interpret the best model. From the 40,664 adolescent girls in East Africa, 13.22% (95% CI: 12.91, 13.54) of participants had knowledge of the fertility period. Logistic regression was found to be the best model for unbalanced training data with 74.38% of an AUC and 82.71% of an accuracy. While random forest outperformed on balanced training data, it achieved 91.12% of an AUC and 83.26% accuracy. The key determinant factors of the knowledge of the fertility period were education level, country, hearing about family planning, hearing about sexually transmitted infections, wealth index, knowledge of any method, and visiting health facilities. Governments, NGOs, policy makers, and researchers can utilize these findings to design targeted interventions for improving adolescents’ reproductive health based on the identified gaps and disparities.Author summary: The purpose of this study was to classify knowledge of the fertility period among adolescent girls in East Africa. Ten machine-learning models were trained, and the random forest model provided the most accurate prediction with an AUC score of 91.12% and an accuracy of 83.26%. Using the SHAP feature importance method, the top ten predictors were identified: education level, country, heard about family planning, heard about sexually transmitted infections, wealth index, knowledge of any method, and visited health facilities. Education level was a primary factor since girls with higher education levels were better prepared to understand reproductive health concepts. Similarly, increased exposure to hearing about family planning and general knowledge about STIs improved knowledge of fertility, again showing that a more extensive health communication strategy has contributed to girls’ foundational knowledge. It is important to note that the wealth index also has an effect on access to media, education, and health services. We saw that girls who demonstrated that they were aware of a contraceptive method or had ever visited a health facility were significantly more likely to understand their fertility period.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/digitalhealth/article?id=10.1371/journal.pdig.0001108 (text/html)
https://journals.plos.org/digitalhealth/article/fi ... 01108&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pdig00:0001108
DOI: 10.1371/journal.pdig.0001108
Access Statistics for this article
More articles in PLOS Digital Health from Public Library of Science
Bibliographic data for series maintained by digitalhealth ().