Chikitsak: Medical Chatbot Using BioBert and GPT-2 Model

Khan, Md. Suhail; Bhattacharjya, Kamalika

Chikitsak: Medical Chatbot Using BioBert and GPT-2 Model

Md. Suhail Khan and Kamalika Bhattacharjya ()
Additional contact information
Md. Suhail Khan: Maulana Abul Kalam Azad University of Technology, West Bengal, Department of Information Technology
Kamalika Bhattacharjya: Maulana Abul Kalam Azad University of Technology, West Bengal, Department of Information Technology

A chapter in AI in Smart and Secure Healthcare, 2026, pp 403-419 from Springer

Abstract: Abstract Real-time patient guidance is vital in healthcare. This paper presents Chikitsak, an AI-enabled medical chatbot combining BioBERT for semantic retrieval and GPT-2 for fluent, context-aware response generation. The system uses fine-tuned BioBERT embeddings with FAISS indexing and a tag-aware negative sampling strategy, achieving 92% retrieval accuracy. Trained on 25,000 annotated doctor–patient interactions (70% from North American/European sources), it achieves BLEU-4 = 0.42, ROUGE-L = 0.51, and BERTScore-F1 = 0.73. While dataset bias is acknowledged, potential impacts on minority populations are analyzed, and mitigation strategies such as multilingual expansion, bias audits, and rare-condition oversampling are proposed. Medical experts rated 85% of responses as clinically appropriate and 72% as sufficiently detailed, surpassing baseline GPT-2 by 15%. The hallucination rate is reduced to 5%, outperforming ClinicalBERT (10%) and baseline GPT-2 (12%). Low-confidence outputs revert to ranked Q&A references for safety. Ethical considerations, including patient data privacy, explainability, and regulatory compliance (GDPR/HIPAA), are addressed. Limitations include reliance on simulated evaluation and the absence of real-world usability testing, planned for future work. The open-source design will be shared for reproducibility, with future improvements targeting advanced LLM integration, knowledge graphs, multilingual capability, and clinician-in-the-loop learning.

Keywords: BioBERT; GPT-2; BLEU-4; ROUGE-L; BERTScore; Hallucination rate (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:spochp:978-3-032-15092-9_16

Ordering information: This item can be ordered from
http://www.springer.com/9783032150929

DOI: 10.1007/978-3-032-15092-9_16

Access Statistics for this chapter

More chapters in Springer Optimization and Its Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().