Naïve Bayes
Frank Acito
Additional contact information
Frank Acito: Indiana University
Chapter Chapter 9 in Predictive Analytics with KNIME, 2023, pp 193-207 from Springer
Abstract:
Abstract This chapter introduces the Naïve Bayes algorithm, a predictive model based on Bayesian analysis. The chapter starts with a thought problem involving a breathalyzer used by a police department. It demonstrates how Bayes’ Theorem can be used to estimate the probability that a driver is over the legal alcohol limit based on breathalyzer results. The concept of Naïve Bayes is illustrated with a toy data set of 15 observations, demonstrating how probabilities are calculated using counting. The assumption of conditional independence is explained, and the algorithm’s applicability to categorical and continuous predictors is discussed. The problem of zero probabilities and the need for Laplace smoothing to avoid issues with small sample sizes are explored. Naïve Bayes in KNIME is applied to a real-world example of predicting heart disease detection and identifying spam emails, achieving 85% and 99% accuracy, respectively, with test data. Despite its strong assumption of independence among predictors, Naïve Bayes proves to be a practical and efficient algorithm for classification tasks, especially when dealing with a large number of predictors. While its probability estimates may not always be precise, the classification results are often reliable.
Date: 2023
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-031-45630-5_9
Ordering information: This item can be ordered from
http://www.springer.com/9783031456305
DOI: 10.1007/978-3-031-45630-5_9
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().