An Empirical Analysis of Hidden Markov Models with Momentum
Andrew Miller,
Fabio Di Troia and
Mark Stamp ()
Additional contact information
Andrew Miller: San Jose State University
Fabio Di Troia: San Jose State University
Mark Stamp: San Jose State University
A chapter in Machine Learning, Deep Learning and AI for Cybersecurity, 2025, pp 169-206 from Springer
Abstract:
Abstract Momentum is a technique that is widely used to improve convergence rates during gradient descent. In this research, we experiment with adding momentum to the Baum-Welch expectation-maximization algorithm for training Hidden Markov Models (HMM). We compare discrete HMMs trained with and without momentum on English text and malware opcode data. The effectiveness of momentum is determined by measuring the changes in model score and classification accuracy due to momentum, as a function of the Baum-Welch iteration. Our extensive experiments indicate that applying momentum to Baum-Welch can accelerate convergence, in the sense of reducing the number of iterations required for initial convergence, particularly in cases where the model is otherwise slow to converge. However, momentum does not seem to improve the final model performance in cases where a sufficiently large number of iterations are used.
Date: 2025
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-031-83157-7_7
Ordering information: This item can be ordered from
http://www.springer.com/9783031831577
DOI: 10.1007/978-3-031-83157-7_7
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().