Outcome Weighted Learning in Dynamic Treatment Regimes
Shinto Eguchi () and
Osamu Komori ()
Additional contact information
Shinto Eguchi: Institute of Statistical Mathematic
Osamu Komori: Seikei University
Chapter Chapter 8 in Minimum Divergence Methods in Statistical Machine Learning, 2022, pp 197-216 from Springer
Abstract:
Abstract This chapter discusses applications of information geometry in a paradigm of reinforcement learningReinforcement learning with emphasis on dynamic treatment regimesDynamic treatment regimes which have progressed recently in the learning algorithm with outcome weighed learning. The probabilistic framework for a triple expressing state, action, and reward is formulated in multiple stages, in which a decision functionDecision function defined by a state and action is estimated to make an optimal policy for a given dataset. Decision consistency for a decision functionDecision function is introduced by the state-value function in the space of all the decision functionsDecision function. We introduce the $$\varPsi $$ Ψ -divergence on the decision functionDecision function space with a generator function $$\varPsi $$ Ψ , and investigate statistical properties for the $$\varPsi $$ Ψ -loss function $$\varPsi $$ Ψ -loss function conducted by $$\varPsi $$ Ψ -divergence. An outcome-weighed learning algorithm for the decision functionDecision function is considered in a boosting approach in association with the prediction in supervised learning.
Date: 2022
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-4-431-56922-0_8
Ordering information: This item can be ordered from
http://www.springer.com/9784431569220
DOI: 10.1007/978-4-431-56922-0_8
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().