EconPapers    
Economics at your fingertips  
 

Outcome Weighted Learning in Dynamic Treatment Regimes

Shinto Eguchi () and Osamu Komori ()
Additional contact information
Shinto Eguchi: Institute of Statistical Mathematic
Osamu Komori: Seikei University

Chapter Chapter 8 in Minimum Divergence Methods in Statistical Machine Learning, 2022, pp 197-216 from Springer

Abstract: Abstract This chapter discusses applications of information geometry in a paradigm of reinforcement learningReinforcement learning with emphasis on dynamic treatment regimesDynamic treatment regimes which have progressed recently in the learning algorithm with outcome weighed learning. The probabilistic framework for a triple expressing state, action, and reward is formulated in multiple stages, in which a decision functionDecision function defined by a state and action is estimated to make an optimal policy for a given dataset. Decision consistency for a decision functionDecision function is introduced by the state-value function in the space of all the decision functionsDecision function. We introduce the $$\varPsi $$ Ψ -divergence on the decision functionDecision function space with a generator function $$\varPsi $$ Ψ , and investigate statistical properties for the $$\varPsi $$ Ψ -loss function $$\varPsi $$ Ψ -loss function conducted by $$\varPsi $$ Ψ -divergence. An outcome-weighed learning algorithm for the decision functionDecision function is considered in a boosting approach in association with the prediction in supervised learning.

Date: 2022
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-4-431-56922-0_8

Ordering information: This item can be ordered from
http://www.springer.com/9784431569220

DOI: 10.1007/978-4-431-56922-0_8

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-06-01
Handle: RePEc:spr:sprchp:978-4-431-56922-0_8