EconPapers    
Economics at your fingertips  
 

Least Squares Monte Carlo and Pathwise Optimization for Merchant Energy Production

Bo Yang (), Selvaprabu Nadarajah () and Nicola Secomandi ()
Additional contact information
Bo Yang: Tepper School of Business, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Selvaprabu Nadarajah: College of Business, University of Illinois at Chicago, Chicago, Illinois 60607
Nicola Secomandi: Tepper School of Business, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; Jones Graduate School of Business, Houston, Texas 77005

Operations Research, 2024, vol. 72, issue 6, 2758-2775

Abstract: We study merchant energy production modeled as a compound switching and timing option. The resulting Markov decision process is intractable. Least squares Monte Carlo combined with information relaxation and duality is a state-of-the-art reinforcement learning methodology to obtain operating policies and optimality gaps for related models. Pathwise optimization is a competing technique developed for optimal stopping settings, in which it typically provides superior results compared with this approach, albeit with a larger computational effort. We apply these procedures to merchant energy production. Using pathwise optimization requires methodological extensions. We use principal component analysis and block coordinate descent in novel ways to respectively precondition and solve the ensuing ill-conditioned and large-scale linear program, which even a cutting-edge commercial solver is unable to handle directly. Both techniques yield near optimal operating policies on realistic ethanol production instances. However, at the cost of both considerably longer run times and greater memory usage, which limits the number of stages of the instances that it can handle, pathwise optimization leads to substantially tighter dual bounds compared with least squares Monte Carlo, even when specified in a simple fashion, complementing it in this case. Thus, it plays a critical role in obtaining small optimality gaps. Our numerical observations on the magnitudes of these bound improvements differ from what is currently known. This research has potential relevance for other commodity merchant operations contexts and motivates additional algorithmic work in the area of pathwise optimization.

Keywords: Decision Analysis; block coordinate descent; information relaxation and duality; least squares Monte Carlo; Markov decision processes; merchant energy operations; pathwise optimization; principal component analysis; real options; reinforcement learning (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/opre.2018.0341 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:72:y:2024:i:6:p:2758-2775

Access Statistics for this article

More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-03-19
Handle: RePEc:inm:oropre:v:72:y:2024:i:6:p:2758-2775