Online Multi-task Learning with Hard Constraints
Gabor Lugosi (),
Omiros Papaspiliopoulos () and
Gilles Stoltz
Additional contact information
Gabor Lugosi: ICREA - Institució Catalana de Recerca i Estudis Avançats = Catalan Institution for Research and Advanced Studies
Omiros Papaspiliopoulos: ICREA - Institució Catalana de Recerca i Estudis Avançats = Catalan Institution for Research and Advanced Studies
Working Papers from HAL
Abstract:
We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss ``tracking'' and ``bandit'' versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.
Date: 2009-02-13
Note: View the original document on HAL open archive server: https://hal.science/hal-00362643v2
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://hal.science/hal-00362643v2/document (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-00362643
Access Statistics for this paper
More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD ().