Subsampling under distributional constraints
Florian Combes,
Ricardo Fraiman and
Badih Ghattas
Additional contact information
Florian Combes: AMSE - Aix-Marseille Sciences Economiques - EHESS - École des hautes études en sciences sociales - AMU - Aix Marseille Université - ECM - École Centrale de Marseille - CNRS - Centre National de la Recherche Scientifique, AMU - Aix Marseille Université
Ricardo Fraiman: CMAT - Centro de Matemática [Montevideo] - UDELAR - Universidad de la República [Montevideo]
Badih Ghattas: AMSE - Aix-Marseille Sciences Economiques - EHESS - École des hautes études en sciences sociales - AMU - Aix Marseille Université - ECM - École Centrale de Marseille - CNRS - Centre National de la Recherche Scientifique, AMU - Aix Marseille Université
Post-Print from HAL
Abstract:
Abstract Some complex models are frequently employed to describe physical and mechanical phenomena. In this setting, we have an input in a general space, and an output where is a very complicated function, whose computational cost for every new input is very high, and may be also very expensive. We are given two sets of observations of , and of different sizes such that only is available. We tackle the problem of selecting a subset of smaller size on which to run the complex model , and such that the empirical distribution of is close to that of . We suggest three algorithms to solve this problem and show their efficiency using simulated datasets and the Airfoil self‐noise data set.
Date: 2024-02-09
Note: View the original document on HAL open archive server: https://hal.science/hal-04742977v1
References: Add references at CitEc
Citations:
Published in Statistical Analysis and Data Mining, 2024, 17 (1), ⟨10.1002/sam.11661⟩
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-04742977
DOI: 10.1002/sam.11661
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().