EconPapers    
Economics at your fingertips  
 

Model design in data science: engineering design to uncover design processes and anomalies

Antoine Bordas (), Pascal Le Masson () and Benoit Weil ()
Additional contact information
Antoine Bordas: CGS i3 - Centre de Gestion Scientifique i3 - Mines Paris - PSL (École nationale supérieure des mines de Paris) - PSL - Université Paris Sciences et Lettres - I3 - Institut interdisciplinaire de l’innovation - CNRS - Centre National de la Recherche Scientifique
Pascal Le Masson: CGS i3 - Centre de Gestion Scientifique i3 - Mines Paris - PSL (École nationale supérieure des mines de Paris) - PSL - Université Paris Sciences et Lettres - I3 - Institut interdisciplinaire de l’innovation - CNRS - Centre National de la Recherche Scientifique
Benoit Weil: CGS i3 - Centre de Gestion Scientifique i3 - Mines Paris - PSL (École nationale supérieure des mines de Paris) - PSL - Université Paris Sciences et Lettres - I3 - Institut interdisciplinaire de l’innovation - CNRS - Centre National de la Recherche Scientifique

Post-Print from HAL

Abstract: In the current data-rich environment, valorizing of data has become a common task in data science and requires the design of a statistical model to transform input data into a desirable output. The literature in data science regarding the design of new models is abundant, while in parallel, other streams of literature such as epistemology of science, has shown the relevance of anomalies in model design processes. Anomalies are to be understood as unexpected observations in data, an historical example being the discovery of Mercury based on its famous anomalous precession perihelion. Therefore, this paper addresses the various design processes in data science and their relationships to anomalies. To do so, we conceptualize what designing a data science model means, and we derive three design processes based on the latest theories in engineering design. This allows us to formulate assumptions regarding the relationships between each design process and anomalies, which we test with several case studies. Notably, three processes for the design of models in data science are identified and, for each of them, the following information is provided: (1) the various knowledge leveraged and generated and (2) the specific relations with anomalies. From a theoretical standpoint, this work is one of the first applications of design methods in data science. This work paves the way for more research at the intersection of engineering design and data science, which could enrich both fields.

Date: 2024-10-27
References: Add references at CitEc
Citations:

Published in Research in Engineering Design, 2024, 36 (1), pp.1. ⟨10.1007/s00163-024-00442-w⟩

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-04790948

DOI: 10.1007/s00163-024-00442-w

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().

 
Page updated 2025-03-19
Handle: RePEc:hal:journl:hal-04790948