EconPapers    
Economics at your fingertips  
 

Real-manufacturing-oriented big data analysis and data value evaluation with domain knowledge

Weichang Kong (), Fei Qiao () and Qidi Wu ()
Additional contact information
Weichang Kong: Tongji University
Fei Qiao: Tongji University
Qidi Wu: Tongji University

Computational Statistics, 2020, vol. 35, issue 2, No 5, 515-538

Abstract: Abstract As one of the most popular topics currently, big data has played an important role in both academic research and practical applications. However, in the manufacturing industry, it is difficult to make full use of the research results for production optimization and/or management due to the low quality of real workshop data. Typical quality problems of real workshop data include the information match degree, missing recessive data, and false error identification. The conventional data analysis methods cannot handle most such issues because these methods fail to consider professional insights into and domain knowledge about the data. The main motivation of this paper is to explore methods for analyzing and evaluating big data with domain knowledge. For this purpose, real production data from a semiconductor manufacturing workshop are adopted as the data object. First, a series of data analysis techniques with domain knowledge are developed for diagnosing the imperfections. Then, corresponding data processing techniques with domain knowledge are proposed for solving those data quality problems according to specific flaws in the data. Furthermore, this paper proposes quantitative calculation methods of data value density to determine the extent to which data quality can be improved by the proposed data processing techniques. Case studies are conducted to demonstrate that data analysis and processing techniques with domain knowledge can effectively handle data quality problems of real workshop data in terms of the information match degree, missing recessive data, and false error identification. The work in this paper has the potential to be further extended and applied to other big data applications beyond the manufacturing industry.

Keywords: Manufacturing big data; Data quality; Data value density; Data’s professional insights (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s00180-019-00919-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00919-6

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-019-00919-6

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00919-6