Recently, with the development of improved data acquisition systems such as sensors and fast communication systems, various types of data that could not be collected before can be obtained. The results of analysis using the collected data can provide ...
Recently, with the development of improved data acquisition systems such as sensors and fast communication systems, various types of data that could not be collected before can be obtained. The results of analysis using the collected data can provide real-time information on specific trends or changes in quality characteristics and process variables, leading to reduced process monitoring and controlling time and quality cost. However, errors may appear during the data collection process due to such problems as noise or process environment. Utilization of these error-contained data can waste resources and render analysis results useless.
This study aims to provide guidelines for appropriate pre-processing of manufacturing data. The data pre-processing approach is in the order of data integration, missing value processing, outlier processing, feature engineering, variable transformation, and data imbalance processing. The proposed data pre-processing approach is compared to six other data pre-processing methods, and shows its advantage with respect to the F1 score. The approach proposed in this study is expected to be helpful to quality practitioners who want to extract insight by applying statistical process control and machine learning methods with manufacturing quality data.