TY - GEN
T1 - Evaluating Tabular Data Generation Techniques on the DaFne Platform
T2 - 9th International Congress on Information and Communication Technology, ICICT 2024
AU - Baddam, Pramod
AU - Glass, Ayse
AU - Jäkel, René
AU - Jander, Jonathan
AU - Krause, Tom
AU - Kunert, Pamela
AU - Noennig, Jörg
AU - Okhrin, Iryna
AU - Sanchez, Mariela
AU - Steffens, Ulrike
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024/8/2
Y1 - 2024/8/2
N2 - In the realm of artificial intelligence (AI) and machine learning (ML), the scarcity of robust and diverse datasets often poses a significant challenge, prompting the need for effective data generation methods. This paper presents an evaluation of tabular data generation techniques on the DaFne platform, centered around a predictive maintenance case study for bridges. The DaFne platform offers a variety of tabular data generation functionalities, including rule-based creation, data fusion (with weather data), and data reproduction. We investigate the utility of these functionalities across different machine learning models for the prediction of bridge conditions. Our analysis includes a descriptive statistical comparison of real and synthetic data. Additionally, we explore the utility of original, weather, and synthetic datasets. We do this through the lens of ML models like MLR, XGBoost, CNN, and GRU, performing a predictive maintenance algorithm on these datasets. Our results indicate that while the inclusion of weather data did not significantly enhance predictive performance, the synthetic dataset shows satisfactory quality. However, the synthetic data’s performance is lower than the original data in predictive maintenance tasks, with differences observed in models heavily reliant on sequential data. This research underscores the potential of the DaFne platform in generating high-quality synthetic data. It also highlights areas for future improvement and offers valuable insights for advancing data generation and analysis techniques in predictive maintenance and other AI applications.
AB - In the realm of artificial intelligence (AI) and machine learning (ML), the scarcity of robust and diverse datasets often poses a significant challenge, prompting the need for effective data generation methods. This paper presents an evaluation of tabular data generation techniques on the DaFne platform, centered around a predictive maintenance case study for bridges. The DaFne platform offers a variety of tabular data generation functionalities, including rule-based creation, data fusion (with weather data), and data reproduction. We investigate the utility of these functionalities across different machine learning models for the prediction of bridge conditions. Our analysis includes a descriptive statistical comparison of real and synthetic data. Additionally, we explore the utility of original, weather, and synthetic datasets. We do this through the lens of ML models like MLR, XGBoost, CNN, and GRU, performing a predictive maintenance algorithm on these datasets. Our results indicate that while the inclusion of weather data did not significantly enhance predictive performance, the synthetic dataset shows satisfactory quality. However, the synthetic data’s performance is lower than the original data in predictive maintenance tasks, with differences observed in models heavily reliant on sequential data. This research underscores the potential of the DaFne platform in generating high-quality synthetic data. It also highlights areas for future improvement and offers valuable insights for advancing data generation and analysis techniques in predictive maintenance and other AI applications.
KW - Bridge maintenance
KW - Evaluation
KW - Machine learning
KW - Prediction
KW - Synthetic data
KW - Tabular data generation
U2 - 10.1007/978-981-97-3289-0_49
DO - 10.1007/978-981-97-3289-0_49
M3 - Conference Paper
AN - SCOPUS:85201068794
SN - 9789819732883
T3 - Lecture Notes in Networks and Systems
SP - 611
EP - 628
BT - Proceedings of 9th International Congress on Information and Communication Technology - ICICT 2024
A2 - Yang, Xin-She
A2 - Sherratt, Simon
A2 - Dey, Nilanjan
A2 - Joshi, Amit
Y2 - 19 February 2024 through 22 February 2024
ER -