This study investigates the application of GPT-2, a transformer-based model, for fault diagnosis in reciprocating air compressors, highlighting its ability to capture complex patterns in acoustic signals. Two approaches are compared: the first involves extracting time, frequency, and complexity-based features and classifying them using traditional machine learning models, with a narrow neural network achieving the best performance. The second approach reformulates these features as sequential data for GPT-2, which, through meticulous hyperparameter optimization, delivered superior diagnostic accuracy. Additionally, SHapley Additive exPlanations analysis was employed to enhance model interpretability by identifying the most influential features, providing valuable insights into the fault diagnosis process. While GPT-2 demonstrated notable performance gains over conventional models, it required a more precise hyperparameter tuning. This study offers valuable insights into the application of large language models for classifying damaged mechanical systems.

Enhancing Air Compressor Fault Diagnosis: A Comparative Study of GPT-2 and Traditional Machine Learning Models

Rezazadeh N.;Perfetto D.;Caputo F.;De Luca A.
2025

Abstract

This study investigates the application of GPT-2, a transformer-based model, for fault diagnosis in reciprocating air compressors, highlighting its ability to capture complex patterns in acoustic signals. Two approaches are compared: the first involves extracting time, frequency, and complexity-based features and classifying them using traditional machine learning models, with a narrow neural network achieving the best performance. The second approach reformulates these features as sequential data for GPT-2, which, through meticulous hyperparameter optimization, delivered superior diagnostic accuracy. Additionally, SHapley Additive exPlanations analysis was employed to enhance model interpretability by identifying the most influential features, providing valuable insights into the fault diagnosis process. While GPT-2 demonstrated notable performance gains over conventional models, it required a more precise hyperparameter tuning. This study offers valuable insights into the application of large language models for classifying damaged mechanical systems.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11591/572727
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact