This paper investigates the underexplored area of clustering multiple survival curves, with a focus on the advantages of Functional Data Analysis for analyzing survival or hazard functions to exploit their inherent continuous nature. We propose customized functional methods, particularly leveraging Functional Principal Component Analysis, and compare them with existing methods using two real datasets: the German Breast Cancer Study (GBCS) and the Lung Cancer dataset. The results show that FDA-based methods offer faster execution times and improve clustering quality overall, highlighting the potential of FDA as a more natural and efficient approach for clustering survival curves, making it a promising direction for future survival data analysis.
Functional Clustering for Survival Curves
Mariarita De LuciaMembro del Collaboration Group
;Elvira Romano
Membro del Collaboration Group
;Fabrizio MaturoMembro del Collaboration Group
2025
Abstract
This paper investigates the underexplored area of clustering multiple survival curves, with a focus on the advantages of Functional Data Analysis for analyzing survival or hazard functions to exploit their inherent continuous nature. We propose customized functional methods, particularly leveraging Functional Principal Component Analysis, and compare them with existing methods using two real datasets: the German Breast Cancer Study (GBCS) and the Lung Cancer dataset. The results show that FDA-based methods offer faster execution times and improve clustering quality overall, highlighting the potential of FDA as a more natural and efficient approach for clustering survival curves, making it a promising direction for future survival data analysis.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


