Automatic sleep-stage classification is a key component of long-term sleep monitoring and digital health applications. Although deep learning models trained on centralized datasets have achieved strong performance, their deployment in real-world healthcare settings is constrained by privacy, data-governance, and regulatory requirements. Federated learning (FL) addresses these issues by enabling decentralized training in which raw data remain local and only model parameters are exchanged; however, its effectiveness under realistic physiological heterogeneity remains insufficiently understood. In this work, we investigate a subject-level federated deep learning framework for sleep-stage classification using polysomnography data from the ISRUC-Sleep dataset. We adopt a realistic one subject = one client setting spanning three clinically distinct subgroups and evaluate a lightweight one-dimensional convolutional neural network (1D-CNN) under four training regimes: a centralized baseline and three federated strategies (FedAvg, FedProx, and FedBN), all sharing identical architecture and preprocessing. The centralized model, trained on a cohort with regular sleep architecture, achieves stable performance (accuracy 69.65%, macro-F1 0.6537). In contrast, naive FedAvg fails to converge under subject-level non-IID data (accuracy 14.21%, macro-F1 0.0601), with minority stages such as N1 and REM largely lost. FedProx yields only marginal improvement, while FedBN-by preserving client-specific batch-normalization statistics-achieves the best federated performance (accuracy 26.04%, macro-F1 0.1732) and greater stability across clients. These findings indicate that the main limitation of FL for sleep staging lies in physiological heterogeneity rather than model capacity, highlighting the need for heterogeneity-aware strategies in privacy-preserving sleep analytics.

A Federated Deep Learning Framework for Sleep-Stage Monitoring Using the ISRUC-Sleep Dataset

Amato A.
2026

Abstract

Automatic sleep-stage classification is a key component of long-term sleep monitoring and digital health applications. Although deep learning models trained on centralized datasets have achieved strong performance, their deployment in real-world healthcare settings is constrained by privacy, data-governance, and regulatory requirements. Federated learning (FL) addresses these issues by enabling decentralized training in which raw data remain local and only model parameters are exchanged; however, its effectiveness under realistic physiological heterogeneity remains insufficiently understood. In this work, we investigate a subject-level federated deep learning framework for sleep-stage classification using polysomnography data from the ISRUC-Sleep dataset. We adopt a realistic one subject = one client setting spanning three clinically distinct subgroups and evaluate a lightweight one-dimensional convolutional neural network (1D-CNN) under four training regimes: a centralized baseline and three federated strategies (FedAvg, FedProx, and FedBN), all sharing identical architecture and preprocessing. The centralized model, trained on a cohort with regular sleep architecture, achieves stable performance (accuracy 69.65%, macro-F1 0.6537). In contrast, naive FedAvg fails to converge under subject-level non-IID data (accuracy 14.21%, macro-F1 0.0601), with minority stages such as N1 and REM largely lost. FedProx yields only marginal improvement, while FedBN-by preserving client-specific batch-normalization statistics-achieves the best federated performance (accuracy 26.04%, macro-F1 0.1732) and greater stability across clients. These findings indicate that the main limitation of FL for sleep staging lies in physiological heterogeneity rather than model capacity, highlighting the need for heterogeneity-aware strategies in privacy-preserving sleep analytics.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11591/584673
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact