Distributed vibration sensing based on optical vector network analysis (OVNA) is a promising technique for measuring dynamic perturbations in optical fibers, but its practical use is limited by the high computational cost of short-time Fourier transform (STFT) and cross-correlation stages. In this work, we present a GPU-accelerated signal processing pipeline, together with an optimization strategy based on dataflow reduction, mixed-precision arithmetic, and hardware-aware tuning. The proposed implementation reduces the processing time for 200 sweeps from 64.7 s on a single-core CPU to 0.199 s on a modern GPU, while preserving the final shift results, with zero mismatches over 199,199 measurement points. Benchmarking across three GPU generations further shows that STFT benefits more from large on-chip cache resources, whereas cross-correlation scales more closely with memory bandwidth. These results suggest that modern GPUs can significantly reduce the computational burden of OVNA, as well as other distributed sensing methods with a similar processing flow, enabling kHz-rate aggregate throughput from batched processing, supporting real-time-oriented operation on modern GPUs.
GPU-Accelerated Signal Processing for Distributed Vibration Sensing Based on OVNA Method
Alessandro Meoli;Raffaele Vallifuoco;Agnese Coscetta;Luigi Zeni;Aldo Minardo
2026
Abstract
Distributed vibration sensing based on optical vector network analysis (OVNA) is a promising technique for measuring dynamic perturbations in optical fibers, but its practical use is limited by the high computational cost of short-time Fourier transform (STFT) and cross-correlation stages. In this work, we present a GPU-accelerated signal processing pipeline, together with an optimization strategy based on dataflow reduction, mixed-precision arithmetic, and hardware-aware tuning. The proposed implementation reduces the processing time for 200 sweeps from 64.7 s on a single-core CPU to 0.199 s on a modern GPU, while preserving the final shift results, with zero mismatches over 199,199 measurement points. Benchmarking across three GPU generations further shows that STFT benefits more from large on-chip cache resources, whereas cross-correlation scales more closely with memory bandwidth. These results suggest that modern GPUs can significantly reduce the computational burden of OVNA, as well as other distributed sensing methods with a similar processing flow, enabling kHz-rate aggregate throughput from batched processing, supporting real-time-oriented operation on modern GPUs.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


