Long-Term Guided Wave Dataset Under Real-World Conditions Dataset

Summary
What happens when we take structural health monitoring (SHM) out of the lab and into the real world?
The Dataset on Guided Waves from Long-Term Structural Health Monitoring under Uncontrolled and Dynamic Conditions, created by Kang Yang and the SmartDATA Lab, offers one of the first comprehensive public resources for doing just that. This large-scale, real-world dataset captures guided ultrasonic waves over two years of continuous monitoring, encompassing millions of waveforms, changing environmental conditions, and true operational variability.
It was introduced in the open-access journal Scientific Data by Yang et al. (2025):
๐ Yang, K., et al. (2025). Dataset on guided waves from long-term structural health monitoring under uncontrolled and dynamic conditions.
Nature Scientific Data, https://doi.org/10.1038/s41597-025-05300-5
๐ฆ What’s in the Dataset?
This dataset includes:
- ๐ Over 7.6 million ultrasonic guided wave signals, collected automatically
- ๐ 4.5 years of measurements, recorded every 30 seconds
- ๐ก๏ธ Paired temperature and humidity data, enabling environmental correlation studies
- ๐งญ Multiple distinct propagation paths across an aluminum panel instrumented with 14 piezoelectric sensors
- ๐ Real-world uncertainties: signal drift, sensor degradation, seasonal temperature swings, and unmonitored activity
Data is stored in efficient HDF5 format, and is accompanied by:
- โ Python code for data loading
- โ Scripts for basic preprocessing and exploratory analysis
๐ Why This Dataset Matters
Most SHM and AI-guided wave studies use simulated or lab-controlled data. These are valuableโbut they donโt reflect the uncertainty, noise, and environmental variation found in real infrastructure.
This dataset helps researchers:
- ๐ง Train and evaluate AI algorithms under real, imperfect conditions
- ๐ Develop unsupervised learning, drift compensation, and anomaly detection techniques
- ๐ฆ๏ธ Understand the influence of temperature, humidity, and long-term aging on signal quality
- ๐งช Benchmark signal processing, sensor fusion, and domain adaptation methods in SHM
With this data, AI and SHM researchers can finally move beyond idealized scenarios and begin tackling the complexities of real-world monitoring.
๐ก Ideal For:
- Machine learning & signal processing research in SHM
- Benchmarking algorithms under environmental drift
- Simulating long-term sensor networks
- Educating students with real sensor data
- Developing temperature-robust damage detection algorithms
๐ Get Started
- ๐ Access the dataset:
โถ View on Figshare - ๐ป Load and explore the data:
โถ GitHub code repository
๐ Citation
If you use this dataset in your research, please cite:
Yang, K., et al. (2025). Dataset on guided waves from long-term structural health monitoring under uncontrolled and dynamic conditions. Scientific Data. https://doi.org/10.1038/s41597-025-05300-5
๐ In Summary
This is not just another SHM datasetโitโs a step toward real-world AI in structural monitoring. Whether youโre building smarter algorithms, modeling sensor behavior over time, or exploring environmental robustness, this dataset provides a rich, messy, and meaningful foundation for innovation.
Long-term. Real-world. Open-access. This is SHM data for the future.
Lamb Wave Long-Term Machine Learning Real Data Structural Health Monitoring Variations