YADING: Fast Clustering of Large-Scale Time Series Data
- Rui Ding ,
- Qiang Wang ,
- Qiang Wang ,
- Yingnong Dang ,
- Qiang Fu ,
- Haidong Zhang ,
- Dongmei Zhang ,
- Rui Ding
Published by VLDB - Very Large Data Bases
Fast and scalable analysis techniques are becoming increasingly important in the era of big data, because they are the enabling techniques to create real-time and interactive experiences in data analysis. Time series are widely available in diverse application areas. Due to the large number of time series instances (e.g., millions) and the high dimensionality of each time series instance (e.g., thousands), it is challenging to conduct clustering on largescale time series, and it is even more challenging to do so in realtime to support interactive exploration.