An Improved Self-Training Method for Positive Unlabeled Time Series Classification Using DTW Barycenter Averaging
Abstract
:1. Introduction
- We point out that traditional ST-based methods may be sensitive to the initial labeled time-series located near the boundary between the positive and negative classes. To overcome this issue, we propose a novel method ST-average to solve the PUTSC problem by using the average sequence of the PL set to decide which unlabeled time-series should be labeled and added into PL set.
- It is not a trivial task to calculate the average sequence of the time-series set and we demonstrate the necessity of using DBA through experiments.
- The ST-average method is orthogonal to some of the stopping criteria and similarity measures used in ST-based methods. We show how ST-average can naturally be implemented along with them and present an explicit implementation of ST-average.
- We conduct experiments using public well-know time-series datasets to evaluate the performance of our proposal. Experimental results demonstrate that our method performs better than related competitors.
2. Background and Related Work
2.1. Positive Unlabeled Time Series Classification
2.2. Self-Training Technique for the PUTSC
Algorithm 1 The pseudo-code of the self-training method for PUTSC. |
Require: |
: Original positive labeled data; U: Unlabeled data. |
Ensure: |
: The time-series set labeled as positive. |
: The time-series set labeled as negative. |
|
2.3. Related Work for the Self-Training
3. Proposed Method
3.1. Motivation
3.2. Dynamic Time Warping
- Boundary constraint: and .
- Continuity-monotonically constraint: .
3.3. Time-Series Averaging
Algorithm 2 The pseudo-code of the DBA method for averaging time-series set. |
Require: |
: the time-series set to average. |
: initial average sequence (length l) selected from T randomly. |
: number of iterations. |
Ensure: |
: the average sequence. |
|
3.4. The ST-Average Method
Algorithm 3 The pseudo-code of the ST-average method for PUTSC. |
Require: |
: Original positive labeled data; U: Unlabeled data. |
Ensure: |
: The time-series set labeled as positive. |
: The time-series set labeled as negative. |
|
3.5. Time Complexity Analysis
4. Experimental Evaluation
4.1. Experimental Setup
4.1.1. Algorithms
- ST-SCC is proposed in [8], and is one of the state-of-the-art algorithms for PUTSC problem, which uses DTW distance as the similarity measure and SCC as the stopping criterion.
- C-MDL is proposed in [9], which uses the constraint-based MDL principle for PUTSC problem. This method does not use any stopping criteria, but stops the self-training process when the number of time-series which does not satisfy the constraints exceeds the predefined threshold.
- SCC-center-dtw is our proposed method presented in Section 3.4, which utilizes the idea of ST-average and is an explicit implementation of ST-average.
- SCC-center-ed is similar to our SCC-center-dtw approach. The only difference is that SCC-center-ed uses the Euclidean Distance (ED) to calculate the average sequence while SCC-center-dtw uses DTW distance.
4.1.2. The Performance Metric
4.1.3. Datasets
4.1.4. Implementation Details
4.2. F1-Score
4.3. Running Time
5. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Chen, L.J.; Ho, Y.H.; Hsieh, H.H.; Huang, S.T.; Lee, H.C.; Mahajan, S. ADF: An anomaly detection framework for large-scale PM2. 5 sensing systems. IEEE Internet Things J. 2017, 5, 559–570. [Google Scholar] [CrossRef]
- Norgaard, S.; Saeedi, R.; Gebremedhin, A.H. Multi-Sensor Time-Series Classification for Activity Tracking Under Variable Length. IEEE Sens. J. 2019, 20, 2701–2709. [Google Scholar] [CrossRef]
- Kiranyaz, S.; Ince, T.; Gabbouj, M. Real-time patient-specific ECG classification by 1-D convolutional neural networks. IEEE Trans. Biomed. Eng. 2015, 63, 664–675. [Google Scholar] [CrossRef] [PubMed]
- Chen, W.; Shi, K. A deep learning framework for time series classification using Relative Position Matrix and Convolutional Neural Network. Neurocomputing 2019, 359, 384–394. [Google Scholar] [CrossRef]
- Liang, S.; Zhang, Y.; Ma, J. Active Model Selection for Positive Unlabeled Time Series Classification. In Proceedings of the 2020 IEEE 36th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 20–24 April 2020; pp. 361–372. [Google Scholar]
- Nguyen, M.N.; Li, X.L.; Ng, S.K. Positive unlabeled learning for time series classification. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain, 16–22 July 2011. [Google Scholar]
- Wei, L.; Keogh, E. Semi-supervised time series classification. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, 20–23 August 2006; pp. 748–753. [Google Scholar]
- Ratanamahatana, C.A.; Wanichsan, D. Stopping criterion selection for efficient semi-supervised time series classification. In Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing; Springer: Berlin/Heidelberg, Germany, 2008; pp. 1–14. [Google Scholar]
- Vinh, V.T.; Anh, D.T. Constraint-based MDL principle for semi-supervised classification of time series. In Proceedings of the 2015 Seventh International Conference on Knowledge and Systems Engineering (KSE), Ho Chi Minh, Vietnam, 8–10 October 2015; pp. 43–48. [Google Scholar]
- Begum, N.; Hu, B.; Rakthanmanon, T.; Keogh, E. Towards a minimum description length based stopping criterion for semi-supervised time series classification. In Proceedings of the 2013 IEEE 14th International Conference on Information Reuse & Integration (IRI), San Francisco, CA, USA, 14–16 August 2013; pp. 333–340. [Google Scholar]
- Vinh, V.T.; Anh, D.T. Two novel techniques to improve mdl-based semi-supervised classification of time series. In Transactions on Computational Collective Intelligence XXV; Springer: Berlin/Heidelberg, Germany, 2016; pp. 127–147. [Google Scholar]
- González, M.; Bergmeir, C.; Triguero, I.; Rodríguez, Y.; Benítez, J.M. On the stopping criteria for k-nearest neighbor in positive unlabeled time series classification problems. Inf. Sci. 2016, 328, 42–59. [Google Scholar] [CrossRef]
- Chen, Y.; Hu, B.; Keogh, E.; Batista, G.E. DTW-D: Time series semi-supervised learning from a single example. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, IL, USA, 11–13 August 2013; pp. 383–391. [Google Scholar]
- de Carvalho Pagliosa, L.; de Mello, R.F. Semi-supervised time series classification on positive and unlabeled problems using cross-recurrence quantification analysis. Pattern Recognit. 2018, 80, 53–63. [Google Scholar] [CrossRef]
- Petitjean, F.; Forestier, G.; Webb, G.I.; Nicholson, A.E.; Chen, Y.; Keogh, E. Faster and more accurate classification of time series by exploiting a novel dynamic time warping averaging algorithm. Knowl. Inf. Syst. 2016, 47, 1–26. [Google Scholar] [CrossRef] [Green Version]
- Dau, H.A.; Keogh, E.; Kamgar, K.; Yeh, C.C.M.; Zhu, Y.; Gharghabi, S.; Ratanamahatana, C.A.; Hu, B.; Begum, N.; Bagnall, A.; et al. The UCR Time Series Classification Archive. 2018. Available online: https://www.cs.ucr.edu/~eamonn/time_series_data_2018/ (accessed on 4 November 2021).
- Berndt, D.J.; Clifford, J. Using Dynamic Time Warping to Find Patterns in Time Series; KDD Workshop: Seattle, WA, USA, 1994; Volume 10, pp. 359–370. [Google Scholar]
- Petitjean, F.; Ketterlin, A.; Gançarski, P. A global averaging method for dynamic time warping, with applications to clustering. Pattern Recognit. 2011, 44, 678–693. [Google Scholar] [CrossRef]
- Zhu, X.J. Semi-Supervised Learning Literature Survey; Technical Report; University of Wisconsin-Madison Department of Computer Sciences: Madison, WI, USA, 2005. [Google Scholar]
- Zhu, X.; Goldberg, A.B. Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 2009, 3, 1–130. [Google Scholar] [CrossRef] [Green Version]
- Nguyen, M.N.; Li, X.L.; Ng, S.K. Ensemble based positive unlabeled learning for time series classification. In International Conference on Database Systems for Advanced Applications; Springer: Berlin/Heidelberg, Germany, 2012; pp. 243–257. [Google Scholar]
- Niennattrakul, V.; Srisai, D.; Ratanamahatana, C.A. Shape-based template matching for time series data. Knowl.-Based Syst. 2012, 26, 1–8. [Google Scholar] [CrossRef]
- Morel, M.; Achard, C.; Kulpa, R.; Dubuisson, S. Time-series averaging using constrained dynamic time warping with tolerance. Pattern Recognit. 2018, 74, 77–89. [Google Scholar] [CrossRef] [Green Version]
- Liu, Y.T.; Zhang, Y.A.; Zeng, M. Adaptive global time sequence averaging method using dynamic time warping. IEEE Trans. Signal Process. 2019, 67, 2129–2142. [Google Scholar] [CrossRef]
- Rakthanmanon, T.; Campana, B.; Mueen, A.; Batista, G.; Westover, B.; Zhu, Q.; Zakaria, J.; Keogh, E. Searching and mining trillions of time series subsequences under dynamic time warping. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; pp. 262–270. [Google Scholar]
- Keogh, E.J.; Pazzani, M.J. Derivative dynamic time warping. In Proceedings of the 2001 SIAM International Conference on Data Mining, Chicago, IL, USA, 5–7 April 2001; pp. 1–11. [Google Scholar]
- Candan, K.S.; Rossini, R.; Wang, X.; Sapino, M.L. sDTW: Computing DTW distances using locally relevant constraints based on salient feature alignments. Proc. VLDB Endow. 2012, 5, 1519–1530. [Google Scholar] [CrossRef]
- Niennattrakul, V.; Ratanamahatana, C.A. On clustering multimedia time series data using k-means and dynamic time warping. In Proceedings of the 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE’07), Seoul, Korea, 26–28 April 2007; pp. 733–738. [Google Scholar]
- Leon-Alcaide, P.; Rodriguez-Benitez, L.; Castillo-Herrera, E.; Moreno-Garcia, J.; Jimenez-Linares, L. An evolutionary approach for efficient prototyping of large time series datasets. Inf. Sci. 2020, 511, 74–93. [Google Scholar] [CrossRef]
- Zhang, M.; Pi, D. A new time series representation model and corresponding similarity measure for fast and accurate similarity detection. IEEE Access 2017, 5, 24503–24519. [Google Scholar] [CrossRef]
- Tan, C.W.; Petitjean, F.; Webb, G.I. Elastic bands across the path: A new framework and method to lower bound DTW. In Proceedings of the 2019 SIAM International Conference on Data Mining, Calgary, AB, Canada, 2–4 May 2019; pp. 522–530. [Google Scholar]
- Silva, D.F.; Batista, G.E. Speeding up all-pairwise dynamic time warping matrix calculation. In Proceedings of the 2016 SIAM International Conference on Data Mining, Miami, FL, USA, 5–7 May 2016; pp. 837–845. [Google Scholar]
- Sakoe, H.; Chiba, S. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 1978, 26, 43–49. [Google Scholar] [CrossRef] [Green Version]
- Itakura, F. Minimum prediction residual principle applied to speech recognition. IEEE Trans. Acoust. Speech Signal Process. 1975, 23, 67–72. [Google Scholar] [CrossRef]
No. | Dataset | Size of Training Data | Time-Series length | Classes | Number of Positive Samples | Type |
---|---|---|---|---|---|---|
1 | CBF | 30 | 128 | 3 | 10 | Simulated |
2 | Meat | 60 | 448 | 3 | 20 | Spectro |
3 | Coffee | 28 | 286 | 2 | 14 | Spectro |
4 | FaceAll | 560 | 131 | 14 | 40 | Image |
5 | ECG5000 | 500 | 140 | 5 | 292 | ECG |
6 | Strawberry | 370 | 235 | 2 | 132 | Spectro |
7 | FiftyWords | 450 | 270 | 50 | 52 | Image |
8 | TwoleadECG | 23 | 82 | 2 | 12 | ECG |
9 | ItalyPowerDemand | 67 | 24 | 2 | 34 | Sensor |
10 | FreezerRegularTrain | 150 | 301 | 2 | 75 | Sensor |
Dataset | ST-SCC | SCC-Center-dtw | SCC-Center-ed | C-MDL |
---|---|---|---|---|
CBF | 0.669 ± 0.123 | 0.805 ± 0.101 | 0.771 ± 0.103 | 0.488 ± 0.175 |
Meat | 0.550 ± 0.096 | 0.561 ± 0.083 | 0.580 ± 0.066 | 0.460 ± 0 |
Coffee | 0.608 ± 0.200 | 0.814 ± 0.145 | 0.735 ± 0.210 | 0.611 ± 0 |
FaceAll | 0.508 ± 0.144 | 0.552 ± 0.201 | 0.353 ± 0.073 | 0.469 ± 0.192 |
ECG5000 | 0.551 ± 0.087 | 0.758 ± 0.234 | 0.891 ± 0.015 | 0.796 ± 0.150 |
Strawberry | 0.547 ± 0.088 | 0.615 ± 0.118 | 0.546 ± 0.129 | 0.520 ± 0 |
FiftyWords | 0.780 ± 0.140 | 0.759 ± 0.135 | 0.063 ± 0.020 | 0.409 ± 0.206 |
TwoleadECG | 0.563 ± 0.169 | 0.657 ± 0.174 | 0.464 ± 0.176 | 0.628 ± 0.076 |
ItalyPowerDemand | 0.431 ± 0.147 | 0.603 ± 0.183 | 0.469 ± 0.218 | 0.684 ± 0.039 |
FreezerRegularTrain | 0.285 ± 0.107 | 0.708 ± 0.046 | 0.702 ± 0.097 | 0.591 ± 0.131 |
Dataset | ST-SCC | SCC-Center-dtw | SCC-Center-ed | C-MDL |
---|---|---|---|---|
CBF | 3 | 1 | 2 | 4 |
Meat | 3 | 2 | 1 | 4 |
Coffee | 4 | 1 | 2 | 3 |
FaceAll | 2 | 1 | 4 | 3 |
ECG5000 | 4 | 3 | 1 | 2 |
Strawberry | 2 | 1 | 3 | 4 |
FiftyWords | 1 | 2 | 4 | 3 |
TwoleadECG | 3 | 1 | 4 | 2 |
ItalyPowerDemand | 4 | 2 | 3 | 1 |
FreezerRegularTrain | 4 | 1 | 2 | 3 |
Average ranking | 3.0 ± 1.0 | 1.5 ± 0.67 | 2.6 ± 1.11 | 2.9 ± 0.94 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, J.; Zhang, H.; Dong, Y.; Zuo, T.; Xu, D. An Improved Self-Training Method for Positive Unlabeled Time Series Classification Using DTW Barycenter Averaging. Sensors 2021, 21, 7414. https://doi.org/10.3390/s21217414
Li J, Zhang H, Dong Y, Zuo T, Xu D. An Improved Self-Training Method for Positive Unlabeled Time Series Classification Using DTW Barycenter Averaging. Sensors. 2021; 21(21):7414. https://doi.org/10.3390/s21217414
Chicago/Turabian StyleLi, Jing, Haowen Zhang, Yabo Dong, Tongbin Zuo, and Duanqing Xu. 2021. "An Improved Self-Training Method for Positive Unlabeled Time Series Classification Using DTW Barycenter Averaging" Sensors 21, no. 21: 7414. https://doi.org/10.3390/s21217414
APA StyleLi, J., Zhang, H., Dong, Y., Zuo, T., & Xu, D. (2021). An Improved Self-Training Method for Positive Unlabeled Time Series Classification Using DTW Barycenter Averaging. Sensors, 21(21), 7414. https://doi.org/10.3390/s21217414