Open tdingquan opened 5 years ago
This paper uses unsupervised learning method to discover common motif and leverage the motifs to more robustly assign states to the system measurement, which may be regarded as one moment's of the whole large number of noisy time series data over a period of time.
The whole process can be seen as an optimization problem.
- The individual states and state assignments are unknown.
- Durations of measurements assigned to one state is unknown.
- State: Represent a prototype of system behavior.
- Motif: Sequence of state assignments which correspond to complex behaviors that capture common sequence of state transitions, where all neighboring occureences of the same states are merged into one.
Input: A sequence of T measurements, where each measurement is a vector of data values observed at time t.
Output: Sequence of States corresponding to the measurements and Motif Patterns hidden in the state.
- State model Θ
- Assignment of states to measurements S
- Assignment of motifs to measurements M
- Optimize:
- Expectation Maximization Type Approach:
Initialize: Initialize the state model Θ E-Step: Assign state to measurements, discover motifs and update state assignments
E-Step A: Discover Candidate Motifs E-Step B: Using Motifs to Assign States: M-Step: Update state probability model Θ with the updated state assignment
- Viterby Alogorithm, to solve the state assignment problem
- Toeplitz Inverse Covariance-based Clustering (TICC) model, to define each state
- Hidden Markov model, to model each motif separately
- Suffix array, to find the maximal subsequence, which is defined as a sequence that cannot be extended to either the left or right without changing the set of occurrences in S.
During the process of E-Step A, the discorvery of candidates motif, we use the null model to discriminate the redundancies. I believe a bracket was missed from the corresponding formula. What should the formula look like?
Especially the B, what does this mean? Is this an idiom?
Authors:
Conference:
Keywords:
Tasks:
Code:
Datasets: [TO BE EDITTED]
Reviewer: