MASA: Motif-Aware State Assignment in Noisy Time Series Data

tdingquan commented 5 years ago

Authors:

Saachi Jain - saachi@stanford.edu - Stanford University

David Hallac - hallac@stanford.edu - Stanford University

Rok Sosic - rok@stanford.edu - Stanford University

Jure Leskovec - jure@stanford.edu - Stanford University

Conference:

KDD 2019, Workshop on Mining and Learning from Time Series

Keywords:

Noisy motif discovery

Temporal clustering

Multivariate time series

Tasks:

Cluster analysis

State definition

Motif discovery

Code:

https://github.com/snap-stanford/masa.git

Datasets: [TO BE EDITTED]

Subjects cycling on an exercise bike, Daily and Sports Activities Data Set

Aircrafts, commercial & classified

Automobiles, commercial & classified

Reviewer:

Ding Quan

tdingquan commented 5 years ago

One Sentence Summary:

This paper uses unsupervised learning method to discover common motif and leverage the motifs to more robustly assign states to the system measurement, which may be regarded as one moment's of the whole large number of noisy time series data over a period of time.

The whole process can be seen as an optimization problem.

Problems:

The individual states and state assignments are unknown.

Durations of measurements assigned to one state is unknown.

Definitions:

State: Represent a prototype of system behavior.

Motif: Sequence of state assignments which correspond to complex behaviors that capture common sequence of state transitions, where all neighboring occureences of the same states are merged into one.

Input & Output:

Input: A sequence of T measurements, where each measurement is a vector of data values observed at time t.

Output: Sequence of States corresponding to the measurements and Motif Patterns hidden in the state.

Models:

State model Θ

Assignment of states to measurements S

Assignment of motifs to measurements M

Optimize:

Methods & Process:

Expectation Maximization Type Approach:

Initialize: Initialize the state model Θ E-Step: Assign state to measurements, discover motifs and update state assignments

E-Step A: Discover Candidate Motifs E-Step B: Using Motifs to Assign States: M-Step: Update state probability model Θ with the updated state assignment

Viterby Alogorithm, to solve the state assignment problem

Toeplitz Inverse Covariance-based Clustering (TICC) model, to define each state

Hidden Markov model, to model each motif separately

Suffix array, to find the maximal subsequence, which is defined as a sequence that cannot be extended to either the left or right without changing the set of occurrences in S.

tdingquan commented 5 years ago

Problem 1:

During the process of E-Step A, the discorvery of candidates motif, we use the null model to discriminate the redundancies. I believe a bracket was missed from the corresponding formula. What should the formula look like?

TIM截图20190904173941

Especially the B, what does this mean? Is this an idiom?

shenghua-liu / reading-group

MASA: Motif-Aware State Assignment in Noisy Time Series Data #4

Authors:

Conference:

Keywords:

Tasks:

Code:

Datasets: [TO BE EDITTED]

Reviewer:

One Sentence Summary:

Problems:

Definitions:

Input & Output:

Models:

Methods & Process:

Problem 1: