numenta / NAB

The Numenta Anomaly Benchmark
GNU Affero General Public License v3.0
1.93k stars 868 forks source link

Synthetic datasets for different classes of anomalies #217

Closed breznak closed 9 years ago

breznak commented 9 years ago

Define, create and include synthetic datasets for different kinds of anomalies. This is important for regressions, as the simple data can stress (at different difficulties) certain properties of HTM. I will also help to define concrete advantages and weak spots of HTM.

breznak commented 9 years ago

@subutai This has been on my mind for a while, I'd love to hear your thought on it! :question:

subutai commented 9 years ago

NAB already includes a few artificial datasets, some of which fall into your classes above. I think it is fine to create some more elsewhere (i.e. another repo) that are NAB compatible, but I don't really want to add them into the formal benchmark. I want to focus NAB mostly on real world data and would ideally like to even get rid of the existing artificial datasets. There are lots of other anomaly benchmarks with artificial data.

breznak commented 5 years ago

I want to focus NAB mostly on real world data and would ideally like to even get rid of the existing artificial datasets.

Revisiting this. Your decision sounds fair, I'll setup a NAB.synthetic.

subutai commented 5 years ago

👍