NeuroTechX / moabb

Mother of All BCI Benchmarks
https://neurotechx.github.io/moabb/
BSD 3-Clause "New" or "Revised" License
700 stars 180 forks source link

[No Code] Discover new datasets #1

Open alexandrebarachant opened 7 years ago

alexandrebarachant commented 7 years ago

We need people browsing the web to discover interesting datasets than could be added to the moabb.

You can comment on this issue.

But first, check your dataset is not already in the list

What kind of datasets

We are interested in any datasets of time neural timeseries (EEG, MEG, ECOG, and fNIRS) with a minimum of 5 subjects, where we can apply machine learning algorithms and available online. It does not need to be a BCI dataset, but it must contains different condition/task, labelled and tagged.

How do I search for a new dataset ?

Many of the datasets of the BNCI index have not been reported. you can start here.

Researcher are making more and more datasets available. some database exists and might contains interesting things :

Finally, google is your friend

How much time does it takes ?

Entering a new dataset should took you 2 minutes.

vinay-jayaram commented 6 years ago

found one here: https://depositonce.tu-berlin.de/handle/11303/6271

alexandrebarachant commented 6 years ago

Browsing Plos one to find New motor imagery datasets:

vinay-jayaram commented 6 years ago

re: dataset 1, All the trials are pre-epoched :(

alexandrebarachant commented 6 years ago

yeah. Actually i think the GigaDb dataset is already like this ...

vinay-jayaram commented 6 years ago

ah you're right, you just concatenated all the trials. In that case we can do the same here :) good good

vinay-jayaram commented 6 years ago

also regarding the second to last: Have you e-mailed Fabien?

alexandrebarachant commented 6 years ago

I'm definitely not super happy about the concatenation of individual trials. in the case of the GigaDB, the dataset was too large to ignore. in those case, we can contact the authors to ask them about the raw data, but concatenating is a good starting point to see whether the dataset is really interesting or not.

Also, let's contact fabien and camille about the second last dataset. I will do it today.

vinay-jayaram commented 6 years ago

regarding concatenation though: Couldn't we just add a buffer of zeros before and after each trial to smooth out border effects? After de-meaning the trials to eliminate the issue of offset

alexandrebarachant commented 6 years ago

Yep we could. I think the most problematic part is the non zero mean that create huge edge artifact. We could also return Mne epochs in that case, but that still not ideal from a filtering point of view.

In any case we might want to put a warning ?

vinay-jayaram commented 6 years ago

warning is good, will add

Seburath commented 5 years ago

the list Is not synchronized whit the documentation, why? can I help there?

sylvchev commented 3 years ago

We will use this issue and the associated wiki page to keep track of the dataset that we could add in MOABB. Please, comment this issue if you want to report about a new dataset.

sylvchev commented 2 years ago

There is a nice dataset here for SSVEP and ERP using EEG and ear-EEG while standing or moving, the data are available here

Div12345 commented 2 years ago

These are 2 other interesting ones someone pointed out on the Slack channel -

  1. Continuous sensorimotor rhythm based brain computer interface learning in a large population - Data
  2. A large electroencephalographic motor imagery dataset for electroencephalographic brain computer interfaces - Data
sylvchev commented 2 years ago

Many datasets are listed here : https://www.researchgate.net/post/Are-there-any-public-EEG-data-sets-that-one-can-try-their-hands-on

okbalefthanded commented 1 year ago

This dataset is interesting for its population age and size, it is based on SSVEP for 100 participants with ages greater than 50 years old: https://www.nature.com/articles/s41597-022-01372-9

sylvchev commented 1 year ago

This dataset could be integrated in MOABB, MI with information about subjects: https://zenodo.org/record/7554429

taziksh commented 10 months ago

Ideas for EEG datasets: https://www.fieldtriptoolbox.org/faq/open_data/

okbalefthanded commented 9 months ago
vmcru commented 9 months ago

This dataset "Inner Speech Dataset" was published in nature and seems like a good fit to add support. Paper: Thinking out loud, an open-access EEG-based BCI dataset for inner speech recognition Data: OpenNeuro link

HarlockOfficial commented 9 months ago

These are 2 other interesting ones someone pointed out on the Slack channel -

  1. Continuous sensorimotor rhythm based brain computer interface learning in a large population - Data
  2. A large electroencephalographic motor imagery dataset for electroencephalographic brain computer interfaces - Data

Hi @Div12345, I was interested in the second dataset, but unfortunately, I did not find it in the MOABB documentation. Are there any plans related to adding the second dataset in the near future, or is the dataset already part of the library under some specific section or with a specific name?

bruAristimunha commented 8 months ago

All the dataset inside this paper: https://arxiv.org/pdf/2402.08656.pdf

okbalefthanded commented 8 months ago
bruAristimunha commented 8 months ago

https://www.frontiersin.org/articles/10.3389/fnhum.2023.1134869/full

machinelatto commented 7 months ago

Is someone working on BEnchmark database Towards BCI Application (https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2020.00627/full)?

It is an SSVEP dataset with 70 subjects performing a 40-target cued-spelling task. I saw that it was referred on the Datasets to include section, but found no Issue related to it.

bruAristimunha commented 7 months ago

Hi @machinelatto!

It seems like no one focused on this task, or if someone started, didn't commit or create the PR. Would you be interested?

You would basically need to create two functions, as shown in this tutorial: https://neurotechx.github.io/moabb/auto_tutorials/4_adding_a_dataset.html#sphx-glr-auto-tutorials-4-adding-a-dataset-py

One is to download and one is to load the dataset using mne.

machinelatto commented 6 months ago

Hi @bruAristimunha !

I'm probably going to use this dataset on my research, so I could try to create those functions in the next weeks. If it goes well I'l open a PR.

okbalefthanded commented 3 months ago

P300 dataset from the paper Single-Option P300-BCI Performance Is Affected by Visual Stimulation Conditions Dataset: https://openneuro.org/datasets/ds003190/versions/1.0.1

okbalefthanded commented 2 months ago

Dataset from the paper: A comparison of stimulus types in online classification of the P300 speller using language models Data link: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PHHHB6

okbalefthanded commented 1 month ago

59 Subjects 40 Classes SSVEP dataset

bruAristimunha commented 1 month ago

Pan2024, paper here: https://iopscience.iop.org/article/10.1088/1741-2552/ad0a01 from @PLC-TJU dataset here: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/251NOW

bruAristimunha commented 1 month ago

Moabb code to pan2024 https://github.com/PLC-TJU/RSF/blob/main/loaddata%2Fpan2023.py

bruAristimunha commented 1 month ago

all the datasets from: https://www.sciencedirect.com/science/article/pii/S0925231224013481?casa_token=868a85XVgwsAAAAA:71p-MYfwwTEAOFczbkAq4MNnvUCURVxW77Q7rSD6HLZa_En-YJpDNncO1TqzABByx1WaTI9chvM#tblfn5b

bruAristimunha commented 1 month ago

https://kilthub.cmu.edu/articles/dataset/EEG-BCI_Dataset_for_Motor_Imagery_and_Overt_Spatial_Attention_EEG-BCI_Control/23677098 https://kilthub.cmu.edu/articles/dataset/EEG-BCI_Dataset_for_Continuous_Tracking_using_Deep_Learning-based_Decoding_for_Non-invasive_Brain-Computer_Interface_/25360300