Add Brain Invaders datasets #283

sylvchev · 2022-03-29T08:37:37Z

Adding new ERP datasets from GIPSA-lab.

The datasets were almost supported, as @plcrodrigues wrote repos based on MOABB. Currently, they are not up to date and require an old MNE version. This is a blocking issue in pyriemann-qiskit.

@jsosulski All datasets have been through a sanity check: you could see that @plcrodrigues have plotted evoked potentials and made basic classification tests.

I added all P300 datasets but py.ALPHA.EEG.2017-GIPSA (only alpha waves), py.PHMDML.EEG.2017-GIPSA (music listening), py.VR.EEG.2018-GIPSA (recording during VR session) that are not P300 datasets.

jsosulski · 2022-03-29T08:50:06Z

Great that there are already some plots. If this is merged, I would still just run my scripts to create plots similar to the ones for the other datasets and to make sure that in the process of adding these to MOABB there is no regression or something.

OT: Do we want to "officially" host these plots somwhere? Currently my webserver is still working, but at some point I will probably run into traffic limitations 😅

gcattan · 2022-03-29T08:53:55Z

Thank you for your help @sylvchev :) Do you think we could also integrate the other datasets in MOABB in the future?

toncho11 · 2022-03-29T08:58:10Z

Thank you for your work!

@jsosulski where are the sanity checks (code and plots)?

jsosulski · 2022-03-29T08:57:22Z

moabb/datasets/braininvaders.py


 from moabb.datasets import download as dl
 from moabb.datasets.base import BaseDataset


-BI2013a_URL = "https://zenodo.org/record/1494240/files/"
+BI2012a_URL = "https://zenodo.org/record/2649069/files/"
+BI2013a_URL = "https://zenodo.org/record/2669187/files/"


I am currently running into page not found issues on zenodo (I think thats why CI is currently failing as well) so I cant check, but: is there a difference in bi2013a between the old and the new URLs? If so we probably need to bump minor version.

The files are available on https://zenodo.org/record/1494240/ and https://zenodo.org/record/2649069/
The old link was v2, the new link is v7. The difference is the storage format: it was gdf and it is now csv + mat. The data are the same. I will make a version bump for to include these datasets.

jsosulski · 2022-03-29T09:02:10Z

moabb/datasets/braininvaders.py

+            chtypes = ["eeg"] * 17 + ["stim"]
+            X = loadmat(file_path)[condition].T
+            S = X[1:18, :]
+            stim = (X[18, :] + X[19, :])[None, :]


Probably not necessary for this first merge to make it work with current binary P300, however if there is stimulus information available, we could keep it to be able to classify the letter / stimulus x,y location in the Braininvaders case.

I agree, this is a good thing that the letter information are available in the data. We could use this for P300-speller classification.

sylvchev · 2022-03-29T09:06:55Z

OT: Do we want to "officially" host these plots somwhere? Currently my webserver is still working, but at some point I will probably run into traffic limitations 😅

There ebrains.eu but it seems complex to host data. OSF might be a better choice: https://help.osf.io/article/386-project-storage, there is up to 50 Gb of data for public project it seems

jsosulski · 2022-03-29T09:06:59Z

@toncho11 the code is here. Note that this script will download ALL available P300 datasets in MOABB and create the plots which takes a) very long and b) a lot of disk space.

Currently I hosted the plots for all P300 datasets in MOABB here: http://public.jan-sosulski.de/moabb_sanity/master.html
In the long term, I would like to make that hosting more interactive / link to relevant sections directly from dataset documentation.
But again, this hosting can (and probably will soon) go offline as my webserver does not support a high load of traffic.

sylvchev · 2022-03-29T09:07:43Z

Do you think we could also integrate the other datasets in MOABB in the future?

Yes, if we have the corresponding paradigms, that is why I could not add them right away.

jsosulski · 2022-04-02T08:18:42Z

I just tried out to load the dry EEG data from your branch @sylvchev and noticed that the data needs to be scaled by 1-e6, as it is stored in uV and mne expects V. Probably worth checking how the other paradigms store the data.

gcattan · 2022-04-06T10:46:09Z

Hi,
Just realized that [py.VR.EEG.2018-GIPSA](https://github.com/plcrodrigues/py.VR.EEG.2018-GIPSA) is also a P300 paradigm. But seems that there is already a lot of work in this PR ^^'

toncho11 · 2022-04-06T10:51:20Z

I just tested bi2012 and it works.

sylvchev · 2022-04-06T13:07:31Z

Hi, Just realized that [py.VR.EEG.2018-GIPSA](https://github.com/plcrodrigues/py.VR.EEG.2018-GIPSA) is also a P300 paradigm. But seems that there is already a lot of work in this PR ^^'

Yes, I have seen that, but in Pedro's repo the data are not formatted like the other. So I'm leaving it for another PR.

sylvchev · 2022-04-06T13:29:22Z

thanks for your remark @jsosulski, several datasets were not stored in the correct format. I checked all dataset and scaled down with uncorrected values.
The code is updated, LGTM.

add brain invaders datasets

2158adc

sylvchev self-assigned this Mar 29, 2022

sylvchev added the dataset Supporting new dataset label Mar 29, 2022

sylvchev mentioned this pull request Mar 29, 2022

Add bi2012 dependency pyRiemann/pyRiemann-qiskit#37

Closed

jsosulski reviewed Mar 29, 2022

View reviewed changes

add version added for datasets

65a579c

Sylvain Chevallier added 2 commits April 6, 2022 15:10

correct scaling factor for dry electrodes

477a214

apply scaling factor for uncorrected datasets

e25cb37

sylvchev merged commit f310573 into NeuroTechX:develop Apr 6, 2022

sylvchev deleted the bi_datasets branch January 3, 2023 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Brain Invaders datasets #283

Add Brain Invaders datasets #283

sylvchev commented Mar 29, 2022

jsosulski commented Mar 29, 2022

gcattan commented Mar 29, 2022

toncho11 commented Mar 29, 2022

jsosulski Mar 29, 2022

sylvchev Mar 29, 2022

jsosulski Mar 29, 2022

sylvchev Mar 29, 2022

sylvchev commented Mar 29, 2022

jsosulski commented Mar 29, 2022 •

edited

Loading

sylvchev commented Mar 29, 2022

jsosulski commented Apr 2, 2022 •

edited

Loading

gcattan commented Apr 6, 2022

toncho11 commented Apr 6, 2022

sylvchev commented Apr 6, 2022

sylvchev commented Apr 6, 2022

Add Brain Invaders datasets #283

Add Brain Invaders datasets #283

Conversation

sylvchev commented Mar 29, 2022

jsosulski commented Mar 29, 2022

gcattan commented Mar 29, 2022

toncho11 commented Mar 29, 2022

jsosulski Mar 29, 2022

Choose a reason for hiding this comment

sylvchev Mar 29, 2022

Choose a reason for hiding this comment

jsosulski Mar 29, 2022

Choose a reason for hiding this comment

sylvchev Mar 29, 2022

Choose a reason for hiding this comment

sylvchev commented Mar 29, 2022

jsosulski commented Mar 29, 2022 • edited Loading

sylvchev commented Mar 29, 2022

jsosulski commented Apr 2, 2022 • edited Loading

gcattan commented Apr 6, 2022

toncho11 commented Apr 6, 2022

sylvchev commented Apr 6, 2022

sylvchev commented Apr 6, 2022

jsosulski commented Mar 29, 2022 •

edited

Loading

jsosulski commented Apr 2, 2022 •

edited

Loading