Event-related potential datasets based on a three-stimulus paradigm
© Vařeka et al.; licensee BioMed Central Ltd. 2014
Received: 23 June 2014
Accepted: 16 October 2014
Published: 12 December 2014
The event-related potentials technique is widely used in cognitive neuroscience research. The P300 waveform has been explored in many research articles because of its wide applications, such as lie detection or brain-computer interfaces (BCI). However, very few datasets are publicly available. Therefore, most researchers use only their private datasets for their analysis. This leads to minimally comparable results, particularly in brain-computer research interfaces.
Here we present electroencephalography/event-related potentials (EEG/ERP) data. The data were obtained from 20 healthy subjects and was acquired using an odd-ball hardware stimulator. The visual stimulation was based on a three-stimulus paradigm and included target, non-target and distracter stimuli. The data and collected metadata are shared in the EEG/ERP Portal.
The paper also describes the process and validation results of the presented data. The data were validated using two different methods. The first method evaluated the data by measuring the percentage of artifacts. The second method tested if the expectation of the experimental results was fulfilled (i.e., if the target trials contained the P300 component). The validation proved that most datasets were suitable for subsequent analysis.
The presented datasets together with their metadata provide researchers with an opportunity to study the P300 component from different perspectives. Furthermore, they can be used for BCI research.
KeywordsEvent-related potentials P300 Three-stimulus paradigm Visual stimulation LED
Purpose of the study
In recent decades, research into event-related potentials (ERP) using a classic odd-ball paradigm has become very popular. However, studies on the neural substrates of the P300 and other ERP components are still lacking. In , the authors propose to use a three-stimulus paradigm to explore the P300 component in more detail. The purpose of this study was to make three-stimulus paradigm EEG/ERP datasets freely available to the neuroinformatics community. To the authors’s knowledge, no three-stimulus paradigm datasets have been published.
The BrainVision Recorder 1.2 was used  for recording and storing the EEG/ERP data in the BrainVision format. The Recorder was initialized using the following parameters:
the sampling rate was set to 1 kHz
the resolution was set to 0.1 μ V
the recording low-pass filter was set with the cut-off frequency of 250 Hz
The impedance threshold was set to 10 k Ω, and the real impedances for each experiment are stored as vhdr files.
The stimulator described above was used in the stimulation protocol. In our experiments, the stimulator settings were used as follows: each diode flashed once a second and each flash took 500 ms. The probabilities of the red, green and yellow diodes flashing were 83%, 13.5% and 3.5%, respectively. Between two occurrences of target stimulus (green diodes flashing), at least one non-target stimulus appeared. Otherwise, the order of stimuli was completely random.
The participants were sitting 1 m from the stimulator for 20 minutes. The experimental protocol was divided into three phases, each containing 30 target stimuli and each running for five minutes long. There was a short break between the phases. The participants were asked to sit comfortably, not move and to limit their eye blinking. They were instructed to pay attention to the stimulation.
All experiments were recorded May-July 2012 between 9 am and 5 pm. A soundproof cabin illuminated with a moderate white light was used for the experiments.
A group of 25 healthy volunteers participated in our experiments. However, the data from five of the volunteers were discarded because these participants were blinking excessively during the experiment. The data from the remaining 20 subjects (9 males and 11 females, university students, aged 20-26, 19 of them right-handed, half of them with corrected myopia) were stored. The informed consent was signed by all participants.
The following experimental procedure was applied:
Each participant was acquainted with the course of the experiment and answered questions concerning his/her health. Each participant was given the standard EEG cap made by Electro-Cap International. The international 10-20 system of electrode placement was used. In fact, 19 electrodes were used as depicted in Figure 2. The participant was taken to a soundproof and electrically shielded cabin, and the reference electrode was placed at the root of his/her nose.
The participant was told to watch the stimulator, and to follow the rules described in Section “Stimulation protocol”.
The cabin was closed and both the data recording and stimulation started.
After the experiment had finished, the recorded data and collected metadata were uploaded to the EEG/ERP Portal.
The data were validated in two different ways:
1) The first test was used to check the data obtained for eye-blinking artifacts. The percentage of epochs damaged by eye blinks was estimated using visual inspection of the data for each subject separately.
2) The second test was used to validate the objective of the odd-ball paradigm experiments: for most participants, the target and non-target markers are expected to be associated with differently shaped ERP components, especially P2, N2, and P3 . To validate this objective, dichotomous classification was used. If classification of a specific dataset yields low error rates (defined later), the objective of the odd-ball paradigm was considered to be fulfilled. Distractor stimuli that are thought to be associated with the NoGo-P300  are harder to detect in the EEG signal and were thus excluded from the validation process. Furthermore, to the authors’ best knowledge, there are no datasets publicly available that contain distractor stimuli data.
The classifier was trained on a randomly selected data subset. The training subset contained 730 ERP trials with equal numbers of targets and non-targets, whilst the trained classifier was applied to the data of individual subjects. Following this, the classifier was also tested on public data produced by another laboratory . The stimulation protocol described in  is similar to the protocol described in this paper; it only differs in the length of inter-stimuli intervals.
Matlab scripts available in  using EEGLAB and BCILAB functions  were used for the implementation. Both feature extraction and classification follow the Windowed Means Method proposed in . This method includes feature extraction — low pass filtering and spatial filtering, and shrinkage Linear Discriminant Analysis-based machine learning. The continuous signal was split into epochs using the stimuli markers with the pre-stimulus interval for baseline correction set to 500 ms and the post-stimulus interval set to 1000 ms. As a result, the post-stimulus parts of the epochs were not overlapping. The S2 marker (the green diode flashing) corresponded to the target stimuli occurrence and the S4 marker (the red diode flashing) to the non-target stimuli occurrence. After epoch extraction, the epoch signal was band-pass-filtered with the cut-off frequencies of 0.1 Hz and 8 Hz.
The narrow band-pass filter was used to eliminate as much undesired noise as possible for the subsequent classification. Then, each epoch was down-sampled to 100 samples. In order to extract the features, the intervals following the occurrence of stimuli were chosen as listed below:
200 ms - 250 ms
250 ms - 300 ms
300 ms - 350 ms
350 ms - 400 ms
400 ms - 450 ms
450 ms - 500 ms
The intervals were chosen to correspond to the occurrence of ERP components that differ significantly for target and non-target stimuli , and for each interval, the average value for each EEG channel was calculated. These averages formed the feature vectors.
As a result, error rates indicate the extent to which the classifier was unable to separate target and non-target single trials.
Note that the classification results may differ with each run because of the indeterministic training process. For comparison, the error rates achieved for external data from three subjects  were 30.5%, 36.3% and 28.5%, respectively.
Availability and requirements
To download and analyze the data described in this article, the following projects are available:
Project name: EEG/ERP Portal
Project home page: http://eegdatabase.kiv.zcu.cz
Operating system(s): Platform independent
Programming language: Java
Other requirements: tested in Internet Explorer 10, 11, Mozilla Firefox 29.0.1, Google Chrome
License: GNU GPL
Project name: P3-validator
Project home page: https://github.com/INCF/p3-validator
Operating system(s): Platform independent
Programming language: Matlab
Other requirements: Matlab 2010a or newer, preferably 64bit operating system
License: GNU GPL
Availability of supporting data
The data sets supporting the results of this article are available in the EEG/ERP Portal under the following URL: http://eegdatabase.kiv.zcu.cz/.
Supporting material for this paper can also be found in the GigaScience database, GigaDB ().
To download the experimental data and metadata using the EEG/ERP Portal, the user must take the following steps:
The registration form must be filled out.
The user is logged in using his/her e-mail address and password.
The section Experiments in the header of the selected page.
The “Event-related potential datasets based on a three-stimulus-paradigm” package contains the datasets related to this article.
The data and related metadata can be selected and confirmed after clicking on the Download button. By selecting “Choose all”, the user can download all the data and metadata related to the specific experiment.
Cognitive positive event-related potential component.
This work was supported by the UWB grant SGS-2013-039 Methods and Applications of Bio- and Medical Informatics.
- Polich J: Updating P300: an integrative theory of P3a and P3b. Clin Neurophysiol. 2007, 118 (10): 2128-2148. 10.1016/j.clinph.2007.04.019.View ArticlePubMedPubMed CentralGoogle Scholar
- Moucek R, Jezek P: EEG/ERP portal. 2009,http://eegdatabase.kiv.zcu.cz/,Google Scholar
- Vareka L, Bruha P, Moucek R: Supporting material for: “Event-related potential datasets based on three-stimulus paradigm”. 2014,http://dx.doi.org/10.5524/100111,Google Scholar
- BrainProducts: Brain vision recorder. 2012,http://www.brainproducts.com/productdetails.php?id=21,Google Scholar
- Dudacek K, Mautner P, Moucek R, Novotny J: Odd-ball protocol stimulator for neuroinformatics research. Applied Electronics (AE), 2011 International Conference on, Sept:1–4. 2011, Piscataway, New Jersey: IEEEGoogle Scholar
- Blankertz B, Lemm S, Treder MS, Haufe S, Müller KR: Single-trial analysis and classification of ERP components - A tutorial. NeuroImage. 2011, 56 (2): 814-825. 10.1016/j.neuroimage.2010.06.048.View ArticlePubMedGoogle Scholar
- Hoffmann U, Vesin JM, Ebrahimi T, Diserens K: An efficient P300-based brain-computer interface for disabled subjects. J Neurosci Methods. 2008, 167: 115-125. 10.1016/j.jneumeth.2007.03.005.View ArticlePubMedGoogle Scholar
- Vareka L, Bruha P, Moucek R: P3-validator. 2014,https://github.com/INCF/p3-validator,Google Scholar
- Delorme A, Mullen T, Kothe C, Acar ZA, Bigdely-Shamlo N, Vankov A, Makeig S: EEGLAB, SIFT, NFT, BCILAB, and ERICA: new tools for advanced EEG processing. Intell Neurosci. 2011, 2011: 10:10-10:10.Google Scholar
- Luck S: An introduction to the event-related potential technique. 2005, Cambridge MA, USA: MIT PressGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.