+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ PDF Full Text
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Translate
+ Recently Requested

Preparing Laboratory and Real-World EEG Data for Large-Scale Analysis: A Containerized Approach

Preparing Laboratory and Real-World EEG Data for Large-Scale Analysis: A Containerized Approach

Frontiers in Neuroinformatics 10: 7

Large-scale analysis of EEG and other physiological measures promises new insights into brain processes and more accurate and robust brain-computer interface models. However, the absence of standardized vocabularies for annotating events in a machine understandable manner, the welter of collection-specific data organizations, the difficulty in moving data across processing platforms, and the unavailability of agreed-upon standards for preprocessing have prevented large-scale analyses of EEG. Here we describe a "containerized" approach and freely available tools we have developed to facilitate the process of annotating, packaging, and preprocessing EEG data collections to enable data sharing, archiving, large-scale machine learning/data mining and (meta-)analysis. The EEG Study Schema (ESS) comprises three data "Levels," each with its own XML-document schema and file/folder convention, plus a standardized (PREP) pipeline to move raw (Data Level 1) data to a basic preprocessed state (Data Level 2) suitable for application of a large class of EEG analysis methods. Researchers can ship a study as a single unit and operate on its data using a standardized interface. ESS does not require a central database and provides all the metadata data necessary to execute a wide variety of EEG processing pipelines. The primary focus of ESS is automated in-depth analysis and meta-analysis EEG studies. However, ESS can also encapsulate meta-information for the other modalities such as eye tracking, that are increasingly used in both laboratory and real-world neuroimaging. ESS schema and tools are freely available at www.eegstudy.org and a central catalog of over 850 GB of existing data in ESS format is available at studycatalog.org. These tools and resources are part of a larger effort to enable data sharing at sufficient scale for researchers to engage in truly large-scale EEG analysis and data mining (BigEEG.org).

(PDF emailed within 0-6 h: $19.90)

Accession: 058609400

Download citation: RISBibTeXText

PMID: 27014048

DOI: 10.3389/fninf.2016.00007

Related references

Improved tomographic reconstruction of large-scale real-world data by filter optimization. Advanced Structural and Chemical Imaging 2(1): 17, 2016

The FARSEEING real-world fall repository: a large-scale collaborative database to collect and share sensor signals from real-world falls. European Review of Aging and Physical Activity 13: 8, 2016

Evaluating large-scale propensity score performance through real-world and synthetic data experiments. International Journal of Epidemiology 47(6): 2005-2014, 2018

A real-world evidence-based approach to laboratory reorganization using e-Valuate benchmarking data. Clinical Chemistry and Laboratory Medicine 55(3): 435-440, 2016

The analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. II. two-way data with mixed data types. Euphytica 105(2): 73-82, 1999

Large-scale deployment of electric taxis in Beijing: A real-world analysis. Energy 100: 25-39, 2016

The analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data. Euphytica 95(1): 27-38, 1997

Analysis of three independent real-world driving studies: A data driven and expert analysis approach to determining parameters affecting fuel economy. Transportation Research Part D: Transport and Environment 33: 74-86, 2014

Vigilance in the laboratory predicts avoidance in the real world: A dimensional analysis of neural, behavioral, and ecological momentary data in anxious youth. Developmental Cognitive Neuroscience 19: 128-136, 2017

QAPgrid: a two level QAP-based approach for large-scale data analysis and visualization. Plos One 6(1): E14468, 2011

High-dimensional MRI data analysis using a large-scale manifold learning approach. Machine Vision and Applications 24(5): 995-1014, 2013

Benchmarking After Large-Scale, Comparative Data Analysis Improves the Use of Laboratory Tests: Lessons From the REDCONLAB Initiative. Archives of Pathology and Laboratory Medicine 141(4): 485-486, 2017

A large-scale taxonomy of real-world scenes. 2012

Daily rainfall for the Indian monsoon region from merged satellite and rain gauge values: large-scale analysis from real-time data. Journal of Hydrometeorology 4(5): 769-781, 2003

A modular approach for integrative analysis of large-scale gene-expression and drug-response data. Nature Biotechnology 26(5): 531-539, 2008