Big data from small data: data-sharing in the 'long tail' of neuroscience

In this Commentary, Martone and colleagues discuss the potential benefits of sharing small datasets, also called “long-tail” data, in the Neuroscience community. They introduce the pros and cons associated with data sharing, describe the existing attitudes toward such initiative, introduce best prac...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature neuroscience 2014-11, Vol.17 (11), p.1442-1447
Hauptverfasser: Ferguson, Adam R, Nielson, Jessica L, Cragin, Melissa H, Bandrowski, Anita E, Martone, Maryann E
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this Commentary, Martone and colleagues discuss the potential benefits of sharing small datasets, also called “long-tail” data, in the Neuroscience community. They introduce the pros and cons associated with data sharing, describe the existing attitudes toward such initiative, introduce best practices and offer their views on why and how the field should establish a credit system for sharing “long-tail” data. The launch of the US BRAIN and European Human Brain Projects coincides with growing international efforts toward transparency and increased access to publicly funded research in the neurosciences. The need for data-sharing standards and neuroinformatics infrastructure is more pressing than ever. However, 'big science' efforts are not the only drivers of data-sharing needs, as neuroscientists across the full spectrum of research grapple with the overwhelming volume of data being generated daily and a scientific environment that is increasingly focused on collaboration. In this commentary, we consider the issue of sharing of the richly diverse and heterogeneous small data sets produced by individual neuroscientists, so-called long-tail data. We consider the utility of these data, the diversity of repositories and options available for sharing such data, and emerging best practices. We provide use cases in which aggregating and mining diverse long-tail data convert numerous small data sources into big data for improved knowledge about neuroscience-related disorders.
ISSN:1097-6256
1546-1726
DOI:10.1038/nn.3838