The NIDDK Central Repository at 8 years--ambition, revision, use and impact

The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository makes data and biospecimens from NIDDK-funded research available to the broader scientific community. It thereby facilitates: the testing of new hypotheses without new data or biospecimen collection; pool...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Database : the journal of biological databases and curation 2011-01, Vol.2011, p.bar043-bar043
Hauptverfasser: Turner, Charles F, Pan, Huaqin, Silk, Gregg W, Ardini, Mary-Anne, Bakalov, Vesselina, Bryant, Stephanie, Cantor, Susanna, Chang, Kung-yen, DeLatte, Michael, Eggers, Paul, Ganapathi, Laxminarayana, Lakshmikanthan, Sujatha, Levy, Joshua, Li, Sheping, Pratt, Joseph, Pugh, Norma, Qin, Ying, Rasooly, Rebekah, Ray, Helen, Richardson, Jean E, Riley, Amanda Flynn, Rogers, Susan M, Scheper, Charlotte, Tan, Sylvia, White, Stacie, Cooley, Philip C
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository makes data and biospecimens from NIDDK-funded research available to the broader scientific community. It thereby facilitates: the testing of new hypotheses without new data or biospecimen collection; pooling data across several studies to increase statistical power; and informative genetic analyses using the Repository's well-curated phenotypic data. This article describes the initial database plan for the Repository and its revision using a simpler model. Among the lessons learned were the trade-offs between the complexity of a database design and the costs in time and money of implementation; the importance of integrating consent documents into the basic design; the crucial need for linkage files that associate biospecimen IDs with the masked subject IDs used in deposited data sets; and the importance of standardized procedures to test the integrity data sets prior to distribution. The Repository is currently tracking 111 ongoing NIDDK-funded studies many of which include genotype data, and it houses over 5 million biospecimens of more than 25 types including serum, plasma, stool, urine, DNA, red blood cells, buffy coat and tissue. Repository resources have supported a range of biochemical, clinical, statistical and genetic research (188 external requests for clinical data and 31 for biospecimens have been approved or are pending). Genetic research has included GWAS, validation studies, development of methods to improve statistical power of GWAS and testing of new statistical methods for genetic research. We anticipate that the future impact of the Repository's resources on biomedical research will be enhanced by (i) cross-listing of Repository biospecimens in additional searchable databases and biobank catalogs; (ii) ongoing deployment of new applications for querying the contents of the Repository; and (iii) increased harmonization of procedures, data collection strategies, questionnaires etc. across both research studies and within the vocabularies used by different repositories.
ISSN:1758-0463
1758-0463
DOI:10.1093/database/bar043