Cancer Publication Portal: an online tool for summarizing and searching human cancer-genomic publications [version 1; peer review: 2 approved with reservations]
A search of PubMed lists >582,000 citations with the keywords "cancer" and "gene". The large volume of cancer genomic publications necessitates the development of text-mining tools to help cancer researchers navigate and summarize articles efficiently. We developed a Cancer Pu...
Gespeichert in:
Veröffentlicht in: | F1000 research 2019, Vol.8, p.2073 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A search of PubMed lists >582,000 citations with the keywords "cancer" and "gene". The large volume of cancer genomic publications necessitates the development of text-mining tools to help cancer researchers navigate and summarize articles efficiently. We developed a Cancer Publication Portal (CPP) to help researchers efficiently search and summarize cancer genomic publications, based on one or more genes of interest. CPP integrates data from several sources, including PubTator, the Medical Subject Headings (MeSH) database; the HUGO Gene Nomenclature Committee human gene name database; PubMed, a database of biomedical literature citations; and the National Cancer Institute (NCI) Thesaurus. Following each query, results are summarized and include the publication frequency for each cancer type, as well as publication frequencies for cancer terms, pharmacological agents, genomic mutations, and additional genes stratified by cancer type. Cancer terms were identified by comparing titles and abstracts from cancer-related (N=851,868) and non-cancer related articles (N=2,607,020). CPP allows a user to quickly obtain publication statistics, such as the frequency of articles mentioning
EGFR across cancer types, and to explore associations, such as the association between pharmacological agent and cancer type. Result summaries are interactive, so additional filters can be easily added as the literature is explored. After a search is completed, a PubTator collection can be quickly created, in order to view article titles and abstracts in PubTator. CPP currently includes information for ~1.1 million cancer-related publications associated with >23,000 human genes.
Database URL:
https://gdancik.github.io/bioinformatics/CPP/. |
---|---|
ISSN: | 2046-1402 2046-1402 |
DOI: | 10.12688/f1000research.21463.1 |