Annotated Compendium of 102 Breast Cancer Gene-Expression Datasets-Reference-Cited by-同舟云学术

Annotated Compendium of 102 Breast Cancer Gene-Expression Datasets

Published:2023-09-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Nwosu Ifeanyichukwu O.^ORCID,Tabler Daniel D.,Chipman Greg,Piccolo Stephen R.^ORCID

Abstract

AbstractTranscriptomic data from breast-cancer patients are widely available in public repositories. However, before a researcher can perform statistical inferences or make biological interpretations from such data, they must find relevant datasets, download the data, and perform quality checks. In many cases, it is also useful to normalize and standardize the data for consistency and to use updated genome annotations. Additionally, researchers need to parse and interpret metadata: clinical and demographic characteristics of patients. Each of these steps requires computational and/or biomedical expertise, thus imposing a barrier to reuse for many researchers. We have identified and curated 102 publicly available, breast-cancer datasets representing 17,151 patients. We created a reproducible, computational pipeline to download the data, perform quality checks, renormalize the raw gene-expression measurements (when available), assign gene identifiers from multiple databases, and annotate the metadata against the National Cancer Institute Thesaurus, thus making it easier to infer semantic meaning and compare insights across datasets. We have made the curated data and pipeline freely available for other researchers to use. Having these resources in one place promises to accelerate breast-cancer research, enabling researchers to address diverse types of questions, using data from a variety of patient populations and study contexts.

Publisher

Cold Spring Harbor Laboratory

Reference180 articles.

1. Global cancer statistics

2. The International Agency for Research on Cancer (IARC). Global Cancer Observatory. https://gco.iarc.fr/.

3. Molecular signatures in breast cancer

4. Hallmarks of Cancer: The Next Generation

5. Gene Signatures in Breast Cancer: Current and Future Uses

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comprehensive Meta-Analysis of Breast Cancer Gene Expression;2024-09-02