Abstract
Abstract
Objectives
A novel graph data model of non-small cell lung cancer clinical and genomic data has been constructed with two aims: (1) provide a suitable model for facilitating graph analytics within the Neo4j framework or through tools which can interact through existing Neo4j APIs; and (2) provide a base model extensible to other cancer types and additional datasets such as those derived from electronic health records and other real world sources.
Data description
Clinical and genomic data integrated with a novel property graph database schema from publicly available datasets and analyses based on The Cancer Genome Atlas lung cancer datasets augmented by with subgraphs patient-patient social network from similarity and correlation as well as individual based biological networks.
Publisher
Springer Science and Business Media LLC
Subject
General Biochemistry, Genetics and Molecular Biology,General Medicine
Reference22 articles.
1. Cancer Complexity Knowledge Portal. NIH National Cancer Institute-sponsored Cancer Systems Biology Consortium (CSBC) https://www.cancercomplexity.synapse.org/
2. Hochheiser H, Castine M, Harris D, Savova G, Jacobson RS. An information model for computable cancer phenotypes. BMC Med Inform Decis Making. 2016. https://doi.org/10.1186/s12911-016-0358-4.
3. Timón-Reina S, Rincón M, Martínez-Tomás R. An overview of graph databases and their applications in the biomedical domain. Database. 2021. https://doi.org/10.1093/database/baab026·.
4. TigerGraph: Graph Database | Graph Analytics Platform; https://www.tigergraph.com. Accessed 29 Dec 2021.
5. Neo4j Graph Platform – The Leader in Graph Databases Neo4j Graph Database Platform; https://neo4j.com. Accessed 29 Dec 2021.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献