Abstract
AbstractMotivationIt is necessary to develop exploratory tools to learn from the unprecedented volume of high-dimensional multi-omic data currently being produced. We have developed an R package, SJD, which identifies components of variation that are shared across multiple matrices. The approach focuses specifically on variation across the samples/cells within each dataset while incorporating biologist-defined hierarchical structure among input experiments that can spanin vivoandin vitrosystems, multi-omic data modalities, and species.ResultsSJD enables the definition of molecular variation that is conserved across systems, those that are shared within subsets of studies, and elements unique to individual matrices. We have included functions to simplify the construction and visualization of highly complexin silicoexperiments involving many diverse multi-omic matrices from multiple species. Here we apply SJD to decompose four RNA-seq experiments focused on neurogenesis in the neocortex. The public datasets used in this analysis and the conserved transcriptomic dynamics in mammalian neurogenesis that we define here can view viewed and explored together athttps://nemoanalytics.org/p?l=ChenEtAlSJD2022&g=DCX.Availability and ImplementationThe SJD package can be found athttps://chuansite.github.io/SJD.Contacthzchenhuan@gmail.com;ccolant1@jhmi.eduSupplementary informationSupplementary data are available at Bioinformatics online.
Publisher
Cold Spring Harbor Laboratory