TY - JOUR
T1 - SMGR
T2 - A joint statistical method for integrative analysis of single-cell multi-omics data
AU - Song, Qianqian
AU - Zhu, Xuewei
AU - Jin, Lingtao
AU - Chen, Minghan
AU - Zhang, Wei
AU - Su, Jing
N1 - Publisher Copyright:
© 2022 The Author(s). Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - Unravelling the regulatory programs from single-cell multi-omics data has long been one of the major challenges in genomics, especially in the current emerging single-cell field. Currently there is a huge gap between fast-growing single-cell multi-omics data and effective methods for the integrative analysis of these inherent sparse and heterogeneous data. In this study, we have developed a novel method, Single-cell Multi-omics Gene co-Regulatory algorithm (SMGR), to detect coherent functional regulatory signals and target genes from the joint single-cell RNA-sequencing (scRNA-seq) and single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) data obtained from different samples. Given that scRNA-seq and scATAC-seq data can be captured by zero-inflated Negative Binomial distribution, we utilize a generalized linear regression model to identify the latent representation of consistently expressed genes and peaks, thus enables the identification of co-regulatory programs and the elucidation of regulating mechanisms. Results from both simulation and experimental data demonstrate that SMGR outperforms the existing methods with considerably improved accuracy. To illustrate the biological insights of SMGR, we apply SMGR to mixed-phenotype acute leukemia (MPAL) and identify the MPAL-specific regulatory program with significant peak-gene links, which greatly enhance our understanding of the regulatory mechanisms and potential targets of this complex tumor.
AB - Unravelling the regulatory programs from single-cell multi-omics data has long been one of the major challenges in genomics, especially in the current emerging single-cell field. Currently there is a huge gap between fast-growing single-cell multi-omics data and effective methods for the integrative analysis of these inherent sparse and heterogeneous data. In this study, we have developed a novel method, Single-cell Multi-omics Gene co-Regulatory algorithm (SMGR), to detect coherent functional regulatory signals and target genes from the joint single-cell RNA-sequencing (scRNA-seq) and single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) data obtained from different samples. Given that scRNA-seq and scATAC-seq data can be captured by zero-inflated Negative Binomial distribution, we utilize a generalized linear regression model to identify the latent representation of consistently expressed genes and peaks, thus enables the identification of co-regulatory programs and the elucidation of regulating mechanisms. Results from both simulation and experimental data demonstrate that SMGR outperforms the existing methods with considerably improved accuracy. To illustrate the biological insights of SMGR, we apply SMGR to mixed-phenotype acute leukemia (MPAL) and identify the MPAL-specific regulatory program with significant peak-gene links, which greatly enhance our understanding of the regulatory mechanisms and potential targets of this complex tumor.
UR - http://www.scopus.com/inward/record.url?scp=85136089436&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85136089436&partnerID=8YFLogxK
U2 - 10.1093/nargab/lqac056
DO - 10.1093/nargab/lqac056
M3 - Article
C2 - 35910046
AN - SCOPUS:85136089436
SN - 2631-9268
VL - 4
JO - NAR Genomics and Bioinformatics
JF - NAR Genomics and Bioinformatics
IS - 3
M1 - lqac056
ER -