TY - JOUR
T1 - ADCoC
T2 - Adaptive Distribution Modeling Based Collaborative Clustering for Disentangling Disease Heterogeneity from Neuroimaging Data
AU - Liu, Hangfan
AU - Grothe, Michel J.
AU - Rashid, Tanweer
AU - Labrador-Espinosa, Miguel A.
AU - Toledo, Jon B.
AU - Habes, Mohamad
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - Conventional clustering techniques for neuroimaging applications usually focus on capturing differences between given subjects, while neglecting arising differences between features and the potential bias caused by degraded data quality. In practice, collected neuroimaging data are often inevitably contaminated by noise, which may lead to errors in clustering and clinical interpretation. Additionally, most methods ignore the importance of feature grouping towards optimal clustering. In this paper, we exploit the underlying heterogeneous clusters of features to serve as weak supervision for improved clustering of subjects, which is achieved by simultaneously clustering subjects and features via nonnegative matrix tri-factorization. In order to suppress noise, we further introduce adaptive regularization based on coefficient distribution modeling. Particularly, unlike conventional sparsity regularization techniques that assume zero mean of the coefficients, we form the distributions using the data of interest so that they could better fit the non-negative coefficients. In this manner, the proposed approach is expected to be more effective and robust against noise. We compared the proposed method with standard techniques and recently published methods demonstrating superior clustering performance on synthetic data with known ground truth labels. Furthermore, when applying our proposed technique to magnetic resonance imaging (MRI) data from a cohort of patients with Parkinson's disease, we identified two stable and highly reproducible patient clusters characterized by frontal and posterior cortical/medial temporal atrophy patterns, respectively, which also showed corresponding differences in cognitive characteristics.
AB - Conventional clustering techniques for neuroimaging applications usually focus on capturing differences between given subjects, while neglecting arising differences between features and the potential bias caused by degraded data quality. In practice, collected neuroimaging data are often inevitably contaminated by noise, which may lead to errors in clustering and clinical interpretation. Additionally, most methods ignore the importance of feature grouping towards optimal clustering. In this paper, we exploit the underlying heterogeneous clusters of features to serve as weak supervision for improved clustering of subjects, which is achieved by simultaneously clustering subjects and features via nonnegative matrix tri-factorization. In order to suppress noise, we further introduce adaptive regularization based on coefficient distribution modeling. Particularly, unlike conventional sparsity regularization techniques that assume zero mean of the coefficients, we form the distributions using the data of interest so that they could better fit the non-negative coefficients. In this manner, the proposed approach is expected to be more effective and robust against noise. We compared the proposed method with standard techniques and recently published methods demonstrating superior clustering performance on synthetic data with known ground truth labels. Furthermore, when applying our proposed technique to magnetic resonance imaging (MRI) data from a cohort of patients with Parkinson's disease, we identified two stable and highly reproducible patient clusters characterized by frontal and posterior cortical/medial temporal atrophy patterns, respectively, which also showed corresponding differences in cognitive characteristics.
KW - Clustering
KW - MRI
KW - Parkinson's disease
KW - feature selection
KW - neuroimaging
UR - http://www.scopus.com/inward/record.url?scp=85122595642&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122595642&partnerID=8YFLogxK
U2 - 10.1109/TETCI.2021.3136587
DO - 10.1109/TETCI.2021.3136587
M3 - Article
AN - SCOPUS:85122595642
SN - 2471-285X
VL - 7
SP - 308
EP - 318
JO - IEEE Transactions on Emerging Topics in Computational Intelligence
JF - IEEE Transactions on Emerging Topics in Computational Intelligence
IS - 2
ER -