A Systematic Assessment of Feature Extraction Methods for Robust Prediction of Neuropsychological Scores from Functional Connectivity Data

Multivariate prediction of human behavior from resting state data is gaining increasing popularity in the neuroimaging community, with far-reaching translational implications in neurology and psychiatry. However, the high dimensionality of neuroimaging data increases the risk of overfitting, calling for the use of dimensionality reduction methods to build robust predictive models. In this work, we assess the ability of four dimensionality reduction techniques to extract relevant features from resting state functional connectivity matrices of stroke patients, which are then used to build a predictive model of the associated language deficits based on cross-validated regularized regression. Features extracted by Principal Component Analysis (PCA) were found to be the best predictors, followed by Independent Component Analysis (ICA), Dictionary Learning (DL) and Non-Negative Matrix Factorization. However, ICA and DL led to more parsimonious models. Overall, our findings suggest that the choice of the dimensionality reduction technique should not only be based on prediction/regression accuracy, but also on considerations about model complexity and interpretability.

A Systematic Assessment of Feature Extraction Methods for Robust Prediction of Neuropsychological Scores from Functional Connectivity Data

Calesella F.;Testolin A.;De Filippo De Grazia M.;Zorzi M.

2020

Abstract

Multivariate prediction of human behavior from resting state data is gaining increasing popularity in the neuroimaging community, with far-reaching translational implications in neurology and psychiatry. However, the high dimensionality of neuroimaging data increases the risk of overfitting, calling for the use of dimensionality reduction methods to build robust predictive models. In this work, we assess the ability of four dimensionality reduction techniques to extract relevant features from resting state functional connectivity matrices of stroke patients, which are then used to build a predictive model of the associated language deficits based on cross-validated regularized regression. Features extracted by Principal Component Analysis (PCA) were found to be the best predictors, followed by Independent Component Analysis (ICA), Dictionary Learning (DL) and Non-Negative Matrix Factorization. However, ICA and DL led to more parsimonious models. Overall, our findings suggest that the choice of the dimensionality reduction technique should not only be based on prediction/regression accuracy, but also on considerations about model complexity and interpretability.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del Libro
	
				Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
			
	Collana/serie monografica
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	Titolo convegno
	
				13th International Conference on Brain Informatics, BI 2020
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-030-59277-6_3
			
	Codice WOS
	
				WOS:000769071200003
			
	Codice Scopus
	
				2-s2.0-85092148550
			
	Codice ISBN
	
				978-3-030-59276-9
978-3-030-59277-6
			
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
_496162_1_En_3_Chapter_Author.pdf Accesso riservato Tipologia: Published (Publisher's Version of Record) Licenza: Accesso privato - non pubblico Dimensione 1.16 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.16 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3355881

Citazioni

ND

3

2

ND

social impact