This paper studies the problem of building a machine learning method for biologicaldata. Various feature selection methods and classifier design strategies have been generally used and compared. However, most published articles have applied a certain technique to a certain dataset, and recently several researchers compared these techniques based on several public datasets. We propose an ensemble of classifiers that combine a linear classifier, linear support vector machine, a non-linear classifier, radial basis-support vector machines and a Subspace Classifier. We validate our new method on several recent publicly available datasets both with predictive accuracy of testing samples and through cross validation. Compared with the best performance of other current methods, remarkably improved results can be obtained using our new strategy on a wide range of different datasets. On a wide range of recently published datasets, our method performs better, or is at least comparable to, the current best methods of our knowledge.

Ensemblator: an ensemble of classifiers for reliable classification of Biological Data

NANNI, LORIS;
2007

Abstract

This paper studies the problem of building a machine learning method for biologicaldata. Various feature selection methods and classifier design strategies have been generally used and compared. However, most published articles have applied a certain technique to a certain dataset, and recently several researchers compared these techniques based on several public datasets. We propose an ensemble of classifiers that combine a linear classifier, linear support vector machine, a non-linear classifier, radial basis-support vector machines and a Subspace Classifier. We validate our new method on several recent publicly available datasets both with predictive accuracy of testing samples and through cross validation. Compared with the best performance of other current methods, remarkably improved results can be obtained using our new strategy on a wide range of different datasets. On a wide range of recently published datasets, our method performs better, or is at least comparable to, the current best methods of our knowledge.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/157689
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 50
  • ???jsp.display-item.citation.isi??? 41
social impact