Statistical stemmers are important components of Information Retrieval (IR) systems, especially for text search over languages with few linguistic resources. In recent years, research on stemmers produced relevant results, especially in 2011 when three language-independent stemmers were published in relevant venues. In this paper, we describe our efforts for reproducing these three stemmers. We also share the code as open-source and an extended version of Terrier system integrating the developed stemmers.
Statistical Stemmers: A Reproducibility Study
Silvello, Gianmaria;BUCCO, RICCARDO;FORNARI, GIACOMO;LANGELI, ANDREA;Purpura, Alberto;TEZZA, ALESSANDRO;Agosti, Maristella
2018
Abstract
Statistical stemmers are important components of Information Retrieval (IR) systems, especially for text search over languages with few linguistic resources. In recent years, research on stemmers produced relevant results, especially in 2011 when three language-independent stemmers were published in relevant venues. In this paper, we describe our efforts for reproducing these three stemmers. We also share the code as open-source and an extended version of Terrier system integrating the developed stemmers.File in questo prodotto:
| File | Dimensione | Formato | |
|---|---|---|---|
|
2018-ECIR2018_SA.pdf
Accesso riservato
Tipologia:
Published (Publisher's Version of Record)
Licenza:
Accesso privato - non pubblico
Dimensione
227.72 kB
Formato
Adobe PDF
|
227.72 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




