This report outlines the development of the RISE group’s Information Retrieval (IR) system for the LongEval-WebRetrieval CLEF 2025 Lab. The objective was to design an efficient, scalable search engine capable of handling large-scale French collections with a focus on consistent performance. The proposed system incorporates a modular architecture, including a parser, an analyzer, an indexer and a searcher, then also query translation and expansion using the Gemini LLM, and a non-neural reranking component to enhance retrieval quality. Emphasis was put on optimizing indexing and searching speed through multi-threading, improving relevance via crafting a title for each document and an URL-based document boosting based on the alignment between user queries and the document’s URL. The evaluation has followed a stepwise enhancement approach, beginning with a Lucene-based baseline.

SEUPD@CLEF Team RISE at LongEval: Improving Search by Crafting Titles and Matching URLs

Ferro N.
2025

Abstract

This report outlines the development of the RISE group’s Information Retrieval (IR) system for the LongEval-WebRetrieval CLEF 2025 Lab. The objective was to design an efficient, scalable search engine capable of handling large-scale French collections with a focus on consistent performance. The proposed system incorporates a modular architecture, including a parser, an analyzer, an indexer and a searcher, then also query translation and expansion using the Gemini LLM, and a non-neural reranking component to enhance retrieval quality. Emphasis was put on optimizing indexing and searching speed through multi-threading, improving relevance via crafting a title for each document and an URL-based document boosting based on the alignment between user queries and the document’s URL. The evaluation has followed a stepwise enhancement approach, beginning with a Lucene-based baseline.
2025
Inglese
Inglese
26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025
4038
3433
3453
21
CEUR-WS
26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025
2025
Faculties of Education and Psychology, esp
CLEF 2025; Document Parsing; Information Retrieval; LongEval-WebRetrieval; Query Expansion; Query Translation; Search Engine; Temporal Evolution; URL Manipulation
no
273
Furlan, D.; Gibellato, G.; Nazirialhashem, S. S.; Pase, E.; Pasqualetto, A.; Tiberio, F.; Ferro, N.
7
none
info:eu-repo/semantics/conferenceObject
04 CONTRIBUTO IN ATTO DI CONVEGNO::04.01 - Contributo in atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3571889
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact