Gathering and Mining Information from Web Log Files.

In this paper, a general methodology for gathering and mining information from Web log files is proposed. A series of tools to retrieve, store, and analyze the data extracted from log files have been designed and implemented. The aim is to form general methods by abstracting from the analysis of logs which use a well-defined standard format, such as the Extended Log File Format proposed by W3C. The methodology has been experimented on the Web log files of The European Library portal; the experimental analyses led to personal, technical, geographical and temporal findings about the usage and traffic load. Considerations about a more accurate tracking of users and users profiles, and a better management of crawler accesses using authentication are presented.

Gathering and Mining Information from Web Log Files.

AGOSTI, MARISTELLA;DI NUNZIO, GIORGIO MARIA

2007

Abstract

In this paper, a general methodology for gathering and mining information from Web log files is proposed. A series of tools to retrieve, store, and analyze the data extracted from log files have been designed and implemented. The aim is to form general methods by abstracting from the analysis of logs which use a well-defined standard format, such as the Extended Log File Format proposed by W3C. The methodology has been experimented on the Web log files of The European Library portal; the experimental analyses led to personal, technical, geographical and temporal findings about the usage and traffic load. Considerations about a more accurate tracking of users and users profiles, and a better management of crawler accesses using authentication are presented.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2007
		
	Titolo del Libro
	
			Digital Libraries: Research and Development, First International DELOS Conference, Revised Papers, Lecture Notes in Computer Science
		
	Codice DOI
	
			https://dx.doi.org/10.1007/978-3-540-77088-6_10
		
	Codice WOS
	
			WOS:000252882500010
		
	Codice Scopus
	
			2-s2.0-38149122145
		
	Codice ISBN
	
			9783540770879
		
	Appare nelle tipologie:
	
			04.01 - Contributo in atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/2448843

Citazioni

ND

13

8

social impact