Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph data structures. It will be shown that the Self-Organizing Maps can be trained in an unsupervised fashion to group XML structured data into clusters, and that this task is scaled in linear time with in- creasing size of the corpus. It will also be shown that some simple prior knowl- edge of the data structures is beneficial to the efficient grouping of the XML documents.

Clustering XML Documents using Self-Organizing Maps for Structures

SPERDUTI, ALESSANDRO;
2006

Abstract

Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph data structures. It will be shown that the Self-Organizing Maps can be trained in an unsupervised fashion to group XML structured data into clusters, and that this task is scaled in linear time with in- creasing size of the corpus. It will also be shown that some simple prior knowl- edge of the data structures is beneficial to the efficient grouping of the XML documents.
2006
Advances in XML Information Retrieval and Evaluation
9783540349624
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/1559885
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 8
social impact