In the analysis of a corpus of open-ended questions, one of the most important goals is to identify words which distinguish between groups of respondents. The MOCAR procedure within SpadT does this using hypergeometric probabilities (Lebart et al., 1998). However, while the words obtained may only occur within a particular group, the researcher has no indication of their distribution within that group. A word may be chosen which is specific to one or two responses, rather than being representative of the group as a whole. We address this problem using the MCDISP procedure developed by Baayen (1996). The words identified by MOCAR can then be checked for significant under-dispersion, which would indicate that they are confined to a subset of the texts. We illustrate this with data from a corpus of open interviews of graduates of the University of Padua.

The best of both worlds: combining MOCAR and MCDISP

TUZZI, ARJUNA;
2000

Abstract

In the analysis of a corpus of open-ended questions, one of the most important goals is to identify words which distinguish between groups of respondents. The MOCAR procedure within SpadT does this using hypergeometric probabilities (Lebart et al., 1998). However, while the words obtained may only occur within a particular group, the researcher has no indication of their distribution within that group. A word may be chosen which is specific to one or two responses, rather than being representative of the group as a whole. We address this problem using the MCDISP procedure developed by Baayen (1996). The words identified by MOCAR can then be checked for significant under-dispersion, which would indicate that they are confined to a subset of the texts. We illustrate this with data from a corpus of open interviews of graduates of the University of Padua.
2000
JADT 2000 - Actes de 5es Journées Internationales d’Analyse Statistique des Données Textuelles
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/1373094
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact