For a continuous-scale diagnostic test, the area under the receiver operating characteristic curve (AUC) is a popular summary measure to assess the ability of the test to discriminate between healthy and diseased subjects. In some studies, verification of the true disease status is performed only for a subset of subjects, selected possibly on the basis of the test result and of other characteristics of the subjects. Estimators of the AUC based only on this subset of subjects are typically biased; this is known as verification bias. Some methods have been proposed to correct verification bias, but they require parametric models for the (conditional) probability of disease and/or the (conditional) probability of verification. A wrong specification of such parametric models can affect the behaviour of the estimators, which can be inconsistent. To avoid misspecification problems, in this paper we propose a fully nonparametric method for the estimation of the AUC of a continuous test under verification bias. The method is based on nearest-neighbor imputation and adopts generic smooth regression models for both the probability that a subject is diseased and the probability that it is verified. The new AUC estimator is consistent and asymptotically normal under the assumption that the true disease status, if missing, is missing at random (MAR). A simple extension which deals with stratified samples is also provided. Simulation experiments are used to investigate the finite sample behaviour of the proposed methods. An illustrative example is presented.

Nonparametric verification bias-corrected inference for the area under the ROC curve of a continuous-scale diagnostic test

Gianfranco Adimari
;
Monica Chiogna
2017

Abstract

For a continuous-scale diagnostic test, the area under the receiver operating characteristic curve (AUC) is a popular summary measure to assess the ability of the test to discriminate between healthy and diseased subjects. In some studies, verification of the true disease status is performed only for a subset of subjects, selected possibly on the basis of the test result and of other characteristics of the subjects. Estimators of the AUC based only on this subset of subjects are typically biased; this is known as verification bias. Some methods have been proposed to correct verification bias, but they require parametric models for the (conditional) probability of disease and/or the (conditional) probability of verification. A wrong specification of such parametric models can affect the behaviour of the estimators, which can be inconsistent. To avoid misspecification problems, in this paper we propose a fully nonparametric method for the estimation of the AUC of a continuous test under verification bias. The method is based on nearest-neighbor imputation and adopts generic smooth regression models for both the probability that a subject is diseased and the probability that it is verified. The new AUC estimator is consistent and asymptotically normal under the assumption that the true disease status, if missing, is missing at random (MAR). A simple extension which deals with stratified samples is also provided. Simulation experiments are used to investigate the finite sample behaviour of the proposed methods. An illustrative example is presented.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3261710
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact