Objectives: The Big Data and Deep Learning in the surveillance of occupational cancers (BEST) project aimed to enhance occupational cancer surveillance in Italy by using large-scale registry linkages. It addressed two methodological challenges: controlling for multiple testing/selective inference when profiling occupational categories and cancers, and assessing robustness of inference in presence of unmeasured confounding. Methods: Data were obtained through record linkage between the Italian cause-of-death registry (2005–2018) and the National Social Insurance Agency (INPS) database (1974–2018). Male blue-collar workers were classified by their longest held occupational sector, censoring the last 5 years before death. The study used a proportional cancer mortality design. Logistic regression models were fitted to estimate cause-specific mortality odds ratios (CMORs) for selected cancers and occupational sectors, adjusting for age, education, last region of residence, and year of death using the service sector as reference. Multiplicity was addressed through Q-Q plots with guide rails, q-values and control of false discovery rate, hierarchical Bayesian models and posterior rankings. E-values were calculated to assess the potential influence of unmeasured confounding. Results: Elevated CMORs emerged for several cancers and industries, including lung cancer in construction and fishing, pleural cancer in shipyards, and sinonasal cancers in leather and woodworking trades. Findings remained consistent after multiple testing adjustments and sensitivity analyses using E-values. Conclusions: We propose an integrated methodological framework that combines multiplicity-aware profiling and E-values to address selective inference from multiple testing and sensitivity to unmeasured confounding. The framework improves the interpretation and prioritization of signals in occupational cancer surveillance, providing robust insights to guide prevention strategies.
A methodological framework for occupational cancer surveillance: Addressing multiplicity and unmeasured confounding in Italian registry data
Berti, Mirko;Stoppa, Giorgia
;Catelan, Dolores
2026
Abstract
Objectives: The Big Data and Deep Learning in the surveillance of occupational cancers (BEST) project aimed to enhance occupational cancer surveillance in Italy by using large-scale registry linkages. It addressed two methodological challenges: controlling for multiple testing/selective inference when profiling occupational categories and cancers, and assessing robustness of inference in presence of unmeasured confounding. Methods: Data were obtained through record linkage between the Italian cause-of-death registry (2005–2018) and the National Social Insurance Agency (INPS) database (1974–2018). Male blue-collar workers were classified by their longest held occupational sector, censoring the last 5 years before death. The study used a proportional cancer mortality design. Logistic regression models were fitted to estimate cause-specific mortality odds ratios (CMORs) for selected cancers and occupational sectors, adjusting for age, education, last region of residence, and year of death using the service sector as reference. Multiplicity was addressed through Q-Q plots with guide rails, q-values and control of false discovery rate, hierarchical Bayesian models and posterior rankings. E-values were calculated to assess the potential influence of unmeasured confounding. Results: Elevated CMORs emerged for several cancers and industries, including lung cancer in construction and fishing, pleural cancer in shipyards, and sinonasal cancers in leather and woodworking trades. Findings remained consistent after multiple testing adjustments and sensitivity analyses using E-values. Conclusions: We propose an integrated methodological framework that combines multiplicity-aware profiling and E-values to address selective inference from multiple testing and sensitivity to unmeasured confounding. The framework improves the interpretation and prioritization of signals in occupational cancer surveillance, providing robust insights to guide prevention strategies.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




