Concerning bivariate least squares linear regression, the classical results obtained for extreme structural models in earlier attempts (Isobe et al., 1990; Feigelson and Babu, 1992) are reviewed using a new formalism in terms of deviation (matrix) traces which, for homoscedastic data, reduce to usual quantities leaving aside an unessential (but dimensional) multiplicative factor. Within the framework of classical error models, the dependent variable relates to the independent variable according to a variant of the usual additive model. The classes of linear models considered are regression lines in the limit of uncorrelated errors in X and in Y. The following models are considered in detail: (Y) errors in X negligible (ideally null) with respect to errors in Y; (X) errors in Y negligible (ideally null) with respect to errors in X; (C) oblique regression; (O) orthogonal regression; (R) reduced major-axis regression; (B) bisector regression. For homoscedastic data, the results are taken from earlier attempts and rewritten using a more compact notation. For heteroscedastic data, the results are inferred from a procedure related to functional models (York, 1966; Caimmi, 2011). An example of astronomical application is considered, concerning the [O/H]-[Fe/H] empirical relations deduced from five samples related to different stars and/or different methods of oxygen abundance determination. For low-dispersion samples and assigned methods, different regression models yield results which are in agreement within the errors for both heteroscedastic and homoscedastic data, while the contrary holds for large-dispersion samples. In any case, samples related to different methods produce discrepant results, due to the presence of (still undetected) systematic errors, which implies no definitive statement can be made at present. Asymptotic expressions approximate regression line slope and intercept variance estimators, for normal residuals, to a better extent with respect to earlier attempts. Related fractional discrepancies are not exceeding a few percent for low-dispersion data, which grows up to about 10% for large-dispersion data. An extension of the formalism to generic structural models is left to a forthcoming paper.

Bivariate least squares linear regression: towards a unified analytic formalism. II. Extreme structural models

CAIMMI, ROBERTO
2012

Abstract

Concerning bivariate least squares linear regression, the classical results obtained for extreme structural models in earlier attempts (Isobe et al., 1990; Feigelson and Babu, 1992) are reviewed using a new formalism in terms of deviation (matrix) traces which, for homoscedastic data, reduce to usual quantities leaving aside an unessential (but dimensional) multiplicative factor. Within the framework of classical error models, the dependent variable relates to the independent variable according to a variant of the usual additive model. The classes of linear models considered are regression lines in the limit of uncorrelated errors in X and in Y. The following models are considered in detail: (Y) errors in X negligible (ideally null) with respect to errors in Y; (X) errors in Y negligible (ideally null) with respect to errors in X; (C) oblique regression; (O) orthogonal regression; (R) reduced major-axis regression; (B) bisector regression. For homoscedastic data, the results are taken from earlier attempts and rewritten using a more compact notation. For heteroscedastic data, the results are inferred from a procedure related to functional models (York, 1966; Caimmi, 2011). An example of astronomical application is considered, concerning the [O/H]-[Fe/H] empirical relations deduced from five samples related to different stars and/or different methods of oxygen abundance determination. For low-dispersion samples and assigned methods, different regression models yield results which are in agreement within the errors for both heteroscedastic and homoscedastic data, while the contrary holds for large-dispersion samples. In any case, samples related to different methods produce discrepant results, due to the presence of (still undetected) systematic errors, which implies no definitive statement can be made at present. Asymptotic expressions approximate regression line slope and intercept variance estimators, for normal residuals, to a better extent with respect to earlier attempts. Related fractional discrepancies are not exceeding a few percent for low-dispersion data, which grows up to about 10% for large-dispersion data. An extension of the formalism to generic structural models is left to a forthcoming paper.
2012
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/2578447
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact