The paper provides a new and more explicit formulation of the assumptions needed by the ordinary ecological regression to provide unbiased estimates and clarifies why violations of these assumptions will affect any method of ecological inference. Empir- ical evidence is provided by showing that estimates provided by three main ecological inference methods are heavily biased when compared to multilevel logistic regression applied to a unique set of individual data on voting behaviour. The main findings of our paper have two important implications that apply to all situations where the assumptions needed to apply ecological inference are violated in the data; (i) only ecological inference methods that allow one to model the effect of covariates have a chance to produce unbiased estimates, (ii) there are certain data generating mechanisms producing a kind of bias in ecological estimates which cannot be corrected by modelling the effect of covariates.

Ecological fallacy and covariates: new insights based on multilevel modelling of individual data

Venera Tomaselli;
2018-01-01

Abstract

The paper provides a new and more explicit formulation of the assumptions needed by the ordinary ecological regression to provide unbiased estimates and clarifies why violations of these assumptions will affect any method of ecological inference. Empir- ical evidence is provided by showing that estimates provided by three main ecological inference methods are heavily biased when compared to multilevel logistic regression applied to a unique set of individual data on voting behaviour. The main findings of our paper have two important implications that apply to all situations where the assumptions needed to apply ecological inference are violated in the data; (i) only ecological inference methods that allow one to model the effect of covariates have a chance to produce unbiased estimates, (ii) there are certain data generating mechanisms producing a kind of bias in ecological estimates which cannot be corrected by modelling the effect of covariates.
2018
Ecological inference
Voting behaviour
Logistic regression
Multilevel models.
File in questo prodotto:
File Dimensione Formato  
2018_TOMASELLI_International Statistical Review .pdf

solo utenti autorizzati

Descrizione: Articolo principale
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 564.98 kB
Formato Adobe PDF
564.98 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/314740
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 9
social impact