The cluster-weighted model (CWM) is a mixture model with random covariates that allows for flexible clustering/classification and distribution estimationof a random vector composed of a response variable and a set of covariates. Withinthis class of models, the generalized linear exponential CWMis here introduced especiallyfor modeling bivariate data of mixed-type. Its natural counterpart in the familyof latent class models is also defined. Maximum likelihood parameter estimates are derived using the expectation-maximization algorithm and some computational issuesare detailed. Through Monte Carlo experiments, the classification performance of the proposed model is compared with other mixture-based approaches, consistency of the estimators of the regression coefficients is evaluated, and several likelihood-basedinformation criteria are compared for selecting the number of mixture components. An application to real data is also finally considered.

Clustering Bivariate Mixed-Type Data via the Cluster-Weighted Model

PUNZO, ANTONIO
;
INGRASSIA, Salvatore
2016-01-01

Abstract

The cluster-weighted model (CWM) is a mixture model with random covariates that allows for flexible clustering/classification and distribution estimationof a random vector composed of a response variable and a set of covariates. Withinthis class of models, the generalized linear exponential CWMis here introduced especiallyfor modeling bivariate data of mixed-type. Its natural counterpart in the familyof latent class models is also defined. Maximum likelihood parameter estimates are derived using the expectation-maximization algorithm and some computational issuesare detailed. Through Monte Carlo experiments, the classification performance of the proposed model is compared with other mixture-based approaches, consistency of the estimators of the regression coefficients is evaluated, and several likelihood-basedinformation criteria are compared for selecting the number of mixture components. An application to real data is also finally considered.
2016
Mixture models with random covariates; Model-based clustering; Cluster-weighted models; Generalized linear models; Mixed-type data
File in questo prodotto:
File Dimensione Formato  
Punzo & Ingrassia (2016) - Computational Statistics.pdf

solo gestori archivio

Descrizione: Articolo principale
Tipologia: Versione Editoriale (PDF)
Dimensione 908.35 kB
Formato Adobe PDF
908.35 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/18585
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 31
  • ???jsp.display-item.citation.isi??? 24
social impact