Model selection criteria constitute one of the most critical issues in data clustering via mixture modeling. In this framework, criteria based on penalized log-likelihood (e.g. AIC and BIC) are usually adopted and one single model is finally selected. In this paper, an approach taking into account uncertainty in model selection is proposed in the maximum likelihood framework yielding a conditional probability distribution on the number of components, given a sample. The assessment of the number of population components, which are referred to as underlying populations, is also proposed based on employing a non-parametric bootstrap sampling of the observed data sample. Finally, a novel clustering approach relying on the developed concepts is presented. The proposal is illustrated on the ground of a numerical study based on both simulated and real data.

Uncertain model selection criteria for mixture modeling

Ingrassia, Salvatore
Methodology
2026-01-01

Abstract

Model selection criteria constitute one of the most critical issues in data clustering via mixture modeling. In this framework, criteria based on penalized log-likelihood (e.g. AIC and BIC) are usually adopted and one single model is finally selected. In this paper, an approach taking into account uncertainty in model selection is proposed in the maximum likelihood framework yielding a conditional probability distribution on the number of components, given a sample. The assessment of the number of population components, which are referred to as underlying populations, is also proposed based on employing a non-parametric bootstrap sampling of the observed data sample. Finally, a novel clustering approach relying on the developed concepts is presented. The proposal is illustrated on the ground of a numerical study based on both simulated and real data.
2026
Cluster analysis
Finite mixture model
Mixture order
Model selection criteria
Uncertainty
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/704750
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact