Clustering is a fundamental task in statistics and machine learning aimed at identifying homogeneous groups within data. Among the available approaches, fuzzy clustering (FC) is particularly suited to poorly separated boundaries, as it allows partial memberships and captures uncertainty and overlapping structures. We introduce Power Fuzzy Clustering (PFC), a general framework that unifies several FC methods, including fuzzy K-means and probabilistic distance clustering. By complementing the classical fuzzifier with a power parameter on distances, PFC enhances model flexibility while retaining interpretability. To accommodate clusters with different shapes and scales, we incorporate a volume parameter and alternative distance metrics (e.g., Mahalanobis and Minkowski). We further propose the Power Fuzzy Cluster-wise Regression (PFCR) model, a generalization of PFC that reduces to it when only intercepts are included. Simulation studies investigate the impact of the power and fuzzification parameters, and a real-world application illustrates the method's practical relevance.

Power fuzzy clustering: flexible distance metrics and inclusion of covariates

Punzo, Antonio
2026-01-01

Abstract

Clustering is a fundamental task in statistics and machine learning aimed at identifying homogeneous groups within data. Among the available approaches, fuzzy clustering (FC) is particularly suited to poorly separated boundaries, as it allows partial memberships and captures uncertainty and overlapping structures. We introduce Power Fuzzy Clustering (PFC), a general framework that unifies several FC methods, including fuzzy K-means and probabilistic distance clustering. By complementing the classical fuzzifier with a power parameter on distances, PFC enhances model flexibility while retaining interpretability. To accommodate clusters with different shapes and scales, we incorporate a volume parameter and alternative distance metrics (e.g., Mahalanobis and Minkowski). We further propose the Power Fuzzy Cluster-wise Regression (PFCR) model, a generalization of PFC that reduces to it when only intercepts are included. Simulation studies investigate the impact of the power and fuzzification parameters, and a real-world application illustrates the method's practical relevance.
2026
clus tering with covariates
cluster-wise regression
Fuzzy clustering
fuzzy K-means
probabilistic distance clustering
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/717949
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact