9dcdb25289213c43.tex
1: \begin{abstract}
2: A  model-based approach is developed for clustering categorical data with no natural ordering. The proposed method exploits the Hamming distance to define a family of probability mass functions to model the data. The elements of this family are then considered as kernels of a finite mixture model with unknown number of components.
3:  Conjugate Bayesian inference has been derived for the parameters of the Hamming distribution model. The mixture  is framed in a Bayesian nonparametric setting and a  transdimensional blocked Gibbs sampler is developed to provide 
4: full Bayesian inference on the number of clusters, their structure and the group-specific parameters, facilitating the computation with respect to customary reversible jump algorithms. 
5: The proposed model encompasses a parsimonious latent class model as a special case, when the number of components is fixed. 
6: Model performances are assessed via a simulation study and reference datasets, showing improvements in clustering recovery over existing approaches. 
7: \end{abstract}
8: