Bayesian variable selection in clustering via dirichlet process mixture models

dc.contributorVannucci, Marina
dc.creatorKim, Sinae
dc.date.accessioned2007-09-17T19:36:48Z
dc.date.accessioned2017-04-07T19:53:28Z
dc.date.available2007-09-17T19:36:48Z
dc.date.available2017-04-07T19:53:28Z
dc.date.created2003-05
dc.date.issued2007-09-17
dc.description.abstractThe increased collection of high-dimensional data in various fields has raised a strong interest in clustering algorithms and variable selection procedures. In this disserta- tion, I propose a model-based method that addresses the two problems simultane- ously. I use Dirichlet process mixture models to define the cluster structure and to introduce in the model a latent binary vector to identify discriminating variables. I update the variable selection index using a Metropolis algorithm and obtain inference on the cluster structure via a split-merge Markov chain Monte Carlo technique. I evaluate the method on simulated data and illustrate an application with a DNA microarray study. I also show that the methodology can be adapted to the problem of clustering functional high-dimensional data. There I employ wavelet thresholding methods in order to reduce the dimension of the data and to remove noise from the observed curves. I then apply variable selection and sample clustering methods in the wavelet domain. Thus my methodology is wavelet-based and aims at clustering the curves while identifying wavelet coefficients describing discriminating local features. I exemplify the method on high-dimensional and high-frequency tidal volume traces measured under an induced panic attack model in normal humans.
dc.identifier.urihttp://hdl.handle.net/1969.1/5888
dc.language.isoen_US
dc.publisherTexas A&M University
dc.subjectBayesian inference
dc.subjectClustering
dc.subjectDirichlet process mixture model
dc.subjectDNA microarray data analysis
dc.subjectvariable selection
dc.subjectwavelet shrinkage
dc.titleBayesian variable selection in clustering via dirichlet process mixture models
dc.typeBook
dc.typeThesis

Files