Browsing by Subject "Mixture models"

Now showing 1 - 2 of 2

Bayesian Semiparametric Density Deconvolution and Regression in the Presence of Measurement Errors
(2014-06-24) Sarkar, Abhra
Although the literature on measurement error problems is quite extensive, solutions to even the most fundamental measurement error problems like density deconvolution and regression with errors-in-covariates are available only under numerous simplifying and unrealistic assumptions. This dissertation demonstrates that Bayesian methods, by accommodating measurement errors through natural hierarchies, can provide a very powerful framework for solving these important measurement errors problems under more realistic scenarios. However, the very presence of measurement errors often renders techniques that are successful in measurement error free scenarios inefficient, numerically unstable, computationally challenging or intractable. Additionally, measurement error problems often have unique features that compound modeling and computational challenges. In this dissertation, we develop novel Bayesian semiparametric approaches that cater to these unique challenges of measurement error problems and allow us to break free from many restrictive parametric assumptions of previously existing approaches. In this dissertation, we first consider the problem of univariate density deconvolution when replicated proxies are available for each unknown value of the variable of interest. Existing deconvolution methods often make restrictive and unrealistic assumptions about the density of interest and the distribution of measurement errors, e.g., normality and homoscedasticity and thus independence from the variable of interest. We relax these assumptions and develop robust and efficient deconvolution approaches based on Dirichlet process mixture models and mixtures of B-splines in the presence of conditionally heteroscedastic measurement errors. We then extend the methodology to nonlinear univariate regression with errors-in-covariates problems when the densities of the covariate, the regression errors and the measurement errors are all unknown, and the regression and the measurement errors are conditionally heteroscedastic. The final section of this dissertation is devoted to the development of flexible multivariate density deconvolution approaches. The methods available in the existing sparse literature all assume the measurement error density to be fully specified. In contrast, we develop multivariate deconvolution approaches for scenarios when the measurement error density is unknown but replicated proxies are available for each subject. We consider scenarios when the measurement errors are distributed independently from the vector valued variable of interest as well as scenarios when they are conditionally heteroscedastic. To meet the significantly harder modeling and computational challenges of the multivariate problem, we exploit properties of finite mixture models, multivariate normal kernels, latent factor models and exchangeable priors in many novel ways. We provide theoretical results showing the flexibility of the proposed models. In simulation experiments, the proposed semiparametric methods vastly outperform previously existing approaches. Our methods also significantly outperform theoretically more flexible possible nonparametric alternatives even when the true data generating process closely conformed to these alternatives. The methods automatically encompass a variety of simplified parametric scenarios as special cases and often outperform their competitors even in those special scenarios for which the competitors were specifically designed. We illustrate practical usefulness of the proposed methodology by successfully applying the methods to problems in nutritional epidemiology. The methods can be readily adapted and applied to similar problems from other areas of applied research. The methods also provide the foundation for many interesting extensions and analyses.
Efficient approaches in network inference
(2016-12) Ray, Avik; Sanghavi, Sujay Rajendra, 1979-; Shakkottai, Sanjay; Baccelli, Francois; de Veciana, Gustavo; Caramanis, Constantine; Ravikumar, Pradeep
Network based inference is almost ubiquitous in modern machine learning applications. In this dissertation we investigate several such problems motivated by applications in social networks, biological networks, recommendation system, targeted advertising etc. Unavailability of the graph, presence of latent factors, and large network size often make these inference tasks challenging. We develop both generative models and efficient algorithms to solve such problems. We provide analytical guarantees, in terms of accuracy and computation time, for all our algorithms and demonstrate their applicability on many real datasets. This dissertation mainly consists of two parts. In the first part we consider three different problems. We first consider the task of learning the Markov network structure in a discreet graphical model. We develop three fast greedy algorithms to solve this problem which succeeds even in graphs with strong non-neighbor interaction where previous convex optimization based methods fail. Next we consider the problem of learning latent user interests in different topics, using cascades which spread over a network. Our new algorithm infers both user interests and topics in large cascades, better than standard topic modeling algorithms which do not consider the network structure. In the third problem we develop a novel recursive algorithm based on convex relaxation to detect overlapping communities in a graph. The second part of the dissertation develops a mathematical framework to handle different sources of side information and use it to improve inference in networks. However first we demonstrate a much general technique to incorporate variety of side information in estimating a single component of a mixture model e.g. Gaussian mixture model, latent Dirichlet allocation, subspace clustering, and mixed linear regression. We then use a similar technique to solve the problem of identifying a single target community in a graph, using reference nodes or biased node weights as side information. Our algorithms are based on a variant of method of moments, and are much faster and more accurate than other unsupervised and semi-supervised algorithms.

Browsing by Subject "Mixture models"

Results Per Page

Sort Options