Scalable algorithms for latent variable models in machine learning

dc.contributor.advisorDhillon, Inderjit S.
dc.contributor.committeeMemberPingali, Keshav
dc.contributor.committeeMemberQiu, Lili
dc.contributor.committeeMemberVishwananthan, S.V.N.
dc.contributor.committeeMemberLin, Chih-Jen
dc.creatorYu, Hsiang-Fu
dc.date.accessioned2016-10-13T19:22:45Z
dc.date.accessioned2018-01-22T22:30:46Z
dc.date.available2016-10-13T19:22:45Z
dc.date.available2018-01-22T22:30:46Z
dc.date.issued2016-08
dc.date.submittedAugust 2016
dc.date.updated2016-10-13T19:22:45Z
dc.description.abstractLatent variable modeling (LVM) is a popular approach in many machine learning applications, such as recommender systems and topic modeling, due to its ability to succinctly represent data, even in the presence of several missing entries. Existing learning methods for LVMs, while attractive, are infeasible for the large-scale datasets required in modern big data applications. In addition, such applications often come with various types of side information such as the text description of items and the social network among users in a recommender system. In this thesis, we present scalable learning algorithms for a wide range of latent variable models such as low-rank matrix factorization and latent Dirichlet allocation. We also develop simple but effective techniques to extend existing LVMs to exploit various types of side information and make better predictions in many machine learning applications such as recommender systems, multi-label learning, and high-dimensional time-series prediction. In addition, we also propose a novel approach for the maximum inner product search problem to accelerate the prediction phase of many latent variable models.
dc.description.departmentComputer Sciences
dc.format.mimetypeapplication/pdf
dc.identifierdoi:10.15781/T2J09W589
dc.identifier.urihttp://hdl.handle.net/2152/41635
dc.subjectLatent variable modeling
dc.subjectMatrix factorization
dc.subjectAlgorithms
dc.subjectData
dc.titleScalable algorithms for latent variable models in machine learning
dc.typeThesis
dc.type.materialtext

Files