Show simple item record

dc.contributor.advisorMarcotte, Edward M.
dc.creatorWoods, John Oates, IIIen
dc.date.accessioned2014-02-21T21:44:30Zen
dc.date.accessioned2017-05-11T22:45:39Z
dc.date.available2017-05-11T22:45:39Z
dc.date.issued2013-12en
dc.date.submittedDecember 2013en
dc.identifier.urihttp://hdl.handle.net/2152/23292en
dc.descriptiontexten
dc.description.abstractPhenotypes and diseases may be related by seemingly dissimilar phenotypes in other species by means of the orthology of underlying genes. Such "orthologous phenotypes," or "phenologs," are examples of deep homology, and one member of the orthology relationship may be used to predict candidate genes for its counterpart. (There exists evidence of "paralogous phenotypes" as well, but validation is non-trivial.) In Chapter 2, I demonstrate the utility of including plant phenotypes in our database, and provide as an example the prediction of mammalian neural crest defects from an Arabidopsis thaliana phenotype, negative gravitropism defective. In the third chapter, I describe the incorporation of additional phenotypes into our database (including chicken, zebrafish, E. coli, and new C. elegans datasets). I present a method, developed in coordination with Martin Singh-Blom, for ranking predicted candidate genes by way of a k nearest neighbors naïve Bayes classifier drawing phenolog information from a variety of species. The fourth chapter relates to a computational method and application for identifying shared and overlapping pathways which contribute to phenotypes. I describe a method for rapidly querying a database of phenotype--gene associations for Boolean combinations of phenotypes which yields improved predictions. This method offers insight into the divergence of orthologous pathways in evolution. I demonstrate connections between breast cancer and zebrafish methylmercury response (through oxidative stress and apoptosis); human myopathy and plant red light response genes, minus those involved in water deprivation response (via autophagy); and holoprosencephaly and an array of zebrafish phenotypes. In the first appendix, I present the SciRuby Project, which I co-founded in order to bring scientific libraries to the Ruby programming language. I describe the motivation behind SciRuby and my role in its creation. Finally in Appendix B, I discuss the first beta release of NMatrix, a dense and sparse matrix library for the Ruby language, which I developed in part to facilitate and validate rapid phenolog searches. In this work, I describe the concept of phenologs as well as the development of the necessary computational tools for discovering phenotype orthology relationships, for predicting associated genes, and for statistically validating the discovered relationships and predicted associations.en
dc.format.mimetypeapplication/pdfen
dc.subjectDeep homologyen
dc.subjectPhenologsen
dc.subjectPhenotype orthologyen
dc.subjectPhenotype paralogyen
dc.subjectHomologyen
dc.subjectGene--phenotype associationsen
dc.subjectRubyen
dc.subjectScirubyen
dc.subjectNmatrixen
dc.subjectk nearest neighborsen
dc.titlePredicting gene–phenotype associations in humans and other species from orthologous and paralogous phenotypesen
dc.typeThesisen
dc.description.departmentCellular and Molecular Biologyen
dc.date.updated2014-02-21T21:44:30Zen


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record