Applying sequence-to-sequence RNN models to IR-based bug localization

dc.contributor.advisorKhurshid, Sarfraz
dc.contributor.committeeMemberSaha, Ripon K
dc.creatorLemons, Clayton Lindsay
dc.creator.orcid0000-0002-4731-9372
dc.date.accessioned2016-11-10T18:51:18Z
dc.date.accessioned2018-01-22T22:31:04Z
dc.date.available2016-11-10T18:51:18Z
dc.date.available2018-01-22T22:31:04Z
dc.date.issued2016-08
dc.date.submittedAugust 2016
dc.date.updated2016-11-10T18:51:18Z
dc.description.abstractBug localization is the resource intensive process of finding bugs. A considerable amount of time, effort, and money could be saved if this process was automated. Bug localization based on information retrieval (IR) is a static approach to automation that represents source code files as documents in a database and bug reports as queries. The bug localization approach described in this report is centered around the mental model that evolves in the minds of software developers as they work with a codebase. Using a sequence-to-sequence recurrent neural network (RNN), it may be possible to approximate this mental model by mapping the comments in source code (written in a natural language) to the source code itself (written in a programming language). The model can then be used to convert bug reports (also written in a natural language) to source token keywords for use in IR-based bug localization. The results of experimenting with several approaches to defining the mapping are presented. Although not up to par with the current state-of-the-art, the results show that there is potential in using a sequence-to-sequence RNN for IR-based bug localization.
dc.description.departmentElectrical and Computer Engineering
dc.format.mimetypeapplication/pdf
dc.identifierdoi:10.15781/T23775Z2N
dc.identifier.urihttp://hdl.handle.net/2152/43727
dc.language.isoen
dc.subjectSequence-to-sequence
dc.subjectRecurrent neural network
dc.subjectRNN
dc.subjectBug localization
dc.subjectFault localization
dc.subjectSoftware-fault localization
dc.subjectBLUiR
dc.titleApplying sequence-to-sequence RNN models to IR-based bug localization
dc.typeThesis
dc.type.materialtext

Files