Using sentence-level classification to predict sentiment at the document-level

dc.contributor.advisorRavikumar, Pradeepen
dc.contributor.committeeMemberLiu, Alexanderen
dc.creatorHutton, Amanda Rachelen
dc.date.accessioned2012-08-21T17:37:43Zen
dc.date.accessioned2017-05-11T22:27:12Z
dc.date.available2012-08-21T17:37:43Zen
dc.date.available2017-05-11T22:27:12Z
dc.date.issued2012-05en
dc.date.submittedMay 2012en
dc.date.updated2012-08-21T17:37:50Zen
dc.descriptiontexten
dc.description.abstractThis report explores various aspects of sentiment mining. The two research goals for the report were: (1) to determine useful methods in increasing recall of negative sentences and (2) to determine the best method for applying sentence level classification to the document level. The methods in this report were applied to the Movie Reviews corpus at both the document and sentence level. The basic approach was to first identify polar and neutral sentences within the text and then classify the polar sentences as either positive or negative. The Maximum Entropy classifier was used as the baseline system in which the application of further methods was explored. Part-of-speech tagging was used for its effectiveness to determine if its inclusion increased recall of negative sentences. It was also used to aid in the handling of negations within sentences at the sentence level. Smoothing was investigated and various metrics to describe the sentiment composition were explored to address goal (2). Negative recall was shown to increase with the adjustment of the classification threshold and was also seen to increase through the methods used to address goal (2). Overall, classifying at the sentence level using bigrams and a cutoff value of one was observed to result in the highest evaluation scores.en
dc.description.departmentStatisticsen
dc.format.mimetypeapplication/pdfen
dc.identifier.slug2152/ETD-UT-2012-05-5553en
dc.identifier.urihttp://hdl.handle.net/2152/ETD-UT-2012-05-5553en
dc.language.isoengen
dc.subjectSentiment miningen
dc.subjectSentence-level classificationen
dc.subjectText classificationen
dc.subjectRecallen
dc.titleUsing sentence-level classification to predict sentiment at the document-levelen
dc.type.genrethesisen

Files