Three Essays on Opinion Mining of Social Media Texts

dc.contributor.advisorAtish Sinha
dc.contributor.advisorHuimin Zhao
dc.contributor.committeememberStephen France
dc.contributor.committeememberZhijian Huang
dc.contributor.committeememberMark Srite
dc.creatorDeng, Shuyuan
dc.date.accessioned2025-01-16T19:49:39Z
dc.date.issued2014-12-01
dc.description.abstractThis dissertation research is a collection of three essays on opinion mining of social media texts. I explore different theoretical and methodological perspectives in this inquiry. The first essay focuses on improving lexicon-based sentiment classification. I propose a method to automatically generate a sentiment lexicon that incorporates knowledge from both the language domain and the content domain. This method learns word associations from a large unannotated corpus. These associations are used to identify new sentiment words. Using a Twitter data set containing 743,069 tweets related to the stock market, I show that the sentiment lexicons generated using the proposed method significantly outperforms existing sentiment lexicons in sentiment classification. As sentiment analysis is being applied to different types of documents to solve different problems, the proposed method provides a useful tool to improve sentiment classification. The second essay focuses on improving supervised sentiment classification. In previous work on sentiment classification, a document was typically represented as a collection of single words. This method of feature representation suffers from severe ambiguity, especially in classifying short texts, such as microblog messages. I propose the use of dependency features in sentiment classification. A dependency describes the relationship between a pair of words even when they are distant. I compare the sentiment classification performance of dependency features with a few commonly used features in different experiment settings. The results show that dependency features significantly outperform existing feature representations. In the third essay, I examine the relationship between social media sentiment and stock returns. This is the first study to test the bidirectional effects in this relationship. Based on theories in behavioral finance research, I speculate that social media sentiment does not predict stock return, but rather that stock return predicts social media sentiment. I empirically test a set of research hypotheses by applying the vector autoregression (VAR) model on a social media data set, which is much larger than those used in previous studies. The hypotheses are supported by the results. The findings have significant implications for both theory and practice.
dc.description.embargo2017-03-09
dc.embargo.liftdate2017-03-09
dc.identifier.urihttp://digital.library.wisc.edu/1793/88554
dc.relation.replaceshttps://dc.uwm.edu/etd/679
dc.subjectLexicon
dc.subjectNatural Language Processing
dc.subjectOpinion Mining
dc.subjectSentiment Analysis
dc.subjectSocial Media
dc.subjectStock Return
dc.titleThree Essays on Opinion Mining of Social Media Texts
dc.typedissertation
thesis.degree.disciplineManagement Science
thesis.degree.grantorUniversity of Wisconsin-Milwaukee
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Deng_uwm_0263D_10918.pdf
Size:
1.65 MB
Format:
Adobe Portable Document Format
Description:
Main File