A New Semantic Approach on Yelp Review-star Rating Classification
| dc.contributor.author | Wu, Shuang | |
| dc.contributor.author | Wang, Xiaodong | |
| dc.contributor.author | Qi, Bozhao | |
| dc.date.accessioned | 2020-03-03T20:10:40Z | |
| dc.date.available | 2020-03-03T20:10:40Z | |
| dc.date.issued | 2020-02-26 | |
| dc.description.abstract | This paper introduces a new semantic approach for yelp review star rating prediction. Our approach extracts feature vectors from user reviews to develop star prediction models. User review text contains detailed information about reviewers’ experience, and directly reflects reviewer’s satisfaction level. Our approach can extract sentimental words from review text, and convert these information into different feature vectors. Reviewer’s personal preference may be extremely skewed from each other, to eliminate these effects, we use belief propagation methods to calculate review star probability distributions for different types of reviewers. Our machine learning algorithm predicts review star based on reviewers’ preference and voting habit. We extract different feature vectors and apply them to several machine learning algorithms. To evaluate all the 2.2 million user reviews, we build spark system on three laptops. To achieve a better prediction accuracy, we perform sentiment analysis of reviews in terms of the number of positive, negative, negation words, and apply belief propagation methods to get rid of personal preference effects. Our system can evaluate 2.2 million data entries in less than two minutes and achieve an accuracy of 55%. | en_US |
| dc.identifier.uri | http://digital.library.wisc.edu/1793/79890 | |
| dc.relation.ispartofseries | TR1860; | |
| dc.subject | semantic approach | en_US |
| dc.subject | review rating and classification | en_US |
| dc.subject | big data | en_US |
| dc.subject | machine learning | en_US |
| dc.title | A New Semantic Approach on Yelp Review-star Rating Classification | en_US |
| dc.type | Technical Report | en_US |