![Скачать презентацию Opinion Observer Analyzing and Comparing Opinions on the Скачать презентацию Opinion Observer Analyzing and Comparing Opinions on the](https://present5.com/wp-content/plugins/kama-clic-counter/icons/ppt.jpg)
82f25670144a410662263ecfd49caba0.ppt
- Количество слайдов: 18
Opinion Observer: Analyzing and Comparing Opinions on the Web Bing Liu, Minqing Hu, Junsheng Cheng Paper Presentation: Vinay Goel
Introduction l l l Web: excellent source of consumer opinions Online customer reviews of products Useful information to customers and product manufacturers Novel framework for analyzing and comparing customer opinions Technique based on language pattern matching to extract product features
Opinion Observer
Technical Tasks Identify product features that customers have expressed their opinions on l For each feature, identify whether the opinion is positive or negative l Review Format (2) - Pros, Cons and detailed review l The paper proposes a technique to identify product features from pros and cons in this format l
Problem Statement l Let P={P 1, P 2 … Pn} be a set of products that the user is interested in l Each product Pi has a set of reviews Ri ={r 1, r 2 … rk} l Each review rj is a sequence of sentences rj= {sj 1, sj 2 … sjm}
Product Feature l A product feature f in rj is an attribute/component of the product that has been commented on in rj l If f appears in rj, explicit feature ¡ l “The battery life of this camera is too short” If f does not appear in rj but is implied, implicit feature ¡ “This camera is too large” (size)
Opinions and features l Opinion segment of a feature ¡ ¡ l Set of consecutive sentences that expresses a positive or negative opinion on f “The picture quality is good, but the battery life is short” Positive opinion set of a feature (Pset) ¡ ¡ Set of opinion segments of f that expresses positive opinions about f from all the reviews of the product Nset can be defined similarly
Visualizing Opinion Comparison
Automated opinion analysis Explicit and implicit features Synonyms Granularity of features
Extracting Product Features Labeling l Perform POS tagging and remove digits ¡ l Replace actual feature words with [feature] ¡ l ¡ “<V>included<N>[feature]<V>is” “<N>[feature]<V>is<Adj>stingy” Distinguish duplicate tags ¡ l “<V>included<N>[feature]<V>is<Adj>stingy” Use n-gram to produce shorter segments ¡ l “<V>included<N>MB<V>is<Adj>stingy” “<N 1>[feature]<N 2>usage” Perform word stemming
Rule Generation l Association Rule Mining l Only need rules that have [feature] on the righthand-side (<N 1>, <N 2> --> [feature]) l Consider the sequence of items in the conditional part (left-hand-side) of each rule l Generate language patterns (<N 1>[feature]<N 2>)
Feature Refinement strategies l There may be a more likely feature in the sentence segment but not extracted by any pattern ¡ l Frequent-Noun ¡ l “slight hum from subwoofer when not in use” Only a noun replaces another noun Frequent-Term ¡ Any type replacement
Semi-Automated Tagging of Reviews
Extracting Reviews from Web Pages l Non trivial task l MDR-2 ¡ System finds patterns from page containing reviews ¡ System uses these patterns to extract reviews from other pages of the site
System Architecture
Experimental Results
Experimental Results l Amount of time saved by Semi-automatic tagging is around 45% l Group synonyms using Word. Net (52% recall and 100% precision) ¡ Does not handle context dependent synonyms
Conclusion l Novel visual analysis system l Supervised pattern discovery method l Interactive correction of errors of the automatic system l Improve techniques, study strength of opinions
82f25670144a410662263ecfd49caba0.ppt