What are the most challenging issues in Sentiment Analysis(opinion mining)?

A Twitter tweet

Image via Wikipedia

Ramy Ghaly January 28, 2011

Hossein Said:

Opinion Mining/Sentiment Analysis is a somewhat recent subtask of Natural Language processing.Some compare it to text classification,some take a more deep stance towards it. What do you think about the most challenging issues in Sentiment Analysis(opinion mining)? Can you name a few?

Hightechrider Said:

The key challenges for sentiment analysis are:-

1) Named Entity Recognition – What is the person actually talking about, e.g. is 300 Spartans a group of Greeks or a movie?

2) Anaphora Resolution – the problem of resolving what a pronoun, or a noun phrase refers to. “We watched the movie and went to dinner; it was awful.” What does “It” refer to?

3) Parsing – What is the subject and object of the sentence, which one does the verb and/or adjective actually refer to?

4) Sarcasm – If you don’t know the author you have no idea whether ‘bad’ means bad or good.

5) Twitter – abbreviations, lack of capitals, poor spelling, poor punctuation, poor grammar, …

 

ealdent Said:

I agree with Hightechrider that those are areas where Sentiment Analysis accuracy can see improvement. I would also add that sentiment analysis tends to be done on closed-domain text for the most part. Attempts to do it on open domain text usually winds up having very bad accuracy/F1 measure/what have you or else it is pseudo-open-domain because it only looks at certain grammatical constructions. So I would say topic-sensitive sentiment analysis that can identify context and make decisions based on that is an exciting area for research (and industry products).

I’d also expand his 5th point from Twitter to other social media sites (e.g. Facebook, Youtube), where short, ungrammatical utterances are commonplace.

 

Skarab Said:

I think the answer is the language complexity, mistakes in grammar, and spelling. There is vast of ways people expresses there opinions, e.g., sarcasms could be wrongly interpreted as extremely positive sentiment.

 

What do you think? Do you agree? Would you like to ask a question and get an answer? Try out: Q&A for professional and enthusiast programmers

 


Advertisements

About Ramy Ghaly
Ramy Ghaly is a Marketing Technologist at Pragmatech. He's in charge of Pragmatech's online marketing strategy and implementation. Ramy has launched several successful online and social media marketing campaigns including a 6-month community and awareness building campaign that led to a listing in the reputable multi-client study of the Gilbane Group. Pragmatech's flagship product 'ctrl', a semantic search engine, was the only technology coming out of the Middle East that was recognized in the report, entitled "Semantic Technologies: Landscape of High Value Applications for the Enterprise", cited in additon to Pragmatech's semantic engine some of the largest software companies and leading research labs in the world.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: