What are the most powerful open-source sentiment-analysis tools?

Note to readers: There have been many developments since I posted this article in 2012! I do plan to update the article.

I took a stab at a Quora question, What are the most powerful open-source sentiment-analysis tools?. Here’s my response:

Join us at THE event for consumer, media, social & finance sentiment analysis.
Join us at THE event for consumer, media, social & finance sentiment analysis.

I know of no open-source (software) tools dedicated to sentiment analysis. Instead, a variety of open-source text-analytics tools — natural-language processing for information extraction and classification — can be applied for sentiment analysis. They include —

– Python NLTK (Natural Language Toolkit), http://www.nltk.org/, but see also http://text-processing.com/demo/sentiment/

– R, TM (text mining) module, http://cran.r-project.org/web/packages/tm/index.html, including tm.plugin.sentiment.

– RapidMiner, http://rapid-i.com/content/view/184/196/.

– GATE, the General Architecture for Text Engineering, http://gate.ac.uk/sentiment/.

I’m sure you can also find UIMA-plug-in annotators for sentiment — Apache UIMA is the Unstructured Information Management Architecture, http://uima.apache.org/ — also sentiment classifiers for the WEKA data-mining workbench, http://www.cs.waikato.ac.nz/ml/weka/. See http://www.unal.edu.co/diracad/einternacional/Weka.pdf for one example.

I bet someone’s doing sentiment with the Stanford NLP tools, http://www-nlp.stanford.edu/software/, although my understanding is the maximum-entropy classification isn’t the best approach for sentiment. I’m no scientist so I won’t go into this.

Then there’s LingPipe, which can be characterized as pseudo-open source. See http://alias-i.com/lingpipe/demos/tutorial/sentiment/read-me.html.

Powerful, I can’t say. Where machine learning is involved, a lot will depend on your training set.

Note that the tools above work on textual sources. There may be open-source tools out there for information extraction from non-textual, sentiment-bearing sources such as speech (with the outputs fed into a classification engine such as some fo the above), but I haven’t looked into them. If you know of any, or have additions for my list above, please send me a note (grimes(at)altaplana.com).

9 thoughts on “What are the most powerful open-source sentiment-analysis tools?

  1. Thanks for the nice summary, Seth! The Apache Mahout library includes a Latent Dirichlet Allocation algorithm that can be used for topic identification, and might be used as part of a sentiment analysis process.

    1. Hi Karimkhanp maybe it’s too late to answer your question but it will be usefull for others who asks the same question; Actually there is some tools, I know two of them and tested only one, there is Tashaphyne, and ISRI python packages you can find documentation for the two packages.

  2. What is your review of MeTA langiage? there is a course in coursera about information retrieval which uses this language. Thanks!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s