What are the most powerful open-source sentiment-analysis tools?

I took a stab at a Quora question, What are the most powerful open-source sentiment-analysis tools?. Here’s my response:

I know of no open-source (software) tools dedicated to sentiment analysis. Instead, a variety of open-source text-analytics tools — natural-language processing for information extraction and classification — can be applied for sentiment analysis. They include —

– Python NLTK (Natural Language Toolkit), http://www.nltk.org/, but see also http://text-processing.com/demo/sentiment/

– R, TM (text mining) module, http://cran.r-project.org/web/packages/tm/index.html, including tm.plugin.sentiment.

– RapidMiner, http://rapid-i.com/content/view/184/196/.

– GATE, the General Architecture for Text Engineering, http://gate.ac.uk/sentiment/.

I’m sure you can also find UIMA-plug-in annotators for sentiment — Apache UIMA is the Unstructured Information Management Architecture, http://uima.apache.org/ — also sentiment classifiers for the WEKA data-mining workbench, http://www.cs.waikato.ac.nz/ml/weka/. See http://www.unal.edu.co/diracad/einternacional/Weka.pdf for one example.

I bet someone’s doing sentiment with the Stanford NLP tools, http://www-nlp.stanford.edu/software/, although my understanding is the maximum-entropy classification isn’t the best approach for sentiment. I’m no scientist so I won’t go into this.

Then there’s LingPipe, which can be characterized as pseudo-open source. See http://alias-i.com/lingpipe/demos/tutorial/sentiment/read-me.html.

Powerful, I can’t say. Where machine learning is involved, a lot will depend on your training set.

Note that the tools above work on textual sources. There may be open-source tools out there for information extraction from non-textual, sentiment-bearing sources such as speech (with the outputs fed into a classification engine such as some fo the above), but I haven’t looked into them. If you know of any, or have additions for my list above, please send me a note (grimes(at)altaplana.com).


Want to catch up… or stay ahead? Check out LT-Accelerate, the premier European text, speech, and social analytics conference, taking place 23-24 November, 2015 in Brussels.

9 thoughts on “What are the most powerful open-source sentiment-analysis tools?

  1. Thanks for the nice summary, Seth! The Apache Mahout library includes a Latent Dirichlet Allocation algorithm that can be used for topic identification, and might be used as part of a sentiment analysis process.

    1. Hi Karimkhanp maybe it’s too late to answer your question but it will be usefull for others who asks the same question; Actually there is some tools, I know two of them and tested only one, there is Tashaphyne, and ISRI python packages you can find documentation for the two packages.

  2. What is your review of MeTA langiage? there is a course in coursera about information retrieval which uses this language. Thanks!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s