What are the most powerful open-source sentiment-analysis tools?

I took a stab at a Quora question, What are the most powerful open-source sentiment-analysis tools?. Here’s my response:

I know of no open-source (software) tools dedicated to sentiment analysis. Instead, a variety of open-source text-analytics tools — natural-language processing for information extraction and classification — can be applied for sentiment analysis. They include –

- Python NLTK (Natural Language Toolkit), http://www.nltk.org/, but see also http://text-processing.com/demo/sentiment/

- R, TM (text mining) module, http://cran.r-project.org/web/packages/tm/index.html, including tm.plugin.sentiment.

- RapidMiner, http://rapid-i.com/content/view/184/196/.

- GATE, the General Architecture for Text Engineering, http://gate.ac.uk/sentiment/.

I’m sure you can also find UIMA-plug-in annotators for sentiment — Apache UIMA is the Unstructured Information Management Architecture, http://uima.apache.org/ — also sentiment classifiers for the WEKA data-mining workbench, http://www.cs.waikato.ac.nz/ml/weka/. See http://www.unal.edu.co/diracad/einternacional/Weka.pdf for one example.

I bet someone’s doing sentiment with the Stanford NLP tools, http://www-nlp.stanford.edu/software/, although my understanding is the maximum-entropy classification isn’t the best approach for sentiment. I’m no scientist so I won’t go into this.

Then there’s LingPipe, which can be characterized as pseudo-open source. See http://alias-i.com/lingpipe/demos/tutorial/sentiment/read-me.html.

Powerful, I can’t say. Where machine learning is involved, a lot will depend on your training set.

Note that the tools above work on textual sources. There may be open-source tools out there for information extraction from non-textual, sentiment-bearing sources such as speech (with the outputs fed into a classification engine such as some fo the above), but I haven’t looked into them. If you know of any, or have additions for my list above, please send me a note (grimes(at)altaplana.com).


Want to catch up… or stay ahead? Check out the Sentiment Analysis Symposium, May 8, 2013 in New York; also the May 7 sentiment analysis Research & Innovation session and theĀ Practical Sentiment Analysis tutorial, to be presented by Prof. Ronen Feldman.

7 comments

  1. Pingback: Slightly-more-than-basic sentiment analysis | House of Stones
  2. Lynd Bacon

    Thanks for the nice summary, Seth! The Apache Mahout library includes a Latent Dirichlet Allocation algorithm that can be used for topic identification, and might be used as part of a sentiment analysis process.

  3. Pingback: All About Natural Language Processing | Breakthrough Analysis
  4. Pingback: Slightly-more-than-basic sentiment analysis | Houses of Stones

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s