Skip to main content

Posts

Showing posts from November, 2012

WordNet JWNL hangs / block for some Strings

This may be a bug for JWNL WordNet. When using the following method, MorphologicalProcessor.lookupBaseForm(POS.NOUN, token) if the input token is some long alphanumeric string (e.g. "l0sfool0o0cnews0cinvesting0c20a120c0a50c0a20cplanning0efor" ), The method may run for a long time. Reference: http://stackoverflow.com/questions/10599966/jwnljava-wordnet-library-hangs-while-processing-large-alphanumeric-data

Monitor memory usage in Java

public static void printMemoryUsage() { int MegaBytes = 10241024; long freeMemory = Runtime.getRuntime().freeMemory() / MegaBytes; long totalMemory = Runtime.getRuntime().totalMemory() / MegaBytes; long maxMemory = Runtime.getRuntime().maxMemory() / MegaBytes; logger.warn("JVM freeMemory: " + freeMemory); logger.warn("JVM totalMemory(initial heap size of JVM):" + totalMemory); logger.warn("JVM maxMemory (maximum heap size of JVM): " + maxMemory); }

Reading list for opinion mining or sentiment analysis (keeping update)

The following are papers for opinion mining or sentiment analysis. Recommend a good latest survey paper. (Click to download) A Survey of Opinion Mining and Sentiment Analysis Kluwer Academic Publishers 2012 1. Survey papers    1.   Liu, B.  Sentiment analysis and subjectivity .    In Handbook of Natural Language Processing, Second Edition, N. Indurkhya and F.J. Damerau, Editors. 2010.    2.   Pang, B. and L. Lee.  Opinion mining and sentiment analysis.  Foundations and Trends in Information Retrieval, 2008, 2(1-2): p. 1-135.     2. Sentiment analysis method    1.  Pang, B., L. Lee, and S. Vaithyanathan.  Thumbs up?: sentiment classification using machine learning techniques . In Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP-2002), 2002    2.  Ding, X., B. Liu, and P. Yu.  A holistic lexicon-based approach to opinion mining . In Proceedings of the Conference on Web Search and Web Data Mining (WSDM-2008), 2008.    3.  Jin,