Pradeep Pujari is the Senior Architect at AngiesList. In this role, he is responsible for design and implementation of Text Analytics, Search and Recommend. Prior to AngiesList, he was with Walmart looking after mid-tier apps, web search and data mining. His core domain expertise is in information retrieval, machine learning and natural language processing. In off hours, he loves exploring new technologies, enjoys reading and mentoring.
Solr is an open source, widely used, popular IR machine. It can be used for simple sentiment analysis and sentiment retrieval tool. Its multi-language analyzers together with UIMA (Unstructured Information Management Architecture) framework can be extended for sentiment extraction. Each sentence passes through a series of pluggable annotators. Entity and its associated polarity are detected for each sentence. Polarity of each sentence is stored into Solr index. Persistent model files can be created from training data and accessed at run time.