Skip to content

Latest commit

 

History

History
28 lines (14 loc) · 840 Bytes

README.md

File metadata and controls

28 lines (14 loc) · 840 Bytes

bigdataproject

Analyzing yelp reviews using topic modelling and aspect mining

Dependencies for the project:

The project in this repository was successfully executed on python 2.7 and spark 1.6

Install nltk for stemming #Run nltk.download() on pyspark for installing nltk. Type d (for download) on the prompt and then type all to download all nltk libraries.

Run LDA to find out the aspects from the reviews

Install text blob for finding the polarity of your aspects

pip install -U textblob

The summary of our analysis can be found at

https://github.com/nhemanthrao23/bigdataproject/blob/master/Analysis%20Summary.md

Presentation to our project

https://prezi.com/p/wkcbwtuwy3hf/

methodology