Sentiment-ambiguity

The main objective of the project was to predict customer sentiment based on drug reviews and to identify ambiguous reviews to better serve drug manufacturers and new customers. The dataset for the project was collected from Kaggle 2018 University Club Hackathon and consisted of customer provided ratings for a drug and its review. To strike the right balance between the vocabulary size and the model accuracy, we used a custom stop words list along with various parameters in the vectorization process. We conducted a comparative study of various supervised machine learning classifiers such as SVM and Naive Bayes models to better predict the sentiment of a customer. Based on the evaluation parameters such as Precision, Recall, F-score, Accuracy, and extreme misclassification errors, we concluded that LinearSVC classifiers performed better than Naive Bayes models for predicting sentiment on the given dataset. We hypothesized that the number of conjunctions used in a review is directly proportional to the ambiguity of a review. Therefore, to identify the ambiguous reviews, we used a combination of misclassification errors of LinearSVC with a high number of conjunctions.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dataset		Dataset
Project_736.ipynb		Project_736.ipynb
README.md		README.md
Sentiment Analysis of Drug Reviews- Report.pdf		Sentiment Analysis of Drug Reviews- Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-ambiguity

About

Releases

Packages

Languages

pratt-datar/Sentiment-ambiguity

Folders and files

Latest commit

History

Repository files navigation

Sentiment-ambiguity

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages