NLP Series : Sentiment Analysis on COVID-19 Tweets using Python


Details
Sentiment analysis (or opinion mining) is a natural language processing technique used to determine whether data is positive, negative or neutral. Sentiment analysis is often performed on textual data to help businesses monitor brand and product sentiment in customer feedback, and understand customer needs.Sentiment analysis is extremely important because it helps businesses quickly understand the overall opinions of their customers.
With the current covid-19 pandemic, we see a lot of stuides and research going on in the world today around covid-19. In this article, using data sets of tweets about covid-19 obtained from Kaggle, A text classifier was built to classify sentiments from the tweets. The kaggle data set from curated from tweeter by scraping the tweeter API for all mentions of covid-19.
To build a text classifier, we need to vectorize the text that will be used for training. There are different ways text can be vectorized. Some of the commom techniques used are bag of words, Term Frequency-Inverse Document Frequency(TF-IDF), and Word Embeddings- Word2Vec and GLOVE.
In this session, we will use Bag of Words and TF-IDF to create features, we will then train models using Logistic Regression, Naive Bayes, and Deep Neural Network.


NLP Series : Sentiment Analysis on COVID-19 Tweets using Python