Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Applications of NLP like Topic Modeling, Sentiment Analysis, Word Cloud along with Web Scraping.

NotificationsYou must be signed in to change notification settings

vishakha-b18/Text-Analysis-and-Web-Scraping

Repository files navigation

This repository consists of the following anlaysis and applications of Natural Language Processing (NLP) techniques on several books and their reviews. The books were procured in an image format in pdf, and later were connvereted using OCR to textual information. The book reviews were scraped from the internet:

  1. Topic Modelling: Used Latent Dirichlet Allocation (LDA) for topic modelling on a dataset of books to indetify 5 topics present in each book. Before performing the topic modelling, cleaned the textual data by removing links, special characters, stop words and followed it with Lemmatization.

  2. Web Scraping & Sentiment Analysis: Used BeautifulSoup to scrape book reviews from Goodread and Librarything. Then performed the VADER sentiment analysis, TextBlob sentiment analysis, and the NRC sentiment analysis to understand the emotions, polarity and the subjectivity of the reviews.

  3. Word Cloud: Created Word Clouds on the above data using the WordCloud library


Vishakha Bhattacharjee
MS in Business Analytics, Columbia University

About

Applications of NLP like Topic Modeling, Sentiment Analysis, Word Cloud along with Web Scraping.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp