Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Detects fake product reviews using supervised ML algorithms like SVM, Random Forest, and XGBoost. Uses NLP techniques (tokenization, lemmatization, TF-IDF) for preprocessing. SVM achieved the highest accuracy and F1-score. Aims to enhance trust in online review systems.

License

NotificationsYou must be signed in to change notification settings

sakshi01coder/Fake-Product-Review-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This project aims to detectfake product reviews usingNatural Language Processing (NLP) andSupervised Machine Learning algorithms. It leverages text-based features and evaluates multiple classifiers to identify deceptive or spam-like reviews from genuine ones.

🧠 Algorithms Used

  • Support Vector Machine (SVM) ✅Best performing model
  • Random Forest
  • XGBoost
  • Naive Bayes
  • K-Nearest Neighbors (KNN)
  • Decision Tree
  • Stochastic Gradient Descent (SGD)

🛠️ Tech Stack

  • Python 🐍
  • Scikit-learn
  • Pandas
  • Numpy
  • Matplotlib / Seaborn
  • Natural Language Toolkit (NLTK)

📝 Features

  • Preprocessing of textual data (tokenization, lemmatization, stop-word removal)
  • TF-IDF Vectorization
  • Comparative analysis of model performance (Accuracy, Precision, Recall, F1 Score, ROC-AUC)
  • Real-time user review input testing

📊 Results

ModelAccuracyPrecisionRecallF1 ScoreROC-AUC
SVM85.6%0.8610.8520.8570.856
XGBoost83.2%0.8190.8550.8370.832
Naive Bayes83.4%0.8570.8040.8300.834
Random Forest82.9%0.8600.7880.8230.829
KNN55.0%0.8560.1270.2220.552

🧪 Sample Flow

  1. User inputs a product review
  2. Preprocessing is applied (cleaning + TF-IDF)
  3. Model predicts whether the review isFake orGenuine

🧾 Dataset

A labeled dataset of product reviews (genuine/fake) was used. Each review includes:

  • Review_Text
  • Label (1 = Genuine, 0 = Fake)

Note: Dataset is not included due to licensing. Please use public datasets or your own for replication.

🏗️ Project Setup

✅ Prerequisites

  • Python 3.8+
  • pip (Python package manager)

🔧 Setup Commands

# Clone the repogit clone https://github.com/yourusername/FakeReviewDetection.gitcd FakeReviewDetection

About

Detects fake product reviews using supervised ML algorithms like SVM, Random Forest, and XGBoost. Uses NLP techniques (tokenization, lemmatization, TF-IDF) for preprocessing. SVM achieved the highest accuracy and F1-score. Aims to enhance trust in online review systems.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp