Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
NotificationsYou must be signed in to change notification settings

dianamonroe/Spam_Detection_NLP_model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

System that is able to automatically detect whether a web page contains spam or not based on its URL.

Step 1: Loaded the dataset

The dataset can be found in this project folder under the nameurl_spam.csv. You can load it into the code directly from the link:

https://raw.githubusercontent.com/4GeeksAcademy/NLP-project-tutorial/main/url_spam.csv

Or download it and add it by hand in your repository.

Step 2: Preprocessed the links

Transformed the data to make it compatible with the model we want to train.Segmented the URLs into parts according to their punctuation marks, remove stopwords, lemmatize, and so on.

Make sure to conveniently split the dataset intotrain andtest.

Step 3: Built an SVM

Start solving the problem by implementing an SVM with the default parameters. Trained it and analyzed its results.

Step 4: Optimized the previous model

After training the SVM, I optimized its hyperparameters using a grid search or a random search.

Step 5: Saved the model

Store the model in the corresponding folder.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp