dianamonroe/Spam_Detection_NLP_modelPublic

generated from4GeeksAcademy/Spam_Detection_NLP_ProjectDianaM

NotificationsYou must be signed in to change notification settings
Fork0
Star0

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.devcontainer		.devcontainer
.vscode		.vscode
data		data
models		models
src		src
.env.example		.env.example
.gitignore		.gitignore
README.es.md		README.es.md
README.md		README.md
requirements.txt		requirements.txt

Repository files navigation

Spam link detection system

System that is able to automatically detect whether a web page contains spam or not based on its URL.

Step 1: Loaded the dataset

The dataset can be found in this project folder under the nameurl_spam.csv. You can load it into the code directly from the link:

https://raw.githubusercontent.com/4GeeksAcademy/NLP-project-tutorial/main/url_spam.csv

Or download it and add it by hand in your repository.

Step 2: Preprocessed the links

Transformed the data to make it compatible with the model we want to train.Segmented the URLs into parts according to their punctuation marks, remove stopwords, lemmatize, and so on.

Make sure to conveniently split the dataset intotrain andtest.

Step 3: Built an SVM

Start solving the problem by implementing an SVM with the default parameters. Trained it and analyzed its results.

Step 4: Optimized the previous model

After training the SVM, I optimized its hyperparameters using a grid search or a random search.

Step 5: Saved the model

Store the model in the corresponding folder.

About

No description, website, or topics provided.

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook99.7%
Other0.3%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Spam link detection system

Step 1: Loaded the dataset

Step 2: Preprocessed the links

Step 3: Built an SVM

Step 4: Optimized the previous model

Step 5: Saved the model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

dianamonroe/Spam_Detection_NLP_model

Folders and files

Latest commit

History

Repository files navigation

Spam link detection system

Step 1: Loaded the dataset

Step 2: Preprocessed the links

Step 3: Built an SVM

Step 4: Optimized the previous model

Step 5: Saved the model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages