datapreprocessing
Here are 566 public repositories matching this topic...
Language:All
Sort:Most stars
Simple tool to split COCO annotations into train/test datasets.
- Updated
Aug 15, 2023 - Python
Roadmap for Data Engineering
- Updated
Jun 20, 2024 - Java
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
- Updated
Nov 21, 2023 - PHP
Wind Power Forecasting Based on Hybrid CEEMDAN-EWT Deep Learning Method
- Updated
Sep 28, 2023 - Python
Analyzing the HR Criteria of a Company and how they promote their Employees and keep Balance between them using Data Analytics, Data Visualizations, and Machine Learning Models for Classification Purposes.
- Updated
May 23, 2019 - Jupyter Notebook
Utilizes a Convolutional-based Transformer architecture for accurate and efficient PV power forecasting.
- Updated
Jan 22, 2024 - Python
Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!
- Updated
Oct 15, 2025 - Python
The project provides a real-world dataset focusing on supply chain analytics
- Updated
Aug 26, 2023 - Jupyter Notebook
Data Science RoadMap
- Updated
Apr 30, 2022
Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.
- Updated
Nov 6, 2025 - Python
Power BI exercises for courses on DataCamp's Data Analyst in Power BI Career Track
- Updated
Jan 24, 2024
A helper package for Machine Learning and Deep Learning Algorithms
- Updated
Feb 14, 2023 - Jupyter Notebook
⚒️ Data preprocessing is the process of transforming raw data into an understandable format. It is also an important step in data mining as we cannot work with raw data. The quality of the data should be checked before applying machine learning or data mining algorithms
- Updated
Nov 15, 2021 - Jupyter Notebook
Data Visualization, EDA , Model Building and Deployment etc..
- Updated
Nov 21, 2022 - Jupyter Notebook
My learnings on different algorithms of Machine Learning with Python .
- Updated
Oct 25, 2021 - Jupyter Notebook
Predicting whether a person who has applied for a loan in a bank would get his/her loan approved or not using Classification Algorithms in Machine Learning, by looking at some common and useful attributes.
- Updated
Mar 31, 2019 - Jupyter Notebook
This repository contains a basic fraud detection system utilising supervised learning techniques to identify potentially fraudulent credit card transactions. The project establishes a baseline model that addresses the challenges of credit card fraud in financial institutions.
- Updated
Nov 13, 2024 - Jupyter Notebook
In this project we try to predict home credit default risk for clients. We try to predict, if the client will have payment difficulties or not.
- Updated
Apr 2, 2022 - Python
Automating the process of Data Preprocessing for Data Science
- Updated
Jun 2, 2021 - Python
Improve this page
Add a description, image, and links to thedatapreprocessing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedatapreprocessing topic, visit your repo's landing page and select "manage topics."