dataprocessing
Here are 260 public repositories matching this topic...
Language:All
Sort:Most stars
Learning to create Machine Learning Algorithms
- Updated
Jun 15, 2021 - Python
Classification of Breast Cancer diagnosis Using Support Vector Machines
- Updated
Oct 15, 2022 - Jupyter Notebook
A day to day plan for this challenge (50 Days of Machine Learning) . Covers both theoretical and practical aspects
- Updated
Dec 12, 2019 - Jupyter Notebook
Open source bioinformatics and computational biology toolbox written in F#. This is the core package containing type models and parsers/writers.
- Updated
Apr 25, 2025 - F#
Tool for creating efficient data pipelines in a JavaScript environment
- Updated
Mar 3, 2025 - TypeScript
Native Delta Lake Implementation in Go
- Updated
Sep 5, 2024 - Go
Weather Forecasting report over the Jaipur Dataset for Rain Prediction
- Updated
Oct 25, 2019 - Jupyter Notebook
Stochastic Testing and Input Manipulation for Unbiased Learning Systems
- Updated
Oct 16, 2025 - Nextflow
Dataprocessing framework for JavaScript 🎂
- Updated
Jan 7, 2023 - TypeScript
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
- Updated
May 22, 2020 - Python
Process tardis.dev cryptocurrency data, reconstructing the market depth and computing imbalance.
- Updated
May 3, 2021 - Jupyter Notebook
Machine Learning project to predict popularity of Instagram posts
- Updated
Sep 7, 2017 - Python
The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process…
- Updated
Jan 11, 2020 - Python
Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc
- Updated
Oct 28, 2019 - Java
This notebook presents a pipeline to process raw data files of battery cycling and the prediction of their useful life before the degradation starts.
- Updated
Apr 20, 2021 - Jupyter Notebook
A versatile pipelining library created with media organization in mind.
- Updated
Jul 20, 2022 - C#
Can we tell if a house is abandoned based on aerial imagery?
- Updated
Mar 8, 2021 - Jupyter Notebook
A data engineering project with dbt, Docker, Kestra, Terraform, GCP and Looker.
- Updated
Jul 18, 2025 - HCL
List of all my AI Projects
- Updated
Aug 8, 2021 - Jupyter Notebook
A graphical batch data processing tool for protein crystallography
- Updated
May 31, 2024 - Python
Improve this page
Add a description, image, and links to thedataprocessing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedataprocessing topic, visit your repo's landing page and select "manage topics."