etl-process
Here are 97 public repositories matching this topic...
Language:All
Sort:Most stars
Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.
- Updated
Jan 29, 2023 - Jupyter Notebook
python ETL framework
- Updated
Sep 8, 2021 - Python
For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retrieve data from different sources, clean and transform it into a useful format and finally load the data into an SQL database where the data is ready for further analysis. The result is an established automated p…
- Updated
Feb 9, 2021 - Jupyter Notebook
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
- Updated
May 6, 2023 - Python
3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
- Updated
Aug 17, 2019 - Jupyter Notebook
This is a PHP project which combines ETL with different strategies to extract data from multiple databases, files, and services, transform it and load it into multiple destinations.
- Updated
Apr 3, 2025 - PHP
Sugar candy for data scientist. Easy manipulation in time-series data analytics works.
- Updated
Feb 23, 2024 - Python
This project repository provides a headless module to enrich location data in a database table using the Google Maps Geocode API.
- Updated
Dec 7, 2021 - Python
This is a sentimental analysis project that aims to provide a better insight on customers' satisfaction based on comments gathered (scrapped) from social media using google's Bert classification model.
- Updated
Sep 5, 2024 - Jupyter Notebook
a data warehouse for an online course shop
- Updated
Sep 12, 2021 - TSQL
Scraping BooksToScrape (P2 OC D-A Python) : Utiliser les bases de Python pour l'analyse de marché
- Updated
Jun 16, 2022 - Python
Extractor of Ethereum data to Dgraph format, utilities to analyse the indexed data.
- Updated
Nov 7, 2024 - Rust
Dynamic website scraper and email notifier.
- Updated
Dec 22, 2023 - Python
I made various data normalization operations with python scripts. Target data in CSV format
- Updated
Jul 19, 2021 - Python
We examine two data sets relate with the music Industry. We Extract, transform and load the data sets in order to create a data base and identify insides and trends about the music Industry.
- Updated
May 8, 2021 - Jupyter Notebook
This project automates ETL for gym exercise data, predicting safety scores using KNN and optimizing with GridSearchCV. It generates recommendations, statistical summaries, and visualizations to improve gym safety and client retention. Logging ensures transparency.
- Updated
Feb 25, 2025 - Python
An ETL process for a fictitious streaming service, Amazing Prime, was developed in Jupyter Notebook. The code was then refactored into a Python script to automate the ETL process.
- Updated
Jul 24, 2020 - Jupyter Notebook
This repository contains OLTP, ETL process (using Pentaho Data Integration), and OLAP of credit card dataset. The dataset is taken from Kaggle (https://www.kaggle.com/rikdifos/credit-card-approval-prediction) and part of author Capstone Project.
- Updated
Apr 8, 2022
This project is a comprehensive data engineering solution that extracts HR data from a GitHub repository, performs data transformations using Azure services, and creates an interactive HR dashboard using Power BI. The goal is to enable HR professionals and decision-makers to gain insights from the HR data for better workforce management.
- Updated
Sep 29, 2023 - Jupyter Notebook
Improve this page
Add a description, image, and links to theetl-process topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theetl-process topic, visit your repo's landing page and select "manage topics."