glue-job
Here are 34 public repositories matching this topic...
Language:All
Sort:Most stars
Glue scripts for converting AWS Service Logs for use in Athena
- Updated
Feb 1, 2024 - Python
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
- Updated
Feb 6, 2025 - TypeScript
Build and deploy a serverless data pipeline on AWS with no effort.
- Updated
Feb 8, 2023 - Python
Extract, transform, and load data for analytic processing using AWS Glue
- Updated
May 2, 2021 - Python
This is a data pipeline built with the purpose of serving a business team.
- Updated
Feb 28, 2023 - Python
Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.
- Updated
Feb 10, 2022 - Python
Terraform module which creates Glue Job resources on AWS.
- Updated
Jun 4, 2022 - HCL
A cloud-based ETL pipeline on AWS for automating airline flight data ingestion, transformation, and storage using S3, Glue, Redshift, EventBridge, Step Functions, and SNS.
- Updated
Feb 17, 2025 - Python
This project outlines the final project requirements for Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.
- Updated
Jan 6, 2025 - HTML
- Updated
Oct 28, 2023 - Python
Terraform module to create and manage a AWS Glue job
- Updated
Jan 22, 2025 - HCL
Pipeline ETL na AWS
- Updated
Jul 28, 2024 - Python
Data Engineering project using data streaming produced by python applications, ETL process and availability for ad-hoc SQL queries in the AWS cloud
- Updated
Jul 5, 2024 - Jupyter Notebook
This project creates a serverless data pipeline to extract data from the Colombo Stock Market ASI Index API using AWS Lambda, Kinesis Firehose, and S3. An AWS Glue workflow processes and transforms the data, storing it in an Apache Iceberg table via Athena and Glue ETL jobs.
- Updated
Jul 2, 2024 - Python
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
- Updated
Aug 6, 2024 - Python
DeepLearning.AI & AWS Data Engineering Course Exercises
- Updated
Jan 5, 2025 - Jupyter Notebook
This project is an end-to-end, fully automated warehouse management solution designed to tackle real-world inventory challenges in the FMCG sector. From real-time data ingestion and predictive analytics to interactive dashboards, this project combines cutting-edge technologies and an event-driven architecture to simulate a business-ready system.
- Updated
Dec 28, 2024 - Python
Improve this page
Add a description, image, and links to theglue-job topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theglue-job topic, visit your repo's landing page and select "manage topics."