#
amazon-emr-cluster
Here are 2 public repositories matching this topic...
Language:All
Filter by language
This is a quick start of using JuiceFS as storage backend for Amazon EMR cluster.
- Updated
Aug 18, 2022 - Shell
This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.
apache-sparkaws-s3power-bisnowflakedata-visualizationorchestrationpysparkbusiness-intelligencesqs-queueaws-ec2apache-airflowetl-pipelinedagsgoogle-colab-notebookredfinsnowpipeamazon-emr-cluster
- Updated
Apr 10, 2025 - Jupyter Notebook
Improve this page
Add a description, image, and links to theamazon-emr-cluster topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theamazon-emr-cluster topic, visit your repo's landing page and select "manage topics."