Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork124
Practical Data Engineering: A Hands-On Real-Estate Project Guide
NotificationsYou must be signed in to change notification settings
ssp-data/practical-data-engineering
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
This is a practical example of a data engineering project with real-estates. The connected blog post aboutBuilding a Data Engineering Project in 20 Minutes you can find on mywebsite. Topics are:
- Getting the Data – Scraping withBeautifulSoup
- Storing on S3-MinIO
- Custom Change Data Capture (CDC)
- Adding Database features to S3 –Delta Lake &Spark
- Machine Learning part –Jupyter Notebook
- Ingesting Data Warehouse for low latency –Apache Druid
- The UI with Dashboards and more –Apache Superset
- Orchestrating everything together –Dagster
- DevOps engine –Kubernetes
The Status of the project you findhere.
To get MinIO, Spark, Kubernetes, etc. ready, check the representive folder inhere.
- MinIO started
- Kubernetes ready
- Spark image and role and namespaces ready
- cd
src/pipelines/real-estateand start dagit withdagit
About
Practical Data Engineering: A Hands-On Real-Estate Project Guide
Topics
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Sponsor this project
Uh oh!
There was an error while loading.Please reload this page.
Packages0
No packages published

