🎯
Focusing
Data Full Stack (Scientist/Engineer/Analyst), build end-to-end pipeline with AI/ML models
Welcome to Norton's Data World
- Full demo of end-to-end data pipeline automation run
- Open Data Event link:https://2025.open-data.nyc/event/nyc-resident-housing-property-sale-2019-2023-analysis-and-insight/
- Platform: Google Colab
- Language: Python
- Data Manipulation: Pandas
- Machine Learning Model Demo: Linear Regression, Random Forest
- Data Visualization: Jupyter Notebook: seaborn -> (pairplot, boxplot), plotly -> (3D plot)
- View the data visualization report of group project at (https://app.luminpdf.com/viewer/5ecc6da18124240012ae0885)
- Tableau Dashboard for Philadelphia House Price Data Visualization
- Language: Python
- Data Pipeline and Automation: Apache Airflow
- Data Cleaning: Pandas
- Data Storage: MySQL
- Data Transformation (Join Table): Apache Spark (SQL) - Cluster @ Databricks
- Data Visualization: Jupyter Notebook: Matplotlib -> (heatmap), seaborn -> (pairplot, boxplot), plotly -> (3D plot), Tableau Public-> dashboard
- Language: Python
- Apache Airflow
- Pandas
- Matplotlib
- MySQL / PostgreSQL
- Jupyter Notebook Report
- AWS (S3, EC2, RDS)
- Language: R
- NoSQL Database: MongoDB
- Visualization Tools: ggplot2, plotly -> (3D plot)
- In this project (LEGO dataset), I applied both R-Studio and jupyter notebook to import and export data from MongoDB, then used ggplot2 and plotly to demonstrate the analytical reslut with data visualization, respectively.
- Language: Python
- Class
- Unit Testing
- Object Oriented Programming
- Terminal
- In this project, I applied unit-test tool to verify the classes and modules in the Black Jack Application.
In this repo, Apache Kafka is used for tracking the route of the designed buslines. When we run the three different busdata producers python files, you will see three different moving spots in the map.
Environment & tools:
- Python 3.70 (pykafka, flask, JSON)
- Apache Kafka
- Javascript (Leaflet.JS)
- html
Thank you for visiting. More projects are coming
PinnedLoading
- norton_portfolio
norton_portfolio Public - ZCW.DataGroupProject
ZCW.DataGroupProject PublicJupyter Notebook 2
- DataEngineering.Labs.AirflowProject
DataEngineering.Labs.AirflowProject Public - Week9-ResearchProjects
Week9-ResearchProjects PublicForked fromZipcoder/Week9-ResearchProjects
DataEngineering projects
Jupyter Notebook
- Kafka_Live_Map
Kafka_Live_Map PublicPython
- PythonFundamentals.Labs.BlackJack
PythonFundamentals.Labs.BlackJack PublicForked fromZipCodeCore/PythonFundamentals.Labs.BlackJack
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check theGitHub status page orcontact support.
If the problem persists, check theGitHub status page orcontact support.
Uh oh!
There was an error while loading.Please reload this page.