Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
Here are 56,503 public repositories matching this topic...
Language:All
Sort:Most stars
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
- Updated
Oct 3, 2025 - Jupyter Notebook
Apache Superset is a Data Visualization and Data Exploration Platform
- Updated
Oct 7, 2025 - TypeScript
scikit-learn: machine learning in Python
- Updated
Oct 7, 2025 - Python
Deep Learning for humans
- Updated
Oct 6, 2025 - Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
- Updated
Oct 7, 2025 - Python
Learn how to design, develop, deploy and iterate on production-grade ML applications.
- Updated
Aug 18, 2024 - Jupyter Notebook
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
- Updated
Oct 7, 2025 - Python
Streamlit — A faster way to build and share data apps.
- Updated
Oct 7, 2025 - Python
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
- Updated
Oct 7, 2025 - Python
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
- Updated
Oct 7, 2025 - Python
💫 Industrial-strength Natural Language Processing (NLP) in Python
- Updated
May 28, 2025 - Python
10 Weeks, 20 Lessons, Data Science for All!
- Updated
Oct 3, 2025 - Jupyter Notebook
Roadmap to becoming an Artificial Intelligence Expert in 2022
- Updated
Sep 12, 2025 - JavaScript
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
- Updated
Oct 7, 2025 - Python
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
- Updated
Oct 15, 2023 - Python
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
- Updated
Mar 20, 2024 - Python
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
- Updated
Jul 18, 2024
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
- Updated
Jun 25, 2024 - Jupyter Notebook
500 AI Machine learning Deep learning Computer vision NLP Projects with code
- Updated
Aug 1, 2025
📝 An awesome Data Science repository to learn and apply for real world problems.
- Updated
Oct 6, 2025
- Followers
- 4.3k followers
- Website
- github.com/topics/data-science
- Wikipedia
- Wikipedia