This is alist of data science software and platforms used indata science, which includesprogramming languages, programming environments,machine learning frameworks,data engineering tools,statistical software,data analysis,plotting,MLOps systems, and more.
Programming languages
[edit]Development environments
[edit]Theseinteractive notebooks,IDEs, and platforms provide specialised development environments.
Machine and deep learning software
[edit]TheMachine learning /deep learning tools support development in those fields.
Examples ofData engineering tools.
Examples ofData mining tools.
Free and open-source
[edit]Database management
[edit]Data warehouse environments include:
Data lake environments include:
Statistical software
[edit]Tools forData processing and analysis:
Data and information visualization
[edit]Software forData visualization:
Software forplotting data to support processing and visualise resuls.
Maps and geospatial visualization
[edit]MLOps and model deployment:
Educational data science software
[edit]- Kaggle – online platform for data science education, competitions, datasets, and collaborative learning.
- KNIME – open-source data analytics platform used for teaching data science, machine learning, and workflow-based analysis.
- RapidMiner – used in academic research and education for data mining and machine learning.
- Statistics Online Computational Resource (SOCR) – online tools and instructional resources for statistics education.
- Tanagra (machine learning) – data mining software developed for research and teaching purposes.
- TinkerPlots – explore and analyze data through visual modeling.
- ^"Top 10 Java Libraries for Data Science".GeeksforGeeks. September 22, 2024.
- ^"Swift for Data Science: An Introduction - Alibaba Cloud".www.alibabacloud.com.
- ^"Top 12 Data Science Programming Languages | MDS@Rice".csweb.rice.edu.
- ^"5 Types of Programming Languages for Data Scientists".
- ^"The Role of Programming Languages in Data Science".New York Tech Online College of Engineering & Computer Sciences.
- ^"Apache Zeppelin 0.10.0 Documentation".zeppelin.apache.org.
- ^Monaco, Michael A.; Dexter, Marie; Tamburro, Jennifer."Introduction to SAS® Studio"(PDF).
- ^"6 Best Python IDEs for Data Science in 2025".www.datacamp.com.
- ^"8 Best Machine Learning Software To Use in 2025".
- ^Hiter, Shelby (April 25, 2023)."10 Best Data Mining Tools & Software".
- ^"Cloud Data Warehouse Comparison: Amazon Redshift, Google BigQuery, Azure Synapse, Snowflake, and Databricks".www.linkedin.com.
- ^Darley, James (September 24, 2025)."Top 10: AI Data Lakes".aimagazine.com.
- ^"Top 10 Algorithms for Data Science".
- ^"Machine Learning Algorithms". 17 August 2023.
- ^Staff, Coursera (May 9, 2025)."15 Data Analysis Tools and When to Use Them".Coursera.
- ^"BentoML".GitHub.
- ^"MLflow".mlflow.org.
- ^Zaharia, Matei A.; Chen, Andrew; Davidson, Aaron; Ghodsi, Ali; Hong, Sue Ann; Konwinski, Andy; Murching, Siddharth; Nykodym, Tomas; Ogilvie, Paul; Parkhe, Mani; Xie, Fen; Zumar, Corey (September 28, 2018)."Accelerating the Machine Learning Lifecycle with MLflow"(PDF).Bulletin of the IEEE Computer Society Technical Committee on Data Engineering:39–45.
- ^"Production-ready ML Serving Framework | Seldon Core 2".docs.seldon.ai.
- ^"Streamlit/Streamlit".GitHub.
- ^https://docs.streamlit.io
- ^"Serving Models | TFX".TensorFlow.
- ^"tensorflow/serving". September 27, 2025 – via GitHub.
- ^"wandb/wandb". September 28, 2025 – via GitHub.
- ^"Find Open Datasets and Machine Learning Projects | Kaggle".
- ^"OpenML".
- ^https://archive.ics.uci.edu