large-data
Here are 14 public repositories matching this topic...
Language:All
Sort:Most stars
C++ DataFrame for statistical, financial, and ML analysis in modern C++
- Updated
Nov 5, 2025 - C++
A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.
- Updated
Nov 1, 2025 - Java
The official repo for [ACM CSUR'24] "Empowering Agrifood System with Artificial Intelligence: A Survey of the Progress, Challenges and Opportunities"
- Updated
Dec 6, 2024
Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈
- Updated
Apr 16, 2023 - TypeScript
Optimized Python IPC: Uses shared memory to bypass multiprocessing queue I/O bottlenecks, ideal for large data (1MB+) in scientific computing, RL, etc. Reduces system load and improves latency
- Updated
Apr 25, 2025 - Jupyter Notebook
Wrapping single instance learning algorithms for fitting them to data for multiple instance learning
- Updated
Jul 7, 2025 - Jupyter Notebook
💯 A Ruby on Rails app to generate Fizzbuzz numbers up to 100,000,000,000
- Updated
Dec 14, 2022 - Ruby
File Search-Sorting Algorithms for DBMS
- Updated
Jun 24, 2024 - C
This repository contains introduction to pandas which is a software library written for the Python programming language for data manipulation and analysis.
- Updated
Dec 20, 2022 - Jupyter Notebook
Analyzes Grid Engine log files converting them into a format suitable for time series analysis using Grafana.
- Updated
May 8, 2024 - Java
Using Python to analyze large datasets
- Updated
Jun 26, 2023 - Python
Windows VSS (Volume Shadow Copy) path detector for 7-Zip and backup tools
- Updated
Aug 20, 2025 - Batchfile
Improve this page
Add a description, image, and links to thelarge-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thelarge-data topic, visit your repo's landing page and select "manage topics."