- Notifications
You must be signed in to change notification settings - Fork3.1k
Pull requests: huggingface/datasets
Author
Uh oh!
There was an error while loading.Please reload this page.
Label
Uh oh!
There was an error while loading.Please reload this page.
Projects
Uh oh!
There was an error while loading.Please reload this page.
Milestones
Uh oh!
There was an error while loading.Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading.Please reload this page.
Sort
Pull requests list
fix: prevent duplicate keywords in load_dataset_builder (#4910)
#8008 openedFeb 16, 2026 byDhyeyTeraiyaLoading…
fix save_to_disk/load_from_disk with pathlib.Path input
#8004 openedFeb 13, 2026 byMr-Neutr0nLoading…
Fix Dataset.map writer initialization when early examples return None
#7996 openedFeb 8, 2026 byveeceeyLoading…
✨ Add 'SparseCsv' builder and 'sparse_collate_fn' for efficient high-dimensional sparse data loading
#7993 openedFeb 4, 2026 byEbraheem1Loading…
Fix index out of bound error with original_shard_lengths.
#7987 openedFeb 4, 2026 byjonathanasdfLoading…
Fix unstable tokenizer fingerprinting (enables map cache reuse)
#7982 openedFeb 2, 2026 byKOKOSdeLoading…
feat: implement iter_arrow for skip, take and step iterables
#7972 openedJan 30, 2026 byEdge-ExplorerLoading…
Issue 7756 Fix - multiprocessing hang issue with start method check
#7967 openedJan 28, 2026 byvedanta777Loading…
Use Sequence instead of list in Dataset.from_parquet type hints
#7962 openedJan 26, 2026 byMukundtimbadiya20Loading…
#5354: replace list with Sequence in from_parquet type hints
#7953 openedJan 19, 2026 byashmi8Loading…
feat: Add GenBank file format support for biological sequence data
#7951 openedJan 19, 2026 bybehroozazarkhaliliLoading…
2 tasks done
Remove Python 3.7 and Python 2 code paths from _dill.py
#7941 openedJan 13, 2026 bytboerstadLoading…
Improve readability and documentation of indexing integration tests
#7940 openedJan 13, 2026 byDeeptiAgarwal16Loading…
3 tasks done
Fix duplicate log messages by disabling log propagation by default
#7937 openedJan 12, 2026 bytboerstadLoading…
Fix duplicate keyword conflict in load_dataset_builder
#7932 openedJan 3, 2026 byAshish570rajLoading…
ProTip! Typegi on any issue or pull request to go back to the issue listing page.