Skip to content

Pull requests: huggingface/datasets

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Introduce pdf support (#7318)
#7325 opened Dec 12, 2024 by yabramuvdi Loading…
Resolved for empty datafiles
#7314 opened Dec 9, 2024 by sahillihas Loading…
refactor: remove unnecessary else
#7307 opened Dec 5, 2024 by HarikrishnanBalagopal Loading…
Remove upper version limit of fsspec[http]
#7296 opened Nov 20, 2024 by cyyever Loading…
Remove aiohttp from direct dependencies
#7294 opened Nov 18, 2024 by akx Loading…
Let soundfile directly read local audio files
#7278 opened Nov 4, 2024 by fawazahmed0 Loading…
1 task done
[MINOR:TYPO] Fix typo in exception text
#7274 opened Nov 1, 2024 by cakiki Loading…
fast array extraction
#7227 opened Oct 14, 2024 by alex-hh Loading…
Add with_rank to Dataset.from_generator
#7199 opened Oct 4, 2024 by muthissar Loading…
Add repeat method to datasets
#7198 opened Oct 4, 2024 by alex-hh Loading…
fix grammar in fingerprint.py
#7176 opened Sep 26, 2024 by jxmorris12 Loading…
Do not consume unnecessary memory during sharding
#7136 opened Sep 4, 2024 by janEbert Loading…
Fix data file module inference
#7132 opened Aug 29, 2024 by HennerM Loading…
Add Arabic Docs to Datasets
#7094 opened Aug 7, 2024 by AhmedAlmaghz Loading…
Make BufferShuffledExamplesIterable resumable
#7056 opened Jul 22, 2024 by yzhangcs Loading…
Support folder-based datasets with large metadata.jsonl
#6859 opened May 2, 2024 by gbenson Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.