How to get the original dataset name with username? #7311

npuichigo · 2024-12-08T07:18:14Z

Feature request

The issue is related to ray data ray-project/ray#49008 which it requires to check if the dataset is the original one just after load_dataset and parquet files are already available on hf hub.

The solution used now is to get the dataset name, config and split, then load_dataset again and check the fingerprint. But it's unable to get the correct dataset name if it contains username. So how to get the dataset name with username prefix, or is there another way to query if a dataset is the original one with parquet available?

@lhoestq

Motivation

ray-project/ray#49008

Your contribution

Would like to fix that.

The text was updated successfully, but these errors were encountered:

npuichigo added the enhancement New feature or request label Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the original dataset name with username? #7311

How to get the original dataset name with username? #7311

npuichigo commented Dec 8, 2024 •

edited

Loading

How to get the original dataset name with username? #7311

How to get the original dataset name with username? #7311

Comments

npuichigo commented Dec 8, 2024 • edited Loading

Feature request

Motivation

Your contribution

npuichigo commented Dec 8, 2024 •

edited

Loading