Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EVAL] Add JGLUE Test #455

Open
ryan-minato opened this issue Dec 18, 2024 · 2 comments
Open

[EVAL] Add JGLUE Test #455

ryan-minato opened this issue Dec 18, 2024 · 2 comments
Labels

Comments

@ryan-minato
Copy link
Contributor

JGLUE is a widely used test set in the Japanese LLM research community, consisting of five sub-tests (with MARC-ja removed due to a request from Amazon):

  • MARC-ja
  • JSTS
  • JNLI
  • JSQuAD
  • JCommonsenseQA

Most of these sub-tests have English counterparts, and all of these are available for use under the CC-BY-SA-4.0 license.

JGLUE does not have an official dataset mirror on Hugging Face, and some of the tests lack community mirrors as well. I am currently processing and uploading the four sub-tests that remain accessible.

Due to performance issues with llm-jp-eval on my device, I am working on integrating the test sets used by llm-jp-eval into lighteval. If successful, this integration could greatly improve the evaluation of Japanese LLMs.

Evaluation metadata

Provide all available

@ryan-minato ryan-minato changed the title [EVAL] Adding the JGLUE Test Set [EVAL] Add JGLUE Test Dec 18, 2024
@clefourrier
Copy link
Member

Hi! This sounds good! You might encounter specific issues related to tokenization, so it would be good to double check this when it's added :)

@NathanHB
Copy link
Member

Great ! Don't hesitate to refer to the documentation to add your custom tasks.
https://huggingface.co/docs/lighteval/adding-a-custom-task

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants