[EVAL] Add JGLUE Test #455

ryan-minato · 2024-12-18T10:08:19Z

JGLUE is a widely used test set in the Japanese LLM research community, consisting of five sub-tests (with MARC-ja removed due to a request from Amazon):

MARC-ja
JSTS
JNLI
JSQuAD
JCommonsenseQA

Most of these sub-tests have English counterparts, and all of these are available for use under the CC-BY-SA-4.0 license.

JGLUE does not have an official dataset mirror on Hugging Face, and some of the tests lack community mirrors as well. I am currently processing and uploading the four sub-tests that remain accessible.

Due to performance issues with llm-jp-eval on my device, I am working on integrating the test sets used by llm-jp-eval into lighteval. If successful, this integration could greatly improve the evaluation of Japanese LLMs.

Evaluation metadata

Provide all available

Paper url:
Github url: https://github.com/yahoojapan/JGLUE
Dataset url:

The text was updated successfully, but these errors were encountered:

clefourrier · 2024-12-18T10:23:00Z

Hi! This sounds good! You might encounter specific issues related to tokenization, so it would be good to double check this when it's added :)

NathanHB · 2024-12-19T12:25:27Z

Great ! Don't hesitate to refer to the documentation to add your custom tasks.
https://huggingface.co/docs/lighteval/adding-a-custom-task

ryan-minato added the new task label Dec 18, 2024

ryan-minato changed the title ~~[EVAL] Adding the JGLUE Test Set~~ [EVAL] Add JGLUE Test Dec 18, 2024

ryan-minato mentioned this issue Dec 19, 2024

feat: add JGLUE tasks #469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EVAL] Add JGLUE Test #455

[EVAL] Add JGLUE Test #455

ryan-minato commented Dec 18, 2024

clefourrier commented Dec 18, 2024

NathanHB commented Dec 19, 2024

[EVAL] Add JGLUE Test #455

[EVAL] Add JGLUE Test #455

Comments

ryan-minato commented Dec 18, 2024

Evaluation metadata

clefourrier commented Dec 18, 2024

NathanHB commented Dec 19, 2024