☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() #2501

yaricom · 2024-12-18T15:51:13Z

What does this PR do?

Updated Comet integration to include the following:

Integration with LogCompletionsCallback
Integration with CPOTrainer.evaluation_loop()
Integration with DPOTrainer.evaluation_loop()
Integration with BCOTrainer.evaluation_loop()
Integration with KTOTrainer.evaluation_loop()
Integration with ORPOTrainer.evaluation_loop()

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…mented related integration test.

…during logging of `game_log` table.

… during logging of `game_log` table.

qgallouedec · 2024-12-19T10:35:55Z

trl/trainer/dpo_trainer.py

+
+            if "comet_ml" in self.args.report_to:
+                log_table_to_comet_experiment(
+                    name="game_log.csv",


can you save it in the output_dir instead?

Comet SDK post submitted files to the server in the background. The intermediate copy of the file is stored in the temporary directory until upload to the Comet server completes.

Could you please elaborate on your idea so I can understand it better?

When you run experiment.log_table(tabular_data=table, filename=name), does it save locally something in a filename name?

It is saves indeed the table data into temporary file in temporary directory. This file lives until its upload to the Comet server is complete. After that it is automatically cleaned either by Comet SDK or by OS if something goes bad during Python script execution.

qgallouedec · 2024-12-23T12:38:06Z

Can you screenshot a result?

HuggingFaceDocBuilderDev · 2024-12-23T12:41:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yaricom · 2024-12-23T12:43:48Z

Sure, here is s screenshot from my account at Comet.

yaricom · 2024-12-23T12:45:02Z

And this is a DataFrame encoded as CSV.
game_log.csv

yaricom · 2024-12-23T13:08:10Z

The script I was using to test DPO trainer integration.

import os

from datasets import load_dataset
from transformers import AutoModelForCausalLM, AutoTokenizer

from trl import DPOConfig, DPOTrainer

os.environ["TOKENIZERS_PARALLELISM"] = "false"


def main():
    output_dir = "models/minimal/dpo_my"

    model_id = "trl-internal-testing/tiny-Qwen2ForCausalLM-2.5"
    # model_id = "Qwen/Qwen2-0.5B-Instruct"
    model = AutoModelForCausalLM.from_pretrained(model_id)
    ref_model = AutoModelForCausalLM.from_pretrained(model_id)
    tokenizer = AutoTokenizer.from_pretrained(model_id)
    tokenizer.pad_token = tokenizer.eos_token


    training_args = DPOConfig(
        output_dir=output_dir,
        per_device_train_batch_size=2,
        max_steps=1,
        remove_unused_columns=False,
        gradient_accumulation_steps=8,
        precompute_ref_log_probs=False,
        learning_rate=5.0e-7,
        eval_strategy="steps",
        eval_steps=1,
        report_to="all",
        generate_during_eval=True,
        max_length=1024,
    )

    # dummy_dataset = load_dataset("trl-internal-testing/zen", "standard_preference")
    dummy_dataset = load_dataset("trl-lib/ultrafeedback_binarized", "default")

    dummy_dataset["train"] = dummy_dataset["train"].select(range(20))
    dummy_dataset["test"] = dummy_dataset["test"].select(range(40))

    trainer = DPOTrainer(
        model=model,
        ref_model=ref_model,
        args=training_args,
        processing_class=tokenizer,
        train_dataset=dummy_dataset["train"],
        eval_dataset=dummy_dataset["test"],
    )

    trainer.train()

    trainer.evaluate()


if __name__ == "__main__":
    main()

Do not forget to set COMET_APY_KEY environment variable while executing it.

qgallouedec · 2024-12-28T17:17:57Z

trl/trainer/bco_trainer.py

@@ -24,6 +24,7 @@
 from typing import TYPE_CHECKING, Any, Callable, Literal, Optional, Union

 import numpy as np
+import pandas as pd


pandas is indeed installed.

trl-> datasets->pandas

qgallouedec · 2024-12-28T17:30:49Z

trl/trainer/cpo_trainer.py

+            table = pd.DataFrame(
+                columns=["Prompt", "Policy"],
+                data=[
+                    [prompt, pol[len(prompt) :]] for prompt, pol in zip(random_batch["prompt"], policy_output_decoded)
+                ],


for the record, this doesn't work, because the pol can be over-truncated. See

can be reproduced with

from datasets import load_dataset from trl import CPOConfig, CPOTrainer from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2-0.5B-Instruct") tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2-0.5B-Instruct") train_dataset = load_dataset("trl-lib/ultrafeedback_binarized", split="train[:1%]") eval_dataset = load_dataset("trl-lib/ultrafeedback_binarized", split="test[:1%]") training_args = CPOConfig(output_dir="Qwen2-0.5B-CPO", logging_steps=10, generate_during_eval=True, eval_steps=2, eval_strategy="steps") trainer = CPOTrainer(model=model, args=training_args, processing_class=tokenizer, train_dataset=train_dataset, eval_dataset=eval_dataset) trainer.train()

But this is out of the scope of this PR.

qgallouedec

very nice, thanks @yaricom!

yaricom added 7 commits December 18, 2024 17:41

Implemented integration with Comet in LogCompletionsCallback. Imple…

1f8a17a

…mented related integration test.

Implemented integration with Comet in CPOTrainer.evaluation_loop() …

14bf4da

…during logging of `game_log` table.

Implemented integration with Comet in CPOTrainer.evaluation_loop() …

3211c07

…during logging of `game_log` table.

Implemented integration with Comet in DPOTrainer.evaluation_loop() …

9f9ecb3

…during logging of `game_log` table.

Implemented integration with Comet in BCOTrainer.evaluation_loop() …

248a9f9

…during logging of `game_log` table.

Implemented integration with Comet in KTOTrainer.evaluation_loop() …

5b03ab3

…during logging of `game_log` table.

Implemented integration with Comet in ORPOTrainer.evaluation_loop()…

6feaf4f

… during logging of `game_log` table.

yaricom marked this pull request as ready for review December 18, 2024 15:53

qgallouedec reviewed Dec 19, 2024

View reviewed changes

yaricom added 3 commits December 19, 2024 13:16

Merge branch 'main' into update-comet_ml-integration

6c3c47e

Merge branch 'main' into update-comet_ml-integration

9d082fa

Merge branch 'main' into update-comet_ml-integration

9453084

qgallouedec reviewed Dec 28, 2024

View reviewed changes

qgallouedec approved these changes Dec 28, 2024

View reviewed changes

qgallouedec changed the title ~~Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop()~~ ☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() Dec 28, 2024

qgallouedec merged commit 763738f into huggingface:main Dec 28, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() #2501

☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() #2501

yaricom commented Dec 18, 2024 •

edited

Loading

qgallouedec Dec 19, 2024

yaricom Dec 19, 2024

qgallouedec Dec 21, 2024

yaricom Dec 21, 2024

qgallouedec commented Dec 23, 2024

HuggingFaceDocBuilderDev commented Dec 23, 2024

yaricom commented Dec 23, 2024

yaricom commented Dec 23, 2024

yaricom commented Dec 23, 2024 •

edited

Loading

qgallouedec Dec 28, 2024

qgallouedec Dec 28, 2024

qgallouedec left a comment

☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() #2501

☄️ Update Comet integration to include LogCompletionsCallback and Trainer.evaluation_loop() #2501

Conversation

yaricom commented Dec 18, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

qgallouedec Dec 19, 2024

Choose a reason for hiding this comment

yaricom Dec 19, 2024

Choose a reason for hiding this comment

qgallouedec Dec 21, 2024

Choose a reason for hiding this comment

yaricom Dec 21, 2024

Choose a reason for hiding this comment

qgallouedec commented Dec 23, 2024

HuggingFaceDocBuilderDev commented Dec 23, 2024

yaricom commented Dec 23, 2024

yaricom commented Dec 23, 2024

yaricom commented Dec 23, 2024 • edited Loading

qgallouedec Dec 28, 2024

Choose a reason for hiding this comment

qgallouedec Dec 28, 2024

Choose a reason for hiding this comment

qgallouedec left a comment

Choose a reason for hiding this comment

yaricom commented Dec 18, 2024 •

edited

Loading

yaricom commented Dec 23, 2024 •

edited

Loading