Actions: huggingface/trl
Actions
722 workflow runs
722 workflow runs
model_args
(#2442)
Build documentation
#1004:
Commit 460e780
pushed
by
qgallouedec
ref_model
in OnlineDPOTrainer
(#2417)
Build documentation
#1003:
Commit 7ba118a
pushed
by
qgallouedec
max_steps
calculation in RLOOTrainer
(#2433)
Build documentation
#998:
Commit 52201d3
pushed
by
qgallouedec
DPOTrainer
(#2413)
Build documentation
#994:
Commit 8d9cfaa
pushed
by
qgallouedec
AutoModelForCausalLMWithValueHead
(#2398)
Build documentation
#993:
Commit 94e4135
pushed
by
qgallouedec
SmolVLM
models via standalone script `sft_…
Build documentation
#990:
Commit e1d7813
pushed
by
qgallouedec
KTOTrainer
(#2394)
Build documentation
#985:
Commit baee06f
pushed
by
qgallouedec