Actions: huggingface/trl
Actions
Showing runs from all workflows
2,500+ workflow runs
2,500+ workflow runs
AsyncGRPOTrainer: add PEFT/LoRA support
Build PR Documentation
#16577:
Pull request #5896
opened
by
rycerzes
AsyncGRPOTrainer: add PEFT/LoRA support
Tests (experimental)
#1772:
Pull request #5896
opened
by
rycerzes
AsyncGRPOTrainer: add PEFT/LoRA support
PR Template Check
#960:
Pull request #5896
opened
by
rycerzes
AsyncGRPOTrainer: add ProcessorMixin handling
Build PR Documentation
#16576:
Pull request #5895
opened
by
rycerzes
AsyncGRPOTrainer: add ProcessorMixin handling
Tests (experimental)
#1771:
Pull request #5895
opened
by
rycerzes
AsyncGRPOTrainer: add ProcessorMixin handling
PR Template Check
#959:
Pull request #5895
opened
by
rycerzes
AsyncGRPOTrainer: add sampling parameters (top_p, top_k, min_p, repetition_penalty)
Build PR Documentation
#16575:
Pull request #5894
opened
by
rycerzes
AsyncGRPOTrainer: add sampling parameters (top_p, top_k, min_p, repetition_penalty)
Tests (experimental)
#1770:
Pull request #5894
opened
by
rycerzes
AsyncGRPOTrainer: add sampling parameters (top_p, top_k, min_p, repetition_penalty)
PR Template Check
#958:
Pull request #5894
opened
by
rycerzes
AsyncGRPOTrainer: add model_init_kwargs support
Build PR Documentation
#16574:
Pull request #5893
opened
by
rycerzes
AsyncGRPOTrainer: add model_init_kwargs support
Tests (experimental)
#1769:
Pull request #5893
opened
by
rycerzes
AsyncGRPOTrainer: add model_init_kwargs support
PR Template Check
#957:
Pull request #5893
opened
by
rycerzes