maxent_grpo.training.seed_paper_eval_callback¶
Trainer callback for official SEED paper-style eval against the live vLLM server.
Functions
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Classes
|
Run the official SEED paper-style eval against the live vLLM server. |
- class maxent_grpo.training.seed_paper_eval_callback.SeedPaperEvalCallback(training_args)[source]¶
Bases: <class ‘TrainerCallback’>
Run the official SEED paper-style eval against the live vLLM server.
- Parameters:
training_args (Any)