maxent_grpo.training.patches

Training-time integration patches.

Modules

vllm

Robust helpers for a vLLM /generate server.