maxent_grpo.training.patches¶ Training-time integration patches. Modules vllm Robust helpers for a vLLM /generate server.