maxent_grpo.training.generation.vllm_state

State containers used by the vLLM generation helper.

Classes

_VLLMGenerationState(prompts, target_counts, ...)

Track state shared across multiple vLLM retries.