maxent_grpo.training.runtime.ops¶
Runtime operational helpers.
- class maxent_grpo.training.runtime.ops.StartupStatus(value)[source]¶
-
High-level startup state derived from vLLM log lines.
- STARTING = 'starting'¶
- HEALTHY = 'healthy'¶
- CORE_ENGINE_STALL = 'core_engine_stall'¶
- ERROR = 'error'¶
- maxent_grpo.training.runtime.ops.classify_vllm_startup_log(log_text, stall_threshold=3)[source]¶
Classify startup progress using marker patterns in
log_text.- Parameters:
- Return type:
- maxent_grpo.training.runtime.ops.should_trigger_v0_fallback(log_text, attempt, min_attempts=20, stall_threshold=3)[source]¶
Return True when vLLM startup appears stuck and should be relaunched in V0 mode.
Modules
Detect and classify vLLM server startup stalls from log text. |