maxent\_grpo.training.baseline ============================== .. automodule:: maxent_grpo.training.baseline .. rubric:: Functions .. autosummary:: _builtin_weight_transfer_trainer_init _canonical_eval_benchmark_label _clear_vllm_client_buffer _collect_dataset_columns _encode_vllm_batched_update _ensure_split_mapping _force_vllm_dtype _get_column_names _guided_decoding_kwargs _import_builtin_vllm_weight_transfer _loopback_host _main_process_first _maybe_align_model_tokenizer_vocab _normalize_vllm_generate_url _patch_trl_vllm_client_init _patch_vllm_guided_decoding_compat _resolve_eval_dataset_preset _resolve_eval_dataset_spec _resolve_prompt_column _resolve_vllm_client_generate_boundary _resolve_vllm_group_port _split_eval_dataset_specs _temporary_env _tensor_nbytes _validate_dataset_columns _validate_vllm_completion_ids _vllm_client_nccl_overrides _vllm_sync_chunk_bytes run_baseline_training .. rubric:: Classes .. autosummary:: ChatTemplate _LazyModuleProxy