maxent_grpo.core¶
Core domain helpers (data, evaluation, hub, model access).
Modules
Dataset loading utilities with support for mixtures. |
|
LightEval task registration and Slurm launch utilities. |
|
Helpers for working with the Hugging Face Hub. |
|
Copyright 2025 Liv d'Aliberti |