amazon/gpt-oss-120b-p-eagle
Updated
•
26
Scalable Artificial Intelligence
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training
LikeBench: Evaluating Subjective Likability in LLMs for Personalization