news
| May 08, 2026 | Personalized Benchmarking: Evaluating LLMs by Individual Preferences is accepted to ACL Findings 2026 |
|---|---|
| May 19, 2025 | New paper HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis Generation |
| May 15, 2025 | New paper Evaluating the Goal-Directedness of Large Language Models |
| May 01, 2025 | RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals accepted to ICML 2025 |
| Jan 20, 2025 | Why is constrained neural language generation particularly challenging? accepted to TMLR 2025 |
| Sep 25, 2024 | BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling is accepted to NeurIPS 2024 |