Publications
Please check my Google Scholar page for the latest updates.
2026
- Personalized Benchmarking: Evaluating LLMs by Individual PreferencesarXiv preprint arXiv:2604.18943, 2026
2025
- Hyperalign: Interpretable Personalized LLM Alignment via Hypothesis GenerationarXiv preprint arXiv:2505.00038, 2025
- Evaluating the Goal-Directedness of Large Language ModelsarXiv preprint arXiv:2504.11844, 2025
2024
2023
- Neural language generation for content adaptation: Explainable, efficient low-resource text simplification and evaluationUniversity of Michigan, 2023
- What’s your Use Case? A Taxonomy of Causal Evaluations of Post-hoc InterpretabilityIn Causal Representation Learning Workshop at NeurIPS 2023, 2023
2022
- GEM v2: Multilingual NLG benchmarking in a single line of codeIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2022
- Speech coding using content latent embedding vectors and speaker latent embedding vectors2022US Patent 11,257,507
2021
- The GEM benchmark: Natural language generation, its evaluation and metricsIn Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021), 2021
- Explainable prediction of text complexity: The missing preliminaries for text simplificationIn Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2021
2020
- Neural language generation: Formulation, methods, and evaluationarXiv preprint arXiv:2007.15780, 2020
- UMSIForeseer at SemEval-2020 Task 11: Propaganda detection by fine-tuning BERT with resampling and ensemble learningIn Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020
2019
2017
- A Systematic Analysis of Sentence Update Detection for Temporal SummarizationIn European Conference on Information Retrieval, 2017
2016
- Temporal Summarization of News StreamsUniversity of Amsterdam, 2016
2015
- Supporting exploration of historical perspectives across collectionsIn International Conference on Theory and Practice of Digital Libraries, 2015
- Combining Multiple Signals for Semanticizing Tweets: University of Amsterdam at# Microposts2015.In # MSM, 2015
- The University of Amsterdam (ILPS. UvA) at TREC 2015 Temporal Summarization Track.In TREC, 2015
2014
-
- Feature Selection and Data Sampling Methods for Learning Reputation Dimensions.In CLEF (Working Notes), 2014