Aida Nematzadeh
banner
aidanematzadeh.bsky.social
Aida Nematzadeh
@aidanematzadeh.bsky.social
Research scientist at Google DeepMind.🦎
She/her.
http://www.aidanematzadeh.me/
This result is particularly interesting for capabilities that are harder for models (like numerical reasoning or text rendering) as prompt-aware guidance boosts performance on these failure modes without retraining.
September 30, 2025 at 4:00 PM
Most diffusion-based models use a fixed (model-tuned) guidance schedule. We show that picking the guidance value during inference, conditioned on the prompt/capability, significantly improves performance.

arxiv.org/abs/2509.16131
September 30, 2025 at 4:00 PM
We design 3 main tasks with varying degrees of difficulty and evaluate 13 models across different families. Models show rudimentary numerical reasoning skills, limited to small numbers and simple prompt formats; many models are affected by non-numerical prompt manipulations.
December 9, 2024 at 7:08 PM