I am excited by the idea of using AI to help people manage ilness and health conditions. This isnt very sexy, but I think there is real potential to improve health outcomes and quality of life.
ehudreiter.com/2026/01/19/l...
I am excited by the idea of using AI to help people manage ilness and health conditions. This isnt very sexy, but I think there is real potential to improve health outcomes and quality of life.
ehudreiter.com/2026/01/19/l...
www.theguardian.com/uk-news/2026...
www.theguardian.com/uk-news/2026...
Google: Okay, we'll block "AI" overviews on that query.
The product is fundamentally flawed and cannot be "fixed" by patching query by query.
A short 🧵>>
Google: Okay, we'll block "AI" overviews on that query.
The product is fundamentally flawed and cannot be "fixed" by patching query by query.
A short 🧵>>
I hope to retire soon, and many people are asking about my plans. Basically I want to do lots of travel, say involved in academia, and perhaps do some writing.
ehudreiter.com/2026/01/06/r...
I hope to retire soon, and many people are asking about my plans. Basically I want to do lots of travel, say involved in academia, and perhaps do some writing.
ehudreiter.com/2026/01/06/r...
Researchers should do a “sanity” check on experiments. That is, manually inspect some (A) test/train data, (B) model/system output, and (C) evaluation output, looking for anything that seems strange.
ehudreiter.com/2025/12/22/d...
Researchers should do a “sanity” check on experiments. That is, manually inspect some (A) test/train data, (B) model/system output, and (C) evaluation output, looking for anything that seems strange.
ehudreiter.com/2025/12/22/d...
data contamination, reward hacking, saturation; ensure construct validity; rigorously test and validate, etc.
Unfortunately, community places little value on above. Want to beat SOTA or competitors, dont care if BM used mean anything...
data contamination, reward hacking, saturation; ensure construct validity; rigorously test and validate, etc.
Unfortunately, community places little value on above. Want to beat SOTA or competitors, dont care if BM used mean anything...
LLMs often “cheat” on benchmarks via data contamination and reward hacking. This problem is getting worse, perhaps because of perverse incentives. Need to move beyond benchmarks and start measuring real-world impact.
ehudreiter.com/2025/12/08/d...
LLMs often “cheat” on benchmarks via data contamination and reward hacking. This problem is getting worse, perhaps because of perverse incentives. Need to move beyond benchmarks and start measuring real-world impact.
ehudreiter.com/2025/12/08/d...
Research culture is very important but also very hard to change. I suspect this is one reason why it is so difficult to get people to do more rigorous and meaningful experiments.
ehudreiter.com/2025/11/24/h...
Research culture is very important but also very hard to change. I suspect this is one reason why it is so difficult to get people to do more rigorous and meaningful experiments.
ehudreiter.com/2025/11/24/h...
Closing 28 Nov
www.abdn.ac.uk/jobs/vacanci...
Closing 28 Nov
www.abdn.ac.uk/jobs/vacanci...
When building an NLG system, it really helps to understand what users want; this came up several times at the recent INLG conference. I discuss some of our work in this space, and give a few suggestions.
ehudreiter.com/2025/11/06/u...
When building an NLG system, it really helps to understand what users want; this came up several times at the recent INLG conference. I discuss some of our work in this space, and give a few suggestions.
ehudreiter.com/2025/11/06/u...
Data on usage of AI in healthcare suggests that most common uses in 2025 are probably (A) giving personalised health information to patients and (B) helping clinicians write documents.
ehudreiter.com/2025/10/21/m...
Data on usage of AI in healthcare suggests that most common uses in 2025 are probably (A) giving personalised health information to patients and (B) helping clinicians write documents.
ehudreiter.com/2025/10/21/m...
Ive seen a number of diagrams recently which are too complicated and difficult to understand. I explain some of the problems I see and give advice.
ehudreiter.com/2025/10/08/g...
Ive seen a number of diagrams recently which are too complicated and difficult to understand. I explain some of the problems I see and give advice.
ehudreiter.com/2025/10/08/g...
dl.acm.org/doi/10.1145/...
dl.acm.org/doi/10.1145/...