Lukas Aichberger
banner
aichberger.bsky.social
Lukas Aichberger
@aichberger.bsky.social
Machine Learning ELLIS PhD at Johannes Kepler University Linz and University of Oxford
🏛️ This work was made possible with OATML and TVG at the University of Oxford (@ox.ac.uk). Special thanks to @yaringal.bsky.social, @adelbibi.bsky.social, @philiptorr.bsky.social, and @alasdair-p.bsky.social for their contributions.

📖 Read the paper: www.arxiv.org/abs/2503.10809
Attacking Multimodal OS Agents with Malicious Image Patches
Recent advances in operating system (OS) agents enable vision-language models to interact directly with the graphical user interface of an OS. These multimodal OS agents autonomously perform computer-...
www.arxiv.org
March 18, 2025 at 6:25 PM
💀 Harmful actions could include engaging with the malicious social media post to amplify its spread, navigating to a malicious website, or causing a memory overflow to crash your computer. Preventing such harmful actions remains an open challenge. [6/6]
March 18, 2025 at 6:25 PM
🎯 Once an OS agent – among those the MIP was optimised for – encounters the MIP during the execution of everyday tasks, empirical results indicate harmful actions are triggered in at least 9 out of 10 cases, regardless of the original task or screenshot layout. [5/6]
March 18, 2025 at 6:25 PM
🚨 The real danger? Attackers can simply embed MIPs in social media posts, wallpapers, or ads and spread them across the internet. Unlike text-based attacks, MIPs are hard to detect, allowing them to spread unnoticed. [4/6]
March 18, 2025 at 6:25 PM
🔓 Our work reveals that OS agents are not ready for safe integration into everyday life. Attackers can craft Malicious Image Patches (MIPs), subtle modifications to an image on the screen that, once encountered by an OS agent, deceive it into carrying out harmful actions. [3/6]
March 18, 2025 at 6:25 PM
💻 AI assistants, known as OS agents, autonomously control computers just like humans do. They navigate by analysing the screen and take actions via mouse and keyboard. OS agents could soon take over everyday tasks, saving users time and effort. [2/6]
March 18, 2025 at 6:25 PM
🙋‍♂️
November 19, 2024 at 9:59 PM