The agent autonomously operates over long time horizons and does not seek user involvement at all. The user passively monitors the agent via activity logs and has access to an emergency off switch.
Example: Sakana AI's AI Scientist.
The agent autonomously operates over long time horizons and does not seek user involvement at all. The user passively monitors the agent via activity logs and has access to an emergency off switch.
Example: Sakana AI's AI Scientist.
The agent only requests user involvement when it needs approval for a high-risk action (e.g., writing to a database) or when it fails and needs user assistance.
Example: most coding agents (Cursor, Devin, GH Copilot Agent, etc.).
The agent only requests user involvement when it needs approval for a high-risk action (e.g., writing to a database) or when it fails and needs user assistance.
Example: most coding agents (Cursor, Devin, GH Copilot Agent, etc.).
The agent takes the lead in task planning and execution, but actively consults the user to elicit rich preferences and feedback. Unlike L1 & L2, the user can no longer directly control the agent's workflow.
Example: deep research systems.
The agent takes the lead in task planning and execution, but actively consults the user to elicit rich preferences and feedback. Unlike L1 & L2, the user can no longer directly control the agent's workflow.
Example: deep research systems.
The user and the agent collaboratively plan and execute tasks, handing off information to each other and leveraging shared environments and representations to create common ground.
Example: Cocoa (arxiv.org/abs/2412.10999).
The user and the agent collaboratively plan and execute tasks, handing off information to each other and leveraging shared environments and representations to create common ground.
Example: Cocoa (arxiv.org/abs/2412.10999).
The user is in charge of high-level planning to steer the agent. The agent acts when directed, providing on-demand assistance.
Example: your average "copilot" that drafts your emails when you ask it to.
The user is in charge of high-level planning to steer the agent. The agent acts when directed, providing on-demand assistance.
Example: your average "copilot" that drafts your emails when you ask it to.
The agent autonomously operates over long time horizons and does not seek user involvement at all. The user passively monitors the agent via activity logs and has access to an emergency off switch.
Example: Sakana AI's AI Scientist.
The agent autonomously operates over long time horizons and does not seek user involvement at all. The user passively monitors the agent via activity logs and has access to an emergency off switch.
Example: Sakana AI's AI Scientist.
The agent only requests user involvement when it needs approval for a high-risk action (e.g., writing to a database) or when it fails and needs user assistance.
Example: most coding agents (Cursor, Devin, GH Copilot Agent, etc.).
The agent only requests user involvement when it needs approval for a high-risk action (e.g., writing to a database) or when it fails and needs user assistance.
Example: most coding agents (Cursor, Devin, GH Copilot Agent, etc.).
The agent takes the lead in task planning and execution, but actively consults the user to elicit rich preferences and feedback. Unlike L1 & L2, the user can no longer directly control the agent's workflow.
Example: deep research systems.
The agent takes the lead in task planning and execution, but actively consults the user to elicit rich preferences and feedback. Unlike L1 & L2, the user can no longer directly control the agent's workflow.
Example: deep research systems.
The user and the agent collaboratively plan and execute tasks, handing off information to each other and leveraging shared environments and representations to create common ground.
Example: Cocoa (arxiv.org/abs/2412.10999).
The user and the agent collaboratively plan and execute tasks, handing off information to each other and leveraging shared environments and representations to create common ground.
Example: Cocoa (arxiv.org/abs/2412.10999).
The user is in charge of high-level planning to steer the agent. The agent acts when directed, providing on-demand assistance.
Example: your average "copilot" that drafts your emails when you ask it to.
The user is in charge of high-level planning to steer the agent. The agent acts when directed, providing on-demand assistance.
Example: your average "copilot" that drafts your emails when you ask it to.
We often talk about AI agents augmenting vs. automating work, but how exactly can different configurations of human-agent interaction look like? We introduce a 5-level framework for AI agent autonomy to unpack this.
🧵👇
We often talk about AI agents augmenting vs. automating work, but how exactly can different configurations of human-agent interaction look like? We introduce a 5-level framework for AI agent autonomy to unpack this.
🧵👇
In our #chi2025 paper, we empower designers to think about this via 🎨designerly adaptation🎨 of LLMs and built a Figma widget to help!
📜 arxiv.org/abs/2401.09051
🧵👇
In our #chi2025 paper, we empower designers to think about this via 🎨designerly adaptation🎨 of LLMs and built a Figma widget to help!
📜 arxiv.org/abs/2401.09051
🧵👇
Cocoa embeds AI agents into docs with a new design pattern—interactive plans. They’re “computational notebooks” for humans + agents, breaking tasks down into interactive steps. We designed Cocoa for scientific research, an important & challenging domain for AI.
Cocoa embeds AI agents into docs with a new design pattern—interactive plans. They’re “computational notebooks” for humans + agents, breaking tasks down into interactive steps. We designed Cocoa for scientific research, an important & challenging domain for AI.
🍫Introducing Cocoa, our new interaction paradigm for balancing human & AI agency in complex human-AI workflows. 🧵
🍫Introducing Cocoa, our new interaction paradigm for balancing human & AI agency in complex human-AI workflows. 🧵
🌐 Website: chi-staig.github.io
🗓️ Submit your work by: Feb 17, 2025
🌐 Website: chi-staig.github.io
🗓️ Submit your work by: Feb 17, 2025