Tejas Srinivasan
@tejassrinivasan.bsky.social
CS PhD student at USC. Former research intern at AI2 Mosaic. Interested in human-AI interaction and language grounding.
I'm trying to make "bleet" a thing
May 30, 2025 at 4:36 PM
I'm trying to make "bleet" a thing
The only silver lining of my ACL rejection is that I have something to submit to EMNLP
May 16, 2025 at 7:59 PM
The only silver lining of my ACL rejection is that I have something to submit to EMNLP
Ty for the plug 🙏
Model confidence is a good decision aid (arxiv.org/pdf/2001.02114), while explanations are less useful and can cause over-reliance (arxiv.org/abs/2310.12558, arxiv.org/pdf/2406.19170). Other interaction cues like AI warmth can also make a difference (arxiv.org/abs/2407.07950).
Model confidence is a good decision aid (arxiv.org/pdf/2001.02114), while explanations are less useful and can cause over-reliance (arxiv.org/abs/2310.12558, arxiv.org/pdf/2406.19170). Other interaction cues like AI warmth can also make a difference (arxiv.org/abs/2407.07950).
March 13, 2025 at 1:03 AM
Ty for the plug 🙏
Model confidence is a good decision aid (arxiv.org/pdf/2001.02114), while explanations are less useful and can cause over-reliance (arxiv.org/abs/2310.12558, arxiv.org/pdf/2406.19170). Other interaction cues like AI warmth can also make a difference (arxiv.org/abs/2407.07950).
Model confidence is a good decision aid (arxiv.org/pdf/2001.02114), while explanations are less useful and can cause over-reliance (arxiv.org/abs/2310.12558, arxiv.org/pdf/2406.19170). Other interaction cues like AI warmth can also make a difference (arxiv.org/abs/2407.07950).
What do you mean by core capabilities, for VLMS? IMO core capabilities should be determined by the applications we care about, and I'd argue medical use cases are as important (if not more) as MSCOCO-style images/scenes
March 10, 2025 at 4:05 PM
What do you mean by core capabilities, for VLMS? IMO core capabilities should be determined by the applications we care about, and I'd argue medical use cases are as important (if not more) as MSCOCO-style images/scenes
What are you using o1pro for? And in what aspects do you think it's better than other LLMs?
February 28, 2025 at 7:40 PM
What are you using o1pro for? And in what aspects do you think it's better than other LLMs?
Is this advice you reserve for a particular class of problems, or is it just generally applicable because we still don't know the full breadth of LLM capabilities?
February 28, 2025 at 7:39 PM
Is this advice you reserve for a particular class of problems, or is it just generally applicable because we still don't know the full breadth of LLM capabilities?
I'm always three days away from being three days away
February 28, 2025 at 5:24 PM
I'm always three days away from being three days away
We hope our work inspires the community to more closely consider how user characteristics, including but not limited to trust, affect how people rely on AI assistance.
Work done with the always-awesome @thomason.bsky.social!
Work done with the always-awesome @thomason.bsky.social!
February 27, 2025 at 6:02 PM
We hope our work inspires the community to more closely consider how user characteristics, including but not limited to trust, affect how people rely on AI assistance.
Work done with the always-awesome @thomason.bsky.social!
Work done with the always-awesome @thomason.bsky.social!
Improving AI reliability is more important than ever as AI systems are increasingly deployed in real-world settings with high stakes. We believe it is important for AI researchers to think about the user-AI dyad 🧑🤖, rather than just the AI in a vacuum.
February 27, 2025 at 6:01 PM
Improving AI reliability is more important than ever as AI systems are increasingly deployed in real-world settings with high stakes. We believe it is important for AI researchers to think about the user-AI dyad 🧑🤖, rather than just the AI in a vacuum.
These findings show that being able to estimate users’ trust levels can enhance human-AI collaboration 💪 but we also find that modeling user trust is very challenging! 😓 Our work reveals promising new directions for user modeling that extend beyond merely learning user preferences.
February 27, 2025 at 6:01 PM
These findings show that being able to estimate users’ trust levels can enhance human-AI collaboration 💪 but we also find that modeling user trust is very challenging! 😓 Our work reveals promising new directions for user modeling that extend beyond merely learning user preferences.
We show that adapting AI behavior to user trust levels, by showing AI explanations during moments of low trust and counter-explanations during high trust, effectively mitigates inappropriate reliance and improves decision accuracy! These improvements are also seen with other intervention strategies.
February 27, 2025 at 6:00 PM
We show that adapting AI behavior to user trust levels, by showing AI explanations during moments of low trust and counter-explanations during high trust, effectively mitigates inappropriate reliance and improves decision accuracy! These improvements are also seen with other intervention strategies.
In two decision-making tasks, we find that low and high user trust levels worsen under-reliance and over-reliance on AI recommendations, respectively 💀💀💀
Can the AI assistant do something differently when user trust is low/high to prevent such inappropriate reliance? Yes!
Can the AI assistant do something differently when user trust is low/high to prevent such inappropriate reliance? Yes!
February 27, 2025 at 5:59 PM
In two decision-making tasks, we find that low and high user trust levels worsen under-reliance and over-reliance on AI recommendations, respectively 💀💀💀
Can the AI assistant do something differently when user trust is low/high to prevent such inappropriate reliance? Yes!
Can the AI assistant do something differently when user trust is low/high to prevent such inappropriate reliance? Yes!
Do each of these correspond to a particular conf deadline? I'm guessing
May: EMNLP
July: AACL?
Oct: EACL/NAACL
Feb: ACL
May: EMNLP
July: AACL?
Oct: EACL/NAACL
Feb: ACL
February 19, 2025 at 6:44 PM
Do each of these correspond to a particular conf deadline? I'm guessing
May: EMNLP
July: AACL?
Oct: EACL/NAACL
Feb: ACL
May: EMNLP
July: AACL?
Oct: EACL/NAACL
Feb: ACL