Sweta Karlekar
@swetakar.bsky.social
2.6K followers
1.2K following
31 posts
Machine learning PhD student @ Blei Lab in Columbia University
Working in mechanistic interpretability, nlp, causal inference, and probabilistic modeling!
Previously at Meta for ~3 years on the Bayesian Modeling & Generative AI teams.
🔗 www.sweta.dev
Posts
Media
Videos
Starter Packs
Reposted by Sweta Karlekar
Reposted by Sweta Karlekar
Andrew Jesson
@anndvision.bsky.social
· Dec 13
Hello!
We will be presenting Estimating the Hallucination Rate of Generative AI at NeurIPS. Come if you'd like to chat about epistemic uncertainty for In-Context Learning, or uncertainty more generally. :)
Location: East Exhibit Hall A-C #2703
Time: Friday @ 4:30
Paper: arxiv.org/abs/2406.07457
We will be presenting Estimating the Hallucination Rate of Generative AI at NeurIPS. Come if you'd like to chat about epistemic uncertainty for In-Context Learning, or uncertainty more generally. :)
Location: East Exhibit Hall A-C #2703
Time: Friday @ 4:30
Paper: arxiv.org/abs/2406.07457
Reposted by Sweta Karlekar
Blei Lab
@bleilab.bsky.social
· Dec 2
I am very excited to share our new Neurips 2024 paper + package, Treeffuser! 🌳 We combine gradient-boosted trees with diffusion models for fast, flexible probabilistic predictions and well-calibrated uncertainty.
paper: arxiv.org/abs/2406.07658
repo: github.com/blei-lab/tre...
🧵(1/8)
paper: arxiv.org/abs/2406.07658
repo: github.com/blei-lab/tre...
🧵(1/8)
Reposted by Sweta Karlekar
Blei Lab
@bleilab.bsky.social
· Dec 10
The circuit hypothesis proposes that LLM capabilities emerge from small subnetworks within the model. But how can we actually test this? 🤔
joint work with @velezbeltran.bsky.social @maggiemakar.bsky.social @anndvision.bsky.social @bleilab.bsky.social Adria @far.ai Achille and Caro
joint work with @velezbeltran.bsky.social @maggiemakar.bsky.social @anndvision.bsky.social @bleilab.bsky.social Adria @far.ai Achille and Caro
Reposted by Sweta Karlekar
Sweta Karlekar
@swetakar.bsky.social
· Dec 2
I am very excited to share our new Neurips 2024 paper + package, Treeffuser! 🌳 We combine gradient-boosted trees with diffusion models for fast, flexible probabilistic predictions and well-calibrated uncertainty.
paper: arxiv.org/abs/2406.07658
repo: github.com/blei-lab/tre...
🧵(1/8)
paper: arxiv.org/abs/2406.07658
repo: github.com/blei-lab/tre...
🧵(1/8)
Sweta Karlekar
@swetakar.bsky.social
· Nov 29
GitHub - andrewyng/aisuite: Simple, unified interface to multiple Generative AI providers
Simple, unified interface to multiple Generative AI providers - GitHub - andrewyng/aisuite: Simple, unified interface to multiple Generative AI providers
github.com
Reposted by Sweta Karlekar
Sweta Karlekar
@swetakar.bsky.social
· Nov 25
Sweta Karlekar
@swetakar.bsky.social
· Nov 24
Reposted by Sweta Karlekar
Reposted by Sweta Karlekar
Sweta Karlekar
@swetakar.bsky.social
· Nov 22
Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers
Large pretrained language models have shown surprising in-context learning (ICL) ability. With a few demonstration input-label pairs, they can predict the label for an unseen input without parameter u...
arxiv.org
Reposted by Sweta Karlekar
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
Reposted by Sweta Karlekar
Reposted by Sweta Karlekar
Sweta Karlekar
@swetakar.bsky.social
· Nov 20
An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2 — AI Alignment Forum
This post represents my personal hot takes, not the opinions of my team or employer. This is a massively updated version of a similar list I made two…
www.alignmentforum.org