rohit
banner
rnair.bsky.social
rohit
@rnair.bsky.social
agi whisperer
Reposted by rohit
of all the benchmarks being sent around for Claude 3.7 this is the one i'm paying the most attention to. they're cheating a little bit by giving it the oldest, original pokemon game (red/blue) which is more than 20 years old and will have plenty of info online to learn from.
February 24, 2025 at 7:39 PM
Reposted by rohit
sheesh! what a day for AI. QwQ-Max, sonnet-3.7 AND open source of FlashMLA
February 24, 2025 at 9:31 PM
the only interview question i need to ask is:

how many sqlite databases do you have on your machine and what do you use them for?

if you're able to answer with a specific number, you're not ready yet unless you have a custom cron job cleaning up unused sqlite files.
February 25, 2025 at 2:38 AM
i have a hoarding problem
February 25, 2025 at 2:27 AM
i don't check social media for 8h and people are already playing with claude 3.7 sonnet
February 25, 2025 at 2:19 AM
edtech companies who've amassed a bank of their own content in the last decade are sitting on a goldmine.
February 15, 2025 at 7:29 PM
saving all of my generated deepseek r1's reasoning traces to a USB drive so when the time comes i can send it back to my 12 year old self to prevent him from bashing his head against the desk for being stuck on the 2nd problem of an AMC 12.
February 15, 2025 at 7:22 PM
Reposted by rohit
DeepSeek, a LLM trained for a fraction of the cost of GPT-Xx models, in 2 months for 6 million, on limited GPUs due to export restrictions, and competing head to head. This is crazy.

It's not the AI part I'm excited about, it's the level of efficiency. github.com/deepseek-ai/...
GitHub - deepseek-ai/DeepSeek-V3
Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.
github.com
December 31, 2024 at 5:07 PM
engineering blogs and white papers from the heyday of pre-AI tech are a treasure trove of insights and decision making processes without the burden of validating AI slop.

don't discount them merely on their being of outdated.
December 31, 2024 at 6:06 PM
if you're a cs student aspiring to become a software engineer in 2025, make sure to hone adjacent skills like writing, marketing, & sales.

in the age of agents, establishing human connection & building an audience organically will make set you apart from the rest.
December 31, 2024 at 5:55 PM
Reposted by rohit
Gonna think more thoughts in 2025
December 31, 2024 at 5:04 PM
Reposted by rohit
supercharge your LLM apps with smolagents 🔥

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by @hf.co to make the LLM write code, do analysis and automate boring stuff! huggingface.co/blog/smolage...
December 31, 2024 at 3:32 PM
the intersection between hardcore biotech, precision therapeutics, & AI is an interesting one.

i'm particularly bullish about the work being done with organoid intelligence - mainly because of more energy efficient computing as well as possible contributions to neurodegenerative disease research
December 13, 2024 at 4:21 AM
every PR is another step towards the dream of having a universal near zero latency jarvis+second brain ai mesh

seemed like a gargantuan task back when i discovered obsidian/roam in college but current tech & capabilities make it easier to bootstrap a decent representation of this.
December 13, 2024 at 4:03 AM
the smol.ai newsletter is truly a godsend (thanks @swyx.io)

with all of the model releases this week, neurips, and discord chats popping off, having a single place to start from really helps

highly recommend.
smol.ai
News and Hackathons for AI Engineers!
smol.ai
December 13, 2024 at 3:53 AM

when you've been on the internet as long as i have, you would understand that everything can be used for anything and leaves a trail.

the difference today is a lot more security theatre & forced transparency

it's why i've always written stuff envisioning that they'd be immortalized by a super AGI
November 27, 2024 at 3:27 AM
Reposted by rohit
annoy an ML engineer with these simple phrases:

"cosine distance"
"L2 similarity"
"but did you ship it?"
November 26, 2024 at 10:38 PM
Reposted by rohit
My deep learning course at the University of Geneva is available on-line. 1000+ slides, ~20h of screen-casts. Full of examples in PyTorch.

fleuret.org/dlc/

And my "Little Book of Deep Learning" is available as a phone-formatted pdf (nearing 700k downloads!)

fleuret.org/lbdl/
November 26, 2024 at 6:15 AM
wanna become a prolific open source contributor?

just try using open source llm agent frameworks with rough asynchronous programming patterns in prod
November 26, 2024 at 1:43 AM
Reposted by rohit
@georgehotz.bsky.social is here! Bluesky is going to be so much fun.
November 24, 2024 at 4:41 AM
deploying llm apps on modal labs at night is a much better experience than my daytime woes with terraform, cloud build, k8s, and docker
November 24, 2024 at 4:42 AM
lol nah I think i'll stick to modal
November 24, 2024 at 4:32 AM
Reposted by rohit
distributed systems man. why do we do this to ourselves
November 24, 2024 at 4:05 AM
the good thing about this new generation of voice assistants is that everyone can have their own hunter s thompson like narrators for the most mundane parts of their daily routines
November 23, 2024 at 6:57 PM
i finally asked chatgpt to generate an image given what it knew about me.

spot on
November 23, 2024 at 6:50 PM