Florent Daudens
@fdaudens.bsky.social
Passionate about AI & Journalism / Previously @hf.co @radiocanadainfo @ledevoir & others
AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed.
What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.
What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.
July 18, 2025 at 3:10 PM
AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed.
What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.
What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.
Three big AI copyright updates this week alone. Tracking it all is getting almost impossible!
That’s why @brigittetousi.hf.co and I built this interactive tracker to keep you up to date: huggingface.co/spaces/fdaud...
(Prototyped in minutes with DeepSite!)
That’s why @brigittetousi.hf.co and I built this interactive tracker to keep you up to date: huggingface.co/spaces/fdaud...
(Prototyped in minutes with DeepSite!)
June 27, 2025 at 6:31 PM
Three big AI copyright updates this week alone. Tracking it all is getting almost impossible!
That’s why @brigittetousi.hf.co and I built this interactive tracker to keep you up to date: huggingface.co/spaces/fdaud...
(Prototyped in minutes with DeepSite!)
That’s why @brigittetousi.hf.co and I built this interactive tracker to keep you up to date: huggingface.co/spaces/fdaud...
(Prototyped in minutes with DeepSite!)
Gemma 3n just dropped - a natively multimodal model that runs entirely on your device. No cloud. No API calls.
🧠 Text, image, audio, and video
⚡️Only needs 2B in GPU memory to run
🤯 First sub-10B model to hit 1300+ Elo
✅ Plug-and-play with Hugging Face, MLX, llama.cpp...
🧠 Text, image, audio, and video
⚡️Only needs 2B in GPU memory to run
🤯 First sub-10B model to hit 1300+ Elo
✅ Plug-and-play with Hugging Face, MLX, llama.cpp...
June 26, 2025 at 6:33 PM
Gemma 3n just dropped - a natively multimodal model that runs entirely on your device. No cloud. No API calls.
🧠 Text, image, audio, and video
⚡️Only needs 2B in GPU memory to run
🤯 First sub-10B model to hit 1300+ Elo
✅ Plug-and-play with Hugging Face, MLX, llama.cpp...
🧠 Text, image, audio, and video
⚡️Only needs 2B in GPU memory to run
🤯 First sub-10B model to hit 1300+ Elo
✅ Plug-and-play with Hugging Face, MLX, llama.cpp...
ASMR Shiba has something to say 🐾
June 24, 2025 at 12:47 PM
ASMR Shiba has something to say 🐾
🎵 Dream come true for content creators! TIGER AI can extract voice, effects & music from ANY audio file 🤯
This lightweight model uses frequency band-split technology to separate speech like magic. Kudos to @fffiloni.bsky.social for the amazing demo!
This lightweight model uses frequency band-split technology to separate speech like magic. Kudos to @fffiloni.bsky.social for the amazing demo!
May 27, 2025 at 9:33 PM
🎵 Dream come true for content creators! TIGER AI can extract voice, effects & music from ANY audio file 🤯
This lightweight model uses frequency band-split technology to separate speech like magic. Kudos to @fffiloni.bsky.social for the amazing demo!
This lightweight model uses frequency band-split technology to separate speech like magic. Kudos to @fffiloni.bsky.social for the amazing demo!