Lightnews — Scholar-powered news

merve

@merve.bsky.social

8.4K followers 680 following 240 posts

proud mediterrenean 🧿 open-sourceress at hugging face 🤗 multimodality, zero-shot vision, vision language models, transformers

Posts Replies Media Videos

merve

@merve.bsky.social

here's a good blog on successful DSE model MCDSE, compression and more huggingface.co/blog/marco/a...

Visually Multilingual: Introducing mcdse-2b

A Blog post by Marco Cimolai on Hugging Face

huggingface.co

April 15, 2025 at 4:27 PM

merve

@merve.bsky.social

the model also has impressive OCR capabilities ⬇️

April 11, 2025 at 7:10 PM

merve

@merve.bsky.social

we'll give this model a test on agentic capabilities but here's an example from paper:

April 11, 2025 at 7:09 PM

merve

@merve.bsky.social

This model consists of a dynamic res handling MoonViT encoder, a projection layer and a 16B MoE decoder (with 2.8B active params)

the paper introduces an interesting pre-training pipeline to handle long context and the model saw 4.4T tokens arxiv.org/pdf/2504.07491

April 11, 2025 at 7:08 PM

Reposted by merve

Andi

@andimara.bsky.social

Smol but mighty:
• 256M delivers 80% of the performance of our 2.2B model.
• 500M hits 90%.
Both beat our SOTA 80B model from 17 months ago! 🎉

Efficiency 🤝 Performance

Explore the collection here: huggingface.co/collections/...
Blog: huggingface.co/blog/smolervlm

January 23, 2025 at 1:33 PM

merve

@merve.bsky.social

Learn more from their blog post here huggingface.co/blog/vdr-2b-... 📖

Visual Document Retrieval Goes Multilingual

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

January 13, 2025 at 11:12 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news