deep-diver.bsky.social
@deep-diver.bsky.social
Simple Summarization on DeepSeek-R1

RL is key
↳ but hard to make it helpful alone.
↳ 4 stage pipeline (good start + reasoning RL + SFT + safety RL) = o1 level performance.
↳ Distilling R1-Zero outputs = o1-mini level.

Model: huggingface.co/deepseek-ai
Paper: github.com/deepseek-ai/...
January 21, 2025 at 1:03 PM
Google's Vertex AI RAG Engine

Google launched a RAG-specific service called "Vertex AI RAG Engine." It can be understood as providing infrastructure for RAG on the Google Cloud Platform and supporting libraries that can be easily utilized.

developers.googleblog.com/en/vertex-ai...
Vertex AI RAG Engine: A developers tool
Build robust and grounded generative AI applications with Vertex AI RAG Engine, reducing hallucinations and enhancing accuracy.
developers.googleblog.com
January 20, 2025 at 1:29 AM
updates on ai-paper-reviewer!

core
✦ supporting open source Layout Parsing model from
@OpenDataLab_AI

✦ scrapping papers from
@openreviewnet

blog
✦ display papers by the dates added in
@huggingface
Daily Papers. Up to 3 latest days are managed, then archived

link 👇
January 17, 2025 at 8:12 AM
I am Chansung. I love collab with others for building cool AI project and writing paper

Recent ones 👇
1. Paper on @arxiv

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

2. OSS
AI Paper Reviewer: gen text and poscast of papers

Find links below 🔗
November 20, 2024 at 12:30 PM