RL is key
↳ but hard to make it helpful alone.
↳ 4 stage pipeline (good start + reasoning RL + SFT + safety RL) = o1 level performance.
↳ Distilling R1-Zero outputs = o1-mini level.
Model: huggingface.co/deepseek-ai
Paper: github.com/deepseek-ai/...
RL is key
↳ but hard to make it helpful alone.
↳ 4 stage pipeline (good start + reasoning RL + SFT + safety RL) = o1 level performance.
↳ Distilling R1-Zero outputs = o1-mini level.
Model: huggingface.co/deepseek-ai
Paper: github.com/deepseek-ai/...
Google launched a RAG-specific service called "Vertex AI RAG Engine." It can be understood as providing infrastructure for RAG on the Google Cloud Platform and supporting libraries that can be easily utilized.
developers.googleblog.com/en/vertex-ai...
Google launched a RAG-specific service called "Vertex AI RAG Engine." It can be understood as providing infrastructure for RAG on the Google Cloud Platform and supporting libraries that can be easily utilized.
developers.googleblog.com/en/vertex-ai...
core
✦ supporting open source Layout Parsing model from
@OpenDataLab_AI
✦ scrapping papers from
@openreviewnet
blog
✦ display papers by the dates added in
@huggingface
Daily Papers. Up to 3 latest days are managed, then archived
link 👇
core
✦ supporting open source Layout Parsing model from
@OpenDataLab_AI
✦ scrapping papers from
@openreviewnet
blog
✦ display papers by the dates added in
@huggingface
Daily Papers. Up to 3 latest days are managed, then archived
link 👇
Recent ones 👇
1. Paper on @arxiv
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
2. OSS
AI Paper Reviewer: gen text and poscast of papers
Find links below 🔗
Recent ones 👇
1. Paper on @arxiv
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
2. OSS
AI Paper Reviewer: gen text and poscast of papers
Find links below 🔗