DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
huggingface.co/deepseek-ai
huggingface.co/deepseek-ai/...
DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
huggingface.co/deepseek-ai
huggingface.co/deepseek-ai/...
* HF: huggingface.co/collections/...
* ModelScope: modelscope.cn/models/Qwen/...
* Kaggle: kaggle.com/models/qwen-...
* Demo: huggingface.co/spaces/Qwen/...
* HF: huggingface.co/collections/...
* ModelScope: modelscope.cn/models/Qwen/...
* Kaggle: kaggle.com/models/qwen-...
* Demo: huggingface.co/spaces/Qwen/...