DeepSeek 部署

更新时间：2026年6月9日 16:45 浏览：1640

DeepSeek 主要模型数据

https://modelscope.cn/organization/deepseek-ai

模型名	模型大小	显卡要求
DeepSeek-R1-Distill-Qwen-1.5B	3.4G	RTX 4090 24G x 1张
DeepSeek-R1-Distill-Qwen-7B	15G	RTX 4090 24G x 1张
DeepSeek-R1-Distill-Qwen-14B	28G	RTX 4090 24G x 2张
DeepSeek-R1-Distill-Qwen-32B	62G	A100/A800/H100/H800 80G x 1张或 RTX 4090 24G x 4张
DeepSeek-R1-Distill-Llama-70B	132G	A100/A800/H100/H800 80G x 2张或 RTX 4090 24G x 8张
DeepSeek-R1 满血版 671B	642G	A100/A800/H100/H800 80G x 8张 x 2台或 H200 141G x 8 张
DeepSeek-V3 满血版 671B	642G	A100/A800/H100/H800 80G x 8张 x 2台或 H200 141G x 8 张
DeepSeek-V3 满血版 671B BF16	1.3T	A100/A800/H100/H800 80G x 8张 x 4台
DeepSeek-R1 满血版 671B Q4量化	377G	A100/A800/H100/H800 80G x 8张

模型下载地址

DeepSeek-R1-Distill-Qwen-1.5B（3.4G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek-R1-Distill-Qwen-7B（15G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Qwen-14B（28G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
DeepSeek-R1-Distill-Qwen-32B（62G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Llama-70B（132G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
DeepSeek-R1 满血版 671B（642G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-R1
DeepSeek-V3 满血版 671B（642G）
https://modelscope.cn/models/deepseek-ai/DeepSeek-V3
DeepSeek-V3 满血版 671B BF16 （1.3T）
https://modelscope.cn/unsloth/DeepSeek-R1-GGUF
DeepSeek-R1 满血版 671B Q4量化（377G）
https://modelscope.cn/unsloth/DeepSeek-R1-GGUF

分布式部署

DeepSeek V3/R1 采用多头注意力（128头）显卡张数需要能被 128 整除