LLM2026/06/20 UTC 08:46

DeepSeek V4 architecture: Sparse attention cuts million-token inference costs by 73%

来源：Tech Times

DeepSeek V4 uses sparse attention to cut inference costs 73% at million-token contexts. However, NIST government evaluation found it lags frontier US models by eight months on cross-domain reasoning, showing Chinese AI models lead in efficiency but still trail in comprehensive capability. Reported by Tech Times on June 20.

阅读原文

https://www.techtimes.com/articles/318725/20260619/deepseek-v4-architecture-how-sparse-attention-cuts-inference-costs-what-nist-found.htm

#DeepSeek#Sparse Attention#Inference#NIST

登录后参与评论