Introducing DeepSeek-V3.2-Exp
π Introducing DeepSeek-V3.2-Exp β our latest experimental model!
β¨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention (DSA) for faster, more efficient training & inference on long context.
π Now live on App, Web, and API
π° API prices cut by 50%+!
β‘οΈ Efficiency Gainsβ
π€ DSA achieves fine-grained sparse attention with minimal impact on output quality β boosting long-context performance & reducing compute cost.
π Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.


π§βπ» API Updateβ
π Lower costs, same access!
π° DeepSeek API prices drop 50%+, effective immediately.
πΉ For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://api-docs.deepseek.com/guides/comparison_testing
πΉ Feedback welcome: https://feedback.deepseek.com/dsa

π Open Source Releaseβ
π Model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp
π Tech report: https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/DeepSeek_V3_2.pdf
π Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)