DeepSeek V3.2 Exp

An experimental model hosted in the US introducing DeepSeek Sparse Attention for training and inference efficiency in long-context scenarios.

Model Specifications

Context window 163,840 tokens
Max output 32,000 tokens
Knowledge cutoff September 2025
Multipart messages No
Vision capabilities No

Default Parameters

Temperature 0.3
Top P 1.0
Frequency penalty 0.0
Presence penalty 0.0
Top K 50

More models from DeepSeek