DeepSeek V3.2 Exp

An experimental model hosted in the US introducing DeepSeek Sparse Attention for training and inference efficiency in long-context scenarios.

Model Specifications

Context window 163,840 tokens
Max output 32,000 tokens
Knowledge cutoff September 2025
Multipart messages No
Vision capabilities No

Default Parameters

Temperature 0.3
Top P 1.0
Frequency penalty 0.0
Presence penalty 0.0
Top K 50

Supported Parameters

Temperature

Supported

Controls randomness: Lower values make output more deterministic, higher values more creative.

Range: 0.0 - 2.0 Default: 0.3

More models from DeepSeek