DeepSeek V3.1 Terminus

A large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. USA hosted, it improves tool use, code generation, and reasoning efficiency.

Model Specifications

Context window 163,840 tokens
Max output 32,000 tokens
Knowledge cutoff September 2025
Multipart messages No
Vision capabilities No

Default Parameters

Temperature 0.3
Top P 1.0
Frequency penalty 0.0
Presence penalty 0.0
Top K 50

Supported Parameters

Temperature

Supported

Controls randomness: Lower values make output more deterministic, higher values more creative.

Range: 0.0 - 2.0 Default: 0.3

More models from DeepSeek