f6867e65bd
Z.ai's current GLM-5.2 coding-agent docs confirm actual thinking efforts high and max, with ultracode mapped to max. Its Chat Completion API documents thinking.type, thinking.clear_thinking, and reasoning_content rather than a raw reasoning_effort scalar, so direct Z.ai requests now use the documented thinking object and preserve reasoning_content for coding-agent continuity. Also accepts ultracode as a max-effort alias across settings, routing, and Responses mapping. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>