GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, and performs complex coding tasks.
VisionReasoning|Proprietary Model
Knowledge Cutoff
Unknown
Input → Output Format
Context Memory
203KIN131KOUT
Source:Official Docs