Vision API (Legacy)
Standalone Grok Vision API — use Chat API or Gateway API instead
Legacy Endpoint
This standalone Vision API is legacy. We recommend using vision through the main APIs instead:
- Gateway API — Send images via OpenAI multimodal format across GPT, Claude, Gemini, Grok
- Chat API — Image analysis in chat, battle, and fight modes
The standalone /api/v1/vision endpoint remains available for Grok-specific image analysis with base64 support, but will not receive new features.
Analyze Image (URL)
POST /api/v1/vision
Response
Analyze Image (Base64)
POST /api/v1/vision
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | "grok" or "grok-4-1-fast" |
image | object | Yes | Image object |
image.type | string | Yes | "image_url" or "image_base64" |
image.image_url | string | Conditional | URL of the image |
image.image_base64.data | string | Conditional | Base64-encoded image data |
image.image_base64.media_type | string | Conditional | MIME type (image/png, image/jpeg, image/gif, image/webp) |
prompt | string | Yes | Question about the image |
stream | boolean | No | Enable streaming (default: false) |
List Vision Models
GET /api/v1/vision
Returns available vision models.