Qwen3-Embedding-4B
The Qwen3 Embedding 4b model is a model from the Qwen family, specifically designed for text embedding tasks. The model inherits the multilingual capabilities skills of its foundational model.
Model ID
qwen3-embedding-4b
Source
Modality
- Input: text
- Output: embedding vector
Context limit
- Embedding size: Up to 2560, supports user-defined output dimensions ranging from 32 to 2560
- Maximum input size: 32000 tokens
Endpoints
Rate limits
This model has a rate limit multiplier of 0.1. The effective rate limit for the Free and Standard tier is 1,000,000 prompt tokens/minute. The effective monthly quota for the Free tier is 10,000,000 tokens/month.