Skip to main content
Version: 1.32

Qwen3-Embedding-4B

The Qwen3 Embedding 4b model is a model from the Qwen family, specifically designed for text embedding tasks. The model inherits the multilingual capabilities skills of its foundational model.

Model ID

qwen3-embedding-4b

Source

Hugging face

Modality

  • Input: text
  • Output: embedding vector

Context limit

  • Embedding size: Up to 2560, supports user-defined output dimensions ranging from 32 to 2560
  • Maximum input size: 32000 tokens

Endpoints

Rate limits

This model has a rate limit multiplier of 0.1. The effective rate limit for the Free and Standard tier is 1,000,000 prompt tokens/minute. The effective monthly quota for the Free tier is 10,000,000 tokens/month.