vllm.entrypoints.pooling.embed.protocol ¶
Embedding API protocol models for OpenAI and Cohere formats.
OpenAI: https://platform.openai.com/docs/api-reference/embeddings Cohere: https://docs.cohere.com/reference/embed
_encode_base64_embeddings ¶
Encode float embeddings as base64 (little-endian float32).
Source code in vllm/entrypoints/pooling/embed/protocol.py
_pack_binary_embeddings ¶
Bit-pack float embeddings: positive -> 1, negative -> 0.
Each bit is shifted left by 7 - idx%8, and every 8 bits are packed into one byte.
Source code in vllm/entrypoints/pooling/embed/protocol.py
build_typed_embeddings ¶
build_typed_embeddings(
float_embeddings: list[list[float]],
embedding_types: Sequence[str],
) -> CohereEmbedByTypeEmbeddings
Convert float embeddings to all requested Cohere embedding types.