vllm.model_executor.models.skyworkr1v ¶
SkyworkR1VImageEmbeddingInputs ¶
Bases: TensorSchema
Dimensions
- ni: Number of images
- ifs: Image feature size
- hs: Hidden size (must match the hidden size of language model backbone)
Source code in vllm/model_executor/models/skyworkr1v.py
SkyworkR1VImagePixelInputs ¶
Bases: TensorSchema
Dimensions
- bnp: Batch size * number of images * (1 + num_patches)
- c: Number of channels (3)
- h: Height
- w: Width
- bn: Batch size * number of images