Disambiguation evidence for ViT via surface form

"Vision Transformer"


As object (1)

Triples where some other subject referred to this entity as "Vision Transformer".

CLIP imageEncoderType
"Vision Transformer"
↳ resolves to ViT