Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Encoder[[cv-encoder]]
The Vision Transformer (ViT) opened the door to computer vision tasks without convolutions.