No that I am aware (but I also haven't looked too deeply). I would probably use the original ViT in Torchvision as a baseline: https://
pytorch.org/vision/main/_m
odules/torchvision/models/vision_transformer.html#vit_b_16
… I have a companion article here showing how to accelerate the training on that model:
Using PyTorch Vision ViT as Baseline for Model Training
By
–
Leave a Reply