Well, technically, my first paper on Joint Embedding Architectures (AKA Siamese nets) is from NIPS 1994.
but that paper and many subsequent works use sample-contrastive learning and no predictor.
The JEPA idea trained non sample-contrastive losses (Barlow Twins, VICReg, MCR2) or
Joint Embedding Architectures and Non-Contrastive Learning Methods
By
–
Leave a Reply