Code for BeVIT
The idea is to investigate if it is possible for a network to learn a functional bregman divergence which can be used as a distance measure for self supervised learning.
Relevant papers:
MoCo-v3: https://arxiv.org/abs/2104.02057 Deep Divergence Learning: https://arxiv.org/abs/2005.02612