In a paper by Chuang et al. 2020 (Debiased Contrastive Learning), it has been shown that false negative samples can deteriorate the performance of contrastive learning. Are you aware of a CUT implementation which uses the labels from a semantic segmentation to guide the patch sampling?