Code for the work "On the Value of Cross-Modal Misalignment in Multimodal Representation Learning".
Code for the work "CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts".
Implementations of some classical backbone CNNs with pytorch.