Yichao Cai
Understanding the structure and identifiability of learned representations.
yichao.cai@adelaide.edu.au
I am a PhD student in Computer Science at the Australian Institute for Machine Learning (AIML), Adelaide University, advised by Prof. Javen Qinfeng Shi.
My research examines how supervision—particularly language supervision—shapes learned representations and under what conditions the resulting representations identify latent structure. In particular, I investigate the equivalence classes of representations induced by learning objectives, how cross-modal supervision shapes the geometry of vision-language models, and how learned representations relate to human-interpretable concepts.
I received my M.Sc. and B.Eng. degrees from Wuhan University of Technology and spent five months as a visiting student researcher at California PATH, UC Berkeley.
News
| Jun 12, 2026 | New essay: The Coverage Lock—why scaling cannot teach a multimodal model what its training questions never ask about. |
|---|---|
| May 01, 2026 | We had 3 papers on representation learning (contrastive learning theory, AI4Science, and graphical modeling) accepted to ICML 2026. |
| Feb 10, 2026 | I attended MLSS Melbourne 2026 and enjoyed learning from world-class speakers and connecting with the community. |
| Jan 28, 2026 | Check out our new preprint: The Geometric Mechanics of Contrastive Representation Learning. |
| Oct 15, 2025 | I served as a guest lecturer in Statistical Machine Learning and presented recent advances in vision-language modeling. Slides. |
| Sep 19, 2025 | Our work On the Value of Cross-Modal Misalignment in Multimodal Representation Learning was selected as a Spotlight at NeurIPS 2025. |
| Apr 14, 2025 | We released the preprint On the Value of Cross-Modal Misalignment in Multimodal Representation Learning. |
| Jul 02, 2024 | Our work CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts was accepted at ECCV 2024. |
Research
Selected publications are highlighted.