Yichao Cai
蔡逸超

"A good half of the art of living is resilience." — Alain de Botton
I am a second-year PhD candidate at the Australian Institute for Machine Learning (AIML), University of Adelaide, advised by Prof. Javen Qinfeng Shi, Dr. Zhen Zhang, and Dr. Yuhang Liu. I received my M.S. and B.Eng. degrees in Instrument Science from Wuhan University of Technology.
My research lies at the intersection of representation learning, multimodal modeling, and latent variable modeling, where I aim to develop principled machine learning models that robustly infer and represent underlying semantic structure—even when data is noisy, biased, or incomplete. At the heart of my work is a central question: What makes a representation meaningful, controllable, and aligned with how humans understand the world? I believe building reliable and interpretable AI requires more than scale—it calls for deeper integration with human conceptual frameworks.
Looking ahead, I envision models that reason over structure, adapt across modalities, and remain accountable to the values encoded in the data they learn from.
news
Apr 16, 2025 | Check out our new preprint—On the Value of Cross-Modal Misalignment in Multimodal Representation Learning ! |
---|---|
Jul 02, 2024 | Our work, CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts, is accepted to appear at ECCV 2024. |
latest posts
Feb 23, 2025 | The Generalization–Specialization Dilemma |
---|---|
Jan 08, 2025 | Language and the Art of Modeling the World |
May 25, 2024 | Three Weekly Self-Introspections |