Yichao Cai

蔡逸超

me.png

"A good half of the art of living is resilience." — Alain de Botton

I am a second-year PhD candidate at the Australian Institute for Machine Learning (AIML), University of Adelaide, advised by Prof. Javen Qinfeng Shi, Dr. Zhen Zhang, and Dr. Yuhang Liu. I received my M.S. and B.Eng. degrees in Instrument Science from Wuhan University of Technology.

My research lies at the intersection of representation learning, multimodal modeling, and latent variable modeling, where I aim to develop principled machine learning models that robustly infer and represent underlying semantic structure—even when data is noisy, biased, or incomplete. At the heart of my work is a central question: What makes a representation meaningful, controllable, and aligned with how humans understand the world? I believe building reliable and interpretable AI requires more than scale—it calls for deeper integration with human conceptual frameworks.

Looking ahead, I envision models that reason over structure, adapt across modalities, and remain accountable to the values encoded in the data they learn from.

news

Apr 16, 2025 Check out our new preprint—On the Value of Cross-Modal Misalignment in Multimodal Representation Learning !
Jul 02, 2024 Our work, CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts, is accepted to appear at ECCV 2024.

latest posts

selected publications

  1. A Preprint
    LLM_LVM.png
    I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?
    Yuhang Liu, Dong Gong, Yichao Cai, Erdun Gao, Zhen Zhang, Biwei Huang, Mingming Gong, Anton Hengel, and Javen Qinfeng Shi
    arXiv e-prints, 2025
  2. A Preprint
    negate_or_embrace.png
    On the Value of Cross-Modal Misalignment in Multimodal Representation Learning
    Yichao Cai, Yuhang Liu, Erdun Gao, Tianjiao Jiang, Zhen Zhang, Anton Hengel, and Javen Qinfeng Shi
    2025
  3. ECCV 2024
    clap_preview.png
    CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
    Yichao Cai, Yuhang Liu, Zhen Zhang, and Javen Qinfeng Shi
    In Computer Vision - ECCV 2024: 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXI, Milan, Italy, 2024