Ken Chen
Contact details
Thesis title
Large Foundation Models for Multi-modal Generation and Prediction
Research overview
Large Foundation Models and Generative AI mark a paradigm shift in machine learning, offering unprecedented generalization to model complex distributions. Leveraging these capabilities for Multi-modal Generation and Prediction, this research argues that integrating visual, textual, and temporal modalities is essential to capture nuanced dynamics that unimodal approaches miss. Methods are applied to conditional video generation, time-series forecasting, and agentic AI, demonstrating how cross-modal synergy significantly enhances performance.
Research group
Supervisors
Dr Richard Wang
Qualifications
B.Eng. Coastal Engineering, Zhejiang University, China (2019)
M.Eng. Energy Engineering, Zhejiang University, China (2022)