I am a Ph.D. student in Computer Science Program at King Abdullah University of Science and Technology (KAUST), under the supervision of Prof. Bernard Ghanem. Prior to that, I obtained my master degree from Shanghai Jiao Tong University, and bachelor degree from Xi’an JiaoTong University.

My research interests focus on

  • Multimodal Learning, such as video-language pre-training, video-language foundation models.
  • Long-Form Video Understanding, such as temporal action detection, action recognition, and video grounding.