I am a Ph.D. student in Computer Science Program at King Abdullah University of Science and Technology (KAUST), under the supervision of Prof. Bernard Ghanem. Prior to that, I obtained my master degree from Shanghai Jiao Tong University, and bachelor degree from Xiโan JiaoTong University.
My research interests focus on long-form video understanding, such as temporal action detection, action recognition, and video language grounding.
๐ฅ News
- 2024.07: ย ๐ One co-authored paper is accepted by ECCV 2024.
- 2024.06: ย ๐ We rank 1st in the Action Recognition, Action Detection, and Audio-Based Interaction Detection tasks of the EPIC-KITCHENS-100 2024 Challenge, as well as 1st place in the Moment Queries task of the Ego4D 2024 Challenge by using OpenTAD!
- 2024.06: I am awarded the Deanโs List Award for 2024.
- 2024.05: We release the OpenTAD, which is currently the largest TAD codebase.
- 2024.02: Two papers are accepted by CVPR 2024.
- 2024.01: One paper is accepted by ICLR 2024.
- 2023.02: One paper is accepted by CVPR 2023 and one paper is accepted by CVPRW 2023.
๐ Publications
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu, Chen Zhao, Bernard Ghanem, et al. [Code]
Harnessing Temporal Causality for Advanced Temporal Action Detection
Shuming Liu, Lin Sui, Chen-Lin Zhang, Fangzhou Mu, Chen Zhao, Bernard Ghanem [Code]
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Shuming Liu, Chen-Lin Zhang, Chen Zhao, Bernard Ghanem [Code]
ETAD: Training Action Detection End to End on a Laptop
Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem [Code]
-
ECCV 2024
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders, Carlos Hinojosa, Shuming Liu, Bernard Ghanem -
CVPR 2024
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning, Chen Zhao, Shuming Liu, Karttikeya Mangalam, Guocheng Qian, Fatimah Zohra, Abdulmohsen Alghannam, Jitendra Malik, Bernard Ghanem [Code] -
CVPRW 2024
Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition, Hasan Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad AlBalawi, Bernard Ghanem -
ICLR 2024
Boundary-Denoising for Video Activity Localization, Mengmeng Xu, Mattia Soldan, Jialin Gao, Shuming Liu, Juan-Manuel Perez-Rua, Bernard Ghanem [Code] -
CVPR 2023
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization, Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem [Code] -
TMM 2020
Transferable Knowledge Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection, Haisheng Su, Xu Zhao, Tianwei Lin, Shuming Liu, Zhilan Hu -
ACCV 2020
TSI: Temporal Scale Invariant Network for Action Proposal Generation, Shuming Liu, Xu Zhao, Haisheng Su, Zhilan Hu [Code]
๐ Educations
- 2021.09 - now, Ph.D., King Abdullah University of Science and Technology (KAUST), Saudi Arabia.
- 2018.09 - 2021.04, Master, Shanghai Jiao Tong University (SJTU), China.
- 2014.09 - 2018.06, Bachelor, Xiโan JiaoTong University (XJTU), China.
๐ Honors and Awards
- 2024.06 Deanโs List Award of KAUST (20%)
- 2021.03 Outstanding Graduate of SJTU
- 2019.12 Scholarship of SJTU (5%)
- 2018.06 Outstanding Undergraduate of XJTU
- 2017.12 Scholarship of XJTU (5%)
๐ป Service
Conference Reviewer: CVPR2022/2023/2024, ICCV2023, ECCV2022/2024, ICLR2023, AAAI2023.
Journal Reviewer: TPAMI, IJCV, TIP, TMM.
Teaching Assistant: Introduction to Computer Vision (KAUST), Computer Vision (SJTU)