Delong Chen

Delong Chen

Ph.D. Candidate at HKUST


Delong Chen (陈德龙) is a third-year Ph.D. student at the Hong Kong University of Science and Technology (HKUST), advised by Prof. Pascale Fung. He was a visiting researcher at Meta FAIR in Paris from 2024 to 2026. Prior to that, he interned at MEGVII (Face++) Research and Xiaobing.AI (Microsoft Xiaoice) in Beijing from 2021 to 2023. He received his bachelor’s degree in 2021 from Hohai University, where he worked with Prof. Fan Liu. He is now working on VL-JEPA and vision-language and world modeling.

Awards
  • Best Paper at AAAI 2023 Inaugural Summer Symposium Series - AI x Metaverse
  • Best Dataset Paper at Long-Tailed Distribution Learning Workshop, IJCAI 2021
  • Best Demo at IEEE ICME 2021
  • 江苏省优秀本科毕业论文一等奖
  • 河海大学2021届本科“优秀毕业生”荣誉称号
  • 2020江苏省大学生网络文化节校园歌曲作品征集一等奖
  • “江苏省优秀共青团员”称号
  • “2019江苏省大学生年度人物”提名奖
  • 2020年河海大学“海韵风华大学生年度人物”称号
Reviewer / Program Committee
  • ICLR, NeurIPS, CVPR, ICCV, ICML, ACL Rolling Review (ARR), AAAI, ACMMM
  • IEEE TPAMI, ACM TIST, Artificial Intelligence Review
Volunteer
  • AAAI 2024 (Vancouver, Canada)
  • ACL 2024 (Bangkok, Thailand)
Teaching Assistant
  • ELEC 1200 A System View of Communications (2024 Spring, HKUST)

Selected first & co-first aurthor papers. See full publication list in Google Scholar


Action100M: A Large-scale Video Action Dataset
Delong Chen, Tejaswi Kasarla, Yejin Bang, Mustafa Shukor, Willy Chung, Jade Yu, Allen Bolourchi, Theo Moutakanni, Pascale Fung
World Modeling Workshop
[ facebookresearch/Action100M (300+ stars)]
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
Delong Chen*, Mustafa Shukor*, Theo Moutakanni*, Willy Chung*, Jade Yu, Tejaswi Kasarla, Yejin Bang, Allen Bolourchi, Yann LeCun, Pascale Fung
ICLR 2026 & World Modeling Workshop (Oral Presentation)
[ Presentation] (bilibili)
Planning with Reasoning using Vision Language World Model
Delong Chen*, Theo Moutakanni*, Willy Chung, Yejin Bang, Ziwei Ji, Allen Bolourchi, Pascale Fung
World Modeling Workshop
WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning
Delong Chen*, Willy Chung*, Yejin Bang, Ziwei Ji, Pascale Fung
ICML 2025 Workshop on Assessing World Models
[ facebookresearch/WorldPrediction]
Subobject-level Image Tokenization
Delong Chen, Samuel Cahyawijaya, Jianfeng Liu, Baoyuan Wang, Pascale Fung
ICML 2025
[ ChenDelong1999/subobjects] [🤗 AK's Huggingface Daily Paper] [ Demo]
What Makes for Good Image Captions?
Delong Chen, Samuel Cahyawijaya, Etsuko Ishii, Ho Shu Chan, Yejin Bang, Pascale Fung
EMNLP 2025 Findings & NeurIPS 2024 Workshop on Machine Learning and Compression
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
Fan Liu*, Delong Chen*, Zhangqingyun Guan, Xiaocong Zhou, Jiale Zhu, Jun Zhou
IEEE Transactions on Geoscience and Remote Sensing, 2024
[ ChenDelong1999/RemoteCLIP (500+ stars)] [ Paperswithcode Leaderboard]
ProtoCLIP: Prototypical Contrastive Language Image Pretraining
Delong Chen, Zhao Wu, Fan Liu, Zaiquan Yang, Huaxi Huang, Ying Tan, Erjin Zhou
IEEE Transactions on Neural Networks and Learning Systems, 2023
[ megvii-research/protoclip] [ ITRA codebase]

* Equal Contribution

🎻


Delong was awarded a violin performance diploma from the Central Conservatory of Music (中央音乐学院).
He served as the concert master of the Hohai University Symphony Orchestra during 2019-2020. He is also at bilibili with 20k+ followers.

Internationale ☭

Internationale ☭

Piano: Qiwen Zhang (张启文). Violin: Delong Chen & Haolin Ouyang

南京-武汉11高校云合奏《汉阳门花园》

南京-武汉11高校云合奏《汉阳门花园》

Cloud Symphony: Hanyang Gate Garden. Organized an 11-university symphony orchestra cloud performance – composition, audio mixing, and video editing. Media coverage Xinhua News Agency (新华社), People’s Daily (人民日报)