Chuofan Ma (马逴凡)

I'm a Ph.D. student from CVMI Lab, The University of Hong Kong (HKU), under the supervision of Prof. Xiaojuan Qi. Before that, I obtained my bachelor degree in computer science from HKU.

My research interest primarily lies in open-world visual intelligence and multi-modal fundation models. Please feel free to drop me an email if you are interested in what I do and seek for possible collaborations.

Email  /  Google Scholar  /  Github


Research

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi
arxiv preprint, April, 2024
Paper / Code / Page

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi
Conference on Neural Information Processing Systems (NeurIPS), 2023
Paper / Code / Page

Recognize Any Regions
Haosen Yang, Chuofan Ma, Bin Wen, Yi Jiang2, Zehuan Yuan, Xiatian Zhu
arxiv preprint, Nov, 2023
Paper / Code

EGC: Image Generation and Classification via a Diffusion Energy-Based Model
Qiushan Guo, Chuofan Ma, Yi Jiang, Zehuan Yuan, Yizhou Yu, Ping Luo
International Conference on Computer Vision (ICCV), 2023
Paper / Code / Page

Rethinking Resolution in the Context of Efficient Video Recognition
Chuofan Ma, Qiushan Guo, Yi Jiang, Ping Luo, Zehuan Yuan, Xiaojuan Qi
Conference on Neural Information Processing Systems (NeurIPS), 2022
Paper / Code


Website template from Jon Barron