Ph.D Candidate at Nanjing University
About Me
Iām pursuing a Ph.D. in Computer Science at Nanjing University, advised by Prof. Limin Wang and Prof. Tong Lu.
My research interests are General visual perception and human-computer and multimodal interaction system. I am focusing on video understanding, egocentric vision perception and user-centric visual computing.
News
- [2023-10-10] In the first Perception Test challenge, We obtain the best performance in Temporal Sound Localisation & runner-up in Temporal Action Localisation. The code of solution is here.
- [2023-08-16] Code of MAT is released in here.
- [2023-07-14] Our paper MAT is accepted by ICCV.
- [2023-05-22] We present a novel Video Sequence Understanding Framework VideoLLM.
- [2023-04-03] BasicTAD is accepted by CVIU.
- [2023-01-17] Our team wins the champion of WSDM Cup 2023 Toloka VQA Challenge.
- [2022-11-17] š We provide the final Ego4D report and the code.
- [2022-09-19] Our team wins Top-1 rankings in 7 tracks of Ego4D ECCV2022 Challenge.
- [2022-09-15] We have released the source code of BasicTAD.
- [2022-06-21] Code of DCAN is released in here.
- [2022-05-05] We present the BasicTAD, an end-to-end TAD baseline method.
- [2021-12-01] Our paper DCAN is accepted by AAAI.
Education & Experiences
- Nanjing University, Nanjing, China
Sept 2020 - present - University of South China, Hengyang, China
Sept 2015 - Jun 2019
Publication
Memory-and-Anticipation Transformer for Online Action Understanding
Jiahao Wang*, Guo Chen*, Yifei Huang, Limin Wang, Tong Lu#
International Conference on Computer Vision (ICCV), 2023
Introduction: This work presents a memory-anticipation-based method for online action understanding.
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Min Yang*, Guo Chen*, Yin-Dong Zheng, Tong Lu, Limin Wang#
Computer Vision and Image Understanding (CVIU)
Introduction: This work presents a simple yet effective end-to-end training framework for temporal action detection.
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu#
Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), 2022
This work explored boundary-based methods for temporal action detection and proposed a novel network, termed DCAN, to improve temporal action detection via temporal-level and proposal-level context aggregation.
Projects
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao#
Arxiv, 2022
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Guo Chen*, Sen Xing*, Zhe Chen*, Yi Wang*, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao#
Arxiv, 2022
Introduction: This work presents our champion solutions to five tracks at Ego4D challenge.
Contests and Awards
- [2023-10] 1st Perception Test Challenge, Top-1 and Top-2 Rankings
- [2023-01] WSDM Cup 2023 Toloka VQA Challenge, WSDM2023, Top-1 Ranking
- [2022-10] 2nd Ego4D Challenge, ECCV2022, 7 Top-1 Rankings
- [2017-12] CCPC Final Contest, Bronze Medal
- [2017-10] CCPC Regional Contest, Bronze Medal
- [2017-10] ACM-ICPC Asia Regional Contest, Silver Medal