Guo Chen

Guo Chen (陈果)

Ph.D Candidate at Nanjing University

chenguo1177@gmail.com

About Me

I‘m pursuing a Ph.D. in Computer Science at Nanjing University, advised by Prof. Limin Wang and Prof. Tong Lu.

My research interests are General visual perception and human-computer and multimodal interaction system. I am focusing on video understanding, egocentric vision perception and user-centric visual computing.

News

[2023-10-10] In the first Perception Test challenge, We obtain the best performance in Temporal Sound Localisation & runner-up in Temporal Action Localisation. The code of solution is here.
[2023-08-16] Code of MAT is released in here.
[2023-07-14] Our paper MAT is accepted by ICCV.
[2023-05-22] We present a novel Video Sequence Understanding Framework VideoLLM.
[2023-04-03] BasicTAD is accepted by CVIU.
[2023-01-17] Our team wins the champion of WSDM Cup 2023 Toloka VQA Challenge.
[2022-11-17] 🎂 We provide the final Ego4D report and the code.
[2022-09-19] Our team wins Top-1 rankings in 7 tracks of Ego4D ECCV2022 Challenge.
[2022-09-15] We have released the source code of BasicTAD.
[2022-06-21] Code of DCAN is released in here.
[2022-05-05] We present the BasicTAD, an end-to-end TAD baseline method.
[2021-12-01] Our paper DCAN is accepted by AAAI.

Education & Experiences

Nanjing University, Nanjing, China
Sept 2020 - present
University of South China, Hengyang, China
Sept 2015 - Jun 2019

Publication

Memory-and-Anticipation Transformer for Online Action Understanding

Jiahao Wang*, Guo Chen*, Yifei Huang, Limin Wang, Tong Lu#

International Conference on Computer Vision (ICCV), 2023

Introduction: This work presents a memory-anticipation-based method for online action understanding.

[Home Page]

BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection

Min Yang*, Guo Chen*, Yin-Dong Zheng, Tong Lu, Limin Wang#

Computer Vision and Image Understanding (CVIU)

Introduction: This work presents a simple yet effective end-to-end training framework for temporal action detection.

[PDF] [bibtex] [code]

DCAN: Improving Temporal Action Detection via Dual Context Aggregation

Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu#

Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), 2022

This work explored boundary-based methods for temporal action detection and proposed a novel network, termed DCAN, to improve temporal action detection via temporal-level and proposal-level context aggregation.

[PDF] [bibtex] [code]

Projects

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao#

Arxiv, 2022

[PDF] [bibtex] [code]

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Guo Chen*, Sen Xing*, Zhe Chen*, Yi Wang*, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao#

Arxiv, 2022

Introduction: This work presents our champion solutions to five tracks at Ego4D challenge.

[PDF] [bibtex] [code]

Contests and Awards

[2023-10] 1st Perception Test Challenge, Top-1 and Top-2 Rankings
[2023-01] WSDM Cup 2023 Toloka VQA Challenge, WSDM2023, Top-1 Ranking
[2022-10] 2nd Ego4D Challenge, ECCV2022, 7 Top-1 Rankings
[2017-12] CCPC Final Contest, Bronze Medal
[2017-10] CCPC Regional Contest, Bronze Medal
[2017-10] ACM-ICPC Asia Regional Contest, Silver Medal