Geuntaek Lim

Seoul, South Korea

My research centers on multimodal AI with a particular emphasis on video-centric modeling. My goal is to empower AI systems to assist humans in interpreting complex video content, facilitating high-level reasoning across applications such as sports analytics, surveillance, and media content analysis. I am deeply interested in exploring three key areas for a comprehensive understanding of videos:

Efficient Video Representation: Addressing the high computational demands of video data by developing efficient learning approaches..
Perception and Reasoning in Videos: Tackling the challenge of understanding temporal dynamics, including frame continuity, causality, and long-term dependencies.
Multi-modal Learning: Leveraging audio encoded in video streams to extract semantic information that complements, yet remains distinct from, visual semantics.

news

Oct 27, 2025	🏢 I am now working as a research intern at Naver Cloud (Video Understanding Team).
Mar 19, 2025	💻 I am now working as a research intern at SNU Machine Perception and Reasoning Lab.
Dec 16, 2024	🎓 I successfully completed my Master Degree defense.
Jul 15, 2024	📃 A paper on weakly supervised temporal action localization got accepted to ACM MM 2024.
Dec 09, 2023	📃 A paper on content-based video retrieval got accepted to AAAI 2024.

education

Mar 02, 2026	Ph.D. Dept. of AI Robotics in Sejong University.
Mar 02, 2023	M.S. Dept. of AI Robotics in Sejong University.
Mar 02, 2017	B.S. Dept. of AI Robotics in Sejong University.

selected publications

arXiv

Visual Semantic Hierarchical Learning for Open-Vocabulary Scene Graph Generation

In arXiv preprint, , 2026

HTML Code
MM

Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization

Geuntaek Lim, Hyunwoo Kim, Joonsoo Kim, and Yukyung Choi^†

In ACM Multimedia (MM) , 2024

HTML Code
AAAI

VVS: Video-to-video retrieval with irrelevant frame suppression

Won Jo, Geuntaek Lim, Gwangjin Lee, Hyunwoo Kim, Byungsoo Ko, and Yukyung Choi^†

In Association for the Advancement of Artificial Intelligence (AAAI) , 2024

HTML Code
IEEE Access

Simultaneous Video Retrieval and Alignment

Won Jo, Geuntaek Lim, Yujin Hwang, Gwangjin Lee, Joonsoo Kim, Joungil Yun, Jiyoung Jung, and Yukyung Choi^†

IEEE Access, 2023

HTML
IEEE Access

Exploring the temporal cues to enhance video retrieval on standardized cdva

Won Jo, Geuntaek Lim, Joonsoo Kim, Joungil Yun, and Yukyung Choi^†

IEEE Access, 2022

HTML Code