Geuntaek Lim

I’m a Ph.D. student at the University of Sejong, advised by Prof. Yukyung Choi, working on LMMs/Agents for Video Understanding.

gtlim.jpg

Seoul, South Korea

My research centers on multimodal AI with a particular emphasis on video-centric modeling. My goal is to empower AI systems to assist humans in interpreting complex video content, facilitating high-level reasoning across applications such as sports analytics, surveillance, and media content analysis. I am deeply interested in exploring three key areas for a comprehensive understanding of videos:

  1. Efficient Video Representation: Addressing the high computational demands of video data by developing efficient learning approaches..

  2. Perception and Reasoning in Videos: Tackling the challenge of understanding temporal dynamics, including frame continuity, causality, and long-term dependencies.

  3. Multi-modal Learning: Leveraging audio encoded in video streams to extract semantic information that complements, yet remains distinct from, visual semantics.

news

Oct 27, 2025 🏢 I am now working as a research intern at Naver Cloud (Video Understanding Team).
Mar 19, 2025 💻 I am now working as a research intern at SNU Machine Perception and Reasoning Lab.
Dec 16, 2024 🎓 I successfully completed my Master Degree defense.
Jul 15, 2024 📃 A paper on weakly supervised temporal action localization got accepted to ACM MM 2024.
Dec 09, 2023 📃 A paper on content-based video retrieval got accepted to AAAI 2024.

education

Mar 02, 2026 Ph.D. Dept. of AI Robotics in Sejong University.
Mar 02, 2023 M.S. Dept. of AI Robotics in Sejong University.
Mar 02, 2017 B.S. Dept. of AI Robotics in Sejong University.

selected publications

  1. arXiv
    fig_HypSGG.png
    Visual Semantic Hierarchical Learning for Open-Vocabulary Scene Graph Generation
    In arXiv preprint, , 2026
  2. MM
    fig_PVLR.png
    Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
    Geuntaek Lim, Hyunwoo Kim, Joonsoo Kim, and Yukyung Choi
    In ACM Multimedia (MM) , 2024
  3. AAAI
    fig_VVS.png
    VVS: Video-to-video retrieval with irrelevant frame suppression
    Won Jo, Geuntaek Lim, Gwangjin Lee, Hyunwoo Kim, Byungsoo Ko, and Yukyung Choi
    In Association for the Advancement of Artificial Intelligence (AAAI) , 2024
  4. IEEE Access
    fig_SRA.png
    Simultaneous Video Retrieval and Alignment
    Won Jo, Geuntaek Lim, Yujin Hwang, Gwangjin Lee, Joonsoo Kim, Joungil Yun, Jiyoung Jung, and Yukyung Choi
    IEEE Access, 2023
  5. IEEE Access
    fig_tnip.png
    Exploring the temporal cues to enhance video retrieval on standardized cdva
    Won Jo, Geuntaek Lim, Joonsoo Kim, Joungil Yun, and Yukyung Choi
    IEEE Access, 2022