Meng Qianke

Multimodal AI Research

Hi, I'm Meng Qianke

Focusing on multimodal large models, agents, and video question answering, with an emphasis on vision-language generative intelligence.

Meng Qianke

Research Focus

Multimodal Understanding & Generation

Researching multimodal large models, agents, and video question answering, with a focus on vision-language generative intelligence and cutting-edge methods in cross-modal understanding, autonomous reasoning, and long-form video analysis.

Multimodal Large Models

Investigate robust alignment mechanisms across vision, language, and speech, combining contrastive learning with instruction tuning to enhance cross-modal semantic understanding and generation.

Agent Systems

Develop autonomous agents powered by large models, focusing on tool invocation, reasoning, planning, and environmental interaction for automated task execution.

Video Question Answering

Design temporal modeling and keyframe selection strategies for long-video understanding, integrating parameter-efficient tuning and few-shot learning to improve QA generalization and real-time performance.

Research Output

View research →
  • Paper2024

    VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding (Under review at CVPR)

    Proposing an agentic reasoning framework with hierarchical memory for long-form video understanding. Achieved significant improvements on multiple long-form video QA benchmarks.

    Video UnderstandingAgentsMultimodalLong Video
  • Contest2024

    Aesthetic Feature Modeling of Classical Jiangnan Gardens (National First Prize, China Graduate Mathematical Modeling Contest)

    Mathematical modeling approach to analyze aesthetic features and spatial layout patterns of classical Jiangnan gardens.

    Mathematical ModelingMultimodalAesthetic Analysis

Featured Projects

View all

Experience

  1. Hangzhou Dianzi University logo

    Graduate Student (Master's) · Hangzhou Dianzi University

    • Computer Technology
    • Research focus on multimodal large models
    2024 - Present
  2. Henan University logo

    Undergraduate (Bachelor's) · Henan University

    • Computer Science and Technology
    • Bachelor of Engineering
    2020 - 2024

Get in touch

Let's connect around multimodal research, collaborations, or personal projects.

Scan to Connect

WeChat QR Code
WeChat
Scan to add
Xiaohongshu QR Code
Xiaohongshu
Scan to add
X QR Code
X
Scan to add