Research
Research Projects & Papers
A living log of multimodal papers, experiments, and studies that translate hands-on research into reusable methodologies.
- Paper•2024
VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding (Under review at CVPR)
Proposing an agentic reasoning framework with hierarchical memory for long-form video understanding. Achieved significant improvements on multiple long-form video QA benchmarks.
Video UnderstandingAgentsMultimodalLong Video - Contest•2024
Aesthetic Feature Modeling of Classical Jiangnan Gardens (National First Prize, China Graduate Mathematical Modeling Contest)
Mathematical modeling approach to analyze aesthetic features and spatial layout patterns of classical Jiangnan gardens.
Mathematical ModelingMultimodalAesthetic Analysis - Paper•2024
Undergraduate Thesis: Analysis and Practice of PDE Solving Methods Based on Fourier Neural Operator
Research on deep learning methods for solving partial differential equations using Fourier Neural Operator (FNO), achieving a balance between high accuracy and efficiency on multiple classic PDE benchmarks, exploring a new data-driven paradigm for scientific computing.
Neural OperatorPDEScientific ComputingDeep Learning