Zixin Zhang's picture

3 11 3

Zixin Zhang

zhangzixin02

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

upvoted a paper 20 days ago

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

authored a paper 21 days ago

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

View all activity

Organizations

None yet

upvoted 2 papers 20 days ago

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published 20 days ago • 37

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

Paper • 2512.14336 • Published 22 days ago • 28

upvoted a paper 22 days ago

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Paper • 2512.14442 • Published 22 days ago • 10

upvoted 2 papers about 1 month ago

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression

Paper • 2512.00891 • Published Nov 30, 2025 • 14

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published Nov 28, 2025 • 43

upvoted a paper about 2 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 42

upvoted a paper 2 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29, 2025 • 16

upvoted 3 papers 3 months ago

How to Teach Large Multimodal Models New Skills

Paper • 2510.08564 • Published Oct 9, 2025 • 2

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Paper • 2510.09507 • Published Oct 10, 2025 • 10

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 184

upvoted a paper 9 months ago

DiMeR: Disentangled Mesh Reconstruction Model

Paper • 2504.17670 • Published Apr 24, 2025 • 24