Publications

(2025). Video Active Perception: Efficient Inference-Time Long-Form Video Understanding with Vision-Language Models. At ICLR 2025 (under review).

Cite

(2024). Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks. At NeurIPS 2024.

PDF Cite Abstract

(2024). Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior. At ICLR 2024.

PDF Cite Abstract

(2022). Neural Feature-Adaptation for Symbolic Predictions Using Pre-Training and Semantic Loss. On ArXiv (preprint).

PDF Cite Abstract