publications

2025

  1. vlm3r.gif
    VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
    Zhiwen Fan*, Jian Zhang*, Renjie Li, and 8 more authors
    arXiv preprint, 2025
  2. dynamicverse.gif
    DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
    Kairun Wen, Runyu Chen, Hui Zheng, and 8 more authors
    In NeurIPS, 2025

2024

  1. lsm.gif
    Large spatial model: End-to-end unposed images to semantic 3d
    Zhiwen Fan*, Jian Zhang*, Wenyan Cong, and 8 more authors
    In NeurIPS, 2024
  2. instantsplat.gif
    Instantsplat: Unbounded sparse-view pose-free gaussian splatting in 40 seconds
    Zhiwen Fan, Wenyan Cong, Kairun Wen, and 8 more authors
    arXiv preprint, 2024