Xu Lab

Publications

Date
TitleSubtitle
First Authors Venue
Publication thumbnail

LingBot-Map Geometric Context Transformer for Streaming 3D Reconstruction

Chen, Gao arXiv
Publication thumbnail

LingBot-VA Causal World Modeling for Robot Control

Li arXiv
Publication thumbnail

LingBot-World Advancing Open-source World Models

Gao arXiv
Publication thumbnail

LingBot-Depth Masked Depth Modeling for Spatial Perception

Tan arXiv
Publication thumbnail

Mixture of Contexts for Long Video Generation

Cai ICLR
Publication thumbnail

Video World Models with Long-term Spatial Memory

Wu NeurIPS
Publication thumbnail

Interspatial Attention for Efficient 4D Human Video Generation

Shao, Xu SIGGRAPH
Publication thumbnail

CameraCtrl II Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

He ICCV
Publication thumbnail

GroomLight Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

Zheng CVPR
Publication thumbnail

FLARE Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

Zhang, Wang, Xu CVPR
Publication thumbnail

Edicho Consistent Image Editing in the Wild

Bai ICCV
Publication thumbnail

Representing Long Volumetric Video with Temporal Gaussian Hierarchy

Xu, Xu, Yu SIGGRAPH Asia
Publication thumbnail

FiVA Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Wu NeurIPS
Publication thumbnail

Flow as the Cross-domain Manipulation Interface

Xu CoRL
Publication thumbnail

3DitScene Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Zhang ICLR
Publication thumbnail

Collaborative Video Diffusion Consistent Multi-video Generation with Camera Control

Kuang, Cai NeurIPS
Publication thumbnail

CameraCtrl Enabling Camera Control for Video Diffusion Models

He ICLR
Publication thumbnail

GRM Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Xu, Shi ECCV
Publication thumbnail

Real-time 3D-aware Portrait Editing from a Single Image

Bai ECCV
Publication thumbnail

BerfScene Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

Zhang CVPR
Publication thumbnail

SceneWiz3D Towards Text-guided 3D Scene Composition

Zhang CVPR
Publication thumbnail

Neural Body Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

Peng TPAMI
Publication thumbnail

Gaussian Shell Maps for Efficient 3D Human Generation

Abdal, Yifan, Shi CVPR
Publication thumbnail

DMV3D:Denoising Multi-View Diffusion using 3D Large Reconstruction Model

Xu ICLR
Publication thumbnail

PF-LRM Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Wang ICLR
Publication thumbnail

Instant3D Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Li ICLR
Publication thumbnail

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

Zhu arXiv
Publication thumbnail

Learning Modulated Transformation in GANs

Yang NeurIPS
Publication thumbnail

Efficient 3D Articulated Human Generation with Layered Surface Volumes

Xu 3DV
Publication thumbnail

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

Wang NeurIPS
Publication thumbnail

3D Generation on ImageNet

Skorokhodov ICLR
Publication thumbnail

GH-Feat Learning Versatile Generative Hierarchical Features from GANs

Xu TPAMI
Publication thumbnail

Learning 3D-aware Image Synthesis with Unknown Pose Distribution

Shi, Shen CVPR
Publication thumbnail

DiscoScene Spatially Disentangled Generative Radiance Field for Controllable 3D-aware Scene Synthesis

Xu CVPR
Publication thumbnail

GLeaD Improving GANs with A Generator-Leading Task

Bai CVPR
Publication thumbnail

Towards Smooth Video Composition

Zhang ICLR
Publication thumbnail

3D Generative Models A Survey

Shi, Peng, Xu arXiv
Publication thumbnail

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

Shi NeurIPS
Publication thumbnail

Improving GANs with A Dynamic Discriminator

Yang, Shen NeurIPS
Publication thumbnail

High-fidelity GAN Inversion with Padding Space

Bai, Xu ECCV
Publication thumbnail

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Liu CVPR
Publication thumbnail

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Liu ECCV
Publication thumbnail

Region-Based Semantic Factorization in GANs

Zhu ICML
Publication thumbnail

3D-aware Image Synthesis via Learning Structural and Textural Representations

Xu CVPR
Publication thumbnail

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Xu CVPR
Publication thumbnail

Improving GAN Equilibrium by Raising Spatial Awareness

Wang CVPR
Publication thumbnail

One-Shot Generative Domain Adaptation

Yang, Shen ICCV
Publication thumbnail

Data-Efficient Instance Generation from Instance Discrimination

Yang NeurIPS
Publication thumbnail

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

Yang NeurIPS
Publication thumbnail

CompConv A Compact Convolution Module for Efficient Feature Learning

Zhang CVPRW
Publication thumbnail

Decorating Your Own Bedroom Locally Controlling Image Generation with Generative Adversarial Networks

Zhang CVPRW
Publication thumbnail

Generative Hierarchical Features from Synthesizing Images

Xu, Shen CVPR
Publication thumbnail

Unsupervised Landmark Learning from Unpaired Data

Xu arXiv
Publication thumbnail

Video Representation Learning with Visual Tempo Consistency

Yang arXiv
Publication thumbnail

Temporal Pyramid Network for Action Recognition

Yang, Xu CVPR
Publication thumbnail

Dense RepPoints Representing Visual Objects with Dense Point Sets

Xu, Yang, Xue ECCV
Publication thumbnail

A Main/Subsidiary Network Framework for Simplifying Binary Networks

Xu CVPR

LingBot-Map: Geometric Context Transformer for Streaming 3D Reconstruction

arXiv Apr. 2026 Chen, Gao

LingBot-VA: Causal World Modeling for Robot Control

arXiv Jan. 2026 Li

LingBot-World: Advancing Open-source World Models

arXiv Jan. 2026 Gao

LingBot-Depth: Masked Depth Modeling for Spatial Perception

arXiv Jan. 2026 Tan

Mixture of Contexts for Long Video Generation

ICLR Aug. 2025 Cai
Video World Models with Long-term Spatial Memory

Video World Models with Long-term Spatial Memory

NeurIPS Jun. 2025 Wu

Interspatial Attention for Efficient 4D Human Video Generation

SIGGRAPH May 2025 Shao, Xu

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

ICCV Mar. 2025 He

GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

CVPR Mar. 2025 Zheng
FLARE

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

CVPR Feb. 2025 Zhang, Wang, Xu
Edicho

Edicho: Consistent Image Editing in the Wild

ICCV Dec. 2024 Bai

Representing Long Volumetric Video with Temporal Gaussian Hierarchy

SIGGRAPH Asia Dec. 2024 Xu, Xu, Yu
FiVA

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

NeurIPS Dec. 2024 Wu

Flow as the Cross-domain Manipulation Interface

CoRL Jul. 2024 Xu

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

ICLR May 2024 Zhang

Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

NeurIPS May 2024 Kuang, Cai

CameraCtrl: Enabling Camera Control for Video Diffusion Models

ICLR Apr. 2024 He

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

ECCV Mar. 2024 Xu, Shi
Real-time 3D-aware Portrait Editing from a Single Image

Real-time 3D-aware Portrait Editing from a Single Image

ECCV Feb. 2024 Bai

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

CVPR Dec. 2023 Zhang
SceneWiz3D

SceneWiz3D: Towards Text-guided 3D Scene Composition

CVPR Dec. 2023 Zhang
Neural Body

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

TPAMI Dec. 2023 Peng

Gaussian Shell Maps for Efficient 3D Human Generation

CVPR Nov. 2023 Abdal, Yifan, Shi

DMV3D:Denoising Multi-View Diffusion using 3D Large Reconstruction Model

ICLR Nov. 2023 Xu
PF-LRM

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

ICLR Nov. 2023 Wang

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

ICLR Nov. 2023 Li
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

arXiv Sep. 2023 Zhu
Learning Modulated Transformation in GANs

Learning Modulated Transformation in GANs

NeurIPS Aug. 2023 Yang
Efficient 3D Articulated Human Generation with Layered Surface Volumes

Efficient 3D Articulated Human Generation with Layered Surface Volumes

3DV Jul. 2023 Xu
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

NeurIPS Jun. 2023 Wang
3D Generation on ImageNet

3D Generation on ImageNet

ICLR Mar. 2023 Skorokhodov
GH-Feat

GH-Feat: Learning Versatile Generative Hierarchical Features from GANs

TPAMI Jan. 2023 Xu
Learning 3D-aware Image Synthesis with Unknown Pose Distribution

Learning 3D-aware Image Synthesis with Unknown Pose Distribution

CVPR Jan. 2023 Shi, Shen
DiscoScene

DiscoScene: Spatially Disentangled Generative Radiance Field for Controllable 3D-aware Scene Synthesis

CVPR Dec. 2022 Xu
GLeaD

GLeaD: Improving GANs with A Generator-Leading Task

CVPR Dec. 2022 Bai

Towards Smooth Video Composition

ICLR Dec. 2022 Zhang
3D Generative Models

3D Generative Models: A Survey

arXiv Oct. 2022 Shi, Peng, Xu
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

NeurIPS Sep. 2022 Shi
Improving GANs with A Dynamic Discriminator

Improving GANs with A Dynamic Discriminator

NeurIPS Sep. 2022 Yang, Shen
High-fidelity GAN Inversion with Padding Space

High-fidelity GAN Inversion with Padding Space

ECCV Mar. 2022 Bai, Xu
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

CVPR Mar. 2022 Liu
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

ECCV Jan. 2022 Liu
Region-Based Semantic Factorization in GANs

Region-Based Semantic Factorization in GANs

ICML Dec. 2021 Zhu
3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations

CVPR Dec. 2021 Xu
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

CVPR Dec. 2021 Xu
Improving GAN Equilibrium by Raising Spatial Awareness

Improving GAN Equilibrium by Raising Spatial Awareness

CVPR Dec. 2021 Wang
One-Shot Generative Domain Adaptation

One-Shot Generative Domain Adaptation

ICCV Nov. 2021 Yang, Shen
Data-Efficient Instance Generation from Instance Discrimination

Data-Efficient Instance Generation from Instance Discrimination

NeurIPS Jun. 2021 Yang
Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

NeurIPS Jun. 2021 Yang
CompConv

CompConv: A Compact Convolution Module for Efficient Feature Learning

CVPRW Jun. 2021 Zhang
Decorating Your Own Bedroom

Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks

CVPRW May 2021 Zhang
Generative Hierarchical Features from Synthesizing Images

Generative Hierarchical Features from Synthesizing Images

CVPR Jul. 2020 Xu, Shen
Unsupervised Landmark Learning from Unpaired Data

Unsupervised Landmark Learning from Unpaired Data

arXiv Jul. 2020 Xu
Video Representation Learning with Visual Tempo Consistency

Video Representation Learning with Visual Tempo Consistency

arXiv Jul. 2020 Yang
Temporal Pyramid Network for Action Recognition

Temporal Pyramid Network for Action Recognition

CVPR Apr. 2020 Yang, Xu
Dense RepPoints

Dense RepPoints: Representing Visual Objects with Dense Point Sets

ECCV Dec. 2019 Xu, Yang, Xue
A Main/Subsidiary Network Framework for Simplifying Binary Networks

A Main/Subsidiary Network Framework for Simplifying Binary Networks

CVPR Dec. 2018 Xu