Publications

Featured
List
Previews

LingBot-Map: Geometric Context Transformer for Streaming 3D Reconstruction

arXiv Apr. 2026 Chen, Gao

LingBot-VA: Causal World Modeling for Robot Control

arXiv Jan. 2026 Li

LingBot-World: Advancing Open-source World Models

arXiv Jan. 2026 Gao

LingBot-Depth: Masked Depth Modeling for Spatial Perception

arXiv Jan. 2026 Tan

Date

TitleSubtitle

First Authors Venue

LingBot-Map Geometric Context Transformer for Streaming 3D Reconstruction

Chen, Gao arXiv Jan. 2026

LingBot-VA Causal World Modeling for Robot Control

Li arXiv Jan. 2026

LingBot-World Advancing Open-source World Models

Gao arXiv Jan. 2026

LingBot-Depth Masked Depth Modeling for Spatial Perception

Tan arXiv Aug. 2025

Mixture of Contexts for Long Video Generation

Cai ICLR Jun. 2025

Video World Models with Long-term Spatial Memory

Wu NeurIPS May 2025

Interspatial Attention for Efficient 4D Human Video Generation

Shao, Xu SIGGRAPH Mar. 2025

CameraCtrl II Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

He ICCV Mar. 2025

GroomLight Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

Zheng CVPR Feb. 2025

FLARE Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

Zhang, Wang, Xu CVPR Dec. 2024

Edicho Consistent Image Editing in the Wild

Bai ICCV Dec. 2024

Representing Long Volumetric Video with Temporal Gaussian Hierarchy

Xu, Xu, Yu SIGGRAPH Asia Dec. 2024

FiVA Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Wu NeurIPS Jul. 2024

Flow as the Cross-domain Manipulation Interface

Xu ✨ CoRL May 2024

3DitScene Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Zhang ICLR May 2024

Collaborative Video Diffusion Consistent Multi-video Generation with Camera Control

Kuang, Cai NeurIPS Apr. 2024

CameraCtrl Enabling Camera Control for Video Diffusion Models

He ICLR Mar. 2024

GRM Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Xu, Shi ECCV Feb. 2024

Real-time 3D-aware Portrait Editing from a Single Image

Bai ECCV Dec. 2023

BerfScene Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

Zhang CVPR Dec. 2023

SceneWiz3D Towards Text-guided 3D Scene Composition

Zhang CVPR Dec. 2023

Neural Body Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

Peng ✨ TPAMI Nov. 2023

Gaussian Shell Maps for Efficient 3D Human Generation

Abdal, Yifan, Shi CVPR Nov. 2023

DMV3D:Denoising Multi-View Diffusion using 3D Large Reconstruction Model

Xu ✨ ICLR Nov. 2023

PF-LRM Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Wang ✨ ICLR Nov. 2023

Instant3D Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Li ICLR Sep. 2023

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

Zhu arXiv Aug. 2023

Learning Modulated Transformation in GANs

Yang NeurIPS Jul. 2023

Efficient 3D Articulated Human Generation with Layered Surface Volumes

Xu ✨ 3DV Jun. 2023

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

Wang NeurIPS Mar. 2023

3D Generation on ImageNet

Skorokhodov ✨ ICLR Jan. 2023

GH-Feat Learning Versatile Generative Hierarchical Features from GANs

Xu TPAMI Jan. 2023

Learning 3D-aware Image Synthesis with Unknown Pose Distribution

Shi, Shen CVPR Dec. 2022

DiscoScene Spatially Disentangled Generative Radiance Field for Controllable 3D-aware Scene Synthesis

Xu ✨ CVPR Dec. 2022

GLeaD Improving GANs with A Generator-Leading Task

Bai CVPR Dec. 2022

Towards Smooth Video Composition

Zhang ICLR Oct. 2022

3D Generative Models A Survey

Shi, Peng, Xu arXiv Sep. 2022

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

Shi ✨ NeurIPS Sep. 2022

Improving GANs with A Dynamic Discriminator

Yang, Shen NeurIPS Mar. 2022

High-fidelity GAN Inversion with Padding Space

Bai, Xu ECCV Jan. 2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Liu ✨ ECCV Dec. 2021

Region-Based Semantic Factorization in GANs

Zhu ICML Dec. 2021

3D-aware Image Synthesis via Learning Structural and Textural Representations

Xu CVPR Dec. 2021

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Xu ✨ CVPR Dec. 2021

Improving GAN Equilibrium by Raising Spatial Awareness

Wang CVPR Nov. 2021

One-Shot Generative Domain Adaptation

Yang, Shen ICCV Jun. 2021

Data-Efficient Instance Generation from Instance Discrimination

Yang NeurIPS Jun. 2021

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

Yang NeurIPS Jun. 2021

CompConv A Compact Convolution Module for Efficient Feature Learning

Zhang CVPRW May 2021

Decorating Your Own Bedroom Locally Controlling Image Generation with Generative Adversarial Networks

Zhang CVPRW Jul. 2020

Generative Hierarchical Features from Synthesizing Images

Xu, Shen ✨ CVPR Jul. 2020

Unsupervised Landmark Learning from Unpaired Data

Xu arXiv Jul. 2020

Video Representation Learning with Visual Tempo Consistency

Yang arXiv Apr. 2020

Temporal Pyramid Network for Action Recognition

Yang, Xu CVPR Dec. 2019

Dense RepPoints Representing Visual Objects with Dense Point Sets

Xu, Yang, Xue ECCV Dec. 2018

A Main/Subsidiary Network Framework for Simplifying Binary Networks

LingBot-Map: Geometric Context Transformer for Streaming 3D Reconstruction

arXiv Apr. 2026 Chen, Gao

LingBot-VA: Causal World Modeling for Robot Control

arXiv Jan. 2026 Li

LingBot-World: Advancing Open-source World Models

arXiv Jan. 2026 Gao

LingBot-Depth: Masked Depth Modeling for Spatial Perception

arXiv Jan. 2026 Tan

Mixture of Contexts for Long Video Generation

ICLR Aug. 2025 Cai

Video World Models with Long-term Spatial Memory — Video World Models with Long-term Spatial Memory

NeurIPS Jun. 2025 Wu

Interspatial Attention for Efficient 4D Human Video Generation

SIGGRAPH May 2025 Shao, Xu

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

ICCV Mar. 2025 He

GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

CVPR Mar. 2025 Zheng

FLARE — FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

CVPR Feb. 2025 Zhang, Wang, Xu

Edicho — Edicho: Consistent Image Editing in the Wild

ICCV Dec. 2024 Bai

Representing Long Volumetric Video with Temporal Gaussian Hierarchy

SIGGRAPH Asia Dec. 2024 Xu, Xu, Yu

FiVA — FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

NeurIPS Dec. 2024 Wu

Flow as the Cross-domain Manipulation Interface

CoRL Jul. 2024 Xu

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

ICLR May 2024 Zhang

Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

NeurIPS May 2024 Kuang, Cai

CameraCtrl: Enabling Camera Control for Video Diffusion Models

ICLR Apr. 2024 He

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

ECCV Mar. 2024 Xu, Shi

Real-time 3D-aware Portrait Editing from a Single Image — Real-time 3D-aware Portrait Editing from a Single Image

ECCV Feb. 2024 Bai

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

CVPR Dec. 2023 Zhang

SceneWiz3D — SceneWiz3D: Towards Text-guided 3D Scene Composition

CVPR Dec. 2023 Zhang

Neural Body — Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

TPAMI Dec. 2023 Peng

Gaussian Shell Maps for Efficient 3D Human Generation

CVPR Nov. 2023 Abdal, Yifan, Shi

DMV3D:Denoising Multi-View Diffusion using 3D Large Reconstruction Model

ICLR Nov. 2023 Xu

PF-LRM — PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

ICLR Nov. 2023 Wang

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

ICLR Nov. 2023 Li

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis — Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

arXiv Sep. 2023 Zhu

Learning Modulated Transformation in GANs — Learning Modulated Transformation in GANs

NeurIPS Aug. 2023 Yang

Efficient 3D Articulated Human Generation with Layered Surface Volumes — Efficient 3D Articulated Human Generation with Layered Surface Volumes

3DV Jul. 2023 Xu

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase — Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

NeurIPS Jun. 2023 Wang

3D Generation on ImageNet — 3D Generation on ImageNet

ICLR Mar. 2023 Skorokhodov

GH-Feat — GH-Feat: Learning Versatile Generative Hierarchical Features from GANs

TPAMI Jan. 2023 Xu

Learning 3D-aware Image Synthesis with Unknown Pose Distribution — Learning 3D-aware Image Synthesis with Unknown Pose Distribution

CVPR Jan. 2023 Shi, Shen

DiscoScene — DiscoScene: Spatially Disentangled Generative Radiance Field for Controllable 3D-aware Scene Synthesis

CVPR Dec. 2022 Xu

GLeaD — GLeaD: Improving GANs with A Generator-Leading Task

CVPR Dec. 2022 Bai

Towards Smooth Video Composition

ICLR Dec. 2022 Zhang

3D Generative Models — 3D Generative Models: A Survey

arXiv Oct. 2022 Shi, Peng, Xu

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator — Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

NeurIPS Sep. 2022 Shi

Improving GANs with A Dynamic Discriminator — Improving GANs with A Dynamic Discriminator

NeurIPS Sep. 2022 Yang, Shen

High-fidelity GAN Inversion with Padding Space — High-fidelity GAN Inversion with Padding Space

ECCV Mar. 2022 Bai, Xu

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation — Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

CVPR Mar. 2022 Liu

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation — Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

ECCV Jan. 2022 Liu

Region-Based Semantic Factorization in GANs — Region-Based Semantic Factorization in GANs

ICML Dec. 2021 Zhu

3D-aware Image Synthesis via Learning Structural and Textural Representations — 3D-aware Image Synthesis via Learning Structural and Textural Representations

CVPR Dec. 2021 Xu

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition — Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

CVPR Dec. 2021 Xu

Improving GAN Equilibrium by Raising Spatial Awareness — Improving GAN Equilibrium by Raising Spatial Awareness

CVPR Dec. 2021 Wang

One-Shot Generative Domain Adaptation — One-Shot Generative Domain Adaptation

ICCV Nov. 2021 Yang, Shen

Data-Efficient Instance Generation from Instance Discrimination — Data-Efficient Instance Generation from Instance Discrimination

NeurIPS Jun. 2021 Yang

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering — Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

NeurIPS Jun. 2021 Yang

CompConv — CompConv: A Compact Convolution Module for Efficient Feature Learning

CVPRW Jun. 2021 Zhang

Decorating Your Own Bedroom — Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks

CVPRW May 2021 Zhang

Generative Hierarchical Features from Synthesizing Images — Generative Hierarchical Features from Synthesizing Images

CVPR Jul. 2020 Xu, Shen

Unsupervised Landmark Learning from Unpaired Data — Unsupervised Landmark Learning from Unpaired Data

arXiv Jul. 2020 Xu

Video Representation Learning with Visual Tempo Consistency — Video Representation Learning with Visual Tempo Consistency

arXiv Jul. 2020 Yang

Temporal Pyramid Network for Action Recognition — Temporal Pyramid Network for Action Recognition

CVPR Apr. 2020 Yang, Xu

Dense RepPoints — Dense RepPoints: Representing Visual Objects with Dense Point Sets

ECCV Dec. 2019 Xu, Yang, Xue

A Main/Subsidiary Network Framework for Simplifying Binary Networks — A Main/Subsidiary Network Framework for Simplifying Binary Networks

CVPR Dec. 2018 Xu