REPARO: Compositional 3D Assets Generation
with Differentiable 3D Layout Alignment

Haonan Han1,*, Rui Yang2,*, Huan Liao1,*, Jiankai Xing1, Zunnan Xu1,
Xiaoming Yu3, Junwei Zha3, Xiu Li1,†, Wanhua Li4,†

1 Tsinghua University, 2 The University of Hong Kong

3 Tencent Meeting, 4 Harvard University

Abstract

This paper presents REPARO, an innovative approach designed to address the challenges of compositional 3D assets generation from single images. Traditional image-to-3D models often struggle with scenes containing multiple objects due to biases and occlusion complexities.

To address this challenge, we present REPARO, a novel approach for compositional 3D asset generation from single images. REPARO employs a two-step process: first, it extracts individual objects from the scene and reconstructs their 3D meshes using off-the-shelf image-to-3D models; then, it optimizes the layout of these meshes through differentiable rendering techniques, ensuring coherent scene composition.

By integrating advanced techniques such as Optimal Transport for layout alignment and high-level semantic integration for contextual coherence, REPARO significantly enhances object independence, detail accuracy, and overall scene coherence. Extensive evaluation on multi-object scenes demonstrates REPARO's effectiveness, establishing it as a robust solution for multi-object 3D scene generation. With promising applications in augmented reality, virtual reality, gaming, and beyond, REPARO offers a comprehensive approach to address the complexities of multi-object 3D scene generation from single images.

Pipeline

Pipeline Image Pipeline Image

Main results

Quantitative experiment

We conducted a quantitative experiment to evaluate the performance of REPARO compared to existing methods. The results demonstrate that REPARO achieves superior accuracy and coherence in generating multi-object 3D scenes from single images.

Quantitative Experiment Results

Qualitative experiment

Reference

Qualitative Experiment Image 1

DreamGaussian

Qualitative Experiment GIF 1

LRM

Qualitative Experiment GIF 2

Wonder3D

Qualitative Experiment GIF 3

REPARO(ours)

Qualitative Experiment GIF 4
Qualitative Experiment Image 2
Qualitative Experiment GIF 5
Qualitative Experiment GIF 6
Qualitative Experiment GIF 7
Qualitative Experiment GIF 8
Qualitative Experiment Image 3
Qualitative Experiment GIF 9
Qualitative Experiment GIF 10
Qualitative Experiment GIF 11
Qualitative Experiment GIF 12
Qualitative Experiment Image 4
Qualitative Experiment GIF 13
Qualitative Experiment GIF 14
Qualitative Experiment GIF 15
Qualitative Experiment GIF 16
Qualitative Experiment Image 5
Qualitative Experiment GIF 17
Qualitative Experiment GIF 18
Qualitative Experiment GIF 19
Qualitative Experiment GIF 20
Qualitative Experiment Image 6
Qualitative Experiment GIF 21
Qualitative Experiment GIF 22
Qualitative Experiment GIF 23
Qualitative Experiment GIF 24
Qualitative Experiment Image 7
Qualitative Experiment GIF 25
Qualitative Experiment GIF 26
Qualitative Experiment GIF 27
Qualitative Experiment GIF 28

Layout Alignment

Qualitative Experiment GIF
Qualitative Experiment GIF
Qualitative Experiment GIF
Qualitative Experiment GIF
Qualitative Experiment GIF
Qualitative Experiment GIF
Qualitative Experiment GIF

BibTeX

        @misc{han2024reparo,
        title={REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment}, 
        author={Haonan Han and Rui Yang and Huan Liao and Jiankai Xing and Zunnan Xu and Xiaoming Yu and Junwei Zha and Xiu Li and Wanhua Li},
        year={2024},
        eprint={2405.18525},
        archivePrefix={arXiv},
        primaryClass={cs.CV}
        }