SplatR: Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching

1IIT(ISM) Dhanbad, 2University of Bremen 3IIIT Allahabad

SplatR Overview

Abstract

We present a novel approach that uses 3D Gaussian Splatting for experience goal visual rearrangement.

Experience Goal Visual Rearrangement task stands as a foundational challenge within Embodied AI, requiring an agent to construct a robust world model that accurately captures the goal state. The agent uses this world model to restore a shuffled scene to its original configuration, making an accurate representation of the world essential for successfully completing the task. In this work, we present a novel framework that leverages on 3D Gaussian Splatting as a 3D scene representation for experience goal visual rearrangement task. Recent advances in volumetric scene representation like 3D Gaussian Splatting, offer fast rendering of high quality and photo-realistic novel views. Our approach enables the agent to have consistent views of the current and the goal setting of the rearrangement task, which enables the agent to directly compare the goal state and the shuffled state of the world in image space. To compare these views, we propose to use a dense feature matching method with visual features extracted from a foundation model, leveraging its advantages of a more universal feature representation, which facilitates robustness, and generalization. We validate our approach on the AI2-THOR rearrangement challenge benchmark and demonstrate improvements over the current state-of-the-art methods.

Scene Change Detection

Dataset_id: FloorPlan429__test__49

Dataset_id: FloorPlan328__test__49

Dataset_id: FloorPlan324__val__38

Dataset_id: FloorPlan324__val__26

Dataset_id: FloorPlan225__val__33

Dataset_id: FloorPlan24__val__25

BibTeX

@misc{s2024splatrexperiencegoal,
      title={SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching}, 
      author={Arjun P S and Andrew Melnik and Gora Chand Nandi},
      year={2024},
      eprint={2411.14322},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2411.14322}, 
      }