Structure-Aware Indoor RGB-D SLAM via Manhattan-Constrained 2D Gaussian Splatting
Abstract
Accurate and layout-consistent reconstruction remains a key challenge in indoor simultaneous localization and mapping (SLAM) due to the prevalence of planar and axis-aligned structures. Traditional visual and RGB-D SLAM methods often suffer from incomplete geometry and weak structural reasoning, while NeRF-based SLAM improves fidelity but is computationally expensive and unsuitable for real-time use. 3D Gaussian splatting offers improved efficiency but lacks structural priors, often resulting in distortions in structured scenes. To address these issues, we propose a structure-aware SLAM framework based on 2D Gaussian splatting, which provides efficient, view-consistent mapping. We introduce a lightweight regularization scheme under the Manhattan-world assumption to align Gaussian orientations and positions with dominant axes, improving layout consistency and geometric fidelity. Extensive experiments on Replica and TUM-RGBD datasets demonstrate that our method consistently outperforms existing SLAM baselines in terms of geometric accuracy and edge preservation across multiple indoor scenes.