CoIn Enhances 2D-3D Inpainting with Gaussian Splatting

Hana Kim, Minje Kim, Tae-Kyun Kim· June 29, 2026 View original

Summary

CoIn is a novel framework for comprehensive 2D-3D scene inpainting that leverages Gaussian Splatting (GS) guidance through a multi-stage consistency pipeline. It integrates 2D diffusion models for initial inpainting and uses GS-based feature warping to ensure multi-view consistency, achieving state-of-the-art results for object removal and insertion.

Reconstructing corrupted or occluded areas in 3D scenes, known as 3D inpainting, is a critical task. While recent methods using Gaussian Splatting (GS) offer efficient 3D editing, they are often limited to object removal and require precise segmentation masks. Researchers have introduced CoIn, a new framework that overcomes these limitations by creating a bidirectional information flow between 2D inpainting models and 3DGS. CoIn begins by using a diffusion model to generate initial inpainted images, allowing for flexible mask shapes and diverse tasks like object insertion, not just removal. The framework then reconstructs a coarse 3D scene using Reference Adaptive GS with Feature Attention, which prioritizes a reference 2D view. This 3D representation subsequently guides the 2D diffusion process via GS-based Reference Feature Warping, ensuring consistency across multiple views. Finally, a Texture-Enhancing Discriminator refines the 3D scene for high photometric realism. Experiments show CoIn achieves state-of-the-art performance in both object removal and insertion.

Why it matters

For professionals in computer graphics, virtual reality, augmented reality, and 3D content creation, CoIn provides a powerful and flexible tool for editing and reconstructing 3D scenes with unprecedented realism and control.

How to implement this in your domain

  1. 1Explore CoIn's capabilities for 3D asset creation and scene editing in virtual production pipelines.
  2. 2Integrate the framework into existing 3D modeling software for advanced inpainting functionalities.
  3. 3Experiment with different diffusion models and mask inputs to achieve desired editing effects.
  4. 4Develop workflows for leveraging CoIn in architectural visualization or product design.
  5. 5Train artists and designers on using this advanced 2D-3D inpainting technique for creative projects.

Who benefits

Media & EntertainmentGamingArchitectureProduct DesignE-commerce

Key takeaways

  • CoIn offers a comprehensive 2D-3D inpainting solution using Gaussian Splatting guidance.
  • It supports flexible mask inputs and diverse tasks like object insertion, not just removal.
  • A multi-stage consistency pipeline ensures high photometric realism and multi-view consistency.
  • The framework achieves state-of-the-art performance in 3D scene editing.

Original post by Hana Kim, Minje Kim, Tae-Kyun Kim

"arXiv:2606.27584v1 Announce Type: cross Abstract: 3D scene inpainting is essential for reconstructing areas corrupted by occlusions or limited viewpoints. While recent methods leverage Gaussian Splatting (GS) for efficient 3D editing, they often depend on precise multi-view segme…"

View on X

Originally posted by Hana Kim, Minje Kim, Tae-Kyun Kim on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses