1: \begin{abstract}
2: The rising availability of commercial \360 cameras that democratize indoor scanning, has increased the interest for novel applications, such as interior space re-design.
3: Diminished Reality (DR) fulfills the requirement of such applications, to remove existing objects in the scene, essentially translating this to a counterfactual inpainting task.
4: While recent advances in data-driven inpainting have shown significant progress in generating realistic samples, they are not constrained to produce results with reality mapped structures.
5: To preserve the `reality' in indoor (re-)planning applications, the scene's structure preservation is crucial.
6: To ensure structure-aware counterfactual inpainting, we propose a model that initially predicts the structure of a indoor scene and then uses it to guide the reconstruction of an empty -- background only -- representation of the same scene.
7: We train and compare against other state-of-the-art methods on a version of the Structured3D dataset \cite{Structured3D} modified for DR, showing superior results in both quantitative metrics and qualitative results, but more interestingly, our approach exhibits a much faster convergence rate.
8: Code and models are available at \href{https://vcl3d.github.io/PanoDR/}{vcl3d.github.io/PanoDR/}.
9: \end{abstract}
10: