abstract:bb36b53faa967c52.tex

1: \begin{abstract}

2: Recent research has highlighted improvements in high-quality imaging guided by event cameras, with most of these efforts concentrating on the RGB domain.

3: However, these advancements frequently neglect the unique challenges introduced by the inherent flaws in the sensor design of event cameras in the RAW domain.

4: Specifically, this sensor design results in the partial loss of pixel values, posing new challenges for RAW domain processes like demosaicing.

5: The challenge intensifies as most research in the RAW domain is based on the premise that each pixel contains a value, making the straightforward adaptation of these methods to event camera demosaicing problematic.

6: To end this, we present a Swin-Transformer-based backbone and a pixel-focus loss function for demosaicing with missing pixel values in RAW domain processing.

7: Our core motivation is to refine a general and widely applicable foundational model from the RGB domain for RAW domain processing, thereby broadening the model's applicability within the entire imaging process.

8: Our method harnesses multi-scale processing and space-to-depth techniques to ensure efficiency and reduce computing complexity.

9: We also proposed the Pixel-focus Loss function for network fine-tuning to improve network convergence based on our discovery of a long-tailed distribution in training loss.

10: Our method has undergone validation on the MIPI Demosaic Challenge dataset, with subsequent analytical experimentation confirming its efficacy.

11: All code and trained models are released here: \href{https://github.com/yunfanLu/ev-demosaic}{https://github.com/yunfanLu/ev-demosaic.}

12: \end{abstract}

13: