abstract:f289fb67585e1263.tex

1: \begin{abstract}

2: Conformal prediction {(CP)} is a distribution-free framework for achieving probabilistic guarantees on black-box models. {CP} is generally applied to a model post-training. Recent research efforts, on the other hand, have focused on optimizing CP efficiency \emph{during training}. We formalize this concept as the problem of \emph{conformal risk minimization} (CRM). In this direction, conformal training (\texttt{ConfTr}) by~\cite{stutz2022learning} is a technique that seeks to minimize the expected prediction set size of a model by simulating {CP} in-between training updates. Despite its potential, we identify a strong source of sample inefficiency in \texttt{ConfTr} that leads to overly noisy estimated gradients, introducing training instability and limiting practical use. To address this challenge, we propose \emph{variance-reduced conformal training} \texttt{(VR-ConfTr)}, a CRM method that incorporates a variance reduction technique in the gradient estimation of the \texttt{ConfTr} objective function. Through extensive experiments\footnote{We release our code at \href{https://github.com/nooranisima/conformal-risk-minimization-w-variance-reduction-code}{https://github.com/nooranisima/conformal-risk-minimization-w-variance-reduction-code}} on various benchmark datasets, we demonstrate that \texttt{VR-ConfTr} consistently achieves faster convergence and smaller prediction sets compared to baselines.

3: \end{abstract}

4: