abstract:8177d7450b594877.tex

1: \begin{abstract}

2: Mining large datasets and obtaining calibrated predictions from them is of immediate relevance and utility in reliable deep learning.  In our work, we develop methods for Deep Neural Network based inferences in such datasets like the gene expression.  However, unlike typical Deep learning methods, our inferential technique, while achieving state-of-the-art performance in terms of accuracy, can also provide explanations, and report uncertainty estimates. We adopt the quantile regression framework to predict full conditional quantiles for a given set of housekeeping gene expressions. In addition to being useful in providing rich interpretations of the predictions, conditional quantiles are also robust to measurement noise. Our technique is particularly consequential in High-throughput Genomics, an area that is ushering in a new era in personalized health care, and targeted drug design and delivery. However, check loss, used in quantile regression to drive the estimation process, is not differentiable. We propose $log-cosh$ as a smooth alternative to the check loss. We apply our methods to the GEO microarray dataset. We also extend the method to the binary classification setting. Furthermore, we investigate other consequences of the smoothness of the loss in faster convergence. We further apply the classification framework to other healthcare inference tasks, such as heart disease, breast cancer, diabetes, etc. As a test of the generalization ability of our framework, other non-healthcare related data sets for regression and classification tasks are also evaluated.

3: \end{abstract}

4: