cfacd6bf5329b096.tex
1: \begin{abstract}
2: We provide methods for in-database support of decision making under uncertainty.
3: Many important decision problems correspond
4: to selecting a \emph{package} (bag of tuples in a relational database)
5: that jointly satisfy a set of constraints
6: while minimizing some overall cost function;
7: in most real-world problems, the data is uncertain.
8: We provide methods for specifying---via a \sql extension---and
9: processing \emph{stochastic package queries (\spq{s})}, in order to
10: solve optimization problems over uncertain data, right where the data resides.
11: Prior work in stochastic programming uses Monte Carlo
12: methods where the original stochastic optimization problem is
13: approximated by a large deterministic optimization problem
14: that incorporates many \emph{scenarios}, i.e.,
15: sample realizations of the uncertain data values.
16: For large database tables, however, a huge number of scenarios is required,
17: leading to poor performance and, often, failure of the solver software.
18: We therefore provide a novel \sss algorithm that,
19: instead of trying to solve a large deterministic problem,
20: seamlessly approximates it via a sequence of smaller problems
21: defined over carefully crafted \emph{summaries} of the scenarios
22: that accelerate convergence to a feasible and near-optimal solution.
23: Experimental results on our prototype system
24: show that \sss can be orders of magnitude faster than
25: prior methods at finding feasible and high-quality packages.
26: \end{abstract}
27: