abstract:cfacd6bf5329b096.tex

1: \begin{abstract}

2: We provide methods for in-database support of decision making under uncertainty.

3: Many important decision problems correspond

4: to selecting a \emph{package} (bag of tuples in a relational database)

5: that jointly satisfy a set of constraints

6: while minimizing some overall cost function;

7: in most real-world problems, the data is uncertain.

8: We provide methods for specifying---via a \sql extension---and

9: processing \emph{stochastic package queries (\spq{s})}, in order to

10: solve optimization problems over uncertain data, right where the data resides.

11: Prior work in stochastic programming uses Monte Carlo

12: methods where the original stochastic optimization problem is

13: approximated by a large deterministic optimization problem

14: that incorporates many \emph{scenarios}, i.e.,

15: sample realizations of the uncertain data values.

16: For large database tables, however, a huge number of scenarios is required,

17: leading to poor performance and, often, failure of the solver software.

18: We therefore provide a novel \sss algorithm that,

19: instead of trying to solve a large deterministic problem,

20: seamlessly approximates it via a sequence of smaller problems

21: defined over carefully crafted \emph{summaries} of the scenarios

22: that accelerate convergence to a feasible and near-optimal solution.

23: Experimental results on our prototype system

24: show that \sss can be orders of magnitude faster than

25: prior methods at finding feasible and high-quality packages.

26: \end{abstract}

27: