44c46ec57b4d45a1.tex
1: \begin{abstract}
2: How can we explore the unknown properties of high-dimensional sensitive relational data while preserving privacy?
3: We study how to construct an explorable privacy-preserving materialized view under differential privacy.
4: No existing state-of-the-art methods simultaneously satisfy the following essential properties in data exploration: workload independence, analytical reliability (i.e., providing error bound for each search query), applicability to high-dimensional data, and space efficiency.
5: To solve the above issues, we propose \method, which creates a differentially private materialized view by well-designed recursive bisected partitioning on an original data cube, i.e., count tensor.
6: Our method searches for block partitioning to minimize the error for the counting query, in addition to randomizing the convergence, by choosing the effective cutting points in a differentially private way, resulting in a less noisy and compact view.
7: Furthermore, we ensure formal privacy guarantee and analytical reliability by providing the error bound for arbitrary counting queries on the materialized views.
8: \method has the following desirable properties:
9: (a) \textit{Workload independence},
10: (b) \textit{Analytical reliability},
11: (c) \textit{Noise resistance on high-dimensional data},
12: (d) \textit{Space efficiency}.
13: To demonstrate the above properties and the suitability for data exploration, we conduct extensive experiments with eight types of range counting queries on eight real datasets.
14: \method outperforms the state-of-the-art methods in these evaluations.
15: % Specifically, in our experiments, the data size of the materialized view is $5\times10^{3}$ times smaller on average than the state-of-the-art data-aware segmentation method while improving the utility.
16: % Our proposed method helps us to explore relational data while preserving data privacy and utility.
17: \end{abstract}
18: