0c1cf95b5abedf0e.tex
1: \begin{abstract}
2: 
3: Single individual haplotyping is an NP-hard problem that emerges when
4: attempting to reconstruct an organism's inherited genetic variations using data typically 
5: generated by high-throughput DNA sequencing platforms. Genomes of diploid organisms, 
6: including humans, are organized into homologous pairs of chromosomes that differ from
7: each other in a relatively small number of variant positions. Haplotypes are ordered 
8: sequences of the nucleotides in the variant positions of the chromosomes in a homologous 
9: pair; for diploids, haplotypes associated with a pair of chromosomes may be 
10: conveniently represented by means of complementary binary sequences. In this paper, we 
11: consider a binary matrix factorization formulation of the single individual haplotyping problem 
12: and efficiently solve it by means of alternating minimization. We analyze the convergence 
13: properties of the alternating minimization algorithm and establish theoretical bounds for the
14: achievable haplotype reconstruction error. The proposed technique is shown to outperform 
15: existing methods when applied to synthetic as well as real-world Fosmid-based HapMap 
16: NA12878 datasets.
17: \end{abstract}
18: