1: \begin{abstract}
2:
3: Single individual haplotyping is an NP-hard problem that emerges when
4: attempting to reconstruct an organism's inherited genetic variations using data typically
5: generated by high-throughput DNA sequencing platforms. Genomes of diploid organisms,
6: including humans, are organized into homologous pairs of chromosomes that differ from
7: each other in a relatively small number of variant positions. Haplotypes are ordered
8: sequences of the nucleotides in the variant positions of the chromosomes in a homologous
9: pair; for diploids, haplotypes associated with a pair of chromosomes may be
10: conveniently represented by means of complementary binary sequences. In this paper, we
11: consider a binary matrix factorization formulation of the single individual haplotyping problem
12: and efficiently solve it by means of alternating minimization. We analyze the convergence
13: properties of the alternating minimization algorithm and establish theoretical bounds for the
14: achievable haplotype reconstruction error. The proposed technique is shown to outperform
15: existing methods when applied to synthetic as well as real-world Fosmid-based HapMap
16: NA12878 datasets.
17: \end{abstract}
18: