SeqHBase: a big data toolset for family based sequencing data analysis
Min He Thomas N Person Scott J Hebbring Ethan Heinzen Zhan Ye,
Steven J Schrodi Elizabeth W McPherson Simon M Lin Peggy L Peissig,
Murray H Brilliant Jason O ’ Rawe Reid J Robison Gholson J Lyon,
Kai Wang
Methods Ha doop is a framework for reliable, scalable,distributed processing of large data set s using MapReduce programming mode ls. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequenci ng data to detect denovo, inherited homozyg ous, or compo und heterozy gous mutations that may contribut e to disease manifestations.
SeqHBase: a big data toolset for family based sequencing data analysis