FIXING GENOMIC DATA DECOMPRESSION ERRORS
The problem of genomic data compression/decompression is considered. Unlike current research that depends on reference sequence or template, a new hash-based methodology that does not depended on reference sequence is proposed. By applying the hashing formula which was proposed by Reneker and Shyu, 9 bytes of compression gain per zipped read is possible. However the main emphasis in this paper is put on the correction on errors in genomic data decompression that happen if Renker-Shyu formula is applied.