[关闭]
@Rumia 2017-03-27T07:44:37.000000Z 字数 590 阅读 873

Comp Schemes 记录

compression



metadata (fq)

  1. incremental coding (aka. front compression, which performs better with sorted data)

quality (fq)

  1. run-length-limited coding
  2. ...

reference-based

  1. reference不压缩,直接根据原来的文件进行reference-based diff & compression (压缩一次)

  2. 先把reference用fqzcomp等压缩,再把其他fastq文件也用同一种压缩程序压缩,然后再根据这些压缩后的文件进行reference-based diff & compression (压缩两次)

  3. 在1的基础上对除了reference之外的被压缩过一次的文件进行再次压 (重新找一个reference进行reference-based diff & compression)

suffix tree

others

  1. HDF5
  2. Burrows–Wheeler transform (BWT)
  3. de bruijn graph

websites

  1. Sequence Squeeze
  2. Data Compression Conference
添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注