关于基因组评估大小(GenomeScope)
(2017-10-16 13:12:58)分类: 生物信息学 |
参考文献:Vurture GW, Sedlazeck FJ, Nattestad M, Underwood CJ, Fang
H, Gurtowski J,
Schatz MC. GenomeScope: fast reference-free genome profiling
from short reads.
Bioinformatics. 2017 Jul 15;33(14):2202-2204. doi:
10.1093/bioinformatics/btx153.
PubMed PMID: 28369201.
网址:https://github.com/schatzlab/genomescope
1:首先对二代数据做质控,建议使用
trimmomatic
关于参数设置参考:http://cbsu.tc.cornell.edu/lab/doc/assembly_workshop_20150420_lecture2.pdf
2: kmer评估建议使用jellyfish,不建议gce,以上的格式文件建议是压缩的fq格式文件
jellyfish count -C -m 21 -s 1000000000 -t 10 *.fastq -o
reads.jf
jellyfish histo -t 10 reads.jf > reads.histo
/PATH/TO/Rscript /PATH/TO/genomescope.R ara_F1_21.hist 21 150
output
3:其他软件参考链接:https://github.com/josephryan/estimate_genome_size.pl
前一篇:运行blast出错
后一篇:kegg 数据库学习笔记