fastxtoolkit的安装与使用
(2014-09-13 18:01:03)
标签:
fastxtoolkit |
分类: biology |
下载:http://hannonlab.cshl.edu/fastx_toolkit/
一。主要内容与安装如下:
- FASTQ-to-FASTA converter
Convert FASTQ files to FASTA files.(fastq转换为fasta) - FASTQ Information
Chart Quality Statistics and Nucleotide Distribution(fastq信息即测序中reads质量和核酸分布) - FASTQ/A Collapser
Collapsing identical sequences in a FASTQ/A file into a single sequence (while maintaining reads counts)(合并FASTQ/A文件中相同的reads为一个reads) - FASTQ/A Trimmer
Shortening reads in a FASTQ or FASTQ files (removing barcodes or noise).(去除fasta、fastq文件中barcodes或noise,具体参考下文) - FASTQ/A Renamer
Renames the sequence identifiers in FASTQ/A file.(重新命名FASTQ/A中序列的名字) - FASTQ/A Clipper
Removing sequencing adapters / linkers(去除FASTQ/A文件中的接头序列或引物) - FASTQ/A Reverse-Complement
Producing the Reverse-complement of each sequence in a FASTQ/FASTA file.(在FASTQ/A中产生与每个reads反向互补的序列) - FASTQ/A Barcode splitter
Splitting a FASTQ/FASTA files containning multiple samples(删除包含多个样本的FASTQ/A文件中barcode) - FASTA Formatter
changes the width of sequences line in a FASTA file - FASTA Nucleotide Changer
Convets FASTA sequences from/to RNA/DNA - FASTQ Quality Filter
Filters sequences based on quality(去除文件中低碱基质量分数的reads) - FASTQ Quality Trimmer
Trims (cuts) sequences based on quality(去除文件reads中低质量碱基) - FASTQ Masker
Masks nucleotides with 'N' (or other character) based on quality
它在linux下安装需要安装库,具体例子:可以直接网站下载,也可以如下下载:
## Download pre-compiled binaries, put them in /usr/local/bin
## $ mkdir fastx_bin
# cd fastx_bin
$ wget http://hannonlab.cshl.edu/fastx_toolkit/fastx_toolkit_0.0.13_binaries_Linux_2.6_amd64.tar.bz2
$ tar -xjf fastx_toolkit_0.0.13_binaries_Linux_2.6_amd64.tar.bz2
$ sudo cp ./bin/* /usr/local/bin
解压后:
无论是编译还是make或安装make install都要注意查看是否有error)
接下来,安装fastxtoolkit,一定要先设置pkg-config(它的含义参考其他博文)
小技巧:
编译时,往往不知道是否成功编译通过,而编译没有通过就去make install
必然会出错,增加了解决问题的复杂性,可以通过下面方法检查是否编译成功:
一:编译完成后,输入echo $? 如果返回结果为0,则表示正常结束,否则就出错了 :(
echo $? 表示 检查上一条命令的退出状态,程序正常退出 返回0,错误退出返回非0。
二:编译时,可以用&&连接命令, && 表示"当前一条命令正常结束,后面的命令才会执行",就是"与"啦。
这个办法很好,即节省时间,又可防止出错。例:
./configure --prefix=/usr && make && make install
一定要先安装libgtextutils,否者会出现以下错误
checking for GTEXTUTILS... configure: error: Package requirements (gtextutils) were not met:
No package 'gtextutils' found
二。fastaxtoolkit使用
1.
2.
3.
4.
5.
若用-r和-n,则
@SRR1174233.392 HWI-1116:76:D0LVDACXX:3:1101:14764:2007
length=100
NCTACTGAGTTTCCACACTGCATA
+SRR1174233.392 HWI-1116:76:D0LVDACXX:3:1101:14764:2007
length=100
#1=DFFFFHHHHHJIJJJJJJIJJJIJ
@SRR1174233.404 HWI-1116:76:D0LVDACXX:3:1101:15379:2013
length=100
NTCCAGACATAGTAAGGATTTGTC
+SRR1174233.404 HWI-1116:76:D0LVDACXX:3:1101:15379:2013
length=100
#4=DDFFFHHHHHJIJJIJIJJIGJJJ
结果是:
>1
NCTACTGAGTTTCCACACTGCATA
>2
NTCCAGACATAGTAAGGATTTGTC
若只用-n,则结果是:
>SRR1174233.392 HWI-1116:76:D0LVDACXX:3:1101:14764:2007
length=100
NCTACTGAGTTTCCACACTGCATA
>SRR1174233.404 HWI-1116:76:D0LVDACXX:3:1101:15379:2013
length=100
NTCCAGACATAGTAAGGATTTGTC
6.
7.
老格式输出文件:下面一行代表输出文件的一列