获得重复区域repeat(SINE\LINE)等区域的bed文件
(2014-11-28 13:29:07)分类: 生物信息学 |
- Short interspersed nuclear elements (SINE), which include ALUs
- Long interspersed nuclear elements (LINE)
- Long terminal repeat elements (LTR), which include retroposons
- DNA repeat elements (DNA)
- Simple repeats (micro-satellites)
- Low complexity repeats
- Satellite repeats
- RNA repeats (including RNA, tRNA, rRNA, snRNA, scRNA, srpRNA)
- Other repeats, which includes class RC (Rolling Circle)
- Unknown
grep LINE rmsk.txt | awk -F"\t" '{print $6,$7,$8,$10}' |sed 's/ /\t/g'>LINE.bed
grep Simple_repeat rmsk.txt |awk
grep SINE rmsk.txt |awk
grep LTR rmsk.txt |awk
grep DNA rmsk.txt |awk
grep Satellite rmsk.txt |awk
grep Low_complexity rmsk.txt |awk