数据库的建立"/>
circRNA数据库的建立
- circRNA数据库的建立
wget .fa.gz
gunzip human_hg19_circRNAs_putative_spliced_sequence.fa.gz
wget .fa.gz
gunzip mouse_mm9_circRNAs_putative_spliced_sequence.fa.gz
wget .txt
wget .txt
perl extract_circBase_seq.pl mouse_mm9_circRNAs_putative_spliced_sequence.fa mmu_mm9_circRNA.txt mmu_mm9_circRNA.fa
- 这个脚本是为了从可能的circRNA序列里面提取已知的circRNA序列
- 下载坐标转换工具
wget .x86_64/liftOver
wget .over.chain.gz
gunzip hg19ToHg38.over.chain.gz
wget .over.chain.gz
gunzip mm9ToMm10.over.chain.gz
cut -f1,2,3 mmu_mm9_circRNA.txt >mmu_mm9_circRNA_pos.txt
les mmu_mm9_circRNA.txt |perl -F"\t" -lane 'print join("\t",@F[3..12])' >mmu_mm9_circRNA.info.txt
# trans mmu_mm9_circRNA_pos.txt to mmu_mm10_circRNA_pos.txt
#使用网页版的转换,由于只提供了位置信息,不知道属于那一条链,会有一些bug
paste mmu_mm10_circRNA_pos.txt mmu_mm9_circRNA.info.txt >mmu_mm10_circRNA.txt#use liftOver
les mmu_mm9_circRNA.txt|sed '1d'|awk '{print $1"\t"$2"\t"$3"\t"$5"\t"0"\t"$4"\t"$2"\t"$3"\t255,0,0"}' >mmu_mm9_circRNA.pre2mm10.id.txt
./liftOver mmu_mm9_circRNA.pre2mm10.id.txt mm9ToMm10.over.chain mmu_mm10_circRNA.bed unmap
perl -e '$cir=shift;$bed=shift;open IN,$cir;while(<IN>){chomp;if(/^#/){print"$_\n";}else{@a=(split/\t/,$_);$info=join("\t",@a[5..12]);$hash{$a[4]}=$info;}};close IN;open IN2,$bed;while(<IN2>){chomp;@b=(split/\t/,$_);if(exists $hash{$b[3]}){print qq{$b[0]\t$b[1]\t$b[2]\t$b[5]\t$b[3]\t$hash{$b[3]}\n}}};close IN2;' mmu_mm9_circRNA.txt mmu_mm10_circRNA.bed >mmu_mm10_circRNA2.txt
转载于:.html
更多推荐
circRNA数据库的建立
发布评论