下載Synechococcus elongatus UTEX 2973(accession no.為GCA_000817325.1 )的基因組注釋文件,生成一個(gè)locus_tag和Name對(duì)應(yīng)關(guān)系的表格?要求:只能使用一行shell命令,生成的表格以制表符分隔;并將shell命令和基因數(shù)目寫(xiě)在答案處。
打開(kāi)網(wǎng)址,進(jìn)入NCBI網(wǎng)站
https://www.ncbi.nlm.nih.gov/


按genomes/all/GCA/000/817/325目錄進(jìn)入



wget https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/817/325/GCA_000817325.1_ASM81732v1/GCA_000817325.1_ASM81732v1_genomic.gff.gz? #通過(guò)復(fù)制的鏈接下載基因注釋文件


gunzip GCA_000817325.1_ASM81732v1_genomic.gff.gz

grep $'RefSeq\t' GCA_000817325.1_ASM81732v1_genomic.gff> test.tsv

less test.tsv

查看的結(jié)果

grep $'RefSeq\t' GCA_000817325.1_ASM81732v1_genomic.gff|less?? #查看抓取文件,無(wú),失敗
less GCA_000817325.1_ASM81732v1_genomic.gff?? #查看文件內(nèi)容

grep $'\tProtein' GCA_000817325.1_ASM81732v1_genomic.gff |head

grep $'\tProtein' GCA_000817325.1_ASM81732v1_genomic.gff |sed 's/^.*;Name=//g'|sed 's/;.*;locus_tag=/\t/g'|sed 's/;.*$//g'|head
