2023-07-11 crossMap不同版本基因組位點坐標(biāo)轉(zhuǎn)換

CrossMap

installation

pip3 install CrossMap

download chain files

A chain file describes a pairwise alignment between two reference assemblies. UCSC and Ensembl chain files are available:

UCSC chain files

Ensembl chain files

User Input file

CrossMap supports the following file formats.

  1. BAM, CRAM, or SAM

  2. BED or BED-like. (BED file must have at least ‘chrom’, ‘start’, ‘end’)

  3. Wiggle (“variableStep”, “fixedStep” and “bedGraph” formats are supported)

  4. BigWig

  5. GFF or GTF

  6. VCF

  7. GVCF

  8. MAF

usage

CrossMap.py bed hg18ToHg19.over.chain.gz test.hg18.bed3

$ CrossMap.py -h

usage: CrossMap.py [-h] [-v] {bed,bam,gff,wig,bigwig,vcf,gvcf,maf,region,viewchain} ...

CrossMap (v0.6.0) is a program to convert (liftover) genome coordinates between different reference
assemblies (e.g., from human GRCh37/hg19 to GRCh38/hg38 or vice versa). Supported file formats: BAM,
BED, BigWig, CRAM, GFF, GTF, GVCF, MAF (mutation annotation format), SAM, Wiggle, and VCF.

positional arguments:
  {bed,bam,gff,wig,bigwig,vcf,gvcf,maf,region,viewchain}
                        sub-command help
    bed                 converts BED, bedGraph or other BED-like files. Only genome coordinates
                        (i.e., the first 3 columns) will be updated. Regions mapped to multiple
                        locations to the new assembly will be split. Use the "region" command to
                        liftover large genomic regions. Use the "wig" command if you need
                        bedGraph/bigWig output.
    bam                 converts BAM, CRAM, or SAM format file. Genome coordinates, header section,
                        all SAM flags, insert size will be updated.
    gff                 converts GFF or GTF format file. Genome coordinates will be updated.
    wig                 converts Wiggle or bedGraph format file. Genome coordinates will be updated.
    bigwig              converts BigWig file. Genome coordinates will be updated.
    vcf                 converts VCF file. Genome coordinates, header section, reference alleles will
                        be updated.
    gvcf                converts GVCF file. Genome coordinates, header section, reference alleles
                        will be updated.
    maf                 converts MAF (mutation annotation format) file. Genome coordinates and
                        reference alleles will be updated.
    region              converts big genomic regions (in BED format) such as CNV blocks. Genome
                        coordinates will be updated.
    viewchain           prints out the content of a chain file into a human readable, block-to-block
                        format.

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit

https://crossmap.readthedocs.io/en/latest/
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時請結(jié)合常識與多方信息審慎甄別。
平臺聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容