VarScan使用示例,如下:
samtools mpileup -q35 -d8000 -f GBS_A909.fa M008_alignmen_F4.sort.bam >M008_alignmen_F4.sort.mpileup
java -jar VarScan.v2.4.0.jar mpileup2indel M008_alignmen_F4.sort.mpileup --min-var-freq 0 --p-value 0.99 --min-avg-qual 1 --min-coverage 1 --output-vcf 1 > M008_alignmen_F4.sort.mpileup.indel.txt
 
得到的文件M008_alignmen_F4.sort.mpileup.indel.txt的格式如下所示:

Tab-delimited SNP calls with the following columns:

Chrom           chromosome name

Position   position (1-based)

Ref                reference allele at this position

Var                variant allele observed

PoolCall  Cross-sample call using all data (Cons:Cov:Reads1:Reads2:Freq:P-value)

Cons - consensus genotype in IUPAC format

Cov - total depth of coverage

Reads1 - number of reads supporting reference

Reads2 - number of reads supporting variant

Freq - the variant allele frequency by read count

P-value - FET p-value of observed reads vs expected non-variant

StrandFilt Information to look for strand bias using all reads, format R1+:R1-:R2+:R2-:pval

R1+ = reference supporting reads on forward strand

R1- = reference supporting reads on reverse strand

R2+ = variant supporting reads on forward strand

R2- = variant supporting reads on reverse strand

pval = FET p-value for strand distribution, R1 versus R2

SamplesRef    Number of samples called reference (wildtype)

SamplesHet    Number of samples called heterozygous-variant

SamplesHom  Number of samples called homozygous-variant

SamplesNC     Number of samples not covered / not called

SampleCalls    The calls for each sample in the mpileup, space-delimited

Each sample has six values separated by colons:

Cons - consensus genotype in IUPAC format

Cov - total depth of coverage

Reads1 - number of reads supporting reference

Reads2 - number of reads supporting variant

Freq - the variant allele frequency by read count

P-value - FET p-value of observed reads vs expected non-variant

以下是对varscan2的使用说明:

Commands

Three VarScan subcommands will invoke the germline variant calling model. These work with single-sample and multi-sample mpileup input:

mpileup2snp - calls single nucleotide polymorphisms (SNPs)
mpileup2indel - calls insertions and deletions (indels)
mpileup2cns - calls a consensus genotype (reference, SNP, or indel)

The first two (mpileup2snp and mpileup2indel) report *only* positions at which a variant of the given type (SNP and indel) was called. The third command (mpileup2cns) reports all positions that met the miniumum coverage, or (with the -v parameter), all positions at which a SNP or an indel was called. Use the --output-vcf 1 argument to get VCF 4.1 output. (能够得到vcf格式的文件)

The following commands still work, but only with single-sample pileup and they do NOT include full VCF output support.

pileup2snp - calls single nucleotide polymorphisms (SNPs)
pileup2indel - calls insertions and deletions (indels)
pileup2cns - calls a consensus genotype (reference, SNP, or indel)

The first two (pileup2snp and pileup2indel) report *only* positions at which a variant of the given type (SNP and indel) was called. The third command (pileup2cns) reports all positions that met the miniumum coverage, or (with the -v parameter), all positions at which a SNP or an indel was called.

转载于:https://www.cnblogs.com/xiaofeiIDO/p/6857130.html

变异检测VarScan软件使用说明相关推荐

  1. 突变检测软件 测试数据库,合作文章|变异检测软件技能大PK,谁才是Battle King?...

    DNA变异是个体间遗传变异的重要来源之一.第二代测序技术(NGS)和第三代测序技术(TGS)都在遗传变异研究中大放异彩.许多变异检测工具可以用来解析二代或三代数据,但是目前没有软件能兼顾灵敏性和特异性 ...

  2. [原]基因组变异检测概述

    考虑到cnblog不适合基因组领域这种类型的文章,进过多番折腾,终于用jekyll+github搭了个独立博客www.huangshujia.me,现在博客已经搬迁! 首先,在开始之前我觉得有必要稍微 ...

  3. 基因组变异检测概述(SNP、InDel、SV)

    首先,在开始之前我觉得有必要稍微科普缓冲一下,以便不使得不熟悉生物信息或基因组的客官们疑惑.O(∩_∩)O! 1.基因组:每个人都有一个基因组,这里的"基因组"并不只是" ...

  4. 推荐阅读:变异检测到底应该用什么软件?

    原文见:Validating generalized incremental joint variant calling with GATK HaplotypeCaller, FreeBayes, P ...

  5. 突变检测软件 测试数据库,测序数据比对和变异检测

    全基因组重测序是对已知基因组序列的物种进行不同个体的基因组测序,并在此基础上对个体或群体进行差异性分析.所以首先我们需要某个物种的全基因组序列和该物种的某个个体的基因组序列. 重测序数据分析流程 重测 ...

  6. 在超算上用sentieon快速完成NGS的变异检测

    sentieon 很久之前就听说sentieon在跑calling variants的速度非常快,能甩GATK 不知道多少条街,但是一直缺少一次机会去进行测试.这里感谢sentieon软件公司的张春风 ...

  7. 基因组大数据变异检测算法的并行优化

    基因组大数据变异检测算法的并行优化 基因组大数据变异检测算法的并行优化 崔英博1, 黄春1, 唐滔1, 杨灿群1, 廖湘科1, 彭绍亮2,3 1 国防科技大学计算机学院,湖南 长沙 410073 2 ...

  8. WGS完整流程介绍(原始数据质控、数据预处理、变异检测、数据注释)

    一.原始数据质控 1.原始测序数据(也是reads)       从测序仪中直接取下来的数据,它包括了所有的碱基,无论是测序质量低的,还有可能包含测错的,可能还会包含实验误差. 2.数据质控      ...

  9. biostar handbook(十)|如何进行变异检测

    变异检测流程 什么是基因组变异 基因组变异是一个定义比较模糊的概念. 所谓的变异是相对于一个完美的"参考基因组"而言.但是其实完美的"参考基因组"并不存在,因为 ...

  10. 基因-高通量测序-变异检测

    基因变异检测 ​ 本周的任务是弄清楚基因检测是什么,本来以为挺简单的,后来发现自己基本的染色体都忘干净了. ​ 基因变异检测看名字就知道是针对基因的,但是基因是什么呢?感觉基因像是一种抽象的概念,那么 ...

最新文章

  1. GEO数据挖掘(2)-GEO数据库
  2. react部署之页面空白
  3. linux——管理系统设备之磁盘的加密、加密磁盘的挂载及磁盘阵列、配额
  4. 从严治码-别人在项目中下毒,我该怎么治?
  5. 集成CDI和WebSockets
  6. Pat乙级1011题:A+B和C
  7. python剑指offer面试题_剑指offer面试题Q10 斐波那契数列 python解法
  8. 一道Python面试题,设置一个动态变量名
  9. COJS 1752. [BOI2007]摩基亚Mokia
  10. SAP License:一句话让你明白FICO
  11. 常用的前端框架有哪些?
  12. 感应加热计算机仿真软件,一种新型感应加热电源调功方式的研究与计算机仿真...
  13. 虎牙直播怎么换html5,虎牙直播怎么换徽章-虎牙直播更换粉丝徽章的方法 - 河东软件园...
  14. linux raid5 性能,RAID级别及各个级别特性
  15. Splunk Filed Alias 字段改名
  16. python爬虫学习
  17. 成都盛迈坤电商:评价怎么有效回复
  18. MatLab 画图方法
  19. 多旋翼无人机组成(小白上路)
  20. openstack restful api 使用

热门文章

  1. 2.7 HDFS的使用
  2. 回归的正则化模型:岭回归和Lasso回归(套索回归)
  3. windowsf进程管理器中进程解释
  4. Matlab括号使用详解
  5. 压力测试网站_JMeter简单压力测试教程
  6. hbase中为何不能向表中插入数据_Python自带的数据库,用起来真方便!
  7. 力扣-451 根据字符出现频率排序
  8. 【安装包】Dev-cpp
  9. 【安装包】VC++6.0
  10. 铁大Facebook——十天冲刺(4)