1. art:
art_illumina -ss HS20 -i GRCH38BWAindex/GRCH38chr1L3556522.fna -p -l 100 -m 200 -s 10  -c 10000000 -o g38L100c10000000Nhs20Paired

2.fastq上传到hdfs

spark-submit  --class cs.ucla.edu.bwaspark.BWAMEMSpark --master spark://219.219.220.149:7077  /home/hadoop/xubo/tools/cloud-scale-bwamem-0.2.1/target/cloud-scale-bwamem-0.2.0-assembly.jar upload-fastq   1 21 g38L100c10000000Nhs20Paired1.fq g38L100c10000000Nhs20Paired2.fq /xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12.fastq

3.配置:
需要将fasta dispatch到各个节点,每个节点需要有cs-bwamem的jar包,jni搞全局路径

4.align:

spark-submit --executor-memory 4g --class cs.ucla.edu.bwaspark.BWAMEMSpark --total-executor-cores 20 --master spark://219.219.220.149:7077  --conf spark.driver.host=219.219.220.149 --conf spark.driver.cores=4 --conf spark.driver.maxResultSize=4g --conf spark.storage.memoryFraction=0.7  --conf spark.akka.threads=2 --conf spark.akka.frameSize=1024 /home/hadoop/xubo/tools/cloud-scale-bwamem-0.2.1/target/cloud-scale-bwamem-0.2.0-assembly.jar cs-bwamem -bfn 1 -bPSW 1 -sbatch 10 -bPSWJNI 1  -oChoice 2 -oPath hdfs://219.219.220.149:9000/xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12F2.adam -localRef 1 -jniPath /home/hadoop/xubo/tools/cloud-scale-bwamem-0.2.1/target/jniNative.so -isSWExtBatched 1  1 /home/hadoop/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem/GRCH38BWAindex/GRCH38chr1L3556522.fasta  /xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12.fastq

5.merge:

spark-submit --executor-memory 6g --class cs.ucla.edu.bwaspark.BWAMEMSpark --total-executor-cores 20 --master spark://219.219.220.149:7077  --conf spark.driver.host=219.219.220.149 --conf spark.driver.cores=4 --conf spark.driver.maxResultSize=6g --conf spark.storage.memoryFraction=0.7  --conf spark.akka.threads=2 --conf spark.akka.frameSize=1024 /home/hadoop/xubo/tools/cloud-scale-bwamem-0.2.1/target/cloud-scale-bwamem-0.2.0-assembly.jar merge hdfs://219.219.220.149:9000 /xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12F2.adam /xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12F2.merge.adam

6.统计:

package org.bdgenomics.avocado.cliimport org.apache.spark.sql.SQLContextimport org.apache.spark.{SparkConf, SparkContext}/*** Created by xubo on 2016/5/27.* 从hdfs下载经过avocado匹配好的数据* run:success*/object parquetRead2csbwamemDup {def main(args: Array[String]) {val conf = new SparkConf().setMaster("local[4]").setAppName(this.getClass().getSimpleName().filter(!_.equals('$')))val sc = new SparkContext(conf)val sqlContext = new SQLContext(sc)println("start:")//    val file = "hdfs://219.219.220.149:9000/xubo/16.adam/0"//    val file = "hdfs://219.219.220.149:9000/xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20.merge.adam/"val file = "hdfs://219.219.220.149:9000/xubo/data/alignment/cs-bwamem/fastq/g38L100c10000000Nhs20Paired12F2.merge.adam/"val df3 = sqlContext.read.option("mergeSchema", "true").parquet(file)//    df3.printSchema()df3.show()println(df3.count())var j = 1//    for (i <- df3) {//      println(j + ":" + i)//      j += 1//    }//    println("distinct:" + df3.distinct.count())sc.stopprintln("end")}}

结果:

scala>     println(df3.count())18512773scala> println("distinct:" + df3.distinct.count())distinct:18512773   

附录:
部分记录:

analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.56094168004282, 9.939863106970265)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.56094168004282 9.939863106970265pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.3929171883384, 9.895888368589208)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.3929171883384 9.895888368589208pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5008221332571, 9.893101752855983)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5008221332571 9.893101752855983pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.50341742708892, 9.938821490084942)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.50341742708892 9.938821490084942pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.43662172257996, 9.956950171714343)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.43662172257996 9.956950171714343pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5294684266857, 9.903683785238687)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5294684266857 9.903683785238687pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.40190347187567, 9.937420104198324)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.40190347187567 9.937420104198324pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.45073240443014, 10.004935473704805)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.45073240443014 10.004935473704805pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.48925135028793, 9.872713569624056)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.48925135028793 9.872713569624056pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.40273561658512, 9.90951049687331)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.40273561658512 9.90951049687331pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.46782169411932, 9.994092930915187)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.46782169411932 9.994092930915187pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4832529043789, 9.916498387414416)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4832529043789 9.916498387414416pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.51942615912995, 9.980650734352077)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.51942615912995 9.980650734352077pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.42283366088975, 10.025277342223173)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.42283366088975 10.025277342223173pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.56863795443184, 9.884954572476323)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.56863795443184 9.884954572476323pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.54418222714128, 9.937606729429495)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.54418222714128 9.937606729429495pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.51789788933257, 9.908230182966792)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.51789788933257 9.908230182966792pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.49175824175825, 9.953633581247528)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.49175824175825 9.953633581247528pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.57901688501767, 9.95075215935561)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.57901688501767 9.95075215935561pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.47238496713348, 9.984373782742304)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.47238496713348 9.984373782742304pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.46635524016688, 9.968761617488683)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.46635524016688 9.968761617488683pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.38825547732228, 9.92027510738216)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.38825547732228 9.92027510738216pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4851967207246, 9.937352774317741)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4851967207246 9.937352774317741pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.55340706386016, 9.817553555623133)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.55340706386016 9.817553555623133pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.44619450694668, 9.920089825716484)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.44619450694668 9.920089825716484pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5452329749104, 9.926594228354224)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5452329749104 9.926594228354224pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.45703727621714, 9.948262102053539)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.45703727621714 9.948262102053539pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4996774887121, 9.889445679734042)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4996774887121 9.889445679734042pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.44845544695727, 10.04637179033811)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.44845544695727 10.04637179033811pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.54115079081086, 9.861927310088358)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.54115079081086 9.861927310088358pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.50392688847637, 9.923422645866905)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.50392688847637 9.923422645866905pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.47992413398225, 9.938359869833343)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.47992413398225 9.938359869833343pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.58786034019695, 9.881771439959131)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.58786034019695 9.881771439959131pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.53944973771544, 9.928751173123029)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.53944973771544 9.928751173123029pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.51932833154697, 9.921047819212657)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.51932833154697 9.921047819212657pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5051575828961, 9.955724349633446)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5051575828961 9.955724349633446pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.45987665323875, 9.934862525129335)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.45987665323875 9.934862525129335pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.41697601370646, 9.924344174968864)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.41697601370646 9.924344174968864pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4504008016032, 10.037075657682921)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4504008016032 10.037075657682921pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.53644939965696, 9.99170757810556)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.53644939965696 9.99170757810556pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.48251923421006, 9.977411795227688)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.48251923421006 9.977411795227688pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.50750643408637, 9.996386292381906)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.50750643408637 9.996386292381906pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.43163108941235, 9.93712833366825)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.43163108941235 9.93712833366825pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4462753788962, 9.969547663168125)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4462753788962 9.969547663168125pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.564805511698, 9.981653834837397)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.564805511698 9.981653834837397pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.36601983305985, 9.935551164661065)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.36601983305985 9.935551164661065pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5214522629195, 9.948307577425197)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5214522629195 9.948307577425197pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.4983700519434, 9.945836975580079)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.4983700519434 9.945836975580079pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.55142989753293, 9.949033183252514)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.55142989753293 9.949033183252514pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.57046907934694, 9.951195165671601)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.57046907934694 9.951195165671601pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5627628483855, 9.940990579395123)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5627628483855 9.940990579395123pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.57557661362418, 9.88907098626934)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.57557661362418 9.88907098626934pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.43170531280145, 9.97153529406552)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.43170531280145 9.97153529406552pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.50119476443524, 9.993324406109274)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.50119476443524 9.993324406109274pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.53636818766986, 9.949249012517377)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.53636818766986 9.949249012517377pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.56565548290277, 9.982952754471368)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.56565548290277 9.982952754471368pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.44116064703562, 9.922618947362455)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.44116064703562 9.922618947362455pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.48944921316166, 9.932084984684073)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.48944921316166 9.932084984684073pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.3957828750534, 9.897098539777856)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.3957828750534 9.897098539777856pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.46895465769603, 9.819260659759172)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.46895465769603 9.819260659759172pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.5120634807163, 9.905862404052492)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.5120634807163 9.905862404052492pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.50274353310056, 9.979749125168556)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.50274353310056 9.979749125168556pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.49695820211852, 9.940759507891329)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.49695820211852 9.940759507891329pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.42262032467997, 9.919691202862072)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.42262032467997 9.919691202862072pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.49617638650656, 9.973742669699334)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.49617638650656 9.973742669699334pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.57427937915742, 9.952728233473406)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.57427937915742 9.952728233473406pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 198, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.48755438271164, 9.944608296251676)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.48755438271164 9.944608296251676pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.52546792398914, 10.000808728139058)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.52546792398914 10.000808728139058pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (192, 199, 205)low and high boundaries for computing mean and std.dev: (166, 231)mean and std.dev: (198.49446902654867, 9.999977060170599)low and high boundaries for proper pairs: (153, 244)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 153 244 0 198.49446902654867 9.999977060170599pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Load FASTQ files                                                                @Worker1skip orientation FF as there are not enough pairs                               analyzing insert size distribution for orientation(25, 50, 75) percentile: (191, 198, 205)low and high boundaries for computing mean and std.dev: (163, 233)mean and std.dev: (198.28641975308642, 10.159216233717126)low and high boundaries for proper pairs: (149, 247)skip orientation RF as there are not enough pairsskip orientation RR as there are not enough pairs@MemPeStatpes(0): 0 0 1 0.0 0.0pes(1): 149 247 0 198.28641975308642 10.159216233717126pes(2): 0 0 1 0.0 0.0pes(3): 0 0 1 0.0 0.0Summary:Worker1 Time: 815792Calculate Metrics Time: 2093Worker2 Time: 68846316/06/03 18:07:19 ERROR LiveListenerBus: SparkListenerBus has already stopped! Dropping event SparkListenerExecutorMetricsUpdate(4,WrappedArray())16/06/03 18:07:19 WARN QueuedThreadPool: 1 threads could not be stoppedCS-BWAMEM Finished!!!Jun 3, 2016 5:41:47 PM INFO: parquet.hadoop.ParquetInputFormat: Total input paths to process : 21Jun 3, 2016 5:41:57 PM INFO: parquet.hadoop.ParquetInputFormat: Total input paths to process : 21hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$

参考

【1】https://github.com/xubo245/AdamLearning
【2】https://github.com/bigdatagenomics/adam/
【3】https://github.com/xubo245/SparkLearning
【4】http://spark.apache.org

研究成果:

【1】 [BIBM] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Chao Wang, and Xuehai Zhou, "Distributed Gene Clinical Decision Support System Based on Cloud Computing", in IEEE International Conference on Bioinformatics and Biomedicine. (BIBM 2017, CCF B)
【2】 [IEEE CLOUD] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Xuehai Zhou. Efficient Distributed Smith-Waterman Algorithm Based on Apache Spark (CLOUD 2017, CCF-C).
【3】 [CCGrid] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Jinhong Zhou, Xuehai Zhou. DSA: Scalable Distributed Sequence Alignment System Using SIMD Instructions. (CCGrid 2017, CCF-C).
【4】more: https://github.com/xubo245/Publications

Help

If you have any questions or suggestions, please write it in the issue of this project or send an e-mail to me: xubo245@mail.ustc.edu.cn
Wechat: xu601450868
QQ: 601450868

基因数据处理53之cs-bwamem集群版运行paird-end(1千万条100bp的reads)相关推荐

  1. 基因数据处理54之bwa-mem运行paird-end(1千万条100bp的reads)

    指令: ```hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$ bwa mem GRCH38BWAindex/GRCH38chr1L3 ...

  2. 基因数据处理106之bwa-mem运行paird-end(1千万条100bp的reads g38L100c10000000Nhs20Paired12)

    脚本: hadoop@Master:~/xubo/project/alignment/sparkBWA$ cat g38L100c10000000Nhs20Paired12Bwamem.sh echo ...

  3. 基因数据处理56之bwa运行paird-end(1千万条100bp的reads).md

    (1)pair1.fq>sai bwa aln GRCH38BWAindex/GRCH38chr1L3556522.fasta g38L100c10000000Nhs20Paired1.fq & ...

  4. 基因数据处理59之snap运行single-end(1千万条100bp的reads)

    记录: hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$ snap-aligner single snap/snapindex g38 ...

  5. 基因数据处理51之cs-bwamem集群版运行*

    将master的local改为集群就可以了. 集群运行结果: 问题: 匹配50条的时候,bwa和snap都是生成50条.但是cs-bwamem会生成492条,其中25和50条重读的很多,匹配位置不同. ...

  6. 基因数据处理52之cs-bwamem集群版运行(1千万条100bp的reads)

    1.art生成模拟序列: art_illumina -ss HS20 -i GRCH38BWAindex/GRCH38chr1L3556522.fna -l 100 -c 10000000 -o g3 ...

  7. pythonspark集群模式运行_有关python numpy pandas scipy 等 能在YARN集群上 运行PySpark

    有关这个问题,似乎这个在某些时候,用python写好,且spark没有响应的算法支持, 能否能在YARN集群上 运行PySpark方式, 将python分析程序提交上去? Spark Applicat ...

  8. 基于YARN集群构建运行PySpark Application

    文章转载:https://www.tuicool.com/articles/eaYVN3v Spark Application可以直接运行在YARN集群上,这种运行模式,会将资源的管理与协调统一交给Y ...

  9. 小白学习Spark03-在集群上运行Spark

    03 在集群上运行Spark 3.1 Spark运行架构 3.1.1 驱动器节点 3.1.2 执行器节点 3.1.3 集群管理器 3.1.4 启动Spark程序 3.1.5 小结 3.2 使用spar ...

最新文章

  1. xml 和android脚本之家,Android利用Document实现xml读取和写入操作
  2. 【深度学习】EfficientNetV2分析总结和flops的开源库
  3. 【Groovy】MOP 元对象协议与元编程 ( 方法注入 | 使用 @Category 注解进行方法注入 | 分类注入方法查找优先级 )
  4. 深度学习(九)caffe预测、特征可视化python接口调用
  5. spring-注入list集合对象(值是对象)
  6. activiti搭建
  7. Android接入百度自动更新SDK
  8. MyBatisPlus怎么忽略映射字段
  9. WPF 如何流畅地滚动ScrollViewer
  10. 无setup.exe情况下安装mysql5.7.28(win10)
  11. D3D 光照和材料 小样例
  12. 如何更改Lollipop上最新Chrome版本的标题栏和地址栏的颜色?
  13. 2020云栖大会-达摩院
  14. 如何远程控制您的PC(即使崩溃)
  15. 【FinE】单利与复利
  16. elasticsearch xpack license过期
  17. RuntimeError: NEOS requires a valid email address. Please set the ‘NEOS_EMAIL‘ environment variable.
  18. stata统计分析及行业应用案例分析_描述性统计分析的应用——基于描述性统计分析识别优质股票...
  19. 《计算机网络——自顶向下方法》学习笔记——应用层
  20. transforms.Normalize,计算数据量大数据集的像素均值(mean)和标准差(std)

热门文章

  1. Discuz论坛发帖可以提高我们Discuz论坛的人气
  2. 2021-08-30备货赛灵思Xilinx公司的7系列FPGA选择参考分类
  3. matlab fsolve实例,转 Matlab非线性方程求解器fsolve总结(含实例)
  4. 【ULR #2】Picks loves segment tree IX
  5. python俄罗斯方块教程_python tkinter实现俄罗斯方块 基础版
  6. kk音标导读(下):赖世雄老师26个英语字母导读示范 (附我备注)
  7. 日研发能跟踪气味信息的小型机器人
  8. 物理机centos7配置静态IP和网关
  9. 用Adobe Acrobat将Visio图转EPS图片
  10. 简易版走迷宫(c语言)