说明:

环境如上篇

对BWASW数据处理的时候pattern需要修改,由于有很多这样的段:

[bsw2_aln] read 17598 sequences/pairs (10000016 bp) ...
[bsw2_aln] read 17644 sequences/pairs (10000056 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10001068 bp) ...
[bsw2_aln] read 17660 sequences/pairs (10001404 bp) ...

需要进行第二次pattern,将其进行求和

另外将pattern结果和total结果写在一段代码中,写入两个文件

代码:

package test
import scala.io.Source
import java.io.File._
import java.io.PrintWriter
import scala.collection.mutable.ArrayBuffer
object logPatternBwaswAll extends App {
val directory="file/allbwasw"
//val directory0="file/bwaResult"val filename=directory+"result/allbwasw.txt"
val filename2=directory+"result/allbwaswTotal.txt"
val out=new PrintWriter(filename)
val out2=new PrintWriter(filename2)
val files = (new java.io.File(directory)).listFiles()
for (ifile <- files) {
val source = Source.fromFile(ifile).mkString
val pattern = """(bwa bwasw)[^\:]+\:\s*([0-9]*.[0-9]*)[^\:]+\:\s*([0-9]*.[0-9]*)""".r
val pattern2="""(\[bsw2_aln\]\s*read\s*[0-9]+\s+sequences[^\:]+)\:""".r
val pattern3="""\[bsw2_aln\]\s*read\s*([0-9]+)\s+sequences""".rval b1=for(pattern(s1,num1,num2)<-pattern.findAllIn(source)) yield (s1,num1,num2)
val b2=for(pattern2(seq)<-pattern2.findAllIn(source)) yield (seq)
val b22=b2.toArray
var b3=new ArrayBuffer[String]()
//println(b2.length+" "+b22.length);
for(i<-0 until b22.length){var sump3=0;for (pattern3(num) <- pattern3.findAllIn(b22(i))) {sump3=sump3+num.toInt;}b3.insert(i, sump3.toString())
}val b11=b1.toArray
//val b222=b3.toArray
// println("b11.length:"+b11.length+" b3.length:"+b3.length+" b222.length:"+b222.length)val reads=b3.distinct
// var array2=new ArrayBuffer[ArrayBuffer[String]](reads.length,4)var array2=Array.ofDim[String](reads.length, 5)var readsi=0var arr1=0.0var arr2=0.0for(k<-0 until b11.length) {println(b3(k)+","+b11(k)._1+","+b11(k)._2+","+b11(k)._3)out.println(b3(k)+","+b11(k)._1+","+b11(k)._2+","+b11(k)._3)  }for(j<-0 until reads.length){for(k<-0 until b11.length) {if(reads(j)==b3(k)) array2(j)(0)=reads(j)array2(j)(1)=b11(k)._1array2(j)(2)=(b11(k)._2.toDouble+array2(j)(2).toDouble).toStringarray2(j)(3)=(b11(k)._3.toDouble+array2(j)(3).toDouble).toStringarray2(j)(3)=(array2(j)(3).toInt+1).toString}}
}out.close()}

文件:

hadoop@Mcnode2:~/cloud/adam/xubo/data/test20160310/bwasw$ ./bwasw.sh
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 5 sequences/pairs (2691 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h20.fastq
[main] Real time: 112.694 sec; CPU: 6.378 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 250 sequences/pairs (161179 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h1000.fastq
[main] Real time: 485.574 sec; CPU: 12.060 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 2500 sequences/pairs (1499370 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h10000.fastq
[main] Real time: 2127.204 sec; CPU: 40.981 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 7960 sequences/pairs (4469697 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h100000.fastq
[main] Real time: 3489.448 sec; CPU: 214.049 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 5 sequences/pairs (2691 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h20.fastq
[main] Real time: 132.476 sec; CPU: 7.228 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 250 sequences/pairs (161179 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h1000.fastq
[main] Real time: 520.267 sec; CPU: 11.940 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 2500 sequences/pairs (1499370 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h10000.fastq
[main] Real time: 1972.161 sec; CPU: 39.276 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 7960 sequences/pairs (4469697 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h100000.fastq
[main] Real time: 3474.798 sec; CPU: 213.719 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 5 sequences/pairs (2691 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h20.fastq
[main] Real time: 115.312 sec; CPU: 7.209 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 250 sequences/pairs (161179 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h1000.fastq
[main] Real time: 426.709 sec; CPU: 11.335 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 2500 sequences/pairs (1499370 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h10000.fastq
[main] Real time: 2190.078 sec; CPU: 40.916 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 7960 sequences/pairs (4469697 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161h100000.fastq
[main] Real time: 3346.748 sec; CPU: 212.718 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 17736 sequences/pairs (10000450 bp) ...
[bsw2_aln] read 17632 sequences/pairs (10000617 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000016 bp) ...
[bsw2_aln] read 17644 sequences/pairs (10000056 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10001068 bp) ...
[bsw2_aln] read 17660 sequences/pairs (10001404 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000239 bp) ...
[bsw2_aln] read 17756 sequences/pairs (10000562 bp) ...
[bsw2_aln] read 17168 sequences/pairs (10000899 bp) ...
[bsw2_aln] read 17230 sequences/pairs (10000389 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10001160 bp) ...
[bsw2_aln] read 17684 sequences/pairs (10000797 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10000303 bp) ...
[bsw2_aln] read 17772 sequences/pairs (10000460 bp) ...
[bsw2_aln] read 17722 sequences/pairs (10000941 bp) ...
[bsw2_aln] read 17670 sequences/pairs (10000403 bp) ...
[bsw2_aln] read 17692 sequences/pairs (10000495 bp) ...
[bsw2_aln] read 17732 sequences/pairs (10000515 bp) ...
[bsw2_aln] read 17268 sequences/pairs (10000233 bp) ...
[bsw2_aln] read 16986 sequences/pairs (10001479 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000021 bp) ...
[bsw2_aln] read 17592 sequences/pairs (10001063 bp) ...
[bsw2_aln] read 17608 sequences/pairs (10000532 bp) ...
[bsw2_aln] read 17634 sequences/pairs (10000966 bp) ...
[bsw2_aln] read 17610 sequences/pairs (10000375 bp) ...
[bsw2_aln] read 17630 sequences/pairs (10000393 bp) ...
[bsw2_aln] read 17688 sequences/pairs (10001395 bp) ...
[bsw2_aln] read 17672 sequences/pairs (10000206 bp) ...
[bsw2_aln] read 17246 sequences/pairs (10000227 bp) ...
[bsw2_aln] read 16678 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 16782 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000968 bp) ...
[bsw2_aln] read 17358 sequences/pairs (10000936 bp) ...
[bsw2_aln] read 17578 sequences/pairs (10000630 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000372 bp) ...
[bsw2_aln] read 17478 sequences/pairs (10000575 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10001079 bp) ...
[bsw2_aln] read 17424 sequences/pairs (10002025 bp) ...
[bsw2_aln] read 16508 sequences/pairs (10000430 bp) ...
[bsw2_aln] read 17426 sequences/pairs (10001030 bp) ...
[bsw2_aln] read 17766 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 17664 sequences/pairs (10001067 bp) ...
[bsw2_aln] read 17482 sequences/pairs (10000317 bp) ...
[bsw2_aln] read 17564 sequences/pairs (10000063 bp) ...
[bsw2_aln] read 17446 sequences/pairs (10000263 bp) ...
[bsw2_aln] read 17466 sequences/pairs (10000042 bp) ...
[bsw2_aln] read 17566 sequences/pairs (10000825 bp) ...
[bsw2_aln] read 17366 sequences/pairs (10000771 bp) ...
[bsw2_aln] read 17296 sequences/pairs (10001904 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10000280 bp) ...
[bsw2_aln] read 17648 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000390 bp) ...
[bsw2_aln] read 17562 sequences/pairs (10000598 bp) ...
[bsw2_aln] read 17576 sequences/pairs (10000441 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000038 bp) ...
[bsw2_aln] read 17558 sequences/pairs (10001083 bp) ...
[bsw2_aln] read 17486 sequences/pairs (10000213 bp) ...
[bsw2_aln] read 17428 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000565 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000634 bp) ...
[bsw2_aln] read 17554 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17544 sequences/pairs (10000358 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000017 bp) ...
[bsw2_aln] read 17452 sequences/pairs (10000587 bp) ...
[bsw2_aln] read 17522 sequences/pairs (10000559 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10001210 bp) ...
[bsw2_aln] read 17406 sequences/pairs (10000246 bp) ...
[bsw2_aln] read 17394 sequences/pairs (10000655 bp) ...
[bsw2_aln] read 17132 sequences/pairs (10000531 bp) ...
[bsw2_aln] read 17070 sequences/pairs (10000705 bp) ...
[bsw2_aln] read 17280 sequences/pairs (10000702 bp) ...
[bsw2_aln] read 17504 sequences/pairs (10000584 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10000908 bp) ...
[bsw2_aln] read 17484 sequences/pairs (10000456 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10000394 bp) ...
[bsw2_aln] read 17324 sequences/pairs (10000472 bp) ...
[bsw2_aln] read 17152 sequences/pairs (10000658 bp) ...
[bsw2_aln] read 14281 sequences/pairs (8410932 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161.fastq
[main] Real time: 73953.156 sec; CPU: 10089.564 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 17736 sequences/pairs (10000450 bp) ...
[bsw2_aln] read 17632 sequences/pairs (10000617 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000016 bp) ...
[bsw2_aln] read 17644 sequences/pairs (10000056 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10001068 bp) ...
[bsw2_aln] read 17660 sequences/pairs (10001404 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000239 bp) ...
[bsw2_aln] read 17756 sequences/pairs (10000562 bp) ...
[bsw2_aln] read 17168 sequences/pairs (10000899 bp) ...
[bsw2_aln] read 17230 sequences/pairs (10000389 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10001160 bp) ...
[bsw2_aln] read 17684 sequences/pairs (10000797 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10000303 bp) ...
[bsw2_aln] read 17772 sequences/pairs (10000460 bp) ...
[bsw2_aln] read 17722 sequences/pairs (10000941 bp) ...
[bsw2_aln] read 17670 sequences/pairs (10000403 bp) ...
[bsw2_aln] read 17692 sequences/pairs (10000495 bp) ...
[bsw2_aln] read 17732 sequences/pairs (10000515 bp) ...
[bsw2_aln] read 17268 sequences/pairs (10000233 bp) ...
[bsw2_aln] read 16986 sequences/pairs (10001479 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000021 bp) ...
[bsw2_aln] read 17592 sequences/pairs (10001063 bp) ...
[bsw2_aln] read 17608 sequences/pairs (10000532 bp) ...
[bsw2_aln] read 17634 sequences/pairs (10000966 bp) ...
[bsw2_aln] read 17610 sequences/pairs (10000375 bp) ...
[bsw2_aln] read 17630 sequences/pairs (10000393 bp) ...
[bsw2_aln] read 17688 sequences/pairs (10001395 bp) ...
[bsw2_aln] read 17672 sequences/pairs (10000206 bp) ...
[bsw2_aln] read 17246 sequences/pairs (10000227 bp) ...
[bsw2_aln] read 16678 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 16782 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000968 bp) ...
[bsw2_aln] read 17358 sequences/pairs (10000936 bp) ...
[bsw2_aln] read 17578 sequences/pairs (10000630 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000372 bp) ...
[bsw2_aln] read 17478 sequences/pairs (10000575 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10001079 bp) ...
[bsw2_aln] read 17424 sequences/pairs (10002025 bp) ...
[bsw2_aln] read 16508 sequences/pairs (10000430 bp) ...
[bsw2_aln] read 17426 sequences/pairs (10001030 bp) ...
[bsw2_aln] read 17766 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 17664 sequences/pairs (10001067 bp) ...
[bsw2_aln] read 17482 sequences/pairs (10000317 bp) ...
[bsw2_aln] read 17564 sequences/pairs (10000063 bp) ...
[bsw2_aln] read 17446 sequences/pairs (10000263 bp) ...
[bsw2_aln] read 17466 sequences/pairs (10000042 bp) ...
[bsw2_aln] read 17566 sequences/pairs (10000825 bp) ...
[bsw2_aln] read 17366 sequences/pairs (10000771 bp) ...
[bsw2_aln] read 17296 sequences/pairs (10001904 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10000280 bp) ...
[bsw2_aln] read 17648 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000390 bp) ...
[bsw2_aln] read 17562 sequences/pairs (10000598 bp) ...
[bsw2_aln] read 17576 sequences/pairs (10000441 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000038 bp) ...
[bsw2_aln] read 17558 sequences/pairs (10001083 bp) ...
[bsw2_aln] read 17486 sequences/pairs (10000213 bp) ...
[bsw2_aln] read 17428 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000565 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000634 bp) ...
[bsw2_aln] read 17554 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17544 sequences/pairs (10000358 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000017 bp) ...
[bsw2_aln] read 17452 sequences/pairs (10000587 bp) ...
[bsw2_aln] read 17522 sequences/pairs (10000559 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10001210 bp) ...
[bsw2_aln] read 17406 sequences/pairs (10000246 bp) ...
[bsw2_aln] read 17394 sequences/pairs (10000655 bp) ...
[bsw2_aln] read 17132 sequences/pairs (10000531 bp) ...
[bsw2_aln] read 17070 sequences/pairs (10000705 bp) ...
[bsw2_aln] read 17280 sequences/pairs (10000702 bp) ...
[bsw2_aln] read 17504 sequences/pairs (10000584 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10000908 bp) ...
[bsw2_aln] read 17484 sequences/pairs (10000456 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10000394 bp) ...
[bsw2_aln] read 17324 sequences/pairs (10000472 bp) ...
[bsw2_aln] read 17152 sequences/pairs (10000658 bp) ...
[bsw2_aln] read 14281 sequences/pairs (8410932 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161.fastq
[main] Real time: 72603.284 sec; CPU: 10059.902 sec
[M::bwa_idx_load_from_disk] read 261 ALT contigs
[bsw2_aln] read 17040 sequences/pairs (10000385 bp) ...
[bsw2_aln] read 17736 sequences/pairs (10000450 bp) ...
[bsw2_aln] read 17632 sequences/pairs (10000617 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000016 bp) ...
[bsw2_aln] read 17644 sequences/pairs (10000056 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10001068 bp) ...
[bsw2_aln] read 17660 sequences/pairs (10001404 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000239 bp) ...
[bsw2_aln] read 17756 sequences/pairs (10000562 bp) ...
[bsw2_aln] read 17168 sequences/pairs (10000899 bp) ...
[bsw2_aln] read 17230 sequences/pairs (10000389 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10001160 bp) ...
[bsw2_aln] read 17684 sequences/pairs (10000797 bp) ...
[bsw2_aln] read 17668 sequences/pairs (10000303 bp) ...
[bsw2_aln] read 17772 sequences/pairs (10000460 bp) ...
[bsw2_aln] read 17722 sequences/pairs (10000941 bp) ...
[bsw2_aln] read 17670 sequences/pairs (10000403 bp) ...
[bsw2_aln] read 17692 sequences/pairs (10000495 bp) ...
[bsw2_aln] read 17732 sequences/pairs (10000515 bp) ...
[bsw2_aln] read 17268 sequences/pairs (10000233 bp) ...
[bsw2_aln] read 16986 sequences/pairs (10001479 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000021 bp) ...
[bsw2_aln] read 17592 sequences/pairs (10001063 bp) ...
[bsw2_aln] read 17608 sequences/pairs (10000532 bp) ...
[bsw2_aln] read 17634 sequences/pairs (10000966 bp) ...
[bsw2_aln] read 17610 sequences/pairs (10000375 bp) ...
[bsw2_aln] read 17630 sequences/pairs (10000393 bp) ...
[bsw2_aln] read 17688 sequences/pairs (10001395 bp) ...
[bsw2_aln] read 17672 sequences/pairs (10000206 bp) ...
[bsw2_aln] read 17246 sequences/pairs (10000227 bp) ...
[bsw2_aln] read 16678 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 16782 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000968 bp) ...
[bsw2_aln] read 17358 sequences/pairs (10000936 bp) ...
[bsw2_aln] read 17578 sequences/pairs (10000630 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000372 bp) ...
[bsw2_aln] read 17478 sequences/pairs (10000575 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10001079 bp) ...
[bsw2_aln] read 17424 sequences/pairs (10002025 bp) ...
[bsw2_aln] read 16508 sequences/pairs (10000430 bp) ...
[bsw2_aln] read 17426 sequences/pairs (10001030 bp) ...
[bsw2_aln] read 17766 sequences/pairs (10000476 bp) ...
[bsw2_aln] read 17664 sequences/pairs (10001067 bp) ...
[bsw2_aln] read 17482 sequences/pairs (10000317 bp) ...
[bsw2_aln] read 17564 sequences/pairs (10000063 bp) ...
[bsw2_aln] read 17446 sequences/pairs (10000263 bp) ...
[bsw2_aln] read 17466 sequences/pairs (10000042 bp) ...
[bsw2_aln] read 17566 sequences/pairs (10000825 bp) ...
[bsw2_aln] read 17366 sequences/pairs (10000771 bp) ...
[bsw2_aln] read 17296 sequences/pairs (10001904 bp) ...
[bsw2_aln] read 17650 sequences/pairs (10000280 bp) ...
[bsw2_aln] read 17648 sequences/pairs (10000709 bp) ...
[bsw2_aln] read 17658 sequences/pairs (10000390 bp) ...
[bsw2_aln] read 17562 sequences/pairs (10000598 bp) ...
[bsw2_aln] read 17576 sequences/pairs (10000441 bp) ...
[bsw2_aln] read 17598 sequences/pairs (10000038 bp) ...
[bsw2_aln] read 17558 sequences/pairs (10001083 bp) ...
[bsw2_aln] read 17486 sequences/pairs (10000213 bp) ...
[bsw2_aln] read 17428 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17316 sequences/pairs (10000565 bp) ...
[bsw2_aln] read 17376 sequences/pairs (10000634 bp) ...
[bsw2_aln] read 17554 sequences/pairs (10000555 bp) ...
[bsw2_aln] read 17544 sequences/pairs (10000358 bp) ...
[bsw2_aln] read 17546 sequences/pairs (10000017 bp) ...
[bsw2_aln] read 17452 sequences/pairs (10000587 bp) ...
[bsw2_aln] read 17522 sequences/pairs (10000559 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10001210 bp) ...
[bsw2_aln] read 17406 sequences/pairs (10000246 bp) ...
[bsw2_aln] read 17394 sequences/pairs (10000655 bp) ...
[bsw2_aln] read 17132 sequences/pairs (10000531 bp) ...
[bsw2_aln] read 17070 sequences/pairs (10000705 bp) ...
[bsw2_aln] read 17280 sequences/pairs (10000702 bp) ...
[bsw2_aln] read 17504 sequences/pairs (10000584 bp) ...
[bsw2_aln] read 17480 sequences/pairs (10000908 bp) ...
[bsw2_aln] read 17484 sequences/pairs (10000456 bp) ...
[bsw2_aln] read 17420 sequences/pairs (10000394 bp) ...
[bsw2_aln] read 17324 sequences/pairs (10000472 bp) ...
[bsw2_aln] read 17152 sequences/pairs (10000658 bp) ...
[bsw2_aln] read 14281 sequences/pairs (8410932 bp) ...
[main] Version: 0.7.12-r1039
[main] CMD: bwa bwasw ../GCA_000001405.15_GRCh38/GCA_000001405.15_GRCh38_full_analysis_set.fna ../SRR003161.fastq
[main] Real time: 70031.888 sec; CPU: 10062.826 sec

运行结果:

5,bwa bwasw,120.16066666666667,6.938333333333333,3
250,bwa bwasw,477.5166666666667,11.778333333333334,3
2500,bwa bwasw,2096.4809999999998,40.391,3
25000,bwa bwasw,3436.9979999999996,213.49533333333332,3
1376701,bwa bwasw,72196.10933333334,10070.764000000001,3
5,120.16066666666667,6.938333333333333,3
250,477.5166666666667,11.778333333333334,3
2500,2096.4809999999998,40.391,3
25000,3436.9979999999996,213.49533333333332,3
1376701,72196.10933333334,10070.764000000001,3

文件1:

reads,name,RealTime,CPUTime,number
5,bwa bwasw,120.16066666666667,6.938333333333333,3
250,bwa bwasw,477.5166666666667,11.778333333333334,3
2500,bwa bwasw,2096.4809999999998,40.391,3
25000,bwa bwasw,3436.9979999999996,213.49533333333332,3
1376701,bwa bwasw,72196.10933333334,10070.764000000001,3

文件2:

reads,RealTime,CPUTime,number
5,120.16066666666667,6.938333333333333,3
250,477.5166666666667,11.778333333333334,3
2500,2096.4809999999998,40.391,3
25000,3436.9979999999996,213.49533333333332,3
1376701,72196.10933333334,10070.764000000001,3

基因数据处理16之scala对BWASW运行结果进行时间统计相关推荐

  1. 基因数据处理120之scala调用SSW在linux下运行

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 先有java提供转换,使用jni调用c 然后scala调用java 2.代码: 2.1 java: pa ...

  2. 基因数据处理119之java调用SSW在linux下运行

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 测试自带Example: xubo@xubo:~/xubo/tools/Complete-Striped ...

  3. 基因数据处理118之SSW运行

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 SSW是一个更快的SW算法,并且提供了c语言lib和java的调用 代码: https://github ...

  4. 基因数据处理1之mapping_to_cram

    基因数据处理1之mapping_to_cram 参考资料: A Worked Example Obtain some public data We will use the first 100,000 ...

  5. 基因数据处理123之SSW代码不正确,到时比SparkSW时间长

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 由于要生成新的score matrix:blosum50,第一次使用静态方法,直接传给align,到时每 ...

  6. 基因数据处理121之SSW的score matrix调整,使得与SparkSW评分一致

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 SSW的评分矩阵是128*128的,是按char的int值来进行计算的.而blosum50是蛋白质的,而 ...

  7. 基因数据处理12之samtool的tview来查看sam的匹配文件

    基因数据处理12之samtool的tview来查看sam的匹配文件 具体的之前有文章讲过:http://blog.csdn.net/xubo245/article/details/50836185 记 ...

  8. 基因数据处理122之SSW和SparkSW评分不一致,query为Q9

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 RT,但是顺序一致 2.代码: hadoop@Master:~/disk2/xubo/project/a ...

  9. Ubuntu 16.04下用Wine运行的软件出现方块的解决思路(应该是兼容现在所有平台的Wine碰到这个的问题)

    Ubuntu 16.04下用Wine运行的软件出现方块的解决思路(应该是兼容现在所有平台的Wine碰到这个的问题) 参考文章: (1)Ubuntu 16.04下用Wine运行的软件出现方块的解决思路( ...

最新文章

  1. 5GS 协议栈 — PFCP 协议 — PDR 报文检测规则
  2. UVA11892 ENimEN —— 博弈
  3. 假笨说-协助美团kafka团队定位到的一个JVM Crash问题
  4. 4 关卡流 进阶_儿童桌游要不要鸡血的过关?关卡制儿童桌游介绍与方法论
  5. go插件 vscode 报错_MacOS中 VSCode 安装 GO 插件失败问题的快速解决方法
  6. IOS开发之日期时间格式化字符说明
  7. 51单片机ALE引脚的控制(摘录)
  8. 给刚做网站不久的草根站长们
  9. VMware虚拟机中 启动Windows XP系统黑屏 的解决
  10. sqluldr2用法
  11. unity三维地球实现方法
  12. ansys mechanical 脚本编写
  13. ps 去除gif水印
  14. 【时间之外】浏览器分屏使用技巧
  15. android字符串加删除线,android textview 添加上划线 中划线 删除线
  16. 腾讯-腾讯云citybase产品白皮书
  17. Python3的函数的详解
  18. C/C++时间字符串和时间戳的相互转化
  19. 模拟退火算法——仿真篇
  20. 【系统分析师之路】2018年上系统分析师综合知识真题

热门文章

  1. java i 非原子性_java i++ 非原子操作
  2. wifi质量强度测试
  3. 用户,角色,权限配置表
  4. 长假出游学习指南——碎片化学习的公众号推荐
  5. linux下php安装pathinfo
  6. 利用opencv结合mfc实现识别圆形标记点并计算多个圆形标记点的三维坐标,拟合平面并计算法向量
  7. 如何在Visual Studio中调整代码字体的大小和颜色
  8. 【UTF-8编码透析】神奇的“联通”乱码现象
  9. pytest+allure实战
  10. android 功能防抖,Android 功能防抖