1.下载:

hadoop@Master:~/xubo/tools$  git clone  https://github.com/BenLangmead/bowtie2.git
Cloning into 'bowtie2'...
remote: Counting objects: 7503, done.
remote: Total 7503 (delta 0), reused 0 (delta 0), pack-reused 7503
Receiving objects: 100% (7503/7503), 143.80 MiB | 403.00 KiB/s, done.
Resolving deltas: 100% (4949/4949), done.
Checking connectivity... done.

2.安装:

hadoop@Master:~/xubo/tools/bowtie2$ make
g++ -O3 -m64 -msse2  -funroll-loops -g3 -DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DNDEBUG -Wall \-I third_party \-o bowtie2-build-s bt2_build.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp diff_sample.cpp bowtie_build_main.cpp \-lpthread
g++ -O3 -m64 -msse2  -funroll-loops -g3 -DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DBOWTIE_64BIT_INDEX -DNDEBUG -Wall \-I third_party \-o bowtie2-build-l bt2_build.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp diff_sample.cpp bowtie_build_main.cpp \-lpthread
g++ -O3 -m64 -msse2  -funroll-loops -g3 -DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DNDEBUG -Wall \-I third_party \-o bowtie2-align-s bt2_search.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp qual.cpp pat.cpp sam.cpp read_qseq.cpp aligner_seed_policy.cpp aligner_seed.cpp aligner_seed2.cpp aligner_sw.cpp aligner_sw_driver.cpp aligner_cache.cpp aligner_result.cpp ref_coord.cpp mask.cpp pe.cpp aln_sink.cpp dp_framer.cpp scoring.cpp presets.cpp unique.cpp simple_func.cpp random_util.cpp aligner_bt.cpp sse_util.cpp aligner_swsse.cpp outq.cpp aligner_swsse_loc_i16.cpp aligner_swsse_ee_i16.cpp aligner_swsse_loc_u8.cpp aligner_swsse_ee_u8.cpp aligner_driver.cpp bowtie_main.cpp \-lpthread
g++ -O3 -m64 -msse2  -funroll-loops -g3 -DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DBOWTIE_64BIT_INDEX -DNDEBUG -Wall \-I third_party \-o bowtie2-align-l bt2_search.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp qual.cpp pat.cpp sam.cpp read_qseq.cpp aligner_seed_policy.cpp aligner_seed.cpp aligner_seed2.cpp aligner_sw.cpp aligner_sw_driver.cpp aligner_cache.cpp aligner_result.cpp ref_coord.cpp mask.cpp pe.cpp aln_sink.cpp dp_framer.cpp scoring.cpp presets.cpp unique.cpp simple_func.cpp random_util.cpp aligner_bt.cpp sse_util.cpp aligner_swsse.cpp outq.cpp aligner_swsse_loc_i16.cpp aligner_swsse_ee_i16.cpp aligner_swsse_loc_u8.cpp aligner_swsse_ee_u8.cpp aligner_driver.cpp bowtie_main.cpp \-lpthread
g++ -O3 -m64 -msse2  -funroll-loops -g3 \-DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DBOWTIE_INSPECT_MAIN -Wall \-I third_party -I . \-o bowtie2-inspect-s bt2_inspect.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp \-lpthread
g++ -O3 -m64 -msse2  -funroll-loops -g3 \-DCOMPILER_OPTIONS="\"-O3 -m64 -msse2  -funroll-loops -g3 -DPOPCNT_CAPABILITY\"" -DPOPCNT_CAPABILITY \-fno-strict-aliasing -DBOWTIE2_VERSION="\"`cat VERSION`\"" -DBUILD_HOST="\"`hostname`\"" -DBUILD_TIME="\"`date`\"" -DCOMPILER_VERSION="\"`g++ -v 2>&1 | tail -1`\"" -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -D_GNU_SOURCE  -DBOWTIE_MM  -DBOWTIE2 -DBOWTIE_INSPECT_MAIN  -DBOWTIE_64BIT_INDEX -Wall \-I third_party -I . \-o bowtie2-inspect-l bt2_inspect.cpp \ccnt_lut.cpp ref_read.cpp alphabet.cpp shmem.cpp edit.cpp bt2_idx.cpp bt2_io.cpp bt2_util.cpp reference.cpp ds.cpp multikey_qsort.cpp limit.cpp random_source.cpp tinythread.cpp \-lpthread
hadoop@Master:~/xubo/tools/bowtie2$ make install
mkdir -p /usr/local/bin
for file in bowtie2-build-s bowtie2-build-l bowtie2-align-s bowtie2-align-l bowtie2-inspect-s bowtie2-inspect-l bowtie2-inspect bowtie2-build bowtie2 ; do \cp -f $file /usr/local/bin ; \done
cp: cannot create regular file ‘/usr/local/bin/bowtie2-build-s’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-build-l’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-align-s’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-align-l’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-inspect-s’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-inspect-l’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-inspect’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2-build’: Permission denied
cp: cannot create regular file ‘/usr/local/bin/bowtie2’: Permission denied
make: *** [install] Error 1
hadoop@Master:~/xubo/tools/bowtie2$ sudo make install
[sudo] password for hadoop:
mkdir -p /usr/local/bin
for file in bowtie2-build-s bowtie2-build-l bowtie2-align-s bowtie2-align-l bowtie2-inspect-s bowtie2-inspect-l bowtie2-inspect bowtie2-build bowtie2 ; do \cp -f $file /usr/local/bin ; \done

3.使用:
(1)建立索引:

hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$ bowtie2-build GRCH38BWAindex/GRCH38chr1L3556522.fasta bowtie2/index
Settings:Output files: "bowtie2/index.*.bt2"Line rate: 6 (line is 64 bytes)Lines per side: 1 (side is 64 bytes)Offset rate: 4 (one in 16)FTable chars: 10Strings: unpackedMax bucket size: defaultMax bucket size, sqrt multiplier: defaultMax bucket size, len divisor: 4Difference-cover sample period: 1024Endianness: littleActual local endianness: littleSanity checking: disabledAssertions: disabledRandom seed: 0Sizeofs: void*:8, int:4, long:8, size_t:8
Input files DNA, FASTA:GRCH38BWAindex/GRCH38chr1L3556522.fasta
Building a SMALL index
Reading reference sizesTime reading reference sizes: 00:00:03
Calculating joined length
Writing header
Reserving space for joined string
Joining reference sequencesTime to join reference sequences: 00:00:03
bmax according to bmaxDivN setting: 57620253
Using parameters --bmax 43215190 --dcv 1024Doing ahead-of-time memory usage testPassed!  Constructing with these parameters: --bmax 43215190 --dcv 1024
Constructing suffix-array element generator
Building DifferenceCoverSampleBuilding sPrimeBuilding sPrimeOrderV-Sorting samplesV-Sorting samples time: 00:00:05Allocating rank arrayRanking v-sort outputRanking v-sort output time: 00:00:01Invoking Larsson-Sadakane on ranksInvoking Larsson-Sadakane on ranks time: 00:00:02Sanity-checking and returning
Building samples
Reserving space for 12 sample suffixes
Generating random suffixes
QSorting 12 sample offsets, eliminating duplicates
QSorting sample offsets, eliminating duplicates time: 00:00:00
Multikey QSorting 12 samples(Using difference cover)Multikey QSorting samples time: 00:00:00
Calculating bucket sizes
Splitting and mergingSplitting and merging time: 00:00:00
Avg bucket size: 3.29259e+07 (target: 43215189)
Converting suffix-array elements to index image
Allocating ftab, absorbFtab
Entering Ebwt loop
Getting block 1 of 7Reserving size (43215190) for bucket 1Calculating Z arrays for bucket 1Entering block accumulator loop for bucket 1:bucket 1: 10%bucket 1: 20%bucket 1: 30%bucket 1: 40%bucket 1: 50%bucket 1: 60%bucket 1: 70%bucket 1: 80%bucket 1: 90%bucket 1: 100%Sorting block of length 41559493 for bucket 1(Using difference cover)Sorting block time: 00:00:16
Returning block of 41559494 for bucket 1
Getting block 2 of 7Reserving size (43215190) for bucket 2Calculating Z arrays for bucket 2Entering block accumulator loop for bucket 2:bucket 2: 10%bucket 2: 20%bucket 2: 30%bucket 2: 40%bucket 2: 50%bucket 2: 60%bucket 2: 70%bucket 2: 80%bucket 2: 90%bucket 2: 100%Sorting block of length 36821901 for bucket 2(Using difference cover)Sorting block time: 00:00:13
Returning block of 36821902 for bucket 2
Getting block 3 of 7Reserving size (43215190) for bucket 3Calculating Z arrays for bucket 3Entering block accumulator loop for bucket 3:bucket 3: 10%bucket 3: 20%bucket 3: 30%bucket 3: 40%bucket 3: 50%bucket 3: 60%bucket 3: 70%bucket 3: 80%bucket 3: 90%bucket 3: 100%Sorting block of length 41919124 for bucket 3(Using difference cover)Sorting block time: 00:00:16
Returning block of 41919125 for bucket 3
Getting block 4 of 7Reserving size (43215190) for bucket 4Calculating Z arrays for bucket 4Entering block accumulator loop for bucket 4:bucket 4: 10%bucket 4: 20%bucket 4: 30%bucket 4: 40%bucket 4: 50%bucket 4: 60%bucket 4: 70%bucket 4: 80%bucket 4: 90%bucket 4: 100%Sorting block of length 24681605 for bucket 4(Using difference cover)Sorting block time: 00:00:09
Returning block of 24681606 for bucket 4
Getting block 5 of 7Reserving size (43215190) for bucket 5Calculating Z arrays for bucket 5Entering block accumulator loop for bucket 5:bucket 5: 10%bucket 5: 20%bucket 5: 30%bucket 5: 40%bucket 5: 50%bucket 5: 60%bucket 5: 70%bucket 5: 80%bucket 5: 90%bucket 5: 100%Sorting block of length 41918922 for bucket 5(Using difference cover)Sorting block time: 00:00:15
Returning block of 41918923 for bucket 5
Getting block 6 of 7Reserving size (43215190) for bucket 6Calculating Z arrays for bucket 6Entering block accumulator loop for bucket 6:bucket 6: 10%bucket 6: 20%bucket 6: 30%bucket 6: 40%bucket 6: 50%bucket 6: 60%bucket 6: 70%bucket 6: 80%bucket 6: 90%bucket 6: 100%Sorting block of length 8244387 for bucket 6(Using difference cover)Sorting block time: 00:00:03
Returning block of 8244388 for bucket 6
Getting block 7 of 7Reserving size (43215190) for bucket 7Calculating Z arrays for bucket 7Entering block accumulator loop for bucket 7:bucket 7: 10%bucket 7: 20%bucket 7: 30%bucket 7: 40%bucket 7: 50%bucket 7: 60%bucket 7: 70%bucket 7: 80%bucket 7: 90%bucket 7: 100%Sorting block of length 35335574 for bucket 7(Using difference cover)Sorting block time: 00:00:13
Returning block of 35335575 for bucket 7
Exited Ebwt loop
fchr[A]: 0
fchr[C]: 67070277
fchr[G]: 115125320
fchr[T]: 163236848
fchr[$]: 230481012
Exiting Ebwt::buildToDisk()
Returning from initFromVector
Wrote 81023576 bytes to primary EBWT file: bowtie2/index.1.bt2
Wrote 57620260 bytes to secondary EBWT file: bowtie2/index.2.bt2
Re-opening _in1 and _in2 as input streams
Returning from Ebwt constructor
Headers:len: 230481012bwtLen: 230481013sz: 57620253bwtSz: 57620254lineRate: 6offRate: 4offMask: 0xfffffff0ftabChars: 10eftabLen: 20eftabSz: 80ftabLen: 1048577ftabSz: 4194308offsLen: 14405064offsSz: 57620256lineSz: 64sideSz: 64sideBwtSz: 48sideBwtLen: 192numSides: 1200422numLines: 1200422ebwtTotLen: 76827008ebwtTotSz: 76827008color: 0reverse: 0
Total time for call to driver() for forward index: 00:02:50
Reading reference sizesTime reading reference sizes: 00:00:02
Calculating joined length
Writing header
Reserving space for joined string
Joining reference sequencesTime to join reference sequences: 00:00:02Time to reverse reference sequence: 00:00:01
bmax according to bmaxDivN setting: 57620253
Using parameters --bmax 43215190 --dcv 1024Doing ahead-of-time memory usage testPassed!  Constructing with these parameters: --bmax 43215190 --dcv 1024
Constructing suffix-array element generator
Building DifferenceCoverSampleBuilding sPrimeBuilding sPrimeOrderV-Sorting samplesV-Sorting samples time: 00:00:05Allocating rank arrayRanking v-sort outputRanking v-sort output time: 00:00:01Invoking Larsson-Sadakane on ranksInvoking Larsson-Sadakane on ranks time: 00:00:02Sanity-checking and returning
Building samples
Reserving space for 12 sample suffixes
Generating random suffixes
QSorting 12 sample offsets, eliminating duplicates
QSorting sample offsets, eliminating duplicates time: 00:00:00
Multikey QSorting 12 samples(Using difference cover)Multikey QSorting samples time: 00:00:00
Calculating bucket sizes
Splitting and mergingSplitting and merging time: 00:00:00
Split 2, merged 6; iterating...
Splitting and mergingSplitting and merging time: 00:00:00
Split 1, merged 1; iterating...
Splitting and mergingSplitting and merging time: 00:00:00
Split 1, merged 1; iterating...
Splitting and mergingSplitting and merging time: 00:00:00
Split 1, merged 1; iterating...
Splitting and mergingSplitting and merging time: 00:00:00
Avg bucket size: 2.88101e+07 (target: 43215189)
Converting suffix-array elements to index image
Allocating ftab, absorbFtab
Entering Ebwt loop
Getting block 1 of 8Reserving size (43215190) for bucket 1Calculating Z arrays for bucket 1Entering block accumulator loop for bucket 1:bucket 1: 10%bucket 1: 20%bucket 1: 30%bucket 1: 40%bucket 1: 50%bucket 1: 60%bucket 1: 70%bucket 1: 80%bucket 1: 90%bucket 1: 100%Sorting block of length 28292839 for bucket 1(Using difference cover)Sorting block time: 00:00:10
Returning block of 28292840 for bucket 1
Getting block 2 of 8Reserving size (43215190) for bucket 2Calculating Z arrays for bucket 2Entering block accumulator loop for bucket 2:bucket 2: 10%bucket 2: 20%bucket 2: 30%bucket 2: 40%bucket 2: 50%bucket 2: 60%bucket 2: 70%bucket 2: 80%bucket 2: 90%bucket 2: 100%Sorting block of length 39729933 for bucket 2(Using difference cover)Sorting block time: 00:00:15
Returning block of 39729934 for bucket 2
Getting block 3 of 8Reserving size (43215190) for bucket 3Calculating Z arrays for bucket 3Entering block accumulator loop for bucket 3:bucket 3: 10%bucket 3: 20%bucket 3: 30%bucket 3: 40%bucket 3: 50%bucket 3: 60%bucket 3: 70%bucket 3: 80%bucket 3: 90%bucket 3: 100%Sorting block of length 14119472 for bucket 3(Using difference cover)Sorting block time: 00:00:05
Returning block of 14119473 for bucket 3
Getting block 4 of 8Reserving size (43215190) for bucket 4Calculating Z arrays for bucket 4Entering block accumulator loop for bucket 4:bucket 4: 10%bucket 4: 20%bucket 4: 30%bucket 4: 40%bucket 4: 50%bucket 4: 60%bucket 4: 70%bucket 4: 80%bucket 4: 90%bucket 4: 100%Sorting block of length 41568163 for bucket 4(Using difference cover)Sorting block time: 00:00:15
Returning block of 41568164 for bucket 4
Getting block 5 of 8Reserving size (43215190) for bucket 5Calculating Z arrays for bucket 5Entering block accumulator loop for bucket 5:bucket 5: 10%bucket 5: 20%bucket 5: 30%bucket 5: 40%bucket 5: 50%bucket 5: 60%bucket 5: 70%bucket 5: 80%bucket 5: 90%bucket 5: 100%Sorting block of length 37733383 for bucket 5(Using difference cover)Sorting block time: 00:00:14
Returning block of 37733384 for bucket 5
Getting block 6 of 8Reserving size (43215190) for bucket 6Calculating Z arrays for bucket 6Entering block accumulator loop for bucket 6:bucket 6: 10%bucket 6: 20%bucket 6: 30%bucket 6: 40%bucket 6: 50%bucket 6: 60%bucket 6: 70%bucket 6: 80%bucket 6: 90%bucket 6: 100%Sorting block of length 23633964 for bucket 6(Using difference cover)Sorting block time: 00:00:09
Returning block of 23633965 for bucket 6
Getting block 7 of 8Reserving size (43215190) for bucket 7Calculating Z arrays for bucket 7Entering block accumulator loop for bucket 7:bucket 7: 10%bucket 7: 20%bucket 7: 30%bucket 7: 40%bucket 7: 50%bucket 7: 60%bucket 7: 70%bucket 7: 80%bucket 7: 90%bucket 7: 100%Sorting block of length 34621654 for bucket 7(Using difference cover)Sorting block time: 00:00:13
Returning block of 34621655 for bucket 7
Getting block 8 of 8Reserving size (43215190) for bucket 8Calculating Z arrays for bucket 8Entering block accumulator loop for bucket 8:bucket 8: 10%bucket 8: 20%bucket 8: 30%bucket 8: 40%bucket 8: 50%bucket 8: 60%bucket 8: 70%bucket 8: 80%bucket 8: 90%bucket 8: 100%Sorting block of length 10781597 for bucket 8(Using difference cover)Sorting block time: 00:00:03
Returning block of 10781598 for bucket 8
Exited Ebwt loop
fchr[A]: 0
fchr[C]: 67070277
fchr[G]: 115125320
fchr[T]: 163236848
fchr[$]: 230481012
Exiting Ebwt::buildToDisk()
Returning from initFromVector
Wrote 81023576 bytes to primary EBWT file: bowtie2/index.rev.1.bt2
Wrote 57620260 bytes to secondary EBWT file: bowtie2/index.rev.2.bt2
Re-opening _in1 and _in2 as input streams
Returning from Ebwt constructor
Headers:len: 230481012bwtLen: 230481013sz: 57620253bwtSz: 57620254lineRate: 6offRate: 4offMask: 0xfffffff0ftabChars: 10eftabLen: 20eftabSz: 80ftabLen: 1048577ftabSz: 4194308offsLen: 14405064offsSz: 57620256lineSz: 64sideSz: 64sideBwtSz: 48sideBwtLen: 192numSides: 1200422numLines: 1200422ebwtTotLen: 76827008ebwtTotSz: 76827008color: 0reverse: 1
Total time for backward call to driver() for mirror index: 00:03:36

(2)匹配:

hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$ bowtie2 -x bowtie2/index -U g38L100c10000000Nhs20.fq -S bowtie2/g38L100c10000000Nhs20.bowtie2.sam
9257464 reads; of these:9257464 (100.00%) were unpaired; of these:4401 (0.05%) aligned 0 times7741394 (83.62%) aligned exactly 1 time1511669 (16.33%) aligned >1 times
99.95% overall alignment rate

居然没有统计时间。。。。
统计结果:

hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem$ samtools flagstat bowtie2/g38L100c10000000Nhs20.bowtie2.sam
9257464 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 secondary
0 + 0 supplementary
0 + 0 duplicates
9253063 + 0 mapped (99.95% : N/A)
0 + 0 paired in sequencing
0 + 0 read1
0 + 0 read2
0 + 0 properly paired (N/A : N/A)
0 + 0 with itself and mate mapped
0 + 0 singletons (N/A : N/A)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)

参考:
【1】http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml#getting-started-with-bowtie-2-lambda-phage-example
【2】http://blog.sciencenet.cn/blog-830496-750216.html

参考片段in 【1】:

Getting started with Bowtie 2: Lambda phage exampleBowtie 2 comes with some example files to get you started. The example files are not scientifically significant; we use the Lambda phage reference genome simply because it's short, and the reads were generated by a computer program, not a sequencer. However, these files will let you start running Bowtie 2 and downstream tools right away.First follow the manual instructions to obtain Bowtie 2. Set the BT2_HOME environment variable to point to the new Bowtie 2 directory containing the bowtie2, bowtie2-build and bowtie2-inspect binaries. This is important, as the BT2_HOME variable is used in the commands below to refer to that directory.
Indexing a reference genomeTo create an index for the Lambda phage reference genome included with Bowtie 2, create a new temporary directory (it doesn't matter where), change into that directory, and run:$BT2_HOME/bowtie2-build $BT2_HOME/example/reference/lambda_virus.fa lambda_virusThe command should print many lines of output then quit. When the command completes, the current directory will contain four new files that all start with lambda_virus and end with .1.bt2, .2.bt2, .3.bt2, .4.bt2, .rev.1.bt2, and .rev.2.bt2. These files constitute the index - you're done!You can use bowtie2-build to create an index for a set of FASTA files obtained from any source, including sites such as UCSC, NCBI, and Ensembl. When indexing multiple FASTA files, specify all the files using commas to separate file names. For more details on how to create an index with bowtie2-build, see the manual section on index building. You may also want to bypass this process by obtaining a pre-built index. See using a pre-built index below for an example.
Aligning example readsStay in the directory created in the previous step, which now contains the lambda_virus index files. Next, run:$BT2_HOME/bowtie2 -x lambda_virus -U $BT2_HOME/example/reads/reads_1.fq -S eg1.samThis runs the Bowtie 2 aligner, which aligns a set of unpaired reads to the Lambda phage reference genome using the index generated in the previous step. The alignment results in SAM format are written to the file eg1.sam, and a short alignment summary is written to the console. (Actually, the summary is written to the "standard error" or "stderr" filehandle, which is typically printed to the console.)To see the first few lines of the SAM output, run:head eg1.sam

时间比较:

hadoop@Master:~/cloud/adam/xubo/data/GRCH38Sub/cs-bwamem/bowtie2$ ./run.sh
9257464 reads; of these:9257464 (100.00%) were unpaired; of these:4401 (0.05%) aligned 0 times7741394 (83.62%) aligned exactly 1 time1511669 (16.33%) aligned >1 times
99.95% overall alignment rate
1=> RunTime:2480.267404 s
********************************************
9257464 reads; of these:9257464 (100.00%) were unpaired; of these:4401 (0.05%) aligned 0 times7741394 (83.62%) aligned exactly 1 time1511669 (16.33%) aligned >1 times
99.95% overall alignment rate
2=> RunTime:2405.534448 s
********************************************
9257464 reads; of these:9257464 (100.00%) were unpaired; of these:4401 (0.05%) aligned 0 times7741394 (83.62%) aligned exactly 1 time1511669 (16.33%) aligned >1 times
99.95% overall alignment rate
3=> RunTime:2396.183261 s
********************************************

参考

【1】https://github.com/xubo245/AdamLearning
【2】https://github.com/bigdatagenomics/adam/
【3】https://github.com/xubo245/SparkLearning
【4】http://spark.apache.org
【5】http://stackoverflow.com/questions/28166667/how-to-pass-d-parameter-or-environment-variable-to-spark-job
【6】http://stackoverflow.com/questions/28840438/how-to-override-sparks-log4j-properties-per-driver

研究成果:

【1】 [BIBM] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Chao Wang, and Xuehai Zhou, "Distributed Gene Clinical Decision Support System Based on Cloud Computing", in IEEE International Conference on Bioinformatics and Biomedicine. (BIBM 2017, CCF B)
【2】 [IEEE CLOUD] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Xuehai Zhou. Efficient Distributed Smith-Waterman Algorithm Based on Apache Spark (CLOUD 2017, CCF-C).
【3】 [CCGrid] Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Jinhong Zhou, Xuehai Zhou. DSA: Scalable Distributed Sequence Alignment System Using SIMD Instructions. (CCGrid 2017, CCF-C).
【4】more: https://github.com/xubo245/Publications

Help

If you have any questions or suggestions, please write it in the issue of this project or send an e-mail to me: xubo245@mail.ustc.edu.cn
Wechat: xu601450868
QQ: 601450868

基因数据处理69之bowtie安装与使用相关推荐

  1. 基因数据处理72之GATK安装成功

    1.下载: git clone https://github.com/broadgsa/gatk-protected.git 2.安装: git checkout 3.5 mvn clean pack ...

  2. 基因数据处理44之cloud-scale-bwamem安装

    cloud-scale-bwamem是在spark等云环境上实现bwa-mem算法,加速对全基因组匹配的处理 1.下载: git clone https://github.com/ytchen0323 ...

  3. 基因数据处理70之Picard安装没成功

    1.下载: https://github.com/broadinstitute/picard.git 2.安装: hadoop@Master:~/xubo/tools/picard$ ant clon ...

  4. 基因数据处理8之BWA_MEM小数据集处理(成功)

    基因数据处理8之BWA_MEM小数据集处理 环境:ubuntu14.04 6G内存 参考基因:GRCH38 来源请参考[1] 1.fastq数据:SRR003161.fastq 的头20行,即5条re ...

  5. 基因数据处理1之mapping_to_cram

    基因数据处理1之mapping_to_cram 参考资料: A Worked Example Obtain some public data We will use the first 100,000 ...

  6. 基因数据处理123之SSW代码不正确,到时比SparkSW时间长

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 由于要生成新的score matrix:blosum50,第一次使用静态方法,直接传给align,到时每 ...

  7. 基因数据处理12之samtool的tview来查看sam的匹配文件

    基因数据处理12之samtool的tview来查看sam的匹配文件 具体的之前有文章讲过:http://blog.csdn.net/xubo245/article/details/50836185 记 ...

  8. 基因数据处理118之SSW运行

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 SSW是一个更快的SW算法,并且提供了c语言lib和java的调用 代码: https://github ...

  9. 基因数据处理120之scala调用SSW在linux下运行

    更多代码请见:https://github.com/xubo245 基因数据处理系列 1.解释 先有java提供转换,使用jni调用c 然后scala调用java 2.代码: 2.1 java: pa ...

最新文章

  1. 方法革新:8个宏基因组分析新工具 | 热心肠日报
  2. 日常运维管理技巧十五(htop使用说明top增强版)(转载)
  3. 把广告投给你要几步,一文带你了解广告业务
  4. Python编程语言学习:for循环中常用方法经验技巧(利用enumerate函数对列表实现自带索引等)之详细攻略
  5. MyBatis拦截器有哪些以及分析
  6. linux eclipse报错日志,centos6.8命令行启动eclipse报org.eclipse.swt.SWTError错误
  7. oracle中存储过程可见权限,Oracle数据库存储过程与权限
  8. python递归函数详解-讲解Python中的递归函数
  9. python2020未来前景_2020年Python发展前景如何呢?
  10. python视频 神经网络 Tensorflow
  11. ios开发网络学习九:NSURLSessionDownloadTask实现大文件下载
  12. C#将十进制转二进制
  13. 转换大师swf转html,iPixSoft SWF to HTML5 Converter
  14. 老毛子(Padavan)路由器解锁网易云灰色音乐
  15. mt6735通用recovery_mt6735刷机包下载
  16. 车上控制器中英文对照
  17. 重言式判别 (数据结构课程设计)
  18. 企业要如何利用360评估法做好人才盘点?
  19. 一切成功源于积累——20140928 认识货币——英镑
  20. 这些面试技巧,必须要掌握!

热门文章

  1. 观察 | 当红DEFI:在暗礁与浅滩之间
  2. ArrayList不为人知的小秘密
  3. iview-admin源码分析(三):登录页面及逻辑实现
  4. Flutter插件汇总
  5. oracle pga 1407,ogg replicat OGG-00869 OCI Error ORA-01407 OGG-01154 SQL error 1407
  6. 1407:笨小猴(C C++)
  7. SpringBoot资料合集-03【Spring Boot 热部署与日志】
  8. 大数据预科班作业10
  9. 4G模块DTU网关接入平台系列教程(一):OneNET平台接入教程详解
  10. 恒玄BES调试笔记-BES2500如何Dump Anc Audio数据