一 。how Lordec maps the long reads to DeBruijn Graph

hello,

I would like how LorDEC maps long reads to debruijn graph. After construction of deBruijn graph of short reads ,how map the set of erroneous long reads to DeBuijn graph. it's not clear in the paper.

can you help me please?

Thank you for your response.

As far as I understand, after forming DBG graphs from short reads (SR),the use and store data in GATB library which allows to traverse any path in the graph and to get the sequence of any node. also PacBio long reads (LR) are divided to k-mer which will be compared (traversed) to the k-mer in the DBG of SR.

They used solid k-mer as anchor to correct PacBio reads

We found that if we additionally require that for a k-mer to be considered solid, it must also have at least one incoming and at least one outgoing arc   我们发现,如果我们额外要求一个k-mer被认为是强的 ,它也必须至少有一个传入和至少一个传出弧

how they correct the reads?

Consider the k-mers of a long read starting at position 1,2,3, … : some k-mers belong to the graph and are solid, while others do not and are weak. Basically, solid k-mers are expected to be correct, while weak ones suspectedly include sequencing errors and require a correction. Solid k-mers are entry points in the DBG, and LoRDEC corrects a region made of weak k-mers by finding the best path in the DBG between the solid k-mers bordering this region. Sometimes, an LR has no solid k-mer, in which case, LoRDEC marks it as such in the output and skips it.

The idea here is traversing not mapping (as I understand ) for more about traversing graph traverse

and here explanation about how it works

http://ivory.idyll.org/blog/2015-wok-error-correction.html

and here is a paper talking about the mapping of sequence to DBG

Read mapping on de Bruijn graphs

二。And LORMA? have you an explanation about LORMA?

It is divided into two phases :

  1. first using lordec with different k-mer to self-coorect the long
  2. reads second further correction with the multiple alignment technique by LoRMA

how Lordec maps the long reads to DeBruijn Graph相关推荐

  1. GATB=The Genome Analysis Toolbox with de-Bruijn graph 带有de-Bruijn图的基因组分析工具箱

    带有de-Bruijn图的基因组分析工具箱 Software Depending on your needs, you can choose one the following software co ...

  2. LoRDEC: a tool for correcting errors in long sequencing reads 纠正长序列读取错误的工具

    LoRDEC: a tool for correcting errors in long sequencing reads Eric Rivals 21st August 2015 Abstract ...

  3. LoRDEC的使用(LoRDEC-0.6.tar.gz)

    参考 Laurent Bouri, Dominique Lavenier. Evaluation of long read error correction software. [Research R ...

  4. Bi-level error correction for PacBio long reads

    Bi-level error correction for PacBio long reads 双级错误校正PacBio长read 最新的测序技术,如太平洋生物科学公司(PacBio)和牛津纳米孔机器 ...

  5. 三代测序纠错软件汇总篇

    三代测序纠错软件汇总篇 原创: 李海滨 诺禾科服 2017-12-21 在之前推出的一篇微信中,已经介绍过了三代测序下机数据"三代全长转录组测序常见问题说明".那么我们拿到数据后是 ...

  6. Bioinformatics: Assembling Genomes (week 1-2)

    Bioinformatics: Assembling Genomes (week 1-2) 本文为Coursera课程 Assembling Genomes and Sequencing Antibi ...

  7. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

    去噪DNA深度测序数据,高通量测序误差及其校正 Abstract 描述常见高通量测序平台产生的错误并从技术人工制品中识别出真正的遗传变异是两个相互依赖的步骤,对许多分析如单核苷酸变异调用.单倍型推断. ...

  8. 一文详解基因组denovo组装原理和实战

    关于更多生物医疗大数据分析工具和软件的介绍和使用请看六点了官网[1]. 图文:心如止水  编辑 marple 目录 1.基因组组装 2.基于De-Bruijn Graph的组装算法 3.SOAPden ...

  9. Trinity简介(1)--用于无参考基因组的转录组de novo组装

    一. Trinity简介 Trinity,是由 the Broad Institute 开发的转录组de novo组装软件,由三个独立的软件模块组成: Inchworm,Chrysalis和Butte ...

最新文章

  1. String.Format()方法
  2. 解决Mysql错误Too many connections的方法
  3. c语言中24个字母,菜鸟求助,写一个随机输出26个英文字母的程序
  4. blog搬家通知---------------------------------------
  5. shell 登录mysql 然后quit_使用工具Xshell实现在linux上登录mysql和退出mysql的相关操作讲解...
  6. VirtualBox虚拟机网络环境解析和搭建-NAT、桥接、Host-Only、Internal、端口映射
  7. ivew 的ajax,iView-Upload组件分析
  8. CSS——Fonts(字体)
  9. ssm+maven+eclipse框架搭建
  10. redis 数据类型详解 以及 redis适用场景场合
  11. SQLite学习笔记(二)--VC调用环境搭建
  12. imx6ull的boot, 之我的理解
  13. unity全栈开发是什么意思_unity3D用什么语言开发好?
  14. SpringMVC学习----请求与响应
  15. OpenCV基础学习
  16. 《风之旅人》游戏设计思想二
  17. 存储技术(一)-基本概念和技术发展
  18. MySQL Day05 子查询、函数、MD5加密、SELECT小结
  19. 二叉树的前中后序遍历(栈)(C++)
  20. mysql 表情符号_解决mysql存储特殊文字(表情符号)utf8mb4

热门文章

  1. GICP:基于体素泛化ICP方式的准确快速点云配准方法
  2. VoVNet、VarGNet、PeleeNet:轻量级实时网络(嵌入端)
  3. 图像算法可以稳定处理视频了!港科大开源通用算法,解决视频处理时域不稳定问题|NeurIPS 2020...
  4. mysql5.5.9_mysql5.5.9字符集设置
  5. 状态码302.。。。
  6. 剑指 Offer 17. 打印从1到最大的n位数
  7. 边缘检测、霍夫变换、重映射、仿射变换、直方图均衡化
  8. RDKit | 基于RDKit绘制黑白颜色的分子
  9. 第十八课.支持向量机
  10. 东京奥运会73枚动态图标刷爆朋友圈,中国网友怒赞:不愧是设计大国!