Skip to content

Tools for fast and flexible genome assembly scaffolding and improvement

License

Notifications You must be signed in to change notification settings

martinjvickers/RagTag

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo_text

RagTag

DOI RELEASE CONDA GitHub

RagTag is a collection of software tools for scaffolding and improving modern genome assemblies. Tasks include:

RagTag also provides command line utilities for working with common genome assembly file formats.

logo_text

Getting Started

# install with conda
conda install -c bioconda ragtag

# correct a query assembly
ragtag.py correct ref.fasta query.fasta

# scaffold a query assembly
ragtag.py scaffold ref.fasta query.fasta

# scaffold with multiple references/maps
ragtag.py scaffold -o out_1 ref1.fasta query.fasta
ragtag.py scaffold -o out_2 ref2.fasta query.fasta
ragtag.py merge query.fasta out_*/*.agp other.map.agp

# use Hi-C to resolve conflicts
ragtag.py merge -b hic.bam query.fasta out_*/*.agp other.map.agp

# make joins and fill gaps in target.fa using sequences from query.fa
ragtag.py patch target.fa query.fa

Docs

Please see the Wiki for detailed documentation.

Dependencies

  • Minimap2, Unimap, or Nucmer
  • Python 3 (with the following auto-installed packages)
    • numpy
    • intervaltree
    • pysam
    • networkx

Citation

RagTag supersedes RaGOO:

Acknowledgments

Many of the major algorithmic improvements relative to RaGOO's first release were provided by Aleksey Zimin, lead developer of the MaSuRCA assembler. Luca Venturini suggested and initially implemented many feature enhancements, such as pysam integration. RagTag "merge" was inspired by CAMSA. The developer of CAMSA, Sergey Aganezov, helped review relevant RagTag code. RagTag "patch" was inspired by Grafter, a scaffolding tool written by Melanie Kirsche. Melanie provided guidance for the RagTag implementation. Michael Schatz has provided guidance for the whole project.

About

Tools for fast and flexible genome assembly scaffolding and improvement

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 95.2%
  • Shell 4.8%