How to evaluate an assembly
VirAmp uses QUAST (Quality Assessment Tools for Genome Assemblies) to evaluate the newly assembled genome.
Basic statistics such as contig lengths, N50, NG50, # misassemblies etc. are given on screen, and more detailed report is provided for download on the right "history" panel.
Two important statistics here are N50 and NG50.
What to input
What is shown on screen:
Assembly viramp # contigs (>= 0 bp) 5 # contigs (>= 1000 bp) 5 Total length (>= 0 bp) 135722 Total length (>= 1000 bp) 135722 # contigs 5 Largest contig 108094 Total length 135722 Reference length 136376 GC (%) 67.36 Reference GC (%) 67.53 N50 108094 NG50 108094 N75 108094 NG75 108094 # misassemblies 0 # local misassemblies 3 # unaligned contigs 0 + 1 part Unaligned contigs length 95 Genome fraction (%) 98.610 Duplication ratio 1.009 # N's per 100 kbp 106.84 # mismatches per 100 kbp 472.19 # indels per 100 kbp 72.87 Largest alignment 107807 NA50 107807 NGA50 107807 NA75 107807 NGA75 107807
Featured graphs in the download version
Reference
QUAST: quality assessment tool for genome assemblies. Alexey Gurevich, Vladislav Saveliev, Nikolay Vyahhi and Glenn Tesler. Bioinformatics (2013)29(8):1072-1075.