Assembly Statistics



Sample Number of Contigs Total Length of Contigs N50 Max Length Min Length Avg. Length
Show Description

    Genome assembly refers to aligning and merging fragments of DNA sequences, in order to construct draft genome map of the targeted sample. Assembly is needed due to the limitation of current sequencing technology, although it has been drastically improved, still has no choice but to produce short read sequences compare to entire genome size. Once the assembly is accomplished, constructed map is considered as a reference genome, based on which, gene prediction, annotation and taxon classification can be carried out. In contrast to general assembly, which uses read sequence from single sample to construct independent map for each species, co-assembly uses read sequences from pooled samples, including multiples species. Co-assembly analysis has advantage on providing higher read depth to capture more diversity within samples, and allows you to identify specific genetic information from metagenomes, that is not normally available due to its lack of proportion. This co-assembly analysis was conducted using MEGAHIT $version. After the assembly has been completed, read sequences used for co-assembly were mapped to contigs. If the mapping depth of the corresponding contig appears to be 5X or higher, it is determined to be derived from the species within sequenced samples.


  •     · Number of Contigs : The number of contigs identified
  •     · Total Length of Contigs : The total length of contigs
  •     · N50 : An N50 means that half of all bases reside in contigs of this size or longer
  •     · Max Length : The length of the longest contig
  •     · Min Length : The length of the shortest contig
  •     · Avg. Length : The average length of contigs


Scroll to zoom in/out & Drag to move focus



Sample Total Reads Mapped Reads Coverage (%) Ins. Size (Std.)
Show Description

  •     · Total Reads : The total number of reads
  •     · Mapped Reads : The total number of mapped reads
  •     · Coverage (%) : The percentage of mapped sited
  •     · Ins. Size (Std.) : The average and standard deviation of length for paired and mapped reads