Download Data



File name File size md5sum

Your data will be retained in our server for 3 months. Should you wish to extend the retention period, please email or contact our sales team.



Download file information
File name Description
Rawdata.zip
  • *.fastq.gz : Gzip compressed raw sequencing data in FASTQ file format.
  • *_filtered*.fastq.gz : Gzip compressed and pre-processed sequencing data in FASTQ file format.
Analysis_Result.zip
  • ReadBasedAnalysis
    • TaxonomyClassification : Directory containing read classification results in Excel format.
    • PathwayAnalysis : Directory containing pathway analysis based on the reads in Excel format.
  • ContigBasedAnalysis
    • Assembly : Directory containing fasta files of co-assembly results and contigs of each sample which are co-assembly contigs have mapping depth more than 5 or equal for each sample.
    • TaxonomyClassification : Directory containing contig classification results in Excel format. In the results, 0 means very small amount of contigs were assigned to the taxon, '-' means there are no contigs assigned to the taxon and 'NA' from the abundance means that abundance cannot be calculated from the taxonomic rank. (Abundance can be calculated from genus, species, subspecies levels.)
      • Unclassified : Directory containing binning results and gene prediction results from unclassfied contigs for each sample.
      • Unclassified/Co-assembly : Directory containing binning results of the unclassified co-assembly contigs.
        • Unclassified.bin.txt : Bin name for each contig
        • Unclassified.bin.txt.jpg : Bar graph depicting the number of contigs for each bin.
    • Annotation : Directory containing gene prediction and functional annotation results from the contigs for each sample.
      • File format
        • *.fasta : Nucleotide sequences in FASTA format
        • *.faa : Amino acid sequences predicted from the contigs in FASTA format.
        • *.ffn : Nucleotide sequences of transcripts predicted from the contigs in FASTA format.
        • *.gff : Gene feature format files containing gene information predicted from the contigs.
        • *.tbl : Tab separation formality of NCBI.
        • *.gbk : GenBank format.
    • PanGenomeAnalysis : Directory containing the results of pangenome analysis.
      • gene_presence_absence.xlsx : Excel files containing presence/absence information of the predicted genes among the samples.
      • gene_presence_absence.csv.upset.jpg : UpSet Plot image which is same with that in the Pangenome analysis page of this report.