Genome Annotation Service

Catalog # 8010 (basic service)

Catalog # 8011 (extended service)

Genome Annotation Service

Catalog # 8010 (basic service)

Catalog # 8011 (extended service)

Accurate and complete genome annotation is what makes an assembly usable. Don’t shortchange yourself on this final, critical step. At Dovetail®, we have dedicated annotation bioinformaticians employing the latest published tools in a highly efficient, parallelized compute workflow, with evidence pulled from a variety of sources. The best gene-calling evidence is RNAseq or IsoSeq data from the species of interest. We can generate this for you or utilize existing transcriptome data to call genes. Choose between the Basic or Extended packages. The Basic package includes 5 genes manually curated for optimal accuracy; the Extended Package includes 55 manually curated genes.

Why Use Our Genome Annotation Service?

  • Fully leverage our complete sample-to-publishable assembly workflow.
  • Trust our qualified annotation bioinformaticians to deliver on this crucial final step in genome assembly.
  • High-throughput, bespoke annotation pipeline to maximize speed and performance.

How It Works

Maximize your genome assembly investment with accurate and complete gene annotation. Gene annotation errors will derail your downstream analyses. The experts at Dovetail can provide you with an annotated assembly you can trust. To confidently call genes, we require evidence from as many trustworthy sources as possible, which may include publicly available transcriptome data, protein databases, etc. However, the best evidence is RNAseq or IsoSeq data from a variety of tissues and/or life stages from the species under study. If transcriptome data is unavailable, we can generate it for you as part of our complete service offering. Genes will be called, identified, and functionally annotated. Then a qualified bioinformatician will manually curate up to 5 genes (Basic package) or 55 genes (Extended package) of your choice.

The annotation workflow includes:

  • Repeat masking
  • Model preparation
  • Evidence collection
  • High throughput genome annotation
  • Manual curation
  • Functional annotation
Ab initio annotation

Following ab initio annotation, a trained bioinformatician will manually curate genes most relevant to your research. In this example, IsoSeq data (A) clearly shows the presence of an exon that the ab initio pipeline missed. Similarly, IsoSeq data (B) does not support an exon that the ab initio pipeline erroneously called in the genome. Manual curation ensures optimal annotation accuracy.

Myrtle Warbler

Dr. Brelsford and I have been working on warbler genomics together for nearly a decade now! Dovetail’s Chicago and Hi-C libraries are finally allowing us to put together our small scaffolds from our first-generation assembly towards a near chromosome-level genome. We are excited to see what advances this will bring us in our understanding of avian genome evolution!

David Toews, Penn State

Specifications

Delivery Time Inquire
Sample Requirements Dovetail Genome Assembly (> 70%  BUSCO completeness), RNA-seq and/or IsoSeq transcriptome data
Library NA
Sequencing Platform NA
Analysis Platform Dovetail Genomics Annotation Suite
Project Deliverables A comprehensive report (see example below) that includes:

  • Genome annotation file in GFF3 format
  • Predicted gene CDS sequences in FASTA format
  • Predicted gene peptide sequences in FASTA format
  • BAM alignment files from RNAseq and Iso-Seq data (if evidence data is provided)
  • Repeat annotation file in GFF3 format