Reduce cost & duration by 50% or more on whole-genome sequencing projects

Updated: View “Reduce cost and duration by 50% or more on whole-genome sequencing projects” webinar.

Although sequencing technology and price performance per base-pair-sequenced continue to advance at an impressive rate, Finished whole genome sequencing projects are still costly and lengthy endeavors. Next Gen sequencing technology (and even next-next gen technology) isn’t addressing some of the common issues faced with creating a “Finished” quality genome, namely Contig Placement, Gap Closure and Validation.  Addressing these issues takes several months and a substantial amount of the budget in a sequencing project.

Consider the current workflow for generating a Finished whole genome in the figure below.

As you can see, generating the initial sequence data is no longer the bottle neck. Small genomes can be sequenced using shot gun methods in a couple of days. After the initial assembly the hard part starts: Closing gaps between your contigs, navigating regions with a high number of repeats, resequencing for validation etc. These tasks can represent over 50% of the length of a sequencing project and over 50% the cost!

I wanted to see if other researchers had found novel and/or more cost effective ways of dealing with these challenges. Especially labs that are resource constrained. I came across an interesting paper titled Finishing genomes with limited resources:  lessons from an ensemble of microbial genomes that was published last year in BMC Genomics1. It discusses how using Whole Genome Mapping technology, also called Optical Mapping, can significantly reduce the length of sequencing projects. Before we get into what the paper presents let’s learn more about Whole Genome (Optical) Mapping.

Whole Genome (Optical) Mapping is a de novo process that generates whole genome, ordered, restriction maps with no requirement for previous sequence information & provides a comprehensive view of genomic architecture. An Optical Map or Whole Genome Map (WGM) is displayed in the unique MapCode™ pattern below where the vertical lines indicate the locations of restriction sites, and the distance between the lines represent the fragment size.

The WGM acts as a scaffold for your sequencing project. How? The contigs generated from your sequence assembly are converted to Optical Maps in silico and then are aligned and assembled to the de novo WGM. The WGM acts as an independent validation tool for contig placement and length of repeat regions while also helping to easily identify gaps in your assembly. By taking unordered sequence contigs and aligning them to an ordered WGM you quickly orient the contigs.  When aligned, you can then identify any possible misassemblies that may have occurred in the initial assembly portion of your project.

You might be wondering how the scaffold concept as it applies to Whole Genome Mapping is different from scaffolds obtained with mate-pairs. To quote Nagarajan et al in the paper referenced above “It should be noted that unlike scaffolds obtained with mate-pairs, the scaffolds here are genome-wide and one per genome and therefore well suited for finishing efforts.”(p3) Additionally “While paired-end reads can be invaluable to scaffold contigs, they provide local order information [only] and using them to recreate a genome wide ordering of contigs is computationally challenging.”(p7) Finally “In addition, for time-critical applications in a biodefense or clinical setting, the time to construct paired-end libraries can be a limiting factor. In such settings, Optical Restriction Mapping [22], a form of ordered restriction maps (see Figure 5), can be a promising alternative as it can quickly provide genome wide restriction site information that can be used to order and orient contigs [8].”(p7)

We are starting to get a picture of how using just one single WGM can save time and reduce the need for computationally intensive bioinformatic steps thereby saving money. Let’s look in more detail about how these time savings are gained.

Contig Placement and Validation

With shotgun sequencing, genomic rearrangements, like inversions, can be missed due to incorrect reconstruction of repeats. A WGM can help you validate your whole genome and identify any possible inversions, insertions, translocations and deletions that sequencing may not have identified.  In the example below the contiguous map in the middle was generated de novo using Whole Genome (Optical) Mapping technology. The contigs were generated in silico. Notice the missassemblies for example in Contig980. You can see an example of an inversion in Contig1253. You can also see examples of insertions, deletions and run of the mill gaps that will have to be spanned in resequencing efforts.

Gap Closure

Another example Nagarajan et al describe is using WGMs to reduce the number of PCR experiments needed.  “Working with the original assembly (59 large contigs) could have necessitated on the order of 592 ≈ 3000 PCR experiments.” (p4) That’s a lot of PCR kits and a lot of time.  Using WGMs as scaffolds, they were able to finish the genome using only 43 PCR experiments and 26 sequencing reactions to close 33 of the gaps.  “From a finishing perspective, these (Optical Mapping) scaffolds are particularly useful, as for a set of n contigs, they help reduce the number of PCR experiments needed from roughly n2 to n.” (p7)

Let’s go back to our original figure describing the steps and average time to complete a sequencing project, this time comparing current methods to a workflow that uses a WGM.

As you can see, using a WGM as a scaffold reduces the time significantly by eliminating or greatly reducing the dependence and cost of generating paired-end libraries not to mention the bioinformatics muscle that is required with that approach. Plus having an accurate understanding of the gaps that need to be spanned in resequencing efforts reduces the number of PCR reactions thereby reducing the time and cost of gap closure. Finally the nature of having one whole, ordered contiguous scaffold makes validation inherently easier.

Currently there are many limitations when doing whole-genome sequencing projects. These issues include, but are not limited to: fragmented output of genomes, misassemblies of repeat regions, and limited resources to run these experiments. I’m confident that someday in the future sequencing technology will advance to address these issues. In the meantime Whole Genome (Optical) Mapping acts as a complementary technology to significantly reduce the time and cost associated with the issues discussed in this article.

Learn more about Whole Genome (Optical) Mapping and how to obtain a WGM for your sequencing project.

1 Finishing genomes with limited resources:  lessons from an ensemble of microbial genomes. Nagarajan et al. BMC Genomics 2010, 11;242. Pubmed link

LinkedInShare

Optical Mapping as an alternative to Pulsed-Field Gel Electrophoresis (PFGE) for Strain Typing

Are you looking for a faster and more accurate method of strain typing? Optical Mapping could be your answer.

Optical Mapping produces high coverage, ordered, restriction maps based on hundreds of markers across the entire genome. An Optical Map offers increased accuracy and provides more genomic information than strain typing with alternative methods such as PFGE.

Below you can see a comparison of Optical Mapping and PFGE. Because Optical Maps contain hundreds of markers across the entire genome researchers obtain a much higher resolution compared to other technologies. This enables better strain discrimination among other advantages.

How are researchers using Optical Mapping?

  • High resolution epidemiology

Several clusters of Salmonella Typhimurium infections appeared in Denmark in 2008 and 2009. The paper Molecular characterization of salmonella typhimurium highly successful outbreak strains published in the Foodborne Pathogens and Disease journal discusses how Optical Mapping was able to show that the strain in the largest cluster did not contain an increase content of virulence genes. However Optical Mapping did find a large insert, which was most likely a prophage, in one of the strains. The knowledge of this insert, which may confer a competitive advantage for that strain, is valuable information for epidemiologists.

  • Characterizing and monitoring strain stability

In a paper titled A sustained hospital outbreak of vancomycin-resistant Enterococcus faecium bacteremia due to emergence of vanB E. faecium sequence type 203 published in the Journal of infectious Disease, researchers, with the aid of Optical Mapping, analyzed samples of Enterococcus faecium collected over a 12 year period. The results showed that over this time the strain acquired the vanB locus which resulted in an epidemic clone that exhibits vancomycin resistance.

  • Tracing and linking outbreak and contamination strains to the source

In the figure below you can see the Optical Map similarity cluster of the German Enterohemorrhagic Escherichia coli O104:H4 outbreak of May 2011.

Figure. Optical Map similarity cluster of German EHEC O104:H4 outbreak.
Optical Mapping played a critical role in identifying and tracking strains in this outbreak. The paper Prospective Genomic Characterization of the German Enterohemorrhagic Escherichia coli O104:H4 Outbreak by Rapid Next Generation Sequencing Technology was published about this outbreak and discusses how Optical Mapping accelerated the characterization of the various strains and isolates collected.

  • Distinguishing strains and determining genetic relatedness between multiple isolates

The paper Optical genetic mapping defines regions of chromosomal variation in serovars of S. enterica subsp. enterica of concern for human and animal health published in the journal of Epidemiology and Infection discusses how optical mapping was used to establish 2 geographical lineages (based on the presence of prophage sequences) of strains of Salmonella enterica subsp. enteric.

  • Analyzing microbial mixtures for quality control and process manufacturing

Some of the differentiating technical advantages of Optical Mapping are:

  • Faster time to results than gel electrophoresis and sequence based methods
  • Ability to analyze “unculturable” or fastidious microbes
  • Easy and quick recognition of genomic shifts
  • Much lower cost than whole genome sequencing
  • Easy interpretation though visual analysis using MapSolver™ software
  • High precision epidemiology due to high marker density

How can you access the Optical Mapping technology?

Recently Hitachi Solutions America, Ltd. partnered with OpGen Inc. to offer the MapIt® Optical Mapping Service to its customers. Click here to learn more about Optical Mapping and the MapIt Optical Mapping Service. You can also view a webinar that describes Optical Mapping and its key application areas.

LinkedInShare

Hitachi Solutions America, Inc. Launches MapIt Optical Mapping Service in US

Hitachi Solutions has expanded its agreement with OpGen Inc. to offer MapIt® Optical Mapping Services to its US customer base. Hitachi Solutions has already launched the MapIt service in Japan and this move shows increased confidence in OpGen’s Optical Mapping technology.

Optical Mapping is a novel and de novo process that generates whole genome, ordered, restriction maps with no requirement for previous sequence information & provides a comprehensive view of genomic architecture.

Optical Mapping applications include:

  • Strain Typing
  • Comparative Genomics
  • Whole-genome Sequence Assembly

“We are very excited to bring this new technology to our customers. We feel Optical Mapping offers great value, in terms of time and money saved in its key application areas, along with increased accuracy of results.” said Robert Lynde, Deputy Director, MiraiBio Group of Hitachi Solutions America.

Optical Mapping is currently limited to smaller bacterial, yeast and fungi genomes however OpGen is rolling out large genome support later in 2011.

Learn more Optical Mapping and the MapIt service.

LinkedInShare

Free Live Webinar: “How can Optical Mapping accelerate my research?”

 

 

How can Optical Mapping accelerate my research?

Optical Mapping is a de novo process that generates whole genome, ordered, restriction maps with no requirement for previous sequence information & provides a comprehensive view of genomic architecture. An Optical Map is displayed in the unique MapCode™ pattern above where the vertical lines indicate the locations of restriction sites, and the distance between the lines represent the fragment size.

Join us for a Webinar on August 10
Space is limited.
Reserve your Webinar seat now at:
https://www1.gotomeeting.com/register/367952841
Hitachi Solutions has partnered with OpGen to offer MapIt® Optical Mapping Services to our customers. Trevor Wagner, Senior Applications Scientist Manager from OpGen will be our guest presenter. Trevor was part of the team that developed, tested, and released OpGen’s first major product, the Argus Optical Mapping System in 2010.

This webinar will describe:

1. How Optical Mapping technology will benefit you in the following application areas:

-Strain Typing

-Comparative Genomics

-Whole-genome Sequence Assembly

2. How the MapIt Service works.

Title: How can Optical Mapping accelerate my research?
Date: Wednesday, August 10, 2011
Time: 11:00 AM – 12:00 PM PDT
After registering you will receive a confirmation email containing information about joining the Webinar.
System Requirements

PC-based attendees

Required: Windows® 7, Vista, XP or 2003 Server

Macintosh®-based attendees

Required: Mac OS® X 10.5 or newer


 

LinkedInShare

View MiraiBio Webinar Series Recordings

The following webinar recordings have been made available for viewing at your own leisure:

LinkedInShare