The Hitachi Software MiraiBio Group Blog

What’s going on at MiraiBio

Subscribe to The Hitachi Software MiraiBio Group Blog
Posted by aliu under SmartNote

NCBI BLAST, the Basic Local Alignment Search Tool (BLAST) is a suite of programs designed to search all available sequence databases for similarities between a protein or DNA query and known sequences. BLAST allows quick matching of near and distant sequence relationships, providing scores that allow the user to distinguish real matches from background hits with a high degree of statistical accuracy.

Focusing on local alignments, BLAST uses a heuristic algorithm to detect relationships between sequences that may only share isolated regions of similarity. BLAST results take sequence length and the nucleotide/peptide compositions of the query into account when assigning alignment scores. For sequences shorter than 200 residues, an effective length is used to compensate for “edge effects”. Sequence alignment scores are reported by BLAST programs as E-values that reflect the strength of alignment between a given sequence in the database and a query. E-values are reported instead of the traditional P-value, to improve resolution between low scoring alignments, but for closely related sequences (P < 0.01), these values are nearly equal.

For more detailed information on how BLAST scores are calculated, visit:

http://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-1.html

For most first-time users of BLAST, choosing the right sub-program may be difficult. BLAST offers a variety of search tools for different types of queries. In general, the best choice of program depends upon the sequence length, the database being searched, and the information requested in the search.

Nucleotide BLAST is a collection of programs allowing users to compare a query sequence against other nucleotides in the database. BLAST accepts sequences in a variety of formats, including FASTA, GenBank, and Accession/GI numbers, and compares these with the NCBI databases. MEGABLAST is a concatenating algorithm for quickly aligning sequences longer than 28 residues. For shorter sequences, such as primers, standard nucleotide-nucleotide BLAST offers automatic parameter settings suited to these queries.

Protein BLAST is a collection of programs used to find protein sequences similar to a query. These programs accept sequences in the same file formats as Nucleotide BLAST. PSI-BLAST is a position specific, iterating algorithm that searches sequences from each round as the basis for scoring sequences searched in the next round. It distinguishes between highly and weakly conserved positions in the sequence, resulting in increased sensitivity with each iteration. PSI BLAST also offers the option of including regular expression patterns in the search, allowing users to identify sequences that include a pattern and are homologous to the query protein sequence. As with Nucleotide BLAST, Protein BLAST includes automatic parameter settings for shorter sequences.

Translating BLAST operates in a similar fashion to both the nucleotide and protein search routines. BLASTX translates nucleotide sequences into protein sequences in each of the 6 reading frames, prior to comparing the query to the protein databases. TBLASTN compares a protein sequence query against a database of nucleotide sequences previously translated in each of the 6 reading frames.

Users can refer to the NCBI BLAST program selection guide for more information:
http://www.ncbi.nlm.nih.gov/blast/producttable.shtml
.

Users can access BLAST tools directly through the web, or through a variety of software applications, such as MiraiBio’s DNASIS SmartNote, which helps users find and organize sequences, and automatically submit them to the BLAST programs. DNASIS SmartNote has the additional ability to BLAST multiple sequences “in batch” without tediously copying/pasting each sequence and waiting for each result to come back.

To learn more about DNASIS SmartNote, visit http://smartnote.miraibio.com.

Posted by aliu under SmartNote

Since PCR primer design is one of the most widely used features of DNASIS SmartNote, we did some research and put together a list of the top 10 tips for designing PCR primers that work. When designing oligonucleotide primers for PCR, it is helpful to keep some considerations in mind to optimize the output and specificity of your experiment. Here are some tips gathered from experts to get you started:

1. Design your PCR primers to be 18-30 oligo nucleotides in length. The longer end of this range allows higher specificity and gives you space to add restriction enzyme sites to the primer end for cloning.

2. Make sure the melting temperature (Tm) of the primers used are not more than 5°C different from each other. You can calculate Tm with this formula: Tm = 4(G + C) + 2(A + T)°C

3. Aim for a Tm between 65 and 70°C for each primer over the region of hybridization

4. Use an annealing temperature (Ta) of 10 to 15°C lower than the Tm.

5. The GC content of each primer should be in the range of 40-60% for optimum PCR efficiency.

6. Try to have uniform distribution of G and C nucleotides, as clusters of G’s or C’s can cause non-specific priming.

7. Avoid long runs of the same nucleotide.

8. Check that primers are not self-complementary or complementary to the other primer in the reaction mixture, as this will encourage formation of hairpins and primer dimers and will compete with the template for the use of primer and reagent.

9. If you can, make the 3′ end terminate in C or A, as the 3′ is the end which extends and neither the C or A nucleotide wobbles. This will increases the specificity.

10. You can avoid mispriming by making the 3′ end slightly AT rich.

11. Use the right software. OK, so it’s 11 tips. Using the right software is a great way to automate these steps and minimize errors, especially when you have to design primers for many sequences. DNASIS SmartNote includes several primer design tools and is also a lab notebook that automatically keeps a record of your analysis results. You should definitely give it a try. Click here to sign up a free account.

If you prefer traditional desktop software, take a look at DNASIS Max instead.

References:
http://rothlab.ucdavis.edu/protocols/PrimerDesign.html
http://www.biochem.ucl.ac.uk/bsm/nmr/protocols/protocols/oligo.html
http://www.protocol-online.org/prot/Molecular_Biology/PCR/PCR_Primer/
http://www.mcb.uct.ac.za//pcroptim.htm

Posted by aliu under SmartNote

New this week:

1. By popular demand, you can now import GenBank and multi-GenBank files into SmartNote, including those exported from other programs, such as DNASIS Max and Vector NTI.

2. We have added a new option for displaying the predicted amino acid sequence aligned with a DNA sequence. Use the ExPASy Translate tool and specify “Includes Nucleotide Sequence” for the “Output Format” option. Thank you, Jessica Min for this suggestion. This should help users who need to design primers for mutagenesis.

3. Support for IUPAC codes in sequences is also being added this week.

Please keep the feedback coming!