digplanet beta 1: Athena
Share digplanet:


Applied sciences






















In molecular biology and bioinformatics, the consensus sequence is the calculated order of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It represents the results of a multiple sequence alignments in which related sequences are compared to each other and similar sequence motifs are calculated. Such information is important when considering sequence-dependent enzymes such as RNA polymerase.[1]

Biological significance[edit]

A protein binding site, represented by a consensus sequence, may be a short sequence of nucleotides which is found several times in the genome and is thought to play the same role in its different locations. For example, many transcription factors recognize particular patterns in the promoters of the genes they regulate. In the same way restriction enzymes usually have palindromic consensus sequences, usually corresponding to the site where they cut the DNA. Transposons act in much the same manner in their identification of target sequences for transposition. Finally splice sites (sequences immediately surrounding the exon-intron boundaries) can also be considered as consensus sequences.

Thus a consensus sequence is a model for a putative DNA binding site: it is obtained by aligning all known examples of a certain recognition site and defined as the idealized sequence that represents the predominant base at each position. All the actual examples shouldn't differ from the consensus by more than a few substitutions, but counting mismatches in this way can lead to inconsistencies.[2]

Any mutation allowing a mutated nucleotide in the core promoter sequence to look more like the consensus sequence is known as an up mutation. This kind of mutation will generally make the promoter stronger, and thus the RNA polymerase forms a tighter bind to the DNA it wishes to transcribe and transcription is up-regulated. On the contrary, mutations that destroy conserved nucleotides in the consensus sequence are known as down mutations. These types of mutations down-regulate transcription since RNA polymerase can no longer bind as tightly to the core promoter sequence.

Sequence analysis[edit]

Developing software for pattern recognition is a major topic in genetics, molecular biology, and bioinformatics. Specific sequence motifs can function as regulatory sequences controlling biosynthesis, or as signal sequences that direct a molecule to a specific site within the cell or regulate its maturation. Since the regulatory function of these sequences is important, they are thought to be conserved across long periods of evolution. In some cases, evolutionary relatedness can be estimated by the amount of conservation of these sites.


The conserved sequence motifs are called consensus sequences and they show which residues are conserved and which residues are variable. Consider the following example DNA sequence:


In this notation, A means that an A is always found in that position; [CT] stands for either C or T; N stands for any base; and {A} means any base except A. Y represents any pyrimidine, and R indicates any purine.

In this example, the notation [CT] does not give any indication of the relative frequency of C or T occurring at that position. An alternative method of representing a consensus sequence uses a sequence logo. This is a graphical representation of the consensus sequence, in which the size of a symbol is related to the frequency that a given nucleotide (or amino acid) occurs at a certain position. In sequence logos the more conserved the residue, the larger the symbol for that residue is drawn; the less frequent, the smaller the symbol. Sequence logos can be generated using WebLogo, or using the Gestalt Workbench, a publicly available visualization tool written by Gustavo Glusman at the Institute for Systems Biology.[2]


Bioinformatics tools are able to calculate and visualize consensus sequences. Examples of the tools are JalView and UGENE.

See also[edit]


  1. ^ Pierce, Benjamin A. 2002. Genetics : A Conceptual Approach. 1st ed. New York: W.H. Freeman and Co.
  2. ^ a b Schneider TD (2002). "Consensus Sequence Zen". Appl Bioinformatics 1 (3): 111–119. PMC 1852464. PMID 15130839. 

Original courtesy of Wikipedia: http://en.wikipedia.org/wiki/Consensus_sequence — Please support Wikipedia.
This page uses Creative Commons Licensed content from Wikipedia. A portion of the proceeds from advertising on Digplanet goes to supporting Wikipedia.
3688 videos foundNext > 

What is DNA consensus sequence?

In molecular biology and bioinformatics, the consensus sequence is the calculated order of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment....

12 l Consensus Sequence

Prokaryotic Transcription and Promoters

DNASTAR - Creating a New Consensus for an Alignment

DNASTAR's MegAlign application allows you to define multiple consensus sequences from the alignment report, enabling the comparison of the sequences in your alignment to multiple templates.

Kozak consensus sequence

The Kozak consensus sequence, Kozak consensus or Kozak sequence, is a sequence which occurs on eukaryotic mRNA and has the consensus (gcc)gccRccAUGG. The Kozak consensus sequence ...

Lecture 11. Unit 1 Sequence Consensus and Multi-Paxos

This lecture is about sequence consensus and multi-Paxos. In fact about a variant that is extremely efficient that we call Sequence Paxos. This version is also easy to understand. The lecture...

What is Phasing?

Whole-genome sequencing conventionally generates a single consensus sequence without distinguishing between variants on homologous chromosomes. Genome phasing addresses this limitation ...

Consensus Sequence - Tales from the Genome

This video is part of an online course, Tales from the Genome. Check out the course here: https://www.udacity.com/course/bio110.

BioEdit Input Output sequence Files

This Video tutorial to demonstrates different ways of getting sequences in and out of BioEdit. BioEdit can read many formats including most commonly used fasta (.fa, .fas, .fasta), Genbank...

DNASTAR - Assembling and Analyzing Sanger Sequencing Data

In this video, we show you how to assemble Sanger trace data in SeqMan Pro. After assembly, you can use SeqMan Pro's integrated views to see read alignment, assembly coverage, and base quality.

3688 videos foundNext > 

78 news items

Fri, 17 Apr 2015 06:52:30 -0700

Nature Clim Change 3: 979–984. | Article | ISI | CAS |; Warren G, Corotto L. (1989). The consensus sequence of ice nucleation proteins from Erwinia herbicolaPseudomonas fluorescens and Pseudomonas syringae. Gene 85: 239–242. | Article | PubMed | ISI |.

BMC Blogs Network (blog)

BMC Blogs Network (blog)
Fri, 17 Apr 2015 03:07:30 -0700

“At the time, I was fortunate that Alan Scott was developing a consensus sequence of human L1 elements just two floors above my lab. What's more, Jef Boeke in Molecular Biology at Hopkins, was an expert who coined the term retrotransposition during ...
Watching the Watchers.org
Thu, 16 Apr 2015 21:30:00 -0700

The conclusion is that the virus population produced in HeLa cells consists mainly of genomes with the consensus sequence, and small amounts of many variant genomes. These variants are only those that give rise to viable viruses; lethal mutations are ...
Thu, 22 Jan 2015 11:30:00 -0800

For the second, they allowed multiple alignments to the reference for each read, enabling them to "gather all reads for a single gene in a single pileup to any of the eight HLA-A/B loci to generate a consensus sequence." Several variant and haplotype ...
Science Careers Blog (subscription)
Thu, 11 Sep 2014 11:07:04 -0700

Stephen K. Gire, Augustine Goba, Kristian G. Andersen, Rachel S. G. Sealfon, Daniel J. Park, Lansana Kanneh, Simbirie Jalloh, Mambu Momoh, Mohamed Fullah, Gytis Dudas, Shirlee Wohl, Lina M. Moses, Nathan L. Yozwiak, Sarah Winnicki, Christian B.
Mon, 23 Feb 2015 12:37:30 -0800

Then they error-correct these reads using the partial order aligner or POA software, which applies a directed acyclic graph to compute a multiple alignment to determine a consensus sequence. The corrected reads are then fed into the Celera assembler.


The Daily Galaxy (blog)
Mon, 29 Sep 2014 09:07:30 -0700

One method is called consensus-sequence engineering, and is the more simplistic of the two. Scientists just plug the sequence of a protein of interest into a protein database. The query returns a large number of “hits,” or analogous sequences.
Emory University News and Events
Fri, 11 Jul 2014 03:52:30 -0700

A "fitness bottleneck" explains why viruses that emerge after male-to-female transmission have more mutations than those that emerge after female-to-male transmission. The findings suggest that there is a window of opportunity for drugs and vaccines to ...

Oops, we seem to be having trouble contacting Twitter

Support Wikipedia

A portion of the proceeds from advertising on Digplanet goes to supporting Wikipedia. Please add your support for Wikipedia!

Searchlight Group

Digplanet also receives support from Searchlight Group. Visit Searchlight