digplanet beta 1: Athena
Share digplanet:

Agriculture

Applied sciences

Arts

Belief

Business

Chronology

Culture

Education

Environment

Geography

Health

History

Humanities

Language

Law

Life

Mathematics

Nature

People

Politics

Science

Society

Technology

The Ensembl genome database project.
Ensembl release58 sgcb screenshot.png
Content
Description Ensembl
Contact
Research center
Primary citation Hubbard, et al. (2002)[1]
Access
Website www.ensembl.org
Tools
Miscellaneous

Ensembl is a joint scientific project between the European Bioinformatics Institute and the Wellcome Trust Sanger Institute, which was launched in 1999 in response to the imminent completion of the Human Genome Project.[1] After 10 years in existence,[2] Ensembl's aim remains to provide a centralized resource for geneticists, molecular biologists and other researchers studying the genomes of our own species and other vertebrates and model organisms. Ensembl is one of several well known genome browsers for the retrieval of genomic information.

Similar databases and browsers are found at NCBI and the University of California, Santa Cruz (UCSC).

Contents

Background [edit]

The human genome consists of three billion base pairs, which code for approximately 20,000–25,000 genes. However the genome alone is of little use, unless the locations and relationships of individual genes can be identified. One option is manual annotation, whereby a team of scientists tries to locate genes using experimental data from scientific journals and public databases. However this is a slow, painstaking task. The alternative, known as automated annotation, is to use the power of computers to do the complex pattern-matching of protein to DNA.

In the Ensembl project, sequence data are fed into the gene annotation system (a collection of software "pipelines" written in Perl) which creates a set of predicted gene locations and saves them in a MySQL database for subsequent analysis and display. Ensembl makes these data freely accessible to the world research community. All the data and code produced by the Ensembl project is available to download, and there is also a publicly accessible database server allowing remote access. In addition, the Ensembl website provides computer-generated visual displays of much of the data.

Over time the project has expanded to include additional species (including key model organisms such as mouse, fruitfly and zebrafish) as well as a wider range of genomic data, including genetic variations and regulatory features. Since April 2009, a sister project, Ensembl Genomes, has extended the scope of Ensembl into invertebrate metazoa, plants, fungi, bacteria, and protists, whilst the original project continues to focus on vertebrates.

Displaying genomic data [edit]

Gene SGCB aligned to the human genome

Central to the Ensembl concept is the ability to automatically generate graphical views of the alignment of genes and other genomic data against a reference genome. These are shown as data tracks, and individual tracks can be turned on and off, allowing the user to customise the display to suit their research interests. The interface also enables the user to zoom in to a region or move along the genome in either direction.

Other displays show data at varying levels of resolution, from whole karyotypes down to text-based representations of DNA and amino acid sequences, or present other types of display such as trees of similar genes (homologues) across a range of species. The graphics are complemented by tabular displays, and in many cases data can be exported directly from the page in a variety of standard file formats such as FASTA.

Externally produced data can also be added to the display, either via a DAS (Distributed Annotation System) server on the internet, or by uploading a suitable file in one of the supported formats, such as BAM, BED, or PSL.

Graphics are generated using a suite of custom Perl modules based on GD, the standard Perl graphics display library.

Alternative access methods [edit]

In addition to its website, Ensembl provides a Perl API[3] (Application Programming Interface) that models biological objects such as genes and proteins, allowing simple scripts to be written to retrieve data of interest. The same API is used internally by the web interface to display the data. It is divided in sections like the core API, the compara API (for comparative genomics data), the variation API (for accessing SNPs, SNVs, CNVs..), and the functional genomics API (to access regulatory data). The Ensembl website provides extensive information on how to install and use the API.

This software can be used to access the public MySQL database, avoiding the need to download enormous datasets. The users could even choose to retrieve data from the MySQL with direct SQL queries, but this requires an extensive knowledge of the current database schema.

Large datasets can be retrieved using the BioMart data-mining tool. It provides a web interface for downloading datasets using complex queries.

Last, there is an FTP server which can be used to download entire MySQL databases as well some selected data sets in other formats.

Current species [edit]

The annotated genomes include most fully sequenced vertebrates and selected model organisms. All of them are eukaryotes, there are no prokaryotes. As of 2008, this includes:

See also [edit]

References [edit]

  1. ^ a b Flicek P, Amode MR, Barrell D, et al. (November 2010). "Ensembl 2011". Nucleic Acids Res 39 (Database issue): D800–D806. doi:10.1093/nar/gkq1064. PMC 3013672. PMID 21045057. 
  2. ^ Flicek P, Aken BL, Ballester B, et al. (January 2010). "Ensembl's 10th year". Nucleic Acids Res. 38 (Database issue): D557–62. doi:10.1093/nar/gkp972. PMC 2808936. PMID 19906699. 
  3. ^ Stabenau A, McVicker G, Melsopp C, Proctor G, Clamp M, and Birney E (February 2004). "The Ensembl Core Software Libraries". Genome Research 14 (5): 929–933. doi:10.1101/gr.1857204. PMC 479122. PMID 15123588. 

External links [edit]


Original courtesy of Wikipedia: http://en.wikipedia.org/wiki/Ensembl — Please support Wikipedia.
A portion of the proceeds from advertising on Digplanet goes to supporting Wikipedia.
3441 videos foundNext > 

Ensembl Genome Browser

Learn how to find a gene and browse a region of the genome in www.ensembl.org.

Browsing SNPs and Copy Number Variation in Ensembl

Short sequence variants such as Single Nucleotide Polymorphisms (SNPs) and larger structural variants like Copy Number Variation (CNVs) can be viewed in the ...

How to Install the Ensembl Perl APIs in 4 minutes

This is a quick "how-to" guide for installing the Ensembl Perl APIs. Installation by both cvs and ftp will be demonstrated. All commands in this video can be...

Demo 4: Using BLAST/BLAT in Ensembl

We demonstrate the BLAST/BLAT tool in Ensembl. Search for a sequence in Ensembl, and identify hits to the genome, or to genes, with this tool.

Jean Jacques Goldman Ensemble

Ensembl overview at the Erasmus MC (Sept 2011)

This is an overview of the data available in the Ensembl Genome Browser with a focus on the gene set and sequence variation. Four demos follow this presentat...

Viewing Ensembl Regulation & ENCODE Using the Matrix

This tutorial demonstrates how to view sequences potentially involved in gene regulation. These sequences are analysed by Ensembl Regulation based mostly on ...

Jean jacques goldman ensemble paroles (souvient toi)

Parole de la musique de Jean Jacques Goldman ensemble (souvient toi) :)

EnsemblGenomes: Extending Ensembl

A quick overview of the EnsemblGenomes browser, which was released in April 2009 and is designed to extend the Ensembl browser to cover metazoans, protists, ...

View Your Data in Ensembl

Draw your data along the genome through either quick upload of a file to Ensembl, or attaching a url. We will explore the following examples: 1) Viewing a BA...

3441 videos foundNext > 

6 news items

 
Science Codex
Sun, 28 Apr 2013 10:34:31 -0700

The consortium also investigated the association of embryonic gene expression profiles (GXP) and their morphological evolution pattern, based on ENSEMBL soft-shell turtle gene-set. By integrating RNA-seq technology, comparative genomics method, and ...
 
El País.com (España)
Thu, 23 May 2013 10:52:04 -0700

Primeros abonos a 30 euros). Primera edición de un festival dedicado a la música balcánica, al swing y al ska con bandas como New York Ska Jazz Ensembl, Dunkelbunt, Shantel, Mahala Rai Banda, Sonido Vegetal y Bohemian Betyards, entre las primeras ...
 
Tygodnik Wałbrzyski
Tue, 14 May 2013 23:22:04 -0700

Wałbrzyski Ośrodek Kultury włączy się w organizację jubileuszowego Międzynarodowego Festiwalu Kameralistyki Ensembl im. Księżnej Daisy, na co otrzymał 100 000 zł, miasto dołoży drugie 100 000 zł. Zespół Pieśni i Tańca „Wałbrzych” będzie promował ...
 
Ressources Solidaires
Mon, 13 May 2013 06:08:07 -0700

Je ne peux évidemment que me réjouir pour cette con stance et cette volonté commune d'apporter, ensembl e, une nouvelle pierre à la connaissance de ce mouveme nt social, maintenant assez bien connu... mais presqu e toujours aussi mal reconnu, qu'est ...
 
生物通
Wed, 01 May 2013 18:35:57 -0700

随后,研究人员还基于ENSEMBL注释基因集,对中华鳖胚胎发生基因调控的变化进行了研究。现有研究发现在一些模式动物胚胎发生中会呈现出沙漏模型,即各种分类群在胚胎早期阶段开始出现不同,然后到胚胎发生中期,趋向于 ...
 
Le Zapping du PAF
Sat, 04 May 2013 02:03:33 -0700

... soirée sur TF1, le jeu Money Drop présenté par Laurence Boccolini et produit par Endemol Productions, s'est placé largement en tête des audiences en réunissant 5.2 millions de téléspectateurs pour des Parts d'Audience de 24% sur l'ensembl du public ...
Loading

Oops, we seem to be having trouble contacting Twitter

Talk About Ensembl

You can talk about Ensembl with people all over the world in our discussions.

Support Wikipedia

A portion of the proceeds from advertising on Digplanet goes to supporting Wikipedia. Please add your support for Wikipedia!