![]() |
![]() |
| Elements | Source |
|---|---|
| Primary transcripts Exons Introns |
Transcripts and their exon/intron structure are determined by the mapping
of cDNA sequences from different sources (e.g. RefSeq, GenBank, Ensembl). Transcripts
annotated in ElDorado are divided into 3 quality levels:
|
| UTRs | calculated by determining the longest open reading frame (ORF) for the transcript |
| PromoterInspector-Predictions | calculated by PromoterInspector |
| Promoter regions | Promoters available in ElDorado are evaluated in a 3 step process:
|
| TSR (transcriptional start regions) |
TSRs are defined as regions of genomic sequence for which experimental evidence for transcription initiation is available. Information about transcription initiation is derived from individual full-length cDNAs and from CAGE tags. Both data sources make use of the oligo-capping method. The 5' ends of full-length transcripts and CAGE tags are taken as experimentally verified transcription start sites (TSS). TSSs separated by less than 40bp are grouped in a TSR. |
| Conserved Regions | Conserved regions are calculated by a proprietary algorithm for genome wide sequence alignments. Conserved regions have a minimum length of 50 basepairs and a minimum similarity of 80%. The algorithms accounts for point mutations only, but not insertions/deletions. The elements annotated in ElDorado are based on comparisons of genomes from distant species only, i.e. closely related species like rat/mouse are not considered. Note: Conserved regions are available up to ElDorado 12-2008. |
| MicroRNAs | microRNAs are based on the sequences available in the miRBase at the Sanger Institute. |
| Probes | Each single probe from gene expression arrays from Affymetrix and Illumina is mapped against the corresponding genome. All perfect matches are annotated. |
| SNPs | derived from dbSNP (NCBI) |
| SMARs | calculated by SMARTest |
| Repeats | The following genomic repeats are calculated by ModelInspector: ALUs, L1 elements, THEs, and B1 elements. |
| Modules | calculated for promoter regions by ModelInspector (Promoter Module Library) |
Descriptive information about genetic loci was derived from NCBI's Entrez
Gene.
Bibliosphere data is based on the analysis of abstracts from NCBI's PubMed.
| © 1998-2010 Genomatix Software GmbH - All rights reserved |