![]() |
![]() |
This task allows to search for transcription factor binding site models that are conserved in orthologous promoter sequences of several vertebrate species. This way further evidence for the functionality of promoter models can be gained because TF models are more likely to be functional when they have been preserved during evolution.
The Genomatix homology groups (defined by a proprietary algorithm, see Comparative Genomics) include 16 vertebrate species (Homo sapiens, Pan troglodytes, Macaca mulatta, Mus musculus, Oryctolagus cuniculus, Rattus norvegicus, Equus caballus, Canis lupus familiaris, Bos taurus, Sus scrofa, Monodelphis domestica, Ornithorhynchus anatinus, Xenopus tropicalis, Danio rerio, Gallus gallus, Taeniopygia guttata). A promoter model is defined to be conserved if it is found in at least one alternative promoter sequence of the genes in a homology group for all selected organisms.
| Homology Group Selection | |
|---|---|
| Select organism | Select the organisms for which the orthologous promoter sequences should
be analyzed. The homology groups always correspond to the latest ElDorado
release.
You can select all vertebrates or at least two individual organisms from the following list:
|
| Constraints | You can set one or several mandatory organisms in which promoter model matchess have to be present. Please note that all mandatory organisms need to be included in the organisms selected for the search. The second constraint is the minimum number of organisms for which promoter model matches have to be found in the same homology group. The number has to be equal or lower than the number of organisms selected. |
| Parameters | |
|---|---|
| Model groups | Please choose one or several of the available Genomatix model libraries. If you have created your own models with FastM or FrameWorker, they can be found in the "User-defined models"-library. You can decide if you want to
In the third case, there will be a separate page with a list of all models in the chosen libraries and you can select your model subset by clicking the checkboxes for each model. |
| Further parameters |
All further parameters (e.g. "Max. number of matches") correspond to the ModelInspector search and output parameters. |
| Email address | Here you can choose between two methods for receiving
the results:
The results will be available for a limited time on our server. For details of how long your results will be kept please see the result-email. After that period they will be deleted unless protected in the project management! |
Three output files are generated: the match overview, the detailed output, and the statistics file.
The first output file contains:
| Sequence file: | Vertebrate homologous promoters |
| Selected organisms: | Homo sapiens, Mus musculus, Rattus norvegicus, Gallus gallus, Canis lupus familiaris, Bos taurus |
| Models: | Vertebrate_Modules/CEBP_HNF1_02.model |
| Strand(s) searched: | both strands |
| Threshold for number of elements: | 100.0 % (2 of 2 elements) |
| Output sorted by: | match positions on the sequences |
| Matches have to occur in: | orthologous promoters of at least 3 organisms |
| Maximum number of matches: | 1000 |
| Organism | Gene | GeneID | Sequence | Model Name | Position | Strand | Select Match |
|---|---|---|---|---|---|---|---|
| Genomatix homology group: 13535 | |||||||
| Rattus norvegicus | Alb (albumin) | 24186 | GXP_294008 | CEBP_HNF1_02 | 412 - 474 | (+) | |
| Mus musculus | Alb1 (albumin 1) | 11657 | GXP_152152 | CEBP_HNF1_02 | 412 - 474 | (+) | |
| Bos taurus | ALB (albumin) | 280717 | GXP_1324534 | CEBP_HNF1_02 | 398 - 461 | (+) | |
| Canis lupus familiaris | ALB (albumin) | 403550 | GXP_208626 | CEBP_HNF1_02 | 391 - 454 | (+) | |
| Homo sapiens | ALB (albumin) | 213 | GXP_40794 | CEBP_HNF1_02 | 391 - 454 | (+) | |
| Pan troglodytes | ALB (albumin) | 461260 | GXP_1440806 | CEBP_HNF1_02 | 391 - 454 | (+) | |
| Macaca mulatta | ALB (albumin) | 704892 | GXP_1078447 | CEBP_HNF1_02 | 392 - 455 | (+) | |
| Gallus gallus | ALB (albumin) | 396197 | GXP_1140646 | CEBP_HNF1_02 | 351 - 418 | (+) | |
A total of 8 matches was found in 1 homology groups.
Homology groups searched: 26190.
Sequences searched: 410256 (270320278 bp).
| Extraction Options | |
|---|---|
| Sequence Extraction | You can extract the
|
| GeneID Extraction | The button "Extract GeneIDs" extracts
the GeneIDs of the matching sequences for each model separately. The
extracted GeneIDs can be e.g. used as input for GePS. The button "Extract GeneIDs by Chromosome" extracts the GeneIDs of the matching sequences for each chromosome separately. |
| Excel Extraction | The button "Export matches to EXCEL format" allows to export all information available in the match overview (like model name, sequence name and position of the model match) to a tab-delimited file. This file is saved to your local disk and can be opened directly with Microsoft Excel. |
| Compare results | |
|---|---|
| Enter GeneIDs |
Enter a list of GeneIDs separated by spaces, returns, or commas. |
When you press the "Compare" button you will get the information which GeneIDs are common and which GeneIDs are specific either for your input list or for your result.
The second output file contains detailed information for each individual element of the model:
| Matrix element Model element |
Position | Str | Sequence | Core sim. --- |
Mat. sim. Model sim. |
Distance to next element |
|---|---|---|---|---|---|---|
| V$CEBP/CEBPB.01 | 410 - 424 | (+) | ATGATTTTGTAATGG | 0.940 | 0.959 | 47 bp |
| V$HNF1/HNF1.02 | 456 - 472 | (+) | GGTTAATGATCTACAGT | 1.000 | 0.923 | --- |
The third output file contains a statistics of the model matches and detailed information for your own models:
| Model Name | # matches | in # homology groups | in # organisms | |||||
|---|---|---|---|---|---|---|---|---|
| Homo sapiens | Mus musculus | Rattus norvegicus | Gallus gallus | Canis familiaris | Bos taurus | |||
| CEBP_HNF1_02 | 6 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| © 1998-2011 Genomatix Software GmbH - All rights reserved |