FrameWorker is a complex software tool that allows users to extract a common framework of elements from a set of DNA sequences. These elements are usually transcription factor binding sites since this tool is designed for the comparative analysis of promoter sequences (e.g. inter-species analysis).
![]() |
-> | ![]() |
|---|---|---|
| From single elements | to common framework |
FrameWorker returns the most complex models that are common to the input sequences (satisfying the user parameters). Models/frameworks are defined as all elements (TF sites) that occur in the same order and in a certain distance range in all (or a subset of) the input sequences. The resulting models can be saved in the user-directory and subsequently can be used to scan any set of sequences.
Note that FrameWorker uses strand-specific TF sites for building models: i.e. if the number of sequences with a TF site on one strand (sense OR antisense) is above quorum, then the TF is considered a "common element" and can be used to build potential frameworks.
If no frameworks are found in a set of sequences, the
quorum constraint or the distance constraints should be lowered (see parameter
section below)
If too many frameworks are found, the quorum constraint or the distance
constraints could be raised or mandatory elements should be selected (e.g.
if known from biological data).
New in FrameWorker 5.4 (May 2007):
Please see below for a history of changes.
| Warning: If the majority of input sequences is highly homologous the resulting models will not be significant! E.g. two promoter sequences from mouse and human that are 95% homologous will probably yield frameworks that do not necessarily contain the relevant (i.e. functional) sites. In this case, FrameWorker will display a warning in the output. |
Please note that framework and model are used synonymous here.
| Sequence Selection | |
|---|---|
| Sequence data | The program expects a set of DNA sequences which are the basis for extraction of common elements. These sequences can be supplied in either one of the following formats: You can enter your correctly formatted sequence(s) directly into the form, e.g. with copy and paste, or upload a file containing the sequences. |
| Library selection | |
| Matrix group | The selection here corresponds to the MatInspector
settings. The sequences are scanned for matches to the selected MatInspector matrices. The matches found are used as basic elements for the extraction of a common framework. Note: The lower the core and matrix similarities the more basic elements can be used for extraction of a framework. The trade off may be very unspecific resulting models. |
| Matrix filters | Matrices used for the analysis can be filtered by tissues as described here |
| FrameWorker Parameters | |
| Quorum constraint for framework |
The lower limit of sequences within the input set that has to contain the common framework. Default is the absolute number of sequences that corresponds to at least 85% of the input sequences. |
| Distance constraints for framework |
Minimum and maximum distance between
two elements:
These values denote the minimum and maximum distances between the anchors of two elements of a resulting model. Only basic elements that occur in the given distance range are considered for inclusion in a framework. New in FrameWorker 5.0 (Nov. 2006):Maximum distance variance between two elements: This parameter sets the maximum possible variance of distances between the anchors of the model elements. A framework satisfies the parameter if the distances between the instances of the model in the input sequences do not differ more than the distance variance parameter. For promoters, a distance variance of 20-30 basepairs is usually sufficient. Example: If the minimum distance is set to 5, maximum distance is set to 200, and the distance variance is set to 20, the following models might be possible:
|
| New in FrameWorker 5.4 (May 2007):
Restrictive model search: By default, all models satisfying the
given distance variance are listed (even if there are more specific models
satisfying the remaining parameters). If the restrictive search option
is selected, FrameWorker selects only those models for output having
a minimum variation in the distance between elements. El.1 ← 30-35 → El.2 (distance variance: 6 bps, common to 6 sequences ( 6 matches, 6 non-overlapping)the regular search option might find the following model instead which is less specific (more matches in the sequences) El.1 ← 26-35 → El.2 (distance variance: 10 bps, common to 6 sequences (14 matches, 6 non-overlapping) These parameters are hidden by default. You can use the
|
|
| Element constraints | Minimum and maximum number of elements
in models:
FrameWorker lists only models with the minimum number of elements and stops after models with the maximum number of elements are determined. The default is a minimum of 2 elements and a maximum of 4 elements per model. Show intermediate models: Usually FrameWorker shows only the most complex models that are common
to the input sequences. If this option is checked, all intermediate (i.e.
shorter) models are also listed in the output. Example: If the most complex model consists of 5 elements all models with 2, 3 and 4 elements are also listed. |
| Mandatory Elements:
FrameWorker will only show models that contain at least the element(s) selected here. The selected elements can be combined with "and" (ALL elements must be present in models) or "or" (any of the selected elements must be present). Note: Internally, FrameWorker will search ALL possible element combination and will filter in a second step for the selected mandatory elements. Example: If the transcription factor families "V$AP1F" and "V$CAAT" are selected here, the result will contain only common frameworks that contain an AP1 site as well as a CCAAT binding factor site (if there are any at all). |
|
| Options | Maximum number of models:
The number of different models per model length which will be listed in the output can be set by the user (default: 10, maximum: 100). These parameters are hidden by default. You can use the
|
| Show detailed model matches:
Usually only the model parameters of the frameworks/models found in the sequences are displayed in the FrameWorker results. If you wish to see all matches to the models (i.e. positions of the elements) in your input sequences check this option. The number of matches to each of the FrameWorker derived models in each input sequence is limited (default: 10). You can also display the sequences of the binding sites included in the models. The sequences are always displayed in 5'->3' direction. If the binding sites are located on the antisense strand the displayed reverse complement sequence begins at the end position of the binding site. These parameters are hidden by default. You can use the
|
|
| Determine the p-value of models:
If this option is set, a background promoter sequence set of 5000 human promoters is scanned with the models generated by FrameWorker. The results of this search are used to check whether the models can also be found with a set of randomly selected promoters. The p-value is the probability to obtain an equal or greater number of sequences with a model match in a randomly drawn sample of the same size as the input sequence set. The lower this probability the higher is the specificity of the model. Note: Determination of the p-value increases the computing time. If many models are found, the result may not be finished before the server timeout is reached and the email option has to be used. |
|
| Email address | Here you can choose between two methods for receiving
the results:
The results will be available for a limited time on our server. For details of how long your results will be kept please see the result-email. After that period they will be deleted unless protected in the project management! |
As an example we use a set of 11 promoters for orthologous rhodopsin genes
from 10 species. There is evidence from literature that two transcription factors
(NRLF/V$NRLF and CRX/V$HOXF) act synergistically to regulate rhodopsin transcription
(MEDLINE: 8939891,
MEDLINE: 10887186,
MEDLINE: 10984472)
The 11 input sequences from 10 species were selected from Gene2Promoter, where "rhodopsin, rod pigment" was entered as search term and all orthologous genes were selected. The mouse rhodopsin gene has two alternative promoters annotated in ElDorado.
FrameWorker first automatically detects that there are two alternative promoters from mouse, and asks the user to continue with the the regular FrameWorker or the exhaustive analysis. If there are promoter sets (only when the sequences are from Gene2Promoter or Comparative Genomics in ElDorado), an automatic analysis of all promoter sets is also possible).

When choosing the exhaustive FrameWorker (checking option 2), selecting "70% of loci" as quorum constraint, the following overview will return:

Doing the same analysis with the additional parameter of "V$NRLF" and "V$HOXF" as mandatory elements, the output is reduced to

Please note two main facts:
Here is the example FrameWorker output with a few explanations:
| Models consisting of | # of different models containing mandatory element(s) |
# of models checked |
|---|---|---|
| single element | 44 common elements found | - |
| 2 elements | 1 model found | 62 models checked |
| 3 elements | 11 models found | 141 models checked |
| 4 elements | 38 models found | 272 models checked |
| 5 elements | 65 models found | 371 models checked |
| 6 elements | 49 models found | 257 models checked |
| Element | Strand | Matrix sim. | Common to | |
|---|---|---|---|---|
| V$AP1F | - | Optimized | 7 matches in 7 seq. (70 %) | |
| V$IKRS | - | Optimized | 7 matches in 7 seq. (70 %) | |
| V$NRLF | - | Optimized | 7 matches in 7 seq. (70 %) | |
| V$NRSF | - | Optimized | 7 matches in 7 seq. (70 %) | |
| ... | ||||
In addition, the common elements are displayed graphically, similar to the task "Search for common TF sites in multiple sequences".
For details on the graphics, please also see the common TF binding site graphics.
Note, that this display differs from the task "Search for common TF sites in multiple sequences", because FrameWorker uses strand-specific TF sites, i.e. the sum of TF sites from a family for sense/antisense strand must be above quorum to be considered.
In our example the most complex models consist of six elements, here is the first model as example:
| Element | Strand | Matrix sim. | Distance to next element | Common to | |
|---|---|---|---|---|---|
| 1 | V$SF1F | - | Optimized (min. 0.94) | 21 - 22 bp | 7 matches in 7 seq. (70 %), 7 non-overlapping |
| 2 | V$HOMF | - | Optimized (min. 1.00) | 22 - 24 bp | |
| 3 | V$HOXF | - | Optimized (min. 0.96) | 34 - 42 bp | |
| 4 | V$NRLF | - | Optimized (min. 0.95) | 34 - 34 bp | |
| 5 | V$SRFF | - | Optimized (min. 0.70) | 13 - 16 bp | |
| 6 | V$MAZF | + | Optimized (min. 0.90) | --- |
This model can be saved by clicking the checkbox and supplying a name for the model. It can then later be used with other Genomatix programs (e.g. to search complete databases for matches to this user-defined model).
For a detailed description of the general features of Genomatix graphics please check the graphics overview.
The graphic shows the input sequences as patterned lines with the model matches.
| Each model match is represented by a light gray box containing the individual matrix matches. Matrix matches (i.e. TF sites) are represented by half round symbols. Matches which were found on the positive or negative strand are displayed on top or below the sequence line. There is one color for each matrix family, i.e. matches from matrices of the same family are painted in the same color. |
| The arrow symbol on the sequence | stands for a transcription start site (TSS) or putative TSS. Please note that there can be several (or none) TSS for one sequence. |
![]() |
Left-clicking on a model match symbol will show a small display window containing information on the specified model match: The position of the model match and the p-value (if available). Moving the mouse pointer out of the model match symbol will close the information display. |
| Left-clicking on a matrix match symbol will show a tool tip window containing information on the specified matrix match: The name of the matrix, its family, the position of the match, the matrix similarity and the nucleotide sequence which was matched. The annotation window is also a hyperlink. Following the link will open a new browser window showing further information on the matrix family from MatBase. | ![]() |
![]() |
You can easily customize the graphics according to your needs via the selection tree to the left. If there are several model matches for a sequence, it is possible to select / deselect model matches to be displayed in the graphics. The tree below "Matrix Families" will show a legend of the colors used for Clicking on the matrix family names also provides a link to MatBase with all relevant information on the transcription factor. |
| To change the alignment of the sequences, drag the red symbol to the desired position and press the "Align" button in the toolbar. While dragging, the current position on the sequence is displayed in a small window. | ![]() |
In this case, 7 of 10 input sequences contain exactly one match to the framework/model consisting of 6 elements.
| Sequence | El. 1 | Dist. | El. 2 | Dist. | El. 3 | Dist. | El. 4 | Dist. | El. 5 | Dist. | El. 6 | Overlap | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GXP_141965 Homo sapiens (725 bp) |
V$FTF.01 375-387 (381/0.94/-) |
22 | V$S8.01 397-409 (403/1.00/-) |
24 | V$CRX.01 419-435 (427/0.99/-) |
42 | V$NRL.01 460-478 (469/0.99/-) |
34 | V$SRF.01 494-512 (503/0.72/-) |
13 | V$MAZR.01 510-522 (516/0.92/+) |
- | |
| GXP_1043413 Macaca mulatta (734 bp) |
V$FTF.01 385-397 (391/0.94/-) |
22 | V$S8.01 407-419 (413/1.00/-) |
23 | V$OTX2.01 428-444 (436/0.99/-) |
42 | V$NRL.01 469-487 (478/0.99/-) |
34 | V$SRF.01 503-521 (512/0.72/-) |
13 | V$MAZR.01 519-531 (525/0.90/+) |
- | |
| GXP_1215006 Pan troglodytes (703 bp) |
V$FTF.01 353-365 (359/0.94/-) |
22 | V$S8.01 375-387 (381/1.00/-) |
24 | V$CRX.01 397-413 (405/0.99/-) |
42 | V$NRL.01 438-456 (447/0.99/-) |
34 | V$SRF.01 472-490 (481/0.72/-) |
13 | V$MAZR.01 488-500 (494/0.92/+) |
- | |
| GXP_68427 Mus musculus (704 bp) |
V$FTF.01 362-374 (368/0.94/-) |
21 | V$S8.01 383-395 (389/1.00/-) |
22 | V$CRX.01 403-419 (411/0.99/-) |
34 | V$NRL.01 436-454 (445/0.97/-) |
34 | V$SRF.01 470-488 (479/0.72/-) |
14 | V$MAZR.01 487-499 (493/0.94/+) |
- | |
| GXP_19602 Rattus norvegicus (704 bp) |
V$FTF.01 362-374 (368/0.94/-) |
21 | V$S8.01 383-395 (389/1.00/-) |
22 | V$CRX.01 403-419 (411/0.99/-) |
34 | V$NRL.01 436-454 (445/0.97/-) |
34 | V$SRF.01 470-488 (479/0.72/-) |
14 | V$MAZR.01 487-499 (493/0.94/+) |
- | |
| GXP_950626 Monodelphis domestica (830 bp) |
--- | ||||||||||||
| GXP_231880 Canis familiaris (710 bp) | V$FTF.01 356-368 (362/0.94/-) |
22 | V$S8.01 378-390 (384/1.00/-) |
24 | V$CRX.01 400-416 (408/0.96/-) |
42 | V$NRL.01 441-459 (450/0.96/-) |
34 | V$SRF.01 475-493 (484/0.70/-) |
16 | V$MAZR.01 494-506 (500/0.98/+) |
- | |
| GXP_982299 Bos taurus (709 bp) |
V$FTF.01 356-368 (362/0.94/-) |
22 | V$S8.01 378-390 (384/1.00/-) |
24 | V$CRX.01 400-416 (408/0.98/-) |
42 | V$NRL.01 441-459 (450/0.95/-) |
34 | V$SRF.01 475-493 (484/0.72/-) |
14 | V$MAZR.01 492-504 (498/0.93/+) |
- | |
| GXP_1135009 Gallus gallus (700 bp) |
--- | ||||||||||||
| GXP_617604 Danio rerio (601 bp) |
--- | ||||||||||||
New in FrameWorker 5.1 (Dec. 2006):
Introduced a change to avoid double or overlapping models in the output.
This may reduce the total number of models compared to FrameWorker 5.0, but
the result is essentially the same.
New in FrameWorker 5.0 (Nov. 2006):
FrameWorker now features an additional distance parameter, the maximum
distance variation between two elements within a model. This allows setting
a higher maximum distance between two elements, but keeping the variation of
distances between the sequences within a certain "distance band".
This way, generated frameworks are more likely to be biologically relevant
because the variation in the distance between the elements is set to an upper
limit.
New in FrameWorker 4.6 (Apr. 2006):
FrameWorker allows mandatory elements, i.e. the user can specify
one or several elements (i.e. transcription factor binding sites) that must
be part of all models in the FrameWorker output.
This requirement is very helpful if it is known from experiments that a certain
transcription factor(s) plays a role in the regulation of the input sequences,
since it filters the output of FrameWorker.
New in FrameWorker 4.5 (Jan. 2006):
When Genomatix-annotated promoters from Gene2Promoter are
submitted to FrameWorker,
| © 1998-2010 Genomatix Software GmbH - All rights
reserved Corporate Information • Privacy Policy • Trademarks |