![]() |
![]() |
DiAlign TF displays transcription factor (TF) binding site matches within a multiple alignment. It is possible to display all TF binding site matches, TF binding site matches common to all or subset of the input sequences, or common TF binding site matches that are located in aligned regions. The TF binding sites are visualized in the alignment as colored boxes.
The input sequences are aligned with the multiple alignment program DiAlign. TF binding site matches are identified by MatInspector.
| Sequence Input | |
|---|---|
| Choose from your previously uploaded sequences | Select a sequence file from the list of your personal sequence files. |
| or enter the formatted DNA sequence(s) | Enter your correctly formatted sequence(s) directly into the
form, e.g. with copy and paste. The following formats are accepted: There should be only IUPAC characters in the sequence, any other characters will be skipped! |
| or upload a file containing sequence(s) (max. 100 MB) | If your browser supports this option, a sequence file can be uploaded. If you use this option, the file should contain the sequence(s) in either one of the following formats: Please note, that the size for uploaded files is limited to 100MB. If you want to analyze larger sequences please contact support@genomatix.de. For whole chromosomes you can use the accession number option below (e.g. 'NC_000001' for human chromosome 1). |
| or enter accession number(s) |
If you are interested in one or several special
sequences from a database section, you can supply a list of correct accession
numbers in the form. If you want to select more than one accession number,
please separate the accession numbers by commas or spaces.
On the Genomatix server accession numbers from the following databases can be entered:
|
| Alignment Parameters | |
|---|---|
| Threshold T | DiAlign uses diagonals to construct an alignment. The threshold T influences the set of used diagonals: with T > 0, a diagonal is considered for alignment only if its weight exceeds this threshold. Regions of lower similarity are not aligned. DiAlign usually produces reasonable alignments without a threshold,
i.e. with T = 0. These parameters are hidden by default. You can use the
|
| Output Parameters | |
| Display of alignment | '*' signs below alignment
'*' characters are used in the DiAlign output to create a pseudo-graphical representation indicating
In the first two cases, the user can specify the maximum number of '*' characters per column in the program output thus changing the resolution of the graphics. In the other two cases, one '*' signs denotes identical or variable positions, respectively. The latter two options are especially suited for very similar sequences where one is interested only in the mismatches within an alignment. These parameters are hidden by default. You can use the
|
| Number of nucleic/amino acids per line
The default number of nucleic/amino acids per line in the alignment output is 50. It can be set to 0 (= unlimited) so that the complete alignment is shown in one line. These parameters are hidden by default. You can use the
|
|
| TF search parameters | |
| MatInspector parameters | The parameters here correspond to the following MatInspector parameters: |
| Display of TFs | By default, TF binding sites are displayed if they are common to at least 85 % of the input sequences and are located at the same position within the alignment. It is also possible to display all TF binding sites identified by MatInspector or all TF binding sites common to a user-definable number of sequences. |
| Email address | Here you can choose between two methods for receiving
the results:
The results will be available for a limited time on our server. For details of how long your results will be kept please see the result-email. After that period they will be deleted unless protected in the project management! |
| V$RORA | V$HMTB | V$NFAT | V$ZFHX |
User-defined regions of the alignment (e.g. conserved regions of the alignment without TF binding site matches) can be extracted to a sequence file. This sequence file can then be used as input for MatDefine to define a new matrix for a putative TF binding site.
| Extract alignment region | |
|---|---|
| Positions in alignment | Enter the start and end position of the alignment region that should be extracted. |
| Output file | Enter the name of the sequence file. The sequences will be saved in your personal sequence directory. |
| © 1998-2013 Genomatix Software GmbH - All rights reserved |