nextupprevious
Next: Variable to Axes Up: DESIGN Previous: DESIGN

Variables

Our multivariate data come from alignment reports generated by biological sequence similarity algorithms. Each report consists of an input sequence and many alignments. Each alignment is a match between a subsequence of the input sequence and a subsequence of a sequence from the database. Thus, an alignment indicates a region of similarity between two sequences. Such alignments are the basic elements in our visualization system. Associated with each alignment is the following information:

The PAM and BLOSUM Evolutionary Distances are not given by the alignment report, rather AV computes these as needed. The entry date is also not given in the report, but is retrieved from a separate database as the report is read into the system.

For an alignment, in addition to the above variables, there is the matching vector itself. This vector, represented by an array of integers, contains the residue pair scores of the matches starting from the first matching position. Therefore, an alignment can be viewed as a twelve dimensional point (for the above twelve variables), plus a matching vector.



nextupprevious
Next: Variable to Axes Up: DESIGN Previous: DESIGN



Ed H. Chi
Thu Jul 11 10:52:57 CDT 1996

The views and opinions expressed in this page are strictly those of the page author.
The contents of this page have not been reviewed or approved by the University of Minnesota.