Each unigene build series has 3 files: .seq - FASTA file containing the unigene consensus sequence .qual - FASTA file containing the composite "base quality" scores derived from the phred scores of the ESTs that were aligned -membership.tdv - Tab delimited file indicating which SGN ESTs (by SGN-E#) are "contained" in which SGN unigenes (SGN-U#). Columns are defined as follows: SGN-U# SGN-E# start stop qstart qend direction SGN-U# and SGN-E# are SGN identifiers for the unigene and est respectively. start & stop are the starting and ending (respectively) base positions relative to the consensus sequence. If these are different from qstart and qend, the difference between qstart and start, as well as qend and end, indicates that the leading and/or trailing portion of the EST is not aligned with the consensus sequence and did not influence the consensus bases. qstart & qend are the "quality" starting and ending positions. This means that the EST is aligned with the consensus sequence over the indicated region and the EST bases influenced the consensus bases. direction inidicates whether the given EST sequence was assembled ("+"), or its reverse complement sequence ("-"). By "given sequence" we simply mean that found when reading the chromatogram, or the direction given by SGN's database if queried with the SGN-E# for the EST in question. The directory also contains a file with a conversion table from tigr tc # to sgn unigene #.