GeneSeqer. Version of March 12, 2006. Date run: Mon Aug 28 22:14:55 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 34829 Total sequence length: 35392868 Minimum sequence length: 63 Maximum sequence length: 5381 Length distribution (number of sequences of specified length): < 100: 4 < 200: 53 < 300: 143 < 400: 353 < 500: 791 < 600: 1674 < 700: 2583 < 800: 4023 < 900: 6481 < 1000: 5706 >=1000: 13018 Input file : /tmp/bac-submission-temp-Kw6uM/C06HBa0054K13/C06HBa0054K13.seq.screen ________________________________________________________________________________ Sequence 1: C06HBa0054K13.1-1, from 1 to 10775, both strands analyzed. ... started at: Mon Aug 28 22:23:28 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand 1849 n (File: SGN-U312832+) 1 TTCAGACAAA AATGGCAAAG AGTGGCATTT TGGTAATTGT TTCAGCTCTT GTTGTTCTTG 61 CAGTTTGTGG TGTTTTTGCT GAGGAGAACG AATATGTGTT GACTTTGGAC CATTCTAACC 121 TCACTGAGAC TGTTGCTAAG CACAACTTCA TTGTTGTTGA ATTCTATGCA CCTTGGTGTG 181 GACACTGTAA GAGTCTTGCT CCTGAGTATG AAAAAGCTGC CTCAGAGCTG AGTAGTCATG 241 ACCCTCCAAT TGTTCTAGCT AAGTATGATG CAAATGATGA AGCCAATAGA GAACTTTCAA 301 AACAGTACGA GATCCAGGGT TTCCCAACTA TTAAGATATT GAGAGATGGA GGAAAGAAAG 361 TTCAAGACTA TAACGGTCCT CGTGAAGCAG CTGGTATTGT ATCCTACTTG AAGAAACAAG 421 TGGGTCCTGC ATCTGCTGAA ATCAAGTCGA AGGAAGATGC CACAAACCTT ATTGATGAGA 481 AAAGTATCTT TGTTGTTGGT ATATTTCCAG ACCCCTCCGG AGAGAAATTC GAGAACTATT 541 TAACGCTAGC TGAAAAACTG CGAGGCGAGT TCGATTTTGC TCACACTGTT GATGCTAAAC 601 ACCTCCCTCG GGGTGGACCA GTCAACAAGC CCACTCTTCG TCTTCTAAAG CCATTTGATG 661 AACTCTTTGT TGATTTTGAG GACTTTGATG TCGATGCAAT GGAGAAGTTC ATCTCAGAAT 721 CTAGTATTCC TGTTGTTACT ATTTTTGACA ATGACCCAAA CAACCATCCT TATGTTAACA 781 AGTTCTTCGA AGGCACCAAC GCCAAGGCAT TGCTATTTGT GAACTTTAGC TCTGAATTTG 841 ATGCTTTTAA GTCCAAGTAC AACGATGTTG CTGTGATTTA CAAAGGGGAT GGGGTGAGCT 901 TTCTCTTGGG TGATGTTGAG GCTGGTCAAG GTGCTTTTGA GTACTTCGGA CTGAAGCCGG 961 AACAGGCACC TGTGATCATC ATAATGGACG CTGATGAACA AAAGTATATT AAGGACCATG 1021 TGGAACCTGA TGCCATTGCT GCTTACTTGA AGGATTACAA GGAAGGAAAA CTGAAGCCAC 1081 ATGTGAAGTC AGAGCCCATC CCTGAAGTCA ATGACGAACC TGTTAAGGTG GTTGTTAGGG 1141 ATACCCTCCA GGATATGGTT TACAAATCGG GAAAAAATGT GCTGTTAGAG TTCTATGCAC 1201 CTTGGTGTGG CCACTGCAAG AGTCTGGCTC CAATTTTGGA TGAAGTGGCT GTATCATTTG 1261 AAAGCGATCC TGATGTTCTC ATTGCAAAAC TGGACGCAAC CGCAAATGAT CTCCCGAAAG 1321 GTGACTTTGA TGTTCAGGGA TTCCCTACTA TGTACTTCAG ATCCGCCTCT GGTAACTTGT 1381 CACAGTACAA TGGTGAGAGA ACAAAAGAGG CTATCATCGA ATTCATCGAG AAGAATCGTG 1441 GCAAGCCTGC TCAGTCAGAC TCTGCCAAAG TCGATTCAGC AAAGGATGAA CTTTAGAGGA 1501 CTCTAGGAAC ATTGTGTACT GGTGGATTCA AGTTTTGTTG GAAGCATTGT GTTTCTGGTG 1561 AATTCGACCT CCCACCGGAC TACTGCTCTC TCCCAACGCT TCCGTTGTAG TTTTTGGAGC 1621 TTTTCAGCGC CAATAAAACG GTTGTATGTA TCCATTTTGT GTATCGAATG TAGCTGGATT 1681 ATGAGTTTAT ATTATATCTA TTGAGAAGTT CTCCAACTTT ATAGTAAAAA AAAAAAAAAA 1741 AATCTTCTTT GTCCCAGAAT GCTACTGTTG GAATGTATGC CTCCTGTTGC AGTAAATAAT 1801 GATACACAAT AATTAAAGAA GTTCAAAAAA AAAAAAAAAA AAAAACTCG Predicted gene structure (within gDNA segment 1 to 1891): Exon 1 1 108 ( 108 n); cDNA 1185 1292 ( 108 n); score: 1.000 Intron 1 109 428 ( 320 n); Pd: 1.000 (s: 1.00), Pa: 0.935 (s: 1.00) Exon 2 429 634 ( 206 n); cDNA 1293 1498 ( 206 n); score: 1.000 Intron 2 635 732 ( 98 n); Pd: 0.988 (s: 1.00), Pa: 0.846 (s: 1.00) Exon 3 733 1060 ( 328 n); cDNA 1499 1826 ( 328 n); score: 0.966 PPA cDNA 1827 1846 MATCH C06HBa0054K13.1-1+ SGN-U312832+ 0.983 642 0.347 C PGS_C06HBa0054K13.1-1+_SGN-U312832+ (1 108,429 634,733 1060) Alignment (genomic DNA sequence = upper lines): TTAGAGTTCT ATGCACCTTG GTGTGGCCAC TGCAAGAGTC TGGCTCCAAT TTTGGATGAA 60 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGAGTTCT ATGCACCTTG GTGTGGCCAC TGCAAGAGTC TGGCTCCAAT TTTGGATGAA 1244 GTGGCTGTAT CATTTGAAAG CGATCCTGAT GTTCTCATTG CAAAACTGGT AAGAACTTTT 120 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGGCTGTAT CATTTGAAAG CGATCCTGAT GTTCTCATTG CAAAACTG.. .......... 1292 TTCATAAACT AAACATGGAT ATTTGATGCA TGTATCTACT TCATCCTTTG TTACATTAAG 180 .......... .......... .......... .......... .......... .......... 1292 TAACGGCATA ATAATCAAGA AAGAATGTGT AGAATTAGAA TCGGGTTTTT AGTATAAAAA 240 .......... .......... .......... .......... .......... .......... 1292 TCAGAAAGAA CGGGGTGGAC ATTTTATCTT TGATGTTATG GATTTCGTTG TTAGAGAGAT 300 .......... .......... .......... .......... .......... .......... 1292 TTTTAGTAGA AAACATCTGT AAAGTGTATC AAAAAAGAAG GTTTGTATGG CTTAAACACG 360 .......... .......... .......... .......... .......... .......... 1292 ATTAGCAGTT GTAATGCACA GGACATGTAT TTGTTGTCAT TTCCTCTCAC CTGCTTCCTC 420 .......... .......... .......... .......... .......... .......... 1292 ATTTACAGGA CGCAACCGCA AATGATCTCC CGAAAGGTGA CTTTGATGTT CAGGGATTCC 480 || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ........GA CGCAACCGCA AATGATCTCC CGAAAGGTGA CTTTGATGTT CAGGGATTCC 1344 CTACTATGTA CTTCAGATCC GCCTCTGGTA ACTTGTCACA GTACAATGGT GAGAGAACAA 540 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACTATGTA CTTCAGATCC GCCTCTGGTA ACTTGTCACA GTACAATGGT GAGAGAACAA 1404 AAGAGGCTAT CATCGAATTC ATCGAGAAGA ATCGTGGCAA GCCTGCTCAG TCAGACTCTG 600 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAGGCTAT CATCGAATTC ATCGAGAAGA ATCGTGGCAA GCCTGCTCAG TCAGACTCTG 1464 CCAAAGTCGA TTCAGCAAAG GATGAACTTT AGAGGTTTGG TTCTCTTCCT TAAAGCATCA 660 |||||||||| |||||||||| |||||||||| |||| CCAAAGTCGA TTCAGCAAAG GATGAACTTT AGAG...... .......... .......... 1498 ATTGTGATAT CTGCCATTAT TGGTTCTTTG TGCTACAGTA ATTGTAATTT ATATTGATCT 720 .......... .......... .......... .......... .......... .......... 1498 CCTCTTATTC AGGACTCTAG GAACATTGTG TACTGGTGGA TTCAAGTTTT GTTGGAAGCA 780 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..GACTCTAG GAACATTGTG TACTGGTGGA TTCAAGTTTT GTTGGAAGCA 1546 TTGTGTTTCT GGTGAATTCG ACCTCCCACC GGACTACTGC TCTCTCCCAA CGCTTCCGTT 840 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTGTTTCT GGTGAATTCG ACCTCCCACC GGACTACTGC TCTCTCCCAA CGCTTCCGTT 1606 GTAGTTTTTG GAGCTTTTCA GCGCCAATAA AACGGTTGTA TGTATCCATT TTGTGTATCG 900 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAGTTTTTG GAGCTTTTCA GCGCCAATAA AACGGTTGTA TGTATCCATT TTGTGTATCG 1666 AATGTAGCTG GATTATGAGT TTATATTATA TCTATTGAGA AGTTCTCCAA CTTTATAGTA 960 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTAGCTG GATTATGAGT TTATATTATA TCTATTGAGA AGTTCTCCAA CTTTATAGTA 1726 TCAGCAGTTG AAGTTATCTT CTTTGTCCCA GAATGCTACT GTTGGAATGT ATGCCTCCTG 1020 | | || ||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAAAAAAA AAAAAATCTT CTTTGTCCCA GAATGCTACT GTTGGAATGT ATGCCTCCTG 1786 TTGCAGTAAA TAATGATACA CAATAATTAA AGAAGTTCAA 1060 |||||||||| |||||||||| |||||||||| |||||||||| TTGCAGTAAA TAATGATACA CAATAATTAA AGAAGTTCAA 1826 hqPGS_C06HBa0054K13.1-1+_SGN-U312832+ (1 108,429 634,733 1060) ******************************************************************************** EST sequence 1 +strand 767 n (File: SGN-U312831+) 1 TTTGGAATTG CAGTACTTCG GACTGAAGCC GGAACAGGCA CCTGTGATCA TCATAATGGA 61 CGCTGATGAA CAAAAGTATA TTAAGGACCA TGTGGAACCT GATGCCATTG CTGCTTACTT 121 GAAGGATTAC AAGGTGATGC ATCCTCTTCT TTTTCTTGGA TATATTTTGG AATGAAAGCG 181 TTGTTAATTG TGAACAATAT TCTGCACGGA AGGAAAGGCT TACTAACCTT TTTCCTCTAA 241 TTTTCCACAT TGTTACAGGA AGGAAAACTG AAGCCACATG TGAAGTCAGA GCCCATCCCT 301 GAAGTCAATG ACGAACCTGT TAAGGTGGTT GTTAGGGATA CCCTCCAGGA TATGGTTTAC 361 AAATCGGGAA AAAATGGTGC GTGTCTGTCA ATATTTTAAT CTTTATTCGA TGTCTTGGTT 421 AAGAAAGAGT TGTTTCTTTG TTGTACCATT TCGTTCTCCC TCTTTGGTGT TTACTTGCAT 481 TTCTTTACTA GTGTGTAAGG TCGTGATGAG GAGTTGGATG TCGTAAATTA ACTGATTGTG 541 GAACTATTAT ATGTAGTAGG AGAAAGAGGG AGCAGTCATC AGCTAATTGC CCGCGGGTTT 601 GACTTTAACA ATAGTTGTGC TCTCCTAAAT TATTATTCTT TTTGTCTGCT CAGTGCTGTT 661 AGAGTTCTAT GCACCTTGGT GTGGCCACTG CAAGAGTCTG GCTCCAATTT TGGATGAAGT 721 GGCTGTATCA TTTGAAAGCG ATCCTGATGT TCTCATTGCA AAACTGG Predicted gene structure (within gDNA segment 1 to 719): Exon 1 1 108 ( 108 n); cDNA 659 766 ( 108 n); score: 1.000 MATCH C06HBa0054K13.1-1+ SGN-U312831+ 1.000 108 0.150 G PGS_C06HBa0054K13.1-1+_SGN-U312831+ (1 108) Alignment (genomic DNA sequence = upper lines): TTAGAGTTCT ATGCACCTTG GTGTGGCCAC TGCAAGAGTC TGGCTCCAAT TTTGGATGAA 60 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGAGTTCT ATGCACCTTG GTGTGGCCAC TGCAAGAGTC TGGCTCCAAT TTTGGATGAA 718 GTGGCTGTAT CATTTGAAAG CGATCCTGAT GTTCTCATTG CAAAACTG 108 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGGCTGTAT CATTTGAAAG CGATCCTGAT GTTCTCATTG CAAAACTG 766 hqPGS_C06HBa0054K13.1-1+_SGN-U312831+ (1 108) ******************************************************************************** EST sequence 3 +strand 773 n (File: SGN-U312833+) 1 AGAAAGAATG TGTAGAATTA GAATCGGGTT TTTAGTATAA AAATCAGAAA GAACGGGGTG 61 GACATTTTAT CTTTGATGTT ATGGATTTCG TTGTTAGAGA GATTTTTAGT AGAAAACATC 121 TGTAAAGTGT ATCAAAAAAG AAGGTTTGTA TGGCTTAAAC ACGATTAGCA GTTGTAATGC 181 ACAGGACATG TATTTGTTGT CATTTCCTCT CACCTGCTTC CTCATTTACA GGACGCAACC 241 GCAAATGATC TCCCGAAAGG TGACTTTGAT GTTCAGGGAT TCCCTACTAT GTACTTCAGA 301 TCCGCCTCTG GTAACTTGTC ACAGTACAAT GGTGAGAGAA CAAAAGAGGC TATCATCGAA 361 TTCATCGAGA AGAATCGTGG CAAGCCTGCT CAGTCAGACT CTGCCAAAGT CGATTCAGCA 421 AAGGATGAAC TTTAGAGGTT TGGTTCTCTT CCTTAAAGCA TCAATTGTGA TATCTGCCAT 481 TATTGGTTCT TTGTGCTACA GTAATTGTAA TTTATATTGA TCTCCTCTTA TTCAGGACTC 541 TAGGAACATT GTGTACTGGT GGATTCAAGT TTTGTTGGAA GCATTGTGTT TCTGGTGAAT 601 TCGACCTCCC ACCGGACTAC TGCTCTCTCC CAACGCTTCC GTTGTAGTTT TTGGAGCTTT 661 TCAGCGCCAA TAAAACGGTT GTATGTATCC ATTTTGTGTA TCGAATGTAG CTGGATTATG 721 AGTTTATATT ATATCTATTG AGAAGTTCTC CAACTTTATA GTAAAAAAAA AAA Predicted gene structure (within gDNA segment 1 to 1670): Exon 1 198 960 ( 763 n); cDNA 1 763 ( 763 n); score: 1.000 PPA cDNA 764 773 MATCH C06HBa0054K13.1-1+ SGN-U312833+ 1.000 763 0.987 C PGS_C06HBa0054K13.1-1+_SGN-U312833+ (198 960) Alignment (genomic DNA sequence = upper lines): AGAAAGAATG TGTAGAATTA GAATCGGGTT TTTAGTATAA AAATCAGAAA GAACGGGGTG 257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAAGAATG TGTAGAATTA GAATCGGGTT TTTAGTATAA AAATCAGAAA GAACGGGGTG 60 GACATTTTAT CTTTGATGTT ATGGATTTCG TTGTTAGAGA GATTTTTAGT AGAAAACATC 317 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACATTTTAT CTTTGATGTT ATGGATTTCG TTGTTAGAGA GATTTTTAGT AGAAAACATC 120 TGTAAAGTGT ATCAAAAAAG AAGGTTTGTA TGGCTTAAAC ACGATTAGCA GTTGTAATGC 377 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTAAAGTGT ATCAAAAAAG AAGGTTTGTA TGGCTTAAAC ACGATTAGCA GTTGTAATGC 180 ACAGGACATG TATTTGTTGT CATTTCCTCT CACCTGCTTC CTCATTTACA GGACGCAACC 437 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAGGACATG TATTTGTTGT CATTTCCTCT CACCTGCTTC CTCATTTACA GGACGCAACC 240 GCAAATGATC TCCCGAAAGG TGACTTTGAT GTTCAGGGAT TCCCTACTAT GTACTTCAGA 497 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAAATGATC TCCCGAAAGG TGACTTTGAT GTTCAGGGAT TCCCTACTAT GTACTTCAGA 300 TCCGCCTCTG GTAACTTGTC ACAGTACAAT GGTGAGAGAA CAAAAGAGGC TATCATCGAA 557 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCGCCTCTG GTAACTTGTC ACAGTACAAT GGTGAGAGAA CAAAAGAGGC TATCATCGAA 360 TTCATCGAGA AGAATCGTGG CAAGCCTGCT CAGTCAGACT CTGCCAAAGT CGATTCAGCA 617 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCATCGAGA AGAATCGTGG CAAGCCTGCT CAGTCAGACT CTGCCAAAGT CGATTCAGCA 420 AAGGATGAAC TTTAGAGGTT TGGTTCTCTT CCTTAAAGCA TCAATTGTGA TATCTGCCAT 677 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGGATGAAC TTTAGAGGTT TGGTTCTCTT CCTTAAAGCA TCAATTGTGA TATCTGCCAT 480 TATTGGTTCT TTGTGCTACA GTAATTGTAA TTTATATTGA TCTCCTCTTA TTCAGGACTC 737 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTGGTTCT TTGTGCTACA GTAATTGTAA TTTATATTGA TCTCCTCTTA TTCAGGACTC 540 TAGGAACATT GTGTACTGGT GGATTCAAGT TTTGTTGGAA GCATTGTGTT TCTGGTGAAT 797 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGGAACATT GTGTACTGGT GGATTCAAGT TTTGTTGGAA GCATTGTGTT TCTGGTGAAT 600 TCGACCTCCC ACCGGACTAC TGCTCTCTCC CAACGCTTCC GTTGTAGTTT TTGGAGCTTT 857 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGACCTCCC ACCGGACTAC TGCTCTCTCC CAACGCTTCC GTTGTAGTTT TTGGAGCTTT 660 TCAGCGCCAA TAAAACGGTT GTATGTATCC ATTTTGTGTA TCGAATGTAG CTGGATTATG 917 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAGCGCCAA TAAAACGGTT GTATGTATCC ATTTTGTGTA TCGAATGTAG CTGGATTATG 720 AGTTTATATT ATATCTATTG AGAAGTTCTC CAACTTTATA GTA 960 |||||||||| |||||||||| |||||||||| |||||||||| ||| AGTTTATATT ATATCTATTG AGAAGTTCTC CAACTTTATA GTA 763 hqPGS_C06HBa0054K13.1-1+_SGN-U312833+ (198 960) Total number of EST alignments reported: 3 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 10775: PGL 1 (+ strand): 1 1060 AGS-1 (1 108,429 634,733 1060) SCR (e 1.000 d 1.000 a 0.935,e 1.000 d 0.988 a 0.846,e 0.966) Exon 1 1 108 ( 108 n); score: 1.000 Intron 1 109 428 ( 320 n); Pd: 1.000 Pa: 0.935 Exon 2 429 634 ( 206 n); score: 1.000 Intron 2 635 732 ( 98 n); Pd: 0.988 Pa: 0.846 Exon 3 733 1060 ( 328 n); score: 0.966 PGS (1 108,429 634,733 1060) SGN-U312832+ PGS (1 108) SGN-U312831+ 3-phase translation of AGS-1 (+strand): . . . . . . 1 TTAGAGTTCTATGCACCTTGGTGTGGCCACTGCAAGAGTCTGGCTCCAATTTTGGATGAA L E F Y A P W C G H C K S L A P I L D E - S S M H L G V A T A R V W L Q F W M K R V L C T L V W P L Q E S G S N F G - . . . . . : . 61 GTGGCTGTATCATTTGAAAGCGATCCTGATGTTCTCATTGCAAAACTG : GACGCAACCGCA V A V S F E S D P D V L I A K L : D A T A W L Y H L K A I L M F S L Q N W : T Q P Q S G C I I - K R S - C S H C K T : G R N R . . . . . . 441 AATGATCTCCCGAAAGGTGACTTTGATGTTCAGGGATTCCCTACTATGTACTTCAGATCC N D L P K G D F D V Q G F P T M Y F R S M I S R K V T L M F R D S L L C T S D P K - S P E R - L - C S G I P Y Y V L Q I . . . . . . 501 GCCTCTGGTAACTTGTCACAGTACAATGGTGAGAGAACAAAAGAGGCTATCATCGAATTC A S G N L S Q Y N G E R T K E A I I E F P L V T C H S T M V R E Q K R L S S N S R L W - L V T V Q W - E N K R G Y H R I . . . . . . 561 ATCGAGAAGAATCGTGGCAAGCCTGCTCAGTCAGACTCTGCCAAAGTCGATTCAGCAAAG I E K N R G K P A Q S D S A K V D S A K S R R I V A S L L S Q T L P K S I Q Q R H R E E S W Q A C S V R L C Q S R F S K . . : . . . . 621 GATGAACTTTAGAG : GACTCTAGGAACATTGTGTACTGGTGGATTCAAGTTTTGTTGGAAG D E L - R : T L G T L C T G G F K F C W K M N F R : G L - E H C V L V D S S F V G S G - T L E : D S R N I V Y W W I Q V L L E . . . . . . 779 CATTGTGTTTCTGGTGAATTCGACCTCCCACCGGACTACTGCTCTCTCCCAACGCTTCCG H C V S G E F D L P P D Y C S L P T L P I V F L V N S T S H R T T A L S Q R F R A L C F W - I R P P T G L L L S P N A S . . . . . . 839 TTGTAGTTTTTGGAGCTTTTCAGCGCCAATAAAACGGTTGTATGTATCCATTTTGTGTAT L - F L E L F S A N K T V V C I H F V Y C S F W S F S A P I K R L Y V S I L C I V V V F G A F Q R Q - N G C M Y P F C V . . . . . . 899 CGAATGTAGCTGGATTATGAGTTTATATTATATCTATTGAGAAGTTCTCCAACTTTATAG R M - L D Y E F I L Y L L R S S P T L - E C S W I M S L Y Y I Y - E V L Q L Y S S N V A G L - V Y I I S I E K F S N F I . . . . . . 959 TATCAGCAGTTGAAGTTATCTTCTTTGTCCCAGAATGCTACTGTTGGAATGTATGCCTCC Y Q Q L K L S S L S Q N A T V G M Y A S I S S - S Y L L C P R M L L L E C M P P V S A V E V I F F V P E C Y C W N V C L . . . . . 1019 TGTTGCAGTAAATAATGATACACAATAATTAAAGAAGTTCAA C C S K - - Y T I I K E V Q V A V N N D T Q - L K K F L L Q - I M I H N N - R S S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-1+_PGL-1_AGS-1_PPS_1 (5 108,429 634,733 740) (frame '2'; 315 bp, 105 residues) 1 SSMHLGVATA RVWLQFWMKW LYHLKAILMF SLQNWTQPQM ISRKVTLMFR DSLLCTSDPP 61 LVTCHSTMVR EQKRLSSNSS RRIVASLLSQ TLPKSIQQRM NFRGL- >C06HBa0054K13.1-1+_PGL-1_AGS-1_PPS_2 (1 108,429 632) (frame '1'; 309 bp, 103 residues) 1 LEFYAPWCGH CKSLAPILDE VAVSFESDPD VLIAKLDATA NDLPKGDFDV QGFPTMYFRS 61 ASGNLSQYNG ERTKEAIIEF IEKNRGKPAQ SDSAKVDSAK DEL- >C06HBa0054K13.1-1+_PGL-1_AGS-1_PPS_3 (741 938) (frame '2'; 195 bp, 65 residues) 1 EHCVLVDSSF VGSIVFLVNS TSHRTTALSQ RFRCSFWSFS APIKRLYVSI LCIECSWIMS 61 LYYIY- AGS-2 (198 960) SCR (e 1.000) Exon 1 198 960 ( 763 n); score: 1.000 PGS (198 960) SGN-U312833+ 3-phase translation of AGS-2 (+strand): . . . . . . 198 AGAAAGAATGTGTAGAATTAGAATCGGGTTTTTAGTATAAAAATCAGAAAGAACGGGGTG R K N V - N - N R V F S I K I R K N G V E R M C R I R I G F L V - K S E R T G W K E C V E L E S G F - Y K N Q K E R G . . . . . . 258 GACATTTTATCTTTGATGTTATGGATTTCGTTGTTAGAGAGATTTTTAGTAGAAAACATC D I L S L M L W I S L L E R F L V E N I T F Y L - C Y G F R C - R D F - - K T S G H F I F D V M D F V V R E I F S R K H . . . . . . 318 TGTAAAGTGTATCAAAAAAGAAGGTTTGTATGGCTTAAACACGATTAGCAGTTGTAATGC C K V Y Q K R R F V W L K H D - Q L - C V K C I K K E G L Y G L N T I S S C N A L - S V S K K K V C M A - T R L A V V M . . . . . . 378 ACAGGACATGTATTTGTTGTCATTTCCTCTCACCTGCTTCCTCATTTACAGGACGCAACC T G H V F V V I S S H L L P H L Q D A T Q D M Y L L S F P L T C F L I Y R T Q P H R T C I C C H F L S P A S S F T G R N . . . . . . 438 GCAAATGATCTCCCGAAAGGTGACTTTGATGTTCAGGGATTCCCTACTATGTACTTCAGA A N D L P K G D F D V Q G F P T M Y F R Q M I S R K V T L M F R D S L L C T S D R K - S P E R - L - C S G I P Y Y V L Q . . . . . . 498 TCCGCCTCTGGTAACTTGTCACAGTACAATGGTGAGAGAACAAAAGAGGCTATCATCGAA S A S G N L S Q Y N G E R T K E A I I E P P L V T C H S T M V R E Q K R L S S N I R L W - L V T V Q W - E N K R G Y H R . . . . . . 558 TTCATCGAGAAGAATCGTGGCAAGCCTGCTCAGTCAGACTCTGCCAAAGTCGATTCAGCA F I E K N R G K P A Q S D S A K V D S A S S R R I V A S L L S Q T L P K S I Q Q I H R E E S W Q A C S V R L C Q S R F S . . . . . . 618 AAGGATGAACTTTAGAGGTTTGGTTCTCTTCCTTAAAGCATCAATTGTGATATCTGCCAT K D E L - R F G S L P - S I N C D I C H R M N F R G L V L F L K A S I V I S A I K G - T L E V W F S S L K H Q L - Y L P . . . . . . 678 TATTGGTTCTTTGTGCTACAGTAATTGTAATTTATATTGATCTCCTCTTATTCAGGACTC Y W F F V L Q - L - F I L I S S Y S G L I G S L C Y S N C N L Y - S P L I Q D S L L V L C A T V I V I Y I D L L L F R T . . . . . . 738 TAGGAACATTGTGTACTGGTGGATTCAAGTTTTGTTGGAAGCATTGTGTTTCTGGTGAAT - E H C V L V D S S F V G S I V F L V N R N I V Y W W I Q V L L E A L C F W - I L G T L C T G G F K F C W K H C V S G E . . . . . . 798 TCGACCTCCCACCGGACTACTGCTCTCTCCCAACGCTTCCGTTGTAGTTTTTGGAGCTTT S T S H R T T A L S Q R F R C S F W S F R P P T G L L L S P N A S V V V F G A F F D L P P D Y C S L P T L P L - F L E L . . . . . . 858 TCAGCGCCAATAAAACGGTTGTATGTATCCATTTTGTGTATCGAATGTAGCTGGATTATG S A P I K R L Y V S I L C I E C S W I M Q R Q - N G C M Y P F C V S N V A G L - F S A N K T V V C I H F V Y R M - L D Y . . . . . 918 AGTTTATATTATATCTATTGAGAAGTTCTCCAACTTTATAGTA S L Y Y I Y - E V L Q L Y S V Y I I S I E K F S N F I V E F I L Y L L R S S P T L - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-1+_PGL-1_AGS-2_PPS_1 (310 717) (frame '2'; 405 bp, 135 residues) 1 KTSVKCIKKE GLYGLNTISS CNAQDMYLLS FPLTCFLIYR TQPQMISRKV TLMFRDSLLC 61 TSDPPLVTCH STMVREQKRL SSNSSRRIVA SLLSQTLPKS IQQRMNFRGL VLFLKASIVI 121 SAIIGSLCYS NCNLY- >C06HBa0054K13.1-1+_PGL-1_AGS-2_PPS_2 (741 938) (frame '1'; 195 bp, 65 residues) 1 EHCVLVDSSF VGSIVFLVNS TSHRTTALSQ RFRCSFWSFS APIKRLYVSI LCIECSWIMS 61 LYYIY- 3-phase translation of AGS-2 (-strand): . . . . . . 960 TACTATAAAGTTGGAGAACTTCTCAATAGATATAATATAAACTCATAATCCAGCTACATT Y Y K V G E L L N R Y N I N S - S S Y I T I K L E N F S I D I I - T H N P A T F L - S W R T S Q - I - Y K L I I Q L H . . . . . . 900 CGATACACAAAATGGATACATACAACCGTTTTATTGGCGCTGAAAAGCTCCAAAAACTAC R Y T K W I H T T V L L A L K S S K N Y D T Q N G Y I Q P F Y W R - K A P K T T S I H K M D T Y N R F I G A E K L Q K L . . . . . . 840 AACGGAAGCGTTGGGAGAGAGCAGTAGTCCGGTGGGAGGTCGAATTCACCAGAAACACAA N G S V G R E Q - S G G R S N S P E T Q T E A L G E S S S P V G G R I H Q K H N Q R K R W E R A V V R W E V E F T R N T . . . . . . 780 TGCTTCCAACAAAACTTGAATCCACCAGTACACAATGTTCCTAGAGTCCTGAATAAGAGG C F Q Q N L N P P V H N V P R V L N K R A S N K T - I H Q Y T M F L E S - I R G M L P T K L E S T S T Q C S - S P E - E . . . . . . 720 AGATCAATATAAATTACAATTACTGTAGCACAAAGAACCAATAATGGCAGATATCACAAT R S I - I T I T V A Q R T N N G R Y H N D Q Y K L Q L L - H K E P I M A D I T I E I N I N Y N Y C S T K N Q - W Q I S Q . . . . . . 660 TGATGCTTTAAGGAAGAGAACCAAACCTCTAAAGTTCATCCTTTGCTGAATCGACTTTGG - C F K E E N Q T S K V H P L L N R L W D A L R K R T K P L K F I L C - I D F G L M L - G R E P N L - S S S F A E S T L . . . . . . 600 CAGAGTCTGACTGAGCAGGCTTGCCACGATTCTTCTCGATGAATTCGATGATAGCCTCTT Q S L T E Q A C H D S S R - I R - - P L R V - L S R L A T I L L D E F D D S L F A E S D - A G L P R F F S M N S M I A S . . . . . . 540 TTGTTCTCTCACCATTGTACTGTGACAAGTTACCAGAGGCGGATCTGAAGTACATAGTAG L F S H H C T V T S Y Q R R I - S T - - C S L T I V L - Q V T R G G S E V H S R F V L S P L Y C D K L P E A D L K Y I V . . . . . . 480 GGAATCCCTGAACATCAAAGTCACCTTTCGGGAGATCATTTGCGGTTGCGTCCTGTAAAT G I P E H Q S H L S G D H L R L R P V N E S L N I K V T F R E I I C G C V L - M G N P - T S K S P F G R S F A V A S C K . . . . . . 420 GAGGAAGCAGGTGAGAGGAAATGACAACAAATACATGTCCTGTGCATTACAACTGCTAAT E E A G E R K - Q Q I H V L C I T T A N R K Q V R G N D N K Y M S C A L Q L L I - G S R - E E M T T N T C P V H Y N C - . . . . . . 360 CGTGTTTAAGCCATACAAACCTTCTTTTTTGATACACTTTACAGATGTTTTCTACTAAAA R V - A I Q T F F F D T L Y R C F L L K V F K P Y K P S F L I H F T D V F Y - K S C L S H T N L L F - Y T L Q M F S T K . . . . . . 300 ATCTCTCTAACAACGAAATCCATAACATCAAAGATAAAATGTCCACCCCGTTCTTTCTGA I S L T T K S I T S K I K C P P R S F - S L - Q R N P - H Q R - N V H P V L S D N L S N N E I H N I K D K M S T P F F L . . . . . 240 TTTTTATACTAAAAACCCGATTCTAATTCTACACATTCTTTCT F L Y - K P D S N S T H S F F Y T K N P I L I L H I L S I F I L K T R F - F Y T F F Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:23:33 2006 ________________________________________________________________________________ Sequence 2: C06HBa0054K13.1-2, from 1 to 3478, both strands analyzed. ... started at: Mon Aug 28 22:23:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:23:36 2006 ________________________________________________________________________________ Sequence 3: C06HBa0054K13.1-3, from 1 to 11062, both strands analyzed. ... started at: Mon Aug 28 22:23:36 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 ******************************************************************************** EST sequence 2 -strand 925 n (File: SGN-U340222-) 1 ATCTTACCGA AAGGTCTACG GTGGTTTATG GCCCGCAAAA AGAACTACAA AATGGGCAAT 61 TCANCAGCNT ACATCTNTAG TATGTAGGGT GCTAACTACT AGAAGCACCC TATATAAGTT 121 TTACTTGTAT TAATATTGAT CTTTTTAAAA TATTGTCAGA TATAGGATTT TTAAAATTTT 181 AACGCTATTG TTCCACTAAA TGACATTCAA GTAAAGTCGG GGCTCATAAA ATTCAGAATT 241 TATAAAATTT AAATTTTAAA TTCGTTCGTA CATTAAAAAA AAGGCCAAGT ATATAAACAA 301 ATTTATAAAC TTGTTGGGTT TTTTCTCAAA TACTTTAACT AGGCTATTTT CCTATTAAAT 361 TATTGAATCA CCCATAATTT TGTTCCTTTT AAACACCGTT GATCGATGTG GATCAGATTT 421 TGTTAGGGGT ATTGGTAGAT ATGCATATGT TGTTCTAGAA CTCATTAGAC CAATTGATAG 481 TGAAATTGTT CGAGAATAAT TGTATTTACT CGAAATCATT TGAACACATT AGAAAACAAA 541 TGAGACTAAC AAACAATTTG TCTTCATTTC TTCAATCAAA ATTATTACAA TACAAACACA 601 GTTGAATACA TTTACAATAG GAAAAATAAT CAAAATCAGT TAGCAGTGTT TTAAAGGAAT 661 AAAATATGAG TAGTTTATGA TTCAATAGAA AAATGGCGTA GGTAAAATAT CTAAAGAAAA 721 AAAACCTAAC AAGAAATCTG TTTATGAGTT TGACCTAAAA AAACGAACAC AAACTCAAGT 781 AAATAAGTAA ATCTCTTTAC TCACCTGGAA TTTCTGTTTT CAACACATAA TCTGTCTCAG 841 TTCCTCGTGC CGAAATCCTG CAGCCCGGGG GATCCACTAG TTCTAGAGCG GCNCGCCAAA 901 AAGGTGGAGN TNCCAGCCNC AAGTT Predicted gene structure (within gDNA segment 4999 to 11062): Exon 1 5833 5855 ( 23 n); cDNA 295 318 ( 24 n); score: 0.543 Intron 1 5856 10041 (4186 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 2 10042 10052 ( 11 n); cDNA 319 329 ( 11 n); score: 0.909 Intron 2 10053 10448 ( 396 n); Pd: 0.000 (s: 0), Pa: 0.774 (s: 0) Exon 3 10449 10455 ( 7 n); cDNA 330 336 ( 7 n); score: 0.571 Intron 3 10456 10761 ( 306 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.66) Exon 4 10762 11062 ( 301 n); cDNA 337 635 ( 299 n); score: 0.782 MATCH C06HBa0054K13.1-3+ SGN-U340222- 0.782 342 0.370 C PGS_C06HBa0054K13.1-3+_SGN-U340222- (5833 5855,10042 10052,10449 10455,10762 11062) Alignment (genomic DNA sequence = upper lines): AAACATATTC A-ACACTGTC TGCAATATCT TTGCTAAGAG AATTAAGGAT CCAAGATATT 5891 ||||| ||| | | ||| || AAACAAATTT ATAAACTTGT TGGG...... .......... .......... .......... 318 ACCATGTTAT CACATCTCTC CCATTGGCAA TATTGTGGAG ATGTTGAAGA AGGACGTTTG 5951 .......... .......... .......... .......... .......... .......... 318 AATTCTCCAG TAATGAAGCC TAATTTATTC TTCACAGACA ACGATCTCAT TACACCTCGT 6011 .......... .......... .......... .......... .......... .......... 318 CGCCATGATC GATAAACAAC TCCTTCAAAA ACTGCTGGAA CTAAGATCAG ACTTGGATTG 6071 .......... .......... .......... .......... .......... .......... 318 TCAGATGGAT GTATATAGAG AGGGTTACTA ATATCGATGG TGTTCCCTCC TGTTCCTGAT 6131 .......... .......... .......... .......... .......... .......... 318 GTGGAAACTT GACCAGAAGT AATCTCAACC ATGGTGAAAT CGATTCACAG GAAACAATTA 6191 .......... .......... .......... .......... .......... .......... 318 CAGAAGACAA AAAAACAGAA CAATACCTTC AGAGATAAGC TCAGAAATTG ATTGTAATAT 6251 .......... .......... .......... .......... .......... .......... 318 GAACTCGATT TCTTCAATTG ACAAAAAAGA ACAGTGATCT ACACCCGCTC TGATACCATG 6311 .......... .......... .......... .......... .......... .......... 318 TTGAATAAAG AAACCATGAA TTGTATAGCT ATATCTCCAT TGATGAAGCT TGAAGAAGAA 6371 .......... .......... .......... .......... .......... .......... 318 GAAGCAGATT AGGTAGAAGA AGAAGGAGAT GTGAAGAAGA GAAAGATCAC CCTTCTTACA 6431 .......... .......... .......... .......... .......... .......... 318 TGTATATTAA CAGAGAATGG TATAAAACCA TGAATGTAAA GAATATGTTT ACAGATGAGT 6491 .......... .......... .......... .......... .......... .......... 318 TTGCATATCT ATATATATAC ACACTACTAA CTCAATTAAC TAACTATCTA CAACTAACTA 6551 .......... .......... .......... .......... .......... .......... 318 ATGGAAATGT AAGTTGTTAT GTAACTAACT AAGTTGTTAT GTAACTAACT AATTGAAGTG 6611 .......... .......... .......... .......... .......... .......... 318 TACCTAACTA ATTGAATGTT TATACATAAA TCAATAGTTG CAATTGTCCT TGTTTGGACA 6671 .......... .......... .......... .......... .......... .......... 318 AAAGCCTCCC CTGCACCTAA CATTTTTGCA ATGCCAAAAT CACTTACATG ACCAACCATT 6731 .......... .......... .......... .......... .......... .......... 318 TCTTCATCGA GCAAGACATT GCTTGGCTTC AAGTCACAAT GCACCACAGG CGTTGAATAG 6791 .......... .......... .......... .......... .......... .......... 318 CCATTGTGGA GATAATCTAT TGCAGATGCA ACATCTATCA TTATATCCAA TCTCTGCAAT 6851 .......... .......... .......... .......... .......... .......... 318 AAGTTCAGGA ACAAGTTGTG AGAGTATAAC CATTTGTCAA GTGTCCCGTT GGGCATGTAT 6911 .......... .......... .......... .......... .......... .......... 318 TCCAACACTA GGGCCTTGAA ATCAAGATTG GAGCAGCTGG TGATGACTTT GGTCAGATTT 6971 .......... .......... .......... .......... .......... .......... 318 CTGTGGCGAA GGTTGCGGAG CATCTCACAT TCCGTGTCAA AACTTTTGAA TGCACCCTCC 7031 .......... .......... .......... .......... .......... .......... 318 AATTGCACAT TGAATACCTT TGCTGCAAAA ATGATACCAT CCTTAAGTAT CCCTTTATAA 7091 .......... .......... .......... .......... .......... .......... 318 ACCCTGCTGA AACTCCCATT ACCAAGCAAG TTGGCTTCGT TGAATCCTTC AGTTGCCTGC 7151 .......... .......... .......... .......... .......... .......... 318 TCAAGTTCAT AATATGACAT TCTTTCATGC CCTTTTACGA GAAACTCATC TTTTTGACTT 7211 .......... .......... .......... .......... .......... .......... 318 GCATTCTTCT TTGTGTTTCT CAATCTTAAC ACCACAAATC CAACAGACAA CGTGAAGAGT 7271 .......... .......... .......... .......... .......... .......... 318 GATCCAATCC CTAATAGAAT ATATAAACCT GTAAGCACTT GCTTTCTTCT TGACTTCTTT 7331 .......... .......... .......... .......... .......... .......... 318 GTAGATTTGG TTGGGCATGG TTTTACATTA AATCGAGAGT CACCACAAAG TGCATCATTG 7391 .......... .......... .......... .......... .......... .......... 318 AATAGGAAGG ACTGGCTAGT GATGTTTGCA AAAGGACCTC CAATAGGAAT TTCTCCACTG 7451 .......... .......... .......... .......... .......... .......... 318 AGCTTATTGA ATGAGAAATT CATGTATTTG AGATACACAA GAGCTTCTAA TGACTTTGGA 7511 .......... .......... .......... .......... .......... .......... 318 ATTTCACCAC TAAGATTGTT ATCGCACAAA TCCAAGAATT CTAATGCTAA CATTTTGCCA 7571 .......... .......... .......... .......... .......... .......... 318 AATGAATCTG GAATAGACCC CTCTAATCTA TTATGTGCCA AAGAAAGATT AATCAAGTTA 7631 .......... .......... .......... .......... .......... .......... 318 TCTAGACCCC CTAGCGTGCT AGGAATTTTA CCAGAAAAAT CATTTTTTGA CAGATCAATG 7691 .......... .......... .......... .......... .......... .......... 318 AGTGTTGCAG CCTTCAAGTT TCCAATCTCG AAGGGAATTT CCCCACTCAA TAAATTTGAT 7751 .......... .......... .......... .......... .......... .......... 318 GAAATATTTA ATTCTATGAG ATCACGAAGC CCCCCCAAGC TTGCAGGTAA TCTTGAATTA 7811 .......... .......... .......... .......... .......... .......... 318 AGCCTATTAT AAGCTAGATA AAGTGTCCTC AAACTAGTAA CACTCCCTAA GCATGGTGGC 7871 .......... .......... .......... .......... .......... .......... 318 ACTGAGCCAG AAAACTGATT TCTTGACAAG TCTAATGCAC CAAGATTGTT TAAACCGCAC 7931 .......... .......... .......... .......... .......... .......... 318 ATAACATCTG GTATGCGTCC TTCTATCTTG TTTCTTTGTA GGTAAAGTTC TTGAAGGCTC 7991 .......... .......... .......... .......... .......... .......... 318 AACATTCCTT GGACAGTATT TGGAATATGT CCCGTCAATT CGTTGTATTG CAGAGCTATG 8051 .......... .......... .......... .......... .......... .......... 318 CTTATCACTC CTGTAAGATT ACCAATTTCT CGAGGAATGG TGCCCTTTAA TTTACAACCA 8111 .......... .......... .......... .......... .......... .......... 318 TTTCCTGCAA AATTTTTCAG TGAGTTTGAG AAATTACCAA CTGATGCAGG TAAGACACCA 8171 .......... .......... .......... .......... .......... .......... 318 TCCAATGGAT TTCCATAAAA CCAGAGTGAT CTTAGCTTCC TACACTTTGT CAATGATGTG 8231 .......... .......... .......... .......... .......... .......... 318 AGGAAGCTCA ATGTTGAATC ACTAACAAAG TTACTTCCCC ACAAGCTCAG AACCTCAAGG 8291 .......... .......... .......... .......... .......... .......... 318 TACATTAAGT TACCAAGTGA TTCTGGAATC GGACCTGTGA AACTATTGCC TGAGAGCTCA 8351 .......... .......... .......... .......... .......... .......... 318 AGTATTCTGA GTCTTGAAGA ATTTGAGATA GCAGGAGAGA TAAAACCACT AAAATTATTT 8411 .......... .......... .......... .......... .......... .......... 318 CCTCCGCAGA TGAGTACTTC TATATTGGGC ATTGCACAAC CTAAATCTGA TGGTAGAGTA 8471 .......... .......... .......... .......... .......... .......... 318 CCTGAAAGTT TGTTATGTGC AATATCTAGG ACCAGCAGTG ATGACATGTT GAAAATATTT 8531 .......... .......... .......... .......... .......... .......... 318 GCAGGGACAG AACCACTAAA TCGACTATCT GATAATCCTA GTGCTTGTAG TTTCTTCAGA 8591 .......... .......... .......... .......... .......... .......... 318 TTACCGAGCT CCACTGGTAT CTCTCCTGTC ATTGATTTTA GGTAAATATA TGAGTTACCA 8651 .......... .......... .......... .......... .......... .......... 318 CTTTTGCAGA AGAACATTTT CGTTTTTCTA GTAAGAAATT GTTGTATTAC CTTCCAAATG 8711 .......... .......... .......... .......... .......... .......... 318 AAGACTTCCG ATATATAATA TTGTAAGAGC TGTTAAGTTG GCTAGCTCTC TTGGTACAGT 8771 .......... .......... .......... .......... .......... .......... 318 TCCAGTAAAC TCATTGTAAT TCAATGACAA GACTTGAAGT TTTCTGCATT TTTCTATGTT 8831 .......... .......... .......... .......... .......... .......... 318 TGGTGGAATA ACACCCCCGA GGTAGTTTTT TGAGAGGTAA AGCCCTTCCA AATTTGGAAG 8891 .......... .......... .......... .......... .......... .......... 318 ATGGTCGCAT ATAGTTGTTG GAAGCTGATC AGTGAGATTG TTATAGGTAA GACCAATGTT 8951 .......... .......... .......... .......... .......... .......... 318 TTTCATAGTT GTAATGTTGA AGATTGATAC TGGTATAGAG CCACTAAGTT CATTAAATTG 9011 .......... .......... .......... .......... .......... .......... 318 CAGATCTAAA ATTGTCAGGT AGCAAAGATC ACCGAGTTCT CGAGGGATCT CTCCTTGTAG 9071 .......... .......... .......... .......... .......... .......... 318 AAAATTCTTT TGCATACTCA ACACTTCCAT CTTTGTTATG TTGGAAATGA AAGAAGGAAT 9131 .......... .......... .......... .......... .......... .......... 318 TTCTCCTGAA AATTGATTCA TCGAAAGGTA AGCAACCCGT AGATTCGGTA ACAAACTTAA 9191 .......... .......... .......... .......... .......... .......... 318 AGATGATAAA ATGGCTCCAG TGAAGTTATT ATTTGTGACA CTTATAAATT TCAGCCTCTG 9251 .......... .......... .......... .......... .......... .......... 318 CAAACGAGCC AGCTCTACTG GTAGATCTCC ATAGAGAGCA TTGTAACTTA TGTTGAGGGA 9311 .......... .......... .......... .......... .......... .......... 318 AACAAGAAAT GAGAGATTTC CAAGGCATGG AGAAATGGTA CCACGAAGTT GCATGCTAGA 9371 .......... .......... .......... .......... .......... .......... 318 TATGTTTAAA GCGGTGACTC GGTGGTGGCG GGAGCTACAA GTGATTCCAA TCCAGCTGCA 9431 .......... .......... .......... .......... .......... .......... 318 AACCGGGCTG GAAGAAAATT TCAGTGCAAG AAGAGCGGCT TCATCGGTGG AAATATTAGC 9491 .......... .......... .......... .......... .......... .......... 318 AAGTGAAGAA GTATGGCGAT AAAGCGTAAT GAAAACTGCA AGAGAAAACA GAAGATTGCA 9551 .......... .......... .......... .......... .......... .......... 318 AGTTCTTCCC ATAGCTATAA CAAGAGTACT TGATAGTAAT GATCTTAGAA GATATATTTA 9611 .......... .......... .......... .......... .......... .......... 318 AAGATTAATG TATACCTATA TACTGTTTTT AGATTGGAAT GACTTTTAGT CAATGGGGTC 9671 .......... .......... .......... .......... .......... .......... 318 AATTTAATTA GTTGGATAAA ACAACTAGTT AAACTTTGTA AAAAGTTTGG GTTGTTAAAT 9731 .......... .......... .......... .......... .......... .......... 318 TGTTTTAACA TAGTTTTTTA TCCTTCACCT TTGAACTTTA TCTTGTCAAG ACTTGTCTTG 9791 .......... .......... .......... .......... .......... .......... 318 GCTTGACTAA CAATTAACAT GATCTTTTTC TCCACTTAAC TAACAAATAA AGTTGTTAAC 9851 .......... .......... .......... .......... .......... .......... 318 CTGTATTCAT CTTCAACTAG AGATTCTGAT TTCTGTGTTC GAATCCTAAA ATGAGTTCGC 9911 .......... .......... .......... .......... .......... .......... 318 TCACTCCCTG ATGCGCTCAA AGTTGTTGTA TCACCAGTAA CAGATGATGA ACTTATTGTT 9971 .......... .......... .......... .......... .......... .......... 318 AGGATTTTAA GAGAAATGGG ATTCTTCTTT GAGTTTTGAA GAGTTGTTTC ACAAGTTCAC 10031 .......... .......... .......... .......... .......... .......... 318 AGATCATGAG CTTTTTCTCA AGCACAAATC ATGGAATCAT CGACAAAAGT GATAAATAAC 10091 ||||||||| | .......... TTTTTTCTCA A......... .......... .......... .......... 329 CTATCGAAAC AACAGGACAT GGTCCACAAA TATCTGAATG AACTAACTCA AAATGTCGAT 10151 .......... .......... .......... .......... .......... .......... 329 TCAACTATAA TAAACTTTAT CGATCATTTC ATAACTCTGA GGTCTTGCAT TTGTGTACCC 10211 .......... .......... .......... .......... .......... .......... 329 TCTTCTTCCC TCAATTGATT CATAACTCTC CAAAGAGCAA CGCTTCCCTG AGTAGGCACA 10271 .......... .......... .......... .......... .......... .......... 329 CATTTCTAGT GGCCTTAGAT CATGTTCAAA ATTGATACGC TACTCTCTAT TCAAAAAGTG 10331 .......... .......... .......... .......... .......... .......... 329 TCTGTATATT ATATGCACAT AAAATACTAG TATTTCTTAA AGATAATGTG TTTTATGTAC 10391 .......... .......... .......... .......... .......... .......... 329 CCCTTTTTTC ATTTTAAGGG GGTAAGGTAA ATTTTTTTTT TTTTTTGTTT TATTCAGATT 10451 || .......... .......... .......... .......... .......... .......ATA 332 ATTCATATCT TTGGAATTTG ACAAAATGTA TTCAAATTTT CTCCCCAATA TTGGAGGCCA 10511 || CTTT...... .......... .......... .......... .......... .......... 336 TTTCTCTTTC CTAACTTACT CCCATTGTTC TAAAATATTT GTCACAATTT TTTTTTTGGA 10571 .......... .......... .......... .......... .......... .......... 336 GAATTAAACG ATATAAATTT TGATCGATAT TTTAAGATTA TAATTGATAT GAGAAAAAAA 10631 .......... .......... .......... .......... .......... .......... 336 TATAGTTTGT AGTATCCTAC GTGTAATATT GTGTTCATTT ATTCAATTTT ATACAAATTT 10691 .......... .......... .......... .......... .......... .......... 336 AAATGTTTGT TTGTATGTGT ACACCAAAAT TTGAAGGGCA TAAATTGACA GCTAAAGCAA 10751 .......... .......... .......... .......... .......... .......... 336 AGTTAAGAAC ACACTTATTC ACTTTCCTAT TGAATCATTG AAT-TCTCAT AATTTGTTCC 10810 | ||| | |||||||| | ||| |||| ||| | ||| ||||| | | .......... A-ACTAGGCT ATTTTCCTAT TAAATTATTG AATCACCCAT AATTTTGTTC 385 CTTTAAACAA ACACCGTTGT CTGATGTGGA ACCAGAT-TT ATGAGGGGTA TTGATAGAGG 10869 |||| || ||||||||| |||||||| | |||| || | ||||||| ||| |||| CTTT---TAA ACACCGTTGA TCGATGTGGA TC-AGATTTT GTTAGGGGTA TTGGTAGA-T 440 ATACATATGT TGTTCCACAA CTCATTAGTC CAATTGATAG TGAAAATGTT CGAGAATAAT 10929 || ||||||| ||||| | || |||||||| | |||||||||| ||||| |||| |||||||||| ATGCATATGT TGTTCTAGAA CTCATTAGAC CAATTGATAG TGAAATTGTT CGAGAATAAT 500 TGTGTTTTAC T-TAAGTCAT TTGAAAACAT TAGAAAACAA ATGAGACTAA C--ACGGTTT 10986 ||| ||||| | || |||| ||||| |||| |||||||||| |||||||||| | || ||| TGT-ATTTAC TCGAAATCAT TTGAACACAT TAGAAAACAA ATGAGACTAA CAAACAATTT 559 GTTTTCATTC CTTCAATCAA AATTATTACG ATTCGAACAC AATTAAAAGC ATTTACAATA 11046 || |||||| |||||||||| ||||||||| || | ||||| | || || | |||||||||| GTCTTCATTT CTTCAATCAA AATTATTACA ATACAAACAC AGTTGAATAC ATTTACAATA 619 GAAAAGCTGA TCAAAA 11062 | ||| | | |||||| GGAAAAATAA TCAAAA 635 hqPGS_C06HBa0054K13.1-3+_SGN-U340222- (10762 11062) ******************************************************************************** EST sequence 1 +strand 874 n (File: SGN-U315517+) 1 ATGAGATTTC TAAATTTGTT GAGTTTTTTC CCTCAGGTAT CTCAACTATG TCATTTTCTT 61 ATTGAATCAT TGAACCCCCA TAATCTGTTC CGTTTAAACA AACACTGTTG TCTGATGTGG 121 AACCATATTC GTGAGGGGTA TTGATAGAAG ATACATATGT TGTTCCTGAA CTCATTAGTC 181 CAATTGATAG TGAAAATGTT CGAGAATAAT TGTGTTTCAC TTAAGTGATT TGAAGACATT 241 AGAAAACAAA TGAGATTAAC ACGGTTTGTG TTCATTCGTT CAATCAAAAT TATTACAATT 301 CGAACACAAT TGGAGACATT TACAATAGAA AAGTCGATCA AAATCAACCA ACAGTATTTT 361 AAAGGAATAA ATTATGGGTG GTTTAATGAT GAGGAAAAAA ATCCAACTAA TTTAAGGATC 421 TGTTTATGTA TTTGACCTTA ATATATATAT ATATGTTATT AATGAGCGTA GGCATGGAGT 481 AATGGAAGAA TATATTCACC AAGACACCAA GTGTTGATGA AAGAGGATAA GCAAAGGTAT 541 ACATTGGCGT TATTCACATT CAATAAAGGA ATAACAGATA TACCTGAGGA ATTGGTGGAT 601 GAAACACATC CTTTACAATA CAAGCCTTTT GATAATTTTG ATTTAGTTAT GTACTACAGT 661 ACTGGTGCTA GCCCCATGGC TTATGGCACT GCTAAACCTT ACTGTGGTAT TAATGCTCAA 721 TAGATACCTA TTTTATTATA AATAATATGA GTTCAAACTA GTTTTACGAT GAACTCTATG 781 TGTAATTTCA ACTATCTATT TCAAATTCAA TATAATATAA GATTGTTACA TTTATGAAAA 841 AAAAAAAAAA AAAAAAAGAA AAAAAAAAAC TCGA Predicted gene structure (within gDNA segment 9166 to 11062): Exon 1 10774 11062 ( 289 n); cDNA 55 343 ( 289 n); score: 0.910 PPA cDNA 837 869 MATCH C06HBa0054K13.1-3+ SGN-U315517+ 0.910 289 0.331 C PGS_C06HBa0054K13.1-3+_SGN-U315517+ (10774 11062) Alignment (genomic DNA sequence = upper lines): TTTCCTATTG AATCATTGAA TTCTCATAAT TTGTTCCCTT TAAACAAACA CCGTTGTCTG 10833 |||| ||||| |||||||||| | |||||| |||||| || |||||||||| | |||||||| TTTCTTATTG AATCATTGAA CCCCCATAAT CTGTTCCGTT TAAACAAACA CTGTTGTCTG 114 ATGTGGAACC AGATTTATGA GGGGTATTGA TAGAGGATAC ATATGTTGTT CCACAACTCA 10893 |||||||||| | ||| ||| |||||||||| |||| ||||| |||||||||| || |||||| ATGTGGAACC ATATTCGTGA GGGGTATTGA TAGAAGATAC ATATGTTGTT CCTGAACTCA 174 TTAGTCCAAT TGATAGTGAA AATGTTCGAG AATAATTGTG TTTTACTTAA GTCATTTGAA 10953 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| || ||||||| TTAGTCCAAT TGATAGTGAA AATGTTCGAG AATAATTGTG TTTCACTTAA GTGATTTGAA 234 AACATTAGAA AACAAATGAG ACTAACACGG TTTGTTTTCA TTCCTTCAAT CAAAATTATT 11013 ||||||||| |||||||||| | |||||||| ||||| |||| ||| |||||| |||||||||| GACATTAGAA AACAAATGAG ATTAACACGG TTTGTGTTCA TTCGTTCAAT CAAAATTATT 294 ACGATTCGAA CACAATTAAA AGCATTTACA ATAGAAAAGC TGATCAAAA 11062 || ||||||| ||||||| | |||||||| ||||||||| |||||||| ACAATTCGAA CACAATTGGA GACATTTACA ATAGAAAAGT CGATCAAAA 343 hqPGS_C06HBa0054K13.1-3+_SGN-U315517+ (10774 11062) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 11062: PGL 1 (+ strand): 10762 11062 AGS-1 (10762 11062) SCR (e 0.910) Exon 1 10762 11062 ( 301 n); score: 0.910 PGS (10762 11062) SGN-U340222- PGS (10774 11062) SGN-U315517+ 3-phase translation of AGS-1 (+strand): . . . . . . 10762 ACACTTATTCACTTTCCTATTGAATCATTGAATTCTCATAATTTGTTCCCTTTAAACAAA T L I H F P I E S L N S H N L F P L N K H L F T F L L N H - I L I I C S L - T N T Y S L S Y - I I E F S - F V P F K Q . . . . . . 10822 CACCGTTGTCTGATGTGGAACCAGATTTATGAGGGGTATTGATAGAGGATACATATGTTG H R C L M W N Q I Y E G Y - - R I H M L T V V - C G T R F M R G I D R G Y I C C T P L S D V E P D L - G V L I E D T Y V . . . . . . 10882 TTCCACAACTCATTAGTCCAATTGATAGTGAAAATGTTCGAGAATAATTGTGTTTTACTT F H N S L V Q L I V K M F E N N C V L L S T T H - S N - - - K C S R I I V F Y L V P Q L I S P I D S E N V R E - L C F T . . . . . . 10942 AAGTCATTTGAAAACATTAGAAAACAAATGAGACTAACACGGTTTGTTTTCATTCCTTCA K S F E N I R K Q M R L T R F V F I P S S H L K T L E N K - D - H G L F S F L Q - V I - K H - K T N E T N T V C F H S F . . . . . . 11002 ATCAAAATTATTACGATTCGAACACAATTAAAAGCATTTACAATAGAAAAGCTGATCAAA I K I I T I R T Q L K A F T I E K L I K S K L L R F E H N - K H L Q - K S - S K N Q N Y Y D S N T I K S I Y N R K A D Q . 11062 A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-3+_PGL-1_AGS-1_PPS_1 (10867 11061) (frame '1'; 195 bp, 65 residues) 1 RIHMLFHNSL VQLIVKMFEN NCVLLKSFEN IRKQMRLTRF VFIPSIKIIT IRTQLKAFTI 61 EKLIK 3-phase translation of AGS-1 (-strand): . . . . . . 11062 TTTTGATCAGCTTTTCTATTGTAAATGCTTTTAATTGTGTTCGAATCGTAATAATTTTGA F - S A F L L - M L L I V F E S - - F - F D Q L F Y C K C F - L C S N R N N F D L I S F S I V N A F N C V R I V I I L . . . . . . 11002 TTGAAGGAATGAAAACAAACCGTGTTAGTCTCATTTGTTTTCTAATGTTTTCAAATGACT L K E - K Q T V L V S F V F - C F Q M T - R N E N K P C - S H L F S N V F K - L I E G M K T N R V S L I C F L M F S N D . . . . . . 10942 TAAGTAAAACACAATTATTCTCGAACATTTTCACTATCAATTGGACTAATGAGTTGTGGA - V K H N Y S R T F S L S I G L M S C G K - N T I I L E H F H Y Q L D - - V V E L S K T Q L F S N I F T I N W T N E L W . . . . . . 10882 ACAACATATGTATCCTCTATCAATACCCCTCATAAATCTGGTTCCACATCAGACAACGGT T T Y V S S I N T P H K S G S T S D N G Q H M Y P L S I P L I N L V P H Q T T V N N I C I L Y Q Y P S - I W F H I R Q R . . . . . . 10822 GTTTGTTTAAAGGGAACAAATTATGAGAATTCAATGATTCAATAGGAAAGTGAATAAGTG V C L K G T N Y E N S M I Q - E S E - V F V - R E Q I M R I Q - F N R K V N K C C L F K G N K L - E F N D S I G K - I S . 10762 T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-3-_PGL-1_AGS-1_PPS_1 (11060 10848) (frame '0'; 210 bp, 70 residues) 1 LISFSIVNAF NCVRIVIILI EGMKTNRVSL ICFLMFSNDL SKTQLFSNIF TINWTNELWN 61 NICILYQYPS - ... finished at: Mon Aug 28 22:23:46 2006 ________________________________________________________________________________ Sequence 4: C06HBa0054K13.1-4, from 1 to 14213, both strands analyzed. ... started at: Mon Aug 28 22:23:46 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 3 ******************************************************************************** EST sequence 3 +strand 1442 n (File: SGN-U318765+) 1 TACAAATGAA AATTGAGTTA CACATTTAAG GAGTCTCATG GGAAAAGAAG AAACAAACTA 61 TGTTGTTTGA ACTCTCCTAA AATATCCCTC GGTGTTTTTG AAGAGTCTGA ACAATATAGA 121 AAACAAGTGC ATGTTAACTA TCCCCTTATT AGTCTTTAGC CTTCTCTTTA CAATCTCTTT 181 TTTCACTATG TGTTTCCTTT AATGCTGCAG AATTACCCTT ATCTGAAGTT TGTTCACTTG 241 CACTAATAGG AGTGACTAAA ATTTCATTCT TGTCAGTTGA ACTGGAAGCA AAATTTGTAA 301 CATCACTTGT CTTTTCTTGT TTGTTGTTGT TGGTGGTGTT GGAAGACACC ATAGTGGTCT 361 TGTTTGGTGA TTCTACAGAG ACTTCTTTGA GGCTCATTTT AGACTCAACT AATGAAGTTC 421 TAGACATTCT TGGACTTGGA CTTTCACCTA TTTGTTCCTC CCTGTCAAAG TTCAACAGAG 481 GGCCTTGACC TGTTGAGGAC TTTGGTTTGA GTACGGACTT TGGTGATGAT GAACAACTCG 541 AGGCTGTAGG GACTTCTCGG ATTAAAGATT GTAAGCTGAT CAAGTCTTGA TTGACAGAGC 601 TCTCCGATGG AATTCCAGAA GTCGAGGAGC CTTCACTTGC TGAGAACTTT GGTTGAATCA 661 TAGTGTTGGG AGAGGGGGAT GAAGTAGAGG CCCTCGGAAC TTGTTCAACC ACCAAATCAT 721 GTGGTGATGA TGATGATGAT GCTTGTGAAC CATCAATACT CTGGCTAATT GTAGAAAAAA 781 CTTCTTCAAT AGGCTGTTTA CCACCATTAT CGTGAAGTGC TTCATGAATT TCATTCCTAG 841 GGAGAGACTT TGAATTTTCT GCAGACTTCT GTGTCTTCGG ATGTGGTACT GAACTTTGCA 901 CTGTCGGATT TCTTAACGTA AGTTTTGAGC TCCGAGTACT GGTTGGATCA CAACTTTCAG 961 CAGTTACTTC ATCTGTCAGC TTATTTTTTA GTTTGCGCCT ATCTACTTGT TGAGCCTCCA 1021 AATTTGAACT AGATCCTTTA TCAATGGAAA ATGTTGGTTG CGAAACAGAC TTAGACGATG 1081 ATGTACTAGA AGAGGCTTCA AATGAGTGAT GTCCTTGTCC AATAGTCGAA AATGCCACAG 1141 GTTCACTATG TTGTTCGCTG GCACTGTCTA GAACGAGTGC AGCGACATCG TTCTCTTTGG 1201 CTGAAATTGA ATTTTCTGTT TCTGAAGGTA ATTCAGAAAC TTGTAATGTT AAGTTGCTTA 1261 AACTAAGATC TGATTCCTGT GAAGCAATTG AATCTGATCT TTCACTAGAT ATGTTCACTG 1321 ACCTATCATT TTGTAGCTGA TCAATACTCG AAGAGGATCC ATGTTCAATC AAACAATTTG 1381 GTTGTAGAAC AGATGTCGGT GATACTAATG ACGTTAATGC GAAAGGGAGT TGTTGTGCTG 1441 GA Predicted gene structure (within gDNA segment 2849 to 1): Exon 1 2249 1848 ( 402 n); cDNA 1 402 ( 402 n); score: 1.000 Intron 1 1847 1349 ( 499 n); Pd: 0.999 (s: 1.00), Pa: 0.909 (s: 1.00) Exon 2 1348 1133 ( 216 n); cDNA 403 618 ( 216 n); score: 1.000 Intron 2 1132 599 ( 534 n); Pd: 0.898 (s: 1.00), Pa: 0.983 (s: 1.00) Exon 3 598 1 ( 598 n); cDNA 619 1216 ( 598 n); score: 1.000 MATCH C06HBa0054K13.1-4- SGN-U318765+ 1.000 1216 0.843 C PGS_C06HBa0054K13.1-4-_SGN-U318765+ (2249 1848,1348 1133,598 1) Alignment (genomic DNA sequence = upper lines): TACAAATGAA AATTGAGTTA CACATTTAAG GAGTCTCATG GGAAAAGAAG AAACAAACTA 2190 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACAAATGAA AATTGAGTTA CACATTTAAG GAGTCTCATG GGAAAAGAAG AAACAAACTA 60 TGTTGTTTGA ACTCTCCTAA AATATCCCTC GGTGTTTTTG AAGAGTCTGA ACAATATAGA 2130 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTGTTTGA ACTCTCCTAA AATATCCCTC GGTGTTTTTG AAGAGTCTGA ACAATATAGA 120 AAACAAGTGC ATGTTAACTA TCCCCTTATT AGTCTTTAGC CTTCTCTTTA CAATCTCTTT 2070 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAACAAGTGC ATGTTAACTA TCCCCTTATT AGTCTTTAGC CTTCTCTTTA CAATCTCTTT 180 TTTCACTATG TGTTTCCTTT AATGCTGCAG AATTACCCTT ATCTGAAGTT TGTTCACTTG 2010 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCACTATG TGTTTCCTTT AATGCTGCAG AATTACCCTT ATCTGAAGTT TGTTCACTTG 240 CACTAATAGG AGTGACTAAA ATTTCATTCT TGTCAGTTGA ACTGGAAGCA AAATTTGTAA 1950 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTAATAGG AGTGACTAAA ATTTCATTCT TGTCAGTTGA ACTGGAAGCA AAATTTGTAA 300 CATCACTTGT CTTTTCTTGT TTGTTGTTGT TGGTGGTGTT GGAAGACACC ATAGTGGTCT 1890 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCACTTGT CTTTTCTTGT TTGTTGTTGT TGGTGGTGTT GGAAGACACC ATAGTGGTCT 360 TGTTTGGTGA TTCTACAGAG ACTTCTTTGA GGCTCATTTT AGGTAAGTCC TCCTTCTTGA 1830 |||||||||| |||||||||| |||||||||| |||||||||| || TGTTTGGTGA TTCTACAGAG ACTTCTTTGA GGCTCATTTT AG........ .......... 402 CATCATTGTG ATTAGTGGCT TCTTCATGAA ACTTCTCTTC ATCATTGGAT GGTTTCTGTA 1770 .......... .......... .......... .......... .......... .......... 402 TAACGTTGCA ATCAAAGACA AAGTATGAAT TTCGTTAAGG ATGTTCTAGA TTCAACATAC 1710 .......... .......... .......... .......... .......... .......... 402 ACGTATAGTT TATTATATAT GCGGTTTATT TTTCCTGGCA AAGGATGTTC AACTAACCAC 1650 .......... .......... .......... .......... .......... .......... 402 CGTTGAGCCT ATGTGACTCC GTATTACATT ATTTTCACAA GCACAAAAAT ATGATATAAT 1590 .......... .......... .......... .......... .......... .......... 402 GAATGTAACA AGAAAAGACC ATCTATTTAT GCTGTTTCAA AGAGAGGGTA AGTTTTATGT 1530 .......... .......... .......... .......... .......... .......... 402 ACTGATAAGT GCGAAAATAG GAGAACCTAT TCTCTAAAGG GTGGTTTGGT TTATCATACT 1470 .......... .......... .......... .......... .......... .......... 402 AACAAAAATA ATCCTACCAT ACTGTCAGTA TATATAACTT AAGTAATTGA AGGTATCATT 1410 .......... .......... .......... .......... .......... .......... 402 TGATCCAGTC TATATAAACG TACCGTTTTA GCATCTGTAT GATGAAGTTG GTCTGATGCA 1350 .......... .......... .......... .......... .......... .......... 402 GACTCAACTA ATGAAGTTCT AGACATTCTT GGACTTGGAC TTTCACCTAT TTGTTCCTCC 1290 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .ACTCAACTA ATGAAGTTCT AGACATTCTT GGACTTGGAC TTTCACCTAT TTGTTCCTCC 461 CTGTCAAAGT TCAACAGAGG GCCTTGACCT GTTGAGGACT TTGGTTTGAG TACGGACTTT 1230 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGTCAAAGT TCAACAGAGG GCCTTGACCT GTTGAGGACT TTGGTTTGAG TACGGACTTT 521 GGTGATGATG AACAACTCGA GGCTGTAGGG ACTTCTCGGA TTAAAGATTG TAAGCTGATC 1170 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGATGATG AACAACTCGA GGCTGTAGGG ACTTCTCGGA TTAAAGATTG TAAGCTGATC 581 AAGTCTTGAT TGACAGAGCT CTCCGATGGA ATTCCAGGTA TCGGATCCTA AGTTACATTA 1110 |||||||||| |||||||||| |||||||||| ||||||| AAGTCTTGAT TGACAGAGCT CTCCGATGGA ATTCCAG... .......... .......... 618 AAAAGAAAAA ACAATACTTG AAATTTCAAA CACTTCTATA AGTGAGAAAC TGGAAAGAAA 1050 .......... .......... .......... .......... .......... .......... 618 ATGTTGTTTC TATTAGGTTT TGTTATGCAG AACTTGAAAA TGCAGGTGCA ATTACATTTT 990 .......... .......... .......... .......... .......... .......... 618 CCTGTCTGCC AAAAACACTG CTTTACTGAC ACATTATAAT TTTATTACAA ACTTATGATA 930 .......... .......... .......... .......... .......... .......... 618 CACATTCATT ATTTTTAACT ATACACGATT ATACATCTTT TCTGCCTTTA ACAAGTCGAT 870 .......... .......... .......... .......... .......... .......... 618 ACACAACTAA ACTTCACAAA AAGTATTGTT TGCATGCTGG ATTTCAACTT ATATCAATTA 810 .......... .......... .......... .......... .......... .......... 618 ATTTAACATG GAGTTGATTT CTTATATGGT CACTTGGATC GTAGTTATTA TCTCATAAAG 750 .......... .......... .......... .......... .......... .......... 618 TCACTTTCTT TTTGTTGAAG AAAGGTTAAT TTCTAAGATG ATAACTATTA GTTGAGTAAC 690 .......... .......... .......... .......... .......... .......... 618 CATCAGAGAA ATCAACTCGT TTAACTTGCA TAATCTAATC ATTGTCAAAT TCATTAAAAT 630 .......... .......... .......... .......... .......... .......... 618 CACCAAACCT GCATTCTTTT CTGTGTTTCA GAAGTCGAGG AGCCTTCACT TGCTGAGAAC 570 ||||||||| |||||||||| |||||||||| .......... .......... .......... .AAGTCGAGG AGCCTTCACT TGCTGAGAAC 647 TTTGGTTGAA TCATAGTGTT GGGAGAGGGG GATGAAGTAG AGGCCCTCGG AACTTGTTCA 510 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGGTTGAA TCATAGTGTT GGGAGAGGGG GATGAAGTAG AGGCCCTCGG AACTTGTTCA 707 ACCACCAAAT CATGTGGTGA TGATGATGAT GATGCTTGTG AACCATCAAT ACTCTGGCTA 450 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCACCAAAT CATGTGGTGA TGATGATGAT GATGCTTGTG AACCATCAAT ACTCTGGCTA 767 ATTGTAGAAA AAACTTCTTC AATAGGCTGT TTACCACCAT TATCGTGAAG TGCTTCATGA 390 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGTAGAAA AAACTTCTTC AATAGGCTGT TTACCACCAT TATCGTGAAG TGCTTCATGA 827 ATTTCATTCC TAGGGAGAGA CTTTGAATTT TCTGCAGACT TCTGTGTCTT CGGATGTGGT 330 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTCATTCC TAGGGAGAGA CTTTGAATTT TCTGCAGACT TCTGTGTCTT CGGATGTGGT 887 ACTGAACTTT GCACTGTCGG ATTTCTTAAC GTAAGTTTTG AGCTCCGAGT ACTGGTTGGA 270 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGAACTTT GCACTGTCGG ATTTCTTAAC GTAAGTTTTG AGCTCCGAGT ACTGGTTGGA 947 TCACAACTTT CAGCAGTTAC TTCATCTGTC AGCTTATTTT TTAGTTTGCG CCTATCTACT 210 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCACAACTTT CAGCAGTTAC TTCATCTGTC AGCTTATTTT TTAGTTTGCG CCTATCTACT 1007 TGTTGAGCCT CCAAATTTGA ACTAGATCCT TTATCAATGG AAAATGTTGG TTGCGAAACA 150 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTGAGCCT CCAAATTTGA ACTAGATCCT TTATCAATGG AAAATGTTGG TTGCGAAACA 1067 GACTTAGACG ATGATGTACT AGAAGAGGCT TCAAATGAGT GATGTCCTTG TCCAATAGTC 90 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACTTAGACG ATGATGTACT AGAAGAGGCT TCAAATGAGT GATGTCCTTG TCCAATAGTC 1127 GAAAATGCCA CAGGTTCACT ATGTTGTTCG CTGGCACTGT CTAGAACGAG TGCAGCGACA 30 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAATGCCA CAGGTTCACT ATGTTGTTCG CTGGCACTGT CTAGAACGAG TGCAGCGACA 1187 TCGTTCTCTT TGGCTGAAAT TGAATTTTC 1 |||||||||| |||||||||| ||||||||| TCGTTCTCTT TGGCTGAAAT TGAATTTTC 1216 hqPGS_C06HBa0054K13.1-4-_SGN-U318765+ (2249 1848,1348 1133,598 1) ******************************************************************************** EST sequence 4 +strand 1165 n (File: SGN-U341543+) 1 GAATATGTCA TACGACTCAC TATAGGGCGA ATTGGGTACC TGGGCCCCCC CTCGAGACTG 61 ACTCCGTATT ACATTATTTT CACAAGCACA AAAATATGAT ATAATGAATG TAACAAGAAA 121 AGACCATCTA TTTATGCTGT TTCAAAGAGA GGGTAAGTTT TATGTACTGA TAAGTGCGAA 181 AATAGGAGAA CCTATTCTCT AAAGGGTGGT TTGGTTTATC ATACTAACAA AAATAATCCT 241 ACCATACTGT CAGTATATAT AACTTAAGTA ATTGAAGGTA TCATTTGATC CAGTCTATAT 301 AAACGTACCG TTTTAGCATC TGTATGATGA AGTTGGTCTG ATGCAGACTC AACTAATGAA 361 GTTCTAGACA TTTCTGGACT TGGGACTTTC ACTATTTGTT CTTCCTTGTC AAAGTTCACA 421 AAGGGGCTTG ACTTGTTGAG GACTTGGTTT GAATACGGGC TTTGGGGATA ATAACAACTC 481 AGGTTTAAGG GATTTTTGGG ATAAAAATTG AAACCTGAAA AATCTTGATT GAAAAATTTT 541 CCAGGGAATT TCAAAGATGG GGGACCTTCT TTTTTAAAAT TTTGTTAAAT TTTTTGGTGG 601 GAGAGGGGAG ATAAAAAAGG CCCTCAAAAT TTTCACACCC CAAATTTTGG GGGATAAATA 661 ATAATTTTTG AACTCCAAAT ATTTCTTTTT TTAAGAAAAC TTTTTAATGG GTGTCTCCCC 721 CTCTTTGGGA GGGTGTTATT TTTCTTTGGA GAGAAATTTT TATTCTCCAC TTTTTTGTTG 781 TGGGGGGGCG CCTTCTCTCC GCGGGGTTTT AAAAGGAGGG GCCCCGCGCG CGGGGGGAGC 841 ACCCTCTCCC CTTTTCTTGT TTTTTTTTTT TTGCCCCCTT TTTTCGGGCC CCGCAATAAC 901 ACCTTCTAAA GAGGGGGGGC ACAACAAAAT AACAATAAAA AGAGCGGAGA GGTGGTTTTA 961 CAAAACAACC CCCTTTTTGT TTGTGCCGTA AAAAGCACAC CTTTTTTTTT TTTTTTTTTT 1021 TTGTAAAAAA ATATTTATAA AAAAATACAG AAAACTATTC TTCCCCCCCG GCGTCCTTCG 1081 CCGCGGGGGT TTTTTTTTTT TTTGTTTGTG TGTTTGGTTT GTGACTCCCA CACCACAGAG 1141 GGTGGGCGTG TTCATTAAGT CCCCT Predicted gene structure (within gDNA segment 2816 to 1): Exon 1 1636 1133 ( 504 n); cDNA 59 553 ( 495 n); score: 0.920 Intron 1 1132 599 ( 534 n); Pd: 0.898 (s: 0.72), Pa: 0.983 (s: 0.62) Exon 2 598 473 ( 126 n); cDNA 554 669 ( 116 n); score: 0.655 MATCH C06HBa0054K13.1-4- SGN-U341543+ 0.867 630 0.541 C PGS_C06HBa0054K13.1-4-_SGN-U341543+ (1636 1133,598 473) Alignment (genomic DNA sequence = upper lines): TGACTCCGTA TTACATTATT TTCACAAGCA CAAAAATATG ATATAATGAA TGTAACAAGA 1577 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACTCCGTA TTACATTATT TTCACAAGCA CAAAAATATG ATATAATGAA TGTAACAAGA 118 AAAGACCATC TATTTATGCT GTTTCAAAGA GAGGGTAAGT TTTATGTACT GATAAGTGCG 1517 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGACCATC TATTTATGCT GTTTCAAAGA GAGGGTAAGT TTTATGTACT GATAAGTGCG 178 AAAATAGGAG AACCTATTCT CTAAAGGGTG GTTTGGTTTA TCATACTAAC AAAAATAATC 1457 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAATAGGAG AACCTATTCT CTAAAGGGTG GTTTGGTTTA TCATACTAAC AAAAATAATC 238 CTACCATACT GTCAGTATAT ATAACTTAAG TAATTGAAGG TATCATTTGA TCCAGTCTAT 1397 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACCATACT GTCAGTATAT ATAACTTAAG TAATTGAAGG TATCATTTGA TCCAGTCTAT 298 ATAAACGTAC CGTTTTAGCA TCTGTATGAT GAAGTTGGTC TGATGCAGAC TCAACTAATG 1337 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATAAACGTAC CGTTTTAGCA TCTGTATGAT GAAGTTGGTC TGATGCAGAC TCAACTAATG 358 AAGTTCTAGA CATTCTTGGA CTT-GGACTT TCACCTATTT GTTCCTCCCT GTCAAAGTTC 1278 |||||||||| |||| |||| ||| |||||| ||| |||||| |||| ||| | |||||||||| AAGTTCTAGA CATTTCTGGA CTTGGGACTT TCA-CTATTT GTTCTTCCTT GTCAAAGTTC 417 AACAGAGGGC CTTGACCTGT TGAGGACTTT GGTTTGAGTA CGGACTTTGG TGATGATGAA 1218 ||| |||| |||||| ||| ||||||| || ||||||| || ||| |||||| ||| || || -ACAAAGGGG CTTGACTTGT TGAGGAC-TT GGTTTGAATA CGGGCTTTGG GGATAAT-AA 474 CAACTCGAGG CTGTAGGGAC TTCTCGGATT AAAGATTGTA AGCTGATCAA GTCTTGATTG 1158 |||||| ||| | ||||| || | ||| | ||| |||| | | |||| || ||||||||| CAACTC-AGG TTTAAGGGAT TTTTGGGA-T AAAAATTGAA ACCTGA-AAA ATCTTGATTG 531 ACAGAGCTCT CCGATGGAAT TCCAGGTATC GGATCCTAAG TTACATTAAA AAGAAAAAAC 1098 | | | | | || | ||||| | || A-AAAATTTT CC-AGGGAAT TTCA-..... .......... .......... .......... 553 AATACTTGAA ATTTCAAACA CTTCTATAAG TGAGAAACTG GAAAGAAAAT GTTGTTTCTA 1038 .......... .......... .......... .......... .......... .......... 553 TTAGGTTTTG TTATGCAGAA CTTGAAAATG CAGGTGCAAT TACATTTTCC TGTCTGCCAA 978 .......... .......... .......... .......... .......... .......... 553 AAACACTGCT TTACTGACAC ATTATAATTT TATTACAAAC TTATGATACA CATTCATTAT 918 .......... .......... .......... .......... .......... .......... 553 TTTTAACTAT ACACGATTAT ACATCTTTTC TGCCTTTAAC AAGTCGATAC ACAACTAAAC 858 .......... .......... .......... .......... .......... .......... 553 TTCACAAAAA GTATTGTTTG CATGCTGGAT TTCAACTTAT ATCAATTAAT TTAACATGGA 798 .......... .......... .......... .......... .......... .......... 553 GTTGATTTCT TATATGGTCA CTTGGATCGT AGTTATTATC TCATAAAGTC ACTTTCTTTT 738 .......... .......... .......... .......... .......... .......... 553 TGTTGAAGAA AGGTTAATTT CTAAGATGAT AACTATTAGT TGAGTAACCA TCAGAGAAAT 678 .......... .......... .......... .......... .......... .......... 553 CAACTCGTTT AACTTGCATA ATCTAATCAT TGTCAAATTC ATTAAAATCA CCAAACCTGC 618 .......... .......... .......... .......... .......... .......... 553 ATTCTTTTCT GTGTTTCAGA AGTCGAGGAG CCTTCACTTG CTGAGAACTT TGGTTGAATC 558 | || | || |||| ||| | | || || | ||| ||| .......... .........A AGATGGGGGA CCTT--CTTT TTTA-AAATT TTGTTAAAT- 590 ATAGTGTTGG GAGAGGGG-G ATGAAGTAGA GGCCCTCGGA ACTTGTTCAA CCACCAAATC 499 | || ||| |||||||| | || || | | ||||||| | | || |||| | |||||| TTTTTGGTGG GAGAGGGGAG AT-AA-AAAA GGCCCTC-AA AATT-TTCAC ACCCCAAATT 646 ATGTGGTGAT GATGATGATG ATGCTT 473 || || ||| | || || || || TTG-GGGGAT -A-AATAATA ATTTTT 669 hqPGS_C06HBa0054K13.1-4-_SGN-U341543+ (1636 1133) ******************************************************************************** EST sequence 2 +strand 825 n (File: SGN-U336622+) 1 GCTCCCTCGA ATTACCCTCA CTAAAGGGAC AAAAGCTGGA GCTCCACCGC GGTGGCGGCC 61 GCTCTAGAAC TAGTGGATCC CCCGGGCTGC AGGTTACAAA TGAAAATTGA GTTACACATT 121 TAAGGAGTCT CATGGGAAAA GAAGAAACAA ACTATGTTGT TTGAACTCTC CTAAAATATC 181 CCTCGGTGTT TTTGAAGAGT CTGAACAATA TAGAAAACAA GTGCATGTTA ACTATCCCCT 241 TATTAGTCTT TAGCCTTCTC TTTACAATCT CTTTTTTCAC TATGTGTTTC CTTTAATGCT 301 GCAGAATTAC CCTTATCTGA AGTTTGTTCA CTTGCACTAA TAGGAGTGAC TAAAATTTCA 361 TTCTTGTCAG TTGAACTGGA AGCAAAATTT GTAACATCAC TTGTCTTTTC TTGTTTGTTG 421 TTGTTGGTGG TGTTGGAAGA CACCATAGTG GTCTTGTTTG GTGATTCTAC AGAGACTTCT 481 TTGAGGCTCA TTTTAGACTC AACTAATGGG GGGGGGGGGG GGGCTTGAAC TTGAACTTTC 541 ACCTATTTGT TCCTCCCTGT CAAAGTTCAA CAGAAGGCCT TGACCTGTTG AGGACTTTTG 601 TTTTGAGTTA CGAAACTTGC TGAAGAATAA CCGACCGAGA GTTGTAAGGA CTTTAAAAAT 661 AACGAATGCA AGCCAGAATA ATCATGATTA ATTATAGCCA CCTCAACAAT CATAGGGAAG 721 ATGATAACAG ATAAGATGGG ACGTTAGGTA TGGAATCGAT TGTCGACGGC GGCACTTCGA 781 ACTTGCAGCG ATATGCGAGT GTGGTATGCT GTATGTGAGG AAGAA Predicted gene structure (within gDNA segment 3780 to 1): Exon 1 2250 1848 ( 403 n); cDNA 94 496 ( 403 n); score: 1.000 Intron 1 1847 1349 ( 499 n); Pd: 0.999 (s: 1.00), Pa: 0.909 (s: 0.66) Exon 2 1348 1133 ( 216 n); cDNA 497 715 ( 219 n); score: 0.676 MATCH C06HBa0054K13.1-4- SGN-U336622+ 0.887 619 0.750 C PGS_C06HBa0054K13.1-4-_SGN-U336622+ (2250 1848,1348 1133) Alignment (genomic DNA sequence = upper lines): TTACAAATGA AAATTGAGTT ACACATTTAA GGAGTCTCAT GGGAAAAGAA GAAACAAACT 2191 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTACAAATGA AAATTGAGTT ACACATTTAA GGAGTCTCAT GGGAAAAGAA GAAACAAACT 153 ATGTTGTTTG AACTCTCCTA AAATATCCCT CGGTGTTTTT GAAGAGTCTG AACAATATAG 2131 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGTTGTTTG AACTCTCCTA AAATATCCCT CGGTGTTTTT GAAGAGTCTG AACAATATAG 213 AAAACAAGTG CATGTTAACT ATCCCCTTAT TAGTCTTTAG CCTTCTCTTT ACAATCTCTT 2071 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAACAAGTG CATGTTAACT ATCCCCTTAT TAGTCTTTAG CCTTCTCTTT ACAATCTCTT 273 TTTTCACTAT GTGTTTCCTT TAATGCTGCA GAATTACCCT TATCTGAAGT TTGTTCACTT 2011 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCACTAT GTGTTTCCTT TAATGCTGCA GAATTACCCT TATCTGAAGT TTGTTCACTT 333 GCACTAATAG GAGTGACTAA AATTTCATTC TTGTCAGTTG AACTGGAAGC AAAATTTGTA 1951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACTAATAG GAGTGACTAA AATTTCATTC TTGTCAGTTG AACTGGAAGC AAAATTTGTA 393 ACATCACTTG TCTTTTCTTG TTTGTTGTTG TTGGTGGTGT TGGAAGACAC CATAGTGGTC 1891 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATCACTTG TCTTTTCTTG TTTGTTGTTG TTGGTGGTGT TGGAAGACAC CATAGTGGTC 453 TTGTTTGGTG ATTCTACAGA GACTTCTTTG AGGCTCATTT TAGGTAAGTC CTCCTTCTTG 1831 |||||||||| |||||||||| |||||||||| |||||||||| ||| TTGTTTGGTG ATTCTACAGA GACTTCTTTG AGGCTCATTT TAG....... .......... 496 ACATCATTGT GATTAGTGGC TTCTTCATGA AACTTCTCTT CATCATTGGA TGGTTTCTGT 1771 .......... .......... .......... .......... .......... .......... 496 ATAACGTTGC AATCAAAGAC AAAGTATGAA TTTCGTTAAG GATGTTCTAG ATTCAACATA 1711 .......... .......... .......... .......... .......... .......... 496 CACGTATAGT TTATTATATA TGCGGTTTAT TTTTCCTGGC AAAGGATGTT CAACTAACCA 1651 .......... .......... .......... .......... .......... .......... 496 CCGTTGAGCC TATGTGACTC CGTATTACAT TATTTTCACA AGCACAAAAA TATGATATAA 1591 .......... .......... .......... .......... .......... .......... 496 TGAATGTAAC AAGAAAAGAC CATCTATTTA TGCTGTTTCA AAGAGAGGGT AAGTTTTATG 1531 .......... .......... .......... .......... .......... .......... 496 TACTGATAAG TGCGAAAATA GGAGAACCTA TTCTCTAAAG GGTGGTTTGG TTTATCATAC 1471 .......... .......... .......... .......... .......... .......... 496 TAACAAAAAT AATCCTACCA TACTGTCAGT ATATATAACT TAAGTAATTG AAGGTATCAT 1411 .......... .......... .......... .......... .......... .......... 496 TTGATCCAGT CTATATAAAC GTACCGTTTT AGCATCTGTA TGATGAAGTT GGTCTGATGC 1351 .......... .......... .......... .......... .......... .......... 496 AGACTCAACT AAT-GAAGTT CTAGACATTC TTGGACTTGG ACTTTCACCT ATTTGTTCCT 1292 |||||||| ||| | | | | ||| ||||| |||||||||| |||||||||| ..ACTCAACT AATGGGGGGG GGGGGGGGGC TTGAACTTGA ACTTTCACCT ATTTGTTCCT 554 CCCTGTCAAA GTTCAACAGA GGGCCTTGAC CTGTTGAGGA C-TTTGGTTT GAG-TACGGA 1234 |||||||||| |||||||||| ||||||||| |||||||||| | |||| ||| ||| |||| | CCCTGTCAAA GTTCAACAGA AGGCCTTGAC CTGTTGAGGA CTTTTGTTTT GAGTTACGAA 614 CTTTGGTGAT GATGAACAAC TCGAG-GCTG TAGGGACTTC TCGGATTAAA GATTGTAAG- 1176 ||| ||| || ||| |||| | || || |||||| | | ||| || || ||| ACTTGCTGAA GAATAACCGA CCGAGAGTTG TAAGGACTT- TAAAAATAAC GAATGCAAGC 673 CTGATCAAGT CTTGATTGA- CAGAGCTCTC CGATGGAATT CCAG 1133 | || || | | ||||| | | ||| | | | ||| || CAGAATAA-T CATGATTAAT TATAGC-CAC CTCAACAATC ATAG 715 hqPGS_C06HBa0054K13.1-4-_SGN-U336622+ (2250 1848) ******************************************************************************** EST sequence 1 +strand 2193 n (File: SGN-U316037+) 1 CTATTCCACT TATCACCATT TTCTTCATTT CTCAATAATC CCACTTTTAA GTTCTTTCCC 61 AGTTACTATT AACACAAATA TTTAAGAAAA ATCTCAACTT TTTTGTTGAA TACTTGTAAT 121 AAGCAGAAAT GGATTTTGTG TACAAGAACC CATCAGCTCT TATTGAGGAA AGAGTAAAAG 181 ATTTGTTGTC TCGGATGACA CTTGAAGAAA AAATAGGCCA AATGACTCAG ATCGAACGCA 241 GTGTTGCTAC CCCCTCTGTC ATTACTGACC TTTCTATAGG GAGTATACTC AGTGTTGGAG 301 GCAGTGCGCC ATTTGAGGAT GCTCCATCGG AAGCTTGGGC AGATATGGTT GACGGATTTC 361 AAAAGGCTGC GCTGGAATCA CGGCTTGGGA TTCCGCTTCT TTATGGAGTT GACGCTATTC 421 ATGGCAATAA CAATGTTTAT GGTGCTACCG TTTTTCCACA AAATGTGGGC CTTGGAGCCA 481 CCAGAGATGC AGACTTGGTT CAGAAGATTG GGATTGTGAC TGCTCTTGAA GTCAGGGCTT 541 GTGGCATTAA CTATACTTTT GCTCCCTGTG TTGCTGTATG TAGAGATCCC AGGTGGGGAA 601 GATGCTATGA GAGTTATGGC GAAGACACCG AACTTATTAG GAAGATGACC TCAATTGTCA 661 CAGGCTTGCA AGGGCAACCA CCTCCTGGAT ACCCCCAAAA CTATCCTTTT CTAGCTGGAA 721 GAGACAAGGT TGTTGCCTGT GCAAAGCACT TTGTTGGAGA TGGGGGTACT GACCGAGGTA 781 TAAATGAGGG AAATACCATA TCATCGTATG AAGATCTAGA GAGAATACAT ATTCCCCCAT 841 ATATTGACTG TATTTCTCAG GGAGTTTGCA CAGTAATGGC ATCCTACTCT AAATGGAATG 901 GAAGCCACCT GCATTCTAGC CACTTTCTTC TTACTGAAGT TTTGAAAGGG AAGCTCGGAT 961 TTAAGGGCTT TGTTATTTCT GATTCCGAAG GAATTGACCG ATTTTTCCAT CCTCATGGAT 1021 CTAACTATGA CCAAAGTATT TTGGCAGCAA TCAATGCAGG GATTGACATG GTGATGGTTC 1081 CTTTTCGGTA TCAATTATTT CTCGATCATT TGAAATATCT TGTGGAATCT GGGAATATTC 1141 CAATGACCAG AATTGATGAT GCTGTTGAAA GGATCCTGAG AGTTAAGTTT GTTTCCGGAG 1201 CTTTTGAGAA CCCTCTGAGT GATAGGTCAT TGTTGGATAC CGTTGGTTGT CATCAACATC 1261 GCGAATTAGC ACGTGAAGCA GTTCGCAAAT CACTGGTTCT TCTAAAGAAT GGGAAGGATG 1321 TAACAAAACC ATTTCTTCCG CTAGATAGGA AGGCAAAGAG AATTCTTGTA GCAGGAAAAC 1381 ATGCTGACGA CCTTGGATTC CAATGTGGAG GGTGGACTAA AACATGGGAA GGAATGGGCG 1441 GAAGAATCAC GATTGGAACA ACTATTCTGG AAGCTATTAA AGATGCTGTT GGAGGGGAAA 1501 CAGAACTGGT ATATGAAGAA AATCCTTCAC CAGACACCTT TGCGAGTCAA GACTTCTCTT 1561 ATTGCATTGT AGTTGTTGGT GAACCTCCCT ATTGTGAAAG CGGTGGAGAC AGCCAAGACC 1621 TCAGAATTCC TCTTGGCGGA GAAGAACTAA TAAACTTGGT TGCAGACAGA GTTCCAACGT 1681 TGGTGATATT GATCTCCGGA AGGCCTTTAC ATATAGAGCC TTCGATTCTG GAGAAAATGG 1741 ATGCCTTCGT TGCTGCATGG TTACCGGGCA CTGAGGGAAC TGGTATCACT GATGTCATAT 1801 TCGGAGATTT TGAATTCCAT GGAACCCTCC CTATGACATG GTTTAAGAGT GTAGATCAAT 1861 TACCCCTGCA TCAAGAACAG AACTCCTATG AACCTCTCTT TCCATTCGGC TACGGATTAA 1921 CAAGTAAAAA CAAGGTGATC TAGACGAGAT GTGTCGAAGA AAGGTTTAGT GAAATAGACA 1981 TGTCTATAAA GATGCGGACT ATCAGAAAGA CTGTATGAAG CTTTATGCAA CCTCTTGTGT 2041 ACTTTGTACC ATTTGGTCTC TGGCATATGA TGTTACATTA AGTCCAAATA ATTATATATT 2101 ATGTACTATT ATTACCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAATTT CATTTAATAA 2161 CGATATAAAA TATTCAACTT ATTTTAAAAA AAA Predicted gene structure (within gDNA segment 3754 to 12233): Exon 1 4354 4632 ( 279 n); cDNA 1 279 ( 279 n); score: 1.000 Intron 1 4633 6083 (1451 n); Pd: 1.000 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 2 6084 6288 ( 205 n); cDNA 280 484 ( 205 n); score: 1.000 Intron 2 6289 6381 ( 93 n); Pd: 0.992 (s: 1.00), Pa: 0.884 (s: 1.00) Exon 3 6382 6472 ( 91 n); cDNA 485 575 ( 91 n); score: 1.000 Intron 3 6473 7169 ( 697 n); Pd: 0.977 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 4 7170 7315 ( 146 n); cDNA 576 721 ( 146 n); score: 1.000 Intron 4 7316 7657 ( 342 n); Pd: 0.974 (s: 1.00), Pa: 0.953 (s: 1.00) Exon 5 7658 7901 ( 244 n); cDNA 722 965 ( 244 n); score: 1.000 Intron 5 7902 8692 ( 791 n); Pd: 0.998 (s: 1.00), Pa: 0.587 (s: 1.00) Exon 6 8693 8797 ( 105 n); cDNA 966 1070 ( 105 n); score: 1.000 Intron 6 8798 8876 ( 79 n); Pd: 0.998 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 7 8877 9059 ( 183 n); cDNA 1071 1253 ( 183 n); score: 1.000 Intron 7 9060 9794 ( 735 n); Pd: 0.921 (s: 1.00), Pa: 0.825 (s: 1.00) Exon 8 9795 9996 ( 202 n); cDNA 1254 1455 ( 202 n); score: 1.000 Intron 8 9997 10824 ( 828 n); Pd: 1.000 (s: 1.00), Pa: 0.999 (s: 0.98) Exon 9 10825 11553 ( 729 n); cDNA 1456 2186 ( 731 n); score: 0.970 MATCH C06HBa0054K13.1-4+ SGN-U316037+ 0.990 2184 0.996 C PGS_C06HBa0054K13.1-4+_SGN-U316037+ (4354 4632,6084 6288,6382 6472,7170 7315,7658 7901,8693 8797,8877 9059,9795 9996,10825 11553) Alignment (genomic DNA sequence = upper lines): CTATTCCACT TATCACCATT TTCTTCATTT CTCAATAATC CCACTTTTAA GTTCTTTCCC 4413 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATTCCACT TATCACCATT TTCTTCATTT CTCAATAATC CCACTTTTAA GTTCTTTCCC 60 AGTTACTATT AACACAAATA TTTAAGAAAA ATCTCAACTT TTTTGTTGAA TACTTGTAAT 4473 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTTACTATT AACACAAATA TTTAAGAAAA ATCTCAACTT TTTTGTTGAA TACTTGTAAT 120 AAGCAGAAAT GGATTTTGTG TACAAGAACC CATCAGCTCT TATTGAGGAA AGAGTAAAAG 4533 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGCAGAAAT GGATTTTGTG TACAAGAACC CATCAGCTCT TATTGAGGAA AGAGTAAAAG 180 ATTTGTTGTC TCGGATGACA CTTGAAGAAA AAATAGGCCA AATGACTCAG ATCGAACGCA 4593 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTTGTC TCGGATGACA CTTGAAGAAA AAATAGGCCA AATGACTCAG ATCGAACGCA 240 GTGTTGCTAC CCCCTCTGTC ATTACTGACC TTTCTATAGG TATGCCTTTT TTTCCCTCTT 4653 |||||||||| |||||||||| |||||||||| ||||||||| GTGTTGCTAC CCCCTCTGTC ATTACTGACC TTTCTATAG. .......... .......... 279 TGCCCTAAAA ATAATAATGA AGGTTTCTGA GTTGGAGTCG GGATTTGAAG CTTATGAATT 4713 .......... .......... .......... .......... .......... .......... 279 TGGGAAAATT TAGTTCATTT TAAGTTTTTG AGTTTGTAGG TTAGTAGGTA GTTGAGAATT 4773 .......... .......... .......... .......... .......... .......... 279 GCCTTGTTTT TAAGGGTTTA GGTTTATTTG TGTCTGTCGT TGTATTAGCC GTATTAGCAG 4833 .......... .......... .......... .......... .......... .......... 279 TTTTAGTTTT TGATATCTAT CATCATTTGT TTGTTGTGTT TGTGTTACTG CACTGCTTTT 4893 .......... .......... .......... .......... .......... .......... 279 CTCGTATTAG TGTTATGTTC TCTTGTTCTT TACATCTGAA ATTGGTGAGG GTCTTTCGGG 4953 .......... .......... .......... .......... .......... .......... 279 AACAACCTCT CTATCTTCAC GGGACAGTGA TGAAGTATGC TTATACTCTA CTCTCCCTAA 5013 .......... .......... .......... .......... .......... .......... 279 ACCCCATGAG TGGGATTTCA TTCGGTATGT TGTTGGTGTT GTTAAAGTAT TGAGTTCTAA 5073 .......... .......... .......... .......... .......... .......... 279 ACGAAATTTA CACATATTCA TCAGACAAAA ACGAAATTTG GATAAAAGCT ATTGAGTTAG 5133 .......... .......... .......... .......... .......... .......... 279 AACCAAAGTG CTAGGTCCAC GCCAGTCAAG GTGATAGATT AAATTTGTTG TTGGAGTAGA 5193 .......... .......... .......... .......... .......... .......... 279 TATAAGTGTG ATATTTAGAG CACTTTATTT AGCATTGGAA TGAAATCAAC CTGGATAGAG 5253 .......... .......... .......... .......... .......... .......... 279 AATAATCAAG ATAAATAATA GGTTTAAGTT ATATACACTG ACGAGGTAAC ATCAGTGTAG 5313 .......... .......... .......... .......... .......... .......... 279 TTTAACCTGT TACATTAGGT TGCTTACCAT TTGGTATAAC TTATAAGTTA CCAATGAATG 5373 .......... .......... .......... .......... .......... .......... 279 ATTATTAAGT AGAGTTACAT GCAATGACCT GGTTGTGCAA CTATTCTATT CCATGCCTGT 5433 .......... .......... .......... .......... .......... .......... 279 GTATAGAACG AGAGCTCATA AATGGATTCA TTTAGGCGTT ATTAACTACC TTATAAGCAT 5493 .......... .......... .......... .......... .......... .......... 279 TGTGTTTAGC AGGCTTTATT TAAGCGTTTA TCGGAAACAT CTCTATCTCT ACGAGGTAGG 5553 .......... .......... .......... .......... .......... .......... 279 GTAAGGTCTG TATACACTCT ACCCTTTCGA GATCCACTTG GTGGGATTGC ATTTGGTATG 5613 .......... .......... .......... .......... .......... .......... 279 TTGTTGTTGT AATTGCCTTA TAAAAACTAA GGTGGTTCTA ACTAGTTGGG TAAGTGTAAG 5673 .......... .......... .......... .......... .......... .......... 279 ATTTAGTTAA CCCCTTACTC AGCATTGGAT TGAAATGGAC TTGAATAGAG TGGATGGGTA 5733 .......... .......... .......... .......... .......... .......... 279 TATAGGATGC ATTTAGGTGA TTCTAATTAG TTGGGATAAG CTAGTTATCC ATGTAGAGGG 5793 .......... .......... .......... .......... .......... .......... 279 ACAAGTGTGA GATTCAGAGA ATCCTTTATT TTGAATTGAA TTGAAATGAG ATGAAATGAA 5853 .......... .......... .......... .......... .......... .......... 279 CTGAATAGAG TGGAATGCAT ATAGATGACT CTTTTAGTTG ATACTAACTG GTTGCTGTTA 5913 .......... .......... .......... .......... .......... .......... 279 AACTAGCTAT GTATTTAGGA TAAGATTGGG ACTTTGAGAA CACTTATAAG ACGAAATGGA 5973 .......... .......... .......... .......... .......... .......... 279 CCTAAATGGA GAGGAACAGG TATAGAGGAT TCTATTCCCG GTCCTAAGAT GTTCTGGAAT 6033 .......... .......... .......... .......... .......... .......... 279 TGTGACATAT TGCTTTGCTG AAATTTATTT GAACCTTTTT CGTGGCGCAG GGAGTATACT 6093 |||||||||| .......... .......... .......... .......... .......... GGAGTATACT 289 CAGTGTTGGA GGCAGTGCGC CATTTGAGGA TGCTCCATCG GAAGCTTGGG CAGATATGGT 6153 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTGTTGGA GGCAGTGCGC CATTTGAGGA TGCTCCATCG GAAGCTTGGG CAGATATGGT 349 TGACGGATTT CAAAAGGCTG CGCTGGAATC ACGGCTTGGG ATTCCGCTTC TTTATGGAGT 6213 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACGGATTT CAAAAGGCTG CGCTGGAATC ACGGCTTGGG ATTCCGCTTC TTTATGGAGT 409 TGACGCTATT CATGGCAATA ACAATGTTTA TGGTGCTACC GTTTTTCCAC AAAATGTGGG 6273 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACGCTATT CATGGCAATA ACAATGTTTA TGGTGCTACC GTTTTTCCAC AAAATGTGGG 469 CCTTGGAGCC ACCAGGTTGA ATTAATGTCA AGCTTTGATC TTTATTACCA ATTCTTATTG 6333 |||||||||| ||||| CCTTGGAGCC ACCAG..... .......... .......... .......... .......... 484 ATAACTGGCA TTTAAGATCA ATACCTAAAT TAGATAAATA TCGAGCAGAG ATGCAGACTT 6393 || |||||||||| .......... .......... .......... .......... ........AG ATGCAGACTT 496 GGTTCAGAAG ATTGGGATTG TGACTGCTCT TGAAGTCAGG GCTTGTGGCA TTAACTATAC 6453 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTTCAGAAG ATTGGGATTG TGACTGCTCT TGAAGTCAGG GCTTGTGGCA TTAACTATAC 556 TTTTGCTCCC TGTGTTGCTG TAAGTTGAAC GTGTTACTTG TCCCTTTATT GTCTTTGTTT 6513 |||||||||| ||||||||| TTTTGCTCCC TGTGTTGCT. .......... .......... .......... .......... 575 TAGGTCAATT TCATGGATGA ATGAAGTAAA GGAAACATAA GACGTTGGAC ATATGTACAC 6573 .......... .......... .......... .......... .......... .......... 575 TTAGAGCAAG ATGGAAATAG GATACCCTAA GTATCACATC AAAATTTATA ATTTGAATTC 6633 .......... .......... .......... .......... .......... .......... 575 TTAATTGGTC CTTCCGTTAC AAATTTTTTG TCCGGTTTTG ACTGGGCATT GAGTTAAAGA 6693 .......... .......... .......... .......... .......... .......... 575 AAGTTGCGAA AACTTACAAA TCTTGTGGTC TTAAGCTATA AATATGTAGA ATTTACCAAA 6753 .......... .......... .......... .......... .......... .......... 575 ATGCCCTTTA ATATTGTGTT CTTAAACATG TCATGTGAGT GACAATTAAA GAGTTGTCAA 6813 .......... .......... .......... .......... .......... .......... 575 AAAAGAAAAG ATACATTCTT TTTGAAACGG ACTAAAAAGA AAAGTAGGAC AAACAAATTG 6873 .......... .......... .......... .......... .......... .......... 575 AAACAGAGGG AGTAGTATGT TGCTAACTGA GACAAGAGAA ACATGATTTG TCGTTTTTCT 6933 .......... .......... .......... .......... .......... .......... 575 TTTTGGGAAT TCTTGGTTAA AATTTTTATG TTCCCAGTAC CTCGTAACTG CTATCTCTCT 6993 .......... .......... .......... .......... .......... .......... 575 AGAAGTAGTA CTTGTTTTTT TTTTGTTTGT TTGAGAAGAT AAGCAGCTTG TATATTGATA 7053 .......... .......... .......... .......... .......... .......... 575 AAGAGAAACT ATGTAGTACC AAGCTGATGC TGAGCAATAC CTCTCTAGTT GTTATCTCTT 7113 .......... .......... .......... .......... .......... .......... 575 TTTAACTCTA TTGTCTTGAG GCATCTTGAC TTCTTCATTT AAATAATGTG AAACAGGTAT 7173 |||| .......... .......... .......... .......... .......... ......GTAT 579 GTAGAGATCC CAGGTGGGGA AGATGCTATG AGAGTTATGG CGAAGACACC GAACTTATTA 7233 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAGAGATCC CAGGTGGGGA AGATGCTATG AGAGTTATGG CGAAGACACC GAACTTATTA 639 GGAAGATGAC CTCAATTGTC ACAGGCTTGC AAGGGCAACC ACCTCCTGGA TACCCCCAAA 7293 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAAGATGAC CTCAATTGTC ACAGGCTTGC AAGGGCAACC ACCTCCTGGA TACCCCCAAA 699 ACTATCCTTT TCTAGCTGGA AGGTAGTAGA AGTTGCCATA AAACAAGTTC CAAAATTTTG 7353 |||||||||| |||||||||| || ACTATCCTTT TCTAGCTGGA AG........ .......... .......... .......... 721 TATGCTTCTC TGAATATATA TAAAATTTAT TTCTCTTGTA TTTTACAACC TATAATCTTA 7413 .......... .......... .......... .......... .......... .......... 721 AGTCTTATGT GCATTCAAAT GCTATTCCAC ATGTTGCAAT TTTCAGCAGA AAAGTCATTA 7473 .......... .......... .......... .......... .......... .......... 721 GGGAAGGAAA AGTAGAAGCA TAGCTAGAGT TGAAAATAAA AAAATCAGTG TTGAAAAGTG 7533 .......... .......... .......... .......... .......... .......... 721 TGTCTTAGCT ATTATTACCT TATTGTTGAT CATTCGCCAG ATACTTCTAC AGCTTTCTAC 7593 .......... .......... .......... .......... .......... .......... 721 TTTTGTATTT TTTGTTCTCC TAATGGCAAG GATCTCTTTG GACTGATTGA TGAATTTACT 7653 .......... .......... .......... .......... .......... .......... 721 CCAGAGACAA GGTTGTTGCC TGTGCAAAGC ACTTTGTTGG AGATGGGGGT ACTGACCGAG 7713 |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ....AGACAA GGTTGTTGCC TGTGCAAAGC ACTTTGTTGG AGATGGGGGT ACTGACCGAG 777 GTATAAATGA GGGAAATACC ATATCATCGT ATGAAGATCT AGAGAGAATA CATATTCCCC 7773 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTATAAATGA GGGAAATACC ATATCATCGT ATGAAGATCT AGAGAGAATA CATATTCCCC 837 CATATATTGA CTGTATTTCT CAGGGAGTTT GCACAGTAAT GGCATCCTAC TCTAAATGGA 7833 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATATATTGA CTGTATTTCT CAGGGAGTTT GCACAGTAAT GGCATCCTAC TCTAAATGGA 897 ATGGAAGCCA CCTGCATTCT AGCCACTTTC TTCTTACTGA AGTTTTGAAA GGGAAGCTCG 7893 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGAAGCCA CCTGCATTCT AGCCACTTTC TTCTTACTGA AGTTTTGAAA GGGAAGCTCG 957 GATTTAAGGT AAGGATTGCC TGTTCTCTTA ATGCTCTATG GTTCATTTGA AGTCCAAATT 7953 |||||||| GATTTAAG.. .......... .......... .......... .......... .......... 965 TTGGACCATT AGTTCAACAT AAAAATCCTC TAGATTGTTA GATGCTTTTC TCATTTCCTT 8013 .......... .......... .......... .......... .......... .......... 965 ACCATGCTTA GTAGGGGAAG TGATTGAACA TATTGCGGTT CCAAAATGCT GAGTTTCAGG 8073 .......... .......... .......... .......... .......... .......... 965 GCCCATTTTT GATCGGTATT GTTCTTTAAA ATAATGTAGA AAACTAATCA AATTGGTCTG 8133 .......... .......... .......... .......... .......... .......... 965 AAGCCTGCAC ATGTTTGGAC TTTGTATTCA GCAAACACAA AAGATTCTCT CAGCAAAGAG 8193 .......... .......... .......... .......... .......... .......... 965 CTAGAGAAGT TCATGTGGCT TGGAGTTTGT TCTACTTAGT TTAAACTTCA CCAAGCCGAA 8253 .......... .......... .......... .......... .......... .......... 965 AGTGTAAAAA CTTGTTTCGA TGTGCTTGCT TCAACTTACA AAGGAGTCGG TAAGGCAAAG 8313 .......... .......... .......... .......... .......... .......... 965 AGAGTGGTAA GTACTCCTTT GTCCTTAACA AGAGGTCTAG GGTTCGAGTC CCCTTGGGTA 8373 .......... .......... .......... .......... .......... .......... 965 CGGAGTCGCC TTTGTTAGGG AGTGCTTTAC CCCAATGTGG GACTTTCCGA CGCGAATCCG 8433 .......... .......... .......... .......... .......... .......... 965 AATTTAGTCG GGCTCCAATG TGGGTAGGGG ACACTGGATG GGAAACCAAA AAAAAAAGGT 8493 .......... .......... .......... .......... .......... .......... 965 ATGGTTAAGG GGATTAAGAA ACAATGTCTC TCTTTTATCT GAATCTGGTT GCGTGGAATT 8553 .......... .......... .......... .......... .......... .......... 965 TTGTTATTGT TTATTTATAT TTTACCTCGA TTTGAATGAA ATGCTATGCA CTAATGTTCT 8613 .......... .......... .......... .......... .......... .......... 965 ATTTGTTTGT CAAGTTGCTA GAATATGATT GGCTTAAAGA TCACATACAC TGCAAGAATC 8673 .......... .......... .......... .......... .......... .......... 965 TGAATGCTAA CTCTTGTAGG GCTTTGTTAT TTCTGATTCC GAAGGAATTG ACCGATTTTT 8733 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G GCTTTGTTAT TTCTGATTCC GAAGGAATTG ACCGATTTTT 1006 CCATCCTCAT GGATCTAACT ATGACCAAAG TATTTTGGCA GCAATCAATG CAGGGATTGA 8793 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCATCCTCAT GGATCTAACT ATGACCAAAG TATTTTGGCA GCAATCAATG CAGGGATTGA 1066 CATGGTACAT AAAACCTTGT TTTAAGAGTT ATTTTGTCTT TGCATGAGTT ATTTTTGTTC 8853 |||| CATG...... .......... .......... .......... .......... .......... 1070 TTACTGTGTT AGATTTTTTG CAGGTGATGG TTCCTTTTCG GTATCAATTA TTTCTCGATC 8913 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...GTGATGG TTCCTTTTCG GTATCAATTA TTTCTCGATC 1107 ATTTGAAATA TCTTGTGGAA TCTGGGAATA TTCCAATGAC CAGAATTGAT GATGCTGTTG 8973 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGAAATA TCTTGTGGAA TCTGGGAATA TTCCAATGAC CAGAATTGAT GATGCTGTTG 1167 AAAGGATCCT GAGAGTTAAG TTTGTTTCCG GAGCTTTTGA GAACCCTCTG AGTGATAGGT 9033 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGGATCCT GAGAGTTAAG TTTGTTTCCG GAGCTTTTGA GAACCCTCTG AGTGATAGGT 1227 CATTGTTGGA TACCGTTGGT TGTCATGTAA GTCCGTCTTG TCCATTTTGA CAGATATTGG 9093 |||||||||| |||||||||| |||||| CATTGTTGGA TACCGTTGGT TGTCAT.... .......... .......... .......... 1253 CAAAATATTT ATCTTGAATT TTGACATTAA ATAAGACCTT CATAAGTTTC ATTCTATTCT 9153 .......... .......... .......... .......... .......... .......... 1253 GCCTTTTATC TCAACAACAA AGTGAACATC TGCTTCACAT CCATGATGCA CGGACTTCTC 9213 .......... .......... .......... .......... .......... .......... 1253 AAAATGATTG CAAGCACCCT TGTTGATGTG ACATTGGTGT GGGTGTGGGA TCCACGTGGG 9273 .......... .......... .......... .......... .......... .......... 1253 ATTCGGTCAA CTTATCTTGG ATGCTTTGAC TACATCTATG ACCAAGAAGT ATAGATTGAT 9333 .......... .......... .......... .......... .......... .......... 1253 TGGCTAATTC ATGAGATCTC TGAGTAAAAC CCTATGTTTA TAAAGAAATA AATTTTATTA 9393 .......... .......... .......... .......... .......... .......... 1253 GTTTTAATTC AATACCTTTA ATTAATAGAA ATAAATACTT TGTATAATGT TAAAATATAC 9453 .......... .......... .......... .......... .......... .......... 1253 GTTTATTGGC CTTATCCCAC TTCCCGTATC CGCTCTCGGA TTCATACCCC CAAATCCTAA 9513 .......... .......... .......... .......... .......... .......... 1253 AAGTTCTGTG ACGAAGAGTC CGACCTCTAG ATGCGCACCC ATAGTGGGCA CCCAAACCCG 9573 .......... .......... .......... .......... .......... .......... 1253 AGTCTGAGCA ACATAGCTTT TGCATTGCCC TTACAAAAGT TAAATTATGT GAGTTGGCAA 9633 .......... .......... .......... .......... .......... .......... 1253 ACTGTTTTAC CTGTTTTTAC AATGTATATA CTGCTAATTC TGTTTGTTTT GTAAAATTAT 9693 .......... .......... .......... .......... .......... .......... 1253 ACATTTTTAA AGAATGTAGG AGAGAGAGAG AGAGGAGTTA TGTGCAATAT ATCAGTGCAG 9753 .......... .......... .......... .......... .......... .......... 1253 AGGCTTCATA GTAGTTTTCA CCATTCTGAT AATTGTTTCA GCAACATCGC GAATTAGCAC 9813 ||||||||| |||||||||| .......... .......... .......... .......... .CAACATCGC GAATTAGCAC 1272 GTGAAGCAGT TCGCAAATCA CTGGTTCTTC TAAAGAATGG GAAGGATGTA ACAAAACCAT 9873 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAAGCAGT TCGCAAATCA CTGGTTCTTC TAAAGAATGG GAAGGATGTA ACAAAACCAT 1332 TTCTTCCGCT AGATAGGAAG GCAAAGAGAA TTCTTGTAGC AGGAAAACAT GCTGACGACC 9933 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTTCCGCT AGATAGGAAG GCAAAGAGAA TTCTTGTAGC AGGAAAACAT GCTGACGACC 1392 TTGGATTCCA ATGTGGAGGG TGGACTAAAA CATGGGAAGG AATGGGCGGA AGAATCACGA 9993 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGATTCCA ATGTGGAGGG TGGACTAAAA CATGGGAAGG AATGGGCGGA AGAATCACGA 1452 TTGGTAAGTT ATGCTTCCTC TGTTCTAATT ATATGACACT CATTCATATT TACTACATGA 10053 ||| TTG....... .......... .......... .......... .......... .......... 1455 AAAAAACAAT TTCAACTTTC TATATTGTGA AAACTCTGAC TTTAGGTGTA CCATTTTATT 10113 .......... .......... .......... .......... .......... .......... 1455 CTTAGTGACA TACTCCTATA GATATAGAAA TATTTGACAT GTTTTAGTTC ACGAGTTTTA 10173 .......... .......... .......... .......... .......... .......... 1455 TGGGTACTTT TGGTATGTGT AGAAAGTCTT TGGTTTTTCT TAAATTTTGT GCCCAGCCAA 10233 .......... .......... .......... .......... .......... .......... 1455 GCACCGCCAT ATAAAATGGA GAGAGTAATG GAAAGATGGC ATCAGTCGTT ACTTATACTT 10293 .......... .......... .......... .......... .......... .......... 1455 TCATACAACT AGAGCATTAT AGTTTAGAGT AGCATCTTTC TTTCCTTCTT TCTTACATGG 10353 .......... .......... .......... .......... .......... .......... 1455 ACTCAAATTT ACCACAAGTT TAGGAACAAT ATCTCAAGAG TTCCGCTCCC ATTTCTTGTG 10413 .......... .......... .......... .......... .......... .......... 1455 GGTGATTTGT AAAAACCCTG TATGTTTGAT TGTTTTCTCG CAAGAAATCT TAAGCTATAA 10473 .......... .......... .......... .......... .......... .......... 1455 TGTCATAAAA TGCCTGATAT CGCCAGCTGA TAGCATAATT TTGTATGACA TATGTAAGCT 10533 .......... .......... .......... .......... .......... .......... 1455 AGTGCATACT CTTGTTAGCC TTGCACAAAG ATCTTGATGT ACTAGGAACC TCAATAGACC 10593 .......... .......... .......... .......... .......... .......... 1455 CAATGTAGTC CGGTCCTTCT GCGTACCATG CTCAAAACAG AAACTTAGTG CACTTGGCTG 10653 .......... .......... .......... .......... .......... .......... 1455 CCCTTAGGAA CCTCAATAAG GATGAGGATT GGGATTAGAC GTGCGTTGAG TCCTTTTGAT 10713 .......... .......... .......... .......... .......... .......... 1455 AATGAGCTCA TTTCTCTCAA TAGGCCGATG ATAATATAAT GCATCTCTGA ACTTAGTTTA 10773 .......... .......... .......... .......... .......... .......... 1455 CCACTATTAA TCACGGATCA TTTACTACTT AAAACGGATA ATTTGTTGTA GGTACAACTA 10833 | ||||||| .......... .......... .......... .......... .......... .GAACAACTA 1464 TTCTGGAAGC TATTAAAGAT GCTGTTGGAG GGGAAACAGA ACTGGTATAT GAAGAAAATC 10893 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGGAAGC TATTAAAGAT GCTGTTGGAG GGGAAACAGA ACTGGTATAT GAAGAAAATC 1524 CTTCACCAGA CACCTTTGCG AGTCAAGACT TCTCTTATTG CATTGTAGTT GTTGGTGAAC 10953 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCACCAGA CACCTTTGCG AGTCAAGACT TCTCTTATTG CATTGTAGTT GTTGGTGAAC 1584 CTCCCTATTG TGAAAGCGGT GGAGACAGCC AAGACCTCAG AATTCCTCTT GGCGGAGAAG 11013 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCCCTATTG TGAAAGCGGT GGAGACAGCC AAGACCTCAG AATTCCTCTT GGCGGAGAAG 1644 AACTAATAAG CTTGGTTGCA GACAGAGTTC CAACGTTGGT GATATTGATC TCCGGAAGGC 11073 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTAATAAA CTTGGTTGCA GACAGAGTTC CAACGTTGGT GATATTGATC TCCGGAAGGC 1704 CTTTACATAT AGAGCCTTCG ATTCTGGAGA AAATGGATGC CTTCGTTGCT GCATGGTTAC 11133 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTACATAT AGAGCCTTCG ATTCTGGAGA AAATGGATGC CTTCGTTGCT GCATGGTTAC 1764 CGGGCACTGA GGGAACTGGT ATCACTGATG TCATATTCGG AGATTTTGAA TTCCATGGAA 11193 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGGGCACTGA GGGAACTGGT ATCACTGATG TCATATTCGG AGATTTTGAA TTCCATGGAA 1824 CCCTCCCTAT GACATGGTTT AAGAGTGTAG ATCAATTACC CCTGCATCAA GAACAGAACT 11253 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCTCCCTAT GACATGGTTT AAGAGTGTAG ATCAATTACC CCTGCATCAA GAACAGAACT 1884 CCTATGAACC TCTCTTTCCA TTCGGCTACG GATTAACAAG TAAAAACAAG GTGATCTAGA 11313 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTATGAACC TCTCTTTCCA TTCGGCTACG GATTAACAAG TAAAAACAAG GTGATCTAGA 1944 CGAGATGTGT CGAAGAAAGG TTTAGTGAAA TAGACATGTC TATAAAGATG CGGACTATCA 11373 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAGATGTGT CGAAGAAAGG TTTAGTGAAA TAGACATGTC TATAAAGATG CGGACTATCA 2004 GAAAGACTGT ATGAAGCTTT ATGCAACCTC TTGTGTACTT TGTACCATTT GGTCTCTGGC 11433 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAGACTGT ATGAAGCTTT ATGCAACCTC TTGTGTACTT TGTACCATTT GGTCTCTGGC 2064 ATATGATGTT ACATTAAGTC CAAATAATTA TATATTATGT ACTATTATTA CC--ATATGT 11491 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || | | ATATGATGTT ACATTAAGTC CAAATAATTA TATATTATGT ACTATTATTA CCAAAAAAAA 2124 GAACAAGGAA AAATGCGTAA CAGTTTCATT TAATAACGAT ATAAAATATT CAACTTATTT 11551 || || || ||| || | ||||||| |||||||||| |||||||||| |||||||||| AAAAAAAAAA AAAAAAAAAA AAATTTCATT TAATAACGAT ATAAAATATT CAACTTATTT 2184 TA 11553 || TA 2186 hqPGS_C06HBa0054K13.1-4+_SGN-U316037+ (4354 4632,6084 6288,6382 6472,7170 7315,7658 7901,8693 8797,8877 9059,9795 9996,10825 11553) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 14213: PGL 1 (- strand): 2250 1 AGS-1 (2250 1848,1348 1133,598 1) SCR (e 1.000 d 0.999 a 0.909,e 1.000 d 0.898 a 0.983,e 1.000) Exon 1 2250 1848 ( 403 n); score: 1.000 Intron 1 1847 1349 ( 499 n); Pd: 0.999 Pa: 0.909 Exon 2 1348 1133 ( 216 n); score: 1.000 Intron 2 1132 599 ( 534 n); Pd: 0.898 Pa: 0.983 Exon 3 598 1 ( 598 n); score: 1.000 PGS (2249 1848,1348 1133,598 1) SGN-U318765+ PGS (2250 1848) SGN-U336622+ 3-phase translation of AGS-1 (-strand): . . . . . . 2250 TTACAAATGAAAATTGAGTTACACATTTAAGGAGTCTCATGGGAAAAGAAGAAACAAACT L Q M K I E L H I - G V S W E K K K Q T Y K - K L S Y T F K E S H G K R R N K L T N E N - V T H L R S L M G K E E T N . . . . . . 2190 ATGTTGTTTGAACTCTCCTAAAATATCCCTCGGTGTTTTTGAAGAGTCTGAACAATATAG M L F E L S - N I P R C F - R V - T I - C C L N S P K I S L G V F E E S E Q Y R Y V V - T L L K Y P S V F L K S L N N I . . . . . . 2130 AAAACAAGTGCATGTTAACTATCCCCTTATTAGTCTTTAGCCTTCTCTTTACAATCTCTT K T S A C - L S P Y - S L A F S L Q S L K Q V H V N Y P L I S L - P S L Y N L F E N K C M L T I P L L V F S L L F T I S . . . . . . 2070 TTTTCACTATGTGTTTCCTTTAATGCTGCAGAATTACCCTTATCTGAAGTTTGTTCACTT F S L C V S F N A A E L P L S E V C S L F H Y V F P L M L Q N Y P Y L K F V H L F F T M C F L - C C R I T L I - S L F T . . . . . . 2010 GCACTAATAGGAGTGACTAAAATTTCATTCTTGTCAGTTGAACTGGAAGCAAAATTTGTA A L I G V T K I S F L S V E L E A K F V H - - E - L K F H S C Q L N W K Q N L - C T N R S D - N F I L V S - T G S K I C . . . . . . 1950 ACATCACTTGTCTTTTCTTGTTTGTTGTTGTTGGTGGTGTTGGAAGACACCATAGTGGTC T S L V F S C L L L L V V L E D T I V V H H L S F L V C C C W W C W K T P - W S N I T C L F L F V V V G G V G R H H S G . . . . . : . 1890 TTGTTTGGTGATTCTACAGAGACTTCTTTGAGGCTCATTTTAG : ACTCAACTAATGAAGTT L F G D S T E T S L R L I L : D S T N E V C L V I L Q R L L - G S F - : T Q L M K F L V W - F Y R D F F E A H F R : L N - - S . . . . . . 1331 CTAGACATTCTTGGACTTGGACTTTCACCTATTTGTTCCTCCCTGTCAAAGTTCAACAGA L D I L G L G L S P I C S S L S K F N R - T F L D L D F H L F V P P C Q S S T E S R H S W T W T F T Y L F L P V K V Q Q . . . . . . 1271 GGGCCTTGACCTGTTGAGGACTTTGGTTTGAGTACGGACTTTGGTGATGATGAACAACTC G P - P V E D F G L S T D F G D D E Q L G L D L L R T L V - V R T L V M M N N S R A L T C - G L W F E Y G L W - - - T T . . . . . . 1211 GAGGCTGTAGGGACTTCTCGGATTAAAGATTGTAAGCTGATCAAGTCTTGATTGACAGAG E A V G T S R I K D C K L I K S - L T E R L - G L L G L K I V S - S S L D - Q S R G C R D F S D - R L - A D Q V L I D R . . : . . . . 1151 CTCTCCGATGGAATTCCAG : AAGTCGAGGAGCCTTCACTTGCTGAGAACTTTGGTTGAATC L S D G I P : E V E E P S L A E N F G - I S P M E F Q : K S R S L H L L R T L V E S A L R W N S R : S R G A F T C - E L W L N . . . . . . 557 ATAGTGTTGGGAGAGGGGGATGAAGTAGAGGCCCTCGGAACTTGTTCAACCACCAAATCA I V L G E G D E V E A L G T C S T T K S - C W E R G M K - R P S E L V Q P P N H H S V G R G G - S R G P R N L F N H Q I . . . . . . 497 TGTGGTGATGATGATGATGATGCTTGTGAACCATCAATACTCTGGCTAATTGTAGAAAAA C G D D D D D A C E P S I L W L I V E K V V M M M M M L V N H Q Y S G - L - K K M W - - - - - C L - T I N T L A N C R K . . . . . . 437 ACTTCTTCAATAGGCTGTTTACCACCATTATCGTGAAGTGCTTCATGAATTTCATTCCTA T S S I G C L P P L S - S A S - I S F L L L Q - A V Y H H Y R E V L H E F H S - N F F N R L F T T I I V K C F M N F I P . . . . . . 377 GGGAGAGACTTTGAATTTTCTGCAGACTTCTGTGTCTTCGGATGTGGTACTGAACTTTGC G R D F E F S A D F C V F G C G T E L C G E T L N F L Q T S V S S D V V L N F A R E R L - I F C R L L C L R M W Y - T L . . . . . . 317 ACTGTCGGATTTCTTAACGTAAGTTTTGAGCTCCGAGTACTGGTTGGATCACAACTTTCA T V G F L N V S F E L R V L V G S Q L S L S D F L T - V L S S E Y W L D H N F Q H C R I S - R K F - A P S T G W I T T F . . . . . . 257 GCAGTTACTTCATCTGTCAGCTTATTTTTTAGTTTGCGCCTATCTACTTGTTGAGCCTCC A V T S S V S L F F S L R L S T C - A S Q L L H L S A Y F L V C A Y L L V E P P S S Y F I C Q L I F - F A P I Y L L S L . . . . . . 197 AAATTTGAACTAGATCCTTTATCAATGGAAAATGTTGGTTGCGAAACAGACTTAGACGAT K F E L D P L S M E N V G C E T D L D D N L N - I L Y Q W K M L V A K Q T - T M Q I - T R S F I N G K C W L R N R L R R . . . . . . 137 GATGTACTAGAAGAGGCTTCAAATGAGTGATGTCCTTGTCCAATAGTCGAAAATGCCACA D V L E E A S N E - C P C P I V E N A T M Y - K R L Q M S D V L V Q - S K M P Q - C T R R G F K - V M S L S N S R K C H . . . . . . 77 GGTTCACTATGTTGTTCGCTGGCACTGTCTAGAACGAGTGCAGCGACATCGTTCTCTTTG G S L C C S L A L S R T S A A T S F S L V H Y V V R W H C L E R V Q R H R S L W R F T M L F A G T V - N E C S D I V L F . . 17 GCTGAAATTGAATTTTC A E I E F L K L N F G - N - I F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-4-_PGL-1_AGS-1_PPS_1 (2097 1848,1348 1263) (frame '1'; 333 bp, 111 residues) 1 SLAFSLQSLF SLCVSFNAAE LPLSEVCSLA LIGVTKISFL SVELEAKFVT SLVFSCLLLL 61 VVLEDTIVVL FGDSTETSLR LILDSTNEVL DILGLGLSPI CSSLSKFNRG P- AGS-2 (1636 1133) SCR (e 0.920) Exon 1 1636 1133 ( 504 n); score: 0.920 PGS (1636 1133) SGN-U341543+ 3-phase translation of AGS-2 (-strand): . . . . . . 1636 TGACTCCGTATTACATTATTTTCACAAGCACAAAAATATGATATAATGAATGTAACAAGA - L R I T L F S Q A Q K Y D I M N V T R D S V L H Y F H K H K N M I - - M - Q E T P Y Y I I F T S T K I - Y N E C N K . . . . . . 1576 AAAGACCATCTATTTATGCTGTTTCAAAGAGAGGGTAAGTTTTATGTACTGATAAGTGCG K D H L F M L F Q R E G K F Y V L I S A K T I Y L C C F K E R V S F M Y - - V R K R P S I Y A V S K R G - V L C T D K C . . . . . . 1516 AAAATAGGAGAACCTATTCTCTAAAGGGTGGTTTGGTTTATCATACTAACAAAAATAATC K I G E P I L - R V V W F I I L T K I I K - E N L F S K G W F G L S Y - Q K - S E N R R T Y S L K G G L V Y H T N K N N . . . . . . 1456 CTACCATACTGTCAGTATATATAACTTAAGTAATTGAAGGTATCATTTGATCCAGTCTAT L P Y C Q Y I - L K - L K V S F D P V Y Y H T V S I Y N L S N - R Y H L I Q S I P T I L S V Y I T - V I E G I I - S S L . . . . . . 1396 ATAAACGTACCGTTTTAGCATCTGTATGATGAAGTTGGTCTGATGCAGACTCAACTAATG I N V P F - H L Y D E V G L M Q T Q L M - T Y R F S I C M M K L V - C R L N - - Y K R T V L A S V - - S W S D A D S T N . . . . . . 1336 AAGTTCTAGACATTCTTGGACTTGGACTTTCACCTATTTGTTCCTCCCTGTCAAAGTTCA K F - T F L D L D F H L F V P P C Q S S S S R H S W T W T F T Y L F L P V K V Q E V L D I L G L G L S P I C S S L S K F . . . . . . 1276 ACAGAGGGCCTTGACCTGTTGAGGACTTTGGTTTGAGTACGGACTTTGGTGATGATGAAC T E G L D L L R T L V - V R T L V M M N Q R A L T C - G L W F E Y G L W - - - T N R G P - P V E D F G L S T D F G D D E . . . . . . 1216 AACTCGAGGCTGTAGGGACTTCTCGGATTAAAGATTGTAAGCTGATCAAGTCTTGATTGA N S R L - G L L G L K I V S - S S L D - T R G C R D F S D - R L - A D Q V L I D Q L E A V G T S R I K D C K L I K S - L . . . 1156 CAGAGCTCTCCGATGGAATTCCAG Q S S P M E F Q R A L R W N S T E L S D G I P Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (+strand): . . . . . . 1133 CTGGAATTCCATCGGAGAGCTCTGTCAATCAAGACTTGATCAGCTTACAATCTTTAATCC L E F H R R A L S I K T - S A Y N L - S W N S I G E L C Q S R L D Q L T I F N P G I P S E S S V N Q D L I S L Q S L I . . . . . . 1193 GAGAAGTCCCTACAGCCTCGAGTTGTTCATCATCACCAAAGTCCGTACTCAAACCAAAGT E K S L Q P R V V H H H Q S P Y S N Q S R S P Y S L E L F I I T K V R T Q T K V R E V P T A S S C S S S P K S V L K P K . . . . . . 1253 CCTCAACAGGTCAAGGCCCTCTGTTGAACTTTGACAGGGAGGAACAAATAGGTGAAAGTC P Q Q V K A L C - T L T G R N K - V K V L N R S R P S V E L - Q G G T N R - K S S S T G Q G P L L N F D R E E Q I G E S . . . . . . 1313 CAAGTCCAAGAATGTCTAGAACTTCATTAGTTGAGTCTGCATCAGACCAACTTCATCATA Q V Q E C L E L H - L S L H Q T N F I I K S K N V - N F I S - V C I R P T S S Y P S P R M S R T S L V E S A S D Q L H H . . . . . . 1373 CAGATGCTAAAACGGTACGTTTATATAGACTGGATCAAATGATACCTTCAATTACTTAAG Q M L K R Y V Y I D W I K - Y L Q L L K R C - N G T F I - T G S N D T F N Y L S T D A K T V R L Y R L D Q M I P S I T - . . . . . . 1433 TTATATATACTGACAGTATGGTAGGATTATTTTTGTTAGTATGATAAACCAAACCACCCT L Y I L T V W - D Y F C - Y D K P N H P Y I Y - Q Y G R I I F V S M I N Q T T L V I Y T D S M V G L F L L V - - T K P P . . . . . . 1493 TTAGAGAATAGGTTCTCCTATTTTCGCACTTATCAGTACATAAAACTTACCCTCTCTTTG L E N R F S Y F R T Y Q Y I K L T L S L - R I G S P I F A L I S T - N L P S L - F R E - V L L F S H L S V H K T Y P L F . . . . . . 1553 AAACAGCATAAATAGATGGTCTTTTCTTGTTACATTCATTATATCATATTTTTGTGCTTG K Q H K - M V F S C Y I H Y I I F L C L N S I N R W S F L V T F I I S Y F C A C E T A - I D G L F L L H S L Y H I F V L . . . 1613 TGAAAATAATGTAATACGGAGTCA - K - C N T E S E N N V I R S V K I M - Y G V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-4+_PGL-1_AGS-2_PPS_1 (1135 1431) (frame '0'; 294 bp, 98 residues) 1 GIPSESSVNQ DLISLQSLIR EVPTASSCSS SPKSVLKPKS STGQGPLLNF DREEQIGESP 61 SPRMSRTSLV ESASDQLHHT DAKTVRLYRL DQMIPSIT- PGL 2 (+ strand): 4354 11553 AGS-1 (4354 4632,6084 6288,6382 6472,7170 7315,7658 7901,8693 8797,8877 9059,9795 9996,10825 11553) SCR (e 1.000 d 1.000 a 0.998,e 1.000 d 0.992 a 0.884,e 1.000 d 0.977 a 1.000,e 1.000 d 0.974 a 0.953,e 1.000 d 0.998 a 0.587,e 1.000 d 0.998 a 0.990,e 1.000 d 0.921 a 0.825,e 1.000 d 1.000 a 0.999,e 0.970) Exon 1 4354 4632 ( 279 n); score: 1.000 Intron 1 4633 6083 (1451 n); Pd: 1.000 Pa: 0.998 Exon 2 6084 6288 ( 205 n); score: 1.000 Intron 2 6289 6381 ( 93 n); Pd: 0.992 Pa: 0.884 Exon 3 6382 6472 ( 91 n); score: 1.000 Intron 3 6473 7169 ( 697 n); Pd: 0.977 Pa: 1.000 Exon 4 7170 7315 ( 146 n); score: 1.000 Intron 4 7316 7657 ( 342 n); Pd: 0.974 Pa: 0.953 Exon 5 7658 7901 ( 244 n); score: 1.000 Intron 5 7902 8692 ( 791 n); Pd: 0.998 Pa: 0.587 Exon 6 8693 8797 ( 105 n); score: 1.000 Intron 6 8798 8876 ( 79 n); Pd: 0.998 Pa: 0.990 Exon 7 8877 9059 ( 183 n); score: 1.000 Intron 7 9060 9794 ( 735 n); Pd: 0.921 Pa: 0.825 Exon 8 9795 9996 ( 202 n); score: 1.000 Intron 8 9997 10824 ( 828 n); Pd: 1.000 Pa: 0.999 Exon 9 10825 11553 ( 729 n); score: 0.970 PGS (4354 4632,6084 6288,6382 6472,7170 7315,7658 7901,8693 8797,8877 9059,9795 9996,10825 11553) SGN-U316037+ 3-phase translation of AGS-1 (+strand): . . . . . . 4354 CTATTCCACTTATCACCATTTTCTTCATTTCTCAATAATCCCACTTTTAAGTTCTTTCCC L F H L S P F S S F L N N P T F K F F P Y S T Y H H F L H F S I I P L L S S F P I P L I T I F F I S Q - S H F - V L S . . . . . . 4414 AGTTACTATTAACACAAATATTTAAGAAAAATCTCAACTTTTTTGTTGAATACTTGTAAT S Y Y - H K Y L R K I S T F L L N T C N V T I N T N I - E K S Q L F C - I L V I Q L L L T Q I F K K N L N F F V E Y L - . . . . . . 4474 AAGCAGAAATGGATTTTGTGTACAAGAACCCATCAGCTCTTATTGAGGAAAGAGTAAAAG K Q K W I L C T R T H Q L L L R K E - K S R N G F C V Q E P I S S Y - G K S K R - A E M D F V Y K N P S A L I E E R V K . . . . . . 4534 ATTTGTTGTCTCGGATGACACTTGAAGAAAAAATAGGCCAAATGACTCAGATCGAACGCA I C C L G - H L K K K - A K - L R S N A F V V S D D T - R K N R P N D S D R T Q D L L S R M T L E E K I G Q M T Q I E R . . . . : . . 4594 GTGTTGCTACCCCCTCTGTCATTACTGACCTTTCTATAG : GGAGTATACTCAGTGTTGGAG V L L P P L S L L T F L - : G V Y S V L E C C Y P L C H Y - P F Y R : E Y T Q C W R S V A T P S V I T D L S I : G S I L S V G . . . . . . 6105 GCAGTGCGCCATTTGAGGATGCTCCATCGGAAGCTTGGGCAGATATGGTTGACGGATTTC A V R H L R M L H R K L G Q I W L T D F Q C A I - G C S I G S L G R Y G - R I S G S A P F E D A P S E A W A D M V D G F . . . . . . 6165 AAAAGGCTGCGCTGGAATCACGGCTTGGGATTCCGCTTCTTTATGGAGTTGACGCTATTC K R L R W N H G L G F R F F M E L T L F K G C A G I T A W D S A S L W S - R Y S Q K A A L E S R L G I P L L Y G V D A I . . . . . . 6225 ATGGCAATAACAATGTTTATGGTGCTACCGTTTTTCCACAAAATGTGGGCCTTGGAGCCA M A I T M F M V L P F F H K M W A L E P W Q - Q C L W C Y R F S T K C G P W S H H G N N N V Y G A T V F P Q N V G L G A . : . . . . . 6285 CCAG : AGATGCAGACTTGGTTCAGAAGATTGGGATTGTGACTGCTCTTGAAGTCAGGGCTT P : E M Q T W F R R L G L - L L L K S G L Q : R C R L G S E D W D C D C S - S Q G L T R : D A D L V Q K I G I V T A L E V R A . . . . : . . 6438 GTGGCATTAACTATACTTTTGCTCCCTGTGTTGCT : GTATGTAGAGATCCCAGGTGGGGAA V A L T I L L L P V L L : Y V E I P G G E W H - L Y F C S L C C : C M - R S Q V G K C G I N Y T F A P C V A : V C R D P R W G . . . . . . 7195 GATGCTATGAGAGTTATGGCGAAGACACCGAACTTATTAGGAAGATGACCTCAATTGTCA D A M R V M A K T P N L L G R - P Q L S M L - E L W R R H R T Y - E D D L N C H R C Y E S Y G E D T E L I R K M T S I V . . . . . . 7255 CAGGCTTGCAAGGGCAACCACCTCCTGGATACCCCCAAAACTATCCTTTTCTAGCTGGAA Q A C K G N H L L D T P K T I L F - L E R L A R A T T S W I P P K L S F S S W K T G L Q G Q P P P G Y P Q N Y P F L A G . : . . . . . 7315 G : AGACAAGGTTGTTGCCTGTGCAAAGCACTTTGTTGGAGATGGGGGTACTGACCGAGGTA : E T R L L P V Q S T L L E M G V L T E V : R Q G C C L C K A L C W R W G Y - P R Y R : D K V V A C A K H F V G D G G T D R G . . . . . . 7717 TAAATGAGGGAAATACCATATCATCGTATGAAGATCTAGAGAGAATACATATTCCCCCAT - M R E I P Y H R M K I - R E Y I F P H K - G K Y H I I V - R S R E N T Y S P I I N E G N T I S S Y E D L E R I H I P P . . . . . . 7777 ATATTGACTGTATTTCTCAGGGAGTTTGCACAGTAATGGCATCCTACTCTAAATGGAATG I L T V F L R E F A Q - W H P T L N G M Y - L Y F S G S L H S N G I L L - M E W Y I D C I S Q G V C T V M A S Y S K W N . . . . . . 7837 GAAGCCACCTGCATTCTAGCCACTTTCTTCTTACTGAAGTTTTGAAAGGGAAGCTCGGAT E A T C I L A T F F L L K F - K G S S D K P P A F - P L S S Y - S F E R E A R I G S H L H S S H F L L T E V L K G K L G . : . . . . . 7897 TTAAG : GGCTTTGTTATTTCTGATTCCGAAGGAATTGACCGATTTTTCCATCCTCATGGAT L R : A L L F L I P K E L T D F S I L M D - : G L C Y F - F R R N - P I F P S S W I F K : G F V I S D S E G I D R F F H P H G . . . . . : . 8748 CTAACTATGACCAAAGTATTTTGGCAGCAATCAATGCAGGGATTGACATG : GTGATGGTTC L T M T K V F W Q Q S M Q G L T W : - W F - L - P K Y F G S N Q C R D - H : G D G S S N Y D Q S I L A A I N A G I D M : V M V . . . . . . 8887 CTTTTCGGTATCAATTATTTCTCGATCATTTGAAATATCTTGTGGAATCTGGGAATATTC L F G I N Y F S I I - N I L W N L G I F F S V S I I S R S F E I S C G I W E Y S P F R Y Q L F L D H L K Y L V E S G N I . . . . . . 8947 CAATGACCAGAATTGATGATGCTGTTGAAAGGATCCTGAGAGTTAAGTTTGTTTCCGGAG Q - P E L M M L L K G S - E L S L F P E N D Q N - - C C - K D P E S - V C F R S P M T R I D D A V E R I L R V K F V S G . . . . . . : 9007 CTTTTGAGAACCCTCTGAGTGATAGGTCATTGTTGGATACCGTTGGTTGTCAT : CAACATC L L R T L - V I G H C W I P L V V I : N I F - E P S E - - V I V G Y R W L S : S T S A F E N P L S D R S L L D T V G C H : Q H . . . . . . 9802 GCGAATTAGCACGTGAAGCAGTTCGCAAATCACTGGTTCTTCTAAAGAATGGGAAGGATG A N - H V K Q F A N H W F F - R M G R M R I S T - S S S Q I T G S S K E W E G C R E L A R E A V R K S L V L L K N G K D . . . . . . 9862 TAACAAAACCATTTCTTCCGCTAGATAGGAAGGCAAAGAGAATTCTTGTAGCAGGAAAAC - Q N H F F R - I G R Q R E F L - Q E N N K T I S S A R - E G K E N S C S R K T V T K P F L P L D R K A K R I L V A G K . . . . . . 9922 ATGCTGACGACCTTGGATTCCAATGTGGAGGGTGGACTAAAACATGGGAAGGAATGGGCG M L T T L D S N V E G G L K H G K E W A C - R P W I P M W R V D - N M G R N G R H A D D L G F Q C G G W T K T W E G M G . . : . . . . 9982 GAAGAATCACGATTG : GTACAACTATTCTGGAAGCTATTAAAGATGCTGTTGGAGGGGAAA E E S R L : V Q L F W K L L K M L L E G K K N H D W : Y N Y S G S Y - R C C W R G N G R I T I : G T T I L E A I K D A V G G E . . . . . . 10870 CAGAACTGGTATATGAAGAAAATCCTTCACCAGACACCTTTGCGAGTCAAGACTTCTCTT Q N W Y M K K I L H Q T P L R V K T S L R T G I - R K S F T R H L C E S R L L L T E L V Y E E N P S P D T F A S Q D F S . . . . . . 10930 ATTGCATTGTAGTTGTTGGTGAACCTCCCTATTGTGAAAGCGGTGGAGACAGCCAAGACC I A L - L L V N L P I V K A V E T A K T L H C S C W - T S L L - K R W R Q P R P Y C I V V V G E P P Y C E S G G D S Q D . . . . . . 10990 TCAGAATTCCTCTTGGCGGAGAAGAACTAATAAGCTTGGTTGCAGACAGAGTTCCAACGT S E F L L A E K N - - A W L Q T E F Q R Q N S S W R R R T N K L G C R Q S S N V L R I P L G G E E L I S L V A D R V P T . . . . . . 11050 TGGTGATATTGATCTCCGGAAGGCCTTTACATATAGAGCCTTCGATTCTGGAGAAAATGG W - Y - S P E G L Y I - S L R F W R K W G D I D L R K A F T Y R A F D S G E N G L V I L I S G R P L H I E P S I L E K M . . . . . . 11110 ATGCCTTCGTTGCTGCATGGTTACCGGGCACTGAGGGAACTGGTATCACTGATGTCATAT M P S L L H G Y R A L R E L V S L M S Y C L R C C M V T G H - G N W Y H - C H I D A F V A A W L P G T E G T G I T D V I . . . . . . 11170 TCGGAGATTTTGAATTCCATGGAACCCTCCCTATGACATGGTTTAAGAGTGTAGATCAAT S E I L N S M E P S L - H G L R V - I N R R F - I P W N P P Y D M V - E C R S I F G D F E F H G T L P M T W F K S V D Q . . . . . . 11230 TACCCCTGCATCAAGAACAGAACTCCTATGAACCTCTCTTTCCATTCGGCTACGGATTAA Y P C I K N R T P M N L S F H S A T D - T P A S R T E L L - T S L S I R L R I N L P L H Q E Q N S Y E P L F P F G Y G L . . . . . . 11290 CAAGTAAAAACAAGGTGATCTAGACGAGATGTGTCGAAGAAAGGTTTAGTGAAATAGACA Q V K T R - S R R D V S K K G L V K - T K - K Q G D L D E M C R R K V - - N R H T S K N K V I - T R C V E E R F S E I D . . . . . . 11350 TGTCTATAAAGATGCGGACTATCAGAAAGACTGTATGAAGCTTTATGCAACCTCTTGTGT C L - R C G L S E R L Y E A L C N L L C V Y K D A D Y Q K D C M K L Y A T S C V M S I K M R T I R K T V - S F M Q P L V . . . . . . 11410 ACTTTGTACCATTTGGTCTCTGGCATATGATGTTACATTAAGTCCAAATAATTATATATT T L Y H L V S G I - C Y I K S K - L Y I L C T I W S L A Y D V T L S P N N Y I L Y F V P F G L W H M M L H - V Q I I I Y . . . . . . 11470 ATGTACTATTATTACCATATGTGAACAAGGAAAAATGCGTAACAGTTTCATTTAATAACG M Y Y Y Y H M - T R K N A - Q F H L I T C T I I T I C E Q G K M R N S F I - - R Y V L L L P Y V N K E K C V T V S F N N . . . 11530 ATATAAAATATTCAACTTATTTTA I - N I Q L I L Y K I F N L F D I K Y S T Y F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-4+_PGL-2_AGS-1_PPS_1 (4476 4632,6084 6288,6382 6472,7170 7315,7658 7901,8693 8797,8877 9059,9795 9996,10825 11312) (frame '0'; 1818 bp, 606 residues) 1 AEMDFVYKNP SALIEERVKD LLSRMTLEEK IGQMTQIERS VATPSVITDL SIGSILSVGG 61 SAPFEDAPSE AWADMVDGFQ KAALESRLGI PLLYGVDAIH GNNNVYGATV FPQNVGLGAT 121 RDADLVQKIG IVTALEVRAC GINYTFAPCV AVCRDPRWGR CYESYGEDTE LIRKMTSIVT 181 GLQGQPPPGY PQNYPFLAGR DKVVACAKHF VGDGGTDRGI NEGNTISSYE DLERIHIPPY 241 IDCISQGVCT VMASYSKWNG SHLHSSHFLL TEVLKGKLGF KGFVISDSEG IDRFFHPHGS 301 NYDQSILAAI NAGIDMVMVP FRYQLFLDHL KYLVESGNIP MTRIDDAVER ILRVKFVSGA 361 FENPLSDRSL LDTVGCHQHR ELAREAVRKS LVLLKNGKDV TKPFLPLDRK AKRILVAGKH 421 ADDLGFQCGG WTKTWEGMGG RITIGTTILE AIKDAVGGET ELVYEENPSP DTFASQDFSY 481 CIVVVGEPPY CESGGDSQDL RIPLGGEELI SLVADRVPTL VILISGRPLH IEPSILEKMD 541 AFVAAWLPGT EGTGITDVIF GDFEFHGTLP MTWFKSVDQL PLHQEQNSYE PLFPFGYGLT 601 SKNKVI- ... finished at: Mon Aug 28 22:23:57 2006 ________________________________________________________________________________ Sequence 5: C06HBa0054K13.1-5, from 1 to 5340, both strands analyzed. ... started at: Mon Aug 28 22:23:57 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 1196 n (File: SGN-U318431+) 1 ATGAAGCAGG GCACGCAGTA GTTGGGACTG CTGTTGCAAA TCTTCTTTCC GGGCAGCCAC 61 GGGTTGAGAA GCTGAGCATA TTGCCAAGAT CTGGAGGGGC TTTAGGATTT ACTTATATTC 121 CTCCAACCAA TGAGGACAGA TATCTGCTCT TTGTTGATGA ATTGCGTGGG AGGTTGGTCA 181 CTCTTCTTGG TGGACGTGCT GCGGAAGAAG TATTGTACTC TGGACGTGTA TCTACTGGTG 241 CTCTTGATGA TATACGCAGA GCTACAGACA TGGCCTACAA GGCAGTGGCT GAATATGGTC 301 TTAGTCAGAC CATTGGCCCT ATCTCAGTGG CAACACTTTC TGGAGGTGGG ATGGATGATG 361 GTGGATCCAT GTCCTGGGGA AGGGACCAGG GGCATCTTGT TGATCTTGTT CAAAGAGAGG 421 TCAAGGCTTT ACTGCAGTCT GCACTTGATA TTGCACTCTG TGTTGTACGT GCTAATCCCA 481 AAGTTCTTGA GGGGTTGGGA GCTCAATTGG AAGAGAATGA GAAAGTAGAA GGTGAGCAAC 541 TACAAGAATG GTTGAGCATG GTCGTTGCAC CAGCAGAACT TAATTTTTTC ATAAAGGGCA 601 AAGAAGGATC TTTACTTCCA CTGCAAGCAG GCTCTGGATG ACAGAAAGTG CAAGCTTAAC 661 CCAGTCAACA GAGTACAAAA ACGAATTTCG AATAAGGATT TCCACGCCTT CATCTGTGCT 721 CTCTAGGGAA ACAAAGCTAT ATTCCTGAAC CAGATGGGCA TCACTTGCTG GCATCCAGAC 781 AGCGTGATTT TGCATGTGTT AGTAAGGTAA ATTGACAGCT ACTTAACTCC TACAAACACC 841 ATTGTTATAT ACCACGATCA CAGATTGTTA ACTCGAAGAG AAGGCATGCG CAAAAGATCT 901 AGCCAAGAGT TTCTTGGTTC TCTAGAGGTG AAATGAAAAT AGGCTCGTGA TCAGAGTGAT 961 GTCTGTTGGT ATTTTCTTCT ACATCACTTG TACATGTGCC CAATTATTCT TAGTAATTAC 1021 ACAAGAATCA ATACCTTAAT ATTGTAATAT ACATCTGTAC TGCAGCTTAC AAAGAACTAA 1081 TCCATGTCTA ATGTTGTGGG AGAAGGTGTG AAACTGTGTA TACAATGTAA TCTTTCTGAT 1141 TATGAGACAA AGTTTCAGTA GGTTGCCATA AAAAAAAAAA ATTGACTTGT GTAAAA Predicted gene structure (within gDNA segment 1 to 4409): Exon 1 443 510 ( 68 n); cDNA 1 68 ( 68 n); score: 1.000 Intron 1 511 1827 (1317 n); Pd: 0.996 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 2 1828 2148 ( 321 n); cDNA 69 389 ( 321 n); score: 1.000 Intron 2 2149 2282 ( 134 n); Pd: 0.987 (s: 1.00), Pa: 0.710 (s: 1.00) Exon 3 2283 2406 ( 124 n); cDNA 390 513 ( 124 n); score: 1.000 Intron 3 2407 2891 ( 485 n); Pd: 0.975 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 4 2892 3548 ( 657 n); cDNA 514 1170 ( 657 n); score: 1.000 PPA cDNA 1171 1182 MATCH C06HBa0054K13.1-5+ SGN-U318431+ 1.000 1170 0.978 C PGS_C06HBa0054K13.1-5+_SGN-U318431+ (443 510,1828 2148,2283 2406,2892 3548) Alignment (genomic DNA sequence = upper lines): ATGAAGCAGG GCACGCAGTA GTTGGGACTG CTGTTGCAAA TCTTCTTTCC GGGCAGCCAC 502 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGAAGCAGG GCACGCAGTA GTTGGGACTG CTGTTGCAAA TCTTCTTTCC GGGCAGCCAC 60 GGGTTGAGGT GATCATTATT ATCACTTTCT CTGAGAAATT CTTGTTCTTC CTGATTTACT 562 |||||||| GGGTTGAG.. .......... .......... .......... .......... .......... 68 GCAAGTTCGT GTTATTTGAT TTGTGTTCCA CTGGATAACA TCTGACATCT AAAAAGAATT 622 .......... .......... .......... .......... .......... .......... 68 GATATGATAG AACTACTCTA GAAATAGAAT TTCTTAAGTT TGTCATGTTT CAGAACAGGA 682 .......... .......... .......... .......... .......... .......... 68 AAATCAAATG TCTGCATGTC TTGAACAGCC TGGTTTTGAC CTTTTCCAGG GACATTATGC 742 .......... .......... .......... .......... .......... .......... 68 ATCAACAAAA CCACTCCCCT GCATTTGGTA TGGGACCTCT AATAGTGTTT AAATTGGGTT 802 .......... .......... .......... .......... .......... .......... 68 CACAGTTTGT TCTTGGGGTG ATGGATTATG GAGAAATATT TGGATCATTG ATCAATGCAG 862 .......... .......... .......... .......... .......... .......... 68 TCATCTGTGA CCTATCCTTC TCAATGTAGT AGCACTGTTA TTGAAGCAGA ATAAAATAAA 922 .......... .......... .......... .......... .......... .......... 68 ATATAAAAAA TAGAAAATGG AGAAGTAGTG GCTTATTGGT GTCCTGTTGA TTGATAAGAA 982 .......... .......... .......... .......... .......... .......... 68 TGTCCAAAAT GGAAAAATGC CTGTGTTTGG ATCTAATACT TTATTCTAAT AAAAAATGGG 1042 .......... .......... .......... .......... .......... .......... 68 TGTCTGGATT TGATAAGGAA TAGGAAATGA CTTGGTTCTG ATGTCAGACT TGGGTACAAC 1102 .......... .......... .......... .......... .......... .......... 68 TCAAATACGC TACTGATTGT TTGACCTTTA TTTTCCCTAG AAAGGCAACT CTGGAAGAAT 1162 .......... .......... .......... .......... .......... .......... 68 GTTATATGTA CATAATGACC TCACTTATCA GGAAAAAACA ATATAAGGTT CTTTCTCCCT 1222 .......... .......... .......... .......... .......... .......... 68 ATCTCTCTCT TTTCTGGTGT TCTCCAGATT CATGGAAATA TATTTTCTGG GTAAAAGGGT 1282 .......... .......... .......... .......... .......... .......... 68 TGTATGCCTT CCTTAATGTT GTTTCTCCTC AACAAAAAAA GAAGAAATAG TATCATAGTT 1342 .......... .......... .......... .......... .......... .......... 68 CTATATTTCT GGCTTGAGTA GGTGAGGTAT TTTATTCCAT GATTTTGGTG TTGGTAAATA 1402 .......... .......... .......... .......... .......... .......... 68 GAGAGCGAGC GACTCACTCT CCAACCCCCC AACTTTGTTT TTCTCAATGT GAAAGTTGTT 1462 .......... .......... .......... .......... .......... .......... 68 ACTAATGAGA AATCTCAAGT GATTTTCCTT GTTATTTCCC CTTCCTCTCT TTCTTAATTT 1522 .......... .......... .......... .......... .......... .......... 68 TGACGGCACC AGTTCATTTC TTTTACCTCC CCCAACTATC TGTTTCAAAT GAACCATACC 1582 .......... .......... .......... .......... .......... .......... 68 AACATGGTTC TGGAAGTGGT CTTCCTGCAA GTTCTCTCAA TATCCTGATC CTTTTCATGG 1642 .......... .......... .......... .......... .......... .......... 68 GAAAGTTTGC AAGTGAGGTC AGCCATTTCT ATATTGGTAC AGATGCTTAA AGTCTTTTGC 1702 .......... .......... .......... .......... .......... .......... 68 TAAAATCAGT ACTTTGCTTC TACTCAACCT TGCTTTTCTC AAACTATGAT TGTTGCTTTT 1762 .......... .......... .......... .......... .......... .......... 68 GTTGTACATA ATTGGGATTA GTGCAGTTAT GGTATTAATA ACAGTTCTCT TTTTTCGCTA 1822 .......... .......... .......... .......... .......... .......... 68 ACCAGAAGCT GAGCATATTG CCAAGATCTG GAGGGGCTTT AGGATTTACT TATATTCCTC 1882 ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .....AAGCT GAGCATATTG CCAAGATCTG GAGGGGCTTT AGGATTTACT TATATTCCTC 123 CAACCAATGA GGACAGATAT CTGCTCTTTG TTGATGAATT GCGTGGGAGG TTGGTCACTC 1942 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACCAATGA GGACAGATAT CTGCTCTTTG TTGATGAATT GCGTGGGAGG TTGGTCACTC 183 TTCTTGGTGG ACGTGCTGCG GAAGAAGTAT TGTACTCTGG ACGTGTATCT ACTGGTGCTC 2002 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTTGGTGG ACGTGCTGCG GAAGAAGTAT TGTACTCTGG ACGTGTATCT ACTGGTGCTC 243 TTGATGATAT ACGCAGAGCT ACAGACATGG CCTACAAGGC AGTGGCTGAA TATGGTCTTA 2062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATGATAT ACGCAGAGCT ACAGACATGG CCTACAAGGC AGTGGCTGAA TATGGTCTTA 303 GTCAGACCAT TGGCCCTATC TCAGTGGCAA CACTTTCTGG AGGTGGGATG GATGATGGTG 2122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCAGACCAT TGGCCCTATC TCAGTGGCAA CACTTTCTGG AGGTGGGATG GATGATGGTG 363 GATCCATGTC CTGGGGAAGG GACCAGGTCT TTCCGCTTTT GCCCCTTTCT TGTTTTCTGA 2182 |||||||||| |||||||||| |||||| GATCCATGTC CTGGGGAAGG GACCAG.... .......... .......... .......... 389 ATTCTCTGCT TTTTTCTGCT CTAACATTTA ATCCCAACTT ATGTTTCTAA CTTGAAAATA 2242 .......... .......... .......... .......... .......... .......... 389 CCACATCTTT AGCAGAACTG AAAGTAAAAT TATTCCACAG GGGCATCTTG TTGATCTTGT 2302 |||||||||| |||||||||| .......... .......... .......... .......... GGGCATCTTG TTGATCTTGT 409 TCAAAGAGAG GTCAAGGCTT TACTGCAGTC TGCACTTGAT ATTGCACTCT GTGTTGTACG 2362 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAAAGAGAG GTCAAGGCTT TACTGCAGTC TGCACTTGAT ATTGCACTCT GTGTTGTACG 469 TGCTAATCCC AAAGTTCTTG AGGGGTTGGG AGCTCAATTG GAAGGTAATA TGTTCAATGT 2422 |||||||||| |||||||||| |||||||||| |||||||||| |||| TGCTAATCCC AAAGTTCTTG AGGGGTTGGG AGCTCAATTG GAAG...... .......... 513 CACTTTAGTA GTCAAATTTT TGGTATTCTT TAGTATTGCT TTCTTTCAAT TAACTTCACA 2482 .......... .......... .......... .......... .......... .......... 513 AGGTAATGAT TTTGCATTAA AGATAAATAG AAGAATCTCA TTTTATTTTT AATATGTCAT 2542 .......... .......... .......... .......... .......... .......... 513 ATTTTTAATA TGTTAATTAA TAACAAAGAT CTCTTTATGA TATGTGATGT GTCTTTTCCT 2602 .......... .......... .......... .......... .......... .......... 513 TTCAATCTCT TTACCTTTTT CTTTGTGGAA GGTGGATACT CTTTCTATTC CATTTTATGA 2662 .......... .......... .......... .......... .......... .......... 513 CATTGTTTTG GTTTAGTATG TGTTAAAAGT ATGGCACTCT TTTATGTTTG GGAACTCTCT 2722 .......... .......... .......... .......... .......... .......... 513 AATTCTATAC TTCCTATTGT ACTGTCAATG TGATGCTCTT TTCACTACAA AAATGACATG 2782 .......... .......... .......... .......... .......... .......... 513 TTTAAGACCA CAAGTTTTTA TGGGTACTAT ATGCTCAGAC CGACATATAA AATGAAACTG 2842 .......... .......... .......... .......... .......... .......... 513 GAGTTTTATA TTTGGTACTC TTTTATGGAT TGATATTTCT TCCTCTCAGA GAATGAGAAA 2902 | |||||||||| .......... .......... .......... .......... .........A GAATGAGAAA 524 GTAGAAGGTG AGCAACTACA AGAATGGTTG AGCATGGTCG TTGCACCAGC AGAACTTAAT 2962 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAGAAGGTG AGCAACTACA AGAATGGTTG AGCATGGTCG TTGCACCAGC AGAACTTAAT 584 TTTTTCATAA AGGGCAAAGA AGGATCTTTA CTTCCACTGC AAGCAGGCTC TGGATGACAG 3022 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTCATAA AGGGCAAAGA AGGATCTTTA CTTCCACTGC AAGCAGGCTC TGGATGACAG 644 AAAGTGCAAG CTTAACCCAG TCAACAGAGT ACAAAAACGA ATTTCGAATA AGGATTTCCA 3082 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGTGCAAG CTTAACCCAG TCAACAGAGT ACAAAAACGA ATTTCGAATA AGGATTTCCA 704 CGCCTTCATC TGTGCTCTCT AGGGAAACAA AGCTATATTC CTGAACCAGA TGGGCATCAC 3142 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCCTTCATC TGTGCTCTCT AGGGAAACAA AGCTATATTC CTGAACCAGA TGGGCATCAC 764 TTGCTGGCAT CCAGACAGCG TGATTTTGCA TGTGTTAGTA AGGTAAATTG ACAGCTACTT 3202 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTGGCAT CCAGACAGCG TGATTTTGCA TGTGTTAGTA AGGTAAATTG ACAGCTACTT 824 AACTCCTACA AACACCATTG TTATATACCA CGATCACAGA TTGTTAACTC GAAGAGAAGG 3262 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTCCTACA AACACCATTG TTATATACCA CGATCACAGA TTGTTAACTC GAAGAGAAGG 884 CATGCGCAAA AGATCTAGCC AAGAGTTTCT TGGTTCTCTA GAGGTGAAAT GAAAATAGGC 3322 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATGCGCAAA AGATCTAGCC AAGAGTTTCT TGGTTCTCTA GAGGTGAAAT GAAAATAGGC 944 TCGTGATCAG AGTGATGTCT GTTGGTATTT TCTTCTACAT CACTTGTACA TGTGCCCAAT 3382 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGTGATCAG AGTGATGTCT GTTGGTATTT TCTTCTACAT CACTTGTACA TGTGCCCAAT 1004 TATTCTTAGT AATTACACAA GAATCAATAC CTTAATATTG TAATATACAT CTGTACTGCA 3442 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTCTTAGT AATTACACAA GAATCAATAC CTTAATATTG TAATATACAT CTGTACTGCA 1064 GCTTACAAAG AACTAATCCA TGTCTAATGT TGTGGGAGAA GGTGTGAAAC TGTGTATACA 3502 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTACAAAG AACTAATCCA TGTCTAATGT TGTGGGAGAA GGTGTGAAAC TGTGTATACA 1124 ATGTAATCTT TCTGATTATG AGACAAAGTT TCAGTAGGTT GCCATA 3548 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ATGTAATCTT TCTGATTATG AGACAAAGTT TCAGTAGGTT GCCATA 1170 hqPGS_C06HBa0054K13.1-5+_SGN-U318431+ (443 510,1828 2148,2283 2406,2892 3548) ******************************************************************************** EST sequence 2 +strand 569 n (File: SGN-U318432+) 1 TTCGAATAAG GATCTGTACG CCTTCATCTG TGCTCTCTAG GGAAACAAAG CTATATTCCT 61 GAACCAGATG GGCATCACTT GCTGGCATCC AGACAGCGTG ATTTTGCATG TAATTGACAG 121 CTACGTAACT CCTACAAACA CCATTGTTAT ATACCACGAT CACAGATTGT TAACTCGAAG 181 AGAAGGCATG CGCAAAAGAT CTAGCCAAGA GTTTCTTGGT TCTAGAGGTG AAATGAAAAT 241 AGTTTAGGCT CGTGATCAGA GTGATGTCTG TTGGTATTTT CTTCTACATC ACTTGTACAT 301 GTGCCCAATT ATTCTTAGTA ATTACACAAG AATCAATACC TTAATATTGT AATATACATC 361 TGTACTGCAG CTTACAAAGA ACTAATCCAT GTCTAATGTT GTGGGAGAAG GTGTGAAACT 421 GTGTATACAA TGTAATCTTT CTGATTATGA GACAAAGTTT CAGTAGGTTG CCATAATCTT 481 TGACTTGTGT AAAATTACAA GTATTGGATG AGGTAATAGT GACAATTCCT TGTGCTTGTG 541 GTTAAAAAAA AAAAAAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 2312 to 4468): Exon 1 3065 3618 ( 554 n); cDNA 1 545 ( 545 n); score: 0.945 PPA cDNA 546 569 MATCH C06HBa0054K13.1-5+ SGN-U318432+ 0.945 554 0.974 C PGS_C06HBa0054K13.1-5+_SGN-U318432+ (3065 3618) Alignment (genomic DNA sequence = upper lines): TTCGAATAAG GATTTCCACG CCTTCATCTG TGCTCTCTAG GGAAACAAAG CTATATTCCT 3124 |||||||||| ||| | ||| |||||||||| |||||||||| |||||||||| |||||||||| TTCGAATAAG GATCTGTACG CCTTCATCTG TGCTCTCTAG GGAAACAAAG CTATATTCCT 60 GAACCAGATG GGCATCACTT GCTGGCATCC AGACAGCGTG ATTTTGCATG TGTTAGTAAG 3184 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| | | GAACCAGATG GGCATCACTT GCTGGCATCC AGACAGCGTG ATTTTGC--- ----A-T--- 109 GTAAATTGAC AGCTACTTAA CTCCTACAAA CACCATTGTT ATATACCACG ATCACAGATT 3244 ||| |||||| |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| GTA-ATTGAC AGCTACGTAA CTCCTACAAA CACCATTGTT ATATACCACG ATCACAGATT 168 GTTAACTCGA AGAGAAGGCA TGCGCAAAAG ATCTAGCCAA GAGTTTCTTG GTTCTCTAGA 3304 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | | |||||| GTTAACTCGA AGAGAAGGCA TGCGCAAAAG ATCTAGCCAA GAGTTTCTTG G-T-TCTAGA 226 GGTGAAATG- AAA-A---TA GGCTCGTGAT CAGAGTGATG TCTGTTGGTA TTTTCTTCTA 3359 ||||||||| ||| | || |||||||||| |||||||||| |||||||||| |||||||||| GGTGAAATGA AAATAGTTTA GGCTCGTGAT CAGAGTGATG TCTGTTGGTA TTTTCTTCTA 286 CATCACTTGT ACATGTGCCC AATTATTCTT AGTAATTACA CAAGAATCAA TACCTTAATA 3419 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCACTTGT ACATGTGCCC AATTATTCTT AGTAATTACA CAAGAATCAA TACCTTAATA 346 TTGTAATATA CATCTGTACT GCAGCTTACA AAGAACTAAT CCATGTCTAA TGTTGTGGGA 3479 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTAATATA CATCTGTACT GCAGCTTACA AAGAACTAAT CCATGTCTAA TGTTGTGGGA 406 GAAGGTGTGA AACTGTGTAT ACAATGTAAT CTTTCTGATT ATGAGACAAA GTTTCAGTAG 3539 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGGTGTGA AACTGTGTAT ACAATGTAAT CTTTCTGATT ATGAGACAAA GTTTCAGTAG 466 GTTGCCATAA TCTTTGACTT GTGTAAAATT ACAAGTATTG GATGAGGTAA TAGTGACAAT 3599 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTGCCATAA TCTTTGACTT GTGTAAAATT ACAAGTATTG GATGAGGTAA TAGTGACAAT 526 TCCTTGTGCT TGTGGTTAA 3618 |||||||||| ||||||||| TCCTTGTGCT TGTGGTTAA 545 hqPGS_C06HBa0054K13.1-5+_SGN-U318432+ (3065 3618) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 5340: PGL 1 (+ strand): 443 3618 AGS-1 (443 510,1828 2148,2283 2406,2892 3618) SCR (e 1.000 d 0.996 a 0.000,e 1.000 d 0.987 a 0.710,e 1.000 d 0.975 a 0.998,e 1.000) Exon 1 443 510 ( 68 n); score: 1.000 Intron 1 511 1827 (1317 n); Pd: 0.996 Pa: 0.000 Exon 2 1828 2148 ( 321 n); score: 1.000 Intron 2 2149 2282 ( 134 n); Pd: 0.987 Pa: 0.710 Exon 3 2283 2406 ( 124 n); score: 1.000 Intron 3 2407 2891 ( 485 n); Pd: 0.975 Pa: 0.998 Exon 4 2892 3618 ( 727 n); score: 1.000 PGS (443 510,1828 2148,2283 2406,2892 3548) SGN-U318431+ PGS (3065 3618) SGN-U318432+ 3-phase translation of AGS-1 (+strand): . . . . . . 443 ATGAAGCAGGGCACGCAGTAGTTGGGACTGCTGTTGCAAATCTTCTTTCCGGGCAGCCAC M K Q G T Q - L G L L L Q I F F P G S H - S R A R S S W D C C C K S S F R A A T E A G H A V V G T A V A N L L S G Q P . : . . . . . 503 GGGTTGAG : AAGCTGAGCATATTGCCAAGATCTGGAGGGGCTTTAGGATTTACTTATATTC G L R : S - A Y C Q D L E G L - D L L I F G - : E A E H I A K I W R G F R I Y L Y S R V E : K L S I L P R S G G A L G F T Y I . . . . . . 1880 CTCCAACCAATGAGGACAGATATCTGCTCTTTGTTGATGAATTGCGTGGGAGGTTGGTCA L Q P M R T D I C S L L M N C V G G W S S N Q - G Q I S A L C - - I A W E V G H P P T N E D R Y L L F V D E L R G R L V . . . . . . 1940 CTCTTCTTGGTGGACGTGCTGCGGAAGAAGTATTGTACTCTGGACGTGTATCTACTGGTG L F L V D V L R K K Y C T L D V Y L L V S S W W T C C G R S I V L W T C I Y W C T L L G G R A A E E V L Y S G R V S T G . . . . . . 2000 CTCTTGATGATATACGCAGAGCTACAGACATGGCCTACAAGGCAGTGGCTGAATATGGTC L L M I Y A E L Q T W P T R Q W L N M V S - - Y T Q S Y R H G L Q G S G - I W S A L D D I R R A T D M A Y K A V A E Y G . . . . . . 2060 TTAGTCAGACCATTGGCCCTATCTCAGTGGCAACACTTTCTGGAGGTGGGATGGATGATG L V R P L A L S Q W Q H F L E V G W M M - S D H W P Y L S G N T F W R W D G - W L S Q T I G P I S V A T L S G G G M D D . . . : . . . 2120 GTGGATCCATGTCCTGGGGAAGGGACCAG : GGGCATCTTGTTGATCTTGTTCAAAGAGAGG V D P C P G E G T R : G I L L I L F K E R W I H V L G K G P : G A S C - S C S K R G G G S M S W G R D Q : G H L V D L V Q R E . . . . . . 2314 TCAAGGCTTTACTGCAGTCTGCACTTGATATTGCACTCTGTGTTGTACGTGCTAATCCCA S R L Y C S L H L I L H S V L Y V L I P Q G F T A V C T - Y C T L C C T C - S Q V K A L L Q S A L D I A L C V V R A N P . . . . : . . 2374 AAGTTCTTGAGGGGTTGGGAGCTCAATTGGAAG : AGAATGAGAAAGTAGAAGGTGAGCAAC K F L R G W E L N W K : R M R K - K V S N S S - G V G S S I G R : E - E S R R - A T K V L E G L G A Q L E : E N E K V E G E Q . . . . . . 2919 TACAAGAATGGTTGAGCATGGTCGTTGCACCAGCAGAACTTAATTTTTTCATAAAGGGCA Y K N G - A W S L H Q Q N L I F S - R A T R M V E H G R C T S R T - F F H K G Q L Q E W L S M V V A P A E L N F F I K G . . . . . . 2979 AAGAAGGATCTTTACTTCCACTGCAAGCAGGCTCTGGATGACAGAAAGTGCAAGCTTAAC K K D L Y F H C K Q A L D D R K C K L N R R I F T S T A S R L W M T E S A S L T K E G S L L P L Q A G S G - Q K V Q A - . . . . . . 3039 CCAGTCAACAGAGTACAAAAACGAATTTCGAATAAGGATTTCCACGCCTTCATCTGTGCT P V N R V Q K R I S N K D F H A F I C A Q S T E Y K N E F R I R I S T P S S V L P S Q Q S T K T N F E - G F P R L H L C . . . . . . 3099 CTCTAGGGAAACAAAGCTATATTCCTGAACCAGATGGGCATCACTTGCTGGCATCCAGAC L - G N K A I F L N Q M G I T C W H P D S R E T K L Y S - T R W A S L A G I Q T S L G K Q S Y I P E P D G H H L L A S R . . . . . . 3159 AGCGTGATTTTGCATGTGTTAGTAAGGTAAATTGACAGCTACTTAACTCCTACAAACACC S V I L H V L V R - I D S Y L T P T N T A - F C M C - - G K L T A T - L L Q T P Q R D F A C V S K V N - Q L L N S Y K H . . . . . . 3219 ATTGTTATATACCACGATCACAGATTGTTAACTCGAAGAGAAGGCATGCGCAAAAGATCT I V I Y H D H R L L T R R E G M R K R S L L Y T T I T D C - L E E K A C A K D L H C Y I P R S Q I V N S K R R H A Q K I . . . . . . 3279 AGCCAAGAGTTTCTTGGTTCTCTAGAGGTGAAATGAAAATAGGCTCGTGATCAGAGTGAT S Q E F L G S L E V K - K - A R D Q S D A K S F L V L - R - N E N R L V I R V M - P R V S W F S R G E M K I G S - S E - . . . . . . 3339 GTCTGTTGGTATTTTCTTCTACATCACTTGTACATGTGCCCAATTATTCTTAGTAATTAC V C W Y F L L H H L Y M C P I I L S N Y S V G I F F Y I T C T C A Q L F L V I T C L L V F S S T S L V H V P N Y S - - L . . . . . . 3399 ACAAGAATCAATACCTTAATATTGTAATATACATCTGTACTGCAGCTTACAAAGAACTAA T R I N T L I L - Y T S V L Q L T K N - Q E S I P - Y C N I H L Y C S L Q R T N H K N Q Y L N I V I Y I C T A A Y K E L . . . . . . 3459 TCCATGTCTAATGTTGTGGGAGAAGGTGTGAAACTGTGTATACAATGTAATCTTTCTGAT S M S N V V G E G V K L C I Q C N L S D P C L M L W E K V - N C V Y N V I F L I I H V - C C G R R C E T V Y T M - S F - . . . . . . 3519 TATGAGACAAAGTTTCAGTAGGTTGCCATAATCTTTGACTTGTGTAAAATTACAAGTATT Y E T K F Q - V A I I F D L C K I T S I M R Q S F S R L P - S L T C V K L Q V L L - D K V S V G C H N L - L V - N Y K Y . . . . 3579 GGATGAGGTAATAGTGACAATTCCTTGTGCTTGTGGTTAA G - G N S D N S L C L W L D E V I V T I P C A C G - W M R - - - Q F L V L V V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-5+_PGL-1_AGS-1_PPS_1 (445 510,1828 2148,2283 2406,2892 3019) (frame '0'; 636 bp, 212 residues) 1 EAGHAVVGTA VANLLSGQPR VEKLSILPRS GGALGFTYIP PTNEDRYLLF VDELRGRLVT 61 LLGGRAAEEV LYSGRVSTGA LDDIRRATDM AYKAVAEYGL SQTIGPISVA TLSGGGMDDG 121 GSMSWGRDQG HLVDLVQREV KALLQSALDI ALCVVRANPK VLEGLGAQLE ENEKVEGEQL 181 QEWLSMVVAP AELNFFIKGK EGSLLPLQAG SG- ... finished at: Mon Aug 28 22:24:02 2006 ________________________________________________________________________________ Sequence 6: C06HBa0054K13.1-6, from 1 to 7598, both strands analyzed. ... started at: Mon Aug 28 22:24:02 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 3 ******************************************************************************** EST sequence 1 +strand 1298 n (File: SGN-U322835+) 1 CAATTCCCCT TTCAATGAAT TTCCCAAATC CCCATTCAAT TTTACTTCTT TTTGGATAAA 61 AAATGATAAG TGTTTGTGTG TGTTTGCAGC CCACAATAAT GGCGAAGATG AAGGTGGTGA 121 GGAGTGAAAT TGCTGCGAAA CAAGTGGTTG TGATCGAGGA AAATGAGGAG ATACATTGGT 181 ATGCTTCTTG AATTTCGTCC GGAGGACACA GCTACTCAAA TCCAAATGCA AAGGATCTTC 241 AAGAGTAAAG GAATAAAACA AGACCTTGGT TCATTGTGAA AGTTATGTGT GATCCATAGC 301 TGAAAGTTTG GCATTTATCT CGGTTTTCCC TTCCTATTTG ATTTTTTTTC AATAATCGCC 361 GTTAGGTCAA TTCTGCTATT TAAGTCAAGT TTCAACCAGC TTTTGGGAAT TCTGGAAGTC 421 ACGTGCTCAT CAAAGTAAAT TTTATATACA CTGATGCAAC GGTCTATCTC CACAATGGGT 481 GTTCAAATCC AGTGGTGCAT TGTGACTTGA AGCCAAGTGT CTTGCTTGAT CAAGACATGG 541 TTGGCCATGT CAGTGATTTT GGCATTGCAA AATTGTTAGG TGCAGGGGAG AGTTTTGTTC 601 AAACAAGGAC AATAGCAACC ATTGGATAAT TGCTCCAGAG TATGGACAAG ATGGAATCGT 661 ATCCACAAGC TGCGATGTTT ATAGTTTCGG TATCCTGATT GATGGAGACG TTTACAAGAA 721 TCAGACCAGG TGATGAAAGA TTTACTGGAG AGTTGAGCAT ACGACGTTGG GTTAGTGATT 781 CTTTTCCAGA TGAGATTCAT AAGGTGGTGG ATGCTAATTT GGTACAGCCT AGGGGATGAA 841 CGAATTGACG CAAAGATGCA GTGTCTGTTG TCTATTATAG AGTTAGCTTT GAGCTGTACT 901 TTAGCAACAC CTGATGCAAG AATTAGTATG GAAGATTCTC TTTCAACACT TCAAAATATC 961 AGGCTCCAGT TTGTCAATAG TCGCCACCGA AAAAAGCAAC TGAAGGATTT AGTACCGAAA 1021 AAAGCAACTT GCTTGGTAAT GGCAAGGTCT ACAATGTACA ATTGGAGGGT GCATTCAAAA 1081 GTTTTGATAC AGAAGAATGT GAAATCTGAC CAAAGTCATC AAAGCCTTAA TGTTAGAATA 1141 CATGTCTAGT GGGACACTTG ATAAATGGCT GTACTCTCAC AAGTTGTTCT TGGATTTACT 1201 TCATATTATG TACTCTTTCA CTTTCAGTGC CAGCTGGAAT GGTGATTTTC TAGCTACTGG 1261 AGTTCACAAT TCTAATCCAT AAAAAAAAAA AAAAAAAA Predicted gene structure (within gDNA segment 1 to 7598): Exon 1 52 239 ( 188 n); cDNA 455 638 ( 184 n); score: 0.872 Intron 1 240 336 ( 97 n); Pd: 0.999 (s: 0.90), Pa: 0.976 (s: 0.90) Exon 2 337 683 ( 347 n); cDNA 639 987 ( 349 n); score: 0.818 PPA cDNA 1279 1298 MATCH C06HBa0054K13.1-6+ SGN-U322835+ 0.837 535 0.412 C PGS_C06HBa0054K13.1-6+_SGN-U322835+ (52 239,337 683) Alignment (genomic DNA sequence = upper lines): TGCAATGGAT TATCTCCACA ATGGCTATTC AACGCCTGTG GTGCATTGTG ACTTGAAGCC 111 ||||| || |||||||||| |||| | ||| || || ||| |||||||||| |||||||||| TGCAACGGTC TATCTCCACA ATGGGTGTTC AAATCCAGTG GTGCATTGTG ACTTGAAGCC 514 AAGTAATGTC TTGTTAGATG AAGAAATGGT TGCTCATGTA AGTGATTTTG GCATTGCAAA 171 ||| |||| ||| | ||| |||| ||||| || ||||| |||||||||| |||||||||| AAG---TGTC TTGCTTGATC AAGACATGGT TGGCCATGTC AGTGATTTTG GCATTGCAAA 571 AATGTTAGGT GCAGGGGAGG CTTTTGTTCA AACAAGGACA GTTGCAACCA TTGGATATAT 231 | |||||||| ||||||||| ||||||||| |||||||||| | ||||||| ||||||| || ATTGTTAGGT GCAGGGGAGA GTTTTGTTCA AACAAGGACA ATAGCAACCA TTGGATA-AT 630 TGCTCCAGGT ATACTTTAAG TTTTCTCGTA TCGCTTTAAA TACTCAAAAC AATTATTTCC 291 |||||||| TGCTCCAG.. .......... .......... .......... .......... .......... 638 CCTAGGTATA AATTGATTTA TGATCATTTT CTACCATGAT TGCAGAGTAT GGACAAGATG 351 ||||| |||||||||| .......... .......... .......... .......... .....AGTAT GGACAAGATG 653 GAATAGTATC CACGAGTTGT GATGTTTATA GTTTTGGCAT CCTGA-TGAT GGAGACGTTC 410 |||| ||||| ||| || || |||||||||| |||| || || ||||| |||| ||||||||| GAATCGTATC CACAAGCTGC GATGTTTATA GTTTCGGTAT CCTGATTGAT GGAGACGTTT 713 ACACGAACAA GACCAAGTGA TGAGATATTT ACTGGAGACT TGAGCATACA ACGTTGGATT 470 ||| ||| | ||||| |||| ||| | |||| |||||||| | ||||||||| ||||||| || ACAAGAATCA GACCAGGTGA TGAAAGATTT ACTGGAGAGT TGAGCATACG ACGTTGGGTT 773 AGTGATTCCT TTCCGGGGGA ACTTCACAAG GTGGTAGATT CTAATTTGGT ACAGCC-AGG 529 |||||||| | |||| | || |||| ||| ||||| ||| |||||||||| |||||| ||| AGTGATTCTT TTCCAGATGA GATTCATAAG GTGGTGGATG CTAATTTGGT ACAGCCTAGG 833 AGAAGAACAA ATCGCTGCAA AGATGCAATG TTTGTTATCT ATCATGGAAT TAGCTTTGAA 589 || |||| | || | |||| ||||||| || | |||| ||| || || || | ||||||||| GGATGAACGA ATTGACGCAA AGATGCAGTG TCTGTTGTCT ATTATAGAGT TAGCTTTGAG 893 CTGCACTTTA GTGAGACCTG ATGAAAGAAT TAGCATGAAT GATGCTCTTT CAGCACTCAA 649 ||| |||||| | | ||||| ||| |||||| ||| ||| | ||| |||||| || |||| | CTGTACTTTA GCAACACCTG ATGCAAGAAT TAGTATGGAA GATTCTCTTT CAACACTTCA 953 AAAGATTAGA CTACAGCTTG TTAGTAGTCG GCAC 683 ||| || || || ||| ||| | | |||||| ||| AAATATCAGG CTCCAGTTTG TCAATAGTCG CCAC 987 hqPGS_C06HBa0054K13.1-6+_SGN-U322835+ (52 239,337 683) ******************************************************************************** EST sequence 2 -strand 1754 n (File: SGN-U314435-) 1 ATATGTAAAT TGTAATGAAT AATAAAATGT AACAATTACA AAATTCAAAG ACATACGACA 61 ATAAATCATT CATAGAAAAC TAAATTAAGA AGTCTAACAG AGTGAAAAAA AAGACTAAAA 121 GAAACAGAAT ATTTAAGAAA AAAATACAAT ATAGACATTA TTCTTATAAG ATTGATGTAT 181 ATATAGATGA AAAAATAAGT AAAAATTAGT ACATAACTCT CTGAGTTTTT ATCGACTCCT 241 TTCATATCCT ATCAAAATTC AAGAGTTTTC AGCATCTATT TTTACTATAG AATAAAAATA 301 ATAATCTTGA TAGGGGTTTC ATGAGATAAA GAAGTTGAAT TTATACAAAT AGAATAGAAG 361 GAATACCAAA AGATATCTTA AAGTTTAATT TAAATATTTG GAGGTTATGA ATATGAAAAG 421 ATGATAGTTA TGAATATTAA GAGTGTTGTG CGAAATTTGA GATAATACGA GAAAATATAT 481 AAACGCGAAA AACAAGACAA CAGATTTACG TGGTTCACCA ATAAATTGGC TACGTCCACG 541 GGAAGAGAGG GAGCAATTTT ATTATGGAGA GGCAAAAACA GAATTACAGA ATAGGGTTTG 601 CCATAGCGTC TATATATAGT GCTAAGCTAC GCCCTAAAAG GCTTGGGCCC AACATACAGA 661 ATCAACAGAA AATTAAGGGC CCAATACAAC AACATTGTAT ACCGTCGGCG CCGCGTCTCC 721 GCCCCCCCGG ACCCCCAGGC CAGCGCGTCG CCCCCCTGGA CCCCCCGACT CGCTGACCGG 781 GCAGCGAGAC CCCCGTCCTT TCTGTTTGTA GCGGGTCCGA TTCAAGGCAT TCAACATCCG 841 GCTTCACTTC TAACTCATTG GTAGATCTCA GTTGCCTCTT CACCAAGGCT GCCTGCCTCA 901 AAACCACATC ATAGACCATC TGTTCCGATG TCATCGTCCG TTCTCCAGAT GGAGTAGCCA 961 AAATAGCAGA CCGTACAGAA AATTTCCGTC CATTATTAGT TTGCTTTCCA CCACCTCTAT 1021 TGATTCTCTC ATTGGACACC AAATTCCTAT GCCTCGATGA ATCAAAAAAA CGGTTTCCCT 1081 CCCGGACTGA TTCCATGAAA CTTGTCCCAT TTGAGACGTC ACAAGGAGAA ACAACCCATA 1141 ACAAGGCAAC AGACATTCTG AGCAAGAAAA CCTTGGTTGT CTAAAACAAT TCCAAGATTT 1201 TTGTCTTTCT TTTTGTCACC TTCCAAACTT TACTTAGTTT ATAGAAAATA AATTACTCCT 1261 AGTAGATTCC CACTAGAGAA TATTTCCACT CTCAAACAAA ATTACCCACT TTCTCTTCTG 1321 TAGAAAAAGA TTATAAAAAG ACCACAGACA GGCAAACCAA CTTTTCCTCA CTCAAACAGG 1381 AGTAACCTGT TTCTTGACTT ACAAATTCCA CCCTCTCTTT CAACAATGGC CACTCTGAAT 1441 TTCTGTAATT GTATAGTCAC ACAAAGCGAA GTCAAAATTT AAAATTTAAA GTTTATGAAT 1501 TCTGAAAATT TCTAATATAT TTTAACTTAC TAAATTATTA CGAAGTTTGA CTAAAGTTAC 1561 GACTAAATGC GTAACTTACC CTCTAACTCC ACCCCCTGTA TATAACAAAA ACACTAGCAA 1621 AGCAATAGAA AGTTATAAAG ATTGCAACTT TACATATCAA GAAGTGAAGG GAAACAAACC 1681 TTTGTTTCAA GAAATCAAGA ATGGCCACTT CCAAGAATGG AGAATGCAAT TAGTTGGTTT 1741 TTTCTGCACT TTTT Predicted gene structure (within gDNA segment 1 to 7598): Exon 1 240 258 ( 19 n); cDNA 383 400 ( 18 n); score: 0.684 Intron 1 259 812 ( 554 n); Pd: 0.759 (s: 0), Pa: 0.000 (s: 0.64) Exon 2 813 1095 ( 283 n); cDNA 401 675 ( 275 n); score: 0.866 Intron 2 1096 2753 (1658 n); Pd: 0.426 (s: 0.82), Pa: 0.246 (s: 0) Exon 3 2754 2761 ( 8 n); cDNA 676 683 ( 8 n); score: 0.750 Intron 3 2762 2894 ( 133 n); Pd: 0.000 (s: 0), Pa: 0.784 (s: 0.52) Exon 4 2895 2942 ( 48 n); cDNA 684 730 ( 47 n); score: 0.521 Intron 4 2943 3114 ( 172 n); Pd: 0.901 (s: 0.52), Pa: 0.000 (s: 0.52) Exon 5 3115 3164 ( 50 n); cDNA 731 778 ( 48 n); score: 0.520 Intron 5 3165 5325 (2161 n); Pd: 0.510 (s: 0.52), Pa: 0.000 (s: 0.94) Exon 6 5326 5384 ( 59 n); cDNA 779 837 ( 59 n); score: 0.949 MATCH C06HBa0054K13.1-6+ SGN-U314435- 0.834 467 0.266 C PGS_C06HBa0054K13.1-6+_SGN-U314435- (240 258,813 1095,2754 2761,2895 2942,3115 3164,5326 5384) Alignment (genomic DNA sequence = upper lines): GTATACTTTA AGTTTTCTCG TATCGCTTTA AATACTCAAA ACAATTATTT CCCCTAGGTA 299 || || |||| | | || | GTTTAATTTA AATATT-TG. .......... .......... .......... .......... 400 TAAATTGATT TATGATCATT TTCTACCATG ATTGCAGAGT ATGGACAAGA TGGAATAGTA 359 .......... .......... .......... .......... .......... .......... 400 TCCACGAGTT GTGATGTTTA TAGTTTTGGC ATCCTGATGA TGGAGACGTT CACACGAACA 419 .......... .......... .......... .......... .......... .......... 400 AGACCAAGTG ATGAGATATT TACTGGAGAC TTGAGCATAC AACGTTGGAT TAGTGATTCC 479 .......... .......... .......... .......... .......... .......... 400 TTTCCGGGGG AACTTCACAA GGTGGTAGAT TCTAATTTGG TACAGCCAGG AGAAGAACAA 539 .......... .......... .......... .......... .......... .......... 400 ATCGCTGCAA AGATGCAATG TTTGTTATCT ATCATGGAAT TAGCTTTGAA CTGCACTTTA 599 .......... .......... .......... .......... .......... .......... 400 GTGAGACCTG ATGAAAGAAT TAGCATGAAT GATGCTCTTT CAGCACTCAA AAAGATTAGA 659 .......... .......... .......... .......... .......... .......... 400 CTACAGCTTG TTAGTAGTCG GCACTAGGTG GAATCATTAC CAATCTTCTC TTGTATGTTA 719 .......... .......... .......... .......... .......... .......... 400 TTTAGTTACC AGTCTGCTCT CTTATGGAAT TTAGTTTCGC GCTAGGTGTA TTTTGTGTTG 779 .......... .......... .......... .......... .......... .......... 400 GTTGTTGGTG CTTGATATAT GGAAGTTGAG AATGAGATTT AAAATGCTCA AAACAAGATT 839 ||| || ||| | | ||| | ||| .......... .......... .......... ...GAGGTTA TGAAT-ATGA AAAGATGATA 426 GTCTTTTCTT ATTTATACTG TTGTGCGGAA TTTGAGATAA TACGAGAAAA TATATAAACG 899 || | | | ||| | | || ||||||| || |||||||||| |||||||||| |||||||||| GT-TATGAAT ATTAAGAGTG TTGTGCGAAA TTTGAGATAA TACGAGAAAA TATATAAACG 485 CGAAAAACAA GACAACATAT TTACGTGGTT CACCAATAAA TTGGCTACGT CCACGGGAAG 959 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| CGAAAAACAA GACAACAGAT TTACGTGGTT CACCAATAAA TTGGCTACGT CCACGGGAAG 545 AGAGGGAGCA GTTTTATTAT GGAGAGGCAA AAACAGAATT ACAAAATAGG GTTTGCCATA 1019 |||||||||| ||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| AGAGGGAGCA ATTTTATTAT GGAGAGGCAA AAACAGAATT ACAGAATAGG GTTTGCCATA 605 GCGTCTATAT ATATATATAG TGTTAAGCTA CGCCCTAACA GGCTTGGGCC CAACATACAG 1079 ||||| ||||||||| || ||||||| |||||||| | |||||||||| |||||||||| GCGTC----- -TATATATAG TGCTAAGCTA CGCCCTAAAA GGCTTGGGCC CAACATACAG 659 AATTGACAGA TTCAAGGCAT TCAACAAATC TCCACCTTGA CTTGAATTCT CCGAACAGAT 1139 ||| ||||| AATCAACAGA AAATTA.... .......... .......... .......... .......... 675 TCTTCAGACG CACTATGATA GTGCCAGGCC TCCCCCTCTT CCTCAGAGTT GCCCCGCAGG 1199 .......... .......... .......... .......... .......... .......... 675 GCAATTAACA GCTTCTGATG TTGAGCAAGT CCAAACAGTG TTGAAACTTG CTCTGTGGAA 1259 .......... .......... .......... .......... .......... .......... 675 CCGGCTTTGT GAACATATCA GCAGGATTAT CAGCAGTTCC TACTTTCTTC ACCTTGATTC 1319 .......... .......... .......... .......... .......... .......... 675 TCTTCTCACT TCTCAGAAAA TGATACCTTA CGTCAATATG CTTGGTTCTC TCATGATGGA 1379 .......... .......... .......... .......... .......... .......... 675 CTTGATCCTT GGCTAGACAA ATTGCGCTCA AACTGTCACA ATACACCGTA GCCTGATCAT 1439 .......... .......... .......... .......... .......... .......... 675 GATGCAGACC AAGATCACTA ACCAGCCCTT TAAACCAAAT CCCTTCTTTT GTCACCTCTG 1499 .......... .......... .......... .......... .......... .......... 675 TCAAGGCCAT GTACTCCGCT TCCGTAGTAG ACAAAGTCAC TGTAGGTTGC AAAGTTACCT 1559 .......... .......... .......... .......... .......... .......... 675 TCCAACTGAC GACAGATCCT CCAAGGGTAA ACACATAGCC AGTCATCGAT CTTCTTGTGT 1619 .......... .......... .......... .......... .......... .......... 675 CAACATCTCC AGCATAGTCT GAATCAGAAT AGCCAGTAAC CAAGCACTGA GTATCACCTC 1679 .......... .......... .......... .......... .......... .......... 675 CATAAATGAG ACCAACGTCA GATGTACCTC TAAGGTACCG GAAAATTCTC TTCACAGCCT 1739 .......... .......... .......... .......... .......... .......... 675 GCCAATGTTC TCTCCCTGGT TGTCCCATGA ATCTGCTCAC TACACTGACT GCATGTGCTA 1799 .......... .......... .......... .......... .......... .......... 675 AATCTGGCCT TGTACAGACC ATAGCATACA TCAAACTTCC TACGACACTG GCATAAGGGA 1859 .......... .......... .......... .......... .......... .......... 675 CTCGTGACAT ATACTCCTTC TCTTCTTCTG ACTGTGGAGC GAACATGGCA GTGAGGTGGA 1919 .......... .......... .......... .......... .......... .......... 675 TATTGGCAGC ACTAGGGTTA TCAATAGGCT TAGATGAAGA CATGCCAAAC CTCGCCAAGA 1979 .......... .......... .......... .......... .......... .......... 675 CCTTCTGAAT GTAGCTTCTC TGTGACAAGA AAAGTTTCCT TCTCTCTCTG TCTCTAATGA 2039 .......... .......... .......... .......... .......... .......... 675 TCTCCATCCC TAGAATCTTC CGAGCAGCTC CCAGATCCTT CATCTCAAAC TCCGCACTAA 2099 .......... .......... .......... .......... .......... .......... 675 GTAAACCCTT CAGCTTATGA ATGTCATACT TCTTCTTTGC AGCTATCAGC ATATCATCTA 2159 .......... .......... .......... .......... .......... .......... 675 CATAAAGCAC CAGATAGATG AACGAATCAT CCTTGAGCCT ATTGTAGTAG ACACAACAAT 2219 .......... .......... .......... .......... .......... .......... 675 CATATGAGCT CCGAGTATAG CCCAACTTCG CCATATAGCT GTCAAACCTT TTGTACCACT 2279 .......... .......... .......... .......... .......... .......... 675 GCCTTGGAGA CTGCTTAAGT CCATATAAGG ACTTCTTCAA CTTGCAGACG TGATTTTCCT 2339 .......... .......... .......... .......... .......... .......... 675 TCCCTGGAAC TTGGAAACCA TCCGGCTGAG TCATGTATAT CTCTTCCTCC AACTCTCCAT 2399 .......... .......... .......... .......... .......... .......... 675 GTAGAAACGC TGTCTTCACA TCAAGTTGTT CAAGCTCCAG ATTCTGATGT GTAACTATCG 2459 .......... .......... .......... .......... .......... .......... 675 CTAGTAACAC TCGGATGGAA GTATGTCTGA CCACTGGTGA GAAGATCTCA TTGTAGTCCA 2519 .......... .......... .......... .......... .......... .......... 675 CTCCCTCTCT TTGGTTGAAA CCTCTGGCAA CAACCCTGGC TATATACTTG ACTCCTTCTG 2579 .......... .......... .......... .......... .......... .......... 675 CTGGTGATAT CCCTTCCTTC TTCTTGAAAA CCCATTTGCA AGTAATAATC TTTCTCCCCG 2639 .......... .......... .......... .......... .......... .......... 675 AAGGCTGTAT GACCAGATCC CATGTCTGAT TCTTGTGTAG GGACTCCATC TCATCTCCCA 2699 .......... .......... .......... .......... .......... .......... 675 TAGCAGCAAA CCATTTTTCA GAATCAGGAC TTAAAATGGC TTCTTTGTAA GTAGACGGCT 2759 | ||| .......... .......... .......... .......... .......... ....AGGGCC 681 CAGATGCATC TACCTCTTCA GCAACCTGCA GTGCATAACC CACCATGTCC TCAAAACCAC 2819 || CA........ .......... .......... .......... .......... .......... 683 ACCTCGTAGG TGGCCGAACT CCAACCCTCC TTGGCCGATC TTGAGCTATA CTCTGATGGA 2879 .......... .......... .......... .......... .......... .......... 683 TATCTGATGG CATAGATTCT GGAATATCAG TTTCTGTCTG TGGCTCTTGA TCCTCCTCTT 2939 || | || || | | ||| | | | | | ||| || | .......... .....ATACA ACAACATTGT ATACCGTCGG CGCCGCGT-C TCCGCCCCCC 727 CAGGTTCCTT TAAATCGCTC TCGTTCTGAA TGACTTGAAA CTCCACCTGT TTGTCAAGAC 2999 | | CGG....... .......... .......... .......... .......... .......... 730 TCCCAGTTTC TGACGTAGTT GTAGGCTTCA CAATGGTTCT AAGCAGAGAA CTTTCATCAA 3059 .......... .......... .......... .......... .......... .......... 730 AGAGAACGTT CCTGCTCATA ATAACCCTCT TTTCTGCTGG AGACCAGATT CTGAAACCTT 3119 ||| .......... .......... .......... .......... .......... .....ACC-C 734 TCACTCCATC TCCGTAGCCC ACAAATACTC CCTTTTTAGC TCTTGGTTCT AACTTACCTT 3179 || ||| | ||| |||| | || | || | || | CCAGGCCAGC -GCGTCGCCC CCCTGGACCC CCCGACTCGC TGACC..... .......... 778 CACTGACGTG ATAGTAAGCC GTACAACCAA AAGCTTTCAG ATTTGAATAA TCAGCAGCTT 3239 .......... .......... .......... .......... .......... .......... 778 TTCCTGACCA CATCTCCCTA GGTGTCTTGC ACTGTATGCC TATATGTGGT TCGCGGTTAA 3299 .......... .......... .......... .......... .......... .......... 778 CCAAGTAGCA AGCTGTACTA ACCGCTTCTG CCCAGAATCT TCTATCTAGC CCAGCATTAG 3359 .......... .......... .......... .......... .......... .......... 778 AGAGCATGCA CCTTGCTCTC TCCAGAAGTG TTTGATTCAT CCGCTCAGCT ACACCGTTCT 3419 .......... .......... .......... .......... .......... .......... 778 ACTGTGGTGT ATTTCTGACT GTACGATGTC GAGCAATCCC TTCATCCTTA CAGAATTGAT 3479 .......... .......... .......... .......... .......... .......... 778 CAAATTCAGA CCAACAGAAT TCCAGCCCAT TATCAGTTCG TAACCTCTTG ATCTTCTTCC 3539 .......... .......... .......... .......... .......... .......... 778 CTGTTTGATT TTCCATCAAA ATTTTCCTCT CCTTGAACTT CTGGAAAGCT TCACTTTTAT 3599 .......... .......... .......... .......... .......... .......... 778 GCTTCATCAT GTACACCCAA GTCATCCTTG AGTAGTCATC AATAATGGAC ACAAAAAATC 3659 .......... .......... .......... .......... .......... .......... 778 TGCAGCCTCC CAAAGACTCA ACACGGCATG GACCCCAGCA ATCAGAATGG ATATAATCAA 3719 .......... .......... .......... .......... .......... .......... 778 GTGTGCCTTT TGTTCTATGA ATGGCCTTTG GAAACTTGTT GCGATGTAGT TTTCCAAAAA 3779 .......... .......... .......... .......... .......... .......... 778 CACAATGTTG ACAAAACTCT AGGCTCTTAA CCTTATGACC AGCAAGAAAA TCCTCCTTTG 3839 .......... .......... .......... .......... .......... .......... 778 ACAGAATTTG CATCCCTCTT TCACCCATAT GACCAAGTCT CATGTGCCAT AACTTAGTCA 3899 .......... .......... .......... .......... .......... .......... 778 TATCTTTCTG GTGAAATTCT GACGATGCAA CATGGGCTGA ACCTGTAACC GTGGAACCTT 3959 .......... .......... .......... .......... .......... .......... 778 GTAGAAAATA CAAAGTACCA CGCATGACAC CTTTCAGAAT CAAATTTGAA CCCTTCCAGA 4019 .......... .......... .......... .......... .......... .......... 778 CCCGCAAGAC TCCATCTTTT CCCGACCAGC TGAATCCCTT GCTGTCCAAA AGACTGAGAG 4079 .......... .......... .......... .......... .......... .......... 778 ATATCAGATT TTTCGTCATC AATGGAACGT GCCTGACCTC GTTCAATGTG CAGAAGCTAC 4139 .......... .......... .......... .......... .......... .......... 778 CGTCATGTGT CCTTATCTTG ATCGAGCCTG TCCCAACCAC CTTGCAGACA GAACTGTTGG 4199 .......... .......... .......... .......... .......... .......... 778 CCATCGAGAT GCTGCCTCCG TCTACCTGCT CATAAGTCGT GAACCACTCT CTCCTAGGAC 4259 .......... .......... .......... .......... .......... .......... 778 AGATGTGATA GGATGCCCCA GAATCGAGAA CCCACACATC TAAATGATGA GTGTGCTCAT 4319 .......... .......... .......... .......... .......... .......... 778 CCGCAACTAG GGAAATGTCT TCTTCGGAAT TGGTGTCTTC TTCAGCAACA GCAGCAGACA 4379 .......... .......... .......... .......... .......... .......... 778 CTGATTGTTT TTCCGATTGC TTCTTCTTCT TCGGACAATC AAATTTCCAA TGTCCCTTCT 4439 .......... .......... .......... .......... .......... .......... 778 CCTTGCAGTA ATTACAAACA TCATCCGTCT TTGCACCCTT CGACATCGGC TTATTTTTCT 4499 .......... .......... .......... .......... .......... .......... 778 TTCCGCCGTT TTTCCTTCCT TTTCCGCTAC TGGTGAACAG ACCGGAGGGC TGTATGTCTG 4559 .......... .......... .......... .......... .......... .......... 778 TACTTGTGCC GTTAGCCTTA TGCCGTAATT CCCTGCTATG AAGGGCCGAT CTGACTTCTT 4619 .......... .......... .......... .......... .......... .......... 778 CCAGTGACAC AGTATCTTTC CCAACAATGA ACGATTGAAC AAAATTCTCA AACGACATTG 4679 .......... .......... .......... .......... .......... .......... 778 GGAGAGATAC TAACAGAATC AGGGCAGCAT CTTCATCCTC GATCTTCACA TCGATATTAC 4739 .......... .......... .......... .......... .......... .......... 778 GCAATTCTAA TAACAAAGTA TTCAATTACT CTAAGTGTTC CCTGAGTTGT GTACCTTCAG 4799 .......... .......... .......... .......... .......... .......... 778 CCATTCGTAA ACCGAATAGA CGTTGTTTCA GAAGCAGCTT GTTGGTTAGA GATTTTGTCA 4859 .......... .......... .......... .......... .......... .......... 778 TGTACAACTC TCCAGCTTCA ACCACAGACC AGCAGCAGTC TCTTCATCCG AGACCTCCGT 4919 .......... .......... .......... .......... .......... .......... 778 GATGACGTCA TCCGCGAGAC ACAGCATGAT CGTCGAGTGC GCCTTTTCCT CCAGAATCGC 4979 .......... .......... .......... .......... .......... .......... 778 CATCTCAGGA GTAACGACGG CGTTCTTGTC TTTCGACAAC GGCGCCCAGA AGCCTTGCTG 5039 .......... .......... .......... .......... .......... .......... 778 TTTCAACAAA GTCCGCATCT TGATCTGCCA TAAACTGAAA CTGTTCCTCC CTGTGAATTT 5099 .......... .......... .......... .......... .......... .......... 778 GTCGATTTTC ACGTTCAAAG CAGACATCTC GAATTCTCTA AGAACACCGA TTAACCGAGA 5159 .......... .......... .......... .......... .......... .......... 778 GGCTCTGATA CCAATTTGTT TTGCGGAATT TGAGATAATA CGAGAAAATA TATAAACGCG 5219 .......... .......... .......... .......... .......... .......... 778 AAAAATAAGA CAACAGATTT ACGTGGTTCA CCAATAAATT GGCTACGTCC ACGGGAAGAG 5279 .......... .......... .......... .......... .......... .......... 778 AGGGAGCAGT TTTATTATGG AGAGGCAAAA ACAGAATTAC AAAATAGGGC AGCGAGACCC 5339 |||| |||||||||| .......... .......... .......... .......... ......GGGC AGCGAGACCC 792 CCGTCCTTTC TGTTTGTAAC GTGTCCGATT CAAGACATTC AACAT 5384 |||||||||| |||||||| | | |||||||| |||| ||||| ||||| CCGTCCTTTC TGTTTGTAGC GGGTCCGATT CAAGGCATTC AACAT 837 hqPGS_C06HBa0054K13.1-6+_SGN-U314435- (813 1095,2754 2761,2895 2942,3115 3164,5326 5384) ******************************************************************************** EST sequence 3 -strand 946 n (File: SGN-U345812-) 1 TTCTGTCAAG GCCCATGTAC TCGGCTTCCG TAGTAGACAA AGTCACTGTA GGTGCAAAAG 61 TTGCCTTCCA ACTGACGACA GATCCTCCCA GGGTAAACAC ATAGCCAGTC ATCGATCTTC 121 TTGTGTCAAC ATCTCCAGCA TAGTCTGAAT CAGAATAGCC AGTAACCAAG CACTGAGTAT 181 CACCTCCATA AATGAGACCA ACGTCAGATG TACCTCTAAG GTACCGGAAA ATTCTCTTCA 241 CAGCCTGCCA ATGTTCTCTC CCTGGTTGTC CCATGAATCT GCTCACTACA CTGACTGCAT 301 GTGCTAAATC TGGCCTTGTA CAGACCATAG CATACATCAA ACTTCTTACG GCACTGGCAT 361 AAGGGACTCG TGACATATAC TCCTTCTCTT CTTCTGACTG TGGAGCGAAC ATGGCAGTGA 421 GATGGATATT GGCAGCACTG GGGGTATCAA TGGGCTTAGA TGAAGACATG CCAAACCTCG 481 CCAAGACCTT CTGAATGTAG CTTCTCTGTG ACAAGAAAAG TTTCCTTCTC TCTCTGTCTC 541 TAATGATCTC CATCCCTAAA ATCTTCCGAG CGGCTCCCAG ATCCTTCATC TCAAACTCAG 601 CACTAAGTAA ACCCTTCAGC TTCTGAATGT CATACTTCTT CTTTGCAGCT ATCAACATAT 661 CATCTACATA AAGCACCAGA TAGATGAATG AATCATCATT GAGCCTATTG TAGTAGACAC 721 AACAATCATA TGAGCTCCGA GTATAGCCCA ACTTCACCAT ATAGCTGTCA AACCTTTTAT 781 ACCACTGCCT TGGAGACTGC TTAAGTCCAT ATAAGGACTT CTTCAACTTG CAGACGTGAT 841 TTTCCTTCCC TGGAACTTGG AAACCCCTCG TGCCGAATTC CTGCAGCCCG GGGGATCCAC 901 TAGTTCTAGA GCGGCCNGCC ACCGCGGTGG AGCTCCNGCC CCCAAA Predicted gene structure (within gDNA segment 677 to 3768): Exon 1 1496 2358 ( 863 n); cDNA 2 865 ( 864 n); score: 0.973 MATCH C06HBa0054K13.1-6+ SGN-U345812- 0.973 863 0.912 C PGS_C06HBa0054K13.1-6+_SGN-U345812- (1496 2358) Alignment (genomic DNA sequence = upper lines): TCTGTCAAGG -CCATGTACT CCGCTTCCGT AGTAGACAAA GTCACTGTAG GTTGCAAAGT 1554 |||||||||| ||||||||| | |||||||| |||||||||| |||||||||| || ||||| TCTGTCAAGG CCCATGTACT CGGCTTCCGT AGTAGACAAA GTCACTGTAG GTGCAAAAGT 61 TACCTTCCAA CTGACGACAG ATCCTCCAAG GGTAAACACA TAGCCAGTCA TCGATCTTCT 1614 | |||||||| |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| TGCCTTCCAA CTGACGACAG ATCCTCCCAG GGTAAACACA TAGCCAGTCA TCGATCTTCT 121 TGTGTCAACA TCTCCAGCAT AGTCTGAATC AGAATAGCCA GTAACCAAGC ACTGAGTATC 1674 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGTCAACA TCTCCAGCAT AGTCTGAATC AGAATAGCCA GTAACCAAGC ACTGAGTATC 181 ACCTCCATAA ATGAGACCAA CGTCAGATGT ACCTCTAAGG TACCGGAAAA TTCTCTTCAC 1734 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCTCCATAA ATGAGACCAA CGTCAGATGT ACCTCTAAGG TACCGGAAAA TTCTCTTCAC 241 AGCCTGCCAA TGTTCTCTCC CTGGTTGTCC CATGAATCTG CTCACTACAC TGACTGCATG 1794 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCCTGCCAA TGTTCTCTCC CTGGTTGTCC CATGAATCTG CTCACTACAC TGACTGCATG 301 TGCTAAATCT GGCCTTGTAC AGACCATAGC ATACATCAAA CTTCCTACGA CACTGGCATA 1854 |||||||||| |||||||||| |||||||||| |||||||||| |||| |||| |||||||||| TGCTAAATCT GGCCTTGTAC AGACCATAGC ATACATCAAA CTTCTTACGG CACTGGCATA 361 AGGGACTCGT GACATATACT CCTTCTCTTC TTCTGACTGT GGAGCGAACA TGGCAGTGAG 1914 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGGACTCGT GACATATACT CCTTCTCTTC TTCTGACTGT GGAGCGAACA TGGCAGTGAG 421 GTGGATATTG GCAGCACTAG GGTTATCAAT AGGCTTAGAT GAAGACATGC CAAACCTCGC 1974 ||||||||| |||||||| | || ||||||| ||||||||| |||||||||| |||||||||| ATGGATATTG GCAGCACTGG GGGTATCAAT GGGCTTAGAT GAAGACATGC CAAACCTCGC 481 CAAGACCTTC TGAATGTAGC TTCTCTGTGA CAAGAAAAGT TTCCTTCTCT CTCTGTCTCT 2034 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAGACCTTC TGAATGTAGC TTCTCTGTGA CAAGAAAAGT TTCCTTCTCT CTCTGTCTCT 541 AATGATCTCC ATCCCTAGAA TCTTCCGAGC AGCTCCCAGA TCCTTCATCT CAAACTCCGC 2094 |||||||||| ||||||| || |||||||||| ||||||||| |||||||||| ||||||| || AATGATCTCC ATCCCTAAAA TCTTCCGAGC GGCTCCCAGA TCCTTCATCT CAAACTCAGC 601 ACTAAGTAAA CCCTTCAGCT TATGAATGTC ATACTTCTTC TTTGCAGCTA TCAGCATATC 2154 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| ||| |||||| ACTAAGTAAA CCCTTCAGCT TCTGAATGTC ATACTTCTTC TTTGCAGCTA TCAACATATC 661 ATCTACATAA AGCACCAGAT AGATGAACGA ATCATCCTTG AGCCTATTGT AGTAGACACA 2214 |||||||||| |||||||||| ||||||| || |||||| ||| |||||||||| |||||||||| ATCTACATAA AGCACCAGAT AGATGAATGA ATCATCATTG AGCCTATTGT AGTAGACACA 721 ACAATCATAT GAGCTCCGAG TATAGCCCAA CTTCGCCATA TAGCTGTCAA ACCTTTTGTA 2274 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||||| || ACAATCATAT GAGCTCCGAG TATAGCCCAA CTTCACCATA TAGCTGTCAA ACCTTTTATA 781 CCACTGCCTT GGAGACTGCT TAAGTCCATA TAAGGACTTC TTCAACTTGC AGACGTGATT 2334 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCACTGCCTT GGAGACTGCT TAAGTCCATA TAAGGACTTC TTCAACTTGC AGACGTGATT 841 TTCCTTCCCT GGAACTTGGA AACC 2358 |||||||||| |||||||||| |||| TTCCTTCCCT GGAACTTGGA AACC 865 hqPGS_C06HBa0054K13.1-6+_SGN-U345812- (1496 2358) ******************************************************************************** EST sequence 4 -strand 649 n (File: SGN-U330105-) 1 AGTCATGTAT ATCTCTTCCT CCAACTCTCC ATGTAGAAAC GCTGTCTTCA CATCAAGTGT 61 TTCAAGCTCC AGATTCTGAT GTGCAACTAT CGCTAGTAAC ACTCGGATGG AAGTATGTCT 121 GACCACTGGT GAGAAGATCT CATTGTAGTC CACTCCCTCT CTTTGGTTGA AACCTCTAGC 181 AACAACCCTG ACTTTATACT TGACTCCTTC TGCTGGTGAT ATCCCTTCCT TCTTCTTGAA 241 AACCCATTTG CAAGTAATAA TCTTTCTCCC CGAAGGCTGT ATGACCAGAT CCCATGTCTG 301 ATTCTTGTGT AGGGACTCCA TCTCATCTCC CATAGCGGCA AACCATTTTT CAGAATCAGA 361 ACTTAAAATG GCTTCTTTGT AAGTAGACGG CTCAGATGTA TCTACCTCTT CAGCAACCTG 421 CAGTGCATAA CCCACCATGT CCTCAAAACC ATACCTCGTA GGTGGCCGAA CTCCAACCCT 481 CCTTGGCCGA TCTTGAGCTA TACTCTGATG GATATCTGAT GGCATAGATT CTGGAATATC 541 AGTTTCAGTC TGTGGCTCTT GATCCTCCTC TTCAGGTTCC TTTAAATCGC TCTCGTTCTG 601 AATGACTTGA AACTCCACCT GTTTGTCAAG ACTCCTAGTT TCTGACGTA Predicted gene structure (within gDNA segment 1758 to 3742): Exon 1 2368 3016 ( 649 n); cDNA 1 649 ( 649 n); score: 0.982 MATCH C06HBa0054K13.1-6+ SGN-U330105- 0.982 649 1.000 C PGS_C06HBa0054K13.1-6+_SGN-U330105- (2368 3016) Alignment (genomic DNA sequence = upper lines): AGTCATGTAT ATCTCTTCCT CCAACTCTCC ATGTAGAAAC GCTGTCTTCA CATCAAGTTG 2427 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AGTCATGTAT ATCTCTTCCT CCAACTCTCC ATGTAGAAAC GCTGTCTTCA CATCAAGTGT 60 TTCAAGCTCC AGATTCTGAT GTGTAACTAT CGCTAGTAAC ACTCGGATGG AAGTATGTCT 2487 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| TTCAAGCTCC AGATTCTGAT GTGCAACTAT CGCTAGTAAC ACTCGGATGG AAGTATGTCT 120 GACCACTGGT GAGAAGATCT CATTGTAGTC CACTCCCTCT CTTTGGTTGA AACCTCTGGC 2547 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || GACCACTGGT GAGAAGATCT CATTGTAGTC CACTCCCTCT CTTTGGTTGA AACCTCTAGC 180 AACAACCCTG GCTATATACT TGACTCCTTC TGCTGGTGAT ATCCCTTCCT TCTTCTTGAA 2607 |||||||||| || |||||| |||||||||| |||||||||| |||||||||| |||||||||| AACAACCCTG ACTTTATACT TGACTCCTTC TGCTGGTGAT ATCCCTTCCT TCTTCTTGAA 240 AACCCATTTG CAAGTAATAA TCTTTCTCCC CGAAGGCTGT ATGACCAGAT CCCATGTCTG 2667 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACCCATTTG CAAGTAATAA TCTTTCTCCC CGAAGGCTGT ATGACCAGAT CCCATGTCTG 300 ATTCTTGTGT AGGGACTCCA TCTCATCTCC CATAGCAGCA AACCATTTTT CAGAATCAGG 2727 |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| ||||||||| ATTCTTGTGT AGGGACTCCA TCTCATCTCC CATAGCGGCA AACCATTTTT CAGAATCAGA 360 ACTTAAAATG GCTTCTTTGT AAGTAGACGG CTCAGATGCA TCTACCTCTT CAGCAACCTG 2787 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| ACTTAAAATG GCTTCTTTGT AAGTAGACGG CTCAGATGTA TCTACCTCTT CAGCAACCTG 420 CAGTGCATAA CCCACCATGT CCTCAAAACC ACACCTCGTA GGTGGCCGAA CTCCAACCCT 2847 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| CAGTGCATAA CCCACCATGT CCTCAAAACC ATACCTCGTA GGTGGCCGAA CTCCAACCCT 480 CCTTGGCCGA TCTTGAGCTA TACTCTGATG GATATCTGAT GGCATAGATT CTGGAATATC 2907 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTTGGCCGA TCTTGAGCTA TACTCTGATG GATATCTGAT GGCATAGATT CTGGAATATC 540 AGTTTCTGTC TGTGGCTCTT GATCCTCCTC TTCAGGTTCC TTTAAATCGC TCTCGTTCTG 2967 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTTTCAGTC TGTGGCTCTT GATCCTCCTC TTCAGGTTCC TTTAAATCGC TCTCGTTCTG 600 AATGACTTGA AACTCCACCT GTTTGTCAAG ACTCCCAGTT TCTGACGTA 3016 |||||||||| |||||||||| |||||||||| ||||| |||| ||||||||| AATGACTTGA AACTCCACCT GTTTGTCAAG ACTCCTAGTT TCTGACGTA 649 hqPGS_C06HBa0054K13.1-6+_SGN-U330105- (2368 3016) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 7598: PGL 1 (+ strand): 52 5384 AGS-1 (52 239,337 683) SCR (e 0.872 d 0.999 a 0.976,e 0.818) Exon 1 52 239 ( 188 n); score: 0.872 Intron 1 240 336 ( 97 n); Pd: 0.999 Pa: 0.976 Exon 2 337 683 ( 347 n); score: 0.818 PGS (52 239,337 683) SGN-U322835+ 3-phase translation of AGS-1 (+strand): . . . . . . 52 TGCAATGGATTATCTCCACAATGGCTATTCAACGCCTGTGGTGCATTGTGACTTGAAGCC C N G L S P Q W L F N A C G A L - L E A A M D Y L H N G Y S T P V V H C D L K P Q W I I S T M A I Q R L W C I V T - S . . . . . . 112 AAGTAATGTCTTGTTAGATGAAGAAATGGTTGCTCATGTAAGTGATTTTGGCATTGCAAA K - C L V R - R N G C S C K - F W H C K S N V L L D E E M V A H V S D F G I A K Q V M S C - M K K W L L M - V I L A L Q . . . . . . 172 AATGTTAGGTGCAGGGGAGGCTTTTGTTCAAACAAGGACAGTTGCAACCATTGGATATAT N V R C R G G F C S N K D S C N H W I Y M L G A G E A F V Q T R T V A T I G Y I K C - V Q G R L L F K Q G Q L Q P L D I . : . . . . . 232 TGCTCCAG : AGTATGGACAAGATGGAATAGTATCCACGAGTTGTGATGTTTATAGTTTTGG C S R : V W T R W N S I H E L - C L - F W A P : E Y G Q D G I V S T S C D V Y S F G L L Q : S M D K M E - Y P R V V M F I V L . . . . . . 389 CATCCTGATGATGGAGACGTTCACACGAACAAGACCAAGTGATGAGATATTTACTGGAGA H P D D G D V H T N K T K - - D I Y W R I L M M E T F T R T R P S D E I F T G D A S - - W R R S H E Q D Q V M R Y L L E . . . . . . 449 CTTGAGCATACAACGTTGGATTAGTGATTCCTTTCCGGGGGAACTTCACAAGGTGGTAGA L E H T T L D - - F L S G G T S Q G G R L S I Q R W I S D S F P G E L H K V V D T - A Y N V G L V I P F R G N F T R W - . . . . . . 509 TTCTAATTTGGTACAGCCAGGAGAAGAACAAATCGCTGCAAAGATGCAATGTTTGTTATC F - F G T A R R R T N R C K D A M F V I S N L V Q P G E E Q I A A K M Q C L L S I L I W Y S Q E K N K S L Q R C N V C Y . . . . . . 569 TATCATGGAATTAGCTTTGAACTGCACTTTAGTGAGACCTGATGAAAGAATTAGCATGAA Y H G I S F E L H F S E T - - K N - H E I M E L A L N C T L V R P D E R I S M N L S W N - L - T A L - - D L M K E L A - . . . . . . 629 TGATGCTCTTTCAGCACTCAAAAAGATTAGACTACAGCTTGTTAGTAGTCGGCAC - C S F S T Q K D - T T A C - - S A D A L S A L K K I R L Q L V S S R H M M L F Q H S K R L D Y S L L V V G Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-6+_PGL-1_AGS-1_PPS_1 (53 239,337 683) (frame '2'; 534 bp, 178 residues) 1 AMDYLHNGYS TPVVHCDLKP SNVLLDEEMV AHVSDFGIAK MLGAGEAFVQ TRTVATIGYI 61 APEYGQDGIV STSCDVYSFG ILMMETFTRT RPSDEIFTGD LSIQRWISDS FPGELHKVVD 121 SNLVQPGEEQ IAAKMQCLLS IMELALNCTL VRPDERISMN DALSALKKIR LQLVSSRH AGS-2 (813 1095,2754 2761,2895 2942,3115 3164,5326 5384) SCR (e 0.866 d 0.426 a 0.246,e 0.750 d 0.000 a 0.784,e 0.521 d 0.901 a 0.000,e 0.520 d 0.510 a 0.000,e 0.949) Exon 1 813 1095 ( 283 n); score: 0.866 Intron 1 1096 2753 (1658 n); Pd: 0.426 Pa: 0.246 Exon 2 2754 2761 ( 8 n); score: 0.750 Intron 2 2762 2894 ( 133 n); Pd: 0.000 Pa: 0.784 Exon 3 2895 2942 ( 48 n); score: 0.521 Intron 3 2943 3114 ( 172 n); Pd: 0.901 Pa: 0.000 Exon 4 3115 3164 ( 50 n); score: 0.520 Intron 4 3165 5325 (2161 n); Pd: 0.510 Pa: 0.000 Exon 5 5326 5384 ( 59 n); score: 0.949 PGS (813 1095,2754 2761,2895 2942,3115 3164,5326 5384) SGN-U314435- 3-phase translation of AGS-2 (+strand): . . . . . . 813 GAGATTTAAAATGCTCAAAACAAGATTGTCTTTTCTTATTTATACTGTTGTGCGGAATTT E I - N A Q N K I V F S Y L Y C C A E F R F K M L K T R L S F L I Y T V V R N L D L K C S K Q D C L F L F I L L C G I . . . . . . 873 GAGATAATACGAGAAAATATATAAACGCGAAAAACAAGACAACATATTTACGTGGTTCAC E I I R E N I - T R K T R Q H I Y V V H R - Y E K I Y K R E K Q D N I F T W F T - D N T R K Y I N A K N K T T Y L R G S . . . . . . 933 CAATAAATTGGCTACGTCCACGGGAAGAGAGGGAGCAGTTTTATTATGGAGAGGCAAAAA Q - I G Y V H G K R G S S F I M E R Q K N K L A T S T G R E G A V L L W R G K N P I N W L R P R E E R E Q F Y Y G E A K . . . . . . 993 CAGAATTACAAAATAGGGTTTGCCATAGCGTCTATATATATATATAGTGTTAAGCTACGC Q N Y K I G F A I A S I Y I Y S V K L R R I T K - G L P - R L Y I Y I V L S Y A T E L Q N R V C H S V Y I Y I - C - A T . . . . . : . : 1053 CCTAACAGGCTTGGGCCCAACATACAGAATTGACAGATTCAAG : ACGGCTCA : ATTCTGGAA P N R L G P N I Q N - Q I Q : D G S : I L E L T G L G P T Y R I D R F K : T A Q : F W N P - Q A W A Q H T E L T D S R : R L : N S G . . . . : . . 2904 TATCAGTTTCTGTCTGTGGCTCTTGATCCTCCTCTTCAG : ACCTTTCACTCCATCTCCGTA Y Q F L S V A L D P P L Q : T F H S I S V I S F C L W L L I L L F R : P F T P S P - I S V S V C G S - S S S S : D L S L H L R . . . : . . . 3136 GCCCACAAATACTCCCTTTTTAGCTCTTG : GGGCAGCGAGACCCCCGTCCTTTCTGTTTGT A H K Y S L F S S W : G S E T P V L S V C P T N T P F L A L : G A A R P P S F L F V S P Q I L P F - L L : G Q R D P R P F C L . . . 5357 AACGTGTCCGATTCAAGACATTCAACAT N V S D S R H S T T C P I Q D I Q H - R V R F K T F N Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (1496 2358) SCR (e 0.973) Exon 1 1496 2358 ( 863 n); score: 0.973 PGS (1496 2358) SGN-U345812- 3-phase translation of AGS-3 (+strand): . . . . . . 1496 TCTGTCAAGGCCATGTACTCCGCTTCCGTAGTAGACAAAGTCACTGTAGGTTGCAAAGTT S V K A M Y S A S V V D K V T V G C K V L S R P C T P L P - - T K S L - V A K L C Q G H V L R F R S R Q S H C R L Q S . . . . . . 1556 ACCTTCCAACTGACGACAGATCCTCCAAGGGTAAACACATAGCCAGTCATCGATCTTCTT T F Q L T T D P P R V N T - P V I D L L P S N - R Q I L Q G - T H S Q S S I F L Y L P T D D R S S K G K H I A S H R S S . . . . . . 1616 GTGTCAACATCTCCAGCATAGTCTGAATCAGAATAGCCAGTAACCAAGCACTGAGTATCA V S T S P A - S E S E - P V T K H - V S C Q H L Q H S L N Q N S Q - P S T E Y H C V N I S S I V - I R I A S N Q A L S I . . . . . . 1676 CCTCCATAAATGAGACCAACGTCAGATGTACCTCTAAGGTACCGGAAAATTCTCTTCACA P P - M R P T S D V P L R Y R K I L F T L H K - D Q R Q M Y L - G T G K F S S Q T S I N E T N V R C T S K V P E N S L H . . . . . . 1736 GCCTGCCAATGTTCTCTCCCTGGTTGTCCCATGAATCTGCTCACTACACTGACTGCATGT A C Q C S L P G C P M N L L T T L T A C P A N V L S L V V P - I C S L H - L H V S L P M F S P W L S H E S A H Y T D C M . . . . . . 1796 GCTAAATCTGGCCTTGTACAGACCATAGCATACATCAAACTTCCTACGACACTGGCATAA A K S G L V Q T I A Y I K L P T T L A - L N L A L Y R P - H T S N F L R H W H K C - I W P C T D H S I H Q T S Y D T G I . . . . . . 1856 GGGACTCGTGACATATACTCCTTCTCTTCTTCTGACTGTGGAGCGAACATGGCAGTGAGG G T R D I Y S F S S S D C G A N M A V R G L V T Y T P S L L L T V E R T W Q - G R D S - H I L L L F F - L W S E H G S E . . . . . . 1916 TGGATATTGGCAGCACTAGGGTTATCAATAGGCTTAGATGAAGACATGCCAAACCTCGCC W I L A A L G L S I G L D E D M P N L A G Y W Q H - G Y Q - A - M K T C Q T S P V D I G S T R V I N R L R - R H A K P R . . . . . . 1976 AAGACCTTCTGAATGTAGCTTCTCTGTGACAAGAAAAGTTTCCTTCTCTCTCTGTCTCTA K T F - M - L L C D K K S F L L S L S L R P S E C S F S V T R K V S F S L C L - Q D L L N V A S L - Q E K F P S L S V S . . . . . . 2036 ATGATCTCCATCCCTAGAATCTTCCGAGCAGCTCCCAGATCCTTCATCTCAAACTCCGCA M I S I P R I F R A A P R S F I S N S A - S P S L E S S E Q L P D P S S Q T P H N D L H P - N L P S S S Q I L H L K L R . . . . . . 2096 CTAAGTAAACCCTTCAGCTTATGAATGTCATACTTCTTCTTTGCAGCTATCAGCATATCA L S K P F S L - M S Y F F F A A I S I S - V N P S A Y E C H T S S L Q L S A Y H T K - T L Q L M N V I L L L C S Y Q H I . . . . . . 2156 TCTACATAAAGCACCAGATAGATGAACGAATCATCCTTGAGCCTATTGTAGTAGACACAA S T - S T R - M N E S S L S L L - - T Q L H K A P D R - T N H P - A Y C S R H N I Y I K H Q I D E R I I L E P I V V D T . . . . . . 2216 CAATCATATGAGCTCCGAGTATAGCCCAACTTCGCCATATAGCTGTCAAACCTTTTGTAC Q S Y E L R V - P N F A I - L S N L L Y N H M S S E Y S P T S P Y S C Q T F C T T I I - A P S I A Q L R H I A V K P F V . . . . . . 2276 CACTGCCTTGGAGACTGCTTAAGTCCATATAAGGACTTCTTCAACTTGCAGACGTGATTT H C L G D C L S P Y K D F F N L Q T - F T A L E T A - V H I R T S S T C R R D F P L P W R L L K S I - G L L Q L A D V I . . . 2336 TCCTTCCCTGGAACTTGGAAACC S F P G T W K P S L E L G N F L P W N L E T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-3 (-strand): . . . . . . 2358 GGTTTCCAAGTTCCAGGGAAGGAAAATCACGTCTGCAAGTTGAAGAAGTCCTTATATGGA G F Q V P G K E N H V C K L K K S L Y G V S K F Q G R K I T S A S - R S P Y M D F P S S R E G K S R L Q V E E V L I W . . . . . . 2298 CTTAAGCAGTCTCCAAGGCAGTGGTACAAAAGGTTTGACAGCTATATGGCGAAGTTGGGC L K Q S P R Q W Y K R F D S Y M A K L G L S S L Q G S G T K G L T A I W R S W A T - A V S K A V V Q K V - Q L Y G E V G . . . . . . 2238 TATACTCGGAGCTCATATGATTGTTGTGTCTACTACAATAGGCTCAAGGATGATTCGTTC Y T R S S Y D C C V Y Y N R L K D D S F I L G A H M I V V S T T I G S R M I R S L Y S E L I - L L C L L Q - A Q G - F V . . . . . . 2178 ATCTATCTGGTGCTTTATGTAGATGATATGCTGATAGCTGCAAAGAAGAAGTATGACATT I Y L V L Y V D D M L I A A K K K Y D I S I W C F M - M I C - - L Q R R S M T F H L S G A L C R - Y A D S C K E E V - H . . . . . . 2118 CATAAGCTGAAGGGTTTACTTAGTGCGGAGTTTGAGATGAAGGATCTGGGAGCTGCTCGG H K L K G L L S A E F E M K D L G A A R I S - R V Y L V R S L R - R I W E L L G S - A E G F T - C G V - D E G S G S C S . . . . . . 2058 AAGATTCTAGGGATGGAGATCATTAGAGACAGAGAGAGAAGGAAACTTTTCTTGTCACAG K I L G M E I I R D R E R R K L F L S Q R F - G W R S L E T E R E G N F S C H R E D S R D G D H - R Q R E K E T F L V T . . . . . . 1998 AGAAGCTACATTCAGAAGGTCTTGGCGAGGTTTGGCATGTCTTCATCTAAGCCTATTGAT R S Y I Q K V L A R F G M S S S K P I D E A T F R R S W R G L A C L H L S L L I E K L H S E G L G E V W H V F I - A Y - . . . . . . 1938 AACCCTAGTGCTGCCAATATCCACCTCACTGCCATGTTCGCTCCACAGTCAGAAGAAGAG N P S A A N I H L T A M F A P Q S E E E T L V L P I S T S L P C S L H S Q K K R - P - C C Q Y P P H C H V R S T V R R R . . . . . . 1878 AAGGAGTATATGTCACGAGTCCCTTATGCCAGTGTCGTAGGAAGTTTGATGTATGCTATG K E Y M S R V P Y A S V V G S L M Y A M R S I C H E S L M P V S - E V - C M L W E G V Y V T S P L C Q C R R K F D V C Y . . . . . . 1818 GTCTGTACAAGGCCAGATTTAGCACATGCAGTCAGTGTAGTGAGCAGATTCATGGGACAA V C T R P D L A H A V S V V S R F M G Q S V Q G Q I - H M Q S V - - A D S W D N G L Y K A R F S T C S Q C S E Q I H G T . . . . . . 1758 CCAGGGAGAGAACATTGGCAGGCTGTGAAGAGAATTTTCCGGTACCTTAGAGGTACATCT P G R E H W Q A V K R I F R Y L R G T S Q G E N I G R L - R E F S G T L E V H L T R E R T L A G C E E N F P V P - R Y I . . . . . . 1698 GACGTTGGTCTCATTTATGGAGGTGATACTCAGTGCTTGGTTACTGGCTATTCTGATTCA D V G L I Y G G D T Q C L V T G Y S D S T L V S F M E V I L S A W L L A I L I Q - R W S H L W R - Y S V L G Y W L F - F . . . . . . 1638 GACTATGCTGGAGATGTTGACACAAGAAGATCGATGACTGGCTATGTGTTTACCCTTGGA D Y A G D V D T R R S M T G Y V F T L G T M L E M L T Q E D R - L A M C L P L E R L C W R C - H K K I D D W L C V Y P W . . . . . . 1578 GGATCTGTCGTCAGTTGGAAGGTAACTTTGCAACCTACAGTGACTTTGTCTACTACGGAA G S V V S W K V T L Q P T V T L S T T E D L S S V G R - L C N L Q - L C L L R K R I C R Q L E G N F A T Y S D F V Y Y G . . . 1518 GCGGAGTACATGGCCTTGACAGA A E Y M A L T R S T W P - Q S G V H G L D R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-6-_PGL-1_AGS-3_PPS_1 (2358 1498) (frame '1'; 861 bp, 287 residues) 1 GFQVPGKENH VCKLKKSLYG LKQSPRQWYK RFDSYMAKLG YTRSSYDCCV YYNRLKDDSF 61 IYLVLYVDDM LIAAKKKYDI HKLKGLLSAE FEMKDLGAAR KILGMEIIRD RERRKLFLSQ 121 RSYIQKVLAR FGMSSSKPID NPSAANIHLT AMFAPQSEEE KEYMSRVPYA SVVGSLMYAM 181 VCTRPDLAHA VSVVSRFMGQ PGREHWQAVK RIFRYLRGTS DVGLIYGGDT QCLVTGYSDS 241 DYAGDVDTRR SMTGYVFTLG GSVVSWKVTL QPTVTLSTTE AEYMALT AGS-4 (2368 3016) SCR (e 0.982) Exon 1 2368 3016 ( 649 n); score: 0.982 PGS (2368 3016) SGN-U330105- 3-phase translation of AGS-4 (+strand): . . . . . . 2368 AGTCATGTATATCTCTTCCTCCAACTCTCCATGTAGAAACGCTGTCTTCACATCAAGTTG S H V Y L F L Q L S M - K R C L H I K L V M Y I S S S N S P C R N A V F T S S C S C I S L P P T L H V E T L S S H Q V . . . . . . 2428 TTCAAGCTCCAGATTCTGATGTGTAACTATCGCTAGTAACACTCGGATGGAAGTATGTCT F K L Q I L M C N Y R - - H S D G S M S S S S R F - C V T I A S N T R M E V C L V Q A P D S D V - L S L V T L G W K Y V . . . . . . 2488 GACCACTGGTGAGAAGATCTCATTGTAGTCCACTCCCTCTCTTTGGTTGAAACCTCTGGC D H W - E D L I V V H S L S L V E T S G T T G E K I S L - S T P S L W L K P L A - P L V R R S H C S P L P L F G - N L W . . . . . . 2548 AACAACCCTGGCTATATACTTGACTCCTTCTGCTGGTGATATCCCTTCCTTCTTCTTGAA N N P G Y I L D S F C W - Y P F L L L E T T L A I Y L T P S A G D I P S F F L K Q Q P W L Y T - L L L L V I S L P S S - . . . . . . 2608 AACCCATTTGCAAGTAATAATCTTTCTCCCCGAAGGCTGTATGACCAGATCCCATGTCTG N P F A S N N L S P R R L Y D Q I P C L T H L Q V I I F L P E G C M T R S H V - K P I C K - - S F S P K A V - P D P M S . . . . . . 2668 ATTCTTGTGTAGGGACTCCATCTCATCTCCCATAGCAGCAAACCATTTTTCAGAATCAGG I L V - G L H L I S H S S K P F F R I R F L C R D S I S S P I A A N H F S E S G D S C V G T P S H L P - Q Q T I F Q N Q . . . . . . 2728 ACTTAAAATGGCTTCTTTGTAAGTAGACGGCTCAGATGCATCTACCTCTTCAGCAACCTG T - N G F F V S R R L R C I Y L F S N L L K M A S L - V D G S D A S T S S A T C D L K W L L C K - T A Q M H L P L Q Q P . . . . . . 2788 CAGTGCATAACCCACCATGTCCTCAAAACCACACCTCGTAGGTGGCCGAACTCCAACCCT Q C I T H H V L K T T P R R W P N S N P S A - P T M S S K P H L V G G R T P T L A V H N P P C P Q N H T S - V A E L Q P . . . . . . 2848 CCTTGGCCGATCTTGAGCTATACTCTGATGGATATCTGATGGCATAGATTCTGGAATATC P W P I L S Y T L M D I - W H R F W N I L G R S - A I L - W I S D G I D S G I S S L A D L E L Y S D G Y L M A - I L E Y . . . . . . 2908 AGTTTCTGTCTGTGGCTCTTGATCCTCCTCTTCAGGTTCCTTTAAATCGCTCTCGTTCTG S F C L W L L I L L F R F L - I A L V L V S V C G S - S S S S G S F K S L S F - Q F L S V A L D P P L Q V P L N R S R S . . . . . 2968 AATGACTTGAAACTCCACCTGTTTGTCAAGACTCCCAGTTTCTGACGTA N D L K L H L F V K T P S F - R M T - N S T C L S R L P V S D V E - L E T P P V C Q D S Q F L T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-4 (-strand): . . . . . . 3016 TACGTCAGAAACTGGGAGTCTTGACAAACAGGTGGAGTTTCAAGTCATTCAGAACGAGAG Y V R N W E S - Q T G G V S S H S E R E T S E T G S L D K Q V E F Q V I Q N E S R Q K L G V L T N R W S F K S F R T R . . . . . . 2956 CGATTTAAAGGAACCTGAAGAGGAGGATCAAGAGCCACAGACAGAAACTGATATTCCAGA R F K G T - R G G S R A T D R N - Y S R D L K E P E E E D Q E P Q T E T D I P E A I - R N L K R R I K S H R Q K L I F Q . . . . . . 2896 ATCTATGCCATCAGATATCCATCAGAGTATAGCTCAAGATCGGCCAAGGAGGGTTGGAGT I Y A I R Y P S E Y S S R S A K E G W S S M P S D I H Q S I A Q D R P R R V G V N L C H Q I S I R V - L K I G Q G G L E . . . . . . 2836 TCGGCCACCTACGAGGTGTGGTTTTGAGGACATGGTGGGTTATGCACTGCAGGTTGCTGA S A T Y E V W F - G H G G L C T A G C - R P P T R C G F E D M V G Y A L Q V A E F G H L R G V V L R T W W V M H C R L L . . . . . . 2776 AGAGGTAGATGCATCTGAGCCGTCTACTTACAAAGAAGCCATTTTAAGTCCTGATTCTGA R G R C I - A V Y L Q R S H F K S - F - E V D A S E P S T Y K E A I L S P D S E K R - M H L S R L L T K K P F - V L I L . . . . . . 2716 AAAATGGTTTGCTGCTATGGGAGATGAGATGGAGTCCCTACACAAGAATCAGACATGGGA K M V C C Y G R - D G V P T Q E S D M G K W F A A M G D E M E S L H K N Q T W D K N G L L L W E M R W S P Y T R I R H G . . . . . . 2656 TCTGGTCATACAGCCTTCGGGGAGAAAGATTATTACTTGCAAATGGGTTTTCAAGAAGAA S G H T A F G E K D Y Y L Q M G F Q E E L V I Q P S G R K I I T C K W V F K K K I W S Y S L R G E R L L L A N G F S R R . . . . . . 2596 GGAAGGGATATCACCAGCAGAAGGAGTCAAGTATATAGCCAGGGTTGTTGCCAGAGGTTT G R D I T S R R S Q V Y S Q G C C Q R F E G I S P A E G V K Y I A R V V A R G F R K G Y H Q Q K E S S I - P G L L P E V . . . . . . 2536 CAACCAAAGAGAGGGAGTGGACTACAATGAGATCTTCTCACCAGTGGTCAGACATACTTC Q P K R G S G L Q - D L L T S G Q T Y F N Q R E G V D Y N E I F S P V V R H T S S T K E R E W T T M R S S H Q W S D I L . . . . . . 2476 CATCCGAGTGTTACTAGCGATAGTTACACATCAGAATCTGGAGCTTGAACAACTTGATGT H P S V T S D S Y T S E S G A - T T - C I R V L L A I V T H Q N L E L E Q L D V P S E C Y - R - L H I R I W S L N N L M . . . . . 2416 GAAGACAGCGTTTCTACATGGAGAGTTGGAGGAAGAGATATACATGACT E D S V S T W R V G G R D I H D K T A F L H G E L E E E I Y M T - R Q R F Y M E S W R K R Y T - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-6-_PGL-1_AGS-4_PPS_1 (3015 2368) (frame '2'; 648 bp, 216 residues) 1 TSETGSLDKQ VEFQVIQNES DLKEPEEEDQ EPQTETDIPE SMPSDIHQSI AQDRPRRVGV 61 RPPTRCGFED MVGYALQVAE EVDASEPSTY KEAILSPDSE KWFAAMGDEM ESLHKNQTWD 121 LVIQPSGRKI ITCKWVFKKK EGISPAEGVK YIARVVARGF NQREGVDYNE IFSPVVRHTS 181 IRVLLAIVTH QNLELEQLDV KTAFLHGELE EEIYMT ... finished at: Mon Aug 28 22:24:16 2006 ________________________________________________________________________________ Sequence 7: C06HBa0054K13.1-7, from 1 to 5432, both strands analyzed. ... started at: Mon Aug 28 22:24:16 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 ******************************************************************************** EST sequence 2 +strand 2271 n (File: SGN-U320935+) 1 AAAGACCTGC TTTCTTCTTT CATCAGCTCT AATTCTGGAA ATAGTACTCA AAAAAGGGCA 61 AAGACTCCTA TGAAGAAGAG AATTACTGTA AAAAAATAGT TGCAGATTCT GACACCCTTT 121 TGGGAAATTC GAATTGGGTT TGGGTTTTCT TGGGAGATGG CGTTAAATCC TAGTAGTAGA 181 ATGGACTCAC AAACGGCTGC TGAAAAAGCC GTCTCTGTGA TTGGGTTAGG TTATGATTTG 241 ACAAGTGATA TCCGGTTGAC GGCCTGTAAA ATCGGACCGA ATGGGTATGG TTTGATAGAG 301 ATTGATCAGA AGTCGACCAA GGACCTGCTG GTGCCTGGTG GGGTGGTGGT TTCTAATGTT 361 TCCACCTCTA TCAAGTGCGA TAAAGGGGAA CGAACTAGGT TTCGCTCTGA TGCTCTTTCT 421 TTTAGTCAGA TGTCAGAGCA ATTAAATCAG GAGCTGTCCC TGTCTGGTAA GATACCTTCT 481 GGGTTGTTTA ATGCGATGTT TGGTCACAAG GGATGCTGGC AAAAAGATGC ATCTTCGACA 541 AAGCTTCTCG CTTTTGATGG TTGGTTCATT ACCTTGTACA ATATTGAGTT GGTGAGATCC 601 CATCTAACAC TGTCTGAGCA AGTAAAACAA GACGTGCCTT CTTCTTGGGA TCCTCCTGCA 661 CTTGCAGAGT TTATTGAAAA ATATGGCACC CATATTGTTG TCGGGGTAAA GATGGGAGGT 721 AAAGATGTAA TTCACATAAA GCAACTGCAA AATTCCGTTC TTCAGCCAAT GGAGGTGCAG 781 AAATTACTCA AGCAATTAGC TGATGAAAAA TTTTCAGAAG ACATAAATGG ATGCCAGATA 841 GCAAAACCTG TTAGATCTGT TGAAAAATCA AAGGGCGAAA AATCAATATT CTCAGATCCG 901 CATCTACCAT TTGCAAACTC AATGAGACCA TCTATTATGT CTTACTCCAA AGCTGATGAT 961 CTACTAAGTA TTCATATCCG ACGAGGAGGT CTTGATTTTG GTCAAAGTCA TAGCCAGTGG 1021 CTTCCTACTG TATCACAGTC CCCCAATGCT ATATCAATGT CGTTTGTGCC AATCGCTTCA 1081 CTTTTGAGTG GTGTAAGGGG CAGTGGATTT TTAAGCCATG CAATTAATCT TTACCTGCGA 1141 TATAAACCAC CAATCGAGGA ACTTGAACAA TTTTTAGAAT TTCAGTTACC TCGGCAGTGG 1201 GCTCCAGCAT ATGGTGATCT TCCTCTTGGT CATCGCCACA GAAAACAGGC CTCTCCATCC 1261 TTGCAGTTCA CTTTGATGGG TCCAAAGCTA TACGTTAACA CTGTAAAGGT TGACTCTGGA 1321 AACAGGCCAG TAACTGGAAT TCGGTTATAC TTGGAAGGTA AAAGGAGTGA TCACCTGGCT 1381 ATCCATCTTC AACATCTATC AGCACTTCCT CAAAGTATCC AACTAACAGA TGATCTTAGC 1441 TATGAGCCTG TCGATGAACC AGTTGAACGA GGGTATCTTG AACCTGTCAA ATGGAGCATA 1501 TTTTCACATG TTTGCACCGC CCCAGTAGAA TACCGTGGAA CACGGATCGA TGACTCTGCT 1561 TCTATTGTGA CCAAGGCCTG GTTTGAGGTG AAGGTTATTG GAATGAAGAA GGTCCTCTTC 1621 CTGAGGCTGG GATTCTCGAT GGTCGCCTCA GCGAAGATCC GTCGGTCAGA GTGGGAAGGA 1681 CCAGCAACCA CATGTCGAAA ATCAGGTTTG ATCTCAATGC TGATCACTAC CCCATTCAGT 1741 ACAAAGCTAA ACCAACCCCA GAAGCCAACA AAGGTGGACT TGAATTCCGC GGTTTATCCT 1801 GGTGGTCCAC CTTCACCAGC AAGAGCGCCA AAGATGTCGC ACTTTGTTGA TACAACAGAA 1861 ATGGTAAGAG GTCCCGAGGA GTCCCCTGGT TACTGGGTGG TTACTGGTGC AAAGCTGTGT 1921 GTAGAAGATA GTAGGATAAG AATGAAAGTG AAGTACTCCC TCTTAACCAT ACTTACAGAG 1981 GAGTCATTGT TGATATAAAC GATTGAAAGA TTTCATTTCA AGCTTTATCC ATCTTTCTCT 2041 ATAGGACATT GTGTATATTT CATCTGATAT TTGCGCAGAT ATGCAGAAAG TGTTATAGCA 2101 AGGTCAAATT GACCCAGTTA ACAGTTGAGT GTGTAAAGAA ATTATGAAAC TTTTGTTGAA 2161 AGAAAACGAA GAGTTTCTGA ATGTACATTT AACGTTCCTA AAAAAAAAAA AAAAAAAAAA 2221 AAAAAAAAAA AAAAAAAAAA TTTGGGGGGG GGGGGGGGGA CCCATTTCCC C Predicted gene structure (within gDNA segment 5432 to 286): Exon 1 1665 1366 ( 300 n); cDNA 1903 2201 ( 299 n); score: 0.977 PPA cDNA 2221 2241 MATCH C06HBa0054K13.1-7- SGN-U320935+ 0.977 300 0.132 C PGS_C06HBa0054K13.1-7-_SGN-U320935+ (1665 1366) Alignment (genomic DNA sequence = upper lines): ACTGGTGCAA AGCTGTGTGT AGAAGATAGT AGGATAAGAA TGAAAGTGAA GTACTCCCTC 1606 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGGTGCAA AGCTGTGTGT AGAAGATAGT AGGATAAGAA TGAAAGTGAA GTACTCCCTC 1962 TTAACCATAC TTACAGAGGA GTCATTGTTG ATATAAACGA TCGAAAGATT TCATTTCAAG 1546 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TTAACCATAC TTACAGAGGA GTCATTGTTG ATATAAACGA TTGAAAGATT TCATTTCAAG 2022 CTTTATCTAT CTTTCTCTAT AGGACATTGT GTAAACTTCA TCTGATATTT GGGCAGATAT 1486 ||||||| || |||||||||| |||||||||| ||| | |||| |||||||||| | |||||||| CTTTATCCAT CTTTCTCTAT AGGACATTGT GTATATTTCA TCTGATATTT GCGCAGATAT 2082 GCAGAAAGTG TTATAGCAAA GGTCAAATTG ACCCAGTTAA CAGTTGAGTG TGTAAAGAAA 1426 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| GCAGAAAGTG TTATAGC-AA GGTCAAATTG ACCCAGTTAA CAGTTGAGTG TGTAAAGAAA 2141 TTATGAAACT TTTGTTGAAA GAAAATGAAG AGTTTCTGAA TGTACATTTA ACGTTCCTAA 1366 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| TTATGAAACT TTTGTTGAAA GAAAACGAAG AGTTTCTGAA TGTACATTTA ACGTTCCTAA 2201 hqPGS_C06HBa0054K13.1-7-_SGN-U320935+ (1665 1366) ******************************************************************************** EST sequence 1 -strand 1339 n (File: SGN-U345363-) 1 GAGCTGTCTT ATACAGTATT GAATTGCCTT AATGTCTTGA TAAGTTAAGG TAATTCTCCA 61 TGTTGTATTC TGTGTACGTA TTGTTAGTAT TAATTATCTG TGTTATGGAG GAGGATGAAG 121 TATATATTGA CGATTTAATG TATAAATAGC GTGGAGGTGG ATATTATATT GAGTGTATGT 181 GTAGATGTGT TTGAAATTGT TTTTGGAGGA AGATGTGTTA AATAAATTAT TTACGTTATA 241 GATTATATAT ATGAGTGTGT TGGATGGACT ACAGTATTGT TAATTGTGGC TAAAGTTCAT 301 GTATATACTT TCGGGTGGAG GCAAGAGACA ACAAGAACGA AAGCGGAGCG ACAGGGCAAA 361 ATAAAGGAGG GGAACAGAGG GAGGAGAGAG GGGGGGGGAA AAGGACAAGA AAGGACAGCA 421 CAAGCGGAGA GGAACAAGAG AAGAGAGGAG GAGGGCCAGG GAAAGACGAG ACGCCACAAA 481 AGAAAAGAGC GGAGGNCGGG GGNGNNNNNA NNNNNNNANN NNNNNNNNNN NNNNNNNNNN 541 NNNNNNNNNN CNNNNNNNNN GNNNNGNNNN NNNNNNNNNN NNNNNTTTTT TTTTTTGGGG 601 AACAAACCCC CCCCCGAAAT TTTTTTTTCT TTTCTTTCCC CCGGTTTTTT GGGGGCCAAA 661 TTAAAAAAAT AAAGCCCCCC CTTAGGGGGG GAATTTTTTT TTTTTTGTTT TTCCCCCCCT 721 TTTTCCCCCC TTTTTTTTTT ATTTCATAAA ATAAATTATT TCTAAACATT CACCATTTTT 781 ATACAAAGTG ATTGTTGATA AAGGTTGTAT ATTGACTTTT CGGAAAACTA CTCTAGAGAT 841 GGACTTAAGA CACTGCTTTT TTCGTCCGGA ATTGTTTATT ATGTTACGCT TTTTAAAAGT 901 TAATTTGATT AATTTTTAAA GTTAAATTAG ATCATATTAA TTTGATATTT TTGTAGAAAA 961 AATTAGATAT TCTAAAACTA TAGAAAAAAT ACTATAAAAT TACAATTTTT TACGTATCAA 1021 TATGATAAAA AATTACATCT TAAAATGTTA GTCAAAATTT TCGTAATTTG ACTCTAAAAA 1081 TAGAAATCTT GACAAACAAT TTCGGGCAAA AAAAGTAATT TACATCAATG CATCGAGTGC 1141 CAATGATTTC AATATCGGAT TTGTGCAGTT TAACTTGGGA AAGCACTTAC AAACTGCATT 1201 TTCTTTTTGC CATTTCGCCG ATTAGATTGA GAGTATCGAT TTTTTTTAAG GATTTTCGGT 1261 GTTTATGGAT AAAACTTTTA TAGATTTTTT TAAGGGGTGT GGAGAGGTGG TGGTATCGCG 1321 CTGATAAGCT TTTTAAAGT Predicted gene structure (within gDNA segment 1 to 5432): Exon 1 1781 1829 ( 49 n); cDNA 790 837 ( 48 n); score: 0.612 Intron 1 1830 1889 ( 60 n); Pd: 0.940 (s: 0.61), Pa: 0.000 (s: 0) Exon 2 1890 1895 ( 6 n); cDNA 838 843 ( 6 n); score: 0.667 Intron 2 1896 2222 ( 327 n); Pd: 0.512 (s: 0), Pa: 0.000 (s: 0.70) Exon 3 2223 2346 ( 124 n); cDNA 844 966 ( 123 n); score: 0.778 Intron 3 2347 2511 ( 165 n); Pd: 0.900 (s: 0.85), Pa: 0.000 (s: 0.63) Exon 4 2512 2552 ( 41 n); cDNA 967 1006 ( 40 n); score: 0.634 MATCH C06HBa0054K13.1-7+ SGN-U345363- 0.778 220 0.164 C PGS_C06HBa0054K13.1-7+_SGN-U345363- (1781 1829,1890 1895,2223 2346,2512 2552) Alignment (genomic DNA sequence = upper lines): GATTGGAGTC AGTGGCCCGT GATTGCATTC TCCAAAAACG TACTCTTGAG TAACGTTTTA 1840 ||||| | | || |||| || || ||||| |||||| || GATTGTTGAT AAAGGTTGTA TATTGACTTT TCGGAAAAC- TACTCTAGA. .......... 837 CACCTAAAAC GTTACTCACA GTTACGTTTA TGCGTTTAAC GTCGTCAACC ATGAAGTACA 1900 ||| | .......... .......... .......... .......... .........G ATGGA..... 843 TTTATACTAG TTTTGTGGAA AAAATTAAAA ACGTATTATT TTGGTAAATT GGTTTTAAAA 1960 .......... .......... .......... .......... .......... .......... 843 ATAGCTCATT TTGGTCAAAA ACTCCTTTTT ATAATGAAAG ACTCTTGAAA GCAATTGAGT 2020 .......... .......... .......... .......... .......... .......... 843 CTATTTAAAT GTTATTTTAT TCTACCTTTT ATTTTCTTCC TTTAACTTCT TGTTCAAGTA 2080 .......... .......... .......... .......... .......... .......... 843 AAAAAAGTTT TTTCAATTTT GAAACATAAA AGAATAATAT CATAACTTAT AGTTATTAGT 2140 .......... .......... .......... .......... .......... .......... 843 TATATATAAC ATTTTCACAT AATAATATTA TTATTATTCA CCTTTATCGT TCTAATTTAT 2200 .......... .......... .......... .......... .......... .......... 843 ATTTCATTTC TAGATTATAT CTCTCATACA TATATACTTT CTCCGTCCGA AATTGTTTGT 2260 || | | | | | |||| | |||||| |||||||| | .......... .......... ..CTTA-AGA CA-CTGCTTT TTTCGTCCGG AATTGTTTAT 879 TTAATTGCGC TTTTCGAAAG TCAATTAGAC TAATTTTTAA AGTTAAATTG GATCACATTA 2320 | || ||| |||| |||| | |||| || |||||||||| ||||||||| ||||| |||| TATGTTACGC TTTTTAAAAG TTAATTTGAT TAATTTTTAA AGTTAAATTA GATCATATTA 939 ATTTGATA-T TTTAAACAAA AAATTAGATA TTCTTTTTAT TAGCGTACCT ATTTAATATC 2379 |||||||| | ||| | ||| ||||||| ATTTGATATT TTTGTAGAAA AAATTAG... .......... .......... .......... 966 TACAACTAGT TATATATCAG ATTAGATGAC TGATAAGTGT CAAAGATAGA CTTTCCACGA 2439 .......... .......... .......... .......... .......... .......... 966 CATATGAAAG AAAGTAGTCA ATATATTGAA TGAAGGGAGG GAGCCTTAAT TAGATAAGGT 2499 .......... .......... .......... .......... .......... .......... 966 TGTTGGAGAA AAAAACTTTG AAATGTTACA ATAATTTCGA TAAAAAATAG AAT 2552 | | | | ||| || | | || | | | | |||| || ||| .......... ..ATATTCTA AAACTATAGA AAAAATACTA T-AAAATTAC AAT 1006 hqPGS_C06HBa0054K13.1-7+_SGN-U345363- (2223 2346) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 5432: PGL 1 (- strand): 1665 1366 AGS-1 (1665 1366) SCR (e 0.977) Exon 1 1665 1366 ( 300 n); score: 0.977 PGS (1665 1366) SGN-U320935+ 3-phase translation of AGS-1 (-strand): . . . . . . 1665 ACTGGTGCAAAGCTGTGTGTAGAAGATAGTAGGATAAGAATGAAAGTGAAGTACTCCCTC T G A K L C V E D S R I R M K V K Y S L L V Q S C V - K I V G - E - K - S T P S W C K A V C R R - - D K N E S E V L P . . . . . . 1605 TTAACCATACTTACAGAGGAGTCATTGTTGATATAAACGATCGAAAGATTTCATTTCAAG L T I L T E E S L L I - T I E R F H F K - P Y L Q R S H C - Y K R S K D F I S S L N H T Y R G V I V D I N D R K I S F Q . . . . . . 1545 CTTTATCTATCTTTCTCTATAGGACATTGTGTAAACTTCATCTGATATTTGGGCAGATAT L Y L S F S I G H C V N F I - Y L G R Y F I Y L S L - D I V - T S S D I W A D M A L S I F L Y R T L C K L H L I F G Q I . . . . . . 1485 GCAGAAAGTGTTATAGCAAAGGTCAAATTGACCCAGTTAACAGTTGAGTGTGTAAAGAAA A E S V I A K V K L T Q L T V E C V K K Q K V L - Q R S N - P S - Q L S V - R N C R K C Y S K G Q I D P V N S - V C K E . . . . . . 1425 TTATGAAACTTTTGTTGAAAGAAAATGAAGAGTTTCTGAATGTACATTTAACGTTCCTAA L - N F C - K K M K S F - M Y I - R S - Y E T F V E R K - R V S E C T F N V P I M K L L L K E N E E F L N V H L T F L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-7-_PGL-1_AGS-1_PPS_1 (1633 1439) (frame '0'; 192 bp, 64 residues) 1 DKNESEVLPL NHTYRGVIVD INDRKISFQA LSIFLYRTLC KLHLIFGQIC RKCYSKGQID 61 PVNS- 3-phase translation of AGS-1 (+strand): . . . . . . 1366 TTAGGAACGTTAAATGTACATTCAGAAACTCTTCATTTTCTTTCAACAAAAGTTTCATAA L G T L N V H S E T L H F L S T K V S - - E R - M Y I Q K L F I F F Q Q K F H N R N V K C T F R N S S F S F N K S F I . . . . . . 1426 TTTCTTTACACACTCAACTGTTAACTGGGTCAATTTGACCTTTGCTATAACACTTTCTGC F L Y T L N C - L G Q F D L C Y N T F C F F T H S T V N W V N L T F A I T L S A I S L H T Q L L T G S I - P L L - H F L . . . . . . 1486 ATATCTGCCCAAATATCAGATGAAGTTTACACAATGTCCTATAGAGAAAGATAGATAAAG I S A Q I S D E V Y T M S Y R E R - I K Y L P K Y Q M K F T Q C P I E K D R - S H I C P N I R - S L H N V L - R K I D K . . . . . . 1546 CTTGAAATGAAATCTTTCGATCGTTTATATCAACAATGACTCCTCTGTAAGTATGGTTAA L E M K S F D R L Y Q Q - L L C K Y G - L K - N L S I V Y I N N D S S V S M V K A - N E I F R S F I S T M T P L - V W L . . . . . . 1606 GAGGGAGTACTTCACTTTCATTCTTATCCTACTATCTTCTACACACAGCTTTGCACCAGT E G V L H F H S Y P T I F Y T Q L C T S R E Y F T F I L I L L S S T H S F A P R G S T S L S F L S Y Y L L H T A L H Q Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 2223 2346 AGS-1 (2223 2346) SCR (e 0.778) Exon 1 2223 2346 ( 124 n); score: 0.778 PGS (2223 2346) SGN-U345363- 3-phase translation of AGS-1 (+strand): . . . . . . 2223 CTCATACATATATACTTTCTCCGTCCGAAATTGTTTGTTTAATTGCGCTTTTCGAAAGTC L I H I Y F L R P K L F V - L R F S K V S Y I Y T F S V R N C L F N C A F R K S H T Y I L S P S E I V C L I A L F E S . . . . . . 2283 AATTAGACTAATTTTTAAAGTTAAATTGGATCACATTAATTTGATATTTTAAACAAAAAA N - T N F - S - I G S H - F D I L N K K I R L I F K V K L D H I N L I F - T K N Q L D - F L K L N W I T L I - Y F K Q K . 2343 TTAG L - I Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 2346 CTAATTTTTTGTTTAAAATATCAAATTAATGTGATCCAATTTAACTTTAAAAATTAGTCT L I F C L K Y Q I N V I Q F N F K N - S - F F V - N I K L M - S N L T L K I S L N F L F K I S N - C D P I - L - K L V . . . . . . 2286 AATTGACTTTCGAAAAGCGCAATTAAACAAACAATTTCGGACGGAGAAAGTATATATGTA N - L S K S A I K Q T I S D G E S I Y V I D F R K A Q L N K Q F R T E K V Y M Y - L T F E K R N - T N N F G R R K Y I C . 2226 TGAG - E M Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:24:26 2006 ________________________________________________________________________________ Sequence 8: C06HBa0054K13.1-8, from 1 to 2791, both strands analyzed. ... started at: Mon Aug 28 22:24:26 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand 888 n (File: SGN-U340965+) 1 ATTAGCTGGA GCTCCCCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA 61 GGAATTCGGC ACGAGGGGGA AAGATAATTG GTGGTGGTCC AAGGGCAGGA AATTGAGGTG 121 GGAACCGATA GTTCAAGCTC AAGAAATTGG GGTTTTGTTA TTGCAATTGG GGATTGTGAT 181 GTTTGTTATG AGATTACTAA GACCCGGTTT GCCATTACCC GGGTCGGATC CTAGAGCTCC 241 GACAATGTTT GTAACTGTTC CATATAGTGA GTTTTTGAGT AAGATAAATA GTAATCAGGT 301 GCAGAAAGTT GAGGTTGATG GTGTACATAT AATGTTCAAA TTGAAGAGTG AAGTGAGTAG 361 TAGTGTAATA GAGACTGAGG TTGTGAATGT GAATGAAAAT GGAAATAGTA AGTTGCAAGA 421 TTCTGAGGCA GTGATAAGGA GTGTAACTCC TACAAAGAAA ATTGTGTATA CTACCACGAG 481 GCCGAGTGAT ATAAAGACCC CTTATGAGAA AATGCTTGAG AATGATGTTG AGTTTGGTTC 541 TCCCGATAAA CGGTCTGGTG GATTCATGAA CTCTGCACTG ATAACATTAT TTTATATTGC 601 TGTACTAGCG GGGCTTCTTC ATCGCTTCCC AGAGAATTTT TCTCAGAGCA CAGCTGGCCA 661 ACTCAAAAAT CGCAAGTCAG GGGGTTCAAG TGGCACAAAA GTGTCTGAAC TAAGGGAAAA 721 TATCACATTT GCTGATGTTG CCGGCGTTGA CTAAAGCTAA TGAGGAACCT AAAAGAAAAT 781 GTGGAAATTC CTTAAAAATT CAGAAAAATA TGTACCGCTT TGGTGCACGT CCTTCCCCCG 841 GGGGGTTCTA CTGGGTGGGC CTCCCGGGGG AACGGAAAAA AACTTCTN Predicted gene structure (within gDNA segment 2769 to 1): Exon 1 1408 906 ( 503 n); cDNA 78 580 ( 503 n); score: 1.000 MATCH C06HBa0054K13.1-8- SGN-U340965+ 1.000 503 0.566 C PGS_C06HBa0054K13.1-8-_SGN-U340965+ (1408 906) Alignment (genomic DNA sequence = upper lines): GGAAAGATAA TTGGTGGTGG TCCAAGGGCA GGAAATTGAG GTGGGAACCG ATAGTTCAAG 1349 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAAAGATAA TTGGTGGTGG TCCAAGGGCA GGAAATTGAG GTGGGAACCG ATAGTTCAAG 137 CTCAAGAAAT TGGGGTTTTG TTATTGCAAT TGGGGATTGT GATGTTTGTT ATGAGATTAC 1289 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCAAGAAAT TGGGGTTTTG TTATTGCAAT TGGGGATTGT GATGTTTGTT ATGAGATTAC 197 TAAGACCCGG TTTGCCATTA CCCGGGTCGG ATCCTAGAGC TCCGACAATG TTTGTAACTG 1229 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAGACCCGG TTTGCCATTA CCCGGGTCGG ATCCTAGAGC TCCGACAATG TTTGTAACTG 257 TTCCATATAG TGAGTTTTTG AGTAAGATAA ATAGTAATCA GGTGCAGAAA GTTGAGGTTG 1169 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCCATATAG TGAGTTTTTG AGTAAGATAA ATAGTAATCA GGTGCAGAAA GTTGAGGTTG 317 ATGGTGTACA TATAATGTTC AAATTGAAGA GTGAAGTGAG TAGTAGTGTA ATAGAGACTG 1109 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGTGTACA TATAATGTTC AAATTGAAGA GTGAAGTGAG TAGTAGTGTA ATAGAGACTG 377 AGGTTGTGAA TGTGAATGAA AATGGAAATA GTAAGTTGCA AGATTCTGAG GCAGTGATAA 1049 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTTGTGAA TGTGAATGAA AATGGAAATA GTAAGTTGCA AGATTCTGAG GCAGTGATAA 437 GGAGTGTAAC TCCTACAAAG AAAATTGTGT ATACTACCAC GAGGCCGAGT GATATAAAGA 989 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAGTGTAAC TCCTACAAAG AAAATTGTGT ATACTACCAC GAGGCCGAGT GATATAAAGA 497 CCCCTTATGA GAAAATGCTT GAGAATGATG TTGAGTTTGG TTCTCCCGAT AAACGGTCTG 929 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCCTTATGA GAAAATGCTT GAGAATGATG TTGAGTTTGG TTCTCCCGAT AAACGGTCTG 557 GTGGATTCAT GAACTCTGCA CTG 906 |||||||||| |||||||||| ||| GTGGATTCAT GAACTCTGCA CTG 580 hqPGS_C06HBa0054K13.1-8-_SGN-U340965+ (1408 906) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2791: PGL 1 (- strand): 1408 906 AGS-1 (1408 906) SCR (e 1.000) Exon 1 1408 906 ( 503 n); score: 1.000 PGS (1408 906) SGN-U340965+ 3-phase translation of AGS-1 (-strand): . . . . . . 1408 GGAAAGATAATTGGTGGTGGTCCAAGGGCAGGAAATTGAGGTGGGAACCGATAGTTCAAG G K I I G G G P R A G N - G G N R - F K E R - L V V V Q G Q E I E V G T D S S S K D N W W W S K G R K L R W E P I V Q . . . . . . 1348 CTCAAGAAATTGGGGTTTTGTTATTGCAATTGGGGATTGTGATGTTTGTTATGAGATTAC L K K L G F C Y C N W G L - C L L - D Y S R N W G F V I A I G D C D V C Y E I T A Q E I G V L L L Q L G I V M F V M R L . . . . . . 1288 TAAGACCCGGTTTGCCATTACCCGGGTCGGATCCTAGAGCTCCGACAATGTTTGTAACTG - D P V C H Y P G R I L E L R Q C L - L K T R F A I T R V G S - S S D N V C N C L R P G L P L P G S D P R A P T M F V T . . . . . . 1228 TTCCATATAGTGAGTTTTTGAGTAAGATAAATAGTAATCAGGTGCAGAAAGTTGAGGTTG F H I V S F - V R - I V I R C R K L R L S I - - V F E - D K - - S G A E S - G - V P Y S E F L S K I N S N Q V Q K V E V . . . . . . 1168 ATGGTGTACATATAATGTTCAAATTGAAGAGTGAAGTGAGTAGTAGTGTAATAGAGACTG M V Y I - C S N - R V K - V V V - - R L W C T Y N V Q I E E - S E - - C N R D - D G V H I M F K L K S E V S S S V I E T . . . . . . 1108 AGGTTGTGAATGTGAATGAAAATGGAAATAGTAAGTTGCAAGATTCTGAGGCAGTGATAA R L - M - M K M E I V S C K I L R Q - - G C E C E - K W K - - V A R F - G S D K E V V N V N E N G N S K L Q D S E A V I . . . . . . 1048 GGAGTGTAACTCCTACAAAGAAAATTGTGTATACTACCACGAGGCCGAGTGATATAAAGA G V - L L Q R K L C I L P R G R V I - R E C N S Y K E N C V Y Y H E A E - Y K D R S V T P T K K I V Y T T T R P S D I K . . . . . . 988 CCCCTTATGAGAAAATGCTTGAGAATGATGTTGAGTTTGGTTCTCCCGATAAACGGTCTG P L M R K C L R M M L S L V L P I N G L P L - E N A - E - C - V W F S R - T V W T P Y E K M L E N D V E F G S P D K R S . . . 928 GTGGATTCATGAACTCTGCACTG V D S - T L H W I H E L C T G G F M N S A L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-8-_PGL-1_AGS-1_PPS_1 (1406 906) (frame '0'; 501 bp, 167 residues) 1 KDNWWWSKGR KLRWEPIVQA QEIGVLLLQL GIVMFVMRLL RPGLPLPGSD PRAPTMFVTV 61 PYSEFLSKIN SNQVQKVEVD GVHIMFKLKS EVSSSVIETE VVNVNENGNS KLQDSEAVIR 121 SVTPTKKIVY TTTRPSDIKT PYEKMLENDV EFGSPDKRSG GFMNSAL 3-phase translation of AGS-1 (+strand): . . . . . . 906 CAGTGCAGAGTTCATGAATCCACCAGACCGTTTATCGGGAGAACCAAACTCAACATCATT Q C R V H E S T R P F I G R T K L N I I S A E F M N P P D R L S G E P N S T S F V Q S S - I H Q T V Y R E N Q T Q H H . . . . . . 966 CTCAAGCATTTTCTCATAAGGGGTCTTTATATCACTCGGCCTCGTGGTAGTATACACAAT L K H F L I R G L Y I T R P R G S I H N S S I F S - G V F I S L G L V V V Y T I S Q A F S H K G S L Y H S A S W - Y T Q . . . . . . 1026 TTTCTTTGTAGGAGTTACACTCCTTATCACTGCCTCAGAATCTTGCAACTTACTATTTCC F L C R S Y T P Y H C L R I L Q L T I S F F V G V T L L I T A S E S C N L L F P F S L - E L H S L S L P Q N L A T Y Y F . . . . . . 1086 ATTTTCATTCACATTCACAACCTCAGTCTCTATTACACTACTACTCACTTCACTCTTCAA I F I H I H N L S L Y Y T T T H F T L Q F S F T F T T S V S I T L L L T S L F N H F H S H S Q P Q S L L H Y Y S L H S S . . . . . . 1146 TTTGAACATTATATGTACACCATCAACCTCAACTTTCTGCACCTGATTACTATTTATCTT F E H Y M Y T I N L N F L H L I T I Y L L N I I C T P S T S T F C T - L L F I L I - T L Y V H H Q P Q L S A P D Y Y L S . . . . . . 1206 ACTCAAAAACTCACTATATGGAACAGTTACAAACATTGTCGGAGCTCTAGGATCCGACCC T Q K L T I W N S Y K H C R S S R I R P L K N S L Y G T V T N I V G A L G S D P Y S K T H Y M E Q L Q T L S E L - D P T . . . . . . 1266 GGGTAATGGCAAACCGGGTCTTAGTAATCTCATAACAAACATCACAATCCCCAATTGCAA G - W Q T G S - - S H N K H H N P Q L Q G N G K P G L S N L I T N I T I P N C N R V M A N R V L V I S - Q T S Q S P I A . . . . . . 1326 TAACAAAACCCCAATTTCTTGAGCTTGAACTATCGGTTCCCACCTCAATTTCCTGCCCTT - Q N P N F L S L N Y R F P P Q F P A L N K T P I S - A - T I G S H L N F L P L I T K P Q F L E L E L S V P T S I S C P . . . 1386 GGACCACCACCAATTATCTTTCC G P P P I I F D H H Q L S F W T T T N Y L S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-8+_PGL-1_AGS-1_PPS_1 (906 1271) (frame '1'; 363 bp, 121 residues) 1 QCRVHESTRP FIGRTKLNII LKHFLIRGLY ITRPRGSIHN FLCRSYTPYH CLRILQLTIS 61 IFIHIHNLSL YYTTTHFTLQ FEHYMYTINL NFLHLITIYL TQKLTIWNSY KHCRSSRIRP 121 G- ... finished at: Mon Aug 28 22:24:30 2006 ________________________________________________________________________________ Sequence 9: C06HBa0054K13.1-9, from 1 to 2289, both strands analyzed. ... started at: Mon Aug 28 22:24:30 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:24:33 2006 ________________________________________________________________________________ Sequence 10: C06HBa0054K13.1-10, from 1 to 3166, both strands analyzed. ... started at: Mon Aug 28 22:24:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand 888 n (File: SGN-U340965+) 1 ATTAGCTGGA GCTCCCCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA 61 GGAATTCGGC ACGAGGGGGA AAGATAATTG GTGGTGGTCC AAGGGCAGGA AATTGAGGTG 121 GGAACCGATA GTTCAAGCTC AAGAAATTGG GGTTTTGTTA TTGCAATTGG GGATTGTGAT 181 GTTTGTTATG AGATTACTAA GACCCGGTTT GCCATTACCC GGGTCGGATC CTAGAGCTCC 241 GACAATGTTT GTAACTGTTC CATATAGTGA GTTTTTGAGT AAGATAAATA GTAATCAGGT 301 GCAGAAAGTT GAGGTTGATG GTGTACATAT AATGTTCAAA TTGAAGAGTG AAGTGAGTAG 361 TAGTGTAATA GAGACTGAGG TTGTGAATGT GAATGAAAAT GGAAATAGTA AGTTGCAAGA 421 TTCTGAGGCA GTGATAAGGA GTGTAACTCC TACAAAGAAA ATTGTGTATA CTACCACGAG 481 GCCGAGTGAT ATAAAGACCC CTTATGAGAA AATGCTTGAG AATGATGTTG AGTTTGGTTC 541 TCCCGATAAA CGGTCTGGTG GATTCATGAA CTCTGCACTG ATAACATTAT TTTATATTGC 601 TGTACTAGCG GGGCTTCTTC ATCGCTTCCC AGAGAATTTT TCTCAGAGCA CAGCTGGCCA 661 ACTCAAAAAT CGCAAGTCAG GGGGTTCAAG TGGCACAAAA GTGTCTGAAC TAAGGGAAAA 721 TATCACATTT GCTGATGTTG CCGGCGTTGA CTAAAGCTAA TGAGGAACCT AAAAGAAAAT 781 GTGGAAATTC CTTAAAAATT CAGAAAAATA TGTACCGCTT TGGTGCACGT CCTTCCCCCG 841 GGGGGTTCTA CTGGGTGGGC CTCCCGGGGG AACGGAAAAA AACTTCTN Predicted gene structure (within gDNA segment 1 to 3166): Exon 1 1573 1638 ( 66 n); cDNA 581 646 ( 66 n); score: 0.970 Intron 1 1639 1735 ( 97 n); Pd: 0.999 (s: 0.98), Pa: 0.697 (s: 0.96) Exon 2 1736 1870 ( 135 n); cDNA 647 783 ( 137 n); score: 0.881 Intron 2 1871 2003 ( 133 n); Pd: 0.995 (s: 0.75), Pa: 0.998 (s: 0.71) Exon 3 2004 2069 ( 66 n); cDNA 784 854 ( 71 n); score: 0.674 Intron 3 2070 2408 ( 339 n); Pd: 0.981 (s: 0.65), Pa: 1.000 (s: 0) Exon 4 2409 2440 ( 32 n); cDNA 855 887 ( 33 n); score: 0.703 MATCH C06HBa0054K13.1-10+ SGN-U340965+ 0.852 299 0.337 C PGS_C06HBa0054K13.1-10+_SGN-U340965+ (1573 1638,1736 1870,2004 2069,2409 2440) Alignment (genomic DNA sequence = upper lines): ATAGCATTAT TTTATATTGC TGTACTAGCG GGGCTTCTTC ATCGCTTCCC AGTGAATTTT 1632 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| ATAACATTAT TTTATATTGC TGTACTAGCG GGGCTTCTTC ATCGCTTCCC AGAGAATTTT 640 TCTCAGGTAC ATACAATTAT TATTCAAAAT ATGTGTGTGA TTATTGTATG TCCGTGATAT 1692 |||||| TCTCAG.... .......... .......... .......... .......... .......... 646 TTCTGAATAA CTCACCAACT TCATTGTGGC TGCTTAATAA CAGAGCACAG CTGGCCAACT 1752 ||||||| |||||||||| .......... .......... .......... .......... ...AGCACAG CTGGCCAACT 663 CAGAAATCGC AAGTCAGGGG GTTCAGGTGG CACAAAAGTG TCTGAACTAG GGGAAACTAT 1812 || ||||||| |||||||||| ||||| |||| |||||||||| ||||||||| |||||| ||| CAAAAATCGC AAGTCAGGGG GTTCAAGTGG CACAAAAGTG TCTGAACTAA GGGAAAATAT 723 CACATTTGCT GATGTTGCAG GCGTTGAC-G AGGCTAAGGA GG-AGCTAGA AGAAATTGTG 1870 |||||||||| |||||||| | |||||||| | ||||| || || | ||| | ||||| |||| CACATTTGCT GATGTTGCCG GCGTTGACTA AAGCTAATGA GGAACCTAAA AGAAAATGTG 783 GTATTTTTTA TTTTCATTGG TGAACGTTTC TTAACAACAT ATCTGCTATA AGTGCTGCAT 1930 .......... .......... .......... .......... .......... .......... 783 TTTGATACCA TAAGAAGTTG ATGAACAGTT AATAATTCAT TGTTGTGGTG TTTTTAATTG 1990 .......... .......... .......... .......... .......... .......... 783 CTTTTTTCCC CAGG-AATTT CTTAGAAATC CAGATAAGTA TGTACGGC-T TGGTGCACGT 2048 | |||| |||| |||| |||| || || ||||| || | |||||||||| .......... ...GAAATTC CTTAAAAATT CAGAAAAATA TGTACCGCTT TGGTGCACGT 830 CC-T-CCTCG TGGTGTTCTA CT-GGTGAGT TTCAAATTTG ACGTAGTAAT GTGAATGCTA 2105 || | || || || |||||| || | CCTTCCCCCG GGGGGTTCTA CTGG...... .......... .......... .......... 854 GGTGGTTATC AAAGTGAGAA AGCTTTGAGA ATAATGTACA ATAATTGGAG GGTTCGAATA 2165 .......... .......... .......... .......... .......... .......... 854 GTAACTCATT GAAACTTCTG ATCTTTTGTT CTTGGTTTTT TTCCTGTTTC TGAAGGGATT 2225 .......... .......... .......... .......... .......... .......... 854 CTCTCTCTAT CCCTTCATGT CACTTTGTGA TGCCTCTATG GTCAAATCAC ATCATTATTT 2285 .......... .......... .......... .......... .......... .......... 854 CCTGCTAGTT TTGTTATGCG ACTATTGACT GTCAGACCAT TCTACTTGGG GACACCATTT 2345 .......... .......... .......... .......... .......... .......... 854 GATTAGTTTT ATGCTTTGTG ATTCTCAATG TTTAGATTAT AAAAATGGCT TTAATTGTTA 2405 .......... .......... .......... .......... .......... .......... 854 CAGGTTGGTC TCCC-CGGGA CAGGAAAGAC ACTTCT 2440 || || | |||| |||| ||||| | |||||| ...GTGGGCC TCCCGGGGGA ACGGAAAAAA ACTTCT 887 hqPGS_C06HBa0054K13.1-10+_SGN-U340965+ (1573 1638,1736 1870,2004 2069,2409 2440) ******************************************************************************** EST sequence 1 +strand 800 n (File: SGN-U330578+) 1 TACTAGCGGG GCTTCTTCAT CGCTTCCCAG TGAATTTTTC TCAGAGCACA GCTGGCCAAC 61 TCAGAAATCG CAAGTCAGGG GGTTCAGGTG GCACAAAAGT GTCTGAACTA GGGGAAACTA 121 TCACATTTGC TGATGTTGCA GGCGTTGACG AGGCTAAGGA GGAGCTAGAA GAAATTGTGG 181 AATTTCTTAG AAATCCAGAT AAGTATGTAC GGCTTGGTGC ACGTCCTCCT CGTGGTGTTC 241 TACTGGTTGG TCTCCCCGGG ACAGGAAAGA CACTTCTAGC AAAGGCTGTT GCTGGGGAAG 301 CTGAGGTTCC TTTTATCAGT TGTTCTGCAA GTGAGTTTGT AGAATTGTAT GTAGGAATGG 361 GAGCATCACG CGTCCGTGAC CTGTTTGCAC GGGCAAAGAA GGAGGCACCT TCAATAATTT 421 TTATCGATGA GATAGATGCT GTGGCAAAAA GCCGTGATGG AAAATTCCGC ATTGTAAGCA 481 ATGATGAAAG AGAGCAGACA TTGAACCAAC TACTCACTGA GATGGACGGA TTTGACAGTA 541 ATTCTGCTGT AATTGTTCTT GGAGCAACAA ATCGCTCTGA TGTCTTAGAC CCTGCTCTTC 601 GCCGACCTGG GAGATTTGAC CGTGTTGTAA TGGTGGAAGC GCCTGATAGG TGTGGAAGAG 661 AAGCTATCTT AAAGGTACAT GTCTCCAAGA AAGAACTTCC CTTGGCACAA GATGTTGATC 721 TTGGTAACAT CGCTTCTATG ACTACTGGTT TTACGGGGGC AGATCTTGCA AATCTGGTGA 781 ATGAAGCTGC TCTGTTGGCA Predicted gene structure (within gDNA segment 995 to 3166): Exon 1 1595 1638 ( 44 n); cDNA 1 44 ( 44 n); score: 1.000 Intron 1 1639 1735 ( 97 n); Pd: 0.999 (s: 1.00), Pa: 0.697 (s: 1.00) Exon 2 1736 1870 ( 135 n); cDNA 45 179 ( 135 n); score: 1.000 Intron 2 1871 2003 ( 133 n); Pd: 0.995 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 3 2004 2069 ( 66 n); cDNA 180 245 ( 66 n); score: 1.000 Intron 3 2070 2408 ( 339 n); Pd: 0.981 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 4 2409 2594 ( 186 n); cDNA 246 431 ( 186 n); score: 1.000 Intron 4 2595 2748 ( 154 n); Pd: 0.950 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 5 2749 2835 ( 87 n); cDNA 432 518 ( 87 n); score: 1.000 Intron 5 2836 2912 ( 77 n); Pd: 0.882 (s: 1.00), Pa: 0.992 (s: 1.00) Exon 6 2913 3026 ( 114 n); cDNA 519 632 ( 114 n); score: 1.000 MATCH C06HBa0054K13.1-10+ SGN-U330578+ 1.000 632 0.790 C PGS_C06HBa0054K13.1-10+_SGN-U330578+ (1595 1638,1736 1870,2004 2069,2409 2594,2749 2835,2913 3026) Alignment (genomic DNA sequence = upper lines): TACTAGCGGG GCTTCTTCAT CGCTTCCCAG TGAATTTTTC TCAGGTACAT ACAATTATTA 1654 |||||||||| |||||||||| |||||||||| |||||||||| |||| TACTAGCGGG GCTTCTTCAT CGCTTCCCAG TGAATTTTTC TCAG...... .......... 44 TTCAAAATAT GTGTGTGATT ATTGTATGTC CGTGATATTT CTGAATAACT CACCAACTTC 1714 .......... .......... .......... .......... .......... .......... 44 ATTGTGGCTG CTTAATAACA GAGCACAGCT GGCCAACTCA GAAATCGCAA GTCAGGGGGT 1774 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .AGCACAGCT GGCCAACTCA GAAATCGCAA GTCAGGGGGT 83 TCAGGTGGCA CAAAAGTGTC TGAACTAGGG GAAACTATCA CATTTGCTGA TGTTGCAGGC 1834 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAGGTGGCA CAAAAGTGTC TGAACTAGGG GAAACTATCA CATTTGCTGA TGTTGCAGGC 143 GTTGACGAGG CTAAGGAGGA GCTAGAAGAA ATTGTGGTAT TTTTTATTTT CATTGGTGAA 1894 |||||||||| |||||||||| |||||||||| |||||| GTTGACGAGG CTAAGGAGGA GCTAGAAGAA ATTGTG.... .......... .......... 179 CGTTTCTTAA CAACATATCT GCTATAAGTG CTGCATTTTG ATACCATAAG AAGTTGATGA 1954 .......... .......... .......... .......... .......... .......... 179 ACAGTTAATA ATTCATTGTT GTGGTGTTTT TAATTGCTTT TTTCCCCAGG AATTTCTTAG 2014 | |||||||||| .......... .......... .......... .......... .........G AATTTCTTAG 190 AAATCCAGAT AAGTATGTAC GGCTTGGTGC ACGTCCTCCT CGTGGTGTTC TACTGGTGAG 2074 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| AAATCCAGAT AAGTATGTAC GGCTTGGTGC ACGTCCTCCT CGTGGTGTTC TACTG..... 245 TTTCAAATTT GACGTAGTAA TGTGAATGCT AGGTGGTTAT CAAAGTGAGA AAGCTTTGAG 2134 .......... .......... .......... .......... .......... .......... 245 AATAATGTAC AATAATTGGA GGGTTCGAAT AGTAACTCAT TGAAACTTCT GATCTTTTGT 2194 .......... .......... .......... .......... .......... .......... 245 TCTTGGTTTT TTTCCTGTTT CTGAAGGGAT TCTCTCTCTA TCCCTTCATG TCACTTTGTG 2254 .......... .......... .......... .......... .......... .......... 245 ATGCCTCTAT GGTCAAATCA CATCATTATT TCCTGCTAGT TTTGTTATGC GACTATTGAC 2314 .......... .......... .......... .......... .......... .......... 245 TGTCAGACCA TTCTACTTGG GGACACCATT TGATTAGTTT TATGCTTTGT GATTCTCAAT 2374 .......... .......... .......... .......... .......... .......... 245 GTTTAGATTA TAAAAATGGC TTTAATTGTT ACAGGTTGGT CTCCCCGGGA CAGGAAAGAC 2434 |||||| |||||||||| |||||||||| .......... .......... .......... ....GTTGGT CTCCCCGGGA CAGGAAAGAC 271 ACTTCTAGCA AAGGCTGTTG CTGGGGAAGC TGAGGTTCCT TTTATCAGTT GTTCTGCAAG 2494 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTCTAGCA AAGGCTGTTG CTGGGGAAGC TGAGGTTCCT TTTATCAGTT GTTCTGCAAG 331 TGAGTTTGTA GAATTGTATG TAGGAATGGG AGCATCACGC GTCCGTGACC TGTTTGCACG 2554 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAGTTTGTA GAATTGTATG TAGGAATGGG AGCATCACGC GTCCGTGACC TGTTTGCACG 391 GGCAAAGAAG GAGGCACCTT CAATAATTTT TATCGATGAG GTCACCTGTG CTTTCCTCCT 2614 |||||||||| |||||||||| |||||||||| |||||||||| GGCAAAGAAG GAGGCACCTT CAATAATTTT TATCGATGAG .......... .......... 431 CTTCTTCTTC CTCTACAAAC TCAAAATATT CCAATAAAAA GTGTCATCAT CATCTGGATG 2674 .......... .......... .......... .......... .......... .......... 431 TGCAGACCTC CTCTTCTTTC TCTACAAACT CAAAATATTC TAAAAAAAAG TGTCATCATT 2734 .......... .......... .......... .......... .......... .......... 431 TTCTGGATGT GCAGATAGAT GCTGTGGCAA AAAGCCGTGA TGGAAAATTC CGCATTGTAA 2794 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....ATAGAT GCTGTGGCAA AAAGCCGTGA TGGAAAATTC CGCATTGTAA 477 GCAATGATGA AAGAGAGCAG ACATTGAACC AACTACTCAC TGTAAGACAT ACTGGTTTTG 2854 |||||||||| |||||||||| |||||||||| |||||||||| | GCAATGATGA AAGAGAGCAG ACATTGAACC AACTACTCAC T......... .......... 518 TGGTGCTGCA TCACTTCAGT TTTTTCATAA GAAACTAATA TCTTACAATA ATCTGCAGGA 2914 || .......... .......... .......... .......... .......... ........GA 520 GATGGACGGA TTTGACAGTA ATTCTGCTGT AATTGTTCTT GGAGCAACAA ATCGCTCTGA 2974 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGGACGGA TTTGACAGTA ATTCTGCTGT AATTGTTCTT GGAGCAACAA ATCGCTCTGA 580 TGTCTTAGAC CCTGCTCTTC GCCGACCTGG GAGATTTGAC CGTGTTGTAA TG 3026 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || TGTCTTAGAC CCTGCTCTTC GCCGACCTGG GAGATTTGAC CGTGTTGTAA TG 632 hqPGS_C06HBa0054K13.1-10+_SGN-U330578+ (1595 1638,1736 1870,2004 2069,2409 2594,2749 2835,2913 3026) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3166: PGL 1 (+ strand): 1573 3026 AGS-1 (1573 1638,1736 1870,2004 2069,2409 2594,2749 2835,2913 3026) SCR (e 1.000 d 0.999 a 0.697,e 1.000 d 0.995 a 0.998,e 1.000 d 0.981 a 1.000,e 1.000 d 0.950 a 0.998,e 1.000 d 0.882 a 0.992,e 1.000) Exon 1 1573 1638 ( 66 n); score: 1.000 Intron 1 1639 1735 ( 97 n); Pd: 0.999 Pa: 0.697 Exon 2 1736 1870 ( 135 n); score: 1.000 Intron 2 1871 2003 ( 133 n); Pd: 0.995 Pa: 0.998 Exon 3 2004 2069 ( 66 n); score: 1.000 Intron 3 2070 2408 ( 339 n); Pd: 0.981 Pa: 1.000 Exon 4 2409 2594 ( 186 n); score: 1.000 Intron 4 2595 2748 ( 154 n); Pd: 0.950 Pa: 0.998 Exon 5 2749 2835 ( 87 n); score: 1.000 Intron 5 2836 2912 ( 77 n); Pd: 0.882 Pa: 0.992 Exon 6 2913 3026 ( 114 n); score: 1.000 PGS (1573 1638,1736 1870,2004 2069,2409 2440) SGN-U340965+ PGS (1595 1638,1736 1870,2004 2069,2409 2594,2749 2835,2913 3026) SGN-U330578+ 3-phase translation of AGS-1 (+strand): . . . . . . 1573 ATAGCATTATTTTATATTGCTGTACTAGCGGGGCTTCTTCATCGCTTCCCAGTGAATTTT I A L F Y I A V L A G L L H R F P V N F - H Y F I L L Y - R G F F I A S Q - I F S I I L Y C C T S G A S S S L P S E F . : . . . . . 1633 TCTCAG : AGCACAGCTGGCCAACTCAGAAATCGCAAGTCAGGGGGTTCAGGTGGCACAAAA S Q : S T A G Q L R N R K S G G S G G T K L R : A Q L A N S E I A S Q G V Q V A Q K F S : E H S W P T Q K S Q V R G F R W H K . . . . . . 1790 GTGTCTGAACTAGGGGAAACTATCACATTTGCTGATGTTGCAGGCGTTGACGAGGCTAAG V S E L G E T I T F A D V A G V D E A K C L N - G K L S H L L M L Q A L T R L R S V - T R G N Y H I C - C C R R - R G - . . . : . . . 1850 GAGGAGCTAGAAGAAATTGTG : GAATTTCTTAGAAATCCAGATAAGTATGTACGGCTTGGT E E L E E I V : E F L R N P D K Y V R L G R S - K K L W : N F L E I Q I S M Y G L V G G A R R N C : G I S - K S R - V C T A W . . . : . . . 2043 GCACGTCCTCCTCGTGGTGTTCTACTG : GTTGGTCTCCCCGGGACAGGAAAGACACTTCTA A R P P R G V L L : V G L P G T G K T L L H V L L V V F Y W : L V S P G Q E R H F - C T S S S W C S T : G W S P R D R K D T S . . . . . . 2442 GCAAAGGCTGTTGCTGGGGAAGCTGAGGTTCCTTTTATCAGTTGTTCTGCAAGTGAGTTT A K A V A G E A E V P F I S C S A S E F Q R L L L G K L R F L L S V V L Q V S L S K G C C W G S - G S F Y Q L F C K - V . . . . . . 2502 GTAGAATTGTATGTAGGAATGGGAGCATCACGCGTCCGTGACCTGTTTGCACGGGCAAAG V E L Y V G M G A S R V R D L F A R A K - N C M - E W E H H A S V T C L H G Q R C R I V C R N G S I T R P - P V C T G K . . . . : . . 2562 AAGGAGGCACCTTCAATAATTTTTATCGATGAG : ATAGATGCTGTGGCAAAAAGCCGTGAT K E A P S I I F I D E : I D A V A K S R D R R H L Q - F L S M R : - M L W Q K A V M E G G T F N N F Y R - : D R C C G K K P - . . . . . . : 2776 GGAAAATTCCGCATTGTAAGCAATGATGAAAGAGAGCAGACATTGAACCAACTACTCACT : G K F R I V S N D E R E Q T L N Q L L T : E N S A L - A M M K E S R H - T N Y S L : W K I P H C K Q - - K R A D I E P T T H : . . . . . . 2913 GAGATGGACGGATTTGACAGTAATTCTGCTGTAATTGTTCTTGGAGCAACAAATCGCTCT E M D G F D S N S A V I V L G A T N R S R W T D L T V I L L - L F L E Q Q I A L - D G R I - Q - F C C N C S W S N K S L . . . . . . 2973 GATGTCTTAGACCCTGCTCTTCGCCGACCTGGGAGATTTGACCGTGTTGTAATG D V L D P A L R R P G R F D R V V M M S - T L L F A D L G D L T V L - - C L R P C S S P T W E I - P C C N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-10+_PGL-1_AGS-1_PPS_1 (1573 1638,1736 1870,2004 2069,2409 2594,2749 2835,2913 3026) (frame '1'; 654 bp, 218 residues) 1 IALFYIAVLA GLLHRFPVNF SQSTAGQLRN RKSGGSGGTK VSELGETITF ADVAGVDEAK 61 EELEEIVEFL RNPDKYVRLG ARPPRGVLLV GLPGTGKTLL AKAVAGEAEV PFISCSASEF 121 VELYVGMGAS RVRDLFARAK KEAPSIIFID EIDAVAKSRD GKFRIVSNDE REQTLNQLLT 181 EMDGFDSNSA VIVLGATNRS DVLDPALRRP GRFDRVVM ... finished at: Mon Aug 28 22:24:38 2006 ________________________________________________________________________________ Sequence 11: C06HBa0054K13.1-11, from 1 to 4982, both strands analyzed. ... started at: Mon Aug 28 22:24:38 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 5 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 1 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand 1849 n (File: SGN-U312832+) 1 TTCAGACAAA AATGGCAAAG AGTGGCATTT TGGTAATTGT TTCAGCTCTT GTTGTTCTTG 61 CAGTTTGTGG TGTTTTTGCT GAGGAGAACG AATATGTGTT GACTTTGGAC CATTCTAACC 121 TCACTGAGAC TGTTGCTAAG CACAACTTCA TTGTTGTTGA ATTCTATGCA CCTTGGTGTG 181 GACACTGTAA GAGTCTTGCT CCTGAGTATG AAAAAGCTGC CTCAGAGCTG AGTAGTCATG 241 ACCCTCCAAT TGTTCTAGCT AAGTATGATG CAAATGATGA AGCCAATAGA GAACTTTCAA 301 AACAGTACGA GATCCAGGGT TTCCCAACTA TTAAGATATT GAGAGATGGA GGAAAGAAAG 361 TTCAAGACTA TAACGGTCCT CGTGAAGCAG CTGGTATTGT ATCCTACTTG AAGAAACAAG 421 TGGGTCCTGC ATCTGCTGAA ATCAAGTCGA AGGAAGATGC CACAAACCTT ATTGATGAGA 481 AAAGTATCTT TGTTGTTGGT ATATTTCCAG ACCCCTCCGG AGAGAAATTC GAGAACTATT 541 TAACGCTAGC TGAAAAACTG CGAGGCGAGT TCGATTTTGC TCACACTGTT GATGCTAAAC 601 ACCTCCCTCG GGGTGGACCA GTCAACAAGC CCACTCTTCG TCTTCTAAAG CCATTTGATG 661 AACTCTTTGT TGATTTTGAG GACTTTGATG TCGATGCAAT GGAGAAGTTC ATCTCAGAAT 721 CTAGTATTCC TGTTGTTACT ATTTTTGACA ATGACCCAAA CAACCATCCT TATGTTAACA 781 AGTTCTTCGA AGGCACCAAC GCCAAGGCAT TGCTATTTGT GAACTTTAGC TCTGAATTTG 841 ATGCTTTTAA GTCCAAGTAC AACGATGTTG CTGTGATTTA CAAAGGGGAT GGGGTGAGCT 901 TTCTCTTGGG TGATGTTGAG GCTGGTCAAG GTGCTTTTGA GTACTTCGGA CTGAAGCCGG 961 AACAGGCACC TGTGATCATC ATAATGGACG CTGATGAACA AAAGTATATT AAGGACCATG 1021 TGGAACCTGA TGCCATTGCT GCTTACTTGA AGGATTACAA GGAAGGAAAA CTGAAGCCAC 1081 ATGTGAAGTC AGAGCCCATC CCTGAAGTCA ATGACGAACC TGTTAAGGTG GTTGTTAGGG 1141 ATACCCTCCA GGATATGGTT TACAAATCGG GAAAAAATGT GCTGTTAGAG TTCTATGCAC 1201 CTTGGTGTGG CCACTGCAAG AGTCTGGCTC CAATTTTGGA TGAAGTGGCT GTATCATTTG 1261 AAAGCGATCC TGATGTTCTC ATTGCAAAAC TGGACGCAAC CGCAAATGAT CTCCCGAAAG 1321 GTGACTTTGA TGTTCAGGGA TTCCCTACTA TGTACTTCAG ATCCGCCTCT GGTAACTTGT 1381 CACAGTACAA TGGTGAGAGA ACAAAAGAGG CTATCATCGA ATTCATCGAG AAGAATCGTG 1441 GCAAGCCTGC TCAGTCAGAC TCTGCCAAAG TCGATTCAGC AAAGGATGAA CTTTAGAGGA 1501 CTCTAGGAAC ATTGTGTACT GGTGGATTCA AGTTTTGTTG GAAGCATTGT GTTTCTGGTG 1561 AATTCGACCT CCCACCGGAC TACTGCTCTC TCCCAACGCT TCCGTTGTAG TTTTTGGAGC 1621 TTTTCAGCGC CAATAAAACG GTTGTATGTA TCCATTTTGT GTATCGAATG TAGCTGGATT 1681 ATGAGTTTAT ATTATATCTA TTGAGAAGTT CTCCAACTTT ATAGTAAAAA AAAAAAAAAA 1741 AATCTTCTTT GTCCCAGAAT GCTACTGTTG GAATGTATGC CTCCTGTTGC AGTAAATAAT 1801 GATACACAAT AATTAAAGAA GTTCAAAAAA AAAAAAAAAA AAAAACTCG Predicted gene structure (within gDNA segment 1018 to 4982): Exon 1 1618 1792 ( 175 n); cDNA 1 175 ( 175 n); score: 1.000 Intron 1 1793 2968 (1176 n); Pd: 1.000 (s: 1.00), Pa: 0.997 (s: 0) Exon 2 2969 2999 ( 31 n); cDNA 176 206 ( 31 n); score: 1.000 Intron 2 3000 3122 ( 123 n); Pd: 0.917 (s: 0), Pa: 0.997 (s: 1.00) Exon 3 3123 3410 ( 288 n); cDNA 207 494 ( 288 n); score: 1.000 Intron 3 3411 3508 ( 98 n); Pd: 0.907 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 4 3509 3694 ( 186 n); cDNA 495 680 ( 186 n); score: 1.000 Intron 4 3695 3809 ( 115 n); Pd: 0.988 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 5 3810 3935 ( 126 n); cDNA 681 806 ( 126 n); score: 1.000 Intron 5 3936 4129 ( 194 n); Pd: 0.981 (s: 1.00), Pa: 0.965 (s: 1.00) Exon 6 4130 4264 ( 135 n); cDNA 807 941 ( 135 n); score: 1.000 Intron 6 4265 4359 ( 95 n); Pd: 0.997 (s: 1.00), Pa: 0.858 (s: 1.00) Exon 7 4360 4479 ( 120 n); cDNA 942 1061 ( 120 n); score: 1.000 Intron 7 4480 4604 ( 125 n); Pd: 1.000 (s: 1.00), Pa: 0.984 (s: 1.00) Exon 8 4605 4722 ( 118 n); cDNA 1062 1179 ( 118 n); score: 1.000 PPA cDNA 1825 1846 MATCH C06HBa0054K13.1-11+ SGN-U312832+ 1.000 1179 0.638 C PGS_C06HBa0054K13.1-11+_SGN-U312832+ (1618 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4264,4360 4479,4605 4722) Alignment (genomic DNA sequence = upper lines): TTCAGACAAA AATGGCAAAG AGTGGCATTT TGGTAATTGT TTCAGCTCTT GTTGTTCTTG 1677 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGACAAA AATGGCAAAG AGTGGCATTT TGGTAATTGT TTCAGCTCTT GTTGTTCTTG 60 CAGTTTGTGG TGTTTTTGCT GAGGAGAACG AATATGTGTT GACTTTGGAC CATTCTAACC 1737 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTTTGTGG TGTTTTTGCT GAGGAGAACG AATATGTGTT GACTTTGGAC CATTCTAACC 120 TCACTGAGAC TGTTGCTAAG CACAACTTCA TTGTTGTTGA ATTCTATGCA CCTTGGTAAG 1797 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TCACTGAGAC TGTTGCTAAG CACAACTTCA TTGTTGTTGA ATTCTATGCA CCTTG..... 175 TCATCTGCTG TTTTTTTTTC TATTTATTTT CTTGAAAAAA GTTCATTTTT TTTGCTTGTT 1857 .......... .......... .......... .......... .......... .......... 175 GAATCTGTAT GTGCAGAATG TAATGAATGA TTGGATGTAA GGGGTTTTGT TTTTGATCTG 1917 .......... .......... .......... .......... .......... .......... 175 TTTTTTTTTT GTTATTTTAG TGAATTTTTG CTGTAATTTT TTTGTTTTCA TAAGCTGTGT 1977 .......... .......... .......... .......... .......... .......... 175 GAGGAATGCA TCAAGATCAG TTCTTTATGT TAATTTGTCT TAAAGGGTTC ATCAAGGACC 2037 .......... .......... .......... .......... .......... .......... 175 CCTTTGGCTC TACTTTGCTG TTCCTGTTTT TTTTTTTTTT TTGGAGTTTT ATCTTTTTGC 2097 .......... .......... .......... .......... .......... .......... 175 TGGTTTCTCT GGCTTTAGAA ACTGCATGTG CAGAAGATAA TCAATGACTG GTTCTTTTTC 2157 .......... .......... .......... .......... .......... .......... 175 TTTTGGGGAA TTTAAGGTTT TGTTATGTAA ATGGTTTTTT TTTCTGATTT TTCTGCTGTA 2217 .......... .......... .......... .......... .......... .......... 175 ATTTTTTATT TTACACATTG ACTTGAATTT GTCTGAAAGG GTTTTTCAAG GATCCTTTTG 2277 .......... .......... .......... .......... .......... .......... 175 TTGATGGATG TATTTTGATT TGTGGATTGA TTGAGGTTTT AAGGCTTTTT CTGTAAAAAG 2337 .......... .......... .......... .......... .......... .......... 175 CTTCTTCTTT TAATTCATGG TTGATGCTTC TAGAATCTGG ATTTGCTTGA TGAGTTCTAG 2397 .......... .......... .......... .......... .......... .......... 175 GTTATAAGGT TTCCTCTTTT TATTTAATTT GCTGCTTGTT TTTTGGTATT TAAAATCTCT 2457 .......... .......... .......... .......... .......... .......... 175 GTGTCCAGAA TTTTACAAAA ATATTAATAA ATCTGGATTT GTCTTTGTGT TCTGTGTTGC 2517 .......... .......... .......... .......... .......... .......... 175 TGCAAGTTAA TGGTTTTTCC ATTTCTGTGA CAGTCTAGTC AATTGGAATG TAAAATCTGT 2577 .......... .......... .......... .......... .......... .......... 175 TTTTCTATTT TTGTGTATGT ATTTGGATCT ATGTCAAGAG GTTCCAGTGA TCTGTGTTAA 2637 .......... .......... .......... .......... .......... .......... 175 TAGAAGTGAT TCTTTTAGCC AAATTGTCTT CTTGTTATGT GTAGTGGTGG ATCCAATGTA 2697 .......... .......... .......... .......... .......... .......... 175 ATGGTACCAG GTTCATCTGA ACCTAATAGT TTCGACTCGG AGCATAAATT TATGTGTAGG 2757 .......... .......... .......... .......... .......... .......... 175 AATTCACTAA AATTGCAATA AATAGTAGAC ATGAATGATG TTGATGACGA CATGCTTGAG 2817 .......... .......... .......... .......... .......... .......... 175 ATGGTTGTAT TAATAGCTTG CATTAGTTTC GATCTTTTCT ATGGTATCAA TTCCCATAGT 2877 .......... .......... .......... .......... .......... .......... 175 TATACGTTGT TGTCGAAATG TTTATTTAGT TATTCTTTGA CATGAAATTT TTTTACTTTA 2937 .......... .......... .......... .......... .......... .......... 175 TGAGTAACTG ATAATATACT TGCAATTACA GGTGTGGACA CTGTAAGAGT CTTGCTCCTG 2997 ||||||||| |||||||||| |||||||||| .......... .......... .......... .GTGTGGACA CTGTAAGAGT CTTGCTCCTG 204 AGGTACACTT TCTTCTATAC AGTGCTTAAT GAAGGAAGGA ATATCATTTT TCTCTGTCAT 3057 || AG........ .......... .......... .......... .......... .......... 206 TATGTTTTTT CTTGGCATTT TCAATCTATG CAACTGATTT AATGTTAAAT TATGTTTCTT 3117 .......... .......... .......... .......... .......... .......... 206 TCCAGTATGA AAAAGCTGCC TCAGAGCTGA GTAGTCATGA CCCTCCAATT GTTCTAGCTA 3177 ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .....TATGA AAAAGCTGCC TCAGAGCTGA GTAGTCATGA CCCTCCAATT GTTCTAGCTA 261 AGTATGATGC AAATGATGAA GCCAATAGAG AACTTTCAAA ACAGTACGAG ATCCAGGGTT 3237 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTATGATGC AAATGATGAA GCCAATAGAG AACTTTCAAA ACAGTACGAG ATCCAGGGTT 321 TCCCAACTAT TAAGATATTG AGAGATGGAG GAAAGAAAGT TCAAGACTAT AACGGTCCTC 3297 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCCAACTAT TAAGATATTG AGAGATGGAG GAAAGAAAGT TCAAGACTAT AACGGTCCTC 381 GTGAAGCAGC TGGTATTGTA TCCTACTTGA AGAAACAAGT GGGTCCTGCA TCTGCTGAAA 3357 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAAGCAGC TGGTATTGTA TCCTACTTGA AGAAACAAGT GGGTCCTGCA TCTGCTGAAA 441 TCAAGTCGAA GGAAGATGCC ACAAACCTTA TTGATGAGAA AAGTATCTTT GTTGTAAGTG 3417 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| TCAAGTCGAA GGAAGATGCC ACAAACCTTA TTGATGAGAA AAGTATCTTT GTT....... 494 GTGATACTTT TGATGTTTTA CTTATCATTA CTGAAGTTTA TTTTCAGCAT GCCATAGCTT 3477 .......... .......... .......... .......... .......... .......... 494 CTTGCTGTCT AAAATCTTTC TGTTTCTTTA GGTTGGTATA TTTCCAGACC CCTCCGGAGA 3537 ||||||||| |||||||||| |||||||||| .......... .......... .......... .GTTGGTATA TTTCCAGACC CCTCCGGAGA 523 GAAATTCGAG AACTATTTAA CGCTAGCTGA AAAACTGCGA GGCGAGTTCG ATTTTGCTCA 3597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAATTCGAG AACTATTTAA CGCTAGCTGA AAAACTGCGA GGCGAGTTCG ATTTTGCTCA 583 CACTGTTGAT GCTAAACACC TCCCTCGGGG TGGACCAGTC AACAAGCCCA CTCTTCGTCT 3657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGTTGAT GCTAAACACC TCCCTCGGGG TGGACCAGTC AACAAGCCCA CTCTTCGTCT 643 TCTAAAGCCA TTTGATGAAC TCTTTGTTGA TTTTGAGGTA TTACAGTTAT CTGTGTATAT 3717 |||||||||| |||||||||| |||||||||| ||||||| TCTAAAGCCA TTTGATGAAC TCTTTGTTGA TTTTGAG... .......... .......... 680 TTTTATCTTT GATACTCATT GTATTATGGT TATGAGGCTC AACTCTGTCC TATGTTATCT 3777 .......... .......... .......... .......... .......... .......... 680 TTTAATTGGC CATTGATTGT ACATACTTGC AGGACTTTGA TGTCGATGCA ATGGAGAAGT 3837 |||||||| |||||||||| |||||||||| .......... .......... .......... ..GACTTTGA TGTCGATGCA ATGGAGAAGT 708 TCATCTCAGA ATCTAGTATT CCTGTTGTTA CTATTTTTGA CAATGACCCA AACAACCATC 3897 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATCTCAGA ATCTAGTATT CCTGTTGTTA CTATTTTTGA CAATGACCCA AACAACCATC 768 CTTATGTTAA CAAGTTCTTC GAAGGCACCA ACGCCAAGGT ACTTGATTCT GCGAGCTGTA 3957 |||||||||| |||||||||| |||||||||| |||||||| CTTATGTTAA CAAGTTCTTC GAAGGCACCA ACGCCAAG.. .......... .......... 806 ATGCTTCAAT TTGACCTTAT ATTTCAACGT CAAGCAATTC TAGGATCTTC TCATGAATTT 4017 .......... .......... .......... .......... .......... .......... 806 TACAGTTATA TGCTTGTACT TGTATTATTT TCTTTTCATT CACTAATATT TGACAATTAA 4077 .......... .......... .......... .......... .......... .......... 806 CTTCAAAGTT CTAAGAGATT GATTTGTTTT TTTGACATTT GCAAATCTAC AGGCATTGCT 4137 |||||||| .......... .......... .......... .......... .......... ..GCATTGCT 814 ATTTGTGAAC TTTAGCTCTG AATTTGATGC TTTTAAGTCC AAGTACAACG ATGTTGCTGT 4197 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTGAAC TTTAGCTCTG AATTTGATGC TTTTAAGTCC AAGTACAACG ATGTTGCTGT 874 GATTTACAAA GGGGATGGGG TGAGCTTTCT CTTGGGTGAT GTTGAGGCTG GTCAAGGTGC 4257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATTTACAAA GGGGATGGGG TGAGCTTTCT CTTGGGTGAT GTTGAGGCTG GTCAAGGTGC 934 TTTTGAGGTT GGTTAAAATT TTCAACTATC TGATTACAGT GTCATTGTAT CCATGATCAC 4317 ||||||| TTTTGAG... .......... .......... .......... .......... .......... 941 GTAAAAAGGC GCGTTGAACA TTCGTTTTAT TTGGAATTGC AGTACTTCGG ACTGAAGCCG 4377 |||||||| |||||||||| .......... .......... .......... .......... ..TACTTCGG ACTGAAGCCG 959 GAACAGGCAC CTGTGATCAT CATAATGGAC GCTGATGAAC AAAAGTATAT TAAGGACCAT 4437 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACAGGCAC CTGTGATCAT CATAATGGAC GCTGATGAAC AAAAGTATAT TAAGGACCAT 1019 GTGGAACCTG ATGCCATTGC TGCTTACTTG AAGGATTACA AGGTGATGCA TCCTCTTCTT 4497 |||||||||| |||||||||| |||||||||| |||||||||| || GTGGAACCTG ATGCCATTGC TGCTTACTTG AAGGATTACA AG........ .......... 1061 TTTCTTGGAT ATATTTTGGA ATGAAAGCGT TGTTAATTGT GAACAATATT CTGCACGGAA 4557 .......... .......... .......... .......... .......... .......... 1061 GGAAAGGCTT ACTAACCTTT TTCCTCTAAT TTTCCACATT GTTACAGGAA GGAAAACTGA 4617 ||| |||||||||| .......... .......... .......... .......... .......GAA GGAAAACTGA 1074 AGCCACATGT GAAGTCAGAG CCCATCCCTG AAGTCAATGA CGAACCTGTT AAGGTGGTTG 4677 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCCACATGT GAAGTCAGAG CCCATCCCTG AAGTCAATGA CGAACCTGTT AAGGTGGTTG 1134 TTAGGGATAC CCTCCAGGAT ATGGTTTACA AATCGGGAAA AAATG 4722 |||||||||| |||||||||| |||||||||| |||||||||| ||||| TTAGGGATAC CCTCCAGGAT ATGGTTTACA AATCGGGAAA AAATG 1179 hqPGS_C06HBa0054K13.1-11+_SGN-U312832+ (1618 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4264,4360 4479,4605 4722) ******************************************************************************** EST sequence 3 +strand 871 n (File: SGN-U312834+) 1 CAAAAATGGC AAAGAGTGGC ATTTTGGTAA TTGTTTCAGC TCTTGTTGTT CTTGCAGTTT 61 GTGGTGTTTT TGCTGAGGAG AACGAATATG TGTTGACTTT GGACCATTCT AACCTCACTG 121 AGACTGTTGC TAAGCACAAC TTCATTGTTG TTGAATTCTA TGCACCTTGG TGTGGACACT 181 GTAAGAGTCT TGCTCCTGAG TATGAAAAAG CTGCCTCAGA GCTGAGTAGT CATGACCCTC 241 CAATTGTTCT AGCTAAGTAT GATGCAAATG ATGAAGCCAA TAGAGAACTT TCAAAACAGT 301 ACGAGATCCA GGGTTTCCCA ACTATTAAGA TATTGAGAGA TGGAGGAAAG AAAGTTCAAG 361 ACTATAACGG TCCTCGTGAA GCAGCTGGTA TTGTATCCTA CTTGAAGAAA CAAGTGGGTC 421 CTGCATCTGC TGAAATCAAG TCGAAGGAAG ATGCCACAAA CCTTATTGAT GAGAAAAGTA 481 TCTTTGTTGT TGGTATATTT CCAGACCCCT CCGGAGAGAA ATTCGAGAAC TATTTAACGC 541 TAGCTGAAAA ACTGCGAGGC GAGTTCGATT TTGCTCACAC TGTTGATGCT AAACACCTCC 601 CTCGGGGTGG ACCAGTCAAC AAGCCCACTC TTCGTCTTCT AAAGCCATTT GATGAACTCT 661 TTGTTGATTT TGAGGACTTT GATGTCGATG CAATGGAGAA GTTCATCTCA GAATCTAGTA 721 TTCCTGGTTG TACTATTTTT GACAATGACC CAAAGCACCA TCCTTTATGT AAACAGTTCT 781 TTCGAGGCAC CAAACGCCAG GCATTGCTAT TTGTGAACTT TAGCTCTGAA GTTGATGCTT 841 TTAAGTCAAG NTACACGATG TTGCTGTGAT T Predicted gene structure (within gDNA segment 1024 to 4811): Exon 1 1624 1792 ( 169 n); cDNA 1 169 ( 169 n); score: 1.000 Intron 1 1793 2968 (1176 n); Pd: 1.000 (s: 1.00), Pa: 0.997 (s: 0) Exon 2 2969 2999 ( 31 n); cDNA 170 200 ( 31 n); score: 1.000 Intron 2 3000 3122 ( 123 n); Pd: 0.917 (s: 0), Pa: 0.997 (s: 1.00) Exon 3 3123 3410 ( 288 n); cDNA 201 488 ( 288 n); score: 1.000 Intron 3 3411 3508 ( 98 n); Pd: 0.907 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 4 3509 3694 ( 186 n); cDNA 489 674 ( 186 n); score: 1.000 Intron 4 3695 3809 ( 115 n); Pd: 0.988 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 5 3810 3935 ( 126 n); cDNA 675 800 ( 126 n); score: 0.869 Intron 5 3936 4129 ( 194 n); Pd: 0.981 (s: 0.72), Pa: 0.965 (s: 0.94) Exon 6 4130 4201 ( 72 n); cDNA 801 871 ( 71 n); score: 0.931 MATCH C06HBa0054K13.1-11+ SGN-U312834+ 0.974 872 1.001 C PGS_C06HBa0054K13.1-11+_SGN-U312834+ (1624 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4201) Alignment (genomic DNA sequence = upper lines): CAAAAATGGC AAAGAGTGGC ATTTTGGTAA TTGTTTCAGC TCTTGTTGTT CTTGCAGTTT 1683 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAAATGGC AAAGAGTGGC ATTTTGGTAA TTGTTTCAGC TCTTGTTGTT CTTGCAGTTT 60 GTGGTGTTTT TGCTGAGGAG AACGAATATG TGTTGACTTT GGACCATTCT AACCTCACTG 1743 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGGTGTTTT TGCTGAGGAG AACGAATATG TGTTGACTTT GGACCATTCT AACCTCACTG 120 AGACTGTTGC TAAGCACAAC TTCATTGTTG TTGAATTCTA TGCACCTTGG TAAGTCATCT 1803 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| AGACTGTTGC TAAGCACAAC TTCATTGTTG TTGAATTCTA TGCACCTTG. .......... 169 GCTGTTTTTT TTTCTATTTA TTTTCTTGAA AAAAGTTCAT TTTTTTTGCT TGTTGAATCT 1863 .......... .......... .......... .......... .......... .......... 169 GTATGTGCAG AATGTAATGA ATGATTGGAT GTAAGGGGTT TTGTTTTTGA TCTGTTTTTT 1923 .......... .......... .......... .......... .......... .......... 169 TTTTGTTATT TTAGTGAATT TTTGCTGTAA TTTTTTTGTT TTCATAAGCT GTGTGAGGAA 1983 .......... .......... .......... .......... .......... .......... 169 TGCATCAAGA TCAGTTCTTT ATGTTAATTT GTCTTAAAGG GTTCATCAAG GACCCCTTTG 2043 .......... .......... .......... .......... .......... .......... 169 GCTCTACTTT GCTGTTCCTG TTTTTTTTTT TTTTTTGGAG TTTTATCTTT TTGCTGGTTT 2103 .......... .......... .......... .......... .......... .......... 169 CTCTGGCTTT AGAAACTGCA TGTGCAGAAG ATAATCAATG ACTGGTTCTT TTTCTTTTGG 2163 .......... .......... .......... .......... .......... .......... 169 GGAATTTAAG GTTTTGTTAT GTAAATGGTT TTTTTTTCTG ATTTTTCTGC TGTAATTTTT 2223 .......... .......... .......... .......... .......... .......... 169 TATTTTACAC ATTGACTTGA ATTTGTCTGA AAGGGTTTTT CAAGGATCCT TTTGTTGATG 2283 .......... .......... .......... .......... .......... .......... 169 GATGTATTTT GATTTGTGGA TTGATTGAGG TTTTAAGGCT TTTTCTGTAA AAAGCTTCTT 2343 .......... .......... .......... .......... .......... .......... 169 CTTTTAATTC ATGGTTGATG CTTCTAGAAT CTGGATTTGC TTGATGAGTT CTAGGTTATA 2403 .......... .......... .......... .......... .......... .......... 169 AGGTTTCCTC TTTTTATTTA ATTTGCTGCT TGTTTTTTGG TATTTAAAAT CTCTGTGTCC 2463 .......... .......... .......... .......... .......... .......... 169 AGAATTTTAC AAAAATATTA ATAAATCTGG ATTTGTCTTT GTGTTCTGTG TTGCTGCAAG 2523 .......... .......... .......... .......... .......... .......... 169 TTAATGGTTT TTCCATTTCT GTGACAGTCT AGTCAATTGG AATGTAAAAT CTGTTTTTCT 2583 .......... .......... .......... .......... .......... .......... 169 ATTTTTGTGT ATGTATTTGG ATCTATGTCA AGAGGTTCCA GTGATCTGTG TTAATAGAAG 2643 .......... .......... .......... .......... .......... .......... 169 TGATTCTTTT AGCCAAATTG TCTTCTTGTT ATGTGTAGTG GTGGATCCAA TGTAATGGTA 2703 .......... .......... .......... .......... .......... .......... 169 CCAGGTTCAT CTGAACCTAA TAGTTTCGAC TCGGAGCATA AATTTATGTG TAGGAATTCA 2763 .......... .......... .......... .......... .......... .......... 169 CTAAAATTGC AATAAATAGT AGACATGAAT GATGTTGATG ACGACATGCT TGAGATGGTT 2823 .......... .......... .......... .......... .......... .......... 169 GTATTAATAG CTTGCATTAG TTTCGATCTT TTCTATGGTA TCAATTCCCA TAGTTATACG 2883 .......... .......... .......... .......... .......... .......... 169 TTGTTGTCGA AATGTTTATT TAGTTATTCT TTGACATGAA ATTTTTTTAC TTTATGAGTA 2943 .......... .......... .......... .......... .......... .......... 169 ACTGATAATA TACTTGCAAT TACAGGTGTG GACACTGTAA GAGTCTTGCT CCTGAGGTAC 3003 ||||| |||||||||| |||||||||| |||||| .......... .......... .....GTGTG GACACTGTAA GAGTCTTGCT CCTGAG.... 200 ACTTTCTTCT ATACAGTGCT TAATGAAGGA AGGAATATCA TTTTTCTCTG TCATTATGTT 3063 .......... .......... .......... .......... .......... .......... 200 TTTTCTTGGC ATTTTCAATC TATGCAACTG ATTTAATGTT AAATTATGTT TCTTTCCAGT 3123 | .......... .......... .......... .......... .......... .........T 201 ATGAAAAAGC TGCCTCAGAG CTGAGTAGTC ATGACCCTCC AATTGTTCTA GCTAAGTATG 3183 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGAAAAAGC TGCCTCAGAG CTGAGTAGTC ATGACCCTCC AATTGTTCTA GCTAAGTATG 261 ATGCAAATGA TGAAGCCAAT AGAGAACTTT CAAAACAGTA CGAGATCCAG GGTTTCCCAA 3243 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCAAATGA TGAAGCCAAT AGAGAACTTT CAAAACAGTA CGAGATCCAG GGTTTCCCAA 321 CTATTAAGAT ATTGAGAGAT GGAGGAAAGA AAGTTCAAGA CTATAACGGT CCTCGTGAAG 3303 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATTAAGAT ATTGAGAGAT GGAGGAAAGA AAGTTCAAGA CTATAACGGT CCTCGTGAAG 381 CAGCTGGTAT TGTATCCTAC TTGAAGAAAC AAGTGGGTCC TGCATCTGCT GAAATCAAGT 3363 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCTGGTAT TGTATCCTAC TTGAAGAAAC AAGTGGGTCC TGCATCTGCT GAAATCAAGT 441 CGAAGGAAGA TGCCACAAAC CTTATTGATG AGAAAAGTAT CTTTGTTGTA AGTGGTGATA 3423 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| CGAAGGAAGA TGCCACAAAC CTTATTGATG AGAAAAGTAT CTTTGTT... .......... 488 CTTTTGATGT TTTACTTATC ATTACTGAAG TTTATTTTCA GCATGCCATA GCTTCTTGCT 3483 .......... .......... .......... .......... .......... .......... 488 GTCTAAAATC TTTCTGTTTC TTTAGGTTGG TATATTTCCA GACCCCTCCG GAGAGAAATT 3543 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....GTTGG TATATTTCCA GACCCCTCCG GAGAGAAATT 523 CGAGAACTAT TTAACGCTAG CTGAAAAACT GCGAGGCGAG TTCGATTTTG CTCACACTGT 3603 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAGAACTAT TTAACGCTAG CTGAAAAACT GCGAGGCGAG TTCGATTTTG CTCACACTGT 583 TGATGCTAAA CACCTCCCTC GGGGTGGACC AGTCAACAAG CCCACTCTTC GTCTTCTAAA 3663 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATGCTAAA CACCTCCCTC GGGGTGGACC AGTCAACAAG CCCACTCTTC GTCTTCTAAA 643 GCCATTTGAT GAACTCTTTG TTGATTTTGA GGTATTACAG TTATCTGTGT ATATTTTTAT 3723 |||||||||| |||||||||| |||||||||| | GCCATTTGAT GAACTCTTTG TTGATTTTGA G......... .......... .......... 674 CTTTGATACT CATTGTATTA TGGTTATGAG GCTCAACTCT GTCCTATGTT ATCTTTTAAT 3783 .......... .......... .......... .......... .......... .......... 674 TGGCCATTGA TTGTACATAC TTGCAGGACT TTGATGTCGA TGCAATGGAG AAGTTCATCT 3843 |||| |||||||||| |||||||||| |||||||||| .......... .......... ......GACT TTGATGTCGA TGCAATGGAG AAGTTCATCT 708 CAGAATCTAG TATTCCTGTT GTTACTATTT TTGACAATGA CCCAAACAAC CATCC-TTAT 3902 |||||||||| |||||||| | |||||||| |||||||||| |||||| || ||||| |||| CAGAATCTAG TATTCCTGGT TGTACTATTT TTGACAATGA CCCAAAGCAC CATCCTTTAT 768 GTTAACAAGT TCTTCGAAGG CACCAACGCC AAGGTACTTG ATTCTGCGAG CTGTAATGCT 3962 || ||| ||| |||| ||| |||||| | || GTAAAC-AGT TCTTTCGAGG CACCAAACGC CAG....... .......... .......... 800 TCAATTTGAC CTTATATTTC AACGTCAAGC AATTCTAGGA TCTTCTCATG AATTTTACAG 4022 .......... .......... .......... .......... .......... .......... 800 TTATATGCTT GTACTTGTAT TATTTTCTTT TCATTCACTA ATATTTGACA ATTAACTTCA 4082 .......... .......... .......... .......... .......... .......... 800 AAGTTCTAAG AGATTGATTT GTTTTTTTGA CATTTGCAAA TCTACAGGCA TTGCTATTTG 4142 ||| |||||||||| .......... .......... .......... .......... .......GCA TTGCTATTTG 813 TGAACTTTAG CTCTGAATTT GATGCTTTTA AGTCCAAGTA CAACGATGTT GCTGTGATT 4201 |||||||||| ||||||| || |||||||||| |||| | || | |||||||| ||||||||| TGAACTTTAG CTCTGAAGTT GATGCTTTTA AGTCAAGNTA C-ACGATGTT GCTGTGATT 871 hqPGS_C06HBa0054K13.1-11+_SGN-U312834+ (1624 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4201) ******************************************************************************** EST sequence 4 +strand 888 n (File: SGN-U333955+) 1 AGCGGTGGAT TCCGCTCTAG AACTAGTGGA TCCCGGGGGC TGCAGGAATT CGGCACGAGG 61 TTTTGGCATT TAGTGGCATT AAGGTAATTG GGGGAGCTTT GGGTGTTCTT GCAGGTCGCG 121 CGTGTTTTCG CTGAGGAGAA CGAATATGTG TCGACTTTGG ACCATTCTAA CCTCACTGAG 181 ACTGTTGCTA AGCACAACTT TATTGTTGTT GAATTCTATG CACCTTGGTG TGGACACTGT 241 AAGAGTCTTG CTCCTGAGTA TGAAAAAGCT GCCTCAGATC TGAGTTGTCA TGACCCTCCA 301 ATTGTTCTAG CTAAGTATGA TGCAAATGAT GAAGCCAATA GAGAACTTTC AAAACAGTTC 361 GAGATCCATG GTTTCCCAAC TATTAAGATA TTGAGAGATG GATGAAAGAA AGTTCAAGAC 421 TATAACGGTC CTCGTGAAGC TAGCTGGTAT TGTATCCTAC TTGAAGAAAC AAGTGGGTTC 481 TGCATCTGCT TAATTCAAGT CGAAGGAAGA TGCCCCCTTA CCTTTATTGA TGAGAAAAGT 541 ATCTTTGTTG TTTGGTCTAT ATCCAGACCC CTTCCCGGAG AGAAAATTCG AGAACCATTT 601 AACGCCTATC TGAAAAACTG CCAGGTGAGT TCTGATTTTG CTCCACACTG TTGATGGCTA 661 AACACCTATC TCGGGGTGGG ATCTTGTTAT CCAAAGCCCA CTCTTTCGGC TTCTATAACC 721 CTTTTGATGA AACTTTCTGT TGGATCTTGT AGGACCTTTT TATGCCCATT GCCATAGGAG 781 CAATGTATTA TTTTCATAAA TTCCAAGTAT TTCCATGATT GTTACTCTTT TTTTTTACAA 841 GGTAACCCAA AAAACATCCT ATTCTTTATG TTGTACCAAA ATTTGTCG Predicted gene structure (within gDNA segment 1 to 4982): Exon 1 1638 1792 ( 155 n); cDNA 72 227 ( 156 n); score: 0.887 Intron 1 1793 2968 (1176 n); Pd: 1.000 (s: 0.98), Pa: 0.997 (s: 0) Exon 2 2969 2999 ( 31 n); cDNA 228 258 ( 31 n); score: 1.000 Intron 2 3000 3122 ( 123 n); Pd: 0.917 (s: 0), Pa: 0.997 (s: 0.96) Exon 3 3123 3410 ( 288 n); cDNA 259 549 ( 291 n); score: 0.936 Intron 3 3411 3508 ( 98 n); Pd: 0.907 (s: 0.83), Pa: 0.997 (s: 0.72) Exon 4 3509 3694 ( 186 n); cDNA 550 752 ( 203 n); score: 0.675 Intron 4 3695 3809 ( 115 n); Pd: 0.988 (s: 0.59), Pa: 0.999 (s: 0) Exon 5 3810 3815 ( 6 n); cDNA 753 758 ( 6 n); score: 0.833 MATCH C06HBa0054K13.1-11+ SGN-U333955+ 0.847 666 0.750 C PGS_C06HBa0054K13.1-11+_SGN-U333955+ (1638 1792,2969 2999,3123 3410,3509 3694,3810 3815) Alignment (genomic DNA sequence = upper lines): AGTGGCATTT TGGTAATTGT TTCAGCTCTT GTTGTTCTTG CAGTTTGTG- GTGTTTTTGC 1696 ||||||||| |||||||| |||| | | |||||||| ||| | | | ||||||| || AGTGGCATTA AGGTAATTGG GGGAGCTTTG GGTGTTCTTG CAGGTCGCGC GTGTTTTCGC 131 TGAGGAGAAC GAATATGTGT TGACTTTGGA CCATTCTAAC CTCACTGAGA CTGTTGCTAA 1756 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TGAGGAGAAC GAATATGTGT CGACTTTGGA CCATTCTAAC CTCACTGAGA CTGTTGCTAA 191 GCACAACTTC ATTGTTGTTG AATTCTATGC ACCTTGGTAA GTCATCTGCT GTTTTTTTTT 1816 ||||||||| |||||||||| |||||||||| |||||| GCACAACTTT ATTGTTGTTG AATTCTATGC ACCTTG.... .......... .......... 227 CTATTTATTT TCTTGAAAAA AGTTCATTTT TTTTGCTTGT TGAATCTGTA TGTGCAGAAT 1876 .......... .......... .......... .......... .......... .......... 227 GTAATGAATG ATTGGATGTA AGGGGTTTTG TTTTTGATCT GTTTTTTTTT TGTTATTTTA 1936 .......... .......... .......... .......... .......... .......... 227 GTGAATTTTT GCTGTAATTT TTTTGTTTTC ATAAGCTGTG TGAGGAATGC ATCAAGATCA 1996 .......... .......... .......... .......... .......... .......... 227 GTTCTTTATG TTAATTTGTC TTAAAGGGTT CATCAAGGAC CCCTTTGGCT CTACTTTGCT 2056 .......... .......... .......... .......... .......... .......... 227 GTTCCTGTTT TTTTTTTTTT TTTGGAGTTT TATCTTTTTG CTGGTTTCTC TGGCTTTAGA 2116 .......... .......... .......... .......... .......... .......... 227 AACTGCATGT GCAGAAGATA ATCAATGACT GGTTCTTTTT CTTTTGGGGA ATTTAAGGTT 2176 .......... .......... .......... .......... .......... .......... 227 TTGTTATGTA AATGGTTTTT TTTTCTGATT TTTCTGCTGT AATTTTTTAT TTTACACATT 2236 .......... .......... .......... .......... .......... .......... 227 GACTTGAATT TGTCTGAAAG GGTTTTTCAA GGATCCTTTT GTTGATGGAT GTATTTTGAT 2296 .......... .......... .......... .......... .......... .......... 227 TTGTGGATTG ATTGAGGTTT TAAGGCTTTT TCTGTAAAAA GCTTCTTCTT TTAATTCATG 2356 .......... .......... .......... .......... .......... .......... 227 GTTGATGCTT CTAGAATCTG GATTTGCTTG ATGAGTTCTA GGTTATAAGG TTTCCTCTTT 2416 .......... .......... .......... .......... .......... .......... 227 TTATTTAATT TGCTGCTTGT TTTTTGGTAT TTAAAATCTC TGTGTCCAGA ATTTTACAAA 2476 .......... .......... .......... .......... .......... .......... 227 AATATTAATA AATCTGGATT TGTCTTTGTG TTCTGTGTTG CTGCAAGTTA ATGGTTTTTC 2536 .......... .......... .......... .......... .......... .......... 227 CATTTCTGTG ACAGTCTAGT CAATTGGAAT GTAAAATCTG TTTTTCTATT TTTGTGTATG 2596 .......... .......... .......... .......... .......... .......... 227 TATTTGGATC TATGTCAAGA GGTTCCAGTG ATCTGTGTTA ATAGAAGTGA TTCTTTTAGC 2656 .......... .......... .......... .......... .......... .......... 227 CAAATTGTCT TCTTGTTATG TGTAGTGGTG GATCCAATGT AATGGTACCA GGTTCATCTG 2716 .......... .......... .......... .......... .......... .......... 227 AACCTAATAG TTTCGACTCG GAGCATAAAT TTATGTGTAG GAATTCACTA AAATTGCAAT 2776 .......... .......... .......... .......... .......... .......... 227 AAATAGTAGA CATGAATGAT GTTGATGACG ACATGCTTGA GATGGTTGTA TTAATAGCTT 2836 .......... .......... .......... .......... .......... .......... 227 GCATTAGTTT CGATCTTTTC TATGGTATCA ATTCCCATAG TTATACGTTG TTGTCGAAAT 2896 .......... .......... .......... .......... .......... .......... 227 GTTTATTTAG TTATTCTTTG ACATGAAATT TTTTTACTTT ATGAGTAACT GATAATATAC 2956 .......... .......... .......... .......... .......... .......... 227 TTGCAATTAC AGGTGTGGAC ACTGTAAGAG TCTTGCTCCT GAGGTACACT TTCTTCTATA 3016 |||||||| |||||||||| |||||||||| ||| .......... ..GTGTGGAC ACTGTAAGAG TCTTGCTCCT GAG....... .......... 258 CAGTGCTTAA TGAAGGAAGG AATATCATTT TTCTCTGTCA TTATGTTTTT TCTTGGCATT 3076 .......... .......... .......... .......... .......... .......... 258 TTCAATCTAT GCAACTGATT TAATGTTAAA TTATGTTTCT TTCCAGTATG AAAAAGCTGC 3136 |||| |||||||||| .......... .......... .......... .......... ......TATG AAAAAGCTGC 272 CTCAGAGCTG AGTAGTCATG ACCCTCCAAT TGTTCTAGCT AAGTATGATG CAAATGATGA 3196 |||||| ||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCAGATCTG AGTTGTCATG ACCCTCCAAT TGTTCTAGCT AAGTATGATG CAAATGATGA 332 AGCCAATAGA GAACTTTCAA AACAGTACGA GATCCAGGGT TTCCCAACTA TTAAGATATT 3256 |||||||||| |||||||||| |||||| ||| |||||| ||| |||||||||| |||||||||| AGCCAATAGA GAACTTTCAA AACAGTTCGA GATCCATGGT TTCCCAACTA TTAAGATATT 392 GAGAGATGGA GGAAAGAAAG TTCAAGACTA TAACGGTCCT CGTGAAGC-A GCTGGTATTG 3315 |||||||||| ||||||||| |||||||||| |||||||||| |||||||| | |||||||||| GAGAGATGGA TGAAAGAAAG TTCAAGACTA TAACGGTCCT CGTGAAGCTA GCTGGTATTG 452 TATCCTACTT GAAGAAACAA GTGGGTCCTG CATCTGCTGA AATCAAGTCG AAGGAAGATG 3375 |||||||||| |||||||||| |||||| ||| |||||||| | | |||||||| |||||||||| TATCCTACTT GAAGAAACAA GTGGGTTCTG CATCTGCTTA ATTCAAGTCG AAGGAAGATG 512 -CCACAAACC -TTATTGATG AGAAAAGTAT CTTTGTTGTA AGTGGTGATA CTTTTGATGT 3433 || | ||| ||||||||| |||||||||| ||||||| CCCCCTTACC TTTATTGATG AGAAAAGTAT CTTTGTT... .......... .......... 549 TTTACTTATC ATTACTGAAG TTTATTTTCA GCATGCCATA GCTTCTTGCT GTCTAAAATC 3493 .......... .......... .......... .......... .......... .......... 549 TTTCTGTTTC TTTAGG-TTG GTATATTTCC AGACCCC-T- CCGGAGAG-A AATTCGAGAA 3549 | ||| || ||| ||| ||||||| | |||||||| | |||||||||| .......... .....GTTTG GTCTATATCC AGACCCCTTC CCGGAGAGAA AATTCGAGAA 594 CTATTTAACG -CTAGCTGAA AAACTGCGAG GCGAGTTC-G ATTTTGCT-C ACACTGTTGA 3606 | |||||||| ||| ||||| ||||||| || | |||||| | |||||||| | |||||||||| CCATTTAACG CCTATCTGAA AAACTGCCAG GTGAGTTCTG ATTTTGCTCC ACACTGTTGA 654 T-GCTAAACA CCTCCCTCGG GGT-GGA-CC AGTCA-AC-A AGCCCACTC- TTCGTCTTCT 3660 | |||||||| ||| ||||| ||| ||| | || | | | ||||||||| |||| ||||| TGGCTAAACA CCTATCTCGG GGTGGGATCT TGTTATCCAA AGCCCACTCT TTCGGCTTCT 714 A-AAGCCATT TGATG-AACT CTTTGTT-GA TTTTG-AGGT ATTACAGTTA TCTGTGTATA 3716 | || || || ||||| |||| | |||| || | ||| || ATAACCCTTT TGATGAAACT TTCTGTTGGA TCTTGTAG.. .......... .......... 752 TTTTTATCTT TGATACTCAT TGTATTATGG TTATGAGGCT CAACTCTGTC CTATGTTATC 3776 .......... .......... .......... .......... .......... .......... 752 TTTTAATTGG CCATTGATTG TACATACTTG CAGGACTTT 3815 ||| || .......... .......... .......... ...GACCTT 758 hqPGS_C06HBa0054K13.1-11+_SGN-U333955+ (1638 1792,2969 2999,3123 3410,3509 3694,3810 3815) ******************************************************************************** EST sequence 5 +strand 843 n (File: SGN-U344686+) 1 GGAGATTNNN NNGGGTGTTN CCTATATNAC CATGCTTGAG CTCCACCGCG GTGGCGGCCG 61 CTCTAGAACT AGTGGATCCC CCGGGCTGCA GGAATTGTTG AGCTTTATCA AGGAGTGAAG 121 GAAACACAAA CTCAAATCCA CTAGTCATGT TTACAACTTC CCCTCCCTCT AGCTTCCTTA 181 AGTTCTCTTT GATAAACATC ACTCCTTTGT TTCTTTTATG AAGGCAAGTG TTCCATAAGG 241 TTAATGCAAC CACACATGCT AGTGTATTCA AGAGCCTATC ATATATGCAA AATACAAGCT 301 CTTCTCCCCA TGAACCATCA TCAAGTTGAT TGTCTATAAT CCATTGAAGA CAAGATGGAA 361 ATAGGGGCCT CTTGCTTGTT CCATTAATAT TAGTATTGGT ATCTTCAATA AAGGAAACCC 421 AAGCTGTGTC ATAGGGTGAG ACACTTGACC TTCCATCTCC CATGGAACTC AACATATTTT 481 TTATCTCTTC TCTCAAATTC CTCATATGTA TTCAAAGACG AATTCAAAAT TTAAAATTTG 541 TGAGTTTTTA AAACAATTTC AAATTTTCTA ATACATATTT ATTTCCTGTG CCGAATTCGG 601 CACGAGGCAA AAATGGCAAA GAGTGGCATT TTGGTAATTG GTTCAGCTCT TGTTGTTCTT 661 GCAGNTTGTG GTGTTTTGCT GAAGAAACCA ATTTGGTTAC TTTGGACCTT TTAACCTCAC 721 GGAACGGTGT TAACACAACT TTATTGTTGG GAATCATTCC ACCCTGGGTG GCACGCCAAA 781 ATAGGTCTCT GTTTTAAAAA GAACACCCGC CTAAATACTG ATACCCCTCA CCTTCAATTG 841 TAT Predicted gene structure (within gDNA segment 1 to 4080): Exon 1 1624 1792 ( 169 n); cDNA 608 766 ( 159 n); score: 0.852 MATCH C06HBa0054K13.1-11+ SGN-U344686+ 0.852 169 0.200 C PGS_C06HBa0054K13.1-11+_SGN-U344686+ (1624 1792) Alignment (genomic DNA sequence = upper lines): CAAAAATGGC AAAGAGTGGC ATTTTGGTAA TTGTTTCAGC TCTTGTTGTT CTTGCAGTTT 1683 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ||||||| || CAAAAATGGC AAAGAGTGGC ATTTTGGTAA TTGGTTCAGC TCTTGTTGTT CTTGCAGNTT 667 GTGGTGTTTT TGCTGAGGAG AACGAATATG TGTTGACTTT GGACCATTCT AACCTCACTG 1743 |||||| ||| |||||| || ||| ||| || ||| ||||| ||||| || | |||||||| | GTGGTG-TTT TGCTGAAGA- AACCAATTTG -GTT-ACTTT GGACC-TTTT AACCTCACGG 722 AGACTGTTGC TAAGCACAAC TTCATTGTTG TTGAATTCTA TGCACCTTG 1792 | || | || ||| |||||| || ||||||| ||| || | |||| || A-AC-GGTGT TAA-CACAAC TTTATTGTTG -GGAA-TCAT TCCACCCTG 766 hqPGS_C06HBa0054K13.1-11+_SGN-U344686+ (1624 1792) ******************************************************************************** EST sequence 1 +strand 767 n (File: SGN-U312831+) 1 TTTGGAATTG CAGTACTTCG GACTGAAGCC GGAACAGGCA CCTGTGATCA TCATAATGGA 61 CGCTGATGAA CAAAAGTATA TTAAGGACCA TGTGGAACCT GATGCCATTG CTGCTTACTT 121 GAAGGATTAC AAGGTGATGC ATCCTCTTCT TTTTCTTGGA TATATTTTGG AATGAAAGCG 181 TTGTTAATTG TGAACAATAT TCTGCACGGA AGGAAAGGCT TACTAACCTT TTTCCTCTAA 241 TTTTCCACAT TGTTACAGGA AGGAAAACTG AAGCCACATG TGAAGTCAGA GCCCATCCCT 301 GAAGTCAATG ACGAACCTGT TAAGGTGGTT GTTAGGGATA CCCTCCAGGA TATGGTTTAC 361 AAATCGGGAA AAAATGGTGC GTGTCTGTCA ATATTTTAAT CTTTATTCGA TGTCTTGGTT 421 AAGAAAGAGT TGTTTCTTTG TTGTACCATT TCGTTCTCCC TCTTTGGTGT TTACTTGCAT 481 TTCTTTACTA GTGTGTAAGG TCGTGATGAG GAGTTGGATG TCGTAAATTA ACTGATTGTG 541 GAACTATTAT ATGTAGTAGG AGAAAGAGGG AGCAGTCATC AGCTAATTGC CCGCGGGTTT 601 GACTTTAACA ATAGTTGTGC TCTCCTAAAT TATTATTCTT TTTGTCTGCT CAGTGCTGTT 661 AGAGTTCTAT GCACCTTGGT GTGGCCACTG CAAGAGTCTG GCTCCAATTT TGGATGAAGT 721 GGCTGTATCA TTTGAAAGCG ATCCTGATGT TCTCATTGCA AAACTGG Predicted gene structure (within gDNA segment 3747 to 4982): Exon 1 4347 4982 ( 636 n); cDNA 1 636 ( 636 n); score: 0.998 MATCH C06HBa0054K13.1-11+ SGN-U312831+ 0.998 636 0.829 C PGS_C06HBa0054K13.1-11+_SGN-U312831+ (4347 4982) Alignment (genomic DNA sequence = upper lines): TTTGGAATTG CAGTACTTCG GACTGAAGCC GGAACAGGCA CCTGTGATCA TCATAATGGA 4406 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGGAATTG CAGTACTTCG GACTGAAGCC GGAACAGGCA CCTGTGATCA TCATAATGGA 60 CGCTGATGAA CAAAAGTATA TTAAGGACCA TGTGGAACCT GATGCCATTG CTGCTTACTT 4466 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCTGATGAA CAAAAGTATA TTAAGGACCA TGTGGAACCT GATGCCATTG CTGCTTACTT 120 GAAGGATTAC AAGGTGATGC ATCCTCTTCT TTTTCTTGGA TATATTTTGG AATGAAAGCG 4526 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGGATTAC AAGGTGATGC ATCCTCTTCT TTTTCTTGGA TATATTTTGG AATGAAAGCG 180 TTGTTAATTG TGAACAATAT TCTGCACGGA AGGAAAGGCT TACTAACCTT TTTCCTCTAA 4586 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTTAATTG TGAACAATAT TCTGCACGGA AGGAAAGGCT TACTAACCTT TTTCCTCTAA 240 TTTTCCACAT TGTTACAGGA AGGAAAACTG AAGCCACATG TGAAGTCAGA GCCCATCCCT 4646 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCCACAT TGTTACAGGA AGGAAAACTG AAGCCACATG TGAAGTCAGA GCCCATCCCT 300 GAAGTCAATG ACGAACCTGT TAAGGTGGTT GTTAGGGATA CCCTCCAGGA TATGGTTTAC 4706 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGTCAATG ACGAACCTGT TAAGGTGGTT GTTAGGGATA CCCTCCAGGA TATGGTTTAC 360 AAATCGGGAA AAAATGGTGC GTGTCTGTCA ATATTTTAAT CTTTATTCGA TGTCTTGGTT 4766 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATCGGGAA AAAATGGTGC GTGTCTGTCA ATATTTTAAT CTTTATTCGA TGTCTTGGTT 420 AAGAAAGAGT TGTTTCTTTG TTGTACCATT TCGTTCTCCC TCTTTGGTGT TTACTTGCAT 4826 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAAGAGT TGTTTCTTTG TTGTACCATT TCGTTCTCCC TCTTTGGTGT TTACTTGCAT 480 TTCTTTACTA GTGTGTAAGG TCGTGATGAG GAGTTGGATG TCGTAAATTA ACTGATTGTG 4886 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTTTACTA GTGTGTAAGG TCGTGATGAG GAGTTGGATG TCGTAAATTA ACTGATTGTG 540 GAACTATTAT ACGTAGTAGG AGAAAGAGGG AGCAGTCATC AGCTAATTGC CCGCGGGTTT 4946 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACTATTAT ATGTAGTAGG AGAAAGAGGG AGCAGTCATC AGCTAATTGC CCGCGGGTTT 600 GACTTTAACA ATAGTTGTGC TCTCCTAAAT TATTAT 4982 |||||||||| |||||||||| |||||||||| |||||| GACTTTAACA ATAGTTGTGC TCTCCTAAAT TATTAT 636 hqPGS_C06HBa0054K13.1-11+_SGN-U312831+ (4347 4982) Total number of EST alignments reported: 5 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 4982: PGL 1 (+ strand): 1618 4982 AGS-1 (1618 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4264,4360 4479,4605 4722) SCR (e 1.000 d 1.000 a 0.997,e 1.000 d 0.917 a 0.997,e 1.000 d 0.907 a 0.997,e 1.000 d 0.988 a 0.999,e 1.000 d 0.981 a 0.965,e 1.000 d 0.997 a 0.858,e 1.000 d 1.000 a 0.984,e 1.000) Exon 1 1618 1792 ( 175 n); score: 1.000 Intron 1 1793 2968 (1176 n); Pd: 1.000 Pa: 0.997 Exon 2 2969 2999 ( 31 n); score: 1.000 Intron 2 3000 3122 ( 123 n); Pd: 0.917 Pa: 0.997 Exon 3 3123 3410 ( 288 n); score: 1.000 Intron 3 3411 3508 ( 98 n); Pd: 0.907 Pa: 0.997 Exon 4 3509 3694 ( 186 n); score: 1.000 Intron 4 3695 3809 ( 115 n); Pd: 0.988 Pa: 0.999 Exon 5 3810 3935 ( 126 n); score: 1.000 Intron 5 3936 4129 ( 194 n); Pd: 0.981 Pa: 0.965 Exon 6 4130 4264 ( 135 n); score: 1.000 Intron 6 4265 4359 ( 95 n); Pd: 0.997 Pa: 0.858 Exon 7 4360 4479 ( 120 n); score: 1.000 Intron 7 4480 4604 ( 125 n); Pd: 1.000 Pa: 0.984 Exon 8 4605 4722 ( 118 n); score: 1.000 PGS (1618 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4264,4360 4479,4605 4722) SGN-U312832+ PGS (1624 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4201) SGN-U312834+ PGS (1624 1792) SGN-U344686+ PGS (1638 1792,2969 2999,3123 3410,3509 3694,3810 3815) SGN-U333955+ 3-phase translation of AGS-1 (+strand): . . . . . . 1618 TTCAGACAAAAATGGCAAAGAGTGGCATTTTGGTAATTGTTTCAGCTCTTGTTGTTCTTG F R Q K W Q R V A F W - L F Q L L L F L S D K N G K E W H F G N C F S S C C S C Q T K M A K S G I L V I V S A L V V L . . . . . . 1678 CAGTTTGTGGTGTTTTTGCTGAGGAGAACGAATATGTGTTGACTTTGGACCATTCTAACC Q F V V F L L R R T N M C - L W T I L T S L W C F C - G E R I C V D F G P F - P A V C G V F A E E N E Y V L T L D H S N . . . . . . : 1738 TCACTGAGACTGTTGCTAAGCACAACTTCATTGTTGTTGAATTCTATGCACCTTG : GTGTG S L R L L L S T T S L L L N S M H L : G V H - D C C - A Q L H C C - I L C T L : V W L T E T V A K H N F I V V E F Y A P W : C . . . : . . . 2974 GACACTGTAAGAGTCTTGCTCCTGAG : TATGAAAAAGCTGCCTCAGAGCTGAGTAGTCATG D T V R V L L L S : M K K L P Q S - V V M T L - E S C S - : V - K S C L R A E - S - G H C K S L A P E : Y E K A A S E L S S H . . . . . . 3157 ACCCTCCAATTGTTCTAGCTAAGTATGATGCAAATGATGAAGCCAATAGAGAACTTTCAA T L Q L F - L S M M Q M M K P I E N F Q P S N C S S - V - C K - - S Q - R T F K D P P I V L A K Y D A N D E A N R E L S . . . . . . 3217 AACAGTACGAGATCCAGGGTTTCCCAACTATTAAGATATTGAGAGATGGAGGAAAGAAAG N S T R S R V S Q L L R Y - E M E E R K T V R D P G F P N Y - D I E R W R K E S K Q Y E I Q G F P T I K I L R D G G K K . . . . . . 3277 TTCAAGACTATAACGGTCCTCGTGAAGCAGCTGGTATTGTATCCTACTTGAAGAAACAAG F K T I T V L V K Q L V L Y P T - R N K S R L - R S S - S S W Y C I L L E E T S V Q D Y N G P R E A A G I V S Y L K K Q . . . . . . 3337 TGGGTCCTGCATCTGCTGAAATCAAGTCGAAGGAAGATGCCACAAACCTTATTGATGAGA W V L H L L K S S R R K M P Q T L L M R G S C I C - N Q V E G R C H K P Y - - E V G P A S A E I K S K E D A T N L I D E . . : . . . . 3397 AAAGTATCTTTGTT : GTTGGTATATTTCCAGACCCCTCCGGAGAGAAATTCGAGAACTATT K V S L L : L V Y F Q T P P E R N S R T I K Y L C : C W Y I S R P L R R E I R E L F K S I F V : V G I F P D P S G E K F E N Y . . . . . . 3555 TAACGCTAGCTGAAAAACTGCGAGGCGAGTTCGATTTTGCTCACACTGTTGATGCTAAAC - R - L K N C E A S S I L L T L L M L N N A S - K T A R R V R F C S H C - C - T L T L A E K L R G E F D F A H T V D A K . . . . . . 3615 ACCTCCCTCGGGGTGGACCAGTCAACAAGCCCACTCTTCGTCTTCTAAAGCCATTTGATG T S L G V D Q S T S P L F V F - S H L M P P S G W T S Q Q A H S S S S K A I - - H L P R G G P V N K P T L R L L K P F D . . : . . . . 3675 AACTCTTTGTTGATTTTGAG : GACTTTGATGTCGATGCAATGGAGAAGTTCATCTCAGAAT N S L L I L R : T L M S M Q W R S S S Q N T L C - F - : G L - C R C N G E V H L R I E L F V D F E : D F D V D A M E K F I S E . . . . . . 3850 CTAGTATTCCTGTTGTTACTATTTTTGACAATGACCCAAACAACCATCCTTATGTTAACA L V F L L L L F L T M T Q T T I L M L T - Y S C C Y Y F - Q - P K Q P S L C - Q S S I P V V T I F D N D P N N H P Y V N . . . : . . . 3910 AGTTCTTCGAAGGCACCAACGCCAAG : GCATTGCTATTTGTGAACTTTAGCTCTGAATTTG S S S K A P T P R : H C Y L - T L A L N L V L R R H Q R Q : G I A I C E L - L - I - K F F E G T N A K : A L L F V N F S S E F . . . . . . 4164 ATGCTTTTAAGTCCAAGTACAACGATGTTGCTGTGATTTACAAAGGGGATGGGGTGAGCT M L L S P S T T M L L - F T K G M G - A C F - V Q V Q R C C C D L Q R G W G E L D A F K S K Y N D V A V I Y K G D G V S . . . . . : . 4224 TTCTCTTGGGTGATGTTGAGGCTGGTCAAGGTGCTTTTGAG : TACTTCGGACTGAAGCCGG F S W V M L R L V K V L L S : T S D - S R S L G - C - G W S R C F - : V L R T E A G F L L G D V E A G Q G A F E : Y F G L K P . . . . . . 4379 AACAGGCACCTGTGATCATCATAATGGACGCTGATGAACAAAAGTATATTAAGGACCATG N R H L - S S - W T L M N K S I L R T M T G T C D H H N G R - - T K V Y - G P C E Q A P V I I I M D A D E Q K Y I K D H . . . . . : . 4439 TGGAACCTGATGCCATTGCTGCTTACTTGAAGGATTACAAG : GAAGGAAAACTGAAGCCAC W N L M P L L L T - R I T R : K E N - S H G T - C H C C L L E G L Q : G R K T E A T V E P D A I A A Y L K D Y K : E G K L K P . . . . . . 4624 ATGTGAAGTCAGAGCCCATCCCTGAAGTCAATGACGAACCTGTTAAGGTGGTTGTTAGGG M - S Q S P S L K S M T N L L R W L L G C E V R A H P - S Q - R T C - G G C - G H V K S E P I P E V N D E P V K V V V R . . . . 4684 ATACCCTCCAGGATATGGTTTACAAATCGGGAAAAAATG I P S R I W F T N R E K M Y P P G Y G L Q I G K K D T L Q D M V Y K S G K N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-11+_PGL-1_AGS-1_PPS_1 (1620 1792,2969 2999,3123 3410,3509 3694,3810 3935,4130 4264,4360 4479,4605 4721) (frame '0'; 1176 bp, 392 residues) 1 QTKMAKSGIL VIVSALVVLA VCGVFAEENE YVLTLDHSNL TETVAKHNFI VVEFYAPWCG 61 HCKSLAPEYE KAASELSSHD PPIVLAKYDA NDEANRELSK QYEIQGFPTI KILRDGGKKV 121 QDYNGPREAA GIVSYLKKQV GPASAEIKSK EDATNLIDEK SIFVVGIFPD PSGEKFENYL 181 TLAEKLRGEF DFAHTVDAKH LPRGGPVNKP TLRLLKPFDE LFVDFEDFDV DAMEKFISES 241 SIPVVTIFDN DPNNHPYVNK FFEGTNAKAL LFVNFSSEFD AFKSKYNDVA VIYKGDGVSF 301 LLGDVEAGQG AFEYFGLKPE QAPVIIIMDA DEQKYIKDHV EPDAIAAYLK DYKEGKLKPH 361 VKSEPIPEVN DEPVKVVVRD TLQDMVYKSG KN AGS-2 (4347 4982) SCR (e 0.998) Exon 1 4347 4982 ( 636 n); score: 0.998 PGS (4347 4982) SGN-U312831+ 3-phase translation of AGS-2 (+strand): . . . . . . 4347 TTTGGAATTGCAGTACTTCGGACTGAAGCCGGAACAGGCACCTGTGATCATCATAATGGA F G I A V L R T E A G T G T C D H H N G L E L Q Y F G L K P E Q A P V I I I M D W N C S T S D - S R N R H L - S S - W . . . . . . 4407 CGCTGATGAACAAAAGTATATTAAGGACCATGTGGAACCTGATGCCATTGCTGCTTACTT R - - T K V Y - G P C G T - C H C C L L A D E Q K Y I K D H V E P D A I A A Y L T L M N K S I L R T M W N L M P L L L T . . . . . . 4467 GAAGGATTACAAGGTGATGCATCCTCTTCTTTTTCTTGGATATATTTTGGAATGAAAGCG E G L Q G D A S S S F S W I Y F G M K A K D Y K V M H P L L F L G Y I L E - K R - R I T R - C I L F F F L D I F W N E S . . . . . . 4527 TTGTTAATTGTGAACAATATTCTGCACGGAAGGAAAGGCTTACTAACCTTTTTCCTCTAA L L I V N N I L H G R K G L L T F F L - C - L - T I F C T E G K A Y - P F S S N V V N C E Q Y S A R K E R L T N L F P L . . . . . . 4587 TTTTCCACATTGTTACAGGAAGGAAAACTGAAGCCACATGTGAAGTCAGAGCCCATCCCT F S T L L Q E G K L K P H V K S E P I P F P H C Y R K E N - S H M - S Q S P S L I F H I V T G R K T E A T C E V R A H P . . . . . . 4647 GAAGTCAATGACGAACCTGTTAAGGTGGTTGTTAGGGATACCCTCCAGGATATGGTTTAC E V N D E P V K V V V R D T L Q D M V Y K S M T N L L R W L L G I P S R I W F T - S Q - R T C - G G C - G Y P P G Y G L . . . . . . 4707 AAATCGGGAAAAAATGGTGCGTGTCTGTCAATATTTTAATCTTTATTCGATGTCTTGGTT K S G K N G A C L S I F - S L F D V L V N R E K M V R V C Q Y F N L Y S M S W L Q I G K K W C V S V N I L I F I R C L G . . . . . . 4767 AAGAAAGAGTTGTTTCTTTGTTGTACCATTTCGTTCTCCCTCTTTGGTGTTTACTTGCAT K K E L F L C C T I S F S L F G V Y L H R K S C F F V V P F R S P S L V F T C I - E R V V S L L Y H F V L P L W C L L A . . . . . . 4827 TTCTTTACTAGTGTGTAAGGTCGTGATGAGGAGTTGGATGTCGTAAATTAACTGATTGTG F F T S V - G R D E E L D V V N - L I V S L L V C K V V M R S W M S - I N - L W F L Y - C V R S - - G V G C R K L T D C . . . . . . 4887 GAACTATTATACGTAGTAGGAGAAAGAGGGAGCAGTCATCAGCTAATTGCCCGCGGGTTT E L L Y V V G E R G S S H Q L I A R G F N Y Y T - - E K E G A V I S - L P A G L G T I I R S R R K R E Q S S A N C P R V . . . . 4947 GACTTTAACAATAGTTGTGCTCTCCTAAATTATTAT D F N N S C A L L N Y Y T L T I V V L S - I I - L - Q - L C S P K L L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-11+_PGL-1_AGS-2_PPS_1 (4630 4872) (frame '2'; 240 bp, 80 residues) 1 SQSPSLKSMT NLLRWLLGIP SRIWFTNREK MVRVCQYFNL YSMSWLRKSC FFVVPFRSPS 61 LVFTCISLLV CKVVMRSWMS - 3-phase translation of AGS-2 (-strand): . . . . . . 4982 ATAATAATTTAGGAGAGCACAACTATTGTTAAAGTCAAACCCGCGGGCAATTAGCTGATG I I I - E S T T I V K V K P A G N - L M - - F R R A Q L L L K S N P R A I S - - N N L G E H N Y C - S Q T R G Q L A D . . . . . . 4922 ACTGCTCCCTCTTTCTCCTACTACGTATAATAGTTCCACAATCAGTTAATTTACGACATC T A P S F S Y Y V - - F H N Q L I Y D I L L P L S P T T Y N S S T I S - F T T S D C S L F L L L R I I V P Q S V N L R H . . . . . . 4862 CAACTCCTCATCACGACCTTACACACTAGTAAAGAAATGCAAGTAAACACCAAAGAGGGA Q L L I T T L H T S K E M Q V N T K E G N S S S R P Y T L V K K C K - T P K R E P T P H H D L T H - - R N A S K H Q R G . . . . . . 4802 GAACGAAATGGTACAACAAAGAAACAACTCTTTCTTAACCAAGACATCGAATAAAGATTA E R N G T T K K Q L F L N Q D I E - R L N E M V Q Q R N N S F L T K T S N K D - R T K W Y N K E T T L S - P R H R I K I . . . . . . 4742 AAATATTGACAGACACGCACCATTTTTTCCCGATTTGTAAACCATATCCTGGAGGGTATC K Y - Q T R T I F S R F V N H I L E G I N I D R H A P F F P D L - T I S W R V S K I L T D T H H F F P I C K P Y P G G Y . . . . . . 4682 CCTAACAACCACCTTAACAGGTTCGTCATTGACTTCAGGGATGGGCTCTGACTTCACATG P N N H L N R F V I D F R D G L - L H M L T T T L T G S S L T S G M G S D F T C P - Q P P - Q V R H - L Q G W A L T S H . . . . . . 4622 TGGCTTCAGTTTTCCTTCCTGTAACAATGTGGAAAATTAGAGGAAAAAGGTTAGTAAGCC W L Q F S F L - Q C G K L E E K G - - A G F S F P S C N N V E N - R K K V S K P V A S V F L P V T M W K I R G K R L V S . . . . . . 4562 TTTCCTTCCGTGCAGAATATTGTTCACAATTAACAACGCTTTCATTCCAAAATATATCCA F P S V Q N I V H N - Q R F H S K I Y P F L P C R I L F T I N N A F I P K Y I Q L S F R A E Y C S Q L T T L S F Q N I S . . . . . . 4502 AGAAAAAGAAGAGGATGCATCACCTTGTAATCCTTCAAGTAAGCAGCAATGGCATCAGGT R K R R G C I T L - S F K - A A M A S G E K E E D A S P C N P S S K Q Q W H Q V K K K K R M H H L V I L Q V S S N G I R . . . . . . 4442 TCCACATGGTCCTTAATATACTTTTGTTCATCAGCGTCCATTATGATGATCACAGGTGCC S T W S L I Y F C S S A S I M M I T G A P H G P - Y T F V H Q R P L - - S Q V P F H M V L N I L L F I S V H Y D D H R C . . . . 4382 TGTTCCGGCTTCAGTCCGAAGTACTGCAATTCCAAA C S G F S P K Y C N S K V P A S V R S T A I P L F R L Q S E V L Q F Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-11-_PGL-1_AGS-2_PPS_1 (4650 4348) (frame '0'; 303 bp, 101 residues) 1 LQGWALTSHV ASVFLPVTMW KIRGKRLVSL SFRAEYCSQL TTLSFQNISK KKKRMHHLVI 61 LQVSSNGIRF HMVLNILLFI SVHYDDHRCL FRLQSEVLQF Q ... finished at: Mon Aug 28 22:24:46 2006 ________________________________________________________________________________ Sequence 12: C06HBa0054K13.1-12, from 1 to 43325, both strands analyzed. ... started at: Mon Aug 28 22:24:46 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 4 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 5 ******************************************************************************** EST sequence 6 -strand 1990 n (File: SGN-U318979-) 1 TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT CCAAAAAAAA AAAAAATATT AAAAAATATA 61 TCATTTCAAA GTGTGTGTTT ATATGTTTCA TATAGTGGCT ATTTATAGCC ATACATTAAT 121 GCATACAAGG TCATTAGTGA TGTCATATTC CACATGACAC TACAATTATT TTACACTACA 181 ATTATTTTAC AACACTCCCC CTTGGATATC TATGTAAAAG AAAGTACATA GATATACATA 241 ATACGCTTTA TTTGCTGCCT CATTAAAAAC CTTACCAGGA AAACCCAACT TGGGACAAAA 301 CCATAGTTAA GGAAAAGAGT ACAACGCGTA TTTCACTCCC CCTGATGAAA ACTTTACTTG 361 ATATCTTGGA GACGGCGCAT TCCAATCTTG TATCTCAACT TCTCAAACGT TGATGTTGGC 421 AATGCCTTTG TGAATAAATC AGCAAGATTA TCACTTGAAC GAATTTGTTG AACTTCTATC 481 TCACCATTTT GTTGAAGATC ATGCGTGAAA AAGAACTTTG GTGAGATATG CTTTGTCCGG 541 TCTCCTTTGA TGTATCCTCC TTTCAATTGA GCTATACATG CAGCATTATC TTCGTACATT 601 GTGGTTGGTA TATTCTTTTT CAAAGAAAAA CCACACATTT CCTGAATATG ATGGGTCATT 661 GATCTCAACC AGACGCACTC TCGACTTGCT TCATGGATGG CTATTATTTC TGCATGATTT 721 GAAGAAGTGG CTACCAACGT TTGCTTCATT GATCGCCAAG ATATTGCCGT GTCTCCACAT 781 GTAAACAAAT AGCCTGTTTG TGATCGGGCT TTATGCGGAT CTGATAAATA CCCTGCATCT 841 GCGTAACCAA TCAGTTCTGA TTTGGATTCA TTGGAATAGA ATAAACCCAT GTCCATGGTC 901 CCTCGAAGAT ATCGAAGTAT GTGTTTAACA CCATTCCAAT GTCTTTTTGT TGGGGAGGAA 961 CTGAATCTTG CCAGTAGACT TACTGCAAAA CAGATATCTG GTCGAGTATT GTTAGCAAGG 1021 TACATTAGTG CCCCGATCGC ACTAAGATAA GGAGTTTCAT CACCAAGAAG CTCTTCATCA 1081 TTCTCTTGAG GTCGAAATGG ATCTGTATTG ATGTCAAGCG ATCTTACCAC CATTGGAGTA 1141 CTCAATGGAT GTGAGTTATC CATGTAAAAA CGCTTTAGTA TCTTTTCTGT GTACGTTGAT 1201 TGATGAACAA GTATTCCATT TGACAAATTC TCAATCTGTA GGCCAAGACA AAATTTTGTC 1261 TTGCCGAGAT CTTTCATTTC AAATTCTTTT TTCAGACACT CAATAGCTTC TAAAAGCTCT 1321 TTACCAGTGC CAATGATGTT CAAATCATCA ACATACACAG CTATTATTAC AAATTCAGAC 1381 CCCGACCGTT TAATAAAAAT GCAGGGACAA ATCGGGTCAT TTTTGTACCC TTTCTTTAAC 1441 AAATATTCAC TCAGACGATT GTACCACATC CTTCCTGATT GTTTCAATCC ATACAGAGAT 1501 TTCTGAAGTT TTATTGAACA AGTTTCTCTT GAATCTTTGT ATGCTTCAGG AACTTTGAAT 1561 GCTTCAGGAA TTTTCATGAA AATGTTGTGG TCCAATGAGC CATATAGATA GGCTGTGACA 1621 ACGTCCATTA GACGCATTTC AAGTTTTTCA TGAACTGCCA AATTTATGAG ATATCTGAAG 1681 GTGATTGCAT CTACCACTGG AGAATATGTC TCCATATAAT CAATGCCAGG TCTTTGAGAA 1741 AAACCTTGAG CAACGAGTCG GGCCTTATAT CTCATGACTT CATCTTTCTC ATTTCTTTTC 1801 CGTACAAAAA CCCATTTGTA CCCCACTGGC TTGATACCTT CAGGTGTTCG GATTATCGGT 1861 CCAAAAACTT CACGTATTTC TAGTGAAGCC AATTCAGCTT GAATTGCATC CTTCCATTTT 1921 GGCCAATCAT TTCTCTGTCT ACATTCGTGA ACAGATTTTG GCTCAAAATC TTCATATTGT 1981 TGCATTATTT Predicted gene structure (within gDNA segment 11461 to 17297): Exon 1 12896 12904 ( 9 n); cDNA 156 164 ( 9 n); score: 0.667 Intron 1 12905 14678 (1774 n); Pd: 0.069 (s: 0), Pa: 0.376 (s: 0.76) Exon 2 14679 14729 ( 51 n); cDNA 165 215 ( 51 n); score: 0.745 Intron 2 14730 14788 ( 59 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.72) Exon 3 14789 16562 (1774 n); cDNA 216 1990 (1775 n); score: 0.966 PPA cDNA 30 1 MATCH C06HBa0054K13.1-12+ SGN-U318979- 0.960 1834 0.922 C PGS_C06HBa0054K13.1-12+_SGN-U318979- (12896 12904,14679 14729,14789 16562) Alignment (genomic DNA sequence = upper lines): GATAATATAG TAAAAGAAAA ATACTCAAAT ACATTGAAAT TTAAGAAAAA TATCATCTAC 12955 || | || | GACACTACA. .......... .......... .......... .......... .......... 164 ATCGTTCGTA AAAAACTTGG TTCATCGATA TCATTTTCAT TAGAGAAAAA ATTCATCTAT 13015 .......... .......... .......... .......... .......... .......... 164 TCCGTTATTT GTTAAATGAA AAAGGTTTCA ATTTCGCAAA ATCATTTTTT ATCTGTGGAC 13075 .......... .......... .......... .......... .......... .......... 164 AATTATAATT TGACATTGCT ATTAAGTAAA TCATTTTTTT ATAAAAAAAA ATGTCCAAAT 13135 .......... .......... .......... .......... .......... .......... 164 ATGATATCAA ACTTTGAAAA ATAACTCATT TATGTCATCT GCTTAAAGTA TGGTTAATCT 13195 .......... .......... .......... .......... .......... .......... 164 ATATAATTTC AATTAACAAA CAATGGCATG GATAAGTCTT TACTTAAACA GATATGATAT 13255 .......... .......... .......... .......... .......... .......... 164 AAATAATGTA TAATATAAAA AGTATTCTTT TAACGCTGGA TAACCTATTT AAGCAGTTTT 13315 .......... .......... .......... .......... .......... .......... 164 TCAATAATAA AATAATATAA CAATTATTAT TTTCTTAATA AATACGTCAA ATTTAGATAT 13375 .......... .......... .......... .......... .......... .......... 164 AACTAAAAGT ACACTAAAAA AGTAGTAACT AATTTAACTG CGACCGACTC CCAAATAATT 13435 .......... .......... .......... .......... .......... .......... 164 GTGACAACTG TTCCTTTTTG CAGATAGGAT AATTTATTAT TTTGTTTTAA ATTTCTTAAC 13495 .......... .......... .......... .......... .......... .......... 164 CTACTGGAAT ACTACTATTA ACTCAAAACT GCTATTTGCA TCATTGCCAT AAATGAGGAA 13555 .......... .......... .......... .......... .......... .......... 164 TTTATGACGT GCAGGCAACC GTTAAATGAC CACAATTATG GGACTAAAAT ATGAAAATAA 13615 .......... .......... .......... .......... .......... .......... 164 ATATTACCTT TTAAAAAGGA TGATTTAATT TAATTTAAAA TGGAGTTTAA AAAATGAGAG 13675 .......... .......... .......... .......... .......... .......... 164 AATTTTTTTT AATTTTGTGG TTCAAAAGTA AAATTATATC AAATGTATCA AAATGTCCTT 13735 .......... .......... .......... .......... .......... .......... 164 CAATTTTGTG GTCTTAAACA TGTCACGTGA AAAGTTAAAG TGTTGCCAAA TAAAAAAAGA 13795 .......... .......... .......... .......... .......... .......... 164 GGTCATTATT TTTGAAACAA AATAAAAAGA AAATAAAGAC ATTTTTTTTT AAAACGGAGG 13855 .......... .......... .......... .......... .......... .......... 164 GATTATAAAA AAATCAATAT GATCCTTTTT TCTCCCAAAC AGAGGCATTC AATGTTTATA 13915 .......... .......... .......... .......... .......... .......... 164 TTAAATTAAT TTAATTTCGG GAAAATATAT AAGTACCTTC TTGACAATGA TCAAAATCTC 13975 .......... .......... .......... .......... .......... .......... 164 AGAAACACAT CTTAACTAAA CTAAGATCCT ACTACCTCTG ATTTTTTTTT TTTTTGTAAT 14035 .......... .......... .......... .......... .......... .......... 164 TTATGCACCT TTTTGGCTTA CGTGACATCC AAATATTTTC TAGCGCCTCA ATTGCGTGGG 14095 .......... .......... .......... .......... .......... .......... 164 GTCACGAAGA GTGTCATGTA AGACAAAAGA TGTATATTAT TACAAAAATA GAAGTTCAGG 14155 .......... .......... .......... .......... .......... .......... 164 GGTAATAAAA TCTTACTTTA ATTAAGGTGT CTTTCTGAAA TTCGGTCATA ATCTAGGTGG 14215 .......... .......... .......... .......... .......... .......... 164 TACTTGTGCT TTTTCCCTTT AATTTCTATT ACAATGTAAT TTAAGGAAGT AAAATTTATG 14275 .......... .......... .......... .......... .......... .......... 164 TTATAAAATT TAAACATATG ACAAATATGT ATTGAATTGA TATAAATATT TTGTGTGAAT 14335 .......... .......... .......... .......... .......... .......... 164 AAATTTTATC CCAGATAAAA CTAAGTCATT AATATTCCTT TAAATTCTAA AAGTTCATAT 14395 .......... .......... .......... .......... .......... .......... 164 TACTTTACTC AATGGAAAAA TAAATAAATA TATTTTTGTT GCAAAGAAAG CTCAGAATAT 14455 .......... .......... .......... .......... .......... .......... 164 AAGATGAAAA AGACAAAAAG TATAAGATAG AGGAGATAAT TTTCTTATTC AAGTATATCA 14515 .......... .......... .......... .......... .......... .......... 164 TACAATGGTG AATGATATCT CTATTTATAG TGTTGAGATA TCATATTCAA AGGTCACCTT 14575 .......... .......... .......... .......... .......... .......... 164 GAAATTTTAC ATAGTTATCA TCAAGGTTCA TATCACCTTG ATATCTTATC ACATTGGAAA 14635 .......... .......... .......... .......... .......... .......... 164 CACTAATACA TGAATTCAAT TAATTGTTGG ATAACTCTAA TAGATCATCC AGATATACAA 14695 || || ||||| .......... .......... .......... .......... ...ATTATTT TACACTACAA 181 TGAATTTACA ACACTCCCCC TTGGATATCC ATAGATTATG TGCCTCATTA AAACCTTACT 14755 | | |||||| |||||||||| ||||||||| || TTATTTTACA ACACTCCCCC TTGGATATCT ATGT...... .......... .......... 215 AGGAAAAACC CAGTGGGAAA AAGTCTAGTG AAGGAAAAAG AGTACACATA TCT-CATAAT 14814 ||| | |||||| | | | | |||||| .......... .......... .......... ...AAAAGAA AGTACATAGA TATACATAAT 242 ACGCTTTGAA TGTTGCCTCG TTAAAAACCT TACCATGAAA ACCCAACTTG GGACAAAACC 14874 ||||||| || |||||| |||||||||| ||||| |||| |||||||||| |||||||||| ACGCTTTATT TGCTGCCTCA TTAAAAACCT TACCAGGAAA ACCCAACTTG GGACAAAACC 302 ATAGTTAAGG AAAAGAGTAC AACGCGTATT TCGCTCCCCC TGATGAAAAC TTTACTTGAT 14934 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| ATAGTTAAGG AAAAGAGTAC AACGCGTATT TCACTCCCCC TGATGAAAAC TTTACTTGAT 362 ATCTCGGAGA CGGCGCATTC CAATCTTGTA TCTCAACTTC TCAAATGTTG ATGTTGGCAA 14994 |||| ||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ATCTTGGAGA CGGCGCATTC CAATCTTGTA TCTCAACTTC TCAAACGTTG ATGTTGGCAA 422 TGCCTTAGTG AATAAATCAG CAAGATTATC ACCTGAACGA ATTTGTTGAA CTTCTATCTC 15054 |||||| ||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| TGCCTTTGTG AATAAATCAG CAAGATTATC ACTTGAACGA ATTTGTTGAA CTTCTATCTC 482 ACCATTTTGT TGAAGATCAT ACGTGAAAAA GAACTTTGGT GAGATATGCT TTGTCCGGTC 15114 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ACCATTTTGT TGAAGATCAT GCGTGAAAAA GAACTTTGGT GAGATATGCT TTGTCCGGTC 542 TCCTTTGATG TATCCTCCTT TCAATTGAAC TATACATGCA GCATTATCTT CGTACATTGT 15174 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| TCCTTTGATG TATCCTCCTT TCAATTGAGC TATACATGCA GCATTATCTT CGTACATTGT 602 GGTTGGTATA TTCTTTTTCA AAGAAAAACC ACACATTTTC TGAATATGAT GGGTCATTGA 15234 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| GGTTGGTATA TTCTTTTTCA AAGAAAAACC ACACATTTCC TGAATATGAT GGGTCATTGA 662 TCTCAACCAG ACGCACTCAC GACTTGCTTC ATGGATGGCT ATTATTTCTG CATGATTTGA 15294 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| TCTCAACCAG ACGCACTCTC GACTTGCTTC ATGGATGGCT ATTATTTCTG CATGATTTGA 722 AGAAGTGGCT ACCAACGTTT GCTTCATTGA TCGCCAAGAT ATTGTCATGT CTCCACATGT 15354 |||||||||| |||||||||| |||||||||| |||||||||| |||| | ||| |||||||||| AGAAGTGGCT ACCAACGTTT GCTTCATTGA TCGCCAAGAT ATTGCCGTGT CTCCACATGT 782 AAACAAATAT CCTGTTTGTG ATCAAGCTTT ATGAGGATCT GATAAATACC GTGCATCTGC 15414 ||||||||| |||||||||| ||| ||||| ||| |||||| |||||||||| ||||||||| AAACAAATAG CCTGTTTGTG ATCGGGCTTT ATGCGGATCT GATAAATACC CTGCATCTGC 842 GTAACCAATC AGTTCTGATT TGGATTCATT GGAATAGAAT AAACCCATGT GCATGGTCCC 15474 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| GTAACCAATC AGTTCTGATT TGGATTCATT GGAATAGAAT AAACCCATGT CCATGGTCCC 902 TCGAAGATAT CGAAATATGT GTTTAACACC ATTCCAATGT CTTTTTGTTG GGGAGGAACT 15534 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGAAGATAT CGAAGTATGT GTTTAACACC ATTCCAATGT CTTTTTGTTG GGGAGGAACT 962 GAATCTTGCT AGTAGACTTA CTGCAAAATA GATATCTGGT CGAGTATTGT TAGCAAGGTA 15594 ||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| GAATCTTGCC AGTAGACTTA CTGCAAAACA GATATCTGGT CGAGTATTGT TAGCAAGGTA 1022 CATTAGTGCC CCGATCGCAC TAAGATAAGG AGTTTCATCA CCAAGAAGCT CTTCATCATT 15654 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTAGTGCC CCGATCGCAC TAAGATAAGG AGTTTCATCA CCAAGAAGCT CTTCATCATT 1082 CTCTTGAGGT CGAAATGGAT CTGTATTGAT GTCAAGCGAT CTTACCACCA TTGGAGTACT 15714 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCTTGAGGT CGAAATGGAT CTGTATTGAT GTCAAGCGAT CTTACCACCA TTGGAGTACT 1142 CAATGGATGT GAGTTATCCA TGTAAAAACG CTTTAGTATC TTTTCTGTGT ACGTTGATTG 15774 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATGGATGT GAGTTATCCA TGTAAAAACG CTTTAGTATC TTTTCTGTGT ACGTTGATTG 1202 ATGAACAAGT ATTCCATTTG ACAAATTCTC AATCTGTAGG CCAAGATAAA ATTTTGTCTT 15834 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| ATGAACAAGT ATTCCATTTG ACAAATTCTC AATCTGTAGG CCAAGACAAA ATTTTGTCTT 1262 GCCGAGATCT TTCATTTCAA ATTCTTTTTT CAGACACTCA ACAGCTTTTA AAAGCTCTTT 15894 |||||||||| |||||||||| |||||||||| |||||||||| | ||||| || |||||||||| GCCGAGATCT TTCATTTCAA ATTCTTTTTT CAGACACTCA ATAGCTTCTA AAAGCTCTTT 1322 ATGAGTGCCA ATGATGTTCA AACCATCAAC ATACACAACT ATTATTACAA ATTAAGACCC 15954 | ||||||| |||||||||| || ||||||| ||||||| || |||||||||| ||| |||||| ACCAGTGCCA ATGATGTTCA AATCATCAAC ATACACAGCT ATTATTACAA ATTCAGACCC 1382 CGACTGTTTA ATGAAAATGC AGGGACAAAT CGGGTCATTT TTGTACCCTT TCTTTAACAA 16014 |||| ||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGACCGTTTA ATAAAAATGC AGGGACAAAT CGGGTCATTT TTGTACCCTT TCTTTAACAA 1442 ATATTCACTC AGGCGATTGT ACCACATCCT TCCTGATTGT TTCAATCCAT ACAGAGATTT 16074 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATTCACTC AGACGATTGT ACCACATCCT TCCTGATTGT TTCAATCCAT ACAGAGATTT 1502 CTGAAGTTTT ATTGAACAAG TTTCTCTTGA ATCTTTGTAT GCTTCAGGCA CTTTGAATGC 16134 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| CTGAAGTTTT ATTGAACAAG TTTCTCTTGA ATCTTTGTAT GCTTCAGGAA CTTTGAATGC 1562 TTTAGGAATT TTCATGAAAA TGTTGTGGTC CAATGAGCCA TATAGATAGG CTGTGACAAC 16194 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGGAATT TTCATGAAAA TGTTGTGGTC CAATGAGCCA TATAGATAGG CTGTGACAAC 1622 GTCGATTAGA CGCATTTCAA GTTTTTCATG AACTACCAGA TTTATGAGAT ATCTGAAGGT 16254 ||| |||||| |||||||||| |||||||||| |||| ||| | |||||||||| |||||||||| GTCCATTAGA CGCATTTCAA GTTTTTCATG AACTGCCAAA TTTATGAGAT ATCTGAAGGT 1682 GATTGCATCT ACCACTGGAG AATATGTCTC CATATAATAA ATGTTAGGTC TTTGAGAAAA 16314 |||||||||| |||||||||| |||||||||| |||||||| | ||| ||||| |||||||||| GATTGCATCT ACCACTGGAG AATATGTCTC CATATAATCA ATGCCAGGTC TTTGAGAAAA 1742 ACCTTGAGCA ACGAGTCGGG CCTTATATCT CATGACTTCA CCTTTCTCAT TTCTTTTTCG 16374 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ||||||| || ACCTTGAGCA ACGAGTCGGG CCTTATATCT CATGACTTCA TCTTTCTCAT TTCTTTTCCG 1802 TACAAAAACC CATTTGTACC CCACTGGCTT GATACCTTCA GGTGTTTGGA TTATCGGTCC 16434 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| TACAAAAACC CATTTGTACC CCACTGGCTT GATACCTTCA GGTGTTCGGA TTATCGGTCC 1862 AAAAACTTCA CATTTTTCTA GTGAAGACAA TTCAGCTTGA ATTGCATCCT TCCATTTTGG 16494 |||||||||| | | |||||| |||||| ||| |||||||||| |||||||||| |||||||||| AAAAACTTCA CGTATTTCTA GTGAAGCCAA TTCAGCTTGA ATTGCATCCT TCCATTTTGG 1922 CCAATCATTT CTCTGTCTAC ATTCGTGAAC AGATTTTGGC TCAAAATCTT CATCTTGTTG 16554 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| CCAATCATTT CTCTGTCTAC ATTCGTGAAC AGATTTTGGC TCAAAATCTT CATATTGTTG 1982 CATTATTT 16562 |||||||| CATTATTT 1990 hqPGS_C06HBa0054K13.1-12+_SGN-U318979- (14679 14729,14789 16562) ******************************************************************************** EST sequence 5 -strand 919 n (File: SGN-U336683-) 1 GGATAGGGGA TCATGATTCA ACCAGAGGCC TCTCGACTGC TCAGGATAGT TATTTTTCGC 61 AGAATTGAAG AAGTGATTCC GAGGTTGCTT CAAGATGGTC AAGATATTGT GTGTTTCCAC 121 ATGTAACAAA TTAGCCTGTT GCGGACTGGC TTTATGTGGA TCAAATAAAT ACCCTGTATT 181 TGTGTAATCA ATTAGTCATG ACTGGGATTC GTTGGAATAT ACCCTATTTC ATGGTCTCCG 241 AAGATATCGA AGTATGTGTT TAACACTATT TTCATGTCTT TTTGTTGGGG AGGAACTGAA 301 TATTGCCAGT AGACTTACTG CAAAATAGAT ATTTGGTCGA GTATTATTAG CAAGGTACAT 361 TAGTACCCTT ACTGCACTAA GGTAAGGAGT TTCATCACCA AGAAGCTTTT CATCCTTCTC 421 TTGAGGTCGA AATGAACTTG TATTGATGTC AAGTAATCTT ACCACCATTG GGGTATTCAA 481 TGAATGTGAG TTATCAATGT AATATCATTT TTATATCTTT TCAATGTATG TTGATTGATG 541 AACAAGTATT TCATTTGACA AATTCTTAAT TTGTAGGTTA AGATAAAATT TTATCTTGCC 601 GAGATCTTTC ATTTTAAATT CTCTTTTCAG GCACTCAACG ACTTCTAAAA AATCTTTACG 661 AGTGTCGATG ATGTTCAAAT TATCAACATA CACAACTACT ATAACAAATT CAGACTCCAA 721 TTGTGTAATA AAAAATGTAG GGACAAATCG TGTCATTTTT GTACCCTTTC TTTAACAAAT 781 ATTCACTTAG ACGATTGTAC CAAGCCTCCG TGACTGTTTC AATTCATACA AAGATTTTTA 841 AAGCTTTATC GAACTAGTTC TCTCTCTCCT CGTGCCGAAA CCTGCAGCCC GGGGGATCCA 901 CTATTGAAAG CCGGCCGCC Predicted gene structure (within gDNA segment 12171 to 17943): Exon 1 15220 16091 ( 872 n); cDNA 3 854 ( 852 n); score: 0.834 MATCH C06HBa0054K13.1-12+ SGN-U336683- 0.834 872 0.949 C PGS_C06HBa0054K13.1-12+_SGN-U336683- (15220 16091) Alignment (genomic DNA sequence = upper lines): ATGATGGGTC ATTGATCTCA ACCAGACGCA CTCACGACTT GCTTCATGGA TGGCTATTAT 15279 || || || | |||| ||| |||||| || ||| ||||| || ||| ||| | | |||| | ATAGGGGATC A-TGAT-TCA ACCAGAGGC- CTCTCGACT- GC-TCA-GGA TAGTTATT-T 55 TTCTGCATGA TTTGAAGAAG TGGCTACCAA CGTTTGCTTC ATTGATCGCC AAGATATTGT 15339 ||| ||| || ||||||||| | | | || | | ||||||| | ||| | | |||||||||| TTC-GCA-GA ATTGAAGAAG T-GATTCCGA -GGTTGCTTC AA-GATGGTC AAGATATTGT 110 CATGTCTCCA CATGTAAACA AATATCCTGT TTGTGATCAA GCTTTATGAG GATCTGATAA 15399 ||| |||| ||||||| | | || |||| ||| | | |||||||| | |||| |||| -GTGTTTCCA CATGTAACAA ATTAGCCTG- TTGCGGACTG GCTTTATGTG GATCAAATAA 168 ATACCGTGCA TCTGCGTAAC CAATCAGTTC TGATTTGGAT TCATTGGAAT AGAATAAACC 15459 ||||| || | | || |||| |||| ||| ||| | |||| || ||||||| || | || ATACCCTGTA TTTGTGTAAT CAATTAGTCA TGACTGGGAT TCGTTGGAAT ---AT-ACCC 224 CATGTGCATG GTCCCTCGAA GATATCGAAA TATGTGTTTA ACACCATTCC AATGTCTTTT 15519 || | |||| ||| | |||| ||||||||| |||||||||| |||| ||| ||||||||| TAT-TTCATG GTCTC-CGAA GATATCGAAG TATGTGTTTA ACACTATTTT CATGTCTTTT 282 TGTTGGGGAG GAACTGAATC TTGCTAGTAG ACTTACTGCA AAATAGATAT CTGGTCGAGT 15579 |||||||||| ||||||||| |||| ||||| |||||||||| |||||||||| ||||||||| TGTTGGGGAG GAACTGAATA TTGCCAGTAG ACTTACTGCA AAATAGATAT TTGGTCGAGT 342 ATTGTTAGCA AGGTACATTA GTGCCCCGAT CGCACTAAGA TAAGGAGTTT CATCACCAAG 15639 ||| |||||| |||||||||| || ||| | |||||||| |||||||||| |||||||||| ATTATTAGCA AGGTACATTA GTACCCTTAC TGCACTAAGG TAAGGAGTTT CATCACCAAG 402 AAGCTCTTCA TCATTCTCTT GAGGTCGAAA TGGATCTGTA TTGATGTCAA GCGATCTTAC 15699 ||||| |||| || ||||||| |||||||||| || | |||| |||||||||| | ||||||| AAGCTTTTCA TCCTTCTCTT GAGGTCGAAA TGAACTTGTA TTGATGTCAA GTAATCTTAC 462 CACCATTGGA GTACTCAATG GATGTGAGTT ATCCATGTAA AAACGCTTTA GTATCTTTTC 15759 ||||||||| ||| |||||| ||||||||| ||| |||||| | | ||| ||||||||| CACCATTGGG GTATTCAATG AATGTGAGTT ATCAATGTAA TATCATTTTT ATATCTTTTC 522 TGTGTACGTT GATTGATGAA CAAGTATTCC ATTTGACAAA TTCTCAATCT GTAGGCCAAG 15819 |||| ||| |||||||||| |||||||| | |||||||||| |||| ||| | ||||| ||| AATGTATGTT GATTGATGAA CAAGTATTTC ATTTGACAAA TTCTTAATTT GTAGGTTAAG 582 ATAAAATTTT GTCTTGCCGA GATCTTTCAT TTCAAATTCT TTTTTCAGAC ACTCAACAGC 15879 |||||||||| ||||||||| |||||||||| || ||||||| ||||||| | ||||||| | ATAAAATTTT ATCTTGCCGA GATCTTTCAT TTTAAATTCT CTTTTCAGGC ACTCAACGAC 642 TTTTAAAAGC TCTTTATGAG TGCCAATGAT GTTCAAACCA TCAACATACA CAACTATTAT 15939 || ||||| |||||| ||| || | ||||| ||||||| | |||||||||| |||||| ||| TTCTAAAAAA TCTTTACGAG TGTCGATGAT GTTCAAATTA TCAACATACA CAACTACTAT 702 TACAAATTAA GACCCCGACT GTTTAAT-GA AAATGCAGGG ACAAATCGGG TCATTTTTGT 15998 ||||||| | ||| || | | || |||| | ||||| |||| |||||||| | |||||||||| AACAAATTCA GACTCCAATT GTGTAATAAA AAATGTAGGG ACAAATCGTG TCATTTTTGT 762 ACCCTTTCTT TAACAAATAT TCACTCAGGC GATTGTACCA CATCCTTCCT GATTGTTTCA 16058 |||||||||| |||||||||| ||||| || | |||||||||| | ||| | | || ||||||| ACCCTTTCTT TAACAAATAT TCACTTAGAC GATTGTACCA -AGCCTCCGT GACTGTTTCA 821 ATCCATACAG AGATTTCTGA AGTTTTATTG AAC 16091 || |||||| |||||| | | || ||||| | ||| ATTCATACAA AGATTTTTAA AGCTTTATCG AAC 854 hqPGS_C06HBa0054K13.1-12+_SGN-U336683- (15220 16091) ******************************************************************************** EST sequence 8 +strand 848 n (File: SGN-U339613+) 1 NATGAAACCT ATGGTCGAGC TCCACCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC 61 CGGGCTGCAG GAATTCGGCA CGAGGTACGA ATTTGCTACT TGCATAGAAA TCTCCAATTG 121 TCAGTTGGAT TCAAGATGAG TAATGGAGAT GTATACCTTT TTCATAGTGC TACTACACAT 181 ACAATATTAA AAGAAAAGAA AAGAAATACT TTTCTAATTT GGTTATGAAA ATGACATATG 241 TCAACACAAT ATCAGGTAGT ACAAAATTAA TTGAGAGCTC TGGAAGAGCG ACCTTATTAC 301 AACCTGGAGA GACAATATTG GACATTTGTA ATGCATTATA CCGTAGTAAG TCACAAAGAA 361 ACTTATTAAG TTTCAGAGTA ATTCGCCAAA ATGGCTATCA CGTTGAGACG ACTAATGAAG 421 GAAAGGATGA ATGCCTTTAC ATTACTACAA TTAATGTACA GAAGAAAATT GTGCATGAAA 481 AATTACGTGC ATTTTCTTCT AGGTTCTACT ATACAAGTGT AAGTACAGTT GAATCACATG 541 TCGTAGTAAA CAAAAGGTTT ACTAATTTTA ATGATTCTAT TATTTGGTAT GACCGGTTGG 601 GCCATCCCGG ATTTAATATG ATGCACAGAA TCATTGAGAA TTCACGTGGG CACACCTTAG 661 AGAGTCCAAA TATGCTTCAA TCAAAGGAAT TCTCTTGTGT TGCTTGTTCT CAAGGAAAGT 721 TGATCATTAA CCATCAACAG TTGAAGTAAA AATTGAATTC CCTGCGTGTC TGGAACATAT 781 ACAGGGTGGA ATGTTTTGGA CACATTCCAC CTGCATGTGG GACCCTGTAA ATTTAGTTGA 841 ACTTGATG Predicted gene structure (within gDNA segment 20778 to 16011): Exon 1 19702 19682 ( 21 n); cDNA 87 107 ( 21 n); score: 0.905 Intron 1 19681 18419 (1263 n); Pd: 1.000 (s: 0), Pa: 0.899 (s: 0.94) Exon 2 18418 17684 ( 735 n); cDNA 108 847 ( 740 n); score: 0.886 MATCH C06HBa0054K13.1-12- SGN-U339613+ 0.886 756 0.892 C PGS_C06HBa0054K13.1-12-_SGN-U339613+ (19702 19682,18418 17684) Alignment (genomic DNA sequence = upper lines): ACGAAATTGC TACTTGCAAA GGTACTATAT AAATATTTTT ATTTAATTCA CATATTCTCT 19643 ||||| |||| |||||||| | | ACGAATTTGC TACTTGCATA G......... .......... .......... .......... 107 CATAATTTTC ATTATGTCGA ATTTATCCAA ACTTGAGTTT GTGGCATTAG ATATTTCTGG 19583 .......... .......... .......... .......... .......... .......... 107 AAAGAATTAT CTTTCATGGG TACTCGATGC TGAGATTCAC TTGGCTGCTA AAGGTCTTGA 19523 .......... .......... .......... .......... .......... .......... 107 TGCCACTATT ACTCAGGGAA AAGAAGCATC CAGTCAAGAT AAGGCGAAGG CTATGATTTT 19463 .......... .......... .......... .......... .......... .......... 107 CCTTCGTTAT CATCTTGATG AGGGCCTGAA GATTGAATAT CTGACGGTGA AATATCCACT 19403 .......... .......... .......... .......... .......... .......... 107 TGAATAGTGG ACTGATTTAA AGGGGAGATA TGACCACCGA AAGGCAACAG TGTTGCCAAG 19343 .......... .......... .......... .......... .......... .......... 107 AGCTCGTTAT GAGTGGATGC ATTTATGGTT TCAAGATTTT AAGACCGTAA TTGAATACAA 19283 .......... .......... .......... .......... .......... .......... 107 CAGAGTTGTA TTCAGGATAA CCTCCCAGTT GAAATTATGT GGGGAGACTA TAAAAGATGA 19223 .......... .......... .......... .......... .......... .......... 107 GAACATGTTG GAAAAGACAC TTACTACTTT TCATGCCTCG AATGTGATAT TGCAGTAGCA 19163 .......... .......... .......... .......... .......... .......... 107 ATATCGTGAA AAGGGTTTTC AGAAATATTC TGAACTAATC TCATGTCTTT TGGTGGCTGA 19103 .......... .......... .......... .......... .......... .......... 107 ACAACATAAT GCTCTTTTAA TGAAAAATCA TGAAGCTCGT CCCACTGGAG CTGCTCCATT 19043 .......... .......... .......... .......... .......... .......... 107 ACCGGAGGCA AATGTGGTGG AAGCACGTGA TCAATCTGAA GTAAAAAGAG ATGATCATCG 18983 .......... .......... .......... .......... .......... .......... 107 GGGATATAAT AATGTATGGG GACGTGGCAA AGATAAAAGA CGATACACTA ATCGTCAAGA 18923 .......... .......... .......... .......... .......... .......... 107 TGGTGATCAT AATAAAAGGG AGAACAACAT GAGTTCTCAA AATAACCCCT CAAAAAGTAA 18863 .......... .......... .......... .......... .......... .......... 107 TTGTCGTCAT TGTGGCATGA AAGGCCATTG GAAGAATGAA TGTCGCACAC ATGAACATTT 18803 .......... .......... .......... .......... .......... .......... 107 TGTAAGGCTC TATCAAAATT CCTTCAAAAA GAAAGGAAAT AAAAGTGGTG CTTCCTCTTC 18743 .......... .......... .......... .......... .......... .......... 107 CAATGCTCGA GCTGAGTCAC ATCTGACTCT TAAAGATGGT GATAAGCCGG GAACATCTCA 18683 .......... .......... .......... .......... .......... .......... 107 GAAATATGAT AAAGATGTTG AAGAAAATTT GGCTTTAAGG GATGATGTTT TTGATGGCCT 18623 .......... .......... .......... .......... .......... .......... 107 TGGTGACATT ACTCATATGG AAGTTGATGA CTTCTTTGGA GATCGAAACT AATGTTTGAT 18563 .......... .......... .......... .......... .......... .......... 107 CTTTTAGCTG GGGAATGAAA TTTGTTAATA TTTTACTTAT GTATTTTTAA TTATTATGTT 18503 .......... .......... .......... .......... .......... .......... 107 ATTCATGTTG AAGTATTTAA ATTTCAGTTG TTAATTTTGT TTCTTCCTCC TTTTGATGTA 18443 .......... .......... .......... .......... .......... .......... 107 TTTTATTTTA ATGAAAATTA ATAGAAATCC CCAGTTGTCA GTTGGATTCA AGATGAGTAA 18383 ||||| ||| |||||| |||||||||| |||||||||| .......... .......... ....AAATCT CCAATTGTCA GTTGGATTCA AGATGAGTAA 143 TGGAGATGTA TGCCTTCTTG ATAGTGCTAC AACGCATACA ATATTAAAAG AAAAG---A- 18327 |||||||||| | |||| || |||||||||| || |||||| |||||||||| ||||| | TGGAGATGTA TACCTTTTTC ATAGTGCTAC TACACATACA ATATTAAAAG AAAAGAAAAG 203 AA-TACTTTT CTAATTTGGT TATGAAAATG GCATATGTCA ACACAATATC AGGTAGTACA 18268 || ||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| AAATACTTTT CTAATTTGGT TATGAAAATG ACATATGTCA ACACAATATC AGGTAGTACA 263 AAATTAATTG AGGGCTCTGG AAGAGCGACC TTATTACTAC CTGGAGGGAC AATATTAAGC 18208 |||||||||| || ||||||| |||||||||| ||||||| || |||||| ||| |||||| | AAATTAATTG AGAGCTCTGG AAGAGCGACC TTATTACAAC CTGGAGAGAC AATATTGGAC 323 ATTGATAATG CATTATATTG TAGTAAGTCT CAAAGAAACT TATTAAGTTT CAAAGTTATT 18148 ||| ||||| ||||||| | ||||||||| |||||||||| |||||||||| || ||| ||| ATTTGTAATG CATTATACCG TAGTAAGTCA CAAAGAAACT TATTAAGTTT CAGAGTAATT 383 CGCCAAAATG GCTATCATGT TGAGACGACT AATGAAGGAA AGGTTGAATA CCTTTACATT 18088 |||||||||| ||||||| || |||||||||| |||||||||| ||| ||||| |||||||||| CGCCAAAATG GCTATCACGT TGAGACGACT AATGAAGGAA AGGATGAATG CCTTTACATT 443 ACTACAATTA ATGTAGAGAA GAAAATTGTG CATGAAAAAT CACCTGCATT TTCTTCTGGG 18028 |||||||||| ||||| |||| |||||||||| |||||||||| || |||||| ||||||| || ACTACAATTA ATGTACAGAA GAAAATTGTG CATGAAAAAT TACGTGCATT TTCTTCTAGG 503 TTGTACTATA CAAGTATAAG TACAGTCGAA TCACATGCCG TAGTAAACAA AAGGTTTACT 17968 || ||||||| ||||| |||| |||||| ||| ||||||| || |||||||||| |||||||||| TTCTACTATA CAAGTGTAAG TACAGTTGAA TCACATGTCG TAGTAAACAA AAGGTTTACT 563 AATTTTAATG ATTTTATTAT TTGGCATGAC CAGTTGGGCC ATCCCAGATT TAATATGATG 17908 |||||||||| ||| |||||| |||| ||||| | |||||||| ||||| |||| |||||||||| AATTTTAATG ATTCTATTAT TTGGTATGAC CGGTTGGGCC ATCCCGGATT TAATATGATG 623 CGCAAAATCA TTGAGAATTC ACATGGGCAC ACCTTAAAGA GCCCAAATAT CCTTCAATCA 17848 | || ||||| |||||||||| || ||||||| |||||| ||| | |||||||| ||||||||| CACAGAATCA TTGAGAATTC ACGTGGGCAC ACCTTAGAGA GTCCAAATAT GCTTCAATCA 683 AAGGAATTCT CTTGTGCTGC TTGTTCTCAA GGAAAGTTGA TCATTAAACC ATCAACAGTT 17788 |||||||||| |||||| ||| |||||||||| |||||||||| ||||| |||| |||||||||| AAGGAATTCT CTTGTGTTGC TTGTTCTCAA GGAAAGTTGA TCATT-AACC ATCAACAGTT 742 AAAGTTGGAA TTGAATCCCC TGCGTTTCTG GAACGTATAC AGGATGATAT -ATGTGGACC 17729 |||| || |||||| ||| ||||| |||| |||| ||||| ||| || || | ||||| GAAGTAAAAA TTGAATTCCC TGCGTGTCTG GAACATATAC AGGGTGGAAT GTTTTGGACA 802 AATTCAACCT GCATGT-GGA CCATTTAAAT ATTATATGGT CTTGAT 17684 |||| |||| |||||| ||| || | ||||| ||| || |||||| CATTCCACCT GCATGTGGGA CCCTGTAAAT -TTAGTTGAA CTTGAT 847 hqPGS_C06HBa0054K13.1-12-_SGN-U339613+ (19702 19682,18418 17684) ******************************************************************************** EST sequence 7 +strand 842 n (File: SGN-U339612+) 1 AGCTGGAGCT CCCCGCGGTG GCGGCCGCTC TTGAACTAGT GGATCCCCCG GGCTGCAGGA 61 ATTCGGCACG AGGCTTTTCT AATTTGGTTA TGAAAATGGC ATATGTCAAC ACAAAATTAA 121 TTGAGGGCTC TGGAAGAGCG ACCTTATTAC TACCTGGAGG GACAATATTA AGCATTGATA 181 ATGCATTATA TTGTAGTAAG TCTCAAAGAA ACTTATTAAG TTTCAAAGTT ATTCGCCAAA 241 ATGGCTATCA TGTTGAGACG GCTAATGAAG GAAAGGTTGA ATACCTTTAC ATTACTACAA 301 TTAATGTTGA GAAGAAAATT GTGCATGAAA AATTACCTGC ATTTTCTTCT GGGTTGTACT 361 ATACAAGTAT AAGTACAGTT GAATCACATG CCGTAGTAAA CAAAAGGTTT ACTAATTTTA 421 ATGATTTTAT CATTTGGCAT GACCCGGTTG GGGCCATCCT GGATTTAATA TGAAGCGCCA 481 AATCTTTGGA AATTCCCATG GAACCCCCTT AAAAAACCCA AATTCCCTTC ATCAAAGAAA 541 TTTCCTAAGG GCGCTTGGTC CCCCGGGAAA TGTGTTCTTT TAACCCCTCC CCCTTAAGGG 601 GGGGAAATAA ACCCCCCCGC GTTTTGGAAA AAAAAAAAAA AGGATAATTT GGGGGCCCAC 661 TATCACCCCG CGGGGGGGCC CCCCTCTAAA AAAAAAATAG CGCGCCCTAA AAAAAACCCC 721 CCCCCAAAAA AGGGGCCCCC TGGGTGTTTT TTTTTTACAC CCCCCCAACA CCTTTTTTTT 781 AAAAAAAAGG GCCCCCACCC GAAAAAAAAA AGAAGGAAGA CCCCCCCTCC CCCCCCCCCC 841 AC Predicted gene structure (within gDNA segment 19652 to 13221): Exon 1 18321 17757 ( 565 n); cDNA 75 627 ( 553 n); score: 0.854 Intron 1 17756 14031 (3726 n); Pd: 0.000 (s: 0.60), Pa: 0.000 (s: 0) Exon 2 14030 14017 ( 14 n); cDNA 628 641 ( 14 n); score: 1.000 PPA cDNA 802 814 MATCH C06HBa0054K13.1-12- SGN-U339612+ 0.854 579 0.688 C PGS_C06HBa0054K13.1-12-_SGN-U339612+ (18321 17757,14030 14017) Alignment (genomic DNA sequence = upper lines): TTTTCTAATT TGGTTATGAA AATGGCATAT GTCAACACAA TATCAGGTAG TACAAAATTA 18262 |||||||||| |||||||||| |||||||||| || | | | | | |||||||| TTTTCTAATT TGGTTATGAA AATGGCATAT GT-----C-A -A-C----A- --CAAAATTA 119 ATTGAGGGCT CTGGAAGAGC GACCTTATTA CTACCTGGAG GGACAATATT AAGCATTGAT 18202 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGAGGGCT CTGGAAGAGC GACCTTATTA CTACCTGGAG GGACAATATT AAGCATTGAT 179 AATGCATTAT ATTGTAGTAA GTCTCAAAGA AACTTATTAA GTTTCAAAGT TATTCGCCAA 18142 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGCATTAT ATTGTAGTAA GTCTCAAAGA AACTTATTAA GTTTCAAAGT TATTCGCCAA 239 AATGGCTATC ATGTTGAGAC GACTAATGAA GGAAAGGTTG AATACCTTTA CATTACTACA 18082 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| AATGGCTATC ATGTTGAGAC GGCTAATGAA GGAAAGGTTG AATACCTTTA CATTACTACA 299 ATTAATGTAG AGAAGAAAAT TGTGCATGAA AAATCACCTG CATTTTCTTC TGGGTTGTAC 18022 |||||||| | |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| ATTAATGTTG AGAAGAAAAT TGTGCATGAA AAATTACCTG CATTTTCTTC TGGGTTGTAC 359 TATACAAGTA TAAGTACAGT CGAATCACAT GCCGTAGTAA ACAAAAGGTT TACTAATTTT 17962 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TATACAAGTA TAAGTACAGT TGAATCACAT GCCGTAGTAA ACAAAAGGTT TACTAATTTT 419 AATGATTTTA TTATTTGGCA TGA-CCAGTT -GGGCCATCC CAGATTTAAT ATGATGCGCA 17904 |||||||||| | |||||||| ||| || ||| ||||||||| |||||||| |||| |||| AATGATTTTA TCATTTGGCA TGACCCGGTT GGGGCCATCC TGGATTTAAT ATGAAGCGCC 479 AAATCATTGA GAATTCACAT GGGCACACCT TAAAGAGCCC AAATATCCTT CAATCAAAGG 17844 ||||| ||| ||||| ||| || | ||| |||| | ||| |||| |||| | ||||||| AAATCTTTGG AAATTCCCAT GGAACCCCCT TAAAAAACCC AAATTCCCTT C-ATCAAAGA 538 AATTCTCTTG TGCTGCTT-G TTCTCAAGGA AAGTTGATC- ATTAAACCAT CAACAGTTAA 17786 |||| || | |||| | | | | ||| || || || |||| || | | | | | AATTTCCTAA GGGCGCTTGG TCCCCCGGGA AATGTGTTCT TTTAACCCCT CCCCCTTAAG 598 AGTTGGAATT GAA-TCCCCT GCGTTTCTGG AACGTATACA GGATGATATA TGTGGACCAA 17727 | |||| | || |||| |||||| ||| GGGGGGAAAT AAACCCCCCC GCGTTT-TGG .......... .......... .......... 627 TTCAACCTGC ATGTGGACCA TTTAAATATT ATATGGTCTT GATAGATGTT TCTACAAGAT 17667 .......... .......... .......... .......... .......... .......... 627 GGTCACATGT GTGTTTATTA TCAACTCGCA ACATGGCTTT TGCGAGATTG CTGGCTCAAA 17607 .......... .......... .......... .......... .......... .......... 627 TAATAAGATT GAAAGCACAA TTTCCAGACT ATACAATAAA GACAATCCGT CTAGATAATG 17547 .......... .......... .......... .......... .......... .......... 627 ATGGTGAGTT TACATCTCAA GCATTTAATG ATTATTGTAT GTCTACTGGT ATAACAGTTG 17487 .......... .......... .......... .......... .......... .......... 627 AACATCCAGT TGCTCATGTT CACACTCAAA ACGGTCTAGC AGAATCATTG ATTAAACGTC 17427 .......... .......... .......... .......... .......... .......... 627 TACAATTGAT AGTTAGACCA TTACTAGTGA GAACAAAGTT ATCTGTGTCG ATGTGGGGGC 17367 .......... .......... .......... .......... .......... .......... 627 ATGTTATTTT GCATGCAGCA GCACTTGTGC GCATAAGGCC GACCAATTAT CATGAATTCT 17307 .......... .......... .......... .......... .......... .......... 627 CCCCATTACA ATTAACTTTT GGTCAAGAAC CAAACATTTC CCATCTCAGA GTTTTTGAAT 17247 .......... .......... .......... .......... .......... .......... 627 GTGCGGTGTA TGTCCCAATT GCTCCACCAC AATGCACAAA GATGGGGCCC CAAAGAAGGT 17187 .......... .......... .......... .......... .......... .......... 627 TGGCGATATA TGTTGGGTAT GAATCTCCTT CAATCATAAA ATATTTGGAG CCTATGACTG 17127 .......... .......... .......... .......... .......... .......... 627 GAGATTTATT TAAGGCAAGA TTTGCTGATT GTCATTTTGA TGAATCAGTA TACCCAACAT 17067 .......... .......... .......... .......... .......... .......... 627 TAGGGGGAGA ACATAAGTCA TTGGGAAAAG AGATAGATTG GAATTCATCA TCTCTATCTC 17007 .......... .......... .......... .......... .......... .......... 627 ATCTGGATCC TCGAACAAAC CAATGTGAGC AAGAAGTTCA AAGAATAATT TATTTGCAGA 16947 .......... .......... .......... .......... .......... .......... 627 ACATTGCAAA TCAGCTACCA GATGCATTTA CTAATCTTCC AAGGATTACT AAATCGCATA 16887 .......... .......... .......... .......... .......... .......... 627 TTCCAGCTGT TAATGCTCCA GTTTGAGTTG ATATCCCGAC GGGACAAATA GTTAAGGCAA 16827 .......... .......... .......... .......... .......... .......... 627 ATGAGTCTAG ACCACATTTG AAGCGTGGTA GACCAATTGG TTCCAAGGAT AAAAATCCTC 16767 .......... .......... .......... .......... .......... .......... 627 GAAAGAGAAA AGGAATAAAT GATCAATATG ATCATGGTTG AAAGAAATTT CTCAAGATGA 16707 .......... .......... .......... .......... .......... .......... 627 GACCCAAGTC ATAACACATG ATGATGAGGA GGTTCCGAAC TTCTGAAAAT AATGAAATTT 16647 .......... .......... .......... .......... .......... .......... 627 CAATGAATTA TGTCTCGACG AGAAAGTTGT GGAACCGAAA TAATGTTGTG ATTGACAACA 16587 .......... .......... .......... .......... .......... .......... 627 TATTTGCCTA TAATGTTGCT ATTGAAATAA TGCAACAAGA TGAAGATTTT GAGCCAAAAT 16527 .......... .......... .......... .......... .......... .......... 627 CTGTTCACGA ATGTAGACAG AGAAATGATT GGCCAAAATG GAAGGATGCA ATTCAAGCTG 16467 .......... .......... .......... .......... .......... .......... 627 AATTGTCTTC ACTAGAAAAA TGTGAAGTTT TTGGACCGAT AATCCAAACA CCTGAAGGTA 16407 .......... .......... .......... .......... .......... .......... 627 TCAAGCCAGT GGGGTACAAA TGGGTTTTTG TACGAAAAAG AAATGAGAAA GGTGAAGTCA 16347 .......... .......... .......... .......... .......... .......... 627 TGAGATATAA GGCCCGACTC GTTGCTCAAG GTTTTTCTCA AAGACCTAAC ATTTATTATA 16287 .......... .......... .......... .......... .......... .......... 627 TGGAGACATA TTCTCCAGTG GTAGATGCAA TCACCTTCAG ATATCTCATA AATCTGGTAG 16227 .......... .......... .......... .......... .......... .......... 627 TTCATGAAAA ACTTGAAATG CGTCTAATCG ACGTTGTCAC AGCCTATCTA TATGGCTCAT 16167 .......... .......... .......... .......... .......... .......... 627 TGGACCACAA CATTTTCATG AAAATTCCTA AAGCATTCAA AGTGCCTGAA GCATACAAAG 16107 .......... .......... .......... .......... .......... .......... 627 ATTCAAGAGA AACTTGTTCA ATAAAACTTC AGAAATCTCT GTATGGATTG AAACAATCAG 16047 .......... .......... .......... .......... .......... .......... 627 GAAGGATGTG GTACAATCGC CTGAGTGAAT ATTTGTTAAA GAAAGGGTAC AAAAATGACC 15987 .......... .......... .......... .......... .......... .......... 627 CGATTTGTCC CTGCATTTTC ATTAAACAGT CGGGGTCTTA ATTTGTAATA ATAGTTGTGT 15927 .......... .......... .......... .......... .......... .......... 627 ATGTTGATGG TTTGAACATC ATTGGCACTC ATAAAGAGCT TTTAAAAGCT GTTGAGTGTC 15867 .......... .......... .......... .......... .......... .......... 627 TGAAAAAAGA ATTTGAAATG AAAGATCTCG GCAAGACAAA ATTTTATCTT GGCCTACAGA 15807 .......... .......... .......... .......... .......... .......... 627 TTGAGAATTT GTCAAATGGA ATACTTGTTC ATCAATCAAC GTACACAGAA AAGATACTAA 15747 .......... .......... .......... .......... .......... .......... 627 AGCGTTTTTA CATGGATAAC TCACATCCAT TGAGTACTCC AATGGTGGTA AGATCGCTTG 15687 .......... .......... .......... .......... .......... .......... 627 ACATCAATAC AGATCCATTT CGACCTCAAG AGAATGATGA AGAGCTTCTT GGTGATGAAA 15627 .......... .......... .......... .......... .......... .......... 627 CTCCTTATCT TAGTGCGATC GGGGCACTAA TGTACCTTGC TAACAATACT CGACCAGATA 15567 .......... .......... .......... .......... .......... .......... 627 TCTATTTTGC AGTAAGTCTA CTAGCAAGAT TCAGTTCCTC CCCAACAAAA AGACATTGGA 15507 .......... .......... .......... .......... .......... .......... 627 ATGGTGTTAA ACACATATTT CGATATCTTC GAGGGACCAT GCACATGGGT TTATTCTATT 15447 .......... .......... .......... .......... .......... .......... 627 CCAATGAATC CAAATCAGAA CTGATTGGTT ACGCAGATGC ACGGTATTTA TCAGATCCTC 15387 .......... .......... .......... .......... .......... .......... 627 ATAAAGCTTG ATCACAAACA GGATATTTGT TTACATGTGG AGACATGACA ATATCTTGGC 15327 .......... .......... .......... .......... .......... .......... 627 GATCAATGAA GCAAACGTTG GTAGCCACTT CTTCAAATCA TGCAGAAATA ATAGCCATCC 15267 .......... .......... .......... .......... .......... .......... 627 ATGAAGCAAG TCGTGAGTGC GTCTGGTTGA GATCAATGAC CCATCATATT CAGAAAATGT 15207 .......... .......... .......... .......... .......... .......... 627 GTGGTTTTTC TTTGAAAAAG AATATACCAA CCACAATGTA CGAAGATAAT GCTGCATGTA 15147 .......... .......... .......... .......... .......... .......... 627 TAGTTCAATT GAAAGGAGGA TACATCAAAG GAGACCGGAC AAAGCATATC TCACCAAAGT 15087 .......... .......... .......... .......... .......... .......... 627 TCTTTTTCAC GTATGATCTT CAACAAAATG GTGAGATAGA AGTTCAACAA ATTCGTTCAG 15027 .......... .......... .......... .......... .......... .......... 627 GTGATAATCT TGCTGATTTA TTCACTAAGG CATTGCCAAC ATCAACATTT GAGAAGTTGA 14967 .......... .......... .......... .......... .......... .......... 627 GATACAAGAT TGGAATGCGC CGTCTCCGAG ATATCAAGTA AAGTTTTCAT CAGGGGGAGC 14907 .......... .......... .......... .......... .......... .......... 627 GAAATACGCG TTGTACTCTT TTCCTTAACT ATGGTTTTGT CCCAAGTTGG GTTTTCATGG 14847 .......... .......... .......... .......... .......... .......... 627 TAAGGTTTTT AACGAGGCAA CATTCAAAGC GTATTATGAG ATATGTGTAC TCTTTTTCCT 14787 .......... .......... .......... .......... .......... .......... 627 TCACTAGACT TTTTCCCACT GGGTTTTTCC TAGTAAGGTT TTAATGAGGC ACATAATCTA 14727 .......... .......... .......... .......... .......... .......... 627 TGGATATCCA AGGGGGAGTG TTGTAAATTC ATTGTATATC TGGATGATCT ATTAGAGTTA 14667 .......... .......... .......... .......... .......... .......... 627 TCCAACAATT AATTGAATTC ATGTATTAGT GTTTCCAATG TGATAAGATA TCAAGGTGAT 14607 .......... .......... .......... .......... .......... .......... 627 ATGAACCTTG ATGATAACTA TGTAAAATTT CAAGGTGACC TTTGAATATG ATATCTCAAC 14547 .......... .......... .......... .......... .......... .......... 627 ACTATAAATA GAGATATCAT TCACCATTGT ATGATATACT TGAATAAGAA AATTATCTCC 14487 .......... .......... .......... .......... .......... .......... 627 TCTATCTTAT ACTTTTTGTC TTTTTCATCT TATATTCTGA GCTTTCTTTG CAACAAAAAT 14427 .......... .......... .......... .......... .......... .......... 627 ATATTTATTT ATTTTTCCAT TGAGTAAAGT AATATGAACT TTTAGAATTT AAAGGAATAT 14367 .......... .......... .......... .......... .......... .......... 627 TAATGACTTA GTTTTATCTG GGATAAAATT TATTCACACA AAATATTTAT ATCAATTCAA 14307 .......... .......... .......... .......... .......... .......... 627 TACATATTTG TCATATGTTT AAATTTTATA ACATAAATTT TACTTCCTTA AATTACATTG 14247 .......... .......... .......... .......... .......... .......... 627 TAATAGAAAT TAAAGGGAAA AAGCACAAGT ACCACCTAGA TTATGACCGA ATTTCAGAAA 14187 .......... .......... .......... .......... .......... .......... 627 GACACCTTAA TTAAAGTAAG ATTTTATTAC CCCTGAACTT CTATTTTTGT AATAATATAC 14127 .......... .......... .......... .......... .......... .......... 627 ATCTTTTGTC TTACATGACA CTCTTCGTGA CCCCACGCAA TTGAGGCGCT AGAAAATATT 14067 .......... .......... .......... .......... .......... .......... 627 TGGATGTCAC GTAAGCCAAA AAGGTGCATA AATTACAAAA AAAAAAAAAA 14017 |||| |||||||||| .......... .......... .......... ......AAAA AAAAAAAAAA 641 hqPGS_C06HBa0054K13.1-12-_SGN-U339612+ (18321 17757) ******************************************************************************** EST sequence 1 -strand 911 n (File: SGN-U328267-) 1 TTTTTTTTTT CTCATAAACA TAAAGTACAC TAATATTATT ATTATAAAAT TCTCCAGTCT 61 TATACAAAAC AACATATGAC TTCACTTGAA CTATAATTAA AGAACAATAA AGGGATAATG 121 CACAAGTACC CCCTCAACCT ATGCCCGAAA TTCCAGAGAC ACACTTATAC TATACTAAGG 181 TCCTATTACC CCCCTGAACT TATTTTATAA GTAATTTTCT ACCCCTTTTT AGCCTACGTG 241 GCACTAGTTT AAAAAAAAAG TCAACAACCA TTGGGCCCAC AAGATAGTGC CACGTAGGTC 301 TAAAAGGGGT AAAAAATTAT TAATAAATAA GTTCAGGGGG TAATAAGATC TTAGTATGGT 361 ATAAGTGTAT CTCTGAGATT TTGGACATAG GCTAAGGGGG TACTTGGACA TTATCCCAAC 421 AATAAATAAA GGATATAACA TGATTCAAAA GACAACACGT GATAACCACT TCTACAACTT 481 GTGCATGATC AATTGAGCAC CATATTGAGG TTGAAGCAAC AATGGGTGAG GAGCATGAGC 541 ATAAGATGGA GAGAGTTCAA ATGCATAATG TTTTAGAATC ATGACCATTG CCATTTTTGC 601 CTCTAACATA GCAAAATTTT GCCCAATACA TATTCTTGGA CCCCAACTAA ATGGAAAAAA 661 TACAACTTGT CCTTTTGTTG CTTTTGATAT TCCTTCACTA AATCTCTCTG GCATAAACTC 721 CATTGCATCA TCTCCCCATA TTTCAGTATC ATGATGCACT AACATTGTTG CCAATATGAG 781 TTGGACCCCA GAGGGTAAAC ACAAATCCCC TAACTTTGTT TCTGTATTCA CCATGCGATT 841 AATCGCGTAT ACTGATGTAT ACAACCTTAA GACCTCGTTT AAGATCATTG TAACCACTTT 901 TAGTTGATTC A Predicted gene structure (within gDNA segment 23320 to 14539): Exon 1 21227 20908 ( 320 n); cDNA 106 424 ( 319 n); score: 0.902 Intron 1 20907 19235 (1673 n); Pd: 0.000 (s: 0.82), Pa: 0.000 (s: 0) Exon 2 19234 19205 ( 30 n); cDNA 425 453 ( 29 n); score: 0.767 Intron 2 19204 15411 (3794 n); Pd: 0.000 (s: 0), Pa: 0.733 (s: 0) Exon 3 15410 15404 ( 7 n); cDNA 454 459 ( 6 n); score: 0.714 Intron 3 15403 15309 ( 95 n); Pd: 0.874 (s: 0), Pa: 0.000 (s: 0.66) Exon 4 15308 15265 ( 44 n); cDNA 460 503 ( 44 n); score: 0.659 PPA cDNA 12 1 MATCH C06HBa0054K13.1-12- SGN-U328267- 0.902 401 0.440 C PGS_C06HBa0054K13.1-12-_SGN-U328267- (21227 20908,19234 19205,15410 15404,15308 15265) Alignment (genomic DNA sequence = upper lines): AAAAATGGGA TAATGCACAA GTA-CCCCTC AATCTATGCC CGAAATTTCA GAGACACACT 21169 || || |||| |||||||||| ||| |||||| || ||||||| ||||||| || |||||||||| AATAAAGGGA TAATGCACAA GTACCCCCTC AACCTATGCC CGAAATTCCA GAGACACACT 165 TATACTATAC TAAGGTCCTA TTACCTCCCT GAACTTATTT TATAAGTAAT TTTCTACCCC 21109 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| TATACTATAC TAAGGTCCTA TTACCCCCCT GAACTTATTT TATAAGTAAT TTTCTACCCC 225 TTTTTAGCCT ACGTGGCACT GGTTTGGAAC AAAAAGTCAA CCATCGTTGG ACCCACAAGA 21049 |||||||||| |||||||||| |||| || |||||||||| | | | |||| ||||||||| TTTTTAGCCT ACGTGGCACT AGTTT-AAAA AAAAAGTCAA CAACCATTGG GCCCACAAGA 284 TAGTGCCACG TAGGTCGAAA AGGGGTAAAA AATTATTAAT AAAATAATTT CAGGGGGTAA 20989 |||||||||| |||||| ||| |||||||||| |||||||||| |||||| || |||||||||| TAGTGCCACG TAGGTCTAAA AGGGGTAAAA AATTATTAAT -AAATAAGTT CAGGGGGTAA 343 TAGGACCTTA GTATAGTATA AGTGTGTCTC TGAGATTTCG GGTATAGGTT GAGGAGGTAC 20929 || || |||| |||| ||||| ||||| |||| |||||||| | | ||||| | ||| ||||| TAAGATCTTA GTATGGTATA AGTGTATCTC TGAGATTTTG GACATAGGCT AAGGGGGTAC 403 TTGGACATTA TCCCTATAAA AATTAAGAAA GAGTTTACCA TGAATATATT CAAATTATAA 20869 |||||||||| |||| | || | TTGGACATTA TCCCAACAAT A......... .......... .......... .......... 424 AAAATTGTGG CCGAGTACCA TTATCTCCGA ATCTATGCCT AGAAACACAT TTATACTATA 20809 .......... .......... .......... .......... .......... .......... 424 TCAAGGTCTT ATTACCCTCT AAACTTATTT TATAAATGAT TTTCAATTTT TTTTTGGCCT 20749 .......... .......... .......... .......... .......... .......... 424 ACGTGGTACT ATCCTGTGGG TCAAACGCGT GTTGACATTT TATTTTAAGC TAATACAACG 20689 .......... .......... .......... .......... .......... .......... 424 TAGGCTGAAA AAAGAATAGA AAATTATTTA TAAAATAAGT TCAGGAGGTA ATAAAATCAT 20629 .......... .......... .......... .......... .......... .......... 424 CTTAATATAG TATTCAATAT ATCTTTGGAA TTTCAAAAAT AGATTGAACG TACTCGCGCA 20569 .......... .......... .......... .......... .......... .......... 424 TTTTACCAAA AAAACTAATC AAAGAACACC AATATTGTTC TAATTGGTGT ACAGATTACA 20509 .......... .......... .......... .......... .......... .......... 424 TTAATATCGG GGAATAAAAG CATCAATGAT GGCATTAAAA CATACAACAC CTCAACATTT 20449 .......... .......... .......... .......... .......... .......... 424 AGTAGACCTA CGATGTTTAT TTATTTCATC AACTTAAATT GACATACTTA ATTTAATTAG 20389 .......... .......... .......... .......... .......... .......... 424 ATTTTATTAC TTTAATTATG CTATAGATTA GTCTTTTGCT TTTTTGCTCT TCCTCTCAAA 20329 .......... .......... .......... .......... .......... .......... 424 TTTAAATTTT CTCATTTAGT TATTTCTCAA TACATGTGGA ATGGACAAAT AATTTAAATA 20269 .......... .......... .......... .......... .......... .......... 424 AATGAATTTT AGTTGGGAAC ATCAAAGGTT GCCTAGTTTT TTTTTAGTTT TTTTTTTTTG 20209 .......... .......... .......... .......... .......... .......... 424 CACAATAAGA ATAACAATAA TAATAAAAAT GTTAATTGAG TAGTTATTTC CTAATTTAAT 20149 .......... .......... .......... .......... .......... .......... 424 TTGTCATTTT TAAGCAATTA ATATTTTGTT AAAAGATTCA AAACTTCTAT TTTAATTGGC 20089 .......... .......... .......... .......... .......... .......... 424 TGAAATATAA TAAATATGCA TCTTAAGTTG AACTATAGAC ATATATCATA ATTTTTTGAG 20029 .......... .......... .......... .......... .......... .......... 424 TAAAAGATAT ATTCACTCTA AATCGCAAAA TTAAAAAATT GTTGTAAATC CATTGTATAT 19969 .......... .......... .......... .......... .......... .......... 424 GTAGATGATC CACTAGAGTT ATCCAACAAT TAATTGGATT CATGTATTAG TGTTTCCAAT 19909 .......... .......... .......... .......... .......... .......... 424 GTGATAAGAT ATCAAGGTGA TATGAACCTT GATGATAACT ATGTAAAATT TCAAGGTGAC 19849 .......... .......... .......... .......... .......... .......... 424 CTTTGACTAT GATATCTCAA CACTATAAAT AGAGATATCA TTCACCATTG TATGATATAC 19789 .......... .......... .......... .......... .......... .......... 424 TTGAATAAGA AAATTATCTC CTCTATCTTA TACTTCTTGT CTTTTTCATC TTATATTCTG 19729 .......... .......... .......... .......... .......... .......... 424 AGCTTTCTTT ACAACACGTT ATCAGCACGA AATTGCTACT TGCAAAGGTA CTATATAAAT 19669 .......... .......... .......... .......... .......... .......... 424 ATTTTTATTT AATTCACATA TTCTCTCATA ATTTTCATTA TGTCGAATTT ATCCAAACTT 19609 .......... .......... .......... .......... .......... .......... 424 GAGTTTGTGG CATTAGATAT TTCTGGAAAG AATTATCTTT CATGGGTACT CGATGCTGAG 19549 .......... .......... .......... .......... .......... .......... 424 ATTCACTTGG CTGCTAAAGG TCTTGATGCC ACTATTACTC AGGGAAAAGA AGCATCCAGT 19489 .......... .......... .......... .......... .......... .......... 424 CAAGATAAGG CGAAGGCTAT GATTTTCCTT CGTTATCATC TTGATGAGGG CCTGAAGATT 19429 .......... .......... .......... .......... .......... .......... 424 GAATATCTGA CGGTGAAATA TCCACTTGAA TAGTGGACTG ATTTAAAGGG GAGATATGAC 19369 .......... .......... .......... .......... .......... .......... 424 CACCGAAAGG CAACAGTGTT GCCAAGAGCT CGTTATGAGT GGATGCATTT ATGGTTTCAA 19309 .......... .......... .......... .......... .......... .......... 424 GATTTTAAGA CCGTAATTGA ATACAACAGA GTTGTATTCA GGATAACCTC CCAGTTGAAA 19249 .......... .......... .......... .......... .......... .......... 424 TTATGTGGGG AGACTATAAA AGATGAGAAC ATGTTGGAAA AGACACTTAC TACTTTTCAT 19189 ||||| ||| | ||| ||| | ||| |||| .......... ....AATAAA GGAT-ATAAC ATGATTCAAA AGAC...... .......... 453 GCCTCGAATG TGATATTGCA GTAGCAATAT CGTGAAAAGG GTTTTCAGAA ATATTCTGAA 19129 .......... .......... .......... .......... .......... .......... 453 CTAATCTCAT GTCTTTTGGT GGCTGAACAA CATAATGCTC TTTTAATGAA AAATCATGAA 19069 .......... .......... .......... .......... .......... .......... 453 GCTCGTCCCA CTGGAGCTGC TCCATTACCG GAGGCAAATG TGGTGGAAGC ACGTGATCAA 19009 .......... .......... .......... .......... .......... .......... 453 TCTGAAGTAA AAAGAGATGA TCATCGGGGA TATAATAATG TATGGGGACG TGGCAAAGAT 18949 .......... .......... .......... .......... .......... .......... 453 AAAAGACGAT ACACTAATCG TCAAGATGGT GATCATAATA AAAGGGAGAA CAACATGAGT 18889 .......... .......... .......... .......... .......... .......... 453 TCTCAAAATA ACCCCTCAAA AAGTAATTGT CGTCATTGTG GCATGAAAGG CCATTGGAAG 18829 .......... .......... .......... .......... .......... .......... 453 AATGAATGTC GCACACATGA ACATTTTGTA AGGCTCTATC AAAATTCCTT CAAAAAGAAA 18769 .......... .......... .......... .......... .......... .......... 453 GGAAATAAAA GTGGTGCTTC CTCTTCCAAT GCTCGAGCTG AGTCACATCT GACTCTTAAA 18709 .......... .......... .......... .......... .......... .......... 453 GATGGTGATA AGCCGGGAAC ATCTCAGAAA TATGATAAAG ATGTTGAAGA AAATTTGGCT 18649 .......... .......... .......... .......... .......... .......... 453 TTAAGGGATG ATGTTTTTGA TGGCCTTGGT GACATTACTC ATATGGAAGT TGATGACTTC 18589 .......... .......... .......... .......... .......... .......... 453 TTTGGAGATC GAAACTAATG TTTGATCTTT TAGCTGGGGA ATGAAATTTG TTAATATTTT 18529 .......... .......... .......... .......... .......... .......... 453 ACTTATGTAT TTTTAATTAT TATGTTATTC ATGTTGAAGT ATTTAAATTT CAGTTGTTAA 18469 .......... .......... .......... .......... .......... .......... 453 TTTTGTTTCT TCCTCCTTTT GATGTATTTT ATTTTAATGA AAATTAATAG AAATCCCCAG 18409 .......... .......... .......... .......... .......... .......... 453 TTGTCAGTTG GATTCAAGAT GAGTAATGGA GATGTATGCC TTCTTGATAG TGCTACAACG 18349 .......... .......... .......... .......... .......... .......... 453 CATACAATAT TAAAAGAAAA GAAATACTTT TCTAATTTGG TTATGAAAAT GGCATATGTC 18289 .......... .......... .......... .......... .......... .......... 453 AACACAATAT CAGGTAGTAC AAAATTAATT GAGGGCTCTG GAAGAGCGAC CTTATTACTA 18229 .......... .......... .......... .......... .......... .......... 453 CCTGGAGGGA CAATATTAAG CATTGATAAT GCATTATATT GTAGTAAGTC TCAAAGAAAC 18169 .......... .......... .......... .......... .......... .......... 453 TTATTAAGTT TCAAAGTTAT TCGCCAAAAT GGCTATCATG TTGAGACGAC TAATGAAGGA 18109 .......... .......... .......... .......... .......... .......... 453 AAGGTTGAAT ACCTTTACAT TACTACAATT AATGTAGAGA AGAAAATTGT GCATGAAAAA 18049 .......... .......... .......... .......... .......... .......... 453 TCACCTGCAT TTTCTTCTGG GTTGTACTAT ACAAGTATAA GTACAGTCGA ATCACATGCC 17989 .......... .......... .......... .......... .......... .......... 453 GTAGTAAACA AAAGGTTTAC TAATTTTAAT GATTTTATTA TTTGGCATGA CCAGTTGGGC 17929 .......... .......... .......... .......... .......... .......... 453 CATCCCAGAT TTAATATGAT GCGCAAAATC ATTGAGAATT CACATGGGCA CACCTTAAAG 17869 .......... .......... .......... .......... .......... .......... 453 AGCCCAAATA TCCTTCAATC AAAGGAATTC TCTTGTGCTG CTTGTTCTCA AGGAAAGTTG 17809 .......... .......... .......... .......... .......... .......... 453 ATCATTAAAC CATCAACAGT TAAAGTTGGA ATTGAATCCC CTGCGTTTCT GGAACGTATA 17749 .......... .......... .......... .......... .......... .......... 453 CAGGATGATA TATGTGGACC AATTCAACCT GCATGTGGAC CATTTAAATA TTATATGGTC 17689 .......... .......... .......... .......... .......... .......... 453 TTGATAGATG TTTCTACAAG ATGGTCACAT GTGTGTTTAT TATCAACTCG CAACATGGCT 17629 .......... .......... .......... .......... .......... .......... 453 TTTGCGAGAT TGCTGGCTCA AATAATAAGA TTGAAAGCAC AATTTCCAGA CTATACAATA 17569 .......... .......... .......... .......... .......... .......... 453 AAGACAATCC GTCTAGATAA TGATGGTGAG TTTACATCTC AAGCATTTAA TGATTATTGT 17509 .......... .......... .......... .......... .......... .......... 453 ATGTCTACTG GTATAACAGT TGAACATCCA GTTGCTCATG TTCACACTCA AAACGGTCTA 17449 .......... .......... .......... .......... .......... .......... 453 GCAGAATCAT TGATTAAACG TCTACAATTG ATAGTTAGAC CATTACTAGT GAGAACAAAG 17389 .......... .......... .......... .......... .......... .......... 453 TTATCTGTGT CGATGTGGGG GCATGTTATT TTGCATGCAG CAGCACTTGT GCGCATAAGG 17329 .......... .......... .......... .......... .......... .......... 453 CCGACCAATT ATCATGAATT CTCCCCATTA CAATTAACTT TTGGTCAAGA ACCAAACATT 17269 .......... .......... .......... .......... .......... .......... 453 TCCCATCTCA GAGTTTTTGA ATGTGCGGTG TATGTCCCAA TTGCTCCACC ACAATGCACA 17209 .......... .......... .......... .......... .......... .......... 453 AAGATGGGGC CCCAAAGAAG GTTGGCGATA TATGTTGGGT ATGAATCTCC TTCAATCATA 17149 .......... .......... .......... .......... .......... .......... 453 AAATATTTGG AGCCTATGAC TGGAGATTTA TTTAAGGCAA GATTTGCTGA TTGTCATTTT 17089 .......... .......... .......... .......... .......... .......... 453 GATGAATCAG TATACCCAAC ATTAGGGGGA GAACATAAGT CATTGGGAAA AGAGATAGAT 17029 .......... .......... .......... .......... .......... .......... 453 TGGAATTCAT CATCTCTATC TCATCTGGAT CCTCGAACAA ACCAATGTGA GCAAGAAGTT 16969 .......... .......... .......... .......... .......... .......... 453 CAAAGAATAA TTTATTTGCA GAACATTGCA AATCAGCTAC CAGATGCATT TACTAATCTT 16909 .......... .......... .......... .......... .......... .......... 453 CCAAGGATTA CTAAATCGCA TATTCCAGCT GTTAATGCTC CAGTTTGAGT TGATATCCCG 16849 .......... .......... .......... .......... .......... .......... 453 ACGGGACAAA TAGTTAAGGC AAATGAGTCT AGACCACATT TGAAGCGTGG TAGACCAATT 16789 .......... .......... .......... .......... .......... .......... 453 GGTTCCAAGG ATAAAAATCC TCGAAAGAGA AAAGGAATAA ATGATCAATA TGATCATGGT 16729 .......... .......... .......... .......... .......... .......... 453 TGAAAGAAAT TTCTCAAGAT GAGACCCAAG TCATAACACA TGATGATGAG GAGGTTCCGA 16669 .......... .......... .......... .......... .......... .......... 453 ACTTCTGAAA ATAATGAAAT TTCAATGAAT TATGTCTCGA CGAGAAAGTT GTGGAACCGA 16609 .......... .......... .......... .......... .......... .......... 453 AATAATGTTG TGATTGACAA CATATTTGCC TATAATGTTG CTATTGAAAT AATGCAACAA 16549 .......... .......... .......... .......... .......... .......... 453 GATGAAGATT TTGAGCCAAA ATCTGTTCAC GAATGTAGAC AGAGAAATGA TTGGCCAAAA 16489 .......... .......... .......... .......... .......... .......... 453 TGGAAGGATG CAATTCAAGC TGAATTGTCT TCACTAGAAA AATGTGAAGT TTTTGGACCG 16429 .......... .......... .......... .......... .......... .......... 453 ATAATCCAAA CACCTGAAGG TATCAAGCCA GTGGGGTACA AATGGGTTTT TGTACGAAAA 16369 .......... .......... .......... .......... .......... .......... 453 AGAAATGAGA AAGGTGAAGT CATGAGATAT AAGGCCCGAC TCGTTGCTCA AGGTTTTTCT 16309 .......... .......... .......... .......... .......... .......... 453 CAAAGACCTA ACATTTATTA TATGGAGACA TATTCTCCAG TGGTAGATGC AATCACCTTC 16249 .......... .......... .......... .......... .......... .......... 453 AGATATCTCA TAAATCTGGT AGTTCATGAA AAACTTGAAA TGCGTCTAAT CGACGTTGTC 16189 .......... .......... .......... .......... .......... .......... 453 ACAGCCTATC TATATGGCTC ATTGGACCAC AACATTTTCA TGAAAATTCC TAAAGCATTC 16129 .......... .......... .......... .......... .......... .......... 453 AAAGTGCCTG AAGCATACAA AGATTCAAGA GAAACTTGTT CAATAAAACT TCAGAAATCT 16069 .......... .......... .......... .......... .......... .......... 453 CTGTATGGAT TGAAACAATC AGGAAGGATG TGGTACAATC GCCTGAGTGA ATATTTGTTA 16009 .......... .......... .......... .......... .......... .......... 453 AAGAAAGGGT ACAAAAATGA CCCGATTTGT CCCTGCATTT TCATTAAACA GTCGGGGTCT 15949 .......... .......... .......... .......... .......... .......... 453 TAATTTGTAA TAATAGTTGT GTATGTTGAT GGTTTGAACA TCATTGGCAC TCATAAAGAG 15889 .......... .......... .......... .......... .......... .......... 453 CTTTTAAAAG CTGTTGAGTG TCTGAAAAAA GAATTTGAAA TGAAAGATCT CGGCAAGACA 15829 .......... .......... .......... .......... .......... .......... 453 AAATTTTATC TTGGCCTACA GATTGAGAAT TTGTCAAATG GAATACTTGT TCATCAATCA 15769 .......... .......... .......... .......... .......... .......... 453 ACGTACACAG AAAAGATACT AAAGCGTTTT TACATGGATA ACTCACATCC ATTGAGTACT 15709 .......... .......... .......... .......... .......... .......... 453 CCAATGGTGG TAAGATCGCT TGACATCAAT ACAGATCCAT TTCGACCTCA AGAGAATGAT 15649 .......... .......... .......... .......... .......... .......... 453 GAAGAGCTTC TTGGTGATGA AACTCCTTAT CTTAGTGCGA TCGGGGCACT AATGTACCTT 15589 .......... .......... .......... .......... .......... .......... 453 GCTAACAATA CTCGACCAGA TATCTATTTT GCAGTAAGTC TACTAGCAAG ATTCAGTTCC 15529 .......... .......... .......... .......... .......... .......... 453 TCCCCAACAA AAAGACATTG GAATGGTGTT AAACACATAT TTCGATATCT TCGAGGGACC 15469 .......... .......... .......... .......... .......... .......... 453 ATGCACATGG GTTTATTCTA TTCCAATGAA TCCAAATCAG AACTGATTGG TTACGCAGAT 15409 | .......... .......... .......... .......... .......... ........A- 454 GCACGGTATT TATCAGATCC TCATAAAGCT TGATCACAAA CAGGATATTT GTTTACATGT 15349 |||| ACACG..... .......... .......... .......... .......... .......... 459 GGAGACATGA CAATATCTTG GCGATCAATG AAGCAAACGT TGGTAGCCAC TTCTTCAAAT 15289 || || |||| |||| ||| | .......... .......... .......... .......... TGATAACCAC TTCTACAACT 479 CATGCAGAAA TAATAGCCAT CCAT 15265 |||| | ||| | |||| TGTGCATGAT CAATTGAGCA CCAT 503 hqPGS_C06HBa0054K13.1-12-_SGN-U328267- (21227 20908,19234 19205) ******************************************************************************** EST sequence 2 -strand 904 n (File: SGN-U339339-) 1 GCCCAAACCA TCAAAAATCC AGAAGTAGGT TAAAAGGAAT GACAAAAGTA CCAAAACCAA 61 ACGGCTGAAA TGTATCAAAA AGGCTTCCTG GCAGCTAAAA TGAGATTGCG GAAAAGGCAA 121 CACATCACTC ATACGATAAG TTTCATATTG TTTCCTGAAG TTATACTATG GATATTGGAG 181 AAAATTTGAT GCTGCTCGAC AGAAAAGTAA TCAAGAATGT GAGCCATGTA GTCCATGATA 241 ACTAGGCCAA TCCAGCGTAT ATCCATGAGT AGGTCGAGAC TGATGACCCA AGTACCAAAT 301 CAAACACCTG AATATTATCA AGAAGCTTCC TGGCAGCCAA AATGAGATTG CGTGAAATGC 361 AACTCATCAC TCATACGATG AGTCTCGTAT GCGCCCACAT ATTTAATTTC CATGTGCGGC 421 CGTTATGATG GATTTACGAA TACAAGTATC TGTGTGCTTT TAGATTTAAA GAATTTGGAT 481 AGACATATCT GTGGGTCATG CTGGGAATTG GGGAAACCAT GTGCTGCTTT TTAATTTGAT 541 AATCTCGGAT TCTATTTTTT ATTTTACTTT TTAAAGAAAA AAAGTCTGAA TATCCTTTAA 601 TTTTATTTTA TTGTTATAAA TCCATTGAAT TGTGTGGATT GTCCTTTAGA TTTACTCAAG 661 ACTTAATTGT ATTCATGTAT ACATGTTCCC AAGGTGATAA GAATCATTTG GATAACTATG 721 TACCTACTAA TTGGACATTT ATCATGGGGG CTTTTGGGGT GATATCTCAA CTCTATAAAT 781 AGAGATATCA TTCACCTTTT GTAATACATA CTTGAATAAG AAAATTATCT CCTCTAAAAA 841 AAAAAAAAAA AAACTCGAGG GGGGGCCCGG TACCCAATTC GCCCTATAAG GAGTCGATTA 901 CAAC Predicted gene structure (within gDNA segment 28156 to 18484): Exon 1 22678 22663 ( 16 n); cDNA 568 583 ( 16 n); score: 0.938 Intron 1 22662 20804 (1859 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 2 20803 20778 ( 26 n); cDNA 584 609 ( 26 n); score: 0.769 Intron 2 20777 19992 ( 786 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.72) Exon 3 19991 19765 ( 227 n); cDNA 610 835 ( 226 n); score: 0.738 MATCH C06HBa0054K13.1-12- SGN-U339339- 0.738 269 0.298 C PGS_C06HBa0054K13.1-12-_SGN-U339339- (22678 22663,20803 20778,19991 19765) Alignment (genomic DNA sequence = upper lines): TTTTTAAAAA AAAAAAACAA AAACAAATGA ACTTTAATAT GTTGAATCTG AATTTGTTGA 22619 |||||||| | |||||| TTTTTAAAGA AAAAAA.... .......... .......... .......... .......... 583 GTGCAGACAA ATGAGGCCTA AATATTAAGA CAACCTAATG ACTTTTGACT TCTTAATCCT 22559 .......... .......... .......... .......... .......... .......... 583 TTTCATGTCA TGTATACATA TATAGTGTGT GTGATCATGT TTCATTATGT CATTTTATAT 22499 .......... .......... .......... .......... .......... .......... 583 ATAACACAAT TTAATATCTA CTTATATTCT TTCTGTTAGG GGTATTATGT ATTTTAAATG 22439 .......... .......... .......... .......... .......... .......... 583 ATGCAGACTA ATGCATGTAT CTAGACACAC CGATTAAGCA ACACGTTAAA GTTTTCATAT 22379 .......... .......... .......... .......... .......... .......... 583 CCTACTTTGG ATGAGCCCCC ACCCTGCTTT CAAGGGTTCC CTAACTCCTA TCTTTCAGCA 22319 .......... .......... .......... .......... .......... .......... 583 CATTAGATTA TTTCTCTTAA GATGTGTCGA CAGAGACGGA TCTACATGAA TATGAGGGGT 22259 .......... .......... .......... .......... .......... .......... 583 TCATTCGATC CCCCTTTATT GGAAAGGTTA TATAGTATAC ATAGGCTTAA TTCTTTTATT 22199 .......... .......... .......... .......... .......... .......... 583 TATATATACA CTAAGTGATG AACCCTTTGG ACAAGTGCTT GCGACTTCAA CTCCACATAT 22139 .......... .......... .......... .......... .......... .......... 583 AACAATGTTT TTTCTCCATT GTTTCAACTG ATTTTCATTT TTATTTCTAA ATAAATGAAA 22079 .......... .......... .......... .......... .......... .......... 583 GATTTGTTTT GTTGGAAACC TATTGGGACT AATAATATTT TATTAGACAT TAAAAAATAA 22019 .......... .......... .......... .......... .......... .......... 583 ATTTATTTTC TATTTTTATA GAATTAAGTA AAAGTTATAA CGTTTTTTAA GTTTTCAGAA 21959 .......... .......... .......... .......... .......... .......... 583 TTAAAATTAA AAAATACATT CTTTCATATT CATTTTAAGT CAAATGCTTG ATTTATATTT 21899 .......... .......... .......... .......... .......... .......... 583 AATGAAAAAA TAATATAAAT ATAAGTAAGA TTAAATAAAT AAGGATTATT TAAATTTATA 21839 .......... .......... .......... .......... .......... .......... 583 AATAAATAAG AGTAGTTATA TAAAATTTAT TAATAAATAT GATGGTCGCT ATATTTTTCA 21779 .......... .......... .......... .......... .......... .......... 583 TTCTTGATAA TTATTTTTTC GACACGGATT TTATTGCTAC AATATGATTG GATTACTGAA 21719 .......... .......... .......... .......... .......... .......... 583 TAATGTTTAT ATATCACAAT GACACAGACA AAACTCATAA TTATAGTTTT AAAAAATACA 21659 .......... .......... .......... .......... .......... .......... 583 TTCTCTCGTA CTCAATTTAA TGTTATGTCA AAAATATATT TATTTAATTT TATTGTTTGT 21599 .......... .......... .......... .......... .......... .......... 583 GTTAAGGTAG AGTTGAATTT TTTTAATTAC TTATATCTGA AACTCATTTA TATTTAACCC 21539 .......... .......... .......... .......... .......... .......... 583 GCCGCGTCCG CGTTGCTTCT ACAAATACTA TTAGGATACT TAGTTATTTT TTGAACCTCC 21479 .......... .......... .......... .......... .......... .......... 583 TCGATAAAAA TTCTGCCTCC ATCATTAAAT ATTATTAGCC GTTGTATTGT TGTATGATAA 21419 .......... .......... .......... .......... .......... .......... 583 TTAATACAAA TTATCTATTT TATAAGAAAT TTCCTCTCAA ATGCTTGAAA TTATGTTTTC 21359 .......... .......... .......... .......... .......... .......... 583 CGTTTATGTC TAAAAAAATA AATGTGCACA CTATATTATG TCATGCTATG GATACGTTTG 21299 .......... .......... .......... .......... .......... .......... 583 GCTAACAATG GTCATGAGAT TTTACCACTA TAATGGATGG ACTCAAATTA TAATGTAAAA 21239 .......... .......... .......... .......... .......... .......... 583 TCTTGATATA TAAAAATGGG ATAATGCACA AGTACCCCTC AATCTATGCC CGAAATTTCA 21179 .......... .......... .......... .......... .......... .......... 583 GAGACACACT TATACTATAC TAAGGTCCTA TTACCTCCCT GAACTTATTT TATAAGTAAT 21119 .......... .......... .......... .......... .......... .......... 583 TTTCTACCCC TTTTTAGCCT ACGTGGCACT GGTTTGGAAC AAAAAGTCAA CCATCGTTGG 21059 .......... .......... .......... .......... .......... .......... 583 ACCCACAAGA TAGTGCCACG TAGGTCGAAA AGGGGTAAAA AATTATTAAT AAAATAATTT 20999 .......... .......... .......... .......... .......... .......... 583 CAGGGGGTAA TAGGACCTTA GTATAGTATA AGTGTGTCTC TGAGATTTCG GGTATAGGTT 20939 .......... .......... .......... .......... .......... .......... 583 GAGGAGGTAC TTGGACATTA TCCCTATAAA AATTAAGAAA GAGTTTACCA TGAATATATT 20879 .......... .......... .......... .......... .......... .......... 583 CAAATTATAA AAAATTGTGG CCGAGTACCA TTATCTCCGA ATCTATGCCT AGAAACACAT 20819 .......... .......... .......... .......... .......... .......... 583 TTATACTATA TCAAGGTCTT ATTACCCTCT AAACTTATTT TATAAATGAT TTTCAATTTT 20759 |||| | || ||| | || |||||| | .......... .....GTCTG AATATCCTTT AATTTTATTT T......... .......... 609 TTTTTGGCCT ACGTGGTACT ATCCTGTGGG TCAAACGCGT GTTGACATTT TATTTTAAGC 20699 .......... .......... .......... .......... .......... .......... 609 TAATACAACG TAGGCTGAAA AAAGAATAGA AAATTATTTA TAAAATAAGT TCAGGAGGTA 20639 .......... .......... .......... .......... .......... .......... 609 ATAAAATCAT CTTAATATAG TATTCAATAT ATCTTTGGAA TTTCAAAAAT AGATTGAACG 20579 .......... .......... .......... .......... .......... .......... 609 TACTCGCGCA TTTTACCAAA AAAACTAATC AAAGAACACC AATATTGTTC TAATTGGTGT 20519 .......... .......... .......... .......... .......... .......... 609 ACAGATTACA TTAATATCGG GGAATAAAAG CATCAATGAT GGCATTAAAA CATACAACAC 20459 .......... .......... .......... .......... .......... .......... 609 CTCAACATTT AGTAGACCTA CGATGTTTAT TTATTTCATC AACTTAAATT GACATACTTA 20399 .......... .......... .......... .......... .......... .......... 609 ATTTAATTAG ATTTTATTAC TTTAATTATG CTATAGATTA GTCTTTTGCT TTTTTGCTCT 20339 .......... .......... .......... .......... .......... .......... 609 TCCTCTCAAA TTTAAATTTT CTCATTTAGT TATTTCTCAA TACATGTGGA ATGGACAAAT 20279 .......... .......... .......... .......... .......... .......... 609 AATTTAAATA AATGAATTTT AGTTGGGAAC ATCAAAGGTT GCCTAGTTTT TTTTTAGTTT 20219 .......... .......... .......... .......... .......... .......... 609 TTTTTTTTTG CACAATAAGA ATAACAATAA TAATAAAAAT GTTAATTGAG TAGTTATTTC 20159 .......... .......... .......... .......... .......... .......... 609 CTAATTTAAT TTGTCATTTT TAAGCAATTA ATATTTTGTT AAAAGATTCA AAACTTCTAT 20099 .......... .......... .......... .......... .......... .......... 609 TTTAATTGGC TGAAATATAA TAAATATGCA TCTTAAGTTG AACTATAGAC ATATATCATA 20039 .......... .......... .......... .......... .......... .......... 609 ATTTTTTGAG TAAAAGATAT ATTCACTCTA AATCGCAAAA TTAAAAAATT GTTGTAAATC 19979 ||| ||| |||||| .......... .......... .......... .......... .......ATT GTTATAAATC 622 CATTGTA-TA TGTAGATGAT CCACTAGAGT TATCCAACAA TTAATTGGAT TCATGTATTA 19920 ||||| | | ||| ||| | || |||| | || ||| | ||||||| || |||||||| CATTGAATTG TGTGGATTGT CCTTTAGATT TACTCAAGAC TTAATTGTAT TCATGTATAC 682 GTGTTTCCAA TGTGATAAGA TATCAAGGTG ATATGAACCT TGATGATAAC TATGTAAAAT 19860 |||| |||| ||||||||| |||| | ||| | | | ||| | || | | | ATGTTCCCAA GGTGATAAGA -ATCATTTGG ATAACTATGT ACCTACTAA- T-TGGACATT 739 T-TCAAGGTG ACCTTTGACT ATGATATCTC AACACTATAA ATAGAGATAT CATTCACC-A 19802 | ||| || | | |||| ||||||||| ||| |||||| |||||||||| |||||||| TATCATGGGG GCTTTTG-GG GTGATATCTC AACTCTATAA ATAGAGATAT CATTCACCTT 798 TTGTATGATA TACTTGAATA AGAAAATTAT CTCCTCT 19765 ||||| | | |||||||||| |||||||||| ||||||| TTGTAATACA TACTTGAATA AGAAAATTAT CTCCTCT 835 hqPGS_C06HBa0054K13.1-12-_SGN-U339339- (20803 20778,19991 19765) ******************************************************************************** EST sequence 4 +strand 828 n (File: SGN-U332943+) 1 ATTTTCTTTC TTTTAGAATT CTAGTAAGTT CTTTTCTAAC AAGAATTTTA GTAAGTTCTT 61 TTATAACACG TTATCAGCAC GAAATTGCTA CTTGCAAAGG TACTATATAA ATATTTTTAT 121 TTAATTCACA TATTCTCTCA TAATTTTCAT TATGTCGAAT TTATCCAAAC TTGAGTTTGT 181 GGCATTAGAT ATTTCTGGAA AGAATTATCT TTCATGGGTA CTCGATGCTG AGATTCACTT 241 GGCTGCTAAA GGTCTTGATG CCACTATTAC TCAGGGAAAT GAAGCATCGA GTCAAGATAA 301 GGCGAAGGCT ATGATTTTCC TTCGTCATCA TCTTGATGAG GGCCTGAAGA TTGAATATCT 361 GACAGTGAAA GATCCACTTG AATTGTGGAC TGATTTAAAG GGGAGATATG ACCACCTAAA 421 GGCAACAGTG TTGCCAAGAG CTCGTTATGA GTGGATGCAT TTACGGTTTC AAGATTTTAA 481 GACCGTAATT GAATACAACT CTGCTGTATT CAGGATAACC TCCCAGTTGA AATTATGTGG 541 GGAGACTATA AAAGATGAGG ACATGTTGGA AAAGACACTT ACTACTTTTC ATGCCTCGAA 601 TGTGATATTG CAGCAGCAAT ATCGTGAAAA GGGTTTTCAG AAATATTCTG AACTAATCTC 661 ATGTCTTTTG GTGGCTGAGC AACATAATGC TCTTTTAATG AAAAATCATG AAGCTCGTCC 721 CACTGGAGCT GCTCCATTAC CGGAGGCAAA TGCGGTGGAA GCACGTGATC AATCTGAAGT 781 AAAAAGAAAT GATCATCGGG GATATAATAA TGCACGGGGA CGTGGCAA Predicted gene structure (within gDNA segment 20956 to 18199): Exon 1 19716 18953 ( 764 n); cDNA 65 828 ( 764 n); score: 0.975 MATCH C06HBa0054K13.1-12- SGN-U332943+ 0.975 764 0.923 C PGS_C06HBa0054K13.1-12-_SGN-U332943+ (19716 18953) Alignment (genomic DNA sequence = upper lines): AACACGTTAT CAGCACGAAA TTGCTACTTG CAAAGGTACT ATATAAATAT TTTTATTTAA 19657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACACGTTAT CAGCACGAAA TTGCTACTTG CAAAGGTACT ATATAAATAT TTTTATTTAA 124 TTCACATATT CTCTCATAAT TTTCATTATG TCGAATTTAT CCAAACTTGA GTTTGTGGCA 19597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCACATATT CTCTCATAAT TTTCATTATG TCGAATTTAT CCAAACTTGA GTTTGTGGCA 184 TTAGATATTT CTGGAAAGAA TTATCTTTCA TGGGTACTCG ATGCTGAGAT TCACTTGGCT 19537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGATATTT CTGGAAAGAA TTATCTTTCA TGGGTACTCG ATGCTGAGAT TCACTTGGCT 244 GCTAAAGGTC TTGATGCCAC TATTACTCAG GGAAAAGAAG CATCCAGTCA AGATAAGGCG 19477 |||||||||| |||||||||| |||||||||| ||||| |||| |||| ||||| |||||||||| GCTAAAGGTC TTGATGCCAC TATTACTCAG GGAAATGAAG CATCGAGTCA AGATAAGGCG 304 AAGGCTATGA TTTTCCTTCG TTATCATCTT GATGAGGGCC TGAAGATTGA ATATCTGACG 19417 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| ||||||||| AAGGCTATGA TTTTCCTTCG TCATCATCTT GATGAGGGCC TGAAGATTGA ATATCTGACA 364 GTGAAATATC CACTTGAATA GTGGACTGAT TTAAAGGGGA GATATGACCA CCGAAAGGCA 19357 |||||| ||| ||||||||| |||||||||| |||||||||| |||||||||| || ||||||| GTGAAAGATC CACTTGAATT GTGGACTGAT TTAAAGGGGA GATATGACCA CCTAAAGGCA 424 ACAGTGTTGC CAAGAGCTCG TTATGAGTGG ATGCATTTAT GGTTTCAAGA TTTTAAGACC 19297 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ACAGTGTTGC CAAGAGCTCG TTATGAGTGG ATGCATTTAC GGTTTCAAGA TTTTAAGACC 484 GTAATTGAAT ACAACAGAGT TGTATTCAGG ATAACCTCCC AGTTGAAATT ATGTGGGGAG 19237 |||||||||| ||||| | |||||||||| |||||||||| |||||||||| |||||||||| GTAATTGAAT ACAACTCTGC TGTATTCAGG ATAACCTCCC AGTTGAAATT ATGTGGGGAG 544 ACTATAAAAG ATGAGAACAT GTTGGAAAAG ACACTTACTA CTTTTCATGC CTCGAATGTG 19177 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| ACTATAAAAG ATGAGGACAT GTTGGAAAAG ACACTTACTA CTTTTCATGC CTCGAATGTG 604 ATATTGCAGT AGCAATATCG TGAAAAGGGT TTTCAGAAAT ATTCTGAACT AATCTCATGT 19117 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATTGCAGC AGCAATATCG TGAAAAGGGT TTTCAGAAAT ATTCTGAACT AATCTCATGT 664 CTTTTGGTGG CTGAACAACA TAATGCTCTT TTAATGAAAA ATCATGAAGC TCGTCCCACT 19057 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTGGTGG CTGAGCAACA TAATGCTCTT TTAATGAAAA ATCATGAAGC TCGTCCCACT 724 GGAGCTGCTC CATTACCGGA GGCAAATGTG GTGGAAGCAC GTGATCAATC TGAAGTAAAA 18997 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| GGAGCTGCTC CATTACCGGA GGCAAATGCG GTGGAAGCAC GTGATCAATC TGAAGTAAAA 784 AGAGATGATC ATCGGGGATA TAATAATGTA TGGGGACGTG GCAA 18953 ||| |||||| |||||||||| |||||||| | ||||||||| |||| AGAAATGATC ATCGGGGATA TAATAATGCA CGGGGACGTG GCAA 828 hqPGS_C06HBa0054K13.1-12-_SGN-U332943+ (19716 18953) ******************************************************************************** EST sequence 3 +strand 625 n (File: SGN-U330986+) 1 GATCCCCCGG GCTGCAGGGT GGAATATAAC TGGTGGTAGT GATCTAACAT TATTTGAGGT 61 ACCCGACTAT TTTTTCTCTT GCTCTCTCTC TCTCTTCCGA TACACTTACT AATCATCCTT 121 AGTATGCACC TTTTGTTGAC TGAAAAAGGC AGAAATGTAT TCCATAACGT AGAAAAGATT 181 TTGGCAGTAG AAGCCTTTTA CTTTTTTCAA TGTAGAGATT TTTCTTATTG AAGATGGCAT 241 ATATGAATGT GTGAATTTCC TTGTATGATC GTCTCTTACA AAGAAAAATA AGAGAGAAAC 301 TTTATCTAAT GGTGCATGAG GGTTGTGAAA AGCTATGTTG CTTAGACTCT TCAAAAATGT 361 CGACGGGTGC GTGTCAGATC CTCCAAAAAT AGTGTATTTT TGAAGGATCT GACACGGGTG 421 CGGCAACATT TTTGGAGAGT CCAAGCAACT TAGGTGAAAA GATCAAACAA CCTATGCTTC 481 TGGTTCATCC TGTTTCCTTA TCTGCTGTTT CTCAGCATTT ATGTATGCCA ATGTTGTCAT 541 TTTTAAAGTG CGCTTAAGCC TTGAAGCGTG GTACAAAACA TGTTGAGCAC TTTGCCTCGC 601 TTTATGCACG CTTCAGTGTC GTCAT Predicted gene structure (within gDNA segment 27080 to 33919): Exon 1 28029 28044 ( 16 n); cDNA 313 326 ( 14 n); score: 0.688 Intron 1 28045 29647 (1603 n); Pd: 0.000 (s: 0), Pa: 0.928 (s: 0) Exon 2 29648 29653 ( 6 n); cDNA 327 332 ( 6 n); score: 0.500 Intron 2 29654 30999 (1346 n); Pd: 0.738 (s: 0), Pa: 0.000 (s: 0.96) Exon 3 31000 31116 ( 117 n); cDNA 333 453 ( 121 n); score: 0.833 Intron 3 31117 31780 ( 664 n); Pd: 0.000 (s: 0.70), Pa: 0.375 (s: 0) Exon 4 31781 31810 ( 30 n); cDNA 454 482 ( 29 n); score: 0.633 Intron 4 31811 32518 ( 708 n); Pd: 0.000 (s: 0), Pa: 0.987 (s: 0) Exon 5 32519 32528 ( 10 n); cDNA 483 492 ( 10 n); score: 0.700 MATCH C06HBa0054K13.1-12+ SGN-U330986+ 0.833 179 0.286 C PGS_C06HBa0054K13.1-12+_SGN-U330986+ (28029 28044,29648 29653,31000 31116,31781 31810,32519 32528) Alignment (genomic DNA sequence = upper lines): TGGATAAAGG GATTGGATAA AGCGATTTTA TGTTGATGAC AATGAAGGAA ATGGAGACGA 28088 || || ||| | ||| TGCAT-GAGG G-TTGT.... .......... .......... .......... .......... 326 GATTGCACAT GTGCTTTATG CTGATGATAC CCTATTGCAA TGTGAAGTAG ACAGGGTGCA 28148 .......... .......... .......... .......... .......... .......... 326 AGCTCTATAT TTGAGGGTCA TTTTTTCGGT GTCCAAAGTT GTGCCAGGGC TGAAATTTAA 28208 .......... .......... .......... .......... .......... .......... 326 CTTATCCAAA GGTACTGTTT TCTGTATTTA TCCTGCGGAT TCTATTGCTC AGTATGCTGA 28268 .......... .......... .......... .......... .......... .......... 326 GATCCTGCAA TGTAAAGTAG AAGTTTTTCT CACTACACAT TTGGGAATTC CGTTAGGGGT 28328 .......... .......... .......... .......... .......... .......... 326 GGCAATTAGG CGGGTTGGGT CAAATTTAGA CGGGTTATAA TGGGTTAAGA TAACAATTGG 28388 .......... .......... .......... .......... .......... .......... 326 GTCAAGACCC AACCCAACCC AAAATTGCTC GAGTCAAAAT GGGTTACATT ATGGGTCAAG 28448 .......... .......... .......... .......... .......... .......... 326 ACCCAACACA ACCCAACCCA ATTTTTACCA AGCTTAATTG TTTTATTTGT TCTTTTAAAC 28508 .......... .......... .......... .......... .......... .......... 326 CTTTTTAGTA CTTAATAAAA TCATTATTTT CTTTATTATG GCTATATACA ACATAAGAAA 28568 .......... .......... .......... .......... .......... .......... 326 TTGGAAAAAG AGTTCTATAA AAATATTTTG AGCAGATTTT TCATGAGCCA ATTTGGGTTA 28628 .......... .......... .......... .......... .......... .......... 326 CATATCAACC CATTTTTTAA TGGGTTGGTC TAATGATTGG ACGGGTCAAT TACCCGCCCA 28688 .......... .......... .......... .......... .......... .......... 326 AACATAAACG GTTTGGGGGG GTTGGTCTTG ATTTTGCCAC CCCTAACTCC TGTGGGTGCT 28748 .......... .......... .......... .......... .......... .......... 326 AGAAAGAATG ATGTGGGGTT GCAGGAAGGA TAGTTTTGAT AATTAGCATC ATGGTCTCTC 28808 .......... .......... .......... .......... .......... .......... 326 TACTTATGTT ATGTCACTTT TCCCCTCATC TCCTAAGGTG AAAAAACGAT TGGATTCACT 28868 .......... .......... .......... .......... .......... .......... 326 TAGAAGTAAC TTCATAGGGA AAGGAAACGA AGAGAACAGA TGTTCACCTT GTGGGATGGA 28928 .......... .......... .......... .......... .......... .......... 326 AATTTTTGAT CTCTAGTTAA ATACTGGTGG AAATGCCGAA AGGACAAAAT TTCATCTGGT 28988 .......... .......... .......... .......... .......... .......... 326 TAAATGGGAG TCTGTGAATA GGAGTAAAAT GCAAGAGGAC TTGGAGTCAG GAACTCAAAT 29048 .......... .......... .......... .......... .......... .......... 326 TAGATAATAG AAGTCTTCTG GAAATTGTTG TGGATGTTGA ATGATGGGAC GGATGCTTTG 29108 .......... .......... .......... .......... .......... .......... 326 TGGTAGAGGG TCATTGTTAC TAAATATGGC AAAAAATAGA TATTAAGATC CCAGAATAGT 29168 .......... .......... .......... .......... .......... .......... 326 TGGTTCTCTC TTTGGGTTGC AGCTCGGAAG CAGATCATGA CTCTATGGGG TGATTTTGAA 29228 .......... .......... .......... .......... .......... .......... 326 ATAGCATTAA GTTGAAATTT GGTGATGAGT GAAAAAACAA GAATTGGACA GATTCCCAAG 29288 .......... .......... .......... .......... .......... .......... 326 CAAGCGGGGC TGAGTTTGAA GAGGATCATT TTGGGGGACA ATGATGAACT TCTTAAGGGT 29348 .......... .......... .......... .......... .......... .......... 326 CAGTTCCTAG CACTTTTAGC CTAGTTATGG ACGAGGAATG CTTAATTGCT GCCTGCTTTT 29408 .......... .......... .......... .......... .......... .......... 326 CTAATTTAGG GGGGGGGGGG TTTAGATCAA GGTGAGGACA GAAGAAACTG TGGATGTAAA 29468 .......... .......... .......... .......... .......... .......... 326 TACCCGTCAG TATTGCATGG ATAGTTTTCT CGAAAGAAGT CAGAGATGTT TTGAAGACAG 29528 .......... .......... .......... .......... .......... .......... 326 AAAGGATTCC ACTTCTTATG TTAAGCCATC TATGTATTCA GAATTTAGTC TTTTGTGTTG 29588 .......... .......... .......... .......... .......... .......... 326 AAAGAGAATG TAACTAGGAT TCTAAAAACA ATGGACTTGT TTGTTTCTTT ATGATGTAGG 29648 | .......... .......... .......... .......... .......... .........G 327 ACTGGGCATG TTCTTTGCTT GTACTGAATA TTAGTACCAT CTTGGTACCT TTATTGATAA 29708 | | AAAAG..... .......... .......... .......... .......... .......... 332 TACTTCTTAC CTTTTAGAAA AAAGTGCTCT CTTTGACATG TTAAGCTTTT AGATGTGATG 29768 .......... .......... .......... .......... .......... .......... 332 TCCACATAAT TCAACTATGA GTAAGTAAGC ATGGAAGTTA AAGGAGATGT TGAATTGTGA 29828 .......... .......... .......... .......... .......... .......... 332 ATAACATATG CATAGGTGAG ACTTTTTCTA AACTTTATTT TACAAGAAGT GATGCTTTGA 29888 .......... .......... .......... .......... .......... .......... 332 AGATAAATCA AGTCCTATCA AGCAGATAAA GTTGATTTTT TTGGGTACTT TTGTATTTTT 29948 .......... .......... .......... .......... .......... .......... 332 GGTGTAAAGG AACATGTGTA GAAAGATCAT GATTTAAGTT TTGATGTTTT AGACTCCTTG 30008 .......... .......... .......... .......... .......... .......... 332 TAGATGATAG AGGAGCTATA AGTTTGGTCT CTTTACAATT CTGTAATCAG GGAGGATTAT 30068 .......... .......... .......... .......... .......... .......... 332 ACTTTCTTTG TATAGTATTG CTATTTATAT AAGATATGTT AACTGTTTCA CAAAAGAAGT 30128 .......... .......... .......... .......... .......... .......... 332 GATGCTTCTA GTTAAACATT GGGGAGACCG GAAGATTATA GCAAAGCTAT CAAACATTGA 30188 .......... .......... .......... .......... .......... .......... 332 TATATATTTG AACTATGGGC TATTCTGAAA TAAATTAATT ATTAGAGAGC ATTATCTTCT 30248 .......... .......... .......... .......... .......... .......... 332 CTATTTGTTC TCTAAGAAGA AAACACTGAG GATGCTATCC AGTTGGTAGA CTTTTTAGGA 30308 .......... .......... .......... .......... .......... .......... 332 TATCTGTAAT TAGTTTTTTT CCATTTCATT GTTGATATCT TTTTGGAGAT GGCCAGCATA 30368 .......... .......... .......... .......... .......... .......... 332 ACCCTTAATT CTGAGGAATA CGATTACCAG TTTAAAAAAA AAAAGATTAT TCCTCTCATG 30428 .......... .......... .......... .......... .......... .......... 332 TAGGTCATAC GTCTGAATTG CAATCTGAAG CGGAACAATT GCCATCTGTG AAGCCAGAGC 30488 .......... .......... .......... .......... .......... .......... 332 TACTTCAGGT TGAAGCTGAA GTAAACAAGT CTGAAGTGGC TGACAAGGGT CTAGACACTG 30548 .......... .......... .......... .......... .......... .......... 332 AGATATCTGT CTTGGATGAG ATTTTGTCCG TTGAAGCAGA AGGATCAATA TCAAGATTGG 30608 .......... .......... .......... .......... .......... .......... 332 ATGTGGATAA TGATGGTGCA CGTCAAGAGA ATGACGTAAC AGACACATTT CTTGCTTAAC 30668 .......... .......... .......... .......... .......... .......... 332 ACAGTTCTAG ATTTCCTTCA TTTGATAAGT TAAAGTGGAG TTATCCATTT TTTTGAATAC 30728 .......... .......... .......... .......... .......... .......... 332 TGCATCAAGT ATAGTGATCT TTTTTATGCT TTTTTTCTAA ACATCTGATG CAGGGATGGG 30788 .......... .......... .......... .......... .......... .......... 332 CTGTTACTGG TGGCGGTGAA GTAATTGTAG AACGATTTCA TGATCTCATC CCTGATATGG 30848 .......... .......... .......... .......... .......... .......... 332 CTCTTACTTT CCCATTTGAA CTCGACCCCT TTCAAAAGGA GGTATAAATC TATGTTTTAT 30908 .......... .......... .......... .......... .......... .......... 332 GTTGTGTTTT TTAATGCTGA TGTGGAACTT GGTATCTCTG CCTTATTTAA TCTTATGTCC 30968 .......... .......... .......... .......... .......... .......... 332 CTTAATTGTT GGGTAAATAC ATGCTTTCAA TCTATGTTGC TTAGACTCTT CAAAAATGTC 31028 ||||||||| |||||||||| |||||||||| .......... .......... .......... .CTATGTTGC TTAGACTCTT CAAAAATGTC 361 GACGGGTGTG TGTCGGATCC TCCAAAAAAT AGTGTATTTT T-AAAGATCC GAC----GTG 31083 |||||||| | |||| ||||| ||| |||||| |||||||||| | || |||| ||| ||| GACGGGTGCG TGTCAGATCC TCC-AAAAAT AGTGTATTTT TGAAGGATCT GACACGGGTG 420 CAGCAACATT TTTGGAGAGT CCGAGCAACT TAGCTTTCAA TCATGGGCCC CATGTTTTAG 31143 | |||||||| |||||||||| || ||||||| ||| CGGCAACATT TTTGGAGAGT CCAAGCAACT TAG....... .......... .......... 453 TCGGATGATG TTCCATTGGC ATGAATGGCG AAGTGTTAAT TTTGAACATA ATAGGCTGAG 31203 .......... .......... .......... .......... .......... .......... 453 TTCTGTTTGC TTTTTTATTA ACTTGAAACG ATTTGTTGTG TTTCTTTTTC CTTTTCATTT 31263 .......... .......... .......... .......... .......... .......... 453 CAGGCATATA TATGCCTGTC GAGCTGTTAA AAAATTATGG TCCTTGTGAT TGCATTAATT 31323 .......... .......... .......... .......... .......... .......... 453 GCAGGCCATC TATCATCTTG AAAAAGGGAA CTCTGTCTTT GTTGCTGCTC ATACATCTGC 31383 .......... .......... .......... .......... .......... .......... 453 TGGGAAGACT GTTGTGGCTG AATATGCATT TGCTTTGGCA GCTAAAGTAT GGATTCTCTT 31443 .......... .......... .......... .......... .......... .......... 453 TATATCATGG ATTTTATACT ATTTGACTCT TGTTTTGGTT ATAAAAGAAT ATATTAAAAA 31503 .......... .......... .......... .......... .......... .......... 453 TTTTATATTG ATATCTAAAG TATAATCTTC TTCTTTGCTC ATAAATAAAA TGTCTTCTCT 31563 .......... .......... .......... .......... .......... .......... 453 TTTACATATC CTATTTATAA GTTCCTTTAC AACTTTTTGA GTCAATACTG TCCAAATTTT 31623 .......... .......... .......... .......... .......... .......... 453 CTCGTCTTTA TTTGTCAACA GGTCTGCACC AAATTTATGT TTTTTTATTG TGTGCTGCAG 31683 .......... .......... .......... .......... .......... .......... 453 CATTGTACCA GGGCTGTATA TACAGCTCCA ATCAAAACAA TCAGCAATCA GAAATACAGG 31743 .......... .......... .......... .......... .......... .......... 453 GATTTTTGTG GGAAGTTTGA TGTTGGCCTT CTCACAGGTG ATATAAGCAT AAGACCTGAG 31803 ||| | | | || | |||| | .......... .......... .......... .......GTG AAAAGATCAA ACAACCT-AT 475 GCCTCTTGTC TTATAATGAC CACTGAGATA TTAAGGTCAA TGCTTTATAG AGGGGCTGAT 31863 || ||| GCTTCTG... .......... .......... .......... .......... .......... 482 ATGATTCGGG ATATAGAATG GGTAATTATT AAATATTTTG CTCTGCTCAT TTCACTTCTA 31923 .......... .......... .......... .......... .......... .......... 482 CAACCTTTCC AGTACTTGAG ATAATTTGGT TGGTTACTGG TATTTTTTTG TTGCGAGTGT 31983 .......... .......... .......... .......... .......... .......... 482 AGGTTATATT TGATGAAGTG CATTATGTCA ATGATGTTGA AAGAGGTGTT GTTTGGGAAG 32043 .......... .......... .......... .......... .......... .......... 482 AAGTTATCAT TATGCTCCCA AGACATATCA ACTTTGTCCT CCTTTCAGCT ACGGTGGGTC 32103 .......... .......... .......... .......... .......... .......... 482 TTGATATTTT AGTCACTATA CTTTGTGCTT TGGATCAGTA CTCTTAATGA AGGTATGAGA 32163 .......... .......... .......... .......... .......... .......... 482 GTTAAAGACA CAACAAAAAG ATGTGTAAGG ATGCATGAGA CAAATGTCTT GCACCTGCAC 32223 .......... .......... .......... .......... .......... .......... 482 TTTCTTATAC AAACTTTTCT ATGTATAACT TCACATTTGA TCAACACAGG ACACTGATAT 32283 .......... .......... .......... .......... .......... .......... 482 GTAGCGTAGA ACAACTTCTA ACTATTAAGC TTAGCAAATC TAACATTCTA CCTGAAGTCA 32343 .......... .......... .......... .......... .......... .......... 482 GTGCCCCCTG GTCTAATGTC GATGCAACTT ATTCGATGAT GGACGTGGTA AATATGTTTG 32403 .......... .......... .......... .......... .......... .......... 482 GATAATCATT TGACATTATG AATCTAGTGA CAATGTCTCT TGAGAGTATC TTTTCTGTGA 32463 .......... .......... .......... .......... .......... .......... 482 GAATCTTGTA TGTATTCGAT TCTCTCTTTA AACTCTGAAC TTGATGCAAT ATCAGGTGCA 32523 || || .......... .......... .......... .......... .......... .....GTTCA 487 GCTTG 32528 | || TCCTG 492 hqPGS_C06HBa0054K13.1-12+_SGN-U330986+ (31000 31116) Total number of EST alignments reported: 8 ________________________________________________________________________________ Predicted gene locations (3) in segment 1 to 43325: PGL 1 (+ strand): 14679 16562 AGS-1 (14679 14729,14789 16562) SCR (e 0.745 d 0.000 a 0.000,e 0.966) Exon 1 14679 14729 ( 51 n); score: 0.745 Intron 1 14730 14788 ( 59 n); Pd: 0.000 Pa: 0.000 Exon 2 14789 16562 (1774 n); score: 0.966 PGS (14679 14729,14789 16562) SGN-U318979- PGS (15220 16091) SGN-U336683- 3-phase translation of AGS-1 (+strand): . . . . . . : 14679 ATCATCCAGATATACAATGAATTTACAACACTCCCCCTTGGATATCCATAG : GAAAAAGAG I I Q I Y N E F T T L P L G Y P - : E K E S S R Y T M N L Q H S P L D I H R : K K S H P D I Q - I Y N T P P W I S I : G K R . . . . . . 14798 TACACATATCTCATAATACGCTTTGAATGTTGCCTCGTTAAAAACCTTACCATGAAAACC Y T Y L I I R F E C C L V K N L T M K T T H I S - Y A L N V A S L K T L P - K P V H I S H N T L - M L P R - K P Y H E N . . . . . . 14858 CAACTTGGGACAAAACCATAGTTAAGGAAAAGAGTACAACGCGTATTTCGCTCCCCCTGA Q L G T K P - L R K R V Q R V F R S P - N L G Q N H S - G K E Y N A Y F A P P D P T W D K T I V K E K S T T R I S L P L . . . . . . 14918 TGAAAACTTTACTTGATATCTCGGAGACGGCGCATTCCAATCTTGTATCTCAACTTCTCA - K L Y L I S R R R R I P I L Y L N F S E N F T - Y L G D G A F Q S C I S T S Q M K T L L D I S E T A H S N L V S Q L L . . . . . . 14978 AATGTTGATGTTGGCAATGCCTTAGTGAATAAATCAGCAAGATTATCACCTGAACGAATT N V D V G N A L V N K S A R L S P E R I M L M L A M P - - I N Q Q D Y H L N E F K C - C W Q C L S E - I S K I I T - T N . . . . . . 15038 TGTTGAACTTCTATCTCACCATTTTGTTGAAGATCATACGTGAAAAAGAACTTTGGTGAG C - T S I S P F C - R S Y V K K N F G E V E L L S H H F V E D H T - K R T L V R L L N F Y L T I L L K I I R E K E L W - . . . . . . 15098 ATATGCTTTGTCCGGTCTCCTTTGATGTATCCTCCTTTCAATTGAACTATACATGCAGCA I C F V R S P L M Y P P F N - T I H A A Y A L S G L L - C I L L S I E L Y M Q H D M L C P V S F D V S S F Q L N Y T C S . . . . . . 15158 TTATCTTCGTACATTGTGGTTGGTATATTCTTTTTCAAAGAAAAACCACACATTTTCTGA L S S Y I V V G I F F F K E K P H I F - Y L R T L W L V Y S F S K K N H T F S E I I F V H C G W Y I L F Q R K T T H F L . . . . . . 15218 ATATGATGGGTCATTGATCTCAACCAGACGCACTCACGACTTGCTTCATGGATGGCTATT I - W V I D L N Q T H S R L A S W M A I Y D G S L I S T R R T H D L L H G W L L N M M G H - S Q P D A L T T C F M D G Y . . . . . . 15278 ATTTCTGCATGATTTGAAGAAGTGGCTACCAACGTTTGCTTCATTGATCGCCAAGATATT I S A - F E E V A T N V C F I D R Q D I F L H D L K K W L P T F A S L I A K I L Y F C M I - R S G Y Q R L L H - S P R Y . . . . . . 15338 GTCATGTCTCCACATGTAAACAAATATCCTGTTTGTGATCAAGCTTTATGAGGATCTGAT V M S P H V N K Y P V C D Q A L - G S D S C L H M - T N I L F V I K L Y E D L I C H V S T C K Q I S C L - S S F M R I - . . . . . . 15398 AAATACCGTGCATCTGCGTAACCAATCAGTTCTGATTTGGATTCATTGGAATAGAATAAA K Y R A S A - P I S S D L D S L E - N K N T V H L R N Q S V L I W I H W N R I N - I P C I C V T N Q F - F G F I G I E - . . . . . . 15458 CCCATGTGCATGGTCCCTCGAAGATATCGAAATATGTGTTTAACACCATTCCAATGTCTT P M C M V P R R Y R N M C L T P F Q C L P C A W S L E D I E I C V - H H S N V F T H V H G P S K I S K Y V F N T I P M S . . . . . . 15518 TTTGTTGGGGAGGAACTGAATCTTGCTAGTAGACTTACTGCAAAATAGATATCTGGTCGA F V G E E L N L A S R L T A K - I S G R L L G R N - I L L V D L L Q N R Y L V E F C W G G T E S C - - T Y C K I D I W S . . . . . . 15578 GTATTGTTAGCAAGGTACATTAGTGCCCCGATCGCACTAAGATAAGGAGTTTCATCACCA V L L A R Y I S A P I A L R - G V S S P Y C - Q G T L V P R S H - D K E F H H Q S I V S K V H - C P D R T K I R S F I T . . . . . . 15638 AGAAGCTCTTCATCATTCTCTTGAGGTCGAAATGGATCTGTATTGATGTCAAGCGATCTT R S S S S F S - G R N G S V L M S S D L E A L H H S L E V E M D L Y - C Q A I L K K L F I I L L R S K W I C I D V K R S . . . . . . 15698 ACCACCATTGGAGTACTCAATGGATGTGAGTTATCCATGTAAAAACGCTTTAGTATCTTT T T I G V L N G C E L S M - K R F S I F P P L E Y S M D V S Y P C K N A L V S F Y H H W S T Q W M - V I H V K T L - Y L . . . . . . 15758 TCTGTGTACGTTGATTGATGAACAAGTATTCCATTTGACAAATTCTCAATCTGTAGGCCA S V Y V D - - T S I P F D K F S I C R P L C T L I D E Q V F H L T N S Q S V G Q F C V R - L M N K Y S I - Q I L N L - A . . . . . . 15818 AGATAAAATTTTGTCTTGCCGAGATCTTTCATTTCAAATTCTTTTTTCAGACACTCAACA R - N F V L P R S F I S N S F F R H S T D K I L S C R D L S F Q I L F S D T Q Q K I K F C L A E I F H F K F F F Q T L N . . . . . . 15878 GCTTTTAAAAGCTCTTTATGAGTGCCAATGATGTTCAAACCATCAACATACACAACTATT A F K S S L - V P M M F K P S T Y T T I L L K A L Y E C Q - C S N H Q H T Q L L S F - K L F M S A N D V Q T I N I H N Y . . . . . . 15938 ATTACAAATTAAGACCCCGACTGTTTAATGAAAATGCAGGGACAAATCGGGTCATTTTTG I T N - D P D C L M K M Q G Q I G S F L L Q I K T P T V - - K C R D K S G H F C Y Y K L R P R L F N E N A G T N R V I F . . . . . . 15998 TACCCTTTCTTTAACAAATATTCACTCAGGCGATTGTACCACATCCTTCCTGATTGTTTC Y P F F N K Y S L R R L Y H I L P D C F T L S L T N I H S G D C T T S F L I V S V P F L - Q I F T Q A I V P H P S - L F . . . . . . 16058 AATCCATACAGAGATTTCTGAAGTTTTATTGAACAAGTTTCTCTTGAATCTTTGTATGCT N P Y R D F - S F I E Q V S L E S L Y A I H T E I S E V L L N K F L L N L C M L Q S I Q R F L K F Y - T S F S - I F V C . . . . . . 16118 TCAGGCACTTTGAATGCTTTAGGAATTTTCATGAAAATGTTGTGGTCCAATGAGCCATAT S G T L N A L G I F M K M L W S N E P Y Q A L - M L - E F S - K C C G P M S H I F R H F E C F R N F H E N V V V Q - A I . . . . . . 16178 AGATAGGCTGTGACAACGTCGATTAGACGCATTTCAAGTTTTTCATGAACTACCAGATTT R - A V T T S I R R I S S F S - T T R F D R L - Q R R L D A F Q V F H E L P D L - I G C D N V D - T H F K F F M N Y Q I . . . . . . 16238 ATGAGATATCTGAAGGTGATTGCATCTACCACTGGAGAATATGTCTCCATATAATAAATG M R Y L K V I A S T T G E Y V S I - - M - D I - R - L H L P L E N M S P Y N K C Y E I S E G D C I Y H W R I C L H I I N . . . . . . 16298 TTAGGTCTTTGAGAAAAACCTTGAGCAACGAGTCGGGCCTTATATCTCATGACTTCACCT L G L - E K P - A T S R A L Y L M T S P - V F E K N L E Q R V G P Y I S - L H L V R S L R K T L S N E S G L I S H D F T . . . . . . 16358 TTCTCATTTCTTTTTCGTACAAAAACCCATTTGTACCCCACTGGCTTGATACCTTCAGGT F S F L F R T K T H L Y P T G L I P S G S H F F F V Q K P I C T P L A - Y L Q V F L I S F S Y K N P F V P H W L D T F R . . . . . . 16418 GTTTGGATTATCGGTCCAAAAACTTCACATTTTTCTAGTGAAGACAATTCAGCTTGAATT V W I I G P K T S H F S S E D N S A - I F G L S V Q K L H I F L V K T I Q L E L C L D Y R S K N F T F F - - R Q F S L N . . . . . . 16478 GCATCCTTCCATTTTGGCCAATCATTTCTCTGTCTACATTCGTGAACAGATTTTGGCTCA A S F H F G Q S F L C L H S - T D F G S H P S I L A N H F S V Y I R E Q I L A Q C I L P F W P I I S L S T F V N R F W L . . . 16538 AAATCTTCATCTTGTTGCATTATTT K S S S C C I I N L H L V A L F K I F I L L H Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-12+_PGL-1_AGS-1_PPS_1 (16204 16455) (frame '0'; 249 bp, 83 residues) 1 THFKFFMNYQ IYEISEGDCI YHWRICLHII NVRSLRKTLS NESGLISHDF TFLISFSYKN 61 PFVPHWLDTF RCLDYRSKNF TFF- >C06HBa0054K13.1-12+_PGL-1_AGS-1_PPS_2 (15123 15356) (frame '2'; 231 bp, 77 residues) 1 CILLSIELYM QHYLRTLWLV YSFSKKNHTF SEYDGSLIST RRTHDLLHGW LLFLHDLKKW 61 LPTFASLIAK ILSCLHM- >C06HBa0054K13.1-12+_PGL-1_AGS-1_PPS_3 (15684 15908) (frame '2'; 222 bp, 74 residues) 1 CQAILPPLEY SMDVSYPCKN ALVSFLCTLI DEQVFHLTNS QSVGQDKILS CRDLSFQILF 61 SDTQQLLKAL YECQ- PGL 2 (- strand): 21227 17684 AGS-1 (19702 19682,18418 17684) SCR (e 0.905 d 1.000 a 0.899,e 0.886) Exon 1 19702 19682 ( 21 n); score: 0.905 Intron 1 19681 18419 (1263 n); Pd: 1.000 Pa: 0.899 Exon 2 18418 17684 ( 735 n); score: 0.886 PGS (19702 19682,18418 17684) SGN-U339613+ PGS (18321 17757) SGN-U339612+ 3-phase translation of AGS-1 (-strand): . . . : . . . 19702 ACGAAATTGCTACTTGCAAAG : AAATCCCCAGTTGTCAGTTGGATTCAAGATGAGTAATGG T K L L L A K : K S P V V S W I Q D E - W R N C Y L Q R : N P Q L S V G F K M S N G E I A T C K : E I P S C Q L D S R - V M . . . . . . 18379 AGATGTATGCCTTCTTGATAGTGCTACAACGCATACAATATTAAAAGAAAAGAAATACTT R C M P S - - C Y N A Y N I K R K E I L D V C L L D S A T T H T I L K E K K Y F E M Y A F L I V L Q R I Q Y - K K R N T . . . . . . 18319 TTCTAATTTGGTTATGAAAATGGCATATGTCAACACAATATCAGGTAGTACAAAATTAAT F - F G Y E N G I C Q H N I R - Y K I N S N L V M K M A Y V N T I S G S T K L I F L I W L - K W H M S T Q Y Q V V Q N - . . . . . . 18259 TGAGGGCTCTGGAAGAGCGACCTTATTACTACCTGGAGGGACAATATTAAGCATTGATAA - G L W K S D L I T T W R D N I K H - - E G S G R A T L L L P G G T I L S I D N L R A L E E R P Y Y Y L E G Q Y - A L I . . . . . . 18199 TGCATTATATTGTAGTAAGTCTCAAAGAAACTTATTAAGTTTCAAAGTTATTCGCCAAAA C I I L - - V S K K L I K F Q S Y S P K A L Y C S K S Q R N L L S F K V I R Q N M H Y I V V S L K E T Y - V S K L F A K . . . . . . 18139 TGGCTATCATGTTGAGACGACTAATGAAGGAAAGGTTGAATACCTTTACATTACTACAAT W L S C - D D - - R K G - I P L H Y Y N G Y H V E T T N E G K V E Y L Y I T T I M A I M L R R L M K E R L N T F T L L Q . . . . . . 18079 TAATGTAGAGAAGAAAATTGTGCATGAAAAATCACCTGCATTTTCTTCTGGGTTGTACTA - C R E E N C A - K I T C I F F W V V L N V E K K I V H E K S P A F S S G L Y Y L M - R R K L C M K N H L H F L L G C T . . . . . . 18019 TACAAGTATAAGTACAGTCGAATCACATGCCGTAGTAAACAAAAGGTTTACTAATTTTAA Y K Y K Y S R I T C R S K Q K V Y - F - T S I S T V E S H A V V N K R F T N F N I Q V - V Q S N H M P - - T K G L L I L . . . . . . 17959 TGATTTTATTATTTGGCATGACCAGTTGGGCCATCCCAGATTTAATATGATGCGCAAAAT - F Y Y L A - P V G P S Q I - Y D A Q N D F I I W H D Q L G H P R F N M M R K I M I L L F G M T S W A I P D L I - C A K . . . . . . 17899 CATTGAGAATTCACATGGGCACACCTTAAAGAGCCCAAATATCCTTCAATCAAAGGAATT H - E F T W A H L K E P K Y P S I K G I I E N S H G H T L K S P N I L Q S K E F S L R I H M G T P - R A Q I S F N Q R N . . . . . . 17839 CTCTTGTGCTGCTTGTTCTCAAGGAAAGTTGATCATTAAACCATCAACAGTTAAAGTTGG L L C C L F S R K V D H - T I N S - S W S C A A C S Q G K L I I K P S T V K V G S L V L L V L K E S - S L N H Q Q L K L . . . . . . 17779 AATTGAATCCCCTGCGTTTCTGGAACGTATACAGGATGATATATGTGGACCAATTCAACC N - I P C V S G T Y T G - Y M W T N S T I E S P A F L E R I Q D D I C G P I Q P E L N P L R F W N V Y R M I Y V D Q F N . . . . 17719 TGCATGTGGACCATTTAAATATTATATGGTCTTGAT C M W T I - I L Y G L D A C G P F K Y Y M V L L H V D H L N I I W S - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-12-_PGL-2_AGS-1_PPS_1 (19701 19682,18418 17686) (frame '2'; 753 bp, 251 residues) 1 RNCYLQRNPQ LSVGFKMSNG DVCLLDSATT HTILKEKKYF SNLVMKMAYV NTISGSTKLI 61 EGSGRATLLL PGGTILSIDN ALYCSKSQRN LLSFKVIRQN GYHVETTNEG KVEYLYITTI 121 NVEKKIVHEK SPAFSSGLYY TSISTVESHA VVNKRFTNFN DFIIWHDQLG HPRFNMMRKI 181 IENSHGHTLK SPNILQSKEF SCAACSQGKL IIKPSTVKVG IESPAFLERI QDDICGPIQP 241 ACGPFKYYMV L AGS-2 (19716 18953) SCR (e 0.975) Exon 1 19716 18953 ( 764 n); score: 0.975 PGS (19716 18953) SGN-U332943+ 3-phase translation of AGS-2 (-strand): . . . . . . 19716 AACACGTTATCAGCACGAAATTGCTACTTGCAAAGGTACTATATAAATATTTTTATTTAA N T L S A R N C Y L Q R Y Y I N I F I - T R Y Q H E I A T C K G T I - I F L F N H V I S T K L L L A K V L Y K Y F Y L . . . . . . 19656 TTCACATATTCTCTCATAATTTTCATTATGTCGAATTTATCCAAACTTGAGTTTGTGGCA F T Y S L I I F I M S N L S K L E F V A S H I L S - F S L C R I Y P N L S L W H I H I F S H N F H Y V E F I Q T - V C G . . . . . . 19596 TTAGATATTTCTGGAAAGAATTATCTTTCATGGGTACTCGATGCTGAGATTCACTTGGCT L D I S G K N Y L S W V L D A E I H L A - I F L E R I I F H G Y S M L R F T W L I R Y F W K E L S F M G T R C - D S L G . . . . . . 19536 GCTAAAGGTCTTGATGCCACTATTACTCAGGGAAAAGAAGCATCCAGTCAAGATAAGGCG A K G L D A T I T Q G K E A S S Q D K A L K V L M P L L L R E K K H P V K I R R C - R S - C H Y Y S G K R S I Q S R - G . . . . . . 19476 AAGGCTATGATTTTCCTTCGTTATCATCTTGATGAGGGCCTGAAGATTGAATATCTGACG K A M I F L R Y H L D E G L K I E Y L T R L - F S F V I I L M R A - R L N I - R E G Y D F P S L S S - - G P E D - I S D . . . . . . 19416 GTGAAATATCCACTTGAATAGTGGACTGATTTAAAGGGGAGATATGACCACCGAAAGGCA V K Y P L E - W T D L K G R Y D H R K A - N I H L N S G L I - R G D M T T E R Q G E I S T - I V D - F K G E I - P P K G . . . . . . 19356 ACAGTGTTGCCAAGAGCTCGTTATGAGTGGATGCATTTATGGTTTCAAGATTTTAAGACC T V L P R A R Y E W M H L W F Q D F K T Q C C Q E L V M S G C I Y G F K I L R P N S V A K S S L - V D A F M V S R F - D . . . . . . 19296 GTAATTGAATACAACAGAGTTGTATTCAGGATAACCTCCCAGTTGAAATTATGTGGGGAG V I E Y N R V V F R I T S Q L K L C G E - L N T T E L Y S G - P P S - N Y V G R R N - I Q Q S C I Q D N L P V E I M W G . . . . . . 19236 ACTATAAAAGATGAGAACATGTTGGAAAAGACACTTACTACTTTTCATGCCTCGAATGTG T I K D E N M L E K T L T T F H A S N V L - K M R T C W K R H L L L F M P R M - D Y K R - E H V G K D T Y Y F S C L E C . . . . . . 19176 ATATTGCAGTAGCAATATCGTGAAAAGGGTTTTCAGAAATATTCTGAACTAATCTCATGT I L Q - Q Y R E K G F Q K Y S E L I S C Y C S S N I V K R V F R N I L N - S H V D I A V A I S - K G F S E I F - T N L M . . . . . . 19116 CTTTTGGTGGCTGAACAACATAATGCTCTTTTAATGAAAAATCATGAAGCTCGTCCCACT L L V A E Q H N A L L M K N H E A R P T F W W L N N I M L F - - K I M K L V P L S F G G - T T - C S F N E K S - S S S H . . . . . . 19056 GGAGCTGCTCCATTACCGGAGGCAAATGTGGTGGAAGCACGTGATCAATCTGAAGTAAAA G A A P L P E A N V V E A R D Q S E V K E L L H Y R R Q M W W K H V I N L K - K W S C S I T G G K C G G S T - S I - S K . . . . . 18996 AGAGATGATCATCGGGGATATAATAATGTATGGGGACGTGGCAA R D D H R G Y N N V W G R G E M I I G D I I M Y G D V A K R - S S G I - - C M G T W Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-12-_PGL-2_AGS-2_PPS_1 (19656 19396) (frame '1'; 258 bp, 86 residues) 1 FTYSLIIFIM SNLSKLEFVA LDISGKNYLS WVLDAEIHLA AKGLDATITQ GKEASSQDKA 61 KAMIFLRYHL DEGLKIEYLT VKYPLE- >C06HBa0054K13.1-12-_PGL-2_AGS-2_PPS_2 (19395 19165) (frame '1'; 228 bp, 76 residues) 1 WTDLKGRYDH RKATVLPRAR YEWMHLWFQD FKTVIEYNRV VFRITSQLKL CGETIKDENM 61 LEKTLTTFHA SNVILQ- >C06HBa0054K13.1-12-_PGL-2_AGS-2_PPS_3 (19164 18955) (frame '1'; 210 bp, 70 residues) 1 QYREKGFQKY SELISCLLVA EQHNALLMKN HEARPTGAAP LPEANVVEAR DQSEVKRDDH 61 RGYNNVWGRG 3-phase translation of AGS-2 (+strand): . . . . . . 18953 TTGCCACGTCCCCATACATTATTATATCCCCGATGATCATCTCTTTTTACTTCAGATTGA L P R P H T L L Y P R - S S L F T S D - C H V P I H Y Y I P D D H L F L L Q I D A T S P Y I I I S P M I I S F Y F R L . . . . . . 19013 TCACGTGCTTCCACCACATTTGCCTCCGGTAATGGAGCAGCTCCAGTGGGACGAGCTTCA S R A S T T F A S G N G A A P V G R A S H V L P P H L P P V M E Q L Q W D E L H I T C F H H I C L R - W S S S S G T S F . . . . . . 19073 TGATTTTTCATTAAAAGAGCATTATGTTGTTCAGCCACCAAAAGACATGAGATTAGTTCA - F F I K R A L C C S A T K R H E I S S D F S L K E H Y V V Q P P K D M R L V Q M I F H - K S I M L F S H Q K T - D - F . . . . . . 19133 GAATATTTCTGAAAACCCTTTTCACGATATTGCTACTGCAATATCACATTCGAGGCATGA E Y F - K P F S R Y C Y C N I T F E A - N I S E N P F H D I A T A I S H S R H E R I F L K T L F T I L L L Q Y H I R G M . . . . . . 19193 AAAGTAGTAAGTGTCTTTTCCAACATGTTCTCATCTTTTATAGTCTCCCCACATAATTTC K V V S V F S N M F S S F I V S P H N F K - - V S F P T C S H L L - S P H I I S K S S K C L F Q H V L I F Y S L P T - F . . . . . . 19253 AACTGGGAGGTTATCCTGAATACAACTCTGTTGTATTCAATTACGGTCTTAAAATCTTGA N W E V I L N T T L L Y S I T V L K S - T G R L S - I Q L C C I Q L R S - N L E Q L G G Y P E Y N S V V F N Y G L K I L . . . . . . 19313 AACCATAAATGCATCCACTCATAACGAGCTCTTGGCAACACTGTTGCCTTTCGGTGGTCA N H K C I H S - R A L G N T V A F R W S T I N A S T H N E L L A T L L P F G G H K P - M H P L I T S S W Q H C C L S V V . . . . . . 19373 TATCTCCCCTTTAAATCAGTCCACTATTCAAGTGGATATTTCACCGTCAGATATTCAATC Y L P F K S V H Y S S G Y F T V R Y S I I S P L N Q S T I Q V D I S P S D I Q S I S P L - I S P L F K W I F H R Q I F N . . . . . . 19433 TTCAGGCCCTCATCAAGATGATAACGAAGGAAAATCATAGCCTTCGCCTTATCTTGACTG F R P S S R - - R R K I I A F A L S - L S G P H Q D D N E G K S - P S P Y L D W L Q A L I K M I T K E N H S L R L I L T . . . . . . 19493 GATGCTTCTTTTCCCTGAGTAATAGTGGCATCAAGACCTTTAGCAGCCAAGTGAATCTCA D A S F P - V I V A S R P L A A K - I S M L L F P E - - W H Q D L - Q P S E S Q G C F F S L S N S G I K T F S S Q V N L . . . . . . 19553 GCATCGAGTACCCATGAAAGATAATTCTTTCCAGAAATATCTAATGCCACAAACTCAAGT A S S T H E R - F F P E I S N A T N S S H R V P M K D N S F Q K Y L M P Q T Q V S I E Y P - K I I L S R N I - C H K L K . . . . . . 19613 TTGGATAAATTCGACATAATGAAAATTATGAGAGAATATGTGAATTAAATAAAAATATTT L D K F D I M K I M R E Y V N - I K I F W I N S T - - K L - E N M - I K - K Y L F G - I R H N E N Y E R I C E L N K N I . . . . . 19673 ATATAGTACCTTTGCAAGTAGCAATTTCGTGCTGATAACGTGTT I - Y L C K - Q F R A D N V Y S T F A S S N F V L I T C Y I V P L Q V A I S C - - R V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-12+_PGL-2_AGS-2_PPS_1 (18954 19199) (frame '2'; 243 bp, 81 residues) 1 CHVPIHYYIP DDHLFLLQID HVLPPHLPPV MEQLQWDELH DFSLKEHYVV QPPKDMRLVQ 61 NISENPFHDI ATAISHSRHE K- AGS-3 (21227 20908,19234 19205) SCR (e 0.902 d 0.000 a 0.000,e 0.767) Exon 1 21227 20908 ( 320 n); score: 0.902 Intron 1 20907 19235 (1673 n); Pd: 0.000 Pa: 0.000 Exon 2 19234 19205 ( 30 n); score: 0.767 PGS (21227 20908,19234 19205) SGN-U328267- 3-phase translation of AGS-3 (-strand): . . . . . . 21227 AAAAATGGGATAATGCACAAGTACCCCTCAATCTATGCCCGAAATTTCAGAGACACACTT K N G I M H K Y P S I Y A R N F R D T L K M G - C T S T P Q S M P E I S E T H L K W D N A Q V P L N L C P K F Q R H T . . . . . . 21167 ATACTATACTAAGGTCCTATTACCTCCCTGAACTTATTTTATAAGTAATTTTCTACCCCT I L Y - G P I T S L N L F Y K - F S T P Y Y T K V L L P P - T Y F I S N F L P L Y T I L R S Y Y L P E L I L - V I F Y P . . . . . . 21107 TTTTAGCCTACGTGGCACTGGTTTGGAACAAAAAGTCAACCATCGTTGGACCCACAAGAT F - P T W H W F G T K S Q P S L D P Q D F S L R G T G L E Q K V N H R W T H K I F L A Y V A L V W N K K S T I V G P T R . . . . . . 21047 AGTGCCACGTAGGTCGAAAAGGGGTAAAAAATTATTAATAAAATAATTTCAGGGGGTAAT S A T - V E K G - K I I N K I I S G G N V P R R S K R G K K L L I K - F Q G V I - C H V G R K G V K N Y - - N N F R G - . . . . . . 20987 AGGACCTTAGTATAGTATAAGTGTGTCTCTGAGATTTCGGGTATAGGTTGAGGAGGTACT R T L V - Y K C V S E I S G I G - G G T G P - Y S I S V S L R F R V - V E E V L - D L S I V - V C L - D F G Y R L R R Y . . : . . . 20927 TGGACATTATCCCTATAAAA : TATAAAAGATGAGAACATGTTGGAAAAGAC W T L S L - N : I K D E N M L E K G H Y P Y K : I - K M R T C W K R L D I I P I K : Y K R - E H V G K D Maximal non-overlapping open reading frames (>= 64 codons): none AGS-4 (20803 20778,19991 19765) SCR (e 0.769 d 0.000 a 0.000,e 0.738) Exon 1 20803 20778 ( 26 n); score: 0.769 Intron 1 20777 19992 ( 786 n); Pd: 0.000 Pa: 0.000 Exon 2 19991 19765 ( 227 n); score: 0.738 PGS (20803 20778,19991 19765) SGN-U339339- 3-phase translation of AGS-4 (-strand): . . . : . . . 20803 GTCTTATTACCCTCTAAACTTATTTT : ATTGTTGTAAATCCATTGTATATGTAGATGATCC V L L P S K L I L : L L - I H C I C R - S S Y Y P L N L F : Y C C K S I V Y V D D P L I T L - T Y F : I V V N P L Y M - M I . . . . . . 19957 ACTAGAGTTATCCAACAATTAATTGGATTCATGTATTAGTGTTTCCAATGTGATAAGATA T R V I Q Q L I G F M Y - C F Q C D K I L E L S N N - L D S C I S V S N V I R Y H - S Y P T I N W I H V L V F P M - - D . . . . . . 19897 TCAAGGTGATATGAACCTTGATGATAACTATGTAAAATTTCAAGGTGACCTTTGACTATG S R - Y E P - - - L C K I S R - P L T M Q G D M N L D D N Y V K F Q G D L - L - I K V I - T L M I T M - N F K V T F D Y . . . . . . 19837 ATATCTCAACACTATAAATAGAGATATCATTCACCATTGTATGATATACTTGAATAAGAA I S Q H Y K - R Y H S P L Y D I L E - E Y L N T I N R D I I H H C M I Y L N K K D I S T L - I E I S F T I V - Y T - I R . . 19777 AATTATCTCCTCT N Y L L I I S S K L S P Maximal non-overlapping open reading frames (>= 64 codons): none PGL 3 (+ strand): 31000 31116 AGS-1 (31000 31116) SCR (e 0.833) Exon 1 31000 31116 ( 117 n); score: 0.833 PGS (31000 31116) SGN-U330986+ 3-phase translation of AGS-1 (+strand): . . . . . . 31000 CTATGTTGCTTAGACTCTTCAAAAATGTCGACGGGTGTGTGTCGGATCCTCCAAAAAATA L C C L D S S K M S T G V C R I L Q K I Y V A - T L Q K C R R V C V G S S K K - M L L R L F K N V D G C V S D P P K N . . . . . . 31060 GTGTATTTTTAAAGATCCGACGTGCAGCAACATTTTTGGAGAGTCCGAGCAACTTAG V Y F - R S D V Q Q H F W R V R A T - C I F K D P T C S N I F G E S E Q L S V F L K I R R A A T F L E S P S N L Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 31116 CTAAGTTGCTCGGACTCTCCAAAAATGTTGCTGCACGTCGGATCTTTAAAAATACACTAT L S C S D S P K M L L H V G S L K I H Y - V A R T L Q K C C C T S D L - K Y T I K L L G L S K N V A A R R I F K N T L . . . . . . 31056 TTTTTGGAGGATCCGACACACACCCGTCGACATTTTTGAAGAGTCTAAGCAACATAG F L E D P T H T R R H F - R V - A T - F W R I R H T P V D I F E E S K Q H F F G G S D T H P S T F L K S L S N I Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:25:24 2006 ________________________________________________________________________________ Sequence 13: C06HBa0054K13.1-13, from 1 to 1574, both strands analyzed. ... started at: Mon Aug 28 22:25:24 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:25:29 2006 ________________________________________________________________________________ Sequence 14: C06HBa0054K13.1-14, from 1 to 7572, both strands analyzed. ... started at: Mon Aug 28 22:25:29 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand 989 n (File: SGN-U319960+) 1 GACCTACCAC CAACTCCCAA TAATCTCAAA ATATCTTTGA AGGAAAAAAT ATCTACCCCA 61 AAAGATATCT AACTACTTTA CAACATTTGT AGCACATGAT ATTATTACAT TGATGAACCC 121 CTTCAACTAA CATCAAACAA ATAATATTAC CCTCTTTTTT TTTCAAGATT TTTTACAAGA 181 AGAAGAAGAA GAAAATGGAA GTGAATGGAA ATATTAGACC AAATAGAAGT GATGTTCATT 241 TATCAAAAGA GGAAGAAACA AAGATAGAAG AGGAAACAAG AGAGTATTTT GATGGCATTG 301 CACCAAAAAG ACACACTAAA CCTCAAAGAA GTGATTATTC TTCAACTTAT GTTGATCACA 361 TCAATCTCTA CCCTTCTTCT CATGACACAA TTCCCGAAAA TCTCGAATTC CAACGTCTCG 421 AAAATGATCC TCAGAAATTG GTTTACAATG GCAGCCAAGT GACAGAGGAA TTTATTGAAA 481 CAGAATATTA CAAAGATCTT AATTGCATTG ACAAGCAGCA TCACACGACA GGAACAGGAT 541 TTATCAAAGT GGAGAATAAT GAGAACACTT TTAATATAGG AGCTGATTAT ACTACTGATC 601 CAAGCCATGT TTACAAGGGG AATCCAGCTA CTAATGATTG GATTCCTTCT GCTGTTGATG 661 AGGTTAATTT TATCTCAGGA AAACCACACA GAAGTGATAA CTGAAATTGA GTCTTTGATA 721 AAGACAGTAT CTGCTTATTT TGTATTTCTG TGAAGCTTGG CTTCAACTTG TGTCTCTTTT 781 GTTTGGTTGT ATTTGAATAA CATGCTATAT GAATTATTTC AAACGATTTT AATGAAGAAT 841 TTATATTAGC CAAATATCAG CTAAGGGGGT GTGATGGAAT GGTTCGAATA ACCTAAACCA 901 TTAACCAGAG GTCTCGAATT TAAACTGTTG TGAACGAAAA ATAGCATTAT AAAGGATATC 961 CTTTTCTTTC TGTGTGAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 1 to 4100): Exon 1 70 502 ( 433 n); cDNA 2 434 ( 433 n); score: 0.993 Intron 1 503 1688 (1186 n); Pd: 0.995 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 1689 1781 ( 93 n); cDNA 435 527 ( 93 n); score: 1.000 Intron 2 1782 2509 ( 728 n); Pd: 0.978 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 3 2510 2644 ( 135 n); cDNA 528 662 ( 135 n); score: 1.000 Intron 3 2645 2974 ( 330 n); Pd: 1.000 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 4 2975 3298 ( 324 n); cDNA 663 986 ( 324 n); score: 0.985 MATCH C06HBa0054K13.1-14+ SGN-U319960+ 0.992 985 0.996 C PGS_C06HBa0054K13.1-14+_SGN-U319960+ (70 502,1689 1781,2510 2644,2975 3298) Alignment (genomic DNA sequence = upper lines): ACCTACCACC AACTTCCAAA ACTCTCAAAA TATCTTTGAA GGAAAAAATA TCTACCCCAA 129 |||||||||| |||| |||| | |||||||| |||||||||| |||||||||| |||||||||| ACCTACCACC AACTCCCAAT AATCTCAAAA TATCTTTGAA GGAAAAAATA TCTACCCCAA 61 AAGATATCTA ACTACTTTAC AACATTTGTA GCACATGATA TTATTACATT GATGAACCCC 189 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGATATCTA ACTACTTTAC AACATTTGTA GCACATGATA TTATTACATT GATGAACCCC 121 TTCAACTAAC ATCAAACAAA TAATATTACC CTCTTTTTTT TTCAAGATTT TTTACAAGAA 249 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAACTAAC ATCAAACAAA TAATATTACC CTCTTTTTTT TTCAAGATTT TTTACAAGAA 181 GAAGAAGAAG AAAATGGAAG TGAATGGAAA TATTAGACCA AATAGAAGTG ATGTTCATTT 309 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGAAGAAG AAAATGGAAG TGAATGGAAA TATTAGACCA AATAGAAGTG ATGTTCATTT 241 ATCAAAAGAG GAAGAAACAA AGATAGAAGA GGAAACAAGA GAGTATTTTG ATGGCATTGC 369 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAAAAGAG GAAGAAACAA AGATAGAAGA GGAAACAAGA GAGTATTTTG ATGGCATTGC 301 ACCAAAAAGA CACACTAAAC CTCAAAGAAG TGATTATTCT TCAACTTATG TTGATCACAT 429 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCAAAAAGA CACACTAAAC CTCAAAGAAG TGATTATTCT TCAACTTATG TTGATCACAT 361 CAATCTCTAC CCTTCTTCTC ATGACACAAT TCCCGAAAAT CTCGAATTCC AACGTCTCGA 489 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATCTCTAC CCTTCTTCTC ATGACACAAT TCCCGAAAAT CTCGAATTCC AACGTCTCGA 421 AAATGATCCT CAGGTCTACC TCCTTCTCTG CATATGTCTG TTTTGTTTTA ATCAGTAGAG 549 |||||||||| ||| AAATGATCCT CAG....... .......... .......... .......... .......... 434 GCGGATTCAG GATTTGAACT ATTAATATAT TTAAACTCCA TGCCAAGCAT TGAATCATAA 609 .......... .......... .......... .......... .......... .......... 434 TAGTTTTTGT ATTAGTAGGT TCCGAGTAGA TTATTTGTAC ATATTAGATA TACACATATG 669 .......... .......... .......... .......... .......... .......... 434 ATGTTTGAGC TAAAGTTAAT GAATTTTGTC CAACTCTTAA CTATGCCTTG AAAATGATTT 729 .......... .......... .......... .......... .......... .......... 434 TCATGTTAAT TAAGCTCCCC CCCACCCCCC ACCCCCACCC CTCATTTTTC ACTTTTGGGG 789 .......... .......... .......... .......... .......... .......... 434 TTTCTCCTTG TTAGTGAATA ATATATCAGT AGAAAATTTA GATTTACAAT TTCTAGTGAT 849 .......... .......... .......... .......... .......... .......... 434 ACAGGAACTA CTTCCCACCA AAATAGTGAG AAACTTCATA AGTTTTTCAT AGGGATAAGG 909 .......... .......... .......... .......... .......... .......... 434 TGCAATTAGC CTTAGACTAT ACCCGAAATC CCAAAAACAT ATCTTAACTA AACTAAGATC 969 .......... .......... .......... .......... .......... .......... 434 ATATTATGAT GAACTTGTTT TTTTGTATTT TTGTACACCT TTTTGTCTTA CGTGGTATTC 1029 .......... .......... .......... .......... .......... .......... 434 AAATATCTCT CATGTGTCTC AACTGCGTGA AGTCACGGAG TATACCAAGT AAGTTAAAAG 1089 .......... .......... .......... .......... .......... .......... 434 GAGTATAAAA TTACAAAAAT AAAATGAGTA CAGAGGTAAT AGGACTTAAG TTTAGTTAAA 1149 .......... .......... .......... .......... .......... .......... 434 AGTGTAGTAC TTGTACATGC CCTTTTTTCA TATTAAAACA TTACTATTTT ATTCTTTTCT 1209 .......... .......... .......... .......... .......... .......... 434 CGCTGATTTA ATCAAAATAT TTGTGGTCTT AGCCACTTAA CAATGACTTT TACTAGCTTG 1269 .......... .......... .......... .......... .......... .......... 434 GAATTGAAGT GTAGTTGATC CATTTGAATC ACACTTTTGT TGATCCAATG GCTCAGAGAT 1329 .......... .......... .......... .......... .......... .......... 434 AGCACTTCGC TTTAAGTGTT GAGTCATGCA CAATTTTTTT TCTCGATGTA TCAAAGAGTA 1389 .......... .......... .......... .......... .......... .......... 434 CCTATGAACT ATGTTCCGAT TTACTCATAA GGAAGAAAAC GGGGGACGGG GTTGGGGACC 1449 .......... .......... .......... .......... .......... .......... 434 GGACTATATG ATGTGATAGT AAACAGGAAC CATTATGTAA AAGACTTTGG TATCTATGTT 1509 .......... .......... .......... .......... .......... .......... 434 TGGTTGGATT CTTCCAAAAA CATCGCCAGG TGTGTGTCGA ATCCTACAAA ATGTATGTAT 1569 .......... .......... .......... .......... .......... .......... 434 TTTTGAAGGA TTTGACACGG AAACGGAAAC ATTTTGGAGG GTCTAAGTAA CATAGTCGCG 1629 .......... .......... .......... .......... .......... .......... 434 AATTATAAAT CACATTAATG TCAATCAGAT TCATATATCT TTTGGTTTTG GTTTTGCAGA 1689 | .......... .......... .......... .......... .......... .........A 435 AATTGGTTTA CAATGGCAGC CAAGTGACAG AGGAATTTAT TGAAACAGAA TATTACAAAG 1749 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTGGTTTA CAATGGCAGC CAAGTGACAG AGGAATTTAT TGAAACAGAA TATTACAAAG 495 ATCTTAATTG CATTGACAAG CAGCATCACA CGGTTACTAC TCTCACTCAC TCTTATAACA 1809 |||||||||| |||||||||| |||||||||| || ATCTTAATTG CATTGACAAG CAGCATCACA CG........ .......... .......... 527 CTCTTATTAC CTAAAAAATC CAATTACAGT CGAACCTCTC TATGACGACA TCGTTTGTCT 1869 .......... .......... .......... .......... .......... .......... 527 GTATAATTTT TGGTTGCTAA ACTAAACATG TTGTTATAAA GAGCATAAAA GAGTTAATAG 1929 .......... .......... .......... .......... .......... .......... 527 TAAAAAAATA CACTTACACA CTTTTTCGCG GTTTCATACG CCAACTATCA TTGTTCTCTT 1989 .......... .......... .......... .......... .......... .......... 527 TATCTAAACT ATCACTTTTT TTTGTATTAA AGCACACATC AATTCTGAGT TGGCCAAATA 2049 .......... .......... .......... .......... .......... .......... 527 TTTGTCCACA ATCAACGATA GGCATGTGTA GGCCAACTCA GCATCGAGAT ATGTTTTAAT 2109 .......... .......... .......... .......... .......... .......... 527 ACAAAGGTAA TGATACACGG TTGAGATGTG AAACTCGCGA GAAATTGATA GATTAGGTAG 2169 .......... .......... .......... .......... .......... .......... 527 GTAAATAGAA GAATAGATAT TTGAGGTGTG AAACACGCGA AAAAATGATG TTTTGTGTGT 2229 .......... .......... .......... .......... .......... .......... 527 GCTTTTTTAC CATTAACTCA ACATGTATTA TAATGTAATG TGAAAACTCG GTTCTGAAGA 2289 .......... .......... .......... .......... .......... .......... 527 AAACTTAGTT GTTAGAGTGA AATGTTGTTA TTGAGGATGG ATGTTACAGA GAGGTTTGAT 2349 .......... .......... .......... .......... .......... .......... 527 TGTGTTCTTT TAGTGTCTAT GTTGCTCGGG TTCTCTAAAA ATAATGTAAT TTCGGAGGAT 2409 .......... .......... .......... .......... .......... .......... 527 CTGACATGCT CGTGGCTGTA TTTTTATTGG ATCCAAGAGC AATGTAGCTT TTGGTTTAAA 2469 .......... .......... .......... .......... .......... .......... 527 AAATAAGGTT GATGAACTTT TTTTTTGTGG GAAAATGCAG ACAGGAACAG GATTTATCAA 2529 |||||||||| |||||||||| .......... .......... .......... .......... ACAGGAACAG GATTTATCAA 547 AGTGGAGAAT AATGAGAACA CTTTTAATAT AGGAGCTGAT TATACTACTG ATCCAAGCCA 2589 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTGGAGAAT AATGAGAACA CTTTTAATAT AGGAGCTGAT TATACTACTG ATCCAAGCCA 607 TGTTTACAAG GGGAATCCAG CTACTAATGA TTGGATTCCT TCTGCTGTTG ATGAGGTATG 2649 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TGTTTACAAG GGGAATCCAG CTACTAATGA TTGGATTCCT TCTGCTGTTG ATGAG..... 662 ACCAAATGAA ACTTTTTGTA TATGTGACTA AATGTTTGCG ATAACTTGTT GAGAGGCGCG 2709 .......... .......... .......... .......... .......... .......... 662 TGAGATGAAA AAGTCTTAGA AGTGATCTCA CATGACACAA TGAGTTCATG GTGAACACCT 2769 .......... .......... .......... .......... .......... .......... 662 GATAGTTTAG GTTTGAAACT CGAAAGACAA AAATAGTTTT AGGGAGGGGG TTGGGGGTTA 2829 .......... .......... .......... .......... .......... .......... 662 TTTTATTCAC TCAAATTGAT TTCCACATGT ATTGCCACTT GGAATTCTTG TGTAATATGT 2889 .......... .......... .......... .......... .......... .......... 662 CAACCCAAAC TGTTTTATTT TATTTCTTTG GTTATTGTAA TTCTCAAAAT TAATGTCTAA 2949 .......... .......... .......... .......... .......... .......... 662 TTGTCTGTCA TTTTTGGTGG ATCAGGTTAA TTTTATCTCA GGAAAACCAC ACAGAAGTGA 3009 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....GTTAA TTTTATCTCA GGAAAACCAC ACAGAAGTGA 697 TAACTGAAAT TGAGTCTTTG ATAAAGACAG TATCTGCTTA TTTTGTATTT CTGTGAAGCT 3069 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAACTGAAAT TGAGTCTTTG ATAAAGACAG TATCTGCTTA TTTTGTATTT CTGTGAAGCT 757 TGGCTTCAAC TTGTGTCTCT TTTGTTTGGT TGTATTTGAA TAACATGCTA TATGAATTAT 3129 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGCTTCAAC TTGTGTCTCT TTTGTTTGGT TGTATTTGAA TAACATGCTA TATGAATTAT 817 TTCAAACGAT TTTAATGAAG AATTTATATT AGCCAAATAT CAGCTAAGGG GGTGTGATGG 3189 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAAACGAT TTTAATGAAG AATTTATATT AGCCAAATAT CAGCTAAGGG GGTGTGATGG 877 AATGGTTCGA ATAACCTAAA CCATTAACCA GAGGTCTCGA ATTTAAACTG TTGTGAACGA 3249 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGGTTCGA ATAACCTAAA CCATTAACCA GAGGTCTCGA ATTTAAACTG TTGTGAACGA 937 AAAATAGCAT TATAAAGGAT ATCCTTTTCT TCCTGTGTGA AATCTTAAA 3298 |||||||||| |||||||||| |||||||||| | |||||||| || ||| AAAATAGCAT TATAAAGGAT ATCCTTTTCT TTCTGTGTGA AAAAAAAAA 986 hqPGS_C06HBa0054K13.1-14+_SGN-U319960+ (70 502,1689 1781,2510 2644,2975 3298) ******************************************************************************** EST sequence 1 -strand 913 n (File: SGN-U344643-) 1 AATGGTTAAA AGAAATTGCT GCATAGATGC TGATCTGTAG TCAAATCAAA GGTCGTGTGC 61 TTGCGAAATG AATTCCGGAG AGGAGCTGAT ATGCACAGAA TGTTTATTGA GATTCAGCTA 121 GATGACCTGA ACCAGAAGAG GCAGTGGCGA TCATGTCTCT TTNGTGTTTC AGCAAAAGGA 181 GACTTCTGAA TCTTTCCTCA CGCCAAAGCT TTCTCAGGCT AAAAAGAGAT TACATGAAAC 241 TGCAATAAGA CTTGGGGAGC TTCAGGCTCA GTTTAAGCTG CCGATTGACC CAAAGGAGTA 301 TGCCCAAGAG AACCTTAAGT TTGGTTTGGT TGAAGTGGTA TATGAATGGG CAAAGGGGAC 361 CCCATTTGCT GAGATATGTG AACTCACAGA TGTCCCTGAA GGTGTGATAG TGAGGACTAT 421 TGTTAGATTA GACGAGACTT GTCGTGAATT TAGAAATGCT GCTGCAATTA TGGGCAACTC 481 TGCACTGTAC AAGAAAATGG AAACTGCATC TAATGTGATC AAGCGCGATA TCGTGTTTGC 541 AGCTAGCTTG TATATTACCG GAGTTTGATT AGAGGTCAGG AAGCCTTGAT TTCATTTGCA 601 ATTCTAAAGG ACCTGACGAT ACACATGTTG TACCCTAGAA TATGTTCGCC AAGGTGTGGA 661 CTTTGAGCAA GGCTACCCTA CAATTTGGCT AATTTGTACA GAAATTCAAG TCTTTCAAAT 721 GTACATTTAG CAATTGACAG TAGATTACAC CCTGATTAGC ACAGAGAGAT ACATGAACTT 781 AAAAGGATTT ACTGTATGAC ATTCTCTTAA AATTTAGGTA TGATTTTGAA GTTCTGCATC 841 TAAAAAAAAA AAACCTGCAG CCCGGGGGAT CCACTAGTTA TAGAGCGGCC GCCACCGCGG 901 GGAGCTCCAG TTT Predicted gene structure (within gDNA segment 7114 to 2171): Exon 1 6143 5913 ( 231 n); cDNA 5 228 ( 224 n); score: 0.944 Intron 1 5912 4246 (1667 n); Pd: 0.989 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 2 4245 4119 ( 127 n); cDNA 229 355 ( 127 n); score: 1.000 Intron 2 4118 3965 ( 154 n); Pd: 0.097 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 3 3964 3482 ( 483 n); cDNA 356 841 ( 486 n); score: 0.970 MATCH C06HBa0054K13.1-14- SGN-U344643- 0.967 841 0.921 C PGS_C06HBa0054K13.1-14-_SGN-U344643- (6143 5913,4245 4119,3964 3482) Alignment (genomic DNA sequence = upper lines): GTTAAAAGAA ATTGGCTGCA TAGATGCTGA TCTTGTAGTT CAAATCAAAG GTCGTGTTGC 6084 |||||||||| ||| |||||| |||||||||| || ||||| | |||||||||| |||||| ||| GTTAAAAGAA ATT-GCTGCA TAGATGCTGA TC-TGTAG-T CAAATCAAAG GTCGTG-TGC 60 TTGCGAAATG AATTCCGTAG AGGAGCTGAT ATGCACAGAA TGTTTATTTG AGAATCAGCT 6024 |||||||||| ||||||| || |||||||||| |||||||||| |||||| ||| ||| |||||| TTGCGAAATG AATTCCGGAG AGGAGCTGAT ATGCACAGAA TGTTTA-TTG AGATTCAGCT 119 TGATGACCTG GAGCCAGAAG AGGCAGTGGC GATCATGTCT TCCTTTGTGT TTCAGCAAAA 5964 |||||||| || ||||||| |||||||||| ||||||||| || || |||| |||||||||| AGATGACCT- GAACCAGAAG AGGCAGTGGC GATCATGTC- TCTTTNGTGT TTCAGCAAAA 177 GGAGACTTCT GAATCTTTCC TCACGCCAAA GCTTTCTCAG GCTAAAAAGA GGTTATTTTC 5904 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | GGAGACTTCT GAATCTTTCC TCACGCCAAA GCTTTCTCAG GCTAAAAAGA G......... 228 TTCTCAAATA TTTGGTGTGA TATGCTCATG TTGACGTGTT GTCATTTTTC CTGTTCATCT 5844 .......... .......... .......... .......... .......... .......... 228 ATGATATATA TATATTCAAT TGTTTTTGGT ATTTTTGGAT AAAAGTTTTT ATTGAAGATT 5784 .......... .......... .......... .......... .......... .......... 228 TTTGATAAGA TTTTGTGTTT GCGGTGTCTG AGTTTTTGAG GGATCAAACC TGTATATCAG 5724 .......... .......... .......... .......... .......... .......... 228 CCATGTATAT TTCTTGTAAT GTTTCTTTAA TGGATGGGCT TAAAAGACTG TGAAGACTTC 5664 .......... .......... .......... .......... .......... .......... 228 TCCTCGTTAT ACATTAGGTT TAATGATAGA GAACTCATCA AAATTAATTG ATGAGATGAT 5604 .......... .......... .......... .......... .......... .......... 228 TGGAATATGA TATCTATAAA AGATGTACTT TCCTTTTTAT CACTGCTTTG TTGGACAATT 5544 .......... .......... .......... .......... .......... .......... 228 TAGGATTCTA ACTTGGTATT TATATCTAGG ATGGAATAAA ATACATCATT CTTGTTTTTT 5484 .......... .......... .......... .......... .......... .......... 228 ATGAGAACAG AACATATCAT GTTAAATGTA TTCTCATGTT TCACAGAATT GCATATGCAA 5424 .......... .......... .......... .......... .......... .......... 228 GTTGTACGGG ATGCTAGTTA CTAACTTGGG TGTATCTCAA TAATTGGATT ATAATATATC 5364 .......... .......... .......... .......... .......... .......... 228 ACATTTTTCA CGGAATTGCA CATCACAAGG TCACAGATCA AAGAGCTATG ACACCTTTTC 5304 .......... .......... .......... .......... .......... .......... 228 TTGATATAAT AAAGTGGGTG CTGTGAAAGT AGAGATAAGA GTAAATGTGT GGTAAGCAAT 5244 .......... .......... .......... .......... .......... .......... 228 TTTTGTTTAG TGTCCCACGT TGATGGTGGA AATGGGCTGT TGTTTCTTAT ATGGTTTTGG 5184 .......... .......... .......... .......... .......... .......... 228 GAAATCCTCA TTGCATGAAT TAACTTTTGG GGTTTAGTTA GGCCAAAGGT CCATTTCTCG 5124 .......... .......... .......... .......... .......... .......... 228 TCATGGTATT AAAGCCAGAC CTGTTCCTGT TCTTGGTTCA CTCAGCATTG GGCCCCACTG 5064 .......... .......... .......... .......... .......... .......... 228 TTTGTTGTTC GCTCTCCAAT TATCCAGTCC TGAGCGTGTG AGTAATGTTA AATGTTCCAC 5004 .......... .......... .......... .......... .......... .......... 228 ATTGGTGGTG AAAATGTGTT GTTGTCTCCT TATATGGTCC TGTGCGATTC TTACTTCATG 4944 .......... .......... .......... .......... .......... .......... 228 AGTTTTGGAG TTGATTTAGG CTCAAGACTC ATTTCTTATC AAGTTGTTTC TTCTTTATTA 4884 .......... .......... .......... .......... .......... .......... 228 TAGTGAATAC ACTTTTTAGA GGGAAACGAA TTTTCTGGCC AGCTGCTAAT TCTCTTATAT 4824 .......... .......... .......... .......... .......... .......... 228 TGTCAGTGTG TACTCTGCTT TTCCTTTATT GAAAGATTTT TTGAAAATGG GATTTGGTCA 4764 .......... .......... .......... .......... .......... .......... 228 GATTTACGAA TCAAGCATGT AGTGGTCTAA GACCTGGTTG CTGCTTTTGG GTCTTTCTCT 4704 .......... .......... .......... .......... .......... .......... 228 GAATGCACAA CTGCTAGTGG GATGTCTTTT GACTGCAAAA TATCCAAATT TCCTTCTGTC 4644 .......... .......... .......... .......... .......... .......... 228 AAAAAGTTAC TGAACAACAG AGAATAGAGT TTTACACACT TCAGATCTTC CACAGAACAT 4584 .......... .......... .......... .......... .......... .......... 228 TTCTGTCAGT ATTTGTGCCA ACAAAGGCTG TTACTGTTTA AACGGTGTAT GGCCAATTGG 4524 .......... .......... .......... .......... .......... .......... 228 CTCTCTTTTA CTTCTGAAGT CATGTCATCT CCTCCTTGCA CTTGTCAATG TTGTCTACTT 4464 .......... .......... .......... .......... .......... .......... 228 GTTTTTATTT GACTAGTGCA ACTGTAGTTG TTGATTACGG TAATTTCATT CTTATAACTT 4404 .......... .......... .......... .......... .......... .......... 228 CATTTGTACT TTTTCTTATT GTTTCAAGTC ATTCTGATGT ATTCTGTAGG TATTTTCTCA 4344 .......... .......... .......... .......... .......... .......... 228 TTAATTACCA TGAGCTGTAA CTTAACCCTG TTCTTTGATA TTATAGTTTT TCTTTTTACT 4284 .......... .......... .......... .......... .......... .......... 228 TTGTTCTCAA TGACAATATC ATTCTCGTGG CATTTCAGAT TACATGAAAC TGCAATAAGA 4224 || |||||||||| |||||||||| .......... .......... .......... ........AT TACATGAAAC TGCAATAAGA 250 CTTGGGGAGC TTCAGGCTCA GTTTAAGCTG CCGATTGACC CAAAGGAGTA TGCCCAAGAG 4164 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGGGGAGC TTCAGGCTCA GTTTAAGCTG CCGATTGACC CAAAGGAGTA TGCCCAAGAG 310 AACCTTAAGT TTGGTTTGGT TGAAGTGGTA TATGAATGGG CAAAGGTTCT CTCTCTCTCT 4104 |||||||||| |||||||||| |||||||||| |||||||||| ||||| AACCTTAAGT TTGGTTTGGT TGAAGTGGTA TATGAATGGG CAAAG..... .......... 355 CACTCACACA CGCACACAAA AAAAATTAAG GATTTGCATG TGTGGTTGCG TATGACTTAA 4044 .......... .......... .......... .......... .......... .......... 355 TTATCATAAT ATCCTACCTG TTGCTTGATT CTTCCGTTCT TGAGTGTGGG GTGTCTCTAA 3984 .......... .......... .......... .......... .......... .......... 355 TTCTGCTTCC GTCTTGCAGG GGACCCCATT TGCTGAGATA TGTGAACTCA CAGATGTCCC 3924 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G GGACCCCATT TGCTGAGATA TGTGAACTCA CAGATGTCCC 396 TGAAGGTGTT ATAGTGAGGA CTATTGTTAG ATTAGACGAG ACTTGTCGTG AATTTAGAAA 3864 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGGTGTG ATAGTGAGGA CTATTGTTAG ATTAGACGAG ACTTGTCGTG AATTTAGAAA 456 TGCTGCTGCA ATTATGGGCA ACTCTGCACT GTACAAGAAA ATGGAAACTG CATCTAATGT 3804 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGCTGCA ATTATGGGCA ACTCTGCACT GTACAAGAAA ATGGAAACTG CATCTAATGT 516 GATCAAGCGC GATATCGTGT TTGCAGCTAG CTTATATATT ACCGGAGTTT GATT--AGGT 3746 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||| |||| GATCAAGCGC GATATCGTGT TTGCAGCTAG CTTGTATATT ACCGGAGTTT GATTAGAGGT 576 CCGGAAGCCT TGATTTCATT TGCAATTCTA AAGGACCTGA CGATACACAT GTTGTACACT 3686 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || CAGGAAGCCT TGATTTCATT TGCAATTCTA AAGGACCTGA CGATACACAT GTTGTACCCT 636 AGAATATGTT CGCCAAGGTG TGGACTTTGA GCAAGGCTAC CCTACAATTT GGCTAATTTG 3626 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATATGTT CGCCAAGGTG TGGACTTTGA GCAAGGCTAC CCTACAATTT GGCTAATTTG 696 TACAGAAATT CAAGTCTTTC AAATGTACAT TTAGCAATCG ACAGTAGATT ACACCATGAT 3566 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| ||||| |||| TACAGAAATT CAAGTCTTTC AAATGTACAT TTAGCAATTG ACAGTAGATT ACACCCTGAT 756 TAGCACAGAG AGATACATAA ACTT-AAAGG ATTTACTGTA TGACATTCTC TTAAAATTTA 3507 |||||||||| |||||||| | |||| ||||| |||||||||| |||||||||| |||||||||| TAGCACAGAG AGATACATGA ACTTAAAAGG ATTTACTGTA TGACATTCTC TTAAAATTTA 816 GGTATGATTT TGAAGTTCTG CATCT 3482 |||||||||| |||||||||| ||||| GGTATGATTT TGAAGTTCTG CATCT 841 hqPGS_C06HBa0054K13.1-14-_SGN-U344643- (6143 5913,4245 4119,3964 3482) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 7572: PGL 1 (+ strand): 70 3298 AGS-1 (70 502,1689 1781,2510 2644,2975 3298) SCR (e 0.993 d 0.995 a 1.000,e 1.000 d 0.978 a 0.994,e 1.000 d 1.000 a 0.998,e 0.985) Exon 1 70 502 ( 433 n); score: 0.993 Intron 1 503 1688 (1186 n); Pd: 0.995 Pa: 1.000 Exon 2 1689 1781 ( 93 n); score: 1.000 Intron 2 1782 2509 ( 728 n); Pd: 0.978 Pa: 0.994 Exon 3 2510 2644 ( 135 n); score: 1.000 Intron 3 2645 2974 ( 330 n); Pd: 1.000 Pa: 0.998 Exon 4 2975 3298 ( 324 n); score: 0.985 PGS (70 502,1689 1781,2510 2644,2975 3298) SGN-U319960+ 3-phase translation of AGS-1 (+strand): . . . . . . 70 ACCTACCACCAACTTCCAAAACTCTCAAAATATCTTTGAAGGAAAAAATATCTACCCCAA T Y H Q L P K L S K Y L - R K K Y L P Q P T T N F Q N S Q N I F E G K N I Y P K L P P T S K T L K I S L K E K I S T P . . . . . . 130 AAGATATCTAACTACTTTACAACATTTGTAGCACATGATATTATTACATTGATGAACCCC K I S N Y F T T F V A H D I I T L M N P R Y L T T L Q H L - H M I L L H - - T P K D I - L L Y N I C S T - Y Y Y I D E P . . . . . . 190 TTCAACTAACATCAAACAAATAATATTACCCTCTTTTTTTTTCAAGATTTTTTACAAGAA F N - H Q T N N I T L F F F Q D F L Q E S T N I K Q I I L P S F F F K I F Y K K L Q L T S N K - Y Y P L F F S R F F T R . . . . . . 250 GAAGAAGAAGAAAATGGAAGTGAATGGAAATATTAGACCAAATAGAAGTGATGTTCATTT E E E E N G S E W K Y - T K - K - C S F K K K K M E V N G N I R P N R S D V H L R R R R K W K - M E I L D Q I E V M F I . . . . . . 310 ATCAAAAGAGGAAGAAACAAAGATAGAAGAGGAAACAAGAGAGTATTTTGATGGCATTGC I K R G R N K D R R G N K R V F - W H C S K E E E T K I E E E T R E Y F D G I A Y Q K R K K Q R - K R K Q E S I L M A L . . . . . . 370 ACCAAAAAGACACACTAAACCTCAAAGAAGTGATTATTCTTCAACTTATGTTGATCACAT T K K T H - T S K K - L F F N L C - S H P K R H T K P Q R S D Y S S T Y V D H I H Q K D T L N L K E V I I L Q L M L I T . . . . . . 430 CAATCTCTACCCTTCTTCTCATGACACAATTCCCGAAAATCTCGAATTCCAACGTCTCGA Q S L P F F S - H N S R K S R I P T S R N L Y P S S H D T I P E N L E F Q R L E S I S T L L L M T Q F P K I S N S N V S . . : . . . . 490 AAATGATCCTCAG : AAATTGGTTTACAATGGCAGCCAAGTGACAGAGGAATTTATTGAAAC K - S S : E I G L Q W Q P S D R G I Y - N N D P Q : K L V Y N G S Q V T E E F I E T K M I L R : N W F T M A A K - Q R N L L K . . . . . : . 1736 AGAATATTACAAAGATCTTAATTGCATTGACAAGCAGCATCACACG : ACAGGAACAGGATT R I L Q R S - L H - Q A A S H : D R N R I E Y Y K D L N C I D K Q H H T : T G T G F Q N I T K I L I A L T S S I T R : Q E Q D . . . . . . 2524 TATCAAAGTGGAGAATAATGAGAACACTTTTAATATAGGAGCTGATTATACTACTGATCC Y Q S G E - - E H F - Y R S - L Y Y - S I K V E N N E N T F N I G A D Y T T D P L S K W R I M R T L L I - E L I I L L I . . . . . . 2584 AAGCCATGTTTACAAGGGGAATCCAGCTACTAATGATTGGATTCCTTCTGCTGTTGATGA K P C L Q G E S S Y - - L D S F C C - - S H V Y K G N P A T N D W I P S A V D E Q A M F T R G I Q L L M I G F L L L L M . : . . . . . 2644 G : GTTAATTTTATCTCAGGAAAACCACACAGAAGTGATAACTGAAATTGAGTCTTTGATAA : G - F Y L R K T T Q K - - L K L S L - - : V N F I S G K P H R S D N - N - V F D K R : L I L S Q E N H T E V I T E I E S L I . . . . . . 3034 AGACAGTATCTGCTTATTTTGTATTTCTGTGAAGCTTGGCTTCAACTTGTGTCTCTTTTG R Q Y L L I L Y F C E A W L Q L V S L L D S I C L F C I S V K L G F N L C L F C K T V S A Y F V F L - S L A S T C V S F . . . . . . 3094 TTTGGTTGTATTTGAATAACATGCTATATGAATTATTTCAAACGATTTTAATGAAGAATT F G C I - I T C Y M N Y F K R F - - R I L V V F E - H A I - I I S N D F N E E F V W L Y L N N M L Y E L F Q T I L M K N . . . . . . 3154 TATATTAGCCAAATATCAGCTAAGGGGGTGTGATGGAATGGTTCGAATAACCTAAACCAT Y I S Q I S A K G V - W N G S N N L N H I L A K Y Q L R G C D G M V R I T - T I L Y - P N I S - G G V M E W F E - P K P . . . . . . 3214 TAACCAGAGGTCTCGAATTTAAACTGTTGTGAACGAAAAATAGCATTATAAAGGATATCC - P E V S N L N C C E R K I A L - R I S N Q R S R I - T V V N E K - H Y K G Y P L T R G L E F K L L - T K N S I I K D I . . . 3274 TTTTCTTCCTGTGTGAAATCTTAAA F S S C V K S - F L P V - N L K L F F L C E I L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-14+_PGL-1_AGS-1_PPS_1 (185 502,1689 1781,2510 2644,2975 3016) (frame '2'; 585 bp, 195 residues) 1 TPSTNIKQII LPSFFFKIFY KKKKKKMEVN GNIRPNRSDV HLSKEEETKI EEETREYFDG 61 IAPKRHTKPQ RSDYSSTYVD HINLYPSSHD TIPENLEFQR LENDPQKLVY NGSQVTEEFI 121 ETEYYKDLNC IDKQHHTTGT GFIKVENNEN TFNIGADYTT DPSHVYKGNP ATNDWIPSAV 181 DEVNFISGKP HRSDN- PGL 2 (- strand): 6143 3482 AGS-1 (6143 5913,4245 4119,3964 3482) SCR (e 0.944 d 0.989 a 0.996,e 1.000 d 0.097 a 0.989,e 0.970) Exon 1 6143 5913 ( 231 n); score: 0.944 Intron 1 5912 4246 (1667 n); Pd: 0.989 Pa: 0.996 Exon 2 4245 4119 ( 127 n); score: 1.000 Intron 2 4118 3965 ( 154 n); Pd: 0.097 Pa: 0.989 Exon 3 3964 3482 ( 483 n); score: 0.970 PGS (6143 5913,4245 4119,3964 3482) SGN-U344643- 3-phase translation of AGS-1 (-strand): . . . . . . 6143 GTTAAAAGAAATTGGCTGCATAGATGCTGATCTTGTAGTTCAAATCAAAGGTCGTGTTGC V K R N W L H R C - S C S S N Q R S C C L K E I G C I D A D L V V Q I K G R V A - K K L A A - M L I L - F K S K V V L . . . . . . 6083 TTGCGAAATGAATTCCGTAGAGGAGCTGATATGCACAGAATGTTTATTTGAGAATCAGCT L R N E F R R G A D M H R M F I - E S A C E M N S V E E L I C T E C L F E N Q L L A K - I P - R S - Y A Q N V Y L R I S . . . . . . 6023 TGATGACCTGGAGCCAGAAGAGGCAGTGGCGATCATGTCTTCCTTTGTGTTTCAGCAAAA - - P G A R R G S G D H V F L C V S A K D D L E P E E A V A I M S S F V F Q Q K L M T W S Q K R Q W R S C L P L C F S K . . . . . . : 5963 GGAGACTTCTGAATCTTTCCTCACGCCAAAGCTTTCTCAGGCTAAAAAGAG : ATTACATGA G D F - I F P H A K A F S G - K E : I T - E T S E S F L T P K L S Q A K K R : L H E R R L L N L S S R Q S F L R L K R : D Y M . . . . . . 4236 AACTGCAATAAGACTTGGGGAGCTTCAGGCTCAGTTTAAGCTGCCGATTGACCCAAAGGA N C N K T W G A S G S V - A A D - P K G T A I R L G E L Q A Q F K L P I D P K E K L Q - D L G S F R L S L S C R L T Q R . . . . . . : 4176 GTATGCCCAAGAGAACCTTAAGTTTGGTTTGGTTGAAGTGGTATATGAATGGGCAAAG : GG V C P R E P - V W F G - S G I - M G K : G Y A Q E N L K F G L V E V V Y E W A K : G S M P K R T L S L V W L K W Y M N G Q R : . . . . . . 3962 GACCCCATTTGCTGAGATATGTGAACTCACAGATGTCCCTGAAGGTGTTATAGTGAGGAC D P I C - D M - T H R C P - R C Y S E D T P F A E I C E L T D V P E G V I V R T G P H L L R Y V N S Q M S L K V L - - G . . . . . . 3902 TATTGTTAGATTAGACGAGACTTGTCGTGAATTTAGAAATGCTGCTGCAATTATGGGCAA Y C - I R R D L S - I - K C C C N Y G Q I V R L D E T C R E F R N A A A I M G N L L L D - T R L V V N L E M L L Q L W A . . . . . . 3842 CTCTGCACTGTACAAGAAAATGGAAACTGCATCTAATGTGATCAAGCGCGATATCGTGTT L C T V Q E N G N C I - C D Q A R Y R V S A L Y K K M E T A S N V I K R D I V F T L H C T R K W K L H L M - S S A I S C . . . . . . 3782 TGCAGCTAGCTTATATATTACCGGAGTTTGATTAGGTCCGGAAGCCTTGATTTCATTTGC C S - L I Y Y R S L I R S G S L D F I C A A S L Y I T G V - L G P E A L I S F A L Q L A Y I L P E F D - V R K P - F H L . . . . . . 3722 AATTCTAAAGGACCTGACGATACACATGTTGTACACTAGAATATGTTCGCCAAGGTGTGG N S K G P D D T H V V H - N M F A K V W I L K D L T I H M L Y T R I C S P R C G Q F - R T - R Y T C C T L E Y V R Q G V . . . . . . 3662 ACTTTGAGCAAGGCTACCCTACAATTTGGCTAATTTGTACAGAAATTCAAGTCTTTCAAA T L S K A T L Q F G - F V Q K F K S F K L - A R L P Y N L A N L Y R N S S L S N D F E Q G Y P T I W L I C T E I Q V F Q . . . . . . 3602 TGTACATTTAGCAATCGACAGTAGATTACACCATGATTAGCACAGAGAGATACATAAACT C T F S N R Q - I T P - L A Q R D T - T V H L A I D S R L H H D - H R E I H K L M Y I - Q S T V D Y T M I S T E R Y I N . . . . . . 3542 TAAAGGATTTACTGTATGACATTCTCTTAAAATTTAGGTATGATTTTGAAGTTCTGCATC - R I Y C M T F S - N L G M I L K F C I K G F T V - H S L K I - V - F - S S A S L K D L L Y D I L L K F R Y D F E V L H . 3482 T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-14-_PGL-2_AGS-1_PPS_1 (6142 5913,4245 4119,3964 3752) (frame '2'; 567 bp, 189 residues) 1 LKEIGCIDAD LVVQIKGRVA CEMNSVEELI CTECLFENQL DDLEPEEAVA IMSSFVFQQK 61 ETSESFLTPK LSQAKKRLHE TAIRLGELQA QFKLPIDPKE YAQENLKFGL VEVVYEWAKG 121 TPFAEICELT DVPEGVIVRT IVRLDETCRE FRNAAAIMGN SALYKKMETA SNVIKRDIVF 181 AASLYITGV- ... finished at: Mon Aug 28 22:25:37 2006 ________________________________________________________________________________ Sequence 15: C06HBa0054K13.1-15, from 1 to 4628, both strands analyzed. ... started at: Mon Aug 28 22:25:37 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 ******************************************************************************** EST sequence 2 +strand 1406 n (File: SGN-U345983+) 1 NAGGATCACG CGGGCGGCGC TCAGAATAGT GGATCCCCCG GGCTGCAGGA ATTCGGCACG 61 AGGCTTTTCT CCATTCTCTT GGCATTATTC ATCACACATC TTGCCCCTAT ACTCCTGAAC 121 AAAATGGTGT TGCCGAACGC AAACACCGTC ACCTCATTGA AACCGTCGTC ACTCTCCTTC 181 ATGAGTCTCA TCTCCCTGCA TGAGGCGTTG GCCACTGCCA ACTATCTCAT CAACCACATG 241 CCCAAACCCT CCCTCTCTAA CACATCTCCT TATGAATGTC TCTACACACA CCGTTCGGAT 301 TACACTCACC TTCGCCCCTT TGGTTGTCTT GCCTATCCTT GGCTACGCCC TCACACTCAA 361 CACAAACTTC AACTCCGGTC CACACCATGC ATCTTTCTAG GCTACCATCC CTCCATAAAA 421 GGATACAAAT GTTTTGAACC CATCTCTCAA AAAACATATG TCTCTCGTCA TATTCGTTTT 481 ATCGAGGAGG ACTTTCCCTA TCCCCGGTTA AGTTCTGTCA CTGCCACGTC CAACCCACTC 541 GGTATTACTC CAATCTTTCC ACCGAGCTCA CATGGCCTTG TACCAGCACC CATTAACCCA 601 TCCATCATAC ACACACCACC CAATCCCAAC ACTCTCATCC CATCTGGATC ACCCAGCACC 661 CACAATTTAC CCATCCCACC AATTTTCAAC CCTCCTCATT AACAACCTCC ACCAGCTGAA 721 ACAGCACCAC CTCCACCCCC ACCCCCATCC CCTTTACATC ACATGCAAAC CTATTCCAAA 781 TCCGGCATTT TCAAGCCCAA AGCCTACACC ACCACTTTAC CTGCTCCCTC TCCCTCTGAA 841 CCTATTACCT ACAAACAAGC ATCTACCAAC CCCCTTTTGG TGTCAGGCTA AGGATGATGA 901 ATACAGGTCT CTTATTAATC AACATACTTG GCAATTGGTA CCAGCACCCA GACATCGAAA 961 ACAATTGGTT GCAAGTGGGT GCACCGCATC AAACGTAATG CCGATGGATC CATCTCTCGT 1021 TACAAAGCTC GACTAGTTGC AAAGGGTTAT CATCAAGAGA TGACATTGAC TAATAAGAAA 1081 CTTTCCGCCC ATTCGTCAAG CAGCAGACTA TCCTCTAAGC TTGGCCCCGC TCTTCAAAAG 1141 AATGGCCCCT ACCCGCTTAA GTGAACATGC TTTTTCTCCT GGCCTCTCAA CAGGAATTTA 1201 TTGGAACAAC CCCGGGTTAG TTATATCCCC CCCCTTTTGT TGCAATTAAA AAATCCTCTG 1261 GTCTCAAAGG CCCCCGGCTG AAAAACCTTC CCTTCTTTGA GAATTGGGTG CCACGGCCCC 1321 TTAACAATTA GGAGGACAAA TTCTCCGGTT AATTTGTGGC AATAAAATCC AAAAAGAGGT 1381 TGAGAAAATC TCCGCGTTTT TGTGGG Predicted gene structure (within gDNA segment 2325 to 1): Exon 1 879 1 ( 879 n); cDNA 64 950 ( 887 n); score: 0.872 MATCH C06HBa0054K13.1-15- SGN-U345983+ 0.872 879 0.625 C PGS_C06HBa0054K13.1-15-_SGN-U345983+ (879 1) Alignment (genomic DNA sequence = upper lines): CTTTTCTCCA ATCTCTTGGT ATTGTTCATC ACACATCTTG CCCCTATACT CCTGAACAAA 820 |||||||||| |||||||| ||| |||||| |||||||||| |||||||||| |||||||||| CTTTTCTCCA TTCTCTTGGC ATTATTCATC ACACATCTTG CCCCTATACT CCTGAACAAA 123 ATGGTGTTGC CGAACGCAAA CACCGTCACC TCATTGAAAC CATCGTCACT CTTCTTCACG 760 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| || ||||| | ATGGTGTTGC CGAACGCAAA CACCGTCACC TCATTGAAAC CGTCGTCACT CTCCTTCATG 183 AGTCTCATCT CCCTGCACCT TTCTGGGTCG AGGCGTTGGC CACTGCCAAC TATCTCATCA 700 |||||||||| ||||||| | | |||||||||| |||||||||| |||||||||| AGTCTCATCT CCCTGCA--- -------T-G AGGCGTTGGC CACTGCCAAC TATCTCATCA 232 ACCGCATGCC CACACCTTCC CTCTCTAACA CATCTCCTTA AGAATGTCTA TACACACATC 640 ||| |||||| || ||| ||| |||||||||| |||||||||| |||||||| |||||||| | ACCACATGCC CAAACCCTCC CTCTCTAACA CATCTCCTTA TGAATGTCTC TACACACACC 292 GTCCAGATTA CACTCACCTT CGACCATTTG GTTGTCTTGC CTATCCTTGG CTACGCCCTC 580 || | ||||| |||||||||| || || |||| |||||||||| |||||||||| |||||||||| GTTCGGATTA CACTCACCTT CGCCCCTTTG GTTGTCTTGC CTATCCTTGG CTACGCCCTC 352 ACACTCAACA CAAACTTTAA CCCCGGTCTA CACCATGCAT TTTTCTAGGC TACCATCCCA 520 |||||||||| ||||||| || | |||||| | |||||||||| ||||||||| ||||||||| ACACTCAACA CAAACTTCAA CTCCGGTCCA CACCATGCAT CTTTCTAGGC TACCATCCCT 412 CCATAAAAGG ATACAAATGT TTTGAACCCA TCTCTCAAAA AACATATGTC TCTCATCATG 460 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| |||| CCATAAAAGG ATACAAATGT TTTGAACCCA TCTCTCAAAA AACATATGTC TCTCGTCATA 472 TTCGTTTTAT CGAGGAGGAC TTTCCCTATC CCCGGTTAAG CTCCGTCACT GCCACGTCCA 400 |||||||||| |||||||||| |||||||||| |||||||||| || |||||| |||||||||| TTCGTTTTAT CGAGGAGGAC TTTCCCTATC CCCGGTTAAG TTCTGTCACT GCCACGTCCA 532 ACCCACTTGA TCTTATTCCA ATCTTTCCAC CGAGCTCACA TGGCCTTATA CCGACATCCA 340 ||||||| | | ||| |||| |||||||||| |||||||||| ||||||| || || || ||| ACCCACTCGG TATTACTCCA ATCTTTCCAC CGAGCTCACA TGGCCTTGTA CCAGCACCCA 592 TTAACCCATC TATCATACAC ACACCACCCA ATCCCAACAC TCTCATCCCA TCTGGAT--- 283 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ||||||| TTAACCCATC CATCATACAC ACACCACCCA ATCCCAACAC TCTCATCCCA TCTGGATCAC 652 ----CA-CC- C-A--T-CCC A---CA--AA CTTTCAACCC TCCTCACCAA CAACCTCCAC 238 || || | | | ||| | || || ||||||||| |||||| || |||||||||| CCAGCACCCA CAATTTACCC ATCCCACCAA TTTTCAACCC TCCTCATTAA CAACCTCCAC 712 CAGCCGAAAC AACACCTCCT CCACCCCCAC CCCCACCCCC TTTACATCAC ATGCAAACCC 178 |||| ||||| | |||| ||| |||||||||| ||||| |||| |||||||||| ||||||||| CAGCTGAAAC AGCACCACCT CCACCCCCAC CCCCATCCCC TTTACATCAC ATGCAAACCT 772 GTTCCAAATC CGGCATTTTC AAGCCCAAAG TCTACACCAC CACTTCACTT GCTCCCTCTC 118 ||||||||| |||||||||| |||||||||| ||||||||| ||||| || | |||||||||| ATTCCAAATC CGGCATTTTC AAGCCCAAAG CCTACACCAC CACTTTACCT GCTCCCTCTC 832 CCTCTGAACC TACTACCTAT AAACAAGCAT CTACCAACCT TC-TTTGGTG TCAGGCTATG 59 |||||||||| || |||||| |||||||||| ||||||||| | ||||||| |||||||| | CCTCTGAACC TATTACCTAC AAACAAGCAT CTACCAACCC CCTTTTGGTG TCAGGCTAAG 892 GATGATGAAT ACAGGTCTCT TATTAATCAG CATACTTGGG AATTGGTGCC AGCACCCA 1 |||||||||| |||||||||| ||||||||| ||||||||| ||||||| || |||||||| GATGATGAAT ACAGGTCTCT TATTAATCAA CATACTTGGC AATTGGTACC AGCACCCA 950 hqPGS_C06HBa0054K13.1-15-_SGN-U345983+ (879 1) ******************************************************************************** EST sequence 1 -strand 890 n (File: SGN-U335137-) 1 GTAAAACTAT GTAGNATGAC CATTCTTTTC TTCGATACCA AAAATTAAAT TCCATATAGA 61 CATAAAAAAT GTTTTAAATT TTTTTCTTAC ACTANGGGAA TGNAAGAAAA AAAACAAGAT 121 TAATNAACTC AAATAATTAT AATAAATAAG TCAAAAAAAT AATTTATGTA TTAAAAAAAT 181 TTGAAATATA CCTTGAACTT TGAAAAAAGA ATCATATATG CCCCTAAATA TATTTTTTTT 241 TAAAATTAAA GTAAAATTAT AAATTTAAAA GTAATTTTTT CACTTTCGTT AAATGAAGGG 301 TATATATGAG CTCATTTTGT AACGGCAGAG GTATATGTGA ACCATTTGTA TAACGGTAAG 361 GGTATATATG AGCCACTTTC ATAACGAGGG GTATATCAGT TTCAAATGAC AAAGTTGAGG 421 GGTATATCAT ACCCTTTTCC CATAATATTA TTCATTTTTG GGTTGACGGG TCAAACCTTG 481 GGCTGCTTAG GACTTGATTA GACCGCTATT TTATTGACTC TTTAATTAAT GGGCAACTTT 541 CACATATAAC AAACAAAAAA TTCATATTTG TATGCTATAA CAAAGTTTGC ATAATTGCGC 601 TCCATAGCAA ACATAAAATT GTATAATTCG CTGACCTAAA TTGTATAATT CGCTGGCCTA 661 TTTCGCTGCA ATTGTATAAT TCGCTATCCT ATTTAACTAC AATTGTATAA TTCGCTGCCT 721 ATTTCGCTGC AATATTATTA TAAAATTTGC TTTGCATATA ATTGAACCGA ATTAAAATGT 781 ATGTATATTG CATAATTATA AGTGTATAGC AATAAGATAT ATGTTTTTCC CTGCAGCCCG 841 GGGGATCCAC TAGTTCTAGA GCGGCCGCCA CCGCGGGGAG CTCCAGCTCT Predicted gene structure (within gDNA segment 4628 to 1): Exon 1 4624 4604 ( 21 n); cDNA 285 305 ( 21 n); score: 0.762 Intron 1 4603 3671 ( 933 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.80) Exon 2 3670 3534 ( 137 n); cDNA 306 442 ( 137 n); score: 0.876 Intron 2 3533 2571 ( 963 n); Pd: 0.900 (s: 0.92), Pa: 0.000 (s: 0) Exon 3 2570 2545 ( 26 n); cDNA 443 468 ( 26 n); score: 0.731 MATCH C06HBa0054K13.1-15- SGN-U335137- 0.876 184 0.207 C PGS_C06HBa0054K13.1-15-_SGN-U335137- (4624 4604,3670 3534,2570 2545) Alignment (genomic DNA sequence = upper lines): TTCTTGAAAT TCAAGGTATA TTGATTAATC CGAGTATTTC TATAACGCAG TTTTAATAGG 4565 ||| | |||| | |||||| | TTCGTTAAAT GAAGGGTATA T......... .......... .......... .......... 305 ATATTCACTG GGTTTTTTCC TTTCTCTATT TTATTAGTCT TTATTTGTTC CTTTTTATCC 4505 .......... .......... .......... .......... .......... .......... 305 CTTTTACTTT TTTTTCTTAG TTTGTTTTTC ATTTTTTAAA AAGTGAAATT TCAACAATAA 4445 .......... .......... .......... .......... .......... .......... 305 ATATGATGAA AATATATAAT TTACAGGTAA TAGTAAAATA ATTTTACTTT AAGAATCACA 4385 .......... .......... .......... .......... .......... .......... 305 CATGGTATTT TTCGCAGCTA TAATTTTTTT ATACAATATA TTTATTCTCT TATTTTCACA 4325 .......... .......... .......... .......... .......... .......... 305 TATAGGCTTT TAAATTGTAT ACATCATTAT TTCTCTTACA AGAGTCTTTC CTTTTCCTAT 4265 .......... .......... .......... .......... .......... .......... 305 ATAATTTTTG GGTGATTGAA GATTAAGATC TTAGTTAGAG AACATATTTT AATCTTAGTA 4205 .......... .......... .......... .......... .......... .......... 305 TGTCTTAGAT ATTATTTTCT CTCCAGTTTG AATTTGTAAT ATTTTTTCCT TCAAATTATA 4145 .......... .......... .......... .......... .......... .......... 305 TATGACTTAT CATAAGCATG TTTATAATGT TATTATTGGT TCATCTGTTT TTTTTTGTTA 4085 .......... .......... .......... .......... .......... .......... 305 ATTTATTTTC TGTTTATTAA AAATTAAATT TGAAAACTAA AAGATATTAA TTATTTTTAC 4025 .......... .......... .......... .......... .......... .......... 305 TATTTGAATA GAGTGATGAT TTTTTCATGA ATAATGAAAC ATCATATCAT CAAAAACTAA 3965 .......... .......... .......... .......... .......... .......... 305 CACAAAATAG TGCTATATTT CATGTAAATA CAATAAACTA TACTCATATA TAGATCCATG 3905 .......... .......... .......... .......... .......... .......... 305 GACTTATCAC AAGTAATTAC ATTCTCAATA CAAAATGTGT AATTGAAATG TATATTTCTG 3845 .......... .......... .......... .......... .......... .......... 305 ATAAATTTGT CTTGCGCCTT TGTTATTTTA TTATAGGCAT TAGATACATT TATTATTATT 3785 .......... .......... .......... .......... .......... .......... 305 TTTCTATCAT GAATTCATAG ACATATTTTA CAGTCATGAT TCACATATTT CTGGTTTGCT 3725 .......... .......... .......... .......... .......... .......... 305 AGCAAATTCA CCATTCTATT TTATGATCTA TTTTTAATGA ATAAAAATAA ATAGAAAAAT 3665 | | .......... .......... .......... .......... .......... ....ATGAGC 311 TAATTTTGTA ATGGCAGGGA TATATGTGAG CCGTTTGTAT AACGGTAAGG GCATATGTGA 3605 | |||||||| | ||||| | ||||||||| || ||||||| |||||||||| | |||| ||| TCATTTTGTA ACGGCAGAGG TATATGTGAA CCATTTGTAT AACGGTAAGG GTATATATGA 371 GCCACTTTTA TAACGAGGGG TATATCAGCT CCAAATGACA AAGTTGAGGG GTATATCAAA 3545 |||||||| | |||||||||| |||||||| | ||||||||| |||||||||| |||||||| | GCCACTTTCA TAACGAGGGG TATATCAGTT TCAAATGACA AAGTTGAGGG GTATATCATA 431 CCCTTTTCCC GATATCATTT GATGGTTGAT GCAAGTCTTT CATTAGTTGC TTTAGCCATG 3485 |||||||||| CCCTTTTCCC A......... .......... .......... .......... .......... 442 TGCTCTCTTG TGATGCCATC GTTCCTGCTC GATATTCTGC TTAGTTTAAA GTGAAACAGT 3425 .......... .......... .......... .......... .......... .......... 442 TGGTTGTCTT CTGCTGCACC ATGATATTGC GGCTGATCCA AGATTGAAAT ATATTCGGTG 3365 .......... .......... .......... .......... .......... .......... 442 GTTGATCTTC GAGTATCATG GTCACCAGCA TAATCCGCAT CATAGTATCC GATTATCTTG 3305 .......... .......... .......... .......... .......... .......... 442 CAGGGAGCTC CTTTCATATA GAATAGTCCA TAGTCAAGAG TTCCTTTCAT ATACCTTAAT 3245 .......... .......... .......... .......... .......... .......... 442 ATCTTCCTTA TTGCTTCCAA TGAGGCTTCT TTGGTTTCTG CATAAATCGG CTAACAATTC 3185 .......... .......... .......... .......... .......... .......... 442 TAACAGCGAA TGCAATATCT GCTCTGGTCA ATGTTAAGTA GATAAGACTA CCGACAATTT 3125 .......... .......... .......... .......... .......... .......... 442 GACAATACAT TGTTTCTTCC TCCAAGTCTT TGCCTTCACG GGAACATAGT TTTAAGTTAG 3065 .......... .......... .......... .......... .......... .......... 442 GCTCCATCGG TGTAGAAAAT GTGTTGCAAC CCATCATTCC ATATTTTTGG ATTAGATCTC 3005 .......... .......... .......... .......... .......... .......... 442 TCGCATACTC TTGTTGTCCA AGGAATAATC CATCCTTTGT TTTTTCAATC TCAATAATAG 2945 .......... .......... .......... .......... .......... .......... 442 GAGTAACTAT AATTAGCACT AATTATGTGA CTCATGTGTA ATTATTTATT CATGTTTTCT 2885 .......... .......... .......... .......... .......... .......... 442 CTATATATAT CGGATGTACA ACCCTTAATC AATAAGACTC ATTATTATAT CAGGGTATCA 2825 .......... .......... .......... .......... .......... .......... 442 GAGCTAAGGT TAAACTCTTC CAAAACCTAA AAAAAACCCT AACCTTCTTC TTCTTCTTTG 2765 .......... .......... .......... .......... .......... .......... 442 TTGCTGCCGA TCAACAACAA AACAAAAAAA AAATTCGGCT GCCTCTCTTC ATCAACAATG 2705 .......... .......... .......... .......... .......... .......... 442 GCAGCCGATC CCCTTGCCAT CCTTCCCTCA GGTGTGAAAT TGTTCCTTCG CAATCTTCAC 2645 .......... .......... .......... .......... .......... .......... 442 AATCTCATCC CAGAAAAATT AACCGATCAT AATTATCCAG CCTGGTCGAA TAGCGTCAAG 2585 .......... .......... .......... .......... .......... .......... 442 ATGGCGCTCT CCACAAATCT TCTTCTTGGT TGGGTCGACG 2545 ||| | | ||| | | ||||| |||| .......... ....TAATAT TATTCATTTT TGGGTTGACG 468 hqPGS_C06HBa0054K13.1-15-_SGN-U335137- (4624 4604,3670 3534,2570 2545) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 4628: PGL 1 (- strand): 879 1 AGS-1 (879 1) SCR (e 0.872) Exon 1 879 1 ( 879 n); score: 0.872 PGS (879 1) SGN-U345983+ 3-phase translation of AGS-1 (-strand): . . . . . . 879 CTTTTCTCCAATCTCTTGGTATTGTTCATCACACATCTTGCCCCTATACTCCTGAACAAA L F S N L L V L F I T H L A P I L L N K F S P I S W Y C S S H I L P L Y S - T K F L Q S L G I V H H T S C P Y T P E Q . . . . . . 819 ATGGTGTTGCCGAACGCAAACACCGTCACCTCATTGAAACCATCGTCACTCTTCTTCACG M V L P N A N T V T S L K P S S L F F T W C C R T Q T P S P H - N H R H S S S R N G V A E R K H R H L I E T I V T L L H . . . . . . 759 AGTCTCATCTCCCTGCACCTTTCTGGGTCGAGGCGTTGGCCACTGCCAACTATCTCATCA S L I S L H L S G S R R W P L P T I S S V S S P C T F L G R G V G H C Q L S H Q E S H L P A P F W V E A L A T A N Y L I . . . . . . 699 ACCGCATGCCCACACCTTCCCTCTCTAACACATCTCCTTAAGAATGTCTATACACACATC T A C P H L P S L T H L L K N V Y T H I P H A H T F P L - H I S L R M S I H T S N R M P T P S L S N T S P - E C L Y T H . . . . . . 639 GTCCAGATTACACTCACCTTCGACCATTTGGTTGTCTTGCCTATCCTTGGCTACGCCCTC V Q I T L T F D H L V V L P I L G Y A L S R L H S P S T I W L S C L S L A T P S R P D Y T H L R P F G C L A Y P W L R P . . . . . . 579 ACACTCAACACAAACTTTAACCCCGGTCTACACCATGCATTTTTCTAGGCTACCATCCCA T L N T N F N P G L H H A F F - A T I P H S T Q T L T P V Y T M H F S R L P S H H T Q H K L - P R S T P C I F L G Y H P . . . . . . 519 CCATAAAAGGATACAAATGTTTTGAACCCATCTCTCAAAAAACATATGTCTCTCATCATG P - K D T N V L N P S L K K H M S L I M H K R I Q M F - T H L S K N I C L S S C T I K G Y K C F E P I S Q K T Y V S H H . . . . . . 459 TTCGTTTTATCGAGGAGGACTTTCCCTATCCCCGGTTAAGCTCCGTCACTGCCACGTCCA F V L S R R T F P I P G - A P S L P R P S F Y R G G L S L S P V K L R H C H V Q V R F I E E D F P Y P R L S S V T A T S . . . . . . 399 ACCCACTTGATCTTATTCCAATCTTTCCACCGAGCTCACATGGCCTTATACCGACATCCA T H L I L F Q S F H R A H M A L Y R H P P T - S Y S N L S T E L T W P Y T D I H N P L D L I P I F P P S S H G L I P T S . . . . . . 339 TTAACCCATCTATCATACACACACCACCCAATCCCAACACTCTCATCCCATCTGGATCAC L T H L S Y T H H P I P T L S S H L D H - P I Y H T H T T Q S Q H S H P I W I T I N P S I I H T P P N P N T L I P S G S . . . . . . 279 CCATCCCACAAACTTTCAACCCTCCTCACCAACAACCTCCACCAGCCGAAACAACACCTC P S H K L S T L L T N N L H Q P K Q H L H P T N F Q P S S P T T S T S R N N T S P I P Q T F N P P H Q Q P P P A E T T P . . . . . . 219 CTCCACCCCCACCCCCACCCCCTTTACATCACATGCAAACCCGTTCCAAATCCGGCATTT L H P H P H P L Y I T C K P V P N P A F S T P T P T P F T S H A N P F Q I R H F P P P P P P P P L H H M Q T R S K S G I . . . . . . 159 TCAAGCCCAAAGTCTACACCACCACTTCACTTGCTCCCTCTCCCTCTGAACCTACTACCT S S P K S T P P L H L L P L P L N L L P Q A Q S L H H H F T C S L S L - T Y Y L F K P K V Y T T T S L A P S P S E P T T . . . . . . 99 ATAAACAAGCATCTACCAACCTTCTTTGGTGTCAGGCTATGGATGATGAATACAGGTCTC I N K H L P T F F G V R L W M M N T G L - T S I Y Q P S L V S G Y G - - I Q V S Y K Q A S T N L L W C Q A M D D E Y R S . . . . 39 TTATTAATCAGCATACTTGGGAATTGGTGCCAGCACCCA L L I S I L G N W C Q H P Y - S A Y L G I G A S T L I N Q H T W E L V P A P Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-15-_PGL-1_AGS-1_PPS_1 (559 2) (frame '0'; 558 bp, 186 residues) 1 PRSTPCIFLG YHPTIKGYKC FEPISQKTYV SHHVRFIEED FPYPRLSSVT ATSNPLDLIP 61 IFPPSSHGLI PTSINPSIIH TPPNPNTLIP SGSPIPQTFN PPHQQPPPAE TTPPPPPPPP 121 PLHHMQTRSK SGIFKPKVYT TTSLAPSPSE PTTYKQASTN LLWCQAMDDE YRSLINQHTW 181 ELVPAP >C06HBa0054K13.1-15-_PGL-1_AGS-1_PPS_2 (879 532) (frame '1'; 345 bp, 115 residues) 1 LFSNLLVLFI THLAPILLNK MVLPNANTVT SLKPSSLFFT SLISLHLSGS RRWPLPTISS 61 TACPHLPSLT HLLKNVYTHI VQITLTFDHL VVLPILGYAL TLNTNFNPGL HHAFF- 3-phase translation of AGS-1 (+strand): . . . . . . 1 TGGGTGCTGGCACCAATTCCCAAGTATGCTGATTAATAAGAGACCTGTATTCATCATCCA W V L A P I P K Y A D - - E T C I H H P G C W H Q F P S M L I N K R P V F I I H G A G T N S Q V C - L I R D L Y S S S . . . . . . 61 TAGCCTGACACCAAAGAAGGTTGGTAGATGCTTGTTTATAGGTAGTAGGTTCAGAGGGAG - P D T K E G W - M L V Y R - - V Q R E S L T P K K V G R C L F I G S R F R G R I A - H Q R R L V D A C L - V V G S E G . . . . . . 121 AGGGAGCAAGTGAAGTGGTGGTGTAGACTTTGGGCTTGAAAATGCCGGATTTGGAACGGG R E Q V K W W C R L W A - K C R I W N G G S K - S G G V D F G L E N A G F G T G E G A S E V V V - T L G L K M P D L E R . . . . . . 181 TTTGCATGTGATGTAAAGGGGGTGGGGGTGGGGGTGGAGGAGGTGTTGTTTCGGCTGGTG F A C D V K G V G V G V E E V L F R L V L H V M - R G W G W G W R R C C F G W W V C M - C K G G G G G G G G G V V S A G . . . . . . 241 GAGGTTGTTGGTGAGGAGGGTTGAAAGTTTGTGGGATGGGTGATCCAGATGGGATGAGAG E V V G E E G - K F V G W V I Q M G - E R L L V R R V E S L W D G - S R W D E S G G C W - G G L K V C G M G D P D G M R . . . . . . 301 TGTTGGGATTGGGTGGTGTGTGTATGATAGATGGGTTAATGGATGTCGGTATAAGGCCAT C W D W V V C V - - M G - W M S V - G H V G I G W C V Y D R W V N G C R Y K A M V L G L G G V C M I D G L M D V G I R P . . . . . . 361 GTGAGCTCGGTGGAAAGATTGGAATAAGATCAAGTGGGTTGGACGTGGCAGTGACGGAGC V S S V E R L E - D Q V G W T W Q - R S - A R W K D W N K I K W V G R G S D G A C E L G G K I G I R S S G L D V A V T E . . . . . . 421 TTAACCGGGGATAGGGAAAGTCCTCCTCGATAAAACGAACATGATGAGAGACATATGTTT L T G D R E S P P R - N E H D E R H M F - P G I G K V L L D K T N M M R D I C F L N R G - G K S S S I K R T - - E T Y V . . . . . . 481 TTTGAGAGATGGGTTCAAAACATTTGTATCCTTTTATGGTGGGATGGTAGCCTAGAAAAA F E R W V Q N I C I L L W W D G S L E K L R D G F K T F V S F Y G G M V A - K N F - E M G S K H L Y P F M V G W - P R K . . . . . . 541 TGCATGGTGTAGACCGGGGTTAAAGTTTGTGTTGAGTGTGAGGGCGTAGCCAAGGATAGG C M V - T G V K V C V E C E G V A K D R A W C R P G L K F V L S V R A - P R I G M H G V D R G - S L C - V - G R S Q G - . . . . . . 601 CAAGACAACCAAATGGTCGAAGGTGAGTGTAATCTGGACGATGTGTGTATAGACATTCTT Q D N Q M V E G E C N L D D V C I D I L K T T K W S K V S V I W T M C V - T F L A R Q P N G R R - V - S G R C V Y R H S . . . . . . 661 AAGGAGATGTGTTAGAGAGGGAAGGTGTGGGCATGCGGTTGATGAGATAGTTGGCAGTGG K E M C - R G K V W A C G - - D S W Q W R R C V R E G R C G H A V D E I V G S G - G D V L E R E G V G M R L M R - L A V . . . . . . 721 CCAACGCCTCGACCCAGAAAGGTGCAGGGAGATGAGACTCGTGAAGAAGAGTGACGATGG P T P R P R K V Q G D E T R E E E - R W Q R L D P E R C R E M R L V K K S D D G A N A S T Q K G A G R - D S - R R V T M . . . . . . 781 TTTCAATGAGGTGACGGTGTTTGCGTTCGGCAACACCATTTTGTTCAGGAGTATAGGGGC F Q - G D G V C V R Q H H F V Q E Y R G F N E V T V F A F G N T I L F R S I G A V S M R - R C L R S A T P F C S G V - G . . . . 841 AAGATGTGTGATGAACAATACCAAGAGATTGGAGAAAAG K M C D E Q Y Q E I G E K R C V M N N T K R L E K Q D V - - T I P R D W R K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-15+_PGL-1_AGS-1_PPS_1 (653 877) (frame '2'; 225 bp, 75 residues) 1 TFLRRCVREG RCGHAVDEIV GSGQRLDPER CREMRLVKKS DDGFNEVTVF AFGNTILFRS 61 IGARCVMNNT KRLEK PGL 2 (- strand): 4624 2545 AGS-1 (4624 4604,3670 3534,2570 2545) SCR (e 0.762 d 0.000 a 0.000,e 0.876 d 0.900 a 0.000,e 0.731) Exon 1 4624 4604 ( 21 n); score: 0.762 Intron 1 4603 3671 ( 933 n); Pd: 0.000 Pa: 0.000 Exon 2 3670 3534 ( 137 n); score: 0.876 Intron 2 3533 2571 ( 963 n); Pd: 0.900 Pa: 0.000 Exon 3 2570 2545 ( 26 n); score: 0.731 PGS (4624 4604,3670 3534,2570 2545) SGN-U335137- 3-phase translation of AGS-1 (-strand): . . . : . . . 4624 TTCTTGAAATTCAAGGTATAT : AAAAATTAATTTTGTAATGGCAGGGATATATGTGAGCCG F L K F K V Y : K N - F C N G R D I C E P S - N S R Y I : K I N F V M A G I Y V S R L E I Q G I : - K L I L - W Q G Y M - A . . . . . . 3631 TTTGTATAACGGTAAGGGCATATGTGAGCCACTTTTATAACGAGGGGTATATCAGCTCCA F V - R - G H M - A T F I T R G I S A P L Y N G K G I C E P L L - R G V Y Q L Q V C I T V R A Y V S H F Y N E G Y I S S . . . . : . . 3571 AATGACAAAGTTGAGGGGTATATCAAACCCTTTTCCCG : AAATCTTCTTCTTGGTTGGGTC N D K V E G Y I K P F S R : N L L L G W V M T K L R G I S N P F P : E I F F L V G S K - Q S - G V Y Q T L F P : K S S S W L G . 2548 GACG D T R Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:25:45 2006 ________________________________________________________________________________ Sequence 16: C06HBa0054K13.1-16, from 1 to 28288, both strands analyzed. ... started at: Mon Aug 28 22:25:45 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 15 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 3 ******************************************************************************** EST sequence 15 +strand 898 n (File: SGN-U341961+) 1 GGGGNNNGGT TTGAAAACCT TGGACAACGC TGGAGCTCAC CGCGGTGGCG GCCGCTCTAG 61 AACTAGTGGA TCCCCCGGGC TGCAGGAATT CGGCACGAGC TTGCTCGTTT CTCAGTTTGC 121 TGCACACAAG AAGACAAGAA GCACAAATAA TCAATAGCAA ATGTACGCAG CAAAAGAAGC 181 CAAGATCGAG TCCAAGATCG AGTAAAATGG GTCTGTCAAG TAATCAAGAA AATAGAGTGA 241 TTTGTCAAAG CCAAAAATAA TATATTAAAG GTATAAGTTG ATACAAATAA GGAAAACCTT 301 AAATATATTA GTAAGGAAAG AAGTTGTATA TAATAAGGAT TGTGCTTTAA AAGGGAATCC 361 TTGTTTAATA ATAACTTCCT TATCTTATCT GATAAGGACT GAAGGCAAAA GGAACTCTAT 421 AAGAAGAAGA AGACGCTGAT GAAGAAAATG AGGACTTCAT GCAGAGAACA AGGGAGAATT 481 ATTCATCAAA TTGAAGTCTT CAAGGTTGAT AGGATTACTA CGTGTAAAAC ATTCTTGAGT 541 AAGAAGGTTT TATCGTGTAG AACTATTTGT CTTGTTTTTG TTCTTAATTG TAGTTTACAG 601 TTTATTGTAC AAAGGGTGGG TTTGGCTCTT TGTAGGGTTG AGTGTTAGTA AGAGTTGTAA 661 CAAAAAGTGG GTTTGGCCTT TTGGAGAGAT CGATTGGAGT CAATCGNAGG AGTAGTAGAG 721 ATAAGACTTT TGATTATTGA GTTGTAATCA CAAAATCTTA TAGTTGAAAT AAATAAAACG 781 AGGTTTTTTC TTTCTTGAGT AAAGGAAGGT TTTTAATTTC CCCACCTAAC NNCCNCCCTC 841 NNCNNCCNNT CTTTTTTTTT CTTTCCAATT CTTTCCCACC CCCTTTTTTC CTTTCTTA Predicted gene structure (within gDNA segment 931 to 5677): Exon 1 2966 2975 ( 10 n); cDNA 124 132 ( 9 n); score: 0.900 Intron 1 2976 3129 ( 154 n); Pd: 0.796 (s: 0), Pa: 0.000 (s: 0.94) Exon 2 3130 3816 ( 687 n); cDNA 133 819 ( 687 n); score: 0.881 MATCH C06HBa0054K13.1-16+ SGN-U341961+ 0.881 697 0.776 C PGS_C06HBa0054K13.1-16+_SGN-U341961+ (2966 2975,3130 3816) Alignment (genomic DNA sequence = upper lines): ACACATAGAA GCAACACACC GAAGAACTTG GTCCTTTTAC AGATTTTATG TGTTATGAAT 3025 ||||| |||| ACACA-AGAA .......... .......... .......... .......... .......... 132 TTTTCTTTGT GGTTAAGTCG TGATGTAATC TCAGTTCAAT GAGTTATAAA CTAAATCAGG 3085 .......... .......... .......... .......... .......... .......... 132 TCTGATAGAA TATGATAGAA TCGATAATTC ATGTTTTGAT GATGGACAAG AAGCGCAAAT 3145 |||||| |||| ||||| .......... .......... .......... .......... ....GACAAG AAGCACAAAT 148 AATCAGTAGC AAATATACGC AGCAAAAGAA GCCAAGCTCG AGTCCAAGAT CGAGTAAAAT 3205 ||||| |||| |||| ||||| |||||||||| |||||| ||| |||||||||| |||||||||| AATCAATAGC AAATGTACGC AGCAAAAGAA GCCAAGATCG AGTCCAAGAT CGAGTAAAAT 208 GGATCAGCCA AGTAATCAAG AAAATATAGT TATTTGTCAA AGCCAAAAAT AATATATTAA 3265 || || | || |||||||||| |||||| ||| ||||||||| |||||||||| |||||||||| GGGTCTGTCA AGTAATCAAG AAAATAGAGT GATTTGTCAA AGCCAAAAAT AATATATTAA 268 AGGTATAAGT TGATACACAT AAGGAAAACC TTAAATATAT TATTAAGGAA GGAAGTTGTA 3325 |||||||||| ||||||| || |||||||||| |||||||||| || ||||||| ||||||||| AGGTATAAGT TGATACAAAT AAGGAAAACC TTAAATATAT TAGTAAGGAA AGAAGTTGTA 328 TATAATAAGG ATTGTACTTT AAAAGGGAAT CCTTGTTTAA CAATAAGTTC CTTACCTTAT 3385 |||||||||| ||||| |||| |||||||||| |||||||||| ||||| ||| |||| ||||| TATAATAAGG ATTGTGCTTT AAAAGGGAAT CCTTGTTTAA TAATAACTTC CTTATCTTAT 388 CTGATAAGGA TTTAAGGCAA AAGAAACTCT ATAAGAAGAG GACGATGTTG ATGAAGAATC 3445 |||||||||| | ||||||| ||| |||||| ||||||||| || || | || |||||||| CTGATAAGGA CTGAAGGCAA AAGGAACTCT ATAAGAAGAA GAAGACGCTG ATGAAGAAAA 448 CCATGACTTC ACATTGAGAG AAAAGTGATA ATTATTCATC AAACTGAAGT CTTCAAGATC 3505 | |||||| | |||| | ||| || | |||||||||| ||| |||||| ||||||| | TGAGGACTTC ATGCAGAGA- ACAAGGGAGA ATTATTCATC AAATTGAAGT CTTCAAGGTT 507 GATAGGTTTG ATACGTTTAA AACGTTCTTG AGTGAGAAGG TTTTTAACGC GTAGAACTAT 3565 |||||| || ||||| ||| ||| |||||| ||| |||||| ||||| || |||||||||| GATAGGATTA CTACGTGTAA AACATTCTTG AGTAAGAAGG -TTTTATCGT GTAGAACTAT 566 CT-TCTTG-T TTTGTTCTTG ATTGTAATTT ACAGTTTATT ATACAAAGGG TGGGTTTGGC 3623 | ||||| | ||||||||| |||||| ||| |||||||||| ||||||||| |||||||||| TTGTCTTGTT TTTGTTCTTA ATTGTAGTTT ACAGTTTATT GTACAAAGGG TGGGTTTGGC 626 TCTTTGTAGG GTTGAGTTTT CGTGAGGATT GTAACAAAAG GTGGGTTTGG CCTTTTGGAG 3683 |||||||||| ||||||| || || || || ||||||||| |||||||||| |||||||||| TCTTTGTAGG GTTGAGTGTT AGTAAGAGTT GTAACAAAAA GTGGGTTTGG CCTTTTGGAG 686 AGATCAATTG TAGTCAATCG AGAGAGTTAG TAGAGATAAG GCTTTTTGAT TATTGAGTTG 3743 ||||| |||| ||||||||| ||| ||| |||||||||| | ||||||| |||||||||| AGATCGATTG GAGTCAATCG NAGGAG-TAG TAGAGATAAG AC-TTTTGAT TATTGAGTTG 744 TAATCACAAA ATCTTATAGT TG-AATTAAT AAAATGAGGT TTTTCCTTCC TTGAGT-GCG 3801 |||||||||| |||||||||| || ||| ||| |||| ||||| |||| ||| | |||||| | TAATCACAAA ATCTTATAGT TGAAATAAAT AAAACGAGGT TTTTTCTTTC TTGAGTAAAG 804 GAAGGTTTTT AATTT 3816 |||||||||| ||||| GAAGGTTTTT AATTT 819 hqPGS_C06HBa0054K13.1-16+_SGN-U341961+ (2966 2975,3130 3816) ******************************************************************************** EST sequence 14 -strand 898 n (File: SGN-U341961-) 1 TAAGAAAGGA AAAAAGGGGG TGGGAAAGAA TTGGAAAGAA AAAAAAAGAN NGGNNGNNGA 61 GGGNGGNNGT TAGGTGGGGA AATTAAAAAC CTTCCTTTAC TCAAGAAAGA AAAAACCTCG 121 TTTTATTTAT TTCAACTATA AGATTTTGTG ATTACAACTC AATAATCAAA AGTCTTATCT 181 CTACTACTCC TNCGATTGAC TCCAATCGAT CTCTCCAAAA GGCCAAACCC ACTTTTTGTT 241 ACAACTCTTA CTAACACTCA ACCCTACAAA GAGCCAAACC CACCCTTTGT ACAATAAACT 301 GTAAACTACA ATTAAGAACA AAAACAAGAC AAATAGTTCT ACACGATAAA ACCTTCTTAC 361 TCAAGAATGT TTTACACGTA GTAATCCTAT CAACCTTGAA GACTTCAATT TGATGAATAA 421 TTCTCCCTTG TTCTCTGCAT GAAGTCCTCA TTTTCTTCAT CAGCGTCTTC TTCTTCTTAT 481 AGAGTTCCTT TTGCCTTCAG TCCTTATCAG ATAAGATAAG GAAGTTATTA TTAAACAAGG 541 ATTCCCTTTT AAAGCACAAT CCTTATTATA TACAACTTCT TTCCTTACTA ATATATTTAA 601 GGTTTTCCTT ATTTGTATCA ACTTATACCT TTAATATATT ATTTTTGGCT TTGACAAATC 661 ACTCTATTTT CTTGATTACT TGACAGACCC ATTTTACTCG ATCTTGGACT CGATCTTGGC 721 TTCTTTTGCT GCGTACATTT GCTATTGATT ATTTGTGCTT CTTGTCTTCT TGTGTGCAGC 781 AAACTGAGAA ACGAGCAAGC TCGTGCCGAA TTCCTGCAGC CCGGGGGATC CACTAGTTCT 841 AGAGCGGCCG CCACCGCGGT GAGCTCCAGC GTTGTCCAAG GTTTTCAAAC CNNNCCCC Predicted gene structure (within gDNA segment 13280 to 9076): Exon 1 11880 11197 ( 684 n); cDNA 80 763 ( 684 n); score: 0.892 MATCH C06HBa0054K13.1-16- SGN-U341961- 0.892 684 0.762 C PGS_C06HBa0054K13.1-16-_SGN-U341961- (11880 11197) Alignment (genomic DNA sequence = upper lines): AAATTAAAAA CCTTCC-GCA CTCAAGGAAG GAAAAACCTC GTTTTA-TTA ATTCAACTAT 11823 |||||||||| |||||| | |||||| ||| ||||||||| |||||| ||| ||||||||| AAATTAAAAA CCTTCCTTTA CTCAAGAAAG AAAAAACCTC GTTTTATTTA TTTCAACTAT 139 AAGATTTTGT GATTACAACT CAATAATCAA AAAGCCTTAT CTCTACTACC TCTCTCGATT 11763 |||||||||| |||||||||| |||||||| | |||| ||||| |||||||| | || ||||| AAGATTTTGT GATTACAACT CAATAATC-A AAAGTCTTAT CTCTACTA-C TCCTNCGATT 197 GACTACAATC GATCTCTCCA AAAGGCCAAA CCCACCTTTT GTTACAATCC TCACGAAAAC 11703 |||| ||||| |||||||||| |||||||||| ||||| |||| ||||||| | | || || || GACTCCAATC GATCTCTCCA AAAGGCCAAA CCCACTTTTT GTTACAACTC TTACTAACAC 257 TCAACCCTAC AAAGAGCCAA ACCCACCCTT TGTATAATAA ACTGTAAATT ACAATCAAGA 11643 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||| | ||||| |||| TCAACCCTAC AAAGAGCCAA ACCCACCCTT TGTACAATAA ACTGTAAACT ACAATTAAGA 317 AC-AAAACAA GA-AGATAGT TCTACACGTT AAAAACCTTC TCACTCAAGA ACGTTTTAAA 11585 || ||||||| || | ||||| |||||||| | ||||||||| | |||||||| | |||||| | ACAAAAACAA GACAAATAGT TCTACACGAT -AAAACCTTC TTACTCAAGA ATGTTTTACA 376 CGTAGCAAAC CTATTGATCT TGAAGACTTT AGTTTGATGA ATAATTCTCA CTTTTCTCTC 11525 ||||| || | |||| | || ||||||||| | |||||||| ||||||||| ||| | |||| CGTAGTAATC CTATCAACCT TGAAGACTTC AATTTGATGA ATAATTCTCC CTTGT-TCTC 435 TATGTGAAGT CGTGGGATTC TTCATCAGCA TCGTCCTCTT CTTATAGAGT TTCTTTTGCC 11465 | |||||| | | ||| ||||||||| || || |||| |||||||||| | |||||||| TGCATGAAGT CCTCATTTTC TTCATCAGCG TCTTCTTCTT CTTATAGAGT TCCTTTTGCC 495 TTAAGTCCTT ATCAGATGAG GTAAGAAAGT TATTGTTAAA CAAGGATTCC CTTTTAAAGT 11405 || ||||||| ||||||| || |||| |||| |||| ||||| |||||||||| ||||||||| TTCAGTCCTT ATCAGATAAG ATAAGGAAGT TATTATTAAA CAAGGATTCC CTTTTAAAGC 555 ACAATCCTTA TTATATACAA CTTCCTTCCT TAATAATATA TTTAAGGTTT TCCTTATTTG 11345 |||||||||| |||||||||| |||| ||||| || ||||||| |||||||||| |||||||||| ACAATCCTTA TTATATACAA CTTCTTTCCT TACTAATATA TTTAAGGTTT TCCTTATTTG 615 TATCAACTTA TACCTTTAAT ATATTATTTT TGGCTTTGAC AAATAACTCT ATTTTCTTGA 11285 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| TATCAACTTA TACCTTTAAT ATATTATTTT TGGCTTTGAC AAATCACTCT ATTTTCTTGA 675 TTACTTGGCT GACCCATTTT ACTCGATCAT GGACTCGAGC TTGGCTTCTT TTGCTGCGTA 11225 ||||||| | |||||||||| |||||||| | |||||||| | |||||||||| |||||||||| TTACTTGACA GACCCATTTT ACTCGATCTT GGACTCGATC TTGGCTTCTT TTGCTGCGTA 735 CATTTGCTAC TGATTATTTG CGCTTCTT 11197 ||||||||| |||||||||| ||||||| CATTTGCTAT TGATTATTTG TGCTTCTT 763 hqPGS_C06HBa0054K13.1-16-_SGN-U341961- (11880 11197) ******************************************************************************** EST sequence 11 +strand 904 n (File: SGN-U345542+) 1 AGCTGGAGCT CACCGCGGTG GCGGCCGCTC TAGAACTAGT GGATCCCCCG GGCTGCAGGA 61 TTCGGCACGA GTAGGGATCA CTTGCAATGG GCGCCATAGA GTCACTGCTT TAGACATTTC 121 TAGCATGCAA CTTCATGGTA CCATTCCTCC ACACCTTGGA AATCTCTCAT TTCTCGTATC 181 CCTTGACATT AGTAACAACA CTTTCCATGG ACATCTCTCC CAAGAGTTGA CTCACTTGCG 241 GAGGTTGAAA TTGATTGATG TTACAAGAAA CAATTTTAGT GGTGCCATCC CATCGTTCTT 301 AAGTTCGTTA CCTAATCTTC AATTCCTTTA CCTTTCAAAC AACCAATATT CAGGGGAAAT 361 TCCATCTTCT CTTTCCAATC TAACAAATCT TCAAGAGTTG AGAATACAGA GGAATTTTCT 421 TCAAGGAAAA ATCCCTCCTG AAATTGGTAA TCTTCGTTAC TTGACTTTTT TAGACCTGCA 481 AGGTAATAGA CTTACTGGCT CTATACCGCC GTCAATCTTC AACATGACTT CATTGACAAG 541 ACTTGCTATA ATACACAATC GTCTTGTCGG GAAACTTCCT GTTGATATTT GTGACAATCT 601 ACCAAATCTT CAAGTGCTTT TACTCTCATC AAACAACCTG AATGGACGAA ATACACAAAT 661 TTACAGAAAT GCTCAAACCT ACAACTGNTG ACATTGTCTG GCAATGAGTT CACTGGACCC 721 TATACCAGAG AACTTNGGAA CTTACGATGC TCACAATCTT ACATCTGGNA GAGAACATTT 781 TAGAAAGTGT NGATCGATGA TTCTCACATC GNTCTAGGAT ATGTTTGTTN ANGGTGCAAA 841 AGAGAAAGAA GAATTCAGGT CCACAGATGC ATCTTTGTTG AAAGACATGA ANGAATTACN 901 TATA Predicted gene structure (within gDNA segment 11525 to 21198): Exon 1 13077 13797 ( 721 n); cDNA 74 789 ( 716 n); score: 0.712 Intron 1 13798 15201 (1404 n); Pd: 0.835 (s: 0.54), Pa: 0.000 (s: 0.58) Exon 2 15202 15321 ( 120 n); cDNA 790 903 ( 114 n); score: 0.633 MATCH C06HBa0054K13.1-16+ SGN-U345542+ 0.700 841 0.930 C PGS_C06HBa0054K13.1-16+_SGN-U345542+ (13077 13797,15202 15321) Alignment (genomic DNA sequence = upper lines): GGAATCACTT GCAGCTCCCG TCACCATCGA GTCACTGCTT TAGACATTTC AAGCATGCAA 13136 || ||||||| ||| | | | |||| || |||||||||| |||||||||| ||||||||| GGGATCACTT GCAA-T--GG GCGCCATAGA GTCACTGCTT TAGACATTTC TAGCATGCAA 130 CTTTATGGTA CCATTCCTCC ACACCTTGGA AACCTCTCAT TTATTTCATC GCTTGACATC 13196 ||| |||||| |||||||||| |||||||||| || ||||||| || | ||| |||||||| CTTCATGGTA CCATTCCTCC ACACCTTGGA AATCTCTCAT TTCTCGTATC CCTTGACATT 190 AGTAACAACA CTTTCCATGG AGAGTTGCCA CTAGAGTTGG TTCGTTTGCA GAGGTTGAAA 13256 |||||||||| |||||||||| | | | | | ||||||| || |||| |||||||||| AGTAACAACA CTTTCCATGG ACATCTCTCC CAAGAGTTGA CTCACTTGCG GAGGTTGAAA 250 TTCTTTAATA CTAAAAACAA TAACTTCACC GGAGCCATTC CATCATTTTT AAGTTTGTTA 13316 || || || || || || || || | || ||||| | |||| || || ||||| |||| TTGATTGATG TTACAAGAAA CAATTTTAGT GGTGCCATCC CATCGTTCTT AAGTTCGTTA 310 CCAAACCTAC GCTTTCTGTA CCTATCGAAT AACCAATTTT CGGGTAAAAT TCCATCCTCC 13376 || || || | || || || ||| || || ||||||| || | || |||| |||||| || CCTAATCTTC AATTCCTTTA CCTTTCAAAC AACCAATATT CAGGGGAAAT TCCATCTTCT 370 CTTTCCAATC TGACAAAACT GCAAGTGTTG TCAATACAGA GTAATTATAT TGAAGGAGAG 13436 |||||||||| | ||||| || |||| |||| |||||||| | |||| | | | ||||| | CTTTCCAATC TAACAAATCT TCAAGAGTTG AGAATACAGA GGAATTTTCT TCAAGGAAAA 430 ATCCCTCAAG AACTCGGTGA TCTTCGTTCC TTGATTATCC TAAACCTGCA ATATAATCAG 13496 ||||||| | || | ||| | |||||||| | |||| | | || ||||||| | |||| ATCCCTCCTG AAATTGGTAA TCTTCGTTAC TTGACTTTTT TAGACCTGCA AGGTAATAGA 490 CTTAGTGGCT CTATACCATC TTCAATCTTT GACATCACTA CAATG-CAAG TAATTGCTCT 13555 |||| ||||| ||||||| | |||||||| |||| ||| || || |||| | ||||| | CTTACTGGCT CTATACCGCC GTCAATCTTC AACATGACTT CATTGACAAG -ACTTGCTAT 549 TAGTGGCAAC AATCTTACTG GAAAGATTCC AATCACGATA TGTGATCATC TTCCAGACTT 13615 | ||| |||| | | || |||| | || ||||| ||| | ||| | | AATACACAAT CGTCTTGTCG GGAAACTTCC TGTTGATATT TGTGACAATC TACCAAATCT 609 GGAAGGACTT TACCTCGGCA GAAACTCCCT TGATGGAGTT ATTCCACCAA ACCTGGAGAA 13675 ||| ||| | ||| |||| ||| ||||| | | || ||| | | |||| TCAAGTGCTT TTACTCTCAT CAAACAACCT GAATGGACGA AATACA-CAA ATTTACAGAA 668 ATGCAGAAAG CTTCAAATAT TGGAATTGAC TGAAAATGAG ATTGCTGGA- ACTGTACCAA 13734 |||| ||| || ||| | || |||| | || |||||| | ||||| || |||| | ATGCTCAAAC CTACAACTGN TGACATTGTC TGGCAATGAG TTCACTGGAC CCTATACC-A 727 GAGAGTTAGC CAACTTAACA ACTCTTACAG GACTATATCT TATGGATCTG CATTTGGAAG 13794 |||| | ||||| || | || ||| || |||| || ||| ||| GAGAACTTNG GAACTT-ACG ATGCTCACAA TCTTACATCT GGNAGAGAAC ATTTTAGAAA 786 GTAGTATGAA ATTTTCTTCC TTTTTCTCTT TTCTGCCAAA ACTTGTGCTC ATATATATTC 13854 || GTG....... .......... .......... .......... .......... .......... 789 CCAAAATTGA CAGGAGAGAT ACCAATGGCG CTCGCTAATC TTAAGAAACT TCAAACATTA 13914 .......... .......... .......... .......... .......... .......... 789 GTATTATCAC TGAATGAGCT AACTGGCTCT ATCCCTGACA GCATTTTCAA CATGTCAACA 13974 .......... .......... .......... .......... .......... .......... 789 CTGCAGAAAA TAGATTTTGG ACAAAACAAG CTTACAGGTA CTCTGCCTTC AGATTTAGGT 14034 .......... .......... .......... .......... .......... .......... 789 CGTGGAATGC CCGACCTACA AGTATTTTAT TGTGGAGGAA ATAATCTGAG TGGTTTTATC 14094 .......... .......... .......... .......... .......... .......... 789 TCTGATTCAA TCTCTAATTC TTCAAGACTC ACAATGTTAG ACCTCTCCAG CAACAGTTTC 14154 .......... .......... .......... .......... .......... .......... 789 ACAGGTCTAA TTTCAAAATC ACTTGGTAAC TTAGAATACC TTGAGGTTCT CAACTTGTGG 14214 .......... .......... .......... .......... .......... .......... 789 GGGAATAATT TTGTCAGCGA TTCAACATTG AGCTTCCTTG AATCATTGAC AAACTGTAGG 14274 .......... .......... .......... .......... .......... .......... 789 AATCTAAGAG TACTCACGCT TGGTGGTAAT CCGTTGGATG GTGTTTTGCC TGCATCTGTT 14334 .......... .......... .......... .......... .......... .......... 789 GGTAATTTCT CAAACTCCTT GCAAATTTTT GAAGCATCTA AATGTAAACT GAAGGGTGTC 14394 .......... .......... .......... .......... .......... .......... 789 ATTTCAAAAC AAATTACTAA TCTTACTGGA TTGACAAGGA TGAGTCTGTC GAACAATCAG 14454 .......... .......... .......... .......... .......... .......... 789 TTGATAGGTC ATATTCCAAA AACAGTGCAA GGAATGCTGA ACCTTCAAGA ACTTTACCTA 14514 .......... .......... .......... .......... .......... .......... 789 GGAAGCAACA AGTTAGAAGG AGCCATACCA GATGTTATCT GCAGTTTACA GTATCTTGGT 14574 .......... .......... .......... .......... .......... .......... 789 GCATTAGAAT TGTCAGAAAA TCAATTTTCT AGTTCCGTTC CACCATGCTT AGGGAATGTT 14634 .......... .......... .......... .......... .......... .......... 789 ACTAGTTTGA GGACACTCTA TCTAGATAAC AACAAGCTGG ATTCTAGATT ACCTGCAAGA 14694 .......... .......... .......... .......... .......... .......... 789 TTGGGGGGAC TTCAAAACAT CATAGAGTTC AATATTTCAT CCAATTATTT GAGTGGAGAA 14754 .......... .......... .......... .......... .......... .......... 789 ATTCCGCTAG AGAGCGGAAA CTTGAAGGGT GCAACACTGA TTGATCTGTC AAATAATTAT 14814 .......... .......... .......... .......... .......... .......... 789 TTTTCTGGTA AGATTCCTAG TACTCTAGGG GGCCTAGATA AATTAATTTA TCTTTCTCTA 14874 .......... .......... .......... .......... .......... .......... 789 GCACATAATA GATTAGAAGG GCCTATTCCT GAATCATTTG ACAAATTGTT GGCATTGGAA 14934 .......... .......... .......... .......... .......... .......... 789 TACTTGGATT TGTCCTATAA CAATCTTAGC GGTGAAATTC CAAAGTCATT AGAAGCTCTT 14994 .......... .......... .......... .......... .......... .......... 789 GTGTATCTCA AATACCTAAA TTTCTCTTTC AATGAACTCA GTGGAGAAAT TCCCACTGAT 15054 .......... .......... .......... .......... .......... .......... 789 GGTCCCTTTG CAAATGTAAC CAGTCAGTCT TTCTTGTCCA ATGATGCACT TTGTGGTGAC 15114 .......... .......... .......... .......... .......... .......... 789 TCCCGGTTTA ACGTAAAACC ATGCCCAACC AAATCTACAA AGAAATCAAG AAGAAAAAGA 15174 .......... .......... .......... .......... .......... .......... 789 GTGCTTACAG GTTTATATAT TCTATTAGGG ATAGGATCAC TCTTCATGTT GACTGTTGGA 15234 | || ||| | || ||| | | | | ||| .......... .......... .......TNG AT-CGATGAT TC-TCACATC G-NTCTAGGA 819 TTTGTCGTGT TAAGATTGAG AAACACAAAG AAGAATGCTA GTCAAAAGGA TCTGTCTCTC 15294 | ||| ||| | | || ||| | |||| |||||| | ||| | | || | ||| | TATGT-TTGT TNANGGTGCA AAAGAGAAAG AAGAATTCAG GTCCACA-GA TGCATCT-TT 876 GTAAGAGGGC ATGAAAGAAT TTCCTAT 15321 || | | | ||||| |||| | | ||| GTTGAAAGAC ATGAANGAAT TACNTAT 903 hqPGS_C06HBa0054K13.1-16+_SGN-U345542+ (13077 13797) ******************************************************************************** EST sequence 5 +strand 737 n (File: SGN-U313613+) 1 CATAAACATG CTTAATTAGT TCTTGTTCAT CCATTTTCTT GTTTATTTGA GCCAATAATT 61 AATCAAATCA CAATTTACTA TGGAGGAAAT TTCTGAAGGA ATTCGAGCAA CTTCGTTAAG 121 GATCAATGAT TGTCCATCAC CATTACCATC CGCTATTGCT TCGTCTGCTA CTCCTCAAAA 181 ACACTTAAAA TGCTTTATCA GCGTACATTT CGTGCAAAAG GTATTTAAAT TCACAAAATT 241 CATCTTAATG ATTAAATTTG AGTCTCGAAT TTTCAAGTGT CAGATGAGCA GGTTGAGTCA 301 ATTTCGACGA GTTTTAGTTA TATAGATGGT GATGTTTGAG CTCAAACCAA AATGGAATCA 361 TTCAAGATTT TGGATCCATG AACCATTGTC TGATGATATG AATTGTATAA TTACATCATT 421 TATGTTGGAT TTTATTTTGT CCTAAATTTC TTATCATAAA TAGGTTTTTC TTTTAGGAAA 481 AAAGTTTTGA ATTGACTAAT TCTTTTTCTG GTAGGAAAAT GTTCCCGTGT ATTCTTAGTG 541 AATTGGTTGA GGTTGTTTCT CTCTGTATTT TGTACTCTCA TATTAATAGT GGATTGCTCA 601 TCTCCATTGT GGACGTAGGT CGATTGACCG AACCACGTTA AATCTTTGTG TCTTTTGGTA 661 TATTTCTCGT TGTCTTTCAT ACTCGTGGTC TTTTGAGGTT TGCTTTGCTA GCTTTCCGCG 721 TTTACCCTGC TTATTTC Predicted gene structure (within gDNA segment 11616 to 18592): Exon 1 14234 14264 ( 31 n); cDNA 360 390 ( 31 n); score: 0.742 Intron 1 14265 15639 (1375 n); Pd: 0.000 (s: 0), Pa: 0.999 (s: 0) Exon 2 15640 15668 ( 29 n); cDNA 391 417 ( 27 n); score: 0.621 Intron 2 15669 16000 ( 332 n); Pd: 0.000 (s: 0), Pa: 0.716 (s: 0) Exon 3 16001 16005 ( 5 n); cDNA 418 422 ( 5 n); score: 0.600 Intron 3 16006 17329 (1324 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.89) Exon 4 17330 17426 ( 97 n); cDNA 423 523 ( 101 n); score: 0.794 Intron 4 17427 17562 ( 136 n); Pd: 0.000 (s: 0.69), Pa: 0.000 (s: 0.94) Exon 5 17563 17774 ( 212 n); cDNA 524 736 ( 213 n); score: 0.929 MATCH C06HBa0054K13.1-16+ SGN-U313613+ 0.887 374 0.507 C PGS_C06HBa0054K13.1-16+_SGN-U313613+ (14234 14264,15640 15668,16001 16005,17330 17426,17563 17774) Alignment (genomic DNA sequence = upper lines): ATTCAACATT GAGCTTCCTT GAATCATTGA CAAACTGTAG GAATCTAAGA GTACTCACGC 14293 |||||| ||| | ||| | ||| ||||| | ATTCAAGATT TTGGATCCAT GAACCATTGT C......... .......... .......... 390 TTGGTGGTAA TCCGTTGGAT GGTGTTTTGC CTGCATCTGT TGGTAATTTC TCAAACTCCT 14353 .......... .......... .......... .......... .......... .......... 390 TGCAAATTTT TGAAGCATCT AAATGTAAAC TGAAGGGTGT CATTTCAAAA CAAATTACTA 14413 .......... .......... .......... .......... .......... .......... 390 ATCTTACTGG ATTGACAAGG ATGAGTCTGT CGAACAATCA GTTGATAGGT CATATTCCAA 14473 .......... .......... .......... .......... .......... .......... 390 AAACAGTGCA AGGAATGCTG AACCTTCAAG AACTTTACCT AGGAAGCAAC AAGTTAGAAG 14533 .......... .......... .......... .......... .......... .......... 390 GAGCCATACC AGATGTTATC TGCAGTTTAC AGTATCTTGG TGCATTAGAA TTGTCAGAAA 14593 .......... .......... .......... .......... .......... .......... 390 ATCAATTTTC TAGTTCCGTT CCACCATGCT TAGGGAATGT TACTAGTTTG AGGACACTCT 14653 .......... .......... .......... .......... .......... .......... 390 ATCTAGATAA CAACAAGCTG GATTCTAGAT TACCTGCAAG ATTGGGGGGA CTTCAAAACA 14713 .......... .......... .......... .......... .......... .......... 390 TCATAGAGTT CAATATTTCA TCCAATTATT TGAGTGGAGA AATTCCGCTA GAGAGCGGAA 14773 .......... .......... .......... .......... .......... .......... 390 ACTTGAAGGG TGCAACACTG ATTGATCTGT CAAATAATTA TTTTTCTGGT AAGATTCCTA 14833 .......... .......... .......... .......... .......... .......... 390 GTACTCTAGG GGGCCTAGAT AAATTAATTT ATCTTTCTCT AGCACATAAT AGATTAGAAG 14893 .......... .......... .......... .......... .......... .......... 390 GGCCTATTCC TGAATCATTT GACAAATTGT TGGCATTGGA ATACTTGGAT TTGTCCTATA 14953 .......... .......... .......... .......... .......... .......... 390 ACAATCTTAG CGGTGAAATT CCAAAGTCAT TAGAAGCTCT TGTGTATCTC AAATACCTAA 15013 .......... .......... .......... .......... .......... .......... 390 ATTTCTCTTT CAATGAACTC AGTGGAGAAA TTCCCACTGA TGGTCCCTTT GCAAATGTAA 15073 .......... .......... .......... .......... .......... .......... 390 CCAGTCAGTC TTTCTTGTCC AATGATGCAC TTTGTGGTGA CTCCCGGTTT AACGTAAAAC 15133 .......... .......... .......... .......... .......... .......... 390 CATGCCCAAC CAAATCTACA AAGAAATCAA GAAGAAAAAG AGTGCTTACA GGTTTATATA 15193 .......... .......... .......... .......... .......... .......... 390 TTCTATTAGG GATAGGATCA CTCTTCATGT TGACTGTTGG ATTTGTCGTG TTAAGATTGA 15253 .......... .......... .......... .......... .......... .......... 390 GAAACACAAA GAAGAATGCT AGTCAAAAGG ATCTGTCTCT CGTAAGAGGG CATGAAAGAA 15313 .......... .......... .......... .......... .......... .......... 390 TTTCCTATTA TGAACTTGAA CAGGCAACTG AAGGATTCAA CGAAACCAAC TTGCTTGGTA 15373 .......... .......... .......... .......... .......... .......... 390 ATGGGAGTTT CAGCAGGGTT TATAAAGGGG TACTTAAGGA TGGTATCATT TTTGCAGCAA 15433 .......... .......... .......... .......... .......... .......... 390 AGGTATTCAA TGTGCAATTG GAGGGTGCAT TCAAAAGTTT TGACACGGAA TGTGAGATAC 15493 .......... .......... .......... .......... .......... .......... 390 TTCGCAATCT TCGCCACAGA AATCTTGCCA AAGTCATTAC CAGCTGCTCC AATCTTGATT 15553 .......... .......... .......... .......... .......... .......... 390 TCAAGGCCCT AGTGTTGGAA TACATGCCCA ACGGGACACT TGATAAATGG TTATACTCTC 15613 .......... .......... .......... .......... .......... .......... 390 ACAATTTGTT CTTGAACTTA TTGCAGAGAT TGGATGTAAT GATAGATGTT GCATCTGCAA 15673 ||| ||| ||| || | || |||| .......... .......... ......TGAT GATATG-AAT TGTATA-ATT ACATC..... 417 TGAACTATCT CCACAATGGC TATTCAACGC CTGTAGTGCA TTGTGACTTG AAACCAAGTA 15733 .......... .......... .......... .......... .......... .......... 417 ATGTCTTGTT AGATGAAGAA ATGGTTGCTC ATGTAAGTGA TTTTGGCATT GCAAAAATGT 15793 .......... .......... .......... .......... .......... .......... 417 TAGGTGCAGG GGAGGCTTTT GTTCAAACAA GGACAGTTGC AACCATTGGA TATATTGCTC 15853 .......... .......... .......... .......... .......... .......... 417 CAGGTATATT TTAAGTTTTC TCGTATCGCC TTTAAATACT CAAAACAATT CTTTCCCCTA 15913 .......... .......... .......... .......... .......... .......... 417 TGTATAAATT GATTTGTGGT CATTTTCGAC CATGAATACA GAGTATGGAC AAGATGGAAT 15973 .......... .......... .......... .......... .......... .......... 417 AGTATCCACG AGTTGTGATG TTTATAGTTT TGGTATCCTG ATGATGGAGA CGTTCACACG 16033 || | .......... .......... .......ATT TA........ .......... .......... 422 AACAAGACCA AGTGATGACA TATTTACTGG AGACTTGAGC ATACAAAGCT GGATTAGTGA 16093 .......... .......... .......... .......... .......... .......... 422 TTCCTTTCCG GGTGAACTTC ACAAGGTGGT GGATTCTAAT TTGGTACAGC CCGGAGATGA 16153 .......... .......... .......... .......... .......... .......... 422 ACAAATCGCT GCAAAGATGC AATGTTTGTC ATCTGTCATG GAATTAGCTT TGAAGTGCAC 16213 .......... .......... .......... .......... .......... .......... 422 TTTAGTGAGA CCTGATGCAA GAATTAGCAT GAAGGATGCT CTTTCAACAC TCAAAAAGAT 16273 .......... .......... .......... .......... .......... .......... 422 GAGGCTACAG CTTGTTAGTA GTCGGCATTA GGTGGAATCA TTACCAACCT TCTCTTGTAT 16333 .......... .......... .......... .......... .......... .......... 422 GTTATTTAGT TACCAGTTTT CTCTCTTATG TAATTCAATT TCGCACGAGT GTATTTTATC 16393 .......... .......... .......... .......... .......... .......... 422 TTAGTTGTTG GCTTGATTTA TGGAAGTTGA AAATGAGATT TAAAATGCTC AACACAAGAT 16453 .......... .......... .......... .......... .......... .......... 422 TTTCTTTTCG TATTTACACA ACTTTGAGAA TGCTTCCAAT TATATCACCA CAACTGTAAC 16513 .......... .......... .......... .......... .......... .......... 422 AGTTTTATCA AGTCCTCTCC TTCCAGCCAT CTAAGTGTAG TCTCCGGGAG TAGCTGCCTA 16573 .......... .......... .......... .......... .......... .......... 422 AATTCCTTGT CGTCAAACTT ACATAAGAAT CAAAAGCAAC TTGCCTCCAA AATTTGATGC 16633 .......... .......... .......... .......... .......... .......... 422 AGTTTAAACC TTTTAGTGTT CAATGAGTTT GCTGTGCGCC CATTGGGTGA GGGAAGAGCT 16693 .......... .......... .......... .......... .......... .......... 422 TTAACTCGTG CAGGTTCCTA TTGCATCTCC GCGGGCTTCT GGACCTGTTA CTCGTCTACT 16753 .......... .......... .......... .......... .......... .......... 422 AATTTGGGAT CTTAAGATTG CAAGGGAAAA GTGTGAAAAT CTACTGGCTG AAAATGCATA 16813 .......... .......... .......... .......... .......... .......... 422 CTGATTTAAC TGCATCTAAA GAAGAAGTTG GTTGATTGAA GGATCAGTTA GTACAGAAGA 16873 .......... .......... .......... .......... .......... .......... 422 AGCTTGATAG CAACGCTAGA GTGGACTGGA TCCTCCAGTT ACTTGCTTCT TCATCCTGTC 16933 .......... .......... .......... .......... .......... .......... 422 CTCCAAACCC CAATCATTCC TCTTCTTGAT CCCTGCTTTC TGGTCTAGTT TCTTATACTT 16993 .......... .......... .......... .......... .......... .......... 422 TTGGACTGAA GACATTGTTA TGTTCTTTTG AGTTTTGTGC TTAGACTGAA TATTTTGTGA 17053 .......... .......... .......... .......... .......... .......... 422 GCTTAAAATG TTGGTGTATC TCGGATCCTC TGCTTCCTTC GTTCTTGCAT GTTTTTGCAT 17113 .......... .......... .......... .......... .......... .......... 422 ATGTTTACAG ATGGCTAATA TCATTAGATT GGAGGTATCT TGTTGTGTTC TACGCGATGT 17173 .......... .......... .......... .......... .......... .......... 422 TGCTCTTCCG GAGCTCTGCG TTTTCGTTGG CTTGGTTTAC TGTTTTAACG CTTTTCTCTT 17233 .......... .......... .......... .......... .......... .......... 422 CTTTTTGATA TGTATTGCTC TTGTGTACTG TGTGTATTGA CCTTATACAG GAAATAGGAG 17293 .......... .......... .......... .......... .......... .......... 422 TAGACGATTT GTCATCATCA AAAAAGAGGG AAAATGTGTT GGGTTTTATT TT-CCCTAAA 17352 |||| || ||||||| || |||||| .......... .......... .......... ......TGTT GGATTTTATT TTGTCCTAAA 446 TTTCTTATCA TAAATAGGTT TTCCTTTAAG G-GGAAGGTT TTG-ATTGAC TAA-TCATTT 17409 |||||||||| |||||||||| || |||| || | || ||| ||| |||||| ||| || ||| TTTCTTATCA TAAATAGGTT TTTCTTTTAG GAAAAAAGTT TTGAATTGAC TAATTCTTTT 506 TCTTGTAGGA AAAGGTTTAG GACTCTATAA ATAGAGAAAT GTTCCTTCTA ACTTAGTCAG 17469 ||| |||||| ||| ||| TCTGGTAGGA AAATGTT... .......... .......... .......... .......... 523 CATTCACAAT GTAGTCTTAA GAGCTTTGAG AGTTTTGGTT AGGGAGAGAA TTTATGGGTC 17529 .......... .......... .......... .......... .......... .......... 523 ACAAGTTGGA TACATTATCA CTTGTGTGAA CCTCCCATGT ATTCCGAGTG AATTGGTTGA 17589 ||| ||| |||| |||| |||||||||| .......... .......... .......... ...CCCGTGT ATTCTTAGTG AATTGGTTGA 550 GGTTGTTTCT CTCTGTATTT TGTACTCTCA TATTTATAGT GGATTGCTCA TCTCCTTTGT 17649 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||| |||| GGTTGTTTCT CTCTGTATTT TGTACTCTCA TATTAATAGT GGATTGCTCA TCTCCATTGT 610 GGACGTAGGT CGATTGACCG AACCACATTA AATCTTTGTG TGTTTTGGTA TATTTCTCGT 17709 |||||||||| |||||||||| |||||| ||| |||||||||| | |||||||| |||||||||| GGACGTAGGT CGATTGACCG AACCACGTTA AATCTTTGTG TCTTTTGGTA TATTTCTCGT 670 TGTC-TTCTT ACTCGTGGTC TTTTGAGGTT TGCTTTGCTA GC-TTCCGCG TTTACACCTG 17767 |||| ||| | |||||||||| |||||||||| |||||||||| || ||||||| ||||| |||| TGTCTTTCAT ACTCGTGGTC TTTTGAGGTT TGCTTTGCTA GCTTTCCGCG TTTAC-CCTG 729 CTGATTT 17774 || |||| CTTATTT 736 hqPGS_C06HBa0054K13.1-16+_SGN-U313613+ (14234 14264,15640 15668,16001 16005,17330 17426,17563 17774) ******************************************************************************** EST sequence 9 +strand 842 n (File: SGN-U345275+) 1 ATTGACAAAC GCTGGAGCTC CACCGCGGTG GCGGCCGCTC TAGAACTAGT GGATCCCCCG 61 GGCTGCAGGC TCAACTTGTG GGGGAATAAT TTTGACATCG ATTCAACATT GAGCTTCCTT 121 GAATCATTGA CAAACTGTAG GAATCTAAGA GTACTCACGC TTGGTGGTAA TCCGTTGGAT 181 GGTGTTTTGC CTGCATCTGT TGGGAATTTC TCAAACTGCT TGCAAATATG TGAAGCATCT 241 AAATGTAAAC TGAATGGTGT CATTTCAAAA CAAATTACTA ATCTTACTGG ATTGACAAGG 301 ATGAGTCTGT CGAACAATCA GTTGATAGGC CATATTCCAA CAACAGTGCA AGGAATGCTG 361 AACCTTCAAG AACTTTACCT ATGAAGCAAC AAGTTAGAAG GAGCCATACC AGATGTTATC 421 TGCAGTTGAC AGTATCTTGG TGCATTAGAA TTGTCAGAGA ATCAATTTTC TAGTTTCGTT 481 CCACCATGCT TAGGGAATGT TACTAGTTTG AGGACACTCT ATCTAGATAA CAACAAGCTG 541 GATTCTAGAT TACCTGCAAG ATTGGGGGGA CTTCAAAACA TCATAGAGTT CAATATTTCA 601 TCCAATTATT TGAGTGGAGA AATTCCGCTA GAGAGCGGAA ACTTGAATGG TGCAACACTG 661 ATTGATCTGT CAAATAATTA TTTTTCTGGG TAGATTCCTA GTACTCTAGG GGGCCTAGAT 721 AAATTAAATT AACTTTCTCT AGCACATAGT GGATTACAAG GGCCTATTTC TGAATCATTT 781 GACAAATTGC GGGCCTTGGA ATAACTGGGA TTTGGCCTAT TACAAATCTT AGGGGTGAAA 841 AG Predicted gene structure (within gDNA segment 12652 to 16431): Exon 1 14203 14971 ( 769 n); cDNA 70 840 ( 771 n); score: 0.956 MATCH C06HBa0054K13.1-16+ SGN-U345275+ 0.956 769 0.913 C PGS_C06HBa0054K13.1-16+_SGN-U345275+ (14203 14971) Alignment (genomic DNA sequence = upper lines): CTCAACTTGT GGGGGAATAA TTTTGTCAGC GATTCAACAT TGAGCTTCCT TGAATCATTG 14262 |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| |||||||||| CTCAACTTGT GGGGGAATAA TTTTGACATC GATTCAACAT TGAGCTTCCT TGAATCATTG 129 ACAAACTGTA GGAATCTAAG AGTACTCACG CTTGGTGGTA ATCCGTTGGA TGGTGTTTTG 14322 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAAACTGTA GGAATCTAAG AGTACTCACG CTTGGTGGTA ATCCGTTGGA TGGTGTTTTG 189 CCTGCATCTG TTGGTAATTT CTCAAACTCC TTGCAAATTT TTGAAGCATC TAAATGTAAA 14382 |||||||||| |||| ||||| |||||||| | |||||||| | ||||||||| |||||||||| CCTGCATCTG TTGGGAATTT CTCAAACTGC TTGCAAATAT GTGAAGCATC TAAATGTAAA 249 CTGAAGGGTG TCATTTCAAA ACAAATTACT AATCTTACTG GATTGACAAG GATGAGTCTG 14442 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGAATGGTG TCATTTCAAA ACAAATTACT AATCTTACTG GATTGACAAG GATGAGTCTG 309 TCGAACAATC AGTTGATAGG TCATATTCCA AAAACAGTGC AAGGAATGCT GAACCTTCAA 14502 |||||||||| |||||||||| ||||||||| | |||||||| |||||||||| |||||||||| TCGAACAATC AGTTGATAGG CCATATTCCA ACAACAGTGC AAGGAATGCT GAACCTTCAA 369 GAACTTTACC TAGGAAGCAA CAAGTTAGAA GGAGCCATAC CAGATGTTAT CTGCAGTTTA 14562 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||| | GAACTTTACC TATGAAGCAA CAAGTTAGAA GGAGCCATAC CAGATGTTAT CTGCAGTTGA 429 CAGTATCTTG GTGCATTAGA ATTGTCAGAA AATCAATTTT CTAGTTCCGT TCCACCATGC 14622 |||||||||| |||||||||| ||||||||| |||||||||| |||||| ||| |||||||||| CAGTATCTTG GTGCATTAGA ATTGTCAGAG AATCAATTTT CTAGTTTCGT TCCACCATGC 489 TTAGGGAATG TTACTAGTTT GAGGACACTC TATCTAGATA ACAACAAGCT GGATTCTAGA 14682 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGGGAATG TTACTAGTTT GAGGACACTC TATCTAGATA ACAACAAGCT GGATTCTAGA 549 TTACCTGCAA GATTGGGGGG ACTTCAAAAC ATCATAGAGT TCAATATTTC ATCCAATTAT 14742 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTACCTGCAA GATTGGGGGG ACTTCAAAAC ATCATAGAGT TCAATATTTC ATCCAATTAT 609 TTGAGTGGAG AAATTCCGCT AGAGAGCGGA AACTTGAAGG GTGCAACACT GATTGATCTG 14802 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| TTGAGTGGAG AAATTCCGCT AGAGAGCGGA AACTTGAATG GTGCAACACT GATTGATCTG 669 TCAAATAATT ATTTTTCTGG TAAGATTCCT AGTACTCTAG GGGGCCTAGA TAAATTAATT 14862 |||||||||| |||||||||| |||||||| |||||||||| |||||||||| |||||||| | TCAAATAATT ATTTTTCTGG GTAGATTCCT AGTACTCTAG GGGGCCTAGA TAAATTAAAT 729 TATCTTTCTC TAGCACATAA TAGATTAGAA GGGCCTATTC CTGAATCATT TGACAAATTG 14922 || ||||||| ||||||||| | ||||| || ||||||||| |||||||||| |||||||||| TAACTTTCTC TAGCACATAG TGGATTACAA GGGCCTATTT CTGAATCATT TGACAAATTG 789 TTGGCATTGG AAT-ACTTGG ATTTGTCCTA TAAC-AATCT TAGCGGTGAA A 14971 ||| |||| ||| ||| || ||||| |||| | || ||||| ||| |||||| | CGGGCCTTGG AATAACTGGG ATTTGGCCTA TTACAAATCT TAGGGGTGAA A 840 hqPGS_C06HBa0054K13.1-16+_SGN-U345275+ (14203 14971) ******************************************************************************** EST sequence 3 +strand 1298 n (File: SGN-U322835+) 1 CAATTCCCCT TTCAATGAAT TTCCCAAATC CCCATTCAAT TTTACTTCTT TTTGGATAAA 61 AAATGATAAG TGTTTGTGTG TGTTTGCAGC CCACAATAAT GGCGAAGATG AAGGTGGTGA 121 GGAGTGAAAT TGCTGCGAAA CAAGTGGTTG TGATCGAGGA AAATGAGGAG ATACATTGGT 181 ATGCTTCTTG AATTTCGTCC GGAGGACACA GCTACTCAAA TCCAAATGCA AAGGATCTTC 241 AAGAGTAAAG GAATAAAACA AGACCTTGGT TCATTGTGAA AGTTATGTGT GATCCATAGC 301 TGAAAGTTTG GCATTTATCT CGGTTTTCCC TTCCTATTTG ATTTTTTTTC AATAATCGCC 361 GTTAGGTCAA TTCTGCTATT TAAGTCAAGT TTCAACCAGC TTTTGGGAAT TCTGGAAGTC 421 ACGTGCTCAT CAAAGTAAAT TTTATATACA CTGATGCAAC GGTCTATCTC CACAATGGGT 481 GTTCAAATCC AGTGGTGCAT TGTGACTTGA AGCCAAGTGT CTTGCTTGAT CAAGACATGG 541 TTGGCCATGT CAGTGATTTT GGCATTGCAA AATTGTTAGG TGCAGGGGAG AGTTTTGTTC 601 AAACAAGGAC AATAGCAACC ATTGGATAAT TGCTCCAGAG TATGGACAAG ATGGAATCGT 661 ATCCACAAGC TGCGATGTTT ATAGTTTCGG TATCCTGATT GATGGAGACG TTTACAAGAA 721 TCAGACCAGG TGATGAAAGA TTTACTGGAG AGTTGAGCAT ACGACGTTGG GTTAGTGATT 781 CTTTTCCAGA TGAGATTCAT AAGGTGGTGG ATGCTAATTT GGTACAGCCT AGGGGATGAA 841 CGAATTGACG CAAAGATGCA GTGTCTGTTG TCTATTATAG AGTTAGCTTT GAGCTGTACT 901 TTAGCAACAC CTGATGCAAG AATTAGTATG GAAGATTCTC TTTCAACACT TCAAAATATC 961 AGGCTCCAGT TTGTCAATAG TCGCCACCGA AAAAAGCAAC TGAAGGATTT AGTACCGAAA 1021 AAAGCAACTT GCTTGGTAAT GGCAAGGTCT ACAATGTACA ATTGGAGGGT GCATTCAAAA 1081 GTTTTGATAC AGAAGAATGT GAAATCTGAC CAAAGTCATC AAAGCCTTAA TGTTAGAATA 1141 CATGTCTAGT GGGACACTTG ATAAATGGCT GTACTCTCAC AAGTTGTTCT TGGATTTACT 1201 TCATATTATG TACTCTTTCA CTTTCAGTGC CAGCTGGAAT GGTGATTTTC TAGCTACTGG 1261 AGTTCACAAT TCTAATCCAT AAAAAAAAAA AAAAAAAA Predicted gene structure (within gDNA segment 9659 to 23308): Exon 1 15647 15856 ( 210 n); cDNA 439 638 ( 200 n); score: 0.829 Intron 1 15857 15954 ( 98 n); Pd: 0.999 (s: 0.90), Pa: 0.805 (s: 0.90) Exon 2 15955 16304 ( 350 n); cDNA 639 990 ( 352 n); score: 0.811 Intron 2 16305 16803 ( 499 n); Pd: 0.694 (s: 0.74), Pa: 0.000 (s: 0.57) Exon 3 16804 16850 ( 47 n); cDNA 991 1038 ( 48 n); score: 0.574 Intron 3 16851 17123 ( 273 n); Pd: 0.000 (s: 0.57), Pa: 0.778 (s: 0) Exon 4 17124 17130 ( 7 n); cDNA 1039 1045 ( 7 n); score: 0.857 Intron 4 17131 17283 ( 153 n); Pd: 0.900 (s: 0), Pa: 1.000 (s: 0) Exon 5 17284 17301 ( 18 n); cDNA 1046 1062 ( 17 n); score: 0.500 Intron 5 17302 17628 ( 327 n); Pd: 0.000 (s: 0), Pa: 0.915 (s: 0) Exon 6 17629 17638 ( 10 n); cDNA 1063 1072 ( 10 n); score: 0.600 Intron 6 17639 18159 ( 521 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 7 18160 18194 ( 35 n); cDNA 1073 1105 ( 33 n); score: 0.686 Intron 7 18195 19533 (1339 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.66) Exon 8 19534 19632 ( 99 n); cDNA 1106 1203 ( 98 n); score: 0.737 PPA cDNA 1279 1298 MATCH C06HBa0054K13.1-16+ SGN-U322835+ 0.806 776 0.598 C PGS_C06HBa0054K13.1-16+_SGN-U322835+ (15647 15856,15955 16304,16804 16850,17124 17130,17284 17301,17629 17638,18160 18194,19534 19632) Alignment (genomic DNA sequence = upper lines): ATGTAATGAT AGATGTTGCA TCTGCAATGA ACTATCTCCA CAATGGCTAT TCAACGCCTG 15706 || | || || | | || | ||||| | ||||||||| |||||| | | |||| || | ATTTTAT-AT ACA--CTG-A --TGCAACGG TCTATCTCCA CAATGGGTGT TCAAATCCAG 492 TAGTGCATTG TGACTTGAAA CCAAGTAATG TCTTGTTAGA TGAAGAAATG GTTGCTCATG 15766 | |||||||| ||||||||| ||||| || ||||| | || | |||| ||| |||| |||| TGGTGCATTG TGACTTGAAG CCAAG---TG TCTTGCTTGA TCAAGACATG GTTGGCCATG 549 TAAGTGATTT TGGCATTGCA AAAATGTTAG GTGCAGGGGA GGCTTTTGTT CAAACAAGGA 15826 | |||||||| |||||||||| ||| |||||| |||||||||| | ||||||| |||||||||| TCAGTGATTT TGGCATTGCA AAATTGTTAG GTGCAGGGGA GAGTTTTGTT CAAACAAGGA 609 CAGTTGCAAC CATTGGATAT ATTGCTCCAG GTATATTTTA AGTTTTCTCG TATCGCCTTT 15886 || | ||||| ||||||||| |||||||||| CAATAGCAAC CATTGGATA- ATTGCTCCAG .......... .......... .......... 638 AAATACTCAA AACAATTCTT TCCCCTATGT ATAAATTGAT TTGTGGTCAT TTTCGACCAT 15946 .......... .......... .......... .......... .......... .......... 638 GAATACAGAG TATGGACAAG ATGGAATAGT ATCCACGAGT TGTGATGTTT ATAGTTTTGG 16006 || |||||||||| ||||||| || |||||| || || ||||||| ||||||| || ........AG TATGGACAAG ATGGAATCGT ATCCACAAGC TGCGATGTTT ATAGTTTCGG 690 TATCCTGA-T GATGGAGACG TTCACACGAA CAAGACCAAG TGATGACATA TTTACTGGAG 16065 |||||||| | |||||||||| || ||| ||| |||||| | |||||| | | |||||||||| TATCCTGATT GATGGAGACG TTTACAAGAA TCAGACCAGG TGATGAAAGA TTTACTGGAG 750 ACTTGAGCAT ACAAAGCTGG ATTAGTGATT CCTTTCCGGG TGAACTTCAC AAGGTGGTGG 16125 | |||||||| || | | ||| ||||||||| | ||||| | ||| |||| |||||||||| AGTTGAGCAT ACGACGTTGG GTTAGTGATT CTTTTCCAGA TGAGATTCAT AAGGTGGTGG 810 ATTCTAATTT GGTACAGCC- CGGAGATGAA CAAATCGCTG CAAAGATGCA ATGTTTGTCA 16184 || ||||||| ||||||||| || |||||| | ||| | | |||||||||| ||| ||| ATGCTAATTT GGTACAGCCT AGGGGATGAA CGAATTGACG CAAAGATGCA GTGTCTGTTG 870 TCTGTCATGG AATTAGCTTT GAAGTGCACT TTAGTGAGAC CTGATGCAAG AATTAGCATG 16244 ||| | || | | |||||||| || || ||| |||| | || |||||||||| |||||| ||| TCTATTATAG AGTTAGCTTT GAGCTGTACT TTAGCAACAC CTGATGCAAG AATTAGTATG 930 AAGGATGCTC TTTCAACACT CAAAAAGATG AGGCTACAGC TTGTTAGTAG TCGGCATTAG 16304 | ||| ||| |||||||||| |||| || ||||| ||| |||| | ||| ||| || GAAGATTCTC TTTCAACACT TCAAAATATC AGGCTCCAGT TTGTCAATAG TCGCCACCGA 990 GTGGAATCAT TACCAACCTT CTCTTGTATG TTATTTAGTT ACCAGTTTTC TCTCTTATGT 16364 .......... .......... .......... .......... .......... .......... 990 AATTCAATTT CGCACGAGTG TATTTTATCT TAGTTGTTGG CTTGATTTAT GGAAGTTGAA 16424 .......... .......... .......... .......... .......... .......... 990 AATGAGATTT AAAATGCTCA ACACAAGATT TTCTTTTCGT ATTTACACAA CTTTGAGAAT 16484 .......... .......... .......... .......... .......... .......... 990 GCTTCCAATT ATATCACCAC AACTGTAACA GTTTTATCAA GTCCTCTCCT TCCAGCCATC 16544 .......... .......... .......... .......... .......... .......... 990 TAAGTGTAGT CTCCGGGAGT AGCTGCCTAA ATTCCTTGTC GTCAAACTTA CATAAGAATC 16604 .......... .......... .......... .......... .......... .......... 990 AAAAGCAACT TGCCTCCAAA ATTTGATGCA GTTTAAACCT TTTAGTGTTC AATGAGTTTG 16664 .......... .......... .......... .......... .......... .......... 990 CTGTGCGCCC ATTGGGTGAG GGAAGAGCTT TAACTCGTGC AGGTTCCTAT TGCATCTCCG 16724 .......... .......... .......... .......... .......... .......... 990 CGGGCTTCTG GACCTGTTAC TCGTCTACTA ATTTGGGATC TTAAGATTGC AAGGGAAAAG 16784 .......... .......... .......... .......... .......... .......... 990 TGTGAAAATC TACTGGCTGA AAATGCATAC TGATTTAACT -GCATCTAAA GAAG-AAGTT 16842 | ||| ||| || ||| | | | | | ||| ||| || || .......... .........A AAAAGCA-AC TGAAGGATTT AGTACCGAAA AAAGCAACTT 1030 GGTTGATTGA AGGATCAGTT AGTACAGAAG AAGCTTGATA GCAACGCTAG AGTGGACTGG 16902 | ||| | GCTTGGTA.. .......... .......... .......... .......... .......... 1038 ATCCTCCAGT TACTTGCTTC TTCATCCTGT CCTCCAAACC CCAATCATTC CTCTTCTTGA 16962 .......... .......... .......... .......... .......... .......... 1038 TCCCTGCTTT CTGGTCTAGT TTCTTATACT TTTGGACTGA AGACATTGTT ATGTTCTTTT 17022 .......... .......... .......... .......... .......... .......... 1038 GAGTTTTGTG CTTAGACTGA ATATTTTGTG AGCTTAAAAT GTTGGTGTAT CTCGGATCCT 17082 .......... .......... .......... .......... .......... .......... 1038 CTGCTTCCTT CGTTCTTGCA TGTTTTTGCA TATGTTTACA GATGGCTAAT ATCATTAGAT 17142 ||||| | .......... .......... .......... .......... .ATGGCAA.. .......... 1045 TGGAGGTATC TTGTTGTGTT CTACGCGATG TTGCTCTTCC GGAGCTCTGC GTTTTCGTTG 17202 .......... .......... .......... .......... .......... .......... 1045 GCTTGGTTTA CTGTTTTAAC GCTTTTCTCT TCTTTTTGAT ATGTATTGCT CTTGTGTACT 17262 .......... .......... .......... .......... .......... .......... 1045 GTGTGTATTG ACCTTATACA GGAAATAGGA GTAGACGATT TGTCATCATC AAAAAAGAGG 17322 | || | | || || .......... .......... .GGTCTA-CA ATGTACAAT. .......... .......... 1062 GAAAATGTGT TGGGTTTTAT TTTCCCTAAA TTTCTTATCA TAAATAGGTT TTCCTTTAAG 17382 .......... .......... .......... .......... .......... .......... 1062 GGGAAGGTTT TGATTGACTA ATCATTTTCT TGTAGGAAAA GGTTTAGGAC TCTATAAATA 17442 .......... .......... .......... .......... .......... .......... 1062 GAGAAATGTT CCTTCTAACT TAGTCAGCAT TCACAATGTA GTCTTAAGAG CTTTGAGAGT 17502 .......... .......... .......... .......... .......... .......... 1062 TTTGGTTAGG GAGAGAATTT ATGGGTCACA AGTTGGATAC ATTATCACTT GTGTGAACCT 17562 .......... .......... .......... .......... .......... .......... 1062 CCCATGTATT CCGAGTGAAT TGGTTGAGGT TGTTTCTCTC TGTATTTTGT ACTCTCATAT 17622 .......... .......... .......... .......... .......... .......... 1062 TTATAGTGGA TTGCTCATCT CCTTTGTGGA CGTAGGTCGA TTGACCGAAC CACATTAAAT 17682 |||| | | ......TGGA GGGTGC.... .......... .......... .......... .......... 1072 CTTTGTGTGT TTTGGTATAT TTCTCGTTGT CTTCTTACTC GTGGTCTTTT GAGGTTTGCT 17742 .......... .......... .......... .......... .......... .......... 1072 TTGCTAGCTT CCGCGTTTAC ACCTGCTGAT TTTCGGTCCT AACAAAATGA TGGTTTACGA 17802 .......... .......... .......... .......... .......... .......... 1072 AGTTGACGAT AGACATGCAA TCAGATTTCA AAAAGAGCTG GGAGAAAAGA TTTCAAAAAC 17862 .......... .......... .......... .......... .......... .......... 1072 GAGAGAAAGA AAAGAGAGCT AACACACAAA GGCCAATCAA TTTGAGAGAT CATAGAAATA 17922 .......... .......... .......... .......... .......... .......... 1072 TAAGTTAAGA AGTTCTATAG TTAGAATTTC TTTTGTCATG AGTAGAGTGT TTTGATTCGT 17982 .......... .......... .......... .......... .......... .......... 1072 ACTTAGAGTC ATTATCGTAG GAGGAGTGGA TTTGGCTTCT TGTAGAGTTG AGTCTTTGAG 18042 .......... .......... .......... .......... .......... .......... 1072 AGGTTTGTGA CAAGGAGTGG ATTTGACTTC TTGGAGAGTA GAGATAGTCG ATTATAGTTA 18102 .......... .......... .......... .......... .......... .......... 1072 ATCGAAAGTT TTTCTGTGAG TTCATTGATG AGTTAATAAC TTTGATAGCC ACTCAACTAT 18162 | .......... .......... .......... .......... .......... .......ATT 1075 CAATTGTTTT CACAGATATA GACATCTGAA ATGCTCTGTT GTTTGTGACT TTATAAACTT 18222 ||| ||||| | | | | | || || |||| || CAAAAGTTTT GATACAGA-A GA-ATGTGAA AT........ .......... .......... 1105 GTTGATTTTA GTATATATCC TCAAAATGTA AGTTGTACTC CTACATTCCA AGTCTGCATA 18282 .......... .......... .......... .......... .......... .......... 1105 ATATGGCCAT TTTCAATGCA TGACTGTGCT ACATGTGTGT GACTTTGTTT ATCAATTCTC 18342 .......... .......... .......... .......... .......... .......... 1105 TGCTGGAACG CATATGACAA CAATCTATGT TCCGAGATTA TCAACAACTT GAATATCCCT 18402 .......... .......... .......... .......... .......... .......... 1105 GCTAACCAGA CTGCAACTAG TTGTTGCCTG CAATCACATA TCTGCATATT GTTGCACAGG 18462 .......... .......... .......... .......... .......... .......... 1105 ACTGTGATTG TCTTTATTGG TTGCTAAATG TACGTTCCCT CATTCACTTA GCTGCATAGA 18522 .......... .......... .......... .......... .......... .......... 1105 GTGCTGACTA TTAGAGTTTA GGGCCTGATG ACATGATTAT CTCCAACCTA AAATTCGTAA 18582 .......... .......... .......... .......... .......... .......... 1105 ACACAAACCA AGCTAACCAA AACTAGTTTA GTTCATAATG ACAAGGAATT GTACAGAAAT 18642 .......... .......... .......... .......... .......... .......... 1105 AAAGGAGGAA TTTTAATCTT TCTGAAATGA TCATAATGCA CAACTACCTG AACTTTCTGT 18702 .......... .......... .......... .......... .......... .......... 1105 TCCTAATAAA CAACAACTTA AATAATATTT AAGCTGAAAG ACAAGAAAAA CGGAAAGTCC 18762 .......... .......... .......... .......... .......... .......... 1105 AGAAGTTGTA TTTAACATAG AGGAAATCAA GGAGGAACAT TATTCCTTGT CAACTTTTTC 18822 .......... .......... .......... .......... .......... .......... 1105 CTTCTGAAAC TTCGCATTCC CAACCTCTCC AAATTACAAC TTCCTCTCGC TGTTTTTAGT 18882 .......... .......... .......... .......... .......... .......... 1105 AATTGAGTGG GAGTGTTAAA ATGTTGAATA ATATCATGCT TGTGACTATA TTTTGTCCAT 18942 .......... .......... .......... .......... .......... .......... 1105 AAATATAGGT AAAAAAAGTT TTCTTCCTAT TATTTCTTTT TACTTAGGAT CTTTTCTTTT 19002 .......... .......... .......... .......... .......... .......... 1105 TCCTTTTATA TAATATTTTT TTTTATCTAA TTAGAAGAAT AATTAACTGA ATAGAAGTAT 19062 .......... .......... .......... .......... .......... .......... 1105 GGAAACTATA GATAACCTAC CGTATGTACT GGTATCACCC CATTCTTAAT CTTATTATGA 19122 .......... .......... .......... .......... .......... .......... 1105 AATTAAAATT AGTTTGTCAA AATGGTTTGA TTATCTCTTG GTAATTTATT AATCACGGAC 19182 .......... .......... .......... .......... .......... .......... 1105 CAATATATAC CTTTAACAAA AGATTACAAT AAAAAATATT ATATAAAAGA AAAATAAAAA 19242 .......... .......... .......... .......... .......... .......... 1105 AAAGATCCTA AGTAAAAAAA ATAATAGGAA GAAAACTTTT TTACATAAAT AATATGGACA 19302 .......... .......... .......... .......... .......... .......... 1105 AAATAATATG TTGCTAAAAT TGTGAGATTA TAATAATTTT GATACTTGAT ATATAATATA 19362 .......... .......... .......... .......... .......... .......... 1105 TTTATTAACT AAATCATGCT TAGAGGTTTA GTTTGAAAAA TTAAATCTTA ATCATTTATT 19422 .......... .......... .......... .......... .......... .......... 1105 TTAACCTATG ACATGTGTCA TTAATCCAAA TAGAAACTTG GGGAAAATGG AGAGAAGCCG 19482 .......... .......... .......... .......... .......... .......... 1105 AATCCCTCCG GAACTTCGCC ACAAAAATCT GACCAAAGTC ATAACAAGCT GCTGCAACCT 19542 ||| | | .......... .......... .......... .......... .......... .CTG-ACCAA 1113 TGATTTCAAG GTCCTGGTGT TGGAATACAT GCCCAATGGG ACACTTGATA AATGGTTATA 19602 | |||| | | | ||| | |||||||| | | | |||| |||||||||| ||||| | || AGTCATCAAA GCCTTAATGT TAGAATACAT GTCTAGTGGG ACACTTGATA AATGGCTGTA 1173 TTCTCACAAC TTGTTCTTAA ACTTATTGCA 19632 |||||||| |||||||| | ||| | || CTCTCACAAG TTGTTCTTGG ATTTACTTCA 1203 hqPGS_C06HBa0054K13.1-16+_SGN-U322835+ (15647 15856,15955 16304,16804 16850,17124 17130,17284 17301,17629 17638,18160 18194,19534 19632) ******************************************************************************** EST sequence 1 +strand 1190 n (File: SGN-U322569+) 1 CTTTTGGGGT AATTTTTTAT TCTTTTATTT TTCGGTAACA AGTTAACCAG AGTTCCGCCG 61 GCATCCTCTT AGTTTTTTTT CCGGGGAAAC CAGCTTCCAA CTCCACCGCT ATAGCAGCAA 121 ATGACCAGAA CACCTTCCCC TTTTCCATTG TTTCTAGCAA TACCCTCGTC CCCCTTTTCA 181 TCGTTGCAAC CAGATCAGCG AACCAGAACC GAAATCAACT GTTCCGATGA AAAACCAAGT 241 CTTCACGAGT CCTACAAAAA AAACCTATAA TCGACTTCCA CAACTAAAAA CCCGCTGGAA 301 ACCACCACCT CACGCCAGAC AGACTACAAC CAGACCAAAC CCCTCTTCAT TTTTCTCCCT 361 TTTCCGGTGA AATAGTCACT GAATATAACA CCAAAATCGC TGTCAGATCC ATTCCTCTTC 421 CCCTTTCACG ATTCCTCTTG TATTTTTGCA GGTCATGACT GAAACCCATG AGCTCGGAAC 481 CATGAAACAA CCAGCGCCAA CAATGGCAGA AATCAATCAG ATCTCACAAC CAGTAGCTCC 541 AAAGTCGGAG CAAACACGAA TACCGATCCA AACCAGCAAC CTAAAGTTCA AAATTCCAGG 601 AAATTTCAAT GTTTTCATCC AATATATGGT AGAAATTTCT AACAATTTTA TAACTTTTCG 661 ATGATTTGGT TCACTGCTAG TGTTTGAACT TTGTATTTGA TTGAAGATTT TGTGTTGTTT 721 ACAAATATCC TCTTTTAAGT TTATCTATTT TACGGAAAAA GTTTGATATA TTTCTCAATT 781 TTGTCATTTG CAGCACTGTC GTTACAAAAT GGCTAATATA TATTAAAAAA TTAATTTTAA 841 ATTTATATTT ATTACTTTTG ATTGTTTTTA AAAAAATTAT TTAAAGGTAT ATATTATTCT 901 TCTATCAAAG TTCGAGGTTA TATTTAATTT TTTTTATACA TAAATTATTG TTTGACTTCT 961 TTTATTATAA TTATTTGAGT TTGTTATTCT AATTTTTTTT CTTTCATTCC TTAGTTTAAA 1021 GAGAAAAAAA ACTAAACTAT TTTTTTTGTG TGTATTGTAA TTTAATTTGG TATTCAAAGA 1081 AAAATTTTGG TCATCTACAA TAAGTTTTAC AAGAATATTA GTGAAATATA AATAAATTTG 1141 ATTATCAAAA TAATAATTAT AAATTAGTCA TTAACCAAAA AAAAAGAAAA Predicted gene structure (within gDNA segment 11265 to 22476): Exon 1 13017 13034 ( 18 n); cDNA 635 652 ( 18 n); score: 0.889 Intron 1 13035 13119 ( 85 n); Pd: 0.063 (s: 0), Pa: 0.000 (s: 0) Exon 2 13120 13128 ( 9 n); cDNA 653 661 ( 9 n); score: 0.778 Intron 2 13129 15639 (2511 n); Pd: 0.932 (s: 0), Pa: 0.999 (s: 0) Exon 3 15640 15663 ( 24 n); cDNA 662 684 ( 23 n); score: 0.667 Intron 3 15664 17021 (1358 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 4 17022 17052 ( 31 n); cDNA 685 714 ( 30 n); score: 0.774 Intron 4 17053 17123 ( 71 n); Pd: 0.000 (s: 0), Pa: 0.778 (s: 0) Exon 5 17124 17130 ( 7 n); cDNA 715 721 ( 7 n); score: 0.571 Intron 5 17131 17283 ( 153 n); Pd: 0.900 (s: 0), Pa: 1.000 (s: 0) Exon 6 17284 17290 ( 7 n); cDNA 722 728 ( 7 n); score: 0.714 Intron 6 17291 20601 (3311 n); Pd: 0.000 (s: 0), Pa: 0.501 (s: 0) Exon 7 20602 20607 ( 6 n); cDNA 729 734 ( 6 n); score: 0.500 Intron 7 20608 21158 ( 551 n); Pd: 0.975 (s: 0), Pa: 0.000 (s: 0.72) Exon 8 21159 21254 ( 96 n); cDNA 735 821 ( 87 n); score: 0.719 Intron 8 21255 21333 ( 79 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.90) Exon 9 21334 21700 ( 367 n); cDNA 822 1185 ( 364 n); score: 0.884 MATCH C06HBa0054K13.1-16+ SGN-U322569+ 0.850 565 0.475 C PGS_C06HBa0054K13.1-16+_SGN-U322569+ (13017 13034,13120 13128,15640 15663,17022 17052,17124 17130,17284 17290,20602 20607,21159 21254,21334 21700) Alignment (genomic DNA sequence = upper lines): ATTTCTAACA ATATCATAGC AACAAACTGG TCTTCTTCCG TCTCCGTTTG CAGCTGGATT 13076 |||||||||| || | ||| ATTTCTAACA ATTTTATA.. .......... .......... .......... .......... 652 GGAATCACTT GCAGCTCCCG TCACCATCGA GTCACTGCTT TAGACATTTC AAGCATGCAA 13136 || |||| | .......... .......... .......... .......... ...ACTTTTC GA........ 661 CTTTATGGTA CCATTCCTCC ACACCTTGGA AACCTCTCAT TTATTTCATC GCTTGACATC 13196 .......... .......... .......... .......... .......... .......... 661 AGTAACAACA CTTTCCATGG AGAGTTGCCA CTAGAGTTGG TTCGTTTGCA GAGGTTGAAA 13256 .......... .......... .......... .......... .......... .......... 661 TTCTTTAATA CTAAAAACAA TAACTTCACC GGAGCCATTC CATCATTTTT AAGTTTGTTA 13316 .......... .......... .......... .......... .......... .......... 661 CCAAACCTAC GCTTTCTGTA CCTATCGAAT AACCAATTTT CGGGTAAAAT TCCATCCTCC 13376 .......... .......... .......... .......... .......... .......... 661 CTTTCCAATC TGACAAAACT GCAAGTGTTG TCAATACAGA GTAATTATAT TGAAGGAGAG 13436 .......... .......... .......... .......... .......... .......... 661 ATCCCTCAAG AACTCGGTGA TCTTCGTTCC TTGATTATCC TAAACCTGCA ATATAATCAG 13496 .......... .......... .......... .......... .......... .......... 661 CTTAGTGGCT CTATACCATC TTCAATCTTT GACATCACTA CAATGCAAGT AATTGCTCTT 13556 .......... .......... .......... .......... .......... .......... 661 AGTGGCAACA ATCTTACTGG AAAGATTCCA ATCACGATAT GTGATCATCT TCCAGACTTG 13616 .......... .......... .......... .......... .......... .......... 661 GAAGGACTTT ACCTCGGCAG AAACTCCCTT GATGGAGTTA TTCCACCAAA CCTGGAGAAA 13676 .......... .......... .......... .......... .......... .......... 661 TGCAGAAAGC TTCAAATATT GGAATTGACT GAAAATGAGA TTGCTGGAAC TGTACCAAGA 13736 .......... .......... .......... .......... .......... .......... 661 GAGTTAGCCA ACTTAACAAC TCTTACAGGA CTATATCTTA TGGATCTGCA TTTGGAAGGT 13796 .......... .......... .......... .......... .......... .......... 661 AGTATGAAAT TTTCTTCCTT TTTCTCTTTT CTGCCAAAAC TTGTGCTCAT ATATATTCCC 13856 .......... .......... .......... .......... .......... .......... 661 AAAATTGACA GGAGAGATAC CAATGGCGCT CGCTAATCTT AAGAAACTTC AAACATTAGT 13916 .......... .......... .......... .......... .......... .......... 661 ATTATCACTG AATGAGCTAA CTGGCTCTAT CCCTGACAGC ATTTTCAACA TGTCAACACT 13976 .......... .......... .......... .......... .......... .......... 661 GCAGAAAATA GATTTTGGAC AAAACAAGCT TACAGGTACT CTGCCTTCAG ATTTAGGTCG 14036 .......... .......... .......... .......... .......... .......... 661 TGGAATGCCC GACCTACAAG TATTTTATTG TGGAGGAAAT AATCTGAGTG GTTTTATCTC 14096 .......... .......... .......... .......... .......... .......... 661 TGATTCAATC TCTAATTCTT CAAGACTCAC AATGTTAGAC CTCTCCAGCA ACAGTTTCAC 14156 .......... .......... .......... .......... .......... .......... 661 AGGTCTAATT TCAAAATCAC TTGGTAACTT AGAATACCTT GAGGTTCTCA ACTTGTGGGG 14216 .......... .......... .......... .......... .......... .......... 661 GAATAATTTT GTCAGCGATT CAACATTGAG CTTCCTTGAA TCATTGACAA ACTGTAGGAA 14276 .......... .......... .......... .......... .......... .......... 661 TCTAAGAGTA CTCACGCTTG GTGGTAATCC GTTGGATGGT GTTTTGCCTG CATCTGTTGG 14336 .......... .......... .......... .......... .......... .......... 661 TAATTTCTCA AACTCCTTGC AAATTTTTGA AGCATCTAAA TGTAAACTGA AGGGTGTCAT 14396 .......... .......... .......... .......... .......... .......... 661 TTCAAAACAA ATTACTAATC TTACTGGATT GACAAGGATG AGTCTGTCGA ACAATCAGTT 14456 .......... .......... .......... .......... .......... .......... 661 GATAGGTCAT ATTCCAAAAA CAGTGCAAGG AATGCTGAAC CTTCAAGAAC TTTACCTAGG 14516 .......... .......... .......... .......... .......... .......... 661 AAGCAACAAG TTAGAAGGAG CCATACCAGA TGTTATCTGC AGTTTACAGT ATCTTGGTGC 14576 .......... .......... .......... .......... .......... .......... 661 ATTAGAATTG TCAGAAAATC AATTTTCTAG TTCCGTTCCA CCATGCTTAG GGAATGTTAC 14636 .......... .......... .......... .......... .......... .......... 661 TAGTTTGAGG ACACTCTATC TAGATAACAA CAAGCTGGAT TCTAGATTAC CTGCAAGATT 14696 .......... .......... .......... .......... .......... .......... 661 GGGGGGACTT CAAAACATCA TAGAGTTCAA TATTTCATCC AATTATTTGA GTGGAGAAAT 14756 .......... .......... .......... .......... .......... .......... 661 TCCGCTAGAG AGCGGAAACT TGAAGGGTGC AACACTGATT GATCTGTCAA ATAATTATTT 14816 .......... .......... .......... .......... .......... .......... 661 TTCTGGTAAG ATTCCTAGTA CTCTAGGGGG CCTAGATAAA TTAATTTATC TTTCTCTAGC 14876 .......... .......... .......... .......... .......... .......... 661 ACATAATAGA TTAGAAGGGC CTATTCCTGA ATCATTTGAC AAATTGTTGG CATTGGAATA 14936 .......... .......... .......... .......... .......... .......... 661 CTTGGATTTG TCCTATAACA ATCTTAGCGG TGAAATTCCA AAGTCATTAG AAGCTCTTGT 14996 .......... .......... .......... .......... .......... .......... 661 GTATCTCAAA TACCTAAATT TCTCTTTCAA TGAACTCAGT GGAGAAATTC CCACTGATGG 15056 .......... .......... .......... .......... .......... .......... 661 TCCCTTTGCA AATGTAACCA GTCAGTCTTT CTTGTCCAAT GATGCACTTT GTGGTGACTC 15116 .......... .......... .......... .......... .......... .......... 661 CCGGTTTAAC GTAAAACCAT GCCCAACCAA ATCTACAAAG AAATCAAGAA GAAAAAGAGT 15176 .......... .......... .......... .......... .......... .......... 661 GCTTACAGGT TTATATATTC TATTAGGGAT AGGATCACTC TTCATGTTGA CTGTTGGATT 15236 .......... .......... .......... .......... .......... .......... 661 TGTCGTGTTA AGATTGAGAA ACACAAAGAA GAATGCTAGT CAAAAGGATC TGTCTCTCGT 15296 .......... .......... .......... .......... .......... .......... 661 AAGAGGGCAT GAAAGAATTT CCTATTATGA ACTTGAACAG GCAACTGAAG GATTCAACGA 15356 .......... .......... .......... .......... .......... .......... 661 AACCAACTTG CTTGGTAATG GGAGTTTCAG CAGGGTTTAT AAAGGGGTAC TTAAGGATGG 15416 .......... .......... .......... .......... .......... .......... 661 TATCATTTTT GCAGCAAAGG TATTCAATGT GCAATTGGAG GGTGCATTCA AAAGTTTTGA 15476 .......... .......... .......... .......... .......... .......... 661 CACGGAATGT GAGATACTTC GCAATCTTCG CCACAGAAAT CTTGCCAAAG TCATTACCAG 15536 .......... .......... .......... .......... .......... .......... 661 CTGCTCCAAT CTTGATTTCA AGGCCCTAGT GTTGGAATAC ATGCCCAACG GGACACTTGA 15596 .......... .......... .......... .......... .......... .......... 661 TAAATGGTTA TACTCTCACA ATTTGTTCTT GAACTTATTG CAGAGATTGG ATGTAATGAT 15656 |||| | | | || | .......... .......... .......... .......... ...TGATTTG GTTCACTGCT 678 AGATGTTGCA TCTGCAATGA ACTATCTCCA CAATGGCTAT TCAACGCCTG TAGTGCATTG 15716 || |||| AG-TGTT... .......... .......... .......... .......... .......... 684 TGACTTGAAA CCAAGTAATG TCTTGTTAGA TGAAGAAATG GTTGCTCATG TAAGTGATTT 15776 .......... .......... .......... .......... .......... .......... 684 TGGCATTGCA AAAATGTTAG GTGCAGGGGA GGCTTTTGTT CAAACAAGGA CAGTTGCAAC 15836 .......... .......... .......... .......... .......... .......... 684 CATTGGATAT ATTGCTCCAG GTATATTTTA AGTTTTCTCG TATCGCCTTT AAATACTCAA 15896 .......... .......... .......... .......... .......... .......... 684 AACAATTCTT TCCCCTATGT ATAAATTGAT TTGTGGTCAT TTTCGACCAT GAATACAGAG 15956 .......... .......... .......... .......... .......... .......... 684 TATGGACAAG ATGGAATAGT ATCCACGAGT TGTGATGTTT ATAGTTTTGG TATCCTGATG 16016 .......... .......... .......... .......... .......... .......... 684 ATGGAGACGT TCACACGAAC AAGACCAAGT GATGACATAT TTACTGGAGA CTTGAGCATA 16076 .......... .......... .......... .......... .......... .......... 684 CAAAGCTGGA TTAGTGATTC CTTTCCGGGT GAACTTCACA AGGTGGTGGA TTCTAATTTG 16136 .......... .......... .......... .......... .......... .......... 684 GTACAGCCCG GAGATGAACA AATCGCTGCA AAGATGCAAT GTTTGTCATC TGTCATGGAA 16196 .......... .......... .......... .......... .......... .......... 684 TTAGCTTTGA AGTGCACTTT AGTGAGACCT GATGCAAGAA TTAGCATGAA GGATGCTCTT 16256 .......... .......... .......... .......... .......... .......... 684 TCAACACTCA AAAAGATGAG GCTACAGCTT GTTAGTAGTC GGCATTAGGT GGAATCATTA 16316 .......... .......... .......... .......... .......... .......... 684 CCAACCTTCT CTTGTATGTT ATTTAGTTAC CAGTTTTCTC TCTTATGTAA TTCAATTTCG 16376 .......... .......... .......... .......... .......... .......... 684 CACGAGTGTA TTTTATCTTA GTTGTTGGCT TGATTTATGG AAGTTGAAAA TGAGATTTAA 16436 .......... .......... .......... .......... .......... .......... 684 AATGCTCAAC ACAAGATTTT CTTTTCGTAT TTACACAACT TTGAGAATGC TTCCAATTAT 16496 .......... .......... .......... .......... .......... .......... 684 ATCACCACAA CTGTAACAGT TTTATCAAGT CCTCTCCTTC CAGCCATCTA AGTGTAGTCT 16556 .......... .......... .......... .......... .......... .......... 684 CCGGGAGTAG CTGCCTAAAT TCCTTGTCGT CAAACTTACA TAAGAATCAA AAGCAACTTG 16616 .......... .......... .......... .......... .......... .......... 684 CCTCCAAAAT TTGATGCAGT TTAAACCTTT TAGTGTTCAA TGAGTTTGCT GTGCGCCCAT 16676 .......... .......... .......... .......... .......... .......... 684 TGGGTGAGGG AAGAGCTTTA ACTCGTGCAG GTTCCTATTG CATCTCCGCG GGCTTCTGGA 16736 .......... .......... .......... .......... .......... .......... 684 CCTGTTACTC GTCTACTAAT TTGGGATCTT AAGATTGCAA GGGAAAAGTG TGAAAATCTA 16796 .......... .......... .......... .......... .......... .......... 684 CTGGCTGAAA ATGCATACTG ATTTAACTGC ATCTAAAGAA GAAGTTGGTT GATTGAAGGA 16856 .......... .......... .......... .......... .......... .......... 684 TCAGTTAGTA CAGAAGAAGC TTGATAGCAA CGCTAGAGTG GACTGGATCC TCCAGTTACT 16916 .......... .......... .......... .......... .......... .......... 684 TGCTTCTTCA TCCTGTCCTC CAAACCCCAA TCATTCCTCT TCTTGATCCC TGCTTTCTGG 16976 .......... .......... .......... .......... .......... .......... 684 TCTAGTTTCT TATACTTTTG GACTGAAGAC ATTGTTATGT TCTTTTGAGT TTTGTGCTTA 17036 ||| ||||| || .......... .......... .......... .......... .....TGAAC TTTGT-ATTT 698 GACTGAATAT TTTGTGAGCT TAAAATGTTG GTGTATCTCG GATCCTCTGC TTCCTTCGTT 17096 || |||| || |||||| GATTGAAGAT TTTGTG.... .......... .......... .......... .......... 714 CTTGCATGTT TTTGCATATG TTTACAGATG GCTAATATCA TTAGATTGGA GGTATCTTGT 17156 || || .......... .......... .......TTG TTTA...... .......... .......... 721 TGTGTTCTAC GCGATGTTGC TCTTCCGGAG CTCTGCGTTT TCGTTGGCTT GGTTTACTGT 17216 .......... .......... .......... .......... .......... .......... 721 TTTAACGCTT TTCTCTTCTT TTTGATATGT ATTGCTCTTG TGTACTGTGT GTATTGACCT 17276 .......... .......... .......... .......... .......... .......... 721 TATACAGGAA ATAGGAGTAG ACGATTTGTC ATCATCAAAA AAGAGGGAAA ATGTGTTGGG 17336 || ||| .......CAA ATAT...... .......... .......... .......... .......... 728 TTTTATTTTC CCTAAATTTC TTATCATAAA TAGGTTTTCC TTTAAGGGGA AGGTTTTGAT 17396 .......... .......... .......... .......... .......... .......... 728 TGACTAATCA TTTTCTTGTA GGAAAAGGTT TAGGACTCTA TAAATAGAGA AATGTTCCTT 17456 .......... .......... .......... .......... .......... .......... 728 CTAACTTAGT CAGCATTCAC AATGTAGTCT TAAGAGCTTT GAGAGTTTTG GTTAGGGAGA 17516 .......... .......... .......... .......... .......... .......... 728 GAATTTATGG GTCACAAGTT GGATACATTA TCACTTGTGT GAACCTCCCA TGTATTCCGA 17576 .......... .......... .......... .......... .......... .......... 728 GTGAATTGGT TGAGGTTGTT TCTCTCTGTA TTTTGTACTC TCATATTTAT AGTGGATTGC 17636 .......... .......... .......... .......... .......... .......... 728 TCATCTCCTT TGTGGACGTA GGTCGATTGA CCGAACCACA TTAAATCTTT GTGTGTTTTG 17696 .......... .......... .......... .......... .......... .......... 728 GTATATTTCT CGTTGTCTTC TTACTCGTGG TCTTTTGAGG TTTGCTTTGC TAGCTTCCGC 17756 .......... .......... .......... .......... .......... .......... 728 GTTTACACCT GCTGATTTTC GGTCCTAACA AAATGATGGT TTACGAAGTT GACGATAGAC 17816 .......... .......... .......... .......... .......... .......... 728 ATGCAATCAG ATTTCAAAAA GAGCTGGGAG AAAAGATTTC AAAAACGAGA GAAAGAAAAG 17876 .......... .......... .......... .......... .......... .......... 728 AGAGCTAACA CACAAAGGCC AATCAATTTG AGAGATCATA GAAATATAAG TTAAGAAGTT 17936 .......... .......... .......... .......... .......... .......... 728 CTATAGTTAG AATTTCTTTT GTCATGAGTA GAGTGTTTTG ATTCGTACTT AGAGTCATTA 17996 .......... .......... .......... .......... .......... .......... 728 TCGTAGGAGG AGTGGATTTG GCTTCTTGTA GAGTTGAGTC TTTGAGAGGT TTGTGACAAG 18056 .......... .......... .......... .......... .......... .......... 728 GAGTGGATTT GACTTCTTGG AGAGTAGAGA TAGTCGATTA TAGTTAATCG AAAGTTTTTC 18116 .......... .......... .......... .......... .......... .......... 728 TGTGAGTTCA TTGATGAGTT AATAACTTTG ATAGCCACTC AACTATCAAT TGTTTTCACA 18176 .......... .......... .......... .......... .......... .......... 728 GATATAGACA TCTGAAATGC TCTGTTGTTT GTGACTTTAT AAACTTGTTG ATTTTAGTAT 18236 .......... .......... .......... .......... .......... .......... 728 ATATCCTCAA AATGTAAGTT GTACTCCTAC ATTCCAAGTC TGCATAATAT GGCCATTTTC 18296 .......... .......... .......... .......... .......... .......... 728 AATGCATGAC TGTGCTACAT GTGTGTGACT TTGTTTATCA ATTCTCTGCT GGAACGCATA 18356 .......... .......... .......... .......... .......... .......... 728 TGACAACAAT CTATGTTCCG AGATTATCAA CAACTTGAAT ATCCCTGCTA ACCAGACTGC 18416 .......... .......... .......... .......... .......... .......... 728 AACTAGTTGT TGCCTGCAAT CACATATCTG CATATTGTTG CACAGGACTG TGATTGTCTT 18476 .......... .......... .......... .......... .......... .......... 728 TATTGGTTGC TAAATGTACG TTCCCTCATT CACTTAGCTG CATAGAGTGC TGACTATTAG 18536 .......... .......... .......... .......... .......... .......... 728 AGTTTAGGGC CTGATGACAT GATTATCTCC AACCTAAAAT TCGTAAACAC AAACCAAGCT 18596 .......... .......... .......... .......... .......... .......... 728 AACCAAAACT AGTTTAGTTC ATAATGACAA GGAATTGTAC AGAAATAAAG GAGGAATTTT 18656 .......... .......... .......... .......... .......... .......... 728 AATCTTTCTG AAATGATCAT AATGCACAAC TACCTGAACT TTCTGTTCCT AATAAACAAC 18716 .......... .......... .......... .......... .......... .......... 728 AACTTAAATA ATATTTAAGC TGAAAGACAA GAAAAACGGA AAGTCCAGAA GTTGTATTTA 18776 .......... .......... .......... .......... .......... .......... 728 ACATAGAGGA AATCAAGGAG GAACATTATT CCTTGTCAAC TTTTTCCTTC TGAAACTTCG 18836 .......... .......... .......... .......... .......... .......... 728 CATTCCCAAC CTCTCCAAAT TACAACTTCC TCTCGCTGTT TTTAGTAATT GAGTGGGAGT 18896 .......... .......... .......... .......... .......... .......... 728 GTTAAAATGT TGAATAATAT CATGCTTGTG ACTATATTTT GTCCATAAAT ATAGGTAAAA 18956 .......... .......... .......... .......... .......... .......... 728 AAAGTTTTCT TCCTATTATT TCTTTTTACT TAGGATCTTT TCTTTTTCCT TTTATATAAT 19016 .......... .......... .......... .......... .......... .......... 728 ATTTTTTTTT ATCTAATTAG AAGAATAATT AACTGAATAG AAGTATGGAA ACTATAGATA 19076 .......... .......... .......... .......... .......... .......... 728 ACCTACCGTA TGTACTGGTA TCACCCCATT CTTAATCTTA TTATGAAATT AAAATTAGTT 19136 .......... .......... .......... .......... .......... .......... 728 TGTCAAAATG GTTTGATTAT CTCTTGGTAA TTTATTAATC ACGGACCAAT ATATACCTTT 19196 .......... .......... .......... .......... .......... .......... 728 AACAAAAGAT TACAATAAAA AATATTATAT AAAAGAAAAA TAAAAAAAAG ATCCTAAGTA 19256 .......... .......... .......... .......... .......... .......... 728 AAAAAAATAA TAGGAAGAAA ACTTTTTTAC ATAAATAATA TGGACAAAAT AATATGTTGC 19316 .......... .......... .......... .......... .......... .......... 728 TAAAATTGTG AGATTATAAT AATTTTGATA CTTGATATAT AATATATTTA TTAACTAAAT 19376 .......... .......... .......... .......... .......... .......... 728 CATGCTTAGA GGTTTAGTTT GAAAAATTAA ATCTTAATCA TTTATTTTAA CCTATGACAT 19436 .......... .......... .......... .......... .......... .......... 728 GTGTCATTAA TCCAAATAGA AACTTGGGGA AAATGGAGAG AAGCCGAATC CCTCCGGAAC 19496 .......... .......... .......... .......... .......... .......... 728 TTCGCCACAA AAATCTGACC AAAGTCATAA CAAGCTGCTG CAACCTTGAT TTCAAGGTCC 19556 .......... .......... .......... .......... .......... .......... 728 TGGTGTTGGA ATACATGCCC AATGGGACAC TTGATAAATG GTTATATTCT CACAACTTGT 19616 .......... .......... .......... .......... .......... .......... 728 TCTTAAACTT ATTGCAGAGA TTGGATATAA TGATAGATGT TGCATCTGCA ATGGACTATC 19676 .......... .......... .......... .......... .......... .......... 728 TCCACAATGG CTATTCAACG CCTGTGGTGC ATTGTGACTT GAAGCCAAGC AATATGTTGC 19736 .......... .......... .......... .......... .......... .......... 728 TAGATCAAGA AATGGTTGGT CAAGTTAGTG ATTTTGGCAT TGCAAAATTG TTAGATGTAG 19796 .......... .......... .......... .......... .......... .......... 728 GGGAGGCTTT CGTTCAAACA AGAACAACTG CAACCATCGG ATATATTGCT CCAGTATATT 19856 .......... .......... .......... .......... .......... .......... 728 AGAAACTTCT ATAGTTAAGC AATTCTTTCC CCTATAAATG GATTGACACT CTCAGTTTTG 19916 .......... .......... .......... .......... .......... .......... 728 GCATATTTCA TTTTAACTAG GAAGCTTTCA TGATCATTTT TTACCAAGAT TACAGTGTAT 19976 .......... .......... .......... .......... .......... .......... 728 GGACAAGATG GAATAGTATC CACGAGTTGT GATGTTTATA GTTTTGGCAT CCTAATGATG 20036 .......... .......... .......... .......... .......... .......... 728 GAGAGGTTCA CAAGAAGGAA ACCAAGTGAT GAAATATTTA CAGGAGAAAC AATCATAAAA 20096 .......... .......... .......... .......... .......... .......... 728 TAAGTTCAGG GAAAAAATGC ACAAGTACCC CTCAACCTAT GTCCGATAGG TCGAAAAGGG 20156 .......... .......... .......... .......... .......... .......... 728 ATAGAAAATT ACTTATAAAA TAAGTTGAGG GGGTAATAGG ACCTTAGTGT TGTATAAGTG 20216 .......... .......... .......... .......... .......... .......... 728 TGTCTCTGAA ATTTCGGACA TAGGTTGAGG GGGTACCTGT GCATTTTCCC TCAGAATTAT 20276 .......... .......... .......... .......... .......... .......... 728 TTACATCTTA ACTTTATCAG TTTTTAAGTT TTATATTTGA ACTATTGAAA GTATGAGTTT 20336 .......... .......... .......... .......... .......... .......... 728 TCGACCTAAA CAATCACCTA ATGATTAGTG AAACACACCT TGACTAATTT ATAATGTTGT 20396 .......... .......... .......... .......... .......... .......... 728 GGTTAATACA CTCTCTTATC TATTTAAAAT TTTTGTCATG TAGCTCTCCA CATGAAAAAA 20456 .......... .......... .......... .......... .......... .......... 728 TAATTTCACT GTGACAAAGC TAAATAAATT AATATTACGT TAAATGTTAG GATTAAAGAT 20516 .......... .......... .......... .......... .......... .......... 728 ACATATTATC ACCATATAAA TTAATTTATT CAGTTTTGTG CCTGATATCT TCTTCTCCAC 20576 .......... .......... .......... .......... .......... .......... 728 CTTATTAGTC GTGGTCTCAT CTCAGCCGCG GGTAAATATC TCATATACTT GGAAGGGGAG 20636 || | .......... .......... .....CCTCT T......... .......... .......... 734 AATTTAGATT ATATATATTC ATTCTCTAAT ATTAGTAATC AATGTGGTGA GATTAGTGAA 20696 .......... .......... .......... .......... .......... .......... 734 TTTCTTTCTT CTAAAATTTG AACACTTGAA ACGAATTTTT GTAATATAAA CTATTTATAA 20756 .......... .......... .......... .......... .......... .......... 734 CCAACACAAA ATTGTCTTAT TTGAACTAGT GAATTCCAAA TAATAAAGGT TAATAACCAA 20816 .......... .......... .......... .......... .......... .......... 734 GTGGCAACTA ACCATTATAA GGTCACACAA TTTTAAGATA ATACTGTAAA GCATTCTTTG 20876 .......... .......... .......... .......... .......... .......... 734 ATTATCTGCC TTGACCTTGC TTTGTGCCTT TGGTCTTTGG CTCAACAATA AGATAGTATA 20936 .......... .......... .......... .......... .......... .......... 734 TTGGAAAGTA AATATTTTTG CTTTACTGAT TGCTTTGCTT TTTTGCAGAA AAGGGGATAT 20996 .......... .......... .......... .......... .......... .......... 734 TATTTGTTTT TTTCCACTTC CCTTTTTATA ATGGAAGACT CTTGAAAGCA ATTGAGTCTA 21056 .......... .......... .......... .......... .......... .......... 734 TTTAAATGTT ATTTTATTCT ACCTTTTATT TTCTTCCTTT AACTTCTTGT TCAAGTAAAA 21116 .......... .......... .......... .......... .......... .......... 734 AAAAGTTTTT GAATTTTGAA AGATAAAAGA ATAATATCAT AGTTATAGTT ATATATAAAA 21176 ||| || || ||| | .......... .......... .......... .......... ..TTAAGTTT ATCTATTTTA 752 GGGAAAAAGG TCTGATATAC CCCTCAACTT TGTCATTTGG AGCTCATATA CCCCTCGTTA 21236 ||||||| | | ||||||| ||||| || ||||||||| || | | | |||||| CGGAAAAA-G TTTGATATAT TTCTCAATTT TGTCATTTGC AG--C----A -CTGTCGTTA 804 TAAAAGTGGC TCATATATGC CCTTACCGTT ATACAAACGG CTCACATTTA CCCCTGCTGT 21296 |||| |||| | |||||| CAAAA-TGGC TAATATAT.. .......... .......... .......... .......... 821 TATAAAATGA CTCACATATA CCCTTCATTT AACGGAAGTT AAAAAATTTA TTTTAAATTT 21356 || |||||||| | |||||||||| .......... .......... .......... .......ATT AAAAAATTAA TTTTAAATTT 844 ATATTTATTA CTTTTAATTT TTTTTAAAAA AAATTATTTA GAGATATATA TGATTCTTCT 21416 |||||||||| ||||| ||| ||||| |||| |||||||||| || |||||| | |||||||| ATATTTATTA CTTTTGATTG TTTTT-AAAA AAATTATTTA AAGGTATATA TTATTCTTCT 903 ATCAAAGTTC AATG-TATAT TTTAATTTTT TTCATACATA AATTATTTTT TGACTTCATT 21475 |||||||||| | | |||| |||||||||| || ||||||| ||||||| || ||||||| || ATCAAAGTTC GAGGTTATA- TTTAATTTTT TTTATACATA AATTATTGTT TGACTTCTTT 962 TATTATAATT ATTTGAGTTT CTTATTCTTA TTTTGTTTTT TTCTTTCATT CCTTAGTTTA 21535 |||||||||| |||||||||| ||||||| | |||| ||| | ||| ||| || || | TATTATAATT ATTTGAGTTT GTTATTCTAA TTTTTTTTCT TTCATTCCTT AGTTTAAAGA 1022 AATAAAAAAA TTAAACTATT TTTTTACTGT GTATTGTAAT TTAATTTCGT ATTCGAAGAA 21595 | ||||||| ||||||||| ||||| ||| |||||||||| ||||||| || |||| ||||| GA-AAAAAAA CTAAACTATT TTTTTTGTGT GTATTGTAAT TTAATTTGGT ATTCAAAGAA 1081 AAAATTTGGT CATCTACAAT AAGTTTTACA AGAATATTAG TGAAACATAA ATAAATTTGA 21655 ||| |||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| AAATTTTGGT CATCTACAAT AAGTTTTACA AGAATATTAG TGAAATATAA ATAAATTTGA 1141 TTATCAAAAT AATAATTATA AATTAGTCAT TGAAATACAA AAAAA 21700 |||||||||| |||||||||| |||||||||| | || | || ||||| TTATCAAAAT AATAATTATA AATTAGTCAT T-AACCAAAA AAAAA 1185 hqPGS_C06HBa0054K13.1-16+_SGN-U322569+ (17022 17052,17124 17130,17284 17290,20602 20607,21159 21254,21334 21700) ******************************************************************************** EST sequence 18 +strand 965 n (File: SGN-U334483+) 1 NNNNNNNNGG GGGAGGGGGG GCTTTTTNAA AGAAGAAGAA CGGGAGAATA CGCCGGGCCG 61 GCTGCTCTAG AACTAGTGGA TCCCCCCGGC TGCAGGAATT CGGCACCGGG ATTAGTCAAT 121 CCAAAATTTT TCCCTAAAAG GAAAACCTAT TTATGTTAAG AAATTATGGT AAATAAAATC 181 CAACAAATCT CCTCCCCCTT GGCCTGTATT TCTGACCAAA ATAAATTTCT CCACATTCTT 241 CATTTAATCT TCAACAACTT GCTTCTCTTC TTCTTAATCT CCTTTGTAAA ATTTATGTCT 301 CAACCATAGA AAACCTCTCT GAAACAATTT CTCCAACAAA ATCTTCATTA CTGTCAAAAA 361 GATTGCGGCT AGAACCTACC ACCTGTCAAG ATGAACCACC ACCCTCTTTC TAACCTGGTC 421 CAATAATCGA TTATCGAACC ACTGAACCTG ACCTCTGTCA TTAAATGGCT CTAATACCCA 481 CTTGTTAGGA TCGAAATAAG CAGGTGTAAA TGCGGAAGCT AGCAAAGCAA ACTTCGAAAG 541 ACCACGAGTA AGAAGACAAC GAGAAATATA CCAAAAGACA CCAAAGATTT AACGTGGTTC 601 GGTCAATCGA CCTACGTTCA CAAAGGAGAT GAGCAATCCA CTATAAATAT GAGAGTACAA 661 TATACAGAGA GAAACAACAT CAACCAATTC ATTCGGAATA TATGGGAGGT TCACATAAGT 721 GATAACATAT CTAGCTNGTG ACCTATAAAT TCGTCTATTT ATTGAATCAA GGCAAAATAA 781 ACCCTCGAGA GATATCGAAG GCCTCAAAAT TGTCTCTCAC TTTATTAGTG TCCTCTAAAA 841 TAACTTGCAT TCAAAAATTG GTGATTATTG TGTGAACCTA ATGATTTAAT GGACCATGAA 901 TTAAAATTAT AATTTTACCT CCTATAAGTA TTGCCCTCAA AAAAAAAAAA AAAAACTCAA 961 GGGGG Predicted gene structure (within gDNA segment 24291 to 13511): Exon 1 23209 23196 ( 14 n); cDNA 450 463 ( 14 n); score: 0.643 Intron 1 23195 17996 (5200 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 2 17995 17985 ( 11 n); cDNA 464 474 ( 11 n); score: 0.909 Intron 2 17984 17795 ( 190 n); Pd: 0.908 (s: 0), Pa: 0.000 (s: 0.82) Exon 3 17794 17517 ( 278 n); cDNA 475 752 ( 278 n); score: 0.901 Intron 3 17516 14512 (3005 n); Pd: 0.000 (s: 0.86), Pa: 0.162 (s: 0) Exon 4 14511 14498 ( 14 n); cDNA 753 766 ( 14 n); score: 0.714 PPA cDNA 939 956 MATCH C06HBa0054K13.1-16- SGN-U334483+ 0.901 317 0.328 C PGS_C06HBa0054K13.1-16-_SGN-U334483+ (23209 23196,17995 17985,17794 17517,14511 14498) Alignment (genomic DNA sequence = upper lines): GAAATATATC ATGAGTACTA TGTTTGGCGG AAAAGAGAAA AAGGAAGAAA ATTTCATACT 23150 || | | || || | GACCTCTGTC ATTA...... .......... .......... .......... .......... 463 ACCTTCCAAA TGCAATTCTG AAAGATATAA ATATGTAAGA GCTGTTAAGT TGGCTAGCTC 23090 .......... .......... .......... .......... .......... .......... 463 TCTTGGTAAA GTTCCAATAA ACTTATTTTC ACCCAATTGC AATTTTTGAA GCTTTCTGCA 23030 .......... .......... .......... .......... .......... .......... 463 TTTCTCCAGG TTTGGTGGAA TAACTCCATC TAGGGAGTTT TTACTGAGGT AAAGTCCTTC 22970 .......... .......... .......... .......... .......... .......... 463 CAAGTCTCGA AGATGATCAC ATATCGTTTT TGGAAGATTT CCAGTAAGAT TGTTGCCCGT 22910 .......... .......... .......... .......... .......... .......... 463 AAGAGCAATC ACATGCATTG TAGTAATGTT AAAAATGGAT GGTGGTATAG AGCCACTAAG 22850 .......... .......... .......... .......... .......... .......... 463 CTGATTAATT TGCAGGTCTA GGATAGTCAA GTAACGAAGA TCACCGATTT CTCGAGGGAT 22790 .......... .......... .......... .......... .......... .......... 463 CTCTCCTTCA AGAAAATTTC TATCCAAGTA TAACCTTTGC AGCTTTGTTA TATTGGAAAG 22730 .......... .......... .......... .......... .......... .......... 463 GGAGGATGGA ATTTTCCCAG AAAATTGGTT GCTTGATAGG TGCACAAAGC GTAGGTTTGG 22670 .......... .......... .......... .......... .......... .......... 463 TAACAAACTT AAAAATGATG GAATGGCTCC GGTGAAGTTA TTGCTTGTGA CATTAATCGA 22610 .......... .......... .......... .......... .......... .......... 463 TTTCAACCTC TGCAGACGAG CCAATTCTTG TGGCAAATCT CCATGAAAAG TGTTGTTACT 22550 .......... .......... .......... .......... .......... .......... 463 GATGTCAAGG GAAGAAAGAA ATGACAGGTT TCCGAGGTGT GGAGGAATGG TACCATGAAG 22490 .......... .......... .......... .......... .......... .......... 463 TTGCATGCTT GAAATGTCTA AAGCAGTGAC TCGATGGTGT CGAGAGCTGC AAGTGATTCC 22430 .......... .......... .......... .......... .......... .......... 463 AATCCAACTA CAAACGGGGA CGGAAGAAGA CCAGTTTGTT GCTAAGATAT CGTTAGAAAT 22370 .......... .......... .......... .......... .......... .......... 463 ATGTGATTTA AGTGCAAGAA GAGCAGCTTC ATCAGTGCTA ATATTATTAG CATTGGCATG 22310 .......... .......... .......... .......... .......... .......... 463 TAGTAATAGT AGAATGAAAA TTATAAGAAA AAAGAGGAGA GTGGAACTTC TGTCCATAAC 22250 .......... .......... .......... .......... .......... .......... 463 TAATATAGAT ATCAAATTTA ATGATACCTA TAATGTTGTG GTATTCTATT TATAACACTA 22190 .......... .......... .......... .......... .......... .......... 463 CCATGTGCGT TGCACGTGAT TTCTAATTTC TGAAAATCAA TTTAAGTACT TTGTTTTTCT 22130 .......... .......... .......... .......... .......... .......... 463 CCCCTTGATT TCACGAAACC AATTACGATA AGTAAAACTT TTTAGAGAAT TTGTAATTTT 22070 .......... .......... .......... .......... .......... .......... 463 TTAAAATAGG GAAAAGGATC TAAAATATAT TCTAATTTTG ATCGAAATTG TTGTAACATT 22010 .......... .......... .......... .......... .......... .......... 463 CGAAAACTTT TTTCTCCAAC AACCTTATCT AATTAAGGCT CCCTCCCTTC ATTCAATATA 21950 .......... .......... .......... .......... .......... .......... 463 TTGACTACTT TCTTTCATAT GTCGTGGAAA GTCTATCTTT GACACTTATC AGTCATCTAA 21890 .......... .......... .......... .......... .......... .......... 463 TCTGATAAAA AACTAGTTGT ATGTGTTATA GATAATAAAA AAATATGAAA AAAAAATGGA 21830 .......... .......... .......... .......... .......... .......... 463 AAGGTTGAAA TGAATGAATA CATAAAATTG TAGAAAGCTA AAAAGAAAAT AAATTCATAC 21770 .......... .......... .......... .......... .......... .......... 463 CATGCAAATA ATTGAATCCA ATAAGGGTGA GGGCCTCCAT ATGAGTAAAT TAATTCATCG 21710 .......... .......... .......... .......... .......... .......... 463 TCAAACATAT TTTTTTGTAT TTCAATGACT AATTTATAAT TATTATTTTG ATAATCAAAT 21650 .......... .......... .......... .......... .......... .......... 463 TTATTTATGT TTCACTAATA TTCTTGTAAA ACTTATTGTA GATGACCAAA TTTTTTCTTC 21590 .......... .......... .......... .......... .......... .......... 463 GAATACGAAA TTAAATTACA ATACACAGTA AAAAAATAGT TTAATTTTTT TATTTAAACT 21530 .......... .......... .......... .......... .......... .......... 463 AAGGAATGAA AGAAAAAAAC AAAATAAGAA TAAGAAACTC AAATAATTAT AATAAATGAA 21470 .......... .......... .......... .......... .......... .......... 463 GTCAAAAAAT AATTTATGTA TGAAAAAAAT TAAAATATAC ATTGAACTTT GATAGAAGAA 21410 .......... .......... .......... .......... .......... .......... 463 TCATATATAT CTCTAAATAA TTTTTTTTAA AAAAAATTAA AAGTAATAAA TATAAATTTA 21350 .......... .......... .......... .......... .......... .......... 463 AAATAAATTT TTTAACTTCC GTTAAATGAA GGGTATATGT GAGTCATTTT ATAACAGCAG 21290 .......... .......... .......... .......... .......... .......... 463 GGGTAAATGT GAGCCGTTTG TATAACGGTA AGGGCATATA TGAGCCACTT TTATAACGAG 21230 .......... .......... .......... .......... .......... .......... 463 GGGTATATGA GCTCCAAATG ACAAAGTTGA GGGGTATATC AGACCTTTTT CCCTTTTATA 21170 .......... .......... .......... .......... .......... .......... 463 TATAACTATA ACTATGATAT TATTCTTTTA TCTTTCAAAA TTCAAAAACT TTTTTTTACT 21110 .......... .......... .......... .......... .......... .......... 463 TGAACAAGAA GTTAAAGGAA GAAAATAAAA GGTAGAATAA AATAACATTT AAATAGACTC 21050 .......... .......... .......... .......... .......... .......... 463 AATTGCTTTC AAGAGTCTTC CATTATAAAA AGGGAAGTGG AAAAAAACAA ATAATATCCC 20990 .......... .......... .......... .......... .......... .......... 463 CTTTTCTGCA AAAAAGCAAA GCAATCAGTA AAGCAAAAAT ATTTACTTTC CAATATACTA 20930 .......... .......... .......... .......... .......... .......... 463 TCTTATTGTT GAGCCAAAGA CCAAAGGCAC AAAGCAAGGT CAAGGCAGAT AATCAAAGAA 20870 .......... .......... .......... .......... .......... .......... 463 TGCTTTACAG TATTATCTTA AAATTGTGTG ACCTTATAAT GGTTAGTTGC CACTTGGTTA 20810 .......... .......... .......... .......... .......... .......... 463 TTAACCTTTA TTATTTGGAA TTCACTAGTT CAAATAAGAC AATTTTGTGT TGGTTATAAA 20750 .......... .......... .......... .......... .......... .......... 463 TAGTTTATAT TACAAAAATT CGTTTCAAGT GTTCAAATTT TAGAAGAAAG AAATTCACTA 20690 .......... .......... .......... .......... .......... .......... 463 ATCTCACCAC ATTGATTACT AATATTAGAG AATGAATATA TATAATCTAA ATTCTCCCCT 20630 .......... .......... .......... .......... .......... .......... 463 TCCAAGTATA TGAGATATTT ACCCGCGGCT GAGATGAGAC CACGACTAAT AAGGTGGAGA 20570 .......... .......... .......... .......... .......... .......... 463 AGAAGATATC AGGCACAAAA CTGAATAAAT TAATTTATAT GGTGATAATA TGTATCTTTA 20510 .......... .......... .......... .......... .......... .......... 463 ATCCTAACAT TTAACGTAAT ATTAATTTAT TTAGCTTTGT CACAGTGAAA TTATTTTTTC 20450 .......... .......... .......... .......... .......... .......... 463 ATGTGGAGAG CTACATGACA AAAATTTTAA ATAGATAAGA GAGTGTATTA ACCACAACAT 20390 .......... .......... .......... .......... .......... .......... 463 TATAAATTAG TCAAGGTGTG TTTCACTAAT CATTAGGTGA TTGTTTAGGT CGAAAACTCA 20330 .......... .......... .......... .......... .......... .......... 463 TACTTTCAAT AGTTCAAATA TAAAACTTAA AAACTGATAA AGTTAAGATG TAAATAATTC 20270 .......... .......... .......... .......... .......... .......... 463 TGAGGGAAAA TGCACAGGTA CCCCCTCAAC CTATGTCCGA AATTTCAGAG ACACACTTAT 20210 .......... .......... .......... .......... .......... .......... 463 ACAACACTAA GGTCCTATTA CCCCCTCAAC TTATTTTATA AGTAATTTTC TATCCCTTTT 20150 .......... .......... .......... .......... .......... .......... 463 CGACCTATCG GACATAGGTT GAGGGGTACT TGTGCATTTT TTCCCTGAAC TTATTTTATG 20090 .......... .......... .......... .......... .......... .......... 463 ATTGTTTCTC CTGTAAATAT TTCATCACTT GGTTTCCTTC TTGTGAACCT CTCCATCATT 20030 .......... .......... .......... .......... .......... .......... 463 AGGATGCCAA AACTATAAAC ATCACAACTC GTGGATACTA TTCCATCTTG TCCATACACT 19970 .......... .......... .......... .......... .......... .......... 463 GTAATCTTGG TAAAAAATGA TCATGAAAGC TTCCTAGTTA AAATGAAATA TGCCAAAACT 19910 .......... .......... .......... .......... .......... .......... 463 GAGAGTGTCA ATCCATTTAT AGGGGAAAGA ATTGCTTAAC TATAGAAGTT TCTAATATAC 19850 .......... .......... .......... .......... .......... .......... 463 TGGAGCAATA TATCCGATGG TTGCAGTTGT TCTTGTTTGA ACGAAAGCCT CCCCTACATC 19790 .......... .......... .......... .......... .......... .......... 463 TAACAATTTT GCAATGCCAA AATCACTAAC TTGACCAACC ATTTCTTGAT CTAGCAACAT 19730 .......... .......... .......... .......... .......... .......... 463 ATTGCTTGGC TTCAAGTCAC AATGCACCAC AGGCGTTGAA TAGCCATTGT GGAGATAGTC 19670 .......... .......... .......... .......... .......... .......... 463 CATTGCAGAT GCAACATCTA TCATTATATC CAATCTCTGC AATAAGTTTA AGAACAAGTT 19610 .......... .......... .......... .......... .......... .......... 463 GTGAGAATAT AACCATTTAT CAAGTGTCCC ATTGGGCATG TATTCCAACA CCAGGACCTT 19550 .......... .......... .......... .......... .......... .......... 463 GAAATCAAGG TTGCAGCAGC TTGTTATGAC TTTGGTCAGA TTTTTGTGGC GAAGTTCCGG 19490 .......... .......... .......... .......... .......... .......... 463 AGGGATTCGG CTTCTCTCCA TTTTCCCCAA GTTTCTATTT GGATTAATGA CACATGTCAT 19430 .......... .......... .......... .......... .......... .......... 463 AGGTTAAAAT AAATGATTAA GATTTAATTT TTCAAACTAA ACCTCTAAGC ATGATTTAGT 19370 .......... .......... .......... .......... .......... .......... 463 TAATAAATAT ATTATATATC AAGTATCAAA ATTATTATAA TCTCACAATT TTAGCAACAT 19310 .......... .......... .......... .......... .......... .......... 463 ATTATTTTGT CCATATTATT TATGTAAAAA AGTTTTCTTC CTATTATTTT TTTTACTTAG 19250 .......... .......... .......... .......... .......... .......... 463 GATCTTTTTT TTATTTTTCT TTTATATAAT ATTTTTTATT GTAATCTTTT GTTAAAGGTA 19190 .......... .......... .......... .......... .......... .......... 463 TATATTGGTC CGTGATTAAT AAATTACCAA GAGATAATCA AACCATTTTG ACAAACTAAT 19130 .......... .......... .......... .......... .......... .......... 463 TTTAATTTCA TAATAAGATT AAGAATGGGG TGATACCAGT ACATACGGTA GGTTATCTAT 19070 .......... .......... .......... .......... .......... .......... 463 AGTTTCCATA CTTCTATTCA GTTAATTATT CTTCTAATTA GATAAAAAAA AATATTATAT 19010 .......... .......... .......... .......... .......... .......... 463 AAAAGGAAAA AGAAAAGATC CTAAGTAAAA AGAAATAATA GGAAGAAAAC TTTTTTTACC 18950 .......... .......... .......... .......... .......... .......... 463 TATATTTATG GACAAAATAT AGTCACAAGC ATGATATTAT TCAACATTTT AACACTCCCA 18890 .......... .......... .......... .......... .......... .......... 463 CTCAATTACT AAAAACAGCG AGAGGAAGTT GTAATTTGGA GAGGTTGGGA ATGCGAAGTT 18830 .......... .......... .......... .......... .......... .......... 463 TCAGAAGGAA AAAGTTGACA AGGAATAATG TTCCTCCTTG ATTTCCTCTA TGTTAAATAC 18770 .......... .......... .......... .......... .......... .......... 463 AACTTCTGGA CTTTCCGTTT TTCTTGTCTT TCAGCTTAAA TATTATTTAA GTTGTTGTTT 18710 .......... .......... .......... .......... .......... .......... 463 ATTAGGAACA GAAAGTTCAG GTAGTTGTGC ATTATGATCA TTTCAGAAAG ATTAAAATTC 18650 .......... .......... .......... .......... .......... .......... 463 CTCCTTTATT TCTGTACAAT TCCTTGTCAT TATGAACTAA ACTAGTTTTG GTTAGCTTGG 18590 .......... .......... .......... .......... .......... .......... 463 TTTGTGTTTA CGAATTTTAG GTTGGAGATA ATCATGTCAT CAGGCCCTAA ACTCTAATAG 18530 .......... .......... .......... .......... .......... .......... 463 TCAGCACTCT ATGCAGCTAA GTGAATGAGG GAACGTACAT TTAGCAACCA ATAAAGACAA 18470 .......... .......... .......... .......... .......... .......... 463 TCACAGTCCT GTGCAACAAT ATGCAGATAT GTGATTGCAG GCAACAACTA GTTGCAGTCT 18410 .......... .......... .......... .......... .......... .......... 463 GGTTAGCAGG GATATTCAAG TTGTTGATAA TCTCGGAACA TAGATTGTTG TCATATGCGT 18350 .......... .......... .......... .......... .......... .......... 463 TCCAGCAGAG AATTGATAAA CAAAGTCACA CACATGTAGC ACAGTCATGC ATTGAAAATG 18290 .......... .......... .......... .......... .......... .......... 463 GCCATATTAT GCAGACTTGG AATGTAGGAG TACAACTTAC ATTTTGAGGA TATATACTAA 18230 .......... .......... .......... .......... .......... .......... 463 AATCAACAAG TTTATAAAGT CACAAACAAC AGAGCATTTC AGATGTCTAT ATCTGTGAAA 18170 .......... .......... .......... .......... .......... .......... 463 ACAATTGATA GTTGAGTGGC TATCAAAGTT ATTAACTCAT CAATGAACTC ACAGAAAAAC 18110 .......... .......... .......... .......... .......... .......... 463 TTTCGATTAA CTATAATCGA CTATCTCTAC TCTCCAAGAA GTCAAATCCA CTCCTTGTCA 18050 .......... .......... .......... .......... .......... .......... 463 CAAACCTCTC AAAGACTCAA CTCTACAAGA AGCCAAATCC ACTCCTCCTA CGATAATGAC 17990 |||| | .......... .......... .......... .......... .......... ....AATGGC 469 TCTAAGTACG AATCAAAACA CTCTACTCAT GACAAAAGAA ATTCTAACTA TAGAACTTCT 17930 ||||| TCTAA..... .......... .......... .......... .......... .......... 474 TAACTTATAT TTCTATGATC TCTCAAATTG ATTGGCCTTT GTGTGTTAGC TCTCTTTTCT 17870 .......... .......... .......... .......... .......... .......... 474 TTCTCTCGTT TTTGAAATCT TTTCTCCCAG CTCTTTTTGA AATCTGATTG CATGTCTATC 17810 .......... .......... .......... .......... .......... .......... 474 GTCAACTTCG TAAACCATCA TTTTGTTAGG ACCGAAAATC AGCAGGTGTA AACGCGGAAG 17750 | | |||||||| | || |||| |||||||||| || ||||||| .......... .....TACCC ACTTGTTAGG ATCG-AAATA AGCAGGTGTA AATGCGGAAG 518 CTAGCAAAGC AAACCTCAAA AGACCACGAG TAAGAAGACA ACGAGAAATA TACCAAAACA 17690 |||||||||| |||| || || |||||||||| |||||||||| |||||||||| |||||||| | CTAGCAAAGC AAACTTCGAA AGACCACGAG TAAGAAGACA ACGAGAAATA TACCAAAAGA 578 CA-CAAAGAT TTAATGTGGT TCGGTCAATC GACCTACGTC CACAAAGGAG ATGAGCAATC 17631 || ||||||| |||| ||||| |||||||||| ||||||||| |||||||||| |||||||||| CACCAAAGAT TTAACGTGGT TCGGTCAATC GACCTACGTT CACAAAGGAG ATGAGCAATC 638 CACTATAAAT ATGAGAGTAC AAAATACAGA GAGAAACAAC CTCAACCAAT TCACTCGGAA 17571 |||||||||| |||||||||| || ||||||| |||||||||| ||||||||| ||| |||||| CACTATAAAT ATGAGAGTAC AATATACAGA GAGAAACAAC ATCAACCAAT TCATTCGGAA 698 TACATGGGAG GTTCACACAA GTGATAATGT ATCCAACTTG TGACCCATAA ATTCTCTCCC 17511 || ||||||| ||||||| || ||||||| | ||| | || | ||||| |||| |||| TATATGGGAG GTTCACATAA GTGATAACAT ATCTAGCTNG TGACCTATAA ATTC...... 752 TAACCAAAAC TCTCAAAGCT CTTAAGACTA CATTGTGAAT GCTGACTAAG TTAGAAGGAA 17451 .......... .......... .......... .......... .......... .......... 752 CATTTCTCTA TTTATAGAGT CCTAAACCTT TTCCTACAAG AAAATGATTA GTCAATCAAA 17391 .......... .......... .......... .......... .......... .......... 752 ACCTTCCCCT TAAAGGAAAA CCTATTTATG ATAAGAAATT TAGGGAAAAT AAAACCCAAC 17331 .......... .......... .......... .......... .......... .......... 752 ACATTTTCCC TCTTTTTTGA TGATGACAAA TCGTCTACTC CTATTTCCTG TATAAGGTCA 17271 .......... .......... .......... .......... .......... .......... 752 ATACACACAG TACACAAGAG CAATACATAT CAAAAAGAAG AGAAAAGCGT TAAAACAGTA 17211 .......... .......... .......... .......... .......... .......... 752 AACCAAGCCA ACGAAAACGC AGAGCTCCGG AAGAGCAACA TCGCGTAGAA CACAACAAGA 17151 .......... .......... .......... .......... .......... .......... 752 TACCTCCAAT CTAATGATAT TAGCCATCTG TAAACATATG CAAAAACATG CAAGAACGAA 17091 .......... .......... .......... .......... .......... .......... 752 GGAAGCAGAG GATCCGAGAT ACACCAACAT TTTAAGCTCA CAAAATATTC AGTCTAAGCA 17031 .......... .......... .......... .......... .......... .......... 752 CAAAACTCAA AAGAACATAA CAATGTCTTC AGTCCAAAAG TATAAGAAAC TAGACCAGAA 16971 .......... .......... .......... .......... .......... .......... 752 AGCAGGGATC AAGAAGAGGA ATGATTGGGG TTTGGAGGAC AGGATGAAGA AGCAAGTAAC 16911 .......... .......... .......... .......... .......... .......... 752 TGGAGGATCC AGTCCACTCT AGCGTTGCTA TCAAGCTTCT TCTGTACTAA CTGATCCTTC 16851 .......... .......... .......... .......... .......... .......... 752 AATCAACCAA CTTCTTCTTT AGATGCAGTT AAATCAGTAT GCATTTTCAG CCAGTAGATT 16791 .......... .......... .......... .......... .......... .......... 752 TTCACACTTT TCCCTTGCAA TCTTAAGATC CCAAATTAGT AGACGAGTAA CAGGTCCAGA 16731 .......... .......... .......... .......... .......... .......... 752 AGCCCGCGGA GATGCAATAG GAACCTGCAC GAGTTAAAGC TCTTCCCTCA CCCAATGGGC 16671 .......... .......... .......... .......... .......... .......... 752 GCACAGCAAA CTCATTGAAC ACTAAAAGGT TTAAACTGCA TCAAATTTTG GAGGCAAGTT 16611 .......... .......... .......... .......... .......... .......... 752 GCTTTTGATT CTTATGTAAG TTTGACGACA AGGAATTTAG GCAGCTACTC CCGGAGACTA 16551 .......... .......... .......... .......... .......... .......... 752 CACTTAGATG GCTGGAAGGA GAGGACTTGA TAAAACTGTT ACAGTTGTGG TGATATAATT 16491 .......... .......... .......... .......... .......... .......... 752 GGAAGCATTC TCAAAGTTGT GTAAATACGA AAAGAAAATC TTGTGTTGAG CATTTTAAAT 16431 .......... .......... .......... .......... .......... .......... 752 CTCATTTTCA ACTTCCATAA ATCAAGCCAA CAACTAAGAT AAAATACACT CGTGCGAAAT 16371 .......... .......... .......... .......... .......... .......... 752 TGAATTACAT AAGAGAGAAA ACTGGTAACT AAATAACATA CAAGAGAAGG TTGGTAATGA 16311 .......... .......... .......... .......... .......... .......... 752 TTCCACCTAA TGCCGACTAC TAACAAGCTG TAGCCTCATC TTTTTGAGTG TTGAAAGAGC 16251 .......... .......... .......... .......... .......... .......... 752 ATCCTTCATG CTAATTCTTG CATCAGGTCT CACTAAAGTG CACTTCAAAG CTAATTCCAT 16191 .......... .......... .......... .......... .......... .......... 752 GACAGATGAC AAACATTGCA TCTTTGCAGC GATTTGTTCA TCTCCGGGCT GTACCAAATT 16131 .......... .......... .......... .......... .......... .......... 752 AGAATCCACC ACCTTGTGAA GTTCACCCGG AAAGGAATCA CTAATCCAGC TTTGTATGCT 16071 .......... .......... .......... .......... .......... .......... 752 CAAGTCTCCA GTAAATATGT CATCACTTGG TCTTGTTCGT GTGAACGTCT CCATCATCAG 16011 .......... .......... .......... .......... .......... .......... 752 GATACCAAAA CTATAAACAT CACAACTCGT GGATACTATT CCATCTTGTC CATACTCTGT 15951 .......... .......... .......... .......... .......... .......... 752 ATTCATGGTC GAAAATGACC ACAAATCAAT TTATACATAG GGGAAAGAAT TGTTTTGAGT 15891 .......... .......... .......... .......... .......... .......... 752 ATTTAAAGGC GATACGAGAA AACTTAAAAT ATACCTGGAG CAATATATCC AATGGTTGCA 15831 .......... .......... .......... .......... .......... .......... 752 ACTGTCCTTG TTTGAACAAA AGCCTCCCCT GCACCTAACA TTTTTGCAAT GCCAAAATCA 15771 .......... .......... .......... .......... .......... .......... 752 CTTACATGAG CAACCATTTC TTCATCTAAC AAGACATTAC TTGGTTTCAA GTCACAATGC 15711 .......... .......... .......... .......... .......... .......... 752 ACTACAGGCG TTGAATAGCC ATTGTGGAGA TAGTTCATTG CAGATGCAAC ATCTATCATT 15651 .......... .......... .......... .......... .......... .......... 752 ACATCCAATC TCTGCAATAA GTTCAAGAAC AAATTGTGAG AGTATAACCA TTTATCAAGT 15591 .......... .......... .......... .......... .......... .......... 752 GTCCCGTTGG GCATGTATTC CAACACTAGG GCCTTGAAAT CAAGATTGGA GCAGCTGGTA 15531 .......... .......... .......... .......... .......... .......... 752 ATGACTTTGG CAAGATTTCT GTGGCGAAGA TTGCGAAGTA TCTCACATTC CGTGTCAAAA 15471 .......... .......... .......... .......... .......... .......... 752 CTTTTGAATG CACCCTCCAA TTGCACATTG AATACCTTTG CTGCAAAAAT GATACCATCC 15411 .......... .......... .......... .......... .......... .......... 752 TTAAGTACCC CTTTATAAAC CCTGCTGAAA CTCCCATTAC CAAGCAAGTT GGTTTCGTTG 15351 .......... .......... .......... .......... .......... .......... 752 AATCCTTCAG TTGCCTGTTC AAGTTCATAA TAGGAAATTC TTTCATGCCC TCTTACGAGA 15291 .......... .......... .......... .......... .......... .......... 752 GACAGATCCT TTTGACTAGC ATTCTTCTTT GTGTTTCTCA ATCTTAACAC GACAAATCCA 15231 .......... .......... .......... .......... .......... .......... 752 ACAGTCAACA TGAAGAGTGA TCCTATCCCT AATAGAATAT ATAAACCTGT AAGCACTCTT 15171 .......... .......... .......... .......... .......... .......... 752 TTTCTTCTTG ATTTCTTTGT AGATTTGGTT GGGCATGGTT TTACGTTAAA CCGGGAGTCA 15111 .......... .......... .......... .......... .......... .......... 752 CCACAAAGTG CATCATTGGA CAAGAAAGAC TGACTGGTTA CATTTGCAAA GGGACCATCA 15051 .......... .......... .......... .......... .......... .......... 752 GTGGGAATTT CTCCACTGAG TTCATTGAAA GAGAAATTTA GGTATTTGAG ATACACAAGA 14991 .......... .......... .......... .......... .......... .......... 752 GCTTCTAATG ACTTTGGAAT TTCACCGCTA AGATTGTTAT AGGACAAATC CAAGTATTCC 14931 .......... .......... .......... .......... .......... .......... 752 AATGCCAACA ATTTGTCAAA TGATTCAGGA ATAGGCCCTT CTAATCTATT ATGTGCTAGA 14871 .......... .......... .......... .......... .......... .......... 752 GAAAGATAAA TTAATTTATC TAGGCCCCCT AGAGTACTAG GAATCTTACC AGAAAAATAA 14811 .......... .......... .......... .......... .......... .......... 752 TTATTTGACA GATCAATCAG TGTTGCACCC TTCAAGTTTC CGCTCTCTAG CGGAATTTCT 14751 .......... .......... .......... .......... .......... .......... 752 CCACTCAAAT AATTGGATGA AATATTGAAC TCTATGATGT TTTGAAGTCC CCCCAATCTT 14691 .......... .......... .......... .......... .......... .......... 752 GCAGGTAATC TAGAATCCAG CTTGTTGTTA TCTAGATAGA GTGTCCTCAA ACTAGTAACA 14631 .......... .......... .......... .......... .......... .......... 752 TTCCCTAAGC ATGGTGGAAC GGAACTAGAA AATTGATTTT CTGACAATTC TAATGCACCA 14571 .......... .......... .......... .......... .......... .......... 752 AGATACTGTA AACTGCAGAT AACATCTGGT ATGGCTCCTT CTAACTTGTT GCTTCCTAGG 14511 | .......... .......... .......... .......... .......... .........G 753 TAAAGTTCTT GAA 14498 | | || || ||| TCTATTTATT GAA 766 hqPGS_C06HBa0054K13.1-16-_SGN-U334483+ (17995 17985,17794 17517) ******************************************************************************** EST sequence 4 -strand 785 n (File: SGN-U313611-) 1 CTTGATTCAG AAGTTAGCCC CACCAGCGGC AAAATTGACG ATAATCGAGT CACTGGGAAA 61 ACAAAAACGC ATAACAAACA TTCCCAATGT ACCCAAGCCT CAATGCAACA CTTCTAAACT 121 TCGCCATCCA AACTTTTTCG GCGGGATTAT GCATATGAAG ATTACCATAG CTGATGTAGA 181 TGTAGTTGGC TAAAGACCAA ATAAGAAGAA CGGTAAACAT CGCAGCGAAA ATAAGTTCAA 241 CCGCGTTTAC AATTCCAAGA GATTCAATTA CAAGCCGTGG CCGTCTAAAA TATTCCCGCA 301 CTTTATTACT ACTCCTAACA ATATAATTGT ATGTTAGTTA AAAGAGATAT ATCAAAGATA 361 GATCATTAAT AATTAGTATG TAATGTTAGG ATCGAAAATA AGCAGGTGTA AACGCGGAAG 421 CTAGCAAAGC AAACCTCAAA AGACTACGAG TAAGAAGACA ACGAGAAATA TACCCAAAGA 481 CACAAAGATT TAACGTGGTT GCAAATAAAA CTCAACATGT AATTGATTAA CCTGTGATTA 541 GAAGTTGACT TGTTGTAAAG ATGAAGATAA ACACAGCTCA CAGCAGCTAT CAACATTATG 601 GGAAATGTAA ATAAAAGAAG ATTTATCCCT TGTTCCCTGA AATATGTAGA GTTGAGTTTG 661 ATTAGTAGAT GTGGAGTCCA TGAATTTTTG TAAGTAGGTG TAGGTAACAT TACCCATATG 721 AAAAGCCATC CAACAAACAC CAAAACCACA AAACCATTCA AAATTGTCTT GCTCCCCATA 781 TTTTT Predicted gene structure (within gDNA segment 22379 to 12786): Exon 1 20786 20741 ( 46 n); cDNA 308 351 ( 44 n); score: 0.717 Intron 1 20740 20380 ( 361 n); Pd: 0.000 (s: 0.72), Pa: 0.000 (s: 0) Exon 2 20379 20354 ( 26 n); cDNA 352 376 ( 25 n); score: 0.692 Intron 2 20353 18444 (1910 n); Pd: 0.312 (s: 0), Pa: 0.908 (s: 0) Exon 3 18443 18430 ( 14 n); cDNA 377 389 ( 13 n); score: 0.643 Intron 3 18429 17781 ( 649 n); Pd: 0.123 (s: 0), Pa: 0.978 (s: 0.96) Exon 4 17780 17670 ( 111 n); cDNA 390 500 ( 111 n); score: 0.946 Intron 4 17669 17347 ( 323 n); Pd: 0.000 (s: 0.94), Pa: 0.000 (s: 0) Exon 5 17346 17327 ( 20 n); cDNA 501 520 ( 20 n); score: 0.800 Intron 5 17326 15196 (2131 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 6 15195 15183 ( 13 n); cDNA 521 533 ( 13 n); score: 0.769 Intron 6 15182 14687 ( 496 n); Pd: 0.978 (s: 0), Pa: 0.938 (s: 0) Exon 7 14686 14677 ( 10 n); cDNA 534 542 ( 9 n); score: 0.800 Intron 7 14676 13975 ( 702 n); Pd: 0.000 (s: 0), Pa: 0.976 (s: 0) Exon 8 13974 13938 ( 37 n); cDNA 543 575 ( 33 n); score: 0.676 MATCH C06HBa0054K13.1-16- SGN-U313611- 0.946 277 0.353 C PGS_C06HBa0054K13.1-16-_SGN-U313611- (20786 20741,20379 20354,18443 18430,17780 17670,17346 17327,15195 15183,14686 14677,13974 13938) Alignment (genomic DNA sequence = upper lines): ACTAGTTCAA ATAAGACAAT TTTGTGTTGG TTATAAATAG TTTATATTAC AAAAATTCGT 20727 |||| | | | | || | ||| | | |||| | ||| ||| || |||| ACTACTCCTA ACAATATAAT TGTATGTTAG TTA-AAAGAG -ATATA.... .......... 351 TTCAAGTGTT CAAATTTTAG AAGAAAGAAA TTCACTAATC TCACCACATT GATTACTAAT 20667 .......... .......... .......... .......... .......... .......... 351 ATTAGAGAAT GAATATATAT AATCTAAATT CTCCCCTTCC AAGTATATGA GATATTTACC 20607 .......... .......... .......... .......... .......... .......... 351 CGCGGCTGAG ATGAGACCAC GACTAATAAG GTGGAGAAGA AGATATCAGG CACAAAACTG 20547 .......... .......... .......... .......... .......... .......... 351 AATAAATTAA TTTATATGGT GATAATATGT ATCTTTAATC CTAACATTTA ACGTAATATT 20487 .......... .......... .......... .......... .......... .......... 351 AATTTATTTA GCTTTGTCAC AGTGAAATTA TTTTTTCATG TGGAGAGCTA CATGACAAAA 20427 .......... .......... .......... .......... .......... .......... 351 ATTTTAAATA GATAAGAGAG TGTATTAACC ACAACATTAT AAATTAGTCA AGGTGTGTTT 20367 ||| | | | | .......... .......... .......... .......... .......TCA AAG-ATAGAT 363 CACTAATCAT TAGGTGATTG TTTAGGTCGA AAACTCATAC TTTCAATAGT TCAAATATAA 20307 || |||| || ||| CATTAATAAT TAG....... .......... .......... .......... .......... 376 AACTTAAAAA CTGATAAAGT TAAGATGTAA ATAATTCTGA GGGAAAATGC ACAGGTACCC 20247 .......... .......... .......... .......... .......... .......... 376 CCTCAACCTA TGTCCGAAAT TTCAGAGACA CACTTATACA ACACTAAGGT CCTATTACCC 20187 .......... .......... .......... .......... .......... .......... 376 CCTCAACTTA TTTTATAAGT AATTTTCTAT CCCTTTTCGA CCTATCGGAC ATAGGTTGAG 20127 .......... .......... .......... .......... .......... .......... 376 GGGTACTTGT GCATTTTTTC CCTGAACTTA TTTTATGATT GTTTCTCCTG TAAATATTTC 20067 .......... .......... .......... .......... .......... .......... 376 ATCACTTGGT TTCCTTCTTG TGAACCTCTC CATCATTAGG ATGCCAAAAC TATAAACATC 20007 .......... .......... .......... .......... .......... .......... 376 ACAACTCGTG GATACTATTC CATCTTGTCC ATACACTGTA ATCTTGGTAA AAAATGATCA 19947 .......... .......... .......... .......... .......... .......... 376 TGAAAGCTTC CTAGTTAAAA TGAAATATGC CAAAACTGAG AGTGTCAATC CATTTATAGG 19887 .......... .......... .......... .......... .......... .......... 376 GGAAAGAATT GCTTAACTAT AGAAGTTTCT AATATACTGG AGCAATATAT CCGATGGTTG 19827 .......... .......... .......... .......... .......... .......... 376 CAGTTGTTCT TGTTTGAACG AAAGCCTCCC CTACATCTAA CAATTTTGCA ATGCCAAAAT 19767 .......... .......... .......... .......... .......... .......... 376 CACTAACTTG ACCAACCATT TCTTGATCTA GCAACATATT GCTTGGCTTC AAGTCACAAT 19707 .......... .......... .......... .......... .......... .......... 376 GCACCACAGG CGTTGAATAG CCATTGTGGA GATAGTCCAT TGCAGATGCA ACATCTATCA 19647 .......... .......... .......... .......... .......... .......... 376 TTATATCCAA TCTCTGCAAT AAGTTTAAGA ACAAGTTGTG AGAATATAAC CATTTATCAA 19587 .......... .......... .......... .......... .......... .......... 376 GTGTCCCATT GGGCATGTAT TCCAACACCA GGACCTTGAA ATCAAGGTTG CAGCAGCTTG 19527 .......... .......... .......... .......... .......... .......... 376 TTATGACTTT GGTCAGATTT TTGTGGCGAA GTTCCGGAGG GATTCGGCTT CTCTCCATTT 19467 .......... .......... .......... .......... .......... .......... 376 TCCCCAAGTT TCTATTTGGA TTAATGACAC ATGTCATAGG TTAAAATAAA TGATTAAGAT 19407 .......... .......... .......... .......... .......... .......... 376 TTAATTTTTC AAACTAAACC TCTAAGCATG ATTTAGTTAA TAAATATATT ATATATCAAG 19347 .......... .......... .......... .......... .......... .......... 376 TATCAAAATT ATTATAATCT CACAATTTTA GCAACATATT ATTTTGTCCA TATTATTTAT 19287 .......... .......... .......... .......... .......... .......... 376 GTAAAAAAGT TTTCTTCCTA TTATTTTTTT TACTTAGGAT CTTTTTTTTA TTTTTCTTTT 19227 .......... .......... .......... .......... .......... .......... 376 ATATAATATT TTTTATTGTA ATCTTTTGTT AAAGGTATAT ATTGGTCCGT GATTAATAAA 19167 .......... .......... .......... .......... .......... .......... 376 TTACCAAGAG ATAATCAAAC CATTTTGACA AACTAATTTT AATTTCATAA TAAGATTAAG 19107 .......... .......... .......... .......... .......... .......... 376 AATGGGGTGA TACCAGTACA TACGGTAGGT TATCTATAGT TTCCATACTT CTATTCAGTT 19047 .......... .......... .......... .......... .......... .......... 376 AATTATTCTT CTAATTAGAT AAAAAAAAAT ATTATATAAA AGGAAAAAGA AAAGATCCTA 18987 .......... .......... .......... .......... .......... .......... 376 AGTAAAAAGA AATAATAGGA AGAAAACTTT TTTTACCTAT ATTTATGGAC AAAATATAGT 18927 .......... .......... .......... .......... .......... .......... 376 CACAAGCATG ATATTATTCA ACATTTTAAC ACTCCCACTC AATTACTAAA AACAGCGAGA 18867 .......... .......... .......... .......... .......... .......... 376 GGAAGTTGTA ATTTGGAGAG GTTGGGAATG CGAAGTTTCA GAAGGAAAAA GTTGACAAGG 18807 .......... .......... .......... .......... .......... .......... 376 AATAATGTTC CTCCTTGATT TCCTCTATGT TAAATACAAC TTCTGGACTT TCCGTTTTTC 18747 .......... .......... .......... .......... .......... .......... 376 TTGTCTTTCA GCTTAAATAT TATTTAAGTT GTTGTTTATT AGGAACAGAA AGTTCAGGTA 18687 .......... .......... .......... .......... .......... .......... 376 GTTGTGCATT ATGATCATTT CAGAAAGATT AAAATTCCTC CTTTATTTCT GTACAATTCC 18627 .......... .......... .......... .......... .......... .......... 376 TTGTCATTAT GAACTAAACT AGTTTTGGTT AGCTTGGTTT GTGTTTACGA ATTTTAGGTT 18567 .......... .......... .......... .......... .......... .......... 376 GGAGATAATC ATGTCATCAG GCCCTAAACT CTAATAGTCA GCACTCTATG CAGCTAAGTG 18507 .......... .......... .......... .......... .......... .......... 376 AATGAGGGAA CGTACATTTA GCAACCAATA AAGACAATCA CAGTCCTGTG CAACAATATG 18447 .......... .......... .......... .......... .......... .......... 376 CAGATATGTG ATTGCAGGCA ACAACTAGTT GCAGTCTGGT TAGCAGGGAT ATTCAAGTTG 18387 ||||| || || ...-TATGTA ATGTTAG... .......... .......... .......... .......... 389 TTGATAATCT CGGAACATAG ATTGTTGTCA TATGCGTTCC AGCAGAGAAT TGATAAACAA 18327 .......... .......... .......... .......... .......... .......... 389 AGTCACACAC ATGTAGCACA GTCATGCATT GAAAATGGCC ATATTATGCA GACTTGGAAT 18267 .......... .......... .......... .......... .......... .......... 389 GTAGGAGTAC AACTTACATT TTGAGGATAT ATACTAAAAT CAACAAGTTT ATAAAGTCAC 18207 .......... .......... .......... .......... .......... .......... 389 AAACAACAGA GCATTTCAGA TGTCTATATC TGTGAAAACA ATTGATAGTT GAGTGGCTAT 18147 .......... .......... .......... .......... .......... .......... 389 CAAAGTTATT AACTCATCAA TGAACTCACA GAAAAACTTT CGATTAACTA TAATCGACTA 18087 .......... .......... .......... .......... .......... .......... 389 TCTCTACTCT CCAAGAAGTC AAATCCACTC CTTGTCACAA ACCTCTCAAA GACTCAACTC 18027 .......... .......... .......... .......... .......... .......... 389 TACAAGAAGC CAAATCCACT CCTCCTACGA TAATGACTCT AAGTACGAAT CAAAACACTC 17967 .......... .......... .......... .......... .......... .......... 389 TACTCATGAC AAAAGAAATT CTAACTATAG AACTTCTTAA CTTATATTTC TATGATCTCT 17907 .......... .......... .......... .......... .......... .......... 389 CAAATTGATT GGCCTTTGTG TGTTAGCTCT CTTTTCTTTC TCTCGTTTTT GAAATCTTTT 17847 .......... .......... .......... .......... .......... .......... 389 CTCCCAGCTC TTTTTGAAAT CTGATTGCAT GTCTATCGTC AACTTCGTAA ACCATCATTT 17787 .......... .......... .......... .......... .......... .......... 389 TGTTAGGACC GAAAATCAGC AGGTGTAAAC GCGGAAGCTA GCAAAGCAAA CCTCAAAAGA 17727 || | |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| ......GATC GAAAATAAGC AGGTGTAAAC GCGGAAGCTA GCAAAGCAAA CCTCAAAAGA 443 CCACGAGTAA GAAGACAACG AGAAATATAC CAAAACACAC AAAGATTTAA TGTGGTTCGG 17667 | |||||||| |||||||||| |||||||||| | ||| |||| |||||||||| |||||| CTACGAGTAA GAAGACAACG AGAAATATAC CCAAAGACAC AAAGATTTAA CGTGGTT... 500 TCAATCGACC TACGTCCACA AAGGAGATGA GCAATCCACT ATAAATATGA GAGTACAAAA 17607 .......... .......... .......... .......... .......... .......... 500 TACAGAGAGA AACAACCTCA ACCAATTCAC TCGGAATACA TGGGAGGTTC ACACAAGTGA 17547 .......... .......... .......... .......... .......... .......... 500 TAATGTATCC AACTTGTGAC CCATAAATTC TCTCCCTAAC CAAAACTCTC AAAGCTCTTA 17487 .......... .......... .......... .......... .......... .......... 500 AGACTACATT GTGAATGCTG ACTAAGTTAG AAGGAACATT TCTCTATTTA TAGAGTCCTA 17427 .......... .......... .......... .......... .......... .......... 500 AACCTTTTCC TACAAGAAAA TGATTAGTCA ATCAAAACCT TCCCCTTAAA GGAAAACCTA 17367 .......... .......... .......... .......... .......... .......... 500 TTTATGATAA GAAATTTAGG GAAAATAAAA CCCAACACAT TTTCCCTCTT TTTTGATGAT 17307 | |||||||| | ||||| | .......... .......... GCAAATAAAA CTCAACATGT .......... .......... 520 GACAAATCGT CTACTCCTAT TTCCTGTATA AGGTCAATAC ACACAGTACA CAAGAGCAAT 17247 .......... .......... .......... .......... .......... .......... 520 ACATATCAAA AAGAAGAGAA AAGCGTTAAA ACAGTAAACC AAGCCAACGA AAACGCAGAG 17187 .......... .......... .......... .......... .......... .......... 520 CTCCGGAAGA GCAACATCGC GTAGAACACA ACAAGATACC TCCAATCTAA TGATATTAGC 17127 .......... .......... .......... .......... .......... .......... 520 CATCTGTAAA CATATGCAAA AACATGCAAG AACGAAGGAA GCAGAGGATC CGAGATACAC 17067 .......... .......... .......... .......... .......... .......... 520 CAACATTTTA AGCTCACAAA ATATTCAGTC TAAGCACAAA ACTCAAAAGA ACATAACAAT 17007 .......... .......... .......... .......... .......... .......... 520 GTCTTCAGTC CAAAAGTATA AGAAACTAGA CCAGAAAGCA GGGATCAAGA AGAGGAATGA 16947 .......... .......... .......... .......... .......... .......... 520 TTGGGGTTTG GAGGACAGGA TGAAGAAGCA AGTAACTGGA GGATCCAGTC CACTCTAGCG 16887 .......... .......... .......... .......... .......... .......... 520 TTGCTATCAA GCTTCTTCTG TACTAACTGA TCCTTCAATC AACCAACTTC TTCTTTAGAT 16827 .......... .......... .......... .......... .......... .......... 520 GCAGTTAAAT CAGTATGCAT TTTCAGCCAG TAGATTTTCA CACTTTTCCC TTGCAATCTT 16767 .......... .......... .......... .......... .......... .......... 520 AAGATCCCAA ATTAGTAGAC GAGTAACAGG TCCAGAAGCC CGCGGAGATG CAATAGGAAC 16707 .......... .......... .......... .......... .......... .......... 520 CTGCACGAGT TAAAGCTCTT CCCTCACCCA ATGGGCGCAC AGCAAACTCA TTGAACACTA 16647 .......... .......... .......... .......... .......... .......... 520 AAAGGTTTAA ACTGCATCAA ATTTTGGAGG CAAGTTGCTT TTGATTCTTA TGTAAGTTTG 16587 .......... .......... .......... .......... .......... .......... 520 ACGACAAGGA ATTTAGGCAG CTACTCCCGG AGACTACACT TAGATGGCTG GAAGGAGAGG 16527 .......... .......... .......... .......... .......... .......... 520 ACTTGATAAA ACTGTTACAG TTGTGGTGAT ATAATTGGAA GCATTCTCAA AGTTGTGTAA 16467 .......... .......... .......... .......... .......... .......... 520 ATACGAAAAG AAAATCTTGT GTTGAGCATT TTAAATCTCA TTTTCAACTT CCATAAATCA 16407 .......... .......... .......... .......... .......... .......... 520 AGCCAACAAC TAAGATAAAA TACACTCGTG CGAAATTGAA TTACATAAGA GAGAAAACTG 16347 .......... .......... .......... .......... .......... .......... 520 GTAACTAAAT AACATACAAG AGAAGGTTGG TAATGATTCC ACCTAATGCC GACTACTAAC 16287 .......... .......... .......... .......... .......... .......... 520 AAGCTGTAGC CTCATCTTTT TGAGTGTTGA AAGAGCATCC TTCATGCTAA TTCTTGCATC 16227 .......... .......... .......... .......... .......... .......... 520 AGGTCTCACT AAAGTGCACT TCAAAGCTAA TTCCATGACA GATGACAAAC ATTGCATCTT 16167 .......... .......... .......... .......... .......... .......... 520 TGCAGCGATT TGTTCATCTC CGGGCTGTAC CAAATTAGAA TCCACCACCT TGTGAAGTTC 16107 .......... .......... .......... .......... .......... .......... 520 ACCCGGAAAG GAATCACTAA TCCAGCTTTG TATGCTCAAG TCTCCAGTAA ATATGTCATC 16047 .......... .......... .......... .......... .......... .......... 520 ACTTGGTCTT GTTCGTGTGA ACGTCTCCAT CATCAGGATA CCAAAACTAT AAACATCACA 15987 .......... .......... .......... .......... .......... .......... 520 ACTCGTGGAT ACTATTCCAT CTTGTCCATA CTCTGTATTC ATGGTCGAAA ATGACCACAA 15927 .......... .......... .......... .......... .......... .......... 520 ATCAATTTAT ACATAGGGGA AAGAATTGTT TTGAGTATTT AAAGGCGATA CGAGAAAACT 15867 .......... .......... .......... .......... .......... .......... 520 TAAAATATAC CTGGAGCAAT ATATCCAATG GTTGCAACTG TCCTTGTTTG AACAAAAGCC 15807 .......... .......... .......... .......... .......... .......... 520 TCCCCTGCAC CTAACATTTT TGCAATGCCA AAATCACTTA CATGAGCAAC CATTTCTTCA 15747 .......... .......... .......... .......... .......... .......... 520 TCTAACAAGA CATTACTTGG TTTCAAGTCA CAATGCACTA CAGGCGTTGA ATAGCCATTG 15687 .......... .......... .......... .......... .......... .......... 520 TGGAGATAGT TCATTGCAGA TGCAACATCT ATCATTACAT CCAATCTCTG CAATAAGTTC 15627 .......... .......... .......... .......... .......... .......... 520 AAGAACAAAT TGTGAGAGTA TAACCATTTA TCAAGTGTCC CGTTGGGCAT GTATTCCAAC 15567 .......... .......... .......... .......... .......... .......... 520 ACTAGGGCCT TGAAATCAAG ATTGGAGCAG CTGGTAATGA CTTTGGCAAG ATTTCTGTGG 15507 .......... .......... .......... .......... .......... .......... 520 CGAAGATTGC GAAGTATCTC ACATTCCGTG TCAAAACTTT TGAATGCACC CTCCAATTGC 15447 .......... .......... .......... .......... .......... .......... 520 ACATTGAATA CCTTTGCTGC AAAAATGATA CCATCCTTAA GTACCCCTTT ATAAACCCTG 15387 .......... .......... .......... .......... .......... .......... 520 CTGAAACTCC CATTACCAAG CAAGTTGGTT TCGTTGAATC CTTCAGTTGC CTGTTCAAGT 15327 .......... .......... .......... .......... .......... .......... 520 TCATAATAGG AAATTCTTTC ATGCCCTCTT ACGAGAGACA GATCCTTTTG ACTAGCATTC 15267 .......... .......... .......... .......... .......... .......... 520 TTCTTTGTGT TTCTCAATCT TAACACGACA AATCCAACAG TCAACATGAA GAGTGATCCT 15207 .......... .......... .......... .......... .......... .......... 520 ATCCCTAATA GAATATATAA ACCTGTAAGC ACTCTTTTTC TTCTTGATTT CTTTGTAGAT 15147 ||| || | |||| .......... .AATTGATTA ACCT...... .......... .......... .......... 533 TTGGTTGGGC ATGGTTTTAC GTTAAACCGG GAGTCACCAC AAAGTGCATC ATTGGACAAG 15087 .......... .......... .......... .......... .......... .......... 533 AAAGACTGAC TGGTTACATT TGCAAAGGGA CCATCAGTGG GAATTTCTCC ACTGAGTTCA 15027 .......... .......... .......... .......... .......... .......... 533 TTGAAAGAGA AATTTAGGTA TTTGAGATAC ACAAGAGCTT CTAATGACTT TGGAATTTCA 14967 .......... .......... .......... .......... .......... .......... 533 CCGCTAAGAT TGTTATAGGA CAAATCCAAG TATTCCAATG CCAACAATTT GTCAAATGAT 14907 .......... .......... .......... .......... .......... .......... 533 TCAGGAATAG GCCCTTCTAA TCTATTATGT GCTAGAGAAA GATAAATTAA TTTATCTAGG 14847 .......... .......... .......... .......... .......... .......... 533 CCCCCTAGAG TACTAGGAAT CTTACCAGAA AAATAATTAT TTGACAGATC AATCAGTGTT 14787 .......... .......... .......... .......... .......... .......... 533 GCACCCTTCA AGTTTCCGCT CTCTAGCGGA ATTTCTCCAC TCAAATAATT GGATGAAATA 14727 .......... .......... .......... .......... .......... .......... 533 TTGAACTCTA TGATGTTTTG AAGTCCCCCC AATCTTGCAG GTAATCTAGA ATCCAGCTTG 14667 || || |||| .......... .......... .......... .......... GTGAT-TAGA .......... 542 TTGTTATCTA GATAGAGTGT CCTCAAACTA GTAACATTCC CTAAGCATGG TGGAACGGAA 14607 .......... .......... .......... .......... .......... .......... 542 CTAGAAAATT GATTTTCTGA CAATTCTAAT GCACCAAGAT ACTGTAAACT GCAGATAACA 14547 .......... .......... .......... .......... .......... .......... 542 TCTGGTATGG CTCCTTCTAA CTTGTTGCTT CCTAGGTAAA GTTCTTGAAG GTTCAGCATT 14487 .......... .......... .......... .......... .......... .......... 542 CCTTGCACTG TTTTTGGAAT ATGACCTATC AACTGATTGT TCGACAGACT CATCCTTGTC 14427 .......... .......... .......... .......... .......... .......... 542 AATCCAGTAA GATTAGTAAT TTGTTTTGAA ATGACACCCT TCAGTTTACA TTTAGATGCT 14367 .......... .......... .......... .......... .......... .......... 542 TCAAAAATTT GCAAGGAGTT TGAGAAATTA CCAACAGATG CAGGCAAAAC ACCATCCAAC 14307 .......... .......... .......... .......... .......... .......... 542 GGATTACCAC CAAGCGTGAG TACTCTTAGA TTCCTACAGT TTGTCAATGA TTCAAGGAAG 14247 .......... .......... .......... .......... .......... .......... 542 CTCAATGTTG AATCGCTGAC AAAATTATTC CCCCACAAGT TGAGAACCTC AAGGTATTCT 14187 .......... .......... .......... .......... .......... .......... 542 AAGTTACCAA GTGATTTTGA AATTAGACCT GTGAAACTGT TGCTGGAGAG GTCTAACATT 14127 .......... .......... .......... .......... .......... .......... 542 GTGAGTCTTG AAGAATTAGA GATTGAATCA GAGATAAAAC CACTCAGATT ATTTCCTCCA 14067 .......... .......... .......... .......... .......... .......... 542 CAATAAAATA CTTGTAGGTC GGGCATTCCA CGACCTAAAT CTGAAGGCAG AGTACCTGTA 14007 .......... .......... .......... .......... .......... .......... 542 AGCTTGTTTT GTCCAAAATC TATTTTCTGC AGTGTTGACA TGTTGAAAAT GCTGTCAGGG 13947 |||||| ||||| ||| | || | | .......... .......... .......... ..AGTTGACT TGTTGTAAA- GATG--A-AG 566 ATAGAGCCA 13938 ||| | || ATAAACACA 575 hqPGS_C06HBa0054K13.1-16-_SGN-U313611- (20786 20741,20379 20354,18443 18430,17780 17670) ******************************************************************************** EST sequence 13 -strand 901 n (File: SGN-U341191-) 1 AATTTCCCTT AAGGGTTTTG GGCAAATCCT GGATCTTTCC CCTTTTCCCT TCTTCTCTTA 61 AAATTTTCTT CCCAATTCTT AAGCAAAAAA AAAATTTTCA ATTATGATCT TAAACTAGTT 121 TTTACCCCTG ATAAATCCAT AAAATGAAAT AGAAATTTAT TGGGTGAAAA GACTAGTTTT 181 CCTTTCCTTA ATTCTGGATT AGAACCTTAC TCATTCTAAC AACCCAACTT CGAATAGACA 241 TATCTCCATC CTACTATATT GAAAATGTGC AAGCTTGACG GTGTTGGAAA TATCTCTCCA 301 TGGGCTTTCC AACCATAATA ATAACTAGCA CTAATCTTGA ATCAGAAGTT ATGACCGTTT 361 GAAAATGACC GAATCTCACT TTTTTAACTT AAGAAATTTT CTTGATTTTT CCTTTTCTTT 421 CAAAAAATAA TTGGTTTTAG TTTCTTTGCT ATTTCAGGTT ACGAGATGTC ACAGTTTTAC 481 TAATATTCAT GACCTCTATC ATGACCCAAT GTGTTTGTTT TTTTACAGAG TAGGTTCTCT 541 TACATTTGAA TCCATGAATC TTGTTTACTT TGTTGCAGTT AGTCCAAACG ATAACTACTT 601 TGAAAGTGCA TCTGATAAAT GGATGAATAT ATGATCAGTG AAGAAGGCCA AGATTGTAAA 661 AACTTATAAT GGCGATGACA GTACTTTTAA AGTGTGTTGG AACAAAGAAG ATAACAAGGT 721 TTCAACAGTT AGGATCGGAA ATAAGCAGGT GTAACACGGA AGCTAGCAAA GCAAACCTTG 781 AAAGACCACG AGTAAGAACA CAACGAGAAA TATACCAAAA GCCTCGTGCC GAATTCTTGC 841 AGCCCGGGGG ATCCACTAGT TCTAGAGCGG CCGCCACCAC GGGTGGAGTG CATATATTCT 901 A Predicted gene structure (within gDNA segment 25925 to 16282): Exon 1 23207 23196 ( 12 n); cDNA 626 637 ( 12 n); score: 0.833 Intron 1 23195 22691 ( 505 n); Pd: 0.000 (s: 0), Pa: 0.091 (s: 0) Exon 2 22690 22656 ( 35 n); cDNA 638 669 ( 32 n); score: 0.714 Intron 2 22655 19103 (3553 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 3 19102 19083 ( 20 n); cDNA 670 689 ( 20 n); score: 0.600 Intron 3 19082 18444 ( 639 n); Pd: 0.968 (s: 0), Pa: 0.908 (s: 0) Exon 4 18443 18430 ( 14 n); cDNA 690 702 ( 13 n); score: 0.643 Intron 4 18429 18345 ( 85 n); Pd: 0.123 (s: 0), Pa: 0.818 (s: 0) Exon 5 18344 18311 ( 34 n); cDNA 703 732 ( 30 n); score: 0.706 Intron 5 18310 17781 ( 530 n); Pd: 0.000 (s: 0), Pa: 0.978 (s: 0.86) Exon 6 17780 17692 ( 89 n); cDNA 733 820 ( 88 n); score: 0.910 MATCH C06HBa0054K13.1-16- SGN-U341191- 0.910 204 0.226 C PGS_C06HBa0054K13.1-16-_SGN-U341191- (23207 23196,22690 22656,19102 19083,18443 18430,18344 18311,17780 17692) Alignment (genomic DNA sequence = upper lines): AATATATCAT GAGTACTATG TTTGGCGGAA AAGAGAAAAA GGAAGAAAAT TTCATACTAC 23148 ||||||| || | AATATATGAT CA........ .......... .......... .......... .......... 637 CTTCCAAATG CAATTCTGAA AGATATAAAT ATGTAAGAGC TGTTAAGTTG GCTAGCTCTC 23088 .......... .......... .......... .......... .......... .......... 637 TTGGTAAAGT TCCAATAAAC TTATTTTCAC CCAATTGCAA TTTTTGAAGC TTTCTGCATT 23028 .......... .......... .......... .......... .......... .......... 637 TCTCCAGGTT TGGTGGAATA ACTCCATCTA GGGAGTTTTT ACTGAGGTAA AGTCCTTCCA 22968 .......... .......... .......... .......... .......... .......... 637 AGTCTCGAAG ATGATCACAT ATCGTTTTTG GAAGATTTCC AGTAAGATTG TTGCCCGTAA 22908 .......... .......... .......... .......... .......... .......... 637 GAGCAATCAC ATGCATTGTA GTAATGTTAA AAATGGATGG TGGTATAGAG CCACTAAGCT 22848 .......... .......... .......... .......... .......... .......... 637 GATTAATTTG CAGGTCTAGG ATAGTCAAGT AACGAAGATC ACCGATTTCT CGAGGGATCT 22788 .......... .......... .......... .......... .......... .......... 637 CTCCTTCAAG AAAATTTCTA TCCAAGTATA ACCTTTGCAG CTTTGTTATA TTGGAAAGGG 22728 .......... .......... .......... .......... .......... .......... 637 AGGATGGAAT TTTCCCAGAA AATTGGTTGC TTGATAGGTG CACAAAGCGT AGGTTTGGTA 22668 ||| | || || | | || ||| .......... .......... .......... .......GTG AAGAAGGC-C AAGATT-GTA 658 ACAAACTTAA AAATGATGGA ATGGCTCCGG TGAAGTTATT GCTTGTGACA TTAATCGATT 22608 | ||||||| || A-AAACTTAT AA........ .......... .......... .......... .......... 669 TCAACCTCTG CAGACGAGCC AATTCTTGTG GCAAATCTCC ATGAAAAGTG TTGTTACTGA 22548 .......... .......... .......... .......... .......... .......... 669 TGTCAAGGGA AGAAAGAAAT GACAGGTTTC CGAGGTGTGG AGGAATGGTA CCATGAAGTT 22488 .......... .......... .......... .......... .......... .......... 669 GCATGCTTGA AATGTCTAAA GCAGTGACTC GATGGTGTCG AGAGCTGCAA GTGATTCCAA 22428 .......... .......... .......... .......... .......... .......... 669 TCCAACTACA AACGGGGACG GAAGAAGACC AGTTTGTTGC TAAGATATCG TTAGAAATAT 22368 .......... .......... .......... .......... .......... .......... 669 GTGATTTAAG TGCAAGAAGA GCAGCTTCAT CAGTGCTAAT ATTATTAGCA TTGGCATGTA 22308 .......... .......... .......... .......... .......... .......... 669 GTAATAGTAG AATGAAAATT ATAAGAAAAA AGAGGAGAGT GGAACTTCTG TCCATAACTA 22248 .......... .......... .......... .......... .......... .......... 669 ATATAGATAT CAAATTTAAT GATACCTATA ATGTTGTGGT ATTCTATTTA TAACACTACC 22188 .......... .......... .......... .......... .......... .......... 669 ATGTGCGTTG CACGTGATTT CTAATTTCTG AAAATCAATT TAAGTACTTT GTTTTTCTCC 22128 .......... .......... .......... .......... .......... .......... 669 CCTTGATTTC ACGAAACCAA TTACGATAAG TAAAACTTTT TAGAGAATTT GTAATTTTTT 22068 .......... .......... .......... .......... .......... .......... 669 AAAATAGGGA AAAGGATCTA AAATATATTC TAATTTTGAT CGAAATTGTT GTAACATTCG 22008 .......... .......... .......... .......... .......... .......... 669 AAAACTTTTT TCTCCAACAA CCTTATCTAA TTAAGGCTCC CTCCCTTCAT TCAATATATT 21948 .......... .......... .......... .......... .......... .......... 669 GACTACTTTC TTTCATATGT CGTGGAAAGT CTATCTTTGA CACTTATCAG TCATCTAATC 21888 .......... .......... .......... .......... .......... .......... 669 TGATAAAAAA CTAGTTGTAT GTGTTATAGA TAATAAAAAA ATATGAAAAA AAAATGGAAA 21828 .......... .......... .......... .......... .......... .......... 669 GGTTGAAATG AATGAATACA TAAAATTGTA GAAAGCTAAA AAGAAAATAA ATTCATACCA 21768 .......... .......... .......... .......... .......... .......... 669 TGCAAATAAT TGAATCCAAT AAGGGTGAGG GCCTCCATAT GAGTAAATTA ATTCATCGTC 21708 .......... .......... .......... .......... .......... .......... 669 AAACATATTT TTTTGTATTT CAATGACTAA TTTATAATTA TTATTTTGAT AATCAAATTT 21648 .......... .......... .......... .......... .......... .......... 669 ATTTATGTTT CACTAATATT CTTGTAAAAC TTATTGTAGA TGACCAAATT TTTTCTTCGA 21588 .......... .......... .......... .......... .......... .......... 669 ATACGAAATT AAATTACAAT ACACAGTAAA AAAATAGTTT AATTTTTTTA TTTAAACTAA 21528 .......... .......... .......... .......... .......... .......... 669 GGAATGAAAG AAAAAAACAA AATAAGAATA AGAAACTCAA ATAATTATAA TAAATGAAGT 21468 .......... .......... .......... .......... .......... .......... 669 CAAAAAATAA TTTATGTATG AAAAAAATTA AAATATACAT TGAACTTTGA TAGAAGAATC 21408 .......... .......... .......... .......... .......... .......... 669 ATATATATCT CTAAATAATT TTTTTTAAAA AAAATTAAAA GTAATAAATA TAAATTTAAA 21348 .......... .......... .......... .......... .......... .......... 669 ATAAATTTTT TAACTTCCGT TAAATGAAGG GTATATGTGA GTCATTTTAT AACAGCAGGG 21288 .......... .......... .......... .......... .......... .......... 669 GTAAATGTGA GCCGTTTGTA TAACGGTAAG GGCATATATG AGCCACTTTT ATAACGAGGG 21228 .......... .......... .......... .......... .......... .......... 669 GTATATGAGC TCCAAATGAC AAAGTTGAGG GGTATATCAG ACCTTTTTCC CTTTTATATA 21168 .......... .......... .......... .......... .......... .......... 669 TAACTATAAC TATGATATTA TTCTTTTATC TTTCAAAATT CAAAAACTTT TTTTTACTTG 21108 .......... .......... .......... .......... .......... .......... 669 AACAAGAAGT TAAAGGAAGA AAATAAAAGG TAGAATAAAA TAACATTTAA ATAGACTCAA 21048 .......... .......... .......... .......... .......... .......... 669 TTGCTTTCAA GAGTCTTCCA TTATAAAAAG GGAAGTGGAA AAAAACAAAT AATATCCCCT 20988 .......... .......... .......... .......... .......... .......... 669 TTTCTGCAAA AAAGCAAAGC AATCAGTAAA GCAAAAATAT TTACTTTCCA ATATACTATC 20928 .......... .......... .......... .......... .......... .......... 669 TTATTGTTGA GCCAAAGACC AAAGGCACAA AGCAAGGTCA AGGCAGATAA TCAAAGAATG 20868 .......... .......... .......... .......... .......... .......... 669 CTTTACAGTA TTATCTTAAA ATTGTGTGAC CTTATAATGG TTAGTTGCCA CTTGGTTATT 20808 .......... .......... .......... .......... .......... .......... 669 AACCTTTATT ATTTGGAATT CACTAGTTCA AATAAGACAA TTTTGTGTTG GTTATAAATA 20748 .......... .......... .......... .......... .......... .......... 669 GTTTATATTA CAAAAATTCG TTTCAAGTGT TCAAATTTTA GAAGAAAGAA ATTCACTAAT 20688 .......... .......... .......... .......... .......... .......... 669 CTCACCACAT TGATTACTAA TATTAGAGAA TGAATATATA TAATCTAAAT TCTCCCCTTC 20628 .......... .......... .......... .......... .......... .......... 669 CAAGTATATG AGATATTTAC CCGCGGCTGA GATGAGACCA CGACTAATAA GGTGGAGAAG 20568 .......... .......... .......... .......... .......... .......... 669 AAGATATCAG GCACAAAACT GAATAAATTA ATTTATATGG TGATAATATG TATCTTTAAT 20508 .......... .......... .......... .......... .......... .......... 669 CCTAACATTT AACGTAATAT TAATTTATTT AGCTTTGTCA CAGTGAAATT ATTTTTTCAT 20448 .......... .......... .......... .......... .......... .......... 669 GTGGAGAGCT ACATGACAAA AATTTTAAAT AGATAAGAGA GTGTATTAAC CACAACATTA 20388 .......... .......... .......... .......... .......... .......... 669 TAAATTAGTC AAGGTGTGTT TCACTAATCA TTAGGTGATT GTTTAGGTCG AAAACTCATA 20328 .......... .......... .......... .......... .......... .......... 669 CTTTCAATAG TTCAAATATA AAACTTAAAA ACTGATAAAG TTAAGATGTA AATAATTCTG 20268 .......... .......... .......... .......... .......... .......... 669 AGGGAAAATG CACAGGTACC CCCTCAACCT ATGTCCGAAA TTTCAGAGAC ACACTTATAC 20208 .......... .......... .......... .......... .......... .......... 669 AACACTAAGG TCCTATTACC CCCTCAACTT ATTTTATAAG TAATTTTCTA TCCCTTTTCG 20148 .......... .......... .......... .......... .......... .......... 669 ACCTATCGGA CATAGGTTGA GGGGTACTTG TGCATTTTTT CCCTGAACTT ATTTTATGAT 20088 .......... .......... .......... .......... .......... .......... 669 TGTTTCTCCT GTAAATATTT CATCACTTGG TTTCCTTCTT GTGAACCTCT CCATCATTAG 20028 .......... .......... .......... .......... .......... .......... 669 GATGCCAAAA CTATAAACAT CACAACTCGT GGATACTATT CCATCTTGTC CATACACTGT 19968 .......... .......... .......... .......... .......... .......... 669 AATCTTGGTA AAAAATGATC ATGAAAGCTT CCTAGTTAAA ATGAAATATG CCAAAACTGA 19908 .......... .......... .......... .......... .......... .......... 669 GAGTGTCAAT CCATTTATAG GGGAAAGAAT TGCTTAACTA TAGAAGTTTC TAATATACTG 19848 .......... .......... .......... .......... .......... .......... 669 GAGCAATATA TCCGATGGTT GCAGTTGTTC TTGTTTGAAC GAAAGCCTCC CCTACATCTA 19788 .......... .......... .......... .......... .......... .......... 669 ACAATTTTGC AATGCCAAAA TCACTAACTT GACCAACCAT TTCTTGATCT AGCAACATAT 19728 .......... .......... .......... .......... .......... .......... 669 TGCTTGGCTT CAAGTCACAA TGCACCACAG GCGTTGAATA GCCATTGTGG AGATAGTCCA 19668 .......... .......... .......... .......... .......... .......... 669 TTGCAGATGC AACATCTATC ATTATATCCA ATCTCTGCAA TAAGTTTAAG AACAAGTTGT 19608 .......... .......... .......... .......... .......... .......... 669 GAGAATATAA CCATTTATCA AGTGTCCCAT TGGGCATGTA TTCCAACACC AGGACCTTGA 19548 .......... .......... .......... .......... .......... .......... 669 AATCAAGGTT GCAGCAGCTT GTTATGACTT TGGTCAGATT TTTGTGGCGA AGTTCCGGAG 19488 .......... .......... .......... .......... .......... .......... 669 GGATTCGGCT TCTCTCCATT TTCCCCAAGT TTCTATTTGG ATTAATGACA CATGTCATAG 19428 .......... .......... .......... .......... .......... .......... 669 GTTAAAATAA ATGATTAAGA TTTAATTTTT CAAACTAAAC CTCTAAGCAT GATTTAGTTA 19368 .......... .......... .......... .......... .......... .......... 669 ATAAATATAT TATATATCAA GTATCAAAAT TATTATAATC TCACAATTTT AGCAACATAT 19308 .......... .......... .......... .......... .......... .......... 669 TATTTTGTCC ATATTATTTA TGTAAAAAAG TTTTCTTCCT ATTATTTTTT TTACTTAGGA 19248 .......... .......... .......... .......... .......... .......... 669 TCTTTTTTTT ATTTTTCTTT TATATAATAT TTTTTATTGT AATCTTTTGT TAAAGGTATA 19188 .......... .......... .......... .......... .......... .......... 669 TATTGGTCCG TGATTAATAA ATTACCAAGA GATAATCAAA CCATTTTGAC AAACTAATTT 19128 .......... .......... .......... .......... .......... .......... 669 TAATTTCATA ATAAGATTAA GAATGGGGTG ATACCAGTAC ATACGGTAGG TTATCTATAG 19068 || | || |||||| | .......... .......... .....TGGCG ATGACAGTAC TTTTA..... .......... 689 TTTCCATACT TCTATTCAGT TAATTATTCT TCTAATTAGA TAAAAAAAAA TATTATATAA 19008 .......... .......... .......... .......... .......... .......... 689 AAGGAAAAAG AAAAGATCCT AAGTAAAAAG AAATAATAGG AAGAAAACTT TTTTTACCTA 18948 .......... .......... .......... .......... .......... .......... 689 TATTTATGGA CAAAATATAG TCACAAGCAT GATATTATTC AACATTTTAA CACTCCCACT 18888 .......... .......... .......... .......... .......... .......... 689 CAATTACTAA AAACAGCGAG AGGAAGTTGT AATTTGGAGA GGTTGGGAAT GCGAAGTTTC 18828 .......... .......... .......... .......... .......... .......... 689 AGAAGGAAAA AGTTGACAAG GAATAATGTT CCTCCTTGAT TTCCTCTATG TTAAATACAA 18768 .......... .......... .......... .......... .......... .......... 689 CTTCTGGACT TTCCGTTTTT CTTGTCTTTC AGCTTAAATA TTATTTAAGT TGTTGTTTAT 18708 .......... .......... .......... .......... .......... .......... 689 TAGGAACAGA AAGTTCAGGT AGTTGTGCAT TATGATCATT TCAGAAAGAT TAAAATTCCT 18648 .......... .......... .......... .......... .......... .......... 689 CCTTTATTTC TGTACAATTC CTTGTCATTA TGAACTAAAC TAGTTTTGGT TAGCTTGGTT 18588 .......... .......... .......... .......... .......... .......... 689 TGTGTTTACG AATTTTAGGT TGGAGATAAT CATGTCATCA GGCCCTAAAC TCTAATAGTC 18528 .......... .......... .......... .......... .......... .......... 689 AGCACTCTAT GCAGCTAAGT GAATGAGGGA ACGTACATTT AGCAACCAAT AAAGACAATC 18468 .......... .......... .......... .......... .......... .......... 689 ACAGTCCTGT GCAACAATAT GCAGATATGT GATTGCAGGC AACAACTAGT TGCAGTCTGG 18408 | ||| | ||| | .......... .......... ....AAGTGT G-TTGGAA.. .......... .......... 702 TTAGCAGGGA TATTCAAGTT GTTGATAATC TCGGAACATA GATTGTTGTC ATATGCGTTC 18348 .......... .......... .......... .......... .......... .......... 702 CAGCAGAGAA TTGATAAACA AAGTCACACA CATGTAGCAC AGTCATGCAT TGAAAATGGC 18288 || |||| ||| |||| | || || | || ||| ...CAAAGAA --GAT-AACA AGGTTTCA-A CAGTTAG... .......... .......... 732 CATATTATGC AGACTTGGAA TGTAGGAGTA CAACTTACAT TTTGAGGATA TATACTAAAA 18228 .......... .......... .......... .......... .......... .......... 732 TCAACAAGTT TATAAAGTCA CAAACAACAG AGCATTTCAG ATGTCTATAT CTGTGAAAAC 18168 .......... .......... .......... .......... .......... .......... 732 AATTGATAGT TGAGTGGCTA TCAAAGTTAT TAACTCATCA ATGAACTCAC AGAAAAACTT 18108 .......... .......... .......... .......... .......... .......... 732 TCGATTAACT ATAATCGACT ATCTCTACTC TCCAAGAAGT CAAATCCACT CCTTGTCACA 18048 .......... .......... .......... .......... .......... .......... 732 AACCTCTCAA AGACTCAACT CTACAAGAAG CCAAATCCAC TCCTCCTACG ATAATGACTC 17988 .......... .......... .......... .......... .......... .......... 732 TAAGTACGAA TCAAAACACT CTACTCATGA CAAAAGAAAT TCTAACTATA GAACTTCTTA 17928 .......... .......... .......... .......... .......... .......... 732 ACTTATATTT CTATGATCTC TCAAATTGAT TGGCCTTTGT GTGTTAGCTC TCTTTTCTTT 17868 .......... .......... .......... .......... .......... .......... 732 CTCTCGTTTT TGAAATCTTT TCTCCCAGCT CTTTTTGAAA TCTGATTGCA TGTCTATCGT 17808 .......... .......... .......... .......... .......... .......... 732 CAACTTCGTA AACCATCATT TTGTTAGGAC CGAAAATCAG CAGGTGTAAA CGCGGAAGCT 17748 || || |||| || ||||||| || | |||||||| .......... .......... .......GAT CGGAAATAAG CAGGTGT-AA CACGGAAGCT 764 AGCAAAGCAA ACCTCAAAAG ACCACGAGTA AGAAGACAAC GAGAAATATA CCAAAA 17692 |||||||||| |||| |||| |||||||||| |||| ||||| |||||||||| |||||| AGCAAAGCAA ACCTTGAAAG ACCACGAGTA AGAACACAAC GAGAAATATA CCAAAA 820 hqPGS_C06HBa0054K13.1-16-_SGN-U341191- (22690 22656,19102 19083,18443 18430,18344 18311,17780 17692) ******************************************************************************** EST sequence 16 +strand 542 n (File: SGN-U313612+) 1 GAAAAGAAAC TAAGTAGAGC TATGTATATT TTGTCTGCAG AAGCAATGGC TTAATTTACA 61 ACTATAGGTG GCGTACGTAT AATGCATGGA CCGAAAAATA AGCAGGTGTA AACGCGGAAG 121 CTAGCAATGC TAACATCGAA AGACCACGAG TAAGAAGACA ACGAGAAATA TACCAAAAGA 181 CACAAAGATT TAACGTGGTT CGGTCAATCG ACCTATGTCC ACAAAGGAGA TGAGCAATCC 241 ACTATAAATA TGAGAGTACA AAATACAGAG GGAAACAATC TCAACCAATT CACTCAGAAT 301 ACATGGGAGG TTCACACAAG TAATAACGTA TCAAGCTTGT GACCCACAAA TTCTCCCTCT 361 AACCAAAACT CTCAAAGCCC TTAAGACTAC ATTGTGAATG TTAACTAAGT TAAAAGGAAC 421 ATGCCTCTAT TTATAGAATC CTAAACCTTT CCCTACAAGA AAAGGATTAG TCAATCCAAA 481 ACCTTTTCCT ACAAGGAAAA TCTATTTATG GTAAGAAATT TAGAGCAAAT AAAACCCAAC 541 AA Predicted gene structure (within gDNA segment 19369 to 15440): Exon 1 17780 17330 ( 451 n); cDNA 89 541 ( 453 n); score: 0.911 MATCH C06HBa0054K13.1-16- SGN-U313612+ 0.911 451 0.832 C PGS_C06HBa0054K13.1-16-_SGN-U313612+ (17780 17330) Alignment (genomic DNA sequence = upper lines): GACCG-AAAA TCAGCAGGTG TAAACGCGGA AGCTAGCAAA GCAAACCTCA AAAGACCACG 17722 ||||| |||| | |||||||| |||||||||| ||||||||| || ||| || |||||||||| GACCGAAAAA TAAGCAGGTG TAAACGCGGA AGCTAGCAAT GCTAACATCG AAAGACCACG 148 AGTAAGAAGA CAACGAGAAA TATACCAAAA CACACAAAGA TTTAATGTGG TTCGGTCAAT 17662 |||||||||| |||||||||| |||||||||| ||||||||| ||||| |||| |||||||||| AGTAAGAAGA CAACGAGAAA TATACCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 208 CGACCTACGT CCACAAAGGA GATGAGCAAT CCACTATAAA TATGAGAGTA CAAAATACAG 17602 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGACCTATGT CCACAAAGGA GATGAGCAAT CCACTATAAA TATGAGAGTA CAAAATACAG 268 AGAGAAACAA CCTCAACCAA TTCACTCGGA ATACATGGGA GGTTCACACA AGTGATAATG 17542 || ||||||| ||||||||| ||||||| || |||||||||| |||||||||| ||| |||| | AGGGAAACAA TCTCAACCAA TTCACTCAGA ATACATGGGA GGTTCACACA AGTAATAACG 328 TATCCAACTT GTGACCCATA AATTCTCTCC CTAACCAAAA CTCTCAAAGC TCTTAAGACT 17482 |||| | ||| |||||||| | ||||||| | |||||||||| |||||||||| ||||||||| TATCAAGCTT GTGACCCACA AATTCTCCCT CTAACCAAAA CTCTCAAAGC CCTTAAGACT 388 ACATTGTGAA TGCTGACTAA GTTAGAAGGA ACATTTCTCT ATTTATAGAG TCCTAAACCT 17422 |||||||||| || | ||||| |||| ||||| |||| |||| ||||||||| |||||||||| ACATTGTGAA TGTTAACTAA GTTAAAAGGA ACATGCCTCT ATTTATAGAA TCCTAAACCT 448 TTTCCTACAA GAAAATGATT AGTCAAT-CA AAACCTTCCC CTTAAAGGAA AACCTATTTA 17363 || ||||||| ||||| |||| ||||||| || ||||||| | || |||||| || ||||||| TTCCCTACAA GAAAAGGATT AGTCAATCCA AAACCTTTTC CTACAAGGAA AATCTATTTA 508 TGATAAGAAA TTTAGGGAAA ATAAAACCCA ACA 17330 || ||||||| ||||| | || |||||||||| ||| TGGTAAGAAA TTTAGAGCAA ATAAAACCCA ACA 541 hqPGS_C06HBa0054K13.1-16-_SGN-U313612+ (17780 17330) ******************************************************************************** EST sequence 6 -strand 845 n (File: SGN-U313614-) 1 CCCATATCAA CGGCTGGATT CAAGCTCTGC CCACAGTCGT AAATAATATC AGATTGCAAA 61 ATGGTAGTCT CTCCGGTCAG GATATCAATC TCCACCTCAC TGACAGCAGC ACCAAAGTTC 121 AAATAACTCG TAAAATCAGA TTCTGGTACA TAATAAGAAT TTGCTGCTAA GTTTACTGAT 181 TCCATTTGTG CCTGCAAAAA GAAGGGGGTG GAGGGACAGA ACTGCAAACT CAGATATGTT 241 AGGACCGAAA ATAAGCAGGT GTAAACGCGG AAGCTAGCAA TGCAAACCTC AAAAGACCAC 301 GAGTAAGAAG ACAACGAGAA ATATATCAAA AGACACAAAG ATTTAACGTG GTTCGGTCAA 361 TCGACCTACG TCCACAAAGG AGATGAGCAA TCCACTATAA ATATGAGAGT ACAAAATACA 421 GAGAGAAACA ACCTCAACCA ATTCACTCAG AATACATGGG AGGTTCACAC AAGTGATAAC 481 ATATCAAGCT TGTGACCCAC AGATTCTCCC TCTAACCAAA ACTCTCAAAG CCTGTAAGAC 541 TACATTGTGA ATGCTGATTA AGTTAAAAGG AATATTCATC TATTTATAGA GTCCTAAACC 601 TTTTCCTACA AGAAAAGGAT TAGTCAATTC AAAACCTTTT CCTAAAAGGA AAAGGATTAG 661 TCAATCCAAA ACCTTTTCCT ACAAGGAAAA CCTATTTATG GTAAGAAATT TAGGGCAAAT 721 AAAACCCAAC AAGTCTCCCC CTTGGCCTGA ATTTCTGACA AATAAACTTG TCCACCTTCT 781 TCACTTAATC TTCAACAACT TGCTTCTCCT CTCCATAATC TCCTTTGCAA AATTTATGTC 841 TCAAC Predicted gene structure (within gDNA segment 20756 to 14517): Exon 1 18704 18687 ( 18 n); cDNA 219 236 ( 18 n); score: 0.722 Intron 1 18686 17787 ( 900 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.96) Exon 2 17786 17371 ( 416 n); cDNA 237 653 ( 417 n); score: 0.931 MATCH C06HBa0054K13.1-16- SGN-U313614- 0.931 434 0.514 C PGS_C06HBa0054K13.1-16-_SGN-U313614- (18704 18687,17786 17371) Alignment (genomic DNA sequence = upper lines): GAACAGAAAG TTCAGGTAGT TGTGCATTAT GATCATTTCA GAAAGATTAA AATTCCTCCT 18645 |||| | || |||| || GAACTGCAAA CTCAGATA.. .......... .......... .......... .......... 236 TTATTTCTGT ACAATTCCTT GTCATTATGA ACTAAACTAG TTTTGGTTAG CTTGGTTTGT 18585 .......... .......... .......... .......... .......... .......... 236 GTTTACGAAT TTTAGGTTGG AGATAATCAT GTCATCAGGC CCTAAACTCT AATAGTCAGC 18525 .......... .......... .......... .......... .......... .......... 236 ACTCTATGCA GCTAAGTGAA TGAGGGAACG TACATTTAGC AACCAATAAA GACAATCACA 18465 .......... .......... .......... .......... .......... .......... 236 GTCCTGTGCA ACAATATGCA GATATGTGAT TGCAGGCAAC AACTAGTTGC AGTCTGGTTA 18405 .......... .......... .......... .......... .......... .......... 236 GCAGGGATAT TCAAGTTGTT GATAATCTCG GAACATAGAT TGTTGTCATA TGCGTTCCAG 18345 .......... .......... .......... .......... .......... .......... 236 CAGAGAATTG ATAAACAAAG TCACACACAT GTAGCACAGT CATGCATTGA AAATGGCCAT 18285 .......... .......... .......... .......... .......... .......... 236 ATTATGCAGA CTTGGAATGT AGGAGTACAA CTTACATTTT GAGGATATAT ACTAAAATCA 18225 .......... .......... .......... .......... .......... .......... 236 ACAAGTTTAT AAAGTCACAA ACAACAGAGC ATTTCAGATG TCTATATCTG TGAAAACAAT 18165 .......... .......... .......... .......... .......... .......... 236 TGATAGTTGA GTGGCTATCA AAGTTATTAA CTCATCAATG AACTCACAGA AAAACTTTCG 18105 .......... .......... .......... .......... .......... .......... 236 ATTAACTATA ATCGACTATC TCTACTCTCC AAGAAGTCAA ATCCACTCCT TGTCACAAAC 18045 .......... .......... .......... .......... .......... .......... 236 CTCTCAAAGA CTCAACTCTA CAAGAAGCCA AATCCACTCC TCCTACGATA ATGACTCTAA 17985 .......... .......... .......... .......... .......... .......... 236 GTACGAATCA AAACACTCTA CTCATGACAA AAGAAATTCT AACTATAGAA CTTCTTAACT 17925 .......... .......... .......... .......... .......... .......... 236 TATATTTCTA TGATCTCTCA AATTGATTGG CCTTTGTGTG TTAGCTCTCT TTTCTTTCTC 17865 .......... .......... .......... .......... .......... .......... 236 TCGTTTTTGA AATCTTTTCT CCCAGCTCTT TTTGAAATCT GATTGCATGT CTATCGTCAA 17805 .......... .......... .......... .......... .......... .......... 236 CTTCGTAAAC CATCATTTTG TTAGGACCGA AAATCAGCAG GTGTAAACGC GGAAGCTAGC 17745 || |||||||||| |||| ||||| |||||||||| |||||||||| .......... ........TG TTAGGACCGA AAATAAGCAG GTGTAAACGC GGAAGCTAGC 278 AAAGCAAACC TCAAAAGACC ACGAGTAAGA AGACAACGAG AAATATACCA AAACACACAA 17685 || ||||||| |||||||||| |||||||||| |||||||||| ||||||| || ||| |||||| AATGCAAACC TCAAAAGACC ACGAGTAAGA AGACAACGAG AAATATATCA AAAGACACAA 338 AGATTTAATG TGGTTCGGTC AATCGACCTA CGTCCACAAA GGAGATGAGC AATCCACTAT 17625 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATTTAACG TGGTTCGGTC AATCGACCTA CGTCCACAAA GGAGATGAGC AATCCACTAT 398 AAATATGAGA GTACAAAATA CAGAGAGAAA CAACCTCAAC CAATTCACTC GGAATACATG 17565 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| AAATATGAGA GTACAAAATA CAGAGAGAAA CAACCTCAAC CAATTCACTC AGAATACATG 458 GGAGGTTCAC ACAAGTGATA ATGTATCCAA CTTGTGACCC ATAAATTCTC TCCCTAACCA 17505 |||||||||| |||||||||| | |||| | |||||||||| | | |||||| | ||||||| GGAGGTTCAC ACAAGTGATA ACATATCAAG CTTGTGACCC ACAGATTCTC CCTCTAACCA 518 AAACTCTCAA AGCTCTTAAG ACTACATTGT GAATGCTGAC TAAGTTAGAA GGAACATTTC 17445 |||||||||| ||| |||| |||||||||| ||||||||| ||||||| || |||| ||| AAACTCTCAA AGCCTGTAAG ACTACATTGT GAATGCTGAT TAAGTTAAAA GGAATATTCA 578 TCTATTTATA GAGTCCTAAA CCTTTTCCTA CAAGAAAATG ATTAGTCAA- TCAAAACCTT 17386 |||||||||| |||||||||| |||||||||| |||||||| | ||||||||| |||||||||| TCTATTTATA GAGTCCTAAA CCTTTTCCTA CAAGAAAAGG ATTAGTCAAT TCAAAACCTT 638 CCCCTTAAAG GAAAA 17371 ||| |||| ||||| TTCCTAAAAG GAAAA 653 hqPGS_C06HBa0054K13.1-16-_SGN-U313614- (17786 17371) ******************************************************************************** EST sequence 12 -strand 911 n (File: SGN-U328267-) 1 TTTTTTTTTT CTCATAAACA TAAAGTACAC TAATATTATT ATTATAAAAT TCTCCAGTCT 61 TATACAAAAC AACATATGAC TTCACTTGAA CTATAATTAA AGAACAATAA AGGGATAATG 121 CACAAGTACC CCCTCAACCT ATGCCCGAAA TTCCAGAGAC ACACTTATAC TATACTAAGG 181 TCCTATTACC CCCCTGAACT TATTTTATAA GTAATTTTCT ACCCCTTTTT AGCCTACGTG 241 GCACTAGTTT AAAAAAAAAG TCAACAACCA TTGGGCCCAC AAGATAGTGC CACGTAGGTC 301 TAAAAGGGGT AAAAAATTAT TAATAAATAA GTTCAGGGGG TAATAAGATC TTAGTATGGT 361 ATAAGTGTAT CTCTGAGATT TTGGACATAG GCTAAGGGGG TACTTGGACA TTATCCCAAC 421 AATAAATAAA GGATATAACA TGATTCAAAA GACAACACGT GATAACCACT TCTACAACTT 481 GTGCATGATC AATTGAGCAC CATATTGAGG TTGAAGCAAC AATGGGTGAG GAGCATGAGC 541 ATAAGATGGA GAGAGTTCAA ATGCATAATG TTTTAGAATC ATGACCATTG CCATTTTTGC 601 CTCTAACATA GCAAAATTTT GCCCAATACA TATTCTTGGA CCCCAACTAA ATGGAAAAAA 661 TACAACTTGT CCTTTTGTTG CTTTTGATAT TCCTTCACTA AATCTCTCTG GCATAAACTC 721 CATTGCATCA TCTCCCCATA TTTCAGTATC ATGATGCACT AACATTGTTG CCAATATGAG 781 TTGGACCCCA GAGGGTAAAC ACAAATCCCC TAACTTTGTT TCTGTATTCA CCATGCGATT 841 AATCGCGTAT ACTGATGTAT ACAACCTTAA GACCTCGTTT AAGATCATTG TAACCACTTT 901 TAGTTGATTC A Predicted gene structure (within gDNA segment 22112 to 12058): Exon 1 21115 21079 ( 37 n); cDNA 81 112 ( 32 n); score: 0.757 Intron 1 21078 20266 ( 813 n); Pd: 0.696 (s: 0), Pa: 0.000 (s: 0.92) Exon 2 20265 20125 ( 141 n); cDNA 113 255 ( 143 n); score: 0.823 Intron 2 20124 17733 (2392 n); Pd: 0.644 (s: 0.68), Pa: 0.000 (s: 0) Exon 3 17732 17721 ( 12 n); cDNA 256 267 ( 12 n); score: 0.750 Intron 3 17720 16680 (1041 n); Pd: 0.858 (s: 0), Pa: 0.000 (s: 0) Exon 4 16679 16666 ( 14 n); cDNA 268 281 ( 14 n); score: 0.857 Intron 4 16665 13949 (2717 n); Pd: 0.000 (s: 0), Pa: 0.895 (s: 0) Exon 5 13948 13938 ( 11 n); cDNA 282 292 ( 11 n); score: 0.818 PPA cDNA 12 1 MATCH C06HBa0054K13.1-16- SGN-U328267- 0.823 215 0.236 C PGS_C06HBa0054K13.1-16-_SGN-U328267- (21115 21079,20265 20125,17732 17721,16679 16666,13948 13938) Alignment (genomic DNA sequence = upper lines): TTTACTTGAA CAAGAAGTTA AAGGAAGAAA ATAAAAGGTA GAATAAAATA ACATTTAAAT 21056 || ||||||| | | || ||| || ||| || | |||| TTCACTTGAA CTATAA-TTA AA-GAA-CAA -T-AAAG... .......... .......... 112 AGACTCAATT GCTTTCAAGA GTCTTCCATT ATAAAAAGGG AAGTGGAAAA AAACAAATAA 20996 .......... .......... .......... .......... .......... .......... 112 TATCCCCTTT TCTGCAAAAA AGCAAAGCAA TCAGTAAAGC AAAAATATTT ACTTTCCAAT 20936 .......... .......... .......... .......... .......... .......... 112 ATACTATCTT ATTGTTGAGC CAAAGACCAA AGGCACAAAG CAAGGTCAAG GCAGATAATC 20876 .......... .......... .......... .......... .......... .......... 112 AAAGAATGCT TTACAGTATT ATCTTAAAAT TGTGTGACCT TATAATGGTT AGTTGCCACT 20816 .......... .......... .......... .......... .......... .......... 112 TGGTTATTAA CCTTTATTAT TTGGAATTCA CTAGTTCAAA TAAGACAATT TTGTGTTGGT 20756 .......... .......... .......... .......... .......... .......... 112 TATAAATAGT TTATATTACA AAAATTCGTT TCAAGTGTTC AAATTTTAGA AGAAAGAAAT 20696 .......... .......... .......... .......... .......... .......... 112 TCACTAATCT CACCACATTG ATTACTAATA TTAGAGAATG AATATATATA ATCTAAATTC 20636 .......... .......... .......... .......... .......... .......... 112 TCCCCTTCCA AGTATATGAG ATATTTACCC GCGGCTGAGA TGAGACCACG ACTAATAAGG 20576 .......... .......... .......... .......... .......... .......... 112 TGGAGAAGAA GATATCAGGC ACAAAACTGA ATAAATTAAT TTATATGGTG ATAATATGTA 20516 .......... .......... .......... .......... .......... .......... 112 TCTTTAATCC TAACATTTAA CGTAATATTA ATTTATTTAG CTTTGTCACA GTGAAATTAT 20456 .......... .......... .......... .......... .......... .......... 112 TTTTTCATGT GGAGAGCTAC ATGACAAAAA TTTTAAATAG ATAAGAGAGT GTATTAACCA 20396 .......... .......... .......... .......... .......... .......... 112 CAACATTATA AATTAGTCAA GGTGTGTTTC ACTAATCATT AGGTGATTGT TTAGGTCGAA 20336 .......... .......... .......... .......... .......... .......... 112 AACTCATACT TTCAATAGTT CAAATATAAA ACTTAAAAAC TGATAAAGTT AAGATGTAAA 20276 .......... .......... .......... .......... .......... .......... 112 TAATTCTGAG GGAAAATGCA CAGGTACCCC CTCAACCTAT GTCCGAAATT TCAGAGACAC 20216 ||| |||||| || ||||||| |||||||||| | |||||||| ||||||||| .......... GGATAATGCA CAAGTACCCC CTCAACCTAT GCCCGAAATT CCAGAGACAC 162 ACTTATACAA CACTAAGGTC CTATTA-CCC CCTCAACTTA TTTTATAAGT AATTTTCTAT 20157 |||||||| | ||||||||| |||||| ||| ||| |||||| |||||||||| ||||||||| ACTTATACTA TACTAAGGTC CTATTACCCC CCTGAACTTA TTTTATAAGT AATTTTCTAC 222 CCCTTTTCGA CCTATCGGAC A-TAGGTTGA GGGGTACTTG TGCATTTTTT CCCTGAACTT 20098 ||||||| |||| | | | ||| || | CCCTTTTTAG CCTACGTGGC ACTAGTTTAA AAA....... .......... .......... 255 ATTTTATGAT TGTTTCTCCT GTAAATATTT CATCACTTGG TTTCCTTCTT GTGAACCTCT 20038 .......... .......... .......... .......... .......... .......... 255 CCATCATTAG GATGCCAAAA CTATAAACAT CACAACTCGT GGATACTATT CCATCTTGTC 19978 .......... .......... .......... .......... .......... .......... 255 CATACACTGT AATCTTGGTA AAAAATGATC ATGAAAGCTT CCTAGTTAAA ATGAAATATG 19918 .......... .......... .......... .......... .......... .......... 255 CCAAAACTGA GAGTGTCAAT CCATTTATAG GGGAAAGAAT TGCTTAACTA TAGAAGTTTC 19858 .......... .......... .......... .......... .......... .......... 255 TAATATACTG GAGCAATATA TCCGATGGTT GCAGTTGTTC TTGTTTGAAC GAAAGCCTCC 19798 .......... .......... .......... .......... .......... .......... 255 CCTACATCTA ACAATTTTGC AATGCCAAAA TCACTAACTT GACCAACCAT TTCTTGATCT 19738 .......... .......... .......... .......... .......... .......... 255 AGCAACATAT TGCTTGGCTT CAAGTCACAA TGCACCACAG GCGTTGAATA GCCATTGTGG 19678 .......... .......... .......... .......... .......... .......... 255 AGATAGTCCA TTGCAGATGC AACATCTATC ATTATATCCA ATCTCTGCAA TAAGTTTAAG 19618 .......... .......... .......... .......... .......... .......... 255 AACAAGTTGT GAGAATATAA CCATTTATCA AGTGTCCCAT TGGGCATGTA TTCCAACACC 19558 .......... .......... .......... .......... .......... .......... 255 AGGACCTTGA AATCAAGGTT GCAGCAGCTT GTTATGACTT TGGTCAGATT TTTGTGGCGA 19498 .......... .......... .......... .......... .......... .......... 255 AGTTCCGGAG GGATTCGGCT TCTCTCCATT TTCCCCAAGT TTCTATTTGG ATTAATGACA 19438 .......... .......... .......... .......... .......... .......... 255 CATGTCATAG GTTAAAATAA ATGATTAAGA TTTAATTTTT CAAACTAAAC CTCTAAGCAT 19378 .......... .......... .......... .......... .......... .......... 255 GATTTAGTTA ATAAATATAT TATATATCAA GTATCAAAAT TATTATAATC TCACAATTTT 19318 .......... .......... .......... .......... .......... .......... 255 AGCAACATAT TATTTTGTCC ATATTATTTA TGTAAAAAAG TTTTCTTCCT ATTATTTTTT 19258 .......... .......... .......... .......... .......... .......... 255 TTACTTAGGA TCTTTTTTTT ATTTTTCTTT TATATAATAT TTTTTATTGT AATCTTTTGT 19198 .......... .......... .......... .......... .......... .......... 255 TAAAGGTATA TATTGGTCCG TGATTAATAA ATTACCAAGA GATAATCAAA CCATTTTGAC 19138 .......... .......... .......... .......... .......... .......... 255 AAACTAATTT TAATTTCATA ATAAGATTAA GAATGGGGTG ATACCAGTAC ATACGGTAGG 19078 .......... .......... .......... .......... .......... .......... 255 TTATCTATAG TTTCCATACT TCTATTCAGT TAATTATTCT TCTAATTAGA TAAAAAAAAA 19018 .......... .......... .......... .......... .......... .......... 255 TATTATATAA AAGGAAAAAG AAAAGATCCT AAGTAAAAAG AAATAATAGG AAGAAAACTT 18958 .......... .......... .......... .......... .......... .......... 255 TTTTTACCTA TATTTATGGA CAAAATATAG TCACAAGCAT GATATTATTC AACATTTTAA 18898 .......... .......... .......... .......... .......... .......... 255 CACTCCCACT CAATTACTAA AAACAGCGAG AGGAAGTTGT AATTTGGAGA GGTTGGGAAT 18838 .......... .......... .......... .......... .......... .......... 255 GCGAAGTTTC AGAAGGAAAA AGTTGACAAG GAATAATGTT CCTCCTTGAT TTCCTCTATG 18778 .......... .......... .......... .......... .......... .......... 255 TTAAATACAA CTTCTGGACT TTCCGTTTTT CTTGTCTTTC AGCTTAAATA TTATTTAAGT 18718 .......... .......... .......... .......... .......... .......... 255 TGTTGTTTAT TAGGAACAGA AAGTTCAGGT AGTTGTGCAT TATGATCATT TCAGAAAGAT 18658 .......... .......... .......... .......... .......... .......... 255 TAAAATTCCT CCTTTATTTC TGTACAATTC CTTGTCATTA TGAACTAAAC TAGTTTTGGT 18598 .......... .......... .......... .......... .......... .......... 255 TAGCTTGGTT TGTGTTTACG AATTTTAGGT TGGAGATAAT CATGTCATCA GGCCCTAAAC 18538 .......... .......... .......... .......... .......... .......... 255 TCTAATAGTC AGCACTCTAT GCAGCTAAGT GAATGAGGGA ACGTACATTT AGCAACCAAT 18478 .......... .......... .......... .......... .......... .......... 255 AAAGACAATC ACAGTCCTGT GCAACAATAT GCAGATATGT GATTGCAGGC AACAACTAGT 18418 .......... .......... .......... .......... .......... .......... 255 TGCAGTCTGG TTAGCAGGGA TATTCAAGTT GTTGATAATC TCGGAACATA GATTGTTGTC 18358 .......... .......... .......... .......... .......... .......... 255 ATATGCGTTC CAGCAGAGAA TTGATAAACA AAGTCACACA CATGTAGCAC AGTCATGCAT 18298 .......... .......... .......... .......... .......... .......... 255 TGAAAATGGC CATATTATGC AGACTTGGAA TGTAGGAGTA CAACTTACAT TTTGAGGATA 18238 .......... .......... .......... .......... .......... .......... 255 TATACTAAAA TCAACAAGTT TATAAAGTCA CAAACAACAG AGCATTTCAG ATGTCTATAT 18178 .......... .......... .......... .......... .......... .......... 255 CTGTGAAAAC AATTGATAGT TGAGTGGCTA TCAAAGTTAT TAACTCATCA ATGAACTCAC 18118 .......... .......... .......... .......... .......... .......... 255 AGAAAAACTT TCGATTAACT ATAATCGACT ATCTCTACTC TCCAAGAAGT CAAATCCACT 18058 .......... .......... .......... .......... .......... .......... 255 CCTTGTCACA AACCTCTCAA AGACTCAACT CTACAAGAAG CCAAATCCAC TCCTCCTACG 17998 .......... .......... .......... .......... .......... .......... 255 ATAATGACTC TAAGTACGAA TCAAAACACT CTACTCATGA CAAAAGAAAT TCTAACTATA 17938 .......... .......... .......... .......... .......... .......... 255 GAACTTCTTA ACTTATATTT CTATGATCTC TCAAATTGAT TGGCCTTTGT GTGTTAGCTC 17878 .......... .......... .......... .......... .......... .......... 255 TCTTTTCTTT CTCTCGTTTT TGAAATCTTT TCTCCCAGCT CTTTTTGAAA TCTGATTGCA 17818 .......... .......... .......... .......... .......... .......... 255 TGTCTATCGT CAACTTCGTA AACCATCATT TTGTTAGGAC CGAAAATCAG CAGGTGTAAA 17758 .......... .......... .......... .......... .......... .......... 255 CGCGGAAGCT AGCAAAGCAA ACCTCAAAAG ACCACGAGTA AGAAGACAAC GAGAAATATA 17698 ||||| | || | .......... .......... .....AAAAG TCAACAA... .......... .......... 267 CCAAAACACA CAAAGATTTA ATGTGGTTCG GTCAATCGAC CTACGTCCAC AAAGGAGATG 17638 .......... .......... .......... .......... .......... .......... 267 AGCAATCCAC TATAAATATG AGAGTACAAA ATACAGAGAG AAACAACCTC AACCAATTCA 17578 .......... .......... .......... .......... .......... .......... 267 CTCGGAATAC ATGGGAGGTT CACACAAGTG ATAATGTATC CAACTTGTGA CCCATAAATT 17518 .......... .......... .......... .......... .......... .......... 267 CTCTCCCTAA CCAAAACTCT CAAAGCTCTT AAGACTACAT TGTGAATGCT GACTAAGTTA 17458 .......... .......... .......... .......... .......... .......... 267 GAAGGAACAT TTCTCTATTT ATAGAGTCCT AAACCTTTTC CTACAAGAAA ATGATTAGTC 17398 .......... .......... .......... .......... .......... .......... 267 AATCAAAACC TTCCCCTTAA AGGAAAACCT ATTTATGATA AGAAATTTAG GGAAAATAAA 17338 .......... .......... .......... .......... .......... .......... 267 ACCCAACACA TTTTCCCTCT TTTTTGATGA TGACAAATCG TCTACTCCTA TTTCCTGTAT 17278 .......... .......... .......... .......... .......... .......... 267 AAGGTCAATA CACACAGTAC ACAAGAGCAA TACATATCAA AAAGAAGAGA AAAGCGTTAA 17218 .......... .......... .......... .......... .......... .......... 267 AACAGTAAAC CAAGCCAACG AAAACGCAGA GCTCCGGAAG AGCAACATCG CGTAGAACAC 17158 .......... .......... .......... .......... .......... .......... 267 AACAAGATAC CTCCAATCTA ATGATATTAG CCATCTGTAA ACATATGCAA AAACATGCAA 17098 .......... .......... .......... .......... .......... .......... 267 GAACGAAGGA AGCAGAGGAT CCGAGATACA CCAACATTTT AAGCTCACAA AATATTCAGT 17038 .......... .......... .......... .......... .......... .......... 267 CTAAGCACAA AACTCAAAAG AACATAACAA TGTCTTCAGT CCAAAAGTAT AAGAAACTAG 16978 .......... .......... .......... .......... .......... .......... 267 ACCAGAAAGC AGGGATCAAG AAGAGGAATG ATTGGGGTTT GGAGGACAGG ATGAAGAAGC 16918 .......... .......... .......... .......... .......... .......... 267 AAGTAACTGG AGGATCCAGT CCACTCTAGC GTTGCTATCA AGCTTCTTCT GTACTAACTG 16858 .......... .......... .......... .......... .......... .......... 267 ATCCTTCAAT CAACCAACTT CTTCTTTAGA TGCAGTTAAA TCAGTATGCA TTTTCAGCCA 16798 .......... .......... .......... .......... .......... .......... 267 GTAGATTTTC ACACTTTTCC CTTGCAATCT TAAGATCCCA AATTAGTAGA CGAGTAACAG 16738 .......... .......... .......... .......... .......... .......... 267 GTCCAGAAGC CCGCGGAGAT GCAATAGGAA CCTGCACGAG TTAAAGCTCT TCCCTCACCC 16678 || .......... .......... .......... .......... .......... ........CC 269 AATGGGCGCA CAGCAAACTC ATTGAACACT AAAAGGTTTA AACTGCATCA AATTTTGGAG 16618 | ||||| || || ATTGGGCCCA CA........ .......... .......... .......... .......... 281 GCAAGTTGCT TTTGATTCTT ATGTAAGTTT GACGACAAGG AATTTAGGCA GCTACTCCCG 16558 .......... .......... .......... .......... .......... .......... 281 GAGACTACAC TTAGATGGCT GGAAGGAGAG GACTTGATAA AACTGTTACA GTTGTGGTGA 16498 .......... .......... .......... .......... .......... .......... 281 TATAATTGGA AGCATTCTCA AAGTTGTGTA AATACGAAAA GAAAATCTTG TGTTGAGCAT 16438 .......... .......... .......... .......... .......... .......... 281 TTTAAATCTC ATTTTCAACT TCCATAAATC AAGCCAACAA CTAAGATAAA ATACACTCGT 16378 .......... .......... .......... .......... .......... .......... 281 GCGAAATTGA ATTACATAAG AGAGAAAACT GGTAACTAAA TAACATACAA GAGAAGGTTG 16318 .......... .......... .......... .......... .......... .......... 281 GTAATGATTC CACCTAATGC CGACTACTAA CAAGCTGTAG CCTCATCTTT TTGAGTGTTG 16258 .......... .......... .......... .......... .......... .......... 281 AAAGAGCATC CTTCATGCTA ATTCTTGCAT CAGGTCTCAC TAAAGTGCAC TTCAAAGCTA 16198 .......... .......... .......... .......... .......... .......... 281 ATTCCATGAC AGATGACAAA CATTGCATCT TTGCAGCGAT TTGTTCATCT CCGGGCTGTA 16138 .......... .......... .......... .......... .......... .......... 281 CCAAATTAGA ATCCACCACC TTGTGAAGTT CACCCGGAAA GGAATCACTA ATCCAGCTTT 16078 .......... .......... .......... .......... .......... .......... 281 GTATGCTCAA GTCTCCAGTA AATATGTCAT CACTTGGTCT TGTTCGTGTG AACGTCTCCA 16018 .......... .......... .......... .......... .......... .......... 281 TCATCAGGAT ACCAAAACTA TAAACATCAC AACTCGTGGA TACTATTCCA TCTTGTCCAT 15958 .......... .......... .......... .......... .......... .......... 281 ACTCTGTATT CATGGTCGAA AATGACCACA AATCAATTTA TACATAGGGG AAAGAATTGT 15898 .......... .......... .......... .......... .......... .......... 281 TTTGAGTATT TAAAGGCGAT ACGAGAAAAC TTAAAATATA CCTGGAGCAA TATATCCAAT 15838 .......... .......... .......... .......... .......... .......... 281 GGTTGCAACT GTCCTTGTTT GAACAAAAGC CTCCCCTGCA CCTAACATTT TTGCAATGCC 15778 .......... .......... .......... .......... .......... .......... 281 AAAATCACTT ACATGAGCAA CCATTTCTTC ATCTAACAAG ACATTACTTG GTTTCAAGTC 15718 .......... .......... .......... .......... .......... .......... 281 ACAATGCACT ACAGGCGTTG AATAGCCATT GTGGAGATAG TTCATTGCAG ATGCAACATC 15658 .......... .......... .......... .......... .......... .......... 281 TATCATTACA TCCAATCTCT GCAATAAGTT CAAGAACAAA TTGTGAGAGT ATAACCATTT 15598 .......... .......... .......... .......... .......... .......... 281 ATCAAGTGTC CCGTTGGGCA TGTATTCCAA CACTAGGGCC TTGAAATCAA GATTGGAGCA 15538 .......... .......... .......... .......... .......... .......... 281 GCTGGTAATG ACTTTGGCAA GATTTCTGTG GCGAAGATTG CGAAGTATCT CACATTCCGT 15478 .......... .......... .......... .......... .......... .......... 281 GTCAAAACTT TTGAATGCAC CCTCCAATTG CACATTGAAT ACCTTTGCTG CAAAAATGAT 15418 .......... .......... .......... .......... .......... .......... 281 ACCATCCTTA AGTACCCCTT TATAAACCCT GCTGAAACTC CCATTACCAA GCAAGTTGGT 15358 .......... .......... .......... .......... .......... .......... 281 TTCGTTGAAT CCTTCAGTTG CCTGTTCAAG TTCATAATAG GAAATTCTTT CATGCCCTCT 15298 .......... .......... .......... .......... .......... .......... 281 TACGAGAGAC AGATCCTTTT GACTAGCATT CTTCTTTGTG TTTCTCAATC TTAACACGAC 15238 .......... .......... .......... .......... .......... .......... 281 AAATCCAACA GTCAACATGA AGAGTGATCC TATCCCTAAT AGAATATATA AACCTGTAAG 15178 .......... .......... .......... .......... .......... .......... 281 CACTCTTTTT CTTCTTGATT TCTTTGTAGA TTTGGTTGGG CATGGTTTTA CGTTAAACCG 15118 .......... .......... .......... .......... .......... .......... 281 GGAGTCACCA CAAAGTGCAT CATTGGACAA GAAAGACTGA CTGGTTACAT TTGCAAAGGG 15058 .......... .......... .......... .......... .......... .......... 281 ACCATCAGTG GGAATTTCTC CACTGAGTTC ATTGAAAGAG AAATTTAGGT ATTTGAGATA 14998 .......... .......... .......... .......... .......... .......... 281 CACAAGAGCT TCTAATGACT TTGGAATTTC ACCGCTAAGA TTGTTATAGG ACAAATCCAA 14938 .......... .......... .......... .......... .......... .......... 281 GTATTCCAAT GCCAACAATT TGTCAAATGA TTCAGGAATA GGCCCTTCTA ATCTATTATG 14878 .......... .......... .......... .......... .......... .......... 281 TGCTAGAGAA AGATAAATTA ATTTATCTAG GCCCCCTAGA GTACTAGGAA TCTTACCAGA 14818 .......... .......... .......... .......... .......... .......... 281 AAAATAATTA TTTGACAGAT CAATCAGTGT TGCACCCTTC AAGTTTCCGC TCTCTAGCGG 14758 .......... .......... .......... .......... .......... .......... 281 AATTTCTCCA CTCAAATAAT TGGATGAAAT ATTGAACTCT ATGATGTTTT GAAGTCCCCC 14698 .......... .......... .......... .......... .......... .......... 281 CAATCTTGCA GGTAATCTAG AATCCAGCTT GTTGTTATCT AGATAGAGTG TCCTCAAACT 14638 .......... .......... .......... .......... .......... .......... 281 AGTAACATTC CCTAAGCATG GTGGAACGGA ACTAGAAAAT TGATTTTCTG ACAATTCTAA 14578 .......... .......... .......... .......... .......... .......... 281 TGCACCAAGA TACTGTAAAC TGCAGATAAC ATCTGGTATG GCTCCTTCTA ACTTGTTGCT 14518 .......... .......... .......... .......... .......... .......... 281 TCCTAGGTAA AGTTCTTGAA GGTTCAGCAT TCCTTGCACT GTTTTTGGAA TATGACCTAT 14458 .......... .......... .......... .......... .......... .......... 281 CAACTGATTG TTCGACAGAC TCATCCTTGT CAATCCAGTA AGATTAGTAA TTTGTTTTGA 14398 .......... .......... .......... .......... .......... .......... 281 AATGACACCC TTCAGTTTAC ATTTAGATGC TTCAAAAATT TGCAAGGAGT TTGAGAAATT 14338 .......... .......... .......... .......... .......... .......... 281 ACCAACAGAT GCAGGCAAAA CACCATCCAA CGGATTACCA CCAAGCGTGA GTACTCTTAG 14278 .......... .......... .......... .......... .......... .......... 281 ATTCCTACAG TTTGTCAATG ATTCAAGGAA GCTCAATGTT GAATCGCTGA CAAAATTATT 14218 .......... .......... .......... .......... .......... .......... 281 CCCCCACAAG TTGAGAACCT CAAGGTATTC TAAGTTACCA AGTGATTTTG AAATTAGACC 14158 .......... .......... .......... .......... .......... .......... 281 TGTGAAACTG TTGCTGGAGA GGTCTAACAT TGTGAGTCTT GAAGAATTAG AGATTGAATC 14098 .......... .......... .......... .......... .......... .......... 281 AGAGATAAAA CCACTCAGAT TATTTCCTCC ACAATAAAAT ACTTGTAGGT CGGGCATTCC 14038 .......... .......... .......... .......... .......... .......... 281 ACGACCTAAA TCTGAAGGCA GAGTACCTGT AAGCTTGTTT TGTCCAAAAT CTATTTTCTG 13978 .......... .......... .......... .......... .......... .......... 281 CAGTGTTGAC ATGTTGAAAA TGCTGTCAGG GATAGAGCCA 13938 ||||| |||| .......... .......... .........A GATAGTGCCA 292 hqPGS_C06HBa0054K13.1-16-_SGN-U328267- (21115 21079,20265 20125) ******************************************************************************** EST sequence 17 +strand 826 n (File: SGN-U344226+) 1 ATTGAAACAA AAGCTGGAGC TCCACCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC 61 CGGGCTGCAG GAATTCGGCA CGAGCTTATT GTAGATGACC AATTTTTTCT TCGAATACGA 121 AATTAAATTA CAATACACAC AAAAAAAATA TTTGAATTTT TTTTATTTAA ACTAAGGAAT 181 GAAAGAAAAA AACAAAATAA GAATAAGAAA CTCAAATTAT TATAATAAAA GAAGTCAAAA 241 AATAATTTTT GTATGAAAAA ATTAAAATAT ACCTTGAACT TTGATAGAAG AATCATATAT 301 ATCCCTAAAT ATTTTTTTTA AAAAAAAATT AGAAGTAACA AATATAAATT TAAAACTAAT 361 TTTTTAACTT TCGTTAAATG AAGGGTATAT GTGAGCCATT TTCTAACGGC AGGGGTATAT 421 GTGAGCCGTT TGTATAACGA TAAGGGCATA TATGAACCAC TTTTATTACG AGGGATATAT 481 CAGCTCTAAA TGACAAAGTT GAGAGGTATA TCAGACCCTT TTCCCTATTT TTTAAAATTT 541 CATACCATTT GAAAAAAAAT CCTATGTTCT TCTCTTTACT ATTTTTGGAC ATATCATTCA 601 TGTATCGGTA CGAGATATAT CATTGGTGGT GTACCTTGTA TTAAACAAAA TAATGGATCG 661 AAATGTGAAT GTTTCGGAAC ATAAACAACG TTTCGAGAAG CATTGTATCA AAATATGGAG 721 GGATCATTTG GGTAGGTTTG TCTTGTTTTA ACAAGGGAAT GTATTGAGAC ACAATATATA 781 CATAACGTTT AAAGGAGGGT CGAAAACCCT AAATTTTAAT TTTTAG Predicted gene structure (within gDNA segment 23058 to 15559): Exon 1 21617 21163 ( 455 n); cDNA 86 538 ( 453 n); score: 0.918 Intron 1 21162 19824 (1339 n); Pd: 0.000 (s: 0.86), Pa: 0.771 (s: 0.48) Exon 2 19823 19780 ( 44 n); cDNA 539 582 ( 44 n); score: 0.477 Intron 2 19779 19454 ( 326 n); Pd: 0.753 (s: 0.48), Pa: 0.000 (s: 0) Exon 3 19453 19428 ( 26 n); cDNA 583 607 ( 25 n); score: 0.654 Intron 3 19427 18570 ( 858 n); Pd: 0.595 (s: 0), Pa: 0.980 (s: 0) Exon 4 18569 18563 ( 7 n); cDNA 608 614 ( 7 n); score: 0.714 Intron 4 18562 18444 ( 119 n); Pd: 0.000 (s: 0), Pa: 0.908 (s: 0) Exon 5 18443 18433 ( 11 n); cDNA 615 625 ( 11 n); score: 0.818 MATCH C06HBa0054K13.1-16- SGN-U344226+ 0.918 543 0.657 C PGS_C06HBa0054K13.1-16-_SGN-U344226+ (21617 21163,19823 19780,19453 19428,18569 18563,18443 18433) Alignment (genomic DNA sequence = upper lines): TTATTGTAGA TGACCAAATT TTTTCTTCGA ATACGAAATT AAATTACAAT ACACAGTAAA 21558 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| ||||| ||| TTATTGTAGA TGACC-AATT TTTTCTTCGA ATACGAAATT AAATTACAAT ACACACAAAA 144 AAAATAGTTT AA-TTTTTTT ATTTAAACTA AGGAATGAAA GAAAAAAACA AAATAAGAAT 21499 |||||| || || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAATATTTG AATTTTTTTT ATTTAAACTA AGGAATGAAA GAAAAAAACA AAATAAGAAT 204 AAGAAACTCA AATAATTATA ATAAATGAAG TCAAAAAATA ATTTATGTAT GAAAAAAATT 21439 |||||||||| ||| |||||| ||||| |||| |||||||||| |||| ||||| | |||||||| AAGAAACTCA AATTATTATA ATAAAAGAAG TCAAAAAATA ATTTTTGTAT G-AAAAAATT 263 AAAATATACA TTGAACTTTG ATAGAAGAAT CATATATATC TCTAAATAAT TTTTTTTAAA 21379 ||||||||| |||||||||| |||||||||| |||||||||| ||||||| | |||||| ||| AAAATATACC TTGAACTTTG ATAGAAGAAT CATATATATC CCTAAATATT TTTTTTAAAA 323 AAAAATTAAA AGTAATAAAT ATAAATTTAA AATAAATTTT TTAACTTCCG TTAAATGAAG 21319 |||||||| | ||||| |||| |||||||||| || |||||| ||||||| || |||||||||| AAAAATTAGA AGTAACAAAT ATAAATTTAA AACTAATTTT TTAACTTTCG TTAAATGAAG 383 GGTATATGTG AGTCATTTTA TAACAGCAGG GGTAAATGTG AGCCGTTTGT ATAACGGTAA 21259 |||||||||| || |||||| |||| ||||| |||| ||||| |||||||||| |||||| ||| GGTATATGTG AGCCATTTTC TAACGGCAGG GGTATATGTG AGCCGTTTGT ATAACGATAA 443 GGGCATATAT GAGCCACTTT TATAACGAGG GGTATATGAG CTCCAAATGA CAAAGTTGAG 21199 |||||||||| || ||||||| ||| |||||| | ||||| || ||| |||||| |||||||||| GGGCATATAT GAACCACTTT TATTACGAGG GATATATCAG CTCTAAATGA CAAAGTTGAG 503 GGGTATATCA GACCTTTTTC CCTTTTATAT ATAACTATAA CTATGATATT ATTCTTTTAT 21139 ||||||||| |||| ||||| ||| || | | | || | AGGTATATCA GACCCTTTTC CCTATTTTTT A-AAAT.... .......... .......... 538 CTTTCAAAAT TCAAAAACTT TTTTTTACTT GAACAAGAAG TTAAAGGAAG AAAATAAAAG 21079 .......... .......... .......... .......... .......... .......... 538 GTAGAATAAA ATAACATTTA AATAGACTCA ATTGCTTTCA AGAGTCTTCC ATTATAAAAA 21019 .......... .......... .......... .......... .......... .......... 538 GGGAAGTGGA AAAAAACAAA TAATATCCCC TTTTCTGCAA AAAAGCAAAG CAATCAGTAA 20959 .......... .......... .......... .......... .......... .......... 538 AGCAAAAATA TTTACTTTCC AATATACTAT CTTATTGTTG AGCCAAAGAC CAAAGGCACA 20899 .......... .......... .......... .......... .......... .......... 538 AAGCAAGGTC AAGGCAGATA ATCAAAGAAT GCTTTACAGT ATTATCTTAA AATTGTGTGA 20839 .......... .......... .......... .......... .......... .......... 538 CCTTATAATG GTTAGTTGCC ACTTGGTTAT TAACCTTTAT TATTTGGAAT TCACTAGTTC 20779 .......... .......... .......... .......... .......... .......... 538 AAATAAGACA ATTTTGTGTT GGTTATAAAT AGTTTATATT ACAAAAATTC GTTTCAAGTG 20719 .......... .......... .......... .......... .......... .......... 538 TTCAAATTTT AGAAGAAAGA AATTCACTAA TCTCACCACA TTGATTACTA ATATTAGAGA 20659 .......... .......... .......... .......... .......... .......... 538 ATGAATATAT ATAATCTAAA TTCTCCCCTT CCAAGTATAT GAGATATTTA CCCGCGGCTG 20599 .......... .......... .......... .......... .......... .......... 538 AGATGAGACC ACGACTAATA AGGTGGAGAA GAAGATATCA GGCACAAAAC TGAATAAATT 20539 .......... .......... .......... .......... .......... .......... 538 AATTTATATG GTGATAATAT GTATCTTTAA TCCTAACATT TAACGTAATA TTAATTTATT 20479 .......... .......... .......... .......... .......... .......... 538 TAGCTTTGTC ACAGTGAAAT TATTTTTTCA TGTGGAGAGC TACATGACAA AAATTTTAAA 20419 .......... .......... .......... .......... .......... .......... 538 TAGATAAGAG AGTGTATTAA CCACAACATT ATAAATTAGT CAAGGTGTGT TTCACTAATC 20359 .......... .......... .......... .......... .......... .......... 538 ATTAGGTGAT TGTTTAGGTC GAAAACTCAT ACTTTCAATA GTTCAAATAT AAAACTTAAA 20299 .......... .......... .......... .......... .......... .......... 538 AACTGATAAA GTTAAGATGT AAATAATTCT GAGGGAAAAT GCACAGGTAC CCCCTCAACC 20239 .......... .......... .......... .......... .......... .......... 538 TATGTCCGAA ATTTCAGAGA CACACTTATA CAACACTAAG GTCCTATTAC CCCCTCAACT 20179 .......... .......... .......... .......... .......... .......... 538 TATTTTATAA GTAATTTTCT ATCCCTTTTC GACCTATCGG ACATAGGTTG AGGGGTACTT 20119 .......... .......... .......... .......... .......... .......... 538 GTGCATTTTT TCCCTGAACT TATTTTATGA TTGTTTCTCC TGTAAATATT TCATCACTTG 20059 .......... .......... .......... .......... .......... .......... 538 GTTTCCTTCT TGTGAACCTC TCCATCATTA GGATGCCAAA ACTATAAACA TCACAACTCG 19999 .......... .......... .......... .......... .......... .......... 538 TGGATACTAT TCCATCTTGT CCATACACTG TAATCTTGGT AAAAAATGAT CATGAAAGCT 19939 .......... .......... .......... .......... .......... .......... 538 TCCTAGTTAA AATGAAATAT GCCAAAACTG AGAGTGTCAA TCCATTTATA GGGGAAAGAA 19879 .......... .......... .......... .......... .......... .......... 538 TTGCTTAACT ATAGAAGTTT CTAATATACT GGAGCAATAT ATCCGATGGT TGCAGTTGTT 19819 || | .......... .......... .......... .......... .......... .....TTCAT 543 CTTGTTTGAA CGAAAGCCTC CCCTACATCT AACAATTTTG CAATGCCAAA ATCACTAACT 19759 |||||| ||| | | | ||| | | | ACCATTTGAA AAAAAATCCT ATGTTCTTCT CTTTACTAT. .......... .......... 582 TGACCAACCA TTTCTTGATC TAGCAACATA TTGCTTGGCT TCAAGTCACA ATGCACCACA 19699 .......... .......... .......... .......... .......... .......... 582 GGCGTTGAAT AGCCATTGTG GAGATAGTCC ATTGCAGATG CAACATCTAT CATTATATCC 19639 .......... .......... .......... .......... .......... .......... 582 AATCTCTGCA ATAAGTTTAA GAACAAGTTG TGAGAATATA ACCATTTATC AAGTGTCCCA 19579 .......... .......... .......... .......... .......... .......... 582 TTGGGCATGT ATTCCAACAC CAGGACCTTG AAATCAAGGT TGCAGCAGCT TGTTATGACT 19519 .......... .......... .......... .......... .......... .......... 582 TTGGTCAGAT TTTTGTGGCG AAGTTCCGGA GGGATTCGGC TTCTCTCCAT TTTCCCCAAG 19459 .......... .......... .......... .......... .......... .......... 582 TTTCTATTTG GATTAATGAC ACATGTCATA GGTTAAAATA AATGATTAAG ATTTAATTTT 19399 |||| || || | ||||| || | .....TTTTG GACATATCAT TCATGT-ATC G......... .......... .......... 607 TCAAACTAAA CCTCTAAGCA TGATTTAGTT AATAAATATA TTATATATCA AGTATCAAAA 19339 .......... .......... .......... .......... .......... .......... 607 TTATTATAAT CTCACAATTT TAGCAACATA TTATTTTGTC CATATTATTT ATGTAAAAAA 19279 .......... .......... .......... .......... .......... .......... 607 GTTTTCTTCC TATTATTTTT TTTACTTAGG ATCTTTTTTT TATTTTTCTT TTATATAATA 19219 .......... .......... .......... .......... .......... .......... 607 TTTTTTATTG TAATCTTTTG TTAAAGGTAT ATATTGGTCC GTGATTAATA AATTACCAAG 19159 .......... .......... .......... .......... .......... .......... 607 AGATAATCAA ACCATTTTGA CAAACTAATT TTAATTTCAT AATAAGATTA AGAATGGGGT 19099 .......... .......... .......... .......... .......... .......... 607 GATACCAGTA CATACGGTAG GTTATCTATA GTTTCCATAC TTCTATTCAG TTAATTATTC 19039 .......... .......... .......... .......... .......... .......... 607 TTCTAATTAG ATAAAAAAAA ATATTATATA AAAGGAAAAA GAAAAGATCC TAAGTAAAAA 18979 .......... .......... .......... .......... .......... .......... 607 GAAATAATAG GAAGAAAACT TTTTTTACCT ATATTTATGG ACAAAATATA GTCACAAGCA 18919 .......... .......... .......... .......... .......... .......... 607 TGATATTATT CAACATTTTA ACACTCCCAC TCAATTACTA AAAACAGCGA GAGGAAGTTG 18859 .......... .......... .......... .......... .......... .......... 607 TAATTTGGAG AGGTTGGGAA TGCGAAGTTT CAGAAGGAAA AAGTTGACAA GGAATAATGT 18799 .......... .......... .......... .......... .......... .......... 607 TCCTCCTTGA TTTCCTCTAT GTTAAATACA ACTTCTGGAC TTTCCGTTTT TCTTGTCTTT 18739 .......... .......... .......... .......... .......... .......... 607 CAGCTTAAAT ATTATTTAAG TTGTTGTTTA TTAGGAACAG AAAGTTCAGG TAGTTGTGCA 18679 .......... .......... .......... .......... .......... .......... 607 TTATGATCAT TTCAGAAAGA TTAAAATTCC TCCTTTATTT CTGTACAATT CCTTGTCATT 18619 .......... .......... .......... .......... .......... .......... 607 ATGAACTAAA CTAGTTTTGG TTAGCTTGGT TTGTGTTTAC GAATTTTAGG TTGGAGATAA 18559 | | ||| .......... .......... .......... .......... .........G TACGAG.... 614 TCATGTCATC AGGCCCTAAA CTCTAATAGT CAGCACTCTA TGCAGCTAAG TGAATGAGGG 18499 .......... .......... .......... .......... .......... .......... 614 AACGTACATT TAGCAACCAA TAAAGACAAT CACAGTCCTG TGCAACAATA TGCAGATATG 18439 |||| .......... .......... .......... .......... .......... .....ATATA 619 TGATTG 18433 | |||| TCATTG 625 hqPGS_C06HBa0054K13.1-16-_SGN-U344226+ (21617 21163) ******************************************************************************** EST sequence 7 -strand 890 n (File: SGN-U335137-) 1 GTAAAACTAT GTAGNATGAC CATTCTTTTC TTCGATACCA AAAATTAAAT TCCATATAGA 61 CATAAAAAAT GTTTTAAATT TTTTTCTTAC ACTANGGGAA TGNAAGAAAA AAAACAAGAT 121 TAATNAACTC AAATAATTAT AATAAATAAG TCAAAAAAAT AATTTATGTA TTAAAAAAAT 181 TTGAAATATA CCTTGAACTT TGAAAAAAGA ATCATATATG CCCCTAAATA TATTTTTTTT 241 TAAAATTAAA GTAAAATTAT AAATTTAAAA GTAATTTTTT CACTTTCGTT AAATGAAGGG 301 TATATATGAG CTCATTTTGT AACGGCAGAG GTATATGTGA ACCATTTGTA TAACGGTAAG 361 GGTATATATG AGCCACTTTC ATAACGAGGG GTATATCAGT TTCAAATGAC AAAGTTGAGG 421 GGTATATCAT ACCCTTTTCC CATAATATTA TTCATTTTTG GGTTGACGGG TCAAACCTTG 481 GGCTGCTTAG GACTTGATTA GACCGCTATT TTATTGACTC TTTAATTAAT GGGCAACTTT 541 CACATATAAC AAACAAAAAA TTCATATTTG TATGCTATAA CAAAGTTTGC ATAATTGCGC 601 TCCATAGCAA ACATAAAATT GTATAATTCG CTGACCTAAA TTGTATAATT CGCTGGCCTA 661 TTTCGCTGCA ATTGTATAAT TCGCTATCCT ATTTAACTAC AATTGTATAA TTCGCTGCCT 721 ATTTCGCTGC AATATTATTA TAAAATTTGC TTTGCATATA ATTGAACCGA ATTAAAATGT 781 ATGTATATTG CATAATTATA AGTGTATAGC AATAAGATAT ATGTTTTTCC CTGCAGCCCG 841 GGGGATCCAC TAGTTCTAGA GCGGCCGCCA CCGCGGGGAG CTCCAGCTCT Predicted gene structure (within gDNA segment 23354 to 15979): Exon 1 21624 21169 ( 456 n); cDNA 1 448 ( 448 n); score: 0.808 MATCH C06HBa0054K13.1-16- SGN-U335137- 0.808 456 0.512 C PGS_C06HBa0054K13.1-16-_SGN-U335137- (21624 21169) Alignment (genomic DNA sequence = upper lines): GTAAAACTTA TTGTAGATGA CCAAATTTTT TCTTCGAATA CGAAATTAAA TTACAATACA 21565 ||||||| || | |||| || | | ||| ||||||| | ||| | || | | ||| | GTAAAAC-TA TGTAGNATGA CC-ATTCTTT TCTTCGATAC CAAAAATTAA ATTCCATATA 58 CAGTAAAAAA ATAGTTTAAT TTTTTTATTT AAACTAAGGA ATGAAAGAAA AAAACAAAAT 21505 | ||||| || ||||| |||||| || | |||| || ||| | || ||| |||| GACATAAAAA ATGTTTTAAA TTTTTTTCTT ACACTANGG- --GAATGNAA GAAAAAAAAC 115 AAGAATAAGA AACTCAAATA ATTATAATAA ATGAAGTC-A AAAAATAATT TATGTATGAA 21446 |||| ||| |||||||||| |||||||||| || ||||| | |||||||||| ||||||| || AAGATTAATN AACTCAAATA ATTATAATAA AT-AAGTCAA AAAAATAATT TATGTATTAA 174 AAAAA-TTAA AATATACATT GAACTTTGAT AGAAGAATCA TATATATCTC TAAATAATTT 21387 ||||| || | ||||||| || ||||||||| | |||||||| ||||| | | ||||| || | AAAAATTTGA AATATACCTT GAACTTTGAA AAAAGAATCA TATATGCCCC TAAAT-ATAT 233 TTTTTAAAAA AAATTAAAAG TAATAAATAT AAATTTAAAA TAAATTTTTT AACTTCCGTT 21327 ||||| | ||||| |||| ||| || ||| |||||||||| |||||||| |||| |||| TTTTT-TTTA AAATT-AAAG TAA-AATTAT AAATTTAAAA GTAATTTTTT CACTTTCGTT 290 AAATGAAGGG TATATGTGAG -TCATTTTAT AACAGCAGGG GTAAATGTGA GCCGTTTGTA 21268 |||||||||| ||||| |||| ||||||| | ||| |||| | ||| |||||| || |||||| AAATGAAGGG TATATATGAG CTCATTTTGT AACGGCAGAG GTATATGTGA ACCATTTGTA 350 TAACGGTAAG GGCATATATG AGCCACTTTT ATAACGAGGG GTATATGAGC TCCAAATGAC 21208 |||||||||| || ||||||| ||||||||| |||||||||| |||||| || | |||||||| TAACGGTAAG GGTATATATG AGCCACTTTC ATAACGAGGG GTATATCAGT TTCAAATGAC 410 AAAGTTGAGG GGTATATCAG ACCTTTTTCC CTTTTATAT 21169 |||||||||| ||||||||| ||| |||||| | | |||| AAAGTTGAGG GGTATATCAT ACCCTTTTCC C-ATAATAT 448 hqPGS_C06HBa0054K13.1-16-_SGN-U335137- (21624 21169) ******************************************************************************** EST sequence 10 -strand 904 n (File: SGN-U345542-) 1 TATANGTAAT TCNTTCATGT CTTTCAACAA AGATGCATCT GTGGACCTGA ATTCTTCTTT 61 CTCTTTTGCA CCNTNAACAA ACATATCCTA GANCGATGTG AGAATCATCG ATCNACACTT 121 TCTAAAATGT TCTCTNCCAG ATGTAAGATT GTGAGCATCG TAAGTTCCNA AGTTCTCTGG 181 TATAGGGTCC AGTGAACTCA TTGCCAGACA ATGTCANCAG TTGTAGGTTT GAGCATTTCT 241 GTAAATTTGT GTATTTCGTC CATTCAGGTT GTTTGATGAG AGTAAAAGCA CTTGAAGATT 301 TGGTAGATTG TCACAAATAT CAACAGGAAG TTTCCCGACA AGACGATTGT GTATTATAGC 361 AAGTCTTGTC AATGAAGTCA TGTTGAAGAT TGACGGCGGT ATAGAGCCAG TAAGTCTATT 421 ACCTTGCAGG TCTAAAAAAG TCAAGTAACG AAGATTACCA ATTTCAGGAG GGATTTTTCC 481 TTGAAGAAAA TTCCTCTGTA TTCTCAACTC TTGAAGATTT GTTAGATTGG AAAGAGAAGA 541 TGGAATTTCC CCTGAATATT GGTTGTTTGA AAGGTAAAGG AATTGAAGAT TAGGTAACGA 601 ACTTAAGAAC GATGGGATGG CACCACTAAA ATTGTTTCTT GTAACATCAA TCAATTTCAA 661 CCTCCGCAAG TGAGTCAACT CTTGGGAGAG ATGTCCATGG AAAGTGTTGT TACTAATGTC 721 AAGGGATACG AGAAATGAGA GATTTCCAAG GTGTGGAGGA ATGGTACCAT GAAGTTGCAT 781 GCTAGAAATG TCTAAAGCAG TGACTCTATG GCGCCCATTG CAAGTGATCC CTACTCGTGC 841 CGAATCCTGC AGCCCGGGGG ATCCACTAGT TCTAGAGCGG CCGCCACCGC GGTGAGCTCC 901 AGCT Predicted gene structure (within gDNA segment 28288 to 20878): Exon 1 24677 24563 ( 115 n); cDNA 2 111 ( 110 n); score: 0.643 Intron 1 24562 23157 (1406 n); Pd: 0.000 (s: 0.60), Pa: 0.000 (s: 0.56) Exon 2 23156 22454 ( 703 n); cDNA 112 810 ( 699 n); score: 0.713 MATCH C06HBa0054K13.1-16- SGN-U345542- 0.703 818 0.905 C PGS_C06HBa0054K13.1-16-_SGN-U345542- (24677 24563,23156 22454) Alignment (genomic DNA sequence = upper lines): ATATGAAATT CTTTCATGCC CTTTTACGAG GGACACATCC TTTTGAGTAG CATTCTTCTT 24618 ||| | |||| | |||||| |||| | | || ||| | | | || | ||||||||| ATANGTAATT CNTTCATG-T CTTTCAACAA AGATGCAT-C TGTGGACCTG AATTCTTCTT 59 TGTGTTTCTC AATCTTAACA CCACAATTCC AACAGTCAAC GTGAAGAGTG ATCCAATCCC 24558 | | ||| | | | |||| ||| ||| | | | | ||| ||| | ||| | TCTCTTTTGC ACCNTNAACA A-ACATATCC TAGA-NCGAT GTG-AGAATC ATCGA..... 111 TAATAGAATA TATAAAGCCA TAAGCACTCG TTTTCTTTTG GACTTCTTTG TCGATTTGGT 24498 .......... .......... .......... .......... .......... .......... 111 TAGGCATGGT TTTACGTTAA ATCGGGAGTC ACCACAAAGT GCATCATCGG ACAAGAAAGA 24438 .......... .......... .......... .......... .......... .......... 111 CTTACTGGTT ACATTTGCAA AGGGACCACC AGTGGGAATT TCTCCACTGA GTTCATTGAA 24378 .......... .......... .......... .......... .......... .......... 111 AGAGAAATTT AGGTATTTGA GATACACAAG AGCTTCTAAT GACTTTGGAA TTTGACCACT 24318 .......... .......... .......... .......... .......... .......... 111 AATATTATTA TAGGACAAAT CCAAGTATTC CAAAGACAAC ATTTTGCCAA ATGATTCAGG 24258 .......... .......... .......... .......... .......... .......... 111 AATAGGCCCC TCTAATCTAT TATGTGTTAG AGAAAGATGA ATCAATTTAT CTAGACCCCC 24198 .......... .......... .......... .......... .......... .......... 111 TAGAGTACTA GGCATCTTAC CAGAAAAACA ATTATTTGAC AAATCAATGT GTGTTGCAGC 24138 .......... .......... .......... .......... .......... .......... 111 CTTCAAGTTT CCACTCTCCA TTGGAATTTC CCCACTAAAT AAATTGAATG AAACATCTAA 24078 .......... .......... .......... .......... .......... .......... 111 TTCTATGAGA TCTTGAAGGT TCCCCAAGTT TGAAGGTAAT GTAGAATCCA GCTTGTTGTT 24018 .......... .......... .......... .......... .......... .......... 111 ATATAGATGA AGTATCCTCA AACTAGTAAT GTTCCCTAAG CAGAAGGGTA CCGAACCAGA 23958 .......... .......... .......... .......... .......... .......... 111 AAAATGATTT TTTGACAAGA GTAATGCACC AAGTCTCTTT AAATTACAGA CAACATCTGG 23898 .......... .......... .......... .......... .......... .......... 111 TATGGTTCCT TCTATCTTGT TGTTCAATAA GTAAAGTTCT TGAAGGATCG ACATGCCATG 23838 .......... .......... .......... .......... .......... .......... 111 TACAGTATTT GGAATATGTC CAGTTAATGT ATTGTTAAAC AGACTCATCC TTGTCAGTCC 23778 .......... .......... .......... .......... .......... .......... 111 AGTAAGATTA CCAATTTCTC GAGGAATGAC ACCCTTCAGT GTACAATTAT ATGCTTCAAA 23718 .......... .......... .......... .......... .......... .......... 111 AATTTGCAAG GAGTTTGAGA AATTACCAAC AGATGCAGGC AATACACCAT CCAACGGATT 23658 .......... .......... .......... .......... .......... .......... 111 ACCTGCTAAC GTGAGTGCTC TTAGATTCCT ACAGTTTGTC AATGATGCAA GGAAGCTCAA 23598 .......... .......... .......... .......... .......... .......... 111 TGTAGAATCG CTGACAAAAT TATTCCACAG CAAGTCAAGT AACTCAAGGT ATTCTAAGTT 23538 .......... .......... .......... .......... .......... .......... 111 ACCAAGTGAT TTAGGAATTG AACCTGTGAA ACTGTTTTGT GAGAGATCAA ATTCTCTGAG 23478 .......... .......... .......... .......... .......... .......... 111 TTTTGATGAA TTTGAGATTG AATCAGAGAT AAAACCACTC AGATCATTTC CTCCACAATA 23418 .......... .......... .......... .......... .......... .......... 111 AAATATTTCT AGGTTGGGCA TTCCACGACC TAAATCTGAA GGTAGAGTAC CTGAAAGGTT 23358 .......... .......... .......... .......... .......... .......... 111 ATTTTCTCCA AAATCTATAT ACTGCAGTGC TGACATGTTG AAAATGCTGT GAGGAACAGA 23298 .......... .......... .......... .......... .......... .......... 111 GCCAGTTAGC TCATTCAGTG CTAAATCCAA CTCCTGAAGT TTTTGAAGAT TACCTAACTC 23238 .......... .......... .......... .......... .......... .......... 111 CATTGGTATC TCACCTGTCA ATTTTCAGGA AATATATCAT GAGTACTATG TTTGGCGGAA 23178 .......... .......... .......... .......... .......... .......... 111 AAGAGAAAAA GGAAGAAAAT TTCATACTAC CTTCCAAATG CAATTCTGAA AGATATAAAT 23118 || || || ||| ||| ||| |||| ||| .......... .......... .TC-NAC-AC TTTCTAAAAT GTTCTCTNCC AGATGTAAGA 148 ATGTAAGAGC TGTTAAGTTG GCTAGCTCTC TTGGTAAAGT -TCCAATAAA CTTATTTTCA 23059 ||| || | |||||| || |||| ||||| || |||| | || || ||| || TTGTGAG-CA TCGTAAGTTC CNAAGTTCTC -TGGTATAGG GTCCAGTGAA CTCATTGCCA 206 CCCAATTGCA ATTTTTGAAG CTTTCTGCAT TTCTCCAGGT TTGGTGGAAT AACTCCATCT 22999 |||| || ||| || ||| |||| |||| | | || ||| | | ||||| GACAATGTCA NCAGTTGTAG GTTTGAGCAT TTCTGTAAAT TT-GTGTATT TCGTCCATTC 265 AGGGAGTTTT TACTGAGGTA AAGTCCTTCC AAGTCTCGAA GATGATCACA TATCGTTTTT 22939 ||| |||| ||| | ||| ||| | | | | | ||| ||||| || AGGTTGTTTG ATGAGAGTAA AAGCACTTGA AGATTTGGTA GATTGTCACA AATATCAACA 325 GGAAGATTTC CAGTAAGATT GTTGCCCGTA AGAGCAA-TC ACATGCATTG TAGTAATGTT 22880 ||||| || | | |||| ||| | | ||||| || | || || ||| ||||| GGAAGTTTCC CGACAAGACG ATTGTGTATT ATAGCAAGTC TTGT-CAATG AAGTCATGTT 384 AAAAATGGAT GGTGGTATAG AGCCACTAAG CTGATTAATT TGCAGGTCTA GGATAGTCAA 22820 || || || || ||||||| ||||| |||| |||| | |||||||||| | |||||| GAAGATTGAC GGCGGTATAG AGCCAGTAAG TCTATTACCT TGCAGGTCTA AAAAAGTCAA 444 GTAACGAAGA TCACCGATTT CTCGAGGGAT CTCTCCTTCA AGAAAATTTC TATCCAAGTA 22760 |||||||||| | ||| |||| | ||||||| | ||||| | |||||||| | | | | GTAACGAAGA TTACCAATTT CAGGAGGGAT TTTTCCTTGA AGAAAATTCC TCTGTATTCT 504 TAACCTTTGC AGCTTTGTTA TATTGGAAAG GGAGGATGGA ATTTTCCCAG AAAATTGGTT 22700 ||| ||| || ||||||| ||||||||| || |||||| |||| ||| | || ||||||| CAACTCTTGA AGATTTGTTA GATTGGAAAG AGAAGATGGA ATTTCCCCTG AATATTGGTT 564 GCTTGATAGG TGCACAAAGC GTAGGTTTGG TAACAAACTT AAAAATGATG GAATGGCTCC 22640 | |||| ||| | | || | || || || |||| ||||| || || |||| | ||||| || GTTTGAAAGG TAAAGGAATT GAAGATTAGG TAACGAACTT AAGAACGATG GGATGGCACC 624 GGTGAAGTTA TTGCTTGTGA CATTAATCGA TTTCAACCTC TGCAGACGAG CCAATTCTTG 22580 | || || || ||||| | ||| |||| | |||||||||| ||| ||| ||| ||||| ACTAAAATTG TTTCTTGTAA CATCAATCAA TTTCAACCTC CGCAAGTGAG TCAACTCTTG 684 TGGCAAATCT CCATGAAAAG TGTTGTTACT GATGTCAAGG GAAGAAAGAA ATGACAGGTT 22520 | | || | ||||| |||| |||||||||| ||||||||| || |||| |||| || || GGAGAGATGT CCATGGAAAG TGTTGTTACT AATGTCAAGG GATACGAGAA ATGAGAGATT 744 TCCGAGGTGT GGAGGAATGG TACCATGAAG TTGCATGCTT GAAATGTCTA AAGCAGTGAC 22460 ||| |||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TCCAAGGTGT GGAGGAATGG TACCATGAAG TTGCATGCTA GAAATGTCTA AAGCAGTGAC 804 TCGATG 22454 || ||| TCTATG 810 hqPGS_C06HBa0054K13.1-16-_SGN-U345542- (23156 22454) ******************************************************************************** EST sequence 8 +strand 842 n (File: SGN-U345275+) 1 ATTGACAAAC GCTGGAGCTC CACCGCGGTG GCGGCCGCTC TAGAACTAGT GGATCCCCCG 61 GGCTGCAGGC TCAACTTGTG GGGGAATAAT TTTGACATCG ATTCAACATT GAGCTTCCTT 121 GAATCATTGA CAAACTGTAG GAATCTAAGA GTACTCACGC TTGGTGGTAA TCCGTTGGAT 181 GGTGTTTTGC CTGCATCTGT TGGGAATTTC TCAAACTGCT TGCAAATATG TGAAGCATCT 241 AAATGTAAAC TGAATGGTGT CATTTCAAAA CAAATTACTA ATCTTACTGG ATTGACAAGG 301 ATGAGTCTGT CGAACAATCA GTTGATAGGC CATATTCCAA CAACAGTGCA AGGAATGCTG 361 AACCTTCAAG AACTTTACCT ATGAAGCAAC AAGTTAGAAG GAGCCATACC AGATGTTATC 421 TGCAGTTGAC AGTATCTTGG TGCATTAGAA TTGTCAGAGA ATCAATTTTC TAGTTTCGTT 481 CCACCATGCT TAGGGAATGT TACTAGTTTG AGGACACTCT ATCTAGATAA CAACAAGCTG 541 GATTCTAGAT TACCTGCAAG ATTGGGGGGA CTTCAAAACA TCATAGAGTT CAATATTTCA 601 TCCAATTATT TGAGTGGAGA AATTCCGCTA GAGAGCGGAA ACTTGAATGG TGCAACACTG 661 ATTGATCTGT CAAATAATTA TTTTTCTGGG TAGATTCCTA GTACTCTAGG GGGCCTAGAT 721 AAATTAAATT AACTTTCTCT AGCACATAGT GGATTACAAG GGCCTATTTC TGAATCATTT 781 GACAAATTGC GGGCCTTGGA ATAACTGGGA TTTGGCCTAT TACAAATCTT AGGGGTGAAA 841 AG Predicted gene structure (within gDNA segment 21945 to 28288): Exon 1 23565 24327 ( 763 n); cDNA 76 840 ( 765 n); score: 0.814 MATCH C06HBa0054K13.1-16+ SGN-U345275+ 0.814 763 0.906 C PGS_C06HBa0054K13.1-16+_SGN-U345275+ (23565 24327) Alignment (genomic DNA sequence = upper lines): TTGCTGTGGA ATAATTTTGT CAGCGATTCT ACATTGAGCT TCCTTGCATC ATTGACAAAC 23624 ||| | ||| ||||||||| || |||||| |||||||||| |||||| ||| |||||||||| TTGTGGGGGA ATAATTTTGA CATCGATTCA ACATTGAGCT TCCTTGAATC ATTGACAAAC 135 TGTAGGAATC TAAGAGCACT CACGTTAGCA GGTAATCCGT TGGATGGTGT ATTGCCTGCA 23684 |||||||||| |||||| ||| |||| | | |||||||||| |||||||||| ||||||||| TGTAGGAATC TAAGAGTACT CACGCTTGGT GGTAATCCGT TGGATGGTGT TTTGCCTGCA 195 TCTGTTGGTA ATTTCTCAAA CTCCTTGCAA ATTTTTGAAG CATATAATTG TACACTGAAG 23744 |||||||| | |||||||||| || ||||||| || | ||||| ||| ||| || || |||||| TCTGTTGGGA ATTTCTCAAA CTGCTTGCAA ATATGTGAAG CATCTAAATG TAAACTGAAT 255 GGTGTCATTC CTCGAGAAAT TGGTAATCTT ACTGGACTGA CAAGGATGAG TCTGTTTAAC 23804 ||||||||| | | |||| | ||||||| |||||| ||| |||||||||| ||||| ||| GGTGTCATTT CAAAACAAAT TACTAATCTT ACTGGATTGA CAAGGATGAG TCTGTCGAAC 315 AATACATTAA CTGGACATAT TCCAAATACT GTACATGGCA TGTCGATCCT TCAAGAACTT 23864 ||| || | || ||||| ||||| || || || || | || || ||| |||||||||| AATCAGTTGA TAGGCCATAT TCCAACAACA GTGCAAGGAA TGCTGAACCT TCAAGAACTT 375 TACTTATTGA ACAACAAGAT AGAAGGAACC ATACCAGATG TTGTCTGTAA TTTAAAGAGA 23924 ||| ||| | ||||||| | ||||||| || |||||||||| || |||| | || | || TACCTATGAA GCAACAAGTT AGAAGGAGCC ATACCAGATG TTATCTGCAG TTGACAGTAT 435 CTTGGTGCAT TACTCTTGTC AAAAAATCAT TTTTCTGGTT CGGTACCCTT CTGCTTAGGG 23984 |||||||||| || ||||| | | ||||| |||||| ||| || || ||||||||| CTTGGTGCAT TAGAATTGTC AGAGAATCAA TTTTCTAGTT TCGTTCCACC ATGCTTAGGG 495 AACATTACTA GTTTGAGGAT ACTTCATCTA TATAACAACA AGCTGGATTC TACATTACCT 24044 || |||||| ||||||||| ||| ||||| ||||||||| |||||||||| || ||||||| AATGTTACTA GTTTGAGGAC ACTCTATCTA GATAACAACA AGCTGGATTC TAGATTACCT 555 TCAAACTTGG GGAACCTTCA AGATCTCATA GAATTAGATG TTTCATTCAA TTTATTTAGT 24104 ||| |||| || ||||| | | ||||| || || || |||||| ||| || || ||| GCAAGATTGG GGGGACTTCA AAACATCATA GAGTTCAATA TTTCATCCAA TTATTTGAGT 615 GGGGAAATTC CAATGGAGAG TGGAAACTTG AAGGCTGCAA CACACATTGA TTTGTCAAAT 24164 || ||||||| | | ||||| ||||||||| || | ||||| ||| ||||| | |||||||| GGAGAAATTC CGCTAGAGAG CGGAAACTTG AATGGTGCAA CACTGATTGA TCTGTCAAAT 675 AATTGTTTTT CTGGTAAGAT GCCTAGTACT CTAGGGGGTC TAGATAAATT GATTCATCTT 24224 |||| ||||| |||| |||| ||||||||| |||||||| | |||||||||| | | | ||| AATTATTTTT CTGGGTAGAT TCCTAGTACT CTAGGGGGCC TAGATAAATT AAATTAACTT 735 TCTCTAACAC ATAATAGATT AGAGGGGCCT ATTCCTGAAT CATTTGGCAA AATGTTGTCT 24284 |||||| ||| ||| | |||| | | |||||| ||| |||||| |||||| ||| | || | | TCTCTAGCAC ATAGTGGATT ACAAGGGCCT ATTTCTGAAT CATTTGACAA ATTGCGGGCC 795 TTGGAAT-AC TTGGATTTGT CCTA-TAATA ATATTAGTGG TCAAA 24327 ||||||| || | ||||||| |||| || | || |||| || | ||| TTGGAATAAC TGGGATTTGG CCTATTACAA ATCTTAGGGG TGAAA 840 hqPGS_C06HBa0054K13.1-16+_SGN-U345275+ (23565 24327) ******************************************************************************** EST sequence 2 +strand 1298 n (File: SGN-U322835+) 1 CAATTCCCCT TTCAATGAAT TTCCCAAATC CCCATTCAAT TTTACTTCTT TTTGGATAAA 61 AAATGATAAG TGTTTGTGTG TGTTTGCAGC CCACAATAAT GGCGAAGATG AAGGTGGTGA 121 GGAGTGAAAT TGCTGCGAAA CAAGTGGTTG TGATCGAGGA AAATGAGGAG ATACATTGGT 181 ATGCTTCTTG AATTTCGTCC GGAGGACACA GCTACTCAAA TCCAAATGCA AAGGATCTTC 241 AAGAGTAAAG GAATAAAACA AGACCTTGGT TCATTGTGAA AGTTATGTGT GATCCATAGC 301 TGAAAGTTTG GCATTTATCT CGGTTTTCCC TTCCTATTTG ATTTTTTTTC AATAATCGCC 361 GTTAGGTCAA TTCTGCTATT TAAGTCAAGT TTCAACCAGC TTTTGGGAAT TCTGGAAGTC 421 ACGTGCTCAT CAAAGTAAAT TTTATATACA CTGATGCAAC GGTCTATCTC CACAATGGGT 481 GTTCAAATCC AGTGGTGCAT TGTGACTTGA AGCCAAGTGT CTTGCTTGAT CAAGACATGG 541 TTGGCCATGT CAGTGATTTT GGCATTGCAA AATTGTTAGG TGCAGGGGAG AGTTTTGTTC 601 AAACAAGGAC AATAGCAACC ATTGGATAAT TGCTCCAGAG TATGGACAAG ATGGAATCGT 661 ATCCACAAGC TGCGATGTTT ATAGTTTCGG TATCCTGATT GATGGAGACG TTTACAAGAA 721 TCAGACCAGG TGATGAAAGA TTTACTGGAG AGTTGAGCAT ACGACGTTGG GTTAGTGATT 781 CTTTTCCAGA TGAGATTCAT AAGGTGGTGG ATGCTAATTT GGTACAGCCT AGGGGATGAA 841 CGAATTGACG CAAAGATGCA GTGTCTGTTG TCTATTATAG AGTTAGCTTT GAGCTGTACT 901 TTAGCAACAC CTGATGCAAG AATTAGTATG GAAGATTCTC TTTCAACACT TCAAAATATC 961 AGGCTCCAGT TTGTCAATAG TCGCCACCGA AAAAAGCAAC TGAAGGATTT AGTACCGAAA 1021 AAAGCAACTT GCTTGGTAAT GGCAAGGTCT ACAATGTACA ATTGGAGGGT GCATTCAAAA 1081 GTTTTGATAC AGAAGAATGT GAAATCTGAC CAAAGTCATC AAAGCCTTAA TGTTAGAATA 1141 CATGTCTAGT GGGACACTTG ATAAATGGCT GTACTCTCAC AAGTTGTTCT TGGATTTACT 1201 TCATATTATG TACTCTTTCA CTTTCAGTGC CAGCTGGAAT GGTGATTTTC TAGCTACTGG 1261 AGTTCACAAT TCTAATCCAT AAAAAAAAAA AAAAAAAA Predicted gene structure (within gDNA segment 19015 to 28288): Exon 1 20241 20248 ( 8 n); cDNA 401 408 ( 8 n); score: 0.625 Intron 1 20249 22160 (1912 n); Pd: 0.516 (s: 0), Pa: 0.105 (s: 0) Exon 2 22161 22181 ( 21 n); cDNA 409 428 ( 20 n); score: 0.667 Intron 2 22182 24995 (2814 n); Pd: 0.036 (s: 0), Pa: 0.997 (s: 0.60) Exon 3 24996 25212 ( 217 n); cDNA 429 638 ( 210 n); score: 0.811 Intron 3 25213 25306 ( 94 n); Pd: 1.000 (s: 0.90), Pa: 0.953 (s: 0.88) Exon 4 25307 25653 ( 347 n); cDNA 639 987 ( 349 n); score: 0.813 PPA cDNA 1279 1298 MATCH C06HBa0054K13.1-16+ SGN-U322835+ 0.812 593 0.457 C PGS_C06HBa0054K13.1-16+_SGN-U322835+ (20241 20248,22161 22181,24996 25212,25307 25653) Alignment (genomic DNA sequence = upper lines): TTGAGGGGGT ACCTGTGCAT TTTCCCTCAG AATTATTTAC ATCTTAACTT TATCAGTTTT 20300 || ||| TTTTGGGA.. .......... .......... .......... .......... .......... 408 TAAGTTTTAT ATTTGAACTA TTGAAAGTAT GAGTTTTCGA CCTAAACAAT CACCTAATGA 20360 .......... .......... .......... .......... .......... .......... 408 TTAGTGAAAC ACACCTTGAC TAATTTATAA TGTTGTGGTT AATACACTCT CTTATCTATT 20420 .......... .......... .......... .......... .......... .......... 408 TAAAATTTTT GTCATGTAGC TCTCCACATG AAAAAATAAT TTCACTGTGA CAAAGCTAAA 20480 .......... .......... .......... .......... .......... .......... 408 TAAATTAATA TTACGTTAAA TGTTAGGATT AAAGATACAT ATTATCACCA TATAAATTAA 20540 .......... .......... .......... .......... .......... .......... 408 TTTATTCAGT TTTGTGCCTG ATATCTTCTT CTCCACCTTA TTAGTCGTGG TCTCATCTCA 20600 .......... .......... .......... .......... .......... .......... 408 GCCGCGGGTA AATATCTCAT ATACTTGGAA GGGGAGAATT TAGATTATAT ATATTCATTC 20660 .......... .......... .......... .......... .......... .......... 408 TCTAATATTA GTAATCAATG TGGTGAGATT AGTGAATTTC TTTCTTCTAA AATTTGAACA 20720 .......... .......... .......... .......... .......... .......... 408 CTTGAAACGA ATTTTTGTAA TATAAACTAT TTATAACCAA CACAAAATTG TCTTATTTGA 20780 .......... .......... .......... .......... .......... .......... 408 ACTAGTGAAT TCCAAATAAT AAAGGTTAAT AACCAAGTGG CAACTAACCA TTATAAGGTC 20840 .......... .......... .......... .......... .......... .......... 408 ACACAATTTT AAGATAATAC TGTAAAGCAT TCTTTGATTA TCTGCCTTGA CCTTGCTTTG 20900 .......... .......... .......... .......... .......... .......... 408 TGCCTTTGGT CTTTGGCTCA ACAATAAGAT AGTATATTGG AAAGTAAATA TTTTTGCTTT 20960 .......... .......... .......... .......... .......... .......... 408 ACTGATTGCT TTGCTTTTTT GCAGAAAAGG GGATATTATT TGTTTTTTTC CACTTCCCTT 21020 .......... .......... .......... .......... .......... .......... 408 TTTATAATGG AAGACTCTTG AAAGCAATTG AGTCTATTTA AATGTTATTT TATTCTACCT 21080 .......... .......... .......... .......... .......... .......... 408 TTTATTTTCT TCCTTTAACT TCTTGTTCAA GTAAAAAAAA GTTTTTGAAT TTTGAAAGAT 21140 .......... .......... .......... .......... .......... .......... 408 AAAAGAATAA TATCATAGTT ATAGTTATAT ATAAAAGGGA AAAAGGTCTG ATATACCCCT 21200 .......... .......... .......... .......... .......... .......... 408 CAACTTTGTC ATTTGGAGCT CATATACCCC TCGTTATAAA AGTGGCTCAT ATATGCCCTT 21260 .......... .......... .......... .......... .......... .......... 408 ACCGTTATAC AAACGGCTCA CATTTACCCC TGCTGTTATA AAATGACTCA CATATACCCT 21320 .......... .......... .......... .......... .......... .......... 408 TCATTTAACG GAAGTTAAAA AATTTATTTT AAATTTATAT TTATTACTTT TAATTTTTTT 21380 .......... .......... .......... .......... .......... .......... 408 TAAAAAAAAT TATTTAGAGA TATATATGAT TCTTCTATCA AAGTTCAATG TATATTTTAA 21440 .......... .......... .......... .......... .......... .......... 408 TTTTTTTCAT ACATAAATTA TTTTTTGACT TCATTTATTA TAATTATTTG AGTTTCTTAT 21500 .......... .......... .......... .......... .......... .......... 408 TCTTATTTTG TTTTTTTCTT TCATTCCTTA GTTTAAATAA AAAAATTAAA CTATTTTTTT 21560 .......... .......... .......... .......... .......... .......... 408 ACTGTGTATT GTAATTTAAT TTCGTATTCG AAGAAAAAAT TTGGTCATCT ACAATAAGTT 21620 .......... .......... .......... .......... .......... .......... 408 TTACAAGAAT ATTAGTGAAA CATAAATAAA TTTGATTATC AAAATAATAA TTATAAATTA 21680 .......... .......... .......... .......... .......... .......... 408 GTCATTGAAA TACAAAAAAA TATGTTTGAC GATGAATTAA TTTACTCATA TGGAGGCCCT 21740 .......... .......... .......... .......... .......... .......... 408 CACCCTTATT GGATTCAATT ATTTGCATGG TATGAATTTA TTTTCTTTTT AGCTTTCTAC 21800 .......... .......... .......... .......... .......... .......... 408 AATTTTATGT ATTCATTCAT TTCAACCTTT CCATTTTTTT TTCATATTTT TTTATTATCT 21860 .......... .......... .......... .......... .......... .......... 408 ATAACACATA CAACTAGTTT TTTATCAGAT TAGATGACTG ATAAGTGTCA AAGATAGACT 21920 .......... .......... .......... .......... .......... .......... 408 TTCCACGACA TATGAAAGAA AGTAGTCAAT ATATTGAATG AAGGGAGGGA GCCTTAATTA 21980 .......... .......... .......... .......... .......... .......... 408 GATAAGGTTG TTGGAGAAAA AAGTTTTCGA ATGTTACAAC AATTTCGATC AAAATTAGAA 22040 .......... .......... .......... .......... .......... .......... 408 TATATTTTAG ATCCTTTTCC CTATTTTAAA AAATTACAAA TTCTCTAAAA AGTTTTACTT 22100 .......... .......... .......... .......... .......... .......... 408 ATCGTAATTG GTTTCGTGAA ATCAAGGGGA GAAAAACAAA GTACTTAAAT TGATTTTCAG 22160 .......... .......... .......... .......... .......... .......... 408 AAATTAGAAA TCACGTGCAA CGCACATGGT AGTGTTATAA ATAGAATACC ACAACATTAT 22220 | | ||| |||||||| | ATTCTGGAAG TCACGTGC-T C......... .......... .......... .......... 428 AGGTATCATT AAATTTGATA TCTATATTAG TTATGGACAG AAGTTCCACT CTCCTCTTTT 22280 .......... .......... .......... .......... .......... .......... 428 TTCTTATAAT TTTCATTCTA CTATTACTAC ATGCCAATGC TAATAATATT AGCACTGATG 22340 .......... .......... .......... .......... .......... .......... 428 AAGCTGCTCT TCTTGCACTT AAATCACATA TTTCTAACGA TATCTTAGCA ACAAACTGGT 22400 .......... .......... .......... .......... .......... .......... 428 CTTCTTCCGT CCCCGTTTGT AGTTGGATTG GAATCACTTG CAGCTCTCGA CACCATCGAG 22460 .......... .......... .......... .......... .......... .......... 428 TCACTGCTTT AGACATTTCA AGCATGCAAC TTCATGGTAC CATTCCTCCA CACCTCGGAA 22520 .......... .......... .......... .......... .......... .......... 428 ACCTGTCATT TCTTTCTTCC CTTGACATCA GTAACAACAC TTTTCATGGA GATTTGCCAC 22580 .......... .......... .......... .......... .......... .......... 428 AAGAATTGGC TCGTCTGCAG AGGTTGAAAT CGATTAATGT CACAAGCAAT AACTTCACCG 22640 .......... .......... .......... .......... .......... .......... 428 GAGCCATTCC ATCATTTTTA AGTTTGTTAC CAAACCTACG CTTTGTGCAC CTATCAAGCA 22700 .......... .......... .......... .......... .......... .......... 428 ACCAATTTTC TGGGAAAATT CCATCCTCCC TTTCCAATAT AACAAAGCTG CAAAGGTTAT 22760 .......... .......... .......... .......... .......... .......... 428 ACTTGGATAG AAATTTTCTT GAAGGAGAGA TCCCTCGAGA AATCGGTGAT CTTCGTTACT 22820 .......... .......... .......... .......... .......... .......... 428 TGACTATCCT AGACCTGCAA ATTAATCAGC TTAGTGGCTC TATACCACCA TCCATTTTTA 22880 .......... .......... .......... .......... .......... .......... 428 ACATTACTAC AATGCATGTG ATTGCTCTTA CGGGCAACAA TCTTACTGGA AATCTTCCAA 22940 .......... .......... .......... .......... .......... .......... 428 AAACGATATG TGATCATCTT CGAGACTTGG AAGGACTTTA CCTCAGTAAA AACTCCCTAG 23000 .......... .......... .......... .......... .......... .......... 428 ATGGAGTTAT TCCACCAAAC CTGGAGAAAT GCAGAAAGCT TCAAAAATTG CAATTGGGTG 23060 .......... .......... .......... .......... .......... .......... 428 AAAATAAGTT TATTGGAACT TTACCAAGAG AGCTAGCCAA CTTAACAGCT CTTACATATT 23120 .......... .......... .......... .......... .......... .......... 428 TATATCTTTC AGAATTGCAT TTGGAAGGTA GTATGAAATT TTCTTCCTTT TTCTCTTTTC 23180 .......... .......... .......... .......... .......... .......... 428 CGCCAAACAT AGTACTCATG ATATATTTCC TGAAAATTGA CAGGTGAGAT ACCAATGGAG 23240 .......... .......... .......... .......... .......... .......... 428 TTAGGTAATC TTCAAAAACT TCAGGAGTTG GATTTAGCAC TGAATGAGCT AACTGGCTCT 23300 .......... .......... .......... .......... .......... .......... 428 GTTCCTCACA GCATTTTCAA CATGTCAGCA CTGCAGTATA TAGATTTTGG AGAAAATAAC 23360 .......... .......... .......... .......... .......... .......... 428 CTTTCAGGTA CTCTACCTTC AGATTTAGGT CGTGGAATGC CCAACCTAGA AATATTTTAT 23420 .......... .......... .......... .......... .......... .......... 428 TGTGGAGGAA ATGATCTGAG TGGTTTTATC TCTGATTCAA TCTCAAATTC ATCAAAACTC 23480 .......... .......... .......... .......... .......... .......... 428 AGAGAATTTG ATCTCTCACA AAACAGTTTC ACAGGTTCAA TTCCTAAATC ACTTGGTAAC 23540 .......... .......... .......... .......... .......... .......... 428 TTAGAATACC TTGAGTTACT TGACTTGCTG TGGAATAATT TTGTCAGCGA TTCTACATTG 23600 .......... .......... .......... .......... .......... .......... 428 AGCTTCCTTG CATCATTGAC AAACTGTAGG AATCTAAGAG CACTCACGTT AGCAGGTAAT 23660 .......... .......... .......... .......... .......... .......... 428 CCGTTGGATG GTGTATTGCC TGCATCTGTT GGTAATTTCT CAAACTCCTT GCAAATTTTT 23720 .......... .......... .......... .......... .......... .......... 428 GAAGCATATA ATTGTACACT GAAGGGTGTC ATTCCTCGAG AAATTGGTAA TCTTACTGGA 23780 .......... .......... .......... .......... .......... .......... 428 CTGACAAGGA TGAGTCTGTT TAACAATACA TTAACTGGAC ATATTCCAAA TACTGTACAT 23840 .......... .......... .......... .......... .......... .......... 428 GGCATGTCGA TCCTTCAAGA ACTTTACTTA TTGAACAACA AGATAGAAGG AACCATACCA 23900 .......... .......... .......... .......... .......... .......... 428 GATGTTGTCT GTAATTTAAA GAGACTTGGT GCATTACTCT TGTCAAAAAA TCATTTTTCT 23960 .......... .......... .......... .......... .......... .......... 428 GGTTCGGTAC CCTTCTGCTT AGGGAACATT ACTAGTTTGA GGATACTTCA TCTATATAAC 24020 .......... .......... .......... .......... .......... .......... 428 AACAAGCTGG ATTCTACATT ACCTTCAAAC TTGGGGAACC TTCAAGATCT CATAGAATTA 24080 .......... .......... .......... .......... .......... .......... 428 GATGTTTCAT TCAATTTATT TAGTGGGGAA ATTCCAATGG AGAGTGGAAA CTTGAAGGCT 24140 .......... .......... .......... .......... .......... .......... 428 GCAACACACA TTGATTTGTC AAATAATTGT TTTTCTGGTA AGATGCCTAG TACTCTAGGG 24200 .......... .......... .......... .......... .......... .......... 428 GGTCTAGATA AATTGATTCA TCTTTCTCTA ACACATAATA GATTAGAGGG GCCTATTCCT 24260 .......... .......... .......... .......... .......... .......... 428 GAATCATTTG GCAAAATGTT GTCTTTGGAA TACTTGGATT TGTCCTATAA TAATATTAGT 24320 .......... .......... .......... .......... .......... .......... 428 GGTCAAATTC CAAAGTCATT AGAAGCTCTT GTGTATCTCA AATACCTAAA TTTCTCTTTC 24380 .......... .......... .......... .......... .......... .......... 428 AATGAACTCA GTGGAGAAAT TCCCACTGGT GGTCCCTTTG CAAATGTAAC CAGTAAGTCT 24440 .......... .......... .......... .......... .......... .......... 428 TTCTTGTCCG ATGATGCACT TTGTGGTGAC TCCCGATTTA ACGTAAAACC ATGCCTAACC 24500 .......... .......... .......... .......... .......... .......... 428 AAATCGACAA AGAAGTCCAA AAGAAAACGA GTGCTTATGG CTTTATATAT TCTATTAGGG 24560 .......... .......... .......... .......... .......... .......... 428 ATTGGATCAC TCTTCACGTT GACTGTTGGA ATTGTGGTGT TAAGATTGAG AAACACAAAG 24620 .......... .......... .......... .......... .......... .......... 428 AAGAATGCTA CTCAAAAGGA TGTGTCCCTC GTAAAAGGGC ATGAAAGAAT TTCATATTAC 24680 .......... .......... .......... .......... .......... .......... 428 GAACTTGAAC AGGCAACTGA AGGGTTCAAC GAAGCCAACT TGCTTGGTAA TGGGAGTTTC 24740 .......... .......... .......... .......... .......... .......... 428 AGCAGGGTTT ATAAAGGGAT ACTTAAGGAT GGTATCATTT TTGCAGCAAA GGTATTCAAT 24800 .......... .......... .......... .......... .......... .......... 428 GTGCAATTGG AGGGTGCATT CAAAAGCTTT GACACAGAAT GTGAGGTACT CCGGAACCTT 24860 .......... .......... .......... .......... .......... .......... 428 CGCCACAGAA ATCTGACCAA AGTCATCACC AGCTGCTCCA ATCTTTATTT CAAGGCCCTA 24920 .......... .......... .......... .......... .......... .......... 428 GTGTTGGAAT ACATGCCCAA TGGGACACTT GATAAATGGT TATACTCTCA CAATTTGTTC 24980 .......... .......... .......... .......... .......... .......... 428 TTGAACTTAT TGCAGAGATT GGATATAATG ATAGATGTTG CATCTGCAAT GGATTATCTC 25040 | | || ||| || || | ||||| || |||||| .......... .....ATCAA AG-TA-AATT TTATAT-ACA CTGATGCAAC GGTCTATCTC 470 CACAATGGCT ATTCAACGCC TGTGGTGCAT TGTGACTTGA AGCCAAGTAA TGTCTTGTTA 25100 |||||||| | ||||| || ||||||||| |||||||||| ||||||| ||||||| | CACAATGGGT GTTCAAATCC AGTGGTGCAT TGTGACTTGA AGCCAAG--- TGTCTTGCTT 527 GATGAAGAAA TGGTTGCTCA TGTAAGTGAT TTTGGCATTG CAAAAATGTT AGGTGCAGGG 25160 ||| |||| | |||||| || ||| |||||| |||||||||| ||||| |||| |||||||||| GATCAAGACA TGGTTGGCCA TGTCAGTGAT TTTGGCATTG CAAAATTGTT AGGTGCAGGG 587 GAGGCTTTTG TTCAAACAAG GACAGTTGCA ACCATTGGAT ATATTGCTCC AGGTATATTT 25220 ||| ||||| |||||||||| |||| | ||| |||||||||| | |||||||| || GAGAGTTTTG TTCAAACAAG GACAATAGCA ACCATTGGAT A-ATTGCTCC AG........ 638 CAACTTTTTA AAGTTCTCAT ATCATGTAAA TACTCAAAAC AGTTAGGTAT AAATTGATTT 25280 .......... .......... .......... .......... .......... .......... 638 GTGATCATTT TCGACCATGA TTGCAGAGTA TGGACAAGAT GGCATAGTAT CCACGAGTTG 25340 |||| |||||||||| || || |||| |||| || || .......... .......... ......AGTA TGGACAAGAT GGAATCGTAT CCACAAGCTG 672 TGATGTTTAT AGTTTTGGCA TCGTGA-TGA TGGAGACGTT CACACGAACA AGACCAAGTG 25399 ||||||||| ||||| || | || ||| ||| |||||||||| ||| ||| |||||| ||| CGATGTTTAT AGTTTCGGTA TCCTGATTGA TGGAGACGTT TACAAGAATC AGACCAGGTG 732 ATGAGATATT TACTGGAGAC TTGAGCATAC AGCGTTGGGT TAATGATTCC TTTCCGGGTG 25459 |||| | ||| ||||||||| |||||||||| |||||||| || |||||| ||||| | || ATGAAAGATT TACTGGAGAG TTGAGCATAC GACGTTGGGT TAGTGATTCT TTTCCAGATG 792 AAATTCACAA GGTGGTGGAT TCGAATTTGG TACAGCC-AG GAGATGAACA AATCGCTGCA 25518 | ||||| || |||||||||| | ||||||| ||||||| || | ||||||| ||| | ||| AGATTCATAA GGTGGTGGAT GCTAATTTGG TACAGCCTAG GGGATGAACG AATTGACGCA 852 AAGATGCAAT GTTTGTTATC TATCATGGAA TTAGCTTTGA AGTGCACTTT AGTGAGACCT 25578 |||||||| | || |||| || ||| || || |||||||||| || ||||| || | |||| AAGATGCAGT GTCTGTTGTC TATTATAGAG TTAGCTTTGA GCTGTACTTT AGCAACACCT 912 GATGAAAGAA TTAGCATGAA TGATGCTCTT TCAGCACTCA AAAAGATTAG ACGACAGCTT 25638 |||| ||||| |||| ||| | ||| ||||| ||| |||| |||| || || | ||| || GATGCAAGAA TTAGTATGGA AGATTCTCTT TCAACACTTC AAAATATCAG GCTCCAGTTT 972 GTTAGTAGTC GGCAC 25653 || | ||||| | ||| GTCAATAGTC GCCAC 987 hqPGS_C06HBa0054K13.1-16+_SGN-U322835+ (24996 25212,25307 25653) Total number of EST alignments reported: 18 ________________________________________________________________________________ Predicted gene locations (7) in segment 1 to 28288: PGL 1 (+ strand): 2966 3816 AGS-1 (2966 2975,3130 3816) SCR (e 0.900 d 0.796 a 0.000,e 0.881) Exon 1 2966 2975 ( 10 n); score: 0.900 Intron 1 2976 3129 ( 154 n); Pd: 0.796 Pa: 0.000 Exon 2 3130 3816 ( 687 n); score: 0.881 PGS (2966 2975,3130 3816) SGN-U341961+ 3-phase translation of AGS-1 (+strand): . : . . . . . 2966 ACACATAGAA : GACAAGAAGCGCAAATAATCAGTAGCAAATATACGCAGCAAAAGAAGCCA T H R : R Q E A Q I I S S K Y T Q Q K K P H I E : D K K R K - S V A N I R S K R S Q T - K : T R S A N N Q - Q I Y A A K E A . . . . . . 3180 AGCTCGAGTCCAAGATCGAGTAAAATGGATCAGCCAAGTAATCAAGAAAATATAGTTATT S S S P R S S K M D Q P S N Q E N I V I A R V Q D R V K W I S Q V I K K I - L F K L E S K I E - N G S A K - S R K Y S Y . . . . . . 3240 TGTCAAAGCCAAAAATAATATATTAAAGGTATAAGTTGATACACATAAGGAAAACCTTAA C Q S Q K - Y I K G I S - Y T - G K P - V K A K N N I L K V - V D T H K E N L K L S K P K I I Y - R Y K L I H I R K T L . . . . . . 3300 ATATATTATTAAGGAAGGAAGTTGTATATAATAAGGATTGTACTTTAAAAGGGAATCCTT I Y Y - G R K L Y I I R I V L - K G I L Y I I K E G S C I - - G L Y F K R E S L N I L L R K E V V Y N K D C T L K G N P . . . . . . 3360 GTTTAACAATAAGTTCCTTACCTTATCTGATAAGGATTTAAGGCAAAAGAAACTCTATAA V - Q - V P Y L I - - G F K A K E T L - F N N K F L T L S D K D L R Q K K L Y K C L T I S S L P Y L I R I - G K R N S I . . . . . . 3420 GAAGAGGACGATGTTGATGAAGAATCCCATGACTTCACATTGAGAGAAAAGTGATAATTA E E D D V D E E S H D F T L R E K - - L K R T M L M K N P M T S H - E K S D N Y R R G R C - - R I P - L H I E R K V I I . . . . . . 3480 TTCATCAAACTGAAGTCTTCAAGATCGATAGGTTTGATACGTTTAAAACGTTCTTGAGTG F I K L K S S R S I G L I R L K R S - V S S N - S L Q D R - V - Y V - N V L E - I H Q T E V F K I D R F D T F K T F L S . . . . . . 3540 AGAAGGTTTTTAACGCGTAGAACTATCTTCTTGTTTTGTTCTTGATTGTAATTTACAGTT R R F L T R R T I F L F C S - L - F T V E G F - R V E L S S C F V L D C N L Q F E K V F N A - N Y L L V L F L I V I Y S . . . . . . 3600 TATTATACAAAGGGTGGGTTTGGCTCTTTGTAGGGTTGAGTTTTCGTGAGGATTGTAACA Y Y T K G G F G S L - G - V F V R I V T I I Q R V G L A L C R V E F S - G L - Q L L Y K G W V W L F V G L S F R E D C N . . . . . . 3660 AAAGGTGGGTTTGGCCTTTTGGAGAGATCAATTGTAGTCAATCGAGAGAGTTAGTAGAGA K G G F G L L E R S I V V N R E S - - R K V G L A F W R D Q L - S I E R V S R D K R W V W P F G E I N C S Q S R E L V E . . . . . . 3720 TAAGGCTTTTTGATTATTGAGTTGTAATCACAAAATCTTATAGTTGAATTAATAAAATGA - G F L I I E L - S Q N L I V E L I K - K A F - L L S C N H K I L - L N - - N E I R L F D Y - V V I T K S Y S - I N K M . . . . 3780 GGTTTTTCCTTCCTTGAGTGCGGAAGGTTTTTAATTT G F S F L E C G R F L I V F P S L S A E G F - F R F F L P - V R K V F N Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 11880 11197 AGS-1 (11880 11197) SCR (e 0.892) Exon 1 11880 11197 ( 684 n); score: 0.892 PGS (11880 11197) SGN-U341961- 3-phase translation of AGS-1 (-strand): . . . . . . 11880 AAATTAAAAACCTTCCGCACTCAAGGAAGGAAAAACCTCGTTTTATTAATTCAACTATAA K L K T F R T Q G R K N L V L L I Q L - N - K P S A L K E G K T S F Y - F N Y K I K N L P H S R K E K P R F I N S T I . . . . . . 11820 GATTTTGTGATTACAACTCAATAATCAAAAAGCCTTATCTCTACTACCTCTCTCGATTGA D F V I T T Q - S K S L I S T T S L D - I L - L Q L N N Q K A L S L L P L S I D R F C D Y N S I I K K P Y L Y Y L S R L . . . . . . 11760 CTACAATCGATCTCTCCAAAAGGCCAAACCCACCTTTTGTTACAATCCTCACGAAAACTC L Q S I S P K G Q T H L L L Q S S R K L Y N R S L Q K A K P T F C Y N P H E N S T T I D L S K R P N P P F V T I L T K T . . . . . . 11700 AACCCTACAAAGAGCCAAACCCACCCTTTGTATAATAAACTGTAAATTACAATCAAGAAC N P T K S Q T H P L Y N K L - I T I K N T L Q R A K P T L C I I N C K L Q S R T Q P Y K E P N P P F V - - T V N Y N Q E . . . . . . 11640 AAAACAAGAAGATAGTTCTACACGTTAAAAACCTTCTCACTCAAGAACGTTTTAAACGTA K T R R - F Y T L K T F S L K N V L N V K Q E D S S T R - K P S H S R T F - T - Q N K K I V L H V K N L L T Q E R F K R . . . . . . 11580 GCAAACCTATTGATCTTGAAGACTTTAGTTTGATGAATAATTCTCACTTTTCTCTCTATG A N L L I L K T L V - - I I L T F L S M Q T Y - S - R L - F D E - F S L F S L C S K P I D L E D F S L M N N S H F S L Y . . . . . . 11520 TGAAGTCGTGGGATTCTTCATCAGCATCGTCCTCTTCTTATAGAGTTTCTTTTGCCTTAA - S R G I L H Q H R P L L I E F L L P - E V V G F F I S I V L F L - S F F C L K V K S W D S S S A S S S S Y R V S F A L . . . . . . 11460 GTCCTTATCAGATGAGGTAAGAAAGTTATTGTTAAACAAGGATTCCCTTTTAAAGTACAA V L I R - G K K V I V K Q G F P F K V Q S L S D E V R K L L L N K D S L L K Y N S P Y Q M R - E S Y C - T R I P F - S T . . . . . . 11400 TCCTTATTATATACAACTTCCTTCCTTAATAATATATTTAAGGTTTTCCTTATTTGTATC S L L Y T T S F L N N I F K V F L I C I P Y Y I Q L P S L I I Y L R F S L F V S I L I I Y N F L P - - Y I - G F P Y L Y . . . . . . 11340 AACTTATACCTTTAATATATTATTTTTGGCTTTGACAAATAACTCTATTTTCTTGATTAC N L Y L - Y I I F G F D K - L Y F L D Y T Y T F N I L F L A L T N N S I F L I T Q L I P L I Y Y F W L - Q I T L F S - L . . . . . . 11280 TTGGCTGACCCATTTTACTCGATCATGGACTCGAGCTTGGCTTCTTTTGCTGCGTACATT L A D P F Y S I M D S S L A S F A A Y I W L T H F T R S W T R A W L L L L R T F L G - P I L L D H G L E L G F F C C V H . . . 11220 TGCTACTGATTATTTGCGCTTCTT C Y - L F A L L A T D Y L R F L L L I I C A S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16-_PGL-2_AGS-1_PPS_1 (11477 11199) (frame '2'; 279 bp, 93 residues) 1 SFFCLKSLSD EVRKLLLNKD SLLKYNPYYI QLPSLIIYLR FSLFVSTYTF NILFLALTNN 61 SIFLITWLTH FTRSWTRAWL LLLRTFATDY LRF >C06HBa0054K13.1-16-_PGL-2_AGS-1_PPS_2 (11878 11666) (frame '0'; 210 bp, 70 residues) 1 IKNLPHSRKE KPRFINSTIR FCDYNSIIKK PYLYYLSRLT TIDLSKRPNP PFVTILTKTQ 61 PYKEPNPPFV - 3-phase translation of AGS-1 (+strand): . . . . . . 11197 AAGAAGCGCAAATAATCAGTAGCAAATGTACGCAGCAAAAGAAGCCAAGCTCGAGTCCAT K K R K - S V A N V R S K R S Q A R V H R S A N N Q - Q M Y A A K E A K L E S M E A Q I I S S K C T Q Q K K P S S S P . . . . . . 11257 GATCGAGTAAAATGGGTCAGCCAAGTAATCAAGAAAATAGAGTTATTTGTCAAAGCCAAA D R V K W V S Q V I K K I E L F V K A K I E - N G S A K - S R K - S Y L S K P K - S S K M G Q P S N Q E N R V I C Q S Q . . . . . . 11317 AATAATATATTAAAGGTATAAGTTGATACAAATAAGGAAAACCTTAAATATATTATTAAG N N I L K V - V D T N K E N L K Y I I K I I Y - R Y K L I Q I R K T L N I L L R K - Y I K G I S - Y K - G K P - I Y Y - . . . . . . 11377 GAAGGAAGTTGTATATAATAAGGATTGTACTTTAAAAGGGAATCCTTGTTTAACAATAAC E G S C I - - G L Y F K R E S L F N N N K E V V Y N K D C T L K G N P C L T I T G R K L Y I I R I V L - K G I L V - Q - . . . . . . 11437 TTTCTTACCTCATCTGATAAGGACTTAAGGCAAAAGAAACTCTATAAGAAGAGGACGATG F L T S S D K D L R Q K K L Y K K R T M F L P H L I R T - G K R N S I R R G R C L S Y L I - - G L K A K E T L - E E D D . . . . . . 11497 CTGATGAAGAATCCCACGACTTCACATAGAGAGAAAAGTGAGAATTATTCATCAAACTAA L M K N P T T S H R E K S E N Y S S N - - - R I P R L H I E R K V R I I H Q T K A D E E S H D F T - R E K - E L F I K L . . . . . . 11557 AGTCTTCAAGATCAATAGGTTTGCTACGTTTAAAACGTTCTTGAGTGAGAAGGTTTTTAA S L Q D Q - V C Y V - N V L E - E G F - V F K I N R F A T F K T F L S E K V F N K S S R S I G L L R L K R S - V R R F L . . . . . . 11617 CGTGTAGAACTATCTTCTTGTTTTGTTCTTGATTGTAATTTACAGTTTATTATACAAAGG R V E L S S C F V L D C N L Q F I I Q R V - N Y L L V L F L I V I Y S L L Y K G T C R T I F L F C S - L - F T V Y Y T K . . . . . . 11677 GTGGGTTTGGCTCTTTGTAGGGTTGAGTTTTCGTGAGGATTGTAACAAAAGGTGGGTTTG V G L A L C R V E F S - G L - Q K V G L W V W L F V G L S F R E D C N K R W V W G G F G S L - G - V F V R I V T K G G F . . . . . . 11737 GCCTTTTGGAGAGATCGATTGTAGTCAATCGAGAGAGGTAGTAGAGATAAGGCTTTTTGA A F W R D R L - S I E R G S R D K A F - P F G E I D C S Q S R E V V E I R L F D G L L E R S I V V N R E R - - R - G F L . . . . . . 11797 TTATTGAGTTGTAATCACAAAATCTTATAGTTGAATTAATAAAACGAGGTTTTTCCTTCC L L S C N H K I L - L N - - N E V F P S Y - V V I T K S Y S - I N K T R F F L P I I E L - S Q N L I V E L I K R G F S F . . . 11857 TTGAGTGCGGAAGGTTTTTAATTT L S A E G F - F - V R K V F N L E C G R F L I Maximal non-overlapping open reading frames (>= 64 codons): none PGL 3 (+ strand): 13077 13797 AGS-1 (13077 13797) SCR (e 0.712) Exon 1 13077 13797 ( 721 n); score: 0.712 PGS (13077 13797) SGN-U345542+ 3-phase translation of AGS-1 (+strand): . . . . . . 13077 GGAATCACTTGCAGCTCCCGTCACCATCGAGTCACTGCTTTAGACATTTCAAGCATGCAA G I T C S S R H H R V T A L D I S S M Q E S L A A P V T I E S L L - T F Q A C N N H L Q L P S P S S H C F R H F K H A . . . . . . 13137 CTTTATGGTACCATTCCTCCACACCTTGGAAACCTCTCATTTATTTCATCGCTTGACATC L Y G T I P P H L G N L S F I S S L D I F M V P F L H T L E T S H L F H R L T S T L W Y H S S T P W K P L I Y F I A - H . . . . . . 13197 AGTAACAACACTTTCCATGGAGAGTTGCCACTAGAGTTGGTTCGTTTGCAGAGGTTGAAA S N N T F H G E L P L E L V R L Q R L K V T T L S M E S C H - S W F V C R G - N Q - Q H F P W R V A T R V G S F A E V E . . . . . . 13257 TTCTTTAATACTAAAAACAATAACTTCACCGGAGCCATTCCATCATTTTTAAGTTTGTTA F F N T K N N N F T G A I P S F L S L L S L I L K T I T S P E P F H H F - V C Y I L - Y - K Q - L H R S H S I I F K F V . . . . . . 13317 CCAAACCTACGCTTTCTGTACCTATCGAATAACCAATTTTCGGGTAAAATTCCATCCTCC P N L R F L Y L S N N Q F S G K I P S S Q T Y A F C T Y R I T N F R V K F H P P T K P T L S V P I E - P I F G - N S I L . . . . . . 13377 CTTTCCAATCTGACAAAACTGCAAGTGTTGTCAATACAGAGTAATTATATTGAAGGAGAG L S N L T K L Q V L S I Q S N Y I E G E F P I - Q N C K C C Q Y R V I I L K E R P F Q S D K T A S V V N T E - L Y - R R . . . . . . 13437 ATCCCTCAAGAACTCGGTGATCTTCGTTCCTTGATTATCCTAAACCTGCAATATAATCAG I P Q E L G D L R S L I I L N L Q Y N Q S L K N S V I F V P - L S - T C N I I S D P S R T R - S S F L D Y P K P A I - S . . . . . . 13497 CTTAGTGGCTCTATACCATCTTCAATCTTTGACATCACTACAATGCAAGTAATTGCTCTT L S G S I P S S I F D I T T M Q V I A L L V A L Y H L Q S L T S L Q C K - L L L A - W L Y T I F N L - H H Y N A S N C S . . . . . . 13557 AGTGGCAACAATCTTACTGGAAAGATTCCAATCACGATATGTGATCATCTTCCAGACTTG S G N N L T G K I P I T I C D H L P D L V A T I L L E R F Q S R Y V I I F Q T W - W Q Q S Y W K D S N H D M - S S S R L . . . . . . 13617 GAAGGACTTTACCTCGGCAGAAACTCCCTTGATGGAGTTATTCCACCAAACCTGGAGAAA E G L Y L G R N S L D G V I P P N L E K K D F T S A E T P L M E L F H Q T W R N G R T L P R Q K L P - W S Y S T K P G E . . . . . . 13677 TGCAGAAAGCTTCAAATATTGGAATTGACTGAAAATGAGATTGCTGGAACTGTACCAAGA C R K L Q I L E L T E N E I A G T V P R A E S F K Y W N - L K M R L L E L Y Q E M Q K A S N I G I D - K - D C W N C T K . . . . . . 13737 GAGTTAGCCAACTTAACAACTCTTACAGGACTATATCTTATGGATCTGCATTTGGAAGGT E L A N L T T L T G L Y L M D L H L E G S - P T - Q L L Q D Y I L W I C I W K V R V S Q L N N S Y R T I S Y G S A F G R . 13797 A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-3_AGS-1_PPS_1 (13077 13796) (frame '1'; 720 bp, 240 residues) 1 GITCSSRHHR VTALDISSMQ LYGTIPPHLG NLSFISSLDI SNNTFHGELP LELVRLQRLK 61 FFNTKNNNFT GAIPSFLSLL PNLRFLYLSN NQFSGKIPSS LSNLTKLQVL SIQSNYIEGE 121 IPQELGDLRS LIILNLQYNQ LSGSIPSSIF DITTMQVIAL SGNNLTGKIP ITICDHLPDL 181 EGLYLGRNSL DGVIPPNLEK CRKLQILELT ENEIAGTVPR ELANLTTLTG LYLMDLHLEG 3-phase translation of AGS-1 (-strand): . . . . . . 13797 TACCTTCCAAATGCAGATCCATAAGATATAGTCCTGTAAGAGTTGTTAAGTTGGCTAACT Y L P N A D P - D I V L - E L L S W L T T F Q M Q I H K I - S C K S C - V G - L P S K C R S I R Y S P V R V V K L A N . . . . . . 13737 CTCTTGGTACAGTTCCAGCAATCTCATTTTCAGTCAATTCCAATATTTGAAGCTTTCTGC L L V Q F Q Q S H F Q S I P I F E A F C S W Y S S S N L I F S Q F Q Y L K L S A S L G T V P A I S F S V N S N I - S F L . . . . . . 13677 ATTTCTCCAGGTTTGGTGGAATAACTCCATCAAGGGAGTTTCTGCCGAGGTAAAGTCCTT I S P G L V E - L H Q G S F C R G K V L F L Q V W W N N S I K G V S A E V K S F H F S R F G G I T P S R E F L P R - S P . . . . . . 13617 CCAAGTCTGGAAGATGATCACATATCGTGATTGGAATCTTTCCAGTAAGATTGTTGCCAC P S L E D D H I S - L E S F Q - D C C H Q V W K M I T Y R D W N L S S K I V A T S K S G R - S H I V I G I F P V R L L P . . . . . . 13557 TAAGAGCAATTACTTGCATTGTAGTGATGTCAAAGATTGAAGATGGTATAGAGCCACTAA - E Q L L A L - - C Q R L K M V - S H - K S N Y L H C S D V K D - R W Y R A T K L R A I T C I V V M S K I E D G I E P L . . . . . . 13497 GCTGATTATATTGCAGGTTTAGGATAATCAAGGAACGAAGATCACCGAGTTCTTGAGGGA A D Y I A G L G - S R N E D H R V L E G L I I L Q V - D N Q G T K I T E F L R D S - L Y C R F R I I K E R R S P S S - G . . . . . . 13437 TCTCTCCTTCAATATAATTACTCTGTATTGACAACACTTGCAGTTTTGTCAGATTGGAAA S L L Q Y N Y S V L T T L A V L S D W K L S F N I I T L Y - Q H L Q F C Q I G K I S P S I - L L C I D N T C S F V R L E . . . . . . 13377 GGGAGGATGGAATTTTACCCGAAAATTGGTTATTCGATAGGTACAGAAAGCGTAGGTTTG G R M E F Y P K I G Y S I G T E S V G L G G W N F T R K L V I R - V Q K A - V W R E D G I L P E N W L F D R Y R K R R F . . . . . . 13317 GTAACAAACTTAAAAATGATGGAATGGCTCCGGTGAAGTTATTGTTTTTAGTATTAAAGA V T N L K M M E W L R - S Y C F - Y - R - Q T - K - W N G S G E V I V F S I K E G N K L K N D G M A P V K L L F L V L K . . . . . . 13257 ATTTCAACCTCTGCAAACGAACCAACTCTAGTGGCAACTCTCCATGGAAAGTGTTGTTAC I S T S A N E P T L V A T L H G K C C Y F Q P L Q T N Q L - W Q L S M E S V V T N F N L C K R T N S S G N S P W K V L L . . . . . . 13197 TGATGTCAAGCGATGAAATAAATGAGAGGTTTCCAAGGTGTGGAGGAATGGTACCATAAA - C Q A M K - M R G F Q G V E E W Y H K D V K R - N K - E V S K V W R N G T I K L M S S D E I N E R F P R C G G M V P - . . . . . . 13137 GTTGCATGCTTGAAATGTCTAAAGCAGTGACTCGATGGTGACGGGAGCTGCAAGTGATTC V A C L K C L K Q - L D G D G S C K - F L H A - N V - S S D S M V T G A A S D S S C M L E M S K A V T R W - R E L Q V I . 13077 C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16-_PGL-3_AGS-1_PPS_1 (13420 13139) (frame '0'; 279 bp, 93 residues) 1 LLCIDNTCSF VRLEREDGIL PENWLFDRYR KRRFGNKLKN DGMAPVKLLF LVLKNFNLCK 61 RTNSSGNSPW KVLLLMSSDE INERFPRCGG MVP- >C06HBa0054K13.1-16-_PGL-3_AGS-1_PPS_2 (13739 13518) (frame '2'; 219 bp, 73 residues) 1 LSWYSSSNLI FSQFQYLKLS AFLQVWWNNS IKGVSAEVKS FQVWKMITYR DWNLSSKIVA 61 TKSNYLHCSD VKD- PGL 4 (+ strand): 14203 21700 AGS-1 (14203 14971) SCR (e 0.956) Exon 1 14203 14971 ( 769 n); score: 0.956 PGS (14203 14971) SGN-U345275+ 3-phase translation of AGS-1 (+strand): . . . . . . 14203 CTCAACTTGTGGGGGAATAATTTTGTCAGCGATTCAACATTGAGCTTCCTTGAATCATTG L N L W G N N F V S D S T L S F L E S L S T C G G I I L S A I Q H - A S L N H - Q L V G E - F C Q R F N I E L P - I I . . . . . . 14263 ACAAACTGTAGGAATCTAAGAGTACTCACGCTTGGTGGTAATCCGTTGGATGGTGTTTTG T N C R N L R V L T L G G N P L D G V L Q T V G I - E Y S R L V V I R W M V F C D K L - E S K S T H A W W - S V G W C F . . . . . . 14323 CCTGCATCTGTTGGTAATTTCTCAAACTCCTTGCAAATTTTTGAAGCATCTAAATGTAAA P A S V G N F S N S L Q I F E A S K C K L H L L V I S Q T P C K F L K H L N V N A C I C W - F L K L L A N F - S I - M - . . . . . . 14383 CTGAAGGGTGTCATTTCAAAACAAATTACTAATCTTACTGGATTGACAAGGATGAGTCTG L K G V I S K Q I T N L T G L T R M S L - R V S F Q N K L L I L L D - Q G - V C T E G C H F K T N Y - S Y W I D K D E S . . . . . . 14443 TCGAACAATCAGTTGATAGGTCATATTCCAAAAACAGTGCAAGGAATGCTGAACCTTCAA S N N Q L I G H I P K T V Q G M L N L Q R T I S - - V I F Q K Q C K E C - T F K V E Q S V D R S Y S K N S A R N A E P S . . . . . . 14503 GAACTTTACCTAGGAAGCAACAAGTTAGAAGGAGCCATACCAGATGTTATCTGCAGTTTA E L Y L G S N K L E G A I P D V I C S L N F T - E A T S - K E P Y Q M L S A V Y R T L P R K Q Q V R R S H T R C Y L Q F . . . . . . 14563 CAGTATCTTGGTGCATTAGAATTGTCAGAAAATCAATTTTCTAGTTCCGTTCCACCATGC Q Y L G A L E L S E N Q F S S S V P P C S I L V H - N C Q K I N F L V P F H H A T V S W C I R I V R K S I F - F R S T M . . . . . . 14623 TTAGGGAATGTTACTAGTTTGAGGACACTCTATCTAGATAACAACAAGCTGGATTCTAGA L G N V T S L R T L Y L D N N K L D S R - G M L L V - G H S I - I T T S W I L D L R E C Y - F E D T L S R - Q Q A G F - . . . . . . 14683 TTACCTGCAAGATTGGGGGGACTTCAAAACATCATAGAGTTCAATATTTCATCCAATTAT L P A R L G G L Q N I I E F N I S S N Y Y L Q D W G D F K T S - S S I F H P I I I T C K I G G T S K H H R V Q Y F I Q L . . . . . . 14743 TTGAGTGGAGAAATTCCGCTAGAGAGCGGAAACTTGAAGGGTGCAACACTGATTGATCTG L S G E I P L E S G N L K G A T L I D L - V E K F R - R A E T - R V Q H - L I C F E W R N S A R E R K L E G C N T D - S . . . . . . 14803 TCAAATAATTATTTTTCTGGTAAGATTCCTAGTACTCTAGGGGGCCTAGATAAATTAATT S N N Y F S G K I P S T L G G L D K L I Q I I I F L V R F L V L - G A - I N - F V K - L F F W - D S - Y S R G P R - I N . . . . . . 14863 TATCTTTCTCTAGCACATAATAGATTAGAAGGGCCTATTCCTGAATCATTTGACAAATTG Y L S L A H N R L E G P I P E S F D K L I F L - H I I D - K G L F L N H L T N C L S F S S T - - I R R A Y S - I I - Q I . . . . . 14923 TTGGCATTGGAATACTTGGATTTGTCCTATAACAATCTTAGCGGTGAAA L A L E Y L D L S Y N N L S G E W H W N T W I C P I T I L A V K V G I G I L G F V L - Q S - R - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-4_AGS-1_PPS_1 (14203 14970) (frame '1'; 768 bp, 256 residues) 1 LNLWGNNFVS DSTLSFLESL TNCRNLRVLT LGGNPLDGVL PASVGNFSNS LQIFEASKCK 61 LKGVISKQIT NLTGLTRMSL SNNQLIGHIP KTVQGMLNLQ ELYLGSNKLE GAIPDVICSL 121 QYLGALELSE NQFSSSVPPC LGNVTSLRTL YLDNNKLDSR LPARLGGLQN IIEFNISSNY 181 LSGEIPLESG NLKGATLIDL SNNYFSGKIP STLGGLDKLI YLSLAHNRLE GPIPESFDKL 241 LALEYLDLSY NNLSGE 3-phase translation of AGS-1 (-strand): . . . . . . 14971 TTTCACCGCTAAGATTGTTATAGGACAAATCCAAGTATTCCAATGCCAACAATTTGTCAA F H R - D C Y R T N P S I P M P T I C Q F T A K I V I G Q I Q V F Q C Q Q F V K S P L R L L - D K S K Y S N A N N L S . . . . . . 14911 ATGATTCAGGAATAGGCCCTTCTAATCTATTATGTGCTAGAGAAAGATAAATTAATTTAT M I Q E - A L L I Y Y V L E K D K L I Y - F R N R P F - S I M C - R K I N - F I N D S G I G P S N L L C A R E R - I N L . . . . . . 14851 CTAGGCCCCCTAGAGTACTAGGAATCTTACCAGAAAAATAATTATTTGACAGATCAATCA L G P L E Y - E S Y Q K N N Y L T D Q S - A P - S T R N L T R K I I I - Q I N Q S R P P R V L G I L P E K - L F D R S I . . . . . . 14791 GTGTTGCACCCTTCAAGTTTCCGCTCTCTAGCGGAATTTCTCCACTCAAATAATTGGATG V L H P S S F R S L A E F L H S N N W M C C T L Q V S A L - R N F S T Q I I G - S V A P F K F P L S S G I S P L K - L D . . . . . . 14731 AAATATTGAACTCTATGATGTTTTGAAGTCCCCCCAATCTTGCAGGTAATCTAGAATCCA K Y - T L - C F E V P P I L Q V I - N P N I E L Y D V L K S P Q S C R - S R I Q E I L N S M M F - S P P N L A G N L E S . . . . . . 14671 GCTTGTTGTTATCTAGATAGAGTGTCCTCAAACTAGTAACATTCCCTAAGCATGGTGGAA A C C Y L D R V S S N - - H S L S M V E L V V I - I E C P Q T S N I P - A W W N S L L L S R - S V L K L V T F P K H G G . . . . . . 14611 CGGAACTAGAAAATTGATTTTCTGACAATTCTAATGCACCAAGATACTGTAAACTGCAGA R N - K I D F L T I L M H Q D T V N C R G T R K L I F - Q F - C T K I L - T A D T E L E N - F S D N S N A P R Y C K L Q . . . . . . 14551 TAACATCTGGTATGGCTCCTTCTAACTTGTTGCTTCCTAGGTAAAGTTCTTGAAGGTTCA - H L V W L L L T C C F L G K V L E G S N I W Y G S F - L V A S - V K F L K V Q I T S G M A P S N L L L P R - S S - R F . . . . . . 14491 GCATTCCTTGCACTGTTTTTGGAATATGACCTATCAACTGATTGTTCGACAGACTCATCC A F L A L F L E Y D L S T D C S T D S S H S L H C F W N M T Y Q L I V R Q T H P S I P C T V F G I - P I N - L F D R L I . . . . . . 14431 TTGTCAATCCAGTAAGATTAGTAATTTGTTTTGAAATGACACCCTTCAGTTTACATTTAG L S I Q - D - - F V L K - H P S V Y I - C Q S S K I S N L F - N D T L Q F T F R L V N P V R L V I C F E M T P F S L H L . . . . . . 14371 ATGCTTCAAAAATTTGCAAGGAGTTTGAGAAATTACCAACAGATGCAGGCAAAACACCAT M L Q K F A R S L R N Y Q Q M Q A K H H C F K N L Q G V - E I T N R C R Q N T I D A S K I C K E F E K L P T D A G K T P . . . . . . 14311 CCAACGGATTACCACCAAGCGTGAGTACTCTTAGATTCCTACAGTTTGTCAATGATTCAA P T D Y H Q A - V L L D S Y S L S M I Q Q R I T T K R E Y S - I P T V C Q - F K S N G L P P S V S T L R F L Q F V N D S . . . . . 14251 GGAAGCTCAATGTTGAATCGCTGACAAAATTATTCCCCCACAAGTTGAG G S S M L N R - Q N Y S P T S - E A Q C - I A D K I I P P Q V E R K L N V E S L T K L F P H K L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16-_PGL-4_AGS-1_PPS_1 (14450 14205) (frame '0'; 246 bp, 82 residues) 1 LFDRLILVNP VRLVICFEMT PFSLHLDASK ICKEFEKLPT DAGKTPSNGL PPSVSTLRFL 61 QFVNDSRKLN VESLTKLFPH KL AGS-2 (14234 14264,15640 15668,16001 16005,17330 17426,17563 17774) SCR (e 0.742 d 0.000 a 0.999,e 0.621 d 0.000 a 0.716,e 0.600 d 0.900 a 0.000,e 0.794 d 0.000 a 0.000,e 0.929) Exon 1 14234 14264 ( 31 n); score: 0.742 Intron 1 14265 15639 (1375 n); Pd: 0.000 Pa: 0.999 Exon 2 15640 15668 ( 29 n); score: 0.621 Intron 2 15669 16000 ( 332 n); Pd: 0.000 Pa: 0.716 Exon 3 16001 16005 ( 5 n); score: 0.600 Intron 3 16006 17329 (1324 n); Pd: 0.900 Pa: 0.000 Exon 4 17330 17426 ( 97 n); score: 0.794 Intron 4 17427 17562 ( 136 n); Pd: 0.000 Pa: 0.000 Exon 5 17563 17774 ( 212 n); score: 0.929 PGS (14234 14264,15640 15668,16001 16005,17330 17426,17563 17774) SGN-U313613+ 3-phase translation of AGS-2 (+strand): . . . . : . . : 14234 ATTCAACATTGAGCTTCCTTGAATCATTGAC : AGATTGGATGTAATGATAGATGTTGCATC : I Q H - A S L N H - : Q I G C N D R C C I : F N I E L P - I I D : R L D V M I D V A S : S T L S F L E S L T : D W M - - - M L H : . : . . . . . 16001 TTTTG : TGTTGGGTTTTATTTTCCCTAAATTTCTTATCATAAATAGGTTTTCCTTTAAGGG F C : V G F Y F P - I S Y H K - V F L - G F : V L G F I F P K F L I I N R F S F K G L L : C W V L F S L N F L S - I G F P L R . . . . . : . 17385 GAAGGTTTTGATTGACTAATCATTTTCTTGTAGGAAAAGGTT : CCCATGTATTCCGAGTGA E G F D - L I I F L - E K V : P M Y S E - K V L I D - S F S C R K R F : P C I P S E G R F - L T N H F L V G K G : S H V F R V . . . . . . 17581 ATTGGTTGAGGTTGTTTCTCTCTGTATTTTGTACTCTCATATTTATAGTGGATTGCTCAT I G - G C F S L Y F V L S Y L - W I A H L V E V V S L C I L Y S H I Y S G L L I N W L R L F L S V F C T L I F I V D C S . . . . . . 17641 CTCCTTTGTGGACGTAGGTCGATTGACCGAACCACATTAAATCTTTGTGTGTTTTGGTAT L L C G R R S I D R T T L N L C V F W Y S F V D V G R L T E P H - I F V C F G I S P L W T - V D - P N H I K S L C V L V . . . . . . 17701 ATTTCTCGTTGTCTTCTTACTCGTGGTCTTTTGAGGTTTGCTTTGCTAGCTTCCGCGTTT I S R C L L T R G L L R F A L L A S A F F L V V F L L V V F - G L L C - L P R L Y F S L S S Y S W S F E V C F A S F R V . . 17761 ACACCTGCTGATTT T P A D H L L I Y T C - F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (15647 15856,15955 16304,16804 16850,17124 17130,17284 17301,17629 17638,18160 18194,19534 19632) SCR (e 0.829 d 0.999 a 0.805,e 0.811 d 0.694 a 0.000,e 0.574 d 0.000 a 0.778,e 0.857 d 0.900 a 1.000,e 0.500 d 0.000 a 0.915,e 0.600 d 0.900 a 0.000,e 0.686 d 0.000 a 0.000,e 0.737) Exon 1 15647 15856 ( 210 n); score: 0.829 Intron 1 15857 15954 ( 98 n); Pd: 0.999 Pa: 0.805 Exon 2 15955 16304 ( 350 n); score: 0.811 Intron 2 16305 16803 ( 499 n); Pd: 0.694 Pa: 0.000 Exon 3 16804 16850 ( 47 n); score: 0.574 Intron 3 16851 17123 ( 273 n); Pd: 0.000 Pa: 0.778 Exon 4 17124 17130 ( 7 n); score: 0.857 Intron 4 17131 17283 ( 153 n); Pd: 0.900 Pa: 1.000 Exon 5 17284 17301 ( 18 n); score: 0.500 Intron 5 17302 17628 ( 327 n); Pd: 0.000 Pa: 0.915 Exon 6 17629 17638 ( 10 n); score: 0.600 Intron 6 17639 18159 ( 521 n); Pd: 0.900 Pa: 0.000 Exon 7 18160 18194 ( 35 n); score: 0.686 Intron 7 18195 19533 (1339 n); Pd: 0.000 Pa: 0.000 Exon 8 19534 19632 ( 99 n); score: 0.737 PGS (15647 15856,15955 16304,16804 16850,17124 17130,17284 17301,17629 17638,18160 18194,19534 19632) SGN-U322835+ 3-phase translation of AGS-3 (+strand): . . . . . . 15647 ATGTAATGATAGATGTTGCATCTGCAATGAACTATCTCCACAATGGCTATTCAACGCCTG M - - - M L H L Q - T I S T M A I Q R L C N D R C C I C N E L S P Q W L F N A C V M I D V A S A M N Y L H N G Y S T P . . . . . . 15707 TAGTGCATTGTGACTTGAAACCAAGTAATGTCTTGTTAGATGAAGAAATGGTTGCTCATG - C I V T - N Q V M S C - M K K W L L M S A L - L E T K - C L V R - R N G C S C V V H C D L K P S N V L L D E E M V A H . . . . . . 15767 TAAGTGATTTTGGCATTGCAAAAATGTTAGGTGCAGGGGAGGCTTTTGTTCAAACAAGGA - V I L A L Q K C - V Q G R L L F K Q G K - F W H C K N V R C R G G F C S N K D V S D F G I A K M L G A G E A F V Q T R . . . : . . . 15827 CAGTTGCAACCATTGGATATATTGCTCCAG : AGTATGGACAAGATGGAATAGTATCCACGA Q L Q P L D I L L Q : S M D K M E - Y P R S C N H W I Y C S R : V W T R W N S I H E T V A T I G Y I A P : E Y G Q D G I V S T . . . . . . 15985 GTTGTGATGTTTATAGTTTTGGTATCCTGATGATGGAGACGTTCACACGAACAAGACCAA V V M F I V L V S - - W R R S H E Q D Q L - C L - F W Y P D D G D V H T N K T K S C D V Y S F G I L M M E T F T R T R P . . . . . . 16045 GTGATGACATATTTACTGGAGACTTGAGCATACAAAGCTGGATTAGTGATTCCTTTCCGG V M T Y L L E T - A Y K A G L V I P F R - - H I Y W R L E H T K L D - - F L S G S D D I F T G D L S I Q S W I S D S F P . . . . . . 16105 GTGAACTTCACAAGGTGGTGGATTCTAATTTGGTACAGCCCGGAGATGAACAAATCGCTG V N F T R W W I L I W Y S P E M N K S L - T S Q G G G F - F G T A R R - T N R C G E L H K V V D S N L V Q P G D E Q I A . . . . . . 16165 CAAAGATGCAATGTTTGTCATCTGTCATGGAATTAGCTTTGAAGTGCACTTTAGTGAGAC Q R C N V C H L S W N - L - S A L - - D K D A M F V I C H G I S F E V H F S E T A K M Q C L S S V M E L A L K C T L V R . . . . . . 16225 CTGATGCAAGAATTAGCATGAAGGATGCTCTTTCAACACTCAAAAAGATGAGGCTACAGC L M Q E L A - R M L F Q H S K R - G Y S - C K N - H E G C S F N T Q K D E A T A P D A R I S M K D A L S T L K K M R L Q . . : . . . . 16285 TTGTTAGTAGTCGGCATTAG : AAAATGCATACTGATTTAACTGCATCTAAAGAAGAAGTTG L L V V G I R : K C I L I - L H L K K K L C - - S A L : E N A Y - F N C I - R R S W L V S S R H - : K M H T D L T A S K E E V . : . : . . : . : . 16844 GTTGATT : ATGGCTA : GAAATAGGAGTAGACGAT : TGGATTGCTC : TATCAATTGTTTTCACAG V D : Y G - : K - E - T I : G L L : Y Q L F S Q L I : M A : R N R S R R : L D C S : I N C F H R G - L : W L : E I G V D D : W I A : L S I V F T . . : . . . . 18178 ATATAGACATCTGAAAT : CTGCAACCTTGATTTCAAGGTCCTGGTGTTGGAATACATGCCC I - T S E I : C N L D F K V L V L E Y M P Y R H L K : S A T L I S R S W C W N T C P D I D I - N : L Q P - F Q G P G V G I H A . . . . . . 19577 AATGGGACACTTGATAAATGGTTATATTCTCACAACTTGTTCTTAAACTTATTGCA N G T L D K W L Y S H N L F L N L L M G H L I N G Y I L T T C S - T Y C Q W D T - - M V I F S Q L V L K L I A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-4_AGS-3_PPS_1 (15649 15856,15955 16304) (frame '0'; 555 bp, 185 residues) 1 VMIDVASAMN YLHNGYSTPV VHCDLKPSNV LLDEEMVAHV SDFGIAKMLG AGEAFVQTRT 61 VATIGYIAPE YGQDGIVSTS CDVYSFGILM METFTRTRPS DDIFTGDLSI QSWISDSFPG 121 ELHKVVDSNL VQPGDEQIAA KMQCLSSVME LALKCTLVRP DARISMKDAL STLKKMRLQL 181 VSSRH- AGS-4 (17022 17052,17124 17130,17284 17290,20602 20607,21159 21254,21334 21700) SCR (e 0.774 d 0.000 a 0.778,e 0.571 d 0.900 a 1.000,e 0.714 d 0.000 a 0.501,e 0.500 d 0.975 a 0.000,e 0.719 d 0.000 a 0.000,e 0.884) Exon 1 17022 17052 ( 31 n); score: 0.774 Intron 1 17053 17123 ( 71 n); Pd: 0.000 Pa: 0.778 Exon 2 17124 17130 ( 7 n); score: 0.571 Intron 2 17131 17283 ( 153 n); Pd: 0.900 Pa: 1.000 Exon 3 17284 17290 ( 7 n); score: 0.714 Intron 3 17291 20601 (3311 n); Pd: 0.000 Pa: 0.501 Exon 4 20602 20607 ( 6 n); score: 0.500 Intron 4 20608 21158 ( 551 n); Pd: 0.975 Pa: 0.000 Exon 5 21159 21254 ( 96 n); score: 0.719 Intron 5 21255 21333 ( 79 n); Pd: 0.000 Pa: 0.000 Exon 6 21334 21700 ( 367 n); score: 0.884 PGS (17022 17052,17124 17130,17284 17290,20602 20607,21159 21254,21334 21700) SGN-U322569+ 3-phase translation of AGS-4 (+strand): . . . . : : . : . : 17022 TGAGTTTTGTGCTTAGACTGAATATTTTGTG : ATGGCTA : GAAATAG : CCGCGG : TTATAGTTA - V L C L D - I F C : D G - : K - : P R : L - L E F C A - T E Y F V : M A : R N S : R G : Y S Y S F V L R L N I L - : W L : E I : A A : V I V . . . . . . 21168 TATATAAAAGGGAAAAAGGTCTGATATACCCCTCAACTTTGTCATTTGGAGCTCATATAC Y I K G K K V - Y T P Q L C H L E L I Y I - K G K R S D I P L N F V I W S S Y T I Y K R E K G L I Y P S T L S F G A H I . . . : . . . 21228 CCCTCGTTATAAAAGTGGCTCATATAT : GTTAAAAAATTTATTTTAAATTTATATTTATTA P S L - K W L I Y : V K K F I L N L Y L L P R Y K S G S Y M : L K N L F - I Y I Y Y P L V I K V A H I : C - K I Y F K F I F I . . . . . . 21367 CTTTTAATTTTTTTTAAAAAAAATTATTTAGAGATATATATGATTCTTCTATCAAAGTTC L L I F F K K N Y L E I Y M I L L S K F F - F F L K K I I - R Y I - F F Y Q S S T F N F F - K K L F R D I Y D S S I K V . . . . . . 21427 AATGTATATTTTAATTTTTTTCATACATAAATTATTTTTTGACTTCATTTATTATAATTA N V Y F N F F H T - I I F - L H L L - L M Y I L I F F I H K L F F D F I Y Y N Y Q C I F - F F S Y I N Y F L T S F I I I . . . . . . 21487 TTTGAGTTTCTTATTCTTATTTTGTTTTTTTCTTTCATTCCTTAGTTTAAATAAAAAAAT F E F L I L I L F F S F I P - F K - K N L S F L F L F C F F L S F L S L N K K I I - V S Y S Y F V F F F H S L V - I K K . . . . . . 21547 TAAACTATTTTTTTACTGTGTATTGTAATTTAATTTCGTATTCGAAGAAAAAATTTGGTC - T I F L L C I V I - F R I R R K N L V K L F F Y C V L - F N F V F E E K I W S L N Y F F T V Y C N L I S Y S K K K F G . . . . . . 21607 ATCTACAATAAGTTTTACAAGAATATTAGTGAAACATAAATAAATTTGATTATCAAAATA I Y N K F Y K N I S E T - I N L I I K I S T I S F T R I L V K H K - I - L S K - H L Q - V L Q E Y - - N I N K F D Y Q N . . . . 21667 ATAATTATAAATTAGTCATTGAAATACAAAAAAA I I I N - S L K Y K K - L - I S H - N T K K N N Y K L V I E I Q K Maximal non-overlapping open reading frames (>= 64 codons): none PGL 5 (- strand): 23156 17330 AGS-1 (17995 17985,17794 17330) SCR (e 0.909 d 0.908 a 0.000,e 0.911) Exon 1 17995 17985 ( 11 n); score: 0.909 Intron 1 17984 17795 ( 190 n); Pd: 0.908 Pa: 0.000 Exon 2 17794 17330 ( 465 n); score: 0.911 PGS (17780 17330) SGN-U313612+ PGS (17786 17371) SGN-U313614- PGS (17995 17985,17794 17517) SGN-U334483+ 3-phase translation of AGS-1 (-strand): . . : . . . . 17995 AATGACTCTAA : CATCATTTTGTTAGGACCGAAAATCAGCAGGTGTAAACGCGGAAGCTAG N D S N : I I L L G P K I S R C K R G S - M T L : T S F C - D R K S A G V N A E A S - L - : H H F V R T E N Q Q V - T R K L . . . . . . 17745 CAAAGCAAACCTCAAAAGACCACGAGTAAGAAGACAACGAGAAATATACCAAAACACACA Q S K P Q K T T S K K T T R N I P K H T K A N L K R P R V R R Q R E I Y Q N T Q A K Q T S K D H E - E D N E K Y T K T H . . . . . . 17685 AAGATTTAATGTGGTTCGGTCAATCGACCTACGTCCACAAAGGAGATGAGCAATCCACTA K I - C G S V N R P T S T K E M S N P L R F N V V R S I D L R P Q R R - A I H Y K D L M W F G Q S T Y V H K G D E Q S T . . . . . . 17625 TAAATATGAGAGTACAAAATACAGAGAGAAACAACCTCAACCAATTCACTCGGAATACAT - I - E Y K I Q R E T T S T N S L G I H K Y E S T K Y R E K Q P Q P I H S E Y M I N M R V Q N T E R N N L N Q F T R N T . . . . . . 17565 GGGAGGTTCACACAAGTGATAATGTATCCAACTTGTGACCCATAAATTCTCTCCCTAACC G R F T Q V I M Y P T C D P - I L S L T G G S H K - - C I Q L V T H K F S P - P W E V H T S D N V S N L - P I N S L P N . . . . . . 17505 AAAACTCTCAAAGCTCTTAAGACTACATTGTGAATGCTGACTAAGTTAGAAGGAACATTT K T L K A L K T T L - M L T K L E G T F K L S K L L R L H C E C - L S - K E H F Q N S Q S S - D Y I V N A D - V R R N I . . . . . . 17445 CTCTATTTATAGAGTCCTAAACCTTTTCCTACAAGAAAATGATTAGTCAATCAAAACCTT L Y L - S P K P F P T R K - L V N Q N L S I Y R V L N L F L Q E N D - S I K T F S L F I E S - T F S Y K K M I S Q S K P . . . . . . 17385 CCCCTTAAAGGAAAACCTATTTATGATAAGAAATTTAGGGAAAATAAAACCCAACA P L K G K P I Y D K K F R E N K T Q P L K E N L F M I R N L G K I K P N S P - R K T Y L - - E I - G K - N P T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (20786 20741,20379 20354,18443 18430,17780 17670) SCR (e 0.717 d 0.000 a 0.000,e 0.692 d 0.312 a 0.908,e 0.643 d 0.123 a 0.978,e 0.946) Exon 1 20786 20741 ( 46 n); score: 0.717 Intron 1 20740 20380 ( 361 n); Pd: 0.000 Pa: 0.000 Exon 2 20379 20354 ( 26 n); score: 0.692 Intron 2 20353 18444 (1910 n); Pd: 0.312 Pa: 0.908 Exon 3 18443 18430 ( 14 n); score: 0.643 Intron 3 18429 17781 ( 649 n); Pd: 0.123 Pa: 0.978 Exon 4 17780 17670 ( 111 n); score: 0.946 PGS (20786 20741,20379 20354,18443 18430,17780 17670) SGN-U313611- 3-phase translation of AGS-2 (-strand): . . . . . : . 20786 ACTAGTTCAAATAAGACAATTTTGTGTTGGTTATAAATAGTTTATA : TCAAGGTGTGTTTC T S S N K T I L C W L - I V Y : I K V C F L V Q I R Q F C V G Y K - F I : S R C V S - F K - D N F V L V I N S L Y : Q G V F . . : . : . . . 20365 ACTAATCATTAG : ATATGTGATTGCAG : GACCGAAAATCAGCAGGTGTAAACGCGGAAGCTA T N H - : I C D C R : T E N Q Q V - T R K L L I I R : Y V I A : G P K I S R C K R G S - H - S L : D M - L Q : D R K S A G V N A E A . . . . . . 17746 GCAAAGCAAACCTCAAAAGACCACGAGTAAGAAGACAACGAGAAATATACCAAAACACAC A K Q T S K D H E - E D N E K Y T K T H Q S K P Q K T T S K K T T R N I P K H T S K A N L K R P R V R R Q R E I Y Q N T . . 17686 AAAGATTTAATGTGGTT K D L M W K I - C G Q R F N V V Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (22690 22656,19102 19083,18443 18430,18344 18311,17780 17692) SCR (e 0.714 d 0.000 a 0.000,e 0.600 d 0.968 a 0.908,e 0.643 d 0.123 a 0.818,e 0.706 d 0.000 a 0.978,e 0.910) Exon 1 22690 22656 ( 35 n); score: 0.714 Intron 1 22655 19103 (3553 n); Pd: 0.000 Pa: 0.000 Exon 2 19102 19083 ( 20 n); score: 0.600 Intron 2 19082 18444 ( 639 n); Pd: 0.968 Pa: 0.908 Exon 3 18443 18430 ( 14 n); score: 0.643 Intron 3 18429 18345 ( 85 n); Pd: 0.123 Pa: 0.818 Exon 4 18344 18311 ( 34 n); score: 0.706 Intron 4 18310 17781 ( 530 n); Pd: 0.000 Pa: 0.978 Exon 5 17780 17692 ( 89 n); score: 0.910 PGS (22690 22656,19102 19083,18443 18430,18344 18311,17780 17692) SGN-U341191- 3-phase translation of AGS-3 (-strand): . . . . : . . : 22690 GTGCACAAAGCGTAGGTTTGGTAACAAACTTAAAA : GGGTGATACCAGTACATACG : ATATG V H K A - V W - Q T - K : G D T S T Y : D M C T K R R F G N K L K : R V I P V H T : I C A Q S V G L V T N L K : G - Y Q Y I R : Y . : . . . . : . 18438 TGATTGCAG : CAGAGAATTGATAAACAAAGTCACACACATGTAG : GACCGAAAATCAGCAGG - L Q : Q R I D K Q S H T H V : G P K I S R D C S : R E L I N K V T H M - : D R K S A G V I A : A E N - - T K S H T C R : T E N Q Q . . . . . . 17763 TGTAAACGCGGAAGCTAGCAAAGCAAACCTCAAAAGACCACGAGTAAGAAGACAACGAGA C K R G S - Q S K P Q K T T S K K T T R V N A E A S K A N L K R P R V R R Q R E V - T R K L A K Q T S K D H E - E D N E . . 17703 AATATACCAAAA N I P K I Y Q K Y T K Maximal non-overlapping open reading frames (>= 64 codons): none AGS-4 (21115 21079,20265 20125) SCR (e 0.757 d 0.696 a 0.000,e 0.823) Exon 1 21115 21079 ( 37 n); score: 0.757 Intron 1 21078 20266 ( 813 n); Pd: 0.696 Pa: 0.000 Exon 2 20265 20125 ( 141 n); score: 0.823 PGS (21115 21079,20265 20125) SGN-U328267- 3-phase translation of AGS-4 (-strand): . . . . : . . 21115 TTTACTTGAACAAGAAGTTAAAGGAAGAAAATAAAAG : GGAAAATGCACAGGTACCCCCTC F T - T R S - R K K I K : G K M H R Y P L L L E Q E V K G R K - K : G K C T G T P S Y L N K K L K E E N K R : E N A Q V P P . . . . . . 20242 AACCTATGTCCGAAATTTCAGAGACACACTTATACAACACTAAGGTCCTATTACCCCCTC N L C P K F Q R H T Y T T L R S Y Y P L T Y V R N F R D T L I Q H - G P I T P S Q P M S E I S E T H L Y N T K V L L P P . . . . . . 20182 AACTTATTTTATAAGTAATTTTCTATCCCTTTTCGACCTATCGGACATAGGTTGAGGG N L F Y K - F S I P F R P I G H R L R T Y F I S N F L S L F D L S D I G - G Q L I L - V I F Y P F S T Y R T - V E Maximal non-overlapping open reading frames (>= 64 codons): none AGS-5 (21624 21163) SCR (e 0.918) Exon 1 21624 21163 ( 462 n); score: 0.918 PGS (21617 21163) SGN-U344226+ PGS (21624 21169) SGN-U335137- 3-phase translation of AGS-5 (-strand): . . . . . . 21624 GTAAAACTTATTGTAGATGACCAAATTTTTTCTTCGAATACGAAATTAAATTACAATACA V K L I V D D Q I F S S N T K L N Y N T - N L L - M T K F F L R I R N - I T I H K T Y C R - P N F F F E Y E I K L Q Y . . . . . . 21564 CAGTAAAAAAATAGTTTAATTTTTTTATTTAAACTAAGGAATGAAAGAAAAAAACAAAAT Q - K N S L I F L F K L R N E R K K Q N S K K I V - F F Y L N - G M K E K N K I T V K K - F N F F I - T K E - K K K T K . . . . . . 21504 AAGAATAAGAAACTCAAATAATTATAATAAATGAAGTCAAAAAATAATTTATGTATGAAA K N K K L K - L - - M K S K N N L C M K R I R N S N N Y N K - S Q K I I Y V - K - E - E T Q I I I I N E V K K - F M Y E . . . . . . 21444 AAAATTAAAATATACATTGAACTTTGATAGAAGAATCATATATATCTCTAAATAATTTTT K I K I Y I E L - - K N H I Y L - I I F K L K Y T L N F D R R I I Y I S K - F F K N - N I H - T L I E E S Y I S L N N F . . . . . . 21384 TTTAAAAAAAATTAAAAGTAATAAATATAAATTTAAAATAAATTTTTTAACTTCCGTTAA F K K N - K - - I - I - N K F F N F R - L K K I K S N K Y K F K I N F L T S V K F - K K L K V I N I N L K - I F - L P L . . . . . . 21324 ATGAAGGGTATATGTGAGTCATTTTATAACAGCAGGGGTAAATGTGAGCCGTTTGTATAA M K G I C E S F Y N S R G K C E P F V - - R V Y V S H F I T A G V N V S R L Y N N E G Y M - V I L - Q Q G - M - A V C I . . . . . . 21264 CGGTAAGGGCATATATGAGCCACTTTTATAACGAGGGGTATATGAGCTCCAAATGACAAA R - G H I - A T F I T R G I - A P N D K G K G I Y E P L L - R G V Y E L Q M T K T V R A Y M S H F Y N E G Y M S S K - Q . . . . . 21204 GTTGAGGGGTATATCAGACCTTTTTCCCTTTTATATATAACT V E G Y I R P F S L L Y I T L R G I S D L F P F Y I - S - G V Y Q T F F P F I Y N Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-5 (+strand): . . . . . . 21163 AGTTATATATAAAAGGGAAAAAGGTCTGATATACCCCTCAACTTTGTCATTTGGAGCTCA S Y I - K G K R S D I P L N F V I W S S V I Y K R E K G L I Y P S T L S F G A H L Y I K G K K V - Y T P Q L C H L E L . . . . . . 21223 TATACCCCTCGTTATAAAAGTGGCTCATATATGCCCTTACCGTTATACAAACGGCTCACA Y T P R Y K S G S Y M P L P L Y K R L T I P L V I K V A H I C P Y R Y T N G S H I Y P S L - K W L I Y A L T V I Q T A H . . . . . . 21283 TTTACCCCTGCTGTTATAAAATGACTCACATATACCCTTCATTTAACGGAAGTTAAAAAA F T P A V I K - L T Y T L H L T E V K K L P L L L - N D S H I P F I - R K L K N I Y P C C Y K M T H I Y P S F N G S - K . . . . . . 21343 TTTATTTTAAATTTATATTTATTACTTTTAATTTTTTTTAAAAAAAATTATTTAGAGATA F I L N L Y L L L L I F F K K N Y L E I L F - I Y I Y Y F - F F L K K I I - R Y I Y F K F I F I T F N F F - K K L F R D . . . . . . 21403 TATATGATTCTTCTATCAAAGTTCAATGTATATTTTAATTTTTTTCATACATAAATTATT Y M I L L S K F N V Y F N F F H T - I I I - F F Y Q S S M Y I L I F F I H K L F I Y D S S I K V Q C I F - F F S Y I N Y . . . . . . 21463 TTTTGACTTCATTTATTATAATTATTTGAGTTTCTTATTCTTATTTTGTTTTTTTCTTTC F - L H L L - L F E F L I L I L F F S F F D F I Y Y N Y L S F L F L F C F F L S F L T S F I I I I - V S Y S Y F V F F F . . . . . . 21523 ATTCCTTAGTTTAAATAAAAAAATTAAACTATTTTTTTACTGTGTATTGTAATTTAATTT I P - F K - K N - T I F L L C I V I - F F L S L N K K I K L F F Y C V L - F N F H S L V - I K K L N Y F F T V Y C N L I . . . . . 21583 CGTATTCGAAGAAAAAATTTGGTCATCTACAATAAGTTTTAC R I R R K N L V I Y N K F Y V F E E K I W S S T I S F S Y S K K K F G H L Q - V L Maximal non-overlapping open reading frames (>= 64 codons): none AGS-6 (23156 22454) SCR (e 0.713) Exon 1 23156 22454 ( 703 n); score: 0.713 PGS (23156 22454) SGN-U345542- 3-phase translation of AGS-6 (-strand): . . . . . . 23156 TCATACTACCTTCCAAATGCAATTCTGAAAGATATAAATATGTAAGAGCTGTTAAGTTGG S Y Y L P N A I L K D I N M - E L L S W H T T F Q M Q F - K I - I C K S C - V G I L P S K C N S E R Y K Y V R A V K L . . . . . . 23096 CTAGCTCTCTTGGTAAAGTTCCAATAAACTTATTTTCACCCAATTGCAATTTTTGAAGCT L A L L V K F Q - T Y F H P I A I F E A - L S W - S S N K L I F T Q L Q F L K L A S S L G K V P I N L F S P N C N F - S . . . . . . 23036 TTCTGCATTTCTCCAGGTTTGGTGGAATAACTCCATCTAGGGAGTTTTTACTGAGGTAAA F C I S P G L V E - L H L G S F Y - G K S A F L Q V W W N N S I - G V F T E V K F L H F S R F G G I T P S R E F L L R - . . . . . . 22976 GTCCTTCCAAGTCTCGAAGATGATCACATATCGTTTTTGGAAGATTTCCAGTAAGATTGT V L P S L E D D H I S F L E D F Q - D C S F Q V S K M I T Y R F W K I S S K I V S P S K S R R - S H I V F G R F P V R L . . . . . . 22916 TGCCCGTAAGAGCAATCACATGCATTGTAGTAATGTTAAAAATGGATGGTGGTATAGAGC C P - E Q S H A L - - C - K W M V V - S A R K S N H M H C S N V K N G W W Y R A L P V R A I T C I V V M L K M D G G I E . . . . . . 22856 CACTAAGCTGATTAATTTGCAGGTCTAGGATAGTCAAGTAACGAAGATCACCGATTTCTC H - A D - F A G L G - S S N E D H R F L T K L I N L Q V - D S Q V T K I T D F S P L S - L I C R S R I V K - R R S P I S . . . . . . 22796 GAGGGATCTCTCCTTCAAGAAAATTTCTATCCAAGTATAACCTTTGCAGCTTTGTTATAT E G S L L Q E N F Y P S I T F A A L L Y R D L S F K K I S I Q V - P L Q L C Y I R G I S P S R K F L S K Y N L C S F V I . . . . . . 22736 TGGAAAGGGAGGATGGAATTTTCCCAGAAAATTGGTTGCTTGATAGGTGCACAAAGCGTA W K G R M E F S Q K I G C L I G A Q S V G K G G W N F P R K L V A - - V H K A - L E R E D G I F P E N W L L D R C T K R . . . . . . 22676 GGTTTGGTAACAAACTTAAAAATGATGGAATGGCTCCGGTGAAGTTATTGCTTGTGACAT G L V T N L K M M E W L R - S Y C L - H V W - Q T - K - W N G S G E V I A C D I R F G N K L K N D G M A P V K L L L V T . . . . . . 22616 TAATCGATTTCAACCTCTGCAGACGAGCCAATTCTTGTGGCAAATCTCCATGAAAAGTGT - S I S T S A D E P I L V A N L H E K C N R F Q P L Q T S Q F L W Q I S M K S V L I D F N L C R R A N S C G K S P - K V . . . . . . 22556 TGTTACTGATGTCAAGGGAAGAAAGAAATGACAGGTTTCCGAGGTGTGGAGGAATGGTAC C Y - C Q G K K E M T G F R G V E E W Y V T D V K G R K K - Q V S E V W R N G T L L L M S R E E R N D R F P R C G G M V . . . . . 22496 CATGAAGTTGCATGCTTGAAATGTCTAAAGCAGTGACTCGATG H E V A C L K C L K Q - L D M K L H A - N V - S S D S M P - S C M L E M S K A V T R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16-_PGL-5_AGS-6_PPS_1 (22815 22564) (frame '0'; 249 bp, 83 residues) 1 RRSPISRGIS PSRKFLSKYN LCSFVILERE DGIFPENWLL DRCTKRRFGN KLKNDGMAPV 61 KLLLVTLIDF NLCRRANSCG KSP- 3-phase translation of AGS-6 (+strand): . . . . . . 22454 CATCGAGTCACTGCTTTAGACATTTCAAGCATGCAACTTCATGGTACCATTCCTCCACAC H R V T A L D I S S M Q L H G T I P P H I E S L L - T F Q A C N F M V P F L H T S S H C F R H F K H A T S W Y H S S T . . . . . . 22514 CTCGGAAACCTGTCATTTCTTTCTTCCCTTGACATCAGTAACAACACTTTTCATGGAGAT L G N L S F L S S L D I S N N T F H G D S E T C H F F L P L T S V T T L F M E I P R K P V I S F F P - H Q - Q H F S W R . . . . . . 22574 TTGCCACAAGAATTGGCTCGTCTGCAGAGGTTGAAATCGATTAATGTCACAAGCAATAAC L P Q E L A R L Q R L K S I N V T S N N C H K N W L V C R G - N R L M S Q A I T F A T R I G S S A E V E I D - C H K Q - . . . . . . 22634 TTCACCGGAGCCATTCCATCATTTTTAAGTTTGTTACCAAACCTACGCTTTGTGCACCTA F T G A I P S F L S L L P N L R F V H L S P E P F H H F - V C Y Q T Y A L C T Y L H R S H S I I F K F V T K P T L C A P . . . . . . 22694 TCAAGCAACCAATTTTCTGGGAAAATTCCATCCTCCCTTTCCAATATAACAAAGCTGCAA S S N Q F S G K I P S S L S N I T K L Q Q A T N F L G K F H P P F P I - Q S C K I K Q P I F W E N S I L P F Q Y N K A A . . . . . . 22754 AGGTTATACTTGGATAGAAATTTTCTTGAAGGAGAGATCCCTCGAGAAATCGGTGATCTT R L Y L D R N F L E G E I P R E I G D L G Y T W I E I F L K E R S L E K S V I F K V I L G - K F S - R R D P S R N R - S . . . . . . 22814 CGTTACTTGACTATCCTAGACCTGCAAATTAATCAGCTTAGTGGCTCTATACCACCATCC R Y L T I L D L Q I N Q L S G S I P P S V T - L S - T C K L I S L V A L Y H H P S L L D Y P R P A N - S A - W L Y T T I . . . . . . 22874 ATTTTTAACATTACTACAATGCATGTGATTGCTCTTACGGGCAACAATCTTACTGGAAAT I F N I T T M H V I A L T G N N L T G N F L T L L Q C M - L L L R A T I L L E I H F - H Y Y N A C D C S Y G Q Q S Y W K . . . . . . 22934 CTTCCAAAAACGATATGTGATCATCTTCGAGACTTGGAAGGACTTTACCTCAGTAAAAAC L P K T I C D H L R D L E G L Y L S K N F Q K R Y V I I F E T W K D F T S V K T S S K N D M - S S S R L G R T L P Q - K . . . . . . 22994 TCCCTAGATGGAGTTATTCCACCAAACCTGGAGAAATGCAGAAAGCTTCAAAAATTGCAA S L D G V I P P N L E K C R K L Q K L Q P - M E L F H Q T W R N A E S F K N C N L P R W S Y S T K P G E M Q K A S K I A . . . . . . 23054 TTGGGTGAAAATAAGTTTATTGGAACTTTACCAAGAGAGCTAGCCAACTTAACAGCTCTT L G E N K F I G T L P R E L A N L T A L W V K I S L L E L Y Q E S - P T - Q L L I G - K - V Y W N F T K R A S Q L N S S . . . . . 23114 ACATATTTATATCTTTCAGAATTGCATTTGGAAGGTAGTATGA T Y L Y L S E L H L E G S M H I Y I F Q N C I W K V V - Y I F I S F R I A F G R - Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-5_AGS-6_PPS_1 (22454 23155) (frame '1'; 702 bp, 234 residues) 1 HRVTALDISS MQLHGTIPPH LGNLSFLSSL DISNNTFHGD LPQELARLQR LKSINVTSNN 61 FTGAIPSFLS LLPNLRFVHL SSNQFSGKIP SSLSNITKLQ RLYLDRNFLE GEIPREIGDL 121 RYLTILDLQI NQLSGSIPPS IFNITTMHVI ALTGNNLTGN LPKTICDHLR DLEGLYLSKN 181 SLDGVIPPNL EKCRKLQKLQ LGENKFIGTL PRELANLTAL TYLYLSELHL EGSM PGL 6 (+ strand): 23565 24327 AGS-1 (23565 24327) SCR (e 0.814) Exon 1 23565 24327 ( 763 n); score: 0.814 PGS (23565 24327) SGN-U345275+ 3-phase translation of AGS-1 (+strand): . . . . . . 23565 TTGCTGTGGAATAATTTTGTCAGCGATTCTACATTGAGCTTCCTTGCATCATTGACAAAC L L W N N F V S D S T L S F L A S L T N C C G I I L S A I L H - A S L H H - Q T A V E - F C Q R F Y I E L P C I I D K . . . . . . 23625 TGTAGGAATCTAAGAGCACTCACGTTAGCAGGTAATCCGTTGGATGGTGTATTGCCTGCA C R N L R A L T L A G N P L D G V L P A V G I - E H S R - Q V I R W M V Y C L H L - E S K S T H V S R - S V G W C I A C . . . . . . 23685 TCTGTTGGTAATTTCTCAAACTCCTTGCAAATTTTTGAAGCATATAATTGTACACTGAAG S V G N F S N S L Q I F E A Y N C T L K L L V I S Q T P C K F L K H I I V H - R I C W - F L K L L A N F - S I - L Y T E . . . . . . 23745 GGTGTCATTCCTCGAGAAATTGGTAATCTTACTGGACTGACAAGGATGAGTCTGTTTAAC G V I P R E I G N L T G L T R M S L F N V S F L E K L V I L L D - Q G - V C L T G C H S S R N W - S Y W T D K D E S V - . . . . . . 23805 AATACATTAACTGGACATATTCCAAATACTGTACATGGCATGTCGATCCTTCAAGAACTT N T L T G H I P N T V H G M S I L Q E L I H - L D I F Q I L Y M A C R S F K N F Q Y I N W T Y S K Y C T W H V D P S R T . . . . . . 23865 TACTTATTGAACAACAAGATAGAAGGAACCATACCAGATGTTGTCTGTAATTTAAAGAGA Y L L N N K I E G T I P D V V C N L K R T Y - T T R - K E P Y Q M L S V I - R D L L I E Q Q D R R N H T R C C L - F K E . . . . . . 23925 CTTGGTGCATTACTCTTGTCAAAAAATCATTTTTCTGGTTCGGTACCCTTCTGCTTAGGG L G A L L L S K N H F S G S V P F C L G L V H Y S C Q K I I F L V R Y P S A - G T W C I T L V K K S F F W F G T L L L R . . . . . . 23985 AACATTACTAGTTTGAGGATACTTCATCTATATAACAACAAGCTGGATTCTACATTACCT N I T S L R I L H L Y N N K L D S T L P T L L V - G Y F I Y I T T S W I L H Y L E H Y - F E D T S S I - Q Q A G F Y I T . . . . . . 24045 TCAAACTTGGGGAACCTTCAAGATCTCATAGAATTAGATGTTTCATTCAATTTATTTAGT S N L G N L Q D L I E L D V S F N L F S Q T W G T F K I S - N - M F H S I Y L V F K L G E P S R S H R I R C F I Q F I - . . . . . . 24105 GGGGAAATTCCAATGGAGAGTGGAAACTTGAAGGCTGCAACACACATTGATTTGTCAAAT G E I P M E S G N L K A A T H I D L S N G K F Q W R V E T - R L Q H T L I C Q I W G N S N G E W K L E G C N T H - F V K . . . . . . 24165 AATTGTTTTTCTGGTAAGATGCCTAGTACTCTAGGGGGTCTAGATAAATTGATTCATCTT N C F S G K M P S T L G G L D K L I H L I V F L V R C L V L - G V - I N - F I F - L F F W - D A - Y S R G S R - I D S S . . . . . . 24225 TCTCTAACACATAATAGATTAGAGGGGCCTATTCCTGAATCATTTGGCAAAATGTTGTCT S L T H N R L E G P I P E S F G K M L S L - H I I D - R G L F L N H L A K C C L F S N T - - I R G A Y S - I I W Q N V V . . . . . 24285 TTGGAATACTTGGATTTGTCCTATAATAATATTAGTGGTCAAA L E Y L D L S Y N N I S G Q W N T W I C P I I I L V V K F G I L G F V L - - Y - W S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-6_AGS-1_PPS_1 (23565 24326) (frame '1'; 762 bp, 254 residues) 1 LLWNNFVSDS TLSFLASLTN CRNLRALTLA GNPLDGVLPA SVGNFSNSLQ IFEAYNCTLK 61 GVIPREIGNL TGLTRMSLFN NTLTGHIPNT VHGMSILQEL YLLNNKIEGT IPDVVCNLKR 121 LGALLLSKNH FSGSVPFCLG NITSLRILHL YNNKLDSTLP SNLGNLQDLI ELDVSFNLFS 181 GEIPMESGNL KAATHIDLSN NCFSGKMPST LGGLDKLIHL SLTHNRLEGP IPESFGKMLS 241 LEYLDLSYNN ISGQ 3-phase translation of AGS-1 (-strand): . . . . . . 24327 TTTGACCACTAATATTATTATAGGACAAATCCAAGTATTCCAAAGACAACATTTTGCCAA F D H - Y Y Y R T N P S I P K T T F C Q L T T N I I I G Q I Q V F Q R Q H F A K - P L I L L - D K S K Y S K D N I L P . . . . . . 24267 ATGATTCAGGAATAGGCCCCTCTAATCTATTATGTGTTAGAGAAAGATGAATCAATTTAT M I Q E - A P L I Y Y V L E K D E S I Y - F R N R P L - S I M C - R K M N Q F I N D S G I G P S N L L C V R E R - I N L . . . . . . 24207 CTAGACCCCCTAGAGTACTAGGCATCTTACCAGAAAAACAATTATTTGACAAATCAATGT L D P L E Y - A S Y Q K N N Y L T N Q C - T P - S T R H L T R K T I I - Q I N V S R P P R V L G I L P E K Q L F D K S M . . . . . . 24147 GTGTTGCAGCCTTCAAGTTTCCACTCTCCATTGGAATTTCCCCACTAAATAAATTGAATG V L Q P S S F H S P L E F P H - I N - M C C S L Q V S T L H W N F P T K - I E - C V A A F K F P L S I G I S P L N K L N . . . . . . 24087 AAACATCTAATTCTATGAGATCTTGAAGGTTCCCCAAGTTTGAAGGTAATGTAGAATCCA K H L I L - D L E G S P S L K V M - N P N I - F Y E I L K V P Q V - R - C R I Q E T S N S M R S - R F P K F E G N V E S . . . . . . 24027 GCTTGTTGTTATATAGATGAAGTATCCTCAAACTAGTAATGTTCCCTAAGCAGAAGGGTA A C C Y I D E V S S N - - C S L S R R V L V V I - M K Y P Q T S N V P - A E G Y S L L L Y R - S I L K L V M F P K Q K G . . . . . . 23967 CCGAACCAGAAAAATGATTTTTTGACAAGAGTAATGCACCAAGTCTCTTTAAATTACAGA P N Q K N D F L T R V M H Q V S L N Y R R T R K M I F - Q E - C T K S L - I T D T E P E K - F F D K S N A P S L F K L Q . . . . . . 23907 CAACATCTGGTATGGTTCCTTCTATCTTGTTGTTCAATAAGTAAAGTTCTTGAAGGATCG Q H L V W F L L S C C S I S K V L E G S N I W Y G S F Y L V V Q - V K F L K D R T T S G M V P S I L L F N K - S S - R I . . . . . . 23847 ACATGCCATGTACAGTATTTGGAATATGTCCAGTTAATGTATTGTTAAACAGACTCATCC T C H V Q Y L E Y V Q L M Y C - T D S S H A M Y S I W N M S S - C I V K Q T H P D M P C T V F G I C P V N V L L N R L I . . . . . . 23787 TTGTCAGTCCAGTAAGATTACCAATTTCTCGAGGAATGACACCCTTCAGTGTACAATTAT L S V Q - D Y Q F L E E - H P S V Y N Y C Q S S K I T N F S R N D T L Q C T I I L V S P V R L P I S R G M T P F S V Q L . . . . . . 23727 ATGCTTCAAAAATTTGCAAGGAGTTTGAGAAATTACCAACAGATGCAGGCAATACACCAT M L Q K F A R S L R N Y Q Q M Q A I H H C F K N L Q G V - E I T N R C R Q Y T I Y A S K I C K E F E K L P T D A G N T P . . . . . . 23667 CCAACGGATTACCTGCTAACGTGAGTGCTCTTAGATTCCTACAGTTTGTCAATGATGCAA P T D Y L L T - V L L D S Y S L S M M Q Q R I T C - R E C S - I P T V C Q - C K S N G L P A N V S A L R F L Q F V N D A . . . . . 23607 GGAAGCTCAATGTAGAATCGCTGACAAAATTATTCCACAGCAA G S S M - N R - Q N Y S T A E A Q C R I A D K I I P Q Q R K L N V E S L T K L F H S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16-_PGL-6_AGS-1_PPS_1 (23854 23567) (frame '0'; 288 bp, 96 residues) 1 RIDMPCTVFG ICPVNVLLNR LILVSPVRLP ISRGMTPFSV QLYASKICKE FEKLPTDAGN 61 TPSNGLPANV SALRFLQFVN DARKLNVESL TKLFHS PGL 7 (+ strand): 24996 25653 AGS-1 (24996 25212,25307 25653) SCR (e 0.811 d 1.000 a 0.953,e 0.813) Exon 1 24996 25212 ( 217 n); score: 0.811 Intron 1 25213 25306 ( 94 n); Pd: 1.000 Pa: 0.953 Exon 2 25307 25653 ( 347 n); score: 0.813 PGS (24996 25212,25307 25653) SGN-U322835+ 3-phase translation of AGS-1 (+strand): . . . . . . 24996 AGATTGGATATAATGATAGATGTTGCATCTGCAATGGATTATCTCCACAATGGCTATTCA R L D I M I D V A S A M D Y L H N G Y S D W I - - - M L H L Q W I I S T M A I Q I G Y N D R C C I C N G L S P Q W L F . . . . . . 25056 ACGCCTGTGGTGCATTGTGACTTGAAGCCAAGTAATGTCTTGTTAGATGAAGAAATGGTT T P V V H C D L K P S N V L L D E E M V R L W C I V T - S Q V M S C - M K K W L N A C G A L - L E A K - C L V R - R N G . . . . . . 25116 GCTCATGTAAGTGATTTTGGCATTGCAAAAATGTTAGGTGCAGGGGAGGCTTTTGTTCAA A H V S D F G I A K M L G A G E A F V Q L M - V I L A L Q K C - V Q G R L L F K C S C K - F W H C K N V R C R G G F C S . . . . : . . 25176 ACAAGGACAGTTGCAACCATTGGATATATTGCTCCAG : AGTATGGACAAGATGGCATAGTA T R T V A T I G Y I A P : E Y G Q D G I V Q G Q L Q P L D I L L Q : S M D K M A - Y N K D S C N H W I Y C S R : V W T R W H S . . . . . . 25330 TCCACGAGTTGTGATGTTTATAGTTTTGGCATCGTGATGATGGAGACGTTCACACGAACA S T S C D V Y S F G I V M M E T F T R T P R V V M F I V L A S - - W R R S H E Q I H E L - C L - F W H R D D G D V H T N . . . . . . 25390 AGACCAAGTGATGAGATATTTACTGGAGACTTGAGCATACAGCGTTGGGTTAATGATTCC R P S D E I F T G D L S I Q R W V N D S D Q V M R Y L L E T - A Y S V G L M I P K T K - - D I Y W R L E H T A L G - - F . . . . . . 25450 TTTCCGGGTGAAATTCACAAGGTGGTGGATTCGAATTTGGTACAGCCAGGAGATGAACAA F P G E I H K V V D S N L V Q P G D E Q F R V K F T R W W I R I W Y S Q E M N K L S G - N S Q G G G F E F G T A R R - T . . . . . . 25510 ATCGCTGCAAAGATGCAATGTTTGTTATCTATCATGGAATTAGCTTTGAAGTGCACTTTA I A A K M Q C L L S I M E L A L K C T L S L Q R C N V C Y L S W N - L - S A L - N R C K D A M F V I Y H G I S F E V H F . . . . . . 25570 GTGAGACCTGATGAAAGAATTAGCATGAATGATGCTCTTTCAGCACTCAAAAAGATTAGA V R P D E R I S M N D A L S A L K K I R - D L M K E L A - M M L F Q H S K R L D S E T - - K N - H E - C S F S T Q K D - . . . 25630 CGACAGCTTGTTAGTAGTCGGCAC R Q L V S S R H D S L L V V G T T A C - - S A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-16+_PGL-7_AGS-1_PPS_1 (24996 25212,25307 25653) (frame '1'; 564 bp, 188 residues) 1 RLDIMIDVAS AMDYLHNGYS TPVVHCDLKP SNVLLDEEMV AHVSDFGIAK MLGAGEAFVQ 61 TRTVATIGYI APEYGQDGIV STSCDVYSFG IVMMETFTRT RPSDEIFTGD LSIQRWVNDS 121 FPGEIHKVVD SNLVQPGDEQ IAAKMQCLLS IMELALKCTL VRPDERISMN DALSALKKIR 181 RQLVSSRH ... finished at: Mon Aug 28 22:27:17 2006 ________________________________________________________________________________ Sequence 17: C06HBa0054K13.1-17, from 1 to 3517, both strands analyzed. ... started at: Mon Aug 28 22:27:17 2006 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-LF1h7Z/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 ******************************************************************************** EST sequence 1 +strand 1754 n (File: SGN-U314435+) 1 AAAAAGTGCA GAAAAAACCA ACTAATTGCA TTCTCCATTC TTGGAAGTGG CCATTCTTGA 61 TTTCTTGAAA CAAAGGTTTG TTTCCCTTCA CTTCTTGATA TGTAAAGTTG CAATCTTTAT 121 AACTTTCTAT TGCTTTGCTA GTGTTTTTGT TATATACAGG GGGTGGAGTT AGAGGGTAAG 181 TTACGCATTT AGTCGTAACT TTAGTCAAAC TTCGTAATAA TTTAGTAAGT TAAAATATAT 241 TAGAAATTTT CAGAATTCAT AAACTTTAAA TTTTAAATTT TGACTTCGCT TTGTGTGACT 301 ATACAATTAC AGAAATTCAG AGTGGCCATT GTTGAAAGAG AGGGTGGAAT TTGTAAGTCA 361 AGAAACAGGT TACTCCTGTT TGAGTGAGGA AAAGTTGGTT TGCCTGTCTG TGGTCTTTTT 421 ATAATCTTTT TCTACAGAAG AGAAAGTGGG TAATTTTGTT TGAGAGTGGA AATATTCTCT 481 AGTGGGAATC TACTAGGAGT AATTTATTTT CTATAAACTA AGTAAAGTTT GGAAGGTGAC 541 AAAAAGAAAG ACAAAAATCT TGGAATTGTT TTAGACAACC AAGGTTTTCT TGCTCAGAAT 601 GTCTGTTGCC TTGTTATGGG TTGTTTCTCC TTGTGACGTC TCAAATGGGA CAAGTTTCAT 661 GGAATCAGTC CGGGAGGGAA ACCGTTTTTT TGATTCATCG AGGCATAGGA ATTTGGTGTC 721 CAATGAGAGA ATCAATAGAG GTGGTGGAAA GCAAACTAAT AATGGACGGA AATTTTCTGT 781 ACGGTCTGCT ATTTTGGCTA CTCCATCTGG AGAACGGACG ATGACATCGG AACAGATGGT 841 CTATGATGTG GTTTTGAGGC AGGCAGCCTT GGTGAAGAGG CAACTGAGAT CTACCAATGA 901 GTTAGAAGTG AAGCCGGATG TTGAATGCCT TGAATCGGAC CCGCTACAAA CAGAAAGGAC 961 GGGGGTCTCG CTGCCCGGTC AGCGAGTCGG GGGGTCCAGG GGGGCGACGC GCTGGCCTGG 1021 GGGTCCGGGG GGGCGGAGAC GCGGCGCCGA CGGTATACAA TGTTGTTGTA TTGGGCCCTT 1081 AATTTTCTGT TGATTCTGTA TGTTGGGCCC AAGCCTTTTA GGGCGTAGCT TAGCACTATA 1141 TATAGACGCT ATGGCAAACC CTATTCTGTA ATTCTGTTTT TGCCTCTCCA TAATAAAATT 1201 GCTCCCTCTC TTCCCGTGGA CGTAGCCAAT TTATTGGTGA ACCACGTAAA TCTGTTGTCT 1261 TGTTTTTCGC GTTTATATAT TTTCTCGTAT TATCTCAAAT TTCGCACAAC ACTCTTAATA 1321 TTCATAACTA TCATCTTTTC ATATTCATAA CCTCCAAATA TTTAAATTAA ACTTTAAGAT 1381 ATCTTTTGGT ATTCCTTCTA TTCTATTTGT ATAAATTCAA CTTCTTTATC TCATGAAACC 1441 CCTATCAAGA TTATTATTTT TATTCTATAG TAAAAATAGA TGCTGAAAAC TCTTGAATTT 1501 TGATAGGATA TGAAAGGAGT CGATAAAAAC TCAGAGAGTT ATGTACTAAT TTTTACTTAT 1561 TTTTTCATCT ATATATACAT CAATCTTATA AGAATAATGT CTATATTGTA TTTTTTTCTT 1621 AAATATTCTG TTTCTTTTAG TCTTTTTTTT CACTCTGTTA GACTTCTTAA TTTAGTTTTC 1681 TATGAATGAT TTATTGTCGT ATGTCTTTGA ATTTTGTAAT TGTTACATTT TATTATTCAT 1741 TACAATTTAC ATAT Predicted gene structure (within gDNA segment 3517 to 1): Exon 1 1837 1594 ( 244 n); cDNA 1066 1311 ( 246 n); score: 0.914 MATCH C06HBa0054K13.1-17- SGN-U314435+ 0.914 244 0.139 C PGS_C06HBa0054K13.1-17-_SGN-U314435+ (1837 1594) Alignment (genomic DNA sequence = upper lines): TTGAATTGGG CCCTTAATTA TCTGTCAATT CTGTATTTTG GGCCCAAGCC TGTTAGGGCG 1778 ||| |||||| ||||||||| ||||| ||| |||||| ||| |||||||||| | |||||||| TTGTATTGGG CCCTTAATTT TCTGTTGATT CTGTATGTTG GGCCCAAGCC TTTTAGGGCG 1125 TAGCTTAGCA CTATATATAG ACGCTATGGC AAACCCTATT CTGTAATTCT ATTCTTGCCT 1718 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || |||||| TAGCTTAGCA CTATATATAG ACGCTATGGC AAACCCTATT CTGTAATTCT GTTTTTGCCT 1185 CTCCATAATA AAACTGCTCC CCCTCATCCC GTGGACGTAG TCAATTTATT GGTAAACCAC 1658 |||||||||| ||| |||||| | ||| |||| |||||||||| ||||||||| ||| |||||| CTCCATAATA AAATTGCTCC CTCTCTTCCC GTGGACGTAG CCAATTTATT GGTGAACCAC 1245 GTAAATCTGT TGTCTTGTTT TTCGCGTTTA TAT-TTTT-T CGTATTATAT CAAATTCCGC 1600 |||||||||| |||||||||| |||||||||| ||| |||| | |||||||| | |||||| ||| GTAAATCTGT TGTCTTGTTT TTCGCGTTTA TATATTTTCT CGTATTATCT CAAATTTCGC 1305 ATAACA 1594 | |||| ACAACA 1311 hqPGS_C06HBa0054K13.1-17-_SGN-U314435+ (1837 1594) ******************************************************************************** EST sequence 2 +strand 842 n (File: SGN-U345275+) 1 ATTGACAAAC GCTGGAGCTC CACCGCGGTG GCGGCCGCTC TAGAACTAGT GGATCCCCCG 61 GGCTGCAGGC TCAACTTGTG GGGGAATAAT TTTGACATCG ATTCAACATT GAGCTTCCTT 121 GAATCATTGA CAAACTGTAG GAATCTAAGA GTACTCACGC TTGGTGGTAA TCCGTTGGAT 181 GGTGTTTTGC CTGCATCTGT TGGGAATTTC TCAAACTGCT TGCAAATATG TGAAGCATCT 241 AAATGTAAAC TGAATGGTGT CATTTCAAAA CAAATTACTA ATCTTACTGG ATTGACAAGG 301 ATGAGTCTGT CGAACAATCA GTTGATAGGC CATATTCCAA CAACAGTGCA AGGAATGCTG 361 AACCTTCAAG AACTTTACCT ATGAAGCAAC AAGTTAGAAG GAGCCATACC AGATGTTATC 421 TGCAGTTGAC AGTATCTTGG TGCATTAGAA TTGTCAGAGA ATCAATTTTC TAGTTTCGTT 481 CCACCATGCT TAGGGAATGT TACTAGTTTG AGGACACTCT ATCTAGATAA CAACAAGCTG 541 GATTCTAGAT TACCTGCAAG ATTGGGGGGA CTTCAAAACA TCATAGAGTT CAATATTTCA 601 TCCAATTATT TGAGTGGAGA AATTCCGCTA GAGAGCGGAA ACTTGAATGG TGCAACACTG 661 ATTGATCTGT CAAATAATTA TTTTTCTGGG TAGATTCCTA GTACTCTAGG GGGCCTAGAT 721 AAATTAAATT AACTTTCTCT AGCACATAGT GGATTACAAG GGCCTATTTC TGAATCATTT 781 GACAAATTGC GGGCCTTGGA ATAACTGGGA TTTGGCCTAT TACAAATCTT AGGGGTGAAA 841 AG Predicted gene structure (within gDNA segment 3517 to 69): Exon 1 2489 1849 ( 641 n); cDNA 85 725 ( 641 n); score: 0.805 MATCH C06HBa0054K13.1-17- SGN-U345275+ 0.805 641 0.761 C PGS_C06HBa0054K13.1-17-_SGN-U345275+ (2489 1849) Alignment (genomic DNA sequence = upper lines): AATAATTTTA TCAGCGATTC ATCGTTGAGT TTCCTTACAT CATTAACAAA CTGTAGAAAA 2430 ||||||||| || |||||| | | ||||| |||||| || |||| ||||| |||||| || AATAATTTTG ACATCGATTC AACATTGAGC TTCCTTGAAT CATTGACAAA CTGTAGGAAT 144 CTAAGAGTAC TCCTGTTTAG TGAGAATGCA TTGGATGGGG CTTTATCAGT GTCTGTTGGT 2370 |||||||||| || | || | || ||| | |||||||| | ||| | | |||||||| CTAAGAGTAC TCACGCTTGG TGGTAATCCG TTGGATGGTG TTTTGCCTGC ATCTGTTGGG 204 AATTTCTCAA ACTCTCTGCA AAATTTTGAA GGAAATGGTT GTAAGCTAAA GGGCATCATT 2310 |||||||||| ||| |||| || | |||| | | | | |||| || || || ||||| AATTTCTCAA ACTGCTTGCA AATATGTGAA GCATCTAAAT GTAAACTGAA TGGTGTCATT 264 CCTACAGAAA TTGGTAATCT TACTGGTGTG ATATATATGA GTTTGTATGA CAATAAGTTG 2250 | | | ||| || |||||| |||||| || | | |||| || ||| | |||| ||||| TCAAAACAAA TTACTAATCT TACTGGATTG ACAAGGATGA GTCTGTCGAA CAATCAGTTG 324 ACTGGACATA TTCCAAATAC TGTTCAAGAC ATGTTGAACC TACAAGAATT TTACCTAACA 2190 | || |||| |||||| || || |||| ||| |||||| | |||||| | ||||||| | ATAGGCCATA TTCCAACAAC AGTGCAAGGA ATGCTGAACC TTCAAGAACT TTACCTATGA 384 AGCAACAAGA TAGAAGGAAC CATACCAAAT GCTTTATGCA GTTTAATGAA TCTTGGCGCA 2130 ||||||||| |||||||| | ||||||| || | | | |||| ||| | | | |||||| ||| AGCAACAAGT TAGAAGGAGC CATACCAGAT GTTATCTGCA GTTGACAGTA TCTTGGTGCA 444 TTAGACTTGT CAGGAAATCA TTTTTCTGGT TCAGTGCCCT CATGCTTAGG GAATGTTACG 2070 ||||| |||| ||| ||||| |||||| || | || || |||||||||| ||||||||| TTAGAATTGT CAGAGAATCA ATTTTCTAGT TTCGTTCCAC CATGCTTAGG GAATGTTACT 504 AGTTTGAGGT ATCTTAATCT AGCTTACAAC AGGCTGAATT CAAGATTACC TGCAAACTTA 2010 ||||||||| || |||| || | ||||| | |||| ||| | |||||||| ||||| || AGTTTGAGGA CACTCTATCT AGATAACAAC AAGCTGGATT CTAGATTACC TGCAAGATTG 564 GGGAGCCTTC AAGATCTCAT AACATTCAAT ATTTCATCCA ATTTATTGAG TGGGGAAATT 1950 ||| | |||| || | |||| | |||||| |||||||||| ||| ||||| ||| |||||| GGGGGACTTC AAAACATCAT AGAGTTCAAT ATTTCATCCA ATTATTTGAG TGGAGAAATT 624 CCGCTAGAGA GCGGAAACTT GAAGGCTGCA ACACTGATTG ATCTGTCAAA TAATTATTTT 1890 |||||||||| |||||||||| ||| | |||| |||||||||| |||||||||| |||||||||| CCGCTAGAGA GCGGAAACTT GAATGGTGCA ACACTGATTG ATCTGTCAAA TAATTATTTT 684 TCTGGTAAGA TTCCTAGTAC TCTAGGGGGC CTAGATAAAT T 1849 ||||| ||| |||||||||| |||||||||| |||||||||| | TCTGGGTAGA TTCCTAGTAC TCTAGGGGGC CTAGATAAAT T 725 hqPGS_C06HBa0054K13.1-17-_SGN-U345275+ (2489 1849) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3517: PGL 1 (- strand): 2489 1594 AGS-1 (1837 1594) SCR (e 0.914) Exon 1 1837 1594 ( 244 n); score: 0.914 PGS (1837 1594) SGN-U314435+ 3-phase translation of AGS-1 (-strand): . . . . . . 1837 TTGAATTGGGCCCTTAATTATCTGTCAATTCTGTATTTTGGGCCCAAGCCTGTTAGGGCG L N W A L N Y L S I L Y F G P K P V R A - I G P L I I C Q F C I L G P S L L G R E L G P - L S V N S V F W A Q A C - G . . . . . . 1777 TAGCTTAGCACTATATATAGACGCTATGGCAAACCCTATTCTGTAATTCTATTCTTGCCT - L S T I Y R R Y G K P Y S V I L F L P S L A L Y I D A M A N P I L - F Y S C L V A - H Y I - T L W Q T L F C N S I L A . . . . . . 1717 CTCCATAATAAAACTGCTCCCCCTCATCCCGTGGACGTAGTCAATTTATTGGTAAACCAC L H N K T A P P H P V D V V N L L V N H S I I K L L P L I P W T - S I Y W - T T S P - - N C S P S S R G R S Q F I G K P . . . . . . 1657 GTAAATCTGTTGTCTTGTTTTTCGCGTTTATATTTTTTCGTATTATATCAAATTCCGCAT V N L L S C F S R L Y F F V L Y Q I P H - I C C L V F R V Y I F S Y Y I K F R I R K S V V L F F A F I F F R I I S N S A . 1597 AACA N T - Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 1594 TGTTATGCGGAATTTGATATAATACGAAAAAATATAAACGCGAAAAACAAGACAACAGAT C Y A E F D I I R K N I N A K N K T T D V M R N L I - Y E K I - T R K T R Q Q I L C G I - Y N T K K Y K R E K Q D N R . . . . . . 1654 TTACGTGGTTTACCAATAAATTGACTACGTCCACGGGATGAGGGGGAGCAGTTTTATTAT L R G L P I N - L R P R D E G E Q F Y Y Y V V Y Q - I D Y V H G M R G S S F I M F T W F T N K L T T S T G - G G A V L L . . . . . . 1714 GGAGAGGCAAGAATAGAATTACAGAATAGGGTTTGCCATAGCGTCTATATATAGTGCTAA G E A R I E L Q N R V C H S V Y I - C - E R Q E - N Y R I G F A I A S I Y S A K W R G K N R I T E - G L P - R L Y I V L . . . . . . 1774 GCTACGCCCTAACAGGCTTGGGCCCAAAATACAGAATTGACAGATAATTAAGGGCCCAAT A T P - Q A W A Q N T E L T D N - G P N L R P N R L G P K I Q N - Q I I K G P I S Y A L T G L G P K Y R I D R - L R A Q . 1834 TCAA S Q F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (2489 1849) SCR (e 0.805) Exon 1 2489 1849 ( 641 n); score: 0.805 PGS (2489 1849) SGN-U345275+ 3-phase translation of AGS-2 (-strand): . . . . . . 2489 AATAATTTTATCAGCGATTCATCGTTGAGTTTCCTTACATCATTAACAAACTGTAGAAAA N N F I S D S S L S F L T S L T N C R K I I L S A I H R - V S L H H - Q T V E N - F Y Q R F I V E F P Y I I N K L - K . . . . . . 2429 CTAAGAGTACTCCTGTTTAGTGAGAATGCATTGGATGGGGCTTTATCAGTGTCTGTTGGT L R V L L F S E N A L D G A L S V S V G - E Y S C L V R M H W M G L Y Q C L L V T K S T P V - - E C I G W G F I S V C W . . . . . . 2369 AATTTCTCAAACTCTCTGCAAAATTTTGAAGGAAATGGTTGTAAGCTAAAGGGCATCATT N F S N S L Q N F E G N G C K L K G I I I S Q T L C K I L K E M V V S - R A S F - F L K L S A K F - R K W L - A K G H H . . . . . . 2309 CCTACAGAAATTGGTAATCTTACTGGTGTGATATATATGAGTTTGTATGACAATAAGTTG P T E I G N L T G V I Y M S L Y D N K L L Q K L V I L L V - Y I - V C M T I S - S Y R N W - S Y W C D I Y E F V - Q - V . . . . . . 2249 ACTGGACATATTCCAAATACTGTTCAAGACATGTTGAACCTACAAGAATTTTACCTAACA T G H I P N T V Q D M L N L Q E F Y L T L D I F Q I L F K T C - T Y K N F T - Q D W T Y S K Y C S R H V E P T R I L P N . . . . . . 2189 AGCAACAAGATAGAAGGAACCATACCAAATGCTTTATGCAGTTTAATGAATCTTGGCGCA S N K I E G T I P N A L C S L M N L G A A T R - K E P Y Q M L Y A V - - I L A H K Q Q D R R N H T K C F M Q F N E S W R . . . . . . 2129 TTAGACTTGTCAGGAAATCATTTTTCTGGTTCAGTGCCCTCATGCTTAGGGAATGTTACG L D L S G N H F S G S V P S C L G N V T - T C Q E I I F L V Q C P H A - G M L R I R L V R K S F F W F S A L M L R E C Y . . . . . . 2069 AGTTTGAGGTATCTTAATCTAGCTTACAACAGGCTGAATTCAAGATTACCTGCAAACTTA S L R Y L N L A Y N R L N S R L P A N L V - G I L I - L T T G - I Q D Y L Q T - E F E V S - S S L Q Q A E F K I T C K L . . . . . . 2009 GGGAGCCTTCAAGATCTCATAACATTCAATATTTCATCCAATTTATTGAGTGGGGAAATT G S L Q D L I T F N I S S N L L S G E I G A F K I S - H S I F H P I Y - V G K F R E P S R S H N I Q Y F I Q F I E W G N . . . . . . 1949 CCGCTAGAGAGCGGAAACTTGAAGGCTGCAACACTGATTGATCTGTCAAATAATTATTTT P L E S G N L K A A T L I D L S N N Y F R - R A E T - R L Q H - L I C Q I I I F S A R E R K L E G C N T D - S V K - L F . . . . . 1889 TCTGGTAAGATTCCTAGTACTCTAGGGGGCCTAGATAAATT S G K I P S T L G G L D K L V R F L V L - G A - I N F W - D S - Y S R G P R - I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-17-_PGL-1_AGS-2_PPS_1 (2489 1851) (frame '1'; 639 bp, 213 residues) 1 NNFISDSSLS FLTSLTNCRK LRVLLFSENA LDGALSVSVG NFSNSLQNFE GNGCKLKGII 61 PTEIGNLTGV IYMSLYDNKL TGHIPNTVQD MLNLQEFYLT SNKIEGTIPN ALCSLMNLGA 121 LDLSGNHFSG SVPSCLGNVT SLRYLNLAYN RLNSRLPANL GSLQDLITFN ISSNLLSGEI 181 PLESGNLKAA TLIDLSNNYF SGKIPSTLGG LDK 3-phase translation of AGS-2 (+strand): . . . . . . 1849 AATTTATCTAGGCCCCCTAGAGTACTAGGAATCTTACCAGAAAAATAATTATTTGACAGA N L S R P P R V L G I L P E K - L F D R I Y L G P L E Y - E S Y Q K N N Y L T D F I - A P - S T R N L T R K I I I - Q . . . . . . 1909 TCAATCAGTGTTGCAGCCTTCAAGTTTCCGCTCTCTAGCGGAATTTCCCCACTCAATAAA S I S V A A F K F P L S S G I S P L N K Q S V L Q P S S F R S L A E F P H S I N I N Q C C S L Q V S A L - R N F P T Q - . . . . . . 1969 TTGGATGAAATATTGAATGTTATGAGATCTTGAAGGCTCCCTAAGTTTGCAGGTAATCTT L D E I L N V M R S - R L P K F A G N L W M K Y - M L - D L E G S L S L Q V I L I G - N I E C Y E I L K A P - V C R - S . . . . . . 2029 GAATTCAGCCTGTTGTAAGCTAGATTAAGATACCTCAAACTCGTAACATTCCCTAAGCAT E F S L L - A R L R Y L K L V T F P K H N S A C C K L D - D T S N S - H S L S M - I Q P V V S - I K I P Q T R N I P - A . . . . . . 2089 GAGGGCACTGAACCAGAAAAATGATTTCCTGACAAGTCTAATGCGCCAAGATTCATTAAA E G T E P E K - F P D K S N A P R F I K R A L N Q K N D F L T S L M R Q D S L N - G H - T R K M I S - Q V - C A K I H - . . . . . . 2149 CTGCATAAAGCATTTGGTATGGTTCCTTCTATCTTGTTGCTTGTTAGGTAAAATTCTTGT L H K A F G M V P S I L L L V R - N S C C I K H L V W F L L S C C L L G K I L V T A - S I W Y G S F Y L V A C - V K F L . . . . . . 2209 AGGTTCAACATGTCTTGAACAGTATTTGGAATATGTCCAGTCAACTTATTGTCATACAAA R F N M S - T V F G I C P V N L L S Y K G S T C L E Q Y L E Y V Q S T Y C H T N - V Q H V L N S I W N M S S Q L I V I Q . . . . . . 2269 CTCATATATATCACACCAGTAAGATTACCAATTTCTGTAGGAATGATGCCCTTTAGCTTA L I Y I T P V R L P I S V G M M P F S L S Y I S H Q - D Y Q F L - E - C P L A Y T H I Y H T S K I T N F C R N D A L - L . . . . . . 2329 CAACCATTTCCTTCAAAATTTTGCAGAGAGTTTGAGAAATTACCAACAGACACTGATAAA Q P F P S K F C R E F E K L P T D T D K N H F L Q N F A E S L R N Y Q Q T L I K T T I S F K I L Q R V - E I T N R H - - . . . . . . 2389 GCCCCATCCAATGCATTCTCACTAAACAGGAGTACTCTTAGTTTTCTACAGTTTGTTAAT A P S N A F S L N R S T L S F L Q F V N P H P M H S H - T G V L L V F Y S L L M S P I Q C I L T K Q E Y S - F S T V C - . . . . . 2449 GATGTAAGGAAACTCAACGATGAATCGCTGATAAAATTATT D V R K L N D E S L I K L M - G N S T M N R - - N Y - C K E T Q R - I A D K I I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0054K13.1-17+_PGL-1_AGS-2_PPS_1 (2227 2487) (frame '1'; 261 bp, 87 residues) 1 TVFGICPVNL LSYKLIYITP VRLPISVGMM PFSLQPFPSK FCREFEKLPT DTDKAPSNAF 61 SLNRSTLSFL QFVNDVRKLN DESLIKL ... finished at: Mon Aug 28 22:27:24 2006