GeneSeqer. Version of March 12, 2006. Date run: Tue Jul 25 01:31:22 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 34829 Total sequence length: 35392868 Minimum sequence length: 63 Maximum sequence length: 5381 Length distribution (number of sequences of specified length): < 100: 4 < 200: 53 < 300: 143 < 400: 353 < 500: 791 < 600: 1674 < 700: 2583 < 800: 4023 < 900: 6481 < 1000: 5706 >=1000: 13018 Input file : /tmp/bac-submission-temp-r4tk8/C09HBa0099P03/C09HBa0099P03.seq.screen ________________________________________________________________________________ Sequence 1: C09HBa0099P03.1-1, from 1 to 45342, both strands analyzed. ... started at: Tue Jul 25 01:38:52 2006 EST library file: /tmp/cxgn-bacpublish-resources-mxQHWy/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 6 EST library file: /tmp/cxgn-bacpublish-resources-mxQHWy/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 8 ******************************************************************************** EST sequence 14 +strand 542 n (File: SGN-U331997+) 1 CCGTCTTGAT AGCTCAAGTA ATGATGAAAT TGATAAACTT GCTATTAATG AACTCTTTGA 61 GGTTGCTTTA AAAGAAAGTA AAAGTGGCCC TCTTGTTTTG TTCATCAAAG ACATTGAGAA 121 GTCTATGGTG GGTAATCCTG AGGCCTATGC TGCTTTCAAG ATTAAGCTCG AGCATTTGCC 181 AGAGAATGTT GTTGCCATAG CTTCCCATGC CCAGTCGGAC AGCCGAAAGG AGAAATCGCA 241 TCCTGGTGGC TTGCTATTTA CAAAATTTGG AAGTAACCAA ACAGCATTGC TTGACCTTGC 301 CTTCCCAGAT AATTTTGGTA GGTTGCATGA TAGAAGCAAA GAAACCCCCA AGACGATGAA 361 GCAGCTCACA CGACTGTTTC CCAACAAAGT TACCATACAG ATCCCTCAGG ACGAAACCTT 421 ATTATCCGAC TGGAAGCAAA AGTTAGATCG GGATATGGAA ACTATGAAAT CTCAGTCAAA 481 CATTGCAAGC ATTCGCAATG TTTTGAATCG ATTCAAAATT AATTGCGATG ACCTTGAAAT 541 TC Predicted gene structure (within gDNA segment 2289 to 1): Exon 1 1689 1629 ( 61 n); cDNA 1 61 ( 61 n); score: 1.000 Intron 1 1628 1546 ( 83 n); Pd: 0.997 (s: 1.00), Pa: 0.988 (s: 1.00) Exon 2 1545 1372 ( 174 n); cDNA 62 235 ( 174 n); score: 1.000 Intron 2 1371 1287 ( 85 n); Pd: 0.877 (s: 1.00), Pa: 0.929 (s: 1.00) Exon 3 1286 1215 ( 72 n); cDNA 236 307 ( 72 n); score: 1.000 MATCH C09HBa0099P03.1-1- SGN-U331997+ 1.000 307 0.566 C PGS_C09HBa0099P03.1-1-_SGN-U331997+ (1689 1629,1545 1372,1286 1215) Alignment (genomic DNA sequence = upper lines): CCGTCTTGAT AGCTCAAGTA ATGATGAAAT TGATAAACTT GCTATTAATG AACTCTTTGA 1630 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTCTTGAT AGCTCAAGTA ATGATGAAAT TGATAAACTT GCTATTAATG AACTCTTTGA 60 GGTATTTTCT AATATTCCTT TCCATGATAC TATTCCACTT CTTAACCGAG TGTGATAGCT 1570 | G......... .......... .......... .......... .......... .......... 61 TATTTATATG TTTTATCTGT ATAGGTTGCT TTAAAAGAAA GTAAAAGTGG CCCTCTTGTT 1510 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GTTGCT TTAAAAGAAA GTAAAAGTGG CCCTCTTGTT 97 TTGTTCATCA AAGACATTGA GAAGTCTATG GTGGGTAATC CTGAGGCCTA TGCTGCTTTC 1450 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTTCATCA AAGACATTGA GAAGTCTATG GTGGGTAATC CTGAGGCCTA TGCTGCTTTC 157 AAGATTAAGC TCGAGCATTT GCCAGAGAAT GTTGTTGCCA TAGCTTCCCA TGCCCAGTCG 1390 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGATTAAGC TCGAGCATTT GCCAGAGAAT GTTGTTGCCA TAGCTTCCCA TGCCCAGTCG 217 GACAGCCGAA AGGAGAAAGT ATGCAAGGCA TAGTTATTCT GCATTTCTTT CTGATAAAAA 1330 |||||||||| |||||||| GACAGCCGAA AGGAGAAA.. .......... .......... .......... .......... 235 ATTGCCTTGC GATGCTGATA GCCTTACTTT TTATCTGTTA CAGTCGCATC CTGGTGGCTT 1270 ||||||| |||||||||| .......... .......... .......... .......... ...TCGCATC CTGGTGGCTT 252 GCTATTTACA AAATTTGGAA GTAACCAAAC AGCATTGCTT GACCTTGCCT TCCCA 1215 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| GCTATTTACA AAATTTGGAA GTAACCAAAC AGCATTGCTT GACCTTGCCT TCCCA 307 hqPGS_C09HBa0099P03.1-1-_SGN-U331997+ (1689 1629,1545 1372,1286 1215) ******************************************************************************** EST sequence 8 +strand 884 n (File: SGN-U347100+) 1 ATAGCTGGAG CTCCCCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG 61 GAATTCGGCA CGAGGGAATT ACTCAGAGAT CTTGACCCGC CAATTTTGAC ATCAACCAGG 121 CGCCAAGCTT TTAAAGATGC ATTACAACAA GGAATACTTG ATTCAAAAAG TATTGAGGTC 181 TCGTTTGAAA ATTTTCCGTA CTATTTGAGT GAAACCACGA AGAATGTTCT GATTTCTTCC 241 ACTTACGTTC ATTTGAAGTG TCACAAATTC ATAAAATATG CACCTGATCT CCCCACGTTG 301 TGCCCTAGGA TTTTGTTATC AGGTCCAGCA GGTTCAGAAA TCTATCAGGA GACATTGGCC 361 AAGGCTCTTG CAAAATACTT TGGTGTCAGG CTACTGATAG TTGATTCTCT TTTACTGCCT 421 GGTGGGTCAA TTGCCAAAGA CATTGACTCT GTGAAGGAAA GTTCAAAGCC TGAGAGAGCA 481 AGTGTCTTTG CTAAACGTGC TGCCCAGGTG GCTGCACTAC ACCTTAATAA GAAGCCAGCT 541 TCAAGTGTTG AGGCTGATAT AACTGGGGGT TCTACAGTAA GTTCTCATGC TCAGCCTAAG 601 CAGGAGGCAT CTACTGCATC ATCCAAAAAC TACACTTTTA AGAAAGGTGA CAGAGTGAAG 661 TATGTTGGTC CGTTACAGTC AGGCTTTTCT CCCTTTGCAG CACCTTTGAA GGGGTCAACA 721 TATGGTTACA GGGGCAAAGG GGGTCTGCAT TTGAAAGATA TGAGTCCTCT AAGATGGGGA 781 TCAGGTTGAT AGATCATTCC CAAGGGCATG ATCTTTGTGG GCCCTGCCAG GAGAACATGG 841 GTTTTTTGTG CTGGCCACTT TTACCCCTTG ATGCTCAACA GGGA Predicted gene structure (within gDNA segment 7432 to 1): Exon 1 4246 4114 ( 133 n); cDNA 77 209 ( 133 n); score: 0.752 Intron 1 4113 3942 ( 172 n); Pd: 0.958 (s: 0.76), Pa: 0.859 (s: 0.84) Exon 2 3941 3820 ( 122 n); cDNA 210 331 ( 122 n); score: 0.820 Intron 2 3819 3735 ( 85 n); Pd: 0.956 (s: 0.88), Pa: 0.729 (s: 0.88) Exon 3 3734 3643 ( 92 n); cDNA 332 423 ( 92 n); score: 0.859 Intron 3 3642 2710 ( 933 n); Pd: 0.206 (s: 0.84), Pa: 0.997 (s: 0.70) Exon 4 2709 2487 ( 223 n); cDNA 424 646 ( 223 n); score: 0.830 Intron 4 2486 2341 ( 146 n); Pd: 0.973 (s: 0.90), Pa: 0.842 (s: 0.74) Exon 5 2340 2277 ( 64 n); cDNA 647 711 ( 65 n); score: 0.695 Intron 5 2276 1967 ( 310 n); Pd: 0.988 (s: 0.62), Pa: 0.998 (s: 0.84) Exon 6 1966 1818 ( 149 n); cDNA 712 853 ( 142 n); score: 0.805 MATCH C09HBa0099P03.1-1- SGN-U347100+ 0.803 783 0.886 C PGS_C09HBa0099P03.1-1-_SGN-U347100+ (4246 4114,3941 3820,3734 3643,2709 2487,2340 2277,1966 1818) Alignment (genomic DNA sequence = upper lines): AACTACTCAA GGATTTTGAT CGTCCAGTTT CTGCTTTAAC TAGGCGCCAA ACATTTAAAA 4187 || |||||| ||| |||| | ||| ||| | | ||| ||||||||| | |||||| AATTACTCAG AGATCTTGAC CCGCCAATTT TGACATCAAC CAGGCGCCAA GCTTTTAAAG 136 ATGCCTTACA GCAAGGAGTA GTTGATTTCA ACACTATTGA TGTCACATTT GAAAATTTTC 4127 |||| ||||| |||||| || |||||| | | | |||||| ||| | ||| |||||||||| ATGCATTACA ACAAGGAATA CTTGATTCAA AAAGTATTGA GGTCTCGTTT GAAAATTTTC 196 CATATTACTT ATGGTAAAAA TTTGATCTTA CCCCCCTTCC CCCCCTCTTT CCAGTTTTCT 4067 | || || || | CGTACTATTT GAG....... .......... .......... .......... .......... 209 GGTTGGTATT GCTTTCTTTT CTGTCTTCTC TCGCACCAAA ATTATGTTTG TTCAAGTTGA 4007 .......... .......... .......... .......... .......... .......... 209 AATTGGCACT TGTGTATTTC TGGAATGTGT TCCATTGATT TCTATACTAC ATTTTTGTCT 3947 .......... .......... .......... .......... .......... .......... 209 GACAGTGAAA ATACAAAGAA TGTTCTGATT GCTTCCACTT ATATACACTT GAAGTGTAAC 3887 ||||| || ||||| |||||||||| ||||||||| | | || || ||||||| || .....TGAAA CCACGAAGAA TGTTCTGATT TCTTCCACTT ACGTTCATTT GAAGTGTCAC 264 GGGTTTGCAA AATTTGCATC AGATCTTCCC ACAGTGTGCC CTAGGATTTT GCTATCAGGT 3827 || || ||| |||| | ||||| ||| || |||||| |||||||||| | |||||||| AAATTCATAA AATATGCACC TGATCTCCCC ACGTTGTGCC CTAGGATTTT GTTATCAGGT 324 CCAGCAGGTA ACTACTGCTC CATAGAAATC ACTTATCTGG CTTTGTTCAG TTCTCCTGCC 3767 ||||||| CCAGCAG... .......... .......... .......... .......... .......... 331 CTCGAAATCA CATTTTGACT TGCTGCTCAC AGGTTCAGAG ATTTATCAGG AGACATTGGC 3707 ||||||| || ||||||| |||||||||| .......... .......... .......... ..GTTCAGAA ATCTATCAGG AGACATTGGC 359 CAAAGCACTT GCTAAGTACT TTTGTGCTAA GCTAATGATA GTTGATTCTC TCTTGCTGCC 3647 ||| || ||| || || |||| || ||| | |||| ||||| |||||||||| | || ||||| CAAGGCTCTT GCAAAATACT TTGGTGTCAG GCTACTGATA GTTGATTCTC TTTTACTGCC 419 TGGTGTAAGT GTCTTCTATT ACTTGGCTCA TGTGTCCTCA TTTGAGGGGC TCATACTTTT 3587 |||| TGGT...... .......... .......... .......... .......... .......... 423 TTACCACTGG TTACACCCAA TATGCCTCTT TATATTAGTT ATCTTGCTTC CTATCGCCTC 3527 .......... .......... .......... .......... .......... .......... 423 ATTTCATGGG AAGTTAGAAA TGTAGTTGAA AGTTTTGAAA GTTATACGAG AAATACGCGA 3467 .......... .......... .......... .......... .......... .......... 423 CAAGCTAAGA CTGTTCGGAT TACAGGCCAT TTCTGGTTTT CCAATTCCTT TTTTAGTAAT 3407 .......... .......... .......... .......... .......... .......... 423 AAATTGAATA GGAAAGTCTG TTTATTGGGG TTGAAAAAAG GAAACTTTGT ATCTAAAATT 3347 .......... .......... .......... .......... .......... .......... 423 GTGAGACCAA TAGCTCCTTT GCATAATAGC CAAATTTTTC TTTTTTCTAG TGTAACCATC 3287 .......... .......... .......... .......... .......... .......... 423 AAAAGATAAA CTATTACATA TATTAATGTT CATTTCATCT AACATTTACT GACAAAATAT 3227 .......... .......... .......... .......... .......... .......... 423 ATGCAAAATG GCTTGAAAAT AAGGAAAAAA CAGCTTTGAG TAAGGGGGGA AAGAAAGGAA 3167 .......... .......... .......... .......... .......... .......... 423 AACATTTTGG TTTGAGCCTT GGCCTCGAGA ACAATAGGTA TATTACTTCA ATGGGGTATT 3107 .......... .......... .......... .......... .......... .......... 423 ACAGAATGGG ACAGAAACTT TGGCTGCAAA TTTGTTAGTT TGCTTGTGTG AATCAACAAT 3047 .......... .......... .......... .......... .......... .......... 423 TTTCCTTGTT TCATTTGGGA TTTTGTCAGT ATGCTCATAT AAATCAACAC TGATCATCCA 2987 .......... .......... .......... .......... .......... .......... 423 CACTGCCCAC TGCAAGCCAG ATTAGAAGTA AATAGGGAAG TGCTTGACGG CTGGCTCAGG 2927 .......... .......... .......... .......... .......... .......... 423 GCAATGCAAT CTAAAGTTTT GCAAAATTTC TAATAATCAT ATGGATGGTG GTCAGACTAT 2867 .......... .......... .......... .......... .......... .......... 423 TGAACAGCTG CCCACCTGAG AATTATTGGG CAGTGTTAAG TGAAATGGGA GTGATTGTAG 2807 .......... .......... .......... .......... .......... .......... 423 TTTGAACATT TCTGTTATTG TTGGTGTAAG AACAGGTTAA TATGATCGAA TTTTTATTAC 2747 .......... .......... .......... .......... .......... .......... 423 CCTCCAATTC ATTTCCTTTT TCTTTAATAT TATGTAGGTT TCAAGTTCCA AAGATGTCGA 2687 | |||| | ||| |||| | || .......... .......... .......... .......GGG TCAATTGCCA AAGACATTGA 446 GCCTGTTAAA GTAAGCTCAA AACCAGAGAG AGCTAGTGTA TTTGCTAAAC GTGCGGCGCA 2627 |||| || | ||| |||| | || ||||| ||| ||||| |||||||||| |||| || || CTCTGTGAAG GAAAGTTCAA AGCCTGAGAG AGCAAGTGTC TTTGCTAAAC GTGCTGCCCA 506 AGCGGCAGCA TTGCATCTGA ATAAAAAGCC GGCTTCAAGT GTTGAGGCTG ATATAACTGG 2567 | ||| ||| | || || | |||| ||||| ||||||||| |||||||||| |||||||||| GGTGGCTGCA CTACACCTTA ATAAGAAGCC AGCTTCAAGT GTTGAGGCTG ATATAACTGG 566 TGGTTCAATT TTAAGTTCTC ATGCTCAGCC CAAGCAGGAG GCATCGACTG CCTCATCAAA 2507 ||||| | ||||||||| |||||||||| ||||||||| ||||| |||| | ||||| || GGGTTCTACA GTAAGTTCTC ATGCTCAGCC TAAGCAGGAG GCATCTACTG CATCATCCAA 626 AAACTATACT TTTAAGAAAG GTAATTGTTT CTTTGATAGT GAGAACTCCC TCTCTCTCTC 2447 |||||| ||| |||||||||| AAACTACACT TTTAAGAAAG .......... .......... .......... .......... 646 TCTCTATATA TATATATATA TATATATATA TATATATATG TCAGGCTAGA AAATATCCTT 2387 .......... .......... .......... .......... .......... .......... 646 TGTAGGTGGC AGTTTTGTGC TTAATGCAAA ACCGTCATAT TGACAGGTGA TAGAGTGAAG 2327 |||| ||||||||| .......... .......... .......... .......... ......GTGA CAGAGTGAAG 660 TACATCGGAT CTTTAACATC AAGCTTTTCT CCGTTGCAAT CACCTAT-AA GGTAACTGCA 2268 || | || | ||| || | |||||||| || || | ||||| | || | TATGTTGGTC CGTTACAGTC AGGCTTTTCT CCCTTTGCAG CACCTTTGAA G......... 711 GTAAGTCTGT TAATTTATTT TCCAGTGATA TTAGTTCAGT AAGCTCATTC AATCTCATAC 2208 .......... .......... .......... .......... .......... .......... 711 ATAATATGAT TGATGATGCA TTTTGAATCA CTTCTGAAAA CTTGCTGTTT GAATAATGAT 2148 .......... .......... .......... .......... .......... .......... 711 CTCTGTTAGG ATTGAAAAAC AAGCTATTGG TCTACTTGTT ATTCTCTGTG AATAAAATGC 2088 .......... .......... .......... .......... .......... .......... 711 TCTCTGTCAG GACTAACTTC TTGCTTAGTT CATTTTCTTG TGAACAAAAT GCTGTCTCTA 2028 .......... .......... .......... .......... .......... .......... 711 TTGAGATATG TCGGTCAGTC ATTTCTTTAT TTTAAAATCA CTATAACATG CATGATTGCA 1968 .......... .......... .......... .......... .......... .......... 711 GGGGTCCAAC ATATGGTTAC AGGGGCAAAG TGGTTCTTGC ATTTGAGGAA AATGGGTCCT 1908 |||| |||| |||||||||| |||||||||| || || ||| |||||| | ||| ||||| .GGGT-CAAC ATATGGTTAC AGGGGCAAAG GGGGTC-TGC ATTTGAAAGA TATGAGTCCT 768 CTAAAATTGG TGTCAGATTT GATAGATCAA TTCCTGAGGG TAATGATCTT GGTGGCCTGT 1848 |||| || || |||| || |||||||| | |||| |||| |||||||| |||| | | CTAAGATGGG GATCAG-GTT GATAGATC-A TTCCCAAGGG -CATGATCTT TGTGGGCCCT 825 GCGATGAAGA TCATGGGTTC TTTTGTGCTG 1818 || | | ||| |||||||| |||||||||| GCCA-GGAGA ACATGGGTT- TTTTGTGCTG 853 hqPGS_C09HBa0099P03.1-1-_SGN-U347100+ (4246 4114,3941 3820,3734 3643,2709 2487,2340 2277,1966 1818) ******************************************************************************** EST sequence 5 -strand 1077 n (File: SGN-U340386-) 1 CAACCTGAAA AGGGGGGGAT TAGTTTAGGG TAGGTCCATG TCTTTTTTTT TAAGAATTTG 61 CCGGTGACCC CAATAAAAGA CCTCCTTTAT TTTTTTTTTA CCCTCATGCA CCCTGGTTTT 121 AAACAAATTC CAGTTCCAGC CGCCGTATCC TTTGGCCCCT TTCCACCTTT AAGAGGCTAG 181 GGTTCATCTT TCTTGGTCTT CTCCTCCTAA GAGATCACCC CCCCTACCCC TTTTTTAAAG 241 GTAAGATTTA AAACCAGAAC CCCAGCAATA AGCTTGTTTT GTACAGAGCA CTAACACTAT 301 GTTTGGATCA TTGTTATCCA TTGTATTGTA TCGTATTGTT ACTATACCTA TAATGTTTGT 361 TTTGAGTGTT ACTTAAAATG TATAGTACTG TATTGTTAAA TTTCGTTGTT ACTTAACAAT 421 GAAACCCCTG TTTTATGGAC AACCAATTTG GTGTGTTCCA ATTGTTACTT AATTTCTTTT 481 TTCAATTATA TCTTTATATT ATATTTTTAA ATACTATTTT ACCCTTTACC TTAATTATTT 541 AAATCTAGTC AAACCTCCTA CCCTAGAATA TTTAAGGACG TTTAAGTAAA TTTATAAATT 601 ACAATACAGT ACGCTATAAT CAAACAAAGC AATTAAAATG TTATTAAACA ACAACAAACA 661 ACGCAATCTA GCCAAACATT GTATATATCA TACAATACAG TACAATACAA TACAATACAT 721 TATGAAACAA TGAGTAACAA TGATCCAAAC AAAGAGTAAA GGTAAGCAAA AAAAGCAAAA 781 GTATATCCCT TTTCTAATCT TGAAGGCATT CTATGAGTTC TAACACAGAT TTTGTCTCAT 841 TTAGAAGTGT TATTTTACAA GACAAAAAAT GCAATTGACT TGTATCGTCC AAGAGGAGTT 901 GGATTTGGCT TTAAAACATC TTAATCGAGG AATGTAAAAA TTGTCTGACT AAATAGGCAA 961 TATGAGCTAT TCAAGATGAA TGAAGGAGAA TCCCTTAAAA GAGATGTTTA ATAGATCCAC 1021 TGACATCACC AATATTCCAC CCAATAAACG TTCTCCTCGT GCCGAATTCC TGCAACC Predicted gene structure (within gDNA segment 22660 to 9385): Exon 1 21610 21603 ( 8 n); cDNA 281 288 ( 8 n); score: 0.750 Intron 1 21602 15381 (6222 n); Pd: 0.998 (s: 0), Pa: 0.000 (s: 0.87) Exon 2 15380 14945 ( 436 n); cDNA 289 744 ( 456 n); score: 0.733 Intron 2 14944 10843 (4102 n); Pd: 0.245 (s: 0.64), Pa: 0.000 (s: 0) Exon 3 10842 10833 ( 10 n); cDNA 745 754 ( 10 n); score: 0.800 Intron 3 10832 10592 ( 241 n); Pd: 0.135 (s: 0), Pa: 0.559 (s: 0) Exon 4 10591 10565 ( 27 n); cDNA 755 781 ( 27 n); score: 0.667 MATCH C09HBa0099P03.1-1- SGN-U340386- 0.733 481 0.447 C PGS_C09HBa0099P03.1-1-_SGN-U340386- (21610 21603,15380 14945,10842 10833,10591 10565) Alignment (genomic DNA sequence = upper lines): GTCCAAAGGC ATCAATTGAT GTTTTTTCCT CTCCAAAGAT AGAGTTTATG GTAGCGCTCT 21551 || || || GTACAGAG.. .......... .......... .......... .......... .......... 288 TACCCACCCC AGATTTGCCA ATAACCAGGA TATTCACAGA GAAGTCCAAA TCATCTTTCC 21491 .......... .......... .......... .......... .......... .......... 288 CCTCTGCTTC AAGCTGGAAA GCTTTCATCT TGGCAGCCTC AACGCTAAAA AGTGGACTGT 21431 .......... .......... .......... .......... .......... .......... 288 TTTGCCTTCG TGCAATAAGT GTCATCCGGT ACAAAACTTG TGCAGCTATG GGCTCATCAG 21371 .......... .......... .......... .......... .......... .......... 288 AAGATAGACC CAACCTGTGA ATAAGCCTCA AAAACTTGAC CCTGATCTGC TGTAGTGTGT 21311 .......... .......... .......... .......... .......... .......... 288 CCAGTTTCTT CTTCTCTTCT TCACTCAAGT TGTTCTCAGA TTCTCCACTG TTCTGGAGAT 21251 .......... .......... .......... .......... .......... .......... 288 TGGAATGAGT AAAAAGATTG GGTTGGCTTG GTCGGGGAGC TGGCCTCAGT GACCGGAGTG 21191 .......... .......... .......... .......... .......... .......... 288 ATGACCCAAG ACCAGCAGGA CGTTCAACAG AGAACAGTCT TGATCCATCT TGAGACGTGA 21131 .......... .......... .......... .......... .......... .......... 288 CTGTTATGTT GCCACCATCA GAATCACCAC CTGTTGCTGC TTTTAAAAGA GCAGCTAAGG 21071 .......... .......... .......... .......... .......... .......... 288 CAGCTGAATC AAACAACTCC TTCACATCTC CTTCTTCATC AGTATCAGCC TCCTCATCTG 21011 .......... .......... .......... .......... .......... .......... 288 AGTCGGTGAC AATCTGACCA TCAATATCCT GAGAAACCTC AGCACCAGCA TAGGAGCCAC 20951 .......... .......... .......... .......... .......... .......... 288 CACCAGATTC CCTTTCCAGC TCCTCCATAA ACTGTTTGGC AGCTTCAGAG CTTCCAAAAA 20891 .......... .......... .......... .......... .......... .......... 288 TCATACCATC TGTCTCTCCA TCTGAAACTG AGCCTTCAAG GTTAGCTTCT TCATCTATAT 20831 .......... .......... .......... .......... .......... .......... 288 GATCTTTAGC TTCTCCTTCT TGCTCTGAGC CAGTGATTGA CCTTGATATA GCAGATGTTT 20771 .......... .......... .......... .......... .......... .......... 288 GACCAGAGAC TTCAACTTCC ACTACATCTC GCTCGACGCT TTCTCCGGAA TGGTCACCAT 20711 .......... .......... .......... .......... .......... .......... 288 TGGCATATAT AGTTCCAGAC ACTGCCTGCT CAGGTTCCAC TTCAACATCC TTGGTGTCCA 20651 .......... .......... .......... .......... .......... .......... 288 GACTTTTATT AACAGTTTCA GGATCAGCTT CTTTAATTTC CTCAGTAACA GCTACCACAT 20591 .......... .......... .......... .......... .......... .......... 288 CACCAGTCTG AACAACTCCA GAATCAAGTT TTCCAGCACC AACAATGTTA TCCACAGGCT 20531 .......... .......... .......... .......... .......... .......... 288 TTGATTCTGC AGTTGCAACA GTATCATCCA CCACAGGCTT AGGGTTCTGT GCATCTACTT 20471 .......... .......... .......... .......... .......... .......... 288 CATCAACTGT CATTTTAGCC ACTTCTTCAA TAAGTTGTCT GGTCTCACCA ACATCATTTA 20411 .......... .......... .......... .......... .......... .......... 288 CTGATGTCAC ATTTTCATCA TGGGTACCTT CAATATGTTC TTCCACTTCC TTGCTCTCGT 20351 .......... .......... .......... .......... .......... .......... 288 CCACATCTCC AACAACAGCA ACCCCTGGCC CTGAGACATT GACTTCAATA GCATCCACAA 20291 .......... .......... .......... .......... .......... .......... 288 CTGCATCTCC TTCGGAAGTA AATTTCTTAC TACCTGTTTC AGCCAACAAA GATTCCGACG 20231 .......... .......... .......... .......... .......... .......... 288 GTTGTTCAAC AACCACCTTA TTGAGTTTTT CGGCATCATC ACTATGTATA GCTCTCTCTT 20171 .......... .......... .......... .......... .......... .......... 288 CAATGACCGA GGTAGGTTCT TCGGCCTCCT TCAAATTGGA TGAAGCAATA GCAACCTCTT 20111 .......... .......... .......... .......... .......... .......... 288 CAAAGACCGA GGTAGGTTCC TCAGACTCTT TCAAATTGGA TGAAGCAATA GCACTCTCTT 20051 .......... .......... .......... .......... .......... .......... 288 CGATGACCGA GGTAGGTTCC TCAGCCTCCT TTAAATTAGA TGAAGCAACA CTAGTTAAAG 19991 .......... .......... .......... .......... .......... .......... 288 CTTCATTCTT TTCATCCTCC ACAATCTCCT TCACCTCCTC CTGAATGGAC CTATCCTCAC 19931 .......... .......... .......... .......... .......... .......... 288 CCCCTTGCAA TTCTATCTTT TCATCCTCCA CAGTCTCCTT CACCTCCTCC TCGATAGACC 19871 .......... .......... .......... .......... .......... .......... 288 TATCCCCGCC CTCTTTCGGT TCCATCTTTT CATCCTCCAC AGTTTCCTTC ACCTCCTCCT 19811 .......... .......... .......... .......... .......... .......... 288 GAGTGGACCT ATCCTCACCC TCTTTCAATC CTGCACCAAC CGCAACCTCA CCACCATTAT 19751 .......... .......... .......... .......... .......... .......... 288 CTTTCACCTG CTCAATTGAA TTCAACTTAT CAACCGAGTC AAACTCTTCA GTATTACCCT 19691 .......... .......... .......... .......... .......... .......... 288 CCGAACCCTC GATTCCCGTC CCCAGTGATT TGGTAGCATC AGAGTCAGTA GAATCAGGTA 19631 .......... .......... .......... .......... .......... .......... 288 AACTATTCCC AATAACAGGA ACAGAACCAC CAACATCTCC CCCTAAAGCC TCAACATTTT 19571 .......... .......... .......... .......... .......... .......... 288 CAACACCCTC ACTCATAGAA ACCTCAGAAA CAGGCTTCTC AATTTCCGCA GAACCCACAA 19511 .......... .......... .......... .......... .......... .......... 288 CGGAATCATC CAACTTCTCT TCCCCAATAG TTTTTTCTAG GGTTCCCTCA TCTGGGTCAG 19451 .......... .......... .......... .......... .......... .......... 288 CAGGAATTGG CAATTCTTGC TGACCACCCA CAATAGTTAC ATCACTAGCG CTTTTACCCT 19391 .......... .......... .......... .......... .......... .......... 288 CACTATTGCT ATTACTATTA ATATCGGAAT CATTGATTTC TACATTAATT TTGGAGACAT 19331 .......... .......... .......... .......... .......... .......... 288 TTTCAGTTTC AGTATGATTA GAAGAAGAAT TGTTGATGGG AGAAGAACCT GGAGAAGAAG 19271 .......... .......... .......... .......... .......... .......... 288 AAACAGCAGG AGGCGAAAAC GTCGCTTCTT CAGAATCCAT GGCGCCCAAC AAACAACAAC 19211 .......... .......... .......... .......... .......... .......... 288 AACTCCGCTA CTCAACTGTG AAGAATTGAA AAAAGTGTAA AGATAACAAA GTTCTCTGTG 19151 .......... .......... .......... .......... .......... .......... 288 GAGAGAACAA AGAGGGGTTT GGTTCTTGTG AGATAAGAGT AGACGATAAG GTTTATACTA 19091 .......... .......... .......... .......... .......... .......... 288 CTAGCTGCAT TTTGGTATTG GCACACTATG GGCTTTGGGG TTTTTGAGAT TTTTCGGGTA 19031 .......... .......... .......... .......... .......... .......... 288 TTTCGGTTCG TGAATTAAAT TATTCGAAAT TTTTATTTGA TACACGTTAA TTAAATTTAA 18971 .......... .......... .......... .......... .......... .......... 288 ACTATGCATT ATAAACTATA TTTGAGAATG ACGTTTTCAA TAAAAAAAAT TTCATATTCA 18911 .......... .......... .......... .......... .......... .......... 288 ATATAATATA AGATATATTG AGATTATAAA ATTTGAGGTA TAAAATTATG AAATTTATCA 18851 .......... .......... .......... .......... .......... .......... 288 TTAGTAAAAT GATTAATTAT ATTATTAATT AAATAAAGTA TATGATTTCA ATTAATTTTG 18791 .......... .......... .......... .......... .......... .......... 288 ATTACTTGTA GAAAATGTGG AAGAAAAAGA ATTAACTAAA AATACTAGAA ATGTCTGTTA 18731 .......... .......... .......... .......... .......... .......... 288 AATTTCACAG ATTATTTATT ACAAAAGAAA TTATATTAAA CACCAAAAAA GATACTTTTT 18671 .......... .......... .......... .......... .......... .......... 288 TACTTCATAT TGTAAATTAA TGAATAAAAA ATATAGTTCA AAATAAGTAA ATTTTTCGAA 18611 .......... .......... .......... .......... .......... .......... 288 ATTTTAGAAT ATTATTAACA CTTTTTGCTA ATATTTATTC TTAATTTAAT GAGATTGTTA 18551 .......... .......... .......... .......... .......... .......... 288 ATTATGTTTC TTTTTCTAAT AGAATAATTC GAAAATAATC AAAAAGATAA TAGTAAAATA 18491 .......... .......... .......... .......... .......... .......... 288 TAACTTTTTA TTTATGTTCT TAAATATTTT TTCGATAAAT GTATCAAATC CATATCATTC 18431 .......... .......... .......... .......... .......... .......... 288 ACTTTTAATA TGAACTAGAA AGAGTATATA TAATGGCCTA TTATTTTTAA ATATTTTCTG 18371 .......... .......... .......... .......... .......... .......... 288 AATGTTTTTA ATTATAGAAA TCATGATCTA TAGTAATTTT ATTTACTTAG TGAGTGTGTT 18311 .......... .......... .......... .......... .......... .......... 288 ATAAAAAAAA AAGATTGAAC TATTAAGGAA TGAACATAGA ATTTTCTCAT TGGTTAGAAA 18251 .......... .......... .......... .......... .......... .......... 288 TTTTGACAAA ACACCACAAA AGGGTACACT TCTAAGTGTT ATATAAGTTT TGAAGAAGTG 18191 .......... .......... .......... .......... .......... .......... 288 AGATGAATTT CAAATATAAA AAGTGAATTT AAATCAAATA ATAACTTCAA AAAAATTATT 18131 .......... .......... .......... .......... .......... .......... 288 CTTCATTTTG AACATATGTG TCTTAATATT AAAAAACGCA TGCATATACT TTCTTCGTCC 18071 .......... .......... .......... .......... .......... .......... 288 AATTTTAATT ATTATGATTT CTTTTTTAAA TCAAATATTA AAAACTTTAA CTTACATGTT 18011 .......... .......... .......... .......... .......... .......... 288 ACGATATATT CTTTCATAAT ATTAATTTGT AAAAAGTTGT AACTTATAAT ATCTTTTAAT 17951 .......... .......... .......... .......... .......... .......... 288 ATATTTTTGA ATATTTAATT CTTTTTTTTA AAAAAAATAT CAAATCAATA TGATCTAATT 17891 .......... .......... .......... .......... .......... .......... 288 TAACTTCAGA ATTAGTCAAA ATCCATTTTC GTAAAATATA GTATGACAAC TAAAATAAAA 17831 .......... .......... .......... .......... .......... .......... 288 TAAAGAAAGT AGAGGTTGGG CCTTAGAAAA CATTAATAAC AAACCAAAAA AAACATTAAT 17771 .......... .......... .......... .......... .......... .......... 288 GACAAACCAA ATTTCCAATG TATATGTCGA AAGTCTACTT GTAGAAGAAA AAATGCCTAA 17711 .......... .......... .......... .......... .......... .......... 288 CTTCAAAAAT AAAATTTATA ACTTTGTTTT GAAATGGAAA AAAAAATGTG ACTTGTGAAT 17651 .......... .......... .......... .......... .......... .......... 288 TAAATGCACA TTAGGAGTTT GAATTTGACA TGAACTAAAG CCTAATGAGG TAGTAGGACA 17591 .......... .......... .......... .......... .......... .......... 288 AACTTAAGTT AGGTTATTAG CATTTTAATA TTAATATTTA ACCTATTAAA TTGCTTTATT 17531 .......... .......... .......... .......... .......... .......... 288 TTGAAGTTAT AAGAATGGAT GACTTCTACA TCATCTATTA GCATTGGTTA TCCATCAATG 17471 .......... .......... .......... .......... .......... .......... 288 TATTTCATTC TTAATTCCTC CATTACTTCC TTTGTAACCT ATGTAACATA TGTAACTCCC 17411 .......... .......... .......... .......... .......... .......... 288 TTTGTAACAT ATGTAACTCC CATTGTAACC TATGTAACTC CTATTGTAAC CTATGTAACT 17351 .......... .......... .......... .......... .......... .......... 288 CCTAAGGTCA TTATCTTCAC TATTTACTAT CTCGTAGTAT AAATAGTGGT AGTCTTCATT 17291 .......... .......... .......... .......... .......... .......... 288 TGGTTTTAGA GACACACAAT TCATAAGAGA AAACAAAGAG TGAAAGAGTT AGTCTAAAGA 17231 .......... .......... .......... .......... .......... .......... 288 GAGTTCTTAT TAGTTGAAGG GAGGTGTTCT TTTTTTGTGG AGCTTTGGAC TCAACTCCTG 17171 .......... .......... .......... .......... .......... .......... 288 TCCAGAGTTG TTGAGTTATA CTTTGTAAAG GCTGTTGTAT CCTGGAGGGG ACAAGTCAAA 17111 .......... .......... .......... .......... .......... .......... 288 GAGGACTACT GCTGGACCGG TGAAAACATT TGCTGCAGTG GGCTTGAATC TCCTTAAAGA 17051 .......... .......... .......... .......... .......... .......... 288 GAGCGAGATA TCCGCGCCTC AGCCTGAAGA GATTACTTTC TTCATTTTAT TTTCAATTGT 16991 .......... .......... .......... .......... .......... .......... 288 AATCTTGCAA TTTTATTATC TTGTAAGTTT TTTTTCACTA ACACTTAATT TATTATTTTA 16931 .......... .......... .......... .......... .......... .......... 288 CACATACTGA AGTTTTGGAT TTAGTTTAAG CATAAATTTT ATTAGTAACT ATTTAGTCAA 16871 .......... .......... .......... .......... .......... .......... 288 TATTTAAAAA GTGCATTTAG GAAAAATATA GTATTAAAAA AAAACAATTT TGCATGCTAA 16811 .......... .......... .......... .......... .......... .......... 288 TTTATGAATC ATTTTTTTTT CTTATTAAAA ACATGTTCAA CGTCTTTAAT TCTTTGAAAT 16751 .......... .......... .......... .......... .......... .......... 288 AAATAGTTTT TTTTTTACTT TGAAAATTTT GAAAATCGAA CAAAGTATAA CAAGATAGAG 16691 .......... .......... .......... .......... .......... .......... 288 TGGTTGTGAT GGATTTAGGA TGGGGCGAGA GTATTCACCT GAACCCTTTT GATAAAAAAT 16631 .......... .......... .......... .......... .......... .......... 288 TAACTATTTA TATAAAGTAT TTTTTTTTCA AAATTTTAGT GAATATATAG GATTTAAATA 16571 .......... .......... .......... .......... .......... .......... 288 CCTAAGCATA AGACTGAAAG TTACCTTAGT AATTTAGGGA CTTCATAATT CATTTTTAAA 16511 .......... .......... .......... .......... .......... .......... 288 TAGATATATC ATATTTATTT TTCTAACCTA CTTAATAAAA ATTCTGAATC AGTATTTAAA 16451 .......... .......... .......... .......... .......... .......... 288 AATACAATTT AATTTCACAA ACTTACTGAA TAATTTGAAT TGCATATTTG TTTGGTTGTA 16391 .......... .......... .......... .......... .......... .......... 288 ACTGCAGGCT TTGGATGGAA TGATGATAAA TGATAATATT TTTGGTCTGA CTAAGTTACT 16331 .......... .......... .......... .......... .......... .......... 288 TTTTTTAAAA AAAAATTTTT ATAATATCTT GAGGGATTTT AATTCCCCAC AAAACCCTTT 16271 .......... .......... .......... .......... .......... .......... 288 TTTTTTTTTT TTAGAGTATA TTCCATTTGC TTTTTAAAAT TTTGTCAAAC ATTGGGTTTT 16211 .......... .......... .......... .......... .......... .......... 288 TACTGATTTT ATGTGGACCT CAACTTCCTC ATCTTTAATG TATGGCCTTC TCTTTTTTTA 16151 .......... .......... .......... .......... .......... .......... 288 GGCTTCCATG ATATTCCAAT TCCCTAAATT ATCTATAAAA CTATATATTA TATATTTTAA 16091 .......... .......... .......... .......... .......... .......... 288 ATAATTTGAT TTTAAATTTT AAAAAAAGAA TTTAGCAATT GATCTTATGT TTAGTGAACT 16031 .......... .......... .......... .......... .......... .......... 288 CTATAATTTT TGGATATGAT GATATGAATA TGTACTTATG TCCTCCTAAA AATTATTTTA 15971 .......... .......... .......... .......... .......... .......... 288 TTTGTCTCAA TTTGTATGAC ATAATTAAAT TTTTGAGAGT TGAATGATTT AGTTATATCT 15911 .......... .......... .......... .......... .......... .......... 288 GTGAAATTAG ACATGTAATA TGTAAATTTA TCAAAGTAAA TTTTACATAT TTGAAAACTA 15851 .......... .......... .......... .......... .......... .......... 288 CGTATAAATA CTATAAGACA TTATAATTGA TATTTGAAAG AGGGAAAATG GTCGGAAAAA 15791 .......... .......... .......... .......... .......... .......... 288 TACTTCAACA TTGGCCAAAA TAGTTGTTTC GATATCAAAT TTTATGGAGA ACCTTTTACC 15731 .......... .......... .......... .......... .......... .......... 288 TCATGCACTT TTTAAAAGTA TATTTTAAAG ATATATATGT GCCCACGTGG ACACATTACT 15671 .......... .......... .......... .......... .......... .......... 288 TTTCAAAATG ATGCAATATT TATGATGTTC ACGTGGACAC ATATATATCT TTAAAATACA 15611 .......... .......... .......... .......... .......... .......... 288 TTTATTAAAT AGTGCAAGGG GTAAAAGATC ATGCATGAAG TTTGTATTGT AGCAACAATT 15551 .......... .......... .......... .......... .......... .......... 288 GTGGTCAAGC TTCGAATATA TTTAAGACCT TTTTTCTTTT AAAAATATAT AAAATTATTG 15491 .......... .......... .......... .......... .......... .......... 288 AAATCTCAAA ATTTGCAAAG TATCACATAA CTTAGGACAA AAAAAAAAAT ATTTGTCACG 15431 .......... .......... .......... .......... .......... .......... 288 AAGGCATAAA TTAAAAACCT AACACAAAAT ATGGGTAAAA GTGCAAGTGA CCCTAACACT 15371 | |||||||| .......... .......... .......... .......... .......... CACTAACACT 298 -TGTTTGGAT CATTGTTATC AATTGTATTG TATTGTATCG TTATTATACC TACAATGGTT 15312 ||||||||| |||||||||| ||||||||| ||| |||| | ||| |||||| || |||| || ATGTTTGGAT CATTGTTATC CATTGTATTG TATCGTATTG TTACTATACC TATAATGTTT 358 GTTTTGATTG TTACTTAAAA TATATTGTAC TGTATTGTTA AATTTCGTTG TTACGTAACA 15252 ||||||| || |||||||||| | ||| |||| |||||||||| |||||||||| |||| ||||| GTTTTGAGTG TTACTTAAAA TGTATAGTAC TGTATTGTTA AATTTCGTTG TTACTTAACA 418 ATGGAAATCC CTATTTTATG AAACAACTAA TTTGGTGTGT TCCCATTGTT AC-T-A---- 15198 || |||| || || ||||||| ||||| || |||||||||| ||| |||||| || | | AT-GAAACCC CTGTTTTATG -GACAACCAA TTTGGTGTGT TCCAATTGTT ACTTAATTTC 476 --GTTTC--T TA-ATCTTTA CATAATATTT CAAAATACTA TTTTACCCTT TACCTTAATT 15143 |||| | || ||||||| || |||||| |||||||| |||||||||| |||||||||| TTTTTTCAAT TATATCTTTA TATTATATTT TTAAATACTA TTTTACCCTT TACCTTAATT 536 ATTTAAACCT AGTCAAACCT CCTACCCT-G AA-A--T-A- -A---TTAAG GATATTTATA 15093 ||||||| || |||||||||| |||||||| | || | | | | ||||| | ||||||| ATTTAAATCT AGTCAAACCT CCTACCCTAG AATATTTAAG GACGTTTAAG TAAATTTATA 596 AATTACATTA CTGTATGATA CAGTCAAACC AAACAATTAA AATGTTACTA AACAA-AACA 15034 ||||||| || | ||| | || | |||||| || ||||||| ||||||| || ||||| |||| AATTACAATA CAGTACGCTA TAATCAAACA AAGCAATTAA AATGTTATTA AACAACAACA 656 AACAATACAA TCTAACCAAA CATTGTATCT ACCATACATT ACGATACAAT ACATTATGAA 14974 ||||| ||| |||| ||||| |||||||| | | |||||| | || |||||| ||| || | AACAACGCAA TCTAGCCAAA CATTGTATAT ATCATACAAT ACAGTACAAT ACAATACAAT 716 ACAATAGATA ACAATAATCC AAACAAAATG TAAAATATTT TCTTGAACGG AAAGAAAAAA 14914 ||| || | ||||| | || | || ACATTATGAA ACAATGA-GT AACAATGAT. .......... .......... .......... 744 TCTTCAACTT TCCCCCTTTT TGGGCTTTCT AAAGAAAACA TCTTCACATT TTAATTGGGC 14854 .......... .......... .......... .......... .......... .......... 744 TTTGCAATCA TGTCTTTTGG TTTGGGCCTT GCATCGTGAG GCTTCTTTTC TTTTATAAAA 14794 .......... .......... .......... .......... .......... .......... 744 GTGACTTTTG ATATAAAATC GAAACATTTT GTGAGGTAGA TTATTTTGAC ATTATTATTT 14734 .......... .......... .......... .......... .......... .......... 744 AATTTATATT TTTTTAATGT ATATTGTATG CACATTTTAG ACACTTTATT TGACATTAGG 14674 .......... .......... .......... .......... .......... .......... 744 TTTTGGTACG TCTATTGTCT AATACTGCAT ACGAGTGATG TTTATGCATT TATAACTGAA 14614 .......... .......... .......... .......... .......... .......... 744 GTTTTAAAAA AAAATGGTTG AATTTTAGTC ATATATAATC AAAGTATTTT TATCTATCAA 14554 .......... .......... .......... .......... .......... .......... 744 TTGTTGCTTG TACTGTTATC GCATGTATAT AAACTTCATA CATAAATAGT TGAAATTAGA 14494 .......... .......... .......... .......... .......... .......... 744 CAAGTTATTT TTCCAACTCT TTAATAAAAT CAATACGCTC CGACCATAAC ATGCTGGATA 14434 .......... .......... .......... .......... .......... .......... 744 CAATAAGGTA TCATCACCTA AGTCAAGTTT TAAAAATATA TCATAATCTA CCTTAAATGT 14374 .......... .......... .......... .......... .......... .......... 744 CAAACAAAAA TATAACAAAC TTTCATAAAT ACATATTTTC TTTTTGAACA GATTCAAAAC 14314 .......... .......... .......... .......... .......... .......... 744 AATAATTTTC ATTTTAAAAT ATCATTCATA CACCATAGAT GCTAAGTCTA GAAATAAGTT 14254 .......... .......... .......... .......... .......... .......... 744 AATTTCATCC TGAGTTAATA CTCCAATAAA AAAAATCATT TCTTCATCCT CAAACTTAAA 14194 .......... .......... .......... .......... .......... .......... 744 ATCTCACATT TATACGATCA TATCATGGAT AATACGTTAA CAAATTTTTT TAAAAAAAAT 14134 .......... .......... .......... .......... .......... .......... 744 CATAAAATGA GATATCATTG AAAATCTTTC TACCTCACAA AAATCGATGT TTAGTGAATG 14074 .......... .......... .......... .......... .......... .......... 744 ATATGAAATT AGTTTTTTTA TATAGAGAGA TCTTCACATT TGCGATAAAA CTTTTGTAAC 14014 .......... .......... .......... .......... .......... .......... 744 TACGAGTCCA TCAGAAGAAC AATACTTCAC ACTAAAAAAG CATAACTTTT TCGTACAATT 13954 .......... .......... .......... .......... .......... .......... 744 TCAACAATTC ATTTCGCTAT GATTTTGAAA TTACTATGCA AAATTAATTT TTTCAATCAA 13894 .......... .......... .......... .......... .......... .......... 744 ACACAATTGG GAGCTTACTT GCTTCAGGGT ACCATTTGCT AAAACTAGGT CAAAACATCA 13834 .......... .......... .......... .......... .......... .......... 744 AATTTTTAAT CAAGCACTCA AAATTCATTT GAGACCTCCC AAAATATATA TCAAATATGT 13774 .......... .......... .......... .......... .......... .......... 744 AAATCAATCA TGTTTTTATG AACATGCTGA AGTCATCAAA TCTTCTTTTC GAGGTCATGT 13714 .......... .......... .......... .......... .......... .......... 744 TATTCAAATA TGGATCAGGG TCGATTCTAG ATTTTAAAAA AAATTCACTA AAAATCAACT 13654 .......... .......... .......... .......... .......... .......... 744 TAATGATTTA AACCACTCTT TGGAGTGTCT GGAAAATCAT CCTTCAAACC TAATAGAGCA 13594 .......... .......... .......... .......... .......... .......... 744 TCAAAACTCA AATTCAAATG CATTTAATCA AAATATATAT TGGCTTTTGT CAAACTTCTC 13534 .......... .......... .......... .......... .......... .......... 744 GTTTTCAAAC TTTTTGAAAT AAAATTTCTT CTCCAAAACA CATCTAATCA CTTCAAAAAC 13474 .......... .......... .......... .......... .......... .......... 744 TATGTTGCTC GTACGTGTAA GTAGTAACTA ACATAATAAA ACTATGACAA ATATCAAAAT 13414 .......... .......... .......... .......... .......... .......... 744 TGAAGAAAAA TATATATTCA ATAGCATAAA ACGATCATAC ATATCGCAAC AAAATCTAAA 13354 .......... .......... .......... .......... .......... .......... 744 AGTTATAATT AAGATCAAAG AAATTTCAAC TCTTTCGTCG ATCAACCATA CCACTCCTCA 13294 .......... .......... .......... .......... .......... .......... 744 ACTAAAATAA TTTTACAAAT TATGATGATT ACTTTAATAA TTATAGTCTA AGAAAAAGTC 13234 .......... .......... .......... .......... .......... .......... 744 TTACAAATGT TACAATTAAC TTTATTTTCT ATTTTTGTTA TGAATATGAA TAAACGATAT 13174 .......... .......... .......... .......... .......... .......... 744 CCAATCAATC TCTTTTCGAT GATAATAAAA ACAAAACATA TATTATTCAC CGATGATGAA 13114 .......... .......... .......... .......... .......... .......... 744 AAGAGGAATT TCACATAATC ATTCTAAATT GTTGGTCACA TTAGAAAAAA AAATACATCA 13054 .......... .......... .......... .......... .......... .......... 744 CAAAATACAA TTCTTCCTTA ATCTTTCACA CAGAGCAAGT TGTAATACTA ATAGTTTAAA 12994 .......... .......... .......... .......... .......... .......... 744 AATAGGTTTC AATATACAAT GATTTTGACC ATAATTAATA TTATGTATCA TAATATAATG 12934 .......... .......... .......... .......... .......... .......... 744 TATAATATTG TATGCATGTT TATACTAAAC AAGAGGAAGG AAAGTACAAA GAATAAAATG 12874 .......... .......... .......... .......... .......... .......... 744 AAGTAGTTAT ACTAAAAAAT AAATAGGTAT GAGAATCTTT TAATTTACAA AATTCAAAAA 12814 .......... .......... .......... .......... .......... .......... 744 ATAATTTTAA TTTATATAAA TTTATTGTTG GTTGTTTCTT CCTTAACCAT TCATTGTTCC 12754 .......... .......... .......... .......... .......... .......... 744 CAAGGAATAG CAAAAATAGA ATAAAATAAT GGTAAAAGTG TGTGGTTAAA TAAAGGCAAA 12694 .......... .......... .......... .......... .......... .......... 744 AAAAATAATT AAAATTATAA AATAATAACA GAAGTGTATA TAGTGCCCAA TAGTTGTGTT 12634 .......... .......... .......... .......... .......... .......... 744 TTCCATCTTT AACAACAAAC AAACTAATAA AATATTCCAA AAGAGGATAA GACAATAAAA 12574 .......... .......... .......... .......... .......... .......... 744 ATTGGATACA ATATAATATT TTAGCAAATA GAATTTTTTT TAACTAAAAT AATAATTCTA 12514 .......... .......... .......... .......... .......... .......... 744 TATTAAAATT TAGGATAAAA AATTAAATTT TTATTTAAAT TATAATAAAA TAAGTGTATC 12454 .......... .......... .......... .......... .......... .......... 744 ACAAATTTGT GTAAAAGATA AGAATAATAT AATCATAGAT TGAAGTTGGA AGTTAAATTG 12394 .......... .......... .......... .......... .......... .......... 744 TTCGTGTTAA TTACTAACTT CATTTCAAAC AATTTGATAA AAGAGAATGA GAGATATATA 12334 .......... .......... .......... .......... .......... .......... 744 AGATTATTGA TAACAAAAAA AGAGGTGGAT ATGAGATAAT TAAAACTTAA ATTTAACTAA 12274 .......... .......... .......... .......... .......... .......... 744 GGTTAAATTA TAAATTTAAT TAAATTAATA AAGTTAAAAA TATAATTTCA AAATTGAATA 12214 .......... .......... .......... .......... .......... .......... 744 GCAAAGACAT TCATTACTAA GAATGTCAAG AGTTAAATTG CCCAACTTTT TAATAAATGA 12154 .......... .......... .......... .......... .......... .......... 744 AAACTTTTAT GCAGAATTTT AGAGATATGT ACCAATGATT CTTTCTAAAG AAATAGATTA 12094 .......... .......... .......... .......... .......... .......... 744 AGATTTCAAT TGAAAATTAT ATGATCGATT GAGTTCAATT GATTCGTTGA AGAAGCAATG 12034 .......... .......... .......... .......... .......... .......... 744 TTATTAAAGA ATTCAATTCT CAATAAAATG TTAAGTTCAA TGTATTATAA CATTATTTTT 11974 .......... .......... .......... .......... .......... .......... 744 GTGACTCGTT AAAACGTAAG TTTATGAATT TTCTTTTCAC TCAATATCAG TATGAGCATA 11914 .......... .......... .......... .......... .......... .......... 744 AATTTTTTGT GCTCAAGCAT AATTATTTTT TATTAGCTCT CTAATTATCT CTTTACAAAT 11854 .......... .......... .......... .......... .......... .......... 744 TTATCCTCGT AACTAACAAA CAAAACCTTC TAATAACAAA TCATTAATAA TATATTTGAT 11794 .......... .......... .......... .......... .......... .......... 744 AAAAATTGCA AAATTTCAAA AGCTACAAAT GTTTAGGATG CAAAATGCTA AGTATTTATT 11734 .......... .......... .......... .......... .......... .......... 744 AAATTTTGAT GACTAAAAAA ATGTCTTAAA TATACTAACA ACCTAAACCT ACGTCCTTCA 11674 .......... .......... .......... .......... .......... .......... 744 ATTATCCATA TGAACAAAAA TTATTTTAAA TTTGTTTAAA ATGTGATAAT TTACGTTATA 11614 .......... .......... .......... .......... .......... .......... 744 TCGTTTAAAC ATATTAATGA ACTAACAATT CCCATCAGTT CTCCAACGAA GTCTCCAAAT 11554 .......... .......... .......... .......... .......... .......... 744 TTTACATTAT TAATCACATT TTTTGCACTT AGACACACAA AGGAATTAGT ACTAGTGATA 11494 .......... .......... .......... .......... .......... .......... 744 CTCGAGAAGA TATTAAAAGT GATTTATATA CAATTATCAA ATAAACATGA ATCAGAAGAT 11434 .......... .......... .......... .......... .......... .......... 744 GAAAGTTAAG TTAGGTGCAT AAGAATCATA TGTGACAAAA GATTATAAAT CACATTCAAA 11374 .......... .......... .......... .......... .......... .......... 744 TTAGAACTCT AACGATTTAT TTAATAATAA ATTTACGTTC ATCGTTTGAC TGATATTTTA 11314 .......... .......... .......... .......... .......... .......... 744 AATCATATGA TTGGATGTAT CAAATACCAC TTCCTTTATT AGACTAAGAA GCAGAAGAAT 11254 .......... .......... .......... .......... .......... .......... 744 AAAAAAGAAG TCTAGGAAAC AATAAGAGAA AATAATAAAT TACATTATCA TTGAAAGCAA 11194 .......... .......... .......... .......... .......... .......... 744 TCTCCGCAAA AAGAAAAAGA ATTAACTCTA TCATCTCAAA TGTCATGTTC AAATCGTATA 11134 .......... .......... .......... .......... .......... .......... 744 TCAAAAGATG CTTTGATTTC ATGCACTTCA ATATTATCCT TTTTAACTTT CAATCTTTTT 11074 .......... .......... .......... .......... .......... .......... 744 CTATTTCATA TTGATACCAT TATACGTATT TGTGAAATTT AGGTTCTTTA GAAAAAAAAA 11014 .......... .......... .......... .......... .......... .......... 744 AAAATTGAAT GATAACATTG AGAGATTATA ATCATTTAAT TATTTGTGCA TGTCAATCAT 10954 .......... .......... .......... .......... .......... .......... 744 AGTGGTTCAA TTGATTGAAT ACATAAAATT TTACCTTGTT AGTAAGAATT TTATTTTCAA 10894 .......... .......... .......... .......... .......... .......... 744 ATTTTATTAT TGCACATCTG GCCTATGAAT GTCTATTTAG TGATTTAAAA GCCAAAGAAA 10834 ||||| ||| .......... .......... .......... .......... .......... .CCAAACAAA 753 AGCAAATAAT CAATTCAAAT ATGAAATACA ATGTTGTTAT GAGGTGATAC TAAATAGTAA 10774 G......... .......... .......... .......... .......... .......... 754 TAATAATAAG AAATTGAATT TTATTATTGG TCCGTCCAAT CTATATACGT CTCATTTAGG 10714 .......... .......... .......... .......... .......... .......... 754 AATTTATAAG TCAATTCGAG TATGGATCAT AACAATGTTA TTATGAGCTA ATAGACATGT 10654 .......... .......... .......... .......... .......... .......... 754 AGTAATGTAA GAAACAAAGT ATTGTTACTG GTCCGTCCAA CCTATGTATG TGTCATTTAT 10594 .......... .......... .......... .......... .......... .......... 754 AGAGTTAAAC GACAAAAAAA AAAAAAAAG 10565 ||| || | ||||| || ||||| ..AGTAAAGG TAAGCAAAAA AAGCAAAAG 781 hqPGS_C09HBa0099P03.1-1-_SGN-U340386- (15380 14945) ******************************************************************************** EST sequence 11 +strand 507 n (File: SGN-U330540+) 1 GAAAAATTAG TCAAAAAGAG AGTTTTTATT AGTTGAAGGG AGGTGTTTTT TTATGGAGCT 61 TTGGACTCAA CCCTTTTGCG GAGTTGTTGA GTTATACTTT GTGAAGTTTG TTGAATCCTG 121 GATGGGAAAG TCAAAGAGAA CTATTGCTGG ACCGGTGAAA ACAGTTGTTC CACTAGGCTT 181 GAATCTCCTT ATAGAGAGCG AGATATTCAC GCCTCTGCCT GAACAGATTT TTTTTCTTCA 241 TTTCATTTTA AGATGGATTC TTGTAATTTT TTGTTATCTT ATAAGTTTTC ACTAAAAAAT 301 TCAAGGAGAT TCATTATAGG TACAATTGAA AAAATCACGG ATGTCTAGAT GTGAGACGTC 361 AACAAGCCAT TTTGGTTCAA TAGTAACAAT TTTAAGAGGT GAAAGAGTAA AGTACTTTTC 421 TACTTAAGTT TTCTCAATGT TTCTTATGTG TTAACTGAGA AGCCCAATAA TGAAGAAATC 481 ACCTCTATGA GTGATGGTGA AACTGTC Predicted gene structure (within gDNA segment 18074 to 13285): Exon 1 17242 16950 ( 293 n); cDNA 7 295 ( 289 n); score: 0.819 Intron 1 16949 15584 (1366 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0) Exon 2 15583 15550 ( 34 n); cDNA 296 328 ( 33 n); score: 0.676 MATCH C09HBa0099P03.1-1- SGN-U330540+ 0.819 327 0.645 C PGS_C09HBa0099P03.1-1-_SGN-U330540+ (17242 16950,15583 15550) Alignment (genomic DNA sequence = upper lines): TTAGTCTAA- AGAGAGTTCT TATTAGTTGA AGGGAGGTGT TCTTTTTTTG TGGAGCTTTG 17184 |||||| || |||||||| | |||||||||| ||||||||| ||||||| |||||||||| TTAGTCAAAA AGAGAGTTTT TATTAGTTGA AGGGAGGTG- --TTTTTTTA TGGAGCTTTG 63 GACTCAACTC CTGTCCAGAG TTGTTGAGTT ATACTTTGTA AAGGCTGTTG TATCCTGGAG 17124 |||||||| | | | | ||| |||||||||| ||||||||| ||| ||||| |||||||| GACTCAACCC TTTTGCGGAG TTGTTGAGTT ATACTTTGTG AAGTTTGTTG AATCCTGGAT 123 GGGACAAGTC AAAGAGGACT ACTGCTGGAC CGGTGAAAAC ATTTGCTGCA GTGGGCTTGA 17064 |||| ||||| |||||| ||| | |||||||| |||||||||| | ||| | || | ||||||| GGGA-AAGTC AAAGAGAACT ATTGCTGGAC CGGTGAAAAC AGTTGTTCCA CTAGGCTTGA 182 ATCTCCTTAA AGAGAGCGAG ATATCCGCGC CTCAGCCTGA AGAGA-TTAC TTTCTTCATT 17005 ||||||||| |||||||||| |||| | ||| ||| |||||| | ||| || |||||||||| ATCTCCTTAT AGAGAGCGAG ATATTCACGC CTCTGCCTGA ACAGATTTTT TTTCTTCATT 242 TTATTTTCAA TTGTAATCTT GCAATTTTAT TATCTTGTAA GTTTTTTTTC ACTAACACTT 16945 | ||||| | || | |||| | |||||| | | | | | | | ||||| ||||| TCATTTTAAG ATGGATTCTT GTAATTTT-T TGTTATCTTA -TAAGTTTTC ACTAA..... 295 AATTTATTAT TTTACACATA CTGAAGTTTT GGATTTAGTT TAAGCATAAA TTTTATTAGT 16885 .......... .......... .......... .......... .......... .......... 295 AACTATTTAG TCAATATTTA AAAAGTGCAT TTAGGAAAAA TATAGTATTA AAAAAAAACA 16825 .......... .......... .......... .......... .......... .......... 295 ATTTTGCATG CTAATTTATG AATCATTTTT TTTTCTTATT AAAAACATGT TCAACGTCTT 16765 .......... .......... .......... .......... .......... .......... 295 TAATTCTTTG AAATAAATAG TTTTTTTTTT ACTTTGAAAA TTTTGAAAAT CGAACAAAGT 16705 .......... .......... .......... .......... .......... .......... 295 ATAACAAGAT AGAGTGGTTG TGATGGATTT AGGATGGGGC GAGAGTATTC ACCTGAACCC 16645 .......... .......... .......... .......... .......... .......... 295 TTTTGATAAA AAATTAACTA TTTATATAAA GTATTTTTTT TTCAAAATTT TAGTGAATAT 16585 .......... .......... .......... .......... .......... .......... 295 ATAGGATTTA AATACCTAAG CATAAGACTG AAAGTTACCT TAGTAATTTA GGGACTTCAT 16525 .......... .......... .......... .......... .......... .......... 295 AATTCATTTT TAAATAGATA TATCATATTT ATTTTTCTAA CCTACTTAAT AAAAATTCTG 16465 .......... .......... .......... .......... .......... .......... 295 AATCAGTATT TAAAAATACA ATTTAATTTC ACAAACTTAC TGAATAATTT GAATTGCATA 16405 .......... .......... .......... .......... .......... .......... 295 TTTGTTTGGT TGTAACTGCA GGCTTTGGAT GGAATGATGA TAAATGATAA TATTTTTGGT 16345 .......... .......... .......... .......... .......... .......... 295 CTGACTAAGT TACTTTTTTT AAAAAAAAAT TTTTATAATA TCTTGAGGGA TTTTAATTCC 16285 .......... .......... .......... .......... .......... .......... 295 CCACAAAACC CTTTTTTTTT TTTTTTAGAG TATATTCCAT TTGCTTTTTA AAATTTTGTC 16225 .......... .......... .......... .......... .......... .......... 295 AAACATTGGG TTTTTACTGA TTTTATGTGG ACCTCAACTT CCTCATCTTT AATGTATGGC 16165 .......... .......... .......... .......... .......... .......... 295 CTTCTCTTTT TTTAGGCTTC CATGATATTC CAATTCCCTA AATTATCTAT AAAACTATAT 16105 .......... .......... .......... .......... .......... .......... 295 ATTATATATT TTAAATAATT TGATTTTAAA TTTTAAAAAA AGAATTTAGC AATTGATCTT 16045 .......... .......... .......... .......... .......... .......... 295 ATGTTTAGTG AACTCTATAA TTTTTGGATA TGATGATATG AATATGTACT TATGTCCTCC 15985 .......... .......... .......... .......... .......... .......... 295 TAAAAATTAT TTTATTTGTC TCAATTTGTA TGACATAATT AAATTTTTGA GAGTTGAATG 15925 .......... .......... .......... .......... .......... .......... 295 ATTTAGTTAT ATCTGTGAAA TTAGACATGT AATATGTAAA TTTATCAAAG TAAATTTTAC 15865 .......... .......... .......... .......... .......... .......... 295 ATATTTGAAA ACTACGTATA AATACTATAA GACATTATAA TTGATATTTG AAAGAGGGAA 15805 .......... .......... .......... .......... .......... .......... 295 AATGGTCGGA AAAATACTTC AACATTGGCC AAAATAGTTG TTTCGATATC AAATTTTATG 15745 .......... .......... .......... .......... .......... .......... 295 GAGAACCTTT TACCTCATGC ACTTTTTAAA AGTATATTTT AAAGATATAT ATGTGCCCAC 15685 .......... .......... .......... .......... .......... .......... 295 GTGGACACAT TACTTTTCAA AATGATGCAA TATTTATGAT GTTCACGTGG ACACATATAT 15625 .......... .......... .......... .......... .......... .......... 295 ATCTTTAAAA TACATTTATT AAATAGTGCA AGGGGTAAAA GATCATGCAT GAAGTTTGTA 15565 | || || | || || | .......... .......... .......... .......... .AAAATTCAA GGAGATT-CA 313 TTGTAGCAAC AATTG 15550 || ||| || ||||| TTATAGGTAC AATTG 328 hqPGS_C09HBa0099P03.1-1-_SGN-U330540+ (17242 16950) ******************************************************************************** EST sequence 6 +strand 1149 n (File: SGN-U331580+) 1 GTCTGGACAC CAAGGATGTT GAAGTGGAAC CTGAGCAGGC AGTGTCTGGA ACTATATATG 61 CCAATGGTGA CCATTCCGGA GAAAGCGTCT AGCGAGATGT ATTGGAAGTT GAAGTCTCTG 121 GCCAAACATC TGCTATATCA AGGTCAATCA CTGGCTCAGA GCAAGAAGGA GAAGCTAAAG 181 ATCATATAGA TGAAGAAGCT AACCTTGAAG GCTCAGTTTC TGATGGAGAG ACAGATGGTA 241 TGATTTTTGG AAGCTCTGAA GCTGCCAAAC AGTTTATGGA GGAGCTGGAA AGGGAATCTG 301 GTGGTGGCTC CTATGCTGGT GCTGAAGTTT CTCAGGATAT TGATGGTCAG ATTGTCACCG 361 ACTCCGATGA AGAGGCTGAT ACTGATGAAG AAGGAGATGT GAAGGAGTTG TTTGATTCGA 421 CTGCCTTAGC TGCTCTTTTA AAAACAGCAA CAGGTGGTGA TTCTGATGGT GGCAACATAA 481 CAGTCACGTC TCAAGATGGA TCAAGACTGT TCTCTGTTGA ACGTCCTGCT GGTCTTGGGT 541 CATCACTCCG GTCACTGAGG CCAGCTCCCC GACCAAGCCA ACCCAATCTT TTTACTCATT 601 CCAATCTCCA GAACAGTGGA GAATCTGAGA ACAACTTGAG TGAAGAAGAG AAGAAGAAAC 661 TGGACACACT ACAGCAGATC AGGGTCAAGT TTTTGAGGCT TATTCACAGG TTGGGTCTAT 721 CTTCTGATGA GCCCATAGCT GCACAAGTTT TGTACCGGAT GACACTTATT GCACGAAGGC 781 AAAACAGTCC ACTTTTTAGC GTTGAGGCTG CCAAGATGAA AGCTTTCCAG CTTGAAGCAG 841 AGGGGAAAGA TGATTTGGAC TTCTCTGTGA ATATCCTGGT TATTGGCAAA TCTGGGGTGG 901 GTAAGAGCGC TACCATAAAC TCTATCTTTG GAGAGGAAAA AACATCAATT GATGCCTTTG 961 GACCTGCTAC CACCAGTGTG AAAGAGATCA GTGGTGTTGT AGATGGTGTT AAGATTCGGG 1021 TGTTTGATAC ACCTGGCCTC AAGTCCTCTG CGATGGAACA NGGNTTTCAT CGCAGTGTCT 1081 TGTCTTCAGT AAAGAAGTTG ACTAAGAAGA ATCCCCCTGA TATTTACCTC TATGTCGATC 1141 GGTTGGATG Predicted gene structure (within gDNA segment 20048 to 22406): Exon 1 20648 21796 (1149 n); cDNA 1 1149 (1149 n); score: 0.988 MATCH C09HBa0099P03.1-1+ SGN-U331580+ 0.988 1149 1.000 C PGS_C09HBa0099P03.1-1+_SGN-U331580+ (20648 21796) Alignment (genomic DNA sequence = upper lines): GTCTGGACAC CAAGGATGTT GAAGTGGAAC CTGAGCAGGC AGTGTCTGGA ACTATATATG 20707 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTGGACAC CAAGGATGTT GAAGTGGAAC CTGAGCAGGC AGTGTCTGGA ACTATATATG 60 CCAATGGTGA CCATTCCGGA GAAAGCGTCG AGCGAGATGT AGTGGAAGTT GAAGTCTCTG 20767 |||||||||| |||||||||| ||||||||| |||||||||| | |||||||| |||||||||| CCAATGGTGA CCATTCCGGA GAAAGCGTCT AGCGAGATGT ATTGGAAGTT GAAGTCTCTG 120 GTCAAACATC TGCTATATCA AGGTCAATCA CTGGCTCAGA GCAAGAAGGA GAAGCTAAAG 20827 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCCAAACATC TGCTATATCA AGGTCAATCA CTGGCTCAGA GCAAGAAGGA GAAGCTAAAG 180 ATCATATAGA TGAAGAAGCT AACCTTGAAG GCTCAGTTTC AGATGGAGAG ACAGATGGTA 20887 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| ATCATATAGA TGAAGAAGCT AACCTTGAAG GCTCAGTTTC TGATGGAGAG ACAGATGGTA 240 TGATTTTTGG AAGCTCTGAA GCTGCCAAAC AGTTTATGGA GGAGCTGGAA AGGGAATCTG 20947 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTTTTGG AAGCTCTGAA GCTGCCAAAC AGTTTATGGA GGAGCTGGAA AGGGAATCTG 300 GTGGTGGCTC CTATGCTGGT GCTGAGGTTT CTCAGGATAT TGATGGTCAG ATTGTCACCG 21007 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| GTGGTGGCTC CTATGCTGGT GCTGAAGTTT CTCAGGATAT TGATGGTCAG ATTGTCACCG 360 ACTCAGATGA GGAGGCTGAT ACTGATGAAG AAGGAGATGT GAAGGAGTTG TTTGATTCAG 21067 |||| ||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| ACTCCGATGA AGAGGCTGAT ACTGATGAAG AAGGAGATGT GAAGGAGTTG TTTGATTCGA 420 CTGCCTTAGC TGCTCTTTTA AAAGCAGCAA CAGGTGGTGA TTCTGATGGT GGCAACATAA 21127 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| CTGCCTTAGC TGCTCTTTTA AAAACAGCAA CAGGTGGTGA TTCTGATGGT GGCAACATAA 480 CAGTCACGTC TCAAGATGGA TCAAGACTGT TCTCTGTTGA ACGTCCTGCT GGTCTTGGGT 21187 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTCACGTC TCAAGATGGA TCAAGACTGT TCTCTGTTGA ACGTCCTGCT GGTCTTGGGT 540 CATCACTCCG GTCACTGAGG CCAGCTCCCC GACCAAGCCA ACCCAATCTT TTTACTCATT 21247 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCACTCCG GTCACTGAGG CCAGCTCCCC GACCAAGCCA ACCCAATCTT TTTACTCATT 600 CCAATCTCCA GAACAGTGGA GAATCTGAGA ACAACTTGAG TGAAGAAGAG AAGAAGAAAC 21307 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAATCTCCA GAACAGTGGA GAATCTGAGA ACAACTTGAG TGAAGAAGAG AAGAAGAAAC 660 TGGACACACT ACAGCAGATC AGGGTCAAGT TTTTGAGGCT TATTCACAGG TTGGGTCTAT 21367 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGACACACT ACAGCAGATC AGGGTCAAGT TTTTGAGGCT TATTCACAGG TTGGGTCTAT 720 CTTCTGATGA GCCCATAGCT GCACAAGTTT TGTACCGGAT GACACTTATT GCACGAAGGC 21427 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCTGATGA GCCCATAGCT GCACAAGTTT TGTACCGGAT GACACTTATT GCACGAAGGC 780 AAAACAGTCC ACTTTTTAGC GTTGAGGCTG CCAAGATGAA AGCTTTCCAG CTTGAAGCAG 21487 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAACAGTCC ACTTTTTAGC GTTGAGGCTG CCAAGATGAA AGCTTTCCAG CTTGAAGCAG 840 AGGGGAAAGA TGATTTGGAC TTCTCTGTGA ATATCCTGGT TATTGGCAAA TCTGGGGTGG 21547 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGGGAAAGA TGATTTGGAC TTCTCTGTGA ATATCCTGGT TATTGGCAAA TCTGGGGTGG 900 GTAAGAGCGC TACCATAAAC TCTATCTTTG GAGAGGAAAA AACATCAATT GATGCCTTTG 21607 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAAGAGCGC TACCATAAAC TCTATCTTTG GAGAGGAAAA AACATCAATT GATGCCTTTG 960 GACCTGCTAC CACCAGTGTG AAAGAGATCA GTGGTGTTGT AGATGGTGTT AAGATTCGGG 21667 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACCTGCTAC CACCAGTGTG AAAGAGATCA GTGGTGTTGT AGATGGTGTT AAGATTCGGG 1020 TGTTTGATAC ACCTGGCCTC AAGTCCTCTG CGATGGAACA GGGTTTCAAT CGCAGTGTCT 21727 |||||||||| |||||||||| |||||||||| |||||||||| || || || |||||||||| TGTTTGATAC ACCTGGCCTC AAGTCCTCTG CGATGGAACA NGGNTTTCAT CGCAGTGTCT 1080 TGTCTTCAGT AAAGAAGTTG ACTAAGAAGA ATCCCCCTGA TATTTACCTC TATGTCGATC 21787 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTCTTCAGT AAAGAAGTTG ACTAAGAAGA ATCCCCCTGA TATTTACCTC TATGTCGATC 1140 GGTTGGATG 21796 ||||||||| GGTTGGATG 1149 hqPGS_C09HBa0099P03.1-1+_SGN-U331580+ (20648 21796) ******************************************************************************** EST sequence 4 +strand 1673 n (File: SGN-U318403+) 1 AATGATGAGT CCAAGTCTGA TGAATCCCGT CTCTCTGGTA GAAAATCATC CATCGTGCAG 61 GAGGAATAGG GATGGACATA AGATACTACC TAATGGCCAG AGCTGGAGGC CTCAATTACT 121 ACTATTAAGC TACTCAATGA AGATCTTATC TGAAGCAAGT GCACTTTCAA AGCCTGAAGA 181 TCCATTTGAT CACCGTAAGC TCTTTGGTTT CCGCACACGC TCACCACCTC TTCCCTACAT 241 GCTTTCTTCA ATGTTGCAGT CACGTGCGCA TCCAAAGCTT TCTGCTGAGC AGGGTGGTGA 301 CAACGGTGAT TCAGACATTG ACTTAGATGA TTTGTCAGAC TCTGACCAAG AAGAAGAAGA 361 TGAGTATGAC CAGCTTCCTC CCTTCAAGCC TCTTCGGAAG GCTCAGCTTG CTAAGCTCAG 421 CAAAGAACAG AGGAAGGCGT ACTTTGAGGA GTATGACTAC AGGGTCAAGC TCCTTCAGAA 481 GAAACAGTTG AGAGAAGATT TAAAAAGAAT GAAAGAGATG AAAAGTAAGG GAAAAGAGGC 541 TGCAATTGAC AATGGTTATG CAGAGGAAGA AGCTGATGCA GGTGCAGCAG CTCCCGTAGC 601 AGTTCCCCTT CCTGACATGG CCCTTCCACC TTCTTTTGAT AGTGATAATC CCGCCTATAG 661 GTACCGCTTC TTGGAGCCCA CATCACAGTT CCTTGCAAGG CCTGTTCTGG ACACGCATGG 721 TTGGGATCAT GATTGTGGCT ATGATGGTGT TAACGTGGAA CAAAGTTTAG CCATTGCCAG 781 TCGTTTCCCT GCTGCAGTTA CTGTGCAAAT CACCAAAGAT AAGAAGGATT TCAGTATCAA 841 TTTGGACTCT TCGATTGCTG CTAAGCACGG AGAAAATGGA TCAACCATGG CTGGCTTTGA 901 TATTCAAAGC ATAGGGAAGC AACTTGCCTA TATTGTCCGA GGAGAAACCA AATTCAAAAG 961 CTTGAAGAAG AACAAGACTG CTTGCGGAAT TTCTGTTACA TTTCTAGGTG AAAATATGGT 1021 CACTGGACTT AAAGTTGAAG ATCAAATCAT CTTAGGCAAG CAATACGTTC TAGTTGGCAG 1081 TGCTGGCACT GTTCGATCTC AGAGTGACAC AGCTTATGGG GCTAACTTTG AACTGCAGAG 1141 GAGGGAGGCA GATTTCCCAA TCGGTCAGGT GCAATCTACA TTGTCTATGT CCGTCATAAA 1201 GTGGAGAGGT GATTTGGCTC TAGGTTTCAA CAGTATGGCG CAATTCGCTG TGGGACGCAA 1261 TTCGAAGGTA GCTGTTCGAG CAGGAATCAA TAACAAGCTC AGTGGGCAAG TAACCGTGAG 1321 GACAAGCAGT TCAGACCATC TCTCTCTTGC ACTTACTGCT ATTATTCCAA CTGCAATTGG 1381 CATCTACAGG AAGCTTTGGC CGGATGCTGG CGAGAAGTAC TCAATCTACT AAATTTCATT 1441 TCCATATCAG CATTGCATTT TTGGTTCATT AGACCTTACA TGATGACATA TTGTCTTTGT 1501 CAGTTCATTG AATAATGCTT CTGTTAATTT CCCATCTATT TAGGATTCCT ACTGTTATGA 1561 GTTATAAGTC AATTTTGAGT GATTGAATGT ACTTTTTTGC CAGATGAAAT GAAGAGGTTT 1621 GTGTTGCTTT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA CTCAGACTAG TCT Predicted gene structure (within gDNA segment 21411 to 24671): Exon 1 22011 23640 (1630 n); cDNA 1 1630 (1630 n); score: 0.999 PPA cDNA 1631 1661 MATCH C09HBa0099P03.1-1+ SGN-U318403+ 0.999 1630 0.974 C PGS_C09HBa0099P03.1-1+_SGN-U318403+ (22011 23640) Alignment (genomic DNA sequence = upper lines): AATGATGAGT CCAAGTCTGA TGAATCCCGT CTCTCTGGTA GAAAATCATC CATCTTGCAG 22070 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| AATGATGAGT CCAAGTCTGA TGAATCCCGT CTCTCTGGTA GAAAATCATC CATCGTGCAG 60 GAGGAATAGG GATGGACATA AGATACTACC TAATGGCCAG AGCTGGAGGC CTCAATTACT 22130 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGGAATAGG GATGGACATA AGATACTACC TAATGGCCAG AGCTGGAGGC CTCAATTACT 120 ACTATTAAGC TACTCAATGA AGATCTTATC TGAAGCAAGT GCACTTTCAA AGCCTGAAGA 22190 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTATTAAGC TACTCAATGA AGATCTTATC TGAAGCAAGT GCACTTTCAA AGCCTGAAGA 180 TCCATTTGAT CACCGTAAGC TCTTTGGTTT CCGCACACGC TCACCACCTC TTCCCTACAT 22250 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCATTTGAT CACCGTAAGC TCTTTGGTTT CCGCACACGC TCACCACCTC TTCCCTACAT 240 GCTTTCTTCA ATGTTGCAGT CACGTGCGCA TCCAAAGCTT TCTGCTGAGC AGGGTGGTGA 22310 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTTCTTCA ATGTTGCAGT CACGTGCGCA TCCAAAGCTT TCTGCTGAGC AGGGTGGTGA 300 CAACGGTGAT TCAGACATTG ACTTAGATGA TTTGTCAGAC TCTGACCAAG AAGAAGAAGA 22370 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACGGTGAT TCAGACATTG ACTTAGATGA TTTGTCAGAC TCTGACCAAG AAGAAGAAGA 360 TGAGTATGAC CAGCTTCCTC CCTTCAAGCC TCTTCGGAAG GCTCAGCTTG CTAAGCTCAG 22430 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAGTATGAC CAGCTTCCTC CCTTCAAGCC TCTTCGGAAG GCTCAGCTTG CTAAGCTCAG 420 CAAAGAACAG AGGAAGGCGT ACTTTGAGGA GTATGACTAC AGGGTCAAGC TCCTTCAGAA 22490 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAGAACAG AGGAAGGCGT ACTTTGAGGA GTATGACTAC AGGGTCAAGC TCCTTCAGAA 480 GAAACAGTTG AGAGAAGATT TAAAAAGAAT GAAAGAGATG AAAAGTAAGG GAAAAGAGGC 22550 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAACAGTTG AGAGAAGATT TAAAAAGAAT GAAAGAGATG AAAAGTAAGG GAAAAGAGGC 540 TGCAATTGAC AATGGTTATG CAGAGGAAGA AGCTGATGCA GGTGCAGCAG CTCCCGTAGC 22610 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCAATTGAC AATGGTTATG CAGAGGAAGA AGCTGATGCA GGTGCAGCAG CTCCCGTAGC 600 AGTTCCCCTT CCTGACATGG CCCTTCCACC TTCTTTTGAT AGTGATAATC CCGCCTATAG 22670 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTTCCCCTT CCTGACATGG CCCTTCCACC TTCTTTTGAT AGTGATAATC CCGCCTATAG 660 GTACCGCTTC TTGGAGCCCA CATCACAGTT CCTTGCAAGG CCTGTTCTGG ACACGCATGG 22730 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTACCGCTTC TTGGAGCCCA CATCACAGTT CCTTGCAAGG CCTGTTCTGG ACACGCATGG 720 TTGGGATCAT GATTGTGGCT ATGATGGTGT TAACGTGGAA CAAAGTTTAG CCATTGCCAG 22790 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGGATCAT GATTGTGGCT ATGATGGTGT TAACGTGGAA CAAAGTTTAG CCATTGCCAG 780 TCGTTTCCCT GCTGCAGTTA CTGTGCAAAT CACCAAAGAT AAGAAGGATT TCAGTATCAA 22850 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGTTTCCCT GCTGCAGTTA CTGTGCAAAT CACCAAAGAT AAGAAGGATT TCAGTATCAA 840 TTTGGACTCT TCGATTGCTG CTAAGCACGG AGAAAATGGA TCAACCATGG CTGGCTTTGA 22910 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGGACTCT TCGATTGCTG CTAAGCACGG AGAAAATGGA TCAACCATGG CTGGCTTTGA 900 TATTCAAAGC ATAGGGAAGC AACTTGCCTA TATTGTCCGA GGAGAAACCA AATTCAAAAG 22970 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTCAAAGC ATAGGGAAGC AACTTGCCTA TATTGTCCGA GGAGAAACCA AATTCAAAAG 960 CTTGAAGAAG AACAAGACTG CTTGCGGAAT TTCTGTTACA TTTCTAGGTG AAAATATGGT 23030 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGAAGAAG AACAAGACTG CTTGCGGAAT TTCTGTTACA TTTCTAGGTG AAAATATGGT 1020 CACTGGACTT AAAGTTGAAG ATCAAATCAT CTTAGGCAAG CAATACGTTC TAGTTGGCAG 23090 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGGACTT AAAGTTGAAG ATCAAATCAT CTTAGGCAAG CAATACGTTC TAGTTGGCAG 1080 TGCTGGCACT GTTCGATCTC AGAGTGACAC AGCTTATGGG GCTAACTTTG AACTGCAGAG 23150 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGGCACT GTTCGATCTC AGAGTGACAC AGCTTATGGG GCTAACTTTG AACTGCAGAG 1140 GAGGGAGGCA GATTTCCCAA TCGGTCAGGT GCAATCTACA TTGTCTATGT CCGTCATAAA 23210 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGGGAGGCA GATTTCCCAA TCGGTCAGGT GCAATCTACA TTGTCTATGT CCGTCATAAA 1200 GTGGAGAGGT GATTTGGCTC TAGGTTTCAA CAGTATGGCG CAATTCGCTG TGGGACGCAA 23270 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGGAGAGGT GATTTGGCTC TAGGTTTCAA CAGTATGGCG CAATTCGCTG TGGGACGCAA 1260 TTCGAAGGTA GCTGTTCGAG CAGGAATCAA TAACAAGCTC AGTGGGCAAG TAACCGTGAG 23330 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCGAAGGTA GCTGTTCGAG CAGGAATCAA TAACAAGCTC AGTGGGCAAG TAACCGTGAG 1320 GACAAGCAGT TCAGACCATC TCTCTCTTGC ACTTACTGCT ATTATTCCAA CTGCAATTGG 23390 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACAAGCAGT TCAGACCATC TCTCTCTTGC ACTTACTGCT ATTATTCCAA CTGCAATTGG 1380 CATCTACAGG AAGCTTTGGC CGGATGCTGG CGAGAAGTAC TCAATCTACT AAATTTCATT 23450 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCTACAGG AAGCTTTGGC CGGATGCTGG CGAGAAGTAC TCAATCTACT AAATTTCATT 1440 TCCATATCAG CATTGCATTT TTGGTTCATT AGACCTTACA TGATGACATA TTGTCTTTGT 23510 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCATATCAG CATTGCATTT TTGGTTCATT AGACCTTACA TGATGACATA TTGTCTTTGT 1500 CAGTTCATTG AATAATGCTT CTGTTAATTT CCCATCTATT TAGGATTCCT ACTGTTATGA 23570 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTTCATTG AATAATGCTT CTGTTAATTT CCCATCTATT TAGGATTCCT ACTGTTATGA 1560 GTTATAAGTC AATTTTGAGT GATTGAATGT ACTTTTTTGC CAGATGAAAT GAAGAGGTTT 23630 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTATAAGTC AATTTTGAGT GATTGAATGT ACTTTTTTGC CAGATGAAAT GAAGAGGTTT 1620 GTGTTGCTTT 23640 |||||||||| GTGTTGCTTT 1630 hqPGS_C09HBa0099P03.1-1+_SGN-U318403+ (22011 23640) ******************************************************************************** EST sequence 1 +strand 1043 n (File: SGN-U336521+) 1 TAAGATNCGT GATGGATCCT CGCGCTGCAC CGTGGTGGCG GCCGCTCTAG AACTAGTGGA 61 TCCCCCGGGC TGCAGGAATT CGGCACGAGG AAAGGGATGG TTTTTGGAGA GAATTTTTTT 121 TTTTTACATC TTGTGATAGA AAAGAATTAT TGAAAGAGTT TAGTCATAAG AAGATTTTAC 181 AATAGTTTAA GCACACCTAT ATATCTATTG AAGTTGAATT AAAAACAAAA GGATTGCGGG 241 GTTTTTGATT ATCATTTGAA CTCGAAGTTG TGTAGTGGAA GGTCATCCCC TTGCTCGTTT 301 CGTCTCCAGC TCGCTGCCCC GAAGAAGTTG AGTTTTAGGC TGAATTGTCT TGGCAAGATA 361 AGACGAGTGC CATCAAGTCC ATTTTTGGTT CGTGACATCC GTTTGTATAT TTTGTTTGCA 421 AATATACAAA CAAGGTAAGT ATATACAAAT TCTGCATTTA TACATATAAG AAATTTTCAG 481 AACTATTGTG CTTGTCAATA TACAAACATA GTAATTTATT ACTCCCTTCG TTTCAAAAGG 541 ATGACCTAGT TTGACTTGGA ACAGAGTGTA AGAAAAGAAA GAAAACTCTA ATCTTGTGGT 601 TCTAAATTAA AGTTATGTCA AATGTACCAA AATGCTCTTT AATCTTGTGG TCTTAAATAT 661 GACACGTGGA AAGTTAAAGT TTAAGTGTTG CCAAAAAAAG AAAGGGGTCA TTCTTTTTTA 721 AACAAACTAA AAAGGGAAAT AAGAACATTC TTTTTGAAAC GGAGGGAGTG GATTTATATA 781 TATATATATA TATATATATA TTTATTTATT AATAAATTGA ATATAAATAG CTTGGGGTAG 841 GCTTTTACAA ATTCTAATTT TAAATTAAAG ACTGGATTTT CCAATTTTGG GTTTCCCTAA 901 TTGGTACCCC ACCCTATTGG GGGCTGGTTT TAATGGGAAC ATACTTCAAT AATCAATAGT 961 CTAAAGTTAA AATTTCCAAT TGGCATAAAT TTAAAATAAA TTTTAGGGGG GATTTGGGGT 1021 GGGATCCCCC GGGCATTTTT TTT Predicted gene structure (within gDNA segment 22414 to 34032): Exon 1 24419 24455 ( 37 n); cDNA 453 489 ( 37 n); score: 0.568 Intron 1 24456 24879 ( 424 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 2 24880 24904 ( 25 n); cDNA 490 511 ( 22 n); score: 0.800 Intron 2 24905 28813 (3909 n); Pd: 0.996 (s: 0), Pa: 0.000 (s: 0.78) Exon 3 28814 29015 ( 202 n); cDNA 512 710 ( 199 n); score: 0.802 Intron 3 29016 29637 ( 622 n); Pd: 0.000 (s: 0.66), Pa: 0.528 (s: 0.54) Exon 4 29638 29697 ( 60 n); cDNA 711 768 ( 58 n); score: 0.550 Intron 4 29698 33027 (3330 n); Pd: 0.986 (s: 0.54), Pa: 0.000 (s: 0.74) Exon 5 33028 33095 ( 68 n); cDNA 769 827 ( 59 n); score: 0.735 MATCH C09HBa0099P03.1-1+ SGN-U336521+ 0.742 392 0.376 C PGS_C09HBa0099P03.1-1+_SGN-U336521+ (24419 24455,24880 24904,28814 29015,29638 29697,33028 33095) Alignment (genomic DNA sequence = upper lines): TGCATAGAAG CAATTCATCA TTTTTTAGTA CGAGTAGATA TCATTAGCTA TTTGCATATT 24478 ||||| | || | | | |||| || | | | | TGCATTTATA CATATAAGAA ATTTTCAGAA CTATTGT... .......... .......... 489 GTAAGCAAAT ATGGAGAAAC AACTAACGAT CATGATGGAT CTTGAGTCAT TTAAAAAAAT 24538 .......... .......... .......... .......... .......... .......... 489 GTATATTGTT GCTACTTCTT CGTGTCAGAT AAAAATAAAA TAGGGTGGTG GTTAGAGAAA 24598 .......... .......... .......... .......... .......... .......... 489 ATTACGTAAT AAAATAAATT TATACTATTT GATTACATAT CATAGTTGTA GTTTGCTATT 24658 .......... .......... .......... .......... .......... .......... 489 ATTATCTTTC GCGACAAACA TTGTATATTA ATTACATGGG CCGACTTCGA ATTTGTATAA 24718 .......... .......... .......... .......... .......... .......... 489 TTAGTCACGT TTGTATATGT ATAATTCGCC AGAATATACA AATACATATG TATAATATAC 24778 .......... .......... .......... .......... .......... .......... 489 AATTATTTAA CCTACATATG TATACAGTCA CCTCTCTTCC TAGTCTCGCT CGCCTCTCTC 24838 .......... .......... .......... .......... .......... .......... 489 CTCCCTTTTC CAATCTCCTC TCTCCTCTCT CTCCCAATCT CTCTTGCCAT ATATACAAAT 24898 |||| || |||||||| .......... .......... .......... .......... .GCTTGTCA- ATATACAA-- 505 ACATAGGTAT AATATACAAA TATCTAACCA ATATACATAT ATAATTCACC TTTCTCCCAC 24958 |||||| ACATAG.... .......... .......... .......... .......... .......... 511 TCTTTTCTCC CCTCTCTCGC CTCTCTCGTC ACTCTCCTAG TCTCGCTCAT CTCTTTTCTT 25018 .......... .......... .......... .......... .......... .......... 511 CATATAACAT GTAGCTACAA ATTATAATTA TCAAACTATA TCTATGGAGA GTAATTAATT 25078 .......... .......... .......... .......... .......... .......... 511 ATATTTAAGT GGCTATATGT GAAAATTTCT TCGATTTAAA CACGTGAGGC TTAAAAATAT 25138 .......... .......... .......... .......... .......... .......... 511 GATGGAAATG ATCTCGGGAC AATGTACAAA TATTTTCCTA GACTGATCGA AATCTAATAA 25198 .......... .......... .......... .......... .......... .......... 511 ACGCACGTCT TATTCAACTT TGTGAAAAGT TACAAAATCA AATCATTTCT TTTGATGTTC 25258 .......... .......... .......... .......... .......... .......... 511 TTTACCATCG ATACTCCCTA AAACCTCTTT ATTCTTGTAT TTTGTTTCAC CTGCTTGATT 25318 .......... .......... .......... .......... .......... .......... 511 AAAAGTTGAT ACAATATGAT ATGTTCAAGA ATTTAAATTT GGTGATTTAG GATAATTACT 25378 .......... .......... .......... .......... .......... .......... 511 AAATACAATA TATTAAGGAA CTTAAGAAGT GGCTCACGAT ATAAAGATGA ACAATTACTA 25438 .......... .......... .......... .......... .......... .......... 511 CCGACAAACA TAAACCAAGT TATTAACCTG ACGGCCCATT AAGCCCCCCC CCCCCCCCCC 25498 .......... .......... .......... .......... .......... .......... 511 CCCCAGTACA GTAATGTAAA ATGGTCATTG CGCCGGAGCA ACAGTGCCGT CGGTCAAGGG 25558 .......... .......... .......... .......... .......... .......... 511 AACGGACTGC TCGTTTTCAG CGGTGGCGCC GCCTTCAAAC ACCTTCGCGA TCCTCTCTAC 25618 .......... .......... .......... .......... .......... .......... 511 GGGAATAATC GGTAAATAAT GCGCCGCCGT ATAGCCGTTC TCCGACGCAT ATGTCACCGC 25678 .......... .......... .......... .......... .......... .......... 511 TTGATTGTAT TTCTCACTCC AATAAGCAGC CGTCGGCACC AAAATCTGAG CCACTTGAGG 25738 .......... .......... .......... .......... .......... .......... 511 GAACAAAGGA AGCTTATTCA GAGATCTCCA TGCCAAAACC GCATTTTTCT CGATCACTGG 25798 .......... .......... .......... .......... .......... .......... 511 CTCGTATTTT GTGTATAGCT CCTTGACTGT GGGTTCGTAC TTTGTGTAGA GTGTTTTAGC 25858 .......... .......... .......... .......... .......... .......... 511 TACGTTGCTC GCTGTGTCCA CTAAGCCATC GTGCTGTACC TCGCCGGCGA GATCTCGAGC 25918 .......... .......... .......... .......... .......... .......... 511 TAATTCTGGA GCCTTCTGAG CTATTAACAG AGCTTTAGAT GATGTCTGCT TTAGTAGAGA 25978 .......... .......... .......... .......... .......... .......... 511 AGGCACATGG CTTTCAACTT CTGTCATCAA GTCTGCAACC TGCATAGATT CAAACTTTCA 26038 .......... .......... .......... .......... .......... .......... 511 AATCCTTGCC ACTGAAATTC GAGTTCAAAG TTTACAACAT TCGTTTCTAT GTTAGTGAAT 26098 .......... .......... .......... .......... .......... .......... 511 TATCATGTGC ATGTCCATGC ACATTCTTCC AGATTGAGCT AGATCTAGGA TTTCGATTAG 26158 .......... .......... .......... .......... .......... .......... 511 TTGGAGAAGA CAATTTCTGA ATGCCATACA AATACCTTCT CAATTCCAGC AAACTTTCAG 26218 .......... .......... .......... .......... .......... .......... 511 CTGCAAAATA AGTATGGTGT AGACAGTATG GGTAAAATTC ATTTTGGAGA AAGAAGGTGA 26278 .......... .......... .......... .......... .......... .......... 511 ACCTTTAGGT CGATGAACTT GAGGAGATTG AAAGGAACGT TATGGAACTT CTCATAAACC 26338 .......... .......... .......... .......... .......... .......... 511 GGTCCGATAA CAGTTTTAAC AGTGGCTTCT ACGGCCTGTA CACCAGGTTT CAACGGACCG 26398 .......... .......... .......... .......... .......... .......... 511 GAGTTTTCTT TGCCGTATTC ATACAAAGTT GAGAAGCAAA CAATCACATA GATCGCTGCA 26458 .......... .......... .......... .......... .......... .......... 511 ACTTGAACGA AATCCAGATA TTTAAGTTTC CTCTCATCTT CTGCTACCTT CAATTATAAA 26518 .......... .......... .......... .......... .......... .......... 511 CACAAATCAG GTACAGATTT AAAACTCAAA AATTCTGAAT CTCACCTATG TTGAGAAAAC 26578 .......... .......... .......... .......... .......... .......... 511 ACATCTCGAA TTAAATTCAA CAAATTATAC TAGTATTCTA CCTGTCAATG CATTTAATCA 26638 .......... .......... .......... .......... .......... .......... 511 AATCTAACTA CTCTTTCTCG ATATATTTTT TATCAGTCGA GTGACCATAT GTATTGCGAA 26698 .......... .......... .......... .......... .......... .......... 511 GATGAAATTG GCAACCATGA AGGCATTGCA CAACAACTCA TTCCTAATTT AACACCTCAA 26758 .......... .......... .......... .......... .......... .......... 511 AATAAGAAGA TAACAATTCA TTCTGGCAAT TAAATTGAGT AATTGAACAA CTTAGATCTA 26818 .......... .......... .......... .......... .......... .......... 511 ATATGTGAAG CCTGAATAAC AGAATATGAA GTTACAACCA CGCTACAACG GTGTGTGTTT 26878 .......... .......... .......... .......... .......... .......... 511 GAAAATTAAA ATTGATTTAA CTGTAATATC ATGATAAACG TAGAAGACAT TGTATACTTA 26938 .......... .......... .......... .......... .......... .......... 511 CTGGATCAGC TGGATCAGTA CTAGTAGGCG GCGTCGTGTT TGCTGGATCA GTACTAGCAG 26998 .......... .......... .......... .......... .......... .......... 511 GCGGCGTCGT GCTTGCTGGA TCAGTCGTAG CAGGTGGCGC CGTGCTTGCT GGATCAGTAG 27058 .......... .......... .......... .......... .......... .......... 511 GCGGCGTTGC AGCTGCGTCA GCCATGGTAG AGATGGAGAA AAACTAAGTA ATGGAGTTTG 27118 .......... .......... .......... .......... .......... .......... 511 TACAGGAGAG CTTCGAAAAG GCAGGCAGTT GGCAAGCCCT CTTTTATAGC GGACGGGGCA 27178 .......... .......... .......... .......... .......... .......... 511 CGTAATAATC GTGTGCGGAG AAGTCGATGA AAATTTTACA GGTTTTCAAT TAATGTTTAA 27238 .......... .......... .......... .......... .......... .......... 511 AACACAAAAA TAAAGTGAAT AAATATTCTT AATTAAAGTG ATTAGTATTC ATGACCATAA 27298 .......... .......... .......... .......... .......... .......... 511 ATAGAAAGAA ATTATTTAAT AATATTTTTT TGTTAATTTT TAAATAAGGA AAATTACACA 27358 .......... .......... .......... .......... .......... .......... 511 AATTGTCCTA AGTTAAGACT TTACTCAATT TATTTACTTT GATGGAACAC ATAATCCAAA 27418 .......... .......... .......... .......... .......... .......... 511 TTATTTACCT TACTTTCCTC TTGTTCAATC ACTAAATTTT TTTTTAATTC TTGATACGAT 27478 .......... .......... .......... .......... .......... .......... 511 TTAAATACAC TTCGATGATA TTAATCAGTA ATTGAGCAGT TTCTTCTTTG ATGTCATCAG 27538 .......... .......... .......... .......... .......... .......... 511 AATGATATTA AAGAGTAACT GAGTAGAGTT TTCACTGAAA CACTGTCTCT TTTTCTAAAA 27598 .......... .......... .......... .......... .......... .......... 511 TTTTACTCTA ATGGTATCAA AGAATAAGTG AATCACTGCA CTTTTCTTCC TCTTTGATAC 27658 .......... .......... .......... .......... .......... .......... 511 CACGAGTGTG TGTCTATCTA TATATATACT TTGATGGTAT CGAATATTAA GGACACATTA 27718 .......... .......... .......... .......... .......... .......... 511 GTAGTTAAGA ATAGGGGTAT GAGAACAGTG GGGTAGCCAC ATTATGGTCA GAGTGTCCAA 27778 .......... .......... .......... .......... .......... .......... 511 ATTGACACGA TTCGTCGGGA AAAAAAAATA CAAGTAAAAT GACATACAAA ATTATTAAAT 27838 .......... .......... .......... .......... .......... .......... 511 AACATATTTT GAACACTCTT AACCTAACAA GTTGTTATTG CCTAGTGATT TCGCCTCTTT 27898 .......... .......... .......... .......... .......... .......... 511 GGAATGAGTA ACTGCTTATA TGTTCGAATC TCATTAGTTC TACTTTTAAA AGAGTTCTTA 27958 .......... .......... .......... .......... .......... .......... 511 TTGTGCTTAA TCTGGACACT CTTTGTGAAA TTGTTGATTT CGCCACTAAT ATGAAAATAA 28018 .......... .......... .......... .......... .......... .......... 511 GGAGGATAAC CACTTATAAA TAGTTTTTCT TTTTTGATAT AAATGTTGTT TTCTCTTTAA 28078 .......... .......... .......... .......... .......... .......... 511 AATATTACTC TCTTCGTTTA ATAATAAAGA ATAGCCTACT CTTTTATTTG GTCTGTTTAA 28138 .......... .......... .......... .......... .......... .......... 511 ATAAAAAAGT CCCCTTTTTT TACAATTCTT TAAATTCAAC TTTTCACGTG ATATATTTAA 28198 .......... .......... .......... .......... .......... .......... 511 CACCATACAA TCAATTTTTT AATACATTTG ATACAACTTT AATTTAAAAC TAAAAAAATT 28258 .......... .......... .......... .......... .......... .......... 511 ATTTTATTTT AATTTTTTTA AACCAACATT TTTATAAAAT GAAAGGAGTA TAATTCTTTA 28318 .......... .......... .......... .......... .......... .......... 511 ACCCTTTTAA TTATTATCGG TTACACTTTA ATGCATGTCA CTTTTTCAGT AAGTAGGTTA 28378 .......... .......... .......... .......... .......... .......... 511 TTCTCACGTG ATTAGCTAGC TGTTCTAGAA TCTCGACAGA ATCACGTATA TAACCTGTTT 28438 .......... .......... .......... .......... .......... .......... 511 ATTTTTATTT TTATTTTGTT TATTAATAAA AATAATTAAA TTTAATAGGA TCGGATCCCG 28498 .......... .......... .......... .......... .......... .......... 511 GGTATTAACT TTTATGCGAA GATTCATTTT AGATTGTCTA GATTTATAGT TGTCCTCAAT 28558 .......... .......... .......... .......... .......... .......... 511 TATTTTTTAT TTTTTTCTTT AAAAAGGTAA TTATTTTAAT TAATTCACTT GAAAATATAA 28618 .......... .......... .......... .......... .......... .......... 511 ATAATTATGT AAAATTCAAA AAGAGTTTTT TAAAGTATAA ATTAGTAAAA GTAACATTTT 28678 .......... .......... .......... .......... .......... .......... 511 TATTTATGGT TTTTAAAGAG ACGTATAAAA AAAAAATAGA CAATTACTCT TCAATGCATT 28738 .......... .......... .......... .......... .......... .......... 511 ACTTTTTTGT GTGAATTTAG ATTAGTCAGG TTCTAATATA AATATTAAAA ATGAGATAGT 28798 .......... .......... .......... .......... .......... .......... 511 ATATTGTATA TTCTCTTATT TCATATTTTC TCCGTTTAAA AAAGAATGAA CTAGTTTGAC 28858 | ||| | || | | | ||||| | |||| |||| |||||||||| .......... .....TAATT TATTACTCCC TTCGTTT-CA AAAGGATGAC CTAGTTTGAC 555 TTGGAATGAA GTTTAAGAAA AGAAAGAAGA CTTTTTAATC TTGTGGTTCT AAATTAAAGT 28918 |||||| | || ||||||| |||||||| | | | ||||| |||||||||| |||||||||| TTGGAACAGA GTGTAAGAAA AGAAAGAAAA C--TCTAATC TTGTGGTTCT AAATTAAAGT 613 TATGTCAAAT GTATCAAAAT GTTCTTAAAT CTTGTGGTCT TAAACATGTC ACGTGAAAAG 28978 |||||||||| ||| |||||| | |||| ||| |||||||||| |||| ||| | ||||| |||| TATGTCAAAT GTACCAAAAT GCTCTTTAAT CTTGTGGTCT TAAATATGAC ACGTGGAAAG 673 TTAAAATTAA ATTCTTTTTA AAAAAATTAA ATAAAAAATA AGAATATTCT TTTTTGAAAC 29038 ||||| || | | | || | |||||| || | TTAAAGTTTA AGTGTTGCCA AAAAAAGAAA GGGGTCA... .......... .......... 710 ATAGGAAATA TTATTTATTC CTTAAAGTTT TTTTTTAATA GTATTTTACC TATCGATTTT 29098 .......... .......... .......... .......... .......... .......... 710 AGTAAATCAA AGGTACAATT CTTATTATTA GATGACTGCA TGTATTAACA ATTAAATTGA 29158 .......... .......... .......... .......... .......... .......... 710 AGTTTTCAAA ATATAATTAA TTTAACAACA ATATACCTAT GATAAATATT ATTTTAAAAG 29218 .......... .......... .......... .......... .......... .......... 710 GAATGTCTGG TTAAAATGGA TCATCATCAA AGTTATAGTA TTAATAATAT TAAAATAAAG 29278 .......... .......... .......... .......... .......... .......... 710 AAAGATGGAA TACTTTATAT TATCACTTAC ATTGTTCGGT ACCTACTCAT TGATGAGAAA 29338 .......... .......... .......... .......... .......... .......... 710 ATTGAAAAAT TGGTAGAATT TATAGAGGTG CGTTTGAATT TTGAAATTTA AATAAATTTA 29398 .......... .......... .......... .......... .......... .......... 710 TTTTAATTTA TAAAAATATT TAAATTATTA AATATTGTGA TTTGTATAAA TATTTAAATT 29458 .......... .......... .......... .......... .......... .......... 710 ATTAAATATT GTGATTTGTA TAAATATATT TATAAATTTA AATAAATCTA TTTTAATTTG 29518 .......... .......... .......... .......... .......... .......... 710 TAACTATTTT ATAAAAAAAA TTAAATATTT TAAACTATTA AATATTGTGA CTTGTACAAA 29578 .......... .......... .......... .......... .......... .......... 710 TATATTTATG TAGTTTATAA ATATAAAAAA AATTAAAAAT TTCATATATA AATTTTCAGT 29638 | .......... .......... .......... .......... .......... .........T 711 TAAATTTAAA ATAATTTGAC TCTCGAAAAA TGAAACTAGT ACATGACACG GTAATCGAGG 29698 | ||| || | || | | |||| | ||| | | ||| ||| | | ||| TCTTTTTTAA ACAAACTAAA AAGGGAAATA AGAACATTCT TTTTGAAACG G--AGGGAG. 768 TTGGTGTATG TCTCCCCATC TTCTTGATTT TTTATTTGAA TTATGCCCTT CTGGAGAGTT 29758 .......... .......... .......... .......... .......... .......... 768 ATGGGCTCAT TTTGATCATT TCATACTTTT CTTTGCTCAT AAGATAGCGA TAAGGTTCTC 29818 .......... .......... .......... .......... .......... .......... 768 TCTTTATTTG TCTATGTCTT GTACACTTCA CATTTTTTTT TAGTAATGTC GATAGGTTAA 29878 .......... .......... .......... .......... .......... .......... 768 AATATCATCA TTGTTGGTGG GCTAAAACCC ATAAGGGTGG AAGGGGTAAT TTTTTTTAAT 29938 .......... .......... .......... .......... .......... .......... 768 AATTTGGTAT ATTAAATTTT CGTTAGGTGT AAAATTTAAG AAAAAATTAA ATATAATATA 29998 .......... .......... .......... .......... .......... .......... 768 AAATAAAATA TAAAATAAGA GGTTATAATT TATAATTTAA ACTGTGACTT TTTAATTACA 30058 .......... .......... .......... .......... .......... .......... 768 TGATAATAAC TTTATTAGTT ATCATAACAG CGGATGAAAG TTCGTTTTTT AATGTGTCGA 30118 .......... .......... .......... .......... .......... .......... 768 TATAAGGGTG CAAGGGGTTG AACTCTTTAA TTTTGTTGAT AGGTAAAAAC AATTGGGGTT 30178 .......... .......... .......... .......... .......... .......... 768 CAATTCTTTA GTTGTGTTTG ATAATAGGGC AAAAAATAAT TAGACTTTTC ATATGTATAT 30238 .......... .......... .......... .......... .......... .......... 768 AAGTTTTTTT TTCTCTTATT TAAATTTCTA ATAACACCAT TCGTTTGATT ATTACTTTAT 30298 .......... .......... .......... .......... .......... .......... 768 ATTAAATATT GGGGTTATTG GCATACTAAT TTGATTTTTG ATAATTGTGT TGAATTATGC 30358 .......... .......... .......... .......... .......... .......... 768 ATGTGCACCT AACTCCTAAG TTAGCAAAGA ATTGAAAAAT ATTTTGCAAG GAAGCTATTT 30418 .......... .......... .......... .......... .......... .......... 768 TCACAATTTA AATTTTTAAT CTTTCGATTA CAATAATAAC TTTATATATT ATAACGATAG 30478 .......... .......... .......... .......... .......... .......... 768 AGAAATTTCA ATTTTTTTAA TTATGTTGAT AGGTATAAGG TTGTAGGGGT CAATACTTTA 30538 .......... .......... .......... .......... .......... .......... 768 AATTTTGTTG ATAGGGTTAA ACTTACCAAG GTTTGTGTTT CTAATAACGC AATTCGTTGG 30598 .......... .......... .......... .......... .......... .......... 768 GTTATTAGTA ACACTAACTT AAAATTGGTT TATCATATTG AATTATTTAC TCAAATATTT 30658 .......... .......... .......... .......... .......... .......... 768 AGCCAATATT AGGTACCATA ATATTATTCC CTTCGTTCAT TTTATGTGAC ATCATTTCTT 30718 .......... .......... .......... .......... .......... .......... 768 TTTTTGTCAG TCCCAAAAAG AATGTCACAT TTTCTTATAT GATAACTATT TAAAGGTAAT 30778 .......... .......... .......... .......... .......... .......... 768 ATCCTTATTG GTCCCACTTC ATTTTAAAAA TAGTACTCTA ATAAGAGAAA AAAAAGAAGT 30838 .......... .......... .......... .......... .......... .......... 768 AAGTTTACTA AGAATAATTT AGTAAACATT TTAAAATAAT CTTGCCACAT AAAATAAAAT 30898 .......... .......... .......... .......... .......... .......... 768 AAAGGAAGTA TAATTTAAGA AAATAATAAA CAGTTAAAAT AACTTTCAAA GTAATTTAAC 30958 .......... .......... .......... .......... .......... .......... 768 CACATTTATT GGAACTAAGA GAGTATAATT TAAGAAAATT GTAACAGTTA AAATGACTTT 31018 .......... .......... .......... .......... .......... .......... 768 CAAAGTAATC TTAACCACAT TTTTTCGATT TCCACAGTAA TTTAACAACA TTTAGAGAGT 31078 .......... .......... .......... .......... .......... .......... 768 CTTAAACTTA CTTTATAAAC AACCCAGAAA ATTTTGTTAT TAGTATTGTT GGGATCTTCA 31138 .......... .......... .......... .......... .......... .......... 768 GTTAAGAAAA TCCCTCCAAA AATTAGCGTA TGCCACTAAT TGCAACGCGC ACACGGTATT 31198 .......... .......... .......... .......... .......... .......... 768 CTTTAGTAAA AGGCGAAAGA ATAGTTCGCT ATGTGCTCAC TGCGTGGCTG CGTAAATTAT 31258 .......... .......... .......... .......... .......... .......... 768 ATGATATTAG CACGAGTCTC TTATATGTTG CTTGTAGCTG TTGGCTTACT CACCTACATT 31318 .......... .......... .......... .......... .......... .......... 768 ACGATGTGGT ACTTTATCAA CAACACTTCC ATTTATGAAA TTCCAGTATA TGGGCTTTGT 31378 .......... .......... .......... .......... .......... .......... 768 GCCTTTGTTG GACCCAGTAC TTACAAGTTT GGGAGGGCCA CTTTCTGTAT TAGCCCACTT 31438 .......... .......... .......... .......... .......... .......... 768 TTTCATTGTA ATGTAAACAA TTCTATAATG ATTAGAAAAA CAGGTCAAAT GTGTTTGGGA 31498 .......... .......... .......... .......... .......... .......... 768 TGAAAAATAA GTTTTTCCCA GCAAATGTCA TCTTGAAAAA CAATGTTTTT CAATAATTTT 31558 .......... .......... .......... .......... .......... .......... 768 TTAGTGGTAG ATAATTAAGA AAGAAAATAT TATTCAAAAG AGAATCTATA TGTTGTTTAA 31618 .......... .......... .......... .......... .......... .......... 768 CAAAATATGA TAGAGGTTGA ATGGATTGGT GTATGGGGGT GGGGGGAGGG GATGATCAAG 31678 .......... .......... .......... .......... .......... .......... 768 GGGTGAGAGT TGAGGAGATT GAATGGGCGG GAAGGAGACG ATACATTTGG AATAACACTT 31738 .......... .......... .......... .......... .......... .......... 768 ATGTAATTGT TTTTTTTTTT CATTAAAGAC CTTATTTTTC TAAATATTTT GACCAGCTAA 31798 .......... .......... .......... .......... .......... .......... 768 ACATAAAAAA ATAATAATAG AAAAACATTT TTTTTTGAAT ATTTTCCTCC ATGCTGAACA 31858 .......... .......... .......... .......... .......... .......... 768 TAAAATTGTA AGCTATTCTC TATTATACTA ATCATAAGAT TGTAAAATAT CTTTTGATTT 31918 .......... .......... .......... .......... .......... .......... 768 TTCGAACATA ATAAGTAAAA GTCGAAATAT AAATGTACGT ATTTGAATCT ACATTTAAGA 31978 .......... .......... .......... .......... .......... .......... 768 CAATGGATTA AAACCTCATT TGTTTTCACT AAGATGAAAA ATGACTGAAT ATAAATGTAC 32038 .......... .......... .......... .......... .......... .......... 768 TTGTTAGAAT AATTATGATC ATGTATATAG ATTTGAATAC TGAATAATAT ATTTTAATAT 32098 .......... .......... .......... .......... .......... .......... 768 GTCGCTTTGA GCCTACACAT TGAAAATGAC CAACACTCGA TATCATTGTT GCTCTAAGCG 32158 .......... .......... .......... .......... .......... .......... 768 TGTACCCATA ACGTGACATT GGACATGAGC ATACATCAAG AATGAGTGAA GGTGCAGTTT 32218 .......... .......... .......... .......... .......... .......... 768 AAAAGCATTT AAAGAGACAA GTTCAGTAAA AGAAAACACT TGTATAACAC ATAAGTTGTG 32278 .......... .......... .......... .......... .......... .......... 768 AAATTGGAAT CGTTAAAAAA AGACACGAAA ATTCAACCTG ACACATAACC ATGTATCTCT 32338 .......... .......... .......... .......... .......... .......... 768 CTATTAAACC TCTAAACATG ATCGAGAAGT ATCTATTAGG ACAAGGCCCT TGGTATTACC 32398 .......... .......... .......... .......... .......... .......... 768 TTAAACACAT AATTATAAAG TACTAGAAAT ATGTATATTA AGTCTTTTGA GAAACAGAGA 32458 .......... .......... .......... .......... .......... .......... 768 AGGGCTCCCC AATTAGCAGA AAATATGGGA CCCAGTGAGC GACTTGATCA CATGTATGTG 32518 .......... .......... .......... .......... .......... .......... 768 TATATGTAAC ACTCTCGATA AAAGTGATGT AATTACATAT TAAATAGTAT TTGTATCTAA 32578 .......... .......... .......... .......... .......... .......... 768 ACATATCGGT ACATGCCAGA TCAATACAAA CATATAGCAT AGTATAACAT ATGAAGATTA 32638 .......... .......... .......... .......... .......... .......... 768 AGAGATAAGG TCGTTAATAC TACCTTAATG GTACATCTTT TTATCATATT TTTTTTTATC 32698 .......... .......... .......... .......... .......... .......... 768 AATAATAAGA TTCTGAAAAT ATAATAAACG ATACAAATAC TATGTGAATG CATGAAGTTT 32758 .......... .......... .......... .......... .......... .......... 768 GATCTACCCA ATCAACTAAG GTGGAGACAT ATGTGTACAT ATCATGTGGA TCTATAAGTA 32818 .......... .......... .......... .......... .......... .......... 768 TAACTATCCC TTATCAGAAG CATAAAAGAA CAATTCCCCC AAATCAACAA GATATTATCC 32878 .......... .......... .......... .......... .......... .......... 768 TTAACAGTGT TGGCAAATGA AGTTTTATAC GTTCATCATT TAGATTCATG CACTTTCGAT 32938 .......... .......... .......... .......... .......... .......... 768 CAACTATAAT TGTCTCTAGG TCATATGTTC ACAGATTAGC CCAAAAATTT ATATTCATGA 32998 .......... .......... .......... .......... .......... .......... 768 CTTTTAATGA ACGTAGTTCA TAACTTAAGT TCATCTTATC ATATCATGAA ATTATATTTA 33058 | || |||| |||| || | | ||||| || .......... .......... .........T GGAT-TTAT- ATAT-AT-AT A-TATATATA 794 TATATATTAT AATTGTTTAA CAATTTTGCC ATATAAA 33095 |||||||| | ||| |||| || ||| ||||||| TATATATT-T -ATTTATTAA TAA-ATTG-A ATATAAA 827 hqPGS_C09HBa0099P03.1-1+_SGN-U336521+ (24880 24904,28814 29015,29638 29697,33028 33095) ******************************************************************************** EST sequence 12 +strand 1381 n (File: SGN-U321959+) 1 ATTACTTAGT TTTCTCCATC TCTACCATGG CTGACGCAGC TGCAACGCCG CCTACTGATC 61 CAGCAAGCAC GGCGCCACCT GCTACGACTG ATCCAGCAAG CACGACGCCG CCTGCTAGTA 121 CTGATCCGCC TACTAGTACT GATCCAGCAA GCACGACGCC GCCGACTAGT ACTGATCCAG 181 TAGCAGAAGA TGAGAGGAAA CTCGAATATC TGGATTTCGT TCAAGTTGCA GCGATCTATA 241 TGATTGTTTG CTTCTCAACT TTGTATGAAT ACGGCAAAGA AAACTCCGGC CCGTTGAAAC 301 CTGGTGTACA GGCCGTAGAA GCCACTGTTA AAACTGTTAT CGGACCGGTT TATGAGAAGT 361 TCCATAACGT TCCTTTCAAT CTCCTCAAGT TCATCGACCA AAAGGTTGCA GACTTGATGA 421 CAGAAGTTGA AAGCCATGTG CCTTCTCTAC TAAAGCAGAC ATCATCTAAA GCTCTGTTAA 481 TAGCTCAGAA GGCTCCAGAA TTAGCTCGAG ATCTCGCCGG CGAGGTACAT CACGATGGCT 541 TAGTGGACAC AGCGAGCAAC GTAGCTAAAA CACTCTACAC AAAGTACGAA CCCACAGTCA 601 AGGAGCTATA CACAAAATAC GAGCCAGTGA TCGAGAAAAA CGCGGTTTTG GCATGGAGAT 661 CTCTGAATAA GCTTCCTTTG TTCCCTCAAG TGGCTCAGAT TTTGGTGCCG ACGGCTGCTT 721 ATTGGAGTGA GAAATACAAT CAAGCGGTGA CATATGCGTC GGAGAACGGC TATGCCGCGG 781 CGCATTATTT TCCGATTATT CCCGCAGAGA GGATCGCGAA GGTGTTTGAA GGCGGCGCCA 841 CCGCTGAGAA CGAGCAGTCC GTTCCCTTGA CCGACGGCAC TGTTGCTCCG GCGCAATGAC 901 CATTTTACAT TACTGTACTG GGGGTTGGGC TTAATGGGGC CGCCAGGTTA ATAACTTGGT 961 TTATGTTTGT CGGTAGTAAT TGTTCATCTT TATATCGTAA GCCACTTCTT AAGTTCCTTA 1021 ATATATTGTA TTTAGTAATT ATCCTAAATC CCCAAATTTA AATTCGTGAA AAAACCATAT 1081 TGTATCAACT TTCAATCAAG CAGGGGAAAC AAAATCCAAG AATAAAGAGG TTTTAGGGAG 1141 TATCGATGGT AAAGAACATC AAAAGAAAAG ATTTGATTTT GTGCCCCTTC CCAAAGTTAT 1201 TGGAATAAGA CGTGTGTTTT TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACTCGAGAG 1261 GAGAGAGGAG AGAGAACTAG TCTCTGATAA CGTGCATTGC ACGTTTGTCC CTTACTATTA 1321 TTATAATTTT TTTTATTTAA AAGTGAGTTA TAAAATATGA AATTACAAAA AAAAAAAAAA 1381 A Predicted gene structure (within gDNA segment 27790 to 22504): Exon 1 27110 26940 ( 171 n); cDNA 1 168 ( 168 n); score: 0.865 Intron 1 26939 26630 ( 310 n); Pd: 1.000 (s: 0.56), Pa: 0.000 (s: 0) Exon 2 26629 26620 ( 10 n); cDNA 169 179 ( 11 n); score: 0.350 Intron 2 26619 26506 ( 114 n); Pd: 0.762 (s: 0), Pa: 0.996 (s: 0.96) Exon 3 26505 26281 ( 225 n); cDNA 180 404 ( 225 n); score: 0.978 Intron 3 26280 26018 ( 263 n); Pd: 0.994 (s: 0.98), Pa: 0.958 (s: 1.00) Exon 4 26017 25193 ( 825 n); cDNA 405 1223 ( 819 n); score: 0.939 Intron 4 25192 25016 ( 177 n); Pd: 0.000 (s: 0.63), Pa: 0.978 (s: 0.56) Exon 5 25015 24950 ( 66 n); cDNA 1224 1290 ( 67 n); score: 0.538 PPA cDNA 1365 1381 MATCH C09HBa0099P03.1-1- SGN-U321959+ 0.915 1297 0.939 C PGS_C09HBa0099P03.1-1-_SGN-U321959+ (27110 26940,26629 26620,26505 26281,26017 25193,25015 24950) Alignment (genomic DNA sequence = upper lines): ATTACTTAGT TTTTCTCCAT CTCTACCATG GCTGACGCAG CTGCAACGCC GCCTACTGAT 27051 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTACTTAG- TTTTCTCCAT CTCTACCATG GCTGACGCAG CTGCAACGCC GCCTACTGAT 59 CCAGCAAGCA CGGCGCCACC TGCTACGACT GATCCAGCAA GCACGACGCC GCCTGCTAGT 26991 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAGCAAGCA CGGCGCCACC TGCTACGACT GATCCAGCAA GCACGACGCC GCCTGCTAGT 119 ACTGATCCAG CAAACACGAC GCCGCCTACT AGTACTGATC CAGCTGATCC AGTAAGTATA 26931 |||||||| | | | || || | || || || | || || | | ACTGATCCGC CTACTAGTAC TGATCCAGCA AGCAC-GACG CCGCCGA-CT A......... 168 CAATGTCTTC TACGTTTATC ATGATATTAC AGTTAAATCA ATTTTAATTT TCAAACACAC 26871 .......... .......... .......... .......... .......... .......... 168 ACCGTTGTAG CGTGGTTGTA ACTTCATATT CTGTTATTCA GGCTTCACAT ATTAGATCTA 26811 .......... .......... .......... .......... .......... .......... 168 AGTTGTTCAA TTACTCAATT TAATTGCCAG AATGAATTGT TATCTTCTTA TTTTGAGGTG 26751 .......... .......... .......... .......... .......... .......... 168 TTAAATTAGG AATGAGTTGT TGTGCAATGC CTTCATGGTT GCCAATTTCA TCTTCGCAAT 26691 .......... .......... .......... .......... .......... .......... 168 ACATATGGTC ACTCGACTGA TAAAAAATAT ATCGAGAAAG AGTAGTTAGA TTTGATTAAA 26631 .......... .......... .......... .......... .......... .......... 168 TGCATTGA-C AGGTAGAATA CTAGTATAAT TTGTTGAATT TAATTCGAGA TGTGTTTTCT 26572 | | ||| | .GTACTGATC CA........ .......... .......... .......... .......... 179 CAACATAGGT GAGATTCAGA ATTTTTGAGT TTTAAATCTG TACCTGATTT GTGTTTATAA 26512 .......... .......... .......... .......... .......... .......... 179 TTGAAGGTAG CAGAAGATGA GAGGAAACTT AAATATCTGG ATTTCGTTCA AGTTGCAGCG 26452 |||| |||||||||| ||||||||| ||||||||| |||||||||| |||||||||| ......GTAG CAGAAGATGA GAGGAAACTC GAATATCTGG ATTTCGTTCA AGTTGCAGCG 233 ATCTATGTGA TTGTTTGCTT CTCAACTTTG TATGAATACG GCAAAGAAAA CTCCGGTCCG 26392 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ATCTATATGA TTGTTTGCTT CTCAACTTTG TATGAATACG GCAAAGAAAA CTCCGGCCCG 293 TTGAAACCTG GTGTACAGGC CGTAGAAGCC ACTGTTAAAA CTGTTATCGG ACCGGTTTAT 26332 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAAACCTG GTGTACAGGC CGTAGAAGCC ACTGTTAAAA CTGTTATCGG ACCGGTTTAT 353 GAGAAGTTCC ATAACGTTCC TTTCAATCTC CTCAAGTTCA TCGACCTAAA GGTTCACCTT 26272 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| | GAGAAGTTCC ATAACGTTCC TTTCAATCTC CTCAAGTTCA TCGACCAAAA G......... 404 CTTTCTCCAA AATGAATTTT ACCCATACTG TCTACACCAT ACTTATTTTG CAGCTGAAAG 26212 .......... .......... .......... .......... .......... .......... 404 TTTGCTGGAA TTGAGAAGGT ATTTGTATGG CATTCAGAAA TTGTCTTCTC CAACTAATCG 26152 .......... .......... .......... .......... .......... .......... 404 AAATCCTAGA TCTAGCTCAA TCTGGAAGAA TGTGCATGGA CATGCACATG ATAATTCACT 26092 .......... .......... .......... .......... .......... .......... 404 AACATAGAAA CGAATGTTGT AAACTTTGAA CTCGAATTTC AGTGGCAAGG ATTTGAAAGT 26032 .......... .......... .......... .......... .......... .......... 404 TTGAATCTAT GCAGGTTGCA GACTTGATGA CAGAAGTTGA AAGCCATGTG CCTTCTCTAC 25972 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....GTTGCA GACTTGATGA CAGAAGTTGA AAGCCATGTG CCTTCTCTAC 450 TAAAGCAGAC ATCATCTAAA GCTCTGTTAA TAGCTCAGAA GGCTCCAGAA TTAGCTCGAG 25912 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAAGCAGAC ATCATCTAAA GCTCTGTTAA TAGCTCAGAA GGCTCCAGAA TTAGCTCGAG 510 ATCTCGCCGG CGAGGTACAG CACGATGGCT TAGTGGACAC AGCGAGCAAC GTAGCTAAAA 25852 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCTCGCCGG CGAGGTACAT CACGATGGCT TAGTGGACAC AGCGAGCAAC GTAGCTAAAA 570 CACTCTACAC AAAGTACGAA CCCACAGTCA AGGAGCTATA CACAAAATAC GAGCCAGTGA 25792 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTCTACAC AAAGTACGAA CCCACAGTCA AGGAGCTATA CACAAAATAC GAGCCAGTGA 630 TCGAGAAAAA TGCGGTTTTG GCATGGAGAT CTCTGAATAA GCTTCCTTTG TTCCCTCAAG 25732 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGAGAAAAA CGCGGTTTTG GCATGGAGAT CTCTGAATAA GCTTCCTTTG TTCCCTCAAG 690 TGGCTCAGAT TTTGGTGCCG ACGGCTGCTT ATTGGAGTGA GAAATACAAT CAAGCGGTGA 25672 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGCTCAGAT TTTGGTGCCG ACGGCTGCTT ATTGGAGTGA GAAATACAAT CAAGCGGTGA 750 CATATGCGTC GGAGAACGGC TATACGGCGG CGCATTATTT ACCGATTATT CCCGTAGAGA 25612 |||||||||| |||||||||| ||| | |||| |||||||||| ||||||||| |||| ||||| CATATGCGTC GGAGAACGGC TATGCCGCGG CGCATTATTT TCCGATTATT CCCGCAGAGA 810 GGATCGCGAA GGTGTTTGAA GGCGGCGCCA CCGCTGAAAA CGAGCAGTCC GTTCCCTTGA 25552 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| GGATCGCGAA GGTGTTTGAA GGCGGCGCCA CCGCTGAGAA CGAGCAGTCC GTTCCCTTGA 870 CCGACGGCAC TGTTGCTCCG GCGCAATGAC CATTTTACAT TACTGTACTG GGGGGGGGGG 25492 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CCGACGGCAC TGTTGCTCCG GCGCAATGAC CATTTTACAT TACTGTACT- ---------- 919 GGGGGGGGGG CTTAAT-GGG CCGTCAGGTT AATAACTTGG TTTATGTTTG TCGGTAGTAA 25433 ||||| ||| |||||| ||| ||| |||||| |||||||||| |||||||||| |||||||||| GGGGGTTGGG CTTAATGGGG CCGCCAGGTT AATAACTTGG TTTATGTTTG TCGGTAGTAA 979 TTGTTCATCT TTATATCGTG AGCCACTTCT TAAGTTCCTT AATATATTGT ATTTAGTAAT 25373 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTTCATCT TTATATCGTA AGCCACTTCT TAAGTTCCTT AATATATTGT ATTTAGTAAT 1039 TATCCTAAAT CACCAAATTT AAATTCTTGA ACATATCATA TTGTATCAAC TTTTAATCAA 25313 |||||||||| | |||||||| |||||| ||| | | | |||| |||||||||| ||| |||||| TATCCTAAAT CCCCAAATTT AAATTCGTGA AAAAACCATA TTGTATCAAC TTTCAATCAA 1099 GCAGGTGAAA CAAAATACAA GAATAAAGAG GTTTTAGGGA GTATCGATGG TAAAGAACAT 25253 ||||| |||| |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| GCAGGGGAAA CAAAATCCAA GAATAAAGAG GTTTTAGGGA GTATCGATGG TAAAGAACAT 1159 CAAAAGAAAT GATTTGATTT TGTAACTTTT CACAAAG--- TT-GAATAAG ACGTGCGTTT 25197 ||||||||| |||||||||| ||| | || | ||||| || ||||||| ||||| |||| CAAAAGAAAA GATTTGATTT TGTGCCCCTT CCCAAAGTTA TTGGAATAAG ACGTGTGTTT 1219 ATTAGATTTC GATCAGTCTA GGAAAATATT TGTACATTGT CCCGAGATCA TTTCCATCAT 25137 ||| TTTA...... .......... .......... .......... .......... .......... 1223 ATTTTTAAGC CTCACGTGTT TAAATCGAAG AAATTTTCAC ATATAGCCAC TTAAATATAA 25077 .......... .......... .......... .......... .......... .......... 1223 TTAATTACTC TCCATAGATA TAGTTTGATA ATTATAATTT GTAGCTACAT GTTATATGAA 25017 .......... .......... .......... .......... .......... .......... 1223 GAAAAGAGAT GAGCGAGACT AGGAGAGTGA CGAGAGAGGC GAGAGAGGGG AGAA-AAGAG 24958 |||| | | | | | | | | | | |||||| ||||| | | |||| || .AAAAAAAAA AAAAAAAAAA AAAAAAAAAA CTCGAGAGGA GAGAGGAGAG AGAACTAGTC 1282 TGGGAGAA 24950 | || || TCTGATAA 1290 hqPGS_C09HBa0099P03.1-1-_SGN-U321959+ (27110 26940,26629 26620,26505 26281,26017 25193,25015 24950) ******************************************************************************** EST sequence 13 +strand 819 n (File: SGN-U321960+) 1 TCGGCCGAAT TGGAAGCTCT CCTGTAAAAC TCCATTACTT AGTTTTTCTC CATCTCTACC 61 ATGGCTGACG CAGCTGCAAC GCCGCCTACT GATCCAGCAA GCACGGCGCC ACCTGCTACG 121 ACTGATCCAG CAAGCACGAC GCCGCCTGCT AGTACTGATC CAGCAAACAC GACGCCGCCT 181 ACTAGTACTG ATCCAGCTGA TCCAGTAGCA GAAGATGAGA GGAAACTTAA ATATCTGGAT 241 TTCGTTCAAG TTGCAGCGAT CTATGTGATT GTTTGCTTCT CAACTTTGTA TGAATACGGC 301 AAAGAAAACT CCGGTCCGTT GAAACCTGGT GTACAGGCCG TAGAAGCCAC TGTTAAAACT 361 GTTATCGGAC CGGTTTATGA GAAGTTCCAT AACGTTCCTT TCAATCTCCT CAAGTTCATC 421 GACCTAAAGG TTGCAGACTT GATGACAGAA GTTGAAAGCC ATGTGCCTTC TCTACTAAAG 481 CAGACATCAT CTAAAGCTCT GTTAATAGCT CAGAAGGCTC CAGAATTAGC TCGAGATCTC 541 GCCGGCGAGG TACAGCACGA TGGCTTAGTG GACACAGCGA GCAACGTAGC TAAAACACTC 601 TACACAAAGT ACGAACCCAC AGTCAAGGAG CTATACACAA AATACGAGCC AGTGATCGAG 661 AAAAATGCGG TTTTGGCATG GAGATCTCTG AATAAGCTTC CTTTGTTCCC TCAAGTGGCT 721 CAGATTTTAG TGCCGACGGC TGCTTATTGG AGTGAGAAAT ACAATCAAGC GGTGACATAT 781 GCGTCGGAGA ACGGCTATAC GGCGGCGCAT TATTTACCG Predicted gene structure (within gDNA segment 27977 to 25018): Exon 1 27135 26940 ( 196 n); cDNA 10 204 ( 195 n); score: 0.990 Intron 1 26939 26506 ( 434 n); Pd: 1.000 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 2 26505 26281 ( 225 n); cDNA 205 429 ( 225 n); score: 1.000 Intron 2 26280 26018 ( 263 n); Pd: 0.994 (s: 1.00), Pa: 0.958 (s: 1.00) Exon 3 26017 25628 ( 390 n); cDNA 430 819 ( 390 n); score: 0.997 MATCH C09HBa0099P03.1-1- SGN-U321960+ 0.996 811 0.990 C PGS_C09HBa0099P03.1-1-_SGN-U321960+ (27135 26940,26505 26281,26017 25628) Alignment (genomic DNA sequence = upper lines): TTCGAAGCTC TCCTGTACAA ACTCCATTAC TTAGTTTTTC TCCATCTCTA CCATGGCTGA 27076 || ||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| TTGGAAGCTC TCCTGTA-AA ACTCCATTAC TTAGTTTTTC TCCATCTCTA CCATGGCTGA 68 CGCAGCTGCA ACGCCGCCTA CTGATCCAGC AAGCACGGCG CCACCTGCTA CGACTGATCC 27016 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCAGCTGCA ACGCCGCCTA CTGATCCAGC AAGCACGGCG CCACCTGCTA CGACTGATCC 128 AGCAAGCACG ACGCCGCCTG CTAGTACTGA TCCAGCAAAC ACGACGCCGC CTACTAGTAC 26956 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAAGCACG ACGCCGCCTG CTAGTACTGA TCCAGCAAAC ACGACGCCGC CTACTAGTAC 188 TGATCCAGCT GATCCAGTAA GTATACAATG TCTTCTACGT TTATCATGAT ATTACAGTTA 26896 |||||||||| |||||| TGATCCAGCT GATCCA.... .......... .......... .......... .......... 204 AATCAATTTT AATTTTCAAA CACACACCGT TGTAGCGTGG TTGTAACTTC ATATTCTGTT 26836 .......... .......... .......... .......... .......... .......... 204 ATTCAGGCTT CACATATTAG ATCTAAGTTG TTCAATTACT CAATTTAATT GCCAGAATGA 26776 .......... .......... .......... .......... .......... .......... 204 ATTGTTATCT TCTTATTTTG AGGTGTTAAA TTAGGAATGA GTTGTTGTGC AATGCCTTCA 26716 .......... .......... .......... .......... .......... .......... 204 TGGTTGCCAA TTTCATCTTC GCAATACATA TGGTCACTCG ACTGATAAAA AATATATCGA 26656 .......... .......... .......... .......... .......... .......... 204 GAAAGAGTAG TTAGATTTGA TTAAATGCAT TGACAGGTAG AATACTAGTA TAATTTGTTG 26596 .......... .......... .......... .......... .......... .......... 204 AATTTAATTC GAGATGTGTT TTCTCAACAT AGGTGAGATT CAGAATTTTT GAGTTTTAAA 26536 .......... .......... .......... .......... .......... .......... 204 TCTGTACCTG ATTTGTGTTT ATAATTGAAG GTAGCAGAAG ATGAGAGGAA ACTTAAATAT 26476 |||||||||| |||||||||| |||||||||| .......... .......... .......... GTAGCAGAAG ATGAGAGGAA ACTTAAATAT 234 CTGGATTTCG TTCAAGTTGC AGCGATCTAT GTGATTGTTT GCTTCTCAAC TTTGTATGAA 26416 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGGATTTCG TTCAAGTTGC AGCGATCTAT GTGATTGTTT GCTTCTCAAC TTTGTATGAA 294 TACGGCAAAG AAAACTCCGG TCCGTTGAAA CCTGGTGTAC AGGCCGTAGA AGCCACTGTT 26356 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACGGCAAAG AAAACTCCGG TCCGTTGAAA CCTGGTGTAC AGGCCGTAGA AGCCACTGTT 354 AAAACTGTTA TCGGACCGGT TTATGAGAAG TTCCATAACG TTCCTTTCAA TCTCCTCAAG 26296 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAACTGTTA TCGGACCGGT TTATGAGAAG TTCCATAACG TTCCTTTCAA TCTCCTCAAG 414 TTCATCGACC TAAAGGTTCA CCTTCTTTCT CCAAAATGAA TTTTACCCAT ACTGTCTACA 26236 |||||||||| ||||| TTCATCGACC TAAAG..... .......... .......... .......... .......... 429 CCATACTTAT TTTGCAGCTG AAAGTTTGCT GGAATTGAGA AGGTATTTGT ATGGCATTCA 26176 .......... .......... .......... .......... .......... .......... 429 GAAATTGTCT TCTCCAACTA ATCGAAATCC TAGATCTAGC TCAATCTGGA AGAATGTGCA 26116 .......... .......... .......... .......... .......... .......... 429 TGGACATGCA CATGATAATT CACTAACATA GAAACGAATG TTGTAAACTT TGAACTCGAA 26056 .......... .......... .......... .......... .......... .......... 429 TTTCAGTGGC AAGGATTTGA AAGTTTGAAT CTATGCAGGT TGCAGACTTG ATGACAGAAG 25996 || |||||||||| |||||||||| .......... .......... .......... ........GT TGCAGACTTG ATGACAGAAG 451 TTGAAAGCCA TGTGCCTTCT CTACTAAAGC AGACATCATC TAAAGCTCTG TTAATAGCTC 25936 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAAAGCCA TGTGCCTTCT CTACTAAAGC AGACATCATC TAAAGCTCTG TTAATAGCTC 511 AGAAGGCTCC AGAATTAGCT CGAGATCTCG CCGGCGAGGT ACAGCACGAT GGCTTAGTGG 25876 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAGGCTCC AGAATTAGCT CGAGATCTCG CCGGCGAGGT ACAGCACGAT GGCTTAGTGG 571 ACACAGCGAG CAACGTAGCT AAAACACTCT ACACAAAGTA CGAACCCACA GTCAAGGAGC 25816 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACACAGCGAG CAACGTAGCT AAAACACTCT ACACAAAGTA CGAACCCACA GTCAAGGAGC 631 TATACACAAA ATACGAGCCA GTGATCGAGA AAAATGCGGT TTTGGCATGG AGATCTCTGA 25756 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATACACAAA ATACGAGCCA GTGATCGAGA AAAATGCGGT TTTGGCATGG AGATCTCTGA 691 ATAAGCTTCC TTTGTTCCCT CAAGTGGCTC AGATTTTGGT GCCGACGGCT GCTTATTGGA 25696 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| ATAAGCTTCC TTTGTTCCCT CAAGTGGCTC AGATTTTAGT GCCGACGGCT GCTTATTGGA 751 GTGAGAAATA CAATCAAGCG GTGACATATG CGTCGGAGAA CGGCTATACG GCGGCGCATT 25636 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAGAAATA CAATCAAGCG GTGACATATG CGTCGGAGAA CGGCTATACG GCGGCGCATT 811 ATTTACCG 25628 |||||||| ATTTACCG 819 hqPGS_C09HBa0099P03.1-1-_SGN-U321960+ (27135 26940,26505 26281,26017 25628) ******************************************************************************** EST sequence 7 -strand 1415 n (File: SGN-U344324-) 1 AATAAAAAAA AAAAAAAAAA AAGGTTTTTT TTTTTTTTTT TTTTTTAAAA CAAGTTGGGT 61 AAAGAAAAAC ATCCTTTTCA AGAGAAAAGA TAATTCTTTT TCGGGGGGTT CACAAAATTT 121 TTTTGGCGGA GGGGCTGGCC TTCCGAATAG GAAAGGGTCA GAGGTGTCGG GTTCGGGTTT 181 TCGTTTTAGA AATTTGGTAC AATGGAATCG AAGGGTTGCT TCTGGTACCA ATCGGGTTCA 241 AATGAGGAGA AACCTGAAAG ATCCTTTCCG GTTCGGGGCA GCACGGACGA TCCGTATGAA 301 ATCCGCTTTT TTGAAGACTA GCAATGAGGA GAAATGGTAA AGAGAGGTAT TAAATATTTC 361 AAATTTTAAT TAATATATTC TAAAATTCGT AAAAGGAAGT AGTTTTATGA TTATAAAAAA 421 TTTAGACATA ATTTAACGGA TCAAAAAAGA GTTAACGTGT GATAAGGTAC CATGTGAGCA 481 TGCGTTGCCA ACTTTTTCTG TTAACTAACG CAAGGGAACA GGGGATTACT TATCTTATTG 541 GACTTCCTTT TGTTTCAATT TCTCATTAAT TGATAAAAGA AAAAAAAACT ATTTTAATAT 601 TTTTAAACCC CCCAAAAAGG CAATATTGAT TGAATGTTTT TATATTACTA TTCAAAACGT 661 GTAATCTTTC TTTTCTTATT ATGTTCATTT TGAGTCCAAT GTAAAGTTAT ACTTTCTCGA 721 GTTAAAATTT ATTTAGCCTA ACCAAACATT TTTAACTTTT CAGAAATACT TTTATAAAGT 781 TGTACTTTCT TTATACTCTC TCTGTTTTAA AAAGAATGAC CTGATTTGAC TTGAAACAGT 841 ATTTAAAAAA AGGAAGAAGA CTTTTTAATC TTGTGGTTCT AAATTAAAGT TATGTCAAAT 901 ATACCAAAAC GCCTTTTAAT CTTGTGGTCT TAAACATGCC ACATGAAAAG TTAAAGTTGA 961 AGTGTCATCA AAACAGGAAA TGAGTCATTC TTTTTGAAAT AAACTAAAAA GAAAACATGA 1021 ACATTCTTTT TGAAACAGAG GAAGTACTAT ACTTCTTCCG CATGTGACAT AGTTTGAATT 1081 GACGAAGAGT TTAACAAAGA AATGATTTTT GAAAATTTTG ATATAAAACA TGACATAACA 1141 TTTGTGTGTG ACTATATATA AAAGTATTGA AACTTGCGAT CTTAAACATG TCATAACATT 1201 TGTGCGATAA TGAAATCAAA TTTCTCAATA AGAATAAAAT AAGAAGCATG AAATTAACAA 1261 TACAATATGT TACATTATAA AATAATACGT AATAATTTCT CCAAACTACT ATCAGGTTGA 1321 CAAAACCATG GAGAGTAGGA GAGGGAATGA TACCTATCTA ACTGGAATTC GGCCGATTTG 1381 GCCGCCAATC GCCTTTATAG GGTATACTCC TGCCN Predicted gene structure (within gDNA segment 19742 to 34336): Exon 1 26922 26929 ( 8 n); cDNA 765 772 ( 8 n); score: 0.750 Intron 1 26930 28797 (1868 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.76) Exon 2 28798 28983 ( 186 n); cDNA 773 955 ( 183 n); score: 0.828 Intron 2 28984 29873 ( 890 n); Pd: 0.000 (s: 0.86), Pa: 0.927 (s: 0) Exon 3 29874 29887 ( 14 n); cDNA 956 969 ( 14 n); score: 0.786 Intron 3 29888 33720 (3833 n); Pd: 0.000 (s: 0), Pa: 0.536 (s: 0) Exon 4 33721 33735 ( 15 n); cDNA 970 981 ( 12 n); score: 0.800 Intron 4 33736 33957 ( 222 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 5 33958 33978 ( 21 n); cDNA 982 1002 ( 21 n); score: 0.714 PPA cDNA 46 24 MATCH C09HBa0099P03.1-1+ SGN-U344324- 0.828 244 0.172 C PGS_C09HBa0099P03.1-1+_SGN-U344324- (26922 26929,28798 28983,29874 29887,33721 33735,33958 33978) Alignment (genomic DNA sequence = upper lines): AAGACATTGT ATACTTACTG GATCAGCTGG ATCAGTACTA GTAGGCGGCG TCGTGTTTGC 26981 || || || AATACTTT.. .......... .......... .......... .......... .......... 772 TGGATCAGTA CTAGCAGGCG GCGTCGTGCT TGCTGGATCA GTCGTAGCAG GTGGCGCCGT 27041 .......... .......... .......... .......... .......... .......... 772 GCTTGCTGGA TCAGTAGGCG GCGTTGCAGC TGCGTCAGCC ATGGTAGAGA TGGAGAAAAA 27101 .......... .......... .......... .......... .......... .......... 772 CTAAGTAATG GAGTTTGTAC AGGAGAGCTT CGAAAAGGCA GGCAGTTGGC AAGCCCTCTT 27161 .......... .......... .......... .......... .......... .......... 772 TTATAGCGGA CGGGGCACGT AATAATCGTG TGCGGAGAAG TCGATGAAAA TTTTACAGGT 27221 .......... .......... .......... .......... .......... .......... 772 TTTCAATTAA TGTTTAAAAC ACAAAAATAA AGTGAATAAA TATTCTTAAT TAAAGTGATT 27281 .......... .......... .......... .......... .......... .......... 772 AGTATTCATG ACCATAAATA GAAAGAAATT ATTTAATAAT ATTTTTTTGT TAATTTTTAA 27341 .......... .......... .......... .......... .......... .......... 772 ATAAGGAAAA TTACACAAAT TGTCCTAAGT TAAGACTTTA CTCAATTTAT TTACTTTGAT 27401 .......... .......... .......... .......... .......... .......... 772 GGAACACATA ATCCAAATTA TTTACCTTAC TTTCCTCTTG TTCAATCACT AAATTTTTTT 27461 .......... .......... .......... .......... .......... .......... 772 TTAATTCTTG ATACGATTTA AATACACTTC GATGATATTA ATCAGTAATT GAGCAGTTTC 27521 .......... .......... .......... .......... .......... .......... 772 TTCTTTGATG TCATCAGAAT GATATTAAAG AGTAACTGAG TAGAGTTTTC ACTGAAACAC 27581 .......... .......... .......... .......... .......... .......... 772 TGTCTCTTTT TCTAAAATTT TACTCTAATG GTATCAAAGA ATAAGTGAAT CACTGCACTT 27641 .......... .......... .......... .......... .......... .......... 772 TTCTTCCTCT TTGATACCAC GAGTGTGTGT CTATCTATAT ATATACTTTG ATGGTATCGA 27701 .......... .......... .......... .......... .......... .......... 772 ATATTAAGGA CACATTAGTA GTTAAGAATA GGGGTATGAG AACAGTGGGG TAGCCACATT 27761 .......... .......... .......... .......... .......... .......... 772 ATGGTCAGAG TGTCCAAATT GACACGATTC GTCGGGAAAA AAAAATACAA GTAAAATGAC 27821 .......... .......... .......... .......... .......... .......... 772 ATACAAAATT ATTAAATAAC ATATTTTGAA CACTCTTAAC CTAACAAGTT GTTATTGCCT 27881 .......... .......... .......... .......... .......... .......... 772 AGTGATTTCG CCTCTTTGGA ATGAGTAACT GCTTATATGT TCGAATCTCA TTAGTTCTAC 27941 .......... .......... .......... .......... .......... .......... 772 TTTTAAAAGA GTTCTTATTG TGCTTAATCT GGACACTCTT TGTGAAATTG TTGATTTCGC 28001 .......... .......... .......... .......... .......... .......... 772 CACTAATATG AAAATAAGGA GGATAACCAC TTATAAATAG TTTTTCTTTT TTGATATAAA 28061 .......... .......... .......... .......... .......... .......... 772 TGTTGTTTTC TCTTTAAAAT ATTACTCTCT TCGTTTAATA ATAAAGAATA GCCTACTCTT 28121 .......... .......... .......... .......... .......... .......... 772 TTATTTGGTC TGTTTAAATA AAAAAGTCCC CTTTTTTTAC AATTCTTTAA ATTCAACTTT 28181 .......... .......... .......... .......... .......... .......... 772 TCACGTGATA TATTTAACAC CATACAATCA ATTTTTTAAT ACATTTGATA CAACTTTAAT 28241 .......... .......... .......... .......... .......... .......... 772 TTAAAACTAA AAAAATTATT TTATTTTAAT TTTTTTAAAC CAACATTTTT ATAAAATGAA 28301 .......... .......... .......... .......... .......... .......... 772 AGGAGTATAA TTCTTTAACC CTTTTAATTA TTATCGGTTA CACTTTAATG CATGTCACTT 28361 .......... .......... .......... .......... .......... .......... 772 TTTCAGTAAG TAGGTTATTC TCACGTGATT AGCTAGCTGT TCTAGAATCT CGACAGAATC 28421 .......... .......... .......... .......... .......... .......... 772 ACGTATATAA CCTGTTTATT TTTATTTTTA TTTTGTTTAT TAATAAAAAT AATTAAATTT 28481 .......... .......... .......... .......... .......... .......... 772 AATAGGATCG GATCCCGGGT ATTAACTTTT ATGCGAAGAT TCATTTTAGA TTGTCTAGAT 28541 .......... .......... .......... .......... .......... .......... 772 TTATAGTTGT CCTCAATTAT TTTTTATTTT TTTCTTTAAA AAGGTAATTA TTTTAATTAA 28601 .......... .......... .......... .......... .......... .......... 772 TTCACTTGAA AATATAAATA ATTATGTAAA ATTCAAAAAG AGTTTTTTAA AGTATAAATT 28661 .......... .......... .......... .......... .......... .......... 772 AGTAAAAGTA ACATTTTTAT TTATGGTTTT TAAAGAGACG TATAAAAAAA AAATAGACAA 28721 .......... .......... .......... .......... .......... .......... 772 TTACTCTTCA ATGCATTACT TTTTTGTGTG AATTTAGATT AGTCAGGTTC TAATATAAAT 28781 .......... .......... .......... .......... .......... .......... 772 ATTAAAAATG AGATAGTATA TTGTATATTC TCTTATTTCA TATTTTCTCC GTTTAAAAAA 28841 |||| || | | | | || ||| | || | |||| |||| ||||| .......... ......TATA AAGT-TGTAC T-TTCTTT-A TACTCTCTCT GTTTTAAAAA 813 GAATGAACTA GTTTGACTTG GAATGAAGTT TAAGAAAAGA AAGAAGACTT TTTAATCTTG 28901 |||||| || ||||||||| || || ||| ||||| |||||||||| |||||||||| GAATGACCTG ATTTGACTTG AAACAGTATT TAAAAAAAGG AAGAAGACTT TTTAATCTTG 873 TGGTTCTAAA TTAAAGTTAT GTCAAATGTA TCAAAATGTT CTTAAATCTT GTGGTCTTAA 28961 |||||||||| |||||||||| ||||||| || ||||| | || |||||| |||||||||| TGGTTCTAAA TTAAAGTTAT GTCAAATATA CCAAAACGCC TTTTAATCTT GTGGTCTTAA 933 ACATGTCACG TGAAAAGTTA AAATTAAATT CTTTTTAAAA AAATTAAATA AAAAATAAGA 29021 ||||| ||| |||||||||| || ACATGCCACA TGAAAAGTTA AA........ .......... .......... .......... 955 ATATTCTTTT TTGAAACATA GGAAATATTA TTTATTCCTT AAAGTTTTTT TTTAATAGTA 29081 .......... .......... .......... .......... .......... .......... 955 TTTTACCTAT CGATTTTAGT AAATCAAAGG TACAATTCTT ATTATTAGAT GACTGCATGT 29141 .......... .......... .......... .......... .......... .......... 955 ATTAACAATT AAATTGAAGT TTTCAAAATA TAATTAATTT AACAACAATA TACCTATGAT 29201 .......... .......... .......... .......... .......... .......... 955 AAATATTATT TTAAAAGGAA TGTCTGGTTA AAATGGATCA TCATCAAAGT TATAGTATTA 29261 .......... .......... .......... .......... .......... .......... 955 ATAATATTAA AATAAAGAAA GATGGAATAC TTTATATTAT CACTTACATT GTTCGGTACC 29321 .......... .......... .......... .......... .......... .......... 955 TACTCATTGA TGAGAAAATT GAAAAATTGG TAGAATTTAT AGAGGTGCGT TTGAATTTTG 29381 .......... .......... .......... .......... .......... .......... 955 AAATTTAAAT AAATTTATTT TAATTTATAA AAATATTTAA ATTATTAAAT ATTGTGATTT 29441 .......... .......... .......... .......... .......... .......... 955 GTATAAATAT TTAAATTATT AAATATTGTG ATTTGTATAA ATATATTTAT AAATTTAAAT 29501 .......... .......... .......... .......... .......... .......... 955 AAATCTATTT TAATTTGTAA CTATTTTATA AAAAAAATTA AATATTTTAA ACTATTAAAT 29561 .......... .......... .......... .......... .......... .......... 955 ATTGTGACTT GTACAAATAT ATTTATGTAG TTTATAAATA TAAAAAAAAT TAAAAATTTC 29621 .......... .......... .......... .......... .......... .......... 955 ATATATAAAT TTTCAGTTAA ATTTAAAATA ATTTGACTCT CGAAAAATGA AACTAGTACA 29681 .......... .......... .......... .......... .......... .......... 955 TGACACGGTA ATCGAGGTTG GTGTATGTCT CCCCATCTTC TTGATTTTTT ATTTGAATTA 29741 .......... .......... .......... .......... .......... .......... 955 TGCCCTTCTG GAGAGTTATG GGCTCATTTT GATCATTTCA TACTTTTCTT TGCTCATAAG 29801 .......... .......... .......... .......... .......... .......... 955 ATAGCGATAA GGTTCTCTCT TTATTTGTCT ATGTCTTGTA CACTTCACAT TTTTTTTTAG 29861 .......... .......... .......... .......... .......... .......... 955 TAATGTCGAT AGGTTAAAAT ATCATCATTG TTGGTGGGCT AAAACCCATA AGGGTGGAAG 29921 ||| || | ||||| .......... ..GTTGAAGT GTCATC.... .......... .......... .......... 969 GGGTAATTTT TTTTAATAAT TTGGTATATT AAATTTTCGT TAGGTGTAAA ATTTAAGAAA 29981 .......... .......... .......... .......... .......... .......... 969 AAATTAAATA TAATATAAAA TAAAATATAA AATAAGAGGT TATAATTTAT AATTTAAACT 30041 .......... .......... .......... .......... .......... .......... 969 GTGACTTTTT AATTACATGA TAATAACTTT ATTAGTTATC ATAACAGCGG ATGAAAGTTC 30101 .......... .......... .......... .......... .......... .......... 969 GTTTTTTAAT GTGTCGATAT AAGGGTGCAA GGGGTTGAAC TCTTTAATTT TGTTGATAGG 30161 .......... .......... .......... .......... .......... .......... 969 TAAAAACAAT TGGGGTTCAA TTCTTTAGTT GTGTTTGATA ATAGGGCAAA AAATAATTAG 30221 .......... .......... .......... .......... .......... .......... 969 ACTTTTCATA TGTATATAAG TTTTTTTTTC TCTTATTTAA ATTTCTAATA ACACCATTCG 30281 .......... .......... .......... .......... .......... .......... 969 TTTGATTATT ACTTTATATT AAATATTGGG GTTATTGGCA TACTAATTTG ATTTTTGATA 30341 .......... .......... .......... .......... .......... .......... 969 ATTGTGTTGA ATTATGCATG TGCACCTAAC TCCTAAGTTA GCAAAGAATT GAAAAATATT 30401 .......... .......... .......... .......... .......... .......... 969 TTGCAAGGAA GCTATTTTCA CAATTTAAAT TTTTAATCTT TCGATTACAA TAATAACTTT 30461 .......... .......... .......... .......... .......... .......... 969 ATATATTATA ACGATAGAGA AATTTCAATT TTTTTAATTA TGTTGATAGG TATAAGGTTG 30521 .......... .......... .......... .......... .......... .......... 969 TAGGGGTCAA TACTTTAAAT TTTGTTGATA GGGTTAAACT TACCAAGGTT TGTGTTTCTA 30581 .......... .......... .......... .......... .......... .......... 969 ATAACGCAAT TCGTTGGGTT ATTAGTAACA CTAACTTAAA ATTGGTTTAT CATATTGAAT 30641 .......... .......... .......... .......... .......... .......... 969 TATTTACTCA AATATTTAGC CAATATTAGG TACCATAATA TTATTCCCTT CGTTCATTTT 30701 .......... .......... .......... .......... .......... .......... 969 ATGTGACATC ATTTCTTTTT TTGTCAGTCC CAAAAAGAAT GTCACATTTT CTTATATGAT 30761 .......... .......... .......... .......... .......... .......... 969 AACTATTTAA AGGTAATATC CTTATTGGTC CCACTTCATT TTAAAAATAG TACTCTAATA 30821 .......... .......... .......... .......... .......... .......... 969 AGAGAAAAAA AAGAAGTAAG TTTACTAAGA ATAATTTAGT AAACATTTTA AAATAATCTT 30881 .......... .......... .......... .......... .......... .......... 969 GCCACATAAA ATAAAATAAA GGAAGTATAA TTTAAGAAAA TAATAAACAG TTAAAATAAC 30941 .......... .......... .......... .......... .......... .......... 969 TTTCAAAGTA ATTTAACCAC ATTTATTGGA ACTAAGAGAG TATAATTTAA GAAAATTGTA 31001 .......... .......... .......... .......... .......... .......... 969 ACAGTTAAAA TGACTTTCAA AGTAATCTTA ACCACATTTT TTCGATTTCC ACAGTAATTT 31061 .......... .......... .......... .......... .......... .......... 969 AACAACATTT AGAGAGTCTT AAACTTACTT TATAAACAAC CCAGAAAATT TTGTTATTAG 31121 .......... .......... .......... .......... .......... .......... 969 TATTGTTGGG ATCTTCAGTT AAGAAAATCC CTCCAAAAAT TAGCGTATGC CACTAATTGC 31181 .......... .......... .......... .......... .......... .......... 969 AACGCGCACA CGGTATTCTT TAGTAAAAGG CGAAAGAATA GTTCGCTATG TGCTCACTGC 31241 .......... .......... .......... .......... .......... .......... 969 GTGGCTGCGT AAATTATATG ATATTAGCAC GAGTCTCTTA TATGTTGCTT GTAGCTGTTG 31301 .......... .......... .......... .......... .......... .......... 969 GCTTACTCAC CTACATTACG ATGTGGTACT TTATCAACAA CACTTCCATT TATGAAATTC 31361 .......... .......... .......... .......... .......... .......... 969 CAGTATATGG GCTTTGTGCC TTTGTTGGAC CCAGTACTTA CAAGTTTGGG AGGGCCACTT 31421 .......... .......... .......... .......... .......... .......... 969 TCTGTATTAG CCCACTTTTT CATTGTAATG TAAACAATTC TATAATGATT AGAAAAACAG 31481 .......... .......... .......... .......... .......... .......... 969 GTCAAATGTG TTTGGGATGA AAAATAAGTT TTTCCCAGCA AATGTCATCT TGAAAAACAA 31541 .......... .......... .......... .......... .......... .......... 969 TGTTTTTCAA TAATTTTTTA GTGGTAGATA ATTAAGAAAG AAAATATTAT TCAAAAGAGA 31601 .......... .......... .......... .......... .......... .......... 969 ATCTATATGT TGTTTAACAA AATATGATAG AGGTTGAATG GATTGGTGTA TGGGGGTGGG 31661 .......... .......... .......... .......... .......... .......... 969 GGGAGGGGAT GATCAAGGGG TGAGAGTTGA GGAGATTGAA TGGGCGGGAA GGAGACGATA 31721 .......... .......... .......... .......... .......... .......... 969 CATTTGGAAT AACACTTATG TAATTGTTTT TTTTTTTCAT TAAAGACCTT ATTTTTCTAA 31781 .......... .......... .......... .......... .......... .......... 969 ATATTTTGAC CAGCTAAACA TAAAAAAATA ATAATAGAAA AACATTTTTT TTTGAATATT 31841 .......... .......... .......... .......... .......... .......... 969 TTCCTCCATG CTGAACATAA AATTGTAAGC TATTCTCTAT TATACTAATC ATAAGATTGT 31901 .......... .......... .......... .......... .......... .......... 969 AAAATATCTT TTGATTTTTC GAACATAATA AGTAAAAGTC GAAATATAAA TGTACGTATT 31961 .......... .......... .......... .......... .......... .......... 969 TGAATCTACA TTTAAGACAA TGGATTAAAA CCTCATTTGT TTTCACTAAG ATGAAAAATG 32021 .......... .......... .......... .......... .......... .......... 969 ACTGAATATA AATGTACTTG TTAGAATAAT TATGATCATG TATATAGATT TGAATACTGA 32081 .......... .......... .......... .......... .......... .......... 969 ATAATATATT TTAATATGTC GCTTTGAGCC TACACATTGA AAATGACCAA CACTCGATAT 32141 .......... .......... .......... .......... .......... .......... 969 CATTGTTGCT CTAAGCGTGT ACCCATAACG TGACATTGGA CATGAGCATA CATCAAGAAT 32201 .......... .......... .......... .......... .......... .......... 969 GAGTGAAGGT GCAGTTTAAA AGCATTTAAA GAGACAAGTT CAGTAAAAGA AAACACTTGT 32261 .......... .......... .......... .......... .......... .......... 969 ATAACACATA AGTTGTGAAA TTGGAATCGT TAAAAAAAGA CACGAAAATT CAACCTGACA 32321 .......... .......... .......... .......... .......... .......... 969 CATAACCATG TATCTCTCTA TTAAACCTCT AAACATGATC GAGAAGTATC TATTAGGACA 32381 .......... .......... .......... .......... .......... .......... 969 AGGCCCTTGG TATTACCTTA AACACATAAT TATAAAGTAC TAGAAATATG TATATTAAGT 32441 .......... .......... .......... .......... .......... .......... 969 CTTTTGAGAA ACAGAGAAGG GCTCCCCAAT TAGCAGAAAA TATGGGACCC AGTGAGCGAC 32501 .......... .......... .......... .......... .......... .......... 969 TTGATCACAT GTATGTGTAT ATGTAACACT CTCGATAAAA GTGATGTAAT TACATATTAA 32561 .......... .......... .......... .......... .......... .......... 969 ATAGTATTTG TATCTAAACA TATCGGTACA TGCCAGATCA ATACAAACAT ATAGCATAGT 32621 .......... .......... .......... .......... .......... .......... 969 ATAACATATG AAGATTAAGA GATAAGGTCG TTAATACTAC CTTAATGGTA CATCTTTTTA 32681 .......... .......... .......... .......... .......... .......... 969 TCATATTTTT TTTTATCAAT AATAAGATTC TGAAAATATA ATAAACGATA CAAATACTAT 32741 .......... .......... .......... .......... .......... .......... 969 GTGAATGCAT GAAGTTTGAT CTACCCAATC AACTAAGGTG GAGACATATG TGTACATATC 32801 .......... .......... .......... .......... .......... .......... 969 ATGTGGATCT ATAAGTATAA CTATCCCTTA TCAGAAGCAT AAAAGAACAA TTCCCCCAAA 32861 .......... .......... .......... .......... .......... .......... 969 TCAACAAGAT ATTATCCTTA ACAGTGTTGG CAAATGAAGT TTTATACGTT CATCATTTAG 32921 .......... .......... .......... .......... .......... .......... 969 ATTCATGCAC TTTCGATCAA CTATAATTGT CTCTAGGTCA TATGTTCACA GATTAGCCCA 32981 .......... .......... .......... .......... .......... .......... 969 AAAATTTATA TTCATGACTT TTAATGAACG TAGTTCATAA CTTAAGTTCA TCTTATCATA 33041 .......... .......... .......... .......... .......... .......... 969 TCATGAAATT ATATTTATAT ATATTATAAT TGTTTAACAA TTTTGCCATA TAAAATGAGA 33101 .......... .......... .......... .......... .......... .......... 969 CGAAGGAATA TAATTTAAGA AAATAATAAC AGTTAAAATG ACTTCCAAAA TAATTTAACT 33161 .......... .......... .......... .......... .......... .......... 969 AACATTTTTT CGACTTCCAC ATTAATTTAA CCACATTTAT TGGGACGAAA GGAGCATAAT 33221 .......... .......... .......... .......... .......... .......... 969 TTAAGAAAAC TATGACAGTT GAAATAACTT TCAAAGTAAT CTTAACCACA TTTTTTCGAT 33281 .......... .......... .......... .......... .......... .......... 969 TTCCACAGTA ATTTAACCAC ATTTAGAGGG TCTTAAACTT ACTTTATAAA CAACCCAGAA 33341 .......... .......... .......... .......... .......... .......... 969 AATTTTGTTA TTAGTATTGT TGGGATCTTC AGTTAAGAAA ATCCCTCCAA AAATTAGCGT 33401 .......... .......... .......... .......... .......... .......... 969 ATGCCACTAA TTGCAGCGCG CGCACGGTAT TCTTTAGTAA AAGGCGAAAG AATAGTTCGC 33461 .......... .......... .......... .......... .......... .......... 969 TATGTGCTCA CTGCGTGGCT GCGTGAATCT TTTGTTAGTA GCACAAGTCT CTTATATATT 33521 .......... .......... .......... .......... .......... .......... 969 TCTTGTAGCT GTTGGCTTAC TCACGTACAT TACGATGTGG TATTTTTATC AACAACACTT 33581 .......... .......... .......... .......... .......... .......... 969 GCATTTATGA AACCATAAAT TCCAGTATAT GGGCTTTGTG CCTTTGTTGG ACCCAATACT 33641 .......... .......... .......... .......... .......... .......... 969 TACAAAGTTT GGGAGGGCCT CTTTCTGTAT TAGCCCACTT TTTCATTGTA ATGTATATAA 33701 .......... .......... .......... .......... .......... .......... 969 ACACTTCTAT AATGATTAGA AAAACAGGTC AAATGTGTTT GGTATGAAAA ATAAGTTTTT 33761 |||||||| |||| .......... .........- AAAACAGG-- AAAT...... .......... .......... 981 CCCAGCAAAT GTCATCTTGA AAAACAAAGT GTTTTTTAAT TACTTTTTAG TAGATAACTA 33821 .......... .......... .......... .......... .......... .......... 981 AGCAAGAAAA TATTATTCAA GAGAATCTAC ATGTTGTTTA ACAAAATATG ATAGAGGTTG 33881 .......... .......... .......... .......... .......... .......... 981 AATCGATTGG TGTATGGGGG GTGGGGGGAG GGGATAATCA AGGGGTGGGA GTTGAGGAGA 33941 .......... .......... .......... .......... .......... .......... 981 TTGAATGGGC GGGAAGGAGA CAATAAATTT GGAATAA 33978 ||| || | ||| | ||||| .......... ......GAGT CATTCTTTTT GAAATAA 1002 hqPGS_C09HBa0099P03.1-1+_SGN-U344324- (28798 28983,29874 29887,33721 33735,33958 33978) ******************************************************************************** EST sequence 10 -strand 697 n (File: SGN-U330025-) 1 AAGTTTAGGG TCCAATATAT GTATTATCCC TAATTAATTT GTACTCTTTA GCTCCTTGCA 61 GGAAGTATAG GGAGCTGGTG TATTTTGTAT CCACTTTGAA GAGGTACTTG GTGCTAATGT 121 AGCCAATGTA ATGAAATTCT TTTACTTGTC AAAAGAAAGC AGAAATCAAA AGGCTCTAGA 181 AGTTCTGATA AGAAAGTAAG TATATATGAT AAACACATGT GAAGGCAACA TGCTGCAAGT 241 TACAAATCCA GAGATTTGAT AATTATGTGA ATATTATAAA AAAGGTAAGC ATCACTATAT 301 GATGCAAGAT CTTTCGTACA ACTAGATATA TTAGAAATAT TAGAATACAC TAAGGTGAAA 361 ATTCTGAAAA GCTTCCCCAG CATATTACCA AGATAATGAT TCGGCAACCA TTTCAAGTTG 421 GACAAGAAAA AAAGATCTCA TAATTCTATG GTGTACACTT TCACCATCTC ATTGAGAAAA 481 CAAAACCAAC TCTAAATTTC CAAAATAGTA CTCCCTTCAT TTAAAAAAGA ATGACCTAGT 541 TTGACTTGGA ACGGAGTTTA AGAAAAGAAG AAGACTTTTT AATTTTGTGG TTCTAAATTA 601 AAGTTATGTC AAATGTACCA AAATGTCGTC TTAAACATGT CACTTGGAAA GTTAAAACAA 661 AATGAAAGGA GTCATTCTTT TTTAAACAAA CTAAAAA Predicted gene structure (within gDNA segment 22889 to 30331): Exon 1 23616 23626 ( 11 n); cDNA 426 436 ( 11 n); score: 0.727 Intron 1 23627 27755 (4129 n); Pd: 0.722 (s: 0), Pa: 0.000 (s: 0.62) Exon 2 27756 27811 ( 56 n); cDNA 437 489 ( 53 n); score: 0.625 Intron 2 27812 28801 ( 990 n); Pd: 0.987 (s: 0.62), Pa: 0.000 (s: 0.68) Exon 3 28802 28938 ( 137 n); cDNA 490 624 ( 135 n); score: 0.847 MATCH C09HBa0099P03.1-1+ SGN-U330025- 0.782 204 0.293 C PGS_C09HBa0099P03.1-1+_SGN-U330025- (23616 23626,27756 27811,28802 28938) Alignment (genomic DNA sequence = upper lines): GAAATGAAGA GGTTTGTGTT GCTTTATATG CTTCCTACAA TGTATTGAAC AACTGTGATT 23675 |||| |||| GAAAAAAAGA T......... .......... .......... .......... .......... 436 CTTTTTCTCA TTTAATGTAC TTCTATTCAA CAACATACAT CTTACAAACA AGGGTCCACA 23735 .......... .......... .......... .......... .......... .......... 436 AGGCAGAGTT TCCTTGATTG TTGATATTTA TTGTTTTATA ACATGATATT AATTGAACCG 23795 .......... .......... .......... .......... .......... .......... 436 ACTGATACTT CTTGAGTGGT GATATTACAA GGTTCAATAT TGGAGTAGAA TTTGGTTAAG 23855 .......... .......... .......... .......... .......... .......... 436 GTCACTAATG GCTTATGCTT GGGCAAGGAC ATAATAGATG TACTATCTCA TAGTAGTAGA 23915 .......... .......... .......... .......... .......... .......... 436 AGTATTTGCT TGAAAATTTT TCTTATCGAG TTTGTGAAGT CATTTTTTGA TCTCTTTTTG 23975 .......... .......... .......... .......... .......... .......... 436 GAGAAAATGA TTGTTTTACC CGAGTCGAGA AACAACTCTC TACTCTCACA AGATAGAGAT 24035 .......... .......... .......... .......... .......... .......... 436 AAGGTCTGCG TACGTAACTA TTCTCATATC CTATTTGAGA TTACATTGAA TATGTTGTTG 24095 .......... .......... .......... .......... .......... .......... 436 AGTAAATGTT TGTTTGAGTT GTTTGTTGAT TTTCATTTTT GGTGCCAATC TTATCTCAAA 24155 .......... .......... .......... .......... .......... .......... 436 AGGTGTTTAT GCTGTGTTAC ATAGAGGACA AGGAAGTAAA GAGGGGGGGG GGGAGGGGGG 24215 .......... .......... .......... .......... .......... .......... 436 GGTTGGGTTG AATCTGTCAT TTCACTCTTT TGTAAAGCAT GAACTTTAGA AAGCATTTAG 24275 .......... .......... .......... .......... .......... .......... 436 AATTATGTAC GAACTATTGC AAAAATGTTT CAAGTAGAAG TGATAGATAT TATCAATGTA 24335 .......... .......... .......... .......... .......... .......... 436 AAAATTATAC CTTATCATTT TATTTTAAAA TGTTACTTCA CCATTTTTAC CAATTAATCA 24395 .......... .......... .......... .......... .......... .......... 436 TTCATATTTT AATTTTCTAG ATATGCATAG AAGCAATTCA TCATTTTTTA GTACGAGTAG 24455 .......... .......... .......... .......... .......... .......... 436 ATATCATTAG CTATTTGCAT ATTGTAAGCA AATATGGAGA AACAACTAAC GATCATGATG 24515 .......... .......... .......... .......... .......... .......... 436 GATCTTGAGT CATTTAAAAA AATGTATATT GTTGCTACTT CTTCGTGTCA GATAAAAATA 24575 .......... .......... .......... .......... .......... .......... 436 AAATAGGGTG GTGGTTAGAG AAAATTACGT AATAAAATAA ATTTATACTA TTTGATTACA 24635 .......... .......... .......... .......... .......... .......... 436 TATCATAGTT GTAGTTTGCT ATTATTATCT TTCGCGACAA ACATTGTATA TTAATTACAT 24695 .......... .......... .......... .......... .......... .......... 436 GGGCCGACTT CGAATTTGTA TAATTAGTCA CGTTTGTATA TGTATAATTC GCCAGAATAT 24755 .......... .......... .......... .......... .......... .......... 436 ACAAATACAT ATGTATAATA TACAATTATT TAACCTACAT ATGTATACAG TCACCTCTCT 24815 .......... .......... .......... .......... .......... .......... 436 TCCTAGTCTC GCTCGCCTCT CTCCTCCCTT TTCCAATCTC CTCTCTCCTC TCTCTCCCAA 24875 .......... .......... .......... .......... .......... .......... 436 TCTCTCTTGC CATATATACA AATACATAGG TATAATATAC AAATATCTAA CCAATATACA 24935 .......... .......... .......... .......... .......... .......... 436 TATATAATTC ACCTTTCTCC CACTCTTTTC TCCCCTCTCT CGCCTCTCTC GTCACTCTCC 24995 .......... .......... .......... .......... .......... .......... 436 TAGTCTCGCT CATCTCTTTT CTTCATATAA CATGTAGCTA CAAATTATAA TTATCAAACT 25055 .......... .......... .......... .......... .......... .......... 436 ATATCTATGG AGAGTAATTA ATTATATTTA AGTGGCTATA TGTGAAAATT TCTTCGATTT 25115 .......... .......... .......... .......... .......... .......... 436 AAACACGTGA GGCTTAAAAA TATGATGGAA ATGATCTCGG GACAATGTAC AAATATTTTC 25175 .......... .......... .......... .......... .......... .......... 436 CTAGACTGAT CGAAATCTAA TAAACGCACG TCTTATTCAA CTTTGTGAAA AGTTACAAAA 25235 .......... .......... .......... .......... .......... .......... 436 TCAAATCATT TCTTTTGATG TTCTTTACCA TCGATACTCC CTAAAACCTC TTTATTCTTG 25295 .......... .......... .......... .......... .......... .......... 436 TATTTTGTTT CACCTGCTTG ATTAAAAGTT GATACAATAT GATATGTTCA AGAATTTAAA 25355 .......... .......... .......... .......... .......... .......... 436 TTTGGTGATT TAGGATAATT ACTAAATACA ATATATTAAG GAACTTAAGA AGTGGCTCAC 25415 .......... .......... .......... .......... .......... .......... 436 GATATAAAGA TGAACAATTA CTACCGACAA ACATAAACCA AGTTATTAAC CTGACGGCCC 25475 .......... .......... .......... .......... .......... .......... 436 ATTAAGCCCC CCCCCCCCCC CCCCCCCAGT ACAGTAATGT AAAATGGTCA TTGCGCCGGA 25535 .......... .......... .......... .......... .......... .......... 436 GCAACAGTGC CGTCGGTCAA GGGAACGGAC TGCTCGTTTT CAGCGGTGGC GCCGCCTTCA 25595 .......... .......... .......... .......... .......... .......... 436 AACACCTTCG CGATCCTCTC TACGGGAATA ATCGGTAAAT AATGCGCCGC CGTATAGCCG 25655 .......... .......... .......... .......... .......... .......... 436 TTCTCCGACG CATATGTCAC CGCTTGATTG TATTTCTCAC TCCAATAAGC AGCCGTCGGC 25715 .......... .......... .......... .......... .......... .......... 436 ACCAAAATCT GAGCCACTTG AGGGAACAAA GGAAGCTTAT TCAGAGATCT CCATGCCAAA 25775 .......... .......... .......... .......... .......... .......... 436 ACCGCATTTT TCTCGATCAC TGGCTCGTAT TTTGTGTATA GCTCCTTGAC TGTGGGTTCG 25835 .......... .......... .......... .......... .......... .......... 436 TACTTTGTGT AGAGTGTTTT AGCTACGTTG CTCGCTGTGT CCACTAAGCC ATCGTGCTGT 25895 .......... .......... .......... .......... .......... .......... 436 ACCTCGCCGG CGAGATCTCG AGCTAATTCT GGAGCCTTCT GAGCTATTAA CAGAGCTTTA 25955 .......... .......... .......... .......... .......... .......... 436 GATGATGTCT GCTTTAGTAG AGAAGGCACA TGGCTTTCAA CTTCTGTCAT CAAGTCTGCA 26015 .......... .......... .......... .......... .......... .......... 436 ACCTGCATAG ATTCAAACTT TCAAATCCTT GCCACTGAAA TTCGAGTTCA AAGTTTACAA 26075 .......... .......... .......... .......... .......... .......... 436 CATTCGTTTC TATGTTAGTG AATTATCATG TGCATGTCCA TGCACATTCT TCCAGATTGA 26135 .......... .......... .......... .......... .......... .......... 436 GCTAGATCTA GGATTTCGAT TAGTTGGAGA AGACAATTTC TGAATGCCAT ACAAATACCT 26195 .......... .......... .......... .......... .......... .......... 436 TCTCAATTCC AGCAAACTTT CAGCTGCAAA ATAAGTATGG TGTAGACAGT ATGGGTAAAA 26255 .......... .......... .......... .......... .......... .......... 436 TTCATTTTGG AGAAAGAAGG TGAACCTTTA GGTCGATGAA CTTGAGGAGA TTGAAAGGAA 26315 .......... .......... .......... .......... .......... .......... 436 CGTTATGGAA CTTCTCATAA ACCGGTCCGA TAACAGTTTT AACAGTGGCT TCTACGGCCT 26375 .......... .......... .......... .......... .......... .......... 436 GTACACCAGG TTTCAACGGA CCGGAGTTTT CTTTGCCGTA TTCATACAAA GTTGAGAAGC 26435 .......... .......... .......... .......... .......... .......... 436 AAACAATCAC ATAGATCGCT GCAACTTGAA CGAAATCCAG ATATTTAAGT TTCCTCTCAT 26495 .......... .......... .......... .......... .......... .......... 436 CTTCTGCTAC CTTCAATTAT AAACACAAAT CAGGTACAGA TTTAAAACTC AAAAATTCTG 26555 .......... .......... .......... .......... .......... .......... 436 AATCTCACCT ATGTTGAGAA AACACATCTC GAATTAAATT CAACAAATTA TACTAGTATT 26615 .......... .......... .......... .......... .......... .......... 436 CTACCTGTCA ATGCATTTAA TCAAATCTAA CTACTCTTTC TCGATATATT TTTTATCAGT 26675 .......... .......... .......... .......... .......... .......... 436 CGAGTGACCA TATGTATTGC GAAGATGAAA TTGGCAACCA TGAAGGCATT GCACAACAAC 26735 .......... .......... .......... .......... .......... .......... 436 TCATTCCTAA TTTAACACCT CAAAATAAGA AGATAACAAT TCATTCTGGC AATTAAATTG 26795 .......... .......... .......... .......... .......... .......... 436 AGTAATTGAA CAACTTAGAT CTAATATGTG AAGCCTGAAT AACAGAATAT GAAGTTACAA 26855 .......... .......... .......... .......... .......... .......... 436 CCACGCTACA ACGGTGTGTG TTTGAAAATT AAAATTGATT TAACTGTAAT ATCATGATAA 26915 .......... .......... .......... .......... .......... .......... 436 ACGTAGAAGA CATTGTATAC TTACTGGATC AGCTGGATCA GTACTAGTAG GCGGCGTCGT 26975 .......... .......... .......... .......... .......... .......... 436 GTTTGCTGGA TCAGTACTAG CAGGCGGCGT CGTGCTTGCT GGATCAGTCG TAGCAGGTGG 27035 .......... .......... .......... .......... .......... .......... 436 CGCCGTGCTT GCTGGATCAG TAGGCGGCGT TGCAGCTGCG TCAGCCATGG TAGAGATGGA 27095 .......... .......... .......... .......... .......... .......... 436 GAAAAACTAA GTAATGGAGT TTGTACAGGA GAGCTTCGAA AAGGCAGGCA GTTGGCAAGC 27155 .......... .......... .......... .......... .......... .......... 436 CCTCTTTTAT AGCGGACGGG GCACGTAATA ATCGTGTGCG GAGAAGTCGA TGAAAATTTT 27215 .......... .......... .......... .......... .......... .......... 436 ACAGGTTTTC AATTAATGTT TAAAACACAA AAATAAAGTG AATAAATATT CTTAATTAAA 27275 .......... .......... .......... .......... .......... .......... 436 GTGATTAGTA TTCATGACCA TAAATAGAAA GAAATTATTT AATAATATTT TTTTGTTAAT 27335 .......... .......... .......... .......... .......... .......... 436 TTTTAAATAA GGAAAATTAC ACAAATTGTC CTAAGTTAAG ACTTTACTCA ATTTATTTAC 27395 .......... .......... .......... .......... .......... .......... 436 TTTGATGGAA CACATAATCC AAATTATTTA CCTTACTTTC CTCTTGTTCA ATCACTAAAT 27455 .......... .......... .......... .......... .......... .......... 436 TTTTTTTTAA TTCTTGATAC GATTTAAATA CACTTCGATG ATATTAATCA GTAATTGAGC 27515 .......... .......... .......... .......... .......... .......... 436 AGTTTCTTCT TTGATGTCAT CAGAATGATA TTAAAGAGTA ACTGAGTAGA GTTTTCACTG 27575 .......... .......... .......... .......... .......... .......... 436 AAACACTGTC TCTTTTTCTA AAATTTTACT CTAATGGTAT CAAAGAATAA GTGAATCACT 27635 .......... .......... .......... .......... .......... .......... 436 GCACTTTTCT TCCTCTTTGA TACCACGAGT GTGTGTCTAT CTATATATAT ACTTTGATGG 27695 .......... .......... .......... .......... .......... .......... 436 TATCGAATAT TAAGGACACA TTAGTAGTTA AGAATAGGGG TATGAGAACA GTGGGGTAGC 27755 .......... .......... .......... .......... .......... .......... 436 CACATTATGG TCAGAGTGTC CAAATTGACA CGATTCGTCG GGAAAAAAAA ATACAAGTAA 27815 | ||| || | | |||| || || || | || | | ||||| ||| | ||| CTCATAATTC T-ATGGTGTA CACTTTCAC- CATCTCATTG AGAAAACAAA A-CCAA.... 489 AATGACATAC AAAATTATTA AATAACATAT TTTGAACACT CTTAACCTAA CAAGTTGTTA 27875 .......... .......... .......... .......... .......... .......... 489 TTGCCTAGTG ATTTCGCCTC TTTGGAATGA GTAACTGCTT ATATGTTCGA ATCTCATTAG 27935 .......... .......... .......... .......... .......... .......... 489 TTCTACTTTT AAAAGAGTTC TTATTGTGCT TAATCTGGAC ACTCTTTGTG AAATTGTTGA 27995 .......... .......... .......... .......... .......... .......... 489 TTTCGCCACT AATATGAAAA TAAGGAGGAT AACCACTTAT AAATAGTTTT TCTTTTTTGA 28055 .......... .......... .......... .......... .......... .......... 489 TATAAATGTT GTTTTCTCTT TAAAATATTA CTCTCTTCGT TTAATAATAA AGAATAGCCT 28115 .......... .......... .......... .......... .......... .......... 489 ACTCTTTTAT TTGGTCTGTT TAAATAAAAA AGTCCCCTTT TTTTACAATT CTTTAAATTC 28175 .......... .......... .......... .......... .......... .......... 489 AACTTTTCAC GTGATATATT TAACACCATA CAATCAATTT TTTAATACAT TTGATACAAC 28235 .......... .......... .......... .......... .......... .......... 489 TTTAATTTAA AACTAAAAAA ATTATTTTAT TTTAATTTTT TTAAACCAAC ATTTTTATAA 28295 .......... .......... .......... .......... .......... .......... 489 AATGAAAGGA GTATAATTCT TTAACCCTTT TAATTATTAT CGGTTACACT TTAATGCATG 28355 .......... .......... .......... .......... .......... .......... 489 TCACTTTTTC AGTAAGTAGG TTATTCTCAC GTGATTAGCT AGCTGTTCTA GAATCTCGAC 28415 .......... .......... .......... .......... .......... .......... 489 AGAATCACGT ATATAACCTG TTTATTTTTA TTTTTATTTT GTTTATTAAT AAAAATAATT 28475 .......... .......... .......... .......... .......... .......... 489 AAATTTAATA GGATCGGATC CCGGGTATTA ACTTTTATGC GAAGATTCAT TTTAGATTGT 28535 .......... .......... .......... .......... .......... .......... 489 CTAGATTTAT AGTTGTCCTC AATTATTTTT TATTTTTTTC TTTAAAAAGG TAATTATTTT 28595 .......... .......... .......... .......... .......... .......... 489 AATTAATTCA CTTGAAAATA TAAATAATTA TGTAAAATTC AAAAAGAGTT TTTTAAAGTA 28655 .......... .......... .......... .......... .......... .......... 489 TAAATTAGTA AAAGTAACAT TTTTATTTAT GGTTTTTAAA GAGACGTATA AAAAAAAAAT 28715 .......... .......... .......... .......... .......... .......... 489 AGACAATTAC TCTTCAATGC ATTACTTTTT TGTGTGAATT TAGATTAGTC AGGTTCTAAT 28775 .......... .......... .......... .......... .......... .......... 489 ATAAATATTA AAAATGAGAT AGTATATTGT ATATTCTCTT ATTTCATATT TTCTCCGTTT 28835 | | | ||| || | | || | || | ||| .......... .......... ......CTCT AAATT-TCCA AAATAGTACT CCCTTCATTT 522 AAAAAAGAAT GAACTAGTTT GACTTGGAAT GAAGTTTAAG AAAAGAAAGA AGACTTTTTA 28895 |||||||||| || ||||||| ||||||||| | |||||||| ||||| |||| |||||||||| AAAAAAGAAT GACCTAGTTT GACTTGGAAC GGAGTTTAAG AAAAG-AAGA AGACTTTTTA 581 ATCTTGTGGT TCTAAATTAA AGTTATGTCA AATGTATCAA AAT 28938 || ||||||| |||||||||| |||||||||| |||||| ||| ||| ATTTTGTGGT TCTAAATTAA AGTTATGTCA AATGTACCAA AAT 624 hqPGS_C09HBa0099P03.1-1+_SGN-U330025- (28802 28938) ******************************************************************************** EST sequence 3 +strand 911 n (File: SGN-U328267+) 1 TGAATCAACT AAAAGTGGTT ACAATGATCT TAAACGAGGT CTTAAGGTTG TATACATCAG 61 TATACGCGAT TAATCGCATG GTGAATACAG AAACAAAGTT AGGGGATTTG TGTTTACCCT 121 CTGGGGTCCA ACTCATATTG GCAACAATGT TAGTGCATCA TGATACTGAA ATATGGGGAG 181 ATGATGCAAT GGAGTTTATG CCAGAGAGAT TTAGTGAAGG AATATCAAAA GCAACAAAAG 241 GACAAGTTGT ATTTTTTCCA TTTAGTTGGG GTCCAAGAAT ATGTATTGGG CAAAATTTTG 301 CTATGTTAGA GGCAAAAATG GCAATGGTCA TGATTCTAAA ACATTATGCA TTTGAACTCT 361 CTCCATCTTA TGCTCATGCT CCTCACCCAT TGTTGCTTCA ACCTCAATAT GGTGCTCAAT 421 TGATCATGCA CAAGTTGTAG AAGTGGTTAT CACGTGTTGT CTTTTGAATC ATGTTATATC 481 CTTTATTTAT TGTTGGGATA ATGTCCAAGT ACCCCCTTAG CCTATGTCCA AAATCTCAGA 541 GATACACTTA TACCATACTA AGATCTTATT ACCCCCTGAA CTTATTTATT AATAATTTTT 601 TACCCCTTTT AGACCTACGT GGCACTATCT TGTGGGCCCA ATGGTTGTTG ACTTTTTTTT 661 TAAACTAGTG CCACGTAGGC TAAAAAGGGG TAGAAAATTA CTTATAAAAT AAGTTCAGGG 721 GGGTAATAGG ACCTTAGTAT AGTATAAGTG TGTCTCTGGA ATTTCGGGCA TAGGTTGAGG 781 GGGTACTTGT GCATTATCCC TTTATTGTTC TTTAATTATA GTTCAAGTGA AGTCATATGT 841 TGTTTTGTAT AAGACTGGAG AATTTTATAA TAATAATATT AGTGTACTTT ATGTTTATGA 901 GAAAAAAAAA A Predicted gene structure (within gDNA segment 28646 to 37786): Exon 1 35718 36030 ( 313 n); cDNA 496 809 ( 314 n); score: 0.847 PPA cDNA 900 911 MATCH C09HBa0099P03.1-1+ SGN-U328267+ 0.847 313 0.344 C PGS_C09HBa0099P03.1-1+_SGN-U328267+ (35718 36030) Alignment (genomic DNA sequence = upper lines): GGATAATGCA CAAGTA-CCC CTCAACCTAT GCCCGAAATT TCAGAAACAA ACTTGTACTA 35776 |||||||| |||||| ||| || | ||||| | || |||| ||||| | | |||| ||| | GGATAATGTC CAAGTACCCC CTTAGCCTAT GTCCAAAATC TCAGAGATAC ACTTATACCA 555 TACTAAGGTC CTATTATCCC CTGAACTTAT TTTATTAATA ATTTTCTACC CCTTTTCGGC 35836 ||||||| || ||||| ||| ||||||||| |||||||||| ||||| |||| |||||| | | TACTAAGATC TTATTACCCC CTGAACTTA- TTTATTAATA ATTTTTTACC CCTTTTAGAC 614 TTACGTGACA CTATTTTGTG GGCCCAACGC TGGTT-ATTT TTTTTTCAAG CTAGTGCCAC 35895 |||||| || |||| ||||| ||||||| | | ||| | || |||||| || |||||||||| CTACGTGGCA CTATCTTGTG GGCCCAATGG TTGTTGACTT TTTTTTTAAA CTAGTGCCAC 674 GTAGGCCAAA AAAGTGTAGA AAATTACTTA TAAAATAAGT TCAGGGGGGT CATGGGACCT 35955 |||||| ||| || | ||||| |||||||||| |||||||||| |||||||||| || |||||| GTAGGCTAAA AAGGGGTAGA AAATTACTTA TAAAATAAGT TCAGGGGGGT AATAGGACCT 734 TGGTATAGTA TAAGTGTGTC TCTGAGATTT CAGACATAGG TTGAGGGGGT ACTTGTGCAT 36015 | |||||||| |||||||||| |||| |||| | | |||||| |||||||||| |||||||||| TAGTATAGTA TAAGTGTGTC TCTGGAATTT CGGGCATAGG TTGAGGGGGT ACTTGTGCAT 794 TTTCCTTATT TTTTT 36030 | ||| | | || || TATCCCTTTA TTGTT 809 hqPGS_C09HBa0099P03.1-1+_SGN-U328267+ (35718 36030) ******************************************************************************** EST sequence 9 -strand 911 n (File: SGN-U328267-) 1 TTTTTTTTTT CTCATAAACA TAAAGTACAC TAATATTATT ATTATAAAAT TCTCCAGTCT 61 TATACAAAAC AACATATGAC TTCACTTGAA CTATAATTAA AGAACAATAA AGGGATAATG 121 CACAAGTACC CCCTCAACCT ATGCCCGAAA TTCCAGAGAC ACACTTATAC TATACTAAGG 181 TCCTATTACC CCCCTGAACT TATTTTATAA GTAATTTTCT ACCCCTTTTT AGCCTACGTG 241 GCACTAGTTT AAAAAAAAAG TCAACAACCA TTGGGCCCAC AAGATAGTGC CACGTAGGTC 301 TAAAAGGGGT AAAAAATTAT TAATAAATAA GTTCAGGGGG TAATAAGATC TTAGTATGGT 361 ATAAGTGTAT CTCTGAGATT TTGGACATAG GCTAAGGGGG TACTTGGACA TTATCCCAAC 421 AATAAATAAA GGATATAACA TGATTCAAAA GACAACACGT GATAACCACT TCTACAACTT 481 GTGCATGATC AATTGAGCAC CATATTGAGG TTGAAGCAAC AATGGGTGAG GAGCATGAGC 541 ATAAGATGGA GAGAGTTCAA ATGCATAATG TTTTAGAATC ATGACCATTG CCATTTTTGC 601 CTCTAACATA GCAAAATTTT GCCCAATACA TATTCTTGGA CCCCAACTAA ATGGAAAAAA 661 TACAACTTGT CCTTTTGTTG CTTTTGATAT TCCTTCACTA AATCTCTCTG GCATAAACTC 721 CATTGCATCA TCTCCCCATA TTTCAGTATC ATGATGCACT AACATTGTTG CCAATATGAG 781 TTGGACCCCA GAGGGTAAAC ACAAATCCCC TAACTTTGTT TCTGTATTCA CCATGCGATT 841 AATCGCGTAT ACTGATGTAT ACAACCTTAA GACCTCGTTT AAGATCATTG TAACCACTTT 901 TAGTTGATTC A Predicted gene structure (within gDNA segment 33406 to 43252): Exon 1 35718 36023 ( 306 n); cDNA 113 419 ( 307 n); score: 0.776 Intron 1 36024 36555 ( 532 n); Pd: 0.000 (s: 0.82), Pa: 0.000 (s: 0) Exon 2 36556 36564 ( 9 n); cDNA 420 428 ( 9 n); score: 0.778 Intron 2 36565 38633 (2069 n); Pd: 0.543 (s: 0), Pa: 0.993 (s: 0) Exon 3 38634 38639 ( 6 n); cDNA 429 434 ( 6 n); score: 1.000 Intron 3 38640 39949 (1310 n); Pd: 0.000 (s: 0), Pa: 0.043 (s: 0) Exon 4 39950 39963 ( 14 n); cDNA 435 448 ( 14 n); score: 0.786 PPA cDNA 12 1 MATCH C09HBa0099P03.1-1+ SGN-U328267- 0.776 335 0.368 C PGS_C09HBa0099P03.1-1+_SGN-U328267- (35718 36023,36556 36564,38634 38639,39950 39963) Alignment (genomic DNA sequence = upper lines): GGATAATGCA CAAGTA-CCC CTCAACCTAT GCCCGAAATT TCAGAAACAA ACTTGTACTA 35776 |||||||||| |||||| ||| |||||||||| |||||||||| |||| ||| |||| ||||| GGATAATGCA CAAGTACCCC CTCAACCTAT GCCCGAAATT CCAGAGACAC ACTTATACTA 172 TACTAAGGTC CTATTA-TCC CCTGAACTTA TTTTATTAAT AATTTTCTAC CCCTTTTCGG 35835 |||||||||| |||||| || |||||||||| |||||| | | |||||||||| ||||||| | TACTAAGGTC CTATTACCCC CCTGAACTTA TTTTATAAGT AATTTTCTAC CCCTTTTTAG 232 CTTACGTGAC ACTATTTTGT GGGCCCA-AC GCTGGTTATT TTTTTTTCAA GCTAGTGCCA 35894 | |||||| | |||| ||| | | ||| ||| | |||||||| CCTACGTGGC ACTAGTTTAA AAAAAAAGTC AACAACCATT GGGCCCACAA GATAGTGCCA 292 CGTAGGCCAA AAAAGTGTAG AAAATTACTT ATAAAATAAG TTCAGGGGGG TCATGGGACC 35954 |||||| | | ||| | ||| ||||||| || | ||||||| |||| ||||| | || || | CGTAGGTCTA AAAGGGGTAA AAAATTA-TT AATAAATAAG TTCA-GGGGG TAATAAGATC 350 TTGGTATAGT ATAAGTGTGT CTCTGAGATT TCAGACATAG GTTGAGGGGG TACTTGTGCA 36014 || |||| || |||||||| | |||||||||| | ||||||| | | |||||| |||||| || TTAGTATGGT ATAAGTGTAT CTCTGAGATT TTGGACATAG GCTAAGGGGG TACTTGGACA 410 TTTTCCTTAT TTTTTTAAAA GAAATTCAAC CCTTCTAATT TTCAAGAAAC TCACTGTTGT 36074 || ||| | TTATCCCAA. .......... .......... .......... .......... .......... 419 CCTCTCCACC CCACCCATCC CAACCCAACC ACTCCTCCAC CCCCATTAGA GCATCCATCC 36134 .......... .......... .......... .......... .......... .......... 419 TCTTTACTCC ACCCACTCCA ACTTCAATCG TTTTCACTCT ATCGTCTTTT AAAGAAACAC 36194 .......... .......... .......... .......... .......... .......... 419 TTTCAAAAGA ATCTTAATTA CATTGAAAAA AATAATTTAT AAAAGTTCAC TTTTTCCTCT 36254 .......... .......... .......... .......... .......... .......... 419 TCTTTACTTT AAAAAAGAAT CAATTATAAC TAACAACGGA AGAATTTTTT GCTGGATCTA 36314 .......... .......... .......... .......... .......... .......... 419 AAGAAACAAA ATGAGAAGAA GATAATGGTT GTGGTGATTT TTAGATTTGC AAAATTAAAA 36374 .......... .......... .......... .......... .......... .......... 419 ATTATTTTGA TTTTAACAAG ATTAATGGCG ATGTTGCAGT ACCCTAATTG ATGGTGTATA 36434 .......... .......... .......... .......... .......... .......... 419 GTGGATGAAA TTATCACAAA GATTGAGGGA AGAAGATGAA CAGTGTTTAA AAATATTTTT 36494 .......... .......... .......... .......... .......... .......... 419 TGGCCTGAAA ATAAGCTCCT AACGGGCTGA CTTTGAGTGT ATTACACTCA CCATGTCTTG 36554 .......... .......... .......... .......... .......... .......... 419 TCAACAAAAA GTATCAAAAT AACACAATGA AATTCTACAT GATAGGGTTT AAAATAAATA 36614 ||| ||| | .CAATAAATA .......... .......... .......... .......... .......... 428 TTGCACAATT AAAATGTCTA AATAAAATTT CATACCAAGT TTAAGGATTC CATGGGTTAG 36674 .......... .......... .......... .......... .......... .......... 428 ATATATAATA AAAGAAGCAT GAGATAGCAT CATAAGATTG AGATTTAGGT AATTTCACAA 36734 .......... .......... .......... .......... .......... .......... 428 TTCATAATTC TAGGATTCAT AGCATAAGAA TTTTCAAGGC AAAATTGAAC TAGAAAGAGC 36794 .......... .......... .......... .......... .......... .......... 428 CGCAGATTAT ACAACAATAT GGATGAGCTG AAGATGATTC TAACAGAAGC ATTTGGCGAT 36854 .......... .......... .......... .......... .......... .......... 428 TCATCAAACA GCGAAGGCGA AGAAGAAGAA CAATTTCTCC ATGTTCATTC TGTTGAGAAT 36914 .......... .......... .......... .......... .......... .......... 428 AAGGTCAATG GAAAAGCCCT AATCAGGTCA GTGTTCGGAG AAACTCATAA TTGGGAAAGA 36974 .......... .......... .......... .......... .......... .......... 428 ATCAGTGAAA TTGATGGCCT TTGGCTGTGC AAAGACTTTT TATCTCCTGA TCAACAATCA 37034 .......... .......... .......... .......... .......... .......... 428 AAGTTGTTAT CATCAATCCA ACAAGGTAAT AGCCCCTCTC TTTGTTGCTC ATATTCATAC 37094 .......... .......... .......... .......... .......... .......... 428 TGTTTCAAAG CAGATTTAGG GCGGATTTTT TGGTTCAATT TTTCGAATAT ATATCAAATT 37154 .......... .......... .......... .......... .......... .......... 428 GAAGTAAAAT GTATGCGTAT AATTAGATAG CACTCCTTCT GAATCCATTA TTATACAATG 37214 .......... .......... .......... .......... .......... .......... 428 ACCTAGAATT TTGAATAATT CATTTACTAC TCTTCTAGAC TACCTACGCG GGCCTTGCAT 37274 .......... .......... .......... .......... .......... .......... 428 ATACACAGGG CGGAGCAACA AGGCGATTAC GGGTTCTGAG AAACGTAGTA GCTTTAGCCT 37334 .......... .......... .......... .......... .......... .......... 428 AGACACTGTA TTTGTGTTTG TATGGGGAAA AATCGGACTT AATAAAGAAG CGAAATCATG 37394 .......... .......... .......... .......... .......... .......... 428 TGGCTACACA TGGTAAGTCC AAATATGAAG GAACGTTTGT GGGCGAATGG TGAGGCTCTC 37454 .......... .......... .......... .......... .......... .......... 428 CCCGGTATGT CGGAGGAAGC CTTGAGCTCC TGGTTGGGAC TGACTTGCAA CCGAACAACC 37514 .......... .......... .......... .......... .......... .......... 428 AAAAGAAGCC TTGCAATCAG CTTCAAGTGA GAAGTTAGAA CATACACACT TGCTATGAAA 37574 .......... .......... .......... .......... .......... .......... 428 TAGTCTTTTC AATATCCAAG AGTGCTATCG TTTCACATTG AGGTGACGGT TCAGTTTTTT 37634 .......... .......... .......... .......... .......... .......... 428 GGTCTATAAA GAAGGTTTTG CAGGCAATAG AAGGCATGGT TTCTTAGAAA ACACCAAGAA 37694 .......... .......... .......... .......... .......... .......... 428 GTTTTAGTTA TTTGATAGTT CTCCCCCATT TTAGCCTCTC CATGAAAAAG GATTTAGAAC 37754 .......... .......... .......... .......... .......... .......... 428 CTCTCCCACT TGCACAATTC ATTTCCCATT TTGGAAGTTA TGTACTTGTA AAAAAATCAC 37814 .......... .......... .......... .......... .......... .......... 428 GACCTTGACG TTTTAGTCAT AGGGGAAATG AACTTGATTG GCCTTGACTC ACTTGCCTTT 37874 .......... .......... .......... .......... .......... .......... 428 GACCCCTTTT ACAAATTCAA TTATTAGATT GAACGCCAGT TCAATTTCCA TGTTGGCGGT 37934 .......... .......... .......... .......... .......... .......... 428 CTCAAATTTG AAACCCCTTG CCAGTGAAAG CAAGGGGTTT GATTTCCGAG TCAAGCTCGC 37994 .......... .......... .......... .......... .......... .......... 428 AACGGGCTTG CTTTAGTGCG GTTATCTCTC TTGTGTCGCA TGCGAGCTAT TGCATAGAAT 38054 .......... .......... .......... .......... .......... .......... 428 CAATTGTCTT ACCCTGTCCA CACCAAAGGG TAGCGGCGGC GGGTTTTCCT GGTCATTAAA 38114 .......... .......... .......... .......... .......... .......... 428 AAAAAAAGAT TGAATGCCGT TTTTTCCGGA AAACGATTTT TAAAAGTCCA CTTATTATGT 38174 .......... .......... .......... .......... .......... .......... 428 ATAAATAATT TATTCAGAAC CCAACAAACT CGTCTTTTCC AGAACCTGGA ACCCACACAC 38234 .......... .......... .......... .......... .......... .......... 428 TCGAAATTCT AGCCTTGCCT CTAAATTTAC TTATGTACAA ACATATATAC TCACACATTT 38294 .......... .......... .......... .......... .......... .......... 428 TAAAGATGAT TTGGCCTATA TATATATATA GGTATTCATA TACATATAAA GTTTGCATTT 38354 .......... .......... .......... .......... .......... .......... 428 TAAAGATTCT TTGCATATAG AAATTTTCAA CTATATGTAT ATAAAGTTTC AGCATAATAA 38414 .......... .......... .......... .......... .......... .......... 428 TGTAGTGAAT GGCCTTTTGT TTCAATTATA GCTCAATGCT TTTGAGAGAA CAGTTTAAGC 38474 .......... .......... .......... .......... .......... .......... 428 AGTCATCAGG GAGCTCCTTA TCACGGGTTT CCCTGTAAAT TTACTACTGC GCATGTATTA 38534 .......... .......... .......... .......... .......... .......... 428 TCAGTGATAT TTCAATGGTT TCATTTGGCT AAGGAGTAAG ACAAATGTAT TTGATCAGGC 38594 .......... .......... .......... .......... .......... .......... 428 AGAGCTCTTA TATTTGCTTC TTGATGGTTG TGGTTGCAGA AGGATGGTTT GCTGAGTCTT 38654 | ||||| .......... .......... .......... .........A AGGAT..... .......... 434 CAAGCAATCA GGTGTTGTTG GATCCTCAAA CCTTTGGGTA TTCTTGTATA ATACACTTAT 38714 .......... .......... .......... .......... .......... .......... 434 TCCCTTTAAC TTGTTATATC TTATTCCCTT ACATATCAAC TTGAGAATCA TGCATTCATC 38774 .......... .......... .......... .......... .......... .......... 434 AAGTGTGATC AATAATCTTG TGTCATTTGG CTGAAGGATA TATTACCCGA CGACCACTAT 38834 .......... .......... .......... .......... .......... .......... 434 ATTCATGTTA AATCTTTAGT TTTCATACTT ATGCCCTTGA GTTGTGTAAG AGCATTTGAA 38894 .......... .......... .......... .......... .......... .......... 434 ATAAGATGAT TGTTGTAGTA GTAATTTTTA TGTCAAAATG TAACTTCTCT ATTTACTGGC 38954 .......... .......... .......... .......... .......... .......... 434 GCTAAATCTT GAGAATACTA TGACTACCAC TTGAGGCAGA AGATGAACTT ATGACATTTC 39014 .......... .......... .......... .......... .......... .......... 434 TCTCTGAATA TTGTGAATGG AATAAATGTT TTTCATTTAT TTGTCTGCCG TTTATAATTG 39074 .......... .......... .......... .......... .......... .......... 434 TACGTTAGTA TTCATTCTGG GGAGACTTAT CAAAACAAGG ACTTGAATAC TGAGGGGTTG 39134 .......... .......... .......... .......... .......... .......... 434 TTTGGTATGT GAGATTTTAT GTGTTTGGAT AGGGGTATTA GTTCGTTTCA GAATTATTTA 39194 .......... .......... .......... .......... .......... .......... 434 TCCCACCATT TATACTGCTA TGATGGGATA AGTTATCACA TTATATGGTG GGATAACTTT 39254 .......... .......... .......... .......... .......... .......... 434 GTCAATCCCA TCCCCTACCA AACAACCCCT AAGAGATTTT GGATTAAAAA AACATATATA 39314 .......... .......... .......... .......... .......... .......... 434 AGCCAAAAGG GCTGGATGCT CAAGAATTTA TTTTTTTTTA AAAAAAAAAG AGATTCTGAA 39374 .......... .......... .......... .......... .......... .......... 434 AGTACTAATC ATCACGATGT TGATTCGTAC GCTATTTCGA CCGTGTGCAT CAAACCAATA 39434 .......... .......... .......... .......... .......... .......... 434 TCCTTCGTTC AAAGTTTAAA AACTTTACAC TCTGCTTTCA GCTTTGTTTT TACCCTATAG 39494 .......... .......... .......... .......... .......... .......... 434 TTGTTAATTA ACCTTCGAAT GAGCATACTG CAACGCATAT CTGTCATGTA CTTGAACAAT 39554 .......... .......... .......... .......... .......... .......... 434 GGAAAAGTTC ATCCTGCAGT TCTCTAAGAA AGACAGCAGG GCTGTTGGTT CTGTTTAACA 39614 .......... .......... .......... .......... .......... .......... 434 TTTTCTTTTC CTTTGTTTAA GTGACTAGTT TAGCATTTTT GTAAACCAAC TACAAATTTT 39674 .......... .......... .......... .......... .......... .......... 434 TGAATTTCTC ATAAATCATA ATTAGATCAC CCTGTTCGAC TCTTTTAACC TGTAGTCATT 39734 .......... .......... .......... .......... .......... .......... 434 TATCAATGCA CTTGATCACA TGTATTTTCA TCTCATACAG GCTATGAGAT TTGGCGACCT 39794 .......... .......... .......... .......... .......... .......... 434 ACCAGGATGG GCGGTTGAGC TCTCCAGGTC TATCCATGAG GTGATTCTTT TCGGTAGTTA 39854 .......... .......... .......... .......... .......... .......... 434 TGCTGCAGAG TTGGAAAACT GTGAAAAGGG CAAGGAAGCA TGTATTTTTC CACAAGATCT 39914 .......... .......... .......... .......... .......... .......... 434 GTTATGGAGG GAACCTCTAT TTGATCAACT TATAGCTAAC ATGTATCAA 39963 |||| ||| |||| .......... .......... .......... .....ATAAC ATGATTCAA 448 hqPGS_C09HBa0099P03.1-1+_SGN-U328267- (35718 36023) ******************************************************************************** EST sequence 2 -strand 844 n (File: SGN-U327561-) 1 TTCCTCCTGA CCCTTCTTCT TCTTCTTCTC CAATTCTACT TCCTTTTACC TTTTGCACTC 61 AAAAAACTCT TCAGTTTCCT AAAAAAGTAC ATTTCGATAA TGGCTTCTCA GCAAGATGAG 121 CTAAAACACA GAAGTACTAC GAAATCACAA CAAACAGAGC AATACACAAA ATCTGCTCAC 181 GATAAGGATT CAAAATCGAA CAAAAACATC AACAGATCAA CAAGAAAACA GATCGCTAAA 241 CGAGGCGTCA AATCATTGAC AATCGCTTTA TCAATTCCAC TTCTATTAAC CCTAATTGAC 301 ATTTCTCTAT TCGGATCAAG TTACCAGTAC GTTTCAATGG AGAAGCCTTT CTGGTTTCCG 361 CGTCTATGGG CTTTACATTT AGCCTGTTTA GGTTCTTCTC TTCTAATGGG TCTTTCTGCT 421 TGGCTTGTTT GGGCTGAAGG TGGGTTTCAT CGTCAACCTA TGGCTATAAT TTTGTATTTA 481 GCTCAATTAG GGTTGAGTTT GGCTTGGGAT CCAGTTGTGT TCAAAGCAGG TGCTACTAGA 541 ATTGGGTTAG TGTTATGTGT GGCTTTGTTT GGAGTGTTGA TTGGTTGTTT TAGGGCTTTT 601 AAAAATGTGA ATCCTATTGC TGGGGATTTG GTTAAACCTT GTTTTGGATG GGCTGTGCTT 661 TTGAGTTTAG CAAATCTTAA GCTTGTGTAT CATTAGGAAG AAATAAATAA TATGCACTTG 721 TTTTTTGTTT TGTTTTGGCT TGTGAGGCTT ATGTACTGTA AATTTACATG TTCTATATTT 781 ATGTTTTAAT TAAATTAAAT TTACATGTGT GTTCTATAAA AAAAAAAAAA AAAAAAACTC 841 GAGA Predicted gene structure (within gDNA segment 42970 to 40683): Exon 1 42360 41544 ( 817 n); cDNA 1 817 ( 817 n); score: 0.991 PPA cDNA 818 838 MATCH C09HBa0099P03.1-1- SGN-U327561- 0.991 817 0.968 C PGS_C09HBa0099P03.1-1-_SGN-U327561- (42360 41544) Alignment (genomic DNA sequence = upper lines): TTCCTCCTGA CCCTTCTTCT TCTTCTTCTC CAATTCTACT TCCTTTTACC TTTTGCACTC 42301 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCCTCCTGA CCCTTCTTCT TCTTCTTCTC CAATTCTACT TCCTTTTACC TTTTGCACTC 60 AAAAAACTCT CTCACTTTCC TA-AAAAGTA CATTTTGATA ATGGCTTCTC AACAAGATGA 42242 |||||||||| ||| ||||| || ||||||| ||||| |||| |||||||||| | |||||||| AAAAAACTCT -TCAGTTTCC TAAAAAAGTA CATTTCGATA ATGGCTTCTC AGCAAGATGA 119 GCTAAAACAC AGAAGTACTA CGAAATCACA ACAAACAGAG CAATACACAA AATCTGCTCA 42182 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTAAAACAC AGAAGTACTA CGAAATCACA ACAAACAGAG CAATACACAA AATCTGCTCA 179 CGATAAGGAT TCAAAATCGA ACAAAAACAT CAACAGATCA ACAAGAAAAC AGATCGCTAA 42122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGATAAGGAT TCAAAATCGA ACAAAAACAT CAACAGATCA ACAAGAAAAC AGATCGCTAA 239 ACGAGGCGTC AAATCATTGA CAATCGCTTT ATCAATTCCA CTTCTATTAA CCCTAATTGA 42062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACGAGGCGTC AAATCATTGA CAATCGCTTT ATCAATTCCA CTTCTATTAA CCCTAATTGA 299 CATCTCTCTA TTCGGATCAA GTTACCAGTA CGTTTCAATG GAGAAGCCTT TCTGGTTTCC 42002 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTTCTCTA TTCGGATCAA GTTACCAGTA CGTTTCAATG GAGAAGCCTT TCTGGTTTCC 359 GCGTCTATGG GCTTTACATT TAGCCTGTTT AGGTTCTTCT CTTCTAATGG GTCTTTCTGC 41942 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGTCTATGG GCTTTACATT TAGCCTGTTT AGGTTCTTCT CTTCTAATGG GTCTTTCTGC 419 TTGGCTTGTT TGGGCTGAAG GTGGGTTTCA TCGTCAACCT ATGGCTATAA TTTTGTATTT 41882 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGCTTGTT TGGGCTGAAG GTGGGTTTCA TCGTCAACCT ATGGCTATAA TTTTGTATTT 479 AGCTCAATTA GGGTTGAGTT TGGCTTGGGA TCCAGTTGTG TTCAAAGCAG GTGCTACTAG 41822 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTCAATTA GGGTTGAGTT TGGCTTGGGA TCCAGTTGTG TTCAAAGCAG GTGCTACTAG 539 AATTGGGTTA GTGTTATGTG TGGCTTTGTT TGGAGTGTTG ATTGGTTGTT TTAGGGCTTT 41762 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTGGGTTA GTGTTATGTG TGGCTTTGTT TGGAGTGTTG ATTGGTTGTT TTAGGGCTTT 599 TAAAAATGTG AATCCTATTG CTGGGGATTT GGTTAAACCT TGTTTTGGAT GGGCTGTGCT 41702 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAAAATGTG AATCCTATTG CTGGGGATTT GGTTAAACCT TGTTTTGGAT GGGCTGTGCT 659 TTTGAGTTTA GCAAATCTTA AGCTTGTGTA TCATTAGGAA GAAATAAATA ATATGCACTT 41642 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGAGTTTA GCAAATCTTA AGCTTGTGTA TCATTAGGAA GAAATAAATA ATATGCACTT 719 GTTTTTTGTT TTGTTTTGGC TTGTGAGGCT TATGTACTGT AAATTTACAT GTTCTATATT 41582 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTTTTGTT TTGTTTTGGC TTGTGAGGCT TATGTACTGT AAATTTACAT GTTCTATATT 779 TATGTTTTAA TTAAATTAAA TTTACATGTG TGTTCTAT 41544 |||||||||| |||||||||| |||||||||| |||||||| TATGTTTTAA TTAAATTAAA TTTACATGTG TGTTCTAT 817 hqPGS_C09HBa0099P03.1-1-_SGN-U327561- (42360 41544) Total number of EST alignments reported: 14 ________________________________________________________________________________ Predicted gene locations (8) in segment 1 to 45342: PGL 1 (- strand): 4246 1215 AGS-1 (1689 1629,1545 1372,1286 1215) SCR (e 1.000 d 0.997 a 0.988,e 1.000 d 0.877 a 0.929,e 1.000) Exon 1 1689 1629 ( 61 n); score: 1.000 Intron 1 1628 1546 ( 83 n); Pd: 0.997 Pa: 0.988 Exon 2 1545 1372 ( 174 n); score: 1.000 Intron 2 1371 1287 ( 85 n); Pd: 0.877 Pa: 0.929 Exon 3 1286 1215 ( 72 n); score: 1.000 PGS (1689 1629,1545 1372,1286 1215) SGN-U331997+ 3-phase translation of AGS-1 (-strand): . . . . . . 1689 CCGTCTTGATAGCTCAAGTAATGATGAAATTGATAAACTTGCTATTAATGAACTCTTTGA P S - - L K - - - N - - T C Y - - T L - R L D S S S N D E I D K L A I N E L F E V L I A Q V M M K L I N L L L M N S L . : . . . . . 1629 G : GTTGCTTTAAAAGAAAGTAAAAGTGGCCCTCTTGTTTTGTTCATCAAAGACATTGAGAA : G C F K R K - K W P S C F V H Q R H - E : V A L K E S K S G P L V L F I K D I E K R : L L - K K V K V A L L F C S S K T L R . . . . . . 1486 GTCTATGGTGGGTAATCCTGAGGCCTATGCTGCTTTCAAGATTAAGCTCGAGCATTTGCC V Y G G - S - G L C C F Q D - A R A F A S M V G N P E A Y A A F K I K L E H L P S L W W V I L R P M L L S R L S S S I C . . . . . . : 1426 AGAGAATGTTGTTGCCATAGCTTCCCATGCCCAGTCGGACAGCCGAAAGGAGAAA : TCGCA R E C C C H S F P C P V G Q P K G E : I A E N V V A I A S H A Q S D S R K E K : S H Q R M L L P - L P M P S R T A E R R N : R . . . . . . 1281 TCCTGGTGGCTTGCTATTTACAAAATTTGGAAGTAACCAAACAGCATTGCTTGACCTTGC S W W L A I Y K I W K - P N S I A - P C P G G L L F T K F G S N Q T A L L D L A I L V A C Y L Q N L E V T K Q H C L T L . 1221 CTTCCCA L P F P P S Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-1_AGS-1_PPS_1 (1688 1629,1545 1372,1286 1215) (frame '2'; 306 bp, 102 residues) 1 RLDSSSNDEI DKLAINELFE VALKESKSGP LVLFIKDIEK SMVGNPEAYA AFKIKLEHLP 61 ENVVAIASHA QSDSRKEKSH PGGLLFTKFG SNQTALLDLA FP AGS-2 (4246 4114,3941 3820,3734 3643,2709 2487,2340 2277,1966 1818) SCR (e 0.752 d 0.958 a 0.859,e 0.820 d 0.956 a 0.729,e 0.859 d 0.206 a 0.997,e 0.830 d 0.973 a 0.842,e 0.695 d 0.988 a 0.998,e 0.805) Exon 1 4246 4114 ( 133 n); score: 0.752 Intron 1 4113 3942 ( 172 n); Pd: 0.958 Pa: 0.859 Exon 2 3941 3820 ( 122 n); score: 0.820 Intron 2 3819 3735 ( 85 n); Pd: 0.956 Pa: 0.729 Exon 3 3734 3643 ( 92 n); score: 0.859 Intron 3 3642 2710 ( 933 n); Pd: 0.206 Pa: 0.997 Exon 4 2709 2487 ( 223 n); score: 0.830 Intron 4 2486 2341 ( 146 n); Pd: 0.973 Pa: 0.842 Exon 5 2340 2277 ( 64 n); score: 0.695 Intron 5 2276 1967 ( 310 n); Pd: 0.988 Pa: 0.998 Exon 6 1966 1818 ( 149 n); score: 0.805 PGS (4246 4114,3941 3820,3734 3643,2709 2487,2340 2277,1966 1818) SGN-U347100+ 3-phase translation of AGS-2 (-strand): . . . . . . 4246 AACTACTCAAGGATTTTGATCGTCCAGTTTCTGCTTTAACTAGGCGCCAAACATTTAAAA N Y S R I L I V Q F L L - L G A K H L K T T Q G F - S S S F C F N - A P N I - K L L K D F D R P V S A L T R R Q T F K . . . . . . 4186 ATGCCTTACAGCAAGGAGTAGTTGATTTCAACACTATTGATGTCACATTTGAAAATTTTC M P Y S K E - L I S T L L M S H L K I F C L T A R S S - F Q H Y - C H I - K F S N A L Q Q G V V D F N T I D V T F E N F . . : . . . . 4126 CATATTACTTATG : TGAAAATACAAAGAATGTTCTGATTGCTTCCACTTATATACACTTGA H I T Y : V K I Q R M F - L L P L I Y T - I L L M : - K Y K E C S D C F H L Y T L E P Y Y L C : E N T K N V L I A S T Y I H L . . . . . . 3894 AGTGTAACGGGTTTGCAAAATTTGCATCAGATCTTCCCACAGTGTGCCCTAGGATTTTGC S V T G L Q N L H Q I F P Q C A L G F C V - R V C K I C I R S S H S V P - D F A K C N G F A K F A S D L P T V C P R I L . . : . . . . 3834 TATCAGGTCCAGCAG : GTTCAGAGATTTATCAGGAGACATTGGCCAAAGCACTTGCTAAGT Y Q V Q Q : V Q R F I R R H W P K H L L S I R S S R : F R D L S G D I G Q S T C - V L S G P A : G S E I Y Q E T L A K A L A K . . . . . : . 3689 ACTTTTGTGCTAAGCTAATGATAGTTGATTCTCTCTTGCTGCCTGGT : GTTTCAAGTTCCA T F V L S - - - L I L S C C L V : F Q V P L L C - A N D S - F S L A A W : C F K F Q Y F C A K L M I V D S L L L P G : V S S S . . . . . . 2696 AAGATGTCGAGCCTGTTAAAGTAAGCTCAAAACCAGAGAGAGCTAGTGTATTTGCTAAAC K M S S L L K - A Q N Q R E L V Y L L N R C R A C - S K L K T R E S - C I C - T K D V E P V K V S S K P E R A S V F A K . . . . . . 2636 GTGCGGCGCAAGCGGCAGCATTGCATCTGAATAAAAAGCCGGCTTCAAGTGTTGAGGCTG V R R K R Q H C I - I K S R L Q V L R L C G A S G S I A S E - K A G F K C - G - R A A Q A A A L H L N K K P A S S V E A . . . . . . 2576 ATATAACTGGTGGTTCAATTTTAAGTTCTCATGCTCAGCCCAAGCAGGAGGCATCGACTG I - L V V Q F - V L M L S P S R R H R L Y N W W F N F K F S C S A Q A G G I D C D I T G G S I L S S H A Q P K Q E A S T . . . : . . . 2516 CCTCATCAAAAAACTATACTTTTAAGAAAG : GTGATAGAGTGAAGTACATCGGATCTTTAA P H Q K T I L L R K : V I E - S T S D L - L I K K L Y F - E R : - - S E V H R I F N A S S K N Y T F K K : G D R V K Y I G S L . . . . : . . 2310 CATCAAGCTTTTCTCCGTTGCAATCACCTATAAG : GGGTCCAACATATGGTTACAGGGGCA H Q A F L R C N H L - : G V Q H M V T G A I K L F S V A I T Y K : G S N I W L Q G Q T S S F S P L Q S P I R : G P T Y G Y R G . . . . . . 1940 AAGTGGTTCTTGCATTTGAGGAAAATGGGTCCTCTAAAATTGGTGTCAGATTTGATAGAT K W F L H L R K M G P L K L V S D L I D S G S C I - G K W V L - N W C Q I - - I K V V L A F E E N G S S K I G V R F D R . . . . . . 1880 CAATTCCTGAGGGTAATGATCTTGGTGGCCTGTGCGATGAAGATCATGGGTTCTTTTGTG Q F L R V M I L V A C A M K I M G S F V N S - G - - S W W P V R - R S W V L L C S I P E G N D L G G L C D E D H G F F C . 1820 CTG L A Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-1_AGS-2_PPS_1 (4244 4114,3941 3820,3734 3643,2709 2487,2340 2277,1966 1819) (frame '0'; 780 bp, 260 residues) 1 LLKDFDRPVS ALTRRQTFKN ALQQGVVDFN TIDVTFENFP YYLCENTKNV LIASTYIHLK 61 CNGFAKFASD LPTVCPRILL SGPAGSEIYQ ETLAKALAKY FCAKLMIVDS LLLPGVSSSK 121 DVEPVKVSSK PERASVFAKR AAQAAALHLN KKPASSVEAD ITGGSILSSH AQPKQEASTA 181 SSKNYTFKKG DRVKYIGSLT SSFSPLQSPI RGPTYGYRGK VVLAFEENGS SKIGVRFDRS 241 IPEGNDLGGL CDEDHGFFCA PGL 2 (- strand): 15380 14945 AGS-1 (15380 14945) SCR (e 0.733) Exon 1 15380 14945 ( 436 n); score: 0.733 PGS (15380 14945) SGN-U340386- 3-phase translation of AGS-1 (-strand): . . . . . . 15380 CCCTAACACTTGTTTGGATCATTGTTATCAATTGTATTGTATTGTATCGTTATTATACCT P - H L F G S L L S I V L Y C I V I I P P N T C L D H C Y Q L Y C I V S L L Y L L T L V W I I V I N C I V L Y R Y Y T . . . . . . 15320 ACAATGGTTGTTTTGATTGTTACTTAAAATATATTGTACTGTATTGTTAAATTTCGTTGT T M V V L I V T - N I L Y C I V K F R C Q W L F - L L L K I Y C T V L L N F V V Y N G C F D C Y L K Y I V L Y C - I S L . . . . . . 15260 TACGTAACAATGGAAATCCCTATTTTATGAAACAACTAATTTGGTGTGTTCCCATTGTTA Y V T M E I P I L - N N - F G V F P L L T - Q W K S L F Y E T T N L V C S H C Y L R N N G N P Y F M K Q L I W C V P I V . . . . . . 15200 CTAGTTTCTTAATCTTTACATAATATTTCAAAATACTATTTTACCCTTTACCTTAATTAT L V S - S L H N I S K Y Y F T L Y L N Y - F L N L Y I I F Q N T I L P F T L I I T S F L I F T - Y F K I L F Y P L P - L . . . . . . 15140 TTAAACCTAGTCAAACCTCCTACCCTGAAATAATTAAGGATATTTATAAATTACATTACT L N L V K P P T L K - L R I F I N Y I T - T - S N L L P - N N - G Y L - I T L L F K P S Q T S Y P E I I K D I Y K L H Y . . . . . . 15080 GTATGATACAGTCAAACCAAACAATTAAAATGTTACTAAACAAAACAAACAATACAATCT V - Y S Q T K Q L K C Y - T K Q T I Q S Y D T V K P N N - N V T K Q N K Q Y N L C M I Q S N Q T I K M L L N K T N N T I . . . . . . 15020 AACCAAACATTGTATCTACCATACATTACGATACAATACATTATGAAACAATAGATAACA N Q T L Y L P Y I T I Q Y I M K Q - I T T K H C I Y H T L R Y N T L - N N R - Q - P N I V S T I H Y D T I H Y E T I D N . . 14960 ATAATCCAAACAAAAT I I Q T K - S K Q N N N P N K Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 14945 ATTTTGTTTGGATTATTGTTATCTATTGTTTCATAATGTATTGTATCGTAATGTATGGTA I L F G L L L S I V S - C I V S - C M V F C L D Y C Y L L F H N V L Y R N V W - F V W I I V I Y C F I M Y C I V M Y G . . . . . . 15005 GATACAATGTTTGGTTAGATTGTATTGTTTGTTTTGTTTAGTAACATTTTAATTGTTTGG D T M F G - I V L F V L F S N I L I V W I Q C L V R L Y C L F C L V T F - L F G R Y N V W L D C I V C F V - - H F N C L . . . . . . 15065 TTTGACTGTATCATACAGTAATGTAATTTATAAATATCCTTAATTATTTCAGGGTAGGAG F D C I I Q - C N L - I S L I I S G - E L T V S Y S N V I Y K Y P - L F Q G R R V - L Y H T V M - F I N I L N Y F R V G . . . . . . 15125 GTTTGACTAGGTTTAAATAATTAAGGTAAAGGGTAAAATAGTATTTTGAAATATTATGTA V - L G L N N - G K G - N S I L K Y Y V F D - V - I I K V K G K I V F - N I M - G L T R F K - L R - R V K - Y F E I L C . . . . . . 15185 AAGATTAAGAAACTAGTAACAATGGGAACACACCAAATTAGTTGTTTCATAAAATAGGGA K I K K L V T M G T H Q I S C F I K - G R L R N - - Q W E H T K L V V S - N R D K D - E T S N N G N T P N - L F H K I G . . . . . . 15245 TTTCCATTGTTACGTAACAACGAAATTTAACAATACAGTACAATATATTTTAAGTAACAA F P L L R N N E I - Q Y S T I Y F K - Q F H C Y V T T K F N N T V Q Y I L S N N I S I V T - Q R N L T I Q Y N I F - V T . . . . . . 15305 TCAAAACAACCATTGTAGGTATAATAACGATACAATACAATACAATTGATAACAATGATC S K Q P L - V - - R Y N T I Q L I T M I Q N N H C R Y N N D T I Q Y N - - Q - S I K T T I V G I I T I Q Y N T I D N N D . . 15365 CAAACAAGTGTTAGGG Q T S V R K Q V L G P N K C - Maximal non-overlapping open reading frames (>= 64 codons): none PGL 3 (- strand): 17242 16950 AGS-1 (17242 16950) SCR (e 0.819) Exon 1 17242 16950 ( 293 n); score: 0.819 PGS (17242 16950) SGN-U330540+ 3-phase translation of AGS-1 (-strand): . . . . . . 17242 TTAGTCTAAAGAGAGTTCTTATTAGTTGAAGGGAGGTGTTCTTTTTTTGTGGAGCTTTGG L V - R E F L L V E G R C S F F V E L W - S K E S S Y - L K G G V L F L W S F G S L K R V L I S - R E V F F F C G A L . . . . . . 17182 ACTCAACTCCTGTCCAGAGTTGTTGAGTTATACTTTGTAAAGGCTGTTGTATCCTGGAGG T Q L L S R V V E L Y F V K A V V S W R L N S C P E L L S Y T L - R L L Y P G G D S T P V Q S C - V I L C K G C C I L E . . . . . . 17122 GGACAAGTCAAAGAGGACTACTGCTGGACCGGTGAAAACATTTGCTGCAGTGGGCTTGAA G Q V K E D Y C W T G E N I C C S G L E D K S K R T T A G P V K T F A A V G L N G T S Q R G L L L D R - K H L L Q W A - . . . . . . 17062 TCTCCTTAAAGAGAGCGAGATATCCGCGCCTCAGCCTGAAGAGATTACTTTCTTCATTTT S P - R E R D I R A S A - R D Y F L H F L L K E S E I S A P Q P E E I T F F I L I S L K R A R Y P R L S L K R L L S S F . . . . . . 17002 ATTTTCAATTGTAATCTTGCAATTTTATTATCTTGTAAGTTTTTTTTCACTAA I F N C N L A I L L S C K F F F T F S I V I L Q F Y Y L V S F F S L Y F Q L - S C N F I I L - V F F H - Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-3_AGS-1_PPS_1 (17142 16951) (frame '2'; 192 bp, 64 residues) 1 RLLYPGGDKS KRTTAGPVKT FAAVGLNLLK ESEISAPQPE EITFFILFSI VILQFYYLVS 61 FFSL 3-phase translation of AGS-1 (+strand): . . . . . . 16950 TTAGTGAAAAAAAACTTACAAGATAATAAAATTGCAAGATTACAATTGAAAATAAAATGA L V K K N L Q D N K I A R L Q L K I K - - - K K T Y K I I K L Q D Y N - K - N E S E K K L T R - - N C K I T I E N K M . . . . . . 17010 AGAAAGTAATCTCTTCAGGCTGAGGCGCGGATATCTCGCTCTCTTTAAGGAGATTCAAGC R K - S L Q A E A R I S R S L - G D S S E S N L F R L R R G Y L A L F K E I Q A K K V I S S G - G A D I S L S L R R F K . . . . . . 17070 CCACTGCAGCAAATGTTTTCACCGGTCCAGCAGTAGTCCTCTTTGACTTGTCCCCTCCAG P L Q Q M F S P V Q Q - S S L T C P L Q H C S K C F H R S S S S P L - L V P S R P T A A N V F T G P A V V L F D L S P P . . . . . . 17130 GATACAACAGCCTTTACAAAGTATAACTCAACAACTCTGGACAGGAGTTGAGTCCAAAGC D T T A F T K Y N S T T L D R S - V Q S I Q Q P L Q S I T Q Q L W T G V E S K A G Y N S L Y K V - L N N S G Q E L S P K . . . . . . 17190 TCCACAAAAAAAGAACACCTCCCTTCAACTAATAAGAACTCTCTTTAGACTAA S T K K E H L P S T N K N S L - T P Q K K N T S L Q L I R T L F R L L H K K R T P P F N - - E L S L D - Maximal non-overlapping open reading frames (>= 64 codons): none PGL 4 (+ strand): 20648 23640 AGS-1 (20648 21796) SCR (e 0.988) Exon 1 20648 21796 (1149 n); score: 0.988 PGS (20648 21796) SGN-U331580+ 3-phase translation of AGS-1 (+strand): . . . . . . 20648 GTCTGGACACCAAGGATGTTGAAGTGGAACCTGAGCAGGCAGTGTCTGGAACTATATATG V W T P R M L K W N L S R Q C L E L Y M S G H Q G C - S G T - A G S V W N Y I C L D T K D V E V E P E Q A V S G T I Y . . . . . . 20708 CCAATGGTGACCATTCCGGAGAAAGCGTCGAGCGAGATGTAGTGGAAGTTGAAGTCTCTG P M V T I P E K A S S E M - W K L K S L Q W - P F R R K R R A R C S G S - S L W A N G D H S G E S V E R D V V E V E V S . . . . . . 20768 GTCAAACATCTGCTATATCAAGGTCAATCACTGGCTCAGAGCAAGAAGGAGAAGCTAAAG V K H L L Y Q G Q S L A Q S K K E K L K S N I C Y I K V N H W L R A R R R S - R G Q T S A I S R S I T G S E Q E G E A K . . . . . . 20828 ATCATATAGATGAAGAAGCTAACCTTGAAGGCTCAGTTTCAGATGGAGAGACAGATGGTA I I - M K K L T L K A Q F Q M E R Q M V S Y R - R S - P - R L S F R W R D R W Y D H I D E E A N L E G S V S D G E T D G . . . . . . 20888 TGATTTTTGGAAGCTCTGAAGCTGCCAAACAGTTTATGGAGGAGCTGGAAAGGGAATCTG - F L E A L K L P N S L W R S W K G N L D F W K L - S C Q T V Y G G A G K G I W M I F G S S E A A K Q F M E E L E R E S . . . . . . 20948 GTGGTGGCTCCTATGCTGGTGCTGAGGTTTCTCAGGATATTGATGGTCAGATTGTCACCG V V A P M L V L R F L R I L M V R L S P W W L L C W C - G F S G Y - W S D C H R G G G S Y A G A E V S Q D I D G Q I V T . . . . . . 21008 ACTCAGATGAGGAGGCTGATACTGATGAAGAAGGAGATGTGAAGGAGTTGTTTGATTCAG T Q M R R L I L M K K E M - R S C L I Q L R - G G - Y - - R R R C E G V V - F S D S D E E A D T D E E G D V K E L F D S . . . . . . 21068 CTGCCTTAGCTGCTCTTTTAAAAGCAGCAACAGGTGGTGATTCTGATGGTGGCAACATAA L P - L L F - K Q Q Q V V I L M V A T - C L S C S F K S S N R W - F - W W Q H N A A L A A L L K A A T G G D S D G G N I . . . . . . 21128 CAGTCACGTCTCAAGATGGATCAAGACTGTTCTCTGTTGAACGTCCTGCTGGTCTTGGGT Q S R L K M D Q D C S L L N V L L V L G S H V S R W I K T V L C - T S C W S W V T V T S Q D G S R L F S V E R P A G L G . . . . . . 21188 CATCACTCCGGTCACTGAGGCCAGCTCCCCGACCAAGCCAACCCAATCTTTTTACTCATT H H S G H - G Q L P D Q A N P I F L L I I T P V T E A S S P T K P T Q S F Y S F S S L R S L R P A P R P S Q P N L F T H . . . . . . 21248 CCAATCTCCAGAACAGTGGAGAATCTGAGAACAACTTGAGTGAAGAAGAGAAGAAGAAAC P I S R T V E N L R T T - V K K R R R N Q S P E Q W R I - E Q L E - R R E E E T S N L Q N S G E S E N N L S E E E K K K . . . . . . 21308 TGGACACACTACAGCAGATCAGGGTCAAGTTTTTGAGGCTTATTCACAGGTTGGGTCTAT W T H Y S R S G S S F - G L F T G W V Y G H T T A D Q G Q V F E A Y S Q V G S I L D T L Q Q I R V K F L R L I H R L G L . . . . . . 21368 CTTCTGATGAGCCCATAGCTGCACAAGTTTTGTACCGGATGACACTTATTGCACGAAGGC L L M S P - L H K F C T G - H L L H E G F - - A H S C T S F V P D D T Y C T K A S S D E P I A A Q V L Y R M T L I A R R . . . . . . 21428 AAAACAGTCCACTTTTTAGCGTTGAGGCTGCCAAGATGAAAGCTTTCCAGCTTGAAGCAG K T V H F L A L R L P R - K L S S L K Q K Q S T F - R - G C Q D E S F P A - S R Q N S P L F S V E A A K M K A F Q L E A . . . . . . 21488 AGGGGAAAGATGATTTGGACTTCTCTGTGAATATCCTGGTTATTGGCAAATCTGGGGTGG R G K M I W T S L - I S W L L A N L G W G E R - F G L L C E Y P G Y W Q I W G G E G K D D L D F S V N I L V I G K S G V . . . . . . 21548 GTAAGAGCGCTACCATAAACTCTATCTTTGGAGAGGAAAAAACATCAATTGATGCCTTTG V R A L P - T L S L E R K K H Q L M P L - E R Y H K L Y L W R G K N I N - C L W G K S A T I N S I F G E E K T S I D A F . . . . . . 21608 GACCTGCTACCACCAGTGTGAAAGAGATCAGTGGTGTTGTAGATGGTGTTAAGATTCGGG D L L P P V - K R S V V L - M V L R F G T C Y H Q C E R D Q W C C R W C - D S G G P A T T S V K E I S G V V D G V K I R . . . . . . 21668 TGTTTGATACACCTGGCCTCAAGTCCTCTGCGATGGAACAGGGTTTCAATCGCAGTGTCT C L I H L A S S P L R W N R V S I A V S V - Y T W P Q V L C D G T G F Q S Q C L V F D T P G L K S S A M E Q G F N R S V . . . . . . 21728 TGTCTTCAGTAAAGAAGTTGACTAAGAAGAATCCCCCTGATATTTACCTCTATGTCGATC C L Q - R S - L R R I P L I F T S M S I V F S K E V D - E E S P - Y L P L C R S L S S V K K L T K K N P P D I Y L Y V D . 21788 GGTTGGATG G W M V G R L D Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1+_PGL-4_AGS-1_PPS_1 (20650 21795) (frame '0'; 1146 bp, 382 residues) 1 LDTKDVEVEP EQAVSGTIYA NGDHSGESVE RDVVEVEVSG QTSAISRSIT GSEQEGEAKD 61 HIDEEANLEG SVSDGETDGM IFGSSEAAKQ FMEELERESG GGSYAGAEVS QDIDGQIVTD 121 SDEEADTDEE GDVKELFDSA ALAALLKAAT GGDSDGGNIT VTSQDGSRLF SVERPAGLGS 181 SLRSLRPAPR PSQPNLFTHS NLQNSGESEN NLSEEEKKKL DTLQQIRVKF LRLIHRLGLS 241 SDEPIAAQVL YRMTLIARRQ NSPLFSVEAA KMKAFQLEAE GKDDLDFSVN ILVIGKSGVG 301 KSATINSIFG EEKTSIDAFG PATTSVKEIS GVVDGVKIRV FDTPGLKSSA MEQGFNRSVL 361 SSVKKLTKKN PPDIYLYVDR LD 3-phase translation of AGS-1 (-strand): . . . . . . 21796 CATCCAACCGATCGACATAGAGGTAAATATCAGGGGGATTCTTCTTAGTCAACTTCTTTA H P T D R H R G K Y Q G D S S - S T S L I Q P I D I E V N I R G I L L S Q L L Y S N R S T - R - I S G G F F L V N F F . . . . . . 21736 CTGAAGACAAGACACTGCGATTGAAACCCTGTTCCATCGCAGAGGACTTGAGGCCAGGTG L K T R H C D - N P V P S Q R T - G Q V - R Q D T A I E T L F H R R G L E A R C T E D K T L R L K P C S I A E D L R P G . . . . . . 21676 TATCAAACACCCGAATCTTAACACCATCTACAACACCACTGATCTCTTTCACACTGGTGG Y Q T P E S - H H L Q H H - S L S H W W I K H P N L N T I Y N T T D L F H T G G V S N T R I L T P S T T P L I S F T L V . . . . . . 21616 TAGCAGGTCCAAAGGCATCAATTGATGTTTTTTCCTCTCCAAAGATAGAGTTTATGGTAG - Q V Q R H Q L M F F P L Q R - S L W - S R S K G I N - C F F L S K D R V Y G S V A G P K A S I D V F S S P K I E F M V . . . . . . 21556 CGCTCTTACCCACCCCAGATTTGCCAATAACCAGGATATTCACAGAGAAGTCCAAATCAT R S Y P P Q I C Q - P G Y S Q R S P N H A L T H P R F A N N Q D I H R E V Q I I A L L P T P D L P I T R I F T E K S K S . . . . . . 21496 CTTTCCCCTCTGCTTCAAGCTGGAAAGCTTTCATCTTGGCAGCCTCAACGCTAAAAAGTG L S P L L Q A G K L S S W Q P Q R - K V F P L C F K L E S F H L G S L N A K K W S F P S A S S W K A F I L A A S T L K S . . . . . . 21436 GACTGTTTTGCCTTCGTGCAATAAGTGTCATCCGGTACAAAACTTGTGCAGCTATGGGCT D C F A F V Q - V S S G T K L V Q L W A T V L P S C N K C H P V Q N L C S Y G L G L F C L R A I S V I R Y K T C A A M G . . . . . . 21376 CATCAGAAGATAGACCCAACCTGTGAATAAGCCTCAAAAACTTGACCCTGATCTGCTGTA H Q K I D P T C E - A S K T - P - S A V I R R - T Q P V N K P Q K L D P D L L - S S E D R P N L - I S L K N L T L I C C . . . . . . 21316 GTGTGTCCAGTTTCTTCTTCTCTTCTTCACTCAAGTTGTTCTCAGATTCTCCACTGTTCT V C P V S S S L L H S S C S Q I L H C S C V Q F L L L F F T Q V V L R F S T V L S V S S F F F S S S L K L F S D S P L F . . . . . . 21256 GGAGATTGGAATGAGTAAAAAGATTGGGTTGGCTTGGTCGGGGAGCTGGCCTCAGTGACC G D W N E - K D W V G L V G E L A S V T E I G M S K K I G L A W S G S W P Q - P W R L E - V K R L G W L G R G A G L S D . . . . . . 21196 GGAGTGATGACCCAAGACCAGCAGGACGTTCAACAGAGAACAGTCTTGATCCATCTTGAG G V M T Q D Q Q D V Q Q R T V L I H L E E - - P K T S R T F N R E Q S - S I L R R S D D P R P A G R S T E N S L D P S - . . . . . . 21136 ACGTGACTGTTATGTTGCCACCATCAGAATCACCACCTGTTGCTGCTTTTAAAAGAGCAG T - L L C C H H Q N H H L L L L L K E Q R D C Y V A T I R I T T C C C F - K S S D V T V M L P P S E S P P V A A F K R A . . . . . . 21076 CTAAGGCAGCTGAATCAAACAACTCCTTCACATCTCCTTCTTCATCAGTATCAGCCTCCT L R Q L N Q T T P S H L L L H Q Y Q P P - G S - I K Q L L H I S F F I S I S L L A K A A E S N N S F T S P S S S V S A S . . . . . . 21016 CATCTGAGTCGGTGACAATCTGACCATCAATATCCTGAGAAACCTCAGCACCAGCATAGG H L S R - Q S D H Q Y P E K P Q H Q H R I - V G D N L T I N I L R N L S T S I G S S E S V T I - P S I S - E T S A P A - . . . . . . 20956 AGCCACCACCAGATTCCCTTTCCAGCTCCTCCATAAACTGTTTGGCAGCTTCAGAGCTTC S H H Q I P F P A P P - T V W Q L Q S F A T T R F P F Q L L H K L F G S F R A S E P P P D S L S S S S I N C L A A S E L . . . . . . 20896 CAAAAATCATACCATCTGTCTCTCCATCTGAAACTGAGCCTTCAAGGTTAGCTTCTTCAT Q K S Y H L S L H L K L S L Q G - L L H K N H T I C L S I - N - A F K V S F F I P K I I P S V S P S E T E P S R L A S S . . . . . . 20836 CTATATGATCTTTAGCTTCTCCTTCTTGCTCTGAGCCAGTGATTGACCTTGATATAGCAG L Y D L - L L L L A L S Q - L T L I - Q Y M I F S F S F L L - A S D - P - Y S R S I - S L A S P S C S E P V I D L D I A . . . . . . 20776 ATGTTTGACCAGAGACTTCAACTTCCACTACATCTCGCTCGACGCTTTCTCCGGAATGGT M F D Q R L Q L P L H L A R R F L R N G C L T R D F N F H Y I S L D A F S G M V D V - P E T S T S T T S R S T L S P E W . . . . . . 20716 CACCATTGGCATATATAGTTCCAGACACTGCCTGCTCAGGTTCCACTTCAACATCCTTGG H H W H I - F Q T L P A Q V P L Q H P W T I G I Y S S R H C L L R F H F N I L G S P L A Y I V P D T A C S G S T S T S L . 20656 TGTCCAGAC C P D V Q V S R Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-4_AGS-1_PPS_1 (21770 21351) (frame '0'; 417 bp, 139 residues) 1 ISGGFFLVNF FTEDKTLRLK PCSIAEDLRP GVSNTRILTP STTPLISFTL VVAGPKASID 61 VFSSPKIEFM VALLPTPDLP ITRIFTEKSK SSFPSASSWK AFILAASTLK SGLFCLRAIS 121 VIRYKTCAAM GSSEDRPNL- AGS-2 (22011 23640) SCR (e 0.999) Exon 1 22011 23640 (1630 n); score: 0.999 PGS (22011 23640) SGN-U318403+ 3-phase translation of AGS-2 (+strand): . . . . . . 22011 AATGATGAGTCCAAGTCTGATGAATCCCGTCTCTCTGGTAGAAAATCATCCATCTTGCAG N D E S K S D E S R L S G R K S S I L Q M M S P S L M N P V S L V E N H P S C R - - V Q V - - I P S L W - K I I H L A . . . . . . 22071 GAGGAATAGGGATGGACATAAGATACTACCTAATGGCCAGAGCTGGAGGCCTCAATTACT E E - G W T - D T T - W P E L E A S I T R N R D G H K I L P N G Q S W R P Q L L G G I G M D I R Y Y L M A R A G G L N Y . . . . . . 22131 ACTATTAAGCTACTCAATGAAGATCTTATCTGAAGCAAGTGCACTTTCAAAGCCTGAAGA T I K L L N E D L I - S K C T F K A - R L L S Y S M K I L S E A S A L S K P E D Y Y - A T Q - R S Y L K Q V H F Q S L K . . . . . . 22191 TCCATTTGATCACCGTAAGCTCTTTGGTTTCCGCACACGCTCACCACCTCTTCCCTACAT S I - S P - A L W F P H T L T T S S L H P F D H R K L F G F R T R S P P L P Y M I H L I T V S S L V S A H A H H L F P T . . . . . . 22251 GCTTTCTTCAATGTTGCAGTCACGTGCGCATCCAAAGCTTTCTGCTGAGCAGGGTGGTGA A F F N V A V T C A S K A F C - A G W - L S S M L Q S R A H P K L S A E Q G G D C F L Q C C S H V R I Q S F L L S R V V . . . . . . 22311 CAACGGTGATTCAGACATTGACTTAGATGATTTGTCAGACTCTGACCAAGAAGAAGAAGA Q R - F R H - L R - F V R L - P R R R R N G D S D I D L D D L S D S D Q E E E D T T V I Q T L T - M I C Q T L T K K K K . . . . . . 22371 TGAGTATGACCAGCTTCCTCCCTTCAAGCCTCTTCGGAAGGCTCAGCTTGCTAAGCTCAG - V - P A S S L Q A S S E G S A C - A Q E Y D Q L P P F K P L R K A Q L A K L S M S M T S F L P S S L F G R L S L L S S . . . . . . 22431 CAAAGAACAGAGGAAGGCGTACTTTGAGGAGTATGACTACAGGGTCAAGCTCCTTCAGAA Q R T E E G V L - G V - L Q G Q A P S E K E Q R K A Y F E E Y D Y R V K L L Q K A K N R G R R T L R S M T T G S S S F R . . . . . . 22491 GAAACAGTTGAGAGAAGATTTAAAAAGAATGAAAGAGATGAAAAGTAAGGGAAAAGAGGC E T V E R R F K K N E R D E K - G K R G K Q L R E D L K R M K E M K S K G K E A R N S - E K I - K E - K R - K V R E K R . . . . . . 22551 TGCAATTGACAATGGTTATGCAGAGGAAGAAGCTGATGCAGGTGCAGCAGCTCCCGTAGC C N - Q W L C R G R S - C R C S S S R S A I D N G Y A E E E A D A G A A A P V A L Q L T M V M Q R K K L M Q V Q Q L P - . . . . . . 22611 AGTTCCCCTTCCTGACATGGCCCTTCCACCTTCTTTTGATAGTGATAATCCCGCCTATAG S S P S - H G P S T F F - - - - S R L - V P L P D M A L P P S F D S D N P A Y R Q F P F L T W P F H L L L I V I I P P I . . . . . . 22671 GTACCGCTTCTTGGAGCCCACATCACAGTTCCTTGCAAGGCCTGTTCTGGACACGCATGG V P L L G A H I T V P C K A C S G H A W Y R F L E P T S Q F L A R P V L D T H G G T A S W S P H H S S L Q G L F W T R M . . . . . . 22731 TTGGGATCATGATTGTGGCTATGATGGTGTTAACGTGGAACAAAGTTTAGCCATTGCCAG L G S - L W L - W C - R G T K F S H C Q W D H D C G Y D G V N V E Q S L A I A S V G I M I V A M M V L T W N K V - P L P . . . . . . 22791 TCGTTTCCCTGCTGCAGTTACTGTGCAAATCACCAAAGATAAGAAGGATTTCAGTATCAA S F P C C S Y C A N H Q R - E G F Q Y Q R F P A A V T V Q I T K D K K D F S I N V V S L L Q L L C K S P K I R R I S V S . . . . . . 22851 TTTGGACTCTTCGATTGCTGCTAAGCACGGAGAAAATGGATCAACCATGGCTGGCTTTGA F G L F D C C - A R R K W I N H G W L - L D S S I A A K H G E N G S T M A G F D I W T L R L L L S T E K M D Q P W L A L . . . . . . 22911 TATTCAAAGCATAGGGAAGCAACTTGCCTATATTGTCCGAGGAGAAACCAAATTCAAAAG Y S K H R E A T C L Y C P R R N Q I Q K I Q S I G K Q L A Y I V R G E T K F K S I F K A - G S N L P I L S E E K P N S K . . . . . . 22971 CTTGAAGAAGAACAAGACTGCTTGCGGAATTTCTGTTACATTTCTAGGTGAAAATATGGT L E E E Q D C L R N F C Y I S R - K Y G L K K N K T A C G I S V T F L G E N M V A - R R T R L L A E F L L H F - V K I W . . . . . . 23031 CACTGGACTTAAAGTTGAAGATCAAATCATCTTAGGCAAGCAATACGTTCTAGTTGGCAG H W T - S - R S N H L R Q A I R S S W Q T G L K V E D Q I I L G K Q Y V L V G S S L D L K L K I K S S - A S N T F - L A . . . . . . 23091 TGCTGGCACTGTTCGATCTCAGAGTGACACAGCTTATGGGGCTAACTTTGAACTGCAGAG C W H C S I S E - H S L W G - L - T A E A G T V R S Q S D T A Y G A N F E L Q R V L A L F D L R V T Q L M G L T L N C R . . . . . . 23151 GAGGGAGGCAGATTTCCCAATCGGTCAGGTGCAATCTACATTGTCTATGTCCGTCATAAA E G G R F P N R S G A I Y I V Y V R H K R E A D F P I G Q V Q S T L S M S V I K G G R Q I S Q S V R C N L H C L C P S - . . . . . . 23211 GTGGAGAGGTGATTTGGCTCTAGGTTTCAACAGTATGGCGCAATTCGCTGTGGGACGCAA V E R - F G S R F Q Q Y G A I R C G T Q W R G D L A L G F N S M A Q F A V G R N S G E V I W L - V S T V W R N S L W D A . . . . . . 23271 TTCGAAGGTAGCTGTTCGAGCAGGAATCAATAACAAGCTCAGTGGGCAAGTAACCGTGAG F E G S C S S R N Q - Q A Q W A S N R E S K V A V R A G I N N K L S G Q V T V R I R R - L F E Q E S I T S S V G K - P - . . . . . . 23331 GACAAGCAGTTCAGACCATCTCTCTCTTGCACTTACTGCTATTATTCCAACTGCAATTGG D K Q F R P S L S C T Y C Y Y S N C N W T S S S D H L S L A L T A I I P T A I G G Q A V Q T I S L L H L L L L F Q L Q L . . . . . . 23391 CATCTACAGGAAGCTTTGGCCGGATGCTGGCGAGAAGTACTCAATCTACTAAATTTCATT H L Q E A L A G C W R E V L N L L N F I I Y R K L W P D A G E K Y S I Y - I S F A S T G S F G R M L A R S T Q S T K F H . . . . . . 23451 TCCATATCAGCATTGCATTTTTGGTTCATTAGACCTTACATGATGACATATTGTCTTTGT S I S A L H F W F I R P Y M M T Y C L C P Y Q H C I F G S L D L T - - H I V F V F H I S I A F L V H - T L H D D I L S L . . . . . . 23511 CAGTTCATTGAATAATGCTTCTGTTAATTTCCCATCTATTTAGGATTCCTACTGTTATGA Q F I E - C F C - F P I Y L G F L L L - S S L N N A S V N F P S I - D S Y C Y E S V H - I M L L L I S H L F R I P T V M . . . . . . 23571 GTTATAAGTCAATTTTGAGTGATTGAATGTACTTTTTTGCCAGATGAAATGAAGAGGTTT V I S Q F - V I E C T F L P D E M K R F L - V N F E - L N V L F C Q M K - R G L S Y K S I L S D - M Y F F A R - N E E V . 23631 GTGTTGCTTT V L L C C F C V A Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1+_PGL-4_AGS-2_PPS_1 (22012 23442) (frame '2'; 1428 bp, 476 residues) 1 MMSPSLMNPV SLVENHPSCR RNRDGHKILP NGQSWRPQLL LLSYSMKILS EASALSKPED 61 PFDHRKLFGF RTRSPPLPYM LSSMLQSRAH PKLSAEQGGD NGDSDIDLDD LSDSDQEEED 121 EYDQLPPFKP LRKAQLAKLS KEQRKAYFEE YDYRVKLLQK KQLREDLKRM KEMKSKGKEA 181 AIDNGYAEEE ADAGAAAPVA VPLPDMALPP SFDSDNPAYR YRFLEPTSQF LARPVLDTHG 241 WDHDCGYDGV NVEQSLAIAS RFPAAVTVQI TKDKKDFSIN LDSSIAAKHG ENGSTMAGFD 301 IQSIGKQLAY IVRGETKFKS LKKNKTACGI SVTFLGENMV TGLKVEDQII LGKQYVLVGS 361 AGTVRSQSDT AYGANFELQR READFPIGQV QSTLSMSVIK WRGDLALGFN SMAQFAVGRN 421 SKVAVRAGIN NKLSGQVTVR TSSSDHLSLA LTAIIPTAIG IYRKLWPDAG EKYSIY- 3-phase translation of AGS-2 (-strand): . . . . . . 23640 AAAGCAACACAAACCTCTTCATTTCATCTGGCAAAAAAGTACATTCAATCACTCAAAATT K A T Q T S S F H L A K K Y I Q S L K I K Q H K P L H F I W Q K S T F N H S K L S N T N L F I S S G K K V H S I T Q N . . . . . . 23580 GACTTATAACTCATAACAGTAGGAATCCTAAATAGATGGGAAATTAACAGAAGCATTATT D L - L I T V G I L N R W E I N R S I I T Y N S - Q - E S - I D G K L T E A L F - L I T H N S R N P K - M G N - Q K H Y . . . . . . 23520 CAATGAACTGACAAAGACAATATGTCATCATGTAAGGTCTAATGAACCAAAAATGCAATG Q - T D K D N M S S C K V - - T K N A M N E L T K T I C H H V R S N E P K M Q C S M N - Q R Q Y V I M - G L M N Q K C N . . . . . . 23460 CTGATATGGAAATGAAATTTAGTAGATTGAGTACTTCTCGCCAGCATCCGGCCAAAGCTT L I W K - N L V D - V L L A S I R P K L - Y G N E I - - I E Y F S P A S G Q S F A D M E M K F S R L S T S R Q H P A K A . . . . . . 23400 CCTGTAGATGCCAATTGCAGTTGGAATAATAGCAGTAAGTGCAAGAGAGAGATGGTCTGA P V D A N C S W N N S S K C K R E M V - L - M P I A V G I I A V S A R E R W S E S C R C Q L Q L E - - Q - V Q E R D G L . . . . . . 23340 ACTGCTTGTCCTCACGGTTACTTGCCCACTGAGCTTGTTATTGATTCCTGCTCGAACAGC T A C P H G Y L P T E L V I D S C S N S L L V L T V T C P L S L L L I P A R T A N C L S S R L L A H - A C Y - F L L E Q . . . . . . 23280 TACCTTCGAATTGCGTCCCACAGCGAATTGCGCCATACTGTTGAAACCTAGAGCCAAATC Y L R I A S H S E L R H T V E T - S Q I T F E L R P T A N C A I L L K P R A K S L P S N C V P Q R I A P Y C - N L E P N . . . . . . 23220 ACCTCTCCACTTTATGACGGACATAGACAATGTAGATTGCACCTGACCGATTGGGAAATC T S P L Y D G H R Q C R L H L T D W E I P L H F M T D I D N V D C T - P I G K S H L S T L - R T - T M - I A P D R L G N . . . . . . 23160 TGCCTCCCTCCTCTGCAGTTCAAAGTTAGCCCCATAAGCTGTGTCACTCTGAGATCGAAC C L P P L Q F K V S P I S C V T L R S N A S L L C S S K L A P - A V S L - D R T L P P S S A V Q S - P H K L C H S E I E . . . . . . 23100 AGTGCCAGCACTGCCAACTAGAACGTATTGCTTGCCTAAGATGATTTGATCTTCAACTTT S A S T A N - N V L L A - D D L I F N F V P A L P T R T Y C L P K M I - S S T L Q C Q H C Q L E R I A C L R - F D L Q L . . . . . . 23040 AAGTCCAGTGACCATATTTTCACCTAGAAATGTAACAGAAATTCCGCAAGCAGTCTTGTT K S S D H I F T - K C N R N S A S S L V S P V T I F S P R N V T E I P Q A V L F - V Q - P Y F H L E M - Q K F R K Q S C . . . . . . 22980 CTTCTTCAAGCTTTTGAATTTGGTTTCTCCTCGGACAATATAGGCAAGTTGCTTCCCTAT L L Q A F E F G F S S D N I G K L L P Y F F K L L N L V S P R T I - A S C F P M S S S S F - I W F L L G Q Y R Q V A S L . . . . . . 22920 GCTTTGAATATCAAAGCCAGCCATGGTTGATCCATTTTCTCCGTGCTTAGCAGCAATCGA A L N I K A S H G - S I F S V L S S N R L - I S K P A M V D P F S P C L A A I E C F E Y Q S Q P W L I H F L R A - Q Q S . . . . . . 22860 AGAGTCCAAATTGATACTGAAATCCTTCTTATCTTTGGTGATTTGCACAGTAACTGCAGC R V Q I D T E I L L I F G D L H S N C S E S K L I L K S F L S L V I C T V T A A K S P N - Y - N P S Y L W - F A Q - L Q . . . . . . 22800 AGGGAAACGACTGGCAATGGCTAAACTTTGTTCCACGTTAACACCATCATAGCCACAATC R E T T G N G - T L F H V N T I I A T I G K R L A M A K L C S T L T P S - P Q S Q G N D W Q W L N F V P R - H H H S H N . . . . . . 22740 ATGATCCCAACCATGCGTGTCCAGAACAGGCCTTGCAAGGAACTGTGATGTGGGCTCCAA M I P T M R V Q N R P C K E L - C G L Q - S Q P C V S R T G L A R N C D V G S K H D P N H A C P E Q A L Q G T V M W A P . . . . . . 22680 GAAGCGGTACCTATAGGCGGGATTATCACTATCAAAAGAAGGTGGAAGGGCCATGTCAGG E A V P I G G I I T I K R R W K G H V R K R Y L - A G L S L S K E G G R A M S G R S G T Y R R D Y H Y Q K K V E G P C Q . . . . . . 22620 AAGGGGAACTGCTACGGGAGCTGCTGCACCTGCATCAGCTTCTTCCTCTGCATAACCATT K G N C Y G S C C T C I S F F L C I T I R G T A T G A A A P A S A S S S A - P L E G E L L R E L L H L H Q L L P L H N H . . . . . . 22560 GTCAATTGCAGCCTCTTTTCCCTTACTTTTCATCTCTTTCATTCTTTTTAAATCTTCTCT V N C S L F S L T F H L F H S F - I F S S I A A S F P L L F I S F I L F K S S L C Q L Q P L F P Y F S S L S F F L N L L . . . . . . 22500 CAACTGTTTCTTCTGAAGGAGCTTGACCCTGTAGTCATACTCCTCAAAGTACGCCTTCCT Q L F L L K E L D P V V I L L K V R L P N C F F - R S L T L - S Y S S K Y A F L S T V S S E G A - P C S H T P Q S T P S . . . . . . 22440 CTGTTCTTTGCTGAGCTTAGCAAGCTGAGCCTTCCGAAGAGGCTTGAAGGGAGGAAGCTG L F F A E L S K L S L P K R L E G R K L C S L L S L A S - A F R R G L K G G S W S V L C - A - Q A E P S E E A - R E E A . . . . . . 22380 GTCATACTCATCTTCTTCTTCTTGGTCAGAGTCTGACAAATCATCTAAGTCAATGTCTGA V I L I F F F L V R V - Q I I - V N V - S Y S S S S S W S E S D K S S K S M S E G H T H L L L L G Q S L T N H L S Q C L . . . . . . 22320 ATCACCGTTGTCACCACCCTGCTCAGCAGAAAGCTTTGGATGCGCACGTGACTGCAACAT I T V V T T L L S R K L W M R T - L Q H S P L S P P C S A E S F G C A R D C N I N H R C H H P A Q Q K A L D A H V T A T . . . . . . 22260 TGAAGAAAGCATGTAGGGAAGAGGTGGTGAGCGTGTGCGGAAACCAAAGAGCTTACGGTG - R K H V G K R W - A C A E T K E L T V E E S M - G R G G E R V R K P K S L R - L K K A C R E E V V S V C G N Q R A Y G . . . . . . 22200 ATCAAATGGATCTTCAGGCTTTGAAAGTGCACTTGCTTCAGATAAGATCTTCATTGAGTA I K W I F R L - K C T C F R - D L H - V S N G S S G F E S A L A S D K I F I E - D Q M D L Q A L K V H L L Q I R S S L S . . . . . . 22140 GCTTAATAGTAGTAATTGAGGCCTCCAGCTCTGGCCATTAGGTAGTATCTTATGTCCATC A - - - - L R P P A L A I R - Y L M S I L N S S N - G L Q L W P L G S I L C P S S L I V V I E A S S S G H - V V S Y V H . . . . . . 22080 CCTATTCCTCCTGCAAGATGGATGATTTTCTACCAGAGAGACGGGATTCATCAGACTTGG P I P P A R W M I F Y Q R D G I H Q T W L F L L Q D G - F S T R E T G F I R L G P Y S S C K M D D F L P E R R D S S D L . 22020 ACTCATCATT T H H L I I D S S Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-4_AGS-2_PPS_1 (22393 22100) (frame '0'; 291 bp, 97 residues) 1 REEAGHTHLL LLGQSLTNHL SQCLNHRCHH PAQQKALDAH VTATLKKACR EEVVSVCGNQ 61 RAYGDQMDLQ ALKVHLLQIR SSLSSLIVVI EASSSGH- >C09HBa0099P03.1-1-_PGL-4_AGS-2_PPS_2 (22759 22475) (frame '0'; 282 bp, 94 residues) 1 HHHSHNHDPN HACPEQALQG TVMWAPRSGT YRRDYHYQKK VEGPCQEGEL LRELLHLHQL 61 LPLHNHCQLQ PLFPYFSSLS FFLNLLSTVS SEGA- >C09HBa0099P03.1-1-_PGL-4_AGS-2_PPS_3 (23393 23175) (frame '2'; 216 bp, 72 residues) 1 MPIAVGIIAV SARERWSELL VLTVTCPLSL LLIPARTATF ELRPTANCAI LLKPRAKSPL 61 HFMTDIDNVD CT- PGL 5 (+ strand): 24880 33978 AGS-1 (24880 24904,28814 29015,29638 29697,33028 33095) SCR (e 0.800 d 0.996 a 0.000,e 0.802 d 0.000 a 0.528,e 0.550 d 0.986 a 0.000,e 0.735) Exon 1 24880 24904 ( 25 n); score: 0.800 Intron 1 24905 28813 (3909 n); Pd: 0.996 Pa: 0.000 Exon 2 28814 29015 ( 202 n); score: 0.802 Intron 2 29016 29637 ( 622 n); Pd: 0.000 Pa: 0.528 Exon 3 29638 29697 ( 60 n); score: 0.550 Intron 3 29698 33027 (3330 n); Pd: 0.986 Pa: 0.000 Exon 4 33028 33095 ( 68 n); score: 0.735 PGS (24880 24904,28814 29015,29638 29697,33028 33095) SGN-U336521+ PGS (28802 28938) SGN-U330025- 3-phase translation of AGS-1 (+strand): . . . : . . . 24880 TCTTGCCATATATACAAATACATAG : TTATTTCATATTTTCTCCGTTTAAAAAAGAATGAA S C H I Y K Y I : V I S Y F L R L K K N E L A I Y T N T - : L F H I F S V - K R M N L P Y I Q I H S : Y F I F S P F K K E - . . . . . . 28849 CTAGTTTGACTTGGAATGAAGTTTAAGAAAAGAAAGAAGACTTTTTAATCTTGTGGTTCT L V - L G M K F K K R K K T F - S C G S - F D L E - S L R K E R R L F N L V V L T S L T W N E V - E K K E D F L I L W F . . . . . . 28909 AAATTAAAGTTATGTCAAATGTATCAAAATGTTCTTAAATCTTGTGGTCTTAAACATGTC K L K L C Q M Y Q N V L K S C G L K H V N - S Y V K C I K M F L N L V V L N M S - I K V M S N V S K C S - I L W S - T C . . . . . : . 28969 ACGTGAAAAGTTAAAATTAAATTCTTTTTAAAAAAATTAAATAAAAA : TTAAATTTAAAAT T - K V K I K F F L K K L N K N : - I - N R E K L K L N S F - K N - I K : I K F K I H V K S - N - I L F K K I K - K : L N L K . . . . . : . 29651 AATTTGACTCTCGAAAAATGAAACTAGTACATGACACGGTAATCGAG : TTCATCTTATCAT N L T L E K - N - Y M T R - S S : S S Y H I - L S K N E T S T - H G N R : V H L I I - F D S R K M K L V H D T V I E : F I L S . . . . . . 33041 ATCATGAAATTATATTTATATATATTATAATTGTTTAACAATTTTGCCATATAAA I M K L Y L Y I L - L F N N F A I - S - N Y I Y I Y Y N C L T I L P Y K Y H E I I F I Y I I I V - Q F C H I Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (28798 28983,29874 29887,33721 33735,33958 33978) SCR (e 0.828 d 0.000 a 0.927,e 0.786 d 0.000 a 0.536,e 0.800 d 0.000 a 0.000,e 0.714) Exon 1 28798 28983 ( 186 n); score: 0.828 Intron 1 28984 29873 ( 890 n); Pd: 0.000 Pa: 0.927 Exon 2 29874 29887 ( 14 n); score: 0.786 Intron 2 29888 33720 (3833 n); Pd: 0.000 Pa: 0.536 Exon 3 33721 33735 ( 15 n); score: 0.800 Intron 3 33736 33957 ( 222 n); Pd: 0.000 Pa: 0.000 Exon 4 33958 33978 ( 21 n); score: 0.714 PGS (28798 28983,29874 29887,33721 33735,33958 33978) SGN-U344324- 3-phase translation of AGS-2 (+strand): . . . . . . 28798 TATATTGTATATTCTCTTATTTCATATTTTCTCCGTTTAAAAAAGAATGAACTAGTTTGA Y I V Y S L I S Y F L R L K K N E L V - I L Y I L L F H I F S V - K R M N - F D Y C I F S Y F I F S P F K K E - T S L . . . . . . 28858 CTTGGAATGAAGTTTAAGAAAAGAAAGAAGACTTTTTAATCTTGTGGTTCTAAATTAAAG L G M K F K K R K K T F - S C G S K L K L E - S L R K E R R L F N L V V L N - S T W N E V - E K K E D F L I L W F - I K . . . . . . 28918 TTATGTCAAATGTATCAAAATGTTCTTAAATCTTGTGGTCTTAAACATGTCACGTGAAAA L C Q M Y Q N V L K S C G L K H V T - K Y V K C I K M F L N L V V L N M S R E K V M S N V S K C S - I L W S - T C H V K . : . : . . : . . 28978 GTTAAA : GTTAAAATATCATC : AAAAACAGGTCAAAT : GAGACAATAAATTTGGAATAA V K : V K I S S : K T G Q M : R Q - I W N L K : L K Y H : Q K Q V K : - D N K F G I S - : S - N I I : K N R S N : E T I N L E - Maximal non-overlapping open reading frames (>= 64 codons): none PGL 6 (- strand): 27135 24950 AGS-1 (27110 26940,26629 26620,26505 26281,26017 25193,25015 24950) SCR (e 0.865 d 1.000 a 0.000,e 0.350 d 0.762 a 0.996,e 0.978 d 0.994 a 0.958,e 0.939 d 0.000 a 0.978,e 0.538) Exon 1 27110 26940 ( 171 n); score: 0.865 Intron 1 26939 26630 ( 310 n); Pd: 1.000 Pa: 0.000 Exon 2 26629 26620 ( 10 n); score: 0.350 Intron 2 26619 26506 ( 114 n); Pd: 0.762 Pa: 0.996 Exon 3 26505 26281 ( 225 n); score: 0.978 Intron 3 26280 26018 ( 263 n); Pd: 0.994 Pa: 0.958 Exon 4 26017 25193 ( 825 n); score: 0.939 Intron 4 25192 25016 ( 177 n); Pd: 0.000 Pa: 0.978 Exon 5 25015 24950 ( 66 n); score: 0.538 PGS (27110 26940,26629 26620,26505 26281,26017 25193,25015 24950) SGN-U321959+ 3-phase translation of AGS-1 (-strand): . . . . . . 27110 ATTACTTAGTTTTTCTCCATCTCTACCATGGCTGACGCAGCTGCAACGCCGCCTACTGAT I T - F F S I S T M A D A A A T P P T D L L S F S P S L P W L T Q L Q R R L L I Y L V F L H L Y H G - R S C N A A Y - . . . . . . 27050 CCAGCAAGCACGGCGCCACCTGCTACGACTGATCCAGCAAGCACGACGCCGCCTGCTAGT P A S T A P P A T T D P A S T T P P A S Q Q A R R H L L R L I Q Q A R R R L L V S S K H G A T C Y D - S S K H D A A C - . . . . . . : 26990 ACTGATCCAGCAAACACGACGCCGCCTACTAGTACTGATCCAGCTGATCCA : GCATTGACA T D P A N T T P P T S T D P A D P : A L T L I Q Q T R R R L L V L I Q L I Q : H - Q Y - S S K H D A A Y - Y - S S - S : S I D . : . . . . . 26620 G : GTAGCAGAAGATGAGAGGAAACTTAAATATCTGGATTTCGTTCAAGTTGCAGCGATCTA : G S R R - E E T - I S G F R S S C S D L : V A E D E R K L K Y L D F V Q V A A I Y R : - Q K M R G N L N I W I S F K L Q R S . . . . . . 26446 TGTGATTGTTTGCTTCTCAACTTTGTATGAATACGGCAAAGAAAACTCCGGTCCGTTGAA C D C L L L N F V - I R Q R K L R S V E V I V C F S T L Y E Y G K E N S G P L K M - L F A S Q L C M N T A K K T P V R - . . . . . . 26386 ACCTGGTGTACAGGCCGTAGAAGCCACTGTTAAAACTGTTATCGGACCGGTTTATGAGAA T W C T G R R S H C - N C Y R T G L - E P G V Q A V E A T V K T V I G P V Y E K N L V Y R P - K P L L K L L S D R F M R . . . . . : . 26326 GTTCCATAACGTTCCTTTCAATCTCCTCAAGTTCATCGACCTAAAG : GTTGCAGACTTGAT V P - R S F Q S P Q V H R P K : G C R L D F H N V P F N L L K F I D L K : V A D L M S S I T F L S I S S S S S T - R : L Q T - . . . . . . 26003 GACAGAAGTTGAAAGCCATGTGCCTTCTCTACTAAAGCAGACATCATCTAAAGCTCTGTT D R S - K P C A F S T K A D I I - S S V T E V E S H V P S L L K Q T S S K A L L - Q K L K A M C L L Y - S R H H L K L C . . . . . . 25943 AATAGCTCAGAAGGCTCCAGAATTAGCTCGAGATCTCGCCGGCGAGGTACAGCACGATGG N S S E G S R I S S R S R R R G T A R W I A Q K A P E L A R D L A G E V Q H D G - - L R R L Q N - L E I S P A R Y S T M . . . . . . 25883 CTTAGTGGACACAGCGAGCAACGTAGCTAAAACACTCTACACAAAGTACGAACCCACAGT L S G H S E Q R S - N T L H K V R T H S L V D T A S N V A K T L Y T K Y E P T V A - W T Q R A T - L K H S T Q S T N P Q . . . . . . 25823 CAAGGAGCTATACACAAAATACGAGCCAGTGATCGAGAAAAATGCGGTTTTGGCATGGAG Q G A I H K I R A S D R E K C G F G M E K E L Y T K Y E P V I E K N A V L A W R S R S Y T Q N T S Q - S R K M R F W H G . . . . . . 25763 ATCTCTGAATAAGCTTCCTTTGTTCCCTCAAGTGGCTCAGATTTTGGTGCCGACGGCTGC I S E - A S F V P S S G S D F G A D G C S L N K L P L F P Q V A Q I L V P T A A D L - I S F L C S L K W L R F W C R R L . . . . . . 25703 TTATTGGAGTGAGAAATACAATCAAGCGGTGACATATGCGTCGGAGAACGGCTATACGGC L L E - E I Q S S G D I C V G E R L Y G Y W S E K Y N Q A V T Y A S E N G Y T A L I G V R N T I K R - H M R R R T A I R . . . . . . 25643 GGCGCATTATTTACCGATTATTCCCGTAGAGAGGATCGCGAAGGTGTTTGAAGGCGGCGC G A L F T D Y S R R E D R E G V - R R R A H Y L P I I P V E R I A K V F E G G A R R I I Y R L F P - R G S R R C L K A A . . . . . . 25583 CACCGCTGAAAACGAGCAGTCCGTTCCCTTGACCGACGGCACTGTTGCTCCGGCGCAATG H R - K R A V R S L D R R H C C S G A M T A E N E Q S V P L T D G T V A P A Q - P P L K T S S P F P - P T A L L L R R N . . . . . . 25523 ACCATTTTACATTACTGTACTGGGGGGGGGGGGGGGGGGGGGCTTAATGGGCCGTCAGGT T I L H Y C T G G G G G G G L N G P S G P F Y I T V L G G G G G G G L M G R Q V D H F T L L Y W G G G G G G A - W A V R . . . . . . 25463 TAATAACTTGGTTTATGTTTGTCGGTAGTAATTGTTCATCTTTATATCGTGAGCCACTTC - - L G L C L S V V I V H L Y I V S H F N N L V Y V C R - - L F I F I S - A T S L I T W F M F V G S N C S S L Y R E P L . . . . . . 25403 TTAAGTTCCTTAATATATTGTATTTAGTAATTATCCTAAATCACCAAATTTAAATTCTTG L S S L I Y C I - - L S - I T K F K F L - V P - Y I V F S N Y P K S P N L N S - L K F L N I L Y L V I I L N H Q I - I L . . . . . . 25343 AACATATCATATTGTATCAACTTTTAATCAAGCAGGTGAAACAAAATACAAGAATAAAGA N I S Y C I N F - S S R - N K I Q E - R T Y H I V S T F N Q A G E T K Y K N K E E H I I L Y Q L L I K Q V K Q N T R I K . . . . . . 25283 GGTTTTAGGGAGTATCGATGGTAAAGAACATCAAAAGAAATGATTTGATTTTGTAACTTT G F R E Y R W - R T S K E M I - F C N F V L G S I D G K E H Q K K - F D F V T F R F - G V S M V K N I K R N D L I L - L . . . . : . . 25223 TCACAAAGTTGAATAAGACGTGCGTTTATTA : AAAAGAGATGAGCGAGACTAGGAGAGTGA S Q S - I R R A F I : K K R - A R L G E - H K V E - D V R L L : K R D E R D - E S D F T K L N K T C V Y - : K E M S E T R R V . . . . 24986 CGAGAGAGGCGAGAGAGGGGAGAAAAGAGTGGGAGAA R E R R E R G E K S G R E R G E R G E K R V G E T R E A R E G R K E W E Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-6_AGS-1_PPS_1 (26622 26620,26505 26281,26017 25523) (frame '2'; 720 bp, 240 residues) 1 QVAEDERKLK YLDFVQVAAI YVIVCFSTLY EYGKENSGPL KPGVQAVEAT VKTVIGPVYE 61 KFHNVPFNLL KFIDLKVADL MTEVESHVPS LLKQTSSKAL LIAQKAPELA RDLAGEVQHD 121 GLVDTASNVA KTLYTKYEPT VKELYTKYEP VIEKNAVLAW RSLNKLPLFP QVAQILVPTA 181 AYWSEKYNQA VTYASENGYT AAHYLPIIPV ERIAKVFEGG ATAENEQSVP LTDGTVAPAQ 241 - AGS-2 (27135 26940,26505 26281,26017 25628) SCR (e 0.990 d 1.000 a 0.996,e 1.000 d 0.994 a 0.958,e 0.997) Exon 1 27135 26940 ( 196 n); score: 0.990 Intron 1 26939 26506 ( 434 n); Pd: 1.000 Pa: 0.996 Exon 2 26505 26281 ( 225 n); score: 1.000 Intron 2 26280 26018 ( 263 n); Pd: 0.994 Pa: 0.958 Exon 3 26017 25628 ( 390 n); score: 0.997 PGS (27135 26940,26505 26281,26017 25628) SGN-U321960+ 3-phase translation of AGS-2 (-strand): . . . . . . 27135 TTCGAAGCTCTCCTGTACAAACTCCATTACTTAGTTTTTCTCCATCTCTACCATGGCTGA F E A L L Y K L H Y L V F L H L Y H G - S K L S C T N S I T - F F S I S T M A D R S S P V Q T P L L S F S P S L P W L . . . . . . 27075 CGCAGCTGCAACGCCGCCTACTGATCCAGCAAGCACGGCGCCACCTGCTACGACTGATCC R S C N A A Y - S S K H G A T C Y D - S A A A T P P T D P A S T A P P A T T D P T Q L Q R R L L I Q Q A R R H L L R L I . . . . . . 27015 AGCAAGCACGACGCCGCCTGCTAGTACTGATCCAGCAAACACGACGCCGCCTACTAGTAC S K H D A A C - Y - S S K H D A A Y - Y A S T T P P A S T D P A N T T P P T S T Q Q A R R R L L V L I Q Q T R R R L L V . . : . . . . 26955 TGATCCAGCTGATCCA : GTAGCAGAAGATGAGAGGAAACTTAAATATCTGGATTTCGTTCA - S S - S : S S R R - E E T - I S G F R S D P A D P : V A E D E R K L K Y L D F V Q L I Q L I Q : - Q K M R G N L N I W I S F . . . . . . 26461 AGTTGCAGCGATCTATGTGATTGTTTGCTTCTCAACTTTGTATGAATACGGCAAAGAAAA S C S D L C D C L L L N F V - I R Q R K V A A I Y V I V C F S T L Y E Y G K E N K L Q R S M - L F A S Q L C M N T A K K . . . . . . 26401 CTCCGGTCCGTTGAAACCTGGTGTACAGGCCGTAGAAGCCACTGTTAAAACTGTTATCGG L R S V E T W C T G R R S H C - N C Y R S G P L K P G V Q A V E A T V K T V I G T P V R - N L V Y R P - K P L L K L L S . . . . . . 26341 ACCGGTTTATGAGAAGTTCCATAACGTTCCTTTCAATCTCCTCAAGTTCATCGACCTAAA T G L - E V P - R S F Q S P Q V H R P K P V Y E K F H N V P F N L L K F I D L K D R F M R S S I T F L S I S S S S S T - . : . . . . . 26281 G : GTTGCAGACTTGATGACAGAAGTTGAAAGCCATGTGCCTTCTCTACTAAAGCAGACATC : G C R L D D R S - K P C A F S T K A D I : V A D L M T E V E S H V P S L L K Q T S R : L Q T - - Q K L K A M C L L Y - S R H . . . . . . 25958 ATCTAAAGCTCTGTTAATAGCTCAGAAGGCTCCAGAATTAGCTCGAGATCTCGCCGGCGA I - S S V N S S E G S R I S S R S R R R S K A L L I A Q K A P E L A R D L A G E H L K L C - - L R R L Q N - L E I S P A . . . . . . 25898 GGTACAGCACGATGGCTTAGTGGACACAGCGAGCAACGTAGCTAAAACACTCTACACAAA G T A R W L S G H S E Q R S - N T L H K V Q H D G L V D T A S N V A K T L Y T K R Y S T M A - W T Q R A T - L K H S T Q . . . . . . 25838 GTACGAACCCACAGTCAAGGAGCTATACACAAAATACGAGCCAGTGATCGAGAAAAATGC V R T H S Q G A I H K I R A S D R E K C Y E P T V K E L Y T K Y E P V I E K N A S T N P Q S R S Y T Q N T S Q - S R K M . . . . . . 25778 GGTTTTGGCATGGAGATCTCTGAATAAGCTTCCTTTGTTCCCTCAAGTGGCTCAGATTTT G F G M E I S E - A S F V P S S G S D F V L A W R S L N K L P L F P Q V A Q I L R F W H G D L - I S F L C S L K W L R F . . . . . . 25718 GGTGCCGACGGCTGCTTATTGGAGTGAGAAATACAATCAAGCGGTGACATATGCGTCGGA G A D G C L L E - E I Q S S G D I C V G V P T A A Y W S E K Y N Q A V T Y A S E W C R R L L I G V R N T I K R - H M R R . . . . 25658 GAACGGCTATACGGCGGCGCATTATTTACCG E R L Y G G A L F T N G Y T A A H Y L P R T A I R R R I I Y Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-6_AGS-2_PPS_1 (27101 26940,26505 26281,26017 25628) (frame '2'; 777 bp, 259 residues) 1 FFSISTMADA AATPPTDPAS TAPPATTDPA STTPPASTDP ANTTPPTSTD PADPVAEDER 61 KLKYLDFVQV AAIYVIVCFS TLYEYGKENS GPLKPGVQAV EATVKTVIGP VYEKFHNVPF 121 NLLKFIDLKV ADLMTEVESH VPSLLKQTSS KALLIAQKAP ELARDLAGEV QHDGLVDTAS 181 NVAKTLYTKY EPTVKELYTK YEPVIEKNAV LAWRSLNKLP LFPQVAQILV PTAAYWSEKY 241 NQAVTYASEN GYTAAHYLP PGL 7 (+ strand): 35718 36030 AGS-1 (35718 36030) SCR (e 0.847) Exon 1 35718 36030 ( 313 n); score: 0.847 PGS (35718 36030) SGN-U328267+ PGS (35718 36023) SGN-U328267- 3-phase translation of AGS-1 (+strand): . . . . . . 35718 GGATAATGCACAAGTACCCCTCAACCTATGCCCGAAATTTCAGAAACAAACTTGTACTAT G - C T S T P Q P M P E I S E T N L Y Y D N A Q V P L N L C P K F Q K Q T C T I I M H K Y P S T Y A R N F R N K L V L . . . . . . 35778 ACTAAGGTCCTATTATCCCCTGAACTTATTTTATTAATAATTTTCTACCCCTTTTCGGCT T K V L L S P E L I L L I I F Y P F S A L R S Y Y P L N L F Y - - F S T P F R L Y - G P I I P - T Y F I N N F L P L F G . . . . . . 35838 TACGTGACACTATTTTGTGGGCCCAACGCTGGTTATTTTTTTTTCAAGCTAGTGCCACGT Y V T L F C G P N A G Y F F F K L V P R T - H Y F V G P T L V I F F S S - C H V L R D T I L W A Q R W L F F F Q A S A T . . . . . . 35898 AGGCCAAAAAAGTGTAGAAAATTACTTATAAAATAAGTTCAGGGGGGTCATGGGACCTTG R P K K C R K L L I K - V Q G G H G T L G Q K S V E N Y L - N K F R G V M G P W - A K K V - K I T Y K I S S G G S W D L . . . . . . 35958 GTATAGTATAAGTGTGTCTCTGAGATTTCAGACATAGGTTGAGGGGGTACTTGTGCATTT V - Y K C V S E I S D I G - G G T C A F Y S I S V S L R F Q T - V E G V L V H F G I V - V C L - D F R H R L R G Y L C I . . 36018 TCCTTATTTTTTT S L F F P Y F F F L I F Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1+_PGL-7_AGS-1_PPS_1 (35724 35933) (frame '1'; 207 bp, 69 residues) 1 CTSTPQPMPE ISETNLYYTK VLLSPELILL IIFYPFSAYV TLFCGPNAGY FFFKLVPRRP 61 KKCRKLLIK- 3-phase translation of AGS-1 (-strand): . . . . . . 36030 AAAAAAATAAGGAAAATGCACAAGTACCCCCTCAACCTATGTCTGAAATCTCAGAGACAC K K I R K M H K Y P L N L C L K S Q R H K K - G K C T S T P S T Y V - N L R D T K N K E N A Q V P P Q P M S E I S E T . . . . . . 35970 ACTTATACTATACCAAGGTCCCATGACCCCCCTGAACTTATTTTATAAGTAATTTTCTAC T Y T I P R S H D P P E L I L - V I F Y L I L Y Q G P M T P L N L F Y K - F S T H L Y Y T K V P - P P - T Y F I S N F L . . . . . . 35910 ACTTTTTTGGCCTACGTGGCACTAGCTTGAAAAAAAAATAACCAGCGTTGGGCCCACAAA T F L A Y V A L A - K K N N Q R W A H K L F W P T W H - L E K K I T S V G P T K H F F G L R G T S L K K K - P A L G P Q . . . . . . 35850 ATAGTGTCACGTAAGCCGAAAAGGGGTAGAAAATTATTAATAAAATAAGTTCAGGGGATA I V S R K P K R G R K L L I K - V Q G I - C H V S R K G V E N Y - - N K F R G - N S V T - A E K G - K I I N K I S S G D . . . . . . 35790 ATAGGACCTTAGTATAGTACAAGTTTGTTTCTGAAATTTCGGGCATAGGTTGAGGGGTAC I G P - Y S T S L F L K F R A - V E G Y - D L S I V Q V C F - N F G H R L R G T N R T L V - Y K F V S E I S G I G - G V . . 35730 TTGTGCATTATCC L C I I C A L S L V H Y Maximal non-overlapping open reading frames (>= 64 codons): none PGL 8 (- strand): 42360 41544 AGS-1 (42360 41544) SCR (e 0.991) Exon 1 42360 41544 ( 817 n); score: 0.991 PGS (42360 41544) SGN-U327561- 3-phase translation of AGS-1 (-strand): . . . . . . 42360 TTCCTCCTGACCCTTCTTCTTCTTCTTCTCCAATTCTACTTCCTTTTACCTTTTGCACTC F L L T L L L L L L Q F Y F L L P F A L S S - P F F F F F S N S T S F Y L L H S P P D P S S S S S P I L L P F T F C T . . . . . . 42300 AAAAAACTCTCTCACTTTCCTAAAAAGTACATTTTGATAATGGCTTCTCAACAAGATGAG K K L S H F P K K Y I L I M A S Q Q D E K N S L T F L K S T F - - W L L N K M S Q K T L S L S - K V H F D N G F S T R - . . . . . . 42240 CTAAAACACAGAAGTACTACGAAATCACAACAAACAGAGCAATACACAAAATCTGCTCAC L K H R S T T K S Q Q T E Q Y T K S A H - N T E V L R N H N K Q S N T Q N L L T A K T Q K Y Y E I T T N R A I H K I C S . . . . . . 42180 GATAAGGATTCAAAATCGAACAAAAACATCAACAGATCAACAAGAAAACAGATCGCTAAA D K D S K S N K N I N R S T R K Q I A K I R I Q N R T K T S T D Q Q E N R S L N R - G F K I E Q K H Q Q I N K K T D R - . . . . . . 42120 CGAGGCGTCAAATCATTGACAATCGCTTTATCAATTCCACTTCTATTAACCCTAATTGAC R G V K S L T I A L S I P L L L T L I D E A S N H - Q S L Y Q F H F Y - P - L T T R R Q I I D N R F I N S T S I N P N - . . . . . . 42060 ATCTCTCTATTCGGATCAAGTTACCAGTACGTTTCAATGGAGAAGCCTTTCTGGTTTCCG I S L F G S S Y Q Y V S M E K P F W F P S L Y S D Q V T S T F Q W R S L S G F R H L S I R I K L P V R F N G E A F L V S . . . . . . 42000 CGTCTATGGGCTTTACATTTAGCCTGTTTAGGTTCTTCTCTTCTAATGGGTCTTTCTGCT R L W A L H L A C L G S S L L M G L S A V Y G L Y I - P V - V L L F - W V F L L A S M G F T F S L F R F F S S N G S F C . . . . . . 41940 TGGCTTGTTTGGGCTGAAGGTGGGTTTCATCGTCAACCTATGGCTATAATTTTGTATTTA W L V W A E G G F H R Q P M A I I L Y L G L F G L K V G F I V N L W L - F C I - L A C L G - R W V S S S T Y G Y N F V F . . . . . . 41880 GCTCAATTAGGGTTGAGTTTGGCTTGGGATCCAGTTGTGTTCAAAGCAGGTGCTACTAGA A Q L G L S L A W D P V V F K A G A T R L N - G - V W L G I Q L C S K Q V L L E S S I R V E F G L G S S C V Q S R C Y - . . . . . . 41820 ATTGGGTTAGTGTTATGTGTGGCTTTGTTTGGAGTGTTGATTGGTTGTTTTAGGGCTTTT I G L V L C V A L F G V L I G C F R A F L G - C Y V W L C L E C - L V V L G L L N W V S V M C G F V W S V D W L F - G F . . . . . . 41760 AAAAATGTGAATCCTATTGCTGGGGATTTGGTTAAACCTTGTTTTGGATGGGCTGTGCTT K N V N P I A G D L V K P C F G W A V L K M - I L L L G I W L N L V L D G L C F - K C E S Y C W G F G - T L F W M G C A . . . . . . 41700 TTGAGTTTAGCAAATCTTAAGCTTGTGTATCATTAGGAAGAAATAAATAATATGCACTTG L S L A N L K L V Y H - E E I N N M H L - V - Q I L S L C I I R K K - I I C T C F E F S K S - A C V S L G R N K - Y A L . . . . . . 41640 TTTTTTGTTTTGTTTTGGCTTGTGAGGCTTATGTACTGTAAATTTACATGTTCTATATTT F F V L F W L V R L M Y C K F T C S I F F L F C F G L - G L C T V N L H V L Y L V F C F V L A C E A Y V L - I Y M F Y I . . . . 41580 ATGTTTTAATTAAATTAAATTTACATGTGTGTTCTAT M F - L N - I Y M C V L C F N - I K F T C V F Y Y V L I K L N L H V C S Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1-_PGL-8_AGS-1_PPS_1 (42360 41665) (frame '1'; 693 bp, 231 residues) 1 FLLTLLLLLL QFYFLLPFAL KKLSHFPKKY ILIMASQQDE LKHRSTTKSQ QTEQYTKSAH 61 DKDSKSNKNI NRSTRKQIAK RGVKSLTIAL SIPLLLTLID ISLFGSSYQY VSMEKPFWFP 121 RLWALHLACL GSSLLMGLSA WLVWAEGGFH RQPMAIILYL AQLGLSLAWD PVVFKAGATR 181 IGLVLCVALF GVLIGCFRAF KNVNPIAGDL VKPCFGWAVL LSLANLKLVY H- 3-phase translation of AGS-1 (+strand): . . . . . . 41544 ATAGAACACACATGTAAATTTAATTTAATTAAAACATAAATATAGAACATGTAAATTTAC I E H T C K F N L I K T - I - N M - I Y - N T H V N L I - L K H K Y R T C K F T R T H M - I - F N - N I N I E H V N L . . . . . . 41604 AGTACATAAGCCTCACAAGCCAAAACAAAACAAAAAACAAGTGCATATTATTTATTTCTT S T - A S Q A K T K Q K T S A Y Y L F L V H K P H K P K Q N K K Q V H I I Y F F Q Y I S L T S Q N K T K N K C I L F I S . . . . . . 41664 CCTAATGATACACAAGCTTAAGATTTGCTAAACTCAAAAGCACAGCCCATCCAAAACAAG P N D T Q A - D L L N S K A Q P I Q N K L M I H K L K I C - T Q K H S P S K T R S - - Y T S L R F A K L K S T A H P K Q . . . . . . 41724 GTTTAACCAAATCCCCAGCAATAGGATTCACATTTTTAAAAGCCCTAAAACAACCAATCA V - P N P Q Q - D S H F - K P - N N Q S F N Q I P S N R I H I F K S P K T T N Q G L T K S P A I G F T F L K A L K Q P I . . . . . . 41784 ACACTCCAAACAAAGCCACACATAACACTAACCCAATTCTAGTAGCACCTGCTTTGAACA T L Q T K P H I T L T Q F - - H L L - T H S K Q S H T - H - P N S S S T C F E H N T P N K A T H N T N P I L V A P A L N . . . . . . 41844 CAACTGGATCCCAAGCCAAACTCAACCCTAATTGAGCTAAATACAAAATTATAGCCATAG Q L D P K P N S T L I E L N T K L - P - N W I P S Q T Q P - L S - I Q N Y S H R T T G S Q A K L N P N - A K Y K I I A I . . . . . . 41904 GTTGACGATGAAACCCACCTTCAGCCCAAACAAGCCAAGCAGAAAGACCCATTAGAAGAG V D D E T H L Q P K Q A K Q K D P L E E L T M K P T F S P N K P S R K T H - K R G - R - N P P S A Q T S Q A E R P I R R . . . . . . 41964 AAGAACCTAAACAGGCTAAATGTAAAGCCCATAGACGCGGAAACCAGAAAGGCTTCTCCA K N L N R L N V K P I D A E T R K A S P R T - T G - M - S P - T R K P E R L L H E E P K Q A K C K A H R R G N Q K G F S . . . . . . 42024 TTGAAACGTACTGGTAACTTGATCCGAATAGAGAGATGTCAATTAGGGTTAATAGAAGTG L K R T G N L I R I E R C Q L G L I E V - N V L V T - S E - R D V N - G - - K W I E T Y W - L D P N R E M S I R V N R S . . . . . . 42084 GAATTGATAAAGCGATTGTCAATGATTTGACGCCTCGTTTAGCGATCTGTTTTCTTGTTG E L I K R L S M I - R L V - R S V F L L N - - S D C Q - F D A S F S D L F S C - G I D K A I V N D L T P R L A I C F L V . . . . . . 42144 ATCTGTTGATGTTTTTGTTCGATTTTGAATCCTTATCGTGAGCAGATTTTGTGTATTGCT I C - C F C S I L N P Y R E Q I L C I A S V D V F V R F - I L I V S R F C V L L D L L M F L F D F E S L S - A D F V Y C . . . . . . 42204 CTGTTTGTTGTGATTTCGTAGTACTTCTGTGTTTTAGCTCATCTTGTTGAGAAGCCATTA L F V V I S - Y F C V L A H L V E K P L C L L - F R S T S V F - L I L L R S H Y S V C C D F V V L L C F S S S C - E A I . . . . . . 42264 TCAAAATGTACTTTTTAGGAAAGTGAGAGAGTTTTTTGAGTGCAAAAGGTAAAAGGAAGT S K C T F - E S E R V F - V Q K V K G S Q N V L F R K V R E F F E C K R - K E V I K M Y F L G K - E S F L S A K G K R K . . . . 42324 AGAATTGGAGAAGAAGAAGAAGAAGGGTCAGGAGGAA R I G E E E E E G S G G E L E K K K K K G Q E E - N W R R R R R R V R R Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-1+_PGL-8_AGS-1_PPS_1 (41904 42113) (frame '1'; 207 bp, 69 residues) 1 VDDETHLQPK QAKQKDPLEE KNLNRLNVKP IDAETRKASP LKRTGNLIRI ERCQLGLIEV 61 ELIKRLSMI- >C09HBa0099P03.1-1+_PGL-8_AGS-1_PPS_2 (41672 41878) (frame '0'; 204 bp, 68 residues) 1 YTSLRFAKLK STAHPKQGLT KSPAIGFTFL KALKQPINTP NKATHNTNPI LVAPALNTTG 61 SQAKLNPN- ... finished at: Tue Jul 25 01:40:00 2006 ________________________________________________________________________________ Sequence 2: C09HBa0099P03.1-2, from 1 to 80423, both strands analyzed. ... started at: Tue Jul 25 01:40:00 2006 EST library file: /tmp/cxgn-bacpublish-resources-mxQHWy/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 16 EST library file: /tmp/cxgn-bacpublish-resources-mxQHWy/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 19 ******************************************************************************** EST sequence 7 +strand 864 n (File: SGN-U346148+) 1 AATAGCTGGA GCTCCCCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA 61 GGTTTTCGCC GAGATCGTCA GTTCTCCTGC TCCGGCAGCC ATGGCCGCCA AAGCAGCTGC 121 TATACGCAAG AGTAATAGAT CGGAGAATTT CATTCAAAAA CTCGTTAAAA ATCCTAAAAT 181 ACCTTTTGCT ATTGCAATAC TCATTGCTGA TGCCATCCTC GTTGCGTTGA TTATCGCTTA 241 CGTTCCATAT ACGAAAATTG ATTGGGATGC TTATATGTCT CAGGTTACTG GTTTTCTCGA 301 AGGAGAGAGG GATTATAGTA ACTTGAAAGG TGACACGGGG CCTCTAGTTT ACCCAGCAGG 361 CTTTCTTTAT ATTTACTCTG CTATACAATA TGTTACTGGA GGTCAAGTCT ATCCTGCTCA 421 GATTCTTTTT GGCTTTCTCT ACGTGCTGGA TCTTGCAATT GTCTTGTTCA TCTACTTGAA 481 GACTGATGTG GTACCTTGGT GGGCTCTCTC CTTGCTTTCT CTGTCGAAAA GAGTTCACTC 541 TATCTTTGTT CTTCGATTAT TTAATGATTG TTTTGCCACT ACTCTCCTCC ATGCTGCATT 601 GGTCTCAATT ATCTGCCAAA AATGGCATCT AGGGTTGGTA ATTTTCAGCG GAGCTGTTTC 661 CATAAAGATG AATGTGCTCC TGTATGCACC ACCTCTGTTG CTCCTCATGG TGAAGGCAAT 721 GGATATTGNT GGGAGTATAT CTGCTTTAGC AGGGGCTGCA TTAGTGCAGA TTCTCATAGG 781 GGCTTCTTTT TATCCTGTCA CATCCAGCTN CATATTTATC AAACGCTTTC CATCTTGGNT 841 CGGGTTTCAT CCACTTCTTG TCTG Predicted gene structure (within gDNA segment 2680 to 8426): Exon 1 3900 4085 ( 186 n); cDNA 63 248 ( 186 n); score: 1.000 Intron 1 4086 4190 ( 105 n); Pd: 1.000 (s: 1.00), Pa: 0.871 (s: 0) Exon 2 4191 4225 ( 35 n); cDNA 249 283 ( 35 n); score: 1.000 Intron 2 4226 4432 ( 207 n); Pd: 0.972 (s: 0), Pa: 1.000 (s: 1.00) Exon 3 4433 4570 ( 138 n); cDNA 284 421 ( 138 n); score: 1.000 Intron 3 4571 5379 ( 809 n); Pd: 0.998 (s: 1.00), Pa: 0.439 (s: 1.00) Exon 4 5380 5448 ( 69 n); cDNA 422 490 ( 69 n); score: 1.000 Intron 4 5449 6472 (1024 n); Pd: 0.843 (s: 1.00), Pa: 0.935 (s: 1.00) Exon 5 6473 6630 ( 158 n); cDNA 491 648 ( 158 n); score: 1.000 Intron 5 6631 6739 ( 109 n); Pd: 0.987 (s: 1.00), Pa: 0.805 (s: 1.00) Exon 6 6740 6806 ( 67 n); cDNA 649 715 ( 67 n); score: 1.000 Intron 6 6807 6907 ( 101 n); Pd: 0.988 (s: 1.00), Pa: 0.998 (s: 0.92) Exon 7 6908 6961 ( 54 n); cDNA 716 769 ( 54 n); score: 0.926 Intron 7 6962 7416 ( 455 n); Pd: 0.949 (s: 0.92), Pa: 0.963 (s: 0.91) Exon 8 7417 7505 ( 89 n); cDNA 770 859 ( 90 n); score: 0.882 Intron 8 7506 8200 ( 695 n); Pd: 0.549 (s: 0.86), Pa: 0.996 (s: 0) Exon 9 8201 8205 ( 5 n); cDNA 860 864 ( 5 n); score: 1.000 MATCH C09HBa0099P03.1-2+ SGN-U346148+ 0.981 801 0.927 C PGS_C09HBa0099P03.1-2+_SGN-U346148+ (3900 4085,4191 4225,4433 4570,5380 5448,6473 6630,6740 6806,6908 6961,7417 7505,8201 8205) Alignment (genomic DNA sequence = upper lines): TTTTCGCCGA GATCGTCAGT TCTCCTGCTC CGGCAGCCAT GGCCGCCAAA GCAGCTGCTA 3959 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCGCCGA GATCGTCAGT TCTCCTGCTC CGGCAGCCAT GGCCGCCAAA GCAGCTGCTA 122 TACGCAAGAG TAATAGATCG GAGAATTTCA TTCAAAAACT CGTTAAAAAT CCTAAAATAC 4019 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACGCAAGAG TAATAGATCG GAGAATTTCA TTCAAAAACT CGTTAAAAAT CCTAAAATAC 182 CTTTTGCTAT TGCAATACTC ATTGCTGATG CCATCCTCGT TGCGTTGATT ATCGCTTACG 4079 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTGCTAT TGCAATACTC ATTGCTGATG CCATCCTCGT TGCGTTGATT ATCGCTTACG 242 TTCCATGTAA GTTTCAGTTT TCCACACTTC ATTTCCATTT TTTTTTCTTA TGTGTGTATG 4139 |||||| TTCCAT.... .......... .......... .......... .......... .......... 248 AAATCATAAA ATTCATAAAT TGTGAAATTC CTCAATTTCT CCTTTTTTTA GATACGAAAA 4199 ||||||||| .......... .......... .......... .......... .......... .ATACGAAAA 257 TTGATTGGGA TGCTTATATG TCTCAGGTGA GAGTTCAACA TTTTGATCTG TTTGTTTGTC 4259 |||||||||| |||||||||| |||||| TTGATTGGGA TGCTTATATG TCTCAG.... .......... .......... .......... 283 TTGTTCAGTA ATGTAGCAAC GGTTTGAGTT GAATTCAGTC TGATCAGTAA TTTGGCTTAA 4319 .......... .......... .......... .......... .......... .......... 283 ATGACAAATT AAGGCAAAAT ATTTACCAGT ATTTGACATT ATAGTGTGAT TTATTGATAT 4379 .......... .......... .......... .......... .......... .......... 283 CTTACTTATA TAGTTTTGAT TGATTGATTT TTTATTTTTT TTTGGTTTAA TAGGTTACTG 4439 ||||||| .......... .......... .......... .......... .......... ...GTTACTG 290 GTTTTCTCGA AGGAGAGAGG GATTATAGTA ACTTGAAAGG TGACACGGGG CCTCTAGTTT 4499 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTTCTCGA AGGAGAGAGG GATTATAGTA ACTTGAAAGG TGACACGGGG CCTCTAGTTT 350 ACCCAGCAGG CTTTCTTTAT ATTTACTCTG CTATACAATA TGTTACTGGA GGTCAAGTCT 4559 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCCAGCAGG CTTTCTTTAT ATTTACTCTG CTATACAATA TGTTACTGGA GGTCAAGTCT 410 ATCCTGCTCA GGTAATTTTC CTACTTTCTC CATTTCCGTG AGGTGCCTTT TTTTGCGGTG 4619 |||||||||| | ATCCTGCTCA G......... .......... .......... .......... .......... 421 ATAGCCAGTG AACTTGCGTA AGGAGTTGTT AGATTGATTG TGATTGTCCA CTTTCAAATT 4679 .......... .......... .......... .......... .......... .......... 421 TGATTAGATT TGAAAGAGCA CGTAAATGCA GTTAATTGGT TGTCCCCTTC CAAATTTCAT 4739 .......... .......... .......... .......... .......... .......... 421 AGATTTTGCT ACTGAATTTT AAGTAAGTGA GACAATAGTT GTTATGTGTT AACAAATAAT 4799 .......... .......... .......... .......... .......... .......... 421 AAAATGGAAG CATAGCATCT GTATTGATGA TGTGAACTAG TAAAGTAGGT GTTTTCTTAT 4859 .......... .......... .......... .......... .......... .......... 421 GCAGGCTTTG ATTTCTAAGC AGCTACATTA ACAATCTTAA CTCTTGCTTT GCTTCGACTC 4919 .......... .......... .......... .......... .......... .......... 421 TGTGTCATAA TTTTTGGAAC TCCAGCTAGA TATTCATGCC CGTGAGGCGT GATGCACTTT 4979 .......... .......... .......... .......... .......... .......... 421 TCTTTGAAGA TGCTACTACT CTTGGAAGTA TACCATCTTC TAGAGTGTTT CTTTTCCAAT 5039 .......... .......... .......... .......... .......... .......... 421 AATGACAATC TGAAGTAAAT ATTTTCTTTT CCTCTAACTT AATACTAACT AGAAAAAATT 5099 .......... .......... .......... .......... .......... .......... 421 GCAATGGTAC ACTTTCTATG CTTGTGTGGC TGGTGTAAGC TTAAAGTAGA TTCCAGCATG 5159 .......... .......... .......... .......... .......... .......... 421 AGAAAAAAAA TTAGTATCCT TTTTCTGGGT TTTGCATGTT CTGGAAGGTT CATACATACA 5219 .......... .......... .......... .......... .......... .......... 421 TAAGGTGTAG TCACTTGTTT AGCTAACTAG TTGTGACTTT TGTATAATGT GGGCATATTC 5279 .......... .......... .......... .......... .......... .......... 421 AGCTCGTTCA TTGGATAATC CGTACACTAT TATGTCCTTC GGTATATGGT CACGTTACTT 5339 .......... .......... .......... .......... .......... .......... 421 GCTGCCAGTT GTTGATTTCT ATTTTCTCAT GCCAGTGCAG ATTCTTTTTG GCTTTCTCTA 5399 |||||||||| |||||||||| .......... .......... .......... .......... ATTCTTTTTG GCTTTCTCTA 441 CGTGCTGGAT CTTGCAATTG TCTTGTTCAT CTACTTGAAG ACTGATGTGG TAGGCGTCCT 5459 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CGTGCTGGAT CTTGCAATTG TCTTGTTCAT CTACTTGAAG ACTGATGTG. .......... 490 CTTCATCTGA ATTTTATGGA TAACTTTTTT GGAACTGGTA ATTCTGAATT TTATGGATGA 5519 .......... .......... .......... .......... .......... .......... 490 TGTTTTAGGC TAATGTATTC TTGAAAAAGA GTTGTATTGT GACTGCTGTA AGACTGGATC 5579 .......... .......... .......... .......... .......... .......... 490 CAGTGGTGAA ATTTGTTAGT GTTGACTACA ACAACATACC TAGTGTATTA CCACAAAGTG 5639 .......... .......... .......... .......... .......... .......... 490 GGGTTTGGGG AGGGTAGAGT GTTTAGAGCT AAGGTCTGCA TACACTTTCA AAAGATTTGA 5699 .......... .......... .......... .......... .......... .......... 490 GTTTCTAGGT CCCTATATAT AGTTTTATAC ATGGATATAC TTTATCTTCT CACCTTTGCC 5759 .......... .......... .......... .......... .......... .......... 490 ACTATAGACT GAACTCTAGC ATAATCTTAG TGCTCTTATG AATGAAACCT TATCATTTTA 5819 .......... .......... .......... .......... .......... .......... 490 TAAAGAATAA AATTCTAGAC TTCAATTTGA GACCTGGTGA CAAAATGCAC CTTCTTTAAG 5879 .......... .......... .......... .......... .......... .......... 490 CTTACACAGA TGAGCTTTAT TTAAAGGAGG TATCTTGGTA AACGATAAAT CAAAATTGAA 5939 .......... .......... .......... .......... .......... .......... 490 CTTCAACGGG ATTTATGGAG GAAAAATATG ATGAGGCTGA CAAGTTATTA CGTCTCTGGG 5999 .......... .......... .......... .......... .......... .......... 490 AGGTATTTCT AAATAAAGAA GTTAAAGAAC AGTGAATCTT GGGACTGTAG TTAGACCATT 6059 .......... .......... .......... .......... .......... .......... 490 TATCAATATC ATTCAAATCT TATGCTAAGG AGATCAAGAA TTTTTGCTCA AATAATTTCC 6119 .......... .......... .......... .......... .......... .......... 490 ACAAATAATT ATGTGTGGTT CAGAGGAAGT TGTAATGTGT AATTGAGGAC ATCTGTAAAA 6179 .......... .......... .......... .......... .......... .......... 490 ATTTGTCTGA ACATCTCTAA TAAGACGGAG TCTCGACCAA AACTTTTGGG AATTTGTATA 6239 .......... .......... .......... .......... .......... .......... 490 TCTTTTCACA CTGAATCAAG GTTGGCTTTC AACTGTAGCA AGAGGAGAAT GTCCGCAATA 6299 .......... .......... .......... .......... .......... .......... 490 TGTCTTACTT TCTTCTGATA ACCATTTCTC TTTCTGATTT AGCTTTAGCC TTGAAACTTC 6359 .......... .......... .......... .......... .......... .......... 490 TAATAAGTTC TTTCAGTAAA TATTCAGAAT TCAACCACTT TGTACTAAGT AGAAAATCTT 6419 .......... .......... .......... .......... .......... .......... 490 TATTCCTTGG AGTTAAGATA TGCTATGAGT AATTGGAAAC ATATTTTTTC CAGGTACCTT 6479 ||||||| .......... .......... .......... .......... .......... ...GTACCTT 497 GGTGGGCTCT CTCCTTGCTT TCTCTGTCGA AAAGAGTTCA CTCTATCTTT GTTCTTCGAT 6539 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGGGCTCT CTCCTTGCTT TCTCTGTCGA AAAGAGTTCA CTCTATCTTT GTTCTTCGAT 557 TATTTAATGA TTGTTTTGCC ACTACTCTCC TCCATGCTGC ATTGGTCTCA ATTATCTGCC 6599 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTTAATGA TTGTTTTGCC ACTACTCTCC TCCATGCTGC ATTGGTCTCA ATTATCTGCC 617 AAAAATGGCA TCTAGGGTTG GTAATTTTCA GGTATATTCT TGCTTGCTTA GACATTTTAA 6659 |||||||||| |||||||||| |||||||||| | AAAAATGGCA TCTAGGGTTG GTAATTTTCA G......... .......... .......... 648 TCGTTGGATT AAATTTTTCT CTATGTCATT GATTAAAAAC CATCAGTTAA GATTGATCAA 6719 .......... .......... .......... .......... .......... .......... 648 TGGTCTTCTC TTTCGTACAG CGGAGCTGTT TCCATAAAGA TGAATGTGCT CCTGTATGCA 6779 |||||||||| |||||||||| |||||||||| |||||||||| .......... .......... CGGAGCTGTT TCCATAAAGA TGAATGTGCT CCTGTATGCA 688 CCACCTCTGT TGCTCCTCAT GGTGAAGGTA TACTATTGGG ACAGTATAGG TCATTAATAA 6839 |||||||||| |||||||||| ||||||| CCACCTCTGT TGCTCCTCAT GGTGAAG... .......... .......... .......... 715 TCTCTGTAGG ACGTTGAATT ACATCTCTGA TTAGCTTCTG TGTTATATGG TGGGAACTTG 6899 .......... .......... .......... .......... .......... .......... 715 AAATACAGGC AATGGATATT GTTGGAGTTA TATCTGCTTT AGCAGGGGCT GCATTAGTGC 6959 || |||||||||| | ||| || |||||||||| |||||||||| |||||||||| ........GC AATGGATATT GNTGGGAGTA TATCTGCTTT AGCAGGGGCT GCATTAGTGC 767 AGGTGGGTAA TCTCCAAGTT CCCTTTCTAA TTTCTAATGC AATGATGTTC AGCGGGCGGC 7019 || AG........ .......... .......... .......... .......... .......... 769 ATGATTTTCT TATCTTTACT TTTGTTATTG TGTTCATGAC TGCAATTGTG TTCTTTGTAA 7079 .......... .......... .......... .......... .......... .......... 769 ATGTTGTTAT GGAAAATAGT TTAAGCATGT TTCGTTATTT GGGGAGGGGA GGTTTGTGGT 7139 .......... .......... .......... .......... .......... .......... 769 TGCTCCAGGA TCAAGGCATC GCTTTCTTCG GCGTACTCAT GAACAAAACT TGATGTTTTA 7199 .......... .......... .......... .......... .......... .......... 769 ACTTTTGTGT GATCTATATG TTCCATCTTC AATGCTCTTG GGCATTCTTT TAATTGTTAC 7259 .......... .......... .......... .......... .......... .......... 769 AATATTATTT CTATTCTTTC CTCAGAGATC GACTGATGAG TTTGTGGTTC ATATGCATAG 7319 .......... .......... .......... .......... .......... .......... 769 GTGAGTCGGT TACGAGTATA CTTCTGTTGA ATGGAGCTTC TTCTGATCAA ATTCTTTACC 7379 .......... .......... .......... .......... .......... .......... 769 AATGATCATG GTGATTGGTG ATTGGTGATT GGTGCAGATT CTCATA-GGG CTTCCTTTTA 7438 ||| |||||| ||| |||| ||||| .......... .......... .......... .......ATT CTCATAGGGG CTTCTTTTTA 792 TCCTGTCACA TCCAGCTTCA TATTTATCAA ACGCTTTCAA TCTTGGTCGG GTTTTCATCC 7498 |||||||||| ||||||| || |||||||||| |||||||| | |||||| | | |||||||| TCCTGTCACA TCCAGCTNCA TATTTATCAA ACGCTTTCCA TCTTGGNTCG GGTTTCATCC 852 ACTTCTGGTA CTTTCTTAAA CACAATCTGT GCATAATGGG TACTCTGAGG CACCATATCA 7558 |||||| ACTTCTT... .......... .......... .......... .......... .......... 859 AAAATATCAT GCAACTTTTG AGTCTTTAAC ATTGCCGTCA TCCTTCTTTC TGGATTTATT 7618 .......... .......... .......... .......... .......... .......... 859 TATGTTTCAG TAAAATTGTT TTAATGTGCA AAATGAATAA ACATTATACC CCTGGATATT 7678 .......... .......... .......... .......... .......... .......... 859 TATGAAATCA GCATTAATAT GGTGATAAGA GATTGTGTAT CTATTCACTT GACACACTGT 7738 .......... .......... .......... .......... .......... .......... 859 TATATAATCG TGCATTATCA TTATATGAAA CAACATTATT GTATTTGTCA TGCACTTTTT 7798 .......... .......... .......... .......... .......... .......... 859 ACCTTAAATG TCATCAAACA ATACACTGGT ATTTGGTGGT GTCTGCTAAA TGTAGGAGAT 7858 .......... .......... .......... .......... .......... .......... 859 GCATGTTCTT ACTGCTGGCT GTAGTTTGTT CTCTGTCTCG TCCCTTGGGA GTTCGGGTGG 7918 .......... .......... .......... .......... .......... .......... 859 GTAGTGGGCT CGTTTCTTTC AGTGGACGTA GACAGCAAAG TACTTTTTTT CCCCATATCC 7978 .......... .......... .......... .......... .......... .......... 859 CTTGAATATC ACTCTTACCC TAAGCCTACT CTGATCTTCT AACTAAAGTT CAGTCAAATC 8038 .......... .......... .......... .......... .......... .......... 859 CTTATAAATT TTAAGTTGTA TTTGGTATGC CTTTTAGAAT AATCTTATTC GCTAAGAACT 8098 .......... .......... .......... .......... .......... .......... 859 GTATTTCTAT CAACTGTGTT GGTGCAACAT TATGCTCATG TCATGATGTT GTGCTATCCT 8158 .......... .......... .......... .......... .......... .......... 859 TTACAAAGAA CAATTTATCG TAACTCTCTT CTTTTGTTGC AGGTCTG 8205 ||||| .......... .......... .......... .......... ..GTCTG 864 hqPGS_C09HBa0099P03.1-2+_SGN-U346148+ (3900 4085,4191 4225,4433 4570,5380 5448,6473 6630,6740 6806,6908 6961,7417 7505) ******************************************************************************** EST sequence 13 +strand 1082 n (File: SGN-U328710+) 1 TGGAGTTATA TCTGCTTTAG CAGGGGCTGC ATTAGTGCAG GTGAGTCGGT TACGAGTATA 61 CTTCTGTTGA ATGGAGCTTC TTCTGATCAA ATTCTTTACC AATGATCATG GTGATTGGTG 121 ATTGGTGATT GGTGCAGATT CTCATAGGGC TTCCTTTTAT CCTGTCACAT CCAGCTTCAT 181 ATTTATCAAA CGCTTTCAAT CTTGGTCGGG TTTTCATCCA CTTCTGGTCT GTCAACTTCA 241 AATTTGTTCC TGAAGACATC TTTGTTTCTA AAGCTTGTGC TCTCTCTTTG CTAGTTGCTC 301 ATCTCAGTCT GCTATTGGTG TTTGCTCATT ACAGATGGTG CAGGCATGAA GGAGGACTGT 361 TTGCTGTTGT GCGTTCTAAA ATCATTCAAC TGAAGCTCAG AGTTTCTCAG AGAAATCCTT 421 CCTCAACCAA GAAAGTCCTT CAAGCTGACC ATATTGTGAC GACTATGTTT GTTGGGAATT 481 TCATTGGCAT TATATGTGCC CGATCCCTCC ATTACCAATT TTATTCTTGG TACTTCTATT 541 GCTTACCATA TTTATTGTGG AAAGCACCAT TTCCAACCCT CCTACGTTTA TTCTTGTTCG 601 CAGCTGTAGA GTTTTGCTGG AACGTCTTCC CCTCCAACAC TTGCTCATCA CTTGTCCTCC 661 TCTGTGTCCA TTTGATCATA TTGGCCGGTC TATGGATAAG TTCACCAGAA TATCCGTACG 721 TCGAAGAAAA AACAACTTAT AAATCTACAC CTAAGAAGAA GGCCAGATAA AGCACTTNCT 781 GGTTATGCAT GTGAATGGCA GATAAAGAAA AAACAACTGA TAAAACAAAG TTTTTGTTTT 841 TCTGTTTCTT TTCGTAGTGT TAATGCTTAC AGTTTTGTTA GATGGTATAC AAAACCAGAA 901 AGGTGGTAAC AACCACACGA AGATCATCCA ATGGCATCAA GGATGAATAT TTTCTGGGGG 961 GTTTTCAACA TTTGAGGATT TCATTAGCTC AAATCTGAAT TAGACTTGAA TATTTTAGAA 1021 GGTCAACCTA TTTTATTTAT CTAAATCTGT ATGGTGTACT ATTTATTGTC AAGATTTAAG 1081 GA Predicted gene structure (within gDNA segment 6322 to 10868): Exon 1 6922 6961 ( 40 n); cDNA 1 40 ( 40 n); score: 1.000 Intron 1 6962 7319 ( 358 n); Pd: 0.949 (s: 1.00), Pa: 0.977 (s: 1.00) Exon 2 7320 7505 ( 186 n); cDNA 41 226 ( 186 n); score: 1.000 Intron 2 7506 8200 ( 695 n); Pd: 0.549 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 3 8201 8317 ( 117 n); cDNA 227 343 ( 117 n); score: 0.991 Intron 3 8318 8488 ( 171 n); Pd: 0.996 (s: 1.00), Pa: 0.993 (s: 1.00) Exon 4 8489 8595 ( 107 n); cDNA 344 450 ( 107 n); score: 1.000 Intron 4 8596 8731 ( 136 n); Pd: 0.976 (s: 1.00), Pa: 0.967 (s: 1.00) Exon 5 8732 8810 ( 79 n); cDNA 451 529 ( 79 n); score: 1.000 Intron 5 8811 9623 ( 813 n); Pd: 0.994 (s: 1.00), Pa: 0.776 (s: 1.00) Exon 6 9624 9680 ( 57 n); cDNA 530 586 ( 57 n); score: 1.000 Intron 6 9681 9763 ( 83 n); Pd: 0.996 (s: 1.00), Pa: 0.988 (s: 1.00) Exon 7 9764 10258 ( 495 n); cDNA 587 1082 ( 496 n); score: 0.991 MATCH C09HBa0099P03.1-2+ SGN-U328710+ 0.995 1081 0.999 C PGS_C09HBa0099P03.1-2+_SGN-U328710+ (6922 6961,7320 7505,8201 8317,8489 8595,8732 8810,9624 9680,9764 10258) Alignment (genomic DNA sequence = upper lines): TGGAGTTATA TCTGCTTTAG CAGGGGCTGC ATTAGTGCAG GTGGGTAATC TCCAAGTTCC 6981 |||||||||| |||||||||| |||||||||| |||||||||| TGGAGTTATA TCTGCTTTAG CAGGGGCTGC ATTAGTGCAG .......... .......... 40 CTTTCTAATT TCTAATGCAA TGATGTTCAG CGGGCGGCAT GATTTTCTTA TCTTTACTTT 7041 .......... .......... .......... .......... .......... .......... 40 TGTTATTGTG TTCATGACTG CAATTGTGTT CTTTGTAAAT GTTGTTATGG AAAATAGTTT 7101 .......... .......... .......... .......... .......... .......... 40 AAGCATGTTT CGTTATTTGG GGAGGGGAGG TTTGTGGTTG CTCCAGGATC AAGGCATCGC 7161 .......... .......... .......... .......... .......... .......... 40 TTTCTTCGGC GTACTCATGA ACAAAACTTG ATGTTTTAAC TTTTGTGTGA TCTATATGTT 7221 .......... .......... .......... .......... .......... .......... 40 CCATCTTCAA TGCTCTTGGG CATTCTTTTA ATTGTTACAA TATTATTTCT ATTCTTTCCT 7281 .......... .......... .......... .......... .......... .......... 40 CAGAGATCGA CTGATGAGTT TGTGGTTCAT ATGCATAGGT GAGTCGGTTA CGAGTATACT 7341 || |||||||||| |||||||||| .......... .......... .......... ........GT GAGTCGGTTA CGAGTATACT 62 TCTGTTGAAT GGAGCTTCTT CTGATCAAAT TCTTTACCAA TGATCATGGT GATTGGTGAT 7401 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTGTTGAAT GGAGCTTCTT CTGATCAAAT TCTTTACCAA TGATCATGGT GATTGGTGAT 122 TGGTGATTGG TGCAGATTCT CATAGGGCTT CCTTTTATCC TGTCACATCC AGCTTCATAT 7461 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTGATTGG TGCAGATTCT CATAGGGCTT CCTTTTATCC TGTCACATCC AGCTTCATAT 182 TTATCAAACG CTTTCAATCT TGGTCGGGTT TTCATCCACT TCTGGTACTT TCTTAAACAC 7521 |||||||||| |||||||||| |||||||||| |||||||||| |||| TTATCAAACG CTTTCAATCT TGGTCGGGTT TTCATCCACT TCTG...... .......... 226 AATCTGTGCA TAATGGGTAC TCTGAGGCAC CATATCAAAA ATATCATGCA ACTTTTGAGT 7581 .......... .......... .......... .......... .......... .......... 226 CTTTAACATT GCCGTCATCC TTCTTTCTGG ATTTATTTAT GTTTCAGTAA AATTGTTTTA 7641 .......... .......... .......... .......... .......... .......... 226 ATGTGCAAAA TGAATAAACA TTATACCCCT GGATATTTAT GAAATCAGCA TTAATATGGT 7701 .......... .......... .......... .......... .......... .......... 226 GATAAGAGAT TGTGTATCTA TTCACTTGAC ACACTGTTAT ATAATCGTGC ATTATCATTA 7761 .......... .......... .......... .......... .......... .......... 226 TATGAAACAA CATTATTGTA TTTGTCATGC ACTTTTTACC TTAAATGTCA TCAAACAATA 7821 .......... .......... .......... .......... .......... .......... 226 CACTGGTATT TGGTGGTGTC TGCTAAATGT AGGAGATGCA TGTTCTTACT GCTGGCTGTA 7881 .......... .......... .......... .......... .......... .......... 226 GTTTGTTCTC TGTCTCGTCC CTTGGGAGTT CGGGTGGGTA GTGGGCTCGT TTCTTTCAGT 7941 .......... .......... .......... .......... .......... .......... 226 GGACGTAGAC AGCAAAGTAC TTTTTTTCCC CATATCCCTT GAATATCACT CTTACCCTAA 8001 .......... .......... .......... .......... .......... .......... 226 GCCTACTCTG ATCTTCTAAC TAAAGTTCAG TCAAATCCTT ATAAATTTTA AGTTGTATTT 8061 .......... .......... .......... .......... .......... .......... 226 GGTATGCCTT TTAGAATAAT CTTATTCGCT AAGAACTGTA TTTCTATCAA CTGTGTTGGT 8121 .......... .......... .......... .......... .......... .......... 226 GCAACATTAT GCTCATGTCA TGATGTTGTG CTATCCTTTA CAAAGAACAA TTTATCGTAA 8181 .......... .......... .......... .......... .......... .......... 226 CTCTCTTCTT TTGTTGCAGG TCTGTCAACT TCAAATTTGT TCCTGAAGAC ATCTTTGTTT 8241 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G TCTGTCAACT TCAAATTTGT TCCTGAAGAC ATCTTTGTTT 267 CTAAAGCTTT TGCTCTCTCT TTGCTAGTTG CTCATCTCAG TCTGCTATTG GTGTTTGCTC 8301 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTAAAGCTTG TGCTCTCTCT TTGCTAGTTG CTCATCTCAG TCTGCTATTG GTGTTTGCTC 327 ATTACAGATG GTGCAGGTGA GTATTTCTCA TATTATCATT ACATCTAGTG TAGTGGCGTT 8361 |||||||||| |||||| ATTACAGATG GTGCAG.... .......... .......... .......... .......... 343 GATATTCCAG TACTGAAGTA GGAAAAGAAA AGTCATAATA TGATAACTAT ATCATTTTCT 8421 .......... .......... .......... .......... .......... .......... 343 TCTCTACATA CAAAGGCAAT AAGTTAGCGG CTACTGGACT GCATTTTTCC TTTTTCATCT 8481 .......... .......... .......... .......... .......... .......... 343 TTTGCAGGCA TGAAGGAGGA CTGTTTGCTG TTGTGCGTTC TAAAATCATT CAACTGAAGC 8541 ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......GCA TGAAGGAGGA CTGTTTGCTG TTGTGCGTTC TAAAATCATT CAACTGAAGC 396 TCAGAGTTTC TCAGAGAAAT CCTTCCTCAA CCAAGAAAGT CCTTCAAGCT GACCGTAAGC 8601 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| TCAGAGTTTC TCAGAGAAAT CCTTCCTCAA CCAAGAAAGT CCTTCAAGCT GACC...... 450 AACATGCACA ATTTATTTTG TGATTTTTCT ATACTCAAAA AGGCTGTCAA GTGTTATACT 8661 .......... .......... .......... .......... .......... .......... 450 GTCACTAGTT TTGTCTTGAT TTGACATGAT CTTTGGTTGC TGAATTGGTC TGTGTCTATT 8721 .......... .......... .......... .......... .......... .......... 450 TCATTCTCAG ATATTGTGAC GACTATGTTT GTTGGGAATT TCATTGGCAT TATATGTGCC 8781 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ATATTGTGAC GACTATGTTT GTTGGGAATT TCATTGGCAT TATATGTGCC 500 CGATCCCTCC ATTACCAATT TTATTCTTGG TAAGTTTTTG CAGCTTATTT CTTTCTTTAG 8841 |||||||||| |||||||||| ||||||||| CGATCCCTCC ATTACCAATT TTATTCTTG. .......... .......... .......... 529 GTCATGGATT CGATCTCATA CTCCTCACCT CTTAACACAA ATGTCTTGAT GTGTTATTAA 8901 .......... .......... .......... .......... .......... .......... 529 GGGTTAGCTG AACTAATTTT CAGAGTGAAT TAGAAGAAGA TTTTAATTGT CCACATTAAT 8961 .......... .......... .......... .......... .......... .......... 529 TTAAATATGC AACTTTCCAT ATATTTAAAC ATTTGAAAAG CTTTAGTAAA TGATAATTTT 9021 .......... .......... .......... .......... .......... .......... 529 GCTTTGATTC TTGGAATAGT TAACAATAAT ATTCAGCTTG AGATAGATGC ATGCTTTTAT 9081 .......... .......... .......... .......... .......... .......... 529 GATCTTATGT TTGTATTGAG CTCAGTTCTG GTGGTAACTG CTCATAAAAG CACAACATAT 9141 .......... .......... .......... .......... .......... .......... 529 ATTTCAAGAG GACATAATTG AAATGTTTAA TCAGAAATCA GAATGATAAA CTGTTTTATT 9201 .......... .......... .......... .......... .......... .......... 529 CTAACTTAGT AATTGTGCCA CACTAAAATA AATAATGCGT TTCAGACAGA CATAGTAATA 9261 .......... .......... .......... .......... .......... .......... 529 AATAGAGATG CTTCAGCATC TCATTGAAGA ACAGAGAATT TTGGGGTGCA TTTTGTTAGT 9321 .......... .......... .......... .......... .......... .......... 529 AGTTCACGGG ACTGCAGCAT GTACTTAAGG GAGAAGTGTA TTTAAATATA AAATGCTCAT 9381 .......... .......... .......... .......... .......... .......... 529 TGAGTAGTTG ACCACCTCTC TGTGTGTAAC TTTTTCCCCG GTGAGTCGGT GAAGATCACA 9441 .......... .......... .......... .......... .......... .......... 529 AAAAATCAGC TCCTCTCCCC TGCTAGCATG GCACCAAACA AGACATAAAG GCCTTGACTA 9501 .......... .......... .......... .......... .......... .......... 529 ACATTTGCCA AGCAAATGGA AATTTGGAAG ATTACTCGCT CTTTCACTGC GTTCCTGGGT 9561 .......... .......... .......... .......... .......... .......... 529 TCTGTGTCAA AGTTTTAATT GAATTAAGAT AAAACCATTG ATGTCCTAGA GCTTGTTTGC 9621 .......... .......... .......... .......... .......... .......... 529 AGGTACTTCT ATTGCTTACC ATATTTATTG TGGAAAGCAC CATTTCCAAC CCTCCTACGG 9681 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ..GTACTTCT ATTGCTTACC ATATTTATTG TGGAAAGCAC CATTTCCAAC CCTCCTACG. 586 TATACAACTT AATACTCAAA CTTCTCTGTT TGCTTTAGTT TTTTGGTATT TTCTGTTGAC 9741 .......... .......... .......... .......... .......... .......... 586 AAGTTTTCGT GTTGTACTGC AGTTTATTCT TGTTCGCAGC TGTAGAGTTT TGCTGGAACG 9801 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TTTATTCT TGTTCGCAGC TGTAGAGTTT TGCTGGAACG 624 TCTTCCCCTC CAACACTTGC TCATCACTTG TCCTCCTCTG TGTCCATTTG ATCATATTGG 9861 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTCCCCTC CAACACTTGC TCATCACTTG TCCTCCTCTG TGTCCATTTG ATCATATTGG 684 CCGGTCTATG GATAAGTTCA CCAGAATATC CGTACGTCGA AGAAAAAACA ACTTATAAAT 9921 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGGTCTATG GATAAGTTCA CCAGAATATC CGTACGTCGA AGAAAAAACA ACTTATAAAT 744 CTACACCTAA GAAGAAGGCC AGATAAAGCA CTATCTGGTT ATGCATGTGA ATGGCAGATA 9981 |||||||||| |||||||||| |||||||||| || |||||| |||||||||| |||||||||| CTACACCTAA GAAGAAGGCC AGATAAAGCA CTTNCTGGTT ATGCATGTGA ATGGCAGATA 804 AAGAAAAAAC AACTGATAAA ACAAAGTTTT TGTTTTTCTG TTTCTTTTCG TAGTGTTAAT 10041 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAAAAAC AACTGATAAA ACAAAGTTTT TGTTTTTCTG TTTCTTTTCG TAGTGTTAAT 864 GCTTACAGTT TTGTTAGATG GTATACAAAA CCAGAAAGGT GGTAACAACC ACACGAAGAT 10101 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTACAGTT TTGTTAGATG GTATACAAAA CCAGAAAGGT GGTAACAACC ACACGAAGAT 924 CATCCAATGG CATCAAGGAT GAATATTTTC T-GGGGGTTT TCAACATTTG AGGATTTCAT 10160 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| CATCCAATGG CATCAAGGAT GAATATTTTC TGGGGGGTTT TCAACATTTG AGGATTTCAT 984 TAGCTCAAAT CTGAATTAGA CTTGAATATT TTAGAAGGTC AACCTATTTT ATTTATCTAA 10220 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGCTCAAAT CTGAATTAGA CTTGAATATT TTAGAAGGTC AACCTATTTT ATTTATCTAA 1044 ATCTGTATGG TGTACTATTT ATTGTCAAGA TTTAAGGA 10258 |||||||||| |||||||||| |||||||||| |||||||| ATCTGTATGG TGTACTATTT ATTGTCAAGA TTTAAGGA 1082 hqPGS_C09HBa0099P03.1-2+_SGN-U328710+ (6922 6961,7320 7505,8201 8317,8489 8595,8732 8810,9624 9680,9764 10258) ******************************************************************************** EST sequence 17 +strand 663 n (File: SGN-U323147+) 1 CAAACGATGC CATTTTTTAA ACAAGATATT GTTTTCTGAT TTCAAAGTAG AGAGGACAAG 61 AAAGCCAAAT ATGTTTTGTT ACAACAAACC GACTTTATAG CCCTTTTTCC ACCGATCAGC 121 ATTGAAAAAT TTATCAACAA TGGCTGCGAA TTCCTTTTGT TCCATTTTCA TCATCTCTTC 181 ATTATTGATC GCAGCTTTGA TCATCTCCGG CGATGCTACC GGCGGCGATT TCGACGTGAG 241 CGGTTGGATT CCGATGAAAT CCGCCGATAG CTGTGAAGGT TCGATAGCGG AGTGTATGGC 301 TGCCGGAGAA TTCGAAATGG ATTCGGAGAG CAACAGGCGT ATATTAGCAA CTACTGATTA 361 TATAAGCTAT GGTGCGCTGC AGAGTAACAG TGTTCCGTGT TCTAGAAGAG GTGCGTCGTA 421 TTATAACTGC AAAACAGGTG CTGAAGCTAA TCCGTATACA CGTGGTTGCA GTGCTATTAC 481 TCGTTGCCGG AGTTAAATTA ATTAAAGATC GAATTAATCG ATGTTAATTA ATTATTAGTA 541 AGTGTAATTG TTTTGAATAA TTTCGTAGTG TTTATATTGT ATACTTTAAG TAGGAGTATT 601 TTTCTTTTCA GTTGCAATTT CAAATAAAGT GACAGTGGTG CTTTGGCAGT GGGTTATGGT 661 TAA Predicted gene structure (within gDNA segment 13147 to 10195): Exon 1 11457 10904 ( 554 n); cDNA 110 663 ( 554 n); score: 0.998 MATCH C09HBa0099P03.1-2- SGN-U323147+ 0.998 554 0.836 C PGS_C09HBa0099P03.1-2-_SGN-U323147+ (11457 10904) Alignment (genomic DNA sequence = upper lines): CACCGATCAG CATTGAAAAA TTTATCAACA ATGGCTGCGA ATTCCTTTTG TTCCATTTTC 11398 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCGATCAG CATTGAAAAA TTTATCAACA ATGGCTGCGA ATTCCTTTTG TTCCATTTTC 169 ATCATCTCTT CATTATTGAT CGCAGCTTTG ATCATCTCCG GCGATGCTAC CGGCGGCGAT 11338 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCTCTT CATTATTGAT CGCAGCTTTG ATCATCTCCG GCGATGCTAC CGGCGGCGAT 229 TTCGACGTGA GCGGTTGGAT TCCGATGAAA TCCGCCGATA GCTGTGAAGG TTCGATAGCG 11278 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCGACGTGA GCGGTTGGAT TCCGATGAAA TCCGCCGATA GCTGTGAAGG TTCGATAGCG 289 GAGTGTATGG CTGCCGGAGA ATTCGAAATG GATTCGGAGA GCAACAGGCG TATATTAGCA 11218 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGTGTATGG CTGCCGGAGA ATTCGAAATG GATTCGGAGA GCAACAGGCG TATATTAGCA 349 ACTACTGATT ATATAAGCTA TGGTGCGCTG CAGAGTAACA GTGTTCCGTG TTCTAGAAGA 11158 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTACTGATT ATATAAGCTA TGGTGCGCTG CAGAGTAACA GTGTTCCGTG TTCTAGAAGA 409 GGTGCGTCGT ATTATAACTG CAAAACAGGT GCTGAAGCTA ATCCGTATAC ACGTGGTTGC 11098 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGCGTCGT ATTATAACTG CAAAACAGGT GCTGAAGCTA ATCCGTATAC ACGTGGTTGC 469 AGTGCTATTA CTCGTTGCCG GAGTTAAATT AATTAAAGAT CGAATTAATC GATGTTAATT 11038 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTGCTATTA CTCGTTGCCG GAGTTAAATT AATTAAAGAT CGAATTAATC GATGTTAATT 529 AATTATTAGT AAGTGTAATT GTTTTGAATA ATTTCGTAGT GTTTATATTG TATACTTTAA 10978 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTATTAGT AAGTGTAATT GTTTTGAATA ATTTCGTAGT GTTTATATTG TATACTTTAA 589 GTAGGAGTAT TTTTCTTTTC AGTTGCAATT TCAAATAAAG TGACAGTGGT GCTTTGGCAG 10918 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAGGAGTAT TTTTCTTTTC AGTTGCAATT TCAAATAAAG TGACAGTGGT GCTTTGGCAG 649 TGGTTTATGG TTAA 10904 ||| |||||| |||| TGGGTTATGG TTAA 663 hqPGS_C09HBa0099P03.1-2-_SGN-U323147+ (11457 10904) ******************************************************************************** EST sequence 20 -strand 756 n (File: SGN-U326156-) 1 ATATATTATA GATCCAAGAG ACATGTCTTC CTAACTTTAT ACGACTGATA GAAGTACCAA 61 ACGGACTTGA TAGTGAAAAT CATAAATTGA TGAAAGGAGC CACCTCCCTA CTGAGGCACC 121 ACAATGTAAA ACAAGCACAA GTCATACTAG GGCTGTGTAT CGGCCGGTTC GGTTCGATTT 181 TAAAGTTTAT CGGGTTGGCT TATTAGTTAT CGGTTTGTAG TGATGCTAAA CCATTATAGA 241 ACCATTAAGA TATTGGCTTA TCGGTTATTG GTTTATCGAT TTTTGATCGT TATCTGTTCG 301 GTTATCGGTT TAACCGTTAA GATTTGACCC AAAAGAAAAA ATATTGAAAA TCACGTAGAA 361 ACAAGGTGAC AAACCAAATA AACCATGCAC TTGAGTTCAC AAGTTACATC TTGCTCAAAA 421 GCAAGCACTT TTACATCGTA GAATAATCAA GTGTTTGAGA CAACCAAAAA TAAAAGTAGG 481 AAATTAAACT CCAAGTCGAG AACTTTATAT ACAAAAATGG TATAAATATA AATATTTAAT 541 TTACTATCGG GTTATCGGTT AATCCGTTAA AAAAAAAAAC TTTAAACCGT TAAGAATCGA 601 TAACCCGATA ACAAAAAAAA ATCAAAACCA TTATCAAAAT CACTAAACCA ATAACCCAAT 661 ACTATAAACC AATAACTTTT TTATCGATTC AACTTATCGA TTTTGATTCG ATTTTAAACA 721 GCCTTACTTA ATAGNACTTT TTTTTGGTTT ATTTTG Predicted gene structure (within gDNA segment 9840 to 15636): Exon 1 13807 14395 ( 589 n); cDNA 148 751 ( 604 n); score: 0.818 MATCH C09HBa0099P03.1-2+ SGN-U326156- 0.818 589 0.779 C PGS_C09HBa0099P03.1-2+_SGN-U326156- (13807 14395) Alignment (genomic DNA sequence = upper lines): TAGGGCTATG TATCGATTGG TTCGATTTGA TTTTAAAGTC TATCGAATTG GCTTATTGAT 13866 ||||||| || ||||| || |||| || || ||||||||| ||||| ||| ||||||| | TAGGGCTGTG TATCGGCCGG TTCGGTTCGA TTTTAAAGTT TATCGGGTTG GCTTATTAGT 207 TATCAGTTTG TAGAGATGCT AAACCGTGAT AGAACCATTA AGATATTGAG TTATCAG-T- 13924 |||| ||||| ||| |||||| ||||| | || |||||||||| |||||||| ||||| | | TATCGGTTTG TAGTGATGCT AAACCATTAT AGAACCATTA AGATATTGGC TTATCGGTTA 267 TT--TTTATC G----TT-AT CG------GT TCGGCTATCG ATTTAATCGT TAAGATTTGA 13971 || |||||| | || || || || |||| ||||| ||||| ||| |||||||||| TTGGTTTATC GATTTTTGAT CGTTATCTGT TCGGTTATCG GTTTAACCGT TAAGATTTGA 327 CACAAACAAA AAAATATTAA AAATCACTTA GAAACAAGGT GACAAACCAA ATAAACCATG 14031 | |||| || |||||||| | ||||||| || |||||||||| |||||||||| |||||||||| CCCAAAAGAA AAAATATTGA AAATCACGTA GAAACAAGGT GACAAACCAA ATAAACCATG 387 TACTTGAGTT CACAAGTTAC ATCTCGCTCA AAAGCAAACA CTTTCACATT GTAGAATAAT 14091 ||||||||| |||||||||| |||| ||||| ||||||| || |||| |||| |||||||||| CACTTGAGTT CACAAGTTAC ATCTTGCTCA AAAGCAAGCA CTTTTACATC GTAGAATAAT 447 CAAGTGTTTG AGACAATTAA AAATAAAAGT AGGAAATTAA ACTCTAAGTC GAGAACTTTA 14151 |||||||||| |||||| || |||||||||| |||||||||| |||| ||||| |||||||||| CAAGTGTTTG AGACAACCAA AAATAAAAGT AGGAAATTAA ACTCCAAGTC GAGAACTTTA 507 TATAC-AAAA TGGTATAAAT ATAATTATTT AATTTACTAT CGAGTTATCG ATTAACCCGT 14210 ||||| |||| |||||||||| |||| ||||| |||||||||| || ||||||| |||| |||| TATACAAAAA TGGTATAAAT ATAAATATTT AATTTACTAT CGGGTTATCG GTTAATCCGT 567 T--AAGAAAA AACTTTAAAC CGTTAAGAAC CGATAACCCG ATAACAAAAA AAAATCAAAA 14268 | || |||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TAAAAAAAAA AACTTTAAAC CGTTAAGAAT CGATAACCCG ATAACAAAAA AAAATCAAAA 627 CCGTTATCAA AACCACTAAA CCAATAACCC AATACTATAA ATCAATAACT TTTTTATCAG 14328 || ||||||| || ||||||| |||||||||| |||||||||| | |||||||| |||||||| CCATTATCAA AATCACTAAA CCAATAACCC AATACTATAA ACCAATAACT TTTTTATCGA 687 TTCGACTTAT CGGTTTCAAT TCAATTTTGA ACGGCCCTAG TAGTATAGTA CTTTTTATAT 14388 ||| |||||| || ||| || || ||||| | || ||| || | |||| | |||||| | | TTCAACTTAT CGATTTTGAT TCGATTTTAA ACAGCCTTAC T-TAATAGNA CTTTTT-T-T 744 AGGCTTA 14395 || ||| TGGTTTA 751 hqPGS_C09HBa0099P03.1-2+_SGN-U326156- (13807 14395) ******************************************************************************** EST sequence 18 -strand 1140 n (File: SGN-U324134-) 1 TTTTTTTTTT AAACTCTAAG TCGAAAACTT TATATACAAA ATGGTATAAA TATTATTATT 61 TAATTTACTA TCAGGTTATC GGTTAACCCA TTAAGAAAAA ACTTTAAACC GTTAAGAACC 121 GATACCCGAT AACAAAAAAA ATCAAAACCG ACATCAAAAC CACTAAACCA ATAACCCAAT 181 GATATAAACC AATAACTTTT TTATCGGTTC GGCTTATCGG TTTAGATTCG GTTTTGAACA 241 GCCCTACTCA AGTATATAAC CCTCAGAGAT AGGATGAGCT CTGTGAAAGA AACTTGAAAA 301 ACTCCTCTTT CCTTATTTCA AAGTTGTTCT TTGCATTGGT CAGCAATGTT TGCTCTCTCT 361 CTAAATCTTC CACACATCTT ACCCTTTCTC ACTTCAGCTC CTTCATCTGG ATCTCACTTC 421 CAAAATAGCT TTCACGTAGA TACAACCTTC CAGAACTCTT TGGAAATTTG ATTTGTACTT 481 CCCTTTTGCC TAGGGAAAAT ATGTCAGGAC TCTTTATGAT CCCACCATCC AATGATATGG 541 CTCCATCAAT TATTGGACTT ATATGTTCAA TAAATAAAAA AGCTCAACAA CGAAGCAAAT 601 AAGTACAATA ATACCTTTAC ACAACATTTA GTAAATTACT ATATTACCCC CGCCTAGAAA 661 AGTTGGAACG TTGACGAATA ATTGCTTTCC TAGCATCTTA TATATAGTTA TTAAAGTAAT 721 ATATATATAT ATATATATAT ATATATATAT ATATATATAT ATATAAAAGT TAAAAATGGT 781 TGAAAGTTTG AAATGGGCCA TTTCAATTAT AAATTTGAAA TGTATTTCTT TTTTGGGCAG 841 AGAAAGGAAA TATATATAAT TTTAGCTAAA TTTTATATTT AAGAAATTTA TAAATAATAT 901 AAATTCATAA TTTTATATTT ATGAATAAAA ATTTTAAATC ATAAATTTTA TATTTTAAAT 961 TAATCTATAA AAAATGCACA AGTTAACTAG GCATATGGGA TGATAACGAA TTTCACATTA 1021 TCAATGGAAT ATACTTTTTT CAAAAATTAA AATAAATAAA TCTGTTTAGA ATGCAAACAT 1081 GCCAAGCAGC TGGTTTAAAA TAGGCATGGT TCATGATGCA CATAACATTG TTTTTTGATT Predicted gene structure (within gDNA segment 13438 to 24276): Exon 1 14128 14367 ( 240 n); cDNA 9 246 ( 238 n); score: 0.917 Intron 1 14368 20799 (6432 n); Pd: 0.000 (s: 0.86), Pa: 0.933 (s: 0.56) Exon 2 20800 20850 ( 51 n); cDNA 247 293 ( 47 n); score: 0.569 MATCH C09HBa0099P03.1-2+ SGN-U324134- 0.856 291 0.255 C PGS_C09HBa0099P03.1-2+_SGN-U324134- (14128 14367,20800 20850) Alignment (genomic DNA sequence = upper lines): TTAAACTCTA AGTCGAGAAC TTTATATACA AAATGGTATA AATATAATTA TTTAATTTAC 14187 |||||||||| |||||| ||| |||||||||| |||||||||| ||||| |||| |||||||||| TTAAACTCTA AGTCGAAAAC TTTATATACA AAATGGTATA AATATTATTA TTTAATTTAC 68 TATCGAGTTA TCGATTAACC CGTTAAGAAA AAACTTTAAA CCGTTAAGAA CCGATAACCC 14247 |||| |||| ||| |||||| | |||||||| |||||||||| |||||||||| ||||| |||| TATCAGGTTA TCGGTTAACC CATTAAGAAA AAACTTTAAA CCGTTAAGAA CCGAT-ACCC 127 GATAACAAAA AAAAATCAAA ACCGTTATCA AAACCACTAA ACCAATAACC CAATACTATA 14307 |||||| ||| |||||||||| |||| |||| |||||||||| |||||||||| |||| |||| GATAAC-AAA AAAAATCAAA ACCGACATCA AAACCACTAA ACCAATAACC CAATGATATA 186 AATCAATAAC TTTTTTATCA GTTCGACTTA TCGGTTTCAA TTCAATTTTG AACGGCCCTA 14367 || ||||||| ||||||||| ||||| |||| ||||||| | ||| ||||| ||| |||||| AACCAATAAC TTTTTTATCG GTTCGGCTTA TCGGTTTAGA TTCGGTTTTG AACAGCCCTA 246 GTAGTATAGT ACTTTTTATA TAGGCTTAAA ATATGTCAAA TGTTACATAA GCAAAAATAA 14427 .......... .......... .......... .......... .......... .......... 246 GAAATTAACA CCATGGTTGC TCTTTAGAAT AAAGTCCCGG GAGAAATACT AGAAATAAAA 14487 .......... .......... .......... .......... .......... .......... 246 TATGTCTAGG TGCCTTGTAC AATAGTAAAT CATTACATTT ATTCTCGAGA TTTTAAAATA 14547 .......... .......... .......... .......... .......... .......... 246 TTCATAATCT AATACTAAAA ATTACACAAA TATTTGAGTA CTGAATTTTT TTTTTTAAAT 14607 .......... .......... .......... .......... .......... .......... 246 AAATGCATCA AGCAGAGTGA CCGTGCAGGC AAAGGAAGGA CTCAAATTTT AAAAGGAATA 14667 .......... .......... .......... .......... .......... .......... 246 GAATAATGCA CAATCATCTT GGTCTCTTCG ACCAAGACAG TTCAACCAAG ACGGTTCGAC 14727 .......... .......... .......... .......... .......... .......... 246 AAAATTTAGT AATTTTGATT TCAAGCAATT TATTAATACG TATAGATTAT TAATTTAGAA 14787 .......... .......... .......... .......... .......... .......... 246 TTCGATAACA CTCAAAAAAG ACTAAAATTT AATATACATG AACTTCAAAT CATGGTCTGC 14847 .......... .......... .......... .......... .......... .......... 246 CTTCACCTTA TACTGTAGTT TAATATGAAA AAGTCTTCCT GGCAAAGTTA TTGGTATTTT 14907 .......... .......... .......... .......... .......... .......... 246 TTTATTGGAG TAGGTTCAGA ATATCTTTCT TGTTGGAACC ATTCCAGCTT ATATAGTTTC 14967 .......... .......... .......... .......... .......... .......... 246 TTATTTTCTC GTTTTTTAAT AAAAAAGACT TAGGTTAATA TTAAAACAGA TCGATGTACT 15027 .......... .......... .......... .......... .......... .......... 246 AACAAATTAT AAACTAAAAC TCAGTTTGAA CGTCTGCAAA TAAATAAATA ATAATAATAT 15087 .......... .......... .......... .......... .......... .......... 246 TTCTCAAAGA AGTCAACTAC ATAGCAGAAA TTACTTCCTA GCCCATTAGA TATATTTTCT 15147 .......... .......... .......... .......... .......... .......... 246 CTTGTTTTTT CCTTCTTTGT CCAGTTGATT GTCCCAACCT TCAATAAGGT AACAAAAACC 15207 .......... .......... .......... .......... .......... .......... 246 AAGACTTCAA GCACAGGTGG ATTCACTTGT ATGCACTTGC GAAGCCAGAA ATTTCACTAA 15267 .......... .......... .......... .......... .......... .......... 246 GTATATTCAA GGTTTAATGT GTGTTTTTGT GTGAGTGAGA TTGATAATTT GACCTATATA 15327 .......... .......... .......... .......... .......... .......... 246 TGCATGCAAT ATAGTTTTCC GCCAAAGGTT GTTCAACAAA CCAACCTTCA ATGATAAAAG 15387 .......... .......... .......... .......... .......... .......... 246 TGCTAACTAA AGCTAAATAT CAGATCCAGT GTCATAAAAA GTGTCATATA GTGATAATTC 15447 .......... .......... .......... .......... .......... .......... 246 GTATATATTG AATTAAAATC ATGGATCCAC CTCTAGCTTC AGCAGTAGAT GAATAACTAT 15507 .......... .......... .......... .......... .......... .......... 246 GAGTTCCATT GTTTGGATTT CTATTTCCTA TGGGATTTCA TATCCTTTGG ATATTCATCT 15567 .......... .......... .......... .......... .......... .......... 246 GGCAAATATC AATATCAATT TATAGAATTA TAGTACAGTA TATGTTCGTG AAAAAATGGA 15627 .......... .......... .......... .......... .......... .......... 246 AACGTAAAAC AGACAAGCTC AACAGGCATC AATTAGTGCC TTTTATATTT CAAGTAAGGA 15687 .......... .......... .......... .......... .......... .......... 246 GACTGACCGA AAATACACCA GGTAGTGAAA AGATTTTAGC ACTCAGCAAG AGGAACTTGT 15747 .......... .......... .......... .......... .......... .......... 246 ATTATCTTTC TAGCTTGCTT AATATTGCTT CCATTGGAGG CAGTTCCTTT AAAATTTGAG 15807 .......... .......... .......... .......... .......... .......... 246 CCCAGTCAGG CATCCCCTGC AGAGTTGGTC GATATTAAAA TCGAAGAATT AACCCGTATG 15867 .......... .......... .......... .......... .......... .......... 246 AGTAGTAAAT TATACTTTTA TGGATTCAGT CATCTATGTT ATGTCATGAT CAATTTATGT 15927 .......... .......... .......... .......... .......... .......... 246 GCTGATTATC TATGCATATA AGTATATAAC TGAGTCAGGA TTAATGGACC AAAGAAGAGA 15987 .......... .......... .......... .......... .......... .......... 246 ATTGCAGCAA CAAGTACTGA CCTTTCCTGA TCGTCGAACA CGACCATCTA GACGAAGTAT 16047 .......... .......... .......... .......... .......... .......... 246 AATGTAGACA TCTTCACCAG GAGTAACACC TTCAGACTTC TGTTGATCCC TTATCCACCT 16107 .......... .......... .......... .......... .......... .......... 246 GAAATAGATT TGATGTCAGA CAATAGATCT GGTACAAGGG AAAACAATTC AGTCGATCAA 16167 .......... .......... .......... .......... .......... .......... 246 ACTGTGTAAC GGGATGTACA ATAAATTAAA GTGGCACAAA CATAAATGCA CATGACCATG 16227 .......... .......... .......... .......... .......... .......... 246 ACGGGATGAG TAGTTTCAAG AGTATTTGGA ACAGGTAGAT ATACCTTTCC CACTCTGCAG 16287 .......... .......... .......... .......... .......... .......... 246 GTGAAACGAC CTCCGCCTTG AACCGGATCT CTGACTTCAG TTTTGATTTT GCAATTATTG 16347 .......... .......... .......... .......... .......... .......... 246 ACTGAGTTCT TTGGTCAAAA TCTTCCTACA GATTGGACAA AAATAAGAAC CTGAATGAGA 16407 .......... .......... .......... .......... .......... .......... 246 TAAATGCACC ATTAAGATTA TCGTCTTATG CTTGTTGAGC AACAAGAAAC GACTTTCAAA 16467 .......... .......... .......... .......... .......... .......... 246 AATCCTGTAG CATGTAAAAA AAATATATAT TTTGCACTTT GAGACTTGAT GTGCATGCAA 16527 .......... .......... .......... .......... .......... .......... 246 TAGCATCATG TTGTTTAAAA TGATCATTAA GTGCTACTAA AAATCTTCAA TCTTACCCCT 16587 .......... .......... .......... .......... .......... .......... 246 ATGGATGGGA GGGAAGCGAC TGCCTTTTGC GAAACACCAA AACCTTTCTT CTCTATTTTT 16647 .......... .......... .......... .......... .......... .......... 246 GTTTCCCTAC CTTCACCCCA AATAACAGGA ACTAGAAGGA CCCCGCGTTT AAGAAGCTCA 16707 .......... .......... .......... .......... .......... .......... 246 GTGCGGAATT TCTCAGCATT TCTCATAGCC AGAGAGACTG TCTCCTTTTT TCCAGCCAAA 16767 .......... .......... .......... .......... .......... .......... 246 ATAACCTAAC AAGCAAATTG GTCAAACTTC AAATACAAGG AGAGAATATG TTCAAGTCCG 16827 .......... .......... .......... .......... .......... .......... 246 GAATTTTTAA TAAGGAAAAA GAGAGACAGA GAGAACAAAT ACGTAATGGG ATAGCTTTTT 16887 .......... .......... .......... .......... .......... .......... 246 GCTCATGAAA CAAAGACATT TGTTGCAAAC CTAGCAAAAT ATGTTACTGA CTTTTTCTCT 16947 .......... .......... .......... .......... .......... .......... 246 TTAATTATCC TCCGACAATG TAAATGTGAT GGAAATGGAT TCTTCAAACG AAGGAGGTAA 17007 .......... .......... .......... .......... .......... .......... 246 ACAATATCTT TTGATCTGTT AAGCAGGATA CATGGCAACG AGAACTAGAC TAGTGACAGG 17067 .......... .......... .......... .......... .......... .......... 246 ACATCTCATA CAGGACATCT CATTCTTCAA GGTAAAATTG AAGGACTCCG GACGTTTTCC 17127 .......... .......... .......... .......... .......... .......... 246 ACTCAACTTC TAGAAATTGA CAGGAAGAGT AAAGGGGAAA TTGTCTCACT AACAGGCCTC 17187 .......... .......... .......... .......... .......... .......... 246 ACGGTGTCTC TTAGCTGGAC GAGTTCAACA ATCCTATCAG TTGAAAGGCG CAAAGGCAGC 17247 .......... .......... .......... .......... .......... .......... 246 CTTGATAGTG TTTCATCACG CAATATTTGT GCAAGCTGTT CCTCCTCTTT TTTGTTGTCC 17307 .......... .......... .......... .......... .......... .......... 246 CAAAACAACA AAGCCACAAA AACAGCAATA CCTGCAGAGT GCTGGTTCAA GAGGTTAATC 17367 .......... .......... .......... .......... .......... .......... 246 ATTTTTGAGA AACAGTTCTT TTCATTTAAA AGCGCAAGTT AATATAGCCA GCAAAAATAA 17427 .......... .......... .......... .......... .......... .......... 246 TTTCTAGAAC TTTAGAAGTG AACTAAATGA TTCGTAGCAA AGGTAGTCCT TATTTACCAG 17487 .......... .......... .......... .......... .......... .......... 246 CGACATTTAT AGCAGCATTT CCTGCAGTCG CCCCTAAATC AGGAGCACCA TCTCCACCTT 17547 .......... .......... .......... .......... .......... .......... 246 GAATTGCACG GATTAACCTG GGTATAGTGA AAAATGTAGA AATTCCAGCA GCAGCAATGA 17607 .......... .......... .......... .......... .......... .......... 246 ATGCCACATA AAAGAATCTT CTGACCCCCC TAAACGGTGC TTGTACCTCA CTGATGAGCT 17667 .......... .......... .......... .......... .......... .......... 246 TAAGGTCTCT ACGGAAACTG TAACCTATGT CTTCCCCTCC TAACCTAGCC TGTGAGGTAA 17727 .......... .......... .......... .......... .......... .......... 246 ATAGAAGACT ATGAGATTCA GAAACAATGA AGCAAAGCTA GAGCATTACG AAGTTCTGGA 17787 .......... .......... .......... .......... .......... .......... 246 GATACTGTAA CATAGTCAAT GTGGTAAGGA GCTGAATTAA ATTCAGACAT CCAAGCATGA 17847 .......... .......... .......... .......... .......... .......... 246 CGAAAACTAA AGAACTATAT CAACAGCTGG AAACAAACCT CTTCCTGTAG TTCCTTAAAT 17907 .......... .......... .......... .......... .......... .......... 246 TCAGGCAATG CTCTGAAGGA AGCCAAATCT GGATCATTTA AGATTGTGCC AAACTTAAGG 17967 .......... .......... .......... .......... .......... .......... 246 TTATACTCTT TCAAAGCAGT GCGTAAACAC TCAGCAGCTT TCTTTCCTTC CCCCCTTGAG 18027 .......... .......... .......... .......... .......... .......... 246 TTAATAAAGA AAAATAAATT ATTAAACAAA TAGAGAAATA ATAAAAGAGT GAAGAAGAAT 18087 .......... .......... .......... .......... .......... .......... 246 AAATAAGTAT AGTAGACGCT CATTGAAGAT TCAACATTGG TTTTAGTGAT CCTTCCAATG 18147 .......... .......... .......... .......... .......... .......... 246 AAACAATTAA GCTAAGAACT GTGATTAAGA AACTACAGGT GCCTCAACTT ACAGTTCACA 18207 .......... .......... .......... .......... .......... .......... 246 AAGCTAAAAC TGTGATTAAG AAATTTTAAA TTCCTTTACT TGCATTTTAC ACGAGAAGAA 18267 .......... .......... .......... .......... .......... .......... 246 AGTCGAAAAT ATGTTTTTTT TCTCAATGTC TCACATGATT TCAAACAAAT AACCGTTTAT 18327 .......... .......... .......... .......... .......... .......... 246 AAGATCCAGC TTAATCAATT CAACCATGAA AGAGGCAAAG GAGAGATATA CCTGTAGGCA 18387 .......... .......... .......... .......... .......... .......... 246 TGGCAACATG CTTTGTTATA AAATGCAGCT TGAGCCTCCT CAGGGTTGGG ATTTAAAGTA 18447 .......... .......... .......... .......... .......... .......... 246 AGTGCTGTGT CGAATAGCAC AAGAGCATCT TTCACCTGGA AAGATCAAAT TTAGTGAACA 18507 .......... .......... .......... .......... .......... .......... 246 ATAAGGTGCA TGTCGCCCAT ATTGCAGATT GAAATTGTAA CCAACTGCAA GAGGTTTGTA 18567 .......... .......... .......... .......... .......... .......... 246 CACAAACCAA AATGCAGATT ATTGGAAACA TAATTAAGTG AGAAATAACC AAAAAAAAAA 18627 .......... .......... .......... .......... .......... .......... 246 AAGCCAACTG AATAGATTTT AAATAGGTAA TTTCACAAAA GACAAATTGT TGATTAAAAG 18687 .......... .......... .......... .......... .......... .......... 246 CAAAAAGAAG ATTTGATCAA TGACATTAAC AAATTGATAG CTTTTCCAGT AATAATTGTG 18747 .......... .......... .......... .......... .......... .......... 246 ATTCAACCTA CTAGATACTA TCCTTAAGCT TGCTATGCAG GATCAAATAG AGATACACAA 18807 .......... .......... .......... .......... .......... .......... 246 GAAACTATAA CTCAGACATT CAAGCTTCTG ATGTGCACAA GATTTAACAG AAAAACTTGT 18867 .......... .......... .......... .......... .......... .......... 246 TGAGTAGGAA AATCTTTGAA TCATCAATCT TTTGGTACAC TTTCAGTTTT AGACAGAAAT 18927 .......... .......... .......... .......... .......... .......... 246 TTACTTTCCA ATTATACTCC TTCCTAGTCC ATCTATCTCA ATTCATGTGA CACATGTTCT 18987 .......... .......... .......... .......... .......... .......... 246 TGTTTGATGT CAAATTTACA ATCTTAGCAT TATTATAAAG TTCCAGTCAT ACGCATTTTG 19047 .......... .......... .......... .......... .......... .......... 246 AAATCTAGTT GTTGTTACAG ATAAAATCTG AAAGTACCTT CCTATTTTTC AAGTCACTGT 19107 .......... .......... .......... .......... .......... .......... 246 CAAAAGTCCA CTCACGACCA ATTTTCAATA AAGCATGATT AGGGTGGGTG TTAAGTGTAG 19167 .......... .......... .......... .......... .......... .......... 246 GAGCAGAGCC ACCCCTTCAG CCATTTGGTT CAACAAAACC CTGTAGCTTT GGCTCAAACC 19227 .......... .......... .......... .......... .......... .......... 246 TCGTAGTTAT GTTTCAAAAA GTTCATTTGC TATGTGTAAA ATGATATATT CAAAACTCAT 19287 .......... .......... .......... .......... .......... .......... 246 AAACTAAAAA TCCTAGATAC ACCTCTGAGT ACAACTGTTA CCATAAGATT TACTCAAAAC 19347 .......... .......... .......... .......... .......... .......... 246 AGGAAAATTT TGAATAGACA TGTCAAACTA GTAAACTTGC TCAAGTTGCA GATTCTTGGA 19407 .......... .......... .......... .......... .......... .......... 246 GAACACTTGC CCTAACCAAA CTCTCAAAGC CCCCTATGAC TACATTGTGA ATGTTGATTA 19467 .......... .......... .......... .......... .......... .......... 246 AGTTAGAAGG AACGAGCCTC AATTTATAGA GTACTAAACT TTTTTCTACA AGAAAATGAC 19527 .......... .......... .......... .......... .......... .......... 246 TACTCAAAAA CGGAAACTTT ATAACCTATT TATGCTAAGA AACTCAGGTC AAATACCCAA 19587 .......... .......... .......... .......... .......... .......... 246 CAACAAACTC TTGGAGAACA CTTGAGGAAA ATTAGGTAAC AAATAGAAGG GAATAATATG 19647 .......... .......... .......... .......... .......... .......... 246 ACTGCAAATG ATCAAATGAT GGGGATATTA AAACTCGAAA TCAAAACCTC TATTGTCATT 19707 .......... .......... .......... .......... .......... .......... 246 TATCAAATTA AATTGCATAA ATTAAAACGA ATCCCTTTTA TTTACTTACT TAACTTATTT 19767 .......... .......... .......... .......... .......... .......... 246 GATATCTACA ACTAGTGGCG TAGCTACAAT TGAAGTATAT CTTTGTTAAA AAAATAAATA 19827 .......... .......... .......... .......... .......... .......... 246 TATATAAATA AACAAAAATT TTATGGTCAC GAACCTCAAT CGACACACTC TCTGTCTAAA 19887 .......... .......... .......... .......... .......... .......... 246 TCAACAAAAA TAACAATACA ACAACAATAC TGAGGAGTTT CATAAAGCAT AATGTATATG 19947 .......... .......... .......... .......... .......... .......... 246 CAGACCTTAT CATTGTTTAT AATAGACCTG AAAAACAATG CAATCACATA ATAATTAAGG 20007 .......... .......... .......... .......... .......... .......... 246 AATTTCTTCT GCCATGAAAT TGAAACAGAA TTAAGATAAT ACCCGTCCTT TAGAGAAGAG 20067 .......... .......... .......... .......... .......... .......... 246 GTCGAGGCCG GCATTGACAC AGGACTCAGC AGTGGGAGTG GGTTCACTGA CTTGGGGCGA 20127 .......... .......... .......... .......... .......... .......... 246 CGACGATGAT GGCGGAGGGG GCGAAGTAGA AGAAGAACAA ATGACGAGTG AAATACGATT 20187 .......... .......... .......... .......... .......... .......... 246 CTTCGGGGGA AAATTTAGGC ATAATTGTGT AGTGATTGAG AATTTGGAAG ATGATGAAGA 20247 .......... .......... .......... .......... .......... .......... 246 GAAGAAGCAG AAATGGAGGT GATGAGAGGA GGGAAGAGTT GTAATGGCCA TGTCAAGATT 20307 .......... .......... .......... .......... .......... .......... 246 TATCCATTCG CACCACACCT TACACTTCTT GTGGTTTTCA TAAAAGGCTT TAATTCTAAA 20367 .......... .......... .......... .......... .......... .......... 246 ATTTGGTTTT ATTAAGGGAT TATTATTACT TCCTCACTCT CAAAATATTT GTAATGTTTA 20427 .......... .......... .......... .......... .......... .......... 246 ATCATTTTTA AAAATATTAA TTCAAAAATG TTTGATTATT TCCTTAACAT TTCTTCTATA 20487 .......... .......... .......... .......... .......... .......... 246 GGTGTTTATT TTATACATAA AAAAAAATTC GTTTAATTTT ATTTCTTCAA TTTAAACTTT 20547 .......... .......... .......... .......... .......... .......... 246 GCATACTTCA AAAAAAAGAG AAATGATATG TTATTTTATC ATAATACACA TATTAATTGG 20607 .......... .......... .......... .......... .......... .......... 246 GGTTTAATAT TAAATATTGA AATATGATTT GAAGAATAAG TAATTAATGT TAAGAATAAA 20667 .......... .......... .......... .......... .......... .......... 246 ATGATTTTTT CTTGGTTTAT ACAAAGTGTC AAAGGAAAAG TGAAATTTTT CTTATAAAAT 20727 .......... .......... .......... .......... .......... .......... 246 GATTATGGTT TAGGCTTAAT TAGAGATTTA TAGTTATAGT TCGCTTAATT ATAAATTATA 20787 .......... .......... .......... .......... .......... .......... 246 GATACGTATT AGGGGGAGGA GAGATGCGAG CGAGATCAGA AATTGAGCTG TTCAAAATGA 20847 || | | | | ||||| || | |||||| | ||| || .......... ..CTCAAGTA TATAACCCTC AGAGAT-AG- GA-TGAGCTC TGTGAAA-GA 290 AAC 20850 ||| AAC 293 hqPGS_C09HBa0099P03.1-2+_SGN-U324134- (14128 14367) ******************************************************************************** EST sequence 12 +strand 930 n (File: SGN-U337965+) 1 GGGTTGTATA CGACTATTAT ATGGGCGAAT TGGGTACCGG GCCCCCCCTC GAGTTTTTTT 61 TTTTTTTTTT TTATTTTTTT AGTTGAAAAC TTTATATACA AAATGGTATA AATATTATTA 121 TTTAATTTAC TATCAGGTTA TCGGTTAACC CATTAAGAAA AAACTTTAAA CCGTTAAGAA 181 CCGATACCCG ATAACAAAAA AAATCAAAAC CGACATCAAA ACCACTAAAC CAATAACCCA 241 ATGATATAAA CCAATAACTT TTTTATCGGT TTGGCTTATC GGTTTAGATT CGGGTTTGAA 301 CAGCCCTACT CAAGTATATA ACCCTCAGAG ATAGGATGAG CTCTGTGAAA GAAACTTGAA 361 AAACTCCTCT TTCCTTATTT CAAAGTTGGT CTTTGCATTG GGCAGCAATG GTTGCTCTCT 421 TTTTTAATTT TCCACACATC TTACCCTTTC TTACTTTAGC TCCTTTATTT GGATCTCACT 481 TCCAAAATAG CTTTCACGTA GATACCACCT TTCAGAACTC TTTGGAAATT TGATTTGTAC 541 TTTCCTTTTT TCTAGGGGAA ATATGTTAGG ACTCTTTATG ATTCCACCCA TTCCATGATA 601 TGGGCTCCCT CAATTATTGG GCTTTATTAA TGGTTTAATA AAAAAAAAGC TCTACAACGA 661 AGCAAATTAG TACCATAATT CCTTTAACAC ACATTTTAGT AAATTACTAT ATTACCCCTG 721 CCTAGAAAAG TTGGAACGTT GACAAATAAT TGCTTTCCTA GCATCTTATT ATAGTTATTA 781 AGTAATATTT ATATATATAT ATATATATAT ATATATATAT TTATATATTT AAAAGGTAAA 841 ATGGGAGGAA GGTTTGAAAG GGCCCTTTCT ATTAAAATTT TAAGTGTTTT GTTTTTTTGG 901 CAATGTGGGT ATAACTTGGA TTTTGCTTAG Predicted gene structure (within gDNA segment 12675 to 21566): Exon 1 14138 14367 ( 230 n); cDNA 81 308 ( 228 n); score: 0.900 Intron 1 14368 20799 (6432 n); Pd: 0.000 (s: 0.82), Pa: 0.933 (s: 0.56) Exon 2 20800 20850 ( 51 n); cDNA 309 355 ( 47 n); score: 0.569 MATCH C09HBa0099P03.1-2+ SGN-U337965+ 0.840 281 0.302 C PGS_C09HBa0099P03.1-2+_SGN-U337965+ (14138 14367,20800 20850) Alignment (genomic DNA sequence = upper lines): AGTCGAGAAC TTTATATACA AAATGGTATA AATATAATTA TTTAATTTAC TATCGAGTTA 14197 ||| || ||| |||||||||| |||||||||| ||||| |||| |||||||||| |||| |||| AGTTGAAAAC TTTATATACA AAATGGTATA AATATTATTA TTTAATTTAC TATCAGGTTA 140 TCGATTAACC CGTTAAGAAA AAACTTTAAA CCGTTAAGAA CCGATAACCC GATAACAAAA 14257 ||| |||||| | |||||||| |||||||||| |||||||||| ||||| |||| |||||| ||| TCGGTTAACC CATTAAGAAA AAACTTTAAA CCGTTAAGAA CCGAT-ACCC GATAAC-AAA 198 AAAAATCAAA ACCGTTATCA AAACCACTAA ACCAATAACC CAATACTATA AATCAATAAC 14317 |||||||||| |||| |||| |||||||||| |||||||||| |||| |||| || ||||||| AAAAATCAAA ACCGACATCA AAACCACTAA ACCAATAACC CAATGATATA AACCAATAAC 258 TTTTTTATCA GTTCGACTTA TCGGTTTCAA TTCAATTTTG AACGGCCCTA GTAGTATAGT 14377 ||||||||| ||| | |||| ||||||| | ||| |||| ||| |||||| TTTTTTATCG GTTTGGCTTA TCGGTTTAGA TTCGGGTTTG AACAGCCCTA .......... 308 ACTTTTTATA TAGGCTTAAA ATATGTCAAA TGTTACATAA GCAAAAATAA GAAATTAACA 14437 .......... .......... .......... .......... .......... .......... 308 CCATGGTTGC TCTTTAGAAT AAAGTCCCGG GAGAAATACT AGAAATAAAA TATGTCTAGG 14497 .......... .......... .......... .......... .......... .......... 308 TGCCTTGTAC AATAGTAAAT CATTACATTT ATTCTCGAGA TTTTAAAATA TTCATAATCT 14557 .......... .......... .......... .......... .......... .......... 308 AATACTAAAA ATTACACAAA TATTTGAGTA CTGAATTTTT TTTTTTAAAT AAATGCATCA 14617 .......... .......... .......... .......... .......... .......... 308 AGCAGAGTGA CCGTGCAGGC AAAGGAAGGA CTCAAATTTT AAAAGGAATA GAATAATGCA 14677 .......... .......... .......... .......... .......... .......... 308 CAATCATCTT GGTCTCTTCG ACCAAGACAG TTCAACCAAG ACGGTTCGAC AAAATTTAGT 14737 .......... .......... .......... .......... .......... .......... 308 AATTTTGATT TCAAGCAATT TATTAATACG TATAGATTAT TAATTTAGAA TTCGATAACA 14797 .......... .......... .......... .......... .......... .......... 308 CTCAAAAAAG ACTAAAATTT AATATACATG AACTTCAAAT CATGGTCTGC CTTCACCTTA 14857 .......... .......... .......... .......... .......... .......... 308 TACTGTAGTT TAATATGAAA AAGTCTTCCT GGCAAAGTTA TTGGTATTTT TTTATTGGAG 14917 .......... .......... .......... .......... .......... .......... 308 TAGGTTCAGA ATATCTTTCT TGTTGGAACC ATTCCAGCTT ATATAGTTTC TTATTTTCTC 14977 .......... .......... .......... .......... .......... .......... 308 GTTTTTTAAT AAAAAAGACT TAGGTTAATA TTAAAACAGA TCGATGTACT AACAAATTAT 15037 .......... .......... .......... .......... .......... .......... 308 AAACTAAAAC TCAGTTTGAA CGTCTGCAAA TAAATAAATA ATAATAATAT TTCTCAAAGA 15097 .......... .......... .......... .......... .......... .......... 308 AGTCAACTAC ATAGCAGAAA TTACTTCCTA GCCCATTAGA TATATTTTCT CTTGTTTTTT 15157 .......... .......... .......... .......... .......... .......... 308 CCTTCTTTGT CCAGTTGATT GTCCCAACCT TCAATAAGGT AACAAAAACC AAGACTTCAA 15217 .......... .......... .......... .......... .......... .......... 308 GCACAGGTGG ATTCACTTGT ATGCACTTGC GAAGCCAGAA ATTTCACTAA GTATATTCAA 15277 .......... .......... .......... .......... .......... .......... 308 GGTTTAATGT GTGTTTTTGT GTGAGTGAGA TTGATAATTT GACCTATATA TGCATGCAAT 15337 .......... .......... .......... .......... .......... .......... 308 ATAGTTTTCC GCCAAAGGTT GTTCAACAAA CCAACCTTCA ATGATAAAAG TGCTAACTAA 15397 .......... .......... .......... .......... .......... .......... 308 AGCTAAATAT CAGATCCAGT GTCATAAAAA GTGTCATATA GTGATAATTC GTATATATTG 15457 .......... .......... .......... .......... .......... .......... 308 AATTAAAATC ATGGATCCAC CTCTAGCTTC AGCAGTAGAT GAATAACTAT GAGTTCCATT 15517 .......... .......... .......... .......... .......... .......... 308 GTTTGGATTT CTATTTCCTA TGGGATTTCA TATCCTTTGG ATATTCATCT GGCAAATATC 15577 .......... .......... .......... .......... .......... .......... 308 AATATCAATT TATAGAATTA TAGTACAGTA TATGTTCGTG AAAAAATGGA AACGTAAAAC 15637 .......... .......... .......... .......... .......... .......... 308 AGACAAGCTC AACAGGCATC AATTAGTGCC TTTTATATTT CAAGTAAGGA GACTGACCGA 15697 .......... .......... .......... .......... .......... .......... 308 AAATACACCA GGTAGTGAAA AGATTTTAGC ACTCAGCAAG AGGAACTTGT ATTATCTTTC 15757 .......... .......... .......... .......... .......... .......... 308 TAGCTTGCTT AATATTGCTT CCATTGGAGG CAGTTCCTTT AAAATTTGAG CCCAGTCAGG 15817 .......... .......... .......... .......... .......... .......... 308 CATCCCCTGC AGAGTTGGTC GATATTAAAA TCGAAGAATT AACCCGTATG AGTAGTAAAT 15877 .......... .......... .......... .......... .......... .......... 308 TATACTTTTA TGGATTCAGT CATCTATGTT ATGTCATGAT CAATTTATGT GCTGATTATC 15937 .......... .......... .......... .......... .......... .......... 308 TATGCATATA AGTATATAAC TGAGTCAGGA TTAATGGACC AAAGAAGAGA ATTGCAGCAA 15997 .......... .......... .......... .......... .......... .......... 308 CAAGTACTGA CCTTTCCTGA TCGTCGAACA CGACCATCTA GACGAAGTAT AATGTAGACA 16057 .......... .......... .......... .......... .......... .......... 308 TCTTCACCAG GAGTAACACC TTCAGACTTC TGTTGATCCC TTATCCACCT GAAATAGATT 16117 .......... .......... .......... .......... .......... .......... 308 TGATGTCAGA CAATAGATCT GGTACAAGGG AAAACAATTC AGTCGATCAA ACTGTGTAAC 16177 .......... .......... .......... .......... .......... .......... 308 GGGATGTACA ATAAATTAAA GTGGCACAAA CATAAATGCA CATGACCATG ACGGGATGAG 16237 .......... .......... .......... .......... .......... .......... 308 TAGTTTCAAG AGTATTTGGA ACAGGTAGAT ATACCTTTCC CACTCTGCAG GTGAAACGAC 16297 .......... .......... .......... .......... .......... .......... 308 CTCCGCCTTG AACCGGATCT CTGACTTCAG TTTTGATTTT GCAATTATTG ACTGAGTTCT 16357 .......... .......... .......... .......... .......... .......... 308 TTGGTCAAAA TCTTCCTACA GATTGGACAA AAATAAGAAC CTGAATGAGA TAAATGCACC 16417 .......... .......... .......... .......... .......... .......... 308 ATTAAGATTA TCGTCTTATG CTTGTTGAGC AACAAGAAAC GACTTTCAAA AATCCTGTAG 16477 .......... .......... .......... .......... .......... .......... 308 CATGTAAAAA AAATATATAT TTTGCACTTT GAGACTTGAT GTGCATGCAA TAGCATCATG 16537 .......... .......... .......... .......... .......... .......... 308 TTGTTTAAAA TGATCATTAA GTGCTACTAA AAATCTTCAA TCTTACCCCT ATGGATGGGA 16597 .......... .......... .......... .......... .......... .......... 308 GGGAAGCGAC TGCCTTTTGC GAAACACCAA AACCTTTCTT CTCTATTTTT GTTTCCCTAC 16657 .......... .......... .......... .......... .......... .......... 308 CTTCACCCCA AATAACAGGA ACTAGAAGGA CCCCGCGTTT AAGAAGCTCA GTGCGGAATT 16717 .......... .......... .......... .......... .......... .......... 308 TCTCAGCATT TCTCATAGCC AGAGAGACTG TCTCCTTTTT TCCAGCCAAA ATAACCTAAC 16777 .......... .......... .......... .......... .......... .......... 308 AAGCAAATTG GTCAAACTTC AAATACAAGG AGAGAATATG TTCAAGTCCG GAATTTTTAA 16837 .......... .......... .......... .......... .......... .......... 308 TAAGGAAAAA GAGAGACAGA GAGAACAAAT ACGTAATGGG ATAGCTTTTT GCTCATGAAA 16897 .......... .......... .......... .......... .......... .......... 308 CAAAGACATT TGTTGCAAAC CTAGCAAAAT ATGTTACTGA CTTTTTCTCT TTAATTATCC 16957 .......... .......... .......... .......... .......... .......... 308 TCCGACAATG TAAATGTGAT GGAAATGGAT TCTTCAAACG AAGGAGGTAA ACAATATCTT 17017 .......... .......... .......... .......... .......... .......... 308 TTGATCTGTT AAGCAGGATA CATGGCAACG AGAACTAGAC TAGTGACAGG ACATCTCATA 17077 .......... .......... .......... .......... .......... .......... 308 CAGGACATCT CATTCTTCAA GGTAAAATTG AAGGACTCCG GACGTTTTCC ACTCAACTTC 17137 .......... .......... .......... .......... .......... .......... 308 TAGAAATTGA CAGGAAGAGT AAAGGGGAAA TTGTCTCACT AACAGGCCTC ACGGTGTCTC 17197 .......... .......... .......... .......... .......... .......... 308 TTAGCTGGAC GAGTTCAACA ATCCTATCAG TTGAAAGGCG CAAAGGCAGC CTTGATAGTG 17257 .......... .......... .......... .......... .......... .......... 308 TTTCATCACG CAATATTTGT GCAAGCTGTT CCTCCTCTTT TTTGTTGTCC CAAAACAACA 17317 .......... .......... .......... .......... .......... .......... 308 AAGCCACAAA AACAGCAATA CCTGCAGAGT GCTGGTTCAA GAGGTTAATC ATTTTTGAGA 17377 .......... .......... .......... .......... .......... .......... 308 AACAGTTCTT TTCATTTAAA AGCGCAAGTT AATATAGCCA GCAAAAATAA TTTCTAGAAC 17437 .......... .......... .......... .......... .......... .......... 308 TTTAGAAGTG AACTAAATGA TTCGTAGCAA AGGTAGTCCT TATTTACCAG CGACATTTAT 17497 .......... .......... .......... .......... .......... .......... 308 AGCAGCATTT CCTGCAGTCG CCCCTAAATC AGGAGCACCA TCTCCACCTT GAATTGCACG 17557 .......... .......... .......... .......... .......... .......... 308 GATTAACCTG GGTATAGTGA AAAATGTAGA AATTCCAGCA GCAGCAATGA ATGCCACATA 17617 .......... .......... .......... .......... .......... .......... 308 AAAGAATCTT CTGACCCCCC TAAACGGTGC TTGTACCTCA CTGATGAGCT TAAGGTCTCT 17677 .......... .......... .......... .......... .......... .......... 308 ACGGAAACTG TAACCTATGT CTTCCCCTCC TAACCTAGCC TGTGAGGTAA ATAGAAGACT 17737 .......... .......... .......... .......... .......... .......... 308 ATGAGATTCA GAAACAATGA AGCAAAGCTA GAGCATTACG AAGTTCTGGA GATACTGTAA 17797 .......... .......... .......... .......... .......... .......... 308 CATAGTCAAT GTGGTAAGGA GCTGAATTAA ATTCAGACAT CCAAGCATGA CGAAAACTAA 17857 .......... .......... .......... .......... .......... .......... 308 AGAACTATAT CAACAGCTGG AAACAAACCT CTTCCTGTAG TTCCTTAAAT TCAGGCAATG 17917 .......... .......... .......... .......... .......... .......... 308 CTCTGAAGGA AGCCAAATCT GGATCATTTA AGATTGTGCC AAACTTAAGG TTATACTCTT 17977 .......... .......... .......... .......... .......... .......... 308 TCAAAGCAGT GCGTAAACAC TCAGCAGCTT TCTTTCCTTC CCCCCTTGAG TTAATAAAGA 18037 .......... .......... .......... .......... .......... .......... 308 AAAATAAATT ATTAAACAAA TAGAGAAATA ATAAAAGAGT GAAGAAGAAT AAATAAGTAT 18097 .......... .......... .......... .......... .......... .......... 308 AGTAGACGCT CATTGAAGAT TCAACATTGG TTTTAGTGAT CCTTCCAATG AAACAATTAA 18157 .......... .......... .......... .......... .......... .......... 308 GCTAAGAACT GTGATTAAGA AACTACAGGT GCCTCAACTT ACAGTTCACA AAGCTAAAAC 18217 .......... .......... .......... .......... .......... .......... 308 TGTGATTAAG AAATTTTAAA TTCCTTTACT TGCATTTTAC ACGAGAAGAA AGTCGAAAAT 18277 .......... .......... .......... .......... .......... .......... 308 ATGTTTTTTT TCTCAATGTC TCACATGATT TCAAACAAAT AACCGTTTAT AAGATCCAGC 18337 .......... .......... .......... .......... .......... .......... 308 TTAATCAATT CAACCATGAA AGAGGCAAAG GAGAGATATA CCTGTAGGCA TGGCAACATG 18397 .......... .......... .......... .......... .......... .......... 308 CTTTGTTATA AAATGCAGCT TGAGCCTCCT CAGGGTTGGG ATTTAAAGTA AGTGCTGTGT 18457 .......... .......... .......... .......... .......... .......... 308 CGAATAGCAC AAGAGCATCT TTCACCTGGA AAGATCAAAT TTAGTGAACA ATAAGGTGCA 18517 .......... .......... .......... .......... .......... .......... 308 TGTCGCCCAT ATTGCAGATT GAAATTGTAA CCAACTGCAA GAGGTTTGTA CACAAACCAA 18577 .......... .......... .......... .......... .......... .......... 308 AATGCAGATT ATTGGAAACA TAATTAAGTG AGAAATAACC AAAAAAAAAA AAGCCAACTG 18637 .......... .......... .......... .......... .......... .......... 308 AATAGATTTT AAATAGGTAA TTTCACAAAA GACAAATTGT TGATTAAAAG CAAAAAGAAG 18697 .......... .......... .......... .......... .......... .......... 308 ATTTGATCAA TGACATTAAC AAATTGATAG CTTTTCCAGT AATAATTGTG ATTCAACCTA 18757 .......... .......... .......... .......... .......... .......... 308 CTAGATACTA TCCTTAAGCT TGCTATGCAG GATCAAATAG AGATACACAA GAAACTATAA 18817 .......... .......... .......... .......... .......... .......... 308 CTCAGACATT CAAGCTTCTG ATGTGCACAA GATTTAACAG AAAAACTTGT TGAGTAGGAA 18877 .......... .......... .......... .......... .......... .......... 308 AATCTTTGAA TCATCAATCT TTTGGTACAC TTTCAGTTTT AGACAGAAAT TTACTTTCCA 18937 .......... .......... .......... .......... .......... .......... 308 ATTATACTCC TTCCTAGTCC ATCTATCTCA ATTCATGTGA CACATGTTCT TGTTTGATGT 18997 .......... .......... .......... .......... .......... .......... 308 CAAATTTACA ATCTTAGCAT TATTATAAAG TTCCAGTCAT ACGCATTTTG AAATCTAGTT 19057 .......... .......... .......... .......... .......... .......... 308 GTTGTTACAG ATAAAATCTG AAAGTACCTT CCTATTTTTC AAGTCACTGT CAAAAGTCCA 19117 .......... .......... .......... .......... .......... .......... 308 CTCACGACCA ATTTTCAATA AAGCATGATT AGGGTGGGTG TTAAGTGTAG GAGCAGAGCC 19177 .......... .......... .......... .......... .......... .......... 308 ACCCCTTCAG CCATTTGGTT CAACAAAACC CTGTAGCTTT GGCTCAAACC TCGTAGTTAT 19237 .......... .......... .......... .......... .......... .......... 308 GTTTCAAAAA GTTCATTTGC TATGTGTAAA ATGATATATT CAAAACTCAT AAACTAAAAA 19297 .......... .......... .......... .......... .......... .......... 308 TCCTAGATAC ACCTCTGAGT ACAACTGTTA CCATAAGATT TACTCAAAAC AGGAAAATTT 19357 .......... .......... .......... .......... .......... .......... 308 TGAATAGACA TGTCAAACTA GTAAACTTGC TCAAGTTGCA GATTCTTGGA GAACACTTGC 19417 .......... .......... .......... .......... .......... .......... 308 CCTAACCAAA CTCTCAAAGC CCCCTATGAC TACATTGTGA ATGTTGATTA AGTTAGAAGG 19477 .......... .......... .......... .......... .......... .......... 308 AACGAGCCTC AATTTATAGA GTACTAAACT TTTTTCTACA AGAAAATGAC TACTCAAAAA 19537 .......... .......... .......... .......... .......... .......... 308 CGGAAACTTT ATAACCTATT TATGCTAAGA AACTCAGGTC AAATACCCAA CAACAAACTC 19597 .......... .......... .......... .......... .......... .......... 308 TTGGAGAACA CTTGAGGAAA ATTAGGTAAC AAATAGAAGG GAATAATATG ACTGCAAATG 19657 .......... .......... .......... .......... .......... .......... 308 ATCAAATGAT GGGGATATTA AAACTCGAAA TCAAAACCTC TATTGTCATT TATCAAATTA 19717 .......... .......... .......... .......... .......... .......... 308 AATTGCATAA ATTAAAACGA ATCCCTTTTA TTTACTTACT TAACTTATTT GATATCTACA 19777 .......... .......... .......... .......... .......... .......... 308 ACTAGTGGCG TAGCTACAAT TGAAGTATAT CTTTGTTAAA AAAATAAATA TATATAAATA 19837 .......... .......... .......... .......... .......... .......... 308 AACAAAAATT TTATGGTCAC GAACCTCAAT CGACACACTC TCTGTCTAAA TCAACAAAAA 19897 .......... .......... .......... .......... .......... .......... 308 TAACAATACA ACAACAATAC TGAGGAGTTT CATAAAGCAT AATGTATATG CAGACCTTAT 19957 .......... .......... .......... .......... .......... .......... 308 CATTGTTTAT AATAGACCTG AAAAACAATG CAATCACATA ATAATTAAGG AATTTCTTCT 20017 .......... .......... .......... .......... .......... .......... 308 GCCATGAAAT TGAAACAGAA TTAAGATAAT ACCCGTCCTT TAGAGAAGAG GTCGAGGCCG 20077 .......... .......... .......... .......... .......... .......... 308 GCATTGACAC AGGACTCAGC AGTGGGAGTG GGTTCACTGA CTTGGGGCGA CGACGATGAT 20137 .......... .......... .......... .......... .......... .......... 308 GGCGGAGGGG GCGAAGTAGA AGAAGAACAA ATGACGAGTG AAATACGATT CTTCGGGGGA 20197 .......... .......... .......... .......... .......... .......... 308 AAATTTAGGC ATAATTGTGT AGTGATTGAG AATTTGGAAG ATGATGAAGA GAAGAAGCAG 20257 .......... .......... .......... .......... .......... .......... 308 AAATGGAGGT GATGAGAGGA GGGAAGAGTT GTAATGGCCA TGTCAAGATT TATCCATTCG 20317 .......... .......... .......... .......... .......... .......... 308 CACCACACCT TACACTTCTT GTGGTTTTCA TAAAAGGCTT TAATTCTAAA ATTTGGTTTT 20377 .......... .......... .......... .......... .......... .......... 308 ATTAAGGGAT TATTATTACT TCCTCACTCT CAAAATATTT GTAATGTTTA ATCATTTTTA 20437 .......... .......... .......... .......... .......... .......... 308 AAAATATTAA TTCAAAAATG TTTGATTATT TCCTTAACAT TTCTTCTATA GGTGTTTATT 20497 .......... .......... .......... .......... .......... .......... 308 TTATACATAA AAAAAAATTC GTTTAATTTT ATTTCTTCAA TTTAAACTTT GCATACTTCA 20557 .......... .......... .......... .......... .......... .......... 308 AAAAAAAGAG AAATGATATG TTATTTTATC ATAATACACA TATTAATTGG GGTTTAATAT 20617 .......... .......... .......... .......... .......... .......... 308 TAAATATTGA AATATGATTT GAAGAATAAG TAATTAATGT TAAGAATAAA ATGATTTTTT 20677 .......... .......... .......... .......... .......... .......... 308 CTTGGTTTAT ACAAAGTGTC AAAGGAAAAG TGAAATTTTT CTTATAAAAT GATTATGGTT 20737 .......... .......... .......... .......... .......... .......... 308 TAGGCTTAAT TAGAGATTTA TAGTTATAGT TCGCTTAATT ATAAATTATA GATACGTATT 20797 .......... .......... .......... .......... .......... .......... 308 AGGGGGAGGA GAGATGCGAG CGAGATCAGA AATTGAGCTG TTCAAAATGA AAC 20850 || | | | | ||||| || | |||||| | ||| || ||| ..CTCAAGTA TATAACCCTC AGAGAT-AG- GA-TGAGCTC TGTGAAA-GA AAC 355 hqPGS_C09HBa0099P03.1-2+_SGN-U337965+ (14138 14367) ******************************************************************************** EST sequence 24 +strand 531 n (File: SGN-U329066+) 1 CAAAAGGCAG TCGCTTCCCT CCCATCCATA GGGGAAGATT TTGACCAAAG AACTCAGTCA 61 ATAATTGCAA AATCAAAACT GAAGTCAGAG ATCCGGTTCA AGGCGGAGGT CGTTTCACCT 121 GCAGAGTGGG AAAGGTGGAT AAGGGATCAA CAGAAGTCTG AAGGTGTTAC TCCTGGTGAA 181 GATGTCTACA TTATACTTCG TCTAGATGGT CGTGTTCGAC GATCAGGAAA GGGGATGCCT 241 GACTGGGCTC AAATTTTAAA GGAACTGCCT CCAATGGAAG CAATATTAAG CAAGCTAGAA 301 AGATAATACA AGTTCCTCTT GCTGAGTGCT AAAATCTTTT CACTACCTGG TGTATTTTCG 361 GTCAGTCTCC TTACTTGAAA TATAAAAGGC ACTAATTGAT GCCTGTTGAG CTTGTCTGTT 421 TTACGTTTCC ATTTTTTCAC GAACATATAC TGTACTATAA TTCTATAAAT TGATATTGAT 481 ATTTGCCAGA TGAATATCCA AAGGATATGA AATCCCATAG GAAAAAAAAA A Predicted gene structure (within gDNA segment 17216 to 14851): Exon 1 16616 16584 ( 33 n); cDNA 1 33 ( 33 n); score: 1.000 Intron 1 16583 16373 ( 211 n); Pd: 0.996 (s: 0), Pa: 0.982 (s: 1.00) Exon 2 16372 16272 ( 101 n); cDNA 34 134 ( 101 n); score: 1.000 Intron 2 16271 16106 ( 166 n); Pd: 0.930 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 16105 16009 ( 97 n); cDNA 135 231 ( 97 n); score: 1.000 Intron 3 16008 15824 ( 185 n); Pd: 0.940 (s: 1.00), Pa: 0.993 (s: 1.00) Exon 4 15823 15525 ( 299 n); cDNA 232 530 ( 299 n); score: 0.993 MATCH C09HBa0099P03.1-2- SGN-U329066+ 0.996 530 0.998 C PGS_C09HBa0099P03.1-2-_SGN-U329066+ (16616 16584,16372 16272,16105 16009,15823 15525) Alignment (genomic DNA sequence = upper lines): CAAAAGGCAG TCGCTTCCCT CCCATCCATA GGGGTAAGAT TGAAGATTTT TAGTAGCACT 16557 |||||||||| |||||||||| |||||||||| ||| CAAAAGGCAG TCGCTTCCCT CCCATCCATA GGG....... .......... .......... 33 TAATGATCAT TTTAAACAAC ATGATGCTAT TGCATGCACA TCAAGTCTCA AAGTGCAAAA 16497 .......... .......... .......... .......... .......... .......... 33 TATATATTTT TTTTACATGC TACAGGATTT TTGAAAGTCG TTTCTTGTTG CTCAACAAGC 16437 .......... .......... .......... .......... .......... .......... 33 ATAAGACGAT AATCTTAATG GTGCATTTAT CTCATTCAGG TTCTTATTTT TGTCCAATCT 16377 .......... .......... .......... .......... .......... .......... 33 GTAGGAAGAT TTTGACCAAA GAACTCAGTC AATAATTGCA AAATCAAAAC TGAAGTCAGA 16317 |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ....GAAGAT TTTGACCAAA GAACTCAGTC AATAATTGCA AAATCAAAAC TGAAGTCAGA 89 GATCCGGTTC AAGGCGGAGG TCGTTTCACC TGCAGAGTGG GAAAGGTATA TCTACCTGTT 16257 |||||||||| |||||||||| |||||||||| |||||||||| ||||| GATCCGGTTC AAGGCGGAGG TCGTTTCACC TGCAGAGTGG GAAAG..... .......... 134 CCAAATACTC TTGAAACTAC TCATCCCGTC ATGGTCATGT GCATTTATGT TTGTGCCACT 16197 .......... .......... .......... .......... .......... .......... 134 TTAATTTATT GTACATCCCG TTACACAGTT TGATCGACTG AATTGTTTTC CCTTGTACCA 16137 .......... .......... .......... .......... .......... .......... 134 GATCTATTGT CTGACATCAA ATCTATTTCA GGTGGATAAG GGATCAACAG AAGTCTGAAG 16077 ||||||||| |||||||||| |||||||||| .......... .......... .......... .GTGGATAAG GGATCAACAG AAGTCTGAAG 163 GTGTTACTCC TGGTGAAGAT GTCTACATTA TACTTCGTCT AGATGGTCGT GTTCGACGAT 16017 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTTACTCC TGGTGAAGAT GTCTACATTA TACTTCGTCT AGATGGTCGT GTTCGACGAT 223 CAGGAAAGGT CAGTACTTGT TGCTGCAATT CTCTTCTTTG GTCCATTAAT CCTGACTCAG 15957 |||||||| CAGGAAAG.. .......... .......... .......... .......... .......... 231 TTATATACTT ATATGCATAG ATAATCAGCA CATAAATTGA TCATGACATA ACATAGATGA 15897 .......... .......... .......... .......... .......... .......... 231 CTGAATCCAT AAAAGTATAA TTTACTACTC ATACGGGTTA ATTCTTCGAT TTTAATATCG 15837 .......... .......... .......... .......... .......... .......... 231 ACCAACTCTG CAGGGGATGC CTGACTGGGC TCAAATTTTA AAGGAACTGC CTCCAATGGA 15777 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ...GGGATGC CTGACTGGGC TCAAATTTTA AAGGAACTGC CTCCAATGGA 278 AGCAATATTA AGCAAGCTAG AAAGATAATA CAAGTTCCTC TTGCTGAGTG CTAAAATCTT 15717 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAATATTA AGCAAGCTAG AAAGATAATA CAAGTTCCTC TTGCTGAGTG CTAAAATCTT 338 TTCACTACCT GGTGTATTTT CGGTCAGTCT CCTTACTTGA AATATAAAAG GCACTAATTG 15657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCACTACCT GGTGTATTTT CGGTCAGTCT CCTTACTTGA AATATAAAAG GCACTAATTG 398 ATGCCTGTTG AGCTTGTCTG TTTTACGTTT CCATTTTTTC ACGAACATAT ACTGTACTAT 15597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCCTGTTG AGCTTGTCTG TTTTACGTTT CCATTTTTTC ACGAACATAT ACTGTACTAT 458 AATTCTATAA ATTGATATTG ATATTTGCCA GATGAATATC CAAAGGATAT GAAATCCCAT 15537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTCTATAA ATTGATATTG ATATTTGCCA GATGAATATC CAAAGGATAT GAAATCCCAT 518 AGGAAATAGA AA 15525 |||||| | | || AGGAAAAAAA AA 530 hqPGS_C09HBa0099P03.1-2-_SGN-U329066+ (16616 16584,16372 16272,16105 16009,15823 15525) ******************************************************************************** EST sequence 19 +strand 1257 n (File: SGN-U324334+) 1 TTGACATGGC CATTGCAACT CTTCCCTCCT CTCATCACCT CCATTTCTGC TTCTTCTCTT 61 CATCATCTTC CAAATTCTCA ATCACTACAC AATTATGCCT AAATTTTCCC CCGAAGAATC 121 GTATTTCACT CGTCATTTGT TCTTCTTCTA CTTCGCCCCC TCCGCCATCA TCGTCGTCGC 181 CCCAAGTCAG TGAACCCACT CCCACTGCTG AGTCCTGTGT CAATGCCGGC CTCGACCTCT 241 TCTCTAAAGG ACGGGTGAAA GATGCTCTTG TGCTATTCGA CACAGCACTT ACTTTAAATC 301 CCAACCCTGA GGAGGCTCAA GCTGCATTTT ATAACAAAGC ATGTTGCCAT GCCTACAGGG 361 GGGAAGGAAA GAAAGCTGCT GAGTGTTTAC GCACTGCTTT GAAAGAGTAT AACCTTAAGT 421 TTGGCACAAT CTTAAATGAT CCAGATTTGG CTTCCTTCAG AGCATTGCCT GAATTTAAGG 481 AACTACAGGA AGAGGCTAGG TTAGGAGGGG AAGACATAGG TTACAGTTTC CGTAGAGACC 541 TTAAGCTCAT CAGTGAGGTA CAAGCACCGT TTAGGGGGGT CAGAAGATTC TTTTATGTGG 601 CATTCATTGC TGCTGCTGGA ATTTCTACAT TTTTCACTAT ACCCAGGTTA ATCCGTGCAA 661 TTCAAGGTGG AGATGGTGCT CCTGATTTAG GGGCGACTGC AGGAAATGCT GCTATAAATG 721 TCGCTGGTAT TGCTGTTTTT GTGGCTTTGT TGTTTTGGGA CAACAAAAAA GAGGAGGAAC 781 AGCTTGCACA AATATTGCGT GATGAAACAC TATCAAGGCT GCCTTTGCGC CTTTCAACTG 841 ATAGGATTGT TGAACTCGTC CAGCTAAGAG ACACCGTGAG GCCTGTTAGT GAGACAACTT 901 CCCCTTTACT CTTCCTGTTT TCAATTTCCA GAAGTTGAGA TGAAAACGTC AGGAGTCCTT 961 CAATTTTACC TTGAAGAATG AGAAGTCCTG TATAACTTTT ACTGGTGTCA CTTGTCTAGT 1021 TCTCTTTGCC CTGTATTCTG TGTAAGAGAT CAAAACTTAT GGTTTACCTC CCTCATTTGA 1081 AGAATCCATT TCCATCACAT TTACATTGTC GGAGGGATGA TAAAGACAGA AAAGTTAGTA 1141 ACATCTTTTG CTAGGTTTGC AACAAATGCG TTATGAGCAA AAAGATATCC CTTTATGTAT 1201 TTGGTTTCTC TGTATCTCTA TTCTTTTTCT AAAATGCCTG ACTGACTCAA ATATATT Predicted gene structure (within gDNA segment 21038 to 14926): Exon 1 20303 20050 ( 254 n); cDNA 1 254 ( 254 n); score: 0.996 Intron 1 20049 18483 (1567 n); Pd: 0.998 (s: 1.00), Pa: 0.988 (s: 1.00) Exon 2 18482 18379 ( 104 n); cDNA 255 358 ( 104 n); score: 1.000 Intron 2 18378 18022 ( 357 n); Pd: 0.998 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 18021 17886 ( 136 n); cDNA 359 494 ( 136 n); score: 1.000 Intron 3 17885 17717 ( 169 n); Pd: 0.996 (s: 1.00), Pa: 0.974 (s: 1.00) Exon 4 17716 17485 ( 232 n); cDNA 495 726 ( 232 n); score: 1.000 Intron 4 17484 17339 ( 146 n); Pd: 0.994 (s: 1.00), Pa: 0.175 (s: 1.00) Exon 5 17338 16813 ( 526 n); cDNA 727 1252 ( 526 n); score: 0.861 MATCH C09HBa0099P03.1-2- SGN-U324334+ 0.941 1252 0.996 C PGS_C09HBa0099P03.1-2-_SGN-U324334+ (20303 20050,18482 18379,18021 17886,17716 17485,17338 16813) Alignment (genomic DNA sequence = upper lines): TTGACATGGC CATTACAACT CTTCCCTCCT CTCATCACCT CCATTTCTGC TTCTTCTCTT 20244 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGACATGGC CATTGCAACT CTTCCCTCCT CTCATCACCT CCATTTCTGC TTCTTCTCTT 60 CATCATCTTC CAAATTCTCA ATCACTACAC AATTATGCCT AAATTTTCCC CCGAAGAATC 20184 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCATCTTC CAAATTCTCA ATCACTACAC AATTATGCCT AAATTTTCCC CCGAAGAATC 120 GTATTTCACT CGTCATTTGT TCTTCTTCTA CTTCGCCCCC TCCGCCATCA TCGTCGTCGC 20124 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTATTTCACT CGTCATTTGT TCTTCTTCTA CTTCGCCCCC TCCGCCATCA TCGTCGTCGC 180 CCCAAGTCAG TGAACCCACT CCCACTGCTG AGTCCTGTGT CAATGCCGGC CTCGACCTCT 20064 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCAAGTCAG TGAACCCACT CCCACTGCTG AGTCCTGTGT CAATGCCGGC CTCGACCTCT 240 TCTCTAAAGG ACGGGTATTA TCTTAATTCT GTTTCAATTT CATGGCAGAA GAAATTCCTT 20004 |||||||||| |||| TCTCTAAAGG ACGG...... .......... .......... .......... .......... 254 AATTATTATG TGATTGCATT GTTTTTCAGG TCTATTATAA ACAATGATAA GGTCTGCATA 19944 .......... .......... .......... .......... .......... .......... 254 TACATTATGC TTTATGAAAC TCCTCAGTAT TGTTGTTGTA TTGTTATTTT TGTTGATTTA 19884 .......... .......... .......... .......... .......... .......... 254 GACAGAGAGT GTGTCGATTG AGGTTCGTGA CCATAAAATT TTTGTTTATT TATATATATT 19824 .......... .......... .......... .......... .......... .......... 254 TATTTTTTTA ACAAAGATAT ACTTCAATTG TAGCTACGCC ACTAGTTGTA GATATCAAAT 19764 .......... .......... .......... .......... .......... .......... 254 AAGTTAAGTA AGTAAATAAA AGGGATTCGT TTTAATTTAT GCAATTTAAT TTGATAAATG 19704 .......... .......... .......... .......... .......... .......... 254 ACAATAGAGG TTTTGATTTC GAGTTTTAAT ATCCCCATCA TTTGATCATT TGCAGTCATA 19644 .......... .......... .......... .......... .......... .......... 254 TTATTCCCTT CTATTTGTTA CCTAATTTTC CTCAAGTGTT CTCCAAGAGT TTGTTGTTGG 19584 .......... .......... .......... .......... .......... .......... 254 GTATTTGACC TGAGTTTCTT AGCATAAATA GGTTATAAAG TTTCCGTTTT TGAGTAGTCA 19524 .......... .......... .......... .......... .......... .......... 254 TTTTCTTGTA GAAAAAAGTT TAGTACTCTA TAAATTGAGG CTCGTTCCTT CTAACTTAAT 19464 .......... .......... .......... .......... .......... .......... 254 CAACATTCAC AATGTAGTCA TAGGGGGCTT TGAGAGTTTG GTTAGGGCAA GTGTTCTCCA 19404 .......... .......... .......... .......... .......... .......... 254 AGAATCTGCA ACTTGAGCAA GTTTACTAGT TTGACATGTC TATTCAAAAT TTTCCTGTTT 19344 .......... .......... .......... .......... .......... .......... 254 TGAGTAAATC TTATGGTAAC AGTTGTACTC AGAGGTGTAT CTAGGATTTT TAGTTTATGA 19284 .......... .......... .......... .......... .......... .......... 254 GTTTTGAATA TATCATTTTA CACATAGCAA ATGAACTTTT TGAAACATAA CTACGAGGTT 19224 .......... .......... .......... .......... .......... .......... 254 TGAGCCAAAG CTACAGGGTT TTGTTGAACC AAATGGCTGA AGGGGTGGCT CTGCTCCTAC 19164 .......... .......... .......... .......... .......... .......... 254 ACTTAACACC CACCCTAATC ATGCTTTATT GAAAATTGGT CGTGAGTGGA CTTTTGACAG 19104 .......... .......... .......... .......... .......... .......... 254 TGACTTGAAA AATAGGAAGG TACTTTCAGA TTTTATCTGT AACAACAACT AGATTTCAAA 19044 .......... .......... .......... .......... .......... .......... 254 ATGCGTATGA CTGGAACTTT ATAATAATGC TAAGATTGTA AATTTGACAT CAAACAAGAA 18984 .......... .......... .......... .......... .......... .......... 254 CATGTGTCAC ATGAATTGAG ATAGATGGAC TAGGAAGGAG TATAATTGGA AAGTAAATTT 18924 .......... .......... .......... .......... .......... .......... 254 CTGTCTAAAA CTGAAAGTGT ACCAAAAGAT TGATGATTCA AAGATTTTCC TACTCAACAA 18864 .......... .......... .......... .......... .......... .......... 254 GTTTTTCTGT TAAATCTTGT GCACATCAGA AGCTTGAATG TCTGAGTTAT AGTTTCTTGT 18804 .......... .......... .......... .......... .......... .......... 254 GTATCTCTAT TTGATCCTGC ATAGCAAGCT TAAGGATAGT ATCTAGTAGG TTGAATCACA 18744 .......... .......... .......... .......... .......... .......... 254 ATTATTACTG GAAAAGCTAT CAATTTGTTA ATGTCATTGA TCAAATCTTC TTTTTGCTTT 18684 .......... .......... .......... .......... .......... .......... 254 TAATCAACAA TTTGTCTTTT GTGAAATTAC CTATTTAAAA TCTATTCAGT TGGCTTTTTT 18624 .......... .......... .......... .......... .......... .......... 254 TTTTTTGGTT ATTTCTCACT TAATTATGTT TCCAATAATC TGCATTTTGG TTTGTGTACA 18564 .......... .......... .......... .......... .......... .......... 254 AACCTCTTGC AGTTGGTTAC AATTTCAATC TGCAATATGG GCGACATGCA CCTTATTGTT 18504 .......... .......... .......... .......... .......... .......... 254 CACTAAATTT GATCTTTCCA GGTGAAAGAT GCTCTTGTGC TATTCGACAC AGCACTTACT 18444 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .GTGAAAGAT GCTCTTGTGC TATTCGACAC AGCACTTACT 293 TTAAATCCCA ACCCTGAGGA GGCTCAAGCT GCATTTTATA ACAAAGCATG TTGCCATGCC 18384 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAAATCCCA ACCCTGAGGA GGCTCAAGCT GCATTTTATA ACAAAGCATG TTGCCATGCC 353 TACAGGTATA TCTCTCCTTT GCCTCTTTCA TGGTTGAATT GATTAAGCTG GATCTTATAA 18324 ||||| TACAG..... .......... .......... .......... .......... .......... 358 ACGGTTATTT GTTTGAAATC ATGTGAGACA TTGAGAAAAA AAACATATTT TCGACTTTCT 18264 .......... .......... .......... .......... .......... .......... 358 TCTCGTGTAA AATGCAAGTA AAGGAATTTA AAATTTCTTA ATCACAGTTT TAGCTTTGTG 18204 .......... .......... .......... .......... .......... .......... 358 AACTGTAAGT TGAGGCACCT GTAGTTTCTT AATCACAGTT CTTAGCTTAA TTGTTTCATT 18144 .......... .......... .......... .......... .......... .......... 358 GGAAGGATCA CTAAAACCAA TGTTGAATCT TCAATGAGCG TCTACTATAC TTATTTATTC 18084 .......... .......... .......... .......... .......... .......... 358 TTCTTCACTC TTTTATTATT TCTCTATTTG TTTAATAATT TATTTTTCTT TATTAACTCA 18024 .......... .......... .......... .......... .......... .......... 358 AGGGGGGAAG GAAAGAAAGC TGCTGAGTGT TTACGCACTG CTTTGAAAGA GTATAACCTT 17964 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ..GGGGGAAG GAAAGAAAGC TGCTGAGTGT TTACGCACTG CTTTGAAAGA GTATAACCTT 416 AAGTTTGGCA CAATCTTAAA TGATCCAGAT TTGGCTTCCT TCAGAGCATT GCCTGAATTT 17904 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGTTTGGCA CAATCTTAAA TGATCCAGAT TTGGCTTCCT TCAGAGCATT GCCTGAATTT 476 AAGGAACTAC AGGAAGAGGT TTGTTTCCAG CTGTTGATAT AGTTCTTTAG TTTTCGTCAT 17844 |||||||||| |||||||| AAGGAACTAC AGGAAGAG.. .......... .......... .......... .......... 494 GCTTGGATGT CTGAATTTAA TTCAGCTCCT TACCACATTG ACTATGTTAC AGTATCTCCA 17784 .......... .......... .......... .......... .......... .......... 494 GAACTTCGTA ATGCTCTAGC TTTGCTTCAT TGTTTCTGAA TCTCATAGTC TTCTATTTAC 17724 .......... .......... .......... .......... .......... .......... 494 CTCACAGGCT AGGTTAGGAG GGGAAGACAT AGGTTACAGT TTCCGTAGAG ACCTTAAGCT 17664 ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......GCT AGGTTAGGAG GGGAAGACAT AGGTTACAGT TTCCGTAGAG ACCTTAAGCT 547 CATCAGTGAG GTACAAGCAC CGTTTAGGGG GGTCAGAAGA TTCTTTTATG TGGCATTCAT 17604 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCAGTGAG GTACAAGCAC CGTTTAGGGG GGTCAGAAGA TTCTTTTATG TGGCATTCAT 607 TGCTGCTGCT GGAATTTCTA CATTTTTCAC TATACCCAGG TTAATCCGTG CAATTCAAGG 17544 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGCTGCT GGAATTTCTA CATTTTTCAC TATACCCAGG TTAATCCGTG CAATTCAAGG 667 TGGAGATGGT GCTCCTGATT TAGGGGCGAC TGCAGGAAAT GCTGCTATAA ATGTCGCTGG 17484 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| TGGAGATGGT GCTCCTGATT TAGGGGCGAC TGCAGGAAAT GCTGCTATAA ATGTCGCTG. 726 TAAATAAGGA CTACCTTTGC TACGAATCAT TTAGTTCACT TCTAAAGTTC TAGAAATTAT 17424 .......... .......... .......... .......... .......... .......... 726 TTTTGCTGGC TATATTAACT TGCGCTTTTA AATGAAAAGA ACTGTTTCTC AAAAATGATT 17364 .......... .......... .......... .......... .......... .......... 726 AACCTCTTGA ACCAGCACTC TGCAGGTATT GCTGTTTTTG TGGCTTTGTT GTTTTGGGAC 17304 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....GTATT GCTGTTTTTG TGGCTTTGTT GTTTTGGGAC 761 AACAAAAAAG AGGAGGAACA GCTTGCACAA ATATTGCGTG ATGAAACACT ATCAAGGCTG 17244 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACAAAAAAG AGGAGGAACA GCTTGCACAA ATATTGCGTG ATGAAACACT ATCAAGGCTG 821 CCTTTGCGCC TTTCAACTGA TAGGATTGTT GAACTCGTCC AGCTAAGAGA CACCGTGAGG 17184 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTTTGCGCC TTTCAACTGA TAGGATTGTT GAACTCGTCC AGCTAAGAGA CACCGTGAGG 881 CCTGTTAGTG AGACAATTTC CCCTTTACTC TTCCTG---T CAATTTCTAG AAGTTGAGTG 17127 |||||||||| |||||| ||| |||||||||| |||||| | ||||||| || |||||||| CCTGTTAGTG AGACAACTTC CCCTTTACTC TTCCTGTTTT CAATTTCCAG AAGTTGAGAT 941 GAAAACGTCC GGAGTCCTTC AATTTTACCT TGAAGAATGA GATGTCCTGT ATGAGATGT- 17068 ||||||||| |||||||||| |||||||||| |||||||||| || ||||||| || | | | GAAAACGTCA GGAGTCCTTC AATTTTACCT TGAAGAATGA GAAGTCCTGT ATAACTTTTA 1001 C--CTGTCAC TAGTCTAGTT CTCGTTGCCA TGTATCCTGC TTAACAGATC AAAAGATATT 17010 | |||||| | |||||||| ||| ||||| ||||| ||| ||| ||||| |||| ||| CTGGTGTCAC TTGTCTAGTT CTCTTTGCCC TGTATTCTGT GTAAGAGATC AAAACTTATG 1061 GTTTACCTCC TTCGTTTGAA GAATCCATTT CCATCACATT TACATTGTCG GA-GGATAAT 16951 |||||||||| || |||||| |||||||||| |||||||||| |||||||||| || |||| | GTTTACCTCC CTCATTTGAA GAATCCATTT CCATCACATT TACATTGTCG GAGGGATGA- 1120 TAAAGAGA-A AAAGTCAGTA ACATATTTTG CTAGGTTTGC AACAAATGTC TTTGTTTCAT 16892 |||||| | | ||||| |||| |||| ||||| |||||||||| |||||||| | | || || TAAAGACAGA AAAGTTAGTA ACATCTTTTG CTAGGTTTGC AACAAATG-C ---G-TT-AT 1174 GAGCAAAAAG CTATCCCATT ACGTATTTGT TCTCTCTGTC TCTCTTTTTC CTTATTAAAA 16832 |||||||||| |||||| || | ||||||| | ||||||| |||| | ||| || | ||| GAGCAAAAAG ATATCCCTTT ATGTATTTGG TTTCTCTGTA TCTC-TATTC TTTTTCTAAA 1233 ATTCCGGACT TGAACATAT 16813 || || |||| || || ATGCCTGACT GACTCAAAT 1252 hqPGS_C09HBa0099P03.1-2-_SGN-U324334+ (20303 20050,18482 18379,18021 17886,17716 17485,17338 16813) ******************************************************************************** EST sequence 23 +strand 1102 n (File: SGN-U338003+) 1 NGGGGAAGAT GTGAGGGATA GCGCTCTGAA TAGTGGATCC CCCGGGCTGC AGGAATTCGG 61 CACGAGGCTT GACATGGCCA TTACAACTCT TCCCTCCTCT CATCACCTCC ATTTCTGCTT 121 CTTCTCTTCA TCATCTTCCA AATTCTCAAT CACTACACAA TTATGCCTAA ATTTTCCCCC 181 GAAGAATCGT ATTTCACTCG TCATTTGTTC TTCTTCTACT TCGCCCCCTC CGCCATCATC 241 GTCGTCGCCC CAAGTCAGTG AACCCACTCC CACTGCTGAG TCCTGTGTCA ATGCCGGCCT 301 CGACCTCTTC TCTAAAGGAC GGGTATTATC TTAATTCTGT TTCAATTTCA TGGCAGAAGA 361 AATTCCTTAA TTATTATGTG ATTGCATTGT TTTTCAGGTC TATTATAAAC AATGATAAGG 421 TCTGCATATA CATTATGCTT TATGAAACTC CTCAGTATTG TTGTTGTATT GTTATTTTTG 481 TTGATTTAGA CAGAGAGTGT GTCGATTGAG GTTCGTGACC ATAAAATTTT TGTTTATTTA 541 TATATATTTA TTTTTTTAAC AAAGATATAC TTCAATTGTA GCTACGCCAC TAGTTGTAGA 601 TATCAAATAA GTTAAGTAAG TAAATAAAAG GGATTCGTTT TAATTTATGC AATTTAATTT 661 GATAAATGAC AATAGAGGTT TTGATTTCGA GTTTTAATAT CCCCATCATT TGATCATTTG 721 CAGTCATATT ATTCCCTTCT ATTTGTTACC TAATTTTCCT CAAGTGTTCT CCAAGAGTTT 781 GTTGTTGGGT ATTTGACCTG AGTTTCTTAG CATAAATAGG TTATAAAGTT TCCGTTTTTG 841 AGTAGTCATT TTCTTGTAGA AAAAAGTTTA GTACTCTATA AATTGAGGCT CGTTCCTTCT 901 AACTTAATCA ACATTCACAA TGTAGTCATA GGGGGGCTTT GAAAGTTTGG TTAGGGGCAG 961 TGTTCTCCAA GAATCTGCAA ATTGAGCCAG TTTACTAGTT TGACATGTCT AATTCAAATT 1021 TTTCTTGTTT TGAGTAAATC TTAAGGGTAA CAGTTTGTAC TCAGAGGGGT ATCTAGGAAT 1081 TTTTAGTTTA GGAAGTTTTG AA Predicted gene structure (within gDNA segment 21574 to 18131): Exon 1 20304 19276 (1029 n); cDNA 68 1102 (1035 n); score: 0.976 MATCH C09HBa0099P03.1-2- SGN-U338003+ 0.976 1029 0.934 C PGS_C09HBa0099P03.1-2-_SGN-U338003+ (20304 19276) Alignment (genomic DNA sequence = upper lines): CTTGACATGG CCATTACAAC TCTTCCCTCC TCTCATCACC TCCATTTCTG CTTCTTCTCT 20245 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGACATGG CCATTACAAC TCTTCCCTCC TCTCATCACC TCCATTTCTG CTTCTTCTCT 127 TCATCATCTT CCAAATTCTC AATCACTACA CAATTATGCC TAAATTTTCC CCCGAAGAAT 20185 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATCATCTT CCAAATTCTC AATCACTACA CAATTATGCC TAAATTTTCC CCCGAAGAAT 187 CGTATTTCAC TCGTCATTTG TTCTTCTTCT ACTTCGCCCC CTCCGCCATC ATCGTCGTCG 20125 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTATTTCAC TCGTCATTTG TTCTTCTTCT ACTTCGCCCC CTCCGCCATC ATCGTCGTCG 247 CCCCAAGTCA GTGAACCCAC TCCCACTGCT GAGTCCTGTG TCAATGCCGG CCTCGACCTC 20065 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCCAAGTCA GTGAACCCAC TCCCACTGCT GAGTCCTGTG TCAATGCCGG CCTCGACCTC 307 TTCTCTAAAG GACGGGTATT ATCTTAATTC TGTTTCAATT TCATGGCAGA AGAAATTCCT 20005 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTCTAAAG GACGGGTATT ATCTTAATTC TGTTTCAATT TCATGGCAGA AGAAATTCCT 367 TAATTATTAT GTGATTGCAT TGTTTTTCAG GTCTATTATA AACAATGATA AGGTCTGCAT 19945 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAATTATTAT GTGATTGCAT TGTTTTTCAG GTCTATTATA AACAATGATA AGGTCTGCAT 427 ATACATTATG CTTTATGAAA CTCCTCAGTA TTGTTGTTGT ATTGTTATTT TTGTTGATTT 19885 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACATTATG CTTTATGAAA CTCCTCAGTA TTGTTGTTGT ATTGTTATTT TTGTTGATTT 487 AGACAGAGAG TGTGTCGATT GAGGTTCGTG ACCATAAAAT TTTTGTTTAT TTATATATAT 19825 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGACAGAGAG TGTGTCGATT GAGGTTCGTG ACCATAAAAT TTTTGTTTAT TTATATATAT 547 TTATTTTTTT AACAAAGATA TACTTCAATT GTAGCTACGC CACTAGTTGT AGATATCAAA 19765 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATTTTTTT AACAAAGATA TACTTCAATT GTAGCTACGC CACTAGTTGT AGATATCAAA 607 TAAGTTAAGT AAGTAAATAA AAGGGATTCG TTTTAATTTA TGCAATTTAA TTTGATAAAT 19705 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAGTTAAGT AAGTAAATAA AAGGGATTCG TTTTAATTTA TGCAATTTAA TTTGATAAAT 667 GACAATAGAG GTTTTGATTT CGAGTTTTAA TATCCCCATC ATTTGATCAT TTGCAGTCAT 19645 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACAATAGAG GTTTTGATTT CGAGTTTTAA TATCCCCATC ATTTGATCAT TTGCAGTCAT 727 ATTATTCCCT TCTATTTGTT ACCTAATTTT CCTCAAGTGT TCTCCAAGAG TTTGTTGTTG 19585 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATTCCCT TCTATTTGTT ACCTAATTTT CCTCAAGTGT TCTCCAAGAG TTTGTTGTTG 787 GGTATTTGAC CTGAGTTTCT TAGCATAAAT AGGTTATAAA GTTTCCGTTT TTGAGTAGTC 19525 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTATTTGAC CTGAGTTTCT TAGCATAAAT AGGTTATAAA GTTTCCGTTT TTGAGTAGTC 847 ATTTTCTTGT AGAAAAAAGT TTAGTACTCT ATAAATTGAG GCTCGTTCCT TCTAACTTAA 19465 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTTCTTGT AGAAAAAAGT TTAGTACTCT ATAAATTGAG GCTCGTTCCT TCTAACTTAA 907 TCAACATTCA CAATGTAGTC ATA-GGGGGC TTTGAGAGTT TGGTTAGGGC AAGTGTTCTC 19406 |||||||||| |||||||||| ||| |||||| ||||| |||| ||||||||| ||||||||| TCAACATTCA CAATGTAGTC ATAGGGGGGC TTTGAAAGTT TGGTTAGGGG CAGTGTTCTC 967 CAAGAATCTG CAACTTGAGC AAGTTTACTA GTTTGACATG TCT-ATTCAA AATTTTCCTG 19347 |||||||||| ||| |||||| ||||||||| |||||||||| ||| |||||| | ||||| || CAAGAATCTG CAAATTGAGC CAGTTTACTA GTTTGACATG TCTAATTCAA ATTTTTCTTG 1027 TTTTGAGTAA ATCTT-ATGG TAACAG-TTG TACTCAGAGG TGTATCTAGG -ATTTTTAGT 19290 |||||||||| ||||| | || |||||| ||| |||||||||| ||||||||| ||||||||| TTTTGAGTAA ATCTTAAGGG TAACAGTTTG TACTCAGAGG GGTATCTAGG AATTTTTAGT 1087 TTATG-AGTT TTGAA 19276 ||| | |||| ||||| TTAGGAAGTT TTGAA 1102 hqPGS_C09HBa0099P03.1-2-_SGN-U338003+ (20304 19276) ******************************************************************************** EST sequence 9 +strand 563 n (File: SGN-U316772+) 1 GGCGATTGGC GGCCAATCGG CCGAATTCCT CTTCGCCATC CGCAGCTCAT TCTAGAACAG 61 TCAGGTGATT TTGCAGAATC GAAGTCTTCG TCTCGAATAT CTCTTTGCCA TGTCGGACGA 121 GGAAGTTGTT GACCCAAAGG CGACATTAGA AGTAAGTTGC AAGCCTAAGT GTGTAAGGCA 181 ACTAAAGGAG TATCAGGCAT GTACTAAAAG GATAGAAGGT GATGAATCAG GGCACAAACA 241 TTGCACTGGA CAGTATTTTG ATTATTGGCA CTGCATCGAC AAATGTGTTG CTGCGAAGTT 301 GTTTGACCAT CTCAAGTAAC AAGGATATAA GTTGTTGATC CCTTGCAATT TATCTTCTTT 361 TTGGTTGTTG AACAAGTCAT TACCATATTA TTCCTCACTG TGCTGAAGAC TTGTAACCCT 421 TTCAATCAAC TTGGTTGCTG CATGGAAAAT TTTGAACTAT GCACATCTTA AAAAGTGATT 481 AATAAATCAT ACTCGTGGGT TGAATTGGAC CCTTTTATTC GTTCGAAAAA AAAAAAAAAA 541 AAAAAAAAAA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 23515 to 31951): Exon 1 24375 24412 ( 38 n); cDNA 27 64 ( 38 n); score: 1.000 Intron 1 24413 24589 ( 177 n); Pd: 0.978 (s: 0), Pa: 0.916 (s: 1.00) Exon 2 24590 24636 ( 47 n); cDNA 65 111 ( 47 n); score: 1.000 Intron 2 24637 27190 (2554 n); Pd: 0.987 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 3 27191 27275 ( 85 n); cDNA 112 196 ( 85 n); score: 1.000 Intron 3 27276 30523 (3248 n); Pd: 0.656 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 4 30524 30613 ( 90 n); cDNA 197 286 ( 90 n); score: 1.000 Intron 4 30614 30740 ( 127 n); Pd: 0.940 (s: 1.00), Pa: 0.995 (s: 1.00) Exon 5 30741 30980 ( 240 n); cDNA 287 526 ( 240 n); score: 1.000 PPA cDNA 527 563 MATCH C09HBa0099P03.1-2+ SGN-U316772+ 1.000 500 0.888 C PGS_C09HBa0099P03.1-2+_SGN-U316772+ (24375 24412,24590 24636,27191 27275,30524 30613,30741 30980) Alignment (genomic DNA sequence = upper lines): TCCTCTTCGC CATCCGCAGC TCATTCTAGA ACAGTCAGGT TTACCCGTTG CTCCTCTTCT 24434 |||||||||| |||||||||| |||||||||| |||||||| TCCTCTTCGC CATCCGCAGC TCATTCTAGA ACAGTCAG.. .......... .......... 64 CTGCATTTTG ATCTGTTTAC TCTTTACTTT TTTCTTCCGA TCTGCATATT TAGGTTTTGT 24494 .......... .......... .......... .......... .......... .......... 64 TATTCAATAT TTTTGTTTAT TTTGTACATA TGGAGGATTT TTGGTGTTTC AGCTGGTTAA 24554 .......... .......... .......... .......... .......... .......... 64 TTTTGTATTA ATTAGTATAA AAGTTGCACT TGCAGGTGAT TTTGCAGAAT CGAAGTCTTC 24614 ||||| |||||||||| |||||||||| .......... .......... .......... .....GTGAT TTTGCAGAAT CGAAGTCTTC 89 GTCTCGAATA TCTCTTTGCC ATGTGAGTAA CTGTTGATCT ATTGATAATG AATCTATTCC 24674 |||||||||| |||||||||| || GTCTCGAATA TCTCTTTGCC AT........ .......... .......... .......... 111 CTCTTCATTA GTAATTTGAT TTTAGTTTTA CTTTTGATGT GATTTATTTG TCTCATTTAG 24734 .......... .......... .......... .......... .......... .......... 111 TTTGTAAAAT CGATTCACTC TACTAGTAAG TTACAAATAG TATGTAACTT GTGTTCTCCA 24794 .......... .......... .......... .......... .......... .......... 111 TACCATATGT CTTAAATAGT TTTTTCGATT CTAAACTGTG GGTGTGCGGG TTTTAGGTTG 24854 .......... .......... .......... .......... .......... .......... 111 TTAAAAATTG AGATTGCAAT AAAAATAATG CAGAAGACTT TGATCTGGGG CAATTAAAAA 24914 .......... .......... .......... .......... .......... .......... 111 GATAACAAAC TCGTCATCAT TAGTTCATCT TTATTTTTCT AAAGAGCGAA GTGCTTGACT 24974 .......... .......... .......... .......... .......... .......... 111 TAACCAACTT GTGGTTTGTT CTTGATTGGA ACTATAAGTT GAGAGAAGTG AAGTTTTTCG 25034 .......... .......... .......... .......... .......... .......... 111 GAAATTTAAT AATATTATAA AATTTTGTGA TTTTTTCCCA TTGTTTATAT AATATAGCAA 25094 .......... .......... .......... .......... .......... .......... 111 AATCAACAGT TTGATTGTAC TGATTCGTTG TTAGTTTTCC AATGTGCAAA TACTTACCAG 25154 .......... .......... .......... .......... .......... .......... 111 GTTAATACAT TTATTTAAGG CTTTACTAGT GATCTAATAT TCCTAAGTAA GAAAAACAAC 25214 .......... .......... .......... .......... .......... .......... 111 ATACCCCGTG TAATCCCCTA AGTAGTGTAA TCCCATGGGT GTTGTCTGGG AGGGTGATGT 25274 .......... .......... .......... .......... .......... .......... 111 TTACACAACC TTATCAAAGC AAGTATATAA GGGAAATACA ATAGCAAAAA AAGTCATGCT 25334 .......... .......... .......... .......... .......... .......... 111 GAAAAACAAT TAAGAAGAGA AAAAGTGACA ACAACAATCA ATATGATAAC TCAAGCAAAG 25394 .......... .......... .......... .......... .......... .......... 111 GAAATAACAA TAATACCACA ATTGAAAAAT AAAATAGTAG GAAAGTAATA GGAACAATAC 25454 .......... .......... .......... .......... .......... .......... 111 GGATAAGGAA ATTAGACAAC GTTAGACCAC CTACTTACCT TCTGCCCTTA ATCTCATATC 25514 .......... .......... .......... .......... .......... .......... 111 TTCCTATCTA GAGACATGTC ATGTTCTTTG TAAGCAGTAG AAGTGCCAAG CCTAAGTAGA 25574 .......... .......... .......... .......... .......... .......... 111 TGGATGAATT ATGTTCTCTG TCATGGCTGG AAATTGTTTA AGTAGAGAGT GAGTGTGAGT 25634 .......... .......... .......... .......... .......... .......... 111 ATCCATACCC GAACCTATCC TGTTATATTT GGATTGTTAC TAGTTTTAGG ACTTCACCAT 25694 .......... .......... .......... .......... .......... .......... 111 AGCCTTCAAC AATTAGAAGC TTAACTTACT TTACCTTTAA TCCTGCACCT CGCGTTCATG 25754 .......... .......... .......... .......... .......... .......... 111 ATATCTATTC AACCGTGCGT CAGAGACACG GTCGAAGCAC GCCCGTCCAG TGTGCTTCTT 25814 .......... .......... .......... .......... .......... .......... 111 TCACGACATT CTGTAGTATA ACTATCAGCT TATACGTTGT GACTGTATTG ACTAACCTGA 25874 .......... .......... .......... .......... .......... .......... 111 TGTTATCTGT AAATGTTAGA AGTCAATTCT TTTTCAAGTT TCCACTTTCA ATATCAATGT 25934 .......... .......... .......... .......... .......... .......... 111 CATCGGGGTA AAATCTCTCA ACCATGGTAG CATGCTCCCA TTTCTTTCTT GGCATAGAAC 25994 .......... .......... .......... .......... .......... .......... 111 TTAAAATTTC AACTTGTTCT AAACGCGTAG CCATTTATTT TTGAAAATGA AACATACTTA 26054 .......... .......... .......... .......... .......... .......... 111 ATAGATTTTT TAACAATATG CACAGAAGGG TGCCACAAAC TCCAGCGCAT CTGTATGTAT 26114 .......... .......... .......... .......... .......... .......... 111 GTATGTATGT ATGTATACAT ATATCAGGAA ATATGGTGCA CTGCTACTTA CTGAAAATAC 26174 .......... .......... .......... .......... .......... .......... 111 AACAAGTTCT ACTTAATTTC ATTTATTAAA TAGAAGTGCT TTGAGATAAT CCTTATCCTT 26234 .......... .......... .......... .......... .......... .......... 111 CCTCTGATGC AACTGCTTAT TTTTAAACGT ATCCTTTCTT TGGTGCAACT TCATGTATTA 26294 .......... .......... .......... .......... .......... .......... 111 AAAAATACTT TCATGTCTCA ATTTTAATAT TTTGGACACA GCTGCATATT ATTCTATAGA 26354 .......... .......... .......... .......... .......... .......... 111 TATTATTATA TTTATTGGTT TTTAGTGAAA AGCAACTTTT CTAGTGTACA CTCCATGTTG 26414 .......... .......... .......... .......... .......... .......... 111 AAGCTTCCGA TAAATCTATT CTGTCAGAGT CTTCTTCTTC TTGGTGCTTT CCTGATAACT 26474 .......... .......... .......... .......... .......... .......... 111 AGTGTCAGAA GAATCTTATA TTATTCTTAG AGCCTGTTTG AATTGCTTAG TTGATGTGCT 26534 .......... .......... .......... .......... .......... .......... 111 TCTAAGCGCT TTTGTTGTGT TTCAGTAAAA TAGTCCGTCA GTTAGAATTC CTAAATTGTG 26594 .......... .......... .......... .......... .......... .......... 111 ATTTTTGGCT TATAAGACTA AGCCCAAACA AACAGGCTCT TAGACTAATC ATTAAAATGA 26654 .......... .......... .......... .......... .......... .......... 111 TTTACAAATA TTCAGCCGTT GTTAAAATGG CAGTTAGACT GCGTTTTGGT AATTAATGAC 26714 .......... .......... .......... .......... .......... .......... 111 AAGTGACAAC TCAGTCACTA CTGAATTTGT TTTGGTAAGA TACGTGAAAA AGCTTGAATG 26774 .......... .......... .......... .......... .......... .......... 111 CAAATTTGAT TAGTCTACAA TCCCCTAATA CCACTTCTCT AGTTGTATCT GTAATTTTAT 26834 .......... .......... .......... .......... .......... .......... 111 ATGGGCAAGT AATTTTTGAA GTATGCTTTA CTCTTCAAGT TTTGGTGTAT ATTGTGATTT 26894 .......... .......... .......... .......... .......... .......... 111 ACATTCAGGA TATTTTAATT TACACCCAAA AAAATAATTC TGCATGAAAG AAAAAAATAC 26954 .......... .......... .......... .......... .......... .......... 111 TTCTCTAGTT GTATCTGTAA TTTTATATGG GTAAGTAATT TTTGAAGTAT GCTTTACTCT 27014 .......... .......... .......... .......... .......... .......... 111 TCAAGTTTTG GTGTATATTG TGATTTACAT TCAGGATATT TTAATTTACA CCCAAAAAAA 27074 .......... .......... .......... .......... .......... .......... 111 GTAATTCTGC ATGAAAGAAA AAAAATACTT GTAGGATAGT TTCTTCCAAG TTTGAGCTTT 27134 .......... .......... .......... .......... .......... .......... 111 TGCTAATATT ATCATCTCTT AGAAGATATA TTGCTGATTT TTGTTTCTTG TTCAAGGTCG 27194 |||| .......... .......... .......... .......... .......... ......GTCG 115 GACGAGGAAG TTGTTGACCC AAAGGCGACA TTAGAAGTAA GTTGCAAGCC TAAGTGTGTA 27254 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACGAGGAAG TTGTTGACCC AAAGGCGACA TTAGAAGTAA GTTGCAAGCC TAAGTGTGTA 175 AGGCAACTAA AGGAGTATCA GGTGATTCCC AATGCAATGA AATAGCCTTT CAGATATTTC 27314 |||||||||| |||||||||| | AGGCAACTAA AGGAGTATCA G......... .......... .......... .......... 196 AAAAGCTTTT GGTTCTTATT TTAATTATTG GAACTATTAC TTATCTCCTT TATTTTGGGT 27374 .......... .......... .......... .......... .......... .......... 196 AGTGTGTAAG GCAACTAAAG GGGTATCAGT TGATTCCCAA TGCAATGAAA TAACCTTTCA 27434 .......... .......... .......... .......... .......... .......... 196 GTTATTTCAA AAGCTTTTGG TTCTAATTTA ATTATTGGAA CTTTTATCTC CTTTATTTTG 27494 .......... .......... .......... .......... .......... .......... 196 GGTGTTCAAT ACTTCCACTG ATTAAACCAA TATCAGGATG AGCTATGACA TCGTTTATGT 27554 .......... .......... .......... .......... .......... .......... 196 TAGTGAAAGG CTATTTCTTT TTTTGTAATT TTTAGCATCA TCATACACTG TTTCATAGAT 27614 .......... .......... .......... .......... .......... .......... 196 GGGTATAGCA GTTGAATCTT CCTTTAGTTT GAGAATTCGA GACATTCTAG TGATAAGTGC 27674 .......... .......... .......... .......... .......... .......... 196 CTTCAAATGG AACACAATGG TATGTTTCTT ACCAATGCAT CTGATCTTTT CCTACAGTTC 27734 .......... .......... .......... .......... .......... .......... 196 AAAATTCAAT GAAAAAAATT TGAAAATGAG AGATTTTCTG AAAAATTAGT AAAAAGTAGA 27794 .......... .......... .......... .......... .......... .......... 196 ATGAAAGACA ATCCTATATT GACAAGGACA AACTTTTTTA CAATTTAGTA ATGCCCACAC 27854 .......... .......... .......... .......... .......... .......... 196 TCTTTCTTGT GTATCAAAGA TCATCGAAAT AAGCAACAAA ACTATTTCAA TAAAAAAATA 27914 .......... .......... .......... .......... .......... .......... 196 CTATTTACAA GGATTTCAAT CAAGATACCT TAGCAAGATT CTAGTTCTTT CGTAAGTTCT 27974 .......... .......... .......... .......... .......... .......... 196 TGTATTTAGT GTTGTGGGAA TTTATCTCGT CGTCTTTTTG GGGTTGCTTT TTTCAATAAA 28034 .......... .......... .......... .......... .......... .......... 196 TACATTTCTT TACATAAGTT GTATTAGTAA TCTGGGTTTA TGGATGACTG CTCTTGGTAT 28094 .......... .......... .......... .......... .......... .......... 196 TGTTAGTCTA GCCTTAAACC ATCGTTTATA TAGAGACATC AACATAATGT TTTAGCTTGG 28154 .......... .......... .......... .......... .......... .......... 196 TGGAAATGAA TTCTTTTTCC TCAGGATCCT ATTAGATAGA AACAAAGAAC AAAATAACTA 28214 .......... .......... .......... .......... .......... .......... 196 GAAAGGTTGT AAGAATACCC CTCTTCTAGA TGGATCGTCT ACAAAGCTAT TCGTTTAAAG 28274 .......... .......... .......... .......... .......... .......... 196 TGTATTCAGA TCAAAAGCTC GAATGGCAAA AGAGGGAGGA TTTGTGGCTT AGATCGTAGG 28334 .......... .......... .......... .......... .......... .......... 196 TTCAAGCTCC ACACCATGCA AAAGGAAGCC CCGTATTTAA GTGGTGAAGG GTAGAGTAAT 28394 .......... .......... .......... .......... .......... .......... 196 GATCATATGT CTGTAGTGCA TTGTATGTCC CATGATAGGT CCCATTCTTC TTCTCTTCTC 28454 .......... .......... .......... .......... .......... .......... 196 CAAACAAAGT CTATGATTAT TCATATCTAT GTTACCTTGA TATCAAAGAA GAGTTCAATC 28514 .......... .......... .......... .......... .......... .......... 196 AATCCTATGT TTCCATTCCA ACCTTCCCAT TTATCTGACT GTTGCTGAGC AGTTTATGGA 28574 .......... .......... .......... .......... .......... .......... 196 TATCACACGG ACAGCCCCAG ATGGTAAATT TTAATGTGGA CAAATTTTCG TTGATCATGC 28634 .......... .......... .......... .......... .......... .......... 196 GACACTCGGA TTTGAACAGG GAGAAAAGGA TTTGCAGTCC TCGCAGCTCT GGGCTCGCTG 28694 .......... .......... .......... .......... .......... .......... 196 TCGTAAGTGG GCCTGTTTAT GTCAACTACC CTCTAGGTTT TTTAAGTAGA AGACAACAAA 28754 .......... .......... .......... .......... .......... .......... 196 CTTATTAAAT CTTTTTTTCT GCCTCAATTT TTTTTAATCA GTTCTTAGTT CACTTTTCCC 28814 .......... .......... .......... .......... .......... .......... 196 TTTTCCCTTA AAACCATTTC TTCTGAACTA ATTGAAATAC ACATAAATTT TTCCAAGTAA 28874 .......... .......... .......... .......... .......... .......... 196 GAACCACAAA ACCATTATTT TCCATAAATC AGACCTAAAA CAATCCTCCC TAAAACATTG 28934 .......... .......... .......... .......... .......... .......... 196 AAAACAAAGT GTTTTCTTAC AAAAATACAA ATCTGGAACT AAGTAAGTTA ATACCTTTTC 28994 .......... .......... .......... .......... .......... .......... 196 TATCTTTCAT GAATCTAGTG TAGTTTCATA TTTTATTTCT TCTGTACTGT AATTATGATC 29054 .......... .......... .......... .......... .......... .......... 196 TTCACAAGAA CTCAGAGTGG CACAAAAATG TCTTGCATTA TGTTATGTTA TGTTAGACCT 29114 .......... .......... .......... .......... .......... .......... 196 GAAAATTTTA TATTTGGAAA ATATCAAATA TTATGAACAT AATATGTACT GATTCCAGGA 29174 .......... .......... .......... .......... .......... .......... 196 CCTACTACTC TAAGTATGTA CTGATTCCAG GATCTACTTC AAATCAATTA AAATAGCAGG 29234 .......... .......... .......... .......... .......... .......... 196 TCAAGAGAAT TTTCTTTGCA TGTTACTCTG ATGGGATGAA CCAATATAGG GACTTCATCT 29294 .......... .......... .......... .......... .......... .......... 196 GTGATTATTT GGTCTCTCTA AGAAGCCCCC CAAAAAAACA TGTTTCTGGC CTCCAAATAC 29354 .......... .......... .......... .......... .......... .......... 196 TTCTCGCTCT TTAGATTAGG CACTCTAGAT TTCAGCTGGT TAGCGGTTGC TGGTATTTGT 29414 .......... .......... .......... .......... .......... .......... 196 ATGATTTTTT GTTTCTTTGT TTTGTGTGTA GCATGTTCTT AGATCCAAGA GGAATTGCAA 29474 .......... .......... .......... .......... .......... .......... 196 GACATCAATC AGATTTTCTC TTCAACCTTG CCTTTTATTC TGTTTGATTG CTTTCTATGG 29534 .......... .......... .......... .......... .......... .......... 196 GGTCCTTTTG TTGTTTGTAT CGCCAACAGT TTGCTGTAAG AGTTTTTGAT GATGAGGAGT 29594 .......... .......... .......... .......... .......... .......... 196 GTTACTTCAA GGTTTTCTTT CTCCAAGACT CTGATTTCTA ATATAATTTT GATGCCATTA 29654 .......... .......... .......... .......... .......... .......... 196 AACGTTTTCC AATTTATTTT TTATTTTTCT ACTTGTTCTT CATATACTAC CTCTGCAAAC 29714 .......... .......... .......... .......... .......... .......... 196 ATCTTTGTAA CCAACTTAAT TATTTGTGCA ACGAATGTGC TTTGGTTGGG GAGTCTGAGT 29774 .......... .......... .......... .......... .......... .......... 196 TTCTAACTTT TGGAAGGATT TTGGTCTCTC TTTCTGCGAG AAGTGAAAGA TTTTTTTTTT 29834 .......... .......... .......... .......... .......... .......... 196 TGGAAGGATT CCAGAACAAT GATGTCTCGT TTAGTTGTGC TAACTGTGCA CCTCATATAA 29894 .......... .......... .......... .......... .......... .......... 196 CAATACAGTA ACATATTTGA TCTTCAGAAC AATTGCGCAT GATCTGTTTT TTTTTTTAAT 29954 .......... .......... .......... .......... .......... .......... 196 TTTAACTAAT GGATGTGGAA GTATTGCACC TTAGAATGAA GGTGATATTC CTTCACTAAC 30014 .......... .......... .......... .......... .......... .......... 196 CTCCCTGGGA AGTATATATT ACTTTCATCA AGATATACAT GTATATTTTG TGTTTTTGTT 30074 .......... .......... .......... .......... .......... .......... 196 TTGTCCGGCA GAGGGGACTT CATTTGTGGA TTAAGTTTGA TGGATATTTA GTCCACGAGG 30134 .......... .......... .......... .......... .......... .......... 196 TGTATCATTT TCTATCTTGA AAAACACATT AAGTGTTAGT TAGGGTTGGT TATTTATCGG 30194 .......... .......... .......... .......... .......... .......... 196 GTGCACATCG TTCCCACTGC TCTATGATGG TTAACTGTTG ACCTGCTTTA TTTATTGATC 30254 .......... .......... .......... .......... .......... .......... 196 TGACAATTCT TTTTGGCAAA TGTCACATTA ATGTTTATTA AACAGGACAA GATTACTTGC 30314 .......... .......... .......... .......... .......... .......... 196 ACTTTGTCAT TAGTTGCAGT CCCAAAAACT TTTAACTTGC AATATTTGAT TATTTTCATC 30374 .......... .......... .......... .......... .......... .......... 196 TTTCTGTTTG TCATATCCCC CTTCTTTCCC TTTTTCCTGT CTTTTGGCGG GGTGGGGGTG 30434 .......... .......... .......... .......... .......... .......... 196 GATAATGAAT TTCTGAAGAA GGCCAGCTTT GTTGCTTCAA AATTAGTAGT GGTTTTCTTA 30494 .......... .......... .......... .......... .......... .......... 196 CTTTATGGGT CTCATTCTGC ATCCTACAGG CATGTACTAA AAGGATAGAA GGTGATGAAT 30554 | |||||||||| |||||||||| |||||||||| .......... .......... .........G CATGTACTAA AAGGATAGAA GGTGATGAAT 227 CAGGGCACAA ACATTGCACT GGACAGTATT TTGATTATTG GCACTGCATC GACAAATGTG 30614 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CAGGGCACAA ACATTGCACT GGACAGTATT TTGATTATTG GCACTGCATC GACAAATGT. 286 TAAGCTCCCT TTTCCGGTTG TTTGGCATTT TTGTGTTGGG GTTTTCAGGG TGTGCTGCCT 30674 .......... .......... .......... .......... .......... .......... 286 CTTGATTCTT TTCTCCGTTG TTGTCGTTTG TTGTTTTCTT ACCTTTTCGT GGAACTGTTT 30734 .......... .......... .......... .......... .......... .......... 286 GCACAGGTTG CTGCGAAGTT GTTTGACCAT CTCAAGTAAC AAGGATATAA GTTGTTGATC 30794 |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ......GTTG CTGCGAAGTT GTTTGACCAT CTCAAGTAAC AAGGATATAA GTTGTTGATC 340 CCTTGCAATT TATCTTCTTT TTGGTTGTTG AACAAGTCAT TACCATATTA TTCCTCACTG 30854 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTTGCAATT TATCTTCTTT TTGGTTGTTG AACAAGTCAT TACCATATTA TTCCTCACTG 400 TGCTGAAGAC TTGTAACCCT TTCAATCAAC TTGGTTGCTG CATGGAAAAT TTTGAACTAT 30914 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGAAGAC TTGTAACCCT TTCAATCAAC TTGGTTGCTG CATGGAAAAT TTTGAACTAT 460 GCACATCTTA AAAAGTGATT AATAAATCAT ACTCGTGGGT TGAATTGGAC CCTTTTATTC 30974 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACATCTTA AAAAGTGATT AATAAATCAT ACTCGTGGGT TGAATTGGAC CCTTTTATTC 520 GTTCGA 30980 |||||| GTTCGA 526 hqPGS_C09HBa0099P03.1-2+_SGN-U316772+ (24375 24412,24590 24636,27191 27275,30524 30613,30741 30980) ******************************************************************************** EST sequence 11 +strand 694 n (File: SGN-U316773+) 1 GGGCGATTGG CGGCCAATCG GCCGAATTGC TCTTCTGTCT TCTCTTCACC ATTCGCACCT 61 CATTCAGAGA AAAGTCAGGT TTTTTTGTAG AATCAAAGTC TTCGTATCCA ATATCTCGTT 121 GCCATGTCGG ACGAGGAAGT TGTTGACCCA AAGGCGACAA TGGAAGTATC TTGCAAGCCT 181 AAGTGTGTAA GGCAACTAAA GGATTATCAG GCATGTACTA GAAGGATAGA AGGTGATGAA 241 TCAGGGAGCA AGCATTGCAC TGGACAGTAT TTTGATTATT GGCAATGCAT TGACAAATGT 301 GTTGCCCCAA AGCTATTTGA AAAACTCAAG TAACATGGAG ATAAGTGTTC ATCCATTACG 361 ATTTTATCTT GCTTTTTCAT TGTTGAAGCC TGTGCTACAG AATTGCAATC CCTTCAATCA 421 ACTTAGTTGC CGCATGGAAA ATTTTGTACT ATGCACATTA TTAAGAAGTG ATTAATAAAG 481 GATACTTATG GGTTAAATTT TGTGGACACT TTTATTTGGT CAGATGATTC AAAATCCGGA 541 GGATATTATT CCCAGATTTT TGTTCTCTCT TGCGTTGTTA TATGAGCTCC CAGTTACCTA 601 ATTTCTTTAT GGGAGGAAAT ATCATAAATC AGTGATTTAT ATAAAAAAAA AAAAAAAAAA 661 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 25341 to 36507): Exon 1 27191 27275 ( 85 n); cDNA 126 210 ( 85 n); score: 0.941 Intron 1 27276 30523 (3248 n); Pd: 0.656 (s: 0.92), Pa: 0.996 (s: 0.92) Exon 2 30524 30613 ( 90 n); cDNA 211 300 ( 90 n); score: 0.933 Intron 2 30614 30740 ( 127 n); Pd: 0.940 (s: 0.94), Pa: 0.995 (s: 0.78) Exon 3 30741 30832 ( 92 n); cDNA 301 395 ( 95 n); score: 0.685 Intron 3 30833 30862 ( 30 n); Pd: 0.000 (s: 0.61), Pa: 0.000 (s: 0.61) ?? Exon 4 30863 31010 ( 148 n); cDNA 396 553 ( 158 n); score: 0.655 Intron 4 31011 35440 (4430 n); Pd: 0.000 (s: 0.74), Pa: 0.558 (s: 0) Exon 5 35441 35445 ( 5 n); cDNA 554 558 ( 5 n); score: 0.800 PPA cDNA 644 694 MATCH C09HBa0099P03.1-2+ SGN-U316773+ 0.781 420 0.605 C PGS_C09HBa0099P03.1-2+_SGN-U316773+ (27191 27275,30524 30613,30741 30832,30863 31010,35441 35445) Alignment (genomic DNA sequence = upper lines): GTCGGACGAG GAAGTTGTTG ACCCAAAGGC GACATTAGAA GTAAGTTGCA AGCCTAAGTG 27250 |||||||||| |||||||||| |||||||||| |||| | ||| ||| ||||| |||||||||| GTCGGACGAG GAAGTTGTTG ACCCAAAGGC GACAATGGAA GTATCTTGCA AGCCTAAGTG 185 TGTAAGGCAA CTAAAGGAGT ATCAGGTGAT TCCCAATGCA ATGAAATAGC CTTTCAGATA 27310 |||||||||| |||||||| | ||||| TGTAAGGCAA CTAAAGGATT ATCAG..... .......... .......... .......... 210 TTTCAAAAGC TTTTGGTTCT TATTTTAATT ATTGGAACTA TTACTTATCT CCTTTATTTT 27370 .......... .......... .......... .......... .......... .......... 210 GGGTAGTGTG TAAGGCAACT AAAGGGGTAT CAGTTGATTC CCAATGCAAT GAAATAACCT 27430 .......... .......... .......... .......... .......... .......... 210 TTCAGTTATT TCAAAAGCTT TTGGTTCTAA TTTAATTATT GGAACTTTTA TCTCCTTTAT 27490 .......... .......... .......... .......... .......... .......... 210 TTTGGGTGTT CAATACTTCC ACTGATTAAA CCAATATCAG GATGAGCTAT GACATCGTTT 27550 .......... .......... .......... .......... .......... .......... 210 ATGTTAGTGA AAGGCTATTT CTTTTTTTGT AATTTTTAGC ATCATCATAC ACTGTTTCAT 27610 .......... .......... .......... .......... .......... .......... 210 AGATGGGTAT AGCAGTTGAA TCTTCCTTTA GTTTGAGAAT TCGAGACATT CTAGTGATAA 27670 .......... .......... .......... .......... .......... .......... 210 GTGCCTTCAA ATGGAACACA ATGGTATGTT TCTTACCAAT GCATCTGATC TTTTCCTACA 27730 .......... .......... .......... .......... .......... .......... 210 GTTCAAAATT CAATGAAAAA AATTTGAAAA TGAGAGATTT TCTGAAAAAT TAGTAAAAAG 27790 .......... .......... .......... .......... .......... .......... 210 TAGAATGAAA GACAATCCTA TATTGACAAG GACAAACTTT TTTACAATTT AGTAATGCCC 27850 .......... .......... .......... .......... .......... .......... 210 ACACTCTTTC TTGTGTATCA AAGATCATCG AAATAAGCAA CAAAACTATT TCAATAAAAA 27910 .......... .......... .......... .......... .......... .......... 210 AATACTATTT ACAAGGATTT CAATCAAGAT ACCTTAGCAA GATTCTAGTT CTTTCGTAAG 27970 .......... .......... .......... .......... .......... .......... 210 TTCTTGTATT TAGTGTTGTG GGAATTTATC TCGTCGTCTT TTTGGGGTTG CTTTTTTCAA 28030 .......... .......... .......... .......... .......... .......... 210 TAAATACATT TCTTTACATA AGTTGTATTA GTAATCTGGG TTTATGGATG ACTGCTCTTG 28090 .......... .......... .......... .......... .......... .......... 210 GTATTGTTAG TCTAGCCTTA AACCATCGTT TATATAGAGA CATCAACATA ATGTTTTAGC 28150 .......... .......... .......... .......... .......... .......... 210 TTGGTGGAAA TGAATTCTTT TTCCTCAGGA TCCTATTAGA TAGAAACAAA GAACAAAATA 28210 .......... .......... .......... .......... .......... .......... 210 ACTAGAAAGG TTGTAAGAAT ACCCCTCTTC TAGATGGATC GTCTACAAAG CTATTCGTTT 28270 .......... .......... .......... .......... .......... .......... 210 AAAGTGTATT CAGATCAAAA GCTCGAATGG CAAAAGAGGG AGGATTTGTG GCTTAGATCG 28330 .......... .......... .......... .......... .......... .......... 210 TAGGTTCAAG CTCCACACCA TGCAAAAGGA AGCCCCGTAT TTAAGTGGTG AAGGGTAGAG 28390 .......... .......... .......... .......... .......... .......... 210 TAATGATCAT ATGTCTGTAG TGCATTGTAT GTCCCATGAT AGGTCCCATT CTTCTTCTCT 28450 .......... .......... .......... .......... .......... .......... 210 TCTCCAAACA AAGTCTATGA TTATTCATAT CTATGTTACC TTGATATCAA AGAAGAGTTC 28510 .......... .......... .......... .......... .......... .......... 210 AATCAATCCT ATGTTTCCAT TCCAACCTTC CCATTTATCT GACTGTTGCT GAGCAGTTTA 28570 .......... .......... .......... .......... .......... .......... 210 TGGATATCAC ACGGACAGCC CCAGATGGTA AATTTTAATG TGGACAAATT TTCGTTGATC 28630 .......... .......... .......... .......... .......... .......... 210 ATGCGACACT CGGATTTGAA CAGGGAGAAA AGGATTTGCA GTCCTCGCAG CTCTGGGCTC 28690 .......... .......... .......... .......... .......... .......... 210 GCTGTCGTAA GTGGGCCTGT TTATGTCAAC TACCCTCTAG GTTTTTTAAG TAGAAGACAA 28750 .......... .......... .......... .......... .......... .......... 210 CAAACTTATT AAATCTTTTT TTCTGCCTCA ATTTTTTTTA ATCAGTTCTT AGTTCACTTT 28810 .......... .......... .......... .......... .......... .......... 210 TCCCTTTTCC CTTAAAACCA TTTCTTCTGA ACTAATTGAA ATACACATAA ATTTTTCCAA 28870 .......... .......... .......... .......... .......... .......... 210 GTAAGAACCA CAAAACCATT ATTTTCCATA AATCAGACCT AAAACAATCC TCCCTAAAAC 28930 .......... .......... .......... .......... .......... .......... 210 ATTGAAAACA AAGTGTTTTC TTACAAAAAT ACAAATCTGG AACTAAGTAA GTTAATACCT 28990 .......... .......... .......... .......... .......... .......... 210 TTTCTATCTT TCATGAATCT AGTGTAGTTT CATATTTTAT TTCTTCTGTA CTGTAATTAT 29050 .......... .......... .......... .......... .......... .......... 210 GATCTTCACA AGAACTCAGA GTGGCACAAA AATGTCTTGC ATTATGTTAT GTTATGTTAG 29110 .......... .......... .......... .......... .......... .......... 210 ACCTGAAAAT TTTATATTTG GAAAATATCA AATATTATGA ACATAATATG TACTGATTCC 29170 .......... .......... .......... .......... .......... .......... 210 AGGACCTACT ACTCTAAGTA TGTACTGATT CCAGGATCTA CTTCAAATCA ATTAAAATAG 29230 .......... .......... .......... .......... .......... .......... 210 CAGGTCAAGA GAATTTTCTT TGCATGTTAC TCTGATGGGA TGAACCAATA TAGGGACTTC 29290 .......... .......... .......... .......... .......... .......... 210 ATCTGTGATT ATTTGGTCTC TCTAAGAAGC CCCCCAAAAA AACATGTTTC TGGCCTCCAA 29350 .......... .......... .......... .......... .......... .......... 210 ATACTTCTCG CTCTTTAGAT TAGGCACTCT AGATTTCAGC TGGTTAGCGG TTGCTGGTAT 29410 .......... .......... .......... .......... .......... .......... 210 TTGTATGATT TTTTGTTTCT TTGTTTTGTG TGTAGCATGT TCTTAGATCC AAGAGGAATT 29470 .......... .......... .......... .......... .......... .......... 210 GCAAGACATC AATCAGATTT TCTCTTCAAC CTTGCCTTTT ATTCTGTTTG ATTGCTTTCT 29530 .......... .......... .......... .......... .......... .......... 210 ATGGGGTCCT TTTGTTGTTT GTATCGCCAA CAGTTTGCTG TAAGAGTTTT TGATGATGAG 29590 .......... .......... .......... .......... .......... .......... 210 GAGTGTTACT TCAAGGTTTT CTTTCTCCAA GACTCTGATT TCTAATATAA TTTTGATGCC 29650 .......... .......... .......... .......... .......... .......... 210 ATTAAACGTT TTCCAATTTA TTTTTTATTT TTCTACTTGT TCTTCATATA CTACCTCTGC 29710 .......... .......... .......... .......... .......... .......... 210 AAACATCTTT GTAACCAACT TAATTATTTG TGCAACGAAT GTGCTTTGGT TGGGGAGTCT 29770 .......... .......... .......... .......... .......... .......... 210 GAGTTTCTAA CTTTTGGAAG GATTTTGGTC TCTCTTTCTG CGAGAAGTGA AAGATTTTTT 29830 .......... .......... .......... .......... .......... .......... 210 TTTTTGGAAG GATTCCAGAA CAATGATGTC TCGTTTAGTT GTGCTAACTG TGCACCTCAT 29890 .......... .......... .......... .......... .......... .......... 210 ATAACAATAC AGTAACATAT TTGATCTTCA GAACAATTGC GCATGATCTG TTTTTTTTTT 29950 .......... .......... .......... .......... .......... .......... 210 TAATTTTAAC TAATGGATGT GGAAGTATTG CACCTTAGAA TGAAGGTGAT ATTCCTTCAC 30010 .......... .......... .......... .......... .......... .......... 210 TAACCTCCCT GGGAAGTATA TATTACTTTC ATCAAGATAT ACATGTATAT TTTGTGTTTT 30070 .......... .......... .......... .......... .......... .......... 210 TGTTTTGTCC GGCAGAGGGG ACTTCATTTG TGGATTAAGT TTGATGGATA TTTAGTCCAC 30130 .......... .......... .......... .......... .......... .......... 210 GAGGTGTATC ATTTTCTATC TTGAAAAACA CATTAAGTGT TAGTTAGGGT TGGTTATTTA 30190 .......... .......... .......... .......... .......... .......... 210 TCGGGTGCAC ATCGTTCCCA CTGCTCTATG ATGGTTAACT GTTGACCTGC TTTATTTATT 30250 .......... .......... .......... .......... .......... .......... 210 GATCTGACAA TTCTTTTTGG CAAATGTCAC ATTAATGTTT ATTAAACAGG ACAAGATTAC 30310 .......... .......... .......... .......... .......... .......... 210 TTGCACTTTG TCATTAGTTG CAGTCCCAAA AACTTTTAAC TTGCAATATT TGATTATTTT 30370 .......... .......... .......... .......... .......... .......... 210 CATCTTTCTG TTTGTCATAT CCCCCTTCTT TCCCTTTTTC CTGTCTTTTG GCGGGGTGGG 30430 .......... .......... .......... .......... .......... .......... 210 GGTGGATAAT GAATTTCTGA AGAAGGCCAG CTTTGTTGCT TCAAAATTAG TAGTGGTTTT 30490 .......... .......... .......... .......... .......... .......... 210 CTTACTTTAT GGGTCTCATT CTGCATCCTA CAGGCATGTA CTAAAAGGAT AGAAGGTGAT 30550 ||||||| ||| |||||| |||||||||| .......... .......... .......... ...GCATGTA CTAGAAGGAT AGAAGGTGAT 237 GAATCAGGGC ACAAACATTG CACTGGACAG TATTTTGATT ATTGGCACTG CATCGACAAA 30610 ||||||||| ||| ||||| |||||||||| |||||||||| ||||||| || ||| |||||| GAATCAGGGA GCAAGCATTG CACTGGACAG TATTTTGATT ATTGGCAATG CATTGACAAA 297 TGTGTAAGCT CCCTTTTCCG GTTGTTTGGC ATTTTTGTGT TGGGGTTTTC AGGGTGTGCT 30670 ||| TGT....... .......... .......... .......... .......... .......... 300 GCCTCTTGAT TCTTTTCTCC GTTGTTGTCG TTTGTTGTTT TCTTACCTTT TCGTGGAACT 30730 .......... .......... .......... .......... .......... .......... 300 GTTTGCACAG GTTGCTGCGA AGTTGTTTGA CCATCTCAAG TAACAAGGAT ATAAGTTGTT 30790 ||||| | | || | ||||| | |||||| ||||| ||| ||||| |||| .......... GTTGCCCCAA AGCTATTTGA AAAACTCAAG TAACATGGAG ATAAG-TGTT 349 GATCCCTTGC -AATTTATCT T-CTTTTTGG TTGTTGAA-C AAGT-CATTA CCATATTATT 30846 |||| || | | ||||||| | |||||| |||||||| | || | CATCCATTAC GATTTTATCT TGCTTTTTCA TTGTTGAAGC CTGTGC.... .......... 395 CCTCACTGTG CTGAAG-AC- ---TTGTAAC CCTTTCAATC AACTTGGTTG CTGCATGGAA 30901 || ||| || || ||||||| ||||| |||| | |||||||| .......... ......TACA GAATTGCAAT CCCTTCAATC AACTTAGTTG CCGCATGGAA 439 AATTTTGAAC TATGCACA-T CTTAAAAAGT GATTAATAAA TCATACTCGT GGGTTGAA-- 30958 ||||||| || |||||||| | |||| |||| |||||||||| ||||| | ||||| || AATTTTGTAC TATGCACATT ATTAAGAAGT GATTAATAAA GGATACTTAT GGGTTAAATT 499 -T-TGGACCC TTTTATTCGT TCGAATTATT GAGAATCTCC TGGATGTTAT TCCCTGATGG 31016 | ||||| | ||||||| | || || ||| | |||| |||| |||| |||| TTGTGGACAC TTTTATTTGG TCAGATGATT CAAAATCCGG AGGATATTAT TCCC...... 553 CTTCATGGGG GAAAACGTCT CGTCTTGTGC TCACTGAATC CTGCGCATAT ATTGATAAAT 31076 .......... .......... .......... .......... .......... .......... 553 GAGTTGCTTT GAGTAGACAG AAGTGCAAAT GCAGTCAGCT GCTTGTTCTG TCCACTGTAT 31136 .......... .......... .......... .......... .......... .......... 553 TTGTGCGTTT TTGTTTGCAC TTTACTCGTT ATACCGTATG TTCCTTCTTA TTTGGTCGAT 31196 .......... .......... .......... .......... .......... .......... 553 GGAGAGGTAA CAAGTATCGA GTGAAATGAT TATATTCTCA CTATGCTGCG GGAGGGGTAA 31256 .......... .......... .......... .......... .......... .......... 553 TAGCCATTAG AGATGAAGAC AGAGGAGACG TCTTATCCAA AATGAATCGG TTGTCTTGTC 31316 .......... .......... .......... .......... .......... .......... 553 TCAAAAATGG CTAAAAATAT GGTCATGCAA ATAGAAAGTT TAGCCTACGT ATTTACTTTA 31376 .......... .......... .......... .......... .......... .......... 553 TTGATGTTGG GAAGGAAAAT ATTCAAGTAA TGGTTGAAAA GTGCAAATTG ATAGGAAGTA 31436 .......... .......... .......... .......... .......... .......... 553 CATAAAACGG CCTAAGAAGT ACCAAATTTT CAATGGACAT CACCACCTTA CCCTCACATG 31496 .......... .......... .......... .......... .......... .......... 553 ATAGGCTTTC AAATGCAATT GTGGCAAATC AATAAATTAA ATTGTTTAAT CTCTAATTGA 31556 .......... .......... .......... .......... .......... .......... 553 TAGAATAATT TTAATCTTTG GACTATTGAT GTAATTTAAC ATCGTTAGGA CAATGCAAAT 31616 .......... .......... .......... .......... .......... .......... 553 GATTTTAGAA CGAGTGAGCC TACATTTTAA CAATCCGAAT ATGGGCACTA TTGTCTTTCA 31676 .......... .......... .......... .......... .......... .......... 553 TATCATGCTT TCTACTTTGG ATATCACACT TTTTACTTTG GAGATGGACT GATAACAACT 31736 .......... .......... .......... .......... .......... .......... 553 ACAATAGTCA TCAATCTGGT CTTTTAATAT CCGGTAAAGG ATTAATTTCA CAATGATATC 31796 .......... .......... .......... .......... .......... .......... 553 GTAACGTGTT CTTGATTATT TTAACCCCAT ATAATAAGAA GTTCTGAAAA AACTAGTTCT 31856 .......... .......... .......... .......... .......... .......... 553 AGAAAATTCG TTAGACGCGC AGGTCTTGTT GCTCATCGGG CATAAATTAG TTTTTACTTT 31916 .......... .......... .......... .......... .......... .......... 553 TTACTATTAA ATATAATTTT ATTATATTTG AAGACAACGA AATTAAATAT ATCACAAGTT 31976 .......... .......... .......... .......... .......... .......... 553 GTACCTCCCA GACAAAAGTT AATACATATA GATGAAACTT TTCTTGTCCA TCAAAGTTCC 32036 .......... .......... .......... .......... .......... .......... 553 TTAACTTTTC TTTTTCCTAC AAGTCAAAAA TTTAAAATAT TTTGAGATAG TGAACGTACT 32096 .......... .......... .......... .......... .......... .......... 553 CGATTGACCA TAAATTCGAC TTTATAGGCA ACACAAATTT TTTCTTGCAA TTTATTTTCA 32156 .......... .......... .......... .......... .......... .......... 553 ACTTAAAATT ATATTTTTAA GAATTATCGA AAATACACAA AATTGGACAC TTTTACATGA 32216 .......... .......... .......... .......... .......... .......... 553 CCGGATTGGT TTGGGTTTTT CAAATATTAA ATCAAATTAT TTGTGTCGGA TTTTTGAATT 32276 .......... .......... .......... .......... .......... .......... 553 TACAAACCAA ATCAAAGCAA TAAAACTCGA GGTTTTCAAC TTCGGATTTT TTCAGTAAAG 32336 .......... .......... .......... .......... .......... .......... 553 TATTCATACA AACATATAAT TTACTTGTAC TTTAAATATT TCTTTATTTC TACTAAAATG 32396 .......... .......... .......... .......... .......... .......... 553 CAACTATCTA AGTTATTTCT CAAGAAAATA ACACAAAATA TTATATGATT AATGACACTA 32456 .......... .......... .......... .......... .......... .......... 553 AAATATCCAA CAAAAAAATA AATAATAAAA TCGCGTAAAA CAATATTGCA AATTAATAAG 32516 .......... .......... .......... .......... .......... .......... 553 TCATAATGAA ATTGATTATA ATTTAAAGTA CTAAATCATG CTAAAAATAA GTTTAGTAAG 32576 .......... .......... .......... .......... .......... .......... 553 TACTAGTTAC ATGATTAAAT ATTAAAAGAA AGTAAAATTA GATCATGTAT TTTAATTGTC 32636 .......... .......... .......... .......... .......... .......... 553 TAAATCTATG TATAACTAAA AAATAAATAT TCAATATTAT TGTCATTCTT AGTGTTGAAT 32696 .......... .......... .......... .......... .......... .......... 553 TGATTTTCTT TTTGTATTAG TATTAATTTG ATTTTTATTT AAACTTTATT ATATTTACCA 32756 .......... .......... .......... .......... .......... .......... 553 ACATGTATGG ACTATAATCT TTATTAGATC ATTAAGAATT TTAACTTCCA AACATGAAAT 32816 .......... .......... .......... .......... .......... .......... 553 AAATATATTA AAAGATTAAA ACTATGTAAA AGCATAAGAG ATAATTTAAA AATTATATCA 32876 .......... .......... .......... .......... .......... .......... 553 AAGTAGTTAT TTTACGTATA AAATAAAAAT TTTAAAATTA TATATATAAT GTCGGGTTGG 32936 .......... .......... .......... .......... .......... .......... 553 TTTGGTCCCG GGTTGTGTTT CTTTTAGCTG AAATCAAACC AACCCAAATA TAATCGATTT 32996 .......... .......... .......... .......... .......... .......... 553 TTTTCAACAC TAAATCACTA GTCAATTTTT TTTTAAAGAT TTGACTCAAT TTACAATTCG 33056 .......... .......... .......... .......... .......... .......... 553 ATTCGATTTT GTACAACCTA CTCCATACCT CATTGAAAAA TTACATAAAT TAATATATTT 33116 .......... .......... .......... .......... .......... .......... 553 TAAAAAATAG TTACTGATTT TAGCGATATT TTTTGTTTAT TACTATTTAT AACAATATTG 33176 .......... .......... .......... .......... .......... .......... 553 TGATAAATCT GTAATATGTA TTAAAAATGA ATTATGTATG CAATATATTT GAATTATAAT 33236 .......... .......... .......... .......... .......... .......... 553 TATTTTTGAA ATATATTGTG TTTGTTTGGT AAAAATTATC ATATTGTATT ATAAATATAT 33296 .......... .......... .......... .......... .......... .......... 553 TAAAATATGT GATAAATGTA TTATTTATTA TTATAACTTG TTTTATAAAT AAACTAAAAA 33356 .......... .......... .......... .......... .......... .......... 553 TAATCAAATA AAAAAATATT ATTATTAATA TAAATGATAA ATATTTTTTT ATTATCATAT 33416 .......... .......... .......... .......... .......... .......... 553 ATGTACATAA GTTTTCTACT TCCTTTAATG GGTAAAGCTA GAAGACCAAT TCAAAAAGCC 33476 .......... .......... .......... .......... .......... .......... 553 CAATAAAAAA GCCTTTTTAA TTTAACTCTC TCTCTCTTCT GTCTTCTCTT CACCATTCGC 33536 .......... .......... .......... .......... .......... .......... 553 ACCTCATTCA GAGAAAAGTC AGGTTGCTCC TCTTCTACGC ATTTGATCTG CTTCCTCTTT 33596 .......... .......... .......... .......... .......... .......... 553 ATGTGTTCAT TCGATCTGCA TAATATTAGG TTTTGCTATT TTATCTTGTT TTGTTGCATT 33656 .......... .......... .......... .......... .......... .......... 553 CGCACACATG GAGAATTGTG TTTTTTTTGC TTGATAGCTG GTTAATTTTG TATTGATTAG 33716 .......... .......... .......... .......... .......... .......... 553 TGTAAAAATT GCATTTGCTG GTGAATTCGT AGAATCAATG TCTTCGTATC CAAAATTGAT 33776 .......... .......... .......... .......... .......... .......... 553 TAGTGTAAAA ATTGCATTTG CAGGTTTTTT TGTAGAATCA AAGTCTTCGT ATCCAATATC 33836 .......... .......... .......... .......... .......... .......... 553 TCGTTGCCAT GTGAGTACTG TTGATCTATT TATAATGAAA CTATTCTCTC TTTATTAAAG 33896 .......... .......... .......... .......... .......... .......... 553 TAATTAAATT TTGATGTGAT TTATGTGTTT CGTTTCGTTT GTATAATCCA TTTATTCTAA 33956 .......... .......... .......... .......... .......... .......... 553 TAGGATTTCA CTGTTTATGA GTAATTTATT CTCCTTACCA TATATGTGAG TGATTTTTAA 34016 .......... .......... .......... .......... .......... .......... 553 ATAGTTTTTC AATTATACAC TGTAGAAATT GAGATTCCCT AAATAGTTTA TATAGAATTT 34076 .......... .......... .......... .......... .......... .......... 553 TGATCTGGAA CAGTAAAAAA ATAGAAATAA AATCGTCATC TTTATTTTGT CCAAGAGCAA 34136 .......... .......... .......... .......... .......... .......... 553 AGTGTTAGTT GGACCCTCTA TGTTGAGAGA AATGAAGTTG ATGGAAATTC TATCTAGAAC 34196 .......... .......... .......... .......... .......... .......... 553 AGTAAAAAAA CACAAATAGA ATGGTCATCT TTATTCTGTC AAAGAGCAAA GTGTTAGTTG 34256 .......... .......... .......... .......... .......... .......... 553 GACCTCTATG TTGAGAGAAA TGAAGTTGCT GGAAATCCTA TTGTAATATT ATTATTTTTT 34316 .......... .......... .......... .......... .......... .......... 553 TGTGATTTTT CCCATTGTTT ATATAATGTA GAAAAATAGC AGTTCAATTA TGCTTGCAGC 34376 .......... .......... .......... .......... .......... .......... 553 CACCAAAAAG AGAGGAAAAA ATAGTTTGAT TTTACTCAAA ATGTTTTTGC TTTCTAATGT 34436 .......... .......... .......... .......... .......... .......... 553 GCAAAGAATT ACCATTTTTT GTATGTCTGT TTAAAAAGTA TTGACTAGTG AGCTAATATT 34496 .......... .......... .......... .......... .......... .......... 553 CCGAAGTAGA AGGATGAATT ATGTATCTCT TCAAGAAGCA GTACGTCATT TATTGAAATA 34556 .......... .......... .......... .......... .......... .......... 553 CCTTCATTGG ATTTGATTGG AGAGTGGGTG CGAGTGTCCT AGTACCAATT TATCCTGTTA 34616 .......... .......... .......... .......... .......... .......... 553 TTTTGGATTC GGTTAATAAA GTCCAACATC AATGTCATGG GACTAAAGTC TTCAACCATG 34676 .......... .......... .......... .......... .......... .......... 553 TTAGCTTGCT ATCGTCTATT TCTTTTGTTG CATAAAACAT TTTAAAATTT CAATTCTCTA 34736 .......... .......... .......... .......... .......... .......... 553 GATCTTAGAC ATTTATCTTT AAAAATGAAA AATACGTGGA AGTTCCTTAA ATAATATGCA 34796 .......... .......... .......... .......... .......... .......... 553 CAGACTTCAA GTGTGCCACA GATTCCAACA GATATATATG TATTTATGCG CATATCAGGC 34856 .......... .......... .......... .......... .......... .......... 553 AATTAGCATG TATATGGCGC TATGCTGAGC AAATGCACAC TTACCGAACT TCTACAGATA 34916 .......... .......... .......... .......... .......... .......... 553 CTACTTGATT TCATGTGTTA ATCAGAAGTG CTTTGAGCCT TTGACATAAT CCTTATCCTC 34976 .......... .......... .......... .......... .......... .......... 553 CCTCTGATGC AACAGCTAAT TATTTTTACC TTATTCATTC ATTGGTGTAA CTTCATGTAT 35036 .......... .......... .......... .......... .......... .......... 553 TAAAGATTAC TTTCATGTGT CAATTCTAAT AATTTGTACG CAACTATATA TTCTACCCTA 35096 .......... .......... .......... .......... .......... .......... 553 TTTGATTGGT TTACAACTCT ATCCATTTAT TAGCCGTAGT GGATAGTATT ATATTGCTTT 35156 .......... .......... .......... .......... .......... .......... 553 TAGTGTGAAG TAAAGTTTTG GAGTGTACAT TCCGTGTTGA AACCTTCCAT AAATGCTCTG 35216 .......... .......... .......... .......... .......... .......... 553 TCAAAGGCTC AACTTCTTTG TGATTGATTT TTTAAAAATT TGTTTTATGC ACAAGTATTT 35276 .......... .......... .......... .......... .......... .......... 553 GCTTGTGATA TACCATGTTC TCAGTGTGTT TGTTGATGTA ACTAGTTGGC GAGAAACACC 35336 .......... .......... .......... .......... .......... .......... 553 ATTGTCCATT TTTACTTAGC TTACATTAAG TTAATCGAAG GAGCACTGCA ATGAGATGTT 35396 .......... .......... .......... .......... .......... .......... 553 TGTGAATAGT GTTGGCCCTC TTCGTAAGCT TGTATTCCTT GCAGACATT 35445 | ||| .......... .......... .......... .......... ....AGATT 558 hqPGS_C09HBa0099P03.1-2+_SGN-U316773+ (27191 27275,30524 30613,30741 30832) ******************************************************************************** EST sequence 6 +strand 1139 n (File: SGN-U345971+) 1 NNNNTTATGA CCACGCGGNG GCGGCCGCTC TGAACTAGTG GATCCCCCGG GCTGCAGGAA 61 TTCGGCACGA GCCAATAACT CAGAGTGGCA CAAAAATGTC TTGCATTATG TTATGTTATG 121 TTAGACCTGA AAATTTTATA TTTGGAAAAT ATCAAATATT ATGAACATAA TATGTACTGA 181 TTCCAGGACC TACTACTCTT AAGTATGTAC TGATTCCAGG ATCTACTTCA AATCAATTAA 241 AATAGCAGGT CAAGAGAATT TTCTTTGCAT GTTACTCTGA TGGGATGAAC CAATATAGGG 301 ACTTCATCTG TGATTATTTG GTCTCTCTAA GAAGCCCCCC AAAAAAACAT GTTTCTGGCC 361 TCCAAATACT TCTCGCTCTT TAGATTAGGC ACTCTAGATT TCAGCTGGTT AGCGGTTGCT 421 AGTATTTGTA TGATTTTTTG TTTCTTTGTT TTGTGTGTAG CATGTTCTTA GATCCAAGAG 481 GAATTGCAAG ACATCAATCA GATTTTCTCT TCAACCTTGC CTTTTATTCT GTTTGATTGC 541 TTTCTATGGG GTCCTTTTGT TGTTTGTATC GCCAACAGTT TGCTGTAAGA GTTTTTGATG 601 ATGAGGAGTG TTGCTTCAAG GTTTTCTTTT TCCAAGACTC TGATTTCTAA TATAATTTTG 661 ATGCCATTAA ACATTTTCCA ATTTATTTTT TATTTTTCTA CTTGTTCTTC ATATACTACC 721 TCTGCAAACA TCTTTGTAAC CAACTTAATT ATTTGTGCAA CGAATGTGCT TTGGTTGGGG 781 AGTCTGAGTT TCTAACTTTT GGAAGGATTT TGGTCTCTCT TTCTGCAAGA AGTGAAAGAT 841 TTTTTTTTTT TGGAAGGATT CCAAAACAAT GATGTCCCGT TTAGTTGTGC CAACTGTGCA 901 CCTCATTTAC CAATCCCTAA CCTTTTTGAT CTTCAAAAAC ATTGCGTAAG ATCCGTTTTT 961 TTTTTTTTTA ATTTAACCAA GGGAGGGGAA AAATTTGCCC CCTTAAAAAA AAGGGGAAAA 1021 TTCCTTCCCA AACCCCCCGG GGAAAACATA TTTAATTTTC TCCCAAAAAA ACACGGAAAA 1081 ATTTTGAGGT TTTTTTTTTT TCCCGCCAAA GGGGAATTCC TTTTGGGGGA AAAATTTTA Predicted gene structure (within gDNA segment 27703 to 33217): Exon 1 29063 30025 ( 963 n); cDNA 77 1044 ( 968 n); score: 0.942 MATCH C09HBa0099P03.1-2+ SGN-U345971+ 0.942 963 0.845 C PGS_C09HBa0099P03.1-2+_SGN-U345971+ (29063 30025) Alignment (genomic DNA sequence = upper lines): AACTCAGAGT GGCACAAAAA TGTCTTGCAT TATGTTATGT TATGTTAGAC CTGAAAATTT 29122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTCAGAGT GGCACAAAAA TGTCTTGCAT TATGTTATGT TATGTTAGAC CTGAAAATTT 136 TATATTTGGA AAATATCAAA TATTATGAAC ATAATATGTA CTGATTCCAG GACCTACTAC 29182 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATATTTGGA AAATATCAAA TATTATGAAC ATAATATGTA CTGATTCCAG GACCTACTAC 196 TC-TAAGTAT GTACTGATTC CAGGATCTAC TTCAAATCAA TTAAAATAGC AGGTCAAGAG 29241 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTAAGTAT GTACTGATTC CAGGATCTAC TTCAAATCAA TTAAAATAGC AGGTCAAGAG 256 AATTTTCTTT GCATGTTACT CTGATGGGAT GAACCAATAT AGGGACTTCA TCTGTGATTA 29301 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTTTCTTT GCATGTTACT CTGATGGGAT GAACCAATAT AGGGACTTCA TCTGTGATTA 316 TTTGGTCTCT CTAAGAAGCC CCCCAAAAAA ACATGTTTCT GGCCTCCAAA TACTTCTCGC 29361 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGGTCTCT CTAAGAAGCC CCCCAAAAAA ACATGTTTCT GGCCTCCAAA TACTTCTCGC 376 TCTTTAGATT AGGCACTCTA GATTTCAGCT GGTTAGCGGT TGCTGGTATT TGTATGATTT 29421 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| TCTTTAGATT AGGCACTCTA GATTTCAGCT GGTTAGCGGT TGCTAGTATT TGTATGATTT 436 TTTGTTTCTT TGTTTTGTGT GTAGCATGTT CTTAGATCCA AGAGGAATTG CAAGACATCA 29481 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGTTTCTT TGTTTTGTGT GTAGCATGTT CTTAGATCCA AGAGGAATTG CAAGACATCA 496 ATCAGATTTT CTCTTCAACC TTGCCTTTTA TTCTGTTTGA TTGCTTTCTA TGGGGTCCTT 29541 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAGATTTT CTCTTCAACC TTGCCTTTTA TTCTGTTTGA TTGCTTTCTA TGGGGTCCTT 556 TTGTTGTTTG TATCGCCAAC AGTTTGCTGT AAGAGTTTTT GATGATGAGG AGTGTTACTT 29601 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| TTGTTGTTTG TATCGCCAAC AGTTTGCTGT AAGAGTTTTT GATGATGAGG AGTGTTGCTT 616 CAAGGTTTTC TTTCTCCAAG ACTCTGATTT CTAATATAAT TTTGATGCCA TTAAACGTTT 29661 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||| ||| CAAGGTTTTC TTTTTCCAAG ACTCTGATTT CTAATATAAT TTTGATGCCA TTAAACATTT 676 TCCAATTTAT TTTTTATTTT TCTACTTGTT CTTCATATAC TACCTCTGCA AACATCTTTG 29721 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAATTTAT TTTTTATTTT TCTACTTGTT CTTCATATAC TACCTCTGCA AACATCTTTG 736 TAACCAACTT AATTATTTGT GCAACGAATG TGCTTTGGTT GGGGAGTCTG AGTTTCTAAC 29781 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAACCAACTT AATTATTTGT GCAACGAATG TGCTTTGGTT GGGGAGTCTG AGTTTCTAAC 796 TTTTGGAAGG ATTTTGGTCT CTCTTTCTGC GAGAAGTGAA AGA-TTTTTT TTTTTGGAAG 29840 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| |||||||||| TTTTGGAAGG ATTTTGGTCT CTCTTTCTGC AAGAAGTGAA AGATTTTTTT TTTTTGGAAG 856 GATTCCAGAA CAATGATGTC TCGTTTAGTT GTGCTAACTG TGCACCTCAT ATAACAATAC 29900 ||||||| || |||||||||| ||||||||| |||| ||||| |||||||||| || |||| | GATTCCAAAA CAATGATGTC CCGTTTAGTT GTGCCAACTG TGCACCTCAT TTACCAAT-C 915 AGTAACATAT TTGATCTTCA GAACAATTGC GCATGAT-C- TGTTTTTTTT TTTAATTTTA 29958 |||| | | |||||||||| || ||||| | | ||| | | |||||||| ||| | |||| CCTAACCTTT TTGATCTTCA AAAACATTGC GTAAGATCCG TTTTTTTTTT TTTTAATTTA 975 ACTAATGGAT GTGGAAGTAT TG-CACCTTA GAATGAAGGT G-ATATTCCT TCACTAACCT 30016 || || ||| | | || | || | ||||| || |||| | | |||||| || | |||| ACCAAGGGAG GGGAAAAATT TGCCCCCTTA AAAAAAAGGG GAAAATTCCT TCCCAAACCC 1035 CCCTGGGAA 30025 ||| ||||| CCCGGGGAA 1044 hqPGS_C09HBa0099P03.1-2+_SGN-U345971+ (29063 30025) ******************************************************************************** EST sequence 10 +strand 694 n (File: SGN-U316773+) 1 GGGCGATTGG CGGCCAATCG GCCGAATTGC TCTTCTGTCT TCTCTTCACC ATTCGCACCT 61 CATTCAGAGA AAAGTCAGGT TTTTTTGTAG AATCAAAGTC TTCGTATCCA ATATCTCGTT 121 GCCATGTCGG ACGAGGAAGT TGTTGACCCA AAGGCGACAA TGGAAGTATC TTGCAAGCCT 181 AAGTGTGTAA GGCAACTAAA GGATTATCAG GCATGTACTA GAAGGATAGA AGGTGATGAA 241 TCAGGGAGCA AGCATTGCAC TGGACAGTAT TTTGATTATT GGCAATGCAT TGACAAATGT 301 GTTGCCCCAA AGCTATTTGA AAAACTCAAG TAACATGGAG ATAAGTGTTC ATCCATTACG 361 ATTTTATCTT GCTTTTTCAT TGTTGAAGCC TGTGCTACAG AATTGCAATC CCTTCAATCA 421 ACTTAGTTGC CGCATGGAAA ATTTTGTACT ATGCACATTA TTAAGAAGTG ATTAATAAAG 481 GATACTTATG GGTTAAATTT TGTGGACACT TTTATTTGGT CAGATGATTC AAAATCCGGA 541 GGATATTATT CCCAGATTTT TGTTCTCTCT TGCGTTGTTA TATGAGCTCC CAGTTACCTA 601 ATTTCTTTAT GGGAGGAAAT ATCATAAATC AGTGATTTAT ATAAAAAAAA AAAAAAAAAA 661 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 32620 to 38720): Exon 1 33510 33558 ( 49 n); cDNA 30 78 ( 49 n); score: 1.000 Intron 1 33559 33799 ( 241 n); Pd: 0.771 (s: 1.00), Pa: 0.950 (s: 1.00) Exon 2 33800 33846 ( 47 n); cDNA 79 125 ( 47 n); score: 1.000 Intron 2 33847 35977 (2131 n); Pd: 0.956 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 35978 36062 ( 85 n); cDNA 126 210 ( 85 n); score: 1.000 Intron 3 36063 37092 (1030 n); Pd: 0.828 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 4 37093 37182 ( 90 n); cDNA 211 300 ( 90 n); score: 1.000 Intron 4 37183 37266 ( 84 n); Pd: 0.994 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 5 37267 37610 ( 344 n); cDNA 301 644 ( 344 n); score: 0.994 PPA cDNA 645 694 MATCH C09HBa0099P03.1-2+ SGN-U316773+ 0.996 615 0.886 C PGS_C09HBa0099P03.1-2+_SGN-U316773+ (33510 33558,33800 33846,35978 36062,37093 37182,37267 37610) Alignment (genomic DNA sequence = upper lines): CTCTTCTGTC TTCTCTTCAC CATTCGCACC TCATTCAGAG AAAAGTCAGG TTGCTCCTCT 33569 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CTCTTCTGTC TTCTCTTCAC CATTCGCACC TCATTCAGAG AAAAGTCAG. .......... 78 TCTACGCATT TGATCTGCTT CCTCTTTATG TGTTCATTCG ATCTGCATAA TATTAGGTTT 33629 .......... .......... .......... .......... .......... .......... 78 TGCTATTTTA TCTTGTTTTG TTGCATTCGC ACACATGGAG AATTGTGTTT TTTTTGCTTG 33689 .......... .......... .......... .......... .......... .......... 78 ATAGCTGGTT AATTTTGTAT TGATTAGTGT AAAAATTGCA TTTGCTGGTG AATTCGTAGA 33749 .......... .......... .......... .......... .......... .......... 78 ATCAATGTCT TCGTATCCAA AATTGATTAG TGTAAAAATT GCATTTGCAG GTTTTTTTGT 33809 |||||||||| .......... .......... .......... .......... .......... GTTTTTTTGT 88 AGAATCAAAG TCTTCGTATC CAATATCTCG TTGCCATGTG AGTACTGTTG ATCTATTTAT 33869 |||||||||| |||||||||| |||||||||| ||||||| AGAATCAAAG TCTTCGTATC CAATATCTCG TTGCCAT... .......... .......... 125 AATGAAACTA TTCTCTCTTT ATTAAAGTAA TTAAATTTTG ATGTGATTTA TGTGTTTCGT 33929 .......... .......... .......... .......... .......... .......... 125 TTCGTTTGTA TAATCCATTT ATTCTAATAG GATTTCACTG TTTATGAGTA ATTTATTCTC 33989 .......... .......... .......... .......... .......... .......... 125 CTTACCATAT ATGTGAGTGA TTTTTAAATA GTTTTTCAAT TATACACTGT AGAAATTGAG 34049 .......... .......... .......... .......... .......... .......... 125 ATTCCCTAAA TAGTTTATAT AGAATTTTGA TCTGGAACAG TAAAAAAATA GAAATAAAAT 34109 .......... .......... .......... .......... .......... .......... 125 CGTCATCTTT ATTTTGTCCA AGAGCAAAGT GTTAGTTGGA CCCTCTATGT TGAGAGAAAT 34169 .......... .......... .......... .......... .......... .......... 125 GAAGTTGATG GAAATTCTAT CTAGAACAGT AAAAAAACAC AAATAGAATG GTCATCTTTA 34229 .......... .......... .......... .......... .......... .......... 125 TTCTGTCAAA GAGCAAAGTG TTAGTTGGAC CTCTATGTTG AGAGAAATGA AGTTGCTGGA 34289 .......... .......... .......... .......... .......... .......... 125 AATCCTATTG TAATATTATT ATTTTTTTGT GATTTTTCCC ATTGTTTATA TAATGTAGAA 34349 .......... .......... .......... .......... .......... .......... 125 AAATAGCAGT TCAATTATGC TTGCAGCCAC CAAAAAGAGA GGAAAAAATA GTTTGATTTT 34409 .......... .......... .......... .......... .......... .......... 125 ACTCAAAATG TTTTTGCTTT CTAATGTGCA AAGAATTACC ATTTTTTGTA TGTCTGTTTA 34469 .......... .......... .......... .......... .......... .......... 125 AAAAGTATTG ACTAGTGAGC TAATATTCCG AAGTAGAAGG ATGAATTATG TATCTCTTCA 34529 .......... .......... .......... .......... .......... .......... 125 AGAAGCAGTA CGTCATTTAT TGAAATACCT TCATTGGATT TGATTGGAGA GTGGGTGCGA 34589 .......... .......... .......... .......... .......... .......... 125 GTGTCCTAGT ACCAATTTAT CCTGTTATTT TGGATTCGGT TAATAAAGTC CAACATCAAT 34649 .......... .......... .......... .......... .......... .......... 125 GTCATGGGAC TAAAGTCTTC AACCATGTTA GCTTGCTATC GTCTATTTCT TTTGTTGCAT 34709 .......... .......... .......... .......... .......... .......... 125 AAAACATTTT AAAATTTCAA TTCTCTAGAT CTTAGACATT TATCTTTAAA AATGAAAAAT 34769 .......... .......... .......... .......... .......... .......... 125 ACGTGGAAGT TCCTTAAATA ATATGCACAG ACTTCAAGTG TGCCACAGAT TCCAACAGAT 34829 .......... .......... .......... .......... .......... .......... 125 ATATATGTAT TTATGCGCAT ATCAGGCAAT TAGCATGTAT ATGGCGCTAT GCTGAGCAAA 34889 .......... .......... .......... .......... .......... .......... 125 TGCACACTTA CCGAACTTCT ACAGATACTA CTTGATTTCA TGTGTTAATC AGAAGTGCTT 34949 .......... .......... .......... .......... .......... .......... 125 TGAGCCTTTG ACATAATCCT TATCCTCCCT CTGATGCAAC AGCTAATTAT TTTTACCTTA 35009 .......... .......... .......... .......... .......... .......... 125 TTCATTCATT GGTGTAACTT CATGTATTAA AGATTACTTT CATGTGTCAA TTCTAATAAT 35069 .......... .......... .......... .......... .......... .......... 125 TTGTACGCAA CTATATATTC TACCCTATTT GATTGGTTTA CAACTCTATC CATTTATTAG 35129 .......... .......... .......... .......... .......... .......... 125 CCGTAGTGGA TAGTATTATA TTGCTTTTAG TGTGAAGTAA AGTTTTGGAG TGTACATTCC 35189 .......... .......... .......... .......... .......... .......... 125 GTGTTGAAAC CTTCCATAAA TGCTCTGTCA AAGGCTCAAC TTCTTTGTGA TTGATTTTTT 35249 .......... .......... .......... .......... .......... .......... 125 AAAAATTTGT TTTATGCACA AGTATTTGCT TGTGATATAC CATGTTCTCA GTGTGTTTGT 35309 .......... .......... .......... .......... .......... .......... 125 TGATGTAACT AGTTGGCGAG AAACACCATT GTCCATTTTT ACTTAGCTTA CATTAAGTTA 35369 .......... .......... .......... .......... .......... .......... 125 ATCGAAGGAG CACTGCAATG AGATGTTTGT GAATAGTGTT GGCCCTCTTC GTAAGCTTGT 35429 .......... .......... .......... .......... .......... .......... 125 ATTCCTTGCA GACATTATAG GTGGAGAGAA ATTTTCATGC TATTTTTCCT AATACTCCCT 35489 .......... .......... .......... .......... .......... .......... 125 CCGTCCGATT TTATGTGGCA CAATTTGACC TAGCACGGAG TTTTAAAAAA ATGAAAACTT 35549 .......... .......... .......... .......... .......... .......... 125 TGAATTGTTT ACTAAATTGT CCTTCAAAAG TAGACTCACT TTTCTCTTTT CTCATAAACG 35609 .......... .......... .......... .......... .......... .......... 125 TATTAGAGTA CTATGTAAAA TTAAGTGGGA CCAACAAGGG TAAAAGAGGA ATTGCACCTT 35669 .......... .......... .......... .......... .......... .......... 125 TAAATACTTA CCATATAAGA AAATGTGACA TTCTTTTTGG GACTGAGACA AAAAGGAAAT 35729 .......... .......... .......... .......... .......... .......... 125 GGTGCCACAT AAAATGAGAC GGAGGGAGTA ACTATTATGT CAGAAGCATC TGATATTATT 35789 .......... .......... .......... .......... .......... .......... 125 CTTACACGAT AGTCCTCAAT GACTTACAAA TATCGAGCTC ATGTTACAAA CACAGAGTTA 35849 .......... .......... .......... .......... .......... .......... 125 GACTGTTGTA ATTAATGACA AGTGACACTC AGTGACTCCC CCCCCCCCCA CCACCACCAC 35909 .......... .......... .......... .......... .......... .......... 125 CACACACACA CACACACAAG TATTGCCATC TCTTTGAAGA TAAATTGCTG TTGGTTTCCT 35969 .......... .......... .......... .......... .......... .......... 125 TTGTTCAGGT CGGACGAGGA AGTTGTTGAC CCAAAGGCGA CAATGGAAGT ATCTTGCAAG 36029 || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ........GT CGGACGAGGA AGTTGTTGAC CCAAAGGCGA CAATGGAAGT ATCTTGCAAG 177 CCTAAGTGTG TAAGGCAACT AAAGGATTAT CAGGTGATTC CCACCATAAC AAAACAGCCT 36089 |||||||||| |||||||||| |||||||||| ||| CCTAAGTGTG TAAGGCAACT AAAGGATTAT CAG....... .......... .......... 210 TTCAGTTATA TCAAAGGGAT TTTCTTGTTC TTATATAATT GTTATTGGGT GAGCTGTAAC 36149 .......... .......... .......... .......... .......... .......... 210 TTGGACTTGA TTGTTTGCGG TGGTGGAAGG CTAATTTTTA TTATTGTTAA CATTATCCTG 36209 .......... .......... .......... .......... .......... .......... 210 TACGGTCTCC ACAAGAACTC AGAGTGACAC AAAAATGTCT TGGCAATGTC ATGTTAGACC 36269 .......... .......... .......... .......... .......... .......... 210 TGAAAATTGT AAATTTGGAA AACATCAAGT ACTATGGCCG ACATATGACC TAACATAGTA 36329 .......... .......... .......... .......... .......... .......... 210 TGTATCACTT GAACTTTCAG CTACTCTAAT TACCAAATCA ACTGATATAG TAAGCCAGGA 36389 .......... .......... .......... .......... .......... .......... 210 GAATTTTCTT GTTATGTCAC TCTAATGGGA TGAACCAACA TGTGACGAAG AGACTTAATC 36449 .......... .......... .......... .......... .......... .......... 210 TGTAACTGCT TGGTCTTTCC AAGTAGCCAA AAAGGCATGT TTATGGGCCT CCAAGTACAC 36509 .......... .......... .......... .......... .......... .......... 210 CCACCGACCT CCTCTCATAT TGGCTCTCCA GATTCAACTG GTCAGCAGAA TGCTAGTATT 36569 .......... .......... .......... .......... .......... .......... 210 TCTATGATCT TTTTTGTTTC CTCTGCTCAT TTTTCTGTAT CATATGTGTC TATCGGGTCC 36629 .......... .......... .......... .......... .......... .......... 210 ATTGTTCTTT CTATTGCCAA CAATTTGCTG TAACAATTTT TTATGAAGAA CACAGTGCTG 36689 .......... .......... .......... .......... .......... .......... 210 TAAGGTTTTG ATTATCAAAG ACTCAGAGTT TTAATTTCGA TCCCGTTAAA CCTTTCCGAG 36749 .......... .......... .......... .......... .......... .......... 210 TTAGTTTTGG CTTTCAACTT CTTGTTATTC ATCTACTCCC TCTGCCTAAA TCTTTTAATC 36809 .......... .......... .......... .......... .......... .......... 210 AACTTAATGG TTTGTGCAAA GGATGTGCTT TGGACATGGG GAATCTGGGA TTTTTGTGTG 36869 .......... .......... .......... .......... .......... .......... 210 TTATGCTTGA AGGATGATAT CTAATTGATT TGTGACAACT ACCACCTTAG ATAACTGTAC 36929 .......... .......... .......... .......... .......... .......... 210 AGTAGCATTC GTTTGATCTT CAGAACAATT GCATTTAACT GTTCTGTGCA CTTCCTGAAC 36989 .......... .......... .......... .......... .......... .......... 210 ATTTATGTCT CGTTCATTTG TTCTGAATTT GTGCAAATGT ATTTAAAATT TGTACCGACT 37049 .......... .......... .......... .......... .......... .......... 210 TTCATACTTT GTGGCTCTTA TTTGGGTATT TTGCATCCTA CAGGCATGTA CTAGAAGGAT 37109 ||||||| |||||||||| .......... .......... .......... .......... ...GCATGTA CTAGAAGGAT 227 AGAAGGTGAT GAATCAGGGA GCAAGCATTG CACTGGACAG TATTTTGATT ATTGGCAATG 37169 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAGGTGAT GAATCAGGGA GCAAGCATTG CACTGGACAG TATTTTGATT ATTGGCAATG 287 CATTGACAAA TGTGTAAGCT CTATTTCTCT ATTCTCCGTT GTTGCTCTAT TTAGCATTTC 37229 |||||||||| ||| CATTGACAAA TGT....... .......... .......... .......... .......... 300 GTTTTGTTTC TAAACTTTTT ATGGAATTGT TACACAGGTT GCCCCAAAGC TATTTGAAAA 37289 ||| |||||||||| |||||||||| .......... .......... .......... .......GTT GCCCCAAAGC TATTTGAAAA 323 ACTCAAGTAA CATGGAGATA AGTGTTCATC CATTACGATT TTATCTTGCT TTTTCATTGT 37349 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTCAAGTAA CATGGAGATA AGTGTTCATC CATTACGATT TTATCTTGCT TTTTCATTGT 383 TGAAGCCTGT GCTACAGACT TGCAATCCCT TCAATCAACT TAGTTGCCGC ATGGAAAATT 37409 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| TGAAGCCTGT GCTACAGAAT TGCAATCCCT TCAATCAACT TAGTTGCCGC ATGGAAAATT 443 TTGTACTATG CACATTATTA AGAAGTGATT AATAAAGGAT ACTTATGGGT TAAATTTTGT 37469 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTACTATG CACATTATTA AGAAGTGATT AATAAAGGAT ACTTATGGGT TAAATTTTGT 503 GGACACTTTT ATTTGGTCAG ATGATTCAAA ATCCGGAGGA CATTATTCCC AGATTTTTGT 37529 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| GGACACTTTT ATTTGGTCAG ATGATTCAAA ATCCGGAGGA TATTATTCCC AGATTTTTGT 563 TCTCTCTTGC GTTGTTATAT GAGCTCCCAG TTACCTAATT TCTTTATGGG AGGAAATATC 37589 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCTCTTGC GTTGTTATAT GAGCTCCCAG TTACCTAATT TCTTTATGGG AGGAAATATC 623 ATAAATCAGT GATTTATATA A 37610 |||||||||| |||||||||| | ATAAATCAGT GATTTATATA A 644 hqPGS_C09HBa0099P03.1-2+_SGN-U316773+ (33510 33558,33800 33846,35978 36062,37093 37182,37267 37610) ******************************************************************************** EST sequence 8 +strand 563 n (File: SGN-U316772+) 1 GGCGATTGGC GGCCAATCGG CCGAATTCCT CTTCGCCATC CGCAGCTCAT TCTAGAACAG 61 TCAGGTGATT TTGCAGAATC GAAGTCTTCG TCTCGAATAT CTCTTTGCCA TGTCGGACGA 121 GGAAGTTGTT GACCCAAAGG CGACATTAGA AGTAAGTTGC AAGCCTAAGT GTGTAAGGCA 181 ACTAAAGGAG TATCAGGCAT GTACTAAAAG GATAGAAGGT GATGAATCAG GGCACAAACA 241 TTGCACTGGA CAGTATTTTG ATTATTGGCA CTGCATCGAC AAATGTGTTG CTGCGAAGTT 301 GTTTGACCAT CTCAAGTAAC AAGGATATAA GTTGTTGATC CCTTGCAATT TATCTTCTTT 361 TTGGTTGTTG AACAAGTCAT TACCATATTA TTCCTCACTG TGCTGAAGAC TTGTAACCCT 421 TTCAATCAAC TTGGTTGCTG CATGGAAAAT TTTGAACTAT GCACATCTTA AAAAGTGATT 481 AATAAATCAT ACTCGTGGGT TGAATTGGAC CCTTTTATTC GTTCGAAAAA AAAAAAAAAA 541 AAAAAAAAAA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 34434 to 40706): Exon 1 35978 36062 ( 85 n); cDNA 112 196 ( 85 n); score: 0.941 Intron 1 36063 37092 (1030 n); Pd: 0.828 (s: 0.92), Pa: 0.999 (s: 0.92) Exon 2 37093 37182 ( 90 n); cDNA 197 286 ( 90 n); score: 0.933 Intron 2 37183 37266 ( 84 n); Pd: 0.994 (s: 0.94), Pa: 0.998 (s: 0.74) Exon 3 37267 37370 ( 104 n); cDNA 287 388 ( 102 n); score: 0.697 Intron 3 37371 38791 (1421 n); Pd: 0.352 (s: 0.64), Pa: 0.498 (s: 0) Exon 4 38792 38800 ( 9 n); cDNA 389 396 ( 8 n); score: 0.778 PPA cDNA 526 563 MATCH C09HBa0099P03.1-2+ SGN-U316772+ 0.848 288 0.512 C PGS_C09HBa0099P03.1-2+_SGN-U316772+ (35978 36062,37093 37182,37267 37370,38792 38800) Alignment (genomic DNA sequence = upper lines): GTCGGACGAG GAAGTTGTTG ACCCAAAGGC GACAATGGAA GTATCTTGCA AGCCTAAGTG 36037 |||||||||| |||||||||| |||||||||| |||| | ||| ||| ||||| |||||||||| GTCGGACGAG GAAGTTGTTG ACCCAAAGGC GACATTAGAA GTAAGTTGCA AGCCTAAGTG 171 TGTAAGGCAA CTAAAGGATT ATCAGGTGAT TCCCACCATA ACAAAACAGC CTTTCAGTTA 36097 |||||||||| |||||||| | ||||| TGTAAGGCAA CTAAAGGAGT ATCAG..... .......... .......... .......... 196 TATCAAAGGG ATTTTCTTGT TCTTATATAA TTGTTATTGG GTGAGCTGTA ACTTGGACTT 36157 .......... .......... .......... .......... .......... .......... 196 GATTGTTTGC GGTGGTGGAA GGCTAATTTT TATTATTGTT AACATTATCC TGTACGGTCT 36217 .......... .......... .......... .......... .......... .......... 196 CCACAAGAAC TCAGAGTGAC ACAAAAATGT CTTGGCAATG TCATGTTAGA CCTGAAAATT 36277 .......... .......... .......... .......... .......... .......... 196 GTAAATTTGG AAAACATCAA GTACTATGGC CGACATATGA CCTAACATAG TATGTATCAC 36337 .......... .......... .......... .......... .......... .......... 196 TTGAACTTTC AGCTACTCTA ATTACCAAAT CAACTGATAT AGTAAGCCAG GAGAATTTTC 36397 .......... .......... .......... .......... .......... .......... 196 TTGTTATGTC ACTCTAATGG GATGAACCAA CATGTGACGA AGAGACTTAA TCTGTAACTG 36457 .......... .......... .......... .......... .......... .......... 196 CTTGGTCTTT CCAAGTAGCC AAAAAGGCAT GTTTATGGGC CTCCAAGTAC ACCCACCGAC 36517 .......... .......... .......... .......... .......... .......... 196 CTCCTCTCAT ATTGGCTCTC CAGATTCAAC TGGTCAGCAG AATGCTAGTA TTTCTATGAT 36577 .......... .......... .......... .......... .......... .......... 196 CTTTTTTGTT TCCTCTGCTC ATTTTTCTGT ATCATATGTG TCTATCGGGT CCATTGTTCT 36637 .......... .......... .......... .......... .......... .......... 196 TTCTATTGCC AACAATTTGC TGTAACAATT TTTTATGAAG AACACAGTGC TGTAAGGTTT 36697 .......... .......... .......... .......... .......... .......... 196 TGATTATCAA AGACTCAGAG TTTTAATTTC GATCCCGTTA AACCTTTCCG AGTTAGTTTT 36757 .......... .......... .......... .......... .......... .......... 196 GGCTTTCAAC TTCTTGTTAT TCATCTACTC CCTCTGCCTA AATCTTTTAA TCAACTTAAT 36817 .......... .......... .......... .......... .......... .......... 196 GGTTTGTGCA AAGGATGTGC TTTGGACATG GGGAATCTGG GATTTTTGTG TGTTATGCTT 36877 .......... .......... .......... .......... .......... .......... 196 GAAGGATGAT ATCTAATTGA TTTGTGACAA CTACCACCTT AGATAACTGT ACAGTAGCAT 36937 .......... .......... .......... .......... .......... .......... 196 TCGTTTGATC TTCAGAACAA TTGCATTTAA CTGTTCTGTG CACTTCCTGA ACATTTATGT 36997 .......... .......... .......... .......... .......... .......... 196 CTCGTTCATT TGTTCTGAAT TTGTGCAAAT GTATTTAAAA TTTGTACCGA CTTTCATACT 37057 .......... .......... .......... .......... .......... .......... 196 TTGTGGCTCT TATTTGGGTA TTTTGCATCC TACAGGCATG TACTAGAAGG ATAGAAGGTG 37117 ||||| ||||| |||| |||||||||| .......... .......... .......... .....GCATG TACTAAAAGG ATAGAAGGTG 221 ATGAATCAGG GAGCAAGCAT TGCACTGGAC AGTATTTTGA TTATTGGCAA TGCATTGACA 37177 |||||||||| | ||| ||| |||||||||| |||||||||| ||||||||| ||||| |||| ATGAATCAGG GCACAAACAT TGCACTGGAC AGTATTTTGA TTATTGGCAC TGCATCGACA 281 AATGTGTAAG CTCTATTTCT CTATTCTCCG TTGTTGCTCT ATTTAGCATT TCGTTTTGTT 37237 ||||| AATGT..... .......... .......... .......... .......... .......... 286 TCTAAACTTT TTATGGAATT GTTACACAGG TTGCCCCAAA GCTATTTGAA AAACTCAAGT 37297 | |||| | || | | ||||| | ||||||| .......... .......... .........G TTGCTGCGAA GTTGTTTGAC CATCTCAAGT 317 AACATGGAGA TAAG-TGTTC ATCCATTACG ATTTTATCTT GCTTTTTCAT TGTTGAAGCC 37356 |||| ||| | |||| |||| |||| || | | |||||||| |||||| | ||||||| | AACAAGGATA TAAGTTGTTG ATCCCTTGC- AATTTATCTT -CTTTTTGGT TGTTGAA-CA 374 TGTGCTACAG ACTTGCAATC CCTTCAATCA ACTTAGTTGC CGCATGGAAA ATTTTGTACT 37416 || | | | AGTCATTACC ATAT...... .......... .......... .......... .......... 388 ATGCACATTA TTAAGAAGTG ATTAATAAAG GATACTTATG GGTTAAATTT TGTGGACACT 37476 .......... .......... .......... .......... .......... .......... 388 TTTATTTGGT CAGATGATTC AAAATCCGGA GGACATTATT CCCAGATTTT TGTTCTCTCT 37536 .......... .......... .......... .......... .......... .......... 388 TGCGTTGTTA TATGAGCTCC CAGTTACCTA ATTTCTTTAT GGGAGGAAAT ATCATAAATC 37596 .......... .......... .......... .......... .......... .......... 388 AGTGATTTAT ATAAGCCCAT TTCTTCTTAG TTTTGGTTGT TTTGTGGTTT ATAGTCCTGA 37656 .......... .......... .......... .......... .......... .......... 388 ATTAGTGTGG CATTAAAATT TGCATGCAAC CCTTTAGGTC GAGAATATTG CTAAATGAGT 37716 .......... .......... .......... .......... .......... .......... 388 CATAGAGTAG ACAAAAGTGC AAGGTACAGT CAACTGCTAG AGTGATGTTC CTCACTGTAT 37776 .......... .......... .......... .......... .......... .......... 388 TTGTGGGTTT TCTTCTGGGC TGTCTTCTTA TTTAGCCGAT GGAGAGGTAA TAGGTATCCT 37836 .......... .......... .......... .......... .......... .......... 388 GTGAAATTGT TGATGGAGTG TTCACCTAAC ACTATAACAT GTTAATCTAT GCGTAATCAA 37896 .......... .......... .......... .......... .......... .......... 388 ACCATGTGGC AACTTTATCA TAAAAACAAG ACAAGTAAAT TGAAATGGAT GGAGTAATTT 37956 .......... .......... .......... .......... .......... .......... 388 TTATGGAATA ATTAGGTTTT TGTTTTAGTT GTTGACTTGT GATGACAAAT CAAAGGTTTC 38016 .......... .......... .......... .......... .......... .......... 388 TTATCTTCTT TAGGTGCTTT GTTTTTAATC TGTGGATATC TTATTGATTT TTTAGAGCTT 38076 .......... .......... .......... .......... .......... .......... 388 CCTGATTCTG TATGCTTATT CACTTGTTTG TCAAGCAGGA TGAGAAGTAT ATATGTTCTC 38136 .......... .......... .......... .......... .......... .......... 388 TATATAATGA AATGACTAAA GATTTTTTTT TGAATTGAAA ATTACAGAGA AAGAGTCTGT 38196 .......... .......... .......... .......... .......... .......... 388 TTTTAAGAAA TATAGTTTTA CTGTACAAAA TAGATATGGG AGTACATATT TTGAATACAG 38256 .......... .......... .......... .......... .......... .......... 388 GGAATACGAT TCTTTTTGGT TTTCACCGTC CTCCGAAAAT AATTATAACT TATAAATATA 38316 .......... .......... .......... .......... .......... .......... 388 CATAAATTAT ACACTAATGA TGTATAATTT TGCGTATATT ATACATCCGT AGATAATTAT 38376 .......... .......... .......... .......... .......... .......... 388 TTTGGACGGT GGCGATACAG TGTGAAAATC CCCTTTTAAT GTTTAAGAAT GGGTCCAAAA 38436 .......... .......... .......... .......... .......... .......... 388 CCCCAAGAAC ATGATAGTTC TAATGGGATC TTCCATGTTT GAACTCTTTC TTCTTTGATT 38496 .......... .......... .......... .......... .......... .......... 388 TTGATTTTCT TTGGAATTTT GTTTTGATTT TGAGAGCTTT TCTCTGGTTG TTCTCTTTGA 38556 .......... .......... .......... .......... .......... .......... 388 TTTGTGAATC AAAGATTTAT CATCTTCAGA GAAATGGTGA GGCTTCATCC GTGTGTTGAA 38616 .......... .......... .......... .......... .......... .......... 388 ATGTTTTACT TAAGATATTG AAGCTTCTTA TAAATTGTGG ACAAAATACA AACAAATTTA 38676 .......... .......... .......... .......... .......... .......... 388 TAGAAAAAAA CTTCATGAAA TGAGAAAAGC ATTAGAGATG GAATATATGA TAGATATCTC 38736 .......... .......... .......... .......... .......... .......... 388 CATTTATTAA GCATATATTC CTTATGTCTG TTGATGTGGA TGGTGTATAT ATCAGTATTT 38796 || || .......... .......... .......... .......... .......... .....TA-TT 392 GCTC 38800 ||| CCTC 396 hqPGS_C09HBa0099P03.1-2+_SGN-U316772+ (35978 36062,37093 37182,37267 37370) ******************************************************************************** EST sequence 34 +strand 1167 n (File: SGN-U342268+) 1 NGTGCAGGGA TGATCGACTC CTAAGGGCGA ATTGGCGGCC AATCGGCCGA ATTGATAAGA 61 AGCTTCAATA TCTTAAGTAA AACATTTCAA CACACCAAAA TATAGGATGA AGCCTCACCA 121 TTTCTCTGAA GATGATAAAT CTTTGATTCA CAAATCAAAG AGAACAACCA GAGAAAAGCT 181 CTCAAAATCA AAACAAAATT CCAAAGAAAA TCAAAATCAA AGAAGAAAGA GTTCAAACAT 241 GGAAGATCCC ATTAGAACTA TCATGTTCTT GGGGTTTTGG ACCCATTCTT AAACATTAAA 301 AGGGGATTTT CACACTGTAT CGCCACCGTC CAAAATAATT ATCTACGGAT GTATAATATA 361 CGCTAAATTA TACATCATTA GTGTATAATT TATGTATATT TATAAGTTAT AATTATTTTC 421 GGAGGACGGT GAAAACCAAA AAGAATCGTA TTCCCTGTAT TCAAAATATG TACTCCCATA 481 TCTATTTTGT ACAGTAAAAC TATATTTCTT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 541 AAAAAAAAAA AAAAAAAATT CCCGGCCAAA GGGGGCCCCT GGGATTTTTT TAAGGGGCCC 601 CAAAAAAGGG GGGGGAAAAC GGGGGAAAAA GTTTTTCCGG GGGGAATGTT TCCCCCAAAA 661 AAANNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 721 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 781 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 841 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 961 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1021 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1081 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1141 NNNNNNNNNN NNNNNNNNNN NNNNNNN Predicted gene structure (within gDNA segment 39787 to 31066): Exon 1 38606 38192 ( 415 n); cDNA 105 519 ( 415 n); score: 0.993 Intron 1 38191 38136 ( 56 n); Pd: 0.000 (s: 0.96), Pa: 0.169 (s: 0) Exon 2 38135 38129 ( 7 n); cDNA 520 526 ( 7 n); score: 0.571 Intron 2 38128 32053 (6076 n); Pd: 0.900 (s: 0), Pa: 0.968 (s: 0) Exon 3 32052 32042 ( 11 n); cDNA 527 537 ( 11 n); score: 0.818 Intron 3 32041 31920 ( 122 n); Pd: 0.000 (s: 0), Pa: 0.250 (s: 0) Exon 4 31919 31893 ( 27 n); cDNA 538 564 ( 27 n); score: 0.667 MATCH C09HBa0099P03.1-2- SGN-U342268+ 0.993 460 0.394 C PGS_C09HBa0099P03.1-2-_SGN-U342268+ (38606 38192,38135 38129,32052 32042,31919 31893) Alignment (genomic DNA sequence = upper lines): GGATGAAGCC TCACCATTTC TCTGAAGATG ATAAATCTTT GATTCACAAA TCAAAGAGAA 38547 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATGAAGCC TCACCATTTC TCTGAAGATG ATAAATCTTT GATTCACAAA TCAAAGAGAA 164 CAACCAGAGA AAAGCTCTCA AAATCAAAAC AAAATTCCAA AGAAAATCAA AATCAAAGAA 38487 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACCAGAGA AAAGCTCTCA AAATCAAAAC AAAATTCCAA AGAAAATCAA AATCAAAGAA 224 GAAAGAGTTC AAACATGGAA GATCCCATTA GAACTATCAT GTTCTTGGGG TTTTGGACCC 38427 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAGAGTTC AAACATGGAA GATCCCATTA GAACTATCAT GTTCTTGGGG TTTTGGACCC 284 ATTCTTAAAC ATTAAAAGGG GATTTTCACA CTGTATCGCC ACCGTCCAAA ATAATTATCT 38367 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTCTTAAAC ATTAAAAGGG GATTTTCACA CTGTATCGCC ACCGTCCAAA ATAATTATCT 344 ACGGATGTAT AATATACGCA AAATTATACA TCATTAGTGT ATAATTTATG TATATTTATA 38307 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACGGATGTAT AATATACGCT AAATTATACA TCATTAGTGT ATAATTTATG TATATTTATA 404 AGTTATAATT ATTTTCGGAG GACGGTGAAA ACCAAAAAGA ATCGTATTCC CTGTATTCAA 38247 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTTATAATT ATTTTCGGAG GACGGTGAAA ACCAAAAAGA ATCGTATTCC CTGTATTCAA 464 AATATGTACT CCCATATCTA TTTTGTACAG TAAAACTATA TTTCTTAAAA ACAGACTCTT 38187 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | | | AATATGTACT CCCATATCTA TTTTGTACAG TAAAACTATA TTTCTTAAAA AAAAA..... 519 TCTCTGTAAT TTTCAATTCA AAAAAAAATC TTTAGTCATT TCATTATATA GAGAACATAT 38127 | || | .......... .......... .......... .......... .......... .AAAAAAA.. 526 ATACTTCTCA TCCTGCTTGA CAAACAAGTG AATAAGCATA CAGAATCAGG AAGCTCTAAA 38067 .......... .......... .......... .......... .......... .......... 526 AAATCAATAA GATATCCACA GATTAAAAAC AAAGCACCTA AAGAAGATAA GAAACCTTTG 38007 .......... .......... .......... .......... .......... .......... 526 ATTTGTCATC ACAAGTCAAC AACTAAAACA AAAACCTAAT TATTCCATAA AAATTACTCC 37947 .......... .......... .......... .......... .......... .......... 526 ATCCATTTCA ATTTACTTGT CTTGTTTTTA TGATAAAGTT GCCACATGGT TTGATTACGC 37887 .......... .......... .......... .......... .......... .......... 526 ATAGATTAAC ATGTTATAGT GTTAGGTGAA CACTCCATCA ACAATTTCAC AGGATACCTA 37827 .......... .......... .......... .......... .......... .......... 526 TTACCTCTCC ATCGGCTAAA TAAGAAGACA GCCCAGAAGA AAACCCACAA ATACAGTGAG 37767 .......... .......... .......... .......... .......... .......... 526 GAACATCACT CTAGCAGTTG ACTGTACCTT GCACTTTTGT CTACTCTATG ACTCATTTAG 37707 .......... .......... .......... .......... .......... .......... 526 CAATATTCTC GACCTAAAGG GTTGCATGCA AATTTTAATG CCACACTAAT TCAGGACTAT 37647 .......... .......... .......... .......... .......... .......... 526 AAACCACAAA ACAACCAAAA CTAAGAAGAA ATGGGCTTAT ATAAATCACT GATTTATGAT 37587 .......... .......... .......... .......... .......... .......... 526 ATTTCCTCCC ATAAAGAAAT TAGGTAACTG GGAGCTCATA TAACAACGCA AGAGAGAACA 37527 .......... .......... .......... .......... .......... .......... 526 AAAATCTGGG AATAATGTCC TCCGGATTTT GAATCATCTG ACCAAATAAA AGTGTCCACA 37467 .......... .......... .......... .......... .......... .......... 526 AAATTTAACC CATAAGTATC CTTTATTAAT CACTTCTTAA TAATGTGCAT AGTACAAAAT 37407 .......... .......... .......... .......... .......... .......... 526 TTTCCATGCG GCAACTAAGT TGATTGAAGG GATTGCAAGT CTGTAGCACA GGCTTCAACA 37347 .......... .......... .......... .......... .......... .......... 526 ATGAAAAAGC AAGATAAAAT CGTAATGGAT GAACACTTAT CTCCATGTTA CTTGAGTTTT 37287 .......... .......... .......... .......... .......... .......... 526 TCAAATAGCT TTGGGGCAAC CTGTGTAACA ATTCCATAAA AAGTTTAGAA ACAAAACGAA 37227 .......... .......... .......... .......... .......... .......... 526 ATGCTAAATA GAGCAACAAC GGAGAATAGA GAAATAGAGC TTACACATTT GTCAATGCAT 37167 .......... .......... .......... .......... .......... .......... 526 TGCCAATAAT CAAAATACTG TCCAGTGCAA TGCTTGCTCC CTGATTCATC ACCTTCTATC 37107 .......... .......... .......... .......... .......... .......... 526 CTTCTAGTAC ATGCCTGTAG GATGCAAAAT ACCCAAATAA GAGCCACAAA GTATGAAAGT 37047 .......... .......... .......... .......... .......... .......... 526 CGGTACAAAT TTTAAATACA TTTGCACAAA TTCAGAACAA ATGAACGAGA CATAAATGTT 36987 .......... .......... .......... .......... .......... .......... 526 CAGGAAGTGC ACAGAACAGT TAAATGCAAT TGTTCTGAAG ATCAAACGAA TGCTACTGTA 36927 .......... .......... .......... .......... .......... .......... 526 CAGTTATCTA AGGTGGTAGT TGTCACAAAT CAATTAGATA TCATCCTTCA AGCATAACAC 36867 .......... .......... .......... .......... .......... .......... 526 ACAAAAATCC CAGATTCCCC ATGTCCAAAG CACATCCTTT GCACAAACCA TTAAGTTGAT 36807 .......... .......... .......... .......... .......... .......... 526 TAAAAGATTT AGGCAGAGGG AGTAGATGAA TAACAAGAAG TTGAAAGCCA AAACTAACTC 36747 .......... .......... .......... .......... .......... .......... 526 GGAAAGGTTT AACGGGATCG AAATTAAAAC TCTGAGTCTT TGATAATCAA AACCTTACAG 36687 .......... .......... .......... .......... .......... .......... 526 CACTGTGTTC TTCATAAAAA ATTGTTACAG CAAATTGTTG GCAATAGAAA GAACAATGGA 36627 .......... .......... .......... .......... .......... .......... 526 CCCGATAGAC ACATATGATA CAGAAAAATG AGCAGAGGAA ACAAAAAAGA TCATAGAAAT 36567 .......... .......... .......... .......... .......... .......... 526 ACTAGCATTC TGCTGACCAG TTGAATCTGG AGAGCCAATA TGAGAGGAGG TCGGTGGGTG 36507 .......... .......... .......... .......... .......... .......... 526 TACTTGGAGG CCCATAAACA TGCCTTTTTG GCTACTTGGA AAGACCAAGC AGTTACAGAT 36447 .......... .......... .......... .......... .......... .......... 526 TAAGTCTCTT CGTCACATGT TGGTTCATCC CATTAGAGTG ACATAACAAG AAAATTCTCC 36387 .......... .......... .......... .......... .......... .......... 526 TGGCTTACTA TATCAGTTGA TTTGGTAATT AGAGTAGCTG AAAGTTCAAG TGATACATAC 36327 .......... .......... .......... .......... .......... .......... 526 TATGTTAGGT CATATGTCGG CCATAGTACT TGATGTTTTC CAAATTTACA ATTTTCAGGT 36267 .......... .......... .......... .......... .......... .......... 526 CTAACATGAC ATTGCCAAGA CATTTTTGTG TCACTCTGAG TTCTTGTGGA GACCGTACAG 36207 .......... .......... .......... .......... .......... .......... 526 GATAATGTTA ACAATAATAA AAATTAGCCT TCCACCACCG CAAACAATCA AGTCCAAGTT 36147 .......... .......... .......... .......... .......... .......... 526 ACAGCTCACC CAATAACAAT TATATAAGAA CAAGAAAATC CCTTTGATAT AACTGAAAGG 36087 .......... .......... .......... .......... .......... .......... 526 CTGTTTTGTT ATGGTGGGAA TCACCTGATA ATCCTTTAGT TGCCTTACAC ACTTAGGCTT 36027 .......... .......... .......... .......... .......... .......... 526 GCAAGATACT TCCATTGTCG CCTTTGGGTC AACAACTTCC TCGTCCGACC TGAACAAAGG 35967 .......... .......... .......... .......... .......... .......... 526 AAACCAACAG CAATTTATCT TCAAAGAGAT GGCAATACTT GTGTGTGTGT GTGTGTGGTG 35907 .......... .......... .......... .......... .......... .......... 526 GTGGTGGTGG GGGGGGGGGG AGTCACTGAG TGTCACTTGT CATTAATTAC AACAGTCTAA 35847 .......... .......... .......... .......... .......... .......... 526 CTCTGTGTTT GTAACATGAG CTCGATATTT GTAAGTCATT GAGGACTATC GTGTAAGAAT 35787 .......... .......... .......... .......... .......... .......... 526 AATATCAGAT GCTTCTGACA TAATAGTTAC TCCCTCCGTC TCATTTTATG TGGCACCATT 35727 .......... .......... .......... .......... .......... .......... 526 TCCTTTTTGT CTCAGTCCCA AAAAGAATGT CACATTTTCT TATATGGTAA GTATTTAAAG 35667 .......... .......... .......... .......... .......... .......... 526 GTGCAATTCC TCTTTTACCC TTGTTGGTCC CACTTAATTT TACATAGTAC TCTAATACGT 35607 .......... .......... .......... .......... .......... .......... 526 TTATGAGAAA AGAGAAAAGT GAGTCTACTT TTGAAGGACA ATTTAGTAAA CAATTCAAAG 35547 .......... .......... .......... .......... .......... .......... 526 TTTTCATTTT TTTAAAACTC CGTGCTAGGT CAAATTGTGC CACATAAAAT CGGACGGAGG 35487 .......... .......... .......... .......... .......... .......... 526 GAGTATTAGG AAAAATAGCA TGAAAATTTC TCTCCACCTA TAATGTCTGC AAGGAATACA 35427 .......... .......... .......... .......... .......... .......... 526 AGCTTACGAA GAGGGCCAAC ACTATTCACA AACATCTCAT TGCAGTGCTC CTTCGATTAA 35367 .......... .......... .......... .......... .......... .......... 526 CTTAATGTAA GCTAAGTAAA AATGGACAAT GGTGTTTCTC GCCAACTAGT TACATCAACA 35307 .......... .......... .......... .......... .......... .......... 526 AACACACTGA GAACATGGTA TATCACAAGC AAATACTTGT GCATAAAACA AATTTTTAAA 35247 .......... .......... .......... .......... .......... .......... 526 AAATCAATCA CAAAGAAGTT GAGCCTTTGA CAGAGCATTT ATGGAAGGTT TCAACACGGA 35187 .......... .......... .......... .......... .......... .......... 526 ATGTACACTC CAAAACTTTA CTTCACACTA AAAGCAATAT AATACTATCC ACTACGGCTA 35127 .......... .......... .......... .......... .......... .......... 526 ATAAATGGAT AGAGTTGTAA ACCAATCAAA TAGGGTAGAA TATATAGTTG CGTACAAATT 35067 .......... .......... .......... .......... .......... .......... 526 ATTAGAATTG ACACATGAAA GTAATCTTTA ATACATGAAG TTACACCAAT GAATGAATAA 35007 .......... .......... .......... .......... .......... .......... 526 GGTAAAAATA ATTAGCTGTT GCATCAGAGG GAGGATAAGG ATTATGTCAA AGGCTCAAAG 34947 .......... .......... .......... .......... .......... .......... 526 CACTTCTGAT TAACACATGA AATCAAGTAG TATCTGTAGA AGTTCGGTAA GTGTGCATTT 34887 .......... .......... .......... .......... .......... .......... 526 GCTCAGCATA GCGCCATATA CATGCTAATT GCCTGATATG CGCATAAATA CATATATATC 34827 .......... .......... .......... .......... .......... .......... 526 TGTTGGAATC TGTGGCACAC TTGAAGTCTG TGCATATTAT TTAAGGAACT TCCACGTATT 34767 .......... .......... .......... .......... .......... .......... 526 TTTCATTTTT AAAGATAAAT GTCTAAGATC TAGAGAATTG AAATTTTAAA ATGTTTTATG 34707 .......... .......... .......... .......... .......... .......... 526 CAACAAAAGA AATAGACGAT AGCAAGCTAA CATGGTTGAA GACTTTAGTC CCATGACATT 34647 .......... .......... .......... .......... .......... .......... 526 GATGTTGGAC TTTATTAACC GAATCCAAAA TAACAGGATA AATTGGTACT AGGACACTCG 34587 .......... .......... .......... .......... .......... .......... 526 CACCCACTCT CCAATCAAAT CCAATGAAGG TATTTCAATA AATGACGTAC TGCTTCTTGA 34527 .......... .......... .......... .......... .......... .......... 526 AGAGATACAT AATTCATCCT TCTACTTCGG AATATTAGCT CACTAGTCAA TACTTTTTAA 34467 .......... .......... .......... .......... .......... .......... 526 ACAGACATAC AAAAAATGGT AATTCTTTGC ACATTAGAAA GCAAAAACAT TTTGAGTAAA 34407 .......... .......... .......... .......... .......... .......... 526 ATCAAACTAT TTTTTCCTCT CTTTTTGGTG GCTGCAAGCA TAATTGAACT GCTATTTTTC 34347 .......... .......... .......... .......... .......... .......... 526 TACATTATAT AAACAATGGG AAAAATCACA AAAAAATAAT AATATTACAA TAGGATTTCC 34287 .......... .......... .......... .......... .......... .......... 526 AGCAACTTCA TTTCTCTCAA CATAGAGGTC CAACTAACAC TTTGCTCTTT GACAGAATAA 34227 .......... .......... .......... .......... .......... .......... 526 AGATGACCAT TCTATTTGTG TTTTTTTACT GTTCTAGATA GAATTTCCAT CAACTTCATT 34167 .......... .......... .......... .......... .......... .......... 526 TCTCTCAACA TAGAGGGTCC AACTAACACT TTGCTCTTGG ACAAAATAAA GATGACGATT 34107 .......... .......... .......... .......... .......... .......... 526 TTATTTCTAT TTTTTTACTG TTCCAGATCA AAATTCTATA TAAACTATTT AGGGAATCTC 34047 .......... .......... .......... .......... .......... .......... 526 AATTTCTACA GTGTATAATT GAAAAACTAT TTAAAAATCA CTCACATATA TGGTAAGGAG 33987 .......... .......... .......... .......... .......... .......... 526 AATAAATTAC TCATAAACAG TGAAATCCTA TTAGAATAAA TGGATTATAC AAACGAAACG 33927 .......... .......... .......... .......... .......... .......... 526 AAACACATAA ATCACATCAA AATTTAATTA CTTTAATAAA GAGAGAATAG TTTCATTATA 33867 .......... .......... .......... .......... .......... .......... 526 AATAGATCAA CAGTACTCAC ATGGCAACGA GATATTGGAT ACGAAGACTT TGATTCTACA 33807 .......... .......... .......... .......... .......... .......... 526 AAAAAACCTG CAAATGCAAT TTTTACACTA ATCAATTTTG GATACGAAGA CATTGATTCT 33747 .......... .......... .......... .......... .......... .......... 526 ACGAATTCAC CAGCAAATGC AATTTTTACA CTAATCAATA CAAAATTAAC CAGCTATCAA 33687 .......... .......... .......... .......... .......... .......... 526 GCAAAAAAAA CACAATTCTC CATGTGTGCG AATGCAACAA AACAAGATAA AATAGCAAAA 33627 .......... .......... .......... .......... .......... .......... 526 CCTAATATTA TGCAGATCGA ATGAACACAT AAAGAGGAAG CAGATCAAAT GCGTAGAAGA 33567 .......... .......... .......... .......... .......... .......... 526 GGAGCAACCT GACTTTTCTC TGAATGAGGT GCGAATGGTG AAGAGAAGAC AGAAGAGAGA 33507 .......... .......... .......... .......... .......... .......... 526 GAGAGTTAAA TTAAAAAGGC TTTTTTATTG GGCTTTTTGA ATTGGTCTTC TAGCTTTACC 33447 .......... .......... .......... .......... .......... .......... 526 CATTAAAGGA AGTAGAAAAC TTATGTACAT ATATGATAAT AAAAAAATAT TTATCATTTA 33387 .......... .......... .......... .......... .......... .......... 526 TATTAATAAT AATATTTTTT TATTTGATTA TTTTTAGTTT ATTTATAAAA CAAGTTATAA 33327 .......... .......... .......... .......... .......... .......... 526 TAATAAATAA TACATTTATC ACATATTTTA ATATATTTAT AATACAATAT GATAATTTTT 33267 .......... .......... .......... .......... .......... .......... 526 ACCAAACAAA CACAATATAT TTCAAAAATA ATTATAATTC AAATATATTG CATACATAAT 33207 .......... .......... .......... .......... .......... .......... 526 TCATTTTTAA TACATATTAC AGATTTATCA CAATATTGTT ATAAATAGTA ATAAACAAAA 33147 .......... .......... .......... .......... .......... .......... 526 AATATCGCTA AAATCAGTAA CTATTTTTTA AAATATATTA ATTTATGTAA TTTTTCAATG 33087 .......... .......... .......... .......... .......... .......... 526 AGGTATGGAG TAGGTTGTAC AAAATCGAAT CGAATTGTAA ATTGAGTCAA ATCTTTAAAA 33027 .......... .......... .......... .......... .......... .......... 526 AAAAATTGAC TAGTGATTTA GTGTTGAAAA AAATCGATTA TATTTGGGTT GGTTTGATTT 32967 .......... .......... .......... .......... .......... .......... 526 CAGCTAAAAG AAACACAACC CGGGACCAAA CCAACCCGAC ATTATATATA TAATTTTAAA 32907 .......... .......... .......... .......... .......... .......... 526 ATTTTTATTT TATACGTAAA ATAACTACTT TGATATAATT TTTAAATTAT CTCTTATGCT 32847 .......... .......... .......... .......... .......... .......... 526 TTTACATAGT TTTAATCTTT TAATATATTT ATTTCATGTT TGGAAGTTAA AATTCTTAAT 32787 .......... .......... .......... .......... .......... .......... 526 GATCTAATAA AGATTATAGT CCATACATGT TGGTAAATAT AATAAAGTTT AAATAAAAAT 32727 .......... .......... .......... .......... .......... .......... 526 CAAATTAATA CTAATACAAA AAGAAAATCA ATTCAACACT AAGAATGACA ATAATATTGA 32667 .......... .......... .......... .......... .......... .......... 526 ATATTTATTT TTTAGTTATA CATAGATTTA GACAATTAAA ATACATGATC TAATTTTACT 32607 .......... .......... .......... .......... .......... .......... 526 TTCTTTTAAT ATTTAATCAT GTAACTAGTA CTTACTAAAC TTATTTTTAG CATGATTTAG 32547 .......... .......... .......... .......... .......... .......... 526 TACTTTAAAT TATAATCAAT TTCATTATGA CTTATTAATT TGCAATATTG TTTTACGCGA 32487 .......... .......... .......... .......... .......... .......... 526 TTTTATTATT TATTTTTTTG TTGGATATTT TAGTGTCATT AATCATATAA TATTTTGTGT 32427 .......... .......... .......... .......... .......... .......... 526 TATTTTCTTG AGAAATAACT TAGATAGTTG CATTTTAGTA GAAATAAAGA AATATTTAAA 32367 .......... .......... .......... .......... .......... .......... 526 GTACAAGTAA ATTATATGTT TGTATGAATA CTTTACTGAA AAAATCCGAA GTTGAAAACC 32307 .......... .......... .......... .......... .......... .......... 526 TCGAGTTTTA TTGCTTTGAT TTGGTTTGTA AATTCAAAAA TCCGACACAA ATAATTTGAT 32247 .......... .......... .......... .......... .......... .......... 526 TTAATATTTG AAAAACCCAA ACCAATCCGG TCATGTAAAA GTGTCCAATT TTGTGTATTT 32187 .......... .......... .......... .......... .......... .......... 526 TCGATAATTC TTAAAAATAT AATTTTAAGT TGAAAATAAA TTGCAAGAAA AAATTTGTGT 32127 .......... .......... .......... .......... .......... .......... 526 TGCCTATAAA GTCGAATTTA TGGTCAATCG AGTACGTTCA CTATCTCAAA ATATTTTAAA 32067 .......... .......... .......... .......... .......... .......... 526 TTTTTGACTT GTAGGAAAAA GAAAAGTTAA GGAACTTTGA TGGACAAGAA AAGTTTCATC 32007 ||||| |||| .......... ....AAAAAA AAAAA..... .......... .......... .......... 537 TATATGTATT AACTTTTGTC TGGGAGGTAC AACTTGTGAT ATATTTAATT TCGTTGTCTT 31947 .......... .......... .......... .......... .......... .......... 537 CAAATATAAT AAAATTATAT TTAATAGTAA AAAGTAAAAA CTAATTTATG CCCG 31893 || ||| ||||| || || |||| .......... .......... .......AAA AAAAAAAAAA AAAAAAAATT CCCG 564 hqPGS_C09HBa0099P03.1-2-_SGN-U342268+ (38606 38192) ******************************************************************************** EST sequence 32 +strand 915 n (File: SGN-U320670+) 1 TCTATCATAA TCCTAATTTA TTATTCTTGA AATAATGGCC ATCAAAGTCC ATGGTATCCC 61 CTTGTCAACT GCAACCATGA GAGTTATTTC TTGCCTTATT GAGAAGGATT TGGATTTTGA 121 GTTTGTCTTT GTTGATATGG CCAAAGAAGA ACACAAGAGG CCCCCTTTCC TCTCACTCAA 181 TCCTTTTGCT CAAGTACCAG CATTTGAAGA TGGAGACTTG AAGCTCTTTG AATCAAGGGC 241 AATCACTCAA TACATTGCTC AGGTTTATGC TAGCAATGGC ATTCAACTAA TACTCCAAGA 301 TCCAATGAAA ATGGCCATTA TGTCAGTATG GATGGAAGTA GAAGGCCAAA AATTTGAACC 361 ACCAGCTTCA AAATTAACAT GGGAGCTAGT CATAAAACCA ATGATTGGCT TGGGCAGTAC 421 CGATGATGTT ATTGTGAAGG AAAGTGAAGA ACAATTGTCT AAGGTTCTTG ACATCTACGA 481 AACTCGATTG ACAGAGTCAA AATACTTGGG TGGCGACTCC TTTACACTTG TTGATTTGCA 541 TCATATACCA AATATATACC ATCTGATGAA TACAAAAGCT AAGGCACTGT TTGATTCGCG 601 CCCTCGTGTG AGTGTATGGT GTGCTGATAT ATTGGCTAGG CCAGCTTGGG TGAAGGGGTT 661 GGAGAAGATG CAAAAATGAA AAAAAGTCGT GAATTAATGG ATGATCATAA TTCATATATA 721 TGTTTTTGTT TTGAAGCATT TGTGTCTTAA TATGTTGTGT TTCTTGTCTG AAGATGTTTG 781 TCTTGCAATA CAATAAACAG TGATCTATAT CTATGTGATT TTACTAATTG TACTGATGTA 841 AAATATGCTA TGTTCCGGTC ATTTATAAAA TAATTGCGCG CTATATTTTT GTGAAAAAAA 901 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 42656 to 38834): Exon 1 42056 41876 ( 181 n); cDNA 1 181 ( 181 n); score: 0.989 Intron 1 41875 41285 ( 591 n); Pd: 1.000 (s: 0.98), Pa: 1.000 (s: 1.00) Exon 2 41284 41236 ( 49 n); cDNA 182 230 ( 49 n); score: 1.000 Intron 2 41235 40318 ( 918 n); Pd: 0.954 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 3 40317 39655 ( 663 n); cDNA 231 893 ( 663 n); score: 1.000 PPA cDNA 894 915 MATCH C09HBa0099P03.1-2- SGN-U320670+ 0.998 893 0.976 C PGS_C09HBa0099P03.1-2-_SGN-U320670+ (42056 41876,41284 41236,40317 39655) Alignment (genomic DNA sequence = upper lines): TCTATCATAA TCCTAATTTA TTATTCTTGA AATAATGGCA ATCAAAGTCC ATGGTATCCC 41997 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TCTATCATAA TCCTAATTTA TTATTCTTGA AATAATGGCC ATCAAAGTCC ATGGTATCCC 60 CTTGTCAACT GCAACCATGA GAGTTATTTC TTGCCTTATT GAGAAGGATT TGGATTTTGA 41937 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGTCAACT GCAACCATGA GAGTTATTTC TTGCCTTATT GAGAAGGATT TGGATTTTGA 120 GTTTGTCTTT GTTGATATGG CCAAAGAAGA ACACAAGAGG CACCCTTTCC TCTCACTCAA 41877 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| GTTTGTCTTT GTTGATATGG CCAAAGAAGA ACACAAGAGG CCCCCTTTCC TCTCACTCAA 180 TGTAAGCATA AATAATTACT CCCTCTGTAC AGCTACATAA ATATTTAAGA ATTGTTTGAC 41817 | T......... .......... .......... .......... .......... .......... 181 CATAAGTTTC AAAAGTTCTT TTCTTTAAAC ATGGTGTCAA GTCAAATGGT GTCAAATAAA 41757 .......... .......... .......... .......... .......... .......... 181 ATGGGACGGT TGAAGTACAT TATTAGCATG TTTGATCAAG TTTTGGAGAA GCTAAAAGTA 41697 .......... .......... .......... .......... .......... .......... 181 TTTCTTTTTA AATGTTTATT TTAGAAATTT GAGATGTTCA GTGTTTTTTA GTAGCAGCAT 41637 .......... .......... .......... .......... .......... .......... 181 AAACTGAACT AAAAACACTT TTTTGAAACT TTGGTCAAAC ACAAATGTTG CAAAAACATT 41577 .......... .......... .......... .......... .......... .......... 181 TGTCATATTG ATGGCAAATA CAAATTGTCA TCGTCCAAAA TACTCTTTAA ATACAACACT 41517 .......... .......... .......... .......... .......... .......... 181 TTTTGAAATT AATAATTTCT AAAATGCTAA ACAAACTATA AATTTATTTG GTAGTGAATG 41457 .......... .......... .......... .......... .......... .......... 181 TTACTAGTGA AAAAAAATAA ATCTATTTGG TAGTGATTTC TACACAGATT TATATTTGTC 41397 .......... .......... .......... .......... .......... .......... 181 AAAAATATTT TAATTGGTTC ACTTGTACTT GCTATCAATC CATTTTTTTC CTTTAATTTA 41337 .......... .......... .......... .......... .......... .......... 181 AAATCATGAA TTCATATAGT ATATTAAAAT GATATTTTCT CAATGGGTGC AGCCTTTTGC 41277 |||||||| .......... .......... .......... .......... .......... ..CCTTTTGC 189 TCAAGTACCA GCATTTGAAG ATGGAGACTT GAAGCTCTTT GGTAAAGTGT TTTAGCTAAT 41217 |||||||||| |||||||||| |||||||||| |||||||||| | TCAAGTACCA GCATTTGAAG ATGGAGACTT GAAGCTCTTT G......... .......... 230 CTTACAATTT GTAGTATATG TCTGCACCAA GGTGTTAAAT TTGAAAAGAT TTAAGTTACA 41157 .......... .......... .......... .......... .......... .......... 230 TATACATAAA CATTATTAAA TATTTTTTAT ACTATCAATG TAATCTATCA TGTTATAGTA 41097 .......... .......... .......... .......... .......... .......... 230 GGTTACTCAT TATATTTACT AATTATTAGT GCTTATAATA GAATATTTAA AATATAACAC 41037 .......... .......... .......... .......... .......... .......... 230 AAAAAAATGC TCTTTAACCT TACCTCATCT GACATATATG CTCACCAACT TAGGATTTGC 40977 .......... .......... .......... .......... .......... .......... 230 ACGAGTGAAC AATTAAACTT GTATATAATT GAATAAGTGG ACACACATCC TACATGACAA 40917 .......... .......... .......... .......... .......... .......... 230 TTTGCAATCT TACATGGTGT CCTACATGTA TTAAGTCATT TTAGACATGC GTGTCTACGT 40857 .......... .......... .......... .......... .......... .......... 230 GTTCAACTTT ATATAAATTT AAATGTCTAC TTATGTACAT ATAAAATTGG ACAGATGTCA 40797 .......... .......... .......... .......... .......... .......... 230 GCTAAGTTCA AATTAAAAAA CTATTTTTAT GTATTGTGCC AAATTTTTTT ACATTACCAT 40737 .......... .......... .......... .......... .......... .......... 230 ACAAGTTGGA TCAGAGTTAG TAATACAGGA GTTAAACTAA ATCCTCTCCG CCGAATAATT 40677 .......... .......... .......... .......... .......... .......... 230 ATATTTATGC AAATAGAGCA AGACCAACTG TTATATCCAA CCTTTTCCCC CTTGACATAA 40617 .......... .......... .......... .......... .......... .......... 230 AAGGAAGATT TTACTAGTGA TAAAGGGGTT CATAAGTTCA ATCTTAACTT TGACATCCTA 40557 .......... .......... .......... .......... .......... .......... 230 ATATATATTT TTGAACCCTT TTTATATATA TAAAGAAGAA TTAACTCAAC GAGTCACCAC 40497 .......... .......... .......... .......... .......... .......... 230 CCAATCGTTC AACACATGTA TAATTTATGT TTATCAGTTG GGAAATGTAA ACAAGTGAAT 40437 .......... .......... .......... .......... .......... .......... 230 CCATACAGTT AATTATGTAA AGATTCCTTA TATGAATTCT TGTTTCTACC ATTGCTGATA 40377 .......... .......... .......... .......... .......... .......... 230 AGTGACCTTT TCTTTAAAAC TTATCAATTT GAATGACAAT ATGTGCACTT GTAATGCAGA 40317 | .......... .......... .......... .......... .......... .........A 231 ATCAAGGGCA ATCACTCAAT ACATTGCTCA GGTTTATGCT AGCAATGGCA TTCAACTAAT 40257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAAGGGCA ATCACTCAAT ACATTGCTCA GGTTTATGCT AGCAATGGCA TTCAACTAAT 291 ACTCCAAGAT CCAATGAAAA TGGCCATTAT GTCAGTATGG ATGGAAGTAG AAGGCCAAAA 40197 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTCCAAGAT CCAATGAAAA TGGCCATTAT GTCAGTATGG ATGGAAGTAG AAGGCCAAAA 351 ATTTGAACCA CCAGCTTCAA AATTAACATG GGAGCTAGTC ATAAAACCAA TGATTGGCTT 40137 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGAACCA CCAGCTTCAA AATTAACATG GGAGCTAGTC ATAAAACCAA TGATTGGCTT 411 GGGCAGTACC GATGATGTTA TTGTGAAGGA AAGTGAAGAA CAATTGTCTA AGGTTCTTGA 40077 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGCAGTACC GATGATGTTA TTGTGAAGGA AAGTGAAGAA CAATTGTCTA AGGTTCTTGA 471 CATCTACGAA ACTCGATTGA CAGAGTCAAA ATACTTGGGT GGCGACTCCT TTACACTTGT 40017 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCTACGAA ACTCGATTGA CAGAGTCAAA ATACTTGGGT GGCGACTCCT TTACACTTGT 531 TGATTTGCAT CATATACCAA ATATATACCA TCTGATGAAT ACAAAAGCTA AGGCACTGTT 39957 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTTGCAT CATATACCAA ATATATACCA TCTGATGAAT ACAAAAGCTA AGGCACTGTT 591 TGATTCGCGC CCTCGTGTGA GTGTATGGTG TGCTGATATA TTGGCTAGGC CAGCTTGGGT 39897 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTCGCGC CCTCGTGTGA GTGTATGGTG TGCTGATATA TTGGCTAGGC CAGCTTGGGT 651 GAAGGGGTTG GAGAAGATGC AAAAATGAAA AAAAGTCGTG AATTAATGGA TGATCATAAT 39837 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGGGGTTG GAGAAGATGC AAAAATGAAA AAAAGTCGTG AATTAATGGA TGATCATAAT 711 TCATATATAT GTTTTTGTTT TGAAGCATTT GTGTCTTAAT ATGTTGTGTT TCTTGTCTGA 39777 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATATATAT GTTTTTGTTT TGAAGCATTT GTGTCTTAAT ATGTTGTGTT TCTTGTCTGA 771 AGATGTTTGT CTTGCAATAC AATAAACAGT GATCTATATC TATGTGATTT TACTAATTGT 39717 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATGTTTGT CTTGCAATAC AATAAACAGT GATCTATATC TATGTGATTT TACTAATTGT 831 ACTGATGTAA AATATGCTAT GTTCCGGTCA TTTATAAAAT AATTGCGCGC TATATTTTTG 39657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGATGTAA AATATGCTAT GTTCCGGTCA TTTATAAAAT AATTGCGCGC TATATTTTTG 891 TG 39655 || TG 893 hqPGS_C09HBa0099P03.1-2-_SGN-U320670+ (42056 41876,41284 41236,40317 39655) ******************************************************************************** EST sequence 31 +strand 808 n (File: SGN-U320669+) 1 AAATATAGAG TCACTGGATG ACTTATTGGA GAATTATGGT GGAAATGAAG TACAACAGTT 61 CGAGGAGAAT TTGGTCTCAT CTGAAGTAGC AGTTGTACAT GATCCAAATG AGCATTCCAT 121 GGCTGAGGTT CTGGATCACT TTCAGCATAC AAGTTCCTCA CGAGGCAATC CTAAAATGCT 181 GCAAACAAAA ATACCTGGAT CACGGTTCTT ACGGAAGAGA AACTTATTGC TGCTTGGTGA 241 CAGAAACATG AGCAATGGCG AACAACCTGA GGAACTAGAT AGTGATCCAT CTAGTGATGA 301 GGATGTAAAT GAAGTTCCCC AGATTCTGAA GTCTGCTATA CCTCAGAGGA CCATGGCTGA 361 CCAATTTCAT CTAGCGTTAG GAGCTGTATC CACAAATGAG AGGCTATGTA TTGCAAGGCC 421 TAAGCAATTT GGTTTATCTG GAAGGTTGCA GCACGTGATG CAATGTGAAA AGGACAGAGA 481 TACATATTTT TTGGAGAAGT CACAAACACA TGCTGCTTCA AGTGGTGCAG AAAGCTTCAT 541 TGATGTGAGA ATTTTGTCAA GTTCTTTGGA GGCCAAGCTG ACTGTTTGTT TTTGTGCTTT 601 ACATGGAGAT GAAGAGGAAG GAGGTACATG TGAACGAGAA AGACGAGGCC ATCATTTTGT 661 GTGCATATTT CTCTCAGATT TAGTCTTTGG ATCACATTTT ATACCAAGTC CATGGTATCC 721 CCTTGTCAAC TGCAACCATG AGAGTTATTT CTTGCCTTAT TGAGAAGGAT TTGGATTTTG 781 AGTTTGTCTT TGTTGATATG GCCAAAGA Predicted gene structure (within gDNA segment 46088 to 41300): Exon 1 45488 45311 ( 178 n); cDNA 1 178 ( 178 n); score: 1.000 Intron 1 45310 45195 ( 116 n); Pd: 0.933 (s: 1.00), Pa: 0.823 (s: 1.00) Exon 2 45194 45063 ( 132 n); cDNA 179 310 ( 132 n); score: 1.000 Intron 2 45062 44985 ( 78 n); Pd: 0.829 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 3 44984 44863 ( 122 n); cDNA 311 432 ( 122 n); score: 1.000 Intron 3 44862 44743 ( 120 n); Pd: 0.900 (s: 1.00), Pa: 0.982 (s: 1.00) Exon 4 44742 44651 ( 92 n); cDNA 433 524 ( 92 n); score: 1.000 Intron 4 44650 44582 ( 69 n); Pd: 0.275 (s: 1.00), Pa: 0.953 (s: 1.00) Exon 5 44581 44490 ( 92 n); cDNA 525 616 ( 92 n); score: 1.000 Intron 5 44489 42795 (1695 n); Pd: 0.987 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 6 42794 42703 ( 92 n); cDNA 617 708 ( 92 n); score: 1.000 Intron 6 42702 42010 ( 693 n); Pd: 0.884 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 7 42009 41910 ( 100 n); cDNA 709 808 ( 100 n); score: 1.000 MATCH C09HBa0099P03.1-2- SGN-U320669+ 1.000 808 1.000 C PGS_C09HBa0099P03.1-2-_SGN-U320669+ (45488 45311,45194 45063,44984 44863,44742 44651,44581 44490,42794 42703,42009 41910) Alignment (genomic DNA sequence = upper lines): AAATATAGAG TCACTGGATG ACTTATTGGA GAATTATGGT GGAAATGAAG TACAACAGTT 45429 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATATAGAG TCACTGGATG ACTTATTGGA GAATTATGGT GGAAATGAAG TACAACAGTT 60 CGAGGAGAAT TTGGTCTCAT CTGAAGTAGC AGTTGTACAT GATCCAAATG AGCATTCCAT 45369 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAGGAGAAT TTGGTCTCAT CTGAAGTAGC AGTTGTACAT GATCCAAATG AGCATTCCAT 120 GGCTGAGGTT CTGGATCACT TTCAGCATAC AAGTTCCTCA CGAGGCAATC CTAAAATGGT 45309 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| GGCTGAGGTT CTGGATCACT TTCAGCATAC AAGTTCCTCA CGAGGCAATC CTAAAATG.. 178 TTGCTTTCTT CTTCTCCAGA TCTCTTTTCT TAAAGATCTC AGCAGCGCTT CCGTATTACC 45249 .......... .......... .......... .......... .......... .......... 178 ATTTTTACTA TTTCAATTAT GTTTTTGAAC GTAAGAATCA TGCCTATCCA ACAGCTGCAA 45189 |||||| .......... .......... .......... .......... .......... ....CTGCAA 184 ACAAAAATAC CTGGATCACG GTTCTTACGG AAGAGAAACT TATTGCTGCT TGGTGACAGA 45129 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAAAAATAC CTGGATCACG GTTCTTACGG AAGAGAAACT TATTGCTGCT TGGTGACAGA 244 AACATGAGCA ATGGCGAACA ACCTGAGGAA CTAGATAGTG ATCCATCTAG TGATGAGGAT 45069 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACATGAGCA ATGGCGAACA ACCTGAGGAA CTAGATAGTG ATCCATCTAG TGATGAGGAT 304 GTAAATGTAG TTAATCTTTT ACTGCTTAAT CATCACGATC AATTGCCATA TGTAGTTCTC 45009 |||||| GTAAAT.... .......... .......... .......... .......... .......... 310 TCAATTCCTT TCTTTTTGTT TCAGGAAGTT CCCCAGATTC TGAAGTCTGC TATACCTCAG 44949 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GAAGTT CCCCAGATTC TGAAGTCTGC TATACCTCAG 346 AGGACCATGG CTGACCAATT TCATCTAGCG TTAGGAGCTG TATCCACAAA TGAGAGGCTA 44889 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGACCATGG CTGACCAATT TCATCTAGCG TTAGGAGCTG TATCCACAAA TGAGAGGCTA 406 TGTATTGCAA GGCCTAAGCA ATTTGGGTGA GTTATGGAAC TAGAATGTAT CAGTTATGAG 44829 |||||||||| |||||||||| |||||| TGTATTGCAA GGCCTAAGCA ATTTGG.... .......... .......... .......... 432 CATATTCTGC ATGTTTTCTA TGAACAGAAA CAGAAAAATA TTCTTCAAAT TTTATCTTTT 44769 .......... .......... .......... .......... .......... .......... 432 CTTACTTGTG TTGATAAGTC ATGCAGTTTA TCTGGAAGGT TGCAGCACGT GATGCAATGT 44709 |||| |||||||||| |||||||||| |||||||||| .......... .......... ......TTTA TCTGGAAGGT TGCAGCACGT GATGCAATGT 466 GAAAAGGACA GAGATACATA TTTTTTGGAG AAGTCACAAA CACATGCTGC TTCAAGTGGT 44649 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| GAAAAGGACA GAGATACATA TTTTTTGGAG AAGTCACAAA CACATGCTGC TTCAAGTG.. 524 AAAAGGCCAA ACTGTGTATC TCCTGTCTCT TGCTTGTTTC TGACCTTCTC CTCTGCTGTT 44589 .......... .......... .......... .......... .......... .......... 524 AATTCAGGTG CAGAAAGCTT CATTGATGTG AGAATTTTGT CAAGTTCTTT GGAGGCCAAG 44529 ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......GTG CAGAAAGCTT CATTGATGTG AGAATTTTGT CAAGTTCTTT GGAGGCCAAG 577 CTGACTGTTT GTTTTTGTGC TTTACATGGA GATGAAGAGG TACTTGAACT AACTATTGTA 44469 |||||||||| |||||||||| |||||||||| ||||||||| CTGACTGTTT GTTTTTGTGC TTTACATGGA GATGAAGAG. .......... .......... 616 CATTTGCTAA ACAGAAGGTT AAAAGTATGA GCATGTCTAC TTTATTGAGG ATACAACGTT 44409 .......... .......... .......... .......... .......... .......... 616 GCTTTCAACT TTTTGAAGGT CCTTCGCGTG CTTTTCTTGC TCTCACTAGT TTCTTAAGTA 44349 .......... .......... .......... .......... .......... .......... 616 AAAATTCCCA GTACTTGTTT TGATGATAGC CAATTTTTAC TTGGTGCCTG GAACATTTAC 44289 .......... .......... .......... .......... .......... .......... 616 CATTCTCATA TAAATGGACA ACTTTTAGAA ATTCAGCATG TTGCATGGCA CAAAAAGTCG 44229 .......... .......... .......... .......... .......... .......... 616 GGTAGGGTGG CCTAGGACCG TGGATACTGA GGGTAAAGAA CTTTTACTTT CCATTATAGG 44169 .......... .......... .......... .......... .......... .......... 616 TGGTAGCTTT TGGGTGAAGT GTCGTAGTTT CTTAACTTAC TATACACCAT TTCCCTCCGA 44109 .......... .......... .......... .......... .......... .......... 616 AGCACTGTAA GAAATAAGCC CTGCAAATTA AACAAACAAT GTGGTAGTTA GTTTTTTTTT 44049 .......... .......... .......... .......... .......... .......... 616 TTTTTTGAAA TGTGTAATCA GTAAGGAACT CCCAAAATTA ATTTATCACT GTGTTCAATA 43989 .......... .......... .......... .......... .......... .......... 616 ATTGCATATT TCACTTTTTT CCCCAATAAT TGCAGTCGTT TAATAATATG GCATTTACTG 43929 .......... .......... .......... .......... .......... .......... 616 TGATCTGTAT AAACTTTTTC AGAAATGTGT TCCTTGTAGT ATCTACGACT TGACTCTTAG 43869 .......... .......... .......... .......... .......... .......... 616 GAGCTTCAGA TAATTGAAGG ACCATGGAGA CGTTATATTT GTTCTATGAT TTCTTTCTTC 43809 .......... .......... .......... .......... .......... .......... 616 TTTTGTTTTA GATCAATATT TTTCATCCTT TGTGCTTAAA CCACTTCTTG GTTTCTTCTT 43749 .......... .......... .......... .......... .......... .......... 616 TACCTCGTCT GGATCAATAT TATTTCTTAA TTCACAAAGA AGCTTGCAAA TTTAGGGTCA 43689 .......... .......... .......... .......... .......... .......... 616 GAATATGCCC ATGATCCAAA TGTCTAACCA AGGATGTTCG CTGTCACTTA CGATCTATAT 43629 .......... .......... .......... .......... .......... .......... 616 ACGTAGTTTG TACTGGAGGA TTATCTAATG CAAGATTGAA TAAATTGATA TATATTTAGG 43569 .......... .......... .......... .......... .......... .......... 616 TTTTATCTGC TTTTGTTTAA ATACAGAAGG AGGGAAAAAA GTGATCTCGA CAAACACAAA 43509 .......... .......... .......... .......... .......... .......... 616 TCAGTACTCA TTCCCCAAGC ACATAGGTCT AGAAGGGAAA AAAAGAAGTT TCATACCATC 43449 .......... .......... .......... .......... .......... .......... 616 TGGCTAGTCA TTTCGTCATA ATGGAAATTT TTAAGAATGA AAGTTGGGGT CTTAACATTT 43389 .......... .......... .......... .......... .......... .......... 616 ATTAACTGTT TATTTTACTC TAACTGGTGT TGATGGTTTC ATTTACCAAA CAGAATAAAT 43329 .......... .......... .......... .......... .......... .......... 616 CTGTGGTGTT GGGTGATGCG ATTGTCAATT TTATTTTTCT TTGCAGGGTT CTGAGTGCCT 43269 .......... .......... .......... .......... .......... .......... 616 GAGCAATCCT CGAGAAAGGA AGGGTACTGG TAGAAGGGAG TTTACTATCA TTTTTAACTC 43209 .......... .......... .......... .......... .......... .......... 616 AAGAATTTGT AAAGATGTCG AACTTGAAAT AGGGAATGTT ATCCGCATAC ATCAACCTTG 43149 .......... .......... .......... .......... .......... .......... 616 GTAGGAGGCG CTAAAATATT TTCTTTTTTC AATATTTGTT GTTTTTGTTT GTTTGGATAA 43089 .......... .......... .......... .......... .......... .......... 616 TACTCTTTTC TCGGTACCGG GTTGAGTATG ATGGCTCTTG GCACCCAACC TAAATTATTG 43029 .......... .......... .......... .......... .......... .......... 616 TGATTGAAAT GCCTAGAAGG CTAGAGCACC CTTTTGGTAG AAGTGGAGTA ACTCTATACA 42969 .......... .......... .......... .......... .......... .......... 616 TTAAATTTTC TAAAAACAAG GAGACAAAGC TAGATTGATT TTAGCCCCTC ATAGCATGGA 42909 .......... .......... .......... .......... .......... .......... 616 GGAAAGTTCT GATTCTGTCC TTGTAAATTA TTAACTGTAC CTTTTACTGA ACTTTCTTGC 42849 .......... .......... .......... .......... .......... .......... 616 ATTGCTTAAC AAATACTAAG TGATTCTTTA ACAGACTGAT TTTTTCTGGT GCAGGAAGGA 42789 |||||| .......... .......... .......... .......... .......... ....GAAGGA 622 GGTACATGTG AACGAGAAAG ACGAGGCCAT CATTTTGTGT GCATATTTCT CTCAGATTTA 42729 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTACATGTG AACGAGAAAG ACGAGGCCAT CATTTTGTGT GCATATTTCT CTCAGATTTA 682 GTCTTTGGAT CACATTTTAT ACCAAGGTAA TAGCAAGGTT CCTTGAGATA ACTCTATTGT 42669 |||||||||| |||||||||| |||||| GTCTTTGGAT CACATTTTAT ACCAAG.... .......... .......... .......... 708 TGTACTTTGC ACTTGCCTAT ATTTTCTAAA TAGATAAAAA TATAGAAGTC TAATTTATCC 42609 .......... .......... .......... .......... .......... .......... 708 ATTTTCGCCA TAAATATTAC TCATGACTAC TGCAAAATGT ATGAATGCCT TGAAACAAAT 42549 .......... .......... .......... .......... .......... .......... 708 TGGTTGCATC TGCAAGTTTC CTGGTACATC CCCATGATGT ATCCAAGATC CTATAGTTTA 42489 .......... .......... .......... .......... .......... .......... 708 AAAGGAATTT TTATATTTTA GGAAAGAATA GAGATGGAAT GTAATTAACT CTAAACACTG 42429 .......... .......... .......... .......... .......... .......... 708 TAGGATTTAT GTAATTTTTA CAGAAAAATA AAAATAATTC TGAGTGTTGA AGTTTACACC 42369 .......... .......... .......... .......... .......... .......... 708 TCCCATAGTT TGAAGTAGTC AGTCTGTACT ATCCAGCCTG TTTGTTCAAC TTTAAAATGC 42309 .......... .......... .......... .......... .......... .......... 708 ACTTAACAAA TTGGTTTAGT CAACATATAA GGATAGTTGT GAGGTCATTA ATTACCTAAT 42249 .......... .......... .......... .......... .......... .......... 708 TAGATTATTA ATCTGTTCTT TCTCCATCTT ATAATCATGG TATTACTTAG CCTCTAATCT 42189 .......... .......... .......... .......... .......... .......... 708 TAAAGCATTT AAAAAAGGTT TGCTTAGGAT TCAAGACTTT TTGGATATAT TGATTTTTTT 42129 .......... .......... .......... .......... .......... .......... 708 AAGAATGATA TCAAGAAAAT TTTACCCTAT ATATACCCCT CCATATAACT TCATTTCATC 42069 .......... .......... .......... .......... .......... .......... 708 AACTTGGAGC AATCTATCAT AATCCTAATT TATTATTCTT GAAATAATGG CAATCAAAGT 42009 | .......... .......... .......... .......... .......... .........T 709 CCATGGTATC CCCTTGTCAA CTGCAACCAT GAGAGTTATT TCTTGCCTTA TTGAGAAGGA 41949 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCATGGTATC CCCTTGTCAA CTGCAACCAT GAGAGTTATT TCTTGCCTTA TTGAGAAGGA 769 TTTGGATTTT GAGTTTGTCT TTGTTGATAT GGCCAAAGA 41910 |||||||||| |||||||||| |||||||||| ||||||||| TTTGGATTTT GAGTTTGTCT TTGTTGATAT GGCCAAAGA 808 hqPGS_C09HBa0099P03.1-2-_SGN-U320669+ (45488 45311,45194 45063,44984 44863,44742 44651,44581 44490,42794 42703,42009 41910) ******************************************************************************** EST sequence 21 +strand 874 n (File: SGN-U337182+) 1 AACTAGTCGA TCCCCCGGGC TGCAGGAATT CGGCACGAGG TTAATTCAGG TGCAGAATTC 61 TTCATTGATG TGAGAATTTT GTCAAGTTCT TTGGAGGCCA AGCTGACTGT TTGTTTTTGT 121 GCTTTACATG GAGATGAAGA GGGTTCTGAG TGCCTGAGCA ATCCTCGAGA AAGGAAGGGT 181 ACTGGCAGAA GGAAGTTTAT TATCATTTTT AACTCAAGAA TTTGTAAAGA TGTCGAACTT 241 ACAATAGGGA ATGTTATCCG CATACATCAA CCTTGGAAGG AGGTACATGT GAACGAGAAA 301 GACGAGGCCA TCATTTTGTG TGCATATTTC TCTCAGATTT AGTCTTTGGA TCACATTTTA 361 TACCAAGGTA ATAGCAAGGT TCCTTGAGAT AACTCTATTG TTGTACTTTG CACTTGCCCT 421 ATATTGTCTA AATAGATAAA AATATAGAAG TCTAATTTAT CCATTTTCGC CATAAATATT 481 ACTCATGACT ACTGCAAAAT GTATGAATGC CTTAAAACAA ATTGGTATGC ATCTGCAGGT 541 TTCCTGGTAC ATCCCCATGA TGTATCCAAG ATCCTATAAA GGAATTTTTA TATTTAGGAA 601 AGAATAGATA TGGCATGTAA TTAACTCTAA ACACCGGAGG ATTTATGTTA TTTTTACAGA 661 AAAAATAAAA ATAATTCTGA GTGTTGAAGT TTACACCCTT CCATAGTTTG AAATAGCCAA 721 TCTGGTCTAT TCCACCTGGT TGGTCAACTT TTAAATGCAC CTTTACAAAT GGGTTTAGTC 781 CACCTATTAA GGTTGCTGGG GAGGGGCATT TGTTACCCCA ACTTGAATAA GAAAATGGGC 841 CTTTCTTCAT CCTATAAACA CGGGGTTTCT TAAC Predicted gene structure (within gDNA segment 45581 to 39989): Exon 1 44581 44490 ( 92 n); cDNA 50 141 ( 92 n); score: 0.978 Intron 1 44489 43283 (1207 n); Pd: 0.987 (s: 1.00), Pa: 0.999 (s: 0.98) Exon 2 43282 43149 ( 134 n); cDNA 142 275 ( 134 n); score: 0.963 Intron 2 43148 42795 ( 354 n); Pd: 0.987 (s: 0.96), Pa: 0.999 (s: 1.00) Exon 3 42794 42266 ( 529 n); cDNA 276 803 ( 528 n); score: 0.904 MATCH C09HBa0099P03.1-2- SGN-U337182+ 0.923 755 0.864 C PGS_C09HBa0099P03.1-2-_SGN-U337182+ (44581 44490,43282 43149,42794 42266) Alignment (genomic DNA sequence = upper lines): GTGCAGAAAG CTTCATTGAT GTGAGAATTT TGTCAAGTTC TTTGGAGGCC AAGCTGACTG 44522 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGCAGAATT CTTCATTGAT GTGAGAATTT TGTCAAGTTC TTTGGAGGCC AAGCTGACTG 109 TTTGTTTTTG TGCTTTACAT GGAGATGAAG AGGTACTTGA ACTAACTATT GTACATTTGC 44462 |||||||||| |||||||||| |||||||||| || TTTGTTTTTG TGCTTTACAT GGAGATGAAG AG........ .......... .......... 141 TAAACAGAAG GTTAAAAGTA TGAGCATGTC TACTTTATTG AGGATACAAC GTTGCTTTCA 44402 .......... .......... .......... .......... .......... .......... 141 ACTTTTTGAA GGTCCTTCGC GTGCTTTTCT TGCTCTCACT AGTTTCTTAA GTAAAAATTC 44342 .......... .......... .......... .......... .......... .......... 141 CCAGTACTTG TTTTGATGAT AGCCAATTTT TACTTGGTGC CTGGAACATT TACCATTCTC 44282 .......... .......... .......... .......... .......... .......... 141 ATATAAATGG ACAACTTTTA GAAATTCAGC ATGTTGCATG GCACAAAAAG TCGGGTAGGG 44222 .......... .......... .......... .......... .......... .......... 141 TGGCCTAGGA CCGTGGATAC TGAGGGTAAA GAACTTTTAC TTTCCATTAT AGGTGGTAGC 44162 .......... .......... .......... .......... .......... .......... 141 TTTTGGGTGA AGTGTCGTAG TTTCTTAACT TACTATACAC CATTTCCCTC CGAAGCACTG 44102 .......... .......... .......... .......... .......... .......... 141 TAAGAAATAA GCCCTGCAAA TTAAACAAAC AATGTGGTAG TTAGTTTTTT TTTTTTTTTG 44042 .......... .......... .......... .......... .......... .......... 141 AAATGTGTAA TCAGTAAGGA ACTCCCAAAA TTAATTTATC ACTGTGTTCA ATAATTGCAT 43982 .......... .......... .......... .......... .......... .......... 141 ATTTCACTTT TTTCCCCAAT AATTGCAGTC GTTTAATAAT ATGGCATTTA CTGTGATCTG 43922 .......... .......... .......... .......... .......... .......... 141 TATAAACTTT TTCAGAAATG TGTTCCTTGT AGTATCTACG ACTTGACTCT TAGGAGCTTC 43862 .......... .......... .......... .......... .......... .......... 141 AGATAATTGA AGGACCATGG AGACGTTATA TTTGTTCTAT GATTTCTTTC TTCTTTTGTT 43802 .......... .......... .......... .......... .......... .......... 141 TTAGATCAAT ATTTTTCATC CTTTGTGCTT AAACCACTTC TTGGTTTCTT CTTTACCTCG 43742 .......... .......... .......... .......... .......... .......... 141 TCTGGATCAA TATTATTTCT TAATTCACAA AGAAGCTTGC AAATTTAGGG TCAGAATATG 43682 .......... .......... .......... .......... .......... .......... 141 CCCATGATCC AAATGTCTAA CCAAGGATGT TCGCTGTCAC TTACGATCTA TATACGTAGT 43622 .......... .......... .......... .......... .......... .......... 141 TTGTACTGGA GGATTATCTA ATGCAAGATT GAATAAATTG ATATATATTT AGGTTTTATC 43562 .......... .......... .......... .......... .......... .......... 141 TGCTTTTGTT TAAATACAGA AGGAGGGAAA AAAGTGATCT CGACAAACAC AAATCAGTAC 43502 .......... .......... .......... .......... .......... .......... 141 TCATTCCCCA AGCACATAGG TCTAGAAGGG AAAAAAAGAA GTTTCATACC ATCTGGCTAG 43442 .......... .......... .......... .......... .......... .......... 141 TCATTTCGTC ATAATGGAAA TTTTTAAGAA TGAAAGTTGG GGTCTTAACA TTTATTAACT 43382 .......... .......... .......... .......... .......... .......... 141 GTTTATTTTA CTCTAACTGG TGTTGATGGT TTCATTTACC AAACAGAATA AATCTGTGGT 43322 .......... .......... .......... .......... .......... .......... 141 GTTGGGTGAT GCGATTGTCA ATTTTATTTT TCTTTGCAGG GTTCTGAGTG CCTGAGCAAT 43262 | |||||||||| |||||||||| .......... .......... .......... .........G GTTCTGAGTG CCTGAGCAAT 162 CCTCGAGAAA GGAAGGGTAC TGGTAGAAGG GAGTTTACTA TCATTTTTAA CTCAAGAATT 43202 |||||||||| |||||||||| ||| |||||| |||||| || |||||||||| |||||||||| CCTCGAGAAA GGAAGGGTAC TGGCAGAAGG AAGTTTATTA TCATTTTTAA CTCAAGAATT 222 TGTAAAGATG TCGAACTTGA AATAGGGAAT GTTATCCGCA TACATCAACC TTGGTAGGAG 43142 |||||||||| |||||||| |||||||||| |||||||||| |||||||||| ||| TGTAAAGATG TCGAACTTAC AATAGGGAAT GTTATCCGCA TACATCAACC TTG....... 275 GCGCTAAAAT ATTTTCTTTT TTCAATATTT GTTGTTTTTG TTTGTTTGGA TAATACTCTT 43082 .......... .......... .......... .......... .......... .......... 275 TTCTCGGTAC CGGGTTGAGT ATGATGGCTC TTGGCACCCA ACCTAAATTA TTGTGATTGA 43022 .......... .......... .......... .......... .......... .......... 275 AATGCCTAGA AGGCTAGAGC ACCCTTTTGG TAGAAGTGGA GTAACTCTAT ACATTAAATT 42962 .......... .......... .......... .......... .......... .......... 275 TTCTAAAAAC AAGGAGACAA AGCTAGATTG ATTTTAGCCC CTCATAGCAT GGAGGAAAGT 42902 .......... .......... .......... .......... .......... .......... 275 TCTGATTCTG TCCTTGTAAA TTATTAACTG TACCTTTTAC TGAACTTTCT TGCATTGCTT 42842 .......... .......... .......... .......... .......... .......... 275 AACAAATACT AAGTGATTCT TTAACAGACT GATTTTTTCT GGTGCAGGAA GGAGGTACAT 42782 ||| |||||||||| .......... .......... .......... .......... .......GAA GGAGGTACAT 288 GTGAACGAGA AAGACGAGGC CATCATTTTG TGTGCATATT TCTCTCAGAT TTAGTCTTTG 42722 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAACGAGA AAGACGAGGC CATCATTTTG TGTGCATATT TCTCTCAGAT TTAGTCTTTG 348 GATCACATTT TATACCAAGG TAATAGCAAG GTTCCTTGAG ATAACTCTAT TGTTGTACTT 42662 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATCACATTT TATACCAAGG TAATAGCAAG GTTCCTTGAG ATAACTCTAT TGTTGTACTT 408 TGCACTTG-C CTATATTTTC TAAATAGATA AAAATATAGA AGTCTAATTT ATCCATTTTC 42603 |||||||| | ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| TGCACTTGCC CTATATTGTC TAAATAGATA AAAATATAGA AGTCTAATTT ATCCATTTTC 468 GCCATAAATA TTACTCATGA CTACTGCAAA ATGTATGAAT GCCTTGAAAC AAATTGGT-T 42544 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||| | GCCATAAATA TTACTCATGA CTACTGCAAA ATGTATGAAT GCCTTAAAAC AAATTGGTAT 528 GCATCTGCAA GTTTCCTGGT ACATCCCCAT GATGTATCCA AGATCCTATA GTTTAAAAGG 42484 ||||||||| |||||||||| |||||||||| |||||||||| |||||| || | ||||| GCATCTGCAG GTTTCCTGGT ACATCCCCAT GATGTATCCA AGATCC--TA ---T-AAAGG 582 AATTTTTATA TTTTAGGAAA GAATAGAGAT GGAATGTAAT TAACTCTAAA CACTGTAGGA 42424 |||||||||| ||||||||| ||||||| || || ||||||| |||||||||| ||| | |||| AATTTTTATA -TTTAGGAAA GAATAGATAT GGCATGTAAT TAACTCTAAA CACCGGAGGA 641 TTTATGTAAT TTTTACAG-A AAAATAAAAA TAATTCTGAG TGTTGAAGTT TACA-CCTCC 42366 ||||||| || |||||||| | |||||||||| |||||||||| |||||||||| |||| ||| | TTTATGTTAT TTTTACAGAA AAAATAAAAA TAATTCTGAG TGTTGAAGTT TACACCCTTC 701 CATAGTTTGA AGTAGTCAGT CTGTACTATC CAGCCTGTTT GTTCAACTTT AAAATGCA-C 42307 |||||||||| | ||| || | ||| |||| | |||| || | |||||||| ||||||| | CATAGTTTGA AATAGCCAAT CTGGTCTATT CCACCTGGTT GGTCAACTTT TAAATGCACC 761 TTAACAAATT GGTTTAGTCA ACATA-TAAG GATAGTTGTG AG 42266 || |||||| ||||||||| || || |||| | | | | | || TTTACAAATG GGTTTAGTCC ACCTATTAAG GTTGCTGGGG AG 803 hqPGS_C09HBa0099P03.1-2-_SGN-U337182+ (44581 44490,43282 43149,42794 42266) ******************************************************************************** EST sequence 22 +strand 853 n (File: SGN-U337183+) 1 AACTAGTGGA TCCCCCGGGC TGCAGGAAAA AAGGTTTGCT TAGGATTCAA GACTTTTTGG 61 ATATATTGAT TTTTTTAAGA ATGATATCAA GAAAATTTTA CCCTATATAT ACCCCTCCAT 121 ATAACTTCAT TTCATCAACT TGGAGCAATC TATCATAATC CTAATTTATT ATTCTTGAAA 181 TAATGGCAAT CAAAGTCCAT GGTATCCCCT TGTCAACTGC AACCATGAGA GTTATTTCTT 241 GCCTTATTGA GAAGGATTTG GATTTTGAGT TTGTCTTTGT TGATATGGCC AAAGAAGAAC 301 ACAAGAGGCA CCCTTTCCTC TCACTCAATG TAAGCATAAA TAATTACTCC CTCTGTACAG 361 CTACATAAAT ATTTAAGAAT TGTTTGACCA TAAGTTTCAA AAGTTCTTTT CTTTAAACAT 421 GGTGTCAAGT CAAATGGTGT CAAATAAAAT GGGACGGTTG AAGTACATTA TTAGCATGTT 481 TGATCAAGTT TTGGAGAAGC TAAAAGTATT TCTTTTTAAA TGTTTATTTT AGAAATTTGA 541 GATGTTCAGT GTTTTTTAGT AGCAGCATAA ACTGAACTAA AAACACTTTT TTGAAACTTT 601 GGTCAAACAC AAATGTTGCA AAAACATTTG TCATATTGAT GGCAAATACA AATTGTCATC 661 GTCCAAAATA CTCTTTAAAT ACAACACTTT TTGAAATTAA TAATTTCTAA AATGCTAAAC 721 AAACTATAAA TTTATTTGGT AGTGAATGTT ACTAGTGAAA AAAAAATAAA TCTAATTTGG 781 TAGTGATTTC TACACAGATT TATTTTTGGC AAAAAATATT TTAAATGGTT TCAACTTGAC 841 TTGCTATCCA ATC Predicted gene structure (within gDNA segment 43038 to 40294): Exon 1 42178 41361 ( 818 n); cDNA 27 848 ( 822 n); score: 0.979 MATCH C09HBa0099P03.1-2- SGN-U337183+ 0.979 818 0.959 C PGS_C09HBa0099P03.1-2-_SGN-U337183+ (42178 41361) Alignment (genomic DNA sequence = upper lines): AAAAAAGGTT TGCTTAGGAT TCAAGACTTT TTGGATATAT TGATTTTTTT AAGAATGATA 42119 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAAAGGTT TGCTTAGGAT TCAAGACTTT TTGGATATAT TGATTTTTTT AAGAATGATA 86 TCAAGAAAAT TTTACCCTAT ATATACCCCT CCATATAACT TCATTTCATC AACTTGGAGC 42059 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAAGAAAAT TTTACCCTAT ATATACCCCT CCATATAACT TCATTTCATC AACTTGGAGC 146 AATCTATCAT AATCCTAATT TATTATTCTT GAAATAATGG CAATCAAAGT CCATGGTATC 41999 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCTATCAT AATCCTAATT TATTATTCTT GAAATAATGG CAATCAAAGT CCATGGTATC 206 CCCTTGTCAA CTGCAACCAT GAGAGTTATT TCTTGCCTTA TTGAGAAGGA TTTGGATTTT 41939 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCTTGTCAA CTGCAACCAT GAGAGTTATT TCTTGCCTTA TTGAGAAGGA TTTGGATTTT 266 GAGTTTGTCT TTGTTGATAT GGCCAAAGAA GAACACAAGA GGCACCCTTT CCTCTCACTC 41879 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGTTTGTCT TTGTTGATAT GGCCAAAGAA GAACACAAGA GGCACCCTTT CCTCTCACTC 326 AATGTAAGCA TAAATAATTA CTCCCTCTGT ACAGCTACAT AAATATTTAA GAATTGTTTG 41819 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTAAGCA TAAATAATTA CTCCCTCTGT ACAGCTACAT AAATATTTAA GAATTGTTTG 386 ACCATAAGTT TCAAAAGTTC TTTTCTTTAA ACATGGTGTC AAGTCAAATG GTGTCAAATA 41759 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCATAAGTT TCAAAAGTTC TTTTCTTTAA ACATGGTGTC AAGTCAAATG GTGTCAAATA 446 AAATGGGACG GTTGAAGTAC ATTATTAGCA TGTTTGATCA AGTTTTGGAG AAGCTAAAAG 41699 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATGGGACG GTTGAAGTAC ATTATTAGCA TGTTTGATCA AGTTTTGGAG AAGCTAAAAG 506 TATTTCTTTT TAAATGTTTA TTTTAGAAAT TTGAGATGTT CAGTGTTTTT TAGTAGCAGC 41639 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTTCTTTT TAAATGTTTA TTTTAGAAAT TTGAGATGTT CAGTGTTTTT TAGTAGCAGC 566 ATAAACTGAA CTAAAAACAC TTTTTTGAAA CTTTGGTCAA ACACAAATGT TGCAAAAACA 41579 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATAAACTGAA CTAAAAACAC TTTTTTGAAA CTTTGGTCAA ACACAAATGT TGCAAAAACA 626 TTTGTCATAT TGATGGCAAA TACAAATTGT CATCGTCCAA AATACTCTTT AAATACAACA 41519 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGTCATAT TGATGGCAAA TACAAATTGT CATCGTCCAA AATACTCTTT AAATACAACA 686 CTTTTTGAAA TTAATAATTT CTAAAATGCT AAACAAACTA TAAATTTATT TGGTAGTGAA 41459 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTTGAAA TTAATAATTT CTAAAATGCT AAACAAACTA TAAATTTATT TGGTAGTGAA 746 TGTTACTAGT G-AAAAAAAA TAAATCT-AT TTGGTAGTGA TTTCTACACA GATTTATATT 41401 |||||||||| | |||||||| ||||||| || |||||||||| |||||||||| ||||||| || TGTTACTAGT GAAAAAAAAA TAAATCTAAT TTGGTAGTGA TTTCTACACA GATTTATTTT 806 TGTC-AAAAA TATTTTAATT GG-TTCACTT GTACTTGCTA TC 41361 || | ||||| |||||||| | || |||| | |||||||| || TGGCAAAAAA TATTTTAAAT GGTTTCAACT TGACTTGCTA TC 848 hqPGS_C09HBa0099P03.1-2-_SGN-U337183+ (42178 41361) ******************************************************************************** EST sequence 30 +strand 773 n (File: SGN-U330064+) 1 GCACGAGCCC AATTAGCTCC GCAAAATTGC GTCTCTGCTT GTTACCAGCT ATGGAAGAGA 61 AAGGTTCAGC TGCTAGTAGA GTATTTCTTC AGGAAAAGGA AGATTCGAAC CAGAGTTTTT 121 CCGAAGAGGA AGATATGGAC GATGACGAAT GGATGACAAA TGACAATTGT TCCTTAGAAA 181 ACAAGGGAGG TTTAGGAGTC CTTTCCCAGC TTGAACGGCT CACAGATGTC AAAAGACTTC 241 ATCATTCAAC CGATACAGTG AACTCTGATC AGCTGGTCAG AGGCGGACAG GTTTATGCGA 301 AGAAGATGAC GTTGAAGTTC CTTTGTTTAA GAGTCAGGAT GGTAGCCTGA TCAACAAAAA 361 TGATCAGGAT GGTAGCTTCA TCAACAAAAA TGATCATTGG AAGGCGTTGT CCTGCTCCTT 421 AGATGATGAA TTTTGTCACG TCACTAGAAT TACATCCACT TGTAATTCAG AAGAGGAAAT 481 TATGTCTGAT GATGAGATGA GGCCTTCTAC TGATGGAAAA TTCAAAAGAG ATGGTAAAAG 541 TACAATGCTT AAAGTAAGCG CAGATTGCAA ATCAGGAGCT TTCTTCAATA AAGATGCTGG 601 GTGTTCATCG GTATACGGGG CCTCATCAAA ATTGAACAGA TCATCTAAAG GAAGCCCGGG 661 CAAATCTAAG GCCAAATTTT TGTTCCAATC CCGGCCACAG AAGAAAGACT ATGCTTTGGT 721 TGTCCATGAT AGTTGTGAAA CCTGCATGCC CTTATCTGTG CTTCCACTAA ATG Predicted gene structure (within gDNA segment 48906 to 44901): Exon 1 48236 48152 ( 85 n); cDNA 8 92 ( 85 n); score: 1.000 Intron 1 48151 47977 ( 175 n); Pd: 0.870 (s: 1.00), Pa: 0.950 (s: 0) Exon 2 47976 47946 ( 31 n); cDNA 93 123 ( 31 n); score: 1.000 Intron 2 47945 47839 ( 107 n); Pd: 0.582 (s: 0), Pa: 0.995 (s: 1.00) Exon 3 47838 47777 ( 62 n); cDNA 124 185 ( 62 n); score: 1.000 Intron 3 47776 47323 ( 454 n); Pd: 0.998 (s: 1.00), Pa: 0.978 (s: 1.00) Exon 4 47322 47283 ( 40 n); cDNA 186 225 ( 40 n); score: 1.000 Intron 4 47282 47173 ( 110 n); Pd: 0.899 (s: 1.00), Pa: 0.954 (s: 1.00) Exon 5 47172 47117 ( 56 n); cDNA 226 280 ( 55 n); score: 0.982 Intron 5 47116 46478 ( 639 n); Pd: 0.998 (s: 0.98), Pa: 0.011 (s: 1.00) Exon 6 46477 46262 ( 216 n); cDNA 281 496 ( 216 n); score: 1.000 Intron 6 46261 46157 ( 105 n); Pd: 0.996 (s: 1.00), Pa: 0.949 (s: 1.00) Exon 7 46156 46003 ( 154 n); cDNA 497 650 ( 154 n); score: 1.000 Intron 7 46002 45634 ( 369 n); Pd: 0.998 (s: 1.00), Pa: 0.971 (s: 1.00) Exon 8 45633 45511 ( 123 n); cDNA 651 773 ( 123 n); score: 1.000 MATCH C09HBa0099P03.1-2- SGN-U330064+ 0.999 767 0.992 C PGS_C09HBa0099P03.1-2-_SGN-U330064+ (48236 48152,47976 47946,47838 47777,47322 47283,47172 47117,46477 46262,46156 46003,45633 45511) Alignment (genomic DNA sequence = upper lines): CCCAATTAGC TCCGCAAAAT TGCGTCTCTG CTTGTTACCA GCTATGGAAG AGAAAGGTTC 48177 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCAATTAGC TCCGCAAAAT TGCGTCTCTG CTTGTTACCA GCTATGGAAG AGAAAGGTTC 67 AGCTGCTAGT AGAGTATTTC TTCAGGTATT TCGAGATTTT CCTAACAATT TTCATCATTT 48117 |||||||||| |||||||||| ||||| AGCTGCTAGT AGAGTATTTC TTCAG..... .......... .......... .......... 92 TGCTCAAAAA TCGGAGATTA TCTCATCAGT ATTGTAGTTT TAGGTTACCT GCAATTGTTC 48057 .......... .......... .......... .......... .......... .......... 92 AGACGATTTT TTGTGTTGAT TTTGAATGAA TTTCGATGTT TTCTGTAAAG TTGAACTTGA 47997 .......... .......... .......... .......... .......... .......... 92 TTTCCTCAAT TATAATGCAG GAAAAGGAAG ATTCGAACCA GAGTTTTTCC GGTACCTTTC 47937 |||||||||| |||||||||| |||||||||| | .......... .......... GAAAAGGAAG ATTCGAACCA GAGTTTTTCC G......... 123 TTCTCCTTGA AAGAAAGCTT CACAGTAATA ACCTAGGTCT GTTTGTTTGT TTGTTTTAAT 47877 .......... .......... .......... .......... .......... .......... 123 TGAATCTGCT TTGCTTTGTT TTAAATTGTT ATTGTTAGAA GAGGAAGATA TGGACGATGA 47817 || |||||||||| |||||||||| .......... .......... .......... ........AA GAGGAAGATA TGGACGATGA 145 CGAATGGATG ACAAATGACA ATTGTTCCTT AGAAAACAAG GTAATGCCTA GTTAATTTTC 47757 |||||||||| |||||||||| |||||||||| |||||||||| CGAATGGATG ACAAATGACA ATTGTTCCTT AGAAAACAAG .......... .......... 185 TTGCTTACAT TTTGAAATTC CTCGAGTTAA AACGTCAGCT GCCTTCTATA ATGTTCTAGG 47697 .......... .......... .......... .......... .......... .......... 185 TAAGCTGTTA ATTTATTTTT AGTCTAAGAC AAAATTTGTT TACTGTTAGC AGAATGGATA 47637 .......... .......... .......... .......... .......... .......... 185 TAGAGGAGTC GTATAGCCAA TCCCAACTAA TCTTATTGGC TGATTGATTG TTAAAATGTT 47577 .......... .......... .......... .......... .......... .......... 185 ATCACATAAT TGTCTTGCTT ACTATTCGTA GAAAAACAAA TCTCAAAGTG GTTTGGAGGG 47517 .......... .......... .......... .......... .......... .......... 185 TTAAGGTTTA TGTTTTGGCT TTGATCGTTA GCAATTTAAT TATAGTTGTG AGATTGAGTG 47457 .......... .......... .......... .......... .......... .......... 185 TTTAGTGTTT ATTCACAATG CAGGCATGTT CACTGCACAC ATTTCCTAGT TATCAGTGAA 47397 .......... .......... .......... .......... .......... .......... 185 CCCCTTATTT TGTATATGAA TTTGCTGATG AATACTTTAT AATGCCCAAG ACTTGGATGT 47337 .......... .......... .......... .......... .......... .......... 185 TTCTTGTTTG TCAGGGAGGT TTAGGAGTCC TTTCCCAGCT TGAACGGCTC ACAGGTATTC 47277 |||||| |||||||||| |||||||||| |||||||||| |||| .......... ....GGAGGT TTAGGAGTCC TTTCCCAGCT TGAACGGCTC ACAG...... 225 CCAATGAAAC TTCTCCACTT GATTTTTCAT TCATCAGCAC CCCCCTGCAA AATATTCAAA 47217 .......... .......... .......... .......... .......... .......... 225 AAGATAAGGA CATTTGGGAA GTTATCCATG TTAATCTTCT GCAGATGTCA AAAGACTTCA 47157 |||||| |||||||||| .......... .......... .......... .......... ....ATGTCA AAAGACTTCA 241 TCATTCAACC GATACAGTGA ACTCTGATCA GCTGGTACAG GTTTGAATTC ACCTCCTTTT 47097 |||||||||| |||||||||| |||||||||| |||||| ||| TCATTCAACC GATACAGTGA ACTCTGATCA GCTGGT-CAG .......... .......... 280 TTTTGTCATA TGACCTACCT TCATTCCTTC TTTCTATTGT TCCTTTGGTC CTCGTCTTTT 47037 .......... .......... .......... .......... .......... .......... 280 CCTCTCTGTT TTTTTTTTTT TTGGGGTTAG ATATGTCTCA AATAGTTCCT AGTTGACTTT 46977 .......... .......... .......... .......... .......... .......... 280 AATGGGTCAT GGTGGTTAAG AATTCAACAT GGAAATTGAC TTTTTGATGG GGATTCCAAG 46917 .......... .......... .......... .......... .......... .......... 280 AAAAAGTTGC TCTCCACTTC TCCCGTTTCT TTTGAATCGT TCAGTTTAAG GAACAACATG 46857 .......... .......... .......... .......... .......... .......... 280 TTGGGCTAAA ATAACCTGGA GATACTCTTA GCATGGTGTA ATATCGTCCG CTTTGGGCCA 46797 .......... .......... .......... .......... .......... .......... 280 AAAGTCGCAC AGTTTTCCTA ATAAGGGCTC AAACATTAAG AGTATCCATC CCTATACCTT 46737 .......... .......... .......... .......... .......... .......... 280 TTATGTAGTT TCTTTTTCCT TTCAGCTACC AATGTGTGAC TACCTTATCA GAAAAAAGAA 46677 .......... .......... .......... .......... .......... .......... 280 AAAAAAATAC CAACGTGGAA CTTTGTTCAT ATACGCAACA CATAGCTTAT TAATTGATGG 46617 .......... .......... .......... .......... .......... .......... 280 GGGAGGCCTG TATTAACATT ATGTGTACCC CATCTACAAA AGGGATCTAT TAGTAAGTGG 46557 .......... .......... .......... .......... .......... .......... 280 AACTAGACAA AAATGATAAG CAGGAAAAAG TTCAGATGAC AACTTCCTCT TACAGTTGCG 46497 .......... .......... .......... .......... .......... .......... 280 GTTGCATTTT TGTCAGCAGA GGCGGACAGG TTTATGCGAA GAAGATGACG TTGAAGTTCC 46437 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........A GGCGGACAGG TTTATGCGAA GAAGATGACG TTGAAGTTCC 321 TTTGTTTAAG AGTCAGGATG GTAGCCTGAT CAACAAAAAT GATCAGGATG GTAGCTTCAT 46377 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGTTTAAG AGTCAGGATG GTAGCCTGAT CAACAAAAAT GATCAGGATG GTAGCTTCAT 381 CAACAAAAAT GATCATTGGA AGGCGTTGTC CTGCTCCTTA GATGATGAAT TTTGTCACGT 46317 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACAAAAAT GATCATTGGA AGGCGTTGTC CTGCTCCTTA GATGATGAAT TTTGTCACGT 441 CACTAGAATT ACATCCACTT GTAATTCAGA AGAGGAAATT ATGTCTGATG ATGAGGTATG 46257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| CACTAGAATT ACATCCACTT GTAATTCAGA AGAGGAAATT ATGTCTGATG ATGAG..... 496 ACTGCTGATC TCTATGTAGA TCGATATCAC TAAAACTTGA ATAATTAATA TTTTGTCCAA 46197 .......... .......... .......... .......... .......... .......... 496 CCTATACTAA GTGGATCTTT GGAGGTTAAA AATCTGGCAG ATGAGGCCTT CTACTGATGG 46137 |||||||||| |||||||||| .......... .......... .......... .......... ATGAGGCCTT CTACTGATGG 516 AAAATTCAAA AGAGATGGTA AAAGTACAAT GCTTAAAGTA AGCGCAGATT GCAAATCAGG 46077 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAATTCAAA AGAGATGGTA AAAGTACAAT GCTTAAAGTA AGCGCAGATT GCAAATCAGG 576 AGCTTTCTTC AATAAAGATG CTGGGTGTTC ATCGGTATAC GGGGCCTCAT CAAAATTGAA 46017 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTTTCTTC AATAAAGATG CTGGGTGTTC ATCGGTATAC GGGGCCTCAT CAAAATTGAA 636 CAGATCATCT AAAGGTATTA GTATTTCTGT TGACTTCATT TCCCTATTTT GCATTCTTTG 45957 |||||||||| |||| CAGATCATCT AAAG...... .......... .......... .......... .......... 650 TCTTGATGCT AAAGATGTCG TCAACTCCCC CAACGAGAAG AATGATCTTA ATCAAGCATT 45897 .......... .......... .......... .......... .......... .......... 650 GTTCATCACA GTTGGTTAAG TTGTCTTGGA CTAAGCTGTT ACTGACAAGC ATAACAATGT 45837 .......... .......... .......... .......... .......... .......... 650 AATCCCCCCC CCCCAGCATA ACATGTGTAA TTGTGGTGTT TCTCAGGAAC AAGTGAACTC 45777 .......... .......... .......... .......... .......... .......... 650 CAAGTAAAAT AGAAAAGAAA TGGATGCATC TCAATATTGA ATCCTCGAGT CTGACATCTA 45717 .......... .......... .......... .......... .......... .......... 650 TGACGGCTCT TTTTTTTTAC TTGAGATCTG TGATTGAGTT TTCAAGCTGT AAATTTTTGA 45657 .......... .......... .......... .......... .......... .......... 650 ACGACTTTTG ATATCAACTG CAGGAAGCCC GGGCAAATCT AAGGCCAAAT TTTTGTTCCA 45597 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...GAAGCCC GGGCAAATCT AAGGCCAAAT TTTTGTTCCA 687 ATCCCGGCCA CAGAAGAAAG ACTATGCTTT GGTTGTCCAT GATAGTTGTG AAACCTGCAT 45537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCCCGGCCA CAGAAGAAAG ACTATGCTTT GGTTGTCCAT GATAGTTGTG AAACCTGCAT 747 GCCCTTATCT GTGCTTCCAC TAAATG 45511 |||||||||| |||||||||| |||||| GCCCTTATCT GTGCTTCCAC TAAATG 773 hqPGS_C09HBa0099P03.1-2-_SGN-U330064+ (48236 48152,47976 47946,47838 47777,47322 47283,47172 47117,46477 46262,46156 46003,45633 45511) ******************************************************************************** EST sequence 5 +strand 2020 n (File: SGN-U324287+) 1 ATATATGAAA TTTTTCCAAA ATATAATATG AACCCAATTA TATAAACCCG CTCAAATTTG 61 AAATCTCCCC AAATCCCCAT TTCTGACACC ATTACAGAGG TGAAGCACCG AGCTCGAACT 121 CTCTCCAGAT TCTCTCTACA AGAAGATCCA TTTCCAGCTG ATGGAGAGGA AAACTCCAAA 181 TAGAACAAGG AGGAAGCAGA GAAGCAACCA AAAAAGCAAG AAGAGGATGA ACAAGGGTTC 241 ACTGTCTAGG CATTTCACTG TTGGGATTGC AAAACCTCCG CTTCCTAATC AACAACAGCT 301 TCATTCCTCA CTTTCTAATG TTACTTTACC GAATCCTTCT AGATTCCAGA AACTTCTGGA 361 TTCTGATGAC CTTCCGCCAG CTCAATCTCA GTTCTCTTCA GTTTTGCCGT TGAATCTCGA 421 TGCTGATGAT GATGCCGATG TTGCCGATGT TGCTGAAAAG GACTTCATTC TCAGTCAAGA 481 TTTCTTCTGT ACCCCGGATT ATCTAACGCC AGATGCACCT GCAATTTGTA ATGGGCTTGA 541 TGGTGATAAG GATGATTATA CTCCTTGTCC CAAATCACCC GAGAAGCTTC TAAGTGTATC 601 AAGAAAGAGG CCGCGACTAG CGTCGGTAAG GCCTTTTAGT TCCGATTTAT CTGGACAGCA 661 GCAGCCAGTA GATATTCCTA CAGATACTTT TGGGACAGAC GAAATGAAAT CAGAAAAGAT 721 AAGCGAGTCA GAAAAGGGTC CCAGTTATGT GTCACAATCT GCTATTGCTT TAAGATATCG 781 AGTCATGCCT CCTCCGTGCA TTAGAAACCC TTATCTCGGG GATGCTTCCG AGATAGATGC 841 TGATCCTTTT GGTAACAGGA GATCCAAGTA CCCAGGTTTT AACCCTGCAA TTTCTGGTAA 901 TGATGGTCTG TCACGGTATC GTACTGATTT CCACGAAATT GAGCAAATCG GTAGTGGGAA 961 CTTCAGCCGT GTTTTCAAAG TCTTTAAGAG AATTGATGGA TGTATGTATG CAGTGAAACA 1021 TAGCACTAAA CAGTTACATC AAGACACAGA TAGGAGACAG GCTTTGATGG AAGTGCAAGC 1081 ATTGGCTGCT TTAGGACCTC ATGAGAACGT AGTTGGTTAT TATTCATCTT GGTTTGAAAA 1141 TGAACACCTT TACATCCAAA TGGAGCTCTG TGACCACAGC TTATCCAATA AAAAATATTG 1201 TAAACTATTT TCGGAGGTAG AAGTTTTGGA AGCAATGTAT CAGGTAGCCA ACGCATTGCA 1261 GTTTATACAT CAGAGAGGGG TCGCTCATTT AGATGTAAAG CCAGATAATA TTTATGTGAA 1321 AAATGGTGTA TATAAGCTTG GTGATTTTGG ATGTGCAACT CTTCTTGATA AGAGCCAGCC 1381 AATTGAAGAG GGTGATGCAC GTTATATGCC CCAAGAAATA CTTAATGAGA ACTATGATCA 1441 TCTTGACAAA GTTGACATAT TCTCCTTGGG CGCTGCAATA TATGAACTTA TTAGAGGGTC 1501 TTCACTGCCA GAATCAGGGC CTCATTTTCT AAACCTCAGG GAGGGGAAAT TGCCTCTTCT 1561 TCCGGGTCAC TCCTTGCAAT TTCAGAATCT ACTCAAGGCA ATGATGGACC CAGATCCAAC 1621 ACGTCGTCCT TCTGCAAAAG GCGTTGTGGA TAATCCAATC TTTGAAAGAT GGCAAAGAAA 1681 TTCCAACAAG TAGATATCCA TGTAAATCAC TGTTTTCTGG GATTTGTCGA TTGCTACTTT 1741 TGCCAAAGAT CCAGAATTCA AAGCTGCAGT ATCTCATGCA GCATTCTGGT GTTGACCTAT 1801 AGCCATTTCT GTAAATAGAG ATAAAGCTTC ATGCACCAAA TTTTCCCATT TTGATGGTGC 1861 CACTTTTGCC AAATATTATA TCAAGAATGT AATGTTGTAT CCTAATTGAC CTGCAAACTG 1921 TGGTTTGTAT GTCAAAATAT GGTATTTGGT GGCTTTTAAA AAAAGTAATT CAGTATTGGT 1981 TTTTGGTTTT GGTAAAAAAA AAAAAGAAAA AAAAAAAAAA Predicted gene structure (within gDNA segment 49329 to 55686): Exon 1 49929 50417 ( 489 n); cDNA 1 489 ( 489 n); score: 1.000 Intron 1 50418 50831 ( 414 n); Pd: 0.995 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 2 50832 50892 ( 61 n); cDNA 490 550 ( 61 n); score: 1.000 Intron 2 50893 50993 ( 101 n); Pd: 0.999 (s: 1.00), Pa: 0.896 (s: 1.00) Exon 3 50994 51063 ( 70 n); cDNA 551 620 ( 70 n); score: 1.000 Intron 3 51064 51517 ( 454 n); Pd: 0.998 (s: 1.00), Pa: 0.915 (s: 1.00) Exon 4 51518 51772 ( 255 n); cDNA 621 875 ( 255 n); score: 1.000 Intron 4 51773 51931 ( 159 n); Pd: 0.861 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 5 51932 51999 ( 68 n); cDNA 876 943 ( 68 n); score: 1.000 Intron 5 52000 52086 ( 87 n); Pd: 1.000 (s: 1.00), Pa: 0.579 (s: 1.00) Exon 6 52087 52196 ( 110 n); cDNA 944 1053 ( 110 n); score: 1.000 Intron 6 52197 52625 ( 429 n); Pd: 0.965 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 7 52626 52666 ( 41 n); cDNA 1054 1094 ( 41 n); score: 1.000 Intron 7 52667 52758 ( 92 n); Pd: 0.970 (s: 1.00), Pa: 0.970 (s: 1.00) Exon 8 52759 52907 ( 149 n); cDNA 1095 1243 ( 149 n); score: 1.000 Intron 8 52908 53803 ( 896 n); Pd: 0.045 (s: 1.00), Pa: 0.890 (s: 1.00) Exon 9 53804 54157 ( 354 n); cDNA 1244 1597 ( 354 n); score: 1.000 Intron 9 54158 54428 ( 271 n); Pd: 1.000 (s: 1.00), Pa: 0.972 (s: 1.00) Exon 10 54429 54842 ( 414 n); cDNA 1598 2011 ( 414 n); score: 0.961 MATCH C09HBa0099P03.1-2+ SGN-U324287+ 0.992 2011 0.996 C PGS_C09HBa0099P03.1-2+_SGN-U324287+ (49929 50417,50832 50892,50994 51063,51518 51772,51932 51999,52087 52196,52626 52666,52759 52907,53804 54157,54429 54842) Alignment (genomic DNA sequence = upper lines): ATATATGAAA TTTTTCCAAA ATATAATATG AACCCAATTA TATAAACCCG CTCAAATTTG 49988 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATATGAAA TTTTTCCAAA ATATAATATG AACCCAATTA TATAAACCCG CTCAAATTTG 60 AAATCTCCCC AAATCCCCAT TTCTGACACC ATTACAGAGG TGAAGCACCG AGCTCGAACT 50048 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATCTCCCC AAATCCCCAT TTCTGACACC ATTACAGAGG TGAAGCACCG AGCTCGAACT 120 CTCTCCAGAT TCTCTCTACA AGAAGATCCA TTTCCAGCTG ATGGAGAGGA AAACTCCAAA 50108 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCTCCAGAT TCTCTCTACA AGAAGATCCA TTTCCAGCTG ATGGAGAGGA AAACTCCAAA 180 TAGAACAAGG AGGAAGCAGA GAAGCAACCA AAAAAGCAAG AAGAGGATGA ACAAGGGTTC 50168 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGAACAAGG AGGAAGCAGA GAAGCAACCA AAAAAGCAAG AAGAGGATGA ACAAGGGTTC 240 ACTGTCTAGG CATTTCACTG TTGGGATTGC AAAACCTCCG CTTCCTAATC AACAACAGCT 50228 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGTCTAGG CATTTCACTG TTGGGATTGC AAAACCTCCG CTTCCTAATC AACAACAGCT 300 TCATTCCTCA CTTTCTAATG TTACTTTACC GAATCCTTCT AGATTCCAGA AACTTCTGGA 50288 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATTCCTCA CTTTCTAATG TTACTTTACC GAATCCTTCT AGATTCCAGA AACTTCTGGA 360 TTCTGATGAC CTTCCGCCAG CTCAATCTCA GTTCTCTTCA GTTTTGCCGT TGAATCTCGA 50348 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGATGAC CTTCCGCCAG CTCAATCTCA GTTCTCTTCA GTTTTGCCGT TGAATCTCGA 420 TGCTGATGAT GATGCCGATG TTGCCGATGT TGCTGAAAAG GACTTCATTC TCAGTCAAGA 50408 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGATGAT GATGCCGATG TTGCCGATGT TGCTGAAAAG GACTTCATTC TCAGTCAAGA 480 TTTCTTCTGG TAAATCTTTT TTCCCTTTTT TTTTCGATTA GAAGGATATG GGTTTGAATG 50468 ||||||||| TTTCTTCTG. .......... .......... .......... .......... .......... 489 GAATAAGATG TAGAGTAGCC ACTGAGATTT GAACTGGTTC CGATTGTGCC CTTTTTTTGG 50528 .......... .......... .......... .......... .......... .......... 489 TCCCTTGATG TTCCATATGA GACCGGGAAC TGATTAAGGT GTAATAAGTT GATTGCTTAC 50588 .......... .......... .......... .......... .......... .......... 489 TGTTCAATTT TATATTGGTT AGGTTTTGTT TTTTAACTTT TACCCCCAAT TCCTGATCGT 50648 .......... .......... .......... .......... .......... .......... 489 ATCTTGGTAT TGGTGAATCT TAGTAGCTTA GTTGGTTCAC TAACTTAACT TGCACCTTGT 50708 .......... .......... .......... .......... .......... .......... 489 TGGTAAGGGT TTGATTCCCA ACTTGTAATC TCCTCCCCCG ATTTTATAAA TATAAAAGAT 50768 .......... .......... .......... .......... .......... .......... 489 CATATATTCA CATTGTATCC TCTAGTTTTG TTTTCTGCTT ATAAATAATT TTTTGGAATG 50828 .......... .......... .......... .......... .......... .......... 489 CAGTACCCCG GATTATCTAA CGCCAGATGC ACCTGCAATT TGTAATGGGC TTGATGGTGA 50888 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ...TACCCCG GATTATCTAA CGCCAGATGC ACCTGCAATT TGTAATGGGC TTGATGGTGA 546 TAAGGTAATT TCTTTACCAG AGTGTTAAAA ATTGTGACTT TTTTTCTCTA TGACAGGCTG 50948 |||| TAAG...... .......... .......... .......... .......... .......... 550 TAGTTTCAGT ATGTCATACT GTGCGTTTCC CTCTATTTCT TTTAGGATGA TTATACTCCT 51008 ||||| |||||||||| .......... .......... .......... .......... .....GATGA TTATACTCCT 565 TGTCCCAAAT CACCCGAGAA GCTTCTAAGT GTATCAAGAA AGAGGCCGCG ACTAGGTATG 51068 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TGTCCCAAAT CACCCGAGAA GCTTCTAAGT GTATCAAGAA AGAGGCCGCG ACTAG..... 620 AAATATTTCG TACTTTGGTG TCCATTTTAC TTCATTTTTC ATTTGTGGAA TCGTTACTGG 51128 .......... .......... .......... .......... .......... .......... 620 CTCTGTCAAG TTTTTAACTT TTATCATGTA AATTCTCTTC TTTCTTTTCT TATATTTGTT 51188 .......... .......... .......... .......... .......... .......... 620 CTACTATTGA GGCATGACAA AGGCGTCGAG TAAGGTAAGG GGACAAACTA ATAGCACATT 51248 .......... .......... .......... .......... .......... .......... 620 CGGATATTGA AGAAAATTAG AACAGAATAA TCTCTGAAAG TTTAAGAAAC TATATTGTGT 51308 .......... .......... .......... .......... .......... .......... 620 ATGAATATCC CTTGCATTAT CAAGCTTCTA TTTTTAGCCG TTCTTTTTGT TACTGTTGGA 51368 .......... .......... .......... .......... .......... .......... 620 CACCTTTGAA TCTGGAGATC TCTTTCTCTC TCTCTCTCCA CTTTAGCATG TTTTGAAGTT 51428 .......... .......... .......... .......... .......... .......... 620 TGTTGTTTGC ACAATTATCT TCCATAATTA ATCGTAAAGT TGTAAATTAA CTTTCTTATG 51488 .......... .......... .......... .......... .......... .......... 620 TGGTCCTTTC GTGATGCCCA TCTTTACAGC GTCGGTAAGG CCTTTTAGTT CCGATTTATC 51548 | |||||||||| |||||||||| |||||||||| .......... .......... .........C GTCGGTAAGG CCTTTTAGTT CCGATTTATC 651 TGGACAGCAG CAGCCAGTAG ATATTCCTAC AGATACTTTT GGGACAGACG AAATGAAATC 51608 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGACAGCAG CAGCCAGTAG ATATTCCTAC AGATACTTTT GGGACAGACG AAATGAAATC 711 AGAAAAGATA AGCGAGTCAG AAAAGGGTCC CAGTTATGTG TCACAATCTG CTATTGCTTT 51668 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAAAGATA AGCGAGTCAG AAAAGGGTCC CAGTTATGTG TCACAATCTG CTATTGCTTT 771 AAGATATCGA GTCATGCCTC CTCCGTGCAT TAGAAACCCT TATCTCGGGG ATGCTTCCGA 51728 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGATATCGA GTCATGCCTC CTCCGTGCAT TAGAAACCCT TATCTCGGGG ATGCTTCCGA 831 GATAGATGCT GATCCTTTTG GTAACAGGAG ATCCAAGTAC CCAGGTACTG GTCAGTGAGT 51788 |||||||||| |||||||||| |||||||||| |||||||||| |||| GATAGATGCT GATCCTTTTG GTAACAGGAG ATCCAAGTAC CCAG...... .......... 875 TTGAATAATT GGTGGATTCT ATTTTCTTTG GAACAATTTA GTACTTCAAA AGAAAATTAG 51848 .......... .......... .......... .......... .......... .......... 875 AGTGTGATCT TTTGTTCTTC ATGGCAGAAT TATTGTGTTG TTTTCTGTTA AGTTCCTGAT 51908 .......... .......... .......... .......... .......... .......... 875 GTTACTCTTT CTTTGATCTG TAGGTTTTAA CCCTGCAATT TCTGGTAATG ATGGTCTGTC 51968 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...GTTTTAA CCCTGCAATT TCTGGTAATG ATGGTCTGTC 912 ACGGTATCGT ACTGATTTCC ACGAAATTGA GGCATGTTTT GATGCGTGTA TTCTTGACTC 52028 |||||||||| |||||||||| |||||||||| | ACGGTATCGT ACTGATTTCC ACGAAATTGA G......... .......... .......... 943 CTACTAATAC TTTCTTAAAT TTTTTTCTGG ACCTATCTAA GACCCTTCAT TCTTTCAGCA 52088 || .......... .......... .......... .......... .......... ........CA 945 AATCGGTAGT GGGAACTTCA GCCGTGTTTT CAAAGTCTTT AAGAGAATTG ATGGATGTAT 52148 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCGGTAGT GGGAACTTCA GCCGTGTTTT CAAAGTCTTT AAGAGAATTG ATGGATGTAT 1005 GTATGCAGTG AAACATAGCA CTAAACAGTT ACATCAAGAC ACAGATAGGT GATATTTCCT 52208 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTATGCAGTG AAACATAGCA CTAAACAGTT ACATCAAGAC ACAGATAG.. .......... 1053 TTCTTTCAAC ACTCTGGCCT TTATGGTATG ACACTACAGA GTGTGCATAT TTTTCTTGAC 52268 .......... .......... .......... .......... .......... .......... 1053 ATATCATATA ACTAAAAGAA ACATTTCCTT AGTTATTTTC TGATATCCTG TTGGCCATAT 52328 .......... .......... .......... .......... .......... .......... 1053 AATTGGTAGC ATCTAATATC AGTGGAAGTT TGGGGTGAGG CTTTAGTGGT ATTGCTATGG 52388 .......... .......... .......... .......... .......... .......... 1053 GGTTTGCATG ATATACGAAT TAAGAATGCA ATAGGAAATG TTCTACATCT ATAATTTATG 52448 .......... .......... .......... .......... .......... .......... 1053 TGTACTGTTT CTTACATTTG GCATACTGTG TTTTCTCCAA GTCAAACATT TTCAATAGAT 52508 .......... .......... .......... .......... .......... .......... 1053 TTGAGCACGT ATTTTTGGAC AATGTCTTCA TAAGGTGAAT CTTATGTGAT GCCAAGCATG 52568 .......... .......... .......... .......... .......... .......... 1053 CTGTTTATAT TCAATATAGT TCAGTCCTAC TTATGCATTT AAATTGTGTT CTTCCAGGAG 52628 ||| .......... .......... .......... .......... .......... .......GAG 1056 ACAGGCTTTG ATGGAAGTGC AAGCATTGGC TGCTTTAGGT AGCATGCTGA TCTACTTCTG 52688 |||||||||| |||||||||| |||||||||| |||||||| ACAGGCTTTG ATGGAAGTGC AAGCATTGGC TGCTTTAG.. .......... .......... 1094 TTCCTGATAT TATTATTTGA ACTACCCCAG AGATTATTGT TCAAACATGT TTTGTTATTT 52748 .......... .......... .......... .......... .......... .......... 1094 ATACTTACAG GACCTCATGA GAACGTAGTT GGTTATTATT CATCTTGGTT TGAAAATGAA 52808 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... GACCTCATGA GAACGTAGTT GGTTATTATT CATCTTGGTT TGAAAATGAA 1144 CACCTTTACA TCCAAATGGA GCTCTGTGAC CACAGCTTAT CCAATAAAAA ATATTGTAAA 52868 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCTTTACA TCCAAATGGA GCTCTGTGAC CACAGCTTAT CCAATAAAAA ATATTGTAAA 1204 CTATTTTCGG AGGTAGAAGT TTTGGAAGCA ATGTATCAGG TGCGGCTCTT GTTGGAATCT 52928 |||||||||| |||||||||| |||||||||| ||||||||| CTATTTTCGG AGGTAGAAGT TTTGGAAGCA ATGTATCAG. .......... .......... 1243 TTAAGAGCTA CTTTGTGTCA TGTACTGAGT ATGACTGTCT TATAAGTCTG AAATAAAGTT 52988 .......... .......... .......... .......... .......... .......... 1243 AGAAGGAAGA TATATATCTT GCAAACCAAT TATAATTCAT CGACCAAAGA GATAATTTCT 53048 .......... .......... .......... .......... .......... .......... 1243 TACTATATAC ATAACCAATG AGCATATCAT CAATCATGAA ACTTCAACTG CTAAAATAAC 53108 .......... .......... .......... .......... .......... .......... 1243 TGTGATTTCT ATCTGTTATT TAGGGAAAAC AACTGTAGCA AAGTCAGAAC CTAATCCACC 53168 .......... .......... .......... .......... .......... .......... 1243 CTCCTAATAC TTGAAACATG GAGCACACAA GATGCAACAC TTCATAATGT TTTAAGGTTT 53228 .......... .......... .......... .......... .......... .......... 1243 TCATGGAAAT ACAATATTCA GATCTAGTTC CATTTTGTTA TTTACATTGG TAGGTTGTAC 53288 .......... .......... .......... .......... .......... .......... 1243 TTTTGTAACT TGATTCTCAT TCAAATGCAT ATAATATTAC CCAAAAATCC TGCATTGAGT 53348 .......... .......... .......... .......... .......... .......... 1243 TGGTGCTTGT TGCTTTCACA CGAGAAAAGA GGATCAGACT ATAGGCTGTA TAAAAAAAGA 53408 .......... .......... .......... .......... .......... .......... 1243 AAAAGAGAGG ATCAGTTTTA TATCTGTTAT AGAAGCTGAT ACCATTGACT GTATATGTGA 53468 .......... .......... .......... .......... .......... .......... 1243 TCCAAATAAC CCTCTTTGAG GTGGTAGGGG AGCTGTTTTA TATCATGTAT CTTAGGATTT 53528 .......... .......... .......... .......... .......... .......... 1243 ATTACTATCC TCTAGCTTGA GATATGACTG GAACTGGCTT TCAATTTTGA GAGGCCGAAG 53588 .......... .......... .......... .......... .......... .......... 1243 ACCTGTATAG TAAACTATTT TGAGCTTTTA GTCGTGATCT TCTTCAAAAT GCTTAAGAGT 53648 .......... .......... .......... .......... .......... .......... 1243 TCACCCTCTA CTAACTTCAT TTGCTTTTGA CTTCCCTTTG CTTGTTTTTG TTCGTGTATC 53708 .......... .......... .......... .......... .......... .......... 1243 TGGGTATTTT ATTTGTGGGG TGGGAAGGCT GAGGAAACAT GGTTGTTCAA ATTGAGGTCA 53768 .......... .......... .......... .......... .......... .......... 1243 AAATATTGAT ACTTGAGATG CTTTCTGATA TGCAGGTAGC CAACGCATTG CAGTTTATAC 53828 ||||| |||||||||| |||||||||| .......... .......... .......... .....GTAGC CAACGCATTG CAGTTTATAC 1268 ATCAGAGAGG GGTCGCTCAT TTAGATGTAA AGCCAGATAA TATTTATGTG AAAAATGGTG 53888 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAGAGAGG GGTCGCTCAT TTAGATGTAA AGCCAGATAA TATTTATGTG AAAAATGGTG 1328 TATATAAGCT TGGTGATTTT GGATGTGCAA CTCTTCTTGA TAAGAGCCAG CCAATTGAAG 53948 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATATAAGCT TGGTGATTTT GGATGTGCAA CTCTTCTTGA TAAGAGCCAG CCAATTGAAG 1388 AGGGTGATGC ACGTTATATG CCCCAAGAAA TACTTAATGA GAACTATGAT CATCTTGACA 54008 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGGTGATGC ACGTTATATG CCCCAAGAAA TACTTAATGA GAACTATGAT CATCTTGACA 1448 AAGTTGACAT ATTCTCCTTG GGCGCTGCAA TATATGAACT TATTAGAGGG TCTTCACTGC 54068 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGTTGACAT ATTCTCCTTG GGCGCTGCAA TATATGAACT TATTAGAGGG TCTTCACTGC 1508 CAGAATCAGG GCCTCATTTT CTAAACCTCA GGGAGGGGAA ATTGCCTCTT CTTCCGGGTC 54128 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGAATCAGG GCCTCATTTT CTAAACCTCA GGGAGGGGAA ATTGCCTCTT CTTCCGGGTC 1568 ACTCCTTGCA ATTTCAGAAT CTACTCAAGG TATTAACTTC TTCCCTTTTA CTATGTTAAA 54188 |||||||||| |||||||||| ||||||||| ACTCCTTGCA ATTTCAGAAT CTACTCAAG. .......... .......... .......... 1597 AAGTCTCTGG ATTTGTATTT TCCTTTATGC TTCATGTTCA TATGCCTTGA TCATCTATCA 54248 .......... .......... .......... .......... .......... .......... 1597 TCTTGTACAA CCAGCACGCT ATCTGAAGCA CAATATTCCA TATATTTTAA TCCGAGTATA 54308 .......... .......... .......... .......... .......... .......... 1597 TCTAAACTGT AGTTAGATGA TATATTTCGT TTGTCTGTGG CTATGTATGA TAACGCTAAT 54368 .......... .......... .......... .......... .......... .......... 1597 CCATAGTCAT CTGTAGAGGC AAATGCTAAT TCTCTATGTC CAAATTGTAA ATGTTCTCAG 54428 .......... .......... .......... .......... .......... .......... 1597 GCAATGATGG ACCCAGATCC AACACGTCGT CCTTCTGCAA AAGGCGTTGT GGATAATCCA 54488 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAATGATGG ACCCAGATCC AACACGTCGT CCTTCTGCAA AAGGCGTTGT GGATAATCCA 1657 ATCTTTGAAA GATGGCAAAG AAATTCCAAC AAGTAGATAT CCATGTAAAT CACTGTTTTC 54548 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCTTTGAAA GATGGCAAAG AAATTCCAAC AAGTAGATAT CCATGTAAAT CACTGTTTTC 1717 TGGGATTTGT CGATTGCTAC TTTTGCCAAA GATCCAGAAT TCAAAGCTGC AGTATCTCAT 54608 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGGATTTGT CGATTGCTAC TTTTGCCAAA GATCCAGAAT TCAAAGCTGC AGTATCTCAT 1777 GCAGCATTCT GGTGTTGACC TATAGCCATT TCTGTAAATA GAGATAAAGC TTCATGCACC 54668 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAGCATTCT GGTGTTGACC TATAGCCATT TCTGTAAATA GAGATAAAGC TTCATGCACC 1837 AAATTTTCCC ATTTTGATGG TGCCACTTTT GCCAAATATT ATATCAAGAA TGTAATGTTG 54728 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATTTTCCC ATTTTGATGG TGCCACTTTT GCCAAATATT ATATCAAGAA TGTAATGTTG 1897 TATCCTAATT GACCTGCAAA CTGTGGTTTG TATGTCAAAA TATGGTATTT GGTGGCTTTT 54788 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATCCTAATT GACCTGCAAA CTGTGGTTTG TATGTCAAAA TATGGTATTT GGTGGCTTTT 1957 AAGCTGATTA ATTCAGTATT GGTTTTTGGT TTTGGTAACT CAGATTTGTA TTAA 54842 || | || |||||||||| |||||||||| |||||||| | | | || AAAAAAAGTA ATTCAGTATT GGTTTTTGGT TTTGGTAAAA AAAAAAAAGA AAAA 2011 hqPGS_C09HBa0099P03.1-2+_SGN-U324287+ (49929 50417,50832 50892,50994 51063,51518 51772,51932 51999,52087 52196,52626 52666,52759 52907,53804 54157,54429 54842) ******************************************************************************** EST sequence 33 +strand 858 n (File: SGN-U341178+) 1 GTNAAACTTC GATCTGATCG ACCCACTTGA GGAAAATCCT GGGCCCGGGC CGCTCTAGAA 61 GTACTCTCGA GTTTTTTTTT TATCTTTTGG GAAAATTACG CGGCTAAGCA AATTTATACT 121 ATTTAATTAC TCATCATAGC TATAGTTTGC TATAATTAGA ACTCATGACT AACATTATAC 181 ATTAATTACG TGGGCTGACT TCGAATTTGT ATTATTAGTC ACTTTCGCCA GGATATACAA 241 ATACACATGT ATAATATACA ATTATTTAAC CTATATACAT ATACAATTCA CCTCTCTCCC 301 ACTCTCTGCC CTCTCTCGCT CGCCTCTCTC CTCCCTCTCT CAATTTCACT CGCTTCTCTC 361 CTCCTTCTCC CAATCTCGCT CGCCATACAT ACAAATGCAT ATGTATAATA CACAATTATA 421 TACATATACA ATTCACCTTT CTCCCACTCT TTGCCCCCTT TTTTTCGCCT TTCGCCTCCC 481 TTTCTCGATC TCTCTTGCCA TATTTACAAA ATCATATGGA TAATATACAA TTATCTAACC 541 AATATACATT TACCGGTCAC CTTTTACCAC TTTTTCCCGC TTTTGTCTTT TTCCTTTTTT 601 TAAAATTGGT GGGGTTTTTC TTCTTATAAC AGGCACTCTA ATTGGAATAA TAAACTTTAG 661 ATCTGATAGT TAATAAGCAA TTTTTAGGGG GCCCGGGCCC AATTTTTGGA TCACTAGGGT 721 GACCTGGAGG CGCCCGAGCC CCAAGTTTTG TTCCCTTTAG GAGGGTTAAT TTAGGCTTGG 781 TGAATAAAGG GAAATCTGTT TCCTGGTGAA ATGGTTTTCC TCCAATATCC CCCAATTTCT 841 ATCGGAGTAT AAATGTAG Predicted gene structure (within gDNA segment 58940 to 50884): Exon 1 56449 56327 ( 123 n); cDNA 90 212 ( 123 n); score: 0.886 Intron 1 56326 55915 ( 412 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.82) Exon 2 55914 55485 ( 430 n); cDNA 213 624 ( 412 n); score: 0.766 MATCH C09HBa0099P03.1-2- SGN-U341178+ 0.793 553 0.645 C PGS_C09HBa0099P03.1-2-_SGN-U341178+ (56449 56327,55914 55485) Alignment (genomic DNA sequence = upper lines): GGTAAATTAT GTGGCTAAGC AAACTTATAC TATTTAATTA CTCATCATAG TTATAGTTTA 56390 || |||||| | |||||||| ||| |||||| |||||||||| |||||||||| |||||||| GGAAAATTAC GCGGCTAAGC AAATTTATAC TATTTAATTA CTCATCATAG CTATAGTTTG 149 CTATAATTAC CACCCACGAC TAACATTATA CATTAATTAT ATGGGCTGAC CTCGAGTTTG 56330 ||||||||| || || ||| |||||||||| ||||||||| ||||||||| |||| |||| CTATAATTAG AACTCATGAC TAACATTATA CATTAATTAC GTGGGCTGAC TTCGAATTTG 209 TATAAGAAAA AGTTACATGA ATTAATACAT TTTAAAAAAT AATTACTGAT TTTAGCGATA 56270 ||| TAT....... .......... .......... .......... .......... .......... 212 TTTTGTTTAT GACCATTTAT AGCAATACTG TGATAAATCT GTAATATGTA TTAAAAGTGA 56210 .......... .......... .......... .......... .......... .......... 212 ATTGTTTATG CAATATATTT GAATTATAAT TGTTTTTTAA ATATATTGTA TTTGTTTGAT 56150 .......... .......... .......... .......... .......... .......... 212 AAAAAAATAG TCACGTTGTA TTATAAGTGT ATTAAAATGC GTGATAAGTG TATTATCCAT 56090 .......... .......... .......... .......... .......... .......... 212 CATTAAAACT TGTATTATAT GTAAATAATA AATTATTCTT TGTAATATGT ATTAAACTTG 56030 .......... .......... .......... .......... .......... .......... 212 TATTATAAAT CAATTAAAAG TGATCAAGTG AAAAAATGTC ATTGCTATAA ATGGTAAATA 55970 .......... .......... .......... .......... .......... .......... 212 TTTTTTATTA TAGCACATTT ATATAAGTTT CTCTTTGTAT AATTAGTCAA GTTTGTATAT 55910 ||| .......... .......... .......... .......... .......... .....TATTA 217 ATATAATTCG CCAAGATATA CAAATACATA TGTATAATAT ACAATTATTT AACCTACATA 55850 | |||| ||| |||||| |||||||| | |||||||||| |||||||||| |||||| ||| GTCACTTTCG CCAGGATATA CAAATACACA TGTATAATAT ACAATTATTT AACCTATATA 277 CATATACAAT TTGCCTCTCT CCCACTCTCT GCCCTCTCTC ACTCGCATCT CTCCTCCCTC 55790 |||||||||| | ||||||| |||||||||| |||||||||| ||||| ||| |||||||||| CATATACAAT TCACCTCTCT CCCACTCTCT GCCCTCTCTC GCTCGCCTCT CTCCTCCCTC 337 TCTCAATCTC GCTCATCTCT CTCCTCCCTC TCCTATTCTC GCTTGCCATA TATACAAATG 55730 ||||||| || ||| ||| ||||||| || ||| | |||| ||| |||||| ||||||||| TCTCAATTTC ACTCGCTTCT CTCCTCCTTC TCCCAATCTC GCTCGCCATA CATACAAATG 397 CATATG---- -TACACAATT ATATACATAT ACAATTCACC TCTCTCCCAC CCTTTGCCCT 55675 |||||| ||||||||| |||||||||| |||||||||| | |||||||| |||||||| CATATGTATA ATACACAATT ATATACATAT ACAATTCACC TTTCTCCCAC TCTTTGCCC- 456 CTCTCCTCCC TCTCCTAGTC TCGCTCGCCT TCTCCTCCCT CTCTCAATAT CTCTTTCCAT 55615 | || | || ||| || | ||| ||| | | ||| | ||||| | | | | || C-CT-TTTTT TCGCCT-TTC GC-CTC-CCT T-T-CTCGAT CTCTC--T-T -GCCAT--AT 503 ATACAAAAAT ATATTTATAA TATACAATTA TCTAATCAAT ATACTTATAC AATTCACCTT 55555 ||||||| |||| |||| |||||||||| ||||| |||| |||| | ||| ||||||| TTACAAAATC ATATGGATAA TATACAATTA TCTAACCAAT ATACATTTAC CGGTCACCTT 563 TCTCCTACTC TTTTCCCCCT TTCTCTCACT TCTCTCCTCT CTCTCCCAAT CTCGCTCGCT 55495 | | | ||| ||||||| || || | | || | | |||| | | | ||| | | | | T-TACCACT- TTTTCCCGCT TT-TGT--CT T-TTTCCT-T TTTTTAAAAT -T-GGTGGGG 614 TCTCTCTTCT 55485 | | |||||| TTTTTCTTCT 624 hqPGS_C09HBa0099P03.1-2-_SGN-U341178+ (56449 56327,55914 55485) ******************************************************************************** EST sequence 14 +strand 1163 n (File: SGN-U319443+) 1 ATCAACTTCA TAGATTGGTG TAATAAGAGA ATTGTAGTGT TGGGAAACGC TTTTGCGAAA 61 ATTTTGTAAT TGATTGAGTT TTGAGGATTT TGATCTCGAA CTCAATCAAT CATACAAATT 121 TCGTAGTTGA TTGAGTTCTG TGGTGTTTTG ATCTCGAACT CAATCAATCA TTCATTTGCA 181 TTTCGTTGGT GTCTCCCTCC TACGTTTCTC TTCGTTTTTC TTCTCTTTTT TTAAGCAACC 241 AATTCATTGG TGATTAAGTT TACATGTCAT GGAACTCAAC CAATTTTTAT TTTGCTTGAT 301 TTGATCAGCT TTTTTCAAAA CAAACAAACA AAAGATAAAA TTAAATTCGT TTTCATAAAT 361 TAAAAAAAAA AAAAAGATGG GGAGCAACAT TGAGGATAAC CAAGATGATG TTCCCATGGA 421 GCTACAACTC AAGGGAAAGA AGCCTTCGAG CCAAAAGTTG AAACGACATG ACTCTTTGGA 481 TGTCGAAGCA AGCAAAATGC CCGATGCCAA AAAGGTAATC GGAACGTCTG TGCTACTAAA 541 ACTTGCATTC CAAAGCATAG GAGTGGTGTA TGGAGATATT GGAACGTCAC CATTGTACGT 601 GTTTTCAACC ATCTTTCTCG AAGGAGTAAA ACACGAAGAT GATATACTTG GTGCTCTATC 661 TCTCATCTTG TATACGATCA CCTTGATCCC TGTCATCAAG TACGTATTCA TCGTTCTCCA 721 AGCTAATGAC AACGGAGATG GTGGTACGTT CGCCTTATAT TCATTGATAT GCCGACATTC 781 CAAGGTGGGA TTGATTCCGA GTACAATGGC AGAAGACAGC GATGTCTCGA CTTTTAAACT 841 TGATATGCCT GATAGACGTA CACGTAGGGC ATCACAACTT AAGTCGGTGC TAGAAAACAG 901 CCAATTCGCG AAGTTCTTTC TGCTAATTGC AACAATGCTT GGTACTTCCA TGGTTATCGG 961 TGATGGTGTC CTAACGCCCT GTATTTCAGT TTTGTCTGCA ATTGGAGGAG TTAAAGCAGC 1021 TGCTCCAGAG GCAATGACTG AAGACAGGAT CGTTTGGCTT GCAGTAGCCA TCTTGATACT 1081 TCTGTTCATG TTTCAAAGAT TTGGAACTGA AAAAGTTGGT TACACATTTG CACCTATACT 1141 TTGCTTATGG TTTGTATTGA TTG Predicted gene structure (within gDNA segment 59404 to 62302): Exon 1 60085 60594 ( 510 n); cDNA 1 514 ( 514 n); score: 0.950 Intron 1 60595 60768 ( 174 n); Pd: 0.996 (s: 0.98), Pa: 0.955 (s: 0.96) Exon 2 60769 60994 ( 226 n); cDNA 515 740 ( 226 n); score: 0.978 Intron 2 60995 61071 ( 77 n); Pd: 1.000 (s: 0.98), Pa: 0.999 (s: 0.98) Exon 3 61072 61320 ( 249 n); cDNA 741 989 ( 249 n); score: 0.988 Intron 3 61321 61426 ( 106 n); Pd: 0.989 (s: 0.98), Pa: 0.997 (s: 0.98) Exon 4 61427 61480 ( 54 n); cDNA 990 1043 ( 54 n); score: 0.981 Intron 4 61481 61572 ( 92 n); Pd: 1.000 (s: 0.98), Pa: 0.977 (s: 1.00) Exon 5 61573 61692 ( 120 n); cDNA 1044 1163 ( 120 n); score: 1.000 MATCH C09HBa0099P03.1-2+ SGN-U319443+ 0.970 1159 0.997 C PGS_C09HBa0099P03.1-2+_SGN-U319443+ (60085 60594,60769 60994,61072 61320,61427 61480,61573 61692) Alignment (genomic DNA sequence = upper lines): ATCAACTTTA TAGATTGGTG TAATAAGAGA ATTGTAGTGT TGGGAAACGC TTTTGCGAAA 60144 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAACTTCA TAGATTGGTG TAATAAGAGA ATTGTAGTGT TGGGAAACGC TTTTGCGAAA 60 ATTTTGTAAT TGATTGAGTT TTGAGGTTTT TGATCTCGAA CTCAATCAAT CATACAAATT 60204 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| ATTTTGTAAT TGATTGAGTT TTGAGGATTT TGATCTCGAA CTCAATCAAT CATACAAATT 120 TCGTAGTTGA TTGAGTTCTG AGGTGTTTTG ATTTCGAATT CAATCAATCA TTCATTTGCA 60264 |||||||||| |||||||||| ||||||||| || ||||| | |||||||||| |||||||||| TCGTAGTTGA TTGAGTTCTG TGGTGTTTTG ATCTCGAACT CAATCAATCA TTCATTTGCA 180 TTTCGTTGGT GTCTCCCACC TACGTTTCTC TTCGTTTTTC TTCTCTTTTT TTAAGCAACC 60324 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| TTTCGTTGGT GTCTCCCTCC TACGTTTCTC TTCGTTTTTC TTCTCTTTTT TTAAGCAACC 240 AATTCATTGG TGATTAAGTT TACATGTCAA TGGAACTCAA CCAATTTTAA TTTTGCTTGA 60384 |||||||||| |||||||||| |||||||| | |||||||||| |||||||| | |||||||||| AATTCATTGG TGATTAAGTT TACATGTC-A TGGAACTCAA CCAATTTTTA TTTTGCTTGA 299 TTTGATCAGC TTTTTTCAAA ACA---AAAC AAAA-A-AAA ATTAAAATCG TTTTCATAAA 60439 |||||||||| |||||||||| ||| |||| |||| | ||| |||||| ||| |||||||||| TTTGATCAGC TTTTTTCAAA ACAAACAAAC AAAAGATAAA ATTAAATTCG TTTTCATAAA 359 TTAAAAAAAA GAAGAAGATG GGGAGCAACA TTGAGGATAA CCAAGAAGAT GTTCCCATGG 60499 |||||||||| || |||||| |||||||||| |||||||||| |||||| ||| |||||||||| TTAAAAAAAA AAAAAAGATG GGGAGCAACA TTGAGGATAA CCAAGATGAT GTTCCCATGG 419 AGCTACAACT CAAGGGAAAG AAGCCTTCGA GCCAAAAGTT GAAACGACAT GACTCTTTGG 60559 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTACAACT CAAGGGAAAG AAGCCTTCGA GCCAAAAGTT GAAACGACAT GACTCTTTGG 479 ATGTCGAAGC AAGCAAATTG CCCGATGCCA AAAAGGTGTG TTAGCTAGCT ACCTCAATTA 60619 |||||||||| ||||||| || |||||||||| ||||| ATGTCGAAGC AAGCAAAATG CCCGATGCCA AAAAG..... .......... .......... 514 AATACGAGAC ATTGTATTGT AAGTAGGAGA GTGCCTTGCT GTTATGATCC ACCTCATTTA 60679 .......... .......... .......... .......... .......... .......... 514 AATACGAGAT ATGGTATTGT AAGTAGCAGA GTTGCCTTGC TGTTATGATC CACTGAGATT 60739 .......... .......... .......... .......... .......... .......... 514 TAGCCCTTCA TGTTTTTGTG GTTGAGCAGG TAGTCGGAAT GTCTGTGCTA CTAAAACTTG 60799 | || |||||| |||||||||| |||||||||| .......... .......... .........G TAATCGGAAC GTCTGTGCTA CTAAAACTTG 545 CATTCCAAAG CATAGGAGTG GTGTATGGAG ATATTGGAAC GTCACCATTG TACGTGTTTT 60859 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTCCAAAG CATAGGAGTG GTGTATGGAG ATATTGGAAC GTCACCATTG TACGTGTTTT 605 CAACCATCTT TCTCGAAGGT GTAAAACACG AAGAGGATAT ACTTGGTGCT CTATCTCTCA 60919 |||||||||| ||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| CAACCATCTT TCTCGAAGGA GTAAAACACG AAGATGATAT ACTTGGTGCT CTATCTCTCA 665 TCTTGTATAC GATCACCTTG ATCCCTGTCG TCAAGTACGT ATTCATCGTT CTCCAAGCTA 60979 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TCTTGTATAC GATCACCTTG ATCCCTGTCA TCAAGTACGT ATTCATCGTT CTCCAAGCTA 725 ATGACAACGG AGATGGTAAG TACTTTTCAC CTACACACTT TTACTATGCT TTAATTTCCT 61039 |||||||||| ||||| ATGACAACGG AGATG..... .......... .......... .......... .......... 740 TTTTAACTCA TGATTTTGTC TTTTAATTTC AGGTGGTACG TTCGCCTTAT ATTCATTGAT 61099 |||||||| |||||||||| |||||||||| .......... .......... .......... ..GTGGTACG TTCGCCTTAT ATTCATTGAT 768 ATGCCGATAT TCCAAGGTGG GATTGATTCC GAGTACAATG GCAGAAGACA GCGATGTCTC 61159 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCCGACAT TCCAAGGTGG GATTGATTCC GAGTACAATG GCAGAAGACA GCGATGTCTC 828 GACTTTTAAA CTTGATATGC CTGATAGACG TACACGTAGG GCATCACAAC TTAAGTCGAT 61219 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | GACTTTTAAA CTTGATATGC CTGATAGACG TACACGTAGG GCATCACAAC TTAAGTCGGT 888 GCTAGAAAAC AGCCAATTCG CGAAGTTCTT TCTGCTAATT GCAACAATGC TTGGTACTTC 61279 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTAGAAAAC AGCCAATTCG CGAAGTTCTT TCTGCTAATT GCAACAATGC TTGGTACTTC 948 CATGGTTATC GGTGATGGTG TCCTAACACC CTGTATTTCA GGCACGTTGT TGATTTCCCT 61339 |||||||||| |||||||||| ||||||| || |||||||||| | CATGGTTATC GGTGATGGTG TCCTAACGCC CTGTATTTCA G......... .......... 989 CTCTTTTATT CTAACAGAAA TAATACGAAT TAGAACACTG GAGATTAACA GAGTTTATGA 61399 .......... .......... .......... .......... .......... .......... 989 TACTAATAAG CTAATTTATG TGATCAGTTT TGTCTGCAAT TGGAGGAGTT AAAGCAGCTG 61459 ||| |||||||||| |||||||||| |||||||||| .......... .......... .......TTT TGTCTGCAAT TGGAGGAGTT AAAGCAGCTG 1022 CTCCAGACGC AATGACTGAA GGTAAATTTT GCATCTGTTA ATGAAAATAT CAACTTCTTT 61519 ||||||| || |||||||||| | CTCCAGAGGC AATGACTGAA G......... .......... .......... .......... 1043 ATCTGTTTTA ATTCTCGATT TACTAATCGT TTGTTTGTTT TCGTGGTATA CAGACAGGAT 61579 ||||||| .......... .......... .......... .......... .......... ...ACAGGAT 1050 CGTTTGGCTT GCAGTAGCCA TCTTGATACT TCTGTTCATG TTTCAAAGAT TTGGAACTGA 61639 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTTTGGCTT GCAGTAGCCA TCTTGATACT TCTGTTCATG TTTCAAAGAT TTGGAACTGA 1110 AAAAGTTGGT TACACATTTG CACCTATACT TTGCTTATGG TTTGTATTGA TTG 61692 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| AAAAGTTGGT TACACATTTG CACCTATACT TTGCTTATGG TTTGTATTGA TTG 1163 hqPGS_C09HBa0099P03.1-2+_SGN-U319443+ (60085 60594,60769 60994,61072 61320,61427 61480,61573 61692) ******************************************************************************** EST sequence 15 +strand 1579 n (File: SGN-U320704+) 1 GTTTGTCATA GCTGTATTCG CAGCCATCAT TGCAAGTCAA GCTTTGATTT CCGGGACATT 61 CGCTATAATC CAGCAATCTC TGGCTTTAGG ATGCTTTCCT CGTGTTAAAA TCGTGCATAC 121 ATCAAAGAAA CATCATGGAC AAATCTACAT TCCTGAAATC AATAACCTTC TCATGGTCGC 181 TTGTGTTCTT ACCATTATCG GATTCAAGAC TACTGAAAAG CTTAGCAATG CTTATGGAAT 241 GGCAGTGGTG TTTGTGATGT TCCTAACATC GTGCTTCCTC ATACTAGTCA TGATCTTGAT 301 ATGGAAAACC AACATTCTTC TTATTATCGT CTATATTCTA ATCATTGTTT CGGTTGAGCT 361 TGTATTCCTA AGCGCAGTCC TTTACAAGTT TGAACAAGGT GGTTACCTCC CTGTGGCTTT 421 AGCTCTGTTC CTAATGTTTA TCATGTACGT ATGGAACTAT GTGTACCGTA AGAAGTATCA 481 CTACGAGCTA GAACACAAGA TCTCTCCCGA AAAAGTTAAA GAAACATTGG ATGCAACCAG 541 TTCACATCGC CTTCCAGGTC TTGCCATTTT CTACTCTGAA CTAGTCCACG GAATCCCCCC 601 AATCTTCAAG CATTATGTTG AGAATGTACC TGCTTTACAC TCTGTCCTCG TGTTCGCTTC 661 TGTCAAATCA CTTCCCATAA GCAAAGTTCC ACTAGAAGAA AGGTTCCTCT TCAGAAGGGT 721 GAAACCATAT GACCTCTATG TGTTCCGTTG TGTGATACGT TATGGATACA ATGAAATGCG 781 CAATGAGGAA GAGCCTATTG AGAAGTTATT GGTAGAAAGG CTAAAGAACT ACATCAAGGA 841 AGATTACATG TTCTCAGTTG CAGCAAATGG AGACAATCAA GGAGAAACTG CCTCCTTGAT 901 TGAGAAAGAC GTCGAAGTAC TTGAGAGAGC TTCCAACATG GGAGTGGTTC ATTTGGTTGG 961 AGAACAAGAC GTCGTCGCGT GCAAGGGGTC TGGTGTAACC AAAAGAATGG TGATCAACTA 1021 TGCATACAAT TTCCTCAAGA GGAACTTAAG ACAGAGTAGT AACAAAGTAT TCGATATCCC 1081 AACGAAACGA ATGCTCAAAG TTGGAATGAC ATGTGAGCTT TAGGGTGATT ATTTTCTCTA 1141 AAAATTTCTT TTTAAGGTTT AAAATAAATG TGCATTGAGT TGAAGGGAAC AAGGGAGAAG 1201 GCACCATACA TGTTTCAACT TCCATCATCA AAAGGTGCTT GCACAAGAAA AAATAATGTC 1261 TTTTTTTTTA ATTTCTTAGT TTTCAGTTTG TTTTGTTTGT TTGGATTTTG TATGAGATAA 1321 GCTAAGCTAA GCCTGATTTT GTAAGAGAAT AAGCTAAGTA TTTGTATTTT GTTTTTTCTT 1381 GTTTCAAAGA AAAGAGAAGA TCAAGATATG TCTAGATAAA AAAACAACAT AGGTGTAGTT 1441 GTTTTTGACT AGCAAAACAT TTTTGTTATT ACCTTGTAAT GTCTTGTAAA AGATTGTCAG 1501 TTTATACATG ATAATAATAA CTAATCTAAG AATAAATAGT TTATAGAAAA AAAAAAAAAA 1561 AAAACTCGAG GGGGGCCCG Predicted gene structure (within gDNA segment 61621 to 65216): Exon 1 62221 62456 ( 236 n); cDNA 1 236 ( 236 n); score: 0.992 Intron 1 62457 62571 ( 115 n); Pd: 0.962 (s: 0.98), Pa: 0.996 (s: 0.98) Exon 2 62572 63882 (1311 n); cDNA 237 1548 (1312 n); score: 0.978 PPA cDNA 1549 1565 MATCH C09HBa0099P03.1-2+ SGN-U320704+ 0.980 1547 0.980 C PGS_C09HBa0099P03.1-2+_SGN-U320704+ (62221 62456,62572 63882) Alignment (genomic DNA sequence = upper lines): GTTTGTCATA GCTGTATTCG CAGCCATCAT TGCAAGTCAA GCTTTGATTT CCGGGACATT 62280 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTGTCATA GCTGTATTCG CAGCCATCAT TGCAAGTCAA GCTTTGATTT CCGGGACATT 60 CGCTATAATC CAGCAATCTC TGGCTTTAGG ATGCTTTCCT CGTGTTAAAA TCGTGCATAC 62340 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCTATAATC CAGCAATCTC TGGCTTTAGG ATGCTTTCCT CGTGTTAAAA TCGTGCATAC 120 ATCAAAGAAA CATCATGGAC AAATCTACAT TCCTGAAATC AATAACCTTC TCATGATCGC 62400 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ATCAAAGAAA CATCATGGAC AAATCTACAT TCCTGAAATC AATAACCTTC TCATGGTCGC 180 TTGTGTTCTT ACCACTATCG GATTCAAGAC TACTGAAAAG CTTAGCAATG CTTATGGTAA 62460 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||| TTGTGTTCTT ACCATTATCG GATTCAAGAC TACTGAAAAG CTTAGCAATG CTTATG.... 236 AGGCCACTTA ACTCTATTTC TTCTACGGAG TGAAATAAAC ATGTAAAAAG TGCAGGGGAC 62520 .......... .......... .......... .......... .......... .......... 236 TTGCCTTCTT TTCTATATTA TTAACTCGAT CACTCTATTG CTTGATTACA GGAATAGCAG 62580 |||| |||| .......... .......... .......... .......... .......... .GAATGGCAG 245 TGGTGTTTGT GATGTTCCTA ACATCGTGCT TCCTCATACT AGTCATGATC TTGATATGGA 62640 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTGTTTGT GATGTTCCTA ACATCGTGCT TCCTCATACT AGTCATGATC TTGATATGGA 305 AAACCAACAT TCTTCTTATT ATCGTCTATA TTCTAATCAT TGTTTCGGTT GAGCTTGTAT 62700 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAACCAACAT TCTTCTTATT ATCGTCTATA TTCTAATCAT TGTTTCGGTT GAGCTTGTAT 365 ACCTAAGCGC AGTCCTTTAC AAGTTTGAAC AAGGTGGTTA CCTCCCTGTG GCTTTAGCTC 62760 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCTAAGCGC AGTCCTTTAC AAGTTTGAAC AAGGTGGTTA CCTCCCTGTG GCTTTAGCTC 425 TGTTCCTAAT GTTTATCATG TACGTATGGA ACTATGTGTA CCGTAAGAAG TATCACTACG 62820 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTCCTAAT GTTTATCATG TACGTATGGA ACTATGTGTA CCGTAAGAAG TATCACTACG 485 AGCTAGAACA CAAGATCTCT CCCGAAAAAG TTAAAGAAAC ATTGGATGCA ACCAGTTCAC 62880 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTAGAACA CAAGATCTCT CCCGAAAAAG TTAAAGAAAC ATTGGATGCA ACCAGTTCAC 545 ATCGCCTTCC AGGTCTTGCC ATTTTCTACT CTGAACTAGT CCACGGAATC CCCCCAATCT 62940 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCGCCTTCC AGGTCTTGCC ATTTTCTACT CTGAACTAGT CCACGGAATC CCCCCAATCT 605 TCAAGCATTA TGTTGAGAAT GTACCTGCTT TACACTCTGT CCTCGTGTTC GCTTCTGTCA 63000 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAAGCATTA TGTTGAGAAT GTACCTGCTT TACACTCTGT CCTCGTGTTC GCTTCTGTCA 665 AATCACTTCC CATAAGCAAA GTTCCACTAG AAGAAAGGTT CCTCTTCAGA AGGGTGAAAC 63060 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCACTTCC CATAAGCAAA GTTCCACTAG AAGAAAGGTT CCTCTTCAGA AGGGTGAAAC 725 CATATGACCT CTATGTGTTC CGTTGTGTGA TACGTTATGG ATACAATGAA ATGCGCAATG 63120 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATATGACCT CTATGTGTTC CGTTGTGTGA TACGTTATGG ATACAATGAA ATGCGCAATG 785 AGGAAGAGCC TATCGAGAAG TTATTGGTAG AAAGGCTAAA GAACTACATC AAGGAAGATT 63180 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGAAGAGCC TATTGAGAAG TTATTGGTAG AAAGGCTAAA GAACTACATC AAGGAAGATT 845 ACATGTTCTC AGTTGCAGCA AATGGAGACA ATCAAGGAGA AACTGCTTCC TTGATTGAGA 63240 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| ACATGTTCTC AGTTGCAGCA AATGGAGACA ATCAAGGAGA AACTGCCTCC TTGATTGAGA 905 AAGACGTCGA AGTACTTGAG AGAGCTTCCA ACATGGGAGT GGTTCATTTG GTTGGAGAAC 63300 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGACGTCGA AGTACTTGAG AGAGCTTCCA ACATGGGAGT GGTTCATTTG GTTGGAGAAC 965 AAGACGTTGT CGCGTGCAAG GGGTCTGGTG TAACCAAAAG AATGGTGATC AACTACGCAT 63360 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| AAGACGTCGT CGCGTGCAAG GGGTCTGGTG TAACCAAAAG AATGGTGATC AACTATGCAT 1025 ACAATTTCCT CAAGAGGAAC TTAAGACAGA GTAGTAACAA AGTATTCGAT ATCCCAACGA 63420 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAATTTCCT CAAGAGGAAC TTAAGACAGA GTAGTAACAA AGTATTCGAT ATCCCAACGA 1085 AGCGAATGCT CAAAGTTGGA ATGACATGTG AGCTTTAGGG TGATTATTTT CTCTAAAAA- 63479 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| AACGAATGCT CAAAGTTGGA ATGACATGTG AGCTTTAGGG TGATTATTTT CTCTAAAAAT 1145 AAAATTGTAA GGTTTAAAAT AAATGTGCAT AGAGTTGAAG GGAACAAGGG AGAAGGCACC 63539 || ||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TTCTTTTTAA GGTTTAAAAT AAATGTGCAT TGAGTTGAAG GGAACAAGGG AGAAGGCACC 1205 ATATATGTTT CAACTTCCAT CATCAAAAGG TGCTTGCACA AGAAAAAATA ATGTTCTTTT 63599 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| |||| ATACATGTTT CAACTTCCAT CATCAAAAGG TGCTTGCACA AGAAAAAATA ATGTCTTTTT 1265 TGTTAATTTC TTAGTTTTCA ATTTG-TTTG TTTGTTTGGA TTTTGTATGA GATAAGCTAA 63658 | |||||||| |||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TTTTAATTTC TTAGTTTTCA GTTTGTTTTG TTTGTTTGGA TTTTGTATGA GATAAGCTAA 1325 GCTAAGCCTG ATTTTGTAAG AGAATAAGCT AAGTATTTGT ATTTTGTTTT TTTCTTGTTT 63718 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| GCTAAGCCTG ATTTTGTAAG AGAATAAGCT AAGTATTTGT ATTTTG-TTT TTTCTTGTTT 1384 CAAAGAAAAG AGAAGATCAA GATATGTCTA GATAAAAAAA CAACATAGGT GTAGTTGTTT 63778 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAGAAAAG AGAAGATCAA GATATGTCTA GATAAAAAAA CAACATAGGT GTAGTTGTTT 1444 TTGACTAGCA AAACATTTTT GTTTTTACCT TGTAATATCT TGTAAAAGAT TGTCAGTTCA 63838 |||||||||| |||||||||| ||| |||||| |||||| ||| |||||||||| |||||||| | TTGACTAGCA AAACATTTTT GTTATTACCT TGTAATGTCT TGTAAAAGAT TGTCAGTTTA 1504 TACATAATAA TAATAATTAA TCTAAGAATA AATAGTTTAT AGAA 63882 ||||| |||| |||||| ||| |||||||||| |||||||||| |||| TACATGATAA TAATAACTAA TCTAAGAATA AATAGTTTAT AGAA 1548 hqPGS_C09HBa0099P03.1-2+_SGN-U320704+ (62221 62456,62572 63882) ******************************************************************************** EST sequence 3 +strand 1062 n (File: SGN-U313537+) 1 TCACACAAAG AACACACTCA ATTTGTGAGT TATCTTCTTC AGAATTCTCT CTGTTCATCA 61 ATCATGGTTG TTGAAGTTTG TGTCAAGGCT GCTGTGGGTG CCCCTGATGT CCTTGGAGAC 121 TGTCCATTTA GCCAAAGGGT ACTTCTGACA TTGGAGGAAA AGAAAGTGAC TTACAAGAAG 181 CACTTGATCA ATGTTAGTGA CAAGCCCAAA TGGTTCTTGG AGGTGAACCC TGAAGGGAAA 241 GTTCCCGTGA TCAATTTTGG TGACAAATGG ATCCCAGATT CTGATGTCAT TGTTGGGATT 301 ATTGAAGAGA AATACCCAAA TCCCTCTCTC ATTGCTCCCC CTGAATTTGC CTCTGTGGGC 361 TCGAAAATAT TTCCTACCTT CGTCTCATTT CTGAAGAGCA AGGATTCTAG TGACAGTACT 421 GAGCAGGCTC TCCTTGATGA ACTAAAGGCT TTGGAAGAGC ATCTCAAGGC TCATGGACCA 481 TATATCAATG GGCAGAATGT TTGTTCAGTT GATATGAGCT TGGCTCCAAA ACTGTACCAT 541 CTCGAGGTGG CTCTTGGACA CTTCAAGAAG TGGAGTGTGC CTGAAAGCTT GAGTCATGTG 601 CGTAACTACA TGAAGCTGCT CTTCGAGCGA GAGTCGTTCC AGAAAACCAA GGCTGAAGAG 661 AAGTACGTCA TCGCAGGGTG GGCTCCAAAG GTTTAACGTA TGACTGACTC GAAACTATAA 721 TCATGTTTTG TCTCGAGTAG TTAGCAGTTC GACGAGTGCA GCTTTATGAA CGTGATCTAT 781 ATAGTATGAT CTAAATAAAA ACGTTCATCG TTGGTCTGTG TATCGATTAA TCCTTGTACT 841 GTTTGTGTCT AATAAACTAA GTTATCTTCT GAAACTTTTA TTAGTTAATG TAAGGTGTGT 901 TATAAAATCA AGCCCATAAG AATATAATCA TAAAGAGAAG TAGATAGAAG AATGAGAATT 961 TTCTTCTTAT TCAAAAGTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA GTGAATGAAC 1021 AACCCTATTT ATAGAGTGAG AAATCACTCC AAAAAAAAAA AA Predicted gene structure (within gDNA segment 63832 to 76059): Exon 1 65401 65426 ( 26 n); cDNA 779 803 ( 25 n); score: 0.769 Intron 1 65427 72133 (6707 n); Pd: 0.000 (s: 0), Pa: 0.218 (s: 0) Exon 2 72134 72166 ( 33 n); cDNA 804 832 ( 29 n); score: 0.697 Intron 2 72167 72797 ( 631 n); Pd: 0.000 (s: 0), Pa: 0.975 (s: 0) Exon 3 72798 72823 ( 26 n); cDNA 833 857 ( 25 n); score: 0.769 Intron 3 72824 73336 ( 513 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 4 73337 73365 ( 29 n); cDNA 858 886 ( 29 n); score: 0.621 Intron 4 73366 73874 ( 509 n); Pd: 0.810 (s: 0), Pa: 0.317 (s: 0) Exon 5 73875 73884 ( 10 n); cDNA 887 896 ( 10 n); score: 0.500 Intron 5 73885 74591 ( 707 n); Pd: 0.056 (s: 0), Pa: 0.000 (s: 0.96) Exon 6 74592 74647 ( 56 n); cDNA 897 952 ( 56 n); score: 0.929 Intron 6 74648 75230 ( 583 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0) Exon 7 75231 75256 ( 26 n); cDNA 953 979 ( 27 n); score: 0.635 PPA cDNA 1051 1062 MATCH C09HBa0099P03.1-2+ SGN-U313537+ 0.929 206 0.194 C PGS_C09HBa0099P03.1-2+_SGN-U313537+ (65401 65426,72134 72166,72798 72823,73337 73365,73875 73884,74592 74647,75231 75256) Alignment (genomic DNA sequence = upper lines): ATATAATAAA ATATGAAATA AAAACTATCA TGCCATTTTT GAAATTTATA ATTAAAAGAA 65460 ||||| || || | ||||| ||||| ATATAGTATG ATCT-AAATA AAAACG.... .......... .......... .......... 803 TTGGATATTT TTCACGTAAA AAAATTGTGT TTTTACTTTA ATGCACTTCA TAATTCTACA 65520 .......... .......... .......... .......... .......... .......... 803 TTTTAGAACC AAGGAACAAA TAAAGCAACC TAGTCCATTG CTTAAGAGAA AATCAGGCTA 65580 .......... .......... .......... .......... .......... .......... 803 ACCCAATTAC CCCCCTTGTG TTATTATGAT GTACATAATA TCTAACTCTT TTTTCGTTTT 65640 .......... .......... .......... .......... .......... .......... 803 GTCTCATATT TCTTAGTTTG GCTGAACACA AAAAATTATG AATATAAAGA AAATTTTAAA 65700 .......... .......... .......... .......... .......... .......... 803 TAATTTTGTG ACATTATAAA CTATAATCAT GTATAACACA TCATATATAT TTTGAATCTT 65760 .......... .......... .......... .......... .......... .......... 803 ATAGTATAAA ACATCTCAAT TGTCATGTCA TTTCTATGTT TAGAGCCATT TGATGTAGAA 65820 .......... .......... .......... .......... .......... .......... 803 TGAAAGCTTT GAATTAATAA TTTAACATAA AAATGTGATT ATTTTAAATA CACAGAACCG 65880 .......... .......... .......... .......... .......... .......... 803 AAATATAAAT AAAATATATA AAAATTAAAG ACATTGGATT TAGCGATGGA TAAATTTATA 65940 .......... .......... .......... .......... .......... .......... 803 TTGTTAAATA GAAAAATTCG TTGCTAATCC TATTTAAAGA TATATTTTAC AAAGACGGAT 66000 .......... .......... .......... .......... .......... .......... 803 CAACGATGAA ATATATAGCT TCTTTTTAGT AGTATAAATT GAAACAAAAA AGAGTATATT 66060 .......... .......... .......... .......... .......... .......... 803 TTACTAATAT TTTAAATACT AATAACCAAA CTATTATTTG CAAGATTGAA ATATATAAAT 66120 .......... .......... .......... .......... .......... .......... 803 ATGTATATTA ATTTTTTCTT ACCTGACATT GGATCCTGAT CAAAGTCCTT TCATCTTTTT 66180 .......... .......... .......... .......... .......... .......... 803 TGGAATTATC ACAACTAACC ACATTTAGCC CTTGTTTAAG TAGCCATCTC AACACTTTGG 66240 .......... .......... .......... .......... .......... .......... 803 ATAAAGAAAA TATATCCTCT TTATTGCTAT TATTGACATT AATCAGAATT TCCATACCAT 66300 .......... .......... .......... .......... .......... .......... 803 CTTCACATTT ATTCACTGTG ACACAATTAT TATTTTGTTG AAAAATATCA ACAATTGTTG 66360 .......... .......... .......... .......... .......... .......... 803 AACTTTTATT TTGATTATCC AATGAAATAA TACATTTTCT TGTATTATCC AACTCTTGTA 66420 .......... .......... .......... .......... .......... .......... 803 TATTCTTCTG AAGATACTTT ATATAATTCA CGGCTGCATG CATATGATCT GATGCTGAAC 66480 .......... .......... .......... .......... .......... .......... 803 GTTTACCCTA TTTTTATAAT AAAAATTCAA ATCATGTTAT AATTTAAAAA TAAAAAAGTT 66540 .......... .......... .......... .......... .......... .......... 803 CATGGTGTGT GTACTAACGA AATAAAAGAT ATTTATACAA TCAAGTCATT TAAAAAGTAA 66600 .......... .......... .......... .......... .......... .......... 803 TTGCAAATAA TTTTTGCAGT AAGCATTGAT AAAAAATGAT AAATAACATA GTATAATTAG 66660 .......... .......... .......... .......... .......... .......... 803 AAGGTAATTG CAAATAATTT TAGTAGTAAG CATTGATAAA AAAAATTGAT AAATAACATG 66720 .......... .......... .......... .......... .......... .......... 803 TTAAATTATA GAGATAATGT AAAAATTTAT ATTATCAATG CAAAAGTTCT TCAATTTTTT 66780 .......... .......... .......... .......... .......... .......... 803 TGTACTCAAT ATAATGAAAC CCTTACAAAA AAAAATAGAA AGAAAGACAT ATAGAGAAAC 66840 .......... .......... .......... .......... .......... .......... 803 TAGAATTAAT AAATAACACA GGCTGATATA TATACTTTAT CTTGGAAACA ATTTTTTTTT 66900 .......... .......... .......... .......... .......... .......... 803 TGTCAATGAT CATATATTCA ACATTTTTTT TTTCAAAAAA AGAAAAAAAG AGGAGAGGAA 66960 .......... .......... .......... .......... .......... .......... 803 AATAAAACTA TGTCCCGTAT TTTAAAAACA AATTTTAATT TTTACTTGTC GATATATTAA 67020 .......... .......... .......... .......... .......... .......... 803 AAACATTTTT TTTACCTTAA TATTAATTAC TTATTTCCAA ATCATTTCTT GAAACCAAAA 67080 .......... .......... .......... .......... .......... .......... 803 AGTATACATC AATTAATATG AATTTTATGG TAAAATACCT ATATTAATCA TTGTTTGTTG 67140 .......... .......... .......... .......... .......... .......... 803 ACGAACGTAA GGCCAGAGTC TAAAAGGAGT AGTTATCTCA CCTTGATGAA CTCAATGGGA 67200 .......... .......... .......... .......... .......... .......... 803 ATTAGAGATC TGAGAGAAGT AACAAGATTT GCCATTTCTT GCCTTCTATG CCTTTCAATA 67260 .......... .......... .......... .......... .......... .......... 803 TTTCTATGCA CAGCTTTCTT GAGCTTATTT GATTCATCTT GACTTCTTAA CCCCTCCTTA 67320 .......... .......... .......... .......... .......... .......... 803 GTAAAAGATT TATATAGTCG ATCACAACAA CTTGTCTGGA GTTGACCAGG CATAATAATT 67380 .......... .......... .......... .......... .......... .......... 803 GATGTACTAC TTCTATTTAC TGAAGATTTA TACAGCCGAT CATAACTTGT GTGGGACTGA 67440 .......... .......... .......... .......... .......... .......... 803 GGCGTAATAA TTGAAGTACT TTTCGTACTA ATCAAAGATT TACATAAATA ATCACTACTT 67500 .......... .......... .......... .......... .......... .......... 803 CTCTGAAACT GAGGCATAAT AATTGAAGTA TTTGTGCTGA CGGTTGAAGA TTTATACGGC 67560 .......... .......... .......... .......... .......... .......... 803 TGATCACAAC TTGTTTGGAA CTGAGACGTA ATAATTGACG AACCAACATG ATCCTCTATG 67620 .......... .......... .......... .......... .......... .......... 803 AGATCTTGAT TATCTTGTTG GGAAAATCCA TTTAATTGGA AAGTAGGAAT CTGAATCTGA 67680 .......... .......... .......... .......... .......... .......... 803 AGTAGGTGGT GGTTATGATC AACATGATCT TCACTAATAT TTTGTTTGTT ATCAGTGAAG 67740 .......... .......... .......... .......... .......... .......... 803 ATCTCCCAAA GATAATTATT GTCCATTGAA CTAATTATGC TCTAATGGAT ATATTTTGAT 67800 .......... .......... .......... .......... .......... .......... 803 CTTCTTTTTG GTATTTTAAT TGAAAGGTGA TAAAGAAGAA GATACGTTAG TTAGGGTTTT 67860 .......... .......... .......... .......... .......... .......... 803 GAGATCTCCC TAATATCTTC ATAAACAAGA AGATAATACA CTAATTGTTT ATTAGATTTT 67920 .......... .......... .......... .......... .......... .......... 803 TTAAAATTTA ATAGTACCTA TTCATATCTA CGCTTATGGT CCCACCATTT GCTTCCACAC 67980 .......... .......... .......... .......... .......... .......... 803 TATCCTTTTA ATTTATTTAG CGTCACTCTT TTAATACTTT ATAGATGTTG ACGGAGATTG 68040 .......... .......... .......... .......... .......... .......... 803 ATTTTACGTA TTATTAATTT GATTAATTAA TTTTTAATTT TTAAATGTGT AAAATTAATA 68100 .......... .......... .......... .......... .......... .......... 803 ATTTTTCATG TATTTTTGGT TTAAGAGTTT TTGGTTTAAT TAATCAGAAG ATACTTATAA 68160 .......... .......... .......... .......... .......... .......... 803 AATAAATATA TAATTTTTCT AATAAATTTG AGGCGAAAAC ACAATAATGT AATTTTACAA 68220 .......... .......... .......... .......... .......... .......... 803 ATATTCATAA AATAGAAACA ATAATAATAA ATATGAAAAG AACTATAAAT GTGTAACACG 68280 .......... .......... .......... .......... .......... .......... 803 GAGGGAAATG ATTAAGAAGA ATAAGGTTCT TACTTTAGAT TTTAACGTTT TGTATTATGT 68340 .......... .......... .......... .......... .......... .......... 803 GAATTTGTGA ACTCAAAGTC ATTATGAAGT GTAAATTGAA GGTTAAATGA CAAATATAGT 68400 .......... .......... .......... .......... .......... .......... 803 AACTAGTAAG ATATTGAAAT TAATATATAA TATTATGTAC AAGTAAAAGT AGTAAATTAC 68460 .......... .......... .......... .......... .......... .......... 803 TAGTCTTAAT GGGTTATCGA TTTACCCAAT ATTAAGAACC AATACCGAAC TGATAATTCA 68520 .......... .......... .......... .......... .......... .......... 803 ATAATTTTTT TATAAAATTA TTAAAAACCC GTTGAAAAAA TCGTTAACAC TTTTATGGGT 68580 .......... .......... .......... .......... .......... .......... 803 TTGATTTTTG TACACCCCTA CGTATAATAA TATTTATATC ATCAGATTAT CTAAAAAGTA 68640 .......... .......... .......... .......... .......... .......... 803 AATAAAAAAT ATAATCAAAG TGTGAAAAAT TGATATGACC TCGAATTGAT CGAACTCCAA 68700 .......... .......... .......... .......... .......... .......... 803 GTTGGTCTGA CCGGTCCAGA CCCTTTTCTG GATGAGCAAT TTTCACATAT AGCAAACATG 68760 .......... .......... .......... .......... .......... .......... 803 AAAATTATAT TTGTATATTA TATCTATAGT TTATATAATT GCGTTTCATA GCAAATTTTA 68820 .......... .......... .......... .......... .......... .......... 803 CGTTTGATAT GACTATTAAT CTTTATATTT TGATATACAT ATACACGAAA AAATTGTATA 68880 .......... .......... .......... .......... .......... .......... 803 TCGATCTATT TATATATTTC GGTATATAAA TGGATCCTGT AATGGGTCCT GTATTTGTAT 68940 .......... .......... .......... .......... .......... .......... 803 ATTTTTGTAC ACAAATGAAT CTTGCATTAG TATATTTCAA TACAAATGAG ATGACAAAAA 69000 .......... .......... .......... .......... .......... .......... 803 AAAGATATTT ACTGTGAATC ACAAATAAAA TAAACTATAA CTGTAATATT TAATTTGAAT 69060 .......... .......... .......... .......... .......... .......... 803 TAATAATTTA TTATTTTATG TAATTTTTCA TCTCGGAATA TCTTCCGGCC AAACCTTCCA 69120 .......... .......... .......... .......... .......... .......... 803 AATTGGGCCG GGCCGAGTTG GGCTTTATCG GTTTTAATGA GCTAGATCCG GGCTGATCCG 69180 .......... .......... .......... .......... .......... .......... 803 GGCCAGTTGA CAAGTTTAGT TGTGAGGTTG TCTCTTTATT TTATTTAGTC ATATATTTTG 69240 .......... .......... .......... .......... .......... .......... 803 GTTAGTTATT TGCACCTAAA TTGGAATAAA CTTCCTAGTG GTCAAAGATT CTTAATTTAT 69300 .......... .......... .......... .......... .......... .......... 803 TTTATGACTA TGGTGCTTAA GTAAGCATGT TAAATATAAA TATCAGTTAA TTTTATATAT 69360 .......... .......... .......... .......... .......... .......... 803 TAAAACTTTT TTTTTTTTTT GAAAAAAAGC ATTTATTATA TACTTTTTTC CCTATTAAAA 69420 .......... .......... .......... .......... .......... .......... 803 TTTAAATTGA GATATCATAA TTTTCTATTC ACTTCATTAG AGGTGTATAT GATCGGGTTG 69480 .......... .......... .......... .......... .......... .......... 803 GTTTGGGTTT TTCAAATATC AAATCAAACT ATTTGTATCG AATTTTTAAA TATATAAACC 69540 .......... .......... .......... .......... .......... .......... 803 AAACCAAAGC ATAAACTCAA GGTTTTCAAC CTTGGGTATT TTTGAATTTT TTCGCTAAAG 69600 .......... .......... .......... .......... .......... .......... 803 TATTTATATA AAAATATAAT TTATTTGTAA TTCAAATATT TCTTCAGTCC TATCAAAATA 69660 .......... .......... .......... .......... .......... .......... 803 TAACTATCTA AGGTGTTTCT GAATATAACA TAAACAATGG TATGATTAAT GATACTAAAA 69720 .......... .......... .......... .......... .......... .......... 803 TATCAAATGT GTGTGTGTGT GTGTATATAT ATATATATAT ATATATATAT ATATATATAT 69780 .......... .......... .......... .......... .......... .......... 803 ATATAAATTA AAATTGCGTA AAATAAATAT TGCAAATTAA CAAGTCATAA TGAAAATGAT 69840 .......... .......... .......... .......... .......... .......... 803 CATAATTTAA AAGTACTAAA AAATGCTGAA ACAAGTTTTG TTATAAAATC AAACTCATAA 69900 .......... .......... .......... .......... .......... .......... 803 GAATATAATC ATAAAGAGAA GAAGACAAGA GAAACATATA GAAGAAGAGG AATTTTCTTC 69960 .......... .......... .......... .......... .......... .......... 803 TTATTCAAGT GTATTCAAGT CTCATATGTA TATATTACAA TAGTGAATGA ACAACCCTAT 70020 .......... .......... .......... .......... .......... .......... 803 TTATAGAGTG AGAAATCACT CCAAAGGTCA CCATATTAAG TATCATAATA AATAGATACA 70080 .......... .......... .......... .......... .......... .......... 803 TTCTTATCTA AAAAGGTTCA TGTAACCTAG TGAATATGAA TGGTTGTTCA TGTACCAACC 70140 .......... .......... .......... .......... .......... .......... 803 TTATGGACTA TCCACTCATT CAATGAATTT ATAACACTCC CCCTTGGATG TCCATAGATA 70200 .......... .......... .......... .......... .......... .......... 803 ATGTGCCTCG TTAAAACCTT ACTAGGAAAA ACCCTGTGGA AAAAAATTCT AGTGAAGGAA 70260 .......... .......... .......... .......... .......... .......... 803 AAAGAGTACA CATATCTTTT GATACGCACC ATTTGTTGCC TCATTAAAAA CCTTGCCAGG 70320 .......... .......... .......... .......... .......... .......... 803 AAAACCCAGT GGGAAAAAAA CCTCGATCAA GGGAAAAAGA GTGCAACGCG TATTGTACTC 70380 .......... .......... .......... .......... .......... .......... 803 CCCCTGATTA AAACATCACT TAATTTCTTG TGATGATGTC TCCAGTCTTT GTACATTGTG 70440 .......... .......... .......... .......... .......... .......... 803 GTTGGTATAT TCTTTTTTTT AAAGAAAAAC CACACATTTC TTGAATATGG TGTGTCATTT 70500 .......... .......... .......... .......... .......... .......... 803 ATCTCAACCA GACGCACTTT TGACTTGCTT CATGGAGGAC TTTTATTTCT GCAGAGCAAC 70560 .......... .......... .......... .......... .......... .......... 803 ATTTGCTTCA TTGATCCCCA GGATATTATT GTGCCTCCAC ATGCAAACAC ATAGCGTGCT 70620 .......... .......... .......... .......... .......... .......... 803 TGAGATAGAG CTTTATGCGG ATCAAATAAA TATTATGCAT CTGCGTAATC AATCAATTTT 70680 .......... .......... .......... .......... .......... .......... 803 GTCTTGGATT CCTCGGGATA GAATAAACTC ATAACTATGG TCCTTCGAGG ATATTCAATC 70740 .......... .......... .......... .......... .......... .......... 803 ATGTGCTCAA CACCATTTCA ATGTCCTTTT ATCGGGGAGA AACTGAATCT TGCCAGTAAA 70800 .......... .......... .......... .......... .......... .......... 803 TTTACTGCAA AACAAATATC TGGTCAAATA TTGTTAGTAA GATGCACTAG TGCCCTGATT 70860 .......... .......... .......... .......... .......... .......... 803 GCACCAAGAT AGAGTTTCAT CACCAAGAAA CTGTTCATCC TTCTTTTGAG ATCAAACTGA 70920 .......... .......... .......... .......... .......... .......... 803 ATCATCATTT ATGTCAAGTG ATCTCATAAT CATTGGGGTA CTCAATGAAT GCGAGATATC 70980 .......... .......... .......... .......... .......... .......... 803 CACATAAAAT TGCATCAAAA TTTTTCAGTG TATGTGCACT GTTAGACAAA TATTTTATTT 71040 .......... .......... .......... .......... .......... .......... 803 GTCAAATTAA CAATCTGTAG GTCTAGACAA AATTTTGTCT TACCAAGACC TTTTCATAAA 71100 .......... .......... .......... .......... .......... .......... 803 AATACAAGGA AAATTAGGGT CATTCTTTTG TACCCTTTTT CTTCCTTGAC TATTTCAATC 71160 .......... .......... .......... .......... .......... .......... 803 CATATAAGGA TTCTTGAAGC TTTATTGGAA AAGTTTCTTT CAAACTCTTA TATGCTTCAG 71220 .......... .......... .......... .......... .......... .......... 803 ACACCTCGAT TGTTTCAAGA ATTTTCATAT AACCTTCATT GTCTAGCTAG ACATTACAAT 71280 .......... .......... .......... .......... .......... .......... 803 TATTCATTTA CATGCATTCA AGTTTTCATA TGTTGCCAGA TCAAAACAAA CCTCATTTCA 71340 .......... .......... .......... .......... .......... .......... 803 TCCACCAAAG GAGAAAACAT CTCCACTAAT GTCAGGATTT TTGCGATAAA CCTTTATGCC 71400 .......... .......... .......... .......... .......... .......... 803 CCAAGGCACA AACCGTATTT GATATCTTAC GGTTATACTA TCGCATAATA ATTTATTTGT 71460 .......... .......... .......... .......... .......... .......... 803 ACCCCACTGA CATTATATCT TGATTTATGG ACTATCAGTC CAAAATGTCA CTTTTCTAAG 71520 .......... .......... .......... .......... .......... .......... 803 TGAAACAAAA TATAATTGAA TTGTGCATTT TACTTGGCCA ACCATTTATC TGTTCACACT 71580 .......... .......... .......... .......... .......... .......... 803 ATATGACAGA TTTGAATTCA AGATCCTCAT CACAATTTAT ATCATTGAGC GCTACCTCAT 71640 .......... .......... .......... .......... .......... .......... 803 ATCAAAGATA CCTTCGACGG TTATTTGATA TCGATTCCAA CAGGACATAA CTTATCGAGA 71700 .......... .......... .......... .......... .......... .......... 803 TCTCTTCATT TTTCATTATT TCAGGTACCT GAACCTTTTT CAAAATTTTA TGAAGTGTTA 71760 .......... .......... .......... .......... .......... .......... 803 TGTCAATGTG CTCTTTTATA GCACTTGCCT CATTATCATG ACCATATTGA TCATTTGCTC 71820 .......... .......... .......... .......... .......... .......... 803 CTTCCTTCTT CAAGGAATTT TATATTTGGA ATCGATTGGT CTATTACGCT TCCGGCGTAT 71880 .......... .......... .......... .......... .......... .......... 803 CATAGACTCT GTCCTTCAAG GACTTCACAT TGGAGCACTT GCAGCTGAAT TATATCATAA 71940 .......... .......... .......... .......... .......... .......... 803 GATCAGTGCA TTCATCTGGC AACTGATTTT CAACATTATG CAACTGAATT ATCTCTTGAA 72000 .......... .......... .......... .......... .......... .......... 803 CTTTAAGTTC ACATATTTAT TTTAACGAGG ATCTAGATAA TTCATAATTA ACTTCACACA 72060 .......... .......... .......... .......... .......... .......... 803 TAATTTTCAG CTGTCAATCA TTTCTCCCCC TAATGTTAGA AAATCTAACA TTCATTCCAA 72120 .......... .......... .......... .......... .......... .......... 803 TCTTATTTGG AAGTTCATCT TTGTGCGTGG TGGAGCAATT AAAATCATAA TCGCACATTC 72180 |||||| ||| | | | || | | ||| |||| .......... ...TTCATCG TTG-GTCT-G TGTATCGATT --AATC.... .......... 832 AACATTTTAG ATGAAAATTA TATGGTTCCT GACCCTGAAC CAATTTTAAT GGGGAAACCT 72240 .......... .......... .......... .......... .......... .......... 832 TTTTTGTTTG ATGCATACAT GTGTTGCTAC ATGTTAAACA AAATTATCTC ATACCAAATT 72300 .......... .......... .......... .......... .......... .......... 832 TTATTTGGGA GTTTTGTTCT CATAACCATG ATTTAGCTAT AATTGGAGGC ATACAATTTC 72360 .......... .......... .......... .......... .......... .......... 832 TGCTAAATCA ACCACTATTA TCAATATAAT TTCATAATTT GAAACTATGC TATTAATTTT 72420 .......... .......... .......... .......... .......... .......... 832 AACAATTCGA GCAAACAACT TTACAAAAAC CAATTTGTAA GTTGACAACA AAATACATGT 72480 .......... .......... .......... .......... .......... .......... 832 GACCATCGCA TAGATGTATT AATCATATAA TTGCAAATGA TCCACATGAC AGGTAAACAG 72540 .......... .......... .......... .......... .......... .......... 832 ACCCATATTC ACCCTTTTAT ATGTTCCAAA AAATTTTACG GGATTACAGT CCCGACCTTA 72600 .......... .......... .......... .......... .......... .......... 832 GCTGGTACAA TGATCATTTT ATCAATAGAA CAAGCAACAC AAGAGAATTC TTGAAGAATC 72660 .......... .......... .......... .......... .......... .......... 832 TTTATTTCTT AATCATATAA ATCACTTGAA TTCTCAATCA CGTTTGCATT ACATTTAATC 72720 .......... .......... .......... .......... .......... .......... 832 GGAATGGTCA ATCAGTCATG CCAACTGATC TTTATTTTAG TACACTTATA ATTTACTTTT 72780 .......... .......... .......... .......... .......... .......... 832 ACATGTGATT TCATCAGCAT GTACATGTTT GTATGGAACA AACAGGAGGA AAAGTGGGGT 72840 | | |||| ||||| || | || | ||| .......... .......CTT GTAC-TGTTT GTGTCTAATA AAC....... .......... 857 AATCTTTTAC ATAAACATTA TAACCCTCTT CGGTTGTAGT GATTTATAGT CTCAATATTT 72900 .......... .......... .......... .......... .......... .......... 857 ATTTTCATTC ATCCGAAACT TAAAAAGTTT CTTTTGAGAC TTACTACAAC CCCAATATTA 72960 .......... .......... .......... .......... .......... .......... 857 TTCCTCTGAT AATAGCACAA CTCGTTAGAG CTTACAATTA ATTTTGTGTA CTATGACATA 73020 .......... .......... .......... .......... .......... .......... 857 TTGCATGACA CCATCCCTAT GGTGAGCATC TCCTTCAATG AGGAAATTAT ATTATTATTG 73080 .......... .......... .......... .......... .......... .......... 857 TCATGATCAA AATCATTACG AGCAAGACGT CTTTCTTGCT TTACTTTTGT TGTACCATAG 73140 .......... .......... .......... .......... .......... .......... 857 GTACACAATC AATGACAAAT TTGTATTGCC ACACAATAAT GACCACAACC CTTATAGGCA 73200 .......... .......... .......... .......... .......... .......... 857 AGAACTATTT CTTTCTTTTG CCCTTTCGGT AGTTTATATC ATAGTGACGT ATAAAACGTA 73260 .......... .......... .......... .......... .......... .......... 857 CAATACAATT AGCCATTTTT ACCAAATAAA ATTTATTTTC TCTCAAATCT CCCGTAGAGA 73320 .......... .......... .......... .......... .......... .......... 857 TGAGAAGTGA CATGTATCAT TTTTTTTCTC AAACCTTCTA TAGAGGTGAG TTGTGACATA 73380 | | || | |||| |||| || ||| .......... ......TAAG TTATCTTCTG AAACTTTTAT TAGTT..... .......... 886 TATCCAACCC ATAATGATTT GCTCACATCT GGACCCATTC TCTTATAGAA TGGTCGAGCA 73440 .......... .......... .......... .......... .......... .......... 886 TTCACGATAC TCATCATAAG TCACATTCTC TTCAAGGAAT GAACTAATTA TTTATATGTA 73500 .......... .......... .......... .......... .......... .......... 886 CTTCATCAAT TTGCATTATA AGCACATTGT CATGACTTTA AAATTCATCA TATTTCCATG 73560 .......... .......... .......... .......... .......... .......... 886 ACACACCTCA ACATCATTTT GAATGATCAA GTGTACCATC ACTTTATTTT CTTGTATCAT 73620 .......... .......... .......... .......... .......... .......... 886 GATAGAGGAT TTATAAACTT GTCAAATTAT TGGGTTGACA ACATTCACAA GTCTAATGAC 73680 .......... .......... .......... .......... .......... .......... 886 CTTTCGTACC ATTTTGATAA TGAAAACTGT CTTCACATTT CGAAGGACTA TTTGGAGAAC 73740 .......... .......... .......... .......... .......... .......... 886 CCATAGTGTT CTTTCTTTTA TAATTACCAC AATGATGACT ATTATAATTC CGTCCCTTCA 73800 .......... .......... .......... .......... .......... .......... 886 TGCCTTTTTT CATATCGTTA ATCATTTGTC ATCTTTCAGA CTAACTATTT ACTGCTACCA 73860 .......... .......... .......... .......... .......... .......... 886 CATTCATACG ATAGAATGGA GCAAGTGAAC ATGCAATTTT AAAATTATCG TATGTTGCTA 73920 |||| | .......... ....AATGTA AGGT...... .......... .......... .......... 896 CCACATTCTT TTCAAGGAAT GGAGCAAATT CAACAGGACG AATTTCAAGA CTTTTCATCA 73980 .......... .......... .......... .......... .......... .......... 896 AAGCCTTAGT CATCATTATA TCATCACTAT GGTGCATTGA CTCAAACCCA ATGTCTTATC 74040 .......... .......... .......... .......... .......... .......... 896 CATATAAGGC GTGTTACGCC ATAATTCTAT GGGACTTGAA CCCAATCGCG GTGCATCAAA 74100 .......... .......... .......... .......... .......... .......... 896 ACTCAAATTG ATGTCTTACC CTTTTGGTGC GATAGAACTT GAATCTATCG TCTTATCCTC 74160 .......... .......... .......... .......... .......... .......... 896 AAATATTTTT CATTATGGTG CATCCGGACT TTAACCTGAT GTCTTACCCT TTTGGTGCGA 74220 .......... .......... .......... .......... .......... .......... 896 TGAAACTTGA ATTCATCGTC TTACCCTTAA CCAACGGTAT AATAAACCGA AGTACAACAT 74280 .......... .......... .......... .......... .......... .......... 896 GACTCACCCT TTGAGTCATT CAAAACAATC AAAATAACAT TCAAATTCAT GCTATTAAAA 74340 .......... .......... .......... .......... .......... .......... 896 TTTTATACCT TCTTCTATTT AAGAAGTTTT GTATAGCATG TCATATTCAA CCAATAGTAG 74400 .......... .......... .......... .......... .......... .......... 896 TATATGTCTT CGGTTCCTGA TAATTATTAG TGATTGAATA CTATTTTATT CTAAATCATA 74460 .......... .......... .......... .......... .......... .......... 896 AAATATTTTA GAAAATACAT ACCTGATTTT ATGCATATAA TACTTTGATC TTCATAACGA 74520 .......... .......... .......... .......... .......... .......... 896 GATCAACCAA AATATGAGCT AAAGAAAAAC AAGAACCCAC TCTCAATTAG ATAGAGACTC 74580 .......... .......... .......... .......... .......... .......... 896 GTGCTGATAA CGTGTTATAA AATCAAGCTC ATAAGAATAT AATCATAAAG AGAAGAAGAT 74640 ||||||||| |||||||| | |||||||||| |||||||||| ||||| |||| .......... .GTGTTATAA AATCAAGCCC ATAAGAATAT AATCATAAAG AGAAGTAGAT 945 AAGAGAAGTA TATAGAAGAA GAGGAATTTT CTTCTTATTC AAGTGTATTC AAGTCTCATA 74700 | |||| AGAAGAA... .......... .......... .......... .......... .......... 952 TGTATCTATT ACAATACTGA ATGAACAACC CTATTTATAG AGTGAGAAAT CACTCCAAAG 74760 .......... .......... .......... .......... .......... .......... 952 GTTACCATAT TAAGTATCAT AGTAAATAGA TACATTCTTA TCCAAAAAGG TTCATGTAAC 74820 .......... .......... .......... .......... .......... .......... 952 CTAGTGAATA TGAATGGTTG TTCATGTACC AACCTTATGG ACTATCCACT CATTCAATGA 74880 .......... .......... .......... .......... .......... .......... 952 ATTTATAACA AATAAGCATT AGTTACATGA CTAAATATTA AAGAAAATTA AAATTAGATT 74940 .......... .......... .......... .......... .......... .......... 952 ATGTATTTCA ATTGTCTAAA TCAACGTAAA ATCAAAGAAC AAATATTTAA TATTATTGTC 75000 .......... .......... .......... .......... .......... .......... 952 ATTCTTAATG CTAAATAGAT TTCCTTTTTG CATCAATATT AATTTAATTT TGATTTGAGT 75060 .......... .......... .......... .......... .......... .......... 952 TTTATTATAA TTACCAACAC ATGTGGACTA TAATATTTAT GGAACCATCC AAAATTCTAA 75120 .......... .......... .......... .......... .......... .......... 952 GTTTCAAACT TAAATATAAG AAATATTTAA AAATTATATC AAAGTAAATA TTTTTATACG 75180 .......... .......... .......... .......... .......... .......... 952 CAAATAATAC CTCTTTATTT AAAAATTATA TATTTAAGCG GTGATTCTCG TGCCTAGTTT 75240 || | ||| .......... .......... .......... .......... .......... TGAGAATTTT 962 -TTCTTAATC AACAATA 75256 |||||| || || | || CTTCTTATTC AAAAGTA 979 hqPGS_C09HBa0099P03.1-2+_SGN-U313537+ (65401 65426,72134 72166,72798 72823,73337 73365,73875 73884,74592 74647) ******************************************************************************** EST sequence 16 -strand 1099 n (File: SGN-U322792-) 1 AGTAAATTTA CTGCAAAACA AATATCTGGA ATAATATTGT TAGTAAGATG CACTAGTGCC 61 CTGATTGCAC CAAGATAGAG TTTCATCACC AAGAAACTCA TCATCCTTCT TTTGAGATCA 121 AACTGAATCA TCATTTATGT CAAGTAATCT CATAATCATT GGGGTACTCA ATGAATGCGA 181 TTTATTCACA TAAAATTGCA TCAAAATTTT TCAGTGTATG TGCACTGTTA GACAAATATT 241 TTATTTGTCA AATTAACAAT CTGTAGGTCA AGACAAAATT TTGTCTTACC AAGACCTTTT 301 CATAAAAATA CAAGGACAAT TAGGATCATT CTTTTGTACC CTTTTCTTCC TTGACTATTT 361 CAATCCATAT AAGGATTCTT GAAGCTTTAT TGAAAAAGTT TCTTTCAAAC TCTTGTATGC 421 TTCAGACACC TCGATTGTTT CAAGGATTTT CATATAACCT TCATTGTCTA GCTAGACATT 481 ACAATTATTC ATTACATGCA TTCAAGCTTT TCATATGTTG CCAGATCAAA ACAAACCTCA 541 TTGCATCCAC CAAAGGAGAA AACATCTCCA ATAATGTCAG GATTTTTGCG ATAAACCTTT 601 ATGCCCCAAG ACACAAACCG TATTTGATAT CTTACGGTTA TATTATTGCA TAATAATTTA 661 TTTGTACCCC ACTGACATTA TACCTTGATT TATGGACTAC CAGTCCAAAA TGTCACTTTT 721 CTAAGTGAAA CAAAATATAC TTGAATTGTG CATTTTACTT GGTCAACCAT TAATCTGTCC 781 ACACTATATG ACAGATTTGA ATTCAAGATC CTCATCACAA TTTATATCAT TGAGCGCTAC 841 CTCATATCAA AGATACCGTC GACGGTTATT TGATATCGAT TCCAACAAGA CATAACTTAT 901 CGAGATCTCT TCAATTTTTT TCATTATTTC AGGTACCTGA ACCTTTTTCA AGATTTTATG 961 AAGTGTTATG TCAATGTGCT CCTTTATAGC ACTTGCCTCA TTATCATGAC CATATTGATC 1021 ATTTGCTCCT TCCTTCTTTA AGGAATTTTA TATTTGGAAT CAATTGGTCT ATTACGCTTT 1081 CGGCGTATCA TAAACTCTG Predicted gene structure (within gDNA segment 70185 to 72671): Exon 1 70795 71891 (1097 n); cDNA 1 1099 (1099 n); score: 0.958 MATCH C09HBa0099P03.1-2+ SGN-U322792- 0.958 1097 0.998 C PGS_C09HBa0099P03.1-2+_SGN-U322792- (70795 71891) Alignment (genomic DNA sequence = upper lines): AGTAAATTTA CTGCAAAACA AATATCTGGT CAAATATTGT TAGTAAGATG CACTAGTGCC 70854 |||||||||| |||||||||| ||||||||| |||||||| |||||||||| |||||||||| AGTAAATTTA CTGCAAAACA AATATCTGGA ATAATATTGT TAGTAAGATG CACTAGTGCC 60 CTGATTGCAC CAAGATAGAG TTTCATCACC AAGAAACTGT TCATCCTTCT TTTGAGATCA 70914 |||||||||| |||||||||| |||||||||| |||||||| |||||||||| |||||||||| CTGATTGCAC CAAGATAGAG TTTCATCACC AAGAAACTCA TCATCCTTCT TTTGAGATCA 120 AACTGAATCA TCATTTATGT CAAGTGATCT CATAATCATT GGGGTACTCA ATGAATGCGA 70974 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| AACTGAATCA TCATTTATGT CAAGTAATCT CATAATCATT GGGGTACTCA ATGAATGCGA 180 GATATCCACA TAAAATTGCA TCAAAATTTT TCAGTGTATG TGCACTGTTA GACAAATATT 71034 ||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTATTCACA TAAAATTGCA TCAAAATTTT TCAGTGTATG TGCACTGTTA GACAAATATT 240 TTATTTGTCA AATTAACAAT CTGTAGGTCT AGACAAAATT TTGTCTTACC AAGACCTTTT 71094 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TTATTTGTCA AATTAACAAT CTGTAGGTCA AGACAAAATT TTGTCTTACC AAGACCTTTT 300 CATAAAAATA CAAGGAAAAT TAGGGTCATT CTTTTGTACC CTTTTTCTTC CTTGACTATT 71154 |||||||||| |||||| ||| |||| ||||| |||||||||| | |||||||| |||||||||| CATAAAAATA CAAGGACAAT TAGGATCATT CTTTTGTACC C-TTTTCTTC CTTGACTATT 359 TCAATCCATA TAAGGATTCT TGAAGCTTTA TTGGAAAAGT TTCTTTCAAA CTCTTATATG 71214 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ||||| |||| TCAATCCATA TAAGGATTCT TGAAGCTTTA TTGAAAAAGT TTCTTTCAAA CTCTTGTATG 419 CTTCAGACAC CTCGATTGTT TCAAGAATTT TCATATAACC TTCATTGTCT AGCTAGACAT 71274 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| CTTCAGACAC CTCGATTGTT TCAAGGATTT TCATATAACC TTCATTGTCT AGCTAGACAT 479 TACAATTATT CATTTACATG CATTCAAG-T TTTCATATGT TGCCAGATCA AAACAAACCT 71333 |||||||||| || ||||||| |||||||| | |||||||||| |||||||||| |||||||||| TACAATTATT CA-TTACATG CATTCAAGCT TTTCATATGT TGCCAGATCA AAACAAACCT 538 CATTTCATCC ACCAAAGGAG AAAACATCTC CACTAATGTC AGGATTTTTG CGATAAACCT 71393 |||| ||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| CATTGCATCC ACCAAAGGAG AAAACATCTC CAATAATGTC AGGATTTTTG CGATAAACCT 598 TTATGCCCCA AGGCACAAAC CGTATTTGAT ATCTTACGGT TATACTATCG CATAATAATT 71453 |||||||||| || ||||||| |||||||||| |||||||||| |||| ||| | |||||||||| TTATGCCCCA AGACACAAAC CGTATTTGAT ATCTTACGGT TATATTATTG CATAATAATT 658 TATTTGTACC CCACTGACAT TATATCTTGA TTTATGGACT ATCAGTCCAA AATGTCACTT 71513 |||||||||| |||||||||| |||| ||||| |||||||||| | |||||||| |||||||||| TATTTGTACC CCACTGACAT TATACCTTGA TTTATGGACT ACCAGTCCAA AATGTCACTT 718 TTCTAAGTGA AACAAAATAT AATTGAATTG TGCATTTTAC TTGGCCAACC ATTTATCTGT 71573 |||||||||| |||||||||| | |||||||| |||||||||| |||| ||||| ||| |||||| TTCTAAGTGA AACAAAATAT ACTTGAATTG TGCATTTTAC TTGGTCAACC ATTAATCTGT 778 TCACACTATA TGACAGATTT GAATTCAAGA TCCTCATCAC AATTTATATC ATTGAGCGCT 71633 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCACACTATA TGACAGATTT GAATTCAAGA TCCTCATCAC AATTTATATC ATTGAGCGCT 838 ACCTCATATC AAAGATACCT TCGACGGTTA TTTGATATCG ATTCCAACAG GACATAACTT 71693 |||||||||| ||||||||| |||||||||| |||||||||| ||||||||| |||||||||| ACCTCATATC AAAGATACCG TCGACGGTTA TTTGATATCG ATTCCAACAA GACATAACTT 898 ATCGAGATCT CTTC-A--TT TTTCATTATT TCAGGTACCT GAACCTTTTT CAAAATTTTA 71750 |||||||||| |||| | || |||||||||| |||||||||| |||||||||| ||| |||||| ATCGAGATCT CTTCAATTTT TTTCATTATT TCAGGTACCT GAACCTTTTT CAAGATTTTA 958 TGAAGTGTTA TGTCAATGTG CTCTTTTATA GCACTTGCCT CATTATCATG ACCATATTGA 71810 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| TGAAGTGTTA TGTCAATGTG CTCCTTTATA GCACTTGCCT CATTATCATG ACCATATTGA 1018 TCATTTGCTC CTTCCTTCTT CAAGGAATTT TATATTTGGA ATCGATTGGT CTATTACGCT 71870 |||||||||| |||||||||| ||||||||| |||||||||| ||| |||||| |||||||||| TCATTTGCTC CTTCCTTCTT TAAGGAATTT TATATTTGGA ATCAATTGGT CTATTACGCT 1078 TCCGGCGTAT CATAGACTCT G 71891 | |||||||| |||| ||||| | TTCGGCGTAT CATAAACTCT G 1099 hqPGS_C09HBa0099P03.1-2+_SGN-U322792- (70795 71891) ******************************************************************************** EST sequence 2 -strand 917 n (File: SGN-U312404-) 1 AGAATATTGG TGTAAATCCA TTTCGACTAC CAGAAATTGT AGCATCTGCG GAATTAATAT 61 ACTCAATTTT TGAGAGGGTG CTAACGTAAC TTCTCCTTGT TATCTGAACG CCACAAGACA 121 TTGGACATCC CAAGAATTTG TGCCCTGAAA TTGAAATACT TCCAATTGGT TTCTTGAAGG 181 TAATTTTTTT TGCATGTTTG ATAAATGGGA GAATTAGCCC ACATAATGCT GCATCGCAAT 241 GGATATAATA ATTGTCATTT GAATAACCAC AATTTTCAAG TGTTTGTATG ACGAAATCGA 301 GGTCATCAAT AGCTCCTTTG AAGGTTGTTC CAATATTGAT ATTGATGATA GCTGGTTTGT 361 TCTTGTTGAC AAGTAACTTT GATTGTAAAT CTTCATAATC AATTTCCCCA TTAACTAAAG 421 TGTTGATAGT TTGTAGCTCC ATTCGATACA TTCTTGCTGC TTTGAAAATC GAGTAATGTG 481 AATCTTTTGA TGCATATAAT ATCCATTAGG AAGTAGCTCT CTTCTGCACA AATATATAAT 541 ATTGTTCAAA TGCTTTATGA AAGTGATGAA GCACTTGCAT CATTTAACCA TAAGAAAAAA 601 ACTCAAATAT GTTATCAAAT TTGTTTAAAA AAGACTCATT TATGTCATTC ATTATGAAAA 661 ATATTTGAGG GTAAGACGGT AGATTCAAGT TCTATCGCAC CAAGAGGGTA AGACATCAAT 721 TCGAGTTTTG ATGCACCATA ATGATAAGGT ATTGGGTTCA AGTCCCATAG AATTATGGCC 781 TAATGTGCCT TGTATGGATA AGACATTGAG TTTGAGTCTC CATGCACCAA ATTGATGATA 841 AGATGATGAC TGAGGCAAAA TAATATATTT TTGATGAAAA GTATTGAAAT TCATCATATT 901 AAATTTGCTC CATTCCT Predicted gene structure (within gDNA segment 80423 to 72339): Exon 1 80136 80114 ( 23 n); cDNA 441 461 ( 21 n); score: 0.652 Intron 1 80113 78532 (1582 n); Pd: 0.465 (s: 0), Pa: 0.470 (s: 0) Exon 2 78531 78519 ( 13 n); cDNA 462 473 ( 12 n); score: 0.615 Intron 2 78518 78092 ( 427 n); Pd: 0.380 (s: 0), Pa: 0.094 (s: 0) Exon 3 78091 78087 ( 5 n); cDNA 474 478 ( 5 n); score: 1.000 Intron 3 78086 76725 (1362 n); Pd: 0.535 (s: 0), Pa: 0.000 (s: 0.64) Exon 4 76724 76683 ( 42 n); cDNA 479 519 ( 41 n); score: 0.643 Intron 4 76682 76261 ( 422 n); Pd: 0.000 (s: 0.64), Pa: 0.051 (s: 0.52) Exon 5 76260 76175 ( 86 n); cDNA 520 606 ( 87 n); score: 0.570 Intron 5 76174 75605 ( 570 n); Pd: 0.000 (s: 0.60), Pa: 0.646 (s: 0) Exon 6 75604 75569 ( 36 n); cDNA 607 640 ( 34 n); score: 0.694 Intron 6 75568 74795 ( 774 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 7 74794 74766 ( 29 n); cDNA 641 667 ( 27 n); score: 0.655 Intron 7 74765 74248 ( 518 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.88) Exon 8 74247 74167 ( 81 n); cDNA 668 747 ( 80 n); score: 0.827 Intron 8 74166 74090 ( 77 n); Pd: 0.000 (s: 0.82), Pa: 0.000 (s: 0.86) Exon 9 74089 73985 ( 105 n); cDNA 748 854 ( 107 n); score: 0.819 MATCH C09HBa0099P03.1-2- SGN-U312404- 0.743 420 0.458 C PGS_C09HBa0099P03.1-2-_SGN-U312404- (80136 80114,78531 78519,78091 78087,76724 76683,76260 76175,75604 75569,74794 74766,74247 74167,74089 73985) Alignment (genomic DNA sequence = upper lines): ATTTGTGACA CACTTAGCAA GCTGTACTTT AACCCATCCA CATGGTAAAC ATTGTCAATT 80077 ||| | ||| ||| || ||| ATTCGATACA TTCTT-GC-T GCT....... .......... .......... .......... 461 GAATCTTCAA GAGATCTTCC TACTTTGCCA ACTCCCAAAA TGTACCCCTT CTTGCCATCA 80017 .......... .......... .......... .......... .......... .......... 461 CCAAATGAGA CACCTCCTCC TTGGAGGGTC TTGAGTGAGA GGAAGTTCTT TACATCACCA 79957 .......... .......... .......... .......... .......... .......... 461 GTCATATGTT TAGAGCAGCC ACTATCCATA TACCAACATT GTCTGCTGCT CCTCTCACTC 79897 .......... .......... .......... .......... .......... .......... 461 ACCTGCATCA AAAATCACTT GTTAAGCTTG GGAACTCATT TCAGCTTGAG TTCCCAGTAG 79837 .......... .......... .......... .......... .......... .......... 461 GCAAACAAAG GAGTGATCAG ATTGTACTTG GTCCAATATG GTAGACTTTG AATCTTTCTA 79777 .......... .......... .......... .......... .......... .......... 461 ACAAAGGACC TAGGAGCAGG AACAGATTTT TTCTTTGAAA ATCTGTGAGT TGAGACAGGA 79717 .......... .......... .......... .......... .......... .......... 461 TCTGTAGGAC CAGGTCTCTC ATTTGGTACA TTTTTTCTTT CAGCATACTT AAGAGTATCT 79657 .......... .......... .......... .......... .......... .......... 461 CTCACATGAG TTTCTCCAGC TAACACACTC ATTCTTCAGA TGTCCATTCT TACCACAATG 79597 .......... .......... .......... .......... .......... .......... 461 CAGACACAAT AGATTGTCAG ACACAAACAC ATACTTACTG TGAGGATTAT AAAGAGGACT 79537 .......... .......... .......... .......... .......... .......... 461 TATGTTCAGA CTTCTTAGTC CTTTCTTATT GAAATTACTC TGATTTGTCA CATTTGACAG 79477 .......... .......... .......... .......... .......... .......... 461 CAACTTTGAG GATTTGATCC ACTTAAGAGA ATTTTCAAGC TCTTTCTTAA ATTTCACAAC 79417 .......... .......... .......... .......... .......... .......... 461 ATCCTGTTCT AATTTGTTAC TCTTTTCAAG TGACAAACTA AGATTTTTCT CAGAGTTTTT 79357 .......... .......... .......... .......... .......... .......... 461 CAATTTTTCT TGAATTTCAG CTTGCAGACT ATTTGGCTTT CCATTCAGAT TTTCAGCTTC 79297 .......... .......... .......... .......... .......... .......... 461 TTCAGTAATC TGACACAACT GGTTCTTAAG TTCGGAGTTT TTTAACTCTA GAGACACCAT 79237 .......... .......... .......... .......... .......... .......... 461 TCTTGACATT GTGTCTTCAA ATTGACCTTT GTTTTCAGTT AAAATTTCAA GTTCAGCAAT 79177 .......... .......... .......... .......... .......... .......... 461 TATGGTATCT TTTTCAGATG TTAACTCTAT CACAGAATCT AACATGACTT TTGACAAGGT 79117 .......... .......... .......... .......... .......... .......... 461 TCTCAATTTT TTAAGAGAAT ACTTATCCAA GTCAATTTTC ATGTCAAGAA GAGTTACCTG 79057 .......... .......... .......... .......... .......... .......... 461 ATTGTCTTCT TCTTCATTTT CTGTGTGTGC CATAAGAGCA AACATTTCAT TGAAGACAGT 78997 .......... .......... .......... .......... .......... .......... 461 TTCCTCCTCA TGCACAGCAA CCATTGACAC ATCTTTTGGC TCATCAGGAT CTTCTGAGTC 78937 .......... .......... .......... .......... .......... .......... 461 ACTAGAAGAA TCCCCTCATG CAGCAAGAGC CCTTTTGACA ACCATATCAG CAGCAACTTT 78877 .......... .......... .......... .......... .......... .......... 461 ACGATCTCTG TTACCAAGTA CCAGGTCCCT TCTGTTTTCT TTGTCACCCC TGTGTTTTTG 78817 .......... .......... .......... .......... .......... .......... 461 ATGTTCCTTG TTTTCATTCT TGAGTAAAGG ACACTCTCTG ATGAAGTGCC CAGACTTTCC 78757 .......... .......... .......... .......... .......... .......... 461 ACACTTGTAG CAAGTATCCC CTTGAGCAGC ATTGCGAGTC CCATTTGTTC CTCTTTTATA 78697 .......... .......... .......... .......... .......... .......... 461 CATTTTGTTT TTTCTCACAA TCTTTTGAAA TCTACTGATG AGATAAGCCA TATCATCATC 78637 .......... .......... .......... .......... .......... .......... 461 ATCACTTGAA TCTTCATCTG ATTTATACTT CAACATCAAT GACTTGTCCT TCTTGGCTTC 78577 .......... .......... .......... .......... .......... .......... 461 CTTTTTTGAC AAATCATAGT TTCGATTCAT CTCATGTGTT TTAAGATTAC CAATCAAGGC 78517 | | |||| || .......... .......... .......... .......... .....TTGAA -AATCGAG.. 473 ATCCATGGTC AGCACCTTCA AGTCCTTGGC TTCTGTAATG GCATCAACTT TGCTCTCCCA 78457 .......... .......... .......... .......... .......... .......... 473 GGACTTTGGA AGAATTCAAA GCACTTTCCT GACTTGTTTG GTCATGCTTA TAGGTTCACC 78397 .......... .......... .......... .......... .......... .......... 473 CAAACTTCGT AGCTCTTTTG TAATGGAAGA CAACTTGGTG AACATGTCAT GTATAGTTTC 78337 .......... .......... .......... .......... .......... .......... 473 TCCTTCCTTC ATTTTGAAGT TCTCATATCG TGAGGTGAGC ATGTCAATCT TGGATTCTTT 78277 .......... .......... .......... .......... .......... .......... 473 GACTTGTTCA GTTCCTTCAT GTGCAGTCAA CAAGCAATCC CAAATTTCTT TAGCAGACTC 78217 .......... .......... .......... .......... .......... .......... 473 ACAGGCTGAC ACTCTGTTGT ACTCATCAGG TCCTATCCCA CAGACCAGAA GAGTTTTAGC 78157 .......... .......... .......... .......... .......... .......... 473 TTTGAAACCC TTTTCAATCT TTTTCCTGTC AGCATCATCA TATTTCTGCC TGGGCTTTGG 78097 .......... .......... .......... .......... .......... .......... 473 AACAGTAATG GTCTTTTCTC CATCCTTTAC TTCCATCATT GGAACAAAGG GTCCATCTAG 78037 ||||| .....TAATG .......... .......... .......... .......... .......... 478 TATAATATCC CATAACTCGC TATCTTCAGC CATGAGATAG TCGTGCATTC TAACTTTCCA 77977 .......... .......... .......... .......... .......... .......... 478 CCAACTGTAG AAATGTCCAT TGAAACGAGG AGGTCTGTGT GATGACTGAC CTTCTTCGAG 77917 .......... .......... .......... .......... .......... .......... 478 GTTAAGTGGA GCTGCTATTC TTAGAACAAA ATCACTTCCT TGGTGTTAAC CAAATAGGTA 77857 .......... .......... .......... .......... .......... .......... 478 GTGCCTGCTC TGATACCACT TGATAGAATT TATGCCTTCA CTTATTAGTA ATGGACCAGG 77797 .......... .......... .......... .......... .......... .......... 478 TCTCTTACTT TACTTCAGAA AATTACCAGA AAGTTAAATG CAGTAAAATT AACACAATGA 77737 .......... .......... .......... .......... .......... .......... 478 TTTTACGTGG AAACCTCCTT GCTTAAGGGA GTAAAACCAC GACCTATCTC ACAGGATTTT 77677 .......... .......... .......... .......... .......... .......... 478 CAATCGTTTT CACTAATCTT CAAAAGCAAA AGCGAAACAC GATTACACCA AACGTAAGAA 77617 .......... .......... .......... .......... .......... .......... 478 AGAGTTATCA ATCTTACCGT TAAGCAATAG ACCTCTATTG CTCAACAAGC CAAAGTAGAA 77557 .......... .......... .......... .......... .......... .......... 478 AAACAATCTA CCCACTAAGC TATCCCACCT GGACAACCTA GACTTTTAAC ACAACACACC 77497 .......... .......... .......... .......... .......... .......... 478 AATTCCTTTA TAGATTTAGG AGTGGTTTAC AATTTAAGAA CAAGAGAATA AATTCCTAAA 77437 .......... .......... .......... .......... .......... .......... 478 CAACTAGACG AAAAGCTCCA GATGTTGCTG TTGTCCTAGG AATGATTCTG CCTTTGCTTT 77377 .......... .......... .......... .......... .......... .......... 478 GTGTAGCCTT TGCAAGAGTT CTTGAAAAGT TTTTATCAAG TTGCAAAAAC TGAAAGACAA 77317 .......... .......... .......... .......... .......... .......... 478 ATGTTTAGGA AAGTGCCTTT TATATGGGCA AGTCACTTTC CTAAACTTCT TTGCCATTGG 77257 .......... .......... .......... .......... .......... .......... 478 TTGGAAAGGT CACACTTTTC TGACGCCATC GAGAAGTGTG CACCTACTTT CTGTACGTCT 77197 .......... .......... .......... .......... .......... .......... 478 CCAATTGGCA GTTGACTGGC TCATCATCAT GAGGGCCTGG TACCTCTACT AGGTCCCTGA 77137 .......... .......... .......... .......... .......... .......... 478 GTTTGTTTCA TCTGCAATAC TTCAAACAAG ACACCTGCAA TACTCAAGTA GAAAACCCGG 77077 .......... .......... .......... .......... .......... .......... 478 TACCTTTATG AGGTCCCTAA GTTTGTCAAA TTATCAAAAC TACAAATAAC AAATATGTAT 77017 .......... .......... .......... .......... .......... .......... 478 GATGTTTCTA TAATAGCGTG AGAAGGTGAT TAAAAATTTA GATGTTTTAA TTTGTATTAC 76957 .......... .......... .......... .......... .......... .......... 478 TTTGACTTGA CATGGTAATA TTCACAATCT ATTGTAAAAA AATATATATA TTAGAGTGAC 76897 .......... .......... .......... .......... .......... .......... 478 GCTTTCATAA TTGTGTTTTA GTTTTTGAAG AAGAACTCAT TTATTTGTAA AGCCCAAGTT 76837 .......... .......... .......... .......... .......... .......... 478 AATGATTCAG TCCAAATGAA AATGAATTTA ATTCAAATTT TGTATGAATT TGATCTAATT 76777 .......... .......... .......... .......... .......... .......... 478 ACAATGTATG TGTCTACAAT ATTAACGCTA CTCCCCTATC TTTATATATA GATGATCCAT 76717 ||| | | .......... .......... .......... .......... .......... ..TGAATCTT 486 TTGGTTCATA TGTATATGTA TATAGAAATA ACTATGAAGA GAGTATTTTC AATATTGGTG 76657 ||| | |||| | |||| | | ||| || || TTGATGCATA T-AATATCCA TTAGGAAGTA GCTC...... .......... .......... 519 GACTCAAGTA TAGTACACAA CATTTTATGA TGACGGAACT ATTTACTACT ATTAAGAAGG 76597 .......... .......... .......... .......... .......... .......... 519 GAGTGACGAT CCATGGGACT ACGATAAACT ATTGAGAAAG GGTGAATTGA TCATAATATT 76537 .......... .......... .......... .......... .......... .......... 519 ATGGTGGAAT TAGAATGTAA GTTGACAATT TTATATCCGA TTATCAGTTT ATTCATACGT 76477 .......... .......... .......... .......... .......... .......... 519 ATTTTTTGAA TCGGAATCGA AGGACTATGT GAGTTTCGTT TCCAACTCAT TAAACTGTTC 76417 .......... .......... .......... .......... .......... .......... 519 GTTTTTATGA CTTCAAAATA GAGAGATATT CACGTTTGTG CCTCTTCAAG AGCTTTGGAC 76357 .......... .......... .......... .......... .......... .......... 519 CGGCCATAAC ATTCGAAATT GGAGAAAAAA ATTAAGGAAA CTCTTATTTT ATCTGTAGAA 76297 .......... .......... .......... .......... .......... .......... 519 TAATACTTAA TTTTTTAGCC ACATTTTTTT CATCAGTCTT CTGCACTACT AATGAAGAAA 76237 |||| |||||| | | | || | .......... .......... .......... ......TCTT CTGCACAAAT ATATAATATT 543 AGG-GATTAC TTTTTGTGTG AGTTGGAGAA AGTTGAATGA ATTTACTATA A-AAAAAAAC 76179 | | | ||| || | | || || | ||| || | || || ||| | |||||||| GTTCAAATGC TTTATGAAAG TGATGAAG-C ACTTGCATCA TTTAACCATA AGAAAAAAAC 602 AAATATCCAA TTCCCATTAT TTACTTGTTA AATTACGAGT GAACAATGAA CCCAATTGAA 76119 | TCAA...... .......... .......... .......... .......... .......... 606 TAATTTAACA AAATAAATTT GAGAAAAATG TTTGAAAAAC GTTTTAATTT TGGTCGAAAT 76059 .......... .......... .......... .......... .......... .......... 606 TGATGTTACG AAATCAAATT TTATAGATGA TCTTTTACGT ATTTTAAAGG TATATATATG 75999 .......... .......... .......... .......... .......... .......... 606 TGCCCATGTG GACACATTAA TATTTATAAT GATACAATAT TTGTGATGTC CATGTGAATA 75939 .......... .......... .......... .......... .......... .......... 606 TATATATATA TATTTAAAAT ACACTATCAA ATAGAATAGG GATAAAAAAT TATTTTCAAA 75879 .......... .......... .......... .......... .......... .......... 606 ATTCAATATT ATGACAATAA TCTCAATTAA AATTTGAATA TATTTCAAAT TTTTTTTTTC 75819 .......... .......... .......... .......... .......... .......... 606 AATAAATTTT AAAAGTTGGA TCAAACGTGT TACTGTTAAT ATTAACAAAG AAAAGAGTTC 75759 .......... .......... .......... .......... .......... .......... 606 GTCGTAATAC TACTGAATTA TTGTCCTAAA AAAATGGAGA AAAATTATTG TAATGCATTA 75699 .......... .......... .......... .......... .......... .......... 606 ATTACGGACA GATATTAGAT AATGTAGACC CACCATTATA TAAATTGACT ATGTGAATTG 75639 .......... .......... .......... .......... .......... .......... 606 AAACAAAATG TCTAAAAAGG TTGTCAAATT CCAGGTATTT GATCTCATCT TATTATAAAA 75579 ||| | ||| || | | || ||||| .......... .......... .......... ....ATATGT TATCAAAT-T TGTT-TAAAA 630 AAAACTAATA ATTGTTCCAG ATATTCGTGG CATAACAAAA TAATTGATAC ACATGCAATG 75519 || ||| || AAGACTCATT .......... .......... .......... .......... .......... 640 TTTTATCTTC TAGAAAACAC TTGAAATTAT TTAATAAAAT CTTCTTAACT TGTTGAAGTG 75459 .......... .......... .......... .......... .......... .......... 640 TATTGATTAG AAGTGATATG AAAAAGAAAA AAGAACATAT GATAACACCC ATAACTCTTT 75399 .......... .......... .......... .......... .......... .......... 640 TGCACCTCAC CTATATATTT TACTAATTTC ACAATTTTAT TCTTTTAGTG TTTTTTTATT 75339 .......... .......... .......... .......... .......... .......... 640 TTTATTTCTA CTTGAGCAAA AAATGATATA AGTTTCTGAA AGAAAAAAAA ATAAAACTTT 75279 .......... .......... .......... .......... .......... .......... 640 ATTAAAGAGG TATTTTTTAT ACTATTGTTG ATTAAGAAAA ACTAGGCACG AGAATCACCG 75219 .......... .......... .......... .......... .......... .......... 640 CTTAAATATA TAATTTTTAA ATAAAGAGGT ATTATTTGCG TATAAAAATA TTTACTTTGA 75159 .......... .......... .......... .......... .......... .......... 640 TATAATTTTT AAATATTTCT TATATTTAAG TTTGAAACTT AGAATTTTGG ATGGTTCCAT 75099 .......... .......... .......... .......... .......... .......... 640 AAATATTATA GTCCACATGT GTTGGTAATT ATAATAAAAC TCAAATCAAA ATTAAATTAA 75039 .......... .......... .......... .......... .......... .......... 640 TATTGATGCA AAAAGGAAAT CTATTTAGCA TTAAGAATGA CAATAATATT AAATATTTGT 74979 .......... .......... .......... .......... .......... .......... 640 TCTTTGATTT TACGTTGATT TAGACAATTG AAATACATAA TCTAATTTTA ATTTTCTTTA 74919 .......... .......... .......... .......... .......... .......... 640 ATATTTAGTC ATGTAACTAA TGCTTATTTG TTATAAATTC ATTGAATGAG TGGATAGTCC 74859 .......... .......... .......... .......... .......... .......... 640 ATAAGGTTGG TACATGAACA ACCATTCATA TTCACTAGGT TACATGAACC TTTTTGGATA 74799 .......... .......... .......... .......... .......... .......... 640 AGAATGTATC TATTTACTAT GATACTTAAT ATGGTAACCT TTGGAGTGAT TTCTCACTCT 74739 | | || ||| | ||| || | | || || ....TATGTC -ATTCATTAT GAAAAAT-AT TTG....... .......... .......... 667 ATAAATAGGG TTGTTCATTC AGTATTGTAA TAGATACATA TGAGACTTGA ATACACTTGA 74679 .......... .......... .......... .......... .......... .......... 667 ATAAGAAGAA AATTCCTCTT CTTCTATATA CTTCTCTTAT CTTCTTCTCT TTATGATTAT 74619 .......... .......... .......... .......... .......... .......... 667 ATTCTTATGA GCTTGATTTT ATAACACGTT ATCAGCACGA GTCTCTATCT AATTGAGAGT 74559 .......... .......... .......... .......... .......... .......... 667 GGGTTCTTGT TTTTCTTTAG CTCATATTTT GGTTGATCTC GTTATGAAGA TCAAAGTATT 74499 .......... .......... .......... .......... .......... .......... 667 ATATGCATAA AATCAGGTAT GTATTTTCTA AAATATTTTA TGATTTAGAA TAAAATAGTA 74439 .......... .......... .......... .......... .......... .......... 667 TTCAATCACT AATAATTATC AGGAACCGAA GACATATACT ACTATTGGTT GAATATGACA 74379 .......... .......... .......... .......... .......... .......... 667 TGCTATACAA AACTTCTTAA ATAGAAGAAG GTATAAAATT TTAATAGCAT GAATTTGAAT 74319 .......... .......... .......... .......... .......... .......... 667 GTTATTTTGA TTGTTTTGAA TGACTCAAAG GGTGAGTCAT GTTGTACTTC GGTTTATTAT 74259 .......... .......... .......... .......... .......... .......... 667 ACCGTTGGTT AAGGGTAAGA CGATGAATTC AAGTTTCATC GCACCAAAAG GGTAAGACAT 74199 ||||||||| || | |||| ||||| ||| ||||||| || |||||||||| .......... .AGGGTAAGA CGGTAGATTC AAGTTCTATC GCACCAAGAG GGTAAGACAT 716 CAGGTTAAAG TCCGGATGCA CCATAATGAA AAATATTTGA GGATAAGACG ATAGATTCAA 74139 || || || | |||||| ||||||||| || CA-ATTCGAG TTTTGATGCA CCATAATGAT AA........ .......... .......... 747 GTTCTATCGC ACCAAAAGGG TAAGACATCA ATTTGAGTTT TGATGCACCG CGATTGGGTT 74079 | |||||||| .......... .......... .......... .......... .........G GTATTGGGTT 758 CAAGTCCCAT AGAATTATGG CGTAACACGC CTTATATGGA TAAGACATTG GGTTTGAG-- 74021 |||||||||| |||||||||| | ||| || ||| |||||| |||||||||| ||||||| CAAGTCCCAT AGAATTATGG CCTAATGTGC CTTGTATGGA TAAGACATTG AGTTTGAGTC 818 TCAATGCACC ATAGTGATGA TATAATGATG ACTAAG 73985 || ||||||| | | |||||| || |||||| ||| || TCCATGCACC AAATTGATGA TAAGATGATG ACTGAG 854 hqPGS_C09HBa0099P03.1-2-_SGN-U312404- (74247 74167,74089 73985) ******************************************************************************** EST sequence 25 +strand 844 n (File: SGN-U329457+) 1 GATAAGACGA TAGATTCAAG TTCTATCTCA CCAAAAGGGT AAGACATCAA TTAGAGTTTT 61 GATACACCGT GATTGGGTTC AAGTTCCATA GAATTATGGC GTAACACGCC TTACATGGGT 121 AAGACATTGA GTTTGATCCA ATGCATAAAA TGATGACTAA GAAAAATAAT ATATTTTTGA 181 TGAAATGTCT TTAAATTCAT CATATTGAAT TTGCTCCATT TCTTGAAAAG AATATGGTAG 241 CAACATGTGA TAACTCTGAA ATCCACCTGT TGACTTGCTC CATTCAATCA TATGAATGTG 301 GTAGCAGTGA ATGATTAGTT TGAAAGATGA CAAATAATTA ATGATATAGA ATAAGGGAGT 361 GGAGGGAAAG AATTATAATA GTCATCATTG TGGTAATGAT AAAAGGAAGA ACACTATGAG 421 TTCTTCAAAT AGTCCTTCAA AATGTGAAGA CAACTTTTAT TATGAAAATG GTATGAAAAG 481 TCATTAGACT TGTGAATGTT GTCAACCCAA TAACTTGACA AGTTTATAAA TCCTCTATCA 541 TGATATAAGA AGATAAAGTG ATGGTACACT TTATCTTTCA AAGTGATGTT GAGGTGTGTC 601 ATGGAAATAT GATGAATTTT AAAGTCATGA CAATGTGCTT ATAATGCAAA TTTATAAAGT 661 ACATATGAAT AATTAGTCCA ATCTTTGAAG AGAATGTGAC TTGTAATGAG TATCGTGAAT 721 GCTCGACCAT TCTATAAGAG AATGGGTCCA GATGTGATAA AATCATTATG GGTTGGATAT 781 ATGTCACAAC TCACCTCTAT AGAAGGTTTT GAAAAAAAAA AAAAAAAAAC TCGGACTAGT 841 TCTC Predicted gene structure (within gDNA segment 74757 to 72391): Exon 1 74157 73355 ( 803 n); cDNA 1 805 ( 805 n); score: 0.887 PPA cDNA 812 830 MATCH C09HBa0099P03.1-2- SGN-U329457+ 0.887 803 0.951 C PGS_C09HBa0099P03.1-2-_SGN-U329457+ (74157 73355) Alignment (genomic DNA sequence = upper lines): GATAAGACGA TAGATTCAAG TTCTATCGCA CCAAAAGGGT AAGACATCAA TTTGAGTTTT 74098 |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| || ||||||| GATAAGACGA TAGATTCAAG TTCTATCTCA CCAAAAGGGT AAGACATCAA TTAGAGTTTT 60 GATGCACCGC GATTGGGTTC AAGTCCCATA GAATTATGGC GTAACACGCC TTATATGGAT 74038 ||| ||||| |||||||||| |||| ||||| |||||||||| |||||||||| ||| |||| | GATACACCGT GATTGGGTTC AAGTTCCATA GAATTATGGC GTAACACGCC TTACATGGGT 120 AAGACATTGG GTTTGAGTCA ATGCACCATA GTGATGA-TA TAATGATGAC TAAGGCTTTG 73979 ||||||||| |||||| || ||||| | | |||||| || | | | || |||| AAGACATTGA GTTTGATCCA ATGCA-TAAA ATGATGACTA AGAAAAATAA TATATTTTTG 179 ATGAAAAGTC TTGAAATTCG TCCTGTTGAA TTTGCTCCAT TCCTTGAAAA GAATGTGGTA 73919 |||||| ||| || |||||| || | ||||| |||||||||| | |||||||| |||| ||||| ATGAAATGTC TTTAAATTCA TCATATTGAA TTTGCTCCAT TTCTTGAAAA GAATATGGTA 239 GCAACATACG ATAATTTTAA AAT-TGCATG TTCACTTGCT CCATTCTATC GTATGAATGT 73860 ||||||| | |||| | | | ||| | || || ||||||| |||||| ||| ||||||||| GCAACATGTG ATAACTCTGA AATCCACCTG TTGACTTGCT CCATTCAATC ATATGAATGT 299 GGTAGCAGTA AATAGTTAGT CTGAAAGATG ACAAATGATT AACGATAT-G AAAAAAGGCA 73801 ||||||||| ||| ||||| ||||||||| |||||| ||| || ||||| | || || || GGTAGCAGTG AATGATTAGT TTGAAAGATG ACAAATAATT AATGATATAG AATAAGGGAG 359 TGAAGGGACG GAATTATAAT AGTCATCATT GTGGTAATTA TAAAAGAAAG AACACTATGG 73741 || ||||| |||||||||| |||||||||| |||||||| | |||||| ||| ||||||||| TGGAGGGAAA GAATTATAAT AGTCATCATT GTGGTAATGA TAAAAGGAAG AACACTATGA 419 GTTCTCCAAA TAGTCCTTCG AAATGTGAAG ACAGTTTTCA TTATCAAAAT GGTACGAAAG 73681 ||||| |||| ||||||||| |||||||||| ||| ||| | |||| ||||| |||| |||| GTTCTTCAAA TAGTCCTTCA AAATGTGAAG ACAACTTTTA TTATGAAAAT GGTATGAAAA 479 GTCATTAGAC TTGTGAATGT TGTCAACCCA ATAATTTGAC AAGTTTATAA ATCCTCTATC 73621 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| GTCATTAGAC TTGTGAATGT TGTCAACCCA ATAACTTGAC AAGTTTATAA ATCCTCTATC 539 ATGATACAAG AAAATAAAGT GATGGTACAC TTGATCATTC AAAATGATGT TGAGGTGTGT 73561 |||||| ||| || ||||||| |||||||||| || ||| ||| ||| |||||| |||||||||| ATGATATAAG AAGATAAAGT GATGGTACAC TTTATCTTTC AAAGTGATGT TGAGGTGTGT 599 CATGGAAATA TGATGAATTT TAAAGTCATG ACAATGTGCT TATAATGCAA ATTGATGAAG 73501 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| || ||| CATGGAAATA TGATGAATTT TAAAGTCATG ACAATGTGCT TATAATGCAA ATTTATAAAG 659 TACATATAAA TAATTAGTTC ATTCCTTGAA GAGAATGTGA CTTATGATGA GTATCGTGAA 73441 ||||||| || |||||||| | | || ||||| |||||||||| ||| | |||| |||||||||| TACATATGAA TAATTAGTCC AATCTTTGAA GAGAATGTGA CTTGTAATGA GTATCGTGAA 719 TGCTCGACCA TTCTATAAGA GAATGGGTCC AGATGTGAGC AAATCATTAT GGGTTGGATA 73381 |||||||||| |||||||||| |||||||||| |||||||| |||||||||| |||||||||| TGCTCGACCA TTCTATAAGA GAATGGGTCC AGATGTGATA AAATCATTAT GGGTTGGATA 779 TATGTCACAA CTCACCTCTA TAGAAG 73355 |||||||||| |||||||||| |||||| TATGTCACAA CTCACCTCTA TAGAAG 805 hqPGS_C09HBa0099P03.1-2-_SGN-U329457+ (74157 73355) ******************************************************************************** EST sequence 1 -strand 1371 n (File: SGN-U312402-) 1 TTTTTTTTTA GTTATGCATC TTTTTATTGG AGCACATACA ATTTTTAGAG CCAAAATCAT 61 CAGCTAGGCA AGGAGGCAGA GCTTTTACAT CTTGGTACCA CTTATAGTTC CTCTCTTGCA 121 TTAGATCTTT GAAGAAACTG TCTATAGTTT CTCTTGTAAT ACCTGGCATA ATCACAACAT 181 GTGCCATGCC TCTTAAGTAA CACAAGTTCC AACGATGAAT GAATTTATGG TCACAAGATC 241 GTTCAAAAAT AACAGTAATA CTAAACTCAT TCAGCATAAC ACTAATTCCT GCTTCAAGAA 301 GTCGATCTTT CAAATACCGA GCATTTTCAA TGCATGTTAT GGAATCTTGT TGCAATCTAG 361 CATGTCCTTT CTTGCTTAAA CAGTACCATA AGAATATTGG TGTAAATCCA TTTCGACTAC 421 CAGAAATTGT AGCATCTGTG GAATTAATAT ACTCAATTTT TGAGTGGGTA CTAACATAAG 481 TTTTCCTTGT TATCTGAATG CCACAAGGCA TTGGACATCC CAAGAATTTG TGCCCTGAAA 541 TTGAAATACT ACCAATTGGC TTCTTGAAGG TAATTTTTTT TGCCTATTGT CAAATACCAT 601 AAAAAGAAAC TATGTCATTG TAACTTTGAT CAATAAACGT TGAAGGAAAA TTGAATAAAA 661 ATAAGCTTAC ATGTTTGATA AATGGGAGAA TTAGCCCACA TAATGCTGCA TCGCAATGGA 721 TATAATAATT GTCATTTGAA TAACCACAAT TTTCAAGTGT TTGTATGACG AAATCGAGGT 781 CATCAATAGC TCCTTTGAAG GTTGTTCCTA TAAACACATA TATACTAATT TACCAAAAGT 841 GATGTTATAA GATGTAAGAA TATATATGTA TGTATTTTTA CCAATATTGA TATTGATGAT 901 AGCTGGTTTG TTCTTGTTGA CAAGTAACTT TGATTGTAAA TCTTCATAAT CAATTTCCCC 961 ATTAACTAAA GTGTTGATAG TTTGTAGCTC CATTCGATAC ATTCTTGCTG CTTTGAAAAT 1021 CGAGTAATGT GAATCTTTTG ATGCATATAA TATCCATTAG GAAGTAGCTC TCTTCTGCAC 1081 AAATATATAA TATTGTTCAA ATGCTTTATG AAAGTGATGA AGCACTTGCA TCATTTAACC 1141 ATAAGAAAAA AACTCAAATA TGTTATCAAA TTTGTTTAAA AAAGACTCAT TTATGTCATT 1201 CATTATGAAA AATATTTGAG GGTAAGACGG TAGATTCAAG TTCTATCGCA CCAAGAGGGT 1261 AAGACATCAA TTCGAGTTTT GATGCACCAT AATGATAAGG TATTGGGTTC AAGTCCCATA 1321 GAATTATGGC CTAATGTGCC TTGTATGGAT AAGACATTGA GTTTGAGTCT C Predicted gene structure (within gDNA segment 80423 to 73309): Exon 1 80136 80114 ( 23 n); cDNA 992 1012 ( 21 n); score: 0.652 Intron 1 80113 78532 (1582 n); Pd: 0.465 (s: 0), Pa: 0.470 (s: 0) Exon 2 78531 78519 ( 13 n); cDNA 1013 1024 ( 12 n); score: 0.615 Intron 2 78518 78092 ( 427 n); Pd: 0.380 (s: 0), Pa: 0.094 (s: 0) Exon 3 78091 78087 ( 5 n); cDNA 1025 1029 ( 5 n); score: 1.000 Intron 3 78086 76725 (1362 n); Pd: 0.535 (s: 0), Pa: 0.000 (s: 0.64) Exon 4 76724 76683 ( 42 n); cDNA 1030 1070 ( 41 n); score: 0.643 Intron 4 76682 76261 ( 422 n); Pd: 0.000 (s: 0.64), Pa: 0.051 (s: 0.52) Exon 5 76260 76175 ( 86 n); cDNA 1071 1157 ( 87 n); score: 0.570 Intron 5 76174 75605 ( 570 n); Pd: 0.000 (s: 0.60), Pa: 0.646 (s: 0.63) Exon 6 75604 75559 ( 46 n); cDNA 1158 1200 ( 43 n); score: 0.630 Intron 6 75558 74178 (1381 n); Pd: 0.000 (s: 0.63), Pa: 0.000 (s: 0.94) Exon 7 74177 74090 ( 88 n); cDNA 1201 1288 ( 88 n); score: 0.943 MATCH C09HBa0099P03.1-2- SGN-U312402- 0.759 303 0.221 C PGS_C09HBa0099P03.1-2-_SGN-U312402- (80136 80114,78531 78519,78091 78087,76724 76683,76260 76175,75604 75559,74177 74090) Alignment (genomic DNA sequence = upper lines): ATTTGTGACA CACTTAGCAA GCTGTACTTT AACCCATCCA CATGGTAAAC ATTGTCAATT 80077 ||| | ||| ||| || ||| ATTCGATACA TTCTT-GC-T GCT....... .......... .......... .......... 1012 GAATCTTCAA GAGATCTTCC TACTTTGCCA ACTCCCAAAA TGTACCCCTT CTTGCCATCA 80017 .......... .......... .......... .......... .......... .......... 1012 CCAAATGAGA CACCTCCTCC TTGGAGGGTC TTGAGTGAGA GGAAGTTCTT TACATCACCA 79957 .......... .......... .......... .......... .......... .......... 1012 GTCATATGTT TAGAGCAGCC ACTATCCATA TACCAACATT GTCTGCTGCT CCTCTCACTC 79897 .......... .......... .......... .......... .......... .......... 1012 ACCTGCATCA AAAATCACTT GTTAAGCTTG GGAACTCATT TCAGCTTGAG TTCCCAGTAG 79837 .......... .......... .......... .......... .......... .......... 1012 GCAAACAAAG GAGTGATCAG ATTGTACTTG GTCCAATATG GTAGACTTTG AATCTTTCTA 79777 .......... .......... .......... .......... .......... .......... 1012 ACAAAGGACC TAGGAGCAGG AACAGATTTT TTCTTTGAAA ATCTGTGAGT TGAGACAGGA 79717 .......... .......... .......... .......... .......... .......... 1012 TCTGTAGGAC CAGGTCTCTC ATTTGGTACA TTTTTTCTTT CAGCATACTT AAGAGTATCT 79657 .......... .......... .......... .......... .......... .......... 1012 CTCACATGAG TTTCTCCAGC TAACACACTC ATTCTTCAGA TGTCCATTCT TACCACAATG 79597 .......... .......... .......... .......... .......... .......... 1012 CAGACACAAT AGATTGTCAG ACACAAACAC ATACTTACTG TGAGGATTAT AAAGAGGACT 79537 .......... .......... .......... .......... .......... .......... 1012 TATGTTCAGA CTTCTTAGTC CTTTCTTATT GAAATTACTC TGATTTGTCA CATTTGACAG 79477 .......... .......... .......... .......... .......... .......... 1012 CAACTTTGAG GATTTGATCC ACTTAAGAGA ATTTTCAAGC TCTTTCTTAA ATTTCACAAC 79417 .......... .......... .......... .......... .......... .......... 1012 ATCCTGTTCT AATTTGTTAC TCTTTTCAAG TGACAAACTA AGATTTTTCT CAGAGTTTTT 79357 .......... .......... .......... .......... .......... .......... 1012 CAATTTTTCT TGAATTTCAG CTTGCAGACT ATTTGGCTTT CCATTCAGAT TTTCAGCTTC 79297 .......... .......... .......... .......... .......... .......... 1012 TTCAGTAATC TGACACAACT GGTTCTTAAG TTCGGAGTTT TTTAACTCTA GAGACACCAT 79237 .......... .......... .......... .......... .......... .......... 1012 TCTTGACATT GTGTCTTCAA ATTGACCTTT GTTTTCAGTT AAAATTTCAA GTTCAGCAAT 79177 .......... .......... .......... .......... .......... .......... 1012 TATGGTATCT TTTTCAGATG TTAACTCTAT CACAGAATCT AACATGACTT TTGACAAGGT 79117 .......... .......... .......... .......... .......... .......... 1012 TCTCAATTTT TTAAGAGAAT ACTTATCCAA GTCAATTTTC ATGTCAAGAA GAGTTACCTG 79057 .......... .......... .......... .......... .......... .......... 1012 ATTGTCTTCT TCTTCATTTT CTGTGTGTGC CATAAGAGCA AACATTTCAT TGAAGACAGT 78997 .......... .......... .......... .......... .......... .......... 1012 TTCCTCCTCA TGCACAGCAA CCATTGACAC ATCTTTTGGC TCATCAGGAT CTTCTGAGTC 78937 .......... .......... .......... .......... .......... .......... 1012 ACTAGAAGAA TCCCCTCATG CAGCAAGAGC CCTTTTGACA ACCATATCAG CAGCAACTTT 78877 .......... .......... .......... .......... .......... .......... 1012 ACGATCTCTG TTACCAAGTA CCAGGTCCCT TCTGTTTTCT TTGTCACCCC TGTGTTTTTG 78817 .......... .......... .......... .......... .......... .......... 1012 ATGTTCCTTG TTTTCATTCT TGAGTAAAGG ACACTCTCTG ATGAAGTGCC CAGACTTTCC 78757 .......... .......... .......... .......... .......... .......... 1012 ACACTTGTAG CAAGTATCCC CTTGAGCAGC ATTGCGAGTC CCATTTGTTC CTCTTTTATA 78697 .......... .......... .......... .......... .......... .......... 1012 CATTTTGTTT TTTCTCACAA TCTTTTGAAA TCTACTGATG AGATAAGCCA TATCATCATC 78637 .......... .......... .......... .......... .......... .......... 1012 ATCACTTGAA TCTTCATCTG ATTTATACTT CAACATCAAT GACTTGTCCT TCTTGGCTTC 78577 .......... .......... .......... .......... .......... .......... 1012 CTTTTTTGAC AAATCATAGT TTCGATTCAT CTCATGTGTT TTAAGATTAC CAATCAAGGC 78517 | | |||| || .......... .......... .......... .......... .....TTGAA -AATCGAG.. 1024 ATCCATGGTC AGCACCTTCA AGTCCTTGGC TTCTGTAATG GCATCAACTT TGCTCTCCCA 78457 .......... .......... .......... .......... .......... .......... 1024 GGACTTTGGA AGAATTCAAA GCACTTTCCT GACTTGTTTG GTCATGCTTA TAGGTTCACC 78397 .......... .......... .......... .......... .......... .......... 1024 CAAACTTCGT AGCTCTTTTG TAATGGAAGA CAACTTGGTG AACATGTCAT GTATAGTTTC 78337 .......... .......... .......... .......... .......... .......... 1024 TCCTTCCTTC ATTTTGAAGT TCTCATATCG TGAGGTGAGC ATGTCAATCT TGGATTCTTT 78277 .......... .......... .......... .......... .......... .......... 1024 GACTTGTTCA GTTCCTTCAT GTGCAGTCAA CAAGCAATCC CAAATTTCTT TAGCAGACTC 78217 .......... .......... .......... .......... .......... .......... 1024 ACAGGCTGAC ACTCTGTTGT ACTCATCAGG TCCTATCCCA CAGACCAGAA GAGTTTTAGC 78157 .......... .......... .......... .......... .......... .......... 1024 TTTGAAACCC TTTTCAATCT TTTTCCTGTC AGCATCATCA TATTTCTGCC TGGGCTTTGG 78097 .......... .......... .......... .......... .......... .......... 1024 AACAGTAATG GTCTTTTCTC CATCCTTTAC TTCCATCATT GGAACAAAGG GTCCATCTAG 78037 ||||| .....TAATG .......... .......... .......... .......... .......... 1029 TATAATATCC CATAACTCGC TATCTTCAGC CATGAGATAG TCGTGCATTC TAACTTTCCA 77977 .......... .......... .......... .......... .......... .......... 1029 CCAACTGTAG AAATGTCCAT TGAAACGAGG AGGTCTGTGT GATGACTGAC CTTCTTCGAG 77917 .......... .......... .......... .......... .......... .......... 1029 GTTAAGTGGA GCTGCTATTC TTAGAACAAA ATCACTTCCT TGGTGTTAAC CAAATAGGTA 77857 .......... .......... .......... .......... .......... .......... 1029 GTGCCTGCTC TGATACCACT TGATAGAATT TATGCCTTCA CTTATTAGTA ATGGACCAGG 77797 .......... .......... .......... .......... .......... .......... 1029 TCTCTTACTT TACTTCAGAA AATTACCAGA AAGTTAAATG CAGTAAAATT AACACAATGA 77737 .......... .......... .......... .......... .......... .......... 1029 TTTTACGTGG AAACCTCCTT GCTTAAGGGA GTAAAACCAC GACCTATCTC ACAGGATTTT 77677 .......... .......... .......... .......... .......... .......... 1029 CAATCGTTTT CACTAATCTT CAAAAGCAAA AGCGAAACAC GATTACACCA AACGTAAGAA 77617 .......... .......... .......... .......... .......... .......... 1029 AGAGTTATCA ATCTTACCGT TAAGCAATAG ACCTCTATTG CTCAACAAGC CAAAGTAGAA 77557 .......... .......... .......... .......... .......... .......... 1029 AAACAATCTA CCCACTAAGC TATCCCACCT GGACAACCTA GACTTTTAAC ACAACACACC 77497 .......... .......... .......... .......... .......... .......... 1029 AATTCCTTTA TAGATTTAGG AGTGGTTTAC AATTTAAGAA CAAGAGAATA AATTCCTAAA 77437 .......... .......... .......... .......... .......... .......... 1029 CAACTAGACG AAAAGCTCCA GATGTTGCTG TTGTCCTAGG AATGATTCTG CCTTTGCTTT 77377 .......... .......... .......... .......... .......... .......... 1029 GTGTAGCCTT TGCAAGAGTT CTTGAAAAGT TTTTATCAAG TTGCAAAAAC TGAAAGACAA 77317 .......... .......... .......... .......... .......... .......... 1029 ATGTTTAGGA AAGTGCCTTT TATATGGGCA AGTCACTTTC CTAAACTTCT TTGCCATTGG 77257 .......... .......... .......... .......... .......... .......... 1029 TTGGAAAGGT CACACTTTTC TGACGCCATC GAGAAGTGTG CACCTACTTT CTGTACGTCT 77197 .......... .......... .......... .......... .......... .......... 1029 CCAATTGGCA GTTGACTGGC TCATCATCAT GAGGGCCTGG TACCTCTACT AGGTCCCTGA 77137 .......... .......... .......... .......... .......... .......... 1029 GTTTGTTTCA TCTGCAATAC TTCAAACAAG ACACCTGCAA TACTCAAGTA GAAAACCCGG 77077 .......... .......... .......... .......... .......... .......... 1029 TACCTTTATG AGGTCCCTAA GTTTGTCAAA TTATCAAAAC TACAAATAAC AAATATGTAT 77017 .......... .......... .......... .......... .......... .......... 1029 GATGTTTCTA TAATAGCGTG AGAAGGTGAT TAAAAATTTA GATGTTTTAA TTTGTATTAC 76957 .......... .......... .......... .......... .......... .......... 1029 TTTGACTTGA CATGGTAATA TTCACAATCT ATTGTAAAAA AATATATATA TTAGAGTGAC 76897 .......... .......... .......... .......... .......... .......... 1029 GCTTTCATAA TTGTGTTTTA GTTTTTGAAG AAGAACTCAT TTATTTGTAA AGCCCAAGTT 76837 .......... .......... .......... .......... .......... .......... 1029 AATGATTCAG TCCAAATGAA AATGAATTTA ATTCAAATTT TGTATGAATT TGATCTAATT 76777 .......... .......... .......... .......... .......... .......... 1029 ACAATGTATG TGTCTACAAT ATTAACGCTA CTCCCCTATC TTTATATATA GATGATCCAT 76717 ||| | | .......... .......... .......... .......... .......... ..TGAATCTT 1037 TTGGTTCATA TGTATATGTA TATAGAAATA ACTATGAAGA GAGTATTTTC AATATTGGTG 76657 ||| | |||| | |||| | | ||| || || TTGATGCATA T-AATATCCA TTAGGAAGTA GCTC...... .......... .......... 1070 GACTCAAGTA TAGTACACAA CATTTTATGA TGACGGAACT ATTTACTACT ATTAAGAAGG 76597 .......... .......... .......... .......... .......... .......... 1070 GAGTGACGAT CCATGGGACT ACGATAAACT ATTGAGAAAG GGTGAATTGA TCATAATATT 76537 .......... .......... .......... .......... .......... .......... 1070 ATGGTGGAAT TAGAATGTAA GTTGACAATT TTATATCCGA TTATCAGTTT ATTCATACGT 76477 .......... .......... .......... .......... .......... .......... 1070 ATTTTTTGAA TCGGAATCGA AGGACTATGT GAGTTTCGTT TCCAACTCAT TAAACTGTTC 76417 .......... .......... .......... .......... .......... .......... 1070 GTTTTTATGA CTTCAAAATA GAGAGATATT CACGTTTGTG CCTCTTCAAG AGCTTTGGAC 76357 .......... .......... .......... .......... .......... .......... 1070 CGGCCATAAC ATTCGAAATT GGAGAAAAAA ATTAAGGAAA CTCTTATTTT ATCTGTAGAA 76297 .......... .......... .......... .......... .......... .......... 1070 TAATACTTAA TTTTTTAGCC ACATTTTTTT CATCAGTCTT CTGCACTACT AATGAAGAAA 76237 |||| |||||| | | | || | .......... .......... .......... ......TCTT CTGCACAAAT ATATAATATT 1094 AGG-GATTAC TTTTTGTGTG AGTTGGAGAA AGTTGAATGA ATTTACTATA A-AAAAAAAC 76179 | | | ||| || | | || || | ||| || | || || ||| | |||||||| GTTCAAATGC TTTATGAAAG TGATGAAG-C ACTTGCATCA TTTAACCATA AGAAAAAAAC 1153 AAATATCCAA TTCCCATTAT TTACTTGTTA AATTACGAGT GAACAATGAA CCCAATTGAA 76119 | TCAA...... .......... .......... .......... .......... .......... 1157 TAATTTAACA AAATAAATTT GAGAAAAATG TTTGAAAAAC GTTTTAATTT TGGTCGAAAT 76059 .......... .......... .......... .......... .......... .......... 1157 TGATGTTACG AAATCAAATT TTATAGATGA TCTTTTACGT ATTTTAAAGG TATATATATG 75999 .......... .......... .......... .......... .......... .......... 1157 TGCCCATGTG GACACATTAA TATTTATAAT GATACAATAT TTGTGATGTC CATGTGAATA 75939 .......... .......... .......... .......... .......... .......... 1157 TATATATATA TATTTAAAAT ACACTATCAA ATAGAATAGG GATAAAAAAT TATTTTCAAA 75879 .......... .......... .......... .......... .......... .......... 1157 ATTCAATATT ATGACAATAA TCTCAATTAA AATTTGAATA TATTTCAAAT TTTTTTTTTC 75819 .......... .......... .......... .......... .......... .......... 1157 AATAAATTTT AAAAGTTGGA TCAAACGTGT TACTGTTAAT ATTAACAAAG AAAAGAGTTC 75759 .......... .......... .......... .......... .......... .......... 1157 GTCGTAATAC TACTGAATTA TTGTCCTAAA AAAATGGAGA AAAATTATTG TAATGCATTA 75699 .......... .......... .......... .......... .......... .......... 1157 ATTACGGACA GATATTAGAT AATGTAGACC CACCATTATA TAAATTGACT ATGTGAATTG 75639 .......... .......... .......... .......... .......... .......... 1157 AAACAAAATG TCTAAAAAGG TTGTCAAATT CCAGGTATTT GATCTCATCT TATTATAAAA 75579 ||| | ||| || | | || ||||| .......... .......... .......... ....ATATGT TATCAAAT-T TGTT-TAAAA 1181 AAAACTAATA ATTGTTCCAG ATATTCGTGG CATAACAAAA TAATTGATAC ACATGCAATG 75519 || ||| || ||| | AAGACTCATT TATGT-CATT .......... .......... .......... .......... 1200 TTTTATCTTC TAGAAAACAC TTGAAATTAT TTAATAAAAT CTTCTTAACT TGTTGAAGTG 75459 .......... .......... .......... .......... .......... .......... 1200 TATTGATTAG AAGTGATATG AAAAAGAAAA AAGAACATAT GATAACACCC ATAACTCTTT 75399 .......... .......... .......... .......... .......... .......... 1200 TGCACCTCAC CTATATATTT TACTAATTTC ACAATTTTAT TCTTTTAGTG TTTTTTTATT 75339 .......... .......... .......... .......... .......... .......... 1200 TTTATTTCTA CTTGAGCAAA AAATGATATA AGTTTCTGAA AGAAAAAAAA ATAAAACTTT 75279 .......... .......... .......... .......... .......... .......... 1200 ATTAAAGAGG TATTTTTTAT ACTATTGTTG ATTAAGAAAA ACTAGGCACG AGAATCACCG 75219 .......... .......... .......... .......... .......... .......... 1200 CTTAAATATA TAATTTTTAA ATAAAGAGGT ATTATTTGCG TATAAAAATA TTTACTTTGA 75159 .......... .......... .......... .......... .......... .......... 1200 TATAATTTTT AAATATTTCT TATATTTAAG TTTGAAACTT AGAATTTTGG ATGGTTCCAT 75099 .......... .......... .......... .......... .......... .......... 1200 AAATATTATA GTCCACATGT GTTGGTAATT ATAATAAAAC TCAAATCAAA ATTAAATTAA 75039 .......... .......... .......... .......... .......... .......... 1200 TATTGATGCA AAAAGGAAAT CTATTTAGCA TTAAGAATGA CAATAATATT AAATATTTGT 74979 .......... .......... .......... .......... .......... .......... 1200 TCTTTGATTT TACGTTGATT TAGACAATTG AAATACATAA TCTAATTTTA ATTTTCTTTA 74919 .......... .......... .......... .......... .......... .......... 1200 ATATTTAGTC ATGTAACTAA TGCTTATTTG TTATAAATTC ATTGAATGAG TGGATAGTCC 74859 .......... .......... .......... .......... .......... .......... 1200 ATAAGGTTGG TACATGAACA ACCATTCATA TTCACTAGGT TACATGAACC TTTTTGGATA 74799 .......... .......... .......... .......... .......... .......... 1200 AGAATGTATC TATTTACTAT GATACTTAAT ATGGTAACCT TTGGAGTGAT TTCTCACTCT 74739 .......... .......... .......... .......... .......... .......... 1200 ATAAATAGGG TTGTTCATTC AGTATTGTAA TAGATACATA TGAGACTTGA ATACACTTGA 74679 .......... .......... .......... .......... .......... .......... 1200 ATAAGAAGAA AATTCCTCTT CTTCTATATA CTTCTCTTAT CTTCTTCTCT TTATGATTAT 74619 .......... .......... .......... .......... .......... .......... 1200 ATTCTTATGA GCTTGATTTT ATAACACGTT ATCAGCACGA GTCTCTATCT AATTGAGAGT 74559 .......... .......... .......... .......... .......... .......... 1200 GGGTTCTTGT TTTTCTTTAG CTCATATTTT GGTTGATCTC GTTATGAAGA TCAAAGTATT 74499 .......... .......... .......... .......... .......... .......... 1200 ATATGCATAA AATCAGGTAT GTATTTTCTA AAATATTTTA TGATTTAGAA TAAAATAGTA 74439 .......... .......... .......... .......... .......... .......... 1200 TTCAATCACT AATAATTATC AGGAACCGAA GACATATACT ACTATTGGTT GAATATGACA 74379 .......... .......... .......... .......... .......... .......... 1200 TGCTATACAA AACTTCTTAA ATAGAAGAAG GTATAAAATT TTAATAGCAT GAATTTGAAT 74319 .......... .......... .......... .......... .......... .......... 1200 GTTATTTTGA TTGTTTTGAA TGACTCAAAG GGTGAGTCAT GTTGTACTTC GGTTTATTAT 74259 .......... .......... .......... .......... .......... .......... 1200 ACCGTTGGTT AAGGGTAAGA CGATGAATTC AAGTTTCATC GCACCAAAAG GGTAAGACAT 74199 .......... .......... .......... .......... .......... .......... 1200 CAGGTTAAAG TCCGGATGCA CCATAATGAA AAATATTTGA GGATAAGACG ATAGATTCAA 74139 ||| ||||| |||||||||| || ||||||| ||||||||| .......... .......... .CATTATGAA AAATATTTGA GGGTAAGACG GTAGATTCAA 1239 GTTCTATCGC ACCAAAAGGG TAAGACATCA ATTTGAGTTT TGATGCACC 74090 |||||||||| ||||| |||| |||||||||| ||| |||||| ||||||||| GTTCTATCGC ACCAAGAGGG TAAGACATCA ATTCGAGTTT TGATGCACC 1288 hqPGS_C09HBa0099P03.1-2-_SGN-U312402- (74177 74090) ******************************************************************************** EST sequence 26 +strand 430 n (File: SGN-U329525+) 1 CTAGTCTCGA GTTTTTTTTT TTTTTTTAGA ATTTTAGTAA GTTCTTTTAT AACACGTTAT 61 CAGCACGAGT CTCTAGCTAA TTGAGAGTGA GTTTTAGTTT TTTCTTTAAC TCATGTTTTG 121 GTTGATCTCG ATATGAGAAT CAAGGTATTA TTTGCATAAG ATCAGGTATG TATTTTCTAA 181 AATATTTTAT GATTTAGAAT AAAATAGTAT TCAGTCACTA GCAATTATCA GGAACCGAAG 241 ACATATACTA CTATTGGTTG AATATGACAT GCTATATAAA ACTTATTAAA TGGAAGAAGG 301 TATAAAATTT TAATAGCATG AGTTTGAATG TTATTTTGAT TGTTTTGAAA GACTCAAAGG 361 GTGAGTCGTG TTGTACTTCG GTCTATTATA CCGTTGGTCA AGGGTAAGAC GATTGATTCA 421 AGTTTCATCG Predicted gene structure (within gDNA segment 75642 to 73041): Exon 1 74602 74218 ( 385 n); cDNA 45 430 ( 386 n); score: 0.929 PPA cDNA 27 11 MATCH C09HBa0099P03.1-2- SGN-U329525+ 0.929 385 0.895 C PGS_C09HBa0099P03.1-2-_SGN-U329525+ (74602 74218) Alignment (genomic DNA sequence = upper lines): TTTTATAACA CGTTATCAGC ACGAGTCTCT ATCTAATTGA GAGTGGGTTC TTG-TTTTTC 74544 |||||||||| |||||||||| |||||||||| | |||||||| ||||| ||| | | |||||| TTTTATAACA CGTTATCAGC ACGAGTCTCT AGCTAATTGA GAGTGAGTTT TAGTTTTTTC 104 TTTAGCTCAT ATTTTGGTTG ATCTCGTTAT GAAGATCAAA GTATTATATG CATAAAATCA 74484 |||| ||||| ||||||||| |||||| ||| || ||||| ||||||| || ||||| |||| TTTAACTCAT GTTTTGGTTG ATCTCGATAT GAGAATCAAG GTATTATTTG CATAAGATCA 164 GGTATGTATT TTCTAAAATA TTTTATGATT TAGAATAAAA TAGTATTCAA TCACTAATAA 74424 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||| || GGTATGTATT TTCTAAAATA TTTTATGATT TAGAATAAAA TAGTATTCAG TCACTAGCAA 224 TTATCAGGAA CCGAAGACAT ATACTACTAT TGGTTGAATA TGACATGCTA TACAAAACTT 74364 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| TTATCAGGAA CCGAAGACAT ATACTACTAT TGGTTGAATA TGACATGCTA TATAAAACTT 284 CTTAAATAGA AGAAGGTATA AAATTTTAAT AGCATGAATT TGAATGTTAT TTTGATTGTT 74304 |||||| || |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| ATTAAATGGA AGAAGGTATA AAATTTTAAT AGCATGAGTT TGAATGTTAT TTTGATTGTT 344 TTGAATGACT CAAAGGGTGA GTCATGTTGT ACTTCGGTTT ATTATACCGT TGGTTAAGGG 74244 ||||| |||| |||||||||| ||| |||||| |||||||| | |||||||||| |||| ||||| TTGAAAGACT CAAAGGGTGA GTCGTGTTGT ACTTCGGTCT ATTATACCGT TGGTCAAGGG 404 TAAGACGATG AATTCAAGTT TCATCG 74218 ||||||||| ||||||||| |||||| TAAGACGATT GATTCAAGTT TCATCG 430 hqPGS_C09HBa0099P03.1-2-_SGN-U329525+ (74602 74218) ******************************************************************************** EST sequence 27 +strand 805 n (File: SGN-U329700+) 1 TTCTAATCAT TTCATATATA TTCAACTTTT CTTCTACTTA AGTATTTCGA ATTAATAGCG 61 TCGATTCAAA ATTATAGTTA AGGCATTCTA AATAATATTG AGTTTCCAAA TAATATATAT 121 ATATATAAAC TTTTGAACAA AAGCTATATA TTCACTTGAA CCCGACAATA GGCTAGAGCT 181 TAAATTCATT ATTGGGAGGA AATAGATATC ATCTTAATCT ATATTATTCT AAGCCCTTGA 241 CCTATATATA CGAGATTCTA TATTTTTCTT ATAGTCCACA TTAGGATCGT ACAAAATCGA 301 ACCTGAAATT AAATCAAAAC TCCAATTGAG TCAAATTGGA AAAAGAATCC GACTAGTGTT 361 TGGTTTGACT TGGTTTGATG TTGAAAAAAG AACCCGATTA TATTTGGGTT GGGTTGGTTT 421 TAACTTTAAA AGAAATAACC CGAGATCAAA CCAACCCGAC ATTATATATA ATTTTAAAAT 481 TTTATATTAT ACATAAAATA TTTACTTTGA TATAATTTTT AAATAGTTCT TATATTTTTC 541 ATCTTTTAAT ATATTATTTC AAGTTTGAAA TATAGTCCAC AAATGTTGGT AATTATAATA 601 AAGTTTAAAT CAAAATCAAA TTAATAATAA AGCAAAAAGA AAATCAATTC AACACTAATA 661 ATGACAATAA TCTTAAATGT TTGTTCTTTA GCTTTACATT GGTCTAGACA ATTAAAATAG 721 ANCATTAATA CATACTCTAA TTTTAATTTT CTTTAATATT TAGTCATGGT ACTGATACTT 781 ATAAAACTTA TTTAGCATTA TTTAG Predicted gene structure (within gDNA segment 80423 to 73926): Exon 1 75913 75863 ( 51 n); cDNA 445 493 ( 49 n); score: 0.686 Intron 1 75862 75176 ( 687 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.88) Exon 2 75175 74938 ( 238 n); cDNA 494 725 ( 232 n); score: 0.779 MATCH C09HBa0099P03.1-2- SGN-U329700+ 0.763 289 0.359 C PGS_C09HBa0099P03.1-2-_SGN-U329700+ (75913 75863,75175 74938) Alignment (genomic DNA sequence = upper lines): ATCAAATAGA ATAGGGATAA AAAATTATTT TCAAAATTCA ATATTATGAC AATAATCTCA 75854 |||||| | | || | | || |||| | |||||| ||||||| || | ATCAAACCAA CCCGACATTA TATATAATTT T-AAAATTTT ATATTAT-AC A......... 493 ATTAAAATTT GAATATATTT CAAATTTTTT TTTTCAATAA ATTTTAAAAG TTGGATCAAA 75794 .......... .......... .......... .......... .......... .......... 493 CGTGTTACTG TTAATATTAA CAAAGAAAAG AGTTCGTCGT AATACTACTG AATTATTGTC 75734 .......... .......... .......... .......... .......... .......... 493 CTAAAAAAAT GGAGAAAAAT TATTGTAATG CATTAATTAC GGACAGATAT TAGATAATGT 75674 .......... .......... .......... .......... .......... .......... 493 AGACCCACCA TTATATAAAT TGACTATGTG AATTGAAACA AAATGTCTAA AAAGGTTGTC 75614 .......... .......... .......... .......... .......... .......... 493 AAATTCCAGG TATTTGATCT CATCTTATTA TAAAAAAAAC TAATAATTGT TCCAGATATT 75554 .......... .......... .......... .......... .......... .......... 493 CGTGGCATAA CAAAATAATT GATACACATG CAATGTTTTA TCTTCTAGAA AACACTTGAA 75494 .......... .......... .......... .......... .......... .......... 493 ATTATTTAAT AAAATCTTCT TAACTTGTTG AAGTGTATTG ATTAGAAGTG ATATGAAAAA 75434 .......... .......... .......... .......... .......... .......... 493 GAAAAAAGAA CATATGATAA CACCCATAAC TCTTTTGCAC CTCACCTATA TATTTTACTA 75374 .......... .......... .......... .......... .......... .......... 493 ATTTCACAAT TTTATTCTTT TAGTGTTTTT TTATTTTTAT TTCTACTTGA GCAAAAAATG 75314 .......... .......... .......... .......... .......... .......... 493 ATATAAGTTT CTGAAAGAAA AAAAAATAAA ACTTTATTAA AGAGGTATTT TTTATACTAT 75254 .......... .......... .......... .......... .......... .......... 493 TGTTGATTAA GAAAAACTAG GCACGAGAAT CACCGCTTAA ATATATAATT TTTAAATAAA 75194 .......... .......... .......... .......... .......... .......... 493 GAGGTATTAT TTGCGTATAA AAATATTTAC TTTGATATAA TTTTTAAATA TTTCTTATAT 75134 | |||||||||| |||||||||| |||||||||| |||||||| .......... ........TA AAATATTTAC TTTGATATAA TTTTTAAATA GTTCTTATA- 534 TTAAGTTTGA AACTTAGAAT TTTGGATGGT TCCA-TAAAT ATTATAGTCC ACATGTGTTG 75075 || ||| | ||| ||| | || | || | | | |||||||| ||| ||||| TT---TTT-C ATCTTTTAAT ATATTAT--T TCAAGTTTGA AATATAGTCC ACAAATGTTG 588 GTAATTATAA TAAAACTCAA ATCAAAATTA AATTAATATT GATGCAAAAA GGAAATCTAT 75015 |||||||||| |||| | || |||||||| | |||||||| | | ||||||| | ||||| || GTAATTATAA TAAAGTTTAA ATCAAAATCA AATTAATAAT AAAGCAAAAA GAAAATCAAT 648 TTAGCATTAA GAATGACAAT AATATTAAAT ATTTGTTCTT TGATTTTACG TTGATTTAGA 74955 | | || ||| ||||||||| ||| |||||| ||||||||| | ||||| ||| | |||| TCAACACTAA TAATGACAAT AATCTTAAAT GTTTGTTCTT TAGCTTTACA TTGGTCTAGA 708 CAATTGAAAT ACATAAT 74938 ||||| |||| | | || CAATTAAAAT AGANCAT 725 hqPGS_C09HBa0099P03.1-2-_SGN-U329700+ (75175 74938) ******************************************************************************** EST sequence 29 -strand 925 n (File: SGN-U339745-) 1 CCAAACCTAG GCCCTTTGGA GAAAATAAAG GCTATTTTTG TTTAAACTAC ACCTGGTCTA 61 TCCACCATAA TTGAGAGGAC TTAAAGGTAA TGTGATTTTT TCTTGACACG ATGTCATAGA 121 TCGTCACTCA ATTTAATTAT AGACTCTGTT CTCACAAAGT TAACTCAATT TTGATTTTTA 181 CCTCAAAAGT CACTCAATTA TGAATTTTTT CTCAGGAAAT CACTCTAACT ATACTCTTCA 241 CTCGATTTTA TTTTAACTCA AATGTTACTT TAACTGTAAA TGTTATTTGT AGTTTTGATG 301 ATTTGACAAA CAATGGGACC TCATAAAGGT ATCAGGTTCT CTAATTGTGT ATTGCAGGTG 361 TCTTGTTTGA GGTATTGCAG AAGAAACAAA ACCAGGGACC TAGTAGAGGT ACCAGGCTCT 421 CATGAAAATG AGTCAGTCGA CTGCCAGCTG GAGATGGTAC AGAAAGTAGG TGCACACTTC 481 CCGATGGCGT CAGAAAAGTG TGACCTTTTC AGCAAATGGC AAAGGAGTTT AGGAAAGTGA 541 CTTGCCCACA TAAAAGGCAC TTTCCAAAAC ATTTGTTTTT TAGTTTTTGC AACTTGATAA 601 AAACTTTTCA AGAACACTTG CAAAGGCTAC ACAAAAGGTA GAATCATTCC AAGAACAACA 661 GCAACATCTT GAGCTTGTCG TATAGTTGTT TAGGAATTTA TTCTCTTGTT GTTAAATTGT 721 AAACCACTGC TAAATCTATA AAGGAATTGG TGTGTTGTGT AAAAGTCTAG GTTGTCCAAG 781 TGGGATAGCT TAGTGGGTAG ATTGTTTTTC TACTTAGGCT TGTTGAGCAA TAGAGGACTA 841 TTGTTTAACG GTAAGATTGA TAACTCTTTC TTACGTTTGG TGAAAAAAAA AAAAAAAAAA 901 ACTCGAGGGG GGCCCGGTCC CAATC Predicted gene structure (within gDNA segment 73616 to 78661): Exon 1 75305 75309 ( 5 n); cDNA 272 276 ( 5 n); score: 0.800 Intron 1 75310 77021 (1712 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.88) Exon 2 77022 77630 ( 609 n); cDNA 277 881 ( 605 n); score: 0.914 PPA cDNA 883 902 MATCH C09HBa0099P03.1-2+ SGN-U339745- 0.914 614 0.664 C PGS_C09HBa0099P03.1-2+_SGN-U339745- (75305 75309,77022 77630) Alignment (genomic DNA sequence = upper lines): AACTTATATC ATTTTTTGCT CAAGTAGAAA TAAAAATAAA AAAACACTAA AAGAATAAAA 75364 |||| AACTG..... .......... .......... .......... .......... .......... 276 TTGTGAAATT AGTAAAATAT ATAGGTGAGG TGCAAAAGAG TTATGGGTGT TATCATATGT 75424 .......... .......... .......... .......... .......... .......... 276 TCTTTTTTCT TTTTCATATC ACTTCTAATC AATACACTTC AACAAGTTAA GAAGATTTTA 75484 .......... .......... .......... .......... .......... .......... 276 TTAAATAATT TCAAGTGTTT TCTAGAAGAT AAAACATTGC ATGTGTATCA ATTATTTTGT 75544 .......... .......... .......... .......... .......... .......... 276 TATGCCACGA ATATCTGGAA CAATTATTAG TTTTTTTTAT AATAAGATGA GATCAAATAC 75604 .......... .......... .......... .......... .......... .......... 276 CTGGAATTTG ACAACCTTTT TAGACATTTT GTTTCAATTC ACATAGTCAA TTTATATAAT 75664 .......... .......... .......... .......... .......... .......... 276 GGTGGGTCTA CATTATCTAA TATCTGTCCG TAATTAATGC ATTACAATAA TTTTTCTCCA 75724 .......... .......... .......... .......... .......... .......... 276 TTTTTTTAGG ACAATAATTC AGTAGTATTA CGACGAACTC TTTTCTTTGT TAATATTAAC 75784 .......... .......... .......... .......... .......... .......... 276 AGTAACACGT TTGATCCAAC TTTTAAAATT TATTGAAAAA AAAAATTTGA AATATATTCA 75844 .......... .......... .......... .......... .......... .......... 276 AATTTTAATT GAGATTATTG TCATAATATT GAATTTTGAA AATAATTTTT TATCCCTATT 75904 .......... .......... .......... .......... .......... .......... 276 CTATTTGATA GTGTATTTTA AATATATATA TATATATTCA CATGGACATC ACAAATATTG 75964 .......... .......... .......... .......... .......... .......... 276 TATCATTATA AATATTAATG TGTCCACATG GGCACATATA TATACCTTTA AAATACGTAA 76024 .......... .......... .......... .......... .......... .......... 276 AAGATCATCT ATAAAATTTG ATTTCGTAAC ATCAATTTCG ACCAAAATTA AAACGTTTTT 76084 .......... .......... .......... .......... .......... .......... 276 CAAACATTTT TCTCAAATTT ATTTTGTTAA ATTATTCAAT TGGGTTCATT GTTCACTCGT 76144 .......... .......... .......... .......... .......... .......... 276 AATTTAACAA GTAAATAATG GGAATTGGAT ATTTGTTTTT TTTTATAGTA AATTCATTCA 76204 .......... .......... .......... .......... .......... .......... 276 ACTTTCTCCA ACTCACACAA AAAGTAATCC CTTTTCTTCA TTAGTAGTGC AGAAGACTGA 76264 .......... .......... .......... .......... .......... .......... 276 TGAAAAAAAT GTGGCTAAAA AATTAAGTAT TATTCTACAG ATAAAATAAG AGTTTCCTTA 76324 .......... .......... .......... .......... .......... .......... 276 ATTTTTTTCT CCAATTTCGA ATGTTATGGC CGGTCCAAAG CTCTTGAAGA GGCACAAACG 76384 .......... .......... .......... .......... .......... .......... 276 TGAATATCTC TCTATTTTGA AGTCATAAAA ACGAACAGTT TAATGAGTTG GAAACGAAAC 76444 .......... .......... .......... .......... .......... .......... 276 TCACATAGTC CTTCGATTCC GATTCAAAAA ATACGTATGA ATAAACTGAT AATCGGATAT 76504 .......... .......... .......... .......... .......... .......... 276 AAAATTGTCA ACTTACATTC TAATTCCACC ATAATATTAT GATCAATTCA CCCTTTCTCA 76564 .......... .......... .......... .......... .......... .......... 276 ATAGTTTATC GTAGTCCCAT GGATCGTCAC TCCCTTCTTA ATAGTAGTAA ATAGTTCCGT 76624 .......... .......... .......... .......... .......... .......... 276 CATCATAAAA TGTTGTGTAC TATACTTGAG TCCACCAATA TTGAAAATAC TCTCTTCATA 76684 .......... .......... .......... .......... .......... .......... 276 GTTATTTCTA TATACATATA CATATGAACC AAATGGATCA TCTATATATA AAGATAGGGG 76744 .......... .......... .......... .......... .......... .......... 276 AGTAGCGTTA ATATTGTAGA CACATACATT GTAATTAGAT CAAATTCATA CAAAATTTGA 76804 .......... .......... .......... .......... .......... .......... 276 ATTAAATTCA TTTTCATTTG GACTGAATCA TTAACTTGGG CTTTACAAAT AAATGAGTTC 76864 .......... .......... .......... .......... .......... .......... 276 TTCTTCAAAA ACTAAAACAC AATTATGAAA GCGTCACTCT AATATATATA TTTTTTTACA 76924 .......... .......... .......... .......... .......... .......... 276 ATAGATTGTG AATATTACCA TGTCAAGTCA AAGTAATACA AATTAAAACA TCTAAATTTT 76984 .......... .......... .......... .......... .......... .......... 276 TAATCACCTT CTCACGCTAT TATAGAAACA TCATACATAT TTGTTATTTG TAGTTTTGAT 77044 || ||||||||| |||||||||| .......... .......... .......... .......TAA ATGTTATTTG TAGTTTTGAT 299 AATTTGACAA ACTTAGGGAC CTCATAAAGG TACCGGGTTT TCTACTTGAG TATTGCAGGT 77104 ||||||||| || ||||| |||||||||| || | |||| |||| ||| | |||||||||| GATTTGACAA ACAATGGGAC CTCATAAAGG TATCAGGTTC TCTAATTGTG TATTGCAGGT 359 GTCTTGTTTG AAGTATTGCA GATGAAACAA ACTCAGGGAC CTAGTAGAGG TACCAGGCCC 77164 |||||||||| | |||||||| || ||||||| | ||||||| |||||||||| |||||||| | GTCTTGTTTG AGGTATTGCA GAAGAAACAA AACCAGGGAC CTAGTAGAGG TACCAGGCTC 419 TCATGATGAT GAGCCAGTCA ACTGCCAATT GGAGA-CGTA CAGAAAGTAG GTGCACACTT 77223 |||||| || ||| ||||| ||||||| | ||||| ||| |||||||||| |||||||||| TCATGAAAAT GAGTCAGTCG ACTGCCAGCT GGAGATGGTA CAGAAAGTAG GTGCACACTT 479 CTCGATGGCG TCAGAAAAGT GTGACCTTTC CAACCAATGG CAAAGAAGTT TAGGAAAGTG 77283 | |||||||| |||||||||| ||||||||| || | ||||| ||||| |||| |||||||||| CCCGATGGCG TCAGAAAAGT GTGACCTTTT CAGCAAATGG CAAAGGAGTT TAGGAAAGTG 539 ACTTGCCCAT ATAAAAGGCA CTTTCCTAAA CATTTGTCTT TCAGTTTTTG CAACTTGATA 77343 ||||||||| |||||||||| |||||| ||| ||||||| || | |||||||| |||||||||| ACTTGCCCAC ATAAAAGGCA CTTTCCAAAA CATTTGTTTT TTAGTTTTTG CAACTTGATA 599 AAAACTTTTC AAGAACTCTT GCAAAGGCTA CACAAAGCAA AGGCAGAATC ATTCCTAGGA 77403 |||||||||| |||||| ||| |||||||||| ||| | || ||| |||||| ||||| || | AAAACTTTTC AAGAACACTT GCAAAGGCTA CAC--A--AA AGGTAGAATC ATTCCAAGAA 655 CAACAGCAAC ATCTGGAGCT TTTCGTCTAG TTGTTTAGGA ATTTATTCTC TTGTTCTTAA 77463 |||||||||| |||| ||||| | |||| ||| |||||||||| |||||||||| ||||| |||| CAACAGCAAC ATCTTGAGCT TGTCGTATAG TTGTTTAGGA ATTTATTCTC TTGTTGTTAA 715 ATTGTAAACC ACTCCTAAAT CTATAAAGGA ATTGGTGTGT TGTGTTAAAA GTCTAGGTTG 77523 |||||||||| ||| |||||| |||||||||| |||||||||| |||| ||||| |||||||||| ATTGTAAACC ACTGCTAAAT CTATAAAGGA ATTGGTGTGT TGTG-TAAAA GTCTAGGTTG 774 TCCAGGTGGG ATAGCTTAGT GGGTAGATTG TTTTTCTACT TTGGCTTGTT GAGCAATAGA 77583 |||| ||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TCCAAGTGGG ATAGCTTAGT GGGTAGATTG TTTTTCTACT TAGGCTTGTT GAGCAATAGA 834 GGTCTATTGC TTAACGGTAA GATTGATAAC TCTTTCTTAC GTTTGGT 77630 || |||||| |||||||||| |||||||||| |||||||||| ||||||| GGACTATTGT TTAACGGTAA GATTGATAAC TCTTTCTTAC GTTTGGT 881 hqPGS_C09HBa0099P03.1-2+_SGN-U339745- (77022 77630) ******************************************************************************** EST sequence 28 -strand 588 n (File: SGN-U339744-) 1 AACTGTCAGC TGGAGATGGT GCAGAAAGTA GGTGCACACT TCCCGATGGC GTCAGAAAGT 61 GACCCTTCCA ACCAATGGCA AAGAAGTTTA GGAAAGTGAC TTGCCCATAT AAAAGGCACT 121 TTCCTAAACA CTTTTCTTCA GTTTTTGCAA CTTGATAAAA ACTTTTCAAG AACTCTTGCA 181 AAGGCTCTAC AAAGTAAAGG CAGAATTATT CCAAGAACAA CAGCAATATC TTGAGCTTTT 241 CGTCTAGTTG TTTAGGAATA TATTCTCTTG TTCTTAAACT GTAAACCATT CCTGAATCTA 301 TAAAGGATTT GGTGTGTTGT GTTAAAAGTC TAGGTTGTCC AGGTGGGATA GCTTAGTGGG 361 TAGATTGTTT TCTACTTAGG CTTGTTAAGC AATAGAGGAC TATTGCTTAA CGGTAAGATT 421 CATAACTCTT TCTTACGTTT GGTGTAATCG TGTTTTACTT TTGCTTTTGA AGATTAGTGA 481 AAACGATTGA AAAAAAAAAA AAAAAAAAAA CTCGAGAGTA CTTCTAGAGC GGCCGCGGGC 541 CCATCGATTT TCCACCCGGG TGGGGTACTG CGTAAGAGTG CTCAACAA Predicted gene structure (within gDNA segment 75991 to 79230): Exon 1 77184 77679 ( 496 n); cDNA 1 492 ( 492 n); score: 0.928 MATCH C09HBa0099P03.1-2+ SGN-U339744- 0.928 496 0.844 C PGS_C09HBa0099P03.1-2+_SGN-U339744- (77184 77679) Alignment (genomic DNA sequence = upper lines): AACTGCCAAT TGGAGACG-T ACAGAAAGTA GGTGCACACT TCTCGATGGC GTCAGAAAAG 77242 ||||| || |||||| | | ||||||||| |||||||||| || ||||||| ||||| ||| AACTGTCAGC TGGAGATGGT GCAGAAAGTA GGTGCACACT TCCCGATGGC GTCAG-AAA- 58 TGTGACCTTT CCAACCAATG GCAAAGAAGT TTAGGAAAGT GACTTGCCCA TATAAAAGGC 77302 |||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| -GTGACCCTT CCAACCAATG GCAAAGAAGT TTAGGAAAGT GACTTGCCCA TATAAAAGGC 117 ACTTTCCTAA ACATTTGTCT TTCAGTTTTT GCAACTTGAT AAAAACTTTT CAAGAACTCT 77362 |||||||||| ||| || || |||||||||| |||||||||| |||||||||| |||||||||| ACTTTCCTAA ACACTTTTC- TTCAGTTTTT GCAACTTGAT AAAAACTTTT CAAGAACTCT 176 TGCAAAGGCT ACACAAAGCA AAGGCAGAAT CATTCCTAGG ACAACAGCAA CATCTGGAGC 77422 |||||||||| |||||| | |||||||||| ||||| || |||||||||| |||| |||| TGCAAAGGCT CTACAAAGTA AAGGCAGAAT TATTCCAAGA ACAACAGCAA TATCTTGAGC 236 TTTTCGTCTA GTTGTTTAGG AATTTATTCT CTTGTTCTTA AATTGTAAAC CACTCCTAAA 77482 |||||||||| |||||||||| ||| |||||| |||||||||| || ||||||| || |||| || TTTTCGTCTA GTTGTTTAGG AATATATTCT CTTGTTCTTA AACTGTAAAC CATTCCTGAA 296 TCTATAAAGG AATTGGTGTG TTGTGTTAAA AGTCTAGGTT GTCCAGGTGG GATAGCTTAG 77542 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTATAAAGG ATTTGGTGTG TTGTGTTAAA AGTCTAGGTT GTCCAGGTGG GATAGCTTAG 356 TGGGTAGATT GTTTTTCTAC TTTGGCTTGT TGAGCAATAG AGGTCTATTG CTTAACGGTA 77602 |||||||||| | |||||||| || ||||||| | |||||||| ||| |||||| |||||||||| TGGGTAGATT G-TTTTCTAC TTAGGCTTGT TAAGCAATAG AGGACTATTG CTTAACGGTA 415 AGATTGATAA CTCTTTCTTA CGTTTGGTGT AATCGTGTTT CGCTTTTGCT TTTGAAGATT 77662 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||| |||||||||| AGATTCATAA CTCTTTCTTA CGTTTGGTGT AATCGTGTTT TACTTTTGCT TTTGAAGATT 475 AGTGAAAACG ATTGAAA 77679 |||||||||| ||||||| AGTGAAAACG ATTGAAA 492 hqPGS_C09HBa0099P03.1-2+_SGN-U339744- (77184 77679) ******************************************************************************** EST sequence 4 +strand 889 n (File: SGN-U343802+) 1 CTGGAGCTCA CCGCGGTGGC GGCCGCTCTA GAACTAGTGG ATCCCCCGGG CTGCAGGAAT 61 TCGGCACGAG GGTTGCTTAG GAATATAATC TCTTGGTCTT ACATTATAAA CTATTCCTGA 121 AACTATAAAG GAATCAGTGG GTAGTGTTTA AATTGTAAGT TGTCTATGTG GTATAGCTTA 181 GTGGGTAGAT TCATTTCTAC TTAGGCTTGT AAAGCAATAG AGAATATTGC TTGAACGTGA 241 GATTAAAAAC TTTTTCTCAT ATTTGTTGTA ATCGTGTTTC ACTTTTGCTT GTGAAGATTA 301 GTGAAAACGA TTGAAAATCC TGTGAGACAA GTCGTGGTTT TACTCCCTTA AGCAAGGAGG 361 TTTCCACGTA AAATGTTGTG TTGTTTAAAC TGCATTTACT TTCTGTTCAT ACTTTTTAGT 421 GTGTCAAGTG AACTGGGTCA TTGACTTATA AGTGAAGGCA TTTATTCTAT CAGGCACCAA 481 GCAACACATT AGAGGGAAGC ATTTTATAAA GTATTTGTTT GTATTTTATT TATATTTTAT 541 CTACTATGTT CAATTTTTAA ACTGGAGTCT AACTGATTCA CATATTCTCC ATCGAGAAAC 601 ACATCAGAGG GAAGCACTCT ATGAAGTATT TTATTTAATT TTTTACCTGA TCTTTTCAAT 661 ATTTATATTA GAGTTCAAAT TATTTAATTA GATTTTGACA ACAAACAACA TTGTCGAGGG 721 AAGCACACTA TGAAGNNATT GNTTGNAATT TATATATTTT TCATCTATTG GATGGTCGAA 781 TTTATATTGA AGTCTAAATT TTAAGACGTC GGGGAAGTAC TCATGAAATA TTTATTTTTG 841 TATATACCTA GGCACTNCAT ATTTTAGCAC TTTATATTCT ACTTTCGCA Predicted gene structure (within gDNA segment 74145 to 80423): Exon 1 77434 77836 ( 403 n); cDNA 73 472 ( 400 n); score: 0.818 MATCH C09HBa0099P03.1-2+ SGN-U343802+ 0.818 403 0.453 C PGS_C09HBa0099P03.1-2+_SGN-U343802+ (77434 77836) Alignment (genomic DNA sequence = upper lines): TTGTTTAGGA ATTTATTCTC TTGTTCTTAA ATTGTAAACC ACTCCTAAAT CTATAAAGGA 77493 ||| |||||| || || |||| ||| ||||| ||| ||||| | |||| || |||||||||| TTGCTTAGGA ATATAATCTC TTGGTCTTAC ATTATAAACT ATTCCTGAAA CTATAAAGGA 132 ATTGGTGTGT TGTGTTAAAA GTCTAGGTTG TCCAGGTGGG ATAGCTTAGT GGGTAGATTG 77553 || ||| || ||||| ||| | || |||| || | |||| |||||||||| ||||||||| ATCAGTGGGT AGTGTTTAAA TTGTAAGTTG TCTATGTGGT ATAGCTTAGT GGGTAGATT- 191 TTTTTCTACT TTGGCTTGTT GAGCAATAGA GGTCTATTGC TTAACGGTAA GATTGATAAC 77613 |||||||| | ||||||| ||||||||| | |||||| || | || | |||| | ||| CATTTCTACT TAGGCTTGTA AAGCAATAGA -GAATATTGC TTGAACGTGA GATTAAAAAC 250 TCTTTCTTAC GTTTGGTGTA ATCGTGTTTC GCTTTTGCTT TTGAAGATTA GTGAAAACGA 77673 | ||||| | |||| |||| |||||||||| ||||||||| ||||||||| |||||||||| TTTTTCTCAT ATTTGTTGTA ATCGTGTTTC ACTTTTGCTT GTGAAGATTA GTGAAAACGA 310 TTGAAAATCC TGTGAGATAG GTCGTGGTTT TACTCCCTTA AGCAAGGAGG TTTCCACGTA 77733 |||||||||| ||||||| | |||||||||| |||||||||| |||||||||| |||||||||| TTGAAAATCC TGTGAGACAA GTCGTGGTTT TACTCCCTTA AGCAAGGAGG TTTCCACGTA 370 AAATCATTGT GTTAATTTTA CTGCATTTAA CTTTCTGGTA ATTTTCTGAA GTAAAGTAAG 77793 |||| |||| ||| || | |||||||| | ||||||| | || | | | || ||| AAAT-GTTGT GTTGTTTAAA CTGCATTT-A CTTTCTGTTC ATACTTTTTA GTGTGTCAAG 428 AGACCTGGTC CATT-ACTAA TAAGTGAAGG CATAAATTCT ATCA 77836 || |||| |||| ||| | |||||||||| ||| ||||| |||| TGAACTGGGT CATTGACTTA TAAGTGAAGG CATTTATTCT ATCA 472 hqPGS_C09HBa0099P03.1-2+_SGN-U343802+ (77434 77836) Total number of EST alignments reported: 34 ________________________________________________________________________________ Predicted gene locations (16) in segment 1 to 80423: PGL 1 (+ strand): 3900 10258 AGS-1 (3900 4085,4191 4225,4433 4570,5380 5448,6473 6630,6740 6806,6908 6961,7417 7505) SCR (e 1.000 d 1.000 a 0.871,e 1.000 d 0.972 a 1.000,e 1.000 d 0.998 a 0.439,e 1.000 d 0.843 a 0.935,e 1.000 d 0.987 a 0.805,e 1.000 d 0.988 a 0.998,e 0.926 d 0.949 a 0.963,e 0.882) Exon 1 3900 4085 ( 186 n); score: 1.000 Intron 1 4086 4190 ( 105 n); Pd: 1.000 Pa: 0.871 Exon 2 4191 4225 ( 35 n); score: 1.000 Intron 2 4226 4432 ( 207 n); Pd: 0.972 Pa: 1.000 Exon 3 4433 4570 ( 138 n); score: 1.000 Intron 3 4571 5379 ( 809 n); Pd: 0.998 Pa: 0.439 Exon 4 5380 5448 ( 69 n); score: 1.000 Intron 4 5449 6472 (1024 n); Pd: 0.843 Pa: 0.935 Exon 5 6473 6630 ( 158 n); score: 1.000 Intron 5 6631 6739 ( 109 n); Pd: 0.987 Pa: 0.805 Exon 6 6740 6806 ( 67 n); score: 1.000 Intron 6 6807 6907 ( 101 n); Pd: 0.988 Pa: 0.998 Exon 7 6908 6961 ( 54 n); score: 0.926 Intron 7 6962 7416 ( 455 n); Pd: 0.949 Pa: 0.963 Exon 8 7417 7505 ( 89 n); score: 0.882 PGS (3900 4085,4191 4225,4433 4570,5380 5448,6473 6630,6740 6806,6908 6961,7417 7505) SGN-U346148+ 3-phase translation of AGS-1 (+strand): . . . . . . 3900 TTTTCGCCGAGATCGTCAGTTCTCCTGCTCCGGCAGCCATGGCCGCCAAAGCAGCTGCTA F S P R S S V L L L R Q P W P P K Q L L F R R D R Q F S C S G S H G R Q S S C Y F A E I V S S P A P A A M A A K A A A . . . . . . 3960 TACGCAAGAGTAATAGATCGGAGAATTTCATTCAAAAACTCGTTAAAAATCCTAAAATAC Y A R V I D R R I S F K N S L K I L K Y T Q E - - I G E F H S K T R - K S - N T I R K S N R S E N F I Q K L V K N P K I . . . . . . 4020 CTTTTGCTATTGCAATACTCATTGCTGATGCCATCCTCGTTGCGTTGATTATCGCTTACG L L L L Q Y S L L M P S S L R - L S L T F C Y C N T H C - C H P R C V D Y R L R P F A I A I L I A D A I L V A L I I A Y . : . . . . : . 4080 TTCCAT : ATACGAAAATTGATTGGGATGCTTATATGTCTCAG : GTTACTGGTTTTCTCGAAG F H : I R K L I G M L I C L R : L L V F S K S I : Y E N - L G C L Y V S : G Y W F S R R V P : Y T K I D W D A Y M S Q : V T G F L E . . . . . . 4452 GAGAGAGGGATTATAGTAACTTGAAAGGTGACACGGGGCCTCTAGTTTACCCAGCAGGCT E R G I I V T - K V T R G L - F T Q Q A R E G L - - L E R - H G A S S L P S R L G E R D Y S N L K G D T G P L V Y P A G . . . . . . : 4512 TTCTTTATATTTACTCTGCTATACAATATGTTACTGGAGGTCAAGTCTATCCTGCTCAG : A F F I F T L L Y N M L L E V K S I L L R : S L Y L L C Y T I C Y W R S S L S C S : D F L Y I Y S A I Q Y V T G G Q V Y P A Q : . . . . . . 5381 TTCTTTTTGGCTTTCTCTACGTGCTGGATCTTGCAATTGTCTTGTTCATCTACTTGAAGA F F L A F S T C W I L Q L S C S S T - R S F W L S L R A G S C N C L V H L L E D I L F G F L Y V L D L A I V L F I Y L K . : . . . . . 5441 CTGATGTG : GTACCTTGGTGGGCTCTCTCCTTGCTTTCTCTGTCGAAAAGAGTTCACTCTA L M W : Y L G G L S P C F L C R K E F T L - C : G T L V G S L L A F S V E K S S L Y T D V : V P W W A L S L L S L S K R V H S . . . . . . 6525 TCTTTGTTCTTCGATTATTTAATGATTGTTTTGCCACTACTCTCCTCCATGCTGCATTGG S L F F D Y L M I V L P L L S S M L H W L C S S I I - - L F C H Y S P P C C I G I F V L R L F N D C F A T T L L H A A L . . . . . : . 6585 TCTCAATTATCTGCCAAAAATGGCATCTAGGGTTGGTAATTTTCAG : CGGAGCTGTTTCCA S Q L S A K N G I - G W - F S : A E L F P L N Y L P K M A S R V G N F Q : R S C F H V S I I C Q K W H L G L V I F S : G A V S . . . . . . : 6754 TAAAGATGAATGTGCTCCTGTATGCACCACCTCTGTTGCTCCTCATGGTGAAG : GCAATGG - R - M C S C M H H L C C S S W - R : Q W K D E C A P V C T T S V A P H G E : G N G I K M N V L L Y A P P L L L L M V K : A M . . . . . : . 6915 ATATTGTTGGAGTTATATCTGCTTTAGCAGGGGCTGCATTAGTGCAG : ATTCTCATAGGGC I L L E L Y L L - Q G L H - C R : F S - G Y C W S Y I C F S R G C I S A : D S H R A D I V G V I S A L A G A A L V Q : I L I G . . . . . . 7430 TTCCTTTTATCCTGTCACATCCAGCTTCATATTTATCAAACGCTTTCAATCTTGGTCGGG F L L S C H I Q L H I Y Q T L S I L V G S F Y P V T S S F I F I K R F Q S W S G L P F I L S H P A S Y L S N A F N L G R . . 7490 TTTTCATCCACTTCTG F S S T S F H P L L V F I H F Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-1_AGS-1_PPS_1 (3902 4085,4191 4225,4433 4570,5380 5448,6473 6630,6740 6806,6908 6961,7417 7503) (frame '0'; 792 bp, 264 residues) 1 FAEIVSSPAP AAMAAKAAAI RKSNRSENFI QKLVKNPKIP FAIAILIADA ILVALIIAYV 61 PYTKIDWDAY MSQVTGFLEG ERDYSNLKGD TGPLVYPAGF LYIYSAIQYV TGGQVYPAQI 121 LFGFLYVLDL AIVLFIYLKT DVVPWWALSL LSLSKRVHSI FVLRLFNDCF ATTLLHAALV 181 SIICQKWHLG LVIFSGAVSI KMNVLLYAPP LLLLMVKAMD IVGVISALAG AALVQILIGL 241 PFILSHPASY LSNAFNLGRV FIHF AGS-2 (6922 6961,7320 7505,8201 8317,8489 8595,8732 8810,9624 9680,9764 10258) SCR (e 1.000 d 0.949 a 0.977,e 1.000 d 0.549 a 0.996,e 0.991 d 0.996 a 0.993,e 1.000 d 0.976 a 0.967,e 1.000 d 0.994 a 0.776,e 1.000 d 0.996 a 0.988,e 0.991) Exon 1 6922 6961 ( 40 n); score: 1.000 Intron 1 6962 7319 ( 358 n); Pd: 0.949 Pa: 0.977 Exon 2 7320 7505 ( 186 n); score: 1.000 Intron 2 7506 8200 ( 695 n); Pd: 0.549 Pa: 0.996 Exon 3 8201 8317 ( 117 n); score: 0.991 Intron 3 8318 8488 ( 171 n); Pd: 0.996 Pa: 0.993 Exon 4 8489 8595 ( 107 n); score: 1.000 Intron 4 8596 8731 ( 136 n); Pd: 0.976 Pa: 0.967 Exon 5 8732 8810 ( 79 n); score: 1.000 Intron 5 8811 9623 ( 813 n); Pd: 0.994 Pa: 0.776 Exon 6 9624 9680 ( 57 n); score: 1.000 Intron 6 9681 9763 ( 83 n); Pd: 0.996 Pa: 0.988 Exon 7 9764 10258 ( 495 n); score: 0.991 PGS (6922 6961,7320 7505,8201 8317,8489 8595,8732 8810,9624 9680,9764 10258) SGN-U328710+ 3-phase translation of AGS-2 (+strand): . . . . : . . 6922 TGGAGTTATATCTGCTTTAGCAGGGGCTGCATTAGTGCAG : GTGAGTCGGTTACGAGTATA W S Y I C F S R G C I S A : G E S V T S I G V I S A L A G A A L V Q : V S R L R V Y E L Y L L - Q G L H - C R : - V G Y E Y . . . . . . 7340 CTTCTGTTGAATGGAGCTTCTTCTGATCAAATTCTTTACCAATGATCATGGTGATTGGTG L L L N G A S S D Q I L Y Q - S W - L V F C - M E L L L I K F F T N D H G D W - T S V E W S F F - S N S L P M I M V I G . . . . . . 7400 ATTGGTGATTGGTGCAGATTCTCATAGGGCTTCCTTTTATCCTGTCACATCCAGCTTCAT I G D W C R F S - G F L L S C H I Q L H L V I G A D S H R A S F Y P V T S S F I D W - L V Q I L I G L P F I L S H P A S . . . . . : . 7460 ATTTATCAAACGCTTTCAATCTTGGTCGGGTTTTCATCCACTTCTG : GTCTGTCAACTTCA I Y Q T L S I L V G F S S T S : G L S T S F I K R F Q S W S G F H P L L : V C Q L Q Y L S N A F N L G R V F I H F W : S V N F . . . . . . 8215 AATTTGTTCCTGAAGACATCTTTGTTTCTAAAGCTTTTGCTCTCTCTTTGCTAGTTGCTC N L F L K T S L F L K L L L S L C - L L I C S - R H L C F - S F C S L F A S C S K F V P E D I F V S K A F A L S L L V A . . . . . : . 8275 ATCTCAGTCTGCTATTGGTGTTTGCTCATTACAGATGGTGCAG : GCATGAAGGAGGACTGT I S V C Y W C L L I T D G A : G M K E D C S Q S A I G V C S L Q M V Q : A - R R T V H L S L L L V F A H Y R W C R : H E G G L . . . . . . 8506 TTGCTGTTGTGCGTTCTAAAATCATTCAACTGAAGCTCAGAGTTTCTCAGAGAAATCCTT L L L C V L K S F N - S S E F L R E I L C C C A F - N H S T E A Q S F S E K S F F A V V R S K I I Q L K L R V S Q R N P . . . : . . . 8566 CCTCAACCAAGAAAGTCCTTCAAGCTGACC : ATATTGTGACGACTATGTTTGTTGGGAATT P Q P R K S F K L T : I L - R L C L L G I L N Q E S P S S - P : Y C D D Y V C W E F S S T K K V L Q A D : H I V T T M F V G N . . . . . : . 8762 TCATTGGCATTATATGTGCCCGATCCCTCCATTACCAATTTTATTCTTG : GTACTTCTATT S L A L Y V P D P S I T N F I L : G T S I H W H Y M C P I P P L P I L F L : V L L L F I G I I C A R S L H Y Q F Y S W : Y F Y . . . . . : . 9635 GCTTACCATATTTATTGTGGAAAGCACCATTTCCAACCCTCCTACG : TTTATTCTTGTTCG A Y H I Y C G K H H F Q P S Y : V Y S C S L T I F I V E S T I S N P P T : F I L V R C L P Y L L W K A P F P T L L R : L F L F . . . . . . 9778 CAGCTGTAGAGTTTTGCTGGAACGTCTTCCCCTCCAACACTTGCTCATCACTTGTCCTCC Q L - S F A G T S S P P T L A H H L S S S C R V L L E R L P L Q H L L I T C P P A A V E F C W N V F P S N T C S S L V L . . . . . . 9838 TCTGTGTCCATTTGATCATATTGGCCGGTCTATGGATAAGTTCACCAGAATATCCGTACG S V S I - S Y W P V Y G - V H Q N I R T L C P F D H I G R S M D K F T R I S V R L C V H L I I L A G L W I S S P E Y P Y . . . . . . 9898 TCGAAGAAAAAACAACTTATAAATCTACACCTAAGAAGAAGGCCAGATAAAGCACTATCT S K K K Q L I N L H L R R R P D K A L S R R K N N L - I Y T - E E G Q I K H Y L V E E K T T Y K S T P K K K A R - S T I . . . . . . 9958 GGTTATGCATGTGAATGGCAGATAAAGAAAAAACAACTGATAAAACAAAGTTTTTGTTTT G Y A C E W Q I K K K Q L I K Q S F C F V M H V N G R - R K N N - - N K V F V F W L C M - M A D K E K T T D K T K F L F . . . . . . 10018 TCTGTTTCTTTTCGTAGTGTTAATGCTTACAGTTTTGTTAGATGGTATACAAAACCAGAA S V S F R S V N A Y S F V R W Y T K P E L F L F V V L M L T V L L D G I Q N Q K F C F F S - C - C L Q F C - M V Y K T R . . . . . . 10078 AGGTGGTAACAACCACACGAAGATCATCCAATGGCATCAAGGATGAATATTTTCTGGGGG R W - Q P H E D H P M A S R M N I F W G G G N N H T K I I Q W H Q G - I F S G G K V V T T T R R S S N G I K D E Y F L G . . . . . . 10138 TTTTCAACATTTGAGGATTTCATTAGCTCAAATCTGAATTAGACTTGAATATTTTAGAAG F S T F E D F I S S N L N - T - I F - K F Q H L R I S L A Q I - I R L E Y F R R V F N I - G F H - L K S E L D L N I L E . . . . . . 10198 GTCAACCTATTTTATTTATCTAAATCTGTATGGTGTACTATTTATTGTCAAGATTTAAGG V N L F Y L S K S V W C T I Y C Q D L R S T Y F I Y L N L Y G V L F I V K I - G G Q P I L F I - I C M V Y Y L L S R F K . 10258 A Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-1_AGS-2_PPS_1 (7408 7505,8201 8317,8489 8595,8732 8810,9624 9680,9764 9947) (frame '0'; 639 bp, 213 residues) 1 LVQILIGLPF ILSHPASYLS NAFNLGRVFI HFWSVNFKFV PEDIFVSKAF ALSLLVAHLS 61 LLLVFAHYRW CRHEGGLFAV VRSKIIQLKL RVSQRNPSST KKVLQADHIV TTMFVGNFIG 121 IICARSLHYQ FYSWYFYCLP YLLWKAPFPT LLRLFLFAAV EFCWNVFPSN TCSSLVLLCV 181 HLIILAGLWI SSPEYPYVEE KTTYKSTPKK KAR- PGL 2 (- strand): 11457 10904 AGS-1 (11457 10904) SCR (e 0.998) Exon 1 11457 10904 ( 554 n); score: 0.998 PGS (11457 10904) SGN-U323147+ 3-phase translation of AGS-1 (-strand): . . . . . . 11457 CACCGATCAGCATTGAAAAATTTATCAACAATGGCTGCGAATTCCTTTTGTTCCATTTTC H R S A L K N L S T M A A N S F C S I F T D Q H - K I Y Q Q W L R I P F V P F S P I S I E K F I N N G C E F L L F H F . . . . . . 11397 ATCATCTCTTCATTATTGATCGCAGCTTTGATCATCTCCGGCGATGCTACCGGCGGCGAT I I S S L L I A A L I I S G D A T G G D S S L H Y - S Q L - S S P A M L P A A I H H L F I I D R S F D H L R R C Y R R R . . . . . . 11337 TTCGACGTGAGCGGTTGGATTCCGATGAAATCCGCCGATAGCTGTGAAGGTTCGATAGCG F D V S G W I P M K S A D S C E G S I A S T - A V G F R - N P P I A V K V R - R F R R E R L D S D E I R R - L - R F D S . . . . . . 11277 GAGTGTATGGCTGCCGGAGAATTCGAAATGGATTCGGAGAGCAACAGGCGTATATTAGCA E C M A A G E F E M D S E S N R R I L A S V W L P E N S K W I R R A T G V Y - Q G V Y G C R R I R N G F G E Q Q A Y I S . . . . . . 11217 ACTACTGATTATATAAGCTATGGTGCGCTGCAGAGTAACAGTGTTCCGTGTTCTAGAAGA T T D Y I S Y G A L Q S N S V P C S R R L L I I - A M V R C R V T V F R V L E E N Y - L Y K L W C A A E - Q C S V F - K . . . . . . 11157 GGTGCGTCGTATTATAACTGCAAAACAGGTGCTGAAGCTAATCCGTATACACGTGGTTGC G A S Y Y N C K T G A E A N P Y T R G C V R R I I T A K Q V L K L I R I H V V A R C V V L - L Q N R C - S - S V Y T W L . . . . . . 11097 AGTGCTATTACTCGTTGCCGGAGTTAAATTAATTAAAGATCGAATTAATCGATGTTAATT S A I T R C R S - I N - R S N - S M L I V L L L V A G V K L I K D R I N R C - L Q C Y Y S L P E L N - L K I E L I D V N . . . . . . 11037 AATTATTAGTAAGTGTAATTGTTTTGAATAATTTCGTAGTGTTTATATTGTATACTTTAA N Y - - V - L F - I I S - C L Y C I L - I I S K C N C F E - F R S V Y I V Y F K - L L V S V I V L N N F V V F I L Y T L . . . . . . 10977 GTAGGAGTATTTTTCTTTTCAGTTGCAATTTCAAATAAAGTGACAGTGGTGCTTTGGCAG V G V F F F S V A I S N K V T V V L W Q - E Y F S F Q L Q F Q I K - Q W C F G S S R S I F L F S C N F K - S D S G A L A . . 10917 TGGTTTATGGTTAA W F M V G L W L V V Y G - Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-2_AGS-1_PPS_1 (11457 11071) (frame '1'; 384 bp, 128 residues) 1 HRSALKNLST MAANSFCSIF IISSLLIAAL IISGDATGGD FDVSGWIPMK SADSCEGSIA 61 ECMAAGEFEM DSESNRRILA TTDYISYGAL QSNSVPCSRR GASYYNCKTG AEANPYTRGC 121 SAITRCRS- 3-phase translation of AGS-1 (+strand): . . . . . . 10904 TTAACCATAAACCACTGCCAAAGCACCACTGTCACTTTATTTGAAATTGCAACTGAAAAG L T I N H C Q S T T V T L F E I A T E K - P - T T A K A P L S L Y L K L Q L K R N H K P L P K H H C H F I - N C N - K . . . . . . 10964 AAAAATACTCCTACTTAAAGTATACAATATAAACACTACGAAATTATTCAAAACAATTAC K N T P T - S I Q Y K H Y E I I Q N N Y K I L L L K V Y N I N T T K L F K T I T E K Y S Y L K Y T I - T L R N Y S K Q L . . . . . . 11024 ACTTACTAATAATTAATTAACATCGATTAATTCGATCTTTAATTAATTTAACTCCGGCAA T Y - - L I N I D - F D L - L I - L R Q L T N N - L T S I N S I F N - F N S G N H L L I I N - H R L I R S L I N L T P A . . . . . . 11084 CGAGTAATAGCACTGCAACCACGTGTATACGGATTAGCTTCAGCACCTGTTTTGCAGTTA R V I A L Q P R V Y G L A S A P V L Q L E - - H C N H V Y T D - L Q H L F C S Y T S N S T A T T C I R I S F S T C F A V . . . . . . 11144 TAATACGACGCACCTCTTCTAGAACACGGAACACTGTTACTCTGCAGCGCACCATAGCTT - Y D A P L L E H G T L L L C S A P - L N T T H L F - N T E H C Y S A A H H S L I I R R T S S R T R N T V T L Q R T I A . . . . . . 11204 ATATAATCAGTAGTTGCTAATATACGCCTGTTGCTCTCCGAATCCATTTCGAATTCTCCG I - S V V A N I R L L L S E S I S N S P Y N Q - L L I Y A C C S P N P F R I L R Y I I S S C - Y T P V A L R I H F E F S . . . . . . 11264 GCAGCCATACACTCCGCTATCGAACCTTCACAGCTATCGGCGGATTTCATCGGAATCCAA A A I H S A I E P S Q L S A D F I G I Q Q P Y T P L S N L H S Y R R I S S E S N G S H T L R Y R T F T A I G G F H R N P . . . . . . 11324 CCGCTCACGTCGAAATCGCCGCCGGTAGCATCGCCGGAGATGATCAAAGCTGCGATCAAT P L T S K S P P V A S P E M I K A A I N R S R R N R R R - H R R R - S K L R S I T A H V E I A A G S I A G D D Q S C D Q . . . . . . 11384 AATGAAGAGATGATGAAAATGGAACAAAAGGAATTCGCAGCCATTGTTGATAAATTTTTC N E E M M K M E Q K E F A A I V D K F F M K R - - K W N K R N S Q P L L I N F S - - R D D E N G T K G I R S H C - - I F . . 11444 AATGCTGATCGGTG N A D R M L I G Q C - S V Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-2_AGS-1_PPS_1 (11210 11455) (frame '1'; 246 bp, 82 residues) 1 SVVANIRLLL SESISNSPAA IHSAIEPSQL SADFIGIQPL TSKSPPVASP EMIKAAINNE 61 EMMKMEQKEF AAIVDKFFNA DR PGL 3 (+ strand): 13807 14395 AGS-1 (13807 14395) SCR (e 0.818) Exon 1 13807 14395 ( 589 n); score: 0.818 PGS (13807 14395) SGN-U326156- PGS (14128 14367) SGN-U324134- PGS (14138 14367) SGN-U337965+ 3-phase translation of AGS-1 (+strand): . . . . . . 13807 TAGGGCTATGTATCGATTGGTTCGATTTGATTTTAAAGTCTATCGAATTGGCTTATTGAT - G Y V S I G S I - F - S L S N W L I D R A M Y R L V R F D F K V Y R I G L L I G L C I D W F D L I L K S I E L A Y - . . . . . . 13867 TATCAGTTTGTAGAGATGCTAAACCGTGATAGAACCATTAAGATATTGAGTTATCAGTTT Y Q F V E M L N R D R T I K I L S Y Q F I S L - R C - T V I E P L R Y - V I S F L S V C R D A K P - - N H - D I E L S V . . . . . . 13927 TTTATCGTTATCGGTTCGGCTATCGATTTAATCGTTAAGATTTGACACAAACAAAAAAAT F I V I G S A I D L I V K I - H K Q K N L S L S V R L S I - S L R F D T N K K I F Y R Y R F G Y R F N R - D L T Q T K K . . . . . . 13987 ATTAAAAATCACTTAGAAACAAGGTGACAAACCAAATAAACCATGTACTTGAGTTCACAA I K N H L E T R - Q T K - T M Y L S S Q L K I T - K Q G D K P N K P C T - V H K Y - K S L R N K V T N Q I N H V L E F T . . . . . . 14047 GTTACATCTCGCTCAAAAGCAAACACTTTCACATTGTAGAATAATCAAGTGTTTGAGACA V T S R S K A N T F T L - N N Q V F E T L H L A Q K Q T L S H C R I I K C L R Q S Y I S L K S K H F H I V E - S S V - D . . . . . . 14107 ATTAAAAATAAAAGTAGGAAATTAAACTCTAAGTCGAGAACTTTATATACAAAATGGTAT I K N K S R K L N S K S R T L Y T K W Y L K I K V G N - T L S R E L Y I Q N G I N - K - K - E I K L - V E N F I Y K M V . . . . . . 14167 AAATATAATTATTTAATTTACTATCGAGTTATCGATTAACCCGTTAAGAAAAAACTTTAA K Y N Y L I Y Y R V I D - P V K K K L - N I I I - F T I E L S I N P L R K N F K - I - L F N L L S S Y R L T R - E K T L . . . . . . 14227 ACCGTTAAGAACCGATAACCCGATAACAAAAAAAAATCAAAACCGTTATCAAAACCACTA T V K N R - P D N K K K S K P L S K P L P L R T D N P I T K K N Q N R Y Q N H - N R - E P I T R - Q K K I K T V I K T T . . . . . . 14287 AACCAATAACCCAATACTATAAATCAATAACTTTTTTATCAGTTCGACTTATCGGTTTCA N Q - P N T I N Q - L F Y Q F D L S V S T N N P I L - I N N F F I S S T Y R F Q K P I T Q Y Y K S I T F L S V R L I G F . . . . . 14347 ATTCAATTTTGAACGGCCCTAGTAGTATAGTACTTTTTATATAGGCTTA I Q F - T A L V V - Y F L Y R L F N F E R P - - Y S T F Y I G L N S I L N G P S S I V L F I - A Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 14395 TAAGCCTATATAAAAAGTACTATACTACTAGGGCCGTTCAAAATTGAATTGAAACCGATA - A Y I K S T I L L G P F K I E L K P I K P I - K V L Y Y - G R S K L N - N R - S L Y K K Y Y T T R A V Q N - I E T D . . . . . . 14335 AGTCGAACTGATAAAAAAGTTATTGATTTATAGTATTGGGTTATTGGTTTAGTGGTTTTG S R T D K K V I D L - Y W V I G L V V L V E L I K K L L I Y S I G L L V - W F - K S N - - K S Y - F I V L G Y W F S G F . . . . . . 14275 ATAACGGTTTTGATTTTTTTTTGTTATCGGGTTATCGGTTCTTAACGGTTTAAAGTTTTT I T V L I F F C Y R V I G S - R F K V F - R F - F F F V I G L S V L N G L K F F D N G F D F F L L S G Y R F L T V - S F . . . . . . 14215 TCTTAACGGGTTAATCGATAACTCGATAGTAAATTAAATAATTATATTTATACCATTTTG S - R V N R - L D S K L N N Y I Y T I L L N G L I D N S I V N - I I I F I P F C F L T G - S I T R - - I K - L Y L Y H F . . . . . . 14155 TATATAAAGTTCTCGACTTAGAGTTTAATTTCCTACTTTTATTTTTAATTGTCTCAAACA Y I K F S T - S L I S Y F Y F - L S Q T I - S S R L R V - F P T F I F N C L K H V Y K V L D L E F N F L L L F L I V S N . . . . . . 14095 CTTGATTATTCTACAATGTGAAAGTGTTTGCTTTTGAGCGAGATGTAACTTGTGAACTCA L D Y S T M - K C L L L S E M - L V N S L I I L Q C E S V C F - A R C N L - T Q T - L F Y N V K V F A F E R D V T C E L . . . . . . 14035 AGTACATGGTTTATTTGGTTTGTCACCTTGTTTCTAAGTGATTTTTAATATTTTTTTGTT S T W F I W F V T L F L S D F - Y F F V V H G L F G L S P C F - V I F N I F L F K Y M V Y L V C H L V S K - F L I F F C . . . . . . 13975 TGTGTCAAATCTTAACGATTAAATCGATAGCCGAACCGATAACGATAAAAAACTGATAAC C V K S - R L N R - P N R - R - K T D N V S N L N D - I D S R T D N D K K L I T L C Q I L T I K S I A E P I T I K N - - . . . . . . 13915 TCAATATCTTAATGGTTCTATCACGGTTTAGCATCTCTACAAACTGATAATCAATAAGCC S I S - W F Y H G L A S L Q T D N Q - A Q Y L N G S I T V - H L Y K L I I N K P L N I L M V L S R F S I S T N - - S I S . . . . . 13855 AATTCGATAGACTTTAAAATCAAATCGAACCAATCGATACATAGCCCTA N S I D F K I K S N Q S I H S P I R - T L K S N R T N R Y I A L Q F D R L - N Q I E P I D T - P Maximal non-overlapping open reading frames (>= 64 codons): none PGL 4 (- strand): 20304 15525 AGS-1 (16616 16584,16372 16272,16105 16009,15823 15525) SCR (e 1.000 d 0.996 a 0.982,e 1.000 d 0.930 a 0.997,e 1.000 d 0.940 a 0.993,e 0.993) Exon 1 16616 16584 ( 33 n); score: 1.000 Intron 1 16583 16373 ( 211 n); Pd: 0.996 Pa: 0.982 Exon 2 16372 16272 ( 101 n); score: 1.000 Intron 2 16271 16106 ( 166 n); Pd: 0.930 Pa: 0.997 Exon 3 16105 16009 ( 97 n); score: 1.000 Intron 3 16008 15824 ( 185 n); Pd: 0.940 Pa: 0.993 Exon 4 15823 15525 ( 299 n); score: 0.993 PGS (16616 16584,16372 16272,16105 16009,15823 15525) SGN-U329066+ 3-phase translation of AGS-1 (-strand): . . . . : . . 16616 CAAAAGGCAGTCGCTTCCCTCCCATCCATAGGG : GAAGATTTTGACCAAAGAACTCAGTCA Q K A V A S L P S I G : E D F D Q R T Q S K R Q S L P S H P - G : K I L T K E L S Q K G S R F P P I H R : G R F - P K N S V . . . . . . 16345 ATAATTGCAAAATCAAAACTGAAGTCAGAGATCCGGTTCAAGGCGGAGGTCGTTTCACCT I I A K S K L K S E I R F K A E V V S P - L Q N Q N - S Q R S G S R R R S F H L N N C K I K T E V R D P V Q G G G R F T . . : . . . . 16285 GCAGAGTGGGAAAG : GTGGATAAGGGATCAACAGAAGTCTGAAGGTGTTACTCCTGGTGAA A E W E R : W I R D Q Q K S E G V T P G E Q S G K : G G - G I N R S L K V L L L V K C R V G K : V D K G S T E V - R C Y S W - . . . . . . : 16059 GATGTCTACATTATACTTCGTCTAGATGGTCGTGTTCGACGATCAGGAAAG : GGGATGCCT D V Y I I L R L D G R V R R S G K : G M P M S T L Y F V - M V V F D D Q E R : G C L R C L H Y T S S R W S C S T I R K : G D A . . . . . . 15814 GACTGGGCTCAAATTTTAAAGGAACTGCCTCCAATGGAAGCAATATTAAGCAAGCTAGAA D W A Q I L K E L P P M E A I L S K L E T G L K F - R N C L Q W K Q Y - A S - K - L G S N F K G T A S N G S N I K Q A R . . . . . . 15754 AGATAATACAAGTTCCTCTTGCTGAGTGCTAAAATCTTTTCACTACCTGGTGTATTTTCG R - Y K F L L L S A K I F S L P G V F S D N T S S S C - V L K S F H Y L V Y F R K I I Q V P L A E C - N L F T T W C I F . . . . . . 15694 GTCAGTCTCCTTACTTGAAATATAAAAGGCACTAATTGATGCCTGTTGAGCTTGTCTGTT V S L L T - N I K G T N - C L L S L S V S V S L L E I - K A L I D A C - A C L F G Q S P Y L K Y K R H - L M P V E L V C . . . . . . 15634 TTACGTTTCCATTTTTTCACGAACATATACTGTACTATAATTCTATAAATTGATATTGAT L R F H F F T N I Y C T I I L - I D I D Y V S I F S R T Y T V L - F Y K L I L I F T F P F F H E H I L Y Y N S I N - Y - . . . . . 15574 ATTTGCCAGATGAATATCCAAAGGATATGAAATCCCATAGGAAATAGAAA I C Q M N I Q R I - N P I G N R F A R - I S K G Y E I P - E I E Y L P D E Y P K D M K S H R K - K Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-4_AGS-1_PPS_1 (16616 16584,16372 16272,16105 16009,15823 15749) (frame '1'; 303 bp, 101 residues) 1 QKAVASLPSI GEDFDQRTQS IIAKSKLKSE IRFKAEVVSP AEWERWIRDQ QKSEGVTPGE 61 DVYIILRLDG RVRRSGKGMP DWAQILKELP PMEAILSKLE R- AGS-2 (20303 20050,18482 18379,18021 17886,17716 17485,17338 16813) SCR (e 0.996 d 0.998 a 0.988,e 1.000 d 0.998 a 0.999,e 1.000 d 0.996 a 0.974,e 1.000 d 0.994 a 0.175,e 0.861) Exon 1 20303 20050 ( 254 n); score: 0.996 Intron 1 20049 18483 (1567 n); Pd: 0.998 Pa: 0.988 Exon 2 18482 18379 ( 104 n); score: 1.000 Intron 2 18378 18022 ( 357 n); Pd: 0.998 Pa: 0.999 Exon 3 18021 17886 ( 136 n); score: 1.000 Intron 3 17885 17717 ( 169 n); Pd: 0.996 Pa: 0.974 Exon 4 17716 17485 ( 232 n); score: 1.000 Intron 4 17484 17339 ( 146 n); Pd: 0.994 Pa: 0.175 Exon 5 17338 16813 ( 526 n); score: 0.861 PGS (20303 20050,18482 18379,18021 17886,17716 17485,17338 16813) SGN-U324334+ 3-phase translation of AGS-2 (-strand): . . . . . . 20303 TTGACATGGCCATTACAACTCTTCCCTCCTCTCATCACCTCCATTTCTGCTTCTTCTCTT L T W P L Q L F P P L I T S I S A S S L - H G H Y N S S L L S S P P F L L L L F D M A I T T L P S S H H L H F C F F S . . . . . . 20243 CATCATCTTCCAAATTCTCAATCACTACACAATTATGCCTAAATTTTCCCCCGAAGAATC H H L P N S Q S L H N Y A - I F P R R I I I F Q I L N H Y T I M P K F S P E E S S S S S K F S I T T Q L C L N F P P K N . . . . . . 20183 GTATTTCACTCGTCATTTGTTCTTCTTCTACTTCGCCCCCTCCGCCATCATCGTCGTCGC V F H S S F V L L L L R P L R H H R R R Y F T R H L F F F Y F A P S A I I V V A R I S L V I C S S S T S P P P P S S S S . . . . . . 20123 CCCAAGTCAGTGAACCCACTCCCACTGCTGAGTCCTGTGTCAATGCCGGCCTCGACCTCT P K S V N P L P L L S P V S M P A S T S P S Q - T H S H C - V L C Q C R P R P L P Q V S E P T P T A E S C V N A G L D L . . : . . . . 20063 TCTCTAAAGGACGG : GTGAAAGATGCTCTTGTGCTATTCGACACAGCACTTACTTTAAATC S L K D G : - K M L L C Y S T Q H L L - I L - R T : G E R C S C A I R H S T Y F K S F S K G R : V K D A L V L F D T A L T L N . . . . . . : 18436 CCAACCCTGAGGAGGCTCAAGCTGCATTTTATAACAAAGCATGTTGCCATGCCTACAG : GG P T L R R L K L H F I T K H V A M P T : G Q P - G G S S C I L - Q S M L P C L Q : G P N P E E A Q A A F Y N K A C C H A Y R : . . . . . . 18019 GGGAAGGAAAGAAAGCTGCTGAGTGTTTACGCACTGCTTTGAAAGAGTATAACCTTAAGT G K E R K L L S V Y A L L - K S I T L S G R K E S C - V F T H C F E R V - P - V G E G K K A A E C L R T A L K E Y N L K . . . . . . 17959 TTGGCACAATCTTAAATGATCCAGATTTGGCTTCCTTCAGAGCATTGCCTGAATTTAAGG L A Q S - M I Q I W L P S E H C L N L R W H N L K - S R F G F L Q S I A - I - G F G T I L N D P D L A S F R A L P E F K . . : . . . . 17899 AACTACAGGAAGAG : GCTAGGTTAGGAGGGGAAGACATAGGTTACAGTTTCCGTAGAGACC N Y R K R : L G - E G K T - V T V S V E T T T G R : G - V R R G R H R L Q F P - R P E L Q E E : A R L G G E D I G Y S F R R D . . . . . . 17670 TTAAGCTCATCAGTGAGGTACAAGCACCGTTTAGGGGGGTCAGAAGATTCTTTTATGTGG L S S S V R Y K H R L G G S E D S F M W - A H Q - G T S T V - G G Q K I L L C G L K L I S E V Q A P F R G V R R F F Y V . . . . . . 17610 CATTCATTGCTGCTGCTGGAATTTCTACATTTTTCACTATACCCAGGTTAATCCGTGCAA H S L L L L E F L H F S L Y P G - S V Q I H C C C W N F Y I F H Y T Q V N P C N A F I A A A G I S T F F T I P R L I R A . . . . . . 17550 TTCAAGGTGGAGATGGTGCTCCTGATTTAGGGGCGACTGCAGGAAATGCTGCTATAAATG F K V E M V L L I - G R L Q E M L L - M S R W R W C S - F R G D C R K C C Y K C I Q G G D G A P D L G A T A G N A A I N . : . . . . . 17490 TCGCTG : GTATTGCTGTTTTTGTGGCTTTGTTGTTTTGGGACAACAAAAAAGAGGAGGAAC S L : V L L F L W L C C F G T T K K R R N R W : Y C C F C G F V V L G Q Q K R G G T V A : G I A V F V A L L F W D N K K E E E . . . . . . 17284 AGCTTGCACAAATATTGCGTGATGAAACACTATCAAGGCTGCCTTTGCGCCTTTCAACTG S L H K Y C V M K H Y Q G C L C A F Q L A C T N I A - - N T I K A A F A P F N - Q L A Q I L R D E T L S R L P L R L S T . . . . . . 17224 ATAGGATTGTTGAACTCGTCCAGCTAAGAGACACCGTGAGGCCTGTTAGTGAGACAATTT I G L L N S S S - E T P - G L L V R Q F - D C - T R P A K R H R E A C - - D N F D R I V E L V Q L R D T V R P V S E T I . . . . . . 17164 CCCCTTTACTCTTCCTGTCAATTTCTAGAAGTTGAGTGGAAAACGTCCGGAGTCCTTCAA P L Y S S C Q F L E V E W K T S G V L Q P F T L P V N F - K L S G K R P E S F N S P L L F L S I S R S - V E N V R S P S . . . . . . 17104 TTTTACCTTGAAGAATGAGATGTCCTGTATGAGATGTCCTGTCACTAGTCTAGTTCTCGT F Y L E E - D V L Y E M S C H - S S S R F T L K N E M S C M R C P V T S L V L V I L P - R M R C P V - D V L S L V - F S . . . . . . 17044 TGCCATGTATCCTGCTTAACAGATCAAAAGATATTGTTTACCTCCTTCGTTTGAAGAATC C H V S C L T D Q K I L F T S F V - R I A M Y P A - Q I K R Y C L P P S F E E S L P C I L L N R S K D I V Y L L R L K N . . . . . . 16984 CATTTCCATCACATTTACATTGTCGGAGGATAATTAAAGAGAAAAAGTCAGTAACATATT H F H H I Y I V G G - L K R K S Q - H I I S I T F T L S E D N - R E K V S N I F P F P S H L H C R R I I K E K K S V T Y . . . . . . 16924 TTGCTAGGTTTGCAACAAATGTCTTTGTTTCATGAGCAAAAAGCTATCCCATTACGTATT L L G L Q Q M S L F H E Q K A I P L R I C - V C N K C L C F M S K K L S H Y V F F A R F A T N V F V S - A K S Y P I T Y . . . . . . 16864 TGTTCTCTCTGTCTCTCTTTTTCCTTATTAAAAATTCCGGACTTGAACATAT C S L C L S F S L L K I P D L N I V L S V S L F P Y - K F R T - T Y L F S L S L F F L I K N S G L E H Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-4_AGS-2_PPS_1 (20301 20050,18482 18379,18021 17886,17716 17485,17338 17130) (frame '0'; 930 bp, 310 residues) 1 DMAITTLPSS HHLHFCFFSS SSSKFSITTQ LCLNFPPKNR ISLVICSSST SPPPPSSSSP 61 QVSEPTPTAE SCVNAGLDLF SKGRVKDALV LFDTALTLNP NPEEAQAAFY NKACCHAYRG 121 EGKKAAECLR TALKEYNLKF GTILNDPDLA SFRALPEFKE LQEEARLGGE DIGYSFRRDL 181 KLISEVQAPF RGVRRFFYVA FIAAAGISTF FTIPRLIRAI QGGDGAPDLG ATAGNAAINV 241 AGIAVFVALL FWDNKKEEEQ LAQILRDETL SRLPLRLSTD RIVELVQLRD TVRPVSETIS 301 PLLFLSISRS - AGS-3 (20304 19276) SCR (e 0.976) Exon 1 20304 19276 (1029 n); score: 0.976 PGS (20304 19276) SGN-U338003+ 3-phase translation of AGS-3 (-strand): . . . . . . 20304 CTTGACATGGCCATTACAACTCTTCCCTCCTCTCATCACCTCCATTTCTGCTTCTTCTCT L D M A I T T L P S S H H L H F C F F S L T W P L Q L F P P L I T S I S A S S L - H G H Y N S S L L S S P P F L L L L . . . . . . 20244 TCATCATCTTCCAAATTCTCAATCACTACACAATTATGCCTAAATTTTCCCCCGAAGAAT S S S S K F S I T T Q L C L N F P P K N H H L P N S Q S L H N Y A - I F P R R I F I I F Q I L N H Y T I M P K F S P E E . . . . . . 20184 CGTATTTCACTCGTCATTTGTTCTTCTTCTACTTCGCCCCCTCCGCCATCATCGTCGTCG R I S L V I C S S S T S P P P P S S S S V F H S S F V L L L L R P L R H H R R R S Y F T R H L F F F Y F A P S A I I V V . . . . . . 20124 CCCCAAGTCAGTGAACCCACTCCCACTGCTGAGTCCTGTGTCAATGCCGGCCTCGACCTC P Q V S E P T P T A E S C V N A G L D L P K S V N P L P L L S P V S M P A S T S A P S Q - T H S H C - V L C Q C R P R P . . . . . . 20064 TTCTCTAAAGGACGGGTATTATCTTAATTCTGTTTCAATTTCATGGCAGAAGAAATTCCT F S K G R V L S - F C F N F M A E E I P S L K D G Y Y L N S V S I S W Q K K F L L L - R T G I I L I L F Q F H G R R N S . . . . . . 20004 TAATTATTATGTGATTGCATTGTTTTTCAGGTCTATTATAAACAATGATAAGGTCTGCAT - L L C D C I V F Q V Y Y K Q - - G L H N Y Y V I A L F F R S I I N N D K V C I L I I M - L H C F S G L L - T M I R S A . . . . . . 19944 ATACATTATGCTTTATGAAACTCCTCAGTATTGTTGTTGTATTGTTATTTTTGTTGATTT I H Y A L - N S S V L L L Y C Y F C - F Y I M L Y E T P Q Y C C C I V I F V D L Y T L C F M K L L S I V V V L L F L L I . . . . . . 19884 AGACAGAGAGTGTGTCGATTGAGGTTCGTGACCATAAAATTTTTGTTTATTTATATATAT R Q R V C R L R F V T I K F L F I Y I Y D R E C V D - G S - P - N F C L F I Y I - T E S V S I E V R D H K I F V Y L Y I . . . . . . 19824 TTATTTTTTTAACAAAGATATACTTCAATTGTAGCTACGCCACTAGTTGTAGATATCAAA L F F - Q R Y T S I V A T P L V V D I K Y F F N K D I L Q L - L R H - L - I S N F I F L T K I Y F N C S Y A T S C R Y Q . . . . . . 19764 TAAGTTAAGTAAGTAAATAAAAGGGATTCGTTTTAATTTATGCAATTTAATTTGATAAAT - V K - V N K R D S F - F M Q F N L I N K L S K - I K G I R F N L C N L I - - M I S - V S K - K G F V L I Y A I - F D K . . . . . . 19704 GACAATAGAGGTTTTGATTTCGAGTTTTAATATCCCCATCATTTGATCATTTGCAGTCAT D N R G F D F E F - Y P H H L I I C S H T I E V L I S S F N I P I I - S F A V I - Q - R F - F R V L I S P S F D H L Q S . . . . . . 19644 ATTATTCCCTTCTATTTGTTACCTAATTTTCCTCAAGTGTTCTCCAAGAGTTTGTTGTTG I I P F Y L L P N F P Q V F S K S L L L L F P S I C Y L I F L K C S P R V C C W Y Y S L L F V T - F S S S V L Q E F V V . . . . . . 19584 GGTATTTGACCTGAGTTTCTTAGCATAAATAGGTTATAAAGTTTCCGTTTTTGAGTAGTC G I - P E F L S I N R L - S F R F - V V V F D L S F L A - I G Y K V S V F E - S G Y L T - V S - H K - V I K F P F L S S . . . . . . 19524 ATTTTCTTGTAGAAAAAAGTTTAGTACTCTATAAATTGAGGCTCGTTCCTTCTAACTTAA I F L - K K V - Y S I N - G S F L L T - F S C R K K F S T L - I E A R S F - L N H F L V E K S L V L Y K L R L V P S N L . . . . . . 19464 TCAACATTCACAATGTAGTCATAGGGGGCTTTGAGAGTTTGGTTAGGGCAAGTGTTCTCC S T F T M - S - G A L R V W L G Q V F S Q H S Q C S H R G L - E F G - G K C S P I N I H N V V I G G F E S L V R A S V L . . . . . . 19404 AAGAATCTGCAACTTGAGCAAGTTTACTAGTTTGACATGTCTATTCAAAATTTTCCTGTT K N L Q L E Q V Y - F D M S I Q N F P V R I C N L S K F T S L T C L F K I F L F Q E S A T - A S L L V - H V Y S K F S C . . . . . . 19344 TTGAGTAAATCTTATGGTAACAGTTGTACTCAGAGGTGTATCTAGGATTTTTAGTTTATG L S K S Y G N S C T Q R C I - D F - F M - V N L M V T V V L R G V S R I F S L - F E - I L W - Q L Y S E V Y L G F L V Y . 19284 AGTTTTGAA S F E V L E F - Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-4_AGS-3_PPS_1 (20201 19863) (frame '2'; 336 bp, 112 residues) 1 IFPRRIVFHS SFVLLLLRPL RHHRRRPKSV NPLPLLSPVS MPASTSSLKD GYYLNSVSIS 61 WQKKFLNYYV IALFFRSIIN NDKVCIYIML YETPQYCCCI VIFVDLDREC VD- 3-phase translation of AGS-3 (+strand): . . . . . . 19276 TTCAAAACTCATAAACTAAAAATCCTAGATACACCTCTGAGTACAACTGTTACCATAAGA F K T H K L K I L D T P L S T T V T I R S K L I N - K S - I H L - V Q L L P - D Q N S - T K N P R Y T S E Y N C Y H K . . . . . . 19336 TTTACTCAAAACAGGAAAATTTTGAATAGACATGTCAAACTAGTAAACTTGCTCAAGTTG F T Q N R K I L N R H V K L V N L L K L L L K T G K F - I D M S N - - T C S S C I Y S K Q E N F E - T C Q T S K L A Q V . . . . . . 19396 CAGATTCTTGGAGAACACTTGCCCTAACCAAACTCTCAAAGCCCCCTATGACTACATTGT Q I L G E H L P - P N S Q S P L - L H C R F L E N T C P N Q T L K A P Y D Y I V A D S W R T L A L T K L S K P P M T T L . . . . . . 19456 GAATGTTGATTAAGTTAGAAGGAACGAGCCTCAATTTATAGAGTACTAAACTTTTTTCTA E C - L S - K E R A S I Y R V L N F F L N V D - V R R N E P Q F I E Y - T F F Y - M L I K L E G T S L N L - S T K L F S . . . . . . 19516 CAAGAAAATGACTACTCAAAAACGGAAACTTTATAACCTATTTATGCTAAGAAACTCAGG Q E N D Y S K T E T L - P I Y A K K L R K K M T T Q K R K L Y N L F M L R N S G T R K - L L K N G N F I T Y L C - E T Q . . . . . . 19576 TCAAATACCCAACAACAAACTCTTGGAGAACACTTGAGGAAAATTAGGTAACAAATAGAA S N T Q Q Q T L G E H L R K I R - Q I E Q I P N N K L L E N T - G K L G N K - K V K Y P T T N S W R T L E E N - V T N R . . . . . . 19636 GGGAATAATATGACTGCAAATGATCAAATGATGGGGATATTAAAACTCGAAATCAAAACC G N N M T A N D Q M M G I L K L E I K T G I I - L Q M I K - W G Y - N S K S K P R E - Y D C K - S N D G D I K T R N Q N . . . . . . 19696 TCTATTGTCATTTATCAAATTAAATTGCATAAATTAAAACGAATCCCTTTTATTTACTTA S I V I Y Q I K L H K L K R I P F I Y L L L S F I K L N C I N - N E S L L F T Y L Y C H L S N - I A - I K T N P F Y L L . . . . . . 19756 CTTAACTTATTTGATATCTACAACTAGTGGCGTAGCTACAATTGAAGTATATCTTTGTTA L N L F D I Y N - W R S Y N - S I S L L L T Y L I S T T S G V A T I E V Y L C - T - L I - Y L Q L V A - L Q L K Y I F V . . . . . . 19816 AAAAAATAAATATATATAAATAAACAAAAATTTTATGGTCACGAACCTCAATCGACACAC K K - I Y I N K Q K F Y G H E P Q S T H K N K Y I - I N K N F M V T N L N R H T K K I N I Y K - T K I L W S R T S I D T . . . . . . 19876 TCTCTGTCTAAATCAACAAAAATAACAATACAACAACAATACTGAGGAGTTTCATAAAGC S L S K S T K I T I Q Q Q Y - G V S - S L C L N Q Q K - Q Y N N N T E E F H K A L S V - I N K N N N T T T I L R S F I K . . . . . . 19936 ATAATGTATATGCAGACCTTATCATTGTTTATAATAGACCTGAAAAACAATGCAATCACA I M Y M Q T L S L F I I D L K N N A I T - C I C R P Y H C L - - T - K T M Q S H H N V Y A D L I I V Y N R P E K Q C N H . . . . . . 19996 TAATAATTAAGGAATTTCTTCTGCCATGAAATTGAAACAGAATTAAGATAATACCCGTCC - - L R N F F C H E I E T E L R - Y P S N N - G I S S A M K L K Q N - D N T R P I I I K E F L L P - N - N R I K I I P V . . . . . . 20056 TTTAGAGAAGAGGTCGAGGCCGGCATTGACACAGGACTCAGCAGTGGGAGTGGGTTCACT F R E E V E A G I D T G L S S G S G F T L E K R S R P A L T Q D S A V G V G S L L - R R G R G R H - H R T Q Q W E W V H . . . . . . 20116 GACTTGGGGCGACGACGATGATGGCGGAGGGGGCGAAGTAGAAGAAGAACAAATGACGAG D L G R R R - W R R G R S R R R T N D E T W G D D D D G G G G E V E E E Q M T S - L G A T T M M A E G A K - K K N K - R . . . . . . 20176 TGAAATACGATTCTTCGGGGGAAAATTTAGGCATAATTGTGTAGTGATTGAGAATTTGGA - N T I L R G K I - A - L C S D - E F G E I R F F G G K F R H N C V V I E N L E V K Y D S S G E N L G I I V - - L R I W . . . . . . 20236 AGATGATGAAGAGAAGAAGCAGAAATGGAGGTGATGAGAGGAGGGAAGAGTTGTAATGGC R - - R E E A E M E V M R G G K S C N G D D E E K K Q K W R - - E E G R V V M A K M M K R R S R N G G D E R R E E L - W . 20296 CATGTCAAG H V K M S P C Q Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-4_AGS-3_PPS_1 (20042 20269) (frame '2'; 225 bp, 75 residues) 1 DNTRPLEKRS RPALTQDSAV GVGSLTWGDD DDGGGGEVEE EQMTSEIRFF GGKFRHNCVV 61 IENLEDDEEK KQKWR- PGL 5 (+ strand): 24375 30980 AGS-1 (24375 24412,24590 24636,27191 27275,30524 30613,30741 30980) SCR (e 1.000 d 0.978 a 0.916,e 1.000 d 0.987 a 0.998,e 1.000 d 0.656 a 0.996,e 1.000 d 0.940 a 0.995,e 1.000) Exon 1 24375 24412 ( 38 n); score: 1.000 Intron 1 24413 24589 ( 177 n); Pd: 0.978 Pa: 0.916 Exon 2 24590 24636 ( 47 n); score: 1.000 Intron 2 24637 27190 (2554 n); Pd: 0.987 Pa: 0.998 Exon 3 27191 27275 ( 85 n); score: 1.000 Intron 3 27276 30523 (3248 n); Pd: 0.656 Pa: 0.996 Exon 4 30524 30613 ( 90 n); score: 1.000 Intron 4 30614 30740 ( 127 n); Pd: 0.940 Pa: 0.995 Exon 5 30741 30980 ( 240 n); score: 1.000 PGS (24375 24412,24590 24636,27191 27275,30524 30613,30741 30980) SGN-U316772+ PGS (27191 27275,30524 30613,30741 30832) SGN-U316773+ 3-phase translation of AGS-1 (+strand): . . . . : . . 24375 TCCTCTTCGCCATCCGCAGCTCATTCTAGAACAGTCAG : GTGATTTTGCAGAATCGAAGTC S S S P S A A H S R T V R : - F C R I E V P L R H P Q L I L E Q S : G D F A E S K S L F A I R S S F - N S Q : V I L Q N R S . . . : . . . 24612 TTCGTCTCGAATATCTCTTTGCCAT : GTCGGACGAGGAAGTTGTTGACCCAAAGGCGACAT F V S N I S L P : C R T R K L L T Q R R H S S R I S L C H : V G R G S C - P K G D I L R L E Y L F A M : S D E E V V D P K A T . . . . . : . 27226 TAGAAGTAAGTTGCAAGCCTAAGTGTGTAAGGCAACTAAAGGAGTATCAG : GCATGTACTA - K - V A S L S V - G N - R S I R : H V L R S K L Q A - V C K A T K G V S : G M Y - L E V S C K P K C V R Q L K E Y Q : A C T . . . . . . 30534 AAAGGATAGAAGGTGATGAATCAGGGCACAAACATTGCACTGGACAGTATTTTGATTATT K G - K V M N Q G T N I A L D S I L I I K D R R - - I R A Q T L H W T V F - L L K R I E G D E S G H K H C T G Q Y F D Y . . : . . . . 30594 GGCACTGCATCGACAAATGT : GTTGCTGCGAAGTTGTTTGACCATCTCAAGTAACAAGGAT G T A S T N V : L L R S C L T I S S N K D A L H R Q M : C C C E V V - P S Q V T R I W H C I D K C : V A A K L F D H L K - Q G . . . . . . 30781 ATAAGTTGTTGATCCCTTGCAATTTATCTTCTTTTTGGTTGTTGAACAAGTCATTACCAT I S C - S L A I Y L L F G C - T S H Y H - V V D P L Q F I F F L V V E Q V I T I Y K L L I P C N L S S F W L L N K S L P . . . . . . 30841 ATTATTCCTCACTGTGCTGAAGACTTGTAACCCTTTCAATCAACTTGGTTGCTGCATGGA I I P H C A E D L - P F Q S T W L L H G L F L T V L K T C N P F N Q L G C C M E Y Y S S L C - R L V T L S I N L V A A W . . . . . . 30901 AAATTTTGAACTATGCACATCTTAAAAAGTGATTAATAAATCATACTCGTGGGTTGAATT K F - T M H I L K S D - - I I L V G - I N F E L C T S - K V I N K S Y S W V E L K I L N Y A H L K K - L I N H T R G L N . . 30961 GGACCCTTTTATTCGTTCGA G P F Y S F D P F I R S W T L L F V R Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-5_AGS-1_PPS_1 (24404 24412,24590 24636,27191 27275,30524 30613,30741 30773) (frame '0'; 261 bp, 87 residues) 1 NSQVILQNRS LRLEYLFAMS DEEVVDPKAT LEVSCKPKCV RQLKEYQACT KRIEGDESGH 61 KHCTGQYFDY WHCIDKCVAA KLFDHLK- AGS-2 (29063 30025) SCR (e 0.942) Exon 1 29063 30025 ( 963 n); score: 0.942 PGS (29063 30025) SGN-U345971+ 3-phase translation of AGS-2 (+strand): . . . . . . 29063 AACTCAGAGTGGCACAAAAATGTCTTGCATTATGTTATGTTATGTTAGACCTGAAAATTT N S E W H K N V L H Y V M L C - T - K F T Q S G T K M S C I M L C Y V R P E N F L R V A Q K C L A L C Y V M L D L K I . . . . . . 29123 TATATTTGGAAAATATCAAATATTATGAACATAATATGTACTGATTCCAGGACCTACTAC Y I W K I S N I M N I I C T D S R T Y Y I F G K Y Q I L - T - Y V L I P G P T T L Y L E N I K Y Y E H N M Y - F Q D L L . . . . . . 29183 TCTAAGTATGTACTGATTCCAGGATCTACTTCAAATCAATTAAAATAGCAGGTCAAGAGA S K Y V L I P G S T S N Q L K - Q V K R L S M Y - F Q D L L Q I N - N S R S R E L - V C T D S R I Y F K S I K I A G Q E . . . . . . 29243 ATTTTCTTTGCATGTTACTCTGATGGGATGAACCAATATAGGGACTTCATCTGTGATTAT I F F A C Y S D G M N Q Y R D F I C D Y F S L H V T L M G - T N I G T S S V I I N F L C M L L - W D E P I - G L H L - L . . . . . . 29303 TTGGTCTCTCTAAGAAGCCCCCCAAAAAAACATGTTTCTGGCCTCCAAATACTTCTCGCT L V S L R S P P K K H V S G L Q I L L A W S L - E A P Q K N M F L A S K Y F S L F G L S K K P P K K T C F W P P N T S R . . . . . . 29363 CTTTAGATTAGGCACTCTAGATTTCAGCTGGTTAGCGGTTGCTGGTATTTGTATGATTTT L - I R H S R F Q L V S G C W Y L Y D F F R L G T L D F S W L A V A G I C M I F S L D - A L - I S A G - R L L V F V - F . . . . . . 29423 TTGTTTCTTTGTTTTGTGTGTAGCATGTTCTTAGATCCAAGAGGAATTGCAAGACATCAA L F L C F V C S M F L D P R G I A R H Q C F F V L C V A C S - I Q E E L Q D I N F V S L F C V - H V L R S K R N C K T S . . . . . . 29483 TCAGATTTTCTCTTCAACCTTGCCTTTTATTCTGTTTGATTGCTTTCTATGGGGTCCTTT S D F L F N L A F Y S V - L L S M G S F Q I F S S T L P F I L F D C F L W G P F I R F S L Q P C L L F C L I A F Y G V L . . . . . . 29543 TGTTGTTTGTATCGCCAACAGTTTGCTGTAAGAGTTTTTGATGATGAGGAGTGTTACTTC C C L Y R Q Q F A V R V F D D E E C Y F V V C I A N S L L - E F L M M R S V T S L L F V S P T V C C K S F - - - G V L L . . . . . . 29603 AAGGTTTTCTTTCTCCAAGACTCTGATTTCTAATATAATTTTGATGCCATTAAACGTTTT K V F F L Q D S D F - Y N F D A I K R F R F S F S K T L I S N I I L M P L N V F Q G F L S P R L - F L I - F - C H - T F . . . . . . 29663 CCAATTTATTTTTTATTTTTCTACTTGTTCTTCATATACTACCTCTGCAAACATCTTTGT P I Y F L F F Y L F F I Y Y L C K H L C Q F I F Y F S T C S S Y T T S A N I F V S N L F F I F L L V L H I L P L Q T S L . . . . . . 29723 AACCAACTTAATTATTTGTGCAACGAATGTGCTTTGGTTGGGGAGTCTGAGTTTCTAACT N Q L N Y L C N E C A L V G E S E F L T T N L I I C A T N V L W L G S L S F - L - P T - L F V Q R M C F G W G V - V S N . . . . . . 29783 TTTGGAAGGATTTTGGTCTCTCTTTCTGCGAGAAGTGAAAGATTTTTTTTTTTGGAAGGA F G R I L V S L S A R S E R F F F L E G L E G F W S L F L R E V K D F F F W K D F W K D F G L S F C E K - K I F F F G R . . . . . . 29843 TTCCAGAACAATGATGTCTCGTTTAGTTGTGCTAACTGTGCACCTCATATAACAATACAG F Q N N D V S F S C A N C A P H I T I Q S R T M M S R L V V L T V H L I - Q Y S I P E Q - C L V - L C - L C T S Y N N T . . . . . . 29903 TAACATATTTGATCTTCAGAACAATTGCGCATGATCTGTTTTTTTTTTTAATTTTAACTA - H I - S S E Q L R M I C F F F - F - L N I F D L Q N N C A - S V F F F N F N - V T Y L I F R T I A H D L F F F L I L T . . . . . . 29963 ATGGATGTGGAAGTATTGCACCTTAGAATGAAGGTGATATTCCTTCACTAACCTCCCTGG M D V E V L H L R M K V I F L H - P P W W M W K Y C T L E - R - Y S F T N L P G N G C G S I A P - N E G D I P S L T S L . 30023 GAA E G Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-5_AGS-2_PPS_1 (29636 29905) (frame '1'; 267 bp, 89 residues) 1 YNFDAIKRFP IYFLFFYLFF IYYLCKHLCN QLNYLCNECA LVGESEFLTF GRILVSLSAR 61 SERFFFLEGF QNNDVSFSCA NCAPHITIQ- 3-phase translation of AGS-2 (-strand): . . . . . . 30025 TTCCCAGGGAGGTTAGTGAAGGAATATCACCTTCATTCTAAGGTGCAATACTTCCACATC F P G R L V K E Y H L H S K V Q Y F H I S Q G G - - R N I T F I L R C N T S T S P R E V S E G I S P S F - G A I L P H . . . . . . 29965 CATTAGTTAAAATTAAAAAAAAAAACAGATCATGCGCAATTGTTCTGAAGATCAAATATG H - L K L K K K T D H A Q L F - R S N M I S - N - K K K Q I M R N C S E D Q I C P L V K I K K K N R S C A I V L K I K Y . . . . . . 29905 TTACTGTATTGTTATATGAGGTGCACAGTTAGCACAACTAAACGAGACATCATTGTTCTG L L Y C Y M R C T V S T T K R D I I V L Y C I V I - G A Q L A Q L N E T S L F W V T V L L Y E V H S - H N - T R H H C S . . . . . . 29845 GAATCCTTCCAAAAAAAAAAATCTTTCACTTCTCGCAGAAAGAGAGACCAAAATCCTTCC E S F Q K K K S F T S R R K R D Q N P S N P S K K K N L S L L A E R E T K I L P G I L P K K K I F H F S Q K E R P K S F . . . . . . 29785 AAAAGTTAGAAACTCAGACTCCCCAACCAAAGCACATTCGTTGCACAAATAATTAAGTTG K S - K L R L P N Q S T F V A Q I I K L K V R N S D S P T K A H S L H K - L S W Q K L E T Q T P Q P K H I R C T N N - V . . . . . . 29725 GTTACAAAGATGTTTGCAGAGGTAGTATATGAAGAACAAGTAGAAAAATAAAAAATAAAT V T K M F A E V V Y E E Q V E K - K I N L Q R C L Q R - Y M K N K - K N K K - I G Y K D V C R G S I - R T S R K I K N K . . . . . . 29665 TGGAAAACGTTTAATGGCATCAAAATTATATTAGAAATCAGAGTCTTGGAGAAAGAAAAC W K T F N G I K I I L E I R V L E K E N G K R L M A S K L Y - K S E S W R K K T L E N V - W H Q N Y I R N Q S L G E R K . . . . . . 29605 CTTGAAGTAACACTCCTCATCATCAAAAACTCTTACAGCAAACTGTTGGCGATACAAACA L E V T L L I I K N S Y S K L L A I Q T L K - H S S S S K T L T A N C W R Y K Q P - S N T P H H Q K L L Q Q T V G D T N . . . . . . 29545 ACAAAAGGACCCCATAGAAAGCAATCAAACAGAATAAAAGGCAAGGTTGAAGAGAAAATC T K G P H R K Q S N R I K G K V E E K I Q K D P I E S N Q T E - K A R L K R K S N K R T P - K A I K Q N K R Q G - R E N . . . . . . 29485 TGATTGATGTCTTGCAATTCCTCTTGGATCTAAGAACATGCTACACACAAAACAAAGAAA - L M S C N S S W I - E H A T H K T K K D - C L A I P L G S K N M L H T K Q R N L I D V L Q F L L D L R T C Y T Q N K E . . . . . . 29425 CAAAAAATCATACAAATACCAGCAACCGCTAACCAGCTGAAATCTAGAGTGCCTAATCTA Q K I I Q I P A T A N Q L K S R V P N L K K S Y K Y Q Q P L T S - N L E C L I - T K N H T N T S N R - P A E I - S A - S . . . . . . 29365 AAGAGCGAGAAGTATTTGGAGGCCAGAAACATGTTTTTTTGGGGGGCTTCTTAGAGAGAC K S E K Y L E A R N M F F W G A S - R D R A R S I W R P E T C F F G G L L R E T K E R E V F G G Q K H V F L G G F L E R . . . . . . 29305 CAAATAATCACAGATGAAGTCCCTATATTGGTTCATCCCATCAGAGTAACATGCAAAGAA Q I I T D E V P I L V H P I R V T C K E K - S Q M K S L Y W F I P S E - H A K K P N N H R - S P Y I G S S H Q S N M Q R . . . . . . 29245 AATTCTCTTGACCTGCTATTTTAATTGATTTGAAGTAGATCCTGGAATCAGTACATACTT N S L D L L F - L I - S R S W N Q Y I L I L L T C Y F N - F E V D P G I S T Y L K F S - P A I L I D L K - I L E S V H T . . . . . . 29185 AGAGTAGTAGGTCCTGGAATCAGTACATATTATGTTCATAATATTTGATATTTTCCAAAT R V V G P G I S T Y Y V H N I - Y F P N E - - V L E S V H I M F I I F D I F Q I - S S R S W N Q Y I L C S - Y L I F S K . . . . . . 29125 ATAAAATTTTCAGGTCTAACATAACATAACATAATGCAAGACATTTTTGTGCCACTCTGA I K F S G L T - H N I M Q D I F V P L - - N F Q V - H N I T - C K T F L C H S E Y K I F R S N I T - H N A R H F C A T L . 29065 GTT V S Maximal non-overlapping open reading frames (>= 64 codons): none PGL 6 (+ strand): 33510 37610 AGS-1 (33510 33558,33800 33846,35978 36062,37093 37182,37267 37610) SCR (e 1.000 d 0.771 a 0.950,e 1.000 d 0.956 a 0.999,e 1.000 d 0.828 a 0.999,e 1.000 d 0.994 a 0.998,e 0.994) Exon 1 33510 33558 ( 49 n); score: 1.000 Intron 1 33559 33799 ( 241 n); Pd: 0.771 Pa: 0.950 Exon 2 33800 33846 ( 47 n); score: 1.000 Intron 2 33847 35977 (2131 n); Pd: 0.956 Pa: 0.999 Exon 3 35978 36062 ( 85 n); score: 1.000 Intron 3 36063 37092 (1030 n); Pd: 0.828 Pa: 0.999 Exon 4 37093 37182 ( 90 n); score: 1.000 Intron 4 37183 37266 ( 84 n); Pd: 0.994 Pa: 0.998 Exon 5 37267 37610 ( 344 n); score: 0.994 PGS (33510 33558,33800 33846,35978 36062,37093 37182,37267 37610) SGN-U316773+ PGS (35978 36062,37093 37182,37267 37370) SGN-U316772+ 3-phase translation of AGS-1 (+strand): . . . . . : . 33510 CTCTTCTGTCTTCTCTTCACCATTCGCACCTCATTCAGAGAAAAGTCAG : GTTTTTTTGTA L F C L L F T I R T S F R E K S : G F F V S S V F S S P F A P H S E K S Q : V F L - L L S S L H H S H L I Q R K V R : F F C . . . . : . . 33811 GAATCAAAGTCTTCGTATCCAATATCTCGTTGCCAT : GTCGGACGAGGAAGTTGTTGACCC E S K S S Y P I S R C H : V G R G S C - P N Q S L R I Q Y L V A M : S D E E V V D P R I K V F V S N I S L P : C R T R K L L T . . . . . . 36002 AAAGGCGACAATGGAAGTATCTTGCAAGCCTAAGTGTGTAAGGCAACTAAAGGATTATCA K G D N G S I L Q A - V C K A T K G L S K A T M E V S C K P K C V R Q L K D Y Q Q R R Q W K Y L A S L S V - G N - R I I . : . . . . . 36062 G : GCATGTACTAGAAGGATAGAAGGTGATGAATCAGGGAGCAAGCATTGCACTGGACAGTA : G M Y - K D R R - - I R E Q A L H W T V : A C T R R I E G D E S G S K H C T G Q Y R : H V L E G - K V M N Q G A S I A L D S . . . . : . . 37152 TTTTGATTATTGGCAATGCATTGACAAATGT : GTTGCCCCAAAGCTATTTGAAAAACTCAA F - L L A M H - Q M : C C P K A I - K T Q F D Y W Q C I D K C : V A P K L F E K L K I L I I G N A L T N V : L P Q S Y L K N S . . . . . . 37296 GTAACATGGAGATAAGTGTTCATCCATTACGATTTTATCTTGCTTTTTCATTGTTGAAGC V T W R - V F I H Y D F I L L F H C - S - H G D K C S S I T I L S C F F I V E A S N M E I S V H P L R F Y L A F S L L K . . . . . . 37356 CTGTGCTACAGACTTGCAATCCCTTCAATCAACTTAGTTGCCGCATGGAAAATTTTGTAC L C Y R L A I P S I N L V A A W K I L Y C A T D L Q S L Q S T - L P H G K F C T P V L Q T C N P F N Q L S C R M E N F V . . . . . . 37416 TATGCACATTATTAAGAAGTGATTAATAAAGGATACTTATGGGTTAAATTTTGTGGACAC Y A H Y - E V I N K G Y L W V K F C G H M H I I K K - L I K D T Y G L N F V D T L C T L L R S D - - R I L M G - I L W T . . . . . . 37476 TTTTATTTGGTCAGATGATTCAAAATCCGGAGGACATTATTCCCAGATTTTTGTTCTCTC F Y L V R - F K I R R T L F P D F C S L F I W S D D S K S G G H Y S Q I F V L S L L F G Q M I Q N P E D I I P R F L F S . . . . . . 37536 TTGCGTTGTTATATGAGCTCCCAGTTACCTAATTTCTTTATGGGAGGAAATATCATAAAT L R C Y M S S Q L P N F F M G G N I I N C V V I - A P S Y L I S L W E E I S - I L A L L Y E L P V T - F L Y G R K Y H K . . 37596 CAGTGATTTATATAA Q - F I - S D L Y S V I Y I Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-6_AGS-1_PPS_1 (37112 37182,37267 37441) (frame '0'; 243 bp, 81 residues) 1 KVMNQGASIA LDSILIIGNA LTNVLPQSYL KNSSNMEISV HPLRFYLAFS LLKPVLQTCN 61 PFNQLSCRME NFVLCTLLRS D- >C09HBa0099P03.1-2+_PGL-6_AGS-1_PPS_2 (33812 33846,35978 36062,37093 37182,37267 37299) (frame '2'; 240 bp, 80 residues) 1 NQSLRIQYLV AMSDEEVVDP KATMEVSCKP KCVRQLKDYQ ACTRRIEGDE SGSKHCTGQY 61 FDYWQCIDKC VAPKLFEKLK - PGL 7 (- strand): 38606 38192 AGS-1 (38606 38192) SCR (e 0.993) Exon 1 38606 38192 ( 415 n); score: 0.993 PGS (38606 38192) SGN-U342268+ 3-phase translation of AGS-1 (-strand): . . . . . . 38606 GGATGAAGCCTCACCATTTCTCTGAAGATGATAAATCTTTGATTCACAAATCAAAGAGAA G - S L T I S L K M I N L - F T N Q R E D E A S P F L - R - - I F D S Q I K E N M K P H H F S E D D K S L I H K S K R . . . . . . 38546 CAACCAGAGAAAAGCTCTCAAAATCAAAACAAAATTCCAAAGAAAATCAAAATCAAAGAA Q P E K S S Q N Q N K I P K K I K I K E N Q R K A L K I K T K F Q R K S K S K K T T R E K L S K S K Q N S K E N Q N Q R . . . . . . 38486 GAAAGAGTTCAAACATGGAAGATCCCATTAGAACTATCATGTTCTTGGGGTTTTGGACCC E R V Q T W K I P L E L S C S W G F G P K E F K H G R S H - N Y H V L G V L D P R K S S N M E D P I R T I M F L G F W T . . . . . . 38426 ATTCTTAAACATTAAAAGGGGATTTTCACACTGTATCGCCACCGTCCAAAATAATTATCT I L K H - K G I F T L Y R H R P K - L S F L N I K R G F S H C I A T V Q N N Y L H S - T L K G D F H T V S P P S K I I I . . . . . . 38366 ACGGATGTATAATATACGCAAAATTATACATCATTAGTGTATAATTTATGTATATTTATA T D V - Y T Q N Y T S L V Y N L C I F I R M Y N I R K I I H H - C I I Y V Y L - Y G C I I Y A K L Y I I S V - F M Y I Y . . . . . . 38306 AGTTATAATTATTTTCGGAGGACGGTGAAAACCAAAAAGAATCGTATTCCCTGTATTCAA S Y N Y F R R T V K T K K N R I P C I Q V I I I F G G R - K P K R I V F P V F K K L - L F S E D G E N Q K E S Y S L Y S . . . . . . 38246 AATATGTACTCCCATATCTATTTTGTACAGTAAAACTATATTTCTTAAAAACAGA N M Y S H I Y F V Q - N Y I S - K Q I C T P I S I L Y S K T I F L K N R K Y V L P Y L F C T V K L Y F L K T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 38192 TCTGTTTTTAAGAAATATAGTTTTACTGTACAAAATAGATATGGGAGTACATATTTTGAA S V F K K Y S F T V Q N R Y G S T Y F E L F L R N I V L L Y K I D M G V H I L N C F - E I - F Y C T K - I W E Y I F - . . . . . . 38252 TACAGGGAATACGATTCTTTTTGGTTTTCACCGTCCTCCGAAAATAATTATAACTTATAA Y R E Y D S F W F S P S S E N N Y N L - T G N T I L F G F H R P P K I I I T Y K I Q G I R F F L V F T V L R K - L - L I . . . . . . 38312 ATATACATAAATTATACACTAATGATGTATAATTTTGCGTATATTATACATCCGTAGATA I Y I N Y T L M M Y N F A Y I I H P - I Y T - I I H - - C I I L R I L Y I R R - N I H K L Y T N D V - F C V Y Y T S V D . . . . . . 38372 ATTATTTTGGACGGTGGCGATACAGTGTGAAAATCCCCTTTTAATGTTTAAGAATGGGTC I I L D G G D T V - K S P F N V - E W V L F W T V A I Q C E N P L L M F K N G S N Y F G R W R Y S V K I P F - C L R M G . . . . . . 38432 CAAAACCCCAAGAACATGATAGTTCTAATGGGATCTTCCATGTTTGAACTCTTTCTTCTT Q N P K N M I V L M G S S M F E L F L L K T P R T - - F - W D L P C L N S F F F P K P Q E H D S S N G I F H V - T L S S . . . . . . 38492 TGATTTTGATTTTCTTTGGAATTTTGTTTTGATTTTGAGAGCTTTTCTCTGGTTGTTCTC - F - F S L E F C F D F E S F S L V V L D F D F L W N F V L I L R A F L W L F S L I L I F F G I L F - F - E L F S G C S . . . . . . 38552 TTTGATTTGTGAATCAAAGATTTATCATCTTCAGAGAAATGGTGAGGCTTCATCC F D L - I K D L S S S E K W - G F I L I C E S K I Y H L Q R N G E A S S L - F V N Q R F I I F R E M V R L H Maximal non-overlapping open reading frames (>= 64 codons): none PGL 8 (- strand): 48236 39655 AGS-1 (42056 41876,41284 41236,40317 39655) SCR (e 0.989 d 1.000 a 1.000,e 1.000 d 0.954 a 0.996,e 1.000) Exon 1 42056 41876 ( 181 n); score: 0.989 Intron 1 41875 41285 ( 591 n); Pd: 1.000 Pa: 1.000 Exon 2 41284 41236 ( 49 n); score: 1.000 Intron 2 41235 40318 ( 918 n); Pd: 0.954 Pa: 0.996 Exon 3 40317 39655 ( 663 n); score: 1.000 PGS (42056 41876,41284 41236,40317 39655) SGN-U320670+ 3-phase translation of AGS-1 (-strand): . . . . . . 42056 TCTATCATAATCCTAATTTATTATTCTTGAAATAATGGCAATCAAAGTCCATGGTATCCC S I I I L I Y Y S - N N G N Q S P W Y P L S - S - F I I L E I M A I K V H G I P Y H N P N L L F L K - W Q S K S M V S . . . . . . 41996 CTTGTCAACTGCAACCATGAGAGTTATTTCTTGCCTTATTGAGAAGGATTTGGATTTTGA L V N C N H E S Y F L P Y - E G F G F - L S T A T M R V I S C L I E K D L D F E P C Q L Q P - E L F L A L L R R I W I L . . . . . . 41936 GTTTGTCTTTGTTGATATGGCCAAAGAAGAACACAAGAGGCACCCTTTCCTCTCACTCAA V C L C - Y G Q R R T Q E A P F P L T Q F V F V D M A K E E H K R H P F L S L N S L S L L I W P K K N T R G T L S S H S . : . . . . : . 41876 T : CCTTTTGCTCAAGTACCAGCATTTGAAGATGGAGACTTGAAGCTCTTTG : AATCAAGGGC : S F C S S T S I - R W R L E A L - : I K G : P F A Q V P A F E D G D L K L F : E S R A I : L L L K Y Q H L K M E T - S S L : N Q G . . . . . . 40307 AATCACTCAATACATTGCTCAGGTTTATGCTAGCAATGGCATTCAACTAATACTCCAAGA N H S I H C S G L C - Q W H S T N T P R I T Q Y I A Q V Y A S N G I Q L I L Q D Q S L N T L L R F M L A M A F N - Y S K . . . . . . 40247 TCCAATGAAAATGGCCATTATGTCAGTATGGATGGAAGTAGAAGGCCAAAAATTTGAACC S N E N G H Y V S M D G S R R P K I - T P M K M A I M S V W M E V E G Q K F E P I Q - K W P L C Q Y G W K - K A K N L N . . . . . . 40187 ACCAGCTTCAAAATTAACATGGGAGCTAGTCATAAAACCAATGATTGGCTTGGGCAGTAC T S F K I N M G A S H K T N D W L G Q Y P A S K L T W E L V I K P M I G L G S T H Q L Q N - H G S - S - N Q - L A W A V . . . . . . 40127 CGATGATGTTATTGTGAAGGAAAGTGAAGAACAATTGTCTAAGGTTCTTGACATCTACGA R - C Y C E G K - R T I V - G S - H L R D D V I V K E S E E Q L S K V L D I Y E P M M L L - R K V K N N C L R F L T S T . . . . . . 40067 AACTCGATTGACAGAGTCAAAATACTTGGGTGGCGACTCCTTTACACTTGTTGATTTGCA N S I D R V K I L G W R L L Y T C - F A T R L T E S K Y L G G D S F T L V D L H K L D - Q S Q N T W V A T P L H L L I C . . . . . . 40007 TCATATACCAAATATATACCATCTGATGAATACAAAAGCTAAGGCACTGTTTGATTCGCG S Y T K Y I P S D E Y K S - G T V - F A H I P N I Y H L M N T K A K A L F D S R I I Y Q I Y T I - - I Q K L R H C L I R . . . . . . 39947 CCCTCGTGTGAGTGTATGGTGTGCTGATATATTGGCTAGGCCAGCTTGGGTGAAGGGGTT P S C E C M V C - Y I G - A S L G E G V P R V S V W C A D I L A R P A W V K G L A L V - V Y G V L I Y W L G Q L G - R G . . . . . . 39887 GGAGAAGATGCAAAAATGAAAAAAAGTCGTGAATTAATGGATGATCATAATTCATATATA G E D A K M K K S R E L M D D H N S Y I E K M Q K - K K V V N - W M I I I H I Y W R R C K N E K K S - I N G - S - F I Y . . . . . . 39827 TGTTTTTGTTTTGAAGCATTTGTGTCTTAATATGTTGTGTTTCTTGTCTGAAGATGTTTG C F C F E A F V S - Y V V F L V - R C L V F V L K H L C L N M L C F L S E D V C M F L F - S I C V L I C C V S C L K M F . . . . . . 39767 TCTTGCAATACAATAAACAGTGATCTATATCTATGTGATTTTACTAATTGTACTGATGTA S C N T I N S D L Y L C D F T N C T D V L A I Q - T V I Y I Y V I L L I V L M - V L Q Y N K Q - S I S M - F Y - L Y - C . . . . . . 39707 AAATATGCTATGTTCCGGTCATTTATAAAATAATTGCGCGCTATATTTTTGTG K Y A M F R S F I K - L R A I F L N M L C S G H L - N N C A L Y F C K I C Y V P V I Y K I I A R Y I F V Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-8_AGS-1_PPS_1 (42040 41876,41284 41236,40317 39869) (frame '2'; 660 bp, 220 residues) 1 FIILEIMAIK VHGIPLSTAT MRVISCLIEK DLDFEFVFVD MAKEEHKRHP FLSLNPFAQV 61 PAFEDGDLKL FESRAITQYI AQVYASNGIQ LILQDPMKMA IMSVWMEVEG QKFEPPASKL 121 TWELVIKPMI GLGSTDDVIV KESEEQLSKV LDIYETRLTE SKYLGGDSFT LVDLHHIPNI 181 YHLMNTKAKA LFDSRPRVSV WCADILARPA WVKGLEKMQK - AGS-2 (42178 41361) SCR (e 0.979) Exon 1 42178 41361 ( 818 n); score: 0.979 PGS (42178 41361) SGN-U337183+ 3-phase translation of AGS-2 (-strand): . . . . . . 42178 AAAAAAGGTTTGCTTAGGATTCAAGACTTTTTGGATATATTGATTTTTTTAAGAATGATA K K G L L R I Q D F L D I L I F L R M I K K V C L G F K T F W I Y - F F - E - Y K R F A - D S R L F G Y I D F F K N D . . . . . . 42118 TCAAGAAAATTTTACCCTATATATACCCCTCCATATAACTTCATTTCATCAACTTGGAGC S R K F Y P I Y T P P Y N F I S S T W S Q E N F T L Y I P L H I T S F H Q L G A I K K I L P Y I Y P S I - L H F I N L E . . . . . . 42058 AATCTATCATAATCCTAATTTATTATTCTTGAAATAATGGCAATCAAAGTCCATGGTATC N L S - S - F I I L E I M A I K V H G I I Y H N P N L L F L K - W Q S K S M V S Q S I I I L I Y Y S - N N G N Q S P W Y . . . . . . 41998 CCCTTGTCAACTGCAACCATGAGAGTTATTTCTTGCCTTATTGAGAAGGATTTGGATTTT P L S T A T M R V I S C L I E K D L D F P C Q L Q P - E L F L A L L R R I W I L P L V N C N H E S Y F L P Y - E G F G F . . . . . . 41938 GAGTTTGTCTTTGTTGATATGGCCAAAGAAGAACACAAGAGGCACCCTTTCCTCTCACTC E F V F V D M A K E E H K R H P F L S L S L S L L I W P K K N T R G T L S S H S - V C L C - Y G Q R R T Q E A P F P L T . . . . . . 41878 AATGTAAGCATAAATAATTACTCCCTCTGTACAGCTACATAAATATTTAAGAATTGTTTG N V S I N N Y S L C T A T - I F K N C L M - A - I I T P S V Q L H K Y L R I V - Q C K H K - L L P L Y S Y I N I - E L F . . . . . . 41818 ACCATAAGTTTCAAAAGTTCTTTTCTTTAAACATGGTGTCAAGTCAAATGGTGTCAAATA T I S F K S S F L - T W C Q V K W C Q I P - V S K V L F F K H G V K S N G V K - D H K F Q K F F S L N M V S S Q M V S N . . . . . . 41758 AAATGGGACGGTTGAAGTACATTATTAGCATGTTTGATCAAGTTTTGGAGAAGCTAAAAG K W D G - S T L L A C L I K F W R S - K N G T V E V H Y - H V - S S F G E A K S K M G R L K Y I I S M F D Q V L E K L K . . . . . . 41698 TATTTCTTTTTAAATGTTTATTTTAGAAATTTGAGATGTTCAGTGTTTTTTAGTAGCAGC Y F F L N V Y F R N L R C S V F F S S S I S F - M F I L E I - D V Q C F L V A A V F L F K C L F - K F E M F S V F - - Q . . . . . . 41638 ATAAACTGAACTAAAAACACTTTTTTGAAACTTTGGTCAAACACAAATGTTGCAAAAACA I N - T K N T F L K L W S N T N V A K T - T E L K T L F - N F G Q T Q M L Q K H H K L N - K H F F E T L V K H K C C K N . . . . . . 41578 TTTGTCATATTGATGGCAAATACAAATTGTCATCGTCCAAAATACTCTTTAAATACAACA F V I L M A N T N C H R P K Y S L N T T L S Y - W Q I Q I V I V Q N T L - I Q H I C H I D G K Y K L S S S K I L F K Y N . . . . . . 41518 CTTTTTGAAATTAATAATTTCTAAAATGCTAAACAAACTATAAATTTATTTGGTAGTGAA L F E I N N F - N A K Q T I N L F G S E F L K L I I S K M L N K L - I Y L V V N T F - N - - F L K C - T N Y K F I W - - . . . . . . 41458 TGTTACTAGTGAAAAAAAATAAATCTATTTGGTAGTGATTTCTACACAGATTTATATTTG C Y - - K K I N L F G S D F Y T D L Y L V T S E K K - I Y L V V I S T Q I Y I C M L L V K K N K S I W - - F L H R F I F . . . . 41398 TCAAAAATATTTTAATTGGTTCACTTGTACTTGCTATC S K I F - L V H L Y L L Q K Y F N W F T C T C Y V K N I L I G S L V L A I Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-8_AGS-2_PPS_1 (42040 41837) (frame '1'; 201 bp, 67 residues) 1 FIILEIMAIK VHGIPLSTAT MRVISCLIEK DLDFEFVFVD MAKEEHKRHP FLSLNVSINN 61 YSLCTAT- 3-phase translation of AGS-2 (+strand): . . . . . . 41361 GATAGCAAGTACAAGTGAACCAATTAAAATATTTTTGACAAATATAAATCTGTGTAGAAA D S K Y K - T N - N I F D K Y K S V - K I A S T S E P I K I F L T N I N L C R N - Q V Q V N Q L K Y F - Q I - I C V E . . . . . . 41421 TCACTACCAAATAGATTTATTTTTTTTCACTAGTAACATTCACTACCAAATAAATTTATA S L P N R F I F F H - - H S L P N K F I H Y Q I D L F F F T S N I H Y Q I N L - I T T K - I Y F F S L V T F T T K - I Y . . . . . . 41481 GTTTGTTTAGCATTTTAGAAATTATTAATTTCAAAAAGTGTTGTATTTAAAGAGTATTTT V C L A F - K L L I S K S V V F K E Y F F V - H F R N Y - F Q K V L Y L K S I L S L F S I L E I I N F K K C C I - R V F . . . . . . 41541 GGACGATGACAATTTGTATTTGCCATCAATATGACAAATGTTTTTGCAACATTTGTGTTT G R - Q F V F A I N M T N V F A T F V F D D D N L Y L P S I - Q M F L Q H L C L W T M T I C I C H Q Y D K C F C N I C V . . . . . . 41601 GACCAAAGTTTCAAAAAAGTGTTTTTAGTTCAGTTTATGCTGCTACTAAAAAACACTGAA D Q S F K K V F L V Q F M L L L K N T E T K V S K K C F - F S L C C Y - K T L N - P K F Q K S V F S S V Y A A T K K H - . . . . . . 41661 CATCTCAAATTTCTAAAATAAACATTTAAAAAGAAATACTTTTAGCTTCTCCAAAACTTG H L K F L K - T F K K K Y F - L L Q N L I S N F - N K H L K R N T F S F S K T - T S Q I S K I N I - K E I L L A S P K L . . . . . . 41721 ATCAAACATGCTAATAATGTACTTCAACCGTCCCATTTTATTTGACACCATTTGACTTGA I K H A N N V L Q P S H F I - H H L T - S N M L I M Y F N R P I L F D T I - L D D Q T C - - C T S T V P F Y L T P F D L . . . . . . 41781 CACCATGTTTAAAGAAAAGAACTTTTGAAACTTATGGTCAAACAATTCTTAAATATTTAT H H V - R K E L L K L M V K Q F L N I Y T M F K E K N F - N L W S N N S - I F M T P C L K K R T F E T Y G Q T I L K Y L . . . . . . 41841 GTAGCTGTACAGAGGGAGTAATTATTTATGCTTACATTGAGTGAGAGGAAAGGGTGCCTC V A V Q R E - L F M L T L S E R K G C L - L Y R G S N Y L C L H - V R G K G A S C S C T E G V I I Y A Y I E - E E R V P . . . . . . 41901 TTGTGTTCTTCTTTGGCCATATCAACAAAGACAAACTCAAAATCCAAATCCTTCTCAATA L C S S L A I S T K T N S K S K S F S I C V L L W P Y Q Q R Q T Q N P N P S Q - L V F F F G H I N K D K L K I Q I L L N . . . . . . 41961 AGGCAAGAAATAACTCTCATGGTTGCAGTTGACAAGGGGATACCATGGACTTTGATTGCC R Q E I T L M V A V D K G I P W T L I A G K K - L S W L Q L T R G Y H G L - L P K A R N N S H G C S - Q G D T M D F D C . . . . . . 42021 ATTATTTCAAGAATAATAAATTAGGATTATGATAGATTGCTCCAAGTTGATGAAATGAAG I I S R I I N - D Y D R L L Q V D E M K L F Q E - - I R I M I D C S K L M K - S H Y F K N N K L G L - - I A P S - - N E . . . . . . 42081 TTATATGGAGGGGTATATATAGGGTAAAATTTTCTTGATATCATTCTTAAAAAAATCAAT L Y G G V Y I G - N F L D I I L K K I N Y M E G Y I - G K I F L I S F L K K S I V I W R G I Y R V K F S - Y H S - K N Q . . . . 42141 ATATCCAAAAAGTCTTGAATCCTAAGCAAACCTTTTTT I S K K S - I L S K P F Y P K S L E S - A N L F Y I Q K V L N P K Q T F F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (45488 45311,45194 45063,44984 44863,44742 44651,44581 44490,42794 42703,42009 41910) SCR (e 1.000 d 0.933 a 0.823,e 1.000 d 0.829 a 0.994,e 1.000 d 0.900 a 0.982,e 1.000 d 0.275 a 0.953,e 1.000 d 0.987 a 0.999,e 1.000 d 0.884 a 0.000,e 1.000) Exon 1 45488 45311 ( 178 n); score: 1.000 Intron 1 45310 45195 ( 116 n); Pd: 0.933 Pa: 0.823 Exon 2 45194 45063 ( 132 n); score: 1.000 Intron 2 45062 44985 ( 78 n); Pd: 0.829 Pa: 0.994 Exon 3 44984 44863 ( 122 n); score: 1.000 Intron 3 44862 44743 ( 120 n); Pd: 0.900 Pa: 0.982 Exon 4 44742 44651 ( 92 n); score: 1.000 Intron 4 44650 44582 ( 69 n); Pd: 0.275 Pa: 0.953 Exon 5 44581 44490 ( 92 n); score: 1.000 Intron 5 44489 42795 (1695 n); Pd: 0.987 Pa: 0.999 Exon 6 42794 42703 ( 92 n); score: 1.000 Intron 6 42702 42010 ( 693 n); Pd: 0.884 Pa: 0.000 Exon 7 42009 41910 ( 100 n); score: 1.000 PGS (45488 45311,45194 45063,44984 44863,44742 44651,44581 44490,42794 42703,42009 41910) SGN-U320669+ 3-phase translation of AGS-3 (-strand): . . . . . . 45488 AAATATAGAGTCACTGGATGACTTATTGGAGAATTATGGTGGAAATGAAGTACAACAGTT K Y R V T G - L I G E L W W K - S T T V N I E S L D D L L E N Y G G N E V Q Q F I - S H W M T Y W R I M V E M K Y N S . . . . . . 45428 CGAGGAGAATTTGGTCTCATCTGAAGTAGCAGTTGTACATGATCCAAATGAGCATTCCAT R G E F G L I - S S S C T - S K - A F H E E N L V S S E V A V V H D P N E H S M S R R I W S H L K - Q L Y M I Q M S I P . . . . . . : 45368 GGCTGAGGTTCTGGATCACTTTCAGCATACAAGTTCCTCACGAGGCAATCCTAAAATG : CT G - G S G S L S A Y K F L T R Q S - N : A A E V L D H F Q H T S S S R G N P K M : L W L R F W I T F S I Q V P H E A I L K C : . . . . . . 45192 GCAAACAAAAATACCTGGATCACGGTTCTTACGGAAGAGAAACTTATTGCTGCTTGGTGA A N K N T W I T V L T E E K L I A A W - Q T K I P G S R F L R K R N L L L L G D C K Q K Y L D H G S Y G R E T Y C C L V . . . . . . 45132 CAGAAACATGAGCAATGGCGAACAACCTGAGGAACTAGATAGTGATCCATCTAGTGATGA Q K H E Q W R T T - G T R - - S I - - - R N M S N G E Q P E E L D S D P S S D E T E T - A M A N N L R N - I V I H L V M . : . . . . . 45072 GGATGTAAAT : GAAGTTCCCCAGATTCTGAAGTCTGCTATACCTCAGAGGACCATGGCTGA G C K : - S S P D S E V C Y T S E D H G - D V N : E V P Q I L K S A I P Q R T M A D R M - M : K F P R F - S L L Y L R G P W L . . . . . . 44934 CCAATTTCATCTAGCGTTAGGAGCTGTATCCACAAATGAGAGGCTATGTATTGCAAGGCC P I S S S V R S C I H K - E A M Y C K A Q F H L A L G A V S T N E R L C I A R P T N F I - R - E L Y P Q M R G Y V L Q G . . : . . . . 44874 TAAGCAATTTGG : TTTATCTGGAAGGTTGCAGCACGTGATGCAATGTGAAAAGGACAGAGA - A I W : F I W K V A A R D A M - K G Q R K Q F G : L S G R L Q H V M Q C E K D R D L S N L : V Y L E G C S T - C N V K R T E . . . . . : . 44694 TACATATTTTTTGGAGAAGTCACAAACACATGCTGCTTCAAGTG : GTGCAGAAAGCTTCAT Y I F F G E V T N T C C F K W : C R K L H T Y F L E K S Q T H A A S S : G A E S F I I H I F W R S H K H M L L Q V : V Q K A S . . . . . . 44565 TGATGTGAGAATTTTGTCAAGTTCTTTGGAGGCCAAGCTGACTGTTTGTTTTTGTGCTTT - C E N F V K F F G G Q A D C L F L C F D V R I L S S S L E A K L T V C F C A L L M - E F C Q V L W R P S - L F V F V L . . : . . . . 44505 ACATGGAGATGAAGAG : GAAGGAGGTACATGTGAACGAGAAAGACGAGGCCATCATTTTGT T W R - R : G R R Y M - T R K T R P S F C H G D E E : E G G T C E R E R R G H H F V Y M E M K R : K E V H V N E K D E A I I L . . . . . : . 42750 GTGCATATTTCTCTCAGATTTAGTCTTTGGATCACATTTTATACCAAG : TCCATGGTATCC V H I S L R F S L W I T F Y T K : S M V S C I F L S D L V F G S H F I P S : P W Y P C A Y F S Q I - S L D H I L Y Q : V H G I . . . . . . 41997 CCTTGTCAACTGCAACCATGAGAGTTATTTCTTGCCTTATTGAGAAGGATTTGGATTTTG P C Q L Q P - E L F L A L L R R I W I L L V N C N H E S Y F L P Y - E G F G F - P L S T A T M R V I S C L I E K D L D F . . . 41937 AGTTTGTCTTTGTTGATATGGCCAAAGA S L S L L I W P K V C L C - Y G Q R E F V F V D M A K Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-8_AGS-3_PPS_1 (45487 45311,45194 45063,44984 44863,44742 44651,44581 44490,42794 42703,42009 41955) (frame '2'; 759 bp, 253 residues) 1 NIESLDDLLE NYGGNEVQQF EENLVSSEVA VVHDPNEHSM AEVLDHFQHT SSSRGNPKML 61 QTKIPGSRFL RKRNLLLLGD RNMSNGEQPE ELDSDPSSDE DVNEVPQILK SAIPQRTMAD 121 QFHLALGAVS TNERLCIARP KQFGLSGRLQ HVMQCEKDRD TYFLEKSQTH AASSGAESFI 181 DVRILSSSLE AKLTVCFCAL HGDEEEGGTC ERERRGHHFV CIFLSDLVFG SHFIPSPWYP 241 LVNCNHESYF LPY- AGS-4 (44581 44490,43282 43149,42794 42266) SCR (e 0.978 d 0.987 a 0.999,e 0.963 d 0.987 a 0.999,e 0.904) Exon 1 44581 44490 ( 92 n); score: 0.978 Intron 1 44489 43283 (1207 n); Pd: 0.987 Pa: 0.999 Exon 2 43282 43149 ( 134 n); score: 0.963 Intron 2 43148 42795 ( 354 n); Pd: 0.987 Pa: 0.999 Exon 3 42794 42266 ( 529 n); score: 0.904 PGS (44581 44490,43282 43149,42794 42266) SGN-U337182+ 3-phase translation of AGS-4 (-strand): . . . . . . 44581 GTGCAGAAAGCTTCATTGATGTGAGAATTTTGTCAAGTTCTTTGGAGGCCAAGCTGACTG V Q K A S L M - E F C Q V L W R P S - L C R K L H - C E N F V K F F G G Q A D C A E S F I D V R I L S S S L E A K L T . . . . : . . 44521 TTTGTTTTTGTGCTTTACATGGAGATGAAGAG : GGTTCTGAGTGCCTGAGCAATCCTCGAG F V F V L Y M E M K R : V L S A - A I L E L F L C F T W R - R : G F - V P E Q S S R V C F C A L H G D E E : G S E C L S N P R . . . . . . 43254 AAAGGAAGGGTACTGGTAGAAGGGAGTTTACTATCATTTTTAACTCAAGAATTTGTAAAG K G R V L V E G S L L S F L T Q E F V K K E G Y W - K G V Y Y H F - L K N L - R E R K G T G R R E F T I I F N S R I C K . . . . . : . 43194 ATGTCGAACTTGAAATAGGGAATGTTATCCGCATACATCAACCTTG : GAAGGAGGTACATG M S N L K - G M L S A Y I N L : G R R Y M C R T - N R E C Y P H T S T L : E G G T C D V E L E I G N V I R I H Q P W : K E V H . . . . . . 42780 TGAACGAGAAAGACGAGGCCATCATTTTGTGTGCATATTTCTCTCAGATTTAGTCTTTGG - T R K T R P S F C V H I S L R F S L W E R E R R G H H F V C I F L S D L V F G V N E K D E A I I L C A Y F S Q I - S L . . . . . . 42720 ATCACATTTTATACCAAGGTAATAGCAAGGTTCCTTGAGATAACTCTATTGTTGTACTTT I T F Y T K V I A R F L E I T L L L Y F S H F I P R - - Q G S L R - L Y C C T L D H I L Y Q G N S K V P - D N S I V V L . . . . . . 42660 GCACTTGCCTATATTTTCTAAATAGATAAAAATATAGAAGTCTAATTTATCCATTTTCGC A L A Y I F - I D K N I E V - F I H F R H L P I F S K - I K I - K S N L S I F A C T C L Y F L N R - K Y R S L I Y P F S . . . . . . 42600 CATAAATATTACTCATGACTACTGCAAAATGTATGAATGCCTTGAAACAAATTGGTTGCA H K Y Y S - L L Q N V - M P - N K L V A I N I T H D Y C K M Y E C L E T N W L H P - I L L M T T A K C M N A L K Q I G C . . . . . . 42540 TCTGCAAGTTTCCTGGTACATCCCCATGATGTATCCAAGATCCTATAGTTTAAAAGGAAT S A S F L V H P H D V S K I L - F K R N L Q V S W Y I P M M Y P R S Y S L K G I I C K F P G T S P - C I Q D P I V - K E . . . . . . 42480 TTTTATATTTTAGGAAAGAATAGAGATGGAATGTAATTAACTCTAAACACTGTAGGATTT F Y I L G K N R D G M - L T L N T V G F F I F - E R I E M E C N - L - T L - D L F L Y F R K E - R W N V I N S K H C R I . . . . . . 42420 ATGTAATTTTTACAGAAAAATAAAAATAATTCTGAGTGTTGAAGTTTACACCTCCCATAG M - F L Q K N K N N S E C - S L H L P - C N F Y R K I K I I L S V E V Y T S H S Y V I F T E K - K - F - V L K F T P P I . . . . . . 42360 TTTGAAGTAGTCAGTCTGTACTATCCAGCCTGTTTGTTCAACTTTAAAATGCACTTAACA F E V V S L Y Y P A C L F N F K M H L T L K - S V C T I Q P V C S T L K C T - Q V - S S Q S V L S S L F V Q L - N A L N . . . . 42300 AATTGGTTTAGTCAACATATAAGGATAGTTGTGAG N W F S Q H I R I V V I G L V N I - G - L - K L V - S T Y K D S C E Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-8_AGS-4_PPS_1 (44579 44490,43282 43149,42794 42728) (frame '0'; 288 bp, 96 residues) 1 AESFIDVRIL SSSLEAKLTV CFCALHGDEE GSECLSNPRE RKGTGRREFT IIFNSRICKD 61 VELEIGNVIR IHQPWKEVHV NEKDEAIILC AYFSQI- AGS-5 (48236 48152,47976 47946,47838 47777,47322 47283,47172 47117,46477 46262,46156 46003,45633 45511) SCR (e 1.000 d 0.870 a 0.950,e 1.000 d 0.582 a 0.995,e 1.000 d 0.998 a 0.978,e 1.000 d 0.899 a 0.954,e 0.982 d 0.998 a 0.011,e 1.000 d 0.996 a 0.949,e 1.000 d 0.998 a 0.971,e 1.000) Exon 1 48236 48152 ( 85 n); score: 1.000 Intron 1 48151 47977 ( 175 n); Pd: 0.870 Pa: 0.950 Exon 2 47976 47946 ( 31 n); score: 1.000 Intron 2 47945 47839 ( 107 n); Pd: 0.582 Pa: 0.995 Exon 3 47838 47777 ( 62 n); score: 1.000 Intron 3 47776 47323 ( 454 n); Pd: 0.998 Pa: 0.978 Exon 4 47322 47283 ( 40 n); score: 1.000 Intron 4 47282 47173 ( 110 n); Pd: 0.899 Pa: 0.954 Exon 5 47172 47117 ( 56 n); score: 0.982 Intron 5 47116 46478 ( 639 n); Pd: 0.998 Pa: 0.011 Exon 6 46477 46262 ( 216 n); score: 1.000 Intron 6 46261 46157 ( 105 n); Pd: 0.996 Pa: 0.949 Exon 7 46156 46003 ( 154 n); score: 1.000 Intron 7 46002 45634 ( 369 n); Pd: 0.998 Pa: 0.971 Exon 8 45633 45511 ( 123 n); score: 1.000 PGS (48236 48152,47976 47946,47838 47777,47322 47283,47172 47117,46477 46262,46156 46003,45633 45511) SGN-U330064+ 3-phase translation of AGS-5 (-strand): . . . . . . 48236 CCCAATTAGCTCCGCAAAATTGCGTCTCTGCTTGTTACCAGCTATGGAAGAGAAAGGTTC P N - L R K I A S L L V T S Y G R E R F P I S S A K L R L C L L P A M E E K G S Q L A P Q N C V S A C Y Q L W K R K V . . . : . . . : 48176 AGCTGCTAGTAGAGTATTTCTTCAG : GAAAAGGAAGATTCGAACCAGAGTTTTTCCG : AAGA S C - - S I S S : G K G R F E P E F F R : R A A S R V F L Q : E K E D S N Q S F S : E E Q L L V E Y F F R : K R K I R T R V F P : K . . . . . . : 47834 GGAAGATATGGACGATGACGAATGGATGACAAATGACAATTGTTCCTTAGAAAACAAG : GG G R Y G R - R M D D K - Q L F L R K Q : G E D M D D D E W M T N D N C S L E N K : G R K I W T M T N G - Q M T I V P - K T R : . . . . : . . 47320 AGGTTTAGGAGTCCTTTCCCAGCTTGAACGGCTCACAG : ATGTCAAAAGACTTCATCATTC R F R S P F P A - T A H R : C Q K T S S F G L G V L S Q L E R L T : D V K R L H H S E V - E S F P S L N G S Q : M S K D F I I . . . . : . . 47150 AACCGATACAGTGAACTCTGATCAGCTGGTACAG : AGGCGGACAGGTTTATGCGAAGAAGA N R Y S E L - S A G T : E A D R F M R R R T D T V N S D Q L V Q : R R T G L C E E D Q P I Q - T L I S W Y R : G G Q V Y A K K . . . . . . 46451 TGACGTTGAAGTTCCTTTGTTTAAGAGTCAGGATGGTAGCCTGATCAACAAAAATGATCA - R - S S F V - E S G W - P D Q Q K - S D V E V P L F K S Q D G S L I N K N D Q M T L K F L C L R V R M V A - S T K M I . . . . . . 46391 GGATGGTAGCTTCATCAACAAAAATGATCATTGGAAGGCGTTGTCCTGCTCCTTAGATGA G W - L H Q Q K - S L E G V V L L L R - D G S F I N K N D H W K A L S C S L D D R M V A S S T K M I I G R R C P A P - M . . . . . . 46331 TGAATTTTGTCACGTCACTAGAATTACATCCACTTGTAATTCAGAAGAGGAAATTATGTC - I L S R H - N Y I H L - F R R G N Y V E F C H V T R I T S T C N S E E E I M S M N F V T S L E L H P L V I Q K R K L C . : . . . . . 46271 TGATGATGAG : ATGAGGCCTTCTACTGATGGAAAATTCAAAAGAGATGGTAAAAGTACAAT - - - : D E A F Y - W K I Q K R W - K Y N D D E : M R P S T D G K F K R D G K S T M L M M R : - G L L L M E N S K E M V K V Q . . . . . . 46106 GCTTAAAGTAAGCGCAGATTGCAAATCAGGAGCTTTCTTCAATAAAGATGCTGGGTGTTC A - S K R R L Q I R S F L Q - R C W V F L K V S A D C K S G A F F N K D A G C S C L K - A Q I A N Q E L S S I K M L G V . . . . . : . 46046 ATCGGTATACGGGGCCTCATCAAAATTGAACAGATCATCTAAAG : GAAGCCCGGGCAAATC I G I R G L I K I E Q I I - R : K P G Q I S V Y G A S S K L N R S S K : G S P G K S H R Y T G P H Q N - T D H L K : E A R A N . . . . . . 45617 TAAGGCCAAATTTTTGTTCCAATCCCGGCCACAGAAGAAAGACTATGCTTTGGTTGTCCA - G Q I F V P I P A T E E R L C F G C P K A K F L F Q S R P Q K K D Y A L V V H L R P N F C S N P G H R R K T M L W L S . . . . . 45557 TGATAGTTGTGAAACCTGCATGCCCTTATCTGTGCTTCCACTAAATG - - L - N L H A L I C A S T K D S C E T C M P L S V L P L N M I V V K P A C P Y L C F H - M Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-8_AGS-5_PPS_1 (48235 48152,47976 47946,47838 47777,47322 47283,47172 47117,46477 46262,46156 46003,45633 45512) (frame '2'; 765 bp, 255 residues) 1 PISSAKLRLC LLPAMEEKGS AASRVFLQEK EDSNQSFSEE EDMDDDEWMT NDNCSLENKG 61 GLGVLSQLER LTDVKRLHHS TDTVNSDQLV QRRTGLCEED DVEVPLFKSQ DGSLINKNDQ 121 DGSFINKNDH WKALSCSLDD EFCHVTRITS TCNSEEEIMS DDEMRPSTDG KFKRDGKSTM 181 LKVSADCKSG AFFNKDAGCS SVYGASSKLN RSSKGSPGKS KAKFLFQSRP QKKDYALVVH 241 DSCETCMPLS VLPLN PGL 9 (+ strand): 49929 54842 AGS-1 (49929 50417,50832 50892,50994 51063,51518 51772,51932 51999,52087 52196,52626 52666,52759 52907,53804 54157,54429 54842) SCR (e 1.000 d 0.995 a 0.999,e 1.000 d 0.999 a 0.896,e 1.000 d 0.998 a 0.915,e 1.000 d 0.861 a 0.996,e 1.000 d 1.000 a 0.579,e 1.000 d 0.965 a 0.999,e 1.000 d 0.970 a 0.970,e 1.000 d 0.045 a 0.890,e 1.000 d 1.000 a 0.972,e 0.961) Exon 1 49929 50417 ( 489 n); score: 1.000 Intron 1 50418 50831 ( 414 n); Pd: 0.995 Pa: 0.999 Exon 2 50832 50892 ( 61 n); score: 1.000 Intron 2 50893 50993 ( 101 n); Pd: 0.999 Pa: 0.896 Exon 3 50994 51063 ( 70 n); score: 1.000 Intron 3 51064 51517 ( 454 n); Pd: 0.998 Pa: 0.915 Exon 4 51518 51772 ( 255 n); score: 1.000 Intron 4 51773 51931 ( 159 n); Pd: 0.861 Pa: 0.996 Exon 5 51932 51999 ( 68 n); score: 1.000 Intron 5 52000 52086 ( 87 n); Pd: 1.000 Pa: 0.579 Exon 6 52087 52196 ( 110 n); score: 1.000 Intron 6 52197 52625 ( 429 n); Pd: 0.965 Pa: 0.999 Exon 7 52626 52666 ( 41 n); score: 1.000 Intron 7 52667 52758 ( 92 n); Pd: 0.970 Pa: 0.970 Exon 8 52759 52907 ( 149 n); score: 1.000 Intron 8 52908 53803 ( 896 n); Pd: 0.045 Pa: 0.890 Exon 9 53804 54157 ( 354 n); score: 1.000 Intron 9 54158 54428 ( 271 n); Pd: 1.000 Pa: 0.972 Exon 10 54429 54842 ( 414 n); score: 0.961 PGS (49929 50417,50832 50892,50994 51063,51518 51772,51932 51999,52087 52196,52626 52666,52759 52907,53804 54157,54429 54842) SGN-U324287+ 3-phase translation of AGS-1 (+strand): . . . . . . 49929 ATATATGAAATTTTTCCAAAATATAATATGAACCCAATTATATAAACCCGCTCAAATTTG I Y E I F P K Y N M N P I I - T R S N L Y M K F F Q N I I - T Q L Y K P A Q I - I - N F S K I - Y E P N Y I N P L K F . . . . . . 49989 AAATCTCCCCAAATCCCCATTTCTGACACCATTACAGAGGTGAAGCACCGAGCTCGAACT K S P Q I P I S D T I T E V K H R A R T N L P K S P F L T P L Q R - S T E L E L E I S P N P H F - H H Y R G E A P S S N . . . . . . 50049 CTCTCCAGATTCTCTCTACAAGAAGATCCATTTCCAGCTGATGGAGAGGAAAACTCCAAA L S R F S L Q E D P F P A D G E E N S K S P D S L Y K K I H F Q L M E R K T P N S L Q I L S T R R S I S S - W R G K L Q . . . . . . 50109 TAGAACAAGGAGGAAGCAGAGAAGCAACCAAAAAAGCAAGAAGAGGATGAACAAGGGTTC - N K E E A E K Q P K K Q E E D E Q G F R T R R K Q R S N Q K S K K R M N K G S I E Q G G S R E A T K K A R R G - T R V . . . . . . 50169 ACTGTCTAGGCATTTCACTGTTGGGATTGCAAAACCTCCGCTTCCTAATCAACAACAGCT T V - A F H C W D C K T S A S - S T T A L S R H F T V G I A K P P L P N Q Q Q L H C L G I S L L G L Q N L R F L I N N S . . . . . . 50229 TCATTCCTCACTTTCTAATGTTACTTTACCGAATCCTTCTAGATTCCAGAAACTTCTGGA S F L T F - C Y F T E S F - I P E T S G H S S L S N V T L P N P S R F Q K L L D F I P H F L M L L Y R I L L D S R N F W . . . . . . 50289 TTCTGATGACCTTCCGCCAGCTCAATCTCAGTTCTCTTCAGTTTTGCCGTTGAATCTCGA F - - P S A S S I S V L F S F A V E S R S D D L P P A Q S Q F S S V L P L N L D I L M T F R Q L N L S S L Q F C R - I S . . . . . . 50349 TGCTGATGATGATGCCGATGTTGCCGATGTTGCTGAAAAGGACTTCATTCTCAGTCAAGA C - - - C R C C R C C - K G L H S Q S R A D D D A D V A D V A E K D F I L S Q D M L M M M P M L P M L L K R T S F S V K . : . . . . . 50409 TTTCTTCTG : TACCCCGGATTATCTAACGCCAGATGCACCTGCAATTTGTAATGGGCTTGA F L L : Y P G L S N A R C T C N L - W A - F F C : T P D Y L T P D A P A I C N G L D I S S : V P R I I - R Q M H L Q F V M G L . : . . . . . 50883 TGGTGATAAG : GATGATTATACTCCTTGTCCCAAATCACCCGAGAAGCTTCTAAGTGTATC W - - : G - L Y S L S Q I T R E A S K C I G D K : D D Y T P C P K S P E K L L S V S M V I R : M I I L L V P N H P R S F - V Y . . : . . . . 51044 AAGAAAGAGGCCGCGACTAG : CGTCGGTAAGGCCTTTTAGTTCCGATTTATCTGGACAGCA K K E A A T S : V G K A F - F R F I W T A R K R P R L : A S V R P F S S D L S G Q Q Q E R G R D - : R R - G L L V P I Y L D S . . . . . . 51558 GCAGCCAGTAGATATTCCTACAGATACTTTTGGGACAGACGAAATGAAATCAGAAAAGAT A A S R Y S Y R Y F W D R R N E I R K D Q P V D I P T D T F G T D E M K S E K I S S Q - I F L Q I L L G Q T K - N Q K R . . . . . . 51618 AAGCGAGTCAGAAAAGGGTCCCAGTTATGTGTCACAATCTGCTATTGCTTTAAGATATCG K R V R K G S Q L C V T I C Y C F K I S S E S E K G P S Y V S Q S A I A L R Y R - A S Q K R V P V M C H N L L L L - D I . . . . . . 51678 AGTCATGCCTCCTCCGTGCATTAGAAACCCTTATCTCGGGGATGCTTCCGAGATAGATGC S H A S S V H - K P L S R G C F R D R C V M P P P C I R N P Y L G D A S E I D A E S C L L R A L E T L I S G M L P R - M . . . . : . . 51738 TGATCCTTTTGGTAACAGGAGATCCAAGTACCCAG : GTTTTAACCCTGCAATTTCTGGTAA - S F W - Q E I Q V P R : F - P C N F W - D P F G N R R S K Y P : G F N P A I S G N L I L L V T G D P S T Q : V L T L Q F L V . . . . . : . 51957 TGATGGTCTGTCACGGTATCGTACTGATTTCCACGAAATTGAG : CAAATCGGTAGTGGGAA - W S V T V S Y - F P R N - : A N R - W E D G L S R Y R T D F H E I E : Q I G S G N M M V C H G I V L I S T K L S : K S V V G . . . . . . 52104 CTTCAGCCGTGTTTTCAAAGTCTTTAAGAGAATTGATGGATGTATGTATGCAGTGAAACA L Q P C F Q S L - E N - W M Y V C S E T F S R V F K V F K R I D G C M Y A V K H T S A V F S K S L R E L M D V C M Q - N . . . . : . . 52164 TAGCACTAAACAGTTACATCAAGACACAGATAG : GAGACAGGCTTTGATGGAAGTGCAAGC - H - T V T S R H R - : E T G F D G S A S S T K Q L H Q D T D R : R Q A L M E V Q A I A L N S Y I K T Q I : G D R L - W K C K . . : . . . . 52653 ATTGGCTGCTTTAG : GACCTCATGAGAACGTAGTTGGTTATTATTCATCTTGGTTTGAAAA I G C F R : T S - E R S W L L F I L V - K L A A L : G P H E N V V G Y Y S S W F E N H W L L - : D L M R T - L V I I H L G L K . . . . . . 52805 TGAACACCTTTACATCCAAATGGAGCTCTGTGACCACAGCTTATCCAATAAAAAATATTG - T P L H P N G A L - P Q L I Q - K I L E H L Y I Q M E L C D H S L S N K K Y C M N T F T S K W S S V T T A Y P I K N I . . . . . : . 52865 TAAACTATTTTCGGAGGTAGAAGTTTTGGAAGCAATGTATCAG : GTAGCCAACGCATTGCA - T I F G G R S F G S N V S : G S Q R I A K L F S E V E V L E A M Y Q : V A N A L Q V N Y F R R - K F W K Q C I R : - P T H C . . . . . . 53821 GTTTATACATCAGAGAGGGGTCGCTCATTTAGATGTAAAGCCAGATAATATTTATGTGAA V Y T S E R G R S F R C K A R - Y L C E F I H Q R G V A H L D V K P D N I Y V K S L Y I R E G S L I - M - S Q I I F M - . . . . . . 53881 AAATGGTGTATATAAGCTTGGTGATTTTGGATGTGCAACTCTTCTTGATAAGAGCCAGCC K W C I - A W - F W M C N S S - - E P A N G V Y K L G D F G C A T L L D K S Q P K M V Y I S L V I L D V Q L F L I R A S . . . . . . 53941 AATTGAAGAGGGTGATGCACGTTATATGCCCCAAGAAATACTTAATGAGAACTATGATCA N - R G - C T L Y A P R N T - - E L - S I E E G D A R Y M P Q E I L N E N Y D H Q L K R V M H V I C P K K Y L M R T M I . . . . . . 54001 TCTTGACAAAGTTGACATATTCTCCTTGGGCGCTGCAATATATGAACTTATTAGAGGGTC S - Q S - H I L L G R C N I - T Y - R V L D K V D I F S L G A A I Y E L I R G S I L T K L T Y S P W A L Q Y M N L L E G . . . . . . 54061 TTCACTGCCAGAATCAGGGCCTCATTTTCTAAACCTCAGGGAGGGGAAATTGCCTCTTCT F T A R I R A S F S K P Q G G E I A S S S L P E S G P H F L N L R E G K L P L L L H C Q N Q G L I F - T S G R G N C L F . . . . : . . 54121 TCCGGGTCACTCCTTGCAATTTCAGAATCTACTCAAG : GCAATGATGGACCCAGATCCAAC S G S L L A I S E S T Q : G N D G P R S N P G H S L Q F Q N L L K : A M M D P D P T F R V T P C N F R I Y S R : Q - W T Q I Q . . . . . . 54452 ACGTCGTCCTTCTGCAAAAGGCGTTGTGGATAATCCAATCTTTGAAAGATGGCAAAGAAA T S S F C K R R C G - S N L - K M A K K R R P S A K G V V D N P I F E R W Q R N H V V L L Q K A L W I I Q S L K D G K E . . . . . . 54512 TTCCAACAAGTAGATATCCATGTAAATCACTGTTTTCTGGGATTTGTCGATTGCTACTTT F Q Q V D I H V N H C F L G F V D C Y F S N K - I S M - I T V F W D L S I A T F I P T S R Y P C K S L F S G I C R L L L . . . . . . 54572 TGCCAAAGATCCAGAATTCAAAGCTGCAGTATCTCATGCAGCATTCTGGTGTTGACCTAT C Q R S R I Q S C S I S C S I L V L T Y A K D P E F K A A V S H A A F W C - P I L P K I Q N S K L Q Y L M Q H S G V D L . . . . . . 54632 AGCCATTTCTGTAAATAGAGATAAAGCTTCATGCACCAAATTTTCCCATTTTGATGGTGC S H F C K - R - S F M H Q I F P F - W C A I S V N R D K A S C T K F S H F D G A - P F L - I E I K L H A P N F P I L M V . . . . . . 54692 CACTTTTGCCAAATATTATATCAAGAATGTAATGTTGTATCCTAATTGACCTGCAAACTG H F C Q I L Y Q E C N V V S - L T C K L T F A K Y Y I K N V M L Y P N - P A N C P L L P N I I S R M - C C I L I D L Q T . . . . . . 54752 TGGTTTGTATGTCAAAATATGGTATTTGGTGGCTTTTAAGCTGATTAATTCAGTATTGGT W F V C Q N M V F G G F - A D - F S I G G L Y V K I W Y L V A F K L I N S V L V V V C M S K Y G I W W L L S - L I Q Y W . . . . 54812 TTTTGGTTTTGGTAACTCAGATTTGTATTAA F W F W - L R F V L F G F G N S D L Y - F L V L V T Q I C I Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-9_AGS-1_PPS_1 (50032 50417,50832 50892,50994 51063,51518 51772,51932 51999,52087 52196,52626 52666,52759 52907,53804 54157,54429 54524) (frame '2'; 1587 bp, 529 residues) 1 STELELSPDS LYKKIHFQLM ERKTPNRTRR KQRSNQKSKK RMNKGSLSRH FTVGIAKPPL 61 PNQQQLHSSL SNVTLPNPSR FQKLLDSDDL PPAQSQFSSV LPLNLDADDD ADVADVAEKD 121 FILSQDFFCT PDYLTPDAPA ICNGLDGDKD DYTPCPKSPE KLLSVSRKRP RLASVRPFSS 181 DLSGQQQPVD IPTDTFGTDE MKSEKISESE KGPSYVSQSA IALRYRVMPP PCIRNPYLGD 241 ASEIDADPFG NRRSKYPGFN PAISGNDGLS RYRTDFHEIE QIGSGNFSRV FKVFKRIDGC 301 MYAVKHSTKQ LHQDTDRRQA LMEVQALAAL GPHENVVGYY SSWFENEHLY IQMELCDHSL 361 SNKKYCKLFS EVEVLEAMYQ VANALQFIHQ RGVAHLDVKP DNIYVKNGVY KLGDFGCATL 421 LDKSQPIEEG DARYMPQEIL NENYDHLDKV DIFSLGAAIY ELIRGSSLPE SGPHFLNLRE 481 GKLPLLPGHS LQFQNLLKAM MDPDPTRRPS AKGVVDNPIF ERWQRNSNK- PGL 10 (- strand): 56449 55485 AGS-1 (56449 56327,55914 55485) SCR (e 0.886 d 0.000 a 0.000,e 0.766) Exon 1 56449 56327 ( 123 n); score: 0.886 Intron 1 56326 55915 ( 412 n); Pd: 0.000 Pa: 0.000 Exon 2 55914 55485 ( 430 n); score: 0.766 PGS (56449 56327,55914 55485) SGN-U341178+ 3-phase translation of AGS-1 (-strand): . . . . . . 56449 GGTAAATTATGTGGCTAAGCAAACTTATACTATTTAATTACTCATCATAGTTATAGTTTA G K L C G - A N L Y Y L I T H H S Y S L V N Y V A K Q T Y T I - L L I I V I V Y - I M W L S K L I L F N Y S S - L - F . . . . . . 56389 CTATAATTACCACCCACGACTAACATTATACATTAATTATATGGGCTGACCTCGAGTTTG L - L P P T T N I I H - L Y G L T S S L Y N Y H P R L T L Y I N Y M G - P R V C T I I T T H D - H Y T L I I W A D L E F . : . . . . . 56329 TAT : TATATATATAATTCGCCAAGATATACAAATACATATGTATAATATACAATTATTTAA Y : Y I Y N S P R Y T N T Y V - Y T I I - I : I Y I I R Q D I Q I H M Y N I Q L F N V : L Y I - F A K I Y K Y I C I I Y N Y L . . . . . . 55857 CCTACATACATATACAATTTGCCTCTCTCCCACTCTCTGCCCTCTCTCACTCGCATCTCT P T Y I Y N L P L S H S L P S L T R I S L H T Y T I C L S P T L C P L S L A S L T Y I H I Q F A S L P L S A L S H S H L . . . . . . 55797 CCTCCCTCTCTCAATCTCGCTCATCTCTCTCCTCCCTCTCCTATTCTCGCTTGCCATATA P P S L N L A H L S P P S P I L A C H I L P L S I S L I S L L P L L F S L A I Y S S L S Q S R S S L S S L S Y S R L P Y . . . . . . 55737 TACAAATGCATATGTACACAATTATATACATATACAATTCACCTCTCTCCCACCCTTTGC Y K C I C T Q L Y T Y T I H L S P T L C T N A Y V H N Y I H I Q F T S L P P F A I Q M H M Y T I I Y I Y N S P L S H P L . . . . . . 55677 CCTCTCTCCTCCCTCTCCTAGTCTCGCTCGCCTTCTCCTCCCTCTCTCAATATCTCTTTC P L S S L S - S R S P S P P S L N I S F L S P P S P S L A R L L L P L S I S L S P S L L P L L V S L A F S S L S Q Y L F . . . . . . 55617 CATATACAAAAATATATTTATAATATACAATTATCTAATCAATATACTTATACAATTCAC H I Q K Y I Y N I Q L S N Q Y T Y T I H I Y K N I F I I Y N Y L I N I L I Q F T P Y T K I Y L - Y T I I - S I Y L Y N S . . . . . . 55557 CTTTCTCCTACTCTTTTCCCCCTTTCTCTCACTTCTCTCCTCTCTCTCCCAATCTCGCTC L S P T L F P L S L T S L L S L P I S L F L L L F S P F L S L L S S L S Q S R S P F S Y S F P P F S H F S P L S P N L A . . 55497 GCTTCTCTCTTCT A S L F L L S S R F S L Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-10_AGS-1_PPS_1 (56340 56327,55914 55485) (frame '2'; 444 bp, 148 residues) 1 PRVCIIYIIR QDIQIHMYNI QLFNLHTYTI CLSPTLCPLS LASLLPLSIS LISLLPLLFS 61 LAIYTNAYVH NYIHIQFTSL PPFALSPPSP SLARLLLPLS ISLSIYKNIF IIYNYLINIL 121 IQFTFLLLFS PFLSLLSSLS QSRSLLSS PGL 11 (+ strand): 60085 61692 AGS-1 (60085 60594,60769 60994,61072 61320,61427 61480,61573 61692) SCR (e 0.950 d 0.996 a 0.955,e 0.978 d 1.000 a 0.999,e 0.988 d 0.989 a 0.997,e 0.981 d 1.000 a 0.977,e 1.000) Exon 1 60085 60594 ( 510 n); score: 0.950 Intron 1 60595 60768 ( 174 n); Pd: 0.996 Pa: 0.955 Exon 2 60769 60994 ( 226 n); score: 0.978 Intron 2 60995 61071 ( 77 n); Pd: 1.000 Pa: 0.999 Exon 3 61072 61320 ( 249 n); score: 0.988 Intron 3 61321 61426 ( 106 n); Pd: 0.989 Pa: 0.997 Exon 4 61427 61480 ( 54 n); score: 0.981 Intron 4 61481 61572 ( 92 n); Pd: 1.000 Pa: 0.977 Exon 5 61573 61692 ( 120 n); score: 1.000 PGS (60085 60594,60769 60994,61072 61320,61427 61480,61573 61692) SGN-U319443+ 3-phase translation of AGS-1 (+strand): . . . . . . 60085 ATCAACTTTATAGATTGGTGTAATAAGAGAATTGTAGTGTTGGGAAACGCTTTTGCGAAA I N F I D W C N K R I V V L G N A F A K S T L - I G V I R E L - C W E T L L R K Q L Y R L V - - E N C S V G K R F C E . . . . . . 60145 ATTTTGTAATTGATTGAGTTTTGAGGTTTTTGATCTCGAACTCAATCAATCATACAAATT I L - L I E F - G F - S R T Q S I I Q I F C N - L S F E V F D L E L N Q S Y K F N F V I D - V L R F L I S N S I N H T N . . . . . . 60205 TCGTAGTTGATTGAGTTCTGAGGTGTTTTGATTTCGAATTCAATCAATCATTCATTTGCA S - L I E F - G V L I S N S I N H S F A R S - L S S E V F - F R I Q S I I H L H F V V D - V L R C F D F E F N Q S F I C . . . . . . 60265 TTTCGTTGGTGTCTCCCACCTACGTTTCTCTTCGTTTTTCTTCTCTTTTTTTAAGCAACC F R W C L P P T F L F V F L L F F - A T F V G V S H L R F S S F F F S F F K Q P I S L V S P T Y V S L R F S S L F L S N . . . . . . 60325 AATTCATTGGTGATTAAGTTTACATGTCAATGGAACTCAACCAATTTTAATTTTGCTTGA N S L V I K F T C Q W N S T N F N F A - I H W - L S L H V N G T Q P I L I L L D Q F I G D - V Y M S M E L N Q F - F C L . . . . . . 60385 TTTGATCAGCTTTTTTCAAAACAAAACAAAAAAAAATTAAAATCGTTTTCATAAATTAAA F D Q L F S K Q N K K K L K S F S - I K L I S F F Q N K T K K N - N R F H K L K I - S A F F K T K Q K K I K I V F I N - . . . . . . 60445 AAAAAGAAGAAGATGGGGAGCAACATTGAGGATAACCAAGAAGATGTTCCCATGGAGCTA K K K K M G S N I E D N Q E D V P M E L K R R R W G A T L R I T K K M F P W S Y K K E E D G E Q H - G - P R R C S H G A . . . . . . 60505 CAACTCAAGGGAAAGAAGCCTTCGAGCCAAAAGTTGAAACGACATGACTCTTTGGATGTC Q L K G K K P S S Q K L K R H D S L D V N S R E R S L R A K S - N D M T L W M S T T Q G K E A F E P K V E T T - L F G C . . . : . . . 60565 GAAGCAAGCAAATTGCCCGATGCCAAAAAG : GTAGTCGGAATGTCTGTGCTACTAAAACTT E A S K L P D A K K : V V G M S V L L K L K Q A N C P M P K R : - S E C L C Y - N L R S K Q I A R C Q K : G S R N V C A T K T . . . . . . 60799 GCATTCCAAAGCATAGGAGTGGTGTATGGAGATATTGGAACGTCACCATTGTACGTGTTT A F Q S I G V V Y G D I G T S P L Y V F H S K A - E W C M E I L E R H H C T C F C I P K H R S G V W R Y W N V T I V R V . . . . . . 60859 TCAACCATCTTTCTCGAAGGTGTAAAACACGAAGAGGATATACTTGGTGCTCTATCTCTC S T I F L E G V K H E E D I L G A L S L Q P S F S K V - N T K R I Y L V L Y L S F N H L S R R C K T R R G Y T W C S I S . . . . . . 60919 ATCTTGTATACGATCACCTTGATCCCTGTCGTCAAGTACGTATTCATCGTTCTCCAAGCT I L Y T I T L I P V V K Y V F I V L Q A S C I R S P - S L S S S T Y S S F S K L H L V Y D H L D P C R Q V R I H R S P S . . : . . . . 60979 AATGACAACGGAGATG : GTGGTACGTTCGCCTTATATTCATTGATATGCCGATATTCCAAG N D N G D : G G T F A L Y S L I C R Y S K M T T E M : V V R S P Y I H - Y A D I P R - - Q R R W : W Y V R L I F I D M P I F Q . . . . . . 61116 GTGGGATTGATTCCGAGTACAATGGCAGAAGACAGCGATGTCTCGACTTTTAAACTTGAT V G L I P S T M A E D S D V S T F K L D W D - F R V Q W Q K T A M S R L L N L I G G I D S E Y N G R R Q R C L D F - T - . . . . . . 61176 ATGCCTGATAGACGTACACGTAGGGCATCACAACTTAAGTCGATGCTAGAAAACAGCCAA M P D R R T R R A S Q L K S M L E N S Q C L I D V H V G H H N L S R C - K T A N Y A - - T Y T - G I T T - V D A R K Q P . . . . . . 61236 TTCGCGAAGTTCTTTCTGCTAATTGCAACAATGCTTGGTACTTCCATGGTTATCGGTGAT F A K F F L L I A T M L G T S M V I G D S R S S F C - L Q Q C L V L P W L S V M I R E V L S A N C N N A W Y F H G Y R - . . . : . . . 61296 GGTGTCCTAACACCCTGTATTTCAG : TTTTGTCTGCAATTGGAGGAGTTAAAGCAGCTGCT G V L T P C I S : V L S A I G G V K A A A V S - H P V F Q : F C L Q L E E L K Q L L W C P N T L Y F S : F V C N W R S - S S C . . : . . . . 61462 CCAGACGCAATGACTGAAG : ACAGGATCGTTTGGCTTGCAGTAGCCATCTTGATACTTCTG P D A M T E : D R I V W L A V A I L I L L Q T Q - L K : T G S F G L Q - P S - Y F C S R R N D - R : Q D R L A C S S H L D T S . . . . . . 61614 TTCATGTTTCAAAGATTTGGAACTGAAAAAGTTGGTTACACATTTGCACCTATACTTTGC F M F Q R F G T E K V G Y T F A P I L C S C F K D L E L K K L V T H L H L Y F A V H V S K I W N - K S W L H I C T Y T L . . 61674 TTATGGTTTGTATTGATTG L W F V L I Y G L Y - L L M V C I D Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-11_AGS-1_PPS_1 (60439 60594,60769 60994,61072 61320,61427 61480,61573 61691) (frame '1'; 804 bp, 268 residues) 1 IKKKKKMGSN IEDNQEDVPM ELQLKGKKPS SQKLKRHDSL DVEASKLPDA KKVVGMSVLL 61 KLAFQSIGVV YGDIGTSPLY VFSTIFLEGV KHEEDILGAL SLILYTITLI PVVKYVFIVL 121 QANDNGDGGT FALYSLICRY SKVGLIPSTM AEDSDVSTFK LDMPDRRTRR ASQLKSMLEN 181 SQFAKFFLLI ATMLGTSMVI GDGVLTPCIS VLSAIGGVKA AAPDAMTEDR IVWLAVAILI 241 LLFMFQRFGT EKVGYTFAPI LCLWFVLI PGL 12 (+ strand): 62221 63882 AGS-1 (62221 62456,62572 63882) SCR (e 0.992 d 0.962 a 0.996,e 0.978) Exon 1 62221 62456 ( 236 n); score: 0.992 Intron 1 62457 62571 ( 115 n); Pd: 0.962 Pa: 0.996 Exon 2 62572 63882 (1311 n); score: 0.978 PGS (62221 62456,62572 63882) SGN-U320704+ 3-phase translation of AGS-1 (+strand): . . . . . . 62221 GTTTGTCATAGCTGTATTCGCAGCCATCATTGCAAGTCAAGCTTTGATTTCCGGGACATT V C H S C I R S H H C K S S F D F R D I F V I A V F A A I I A S Q A L I S G T F L S - L Y S Q P S L Q V K L - F P G H . . . . . . 62281 CGCTATAATCCAGCAATCTCTGGCTTTAGGATGCTTTCCTCGTGTTAAAATCGTGCATAC R Y N P A I S G F R M L S S C - N R A Y A I I Q Q S L A L G C F P R V K I V H T S L - S S N L W L - D A F L V L K S C I . . . . . . 62341 ATCAAAGAAACATCATGGACAAATCTACATTCCTGAAATCAATAACCTTCTCATGATCGC I K E T S W T N L H S - N Q - P S H D R S K K H H G Q I Y I P E I N N L L M I A H Q R N I M D K S T F L K S I T F S - S . . . . . . : 62401 TTGTGTTCTTACCACTATCGGATTCAAGACTACTGAAAAGCTTAGCAATGCTTATG : GAAT L C S Y H Y R I Q D Y - K A - Q C L W : N C V L T T I G F K T T E K L S N A Y : G I L V F L P L S D S R L L K S L A M L M : E . . . . . . 62576 AGCAGTGGTGTTTGTGATGTTCCTAACATCGTGCTTCCTCATACTAGTCATGATCTTGAT S S G V C D V P N I V L P H T S H D L D A V V F V M F L T S C F L I L V M I L I - Q W C L - C S - H R A S S Y - S - S - . . . . . . 62636 ATGGAAAACCAACATTCTTCTTATTATCGTCTATATTCTAATCATTGTTTCGGTTGAGCT M E N Q H S S Y Y R L Y S N H C F G - A W K T N I L L I I V Y I L I I V S V E L Y G K P T F F L L S S I F - S L F R L S . . . . . . 62696 TGTATACCTAAGCGCAGTCCTTTACAAGTTTGAACAAGGTGGTTACCTCCCTGTGGCTTT C I P K R S P L Q V - T R W L P P C G F V Y L S A V L Y K F E Q G G Y L P V A L L Y T - A Q S F T S L N K V V T S L W L . . . . . . 62756 AGCTCTGTTCCTAATGTTTATCATGTACGTATGGAACTATGTGTACCGTAAGAAGTATCA S S V P N V Y H V R M E L C V P - E V S A L F L M F I M Y V W N Y V Y R K K Y H - L C S - C L S C T Y G T M C T V R S I . . . . . . 62816 CTACGAGCTAGAACACAAGATCTCTCCCGAAAAAGTTAAAGAAACATTGGATGCAACCAG L R A R T Q D L S R K S - R N I G C N Q Y E L E H K I S P E K V K E T L D A T S T T S - N T R S L P K K L K K H W M Q P . . . . . . 62876 TTCACATCGCCTTCCAGGTCTTGCCATTTTCTACTCTGAACTAGTCCACGGAATCCCCCC F T S P S R S C H F L L - T S P R N P P S H R L P G L A I F Y S E L V H G I P P V H I A F Q V L P F S T L N - S T E S P . . . . . . 62936 AATCTTCAAGCATTATGTTGAGAATGTACCTGCTTTACACTCTGTCCTCGTGTTCGCTTC N L Q A L C - E C T C F T L C P R V R F I F K H Y V E N V P A L H S V L V F A S Q S S S I M L R M Y L L Y T L S S C S L . . . . . . 62996 TGTCAAATCACTTCCCATAAGCAAAGTTCCACTAGAAGAAAGGTTCCTCTTCAGAAGGGT C Q I T S H K Q S S T R R K V P L Q K G V K S L P I S K V P L E E R F L F R R V L S N H F P - A K F H - K K G S S S E G . . . . . . 63056 GAAACCATATGACCTCTATGTGTTCCGTTGTGTGATACGTTATGGATACAATGAAATGCG E T I - P L C V P L C D T L W I Q - N A K P Y D L Y V F R C V I R Y G Y N E M R - N H M T S M C S V V - Y V M D T M K C . . . . . . 63116 CAATGAGGAAGAGCCTATCGAGAAGTTATTGGTAGAAAGGCTAAAGAACTACATCAAGGA Q - G R A Y R E V I G R K A K E L H Q G N E E E P I E K L L V E R L K N Y I K E A M R K S L S R S Y W - K G - R T T S R . . . . . . 63176 AGATTACATGTTCTCAGTTGCAGCAAATGGAGACAATCAAGGAGAAACTGCTTCCTTGAT R L H V L S C S K W R Q S R R N C F L D D Y M F S V A A N G D N Q G E T A S L I K I T C S Q L Q Q M E T I K E K L L P - . . . . . . 63236 TGAGAAAGACGTCGAAGTACTTGAGAGAGCTTCCAACATGGGAGTGGTTCATTTGGTTGG - E R R R S T - E S F Q H G S G S F G W E K D V E V L E R A S N M G V V H L V G L R K T S K Y L R E L P T W E W F I W L . . . . . . 63296 AGAACAAGACGTTGTCGCGTGCAAGGGGTCTGGTGTAACCAAAAGAATGGTGATCAACTA R T R R C R V Q G V W C N Q K N G D Q L E Q D V V A C K G S G V T K R M V I N Y E N K T L S R A R G L V - P K E W - S T . . . . . . 63356 CGCATACAATTTCCTCAAGAGGAACTTAAGACAGAGTAGTAACAAAGTATTCGATATCCC R I Q F P Q E E L K T E - - Q S I R Y P A Y N F L K R N L R Q S S N K V F D I P T H T I S S R G T - D R V V T K Y S I S . . . . . . 63416 AACGAAGCGAATGCTCAAAGTTGGAATGACATGTGAGCTTTAGGGTGATTATTTTCTCTA N E A N A Q S W N D M - A L G - L F S L T K R M L K V G M T C E L - G D Y F L - Q R S E C S K L E - H V S F R V I I F S . . . . . . 63476 AAAAAAAATTGTAAGGTTTAAAATAAATGTGCATAGAGTTGAAGGGAACAAGGGAGAAGG K K N C K V - N K C A - S - R E Q G R R K K I V R F K I N V H R V E G N K G E G K K K L - G L K - M C I E L K G T R E K . . . . . . 63536 CACCATATATGTTTCAACTTCCATCATCAAAAGGTGCTTGCACAAGAAAAAATAATGTTC H H I C F N F H H Q K V L A Q E K I M F T I Y V S T S I I K R C L H K K K - C S A P Y M F Q L P S S K G A C T R K N N V . . . . . . 63596 TTTTTGTTAATTTCTTAGTTTTCAATTTGTTTGTTTGTTTGGATTTTGTATGAGATAAGC F L L I S - F S I C L F V W I L Y E I S F C - F L S F Q F V C L F G F C M R - A L F V N F L V F N L F V C L D F V - D K . . . . . . 63656 TAAGCTAAGCCTGATTTTGTAAGAGAATAAGCTAAGTATTTGTATTTTGTTTTTTTCTTG - A K P D F V R E - A K Y L Y F V F F L K L S L I L - E N K L S I C I L F F S C L S - A - F C K R I S - V F V F C F F L . . . . . . 63716 TTTCAAAGAAAAGAGAAGATCAAGATATGTCTAGATAAAAAAACAACATAGGTGTAGTTG F Q R K E K I K I C L D K K T T - V - L F K E K R R S R Y V - I K K Q H R C S C V S K K R E D Q D M S R - K N N I G V V . . . . . . 63776 TTTTTGACTAGCAAAACATTTTTGTTTTTACCTTGTAATATCTTGTAAAAGATTGTCAGT F L T S K T F L F L P C N I L - K I V S F - L A K H F C F Y L V I S C K R L S V V F D - Q N I F V F T L - Y L V K D C Q . . . . . 63836 TCATACATAATAATAATAATTAATCTAAGAATAAATAGTTTATAGAA S Y I I I I I N L R I N S L - H T - - - - L I - E - I V Y R F I H N N N N - S K N K - F I E Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-12_AGS-1_PPS_1 (62222 62456,62572 63458) (frame '2'; 1119 bp, 373 residues) 1 FVIAVFAAII ASQALISGTF AIIQQSLALG CFPRVKIVHT SKKHHGQIYI PEINNLLMIA 61 CVLTTIGFKT TEKLSNAYGI AVVFVMFLTS CFLILVMILI WKTNILLIIV YILIIVSVEL 121 VYLSAVLYKF EQGGYLPVAL ALFLMFIMYV WNYVYRKKYH YELEHKISPE KVKETLDATS 181 SHRLPGLAIF YSELVHGIPP IFKHYVENVP ALHSVLVFAS VKSLPISKVP LEERFLFRRV 241 KPYDLYVFRC VIRYGYNEMR NEEEPIEKLL VERLKNYIKE DYMFSVAANG DNQGETASLI 301 EKDVEVLERA SNMGVVHLVG EQDVVACKGS GVTKRMVINY AYNFLKRNLR QSSNKVFDIP 361 TKRMLKVGMT CEL- PGL 13 (+ strand): 65401 74647 AGS-1 (65401 65426,72134 72166,72798 72823,73337 73365,73875 73884,74592 74647) SCR (e 0.769 d 0.000 a 0.218,e 0.697 d 0.000 a 0.975,e 0.769 d 0.000 a 0.000,e 0.621 d 0.810 a 0.317,e 0.500 d 0.056 a 0.000,e 0.929) Exon 1 65401 65426 ( 26 n); score: 0.769 Intron 1 65427 72133 (6707 n); Pd: 0.000 Pa: 0.218 Exon 2 72134 72166 ( 33 n); score: 0.697 Intron 2 72167 72797 ( 631 n); Pd: 0.000 Pa: 0.975 Exon 3 72798 72823 ( 26 n); score: 0.769 Intron 3 72824 73336 ( 513 n); Pd: 0.000 Pa: 0.000 Exon 4 73337 73365 ( 29 n); score: 0.621 Intron 4 73366 73874 ( 509 n); Pd: 0.810 Pa: 0.317 Exon 5 73875 73884 ( 10 n); score: 0.500 Intron 5 73885 74591 ( 707 n); Pd: 0.056 Pa: 0.000 Exon 6 74592 74647 ( 56 n); score: 0.929 PGS (65401 65426,72134 72166,72798 72823,73337 73365,73875 73884,74592 74647) SGN-U313537+ 3-phase translation of AGS-1 (+strand): . . . : . . . : 65401 ATATAATAAAATATGAAATAAAAACT : TTCATCTTTGTGCGTGGTGGAGCAATTAAAATC : C I - - N M K - K L : S S L C V V E Q L K S : Y N K I - N K N : F H L C A W W S N - N : P I I K Y E I K T : F I F V R G G A I K I : . . . : . . . : 72799 ATGTACATGTTTGTATGGAACAAAC : TCATTTTTTTTCTCAAACCTTCTATAGAG : AATGGA M Y M F V W N K : L I F F L K P S I E : N G C T C L Y G T N : S F F F S N L L - R : M E H V H V C M E Q T : H F F S Q T F Y R : E W . : . . . . . 73881 GCAA : GTGTTATAAAATCAAGCTCATAAGAATATAATCATAAAGAGAAGAAGATAAGAGAA A : S V I K S S S - E Y N H K E K K I R E Q : V L - N Q A H K N I I I K R R R - E S K : C Y K I K L I R I - S - R E E D K R Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (70795 71891) SCR (e 0.958) Exon 1 70795 71891 (1097 n); score: 0.958 PGS (70795 71891) SGN-U322792- 3-phase translation of AGS-2 (+strand): . . . . . . 70795 AGTAAATTTACTGCAAAACAAATATCTGGTCAAATATTGTTAGTAAGATGCACTAGTGCC S K F T A K Q I S G Q I L L V R C T S A V N L L Q N K Y L V K Y C - - D A L V P - I Y C K T N I W S N I V S K M H - C . . . . . . 70855 CTGATTGCACCAAGATAGAGTTTCATCACCAAGAAACTGTTCATCCTTCTTTTGAGATCA L I A P R - S F I T K K L F I L L L R S - L H Q D R V S S P R N C S S F F - D Q P D C T K I E F H H Q E T V H P S F E I . . . . . . 70915 AACTGAATCATCATTTATGTCAAGTGATCTCATAATCATTGGGGTACTCAATGAATGCGA N - I I I Y V K - S H N H W G T Q - M R T E S S F M S S D L I I I G V L N E C E K L N H H L C Q V I S - S L G Y S M N A . . . . . . 70975 GATATCCACATAAAATTGCATCAAAATTTTTCAGTGTATGTGCACTGTTAGACAAATATT D I H I K L H Q N F S V Y V H C - T N I I S T - N C I K I F Q C M C T V R Q I F R Y P H K I A S K F F S V C A L L D K Y . . . . . . 71035 TTATTTGTCAAATTAACAATCTGTAGGTCTAGACAAAATTTTGTCTTACCAAGACCTTTT L F V K L T I C R S R Q N F V L P R P F Y L S N - Q S V G L D K I L S Y Q D L F F I C Q I N N L - V - T K F C L T K T F . . . . . . 71095 CATAAAAATACAAGGAAAATTAGGGTCATTCTTTTGTACCCTTTTTCTTCCTTGACTATT H K N T R K I R V I L L Y P F S S L T I I K I Q G K L G S F F C T L F L P - L F S - K Y K E N - G H S F V P F F F L D Y . . . . . . 71155 TCAATCCATATAAGGATTCTTGAAGCTTTATTGGAAAAGTTTCTTTCAAACTCTTATATG S I H I R I L E A L L E K F L S N S Y M Q S I - G F L K L Y W K S F F Q T L I C F N P Y K D S - S F I G K V S F K L L Y . . . . . . 71215 CTTCAGACACCTCGATTGTTTCAAGAATTTTCATATAACCTTCATTGTCTAGCTAGACAT L Q T P R L F Q E F S Y N L H C L A R H F R H L D C F K N F H I T F I V - L D I A S D T S I V S R I F I - P S L S S - T . . . . . . 71275 TACAATTATTCATTTACATGCATTCAAGTTTTCATATGTTGCCAGATCAAAACAAACCTC Y N Y S F T C I Q V F I C C Q I K T N L T I I H L H A F K F S Y V A R S K Q T S L Q L F I Y M H S S F H M L P D Q N K P . . . . . . 71335 ATTTCATCCACCAAAGGAGAAAACATCTCCACTAATGTCAGGATTTTTGCGATAAACCTT I S S T K G E N I S T N V R I F A I N L F H P P K E K T S P L M S G F L R - T F H F I H Q R R K H L H - C Q D F C D K P . . . . . . 71395 TATGCCCCAAGGCACAAACCGTATTTGATATCTTACGGTTATACTATCGCATAATAATTT Y A P R H K P Y L I S Y G Y T I A - - F M P Q G T N R I - Y L T V I L S H N N L L C P K A Q T V F D I L R L Y Y R I I I . . . . . . 71455 ATTTGTACCCCACTGACATTATATCTTGATTTATGGACTATCAGTCCAAAATGTCACTTT I C T P L T L Y L D L W T I S P K C H F F V P H - H Y I L I Y G L S V Q N V T F Y L Y P T D I I S - F M D Y Q S K M S L . . . . . . 71515 TCTAAGTGAAACAAAATATAATTGAATTGTGCATTTTACTTGGCCAACCATTTATCTGTT S K - N K I - L N C A F Y L A N H L S V L S E T K Y N - I V H F T W P T I Y L F F - V K Q N I I E L C I L L G Q P F I C . . . . . . 71575 CACACTATATGACAGATTTGAATTCAAGATCCTCATCACAATTTATATCATTGAGCGCTA H T I - Q I - I Q D P H H N L Y H - A L T L Y D R F E F K I L I T I Y I I E R Y S H Y M T D L N S R S S S Q F I S L S A . . . . . . 71635 CCTCATATCAAAGATACCTTCGACGGTTATTTGATATCGATTCCAACAGGACATAACTTA P H I K D T F D G Y L I S I P T G H N L L I S K I P S T V I - Y R F Q Q D I T Y T S Y Q R Y L R R L F D I D S N R T - L . . . . . . 71695 TCGAGATCTCTTCATTTTTCATTATTTCAGGTACCTGAACCTTTTTCAAAATTTTATGAA S R S L H F S L F Q V P E P F S K F Y E R D L F I F H Y F R Y L N L F Q N F M K I E I S S F F I I S G T - T F F K I L - . . . . . . 71755 GTGTTATGTCAATGTGCTCTTTTATAGCACTTGCCTCATTATCATGACCATATTGATCAT V L C Q C A L L - H L P H Y H D H I D H C Y V N V L F Y S T C L I I M T I L I I S V M S M C S F I A L A S L S - P Y - S . . . . . . 71815 TTGCTCCTTCCTTCTTCAAGGAATTTTATATTTGGAATCGATTGGTCTATTACGCTTCCG L L L P S S R N F I F G I D W S I T L P C S F L L Q G I L Y L E S I G L L R F R F A P S F F K E F Y I W N R L V Y Y A S . . 71875 GCGTATCATAGACTCTG A Y H R L R I I D S G V S - T L Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-13_AGS-2_PPS_1 (71026 71448) (frame '1'; 420 bp, 140 residues) 1 TNILFVKLTI CRSRQNFVLP RPFHKNTRKI RVILLYPFSS LTISIHIRIL EALLEKFLSN 61 SYMLQTPRLF QEFSYNLHCL ARHYNYSFTC IQVFICCQIK TNLISSTKGE NISTNVRIFA 121 INLYAPRHKP YLISYGYTIA - >C09HBa0099P03.1-2+_PGL-13_AGS-2_PPS_2 (71669 71890) (frame '2'; 222 bp, 74 residues) 1 YRFQQDITYR DLFIFHYFRY LNLFQNFMKC YVNVLFYSTC LIIMTILIIC SFLLQGILYL 61 ESIGLLRFRR IIDS 3-phase translation of AGS-2 (-strand): . . . . . . 71891 CAGAGTCTATGATACGCCGGAAGCGTAATAGACCAATCGATTCCAAATATAAAATTCCTT Q S L - Y A G S V I D Q S I P N I K F L R V Y D T P E A - - T N R F Q I - N S L E S M I R R K R N R P I D S K Y K I P . . . . . . 71831 GAAGAAGGAAGGAGCAAATGATCAATATGGTCATGATAATGAGGCAAGTGCTATAAAAGA E E G R S K - S I W S - - - G K C Y K R K K E G A N D Q Y G H D N E A S A I K E - R R K E Q M I N M V M I M R Q V L - K . . . . . . 71771 GCACATTGACATAACACTTCATAAAATTTTGAAAAAGGTTCAGGTACCTGAAATAATGAA A H - H N T S - N F E K G S G T - N N E H I D I T L H K I L K K V Q V P E I M K S T L T - H F I K F - K R F R Y L K - - . . . . . . 71711 AAATGAAGAGATCTCGATAAGTTATGTCCTGTTGGAATCGATATCAAATAACCGTCGAAG K - R D L D K L C P V G I D I K - P S K N E E I S I S Y V L L E S I S N N R R R K M K R S R - V M S C W N R Y Q I T V E . . . . . . 71651 GTATCTTTGATATGAGGTAGCGCTCAATGATATAAATTGTGATGAGGATCTTGAATTCAA V S L I - G S A Q - Y K L - - G S - I Q Y L - Y E V A L N D I N C D E D L E F K G I F D M R - R S M I - I V M R I L N S . . . . . . 71591 ATCTGTCATATAGTGTGAACAGATAAATGGTTGGCCAAGTAAAATGCACAATTCAATTAT I C H I V - T D K W L A K - N A Q F N Y S V I - C E Q I N G W P S K M H N S I I N L S Y S V N R - M V G Q V K C T I Q L . . . . . . 71531 ATTTTGTTTCACTTAGAAAAGTGACATTTTGGACTGATAGTCCATAAATCAAGATATAAT I L F H L E K - H F G L I V H K S R Y N F C F T - K S D I L D - - S I N Q D I M Y F V S L R K V T F W T D S P - I K I - . . . . . . 71471 GTCAGTGGGGTACAAATAAATTATTATGCGATAGTATAACCGTAAGATATCAAATACGGT V S G V Q I N Y Y A I V - P - D I K Y G S V G Y K - I I M R - Y N R K I S N T V C Q W G T N K L L C D S I T V R Y Q I R . . . . . . 71411 TTGTGCCTTGGGGCATAAAGGTTTATCGCAAAAATCCTGACATTAGTGGAGATGTTTTCT L C L G A - R F I A K I L T L V E M F S C A L G H K G L S Q K S - H - W R C F L F V P W G I K V Y R K N P D I S G D V F . . . . . . 71351 CCTTTGGTGGATGAAATGAGGTTTGTTTTGATCTGGCAACATATGAAAACTTGAATGCAT P L V D E M R F V L I W Q H M K T - M H L W W M K - G L F - S G N I - K L E C M S F G G - N E V C F D L A T Y E N L N A . . . . . . 71291 GTAAATGAATAATTGTAATGTCTAGCTAGACAATGAAGGTTATATGAAAATTCTTGAAAC V N E - L - C L A R Q - R L Y E N S - N - M N N C N V - L D N E G Y M K I L E T C K - I I V M S S - T M K V I - K F L K . . . . . . 71231 AATCGAGGTGTCTGAAGCATATAAGAGTTTGAAAGAAACTTTTCCAATAAAGCTTCAAGA N R G V - S I - E F E R N F S N K A S R I E V S E A Y K S L K E T F P I K L Q E Q S R C L K H I R V - K K L F Q - S F K . . . . . . 71171 ATCCTTATATGGATTGAAATAGTCAAGGAAGAAAAAGGGTACAAAAGAATGACCCTAATT I L I W I E I V K E E K G Y K R M T L I S L Y G L K - S R K K K G T K E - P - F N P Y M D - N S Q G R K R V Q K N D P N . . . . . . 71111 TTCCTTGTATTTTTATGAAAAGGTCTTGGTAAGACAAAATTTTGTCTAGACCTACAGATT F L V F L - K G L G K T K F C L D L Q I S L Y F Y E K V L V R Q N F V - T Y R L F P C I F M K R S W - D K I L S R P T D . . . . . . 71051 GTTAATTTGACAAATAAAATATTTGTCTAACAGTGCACATACACTGAAAAATTTTGATGC V N L T N K I F V - Q C T Y T E K F - C L I - Q I K Y L S N S A H T L K N F D A C - F D K - N I C L T V H I H - K I L M . . . . . . 70991 AATTTTATGTGGATATCTCGCATTCATTGAGTACCCCAATGATTATGAGATCACTTGACA N F M W I S R I H - V P Q - L - D H L T I L C G Y L A F I E Y P N D Y E I T - H Q F Y V D I S H S L S T P M I M R S L D . . . . . . 70931 TAAATGATGATTCAGTTTGATCTCAAAAGAAGGATGAACAGTTTCTTGGTGATGAAACTC - M M I Q F D L K R R M N S F L V M K L K - - F S L I S K E G - T V S W - - N S I N D D S V - S Q K K D E Q F L G D E T . . . . . . 70871 TATCTTGGTGCAATCAGGGCACTAGTGCATCTTACTAACAATATTTGACCAGATATTTGT Y L G A I R A L V H L T N N I - P D I C I L V Q S G H - C I L L T I F D Q I F V L S W C N Q G T S A S Y - Q Y L T R Y L . . 70811 TTTGCAGTAAATTTACT F A V N L L Q - I Y F C S K F T Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-13_AGS-2_PPS_1 (71839 71642) (frame '2'; 195 bp, 65 residues) 1 NSLKKEGAND QYGHDNEASA IKEHIDITLH KILKKVQVPE IMKNEEISIS YVLLESISNN 61 RRRYL- PGL 14 (- strand): 74602 73355 AGS-1 (74177 73355) SCR (e 0.887) Exon 1 74177 73355 ( 823 n); score: 0.887 PGS (74157 73355) SGN-U329457+ PGS (74177 74090) SGN-U312402- 3-phase translation of AGS-1 (-strand): . . . . . . 74177 CATAATGAAAAATATTTGAGGATAAGACGATAGATTCAAGTTCTATCGCACCAAAAGGGT H N E K Y L R I R R - I Q V L S H Q K G I M K N I - G - D D R F K F Y R T K R V - - K I F E D K T I D S S S I A P K G . . . . . . 74117 AAGACATCAATTTGAGTTTTGATGCACCGCGATTGGGTTCAAGTCCCATAGAATTATGGC K T S I - V L M H R D W V Q V P - N Y G R H Q F E F - C T A I G F K S H R I M A - D I N L S F D A P R L G S S P I E L W . . . . . . 74057 GTAACACGCCTTATATGGATAAGACATTGGGTTTGAGTCAATGCACCATAGTGATGATAT V T R L I W I R H W V - V N A P - - - Y - H A L Y G - D I G F E S M H H S D D I R N T P Y M D K T L G L S Q C T I V M I . . . . . . 73997 AATGATGACTAAGGCTTTGATGAAAAGTCTTGAAATTCGTCCTGTTGAATTTGCTCCATT N D D - G F D E K S - N S S C - I C S I M M T K A L M K S L E I R P V E F A P F - - - L R L - - K V L K F V L L N L L H . . . . . . 73937 CCTTGAAAAGAATGTGGTAGCAACATACGATAATTTTAAAATTGCATGTTCACTTGCTCC P - K E C G S N I R - F - N C M F T C S L E K N V V A T Y D N F K I A C S L A P S L K R M W - Q H T I I L K L H V H L L . . . . . . 73877 ATTCTATCGTATGAATGTGGTAGCAGTAAATAGTTAGTCTGAAAGATGACAAATGATTAA I L S Y E C G S S K - L V - K M T N D - F Y R M N V V A V N S - S E R - Q M I N H S I V - M W - Q - I V S L K D D K - L . . . . . . 73817 CGATATGAAAAAAGGCATGAAGGGACGGAATTATAATAGTCATCATTGTGGTAATTATAA R Y E K R H E G T E L - - S S L W - L - D M K K G M K G R N Y N S H H C G N Y K T I - K K A - R D G I I I V I I V V I I . . . . . . 73757 AAGAAAGAACACTATGGGTTCTCCAAATAGTCCTTCGAAATGTGAAGACAGTTTTCATTA K K E H Y G F S K - S F E M - R Q F S L R K N T M G S P N S P S K C E D S F H Y K E R T L W V L Q I V L R N V K T V F I . . . . . . 73697 TCAAAATGGTACGAAAGGTCATTAGACTTGTGAATGTTGTCAACCCAATAATTTGACAAG S K W Y E R S L D L - M L S T Q - F D K Q N G T K G H - T C E C C Q P N N L T S I K M V R K V I R L V N V V N P I I - Q . . . . . . 73637 TTTATAAATCCTCTATCATGATACAAGAAAATAAAGTGATGGTACACTTGATCATTCAAA F I N P L S - Y K K I K - W Y T - S F K L - I L Y H D T R K - S D G T L D H S K V Y K S S I M I Q E N K V M V H L I I Q . . . . . . 73577 ATGATGTTGAGGTGTGTCATGGAAATATGATGAATTTTAAAGTCATGACAATGTGCTTAT M M L R C V M E I - - I L K S - Q C A Y - C - G V S W K Y D E F - S H D N V L I N D V E V C H G N M M N F K V M T M C L . . . . . . 73517 AATGCAAATTGATGAAGTACATATAAATAATTAGTTCATTCCTTGAAGAGAATGTGACTT N A N - - S T Y K - L V H S L K R M - L M Q I D E V H I N N - F I P - R E C D L - C K L M K Y I - I I S S F L E E N V T . . . . . . 73457 ATGATGAGTATCGTGAATGCTCGACCATTCTATAAGAGAATGGGTCCAGATGTGAGCAAA M M S I V N A R P F Y K R M G P D V S K - - V S - M L D H S I R E W V Q M - A N Y D E Y R E C S T I L - E N G S R C E Q . . . . . 73397 TCATTATGGGTTGGATATATGTCACAACTCACCTCTATAGAAG S L W V G Y M S Q L T S I E H Y G L D I C H N S P L - K I I M G W I Y V T T H L Y R Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-14_AGS-1_PPS_1 (74035 73841) (frame '2'; 192 bp, 64 residues) 1 DIGFESMHHS DDIMMTKALM KSLEIRPVEF APFLEKNVVA TYDNFKIACS LAPFYRMNVV 61 AVNS- 3-phase translation of AGS-1 (+strand): . . . . . . 73355 CTTCTATAGAGGTGAGTTGTGACATATATCCAACCCATAATGATTTGCTCACATCTGGAC L L - R - V V T Y I Q P I M I C S H L D F Y R G E L - H I S N P - - F A H I W T S I E V S C D I Y P T H N D L L T S G . . . . . . 73415 CCATTCTCTTATAGAATGGTCGAGCATTCACGATACTCATCATAAGTCACATTCTCTTCA P F S Y R M V E H S R Y S S - V T F S S H S L I E W S S I H D T H H K S H S L Q P I L L - N G R A F T I L I I S H I L F . . . . . . 73475 AGGAATGAACTAATTATTTATATGTACTTCATCAATTTGCATTATAAGCACATTGTCATG R N E L I I Y M Y F I N L H Y K H I V M G M N - L F I C T S S I C I I S T L S - K E - T N Y L Y V L H Q F A L - A H C H . . . . . . 73535 ACTTTAAAATTCATCATATTTCCATGACACACCTCAACATCATTTTGAATGATCAAGTGT T L K F I I F P - H T S T S F - M I K C L - N S S Y F H D T P Q H H F E - S S V D F K I H H I S M T H L N I I L N D Q V . . . . . . 73595 ACCATCACTTTATTTTCTTGTATCATGATAGAGGATTTATAAACTTGTCAAATTATTGGG T I T L F S C I M I E D L - T C Q I I G P S L Y F L V S - - R I Y K L V K L L G Y H H F I F L Y H D R G F I N L S N Y W . . . . . . 73655 TTGACAACATTCACAAGTCTAATGACCTTTCGTACCATTTTGATAATGAAAACTGTCTTC L T T F T S L M T F R T I L I M K T V F - Q H S Q V - - P F V P F - - - K L S S V D N I H K S N D L S Y H F D N E N C L . . . . . . 73715 ACATTTCGAAGGACTATTTGGAGAACCCATAGTGTTCTTTCTTTTATAATTACCACAATG T F R R T I W R T H S V L S F I I T T M H F E G L F G E P I V F F L L - L P Q - H I S K D Y L E N P - C S F F Y N Y H N . . . . . . 73775 ATGACTATTATAATTCCGTCCCTTCATGCCTTTTTTCATATCGTTAATCATTTGTCATCT M T I I I P S L H A F F H I V N H L S S - L L - F R P F M P F F I S L I I C H L D D Y Y N S V P S C L F S Y R - S F V I . . . . . . 73835 TTCAGACTAACTATTTACTGCTACCACATTCATACGATAGAATGGAGCAAGTGAACATGC F R L T I Y C Y H I H T I E W S K - T C S D - L F T A T T F I R - N G A S E H A F Q T N Y L L L P H S Y D R M E Q V N M . . . . . . 73895 AATTTTAAAATTATCGTATGTTGCTACCACATTCTTTTCAAGGAATGGAGCAAATTCAAC N F K I I V C C Y H I L F K E W S K F N I L K L S Y V A T T F F S R N G A N S T Q F - N Y R M L L P H S F Q G M E Q I Q . . . . . . 73955 AGGACGAATTTCAAGACTTTTCATCAAAGCCTTAGTCATCATTATATCATCACTATGGTG R T N F K T F H Q S L S H H Y I I T M V G R I S R L F I K A L V I I I S S L W C Q D E F Q D F S S K P - S S L Y H H Y G . . . . . . 74015 CATTGACTCAAACCCAATGTCTTATCCATATAAGGCGTGTTACGCCATAATTCTATGGGA H - L K P N V L S I - G V L R H N S M G I D S N P M S Y P Y K A C Y A I I L W D A L T Q T Q C L I H I R R V T P - F Y G . . . . . . 74075 CTTGAACCCAATCGCGGTGCATCAAAACTCAAATTGATGTCTTACCCTTTTGGTGCGATA L E P N R G A S K L K L M S Y P F G A I L N P I A V H Q N S N - C L T L L V R - T - T Q S R C I K T Q I D V L P F W C D . . . . . 74135 GAACTTGAATCTATCGTCTTATCCTCAAATATTTTTCATTATG E L E S I V L S S N I F H Y N L N L S S Y P Q I F F I M R T - I Y R L I L K Y F S L Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2+_PGL-14_AGS-1_PPS_1 (73637 73888) (frame '1'; 249 bp, 83 residues) 1 TCQIIGLTTF TSLMTFRTIL IMKTVFTFRR TIWRTHSVLS FIITTMMTII IPSLHAFFHI 61 VNHLSSFRLT IYCYHIHTIE WSK- >C09HBa0099P03.1-2+_PGL-14_AGS-1_PPS_2 (73875 74111) (frame '2'; 234 bp, 78 residues) 1 NGASEHAILK LSYVATTFFS RNGANSTGRI SRLFIKALVI IISSLWCIDS NPMSYPYKAC 61 YAIILWDLNP IAVHQNSN- >C09HBa0099P03.1-2+_PGL-14_AGS-1_PPS_3 (73522 73746) (frame '0'; 222 bp, 74 residues) 1 AHCHDFKIHH ISMTHLNIIL NDQVYHHFIF LYHDRGFINL SNYWVDNIHK SNDLSYHFDN 61 ENCLHISKDY LENP- AGS-2 (74602 74167,74089 73985) SCR (e 0.929 d 0.000 a 0.000,e 0.819) Exon 1 74602 74167 ( 436 n); score: 0.929 Intron 1 74166 74090 ( 77 n); Pd: 0.000 Pa: 0.000 Exon 2 74089 73985 ( 105 n); score: 0.819 PGS (74247 74167,74089 73985) SGN-U312404- PGS (74602 74218) SGN-U329525+ 3-phase translation of AGS-2 (-strand): . . . . . . 74602 TTTTATAACACGTTATCAGCACGAGTCTCTATCTAATTGAGAGTGGGTTCTTGTTTTTCT F Y N T L S A R V S I - L R V G S C F S F I T R Y Q H E S L S N - E W V L V F L L - H V I S T S L Y L I E S G F L F F . . . . . . 74542 TTAGCTCATATTTTGGTTGATCTCGTTATGAAGATCAAAGTATTATATGCATAAAATCAG L A H I L V D L V M K I K V L Y A - N Q - L I F W L I S L - R S K Y Y M H K I R F S S Y F G - S R Y E D Q S I I C I K S . . . . . . 74482 GTATGTATTTTCTAAAATATTTTATGATTTAGAATAAAATAGTATTCAATCACTAATAAT V C I F - N I L - F R I K - Y S I T N N Y V F S K I F Y D L E - N S I Q S L I I G M Y F L K Y F M I - N K I V F N H - - . . . . . . 74422 TATCAGGAACCGAAGACATATACTACTATTGGTTGAATATGACATGCTATACAAAACTTC Y Q E P K T Y T T I G - I - H A I Q N F I R N R R H I L L L V E Y D M L Y K T S L S G T E D I Y Y Y W L N M T C Y T K L . . . . . . 74362 TTAAATAGAAGAAGGTATAAAATTTTAATAGCATGAATTTGAATGTTATTTTGATTGTTT L N R R R Y K I L I A - I - M L F - L F - I E E G I K F - - H E F E C Y F D C F L K - K K V - N F N S M N L N V I L I V . . . . . . 74302 TGAATGACTCAAAGGGTGAGTCATGTTGTACTTCGGTTTATTATACCGTTGGTTAAGGGT - M T Q R V S H V V L R F I I P L V K G E - L K G - V M L Y F G L L Y R W L R V L N D S K G E S C C T S V Y Y T V G - G . . . . . . 74242 AAGACGATGAATTCAAGTTTCATCGCACCAAAAGGGTAAGACATCAGGTTAAAGTCCGGA K T M N S S F I A P K G - D I R L K S G R R - I Q V S S H Q K G K T S G - S P D - D D E F K F H R T K R V R H Q V K V R . . : . . . . 74182 TGCACCATAATGAAAA : GCGATTGGGTTCAAGTCCCATAGAATTATGGCGTAACACGCCTT C T I M K : S D W V Q V P - N Y G V T R L A P - - K : A I G F K S H R I M A - H A L M H H N E K : R L G S S P I E L W R N T P . . . . . . 74045 ATATGGATAAGACATTGGGTTTGAGTCAATGCACCATAGTGATGATATAATGATGACTAA I W I R H W V - V N A P - - - Y N D D - Y G - D I G F E S M H H S D D I M M T K Y M D K T L G L S Q C T I V M I - - - L . 73985 G Maximal non-overlapping open reading frames (>= 64 codons): none PGL 15 (- strand): 75175 74938 AGS-1 (75175 74938) SCR (e 0.779) Exon 1 75175 74938 ( 238 n); score: 0.779 PGS (75175 74938) SGN-U329700+ 3-phase translation of AGS-1 (-strand): . . . . . . 75175 AAAAATATTTACTTTGATATAATTTTTAAATATTTCTTATATTTAAGTTTGAAACTTAGA K N I Y F D I I F K Y F L Y L S L K L R K I F T L I - F L N I S Y I - V - N L E K Y L L - Y N F - I F L I F K F E T - . . . . . . 75115 ATTTTGGATGGTTCCATAAATATTATAGTCCACATGTGTTGGTAATTATAATAAAACTCA I L D G S I N I I V H M C W - L - - N S F W M V P - I L - S T C V G N Y N K T Q N F G W F H K Y Y S P H V L V I I I K L . . . . . . 75055 AATCAAAATTAAATTAATATTGATGCAAAAAGGAAATCTATTTAGCATTAAGAATGACAA N Q N - I N I D A K R K S I - H - E - Q I K I K L I L M Q K G N L F S I K N D N K S K L N - Y - C K K E I Y L A L R M T . . . . . . 74995 TAATATTAAATATTTGTTCTTTGATTTTACGTTGATTTAGACAATTGAAATACATAAT - Y - I F V L - F Y V D L D N - N T - N I K Y L F F D F T L I - T I E I H N I I L N I C S L I L R - F R Q L K Y I Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 74938 ATTATGTATTTCAATTGTCTAAATCAACGTAAAATCAAAGAACAAATATTTAATATTATT I M Y F N C L N Q R K I K E Q I F N I I L C I S I V - I N V K S K N K Y L I L L Y V F Q L S K S T - N Q R T N I - Y Y . . . . . . 74998 GTCATTCTTAATGCTAAATAGATTTCCTTTTTGCATCAATATTAATTTAATTTTGATTTG V I L N A K - I S F L H Q Y - F N F D L S F L M L N R F P F C I N I N L I L I - C H S - C - I D F L F A S I L I - F - F . . . . . . 75058 AGTTTTATTATAATTACCAACACATGTGGACTATAATATTTATGGAACCATCCAAAATTC S F I I I T N T C G L - Y L W N H P K F V L L - L P T H V D Y N I Y G T I Q N S E F Y Y N Y Q H M W T I I F M E P S K I . . . . . . 75118 TAAGTTTCAAACTTAAATATAAGAAATATTTAAAAATTATATCAAAGTAAATATTTTT - V S N L N I R N I - K L Y Q S K Y F K F Q T - I - E I F K N Y I K V N I F L S F K L K Y K K Y L K I I S K - I F Maximal non-overlapping open reading frames (>= 64 codons): none PGL 16 (+ strand): 77022 77836 AGS-1 (77022 77836) SCR (e 0.914) Exon 1 77022 77836 ( 815 n); score: 0.914 PGS (77022 77630) SGN-U339745- PGS (77184 77679) SGN-U339744- PGS (77434 77836) SGN-U343802+ 3-phase translation of AGS-1 (+strand): . . . . . . 77022 TATTTGTTATTTGTAGTTTTGATAATTTGACAAACTTAGGGACCTCATAAAGGTACCGGG Y L L F V V L I I - Q T - G P H K G T G I C Y L - F - - F D K L R D L I K V P G F V I C S F D N L T N L G T S - R Y R . . . . . . 77082 TTTTCTACTTGAGTATTGCAGGTGTCTTGTTTGAAGTATTGCAGATGAAACAAACTCAGG F S T - V L Q V S C L K Y C R - N K L R F L L E Y C R C L V - S I A D E T N S G V F Y L S I A G V L F E V L Q M K Q T Q . . . . . . 77142 GACCTAGTAGAGGTACCAGGCCCTCATGATGATGAGCCAGTCAACTGCCAATTGGAGACG D L V E V P G P H D D E P V N C Q L E T T - - R Y Q A L M M M S Q S T A N W R R G P S R G T R P S - - - A S Q L P I G D . . . . . . 77202 TACAGAAAGTAGGTGCACACTTCTCGATGGCGTCAGAAAAGTGTGACCTTTCCAACCAAT Y R K - V H T S R W R Q K S V T F P T N T E S R C T L L D G V R K V - P F Q P M V Q K V G A H F S M A S E K C D L S N Q . . . . . . 77262 GGCAAAGAAGTTTAGGAAAGTGACTTGCCCATATAAAAGGCACTTTCCTAAACATTTGTC G K E V - E S D L P I - K A L S - T F V A K K F R K V T C P Y K R H F P K H L S W Q R S L G K - L A H I K G T F L N I C . . . . . . 77322 TTTCAGTTTTTGCAACTTGATAAAAACTTTTCAAGAACTCTTGCAAAGGCTACACAAAGC F Q F L Q L D K N F S R T L A K A T Q S F S F C N L I K T F Q E L L Q R L H K A L S V F A T - - K L F K N S C K G Y T K . . . . . . 77382 AAAGGCAGAATCATTCCTAGGACAACAGCAACATCTGGAGCTTTTCGTCTAGTTGTTTAG K G R I I P R T T A T S G A F R L V V - K A E S F L G Q Q Q H L E L F V - L F R Q R Q N H S - D N S N I W S F S S S C L . . . . . . 77442 GAATTTATTCTCTTGTTCTTAAATTGTAAACCACTCCTAAATCTATAAAGGAATTGGTGT E F I L L F L N C K P L L N L - R N W C N L F S C S - I V N H S - I Y K G I G V G I Y S L V L K L - T T P K S I K E L V . . . . . . 77502 GTTGTGTTAAAAGTCTAGGTTGTCCAGGTGGGATAGCTTAGTGGGTAGATTGTTTTTCTA V V L K V - V V Q V G - L S G - I V F L L C - K S R L S R W D S L V G R L F F Y C C V K S L G C P G G I A - W V D C F S . . . . . . 77562 CTTTGGCTTGTTGAGCAATAGAGGTCTATTGCTTAACGGTAAGATTGATAACTCTTTCTT L W L V E Q - R S I A - R - D - - L F L F G L L S N R G L L L N G K I D N S F L T L A C - A I E V Y C L T V R L I T L S . . . . . . 77622 ACGTTTGGTGTAATCGTGTTTCGCTTTTGCTTTTGAAGATTAGTGAAAACGATTGAAAAT T F G V I V F R F C F - R L V K T I E N R L V - S C F A F A F E D - - K R L K I Y V W C N R V S L L L L K I S E N D - K . . . . . . 77682 CCTGTGAGATAGGTCGTGGTTTTACTCCCTTAAGCAAGGAGGTTTCCACGTAAAATCATT P V R - V V V L L P - A R R F P R K I I L - D R S W F Y S L K Q G G F H V K S L S C E I G R G F T P L S K E V S T - N H . . . . . . 77742 GTGTTAATTTTACTGCATTTAACTTTCTGGTAATTTTCTGAAGTAAAGTAAGAGACCTGG V L I L L H L T F W - F S E V K - E T W C - F Y C I - L S G N F L K - S K R P G C V N F T A F N F L V I F - S K V R D L . . . . 77802 TCCATTACTAATAAGTGAAGGCATAAATTCTATCA S I T N K - R H K F Y P L L I S E G I N S I V H Y - - V K A - I L S Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 77836 TGATAGAATTTATGCCTTCACTTATTAGTAATGGACCAGGTCTCTTACTTTACTTCAGAA - - N L C L H L L V M D Q V S Y F T S E D R I Y A F T Y - - W T R S L T L L Q K I E F M P S L I S N G P G L L L Y F R . . . . . . 77776 AATTACCAGAAAGTTAAATGCAGTAAAATTAACACAATGATTTTACGTGGAAACCTCCTT N Y Q K V K C S K I N T M I L R G N L L I T R K L N A V K L T Q - F Y V E T S L K L P E S - M Q - N - H N D F T W K P P . . . . . . 77716 GCTTAAGGGAGTAAAACCACGACCTATCTCACAGGATTTTCAATCGTTTTCACTAATCTT A - G S K T T T Y L T G F S I V F T N L L K G V K P R P I S Q D F Q S F S L I F C L R E - N H D L S H R I F N R F H - S . . . . . . 77656 CAAAAGCAAAAGCGAAACACGATTACACCAAACGTAAGAAAGAGTTATCAATCTTACCGT Q K Q K R N T I T P N V R K S Y Q S Y R K S K S E T R L H Q T - E R V I N L T V S K A K A K H D Y T K R K K E L S I L P . . . . . . 77596 TAAGCAATAGACCTCTATTGCTCAACAAGCCAAAGTAGAAAAACAATCTACCCACTAAGC - A I D L Y C S T S Q S R K T I Y P L S K Q - T S I A Q Q A K V E K Q S T H - A L S N R P L L L N K P K - K N N L P T K . . . . . . 77536 TATCCCACCTGGACAACCTAGACTTTTAACACAACACACCAATTCCTTTATAGATTTAGG Y P T W T T - T F N T T H Q F L Y R F R I P P G Q P R L L T Q H T N S F I D L G L S H L D N L D F - H N T P I P L - I - . . . . . . 77476 AGTGGTTTACAATTTAAGAACAAGAGAATAAATTCCTAAACAACTAGACGAAAAGCTCCA S G L Q F K N K R I N S - T T R R K A P V V Y N L R T R E - I P K Q L D E K L Q E W F T I - E Q E N K F L N N - T K S S . . . . . . 77416 GATGTTGCTGTTGTCCTAGGAATGATTCTGCCTTTGCTTTGTGTAGCCTTTGCAAGAGTT D V A V V L G M I L P L L C V A F A R V M L L L S - E - F C L C F V - P L Q E F R C C C C P R N D S A F A L C S L C K S . . . . . . 77356 CTTGAAAAGTTTTTATCAAGTTGCAAAAACTGAAAGACAAATGTTTAGGAAAGTGCCTTT L E K F L S S C K N - K T N V - E S A F L K S F Y Q V A K T E R Q M F R K V P F S - K V F I K L Q K L K D K C L G K C L . . . . . . 77296 TATATGGGCAAGTCACTTTCCTAAACTTCTTTGCCATTGGTTGGAAAGGTCACACTTTTC Y M G K S L S - T S L P L V G K V T L F I W A S H F P K L L C H W L E R S H F S L Y G Q V T F L N F F A I G W K G H T F . . . . . . 77236 TGACGCCATCGAGAAGTGTGCACCTACTTTCTGTACGTCTCCAATTGGCAGTTGACTGGC - R H R E V C T Y F L Y V S N W Q L T G D A I E K C A P T F C T S P I G S - L A L T P S R S V H L L S V R L Q L A V D W . . . . . . 77176 TCATCATCATGAGGGCCTGGTACCTCTACTAGGTCCCTGAGTTTGTTTCATCTGCAATAC S S S - G P G T S T R S L S L F H L Q Y H H H E G L V P L L G P - V C F I C N T L I I M R A W Y L Y - V P E F V S S A I . . . . . . 77116 TTCAAACAAGACACCTGCAATACTCAAGTAGAAAACCCGGTACCTTTATGAGGTCCCTAA F K Q D T C N T Q V E N P V P L - G P - S N K T P A I L K - K T R Y L Y E V P K L Q T R H L Q Y S S R K P G T F M R S L . . . . 77056 GTTTGTCAAATTATCAAAACTACAAATAACAAATA V C Q I I K T T N N K F V K L S K L Q I T N S L S N Y Q N Y K - Q I Maximal non-overlapping open reading frames (>= 64 codons): >C09HBa0099P03.1-2-_PGL-16_AGS-1_PPS_1 (77351 77145) (frame '0'; 204 bp, 68 residues) 1 KVFIKLQKLK DKCLGKCLLY GQVTFLNFFA IGWKGHTFLT PSRSVHLLSV RLQLAVDWLI 61 IMRAWYLY- ... finished at: Tue Jul 25 01:41:36 2006