GeneSeqer. Version of March 12, 2006. Date run: Mon Jul 24 22:49:23 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 175665 Total sequence length: 93213537 Minimum sequence length: 89 Maximum sequence length: 1082 Length distribution (number of sequences of specified length): < 100: 1 < 200: 2188 < 300: 8544 < 400: 20465 < 500: 39499 < 600: 49432 < 700: 32872 < 800: 19308 < 900: 3155 < 1000: 193 >=1000: 8 Input file : /tmp/bac-submission-temp-N7Z1D/C06HBa0057J04/C06HBa0057J04.seq.screen ________________________________________________________________________________ Sequence 1: C06HBa0057J04.1-1, from 1 to 2049, both strands analyzed. ... started at: Mon Jul 24 23:12:47 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:12:57 2006 ________________________________________________________________________________ Sequence 2: C06HBa0057J04.1-2, from 1 to 1052, both strands analyzed. ... started at: Mon Jul 24 23:12:57 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:13:05 2006 ________________________________________________________________________________ Sequence 3: C06HBa0057J04.1-3, from 1 to 6814, both strands analyzed. ... started at: Mon Jul 24 23:13:05 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 8 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 13 ******************************************************************************** EST sequence 3 -strand 730 n (File: SGN-E379982-) 1 AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 61 CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 121 GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 181 ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 241 ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 301 TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 361 GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 421 TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 481 AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 541 ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 601 ATTGCTCCAG CGGTATTCAT GGATTTAATG AATTGAGTAT TTAATCCATT CCTTGATATG 661 TTTGTTATTG GATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGAA 721 GACTATTTAA Predicted gene structure (within gDNA segment 3584 to 1177): Exon 1 2614 1885 ( 730 n); cDNA 1 730 ( 730 n); score: 0.953 MATCH C06HBa0057J04.1-3- SGN-E379982- 0.953 730 1.000 C PGS_C06HBa0057J04.1-3-_SGN-E379982- (2614 1885) Alignment (genomic DNA sequence = upper lines): AAAGGTAAGT TCATTTAATA CCTCAAGGCC GGTAAGATGG TTAGAAAAGG CTATATTTAC 2555 |||||||||| |||||| ||| | |||||||| || |||||| |||||||||| |||||||||| AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 60 CATCTGATTC GAGTGCATGA CATAAAGGCA GAGGCACCGA CTCTTCAATC AGTCCCGGTA 2495 |||||||||| | |||||||| |||||||||| ||||||| || |||||||||| |||| ||||| CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 120 GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 2435 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 180 ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAGCCTA TATCTATACC TCCTTATAGA 2375 |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 240 ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTGCT AGAAAAGGGC 2315 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 300 TTCATCAGGC CTAGTACGTC ACCTTGGGGA TCACCAGTAC TGTTTGTGAG GAAGAAGGAT 2255 |||||||||| |||||||||| |||||||||| |||| |||| | |||||||| |||||||||| TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 360 GGGTCGCTGC GGATGTGCAT TGATTATAGG CAGTTGAACA AAGTAACAAT AAAGAACAGG 2195 |||||||| | |||||||||| |||||||||| ||| |||||| |||||||||| || ||||||| GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 420 TATCCCCTCC CAAGGATTGA CGATCTACTT GACCGGTTGC AGGGTGCAAA GTGTTTTTCA 2135 |||||||||| |||||||||| ||||||| || |||| ||||| |||||||||| | |||||||| TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 480 AAGATAGACT TGCGGTCAGG TTATCATTAG GTGCGGGTAA GGGAGGCAGA TATTCCAAAG 2075 |||||||||| |||||||||| ||||||| || || ||||| | |||||||||| ||| |||| | AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 540 ACAGCATTCC GGACCCGATA TGGGCATTAT GAGTTTAGAG TGCTGTCTTT TGGGCTGACT 2015 || ||||||| |||||||||| |||||||||| | |||||||| || ||||||| |||||||||| ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 600 AATGCTCCAG CGGTATTCAT GGATTTAATG AATCGAGTAT TTAAACCATT CCTTGATATG 1955 | |||||||| |||||||||| |||||||||| ||| |||||| |||| ||||| |||||||||| ATTGCTCCAG CGGTATTCAT GGATTTAATG AATTGAGTAT TTAATCCATT CCTTGATATG 660 TTTGTTATTG TATTTATAGA CGATATTCTA GTCTATTCAC GTTCAGAAGA GGAGCATGCA 1895 |||||||||| ||||||||| ||||||||| |||||||||| |||||||||| |||||||| | TTTGTTATTG GATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGAA 720 GATCATTTAA 1885 || |||||| GACTATTTAA 730 hqPGS_C06HBa0057J04.1-3-_SGN-E379982- (2614 1885) ******************************************************************************** EST sequence 8 -strand 521 n (File: SGN-E201553-) 1 TACCCAACCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 61 AGAGCAATTG AGGGATTTAC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 121 AGCACCGGTA CTATTTGTGA GGAAGAAGGA TGGGTCGCTC CGGATGTGCA TTGATTATAG 181 GCAGCTGAAC AAAGTAACAA TAATGAACAG GTATCCCCTC CCAAGGATTG ACGATCTATT 241 TGACCAGTTG CAGGGTGCAA AGGGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 301 GGTACGGGTT AGGGAGGCAG ATATCCCAAT GACGGCATTC CGGACCCGAT ATGGGCATTA 361 TGTGTTTAGA GTGTTGTCTT TTGGGCTGAC TATTGCTCCA GCGGTATTCA TGGATTTAAT 421 GAATTGAGTA TTTAATCCAT TCCTTGATAT GTTTGTTATT GGATTTATAG CCCCTATTAT 481 GGTCTATTCA CGTTCAGAAA AGGAGCATGA AGACTATTTA A Predicted gene structure (within gDNA segment 3078 to 745): Exon 1 2405 1885 ( 521 n); cDNA 1 521 ( 521 n); score: 0.939 MATCH C06HBa0057J04.1-3- SGN-E201553- 0.939 521 1.000 C PGS_C06HBa0057J04.1-3-_SGN-E201553- (2405 1885) Alignment (genomic DNA sequence = upper lines): TACCCAGCCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 2346 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACCCAACCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 60 AGAGCAATTG AGGGATTTGC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 2286 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| AGAGCAATTG AGGGATTTAC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 120 ATCACCAGTA CTGTTTGTGA GGAAGAAGGA TGGGTCGCTG CGGATGTGCA TTGATTATAG 2226 | |||| ||| || ||||||| |||||||||| ||||||||| |||||||||| |||||||||| AGCACCGGTA CTATTTGTGA GGAAGAAGGA TGGGTCGCTC CGGATGTGCA TTGATTATAG 180 GCAGTTGAAC AAAGTAACAA TAAAGAACAG GTATCCCCTC CCAAGGATTG ACGATCTACT 2166 |||| ||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||| | GCAGCTGAAC AAAGTAACAA TAATGAACAG GTATCCCCTC CCAAGGATTG ACGATCTATT 240 TGACCGGTTG CAGGGTGCAA AGTGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATTA 2106 ||||| |||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||| | TGACCAGTTG CAGGGTGCAA AGGGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 300 GGTGCGGGTA AGGGAGGCAG ATATTCCAAA GACAGCATTC CGGACCCGAT ATGGGCATTA 2046 ||| ||||| |||||||||| |||| |||| ||| |||||| |||||||||| |||||||||| GGTACGGGTT AGGGAGGCAG ATATCCCAAT GACGGCATTC CGGACCCGAT ATGGGCATTA 360 TGAGTTTAGA GTGCTGTCTT TTGGGCTGAC TAATGCTCCA GCGGTATTCA TGGATTTAAT 1986 || ||||||| ||| |||||| |||||||||| || ||||||| |||||||||| |||||||||| TGTGTTTAGA GTGTTGTCTT TTGGGCTGAC TATTGCTCCA GCGGTATTCA TGGATTTAAT 420 GAATCGAGTA TTTAAACCAT TCCTTGATAT GTTTGTTATT GTATTTATAG ACGATATTCT 1926 |||| ||||| ||||| |||| |||||||||| |||||||||| | |||||||| | |||| | GAATTGAGTA TTTAATCCAT TCCTTGATAT GTTTGTTATT GGATTTATAG CCCCTATTAT 480 AGTCTATTCA CGTTCAGAAG AGGAGCATGC AGATCATTTA A 1885 ||||||||| ||||||||| ||||||||| ||| ||||| | GGTCTATTCA CGTTCAGAAA AGGAGCATGA AGACTATTTA A 521 hqPGS_C06HBa0057J04.1-3-_SGN-E201553- (2405 1885) ******************************************************************************** EST sequence 1 -strand 598 n (File: SGN-E350824-) 1 AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 61 GGTTCGATCG TATGGGATAT TCAGAAGCTG GTCAGAGCTC TAGGGCGTTA GGGTCACAGA 121 TGGGCAGGAG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCA GTGTTCTCAT TGTGGTAAGT 181 CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 241 ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 301 CCGTTGCTGG TTCATTTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 361 CAGCAGGCCA TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCAGTGCC TCGAACCGTA 421 TATATGCTTT GACTAATAGG CAGGATCAGG GGGTGTCACC TAATGTGATC ACAGGTATAT 481 TATCACTATT CTCCCGAAGT GTGTATACAT TGATAGACCC AGGTTCCACC TTATCATATA 541 TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTATGA GTTGATAGAA CCATTTGA Predicted gene structure (within gDNA segment 4066 to 2259): Exon 1 3456 2859 ( 598 n); cDNA 1 598 ( 598 n); score: 0.965 MATCH C06HBa0057J04.1-3- SGN-E350824- 0.965 598 1.000 C PGS_C06HBa0057J04.1-3-_SGN-E350824- (3456 2859) Alignment (genomic DNA sequence = upper lines): AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 3397 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 60 GGTTTGATCG TATGGGATAT TCGGAACCTG GTCAGAGCTC TAGGGCGTCA AGGTCACAGA 3337 |||| ||||| |||||||||| || ||| ||| |||||||||| |||||||| | ||||||||| GGTTCGATCG TATGGGATAT TCAGAAGCTG GTCAGAGCTC TAGGGCGTTA GGGTCACAGA 120 TGGGCAGGGG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCG GTGTTCTCGT TGTGGTAAGT 3277 |||||||| | |||||||||| |||||||||| ||||||||| |||||||| | |||||||||| TGGGCAGGAG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCA GTGTTCTCAT TGTGGTAAGT 180 CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 3217 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 240 ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 3157 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 300 CCGTTGCTGG TTCATCTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCAC 3097 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||| | CCGTTGCTGG TTCATTTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 360 CAGCCGGCCG TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCGGTCCC TCAAACCGTA 3037 |||| |||| |||||||||| |||||||||| |||||||||| |||| || || || ||||||| CAGCAGGCCA TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCAGTGCC TCGAACCGTA 420 TATATGCTTT GACTAATAGG CAAGATCAAG AGGCGTCACC TAATGTGATC ACAGGTATAT 2977 |||||||||| |||||||||| || ||||| | || |||||| |||||||||| |||||||||| TATATGCTTT GACTAATAGG CAGGATCAGG GGGTGTCACC TAATGTGATC ACAGGTATAT 480 TATCACTATT CTCCCGAAGT GTGTATGCAT TGATAGACCC AGGTTCCACC TTATCATATA 2917 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| TATCACTATT CTCCCGAAGT GTGTATACAT TGATAGACCC AGGTTCCACC TTATCATATA 540 TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTCTGA GTTGATAGAA CCATTTGA 2859 |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||| TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTATGA GTTGATAGAA CCATTTGA 598 hqPGS_C06HBa0057J04.1-3-_SGN-E350824- (3456 2859) ******************************************************************************** EST sequence 9 +strand 196 n (File: SGN-E379248+) 1 CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 61 CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 121 CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 181 GGTTATCTTG ACGAGT Predicted gene structure (within gDNA segment 4669 to 2499): Exon 1 3673 3478 ( 196 n); cDNA 1 196 ( 196 n); score: 0.918 MATCH C06HBa0057J04.1-3- SGN-E379248+ 0.918 196 1.000 C PGS_C06HBa0057J04.1-3-_SGN-E379248+ (3673 3478) Alignment (genomic DNA sequence = upper lines): CATCGTTATG TGATGGGATT GGATCGTTAT ATGATTGACG GTTGTATGGC AGTGACTCTT 3614 |||||||||| |||||||||| |||| ||||| |||||||| | |||||| |||||||||| CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 60 CAGCCAGGTA TGGACATCGC TCGGGTGCAG GCATTTGCAC AGGGGGTAGA GGATCGGCAC 3554 || ||||||| ||||||| || |||||||||| |||| ||||| |||||||||| |||||||||| CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 120 CGGGGACGTC AGCCAGATAG AGATTATAAT AGAGGCCAGC ATAAGAGGGC TAGATCAGCA 3494 ||||||||| |||||||||| |||||||||| ||||| | | |||||||||| ||||||| || CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 180 CGTTATCCTG ACGAGT 3478 |||||| || |||||| GGTTATCTTG ACGAGT 196 hqPGS_C06HBa0057J04.1-3-_SGN-E379248+ (3673 3478) ******************************************************************************** EST sequence 6 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 6814 to 3528): Exon 1 6533 6465 ( 69 n); cDNA 1 68 ( 68 n); score: 0.862 Intron 1 6464 5911 ( 554 n); Pd: 0.900 (s: 0.86), Pa: 0.868 (s: 0.98) Exon 2 5910 5864 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.98), Pa: 0.000 (s: 0.96) Exon 3 5140 4808 ( 333 n); cDNA 116 448 ( 333 n); score: 0.896 Intron 3 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 4689 4682 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-3- SGN-E543103- 0.891 457 0.780 C PGS_C06HBa0057J04.1-3-_SGN-E543103- (6533 6465,5910 5864,5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 6475 |||||||||| ||||||||| ||| ||||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 6415 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 6355 .......... .......... .......... .......... .......... .......... 68 TACTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 6295 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TATGTTTTGG CATTATGTAT ATGTTGAATG 6235 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA 6175 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT 6115 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT 6055 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTC GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG 5995 .......... .......... .......... .......... .......... .......... 68 AATTTTCGTT AGATTATTAG CTAGCTTACA AGAAAGTAAA GCACGATGTT TATCTAATTG 5935 .......... .......... .......... .......... .......... .......... 68 CGGCACGATT GTTGCTTGTT ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 5875 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....ATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 104 TCGATTATAC GGTATGTAAC GCTGTCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT 5815 |||| ||||| | TCGACTATAC G......... .......... .......... .......... .......... 115 AAGCGAATAA CGGACAGATT TGATACTTAC CTCTAAAGCG TCTAGGTGAT GTATATTCTT 5755 .......... .......... .......... .......... .......... .......... 115 GCTTCCACAA TTATTCCTCT ATATATCGGT TATGTCTAAG GATATGATGA TCTCTAATAT 5695 .......... .......... .......... .......... .......... .......... 115 CTATGGTAAT GCTTCTTAGA GTCATTGAAA TTTTACGTTT TCATATCGTA TTAAAGGTTC 5635 .......... .......... .......... .......... .......... .......... 115 ATAATCTTGA TAAAACATTA ATCTTTGGTA ATACTCCTTG CTGGTTCACG TTGATTGTTC 5575 .......... .......... .......... .......... .......... .......... 115 TATTGAGTTA TAAGAAATGA TTTTAATTGC ATATGGTTGC TCATAATATT CTGCTCGTGC 5515 .......... .......... .......... .......... .......... .......... 115 ATAGAGTCAT TTATCATTTC ACCAAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT 5455 .......... .......... .......... .......... .......... .......... 115 GCATATGTCA CCGAGTTCCT CACTAGAGGG CCGGGTATGT ATATTATATA TATGATTGGT 5395 .......... .......... .......... .......... .......... .......... 115 GATGAGGATG GTTATGATGA TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC 5335 .......... .......... .......... .......... .......... .......... 115 CCCTTACTAG GGAAGCTGGG CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA 5275 .......... .......... .......... .......... .......... .......... 115 AGTATATGTG TAGCGATATT TTTTTTCGAG TTGCCACATT GGTATCCTGT CATCTTTACC 5215 .......... .......... .......... .......... .......... .......... 115 TTATGCTTTA CATACTCAGT ACATTGTTCG TACTGACCCC CCTTTCCTCG GGGGGCTGCG 5155 .......... .......... .......... .......... .......... .......... 115 TTTCATGCCC GCAGGTGTAG ACGCGCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC 5095 |||||| |||| ||||| |||||||||| |||||||||| ||||||||| .......... ....GTGTAG ACGCTCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTT 161 TGCTGATTGG GAGAGCTCCA CTGTTCCGGA GCCCATTCGT TTTGGTACAT AAC-TTTTGT 5036 |||||| ||| |||||||||| ||||| || | ||||| || | |||||||||| ||| |||||| TGCTGAGTGG GAGAGCTCCA CTGTTTCGTA GCCCAGTCAT TTTGGTACAT AACTTTTTGT 221 GTAGTCTTTT GCTCGTCTAT GGGTATGGCG GGGCCCTGTC CCGTCGAGTT TCACTAATGT 4976 |||||||||| ||| |||||| |||||||| | |||||||||| |||||||||| |||||| | | GTAGTCTTTT GCTTGTCTAT GGGTATGGTG GGGCCCTGTC CCGTCGAGTT TCACTACTAT 281 ACCCTTAGAG GTCTGTGGGC ATTATGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG 4916 || ||||||| ||| | | | || ||||| |||||||||| |||||||||| |||||||||| ACTCTTAGAG GTCCATAGAC ATCGCGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG 341 GACATGGTTT GTTTGGGATG TCCGCTTGTA CAGGGGCAGC CTTGTCGGCT GTGTACATCA 4856 |||||||||| |||||||||| ||| |||||| || ||||||| |||||| ||| | ||||||| GACATGGTTT GTTTGGGATG TCCACTTGTA CAAGGGCAGC CTTGTCAGCT GCGTACATCT 401 TTATGCTTTG AATAGTGGCG GCCTTGTCGG CTCGCGTATG CTGTTATGGT TGAATGGTTA 4796 || || ||| ||||||| |||||||||| || ||||||| || ||||| TTGTGTATTG TGTAGTGGCA GCCTTGTCGG CT-GCGTATG CTATTATG.. .......... 448 TGACTCCTTA TGAGACAGGT CCTCTTATAT ATATATATGA CGTTGGGGTT GGCTTGATTT 4736 .......... .......... .......... .......... .......... .......... 448 GATTAAATTC CATATTGTCT TAGTTTCAGT TGGTCATACT TAGCAGGTTT GTAT 4682 ||| | || .......... .......... .......... .......... ......CTTT GGAT 456 hqPGS_C06HBa0057J04.1-3-_SGN-E543103- (6533 6465,5910 5864,5140 4808,4689 4682) ******************************************************************************** EST sequence 18 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 6814 to 3608): Exon 1 6533 6465 ( 69 n); cDNA 1 68 ( 68 n); score: 0.862 Intron 1 6464 5911 ( 554 n); Pd: 0.900 (s: 0.86), Pa: 0.868 (s: 0.98) Exon 2 5910 5864 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.98), Pa: 0.000 (s: 0.96) Exon 3 5140 4808 ( 333 n); cDNA 116 448 ( 333 n); score: 0.896 Intron 3 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 4689 4682 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-3- SGN-E543104+ 0.891 457 0.792 C PGS_C06HBa0057J04.1-3-_SGN-E543104+ (6533 6465,5910 5864,5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 6475 |||||||||| ||||||||| ||| ||||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 6415 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 6355 .......... .......... .......... .......... .......... .......... 68 TACTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 6295 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TATGTTTTGG CATTATGTAT ATGTTGAATG 6235 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA 6175 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT 6115 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT 6055 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTC GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG 5995 .......... .......... .......... .......... .......... .......... 68 AATTTTCGTT AGATTATTAG CTAGCTTACA AGAAAGTAAA GCACGATGTT TATCTAATTG 5935 .......... .......... .......... .......... .......... .......... 68 CGGCACGATT GTTGCTTGTT ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 5875 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....ATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 104 TCGATTATAC GGTATGTAAC GCTGTCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT 5815 |||| ||||| | TCGACTATAC G......... .......... .......... .......... .......... 115 AAGCGAATAA CGGACAGATT TGATACTTAC CTCTAAAGCG TCTAGGTGAT GTATATTCTT 5755 .......... .......... .......... .......... .......... .......... 115 GCTTCCACAA TTATTCCTCT ATATATCGGT TATGTCTAAG GATATGATGA TCTCTAATAT 5695 .......... .......... .......... .......... .......... .......... 115 CTATGGTAAT GCTTCTTAGA GTCATTGAAA TTTTACGTTT TCATATCGTA TTAAAGGTTC 5635 .......... .......... .......... .......... .......... .......... 115 ATAATCTTGA TAAAACATTA ATCTTTGGTA ATACTCCTTG CTGGTTCACG TTGATTGTTC 5575 .......... .......... .......... .......... .......... .......... 115 TATTGAGTTA TAAGAAATGA TTTTAATTGC ATATGGTTGC TCATAATATT CTGCTCGTGC 5515 .......... .......... .......... .......... .......... .......... 115 ATAGAGTCAT TTATCATTTC ACCAAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT 5455 .......... .......... .......... .......... .......... .......... 115 GCATATGTCA CCGAGTTCCT CACTAGAGGG CCGGGTATGT ATATTATATA TATGATTGGT 5395 .......... .......... .......... .......... .......... .......... 115 GATGAGGATG GTTATGATGA TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC 5335 .......... .......... .......... .......... .......... .......... 115 CCCTTACTAG GGAAGCTGGG CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA 5275 .......... .......... .......... .......... .......... .......... 115 AGTATATGTG TAGCGATATT TTTTTTCGAG TTGCCACATT GGTATCCTGT CATCTTTACC 5215 .......... .......... .......... .......... .......... .......... 115 TTATGCTTTA CATACTCAGT ACATTGTTCG TACTGACCCC CCTTTCCTCG GGGGGCTGCG 5155 .......... .......... .......... .......... .......... .......... 115 TTTCATGCCC GCAGGTGTAG ACGCGCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC 5095 |||||| |||| ||||| |||||||||| |||||||||| ||||||||| .......... ....GTGTAG ACGCTCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTT 161 TGCTGATTGG GAGAGCTCCA CTGTTCCGGA GCCCATTCGT TTTGGTACAT AAC-TTTTGT 5036 |||||| ||| |||||||||| ||||| || | ||||| || | |||||||||| ||| |||||| TGCTGAGTGG GAGAGCTCCA CTGTTTCGTA GCCCAGTCAT TTTGGTACAT AACTTTTTGT 221 GTAGTCTTTT GCTCGTCTAT GGGTATGGCG GGGCCCTGTC CCGTCGAGTT TCACTAATGT 4976 |||||||||| ||| |||||| |||||||| | |||||||||| |||||||||| |||||| | | GTAGTCTTTT GCTTGTCTAT GGGTATGGTG GGGCCCTGTC CCGTCGAGTT TCACTACTAT 281 ACCCTTAGAG GTCTGTGGGC ATTATGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG 4916 || ||||||| ||| | | | || ||||| |||||||||| |||||||||| |||||||||| ACTCTTAGAG GTCCATAGAC ATCGCGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG 341 GACATGGTTT GTTTGGGATG TCCGCTTGTA CAGGGGCAGC CTTGTCGGCT GTGTACATCA 4856 |||||||||| |||||||||| ||| |||||| || ||||||| |||||| ||| | ||||||| GACATGGTTT GTTTGGGATG TCCACTTGTA CAAGGGCAGC CTTGTCAGCT GCGTACATCT 401 TTATGCTTTG AATAGTGGCG GCCTTGTCGG CTCGCGTATG CTGTTATGGT TGAATGGTTA 4796 || || ||| ||||||| |||||||||| || ||||||| || ||||| TTGTGTATTG TGTAGTGGCA GCCTTGTCGG CT-GCGTATG CTATTATG.. .......... 448 TGACTCCTTA TGAGACAGGT CCTCTTATAT ATATATATGA CGTTGGGGTT GGCTTGATTT 4736 .......... .......... .......... .......... .......... .......... 448 GATTAAATTC CATATTGTCT TAGTTTCAGT TGGTCATACT TAGCAGGTTT GTAT 4682 ||| | || .......... .......... .......... .......... ......CTTT GGAT 456 hqPGS_C06HBa0057J04.1-3-_SGN-E543104+ (6533 6465,5910 5864,5140 4808,4689 4682) ******************************************************************************** EST sequence 5 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 6814 to 2443): Exon 1 6531 6472 ( 60 n); cDNA 1 61 ( 61 n); score: 0.842 Intron 1 6471 5911 ( 561 n); Pd: 0.997 (s: 0.81), Pa: 0.868 (s: 0.89) Exon 2 5910 5864 ( 47 n); cDNA 62 108 ( 47 n); score: 0.894 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 5140 4808 ( 333 n); cDNA 109 441 ( 333 n); score: 0.899 Intron 3 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 4689 4682 ( 8 n); cDNA 442 449 ( 8 n); score: 0.750 PPA cDNA 528 542 MATCH C06HBa0057J04.1-3- SGN-E374134- 0.891 448 0.827 C PGS_C06HBa0057J04.1-3-_SGN-E374134- (6531 6472,5910 5864,5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): CAGCCATGGA AATGGAG-AA ACCAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT 6473 |||||||||| ||||||| || |||| || | ||||||||| ||||||||| |||| || || CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 60 GGTTAGTAAT CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT 6413 | G......... .......... .......... .......... .......... .......... 61 TAATTTTAAG AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTA 6353 .......... .......... .......... .......... .......... .......... 61 CTAAGTATGA ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG 6293 .......... .......... .......... .......... .......... .......... 61 TTTTGATTAA AGCAAACTGC AGGAAAATTA TGTTTTGGCA TTATGTATAT GTTGAATGTG 6233 .......... .......... .......... .......... .......... .......... 61 ATTATGAGTA TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTAC GAATTATAAA 6173 .......... .......... .......... .......... .......... .......... 61 ACGAGTTATC ACTCGGTGTG TCGTTGCTTC GCTGATATAG TTGCCGAGAT GGAACTGTTT 6113 .......... .......... .......... .......... .......... .......... 61 TGGGGAGGGG GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTATTGGT ATTGTTGTGG 6053 .......... .......... .......... .......... .......... .......... 61 ATAATTTGGA TTGTTGTCGG ATTGGGACGA AGTAAGGAAA ATAGGGGAGG TGCTGCCGAA 5993 .......... .......... .......... .......... .......... .......... 61 TTTTCGTTAG ATTATTAGCT AGCTTACAAG AAAGTAAAGC ACGATGTTTA TCTAATTGCG 5933 .......... .......... .......... .......... .......... .......... 61 GCACGATTGT TGCTTGTTAT AGATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC 5873 ||| ||| |||||||||| |||||||||| ||| |||||| .......... .......... ..ATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC 99 GATTATACGG TATGTAACGC TGTCCCTTCT TTCTTTGTTT GGCATGACTT TTAAAAATAA 5813 || ||| || GACTATTCG. .......... .......... .......... .......... .......... 108 GCGAATAACG GACAGATTTG ATACTTACCT CTAAAGCGTC TAGGTGATGT ATATTCTTGC 5753 .......... .......... .......... .......... .......... .......... 108 TTCCACAATT ATTCCTCTAT ATATCGGTTA TGTCTAAGGA TATGATGATC TCTAATATCT 5693 .......... .......... .......... .......... .......... .......... 108 ATGGTAATGC TTCTTAGAGT CATTGAAATT TTACGTTTTC ATATCGTATT AAAGGTTCAT 5633 .......... .......... .......... .......... .......... .......... 108 AATCTTGATA AAACATTAAT CTTTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA 5573 .......... .......... .......... .......... .......... .......... 108 TTGAGTTATA AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT 5513 .......... .......... .......... .......... .......... .......... 108 AGAGTCATTT ATCATTTCAC CAAGTCCCGG GCCGGGTAAT GTTCGTGCGG AGTTTCTTGC 5453 .......... .......... .......... .......... .......... .......... 108 ATATGTCACC GAGTTCCTCA CTAGAGGGCC GGGTATGTAT ATTATATATA TGATTGGTGA 5393 .......... .......... .......... .......... .......... .......... 108 TGAGGATGGT TATGATGATG ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC 5333 .......... .......... .......... .......... .......... .......... 108 CTTACTAGGG AAGCTGGGCA CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAAG 5273 .......... .......... .......... .......... .......... .......... 108 TATATGTGTA GCGATATTTT TTTTCGAGTT GCCACATTGG TATCCTGTCA TCTTTACCTT 5213 .......... .......... .......... .......... .......... .......... 108 ATGCTTTACA TACTCAGTAC ATTGTTCGTA CTGACCCCCC TTTCCTCGGG GGGCTGCGTT 5153 .......... .......... .......... .......... .......... .......... 108 TCATGCCCGC AGGTGTAGAC GCGCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG 5093 |||||||| || ||||||| |||||||||| |||||||||| |||||||||| .......... ..GTGTAGAC GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG 156 CTGATTGGGA GAGCTCCACT GTTCCGGAGC CCATTCGTTT TGGTACATAA CTT-TTGTGT 5034 || |||||| |||||||||| |||||||||| ||| |||||| |||||||||| ||| || ||| CTTTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT 216 AGTCTTTTGC TCGTCTATGG GTATGGCGGG GCCCTGTCCC GTCGAGTTTC ACTAATGTAC 4974 |||||||||| | |||||||| |||||||||| |||||||||| ||| |||||| |||| | ||| AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC 276 CCTTAGAGGT CTGTGGGCAT TATGTGGGTT GTATATATAT GTTTTGGATA ATGGTCTGGA 4914 ||||||||| |||| | ||| |||||||| ||||| ||| |||||||||| |||||||||| TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA 336 CATGGTTTGT TTGGGATGTC CGCTTGTACA GGGGCAGCCT TGTCGGCTGT GTACATCATT 4854 |||||||||| |||||||||| | ||||||| | ||||||| |||||| ||| | |||||||| CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT 396 ATGCTTTGAA TAGTGGCGGC CTTGTCGGCT CGCGTATGCT GTTATGGTTG AATGGTTATG 4794 || ||| ||||||| || || ||||||| ||||||||| ||||| GTGTATTGTG TAGTGGCAGC CTCGTCGGCT -GCGTATGCT ATTATG.... .......... 441 ACTCCTTATG AGACAGGTCC TCTTATATAT ATATATGACG TTGGGGTTGG CTTGATTTGA 4734 .......... .......... .......... .......... .......... .......... 441 TTAAATTCCA TATTGTCTTA GTTTCAGTTG GTCATACTTA GCAGGTTTGT AT 4682 |||| || .......... .......... .......... .......... ....TTTTGG AT 449 hqPGS_C06HBa0057J04.1-3-_SGN-E374134- (6531 6472,5910 5864,5140 4808,4689 4682) ******************************************************************************** EST sequence 13 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 6814 to 2373): Exon 1 6530 6472 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 6471 5911 ( 561 n); Pd: 0.997 (s: 0.81), Pa: 0.868 (s: 0.89) Exon 2 5910 5864 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 5140 4808 ( 333 n); cDNA 108 440 ( 333 n); score: 0.899 Intron 3 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 4689 4682 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-3- SGN-E305738+ 0.890 447 0.817 C PGS_C06HBa0057J04.1-3-_SGN-E305738+ (6530 6472,5910 5864,5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 6472 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 6412 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 6352 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 6292 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTAT GTTTTGGCAT TATGTATATG TTGAATGTGA 6232 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 6172 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 6112 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 6052 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 5992 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTAAAGCA CGATGTTTAT CTAATTGCGG 5932 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 5872 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGTTTG GCATGACTTT TAAAAATAAG 5812 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAAAGCGTCT AGGTGATGTA TATTCTTGCT 5752 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGTTAT GTCTAAGGAT ATGATGATCT CTAATATCTA 5692 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAAATTT TACGTTTTCA TATCGTATTA AAGGTTCATA 5632 .......... .......... .......... .......... .......... .......... 107 ATCTTGATAA AACATTAATC TTTGGTAATA CTCCTTGCTG GTTCACGTTG ATTGTTCTAT 5572 .......... .......... .......... .......... .......... .......... 107 TGAGTTATAA GAAATGATTT TAATTGCATA TGGTTGCTCA TAATATTCTG CTCGTGCATA 5512 .......... .......... .......... .......... .......... .......... 107 GAGTCATTTA TCATTTCACC AAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 5452 .......... .......... .......... .......... .......... .......... 107 TATGTCACCG AGTTCCTCAC TAGAGGGCCG GGTATGTATA TTATATATAT GATTGGTGAT 5392 .......... .......... .......... .......... .......... .......... 107 GAGGATGGTT ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC 5332 .......... .......... .......... .......... .......... .......... 107 TTACTAGGGA AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAAGT 5272 .......... .......... .......... .......... .......... .......... 107 ATATGTGTAG CGATATTTTT TTTCGAGTTG CCACATTGGT ATCCTGTCAT CTTTACCTTA 5212 .......... .......... .......... .......... .......... .......... 107 TGCTTTACAT ACTCAGTACA TTGTTCGTAC TGACCCCCCT TTCCTCGGGG GGCTGCGTTT 5152 .......... .......... .......... .......... .......... .......... 107 CATGCCCGCA GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 5092 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| .......... .GTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 156 TGATTGGGAG AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA 5033 | ||||||| |||||||||| |||||||||| || ||||||| |||||||||| || || |||| TTTTTGGGAG AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA 216 GTCTTTTGCT CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC 4973 |||||||||| ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| GTCTTTTGCT TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 276 CTTAGAGGTC TGTGGGCATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC 4913 |||||||||| ||| | ||| ||||||||| |||| |||| |||||||||| |||||||||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC 336 ATGGTTTGTT TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA 4853 |||||||||| |||||||||| ||||||| | |||||||| ||||| |||| |||||||| ATGGTTTGTT TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG 396 TGCTTTGAAT AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA 4793 || ||| | |||||| ||| | ||||||| ||||||||| ||||| TGTATTGTGT AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... 440 CTCCTTATGA GACAGGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT 4733 .......... .......... .......... .......... .......... .......... 440 TAAATTCCAT ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 4682 |||| | | .......... .......... .......... .......... ...TTTTGGA T 448 hqPGS_C06HBa0057J04.1-3-_SGN-E305738+ (6530 6472,5910 5864,5140 4808,4689 4682) ******************************************************************************** EST sequence 16 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 6814 to 2423): Exon 1 6530 6472 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 6471 5911 ( 561 n); Pd: 0.997 (s: 0.81), Pa: 0.868 (s: 0.89) Exon 2 5910 5864 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 5140 4808 ( 333 n); cDNA 108 440 ( 333 n); score: 0.899 Intron 3 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 4689 4682 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 PPA cDNA 527 542 MATCH C06HBa0057J04.1-3- SGN-E374135+ 0.890 447 0.825 C PGS_C06HBa0057J04.1-3-_SGN-E374135+ (6530 6472,5910 5864,5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 6472 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 6412 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 6352 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 6292 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTAT GTTTTGGCAT TATGTATATG TTGAATGTGA 6232 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 6172 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 6112 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 6052 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 5992 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTAAAGCA CGATGTTTAT CTAATTGCGG 5932 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 5872 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGTTTG GCATGACTTT TAAAAATAAG 5812 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAAAGCGTCT AGGTGATGTA TATTCTTGCT 5752 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGTTAT GTCTAAGGAT ATGATGATCT CTAATATCTA 5692 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAAATTT TACGTTTTCA TATCGTATTA AAGGTTCATA 5632 .......... .......... .......... .......... .......... .......... 107 ATCTTGATAA AACATTAATC TTTGGTAATA CTCCTTGCTG GTTCACGTTG ATTGTTCTAT 5572 .......... .......... .......... .......... .......... .......... 107 TGAGTTATAA GAAATGATTT TAATTGCATA TGGTTGCTCA TAATATTCTG CTCGTGCATA 5512 .......... .......... .......... .......... .......... .......... 107 GAGTCATTTA TCATTTCACC AAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 5452 .......... .......... .......... .......... .......... .......... 107 TATGTCACCG AGTTCCTCAC TAGAGGGCCG GGTATGTATA TTATATATAT GATTGGTGAT 5392 .......... .......... .......... .......... .......... .......... 107 GAGGATGGTT ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC 5332 .......... .......... .......... .......... .......... .......... 107 TTACTAGGGA AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAAGT 5272 .......... .......... .......... .......... .......... .......... 107 ATATGTGTAG CGATATTTTT TTTCGAGTTG CCACATTGGT ATCCTGTCAT CTTTACCTTA 5212 .......... .......... .......... .......... .......... .......... 107 TGCTTTACAT ACTCAGTACA TTGTTCGTAC TGACCCCCCT TTCCTCGGGG GGCTGCGTTT 5152 .......... .......... .......... .......... .......... .......... 107 CATGCCCGCA GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 5092 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| .......... .GTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 156 TGATTGGGAG AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA 5033 | ||||||| |||||||||| |||||||||| || ||||||| |||||||||| || || |||| TTTTTGGGAG AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA 216 GTCTTTTGCT CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC 4973 |||||||||| ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| GTCTTTTGCT TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 276 CTTAGAGGTC TGTGGGCATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC 4913 |||||||||| ||| | ||| ||||||||| |||| |||| |||||||||| |||||||||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC 336 ATGGTTTGTT TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA 4853 |||||||||| |||||||||| ||||||| | |||||||| ||||| |||| |||||||| ATGGTTTGTT TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG 396 TGCTTTGAAT AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA 4793 || ||| | |||||| ||| | ||||||| ||||||||| ||||| TGTATTGTGT AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... 440 CTCCTTATGA GACAGGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT 4733 .......... .......... .......... .......... .......... .......... 440 TAAATTCCAT ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 4682 |||| | | .......... .......... .......... .......... ...TTTTGGA T 448 hqPGS_C06HBa0057J04.1-3-_SGN-E374135+ (6530 6472,5910 5864,5140 4808,4689 4682) ******************************************************************************** EST sequence 2 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 5909 to 2523): Exon 1 5141 4808 ( 334 n); cDNA 6 339 ( 334 n); score: 0.900 Intron 1 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 4689 4682 ( 8 n); cDNA 340 347 ( 8 n); score: 0.750 PPA cDNA 415 432 MATCH C06HBa0057J04.1-3- SGN-E225616- 0.900 342 0.792 C PGS_C06HBa0057J04.1-3-_SGN-E225616- (5141 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGATTGGGAG 5082 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA GTCTTTTGCT 5023 |||||||||| |||||||||| || ||||||| |||||||||| || || |||| |||||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC CTTAGAGGTC 4963 ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| |||||||||| TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 185 TGTGGGCATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 4903 ||| | ||| ||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 245 TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA TGCTTTGAAT 4843 |||||||||| ||||||| | |||||||| ||||| |||| |||||||| || ||| | TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 305 AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA CTCCTTATGA 4783 |||||| ||| | ||||||| ||||||||| ||||| AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... .......... 339 GACAGGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT TAAATTCCAT 4723 .......... .......... .......... .......... .......... .......... 339 ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 4682 |||| | | .......... .......... .......... ...TTTTGGA T 347 hqPGS_C06HBa0057J04.1-3-_SGN-E225616- (5141 4808,4689 4682) ******************************************************************************** EST sequence 10 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 6159 to 2143): Exon 1 5140 4808 ( 333 n); cDNA 33 365 ( 333 n); score: 0.902 Intron 1 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 4689 4682 ( 8 n); cDNA 366 373 ( 8 n); score: 0.750 PPA cDNA 481 495 MATCH C06HBa0057J04.1-3- SGN-E306317+ 0.902 341 0.689 C PGS_C06HBa0057J04.1-3-_SGN-E306317+ (5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 5081 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 5022 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 4962 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 212 GTGGGCATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4902 || | ||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 272 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 4842 ||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 332 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC TCCTTATGAG 4782 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... .......... 365 ACAGGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT AAATTCCATA 4722 .......... .......... .......... .......... .......... .......... 365 TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 4682 |||| || .......... .......... .......... ..TTTTGGAT 373 hqPGS_C06HBa0057J04.1-3-_SGN-E306317+ (5140 4808,4689 4682) ******************************************************************************** EST sequence 19 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 6359 to 2063): Exon 1 5140 4808 ( 333 n); cDNA 53 385 ( 333 n); score: 0.899 Intron 1 4807 4690 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 4689 4682 ( 8 n); cDNA 386 393 ( 8 n); score: 0.750 PPA cDNA 514 523 MATCH C06HBa0057J04.1-3- SGN-E303695+ 0.899 341 0.652 C PGS_C06HBa0057J04.1-3-_SGN-E303695+ (5140 4808,4689 4682) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 5081 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 112 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 5022 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 172 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 4962 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 232 GTGGGCATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4902 || | ||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 292 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 4842 ||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 352 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC TCCTTATGAG 4782 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... .......... 385 ACAGGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT AAATTCCATA 4722 .......... .......... .......... .......... .......... .......... 385 TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 4682 |||| || .......... .......... .......... ..TTTTGGAT 393 hqPGS_C06HBa0057J04.1-3-_SGN-E303695+ (5140 4808,4689 4682) ******************************************************************************** EST sequence 20 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 6814 to 2653): Exon 1 6530 6472 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 6471 5911 ( 561 n); Pd: 0.997 (s: 0.81), Pa: 0.868 (s: 0.89) Exon 2 5910 5864 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5863 5141 ( 723 n); Pd: 0.994 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 5140 4808 ( 333 n); cDNA 108 440 ( 333 n); score: 0.902 PPA cDNA 508 519 MATCH C06HBa0057J04.1-3- SGN-E310669+ 0.893 439 0.846 C PGS_C06HBa0057J04.1-3-_SGN-E310669+ (6530 6472,5910 5864,5140 4808) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 6472 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 6412 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 6352 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 6292 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTAT GTTTTGGCAT TATGTATATG TTGAATGTGA 6232 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 6172 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 6112 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 6052 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 5992 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTAAAGCA CGATGTTTAT CTAATTGCGG 5932 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 5872 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGTTTG GCATGACTTT TAAAAATAAG 5812 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAAAGCGTCT AGGTGATGTA TATTCTTGCT 5752 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGTTAT GTCTAAGGAT ATGATGATCT CTAATATCTA 5692 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAAATTT TACGTTTTCA TATCGTATTA AAGGTTCATA 5632 .......... .......... .......... .......... .......... .......... 107 ATCTTGATAA AACATTAATC TTTGGTAATA CTCCTTGCTG GTTCACGTTG ATTGTTCTAT 5572 .......... .......... .......... .......... .......... .......... 107 TGAGTTATAA GAAATGATTT TAATTGCATA TGGTTGCTCA TAATATTCTG CTCGTGCATA 5512 .......... .......... .......... .......... .......... .......... 107 GAGTCATTTA TCATTTCACC AAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 5452 .......... .......... .......... .......... .......... .......... 107 TATGTCACCG AGTTCCTCAC TAGAGGGCCG GGTATGTATA TTATATATAT GATTGGTGAT 5392 .......... .......... .......... .......... .......... .......... 107 GAGGATGGTT ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC 5332 .......... .......... .......... .......... .......... .......... 107 TTACTAGGGA AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAAGT 5272 .......... .......... .......... .......... .......... .......... 107 ATATGTGTAG CGATATTTTT TTTCGAGTTG CCACATTGGT ATCCTGTCAT CTTTACCTTA 5212 .......... .......... .......... .......... .......... .......... 107 TGCTTTACAT ACTCAGTACA TTGTTCGTAC TGACCCCCCT TTCCTCGGGG GGCTGCGTTT 5152 .......... .......... .......... .......... .......... .......... 107 CATGCCCGCA GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 5092 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| .......... .GTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 156 TGATTGGGAG AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA 5033 || ||||||| |||||||||| |||||||||| || ||||||| |||||||||| || || |||| TGTTTGGGAG AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA 216 GTCTTTTGCT CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC 4973 |||||||||| ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| GTCTTTTGCT TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 276 CTTAGAGGTC TGTGGGCATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC 4913 |||||||||| ||| | ||| ||||||||| |||| |||| |||||||||| |||||||||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC 336 ATGGTTTGTT TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA 4853 |||||||||| |||||||||| ||||||| | |||||||| ||||| |||| |||||||| ATGGTTTGTT TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG 396 TGCTTTGAAT AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATG 4808 || ||| | |||||| ||| | ||||||| ||||||||| ||||| TGTATTGTGT AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG 440 hqPGS_C06HBa0057J04.1-3-_SGN-E310669+ (6530 6472,5910 5864,5140 4808) ******************************************************************************** EST sequence 11 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 6571 to 4208): Exon 1 6514 6472 ( 43 n); cDNA 5 47 ( 43 n); score: 0.837 Intron 1 6471 5511 ( 961 n); Pd: 0.997 (s: 0.84), Pa: 0.966 (s: 0.98) Exon 2 5510 5316 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 5315 5141 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 5140 4808 ( 333 n); cDNA 242 573 ( 332 n); score: 0.896 MATCH C06HBa0057J04.1-3- SGN-E538151+ 0.908 571 0.942 C PGS_C06HBa0057J04.1-3-_SGN-E538151+ (6514 6472,5510 5316,5140 4808) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 6455 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 6395 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 6335 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 6275 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TATGTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 6215 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 6155 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 6095 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTC 6035 .......... .......... .......... .......... .......... .......... 47 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 5975 .......... .......... .......... .......... .......... .......... 47 CTAGCTTACA AGAAAGTAAA GCACGATGTT TATCTAATTG CGGCACGATT GTTGCTTGTT 5915 .......... .......... .......... .......... .......... .......... 47 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 5855 .......... .......... .......... .......... .......... .......... 47 GCTGTCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 5795 .......... .......... .......... .......... .......... .......... 47 TGATACTTAC CTCTAAAGCG TCTAGGTGAT GTATATTCTT GCTTCCACAA TTATTCCTCT 5735 .......... .......... .......... .......... .......... .......... 47 ATATATCGGT TATGTCTAAG GATATGATGA TCTCTAATAT CTATGGTAAT GCTTCTTAGA 5675 .......... .......... .......... .......... .......... .......... 47 GTCATTGAAA TTTTACGTTT TCATATCGTA TTAAAGGTTC ATAATCTTGA TAAAACATTA 5615 .......... .......... .......... .......... .......... .......... 47 ATCTTTGGTA ATACTCCTTG CTGGTTCACG TTGATTGTTC TATTGAGTTA TAAGAAATGA 5555 .......... .......... .......... .......... .......... .......... 47 TTTTAATTGC ATATGGTTGC TCATAATATT CTGCTCGTGC ATAGAGTCAT TTATCATTTC 5495 |||||| |||||||||| .......... .......... .......... .......... ....AGTCAT TTATCATTTC 63 ACCAAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTTCCT 5435 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ACCGAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTCCCT 123 CACTAGAGGG CCGGGTATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 5375 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CACTAGAGGG CCGGGAATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 183 TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC CCCTTACTAG GGAAGCTGGG 5315 |||||||||| |||||||| | |||||| ||| || | ||| |||| ||||| | || || TGATGATGAC GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. 241 CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA AGTATATGTG TAGCGATATT 5255 .......... .......... .......... .......... .......... .......... 241 TTTTTTCGAG TTGCCACATT GGTATCCTGT CATCTTTACC TTATGCTTTA CATACTCAGT 5195 .......... .......... .......... .......... .......... .......... 241 ACATTGTTCG TACTGACCCC CCTTTCCTCG GGGGGCTGCG TTTCATGCCC GCAGGTGTAG 5135 |||||| .......... .......... .......... .......... .......... ....GTGTAG 247 ACGCGCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGATTGG GAGAGCTCCA 5075 |||| ||||| ||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ACGCTCAGTT TGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGTTTGG GAGAGCTCCA 307 CTGTTCCGGA GCCCATTCGT TTTGGTACAT AACTT-TTGT GTAGTCTTTT GCTCGTCTAT 5016 |||||||||| ||||| |||| |||||||||| ||||| || | |||||||||| ||| |||||| CTGTTCCGGA GCCCAGTCGT TTTGGTACAT AACTTCTTAT GTAGTCTTTT GCTTGTCTAT 367 GGGTATGGCG GGGCCCTGTC CCGTCGAGTT TCACTAATGT ACCCTTAGAG GTCTGTGGGC 4956 |||||| ||| |||||||||| ||||| |||| |||||| | | || ||||||| |||||| | | GGGTAT-GCG GGGCCCTGTC CCGTCAAGTT TCACTACTAT ACTCTTAGAG GTCTGTAGAC 426 ATTATGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG GACATGGTTT GTTTGGGATG 4896 || |||||| ||||||| | ||||||| || |||||||||| |||||||||| |||||||||| ATCGTGTGGG TTGTATAATT ATGTTTTTGA TAATGGTCTG GACATGGTTT GTTTGGGATG 486 TCCGCTTGTA CAGGGGCAGC CTTGTCGGCT GTGTACATCA TTATGCTTTG AATAGTGGCG 4836 ||| |||||| || | ||| | |||||||| | ||||||||| || || ||| ||||||| TCCACTTGTA CAAGTGCAAC CTTGTCGGTT GTGTACATCT TTGTGTATTG TGTAGTGGCA 546 GCCTTGTCGG CTCGCGTATG CTGTTATG 4808 |||||||||| || ||||||| || ||||| GCCTTGTCGG CT-GCGTATG CTATTATG 573 hqPGS_C06HBa0057J04.1-3-_SGN-E538151+ (6514 6472,5510 5316,5140 4808) ******************************************************************************** EST sequence 12 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 6571 to 3846): Exon 1 6514 6472 ( 43 n); cDNA 5 47 ( 43 n); score: 0.837 Intron 1 6471 5511 ( 961 n); Pd: 0.997 (s: 0.84), Pa: 0.966 (s: 0.98) Exon 2 5510 5316 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 5315 5141 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 5140 4808 ( 333 n); cDNA 242 573 ( 332 n); score: 0.890 MATCH C06HBa0057J04.1-3- SGN-E538156+ 0.904 571 0.887 C PGS_C06HBa0057J04.1-3-_SGN-E538156+ (6514 6472,5510 5316,5140 4808) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 6455 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 6395 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 6335 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 6275 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TATGTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 6215 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 6155 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 6095 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTC 6035 .......... .......... .......... .......... .......... .......... 47 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 5975 .......... .......... .......... .......... .......... .......... 47 CTAGCTTACA AGAAAGTAAA GCACGATGTT TATCTAATTG CGGCACGATT GTTGCTTGTT 5915 .......... .......... .......... .......... .......... .......... 47 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 5855 .......... .......... .......... .......... .......... .......... 47 GCTGTCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 5795 .......... .......... .......... .......... .......... .......... 47 TGATACTTAC CTCTAAAGCG TCTAGGTGAT GTATATTCTT GCTTCCACAA TTATTCCTCT 5735 .......... .......... .......... .......... .......... .......... 47 ATATATCGGT TATGTCTAAG GATATGATGA TCTCTAATAT CTATGGTAAT GCTTCTTAGA 5675 .......... .......... .......... .......... .......... .......... 47 GTCATTGAAA TTTTACGTTT TCATATCGTA TTAAAGGTTC ATAATCTTGA TAAAACATTA 5615 .......... .......... .......... .......... .......... .......... 47 ATCTTTGGTA ATACTCCTTG CTGGTTCACG TTGATTGTTC TATTGAGTTA TAAGAAATGA 5555 .......... .......... .......... .......... .......... .......... 47 TTTTAATTGC ATATGGTTGC TCATAATATT CTGCTCGTGC ATAGAGTCAT TTATCATTTC 5495 |||||| |||||||||| .......... .......... .......... .......... ....AGTCAT TTATCATTTC 63 ACCAAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTTCCT 5435 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ACCGAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTCCCT 123 CACTAGAGGG CCGGGTATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 5375 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CACTAGAGGG CCGGGAATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 183 TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC CCCTTACTAG GGAAGCTGGG 5315 |||||||||| |||||||| | |||||| ||| || | ||| |||| ||||| | || || TGATGATGAC GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. 241 CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA AGTATATGTG TAGCGATATT 5255 .......... .......... .......... .......... .......... .......... 241 TTTTTTCGAG TTGCCACATT GGTATCCTGT CATCTTTACC TTATGCTTTA CATACTCAGT 5195 .......... .......... .......... .......... .......... .......... 241 ACATTGTTCG TACTGACCCC CCTTTCCTCG GGGGGCTGCG TTTCATGCCC GCAGGTGTAG 5135 |||||| .......... .......... .......... .......... .......... ....GTGTAG 247 ACGCGCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGATTGG GAGAGCTCCA 5075 |||| ||||| ||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ACGCTCAGTT TGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGTTTGG GAGAGCTCCA 307 CTGTTCCGGA GCCCATTCGT TTTGGTACAT AACTT-TTGT GTAGTCTTTT GCTCGTCTAT 5016 |||||||||| ||||| |||| | |||||||| ||||| || | |||||||||| ||| |||||| CTGTTCCGGA GCCCAGTCGT TGTGGTACAT AACTTCTTAT GTAGTCTTTT GCTTGTCTAT 367 GGGTATGGCG GGGCCCTGTC CCGTCGAGTT TCACTAATGT ACCCTTAGAG GTCTGTGGGC 4956 |||||| ||| |||||||||| ||||| |||| |||||| | | || ||||||| |||||| | | GGGTAT-GCG GGGCCCTGTC CCGTCAAGTT TCACTACTAT ACTCTTAGAG GTCTGTAGAC 426 ATTATGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG GACATGGTTT GTTTGGGATG 4896 || |||||| ||||||| | ||||||| || |||||||||| |||||||||| |||||||||| ATCGTGTGGG TTGTATAATT ATGTTTTTGA TAATGGTCTG GACATGGTTT GTTTGGGATG 486 TCCGCTTGTA CAGGGGCAGC CTTGTCGGCT GTGTACATCA TTATGCTTTG AATAGTGGCG 4836 ||| |||||| || | ||| | |||||||| | ||||||||| || || ||| ||||||| TCCACTTGTA CAAGTGCAAC CTTGTCGGTT GTGTACATCT TTGTGTATTG TGTAGTGGCA 546 GCCTTGTCGG CTCGCGTATG CTGTTATG 4808 |||||| ||| || ||||||| || ||||| GCCTTGACGG CT-GCGTATG CTATTATG 573 hqPGS_C06HBa0057J04.1-3-_SGN-E538156+ (6514 6472,5510 5316,5140 4808) ******************************************************************************** EST sequence 7 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 6121 to 2633): Exon 1 5455 5401 ( 55 n); cDNA 161 214 ( 54 n); score: 0.818 Intron 1 5400 5356 ( 45 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.88) Exon 2 5355 4808 ( 548 n); cDNA 215 761 ( 547 n); score: 0.904 PPA cDNA 829 839 MATCH C06HBa0057J04.1-3- SGN-E544254- 0.896 603 0.715 C PGS_C06HBa0057J04.1-3-_SGN-E544254- (5455 5401,5355 4808) Alignment (genomic DNA sequence = upper lines): TGCATATGTC ACCGAGTTCC TCACTAGAGG GCCGGGTATG TATATTATAT ATATGATTGG 5396 || ||| || ||||||| || |||||||||| |||||||| ||| | ||| |||| TGACTATTTC ACCGAGTCCC TCACTAGAGG GCCGGGTA-C TATGATGTAT ATATA..... 214 TGATGAGGAT GGTTATGATG ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG 5336 ||||||||| |||||||||| .......... .......... .......... .......... ATGATGATTA TTTTGCCGAG 234 CCCCTTACTA GGGAAGCTGG GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA 5276 ||||||||| |||||| | | ||| |||| | ||||||| || |||||||||| |||||||||| TCCCTTACTA GGGAAGTTAG GCATCTTATA TGTTAAAGAT ATGCATGATT TTCACTTAAA 294 AAGTATATGT GTAGCGATAT TTTTTTTCGA GTTGCCACAT TGGTATCCTG TCATCTTTAC 5216 ||||| |||| |||| ||||| || |||||| || || | |||||||||| |||||||||| AAGTACATGT GTAGAGATAT CTTGTTTCGA CTTATCATGT TGGTATCCTG TCATCTTTAC 354 CTTATGCTTT ACATACTCAG TACATTGTTC GTACTGACCC CCCTTTCCTC GGGGGGCTGC 5156 |||||||||| |||||||||| |||||||| | ||||||| || |||||| ||| |||||||||| CTTATGCTTT ACATACTCAG TACATTGTCC GTACTGA-CC CCCTTTTCTC GGGGGGCTGC 413 GTTTCATGCC CGCAGGTGTA GACGCGCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT 5096 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| GTTTCATGCC CGCAGGTGTA GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT 473 CTGCTGATTG GGAGAGCTCC ACTGTTCCGG AGCCCATTCG TTTTGGTACA TAACTT-TTG 5037 ||||| ||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||| || CTGCTTTTTG GGAGAGCTCC ACTGTTCCGG AGCCCAGTCG TTTTGGTACA TAACTTCTTA 533 TGTAGTCTTT TGCTCGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCGAGT TTCACTAATG 4977 |||||||||| |||| ||||| |||||||||| |||||||||| |||||| ||| ||||||| | TGTAGTCTTT TGCTTGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCAAGT TTCACTACTA 593 TACCCTTAGA GGTCTGTGGG CATTATGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT 4917 ||| |||||| ||||||| | ||| ||||| |||||||| |||||||||| |||||||||| TACTCTTAGA GGTCTGTAGA CATCGTGTGG GTTGTATAAT TATGTTTTGG ATAATGGTCT 653 GGACATGGTT TGTTTGGGAT GTCCGCTTGT ACAGGGGCAG CCTTGTCGGC TGTGTACATC 4857 |||||||||| |||||||||| |||| |||| ||| | |||| ||||||||| |||| ||||| GGACATGGTT TGTTTGGGAT GTCCATTTGT ACAAGTGCAG CCTTGTCGGT TGTGAACATC 713 ATTATGCTTT GAATAGTGGC GGCCTTGTCG GCTCGCGTAT GCTGTTATG 4808 ||| || || | ||||||| |||| |||| ||| |||||| ||| ||||| ATTGTGTATT GTGTAGTGGC AGCCTCGTCG GCT-GCGTAT GCTATTATG 761 hqPGS_C06HBa0057J04.1-3-_SGN-E544254- (5455 5401,5355 4808) ******************************************************************************** EST sequence 17 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 6740 to 3130): Exon 1 6514 6472 ( 43 n); cDNA 3 45 ( 43 n); score: 0.837 Intron 1 6471 5511 ( 961 n); Pd: 0.997 (s: 0.84), Pa: 0.966 (s: 0.96) Exon 2 5510 5316 ( 195 n); cDNA 46 239 ( 194 n); score: 0.923 Intron 2 5315 5141 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 5140 4910 ( 231 n); cDNA 240 470 ( 231 n); score: 0.903 MATCH C06HBa0057J04.1-3- SGN-E268096+ 0.912 469 0.998 C PGS_C06HBa0057J04.1-3-_SGN-E268096+ (6514 6472,5510 5316,5140 4910) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 6455 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 45 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 6395 .......... .......... .......... .......... .......... .......... 45 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 6335 .......... .......... .......... .......... .......... .......... 45 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 6275 .......... .......... .......... .......... .......... .......... 45 GCAGGAAAAT TATGTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 6215 .......... .......... .......... .......... .......... .......... 45 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 6155 .......... .......... .......... .......... .......... .......... 45 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 6095 .......... .......... .......... .......... .......... .......... 45 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTC 6035 .......... .......... .......... .......... .......... .......... 45 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 5975 .......... .......... .......... .......... .......... .......... 45 CTAGCTTACA AGAAAGTAAA GCACGATGTT TATCTAATTG CGGCACGATT GTTGCTTGTT 5915 .......... .......... .......... .......... .......... .......... 45 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 5855 .......... .......... .......... .......... .......... .......... 45 GCTGTCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 5795 .......... .......... .......... .......... .......... .......... 45 TGATACTTAC CTCTAAAGCG TCTAGGTGAT GTATATTCTT GCTTCCACAA TTATTCCTCT 5735 .......... .......... .......... .......... .......... .......... 45 ATATATCGGT TATGTCTAAG GATATGATGA TCTCTAATAT CTATGGTAAT GCTTCTTAGA 5675 .......... .......... .......... .......... .......... .......... 45 GTCATTGAAA TTTTACGTTT TCATATCGTA TTAAAGGTTC ATAATCTTGA TAAAACATTA 5615 .......... .......... .......... .......... .......... .......... 45 ATCTTTGGTA ATACTCCTTG CTGGTTCACG TTGATTGTTC TATTGAGTTA TAAGAAATGA 5555 .......... .......... .......... .......... .......... .......... 45 TTTTAATTGC ATATGGTTGC TCATAATATT CTGCTCGTGC ATAGAGTCAT TTATCATTTC 5495 |||||| |||||||| | .......... .......... .......... .......... ....AGTCAT TTATCATTGC 61 ACCAAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTTCCT 5435 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ACCGAGTCCC GGGCCGGGTA ATGTTCGTGC GGAGTTTCTT GCATATGTCA CCGAGTCCCT 121 CACTAGAGGG CCGGGTATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 5375 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CACTAGAGGG CCGGGAATGT ATATTATATA TATGATTGGT GATGAGGATG GTTATGATGA 181 TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC CCCTTACTAG GGAAGCTGGG 5315 |||||||||| |||||||| | |||||| ||| || | ||| |||| ||||| | || || TGATGATGAC GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. 239 CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA AGTATATGTG TAGCGATATT 5255 .......... .......... .......... .......... .......... .......... 239 TTTTTTCGAG TTGCCACATT GGTATCCTGT CATCTTTACC TTATGCTTTA CATACTCAGT 5195 .......... .......... .......... .......... .......... .......... 239 ACATTGTTCG TACTGACCCC CCTTTCCTCG GGGGGCTGCG TTTCATGCCC GCAGGTGTAG 5135 |||||| .......... .......... .......... .......... .......... ....GTGTAG 245 ACGCGCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGATTGG GAGAGCTCCA 5075 |||| ||||| ||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ACGCTCAGTT TGGTGATCCT CCCGCCTAGG ATATCTACTC TGCTGTTTGG GAGAGCTCCA 305 CTGTTCCGGA GCCCATTCGT TTTGGTACAT AACTT-TTGT GTAGTCTTTT GCTCGTCTAT 5016 |||||||||| ||||| |||| |||||||||| ||||| || | |||||||||| ||| |||||| CTGTTCCGGA GCCCAGTCGT TTTGGTACAT AACTTCTTAT GTAGTCTTTT GCTTGTCTAT 365 GGGTATGGCG GGGCCCTGTC CCGTCGAGTT TCACTAATGT ACCCTTAGAG GTCTGTGGGC 4956 |||||| ||| |||||||||| ||||| |||| |||||| | | || ||||||| |||||| | | GGGTAT-GCG GGGCCCTGTC CCGTCAAGTT TCACTACTAT ACTCTTAGAG GTCTGTAGAC 424 ATTATGTGGG TTGTATATAT ATGTTTTGGA TAATGGTCTG GACATG 4910 || |||||| | ||||| | ||||||| || |||||| ||| |||||| ATCGTGTGGG TAGTATAATT ATGTTTTTGA TAATGGGCTG GACATG 470 hqPGS_C06HBa0057J04.1-3-_SGN-E268096+ (6514 6472,5510 5316,5140 4910) ******************************************************************************** EST sequence 4 -strand 573 n (File: SGN-E538150-) 1 CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 61 GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 121 GGTTTGGCTT GAAAAAAAAA AAAAAAAAAA AACTCTTGAT ACAGTATTGG TTGGAAATTC 181 CCAAAGAGTT GCAGGTGCAG ATTATGCATT AGAAAGTATC CACAACATCA GGGAAGCAAT 241 ACCACAACTT TGGGAAGTAG ACAGGCTGGC TGAAGTTAAC TACTCTGGTG TAGCTGTTGA 301 GACATCTGTC ACAGCTTAGA ATCAGTAGTA CTACTATATC TCATCATCAT GCTGATGGCA 361 GAAGGAAAAA AAAATTAATC AAGAATCATG AGAAGATCCA AAATTTTCTG TCAAATTTGA 421 TTTTAAATGA TGTTGATGTT TTGTTGTCAT CAATTAATAA CTAGCTTTTA GTATTTCCTT 481 TCCATCCACA AATCTTGTAA ATAAATTCTA TATTTATCAG TCTACCTTTC TATGATTATA 541 TAATAATGAA GTTCAATTAT TAAAAAAAAA AAA Predicted gene structure (within gDNA segment 5576 to 1): Exon 1 4867 4739 ( 129 n); cDNA 1 132 ( 132 n); score: 0.826 Intron 1 4738 1308 (3431 n); Pd: 0.000 (s: 0.71), Pa: 0.863 (s: 0.48) Exon 2 1307 1253 ( 55 n); cDNA 133 188 ( 56 n); score: 0.518 PPA cDNA 562 573 MATCH C06HBa0057J04.1-3- SGN-E538150- 0.734 184 0.321 C PGS_C06HBa0057J04.1-3-_SGN-E538150- (4867 4739,1307 1253) Alignment (genomic DNA sequence = upper lines): CTGTGTACAT CATTATGCTT TGAATAGTGG CGGCCTTGTC GGCTCGCGTA TGCTGTTATG 4808 ||| ||| ||||||||| |||||||||| | |||||||| |||||||||| || ||||| | CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 60 GTTGAATGGT TATGACTCCT TATGAGACAG GTCCTC-TTA TATAT--ATA TATGACGTTG 4751 |||||||||| |||||||| | ||||||| || ||| | ||| ||||| ||| |||| |||| GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 120 GGGTTGGCTT GATTTGATTA AATTCCATAT TGTCTTAGTT TCAGTTGGTC ATACTTAGCA 4691 || ||||||| || GGTTTGGCTT GA........ .......... .......... .......... .......... 132 GGTTTGTATG TGGGTGTCCA AAACGGGCAC TAATCACGGC CTATCGGGTT GGGTCGTGAC 4631 .......... .......... .......... .......... .......... .......... 132 AAAGAGTGGT ATCAGAGCGG TTCTTCCTCA AAAGTGTCTA CAGACCGTGT CTAGTAGAGT 4571 .......... .......... .......... .......... .......... .......... 132 CTTGTTTATC GGTGTGTTGT GCACCACATC TATAAACAGG AAGCTACAGG ACATTTATGA 4511 .......... .......... .......... .......... .......... .......... 132 TGTCATTCTT TCTTCTTATT CTAGATCGTG CGATAGAGCT ATATTAACTG GATAATCCCT 4451 .......... .......... .......... .......... .......... .......... 132 CTCTAACGAA TCCATGTGTT TTCACCTATG CCTCCAAAGA AAGCGACAGC CGCCTAGAAG 4391 .......... .......... .......... .......... .......... .......... 132 GGAAAATCGG TAGCAGAAGG TACTAGTCAG ACCCGAAGAG TTACTAGGGC CCGTGCCTAG 4331 .......... .......... .......... .......... .......... .......... 132 TCTATGCCTG GTATTATGCT CCAGTCGGAG AGCTCTGCTA CACCCCCACC GCCAGAAGAG 4271 .......... .......... .......... .......... .......... .......... 132 CTTAGAGCAG CAGCAGCTCC AGTTCGGGGG ACACCACCAG CCCCCGAGGC CACAACATCT 4211 .......... .......... .......... .......... .......... .......... 132 GAACCTCCAG CTCCTCAGTC AGGGGCGGAG GATAGGGCCA TGAGAGATGC GGTTTAATTG 4151 .......... .......... .......... .......... .......... .......... 132 CTGACTAGAT TAGTGGCAGA TCAGGCTCGC AGGCATGGAC TAGGAGTTGA TCATGCGGAC 4091 .......... .......... .......... .......... .......... .......... 132 AGATCTGATA GCTTAAGGGC TCGTGACTTC TTAAGTTGTA ATCCTCCAGA GTTCTTTGGG 4031 .......... .......... .......... .......... .......... .......... 132 TCAAGGCCAC AGGATGATCC GCAAGAGTTT ATTCGTCAGA TGCAGCGTAC ATTGAGGATA 3971 .......... .......... .......... .......... .......... .......... 132 ATCAAGGCTT CGGAGACCGA GTCTGTTGAG TTGGCTACGT ATCGTTTGCG GGATGTAGCT 3911 .......... .......... .......... .......... .......... .......... 132 ATTAATTGGT ATGAGTCTTG GGAGTTATCT AGGGGTGAGG GTGCCCCTCC AGCGGTATGG 3851 .......... .......... .......... .......... .......... .......... 132 GATGAATTTG TGGAGGCTTT CCAGGGCCAC TTCCTGCCTC CAGAGATGAA GCGAGCTAGA 3791 .......... .......... .......... .......... .......... .......... 132 GTCGATAAAT TCTTGCGTTT GAAGCAAAAT GGCAGGAGCG TTCGAGAGTA TAGCCTCGAG 3731 .......... .......... .......... .......... .......... .......... 132 TTTGATTCAT TTGCTAGGCA TGCGCCTACT ATTGTGGCTG ATATGGCAGA TACGGTACAT 3671 .......... .......... .......... .......... .......... .......... 132 CGTTATGTGA TGGGATTGGA TCGTTATATG ATTGACGGTT GTATGGCAGT GACTCTTCAG 3611 .......... .......... .......... .......... .......... .......... 132 CCAGGTATGG ACATCGCTCG GGTGCAGGCA TTTGCACAGG GGGTAGAGGA TCGGCACCGG 3551 .......... .......... .......... .......... .......... .......... 132 GGACGTCAGC CAGATAGAGA TTATAATAGA GGCCAGCATA AGAGGGCTAG ATCAGCACGT 3491 .......... .......... .......... .......... .......... .......... 132 TATCCTGACG AGTTTCAAAG CAGGCAGTCT CAGCAGCATG TTAGATTTTC TTCCCAGCCA 3431 .......... .......... .......... .......... .......... .......... 132 GCACAGAGTG CACCCCCACG TTTCATGGGT AGGGGGTTTG ATCGTATGGG ATATTCGGAA 3371 .......... .......... .......... .......... .......... .......... 132 CCTGGTCAGA GCTCTAGGGC GTCAAGGTCA CAGATGGGCA GGGGTTTGAG CCAGTCGAGG 3311 .......... .......... .......... .......... .......... .......... 132 CCACCTTTGC CTCGGTGTTC TCGTTGTGGT AAGTCCCATC CTGGGGAATG TCGTTGGGCT 3251 .......... .......... .......... .......... .......... .......... 132 ACAGGTGCGT GTTTTTCTTG CGGCCGTCAG GGCCATACTA TGAGGGAGTG TCACCTTAGA 3191 .......... .......... .......... .......... .......... .......... 132 GGTAGTGCAG GTGGTATGGC ACAGCCTACA GGGTCCGTTG CTGGTTCATC TTCTTCTGTG 3131 .......... .......... .......... .......... .......... .......... 132 GCTATGCGCC CTACGGGGCA GGGTATTCAG GCACCAGCCG GCCGTGGTAG AGGACGTGGT 3071 .......... .......... .......... .......... .......... .......... 132 GGAGCTTCCA GTTCTAGCGG TCCCTCAAAC CGTATATATG CTTTGACTAA TAGGCAAGAT 3011 .......... .......... .......... .......... .......... .......... 132 CAAGAGGCGT CACCTAATGT GATCACAGGT ATATTATCAC TATTCTCCCG AAGTGTGTAT 2951 .......... .......... .......... .......... .......... .......... 132 GCATTGATAG ACCCAGGTTC CACCTTATCA TATATATCTC CCTTTGTTGC TAGTAGGATC 2891 .......... .......... .......... .......... .......... .......... 132 GGAATAGAGT CTGAGTTGAT AGAACCATTT GAGGTAGCTA CACCTGTAGG AGATTTTGTC 2831 .......... .......... .......... .......... .......... .......... 132 ATAGCTACGC GAGTATATAG GAATTGTTCA GTAGCTATAT ATAGTCGTCA TACCGTAGAT 2771 .......... .......... .......... .......... .......... .......... 132 GATCTAATAG AGTTAAATAT GATTGAGTTT GATATTATCA TGGGCATGGA TTGGTTGGCT 2711 .......... .......... .......... .......... .......... .......... 132 GCTTGTTATG CTAATATTGA TTGCAGAGGA AAGATAGTTC GATTTCAATT TCCAGGGGAA 2651 .......... .......... .......... .......... .......... .......... 132 CCGATTATAG AGTGGAAGGG AAGTACAGTA TCGCCGAAAG GTAAGTTCAT TTAATACCTC 2591 .......... .......... .......... .......... .......... .......... 132 AAGGCCGGTA AGATGGTTAG AAAAGGCTAT ATTTACCATC TGATTCGAGT GCATGACATA 2531 .......... .......... .......... .......... .......... .......... 132 AAGGCAGAGG CACCGACTCT TCAATCAGTC CCGGTAGTTA ATGAATTTCC TGATGTATTC 2471 .......... .......... .......... .......... .......... .......... 132 CCCGAGGAAC TTCCAGGCCT TCCTCCAGAA CGGGAGATAG AGTTTACTAT AGATGTACTG 2411 .......... .......... .......... .......... .......... .......... 132 CCAGATACCC AGCCTATATC TATACCTCCT TATAGAATGG CACCTGCTGA GTTGAAAGAA 2351 .......... .......... .......... .......... .......... .......... 132 TTGAAAGAGC AATTGAGGGA TTTGCTAGAA AAGGGCTTCA TCAGGCCTAG TACGTCACCT 2291 .......... .......... .......... .......... .......... .......... 132 TGGGGATCAC CAGTACTGTT TGTGAGGAAG AAGGATGGGT CGCTGCGGAT GTGCATTGAT 2231 .......... .......... .......... .......... .......... .......... 132 TATAGGCAGT TGAACAAAGT AACAATAAAG AACAGGTATC CCCTCCCAAG GATTGACGAT 2171 .......... .......... .......... .......... .......... .......... 132 CTACTTGACC GGTTGCAGGG TGCAAAGTGT TTTTCAAAGA TAGACTTGCG GTCAGGTTAT 2111 .......... .......... .......... .......... .......... .......... 132 CATTAGGTGC GGGTAAGGGA GGCAGATATT CCAAAGACAG CATTCCGGAC CCGATATGGG 2051 .......... .......... .......... .......... .......... .......... 132 CATTATGAGT TTAGAGTGCT GTCTTTTGGG CTGACTAATG CTCCAGCGGT ATTCATGGAT 1991 .......... .......... .......... .......... .......... .......... 132 TTAATGAATC GAGTATTTAA ACCATTCCTT GATATGTTTG TTATTGTATT TATAGACGAT 1931 .......... .......... .......... .......... .......... .......... 132 ATTCTAGTCT ATTCACGTTC AGAAGAGGAG CATGCAGATC ATTTAAGGAC GGTACTTAGG 1871 .......... .......... .......... .......... .......... .......... 132 GTGCTTCAGC ACCAGAAGTT GTATGCTAAA TTTTCTAAGT GCGAGTTCTG GTTGACTTCA 1811 .......... .......... .......... .......... .......... .......... 132 GTGGCATTCT TGGGGCATAT TATTGGAGCT GATGGGATTC GGGTAGAGAC GCAGAAGATT 1751 .......... .......... .......... .......... .......... .......... 132 GAGGCAGTAA AGACTTGGCC CAGACCTACG ACACCTACTG AGTGCGCGCA GCTTTTTGGG 1691 .......... .......... .......... .......... .......... .......... 132 GTTAGCAGGA TATTACAGGA GATTCGTAGA AAAGTTTGCC TCAATCTCAG TGCATTTGAC 1631 .......... .......... .......... .......... .......... .......... 132 AAGGCTAACT CAAAAGGCAG CCAAGTTCCA GTGGACAGAT GCTTATGAGC GAAGCTTCCA 1571 .......... .......... .......... .......... .......... .......... 132 GCTATTAAAA GACAAATTGA CTACAGCTCC AGTCCTAACT CTTCCAGAGG GACCAGACGG 1511 .......... .......... .......... .......... .......... .......... 132 CTATGTTATT TATTGTGATG CTTCGGGTGT TGGGCTAGGA TGTGTATTGA TGCAGCATGG 1451 .......... .......... .......... .......... .......... .......... 132 CAAAGTTATA GCCTATGCCT CCCGACAACT TAGGAAGCAT GAAAAGAACT ATCCTACTCA 1391 .......... .......... .......... .......... .......... .......... 132 CGATCTGGAG TTAGCGGTCG TGGTTCATGC CTTGAAGATA TGGAGACATT ATTTATATGG 1331 .......... .......... .......... .......... .......... .......... 132 TGTCCATGTG GACATCTATA CAGATCATAA GAGTCTCCAA TATATCTTTA AACAGAAGGA 1271 | | || | || | |||| | |||| | .......... .......... ...AAAAAAA AAAAAAAAAA AAACTCTTGA TACAGTATTG 169 GCT-GAACTT ACGACAGAG 1253 | | ||| || | | |||| GTTGGAAATT CCCAAAGAG 188 hqPGS_C06HBa0057J04.1-3-_SGN-E538150- (4867 4739) ******************************************************************************** EST sequence 15 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 6259 to 2663): Exon 1 5140 4808 ( 333 n); cDNA 43 375 ( 333 n); score: 0.899 PPA cDNA 443 453 MATCH C06HBa0057J04.1-3- SGN-E303256+ 0.899 333 0.735 C PGS_C06HBa0057J04.1-3-_SGN-E303256+ (5140 4808) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 5081 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 5022 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 4962 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 222 GTGGGCATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4902 || | ||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 282 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 4842 ||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 342 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATG 4808 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG 375 hqPGS_C06HBa0057J04.1-3-_SGN-E303256+ (5140 4808) ******************************************************************************** EST sequence 14 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 6814 to 4294): Exon 1 6514 5822 ( 693 n); cDNA 3 691 ( 689 n); score: 0.861 MATCH C06HBa0057J04.1-3- SGN-E328093+ 0.861 693 1.003 C PGS_C06HBa0057J04.1-3-_SGN-E328093+ (6514 5822) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 6455 |||||| || | ||||||| || ||||||| |||||| || |||||||||| ||||| ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTGGTTAGTA ATCTCTTTGC 62 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGG- 6396 ||||| |||| ||||| |||| |||||| ||| ||||||| | ||| || ||| ||||||||| TTGGTTTGTT AATTCCTTAG AATACCTTTG TTAATTAGAC ATTTATGTTA AGAAGGGGGA 122 CGTGACCAGT AGCTTAGGAA GTTTGTTTTA GTTATTGAAT GTACTAAGTA TGAATGGAAA 6336 ||||| |||| | |||||||| ||||||||| |||||||||| || ||||| | ||| |||| CGTGAACAGT ATCTTAGGAA -TTTGTTTTA GTTATTGAAT GTGCTAAGGA TGAGCAGAAA 181 CCATAATCGG ATTATTAGTG GTGTCGTGTT GGTGCTTGGG CTGTTTTGAT TAAAGCAAAC 6276 |||| ||||| ||| ||| | |||| | || ||| ||||| |||||||||| ||||| || | CCATGATCGG ATTGCTAGCG GTGTTATATT TGTG-TTGGG CTGTTTTGAT TAAAGTAAGC 240 TGCAGGAAAA TTATGTTTTG GCATTATGTA TATGTTGAAT GTGATTATGA GTATATACTC 6216 ||| || ||| || ||||||| | ||||| | |||||| ||| |||||||| |||||||||| TGCTGG-AAA TTCTGTTTTG GTGTTATGCA TATGTT-AAT ATGATTATGG GTATATACTC 298 CAAAGGATGA ATACGATAAG GTAGATGTGT TACGAATTAT AAAACGAGTT ATCACTCGGT 6156 |||||||||| |||| ||||| |||||||||| | |||||||| ||||||| || ||| ||||| CAAAGGATGA ATACAATAAG GTAGATGTGT TGCGAATTAT AAAACGAATT ATCGGTCGGT 358 GTGTCGTTGC TTCGCTGATA TAGTTGC-CG AGATGGAACT GTTTTGGGGA GGGGGCTGTT 6097 ||||||||| || | | || | ||||| | | |||||| ||||| ||| || ||||||| GTGTCGTTGT TTTGTTACTA TGGTTGCTAA AAACGGAACT GTTTT-GGG- GGAGGCTGTT 416 TAATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGTT GTGGATAATT TGGATTGTTG 6037 |||||||||| |||| ||| |||||||| | |||||||||| |||||||||| ||| |||||| TAATATGATT TGTTGGATTA TATGTGTTGT TGGTATTGTT GTGGATAATT TGGGTTGTTG 476 TCGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 5977 | |||||||| | ||||||| |||||||||| || ||||||| || ||||||| |||||||||| TTGGATTGGG ATGAAGTAAA GAAAATAGGG GAAGTGCTGC CGGATTTTCG TTAGATTATT 536 AGCTAGCTTA CA-AGAAAGT AAAGCACGAT GTTTATCTAA TTGCGGCACG ATTGTTGCTT 5918 |||||||||| || | ||| |||| ||| ||||||||| |||||||||| |||| ||||| AGCTAGCTTA CATAAGTAGT -AAGCGCGAC ATTTATCTAA TTGCGGCACG ATTGGTGCTT 595 GTTATAGATT AATAGCTTGA GCAGTAAATA TTGGACGTGC GGCTCGATTA TACGGTATGT 5858 |||||||||| ||| ||||| |||||||||| |||||||| | ||||||| || | |||||||| GTTATAGATT TATACCTTGA GCAGTAAATA TTGGACGTAC GGCTCGACTA TTCGGTATGT 655 AACGCTGTCC CTTCTTTCTT TGTTTGGCAT GACTTT 5822 |||||| ||| ||| ||||| |||||||||| ||| || AACGCTATCC TTTCCTTCTT TGTTTGGCAT GACCTT 691 hqPGS_C06HBa0057J04.1-3-_SGN-E328093+ (6514 5822) ******************************************************************************** EST sequence 21 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 6814 to 4153): Exon 1 6522 6068 ( 455 n); cDNA 1 455 ( 455 n); score: 0.947 MATCH C06HBa0057J04.1-3- SGN-E298250+ 0.947 455 1.000 C PGS_C06HBa0057J04.1-3-_SGN-E298250+ (6522 6068) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT GGTTAGTAAT 6463 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| || |||||| AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 60 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT TAATTTTAAG 6403 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 120 AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTA CTAAGTATGA 6343 |||||||||| ||||||| || ||| |||||| |||||||||| ||||||||| |||||||||| AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 180 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 6283 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 240 AGCAAACTGC AGGAAAATTA TGTTTTGGCA TTATGTATAT GTTGAATGTG ATTATGAGTA 6223 |||||||||| ||||||||| | |||||||| |||||||||| | |||||||| |||||||||| AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 300 TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTAC GAATTATAAA ACGAGTTATC 6163 |||||||||| ||||||||| |||| ||||| ||||||| | |||||||||| |||||||||| TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 360 ACTCGGTGTG TCGTTGCTTC GCTGATATAG TTGCCGAGAT GGAACTGTTT TGGGGAGGGG 6103 ||||||||| ||| |||||| |||| ||||| ||||| ||| |||||||||| |||||||||| GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 420 GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTA 6068 |||| |||| |||| ||| |||||||||| ||||| GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA 455 hqPGS_C06HBa0057J04.1-3-_SGN-E298250+ (6522 6068) Total number of EST alignments reported: 21 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 6814: PGL 1 (- strand): 3673 1885 AGS-1 (2614 1885) SCR (e 0.953) Exon 1 2614 1885 ( 730 n); score: 0.953 PGS (2614 1885) SGN-E379982- PGS (2405 1885) SGN-E201553- 3-phase translation of AGS-1 (-strand): . . . . . . 2614 AAAGGTAAGTTCATTTAATACCTCAAGGCCGGTAAGATGGTTAGAAAAGGCTATATTTAC K G K F I - Y L K A G K M V R K G Y I Y K V S S F N T S R P V R W L E K A I F T R - V H L I P Q G R - D G - K R L Y L . . . . . . 2554 CATCTGATTCGAGTGCATGACATAAAGGCAGAGGCACCGACTCTTCAATCAGTCCCGGTA H L I R V H D I K A E A P T L Q S V P V I - F E C M T - R Q R H R L F N Q S R - P S D S S A - H K G R G T D S S I S P G . . . . . . 2494 GTTAATGAATTTCCTGATGTATTCCCCGAGGAACTTCCAGGCCTTCCTCCAGAACGGGAG V N E F P D V F P E E L P G L P P E R E L M N F L M Y S P R N F Q A F L Q N G R S - - I S - C I P R G T S R P S S R T G . . . . . . 2434 ATAGAGTTTACTATAGATGTACTGCCAGATACCCAGCCTATATCTATACCTCCTTATAGA I E F T I D V L P D T Q P I S I P P Y R - S L L - M Y C Q I P S L Y L Y L L I E D R V Y Y R C T A R Y P A Y I Y T S L - . . . . . . 2374 ATGGCACCTGCTGAGTTGAAAGAATTGAAAGAGCAATTGAGGGATTTGCTAGAAAAGGGC M A P A E L K E L K E Q L R D L L E K G W H L L S - K N - K S N - G I C - K R A N G T C - V E R I E R A I E G F A R K G . . . . . . 2314 TTCATCAGGCCTAGTACGTCACCTTGGGGATCACCAGTACTGTTTGTGAGGAAGAAGGAT F I R P S T S P W G S P V L F V R K K D S S G L V R H L G D H Q Y C L - G R R M L H Q A - Y V T L G I T S T V C E E E G . . . . . . 2254 GGGTCGCTGCGGATGTGCATTGATTATAGGCAGTTGAACAAAGTAACAATAAAGAACAGG G S L R M C I D Y R Q L N K V T I K N R G R C G C A L I I G S - T K - Q - R T G W V A A D V H - L - A V E Q S N N K E Q . . . . . . 2194 TATCCCCTCCCAAGGATTGACGATCTACTTGACCGGTTGCAGGGTGCAAAGTGTTTTTCA Y P L P R I D D L L D R L Q G A K C F S I P S Q G L T I Y L T G C R V Q S V F Q V S P P K D - R S T - P V A G C K V F F . . . . . . 2134 AAGATAGACTTGCGGTCAGGTTATCATTAGGTGCGGGTAAGGGAGGCAGATATTCCAAAG K I D L R S G Y H - V R V R E A D I P K R - T C G Q V I I R C G - G R Q I F Q R K D R L A V R L S L G A G K G G R Y S K . . . . . . 2074 ACAGCATTCCGGACCCGATATGGGCATTATGAGTTTAGAGTGCTGTCTTTTGGGCTGACT T A F R T R Y G H Y E F R V L S F G L T Q H S G P D M G I M S L E C C L L G - L D S I P D P I W A L - V - S A V F W A D . . . . . . 2014 AATGCTCCAGCGGTATTCATGGATTTAATGAATCGAGTATTTAAACCATTCCTTGATATG N A P A V F M D L M N R V F K P F L D M M L Q R Y S W I - - I E Y L N H S L I C - C S S G I H G F N E S S I - T I P - Y . . . . . . 1954 TTTGTTATTGTATTTATAGACGATATTCTAGTCTATTCACGTTCAGAAGAGGAGCATGCA F V I V F I D D I L V Y S R S E E E H A L L L Y L - T I F - S I H V Q K R S M Q V C Y C I Y R R Y S S L F T F R R G A C . 1894 GATCATTTAA D H L I I - R S F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-1_AGS-1_PPS_1 (2596 2105) (frame '1'; 489 bp, 163 residues) 1 YLKAGKMVRK GYIYHLIRVH DIKAEAPTLQ SVPVVNEFPD VFPEELPGLP PEREIEFTID 61 VLPDTQPISI PPYRMAPAEL KELKEQLRDL LEKGFIRPST SPWGSPVLFV RKKDGSLRMC 121 IDYRQLNKVT IKNRYPLPRI DDLLDRLQGA KCFSKIDLRS GYH- >C06HBa0057J04.1-3-_PGL-1_AGS-1_PPS_2 (2104 1886) (frame '1'; 219 bp, 73 residues) 1 VRVREADIPK TAFRTRYGHY EFRVLSFGLT NAPAVFMDLM NRVFKPFLDM FVIVFIDDIL 61 VYSRSEEEHA DHL 3-phase translation of AGS-1 (+strand): . . . . . . 1885 TTAAATGATCTGCATGCTCCTCTTCTGAACGTGAATAGACTAGAATATCGTCTATAAATA L N D L H A P L L N V N R L E Y R L - I - M I C M L L F - T - I D - N I V Y K Y K - S A C S S S E R E - T R I S S I N . . . . . . 1945 CAATAACAAACATATCAAGGAATGGTTTAAATACTCGATTCATTAAATCCATGAATACCG Q - Q T Y Q G M V - I L D S L N P - I P N N K H I K E W F K Y S I H - I H E Y R T I T N I S R N G L N T R F I K S M N T . . . . . . 2005 CTGGAGCATTAGTCAGCCCAAAAGACAGCACTCTAAACTCATAATGCCCATATCGGGTCC L E H - S A Q K T A L - T H N A H I G S W S I S Q P K R Q H S K L I M P I S G P A G A L V S P K D S T L N S - C P Y R V . . . . . . 2065 GGAATGCTGTCTTTGGAATATCTGCCTCCCTTACCCGCACCTAATGATAACCTGACCGCA G M L S L E Y L P P L P A P N D N L T A E C C L W N I C L P Y P H L M I T - P Q R N A V F G I S A S L T R T - - - P D R . . . . . . 2125 AGTCTATCTTTGAAAAACACTTTGCACCCTGCAACCGGTCAAGTAGATCGTCAATCCTTG S L S L K N T L H P A T G Q V D R Q S L V Y L - K T L C T L Q P V K - I V N P W K S I F E K H F A P C N R S S R S S I L . . . . . . 2185 GGAGGGGATACCTGTTCTTTATTGTTACTTTGTTCAACTGCCTATAATCAATGCACATCC G G D T C S L L L L C S T A Y N Q C T S E G I P V L Y C Y F V Q L P I I N A H P G R G Y L F F I V T L F N C L - S M H I . . . . . . 2245 GCAGCGACCCATCCTTCTTCCTCACAAACAGTACTGGTGATCCCCAAGGTGACGTACTAG A A T H P S S S Q T V L V I P K V T Y - Q R P I L L P H K Q Y W - S P R - R T R R S D P S F F L T N S T G D P Q G D V L . . . . . . 2305 GCCTGATGAAGCCCTTTTCTAGCAAATCCCTCAATTGCTCTTTCAATTCTTTCAACTCAG A - - S P F L A N P S I A L S I L S T Q P D E A L F - Q I P Q L L F Q F F Q L S G L M K P F S S K S L N C S F N S F N S . . . . . . 2365 CAGGTGCCATTCTATAAGGAGGTATAGATATAGGCTGGGTATCTGGCAGTACATCTATAG Q V P F Y K E V - I - A G Y L A V H L - R C H S I R R Y R Y R L G I W Q Y I Y S A G A I L - G G I D I G W V S G S T S I . . . . . . 2425 TAAACTCTATCTCCCGTTCTGGAGGAAGGCCTGGAAGTTCCTCGGGGAATACATCAGGAA - T L S P V L E E G L E V P R G I H Q E K L Y L P F W R K A W K F L G E Y I R K V N S I S R S G G R P G S S S G N T S G . . . . . . 2485 ATTCATTAACTACCGGGACTGATTGAAGAGTCGGTGCCTCTGCCTTTATGTCATGCACTC I H - L P G L I E E S V P L P L C H A L F I N Y R D - L K S R C L C L Y V M H S N S L T T G T D - R V G A S A F M S C T . . . . . . 2545 GAATCAGATGGTAAATATAGCCTTTTCTAACCATCTTACCGGCCTTGAGGTATTAAATGA E S D G K Y S L F - P S Y R P - G I K - N Q M V N I A F S N H L T G L E V L N E R I R W - I - P F L T I L P A L R Y - M . 2605 ACTTACCTTT T Y L L T F N L P Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3+_PGL-1_AGS-1_PPS_1 (2041 2304) (frame '1'; 261 bp, 87 residues) 1 THNAHIGSGM LSLEYLPPLP APNDNLTASL SLKNTLHPAT GQVDRQSLGG DTCSLLLLCS 61 TAYNQCTSAA THPSSSQTVL VIPKVTY- AGS-2 (3456 2859) SCR (e 0.965) Exon 1 3456 2859 ( 598 n); score: 0.965 PGS (3456 2859) SGN-E350824- 3-phase translation of AGS-2 (-strand): . . . . . . 3456 AGCATGTTAGATTTTCTTCCCAGCCAGCACAGAGTGCACCCCCACGTTTCATGGGTAGGG S M L D F L P S Q H R V H P H V S W V G A C - I F F P A S T E C T P T F H G - G H V R F S S Q P A Q S A P P R F M G R . . . . . . 3396 GGTTTGATCGTATGGGATATTCGGAACCTGGTCAGAGCTCTAGGGCGTCAAGGTCACAGA G L I V W D I R N L V R A L G R Q G H R V - S Y G I F G T W S E L - G V K V T D G F D R M G Y S E P G Q S S R A S R S Q . . . . . . 3336 TGGGCAGGGGTTTGAGCCAGTCGAGGCCACCTTTGCCTCGGTGTTCTCGTTGTGGTAAGT W A G V - A S R G H L C L G V L V V V S G Q G F E P V E A T F A S V F S L W - V M G R G L S Q S R P P L P R C S R C G K . . . . . . 3276 CCCATCCTGGGGAATGTCGTTGGGCTACAGGTGCGTGTTTTTCTTGCGGCCGTCAGGGCC P I L G N V V G L Q V R V F L A A V R A P S W G M S L G Y R C V F F L R P S G P S H P G E C R W A T G A C F S C G R Q G . . . . . . 3216 ATACTATGAGGGAGTGTCACCTTAGAGGTAGTGCAGGTGGTATGGCACAGCCTACAGGGT I L - G S V T L E V V Q V V W H S L Q G Y Y E G V S P - R - C R W Y G T A Y R V H T M R E C H L R G S A G G M A Q P T G . . . . . . 3156 CCGTTGCTGGTTCATCTTCTTCTGTGGCTATGCGCCCTACGGGGCAGGGTATTCAGGCAC P L L V H L L L W L C A L R G R V F R H R C W F I F F C G Y A P Y G A G Y S G T S V A G S S S S V A M R P T G Q G I Q A . . . . . . 3096 CAGCCGGCCGTGGTAGAGGACGTGGTGGAGCTTCCAGTTCTAGCGGTCCCTCAAACCGTA Q P A V V E D V V E L P V L A V P Q T V S R P W - R T W W S F Q F - R S L K P Y P A G R G R G R G G A S S S S G P S N R . . . . . . 3036 TATATGCTTTGACTAATAGGCAAGATCAAGAGGCGTCACCTAATGTGATCACAGGTATAT Y M L - L I G K I K R R H L M - S Q V Y I C F D - - A R S R G V T - C D H R Y I I Y A L T N R Q D Q E A S P N V I T G I . . . . . . 2976 TATCACTATTCTCCCGAAGTGTGTATGCATTGATAGACCCAGGTTCCACCTTATCATATA Y H Y S P E V C M H - - T Q V P P Y H I I T I L P K C V C I D R P R F H L I I Y L S L F S R S V Y A L I D P G S T L S Y . . . . . . 2916 TATCTCCCTTTGTTGCTAGTAGGATCGGAATAGAGTCTGAGTTGATAGAACCATTTGA Y L P L L L V G S E - S L S - - N H L I S L C C - - D R N R V - V D R T I - I S P F V A S R I G I E S E L I E P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-1_AGS-2_PPS_1 (3454 2861) (frame '0'; 594 bp, 198 residues) 1 HVRFSSQPAQ SAPPRFMGRG FDRMGYSEPG QSSRASRSQM GRGLSQSRPP LPRCSRCGKS 61 HPGECRWATG ACFSCGRQGH TMRECHLRGS AGGMAQPTGS VAGSSSSVAM RPTGQGIQAP 121 AGRGRGRGGA SSSSGPSNRI YALTNRQDQE ASPNVITGIL SLFSRSVYAL IDPGSTLSYI 181 SPFVASRIGI ESELIEPF 3-phase translation of AGS-2 (+strand): . . . . . . 2859 TCAAATGGTTCTATCAACTCAGACTCTATTCCGATCCTACTAGCAACAAAGGGAGATATA S N G S I N S D S I P I L L A T K G D I Q M V L S T Q T L F R S Y - Q Q R E I Y K W F Y Q L R L Y S D P T S N K G R Y . . . . . . 2919 TATGATAAGGTGGAACCTGGGTCTATCAATGCATACACACTTCGGGAGAATAGTGATAAT Y D K V E P G S I N A Y T L R E N S D N M I R W N L G L S M H T H F G R I V I I I - - G G T W V Y Q C I H T S G E - - - . . . . . . 2979 ATACCTGTGATCACATTAGGTGACGCCTCTTGATCTTGCCTATTAGTCAAAGCATATATA I P V I T L G D A S - S C L L V K A Y I Y L - S H - V T P L D L A Y - S K H I Y Y T C D H I R - R L L I L P I S Q S I Y . . . . . . 3039 CGGTTTGAGGGACCGCTAGAACTGGAAGCTCCACCACGTCCTCTACCACGGCCGGCTGGT R F E G P L E L E A P P R P L P R P A G G L R D R - N W K L H H V L Y H G R L V T V - G T A R T G S S T T S S T T A G W . . . . . . 3099 GCCTGAATACCCTGCCCCGTAGGGCGCATAGCCACAGAAGAAGATGAACCAGCAACGGAC A - I P C P V G R I A T E E D E P A T D P E Y P A P - G A - P Q K K M N Q Q R T C L N T L P R R A H S H R R R - T S N G . . . . . . 3159 CCTGTAGGCTGTGCCATACCACCTGCACTACCTCTAAGGTGACACTCCCTCATAGTATGG P V G C A I P P A L P L R - H S L I V W L - A V P Y H L H Y L - G D T P S - Y G P C R L C H T T C T T S K V T L P H S M . . . . . . 3219 CCCTGACGGCCGCAAGAAAAACACGCACCTGTAGCCCAACGACATTCCCCAGGATGGGAC P - R P Q E K H A P V A Q R H S P G W D P D G R K K N T H L - P N D I P Q D G T A L T A A R K T R T C S P T T F P R M G . . . . . . 3279 TTACCACAACGAGAACACCGAGGCAAAGGTGGCCTCGACTGGCTCAAACCCCTGCCCATC L P Q R E H R G K G G L D W L K P L P I Y H N E N T E A K V A S T G S N P C P S L T T T R T P R Q R W P R L A Q T P A H . . . . . . 3339 TGTGACCTTGACGCCCTAGAGCTCTGACCAGGTTCCGAATATCCCATACGATCAAACCCC C D L D A L E L - P G S E Y P I R S N P V T L T P - S S D Q V P N I P Y D Q T P L - P - R P R A L T R F R I S H T I K P . . . . . . 3399 CTACCCATGAAACGTGGGGGTGCACTCTGTGCTGGCTGGGAAGAAAATCTAACATGCT L P M K R G G A L C A G W E E N L T C Y P - N V G V H S V L A G K K I - H A P T H E T W G C T L C W L G R K S N M Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3+_PGL-1_AGS-2_PPS_1 (3146 3343) (frame '0'; 195 bp, 65 residues) 1 TSNGPCRLCH TTCTTSKVTL PHSMALTAAR KTRTCSPTTF PRMGLTTTRT PRQRWPRLAQ 61 TPAHL- AGS-3 (3673 3478) SCR (e 0.918) Exon 1 3673 3478 ( 196 n); score: 0.918 PGS (3673 3478) SGN-E379248+ 3-phase translation of AGS-3 (-strand): . . . . . . 3673 CATCGTTATGTGATGGGATTGGATCGTTATATGATTGACGGTTGTATGGCAGTGACTCTT H R Y V M G L D R Y M I D G C M A V T L I V M - W D W I V I - L T V V W Q - L F S L C D G I G S L Y D - R L Y G S D S . . . . . . 3613 CAGCCAGGTATGGACATCGCTCGGGTGCAGGCATTTGCACAGGGGGTAGAGGATCGGCAC Q P G M D I A R V Q A F A Q G V E D R H S Q V W T S L G C R H L H R G - R I G T S A R Y G H R S G A G I C T G G R G S A . . . . . . 3553 CGGGGACGTCAGCCAGATAGAGATTATAATAGAGGCCAGCATAAGAGGGCTAGATCAGCA R G R Q P D R D Y N R G Q H K R A R S A G D V S Q I E I I I E A S I R G L D Q H P G T S A R - R L - - R P A - E G - I S . . 3493 CGTTATCCTGACGAGT R Y P D E V I L T S T L S - R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-1_AGS-3_PPS_1 (3673 3479) (frame '1'; 195 bp, 65 residues) 1 HRYVMGLDRY MIDGCMAVTL QPGMDIARVQ AFAQGVEDRH RGRQPDRDYN RGQHKRARSA 61 RYPDE 3-phase translation of AGS-3 (+strand): . . . . . . 3478 ACTCGTCAGGATAACGTGCTGATCTAGCCCTCTTATGCTGGCCTCTATTATAATCTCTAT T R Q D N V L I - P S Y A G L Y Y N L Y L V R I T C - S S P L M L A S I I I S I S S G - R A D L A L L C W P L L - S L . . . . . . 3538 CTGGCTGACGTCCCCGGTGCCGATCCTCTACCCCCTGTGCAAATGCCTGCACCCGAGCGA L A D V P G A D P L P P V Q M P A P E R W L T S P V P I L Y P L C K C L H P S D S G - R P R C R S S T P C A N A C T R A . . . . . . 3598 TGTCCATACCTGGCTGAAGAGTCACTGCCATACAACCGTCAATCATATAACGATCCAATC C P Y L A E E S L P Y N R Q S Y N D P I V H T W L K S H C H T T V N H I T I Q S M S I P G - R V T A I Q P S I I - R S N . . 3658 CCATCACATAACGATG P S H N D H H I T M P I T - R Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 6533 4682 AGS-1 (6533 6465,5910 5864,5140 4808,4689 4682) SCR (e 0.862 d 0.900 a 0.868,e 0.979 d 0.994 a 0.000,e 0.902 d 0.000 a 0.973,e 0.750) Exon 1 6533 6465 ( 69 n); score: 0.862 Intron 1 6464 5911 ( 554 n); Pd: 0.900 Pa: 0.868 Exon 2 5910 5864 ( 47 n); score: 0.979 Intron 2 5863 5141 ( 723 n); Pd: 0.994 Pa: 0.000 Exon 3 5140 4808 ( 333 n); score: 0.902 Intron 3 4807 4690 ( 118 n); Pd: 0.000 Pa: 0.973 Exon 4 4689 4682 ( 8 n); score: 0.750 PGS (6533 6465,5910 5864,5140 4808,4689 4682) SGN-E543103- PGS (6533 6465,5910 5864,5140 4808,4689 4682) SGN-E543104+ PGS (5141 4808,4689 4682) SGN-E225616- PGS (5140 4808,4689 4682) SGN-E306317+ PGS (5140 4808,4689 4682) SGN-E303695+ PGS (5140 4808) SGN-E303256+ 3-phase translation of AGS-1 (-strand): . . . . . . 6533 GGCAGCCATGGAAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATT G S H G N G E T N P A T L G Q Q L Q I I A A M E M E K P T L Q L L A S S C K - F Q P W K W R N Q P C N S W P A A A N N . : . . . . . : 6473 TGGTTAGTA : ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGT W L V : I N S L S S K Y W T C G S I I R : C G - - : L I A - A V N I G R A A R L Y : G V L V S : N - - L E Q - I L D V R L D Y T : V . . . . . . 5136 AGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTC R R A V R - S S R L G Y L L C - L G E L D A Q F G D P P A - D I Y S A D W E S S - T R S S V I L P P R I S T L L I G R A . . . . . . 5076 CACTGTTCCGGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTA H C S G A H S F W Y I T F V - S F A R L T V P E P I R F G T - L L C S L L L V Y P L F R S P F V L V H N F C V V F C S S . . . . . . 5016 TGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTGGG W V W R G P V P S S F T N V P L E V C G G Y G G A L S R R V S L M Y P - R S V G M G M A G P C P V E F H - C T L R G L W . . . . . . 4956 CATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGAT H Y V G C I Y M F W I M V W T W F V W D I M W V V Y I C F G - W S G H G L F G M A L C G L Y I Y V L D N G L D M V C L G . . . . . . 4896 GTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTGGC V R L Y R G S L V G C V H H Y A L N S G S A C T G A A L S A V Y I I M L - I V A C P L V Q G Q P C R L C T S L C F E - W . . . : . 4836 GGCCTTGTCGGCTCGCGTATGCTGTTATG : GTTTGTAT G L V G S R M L L W : F V A L S A R V C C Y : G L Y R P C R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-2_AGS-1_PPS_1 (5031 4808,4689 4683) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-2 (6531 6472,5910 5864,5140 4808,4689 4682) SCR (e 0.842 d 0.997 a 0.868,e 0.894 d 0.994 a 0.000,e 0.902 d 0.000 a 0.973,e 0.750) Exon 1 6531 6472 ( 60 n); score: 0.842 Intron 1 6471 5911 ( 561 n); Pd: 0.997 Pa: 0.868 Exon 2 5910 5864 ( 47 n); score: 0.894 Intron 2 5863 5141 ( 723 n); Pd: 0.994 Pa: 0.000 Exon 3 5140 4808 ( 333 n); score: 0.902 Intron 3 4807 4690 ( 118 n); Pd: 0.000 Pa: 0.973 Exon 4 4689 4682 ( 8 n); score: 0.750 PGS (6531 6472,5910 5864,5140 4808,4689 4682) SGN-E374134- PGS (6530 6472,5910 5864,5140 4808,4689 4682) SGN-E305738+ PGS (6530 6472,5910 5864,5140 4808,4689 4682) SGN-E374135+ PGS (6530 6472,5910 5864,5140 4808) SGN-E310669+ 3-phase translation of AGS-2 (-strand): . . . . . . : 6531 CAGCCATGGAAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : Q P W K W R N Q P C N S W P A A A N N L : S H G N G E T N P A T L G Q Q L Q I I - : A M E M E K P T L Q L L A S S C K - F : . . . . . : . 5910 ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGTAGACGCGCA I N S L S S K Y W T C G S I I R : C R R A L I A - A V N I G R A A R L Y : G V D A Q D - - L E Q - I L D V R L D Y T : V - T R . . . . . . 5127 GTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCC V R - S S R L G Y L L C - L G E L H C S F G D P P A - D I Y S A D W E S S T V P S S V I L P P R I S T L L I G R A P L F . . . . . . 5067 GGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGG G A H S F W Y I T F V - S F A R L W V W E P I R F G T - L L C S L L L V Y G Y G R S P F V L V H N F C V V F C S S M G M . . . . . . 5007 CGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTGGGCATTATGTG R G P V P S S F T N V P L E V C G H Y V G A L S R R V S L M Y P - R S V G I M W A G P C P V E F H - C T L R G L W A L C . . . . . . 4947 GGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCGCTTG G C I Y M F W I M V W T W F V W D V R L V V Y I C F G - W S G H G L F G M S A C G L Y I Y V L D N G L D M V C L G C P L . . . . . . 4887 TACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTGGCGGCCTTGTC Y R G S L V G C V H H Y A L N S G G L V T G A A L S A V Y I I M L - I V A A L S V Q G Q P C R L C T S L C F E - W R P C . . : . 4827 GGCTCGCGTATGCTGTTATG : GTTTGTAT G S R M L L W : F V A R V C C Y : G L Y R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-2_AGS-2_PPS_1 (5031 4808,4689 4683) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-3 (6514 6472,5510 5316,5140 4739) SCR (e 0.837 d 0.997 a 0.966,e 0.928 d 0.000 a 0.000,e 0.896) Exon 1 6514 6472 ( 43 n); score: 0.837 Intron 1 6471 5511 ( 961 n); Pd: 0.997 Pa: 0.966 Exon 2 5510 5316 ( 195 n); score: 0.928 Intron 2 5315 5141 ( 175 n); Pd: 0.000 Pa: 0.000 Exon 3 5140 4739 ( 402 n); score: 0.896 PGS (4867 4739) SGN-E538150- PGS (6514 6472,5510 5316,5140 4808) SGN-E538151+ PGS (6514 6472,5510 5316,5140 4808) SGN-E538156+ PGS (6514 6472,5510 5316,5140 4910) SGN-E268096+ 3-phase translation of AGS-3 (-strand): . . . . . : . 6514 AAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : AGTCATTTATCATTTCA K P T L Q L L A S S C K - F : E S F I I S N Q P C N S W P A A A N N L : S H L S F H T N P A T L G Q Q L Q I I - : V I Y H F . . . . . . 5493 CCAAGTCCCGGGCCGGGTAATGTTCGTGCGGAGTTTCTTGCATATGTCACCGAGTTCCTC P S P G P G N V R A E F L A Y V T E F L Q V P G R V M F V R S F L H M S P S S S T K S R A G - C S C G V S C I C H R V P . . . . . . 5433 ACTAGAGGGCCGGGTATGTATATTATATATATGATTGGTGATGAGGATGGTTATGATGAT T R G P G M Y I I Y M I G D E D G Y D D L E G R V C I L Y I - L V M R M V M M M H - R A G Y V Y Y I Y D W - - G W L - - . . . . . . : 5373 GATGATGACGGAGATGACGTGATGATTATTTTGCCGAGCCCCTTACTAGGGAAGCTGG : GT D D D G D D V M I I L P S P L L G K L : G M M T E M T - - L F C R A P Y - G S W : V - - - R R - R D D Y F A E P L T R E A G : . . . . . . 5138 GTAGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGC V D A Q F G D P P A - D I Y S A D W E S - T R S S V I L P P R I S T L L I G R A C R R A V R - S S R L G Y L L C - L G E . . . . . . 5078 TCCACTGTTCCGGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTC S T V P E P I R F G T - L L C S L L L V P L F R S P F V L V H N F C V V F C S S L H C S G A H S F W Y I T F V - S F A R . . . . . . 5018 TATGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTG Y G Y G G A L S R R V S L M Y P - R S V M G M A G P C P V E F H - C T L R G L W L W V W R G P V P S S F T N V P L E V C . . . . . . 4958 GGCATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGG G I M W V V Y I C F G - W S G H G L F G A L C G L Y I Y V L D N G L D M V C L G G H Y V G C I Y M F W I M V W T W F V W . . . . . . 4898 ATGTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTG M S A C T G A A L S A V Y I I M L - I V C P L V Q G Q P C R L C T S L C F E - W D V R L Y R G S L V G C V H H Y A L N S . . . . . . 4838 GCGGCCTTGTCGGCTCGCGTATGCTGTTATGGTTGAATGGTTATGACTCCTTATGAGACA A A L S A R V C C Y G - M V M T P Y E T R P C R L A Y A V M V E W L - L L M R Q G G L V G S R M L L W L N G Y D S L - D . . . . 4778 GGTCCTCTTATATATATATATGACGTTGGGGTTGGCTTGA G P L I Y I Y D V G V G L V L L Y I Y M T L G L A - R S S Y I Y I - R W G W L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-2_AGS-3_PPS_1 (5031 4783) (frame '0'; 246 bp, 82 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWLNGYD SL- >C06HBa0057J04.1-3-_PGL-2_AGS-3_PPS_2 (6475 6472,5510 5316,5140 5106) (frame '1'; 231 bp, 77 residues) 1 FESFIISPSP GPGNVRAEFL AYVTEFLTRG PGMYIIYMIG DEDGYDDDDD GDDVMIILPS 61 PLLGKLGVDA QFGDPPA- AGS-4 (5455 5401,5355 4808) SCR (e 0.818 d 0.000 a 0.000,e 0.904) Exon 1 5455 5401 ( 55 n); score: 0.818 Intron 1 5400 5356 ( 45 n); Pd: 0.000 Pa: 0.000 Exon 2 5355 4808 ( 548 n); score: 0.904 PGS (5455 5401,5355 4808) SGN-E544254- 3-phase translation of AGS-4 (-strand): . . . . . . : 5455 TGCATATGTCACCGAGTTCCTCACTAGAGGGCCGGGTATGTATATTATATATATG : GTGAT C I C H R V P H - R A G Y V Y Y I Y : G D A Y V T E F L T R G P G M Y I I Y M : V M H M S P S S S L E G R V C I L Y I W : - . . . . . . 5350 GATTATTTTGCCGAGCCCCTTACTAGGGAAGCTGGGCACCTTAAATGTTAAATATATGCA D Y F A E P L T R E A G H L K C - I Y A I I L P S P L L G K L G T L N V K Y M H - L F C R A P Y - G S W A P - M L N I C . . . . . . 5290 TGATTTTCACTTAAAAAGTATATGTGTAGCGATATTTTTTTTCGAGTTGCCACATTGGTA - F S L K K Y M C S D I F F R V A T L V D F H L K S I C V A I F F F E L P H W Y M I F T - K V Y V - R Y F F S S C H I G . . . . . . 5230 TCCTGTCATCTTTACCTTATGCTTTACATACTCAGTACATTGTTCGTACTGACCCCCCTT S C H L Y L M L Y I L S T L F V L T P L P V I F T L C F T Y S V H C S Y - P P F I L S S L P Y A L H T Q Y I V R T D P P . . . . . . 5170 TCCTCGGGGGGCTGCGTTTCATGCCCGCAGGTGTAGACGCGCAGTTCGGTGATCCTCCCG S S G G C V S C P Q V - T R S S V I L P P R G A A F H A R R C R R A V R - S S R F L G G L R F M P A G V D A Q F G D P P . . . . . . 5110 CCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCCGGAGCCCATTCGTTTTG P R I S T L L I G R A P L F R S P F V L L G Y L L C - L G E L H C S G A H S F W A - D I Y S A D W E S S T V P E P I R F . . . . . . 5050 GTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGGCGGGGCCCTGTCCCGTC V H N F C V V F C S S M G M A G P C P V Y I T F V - S F A R L W V W R G P V P S G T - L L C S L L L V Y G Y G G A L S R . . . . . . 4990 GAGTTTCACTAATGTACCCTTAGAGGTCTGTGGGCATTATGTGGGTTGTATATATATGTT E F H - C T L R G L W A L C G L Y I Y V S F T N V P L E V C G H Y V G C I Y M F R V S L M Y P - R S V G I M W V V Y I C . . . . . . 4930 TTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCGCTTGTACAGGGGCAGCCTTGT L D N G L D M V C L G C P L V Q G Q P C W I M V W T W F V W D V R L Y R G S L V F G - W S G H G L F G M S A C T G A A L . . . . . . 4870 CGGCTGTGTACATCATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGCTGTT R L C T S L C F E - W R P C R L A Y A V G C V H H Y A L N S G G L V G S R M L L S A V Y I I M L - I V A A L S A R V C C . 4810 ATG M Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3-_PGL-2_AGS-4_PPS_1 (5454 5401,5355 5179) (frame '2'; 228 bp, 76 residues) 1 AYVTEFLTRG PGMYIIYMVM IILPSPLLGK LGTLNVKYMH DFHLKSICVA IFFFELPHWY 61 PVIFTLCFTY SVHCSY- >C06HBa0057J04.1-3-_PGL-2_AGS-4_PPS_2 (5031 4810) (frame '2'; 222 bp, 74 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLL AGS-5 (6522 5822) SCR (e 0.861) Exon 1 6522 5822 ( 701 n); score: 0.861 PGS (6514 5822) SGN-E328093+ PGS (6522 6068) SGN-E298250+ 3-phase translation of AGS-5 (-strand): . . . . . . 6522 AAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N Q P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K P T L Q L L A S S C K - F G - - . . . . . . 6462 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R S P C L V C - F F R I P L L I I H - F - . . . . . . 6402 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTACTAAGTATGA K G A - P V A - E V C F S Y - M Y - V - R G R D Q - L R K F V L V I E C T K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 6342 ATGGAAACCATAATCGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGCTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - S D Y - W C R V G A W A V L I K N G N H N R I I S G V V L V L G L F - L . . . . . . 6282 AGCAAACTGCAGGAAAATTATGTTTTGGCATTATGTATATGTTGAATGTGATTATGAGTA S K L Q E N Y V L A L C I C - M - L - V A N C R K I M F W H Y V Y V E C D Y E Y K Q T A G K L C F G I M Y M L N V I M S . . . . . . 6222 TATACTCCAAAGGATGAATACGATAAGGTAGATGTGTTACGAATTATAAAACGAGTTATC Y T P K D E Y D K V D V L R I I K R V I I L Q R M N T I R - M C Y E L - N E L S I Y S K G - I R - G R C V T N Y K T S Y . . . . . . 6162 ACTCGGTGTGTCGTTGCTTCGCTGATATAGTTGCCGAGATGGAACTGTTTTGGGGAGGGG T R C V V A S L I - L P R W N C F G E G L G V S L L R - Y S C R D G T V L G R G H S V C R C F A D I V A E M E L F W G G . . . . . . 6102 GCTGTTTAATATGATTCTTTGGGTTATATGTGTTATTGGTATTGTTGTGGATAATTTGGA A V - Y D S L G Y M C Y W Y C C G - F G L F N M I L W V I C V I G I V V D N L D G C L I - F F G L Y V L L V L L W I I W . . . . . . 6042 TTGTTGTCGGATTGGGACGAAGTAAGGAAAATAGGGGAGGTGCTGCCGAATTTTCGTTAG L L S D W D E V R K I G E V L P N F R - C C R I G T K - G K - G R C C R I F V R I V V G L G R S K E N R G G A A E F S L . . . . . . 5982 ATTATTAGCTAGCTTACAAGAAAGTAAAGCACGATGTTTATCTAATTGCGGCACGATTGT I I S - L T R K - S T M F I - L R H D C L L A S L Q E S K A R C L S N C G T I V D Y - L A Y K K V K H D V Y L I A A R L . . . . . . 5922 TGCTTGTTATAGATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACGG C L L - I N S L S S K Y W T C G S I I R A C Y R L I A - A V N I G R A A R L Y G L L V I D - - L E Q - I L D V R L D Y T . . . . . 5862 TATGTAACGCTGTCCCTTCTTTCTTTGTTTGGCATGACTTT Y V T L S L L S L F G M T M - R C P F F L C L A - L V C N A V P S F F V W H D F Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-5 (+strand): . . . . . . 5822 AAAGTCATGCCAAACAAAGAAAGAAGGGACAGCGTTACATACCGTATAATCGAGCCGCAC K V M P N K E R R D S V T Y R I I E P H K S C Q T K K E G T A L H T V - S S R T S H A K Q R K K G Q R Y I P Y N R A A . . . . . . 5882 GTCCAATATTTACTGCTCAAGCTATTAATCTATAACAAGCAACAATCGTGCCGCAATTAG V Q Y L L L K L L I Y N K Q Q S C R N - S N I Y C S S Y - S I T S N N R A A I R R P I F T A Q A I N L - Q A T I V P Q L . . . . . . 5942 ATAAACATCGTGCTTTACTTTCTTGTAAGCTAGCTAATAATCTAACGAAAATTCGGCAGC I N I V L Y F L V S - L I I - R K F G S - T S C F T F L - A S - - S N E N S A A D K H R A L L S C K L A N N L T K I R Q . . . . . . 6002 ACCTCCCCTATTTTCCTTACTTCGTCCCAATCCGACAACAATCCAAATTATCCACAACAA T S P I F L T S S Q S D N N P N Y P Q Q P P L F S L L R P N P T T I Q I I H N N H L P Y F P Y F V P I R Q Q S K L S T T . . . . . . 6062 TACCAATAACACATATAACCCAAAGAATCATATTAAACAGCCCCCTCCCCAAAACAGTTC Y Q - H I - P K E S Y - T A P S P K Q F T N N T Y N P K N H I K Q P P P Q N S S I P I T H I T Q R I I L N S P L P K T V . . . . . . 6122 CATCTCGGCAACTATATCAGCGAAGCAACGACACACCGAGTGATAACTCGTTTTATAATT H L G N Y I S E A T T H R V I T R F I I I S A T I S A K Q R H T E - - L V L - F P S R Q L Y Q R S N D T P S D N S F Y N . . . . . . 6182 CGTAACACATCTACCTTATCGTATTCATCCTTTGGAGTATATACTCATAATCACATTCAA R N T S T L S Y S S F G V Y T H N H I Q V T H L P Y R I H P L E Y I L I I T F N S - H I Y L I V F I L W S I Y S - S H S . . . . . . 6242 CATATACATAATGCCAAAACATAATTTTCCTGCAGTTTGCTTTAATCAAAACAGCCCAAG H I H N A K T - F S C S L L - S K Q P K I Y I M P K H N F P A V C F N Q N S P S T Y T - C Q N I I F L Q F A L I K T A Q . . . . . . 6302 CACCAACACGACACCACTAATAATCCGATTATGGTTTCCATTCATACTTAGTACATTCAA H Q H D T T N N P I M V S I H T - Y I Q T N T T P L I I R L W F P F I L S T F N A P T R H H - - S D Y G F H S Y L V H S . . . . . . 6362 TAACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCCCTTCTTAAAATTAATGGATAAT - L K Q T S - A T G H A P F L K L M D N N - N K L P K L L V T P P S - N - W I I I T K T N F L S Y W S R P L L K I N G - . . . . . . 6422 TAACAAGGGTATTCTAAAGAATTAACACACCAAACAAGGAGATTACTAACCAAATTATTT - Q G Y S K E L T H Q T R R L L T K L F N K G I L K N - H T K Q G D Y - P N Y L L T R V F - R I N T P N K E I T N Q I I . . . . . 6482 GCAGCTGCTGGCCAAGAGTTGCAGGGTTGGTTTCTCCATTT A A A G Q E L Q G W F L H Q L L A K S C R V G F S I C S C W P R V A G L V S P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-3+_PGL-2_AGS-5_PPS_1 (5917 6186) (frame '0'; 267 bp, 89 residues) 1 QATIVPQLDK HRALLSCKLA NNLTKIRQHL PYFPYFVPIR QQSKLSTTIP ITHITQRIIL 61 NSPLPKTVPS RQLYQRSNDT PSDNSFYNS- ... finished at: Mon Jul 24 23:13:30 2006 ________________________________________________________________________________ Sequence 4: C06HBa0057J04.1-4, from 1 to 3703, both strands analyzed. ... started at: Mon Jul 24 23:13:30 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 4 ******************************************************************************** EST sequence 2 +strand 599 n (File: SGN-E307342+) 1 GACGAGGGAC GAATGTTCCT AAGGGGGGAA GGATGTTACG CCTCGTATTT TTATACGTCG 61 TGCGCGTCAT GAACTAGTAT ATGTAAGTTC GGGAAATGAG ATTTTATTTT AAGTTCCAAG 121 TGTTTAAAGA AATATTATGC ATGGATGTTA ATTCCATATG TGATATTAAT TAGTGTGGGA 181 TTAATTAGGG GCTGATTTGG ATTTAATTTA TCGAGTGGGC CCCACCACTC AAGGCAAAGT 241 AAGAATTCAG ATTTTGGGAG ATAGCCTTAG TGGAGAATGT GTAGTGGGGG GCTCCTCCAC 301 TTCATATTGC TAAATATTAA AAAGAAAATC TGATTTGGGG ATAGAATTAG TGGAGCATTT 361 GTAGTGGTGG GCTGCCTCCA CTTCATATTG TCAAATGGTG TAGTGAAATA CTTTGCAACA 421 TATCATATCT TTCACCACAT GACTTGGGCA GCCATGGAAA TGGAGAAAAC CAGCCATTAA 481 ACTCTTGGAC AGCAGCTGCA AAGAAATTGG TTAGTAATCT CTTTGCTTGG TTTGTTAATT 541 CCTTAGAATA CCTTTGTTAA TTAGACATTT ATGTTAAGAA AGGGGACGTG AACAGTATC Predicted gene structure (within gDNA segment 2311 to 1): Exon 1 1351 1192 ( 160 n); cDNA 10 169 ( 160 n); score: 0.856 Intron 1 1191 1159 ( 33 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.84) Exon 2 1158 1010 ( 149 n); cDNA 170 317 ( 148 n); score: 0.762 Intron 2 1009 913 ( 97 n); Pd: 0.095 (s: 0.72), Pa: 0.000 (s: 0) Exon 3 912 904 ( 9 n); cDNA 318 326 ( 9 n); score: 1.000 MATCH C06HBa0057J04.1-4- SGN-E307342+ 0.811 318 0.531 C PGS_C06HBa0057J04.1-4-_SGN-E307342+ (1351 1192,1158 1010,912 904) Alignment (genomic DNA sequence = upper lines): CGAATATTAC TAACGGGGGA AGGATGGTAT GCCTCGTATT TTTATATGTA ATGCGCGTCA 1292 ||||| || | ||| |||||| |||||| || |||||||||| |||||| || ||||||||| CGAATGTTCC TAAGGGGGGA AGGATGTTAC GCCTCGTATT TTTATACGTC GTGCGCGTCA 69 TGAACTATTA GATGTAAGTC CGGAGAATAA GAATTTATTT TATGTTCCAA GTGATTAAAG 1232 ||||||| || |||||||| ||| ||| | || ||||||| || ||||||| ||| |||||| TGAACTAGTA TATGTAAGTT CGGGAAATGA GATTTTATTT TAAGTTCCAA GTGTTTAAAG 129 AAATATTAGG CAAGGATGTT TATTCCAAAT GAGTTATTAA GTATGATTTG GCTAATATTA 1172 |||||||| | || ||||||| |||||| || | | |||||| AAATATTATG CATGGATGTT AATTCCATAT GTGATATTAA .......... .......... 169 GTGTCATTAA CTTTAAGTGA GGGATTAATT AGTGGTTGAT TAGGATTAAA TTAATCTAGT 1112 | |||| |||||||||| || || |||| | ||||| || || ||| ||| .......... ...TTAGTGT GGGATTAATT AGGGGCTGAT TTGGATTTAA TTTATCGAGT 216 GGGCCCCACC ACTCAAGTCA AATTAAAAGG GATCAAATTG GGAGCTGTCC TTAGTGGGAC 1052 |||||||||| ||||||| || || ||| | | ||| |||| | || ||||||| GGGCCCCACC ACTCAAGGCA AAGTAAGAAT TCAGATTTTG GGAGATAGCC TTAGTGGAGA 276 ACGTGTAGAG TAGGGCTGCC TCCACTTCAT ATT-ATAAAT GAGGTGGTAA AATGCAATGC 993 | |||||| | ||||| || |||||||||| ||| ||||| | ATGTGTAGTG GGGGGCT-CC TCCACTTCAT ATTGCTAAAT -AT....... .......... 317 AACATATCAT CTTCTTCACC TCTTGGCTTA GGCAGCCATG GAAATGGAGA AAACCAACCA 933 .......... .......... .......... .......... .......... .......... 317 TGCAGCTCTT GGCCAGCAAC TAAAAAGAA 904 ||||||||| .......... .......... TAAAAAGAA 326 hqPGS_C06HBa0057J04.1-4-_SGN-E307342+ (1351 1192,1158 1010) ******************************************************************************** EST sequence 1 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 1922 to 1): Exon 1 944 248 ( 697 n); cDNA 1 691 ( 691 n); score: 0.868 MATCH C06HBa0057J04.1-4- SGN-E328093+ 0.868 697 1.009 C PGS_C06HBa0057J04.1-4-_SGN-E328093+ (944 248) Alignment (genomic DNA sequence = upper lines): GAAAACCAAC CATGCAGCTC TTGGCCAGCA ACTAAAAAGA ATTTGGTTAG TAATCTCTTT 885 |||||||| | ||| | ||| |||| ||||| || ||||| | |||||||| |||||||||| GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 60 GCTTGGTGTG TTAATTCCTT AGAATGCCTT TGTTAATTAG CCATTAATGT TAAGAATGGG 825 ||||||| || |||||||||| ||||| |||| |||||||||| |||| |||| |||||| ||| GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 120 G-CCTGACCA GTATCTTAGG ATGTTTGTTT TAGTTATTGA ATATGATAAG TATGAACGGA 766 | | ||| || |||||||||| | ||||||| |||||||||| || || |||| |||| | || GACGTGAACA GTATCTTAGG A-ATTTGTTT TAGTTATTGA ATGTGCTAAG GATGAGCAGA 179 AACCATAACC GGATTATTAG TGGTGTCGTG TTAGTGCTTG GGCTGTTTTG ATTAAAGCAA 706 |||||| | | ||||| ||| ||||| | || ||| ||| |||||||||| ||||||| || AACCATGATC GGATTGCTAG CGGTGTTATA TTTGTG-TTG GGCTGTTTTG ATTAAAGTAA 238 ACTGCGAGAA ATTCTGTTTT GGCGTTATGT ATATGTTAAA TGTTATTATG GGTATATACT 646 |||| ||| |||||||||| || |||||| ||||||| || | | |||||| |||||||||| GCTGCTGGAA ATTCTGTTTT GGTGTTATGC ATATGTT-AA TATGATTATG GGTATATACT 297 CTAAAGGATG AATACGATAA GGTAGATGTG TTGGGAATTA TAAAAGGAGT TATCGCTCAG 586 | |||||||| ||||| |||| |||||||||| ||| |||||| ||||| || | ||||| || | CCAAAGGATG AATACAATAA GGTAGATGTG TTGCGAATTA TAAAACGAAT TATCGGTCGG 357 TGTGTTGTTG CTTCATTACT ATGGTTGC-C GAGACGGAAC TGTTTTGGGG GAGGGGGCTG 527 ||||| |||| || ||||| |||||||| | ||||||| |||||||||| || ||||| TGTGTCGTTG TTTTGTTACT ATGGTTGCTA AAAACGGAAC TGTTTTGGGG GA---GGCTG 414 TTTAATATGA TTCATTGGGT TATATGTGTT GTTGGTATTA CTGTGGATAA TTTGGGTTGT 467 |||||||||| || |||| | |||||||||| ||||||||| ||||||||| |||||||||| TTTAATATGA TTTGTTGGAT TATATGTGTT GTTGGTATTG TTGTGGATAA TTTGGGTTGT 474 TGTCGGATTG GGACGAAGTA AGGAAATTAG GGGAAGTGCT ACCGGATTTT TGTTAGATTA 407 ||| |||||| ||| |||||| | |||| ||| |||||||||| ||||||||| ||||||||| TGTTGGATTG GGATGAAGTA AAGAAAATAG GGGAAGTGCT GCCGGATTTT CGTTAGATTA 534 TTAACTAGCT TACAATAGGT AGTAAAGCGC GACATTTATC TAATTGCGGC ACGATTTATG 347 ||| |||||| ||| ||| || ||| |||||| |||||||||| |||||||||| |||||| || TTAGCTAGCT TAC-ATAAGT AGT-AAGCGC GACATTTATC TAATTGCGGC ACGATTGGTG 592 CTTGTTATAG ATTAATAGCT TGAGAAGTAA ATATTGGACG TGCGACTCGA CTATACGGTA 287 |||||||||| ||| ||| || |||| ||||| |||||||||| | || ||||| |||| ||||| CTTGTTATAG ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTA 652 TGTAACGCTA CCTCTTCTTT CTTTGTTTGG CATGACTTT 248 |||||||||| | ||| || |||||||||| |||||| || TGTAACGCTA TCCTTTCCTT CTTTGTTTGG CATGACCTT 691 hqPGS_C06HBa0057J04.1-4-_SGN-E328093+ (944 248) ******************************************************************************** EST sequence 4 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 3296 to 1): Exon 1 951 497 ( 455 n); cDNA 1 454 ( 454 n); score: 0.867 MATCH C06HBa0057J04.1-4- SGN-E298250+ 0.867 455 1.000 C PGS_C06HBa0057J04.1-4-_SGN-E298250+ (951 497) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA AACCAACCAT GCAGCTCTTG GCCAGCAACT AAAAAGAATT TGGTTAGTAA 892 |||||||| | |||||||| | ||| |||||| ||||| | || ||| |||| ||| ||||| AAATGGAG-A AACCAACCCT GCAACTCTTG GCCAGTAGCT GCAAATAATT TGGGGAGTAA 59 TCTCTTTGCT TGGTGTGTTA ATTCCTTAGA ATGCCTTTGT TAATTAGCCA TTAATGTTAA 832 |||| ||| | |||||||||| |||| ||||| | || |||| |||||| ||| ||||| |||| TCTCCTTGTT TGGTGTGTTA ATTCTTTAGA ACACCCTTGT TAATTATCCA TTAATTTTAA 119 GAATGGGGCC TGACCAGTAT CTTAGGATGT TTGTTTTAGT TATTGAATAT GATAAGTATG 772 ||| ||||| |||||||| |||| || || |||||||||| |||||||| | | |||||||| GAAGGGGGCG TGACCAGTTG CTTACGAAGT TTGTTTTAGT TATTGAATGT GCTAAGTATG 179 AACGGAAACC ATAACCGGAT TATTAGTGGT GTCGTGTTAG TGCTTGGGCT GTTTTGATTA 712 || ||||||| |||| ||||| |||||||||| |||||||| | |||||||||| |||||||||| AATGGAAACC ATAATCGGAT TATTAGTGGT GTCGTGTTGG TGCTTGGGCT GTTTTGATTA 239 AAGCAAACTG C-GAGAAATT CTGTTTTGGC GTTATGTATA TGTTAAATGT TATTATGGGT 653 |||||||||| | | ||||| || ||||||| ||||||||| || | ||||| |||||| || AAGCAAACTG CAGGAAAATT CTATTTTGGC ATTATGTATA TGCTGAATGT GATTATGAGT 299 ATATACTCTA AAGGATGAAT ACGATAAGGT AGATGTGTTG GGAATTATAA AAGGAGTTAT 593 |||||||| | | |||||||| ||||| |||| | |||||||| ||||||||| || ||||||| ATATACTCCA ACGGATGAAT ACGATTAGGT AAATGTGTTG CGAATTATAA AACGAGTTAT 359 CGCTCAGTGT GTTGTTGCTT CATTACTATG GTTGCCGAGA CGGAACTGTT TTGGGGGAGG 533 ||||| |||| || | ||||| | | |||| |||||| ||| |||||||||| || ||||||| CGCTCGGTGT GTCGGTGCTT CGCTGCTATA GTTGCCCAGA CGGAACTGTT TT-GGGGAGG 418 GGGCTGTTTA ATATGATTCA TTGGGTTATA TGTGTT 497 |||||| || || |||| | | |||||||| |||||| GGGCTGCCTA ATGTGATACT TCGGGTTATA TGTGTT 454 hqPGS_C06HBa0057J04.1-4-_SGN-E298250+ (951 497) ******************************************************************************** EST sequence 3 +strand 713 n (File: SGN-E544255+) 1 GGACGAGGGA CGAATGTTCC TAAGGGGGGA AGGATGTTAC GCCTCGTATT TTTATACGTC 61 GTGCGCGTCA TGAACTAGTA TATGTAAGTT CGGGAAATGA GATTTTATTT TAAGTTCCAA 121 GTGTTTAAAG AAATATTATG CAAGGATGTT AATTCCATAT GTGATATTAA TTAGTGTGGG 181 ATTAATTAGG GGCTGATTTG GATTTAATTT ATCGAGTGGG CCCCACCACT CAAGGCAAAG 241 TAAGAATTCA GATTTTGGGA GATAGCCTTA GTGGAGAATG TGTAGTGGGG GGCTCCTCCA 301 CTTCATATTG CTAAATATTA AAAAGAAAAT CTGATTTGGG GATAGAATTA GTGGAGCATT 361 TGTAGTGGTG GGCTGCCTCC ACTTCATATT GTCAAATGGT GTAGTGAAAT ACTTTGCAAC 421 ATATCATATC TTTCACCACA TGACTTGGGC AGCCATGGAA ATGGAGAAAA CCAGCCATTA 481 AACTCTTGGA CAGCAGCTGC AAAGAAATTG GTTAGTAATC TCTTTGCTTG GTTTGTTAAT 541 TCCTTAGAAT ACCTTTGTTA ATTAGACATT TATGTTAAGA AGGGGGACGT GAACAGTATC 601 TTAGGAATTT GTTTTAGTTA TTGAATGTGC TAAGGATGAG CAGAAACCAT GATCGGATTG 661 CTAGCGGTGT TATATTTGTG TTGGGCTGTT TTGATTAAAG TAAGCTGCTG GAA Predicted gene structure (within gDNA segment 2321 to 1): Exon 1 1080 696 ( 385 n); cDNA 329 713 ( 385 n); score: 0.810 MATCH C06HBa0057J04.1-4- SGN-E544255+ 0.810 385 0.540 C PGS_C06HBa0057J04.1-4-_SGN-E544255+ (1080 696) Alignment (genomic DNA sequence = upper lines): ATCAAATTGG GAGCTGTCCT TAGTGGGACA CGTGTAGAGT AGGGCTGCCT CCACTTCATA 1021 ||| ||| | | | | | |||||| || ||||| | ||||||||| |||||||||| ATCTGATTTG GGGATAGAAT TAGTGGAGCA TTTGTAGTGG TGGGCTGCCT CCACTTCATA 388 TTAT-AAATG AGGTGGTAAA ATGCAATGCA ACATATCATC TTCTTCACCT CTTGGCTTAG 962 || | ||||| || || || || | |||| ||||||||| | |||||| | || ||| | TTGTCAAATG GTGTAGTGAA ATACTTTGCA ACATATCATA TCTTTCACCA CATGACTTGG 448 GCAGCCATGG AAATGGAGAA AACCAACCAT GCAGCTCTTG GCCAGCAACT AAAAAGAATT 902 |||||||||| |||||||||| ||||| |||| | |||||| | ||||| || |||||| | GCAGCCATGG AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT 508 TGGTTAGTAA TCTCTTTGCT TGGTGTGTTA ATTCCTTAGA ATGCCTTTGT TAATTAGCCA 842 |||||||||| |||||||||| |||| ||||| |||||||||| || ||||||| ||||||| || TGGTTAGTAA TCTCTTTGCT TGGTTTGTTA ATTCCTTAGA ATACCTTTGT TAATTAGACA 568 TTAATGTTAA GAATGGGG-C CTGACCAGTA TCTTAGGATG TTTGTTTTAG TTATTGAATA 783 || ||||||| ||| |||| | ||| ||||| |||||||| |||||||||| ||||||||| TTTATGTTAA GAAGGGGGAC GTGAACAGTA TCTTAGGA-A TTTGTTTTAG TTATTGAATG 627 TGATAAGTAT GAACGGAAAC CATAACCGGA TTATTAGTGG TGTCGTGTTA GTGCTTGGGC 723 || |||| || || | ||||| ||| | |||| || ||| || ||| | || ||| |||||| TGCTAAGGAT GAGCAGAAAC CATGATCGGA TTGCTAGCGG TGTTATATTT GTG-TTGGGC 686 TGTTTTGATT AAAGCAAACT GCGAGAA 696 |||||||||| |||| || || || ||| TGTTTTGATT AAAGTAAGCT GCTGGAA 713 hqPGS_C06HBa0057J04.1-4-_SGN-E544255+ (1080 696) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3703: PGL 1 (- strand): 1351 248 AGS-1 (1351 1192,1158 248) SCR (e 0.856 d 0.000 a 0.000,e 0.868) Exon 1 1351 1192 ( 160 n); score: 0.856 Intron 1 1191 1159 ( 33 n); Pd: 0.000 Pa: 0.000 Exon 2 1158 248 ( 911 n); score: 0.868 PGS (944 248) SGN-E328093+ PGS (951 497) SGN-E298250+ PGS (1080 696) SGN-E544255+ PGS (1351 1192,1158 1010) SGN-E307342+ 3-phase translation of AGS-1 (-strand): . . . . . . 1351 CGAATATTACTAACGGGGGAAGGATGGTATGCCTCGTATTTTTATATGTAATGCGCGTCA R I L L T G E G W Y A S Y F Y M - C A S E Y Y - R G K D G M P R I F I C N A R H N I T N G G R M V C L V F L Y V M R V . . . . . . 1291 TGAACTATTAGATGTAAGTCCGGAGAATAAGAATTTATTTTATGTTCCAAGTGATTAAAG - T I R C K S G E - E F I L C S K - L K E L L D V S P E N K N L F Y V P S D - R M N Y - M - V R R I R I Y F M F Q V I K . . . . : . . 1231 AAATATTAGGCAAGGATGTTTATTCCAAATGAGTTATTAA : TAAGTGAGGGATTAATTAGT K Y - A R M F I P N E L L : I S E G L I S N I R Q G C L F Q M S Y - : - V R D - L V E I L G K D V Y S K - V I N : K - G I N - . . . . . . 1138 GGTTGATTAGGATTAAATTAATCTAGTGGGCCCCACCACTCAAGTCAAATTAAAAGGGAT G - L G L N - S S G P H H S S Q I K R D V D - D - I N L V G P T T Q V K L K G I W L I R I K L I - W A P P L K S N - K G . . . . . . 1078 CAAATTGGGAGCTGTCCTTAGTGGGACACGTGTAGAGTAGGGCTGCCTCCACTTCATATT Q I G S C P - W D T C R V G L P P L H I K L G A V L S G T R V E - G C L H F I L S N W E L S L V G H V - S R A A S T S Y . . . . . . 1018 ATAAATGAGGTGGTAAAATGCAATGCAACATATCATCTTCTTCACCTCTTGGCTTAGGCA I N E V V K C N A T Y H L L H L L A - A - M R W - N A M Q H I I F F T S W L R Q Y K - G G K M Q C N I S S S S P L G L G . . . . . . 958 GCCATGGAAATGGAGAAAACCAACCATGCAGCTCTTGGCCAGCAACTAAAAAGAATTTGG A M E M E K T N H A A L G Q Q L K R I W P W K W R K P T M Q L L A S N - K E F G S H G N G E N Q P C S S W P A T K K N L . . . . . . 898 TTAGTAATCTCTTTGCTTGGTGTGTTAATTCCTTAGAATGCCTTTGTTAATTAGCCATTA L V I S L L G V L I P - N A F V N - P L - - S L C L V C - F L R M P L L I S H - V S N L F A W C V N S L E C L C - L A I . . . . . . 838 ATGTTAAGAATGGGGCCTGACCAGTATCTTAGGATGTTTGTTTTAGTTATTGAATATGAT M L R M G P D Q Y L R M F V L V I E Y D C - E W G L T S I L G C L F - L L N M I N V K N G A - P V S - D V C F S Y - I - . . . . . . 778 AAGTATGAACGGAAACCATAACCGGATTATTAGTGGTGTCGTGTTAGTGCTTGGGCTGTT K Y E R K P - P D Y - W C R V S A W A V S M N G N H N R I I S G V V L V L G L F - V - T E T I T G L L V V S C - C L G C . . . . . . 718 TTGATTAAAGCAAACTGCGAGAAATTCTGTTTTGGCGTTATGTATATGTTAAATGTTATT L I K A N C E K F C F G V M Y M L N V I - L K Q T A R N S V L A L C I C - M L L F D - S K L R E I L F W R Y V Y V K C Y . . . . . . 658 ATGGGTATATACTCTAAAGGATGAATACGATAAGGTAGATGTGTTGGGAATTATAAAAGG M G I Y S K G - I R - G R C V G N Y K R W V Y T L K D E Y D K V D V L G I I K G Y G Y I L - R M N T I R - M C W E L - K . . . . . . 598 AGTTATCGCTCAGTGTGTTGTTGCTTCATTACTATGGTTGCCGAGACGGAACTGTTTTGG S Y R S V C C C F I T M V A E T E L F W V I A Q C V V A S L L W L P R R N C F G E L S L S V L L L H Y Y G C R D G T V L . . . . . . 538 GGGAGGGGGCTGTTTAATATGATTCATTGGGTTATATGTGTTGTTGGTATTACTGTGGAT G R G L F N M I H W V I C V V G I T V D G G G C L I - F I G L Y V L L V L L W I G E G A V - Y D S L G Y M C C W Y Y C G . . . . . . 478 AATTTGGGTTGTTGTCGGATTGGGACGAAGTAAGGAAATTAGGGGAAGTGCTACCGGATT N L G C C R I G T K - G N - G K C Y R I I W V V V G L G R S K E I R G S A T G F - F G L L S D W D E V R K L G E V L P D . . . . . . 418 TTTGTTAGATTATTAACTAGCTTACAATAGGTAGTAAAGCGCGACATTTATCTAATTGCG F V R L L T S L Q - V V K R D I Y L I A L L D Y - L A Y N R - - S A T F I - L R F C - I I N - L T I G S K A R H L S N C . . . . . . 358 GCACGATTTATGCTTGTTATAGATTAATAGCTTGAGAAGTAAATATTGGACGTGCGACTC A R F M L V I D - - L E K - I L D V R L H D L C L L - I N S L R S K Y W T C D S G T I Y A C Y R L I A - E V N I G R A T . . . . . . 298 GACTATACGGTATGTAACGCTACCTCTTCTTTCTTTGTTTGGCATGACTTT D Y T V C N A T S S F F V W H D F T I R Y V T L P L L S L F G M T R L Y G M - R Y L F F L C L A - L Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:13:40 2006 ________________________________________________________________________________ Sequence 5: C06HBa0057J04.1-5, from 1 to 1024, both strands analyzed. ... started at: Mon Jul 24 23:13:40 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:13:49 2006 ________________________________________________________________________________ Sequence 6: C06HBa0057J04.1-6, from 1 to 1358, both strands analyzed. ... started at: Mon Jul 24 23:13:49 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:13:59 2006 ________________________________________________________________________________ Sequence 7: C06HBa0057J04.1-7, from 1 to 2064, both strands analyzed. ... started at: Mon Jul 24 23:13:59 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:14:09 2006 ________________________________________________________________________________ Sequence 8: C06HBa0057J04.1-8, from 1 to 2793, both strands analyzed. ... started at: Mon Jul 24 23:14:09 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 8 ******************************************************************************** EST sequence 2 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 2185 to 1): Exon 1 1037 1015 ( 23 n); cDNA 133 155 ( 23 n); score: 0.652 Intron 1 1014 280 ( 735 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.74) Exon 2 279 219 ( 61 n); cDNA 156 215 ( 60 n); score: 0.738 Intron 2 218 174 ( 45 n); Pd: 0.000 (s: 0.80), Pa: 0.000 (s: 0.94) Exon 3 173 1 ( 173 n); cDNA 216 387 ( 172 n); score: 0.896 PPA cDNA 829 839 MATCH C06HBa0057J04.1-8- SGN-E544254- 0.855 257 0.305 C PGS_C06HBa0057J04.1-8-_SGN-E544254- (1037 1015,279 219,173 1) Alignment (genomic DNA sequence = upper lines): GATAACTAAG ATAAGGTAGA TGTGTTGCGA ATTATAAAGT GAGTTATCGC TCGGTGTGTC 978 ||| | | | || | | ||| || GATGATGATG ATGACGGAGA TGA....... .......... .......... .......... 155 GTTGCTTCGT TACTATGGTT GCCGAGACGA AACTGTTTTG GGGGAGGGGG AGGGGGCTGT 918 .......... .......... .......... .......... .......... .......... 155 TTAATATGAT TCGTTGGGTT ATATGTGTTA TTGGTATTGC TATGGATAAT TTGGGTTGTT 858 .......... .......... .......... .......... .......... .......... 155 GTCGGATTTG GACAAAGTAA GGAAAATAGG GGAAATGCTG CCGGATTTTT GTTAGATTAT 798 .......... .......... .......... .......... .......... .......... 155 TAGCTAGATT ATAATAAGTA GTAAAGCGCG ACGTTTATCT AATTGCGGCA CGATTGTTGC 738 .......... .......... .......... .......... .......... .......... 155 TTGTCATAGA ATAATAGCTT GAGCAGTAAA TATTTGACGT GCGAATCAAC TATACGGTAT 678 .......... .......... .......... .......... .......... .......... 155 GTAAGGCTAC CCCTTCTTTC TTTGTTTGGC ATGACTTTTA AAAATGAGTG AATAACGGAC 618 .......... .......... .......... .......... .......... .......... 155 AGGTTTGATA CTTACTTCTA GAGCGTCTAG GTGACATATA TTCTTACTTC CACAACTATT 558 .......... .......... .......... .......... .......... .......... 155 CCTCTATATA TCGGCTATGT CTAAGTCTTA ATGATTTCTC ATATCTATGG TAGTACTTCT 498 .......... .......... .......... .......... .......... .......... 155 AAGAGTCATT GAGATTTTAC GTTTCCATAT CGTATTAAAG GATCGTAATC TTGATAAAAC 438 .......... .......... .......... .......... .......... .......... 155 GTTAATCTTT TGTAATACTC CTTGCTGGTT CATGTTGATT GTTCTATTGA GTTATAAGAA 378 .......... .......... .......... .......... .......... .......... 155 ATGATTTTAA TTGCATATTG TTGCTCATAA TATTTTGCTC GTGCATAGAC TAATTTATCA 318 .......... .......... .......... .......... .......... .......... 155 TTTCACCGAG TTTCGGGTCG GGTAATGTTC GTGCGGAGTT TCTTGCATTT GTCACCGAGT 258 | | || | | ||||||||| .......... .......... .......... ........TG TGATGACTAT TTCACCGAGT 177 CACTCACTAG AGGGTCGGGT ATGTATATTA TACATATTAT TGGTGATGAG GATGGTTATG 198 | |||||||| |||| ||||| | ||| | || |||| | CCCTCACTAG AGGGCCGGGT A-CTATGATG TATATATAA. .......... .......... 215 ATGATGATGA TGACGGAGAT GATGTGATGA TTATTTTGCC GAGCCCCTTA CTAGGGAAGT 138 |||||| |||||||||| ||| |||||| |||||||||| .......... .......... ....TGATGA TTATTTTGCC GAGTCCCTTA CTAGGGAAGT 251 TGGGCACCTT ATATGTTAAA GATATGCACG ATTTTCACTT AAAAGGGTAT ATGTGTAGCG 78 | |||| ||| |||||||||| |||||||| | |||||||||| |||| ||| |||||||| | TAGGCATCTT ATATGTTAAA GATATGCATG ATTTTCACTT AAAA-AGTAC ATGTGTAGAG 310 ATATTTTGTT TCAACTTACC ATATATGTAT CCTATCATGT TGACCTTATG CTTTACATAC 18 |||| ||||| || ||||| | || | |||| ||| |||| | | |||||||| |||||||||| ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC 370 TCAGTACATT GTTCGTA 1 |||||||||| || |||| TCAGTACATT GTCCGTA 387 hqPGS_C06HBa0057J04.1-8-_SGN-E544254- (279 219,173 1) ******************************************************************************** EST sequence 3 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 1435 to 1): Exon 1 1344 1298 ( 47 n); cDNA 1 47 ( 47 n); score: 0.872 Intron 1 1297 330 ( 968 n); Pd: 0.992 (s: 0.87), Pa: 0.881 (s: 0.90) Exon 2 329 144 ( 186 n); cDNA 48 233 ( 186 n); score: 0.909 MATCH C06HBa0057J04.1-8- SGN-E538151+ 0.909 233 0.384 C PGS_C06HBa0057J04.1-8-_SGN-E538151+ (1344 1298,329 144) Alignment (genomic DNA sequence = upper lines): GAGAAAACCA ACCATGCAAC TCTTGGCCAC CAGCTGCAAA GAATTTGGTT AGTAATCTCT 1285 |||||||||| |||| ||| |||||| || |||||||||| ||| ||| GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTG... .......... 47 TTGCTTGGTG TGTTAATTCC TTAGAATGCC TTTGTTAATT AGACATTAAT GTTAAGAAGG 1225 .......... .......... .......... .......... .......... .......... 47 AGGCATGAAC AGTATCTTAG GAAGTTTGTT TTAGTTATTG AATGTGCTAA GTATGAACGG 1165 .......... .......... .......... .......... .......... .......... 47 AAACCATTAT CAAATTATTA GTGGTGTCGT GTTAGTGCTT GGGTTGTTTT GATTAAAGCA 1105 .......... .......... .......... .......... .......... .......... 47 AATTGCGGGA AATTCTATTT TGGCATTATG TATATGTTAA ATGTGATTAT AGGTATATTC 1045 .......... .......... .......... .......... .......... .......... 47 TCCAAAGGAT AACTAAGATA AGGTAGATGT GTTGCGAATT ATAAAGTGAG TTATCGCTCG 985 .......... .......... .......... .......... .......... .......... 47 GTGTGTCGTT GCTTCGTTAC TATGGTTGCC GAGACGAAAC TGTTTTGGGG GAGGGGGAGG 925 .......... .......... .......... .......... .......... .......... 47 GGGCTGTTTA ATATGATTCG TTGGGTTATA TGTGTTATTG GTATTGCTAT GGATAATTTG 865 .......... .......... .......... .......... .......... .......... 47 GGTTGTTGTC GGATTTGGAC AAAGTAAGGA AAATAGGGGA AATGCTGCCG GATTTTTGTT 805 .......... .......... .......... .......... .......... .......... 47 AGATTATTAG CTAGATTATA ATAAGTAGTA AAGCGCGACG TTTATCTAAT TGCGGCACGA 745 .......... .......... .......... .......... .......... .......... 47 TTGTTGCTTG TCATAGAATA ATAGCTTGAG CAGTAAATAT TTGACGTGCG AATCAACTAT 685 .......... .......... .......... .......... .......... .......... 47 ACGGTATGTA AGGCTACCCC TTCTTTCTTT GTTTGGCATG ACTTTTAAAA ATGAGTGAAT 625 .......... .......... .......... .......... .......... .......... 47 AACGGACAGG TTTGATACTT ACTTCTAGAG CGTCTAGGTG ACATATATTC TTACTTCCAC 565 .......... .......... .......... .......... .......... .......... 47 AACTATTCCT CTATATATCG GCTATGTCTA AGTCTTAATG ATTTCTCATA TCTATGGTAG 505 .......... .......... .......... .......... .......... .......... 47 TACTTCTAAG AGTCATTGAG ATTTTACGTT TCCATATCGT ATTAAAGGAT CGTAATCTTG 445 .......... .......... .......... .......... .......... .......... 47 ATAAAACGTT AATCTTTTGT AATACTCCTT GCTGGTTCAT GTTGATTGTT CTATTGAGTT 385 .......... .......... .......... .......... .......... .......... 47 ATAAGAAATG ATTTTAATTG CATATTGTTG CTCATAATAT TTTGCTCGTG CATAGACTAA 325 | | | .......... .......... .......... .......... .......... .....AGTCA 52 TTTATCATTT CACCGAGTTT CGGGTCGGGT AATGTTCGTG CGGAGTTTCT TGCATTTGTC 265 |||||||||| |||||||| |||| ||||| |||||||||| |||||||||| ||||| |||| TTTATCATTT CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC 112 ACCGAGTCAC TCACTAGAGG GTCGGGTATG TATATTATAC ATATTATTGG TGATGAGGAT 205 |||||||| | |||||||||| | |||| ||| ||||||||| |||| ||||| |||||||||| ACCGAGTCCC TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT 172 GGTTATGATG ATGATGATGA CGGAGATGAT GTGATGATTA TTTTGCCGAG CCCCTTACTA 145 |||||||||| |||||||||| |||||||||| ||||||| || ||| | ||| |||| |||| GGTTATGATG ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA 232 G 144 | G 233 hqPGS_C06HBa0057J04.1-8-_SGN-E538151+ (1344 1298,329 144) ******************************************************************************** EST sequence 4 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 1435 to 1): Exon 1 1344 1298 ( 47 n); cDNA 1 47 ( 47 n); score: 0.872 Intron 1 1297 330 ( 968 n); Pd: 0.992 (s: 0.87), Pa: 0.881 (s: 0.90) Exon 2 329 144 ( 186 n); cDNA 48 233 ( 186 n); score: 0.909 MATCH C06HBa0057J04.1-8- SGN-E538156+ 0.909 233 0.362 C PGS_C06HBa0057J04.1-8-_SGN-E538156+ (1344 1298,329 144) Alignment (genomic DNA sequence = upper lines): GAGAAAACCA ACCATGCAAC TCTTGGCCAC CAGCTGCAAA GAATTTGGTT AGTAATCTCT 1285 |||||||||| |||| ||| |||||| || |||||||||| ||| ||| GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTG... .......... 47 TTGCTTGGTG TGTTAATTCC TTAGAATGCC TTTGTTAATT AGACATTAAT GTTAAGAAGG 1225 .......... .......... .......... .......... .......... .......... 47 AGGCATGAAC AGTATCTTAG GAAGTTTGTT TTAGTTATTG AATGTGCTAA GTATGAACGG 1165 .......... .......... .......... .......... .......... .......... 47 AAACCATTAT CAAATTATTA GTGGTGTCGT GTTAGTGCTT GGGTTGTTTT GATTAAAGCA 1105 .......... .......... .......... .......... .......... .......... 47 AATTGCGGGA AATTCTATTT TGGCATTATG TATATGTTAA ATGTGATTAT AGGTATATTC 1045 .......... .......... .......... .......... .......... .......... 47 TCCAAAGGAT AACTAAGATA AGGTAGATGT GTTGCGAATT ATAAAGTGAG TTATCGCTCG 985 .......... .......... .......... .......... .......... .......... 47 GTGTGTCGTT GCTTCGTTAC TATGGTTGCC GAGACGAAAC TGTTTTGGGG GAGGGGGAGG 925 .......... .......... .......... .......... .......... .......... 47 GGGCTGTTTA ATATGATTCG TTGGGTTATA TGTGTTATTG GTATTGCTAT GGATAATTTG 865 .......... .......... .......... .......... .......... .......... 47 GGTTGTTGTC GGATTTGGAC AAAGTAAGGA AAATAGGGGA AATGCTGCCG GATTTTTGTT 805 .......... .......... .......... .......... .......... .......... 47 AGATTATTAG CTAGATTATA ATAAGTAGTA AAGCGCGACG TTTATCTAAT TGCGGCACGA 745 .......... .......... .......... .......... .......... .......... 47 TTGTTGCTTG TCATAGAATA ATAGCTTGAG CAGTAAATAT TTGACGTGCG AATCAACTAT 685 .......... .......... .......... .......... .......... .......... 47 ACGGTATGTA AGGCTACCCC TTCTTTCTTT GTTTGGCATG ACTTTTAAAA ATGAGTGAAT 625 .......... .......... .......... .......... .......... .......... 47 AACGGACAGG TTTGATACTT ACTTCTAGAG CGTCTAGGTG ACATATATTC TTACTTCCAC 565 .......... .......... .......... .......... .......... .......... 47 AACTATTCCT CTATATATCG GCTATGTCTA AGTCTTAATG ATTTCTCATA TCTATGGTAG 505 .......... .......... .......... .......... .......... .......... 47 TACTTCTAAG AGTCATTGAG ATTTTACGTT TCCATATCGT ATTAAAGGAT CGTAATCTTG 445 .......... .......... .......... .......... .......... .......... 47 ATAAAACGTT AATCTTTTGT AATACTCCTT GCTGGTTCAT GTTGATTGTT CTATTGAGTT 385 .......... .......... .......... .......... .......... .......... 47 ATAAGAAATG ATTTTAATTG CATATTGTTG CTCATAATAT TTTGCTCGTG CATAGACTAA 325 | | | .......... .......... .......... .......... .......... .....AGTCA 52 TTTATCATTT CACCGAGTTT CGGGTCGGGT AATGTTCGTG CGGAGTTTCT TGCATTTGTC 265 |||||||||| |||||||| |||| ||||| |||||||||| |||||||||| ||||| |||| TTTATCATTT CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC 112 ACCGAGTCAC TCACTAGAGG GTCGGGTATG TATATTATAC ATATTATTGG TGATGAGGAT 205 |||||||| | |||||||||| | |||| ||| ||||||||| |||| ||||| |||||||||| ACCGAGTCCC TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT 172 GGTTATGATG ATGATGATGA CGGAGATGAT GTGATGATTA TTTTGCCGAG CCCCTTACTA 145 |||||||||| |||||||||| |||||||||| ||||||| || ||| | ||| |||| |||| GGTTATGATG ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA 232 G 144 | G 233 hqPGS_C06HBa0057J04.1-8-_SGN-E538156+ (1344 1298,329 144) ******************************************************************************** EST sequence 7 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 1649 to 1): Exon 1 1342 1298 ( 45 n); cDNA 1 45 ( 45 n); score: 0.867 Intron 1 1297 330 ( 968 n); Pd: 0.992 (s: 0.87), Pa: 0.881 (s: 0.88) Exon 2 329 144 ( 186 n); cDNA 46 231 ( 186 n); score: 0.903 MATCH C06HBa0057J04.1-8- SGN-E268096+ 0.903 231 0.491 C PGS_C06HBa0057J04.1-8-_SGN-E268096+ (1342 1298,329 144) Alignment (genomic DNA sequence = upper lines): GAAAACCAAC CATGCAACTC TTGGCCACCA GCTGCAAAGA ATTTGGTTAG TAATCTCTTT 1283 |||||||| | ||| ||||| |||| || || |||||||||| | ||| GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTG..... .......... 45 GCTTGGTGTG TTAATTCCTT AGAATGCCTT TGTTAATTAG ACATTAATGT TAAGAAGGAG 1223 .......... .......... .......... .......... .......... .......... 45 GCATGAACAG TATCTTAGGA AGTTTGTTTT AGTTATTGAA TGTGCTAAGT ATGAACGGAA 1163 .......... .......... .......... .......... .......... .......... 45 ACCATTATCA AATTATTAGT GGTGTCGTGT TAGTGCTTGG GTTGTTTTGA TTAAAGCAAA 1103 .......... .......... .......... .......... .......... .......... 45 TTGCGGGAAA TTCTATTTTG GCATTATGTA TATGTTAAAT GTGATTATAG GTATATTCTC 1043 .......... .......... .......... .......... .......... .......... 45 CAAAGGATAA CTAAGATAAG GTAGATGTGT TGCGAATTAT AAAGTGAGTT ATCGCTCGGT 983 .......... .......... .......... .......... .......... .......... 45 GTGTCGTTGC TTCGTTACTA TGGTTGCCGA GACGAAACTG TTTTGGGGGA GGGGGAGGGG 923 .......... .......... .......... .......... .......... .......... 45 GCTGTTTAAT ATGATTCGTT GGGTTATATG TGTTATTGGT ATTGCTATGG ATAATTTGGG 863 .......... .......... .......... .......... .......... .......... 45 TTGTTGTCGG ATTTGGACAA AGTAAGGAAA ATAGGGGAAA TGCTGCCGGA TTTTTGTTAG 803 .......... .......... .......... .......... .......... .......... 45 ATTATTAGCT AGATTATAAT AAGTAGTAAA GCGCGACGTT TATCTAATTG CGGCACGATT 743 .......... .......... .......... .......... .......... .......... 45 GTTGCTTGTC ATAGAATAAT AGCTTGAGCA GTAAATATTT GACGTGCGAA TCAACTATAC 683 .......... .......... .......... .......... .......... .......... 45 GGTATGTAAG GCTACCCCTT CTTTCTTTGT TTGGCATGAC TTTTAAAAAT GAGTGAATAA 623 .......... .......... .......... .......... .......... .......... 45 CGGACAGGTT TGATACTTAC TTCTAGAGCG TCTAGGTGAC ATATATTCTT ACTTCCACAA 563 .......... .......... .......... .......... .......... .......... 45 CTATTCCTCT ATATATCGGC TATGTCTAAG TCTTAATGAT TTCTCATATC TATGGTAGTA 503 .......... .......... .......... .......... .......... .......... 45 CTTCTAAGAG TCATTGAGAT TTTACGTTTC CATATCGTAT TAAAGGATCG TAATCTTGAT 443 .......... .......... .......... .......... .......... .......... 45 AAAACGTTAA TCTTTTGTAA TACTCCTTGC TGGTTCATGT TGATTGTTCT ATTGAGTTAT 383 .......... .......... .......... .......... .......... .......... 45 AAGAAATGAT TTTAATTGCA TATTGTTGCT CATAATATTT TGCTCGTGCA TAGACTAATT 323 | | ||| .......... .......... .......... .......... .......... ...AGTCATT 52 TATCATTTCA CCGAGTTTCG GGTCGGGTAA TGTTCGTGCG GAGTTTCTTG CATTTGTCAC 263 ||||||| || |||||| || || ||||||| |||||||||| |||||||||| ||| |||||| TATCATTGCA CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC 112 CGAGTCACTC ACTAGAGGGT CGGGTATGTA TATTATACAT ATTATTGGTG ATGAGGATGG 203 |||||| ||| ||||||||| |||| ||||| ||||||| || || ||||||| |||||||||| CGAGTCCCTC ACTAGAGGGC CGGGAATGTA TATTATATAT ATGATTGGTG ATGAGGATGG 172 TTATGATGAT GATGATGACG GAGATGATGT GATGATTATT TTGCCGAGCC CCTTACTAG 144 |||||||||| |||||||||| |||||||||| ||||| |||| | | ||| | ||| ||||| TTATGATGAT GATGATGACG GAGATGATGT GATGACTATT TCACTGAGTC CCTCACTAG 231 hqPGS_C06HBa0057J04.1-8-_SGN-E268096+ (1342 1298,329 144) ******************************************************************************** EST sequence 1 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 1970 to 1): Exon 1 1360 1298 ( 63 n); cDNA 1 63 ( 63 n); score: 0.952 Intron 1 1297 734 ( 564 n); Pd: 0.992 (s: 0.94), Pa: 0.000 (s: 0.84) Exon 2 733 682 ( 52 n); cDNA 64 115 ( 52 n); score: 0.846 MATCH C06HBa0057J04.1-8- SGN-E543103- 0.904 115 0.196 C PGS_C06HBa0057J04.1-8-_SGN-E543103- (1360 1298,733 682) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAGA AAACCAACCA TGCAACTCTT GGCCACCAGC TGCAAAGAAT 1301 |||||||||| |||||||||| ||| |||||| |||||||||| | ||| |||| |||||||||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCTTTGC TTGGTGTGTT AATTCCTTAG AATGCCTTTG TTAATTAGAC 1241 ||| TTG....... .......... .......... .......... .......... .......... 63 ATTAATGTTA AGAAGGAGGC ATGAACAGTA TCTTAGGAAG TTTGTTTTAG TTATTGAATG 1181 .......... .......... .......... .......... .......... .......... 63 TGCTAAGTAT GAACGGAAAC CATTATCAAA TTATTAGTGG TGTCGTGTTA GTGCTTGGGT 1121 .......... .......... .......... .......... .......... .......... 63 TGTTTTGATT AAAGCAAATT GCGGGAAATT CTATTTTGGC ATTATGTATA TGTTAAATGT 1061 .......... .......... .......... .......... .......... .......... 63 GATTATAGGT ATATTCTCCA AAGGATAACT AAGATAAGGT AGATGTGTTG CGAATTATAA 1001 .......... .......... .......... .......... .......... .......... 63 AGTGAGTTAT CGCTCGGTGT GTCGTTGCTT CGTTACTATG GTTGCCGAGA CGAAACTGTT 941 .......... .......... .......... .......... .......... .......... 63 TTGGGGGAGG GGGAGGGGGC TGTTTAATAT GATTCGTTGG GTTATATGTG TTATTGGTAT 881 .......... .......... .......... .......... .......... .......... 63 TGCTATGGAT AATTTGGGTT GTTGTCGGAT TTGGACAAAG TAAGGAAAAT AGGGGAAATG 821 .......... .......... .......... .......... .......... .......... 63 CTGCCGGATT TTTGTTAGAT TATTAGCTAG ATTATAATAA GTAGTAAAGC GCGACGTTTA 761 .......... .......... .......... .......... .......... .......... 63 TCTAATTGCG GCACGATTGT TGCTTGTCAT AGAATAATAG CTTGAGCAGT AAATATTTGA 701 | || |||||| |||||||||| ||||||| || .......... .......... .......GTT TGATTAATAG CTTGAGCAGT AAATATTGGA 96 CGTGCGAATC AACTATACG 682 |||||| || |||||||| CGTGCGGCTC GACTATACG 115 hqPGS_C06HBa0057J04.1-8-_SGN-E543103- (1360 1298,733 682) ******************************************************************************** EST sequence 8 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 1960 to 1): Exon 1 1360 1298 ( 63 n); cDNA 1 63 ( 63 n); score: 0.952 Intron 1 1297 734 ( 564 n); Pd: 0.992 (s: 0.94), Pa: 0.000 (s: 0.84) Exon 2 733 682 ( 52 n); cDNA 64 115 ( 52 n); score: 0.846 MATCH C06HBa0057J04.1-8- SGN-E543104+ 0.904 115 0.199 C PGS_C06HBa0057J04.1-8-_SGN-E543104+ (1360 1298,733 682) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAGA AAACCAACCA TGCAACTCTT GGCCACCAGC TGCAAAGAAT 1301 |||||||||| |||||||||| ||| |||||| |||||||||| | ||| |||| |||||||||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCTTTGC TTGGTGTGTT AATTCCTTAG AATGCCTTTG TTAATTAGAC 1241 ||| TTG....... .......... .......... .......... .......... .......... 63 ATTAATGTTA AGAAGGAGGC ATGAACAGTA TCTTAGGAAG TTTGTTTTAG TTATTGAATG 1181 .......... .......... .......... .......... .......... .......... 63 TGCTAAGTAT GAACGGAAAC CATTATCAAA TTATTAGTGG TGTCGTGTTA GTGCTTGGGT 1121 .......... .......... .......... .......... .......... .......... 63 TGTTTTGATT AAAGCAAATT GCGGGAAATT CTATTTTGGC ATTATGTATA TGTTAAATGT 1061 .......... .......... .......... .......... .......... .......... 63 GATTATAGGT ATATTCTCCA AAGGATAACT AAGATAAGGT AGATGTGTTG CGAATTATAA 1001 .......... .......... .......... .......... .......... .......... 63 AGTGAGTTAT CGCTCGGTGT GTCGTTGCTT CGTTACTATG GTTGCCGAGA CGAAACTGTT 941 .......... .......... .......... .......... .......... .......... 63 TTGGGGGAGG GGGAGGGGGC TGTTTAATAT GATTCGTTGG GTTATATGTG TTATTGGTAT 881 .......... .......... .......... .......... .......... .......... 63 TGCTATGGAT AATTTGGGTT GTTGTCGGAT TTGGACAAAG TAAGGAAAAT AGGGGAAATG 821 .......... .......... .......... .......... .......... .......... 63 CTGCCGGATT TTTGTTAGAT TATTAGCTAG ATTATAATAA GTAGTAAAGC GCGACGTTTA 761 .......... .......... .......... .......... .......... .......... 63 TCTAATTGCG GCACGATTGT TGCTTGTCAT AGAATAATAG CTTGAGCAGT AAATATTTGA 701 | || |||||| |||||||||| ||||||| || .......... .......... .......GTT TGATTAATAG CTTGAGCAGT AAATATTGGA 96 CGTGCGAATC AACTATACG 682 |||||| || |||||||| CGTGCGGCTC GACTATACG 115 hqPGS_C06HBa0057J04.1-8-_SGN-E543104+ (1360 1298,733 682) ******************************************************************************** EST sequence 6 +strand 599 n (File: SGN-E307342+) 1 GACGAGGGAC GAATGTTCCT AAGGGGGGAA GGATGTTACG CCTCGTATTT TTATACGTCG 61 TGCGCGTCAT GAACTAGTAT ATGTAAGTTC GGGAAATGAG ATTTTATTTT AAGTTCCAAG 121 TGTTTAAAGA AATATTATGC ATGGATGTTA ATTCCATATG TGATATTAAT TAGTGTGGGA 181 TTAATTAGGG GCTGATTTGG ATTTAATTTA TCGAGTGGGC CCCACCACTC AAGGCAAAGT 241 AAGAATTCAG ATTTTGGGAG ATAGCCTTAG TGGAGAATGT GTAGTGGGGG GCTCCTCCAC 301 TTCATATTGC TAAATATTAA AAAGAAAATC TGATTTGGGG ATAGAATTAG TGGAGCATTT 361 GTAGTGGTGG GCTGCCTCCA CTTCATATTG TCAAATGGTG TAGTGAAATA CTTTGCAACA 421 TATCATATCT TTCACCACAT GACTTGGGCA GCCATGGAAA TGGAGAAAAC CAGCCATTAA 481 ACTCTTGGAC AGCAGCTGCA AAGAAATTGG TTAGTAATCT CTTTGCTTGG TTTGTTAATT 541 CCTTAGAATA CCTTTGTTAA TTAGACATTT ATGTTAAGAA AGGGGACGTG AACAGTATC Predicted gene structure (within gDNA segment 2793 to 328): Exon 1 1725 1590 ( 136 n); cDNA 34 169 ( 136 n); score: 0.919 Intron 1 1589 1557 ( 33 n); Pd: 0.000 (s: 0.90), Pa: 0.000 (s: 0.86) Exon 2 1556 1411 ( 146 n); cDNA 170 315 ( 146 n); score: 0.791 MATCH C06HBa0057J04.1-8- SGN-E307342+ 0.853 282 0.471 C PGS_C06HBa0057J04.1-8-_SGN-E307342+ (1725 1590,1556 1411) Alignment (genomic DNA sequence = upper lines): TGTTATGCCT CGTATTTTTA TACGTAGTGC GCATCATGAA CTAGTAGATG TAAGTCCGGG 1666 ||||| |||| |||||||||| ||||| |||| || ||||||| |||||| ||| ||||| |||| TGTTACGCCT CGTATTTTTA TACGTCGTGC GCGTCATGAA CTAGTATATG TAAGTTCGGG 93 GAATGAGATT TTATTTTAAG TTCCAAGTGA TTAAAGAAAT ATTAGGCAAG GATGTTAATT 1606 ||||||||| |||||||||| ||||||||| |||||||||| |||| ||| | |||||||||| AAATGAGATT TTATTTTAAG TTCCAAGTGT TTAAAGAAAT ATTATGCATG GATGTTAATT 153 CCAAATGTGT TATTAAGTAT GATTTGGATA ATGTAAGTGC CATTAACTTT AAGTGAGGGA 1546 ||| ||||| |||||| | |||| |||| CCATATGTGA TATTAA.... .......... .......... .........T TAGTGTGGGA 180 TTAATTAGTG GCTGATTAGG ATTAAATTAA TCTAGTGGGC CCCACCACTC AAGGAAAATT 1486 |||||||| | ||||||| || ||| |||| | || ||||||| |||||||||| |||| ||| | TTAATTAGGG GCTGATTTGG ATTTAATTTA TCGAGTGGGC CCCACCACTC AAGGCAAAGT 240 AAAAGGGATC AGATTGGGAG CTGGCCTTAG TGGGACACGT GTAGAGGGAG GCTGCCTCCA 1426 || | | ||||||| | ||||||| ||| | || |||| ||| | ||| |||||| AAGAATTCAG ATTTTGGGAG ATAGCCTTAG TGGAGAATGT GTAGTGGGGG GCT-CCTCCA 299 CTTCATATT- ATAAAT 1411 ||||||||| ||||| CTTCATATTG CTAAAT 315 hqPGS_C06HBa0057J04.1-8-_SGN-E307342+ (1725 1590,1556 1411) ******************************************************************************** EST sequence 5 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 2320 to 1): Exon 1 1342 640 ( 703 n); cDNA 1 691 ( 691 n); score: 0.858 MATCH C06HBa0057J04.1-8- SGN-E328093+ 0.858 703 1.017 C PGS_C06HBa0057J04.1-8-_SGN-E328093+ (1342 640) Alignment (genomic DNA sequence = upper lines): GAAAACCAAC CATGCAACTC TTGGCCACCA GCTGCAAAGA ATTTGGTTAG TAATCTCTTT 1283 |||||||| | ||| ||||| |||| || || |||||||||| | |||||||| |||||||||| GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 60 GCTTGGTGTG TTAATTCCTT AGAATGCCTT TGTTAATTAG ACATTAATGT TAAGAAGGAG 1223 ||||||| || |||||||||| ||||| |||| |||||||||| ||||| |||| |||||||| | GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 120 G-CATGAACA GTATCTTAGG AAGTTTGTTT TAGTTATTGA ATGTGCTAAG TATGAACGGA 1164 | | |||||| |||||||||| || ||||||| |||||||||| |||||||||| |||| | || GACGTGAACA GTATCTTAGG AA-TTTGTTT TAGTTATTGA ATGTGCTAAG GATGAGCAGA 179 AACCATTATC AAATTATTAG TGGTGTCGTG TTAGTGCTTG GGTTGTTTTG ATTAAAGCAA 1104 |||||| ||| ||| ||| ||||| | || ||| ||| || ||||||| ||||||| || AACCATGATC GGATTGCTAG CGGTGTTATA TTTGTG-TTG GGCTGTTTTG ATTAAAGTAA 238 ATTGCGGGAA ATTCTATTTT GGCATTATGT ATATGTTAAA TGTGATTATA GGTATATTCT 1044 ||| |||| ||||| |||| || ||||| ||||||| || | ||||||| ||||||| || GCTGCTGGAA ATTCTGTTTT GGTGTTATGC ATATGTT-AA TATGATTATG GGTATATACT 297 CCAAAGGATA ACTAAGATAA GGTAGATGTG TTGCGAATTA TAAAGTGAGT TATCGCTCGG 984 ||||||||| | || |||| |||||||||| |||||||||| |||| || | ||||| |||| CCAAAGGATG AATACAATAA GGTAGATGTG TTGCGAATTA TAAAACGAAT TATCGGTCGG 357 TGTGTCGTTG CTTCGTTACT ATGGTTGC-C GAGACGAAAC TGTTTTGGGG GAGGGGGAGG 925 |||||||||| || |||||| |||||||| | ||| ||| |||||| |||||| TGTGTCGTTG TTTTGTTACT ATGGTTGCTA AAAACGGAAC TGTTTT---- --GGGGGA-- 409 GGGCTGTTTA ATATGATTCG TTGGGTTATA TGTGTTATTG GTATTGCTAT GGATAATTTG 865 ||||||||| |||||||| | |||| ||||| |||||| ||| |||||| | | |||||||||| -GGCTGTTTA ATATGATTTG TTGGATTATA TGTGTTGTTG GTATTGTTGT GGATAATTTG 468 GGTTGTTGTC GGATTTGGAC AAAGTAAGGA AAATAGGGGA AATGCTGCCG GATTTTTGTT 805 ||||||||| ||||| ||| |||||| || |||||||||| | |||||||| |||||| ||| GGTTGTTGTT GGATTGGGAT GAAGTAAAGA AAATAGGGGA AGTGCTGCCG GATTTTCGTT 528 AGATTATTAG CTAGATTATA ATAAGTAGTA AAGCGCGACG TTTATCTAAT TGCGGCACGA 745 |||||||||| |||| ||| ||||||||| ||||||||| |||||||||| |||||||||| AGATTATTAG CTAGCTTA-C ATAAGTAGT- AAGCGCGACA TTTATCTAAT TGCGGCACGA 586 TTGTTGCTTG TCATAGAATA ATAGCTTGAG CAGTAAATAT TTGACGTGCG AATCAACTAT 685 ||| |||||| | ||||| | ||| |||||| |||||||||| | ||||| || || ||||| TTGGTGCTTG TTATAGATTT ATACCTTGAG CAGTAAATAT TGGACGTACG GCTCGACTAT 646 ACGGTATGTA AGGCTACCCC TTCTTTCTTT GTTTGGCATG ACTTT 640 ||||||||| | |||| || ||| |||||| |||||||||| || || TCGGTATGTA ACGCTATCCT TTCCTTCTTT GTTTGGCATG ACCTT 691 hqPGS_C06HBa0057J04.1-8-_SGN-E328093+ (1342 640) ******************************************************************************** EST sequence 10 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 2793 to 1): Exon 1 1349 888 ( 462 n); cDNA 1 455 ( 455 n); score: 0.860 MATCH C06HBa0057J04.1-8- SGN-E298250+ 0.860 462 1.015 C PGS_C06HBa0057J04.1-8-_SGN-E298250+ (1349 888) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA AACCAACCAT GCAACTCTTG GCCACCAGCT GCAAAGAATT TGGTTAGTAA 1290 |||||||| | |||||||| | |||||||||| |||| |||| ||||| |||| ||| ||||| AAATGGAG-A AACCAACCCT GCAACTCTTG GCCAGTAGCT GCAAATAATT TGGGGAGTAA 59 TCTCTTTGCT TGGTGTGTTA ATTCCTTAGA ATGCCTTTGT TAATTAGACA TTAATGTTAA 1230 |||| ||| | |||||||||| |||| ||||| | || |||| |||||| || ||||| |||| TCTCCTTGTT TGGTGTGTTA ATTCTTTAGA ACACCCTTGT TAATTATCCA TTAATTTTAA 119 GAAGGAGGCA TGAACAGTAT CTTAGGAAGT TTGTTTTAGT TATTGAATGT GCTAAGTATG 1170 ||||| ||| ||| |||| |||| ||||| |||||||||| |||||||||| |||||||||| GAAGGGGGCG TGACCAGTTG CTTACGAAGT TTGTTTTAGT TATTGAATGT GCTAAGTATG 179 AACGGAAACC ATTATCAAAT TATTAGTGGT GTCGTGTTAG TGCTTGGGTT GTTTTGATTA 1110 || ||||||| || ||| || |||||||||| |||||||| | |||||||| | |||||||||| AATGGAAACC ATAATCGGAT TATTAGTGGT GTCGTGTTGG TGCTTGGGCT GTTTTGATTA 239 AAGCAAATTG C-GGGAAATT CTATTTTGGC ATTATGTATA TGTTAAATGT GATTATAGGT 1051 ||||||| || | || ||||| |||||||||| |||||||||| || | ||||| |||||| || AAGCAAACTG CAGGAAAATT CTATTTTGGC ATTATGTATA TGCTGAATGT GATTATGAGT 299 ATATTCTCCA AAGGATAACT AAGATAAGGT AGATGTGTTG CGAATTATAA AGTGAGTTAT 991 |||| ||||| | |||| | | | ||| |||| | |||||||| |||||||||| | ||||||| ATATACTCCA ACGGATGAAT ACGATTAGGT AAATGTGTTG CGAATTATAA AACGAGTTAT 359 CGCTCGGTGT GTCGTTGCTT CGTTACTATG GTTGCCGAGA CGAAACTGTT TTGGGGGAGG 931 |||||||||| |||| ||||| || | |||| |||||| ||| || ||||||| || | CGCTCGGTGT GTCGGTGCTT CGCTGCTATA GTTGCCCAGA CGGAACTGTT TT-------G 412 GGGAGGGGGC TGTTTAATAT GATTCGTTGG GTTATATGTG TTA 888 |||||||||| || |||| | ||| | | || |||||||||| ||| GGGAGGGGGC TGCCTAATGT GATACTTCGG GTTATATGTG TTA 455 hqPGS_C06HBa0057J04.1-8-_SGN-E298250+ (1349 888) ******************************************************************************** EST sequence 9 +strand 713 n (File: SGN-E544255+) 1 GGACGAGGGA CGAATGTTCC TAAGGGGGGA AGGATGTTAC GCCTCGTATT TTTATACGTC 61 GTGCGCGTCA TGAACTAGTA TATGTAAGTT CGGGAAATGA GATTTTATTT TAAGTTCCAA 121 GTGTTTAAAG AAATATTATG CAAGGATGTT AATTCCATAT GTGATATTAA TTAGTGTGGG 181 ATTAATTAGG GGCTGATTTG GATTTAATTT ATCGAGTGGG CCCCACCACT CAAGGCAAAG 241 TAAGAATTCA GATTTTGGGA GATAGCCTTA GTGGAGAATG TGTAGTGGGG GGCTCCTCCA 301 CTTCATATTG CTAAATATTA AAAAGAAAAT CTGATTTGGG GATAGAATTA GTGGAGCATT 361 TGTAGTGGTG GGCTGCCTCC ACTTCATATT GTCAAATGGT GTAGTGAAAT ACTTTGCAAC 421 ATATCATATC TTTCACCACA TGACTTGGGC AGCCATGGAA ATGGAGAAAA CCAGCCATTA 481 AACTCTTGGA CAGCAGCTGC AAAGAAATTG GTTAGTAATC TCTTTGCTTG GTTTGTTAAT 541 TCCTTAGAAT ACCTTTGTTA ATTAGACATT TATGTTAAGA AGGGGGACGT GAACAGTATC 601 TTAGGAATTT GTTTTAGTTA TTGAATGTGC TAAGGATGAG CAGAAACCAT GATCGGATTG 661 CTAGCGGTGT TATATTTGTG TTGGGCTGTT TTGATTAAAG TAAGCTGCTG GAA Predicted gene structure (within gDNA segment 2793 to 1): Exon 1 1698 1677 ( 22 n); cDNA 309 330 ( 22 n); score: 0.591 Intron 1 1676 1477 ( 200 n); Pd: 0.207 (s: 0), Pa: 0.000 (s: 0.72) Exon 2 1476 1094 ( 383 n); cDNA 331 713 ( 383 n); score: 0.830 MATCH C06HBa0057J04.1-8- SGN-E544255+ 0.830 405 0.568 C PGS_C06HBa0057J04.1-8-_SGN-E544255+ (1698 1677,1476 1094) Alignment (genomic DNA sequence = upper lines): TGCGCATCAT GAACTAGTAG ATGTAAGTCC GGGGAATGAG ATTTTATTTT AAGTTCCAAG 1639 ||| | || || || | || TGCTAAATAT TAAAAAGAAA AT........ .......... .......... .......... 330 TGATTAAAGA AATATTAGGC AAGGATGTTA ATTCCAAATG TGTTATTAAG TATGATTTGG 1579 .......... .......... .......... .......... .......... .......... 330 ATAATGTAAG TGCCATTAAC TTTAAGTGAG GGATTAATTA GTGGCTGATT AGGATTAAAT 1519 .......... .......... .......... .......... .......... .......... 330 TAATCTAGTG GGCCCCACCA CTCAAGGAAA ATTAAAAGGG ATCAGATTGG GAGCTGGCCT 1459 | |||| | | | | | | .......... .......... .......... .......... ..CTGATTTG GGGATAGAAT 348 TAGTGGGACA CGTGTAGAGG GAGGCTGCCT CCACTTCATA TTAT-AAATG AGGTGGTGAA 1400 |||||| || ||||| || |||||||| |||||||||| || | ||||| || ||||| TAGTGGAGCA TTTGTAGTGG TGGGCTGCCT CCACTTCATA TTGTCAAATG GTGTAGTGAA 408 ATGCATTGAA ACATATCATC TTATTTACCT CTTGGCTTAG GCAGCCATGG AAATGGAGAA 1340 || | ||| | ||||||||| | || ||| | || ||| | |||||||||| |||||||||| ATACTTTGCA ACATATCATA TCTTTCACCA CATGACTTGG GCAGCCATGG AAATGGAGAA 468 AACCAACCAT GCAACTCTTG GCCACCAGCT GCAAAGAATT TGGTTAGTAA TCTCTTTGCT 1280 ||||| |||| |||||||| | || ||||| |||||||| | |||||||||| |||||||||| AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTTAGTAA TCTCTTTGCT 528 TGGTGTGTTA ATTCCTTAGA ATGCCTTTGT TAATTAGACA TTAATGTTAA GAAGGAGG-C 1221 |||| ||||| |||||||||| || ||||||| |||||||||| || ||||||| ||||| || | TGGTTTGTTA ATTCCTTAGA ATACCTTTGT TAATTAGACA TTTATGTTAA GAAGGGGGAC 588 ATGAACAGTA TCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAACGGAAAC 1161 ||||||||| ||||||||| |||||||||| |||||||||| ||||||| || || | ||||| GTGAACAGTA TCTTAGGAA- TTTGTTTTAG TTATTGAATG TGCTAAGGAT GAGCAGAAAC 647 CATTATCAAA TTATTAGTGG TGTCGTGTTA GTGCTTGGGT TGTTTTGATT AAAGCAAATT 1101 ||| ||| | || ||| || ||| | || ||| ||||| |||||||||| |||| || | CATGATCGGA TTGCTAGCGG TGTTATATTT GTG-TTGGGC TGTTTTGATT AAAGTAAGCT 706 GCGGGAA 1094 || |||| GCTGGAA 713 hqPGS_C06HBa0057J04.1-8-_SGN-E544255+ (1476 1094) Total number of EST alignments reported: 10 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2793: PGL 1 (- strand): 1725 1 AGS-1 (279 219,173 1) SCR (e 0.738 d 0.000 a 0.000,e 0.896) Exon 1 279 219 ( 61 n); score: 0.738 Intron 1 218 174 ( 45 n); Pd: 0.000 Pa: 0.000 Exon 2 173 1 ( 173 n); score: 0.896 PGS (279 219,173 1) SGN-E544254- 3-phase translation of AGS-1 (-strand): . . . . . . 279 TTTCTTGCATTTGTCACCGAGTCACTCACTAGAGGGTCGGGTATGTATATTATACATATT F L A F V T E S L T R G S G M Y I I H I F L H L S P S H S L E G R V C I L Y I L S C I C H R V T H - R V G Y V Y Y T Y . : . . . . . 219 A : TGATGATTATTTTGCCGAGCCCCTTACTAGGGAAGTTGGGCACCTTATATGTTAAAGAT : M M I I L P S P L L G K L G T L Y V K D : - - L F C R A P Y - G S W A P Y M L K I Y : D D Y F A E P L T R E V G H L I C - R . . . . . . 114 ATGCACGATTTTCACTTAAAAGGGTATATGTGTAGCGATATTTTGTTTCAACTTACCATA M H D F H L K G Y M C S D I L F Q L T I C T I F T - K G I C V A I F C F N L P Y Y A R F S L K R V Y V - R Y F V S T Y H . . . . . . 54 TATGTATCCTATCATGTTGACCTTATGCTTTACATACTCAGTACATTGTTCGTA Y V S Y H V D L M L Y I L S T L F V M Y P I M L T L C F T Y S V H C S I C I L S C - P Y A L H T Q Y I V R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-8-_PGL-1_AGS-1_PPS_1 (279 219,173 1) (frame '1'; 234 bp, 78 residues) 1 FLAFVTESLT RGSGMYIIHI MMIILPSPLL GKLGTLYVKD MHDFHLKGYM CSDILFQLTI 61 YVSYHVDLML YILSTLFV AGS-2 (1344 1298,329 144) SCR (e 0.872 d 0.992 a 0.881,e 0.909) Exon 1 1344 1298 ( 47 n); score: 0.872 Intron 1 1297 330 ( 968 n); Pd: 0.992 Pa: 0.881 Exon 2 329 144 ( 186 n); score: 0.909 PGS (1344 1298,329 144) SGN-E538151+ PGS (1344 1298,329 144) SGN-E538156+ PGS (1342 1298,329 144) SGN-E268096+ 3-phase translation of AGS-2 (-strand): . . . . . : . 1344 GAGAAAACCAACCATGCAACTCTTGGCCACCAGCTGCAAAGAATTTG : ACTAATTTATCAT E K T N H A T L G H Q L Q R I - : L I Y H R K P T M Q L L A T S C K E F : D - F I I E N Q P C N S W P P A A K N L : T N L S . . . . . . 316 TTCACCGAGTTTCGGGTCGGGTAATGTTCGTGCGGAGTTTCTTGCATTTGTCACCGAGTC F T E F R V G - C S C G V S C I C H R V S P S F G S G N V R A E F L A F V T E S F H R V S G R V M F V R S F L H L S P S . . . . . . 256 ACTCACTAGAGGGTCGGGTATGTATATTATACATATTATTGGTGATGAGGATGGTTATGA T H - R V G Y V Y Y T Y Y W - - G W L - L T R G S G M Y I I H I I G D E D G Y D H S L E G R V C I L Y I L L V M R M V M . . . . . . 196 TGATGATGATGACGGAGATGATGTGATGATTATTTTGCCGAGCCCCTTACTAG - - - - R R - C D D Y F A E P L T D D D D G D D V M I I L P S P L L M M M M T E M M - - L F C R A P Y - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-8-_PGL-1_AGS-2_PPS_1 (1342 1298,329 171) (frame '0'; 201 bp, 67 residues) 1 ENQPCNSWPP AAKNLTNLSF HRVSGRVMFV RSFLHLSPSH SLEGRVCILY ILLVMRMVMM 61 MMMTEMM- AGS-3 (1725 1590,1556 640) SCR (e 0.919 d 0.000 a 0.000,e 0.858) Exon 1 1725 1590 ( 136 n); score: 0.919 Intron 1 1589 1557 ( 33 n); Pd: 0.000 Pa: 0.000 Exon 2 1556 640 ( 917 n); score: 0.858 PGS (1342 640) SGN-E328093+ PGS (1349 888) SGN-E298250+ PGS (1476 1094) SGN-E544255+ PGS (1725 1590,1556 1411) SGN-E307342+ 3-phase translation of AGS-3 (-strand): . . . . . . 1725 TGTTATGCCTCGTATTTTTATACGTAGTGCGCATCATGAACTAGTAGATGTAAGTCCGGG C Y A S Y F Y T - C A S - T S R C K S G V M P R I F I R S A H H E L V D V S P G L C L V F L Y V V R I M N - - M - V R . . . . . . 1665 GAATGAGATTTTATTTTAAGTTCCAAGTGATTAAAGAAATATTAGGCAAGGATGTTAATT E - D F I L S S K - L K K Y - A R M L I N E I L F - V P S D - R N I R Q G C - F G M R F Y F K F Q V I K E I L G K D V N . . : . . . . 1605 CCAAATGTGTTATTAA : TAAGTGAGGGATTAATTAGTGGCTGATTAGGATTAAATTAATCT P N V L L : I S E G L I S G - L G L N - S Q M C Y - : - V R D - L V A D - D - I N L S K C V I N : K - G I N - W L I R I K L I . . . . . . 1512 AGTGGGCCCCACCACTCAAGGAAAATTAAAAGGGATCAGATTGGGAGCTGGCCTTAGTGG S G P H H S R K I K R D Q I G S W P - W V G P T T Q G K L K G I R L G A G L S G - W A P P L K E N - K G S D W E L A L V . . . . . . 1452 GACACGTGTAGAGGGAGGCTGCCTCCACTTCATATTATAAATGAGGTGGTGAAATGCATT D T C R G R L P P L H I I N E V V K C I T R V E G G C L H F I L - M R W - N A L G H V - R E A A S T S Y Y K - G G E M H . . . . . . 1392 GAAACATATCATCTTATTTACCTCTTGGCTTAGGCAGCCATGGAAATGGAGAAAACCAAC E T Y H L I Y L L A - A A M E M E K T N K H I I L F T S W L R Q P W K W R K P T - N I S S Y L P L G L G S H G N G E N Q . . . . . . 1332 CATGCAACTCTTGGCCACCAGCTGCAAAGAATTTGGTTAGTAATCTCTTTGCTTGGTGTG H A T L G H Q L Q R I W L V I S L L G V M Q L L A T S C K E F G - - S L C L V C P C N S W P P A A K N L V S N L F A W C . . . . . . 1272 TTAATTCCTTAGAATGCCTTTGTTAATTAGACATTAATGTTAAGAAGGAGGCATGAACAG L I P - N A F V N - T L M L R R R H E Q - F L R M P L L I R H - C - E G G M N S V N S L E C L C - L D I N V K K E A - T . . . . . . 1212 TATCTTAGGAAGTTTGTTTTAGTTATTGAATGTGCTAAGTATGAACGGAAACCATTATCA Y L R K F V L V I E C A K Y E R K P L S I L G S L F - L L N V L S M N G N H Y Q V S - E V C F S Y - M C - V - T E T I I . . . . . . 1152 AATTATTAGTGGTGTCGTGTTAGTGCTTGGGTTGTTTTGATTAAAGCAAATTGCGGGAAA N Y - W C R V S A W V V L I K A N C G K I I S G V V L V L G L F - L K Q I A G N K L L V V S C - C L G C F D - S K L R E . . . . . . 1092 TTCTATTTTGGCATTATGTATATGTTAAATGTGATTATAGGTATATTCTCCAAAGGATAA F Y F G I M Y M L N V I I G I F S K G - S I L A L C I C - M - L - V Y S P K D N I L F W H Y V Y V K C D Y R Y I L Q R I . . . . . . 1032 CTAAGATAAGGTAGATGTGTTGCGAATTATAAAGTGAGTTATCGCTCGGTGTGTCGTTGC L R - G R C V A N Y K V S Y R S V C R C - D K V D V L R I I K - V I A R C V V A T K I R - M C C E L - S E L S L G V S L . . . . . . 972 TTCGTTACTATGGTTGCCGAGACGAAACTGTTTTGGGGGAGGGGGAGGGGGCTGTTTAAT F V T M V A E T K L F W G R G R G L F N S L L W L P R R N C F G G G G G G C L I L R Y Y G C R D E T V L G E G E G A V - . . . . . . 912 ATGATTCGTTGGGTTATATGTGTTATTGGTATTGCTATGGATAATTTGGGTTGTTGTCGG M I R W V I C V I G I A M D N L G C C R - F V G L Y V L L V L L W I I W V V V G Y D S L G Y M C Y W Y C Y G - F G L L S . . . . . . 852 ATTTGGACAAAGTAAGGAAAATAGGGGAAATGCTGCCGGATTTTTGTTAGATTATTAGCT I W T K - G K - G K C C R I F V R L L A F G Q S K E N R G N A A G F L L D Y - L D L D K V R K I G E M L P D F C - I I S . . . . . . 792 AGATTATAATAAGTAGTAAAGCGCGACGTTTATCTAATTGCGGCACGATTGTTGCTTGTC R L - - V V K R D V Y L I A A R L L L V D Y N K - - S A T F I - L R H D C C L S - I I I S S K A R R L S N C G T I V A C . . . . . . 732 ATAGAATAATAGCTTGAGCAGTAAATATTTGACGTGCGAATCAACTATACGGTATGTAAG I E - - L E Q - I F D V R I N Y T V C K - N N S L S S K Y L T C E S T I R Y V R H R I I A - A V N I - R A N Q L Y G M - . . . . 672 GCTACCCCTTCTTTCTTTGTTTGGCATGACTTT A T P S F F V W H D F L P L L S L F G M T G Y P F F L C L A - L Maximal non-overlapping open reading frames (>= 64 codons): none AGS-4 (1360 1298,733 682) SCR (e 0.952 d 0.992 a 0.000,e 0.846) Exon 1 1360 1298 ( 63 n); score: 0.952 Intron 1 1297 734 ( 564 n); Pd: 0.992 Pa: 0.000 Exon 2 733 682 ( 52 n); score: 0.846 PGS (1360 1298,733 682) SGN-E543103- PGS (1360 1298,733 682) SGN-E543104+ 3-phase translation of AGS-4 (-strand): . . . . . . 1360 GGCAGCCATGGAAATGGAGAAAACCAACCATGCAACTCTTGGCCACCAGCTGCAAAGAAT G S H G N G E N Q P C N S W P P A A K N A A M E M E K T N H A T L G H Q L Q R I Q P W K W R K P T M Q L L A T S C K E . : . . . . . 1300 TTG : CATAGAATAATAGCTTGAGCAGTAAATATTTGACGTGCGAATCAACTATACG L : H R I I A - A V N I - R A N Q L Y C : I E - - L E Q - I F D V R I N Y T F : A - N N S L S S K Y L T C E S T I Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:14:26 2006 ________________________________________________________________________________ Sequence 9: C06HBa0057J04.1-9, from 1 to 9921, both strands analyzed. ... started at: Mon Jul 24 23:14:26 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 10 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 25 ******************************************************************************** EST sequence 7 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 4602 to 1131): Exon 1 3941 3873 ( 69 n); cDNA 1 68 ( 68 n); score: 0.848 Intron 1 3872 3514 ( 359 n); Pd: 0.900 (s: 0.84), Pa: 0.889 (s: 0.94) Exon 2 3513 3467 ( 47 n); cDNA 69 115 ( 47 n); score: 0.936 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.94), Pa: 0.000 (s: 0.96) Exon 3 2743 2411 ( 333 n); cDNA 116 448 ( 333 n); score: 0.905 Intron 3 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 2292 2285 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-9- SGN-E543103- 0.896 457 0.780 C PGS_C06HBa0057J04.1-9-_SGN-E543103- (3941 3873,3513 3467,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACAAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 3883 |||||||||| ||||||||| ||| |||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 3823 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 3763 .......... .......... .......... .......... .......... .......... 68 TGCTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCATGTTG GTGCTTGGGC 3703 .......... .......... .......... .......... .......... .......... 68 TGTTTATATG ATTCTTTGGG TTATATGTGT TATTGGTATT GCTGTGGATA ATTTGGATTG 3643 .......... .......... .......... .......... .......... .......... 68 TTGTCGGATT GGGACGAAGT AAGGAAAATA GGGGAGGTGC TGCCGAATTT TCGTTAGATT 3583 .......... .......... .......... .......... .......... .......... 68 ATTAGCTAGC TTACAAGAAA GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC 3523 .......... .......... .......... .......... .......... .......... 68 TTGTTATAGA TTAATAGCTT GAGCAGTAAA TAATGGACGT GCGGCTCAAT TATACGGTAT 3463 | |||||||||| |||||||||| || ||||||| ||||||| | |||||| .........A TTAATAGCTT GAGCAGTAAA TATTGGACGT GCGGCTCGAC TATACG.... 115 GTAACGCTGT CCCTTCTTTC TTTGCTTGGC ATGACTTTTA AAAATAAGCG AATAACGGAC 3403 .......... .......... .......... .......... .......... .......... 115 AGATTTGATA CTTACCTCTA AAGCGTCTAG GTGATGTATA TTCTTGCTTC CACAATTATT 3343 .......... .......... .......... .......... .......... .......... 115 CCTCTATATA TCGGTTATGT CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC 3283 .......... .......... .......... .......... .......... .......... 115 TTAGAGTCAT TGAAATTTTA CGTTTTCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA 3223 .......... .......... .......... .......... .......... .......... 115 CATTAATCTT TGGTAATACT CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA 3163 .......... .......... .......... .......... .......... .......... 115 AATGATTTTA ATTGCATATG GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC 3103 .......... .......... .......... .......... .......... .......... 115 ATTTCACCGA GTCCCGGGCC GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG 3043 .......... .......... .......... .......... .......... .......... 115 TTCCTCACTA GAGGGCCGGG TATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT 2983 .......... .......... .......... .......... .......... .......... 115 GATGATGATG ATGACGGAGA TGACGTGATG ATTATTTTGC CGAGCCCTTT ACTAGGGAAG 2923 .......... .......... .......... .......... .......... .......... 115 CTGGGCACCT TAAATGTTAA ATATATGCAT GATTTTCACT TAAAAAGTAT ATGTGTAGCG 2863 .......... .......... .......... .......... .......... .......... 115 ATATTTTGTT TCGACTTGCC ACATTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC 2803 .......... .......... .......... .......... .......... .......... 115 TCAGTACATT GTCCGTACTG ACCCCCCTTT CCTCGGGGGG CTGCGTTTCA TGCCTGCAGG 2743 | .......... .......... .......... .......... .......... .........G 116 TGTAGACGCG CAGTTCGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG ATTGGGAGAG 2683 ||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| | |||||||| TGTAGACGCT CAGTTCGGTG ATCCTCCCGC CTAGGATATC TACTTTGCTG AGTGGGAGAG 176 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAAC-T TTTGTGTAGG CTTTTGCTCG 2624 |||||||||| || |||||| ||| |||||| |||||||| | ||||||||| |||||||| | CTCCACTGTT TCGTAGCCCA GTCATTTTGG TACATAACTT TTTGTGTAGT CTTTTGCTTG 236 TCTATGGGTA TGGCGGGGCC CTGTCCCGTC GAGTTTCACT AATGTACTCT TAGAGGTCTG 2564 |||||||||| ||| |||||| |||||||||| |||||||||| | | |||||| |||||||| TCTATGGGTA TGGTGGGGCC CTGTCCCGTC GAGTTTCACT ACTATACTCT TAGAGGTCCA 296 TGGACATTAT GTGGGTTGTA TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG 2504 | ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGACATCGC GTGGGTTGTA TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG 356 GGATGTCCGC TTGTACAGGG GCAGCCTTGT CGGCTGCGTA CATCATTATG CTTTGAATAG 2444 |||||||| | ||||||| || |||||||||| | |||||||| |||| || || ||| ||| GGATGTCCAC TTGTACAAGG GCAGCCTTGT CAGCTGCGTA CATCTTTGTG TATTGTGTAG 416 TGGCGGCCTT GTCGGCTCGC GTATGCTGTT ATGGTTGAAT GGTTATGACT CCTTATGAGA 2384 |||| ||||| ||||||| || ||||||| || ||| TGGCAGCCTT GTCGGCT-GC GTATGCTATT ATG....... .......... .......... 448 CAGGTCCTCT TATATATATA TATGACGTTG GGGTTGGCTT GATTTGATTA AATTCCATAT 2324 .......... .......... .......... .......... .......... .......... 448 TGTCTTAGTT TCAGTTGGTC ATACTTAGCA GGTTTGTAT 2285 |||| || .......... .......... .......... .CTTTGGAT 456 hqPGS_C06HBa0057J04.1-9-_SGN-E543103- (3941 3873,3513 3467,2743 2411,2292 2285) ******************************************************************************** EST sequence 24 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 7254 to 1211): Exon 1 3941 3873 ( 69 n); cDNA 1 68 ( 68 n); score: 0.848 Intron 1 3872 3514 ( 359 n); Pd: 0.900 (s: 0.84), Pa: 0.889 (s: 0.94) Exon 2 3513 3467 ( 47 n); cDNA 69 115 ( 47 n); score: 0.936 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.94), Pa: 0.000 (s: 0.96) Exon 3 2743 2411 ( 333 n); cDNA 116 448 ( 333 n); score: 0.905 Intron 3 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 2292 2285 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-9- SGN-E543104+ 0.896 457 0.792 C PGS_C06HBa0057J04.1-9-_SGN-E543104+ (3941 3873,3513 3467,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACAAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 3883 |||||||||| ||||||||| ||| |||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 3823 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 3763 .......... .......... .......... .......... .......... .......... 68 TGCTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCATGTTG GTGCTTGGGC 3703 .......... .......... .......... .......... .......... .......... 68 TGTTTATATG ATTCTTTGGG TTATATGTGT TATTGGTATT GCTGTGGATA ATTTGGATTG 3643 .......... .......... .......... .......... .......... .......... 68 TTGTCGGATT GGGACGAAGT AAGGAAAATA GGGGAGGTGC TGCCGAATTT TCGTTAGATT 3583 .......... .......... .......... .......... .......... .......... 68 ATTAGCTAGC TTACAAGAAA GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC 3523 .......... .......... .......... .......... .......... .......... 68 TTGTTATAGA TTAATAGCTT GAGCAGTAAA TAATGGACGT GCGGCTCAAT TATACGGTAT 3463 | |||||||||| |||||||||| || ||||||| ||||||| | |||||| .........A TTAATAGCTT GAGCAGTAAA TATTGGACGT GCGGCTCGAC TATACG.... 115 GTAACGCTGT CCCTTCTTTC TTTGCTTGGC ATGACTTTTA AAAATAAGCG AATAACGGAC 3403 .......... .......... .......... .......... .......... .......... 115 AGATTTGATA CTTACCTCTA AAGCGTCTAG GTGATGTATA TTCTTGCTTC CACAATTATT 3343 .......... .......... .......... .......... .......... .......... 115 CCTCTATATA TCGGTTATGT CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC 3283 .......... .......... .......... .......... .......... .......... 115 TTAGAGTCAT TGAAATTTTA CGTTTTCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA 3223 .......... .......... .......... .......... .......... .......... 115 CATTAATCTT TGGTAATACT CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA 3163 .......... .......... .......... .......... .......... .......... 115 AATGATTTTA ATTGCATATG GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC 3103 .......... .......... .......... .......... .......... .......... 115 ATTTCACCGA GTCCCGGGCC GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG 3043 .......... .......... .......... .......... .......... .......... 115 TTCCTCACTA GAGGGCCGGG TATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT 2983 .......... .......... .......... .......... .......... .......... 115 GATGATGATG ATGACGGAGA TGACGTGATG ATTATTTTGC CGAGCCCTTT ACTAGGGAAG 2923 .......... .......... .......... .......... .......... .......... 115 CTGGGCACCT TAAATGTTAA ATATATGCAT GATTTTCACT TAAAAAGTAT ATGTGTAGCG 2863 .......... .......... .......... .......... .......... .......... 115 ATATTTTGTT TCGACTTGCC ACATTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC 2803 .......... .......... .......... .......... .......... .......... 115 TCAGTACATT GTCCGTACTG ACCCCCCTTT CCTCGGGGGG CTGCGTTTCA TGCCTGCAGG 2743 | .......... .......... .......... .......... .......... .........G 116 TGTAGACGCG CAGTTCGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG ATTGGGAGAG 2683 ||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| | |||||||| TGTAGACGCT CAGTTCGGTG ATCCTCCCGC CTAGGATATC TACTTTGCTG AGTGGGAGAG 176 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAAC-T TTTGTGTAGG CTTTTGCTCG 2624 |||||||||| || |||||| ||| |||||| |||||||| | ||||||||| |||||||| | CTCCACTGTT TCGTAGCCCA GTCATTTTGG TACATAACTT TTTGTGTAGT CTTTTGCTTG 236 TCTATGGGTA TGGCGGGGCC CTGTCCCGTC GAGTTTCACT AATGTACTCT TAGAGGTCTG 2564 |||||||||| ||| |||||| |||||||||| |||||||||| | | |||||| |||||||| TCTATGGGTA TGGTGGGGCC CTGTCCCGTC GAGTTTCACT ACTATACTCT TAGAGGTCCA 296 TGGACATTAT GTGGGTTGTA TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG 2504 | ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGACATCGC GTGGGTTGTA TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG 356 GGATGTCCGC TTGTACAGGG GCAGCCTTGT CGGCTGCGTA CATCATTATG CTTTGAATAG 2444 |||||||| | ||||||| || |||||||||| | |||||||| |||| || || ||| ||| GGATGTCCAC TTGTACAAGG GCAGCCTTGT CAGCTGCGTA CATCTTTGTG TATTGTGTAG 416 TGGCGGCCTT GTCGGCTCGC GTATGCTGTT ATGGTTGAAT GGTTATGACT CCTTATGAGA 2384 |||| ||||| ||||||| || ||||||| || ||| TGGCAGCCTT GTCGGCT-GC GTATGCTATT ATG....... .......... .......... 448 CAGGTCCTCT TATATATATA TATGACGTTG GGGTTGGCTT GATTTGATTA AATTCCATAT 2324 .......... .......... .......... .......... .......... .......... 448 TGTCTTAGTT TCAGTTGGTC ATACTTAGCA GGTTTGTAT 2285 |||| || .......... .......... .......... .CTTTGGAT 456 hqPGS_C06HBa0057J04.1-9-_SGN-E543104+ (3941 3873,3513 3467,2743 2411,2292 2285) ******************************************************************************** EST sequence 5 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 4532 to 46): Exon 1 3939 3880 ( 60 n); cDNA 1 61 ( 61 n); score: 0.825 Intron 1 3879 3514 ( 366 n); Pd: 0.995 (s: 0.79), Pa: 0.889 (s: 0.85) Exon 2 3513 3467 ( 47 n); cDNA 62 108 ( 47 n); score: 0.851 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.85), Pa: 0.000 (s: 0.98) Exon 3 2743 2411 ( 333 n); cDNA 109 441 ( 333 n); score: 0.902 Intron 3 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 2292 2285 ( 8 n); cDNA 442 449 ( 8 n); score: 0.750 PPA cDNA 528 542 MATCH C06HBa0057J04.1-9- SGN-E374134- 0.891 448 0.827 C PGS_C06HBa0057J04.1-9-_SGN-E374134- (3939 3880,3513 3467,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): CAGCCATGGA AATGGAG-AA ACAAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT 3881 |||||||||| ||||||| || || | || | ||||||||| ||||||||| |||| || || CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 60 GGTTAGTAAT CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT 3821 | G......... .......... .......... .......... .......... .......... 61 TAATTTTAAG AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG 3761 .......... .......... .......... .......... .......... .......... 61 CTAAGTATGA ATGGAAACCA TAATCGGATT ATTAGTGGTG TCATGTTGGT GCTTGGGCTG 3701 .......... .......... .......... .......... .......... .......... 61 TTTATATGAT TCTTTGGGTT ATATGTGTTA TTGGTATTGC TGTGGATAAT TTGGATTGTT 3641 .......... .......... .......... .......... .......... .......... 61 GTCGGATTGG GACGAAGTAA GGAAAATAGG GGAGGTGCTG CCGAATTTTC GTTAGATTAT 3581 .......... .......... .......... .......... .......... .......... 61 TAGCTAGCTT ACAAGAAAGT AAAGCACGAT GTTTATCTAA TTGCGGCACG ATTGTTGCTT 3521 .......... .......... .......... .......... .......... .......... 61 GTTATAGATT AATAGCTTGA GCAGTAAATA ATGGACGTGC GGCTCAATTA TACGGTATGT 3461 ||| ||| ||||| |||||||||| ||||||| | ||||| | || | || .......ATT TATACCTTGA GCAGTAAATA TTGGACGTAC GGCTCGACTA TTCG...... 108 AACGCTGTCC CTTCTTTCTT TGCTTGGCAT GACTTTTAAA AATAAGCGAA TAACGGACAG 3401 .......... .......... .......... .......... .......... .......... 108 ATTTGATACT TACCTCTAAA GCGTCTAGGT GATGTATATT CTTGCTTCCA CAATTATTCC 3341 .......... .......... .......... .......... .......... .......... 108 TCTATATATC GGTTATGTCT AAGGCTATGA TGATCTCTAA TATCTATGGT AATGCTTCTT 3281 .......... .......... .......... .......... .......... .......... 108 AGAGTCATTG AAATTTTACG TTTTCATATC GTATTAAAGG TTCATAATCT TGATAAAACA 3221 .......... .......... .......... .......... .......... .......... 108 TTAATCTTTG GTAATACTCC TTGCTGGTTC ACGTTGATTG TTCTATTGAG TTATAAGAAA 3161 .......... .......... .......... .......... .......... .......... 108 TGATTTTAAT TGCATATGGT TGCTCATAAT ATTCTGCTCG TGCATAGAGT CATTTATCAT 3101 .......... .......... .......... .......... .......... .......... 108 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTT 3041 .......... .......... .......... .......... .......... .......... 108 CCTCACTAGA GGGCCGGGTA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 2981 .......... .......... .......... .......... .......... .......... 108 TGATGATGAT GACGGAGATG ACGTGATGAT TATTTTGCCG AGCCCTTTAC TAGGGAAGCT 2921 .......... .......... .......... .......... .......... .......... 108 GGGCACCTTA AATGTTAAAT ATATGCATGA TTTTCACTTA AAAAGTATAT GTGTAGCGAT 2861 .......... .......... .......... .......... .......... .......... 108 ATTTTGTTTC GACTTGCCAC ATTGGTATCC TGTCATCTTT ACCTTATGCT TTACATACTC 2801 .......... .......... .......... .......... .......... .......... 108 AGTACATTGT CCGTACTGAC CCCCCTTTCC TCGGGGGGCT GCGTTTCATG CCTGCAGGTG 2741 ||| .......... .......... .......... .......... .......... .......GTG 111 TAGACGCGCA GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGAT TGGGAGAGCT 2681 ||||||| || |||||||||| |||||||||| |||||||||| ||||||| | |||||||||| TAGACGCTCA GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT 171 CCACTGTTCC GGAGCCCAGT CGTTTTGGTA CATAACTT-T TGTGTAGGCT TTTGCTCGTC 2622 |||||||||| |||||||||| |||||||||| |||||||| | | ||||| || |||||| ||| CCACTGTTCC GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC 231 TATGGGTATG GCGGGGCCCT GTCCCGTCGA GTTTCACTAA TGTACTCTTA GAGGTCTGTG 2562 |||||||||| |||||||||| |||||||| | ||||||||| | |||||||| ||||||||| TATGGGTATG GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA 291 GACATTATGT GGGTTGTATA TATATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG 2502 ||||| ||| |||||||||| |||||||| |||||||||| |||||||||| |||||||||| GACATCGTGT GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG 351 ATGTCCGCTT GTACAGGGGC AGCCTTGTCG GCTGCGTACA TCATTATGCT TTGAATAGTG 2442 |||||| || ||||| | || |||||||||| | || | ||| ||||| || ||| ||||| ATGTCCATTT GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG 411 GCGGCCTTGT CGGCTCGCGT ATGCTGTTAT GGTTGAATGG TTATGACTCC TTATGAGACA 2382 || |||| || ||||| |||| ||||| |||| | GCAGCCTCGT CGGCT-GCGT ATGCTATTAT G......... .......... .......... 441 GGTCCTCTTA TATATATATA TGACGTTGGG GTTGGCTTGA TTTGATTAAA TTCCATATTG 2322 .......... .......... .......... .......... .......... .......... 441 TCTTAGTTTC AGTTGGTCAT ACTTAGCAGG TTTGTAT 2285 |||| || .......... .......... .........T TTTGGAT 449 hqPGS_C06HBa0057J04.1-9-_SGN-E374134- (3939 3880,3513 3467,2743 2411,2292 2285) ******************************************************************************** EST sequence 15 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 4512 to 1): Exon 1 3938 3880 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 3879 3514 ( 366 n); Pd: 0.995 (s: 0.79), Pa: 0.889 (s: 0.85) Exon 2 3513 3467 ( 47 n); cDNA 61 107 ( 47 n); score: 0.851 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.85), Pa: 0.000 (s: 0.98) Exon 3 2743 2411 ( 333 n); cDNA 108 440 ( 333 n); score: 0.902 Intron 3 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 2292 2285 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-9- SGN-E305738+ 0.890 447 0.817 C PGS_C06HBa0057J04.1-9-_SGN-E305738+ (3938 3880,3513 3467,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CAAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3880 |||||||||| ||||||| || | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 3820 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 3760 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CATGTTGGTG CTTGGGCTGT 3700 .......... .......... .......... .......... .......... .......... 60 TTATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG 3640 .......... .......... .......... .......... .......... .......... 60 TCGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 3580 .......... .......... .......... .......... .......... .......... 60 AGCTAGCTTA CAAGAAAGTA AAGCACGATG TTTATCTAAT TGCGGCACGA TTGTTGCTTG 3520 .......... .......... .......... .......... .......... .......... 60 TTATAGATTA ATAGCTTGAG CAGTAAATAA TGGACGTGCG GCTCAATTAT ACGGTATGTA 3460 ||| ||| |||||| ||||||||| ||||||| || |||| | ||| || ......ATTT ATACCTTGAG CAGTAAATAT TGGACGTACG GCTCGACTAT TCG....... 107 ACGCTGTCCC TTCTTTCTTT GCTTGGCATG ACTTTTAAAA ATAAGCGAAT AACGGACAGA 3400 .......... .......... .......... .......... .......... .......... 107 TTTGATACTT ACCTCTAAAG CGTCTAGGTG ATGTATATTC TTGCTTCCAC AATTATTCCT 3340 .......... .......... .......... .......... .......... .......... 107 CTATATATCG GTTATGTCTA AGGCTATGAT GATCTCTAAT ATCTATGGTA ATGCTTCTTA 3280 .......... .......... .......... .......... .......... .......... 107 GAGTCATTGA AATTTTACGT TTTCATATCG TATTAAAGGT TCATAATCTT GATAAAACAT 3220 .......... .......... .......... .......... .......... .......... 107 TAATCTTTGG TAATACTCCT TGCTGGTTCA CGTTGATTGT TCTATTGAGT TATAAGAAAT 3160 .......... .......... .......... .......... .......... .......... 107 GATTTTAATT GCATATGGTT GCTCATAATA TTCTGCTCGT GCATAGAGTC ATTTATCATT 3100 .......... .......... .......... .......... .......... .......... 107 TCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT CACCGAGTTC 3040 .......... .......... .......... .......... .......... .......... 107 CTCACTAGAG GGCCGGGTAT GTATATTATA TATATGATTG GTGATGAGGA TGGTTATGAT 2980 .......... .......... .......... .......... .......... .......... 107 GATGATGATG ACGGAGATGA CGTGATGATT ATTTTGCCGA GCCCTTTACT AGGGAAGCTG 2920 .......... .......... .......... .......... .......... .......... 107 GGCACCTTAA ATGTTAAATA TATGCATGAT TTTCACTTAA AAAGTATATG TGTAGCGATA 2860 .......... .......... .......... .......... .......... .......... 107 TTTTGTTTCG ACTTGCCACA TTGGTATCCT GTCATCTTTA CCTTATGCTT TACATACTCA 2800 .......... .......... .......... .......... .......... .......... 107 GTACATTGTC CGTACTGACC CCCCTTTCCT CGGGGGGCTG CGTTTCATGC CTGCAGGTGT 2740 |||| .......... .......... .......... .......... .......... ......GTGT 111 AGACGCGCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGATT GGGAGAGCTC 2680 |||||| ||| |||||||||| |||||||||| |||||||||| |||||| || |||||||||| AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT GGGAGAGCTC 171 CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTT-TT GTGTAGGCTT TTGCTCGTCT 2621 |||||||||| |||||||||| |||||||||| ||||||| || ||||| ||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 231 ATGGGTATGG CGGGGCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 2561 |||||||||| |||||||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 291 ACATTATGTG GGTTGTATAT ATATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 2501 |||| |||| ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 351 TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT CATTATGCTT TGAATAGTGG 2441 ||||| ||| |||| | ||| |||||||||| || | |||| |||| || | || |||||| TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT TGTGTAGTGG 411 CGGCCTTGTC GGCTCGCGTA TGCTGTTATG GTTGAATGGT TATGACTCCT TATGAGACAG 2381 | |||| ||| |||| ||||| |||| ||||| CAGCCTCGTC GGCT-GCGTA TGCTATTATG .......... .......... .......... 440 GTCCTCTTAT ATATATATAT GACGTTGGGG TTGGCTTGAT TTGATTAAAT TCCATATTGT 2321 .......... .......... .......... .......... .......... .......... 440 CTTAGTTTCA GTTGGTCATA CTTAGCAGGT TTGTAT 2285 | ||| || .......... .......... ........TT TTGGAT 448 hqPGS_C06HBa0057J04.1-9-_SGN-E305738+ (3938 3880,3513 3467,2743 2411,2292 2285) ******************************************************************************** EST sequence 20 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 4512 to 26): Exon 1 3938 3880 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 3879 3514 ( 366 n); Pd: 0.995 (s: 0.79), Pa: 0.889 (s: 0.85) Exon 2 3513 3467 ( 47 n); cDNA 61 107 ( 47 n); score: 0.851 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.85), Pa: 0.000 (s: 0.98) Exon 3 2743 2411 ( 333 n); cDNA 108 440 ( 333 n); score: 0.902 Intron 3 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 2292 2285 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 PPA cDNA 527 542 MATCH C06HBa0057J04.1-9- SGN-E374135+ 0.890 447 0.825 C PGS_C06HBa0057J04.1-9-_SGN-E374135+ (3938 3880,3513 3467,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CAAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3880 |||||||||| ||||||| || | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 3820 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 3760 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CATGTTGGTG CTTGGGCTGT 3700 .......... .......... .......... .......... .......... .......... 60 TTATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG 3640 .......... .......... .......... .......... .......... .......... 60 TCGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 3580 .......... .......... .......... .......... .......... .......... 60 AGCTAGCTTA CAAGAAAGTA AAGCACGATG TTTATCTAAT TGCGGCACGA TTGTTGCTTG 3520 .......... .......... .......... .......... .......... .......... 60 TTATAGATTA ATAGCTTGAG CAGTAAATAA TGGACGTGCG GCTCAATTAT ACGGTATGTA 3460 ||| ||| |||||| ||||||||| ||||||| || |||| | ||| || ......ATTT ATACCTTGAG CAGTAAATAT TGGACGTACG GCTCGACTAT TCG....... 107 ACGCTGTCCC TTCTTTCTTT GCTTGGCATG ACTTTTAAAA ATAAGCGAAT AACGGACAGA 3400 .......... .......... .......... .......... .......... .......... 107 TTTGATACTT ACCTCTAAAG CGTCTAGGTG ATGTATATTC TTGCTTCCAC AATTATTCCT 3340 .......... .......... .......... .......... .......... .......... 107 CTATATATCG GTTATGTCTA AGGCTATGAT GATCTCTAAT ATCTATGGTA ATGCTTCTTA 3280 .......... .......... .......... .......... .......... .......... 107 GAGTCATTGA AATTTTACGT TTTCATATCG TATTAAAGGT TCATAATCTT GATAAAACAT 3220 .......... .......... .......... .......... .......... .......... 107 TAATCTTTGG TAATACTCCT TGCTGGTTCA CGTTGATTGT TCTATTGAGT TATAAGAAAT 3160 .......... .......... .......... .......... .......... .......... 107 GATTTTAATT GCATATGGTT GCTCATAATA TTCTGCTCGT GCATAGAGTC ATTTATCATT 3100 .......... .......... .......... .......... .......... .......... 107 TCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT CACCGAGTTC 3040 .......... .......... .......... .......... .......... .......... 107 CTCACTAGAG GGCCGGGTAT GTATATTATA TATATGATTG GTGATGAGGA TGGTTATGAT 2980 .......... .......... .......... .......... .......... .......... 107 GATGATGATG ACGGAGATGA CGTGATGATT ATTTTGCCGA GCCCTTTACT AGGGAAGCTG 2920 .......... .......... .......... .......... .......... .......... 107 GGCACCTTAA ATGTTAAATA TATGCATGAT TTTCACTTAA AAAGTATATG TGTAGCGATA 2860 .......... .......... .......... .......... .......... .......... 107 TTTTGTTTCG ACTTGCCACA TTGGTATCCT GTCATCTTTA CCTTATGCTT TACATACTCA 2800 .......... .......... .......... .......... .......... .......... 107 GTACATTGTC CGTACTGACC CCCCTTTCCT CGGGGGGCTG CGTTTCATGC CTGCAGGTGT 2740 |||| .......... .......... .......... .......... .......... ......GTGT 111 AGACGCGCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGATT GGGAGAGCTC 2680 |||||| ||| |||||||||| |||||||||| |||||||||| |||||| || |||||||||| AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT GGGAGAGCTC 171 CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTT-TT GTGTAGGCTT TTGCTCGTCT 2621 |||||||||| |||||||||| |||||||||| ||||||| || ||||| ||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 231 ATGGGTATGG CGGGGCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 2561 |||||||||| |||||||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 291 ACATTATGTG GGTTGTATAT ATATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 2501 |||| |||| ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 351 TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT CATTATGCTT TGAATAGTGG 2441 ||||| ||| |||| | ||| |||||||||| || | |||| |||| || | || |||||| TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT TGTGTAGTGG 411 CGGCCTTGTC GGCTCGCGTA TGCTGTTATG GTTGAATGGT TATGACTCCT TATGAGACAG 2381 | |||| ||| |||| ||||| |||| ||||| CAGCCTCGTC GGCT-GCGTA TGCTATTATG .......... .......... .......... 440 GTCCTCTTAT ATATATATAT GACGTTGGGG TTGGCTTGAT TTGATTAAAT TCCATATTGT 2321 .......... .......... .......... .......... .......... .......... 440 CTTAGTTTCA GTTGGTCATA CTTAGCAGGT TTGTAT 2285 | ||| || .......... .......... ........TT TTGGAT 448 hqPGS_C06HBa0057J04.1-9-_SGN-E374135+ (3938 3880,3513 3467,2743 2411,2292 2285) ******************************************************************************** EST sequence 26 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 3962 to 1): Exon 1 3930 3880 ( 51 n); cDNA 1 52 ( 52 n); score: 0.794 Intron 1 3879 2744 (1136 n); Pd: 0.995 (s: 0.79), Pa: 0.000 (s: 0.98) Exon 2 2743 2411 ( 333 n); cDNA 53 385 ( 333 n); score: 0.902 Intron 2 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 3 2292 2285 ( 8 n); cDNA 386 393 ( 8 n); score: 0.750 PPA cDNA 514 523 MATCH C06HBa0057J04.1-9- SGN-E303695+ 0.888 392 0.750 C PGS_C06HBa0057J04.1-9-_SGN-E303695+ (3930 3880,2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): AAATGGAG-A AACAAACCCT GCAACTCTTG GCCAGCAGCT GCAAATAATT TGGTTAGTAA 3872 |||||||| | ||| | || | |||||||| | |||||||| ||||| || | || AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TG........ 52 TCTCCTTGTT TGGTGTGTTA ATTCTTTAGA ATACCCTTGT TAATTATCCA TTAATTTTAA 3812 .......... .......... .......... .......... .......... .......... 52 GAAGGGGGCG TGACCAGTAG CTTAGGAAGT TTGTTTTAGT TATTGAATGT GCTAAGTATG 3752 .......... .......... .......... .......... .......... .......... 52 AATGGAAACC ATAATCGGAT TATTAGTGGT GTCATGTTGG TGCTTGGGCT GTTTATATGA 3692 .......... .......... .......... .......... .......... .......... 52 TTCTTTGGGT TATATGTGTT ATTGGTATTG CTGTGGATAA TTTGGATTGT TGTCGGATTG 3632 .......... .......... .......... .......... .......... .......... 52 GGACGAAGTA AGGAAAATAG GGGAGGTGCT GCCGAATTTT CGTTAGATTA TTAGCTAGCT 3572 .......... .......... .......... .......... .......... .......... 52 TACAAGAAAG TAAAGCACGA TGTTTATCTA ATTGCGGCAC GATTGTTGCT TGTTATAGAT 3512 .......... .......... .......... .......... .......... .......... 52 TAATAGCTTG AGCAGTAAAT AATGGACGTG CGGCTCAATT ATACGGTATG TAACGCTGTC 3452 .......... .......... .......... .......... .......... .......... 52 CCTTCTTTCT TTGCTTGGCA TGACTTTTAA AAATAAGCGA ATAACGGACA GATTTGATAC 3392 .......... .......... .......... .......... .......... .......... 52 TTACCTCTAA AGCGTCTAGG TGATGTATAT TCTTGCTTCC ACAATTATTC CTCTATATAT 3332 .......... .......... .......... .......... .......... .......... 52 CGGTTATGTC TAAGGCTATG ATGATCTCTA ATATCTATGG TAATGCTTCT TAGAGTCATT 3272 .......... .......... .......... .......... .......... .......... 52 GAAATTTTAC GTTTTCATAT CGTATTAAAG GTTCATAATC TTGATAAAAC ATTAATCTTT 3212 .......... .......... .......... .......... .......... .......... 52 GGTAATACTC CTTGCTGGTT CACGTTGATT GTTCTATTGA GTTATAAGAA ATGATTTTAA 3152 .......... .......... .......... .......... .......... .......... 52 TTGCATATGG TTGCTCATAA TATTCTGCTC GTGCATAGAG TCATTTATCA TTTCACCGAG 3092 .......... .......... .......... .......... .......... .......... 52 TCCCGGGCCG GGTAATGTTC GTGCGGAGTT TCTTGCATAT GTCACCGAGT TCCTCACTAG 3032 .......... .......... .......... .......... .......... .......... 52 AGGGCCGGGT ATGTATATTA TATATATGAT TGGTGATGAG GATGGTTATG ATGATGATGA 2972 .......... .......... .......... .......... .......... .......... 52 TGACGGAGAT GACGTGATGA TTATTTTGCC GAGCCCTTTA CTAGGGAAGC TGGGCACCTT 2912 .......... .......... .......... .......... .......... .......... 52 AAATGTTAAA TATATGCATG ATTTTCACTT AAAAAGTATA TGTGTAGCGA TATTTTGTTT 2852 .......... .......... .......... .......... .......... .......... 52 CGACTTGCCA CATTGGTATC CTGTCATCTT TACCTTATGC TTTACATACT CAGTACATTG 2792 .......... .......... .......... .......... .......... .......... 52 TCCGTACTGA CCCCCCTTTC CTCGGGGGGC TGCGTTTCAT GCCTGCAGGT GTAGACGCGC 2732 || |||||||| | .......... .......... .......... .......... ........GT GTAGACGCTC 64 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTGA TTGGGAGAGC TCCACTGTTC 2672 |||||||||| |||||||||| |||||||||| |||||||| |||||||||| |||||||||| AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 124 CGGAGCCCAG TCGTTTTGGT ACATAACTT- TTGTGTAGGC TTTTGCTCGT CTATGGGTAT 2613 |||||||||| |||||||||| ||||||||| || ||||| | ||||||| || |||||||||| CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 184 GGCGGGGCCC TGTCCCGTCG AGTTTCACTA ATGTACTCTT AGAGGTCTGT GGACATTATG 2553 |||||||||| ||||||||| |||||||||| | ||||||| |||||||||| ||||| || GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 244 TGGGTTGTAT ATATATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCGCT 2493 |||||||||| | ||||||| |||||||||| |||||||||| |||||||||| ||||||| | TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 304 TGTACAGGGG CAGCCTTGTC GGCTGCGTAC ATCATTATGC TTTGAATAGT GGCGGCCTTG 2433 |||||| | | |||||||||| || || | || |||||| || ||| |||| ||| |||| | TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 364 TCGGCTCGCG TATGCTGTTA TGGTTGAATG GTTATGACTC CTTATGAGAC AGGTCCTCTT 2373 |||||| ||| |||||| ||| || TCGGCT-GCG TATGCTATTA TG........ .......... .......... .......... 385 ATATATATAT ATGACGTTGG GGTTGGCTTG ATTTGATTAA ATTCCATATT GTCTTAGTTT 2313 .......... .......... .......... .......... .......... .......... 385 CAGTTGGTCA TACTTAGCAG GTTTGTAT 2285 |||| || .......... .......... TTTTGGAT 393 hqPGS_C06HBa0057J04.1-9-_SGN-E303695+ (3930 3880,2743 2411,2292 2285) ******************************************************************************** EST sequence 2 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 3512 to 126): Exon 1 2744 2411 ( 334 n); cDNA 6 339 ( 334 n); score: 0.903 Intron 1 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 2292 2285 ( 8 n); cDNA 340 347 ( 8 n); score: 0.750 PPA cDNA 415 432 MATCH C06HBa0057J04.1-9- SGN-E225616- 0.903 342 0.792 C PGS_C06HBa0057J04.1-9-_SGN-E225616- (2744 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGATTGGGAG 2685 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TT-TTGTGTA GGCTTTTGCT 2626 |||||||||| |||||||||| |||||||||| |||||||||| || || |||| | |||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACT CTTAGAGGTC 2566 ||||||||| |||||||||| |||||||||| || ||||||| ||| | |||| |||||||||| TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 185 TGTGGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 2506 ||| ||||| ||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 245 TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGCG TACATCATTA TGCTTTGAAT 2446 |||||||||| ||||||| | |||||||| ||||| || | |||||||| || ||| | TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 305 AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA CTCCTTATGA 2386 |||||| ||| | ||||||| ||||||||| ||||| AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... .......... 339 GACAGGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT TAAATTCCAT 2326 .......... .......... .......... .......... .......... .......... 339 ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 2285 |||| | | .......... .......... .......... ...TTTTGGA T 347 hqPGS_C06HBa0057J04.1-9-_SGN-E225616- (2744 2411,2292 2285) ******************************************************************************** EST sequence 9 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 3762 to 1): Exon 1 2743 2411 ( 333 n); cDNA 33 365 ( 333 n); score: 0.905 Intron 1 2410 2293 ( 118 n); Pd: 0.000 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 2292 2285 ( 8 n); cDNA 366 373 ( 8 n); score: 0.750 PPA cDNA 481 495 MATCH C06HBa0057J04.1-9- SGN-E306317+ 0.905 341 0.689 C PGS_C06HBa0057J04.1-9-_SGN-E306317+ (2743 2411,2292 2285) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 2684 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT T-TTGTGTAG GCTTTTGCTC 2625 |||||||||| |||||||||| |||||||||| |||||||||| | || ||||| |||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 2565 |||||||||| |||||||||| |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 212 GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 2505 || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 272 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGCGT ACATCATTAT GCTTTGAATA 2445 ||||||||| ||||||| | ||||||||| |||| || | |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 332 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC TCCTTATGAG 2385 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... .......... 365 ACAGGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT AAATTCCATA 2325 .......... .......... .......... .......... .......... .......... 365 TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 2285 |||| || .......... .......... .......... ..TTTTGGAT 373 hqPGS_C06HBa0057J04.1-9-_SGN-E306317+ (2743 2411,2292 2285) ******************************************************************************** EST sequence 8 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 8865 to 621): Exon 1 8255 8088 ( 168 n); cDNA 1 168 ( 168 n); score: 0.958 Intron 1 8087 3051 (5037 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0.85) Exon 2 3050 3004 ( 47 n); cDNA 169 214 ( 46 n); score: 0.851 Intron 2 3003 2959 ( 45 n); Pd: 0.000 (s: 0.85), Pa: 0.000 (s: 0.86) Exon 3 2958 2411 ( 548 n); cDNA 215 761 ( 547 n); score: 0.908 PPA cDNA 829 839 MATCH C06HBa0057J04.1-9- SGN-E544254- 0.920 763 0.905 C PGS_C06HBa0057J04.1-9-_SGN-E544254- (8255 8088,3050 3004,2958 2411) Alignment (genomic DNA sequence = upper lines): GAGTCATTTA TCATTTCACC GAGTCCCGGG CAGGGTAATG TTCATGCGGA GTTTCTTGCA 8196 |||||||||| |||||||||| |||||||||| | |||||||| ||| |||||| |||||||||| GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 60 TATGTCACCG AGTTCCTCAC TAGAGGGCCG GGTATGTATA TTATATGTAT GATTGGTGAT 8136 |||||||||| ||| |||||| |||||||||| || ||||||| |||||| ||| |||||||||| TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 120 GAGGATGGTT ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC 8076 |||||||||| |||||||||| |||||||||| ||||| |||| ||| |||| GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATT.. .......... 168 TTATTAGGGA AGTTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAGGG 8016 .......... .......... .......... .......... .......... .......... 168 TATATGTGTA GCGATATTTT GTTTTGACTT GCTATATTGG TATGCTGTCA TCTTTACCTT 7956 .......... .......... .......... .......... .......... .......... 168 ATGCTTTACA TACTCATTAC ATTGTCTGTA CTGACCCCCC TTTCCTCGGG GGGCTGGTTT 7896 .......... .......... .......... .......... .......... .......... 168 TCATGCCCGC AGGTGTAGAC GCACAGTTTG GTGATCCTCC CGCCTAGGAT ATCTACTCTG 7836 .......... .......... .......... .......... .......... .......... 168 ATGATTGGGA GAGCTCCACT GTTCCGGAGC CTAGTCGTTT TGGTACATAA CTTTTGTGTA 7776 .......... .......... .......... .......... .......... .......... 168 GTCTTTTGCT CGTCTATGGG TATGGCGGGT CCCTGTCCCG TCGAGTTTCA CTAATGTACT 7716 .......... .......... .......... .......... .......... .......... 168 CTTAGAGGTC TGTGGACATT ATGTGGGTTG TATATATATT TTTTGGATAA TGGTCTGGAC 7656 .......... .......... .......... .......... .......... .......... 168 ATGGTTTGTT TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGCG TACATCATTG 7596 .......... .......... .......... .......... .......... .......... 168 TGTATTGTGT AGTGGCAGCC TTGTCGGCAT ACGTATGTTA TTATGCTTTG AATAGTGGCG 7536 .......... .......... .......... .......... .......... .......... 168 GCCTTGTCGG CTCGCGTATG TTGTTATGGT TGAATGATTA TGACTCCTTA TGAGACAGGT 7476 .......... .......... .......... .......... .......... .......... 168 CCTCTTATAT ATATATGACG TTGGGGTTGG CTTGATTTGA TTAAATTCCA TATTGTCTTA 7416 .......... .......... .......... .......... .......... .......... 168 GTTTCAGTTG GTTATACTTA GCAGGTTTGT ATGTGGGTGT CCAAAACGGG CACTAGTCAC 7356 .......... .......... .......... .......... .......... .......... 168 GGCCCATCGG GTTGGGTCGT GACAAATTTC CCTTGAAATA GAATATAAAT TCTCTAAGCA 7296 .......... .......... .......... .......... .......... .......... 168 TATACTCTAG GGTTAGAATA GTACATTTCG CTTGGCCATC CGTTTGGGGA TAAAAAATGG 7236 .......... .......... .......... .......... .......... .......... 168 TGCTTTACTT CACATTGGTA CCCAACCCTT ATTGGAATGG ACTCCAAAAC ATAGATGTGA 7176 .......... .......... .......... .......... .......... .......... 168 ATTTTTAACC CCTATATTAT ATGATGGATA ATAAACTACC ATGGTGACAT ACAATATAAT 7116 .......... .......... .......... .......... .......... .......... 168 CTATGAAGAT CCTTGCATAA TCCTCCGAGT AAGTGGACTT GACTAGAATA AAATGAGTAG 7056 .......... .......... .......... .......... .......... .......... 168 ATTTGGTCAA CTTATCCACA ACCACCTATA TGGAGACATA TTGATTTTGT GTCTGAGGCA 6996 .......... .......... .......... .......... .......... .......... 168 AACCTACCAA AAAATTCATA TTTATGTCTT CCCACATCCA CGTATGGATA TGGACTTCTT 6936 .......... .......... .......... .......... .......... .......... 168 CATGTTGACC AACCGACTTT AACTAGTTTG TAATTTGGAC ATTGTGCAAC AAATTCCACT 6876 .......... .......... .......... .......... .......... .......... 168 ATATCCATTT TAAAGTCTTC CAACCAAATC ACTTCTCTAA GGTCCTGATA CATCTTTGTA 6816 .......... .......... .......... .......... .......... .......... 168 GAAACCGGAT GAATGGATTA ACGGGACCCA TGAGCTTCCT CTAGAATCCA GTTTCTCAAA 6756 .......... .......... .......... .......... .......... .......... 168 CCATCGACAT TTCAAACACA CAACCTTCCT TGATACCTCA GGACACCATC CCCTCCTAAG 6696 .......... .......... .......... .......... .......... .......... 168 GAGAATGCCT CATTCAGATT TCTTAGAACC AATCCTTTCA ACTACATCAA TAATGAATCA 6636 .......... .......... .......... .......... .......... .......... 168 AGGTATTTCT TAGATTTTAC CTAGACCACC AACGCGGATT CAGAGTTATG ATTGACCATA 6576 .......... .......... .......... .......... .......... .......... 168 AAACAACCAT TCAAAGAATC TTCCAATCTA TCACCAAATA TAGCCAACCT ATGAATATCT 6516 .......... .......... .......... .......... .......... .......... 168 TGCACTAGGT ATTTCTTTGA TTAATCTACA TGAGACACAC TACTCATGGT CATATGACTT 6456 .......... .......... .......... .......... .......... .......... 168 AGACCATCTG CAACCACGTT GGACATGTTT GGATGGTAGA GAACACTCAT GTCATAATCT 6396 .......... .......... .......... .......... .......... .......... 168 TTCAACAATT GAAACCACCT TCTTTGTTGG AGAGTTAATT CCTTTTGAGT AAACACATAT 6336 .......... .......... .......... .......... .......... .......... 168 TGAAGACTCT TGTGGTCGGT ATACACATAA CCATGAATAC CATACAAATG AAGTCTCCAT 6276 .......... .......... .......... .......... .......... .......... 168 ATTTTTAAGG CAAACATAAC GATGCTAGTT AAAGACCATG AGTTTGATAA TTTCTATGAT 6216 .......... .......... .......... .......... .......... .......... 168 GCACATTATG TTTTTTAGAG GCATTGGCTA CTAATTTACC ATGTTTCATA AACACACACA 6156 .......... .......... .......... .......... .......... .......... 168 CCAAGTCCAC TCGGCATGCA TCACAATACA CACCGAATAC CTTGGTATCC TCTGGTAAAG 6096 .......... .......... .......... .......... .......... .......... 168 TCAACACCGG AGTGGAAGTA AGCATATCCT TCAATATTTG AGAGCTTCTT TCACATGCCT 6036 .......... .......... .......... .......... .......... .......... 168 CCGACCACTC GTGTTTCACA TTTTTTTGGG CCGGATTAGT AAAGGGAGAT GCAATGGGTG 5976 .......... .......... .......... .......... .......... .......... 168 CAAAACCATA AAAAAACATC CTATAATAAC TTGTTACGCC CAAGAAACTC CTAATATTAG 5916 .......... .......... .......... .......... .......... .......... 168 TTGGAGTCAA TGGTCTAGGC ATATTTTTCA CCGCCTTGGA TTTCTGTGGA TCAATATGAA 5856 .......... .......... .......... .......... .......... .......... 168 CCCCCTCACT TGATATGATG TGACCATTAT ATGAAACTTA CCTTAACCGA AAAACTCATA 5796 .......... .......... .......... .......... .......... .......... 168 TTTGTTATAC TTAGCAAACA TTTGTTTCTC CTTATGCATT TGTAACACCG CCCTCTAGTG 5736 .......... .......... .......... .......... .......... .......... 168 GTCCATGTGT TCACCCTTAT TTTTCAAATA TACCAATATG TCGTTAATGA AAACAATGAC 5676 .......... .......... .......... .......... .......... .......... 168 AAATAAATCC AGGTTTCTTA AAACACGCTA TTCATGAGAT CCATAATTTT TGCCGAGGCA 5616 .......... .......... .......... .......... .......... .......... 168 TTAGTGAAAC CAAAGGACAT TACTAAGAAC TCATAATGAC TATATCTAGT AAGAATTATC 5556 .......... .......... .......... .......... .......... .......... 168 GTTTTTGTAT ATCCTTAGCT CTCACCCTAT GTTGGTGATA CCCCGATCTC AAGTTAATCT 5496 .......... .......... .......... .......... .......... .......... 168 TAGAAAAGTA GCTTGCCCAT TGGAGTTGAT CAAACAAGTC CTCAATCCGA GGGAGAGGAT 5436 .......... .......... .......... .......... .......... .......... 168 ACTTGTTCCT TATAGTGACT TTATTGAGTT GGCGATAATA TCTGCACATT ATAAGGGACC 5376 .......... .......... .......... .......... .......... .......... 168 CATCCTTCTT CTTGAAAAAC AATACCACAG AACCCCTTGG AGAAATACTA GGTCGAATGA 5316 .......... .......... .......... .......... .......... .......... 168 AGACTTTTTC TAGTATGTCT TTGAGTTGAG ACTTCAACTC TTTCAATTTA GTCGGAGCCA 5256 .......... .......... .......... .......... .......... .......... 168 TCCAATAAGG AGGAACTGAT ATGGTATTGG TTTCCGGAGT AAGTCAATAG CACAATCAAT 5196 .......... .......... .......... .......... .......... .......... 168 TTTTTGTTCG GGAGGGATTA TGTAAAGATT TTGAGGAAAG GCCTCTAGAA ATTCCATTAT 5136 .......... .......... .......... .......... .......... .......... 168 AACCAAGACT ATTTCAATTG GAGGATTCTC GTAGTCTAAA TCATTGGCTC TTATTATATT 5076 .......... .......... .......... .......... .......... .......... 168 ATATCGACAC CTTTTTTTAG ATCTTTTACA AACTTTCAAA TAAGAAATAA TACGACCTCT 5016 .......... .......... .......... .......... .......... .......... 168 AAAAATTTCC CCCCTTCCAA TCTATAATGG ACTGATTTGG AAAGTTAAAC TTCTCCACCC 4956 .......... .......... .......... .......... .......... .......... 168 CTTTTCTACA ATCTATGTAG GCAAAGCAAG CATCAACCAA TCCATCCGCA AAATGACAAC 4896 .......... .......... .......... .......... .......... .......... 168 AAAATCTACC ATATCAAATT CTACTAGTTC AACATGTGTT AATTTATTGG CCAACATTAT 4836 .......... .......... .......... .......... .......... .......... 168 AGGACAATTC CTATATACTT TCTTTGTAAC CACCTACTCA CCTATCGGGG TAGTTAATAT 4776 .......... .......... .......... .......... .......... .......... 168 GAAATGTTCA TTAGAATATC GAGCAAAATG TCAAACATTC TAATTATCAA GGGTGTATAC 4716 .......... .......... .......... .......... .......... .......... 168 ATATATAATG TATCACCCGG ATGGAGTAAA TAAAAAAAAA TCAATAGAGA AGACTTTCAA 4656 .......... .......... .......... .......... .......... .......... 168 CATACAAGTT ACCACATCGG GAGAAGTCTC TTTCTCACCT ATAAAGCAGA CAGAATAAAA 4596 .......... .......... .......... .......... .......... .......... 168 GTGATTCTTC TTCAGAGCAT CAACATTAGA ACAACTAGCT TTCCCACTAC CTTTGTCTTT 4536 .......... .......... .......... .......... .......... .......... 168 CCCCTTCACA TTTGGGAAAT CCCTAGCCTT TATACAACCC TCCCCACATA CAAACTAAAA 4476 .......... .......... .......... .......... .......... .......... 168 CGTCCGTCCC AATAAGGAAA TCACCATAAT GCTTCTTGGC ACACTTTCCA CAAGTTGGAT 4416 .......... .......... .......... .......... .......... .......... 168 TCTTGCTTGG TGCGATAATA CCTCTACCCT TTTTAGACTT AGGGTTAGAT ACCATATCGT 4356 .......... .......... .......... .......... .......... .......... 168 CACTAGACTT GGGATATTTT TTAGGAAATT TATTAGAAGA CCTCTTCTTA ATTGTTATGC 4296 .......... .......... .......... .......... .......... .......... 168 CTCGTATTTT TTTTATACGT AGTGCGCATC GTGATCTAGA AGACGTAAAG AAATATTAGG 4236 .......... .......... .......... .......... .......... .......... 168 CAAGGATGTT ATTTCCAAAT GTGATATTAA GTATGAGTTG GTTAATGTAA GTGCCATTAA 4176 .......... .......... .......... .......... .......... .......... 168 CTTTAAGTGA GGGATTAATT AGTGGCTAAT TTGGATTGAT TTAATCCAAT GGGCCCCACC 4116 .......... .......... .......... .......... .......... .......... 168 ACTCAAGGCA AAAATTAAAA AGGGATCAGA TTGTGGGCTG GCCTAAGTGG ACAGGTGTAG 4056 .......... .......... .......... .......... .......... .......... 168 AGGGGGGCTG CCTCCACTTC ATACTATTAA ATGAGGTGGA GAAATCCACT CCATGCTATA 3996 .......... .......... .......... .......... .......... .......... 168 TAAAGTGGTG AAATGCATTG CTGCATATCA TCTTCTTCTT CACCACTTGT CTTAGGCAGC 3936 .......... .......... .......... .......... .......... .......... 168 CATGGAAATG GAGAAACAAA CCCTGCAACT CTTGGCCAGC AGCTGCAAAT AATTTGGTTA 3876 .......... .......... .......... .......... .......... .......... 168 GTAATCTCCT TGTTTGGTGT GTTAATTCTT TAGAATACCC TTGTTAATTA TCCATTAATT 3816 .......... .......... .......... .......... .......... .......... 168 TTAAGAAGGG GGCGTGACCA GTAGCTTAGG AAGTTTGTTT TAGTTATTGA ATGTGCTAAG 3756 .......... .......... .......... .......... .......... .......... 168 TATGAATGGA AACCATAATC GGATTATTAG TGGTGTCATG TTGGTGCTTG GGCTGTTTAT 3696 .......... .......... .......... .......... .......... .......... 168 ATGATTCTTT GGGTTATATG TGTTATTGGT ATTGCTGTGG ATAATTTGGA TTGTTGTCGG 3636 .......... .......... .......... .......... .......... .......... 168 ATTGGGACGA AGTAAGGAAA ATAGGGGAGG TGCTGCCGAA TTTTCGTTAG ATTATTAGCT 3576 .......... .......... .......... .......... .......... .......... 168 AGCTTACAAG AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT TGCTTGTTAT 3516 .......... .......... .......... .......... .......... .......... 168 AGATTAATAG CTTGAGCAGT AAATAATGGA CGTGCGGCTC AATTATACGG TATGTAACGC 3456 .......... .......... .......... .......... .......... .......... 168 TGTCCCTTCT TTCTTTGCTT GGCATGACTT TTAAAAATAA GCGAATAACG GACAGATTTG 3396 .......... .......... .......... .......... .......... .......... 168 ATACTTACCT CTAAAGCGTC TAGGTGATGT ATATTCTTGC TTCCACAATT ATTCCTCTAT 3336 .......... .......... .......... .......... .......... .......... 168 ATATCGGTTA TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC TTCTTAGAGT 3276 .......... .......... .......... .......... .......... .......... 168 CATTGAAATT TTACGTTTTC ATATCGTATT AAAGGTTCAT AATCTTGATA AAACATTAAT 3216 .......... .......... .......... .......... .......... .......... 168 CTTTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA AGAAATGATT 3156 .......... .......... .......... .......... .......... .......... 168 TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT ATCATTTCAC 3096 .......... .......... .......... .......... .......... .......... 168 CGAGTCCCGG GCCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTTCCTCA 3036 ||||| |||| ||||| .......... .......... .......... .......... .....TCACC GAGTCCCTCA 183 CTAGAGGGCC GGGTATGTAT ATTATATATA TGATTGGTGA TGAGGATGGT TATGATGATG 2976 |||||||||| ||||| ||| | |||||| | CTAGAGGGCC GGGTA-CTAT GATGTATATA TA........ .......... .......... 214 ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC TTTACTAGGG AAGCTGGGCA 2916 || |||||||||| ||||||| || ||||||||| ||| | |||| .......... .......ATG ATGATTATTT TGCCGAGTCC CTTACTAGGG AAGTTAGGCA 257 CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAAG TATATGTGTA GCGATATTTT 2856 |||| |||| |||| ||||| |||||||||| |||||||||| || ||||||| | ||||| || TCTTATATGT TAAAGATATG CATGATTTTC ACTTAAAAAG TACATGTGTA GAGATATCTT 317 GTTTCGACTT GCCACATTGG TATCCTGTCA TCTTTACCTT ATGCTTTACA TACTCAGTAC 2796 |||||||||| || |||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTCGACTT ATCATGTTGG TATCCTGTCA TCTTTACCTT ATGCTTTACA TACTCAGTAC 377 ATTGTCCGTA CTGACCCCCC TTTCCTCGGG GGGCTGCGTT TCATGCCTGC AGGTGTAGAC 2736 |||||||||| |||| ||||| ||| |||||| |||||||||| ||||||| || |||||||||| ATTGTCCGTA CTGA-CCCCC TTTTCTCGGG GGGCTGCGTT TCATGCCCGC AGGTGTAGAC 436 GCGCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTGATTGGGA GAGCTCCACT 2676 || ||||||| |||||||||| |||||||||| |||||||||| || |||||| |||||||||| GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 496 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTT-TTGTGT AGGCTTTTGC TCGTCTATGG 2617 |||||||||| |||||||||| |||||||||| ||| || ||| || ||||||| | |||||||| GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 556 GTATGGCGGG GCCCTGTCCC GTCGAGTTTC ACTAATGTAC TCTTAGAGGT CTGTGGACAT 2557 |||||||||| |||||||||| ||| |||||| |||| | ||| |||||||||| |||| ||||| GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 616 TATGTGGGTT GTATATATAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 2497 |||||||| ||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 676 CGCTTGTACA GGGGCAGCCT TGTCGGCTGC GTACATCATT ATGCTTTGAA TAGTGGCGGC 2437 | ||||||| | ||||||| |||||| || | |||||||| || ||| ||||||| || CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 736 CTTGTCGGCT CGCGTATGCT GTTATG 2411 || ||||||| ||||||||| ||||| CTCGTCGGCT -GCGTATGCT ATTATG 761 hqPGS_C06HBa0057J04.1-9-_SGN-E544254- (8255 8088,3050 3004,2958 2411) ******************************************************************************** EST sequence 28 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 4512 to 256): Exon 1 3938 3880 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 3879 3514 ( 366 n); Pd: 0.995 (s: 0.79), Pa: 0.889 (s: 0.85) Exon 2 3513 3467 ( 47 n); cDNA 61 107 ( 47 n); score: 0.851 Intron 2 3466 2744 ( 723 n); Pd: 0.990 (s: 0.85), Pa: 0.000 (s: 0.98) Exon 3 2743 2411 ( 333 n); cDNA 108 440 ( 333 n); score: 0.905 PPA cDNA 508 519 MATCH C06HBa0057J04.1-9- SGN-E310669+ 0.893 439 0.846 C PGS_C06HBa0057J04.1-9-_SGN-E310669+ (3938 3880,3513 3467,2743 2411) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CAAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3880 |||||||||| ||||||| || | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 3820 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 3760 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CATGTTGGTG CTTGGGCTGT 3700 .......... .......... .......... .......... .......... .......... 60 TTATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG 3640 .......... .......... .......... .......... .......... .......... 60 TCGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 3580 .......... .......... .......... .......... .......... .......... 60 AGCTAGCTTA CAAGAAAGTA AAGCACGATG TTTATCTAAT TGCGGCACGA TTGTTGCTTG 3520 .......... .......... .......... .......... .......... .......... 60 TTATAGATTA ATAGCTTGAG CAGTAAATAA TGGACGTGCG GCTCAATTAT ACGGTATGTA 3460 ||| ||| |||||| ||||||||| ||||||| || |||| | ||| || ......ATTT ATACCTTGAG CAGTAAATAT TGGACGTACG GCTCGACTAT TCG....... 107 ACGCTGTCCC TTCTTTCTTT GCTTGGCATG ACTTTTAAAA ATAAGCGAAT AACGGACAGA 3400 .......... .......... .......... .......... .......... .......... 107 TTTGATACTT ACCTCTAAAG CGTCTAGGTG ATGTATATTC TTGCTTCCAC AATTATTCCT 3340 .......... .......... .......... .......... .......... .......... 107 CTATATATCG GTTATGTCTA AGGCTATGAT GATCTCTAAT ATCTATGGTA ATGCTTCTTA 3280 .......... .......... .......... .......... .......... .......... 107 GAGTCATTGA AATTTTACGT TTTCATATCG TATTAAAGGT TCATAATCTT GATAAAACAT 3220 .......... .......... .......... .......... .......... .......... 107 TAATCTTTGG TAATACTCCT TGCTGGTTCA CGTTGATTGT TCTATTGAGT TATAAGAAAT 3160 .......... .......... .......... .......... .......... .......... 107 GATTTTAATT GCATATGGTT GCTCATAATA TTCTGCTCGT GCATAGAGTC ATTTATCATT 3100 .......... .......... .......... .......... .......... .......... 107 TCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT CACCGAGTTC 3040 .......... .......... .......... .......... .......... .......... 107 CTCACTAGAG GGCCGGGTAT GTATATTATA TATATGATTG GTGATGAGGA TGGTTATGAT 2980 .......... .......... .......... .......... .......... .......... 107 GATGATGATG ACGGAGATGA CGTGATGATT ATTTTGCCGA GCCCTTTACT AGGGAAGCTG 2920 .......... .......... .......... .......... .......... .......... 107 GGCACCTTAA ATGTTAAATA TATGCATGAT TTTCACTTAA AAAGTATATG TGTAGCGATA 2860 .......... .......... .......... .......... .......... .......... 107 TTTTGTTTCG ACTTGCCACA TTGGTATCCT GTCATCTTTA CCTTATGCTT TACATACTCA 2800 .......... .......... .......... .......... .......... .......... 107 GTACATTGTC CGTACTGACC CCCCTTTCCT CGGGGGGCTG CGTTTCATGC CTGCAGGTGT 2740 |||| .......... .......... .......... .......... .......... ......GTGT 111 AGACGCGCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGATT GGGAGAGCTC 2680 |||||| ||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGTTT GGGAGAGCTC 171 CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTT-TT GTGTAGGCTT TTGCTCGTCT 2621 |||||||||| |||||||||| |||||||||| ||||||| || ||||| ||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 231 ATGGGTATGG CGGGGCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 2561 |||||||||| |||||||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 291 ACATTATGTG GGTTGTATAT ATATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 2501 |||| |||| ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 351 TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT CATTATGCTT TGAATAGTGG 2441 ||||| ||| |||| | ||| |||||||||| || | |||| |||| || | || |||||| TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT TGTGTAGTGG 411 CGGCCTTGTC GGCTCGCGTA TGCTGTTATG 2411 | |||| ||| |||| ||||| |||| ||||| CAGCCTCGTC GGCT-GCGTA TGCTATTATG 440 hqPGS_C06HBa0057J04.1-9-_SGN-E310669+ (3938 3880,3513 3467,2743 2411) ******************************************************************************** EST sequence 11 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 4174 to 1811): Exon 1 3922 3880 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 3879 3114 ( 766 n); Pd: 0.995 (s: 0.81), Pa: 0.975 (s: 1.00) Exon 2 3113 2919 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 2918 2744 ( 175 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.96) Exon 3 2743 2411 ( 333 n); cDNA 242 573 ( 332 n); score: 0.899 MATCH C06HBa0057J04.1-9- SGN-E538151+ 0.910 571 0.942 C PGS_C06HBa0057J04.1-9-_SGN-E538151+ (3922 3880,3113 2919,2743 2411) Alignment (genomic DNA sequence = upper lines): AAACAAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3863 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 3803 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 3743 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCATGTTG GTGCTTGGGC TGTTTATATG ATTCTTTGGG 3683 .......... .......... .......... .......... .......... .......... 47 TTATATGTGT TATTGGTATT GCTGTGGATA ATTTGGATTG TTGTCGGATT GGGACGAAGT 3623 .......... .......... .......... .......... .......... .......... 47 AAGGAAAATA GGGGAGGTGC TGCCGAATTT TCGTTAGATT ATTAGCTAGC TTACAAGAAA 3563 .......... .......... .......... .......... .......... .......... 47 GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC TTGTTATAGA TTAATAGCTT 3503 .......... .......... .......... .......... .......... .......... 47 GAGCAGTAAA TAATGGACGT GCGGCTCAAT TATACGGTAT GTAACGCTGT CCCTTCTTTC 3443 .......... .......... .......... .......... .......... .......... 47 TTTGCTTGGC ATGACTTTTA AAAATAAGCG AATAACGGAC AGATTTGATA CTTACCTCTA 3383 .......... .......... .......... .......... .......... .......... 47 AAGCGTCTAG GTGATGTATA TTCTTGCTTC CACAATTATT CCTCTATATA TCGGTTATGT 3323 .......... .......... .......... .......... .......... .......... 47 CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC TTAGAGTCAT TGAAATTTTA 3263 .......... .......... .......... .......... .......... .......... 47 CGTTTTCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA CATTAATCTT TGGTAATACT 3203 .......... .......... .......... .......... .......... .......... 47 CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA AATGATTTTA ATTGCATATG 3143 .......... .......... .......... .......... .......... .......... 47 GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC ATTTCACCGA GTCCCGGGCC 3083 | |||||||||| |||||||||| |||||||||| .......... .......... .........A GTCATTTATC ATTTCACCGA GTCCCGGGCC 78 GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TTCCTCACTA GAGGGCCGGG 3023 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TCCCTCACTA GAGGGCCGGG 138 TATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 2963 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 198 TGACGTGATG ATTATTTTGC CGAGCCCTTT ACTAGGGAAG CTGGGCACCT TAAATGTTAA 2903 ||| |||||| | ||||| | ||| || | ||||| | | | || TGATGTGATG ACTATTTCAC TGAGTCCCTC ACTAGAG-GG CCGG...... .......... 241 ATATATGCAT GATTTTCACT TAAAAAGTAT ATGTGTAGCG ATATTTTGTT TCGACTTGCC 2843 .......... .......... .......... .......... .......... .......... 241 ACATTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC TCAGTACATT GTCCGTACTG 2783 .......... .......... .......... .......... .......... .......... 241 ACCCCCCTTT CCTCGGGGGG CTGCGTTTCA TGCCTGCAGG TGTAGACGCG CAGTTCGGTG 2723 | ||||||||| ||||| |||| .......... .......... .......... .........G TGTAGACGCT CAGTTTGGTG 262 ATCCTCCCGC CTAGGATATC TACTCTGCTG ATTGGGAGAG CTCCACTGTT CCGGAGCCCA 2663 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG CTCCACTGTT CCGGAGCCCA 322 GTCGTTTTGG TACATAACTT -TTGTGTAGG CTTTTGCTCG TCTATGGGTA TGGCGGGGCC 2604 |||||||||| |||||||||| || ||||| |||||||| | |||||||||| | |||||||| GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG TCTATGGGTA T-GCGGGGCC 381 CTGTCCCGTC GAGTTTCACT AATGTACTCT TAGAGGTCTG TGGACATTAT GTGGGTTGTA 2544 |||||||||| ||||||||| | | |||||| |||||||||| | ||||| | |||||||||| CTGTCCCGTC AAGTTTCACT ACTATACTCT TAGAGGTCTG TAGACATCGT GTGGGTTGTA 441 TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG GGATGTCCGC TTGTACAGGG 2484 || |||||| || ||||||| |||||||||| |||||||||| |||||||| | ||||||| | TAATTATGTT TTTGATAATG GTCTGGACAT GGTTTGTTTG GGATGTCCAC TTGTACAAGT 501 GCAGCCTTGT CGGCTGCGTA CATCATTATG CTTTGAATAG TGGCGGCCTT GTCGGCTCGC 2424 ||| |||||| ||| || ||| |||| || || ||| ||| |||| ||||| ||||||| || GCAACCTTGT CGGTTGTGTA CATCTTTGTG TATTGTGTAG TGGCAGCCTT GTCGGCT-GC 560 GTATGCTGTT ATG 2411 ||||||| || ||| GTATGCTATT ATG 573 hqPGS_C06HBa0057J04.1-9-_SGN-E538151+ (3922 3880,3113 2919,2743 2411) ******************************************************************************** EST sequence 13 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 4174 to 1449): Exon 1 3922 3880 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 3879 3114 ( 766 n); Pd: 0.995 (s: 0.81), Pa: 0.975 (s: 1.00) Exon 2 3113 2919 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 2918 2744 ( 175 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.96) Exon 3 2743 2411 ( 333 n); cDNA 242 573 ( 332 n); score: 0.893 MATCH C06HBa0057J04.1-9- SGN-E538156+ 0.906 571 0.887 C PGS_C06HBa0057J04.1-9-_SGN-E538156+ (3922 3880,3113 2919,2743 2411) Alignment (genomic DNA sequence = upper lines): AAACAAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3863 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 3803 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 3743 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCATGTTG GTGCTTGGGC TGTTTATATG ATTCTTTGGG 3683 .......... .......... .......... .......... .......... .......... 47 TTATATGTGT TATTGGTATT GCTGTGGATA ATTTGGATTG TTGTCGGATT GGGACGAAGT 3623 .......... .......... .......... .......... .......... .......... 47 AAGGAAAATA GGGGAGGTGC TGCCGAATTT TCGTTAGATT ATTAGCTAGC TTACAAGAAA 3563 .......... .......... .......... .......... .......... .......... 47 GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC TTGTTATAGA TTAATAGCTT 3503 .......... .......... .......... .......... .......... .......... 47 GAGCAGTAAA TAATGGACGT GCGGCTCAAT TATACGGTAT GTAACGCTGT CCCTTCTTTC 3443 .......... .......... .......... .......... .......... .......... 47 TTTGCTTGGC ATGACTTTTA AAAATAAGCG AATAACGGAC AGATTTGATA CTTACCTCTA 3383 .......... .......... .......... .......... .......... .......... 47 AAGCGTCTAG GTGATGTATA TTCTTGCTTC CACAATTATT CCTCTATATA TCGGTTATGT 3323 .......... .......... .......... .......... .......... .......... 47 CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC TTAGAGTCAT TGAAATTTTA 3263 .......... .......... .......... .......... .......... .......... 47 CGTTTTCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA CATTAATCTT TGGTAATACT 3203 .......... .......... .......... .......... .......... .......... 47 CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA AATGATTTTA ATTGCATATG 3143 .......... .......... .......... .......... .......... .......... 47 GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC ATTTCACCGA GTCCCGGGCC 3083 | |||||||||| |||||||||| |||||||||| .......... .......... .........A GTCATTTATC ATTTCACCGA GTCCCGGGCC 78 GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TTCCTCACTA GAGGGCCGGG 3023 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TCCCTCACTA GAGGGCCGGG 138 TATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 2963 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 198 TGACGTGATG ATTATTTTGC CGAGCCCTTT ACTAGGGAAG CTGGGCACCT TAAATGTTAA 2903 ||| |||||| | ||||| | ||| || | ||||| | | | || TGATGTGATG ACTATTTCAC TGAGTCCCTC ACTAGAG-GG CCGG...... .......... 241 ATATATGCAT GATTTTCACT TAAAAAGTAT ATGTGTAGCG ATATTTTGTT TCGACTTGCC 2843 .......... .......... .......... .......... .......... .......... 241 ACATTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC TCAGTACATT GTCCGTACTG 2783 .......... .......... .......... .......... .......... .......... 241 ACCCCCCTTT CCTCGGGGGG CTGCGTTTCA TGCCTGCAGG TGTAGACGCG CAGTTCGGTG 2723 | ||||||||| ||||| |||| .......... .......... .......... .........G TGTAGACGCT CAGTTTGGTG 262 ATCCTCCCGC CTAGGATATC TACTCTGCTG ATTGGGAGAG CTCCACTGTT CCGGAGCCCA 2663 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG CTCCACTGTT CCGGAGCCCA 322 GTCGTTTTGG TACATAACTT -TTGTGTAGG CTTTTGCTCG TCTATGGGTA TGGCGGGGCC 2604 |||||| ||| |||||||||| || ||||| |||||||| | |||||||||| | |||||||| GTCGTTGTGG TACATAACTT CTTATGTAGT CTTTTGCTTG TCTATGGGTA T-GCGGGGCC 381 CTGTCCCGTC GAGTTTCACT AATGTACTCT TAGAGGTCTG TGGACATTAT GTGGGTTGTA 2544 |||||||||| ||||||||| | | |||||| |||||||||| | ||||| | |||||||||| CTGTCCCGTC AAGTTTCACT ACTATACTCT TAGAGGTCTG TAGACATCGT GTGGGTTGTA 441 TATATATGTT TTGGATAATG GTCTGGACAT GGTTTGTTTG GGATGTCCGC TTGTACAGGG 2484 || |||||| || ||||||| |||||||||| |||||||||| |||||||| | ||||||| | TAATTATGTT TTTGATAATG GTCTGGACAT GGTTTGTTTG GGATGTCCAC TTGTACAAGT 501 GCAGCCTTGT CGGCTGCGTA CATCATTATG CTTTGAATAG TGGCGGCCTT GTCGGCTCGC 2424 ||| |||||| ||| || ||| |||| || || ||| ||| |||| ||||| | ||||| || GCAACCTTGT CGGTTGTGTA CATCTTTGTG TATTGTGTAG TGGCAGCCTT GACGGCT-GC 560 GTATGCTGTT ATG 2411 ||||||| || ||| GTATGCTATT ATG 573 hqPGS_C06HBa0057J04.1-9-_SGN-E538156+ (3922 3880,3113 2919,2743 2411) ******************************************************************************** EST sequence 22 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 4298 to 1453): Exon 1 3922 3880 ( 43 n); cDNA 3 45 ( 43 n); score: 0.814 Intron 1 3879 3114 ( 766 n); Pd: 0.995 (s: 0.81), Pa: 0.975 (s: 0.98) Exon 2 3113 2919 ( 195 n); cDNA 46 239 ( 194 n); score: 0.923 Intron 2 2918 2744 ( 175 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.96) Exon 3 2743 2513 ( 231 n); cDNA 240 470 ( 231 n); score: 0.911 MATCH C06HBa0057J04.1-9- SGN-E268096+ 0.917 469 0.998 C PGS_C06HBa0057J04.1-9-_SGN-E268096+ (3922 3880,3113 2919,2743 2513) Alignment (genomic DNA sequence = upper lines): AAACAAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3863 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 45 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 3803 .......... .......... .......... .......... .......... .......... 45 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 3743 .......... .......... .......... .......... .......... .......... 45 CATAATCGGA TTATTAGTGG TGTCATGTTG GTGCTTGGGC TGTTTATATG ATTCTTTGGG 3683 .......... .......... .......... .......... .......... .......... 45 TTATATGTGT TATTGGTATT GCTGTGGATA ATTTGGATTG TTGTCGGATT GGGACGAAGT 3623 .......... .......... .......... .......... .......... .......... 45 AAGGAAAATA GGGGAGGTGC TGCCGAATTT TCGTTAGATT ATTAGCTAGC TTACAAGAAA 3563 .......... .......... .......... .......... .......... .......... 45 GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC TTGTTATAGA TTAATAGCTT 3503 .......... .......... .......... .......... .......... .......... 45 GAGCAGTAAA TAATGGACGT GCGGCTCAAT TATACGGTAT GTAACGCTGT CCCTTCTTTC 3443 .......... .......... .......... .......... .......... .......... 45 TTTGCTTGGC ATGACTTTTA AAAATAAGCG AATAACGGAC AGATTTGATA CTTACCTCTA 3383 .......... .......... .......... .......... .......... .......... 45 AAGCGTCTAG GTGATGTATA TTCTTGCTTC CACAATTATT CCTCTATATA TCGGTTATGT 3323 .......... .......... .......... .......... .......... .......... 45 CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC TTAGAGTCAT TGAAATTTTA 3263 .......... .......... .......... .......... .......... .......... 45 CGTTTTCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA CATTAATCTT TGGTAATACT 3203 .......... .......... .......... .......... .......... .......... 45 CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA AATGATTTTA ATTGCATATG 3143 .......... .......... .......... .......... .......... .......... 45 GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC ATTTCACCGA GTCCCGGGCC 3083 | |||||||||| ||| |||||| |||||||||| .......... .......... .........A GTCATTTATC ATTGCACCGA GTCCCGGGCC 76 GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TTCCTCACTA GAGGGCCGGG 3023 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG TCCCTCACTA GAGGGCCGGG 136 TATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 2963 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTATATT ATATATATGA TTGGTGATGA GGATGGTTAT GATGATGATG ATGACGGAGA 196 TGACGTGATG ATTATTTTGC CGAGCCCTTT ACTAGGGAAG CTGGGCACCT TAAATGTTAA 2903 ||| |||||| | ||||| | ||| || | ||||| | | | || TGATGTGATG ACTATTTCAC TGAGTCCCTC ACTAGAG-GG CCGG...... .......... 239 ATATATGCAT GATTTTCACT TAAAAAGTAT ATGTGTAGCG ATATTTTGTT TCGACTTGCC 2843 .......... .......... .......... .......... .......... .......... 239 ACATTGGTAT CCTGTCATCT TTACCTTATG CTTTACATAC TCAGTACATT GTCCGTACTG 2783 .......... .......... .......... .......... .......... .......... 239 ACCCCCCTTT CCTCGGGGGG CTGCGTTTCA TGCCTGCAGG TGTAGACGCG CAGTTCGGTG 2723 | ||||||||| ||||| |||| .......... .......... .......... .........G TGTAGACGCT CAGTTTGGTG 260 ATCCTCCCGC CTAGGATATC TACTCTGCTG ATTGGGAGAG CTCCACTGTT CCGGAGCCCA 2663 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG CTCCACTGTT CCGGAGCCCA 320 GTCGTTTTGG TACATAACTT -TTGTGTAGG CTTTTGCTCG TCTATGGGTA TGGCGGGGCC 2604 |||||||||| |||||||||| || ||||| |||||||| | |||||||||| | |||||||| GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG TCTATGGGTA T-GCGGGGCC 379 CTGTCCCGTC GAGTTTCACT AATGTACTCT TAGAGGTCTG TGGACATTAT GTGGGTTGTA 2544 |||||||||| ||||||||| | | |||||| |||||||||| | ||||| | |||||| ||| CTGTCCCGTC AAGTTTCACT ACTATACTCT TAGAGGTCTG TAGACATCGT GTGGGTAGTA 439 TATATATGTT TTGGATAATG GTCTGGACAT G 2513 || |||||| || ||||||| | |||||||| | TAATTATGTT TTTGATAATG GGCTGGACAT G 470 hqPGS_C06HBa0057J04.1-9-_SGN-E268096+ (3922 3880,3113 2919,2743 2513) ******************************************************************************** EST sequence 3 -strand 573 n (File: SGN-E538150-) 1 CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 61 GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 121 GGTTTGGCTT GAAAAAAAAA AAAAAAAAAA AACTCTTGAT ACAGTATTGG TTGGAAATTC 181 CCAAAGAGTT GCAGGTGCAG ATTATGCATT AGAAAGTATC CACAACATCA GGGAAGCAAT 241 ACCACAACTT TGGGAAGTAG ACAGGCTGGC TGAAGTTAAC TACTCTGGTG TAGCTGTTGA 301 GACATCTGTC ACAGCTTAGA ATCAGTAGTA CTACTATATC TCATCATCAT GCTGATGGCA 361 GAAGGAAAAA AAAATTAATC AAGAATCATG AGAAGATCCA AAATTTTCTG TCAAATTTGA 421 TTTTAAATGA TGTTGATGTT TTGTTGTCAT CAATTAATAA CTAGCTTTTA GTATTTCCTT 481 TCCATCCACA AATCTTGTAA ATAAATTCTA TATTTATCAG TCTACCTTTC TATGATTATA 541 TAATAATGAA GTTCAATTAT TAAAAAAAAA AAA Predicted gene structure (within gDNA segment 3179 to 1): Exon 1 2470 2343 ( 128 n); cDNA 1 131 ( 131 n); score: 0.832 PPA cDNA 562 573 MATCH C06HBa0057J04.1-9- SGN-E538150- 0.832 128 0.223 C PGS_C06HBa0057J04.1-9-_SGN-E538150- (2470 2343) Alignment (genomic DNA sequence = upper lines): CTGCGTACAT CATTATGCTT TGAATAGTGG CGGCCTTGTC GGCTCGCGTA TGCTGTTATG 2411 ||||||| ||||||||| |||||||||| | |||||||| |||||||||| || ||||| | CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 60 GTTGAATGGT TATGACTCCT TATGAGACAG GTCCTC-TTA TAT--ATATA TATGACGTTG 2354 |||||||||| |||||||| | ||||||| || ||| | ||| ||| ||||| |||| |||| GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 120 GGGTTGGCTT G 2343 || ||||||| | GGTTTGGCTT G 131 hqPGS_C06HBa0057J04.1-9-_SGN-E538150- (2470 2343) ******************************************************************************** EST sequence 18 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 3862 to 266): Exon 1 2743 2411 ( 333 n); cDNA 43 375 ( 333 n); score: 0.902 PPA cDNA 443 453 MATCH C06HBa0057J04.1-9- SGN-E303256+ 0.902 333 0.735 C PGS_C06HBa0057J04.1-9-_SGN-E303256+ (2743 2411) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 2684 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT T-TTGTGTAG GCTTTTGCTC 2625 |||||||||| |||||||||| |||||||||| |||||||||| | || ||||| |||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 2565 |||||||||| |||||||||| |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 222 GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 2505 || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 282 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGCGT ACATCATTAT GCTTTGAATA 2445 ||||||||| ||||||| | ||||||||| |||| || | |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 342 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATG 2411 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG 375 hqPGS_C06HBa0057J04.1-9-_SGN-E303256+ (2743 2411) ******************************************************************************** EST sequence 30 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 4647 to 868): Exon 1 3930 3698 ( 233 n); cDNA 1 233 ( 233 n); score: 0.966 MATCH C06HBa0057J04.1-9- SGN-E298250+ 0.966 233 0.512 C PGS_C06HBa0057J04.1-9-_SGN-E298250+ (3930 3698) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA ACAAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT GGTTAGTAAT 3871 |||||||||| || ||||||| |||||||||| |||| ||||| |||||||||| || |||||| AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 60 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT TAATTTTAAG 3811 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 120 AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 3751 |||||||||| ||||||| || ||| |||||| |||||||||| |||||||||| |||||||||| AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 180 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCATGTTGGT GCTTGGGCTG TTT 3698 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| ||| ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTT 233 hqPGS_C06HBa0057J04.1-9-_SGN-E298250+ (3930 3698) ******************************************************************************** EST sequence 25 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 9921 to 6308): Exon 1 9278 9217 ( 62 n); cDNA 1 63 ( 63 n); score: 0.879 Intron 1 9216 8956 ( 261 n); Pd: 0.991 (s: 0.87), Pa: 0.933 (s: 0) Exon 2 8955 8951 ( 5 n); cDNA 64 68 ( 5 n); score: 0.400 Intron 2 8950 8656 ( 295 n); Pd: 0.000 (s: 0), Pa: 0.880 (s: 0.98) Exon 3 8655 8609 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 3 8608 7884 ( 725 n); Pd: 0.991 (s: 0.98), Pa: 0.000 (s: 0.92) Exon 4 7883 7429 ( 455 n); cDNA 116 570 ( 455 n); score: 0.890 MATCH C06HBa0057J04.1-9- SGN-E543104+ 0.889 569 0.986 C PGS_C06HBa0057J04.1-9-_SGN-E543104+ (9278 9217,8955 8951,8655 8609,7883 7429) Alignment (genomic DNA sequence = upper lines): GGCAGCAATG GAAATGGAGA AACT-AACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 9220 |||||| ||| |||||||||| || | |||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATATCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 9160 ||| TTG....... .......... .......... .......... .......... .......... 63 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 9100 .......... .......... .......... .......... .......... .......... 63 TGCTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGT 9040 .......... .......... .......... .......... .......... .......... 63 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTTTTTTGG CATTATGTAT ATGTTGAATG 8980 .......... .......... .......... .......... .......... .......... 63 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGCGTT GCGAATTATA 8920 | | .......... .......... ....GTTTG. .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGCTAT GGTTGCCGAG ACGGAACTGT 8860 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG TTATTACTGT 8800 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTC GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG 8740 .......... .......... .......... .......... .......... .......... 68 AATTTTCGTT AGATTATTAG CTAGCTTACA AGAAAGTGAA GCACGATGTT TATCTAAATG 8680 .......... .......... .......... .......... .......... .......... 68 CGGCACGATT GTTGCTTGTT ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 8620 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....ATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 104 TCGATTATAC GGTATGTAAC GCTGTCCCTT CTTTCTTTGC TTGGCATGAC TTTTAAAAAT 8560 |||| ||||| | TCGACTATAC G......... .......... .......... .......... .......... 115 AAGCGAATAA CGGACAGATT TGATACTTAC CTCTAGAGCG TCTAGGTGAC GTATATTCTT 8500 .......... .......... .......... .......... .......... .......... 115 GCTTCCACAA TTATTCCTCT ATATATCGGC TATGTCTAAG GCTATGATGA TCCCTAATAT 8440 .......... .......... .......... .......... .......... .......... 115 CTATGGTAAT GCTTCTTAGA GTCATTGAGA TTTTTACGTT TCCATATCGT ATTAAAGGTT 8380 .......... .......... .......... .......... .......... .......... 115 CATAATCTTG ATAAAATATT AATCTTTGGT AATACTCCTT GCTGGTTCAC GTTGATTGTT 8320 .......... .......... .......... .......... .......... .......... 115 CTATTGAGTT ATAAGAAATG ATTTTAATTG CATATGGTTG CTCATAATAT TCTGCTCGTG 8260 .......... .......... .......... .......... .......... .......... 115 CATAGAGTCA TTTATCATTT CACCGAGTCC CGGGCAGGGT AATGTTCATG CGGAGTTTCT 8200 .......... .......... .......... .......... .......... .......... 115 TGCATATGTC ACCGAGTTCC TCACTAGAGG GCCGGGTATG TATATTATAT GTATGATTGG 8140 .......... .......... .......... .......... .......... .......... 115 TGATGAGGAT GGTTATGATG ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG 8080 .......... .......... .......... .......... .......... .......... 115 CCCCTTATTA GGGAAGTTGG GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA 8020 .......... .......... .......... .......... .......... .......... 115 AGGGTATATG TGTAGCGATA TTTTGTTTTG ACTTGCTATA TTGGTATGCT GTCATCTTTA 7960 .......... .......... .......... .......... .......... .......... 115 CCTTATGCTT TACATACTCA TTACATTGTC TGTACTGACC CCCCTTTCCT CGGGGGGCTG 7900 .......... .......... .......... .......... .......... .......... 115 GTTTTCATGC CCGCAGGTGT AGACGCACAG TTTGGTGATC CTCCCGCCTA GGATATCTAC 7840 |||| |||||| ||| || ||||||| |||||||||| |||||||||| .......... ......GTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC 159 TCTGATGATT GGGAGAGCTC CACTGTTCCG GAGCCTAGTC GTTTTGGTAC ATAAC-TTTT 7781 | || ||| | |||||||||| ||||||| || |||| |||| ||||||||| ||||| |||| TTTGCTGAGT GGGAGAGCTC CACTGTTTCG TAGCCCAGTC ATTTTGGTAC ATAACTTTTT 219 GTGTAGTCTT TTGCTCGTCT ATGGGTATGG CGGGTCCCTG TCCCGTCGAG TTTCACTAAT 7721 |||||||||| ||||| |||| |||||||||| ||| ||||| |||||||||| |||||||| | GTGTAGTCTT TTGCTTGTCT ATGGGTATGG TGGGGCCCTG TCCCGTCGAG TTTCACTACT 279 GTACTCTTAG AGGTCTGTGG ACATTATGTG GGTTGTATAT ATATTTTTTG GATAATGGTC 7661 ||||||||| ||||| | | |||| ||| |||||||||| |||| ||||| |||||||||| ATACTCTTAG AGGTCCATAG ACATCGCGTG GGTTGTATAT ATATGTTTTG GATAATGGTC 339 TGGACATGGT TTGTTTGGGA TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT 7601 |||||||||| |||||||||| ||||| |||| |||| ||||| |||||||| | |||||||||| TGGACATGGT TTGTTTGGGA TGTCCACTTG TACAAGGGCA GCCTTGTCAG CTGCGTACAT 399 CATTGTGTAT TGTGTAGTGG CAGCCTTGTC GGCATACGTA TGTTATTATG CTTTGAATAG 7541 | |||||||| |||||||||| |||||||||| ||| | |||| || ||||||| ||||| |||| CTTTGTGTAT TGTGTAGTGG CAGCCTTGTC GGC-TGCGTA TGCTATTATG CTTTGGATAG 458 TGGCGGCCTT GTCGGCTCGC GTATGTTGTT ATGGTTGAAT GATTATGACT CCTTATGAGA 7481 |||||||||| |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| TGGCGGCCTT GTCGGCTCGC GTATGTTGTT ACGGTTGAAT GGTTATGACT CCTTATGAGA 518 CAGGTCCTC- TTATATATAT ATGACGTTGG GGTTGGCTTG ATTTGATTAA ATT 7429 ||| ||| | |||||||||| || | | | | | || ||| |||||| ||| CAGATCCACT TTATATATAT AT-ATATGGC GATGGGGTTG GCTTGATTTG ATT 570 hqPGS_C06HBa0057J04.1-9-_SGN-E543104+ (9278 9217,8955 8951,8655 8609,7883 7429) ******************************************************************************** EST sequence 16 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 9706 to 5665): Exon 1 9275 9217 ( 59 n); cDNA 1 60 ( 60 n); score: 0.805 Intron 1 9216 8656 ( 561 n); Pd: 0.991 (s: 0.79), Pa: 0.880 (s: 0.89) Exon 2 8655 8609 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 8608 7884 ( 725 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.94) Exon 3 7883 7444 ( 440 n); cDNA 108 546 ( 439 n); score: 0.876 MATCH C06HBa0057J04.1-9- SGN-E305738+ 0.868 546 0.998 C PGS_C06HBa0057J04.1-9-_SGN-E305738+ (9275 9217,8655 8609,7883 7444) Alignment (genomic DNA sequence = upper lines): AGCAATGGAA ATGGAG-AAA CTAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 9217 ||| |||||| |||||| ||| | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATA TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 9157 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 9097 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGTTGT 9037 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTTTTGGCAT TATGTATATG TTGAATGTGA 8977 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGCGTTGCG AATTATAAAA 8917 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGCTATGGT TGCCGAGACG GAACTGTTTT 8857 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGTTA TTACTGTGGA 8797 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 8737 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTGAAGCA CGATGTTTAT CTAAATGCGG 8677 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 8617 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGCTTG GCATGACTTT TAAAAATAAG 8557 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAGAGCGTCT AGGTGACGTA TATTCTTGCT 8497 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGCTAT GTCTAAGGCT ATGATGATCC CTAATATCTA 8437 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAGATTT TTACGTTTCC ATATCGTATT AAAGGTTCAT 8377 .......... .......... .......... .......... .......... .......... 107 AATCTTGATA AAATATTAAT CTTTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA 8317 .......... .......... .......... .......... .......... .......... 107 TTGAGTTATA AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT 8257 .......... .......... .......... .......... .......... .......... 107 AGAGTCATTT ATCATTTCAC CGAGTCCCGG GCAGGGTAAT GTTCATGCGG AGTTTCTTGC 8197 .......... .......... .......... .......... .......... .......... 107 ATATGTCACC GAGTTCCTCA CTAGAGGGCC GGGTATGTAT ATTATATGTA TGATTGGTGA 8137 .......... .......... .......... .......... .......... .......... 107 TGAGGATGGT TATGATGATG ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC 8077 .......... .......... .......... .......... .......... .......... 107 CTTATTAGGG AAGTTGGGCA CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAGG 8017 .......... .......... .......... .......... .......... .......... 107 GTATATGTGT AGCGATATTT TGTTTTGACT TGCTATATTG GTATGCTGTC ATCTTTACCT 7957 .......... .......... .......... .......... .......... .......... 107 TATGCTTTAC ATACTCATTA CATTGTCTGT ACTGACCCCC CTTTCCTCGG GGGGCTGGTT 7897 .......... .......... .......... .......... .......... .......... 107 TTCATGCCCG CAGGTGTAGA CGCACAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT 7837 ||||||| ||| ||||| |||||||||| |||||||||| |||||||||| .......... ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT 154 GATGATTGGG AGAGCTCCAC TGTTCCGGAG CCTAGTCGTT TTGGTACATA ACTT-TTGTG 7778 | | ||||| |||||||||| |||||||||| || ||||||| |||||||||| |||| || || GCTTTTTGGG AGAGCTCCAC TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG 214 TAGTCTTTTG CTCGTCTATG GGTATGGCGG GTCCCTGTCC CGTCGAGTTT CACTAATGTA 7718 |||||||||| || ||||||| |||||||||| | |||||||| |||| ||||| ||||| | || TAGTCTTTTG CTTGTCTATG GGTATGGCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA 274 CTCTTAGAGG TCTGTGGACA TTATGTGGGT TGTATATATA TTTTTTGGAT AATGGTCTGG 7658 |||||||||| ||||| |||| | ||||||| |||||| || | |||||||| |||||||||| CTCTTAGAGG TCTGTAGACA TCGTGTGGGT TGTATAATTA TGTTTTGGAT AATGGTCTGG 334 ACATGGTTTG TTTGGGATGT CCGCTTGTAC AGGGGCAGCC TTGTCGGCTG CGTACATCAT 7598 |||||||||| |||||||||| || |||||| | | |||||| ||||||| || | ||||||| ACATGGTTTG TTTGGGATGT CCATTTGTAC AAGTGCAGCC TTGTCGGTTG TGAACATCAT 394 TGTGTATTGT GTAGTGGCAG CCTTGTCGGC ATACGTATGT TATTATGCTT TGAATAGTGG 7538 |||||||||| |||||||||| ||| |||||| | |||||| ||||||| || || ||||||| TGTGTATTGT GTAGTGGCAG CCTCGTCGGC -TGCGTATGC TATTATGTTT TGGATAGTGG 453 CGGCCTTGTC GGCTCGCGTA TGTTGTTATG GTTGAATGAT TATGACTCCT TATGAGACAG 7478 |||||||||| ||||||| || |||||||| | || |||| | |||||||| | ||||||| || CGGCCTTGTC GGCTCGCATA TGTTGTTACG ATTTAATGGT TATGACTCTT TATGAGATAG 513 GTCCTCTTAT ATATATATGA CGTTGGGGTT GGCT 7444 ||| ||| | |||||| | | || | || ATCCACTT-T ATATATGAAA TGAATGGACT AACT 546 hqPGS_C06HBa0057J04.1-9-_SGN-E305738+ (9275 9217,8655 8609,7883 7444) ******************************************************************************** EST sequence 4 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 9726 to 5735): Exon 1 9276 9217 ( 60 n); cDNA 1 61 ( 61 n); score: 0.808 Intron 1 9216 8656 ( 561 n); Pd: 0.991 (s: 0.79), Pa: 0.880 (s: 0.89) Exon 2 8655 8609 ( 47 n); cDNA 62 108 ( 47 n); score: 0.894 Intron 2 8608 7884 ( 725 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.94) Exon 3 7883 7461 ( 423 n); cDNA 109 530 ( 422 n); score: 0.897 PPA cDNA 531 542 MATCH C06HBa0057J04.1-9- SGN-E374134- 0.886 530 0.978 C PGS_C06HBa0057J04.1-9-_SGN-E374134- (9276 9217,8655 8609,7883 7461) Alignment (genomic DNA sequence = upper lines): CAGCAATGGA AATGGAG-AA ACTAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT 9218 |||| ||||| ||||||| || || | || | ||||||||| ||||||||| |||| || || CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 60 GGTTAGTAAT ATCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT 9158 | G......... .......... .......... .......... .......... .......... 61 TAATTTTAAG AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG 9098 .......... .......... .......... .......... .......... .......... 61 CTAAGTATGA ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGTTG 9038 .......... .......... .......... .......... .......... .......... 61 TTTTGATTAA AGCAAACTGC AGGAAAATTC TTTTTTGGCA TTATGTATAT GTTGAATGTG 8978 .......... .......... .......... .......... .......... .......... 61 ATTATGAGTA TATACTCCAA AGGATGAATA CGATAAGGTA GATGCGTTGC GAATTATAAA 8918 .......... .......... .......... .......... .......... .......... 61 ACGAGTTATC ACTCGGTGTG TCGTTGCTTC GCTGCTATGG TTGCCGAGAC GGAACTGTTT 8858 .......... .......... .......... .......... .......... .......... 61 TGGGGAGGGG GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTATTGTT ATTACTGTGG 8798 .......... .......... .......... .......... .......... .......... 61 ATAATTTGGA TTGTTGTCGG ATTGGGACGA AGTAAGGAAA ATAGGGGAGG TGCTGCCGAA 8738 .......... .......... .......... .......... .......... .......... 61 TTTTCGTTAG ATTATTAGCT AGCTTACAAG AAAGTGAAGC ACGATGTTTA TCTAAATGCG 8678 .......... .......... .......... .......... .......... .......... 61 GCACGATTGT TGCTTGTTAT AGATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC 8618 ||| ||| |||||||||| |||||||||| ||| |||||| .......... .......... ..ATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC 99 GATTATACGG TATGTAACGC TGTCCCTTCT TTCTTTGCTT GGCATGACTT TTAAAAATAA 8558 || ||| || GACTATTCG. .......... .......... .......... .......... .......... 108 GCGAATAACG GACAGATTTG ATACTTACCT CTAGAGCGTC TAGGTGACGT ATATTCTTGC 8498 .......... .......... .......... .......... .......... .......... 108 TTCCACAATT ATTCCTCTAT ATATCGGCTA TGTCTAAGGC TATGATGATC CCTAATATCT 8438 .......... .......... .......... .......... .......... .......... 108 ATGGTAATGC TTCTTAGAGT CATTGAGATT TTTACGTTTC CATATCGTAT TAAAGGTTCA 8378 .......... .......... .......... .......... .......... .......... 108 TAATCTTGAT AAAATATTAA TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT 8318 .......... .......... .......... .......... .......... .......... 108 ATTGAGTTAT AAGAAATGAT TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA 8258 .......... .......... .......... .......... .......... .......... 108 TAGAGTCATT TATCATTTCA CCGAGTCCCG GGCAGGGTAA TGTTCATGCG GAGTTTCTTG 8198 .......... .......... .......... .......... .......... .......... 108 CATATGTCAC CGAGTTCCTC ACTAGAGGGC CGGGTATGTA TATTATATGT ATGATTGGTG 8138 .......... .......... .......... .......... .......... .......... 108 ATGAGGATGG TTATGATGAT GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC 8078 .......... .......... .......... .......... .......... .......... 108 CCTTATTAGG GAAGTTGGGC ACCTTAAATG TTAAATATAT GCATGATTTT CACTTAAAAG 8018 .......... .......... .......... .......... .......... .......... 108 GGTATATGTG TAGCGATATT TTGTTTTGAC TTGCTATATT GGTATGCTGT CATCTTTACC 7958 .......... .......... .......... .......... .......... .......... 108 TTATGCTTTA CATACTCATT ACATTGTCTG TACTGACCCC CCTTTCCTCG GGGGGCTGGT 7898 .......... .......... .......... .......... .......... .......... 108 TTTCATGCCC GCAGGTGTAG ACGCACAGTT TGGTGATCCT CCCGCCTAGG ATATCTACTC 7838 |||||| |||| ||||| ||||||||| |||||||||| |||||||||| .......... ....GTGTAG ACGCTCAGTT CGGTGATCCT CCCGCCTAGG ATATCTACTC 154 TGATGATTGG GAGAGCTCCA CTGTTCCGGA GCCTAGTCGT TTTGGTACAT AACTT-TTGT 7779 || | |||| |||||||||| |||||||||| ||| |||||| |||||||||| ||||| || | TGCTTTTTGG GAGAGCTCCA CTGTTCCGGA GCCCAGTCGT TTTGGTACAT AACTTCTTAT 214 GTAGTCTTTT GCTCGTCTAT GGGTATGGCG GGTCCCTGTC CCGTCGAGTT TCACTAATGT 7719 |||||||||| ||| |||||| |||||||||| || ||||||| ||||| |||| |||||| | | GTAGTCTTTT GCTTGTCTAT GGGTATGGCG GGGCCCTGTC CCGTCAAGTT TCACTACTAT 274 ACTCTTAGAG GTCTGTGGAC ATTATGTGGG TTGTATATAT ATTTTTTGGA TAATGGTCTG 7659 |||||||||| |||||| ||| || |||||| ||||||| | || ||||||| |||||||||| ACTCTTAGAG GTCTGTAGAC ATCGTGTGGG TTGTATAATT ATGTTTTGGA TAATGGTCTG 334 GACATGGTTT GTTTGGGATG TCCGCTTGTA CAGGGGCAGC CTTGTCGGCT GCGTACATCA 7599 |||||||||| |||||||||| ||| ||||| || | ||||| |||||||| | | | |||||| GACATGGTTT GTTTGGGATG TCCATTTGTA CAAGTGCAGC CTTGTCGGTT GTGAACATCA 394 TTGTGTATTG TGTAGTGGCA GCCTTGTCGG CATACGTATG TTATTATGCT TTGAATAGTG 7539 |||||||||| |||||||||| |||| ||||| | | |||||| ||||||| | ||| |||||| TTGTGTATTG TGTAGTGGCA GCCTCGTCGG C-TGCGTATG CTATTATGTT TTGGATAGTG 453 GCGGCCTTGT CGGCTCGCGT ATGTTGTTAT GGTTGAATGA TTATGACTCC TTATGAGACA 7479 |||||||||| |||||||| | ||||||||| | || |||| ||||||||| |||||||| | GCGGCCTTGT CGGCTCGCAT ATGTTGTTAC GATTTAATGG TTATGACTCT TTATGAGATA 513 GGTCCTCTTA TATATATA 7461 | ||| ||| |||||||| GATCCACTT- TATATATA 530 hqPGS_C06HBa0057J04.1-9-_SGN-E374134- (9276 9217,8655 8609,7883 7461) ******************************************************************************** EST sequence 21 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 9706 to 5715): Exon 1 9275 9217 ( 59 n); cDNA 1 60 ( 60 n); score: 0.805 Intron 1 9216 8656 ( 561 n); Pd: 0.991 (s: 0.79), Pa: 0.880 (s: 0.89) Exon 2 8655 8609 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 8608 7884 ( 725 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.94) Exon 3 7883 7461 ( 423 n); cDNA 108 529 ( 422 n); score: 0.897 PPA cDNA 530 542 MATCH C06HBa0057J04.1-9- SGN-E374135+ 0.886 529 0.976 C PGS_C06HBa0057J04.1-9-_SGN-E374135+ (9275 9217,8655 8609,7883 7461) Alignment (genomic DNA sequence = upper lines): AGCAATGGAA ATGGAG-AAA CTAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 9217 ||| |||||| |||||| ||| | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATA TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 9157 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 9097 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGTTGT 9037 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTTTTGGCAT TATGTATATG TTGAATGTGA 8977 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGCGTTGCG AATTATAAAA 8917 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGCTATGGT TGCCGAGACG GAACTGTTTT 8857 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGTTA TTACTGTGGA 8797 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 8737 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTGAAGCA CGATGTTTAT CTAAATGCGG 8677 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 8617 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGCTTG GCATGACTTT TAAAAATAAG 8557 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAGAGCGTCT AGGTGACGTA TATTCTTGCT 8497 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGCTAT GTCTAAGGCT ATGATGATCC CTAATATCTA 8437 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAGATTT TTACGTTTCC ATATCGTATT AAAGGTTCAT 8377 .......... .......... .......... .......... .......... .......... 107 AATCTTGATA AAATATTAAT CTTTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA 8317 .......... .......... .......... .......... .......... .......... 107 TTGAGTTATA AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT 8257 .......... .......... .......... .......... .......... .......... 107 AGAGTCATTT ATCATTTCAC CGAGTCCCGG GCAGGGTAAT GTTCATGCGG AGTTTCTTGC 8197 .......... .......... .......... .......... .......... .......... 107 ATATGTCACC GAGTTCCTCA CTAGAGGGCC GGGTATGTAT ATTATATGTA TGATTGGTGA 8137 .......... .......... .......... .......... .......... .......... 107 TGAGGATGGT TATGATGATG ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC 8077 .......... .......... .......... .......... .......... .......... 107 CTTATTAGGG AAGTTGGGCA CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAGG 8017 .......... .......... .......... .......... .......... .......... 107 GTATATGTGT AGCGATATTT TGTTTTGACT TGCTATATTG GTATGCTGTC ATCTTTACCT 7957 .......... .......... .......... .......... .......... .......... 107 TATGCTTTAC ATACTCATTA CATTGTCTGT ACTGACCCCC CTTTCCTCGG GGGGCTGGTT 7897 .......... .......... .......... .......... .......... .......... 107 TTCATGCCCG CAGGTGTAGA CGCACAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT 7837 ||||||| ||| ||||| |||||||||| |||||||||| |||||||||| .......... ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT 154 GATGATTGGG AGAGCTCCAC TGTTCCGGAG CCTAGTCGTT TTGGTACATA ACTT-TTGTG 7778 | | ||||| |||||||||| |||||||||| || ||||||| |||||||||| |||| || || GCTTTTTGGG AGAGCTCCAC TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG 214 TAGTCTTTTG CTCGTCTATG GGTATGGCGG GTCCCTGTCC CGTCGAGTTT CACTAATGTA 7718 |||||||||| || ||||||| |||||||||| | |||||||| |||| ||||| ||||| | || TAGTCTTTTG CTTGTCTATG GGTATGGCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA 274 CTCTTAGAGG TCTGTGGACA TTATGTGGGT TGTATATATA TTTTTTGGAT AATGGTCTGG 7658 |||||||||| ||||| |||| | ||||||| |||||| || | |||||||| |||||||||| CTCTTAGAGG TCTGTAGACA TCGTGTGGGT TGTATAATTA TGTTTTGGAT AATGGTCTGG 334 ACATGGTTTG TTTGGGATGT CCGCTTGTAC AGGGGCAGCC TTGTCGGCTG CGTACATCAT 7598 |||||||||| |||||||||| || |||||| | | |||||| ||||||| || | ||||||| ACATGGTTTG TTTGGGATGT CCATTTGTAC AAGTGCAGCC TTGTCGGTTG TGAACATCAT 394 TGTGTATTGT GTAGTGGCAG CCTTGTCGGC ATACGTATGT TATTATGCTT TGAATAGTGG 7538 |||||||||| |||||||||| ||| |||||| | |||||| ||||||| || || ||||||| TGTGTATTGT GTAGTGGCAG CCTCGTCGGC -TGCGTATGC TATTATGTTT TGGATAGTGG 453 CGGCCTTGTC GGCTCGCGTA TGTTGTTATG GTTGAATGAT TATGACTCCT TATGAGACAG 7478 |||||||||| ||||||| || |||||||| | || |||| | |||||||| | ||||||| || CGGCCTTGTC GGCTCGCATA TGTTGTTACG ATTTAATGGT TATGACTCTT TATGAGATAG 513 GTCCTCTTAT ATATATA 7461 ||| ||| | ||||||| ATCCACTT-T ATATATA 529 hqPGS_C06HBa0057J04.1-9-_SGN-E374135+ (9275 9217,8655 8609,7883 7461) ******************************************************************************** EST sequence 6 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 9921 to 6228): Exon 1 9278 9217 ( 62 n); cDNA 1 63 ( 63 n); score: 0.879 Intron 1 9216 8956 ( 261 n); Pd: 0.991 (s: 0.87), Pa: 0.933 (s: 0) Exon 2 8955 8951 ( 5 n); cDNA 64 68 ( 5 n); score: 0.400 Intron 2 8950 8656 ( 295 n); Pd: 0.000 (s: 0), Pa: 0.880 (s: 0.98) Exon 3 8655 8609 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 3 8608 7884 ( 725 n); Pd: 0.991 (s: 0.98), Pa: 0.000 (s: 0.92) Exon 4 7883 7462 ( 422 n); cDNA 116 536 ( 421 n); score: 0.914 MATCH C06HBa0057J04.1-9- SGN-E543103- 0.909 536 0.915 C PGS_C06HBa0057J04.1-9-_SGN-E543103- (9278 9217,8955 8951,8655 8609,7883 7462) Alignment (genomic DNA sequence = upper lines): GGCAGCAATG GAAATGGAGA AACT-AACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 9220 |||||| ||| |||||||||| || | |||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATATCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 9160 ||| TTG....... .......... .......... .......... .......... .......... 63 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 9100 .......... .......... .......... .......... .......... .......... 63 TGCTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGT 9040 .......... .......... .......... .......... .......... .......... 63 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTTTTTTGG CATTATGTAT ATGTTGAATG 8980 .......... .......... .......... .......... .......... .......... 63 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGCGTT GCGAATTATA 8920 | | .......... .......... ....GTTTG. .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGCTAT GGTTGCCGAG ACGGAACTGT 8860 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG TTATTACTGT 8800 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTC GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG 8740 .......... .......... .......... .......... .......... .......... 68 AATTTTCGTT AGATTATTAG CTAGCTTACA AGAAAGTGAA GCACGATGTT TATCTAAATG 8680 .......... .......... .......... .......... .......... .......... 68 CGGCACGATT GTTGCTTGTT ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 8620 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....ATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC 104 TCGATTATAC GGTATGTAAC GCTGTCCCTT CTTTCTTTGC TTGGCATGAC TTTTAAAAAT 8560 |||| ||||| | TCGACTATAC G......... .......... .......... .......... .......... 115 AAGCGAATAA CGGACAGATT TGATACTTAC CTCTAGAGCG TCTAGGTGAC GTATATTCTT 8500 .......... .......... .......... .......... .......... .......... 115 GCTTCCACAA TTATTCCTCT ATATATCGGC TATGTCTAAG GCTATGATGA TCCCTAATAT 8440 .......... .......... .......... .......... .......... .......... 115 CTATGGTAAT GCTTCTTAGA GTCATTGAGA TTTTTACGTT TCCATATCGT ATTAAAGGTT 8380 .......... .......... .......... .......... .......... .......... 115 CATAATCTTG ATAAAATATT AATCTTTGGT AATACTCCTT GCTGGTTCAC GTTGATTGTT 8320 .......... .......... .......... .......... .......... .......... 115 CTATTGAGTT ATAAGAAATG ATTTTAATTG CATATGGTTG CTCATAATAT TCTGCTCGTG 8260 .......... .......... .......... .......... .......... .......... 115 CATAGAGTCA TTTATCATTT CACCGAGTCC CGGGCAGGGT AATGTTCATG CGGAGTTTCT 8200 .......... .......... .......... .......... .......... .......... 115 TGCATATGTC ACCGAGTTCC TCACTAGAGG GCCGGGTATG TATATTATAT GTATGATTGG 8140 .......... .......... .......... .......... .......... .......... 115 TGATGAGGAT GGTTATGATG ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG 8080 .......... .......... .......... .......... .......... .......... 115 CCCCTTATTA GGGAAGTTGG GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA 8020 .......... .......... .......... .......... .......... .......... 115 AGGGTATATG TGTAGCGATA TTTTGTTTTG ACTTGCTATA TTGGTATGCT GTCATCTTTA 7960 .......... .......... .......... .......... .......... .......... 115 CCTTATGCTT TACATACTCA TTACATTGTC TGTACTGACC CCCCTTTCCT CGGGGGGCTG 7900 .......... .......... .......... .......... .......... .......... 115 GTTTTCATGC CCGCAGGTGT AGACGCACAG TTTGGTGATC CTCCCGCCTA GGATATCTAC 7840 |||| |||||| ||| || ||||||| |||||||||| |||||||||| .......... ......GTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC 159 TCTGATGATT GGGAGAGCTC CACTGTTCCG GAGCCTAGTC GTTTTGGTAC ATAAC-TTTT 7781 | || ||| | |||||||||| ||||||| || |||| |||| ||||||||| ||||| |||| TTTGCTGAGT GGGAGAGCTC CACTGTTTCG TAGCCCAGTC ATTTTGGTAC ATAACTTTTT 219 GTGTAGTCTT TTGCTCGTCT ATGGGTATGG CGGGTCCCTG TCCCGTCGAG TTTCACTAAT 7721 |||||||||| ||||| |||| |||||||||| ||| ||||| |||||||||| |||||||| | GTGTAGTCTT TTGCTTGTCT ATGGGTATGG TGGGGCCCTG TCCCGTCGAG TTTCACTACT 279 GTACTCTTAG AGGTCTGTGG ACATTATGTG GGTTGTATAT ATATTTTTTG GATAATGGTC 7661 ||||||||| ||||| | | |||| ||| |||||||||| |||| ||||| |||||||||| ATACTCTTAG AGGTCCATAG ACATCGCGTG GGTTGTATAT ATATGTTTTG GATAATGGTC 339 TGGACATGGT TTGTTTGGGA TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT 7601 |||||||||| |||||||||| ||||| |||| |||| ||||| |||||||| | |||||||||| TGGACATGGT TTGTTTGGGA TGTCCACTTG TACAAGGGCA GCCTTGTCAG CTGCGTACAT 399 CATTGTGTAT TGTGTAGTGG CAGCCTTGTC GGCATACGTA TGTTATTATG CTTTGAATAG 7541 | |||||||| |||||||||| |||||||||| ||| | |||| || ||||||| ||||| |||| CTTTGTGTAT TGTGTAGTGG CAGCCTTGTC GGC-TGCGTA TGCTATTATG CTTTGGATAG 458 TGGCGGCCTT GTCGGCTCGC GTATGTTGTT ATGGTTGAAT GATTATGACT CCTTATGAGA 7481 |||||||||| |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| TGGCGGCCTT GTCGGCTCGC GTATGTTGTT ACGGTTGAAT GGTTATGACT CCTTATGAGA 518 CAGGTCCTCT TATATATAT 7462 ||| ||| || | ||||||| CAGATCCACT T-TATATAT 536 hqPGS_C06HBa0057J04.1-9-_SGN-E543103- (9278 9217,8955 8951,8655 8609,7883 7462) ******************************************************************************** EST sequence 14 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 9315 to 6600): Exon 1 9259 9217 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 9216 8255 ( 962 n); Pd: 0.991 (s: 0.81), Pa: 0.959 (s: 0.96) Exon 2 8254 8060 ( 195 n); cDNA 48 241 ( 194 n); score: 0.908 Intron 2 8059 7884 ( 176 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.96) Exon 3 7883 7481 ( 403 n); cDNA 242 643 ( 402 n); score: 0.907 MATCH C06HBa0057J04.1-9- SGN-E538156+ 0.907 641 0.995 C PGS_C06HBa0057J04.1-9-_SGN-E538156+ (9259 9217,8254 8060,7883 7481) Alignment (genomic DNA sequence = upper lines): AAACTAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATATCCTTGT 9200 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 9140 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 9080 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGT TGTTTTGATT AAAGCAAACT 9020 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTTTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 8960 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGCGTT GCGAATTATA AAACGAGTTA TCACTCGGTG 8900 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGCTAT GGTTGCCGAG ACGGAACTGT TTTGGGGAGG GGGCTGTTTA 8840 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG TTATTACTGT GGATAATTTG GATTGTTGTC 8780 .......... .......... .......... .......... .......... .......... 47 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 8720 .......... .......... .......... .......... .......... .......... 47 CTAGCTTACA AGAAAGTGAA GCACGATGTT TATCTAAATG CGGCACGATT GTTGCTTGTT 8660 .......... .......... .......... .......... .......... .......... 47 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 8600 .......... .......... .......... .......... .......... .......... 47 GCTGTCCCTT CTTTCTTTGC TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 8540 .......... .......... .......... .......... .......... .......... 47 TGATACTTAC CTCTAGAGCG TCTAGGTGAC GTATATTCTT GCTTCCACAA TTATTCCTCT 8480 .......... .......... .......... .......... .......... .......... 47 ATATATCGGC TATGTCTAAG GCTATGATGA TCCCTAATAT CTATGGTAAT GCTTCTTAGA 8420 .......... .......... .......... .......... .......... .......... 47 GTCATTGAGA TTTTTACGTT TCCATATCGT ATTAAAGGTT CATAATCTTG ATAAAATATT 8360 .......... .......... .......... .......... .......... .......... 47 AATCTTTGGT AATACTCCTT GCTGGTTCAC GTTGATTGTT CTATTGAGTT ATAAGAAATG 8300 .......... .......... .......... .......... .......... .......... 47 ATTTTAATTG CATATGGTTG CTCATAATAT TCTGCTCGTG CATAGAGTCA TTTATCATTT 8240 ||||| |||||||||| .......... .......... .......... .......... .....AGTCA TTTATCATTT 62 CACCGAGTCC CGGGCAGGGT AATGTTCATG CGGAGTTTCT TGCATATGTC ACCGAGTTCC 8180 |||||||||| ||||| |||| ||||||| || |||||||||| |||||||||| ||||||| || CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 122 TCACTAGAGG GCCGGGTATG TATATTATAT GTATGATTGG TGATGAGGAT GGTTATGATG 8120 |||||||||| |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 182 ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG CCCCTTATTA GGGAAGTTGG 8060 |||||||||| ||||||||| ||||||| || ||| | ||| |||| | || | | | || ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAG-GGCCGG 241 GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA AGGGTATATG TGTAGCGATA 8000 .......... .......... .......... .......... .......... .......... 241 TTTTGTTTTG ACTTGCTATA TTGGTATGCT GTCATCTTTA CCTTATGCTT TACATACTCA 7940 .......... .......... .......... .......... .......... .......... 241 TTACATTGTC TGTACTGACC CCCCTTTCCT CGGGGGGCTG GTTTTCATGC CCGCAGGTGT 7880 |||| .......... .......... .......... .......... .......... ......GTGT 245 AGACGCACAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGATGATT GGGAGAGCTC 7820 |||||| ||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| AGACGCTCAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGTTT GGGAGAGCTC 305 CACTGTTCCG GAGCCTAGTC GTTTTGGTAC ATAACTT-TT GTGTAGTCTT TTGCTCGTCT 7761 |||||||||| ||||| |||| ||| |||||| ||||||| || ||||||||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTGTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 365 ATGGGTATGG CGGGTCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 7701 |||||||| | |||| ||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTAT-G CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 424 ACATTATGTG GGTTGTATAT ATATTTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 7641 |||| |||| ||||||||| ||| |||| |||||||||| |||||||||| |||||||||| ACATCGTGTG GGTTGTATAA TTATGTTTTT GATAATGGTC TGGACATGGT TTGTTTGGGA 484 TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT CATTGTGTAT TGTGTAGTGG 7581 ||||| |||| |||| | ||| ||||||||| || |||||| | |||||||| |||||||||| TGTCCACTTG TACAAGTGCA ACCTTGTCGG TTGTGTACAT CTTTGTGTAT TGTGTAGTGG 544 CAGCCTTGTC GGCATACGTA TGTTATTATG CTTTGAATAG TGGCGGCCTT GTCGGCTCGC 7521 |||||||| | ||| | |||| || ||||||| |||||||||| |||| ||||| |||||||||| CAGCCTTGAC GGC-TGCGTA TGCTATTATG CTTTGAATAG TGGCAGCCTT GTCGGCTCGC 603 GTATGTTGTT ATGGTTGAAT GATTATGACT CCTTATGAGA 7481 |||||||||| | |||||||| | ||||||| | |||||||| GTATGTTGTT ACGGTTGAAT GGGTATGACT CTTTATGAGA 643 hqPGS_C06HBa0057J04.1-9-_SGN-E538156+ (9259 9217,8254 8060,7883 7481) ******************************************************************************** EST sequence 29 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 9706 to 5945): Exon 1 9275 9217 ( 59 n); cDNA 1 60 ( 60 n); score: 0.805 Intron 1 9216 8656 ( 561 n); Pd: 0.991 (s: 0.79), Pa: 0.880 (s: 0.89) Exon 2 8655 8609 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 8608 7884 ( 725 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.94) Exon 3 7883 7483 ( 401 n); cDNA 108 508 ( 401 n); score: 0.904 PPA cDNA 509 519 MATCH C06HBa0057J04.1-9- SGN-E310669+ 0.891 507 0.977 C PGS_C06HBa0057J04.1-9-_SGN-E310669+ (9275 9217,8655 8609,7883 7483) Alignment (genomic DNA sequence = upper lines): AGCAATGGAA ATGGAG-AAA CTAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 9217 ||| |||||| |||||| ||| | | || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATA TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 9157 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 9097 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGTTGT 9037 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTTTTGGCAT TATGTATATG TTGAATGTGA 8977 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGCGTTGCG AATTATAAAA 8917 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGCTATGGT TGCCGAGACG GAACTGTTTT 8857 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGTTA TTACTGTGGA 8797 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTCGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 8737 .......... .......... .......... .......... .......... .......... 60 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTGAAGCA CGATGTTTAT CTAAATGCGG 8677 .......... .......... .......... .......... .......... .......... 60 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 8617 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 99 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGCTTG GCATGACTTT TAAAAATAAG 8557 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 107 CGAATAACGG ACAGATTTGA TACTTACCTC TAGAGCGTCT AGGTGACGTA TATTCTTGCT 8497 .......... .......... .......... .......... .......... .......... 107 TCCACAATTA TTCCTCTATA TATCGGCTAT GTCTAAGGCT ATGATGATCC CTAATATCTA 8437 .......... .......... .......... .......... .......... .......... 107 TGGTAATGCT TCTTAGAGTC ATTGAGATTT TTACGTTTCC ATATCGTATT AAAGGTTCAT 8377 .......... .......... .......... .......... .......... .......... 107 AATCTTGATA AAATATTAAT CTTTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA 8317 .......... .......... .......... .......... .......... .......... 107 TTGAGTTATA AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT 8257 .......... .......... .......... .......... .......... .......... 107 AGAGTCATTT ATCATTTCAC CGAGTCCCGG GCAGGGTAAT GTTCATGCGG AGTTTCTTGC 8197 .......... .......... .......... .......... .......... .......... 107 ATATGTCACC GAGTTCCTCA CTAGAGGGCC GGGTATGTAT ATTATATGTA TGATTGGTGA 8137 .......... .......... .......... .......... .......... .......... 107 TGAGGATGGT TATGATGATG ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC 8077 .......... .......... .......... .......... .......... .......... 107 CTTATTAGGG AAGTTGGGCA CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAGG 8017 .......... .......... .......... .......... .......... .......... 107 GTATATGTGT AGCGATATTT TGTTTTGACT TGCTATATTG GTATGCTGTC ATCTTTACCT 7957 .......... .......... .......... .......... .......... .......... 107 TATGCTTTAC ATACTCATTA CATTGTCTGT ACTGACCCCC CTTTCCTCGG GGGGCTGGTT 7897 .......... .......... .......... .......... .......... .......... 107 TTCATGCCCG CAGGTGTAGA CGCACAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT 7837 ||||||| ||| ||||| |||||||||| |||||||||| |||||||||| .......... ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT 154 GATGATTGGG AGAGCTCCAC TGTTCCGGAG CCTAGTCGTT TTGGTACATA ACTT-TTGTG 7778 | || ||||| |||||||||| |||||||||| || ||||||| |||||||||| |||| || || GCTGTTTGGG AGAGCTCCAC TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG 214 TAGTCTTTTG CTCGTCTATG GGTATGGCGG GTCCCTGTCC CGTCGAGTTT CACTAATGTA 7718 |||||||||| || ||||||| |||||||||| | |||||||| |||| ||||| ||||| | || TAGTCTTTTG CTTGTCTATG GGTATGGCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA 274 CTCTTAGAGG TCTGTGGACA TTATGTGGGT TGTATATATA TTTTTTGGAT AATGGTCTGG 7658 |||||||||| ||||| |||| | ||||||| |||||| || | |||||||| |||||||||| CTCTTAGAGG TCTGTAGACA TCGTGTGGGT TGTATAATTA TGTTTTGGAT AATGGTCTGG 334 ACATGGTTTG TTTGGGATGT CCGCTTGTAC AGGGGCAGCC TTGTCGGCTG CGTACATCAT 7598 |||||||||| |||||||||| || |||||| | | |||||| ||||||| || | ||||||| ACATGGTTTG TTTGGGATGT CCATTTGTAC AAGTGCAGCC TTGTCGGTTG TGAACATCAT 394 TGTGTATTGT GTAGTGGCAG CCTTGTCGGC ATACGTATGT TATTATGCTT TGAATAGTGG 7538 |||||||||| |||||||||| ||| |||||| | |||||| ||||||| || || ||||||| TGTGTATTGT GTAGTGGCAG CCTCGTCGGC -TGCGTATGC TATTATGTTT TGGATAGTGG 453 CGGCCTTGTC GGCTCGCGTA TGTTGTTATG GTTGAATGAT TATGACTCCT TATGA 7483 |||||||||| ||||||| || |||||||| | || |||| | |||||||| | ||||| CGGCCTTGTC GGCTCGCATA TGTTGTTACG ATTTAATGGT TATGACTCTT TATGA 508 hqPGS_C06HBa0057J04.1-9-_SGN-E310669+ (9275 9217,8655 8609,7883 7483) ******************************************************************************** EST sequence 12 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 9315 to 6908): Exon 1 9259 9217 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 9216 8255 ( 962 n); Pd: 0.991 (s: 0.81), Pa: 0.959 (s: 0.96) Exon 2 8254 8060 ( 195 n); cDNA 48 241 ( 194 n); score: 0.908 Intron 2 8059 7884 ( 176 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.96) Exon 3 7883 7518 ( 366 n); cDNA 242 606 ( 365 n); score: 0.914 MATCH C06HBa0057J04.1-9- SGN-E538151+ 0.912 604 0.997 C PGS_C06HBa0057J04.1-9-_SGN-E538151+ (9259 9217,8254 8060,7883 7518) Alignment (genomic DNA sequence = upper lines): AAACTAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATATCCTTGT 9200 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 9140 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 9080 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGT TGTTTTGATT AAAGCAAACT 9020 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTTTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 8960 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGCGTT GCGAATTATA AAACGAGTTA TCACTCGGTG 8900 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGCTAT GGTTGCCGAG ACGGAACTGT TTTGGGGAGG GGGCTGTTTA 8840 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG TTATTACTGT GGATAATTTG GATTGTTGTC 8780 .......... .......... .......... .......... .......... .......... 47 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 8720 .......... .......... .......... .......... .......... .......... 47 CTAGCTTACA AGAAAGTGAA GCACGATGTT TATCTAAATG CGGCACGATT GTTGCTTGTT 8660 .......... .......... .......... .......... .......... .......... 47 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 8600 .......... .......... .......... .......... .......... .......... 47 GCTGTCCCTT CTTTCTTTGC TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 8540 .......... .......... .......... .......... .......... .......... 47 TGATACTTAC CTCTAGAGCG TCTAGGTGAC GTATATTCTT GCTTCCACAA TTATTCCTCT 8480 .......... .......... .......... .......... .......... .......... 47 ATATATCGGC TATGTCTAAG GCTATGATGA TCCCTAATAT CTATGGTAAT GCTTCTTAGA 8420 .......... .......... .......... .......... .......... .......... 47 GTCATTGAGA TTTTTACGTT TCCATATCGT ATTAAAGGTT CATAATCTTG ATAAAATATT 8360 .......... .......... .......... .......... .......... .......... 47 AATCTTTGGT AATACTCCTT GCTGGTTCAC GTTGATTGTT CTATTGAGTT ATAAGAAATG 8300 .......... .......... .......... .......... .......... .......... 47 ATTTTAATTG CATATGGTTG CTCATAATAT TCTGCTCGTG CATAGAGTCA TTTATCATTT 8240 ||||| |||||||||| .......... .......... .......... .......... .....AGTCA TTTATCATTT 62 CACCGAGTCC CGGGCAGGGT AATGTTCATG CGGAGTTTCT TGCATATGTC ACCGAGTTCC 8180 |||||||||| ||||| |||| ||||||| || |||||||||| |||||||||| ||||||| || CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 122 TCACTAGAGG GCCGGGTATG TATATTATAT GTATGATTGG TGATGAGGAT GGTTATGATG 8120 |||||||||| |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 182 ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG CCCCTTATTA GGGAAGTTGG 8060 |||||||||| ||||||||| ||||||| || ||| | ||| |||| | || | | | || ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAG-GGCCGG 241 GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA AGGGTATATG TGTAGCGATA 8000 .......... .......... .......... .......... .......... .......... 241 TTTTGTTTTG ACTTGCTATA TTGGTATGCT GTCATCTTTA CCTTATGCTT TACATACTCA 7940 .......... .......... .......... .......... .......... .......... 241 TTACATTGTC TGTACTGACC CCCCTTTCCT CGGGGGGCTG GTTTTCATGC CCGCAGGTGT 7880 |||| .......... .......... .......... .......... .......... ......GTGT 245 AGACGCACAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGATGATT GGGAGAGCTC 7820 |||||| ||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| AGACGCTCAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGTTT GGGAGAGCTC 305 CACTGTTCCG GAGCCTAGTC GTTTTGGTAC ATAACTT-TT GTGTAGTCTT TTGCTCGTCT 7761 |||||||||| ||||| |||| |||||||||| ||||||| || ||||||||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 365 ATGGGTATGG CGGGTCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 7701 |||||||| | |||| ||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTAT-G CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 424 ACATTATGTG GGTTGTATAT ATATTTTTTG GATAATGGTC TGGACATGGT TTGTTTGGGA 7641 |||| |||| ||||||||| ||| |||| |||||||||| |||||||||| |||||||||| ACATCGTGTG GGTTGTATAA TTATGTTTTT GATAATGGTC TGGACATGGT TTGTTTGGGA 484 TGTCCGCTTG TACAGGGGCA GCCTTGTCGG CTGCGTACAT CATTGTGTAT TGTGTAGTGG 7581 ||||| |||| |||| | ||| ||||||||| || |||||| | |||||||| |||||||||| TGTCCACTTG TACAAGTGCA ACCTTGTCGG TTGTGTACAT CTTTGTGTAT TGTGTAGTGG 544 CAGCCTTGTC GGCATACGTA TGTTATTATG CTTTGAATAG TGGCGGCCTT GTCGGCTCGC 7521 |||||||||| ||| | |||| || ||||||| |||||||||| |||| ||||| |||||||||| CAGCCTTGTC GGC-TGCGTA TGCTATTATG CTTTGAATAG TGGCAGCCTT GTCGGCTCGC 603 GTA 7518 ||| GTA 606 hqPGS_C06HBa0057J04.1-9-_SGN-E538151+ (9259 9217,8254 8060,7883 7518) ******************************************************************************** EST sequence 23 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 9921 to 5873): Exon 1 9259 9217 ( 43 n); cDNA 3 45 ( 43 n); score: 0.814 Intron 1 9216 8255 ( 962 n); Pd: 0.991 (s: 0.81), Pa: 0.959 (s: 0.94) Exon 2 8254 8060 ( 195 n); cDNA 46 239 ( 194 n); score: 0.903 Intron 2 8059 7884 ( 176 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.96) Exon 3 7883 7653 ( 231 n); cDNA 240 470 ( 231 n); score: 0.903 MATCH C06HBa0057J04.1-9- SGN-E268096+ 0.903 469 0.998 C PGS_C06HBa0057J04.1-9-_SGN-E268096+ (9259 9217,8254 8060,7883 7653) Alignment (genomic DNA sequence = upper lines): AAACTAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATATCCTTGT 9200 |||| | || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 45 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 9140 .......... .......... .......... .......... .......... .......... 45 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGTAT GAATGGAAAC 9080 .......... .......... .......... .......... .......... .......... 45 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGT TGTTTTGATT AAAGCAAACT 9020 .......... .......... .......... .......... .......... .......... 45 GCAGGAAAAT TCTTTTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 8960 .......... .......... .......... .......... .......... .......... 45 AAAGGATGAA TACGATAAGG TAGATGCGTT GCGAATTATA AAACGAGTTA TCACTCGGTG 8900 .......... .......... .......... .......... .......... .......... 45 TGTCGTTGCT TCGCTGCTAT GGTTGCCGAG ACGGAACTGT TTTGGGGAGG GGGCTGTTTA 8840 .......... .......... .......... .......... .......... .......... 45 ATATGATTCT TTGGGTTATA TGTGTTATTG TTATTACTGT GGATAATTTG GATTGTTGTC 8780 .......... .......... .......... .......... .......... .......... 45 GGATTGGGAC GAAGTAAGGA AAATAGGGGA GGTGCTGCCG AATTTTCGTT AGATTATTAG 8720 .......... .......... .......... .......... .......... .......... 45 CTAGCTTACA AGAAAGTGAA GCACGATGTT TATCTAAATG CGGCACGATT GTTGCTTGTT 8660 .......... .......... .......... .......... .......... .......... 45 ATAGATTAAT AGCTTGAGCA GTAAATATTG GACGTGCGGC TCGATTATAC GGTATGTAAC 8600 .......... .......... .......... .......... .......... .......... 45 GCTGTCCCTT CTTTCTTTGC TTGGCATGAC TTTTAAAAAT AAGCGAATAA CGGACAGATT 8540 .......... .......... .......... .......... .......... .......... 45 TGATACTTAC CTCTAGAGCG TCTAGGTGAC GTATATTCTT GCTTCCACAA TTATTCCTCT 8480 .......... .......... .......... .......... .......... .......... 45 ATATATCGGC TATGTCTAAG GCTATGATGA TCCCTAATAT CTATGGTAAT GCTTCTTAGA 8420 .......... .......... .......... .......... .......... .......... 45 GTCATTGAGA TTTTTACGTT TCCATATCGT ATTAAAGGTT CATAATCTTG ATAAAATATT 8360 .......... .......... .......... .......... .......... .......... 45 AATCTTTGGT AATACTCCTT GCTGGTTCAC GTTGATTGTT CTATTGAGTT ATAAGAAATG 8300 .......... .......... .......... .......... .......... .......... 45 ATTTTAATTG CATATGGTTG CTCATAATAT TCTGCTCGTG CATAGAGTCA TTTATCATTT 8240 ||||| ||||||||| .......... .......... .......... .......... .....AGTCA TTTATCATTG 60 CACCGAGTCC CGGGCAGGGT AATGTTCATG CGGAGTTTCT TGCATATGTC ACCGAGTTCC 8180 |||||||||| ||||| |||| ||||||| || |||||||||| |||||||||| ||||||| || CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 120 TCACTAGAGG GCCGGGTATG TATATTATAT GTATGATTGG TGATGAGGAT GGTTATGATG 8120 |||||||||| |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 180 ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG CCCCTTATTA GGGAAGTTGG 8060 |||||||||| ||||||||| ||||||| || ||| | ||| |||| | || | | | || ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAG-GGCCGG 239 GCACCTTAAA TGTTAAATAT ATGCATGATT TTCACTTAAA AGGGTATATG TGTAGCGATA 8000 .......... .......... .......... .......... .......... .......... 239 TTTTGTTTTG ACTTGCTATA TTGGTATGCT GTCATCTTTA CCTTATGCTT TACATACTCA 7940 .......... .......... .......... .......... .......... .......... 239 TTACATTGTC TGTACTGACC CCCCTTTCCT CGGGGGGCTG GTTTTCATGC CCGCAGGTGT 7880 |||| .......... .......... .......... .......... .......... ......GTGT 243 AGACGCACAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGATGATT GGGAGAGCTC 7820 |||||| ||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| AGACGCTCAG TTTGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTGTTT GGGAGAGCTC 303 CACTGTTCCG GAGCCTAGTC GTTTTGGTAC ATAACTT-TT GTGTAGTCTT TTGCTCGTCT 7761 |||||||||| ||||| |||| |||||||||| ||||||| || ||||||||| ||||| |||| CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT TTGCTTGTCT 363 ATGGGTATGG CGGGTCCCTG TCCCGTCGAG TTTCACTAAT GTACTCTTAG AGGTCTGTGG 7701 |||||||| | |||| ||||| ||||||| || |||||||| | ||||||||| |||||||| | ATGGGTAT-G CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG AGGTCTGTAG 422 ACATTATGTG GGTTGTATAT ATATTTTTTG GATAATGGTC TGGACATG 7653 |||| |||| ||| ||||| ||| |||| |||||||| | |||||||| ACATCGTGTG GGTAGTATAA TTATGTTTTT GATAATGGGC TGGACATG 470 hqPGS_C06HBa0057J04.1-9-_SGN-E268096+ (9259 9217,8254 8060,7883 7653) ******************************************************************************** EST sequence 10 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 8956 to 5435): Exon 1 7883 7462 ( 422 n); cDNA 33 453 ( 421 n); score: 0.899 PPA cDNA 481 495 MATCH C06HBa0057J04.1-9- SGN-E306317+ 0.899 422 0.853 C PGS_C06HBa0057J04.1-9-_SGN-E306317+ (7883 7462) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGAT GATTGGGAGA 7824 |||||||||| ||||| ||| |||||||||| |||||||||| |||||||| | | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTCCACTGT TCCGGAGCCT AGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 7765 |||||||||| ||||||||| |||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCTATGGGT ATGGCGGGTC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 7705 |||||||||| |||||||| | |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 212 GTGGACATTA TGTGGGTTGT ATATATATTT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 7645 || ||||| |||||||||| ||| ||| | |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 272 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGCGT ACATCATTGT GTATTGTGTA 7585 ||||||||| ||||||| | ||||||||| |||| || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 332 GTGGCAGCCT TGTCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 7525 |||||||||| |||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 391 TCGCGTATGT TGTTATGGTT GAATGATTAT GACTCCTTAT GAGACAGGTC CTCTTATATA 7465 |||| ||||| ||||| | || |||| |||| ||||| |||| |||| || || | ||| |||| TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GAGATAGATC CACTT-TATA 450 TAT 7462 ||| TAT 453 hqPGS_C06HBa0057J04.1-9-_SGN-E306317+ (7883 7462) ******************************************************************************** EST sequence 27 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 9156 to 5355): Exon 1 7883 7462 ( 422 n); cDNA 53 473 ( 421 n); score: 0.897 PPA cDNA 514 523 MATCH C06HBa0057J04.1-9- SGN-E303695+ 0.897 422 0.807 C PGS_C06HBa0057J04.1-9-_SGN-E303695+ (7883 7462) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGAT GATTGGGAGA 7824 |||||||||| ||||| ||| |||||||||| |||||||||| |||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 112 GCTCCACTGT TCCGGAGCCT AGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 7765 |||||||||| ||||||||| |||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 172 GTCTATGGGT ATGGCGGGTC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 7705 |||||||||| |||||||| | |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 232 GTGGACATTA TGTGGGTTGT ATATATATTT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 7645 || ||||| |||||||||| ||| ||| | |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 292 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGCGT ACATCATTGT GTATTGTGTA 7585 ||||||||| ||||||| | ||||||||| |||| || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 352 GTGGCAGCCT TGTCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 7525 |||||||||| |||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 411 TCGCGTATGT TGTTATGGTT GAATGATTAT GACTCCTTAT GAGACAGGTC CTCTTATATA 7465 |||| ||||| ||||| | || |||| |||| ||||| |||| |||| || || | ||| |||| TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GAGATAGATC CACTT-TATA 470 TAT 7462 ||| TAT 473 hqPGS_C06HBa0057J04.1-9-_SGN-E303695+ (7883 7462) ******************************************************************************** EST sequence 1 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 8706 to 5815): Exon 1 7884 7483 ( 402 n); cDNA 6 407 ( 402 n); score: 0.902 PPA cDNA 415 432 MATCH C06HBa0057J04.1-9- SGN-E225616- 0.902 402 0.931 C PGS_C06HBa0057J04.1-9-_SGN-E225616- (7884 7483) Alignment (genomic DNA sequence = upper lines): GGTGTAGACG CACAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGA TGATTGGGAG 7825 |||||||||| | ||||| || |||||||||| |||||||||| ||||||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTCCACTG TTCCGGAGCC TAGTCGTTTT GGTACATAAC TT-TTGTGTA GTCTTTTGCT 7766 |||||||||| |||||||||| ||||||||| |||||||||| || || |||| |||||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 CGTCTATGGG TATGGCGGGT CCCTGTCCCG TCGAGTTTCA CTAATGTACT CTTAGAGGTC 7706 ||||||||| ||||||||| |||||||||| || ||||||| ||| | |||| |||||||||| TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 185 TGTGGACATT ATGTGGGTTG TATATATATT TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 7646 ||| ||||| ||||||||| |||| ||| |||||||||| |||||||||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 245 TGGGATGTCC GCTTGTACAG GGGCAGCCTT GTCGGCTGCG TACATCATTG TGTATTGTGT 7586 |||||||||| ||||||| | |||||||| ||||| || | ||||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 305 AGTGGCAGCC TTGTCGGCAT ACGTATGTTA TTATGCTTTG AATAGTGGCG GCCTTGTCGG 7526 |||||||||| | |||||| | |||||| || ||||| |||| ||||||||| |||||||||| AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 364 CTCGCGTATG TTGTTATGGT TGAATGATTA TGACTCCTTA TGA 7483 ||||| |||| |||||| | | | |||| ||| |||||| ||| ||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGA 407 hqPGS_C06HBa0057J04.1-9-_SGN-E225616- (7884 7483) ******************************************************************************** EST sequence 19 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 9056 to 5955): Exon 1 7883 7483 ( 401 n); cDNA 43 443 ( 401 n); score: 0.901 PPA cDNA 444 453 MATCH C06HBa0057J04.1-9- SGN-E303256+ 0.901 401 0.885 C PGS_C06HBa0057J04.1-9-_SGN-E303256+ (7883 7483) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGAT GATTGGGAGA 7824 |||||||||| ||||| ||| |||||||||| |||||||||| |||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTCCACTGT TCCGGAGCCT AGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 7765 |||||||||| ||||||||| |||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCTATGGGT ATGGCGGGTC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 7705 |||||||||| |||||||| | |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 222 GTGGACATTA TGTGGGTTGT ATATATATTT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 7645 || ||||| |||||||||| ||| ||| | |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 282 GGGATGTCCG CTTGTACAGG GGCAGCCTTG TCGGCTGCGT ACATCATTGT GTATTGTGTA 7585 ||||||||| ||||||| | ||||||||| |||| || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 342 GTGGCAGCCT TGTCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 7525 |||||||||| |||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 401 TCGCGTATGT TGTTATGGTT GAATGATTAT GACTCCTTAT GA 7483 |||| ||||| ||||| | || |||| |||| ||||| |||| || TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GA 443 hqPGS_C06HBa0057J04.1-9-_SGN-E303256+ (7883 7483) ******************************************************************************** EST sequence 17 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 9921 to 1897): Exon 1 9259 8567 ( 693 n); cDNA 3 691 ( 689 n); score: 0.855 MATCH C06HBa0057J04.1-9- SGN-E328093+ 0.855 693 1.003 C PGS_C06HBa0057J04.1-9-_SGN-E328093+ (9259 8567) Alignment (genomic DNA sequence = upper lines): AAACTAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATATCCTTGT 9200 |||| | || | ||||||| || ||||||| |||||| || |||||||||| || || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTGGTTAGTA ATCTCTTTGC 62 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGG- 9141 ||||| |||| ||||| |||| |||||| ||| ||||||| | ||| || ||| ||||||||| TTGGTTTGTT AATTCCTTAG AATACCTTTG TTAATTAGAC ATTTATGTTA AGAAGGGGGA 122 CGTGACCAGT AGCTTAGGAA GTTTGTTTTA GTTATTGAAT GTGCTAAGTA TGAATGGAAA 9081 ||||| |||| | |||||||| ||||||||| |||||||||| |||||||| | ||| |||| CGTGAACAGT ATCTTAGGAA -TTTGTTTTA GTTATTGAAT GTGCTAAGGA TGAGCAGAAA 181 CCATAATCGG ATTATTAGTG GTGTCGTGTT GGTGCTTGGG TTGTTTTGAT TAAAGCAAAC 9021 |||| ||||| ||| ||| | |||| | || ||| ||||| ||||||||| ||||| || | CCATGATCGG ATTGCTAGCG GTGTTATATT TGTG-TTGGG CTGTTTTGAT TAAAGTAAGC 240 TGCAGGAAAA TTCTTTTTTG GCATTATGTA TATGTTGAAT GTGATTATGA GTATATACTC 8961 ||| || ||| |||| ||||| | ||||| | |||||| ||| |||||||| |||||||||| TGCTGG-AAA TTCTGTTTTG GTGTTATGCA TATGTT-AAT ATGATTATGG GTATATACTC 298 CAAAGGATGA ATACGATAAG GTAGATGCGT TGCGAATTAT AAAACGAGTT ATCACTCGGT 8901 |||||||||| |||| ||||| ||||||| || |||||||||| ||||||| || ||| ||||| CAAAGGATGA ATACAATAAG GTAGATGTGT TGCGAATTAT AAAACGAATT ATCGGTCGGT 358 GTGTCGTTGC TTCGCTGCTA TGGTTGC-CG AGACGGAACT GTTTTGGGGA GGGGGCTGTT 8842 ||||||||| || | | ||| ||||||| | |||||||| ||||| ||| || ||||||| GTGTCGTTGT TTTGTTACTA TGGTTGCTAA AAACGGAACT GTTTT-GGG- GGAGGCTGTT 416 TAATATGATT CTTTGGGTTA TATGTGTTAT TGTTATTACT GTGGATAATT TGGATTGTTG 8782 |||||||||| |||| ||| |||||||| | || |||| | |||||||||| ||| |||||| TAATATGATT TGTTGGATTA TATGTGTTGT TGGTATTGTT GTGGATAATT TGGGTTGTTG 476 TCGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 8722 | |||||||| | ||||||| |||||||||| || ||||||| || ||||||| |||||||||| TTGGATTGGG ATGAAGTAAA GAAAATAGGG GAAGTGCTGC CGGATTTTCG TTAGATTATT 536 AGCTAGCTTA CA-AGAAAGT GAAGCACGAT GTTTATCTAA ATGCGGCACG ATTGTTGCTT 8663 |||||||||| || | ||| |||| ||| ||||||||| ||||||||| |||| ||||| AGCTAGCTTA CATAAGTAGT -AAGCGCGAC ATTTATCTAA TTGCGGCACG ATTGGTGCTT 595 GTTATAGATT AATAGCTTGA GCAGTAAATA TTGGACGTGC GGCTCGATTA TACGGTATGT 8603 |||||||||| ||| ||||| |||||||||| |||||||| | ||||||| || | |||||||| GTTATAGATT TATACCTTGA GCAGTAAATA TTGGACGTAC GGCTCGACTA TTCGGTATGT 655 AACGCTGTCC CTTCTTTCTT TGCTTGGCAT GACTTT 8567 |||||| ||| ||| ||||| || ||||||| ||| || AACGCTATCC TTTCCTTCTT TGTTTGGCAT GACCTT 691 hqPGS_C06HBa0057J04.1-9-_SGN-E328093+ (9259 8567) ******************************************************************************** EST sequence 31 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 9921 to 6898): Exon 1 9267 8813 ( 455 n); cDNA 1 455 ( 455 n); score: 0.947 MATCH C06HBa0057J04.1-9- SGN-E298250+ 0.947 455 1.000 C PGS_C06HBa0057J04.1-9-_SGN-E298250+ (9267 8813) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA ACTAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT GGTTAGTAAT 9208 |||||||||| || ||||||| |||||||||| |||| ||||| |||||||||| || |||||| AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 60 ATCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT TAATTTTAAG 9148 ||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 120 AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 9088 |||||||||| ||||||| || ||| |||||| |||||||||| |||||||||| |||||||||| AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 180 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGTTG TTTTGATTAA 9028 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 240 AGCAAACTGC AGGAAAATTC TTTTTTGGCA TTATGTATAT GTTGAATGTG ATTATGAGTA 8968 |||||||||| |||||||||| | |||||||| |||||||||| | |||||||| |||||||||| AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 300 TATACTCCAA AGGATGAATA CGATAAGGTA GATGCGTTGC GAATTATAAA ACGAGTTATC 8908 |||||||||| ||||||||| |||| ||||| ||| ||||| |||||||||| |||||||||| TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 360 ACTCGGTGTG TCGTTGCTTC GCTGCTATGG TTGCCGAGAC GGAACTGTTT TGGGGAGGGG 8848 ||||||||| ||| |||||| |||||||| | ||||| |||| |||||||||| |||||||||| GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 420 GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTA 8813 |||| |||| |||| ||| |||||||||| ||||| GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA 455 hqPGS_C06HBa0057J04.1-9-_SGN-E298250+ (9267 8813) Total number of EST alignments reported: 31 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 9921: PGL 1 (- strand): 9278 2285 AGS-1 (3941 3873,3513 3467,2743 2411,2292 2285) SCR (e 0.848 d 0.900 a 0.889,e 0.936 d 0.990 a 0.000,e 0.905 d 0.000 a 0.973,e 0.750) Exon 1 3941 3873 ( 69 n); score: 0.848 Intron 1 3872 3514 ( 359 n); Pd: 0.900 Pa: 0.889 Exon 2 3513 3467 ( 47 n); score: 0.936 Intron 2 3466 2744 ( 723 n); Pd: 0.990 Pa: 0.000 Exon 3 2743 2411 ( 333 n); score: 0.905 Intron 3 2410 2293 ( 118 n); Pd: 0.000 Pa: 0.973 Exon 4 2292 2285 ( 8 n); score: 0.750 PGS (3941 3873,3513 3467,2743 2411,2292 2285) SGN-E543103- PGS (3941 3873,3513 3467,2743 2411,2292 2285) SGN-E543104+ PGS (2744 2411,2292 2285) SGN-E225616- PGS (2743 2411,2292 2285) SGN-E306317+ PGS (2743 2411) SGN-E303256+ 3-phase translation of AGS-1 (-strand): . . . . . . 3941 GGCAGCCATGGAAATGGAGAAACAAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATT G S H G N G E T N P A T L G Q Q L Q I I A A M E M E K Q T L Q L L A S S C K - F Q P W K W R N K P C N S W P A A A N N . : . . . . . : 3881 TGGTTAGTA : ATTAATAGCTTGAGCAGTAAATAATGGACGTGCGGCTCAATTATACG : GTGT W L V : I N S L S S K - W T C G S I I R : C G - - : L I A - A V N N G R A A Q L Y : G V L V S : N - - L E Q - I M D V R L N Y T : V . . . . . . 2739 AGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTC R R A V R - S S R L G Y L L C - L G E L D A Q F G D P P A - D I Y S A D W E S S - T R S S V I L P P R I S T L L I G R A . . . . . . 2679 CACTGTTCCGGAGCCCAGTCGTTTTGGTACATAACTTTTGTGTAGGCTTTTGCTCGTCTA H C S G A Q S F W Y I T F V - A F A R L T V P E P S R F G T - L L C R L L L V Y P L F R S P V V L V H N F C V G F C S S . . . . . . 2619 TGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTGGA W V W R G P V P S S F T N V L L E V C G G Y G G A L S R R V S L M Y S - R S V D M G M A G P C P V E F H - C T L R G L W . . . . . . 2559 CATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGAT H Y V G C I Y M F W I M V W T W F V W D I M W V V Y I C F G - W S G H G L F G M T L C G L Y I Y V L D N G L D M V C L G . . . . . . 2499 GTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTATGCTTTGAATAGTGGC V R L Y R G S L V G C V H H Y A L N S G S A C T G A A L S A A Y I I M L - I V A C P L V Q G Q P C R L R T S L C F E - W . . . : . 2439 GGCCTTGTCGGCTCGCGTATGCTGTTATG : GTTTGTAT G L V G S R M L L W : F V A L S A R V C C Y : G L Y R P C R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-1_PPS_1 (2634 2411,2292 2286) (frame '1'; 231 bp, 77 residues) 1 AFARLWVWRG PVPSSFTNVL LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-2 (3939 3880,3513 3467,2743 2411,2292 2285) SCR (e 0.825 d 0.995 a 0.889,e 0.851 d 0.990 a 0.000,e 0.905 d 0.000 a 0.973,e 0.750) Exon 1 3939 3880 ( 60 n); score: 0.825 Intron 1 3879 3514 ( 366 n); Pd: 0.995 Pa: 0.889 Exon 2 3513 3467 ( 47 n); score: 0.851 Intron 2 3466 2744 ( 723 n); Pd: 0.990 Pa: 0.000 Exon 3 2743 2411 ( 333 n); score: 0.905 Intron 3 2410 2293 ( 118 n); Pd: 0.000 Pa: 0.973 Exon 4 2292 2285 ( 8 n); score: 0.750 PGS (3939 3880,3513 3467,2743 2411,2292 2285) SGN-E374134- PGS (3938 3880,3513 3467,2743 2411,2292 2285) SGN-E305738+ PGS (3938 3880,3513 3467,2743 2411,2292 2285) SGN-E374135+ PGS (3938 3880,3513 3467,2743 2411) SGN-E310669+ 3-phase translation of AGS-2 (-strand): . . . . . . : 3939 CAGCCATGGAAATGGAGAAACAAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : Q P W K W R N K P C N S W P A A A N N L : S H G N G E T N P A T L G Q Q L Q I I - : A M E M E K Q T L Q L L A S S C K - F : . . . . . : . 3513 ATTAATAGCTTGAGCAGTAAATAATGGACGTGCGGCTCAATTATACG : GTGTAGACGCGCA I N S L S S K - W T C G S I I R : C R R A L I A - A V N N G R A A Q L Y : G V D A Q D - - L E Q - I M D V R L N Y T : V - T R . . . . . . 2730 GTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCC V R - S S R L G Y L L C - L G E L H C S F G D P P A - D I Y S A D W E S S T V P S S V I L P P R I S T L L I G R A P L F . . . . . . 2670 GGAGCCCAGTCGTTTTGGTACATAACTTTTGTGTAGGCTTTTGCTCGTCTATGGGTATGG G A Q S F W Y I T F V - A F A R L W V W E P S R F G T - L L C R L L L V Y G Y G R S P V V L V H N F C V G F C S S M G M . . . . . . 2610 CGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTGGACATTATGTG R G P V P S S F T N V L L E V C G H Y V G A L S R R V S L M Y S - R S V D I M W A G P C P V E F H - C T L R G L W T L C . . . . . . 2550 GGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCGCTTG G C I Y M F W I M V W T W F V W D V R L V V Y I C F G - W S G H G L F G M S A C G L Y I Y V L D N G L D M V C L G C P L . . . . . . 2490 TACAGGGGCAGCCTTGTCGGCTGCGTACATCATTATGCTTTGAATAGTGGCGGCCTTGTC Y R G S L V G C V H H Y A L N S G G L V T G A A L S A A Y I I M L - I V A A L S V Q G Q P C R L R T S L C F E - W R P C . . : . 2430 GGCTCGCGTATGCTGTTATG : GTTTGTAT G S R M L L W : F V A R V C C Y : G L Y R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-2_PPS_1 (2634 2411,2292 2286) (frame '1'; 231 bp, 77 residues) 1 AFARLWVWRG PVPSSFTNVL LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-3 (3930 3880,2743 2411,2292 2285) SCR (e 0.794 d 0.995 a 0.000,e 0.902 d 0.000 a 0.973,e 0.750) Exon 1 3930 3880 ( 51 n); score: 0.794 Intron 1 3879 2744 (1136 n); Pd: 0.995 Pa: 0.000 Exon 2 2743 2411 ( 333 n); score: 0.902 Intron 2 2410 2293 ( 118 n); Pd: 0.000 Pa: 0.973 Exon 3 2292 2285 ( 8 n); score: 0.750 PGS (3930 3880,2743 2411,2292 2285) SGN-E303695+ 3-phase translation of AGS-3 (-strand): . . . . . . : 3930 AAATGGAGAAACAAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : GTGTAGACG K W R N K P C N S W P A A A N N L : V - T N G E T N P A T L G Q Q L Q I I W : C R R M E K Q T L Q L L A S S C K - F : G V D . . . . . . 2734 CGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTG R S S V I L P P R I S T L L I G R A P L A V R - S S R L G Y L L C - L G E L H C A Q F G D P P A - D I Y S A D W E S S T . . . . . . 2674 TTCCGGAGCCCAGTCGTTTTGGTACATAACTTTTGTGTAGGCTTTTGCTCGTCTATGGGT F R S P V V L V H N F C V G F C S S M G S G A Q S F W Y I T F V - A F A R L W V V P E P S R F G T - L L C R L L L V Y G . . . . . . 2614 ATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTGGACATTA M A G P C P V E F H - C T L R G L W T L W R G P V P S S F T N V L L E V C G H Y Y G G A L S R R V S L M Y S - R S V D I . . . . . . 2554 TGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCG C G L Y I Y V L D N G L D M V C L G C P V G C I Y M F W I M V W T W F V W D V R M W V V Y I C F G - W S G H G L F G M S . . . . . . 2494 CTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTATGCTTTGAATAGTGGCGGCCT L V Q G Q P C R L R T S L C F E - W R P L Y R G S L V G C V H H Y A L N S G G L A C T G A A L S A A Y I I M L - I V A A . . . : . 2434 TGTCGGCTCGCGTATGCTGTTATG : GTTTGTAT C R L A Y A V M : V C V G S R M L L W : F V L S A R V C C Y : G L Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-3_PPS_1 (2634 2411,2292 2286) (frame '2'; 231 bp, 77 residues) 1 AFARLWVWRG PVPSSFTNVL LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-4 (8255 8088,3050 3004,2958 2343) SCR (e 0.958 d 0.000 a 0.000,e 0.851 d 0.000 a 0.000,e 0.908) Exon 1 8255 8088 ( 168 n); score: 0.958 Intron 1 8087 3051 (5037 n); Pd: 0.000 Pa: 0.000 Exon 2 3050 3004 ( 47 n); score: 0.851 Intron 2 3003 2959 ( 45 n); Pd: 0.000 Pa: 0.000 Exon 3 2958 2343 ( 616 n); score: 0.908 PGS (2470 2343) SGN-E538150- PGS (8255 8088,3050 3004,2958 2411) SGN-E544254- 3-phase translation of AGS-4 (-strand): . . . . . . 8255 GAGTCATTTATCATTTCACCGAGTCCCGGGCAGGGTAATGTTCATGCGGAGTTTCTTGCA E S F I I S P S P G Q G N V H A E F L A S H L S F H R V P G R V M F M R S F L H V I Y H F T E S R A G - C S C G V S C . . . . . . 8195 TATGTCACCGAGTTCCTCACTAGAGGGCCGGGTATGTATATTATATGTATGATTGGTGAT Y V T E F L T R G P G M Y I I C M I G D M S P S S S L E G R V C I L Y V - L V M I C H R V P H - R A G Y V Y Y M Y D W - . . . . . : . 8135 GAGGATGGTTATGATGATGATGATGACGGAGATGACGTGATGATTATT : TCACCGAGTTCC E D G Y D D D D D G D D V M I I : S P S S R M V M M M M M T E M T - - L F : H R V P - G W L - - - - - R R - R D D Y : F T E F . . . . : . . 3038 TCACTAGAGGGCCGGGTATGTATATTATATATATG : GTGATGATTATTTTGCCGAGCCCTT S L E G R V C I L Y I W : - - L F C R A L H - R A G Y V Y Y I Y : G D D Y F A E P F L T R G P G M Y I I Y M : V M I I L P S P . . . . . . 2933 TACTAGGGAAGCTGGGCACCTTAAATGTTAAATATATGCATGATTTTCACTTAAAAAGTA Y - G S W A P - M L N I C M I F T - K V T R E A G H L K C - I Y A - F S L K K Y L L G K L G T L N V K Y M H D F H L K S . . . . . . 2873 TATGTGTAGCGATATTTTGTTTCGACTTGCCACATTGGTATCCTGTCATCTTTACCTTAT Y V - R Y F V S T C H I G I L S S L P Y M C S D I L F R L A T L V S C H L Y L M I C V A I F C F D L P H W Y P V I F T L . . . . . . 2813 GCTTTACATACTCAGTACATTGTCCGTACTGACCCCCCTTTCCTCGGGGGGCTGCGTTTC A L H T Q Y I V R T D P P F L G G L R F L Y I L S T L S V L T P L S S G G C V S C F T Y S V H C P Y - P P F P R G A A F . . . . . . 2753 ATGCCTGCAGGTGTAGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCT M P A G V D A Q F G D P P A - D I Y S A C L Q V - T R S S V I L P P R I S T L L H A C R C R R A V R - S S R L G Y L L C . . . . . . 2693 GATTGGGAGAGCTCCACTGTTCCGGAGCCCAGTCGTTTTGGTACATAACTTTTGTGTAGG D W E S S T V P E P S R F G T - L L C R I G R A P L F R S P V V L V H N F C V G - L G E L H C S G A Q S F W Y I T F V - . . . . . . 2633 CTTTTGCTCGTCTATGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCT L L L V Y G Y G G A L S R R V S L M Y S F C S S M G M A G P C P V E F H - C T L A F A R L W V W R G P V P S S F T N V L . . . . . . 2573 TAGAGGTCTGTGGACATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACAT - R S V D I M W V V Y I C F G - W S G H R G L W T L C G L Y I Y V L D N G L D M L E V C G H Y V G C I Y M F W I M V W T . . . . . . 2513 GGTTTGTTTGGGATGTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTATG G L F G M S A C T G A A L S A A Y I I M V C L G C P L V Q G Q P C R L R T S L C W F V W D V R L Y R G S L V G C V H H Y . . . . . . 2453 CTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGCTGTTATGGTTGAATGGTTATGACT L - I V A A L S A R V C C Y G - M V M T F E - W R P C R L A Y A V M V E W L - L A L N S G G L V G S R M L L W L N G Y D . . . . . . 2393 CCTTATGAGACAGGTCCTCTTATATATATATATGACGTTGGGGTTGGCTTG P Y E T G P L I Y I Y D V G V G L L M R Q V L L Y I Y M T L G L A S L - D R S S Y I Y I - R W G W L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-4_PPS_1 (2634 2386) (frame '0'; 246 bp, 82 residues) 1 AFARLWVWRG PVPSSFTNVL LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWLNGYD SL- >C06HBa0057J04.1-9-_PGL-1_AGS-4_PPS_2 (8100 8088,3050 3004,2958 2782) (frame '0'; 234 bp, 78 residues) 1 RDDYFTEFLT RGPGMYIIYM VMIILPSPLL GKLGTLNVKY MHDFHLKSIC VAIFCFDLPH 61 WYPVIFTLCF TYSVHCPY- AGS-5 (3922 3880,3113 2919,2743 2411) SCR (e 0.814 d 0.995 a 0.975,e 0.928 d 0.000 a 0.000,e 0.899) Exon 1 3922 3880 ( 43 n); score: 0.814 Intron 1 3879 3114 ( 766 n); Pd: 0.995 Pa: 0.975 Exon 2 3113 2919 ( 195 n); score: 0.928 Intron 2 2918 2744 ( 175 n); Pd: 0.000 Pa: 0.000 Exon 3 2743 2411 ( 333 n); score: 0.899 PGS (3922 3880,3113 2919,2743 2411) SGN-E538151+ PGS (3922 3880,3113 2919,2743 2411) SGN-E538156+ PGS (3922 3880,3113 2919,2743 2513) SGN-E268096+ 3-phase translation of AGS-5 (-strand): . . . . . : . 3922 AAACAAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : AGTCATTTATCATTTCA K Q T L Q L L A S S C K - F : E S F I I S N K P C N S W P A A A N N L : S H L S F H T N P A T L G Q Q L Q I I - : V I Y H F . . . . . . 3096 CCGAGTCCCGGGCCGGGTAATGTTCGTGCGGAGTTTCTTGCATATGTCACCGAGTTCCTC P S P G P G N V R A E F L A Y V T E F L R V P G R V M F V R S F L H M S P S S S T E S R A G - C S C G V S C I C H R V P . . . . . . 3036 ACTAGAGGGCCGGGTATGTATATTATATATATGATTGGTGATGAGGATGGTTATGATGAT T R G P G M Y I I Y M I G D E D G Y D D L E G R V C I L Y I - L V M R M V M M M H - R A G Y V Y Y I Y D W - - G W L - - . . . . . . : 2976 GATGATGACGGAGATGACGTGATGATTATTTTGCCGAGCCCTTTACTAGGGAAGCTGG : GT D D D G D D V M I I L P S P L L G K L : G M M T E M T - - L F C R A L Y - G S W : V - - - R R - R D D Y F A E P F T R E A G : . . . . . . 2741 GTAGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGC V D A Q F G D P P A - D I Y S A D W E S - T R S S V I L P P R I S T L L I G R A C R R A V R - S S R L G Y L L C - L G E . . . . . . 2681 TCCACTGTTCCGGAGCCCAGTCGTTTTGGTACATAACTTTTGTGTAGGCTTTTGCTCGTC S T V P E P S R F G T - L L C R L L L V P L F R S P V V L V H N F C V G F C S S L H C S G A Q S F W Y I T F V - A F A R . . . . . . 2621 TATGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTG Y G Y G G A L S R R V S L M Y S - R S V M G M A G P C P V E F H - C T L R G L W L W V W R G P V P S S F T N V L L E V C . . . . . . 2561 GACATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGG D I M W V V Y I C F G - W S G H G L F G T L C G L Y I Y V L D N G L D M V C L G G H Y V G C I Y M F W I M V W T W F V W . . . . . . 2501 ATGTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTATGCTTTGAATAGTG M S A C T G A A L S A A Y I I M L - I V C P L V Q G Q P C R L R T S L C F E - W D V R L Y R G S L V G C V H H Y A L N S . . . . 2441 GCGGCCTTGTCGGCTCGCGTATGCTGTTATG A A L S A R V C C Y R P C R L A Y A V M G G L V G S R M L L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-5_PPS_1 (3883 3880,3113 2919,2743 2709) (frame '1'; 231 bp, 77 residues) 1 FESFIISPSP GPGNVRAEFL AYVTEFLTRG PGMYIIYMIG DEDGYDDDDD GDDVMIILPS 61 PLLGKLGVDA QFGDPPA- >C06HBa0057J04.1-9-_PGL-1_AGS-5_PPS_2 (2634 2413) (frame '0'; 222 bp, 74 residues) 1 AFARLWVWRG PVPSSFTNVL LEVCGHYVGC IYMFWIMVWT WFVWDVRLYR GSLVGCVHHY 61 ALNSGGLVGS RMLL AGS-6 (3930 3698) SCR (e 0.966) Exon 1 3930 3698 ( 233 n); score: 0.966 PGS (3930 3698) SGN-E298250+ 3-phase translation of AGS-6 (-strand): . . . . . . 3930 AAATGGAGAAACAAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N K P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K Q T L Q L L A S S C K - F G - - . . . . . . 3870 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R S P C L V C - F F R I P L L I I H - F - . . . . . . 3810 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTGCTAAGTATGA K G A - P V A - E V C F S Y - M C - V - R G R D Q - L R K F V L V I E C A K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 3750 ATGGAAACCATAATCGGATTATTAGTGGTGTCATGTTGGTGCTTGGGCTGTTT M E T I I G L L V V S C W C L G C W K P - S D Y - W C H V G A W A V N G N H N R I I S G V M L V L G L F Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-6 (+strand): . . . . . . 3698 AAACAGCCCAAGCACCAACATGACACCACTAATAATCCGATTATGGTTTCCATTCATACT K Q P K H Q H D T T N N P I M V S I H T N S P S T N M T P L I I R L W F P F I L T A Q A P T - H H - - S D Y G F H S Y . . . . . . 3758 TAGCACATTCAATAACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCCCTTCTTAAAA - H I Q - L K Q T S - A T G H A P F L K S T F N N - N K L P K L L V T P P S - N L A H S I T K T N F L S Y W S R P L L K . . . . . . 3818 TTAATGGATAATTAACAAGGGTATTCTAAAGAATTAACACACCAAACAAGGAGATTACTA L M D N - Q G Y S K E L T H Q T R R L L - W I I N K G I L K N - H T K Q G D Y - I N G - L T R V F - R I N T P N K E I T . . . . . . 3878 ACCAAATTATTTGCAGCTGCTGGCCAAGAGTTGCAGGGTTTGTTTCTCCATTT T K L F A A A G Q E L Q G L F L H P N Y L Q L L A K S C R V C F S I N Q I I C S C W P R V A G F V S P F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-7 (9278 9217,8955 8951,8655 8609,7883 7429) SCR (e 0.879 d 0.991 a 0.933,e 0.400 d 0.000 a 0.880,e 0.979 d 0.991 a 0.000,e 0.890) Exon 1 9278 9217 ( 62 n); score: 0.879 Intron 1 9216 8956 ( 261 n); Pd: 0.991 Pa: 0.933 Exon 2 8955 8951 ( 5 n); score: 0.400 Intron 2 8950 8656 ( 295 n); Pd: 0.000 Pa: 0.880 Exon 3 8655 8609 ( 47 n); score: 0.979 Intron 3 8608 7884 ( 725 n); Pd: 0.991 Pa: 0.000 Exon 4 7883 7429 ( 455 n); score: 0.890 PGS (9278 9217,8955 8951,8655 8609,7883 7429) SGN-E543104+ PGS (9278 9217,8955 8951,8655 8609,7883 7462) SGN-E543103- PGS (7883 7462) SGN-E306317+ PGS (7883 7462) SGN-E303695+ PGS (7884 7483) SGN-E225616- PGS (7883 7483) SGN-E303256+ 3-phase translation of AGS-7 (-strand): . . . . . . 9278 GGCAGCAATGGAAATGGAGAAACTAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATT G S N G N G E T N P A T L G Q Q L Q I I A A M E M E K L T L Q L L A S S C K - F Q Q W K W R N - P C N S W P A A A N N . : : . . . . . : 9218 TG : GATGA : ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGTAG W : M : N - - L E Q - I L D V R L D Y T : V - : G - : I N S L S S K Y W T C G S I I R : C R L : D E : L I A - A V N I G R A A R L Y : G V . . . . . . 7877 ACGCACAGTTTGGTGATCCTCCCGCCTAGGATATCTACTCTGATGATTGGGAGAGCTCCA T H S L V I L P P R I S T L M I G R A P R T V W - S S R L G Y L L - - L G E L H D A Q F G D P P A - D I Y S D D W E S S . . . . . . 7817 CTGTTCCGGAGCCTAGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATG L F R S L V V L V H N F C V V F C S S M C S G A - S F W Y I T F V - S F A R L W T V P E P S R F G T - L L C S L L L V Y . . . . . . 7757 GGTATGGCGGGTCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTGGACA G M A G P C P V E F H - C T L R G L W T V W R V P V P S S F T N V L L E V C G H G Y G G S L S R R V S L M Y S - R S V D . . . . . . 7697 TTATGTGGGTTGTATATATATTTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGT L C G L Y I Y F L D N G L D M V C L G C Y V G C I Y I F W I M V W T W F V W D V I M W V V Y I F F G - W S G H G L F G M . . . . . . 7637 CCGCTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTGTGTATTGTGTAGTGGCAG P L V Q G Q P C R L R T S L C I V - W Q R L Y R G S L V G C V H H C V L C S G S S A C T G A A L S A A Y I I V Y C V V A . . . . . . 7577 CCTTGTCGGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTA P C R H T Y V I M L - I V A A L S A R V L V G I R M L L C F E - W R P C R L A Y A L S A Y V C Y Y A L N S G G L V G S R . . . . . . 7517 TGTTGTTATGGTTGAATGATTATGACTCCTTATGAGACAGGTCCTCTTATATATATATGA C C Y G - M I M T P Y E T G P L I Y I - V V M V E - L - L L M R Q V L L Y I Y D M L L W L N D Y D S L - D R S S Y I Y M . . . 7457 CGTTGGGGTTGGCTTGATTTGATTAAATT R W G W L D L I K V G V G L I - L N T L G L A - F D - I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-7_PPS_1 (7774 7541) (frame '2'; 231 bp, 77 residues) 1 SFARLWVWRV PVPSSFTNVL LEVCGHYVGC IYIFWIMVWT WFVWDVRLYR GSLVGCVHHC 61 VLCSGSLVGI RMLLCFE- AGS-8 (9276 9217,8655 8609,7883 7444) SCR (e 0.808 d 0.991 a 0.880,e 0.894 d 0.991 a 0.000,e 0.897) Exon 1 9276 9217 ( 60 n); score: 0.808 Intron 1 9216 8656 ( 561 n); Pd: 0.991 Pa: 0.880 Exon 2 8655 8609 ( 47 n); score: 0.894 Intron 2 8608 7884 ( 725 n); Pd: 0.991 Pa: 0.000 Exon 3 7883 7444 ( 440 n); score: 0.897 PGS (9275 9217,8655 8609,7883 7444) SGN-E305738+ PGS (9276 9217,8655 8609,7883 7461) SGN-E374134- PGS (9275 9217,8655 8609,7883 7461) SGN-E374135+ PGS (9275 9217,8655 8609,7883 7483) SGN-E310669+ 3-phase translation of AGS-8 (-strand): . . . . . . : 9276 CAGCAATGGAAATGGAGAAACTAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : Q Q W K W R N - P C N S W P A A A N N L : S N G N G E T N P A T L G Q Q L Q I I - : A M E M E K L T L Q L L A S S C K - F : . . . . . : . 8655 ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGTAGACGCACA I N S L S S K Y W T C G S I I R : C R R T L I A - A V N I G R A A R L Y : G V D A Q D - - L E Q - I L D V R L D Y T : V - T H . . . . . . 7870 GTTTGGTGATCCTCCCGCCTAGGATATCTACTCTGATGATTGGGAGAGCTCCACTGTTCC V W - S S R L G Y L L - - L G E L H C S F G D P P A - D I Y S D D W E S S T V P S L V I L P P R I S T L M I G R A P L F . . . . . . 7810 GGAGCCTAGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGG G A - S F W Y I T F V - S F A R L W V W E P S R F G T - L L C S L L L V Y G Y G R S L V V L V H N F C V V F C S S M G M . . . . . . 7750 CGGGTCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTGGACATTATGTG R V P V P S S F T N V L L E V C G H Y V G S L S R R V S L M Y S - R S V D I M W A G P C P V E F H - C T L R G L W T L C . . . . . . 7690 GGTTGTATATATATTTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCGCTTG G C I Y I F W I M V W T W F V W D V R L V V Y I F F G - W S G H G L F G M S A C G L Y I Y F L D N G L D M V C L G C P L . . . . . . 7630 TACAGGGGCAGCCTTGTCGGCTGCGTACATCATTGTGTATTGTGTAGTGGCAGCCTTGTC Y R G S L V G C V H H C V L C S G S L V T G A A L S A A Y I I V Y C V V A A L S V Q G Q P C R L R T S L C I V - W Q P C . . . . . . 7570 GGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGTTGTT G I R M L L C F E - W R P C R L A Y V V A Y V C Y Y A L N S G G L V G S R M L L R H T Y V I M L - I V A A L S A R V C C . . . . . . 7510 ATGGTTGAATGATTATGACTCCTTATGAGACAGGTCCTCTTATATATATATGACGTTGGG M V E - L - L L M R Q V L L Y I Y D V G W L N D Y D S L - D R S S Y I Y M T L G Y G - M I M T P Y E T G P L I Y I - R W . 7450 GTTGGCT V G L A G W Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-8_PPS_1 (7774 7541) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRV PVPSSFTNVL LEVCGHYVGC IYIFWIMVWT WFVWDVRLYR GSLVGCVHHC 61 VLCSGSLVGI RMLLCFE- AGS-9 (9259 9217,8254 8060,7883 7481) SCR (e 0.814 d 0.991 a 0.959,e 0.908 d 0.000 a 0.000,e 0.907) Exon 1 9259 9217 ( 43 n); score: 0.814 Intron 1 9216 8255 ( 962 n); Pd: 0.991 Pa: 0.959 Exon 2 8254 8060 ( 195 n); score: 0.908 Intron 2 8059 7884 ( 176 n); Pd: 0.000 Pa: 0.000 Exon 3 7883 7481 ( 403 n); score: 0.907 PGS (9259 9217,8254 8060,7883 7481) SGN-E538156+ PGS (9259 9217,8254 8060,7883 7518) SGN-E538151+ PGS (9259 9217,8254 8060,7883 7653) SGN-E268096+ 3-phase translation of AGS-9 (-strand): . . . . . : . 9259 AAACTAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : AGTCATTTATCATTTCA K L T L Q L L A S S C K - F : E S F I I S N - P C N S W P A A A N N L : S H L S F H T N P A T L G Q Q L Q I I - : V I Y H F . . . . . . 8237 CCGAGTCCCGGGCAGGGTAATGTTCATGCGGAGTTTCTTGCATATGTCACCGAGTTCCTC P S P G Q G N V H A E F L A Y V T E F L R V P G R V M F M R S F L H M S P S S S T E S R A G - C S C G V S C I C H R V P . . . . . . 8177 ACTAGAGGGCCGGGTATGTATATTATATGTATGATTGGTGATGAGGATGGTTATGATGAT T R G P G M Y I I C M I G D E D G Y D D L E G R V C I L Y V - L V M R M V M M M H - R A G Y V Y Y M Y D W - - G W L - - . . . . . . : 8117 GATGATGACGGAGATGACGTGATGATTATTTTGCCGAGCCCCTTATTAGGGAAGTTGG : GT D D D G D D V M I I L P S P L L G K L : G M M T E M T - - L F C R A P Y - G S W : V - - - R R - R D D Y F A E P L I R E V G : . . . . . . 7881 GTAGACGCACAGTTTGGTGATCCTCCCGCCTAGGATATCTACTCTGATGATTGGGAGAGC V D A Q F G D P P A - D I Y S D D W E S - T H S L V I L P P R I S T L M I G R A C R R T V W - S S R L G Y L L - - L G E . . . . . . 7821 TCCACTGTTCCGGAGCCTAGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTC S T V P E P S R F G T - L L C S L L L V P L F R S L V V L V H N F C V V F C S S L H C S G A - S F W Y I T F V - S F A R . . . . . . 7761 TATGGGTATGGCGGGTCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTGTG Y G Y G G S L S R R V S L M Y S - R S V M G M A G P C P V E F H - C T L R G L W L W V W R V P V P S S F T N V L L E V C . . . . . . 7701 GACATTATGTGGGTTGTATATATATTTTTTGGATAATGGTCTGGACATGGTTTGTTTGGG D I M W V V Y I F F G - W S G H G L F G T L C G L Y I Y F L D N G L D M V C L G G H Y V G C I Y I F W I M V W T W F V W . . . . . . 7641 ATGTCCGCTTGTACAGGGGCAGCCTTGTCGGCTGCGTACATCATTGTGTATTGTGTAGTG M S A C T G A A L S A A Y I I V Y C V V C P L V Q G Q P C R L R T S L C I V - W D V R L Y R G S L V G C V H H C V L C S . . . . . . 7581 GCAGCCTTGTCGGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCG A A L S A Y V C Y Y A L N S G G L V G S Q P C R H T Y V I M L - I V A A L S A R G S L V G I R M L L C F E - W R P C R L . . . . . 7521 CGTATGTTGTTATGGTTGAATGATTATGACTCCTTATGAGA R M L L W L N D Y D S L - V C C Y G - M I M T P Y E A Y V V M V E - L - L L M R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9-_PGL-1_AGS-9_PPS_2 (9220 9217,8254 8060,7883 7849) (frame '1'; 231 bp, 77 residues) 1 FESFIISPSP GQGNVHAEFL AYVTEFLTRG PGMYIICMIG DEDGYDDDDD GDDVMIILPS 61 PLLGKLGVDA QFGDPPA- >C06HBa0057J04.1-9-_PGL-1_AGS-9_PPS_1 (7774 7541) (frame '0'; 231 bp, 77 residues) 1 SFARLWVWRV PVPSSFTNVL LEVCGHYVGC IYIFWIMVWT WFVWDVRLYR GSLVGCVHHC 61 VLCSGSLVGI RMLLCFE- AGS-10 (9267 8567) SCR (e 0.855) Exon 1 9267 8567 ( 701 n); score: 0.855 PGS (9259 8567) SGN-E328093+ PGS (9267 8813) SGN-E298250+ 3-phase translation of AGS-10 (-strand): . . . . . . 9267 AAATGGAGAAACTAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N - P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K L T L Q L L A S S C K - F G - - . . . . . . 9207 ATCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG I L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R Y P C L V C - F F R I P L L I I H - F - . . . . . . 9147 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTGCTAAGTATGA K G A - P V A - E V C F S Y - M C - V - R G R D Q - L R K F V L V I E C A K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 9087 ATGGAAACCATAATCGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGTTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - S D Y - W C R V G A W V V L I K N G N H N R I I S G V V L V L G L F - L . . . . . . 9027 AGCAAACTGCAGGAAAATTCTTTTTTGGCATTATGTATATGTTGAATGTGATTATGAGTA S K L Q E N S F L A L C I C - M - L - V A N C R K I L F W H Y V Y V E C D Y E Y K Q T A G K F F F G I M Y M L N V I M S . . . . . . 8967 TATACTCCAAAGGATGAATACGATAAGGTAGATGCGTTGCGAATTATAAAACGAGTTATC Y T P K D E Y D K V D A L R I I K R V I I L Q R M N T I R - M R C E L - N E L S I Y S K G - I R - G R C V A N Y K T S Y . . . . . . 8907 ACTCGGTGTGTCGTTGCTTCGCTGCTATGGTTGCCGAGACGGAACTGTTTTGGGGAGGGG T R C V V A S L L W L P R R N C F G E G L G V S L L R C Y G C R D G T V L G R G H S V C R C F A A M V A E T E L F W G G . . . . . . 8847 GCTGTTTAATATGATTCTTTGGGTTATATGTGTTATTGTTATTACTGTGGATAATTTGGA A V - Y D S L G Y M C Y C Y Y C G - F G L F N M I L W V I C V I V I T V D N L D G C L I - F F G L Y V L L L L L W I I W . . . . . . 8787 TTGTTGTCGGATTGGGACGAAGTAAGGAAAATAGGGGAGGTGCTGCCGAATTTTCGTTAG L L S D W D E V R K I G E V L P N F R - C C R I G T K - G K - G R C C R I F V R I V V G L G R S K E N R G G A A E F S L . . . . . . 8727 ATTATTAGCTAGCTTACAAGAAAGTGAAGCACGATGTTTATCTAAATGCGGCACGATTGT I I S - L T R K - S T M F I - M R H D C L L A S L Q E S E A R C L S K C G T I V D Y - L A Y K K V K H D V Y L N A A R L . . . . . . 8667 TGCTTGTTATAGATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACGG C L L - I N S L S S K Y W T C G S I I R A C Y R L I A - A V N I G R A A R L Y G L L V I D - - L E Q - I L D V R L D Y T . . . . . 8607 TATGTAACGCTGTCCCTTCTTTCTTTGCTTGGCATGACTTT Y V T L S L L S L L G M T M - R C P F F L C L A - L V C N A V P S F F A W H D F Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-10 (+strand): . . . . . . 8567 AAAGTCATGCCAAGCAAAGAAAGAAGGGACAGCGTTACATACCGTATAATCGAGCCGCAC K V M P S K E R R D S V T Y R I I E P H K S C Q A K K E G T A L H T V - S S R T S H A K Q R K K G Q R Y I P Y N R A A . . . . . . 8627 GTCCAATATTTACTGCTCAAGCTATTAATCTATAACAAGCAACAATCGTGCCGCATTTAG V Q Y L L L K L L I Y N K Q Q S C R I - S N I Y C S S Y - S I T S N N R A A F R R P I F T A Q A I N L - Q A T I V P H L . . . . . . 8687 ATAAACATCGTGCTTCACTTTCTTGTAAGCTAGCTAATAATCTAACGAAAATTCGGCAGC I N I V L H F L V S - L I I - R K F G S - T S C F T F L - A S - - S N E N S A A D K H R A S L S C K L A N N L T K I R Q . . . . . . 8747 ACCTCCCCTATTTTCCTTACTTCGTCCCAATCCGACAACAATCCAAATTATCCACAGTAA T S P I F L T S S Q S D N N P N Y P Q - P P L F S L L R P N P T T I Q I I H S N H L P Y F P Y F V P I R Q Q S K L S T V . . . . . . 8807 TAACAATAACACATATAACCCAAAGAATCATATTAAACAGCCCCCTCCCCAAAACAGTTC - Q - H I - P K E S Y - T A P S P K Q F N N N T Y N P K N H I K Q P P P Q N S S I T I T H I T Q R I I L N S P L P K T V . . . . . . 8867 CGTCTCGGCAACCATAGCAGCGAAGCAACGACACACCGAGTGATAACTCGTTTTATAATT R L G N H S S E A T T H R V I T R F I I V S A T I A A K Q R H T E - - L V L - F P S R Q P - Q R S N D T P S D N S F Y N . . . . . . 8927 CGCAACGCATCTACCTTATCGTATTCATCCTTTGGAGTATATACTCATAATCACATTCAA R N A S T L S Y S S F G V Y T H N H I Q A T H L P Y R I H P L E Y I L I I T F N S Q R I Y L I V F I L W S I Y S - S H S . . . . . . 8987 CATATACATAATGCCAAAAAAGAATTTTCCTGCAGTTTGCTTTAATCAAAACAACCCAAG H I H N A K K E F S C S L L - S K Q P K I Y I M P K K N F P A V C F N Q N N P S T Y T - C Q K R I F L Q F A L I K T T Q . . . . . . 9047 CACCAACACGACACCACTAATAATCCGATTATGGTTTCCATTCATACTTAGCACATTCAA H Q H D T T N N P I M V S I H T - H I Q T N T T P L I I R L W F P F I L S T F N A P T R H H - - S D Y G F H S Y L A H S . . . . . . 9107 TAACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCCCTTCTTAAAATTAATGGATAAT - L K Q T S - A T G H A P F L K L M D N N - N K L P K L L V T P P S - N - W I I I T K T N F L S Y W S R P L L K I N G - . . . . . . 9167 TAACAAGGGTATTCTAAAGAATTAACACACCAAACAAGGATATTACTAACCAAATTATTT - Q G Y S K E L T H Q T R I L L T K L F N K G I L K N - H T K Q G Y Y - P N Y L L T R V F - R I N T P N K D I T N Q I I . . . . . 9227 GCAGCTGCTGGCCAAGAGTTGCAGGGTTAGTTTCTCCATTT A A A G Q E L Q G - F L H Q L L A K S C R V S F S I C S C W P R V A G L V S P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-9+_PGL-1_AGS-10_PPS_1 (8662 8883) (frame '0'; 219 bp, 73 residues) 1 QATIVPHLDK HRASLSCKLA NNLTKIRQHL PYFPYFVPIR QQSKLSTVIT ITHITQRIIL 61 NSPLPKTVPS RQP- ... finished at: Mon Jul 24 23:15:09 2006 ________________________________________________________________________________ Sequence 10: C06HBa0057J04.1-10, from 1 to 8203, both strands analyzed. ... started at: Mon Jul 24 23:15:09 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:15:17 2006 ________________________________________________________________________________ Sequence 11: C06HBa0057J04.1-11, from 1 to 2963, both strands analyzed. ... started at: Mon Jul 24 23:15:17 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 ******************************************************************************** EST sequence 2 -strand 730 n (File: SGN-E379982-) 1 AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 61 CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 121 GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 181 ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 241 ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 301 TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 361 GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 421 TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 481 AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 541 ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 601 ATTGCTCCAG CGGTATTCAT GGATTTAATG AATTGAGTAT TTAATCCATT CCTTGATATG 661 TTTGTTATTG GATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGAA 721 GACTATTTAA Predicted gene structure (within gDNA segment 1594 to 1): Exon 1 624 1 ( 624 n); cDNA 1 624 ( 624 n); score: 0.912 MATCH C06HBa0057J04.1-11- SGN-E379982- 0.912 624 0.855 C PGS_C06HBa0057J04.1-11-_SGN-E379982- (624 1) Alignment (genomic DNA sequence = upper lines): AAAGGTAAGT TCATTTCATA CCTTAAGGCC GGGAAGATGG TTAGAAAAGG CTATATTTAT 565 |||||||||| |||||||||| | | |||||| ||||||||| |||||||||| ||||||||| AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 60 CATCTTGTTC GGGTGCATGA CATAAAGGAA GAGGCACCGA CTCTTCAATC AGTCTGGGTA 505 ||||| ||| |||||||||| |||||||| | ||||||| || |||||||||| ||||| |||| CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 120 GTTAATGAAT TTCCTGATGT ATTCACAGAG GAACTTCCAG GCCTTTCTCC AGAATGAGAG 445 |||||||||| |||||||||| |||| | ||| |||||||||| ||||| |||| |||| | ||| GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 180 ATAGAGTTTA CTCTAGATGT ACTGCCAGAT ACCCAGCCTA TATCTATACC TCCTTATAGA 385 |||||||||| || ||||||| |||||||||| ||||| |||| |||||||||| |||||||||| ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 240 ATGGCACCTG CTTAGTTGAA AGAATTGAAA GAGCAATTGA GGGATCTTCT AGAAAACGAC 325 |||||||||| || ||||||| |||||||||| |||||||||| ||||| | || |||||| | | ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 300 TTCATTAGGC CTAGCACGTC ACCTTGGAGA GCACCGGTAC TATTTGTGAA GAAGATGGAT 265 ||||| |||| |||| ||||| ||||||| || |||||||||| ||||||||| ||||| |||| TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 360 GGGTCACTGC GGATGTGTAT TGATTATAGG CAGCTGAACA AAGTAACAGT AAAGAAAAGG 205 ||||| || | ||||||| || |||||||||| |||||||||| |||||||| | || ||| ||| GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 420 TATCCCCTGC CAAGGATTGA TGATCTATTT GACCAGTTGC AGAGTGCTAT GTGTTTTTCA 145 |||||||| | |||||||||| ||||||||| |||||||||| || |||| | | |||||||| TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 480 AAGATAGACT TGCGGTCAGG TTATCATCAG GTGCGGGTAA GGAAGGCAGA CATTCCAAAG 85 |||||||||| |||||||||| |||||||||| || ||||| | || ||||||| || |||| | AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 540 ACGGCATTCC GGACTCGATA CGGGCATTAT GAGTTTAGAG TGCTGGCTTT TGAGAAGACT 25 |||||||||| |||| ||||| ||||||||| | |||||||| || || |||| || | |||| ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 600 AATGCTCCAG CGGTGTTTAT GGAT 1 | |||||||| |||| || || |||| ATTGCTCCAG CGGTATTCAT GGAT 624 hqPGS_C06HBa0057J04.1-11-_SGN-E379982- (624 1) ******************************************************************************** EST sequence 3 -strand 521 n (File: SGN-E201553-) 1 TACCCAACCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 61 AGAGCAATTG AGGGATTTAC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 121 AGCACCGGTA CTATTTGTGA GGAAGAAGGA TGGGTCGCTC CGGATGTGCA TTGATTATAG 181 GCAGCTGAAC AAAGTAACAA TAATGAACAG GTATCCCCTC CCAAGGATTG ACGATCTATT 241 TGACCAGTTG CAGGGTGCAA AGGGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 301 GGTACGGGTT AGGGAGGCAG ATATCCCAAT GACGGCATTC CGGACCCGAT ATGGGCATTA 361 TGTGTTTAGA GTGTTGTCTT TTGGGCTGAC TATTGCTCCA GCGGTATTCA TGGATTTAAT 421 GAATTGAGTA TTTAATCCAT TCCTTGATAT GTTTGTTATT GGATTTATAG CCCCTATTAT 481 GGTCTATTCA CGTTCAGAAA AGGAGCATGA AGACTATTTA A Predicted gene structure (within gDNA segment 1088 to 1): Exon 1 415 1 ( 415 n); cDNA 1 415 ( 415 n); score: 0.904 MATCH C06HBa0057J04.1-11- SGN-E201553- 0.904 415 0.797 C PGS_C06HBa0057J04.1-11-_SGN-E201553- (415 1) Alignment (genomic DNA sequence = upper lines): TACCCAGCCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTTAGTTGA AAGAATTGAA 356 |||||| ||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| TACCCAACCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 60 AGAGCAATTG AGGGATCTTC TAGAAAACGA CTTCATTAGG CCTAGCACGT CACCTTGGAG 296 |||||||||| |||||| | | ||||||| | |||||| ||| ||||| |||| |||||||| | AGAGCAATTG AGGGATTTAC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 120 AGCACCGGTA CTATTTGTGA AGAAGATGGA TGGGTCACTG CGGATGTGTA TTGATTATAG 236 |||||||||| |||||||||| ||||| ||| |||||| || |||||||| | |||||||||| AGCACCGGTA CTATTTGTGA GGAAGAAGGA TGGGTCGCTC CGGATGTGCA TTGATTATAG 180 GCAGCTGAAC AAAGTAACAG TAAAGAAAAG GTATCCCCTG CCAAGGATTG ATGATCTATT 176 |||||||||| ||||||||| ||| ||| || ||||||||| |||||||||| | |||||||| GCAGCTGAAC AAAGTAACAA TAATGAACAG GTATCCCCTC CCAAGGATTG ACGATCTATT 240 TGACCAGTTG CAGAGTGCTA TGTGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 116 |||||||||| ||| |||| | | ||||||| |||||||||| |||||||||| |||||||||| TGACCAGTTG CAGGGTGCAA AGGGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 300 GGTGCGGGTA AGGAAGGCAG ACATTCCAAA GACGGCATTC CGGACTCGAT ACGGGCATTA 56 ||| ||||| ||| |||||| | || |||| |||||||||| ||||| |||| | |||||||| GGTACGGGTT AGGGAGGCAG ATATCCCAAT GACGGCATTC CGGACCCGAT ATGGGCATTA 360 TGAGTTTAGA GTGCTGGCTT TTGAGAAGAC TAATGCTCCA GCGGTGTTTA TGGAT 1 || ||||||| ||| || ||| ||| | ||| || ||||||| ||||| || | ||||| TGTGTTTAGA GTGTTGTCTT TTGGGCTGAC TATTGCTCCA GCGGTATTCA TGGAT 415 hqPGS_C06HBa0057J04.1-11-_SGN-E201553- (415 1) ******************************************************************************** EST sequence 1 -strand 598 n (File: SGN-E350824-) 1 AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 61 GGTTCGATCG TATGGGATAT TCAGAAGCTG GTCAGAGCTC TAGGGCGTTA GGGTCACAGA 121 TGGGCAGGAG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCA GTGTTCTCAT TGTGGTAAGT 181 CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 241 ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 301 CCGTTGCTGG TTCATTTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 361 CAGCAGGCCA TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCAGTGCC TCGAACCGTA 421 TATATGCTTT GACTAATAGG CAGGATCAGG GGGTGTCACC TAATGTGATC ACAGGTATAT 481 TATCACTATT CTCCCGAAGT GTGTATACAT TGATAGACCC AGGTTCCACC TTATCATATA 541 TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTATGA GTTGATAGAA CCATTTGA Predicted gene structure (within gDNA segment 2093 to 1): Exon 1 1463 868 ( 596 n); cDNA 3 598 ( 596 n); score: 0.908 MATCH C06HBa0057J04.1-11- SGN-E350824- 0.908 596 0.997 C PGS_C06HBa0057J04.1-11-_SGN-E350824- (1463 868) Alignment (genomic DNA sequence = upper lines): CATGTTAGAT TTTCTTCCCA TCCAGCATAG ACTGCACCCC CACGTTTCAT GGGTATGGGG 1404 |||||||||| |||||||||| |||||| || | |||||||| |||||||||| ||||| |||| CATGTTAGAT TTTCTTCCCA GCCAGCACAG AGTGCACCCC CACGTTTCAT GGGTAGGGGG 62 TTCTATCGTA CAGGATATTT GGAAGCTGGT CAGAGCTCTA GGGAGTCAGG GTCACAGATG 1344 ||| |||||| ||||||| ||||||||| |||||||||| ||| || ||| |||||||||| TTCGATCGTA TGGGATATTC AGAAGCTGGT CAGAGCTCTA GGGCGTTAGG GTCACAGATG 122 GGTAGGGGTT TGAGCAAGTT GAGGCCACCT TTGCCTCGGT GTTCTCGCTG TGGTAGGTCC 1284 || ||| ||| ||||| ||| |||||||||| ||||||| || |||||| || ||||| |||| GGCAGGAGTT TGAGCCAGTC GAGGCCACCT TTGCCTCAGT GTTCTCATTG TGGTAAGTCC 182 CAACCTGGGA AATGTCATTG GGATACATGT GCGTGTTTTT CTTGCGGCCT TCATGGTCAT 1224 || |||||| |||||| ||| || |||| || |||||||||| ||||||||| ||| || ||| CATCCTGGGG AATGTCGTTG GGCTACAGGT GCGTGTTTTT CTTGCGGCCG TCAGGGCCAT 242 ACTATGAGGA AGTGTCACCT TAGAGGTAGT GCAGGTGGTA TGGCACAGCC TACTGGGTCC 1164 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACTATGAGGG AGTGTCACCT TAGAGGTAGT GCAGGTGGTA TGGCACAGCC TACAGGGTCC 302 GTTGATGGTT CATCTTCTTT TGTGGCTATG CGCCCTACGA GGCAGGGTAT TCAGGAGCCA 1104 |||| ||||| ||| ||||| |||||||||| ||||||||| |||||||||| ||||| |||| GTTGCTGGTT CATTTTCTTC TGTGGCTATG CGCCCTACGG GGCAGGGTAT TCAGGCGCCA 362 GCAAGTCGTG GTAGAGAATG TGGTGGAGCT TCCAGTTCTA GCGGTCCCTC GAACCGTATA 1044 ||| | | || |||||| | | |||||||||| |||||||||| || || |||| |||||||||| GCAGGCCATG GTAGAGGACG TGGTGGAGCT TCCAGTTCTA GCAGTGCCTC GAACCGTATA 422 TATGCTTTGA CTAGTAAGCA GGATCAGGAG GCATCACCTA ATGTGATCAC AAGTATATTA 984 |||||||||| ||| || ||| |||||||| | | ||||||| |||||||||| | |||||||| TATGCTTTGA CTAATAGGCA GGATCAGGGG GTGTCACCTA ATGTGATCAC AGGTATATTA 482 TCACTATTCT CCCGAAGTGT GTATGCATTG ATTGACCCAG GTTCCACCTT ATCATTTATA 924 |||||||||| |||||||||| |||| ||||| || ||||||| |||||||||| ||||| |||| TCACTATTCT CCCGAAGTGT GTATACATTG ATAGACCCAG GTTCCACCTT ATCATATATA 542 TCTCCCTTCG TTGCTAGTAG GATCGCAGTA GAGTCTGAGT TGATAGAACC GTTTGA 868 |||||||| | |||||||||| ||||| | || |||| ||||| |||||||||| ||||| TCTCCCTTTG TTGCTAGTAG GATCGGAATA GAGTATGAGT TGATAGAACC ATTTGA 598 hqPGS_C06HBa0057J04.1-11-_SGN-E350824- (1463 868) ******************************************************************************** EST sequence 4 +strand 196 n (File: SGN-E379248+) 1 CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 61 CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 121 CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 181 GGTTATCTTG ACGAGT Predicted gene structure (within gDNA segment 2678 to 49): Exon 1 1682 1487 ( 196 n); cDNA 1 196 ( 196 n); score: 0.923 MATCH C06HBa0057J04.1-11- SGN-E379248+ 0.923 196 1.000 C PGS_C06HBa0057J04.1-11-_SGN-E379248+ (1682 1487) Alignment (genomic DNA sequence = upper lines): CATCGTTATG TGATGGGATT GGATCGTTAT CTAATTGACA GTTGTATGGC AGTGACTCTT 1623 |||||||||| |||||||||| |||| ||||| || ||||||| | |||||| |||||||||| CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 60 CAGCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGAGTAGA GGATCGTCAC 1563 || ||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||| ||| CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 120 CGGGGACGTC AGTCAGATAG AGATTATAAT AGAGGCCTGC ATAAGAGGGC TAGATCAGCA 1503 ||||||||| || ||||||| |||||||||| ||||| | | |||||||||| ||||||| || CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 180 GGTTATCCTG ACGAGT 1487 ||||||| || |||||| GGTTATCTTG ACGAGT 196 hqPGS_C06HBa0057J04.1-11-_SGN-E379248+ (1682 1487) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2963: PGL 1 (- strand): 1682 1 AGS-1 (624 1) SCR (e 0.912) Exon 1 624 1 ( 624 n); score: 0.912 PGS (624 1) SGN-E379982- PGS (415 1) SGN-E201553- 3-phase translation of AGS-1 (-strand): . . . . . . 624 AAAGGTAAGTTCATTTCATACCTTAAGGCCGGGAAGATGGTTAGAAAAGGCTATATTTAT K G K F I S Y L K A G K M V R K G Y I Y K V S S F H T L R P G R W L E K A I F I R - V H F I P - G R E D G - K R L Y L . . . . . . 564 CATCTTGTTCGGGTGCATGACATAAAGGAAGAGGCACCGACTCTTCAATCAGTCTGGGTA H L V R V H D I K E E A P T L Q S V W V I L F G C M T - R K R H R L F N Q S G - S S C S G A - H K G R G T D S S I S L G . . . . . . 504 GTTAATGAATTTCCTGATGTATTCACAGAGGAACTTCCAGGCCTTTCTCCAGAATGAGAG V N E F P D V F T E E L P G L S P E - E L M N F L M Y S Q R N F Q A F L Q N E R S - - I S - C I H R G T S R P F S R M R . . . . . . 444 ATAGAGTTTACTCTAGATGTACTGCCAGATACCCAGCCTATATCTATACCTCCTTATAGA I E F T L D V L P D T Q P I S I P P Y R - S L L - M Y C Q I P S L Y L Y L L I E D R V Y S R C T A R Y P A Y I Y T S L - . . . . . . 384 ATGGCACCTGCTTAGTTGAAAGAATTGAAAGAGCAATTGAGGGATCTTCTAGAAAACGAC M A P A - L K E L K E Q L R D L L E N D W H L L S - K N - K S N - G I F - K T T N G T C L V E R I E R A I E G S S R K R . . . . . . 324 TTCATTAGGCCTAGCACGTCACCTTGGAGAGCACCGGTACTATTTGTGAAGAAGATGGAT F I R P S T S P W R A P V L F V K K M D S L G L A R H L G E H R Y Y L - R R W M L H - A - H V T L E S T G T I C E E D G . . . . . . 264 GGGTCACTGCGGATGTGTATTGATTATAGGCAGCTGAACAAAGTAACAGTAAAGAAAAGG G S L R M C I D Y R Q L N K V T V K K R G H C G C V L I I G S - T K - Q - R K G W V T A D V Y - L - A A E Q S N S K E K . . . . . . 204 TATCCCCTGCCAAGGATTGATGATCTATTTGACCAGTTGCAGAGTGCTATGTGTTTTTCA Y P L P R I D D L F D Q L Q S A M C F S I P C Q G L M I Y L T S C R V L C V F Q V S P A K D - - S I - P V A E C Y V F F . . . . . . 144 AAGATAGACTTGCGGTCAGGTTATCATCAGGTGCGGGTAAGGAAGGCAGACATTCCAAAG K I D L R S G Y H Q V R V R K A D I P K R - T C G Q V I I R C G - G R Q T F Q R K D R L A V R L S S G A G K E G R H S K . . . . . . 84 ACGGCATTCCGGACTCGATACGGGCATTATGAGTTTAGAGTGCTGGCTTTTGAGAAGACT T A F R T R Y G H Y E F R V L A F E K T R H S G L D T G I M S L E C W L L R R L D G I P D S I R A L - V - S A G F - E D . . . 24 AATGCTCCAGCGGTGTTTATGGAT N A P A V F M D M L Q R C L W - C S S G V Y G Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-11-_PGL-1_AGS-1_PPS_1 (369 1) (frame '1'; 369 bp, 123 residues) 1 LKELKEQLRD LLENDFIRPS TSPWRAPVLF VKKMDGSLRM CIDYRQLNKV TVKKRYPLPR 61 IDDLFDQLQS AMCFSKIDLR SGYHQVRVRK ADIPKTAFRT RYGHYEFRVL AFEKTNAPAV 121 FMD 3-phase translation of AGS-1 (+strand): . . . . . . 1 ATCCATAAACACCGCTGGAGCATTAGTCTTCTCAAAAGCCAGCACTCTAAACTCATAATG I H K H R W S I S L L K S Q H S K L I M S I N T A G A L V F S K A S T L N S - C P - T P L E H - S S Q K P A L - T H N . . . . . . 61 CCCGTATCGAGTCCGGAATGCCGTCTTTGGAATGTCTGCCTTCCTTACCCGCACCTGATG P V S S P E C R L W N V C L P Y P H L M P Y R V R N A V F G M S A F L T R T - - A R I E S G M P S L E C L P S L P A P D . . . . . . 121 ATAACCTGACCGCAAGTCTATCTTTGAAAAACACATAGCACTCTGCAACTGGTCAAATAG I T - P Q V Y L - K T H S T L Q L V K - - P D R K S I F E K H I A L C N W S N R D N L T A S L S L K N T - H S A T G Q I . . . . . . 181 ATCATCAATCCTTGGCAGGGGATACCTTTTCTTTACTGTTACTTTGTTCAGCTGCCTATA I I N P W Q G I P F L Y C Y F V Q L P I S S I L G R G Y L F F T V T L F S C L - D H Q S L A G D T F S L L L L C S A A Y . . . . . . 241 ATCAATACACATCCGCAGTGACCCATCCATCTTCTTCACAAATAGTACCGGTGCTCTCCA I N T H P Q - P I H L L H K - Y R C S P S I H I R S D P S I F F T N S T G A L Q N Q Y T S A V T H P S S S Q I V P V L S . . . . . . 301 AGGTGACGTGCTAGGCCTAATGAAGTCGTTTTCTAGAAGATCCCTCAATTGCTCTTTCAA R - R A R P N E V V F - K I P Q L L F Q G D V L G L M K S F S R R S L N C S F N K V T C - A - - S R F L E D P S I A L S . . . . . . 361 TTCTTTCAACTAAGCAGGTGCCATTCTATAAGGAGGTATAGATATAGGCTGGGTATCTGG F F Q L S R C H S I R R Y R Y R L G I W S F N - A G A I L - G G I D I G W V S G I L S T K Q V P F Y K E V - I - A G Y L . . . . . . 421 CAGTACATCTAGAGTAAACTCTATCTCTCATTCTGGAGAAAGGCCTGGAAGTTCCTCTGT Q Y I - S K L Y L S F W R K A W K F L C S T S R V N S I S H S G E R P G S S S V A V H L E - T L S L I L E K G L E V P L . . . . . . 481 GAATACATCAGGAAATTCATTAACTACCCAGACTGATTGAAGAGTCGGTGCCTCTTCCTT E Y I R K F I N Y P D - L K S R C L F L N T S G N S L T T Q T D - R V G A S S F - I H Q E I H - L P R L I E E S V P L P . . . . . . 541 TATGTCATGCACCCGAACAAGATGATAAATATAGCCTTTTCTAACCATCTTCCCGGCCTT Y V M H P N K M I N I A F S N H L P G L M S C T R T R - - I - P F L T I F P A L L C H A P E Q D D K Y S L F - P S S R P . . . 601 AAGGTATGAAATGAACTTACCTTT K V - N E L T F R Y E M N L P - G M K - T Y L Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (1463 868) SCR (e 0.908) Exon 1 1463 868 ( 596 n); score: 0.908 PGS (1463 868) SGN-E350824- 3-phase translation of AGS-2 (-strand): . . . . . . 1463 CATGTTAGATTTTCTTCCCATCCAGCATAGACTGCACCCCCACGTTTCATGGGTATGGGG H V R F S S H P A - T A P P R F M G M G M L D F L P I Q H R L H P H V S W V W G C - I F F P S S I D C T P T F H G Y G . . . . . . 1403 TTCTATCGTACAGGATATTTGGAAGCTGGTCAGAGCTCTAGGGAGTCAGGGTCACAGATG F Y R T G Y L E A G Q S S R E S G S Q M S I V Q D I W K L V R A L G S Q G H R W V L S Y R I F G S W S E L - G V R V T D . . . . . . 1343 GGTAGGGGTTTGAGCAAGTTGAGGCCACCTTTGCCTCGGTGTTCTCGCTGTGGTAGGTCC G R G L S K L R P P L P R C S R C G R S V G V - A S - G H L C L G V L A V V G P G - G F E Q V E A T F A S V F S L W - V . . . . . . 1283 CAACCTGGGAAATGTCATTGGGATACATGTGCGTGTTTTTCTTGCGGCCTTCATGGTCAT Q P G K C H W D T C A C F S C G L H G H N L G N V I G I H V R V F L A A F M V I P T W E M S L G Y M C V F F L R P S W S . . . . . . 1223 ACTATGAGGAAGTGTCACCTTAGAGGTAGTGCAGGTGGTATGGCACAGCCTACTGGGTCC T M R K C H L R G S A G G M A Q P T G S L - G S V T L E V V Q V V W H S L L G P Y Y E E V S P - R - C R W Y G T A Y W V . . . . . . 1163 GTTGATGGTTCATCTTCTTTTGTGGCTATGCGCCCTACGAGGCAGGGTATTCAGGAGCCA V D G S S S F V A M R P T R Q G I Q E P L M V H L L L W L C A L R G R V F R S Q R - W F I F F C G Y A P Y E A G Y S G A . . . . . . 1103 GCAAGTCGTGGTAGAGAATGTGGTGGAGCTTCCAGTTCTAGCGGTCCCTCGAACCGTATA A S R G R E C G G A S S S S G P S N R I Q V V V E N V V E L P V L A V P R T V Y S K S W - R M W W S F Q F - R S L E P Y . . . . . . 1043 TATGCTTTGACTAGTAAGCAGGATCAGGAGGCATCACCTAATGTGATCACAAGTATATTA Y A L T S K Q D Q E A S P N V I T S I L M L - L V S R I R R H H L M - S Q V Y Y I C F D - - A G S G G I T - C D H K Y I . . . . . . 983 TCACTATTCTCCCGAAGTGTGTATGCATTGATTGACCCAGGTTCCACCTTATCATTTATA S L F S R S V Y A L I D P G S T L S F I H Y S P E V C M H - L T Q V P P Y H L Y I T I L P K C V C I D - P R F H L I I Y . . . . . . 923 TCTCCCTTCGTTGCTAGTAGGATCGCAGTAGAGTCTGAGTTGATAGAACCGTTTGA S P F V A S R I A V E S E L I E P F L P S L L V G S Q - S L S - - N R L I S L R C - - D R S R V - V D R T V - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-11-_PGL-1_AGS-2_PPS_1 (1433 870) (frame '1'; 564 bp, 188 residues) 1 TAPPRFMGMG FYRTGYLEAG QSSRESGSQM GRGLSKLRPP LPRCSRCGRS QPGKCHWDTC 61 ACFSCGLHGH TMRKCHLRGS AGGMAQPTGS VDGSSSFVAM RPTRQGIQEP ASRGRECGGA 121 SSSSGPSNRI YALTSKQDQE ASPNVITSIL SLFSRSVYAL IDPGSTLSFI SPFVASRIAV 181 ESELIEPF 3-phase translation of AGS-2 (+strand): . . . . . . 868 TCAAACGGTTCTATCAACTCAGACTCTACTGCGATCCTACTAGCAACGAAGGGAGATATA S N G S I N S D S T A I L L A T K G D I Q T V L S T Q T L L R S Y - Q R R E I - K R F Y Q L R L Y C D P T S N E G R Y . . . . . . 928 AATGATAAGGTGGAACCTGGGTCAATCAATGCATACACACTTCGGGAGAATAGTGATAAT N D K V E P G S I N A Y T L R E N S D N M I R W N L G Q S M H T H F G R I V I I K - - G G T W V N Q C I H T S G E - - - . . . . . . 988 ATACTTGTGATCACATTAGGTGATGCCTCCTGATCCTGCTTACTAGTCAAAGCATATATA I L V I T L G D A S - S C L L V K A Y I Y L - S H - V M P P D P A Y - S K H I Y Y T C D H I R - C L L I L L T S Q S I Y . . . . . . 1048 CGGTTCGAGGGACCGCTAGAACTGGAAGCTCCACCACATTCTCTACCACGACTTGCTGGC R F E G P L E L E A P P H S L P R L A G G S R D R - N W K L H H I L Y H D L L A T V R G T A R T G S S T T F S T T T C W . . . . . . 1108 TCCTGAATACCCTGCCTCGTAGGGCGCATAGCCACAAAAGAAGATGAACCATCAACGGAC S - I P C L V G R I A T K E D E P S T D P E Y P A S - G A - P Q K K M N H Q R T L L N T L P R R A H S H K R R - T I N G . . . . . . 1168 CCAGTAGGCTGTGCCATACCACCTGCACTACCTCTAAGGTGACACTTCCTCATAGTATGA P V G C A I P P A L P L R - H F L I V - Q - A V P Y H L H Y L - G D T S S - Y D P S R L C H T T C T T S K V T L P H S M . . . . . . 1228 CCATGAAGGCCGCAAGAAAAACACGCACATGTATCCCAATGACATTTCCCAGGTTGGGAC P - R P Q E K H A H V S Q - H F P G W D H E G R K K N T H M Y P N D I S Q V G T T M K A A R K T R T C I P M T F P R L G . . . . . . 1288 CTACCACAGCGAGAACACCGAGGCAAAGGTGGCCTCAACTTGCTCAAACCCCTACCCATC L P Q R E H R G K G G L N L L K P L P I Y H S E N T E A K V A S T C S N P Y P S P T T A R T P R Q R W P Q L A Q T P T H . . . . . . 1348 TGTGACCCTGACTCCCTAGAGCTCTGACCAGCTTCCAAATATCCTGTACGATAGAACCCC C D P D S L E L - P A S K Y P V R - N P V T L T P - S S D Q L P N I L Y D R T P L - P - L P R A L T S F Q I S C T I E P . . . . . . 1408 ATACCCATGAAACGTGGGGGTGCAGTCTATGCTGGATGGGAAGAAAATCTAACATG I P M K R G G A V Y A G W E E N L T Y P - N V G V Q S M L D G K K I - H H T H E T W G C S L C W M G R K S N M Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-11+_PGL-1_AGS-2_PPS_1 (1155 1352) (frame '0'; 195 bp, 65 residues) 1 TINGPSRLCH TTCTTSKVTL PHSMTMKAAR KTRTCIPMTF PRLGPTTART PRQRWPQLAQ 61 TPTHL- AGS-3 (1682 1487) SCR (e 0.923) Exon 1 1682 1487 ( 196 n); score: 0.923 PGS (1682 1487) SGN-E379248+ 3-phase translation of AGS-3 (-strand): . . . . . . 1682 CATCGTTATGTGATGGGATTGGATCGTTATCTAATTGACAGTTGTATGGCAGTGACTCTT H R Y V M G L D R Y L I D S C M A V T L I V M - W D W I V I - L T V V W Q - L F S L C D G I G S L S N - Q L Y G S D S . . . . . . 1622 CAGCCAGGTATGGACATTGCTCGGGTGCAGGCATATGCACAGGGAGTAGAGGATCGTCAC Q P G M D I A R V Q A Y A Q G V E D R H S Q V W T L L G C R H M H R E - R I V T S A R Y G H C S G A G I C T G S R G S S . . . . . . 1562 CGGGGACGTCAGTCAGATAGAGATTATAATAGAGGCCTGCATAAGAGGGCTAGATCAGCA R G R Q S D R D Y N R G L H K R A R S A G D V S Q I E I I I E A C I R G L D Q Q P G T S V R - R L - - R P A - E G - I S . . 1502 GGTTATCCTGACGAGT G Y P D E V I L T S R L S - R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-11-_PGL-1_AGS-3_PPS_1 (1682 1488) (frame '1'; 195 bp, 65 residues) 1 HRYVMGLDRY LIDSCMAVTL QPGMDIARVQ AYAQGVEDRH RGRQSDRDYN RGLHKRARSA 61 GYPDE 3-phase translation of AGS-3 (+strand): . . . . . . 1487 ACTCGTCAGGATAACCTGCTGATCTAGCCCTCTTATGCAGGCCTCTATTATAATCTCTAT T R Q D N L L I - P S Y A G L Y Y N L Y L V R I T C - S S P L M Q A S I I I S I S S G - P A D L A L L C R P L L - S L . . . . . . 1547 CTGACTGACGTCCCCGGTGACGATCCTCTACTCCCTGTGCATATGCCTGCACCCGAGCAA L T D V P G D D P L L P V H M P A P E Q - L T S P V T I L Y S L C I C L H P S N S D - R P R - R S S T P C A Y A C T R A . . . . . . 1607 TGTCCATACCTGGCTGAAGAGTCACTGCCATACAACTGTCAATTAGATAACGATCCAATC C P Y L A E E S L P Y N C Q L D N D P I V H T W L K S H C H T T V N - I T I Q S M S I P G - R V T A I Q L S I R - R S N . . 1667 CCATCACATAACGATG P S H N D H H I T M P I T - R Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:15:27 2006 ________________________________________________________________________________ Sequence 12: C06HBa0057J04.1-12, from 1 to 12360, both strands analyzed. ... started at: Mon Jul 24 23:15:27 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand 735 n (File: SGN-E327901-) 1 TTTGAGATTG CTCTCATCTT TTACATTATC CTGTTATCAT TTNGTCACCA CATTCATTAA 61 CATTAGCATA TTTGCTCTTC ATAATTACTC ACACATTTCT ACGTGAATGT TCATTTCATC 121 ATATGCCAAT GCACTTGGTC ATTTGTGTGT TTCACACTCT TGCTTAACCC TTCTAGGGTC 181 TTATCATTTC ATTACATAAT AATCATTTCA TCGTATTCCA ATACACATGT CAATTTACTG 241 TGTTTTACAC TCGTACTTAA CCCTTTTAGG GTTACACCAT TTCATTACAC AATTCTTAGG 301 TTCACTTATT TTTAAATGTA TGCTAGAGTT ATGGGTTTAT ACATACAAGC CATGAGGCTT 361 TATTCATAGT TCGTTAAGTG ACTCGTATCT TCCTAAGATT ATGTAGTATA AGACTTACGT 421 ATCTTGTATG TATGCCAAGA TCTTGATTTC GATCTAAGTA GGTGGCATTA TTTTTGAACA 481 TTTAATTCAC GTATGACTTA TGTATCAATA TGGAAATTTG GGCAAGGGAA CCCAAGTTGG 541 AATCCCTTAG ACAAAACTTA CATTTTTCCT TGATTTTATT TTTATTTATC ATGACATTAG 601 GACCCTAGAT CGAGCCCCAA CACTAACATT TCTTTTTAGT TCCTTTTCTT TAGAAATTCT 661 CTCCCTTGGC CAAGGGGTTG TGGGTTCAAA CCCCCCACAA CCCTCTTTTT TTTCCTATTT 721 ATTTTTCTTG TTCAT Predicted gene structure (within gDNA segment 9322 to 1266): Exon 1 7763 7590 ( 174 n); cDNA 1 178 ( 178 n); score: 0.836 Intron 1 7589 7560 ( 30 n); Pd: 0.498 (s: 0.87), Pa: 0.000 (s: 0.43) ?? Exon 2 7559 7017 ( 543 n); cDNA 179 726 ( 548 n); score: 0.745 MATCH C06HBa0057J04.1-12- SGN-E327901- 0.767 717 0.976 C PGS_C06HBa0057J04.1-12-_SGN-E327901- (7763 7590,7559 7017) Alignment (genomic DNA sequence = upper lines): TTTCAGATTG CTCTCATCTT TTACATTAGC CTCATATCAT GTTGTCACCA CATTCCTTCA 7704 ||| |||||| |||||||||| |||||||| | || |||||| | ||||||| ||||| || | TTTGAGATTG CTCTCATCTT TTACATTATC CTGTTATCAT TTNGTCACCA CATTCATTAA 60 CATTAG-AT- TTT-CTCTTA ATAATTACTC ACCCATTT-T ACTTCAATGT TCATTTCATC 7648 |||||| || ||| ||||| |||||||||| || ||||| | || | ||||| |||||||||| CATTAGCATA TTTGCTCTTC ATAATTACTC ACACATTTCT ACGTGAATGT TCATTTCATC 120 ATATGCCAAT GCACTTGGCC ATTTGGTGTG TTTCACACTC TTACTTAACC CTTCTA-CGG 7589 |||||||||| |||||||| | |||| ||||| |||||||||| || ||||||| |||||| | ATATGCCAAT GCACTTGGTC ATTT-GTGTG TTTCACACTC TTGCTTAACC CTTCTAGGG. 178 TTTTTCATCA CTTCATTGTT GATTTAATCA -TT-T-A-TT CATTTCAT-- TTTTCATTTC 7535 || | | || |||| ||| | ||||||| .......... .......... .........T CTTATCATTT CATTACATAA TAATCATTTC 209 ATCATGTGCT AATTCACTTT ACCATTTAGT GTGTTTTACA CTCCTACTTA ACCCTTTTAG 7475 ||| | | | ||| ||| | | ||||| | |||||||||| ||| |||||| |||||||||| ATCGTATTCC AATACACATG TCAATTTACT GTGTTTTACA CTCGTACTTA ACCCTTTTAG 269 GTTTAAATCA TCTCATTACA TACATCATAG GTTCACTTAT TTTTAAACGT ATGCTAGACA 7415 | ||| | || | |||||||| | || ||| |||||||||| ||||||| || |||||||| GGTTACACCA TTTCATTACA CAATTCTTAG GTTCACTTAT TTTTAAATGT ATGCTAGAGT 329 TATGGG-TTA TACACACAAG CCATGAGGCT TTGTTCATAG TTCGTTTAAG TGATTCATAT 7356 |||||| ||| |||| ||||| |||||||||| || ||||||| |||| ||||| ||| || ||| TATGGGTTTA TACATACAAG CCATGAGGCT TTATTCATAG TTCG-TTAAG TGACTCGTAT 388 CTTTTTAAGA TTATGTCGTA CAAGACTTAT GCATCTTGTA TGTATGCAAA CATCTCAATT 7296 ||| ||||| |||||| ||| |||||||| | |||||||| ||||||| || |||| ||| CTTCCTAAGA TTATGTAGTA TAAGACTTAC GTATCTTGTA TGTATGCCAA GATCTTGATT 448 TTGATCTAAG TAGGCAACAT TTTTCTTGGC TAATTAATTC ACGCATGATT TATGTATCAA 7236 | |||||||| |||| ||| | || ||| | ||||||| ||| |||| | |||||||||| TCGATCTAAG TAGGTGGCAT TATTTTTGAA CATTTAATTC ACGTATGACT TATGTATCAA 508 TA-AGAAATC TGGTAAAGGG AACACGATTT TCAAATCCCT TGGACAACAC TTTACATCTT 7177 || ||||| ||| ||||| ||| | | | | ||||||| | ||||| || |||||| || TATGGAAATT TGGGCAAGGG AAC-CCAAGT TGGAATCCCT TAGACAAAAC -TTACATTTT 566 AAGTTTTATT TAATTTCGCT CCACATAGCT TTGGGACCCT AGATTGATCC CCAACATCAA 7117 || || || ||| | ||| | || ||||||| |||| || || |||||| || TCCTTGATTT TATTTTTATT TATCATGACA TTAGGACCCT AGATCGAGCC CCAACACTAA 626 CGTTTCCTTT TTATTTCAAT CCGTTGAAAA TTCTCTTCCT TGG-CGAGGT GTTGTGGGTT 7058 | |||| ||| | || | | | || ||| |||||| ||| ||| | ||| |||||||||| CATTTCTTTT TAGTTCCTTT TCTTTAGAAA TTCTCTCCCT TGGCCAAGGG GTTGTGGGTT 686 TAAACCCCCA CATCCCCTTT TAGTTTTTAT TTTTTGTATT T 7017 |||||||| || || | | ||||| | ||| | || | CAAACCCCC- CACAACCCTC TTTTTTTTCC TATTTATTTT T 726 hqPGS_C06HBa0057J04.1-12-_SGN-E327901- (7763 7590,7559 7017) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 12360: PGL 1 (- strand): 7763 7017 AGS-1 (7763 7590,7559 7017) SCR (e 0.836 d 0.498 a 0.000,e 0.745) Exon 1 7763 7590 ( 174 n); score: 0.836 Intron 1 7589 7560 ( 30 n); Pd: 0.498 Pa: 0.000 Exon 2 7559 7017 ( 543 n); score: 0.745 PGS (7763 7590,7559 7017) SGN-E327901- 3-phase translation of AGS-1 (-strand): . . . . . . 7763 TTTCAGATTGCTCTCATCTTTTACATTAGCCTCATATCATGTTGTCACCACATTCCTTCA F Q I A L I F Y I S L I S C C H H I P S F R L L S S F T L A S Y H V V T T F L H S D C S H L L H - P H I M L S P H S F . . . . . . 7703 CATTAGATTTTCTCTTAATAATTACTCACCCATTTTACTTCAATGTTCATTTCATCATAT H - I F S - - L L T H F T S M F I S S Y I R F S L N N Y S P I L L Q C S F H H M T L D F L L I I T H P F Y F N V H F I I . . . . . . : 7643 GCCAATGCACTTGGCCATTTGGTGTGTTTCACACTCTTACTTAACCCTTCTACG : ATTTAT A N A L G H L V C F T L L L N P S T : I Y P M H L A I W C V S H S Y L T L L R : F I C Q C T W P F G V F H T L T - P F Y : D L . . . . . . 7553 TCATTTCATTTTTCATTTCATCATGTGCTAATTCACTTTACCATTTAGTGTGTTTTACAC S F H F S F H H V L I H F T I - C V L H H F I F H F I M C - F T L P F S V F Y T F I S F F I S S C A N S L Y H L V C F T . . . . . . 7493 TCCTACTTAACCCTTTTAGGTTTAAATCATCTCATTACATACATCATAGGTTCACTTATT S Y L T L L G L N H L I T Y I I G S L I P T - P F - V - I I S L H T S - V H L F L L L N P F R F K S S H Y I H H R F T Y . . . . . . 7433 TTTAAACGTATGCTAGACATATGGGTTATACACACAAGCCATGAGGCTTTGTTCATAGTT F K R M L D I W V I H T S H E A L F I V L N V C - T Y G L Y T Q A M R L C S - F F - T Y A R H M G Y T H K P - G F V H S . . . . . . 7373 CGTTTAAGTGATTCATATCTTTTTAAGATTATGTCGTACAAGACTTATGCATCTTGTATG R L S D S Y L F K I M S Y K T Y A S C M V - V I H I F L R L C R T R L M H L V C S F K - F I S F - D Y V V Q D L C I L Y . . . . . . 7313 TATGCAAACATCTCAATTTTGATCTAAGTAGGCAACATTTTTCTTGGCTAATTAATTCAC Y A N I S I L I - V G N I F L G - L I H M Q T S Q F - S K - A T F F L A N - F T V C K H L N F D L S R Q H F S W L I N S . . . . . . 7253 GCATGATTTATGTATCAATAAGAAATCTGGTAAAGGGAACACGATTTTCAAATCCCTTGG A - F M Y Q - E I W - R E H D F Q I P W H D L C I N K K S G K G N T I F K S L G R M I Y V S I R N L V K G T R F S N P L . . . . . . 7193 ACAACACTTTACATCTTAAGTTTTATTTAATTTCGCTCCACATAGCTTTGGGACCCTAGA T T L Y I L S F I - F R S T - L W D P R Q H F T S - V L F N F A P H S F G T L D D N T L H L K F Y L I S L H I A L G P - . . . . . . 7133 TTGATCCCCAACATCAACGTTTCCTTTTTATTTCAATCCGTTGAAAATTCTCTTCCTTGG L I P N I N V S F L F Q S V E N S L P W - S P T S T F P F Y F N P L K I L F L G I D P Q H Q R F L F I S I R - K F S S L . . . . . . 7073 CGAGGTGTTGTGGGTTTAAACCCCCACATCCCCTTTTAGTTTTTATTTTTTGTATTT R G V V G L N P H I P F - F L F F V F E V L W V - T P T S P F S F Y F L Y A R C C G F K P P H P L L V F I F C I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-12-_PGL-1_AGS-1_PPS_1 (7505 7287) (frame '1'; 216 bp, 72 residues) 1 CVLHSYLTLL GLNHLITYII GSLIFKRMLD IWVIHTSHEA LFIVRLSDSY LFKIMSYKTY 61 ASCMYANISI LI- >C06HBa0057J04.1-12-_PGL-1_AGS-1_PPS_2 (7347 7135) (frame '0'; 210 bp, 70 residues) 1 DYVVQDLCIL YVCKHLNFDL SRQHFSWLIN SRMIYVSIRN LVKGTRFSNP LDNTLHLKFY 61 LISLHIALGP - >C06HBa0057J04.1-12-_PGL-1_AGS-1_PPS_3 (7762 7590,7559 7523) (frame '2'; 207 bp, 69 residues) 1 FRLLSSFTLA SYHVVTTFLH IRFSLNNYSP ILLQCSFHHM PMHLAIWCVS HSYLTLLRFI 61 HFIFHFIMC- ... finished at: Mon Jul 24 23:15:37 2006 ________________________________________________________________________________ Sequence 13: C06HBa0057J04.1-13, from 1 to 1857, both strands analyzed. ... started at: Mon Jul 24 23:15:37 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:15:45 2006 ________________________________________________________________________________ Sequence 14: C06HBa0057J04.1-14, from 1 to 2120, both strands analyzed. ... started at: Mon Jul 24 23:15:45 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:15:52 2006 ________________________________________________________________________________ Sequence 15: C06HBa0057J04.1-15, from 1 to 2834, both strands analyzed. ... started at: Mon Jul 24 23:15:52 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:15:59 2006 ________________________________________________________________________________ Sequence 16: C06HBa0057J04.1-16, from 1 to 3024, both strands analyzed. ... started at: Mon Jul 24 23:15:59 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:16:07 2006 ________________________________________________________________________________ Sequence 17: C06HBa0057J04.1-17, from 1 to 2687, both strands analyzed. ... started at: Mon Jul 24 23:16:07 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:16:14 2006 ________________________________________________________________________________ Sequence 18: C06HBa0057J04.1-18, from 1 to 5133, both strands analyzed. ... started at: Mon Jul 24 23:16:14 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand 734 n (File: SGN-E577717-) 1 GAAGAATGCC ATTTAGGTTG TGCAATGCAC CCGCAACCTT TCAGAGATAT ATGATGTCAA 61 TATTTTCTGA CATGGTGGAG GATACTATAG AGGTTTTTAT GGATGATTTT TCTGTGGTTG 121 GTGATTCAAT TGAGCGATGC TTGACCAATT AATCTGAGGT TCTTAAGAGA TGTGAAGACT 181 GCAATCTGGT ACTAAACTGG GAAAAGTGTC ATTTCATGGT AAAAAAAGGT ATTGTGTTGG 241 GTCATCGCAT TTCAGAAAAG GGAATAGAGG TTGATCGAGC TAAAGTTGAG GTAATAGAGA 301 GACTTCCCCC ACCGATCTCT ATGAAAGGTG TGAGAAGCTT TCTTGGGCAT GCAGGTTTTT 361 ATCGGAGATT CATCAAAGAC TTTTCAAAGA TTGCACACCC ATTGTGCAAA TTACTGAAGA 421 AAGAATGTAA ATTTTGTTTT GATGAGTCCT GTCTTAAAGC ATTCAGTGAG CTAAAAGAGA 481 AGTTAGTGTC TGCACCTATC ATTATTTCTC CGGATTGGAA AAGTCCATTT GAGGTAATGT 541 GCGATGCTAG TGGGGTGGCT CTTGGTGTAG TATTGGGACA GAGAAGAAAC AAAATCCTTC 601 ACCCAATTTA CTATGCTAGT AAAGCCCTAA ATGAAGCCCA GAAGAACTAC ACAGTGACTG 661 AGCATGAACT CCTCGCAGTA GTCTTTGCTT TTGAGATATT TTGCTCCTAT TTGCTAGGTA 721 CTAGAGTCAT AGAG Predicted gene structure (within gDNA segment 3681 to 1): Exon 1 2630 2305 ( 326 n); cDNA 1 326 ( 326 n); score: 0.942 MATCH C06HBa0057J04.1-18- SGN-E577717- 0.942 326 0.444 C PGS_C06HBa0057J04.1-18-_SGN-E577717- (2630 2305) Alignment (genomic DNA sequence = upper lines): GAAGAATGCC GTTTGGATTG TGCAATGCAC CCGCAACCTT TCAGAAATGT ATGATGTCAA 2571 |||||||||| ||| | ||| |||||||||| |||||||||| ||||| || | |||||||||| GAAGAATGCC ATTTAGGTTG TGCAATGCAC CCGCAACCTT TCAGAGATAT ATGATGTCAA 60 TATTTTCTGA CATGGTGGAG GATACTATAG AGGTTTTTAT GGATGATTTT TCTGTGGTTG 2511 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTTTCTGA CATGGTGGAG GATACTATAG AGGTTTTTAT GGATGATTTT TCTGTGGTTG 120 GTGATTCATT TGAGCGATGC TTGACCAATT TATCTGAGGT TCTTAAGAGA TGTGAAGACT 2451 |||||||| | |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| GTGATTCAAT TGAGCGATGC TTGACCAATT AATCTGAGGT TCTTAAGAGA TGTGAAGACT 180 GCAATTTGGT ACTAAACTGG GAAAAGTGTC ATTTCATGGT GAAAAAGGGT ATTGTGTTGG 2391 ||||| |||| |||||||||| |||||||||| |||||||||| ||||| ||| |||||||||| GCAATCTGGT ACTAAACTGG GAAAAGTGTC ATTTCATGGT AAAAAAAGGT ATTGTGTTGG 240 ATCATCGCAT TTCAAAAAAG GGCATAGAGG TTGATTCAGC TAAAGTTGAG GTAATAGAGA 2331 ||||||||| |||| ||||| || ||||||| ||||| ||| |||||||||| |||||||||| GTCATCGCAT TTCAGAAAAG GGAATAGAGG TTGATCGAGC TAAAGTTGAG GTAATAGAGA 300 GACTTTCCCC GCCGATCTCT TTAAAA 2305 ||||| |||| ||||||||| | ||| GACTTCCCCC ACCGATCTCT ATGAAA 326 hqPGS_C06HBa0057J04.1-18-_SGN-E577717- (2630 2305) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 5133: PGL 1 (- strand): 2630 2305 AGS-1 (2630 2305) SCR (e 0.942) Exon 1 2630 2305 ( 326 n); score: 0.942 PGS (2630 2305) SGN-E577717- 3-phase translation of AGS-1 (-strand): . . . . . . 2630 GAAGAATGCCGTTTGGATTGTGCAATGCACCCGCAACCTTTCAGAAATGTATGATGTCAA E E C R L D C A M H P Q P F R N V - C Q K N A V W I V Q C T R N L S E M Y D V N R M P F G L C N A P A T F Q K C M M S . . . . . . 2570 TATTTTCTGACATGGTGGAGGATACTATAGAGGTTTTTATGGATGATTTTTCTGTGGTTG Y F L T W W R I L - R F L W M I F L W L I F - H G G G Y Y R G F Y G - F F C G W I F S D M V E D T I E V F M D D F S V V . . . . . . 2510 GTGATTCATTTGAGCGATGCTTGACCAATTTATCTGAGGTTCTTAAGAGATGTGAAGACT V I H L S D A - P I Y L R F L R D V K T - F I - A M L D Q F I - G S - E M - R L G D S F E R C L T N L S E V L K R C E D . . . . . . 2450 GCAATTTGGTACTAAACTGGGAAAAGTGTCATTTCATGGTGAAAAAGGGTATTGTGTTGG A I W Y - T G K S V I S W - K R V L C W Q F G T K L G K V S F H G E K G Y C V G C N L V L N W E K C H F M V K K G I V L . . . . . . 2390 ATCATCGCATTTCAAAAAAGGGCATAGAGGTTGATTCAGCTAAAGTTGAGGTAATAGAGA I I A F Q K R A - R L I Q L K L R - - R S S H F K K G H R G - F S - S - G N R E D H R I S K K G I E V D S A K V E V I E . . . 2330 GACTTTCCCCGCCGATCTCTTTAAAA D F P R R S L - T F P A D L F K R L S P P I S L K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-18-_PGL-1_AGS-1_PPS_1 (2628 2305) (frame '0'; 324 bp, 108 residues) 1 RMPFGLCNAP ATFQKCMMSI FSDMVEDTIE VFMDDFSVVG DSFERCLTNL SEVLKRCEDC 61 NLVLNWEKCH FMVKKGIVLD HRISKKGIEV DSAKVEVIER LSPPISLK 3-phase translation of AGS-1 (+strand): . . . . . . 2305 TTTTAAAGAGATCGGCGGGGAAAGTCTCTCTATTACCTCAACTTTAGCTGAATCAACCTC F - R D R R G K S L Y Y L N F S - I N L F K E I G G E S L S I T S T L A E S T S L K R S A G K V S L L P Q L - L N Q P . . . . . . 2365 TATGCCCTTTTTTGAAATGCGATGATCCAACACAATACCCTTTTTCACCATGAAATGACA Y A L F - N A M I Q H N T L F H H E M T M P F F E M R - S N T I P F F T M K - H L C P F L K C D D P T Q Y P F S P - N D . . . . . . 2425 CTTTTCCCAGTTTAGTACCAAATTGCAGTCTTCACATCTCTTAAGAACCTCAGATAAATT L F P V - Y Q I A V F T S L K N L R - I F S Q F S T K L Q S S H L L R T S D K L T F P S L V P N C S L H I S - E P Q I N . . . . . . 2485 GGTCAAGCATCGCTCAAATGAATCACCAACCACAGAAAAATCATCCATAAAAACCTCTAT G Q A S L K - I T N H R K I I H K N L Y V K H R S N E S P T T E K S S I K T S I W S S I A Q M N H Q P Q K N H P - K P L . . . . . . 2545 AGTATCCTCCACCATGTCAGAAAATATTGACATCATACATTTCTGAAAGGTTGCGGGTGC S I L H H V R K Y - H H T F L K G C G C V S S T M S E N I D I I H F - K V A G A - Y P P P C Q K I L T S Y I S E R L R V . . . 2605 ATTGCACAATCCAAACGGCATTCTTC I A Q S K R H S L H N P N G I L H C T I Q T A F F Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:16:23 2006 ________________________________________________________________________________ Sequence 19: C06HBa0057J04.1-19, from 1 to 2343, both strands analyzed. ... started at: Mon Jul 24 23:16:23 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:16:30 2006 ________________________________________________________________________________ Sequence 20: C06HBa0057J04.1-20, from 1 to 1135, both strands analyzed. ... started at: Mon Jul 24 23:16:30 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 2 ******************************************************************************** EST sequence 1 -strand 691 n (File: SGN-E328093-) 1 AAGGTCATGC CAAACAAAGA AGGAAAGGAT AGCGTTACAT ACCGAATAGT CGAGCCGTAC 61 GTCCAATATT TACTGCTCAA GGTATAAATC TATAACAAGC ACCAATCGTG CCGCAATTAG 121 ATAAATGTCG CGCTTACTAC TTATGTAAGC TAGCTAATAA TCTAACGAAA ATCCGGCAGC 181 ACTTCCCCTA TTTTCTTTAC TTCATCCCAA TCCAACAACA ACCCAAATTA TCCACAACAA 241 TACCAACAAC ACATATAATC CAACAAATCA TATTAAACAG CCTCCCCCAA AACAGTTCCG 301 TTTTTAGCAA CCATAGTAAC AAAACAACGA CACACCGACC GATAATTCGT TTTATAATTC 361 GCAACACATC TACCTTATTG TATTCATCCT TTGGAGTATA TACCCATAAT CATATTAACA 421 TATGCATAAC ACCAAAACAG AATTTCCAGC AGCTTACTTT AATCAAAACA GCCCAACACA 481 AATATAACAC CGCTAGCAAT CCGATCATGG TTTCTGCTCA TCCTTAGCAC ATTCAATAAC 541 TAAAACAAAT TCCTAAGATA CTGTTCACGT CCCCCTTCTT AACATAAATG TCTAATTAAC 601 AAAGGTATTC TAAGGAATTA ACAAACCAAG CAAAGAGATT ACTAACCAAT TTCTTTGCAG 661 CTGCTGTCCA AGAGTTTAAT GGCTGGTTTT C Predicted gene structure (within gDNA segment 1 to 1135): Exon 1 1 655 ( 655 n); cDNA 38 689 ( 652 n); score: 0.860 MATCH C06HBa0057J04.1-20+ SGN-E328093- 0.860 655 0.948 C PGS_C06HBa0057J04.1-20+_SGN-E328093- (1 655) Alignment (genomic DNA sequence = upper lines): CATACCGTAT AATCGAGCCG CACGTCCAAT ATTTACTGCT CAAGCTATTA ATCTATAACA 60 ||||||| || | |||||||| ||||||||| |||||||||| |||| ||| | |||||||||| CATACCGAAT AGTCGAGCCG TACGTCCAAT ATTTACTGCT CAAGGTATAA ATCTATAACA 97 AGCAACAATC GTGCCGCAAT TAGATAAACA TCGTGCTTTA CT-TTCTTGT AAGCTAGCTA 119 |||| ||||| |||||||||| |||||||| ||| || ||| || | ||| |||||||||| AGCACCAATC GTGCCGCAAT TAGATAAATG TCGCGC-TTA CTACTTATGT AAGCTAGCTA 156 ATAATCTAAC GAAAATTCGG CAGCACCTCC CCTATTTTCC TTACTTCGTC CCAATCCGAC 179 |||||||||| |||||| ||| |||||| ||| ||||||||| ||||||| || ||||||| || ATAATCTAAC GAAAATCCGG CAGCACTTCC CCTATTTTCT TTACTTCATC CCAATCCAAC 216 AACAATCCAA ATTATCCACA ACAATACCAA TAACACATAT AACCCAAAGA ATCATATTAA 239 ||||| |||| |||||||||| |||||||||| ||||||||| || |||| | |||||||||| AACAACCCAA ATTATCCACA ACAATACCAA CAACACATAT AATCCAACAA ATCATATTAA 276 ACAGCCCCCT CCCCAAAACA GTTCC-ATCT CGGCAACTAT ATCAGCGAAG CAACGACACA 298 |||| || | |||||||||| ||||| | | ||||| || | | | || |||||||||| ACAG-CCTC- CCCCAAAACA GTTCCGTTTT TAGCAACCAT AGTAACAAAA CAACGACACA 334 CCGAGTGATA ACTCGTTTTA TAATTCGTAA CACATCTACC TTATCGTATT CATCCTTTGG 358 |||| |||| | |||||||| ||||||| || |||||||||| |||| ||||| |||||||||| CCGACCGATA ATTCGTTTTA TAATTCGCAA CACATCTACC TTATTGTATT CATCCTTTGG 394 AGTATATACT CATAATCACA TTCAACATAT ACATAATGCC AAAACATAAT TTCCTGCAGT 418 ||||||||| |||||||| | || ||||||| ||||| || |||||| ||| |||| |||| AGTATATACC CATAATCATA TT-AACATAT GCATAACACC AAAACAGAAT TTCCAGCAGC 453 TTGCTTTAAT CAAAACAGCC CAAGCACCAA CACGACACCA CTAATAATCC GATTATGGTT 478 || ||||||| |||||||||| ||| ||| || | ||||| ||| ||||| ||| |||||| TTACTTTAAT CAAAACAGCC CAA-CACAAA TATAACACCG CTAGCAATCC GATCATGGTT 512 TCCATTCATA CTTAGTACAT TCAATAACTA AAACAAACTT CCTAAGCTAC TGGTCACG-C 537 || |||| ||||| |||| |||||||||| ||||||| || |||||| ||| || ||||| | TCTGCTCATC CTTAGCACAT TCAATAACTA AAACAAA-TT CCTAAGATAC TGTTCACGTC 571 CCCCTTCTTA AAATTAATGG ATAATTAACA AGGGTATTCT AAAGAATTAA CACACCAAAC 597 |||||||||| | || |||| ||||||||| | |||||||| || ||||||| || ||||| | CCCCTTCTTA ACATAAATGT CTAATTAACA AAGGTATTCT AAGGAATTAA CAAACCAAGC 631 AAGGAGATTA CTAACCAAAT TATTTGCAGC TGCTGGCCAA GAGTTGCAGG GTTGGTTT 655 || ||||||| |||||||| | | |||||||| ||||| |||| ||||| | | | |||||| AAAGAGATTA CTAACCAATT TCTTTGCAGC TGCTGTCCAA GAGTTTAATG GCTGGTTT 689 hqPGS_C06HBa0057J04.1-20+_SGN-E328093- (1 655) ******************************************************************************** EST sequence 2 -strand 455 n (File: SGN-E298250-) 1 TAACACATAT AACCCGAAGT ATCACATTAG GCAGCCCCCT CCCCAAAACA GTTCCGTCTG 61 GGCAACTATA GCAGCGAAGC ACCGACACAC CGAGCGATAA CTCGTTTTAT AATTCGCAAC 121 ACATTTACCT AATCGTATTC ATCCGTTGGA GTATATACTC ATAATCACAT TCAGCATATA 181 CATAATGCCA AAATAGAATT TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 241 CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGCACAT TCAATAACTA 301 AAACAAACTT CGTAAGCAAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 361 GGGTGTTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TCCCCAAATT ATTTGCAGCT 421 ACTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT Predicted gene structure (within gDNA segment 1 to 1135): Exon 1 210 663 ( 454 n); cDNA 1 455 ( 455 n); score: 0.942 MATCH C06HBa0057J04.1-20+ SGN-E298250- 0.942 454 0.998 C PGS_C06HBa0057J04.1-20+_SGN-E298250- (210 663) Alignment (genomic DNA sequence = upper lines): TAACACATAT AACCCAAAGA ATCATATTAA ACAGCCCCCT CCCCAAAACA GTTCCATCTC 269 |||||||||| ||||| ||| |||| |||| ||||||||| |||||||||| ||||| ||| TAACACATAT AACCCGAAGT ATCACATTAG GCAGCCCCCT CCCCAAAACA GTTCCGTCTG 60 GGCAACTATA TCAGCGAAGC AACGACACAC CGAGTGATAA CTCGTTTTAT AATTCGTAAC 329 |||||||||| ||||||||| | |||||||| |||| ||||| |||||||||| |||||| ||| GGCAACTATA GCAGCGAAGC ACCGACACAC CGAGCGATAA CTCGTTTTAT AATTCGCAAC 120 ACATCTACCT TATCGTATTC ATCCTTTGGA GTATATACTC ATAATCACAT TCAACATATA 389 |||| ||||| ||||||||| |||| ||||| |||||||||| |||||||||| ||| |||||| ACATTTACCT AATCGTATTC ATCCGTTGGA GTATATACTC ATAATCACAT TCAGCATATA 180 CATAATGCCA AAACATAA-T TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 448 |||||||||| ||| | || | |||||||||| |||||||||| |||||||||| |||||||||| CATAATGCCA AAATAGAATT TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 240 CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGTACAT TCAATAACTA 508 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGCACAT TCAATAACTA 300 AAACAAACTT CCTAAGCTAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 568 |||||||||| | ||||| || |||||||||| |||||||||| |||||||||| |||||||||| AAACAAACTT CGTAAGCAAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 360 GGGTATTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TAACCAAATT ATTTGCAGCT 628 |||| ||||| |||||||||| |||||||||| |||||||||| | ||||||| |||||||||| GGGTGTTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TCCCCAAATT ATTTGCAGCT 420 GCTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT 663 ||||||||| |||||||||| |||||||||| ||||| ACTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT 455 hqPGS_C06HBa0057J04.1-20+_SGN-E298250- (210 663) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 1135: PGL 1 (+ strand): 1 663 AGS-1 (1 663) SCR (e 0.860) Exon 1 1 663 ( 663 n); score: 0.860 PGS (1 655) SGN-E328093- PGS (210 663) SGN-E298250- 3-phase translation of AGS-1 (+strand): . . . . . . 1 CATACCGTATAATCGAGCCGCACGTCCAATATTTACTGCTCAAGCTATTAATCTATAACA H T V - S S R T S N I Y C S S Y - S I T I P Y N R A A R P I F T A Q A I N L - Q Y R I I E P H V Q Y L L L K L L I Y N . . . . . . 61 AGCAACAATCGTGCCGCAATTAGATAAACATCGTGCTTTACTTTCTTGTAAGCTAGCTAA S N N R A A I R - T S C F T F L - A S - A T I V P Q L D K H R A L L S C K L A N K Q Q S C R N - I N I V L Y F L V S - L . . . . . . 121 TAATCTAACGAAAATTCGGCAGCACCTCCCCTATTTTCCTTACTTCGTCCCAATCCGACA - S N E N S A A P P L F S L L R P N P T N L T K I R Q H L P Y F P Y F V P I R Q I I - R K F G S T S P I F L T S S Q S D . . . . . . 181 ACAATCCAAATTATCCACAACAATACCAATAACACATATAACCCAAAGAATCATATTAAA T I Q I I H N N T N N T Y N P K N H I K Q S K L S T T I P I T H I T Q R I I L N N N P N Y P Q Q Y Q - H I - P K E S Y - . . . . . . 241 CAGCCCCCTCCCCAAAACAGTTCCATCTCGGCAACTATATCAGCGAAGCAACGACACACC Q P P P Q N S S I S A T I S A K Q R H T S P L P K T V P S R Q L Y Q R S N D T P T A P S P K Q F H L G N Y I S E A T T H . . . . . . 301 GAGTGATAACTCGTTTTATAATTCGTAACACATCTACCTTATCGTATTCATCCTTTGGAG E - - L V L - F V T H L P Y R I H P L E S D N S F Y N S - H I Y L I V F I L W S R V I T R F I I R N T S T L S Y S S F G . . . . . . 361 TATATACTCATAATCACATTCAACATATACATAATGCCAAAACATAATTTCCTGCAGTTT Y I L I I T F N I Y I M P K H N F L Q F I Y S - S H S T Y T - C Q N I I S C S L V Y T H N H I Q H I H N A K T - F P A V . . . . . . 421 GCTTTAATCAAAACAGCCCAAGCACCAACACGACACCACTAATAATCCGATTATGGTTTC A L I K T A Q A P T R H H - - S D Y G F L - S K Q P K H Q H D T T N N P I M V S C F N Q N S P S T N T T P L I I R L W F . . . . . . 481 CATTCATACTTAGTACATTCAATAACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCC H S Y L V H S I T K T N F L S Y W S R P I H T - Y I Q - L K Q T S - A T G H A P P F I L S T F N N - N K L P K L L V T P . . . . . . 541 CTTCTTAAAATTAATGGATAATTAACAAGGGTATTCTAAAGAATTAACACACCAAACAAG L L K I N G - L T R V F - R I N T P N K F L K L M D N - Q G Y S K E L T H Q T R P S - N - W I I N K G I L K N - H T K Q . . . . . . 601 GAGATTACTAACCAAATTATTTGCAGCTGCTGGCCAAGAGTTGCAGGGTTGGTTTCTCCA E I T N Q I I C S C W P R V A G L V S P R L L T K L F A A A G Q E L Q G W F L H G D Y - P N Y L Q L L A K S C R V G F S . 661 TTT F I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-20+_PGL-1_AGS-1_PPS_1 (59 328) (frame '2'; 267 bp, 89 residues) 1 QATIVPQLDK HRALLSCKLA NNLTKIRQHL PYFPYFVPIR QQSKLSTTIP ITHITQRIIL 61 NSPLPKTVPS RQLYQRSNDT PSDNSFYNS- 3-phase translation of AGS-1 (-strand): . . . . . . 663 AAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N Q P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K P T L Q L L A S S C K - F G - - . . . . . . 603 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R S P C L V C - F F R I P L L I I H - F - . . . . . . 543 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTACTAAGTATGA K G A - P V A - E V C F S Y - M Y - V - R G R D Q - L R K F V L V I E C T K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 483 ATGGAAACCATAATCGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGCTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - S D Y - W C R V G A W A V L I K N G N H N R I I S G V V L V L G L F - L . . . . . . 423 AGCAAACTGCAGGAAATTATGTTTTGGCATTATGTATATGTTGAATGTGATTATGAGTAT S K L Q E I M F W H Y V Y V E C D Y E Y A N C R K L C F G I M Y M L N V I M S I K Q T A G N Y V L A L C I C - M - L - V . . . . . . 363 ATACTCCAAAGGATGAATACGATAAGGTAGATGTGTTACGAATTATAAAACGAGTTATCA I L Q R M N T I R - M C Y E L - N E L S Y S K G - I R - G R C V T N Y K T S Y H Y T P K D E Y D K V D V L R I I K R V I . . . . . . 303 CTCGGTGTGTCGTTGCTTCGCTGATATAGTTGCCGAGATGGAACTGTTTTGGGGAGGGGG L G V S L L R - Y S C R D G T V L G R G S V C R C F A D I V A E M E L F W G G G T R C V V A S L I - L P R W N C F G E G . . . . . . 243 CTGTTTAATATGATTCTTTGGGTTATATGTGTTATTGGTATTGTTGTGGATAATTTGGAT L F N M I L W V I C V I G I V V D N L D C L I - F F G L Y V L L V L L W I I W I A V - Y D S L G Y M C Y W Y C C G - F G . . . . . . 183 TGTTGTCGGATTGGGACGAAGTAAGGAAAATAGGGGAGGTGCTGCCGAATTTTCGTTAGA C C R I G T K - G K - G R C C R I F V R V V G L G R S K E N R G G A A E F S L D L L S D W D E V R K I G E V L P N F R - . . . . . . 123 TTATTAGCTAGCTTACAAGAAAGTAAAGCACGATGTTTATCTAATTGCGGCACGATTGTT L L A S L Q E S K A R C L S N C G T I V Y - L A Y K K V K H D V Y L I A A R L L I I S - L T R K - S T M F I - L R H D C . . . . . . 63 GCTTGTTATAGATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACGGT A C Y R L I A - A V N I G R A A R L Y G L V I D - - L E Q - I L D V R L D Y T V C L L - I N S L S S K Y W T C G S I I R . 3 ATG M Y Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:16:38 2006 ________________________________________________________________________________ Sequence 21: C06HBa0057J04.1-21, from 1 to 2563, both strands analyzed. ... started at: Mon Jul 24 23:16:38 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:16:45 2006 ________________________________________________________________________________ Sequence 22: C06HBa0057J04.1-22, from 1 to 3711, both strands analyzed. ... started at: Mon Jul 24 23:16:45 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 4 HitsTableSize = 5 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 12 ******************************************************************************** EST sequence 4 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 3711 to 106): Exon 1 3119 3051 ( 69 n); cDNA 1 68 ( 68 n); score: 0.862 Intron 1 3050 2498 ( 553 n); Pd: 0.900 (s: 0.86), Pa: 0.875 (s: 0.98) Exon 2 2497 2451 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.98), Pa: 0.000 (s: 0.96) Exon 3 1727 1395 ( 333 n); cDNA 116 448 ( 333 n); score: 0.902 Intron 3 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 1276 1269 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-22- SGN-E543103- 0.896 457 0.780 C PGS_C06HBa0057J04.1-22-_SGN-E543103- (3119 3051,2497 2451,1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 3061 |||||||||| ||||||||| ||| ||||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 3001 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 2941 .......... .......... .......... .......... .......... .......... 68 TACTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 2881 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTTTGTTGG CATTATGTAT ATGTTGAATG 2821 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA 2761 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT 2701 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT 2641 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTG GATTGGGACG AAGTAAGGAA AATAGGGGAG GTGCTGCCGA 2581 .......... .......... .......... .......... .......... .......... 68 ATTTTCGTTA GATTATTAGC TAGCTTACAA GAAAGTAAAG CACGATATTT ATCTAATTGC 2521 .......... .......... .......... .......... .......... .......... 68 GGCACGATTG TTGCTTGTTA TAGATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT 2461 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...ATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT 105 CGATTATACG GTATGTAACG CTGTCCCTTC TTTCTTTGCT TGGCATGACT TTTAAAAATA 2401 ||| |||||| CGACTATACG .......... .......... .......... .......... .......... 115 AGCGAATAAC GGACAGATTT GATACTTACC TCTAAAGCGT CTAGGTGATG TATATTCTTG 2341 .......... .......... .......... .......... .......... .......... 115 CTTCCACAAT TATTCCTCTA TATATCGGTT ATGTCTAAGG CTATGATGAT CTCTAATATC 2281 .......... .......... .......... .......... .......... .......... 115 TATGGTAATG CTTCTTAGAG TCATTGAAAT TTTACGTTTT CATATCGTAT TAAAGGTTCA 2221 .......... .......... .......... .......... .......... .......... 115 TAATCTTGAT AAAACATTAA TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT 2161 .......... .......... .......... .......... .......... .......... 115 ATTGAGTTAT AAGAAATGAT TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA 2101 .......... .......... .......... .......... .......... .......... 115 TAGAGTTATT TATCATTTCA CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG 2041 .......... .......... .......... .......... .......... .......... 115 CATATGTCAC CGAGTTCCTC ACTAGAGGGC CGGGTATGTA TATTATATAT ATGATTGGTG 1981 .......... .......... .......... .......... .......... .......... 115 ATGAGGATGG TTATGATGAT GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC 1921 .......... .......... .......... .......... .......... .......... 115 CCTTACTAGG GAAGCTGGGC ACCTTAAATG TTAAATATAT GCATGATTTT CATTTAAAAA 1861 .......... .......... .......... .......... .......... .......... 115 GTATATGTGT AGCGATATTT TGTTTCGAGT TGCCACATTG GTATCCTGTC ATCTTTACCT 1801 .......... .......... .......... .......... .......... .......... 115 TATGCTTTAC ATACTCAGTA CATTGTTCGT ACTGACCCCC CTTTCCTCGG GGGGCTGCGT 1741 .......... .......... .......... .......... .......... .......... 115 TTCATGCCCG CAGGTGTAGA CGCGCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT 1681 ||||||| ||| |||||| |||||||||| |||||||||| |||||||| | .......... ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTTT 162 GCTGATTGGG AGAGCTCCAC TGTTCCGGAG CCCATTCGTT TTGGTACATA AC-TTTTGTG 1622 ||||| |||| |||||||||| |||| || || |||| || || |||||||||| || ||||||| GCTGAGTGGG AGAGCTCCAC TGTTTCGTAG CCCAGTCATT TTGGTACATA ACTTTTTGTG 222 TAGTCTTTTG CTCGTCTATG GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA 1562 |||||||||| || ||||||| ||||||| || |||||||||| |||||||||| ||||| | || TAGTCTTTTG CTTGTCTATG GGTATGGTGG GGCCCTGTCC CGTCGAGTTT CACTACTATA 282 CCCTTAGAGG TCTGTGGACA TTATGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG 1502 | |||||||| || | |||| | |||||| |||||||||| |||||||||| |||||||||| CTCTTAGAGG TCCATAGACA TCGCGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG 342 ACATGGTTTG TTTGGGATGT CCACTTGTAC AGGGGCAGCC TTGTCGGCTG TGTACATCAT 1442 |||||||||| |||||||||| |||||||||| | |||||||| ||||| |||| ||||||| | ACATGGTTTG TTTGGGATGT CCACTTGTAC AAGGGCAGCC TTGTCAGCTG CGTACATCTT 402 TATGCTTTGA ATAGTGGCGG CCTTGTCGGC TCGCGTATGC TGTTATGGTT GAATGGTTAT 1382 | || ||| ||||||| | |||||||||| | |||||||| | ||||| TGTGTATTGT GTAGTGGCAG CCTTGTCGGC T-GCGTATGC TATTATG... .......... 448 GACTCCTTAT GAGACATGTC CTCTTATATA TATATATGAC GTTGGGGTTG GCTTGATTTG 1322 .......... .......... .......... .......... .......... .......... 448 ATTAAATTCC ATATTGTCTT AGTTTCAGTT GGTCATACTT AGCAGGTTTG TAT 1269 |||| || .......... .......... .......... .......... .....CTTTG GAT 456 hqPGS_C06HBa0057J04.1-22-_SGN-E543103- (3119 3051,2497 2451,1727 1395,1276 1269) ******************************************************************************** EST sequence 14 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 3711 to 186): Exon 1 3119 3051 ( 69 n); cDNA 1 68 ( 68 n); score: 0.862 Intron 1 3050 2498 ( 553 n); Pd: 0.900 (s: 0.86), Pa: 0.875 (s: 0.98) Exon 2 2497 2451 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.98), Pa: 0.000 (s: 0.96) Exon 3 1727 1395 ( 333 n); cDNA 116 448 ( 333 n); score: 0.902 Intron 3 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 1276 1269 ( 8 n); cDNA 449 456 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-22- SGN-E543104+ 0.896 457 0.792 C PGS_C06HBa0057J04.1-22-_SGN-E543104+ (3119 3051,2497 2451,1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT 3061 |||||||||| ||||||||| ||| ||||| |||||||||| | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 3001 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGAAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 2941 .......... .......... .......... .......... .......... .......... 68 TACTAAGTAT GAATGGAAAC CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 2881 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTTTGTTGG CATTATGTAT ATGTTGAATG 2821 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA 2761 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT 2701 .......... .......... .......... .......... .......... .......... 68 TTTGGGGAGG GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT 2641 .......... .......... .......... .......... .......... .......... 68 GGATAATTTG GATTGTTGTG GATTGGGACG AAGTAAGGAA AATAGGGGAG GTGCTGCCGA 2581 .......... .......... .......... .......... .......... .......... 68 ATTTTCGTTA GATTATTAGC TAGCTTACAA GAAAGTAAAG CACGATATTT ATCTAATTGC 2521 .......... .......... .......... .......... .......... .......... 68 GGCACGATTG TTGCTTGTTA TAGATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT 2461 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...ATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT 105 CGATTATACG GTATGTAACG CTGTCCCTTC TTTCTTTGCT TGGCATGACT TTTAAAAATA 2401 ||| |||||| CGACTATACG .......... .......... .......... .......... .......... 115 AGCGAATAAC GGACAGATTT GATACTTACC TCTAAAGCGT CTAGGTGATG TATATTCTTG 2341 .......... .......... .......... .......... .......... .......... 115 CTTCCACAAT TATTCCTCTA TATATCGGTT ATGTCTAAGG CTATGATGAT CTCTAATATC 2281 .......... .......... .......... .......... .......... .......... 115 TATGGTAATG CTTCTTAGAG TCATTGAAAT TTTACGTTTT CATATCGTAT TAAAGGTTCA 2221 .......... .......... .......... .......... .......... .......... 115 TAATCTTGAT AAAACATTAA TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT 2161 .......... .......... .......... .......... .......... .......... 115 ATTGAGTTAT AAGAAATGAT TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA 2101 .......... .......... .......... .......... .......... .......... 115 TAGAGTTATT TATCATTTCA CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG 2041 .......... .......... .......... .......... .......... .......... 115 CATATGTCAC CGAGTTCCTC ACTAGAGGGC CGGGTATGTA TATTATATAT ATGATTGGTG 1981 .......... .......... .......... .......... .......... .......... 115 ATGAGGATGG TTATGATGAT GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC 1921 .......... .......... .......... .......... .......... .......... 115 CCTTACTAGG GAAGCTGGGC ACCTTAAATG TTAAATATAT GCATGATTTT CATTTAAAAA 1861 .......... .......... .......... .......... .......... .......... 115 GTATATGTGT AGCGATATTT TGTTTCGAGT TGCCACATTG GTATCCTGTC ATCTTTACCT 1801 .......... .......... .......... .......... .......... .......... 115 TATGCTTTAC ATACTCAGTA CATTGTTCGT ACTGACCCCC CTTTCCTCGG GGGGCTGCGT 1741 .......... .......... .......... .......... .......... .......... 115 TTCATGCCCG CAGGTGTAGA CGCGCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT 1681 ||||||| ||| |||||| |||||||||| |||||||||| |||||||| | .......... ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTTT 162 GCTGATTGGG AGAGCTCCAC TGTTCCGGAG CCCATTCGTT TTGGTACATA AC-TTTTGTG 1622 ||||| |||| |||||||||| |||| || || |||| || || |||||||||| || ||||||| GCTGAGTGGG AGAGCTCCAC TGTTTCGTAG CCCAGTCATT TTGGTACATA ACTTTTTGTG 222 TAGTCTTTTG CTCGTCTATG GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA 1562 |||||||||| || ||||||| ||||||| || |||||||||| |||||||||| ||||| | || TAGTCTTTTG CTTGTCTATG GGTATGGTGG GGCCCTGTCC CGTCGAGTTT CACTACTATA 282 CCCTTAGAGG TCTGTGGACA TTATGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG 1502 | |||||||| || | |||| | |||||| |||||||||| |||||||||| |||||||||| CTCTTAGAGG TCCATAGACA TCGCGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG 342 ACATGGTTTG TTTGGGATGT CCACTTGTAC AGGGGCAGCC TTGTCGGCTG TGTACATCAT 1442 |||||||||| |||||||||| |||||||||| | |||||||| ||||| |||| ||||||| | ACATGGTTTG TTTGGGATGT CCACTTGTAC AAGGGCAGCC TTGTCAGCTG CGTACATCTT 402 TATGCTTTGA ATAGTGGCGG CCTTGTCGGC TCGCGTATGC TGTTATGGTT GAATGGTTAT 1382 | || ||| ||||||| | |||||||||| | |||||||| | ||||| TGTGTATTGT GTAGTGGCAG CCTTGTCGGC T-GCGTATGC TATTATG... .......... 448 GACTCCTTAT GAGACATGTC CTCTTATATA TATATATGAC GTTGGGGTTG GCTTGATTTG 1322 .......... .......... .......... .......... .......... .......... 448 ATTAAATTCC ATATTGTCTT AGTTTCAGTT GGTCATACTT AGCAGGTTTG TAT 1269 |||| || .......... .......... .......... .......... .....CTTTG GAT 456 hqPGS_C06HBa0057J04.1-22-_SGN-E543104+ (3119 3051,2497 2451,1727 1395,1276 1269) ******************************************************************************** EST sequence 3 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 3516 to 1): Exon 1 3117 3058 ( 60 n); cDNA 1 61 ( 61 n); score: 0.842 Intron 1 3057 2498 ( 560 n); Pd: 0.997 (s: 0.81), Pa: 0.875 (s: 0.89) Exon 2 2497 2451 ( 47 n); cDNA 62 108 ( 47 n); score: 0.894 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 1727 1395 ( 333 n); cDNA 109 441 ( 333 n); score: 0.905 Intron 3 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 1276 1269 ( 8 n); cDNA 442 449 ( 8 n); score: 0.750 PPA cDNA 528 542 MATCH C06HBa0057J04.1-22- SGN-E374134- 0.896 448 0.827 C PGS_C06HBa0057J04.1-22-_SGN-E374134- (3117 3058,2497 2451,1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): CAGCCATGGA AATGGAG-AA ACCAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT 3059 |||||||||| ||||||| || |||| || | ||||||||| ||||||||| |||| || || CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 60 GGTTAGTAAT CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT 2999 | G......... .......... .......... .......... .......... .......... 61 TAATTTTAAG AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTA 2939 .......... .......... .......... .......... .......... .......... 61 CTAAGTATGA ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG 2879 .......... .......... .......... .......... .......... .......... 61 TTTTGATTAA AGCAAACTGC AGGAAAATTC TTTGTTGGCA TTATGTATAT GTTGAATGTG 2819 .......... .......... .......... .......... .......... .......... 61 ATTATGAGTA TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTAC GAATTATAAA 2759 .......... .......... .......... .......... .......... .......... 61 ACGAGTTATC ACTCGGTGTG TCGTTGCTTC GCTGATATAG TTGCCGAGAT GGAACTGTTT 2699 .......... .......... .......... .......... .......... .......... 61 TGGGGAGGGG GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTATTGGT ATTGTTGTGG 2639 .......... .......... .......... .......... .......... .......... 61 ATAATTTGGA TTGTTGTGGA TTGGGACGAA GTAAGGAAAA TAGGGGAGGT GCTGCCGAAT 2579 .......... .......... .......... .......... .......... .......... 61 TTTCGTTAGA TTATTAGCTA GCTTACAAGA AAGTAAAGCA CGATATTTAT CTAATTGCGG 2519 .......... .......... .......... .......... .......... .......... 61 CACGATTGTT GCTTGTTATA GATTAATAGC TTGAGCAGTA AATATTGGAC GTGCGGCTCG 2459 ||| ||| | |||||||||| |||||||||| || ||||||| .......... .......... .ATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG 100 ATTATACGGT ATGTAACGCT GTCCCTTCTT TCTTTGCTTG GCATGACTTT TAAAAATAAG 2399 | ||| || ACTATTCG.. .......... .......... .......... .......... .......... 108 CGAATAACGG ACAGATTTGA TACTTACCTC TAAAGCGTCT AGGTGATGTA TATTCTTGCT 2339 .......... .......... .......... .......... .......... .......... 108 TCCACAATTA TTCCTCTATA TATCGGTTAT GTCTAAGGCT ATGATGATCT CTAATATCTA 2279 .......... .......... .......... .......... .......... .......... 108 TGGTAATGCT TCTTAGAGTC ATTGAAATTT TACGTTTTCA TATCGTATTA AAGGTTCATA 2219 .......... .......... .......... .......... .......... .......... 108 ATCTTGATAA AACATTAATC TTTGGTAATA CTCCTTGCTG GTTCACGTTG ATTGTTCTAT 2159 .......... .......... .......... .......... .......... .......... 108 TGAGTTATAA GAAATGATTT TAATTGCATA TGGTTGCTCA TAATATTCTG CTCGTGCATA 2099 .......... .......... .......... .......... .......... .......... 108 GAGTTATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 2039 .......... .......... .......... .......... .......... .......... 108 TATGTCACCG AGTTCCTCAC TAGAGGGCCG GGTATGTATA TTATATATAT GATTGGTGAT 1979 .......... .......... .......... .......... .......... .......... 108 GAGGATGGTT ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC 1919 .......... .......... .......... .......... .......... .......... 108 TTACTAGGGA AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA TTTAAAAAGT 1859 .......... .......... .......... .......... .......... .......... 108 ATATGTGTAG CGATATTTTG TTTCGAGTTG CCACATTGGT ATCCTGTCAT CTTTACCTTA 1799 .......... .......... .......... .......... .......... .......... 108 TGCTTTACAT ACTCAGTACA TTGTTCGTAC TGACCCCCCT TTCCTCGGGG GGCTGCGTTT 1739 .......... .......... .......... .......... .......... .......... 108 CATGCCCGCA GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 1679 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| .......... .GTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC 157 TGATTGGGAG AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA 1620 | ||||||| |||||||||| |||||||||| || ||||||| |||||||||| || || |||| TTTTTGGGAG AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA 217 GTCTTTTGCT CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC 1560 |||||||||| ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| GTCTTTTGCT TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 277 CTTAGAGGTC TGTGGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC 1500 |||||||||| ||| ||||| ||||||||| |||| |||| |||||||||| |||||||||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC 337 ATGGTTTGTT TGGGATGTCC ACTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA 1440 |||||||||| |||||||||| | ||||||| | |||||||| ||||| |||| |||||||| ATGGTTTGTT TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG 397 TGCTTTGAAT AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA 1380 || ||| | |||||| ||| | ||||||| ||||||||| ||||| TGTATTGTGT AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... 441 CTCCTTATGA GACATGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT 1320 .......... .......... .......... .......... .......... .......... 441 TAAATTCCAT ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 1269 |||| | | .......... .......... .......... .......... ...TTTTGGA T 449 hqPGS_C06HBa0057J04.1-22-_SGN-E374134- (3117 3058,2497 2451,1727 1395,1276 1269) ******************************************************************************** EST sequence 9 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 3496 to 1): Exon 1 3116 3058 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 3057 2498 ( 560 n); Pd: 0.997 (s: 0.81), Pa: 0.875 (s: 0.89) Exon 2 2497 2451 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 1727 1395 ( 333 n); cDNA 108 440 ( 333 n); score: 0.905 Intron 3 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 1276 1269 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 MATCH C06HBa0057J04.1-22- SGN-E305738+ 0.895 447 0.817 C PGS_C06HBa0057J04.1-22-_SGN-E305738+ (3116 3058,2497 2451,1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3058 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 2998 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 2938 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 2878 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTGTTGGCAT TATGTATATG TTGAATGTGA 2818 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 2758 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 2698 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 2638 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTGGAT TGGGACGAAG TAAGGAAAAT AGGGGAGGTG CTGCCGAATT 2578 .......... .......... .......... .......... .......... .......... 60 TTCGTTAGAT TATTAGCTAG CTTACAAGAA AGTAAAGCAC GATATTTATC TAATTGCGGC 2518 .......... .......... .......... .......... .......... .......... 60 ACGATTGTTG CTTGTTATAG ATTAATAGCT TGAGCAGTAA ATATTGGACG TGCGGCTCGA 2458 ||| ||| || |||||||||| |||||||||| | |||||||| .......... .......... ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA 100 TTATACGGTA TGTAACGCTG TCCCTTCTTT CTTTGCTTGG CATGACTTTT AAAAATAAGC 2398 ||| || CTATTCG... .......... .......... .......... .......... .......... 107 GAATAACGGA CAGATTTGAT ACTTACCTCT AAAGCGTCTA GGTGATGTAT ATTCTTGCTT 2338 .......... .......... .......... .......... .......... .......... 107 CCACAATTAT TCCTCTATAT ATCGGTTATG TCTAAGGCTA TGATGATCTC TAATATCTAT 2278 .......... .......... .......... .......... .......... .......... 107 GGTAATGCTT CTTAGAGTCA TTGAAATTTT ACGTTTTCAT ATCGTATTAA AGGTTCATAA 2218 .......... .......... .......... .......... .......... .......... 107 TCTTGATAAA ACATTAATCT TTGGTAATAC TCCTTGCTGG TTCACGTTGA TTGTTCTATT 2158 .......... .......... .......... .......... .......... .......... 107 GAGTTATAAG AAATGATTTT AATTGCATAT GGTTGCTCAT AATATTCTGC TCGTGCATAG 2098 .......... .......... .......... .......... .......... .......... 107 AGTTATTTAT CATTTCACCG AGTCCCGGGC CGGGTAATGT TCGTGCGGAG TTTCTTGCAT 2038 .......... .......... .......... .......... .......... .......... 107 ATGTCACCGA GTTCCTCACT AGAGGGCCGG GTATGTATAT TATATATATG ATTGGTGATG 1978 .......... .......... .......... .......... .......... .......... 107 AGGATGGTTA TGATGATGAT GATGACGGAG ATGACGTGAT GATTATTTTG CCGAGCCCCT 1918 .......... .......... .......... .......... .......... .......... 107 TACTAGGGAA GCTGGGCACC TTAAATGTTA AATATATGCA TGATTTTCAT TTAAAAAGTA 1858 .......... .......... .......... .......... .......... .......... 107 TATGTGTAGC GATATTTTGT TTCGAGTTGC CACATTGGTA TCCTGTCATC TTTACCTTAT 1798 .......... .......... .......... .......... .......... .......... 107 GCTTTACATA CTCAGTACAT TGTTCGTACT GACCCCCCTT TCCTCGGGGG GCTGCGTTTC 1738 .......... .......... .......... .......... .......... .......... 107 ATGCCCGCAG GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 1678 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| .......... GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 157 GATTGGGAGA GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG 1619 |||||||| |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| TTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG 217 TCTTTTGCTC GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC 1559 ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | TCTTTTGCTT GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC 277 TTAGAGGTCT GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA 1499 |||||||||| || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| TTAGAGGTCT GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA 337 TGGTTTGTTT GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT 1439 |||||||||| |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | TGGTTTGTTT GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT 397 GCTTTGAATA GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC 1379 | ||| || ||||| |||| ||||||| | |||||||| | |||| GTATTGTGTA GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... 440 TCCTTATGAG ACATGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT 1319 .......... .......... .......... .......... .......... .......... 440 AAATTCCATA TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 1269 |||| || .......... .......... .......... .......... ..TTTTGGAT 448 hqPGS_C06HBa0057J04.1-22-_SGN-E305738+ (3116 3058,2497 2451,1727 1395,1276 1269) ******************************************************************************** EST sequence 12 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 3496 to 1): Exon 1 3116 3058 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 3057 2498 ( 560 n); Pd: 0.997 (s: 0.81), Pa: 0.875 (s: 0.89) Exon 2 2497 2451 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 1727 1395 ( 333 n); cDNA 108 440 ( 333 n); score: 0.905 Intron 3 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 4 1276 1269 ( 8 n); cDNA 441 448 ( 8 n); score: 0.750 PPA cDNA 527 542 MATCH C06HBa0057J04.1-22- SGN-E374135+ 0.895 447 0.825 C PGS_C06HBa0057J04.1-22-_SGN-E374135+ (3116 3058,2497 2451,1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3058 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 2998 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 2938 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 2878 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTGTTGGCAT TATGTATATG TTGAATGTGA 2818 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 2758 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 2698 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 2638 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTGGAT TGGGACGAAG TAAGGAAAAT AGGGGAGGTG CTGCCGAATT 2578 .......... .......... .......... .......... .......... .......... 60 TTCGTTAGAT TATTAGCTAG CTTACAAGAA AGTAAAGCAC GATATTTATC TAATTGCGGC 2518 .......... .......... .......... .......... .......... .......... 60 ACGATTGTTG CTTGTTATAG ATTAATAGCT TGAGCAGTAA ATATTGGACG TGCGGCTCGA 2458 ||| ||| || |||||||||| |||||||||| | |||||||| .......... .......... ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA 100 TTATACGGTA TGTAACGCTG TCCCTTCTTT CTTTGCTTGG CATGACTTTT AAAAATAAGC 2398 ||| || CTATTCG... .......... .......... .......... .......... .......... 107 GAATAACGGA CAGATTTGAT ACTTACCTCT AAAGCGTCTA GGTGATGTAT ATTCTTGCTT 2338 .......... .......... .......... .......... .......... .......... 107 CCACAATTAT TCCTCTATAT ATCGGTTATG TCTAAGGCTA TGATGATCTC TAATATCTAT 2278 .......... .......... .......... .......... .......... .......... 107 GGTAATGCTT CTTAGAGTCA TTGAAATTTT ACGTTTTCAT ATCGTATTAA AGGTTCATAA 2218 .......... .......... .......... .......... .......... .......... 107 TCTTGATAAA ACATTAATCT TTGGTAATAC TCCTTGCTGG TTCACGTTGA TTGTTCTATT 2158 .......... .......... .......... .......... .......... .......... 107 GAGTTATAAG AAATGATTTT AATTGCATAT GGTTGCTCAT AATATTCTGC TCGTGCATAG 2098 .......... .......... .......... .......... .......... .......... 107 AGTTATTTAT CATTTCACCG AGTCCCGGGC CGGGTAATGT TCGTGCGGAG TTTCTTGCAT 2038 .......... .......... .......... .......... .......... .......... 107 ATGTCACCGA GTTCCTCACT AGAGGGCCGG GTATGTATAT TATATATATG ATTGGTGATG 1978 .......... .......... .......... .......... .......... .......... 107 AGGATGGTTA TGATGATGAT GATGACGGAG ATGACGTGAT GATTATTTTG CCGAGCCCCT 1918 .......... .......... .......... .......... .......... .......... 107 TACTAGGGAA GCTGGGCACC TTAAATGTTA AATATATGCA TGATTTTCAT TTAAAAAGTA 1858 .......... .......... .......... .......... .......... .......... 107 TATGTGTAGC GATATTTTGT TTCGAGTTGC CACATTGGTA TCCTGTCATC TTTACCTTAT 1798 .......... .......... .......... .......... .......... .......... 107 GCTTTACATA CTCAGTACAT TGTTCGTACT GACCCCCCTT TCCTCGGGGG GCTGCGTTTC 1738 .......... .......... .......... .......... .......... .......... 107 ATGCCCGCAG GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 1678 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| .......... GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 157 GATTGGGAGA GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG 1619 |||||||| |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| TTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG 217 TCTTTTGCTC GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC 1559 ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | TCTTTTGCTT GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC 277 TTAGAGGTCT GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA 1499 |||||||||| || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| TTAGAGGTCT GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA 337 TGGTTTGTTT GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT 1439 |||||||||| |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | TGGTTTGTTT GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT 397 GCTTTGAATA GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC 1379 | ||| || ||||| |||| ||||||| | |||||||| | |||| GTATTGTGTA GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... 440 TCCTTATGAG ACATGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT 1319 .......... .......... .......... .......... .......... .......... 440 AAATTCCATA TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 1269 |||| || .......... .......... .......... .......... ..TTTTGGAT 448 hqPGS_C06HBa0057J04.1-22-_SGN-E374135+ (3116 3058,2497 2451,1727 1395,1276 1269) ******************************************************************************** EST sequence 1 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 2496 to 1): Exon 1 1728 1395 ( 334 n); cDNA 6 339 ( 334 n); score: 0.906 Intron 1 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 1276 1269 ( 8 n); cDNA 340 347 ( 8 n); score: 0.750 PPA cDNA 415 432 MATCH C06HBa0057J04.1-22- SGN-E225616- 0.906 342 0.792 C PGS_C06HBa0057J04.1-22-_SGN-E225616- (1728 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): GGTGTAGACG CGCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGATTGGGAG 1669 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTCCACTG TTCCGGAGCC CATTCGTTTT GGTACATAAC TT-TTGTGTA GTCTTTTGCT 1610 |||||||||| |||||||||| || ||||||| |||||||||| || || |||| |||||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACC CTTAGAGGTC 1550 ||||||||| |||||||||| |||||||||| || ||||||| ||| | ||| |||||||||| TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 185 TGTGGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 1490 ||| ||||| ||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 245 TGGGATGTCC ACTTGTACAG GGGCAGCCTT GTCGGCTGTG TACATCATTA TGCTTTGAAT 1430 |||||||||| | ||||||| | |||||||| ||||| |||| |||||||| || ||| | TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 305 AGTGGCGGCC TTGTCGGCTC GCGTATGCTG TTATGGTTGA ATGGTTATGA CTCCTTATGA 1370 |||||| ||| | ||||||| ||||||||| ||||| AGTGGCAGCC TCGTCGGCT- GCGTATGCTA TTATG..... .......... .......... 339 GACATGTCCT CTTATATATA TATATGACGT TGGGGTTGGC TTGATTTGAT TAAATTCCAT 1310 .......... .......... .......... .......... .......... .......... 339 ATTGTCTTAG TTTCAGTTGG TCATACTTAG CAGGTTTGTA T 1269 |||| | | .......... .......... .......... ...TTTTGGA T 347 hqPGS_C06HBa0057J04.1-22-_SGN-E225616- (1728 1395,1276 1269) ******************************************************************************** EST sequence 6 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 2746 to 1): Exon 1 1727 1395 ( 333 n); cDNA 33 365 ( 333 n); score: 0.908 Intron 1 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 1276 1269 ( 8 n); cDNA 366 373 ( 8 n); score: 0.750 PPA cDNA 481 495 MATCH C06HBa0057J04.1-22- SGN-E306317+ 0.908 341 0.689 C PGS_C06HBa0057J04.1-22-_SGN-E306317+ (1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 1668 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 1609 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 1549 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 212 GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 1489 || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 272 GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 1429 |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 332 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC TCCTTATGAG 1369 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... .......... 365 ACATGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT AAATTCCATA 1309 .......... .......... .......... .......... .......... .......... 365 TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 1269 |||| || .......... .......... .......... ..TTTTGGAT 373 hqPGS_C06HBa0057J04.1-22-_SGN-E306317+ (1727 1395,1276 1269) ******************************************************************************** EST sequence 15 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 2946 to 1): Exon 1 1727 1395 ( 333 n); cDNA 53 385 ( 333 n); score: 0.905 Intron 1 1394 1277 ( 118 n); Pd: 0.305 (s: 0.82), Pa: 0.973 (s: 0) Exon 2 1276 1269 ( 8 n); cDNA 386 393 ( 8 n); score: 0.750 PPA cDNA 514 523 MATCH C06HBa0057J04.1-22- SGN-E303695+ 0.905 341 0.652 C PGS_C06HBa0057J04.1-22-_SGN-E303695+ (1727 1395,1276 1269) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 1668 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 112 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 1609 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 172 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 1549 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 232 GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 1489 || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 292 GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 1429 |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 352 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATGGTTGAA TGGTTATGAC TCCTTATGAG 1369 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG...... .......... .......... 385 ACATGTCCTC TTATATATAT ATATGACGTT GGGGTTGGCT TGATTTGATT AAATTCCATA 1309 .......... .......... .......... .......... .......... .......... 385 TTGTCTTAGT TTCAGTTGGT CATACTTAGC AGGTTTGTAT 1269 |||| || .......... .......... .......... ..TTTTGGAT 393 hqPGS_C06HBa0057J04.1-22-_SGN-E303695+ (1727 1395,1276 1269) ******************************************************************************** EST sequence 16 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 3496 to 1): Exon 1 3116 3058 ( 59 n); cDNA 1 60 ( 60 n); score: 0.839 Intron 1 3057 2498 ( 560 n); Pd: 0.997 (s: 0.81), Pa: 0.875 (s: 0.89) Exon 2 2497 2451 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 2450 1728 ( 723 n); Pd: 0.991 (s: 0.89), Pa: 0.000 (s: 0.98) Exon 3 1727 1395 ( 333 n); cDNA 108 440 ( 333 n); score: 0.908 PPA cDNA 508 519 MATCH C06HBa0057J04.1-22- SGN-E310669+ 0.898 439 0.846 C PGS_C06HBa0057J04.1-22-_SGN-E310669+ (3116 3058,2497 2451,1727 1395) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTCTTGGC CAGCAGCTGC AAATAATTTG 3058 |||||||||| ||||||| || ||| || | ||||||||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 2998 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGA AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTAC 2938 .......... .......... .......... .......... .......... .......... 60 TAAGTATGAA TGGAAACCAT AATCGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 2878 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT TTGTTGGCAT TATGTATATG TTGAATGTGA 2818 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTACG AATTATAAAA 2758 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTTCG CTGATATAGT TGCCGAGATG GAACTGTTTT 2698 .......... .......... .......... .......... .......... .......... 60 GGGGAGGGGG CTGTTTAATA TGATTCTTTG GGTTATATGT GTTATTGGTA TTGTTGTGGA 2638 .......... .......... .......... .......... .......... .......... 60 TAATTTGGAT TGTTGTGGAT TGGGACGAAG TAAGGAAAAT AGGGGAGGTG CTGCCGAATT 2578 .......... .......... .......... .......... .......... .......... 60 TTCGTTAGAT TATTAGCTAG CTTACAAGAA AGTAAAGCAC GATATTTATC TAATTGCGGC 2518 .......... .......... .......... .......... .......... .......... 60 ACGATTGTTG CTTGTTATAG ATTAATAGCT TGAGCAGTAA ATATTGGACG TGCGGCTCGA 2458 ||| ||| || |||||||||| |||||||||| | |||||||| .......... .......... ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA 100 TTATACGGTA TGTAACGCTG TCCCTTCTTT CTTTGCTTGG CATGACTTTT AAAAATAAGC 2398 ||| || CTATTCG... .......... .......... .......... .......... .......... 107 GAATAACGGA CAGATTTGAT ACTTACCTCT AAAGCGTCTA GGTGATGTAT ATTCTTGCTT 2338 .......... .......... .......... .......... .......... .......... 107 CCACAATTAT TCCTCTATAT ATCGGTTATG TCTAAGGCTA TGATGATCTC TAATATCTAT 2278 .......... .......... .......... .......... .......... .......... 107 GGTAATGCTT CTTAGAGTCA TTGAAATTTT ACGTTTTCAT ATCGTATTAA AGGTTCATAA 2218 .......... .......... .......... .......... .......... .......... 107 TCTTGATAAA ACATTAATCT TTGGTAATAC TCCTTGCTGG TTCACGTTGA TTGTTCTATT 2158 .......... .......... .......... .......... .......... .......... 107 GAGTTATAAG AAATGATTTT AATTGCATAT GGTTGCTCAT AATATTCTGC TCGTGCATAG 2098 .......... .......... .......... .......... .......... .......... 107 AGTTATTTAT CATTTCACCG AGTCCCGGGC CGGGTAATGT TCGTGCGGAG TTTCTTGCAT 2038 .......... .......... .......... .......... .......... .......... 107 ATGTCACCGA GTTCCTCACT AGAGGGCCGG GTATGTATAT TATATATATG ATTGGTGATG 1978 .......... .......... .......... .......... .......... .......... 107 AGGATGGTTA TGATGATGAT GATGACGGAG ATGACGTGAT GATTATTTTG CCGAGCCCCT 1918 .......... .......... .......... .......... .......... .......... 107 TACTAGGGAA GCTGGGCACC TTAAATGTTA AATATATGCA TGATTTTCAT TTAAAAAGTA 1858 .......... .......... .......... .......... .......... .......... 107 TATGTGTAGC GATATTTTGT TTCGAGTTGC CACATTGGTA TCCTGTCATC TTTACCTTAT 1798 .......... .......... .......... .......... .......... .......... 107 GCTTTACATA CTCAGTACAT TGTTCGTACT GACCCCCCTT TCCTCGGGGG GCTGCGTTTC 1738 .......... .......... .......... .......... .......... .......... 107 ATGCCCGCAG GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 1678 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| .......... GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 157 GATTGGGAGA GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG 1619 | |||||||| |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| GTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG 217 TCTTTTGCTC GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC 1559 ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | TCTTTTGCTT GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC 277 TTAGAGGTCT GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA 1499 |||||||||| || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| TTAGAGGTCT GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA 337 TGGTTTGTTT GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT 1439 |||||||||| |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | TGGTTTGTTT GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT 397 GCTTTGAATA GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATG 1395 | ||| || ||||| |||| ||||||| | |||||||| | |||| GTATTGTGTA GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG 440 hqPGS_C06HBa0057J04.1-22-_SGN-E310669+ (3116 3058,2497 2451,1727 1395) ******************************************************************************** EST sequence 7 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 3203 to 795): Exon 1 3100 3058 ( 43 n); cDNA 5 47 ( 43 n); score: 0.837 Intron 1 3057 2098 ( 960 n); Pd: 0.997 (s: 0.84), Pa: 0.967 (s: 0.98) Exon 2 2097 1903 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 1902 1728 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 1727 1395 ( 333 n); cDNA 242 573 ( 332 n); score: 0.902 MATCH C06HBa0057J04.1-22- SGN-E538151+ 0.912 571 0.942 C PGS_C06HBa0057J04.1-22-_SGN-E538151+ (3100 3058,2097 1903,1727 1395) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3041 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 2981 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 2921 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 2861 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTTTGTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 2801 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 2741 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 2681 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTG 2621 .......... .......... .......... .......... .......... .......... 47 GATTGGGACG AAGTAAGGAA AATAGGGGAG GTGCTGCCGA ATTTTCGTTA GATTATTAGC 2561 .......... .......... .......... .......... .......... .......... 47 TAGCTTACAA GAAAGTAAAG CACGATATTT ATCTAATTGC GGCACGATTG TTGCTTGTTA 2501 .......... .......... .......... .......... .......... .......... 47 TAGATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT CGATTATACG GTATGTAACG 2441 .......... .......... .......... .......... .......... .......... 47 CTGTCCCTTC TTTCTTTGCT TGGCATGACT TTTAAAAATA AGCGAATAAC GGACAGATTT 2381 .......... .......... .......... .......... .......... .......... 47 GATACTTACC TCTAAAGCGT CTAGGTGATG TATATTCTTG CTTCCACAAT TATTCCTCTA 2321 .......... .......... .......... .......... .......... .......... 47 TATATCGGTT ATGTCTAAGG CTATGATGAT CTCTAATATC TATGGTAATG CTTCTTAGAG 2261 .......... .......... .......... .......... .......... .......... 47 TCATTGAAAT TTTACGTTTT CATATCGTAT TAAAGGTTCA TAATCTTGAT AAAACATTAA 2201 .......... .......... .......... .......... .......... .......... 47 TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT ATTGAGTTAT AAGAAATGAT 2141 .......... .......... .......... .......... .......... .......... 47 TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA TAGAGTTATT TATCATTTCA 2081 ||| ||| |||||||||| .......... .......... .......... .......... ...AGTCATT TATCATTTCA 64 CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTTCCTC 2021 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTCCCTC 124 ACTAGAGGGC CGGGTATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 1961 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGGGC CGGGAATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 184 GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC CCTTACTAGG GAAGCTGGGC 1901 |||||||||| ||||||| || ||||| |||| | | ||| | ||| ||||| | || || GATGATGACG GAGATGATGT GATGACTATT TCACTGAGTC CCTCACTAGA G-GGCCGG.. 241 ACCTTAAATG TTAAATATAT GCATGATTTT CATTTAAAAA GTATATGTGT AGCGATATTT 1841 .......... .......... .......... .......... .......... .......... 241 TGTTTCGAGT TGCCACATTG GTATCCTGTC ATCTTTACCT TATGCTTTAC ATACTCAGTA 1781 .......... .......... .......... .......... .......... .......... 241 CATTGTTCGT ACTGACCCCC CTTTCCTCGG GGGGCTGCGT TTCATGCCCG CAGGTGTAGA 1721 ||||||| .......... .......... .......... .......... .......... ...GTGTAGA 248 CGCGCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGATTGGG AGAGCTCCAC 1661 ||| ||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| CGCTCAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGTTTGGG AGAGCTCCAC 308 TGTTCCGGAG CCCATTCGTT TTGGTACATA ACTT-TTGTG TAGTCTTTTG CTCGTCTATG 1602 |||||||||| |||| ||||| |||||||||| |||| || || |||||||||| || ||||||| TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG TAGTCTTTTG CTTGTCTATG 368 GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA CCCTTAGAGG TCTGTGGACA 1542 ||||| |||| |||||||||| |||| ||||| ||||| | || | |||||||| ||||| |||| GGTAT-GCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA CTCTTAGAGG TCTGTAGACA 427 TTATGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG ACATGGTTTG TTTGGGATGT 1482 | ||||||| |||||| || |||||| ||| |||||||||| |||||||||| |||||||||| TCGTGTGGGT TGTATAATTA TGTTTTTGAT AATGGTCTGG ACATGGTTTG TTTGGGATGT 487 CCACTTGTAC AGGGGCAGCC TTGTCGGCTG TGTACATCAT TATGCTTTGA ATAGTGGCGG 1422 |||||||||| | | ||| || ||||||| || |||||||| | | || ||| ||||||| | CCACTTGTAC AAGTGCAACC TTGTCGGTTG TGTACATCTT TGTGTATTGT GTAGTGGCAG 547 CCTTGTCGGC TCGCGTATGC TGTTATG 1395 |||||||||| | |||||||| | ||||| CCTTGTCGGC T-GCGTATGC TATTATG 573 hqPGS_C06HBa0057J04.1-22-_SGN-E538151+ (3100 3058,2097 1903,1727 1395) ******************************************************************************** EST sequence 8 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 3203 to 433): Exon 1 3100 3058 ( 43 n); cDNA 5 47 ( 43 n); score: 0.837 Intron 1 3057 2098 ( 960 n); Pd: 0.997 (s: 0.84), Pa: 0.967 (s: 0.98) Exon 2 2097 1903 ( 195 n); cDNA 48 241 ( 194 n); score: 0.928 Intron 2 1902 1728 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 1727 1395 ( 333 n); cDNA 242 573 ( 332 n); score: 0.896 MATCH C06HBa0057J04.1-22- SGN-E538156+ 0.908 571 0.887 C PGS_C06HBa0057J04.1-22-_SGN-E538156+ (3100 3058,2097 1903,1727 1395) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3041 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 2981 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 2921 .......... .......... .......... .......... .......... .......... 47 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 2861 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTTTGTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 2801 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 2741 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 2681 .......... .......... .......... .......... .......... .......... 47 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTG 2621 .......... .......... .......... .......... .......... .......... 47 GATTGGGACG AAGTAAGGAA AATAGGGGAG GTGCTGCCGA ATTTTCGTTA GATTATTAGC 2561 .......... .......... .......... .......... .......... .......... 47 TAGCTTACAA GAAAGTAAAG CACGATATTT ATCTAATTGC GGCACGATTG TTGCTTGTTA 2501 .......... .......... .......... .......... .......... .......... 47 TAGATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT CGATTATACG GTATGTAACG 2441 .......... .......... .......... .......... .......... .......... 47 CTGTCCCTTC TTTCTTTGCT TGGCATGACT TTTAAAAATA AGCGAATAAC GGACAGATTT 2381 .......... .......... .......... .......... .......... .......... 47 GATACTTACC TCTAAAGCGT CTAGGTGATG TATATTCTTG CTTCCACAAT TATTCCTCTA 2321 .......... .......... .......... .......... .......... .......... 47 TATATCGGTT ATGTCTAAGG CTATGATGAT CTCTAATATC TATGGTAATG CTTCTTAGAG 2261 .......... .......... .......... .......... .......... .......... 47 TCATTGAAAT TTTACGTTTT CATATCGTAT TAAAGGTTCA TAATCTTGAT AAAACATTAA 2201 .......... .......... .......... .......... .......... .......... 47 TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT ATTGAGTTAT AAGAAATGAT 2141 .......... .......... .......... .......... .......... .......... 47 TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA TAGAGTTATT TATCATTTCA 2081 ||| ||| |||||||||| .......... .......... .......... .......... ...AGTCATT TATCATTTCA 64 CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTTCCTC 2021 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTCCCTC 124 ACTAGAGGGC CGGGTATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 1961 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGGGC CGGGAATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 184 GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC CCTTACTAGG GAAGCTGGGC 1901 |||||||||| ||||||| || ||||| |||| | | ||| | ||| ||||| | || || GATGATGACG GAGATGATGT GATGACTATT TCACTGAGTC CCTCACTAGA G-GGCCGG.. 241 ACCTTAAATG TTAAATATAT GCATGATTTT CATTTAAAAA GTATATGTGT AGCGATATTT 1841 .......... .......... .......... .......... .......... .......... 241 TGTTTCGAGT TGCCACATTG GTATCCTGTC ATCTTTACCT TATGCTTTAC ATACTCAGTA 1781 .......... .......... .......... .......... .......... .......... 241 CATTGTTCGT ACTGACCCCC CTTTCCTCGG GGGGCTGCGT TTCATGCCCG CAGGTGTAGA 1721 ||||||| .......... .......... .......... .......... .......... ...GTGTAGA 248 CGCGCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGATTGGG AGAGCTCCAC 1661 ||| ||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| CGCTCAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGTTTGGG AGAGCTCCAC 308 TGTTCCGGAG CCCATTCGTT TTGGTACATA ACTT-TTGTG TAGTCTTTTG CTCGTCTATG 1602 |||||||||| |||| ||||| ||||||||| |||| || || |||||||||| || ||||||| TGTTCCGGAG CCCAGTCGTT GTGGTACATA ACTTCTTATG TAGTCTTTTG CTTGTCTATG 368 GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA CCCTTAGAGG TCTGTGGACA 1542 ||||| |||| |||||||||| |||| ||||| ||||| | || | |||||||| ||||| |||| GGTAT-GCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA CTCTTAGAGG TCTGTAGACA 427 TTATGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG ACATGGTTTG TTTGGGATGT 1482 | ||||||| |||||| || |||||| ||| |||||||||| |||||||||| |||||||||| TCGTGTGGGT TGTATAATTA TGTTTTTGAT AATGGTCTGG ACATGGTTTG TTTGGGATGT 487 CCACTTGTAC AGGGGCAGCC TTGTCGGCTG TGTACATCAT TATGCTTTGA ATAGTGGCGG 1422 |||||||||| | | ||| || ||||||| || |||||||| | | || ||| ||||||| | CCACTTGTAC AAGTGCAACC TTGTCGGTTG TGTACATCTT TGTGTATTGT GTAGTGGCAG 547 CCTTGTCGGC TCGCGTATGC TGTTATG 1395 ||||| |||| | |||||||| | ||||| CCTTGACGGC T-GCGTATGC TATTATG 573 hqPGS_C06HBa0057J04.1-22-_SGN-E538156+ (3100 3058,2097 1903,1727 1395) ******************************************************************************** EST sequence 5 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 2753 to 1): Exon 1 2042 1988 ( 55 n); cDNA 161 214 ( 54 n); score: 0.818 Intron 1 1987 1943 ( 45 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.88) Exon 2 1942 1395 ( 548 n); cDNA 215 761 ( 547 n); score: 0.908 PPA cDNA 829 839 MATCH C06HBa0057J04.1-22- SGN-E544254- 0.900 603 0.715 C PGS_C06HBa0057J04.1-22-_SGN-E544254- (2042 1988,1942 1395) Alignment (genomic DNA sequence = upper lines): TGCATATGTC ACCGAGTTCC TCACTAGAGG GCCGGGTATG TATATTATAT ATATGATTGG 1983 || ||| || ||||||| || |||||||||| |||||||| ||| | ||| |||| TGACTATTTC ACCGAGTCCC TCACTAGAGG GCCGGGTA-C TATGATGTAT ATATA..... 214 TGATGAGGAT GGTTATGATG ATGATGATGA CGGAGATGAC GTGATGATTA TTTTGCCGAG 1923 ||||||||| |||||||||| .......... .......... .......... .......... ATGATGATTA TTTTGCCGAG 234 CCCCTTACTA GGGAAGCTGG GCACCTTAAA TGTTAAATAT ATGCATGATT TTCATTTAAA 1863 ||||||||| |||||| | | ||| |||| | ||||||| || |||||||||| |||| ||||| TCCCTTACTA GGGAAGTTAG GCATCTTATA TGTTAAAGAT ATGCATGATT TTCACTTAAA 294 AAGTATATGT GTAGCGATAT TTTGTTTCGA GTTGCCACAT TGGTATCCTG TCATCTTTAC 1803 ||||| |||| |||| ||||| ||||||||| || || | |||||||||| |||||||||| AAGTACATGT GTAGAGATAT CTTGTTTCGA CTTATCATGT TGGTATCCTG TCATCTTTAC 354 CTTATGCTTT ACATACTCAG TACATTGTTC GTACTGACCC CCCTTTCCTC GGGGGGCTGC 1743 |||||||||| |||||||||| |||||||| | ||||||| || |||||| ||| |||||||||| CTTATGCTTT ACATACTCAG TACATTGTCC GTACTGA-CC CCCTTTTCTC GGGGGGCTGC 413 GTTTCATGCC CGCAGGTGTA GACGCGCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT 1683 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| GTTTCATGCC CGCAGGTGTA GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT 473 CTGCTGATTG GGAGAGCTCC ACTGTTCCGG AGCCCATTCG TTTTGGTACA TAACTT-TTG 1624 ||||| ||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||| || CTGCTTTTTG GGAGAGCTCC ACTGTTCCGG AGCCCAGTCG TTTTGGTACA TAACTTCTTA 533 TGTAGTCTTT TGCTCGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCGAGT TTCACTAATG 1564 |||||||||| |||| ||||| |||||||||| |||||||||| |||||| ||| ||||||| | TGTAGTCTTT TGCTTGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCAAGT TTCACTACTA 593 TACCCTTAGA GGTCTGTGGA CATTATGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT 1504 ||| |||||| ||||||| || ||| ||||| |||||||| |||||||||| |||||||||| TACTCTTAGA GGTCTGTAGA CATCGTGTGG GTTGTATAAT TATGTTTTGG ATAATGGTCT 653 GGACATGGTT TGTTTGGGAT GTCCACTTGT ACAGGGGCAG CCTTGTCGGC TGTGTACATC 1444 |||||||||| |||||||||| ||||| |||| ||| | |||| ||||||||| |||| ||||| GGACATGGTT TGTTTGGGAT GTCCATTTGT ACAAGTGCAG CCTTGTCGGT TGTGAACATC 713 ATTATGCTTT GAATAGTGGC GGCCTTGTCG GCTCGCGTAT GCTGTTATG 1395 ||| || || | ||||||| |||| |||| ||| |||||| ||| ||||| ATTGTGTATT GTGTAGTGGC AGCCTCGTCG GCT-GCGTAT GCTATTATG 761 hqPGS_C06HBa0057J04.1-22-_SGN-E544254- (2042 1988,1942 1395) ******************************************************************************** EST sequence 13 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 3282 to 1): Exon 1 3100 3058 ( 43 n); cDNA 3 45 ( 43 n); score: 0.837 Intron 1 3057 2098 ( 960 n); Pd: 0.997 (s: 0.84), Pa: 0.967 (s: 0.96) Exon 2 2097 1903 ( 195 n); cDNA 46 239 ( 194 n); score: 0.923 Intron 2 1902 1728 ( 175 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.96) Exon 3 1727 1497 ( 231 n); cDNA 240 470 ( 231 n); score: 0.907 MATCH C06HBa0057J04.1-22- SGN-E268096+ 0.914 469 0.998 C PGS_C06HBa0057J04.1-22-_SGN-E268096+ (3100 3058,2097 1903,1727 1497) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3041 |||||| || | ||||||| || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 45 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGGC 2981 .......... .......... .......... .......... .......... .......... 45 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TACTAAGTAT GAATGGAAAC 2921 .......... .......... .......... .......... .......... .......... 45 CATAATCGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 2861 .......... .......... .......... .......... .......... .......... 45 GCAGGAAAAT TCTTTGTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 2801 .......... .......... .......... .......... .......... .......... 45 AAAGGATGAA TACGATAAGG TAGATGTGTT ACGAATTATA AAACGAGTTA TCACTCGGTG 2741 .......... .......... .......... .......... .......... .......... 45 TGTCGTTGCT TCGCTGATAT AGTTGCCGAG ATGGAACTGT TTTGGGGAGG GGGCTGTTTA 2681 .......... .......... .......... .......... .......... .......... 45 ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGTTGT GGATAATTTG GATTGTTGTG 2621 .......... .......... .......... .......... .......... .......... 45 GATTGGGACG AAGTAAGGAA AATAGGGGAG GTGCTGCCGA ATTTTCGTTA GATTATTAGC 2561 .......... .......... .......... .......... .......... .......... 45 TAGCTTACAA GAAAGTAAAG CACGATATTT ATCTAATTGC GGCACGATTG TTGCTTGTTA 2501 .......... .......... .......... .......... .......... .......... 45 TAGATTAATA GCTTGAGCAG TAAATATTGG ACGTGCGGCT CGATTATACG GTATGTAACG 2441 .......... .......... .......... .......... .......... .......... 45 CTGTCCCTTC TTTCTTTGCT TGGCATGACT TTTAAAAATA AGCGAATAAC GGACAGATTT 2381 .......... .......... .......... .......... .......... .......... 45 GATACTTACC TCTAAAGCGT CTAGGTGATG TATATTCTTG CTTCCACAAT TATTCCTCTA 2321 .......... .......... .......... .......... .......... .......... 45 TATATCGGTT ATGTCTAAGG CTATGATGAT CTCTAATATC TATGGTAATG CTTCTTAGAG 2261 .......... .......... .......... .......... .......... .......... 45 TCATTGAAAT TTTACGTTTT CATATCGTAT TAAAGGTTCA TAATCTTGAT AAAACATTAA 2201 .......... .......... .......... .......... .......... .......... 45 TCTTTGGTAA TACTCCTTGC TGGTTCACGT TGATTGTTCT ATTGAGTTAT AAGAAATGAT 2141 .......... .......... .......... .......... .......... .......... 45 TTTAATTGCA TATGGTTGCT CATAATATTC TGCTCGTGCA TAGAGTTATT TATCATTTCA 2081 ||| ||| ||||||| || .......... .......... .......... .......... ...AGTCATT TATCATTGCA 62 CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTTCCTC 2021 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CCGAGTCCCG GGCCGGGTAA TGTTCGTGCG GAGTTTCTTG CATATGTCAC CGAGTCCCTC 122 ACTAGAGGGC CGGGTATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 1961 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGGGC CGGGAATGTA TATTATATAT ATGATTGGTG ATGAGGATGG TTATGATGAT 182 GATGATGACG GAGATGACGT GATGATTATT TTGCCGAGCC CCTTACTAGG GAAGCTGGGC 1901 |||||||||| ||||||| || ||||| |||| | | ||| | ||| ||||| | || || GATGATGACG GAGATGATGT GATGACTATT TCACTGAGTC CCTCACTAGA G-GGCCGG.. 239 ACCTTAAATG TTAAATATAT GCATGATTTT CATTTAAAAA GTATATGTGT AGCGATATTT 1841 .......... .......... .......... .......... .......... .......... 239 TGTTTCGAGT TGCCACATTG GTATCCTGTC ATCTTTACCT TATGCTTTAC ATACTCAGTA 1781 .......... .......... .......... .......... .......... .......... 239 CATTGTTCGT ACTGACCCCC CTTTCCTCGG GGGGCTGCGT TTCATGCCCG CAGGTGTAGA 1721 ||||||| .......... .......... .......... .......... .......... ...GTGTAGA 246 CGCGCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGATTGGG AGAGCTCCAC 1661 ||| ||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| CGCTCAGTTT GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGTTTGGG AGAGCTCCAC 306 TGTTCCGGAG CCCATTCGTT TTGGTACATA ACTT-TTGTG TAGTCTTTTG CTCGTCTATG 1602 |||||||||| |||| ||||| |||||||||| |||| || || |||||||||| || ||||||| TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG TAGTCTTTTG CTTGTCTATG 366 GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA CCCTTAGAGG TCTGTGGACA 1542 ||||| |||| |||||||||| |||| ||||| ||||| | || | |||||||| ||||| |||| GGTAT-GCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA CTCTTAGAGG TCTGTAGACA 425 TTATGTGGGT TGTATATATA TGTTTTGGAT AATGGTCTGG ACATG 1497 | ||||||| ||||| || |||||| ||| ||||| |||| ||||| TCGTGTGGGT AGTATAATTA TGTTTTTGAT AATGGGCTGG ACATG 470 hqPGS_C06HBa0057J04.1-22-_SGN-E268096+ (3100 3058,2097 1903,1727 1497) ******************************************************************************** EST sequence 2 -strand 573 n (File: SGN-E538150-) 1 CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 61 GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 121 GGTTTGGCTT GAAAAAAAAA AAAAAAAAAA AACTCTTGAT ACAGTATTGG TTGGAAATTC 181 CCAAAGAGTT GCAGGTGCAG ATTATGCATT AGAAAGTATC CACAACATCA GGGAAGCAAT 241 ACCACAACTT TGGGAAGTAG ACAGGCTGGC TGAAGTTAAC TACTCTGGTG TAGCTGTTGA 301 GACATCTGTC ACAGCTTAGA ATCAGTAGTA CTACTATATC TCATCATCAT GCTGATGGCA 361 GAAGGAAAAA AAAATTAATC AAGAATCATG AGAAGATCCA AAATTTTCTG TCAAATTTGA 421 TTTTAAATGA TGTTGATGTT TTGTTGTCAT CAATTAATAA CTAGCTTTTA GTATTTCCTT 481 TCCATCCACA AATCTTGTAA ATAAATTCTA TATTTATCAG TCTACCTTTC TATGATTATA 541 TAATAATGAA GTTCAATTAT TAAAAAAAAA AAA Predicted gene structure (within gDNA segment 2163 to 1): Exon 1 1454 1327 ( 128 n); cDNA 1 131 ( 131 n); score: 0.816 PPA cDNA 562 573 MATCH C06HBa0057J04.1-22- SGN-E538150- 0.816 128 0.223 C PGS_C06HBa0057J04.1-22-_SGN-E538150- (1454 1327) Alignment (genomic DNA sequence = upper lines): CTGTGTACAT CATTATGCTT TGAATAGTGG CGGCCTTGTC GGCTCGCGTA TGCTGTTATG 1395 ||| ||| ||||||||| |||||||||| | |||||||| |||||||||| || ||||| | CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 60 GTTGAATGGT TATGACTCCT TATGAGACAT GTCCTCTT-A TATATAT--A TATGACGTTG 1338 |||||||||| |||||||| | ||||||| | ||| ||| | ||||||| | |||| |||| GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 120 GGGTTGGCTT G 1327 || ||||||| | GGTTTGGCTT G 131 hqPGS_C06HBa0057J04.1-22-_SGN-E538150- (1454 1327) ******************************************************************************** EST sequence 11 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 2846 to 1): Exon 1 1727 1395 ( 333 n); cDNA 43 375 ( 333 n); score: 0.905 PPA cDNA 443 453 MATCH C06HBa0057J04.1-22- SGN-E303256+ 0.905 333 0.735 C PGS_C06HBa0057J04.1-22-_SGN-E303256+ (1727 1395) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC GCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 1668 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTCCACTGT TCCGGAGCCC ATTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 1609 |||||||||| |||||||||| | |||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACCC TTAGAGGTCT 1549 |||||||||| |||||||||| |||||||||| | |||||||| || | ||| | |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 222 GTGGACATTA TGTGGGTTGT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 1489 || ||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 282 GGGATGTCCA CTTGTACAGG GGCAGCCTTG TCGGCTGTGT ACATCATTAT GCTTTGAATA 1429 |||||||||| ||||||| | ||||||||| |||| |||| |||||||| | | ||| || GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 342 GTGGCGGCCT TGTCGGCTCG CGTATGCTGT TATG 1395 ||||| |||| ||||||| | |||||||| | |||| GTGGCAGCCT CGTCGGCT-G CGTATGCTAT TATG 375 hqPGS_C06HBa0057J04.1-22-_SGN-E303256+ (1727 1395) ******************************************************************************** EST sequence 10 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 3711 to 881): Exon 1 3100 2409 ( 692 n); cDNA 3 691 ( 689 n); score: 0.857 MATCH C06HBa0057J04.1-22- SGN-E328093+ 0.857 692 1.001 C PGS_C06HBa0057J04.1-22-_SGN-E328093+ (3100 2409) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTCTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 3041 |||||| || | ||||||| || ||||||| |||||| || |||||||||| ||||| ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTGGTTAGTA ATCTCTTTGC 62 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGAAGGGGG- 2982 ||||| |||| ||||| |||| |||||| ||| ||||||| | ||| || ||| ||||||||| TTGGTTTGTT AATTCCTTAG AATACCTTTG TTAATTAGAC ATTTATGTTA AGAAGGGGGA 122 CGTGACCAGT AGCTTAGGAA GTTTGTTTTA GTTATTGAAT GTACTAAGTA TGAATGGAAA 2922 ||||| |||| | |||||||| ||||||||| |||||||||| || ||||| | ||| |||| CGTGAACAGT ATCTTAGGAA -TTTGTTTTA GTTATTGAAT GTGCTAAGGA TGAGCAGAAA 181 CCATAATCGG ATTATTAGTG GTGTCGTGTT GGTGCTTGGG CTGTTTTGAT TAAAGCAAAC 2862 |||| ||||| ||| ||| | |||| | || ||| ||||| |||||||||| ||||| || | CCATGATCGG ATTGCTAGCG GTGTTATATT TGTG-TTGGG CTGTTTTGAT TAAAGTAAGC 240 TGCAGGAAAA TTCTTTGTTG GCATTATGTA TATGTTGAAT GTGATTATGA GTATATACTC 2802 ||| || ||| |||| | ||| | ||||| | |||||| ||| |||||||| |||||||||| TGCTGG-AAA TTCTGTTTTG GTGTTATGCA TATGTT-AAT ATGATTATGG GTATATACTC 298 CAAAGGATGA ATACGATAAG GTAGATGTGT TACGAATTAT AAAACGAGTT ATCACTCGGT 2742 |||||||||| |||| ||||| |||||||||| | |||||||| ||||||| || ||| ||||| CAAAGGATGA ATACAATAAG GTAGATGTGT TGCGAATTAT AAAACGAATT ATCGGTCGGT 358 GTGTCGTTGC TTCGCTGATA TAGTTGC-CG AGATGGAACT GTTTTGGGGA GGGGGCTGTT 2683 ||||||||| || | | || | ||||| | | |||||| ||||| ||| || ||||||| GTGTCGTTGT TTTGTTACTA TGGTTGCTAA AAACGGAACT GTTTT-GGG- GGAGGCTGTT 416 TAATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGTT GTGGATAATT TGGATTGTTG 2623 |||||||||| |||| ||| |||||||| | |||||||||| |||||||||| ||| |||||| TAATATGATT TGTTGGATTA TATGTGTTGT TGGTATTGTT GTGGATAATT TGGGTTGTTG 476 -TGGATTGGG ACGAAGTAAG GAAAATAGGG GAGGTGCTGC CGAATTTTCG TTAGATTATT 2564 ||||||||| | ||||||| |||||||||| || ||||||| || ||||||| |||||||||| TTGGATTGGG ATGAAGTAAA GAAAATAGGG GAAGTGCTGC CGGATTTTCG TTAGATTATT 536 AGCTAGCTTA CA-AGAAAGT AAAGCACGAT ATTTATCTAA TTGCGGCACG ATTGTTGCTT 2505 |||||||||| || | ||| |||| ||| |||||||||| |||||||||| |||| ||||| AGCTAGCTTA CATAAGTAGT -AAGCGCGAC ATTTATCTAA TTGCGGCACG ATTGGTGCTT 595 GTTATAGATT AATAGCTTGA GCAGTAAATA TTGGACGTGC GGCTCGATTA TACGGTATGT 2445 |||||||||| ||| ||||| |||||||||| |||||||| | ||||||| || | |||||||| GTTATAGATT TATACCTTGA GCAGTAAATA TTGGACGTAC GGCTCGACTA TTCGGTATGT 655 AACGCTGTCC CTTCTTTCTT TGCTTGGCAT GACTTT 2409 |||||| ||| ||| ||||| || ||||||| ||| || AACGCTATCC TTTCCTTCTT TGTTTGGCAT GACCTT 691 hqPGS_C06HBa0057J04.1-22-_SGN-E328093+ (3100 2409) ******************************************************************************** EST sequence 17 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 3708 to 739): Exon 1 3108 2654 ( 455 n); cDNA 1 455 ( 455 n); score: 0.947 MATCH C06HBa0057J04.1-22- SGN-E298250+ 0.947 455 1.000 C PGS_C06HBa0057J04.1-22-_SGN-E298250+ (3108 2654) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGCAGCTG CAAATAATTT GGTTAGTAAT 3049 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| || |||||| AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 60 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT TAATTTTAAG 2989 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 120 AAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTA CTAAGTATGA 2929 |||||||||| ||||||| || ||| |||||| |||||||||| ||||||||| |||||||||| AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 180 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 2869 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 240 AGCAAACTGC AGGAAAATTC TTTGTTGGCA TTATGTATAT GTTGAATGTG ATTATGAGTA 2809 |||||||||| |||||||||| | | |||||| |||||||||| | |||||||| |||||||||| AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 300 TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTAC GAATTATAAA ACGAGTTATC 2749 |||||||||| ||||||||| |||| ||||| ||||||| | |||||||||| |||||||||| TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 360 ACTCGGTGTG TCGTTGCTTC GCTGATATAG TTGCCGAGAT GGAACTGTTT TGGGGAGGGG 2689 ||||||||| ||| |||||| |||| ||||| ||||| ||| |||||||||| |||||||||| GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 420 GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTA 2654 |||| |||| |||| ||| |||||||||| ||||| GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA 455 hqPGS_C06HBa0057J04.1-22-_SGN-E298250+ (3108 2654) Total number of EST alignments reported: 17 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3711: PGL 1 (- strand): 3119 1269 AGS-1 (3119 3051,2497 2451,1727 1395,1276 1269) SCR (e 0.862 d 0.900 a 0.875,e 0.979 d 0.991 a 0.000,e 0.908 d 0.305 a 0.973,e 0.750) Exon 1 3119 3051 ( 69 n); score: 0.862 Intron 1 3050 2498 ( 553 n); Pd: 0.900 Pa: 0.875 Exon 2 2497 2451 ( 47 n); score: 0.979 Intron 2 2450 1728 ( 723 n); Pd: 0.991 Pa: 0.000 Exon 3 1727 1395 ( 333 n); score: 0.908 Intron 3 1394 1277 ( 118 n); Pd: 0.305 Pa: 0.973 Exon 4 1276 1269 ( 8 n); score: 0.750 PGS (3119 3051,2497 2451,1727 1395,1276 1269) SGN-E543103- PGS (3119 3051,2497 2451,1727 1395,1276 1269) SGN-E543104+ PGS (1728 1395,1276 1269) SGN-E225616- PGS (1727 1395,1276 1269) SGN-E306317+ PGS (1727 1395,1276 1269) SGN-E303695+ PGS (1727 1395) SGN-E303256+ 3-phase translation of AGS-1 (-strand): . . . . . . 3119 GGCAGCCATGGAAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATT G S H G N G E T N P A T L G Q Q L Q I I A A M E M E K P T L Q L L A S S C K - F Q P W K W R N Q P C N S W P A A A N N . : . . . . . : 3059 TGGTTAGTA : ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGT W L V : I N S L S S K Y W T C G S I I R : C G - - : L I A - A V N I G R A A R L Y : G V L V S : N - - L E Q - I L D V R L D Y T : V . . . . . . 1723 AGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTC R R A V R - S S R L G Y L L C - L G E L D A Q F G D P P A - D I Y S A D W E S S - T R S S V I L P P R I S T L L I G R A . . . . . . 1663 CACTGTTCCGGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTA H C S G A H S F W Y I T F V - S F A R L T V P E P I R F G T - L L C S L L L V Y P L F R S P F V L V H N F C V V F C S S . . . . . . 1603 TGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTGGA W V W R G P V P S S F T N V P L E V C G G Y G G A L S R R V S L M Y P - R S V D M G M A G P C P V E F H - C T L R G L W . . . . . . 1543 CATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGAT H Y V G C I Y M F W I M V W T W F V W D I M W V V Y I C F G - W S G H G L F G M T L C G L Y I Y V L D N G L D M V C L G . . . . . . 1483 GTCCACTTGTACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTGGC V H L Y R G S L V G C V H H Y A L N S G S T C T G A A L S A V Y I I M L - I V A C P L V Q G Q P C R L C T S L C F E - W . . . : . 1423 GGCCTTGTCGGCTCGCGTATGCTGTTATG : GTTTGTAT G L V G S R M L L W : F V A L S A R V C C Y : G L Y R P C R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-22-_PGL-1_AGS-1_PPS_1 (1618 1395,1276 1270) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVHLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-2 (3117 3058,2497 2451,1727 1395,1276 1269) SCR (e 0.842 d 0.997 a 0.875,e 0.894 d 0.991 a 0.000,e 0.908 d 0.305 a 0.973,e 0.750) Exon 1 3117 3058 ( 60 n); score: 0.842 Intron 1 3057 2498 ( 560 n); Pd: 0.997 Pa: 0.875 Exon 2 2497 2451 ( 47 n); score: 0.894 Intron 2 2450 1728 ( 723 n); Pd: 0.991 Pa: 0.000 Exon 3 1727 1395 ( 333 n); score: 0.908 Intron 3 1394 1277 ( 118 n); Pd: 0.305 Pa: 0.973 Exon 4 1276 1269 ( 8 n); score: 0.750 PGS (3117 3058,2497 2451,1727 1395,1276 1269) SGN-E374134- PGS (3116 3058,2497 2451,1727 1395,1276 1269) SGN-E305738+ PGS (3116 3058,2497 2451,1727 1395,1276 1269) SGN-E374135+ PGS (3116 3058,2497 2451,1727 1395) SGN-E310669+ 3-phase translation of AGS-2 (-strand): . . . . . . : 3117 CAGCCATGGAAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : Q P W K W R N Q P C N S W P A A A N N L : S H G N G E T N P A T L G Q Q L Q I I - : A M E M E K P T L Q L L A S S C K - F : . . . . . : . 2497 ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGTAGACGCGCA I N S L S S K Y W T C G S I I R : C R R A L I A - A V N I G R A A R L Y : G V D A Q D - - L E Q - I L D V R L D Y T : V - T R . . . . . . 1714 GTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCC V R - S S R L G Y L L C - L G E L H C S F G D P P A - D I Y S A D W E S S T V P S S V I L P P R I S T L L I G R A P L F . . . . . . 1654 GGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGG G A H S F W Y I T F V - S F A R L W V W E P I R F G T - L L C S L L L V Y G Y G R S P F V L V H N F C V V F C S S M G M . . . . . . 1594 CGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTGGACATTATGTG R G P V P S S F T N V P L E V C G H Y V G A L S R R V S L M Y P - R S V D I M W A G P C P V E F H - C T L R G L W T L C . . . . . . 1534 GGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCACTTG G C I Y M F W I M V W T W F V W D V H L V V Y I C F G - W S G H G L F G M S T C G L Y I Y V L D N G L D M V C L G C P L . . . . . . 1474 TACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTGGCGGCCTTGTC Y R G S L V G C V H H Y A L N S G G L V T G A A L S A V Y I I M L - I V A A L S V Q G Q P C R L C T S L C F E - W R P C . . : . 1414 GGCTCGCGTATGCTGTTATG : GTTTGTAT G S R M L L W : F V A R V C C Y : G L Y R L A Y A V M : V C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-22-_PGL-1_AGS-2_PPS_1 (1618 1395,1276 1270) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVHLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWFV AGS-3 (3100 3058,2097 1903,1727 1327) SCR (e 0.837 d 0.997 a 0.967,e 0.928 d 0.000 a 0.000,e 0.902) Exon 1 3100 3058 ( 43 n); score: 0.837 Intron 1 3057 2098 ( 960 n); Pd: 0.997 Pa: 0.967 Exon 2 2097 1903 ( 195 n); score: 0.928 Intron 2 1902 1728 ( 175 n); Pd: 0.000 Pa: 0.000 Exon 3 1727 1327 ( 401 n); score: 0.902 PGS (1454 1327) SGN-E538150- PGS (3100 3058,2097 1903,1727 1395) SGN-E538151+ PGS (3100 3058,2097 1903,1727 1395) SGN-E538156+ PGS (3100 3058,2097 1903,1727 1497) SGN-E268096+ 3-phase translation of AGS-3 (-strand): . . . . . : . 3100 AAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTG : AGTTATTTATCATTTCA K P T L Q L L A S S C K - F : E L F I I S N Q P C N S W P A A A N N L : S Y L S F H T N P A T L G Q Q L Q I I - : V I Y H F . . . . . . 2080 CCGAGTCCCGGGCCGGGTAATGTTCGTGCGGAGTTTCTTGCATATGTCACCGAGTTCCTC P S P G P G N V R A E F L A Y V T E F L R V P G R V M F V R S F L H M S P S S S T E S R A G - C S C G V S C I C H R V P . . . . . . 2020 ACTAGAGGGCCGGGTATGTATATTATATATATGATTGGTGATGAGGATGGTTATGATGAT T R G P G M Y I I Y M I G D E D G Y D D L E G R V C I L Y I - L V M R M V M M M H - R A G Y V Y Y I Y D W - - G W L - - . . . . . . : 1960 GATGATGACGGAGATGACGTGATGATTATTTTGCCGAGCCCCTTACTAGGGAAGCTGG : GT D D D G D D V M I I L P S P L L G K L : G M M T E M T - - L F C R A P Y - G S W : V - - - R R - R D D Y F A E P L T R E A G : . . . . . . 1725 GTAGACGCGCAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGC V D A Q F G D P P A - D I Y S A D W E S - T R S S V I L P P R I S T L L I G R A C R R A V R - S S R L G Y L L C - L G E . . . . . . 1665 TCCACTGTTCCGGAGCCCATTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTC S T V P E P I R F G T - L L C S L L L V P L F R S P F V L V H N F C V V F C S S L H C S G A H S F W Y I T F V - S F A R . . . . . . 1605 TATGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACCCTTAGAGGTCTGTG Y G Y G G A L S R R V S L M Y P - R S V M G M A G P C P V E F H - C T L R G L W L W V W R G P V P S S F T N V P L E V C . . . . . . 1545 GACATTATGTGGGTTGTATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGG D I M W V V Y I C F G - W S G H G L F G T L C G L Y I Y V L D N G L D M V C L G G H Y V G C I Y M F W I M V W T W F V W . . . . . . 1485 ATGTCCACTTGTACAGGGGCAGCCTTGTCGGCTGTGTACATCATTATGCTTTGAATAGTG M S T C T G A A L S A V Y I I M L - I V C P L V Q G Q P C R L C T S L C F E - W D V H L Y R G S L V G C V H H Y A L N S . . . . . . 1425 GCGGCCTTGTCGGCTCGCGTATGCTGTTATGGTTGAATGGTTATGACTCCTTATGAGACA A A L S A R V C C Y G - M V M T P Y E T R P C R L A Y A V M V E W L - L L M R H G G L V G S R M L L W L N G Y D S L - D . . . . 1365 TGTCCTCTTATATATATATATGACGTTGGGGTTGGCTTG C P L I Y I Y D V G V G L V L L Y I Y M T L G L A M S S Y I Y I - R W G W L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-22-_PGL-1_AGS-3_PPS_1 (1618 1370) (frame '0'; 246 bp, 82 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVHLYR GSLVGCVHHY 61 ALNSGGLVGS RMLLWLNGYD SL- >C06HBa0057J04.1-22-_PGL-1_AGS-3_PPS_2 (3061 3058,2097 1903,1727 1693) (frame '1'; 231 bp, 77 residues) 1 FELFIISPSP GPGNVRAEFL AYVTEFLTRG PGMYIIYMIG DEDGYDDDDD GDDVMIILPS 61 PLLGKLGVDA QFGDPPA- AGS-4 (2042 1988,1942 1395) SCR (e 0.818 d 0.000 a 0.000,e 0.908) Exon 1 2042 1988 ( 55 n); score: 0.818 Intron 1 1987 1943 ( 45 n); Pd: 0.000 Pa: 0.000 Exon 2 1942 1395 ( 548 n); score: 0.908 PGS (2042 1988,1942 1395) SGN-E544254- 3-phase translation of AGS-4 (-strand): . . . . . . : 2042 TGCATATGTCACCGAGTTCCTCACTAGAGGGCCGGGTATGTATATTATATATATG : GTGAT C I C H R V P H - R A G Y V Y Y I Y : G D A Y V T E F L T R G P G M Y I I Y M : V M H M S P S S S L E G R V C I L Y I W : - . . . . . . 1937 GATTATTTTGCCGAGCCCCTTACTAGGGAAGCTGGGCACCTTAAATGTTAAATATATGCA D Y F A E P L T R E A G H L K C - I Y A I I L P S P L L G K L G T L N V K Y M H - L F C R A P Y - G S W A P - M L N I C . . . . . . 1877 TGATTTTCATTTAAAAAGTATATGTGTAGCGATATTTTGTTTCGAGTTGCCACATTGGTA - F S F K K Y M C S D I L F R V A T L V D F H L K S I C V A I F C F E L P H W Y M I F I - K V Y V - R Y F V S S C H I G . . . . . . 1817 TCCTGTCATCTTTACCTTATGCTTTACATACTCAGTACATTGTTCGTACTGACCCCCCTT S C H L Y L M L Y I L S T L F V L T P L P V I F T L C F T Y S V H C S Y - P P F I L S S L P Y A L H T Q Y I V R T D P P . . . . . . 1757 TCCTCGGGGGGCTGCGTTTCATGCCCGCAGGTGTAGACGCGCAGTTCGGTGATCCTCCCG S S G G C V S C P Q V - T R S S V I L P P R G A A F H A R R C R R A V R - S S R F L G G L R F M P A G V D A Q F G D P P . . . . . . 1697 CCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCCGGAGCCCATTCGTTTTG P R I S T L L I G R A P L F R S P F V L L G Y L L C - L G E L H C S G A H S F W A - D I Y S A D W E S S T V P E P I R F . . . . . . 1637 GTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGGCGGGGCCCTGTCCCGTC V H N F C V V F C S S M G M A G P C P V Y I T F V - S F A R L W V W R G P V P S G T - L L C S L L L V Y G Y G G A L S R . . . . . . 1577 GAGTTTCACTAATGTACCCTTAGAGGTCTGTGGACATTATGTGGGTTGTATATATATGTT E F H - C T L R G L W T L C G L Y I Y V S F T N V P L E V C G H Y V G C I Y M F R V S L M Y P - R S V D I M W V V Y I C . . . . . . 1517 TTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTCCACTTGTACAGGGGCAGCCTTGT L D N G L D M V C L G C P L V Q G Q P C W I M V W T W F V W D V H L Y R G S L V F G - W S G H G L F G M S T C T G A A L . . . . . . 1457 CGGCTGTGTACATCATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGCTGTT R L C T S L C F E - W R P C R L A Y A V G C V H H Y A L N S G G L V G S R M L L S A V Y I I M L - I V A A L S A R V C C . 1397 ATG M Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-22-_PGL-1_AGS-4_PPS_1 (2041 1988,1942 1766) (frame '2'; 228 bp, 76 residues) 1 AYVTEFLTRG PGMYIIYMVM IILPSPLLGK LGTLNVKYMH DFHLKSICVA IFCFELPHWY 61 PVIFTLCFTY SVHCSY- >C06HBa0057J04.1-22-_PGL-1_AGS-4_PPS_2 (1618 1397) (frame '2'; 222 bp, 74 residues) 1 SFARLWVWRG PVPSSFTNVP LEVCGHYVGC IYMFWIMVWT WFVWDVHLYR GSLVGCVHHY 61 ALNSGGLVGS RMLL AGS-5 (3108 2409) SCR (e 0.857) Exon 1 3108 2409 ( 700 n); score: 0.857 PGS (3100 2409) SGN-E328093+ PGS (3108 2654) SGN-E298250+ 3-phase translation of AGS-5 (-strand): . . . . . . 3108 AAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N Q P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K P T L Q L L A S S C K - F G - - . . . . . . 3048 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R S P C L V C - F F R I P L L I I H - F - . . . . . . 2988 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTACTAAGTATGA K G A - P V A - E V C F S Y - M Y - V - R G R D Q - L R K F V L V I E C T K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 2928 ATGGAAACCATAATCGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGCTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - S D Y - W C R V G A W A V L I K N G N H N R I I S G V V L V L G L F - L . . . . . . 2868 AGCAAACTGCAGGAAAATTCTTTGTTGGCATTATGTATATGTTGAATGTGATTATGAGTA S K L Q E N S L L A L C I C - M - L - V A N C R K I L C W H Y V Y V E C D Y E Y K Q T A G K F F V G I M Y M L N V I M S . . . . . . 2808 TATACTCCAAAGGATGAATACGATAAGGTAGATGTGTTACGAATTATAAAACGAGTTATC Y T P K D E Y D K V D V L R I I K R V I I L Q R M N T I R - M C Y E L - N E L S I Y S K G - I R - G R C V T N Y K T S Y . . . . . . 2748 ACTCGGTGTGTCGTTGCTTCGCTGATATAGTTGCCGAGATGGAACTGTTTTGGGGAGGGG T R C V V A S L I - L P R W N C F G E G L G V S L L R - Y S C R D G T V L G R G H S V C R C F A D I V A E M E L F W G G . . . . . . 2688 GCTGTTTAATATGATTCTTTGGGTTATATGTGTTATTGGTATTGTTGTGGATAATTTGGA A V - Y D S L G Y M C Y W Y C C G - F G L F N M I L W V I C V I G I V V D N L D G C L I - F F G L Y V L L V L L W I I W . . . . . . 2628 TTGTTGTGGATTGGGACGAAGTAAGGAAAATAGGGGAGGTGCTGCCGAATTTTCGTTAGA L L W I G T K - G K - G R C C R I F V R C C G L G R S K E N R G G A A E F S L D I V V D W D E V R K I G E V L P N F R - . . . . . . 2568 TTATTAGCTAGCTTACAAGAAAGTAAAGCACGATATTTATCTAATTGCGGCACGATTGTT L L A S L Q E S K A R Y L S N C G T I V Y - L A Y K K V K H D I Y L I A A R L L I I S - L T R K - S T I F I - L R H D C . . . . . . 2508 GCTTGTTATAGATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACGGT A C Y R L I A - A V N I G R A A R L Y G L V I D - - L E Q - I L D V R L D Y T V C L L - I N S L S S K Y W T C G S I I R . . . . 2448 ATGTAACGCTGTCCCTTCTTTCTTTGCTTGGCATGACTTT M - R C P F F L C L A - L C N A V P S F F A W H D F Y V T L S L L S L L G M T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-5 (+strand): . . . . . . 2409 AAAGTCATGCCAAGCAAAGAAAGAAGGGACAGCGTTACATACCGTATAATCGAGCCGCAC K V M P S K E R R D S V T Y R I I E P H K S C Q A K K E G T A L H T V - S S R T S H A K Q R K K G Q R Y I P Y N R A A . . . . . . 2469 GTCCAATATTTACTGCTCAAGCTATTAATCTATAACAAGCAACAATCGTGCCGCAATTAG V Q Y L L L K L L I Y N K Q Q S C R N - S N I Y C S S Y - S I T S N N R A A I R R P I F T A Q A I N L - Q A T I V P Q L . . . . . . 2529 ATAAATATCGTGCTTTACTTTCTTGTAAGCTAGCTAATAATCTAACGAAAATTCGGCAGC I N I V L Y F L V S - L I I - R K F G S - I S C F T F L - A S - - S N E N S A A D K Y R A L L S C K L A N N L T K I R Q . . . . . . 2589 ACCTCCCCTATTTTCCTTACTTCGTCCCAATCCACAACAATCCAAATTATCCACAACAAT T S P I F L T S S Q S T T I Q I I H N N P P L F S L L R P N P Q Q S K L S T T I H L P Y F P Y F V P I H N N P N Y P Q Q . . . . . . 2649 ACCAATAACACATATAACCCAAAGAATCATATTAAACAGCCCCCTCCCCAAAACAGTTCC T N N T Y N P K N H I K Q P P P Q N S S P I T H I T Q R I I L N S P L P K T V P Y Q - H I - P K E S Y - T A P S P K Q F . . . . . . 2709 ATCTCGGCAACTATATCAGCGAAGCAACGACACACCGAGTGATAACTCGTTTTATAATTC I S A T I S A K Q R H T E - - L V L - F S R Q L Y Q R S N D T P S D N S F Y N S H L G N Y I S E A T T H R V I T R F I I . . . . . . 2769 GTAACACATCTACCTTATCGTATTCATCCTTTGGAGTATATACTCATAATCACATTCAAC V T H L P Y R I H P L E Y I L I I T F N - H I Y L I V F I L W S I Y S - S H S T R N T S T L S Y S S F G V Y T H N H I Q . . . . . . 2829 ATATACATAATGCCAACAAAGAATTTTCCTGCAGTTTGCTTTAATCAAAACAGCCCAAGC I Y I M P T K N F P A V C F N Q N S P S Y T - C Q Q R I F L Q F A L I K T A Q A H I H N A N K E F S C S L L - S K Q P K . . . . . . 2889 ACCAACACGACACCACTAATAATCCGATTATGGTTTCCATTCATACTTAGTACATTCAAT T N T T P L I I R L W F P F I L S T F N P T R H H - - S D Y G F H S Y L V H S I H Q H D T T N N P I M V S I H T - Y I Q . . . . . . 2949 AACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCCCTTCTTAAAATTAATGGATAATT N - N K L P K L L V T P P S - N - W I I T K T N F L S Y W S R P L L K I N G - L - L K Q T S - A T G H A P F L K L M D N . . . . . . 3009 AACAAGGGTATTCTAAAGAATTAACACACCAAACAAGGAGATTACTAACCAAATTATTTG N K G I L K N - H T K Q G D Y - P N Y L T R V F - R I N T P N K E I T N Q I I C - Q G Y S K E L T H Q T R R L L T K L F . . . . 3069 CAGCTGCTGGCCAAGAGTTGCAGGGTTGGTTTCTCCATTT Q L L A K S C R V G F S I S C W P R V A G L V S P F A A A G Q E L Q G W F L H Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-22+_PGL-1_AGS-5_PPS_1 (2569 2772) (frame '2'; 201 bp, 67 residues) 1 SNENSAAPPL FSLLRPNPQQ SKLSTTIPIT HITQRIILNS PLPKTVPSRQ LYQRSNDTPS 61 DNSFYNS- ... finished at: Mon Jul 24 23:17:04 2006 ________________________________________________________________________________ Sequence 23: C06HBa0057J04.1-23, from 1 to 2267, both strands analyzed. ... started at: Mon Jul 24 23:17:04 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand 832 n (File: SGN-E548992+) 1 TTTCTTTTTT AATTTATTTC TTAACTTTCT CACCTATCCA AGAGGCTGTG GGTTCAATCC 61 CCACTCTCCA CATTTATTTT TCTCCTTTAT TTTTCTCCTA ACTCCCCCAT CCATTCAAGA 121 GGTTGTGGGT TCAATCCACT CTCCACATTT TATTTTGCTC TTTTATTTTC ATCCTAACTC 181 TCTCATCCAT TCAAGAGGTT GTGGGTTCAA TCCGAACTCT CCACATTTTA TTTTCTCCTT 241 TATGTTTCCC CTAACTATCT CATCCATTCA AGAGGTTGTG GGTTCAATCC CCACTCTCCA 301 CGTTTTATTT TCTCCTTTAT GTTTCTCCTA ACTCTCTCAT CCATTCAAGA AGTTGTGGGT 361 TCAACCCCCA CTCTCCACAT TTCATTTTTT TACTTTTTTA TTTTCATGGT TTTTAAACAT 421 GGTGAGGTCA CGGGTTCAAT CCTCCATGAC CTCACTTGTT TTAATTTATT TTTTTTCAAC 481 ATTTTGAACC CTCTTTGCTG ACCGAAATTC ATCAATAGAA GCTAAAAATT TTCTTTTAAT 541 TTTTCAGCAT CCTTTCATCA AGTTACACCT TAGTTCTTAG GGGAATTTCG TAGTGCATTT 601 ATAACCATAT AGGTGCGTCT TTATGTATTC GTTTAGCTAA GACATTATCC CTGAAATCAG 661 CCACATTTCA ACTCATATGA ACATCACATT TATATTAACA TGTGTACATG AAATATAAAG 721 AACTATCATG CCATTTGAAG TCCTAAACCA TGAACAATAG CCATGGCAAC AGACACATAC 781 ATACGTAGTT ACATTTTCGG AAACACAACA TAAGTCACAT AATTTCAACA TA Predicted gene structure (within gDNA segment 2267 to 1): Exon 1 2215 1907 ( 309 n); cDNA 159 460 ( 302 n); score: 0.725 MATCH C06HBa0057J04.1-23- SGN-E548992+ 0.725 309 0.371 C PGS_C06HBa0057J04.1-23-_SGN-E548992+ (2215 1907) Alignment (genomic DNA sequence = upper lines): TCATTCTTTT CCCTCTTTTC TTTCTCATTT AAGACAAGGA GATTGTGGGA TCAATCCCCT 2156 || || ||| | || | | | |||||| | ||| || | ||||||| ||||||| TCTTTTATTT TCATCCTAAC TCTCTCATC- CATTCAA-GA GGTTGTGGGT TCAATCCGAA 216 AAAAGATCAT TAT-TTTTCT C-TTAAT-TT T-TTTTAACT TTCTCACCTA TCCAAGAGGT 2100 ||| | | |||||| | || || || | ||||| ||||| | | | |||||||| CTCTCCACAT TTTATTTTCT CCTTTATGTT TCCCCTAACT ATCTCATCCA TTCAAGAGGT 276 TGTGGGTTCA ATCCCCACTC ACCTCATTTT ATTTTCTCCT TTATTTTTCT CCCAACTCTC 2040 |||||||||| |||||||||| || | |||| |||||||||| |||| ||||| || ||||||| TGTGGGTTCA ATCCCCACTC TCCACGTTTT ATTTTCTCCT TTATGTTTCT CCTAACTCTC 336 TCATCTATTA AAGAGGTTGT TGGTTCAACC CCCACTCACC ACATTTAATT TTTTTTACAT 1980 ||||| ||| |||| ||||| ||||||||| ||||||| || |||||| | | |||||||| | TCATCCATTC AAGAAGTTGT GGGTTCAACC CCCACTCTCC ACATTTCA-T TTTTTTAC-T 394 TTATCTCATG AAATTTTTAC GTAATTTGCA ATTGGTGAGG CCTTGGTTTC AATCCCCAAT 1920 || | | | ||||| | | ||| | |||||||| | || ||| ||||| | || TT-T-T--T- --ATTTTCAT GGTTTTTAAA CATGGTGAGG TCACGGGTTC AATCCTCCAT 447 GACCTCCTAT TTT 1907 |||||| | || GACCTCACTT GTT 460 hqPGS_C06HBa0057J04.1-23-_SGN-E548992+ (2215 1907) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2267: PGL 1 (- strand): 2215 1907 AGS-1 (2215 1907) SCR (e 0.725) Exon 1 2215 1907 ( 309 n); score: 0.725 PGS (2215 1907) SGN-E548992+ 3-phase translation of AGS-1 (-strand): . . . . . . 2215 TCATTCTTTTCCCTCTTTTCTTTCTCATTTAAGACAAGGAGATTGTGGGATCAATCCCCT S F F S L F S F S F K T R R L W D Q S P H S F P S F L S H L R Q G D C G I N P L I L F P L F F L I - D K E I V G S I P . . . . . . 2155 AAAAGATCATTATTTTTCTCTTAATTTTTTTTAACTTTCTCACCTATCCAAGAGGTTGTG K R S L F F S - F F L T F S P I Q E V V K D H Y F S L N F F - L S H L S K R L W - K I I I F L L I F F N F L T Y P R G C . . . . . . 2095 GGTTCAATCCCCACTCACCTCATTTTATTTTCTCCTTTATTTTTCTCCCAACTCTCTCAT G S I P T H L I L F S P L F F S Q L S H V Q S P L T S F Y F L L Y F S P N S L I G F N P H S P H F I F S F I F L P T L S . . . . . . 2035 CTATTAAAGAGGTTGTTGGTTCAACCCCCACTCACCACATTTAATTTTTTTTACATTTAT L L K R L L V Q P P L T T F N F F Y I Y Y - R G C W F N P H S P H L I F F T F I S I K E V V G S T P T H H I - F F L H L . . . . . . 1975 CTCATGAAATTTTTACGTAATTTGCAATTGGTGAGGCCTTGGTTTCAATCCCCAATGACC L M K F L R N L Q L V R P W F Q S P M T S - N F Y V I C N W - G L G F N P Q - P S H E I F T - F A I G E A L V S I P N D . 1915 TCCTATTTT S Y F P I L L F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-23-_PGL-1_AGS-1_PPS_1 (2131 1907) (frame '1'; 225 bp, 75 residues) 1 FFLTFSPIQE VVGSIPTHLI LFSPLFFSQL SHLLKRLLVQ PPLTTFNFFY IYLMKFLRNL 61 QLVRPWFQSP MTSYF 3-phase translation of AGS-1 (+strand): . . . . . . 1907 AAAATAGGAGGTCATTGGGGATTGAAACCAAGGCCTCACCAATTGCAAATTACGTAAAAA K I G G H W G L K P R P H Q L Q I T - K K - E V I G D - N Q G L T N C K L R K N N R R S L G I E T K A S P I A N Y V K . . . . . . 1967 TTTCATGAGATAAATGTAAAAAAAATTAAATGTGGTGAGTGGGGGTTGAACCAACAACCT F H E I N V K K I K C G E W G L N Q Q P F M R - M - K K L N V V S G G - T N N L I S - D K C K K N - M W - V G V E P T T . . . . . . 2027 CTTTAATAGATGAGAGAGTTGGGAGAAAAATAAAGGAGAAAATAAAATGAGGTGAGTGGG L - - M R E L G E K - R R K - N E V S G F N R - E S W E K N K G E N K M R - V G S L I D E R V G R K I K E K I K - G E W . . . . . . 2087 GATTGAACCCACAACCTCTTGGATAGGTGAGAAAGTTAAAAAAAATTAAGAGAAAAATAA D - T H N L L D R - E S - K K L R E K - I E P T T S W I G E K V K K N - E K N N G L N P Q P L G - V R K L K K I K R K I . . . . . . 2147 TGATCTTTTAGGGGATTGATCCCACAATCTCCTTGTCTTAAATGAGAAAGAAAAGAGGGA - S F R G L I P Q S P C L K - E R K E G D L L G D - S H N L L V L N E K E K R E M I F - G I D P T I S L S - M R K K R G . 2207 AAAGAATGA K E - K N K R M Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:17:14 2006 ________________________________________________________________________________ Sequence 24: C06HBa0057J04.1-24, from 1 to 2887, both strands analyzed. ... started at: Mon Jul 24 23:17:14 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 4 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 12 ******************************************************************************** EST sequence 9 -strand 691 n (File: SGN-E328093-) 1 AAGGTCATGC CAAACAAAGA AGGAAAGGAT AGCGTTACAT ACCGAATAGT CGAGCCGTAC 61 GTCCAATATT TACTGCTCAA GGTATAAATC TATAACAAGC ACCAATCGTG CCGCAATTAG 121 ATAAATGTCG CGCTTACTAC TTATGTAAGC TAGCTAATAA TCTAACGAAA ATCCGGCAGC 181 ACTTCCCCTA TTTTCTTTAC TTCATCCCAA TCCAACAACA ACCCAAATTA TCCACAACAA 241 TACCAACAAC ACATATAATC CAACAAATCA TATTAAACAG CCTCCCCCAA AACAGTTCCG 301 TTTTTAGCAA CCATAGTAAC AAAACAACGA CACACCGACC GATAATTCGT TTTATAATTC 361 GCAACACATC TACCTTATTG TATTCATCCT TTGGAGTATA TACCCATAAT CATATTAACA 421 TATGCATAAC ACCAAAACAG AATTTCCAGC AGCTTACTTT AATCAAAACA GCCCAACACA 481 AATATAACAC CGCTAGCAAT CCGATCATGG TTTCTGCTCA TCCTTAGCAC ATTCAATAAC 541 TAAAACAAAT TCCTAAGATA CTGTTCACGT CCCCCTTCTT AACATAAATG TCTAATTAAC 601 AAAGGTATTC TAAGGAATTA ACAAACCAAG CAAAGAGATT ACTAACCAAT TTCTTTGCAG 661 CTGCTGTCCA AGAGTTTAAT GGCTGGTTTT C Predicted gene structure (within gDNA segment 1 to 2887): Exon 1 3 650 ( 648 n); cDNA 46 689 ( 644 n); score: 0.859 MATCH C06HBa0057J04.1-24+ SGN-E328093- 0.859 648 0.938 C PGS_C06HBa0057J04.1-24+_SGN-E328093- (3 650) Alignment (genomic DNA sequence = upper lines): ATAATCGAGC CGCACGTCCA ATATTTACTG CTCAAGCTAT TAATCTATAA CAAGCAACAA 62 ||| |||||| || ||||||| |||||||||| |||||| ||| ||||||||| |||||| ||| ATAGTCGAGC CGTACGTCCA ATATTTACTG CTCAAGGTAT AAATCTATAA CAAGCACCAA 105 TCGTGCCGCA ATTAGATAAA CATCGTGCTT TACT-TTCTT GTAAGCTAGC TAATAATCTA 121 |||||||||| |||||||||| ||| || | |||| | | |||||||||| |||||||||| TCGTGCCGCA ATTAGATAAA TGTCGCGC-T TACTACTTAT GTAAGCTAGC TAATAATCTA 164 ACGAAAATTC GGCAGCACCT CCCCTATTTT CCTTACTTCG TCCCAATCCG ACAACAATCC 181 |||||||| | |||||||| | |||||||||| | ||||||| ||||||||| ||||||| || ACGAAAATCC GGCAGCACTT CCCCTATTTT CTTTACTTCA TCCCAATCCA ACAACAACCC 224 AAATTATCCA CAACAATACC AATAACACAT ATAACCCAAA GAATCATATT AAACAGCCCC 241 |||||||||| |||||||||| || ||||||| |||| |||| ||||||||| |||||| || AAATTATCCA CAACAATACC AACAACACAT ATAATCCAAC AAATCATATT AAACAG-CCT 283 CTCCCCAAAA CAGTTCC-AT CTCGGCAACT ATATCAGCGA AGCAACGACA CACCGAGTGA 300 | |||||||| ||||||| | | ||||| ||| | | | | |||||||| |||||| || C-CCCCAAAA CAGTTCCGTT TTTAGCAACC ATAGTAACAA AACAACGACA CACCGACCGA 342 TAACTCGTTT TATAATTCGT AACACATCTA CCTTATCGTA TTCATCCTTT GGAGTATATA 360 ||| |||||| ||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| TAATTCGTTT TATAATTCGC AACACATCTA CCTTATTGTA TTCATCCTTT GGAGTATATA 402 CTCATAATCA CATTCAACAT ATACATAATG CCAAAACATA ATTTTCCTGC AGTTTGCTTT 420 | |||||||| ||| ||||| || ||||| |||||||| | | ||||| || || || |||| CCCATAATCA TATT-AACAT ATGCATAACA CCAAAACAGA A-TTTCCAGC AGCTTACTTT 460 AATCAAAACA GCCCAAGCAC CAACACGACA CCACTAATAA TCCGATTATG GTTTCCATTC 480 |||||||||| |||||| ||| || | ||| || ||| || |||||| ||| ||||| || AATCAAAACA GCCCAA-CAC AAATATAACA CCGCTAGCAA TCCGATCATG GTTTCTGCTC 519 ATACTTAGTA CATTCAATAA CTAAAACAAA CTTCCTAAGC TACTGGTCAC G-CCCCCTTC 539 || ||||| | |||||||||| |||||||||| |||||||| ||||| |||| | |||||||| ATCCTTAGCA CATTCAATAA CTAAAACAAA -TTCCTAAGA TACTGTTCAC GTCCCCCTTC 578 TTAAAATTAA TGGATAATTA ACAAGGGTAT TCTAAAGAAT TAACACACCA AACAAGGAGA 599 |||| || || || |||||| |||| ||||| ||||| |||| ||||| |||| | ||| |||| TTAACATAAA TGTCTAATTA ACAAAGGTAT TCTAAGGAAT TAACAAACCA AGCAAAGAGA 638 TTACTAACCA AATTATTTGC AGCTGCTGGC CAAGAGTTGC AGGGTTGGTT T 650 |||||||||| | || ||||| |||||||| | |||||||| | || ||||| | TTACTAACCA ATTTCTTTGC AGCTGCTGTC CAAGAGTTTA ATGGCTGGTT T 689 hqPGS_C06HBa0057J04.1-24+_SGN-E328093- (3 650) ******************************************************************************** EST sequence 16 -strand 455 n (File: SGN-E298250-) 1 TAACACATAT AACCCGAAGT ATCACATTAG GCAGCCCCCT CCCCAAAACA GTTCCGTCTG 61 GGCAACTATA GCAGCGAAGC ACCGACACAC CGAGCGATAA CTCGTTTTAT AATTCGCAAC 121 ACATTTACCT AATCGTATTC ATCCGTTGGA GTATATACTC ATAATCACAT TCAGCATATA 181 CATAATGCCA AAATAGAATT TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 241 CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGCACAT TCAATAACTA 301 AAACAAACTT CGTAAGCAAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 361 GGGTGTTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TCCCCAAATT ATTTGCAGCT 421 ACTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT Predicted gene structure (within gDNA segment 1 to 1258): Exon 1 204 658 ( 455 n); cDNA 1 455 ( 455 n); score: 0.947 MATCH C06HBa0057J04.1-24+ SGN-E298250- 0.947 455 1.000 C PGS_C06HBa0057J04.1-24+_SGN-E298250- (204 658) Alignment (genomic DNA sequence = upper lines): TAACACATAT AACCCAAAGA ATCATATTAA ACAGCCCCCT CCCCAAAACA GTTCCATCTC 263 |||||||||| ||||| ||| |||| |||| ||||||||| |||||||||| ||||| ||| TAACACATAT AACCCGAAGT ATCACATTAG GCAGCCCCCT CCCCAAAACA GTTCCGTCTG 60 GGCAACTATA TCAGCGAAGC AACGACACAC CGAGTGATAA CTCGTTTTAT AATTCGTAAC 323 |||||||||| ||||||||| | |||||||| |||| ||||| |||||||||| |||||| ||| GGCAACTATA GCAGCGAAGC ACCGACACAC CGAGCGATAA CTCGTTTTAT AATTCGCAAC 120 ACATCTACCT TATCGTATTC ATCCTTTGGA GTATATACTC ATAATCACAT TCAACATATA 383 |||| ||||| ||||||||| |||| ||||| |||||||||| |||||||||| ||| |||||| ACATTTACCT AATCGTATTC ATCCGTTGGA GTATATACTC ATAATCACAT TCAGCATATA 180 CATAATGCCA AAACATAATT TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 443 |||||||||| ||| | |||| |||||||||| |||||||||| |||||||||| |||||||||| CATAATGCCA AAATAGAATT TTCCTGCAGT TTGCTTTAAT CAAAACAGCC CAAGCACCAA 240 CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGTACAT TCAATAACTA 503 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| CACGACACCA CTAATAATCC GATTATGGTT TCCATTCATA CTTAGCACAT TCAATAACTA 300 AAACAAACTT CCTAAGCTAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 563 |||||||||| | ||||| || |||||||||| |||||||||| |||||||||| |||||||||| AAACAAACTT CGTAAGCAAC TGGTCACGCC CCCTTCTTAA AATTAATGGA TAATTAACAA 360 GGGTATTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TAACCAAATT ATTTGCAGCT 623 |||| ||||| |||||||||| |||||||||| |||||||||| | ||||||| |||||||||| GGGTGTTCTA AAGAATTAAC ACACCAAACA AGGAGATTAC TCCCCAAATT ATTTGCAGCT 420 GCTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT 658 ||||||||| |||||||||| |||||||||| ||||| ACTGGCCAAG AGTTGCAGGG TTGGTTTCTC CATTT 455 hqPGS_C06HBa0057J04.1-24+_SGN-E298250- (204 658) ******************************************************************************** EST sequence 7 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2855 2664 ( 192 n); cDNA 50 241 ( 192 n); score: 0.878 Intron 1 2663 2487 ( 177 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.88) Exon 2 2486 2082 ( 405 n); cDNA 242 643 ( 402 n); score: 0.879 MATCH C06HBa0057J04.1-24- SGN-E538156+ 0.879 597 0.927 C PGS_C06HBa0057J04.1-24-_SGN-E538156+ (2855 2664,2486 2082) Alignment (genomic DNA sequence = upper lines): TCATTTATCA TTTCACCGAA TCCCGGGATG GGTAATGTTC ATGCGGAGTT TCTTGCATAT 2796 |||||||||| ||||||||| ||||||| | |||||||||| ||||||||| |||||||||| TCATTTATCA TTTCACCGAG TCCCGGGCCG GGTAATGTTC GTGCGGAGTT TCTTGCATAT 109 GTCACTGAGT CCCTCAATAG AGGGCCGGGT ATGTATATTA TATATATGAT TGATGATGAG 2736 ||||| |||| |||||| ||| ||||||||| |||||||||| |||||||||| || ||||||| GTCACCGAGT CCCTCACTAG AGGGCCGGGA ATGTATATTA TATATATGAT TGGTGATGAG 169 GATGGTTATG ATGATGATGA TGACAGAGAT G-TGTGATGA TTATTTTGTC GAGCCCCTTA 2677 |||||||||| |||||||||| |||| ||||| | |||||||| ||||| ||| |||| | GATGGTTATG ATGATGATGA TGACGGAGAT GATGTGATGA CTATTTCACT GAGTCCCTCA 229 CTAGGGAAGC TGTGCACCTT ATATGTTAAA GATATGCATG ATTTTCACTT AAAAGGGTAC 2617 |||| | || | CTAGAG-GGC CGG....... .......... .......... .......... .......... 241 ATGTGTAGCG GTATTTTGTT TCAACTTACC ATATTGGTAT CCTATCATCT TTACATTCTG 2557 .......... .......... .......... .......... .......... .......... 241 CTTTACATAC TTAGTACATT GTCCGTACTG ACTCCCGTTT CCTCAAGGGG GCTGCGTTTC 2497 .......... .......... .......... .......... .......... .......... 241 ATGCCTCTAG GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT 2437 |||||||||| || || ||| |||||||| |||||||||| || ||||||| .......... GTGTAGACGC TCAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 291 GATTGGGAGA GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG 2377 | |||||||| ||| |||| || |||| | ||||||| || |||| ||||| | || ||||| GTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTGTG GTACATAACT TCTTATGTAG 351 TCTTTTGCTT GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT 2317 |||||||||| ||| |||||| || || |||| |||||||||| || ||||||| ||||||| || TCTTTTGCTT GTCTATGGGT AT-GC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 409 CTTAGAGGTC TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC 2257 |||||||||| ||||||||| ||||||||| |||| |||| |||| ||||| |||| | ||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTTGATAA TGGTCTGGAC 469 ATGGTTTGTT TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA 2197 |||||||||| |||||||||| |||||||| | ||| |||| ||||| || | |||||| || ATGGTTTGTT TGGGATGTCC ACTTGTACAA GTGCAACCTT GTCGGTTGTG TACATCTTTG 529 TGTATTGTGT AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG 2137 |||||||||| || ||||||| ||| |||| | |||||||||| |||||||||| ||||||| TGTATTGTGT AGTGGCAGCC TTGACGGC-T GCGTATGCTA TTATGCTTTG AATAGTGGCA 588 GCCTTTTTGG CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGA 2082 ||||| | || |||| ||||| ||||||| || ||||||| || |||||| ||| ||||| GCCTTGTCGG CTCGCGTATG TTGTTACGGT TGAATGGGTA TGACTCTTTA TGAGA 643 hqPGS_C06HBa0057J04.1-24-_SGN-E538156+ (2855 2664,2486 2082) ******************************************************************************** EST sequence 4 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2802 2753 ( 50 n); cDNA 161 209 ( 49 n); score: 0.800 Intron 1 2752 2710 ( 43 n); Pd: 0.000 (s: 0.80), Pa: 0.842 (s: 0.80) Exon 2 2709 2082 ( 628 n); cDNA 210 831 ( 622 n); score: 0.877 PPA cDNA 832 843 MATCH C06HBa0057J04.1-24- SGN-E544254- 0.872 678 0.804 C PGS_C06HBa0057J04.1-24-_SGN-E544254- (2802 2753,2709 2082) Alignment (genomic DNA sequence = upper lines): TGCATATGTC ACTGAGTCCC TCAATAGAGG GCCGGGTATG TATATTATAT ATATGATTGA 2743 || ||| || || ||||||| ||| |||||| |||||||| ||| | ||| TGACTATTTC ACCGAGTCCC TCACTAGAGG GCCGGGTA-C TATGATGTAT .......... 209 TGATGAGGAT GGTTATGATG ATGATGATGA CAGAGATGTG TGATGATTAT TTTGTCGAGC 2683 | || |||||||||| |||| |||| .......... .......... .......... ...ATAT-AA TGATGATTAT TTTGCCGAGT 235 CCCTTACTAG GGAAGCTGTG CACCTTATAT GTTAAAGATA TGCATGATTT TCACTTAAAA 2623 |||||||||| ||||| | | || ||||||| |||||||||| |||||||||| |||||||||| CCCTTACTAG GGAAGTTAGG CATCTTATAT GTTAAAGATA TGCATGATTT TCACTTAAAA 295 GGGTACATGT GTAGCGGTAT TTTGTTTCAA CTTACCATAT TGGTATCCTA TCATCTTTAC 2563 |||||||| |||| | ||| ||||||| | |||| ||| | ||||||||| |||||||||| -AGTACATGT GTAGAGATAT CTTGTTTCGA CTTATCATGT TGGTATCCTG TCATCTTTAC 354 ATTCTGCTTT ACATACTTAG TACATTGTCC GTACTGACTC CCGTTTCCTC AAGGGGGCTG 2503 || |||||| ||||||| || |||||||||| |||||||| | || ||| ||| |||||||| CTTATGCTTT ACATACTCAG TACATTGTCC GTACTGAC-C CCCTTTTCTC -GGGGGGCTG 412 CGTTTCATGC CTCTAGGTGT AGACGCACAA TTCGGTGATC CTCCTACCTA GGATATCTGC 2443 |||||||||| | |||||| |||||| || |||||||||| |||| |||| |||||||| | CGTTTCATGC CCGCAGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC 472 TCTGCTGATT GGGAGAGCTT TCCTGTTCTG GAGACTAGTC GTTTTGGTAC GTAACTTTTT 2383 |||||| || ||||||||| |||||| | ||| | |||| |||||||||| |||||| || TCTGCTTTTT GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT 532 GTGTAGTCTT TTGCTTGTCC ATGGGTATGG CGGGGGCCCT GTCCCGTCGA GTTTCACTAC 2323 ||||||||| ||||||||| |||||||||| | |||||||| |||||||| | |||||||||| ATGTAGTCTT TTGCTTGTCT ATGGGTATGG C-GGGGCCCT GTCCCGTCAA GTTTCACTAC 591 TATGCTCTTA GAGGTCTGTA GACATTATGT GGGTTGTATA TATATGTTTT GGATAATGGT 2263 ||| |||||| |||||||||| ||||| ||| |||||||||| |||||||| |||||||||| TATACTCTTA GAGGTCTGTA GACATCGTGT GGGTTGTATA ATTATGTTTT GGATAATGGT 651 ATAGACATGG TTTGTTTGGG ATGTCCGCTT GTACATGGGC AGCCTTGTCG GGTGCGTACA 2203 | ||||||| |||||||||| |||||| || ||||| | || |||||||||| | || | ||| CTGGACATGG TTTGTTTGGG ATGTCCATTT GTACAAGTGC AGCCTTGTCG GTTGTGAACA 711 TCATTATGTA TTGTGTAGGG GCAGCCTTGT CGGCTTGCGT ATGCTATTAT GCTTTGGATA 2143 ||||| |||| |||||||| | ||||||| || |||| ||||| |||||||||| | |||||||| TCATTGTGTA TTGTGTAGTG GCAGCCTCGT CGGC-TGCGT ATGCTATTAT GTTTTGGATA 770 GTGGTGGCCT TTTTGGCTCG TGTATGTTGT TACAGTTGAA TGGTTATGAC TCCTTATGAG 2083 |||| ||||| | | |||||| |||||||| ||| || || |||||||||| || ||||||| GTGGCGGCCT TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG 830 A 2082 | A 831 hqPGS_C06HBa0057J04.1-24-_SGN-E544254- (2802 2753,2709 2082) ******************************************************************************** EST sequence 6 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2855 2664 ( 192 n); cDNA 50 241 ( 192 n); score: 0.878 Intron 1 2663 2487 ( 177 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.88) Exon 2 2486 2119 ( 368 n); cDNA 242 606 ( 365 n); score: 0.880 MATCH C06HBa0057J04.1-24- SGN-E538151+ 0.879 560 0.924 C PGS_C06HBa0057J04.1-24-_SGN-E538151+ (2855 2664,2486 2119) Alignment (genomic DNA sequence = upper lines): TCATTTATCA TTTCACCGAA TCCCGGGATG GGTAATGTTC ATGCGGAGTT TCTTGCATAT 2796 |||||||||| ||||||||| ||||||| | |||||||||| ||||||||| |||||||||| TCATTTATCA TTTCACCGAG TCCCGGGCCG GGTAATGTTC GTGCGGAGTT TCTTGCATAT 109 GTCACTGAGT CCCTCAATAG AGGGCCGGGT ATGTATATTA TATATATGAT TGATGATGAG 2736 ||||| |||| |||||| ||| ||||||||| |||||||||| |||||||||| || ||||||| GTCACCGAGT CCCTCACTAG AGGGCCGGGA ATGTATATTA TATATATGAT TGGTGATGAG 169 GATGGTTATG ATGATGATGA TGACAGAGAT G-TGTGATGA TTATTTTGTC GAGCCCCTTA 2677 |||||||||| |||||||||| |||| ||||| | |||||||| ||||| ||| |||| | GATGGTTATG ATGATGATGA TGACGGAGAT GATGTGATGA CTATTTCACT GAGTCCCTCA 229 CTAGGGAAGC TGTGCACCTT ATATGTTAAA GATATGCATG ATTTTCACTT AAAAGGGTAC 2617 |||| | || | CTAGAG-GGC CGG....... .......... .......... .......... .......... 241 ATGTGTAGCG GTATTTTGTT TCAACTTACC ATATTGGTAT CCTATCATCT TTACATTCTG 2557 .......... .......... .......... .......... .......... .......... 241 CTTTACATAC TTAGTACATT GTCCGTACTG ACTCCCGTTT CCTCAAGGGG GCTGCGTTTC 2497 .......... .......... .......... .......... .......... .......... 241 ATGCCTCTAG GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT 2437 |||||||||| || || ||| |||||||| |||||||||| || ||||||| .......... GTGTAGACGC TCAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 291 GATTGGGAGA GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG 2377 | |||||||| ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| GTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG 351 TCTTTTGCTT GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT 2317 |||||||||| ||| |||||| || || |||| |||||||||| || ||||||| ||||||| || TCTTTTGCTT GTCTATGGGT AT-GC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 409 CTTAGAGGTC TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC 2257 |||||||||| ||||||||| ||||||||| |||| |||| |||| ||||| |||| | ||| CTTAGAGGTC TGTAGACATC GTGTGGGTTG TATAATTATG TTTTTGATAA TGGTCTGGAC 469 ATGGTTTGTT TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA 2197 |||||||||| |||||||||| |||||||| | ||| |||| ||||| || | |||||| || ATGGTTTGTT TGGGATGTCC ACTTGTACAA GTGCAACCTT GTCGGTTGTG TACATCTTTG 529 TGTATTGTGT AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG 2137 |||||||||| || ||||||| |||||||| | |||||||||| |||||||||| ||||||| TGTATTGTGT AGTGGCAGCC TTGTCGGC-T GCGTATGCTA TTATGCTTTG AATAGTGGCA 588 GCCTTTTTGG CTCGTGTA 2119 ||||| | || |||| ||| GCCTTGTCGG CTCGCGTA 606 hqPGS_C06HBa0057J04.1-24-_SGN-E538151+ (2855 2664,2486 2119) ******************************************************************************** EST sequence 12 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 2887 to 1248): Exon 1 2855 2664 ( 192 n); cDNA 48 239 ( 192 n); score: 0.872 Intron 1 2663 2487 ( 177 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.88) Exon 2 2486 2254 ( 233 n); cDNA 240 470 ( 231 n); score: 0.871 MATCH C06HBa0057J04.1-24- SGN-E268096+ 0.872 425 0.904 C PGS_C06HBa0057J04.1-24-_SGN-E268096+ (2855 2664,2486 2254) Alignment (genomic DNA sequence = upper lines): TCATTTATCA TTTCACCGAA TCCCGGGATG GGTAATGTTC ATGCGGAGTT TCTTGCATAT 2796 |||||||||| || |||||| ||||||| | |||||||||| ||||||||| |||||||||| TCATTTATCA TTGCACCGAG TCCCGGGCCG GGTAATGTTC GTGCGGAGTT TCTTGCATAT 107 GTCACTGAGT CCCTCAATAG AGGGCCGGGT ATGTATATTA TATATATGAT TGATGATGAG 2736 ||||| |||| |||||| ||| ||||||||| |||||||||| |||||||||| || ||||||| GTCACCGAGT CCCTCACTAG AGGGCCGGGA ATGTATATTA TATATATGAT TGGTGATGAG 167 GATGGTTATG ATGATGATGA TGACAGAGAT G-TGTGATGA TTATTTTGTC GAGCCCCTTA 2677 |||||||||| |||||||||| |||| ||||| | |||||||| ||||| ||| |||| | GATGGTTATG ATGATGATGA TGACGGAGAT GATGTGATGA CTATTTCACT GAGTCCCTCA 227 CTAGGGAAGC TGTGCACCTT ATATGTTAAA GATATGCATG ATTTTCACTT AAAAGGGTAC 2617 |||| | || | CTAGAG-GGC CGG....... .......... .......... .......... .......... 239 ATGTGTAGCG GTATTTTGTT TCAACTTACC ATATTGGTAT CCTATCATCT TTACATTCTG 2557 .......... .......... .......... .......... .......... .......... 239 CTTTACATAC TTAGTACATT GTCCGTACTG ACTCCCGTTT CCTCAAGGGG GCTGCGTTTC 2497 .......... .......... .......... .......... .......... .......... 239 ATGCCTCTAG GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT 2437 |||||||||| || || ||| |||||||| |||||||||| || ||||||| .......... GTGTAGACGC TCAGTTTGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT 289 GATTGGGAGA GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG 2377 | |||||||| ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| GTTTGGGAGA GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG 349 TCTTTTGCTT GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT 2317 |||||||||| ||| |||||| || || |||| |||||||||| || ||||||| ||||||| || TCTTTTGCTT GTCTATGGGT AT-GC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT 407 CTTAGAGGTC TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC 2257 |||||||||| ||||||||| ||||||| | |||| |||| |||| ||||| ||| | ||| CTTAGAGGTC TGTAGACATC GTGTGGGTAG TATAATTATG TTTTTGATAA TGGGCTGGAC 467 ATG 2254 ||| ATG 470 hqPGS_C06HBa0057J04.1-24-_SGN-E268096+ (2855 2664,2486 2254) ******************************************************************************** EST sequence 14 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2019 ( 468 n); cDNA 53 516 ( 464 n); score: 0.878 MATCH C06HBa0057J04.1-24- SGN-E303695+ 0.878 468 0.895 C PGS_C06HBa0057J04.1-24-_SGN-E303695+ (2486 2019) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 112 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 172 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 231 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 291 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 351 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 410 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGACATGC CCATTATATA 2067 |||| |||| ||||||| | | |||||||| |||||| ||| ||||| | ||| | | || CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACT-T-TA 468 TATATATATA TATATATATG GCGTTGGGGT TGTCTTGATT TGATTAAA 2019 |||||||||| |||||||||| |||||||| | | ||||||| |||||||| TATATATATA TATATATATG GCGTTGGGTT TAGCTTGATT TGATTAAA 516 hqPGS_C06HBa0057J04.1-24-_SGN-E303695+ (2486 2019) ******************************************************************************** EST sequence 13 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 2887 to 1144): Exon 1 2509 2020 ( 490 n); cDNA 94 577 ( 484 n); score: 0.854 MATCH C06HBa0057J04.1-24- SGN-E543104+ 0.854 490 0.849 C PGS_C06HBa0057J04.1-24-_SGN-E543104+ (2509 2020) Alignment (genomic DNA sequence = upper lines): GGGGCTGCGT TTCATGCCTC TA-GGTGTAG ACGCACAATT CGGTGATCCT CCTACCTAGG 2451 || |||| || | || || ||||||| |||| || || |||||||||| || |||||| GGACGTGCGG CTC--GACTA TACGGTGTAG ACGCTCAGTT CGGTGATCCT CCCGCCTAGG 151 ATATCTGCTC TGCTGATTGG GAGAGCTTTC CTGTTCTGGA GACTAGTCGT TTTGGTACGT 2391 |||||| || |||||| ||| ||||||| ||||| | | | | |||| | |||||||| | ATATCTACTT TGCTGAGTGG GAGAGCTCCA CTGTTTCGTA GCCCAGTCAT TTTGGTACAT 211 AACTTTTTGT GTAGTCTTTT GCTTGTCCAT GGGTATGGCG GGGGCCCTGT CCCGTCGAGT 2331 |||||||||| |||||||||| ||||||| || |||||||| |||||||||| |||||||||| AACTTTTTGT GTAGTCTTTT GCTTGTCTAT GGGTATGG-T GGGGCCCTGT CCCGTCGAGT 270 TTCACTACTA TGCTCTTAGA GGTCTGTAGA CATTATGTGG GTTGTATATA TATGTTTTGG 2271 |||||||||| | |||||||| |||| |||| ||| |||| |||||||||| |||||||||| TTCACTACTA TACTCTTAGA GGTCCATAGA CATCGCGTGG GTTGTATATA TATGTTTTGG 330 ATAATGGTAT AGACATGGTT TGTTTGGGAT GTCCGCTTGT ACATGGGCAG CCTTGTCGGG 2211 |||||||| | ||||||||| |||||||||| |||| ||||| ||| |||||| ||||||| | ATAATGGTCT GGACATGGTT TGTTTGGGAT GTCCACTTGT ACAAGGGCAG CCTTGTCAGC 390 TGCGTACATC ATTATGTATT GTGTAGGGGC AGCCTTGTCG GCTTGCGTAT GCTATTATGC 2151 |||||||||| || |||||| |||||| ||| |||||||||| || ||||||| |||||||||| TGCGTACATC TTTGTGTATT GTGTAGTGGC AGCCTTGTCG GC-TGCGTAT GCTATTATGC 449 TTTGGATAGT GGTGGCCTTT TTGGCTCGTG TATGTTGTTA CAGTTGAATG GTTATGACTC 2091 |||||||||| || |||||| | |||||| | |||||||||| | |||||||| |||||||||| TTTGGATAGT GGCGGCCTTG TCGGCTCGCG TATGTTGTTA CGGTTGAATG GTTATGACTC 509 CTTATGAGAC ATGCCCATTA TATATATATA TATATATATA TATGGCGTTG GGGTTGTCTT 2031 |||||||||| | ||| | | |||||||| ||||||| |||| ||| || ||| || CTTATGAGAC AGATCCACT- T-TATATATA TATATATGGC GATGGGGTT- GGCTTGATTT 566 GATTTGATTA A 2020 |||| | | | GATTAAAAAA A 577 hqPGS_C06HBa0057J04.1-24-_SGN-E543104+ (2509 2020) ******************************************************************************** EST sequence 5 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2020 ( 467 n); cDNA 33 495 ( 463 n); score: 0.859 MATCH C06HBa0057J04.1-24- SGN-E306317+ 0.859 467 0.943 C PGS_C06HBa0057J04.1-24-_SGN-E306317+ (2486 2020) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 211 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 271 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 331 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 390 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGACATGC CCATTATATA 2067 |||| |||| ||||||| | | |||||||| |||||| ||| ||||| | ||| | | || CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACT-T-TA 448 TATATATATA TATATATATG GCGTTGGGGT TGTCTTGATT TGATTAA 2020 |||||||||| |||||||||| |||||||| | | | | || TATATATATA TATATATATG GCGTTGGGTT TNAAAAAAAA AAAAAAA 495 hqPGS_C06HBa0057J04.1-24-_SGN-E306317+ (2486 2020) ******************************************************************************** EST sequence 3 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 2887 to 1145): Exon 1 2509 2056 ( 454 n); cDNA 94 544 ( 451 n); score: 0.873 MATCH C06HBa0057J04.1-24- SGN-E543103- 0.873 454 0.775 C PGS_C06HBa0057J04.1-24-_SGN-E543103- (2509 2056) Alignment (genomic DNA sequence = upper lines): GGGGCTGCGT TTCATGCCTC TA-GGTGTAG ACGCACAATT CGGTGATCCT CCTACCTAGG 2451 || |||| || | || || ||||||| |||| || || |||||||||| || |||||| GGACGTGCGG CTC--GACTA TACGGTGTAG ACGCTCAGTT CGGTGATCCT CCCGCCTAGG 151 ATATCTGCTC TGCTGATTGG GAGAGCTTTC CTGTTCTGGA GACTAGTCGT TTTGGTACGT 2391 |||||| || |||||| ||| ||||||| ||||| | | | | |||| | |||||||| | ATATCTACTT TGCTGAGTGG GAGAGCTCCA CTGTTTCGTA GCCCAGTCAT TTTGGTACAT 211 AACTTTTTGT GTAGTCTTTT GCTTGTCCAT GGGTATGGCG GGGGCCCTGT CCCGTCGAGT 2331 |||||||||| |||||||||| ||||||| || |||||||| |||||||||| |||||||||| AACTTTTTGT GTAGTCTTTT GCTTGTCTAT GGGTATGG-T GGGGCCCTGT CCCGTCGAGT 270 TTCACTACTA TGCTCTTAGA GGTCTGTAGA CATTATGTGG GTTGTATATA TATGTTTTGG 2271 |||||||||| | |||||||| |||| |||| ||| |||| |||||||||| |||||||||| TTCACTACTA TACTCTTAGA GGTCCATAGA CATCGCGTGG GTTGTATATA TATGTTTTGG 330 ATAATGGTAT AGACATGGTT TGTTTGGGAT GTCCGCTTGT ACATGGGCAG CCTTGTCGGG 2211 |||||||| | ||||||||| |||||||||| |||| ||||| ||| |||||| ||||||| | ATAATGGTCT GGACATGGTT TGTTTGGGAT GTCCACTTGT ACAAGGGCAG CCTTGTCAGC 390 TGCGTACATC ATTATGTATT GTGTAGGGGC AGCCTTGTCG GCTTGCGTAT GCTATTATGC 2151 |||||||||| || |||||| |||||| ||| |||||||||| || ||||||| |||||||||| TGCGTACATC TTTGTGTATT GTGTAGTGGC AGCCTTGTCG GC-TGCGTAT GCTATTATGC 449 TTTGGATAGT GGTGGCCTTT TTGGCTCGTG TATGTTGTTA CAGTTGAATG GTTATGACTC 2091 |||||||||| || |||||| | |||||| | |||||||||| | |||||||| |||||||||| TTTGGATAGT GGCGGCCTTG TCGGCTCGCG TATGTTGTTA CGGTTGAATG GTTATGACTC 509 CTTATGAGAC ATGCCCATTA TATATATATA TATAT 2056 |||||||||| | ||| | |||||||||| ||||| CTTATGAGAC AGATCCACTT TATATATATA TATAT 544 hqPGS_C06HBa0057J04.1-24-_SGN-E543103- (2509 2056) ******************************************************************************** EST sequence 8 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2056 ( 431 n); cDNA 108 537 ( 430 n); score: 0.864 MATCH C06HBa0057J04.1-24- SGN-E305738+ 0.864 431 0.788 C PGS_C06HBa0057J04.1-24-_SGN-E305738+ (2486 2056) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 167 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 227 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 286 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 346 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 406 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 465 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGACATGC CCATTATATA 2067 |||| |||| ||||||| | | |||||||| |||||| ||| ||||| | ||| | |||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA 525 TAT-ATATAT AT 2056 ||| | || || TATGAAATGA AT 537 hqPGS_C06HBa0057J04.1-24-_SGN-E305738+ (2486 2056) ******************************************************************************** EST sequence 2 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2063 ( 424 n); cDNA 109 530 ( 422 n); score: 0.875 PPA cDNA 531 542 MATCH C06HBa0057J04.1-24- SGN-E374134- 0.875 424 0.782 C PGS_C06HBa0057J04.1-24-_SGN-E374134- (2486 2063) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 168 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 228 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 287 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 347 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 407 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 466 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGACATGC CCATTATATA 2067 |||| |||| ||||||| | | |||||||| |||||| ||| ||||| | ||| | |||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA 526 TATA 2063 |||| TATA 530 hqPGS_C06HBa0057J04.1-24-_SGN-E374134- (2486 2063) ******************************************************************************** EST sequence 11 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2063 ( 424 n); cDNA 108 529 ( 422 n); score: 0.875 PPA cDNA 530 542 MATCH C06HBa0057J04.1-24- SGN-E374135+ 0.875 424 0.782 C PGS_C06HBa0057J04.1-24-_SGN-E374135+ (2486 2063) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 167 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 227 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 286 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 346 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 406 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 465 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGAGACATGC CCATTATATA 2067 |||| |||| ||||||| | | |||||||| |||||| ||| ||||| | ||| | |||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA 525 TATA 2063 |||| TATA 529 hqPGS_C06HBa0057J04.1-24-_SGN-E374135+ (2486 2063) ******************************************************************************** EST sequence 1 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2487 2084 ( 404 n); cDNA 6 407 ( 402 n); score: 0.884 PPA cDNA 415 432 MATCH C06HBa0057J04.1-24- SGN-E225616- 0.884 404 0.935 C PGS_C06HBa0057J04.1-24-_SGN-E225616- (2487 2084) Alignment (genomic DNA sequence = upper lines): GGTGTAGACG CACAATTCGG TGATCCTCCT ACCTAGGATA TCTGCTCTGC TGATTGGGAG 2428 |||||||||| | || ||||| ||||||||| ||||||||| ||| |||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTTTCCTG TTCTGGAGAC TAGTCGTTTT GGTACGTAAC TTTTTGTGTA GTCTTTTGCT 2368 |||| ||| ||| |||| | ||||||||| ||||| |||| || || |||| |||||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 TGTCCATGGG TATGGCGGGG GCCCTGTCCC GTCGAGTTTC ACTACTATGC TCTTAGAGGT 2308 |||| ||||| |||||| ||| |||||||||| ||| |||||| |||||||| | |||||||||| TGTCTATGGG TATGGC-GGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT 184 CTGTAGACAT TATGTGGGTT GTATATATAT GTTTTGGATA ATGGTATAGA CATGGTTTGT 2248 |||||||||| |||||||| ||||| ||| |||||||||| ||||| | || |||||||||| CTGTAGACAT CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 244 TTGGGATGTC CGCTTGTACA TGGGCAGCCT TGTCGGGTGC GTACATCATT ATGTATTGTG 2188 |||||||||| | ||||||| | ||||||| |||||| || | |||||||| ||||||||| TTGGGATGTC CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG 304 TAGGGGCAGC CTTGTCGGCT TGCGTATGCT ATTATGCTTT GGATAGTGGT GGCCTTTTTG 2128 ||| |||||| || |||||| |||||||||| |||||| ||| ||||||||| |||||| | | TAGTGGCAGC CTCGTCGGC- TGCGTATGCT ATTATGTTTT GGATAGTGGC GGCCTTGTCG 363 GCTCGTGTAT GTTGTTACAG TTGAATGGTT ATGACTCCTT ATGA 2084 ||||| ||| |||||||| || ||||||| ||||||| || |||| GCTCGCATAT GTTGTTACGA TTTAATGGTT ATGACTCTTT ATGA 407 hqPGS_C06HBa0057J04.1-24-_SGN-E225616- (2487 2084) ******************************************************************************** EST sequence 10 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 2887 to 7): Exon 1 2486 2084 ( 403 n); cDNA 43 443 ( 401 n); score: 0.883 PPA cDNA 444 453 MATCH C06HBa0057J04.1-24- SGN-E303256+ 0.883 403 0.890 C PGS_C06HBa0057J04.1-24-_SGN-E303256+ (2486 2084) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 221 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 281 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 341 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 400 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGA 2084 |||| |||| ||||||| | | |||||||| |||||| ||| ||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGA 443 hqPGS_C06HBa0057J04.1-24-_SGN-E303256+ (2486 2084) ******************************************************************************** EST sequence 15 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 2887 to 1): Exon 1 2486 2084 ( 403 n); cDNA 108 508 ( 401 n); score: 0.886 PPA cDNA 509 519 MATCH C06HBa0057J04.1-24- SGN-E310669+ 0.886 403 0.776 C PGS_C06HBa0057J04.1-24-_SGN-E310669+ (2486 2084) Alignment (genomic DNA sequence = upper lines): GTGTAGACGC ACAATTCGGT GATCCTCCTA CCTAGGATAT CTGCTCTGCT GATTGGGAGA 2427 |||||||||| || |||||| |||||||| |||||||||| || ||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 167 GCTTTCCTGT TCTGGAGACT AGTCGTTTTG GTACGTAACT TTTTGTGTAG TCTTTTGCTT 2367 ||| |||| || |||| | |||||||||| |||| ||||| | || ||||| |||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 227 GTCCATGGGT ATGGCGGGGG CCCTGTCCCG TCGAGTTTCA CTACTATGCT CTTAGAGGTC 2307 ||| |||||| ||||| |||| |||||||||| || ||||||| ||||||| || |||||||||| GTCTATGGGT ATGGC-GGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 286 TGTAGACATT ATGTGGGTTG TATATATATG TTTTGGATAA TGGTATAGAC ATGGTTTGTT 2247 ||||||||| ||||||||| |||| |||| |||||||||| |||| | ||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 346 TGGGATGTCC GCTTGTACAT GGGCAGCCTT GTCGGGTGCG TACATCATTA TGTATTGTGT 2187 |||||||||| ||||||| | |||||||| ||||| || | |||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 406 AGGGGCAGCC TTGTCGGCTT GCGTATGCTA TTATGCTTTG GATAGTGGTG GCCTTTTTGG 2127 || ||||||| | |||||| | |||||||||| ||||| |||| |||||||| | ||||| | || AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 465 CTCGTGTATG TTGTTACAGT TGAATGGTTA TGACTCCTTA TGA 2084 |||| |||| ||||||| | | |||||||| |||||| ||| ||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGA 508 hqPGS_C06HBa0057J04.1-24-_SGN-E310669+ (2486 2084) Total number of EST alignments reported: 16 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 2887: PGL 1 (+ strand): 3 658 AGS-1 (3 658) SCR (e 0.859) Exon 1 3 658 ( 656 n); score: 0.859 PGS (3 650) SGN-E328093- PGS (204 658) SGN-E298250- 3-phase translation of AGS-1 (+strand): . . . . . . 3 ATAATCGAGCCGCACGTCCAATATTTACTGCTCAAGCTATTAATCTATAACAAGCAACAA I I E P H V Q Y L L L K L L I Y N K Q Q - S S R T S N I Y C S S Y - S I T S N N N R A A R P I F T A Q A I N L - Q A T . . . . . . 63 TCGTGCCGCAATTAGATAAACATCGTGCTTTACTTTCTTGTAAGCTAGCTAATAATCTAA S C R N - I N I V L Y F L V S - L I I - R A A I R - T S C F T F L - A S - - S N I V P Q L D K H R A L L S C K L A N N L . . . . . . 123 CGAAAATTCGGCAGCACCTCCCCTATTTTCCTTACTTCGTCCCAATCCGACAACAATCCA R K F G S T S P I F L T S S Q S D N N P E N S A A P P L F S L L R P N P T T I Q T K I R Q H L P Y F P Y F V P I R Q Q S . . . . . . 183 AATTATCCACAACAATACCAATAACACATATAACCCAAAGAATCATATTAAACAGCCCCC N Y P Q Q Y Q - H I - P K E S Y - T A P I I H N N T N N T Y N P K N H I K Q P P K L S T T I P I T H I T Q R I I L N S P . . . . . . 243 TCCCCAAAACAGTTCCATCTCGGCAACTATATCAGCGAAGCAACGACACACCGAGTGATA S P K Q F H L G N Y I S E A T T H R V I P Q N S S I S A T I S A K Q R H T E - - L P K T V P S R Q L Y Q R S N D T P S D . . . . . . 303 ACTCGTTTTATAATTCGTAACACATCTACCTTATCGTATTCATCCTTTGGAGTATATACT T R F I I R N T S T L S Y S S F G V Y T L V L - F V T H L P Y R I H P L E Y I L N S F Y N S - H I Y L I V F I L W S I Y . . . . . . 363 CATAATCACATTCAACATATACATAATGCCAAAACATAATTTTCCTGCAGTTTGCTTTAA H N H I Q H I H N A K T - F S C S L L - I I T F N I Y I M P K H N F P A V C F N S - S H S T Y T - C Q N I I F L Q F A L . . . . . . 423 TCAAAACAGCCCAAGCACCAACACGACACCACTAATAATCCGATTATGGTTTCCATTCAT S K Q P K H Q H D T T N N P I M V S I H Q N S P S T N T T P L I I R L W F P F I I K T A Q A P T R H H - - S D Y G F H S . . . . . . 483 ACTTAGTACATTCAATAACTAAAACAAACTTCCTAAGCTACTGGTCACGCCCCCTTCTTA T - Y I Q - L K Q T S - A T G H A P F L L S T F N N - N K L P K L L V T P P S - Y L V H S I T K T N F L S Y W S R P L L . . . . . . 543 AAATTAATGGATAATTAACAAGGGTATTCTAAAGAATTAACACACCAAACAAGGAGATTA K L M D N - Q G Y S K E L T H Q T R R L N - W I I N K G I L K N - H T K Q G D Y K I N G - L T R V F - R I N T P N K E I . . . . . . 603 CTAACCAAATTATTTGCAGCTGCTGGCCAAGAGTTGCAGGGTTGGTTTCTCCATTT L T K L F A A A G Q E L Q G W F L H - P N Y L Q L L A K S C R V G F S I T N Q I I C S C W P R V A G L V S P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-24+_PGL-1_AGS-1_PPS_1 (53 322) (frame '0'; 267 bp, 89 residues) 1 QATIVPQLDK HRALLSCKLA NNLTKIRQHL PYFPYFVPIR QQSKLSTTIP ITHITQRIIL 61 NSPLPKTVPS RQLYQRSNDT PSDNSFYNS- 3-phase translation of AGS-1 (-strand): . . . . . . 658 AAATGGAGAAACCAACCCTGCAACTCTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N Q P C N S W P A A A N N L V S N N G E T N P A T L G Q Q L Q I I W L V I M E K P T L Q L L A S S C K - F G - - . . . . . . 598 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L R S P C L V C - F F R I P L L I I H - F - . . . . . . 538 AAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTACTAAGTATGA K G A - P V A - E V C F S Y - M Y - V - R G R D Q - L R K F V L V I E C T K Y E E G G V T S S L G S L F - L L N V L S M . . . . . . 478 ATGGAAACCATAATCGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGCTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - S D Y - W C R V G A W A V L I K N G N H N R I I S G V V L V L G L F - L . . . . . . 418 AGCAAACTGCAGGAAAATTATGTTTTGGCATTATGTATATGTTGAATGTGATTATGAGTA S K L Q E N Y V L A L C I C - M - L - V A N C R K I M F W H Y V Y V E C D Y E Y K Q T A G K L C F G I M Y M L N V I M S . . . . . . 358 TATACTCCAAAGGATGAATACGATAAGGTAGATGTGTTACGAATTATAAAACGAGTTATC Y T P K D E Y D K V D V L R I I K R V I I L Q R M N T I R - M C Y E L - N E L S I Y S K G - I R - G R C V T N Y K T S Y . . . . . . 298 ACTCGGTGTGTCGTTGCTTCGCTGATATAGTTGCCGAGATGGAACTGTTTTGGGGAGGGG T R C V V A S L I - L P R W N C F G E G L G V S L L R - Y S C R D G T V L G R G H S V C R C F A D I V A E M E L F W G G . . . . . . 238 GCTGTTTAATATGATTCTTTGGGTTATATGTGTTATTGGTATTGTTGTGGATAATTTGGA A V - Y D S L G Y M C Y W Y C C G - F G L F N M I L W V I C V I G I V V D N L D G C L I - F F G L Y V L L V L L W I I W . . . . . . 178 TTGTTGTCGGATTGGGACGAAGTAAGGAAAATAGGGGAGGTGCTGCCGAATTTTCGTTAG L L S D W D E V R K I G E V L P N F R - C C R I G T K - G K - G R C C R I F V R I V V G L G R S K E N R G G A A E F S L . . . . . . 118 ATTATTAGCTAGCTTACAAGAAAGTAAAGCACGATGTTTATCTAATTGCGGCACGATTGT I I S - L T R K - S T M F I - L R H D C L L A S L Q E S K A R C L S N C G T I V D Y - L A Y K K V K H D V Y L I A A R L . . . . . . 58 TGCTTGTTATAGATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTAT C L L - I N S L S S K Y W T C G S I A C Y R L I A - A V N I G R A A R L L L V I D - - L E Q - I L D V R L D Y Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 2855 2019 AGS-1 (2855 2664,2486 2019) SCR (e 0.878 d 0.000 a 0.000,e 0.879) Exon 1 2855 2664 ( 192 n); score: 0.878 Intron 1 2663 2487 ( 177 n); Pd: 0.000 Pa: 0.000 Exon 2 2486 2019 ( 468 n); score: 0.879 PGS (2486 2019) SGN-E303695+ PGS (2509 2020) SGN-E543104+ PGS (2486 2020) SGN-E306317+ PGS (2509 2056) SGN-E543103- PGS (2486 2056) SGN-E305738+ PGS (2486 2063) SGN-E374134- PGS (2486 2063) SGN-E374135+ PGS (2855 2664,2486 2082) SGN-E538156+ PGS (2487 2084) SGN-E225616- PGS (2486 2084) SGN-E303256+ PGS (2486 2084) SGN-E310669+ PGS (2855 2664,2486 2119) SGN-E538151+ PGS (2855 2664,2486 2254) SGN-E268096+ 3-phase translation of AGS-1 (-strand): . . . . . . 2855 TCATTTATCATTTCACCGAATCCCGGGATGGGTAATGTTCATGCGGAGTTTCTTGCATAT S F I I S P N P G M G N V H A E F L A Y H L S F H R I P G W V M F M R S F L H M I Y H F T E S R D G - C S C G V S C I . . . . . . 2795 GTCACTGAGTCCCTCAATAGAGGGCCGGGTATGTATATTATATATATGATTGATGATGAG V T E S L N R G P G M Y I I Y M I D D E S L S P S I E G R V C I L Y I - L M M R C H - V P Q - R A G Y V Y Y I Y D - - - . . . . . . 2735 GATGGTTATGATGATGATGATGACAGAGATGTGTGATGATTATTTTGTCGAGCCCCTTAC D G Y D D D D D R D V - - L F C R A P Y M V M M M M M T E M C D D Y F V E P L T G W L - - - - - Q R C V M I I L S S P L . . : . . . . 2675 TAGGGAAGCTGT : GTGTAGACGCACAATTCGGTGATCCTCCTACCTAGGATATCTGCTCTG - G S C : V - T H N S V I L L P R I S A L R E A V : C R R T I R - S S Y L G Y L L C L G K L : C V D A Q F G D P P T - D I C S . . . . . . 2438 CTGATTGGGAGAGCTTTCCTGTTCTGGAGACTAGTCGTTTTGGTACGTAACTTTTTGTGT L I G R A F L F W R L V V L V R N F L C - L G E L S C S G D - S F W Y V T F C V A D W E S F P V L E T S R F G T - L F V . . . . . . 2378 AGTCTTTTGCTTGTCCATGGGTATGGCGGGGGCCCTGTCCCGTCGAGTTTCACTACTATG S L L L V H G Y G G G P V P S S F T T M V F C L S M G M A G A L S R R V S L L C - S F A C P W V W R G P C P V E F H Y Y . . . . . . 2318 CTCTTAGAGGTCTGTAGACATTATGTGGGTTGTATATATATGTTTTGGATAATGGTATAG L L E V C R H Y V G C I Y M F W I M V - S - R S V D I M W V V Y I C F G - W Y R A L R G L - T L C G L Y I Y V L D N G I . . . . . . 2258 ACATGGTTTGTTTGGGATGTCCGCTTGTACATGGGCAGCCTTGTCGGGTGCGTACATCAT T W F V W D V R L Y M G S L V G C V H H H G L F G M S A C T W A A L S G A Y I I D M V C L G C P L V H G Q P C R V R T S . . . . . . 2198 TATGTATTGTGTAGGGGCAGCCTTGTCGGCTTGCGTATGCTATTATGCTTTGGATAGTGG Y V L C R G S L V G L R M L L C F G - W M Y C V G A A L S A C V C Y Y A L D S G L C I V - G Q P C R L A Y A I M L W I V . . . . . . 2138 TGGCCTTTTTGGCTCGTGTATGTTGTTACAGTTGAATGGTTATGACTCCTTATGAGACAT W P F W L V Y V V T V E W L - L L M R H G L F G S C M L L Q L N G Y D S L - D M V A F L A R V C C Y S - M V M T P Y E T . . . . . . 2078 GCCCATTATATATATATATATATATATATATGGCGTTGGGGTTGTCTTGATTTGATTAAA A H Y I Y I Y I Y I W R W G C L D L I K P I I Y I Y I Y I Y G V G V V L I - L C P L Y I Y I Y I Y M A L G L S - F D - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-24-_PGL-2_AGS-1_PPS_1 (2480 2259) (frame '1'; 219 bp, 73 residues) 1 THNSVILLPR ISALLIGRAF LFWRLVVLVR NFLCSLLLVH GYGGGPVPSS FTTMLLEVCR 61 HYVGCIYMFW IMV- AGS-2 (2802 2753,2709 2082) SCR (e 0.800 d 0.000 a 0.842,e 0.877) Exon 1 2802 2753 ( 50 n); score: 0.800 Intron 1 2752 2710 ( 43 n); Pd: 0.000 Pa: 0.842 Exon 2 2709 2082 ( 628 n); score: 0.877 PGS (2802 2753,2709 2082) SGN-E544254- 3-phase translation of AGS-2 (-strand): . . . . . : . 2802 TGCATATGTCACTGAGTCCCTCAATAGAGGGCCGGGTATGTATATTATAT : AGATGTGTGA C I C H - V P Q - R A G Y V Y Y I : D V - A Y V T E S L N R G P G M Y I I : - M C D H M S L S P S I E G R V C I L Y : R C V . . . . . . 2699 TGATTATTTTGTCGAGCCCCTTACTAGGGAAGCTGTGCACCTTATATGTTAAAGATATGC - L F C R A P Y - G S C A P Y M L K I C D Y F V E P L T R E A V H L I C - R Y A M I I L S S P L L G K L C T L Y V K D M . . . . . . 2639 ATGATTTTCACTTAAAAGGGTACATGTGTAGCGGTATTTTGTTTCAACTTACCATATTGG M I F T - K G T C V A V F C F N L P Y W - F S L K R V H V - R Y F V S T Y H I G H D F H L K G Y M C S G I L F Q L T I L . . . . . . 2579 TATCCTATCATCTTTACATTCTGCTTTACATACTTAGTACATTGTCCGTACTGACTCCCG Y P I I F T F C F T Y L V H C P Y - L P I L S S L H S A L H T - Y I V R T D S R V S Y H L Y I L L Y I L S T L S V L T P . . . . . . 2519 TTTCCTCAAGGGGGCTGCGTTTCATGCCTCTAGGTGTAGACGCACAATTCGGTGATCCTC F P Q G G C V S C L - V - T H N S V I L F L K G A A F H A S R C R R T I R - S S V S S R G L R F M P L G V D A Q F G D P . . . . . . 2459 CTACCTAGGATATCTGCTCTGCTGATTGGGAGAGCTTTCCTGTTCTGGAGACTAGTCGTT L P R I S A L L I G R A F L F W R L V V Y L G Y L L C - L G E L S C S G D - S F P T - D I C S A D W E S F P V L E T S R . . . . . . 2399 TTGGTACGTAACTTTTTGTGTAGTCTTTTGCTTGTCCATGGGTATGGCGGGGGCCCTGTC L V R N F L C S L L L V H G Y G G G P V W Y V T F C V V F C L S M G M A G A L S F G T - L F V - S F A C P W V W R G P C . . . . . . 2339 CCGTCGAGTTTCACTACTATGCTCTTAGAGGTCTGTAGACATTATGTGGGTTGTATATAT P S S F T T M L L E V C R H Y V G C I Y R R V S L L C S - R S V D I M W V V Y I P V E F H Y Y A L R G L - T L C G L Y I . . . . . . 2279 ATGTTTTGGATAATGGTATAGACATGGTTTGTTTGGGATGTCCGCTTGTACATGGGCAGC M F W I M V - T W F V W D V R L Y M G S C F G - W Y R H G L F G M S A C T W A A Y V L D N G I D M V C L G C P L V H G Q . . . . . . 2219 CTTGTCGGGTGCGTACATCATTATGTATTGTGTAGGGGCAGCCTTGTCGGCTTGCGTATG L V G C V H H Y V L C R G S L V G L R M L S G A Y I I M Y C V G A A L S A C V C P C R V R T S L C I V - G Q P C R L A Y . . . . . . 2159 CTATTATGCTTTGGATAGTGGTGGCCTTTTTGGCTCGTGTATGTTGTTACAGTTGAATGG L L C F G - W W P F W L V Y V V T V E W Y Y A L D S G G L F G S C M L L Q L N G A I M L W I V V A F L A R V C C Y S - M . . 2099 TTATGACTCCTTATGAGA L - L L M R Y D S L - V M T P Y E Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-24-_PGL-2_AGS-2_PPS_1 (2800 2753,2709 2452) (frame '0'; 303 bp, 101 residues) 1 HMSLSPSIEG RVCILYRCVM IILSSPLLGK LCTLYVKDMH DFHLKGYMCS GILFQLTILV 61 SYHLYILLYI LSTLSVLTPV SSRGLRFMPL GVDAQFGDPP T- >C06HBa0057J04.1-24-_PGL-2_AGS-2_PPS_2 (2480 2259) (frame '1'; 219 bp, 73 residues) 1 THNSVILLPR ISALLIGRAF LFWRLVVLVR NFLCSLLLVH GYGGGPVPSS FTTMLLEVCR 61 HYVGCIYMFW IMV- ... finished at: Mon Jul 24 23:17:33 2006 ________________________________________________________________________________ Sequence 25: C06HBa0057J04.1-25, from 1 to 4679, both strands analyzed. ... started at: Mon Jul 24 23:17:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Jul 24 23:17:43 2006 ________________________________________________________________________________ Sequence 26: C06HBa0057J04.1-26, from 1 to 8386, both strands analyzed. ... started at: Mon Jul 24 23:17:43 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 8 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 13 ******************************************************************************** EST sequence 3 -strand 730 n (File: SGN-E379982-) 1 AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 61 CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 121 GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 181 ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 241 ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 301 TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 361 GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 421 TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 481 AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 541 ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 601 ATTGCTCCAG CGGTATTCAT GGATTTAATG AATTGAGTAT TTAATCCATT CCTTGATATG 661 TTTGTTATTG GATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGAA 721 GACTATTTAA Predicted gene structure (within gDNA segment 2503 to 456): Exon 1 1893 1164 ( 730 n); cDNA 1 730 ( 730 n); score: 0.953 MATCH C06HBa0057J04.1-26- SGN-E379982- 0.953 730 1.000 C PGS_C06HBa0057J04.1-26-_SGN-E379982- (1893 1164) Alignment (genomic DNA sequence = upper lines): AAAGGTAAGT TCATTTCATA CCTCAAGGCC GGGAAGATGG TTAGAAAAGG CTATATTTAC 1834 |||||||||| |||||||||| | |||||||| ||||||||| |||||||||| |||||||||| AAAGGTAAGT TCATTTCATA CTTCAAGGCC GGGAAGATGT TTAGAAAAGG CTATATTTAC 60 CATCTGATTC GGGTGCATGA CATAAAGGCA GAGACACCGA CTCTTCAATC AGTCCAGGTA 1774 |||||||||| |||||||||| |||||||||| ||| ||| || |||||||||| |||| |||| CATCTGATTC GGGTGCATGA CATAAAGGCA GAGGCACTGA CTCTTCAATC AGTCTCGGTA 120 GTTAATGAAT TTCCCGATAT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 1714 |||||||||| |||| ||| | |||||||||| |||||||||| |||||||||| |||||||||| GTTAATGAAT TTCCTGATGT ATTCCCCGAG GAACTTCCAG GCCTTCCTCC AGAACGGGAG 180 ATAGAGTTTA CTATAGATGT ACTACCAGAT GCCCACCCTA TATCTATACC TCCTTATAGA 1654 |||||||||| |||||||||| ||| |||||| |||| |||| |||||||||| |||||||||| ATAGAGTTTA CTATAGATGT ACTGCCAGAT ACCCAACCTA TATCTATACC TCCTTATAGA 240 ATGGCACCTG CTGAGTTGGA AGAATTGAAA GAGCAATTGA GGGATTTGCT AGAAAAGGGC 1594 |||||||||| |||||||| | |||||||||| |||||||||| ||||||| || |||||||||| ATGGCACCTG CTGAGTTGAA AGAATTGAAA GAGCAATTGA GGGATTTACT AGAAAAGGGC 300 TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TGTTTGTGAG GAAGAAGGAT 1534 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TTCATCAGGC CTAGTACGTC ACCTTGGGGA GCACCGGTAC TATTTGTGAG GAAGAAGGAT 360 GGGTCGCTGC GGATGTGCAT TGATTATAGG CAGTTGAACA AAGTAACAAT AAAGAACAGG 1474 |||||||| | |||||||||| |||||||||| ||| |||||| |||||||||| || ||||||| GGGTCGCTCC GGATGTGCAT TGATTATAGG CAGCTGAACA AAGTAACAAT AATGAACAGG 420 TATCCCCTCC CAAGGATTGA CGATCTACTT GACCAGTTGC AGGGTGCAAA GTGTTTTTCA 1414 |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| | |||||||| TATCCCCTCC CAAGGATTGA CGATCTATTT GACCAGTTGC AGGGTGCAAA GGGTTTTTCA 480 AAGATAGACT TGCGGTCAGG TTATCATCAG GTGCGGGTAA GGGAGGCAGA TATTCCAAAG 1354 |||||||||| |||||||||| |||||||||| || ||||| | |||||||||| ||| |||| | AAGATAGACT TGCGGTCAGG TTATCATCAG GTACGGGTTA GGGAGGCAGA TATCCCAATG 540 ACAGCATTCC GGACCCGATA TGGGCATTAT GAGTTTAGAG TGCTGTCTTT TGGGCTGACT 1294 || ||||||| |||||||||| |||||||||| | |||||||| || ||||||| |||||||||| ACGGCATTCC GGACCCGATA TGGGCATTAT GTGTTTAGAG TGTTGTCTTT TGGGCTGACT 600 AATGCTCCAG CGGTGTTCAT GGATTTAATA AATTGAGTAT TTAAACCATT CCTTGATATG 1234 | |||||||| |||| ||||| ||||||||| |||||||||| |||| ||||| |||||||||| ATTGCTCCAG CGGTATTCAT GGATTTAATG AATTGAGTAT TTAATCCATT CCTTGATATG 660 TTTGTTATTG TATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGCA 1174 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TTTGTTATTG GATTTATAGA CGATATTCTG GTCTATTCAC GTTCAGAAGA GGAGCATGAA 720 GATCATTTAA 1164 || |||||| GACTATTTAA 730 hqPGS_C06HBa0057J04.1-26-_SGN-E379982- (1893 1164) ******************************************************************************** EST sequence 8 -strand 521 n (File: SGN-E201553-) 1 TACCCAACCT ATATCTATAC CTCCTTATAG AATGGCACCT GCTGAGTTGA AAGAATTGAA 61 AGAGCAATTG AGGGATTTAC TAGAAAAGGG CTTCATCAGG CCTAGTACGT CACCTTGGGG 121 AGCACCGGTA CTATTTGTGA GGAAGAAGGA TGGGTCGCTC CGGATGTGCA TTGATTATAG 181 GCAGCTGAAC AAAGTAACAA TAATGAACAG GTATCCCCTC CCAAGGATTG ACGATCTATT 241 TGACCAGTTG CAGGGTGCAA AGGGTTTTTC AAAGATAGAC TTGCGGTCAG GTTATCATCA 301 GGTACGGGTT AGGGAGGCAG ATATCCCAAT GACGGCATTC CGGACCCGAT ATGGGCATTA 361 TGTGTTTAGA GTGTTGTCTT TTGGGCTGAC TATTGCTCCA GCGGTATTCA TGGATTTAAT 421 GAATTGAGTA TTTAATCCAT TCCTTGATAT GTTTGTTATT GGATTTATAG CCCCTATTAT 481 GGTCTATTCA CGTTCAGAAA AGGAGCATGA AGACTATTTA A Predicted gene structure (within gDNA segment 2357 to 366): Exon 1 1682 1164 ( 519 n); cDNA 3 521 ( 519 n); score: 0.944 MATCH C06HBa0057J04.1-26- SGN-E201553- 0.944 519 0.996 C PGS_C06HBa0057J04.1-26-_SGN-E201553- (1682 1164) Alignment (genomic DNA sequence = upper lines): CCCACCCTAT ATCTATACCT CCTTATAGAA TGGCACCTGC TGAGTTGGAA GAATTGAAAG 1623 |||| ||||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| CCCAACCTAT ATCTATACCT CCTTATAGAA TGGCACCTGC TGAGTTGAAA GAATTGAAAG 62 AGCAATTGAG GGATTTGCTA GAAAAGGGCT TCATCAGGCC TAGTACGTCA CCTTGGGGAG 1563 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAATTGAG GGATTTACTA GAAAAGGGCT TCATCAGGCC TAGTACGTCA CCTTGGGGAG 122 CACCGGTACT GTTTGTGAGG AAGAAGGATG GGTCGCTGCG GATGTGCATT GATTATAGGC 1503 |||||||||| ||||||||| |||||||||| ||||||| || |||||||||| |||||||||| CACCGGTACT ATTTGTGAGG AAGAAGGATG GGTCGCTCCG GATGTGCATT GATTATAGGC 182 AGTTGAACAA AGTAACAATA AAGAACAGGT ATCCCCTCCC AAGGATTGAC GATCTACTTG 1443 || ||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||| ||| AGCTGAACAA AGTAACAATA ATGAACAGGT ATCCCCTCCC AAGGATTGAC GATCTATTTG 242 ACCAGTTGCA GGGTGCAAAG TGTTTTTCAA AGATAGACTT GCGGTCAGGT TATCATCAGG 1383 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| ACCAGTTGCA GGGTGCAAAG GGTTTTTCAA AGATAGACTT GCGGTCAGGT TATCATCAGG 302 TGCGGGTAAG GGAGGCAGAT ATTCCAAAGA CAGCATTCCG GACCCGATAT GGGCATTATG 1323 | ||||| || |||||||||| || |||| || | |||||||| |||||||||| |||||||||| TACGGGTTAG GGAGGCAGAT ATCCCAATGA CGGCATTCCG GACCCGATAT GGGCATTATG 362 AGTTTAGAGT GCTGTCTTTT GGGCTGACTA ATGCTCCAGC GGTGTTCATG GATTTAATAA 1263 ||||||||| | |||||||| |||||||||| ||||||||| ||| |||||| |||||||| | TGTTTAGAGT GTTGTCTTTT GGGCTGACTA TTGCTCCAGC GGTATTCATG GATTTAATGA 422 ATTGAGTATT TAAACCATTC CTTGATATGT TTGTTATTGT ATTTATAGAC GATATTCTGG 1203 |||||||||| ||| |||||| |||||||||| ||||||||| |||||||| | |||| ||| ATTGAGTATT TAATCCATTC CTTGATATGT TTGTTATTGG ATTTATAGCC CCTATTATGG 482 TCTATTCACG TTCAGAAGAG GAGCATGCAG ATCATTTAA 1164 |||||||||| ||||||| || ||||||| || | |||||| TCTATTCACG TTCAGAAAAG GAGCATGAAG ACTATTTAA 521 hqPGS_C06HBa0057J04.1-26-_SGN-E201553- (1682 1164) ******************************************************************************** EST sequence 1 -strand 598 n (File: SGN-E350824-) 1 AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 61 GGTTCGATCG TATGGGATAT TCAGAAGCTG GTCAGAGCTC TAGGGCGTTA GGGTCACAGA 121 TGGGCAGGAG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCA GTGTTCTCAT TGTGGTAAGT 181 CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 241 ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 301 CCGTTGCTGG TTCATTTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 361 CAGCAGGCCA TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCAGTGCC TCGAACCGTA 421 TATATGCTTT GACTAATAGG CAGGATCAGG GGGTGTCACC TAATGTGATC ACAGGTATAT 481 TATCACTATT CTCCCGAAGT GTGTATACAT TGATAGACCC AGGTTCCACC TTATCATATA 541 TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTATGA GTTGATAGAA CCATTTGA Predicted gene structure (within gDNA segment 3345 to 1538): Exon 1 2735 2138 ( 598 n); cDNA 1 598 ( 598 n); score: 0.958 MATCH C06HBa0057J04.1-26- SGN-E350824- 0.958 598 1.000 C PGS_C06HBa0057J04.1-26-_SGN-E350824- (2735 2138) Alignment (genomic DNA sequence = upper lines): AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGGGCACC CCCACGTTTC ATGGGTAGGG 2676 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| AGCATGTTAG ATTTTCTTCC CAGCCAGCAC AGAGTGCACC CCCACGTTTC ATGGGTAGGG 60 GGTTCGATCG TATGGGATAC TCGGAAGCTA GTTAGAGCTC TAGGGCGTCA AGGTCACAGA 2616 |||||||||| ||||||||| || |||||| || ||||||| |||||||| | ||||||||| GGTTCGATCG TATGGGATAT TCAGAAGCTG GTCAGAGCTC TAGGGCGTTA GGGTCACAGA 120 TGGGCAGGGG TTTGAGCCAG TCAAGGCCAC CTTTGCCTCG GTGTTCTCAT TGTGGTAAGT 2556 |||||||| | |||||||||| || ||||||| ||||||||| |||||||||| |||||||||| TGGGCAGGAG TTTGAGCCAG TCGAGGCCAC CTTTGCCTCA GTGTTCTCAT TGTGGTAAGT 180 CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 2496 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCATCCTGG GGAATGTCGT TGGGCTACAG GTGCGTGTTT TTCTTGCGGC CGTCAGGGCC 240 ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAT CCTACAGGGT 2436 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| ATACTATGAG GGAGTGTCAC CTTAGAGGTA GTGCAGGTGG TATGGCACAG CCTACAGGGT 300 CCGTTGCTGG TTCATCTTCT TCTGTGGCTG TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 2376 |||||||||| ||||| |||| ||||||||| |||||||||| |||||||||| |||||||||| CCGTTGCTGG TTCATTTTCT TCTGTGGCTA TGCGCCCTAC GGGGCAGGGT ATTCAGGCGC 360 CAACAGGCCG TGGTAGAGGA CGTGATGGAG CTTCCAGTTC TAGCGGTCCC TCAAACCGTA 2316 || |||||| |||||||||| |||| ||||| |||||||||| |||| || || || ||||||| CAGCAGGCCA TGGTAGAGGA CGTGGTGGAG CTTCCAGTTC TAGCAGTGCC TCGAACCGTA 420 TATATGCTTT GACTAATAGG CAAGATTAGG AGGCGTCACC TAATGTGATC ACAGGTATAT 2256 |||||||||| |||||||||| || ||| ||| || |||||| |||||||||| |||||||||| TATATGCTTT GACTAATAGG CAGGATCAGG GGGTGTCACC TAATGTGATC ACAGGTATAT 480 TATCACTATT CTCCCGAAGT GTGTATGCAT TGATAGACCC AGGTTCCACC TTATCATATA 2196 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| TATCACTATT CTCCCGAAGT GTGTATACAT TGATAGACCC AGGTTCCACC TTATCATATA 540 TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTCTGA GTTGATAGAA CCATTTGA 2138 |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||| TATCTCCCTT TGTTGCTAGT AGGATCGGAA TAGAGTATGA GTTGATAGAA CCATTTGA 598 hqPGS_C06HBa0057J04.1-26-_SGN-E350824- (2735 2138) ******************************************************************************** EST sequence 9 +strand 196 n (File: SGN-E379248+) 1 CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 61 CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 121 CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 181 GGTTATCTTG ACGAGT Predicted gene structure (within gDNA segment 3948 to 1274): Exon 1 2952 2757 ( 196 n); cDNA 1 196 ( 196 n); score: 0.929 MATCH C06HBa0057J04.1-26- SGN-E379248+ 0.929 196 1.000 C PGS_C06HBa0057J04.1-26-_SGN-E379248+ (2952 2757) Alignment (genomic DNA sequence = upper lines): CATCGTTATG TGATGGGATT GGATCGTTAT CTGATTGACG GTTGTATGGC AGTGACTCTT 2893 |||||||||| |||||||||| |||| ||||| ||||||||| | |||||| |||||||||| CATCGTTATG TGATGGGATT GGATGGTTAT CTGATTGACA TTCCTATGGC AGTGACTCTT 60 CAGCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAT AGGGGGTAGA GGATTGGCAC 2833 || ||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||| ||||| CATCCAGGTA TGGACATTGC TCGGGTGCAG GCATATGCAC AGGGGGTAGA GGATCGGCAC 120 CGGGGACGTC AGCCAGATAG AGATTATAAT AGAGGCCAGC ATAAGAGGGC TAGATCAGCA 2773 ||||||||| |||||||||| |||||||||| ||||| | | |||||||||| ||||||| || CGGGGACGTT AGCCAGATAG AGATTATAAT AGAGGGCCCC ATAAGAGGGC TAGATCACCA 180 GGTTATCCTG ACGAGT 2757 ||||||| || |||||| GGTTATCTTG ACGAGT 196 hqPGS_C06HBa0057J04.1-26-_SGN-E379248+ (2952 2757) ******************************************************************************** EST sequence 18 +strand 577 n (File: SGN-E543104+) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGAT GGGGTTGGCT TGATTTGATT AAAAAAA Predicted gene structure (within gDNA segment 6503 to 2884): Exon 1 5843 5775 ( 69 n); cDNA 1 68 ( 68 n); score: 0.848 Intron 1 5774 5233 ( 542 n); Pd: 0.900 (s: 0.84), Pa: 0.868 (s: 0.98) Exon 2 5232 5186 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.98), Pa: 0.000 (s: 0.94) Exon 3 4459 4005 ( 455 n); cDNA 116 570 ( 455 n); score: 0.895 MATCH C06HBa0057J04.1-26- SGN-E543104+ 0.888 571 0.990 C PGS_C06HBa0057J04.1-26-_SGN-E543104+ (5843 5775,5232 5186,4459 4005) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT 5785 |||||||||| ||||||||| ||| ||||| ||||||| || | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 5725 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGTAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 5665 .......... .......... .......... .......... .......... .......... 68 TGCTAAGCAT GAATGGAAAC CATAATTGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 5605 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTATTTTGG CATTATGTAT ATGTTGAATG 5545 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT GGGAATTATA 5485 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT ATGGTTGCCG AGACGGAACT ATTTTGGAGA 5425 .......... .......... .......... .......... .......... .......... 68 GGGGGCTGTT TAATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT 5365 .......... .......... .......... .......... .......... .......... 68 TGGATTGTTG TCGGATTGGG TCGAAGTAAG GATGGGGAGG TGCTGCCGAA TTTTTGTTAG 5305 .......... .......... .......... .......... .......... .......... 68 ATTATTAGCT AGCTTACAAG AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT 5245 .......... .......... .......... .......... .......... .......... 68 TGCTTGTTAT AGATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC GATTATACGG 5185 |||||||| |||||||||| |||||||||| |||||||||| || |||||| .......... ..ATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC GACTATACG. 115 TATGTAACGC TGTCCCTTCT TTCATTGGTT GGCGTGACTT TTAAAAATAA GCGAATAACG 5125 .......... .......... .......... .......... .......... .......... 115 GATAGATTTG ATACTTACCT CTAGAGCGTC TAGGTGACGT ATATTCTTGC TTCCACAATT 5065 .......... .......... .......... .......... .......... .......... 115 ATTCCTCTAT ATATCGGCTA TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC 5005 .......... .......... .......... .......... .......... .......... 115 TTCTTAGAGT CATTGAGATT TTACGTTTCC ATATCGTATT AAAGGTTCAT AATCTTGATA 4945 .......... .......... .......... .......... .......... .......... 115 AAACATTAAT CATTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA 4885 .......... .......... .......... .......... .......... .......... 115 AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT 4825 .......... .......... .......... .......... .......... .......... 115 ATCATTTCAC CGAGTCCCAG GTCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC 4765 .......... .......... .......... .......... .......... .......... 115 GAGTTCCTCA CTAGAGGGCC TGGTATGTAT ATTATATATA TATGATTGGT GATGAGGATG 4705 .......... .......... .......... .......... .......... .......... 115 GTTATGATGA TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC CCCATACTAG 4645 .......... .......... .......... .......... .......... .......... 115 GGAAGCTGGG CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA GGGTATATGT 4585 .......... .......... .......... .......... .......... .......... 115 GTAGCGATAT TTTGTTTCGA CTTGCCATAT TGGTATCCTG GCATCTTTAC CTTATGCTTT 4525 .......... .......... .......... .......... .......... .......... 115 ACATACTCAG TACATTGTCC GTACTGACCC CGCTTTCCTC GGGGGGCTGC GTTTCATGCC 4465 .......... .......... .......... .......... .......... .......... 115 TGCAGGTGTA GACACACAGT TCGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGATTG 4405 ||||| ||| | |||| |||||||||| |||||||||| |||||||||| |||||| || .....GTGTA GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG 170 GGAGAGCTCC ACTGTTCCGG AGCCCTGTCG TTTTGGTACA TAAC-TTTTG TGTAGTCTTT 4346 |||||||||| |||||| || ||||| ||| |||||||||| |||| ||||| |||||||||| GGAGAGCTCC ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT 230 TGCTCGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCGAGT TTCACTAATG TACTCTTAGA 4286 |||| ||||| ||||||||| |||||||||| |||||||||| ||||||| | |||||||||| TGCTTGTCTA TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA 290 GGTCTTTGGA CATTATGTGG GTTATATATA TATGTTTTGG ATAATGGTCT GGACATGGTT 4226 |||| | || ||| |||| ||| |||||| |||||||||| |||||||||| |||||||||| GGTCCATAGA CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT 350 TGTTTGGGAT GTTCGCTTGT ACAGGGGCAG CCTTGTCAGC TGCGTACATC ATTGTGTATT 4166 |||||||||| || | ||||| ||| |||||| |||||||||| |||||||||| ||||||||| TGTTTGGGAT GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT 410 GTGTAGTGGC AGCCTTATCG GCATACGTAT GTTATTATGC TTTGAATAGT GGCGGCCTTG 4106 |||||||||| |||||| ||| || | ||||| | |||||||| |||| ||||| |||||||||| GTGTAGTGGC AGCCTTGTCG GC-TGCGTAT GCTATTATGC TTTGGATAGT GGCGGCCTTG 469 TCGGCTCGCG TATGTTGTTA TGGTTGAATG GTTATGACTC CTTATGAGAC AGGTCCTC-T 4047 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| || ||| | | TCGGCTCGCG TATGTTGTTA CGGTTGAATG GTTATGACTC CTTATGAGAC AGATCCACTT 529 TATATATATA TGACGTTGGG GTTGGCTTGA TTTGATTAAA TT 4005 |||||||||| | | | | | | || ||| |||||| | || TATATATATA T-ATATGGCG ATGGGGTTGG CTTGATTTGA TT 570 hqPGS_C06HBa0057J04.1-26-_SGN-E543104+ (5843 5775,5232 5186,4459 4005) ******************************************************************************** EST sequence 13 +strand 547 n (File: SGN-E305738+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATGA AATGAATGGA 541 CTAACTA Predicted gene structure (within gDNA segment 6228 to 2241): Exon 1 5840 5782 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 5781 5233 ( 549 n); Pd: 0.993 (s: 0.79), Pa: 0.868 (s: 0.89) Exon 2 5232 5186 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.89), Pa: 0.000 (s: 0.96) Exon 3 4459 4020 ( 440 n); cDNA 108 546 ( 439 n); score: 0.874 MATCH C06HBa0057J04.1-26- SGN-E305738+ 0.868 546 0.998 C PGS_C06HBa0057J04.1-26-_SGN-E305738+ (5840 5782,5232 5186,4459 4020) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTTTTGGC CAGCAGCTGC AAATAATTTG 5782 |||||||||| ||||||| || ||| || | |||| |||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 5722 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGT AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 5662 .......... .......... .......... .......... .......... .......... 60 TAAGCATGAA TGGAAACCAT AATTGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 5602 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT ATTTTGGCAT TATGTATATG TTGAATGTGA 5542 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTGGG AATTATAAAA 5482 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTATG GTTGCCGAGA CGGAACTATT TTGGAGAGGG 5422 .......... .......... .......... .......... .......... .......... 60 GGCTGTTTAA TATGATTCTT TGGGTTATAT GTGTTATTGG TATTGCTGTG GATAATTTGG 5362 .......... .......... .......... .......... .......... .......... 60 ATTGTTGTCG GATTGGGTCG AAGTAAGGAT GGGGAGGTGC TGCCGAATTT TTGTTAGATT 5302 .......... .......... .......... .......... .......... .......... 60 ATTAGCTAGC TTACAAGAAA GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC 5242 .......... .......... .......... .......... .......... .......... 60 TTGTTATAGA TTAATAGCTT GAGCAGTAAA TATTGGACGT GCGGCTCGAT TATACGGTAT 5182 | || ||| ||| |||||||||| |||||||||| |||||||| ||| || .........A TTTATACCTT GAGCAGTAAA TATTGGACGT ACGGCTCGAC TATTCG.... 107 GTAACGCTGT CCCTTCTTTC ATTGGTTGGC GTGACTTTTA AAAATAAGCG AATAACGGAT 5122 .......... .......... .......... .......... .......... .......... 107 AGATTTGATA CTTACCTCTA GAGCGTCTAG GTGACGTATA TTCTTGCTTC CACAATTATT 5062 .......... .......... .......... .......... .......... .......... 107 CCTCTATATA TCGGCTATGT CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC 5002 .......... .......... .......... .......... .......... .......... 107 TTAGAGTCAT TGAGATTTTA CGTTTCCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA 4942 .......... .......... .......... .......... .......... .......... 107 CATTAATCAT TGGTAATACT CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA 4882 .......... .......... .......... .......... .......... .......... 107 AATGATTTTA ATTGCATATG GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC 4822 .......... .......... .......... .......... .......... .......... 107 ATTTCACCGA GTCCCAGGTC GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG 4762 .......... .......... .......... .......... .......... .......... 107 TTCCTCACTA GAGGGCCTGG TATGTATATT ATATATATAT GATTGGTGAT GAGGATGGTT 4702 .......... .......... .......... .......... .......... .......... 107 ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC ATACTAGGGA 4642 .......... .......... .......... .......... .......... .......... 107 AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAGGG TATATGTGTA 4582 .......... .......... .......... .......... .......... .......... 107 GCGATATTTT GTTTCGACTT GCCATATTGG TATCCTGGCA TCTTTACCTT ATGCTTTACA 4522 .......... .......... .......... .......... .......... .......... 107 TACTCAGTAC ATTGTCCGTA CTGACCCCGC TTTCCTCGGG GGGCTGCGTT TCATGCCTGC 4462 .......... .......... .......... .......... .......... .......... 107 AGGTGTAGAC ACACAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTGATTGGGA 4402 |||||||| | ||||||| |||||||||| |||||||||| |||||||||| || |||||| ..GTGTAGAC GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA 165 GAGCTCCACT GTTCCGGAGC CCTGTCGTTT TGGTACATAA CTT-TTGTGT AGTCTTTTGC 4343 |||||||||| |||||||||| || ||||||| |||||||||| ||| || ||| |||||||||| GAGCTCCACT GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC 225 TCGTCTATGG GTATGGCGGG GCCCTGTCCC GTCGAGTTTC ACTAATGTAC TCTTAGAGGT 4283 | |||||||| |||||||||| |||||||||| ||| |||||| |||| | ||| |||||||||| TTGTCTATGG GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT 285 CTTTGGACAT TATGTGGGTT ATATATATAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 4223 || | ||||| |||||||| |||| ||| |||||||||| |||||||||| |||||||||| CTGTAGACAT CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 345 TTGGGATGTT CGCTTGTACA GGGGCAGCCT TGTCAGCTGC GTACATCATT GTGTATTGTG 4163 ||||||||| | ||||||| | ||||||| |||| | || | |||||||| |||||||||| TTGGGATGTC CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG 405 TAGTGGCAGC CTTATCGGCA TACGTATGTT ATTATGCTTT GAATAGTGGC GGCCTTGTCG 4103 |||||||||| || ||||| | |||||| | |||||| ||| | |||||||| |||||||||| TAGTGGCAGC CTCGTCGGC- TGCGTATGCT ATTATGTTTT GGATAGTGGC GGCCTTGTCG 464 GCTCGCGTAT GTTGTTATGG TTGAATGGTT ATGACTCCTT ATGAGACAGG TCCTCTTATA 4043 |||||| ||| ||||||| | || ||||||| ||||||| || |||||| || ||| ||| || GCTCGCATAT GTTGTTACGA TTTAATGGTT ATGACTCTTT ATGAGATAGA TCCACTT-TA 523 TATATATGAC GTTGGGGTTG GCT 4020 ||||| | | || | || TATATGAAAT GAATGGACTA ACT 546 hqPGS_C06HBa0057J04.1-26-_SGN-E305738+ (5840 5782,5232 5186,4459 4020) ******************************************************************************** EST sequence 5 -strand 542 n (File: SGN-E374134-) 1 CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 61 GATTTATACC TTGAGCAGTA AATATTGGAC GTACGGCTCG ACTATTCGGT GTAGACGCTC 121 AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT TTGGGAGAGC TCCACTGTTC 181 CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC TTTTGCTTGT CTATGGGTAT 241 GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT AGACATCGTG 301 TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG GTTTGTTTGG GATGTCCATT 361 TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT ATTGTGTAGT GGCAGCCTCG 421 TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT TGTCGGCTCG CATATGTTGT 481 TACGATTTAA TGGTTATGAC TCTTTATGAG ATAGATCCAC TTTATATATA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 6248 to 2311): Exon 1 5841 5782 ( 60 n); cDNA 1 61 ( 61 n); score: 0.825 Intron 1 5781 5233 ( 549 n); Pd: 0.993 (s: 0.79), Pa: 0.868 (s: 0.89) Exon 2 5232 5186 ( 47 n); cDNA 62 108 ( 47 n); score: 0.894 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.89), Pa: 0.000 (s: 0.96) Exon 3 4459 4037 ( 423 n); cDNA 109 530 ( 422 n); score: 0.895 PPA cDNA 531 542 MATCH C06HBa0057J04.1-26- SGN-E374134- 0.886 530 0.978 C PGS_C06HBa0057J04.1-26-_SGN-E374134- (5841 5782,5232 5186,4459 4037) Alignment (genomic DNA sequence = upper lines): CAGCCATGGA AATGGAG-AA ACCAACCCTG CAACTTTTGG CCAGCAGCTG CAAATAATTT 5783 |||||||||| ||||||| || |||| || | |||| |||| ||||||||| |||| || || CAGCCATGGA AATGGAGAAA ACCAGCCATT AAACTCTTGG ACAGCAGCTG CAAAGAAATT 60 GGTTAGTAAT CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT 5723 | G......... .......... .......... .......... .......... .......... 61 TAATTTTAAG TAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG 5663 .......... .......... .......... .......... .......... .......... 61 CTAAGCATGA ATGGAAACCA TAATTGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG 5603 .......... .......... .......... .......... .......... .......... 61 TTTTGATTAA AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GTTGAATGTG 5543 .......... .......... .......... .......... .......... .......... 61 ATTATGAGTA TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTGG GAATTATAAA 5483 .......... .......... .......... .......... .......... .......... 61 ACGAGTTATC ACTCGGTGTG TCGTTGCTAT GGTTGCCGAG ACGGAACTAT TTTGGAGAGG 5423 .......... .......... .......... .......... .......... .......... 61 GGGCTGTTTA ATATGATTCT TTGGGTTATA TGTGTTATTG GTATTGCTGT GGATAATTTG 5363 .......... .......... .......... .......... .......... .......... 61 GATTGTTGTC GGATTGGGTC GAAGTAAGGA TGGGGAGGTG CTGCCGAATT TTTGTTAGAT 5303 .......... .......... .......... .......... .......... .......... 61 TATTAGCTAG CTTACAAGAA AGTAAAGCAC GATGTTTATC TAATTGCGGC ACGATTGTTG 5243 .......... .......... .......... .......... .......... .......... 61 CTTGTTATAG ATTAATAGCT TGAGCAGTAA ATATTGGACG TGCGGCTCGA TTATACGGTA 5183 ||| ||| || |||||||||| |||||||||| | |||||||| ||| || .......... ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCG... 108 TGTAACGCTG TCCCTTCTTT CATTGGTTGG CGTGACTTTT AAAAATAAGC GAATAACGGA 5123 .......... .......... .......... .......... .......... .......... 108 TAGATTTGAT ACTTACCTCT AGAGCGTCTA GGTGACGTAT ATTCTTGCTT CCACAATTAT 5063 .......... .......... .......... .......... .......... .......... 108 TCCTCTATAT ATCGGCTATG TCTAAGGCTA TGATGATCTC TAATATCTAT GGTAATGCTT 5003 .......... .......... .......... .......... .......... .......... 108 CTTAGAGTCA TTGAGATTTT ACGTTTCCAT ATCGTATTAA AGGTTCATAA TCTTGATAAA 4943 .......... .......... .......... .......... .......... .......... 108 ACATTAATCA TTGGTAATAC TCCTTGCTGG TTCACGTTGA TTGTTCTATT GAGTTATAAG 4883 .......... .......... .......... .......... .......... .......... 108 AAATGATTTT AATTGCATAT GGTTGCTCAT AATATTCTGC TCGTGCATAG AGTCATTTAT 4823 .......... .......... .......... .......... .......... .......... 108 CATTTCACCG AGTCCCAGGT CGGGTAATGT TCGTGCGGAG TTTCTTGCAT ATGTCACCGA 4763 .......... .......... .......... .......... .......... .......... 108 GTTCCTCACT AGAGGGCCTG GTATGTATAT TATATATATA TGATTGGTGA TGAGGATGGT 4703 .......... .......... .......... .......... .......... .......... 108 TATGATGATG ATGATGACGG AGATGACGTG ATGATTATTT TGCCGAGCCC CATACTAGGG 4643 .......... .......... .......... .......... .......... .......... 108 AAGCTGGGCA CCTTAAATGT TAAATATATG CATGATTTTC ACTTAAAAGG GTATATGTGT 4583 .......... .......... .......... .......... .......... .......... 108 AGCGATATTT TGTTTCGACT TGCCATATTG GTATCCTGGC ATCTTTACCT TATGCTTTAC 4523 .......... .......... .......... .......... .......... .......... 108 ATACTCAGTA CATTGTCCGT ACTGACCCCG CTTTCCTCGG GGGGCTGCGT TTCATGCCTG 4463 .......... .......... .......... .......... .......... .......... 108 CAGGTGTAGA CACACAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTGATTGGG 4403 ||||||| | | |||||| |||||||||| |||||||||| |||||||||| ||| ||||| ...GTGTAGA CGCTCAGTTC GGTGATCCTC CCGCCTAGGA TATCTACTCT GCTTTTTGGG 165 AGAGCTCCAC TGTTCCGGAG CCCTGTCGTT TTGGTACATA ACTT-TTGTG TAGTCTTTTG 4344 |||||||||| |||||||||| ||| |||||| |||||||||| |||| || || |||||||||| AGAGCTCCAC TGTTCCGGAG CCCAGTCGTT TTGGTACATA ACTTCTTATG TAGTCTTTTG 225 CTCGTCTATG GGTATGGCGG GGCCCTGTCC CGTCGAGTTT CACTAATGTA CTCTTAGAGG 4284 || ||||||| |||||||||| |||||||||| |||| ||||| ||||| | || |||||||||| CTTGTCTATG GGTATGGCGG GGCCCTGTCC CGTCAAGTTT CACTACTATA CTCTTAGAGG 285 TCTTTGGACA TTATGTGGGT TATATATATA TGTTTTGGAT AATGGTCTGG ACATGGTTTG 4224 ||| | |||| | ||||||| | |||| || |||||||||| |||||||||| |||||||||| TCTGTAGACA TCGTGTGGGT TGTATAATTA TGTTTTGGAT AATGGTCTGG ACATGGTTTG 345 TTTGGGATGT TCGCTTGTAC AGGGGCAGCC TTGTCAGCTG CGTACATCAT TGTGTATTGT 4164 |||||||||| | |||||| | | |||||| ||||| | || | ||||||| |||||||||| TTTGGGATGT CCATTTGTAC AAGTGCAGCC TTGTCGGTTG TGAACATCAT TGTGTATTGT 405 GTAGTGGCAG CCTTATCGGC ATACGTATGT TATTATGCTT TGAATAGTGG CGGCCTTGTC 4104 |||||||||| ||| ||||| | |||||| ||||||| || || ||||||| |||||||||| GTAGTGGCAG CCTCGTCGGC -TGCGTATGC TATTATGTTT TGGATAGTGG CGGCCTTGTC 464 GGCTCGCGTA TGTTGTTATG GTTGAATGGT TATGACTCCT TATGAGACAG GTCCTCTTAT 4044 ||||||| || |||||||| | || |||||| |||||||| | ||||||| || ||| ||| | GGCTCGCATA TGTTGTTACG ATTTAATGGT TATGACTCTT TATGAGATAG ATCCACTT-T 523 ATATATA 4037 ||||||| ATATATA 530 hqPGS_C06HBa0057J04.1-26-_SGN-E374134- (5841 5782,5232 5186,4459 4037) ******************************************************************************** EST sequence 16 +strand 542 n (File: SGN-E374135+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTTTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAGA TAGATCCACT TTATATATAA AAAAAAAAAA 541 AA Predicted gene structure (within gDNA segment 6228 to 2291): Exon 1 5840 5782 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 5781 5233 ( 549 n); Pd: 0.993 (s: 0.79), Pa: 0.868 (s: 0.89) Exon 2 5232 5186 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.89), Pa: 0.000 (s: 0.96) Exon 3 4459 4037 ( 423 n); cDNA 108 529 ( 422 n); score: 0.895 PPA cDNA 530 542 MATCH C06HBa0057J04.1-26- SGN-E374135+ 0.886 529 0.976 C PGS_C06HBa0057J04.1-26-_SGN-E374135+ (5840 5782,5232 5186,4459 4037) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTTTTGGC CAGCAGCTGC AAATAATTTG 5782 |||||||||| ||||||| || ||| || | |||| |||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 5722 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGT AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 5662 .......... .......... .......... .......... .......... .......... 60 TAAGCATGAA TGGAAACCAT AATTGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 5602 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT ATTTTGGCAT TATGTATATG TTGAATGTGA 5542 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTGGG AATTATAAAA 5482 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTATG GTTGCCGAGA CGGAACTATT TTGGAGAGGG 5422 .......... .......... .......... .......... .......... .......... 60 GGCTGTTTAA TATGATTCTT TGGGTTATAT GTGTTATTGG TATTGCTGTG GATAATTTGG 5362 .......... .......... .......... .......... .......... .......... 60 ATTGTTGTCG GATTGGGTCG AAGTAAGGAT GGGGAGGTGC TGCCGAATTT TTGTTAGATT 5302 .......... .......... .......... .......... .......... .......... 60 ATTAGCTAGC TTACAAGAAA GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC 5242 .......... .......... .......... .......... .......... .......... 60 TTGTTATAGA TTAATAGCTT GAGCAGTAAA TATTGGACGT GCGGCTCGAT TATACGGTAT 5182 | || ||| ||| |||||||||| |||||||||| |||||||| ||| || .........A TTTATACCTT GAGCAGTAAA TATTGGACGT ACGGCTCGAC TATTCG.... 107 GTAACGCTGT CCCTTCTTTC ATTGGTTGGC GTGACTTTTA AAAATAAGCG AATAACGGAT 5122 .......... .......... .......... .......... .......... .......... 107 AGATTTGATA CTTACCTCTA GAGCGTCTAG GTGACGTATA TTCTTGCTTC CACAATTATT 5062 .......... .......... .......... .......... .......... .......... 107 CCTCTATATA TCGGCTATGT CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC 5002 .......... .......... .......... .......... .......... .......... 107 TTAGAGTCAT TGAGATTTTA CGTTTCCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA 4942 .......... .......... .......... .......... .......... .......... 107 CATTAATCAT TGGTAATACT CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA 4882 .......... .......... .......... .......... .......... .......... 107 AATGATTTTA ATTGCATATG GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC 4822 .......... .......... .......... .......... .......... .......... 107 ATTTCACCGA GTCCCAGGTC GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG 4762 .......... .......... .......... .......... .......... .......... 107 TTCCTCACTA GAGGGCCTGG TATGTATATT ATATATATAT GATTGGTGAT GAGGATGGTT 4702 .......... .......... .......... .......... .......... .......... 107 ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC ATACTAGGGA 4642 .......... .......... .......... .......... .......... .......... 107 AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAGGG TATATGTGTA 4582 .......... .......... .......... .......... .......... .......... 107 GCGATATTTT GTTTCGACTT GCCATATTGG TATCCTGGCA TCTTTACCTT ATGCTTTACA 4522 .......... .......... .......... .......... .......... .......... 107 TACTCAGTAC ATTGTCCGTA CTGACCCCGC TTTCCTCGGG GGGCTGCGTT TCATGCCTGC 4462 .......... .......... .......... .......... .......... .......... 107 AGGTGTAGAC ACACAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTGATTGGGA 4402 |||||||| | ||||||| |||||||||| |||||||||| |||||||||| || |||||| ..GTGTAGAC GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA 165 GAGCTCCACT GTTCCGGAGC CCTGTCGTTT TGGTACATAA CTT-TTGTGT AGTCTTTTGC 4343 |||||||||| |||||||||| || ||||||| |||||||||| ||| || ||| |||||||||| GAGCTCCACT GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC 225 TCGTCTATGG GTATGGCGGG GCCCTGTCCC GTCGAGTTTC ACTAATGTAC TCTTAGAGGT 4283 | |||||||| |||||||||| |||||||||| ||| |||||| |||| | ||| |||||||||| TTGTCTATGG GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT 285 CTTTGGACAT TATGTGGGTT ATATATATAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 4223 || | ||||| |||||||| |||| ||| |||||||||| |||||||||| |||||||||| CTGTAGACAT CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 345 TTGGGATGTT CGCTTGTACA GGGGCAGCCT TGTCAGCTGC GTACATCATT GTGTATTGTG 4163 ||||||||| | ||||||| | ||||||| |||| | || | |||||||| |||||||||| TTGGGATGTC CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG 405 TAGTGGCAGC CTTATCGGCA TACGTATGTT ATTATGCTTT GAATAGTGGC GGCCTTGTCG 4103 |||||||||| || ||||| | |||||| | |||||| ||| | |||||||| |||||||||| TAGTGGCAGC CTCGTCGGC- TGCGTATGCT ATTATGTTTT GGATAGTGGC GGCCTTGTCG 464 GCTCGCGTAT GTTGTTATGG TTGAATGGTT ATGACTCCTT ATGAGACAGG TCCTCTTATA 4043 |||||| ||| ||||||| | || ||||||| ||||||| || |||||| || ||| ||| || GCTCGCATAT GTTGTTACGA TTTAATGGTT ATGACTCTTT ATGAGATAGA TCCACTT-TA 523 TATATA 4037 |||||| TATATA 529 hqPGS_C06HBa0057J04.1-26-_SGN-E374135+ (5840 5782,5232 5186,4459 4037) ******************************************************************************** EST sequence 6 -strand 586 n (File: SGN-E543103-) 1 GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 61 TTGGTTTGAT TAATAGCTTG AGCAGTAAAT ATTGGACGTG CGGCTCGACT ATACGGTGTA 121 GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG GGAGAGCTCC 181 ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT TGCTTGTCTA 241 TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA GGTCCATAGA 301 CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT 361 GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT GTGTAGTGGC 421 AGCCTTGTCG GCTGCGTATG CTATTATGCT TTGGATAGTG GCGGCCTTGT CGGCTCGCGT 481 ATGTTGTTAC GGTTGAATGG TTATGACTCC TTATGAGACA GATCCACTTT ATATATATAT 541 ATATGGCGTT GGGGTTGGCG TGATTGATAA AAAAAAGGGG GGCCCG Predicted gene structure (within gDNA segment 6513 to 2804): Exon 1 5843 5775 ( 69 n); cDNA 1 68 ( 68 n); score: 0.848 Intron 1 5774 5233 ( 542 n); Pd: 0.900 (s: 0.84), Pa: 0.868 (s: 0.98) Exon 2 5232 5186 ( 47 n); cDNA 69 115 ( 47 n); score: 0.979 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.98), Pa: 0.000 (s: 0.94) Exon 3 4459 4038 ( 422 n); cDNA 116 536 ( 421 n); score: 0.918 MATCH C06HBa0057J04.1-26- SGN-E543103- 0.908 538 0.918 C PGS_C06HBa0057J04.1-26-_SGN-E543103- (5843 5775,5232 5186,4459 4038) Alignment (genomic DNA sequence = upper lines): GGCAGCCATG GAAATGGAG- AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT 5785 |||||||||| ||||||||| ||| ||||| ||||||| || | |||||||| |||||| ||| GGCAGCCATG GAAATGGAGA AAATCAACCA TGCAACTCTT GACCAGCAGC TGCAAAGAAT 60 TTGGTTAGTA ATCTCCTTGT TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC 5725 |||||| | TTGGTT--TG .......... .......... .......... .......... .......... 68 ATTAATTTTA AGTAGGGGGC GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG 5665 .......... .......... .......... .......... .......... .......... 68 TGCTAAGCAT GAATGGAAAC CATAATTGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC 5605 .......... .......... .......... .......... .......... .......... 68 TGTTTTGATT AAAGCAAACT GCAGGAAAAT TCTATTTTGG CATTATGTAT ATGTTGAATG 5545 .......... .......... .......... .......... .......... .......... 68 TGATTATGAG TATATACTCC AAAGGATGAA TACGATAAGG TAGATGTGTT GGGAATTATA 5485 .......... .......... .......... .......... .......... .......... 68 AAACGAGTTA TCACTCGGTG TGTCGTTGCT ATGGTTGCCG AGACGGAACT ATTTTGGAGA 5425 .......... .......... .......... .......... .......... .......... 68 GGGGGCTGTT TAATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT 5365 .......... .......... .......... .......... .......... .......... 68 TGGATTGTTG TCGGATTGGG TCGAAGTAAG GATGGGGAGG TGCTGCCGAA TTTTTGTTAG 5305 .......... .......... .......... .......... .......... .......... 68 ATTATTAGCT AGCTTACAAG AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT 5245 .......... .......... .......... .......... .......... .......... 68 TGCTTGTTAT AGATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC GATTATACGG 5185 |||||||| |||||||||| |||||||||| |||||||||| || |||||| .......... ..ATTAATAG CTTGAGCAGT AAATATTGGA CGTGCGGCTC GACTATACG. 115 TATGTAACGC TGTCCCTTCT TTCATTGGTT GGCGTGACTT TTAAAAATAA GCGAATAACG 5125 .......... .......... .......... .......... .......... .......... 115 GATAGATTTG ATACTTACCT CTAGAGCGTC TAGGTGACGT ATATTCTTGC TTCCACAATT 5065 .......... .......... .......... .......... .......... .......... 115 ATTCCTCTAT ATATCGGCTA TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC 5005 .......... .......... .......... .......... .......... .......... 115 TTCTTAGAGT CATTGAGATT TTACGTTTCC ATATCGTATT AAAGGTTCAT AATCTTGATA 4945 .......... .......... .......... .......... .......... .......... 115 AAACATTAAT CATTGGTAAT ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA 4885 .......... .......... .......... .......... .......... .......... 115 AGAAATGATT TTAATTGCAT ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT 4825 .......... .......... .......... .......... .......... .......... 115 ATCATTTCAC CGAGTCCCAG GTCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC 4765 .......... .......... .......... .......... .......... .......... 115 GAGTTCCTCA CTAGAGGGCC TGGTATGTAT ATTATATATA TATGATTGGT GATGAGGATG 4705 .......... .......... .......... .......... .......... .......... 115 GTTATGATGA TGATGATGAC GGAGATGACG TGATGATTAT TTTGCCGAGC CCCATACTAG 4645 .......... .......... .......... .......... .......... .......... 115 GGAAGCTGGG CACCTTAAAT GTTAAATATA TGCATGATTT TCACTTAAAA GGGTATATGT 4585 .......... .......... .......... .......... .......... .......... 115 GTAGCGATAT TTTGTTTCGA CTTGCCATAT TGGTATCCTG GCATCTTTAC CTTATGCTTT 4525 .......... .......... .......... .......... .......... .......... 115 ACATACTCAG TACATTGTCC GTACTGACCC CGCTTTCCTC GGGGGGCTGC GTTTCATGCC 4465 .......... .......... .......... .......... .......... .......... 115 TGCAGGTGTA GACACACAGT TCGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGATTG 4405 ||||| ||| | |||| |||||||||| |||||||||| |||||||||| |||||| || .....GTGTA GACGCTCAGT TCGGTGATCC TCCCGCCTAG GATATCTACT TTGCTGAGTG 170 GGAGAGCTCC ACTGTTCCGG AGCCCTGTCG TTTTGGTACA TAAC-TTTTG TGTAGTCTTT 4346 |||||||||| |||||| || ||||| ||| |||||||||| |||| ||||| |||||||||| GGAGAGCTCC ACTGTTTCGT AGCCCAGTCA TTTTGGTACA TAACTTTTTG TGTAGTCTTT 230 TGCTCGTCTA TGGGTATGGC GGGGCCCTGT CCCGTCGAGT TTCACTAATG TACTCTTAGA 4286 |||| ||||| ||||||||| |||||||||| |||||||||| ||||||| | |||||||||| TGCTTGTCTA TGGGTATGGT GGGGCCCTGT CCCGTCGAGT TTCACTACTA TACTCTTAGA 290 GGTCTTTGGA CATTATGTGG GTTATATATA TATGTTTTGG ATAATGGTCT GGACATGGTT 4226 |||| | || ||| |||| ||| |||||| |||||||||| |||||||||| |||||||||| GGTCCATAGA CATCGCGTGG GTTGTATATA TATGTTTTGG ATAATGGTCT GGACATGGTT 350 TGTTTGGGAT GTTCGCTTGT ACAGGGGCAG CCTTGTCAGC TGCGTACATC ATTGTGTATT 4166 |||||||||| || | ||||| ||| |||||| |||||||||| |||||||||| ||||||||| TGTTTGGGAT GTCCACTTGT ACAAGGGCAG CCTTGTCAGC TGCGTACATC TTTGTGTATT 410 GTGTAGTGGC AGCCTTATCG GCATACGTAT GTTATTATGC TTTGAATAGT GGCGGCCTTG 4106 |||||||||| |||||| ||| || | ||||| | |||||||| |||| ||||| |||||||||| GTGTAGTGGC AGCCTTGTCG GC-TGCGTAT GCTATTATGC TTTGGATAGT GGCGGCCTTG 469 TCGGCTCGCG TATGTTGTTA TGGTTGAATG GTTATGACTC CTTATGAGAC AGGTCCTCTT 4046 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| || ||| ||| TCGGCTCGCG TATGTTGTTA CGGTTGAATG GTTATGACTC CTTATGAGAC AGATCCACTT 529 ATATATAT 4038 ||||||| -TATATAT 536 hqPGS_C06HBa0057J04.1-26-_SGN-E543103- (5843 5775,5232 5186,4459 4038) ******************************************************************************** EST sequence 12 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 6172 to 3176): Exon 1 5824 5782 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 5781 4833 ( 949 n); Pd: 0.993 (s: 0.81), Pa: 0.932 (s: 0.96) Exon 2 4832 4636 ( 197 n); cDNA 48 241 ( 194 n); score: 0.904 Intron 2 4635 4460 ( 176 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.94) Exon 3 4459 4057 ( 403 n); cDNA 242 643 ( 402 n); score: 0.900 MATCH C06HBa0057J04.1-26- SGN-E538156+ 0.901 643 0.998 C PGS_C06HBa0057J04.1-26-_SGN-E538156+ (5824 5782,4832 4636,4459 4057) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 5765 |||||| || | |||| || || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGTAGGGGGC 5705 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGCAT GAATGGAAAC 5645 .......... .......... .......... .......... .......... .......... 47 CATAATTGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 5585 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTATTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 5525 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT GGGAATTATA AAACGAGTTA TCACTCGGTG 5465 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT ATGGTTGCCG AGACGGAACT ATTTTGGAGA GGGGGCTGTT TAATATGATT 5405 .......... .......... .......... .......... .......... .......... 47 CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG TCGGATTGGG 5345 .......... .......... .......... .......... .......... .......... 47 TCGAAGTAAG GATGGGGAGG TGCTGCCGAA TTTTTGTTAG ATTATTAGCT AGCTTACAAG 5285 .......... .......... .......... .......... .......... .......... 47 AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT TGCTTGTTAT AGATTAATAG 5225 .......... .......... .......... .......... .......... .......... 47 CTTGAGCAGT AAATATTGGA CGTGCGGCTC GATTATACGG TATGTAACGC TGTCCCTTCT 5165 .......... .......... .......... .......... .......... .......... 47 TTCATTGGTT GGCGTGACTT TTAAAAATAA GCGAATAACG GATAGATTTG ATACTTACCT 5105 .......... .......... .......... .......... .......... .......... 47 CTAGAGCGTC TAGGTGACGT ATATTCTTGC TTCCACAATT ATTCCTCTAT ATATCGGCTA 5045 .......... .......... .......... .......... .......... .......... 47 TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC TTCTTAGAGT CATTGAGATT 4985 .......... .......... .......... .......... .......... .......... 47 TTACGTTTCC ATATCGTATT AAAGGTTCAT AATCTTGATA AAACATTAAT CATTGGTAAT 4925 .......... .......... .......... .......... .......... .......... 47 ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA AGAAATGATT TTAATTGCAT 4865 .......... .......... .......... .......... .......... .......... 47 ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT ATCATTTCAC CGAGTCCCAG 4805 |||||||| |||||||||| |||||||| | .......... .......... .......... ..AGTCATTT ATCATTTCAC CGAGTCCCGG 75 GTCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTTCCTCA CTAGAGGGCC 4745 | |||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| GCCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTCCCTCA CTAGAGGGCC 135 TGGTATGTAT ATTATATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 4685 || |||||| | | |||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAATGTAT A-T-TATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 193 GGAGATGACG TGATGATTAT TTTGCCGAGC CCCATACTAG GGAAGCTGGG CACCTTAAAT 4625 |||||||| | |||||| ||| || | ||| ||| ||||| | || || GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. .......... 241 GTTAAATATA TGCATGATTT TCACTTAAAA GGGTATATGT GTAGCGATAT TTTGTTTCGA 4565 .......... .......... .......... .......... .......... .......... 241 CTTGCCATAT TGGTATCCTG GCATCTTTAC CTTATGCTTT ACATACTCAG TACATTGTCC 4505 .......... .......... .......... .......... .......... .......... 241 GTACTGACCC CGCTTTCCTC GGGGGGCTGC GTTTCATGCC TGCAGGTGTA GACACACAGT 4445 ||||| ||| | |||| .......... .......... .......... .......... .....GTGTA GACGCTCAGT 256 TCGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGATTG GGAGAGCTCC ACTGTTCCGG 4385 | |||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGTTTG GGAGAGCTCC ACTGTTCCGG 316 AGCCCTGTCG TTTTGGTACA TAACTT-TTG TGTAGTCTTT TGCTCGTCTA TGGGTATGGC 4326 ||||| |||| || ||||||| |||||| || |||||||||| |||| ||||| ||||||| || AGCCCAGTCG TTGTGGTACA TAACTTCTTA TGTAGTCTTT TGCTTGTCTA TGGGTAT-GC 375 GGGGCCCTGT CCCGTCGAGT TTCACTAATG TACTCTTAGA GGTCTTTGGA CATTATGTGG 4266 |||||||||| |||||| ||| ||||||| | |||||||||| ||||| | || ||| ||||| GGGGCCCTGT CCCGTCAAGT TTCACTACTA TACTCTTAGA GGTCTGTAGA CATCGTGTGG 435 GTTATATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT GTTCGCTTGT 4206 ||| |||| |||||||| | |||||||||| |||||||||| |||||||||| || | ||||| GTTGTATAAT TATGTTTTTG ATAATGGTCT GGACATGGTT TGTTTGGGAT GTCCACTTGT 495 ACAGGGGCAG CCTTGTCAGC TGCGTACATC ATTGTGTATT GTGTAGTGGC AGCCTTATCG 4146 ||| | ||| ||||||| | || ||||||| ||||||||| |||||||||| |||||| || ACAAGTGCAA CCTTGTCGGT TGTGTACATC TTTGTGTATT GTGTAGTGGC AGCCTTGACG 555 GCATACGTAT GTTATTATGC TTTGAATAGT GGCGGCCTTG TCGGCTCGCG TATGTTGTTA 4086 || | ||||| | |||||||| |||||||||| ||| |||||| |||||||||| |||||||||| GC-TGCGTAT GCTATTATGC TTTGAATAGT GGCAGCCTTG TCGGCTCGCG TATGTTGTTA 614 TGGTTGAATG GTTATGACTC CTTATGAGA 4057 ||||||||| | |||||||| |||||||| CGGTTGAATG GGTATGACTC TTTATGAGA 643 hqPGS_C06HBa0057J04.1-26-_SGN-E538156+ (5824 5782,4832 4636,4459 4057) ******************************************************************************** EST sequence 7 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 6322 to 2501): Exon 1 5520 5498 ( 23 n); cDNA 133 155 ( 23 n); score: 0.696 Intron 1 5497 4783 ( 715 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.76) Exon 2 4782 4723 ( 60 n); cDNA 156 214 ( 59 n); score: 0.783 Intron 2 4722 4676 ( 47 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.86) Exon 3 4675 4057 ( 619 n); cDNA 215 831 ( 617 n); score: 0.902 PPA cDNA 832 843 MATCH C06HBa0057J04.1-26- SGN-E544254- 0.892 702 0.833 C PGS_C06HBa0057J04.1-26-_SGN-E544254- (5520 5498,4782 4723,4675 4057) Alignment (genomic DNA sequence = upper lines): GATGAATACG ATAAGGTAGA TGTGTTGGGA ATTATAAAAC GAGTTATCAC TCGGTGTGTC 5461 ||||| | | || | | ||| || GATGATGATG ATGACGGAGA TGA....... .......... .......... .......... 155 GTTGCTATGG TTGCCGAGAC GGAACTATTT TGGAGAGGGG GCTGTTTAAT ATGATTCTTT 5401 .......... .......... .......... .......... .......... .......... 155 GGGTTATATG TGTTATTGGT ATTGCTGTGG ATAATTTGGA TTGTTGTCGG ATTGGGTCGA 5341 .......... .......... .......... .......... .......... .......... 155 AGTAAGGATG GGGAGGTGCT GCCGAATTTT TGTTAGATTA TTAGCTAGCT TACAAGAAAG 5281 .......... .......... .......... .......... .......... .......... 155 TAAAGCACGA TGTTTATCTA ATTGCGGCAC GATTGTTGCT TGTTATAGAT TAATAGCTTG 5221 .......... .......... .......... .......... .......... .......... 155 AGCAGTAAAT ATTGGACGTG CGGCTCGATT ATACGGTATG TAACGCTGTC CCTTCTTTCA 5161 .......... .......... .......... .......... .......... .......... 155 TTGGTTGGCG TGACTTTTAA AAATAAGCGA ATAACGGATA GATTTGATAC TTACCTCTAG 5101 .......... .......... .......... .......... .......... .......... 155 AGCGTCTAGG TGACGTATAT TCTTGCTTCC ACAATTATTC CTCTATATAT CGGCTATGTC 5041 .......... .......... .......... .......... .......... .......... 155 TAAGGCTATG ATGATCTCTA ATATCTATGG TAATGCTTCT TAGAGTCATT GAGATTTTAC 4981 .......... .......... .......... .......... .......... .......... 155 GTTTCCATAT CGTATTAAAG GTTCATAATC TTGATAAAAC ATTAATCATT GGTAATACTC 4921 .......... .......... .......... .......... .......... .......... 155 CTTGCTGGTT CACGTTGATT GTTCTATTGA GTTATAAGAA ATGATTTTAA TTGCATATGG 4861 .......... .......... .......... .......... .......... .......... 155 TTGCTCATAA TATTCTGCTC GTGCATAGAG TCATTTATCA TTTCACCGAG TCCCAGGTCG 4801 .......... .......... .......... .......... .......... .......... 155 GGTAATGTTC GTGCGGAGTT TCTTGCATAT GTCACCGAGT TCCTCACTAG AGGGCCTGGT 4741 | | || ||| ||||||||| ||||||||| |||||| ||| .......... ........TG TGATGACTAT TTCACCGAGT CCCTCACTAG AGGGCCGGGT 197 ATGTATATTA TATATATATG ATTGGTGATG AGGATGGTTA TGATGATGAT GATGACGGAG 4681 | ||| | |||||||| A-CTATGATG TATATATA.. .......... .......... .......... .......... 214 ATGACGTGAT GATTATTTTG CCGAGCCCCA TACTAGGGAA GCTGGGCACC TTAAATGTTA 4621 |||| |||||||||| ||||| ||| |||||||||| | | |||| | ||| |||||| .....ATGAT GATTATTTTG CCGAGTCCCT TACTAGGGAA GTTAGGCATC TTATATGTTA 269 AATATATGCA TGATTTTCAC TTAAAAGGGT ATATGTGTAG CGATATTTTG TTTCGACTTG 4561 || ||||||| |||||||||| |||||| || | |||||||| ||||| ||| ||||||||| AAGATATGCA TGATTTTCAC TTAAAA-AGT ACATGTGTAG AGATATCTTG TTTCGACTTA 328 CCATATTGGT ATCCTGGCAT CTTTACCTTA TGCTTTACAT ACTCAGTACA TTGTCCGTAC 4501 ||| ||||| |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| TCATGTTGGT ATCCTGTCAT CTTTACCTTA TGCTTTACAT ACTCAGTACA TTGTCCGTAC 388 TGACCCCGCT TTCCTCGGGG GGCTGCGTTT CATGCCTGCA GGTGTAGACA CACAGTTCGG 4441 ||||||| || || ||||||| |||||||||| |||||| ||| ||||||||| | |||||||| TGACCCC-CT TTTCTCGGGG GGCTGCGTTT CATGCCCGCA GGTGTAGACG CTCAGTTCGG 447 TGATCCTCCC GCCTAGGATA TCTACTCTGC TGATTGGGAG AGCTCCACTG TTCCGGAGCC 4381 |||||||||| |||||||||| |||||||||| | ||||||| |||||||||| |||||||||| TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG AGCTCCACTG TTCCGGAGCC 507 CTGTCGTTTT GGTACATAAC TT-TTGTGTA GTCTTTTGCT CGTCTATGGG TATGGCGGGG 4322 | |||||||| |||||||||| || || |||| |||||||||| ||||||||| |||||||||| CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT TGTCTATGGG TATGGCGGGG 567 CCCTGTCCCG TCGAGTTTCA CTAATGTACT CTTAGAGGTC TTTGGACATT ATGTGGGTTA 4262 |||||||||| || ||||||| ||| | |||| |||||||||| | | ||||| |||||||| CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC TGTAGACATC GTGTGGGTTG 627 TATATATATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT TGGGATGTTC GCTTGTACAG 4202 |||| |||| |||||||||| |||||||||| |||||||||| |||||||| | ||||||| TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT TGGGATGTCC ATTTGTACAA 687 GGGCAGCCTT GTCAGCTGCG TACATCATTG TGTATTGTGT AGTGGCAGCC TTATCGGCAT 4142 | |||||||| ||| | || | ||||||||| |||||||||| |||||||||| | ||||| | GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT AGTGGCAGCC TCGTCGGC-T 746 ACGTATGTTA TTATGCTTTG AATAGTGGCG GCCTTGTCGG CTCGCGTATG TTGTTATGGT 4082 |||||| || ||||| |||| ||||||||| |||||||||| ||||| |||| |||||| | | GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 806 TGAATGGTTA TGACTCCTTA TGAGA 4057 | |||||||| |||||| ||| ||||| TTAATGGTTA TGACTCTTTA TGAGA 831 hqPGS_C06HBa0057J04.1-26-_SGN-E544254- (4782 4723,4675 4057) ******************************************************************************** EST sequence 20 +strand 519 n (File: SGN-E310669+) 1 AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 61 ATTTATACCT TGAGCAGTAA ATATTGGACG TACGGCTCGA CTATTCGGTG TAGACGCTCA 121 GTTCGGTGAT CCTCCCGCCT AGGATATCTA CTCTGCTGTT TGGGAGAGCT CCACTGTTCC 181 GGAGCCCAGT CGTTTTGGTA CATAACTTCT TATGTAGTCT TTTGCTTGTC TATGGGTATG 241 GCGGGGCCCT GTCCCGTCAA GTTTCACTAC TATACTCTTA GAGGTCTGTA GACATCGTGT 301 GGGTTGTATA ATTATGTTTT GGATAATGGT CTGGACATGG TTTGTTTGGG ATGTCCATTT 361 GTACAAGTGC AGCCTTGTCG GTTGTGAACA TCATTGTGTA TTGTGTAGTG GCAGCCTCGT 421 CGGCTGCGTA TGCTATTATG TTTTGGATAG TGGCGGCCTT GTCGGCTCGC ATATGTTGTT 481 ACGATTTAAT GGTTATGACT CTTTATGAAA AAAAAAAAA Predicted gene structure (within gDNA segment 6228 to 2521): Exon 1 5840 5782 ( 59 n); cDNA 1 60 ( 60 n); score: 0.822 Intron 1 5781 5233 ( 549 n); Pd: 0.993 (s: 0.79), Pa: 0.868 (s: 0.89) Exon 2 5232 5186 ( 47 n); cDNA 61 107 ( 47 n); score: 0.894 Intron 2 5185 4460 ( 726 n); Pd: 0.976 (s: 0.89), Pa: 0.000 (s: 0.96) Exon 3 4459 4059 ( 401 n); cDNA 108 508 ( 401 n); score: 0.901 PPA cDNA 509 519 MATCH C06HBa0057J04.1-26- SGN-E310669+ 0.891 507 0.977 C PGS_C06HBa0057J04.1-26-_SGN-E310669+ (5840 5782,5232 5186,4459 4059) Alignment (genomic DNA sequence = upper lines): AGCCATGGAA ATGGAGA-AA CCAACCCTGC AACTTTTGGC CAGCAGCTGC AAATAATTTG 5782 |||||||||| ||||||| || ||| || | |||| |||| |||||||||| ||| || ||| AGCCATGGAA ATGGAGAAAA CCAGCCATTA AACTCTTGGA CAGCAGCTGC AAAGAAATTG 60 GTTAGTAATC TCCTTGTTTG GTGTGTTAAT TCTTTAGAAT ACCCTTGTTA ATTATCCATT 5722 .......... .......... .......... .......... .......... .......... 60 AATTTTAAGT AGGGGGCGTG ACCAGTAGCT TAGGAAGTTT GTTTTAGTTA TTGAATGTGC 5662 .......... .......... .......... .......... .......... .......... 60 TAAGCATGAA TGGAAACCAT AATTGGATTA TTAGTGGTGT CGTGTTGGTG CTTGGGCTGT 5602 .......... .......... .......... .......... .......... .......... 60 TTTGATTAAA GCAAACTGCA GGAAAATTCT ATTTTGGCAT TATGTATATG TTGAATGTGA 5542 .......... .......... .......... .......... .......... .......... 60 TTATGAGTAT ATACTCCAAA GGATGAATAC GATAAGGTAG ATGTGTTGGG AATTATAAAA 5482 .......... .......... .......... .......... .......... .......... 60 CGAGTTATCA CTCGGTGTGT CGTTGCTATG GTTGCCGAGA CGGAACTATT TTGGAGAGGG 5422 .......... .......... .......... .......... .......... .......... 60 GGCTGTTTAA TATGATTCTT TGGGTTATAT GTGTTATTGG TATTGCTGTG GATAATTTGG 5362 .......... .......... .......... .......... .......... .......... 60 ATTGTTGTCG GATTGGGTCG AAGTAAGGAT GGGGAGGTGC TGCCGAATTT TTGTTAGATT 5302 .......... .......... .......... .......... .......... .......... 60 ATTAGCTAGC TTACAAGAAA GTAAAGCACG ATGTTTATCT AATTGCGGCA CGATTGTTGC 5242 .......... .......... .......... .......... .......... .......... 60 TTGTTATAGA TTAATAGCTT GAGCAGTAAA TATTGGACGT GCGGCTCGAT TATACGGTAT 5182 | || ||| ||| |||||||||| |||||||||| |||||||| ||| || .........A TTTATACCTT GAGCAGTAAA TATTGGACGT ACGGCTCGAC TATTCG.... 107 GTAACGCTGT CCCTTCTTTC ATTGGTTGGC GTGACTTTTA AAAATAAGCG AATAACGGAT 5122 .......... .......... .......... .......... .......... .......... 107 AGATTTGATA CTTACCTCTA GAGCGTCTAG GTGACGTATA TTCTTGCTTC CACAATTATT 5062 .......... .......... .......... .......... .......... .......... 107 CCTCTATATA TCGGCTATGT CTAAGGCTAT GATGATCTCT AATATCTATG GTAATGCTTC 5002 .......... .......... .......... .......... .......... .......... 107 TTAGAGTCAT TGAGATTTTA CGTTTCCATA TCGTATTAAA GGTTCATAAT CTTGATAAAA 4942 .......... .......... .......... .......... .......... .......... 107 CATTAATCAT TGGTAATACT CCTTGCTGGT TCACGTTGAT TGTTCTATTG AGTTATAAGA 4882 .......... .......... .......... .......... .......... .......... 107 AATGATTTTA ATTGCATATG GTTGCTCATA ATATTCTGCT CGTGCATAGA GTCATTTATC 4822 .......... .......... .......... .......... .......... .......... 107 ATTTCACCGA GTCCCAGGTC GGGTAATGTT CGTGCGGAGT TTCTTGCATA TGTCACCGAG 4762 .......... .......... .......... .......... .......... .......... 107 TTCCTCACTA GAGGGCCTGG TATGTATATT ATATATATAT GATTGGTGAT GAGGATGGTT 4702 .......... .......... .......... .......... .......... .......... 107 ATGATGATGA TGATGACGGA GATGACGTGA TGATTATTTT GCCGAGCCCC ATACTAGGGA 4642 .......... .......... .......... .......... .......... .......... 107 AGCTGGGCAC CTTAAATGTT AAATATATGC ATGATTTTCA CTTAAAAGGG TATATGTGTA 4582 .......... .......... .......... .......... .......... .......... 107 GCGATATTTT GTTTCGACTT GCCATATTGG TATCCTGGCA TCTTTACCTT ATGCTTTACA 4522 .......... .......... .......... .......... .......... .......... 107 TACTCAGTAC ATTGTCCGTA CTGACCCCGC TTTCCTCGGG GGGCTGCGTT TCATGCCTGC 4462 .......... .......... .......... .......... .......... .......... 107 AGGTGTAGAC ACACAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTGATTGGGA 4402 |||||||| | ||||||| |||||||||| |||||||||| |||||||||| ||| |||||| ..GTGTAGAC GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTGTTTGGGA 165 GAGCTCCACT GTTCCGGAGC CCTGTCGTTT TGGTACATAA CTT-TTGTGT AGTCTTTTGC 4343 |||||||||| |||||||||| || ||||||| |||||||||| ||| || ||| |||||||||| GAGCTCCACT GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC 225 TCGTCTATGG GTATGGCGGG GCCCTGTCCC GTCGAGTTTC ACTAATGTAC TCTTAGAGGT 4283 | |||||||| |||||||||| |||||||||| ||| |||||| |||| | ||| |||||||||| TTGTCTATGG GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT 285 CTTTGGACAT TATGTGGGTT ATATATATAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 4223 || | ||||| |||||||| |||| ||| |||||||||| |||||||||| |||||||||| CTGTAGACAT CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT 345 TTGGGATGTT CGCTTGTACA GGGGCAGCCT TGTCAGCTGC GTACATCATT GTGTATTGTG 4163 ||||||||| | ||||||| | ||||||| |||| | || | |||||||| |||||||||| TTGGGATGTC CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG 405 TAGTGGCAGC CTTATCGGCA TACGTATGTT ATTATGCTTT GAATAGTGGC GGCCTTGTCG 4103 |||||||||| || ||||| | |||||| | |||||| ||| | |||||||| |||||||||| TAGTGGCAGC CTCGTCGGC- TGCGTATGCT ATTATGTTTT GGATAGTGGC GGCCTTGTCG 464 GCTCGCGTAT GTTGTTATGG TTGAATGGTT ATGACTCCTT ATGA 4059 |||||| ||| ||||||| | || ||||||| ||||||| || |||| GCTCGCATAT GTTGTTACGA TTTAATGGTT ATGACTCTTT ATGA 508 hqPGS_C06HBa0057J04.1-26-_SGN-E310669+ (5840 5782,5232 5186,4459 4059) ******************************************************************************** EST sequence 11 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 6172 to 3484): Exon 1 5824 5782 ( 43 n); cDNA 5 47 ( 43 n); score: 0.814 Intron 1 5781 4833 ( 949 n); Pd: 0.993 (s: 0.81), Pa: 0.932 (s: 0.96) Exon 2 4832 4636 ( 197 n); cDNA 48 241 ( 194 n); score: 0.904 Intron 2 4635 4460 ( 176 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.94) Exon 3 4459 4094 ( 366 n); cDNA 242 606 ( 365 n); score: 0.903 MATCH C06HBa0057J04.1-26- SGN-E538151+ 0.903 606 1.000 C PGS_C06HBa0057J04.1-26-_SGN-E538151+ (5824 5782,4832 4636,4459 4094) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 5765 |||||| || | |||| || || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 47 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGTAGGGGGC 5705 .......... .......... .......... .......... .......... .......... 47 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGCAT GAATGGAAAC 5645 .......... .......... .......... .......... .......... .......... 47 CATAATTGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 5585 .......... .......... .......... .......... .......... .......... 47 GCAGGAAAAT TCTATTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 5525 .......... .......... .......... .......... .......... .......... 47 AAAGGATGAA TACGATAAGG TAGATGTGTT GGGAATTATA AAACGAGTTA TCACTCGGTG 5465 .......... .......... .......... .......... .......... .......... 47 TGTCGTTGCT ATGGTTGCCG AGACGGAACT ATTTTGGAGA GGGGGCTGTT TAATATGATT 5405 .......... .......... .......... .......... .......... .......... 47 CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG TCGGATTGGG 5345 .......... .......... .......... .......... .......... .......... 47 TCGAAGTAAG GATGGGGAGG TGCTGCCGAA TTTTTGTTAG ATTATTAGCT AGCTTACAAG 5285 .......... .......... .......... .......... .......... .......... 47 AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT TGCTTGTTAT AGATTAATAG 5225 .......... .......... .......... .......... .......... .......... 47 CTTGAGCAGT AAATATTGGA CGTGCGGCTC GATTATACGG TATGTAACGC TGTCCCTTCT 5165 .......... .......... .......... .......... .......... .......... 47 TTCATTGGTT GGCGTGACTT TTAAAAATAA GCGAATAACG GATAGATTTG ATACTTACCT 5105 .......... .......... .......... .......... .......... .......... 47 CTAGAGCGTC TAGGTGACGT ATATTCTTGC TTCCACAATT ATTCCTCTAT ATATCGGCTA 5045 .......... .......... .......... .......... .......... .......... 47 TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC TTCTTAGAGT CATTGAGATT 4985 .......... .......... .......... .......... .......... .......... 47 TTACGTTTCC ATATCGTATT AAAGGTTCAT AATCTTGATA AAACATTAAT CATTGGTAAT 4925 .......... .......... .......... .......... .......... .......... 47 ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA AGAAATGATT TTAATTGCAT 4865 .......... .......... .......... .......... .......... .......... 47 ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT ATCATTTCAC CGAGTCCCAG 4805 |||||||| |||||||||| |||||||| | .......... .......... .......... ..AGTCATTT ATCATTTCAC CGAGTCCCGG 75 GTCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTTCCTCA CTAGAGGGCC 4745 | |||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| GCCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTCCCTCA CTAGAGGGCC 135 TGGTATGTAT ATTATATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 4685 || |||||| | | |||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAATGTAT A-T-TATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 193 GGAGATGACG TGATGATTAT TTTGCCGAGC CCCATACTAG GGAAGCTGGG CACCTTAAAT 4625 |||||||| | |||||| ||| || | ||| ||| ||||| | || || GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. .......... 241 GTTAAATATA TGCATGATTT TCACTTAAAA GGGTATATGT GTAGCGATAT TTTGTTTCGA 4565 .......... .......... .......... .......... .......... .......... 241 CTTGCCATAT TGGTATCCTG GCATCTTTAC CTTATGCTTT ACATACTCAG TACATTGTCC 4505 .......... .......... .......... .......... .......... .......... 241 GTACTGACCC CGCTTTCCTC GGGGGGCTGC GTTTCATGCC TGCAGGTGTA GACACACAGT 4445 ||||| ||| | |||| .......... .......... .......... .......... .....GTGTA GACGCTCAGT 256 TCGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGATTG GGAGAGCTCC ACTGTTCCGG 4385 | |||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGTTTG GGAGAGCTCC ACTGTTCCGG 316 AGCCCTGTCG TTTTGGTACA TAACTT-TTG TGTAGTCTTT TGCTCGTCTA TGGGTATGGC 4326 ||||| |||| |||||||||| |||||| || |||||||||| |||| ||||| ||||||| || AGCCCAGTCG TTTTGGTACA TAACTTCTTA TGTAGTCTTT TGCTTGTCTA TGGGTAT-GC 375 GGGGCCCTGT CCCGTCGAGT TTCACTAATG TACTCTTAGA GGTCTTTGGA CATTATGTGG 4266 |||||||||| |||||| ||| ||||||| | |||||||||| ||||| | || ||| ||||| GGGGCCCTGT CCCGTCAAGT TTCACTACTA TACTCTTAGA GGTCTGTAGA CATCGTGTGG 435 GTTATATATA TATGTTTTGG ATAATGGTCT GGACATGGTT TGTTTGGGAT GTTCGCTTGT 4206 ||| |||| |||||||| | |||||||||| |||||||||| |||||||||| || | ||||| GTTGTATAAT TATGTTTTTG ATAATGGTCT GGACATGGTT TGTTTGGGAT GTCCACTTGT 495 ACAGGGGCAG CCTTGTCAGC TGCGTACATC ATTGTGTATT GTGTAGTGGC AGCCTTATCG 4146 ||| | ||| ||||||| | || ||||||| ||||||||| |||||||||| |||||| ||| ACAAGTGCAA CCTTGTCGGT TGTGTACATC TTTGTGTATT GTGTAGTGGC AGCCTTGTCG 555 GCATACGTAT GTTATTATGC TTTGAATAGT GGCGGCCTTG TCGGCTCGCG TA 4094 || | ||||| | |||||||| |||||||||| ||| |||||| |||||||||| || GC-TGCGTAT GCTATTATGC TTTGAATAGT GGCAGCCTTG TCGGCTCGCG TA 606 hqPGS_C06HBa0057J04.1-26-_SGN-E538151+ (5824 5782,4832 4636,4459 4094) ******************************************************************************** EST sequence 17 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 6152 to 2449): Exon 1 5824 5782 ( 43 n); cDNA 3 45 ( 43 n); score: 0.814 Intron 1 5781 4833 ( 949 n); Pd: 0.993 (s: 0.81), Pa: 0.932 (s: 0.94) Exon 2 4832 4636 ( 197 n); cDNA 46 239 ( 194 n); score: 0.898 Intron 2 4635 4460 ( 176 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0.94) Exon 3 4459 4229 ( 231 n); cDNA 240 470 ( 231 n); score: 0.898 MATCH C06HBa0057J04.1-26- SGN-E268096+ 0.898 471 1.002 C PGS_C06HBa0057J04.1-26-_SGN-E268096+ (5824 5782,4832 4636,4459 4229) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 5765 |||||| || | |||| || || ||||||| |||||| || ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTG....... .......... 45 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGTAGGGGGC 5705 .......... .......... .......... .......... .......... .......... 45 GTGACCAGTA GCTTAGGAAG TTTGTTTTAG TTATTGAATG TGCTAAGCAT GAATGGAAAC 5645 .......... .......... .......... .......... .......... .......... 45 CATAATTGGA TTATTAGTGG TGTCGTGTTG GTGCTTGGGC TGTTTTGATT AAAGCAAACT 5585 .......... .......... .......... .......... .......... .......... 45 GCAGGAAAAT TCTATTTTGG CATTATGTAT ATGTTGAATG TGATTATGAG TATATACTCC 5525 .......... .......... .......... .......... .......... .......... 45 AAAGGATGAA TACGATAAGG TAGATGTGTT GGGAATTATA AAACGAGTTA TCACTCGGTG 5465 .......... .......... .......... .......... .......... .......... 45 TGTCGTTGCT ATGGTTGCCG AGACGGAACT ATTTTGGAGA GGGGGCTGTT TAATATGATT 5405 .......... .......... .......... .......... .......... .......... 45 CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG TCGGATTGGG 5345 .......... .......... .......... .......... .......... .......... 45 TCGAAGTAAG GATGGGGAGG TGCTGCCGAA TTTTTGTTAG ATTATTAGCT AGCTTACAAG 5285 .......... .......... .......... .......... .......... .......... 45 AAAGTAAAGC ACGATGTTTA TCTAATTGCG GCACGATTGT TGCTTGTTAT AGATTAATAG 5225 .......... .......... .......... .......... .......... .......... 45 CTTGAGCAGT AAATATTGGA CGTGCGGCTC GATTATACGG TATGTAACGC TGTCCCTTCT 5165 .......... .......... .......... .......... .......... .......... 45 TTCATTGGTT GGCGTGACTT TTAAAAATAA GCGAATAACG GATAGATTTG ATACTTACCT 5105 .......... .......... .......... .......... .......... .......... 45 CTAGAGCGTC TAGGTGACGT ATATTCTTGC TTCCACAATT ATTCCTCTAT ATATCGGCTA 5045 .......... .......... .......... .......... .......... .......... 45 TGTCTAAGGC TATGATGATC TCTAATATCT ATGGTAATGC TTCTTAGAGT CATTGAGATT 4985 .......... .......... .......... .......... .......... .......... 45 TTACGTTTCC ATATCGTATT AAAGGTTCAT AATCTTGATA AAACATTAAT CATTGGTAAT 4925 .......... .......... .......... .......... .......... .......... 45 ACTCCTTGCT GGTTCACGTT GATTGTTCTA TTGAGTTATA AGAAATGATT TTAATTGCAT 4865 .......... .......... .......... .......... .......... .......... 45 ATGGTTGCTC ATAATATTCT GCTCGTGCAT AGAGTCATTT ATCATTTCAC CGAGTCCCAG 4805 |||||||| |||||| ||| |||||||| | .......... .......... .......... ..AGTCATTT ATCATTGCAC CGAGTCCCGG 73 GTCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTTCCTCA CTAGAGGGCC 4745 | |||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| GCCGGGTAAT GTTCGTGCGG AGTTTCTTGC ATATGTCACC GAGTCCCTCA CTAGAGGGCC 133 TGGTATGTAT ATTATATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 4685 || |||||| | | |||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAATGTAT A-T-TATATA TATGATTGGT GATGAGGATG GTTATGATGA TGATGATGAC 191 GGAGATGACG TGATGATTAT TTTGCCGAGC CCCATACTAG GGAAGCTGGG CACCTTAAAT 4625 |||||||| | |||||| ||| || | ||| ||| ||||| | || || GGAGATGATG TGATGACTAT TTCACTGAGT CCCTCACTAG AG-GGCCGG. .......... 239 GTTAAATATA TGCATGATTT TCACTTAAAA GGGTATATGT GTAGCGATAT TTTGTTTCGA 4565 .......... .......... .......... .......... .......... .......... 239 CTTGCCATAT TGGTATCCTG GCATCTTTAC CTTATGCTTT ACATACTCAG TACATTGTCC 4505 .......... .......... .......... .......... .......... .......... 239 GTACTGACCC CGCTTTCCTC GGGGGGCTGC GTTTCATGCC TGCAGGTGTA GACACACAGT 4445 ||||| ||| | |||| .......... .......... .......... .......... .....GTGTA GACGCTCAGT 254 TCGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGATTG GGAGAGCTCC ACTGTTCCGG 4385 | |||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTGGTGATCC TCCCGCCTAG GATATCTACT CTGCTGTTTG GGAGAGCTCC ACTGTTCCGG 314 AGCCCTGTCG TTTTGGTACA TAACTT-TTG TGTAGTCTTT TGCTCGTCTA TGGGTATGGC 4326 ||||| |||| |||||||||| |||||| || |||||||||| |||| ||||| ||||||| || AGCCCAGTCG TTTTGGTACA TAACTTCTTA TGTAGTCTTT TGCTTGTCTA TGGGTAT-GC 373 GGGGCCCTGT CCCGTCGAGT TTCACTAATG TACTCTTAGA GGTCTTTGGA CATTATGTGG 4266 |||||||||| |||||| ||| ||||||| | |||||||||| ||||| | || ||| ||||| GGGGCCCTGT CCCGTCAAGT TTCACTACTA TACTCTTAGA GGTCTGTAGA CATCGTGTGG 433 GTTATATATA TATGTTTTGG ATAATGGTCT GGACATG 4229 || |||| |||||||| | ||||||| || ||||||| GTAGTATAAT TATGTTTTTG ATAATGGGCT GGACATG 470 hqPGS_C06HBa0057J04.1-26-_SGN-E268096+ (5824 5782,4832 4636,4459 4229) ******************************************************************************** EST sequence 4 -strand 573 n (File: SGN-E538150-) 1 CTGCGTATGC TATTATGCTT TGAATAGTGG CAGCCTTGTC GGCTCGCGTA TGTTGTTACG 61 GTTGAATGGT TATGACTCTT TATGAGATAG ATCCACTTTA TATATATATA TATGGTGTTG 121 GGTTTGGCTT GAAAAAAAAA AAAAAAAAAA AACTCTTGAT ACAGTATTGG TTGGAAATTC 181 CCAAAGAGTT GCAGGTGCAG ATTATGCATT AGAAAGTATC CACAACATCA GGGAAGCAAT 241 ACCACAACTT TGGGAAGTAG ACAGGCTGGC TGAAGTTAAC TACTCTGGTG TAGCTGTTGA 301 GACATCTGTC ACAGCTTAGA ATCAGTAGTA CTACTATATC TCATCATCAT GCTGATGGCA 361 GAAGGAAAAA AAAATTAATC AAGAATCATG AGAAGATCCA AAATTTTCTG TCAAATTTGA 421 TTTTAAATGA TGTTGATGTT TTGTTGTCAT CAATTAATAA CTAGCTTTTA GTATTTCCTT 481 TCCATCCACA AATCTTGTAA ATAAATTCTA TATTTATCAG TCTACCTTTC TATGATTATA 541 TAATAATGAA GTTCAATTAT TAAAAAAAAA AAA Predicted gene structure (within gDNA segment 4843 to 1): Exon 1 4140 4022 ( 119 n); cDNA 4 121 ( 118 n); score: 0.882 Intron 1 4021 3704 ( 318 n); Pd: 0.000 (s: 0.78), Pa: 0.950 (s: 0) Exon 2 3703 3684 ( 20 n); cDNA 122 141 ( 20 n); score: 0.600 Intron 2 3683 557 (3127 n); Pd: 0.000 (s: 0), Pa: 0.812 (s: 0) Exon 3 556 534 ( 23 n); cDNA 142 164 ( 23 n); score: 0.565 PPA cDNA 562 573 MATCH C06HBa0057J04.1-26- SGN-E538150- 0.882 162 0.283 C PGS_C06HBa0057J04.1-26-_SGN-E538150- (4140 4022,3703 3684,556 534) Alignment (genomic DNA sequence = upper lines): CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC TCGCGTATGT TGTTATGGTT 4081 |||||| ||| |||||||||| |||||||| | |||||||||| |||||||||| ||||| |||| CGTATGCTAT TATGCTTTGA ATAGTGGCAG CCTTGTCGGC TCGCGTATGT TGTTACGGTT 63 GAATGGTTAT GACTCCTTAT GAGACAGGTC CTCTTATATA TATATGACGT TGGGGTTGGC 4021 |||||||||| ||||| |||| |||| || || | ||| |||| ||||| ||| ||||| GAATGGTTAT GACTCTTTAT GAGATAGATC CACTT-TATA TATATATATA TGGTGTTGG. 121 TTGATTTGAT TAAATTCCAT ATTGTCTTAG TTTCAGTTGG TTATACTTAG CAGGTTTGTA 3961 .......... .......... .......... .......... .......... .......... 121 TGTGGTTGTC CAAAAAGGGT ACTAGTAACG GCCCATCGGG TTGGGTCGTG ACAAAGAGTG 3901 .......... .......... .......... .......... .......... .......... 121 GTATCAGAGC GGTTCTTCCT TGGAAGTGTC TACAGACCAT GTCTAGTAGA GTCTTGTTTA 3841 .......... .......... .......... .......... .......... .......... 121 TCGGTGTGTT GTGCACCACA TCTATAAACA GGAAGCTACA GGACATTTAG GATGTCATTC 3781 .......... .......... .......... .......... .......... .......... 121 TTTCTTCTTA TTCTAGATCG TGCGATAGAG CTATATTAAC AGGATAATCC CTCTCTAACG 3721 .......... .......... .......... .......... .......... .......... 121 AATCCGTGTG TTTTCAGCTA TGCCTCCAAA GAAAGCAATA GCCGCCCAGA AGGGAAAATC 3661 | || || ||| ||| | .......... .......GTT TGGCTTGAAA AAAAAAA... .......... .......... 141 GGTAGCAGAA GGTACTAGTC AGACCCGAAG AGTTACTAGG GCCCGTGCCC AGTCTATGCC 3601 .......... .......... .......... .......... .......... .......... 141 TGGTATTATG CTCCAGTCGG AGAGCTCTGC TACACTCCCA CCGCCAGAAG AGCTTAGAGC 3541 .......... .......... .......... .......... .......... .......... 141 AGCAGCAGCT CCAGTTCGGG GGATACCACC AGCCCCCGAG GCCCCAACAT CTGAACCTCC 3481 .......... .......... .......... .......... .......... .......... 141 AGCTTCTCAG TCAGGGGCGG AGGATAGGGC CATGAGAGAT GCGGTTCAAT TGGTGACTAG 3421 .......... .......... .......... .......... .......... .......... 141 ATTAGTGGCA GATCAGGCTC GCAGGCATGG ACTAGGAGTT GATCATGCGG ACAAATCTGA 3361 .......... .......... .......... .......... .......... .......... 141 TAGCTTAAGG GCTCGTGACT TCTTAAGTTG TAATCTTCCA GAGTTCTTTG GGTCAAGGCC 3301 .......... .......... .......... .......... .......... .......... 141 CCAGGATGAT CCGCAAGAGT TTATTCATCA GATGCAGCGT ACATTGAGGA TAATCAAGGC 3241 .......... .......... .......... .......... .......... .......... 141 TTCGGAGACC GAGTCTGTTG AGTTGGCTAC ATATCGTTTG CGGGATGTAG CTATTAATTG 3181 .......... .......... .......... .......... .......... .......... 141 GTATGAGCCT TAGGAGTTAT CTAGGGTGAG GGTGCTCCTC CAGCGGTGTG GGATGAATTT 3121 .......... .......... .......... .......... .......... .......... 141 GTGGAGGCTT TCCAGGGCCA CTTCCTGCCT CCAGAGATGA AGCGAGCTAG AGTCGATAGA 3061 .......... .......... .......... .......... .......... .......... 141 TTCTTGCGTT TGAAGCAAAA TGGCAGGAGC GTTCGAGAGT ATAGCCTCGA GTTTGATTCA 3001 .......... .......... .......... .......... .......... .......... 141 TTGGCTAGGC ATGCGCCTAC TATTGTGGCT GATATGGCAG ACAGGGTACA TCGTTATGTG 2941 .......... .......... .......... .......... .......... .......... 141 ATGGGATTGG ATCGTTATCT GATTGACGGT TGTATGGCAG TGACTCTTCA GCCAGGTATG 2881 .......... .......... .......... .......... .......... .......... 141 GACATTGCTC GGGTGCAGGC ATATGCATAG GGGGTAGAGG ATTGGCACCG GGGACGTCAG 2821 .......... .......... .......... .......... .......... .......... 141 CCAGATAGAG ATTATAATAG AGGCCAGCAT AAGAGGGCTA GATCAGCAGG TTATCCTGAC 2761 .......... .......... .......... .......... .......... .......... 141 GAGTTTCAAA GCGGGCAGTC TCAGCAGCAT GTTAGATTTT CTTCCCAGCC AGCACAGAGG 2701 .......... .......... .......... .......... .......... .......... 141 GCACCCCCAC GTTTCATGGG TAGGGGGTTC GATCGTATGG GATACTCGGA AGCTAGTTAG 2641 .......... .......... .......... .......... .......... .......... 141 AGCTCTAGGG CGTCAAGGTC ACAGATGGGC AGGGGTTTGA GCCAGTCAAG GCCACCTTTG 2581 .......... .......... .......... .......... .......... .......... 141 CCTCGGTGTT CTCATTGTGG TAAGTCCCAT CCTGGGGAAT GTCGTTGGGC TACAGGTGCG 2521 .......... .......... .......... .......... .......... .......... 141 TGTTTTTCTT GCGGCCGTCA GGGCCATACT ATGAGGGAGT GTCACCTTAG AGGTAGTGCA 2461 .......... .......... .......... .......... .......... .......... 141 GGTGGTATGG CACATCCTAC AGGGTCCGTT GCTGGTTCAT CTTCTTCTGT GGCTGTGCGC 2401 .......... .......... .......... .......... .......... .......... 141 CCTACGGGGC AGGGTATTCA GGCGCCAACA GGCCGTGGTA GAGGACGTGA TGGAGCTTCC 2341 .......... .......... .......... .......... .......... .......... 141 AGTTCTAGCG GTCCCTCAAA CCGTATATAT GCTTTGACTA ATAGGCAAGA TTAGGAGGCG 2281 .......... .......... .......... .......... .......... .......... 141 TCACCTAATG TGATCACAGG TATATTATCA CTATTCTCCC GAAGTGTGTA TGCATTGATA 2221 .......... .......... .......... .......... .......... .......... 141 GACCCAGGTT CCACCTTATC ATATATATCT CCCTTTGTTG CTAGTAGGAT CGGAATAGAG 2161 .......... .......... .......... .......... .......... .......... 141 TCTGAGTTGA TAGAACCATT TGAGGTAGCT ACACCTGTAG GAGATTTTGT CATAGCTACG 2101 .......... .......... .......... .......... .......... .......... 141 CGAGTATATA GGAATTGTTC AGTAGTTATA TATAGTCGTC GTACCGTAGA AGATCTAATA 2041 .......... .......... .......... .......... .......... .......... 141 GAGTTAAATA TGATTGAGTT TGATATTATC ATGGGCATGG ATTGGTTGGT TGCTTGTTAT 1981 .......... .......... .......... .......... .......... .......... 141 GCTAATATTG ATTGCAGAGG AAAGATAGTT CGATTTCAAT TTCCAGGGGA ACAGATTATA 1921 .......... .......... .......... .......... .......... .......... 141 GAGTGGAAGG GAAGTACAGT ATCGCCGAAA GGTAAGTTCA TTTCATACCT CAAGGCCGGG 1861 .......... .......... .......... .......... .......... .......... 141 AAGATGGTTA GAAAAGGCTA TATTTACCAT CTGATTCGGG TGCATGACAT AAAGGCAGAG 1801 .......... .......... .......... .......... .......... .......... 141 ACACCGACTC TTCAATCAGT CCAGGTAGTT AATGAATTTC CCGATATATT CCCCGAGGAA 1741 .......... .......... .......... .......... .......... .......... 141 CTTCCAGGCC TTCCTCCAGA ACGGGAGATA GAGTTTACTA TAGATGTACT ACCAGATGCC 1681 .......... .......... .......... .......... .......... .......... 141 CACCCTATAT CTATACCTCC TTATAGAATG GCACCTGCTG AGTTGGAAGA ATTGAAAGAG 1621 .......... .......... .......... .......... .......... .......... 141 CAATTGAGGG ATTTGCTAGA AAAGGGCTTC ATCAGGCCTA GTACGTCACC TTGGGGAGCA 1561 .......... .......... .......... .......... .......... .......... 141 CCGGTACTGT TTGTGAGGAA GAAGGATGGG TCGCTGCGGA TGTGCATTGA TTATAGGCAG 1501 .......... .......... .......... .......... .......... .......... 141 TTGAACAAAG TAACAATAAA GAACAGGTAT CCCCTCCCAA GGATTGACGA TCTACTTGAC 1441 .......... .......... .......... .......... .......... .......... 141 CAGTTGCAGG GTGCAAAGTG TTTTTCAAAG ATAGACTTGC GGTCAGGTTA TCATCAGGTG 1381 .......... .......... .......... .......... .......... .......... 141 CGGGTAAGGG AGGCAGATAT TCCAAAGACA GCATTCCGGA CCCGATATGG GCATTATGAG 1321 .......... .......... .......... .......... .......... .......... 141 TTTAGAGTGC TGTCTTTTGG GCTGACTAAT GCTCCAGCGG TGTTCATGGA TTTAATAAAT 1261 .......... .......... .......... .......... .......... .......... 141 TGAGTATTTA AACCATTCCT TGATATGTTT GTTATTGTAT TTATAGACGA TATTCTGGTC 1201 .......... .......... .......... .......... .......... .......... 141 TATTCACGTT CAGAAGAGGA GCATGCAGAT CATTTAAGGA CGGTACTTAG GGTGCTTCAG 1141 .......... .......... .......... .......... .......... .......... 141 CACCAGAAGT TGTATGCTAA ATTTTCTAAG TGCGAGTTCT GGTTGACTTC AGTGGCATTC 1081 .......... .......... .......... .......... .......... .......... 141 TTGGGCATAT TATTGGAGCT GATGGTATTC GGGTAGACAC GCAGAAGATT GAGGCAGTAA 1021 .......... .......... .......... .......... .......... .......... 141 AGACTTGGCC CAGACCTGCG ACACCTACTG AGGTACGCAG CTTTTTGGGG TTAGCAGGAT 961 .......... .......... .......... .......... .......... .......... 141 ATTACAGGAG ATTCGTAGAC AAGTTTGCTT CAATTTCAGC GCCTTTGACA AGGCTAACTC 901 .......... .......... .......... .......... .......... .......... 141 AAAAGGAAGC CAAGTTCCAG TGGACATATG TTTGTGAGCG AAGCTTCCAG CTATTGAAAG 841 .......... .......... .......... .......... .......... .......... 141 ACAAATTGAC TACAGCTCCA GTCCTAACTC TTCCGGAGGG ACCAGACGGC TATGTTATTT 781 .......... .......... .......... .......... .......... .......... 141 ATTGTGATGC TTCGGGTGTT GGGCTAGGAT GTGTATTGAT GCAGCATGGC AAAGTTATAG 721 .......... .......... .......... .......... .......... .......... 141 CCTATGCCTC CCGACAACTT AGGAAGCATG AAAAGAACTA TCCTATTCAC GATCTGGAGT 661 .......... .......... .......... .......... .......... .......... 141 TAGCGGTCGT GGTTCATGCC TTGAAGATAT GGAGACATTA TTTATATGGT GTCCATGTGG 601 .......... .......... .......... .......... .......... .......... 141 ACATCTATAC AGATCATAAG AGTCTCCAAT ATATCTTTAA ACAGAAGGAG CTGAACTTAC 541 || | |||| .......... .......... .......... .......... ....AAAAAA AAAAACTCTT 157 GATAGAG 534 |||| || GATACAG 164 hqPGS_C06HBa0057J04.1-26-_SGN-E538150- (4140 4022) ******************************************************************************** EST sequence 10 +strand 495 n (File: SGN-E306317+) 1 TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG GTGATCCTCC 61 CGCCTAGGAT ATCTACTCTG CTGTTTGGGA GAGCTCCACT GTTCCGGAGC CCAGTCGTTT 121 TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG GCCCTGTCCC 181 GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT GTATAATTAT 241 GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA AGTGCAGCCT 301 TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT GCGTATGCTA 361 TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT TTAATGGTTA 421 TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA TATATATGGC GTTGGGTTTN 481 AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 5478 to 2011): Exon 1 4459 4038 ( 422 n); cDNA 33 453 ( 421 n); score: 0.897 PPA cDNA 481 495 MATCH C06HBa0057J04.1-26- SGN-E306317+ 0.897 422 0.853 C PGS_C06HBa0057J04.1-26-_SGN-E306317+ (4459 4038) Alignment (genomic DNA sequence = upper lines): GTGTAGACAC ACAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 4400 |||||||| | ||||||||| |||||||||| |||||||||| |||||||||| | |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GTTTGGGAGA 92 GCTCCACTGT TCCGGAGCCC TGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 4341 |||||||||| |||||||||| ||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 152 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 4281 |||||||||| |||||||||| |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 212 TTGGACATTA TGTGGGTTAT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4221 | ||||| |||||||| | ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 272 GGGATGTTCG CTTGTACAGG GGCAGCCTTG TCAGCTGCGT ACATCATTGT GTATTGTGTA 4161 ||||||| | ||||||| | ||||||||| || | || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 332 GTGGCAGCCT TATCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 4101 |||||||||| ||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 391 TCGCGTATGT TGTTATGGTT GAATGGTTAT GACTCCTTAT GAGACAGGTC CTCTTATATA 4041 |||| ||||| ||||| | || ||||||||| ||||| |||| |||| || || | ||| |||| TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GAGATAGATC CACTT-TATA 450 TAT 4038 ||| TAT 453 hqPGS_C06HBa0057J04.1-26-_SGN-E306317+ (4459 4038) ******************************************************************************** EST sequence 19 +strand 523 n (File: SGN-E303695+) 1 AAATGGAGAA AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC 61 GCTCAGTTCG GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT 121 GTTCCGGAGC CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG 181 GTATGGCGGG GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT 241 CGTGTGGGTT GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC 301 CATTTGTACA AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC 361 CTCGTCGGCT GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG 421 TTGTTACGAT TTAATGGTTA TGACTCTTTA TGAGATAGAT CCACTTTATA TATATATATA 481 TATATATGGC GTTGGGTTTA GCTTGATTTG ATTAAAAAAA AAA Predicted gene structure (within gDNA segment 5678 to 1931): Exon 1 4459 4038 ( 422 n); cDNA 53 473 ( 421 n); score: 0.895 PPA cDNA 514 523 MATCH C06HBa0057J04.1-26- SGN-E303695+ 0.895 422 0.807 C PGS_C06HBa0057J04.1-26-_SGN-E303695+ (4459 4038) Alignment (genomic DNA sequence = upper lines): GTGTAGACAC ACAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 4400 |||||||| | ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 112 GCTCCACTGT TCCGGAGCCC TGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 4341 |||||||||| |||||||||| ||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 172 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 4281 |||||||||| |||||||||| |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 232 TTGGACATTA TGTGGGTTAT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4221 | ||||| |||||||| | ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 292 GGGATGTTCG CTTGTACAGG GGCAGCCTTG TCAGCTGCGT ACATCATTGT GTATTGTGTA 4161 ||||||| | ||||||| | ||||||||| || | || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 352 GTGGCAGCCT TATCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 4101 |||||||||| ||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 411 TCGCGTATGT TGTTATGGTT GAATGGTTAT GACTCCTTAT GAGACAGGTC CTCTTATATA 4041 |||| ||||| ||||| | || ||||||||| ||||| |||| |||| || || | ||| |||| TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GAGATAGATC CACTT-TATA 470 TAT 4038 ||| TAT 473 hqPGS_C06HBa0057J04.1-26-_SGN-E303695+ (4459 4038) ******************************************************************************** EST sequence 2 -strand 432 n (File: SGN-E225616-) 1 TATTCGGTGT AGACGCTCAG TTCGGTGATC CTCCCGCCTA GGATATCTAC TCTGCTTTTT 61 GGGAGAGCTC CACTGTTCCG GAGCCCAGTC GTTTTGGTAC ATAACTTCTT ATGTAGTCTT 121 TTGCTTGTCT ATGGGTATGG CGGGGCCCTG TCCCGTCAAG TTTCACTACT ATACTCTTAG 181 AGGTCTGTAG ACATCGTGTG GGTTGTATAA TTATGTTTTG GATAATGGTC TGGACATGGT 241 TTGTTTGGGA TGTCCATTTG TACAAGTGCA GCCTTGTCGG TTGTGAACAT CATTGTGTAT 301 TGTGTAGTGG CAGCCTCGTC GGCTGCGTAT GCTATTATGT TTTGGATAGT GGCGGCCTTG 361 TCGGCTCGCA TATGTTGTTA CGATTTAATG GTTATGACTC TTTATGAAAA AACCAAAAAA 421 AAAAAAAAAA AA Predicted gene structure (within gDNA segment 5228 to 2391): Exon 1 4460 4059 ( 402 n); cDNA 6 407 ( 402 n); score: 0.899 PPA cDNA 415 432 MATCH C06HBa0057J04.1-26- SGN-E225616- 0.899 402 0.931 C PGS_C06HBa0057J04.1-26-_SGN-E225616- (4460 4059) Alignment (genomic DNA sequence = upper lines): GGTGTAGACA CACAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGATTGGGAG 4401 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| | ||||||| GGTGTAGACG CTCAGTTCGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TTTTTGGGAG 65 AGCTCCACTG TTCCGGAGCC CTGTCGTTTT GGTACATAAC TT-TTGTGTA GTCTTTTGCT 4342 |||||||||| |||||||||| | |||||||| |||||||||| || || |||| |||||||||| AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 125 CGTCTATGGG TATGGCGGGG CCCTGTCCCG TCGAGTTTCA CTAATGTACT CTTAGAGGTC 4282 ||||||||| |||||||||| |||||||||| || ||||||| ||| | |||| |||||||||| TGTCTATGGG TATGGCGGGG CCCTGTCCCG TCAAGTTTCA CTACTATACT CTTAGAGGTC 185 TTTGGACATT ATGTGGGTTA TATATATATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 4222 | | ||||| |||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTAGACATC GTGTGGGTTG TATAATTATG TTTTGGATAA TGGTCTGGAC ATGGTTTGTT 245 TGGGATGTTC GCTTGTACAG GGGCAGCCTT GTCAGCTGCG TACATCATTG TGTATTGTGT 4162 |||||||| | ||||||| | |||||||| ||| | || | ||||||||| |||||||||| TGGGATGTCC ATTTGTACAA GTGCAGCCTT GTCGGTTGTG AACATCATTG TGTATTGTGT 305 AGTGGCAGCC TTATCGGCAT ACGTATGTTA TTATGCTTTG AATAGTGGCG GCCTTGTCGG 4102 |||||||||| | ||||| | |||||| || ||||| |||| ||||||||| |||||||||| AGTGGCAGCC TCGTCGGC-T GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG 364 CTCGCGTATG TTGTTATGGT TGAATGGTTA TGACTCCTTA TGA 4059 ||||| |||| |||||| | | | |||||||| |||||| ||| ||| CTCGCATATG TTGTTACGAT TTAATGGTTA TGACTCTTTA TGA 407 hqPGS_C06HBa0057J04.1-26-_SGN-E225616- (4460 4059) ******************************************************************************** EST sequence 15 +strand 453 n (File: SGN-E303256+) 1 AACCAGCCAT TAAACTCTTG GACAGCAGCT GCAAAGAAAT TGGTGTAGAC GCTCAGTTCG 61 GTGATCCTCC CGCCTAGGAT ATCTACTCTG CTTTTTGGGA GAGCTCCACT GTTCCGGAGC 121 CCAGTCGTTT TGGTACATAA CTTCTTATGT AGTCTTTTGC TTGTCTATGG GTATGGCGGG 181 GCCCTGTCCC GTCAAGTTTC ACTACTATAC TCTTAGAGGT CTGTAGACAT CGTGTGGGTT 241 GTATAATTAT GTTTTGGATA ATGGTCTGGA CATGGTTTGT TTGGGATGTC CATTTGTACA 301 AGTGCAGCCT TGTCGGTTGT GAACATCATT GTGTATTGTG TAGTGGCAGC CTCGTCGGCT 361 GCGTATGCTA TTATGTTTTG GATAGTGGCG GCCTTGTCGG CTCGCATATG TTGTTACGAT 421 TTAATGGTTA TGACTCTTTA TGAAAAAAAA AAA Predicted gene structure (within gDNA segment 5578 to 2531): Exon 1 4459 4059 ( 401 n); cDNA 43 443 ( 401 n); score: 0.899 PPA cDNA 444 453 MATCH C06HBa0057J04.1-26- SGN-E303256+ 0.899 401 0.885 C PGS_C06HBa0057J04.1-26-_SGN-E303256+ (4459 4059) Alignment (genomic DNA sequence = upper lines): GTGTAGACAC ACAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT GATTGGGAGA 4400 |||||||| | ||||||||| |||||||||| |||||||||| |||||||||| |||||||| GTGTAGACGC TCAGTTCGGT GATCCTCCCG CCTAGGATAT CTACTCTGCT TTTTGGGAGA 102 GCTCCACTGT TCCGGAGCCC TGTCGTTTTG GTACATAACT T-TTGTGTAG TCTTTTGCTC 4341 |||||||||| |||||||||| ||||||||| |||||||||| | || ||||| ||||||||| GCTCCACTGT TCCGGAGCCC AGTCGTTTTG GTACATAACT TCTTATGTAG TCTTTTGCTT 162 GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CGAGTTTCAC TAATGTACTC TTAGAGGTCT 4281 |||||||||| |||||||||| |||||||||| | |||||||| || | ||||| |||||||||| GTCTATGGGT ATGGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 222 TTGGACATTA TGTGGGTTAT ATATATATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 4221 | ||||| |||||||| | ||| ||||| |||||||||| |||||||||| |||||||||| GTAGACATCG TGTGGGTTGT ATAATTATGT TTTGGATAAT GGTCTGGACA TGGTTTGTTT 282 GGGATGTTCG CTTGTACAGG GGCAGCCTTG TCAGCTGCGT ACATCATTGT GTATTGTGTA 4161 ||||||| | ||||||| | ||||||||| || | || | |||||||||| |||||||||| GGGATGTCCA TTTGTACAAG TGCAGCCTTG TCGGTTGTGA ACATCATTGT GTATTGTGTA 342 GTGGCAGCCT TATCGGCATA CGTATGTTAT TATGCTTTGA ATAGTGGCGG CCTTGTCGGC 4101 |||||||||| ||||| | |||||| ||| |||| |||| |||||||||| |||||||||| GTGGCAGCCT CGTCGGC-TG CGTATGCTAT TATGTTTTGG ATAGTGGCGG CCTTGTCGGC 401 TCGCGTATGT TGTTATGGTT GAATGGTTAT GACTCCTTAT GA 4059 |||| ||||| ||||| | || ||||||||| ||||| |||| || TCGCATATGT TGTTACGATT TAATGGTTAT GACTCTTTAT GA 443 hqPGS_C06HBa0057J04.1-26-_SGN-E303256+ (4459 4059) ******************************************************************************** EST sequence 14 +strand 691 n (File: SGN-E328093+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGGTTAG TAATCTCTTT 61 GCTTGGTTTG TTAATTCCTT AGAATACCTT TGTTAATTAG ACATTTATGT TAAGAAGGGG 121 GACGTGAACA GTATCTTAGG AATTTGTTTT AGTTATTGAA TGTGCTAAGG ATGAGCAGAA 181 ACCATGATCG GATTGCTAGC GGTGTTATAT TTGTGTTGGG CTGTTTTGAT TAAAGTAAGC 241 TGCTGGAAAT TCTGTTTTGG TGTTATGCAT ATGTTAATAT GATTATGGGT ATATACTCCA 301 AAGGATGAAT ACAATAAGGT AGATGTGTTG CGAATTATAA AACGAATTAT CGGTCGGTGT 361 GTCGTTGTTT TGTTACTATG GTTGCTAAAA ACGGAACTGT TTTGGGGGAG GCTGTTTAAT 421 ATGATTTGTT GGATTATATG TGTTGTTGGT ATTGTTGTGG ATAATTTGGG TTGTTGTTGG 481 ATTGGGATGA AGTAAAGAAA ATAGGGGAAG TGCTGCCGGA TTTTCGTTAG ATTATTAGCT 541 AGCTTACATA AGTAGTAAGC GCGACATTTA TCTAATTGCG GCACGATTGG TGCTTGTTAT 601 AGATTTATAC CTTGAGCAGT AAATATTGGA CGTACGGCTC GACTATTCGG TATGTAACGC 661 TATCCTTTCC TTCTTTGTTT GGCATGACCT T Predicted gene structure (within gDNA segment 8386 to 3616): Exon 1 5824 5144 ( 681 n); cDNA 3 691 ( 689 n); score: 0.811 MATCH C06HBa0057J04.1-26- SGN-E328093+ 0.811 681 0.986 C PGS_C06HBa0057J04.1-26-_SGN-E328093+ (5824 5144) Alignment (genomic DNA sequence = upper lines): AAACCAACCC TGCAACTTTT GGCCAGCAGC TGCAAATAAT TTGGTTAGTA ATCTCCTTGT 5765 |||||| || | |||| || || ||||||| |||||| || |||||||||| ||||| ||| AAACCAGCCA TTAAACTCTT GGACAGCAGC TGCAAAGAAA TTGGTTAGTA ATCTCTTTGC 62 TTGGTGTGTT AATTCTTTAG AATACCCTTG TTAATTATCC ATTAATTTTA AGTAGGGGG- 5706 ||||| |||| ||||| |||| |||||| ||| ||||||| | ||| || ||| || |||||| TTGGTTTGTT AATTCCTTAG AATACCTTTG TTAATTAGAC ATTTATGTTA AGAAGGGGGA 122 CGTGACCAGT AGCTTAGGAA GTTTGTTTTA GTTATTGAAT GTGCTAAGCA TGAATGGAAA 5646 ||||| |||| | |||||||| ||||||||| |||||||||| |||||||| | ||| |||| CGTGAACAGT ATCTTAGGAA -TTTGTTTTA GTTATTGAAT GTGCTAAGGA TGAGCAGAAA 181 CCATAATTGG ATTATTAGTG GTGTCGTGTT GGTGCTTGGG CTGTTTTGAT TAAAGCAAAC 5586 |||| || || ||| ||| | |||| | || ||| ||||| |||||||||| ||||| || | CCATGATCGG ATTGCTAGCG GTGTTATATT TGTG-TTGGG CTGTTTTGAT TAAAGTAAGC 240 TGCAGGAAAA TTCTATTTTG GCATTATGTA TATGTTGAAT GTGATTATGA GTATATACTC 5526 ||| || ||| |||| ||||| | ||||| | |||||| ||| |||||||| |||||||||| TGCTGG-AAA TTCTGTTTTG GTGTTATGCA TATGTT-AAT ATGATTATGG GTATATACTC 298 CAAAGGATGA ATACGATAAG GTAGATGTGT TGGGAATTAT AAAACGAGTT ATCACTCGGT 5466 |||||||||| |||| ||||| |||||||||| || ||||||| ||||||| || ||| ||||| CAAAGGATGA ATACAATAAG GTAGATGTGT TGCGAATTAT AAAACGAATT ATCGGTCGGT 358 GTGTC---G- -TTG---CTA TGGTTGC-CG AGACGGAACT ATTTTGGAGA GGGGGCTGTT 5415 ||||| | ||| ||| ||||||| | |||||||| |||||| | || ||||||| GTGTCGTTGT TTTGTTACTA TGGTTGCTAA AAACGGAACT GTTTTGG-G- GGAGGCTGTT 416 TAATATGATT CTTTGGGTTA TATGTGTTAT TGGTATTGCT GTGGATAATT TGGATTGTTG 5355 |||||||||| |||| ||| |||||||| | |||||||| | |||||||||| ||| |||||| TAATATGATT TGTTGGATTA TATGTGTTGT TGGTATTGTT GTGGATAATT TGGGTTGTTG 476 TCGGATTGGG TCGAAGTAAG GA---T-GGG GAGGTGCTGC CGAATTTTTG TTAGATTATT 5299 | |||||||| ||||||| || | ||| || ||||||| || ||||| | |||||||||| TTGGATTGGG ATGAAGTAAA GAAAATAGGG GAAGTGCTGC CGGATTTTCG TTAGATTATT 536 AGCTAGCTTA CA-AGAAAGT AAAGCACGAT GTTTATCTAA TTGCGGCACG ATTGTTGCTT 5240 |||||||||| || | ||| |||| ||| ||||||||| |||||||||| |||| ||||| AGCTAGCTTA CATAAGTAGT -AAGCGCGAC ATTTATCTAA TTGCGGCACG ATTGGTGCTT 595 GTTATAGATT AATAGCTTGA GCAGTAAATA TTGGACGTGC GGCTCGATTA TACGGTATGT 5180 |||||||||| ||| ||||| |||||||||| |||||||| | ||||||| || | |||||||| GTTATAGATT TATACCTTGA GCAGTAAATA TTGGACGTAC GGCTCGACTA TTCGGTATGT 655 AACGCTGTCC CTTCTTTCAT TGGTTGGCGT GACTTT 5144 |||||| ||| ||| ||| | || ||||| | ||| || AACGCTATCC TTTCCTTCTT TGTTTGGCAT GACCTT 691 hqPGS_C06HBa0057J04.1-26-_SGN-E328093+ (5824 5144) ******************************************************************************** EST sequence 21 +strand 455 n (File: SGN-E298250+) 1 AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 61 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 121 AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 181 ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 241 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 301 TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 361 GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 421 GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA Predicted gene structure (within gDNA segment 6432 to 3463): Exon 1 5832 5386 ( 447 n); cDNA 1 455 ( 455 n); score: 0.899 MATCH C06HBa0057J04.1-26- SGN-E298250+ 0.899 447 0.982 C PGS_C06HBa0057J04.1-26-_SGN-E298250+ (5832 5386) Alignment (genomic DNA sequence = upper lines): AAATGGAGAA ACCAACCCTG CAACTTTTGG CCAGCAGCTG CAAATAATTT GGTTAGTAAT 5773 |||||||||| |||||||||| ||||| |||| |||| ||||| |||||||||| || |||||| AAATGGAGAA ACCAACCCTG CAACTCTTGG CCAGTAGCTG CAAATAATTT GGGGAGTAAT 60 CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA TACCCTTGTT AATTATCCAT TAATTTTAAG 5713 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCCTTGTTT GGTGTGTTAA TTCTTTAGAA CACCCTTGTT AATTATCCAT TAATTTTAAG 120 TAGGGGGCGT GACCAGTAGC TTAGGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGCATGA 5653 ||||||||| ||||||| || ||| |||||| |||||||||| |||||||||| ||||| |||| AAGGGGGCGT GACCAGTTGC TTACGAAGTT TGTTTTAGTT ATTGAATGTG CTAAGTATGA 180 ATGGAAACCA TAATTGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 5593 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGAAACCA TAATCGGATT ATTAGTGGTG TCGTGTTGGT GCTTGGGCTG TTTTGATTAA 240 AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GTTGAATGTG ATTATGAGTA 5533 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| AGCAAACTGC AGGAAAATTC TATTTTGGCA TTATGTATAT GCTGAATGTG ATTATGAGTA 300 TATACTCCAA AGGATGAATA CGATAAGGTA GATGTGTTGG GAATTATAAA ACGAGTTATC 5473 |||||||||| ||||||||| |||| ||||| |||||||| |||||||||| |||||||||| TATACTCCAA CGGATGAATA CGATTAGGTA AATGTGTTGC GAATTATAAA ACGAGTTATC 360 ACTCGGTGTG TC-GT----- --TGCTATGG TTGCCGAGAC GGAACTATTT TGGAGAGGGG 5421 ||||||||| || || |||||| | ||||| |||| |||||| ||| ||| |||||| GCTCGGTGTG TCGGTGCTTC GCTGCTATAG TTGCCCAGAC GGAACTGTTT TGGGGAGGGG 420 GCTGTTTAAT ATGATTCTTT GGGTTATATG TGTTA 5386 |||| |||| |||| ||| |||||||||| ||||| GCTGCCTAAT GTGATACTTC GGGTTATATG TGTTA 455 hqPGS_C06HBa0057J04.1-26-_SGN-E298250+ (5832 5386) Total number of EST alignments reported: 21 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 8386: PGL 1 (- strand): 2952 1164 AGS-1 (1893 1164) SCR (e 0.953) Exon 1 1893 1164 ( 730 n); score: 0.953 PGS (1893 1164) SGN-E379982- PGS (1682 1164) SGN-E201553- 3-phase translation of AGS-1 (-strand): . . . . . . 1893 AAAGGTAAGTTCATTTCATACCTCAAGGCCGGGAAGATGGTTAGAAAAGGCTATATTTAC K G K F I S Y L K A G K M V R K G Y I Y K V S S F H T S R P G R W L E K A I F T R - V H F I P Q G R E D G - K R L Y L . . . . . . 1833 CATCTGATTCGGGTGCATGACATAAAGGCAGAGACACCGACTCTTCAATCAGTCCAGGTA H L I R V H D I K A E T P T L Q S V Q V I - F G C M T - R Q R H R L F N Q S R - P S D S G A - H K G R D T D S S I S P G . . . . . . 1773 GTTAATGAATTTCCCGATATATTCCCCGAGGAACTTCCAGGCCTTCCTCCAGAACGGGAG V N E F P D I F P E E L P G L P P E R E L M N F P I Y S P R N F Q A F L Q N G R S - - I S R Y I P R G T S R P S S R T G . . . . . . 1713 ATAGAGTTTACTATAGATGTACTACCAGATGCCCACCCTATATCTATACCTCCTTATAGA I E F T I D V L P D A H P I S I P P Y R - S L L - M Y Y Q M P T L Y L Y L L I E D R V Y Y R C T T R C P P Y I Y T S L - . . . . . . 1653 ATGGCACCTGCTGAGTTGGAAGAATTGAAAGAGCAATTGAGGGATTTGCTAGAAAAGGGC M A P A E L E E L K E Q L R D L L E K G W H L L S W K N - K S N - G I C - K R A N G T C - V G R I E R A I E G F A R K G . . . . . . 1593 TTCATCAGGCCTAGTACGTCACCTTGGGGAGCACCGGTACTGTTTGTGAGGAAGAAGGAT F I R P S T S P W G A P V L F V R K K D S S G L V R H L G E H R Y C L - G R R M L H Q A - Y V T L G S T G T V C E E E G . . . . . . 1533 GGGTCGCTGCGGATGTGCATTGATTATAGGCAGTTGAACAAAGTAACAATAAAGAACAGG G S L R M C I D Y R Q L N K V T I K N R G R C G C A L I I G S - T K - Q - R T G W V A A D V H - L - A V E Q S N N K E Q . . . . . . 1473 TATCCCCTCCCAAGGATTGACGATCTACTTGACCAGTTGCAGGGTGCAAAGTGTTTTTCA Y P L P R I D D L L D Q L Q G A K C F S I P S Q G L T I Y L T S C R V Q S V F Q V S P P K D - R S T - P V A G C K V F F . . . . . . 1413 AAGATAGACTTGCGGTCAGGTTATCATCAGGTGCGGGTAAGGGAGGCAGATATTCCAAAG K I D L R S G Y H Q V R V R E A D I P K R - T C G Q V I I R C G - G R Q I F Q R K D R L A V R L S S G A G K G G R Y S K . . . . . . 1353 ACAGCATTCCGGACCCGATATGGGCATTATGAGTTTAGAGTGCTGTCTTTTGGGCTGACT T A F R T R Y G H Y E F R V L S F G L T Q H S G P D M G I M S L E C C L L G - L D S I P D P I W A L - V - S A V F W A D . . . . . . 1293 AATGCTCCAGCGGTGTTCATGGATTTAATAAATTGAGTATTTAAACCATTCCTTGATATG N A P A V F M D L I N - V F K P F L D M M L Q R C S W I - - I E Y L N H S L I C - C S S G V H G F N K L S I - T I P - Y . . . . . . 1233 TTTGTTATTGTATTTATAGACGATATTCTGGTCTATTCACGTTCAGAAGAGGAGCATGCA F V I V F I D D I L V Y S R S E E E H A L L L Y L - T I F W S I H V Q K R S M Q V C Y C I Y R R Y S G L F T F R R G A C . 1173 GATCATTTAA D H L I I - R S F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-1_AGS-1_PPS_1 (1893 1258) (frame '1'; 633 bp, 211 residues) 1 KGKFISYLKA GKMVRKGYIY HLIRVHDIKA ETPTLQSVQV VNEFPDIFPE ELPGLPPERE 61 IEFTIDVLPD AHPISIPPYR MAPAELEELK EQLRDLLEKG FIRPSTSPWG APVLFVRKKD 121 GSLRMCIDYR QLNKVTIKNR YPLPRIDDLL DQLQGAKCFS KIDLRSGYHQ VRVREADIPK 181 TAFRTRYGHY EFRVLSFGLT NAPAVFMDLI N- 3-phase translation of AGS-1 (+strand): . . . . . . 1164 TTAAATGATCTGCATGCTCCTCTTCTGAACGTGAATAGACCAGAATATCGTCTATAAATA L N D L H A P L L N V N R P E Y R L - I - M I C M L L F - T - I D Q N I V Y K Y K - S A C S S S E R E - T R I S S I N . . . . . . 1224 CAATAACAAACATATCAAGGAATGGTTTAAATACTCAATTTATTAAATCCATGAACACCG Q - Q T Y Q G M V - I L N L L N P - T P N N K H I K E W F K Y S I Y - I H E H R T I T N I S R N G L N T Q F I K S M N T . . . . . . 1284 CTGGAGCATTAGTCAGCCCAAAAGACAGCACTCTAAACTCATAATGCCCATATCGGGTCC L E H - S A Q K T A L - T H N A H I G S W S I S Q P K R Q H S K L I M P I S G P A G A L V S P K D S T L N S - C P Y R V . . . . . . 1344 GGAATGCTGTCTTTGGAATATCTGCCTCCCTTACCCGCACCTGATGATAACCTGACCGCA G M L S L E Y L P P L P A P D D N L T A E C C L W N I C L P Y P H L M I T - P Q R N A V F G I S A S L T R T - - - P D R . . . . . . 1404 AGTCTATCTTTGAAAAACACTTTGCACCCTGCAACTGGTCAAGTAGATCGTCAATCCTTG S L S L K N T L H P A T G Q V D R Q S L V Y L - K T L C T L Q L V K - I V N P W K S I F E K H F A P C N W S S R S S I L . . . . . . 1464 GGAGGGGATACCTGTTCTTTATTGTTACTTTGTTCAACTGCCTATAATCAATGCACATCC G G D T C S L L L L C S T A Y N Q C T S E G I P V L Y C Y F V Q L P I I N A H P G R G Y L F F I V T L F N C L - S M H I . . . . . . 1524 GCAGCGACCCATCCTTCTTCCTCACAAACAGTACCGGTGCTCCCCAAGGTGACGTACTAG A A T H P S S S Q T V P V L P K V T Y - Q R P I L L P H K Q Y R C S P R - R T R R S D P S F F L T N S T G A P Q G D V L . . . . . . 1584 GCCTGATGAAGCCCTTTTCTAGCAAATCCCTCAATTGCTCTTTCAATTCTTCCAACTCAG A - - S P F L A N P S I A L S I L P T Q P D E A L F - Q I P Q L L F Q F F Q L S G L M K P F S S K S L N C S F N S S N S . . . . . . 1644 CAGGTGCCATTCTATAAGGAGGTATAGATATAGGGTGGGCATCTGGTAGTACATCTATAG Q V P F Y K E V - I - G G H L V V H L - R C H S I R R Y R Y R V G I W - Y I Y S A G A I L - G G I D I G W A S G S T S I . . . . . . 1704 TAAACTCTATCTCCCGTTCTGGAGGAAGGCCTGGAAGTTCCTCGGGGAATATATCGGGAA - T L S P V L E E G L E V P R G I Y R E K L Y L P F W R K A W K F L G E Y I G K V N S I S R S G G R P G S S S G N I S G . . . . . . 1764 ATTCATTAACTACCTGGACTGATTGAAGAGTCGGTGTCTCTGCCTTTATGTCATGCACCC I H - L P G L I E E S V S L P L C H A P F I N Y L D - L K S R C L C L Y V M H P N S L T T W T D - R V G V S A F M S C T . . . . . . 1824 GAATCAGATGGTAAATATAGCCTTTTCTAACCATCTTCCCGGCCTTGAGGTATGAAATGA E S D G K Y S L F - P S S R P - G M K - N Q M V N I A F S N H L P G L E V - N E R I R W - I - P F L T I F P A L R Y E M . 1884 ACTTACCTTT T Y L L T F N L P Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26+_PGL-1_AGS-1_PPS_1 (1320 1583) (frame '1'; 261 bp, 87 residues) 1 THNAHIGSGM LSLEYLPPLP APDDNLTASL SLKNTLHPAT GQVDRQSLGG DTCSLLLLCS 61 TAYNQCTSAA THPSSSQTVP VLPKVTY- AGS-2 (2735 2138) SCR (e 0.958) Exon 1 2735 2138 ( 598 n); score: 0.958 PGS (2735 2138) SGN-E350824- 3-phase translation of AGS-2 (-strand): . . . . . . 2735 AGCATGTTAGATTTTCTTCCCAGCCAGCACAGAGGGCACCCCCACGTTTCATGGGTAGGG S M L D F L P S Q H R G H P H V S W V G A C - I F F P A S T E G T P T F H G - G H V R F S S Q P A Q R A P P R F M G R . . . . . . 2675 GGTTCGATCGTATGGGATACTCGGAAGCTAGTTAGAGCTCTAGGGCGTCAAGGTCACAGA G S I V W D T R K L V R A L G R Q G H R V R S Y G I L G S - L E L - G V K V T D G F D R M G Y S E A S - S S R A S R S Q . . . . . . 2615 TGGGCAGGGGTTTGAGCCAGTCAAGGCCACCTTTGCCTCGGTGTTCTCATTGTGGTAAGT W A G V - A S Q G H L C L G V L I V V S G Q G F E P V K A T F A S V F S L W - V M G R G L S Q S R P P L P R C S H C G K . . . . . . 2555 CCCATCCTGGGGAATGTCGTTGGGCTACAGGTGCGTGTTTTTCTTGCGGCCGTCAGGGCC P I L G N V V G L Q V R V F L A A V R A P S W G M S L G Y R C V F F L R P S G P S H P G E C R W A T G A C F S C G R Q G . . . . . . 2495 ATACTATGAGGGAGTGTCACCTTAGAGGTAGTGCAGGTGGTATGGCACATCCTACAGGGT I L - G S V T L E V V Q V V W H I L Q G Y Y E G V S P - R - C R W Y G T S Y R V H T M R E C H L R G S A G G M A H P T G . . . . . . 2435 CCGTTGCTGGTTCATCTTCTTCTGTGGCTGTGCGCCCTACGGGGCAGGGTATTCAGGCGC P L L V H L L L W L C A L R G R V F R R R C W F I F F C G C A P Y G A G Y S G A S V A G S S S S V A V R P T G Q G I Q A . . . . . . 2375 CAACAGGCCGTGGTAGAGGACGTGATGGAGCTTCCAGTTCTAGCGGTCCCTCAAACCGTA Q Q A V V E D V M E L P V L A V P Q T V N R P W - R T - W S F Q F - R S L K P Y P T G R G R G R D G A S S S S G P S N R . . . . . . 2315 TATATGCTTTGACTAATAGGCAAGATTAGGAGGCGTCACCTAATGTGATCACAGGTATAT Y M L - L I G K I R R R H L M - S Q V Y I C F D - - A R L G G V T - C D H R Y I I Y A L T N R Q D - E A S P N V I T G I . . . . . . 2255 TATCACTATTCTCCCGAAGTGTGTATGCATTGATAGACCCAGGTTCCACCTTATCATATA Y H Y S P E V C M H - - T Q V P P Y H I I T I L P K C V C I D R P R F H L I I Y L S L F S R S V Y A L I D P G S T L S Y . . . . . . 2195 TATCTCCCTTTGTTGCTAGTAGGATCGGAATAGAGTCTGAGTTGATAGAACCATTTGA Y L P L L L V G S E - S L S - - N H L I S L C C - - D R N R V - V D R T I - I S P F V A S R I G I E S E L I E P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-1_AGS-2_PPS_1 (2640 2287) (frame '0'; 351 bp, 117 residues) 1 SSRASRSQMG RGLSQSRPPL PRCSHCGKSH PGECRWATGA CFSCGRQGHT MRECHLRGSA 61 GGMAHPTGSV AGSSSSVAVR PTGQGIQAPT GRGRGRDGAS SSSGPSNRIY ALTNRQD- 3-phase translation of AGS-2 (+strand): . . . . . . 2138 TCAAATGGTTCTATCAACTCAGACTCTATTCCGATCCTACTAGCAACAAAGGGAGATATA S N G S I N S D S I P I L L A T K G D I Q M V L S T Q T L F R S Y - Q Q R E I Y K W F Y Q L R L Y S D P T S N K G R Y . . . . . . 2198 TATGATAAGGTGGAACCTGGGTCTATCAATGCATACACACTTCGGGAGAATAGTGATAAT Y D K V E P G S I N A Y T L R E N S D N M I R W N L G L S M H T H F G R I V I I I - - G G T W V Y Q C I H T S G E - - - . . . . . . 2258 ATACCTGTGATCACATTAGGTGACGCCTCCTAATCTTGCCTATTAGTCAAAGCATATATA I P V I T L G D A S - S C L L V K A Y I Y L - S H - V T P P N L A Y - S K H I Y Y T C D H I R - R L L I L P I S Q S I Y . . . . . . 2318 CGGTTTGAGGGACCGCTAGAACTGGAAGCTCCATCACGTCCTCTACCACGGCCTGTTGGC R F E G P L E L E A P S R P L P R P V G G L R D R - N W K L H H V L Y H G L L A T V - G T A R T G S S I T S S T T A C W . . . . . . 2378 GCCTGAATACCCTGCCCCGTAGGGCGCACAGCCACAGAAGAAGATGAACCAGCAACGGAC A - I P C P V G R T A T E E D E P A T D P E Y P A P - G A Q P Q K K M N Q Q R T R L N T L P R R A H S H R R R - T S N G . . . . . . 2438 CCTGTAGGATGTGCCATACCACCTGCACTACCTCTAAGGTGACACTCCCTCATAGTATGG P V G C A I P P A L P L R - H S L I V W L - D V P Y H L H Y L - G D T P S - Y G P C R M C H T T C T T S K V T L P H S M . . . . . . 2498 CCCTGACGGCCGCAAGAAAAACACGCACCTGTAGCCCAACGACATTCCCCAGGATGGGAC P - R P Q E K H A P V A Q R H S P G W D P D G R K K N T H L - P N D I P Q D G T A L T A A R K T R T C S P T T F P R M G . . . . . . 2558 TTACCACAATGAGAACACCGAGGCAAAGGTGGCCTTGACTGGCTCAAACCCCTGCCCATC L P Q - E H R G K G G L D W L K P L P I Y H N E N T E A K V A L T G S N P C P S L T T M R T P R Q R W P - L A Q T P A H . . . . . . 2618 TGTGACCTTGACGCCCTAGAGCTCTAACTAGCTTCCGAGTATCCCATACGATCGAACCCC C D L D A L E L - L A S E Y P I R S N P V T L T P - S S N - L P S I P Y D R T P L - P - R P R A L T S F R V S H T I E P . . . . . . 2678 CTACCCATGAAACGTGGGGGTGCCCTCTGTGCTGGCTGGGAAGAAAATCTAACATGCT L P M K R G G A L C A G W E E N L T C Y P - N V G V P S V L A G K K I - H A P T H E T W G C P L C W L G R K S N M Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (2952 2757) SCR (e 0.929) Exon 1 2952 2757 ( 196 n); score: 0.929 PGS (2952 2757) SGN-E379248+ 3-phase translation of AGS-3 (-strand): . . . . . . 2952 CATCGTTATGTGATGGGATTGGATCGTTATCTGATTGACGGTTGTATGGCAGTGACTCTT H R Y V M G L D R Y L I D G C M A V T L I V M - W D W I V I - L T V V W Q - L F S L C D G I G S L S D - R L Y G S D S . . . . . . 2892 CAGCCAGGTATGGACATTGCTCGGGTGCAGGCATATGCATAGGGGGTAGAGGATTGGCAC Q P G M D I A R V Q A Y A - G V E D W H S Q V W T L L G C R H M H R G - R I G T S A R Y G H C S G A G I C I G G R G L A . . . . . . 2832 CGGGGACGTCAGCCAGATAGAGATTATAATAGAGGCCAGCATAAGAGGGCTAGATCAGCA R G R Q P D R D Y N R G Q H K R A R S A G D V S Q I E I I I E A S I R G L D Q Q P G T S A R - R L - - R P A - E G - I S . . 2772 GGTTATCCTGACGAGT G Y P D E V I L T S R L S - R Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-3 (+strand): . . . . . . 2757 ACTCGTCAGGATAACCTGCTGATCTAGCCCTCTTATGCTGGCCTCTATTATAATCTCTAT T R Q D N L L I - P S Y A G L Y Y N L Y L V R I T C - S S P L M L A S I I I S I S S G - P A D L A L L C W P L L - S L . . . . . . 2817 CTGGCTGACGTCCCCGGTGCCAATCCTCTACCCCCTATGCATATGCCTGCACCCGAGCAA L A D V P G A N P L P P M H M P A P E Q W L T S P V P I L Y P L C I C L H P S N S G - R P R C Q S S T P Y A Y A C T R A . . . . . . 2877 TGTCCATACCTGGCTGAAGAGTCACTGCCATACAACCGTCAATCAGATAACGATCCAATC C P Y L A E E S L P Y N R Q S D N D P I V H T W L K S H C H T T V N Q I T I Q S M S I P G - R V T A I Q P S I R - R S N . . 2937 CCATCACATAACGATG P S H N D H H I T M P I T - R Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 5843 4005 AGS-1 (5843 5775,5232 5186,4459 4005) SCR (e 0.848 d 0.900 a 0.868,e 0.979 d 0.976 a 0.000,e 0.895) Exon 1 5843 5775 ( 69 n); score: 0.848 Intron 1 5774 5233 ( 542 n); Pd: 0.900 Pa: 0.868 Exon 2 5232 5186 ( 47 n); score: 0.979 Intron 2 5185 4460 ( 726 n); Pd: 0.976 Pa: 0.000 Exon 3 4459 4005 ( 455 n); score: 0.895 PGS (5843 5775,5232 5186,4459 4005) SGN-E543104+ PGS (4140 4022) SGN-E538150- PGS (5843 5775,5232 5186,4459 4038) SGN-E543103- PGS (4459 4038) SGN-E306317+ PGS (4459 4038) SGN-E303695+ PGS (4460 4059) SGN-E225616- PGS (4459 4059) SGN-E303256+ 3-phase translation of AGS-1 (-strand): . . . . . . 5843 GGCAGCCATGGAAATGGAGAAACCAACCCTGCAACTTTTGGCCAGCAGCTGCAAATAATT G S H G N G E T N P A T F G Q Q L Q I I A A M E M E K P T L Q L L A S S C K - F Q P W K W R N Q P C N F W P A A A N N . : . . . . . : 5783 TGGTTAGTA : ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGT W L V : I N S L S S K Y W T C G S I I R : C G - - : L I A - A V N I G R A A R L Y : G V L V S : N - - L E Q - I L D V R L D Y T : V . . . . . . 4455 AGACACACAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTC R H T V R - S S R L G Y L L C - L G E L D T Q F G D P P A - D I Y S A D W E S S - T H S S V I L P P R I S T L L I G R A . . . . . . 4395 CACTGTTCCGGAGCCCTGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTA H C S G A L S F W Y I T F V - S F A R L T V P E P C R F G T - L L C S L L L V Y P L F R S P V V L V H N F C V V F C S S . . . . . . 4335 TGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTTTGGA W V W R G P V P S S F T N V L L E V F G G Y G G A L S R R V S L M Y S - R S L D M G M A G P C P V E F H - C T L R G L W . . . . . . 4275 CATTATGTGGGTTATATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGAT H Y V G Y I Y M F W I M V W T W F V W D I M W V I Y I C F G - W S G H G L F G M T L C G L Y I Y V L D N G L D M V C L G . . . . . . 4215 GTTCGCTTGTACAGGGGCAGCCTTGTCAGCTGCGTACATCATTGTGTATTGTGTAGTGGC V R L Y R G S L V S C V H H C V L C S G F A C T G A A L S A A Y I I V Y C V V A C S L V Q G Q P C Q L R T S L C I V - W . . . . . . 4155 AGCCTTATCGGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCG S L I G I R M L L C F E - W R P C R L A A L S A Y V C Y Y A L N S G G L V G S R Q P Y R H T Y V I M L - I V A A L S A R . . . . . . 4095 TATGTTGTTATGGTTGAATGGTTATGACTCCTTATGAGACAGGTCCTCTTATATATATAT Y V V M V E W L - L L M R Q V L L Y I Y M L L W L N G Y D S L - D R S S Y I Y M V C C Y G - M V M T P Y E T G P L I Y I . . . . 4035 GACGTTGGGGTTGGCTTGATTTGATTAAATT D V G V G L I - L N T L G L A - F D - I - R W G W L D L I K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-2_AGS-1_PPS_1 (4350 4117) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVL LEVFGHYVGY IYMFWIMVWT WFVWDVRLYR GSLVSCVHHC 61 VLCSGSLIGI RMLLCFE- AGS-2 (5841 5782,5232 5186,4459 4020) SCR (e 0.825 d 0.993 a 0.868,e 0.894 d 0.976 a 0.000,e 0.895) Exon 1 5841 5782 ( 60 n); score: 0.825 Intron 1 5781 5233 ( 549 n); Pd: 0.993 Pa: 0.868 Exon 2 5232 5186 ( 47 n); score: 0.894 Intron 2 5185 4460 ( 726 n); Pd: 0.976 Pa: 0.000 Exon 3 4459 4020 ( 440 n); score: 0.895 PGS (5840 5782,5232 5186,4459 4020) SGN-E305738+ PGS (5841 5782,5232 5186,4459 4037) SGN-E374134- PGS (5840 5782,5232 5186,4459 4037) SGN-E374135+ PGS (5840 5782,5232 5186,4459 4059) SGN-E310669+ 3-phase translation of AGS-2 (-strand): . . . . . . : 5841 CAGCCATGGAAATGGAGAAACCAACCCTGCAACTTTTGGCCAGCAGCTGCAAATAATTTG : Q P W K W R N Q P C N F W P A A A N N L : S H G N G E T N P A T F G Q Q L Q I I - : A M E M E K P T L Q L L A S S C K - F : . . . . . : . 5232 ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACG : GTGTAGACACACA I N S L S S K Y W T C G S I I R : C R H T L I A - A V N I G R A A R L Y : G V D T Q D - - L E Q - I L D V R L D Y T : V - T H . . . . . . 4446 GTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCC V R - S S R L G Y L L C - L G E L H C S F G D P P A - D I Y S A D W E S S T V P S S V I L P P R I S T L L I G R A P L F . . . . . . 4386 GGAGCCCTGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGG G A L S F W Y I T F V - S F A R L W V W E P C R F G T - L L C S L L L V Y G Y G R S P V V L V H N F C V V F C S S M G M . . . . . . 4326 CGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTTTGGACATTATGTG R G P V P S S F T N V L L E V F G H Y V G A L S R R V S L M Y S - R S L D I M W A G P C P V E F H - C T L R G L W T L C . . . . . . 4266 GGTTATATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTTCGCTTG G Y I Y M F W I M V W T W F V W D V R L V I Y I C F G - W S G H G L F G M F A C G L Y I Y V L D N G L D M V C L G C S L . . . . . . 4206 TACAGGGGCAGCCTTGTCAGCTGCGTACATCATTGTGTATTGTGTAGTGGCAGCCTTATC Y R G S L V S C V H H C V L C S G S L I T G A A L S A A Y I I V Y C V V A A L S V Q G Q P C Q L R T S L C I V - W Q P Y . . . . . . 4146 GGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGTTGTT G I R M L L C F E - W R P C R L A Y V V A Y V C Y Y A L N S G G L V G S R M L L R H T Y V I M L - I V A A L S A R V C C . . . . . . 4086 ATGGTTGAATGGTTATGACTCCTTATGAGACAGGTCCTCTTATATATATATGACGTTGGG M V E W L - L L M R Q V L L Y I Y D V G W L N G Y D S L - D R S S Y I Y M T L G Y G - M V M T P Y E T G P L I Y I - R W . 4026 GTTGGCT V G L A G W Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-2_AGS-2_PPS_1 (4350 4117) (frame '1'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVL LEVFGHYVGY IYMFWIMVWT WFVWDVRLYR GSLVSCVHHC 61 VLCSGSLIGI RMLLCFE- AGS-3 (5824 5782,4832 4636,4459 4057) SCR (e 0.814 d 0.993 a 0.932,e 0.904 d 0.000 a 0.000,e 0.900) Exon 1 5824 5782 ( 43 n); score: 0.814 Intron 1 5781 4833 ( 949 n); Pd: 0.993 Pa: 0.932 Exon 2 4832 4636 ( 197 n); score: 0.904 Intron 2 4635 4460 ( 176 n); Pd: 0.000 Pa: 0.000 Exon 3 4459 4057 ( 403 n); score: 0.900 PGS (5824 5782,4832 4636,4459 4057) SGN-E538156+ PGS (5824 5782,4832 4636,4459 4094) SGN-E538151+ PGS (5824 5782,4832 4636,4459 4229) SGN-E268096+ 3-phase translation of AGS-3 (-strand): . . . . . : . 5824 AAACCAACCCTGCAACTTTTGGCCAGCAGCTGCAAATAATTTG : AGTCATTTATCATTTCA K P T L Q L L A S S C K - F : E S F I I S N Q P C N F W P A A A N N L : S H L S F H T N P A T F G Q Q L Q I I - : V I Y H F . . . . . . 4815 CCGAGTCCCAGGTCGGGTAATGTTCGTGCGGAGTTTCTTGCATATGTCACCGAGTTCCTC P S P R S G N V R A E F L A Y V T E F L R V P G R V M F V R S F L H M S P S S S T E S Q V G - C S C G V S C I C H R V P . . . . . . 4755 ACTAGAGGGCCTGGTATGTATATTATATATATATGATTGGTGATGAGGATGGTTATGATG T R G P G M Y I I Y I - L V M R M V M M L E G L V C I L Y I Y D W - - G W L - - H - R A W Y V Y Y I Y M I G D E D G Y D . . . . . . : 4695 ATGATGATGACGGAGATGACGTGATGATTATTTTGCCGAGCCCCATACTAGGGAAGCTGG : M M M T E M T - - L F C R A P Y - G S W : - - - R R - R D D Y F A E P H T R E A G : D D D D G D D V M I I L P S P I L G K L : . . . . . . 4459 GTGTAGACACACAGTTCGGTGATCCTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGA V - T H S S V I L P P R I S T L L I G R C R H T V R - S S R L G Y L L C - L G E G V D T Q F G D P P A - D I Y S A D W E . . . . . . 4399 GCTCCACTGTTCCGGAGCCCTGTCGTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCG A P L F R S P V V L V H N F C V V F C S L H C S G A L S F W Y I T F V - S F A R S S T V P E P C R F G T - L L C S L L L . . . . . . 4339 TCTATGGGTATGGCGGGGCCCTGTCCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTT S M G M A G P C P V E F H - C T L R G L L W V W R G P V P S S F T N V L L E V F V Y G Y G G A L S R R V S L M Y S - R S . . . . . . 4279 TGGACATTATGTGGGTTATATATATATGTTTTGGATAATGGTCTGGACATGGTTTGTTTG W T L C G L Y I Y V L D N G L D M V C L G H Y V G Y I Y M F W I M V W T W F V W L D I M W V I Y I C F G - W S G H G L F . . . . . . 4219 GGATGTTCGCTTGTACAGGGGCAGCCTTGTCAGCTGCGTACATCATTGTGTATTGTGTAG G C S L V Q G Q P C Q L R T S L C I V - D V R L Y R G S L V S C V H H C V L C S G M F A C T G A A L S A A Y I I V Y C V . . . . . . 4159 TGGCAGCCTTATCGGCATACGTATGTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCT W Q P Y R H T Y V I M L - I V A A L S A G S L I G I R M L L C F E - W R P C R L V A A L S A Y V C Y Y A L N S G G L V G . . . . . 4099 CGCGTATGTTGTTATGGTTGAATGGTTATGACTCCTTATGAGA R V C C Y G - M V M T P Y E A Y V V M V E W L - L L M R S R M L L W L N G Y D S L - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-2_AGS-3_PPS_1 (4350 4117) (frame '2'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVL LEVFGHYVGY IYMFWIMVWT WFVWDVRLYR GSLVSCVHHC 61 VLCSGSLIGI RMLLCFE- AGS-4 (4782 4723,4675 4057) SCR (e 0.783 d 0.000 a 0.000,e 0.902) Exon 1 4782 4723 ( 60 n); score: 0.783 Intron 1 4722 4676 ( 47 n); Pd: 0.000 Pa: 0.000 Exon 2 4675 4057 ( 619 n); score: 0.902 PGS (4782 4723,4675 4057) SGN-E544254- 3-phase translation of AGS-4 (-strand): . . . . . . : 4782 TTTCTTGCATATGTCACCGAGTTCCTCACTAGAGGGCCTGGTATGTATATTATATATATA : F L A Y V T E F L T R G P G M Y I I Y I : F L H M S P S S S L E G L V C I L Y I - : S C I C H R V P H - R A W Y V Y Y I Y : . . . . . . 4675 GTGATGATTATTTTGCCGAGCCCCATACTAGGGAAGCTGGGCACCTTAAATGTTAAATAT V M I I L P S P I L G K L G T L N V K Y - - L F C R A P Y - G S W A P - M L N I S D D Y F A E P H T R E A G H L K C - I . . . . . . 4615 ATGCATGATTTTCACTTAAAAGGGTATATGTGTAGCGATATTTTGTTTCGACTTGCCATA M H D F H L K G Y M C S D I L F R L A I C M I F T - K G I C V A I F C F D L P Y Y A - F S L K R V Y V - R Y F V S T C H . . . . . . 4555 TTGGTATCCTGGCATCTTTACCTTATGCTTTACATACTCAGTACATTGTCCGTACTGACC L V S W H L Y L M L Y I L S T L S V L T W Y P G I F T L C F T Y S V H C P Y - P I G I L A S L P Y A L H T Q Y I V R T D . . . . . . 4495 CCGCTTTCCTCGGGGGGCTGCGTTTCATGCCTGCAGGTGTAGACACACAGTTCGGTGATC P L S S G G C V S C L Q V - T H S S V I R F P R G A A F H A C R C R H T V R - S P A F L G G L R F M P A G V D T Q F G D . . . . . . 4435 CTCCCGCCTAGGATATCTACTCTGCTGATTGGGAGAGCTCCACTGTTCCGGAGCCCTGTC L P P R I S T L L I G R A P L F R S P V S R L G Y L L C - L G E L H C S G A L S P P A - D I Y S A D W E S S T V P E P C . . . . . . 4375 GTTTTGGTACATAACTTTTGTGTAGTCTTTTGCTCGTCTATGGGTATGGCGGGGCCCTGT V L V H N F C V V F C S S M G M A G P C F W Y I T F V - S F A R L W V W R G P V R F G T - L L C S L L L V Y G Y G G A L . . . . . . 4315 CCCGTCGAGTTTCACTAATGTACTCTTAGAGGTCTTTGGACATTATGTGGGTTATATATA P V E F H - C T L R G L W T L C G L Y I P S S F T N V L L E V F G H Y V G Y I Y S R R V S L M Y S - R S L D I M W V I Y . . . . . . 4255 TATGTTTTGGATAATGGTCTGGACATGGTTTGTTTGGGATGTTCGCTTGTACAGGGGCAG Y V L D N G L D M V C L G C S L V Q G Q M F W I M V W T W F V W D V R L Y R G S I C F G - W S G H G L F G M F A C T G A . . . . . . 4195 CCTTGTCAGCTGCGTACATCATTGTGTATTGTGTAGTGGCAGCCTTATCGGCATACGTAT P C Q L R T S L C I V - W Q P Y R H T Y L V S C V H H C V L C S G S L I G I R M A L S A A Y I I V Y C V V A A L S A Y V . . . . . . 4135 GTTATTATGCTTTGAATAGTGGCGGCCTTGTCGGCTCGCGTATGTTGTTATGGTTGAATG V I M L - I V A A L S A R V C C Y G - M L L C F E - W R P C R L A Y V V M V E W C Y Y A L N S G G L V G S R M L L W L N . . 4075 GTTATGACTCCTTATGAGA V M T P Y E L - L L M R G Y D S L - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26-_PGL-2_AGS-4_PPS_1 (4782 4723,4675 4454) (frame '1'; 279 bp, 93 residues) 1 FLAYVTEFLT RGPGMYIIYI VMIILPSPIL GKLGTLNVKY MHDFHLKGYM CSDILFRLAI 61 LVSWHLYLML YILSTLSVLT PLSSGGCVSC LQV- >C06HBa0057J04.1-26-_PGL-2_AGS-4_PPS_2 (4350 4117) (frame '2'; 231 bp, 77 residues) 1 SFARLWVWRG PVPSSFTNVL LEVFGHYVGY IYMFWIMVWT WFVWDVRLYR GSLVSCVHHC 61 VLCSGSLIGI RMLLCFE- AGS-5 (5832 5144) SCR (e 0.811) Exon 1 5832 5144 ( 689 n); score: 0.811 PGS (5824 5144) SGN-E328093+ PGS (5832 5386) SGN-E298250+ 3-phase translation of AGS-5 (-strand): . . . . . . 5832 AAATGGAGAAACCAACCCTGCAACTTTTGGCCAGCAGCTGCAAATAATTTGGTTAGTAAT K W R N Q P C N F W P A A A N N L V S N N G E T N P A T F G Q Q L Q I I W L V I M E K P T L Q L L A S S C K - F G - - . . . . . . 5772 CTCCTTGTTTGGTGTGTTAATTCTTTAGAATACCCTTGTTAATTATCCATTAATTTTAAG L L V W C V N S L E Y P C - L S I N F K S L F G V L I L - N T L V N Y P L I L S S P C L V C - F F R I P L L I I H - F - . . . . . . 5712 TAGGGGGCGTGACCAGTAGCTTAGGAAGTTTGTTTTAGTTATTGAATGTGCTAAGCATGA - G A - P V A - E V C F S Y - M C - A - R G R D Q - L R K F V L V I E C A K H E V G G V T S S L G S L F - L L N V L S M . . . . . . 5652 ATGGAAACCATAATTGGATTATTAGTGGTGTCGTGTTGGTGCTTGGGCTGTTTTGATTAA M E T I I G L L V V S C W C L G C F D - W K P - L D Y - W C R V G A W A V L I K N G N H N W I I S G V V L V L G L F - L . . . . . . 5592 AGCAAACTGCAGGAAAATTCTATTTTGGCATTATGTATATGTTGAATGTGATTATGAGTA S K L Q E N S I L A L C I C - M - L - V A N C R K I L F W H Y V Y V E C D Y E Y K Q T A G K F Y F G I M Y M L N V I M S . . . . . . 5532 TATACTCCAAAGGATGAATACGATAAGGTAGATGTGTTGGGAATTATAAAACGAGTTATC Y T P K D E Y D K V D V L G I I K R V I I L Q R M N T I R - M C W E L - N E L S I Y S K G - I R - G R C V G N Y K T S Y . . . . . . 5472 ACTCGGTGTGTCGTTGCTATGGTTGCCGAGACGGAACTATTTTGGAGAGGGGGCTGTTTA T R C V V A M V A E T E L F W R G G C L L G V S L L W L P R R N Y F G E G A V - H S V C R C Y G C R D G T I L E R G L F . . . . . . 5412 ATATGATTCTTTGGGTTATATGTGTTATTGGTATTGCTGTGGATAATTTGGATTGTTGTC I - F F G L Y V L L V L L W I I W I V V Y D S L G Y M C Y W Y C C G - F G L L S N M I L W V I C V I G I A V D N L D C C . . . . . . 5352 GGATTGGGTCGAAGTAAGGATGGGGAGGTGCTGCCGAATTTTTGTTAGATTATTAGCTAG G L G R S K D G E V L P N F C - I I S - D W V E V R M G R C C R I F V R L L A S R I G S K - G W G G A A E F L L D Y - L . . . . . . 5292 CTTACAAGAAAGTAAAGCACGATGTTTATCTAATTGCGGCACGATTGTTGCTTGTTATAG L T R K - S T M F I - L R H D C C L L - L Q E S K A R C L S N C G T I V A C Y R A Y K K V K H D V Y L I A A R L L L V I . . . . . . 5232 ATTAATAGCTTGAGCAGTAAATATTGGACGTGCGGCTCGATTATACGGTATGTAACGCTG I N S L S S K Y W T C G S I I R Y V T L L I A - A V N I G R A A R L Y G M - R C D - - L E Q - I L D V R L D Y T V C N A . . . 5172 TCCCTTCTTTCATTGGTTGGCGTGACTTT S L L S L V G V T P F F H W L A - L V P S F I G W R D F Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-5 (+strand): . . . . . . 5144 AAAGTCACGCCAACCAATGAAAGAAGGGACAGCGTTACATACCGTATAATCGAGCCGCAC K V T P T N E R R D S V T Y R I I E P H K S R Q P M K E G T A L H T V - S S R T S H A N Q - K K G Q R Y I P Y N R A A . . . . . . 5204 GTCCAATATTTACTGCTCAAGCTATTAATCTATAACAAGCAACAATCGTGCCGCAATTAG V Q Y L L L K L L I Y N K Q Q S C R N - S N I Y C S S Y - S I T S N N R A A I R R P I F T A Q A I N L - Q A T I V P Q L . . . . . . 5264 ATAAACATCGTGCTTTACTTTCTTGTAAGCTAGCTAATAATCTAACAAAAATTCGGCAGC I N I V L Y F L V S - L I I - Q K F G S - T S C F T F L - A S - - S N K N S A A D K H R A L L S C K L A N N L T K I R Q . . . . . . 5324 ACCTCCCCATCCTTACTTCGACCCAATCCGACAACAATCCAAATTATCCACAGCAATACC T S P S L L R P N P T T I Q I I H S N T P P H P Y F D P I R Q Q S K L S T A I P H L P I L T S T Q S D N N P N Y P Q Q Y . . . . . . 5384 AATAACACATATAACCCAAAGAATCATATTAAACAGCCCCCTCTCCAAAATAGTTCCGTC N N T Y N P K N H I K Q P P L Q N S S V I T H I T Q R I I L N S P L S K I V P S Q - H I - P K E S Y - T A P S P K - F R . . . . . . 5444 TCGGCAACCATAGCAACGACACACCGAGTGATAACTCGTTTTATAATTCCCAACACATCT S A T I A T T H R V I T R F I I P N T S R Q P - Q R H T E - - L V L - F P T H L L G N H S N D T P S D N S F Y N S Q H I . . . . . . 5504 ACCTTATCGTATTCATCCTTTGGAGTATATACTCATAATCACATTCAACATATACATAAT T L S Y S S F G V Y T H N H I Q H I H N P Y R I H P L E Y I L I I T F N I Y I M Y L I V F I L W S I Y S - S H S T Y T - . . . . . . 5564 GCCAAAATAGAATTTTCCTGCAGTTTGCTTTAATCAAAACAGCCCAAGCACCAACACGAC A K I E F S C S L L - S K Q P K H Q H D P K - N F P A V C F N Q N S P S T N T T C Q N R I F L Q F A L I K T A Q A P T R . . . . . . 5624 ACCACTAATAATCCAATTATGGTTTCCATTCATGCTTAGCACATTCAATAACTAAAACAA T T N N P I M V S I H A - H I Q - L K Q P L I I Q L W F P F M L S T F N N - N K H H - - S N Y G F H S C L A H S I T K T . . . . . . 5684 ACTTCCTAAGCTACTGGTCACGCCCCCTACTTAAAATTAATGGATAATTAACAAGGGTAT T S - A T G H A P Y L K L M D N - Q G Y L P K L L V T P P T - N - W I I N K G I N F L S Y W S R P L L K I N G - L T R V . . . . . . 5744 TCTAAAGAATTAACACACCAAACAAGGAGATTACTAACCAAATTATTTGCAGCTGCTGGC S K E L T H Q T R R L L T K L F A A A G L K N - H T K Q G D Y - P N Y L Q L L A F - R I N T P N K E I T N Q I I C S C W . . . 5804 CAAAAGTTGCAGGGTTGGTTTCTCCATTT Q K L Q G W F L H K S C R V G F S I P K V A G L V S P F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0057J04.1-26+_PGL-2_AGS-5_PPS_1 (5309 5596) (frame '1'; 285 bp, 95 residues) 1 QKFGSTSPSL LRPNPTTIQI IHSNTNNTYN PKNHIKQPPL QNSSVSATIA TTHRVITRFI 61 IPNTSTLSYS SFGVYTHNHI QHIHNAKIEF SCSLL- ... finished at: Mon Jul 24 23:18:07 2006 ________________________________________________________________________________ Sequence 27: C06HBa0057J04.1-27, from 1 to 790, both strands analyzed. ... started at: Mon Jul 24 23:18:07 2006 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-hRSnbf/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 3 ******************************************************************************** EST sequence 1 -strand 843 n (File: SGN-E544254-) 1 GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 61 TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 121 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 181 TCACTAGAGG GCCGGGTACT ATGATGTATA TATAATGATG ATTATTTTGC CGAGTCCCTT 241 ACTAGGGAAG TTAGGCATCT TATATGTTAA AGATATGCAT GATTTTCACT TAAAAAGTAC 301 ATGTGTAGAG ATATCTTGTT TCGACTTATC ATGTTGGTAT CCTGTCATCT TTACCTTATG 361 CTTTACATAC TCAGTACATT GTCCGTACTG ACCCCCTTTT CTCGGGGGGC TGCGTTTCAT 421 GCCCGCAGGT GTAGACGCTC AGTTCGGTGA TCCTCCCGCC TAGGATATCT ACTCTGCTTT 481 TTGGGAGAGC TCCACTGTTC CGGAGCCCAG TCGTTTTGGT ACATAACTTC TTATGTAGTC 541 TTTTGCTTGT CTATGGGTAT GGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT 601 AGAGGTCTGT AGACATCGTG TGGGTTGTAT AATTATGTTT TGGATAATGG TCTGGACATG 661 GTTTGTTTGG GATGTCCATT TGTACAAGTG CAGCCTTGTC GGTTGTGAAC ATCATTGTGT 721 ATTGTGTAGT GGCAGCCTCG TCGGCTGCGT ATGCTATTAT GTTTTGGATA GTGGCGGCCT 781 TGTCGGCTCG CATATGTTGT TACGATTTAA TGGTTATGAC TCTTTATGAG AAAAAAAAAG 841 AAA Predicted gene structure (within gDNA segment 790 to 1): Exon 1 183 3 ( 181 n); cDNA 1 181 ( 181 n); score: 0.917 PPA cDNA 829 839 MATCH C06HBa0057J04.1-27- SGN-E544254- 0.917 181 0.229 G PGS_C06HBa0057J04.1-27-_SGN-E544254- (183 3) Alignment (genomic DNA sequence = upper lines): GACTAATTTA TCATTTCACC GAGTTTCGGG TCGGGTAATG TTCGTGCGGA GTTTCTTGCA 124 || | ||||| |||||||||| |||| |||| ||||||||| |||||||||| |||||||||| GAGTCATTTA TCATTTCACC GAGTCCCGGG CCGGGTAATG TTCGTGCGGA GTTTCTTGCA 60 TTTGTCACCG AGTCACTCAC TAGAGGGTCG GGTATGTATA TTATACATAT TATTGGTGAT 64 | |||||||| |||| ||||| ||||||| || || ||||||| ||||| |||| ||||||||| TATGTCACCG AGTCCCTCAC TAGAGGGCCG GGAATGTATA TTATATATAT GATTGGTGAT 120 GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGATTATTTT GCCGAGCCCC 4 |||||||||| |||||||||| |||||||||| |||||||||| ||| ||||| ||||| ||| GAGGATGGTT ATGATGATGA TGATGACGGA GATGATGTGA TGACTATTTC ACCGAGTCCC 180 T 3 | T 181 hqPGS_C06HBa0057J04.1-27-_SGN-E544254- (183 3) ******************************************************************************** EST sequence 2 +strand 606 n (File: SGN-E538151+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTTT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGTCGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTA Predicted gene structure (within gDNA segment 790 to 1): Exon 1 178 3 ( 176 n); cDNA 52 227 ( 176 n); score: 0.920 MATCH C06HBa0057J04.1-27- SGN-E538151+ 0.920 176 0.290 C PGS_C06HBa0057J04.1-27-_SGN-E538151+ (178 3) Alignment (genomic DNA sequence = upper lines): ATTTATCATT TCACCGAGTT TCGGGTCGGG TAATGTTCGT GCGGAGTTTC TTGCATTTGT 119 |||||||||| ||||||||| |||| |||| |||||||||| |||||||||| |||||| ||| ATTTATCATT TCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT 111 CACCGAGTCA CTCACTAGAG GGTCGGGTAT GTATATTATA CATATTATTG GTGATGAGGA 59 ||||||||| |||||||||| || |||| || |||||||||| |||| |||| |||||||||| CACCGAGTCC CTCACTAGAG GGCCGGGAAT GTATATTATA TATATGATTG GTGATGAGGA 171 TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGATT ATTTTGCCGA GCCCCT 3 |||||||||| |||||||||| |||||||||| |||||||| | |||| | || | |||| TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGACT ATTTCACTGA GTCCCT 227 hqPGS_C06HBa0057J04.1-27-_SGN-E538151+ (178 3) ******************************************************************************** EST sequence 3 +strand 644 n (File: SGN-E538156+) 1 GAGAAAACCA GCCATTAAAC TCTTGGACAG CAGCTGCAAA GAAATTGAGT CATTTATCAT 61 TTCACCGAGT CCCGGGCCGG GTAATGTTCG TGCGGAGTTT CTTGCATATG TCACCGAGTC 121 CCTCACTAGA GGGCCGGGAA TGTATATTAT ATATATGATT GGTGATGAGG ATGGTTATGA 181 TGATGATGAT GACGGAGATG ATGTGATGAC TATTTCACTG AGTCCCTCAC TAGAGGGCCG 241 GGTGTAGACG CTCAGTTTGG TGATCCTCCC GCCTAGGATA TCTACTCTGC TGTTTGGGAG 301 AGCTCCACTG TTCCGGAGCC CAGTCGTTGT GGTACATAAC TTCTTATGTA GTCTTTTGCT 361 TGTCTATGGG TATGCGGGGC CCTGTCCCGT CAAGTTTCAC TACTATACTC TTAGAGGTCT 421 GTAGACATCG TGTGGGTTGT ATAATTATGT TTTTGATAAT GGTCTGGACA TGGTTTGTTT 481 GGGATGTCCA CTTGTACAAG TGCAACCTTG TCGGTTGTGT ACATCTTTGT GTATTGTGTA 541 GTGGCAGCCT TGACGGCTGC GTATGCTATT ATGCTTTGAA TAGTGGCAGC CTTGTCGGCT 601 CGCGTATGTT GTTACGGTTG AATGGGTATG ACTCTTTATG AGAT Predicted gene structure (within gDNA segment 790 to 1): Exon 1 178 3 ( 176 n); cDNA 52 227 ( 176 n); score: 0.920 MATCH C06HBa0057J04.1-27- SGN-E538156+ 0.920 176 0.273 C PGS_C06HBa0057J04.1-27-_SGN-E538156+ (178 3) Alignment (genomic DNA sequence = upper lines): ATTTATCATT TCACCGAGTT TCGGGTCGGG TAATGTTCGT GCGGAGTTTC TTGCATTTGT 119 |||||||||| ||||||||| |||| |||| |||||||||| |||||||||| |||||| ||| ATTTATCATT TCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT 111 CACCGAGTCA CTCACTAGAG GGTCGGGTAT GTATATTATA CATATTATTG GTGATGAGGA 59 ||||||||| |||||||||| || |||| || |||||||||| |||| |||| |||||||||| CACCGAGTCC CTCACTAGAG GGCCGGGAAT GTATATTATA TATATGATTG GTGATGAGGA 171 TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGATT ATTTTGCCGA GCCCCT 3 |||||||||| |||||||||| |||||||||| |||||||| | |||| | || | |||| TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGACT ATTTCACTGA GTCCCT 227 hqPGS_C06HBa0057J04.1-27-_SGN-E538156+ (178 3) ******************************************************************************** EST sequence 4 +strand 470 n (File: SGN-E268096+) 1 GAAAACCAGC CATTAAACTC TTGGACAGCA GCTGCAAAGA AATTGAGTCA TTTATCATTG 61 CACCGAGTCC CGGGCCGGGT AATGTTCGTG CGGAGTTTCT TGCATATGTC ACCGAGTCCC 121 TCACTAGAGG GCCGGGAATG TATATTATAT ATATGATTGG TGATGAGGAT GGTTATGATG 181 ATGATGATGA CGGAGATGAT GTGATGACTA TTTCACTGAG TCCCTCACTA GAGGGCCGGG 241 TGTAGACGCT CAGTTTGGTG ATCCTCCCGC CTAGGATATC TACTCTGCTG TTTGGGAGAG 301 CTCCACTGTT CCGGAGCCCA GTCGTTTTGG TACATAACTT CTTATGTAGT CTTTTGCTTG 361 TCTATGGGTA TGCGGGGCCC TGTCCCGTCA AGTTTCACTA CTATACTCTT AGAGGTCTGT 421 AGACATCGTG TGGGTAGTAT AATTATGTTT TTGATAATGG GCTGGACATG Predicted gene structure (within gDNA segment 790 to 1): Exon 1 178 3 ( 176 n); cDNA 50 225 ( 176 n); score: 0.915 MATCH C06HBa0057J04.1-27- SGN-E268096+ 0.915 176 0.374 C PGS_C06HBa0057J04.1-27-_SGN-E268096+ (178 3) Alignment (genomic DNA sequence = upper lines): ATTTATCATT TCACCGAGTT TCGGGTCGGG TAATGTTCGT GCGGAGTTTC TTGCATTTGT 119 |||||||||| |||||||| |||| |||| |||||||||| |||||||||| |||||| ||| ATTTATCATT GCACCGAGTC CCGGGCCGGG TAATGTTCGT GCGGAGTTTC TTGCATATGT 109 CACCGAGTCA CTCACTAGAG GGTCGGGTAT GTATATTATA CATATTATTG GTGATGAGGA 59 ||||||||| |||||||||| || |||| || |||||||||| |||| |||| |||||||||| CACCGAGTCC CTCACTAGAG GGCCGGGAAT GTATATTATA TATATGATTG GTGATGAGGA 169 TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGATT ATTTTGCCGA GCCCCT 3 |||||||||| |||||||||| |||||||||| |||||||| | |||| | || | |||| TGGTTATGAT GATGATGATG ACGGAGATGA TGTGATGACT ATTTCACTGA GTCCCT 225 hqPGS_C06HBa0057J04.1-27-_SGN-E268096+ (178 3) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 790: PGL 1 (- strand): 183 3 AGS-1 (183 3) SCR (e 0.920) Exon 1 183 3 ( 181 n); score: 0.920 PGS (183 3) SGN-E544254- PGS (178 3) SGN-E538151+ PGS (178 3) SGN-E538156+ PGS (178 3) SGN-E268096+ 3-phase translation of AGS-1 (-strand): . . . . . . 183 GACTAATTTATCATTTCACCGAGTTTCGGGTCGGGTAATGTTCGTGCGGAGTTTCTTGCA D - F I I S P S F G S G N V R A E F L A T N L S F H R V S G R V M F V R S F L H L I Y H F T E F R V G - C S C G V S C . . . . . . 123 TTTGTCACCGAGTCACTCACTAGAGGGTCGGGTATGTATATTATACATATTATTGGTGAT F V T E S L T R G S G M Y I I H I I G D L S P S H S L E G R V C I L Y I L L V M I C H R V T H - R V G Y V Y Y T Y Y W - . . . . . . 63 GAGGATGGTTATGATGATGATGATGACGGAGATGATGTGATGATTATTTTGCCGAGCCCC E D G Y D D D D D G D D V M I I L P S P R M V M M M M M T E M M - - L F C R A P - G W L - - - - - R R - C D D Y F A E P . 3 T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 3 AGGGGCTCGGCAAAATAATCATCACATCATCTCCGTCATCATCATCATCATAACCATCCT R G S A K - S S H H L R H H H H H N H P G A R Q N N H H I I S V I I I I I T I L G L G K I I I T S S P S S S S S - P S . . . . . . 63 CATCACCAATAATATGTATAATATACATACCCGACCCTCTAGTGAGTGACTCGGTGACAA H H Q - Y V - Y T Y P T L - - V T R - Q I T N N M Y N I H T R P S S E - L G D K S S P I I C I I Y I P D P L V S D S V T . . . . . . 123 ATGCAAGAAACTCCGCACGAACATTACCCGACCCGAAACTCGGTGAAATGATAAATTAGT M Q E T P H E H Y P T R N S V K - - I S C K K L R T N I T R P E T R - N D K L V N A R N S A R T L P D P K L G E M I N - . 183 C Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Jul 24 23:18:17 2006