GeneSeqer. Version of March 12, 2006. Date run: Mon Aug 28 21:46:41 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 175665 Total sequence length: 93213537 Minimum sequence length: 89 Maximum sequence length: 1082 Length distribution (number of sequences of specified length): < 100: 1 < 200: 2188 < 300: 8544 < 400: 20465 < 500: 39499 < 600: 49432 < 700: 32872 < 800: 19308 < 900: 3155 < 1000: 193 >=1000: 8 Input file : /tmp/bac-submission-temp-lmqMg/C06HBa0153O03/C06HBa0153O03.seq.screen ________________________________________________________________________________ Sequence 1: C06HBa0153O03.1-1, from 1 to 28541, both strands analyzed. ... started at: Mon Aug 28 22:11:11 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 103 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 7 ... matches indexed, elapsed seconds = 7 HitsTableSize = 107 ******************************************************************************** EST sequence 177 +strand 577 n (File: SGN-E353447+) 1 AGAACTAGTC TTGAGTTTTT TTTTAAAATT TGCGAGTATT AACATGAAAC TTTTCGAGAT 61 AATTGAGATG GGGGACTGTG AAGTCAATAA GGTTGTCACA ATTTATCAAA GTTAACTCAA 121 CAAGGTAAGG ATATTTGGGT ATAAATTGGA GAAGGTACGC GAAAAGATGC AAACAAAATG 181 AAAATATAAC TCTTTCAATG AAGGCATTAC ATTAAATTCT CGAATAACAT TCGTGTTGCA 241 ATTTCCTTTT ATATGGAGAA CGGTTGAAAA TCTGGAATCT AAAATTTTAG GAGATGCACA 301 CTACAATAAA AATTACCATT AGCGGCATTT AATTCTTAAT TGTCGCTAAA GATGTATTTT 361 TAGCGGCAAT TGGCACTATT TGTATATGTC ATTAAAGCCT TTAGCGACAT TGGTTCTAAT 421 GTCACTTAAC TAATGCCGTT AAAGACTTTA GCACTCTTTA TTAGTGTCTA TATTTAATAC 481 CGCTAAAAGT TGTTTTTGTT ATAGTGGCAC AATACCTAAA ATTAGGTGGT ATGAACAAAT 541 AGAGTTCTTT GAAACTGCTG TTGTCCGCAG CAAGTTT Predicted gene structure (within gDNA segment 5710 to 1): Exon 1 2044 2037 ( 8 n); cDNA 294 301 ( 8 n); score: 0.750 Intron 1 2036 1453 ( 584 n); Pd: 0.478 (s: 0), Pa: 0.000 (s: 0.84) Exon 2 1452 1249 ( 204 n); cDNA 302 505 ( 204 n); score: 0.882 MATCH C06HBa0153O03.1-1- SGN-E353447+ 0.882 212 0.367 C PGS_C06HBa0153O03.1-1-_SGN-E353447+ (2044 2037,1452 1249) Alignment (genomic DNA sequence = upper lines): ATGATCACGT AAGTTTAGCC AGTGGATCAC TAGGTTGATG ATATCCTATG CGACGACAAA 1985 ||| ||| ATGCACAC.. .......... .......... .......... .......... .......... 301 TTATAGGACA GTTTTGGCAG CGTGTACACG ACACTGTATT ATCACTTAGG CTCATAGTGA 1925 .......... .......... .......... .......... .......... .......... 301 TGGCTGTCAG TTAGAGAAAC TCCAGCAGAA GCTATATTAC TTTCATATAT AAGTAAAGTT 1865 .......... .......... .......... .......... .......... .......... 301 GAGTTTATTA CATGTGTCCT TATTGCTTTA TATTGAGTTG TTATCTTATG AGTTGAGTAG 1805 .......... .......... .......... .......... .......... .......... 301 AGCCAAGATA AGTTCACCCT TACTCCATTT CAAGCGTTAT AGTTGTGCTT AGCATTCCAA 1745 .......... .......... .......... .......... .......... .......... 301 CTCGTATACT TGTACATTCA ATGTACTGAA GACAGTTGGC CTGCATCATC TTGAGATGCA 1685 .......... .......... .......... .......... .......... .......... 301 GACACAGGTA ACCAGGATCA GCACGCAGCA CACCGTTGAT CCATTTGAAC ATTCTGTAGT 1625 .......... .......... .......... .......... .......... .......... 301 CATTTGGTGA GCCTCTTTGC ATTCCGGAGG ACATCCCTTT ATTTACTTTC CTAGTTTAGT 1565 .......... .......... .......... .......... .......... .......... 301 TATTAGGATG TTGTGGGGTC TGTTCCAACA TCCATCTTAG TCAGTTTAGA GGCTTAATAG 1505 .......... .......... .......... .......... .......... .......... 301 ACAATGTAGC AGTTCAGTTT TGGAGTCTCC TTTATCTTAT ACTTCGTATC ACTACAACAA 1445 ||||| || .......... .......... .......... .......... .......... ..TACAATAA 309 AAATGTCCAT TTGCGACATT TAATTCTTAA TTGCCGCTAA GTATGTATTT TTAGAGGCAA 1385 |||| |||| | ||| |||| |||||||||| ||| |||||| |||||||| |||| ||||| AAATTACCAT TAGCGGCATT TAATTCTTAA TTGTCGCTAA AGATGTATTT TTAGCGGCAA 369 TTGTCACTAT TTGTATATGT CCCTATTGCC TTTAGAGACA TTGGTTCTAA TGACACTTAA 1325 ||| |||||| |||||||||| | || ||| ||||| |||| |||||||||| || ||||||| TTGGCACTAT TTGTATATGT CATTAAAGCC TTTAGCGACA TTGGTTCTAA TGTCACTTAA 429 CTAATGCCGG TAAATACTTT AGAACTCTTT ATTAGTGTCA ATATTTAATG CCACTAAAAG 1265 ||||||||| |||| ||||| || ||||||| ||||||||| ||||||||| || ||||||| CTAATGCCGT TAAAGACTTT AGCACTCTTT ATTAGTGTCT ATATTTAATA CCGCTAAAAG 489 TTATTTTTGT TGTAGT 1249 || ||||||| | |||| TTGTTTTTGT TATAGT 505 hqPGS_C06HBa0153O03.1-1-_SGN-E353447+ (1452 1249) ******************************************************************************** EST sequence 4 +strand 718 n (File: SGN-E578113+) 1 TTTACATGGA GACTTGAGTT AATATGAAAT CTCATCCCCA CATAGGTGCT CAACACTACT 61 CTCAAAAAAT ACTTTGGCTC ATGCTTTAAC AAAACTTCCT TCCTTTGTGT TGGGATAATT 121 TACTGAACCC TTTAGGTTTG AAAAGCTCCT TTTAGGGAAT AATTAGTTCC CTTATAGATT 181 TTGAGAAATG AACTCAACTC TTACTCTTTA CTTAACTTAA AACTTCTGAT CTTGATCAAC 241 TTTTTTTTTT TTTTTTCTGC AATTGAAGTG ATCAAGTCAT ATATTTATAA ACATGTTATC 301 TTTAGAAGCC ATTGTACAAT GAGGATAACA GAAGAGGTAG GGATAATAAT AAAAGAATCA 361 CTTCGAGCAA ATTCTAGTAT ATATATAGAG TGAGACCGAT TTAGACAGTG ATAACTGTAG 421 CTTCCCCTGT CCTCATCGAG AAGATCAACC ATCATATGGT TTGGTTGTTT TGTTAGATTT 481 CCTCTTTAAC ACAACAATCC AGGTAACCAC TTCCAATAAT AGGGCGATTA CTCCTAACAC 541 AACTAGTATG CCAATATATG CAGACCTCCA CTTTGTCGCT GGATCTAATA TGTCGAGACC 601 TTTGAATACA TTAACCAAGC TGAGGATAAG CATTGTATAG CCAACTCCGT GATGGTAGAT 661 ATTCCAGTAA AATCTGTACT TATGATCCTT CTTCGGCCTC AACAACAAGG CAAAAACC Predicted gene structure (within gDNA segment 730 to 8001): Exon 1 2086 2240 ( 155 n); cDNA 1 155 ( 155 n); score: 0.861 Intron 1 2241 2327 ( 87 n); Pd: 0.000 (s: 0.86), Pa: 0.717 (s: 0.86) Exon 2 2328 2437 ( 110 n); cDNA 156 265 ( 110 n); score: 0.750 Intron 2 2438 4068 (1631 n); Pd: 0.000 (s: 0.58), Pa: 0.673 (s: 0) Exon 3 4069 4100 ( 32 n); cDNA 266 297 ( 32 n); score: 0.594 Intron 3 4101 5322 (1222 n); Pd: 0.000 (s: 0), Pa: 0.995 (s: 0) Exon 4 5323 5332 ( 10 n); cDNA 298 307 ( 10 n); score: 0.800 MATCH C06HBa0153O03.1-1+ SGN-E578113+ 0.815 307 0.428 C PGS_C06HBa0153O03.1-1+_SGN-E578113+ (2086 2240,2328 2437,4069 4100,5323 5332) Alignment (genomic DNA sequence = upper lines): TTTACATGGG GACTTGAGTT -ATCTGAACT CTCATTCCTA CATCGGTGCT CAATACTACT 2144 ||||||||| |||||||||| || |||| | ||||| || | ||| |||||| ||| |||||| TTTACATGGA GACTTGAGTT AATATGAAAT CTCATCCCCA CATAGGTGCT CAACACTACT 60 CCCAAAACAT ACTTTAGCTC ATACTTTTAA CAAAACTTCC TTCCTTTGGG TTGAGATAAT 2204 | ||||| || ||||| |||| || | ||||| |||||||||| |||||||| | ||| |||||| CTCAAAAAAT ACTTTGGCTC ATGC-TTTAA CAAAACTTCC TTCCTTTGTG TTGGGATAAT 119 TTACTGAACC CTTTAGCTTT ACAAATCTCC TTTTGGAATC AATGTTCCCC TTTTAGTTCA 2264 |||||||||| |||||| ||| ||| |||| |||| | TTACTGAACC CTTTAGGTTT GAAAAGCTCC TTTTAG.... .......... .......... 155 AAACTCTTTT GAAACTTCTT AGTTTTCTTT TAACTTAAAT GTGAAACATT TATCAACCTT 2324 .......... .......... .......... .......... .......... .......... 155 TAGGGAGTAC TTAGTCCCCC TTATATCTTT AGAGAAATGA ACTCAACTCT TACTCTTTAC 2384 ||| || ||||| ||| ||||| ||| ||||||||| |||||||||| |||||||||| ...GGAATAA TTAGT-TCCC TTATAGATTT TGAGAAATGA ACTCAACTCT TACTCTTTAC 211 TTAACTTTAA ACTTTAACTC TT-AGGAAAT ACTTAGTTTT CTTATATACC ATTTTAAGAA 2443 ||||||| || |||| || || | || | || |||| || | | | ||| TTAACTTAAA ACTTCTGATC TTGATCAACT TTTTTTTTTT TTTTTCTGCA ATTG...... 265 AAGAATTCAA CCTTTACTCT TTTCCTTAAC TCGAACATGA GCCTTAAAAC GAAATTATAA 2503 .......... .......... .......... .......... .......... .......... 265 CATTTAAGTA AGATTCTAAG AACTTTGAAA TAAAAGCTTT ACTTTTACTC CCTTCATTGC 2563 .......... .......... .......... .......... .......... .......... 265 TTACTTTCTT GACTTTGATC TTAGTTTCTC TTGAATTGGA TTATGAATTC AAGGATCATA 2623 .......... .......... .......... .......... .......... .......... 265 ATCTCATGTT TATGGATGAT TTCATGATGT TTATATTTAT TCTAGAGTGT TGGTCAACTA 2683 .......... .......... .......... .......... .......... .......... 265 GGAATGAGAG GTACATCACT TAGGAACTAG TACGAAAATA TAGGGAAAGA AAAGGGTCTC 2743 .......... .......... .......... .......... .......... .......... 265 CGAGGGTATT GGCGCTCTAA GAGGCGCGGC GCCCCAAGGG TTGTCTGACT GGACCTCCGA 2803 .......... .......... .......... .......... .......... .......... 265 CTTTTCTTCT CCTCTTTCAT ATCTAAACCT ACCAAACTCC CATGGATCCC ACCTCAAAAA 2863 .......... .......... .......... .......... .......... .......... 265 CTTAGGATCC CTAAAAAACT CATTACCCAA TATTTAGACC CGAAAACATG AATCATAACT 2923 .......... .......... .......... .......... .......... .......... 265 ACTAGAGATC AACTATAATT CAACCAACAA GAACTTCATA ACTTTCATCA AGAACATCAA 2983 .......... .......... .......... .......... .......... .......... 265 GATTTTTATT AAATATAAAC TCTAATTTAT TGTATTGAAT AAATTTGGAG TGTGGATGAA 3043 .......... .......... .......... .......... .......... .......... 265 AGGATCCAAC ACCGAATGAT CTCACATACC TCGATAGGGA TAACCCTTGA TAAAATCTAC 3103 .......... .......... .......... .......... .......... .......... 265 GCAACAATGT TGGAGTTCTT GGACAATTCT TGATCCCTTT TATTTTCTCT TTTCTCCCAA 3163 .......... .......... .......... .......... .......... .......... 265 GACCTAAGCG TTCTAAATTA TTGTAAAAAT GATTTGAGAC TTGTTCTTTA CCCCTAAATA 3223 .......... .......... .......... .......... .......... .......... 265 ACCCTTAAAA GGAATTAGGC AATACTTGTA ACGACCTGTT TAGTCGTTTC GAGCAGCAGA 3283 .......... .......... .......... .......... .......... .......... 265 TTTTATTTCT TGAAAAACAG GCAAAGACGA CGGAACTCAC GACCGACCGT CATGGGCACG 3343 .......... .......... .......... .......... .......... .......... 265 ACGGACCGTC GAGGATATCT CATTCCAAAA TACTTAGAAT TCTCAAAATT GGGTACTGAA 3403 .......... .......... .......... .......... .......... .......... 265 ATCGACTTTC TGAACTTCGT AACGGAATGG CAGGACGAAC CGTTACAGAC TCTTCAGAGA 3463 .......... .......... .......... .......... .......... .......... 265 AATTGAGTCT CGGAACTCTG TGACGGAGCA GCAGAATGGA CCGTCGCAGG CGCGACGGGC 3523 .......... .......... .......... .......... .......... .......... 265 CGTCACAGGC TGCGTAATCC CAGGCTGGGT CAGATTTCTG TAATTATTTT AAGGGGCGTT 3583 .......... .......... .......... .......... .......... .......... 265 TTGGACTATT CCTGCTTTAA TTATAAAGTT AGTGGGTTAA TTTTAATAAG TCTAATTACT 3643 .......... .......... .......... .......... .......... .......... 265 TGGGGGTTAA AAGAGGTAAC CTTAAGTTAA TTAGTGAGTT ATTATTGCCA TCTTTTGTTC 3703 .......... .......... .......... .......... .......... .......... 265 TTAATTATAT GCTAATTAGG GTAAAAGAAA GAGGGTTTGA ATAAGAAAAG TAGAAAGAAC 3763 .......... .......... .......... .......... .......... .......... 265 AAAGAGAGAG AGAGGATCGA ACGAGGAAGA GAGAAAACGA AGAACAAAGC TTGGGGAAAT 3823 .......... .......... .......... .......... .......... .......... 265 TTCTTGCTTG ATCACTAATC TTCGGTGGAG GTAGGTTATG GTTTCTCTTA CGATATTCGT 3883 .......... .......... .......... .......... .......... .......... 265 AGTAAACTCT TAATAGCGAA TGATATATAT TGATAATATT GTAAACCCTG CTATGTGCTT 3943 .......... .......... .......... .......... .......... .......... 265 AATTGTATGC TTGCATGAAT GTAATTTTAT AATTGTGGTT ATATAAGCAT GATGAAGTTA 4003 .......... .......... .......... .......... .......... .......... 265 TTGAATCCCA AATCTTGCAA AACCCTAATC TCTTTGTTAA TGATGGGGCC TTGGTATAAA 4063 .......... .......... .......... .......... .......... .......... 265 AGAAGGCGTG ATGAACTAAA ATAATGAGAT TGATGATGCC TTGGTAAGAA AGAAGGCTTG 4123 ||| || || | | ||| | | | ||| | .....AAGTG ATCAAGTCAT ATATTTATAA ACATGTT... .......... .......... 297 ATGAATTGAT AGAAGGAGAT TAGGGGATAG GGTGTCACGA ACCGACACGT AGATTTAGGG 4183 .......... .......... .......... .......... .......... .......... 297 ATCGGGTGTC ACGAACCGAC ACGTAGATTT AGGGGATCGG GTGTCACGAA CTGACACGTA 4243 .......... .......... .......... .......... .......... .......... 297 GATTTAGGGG ATCTGGTGTC ACGAACCGAC ACTTAGATTT AGGAGATCGA GTGTCACGAA 4303 .......... .......... .......... .......... .......... .......... 297 CCGACACGTA GATTTAGAGG ATCGAAGTGT CACGTTCCGA CACATAGTAG TGGGGGATCG 4363 .......... .......... .......... .......... .......... .......... 297 AAGTGTCACG TACCGACACA AGAGGATTAA TGAATATGAG GGAGCGGAGT GTCACGTACC 4423 .......... .......... .......... .......... .......... .......... 297 GACACAAGAG AAATAAAGAT AATGAATCTT GAAAGATGAT AATATACTCA ATCCAATGAA 4483 .......... .......... .......... .......... .......... .......... 297 CATAATTCCC AAATGAGTAC GGTATTGAGG CTTGAGTCCT CGTGTGTGAA CTTGACGGTA 4543 .......... .......... .......... .......... .......... .......... 297 ATTGTTAATG ATATAGTATT TGATGTTGCT ACATGTTGAG TATCATAGTT GATTTTATGA 4603 .......... .......... .......... .......... .......... .......... 297 TATTACTTGG TATATATTGA TCTCTATTTT GAGTTGGCCG ATGATATCTA CTCAGTACCC 4663 .......... .......... .......... .......... .......... .......... 297 GTGTTTGTAC TGACCCCTAC TTTTATGTTT TCTTCTTGTT TATTTGTGGA GTGCAGCAAA 4723 .......... .......... .......... .......... .......... .......... 297 CGTGCCGTCA TCTTCGACTC AACAGTAACT CTAGCCAGTC TTCATTACTC CGGATATCAG 4783 .......... .......... .......... .......... .......... .......... 297 GGTGAGCTAA TGCTTCTAGC TTGGACTGGA TCTTCCTCTT CATGTCTTGA TGCTTTGAAG 4843 .......... .......... .......... .......... .......... .......... 297 TTCCGGCATG GACTAGCTTT TATGTATTTT TAGCTTCTTA GAAACTCTTA GATTTAGTAG 4903 .......... .......... .......... .......... .......... .......... 297 TTTGAAGTAG ATGTTCTTGT GATGATGACT TCCAGATTTT GGGGATAATA ATATTTATTG 4963 .......... .......... .......... .......... .......... .......... 297 ATTTTATTAA TGAGTTTAAG TCTTCCGCAT TACTTTCTGT TGATATTCTA TTGAAATATT 5023 .......... .......... .......... .......... .......... .......... 297 AAGGTTTAGA TTGGTTGGTT CGCTCACATA GGAGGGTAAG TGTGGGTGCC AGTCGCGATC 5083 .......... .......... .......... .......... .......... .......... 297 CGGTTTTGGG TCGTGACAAA CTTGGTATCA GAGCATTAGG TTCGTTGGTC TCATCACACA 5143 .......... .......... .......... .......... .......... .......... 297 AGAACGAGTC TGGTAGAGTC TTAAGGAACG GTAGGGGGAC ACCTTTACTT TTCTTTGAGG 5203 .......... .......... .......... .......... .......... .......... 297 GGCTATAAGA CTTTAGGAAA ATTCCATTCT TTCATTCTTT CTTTCGTGCT ATTACTTGGG 5263 .......... .......... .......... .......... .......... .......... 297 TCCAATTGGT ATCTAGGTGA TACAAATTGG TATCTGACCA TCTTCACTCT GTTTCGCAGA 5323 | .......... .......... .......... .......... .......... .........A 298 TGGTTAGAA 5332 | |||||| TCTTTAGAA 307 hqPGS_C06HBa0153O03.1-1+_SGN-E578113+ (2086 2240,2328 2437) ******************************************************************************** EST sequence 174 +strand 454 n (File: SGN-E396070+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCGGGGGGGG 421 GAGTTTCTAA TTGTTTTGAA ACTAGACTCC TCGA Predicted gene structure (within gDNA segment 4702 to 500): Exon 1 3787 3448 ( 340 n); cDNA 2 339 ( 338 n); score: 0.863 Intron 1 3447 3354 ( 94 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0) Exon 2 3353 3321 ( 33 n); cDNA 340 372 ( 33 n); score: 0.879 MATCH C06HBa0153O03.1-1- SGN-E396070+ 0.863 373 0.822 C PGS_C06HBa0153O03.1-1-_SGN-E396070+ (3787 3448,3353 3321) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG TAACGGTTCG TCCTGCCATT 3429 ||| || || || ||||||| ||||| | | ||||||||| | GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG T......... .......... 339 CCGTTACGAA GTTCAGAAAG TCGATTTCAG TACCCAATTT TGAGAATTCT AAGTATTTTG 3369 .......... .......... .......... .......... .......... .......... 339 GAATGAGATA TCCTCGACGG TCCGTCGTGC CCATGACGGT CGGTCGTG 3321 ||||| |||||| || || ||||||| | |||||| .......... .....GACGG TCCGTCACGC CCGTGACGGT CCGTCGTG 372 hqPGS_C06HBa0153O03.1-1-_SGN-E396070+ (3787 3448,3353 3321) ******************************************************************************** EST sequence 176 +strand 472 n (File: SGN-E236652+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GA Predicted gene structure (within gDNA segment 4692 to 310): Exon 1 3787 3448 ( 340 n); cDNA 1 338 ( 338 n); score: 0.863 Intron 1 3447 3354 ( 94 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0) Exon 2 3353 3321 ( 33 n); cDNA 339 371 ( 33 n); score: 0.879 MATCH C06HBa0153O03.1-1- SGN-E236652+ 0.863 373 0.790 C PGS_C06HBa0153O03.1-1-_SGN-E236652+ (3787 3448,3353 3321) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG TAACGGTTCG TCCTGCCATT 3429 ||| || || || ||||||| ||||| | | ||||||||| | GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG T......... .......... 338 CCGTTACGAA GTTCAGAAAG TCGATTTCAG TACCCAATTT TGAGAATTCT AAGTATTTTG 3369 .......... .......... .......... .......... .......... .......... 338 GAATGAGATA TCCTCGACGG TCCGTCGTGC CCATGACGGT CGGTCGTG 3321 ||||| |||||| || || ||||||| | |||||| .......... .....GACGG TCCGTCACGC CCGTGACGGT CCGTCGTG 371 hqPGS_C06HBa0153O03.1-1-_SGN-E236652+ (3787 3448,3353 3321) ******************************************************************************** EST sequence 99 -strand 725 n (File: SGN-E546548-) 1 GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 61 TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCCCTA ATCCATCAAG 121 CCTTCTTTTA CACTAAGGCA TCATCATTCT CATTATATAA TTTATCAAGC CTTCTTTCAT 181 ACTAAGGCAT CATCATTCTC ATTATATAAT ATATCAAGCG AATTAGGGTT CTTTCAAGAT 241 TTGGGATTCA ATTGCTTCAT CATGCTTTGT TAATTCATCG CAATTTCATA ATCATAATCA 301 TGCAAGCATA CAACTTAAGC ACATAGCAGG GTTTACAATA CTATCAACAC ATAATATTCA 361 CTATTAAGAG TTCACTACGA ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT 421 TGAATCAACA AGCTATCTTC TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT 481 CGTTCGTTTC TCCTCTCTTT CTGTTCTTTT CTTTTGTTTT GTTTTATTCA AACCCTCCTT 541 CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG GACAATAAAA CCCACTAATT 601 AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACC TATTAATATT AACCCTCAAT 661 CTTTATAATT AAGGAAAGAA TAGTCCAAAA CGACCCCTAA AACGTGTAGA GGAATCCTAT 721 TTTGC Predicted gene structure (within gDNA segment 7541 to 2173): Exon 1 4142 4069 ( 74 n); cDNA 148 219 ( 72 n); score: 0.824 Intron 1 4068 4035 ( 34 n); Pd: 0.000 (s: 0.82), Pa: 0.000 (s: 0.73) Exon 2 4034 3547 ( 488 n); cDNA 220 725 ( 506 n); score: 0.733 MATCH C06HBa0153O03.1-1- SGN-E546548- 0.745 562 0.775 C PGS_C06HBa0153O03.1-1-_SGN-E546548- (4142 4069,4034 3547) Alignment (genomic DNA sequence = upper lines): TCTCCTTCTA TCAATTCATC AAGCCTTCTT TCTTACCAAG GCATCATCAA TCTCATTATT 4083 |||| || || | |||| ||| |||||||||| || ||| ||| ||||||||| |||||||| | TCTCATTATA T-AATTTATC AAGCCTTCTT TCATACTAAG GCATCATCAT TCTCATTA-T 205 TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCCCATCAT TAACAAAGAG ATTAGGGTT- 4024 || | ||| | || ||||||||| ATAATATATC AAGC...... .......... .......... ........GA ATTAGGGTTC 231 TTGCAAGATT TGGGATTCAA TAACTTCATC ATGC-TT-AT -A-TAACCAC AATTATAAAA 3968 || ||||||| |||||||||| | ||||||| |||| || | | | | | | |||| | || TTTCAAGATT TGGGATTCAA TTGCTTCATC ATGCTTTGTT AATTCATCGC AATTTCATAA 291 TTACATTCAT GCAAGCATAC AA-TTAAGCA CATAGCAGGG TTTACAATAT TATCAATATA 3909 | | | |||| |||||||||| || ||||||| |||||||||| ||||||||| |||||| | | TCATAATCAT GCAAGCATAC AACTTAAGCA CATAGCAGGG TTTACAATAC TATCAACACA 351 TATCATTCGC TATTAAGAGT TTACTACGAA TATCGTAAGA GAAACCATAA CCTACCTCCA 3849 || |||| | |||||||||| | |||||||| |||||||| | ||||||||| |||||||||| TAATATTCAC TATTAAGAGT TCACTACGAA TATCGTAACA TAAACCATAA CCTACCTCCA 411 CCGAAGATTA GTGATCAAGC AAGAAAT-TT C-CCCAA-GC TT--TGT--T CTTCGTTT-T 3797 ||||||| | | ||||| | ||| || || | | || | || | | | |||||||| | CCGAAGAATT G-AATCAA-C AAGCTATCTT CTCAAAATCC TTGCTATCCT CTTCGTTTCT 469 CTCTCT-TCC TCGTTCGATC CT-CTCTCTC TCT-TTGTTC T-TTCTACTT T-TCTTATTC 3742 |||||| | | ||||||| | || |||||| ||| || || | || | || | | |||||| CTCTCTCTAC TCGTTCGTTT CTCCTCTCTT TCTGTTCTTT TCTTTTGTTT TGTTTTATTC 529 AAACCCTCTT TC-TTTTACC CTAATTAGCA -TATAATTAA GAACAAAAGA TGGCAATAAT 3684 |||||||| | || ||||||| ||||||| | ||||||||| | ||| || | |||||| AAACCCTCCT TCTTTTTACC CTAATTAAAA GTATAATTAA GTGTAAAGGA GGACAATAA- 588 AACTCACTAA TTAACTTAAG GTTACCTCTT TTAACCCCCA AGTAATTAGA CTTATTAAAA 3624 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||||| | AACCCACTAA TTAACTTAAG GTTACCTCTT TTAACCCCCA AGTAATTAGA CCTATTAATA 648 TTAACCCACT AACTTTATAA TTAAAGCAGG AATAGTCCAA AACGCCCCTT AAAATAATTA 3564 ||||||| | | |||||||| |||| | | | |||||||||| |||| ||| | |||| || TTAACCCTCA ATCTTTATAA TTAAGGAAAG AATAGTCCAA AACGACCCCT AAAACGTGTA 708 CAGAAATCTG ACCCAGC 3547 || |||| | || GAGGAATCCT ATTTTGC 725 hqPGS_C06HBa0153O03.1-1-_SGN-E546548- (4142 4069,4034 3547) ******************************************************************************** EST sequence 10 -strand 679 n (File: SGN-E550127-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTCCT TTTCTTTTTC TTATCAAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTGCACAA CCATGAATTA ATGAAAAAAT TATGACATAA 661 AATATAAAAA ATTACTCAT Predicted gene structure (within gDNA segment 4892 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.770 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0) Exon 2 2525 2496 ( 30 n); cDNA 558 587 ( 30 n); score: 0.700 Intron 2 2495 11 (2485 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 3 10 1 ( 10 n); cDNA 588 597 ( 10 n); score: 0.800 MATCH C06HBa0153O03.1-1- SGN-E550127- 0.770 579 0.853 C PGS_C06HBa0153O03.1-1-_SGN-E550127- (3787 3249,2525 2496,10 1) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||| ||| | | ||||| |||| |||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTCCTT T-TCTTTTTC TTATCAAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TATT...... 587 TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA ATTCTTTTCT TAAAATGGTA 2430 .......... .......... .......... .......... .......... .......... 587 TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA GTTAAGTAAA GAGTAAGAGT 2370 .......... .......... .......... .......... .......... .......... 587 TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT CCCTAAAGGT TGATAAATGT 2310 .......... .......... .......... .......... .......... .......... 587 TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG AGTTTTGAAC TAAAAGGGGA 2250 .......... .......... .......... .......... .......... .......... 587 ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC AGTAAATTAT CTCAACCCAA 2190 .......... .......... .......... .......... .......... .......... 587 AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT TTGGGAGTAG TATTGAGCAC 2130 .......... .......... .......... .......... .......... .......... 587 CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG TAAATCATGT AGCTATCATG 2070 .......... .......... .......... .......... .......... .......... 587 GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT TAGCCAGTGG ATCACTAGGT 2010 .......... .......... .......... .......... .......... .......... 587 TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT GGCAGCGTGT ACACGACACT 1950 .......... .......... .......... .......... .......... .......... 587 GTATTATCAC TTAGGCTCAT AGTGATGGCT GTCAGTTAGA GAAACTCCAG CAGAAGCTAT 1890 .......... .......... .......... .......... .......... .......... 587 ATTACTTTCA TATATAAGTA AAGTTGAGTT TATTACATGT GTCCTTATTG CTTTATATTG 1830 .......... .......... .......... .......... .......... .......... 587 AGTTGTTATC TTATGAGTTG AGTAGAGCCA AGATAAGTTC ACCCTTACTC CATTTCAAGC 1770 .......... .......... .......... .......... .......... .......... 587 GTTATAGTTG TGCTTAGCAT TCCAACTCGT ATACTTGTAC ATTCAATGTA CTGAAGACAG 1710 .......... .......... .......... .......... .......... .......... 587 TTGGCCTGCA TCATCTTGAG ATGCAGACAC AGGTAACCAG GATCAGCACG CAGCACACCG 1650 .......... .......... .......... .......... .......... .......... 587 TTGATCCATT TGAACATTCT GTAGTCATTT GGTGAGCCTC TTTGCATTCC GGAGGACATC 1590 .......... .......... .......... .......... .......... .......... 587 CCTTTATTTA CTTTCCTAGT TTAGTTATTA GGATGTTGTG GGGTCTGTTC CAACATCCAT 1530 .......... .......... .......... .......... .......... .......... 587 CTTAGTCAGT TTAGAGGCTT AATAGACAAT GTAGCAGTTC AGTTTTGGAG TCTCCTTTAT 1470 .......... .......... .......... .......... .......... .......... 587 CTTATACTTC GTATCACTAC AACAAAAATG TCCATTTGCG ACATTTAATT CTTAATTGCC 1410 .......... .......... .......... .......... .......... .......... 587 GCTAAGTATG TATTTTTAGA GGCAATTGTC ACTATTTGTA TATGTCCCTA TTGCCTTTAG 1350 .......... .......... .......... .......... .......... .......... 587 AGACATTGGT TCTAATGACA CTTAACTAAT GCCGGTAAAT ACTTTAGAAC TCTTTATTAG 1290 .......... .......... .......... .......... .......... .......... 587 TGTCAATATT TAATGCCACT AAAAGTTATT TTTGTTGTAG TGTATATTAC AGTATTAAGA 1230 .......... .......... .......... .......... .......... .......... 587 CTTAATTGCC ATTTTGGCTA AGACAAGTTA CTATCTTATG AGAATGACTT TAATTATTTA 1170 .......... .......... .......... .......... .......... .......... 587 TCTAAGTTTA GCATAGTCAT CCATTAAGTT AAGTAAGCCA GACCAAGGGT TCACTTAAGA 1110 .......... .......... .......... .......... .......... .......... 587 CCAGAAATGA TCGTTAAGTG TCGGCCACGT CCGTGGTGTA GGCTTCGGCA TGACATTTTT 1050 .......... .......... .......... .......... .......... .......... 587 TGGTAGCAAG TAAGAGGCGT GAATGACTGA CAAAAATTAT GTAGTTATTA TTGACGATGT 990 .......... .......... .......... .......... .......... .......... 587 CGAATCTAGG ATTTTAAGGT TGTGGACACT TCAAGAAGAG TTGGTCTTTA AATTTTTTAT 930 .......... .......... .......... .......... .......... .......... 587 TAAAAAGGAT TCCATAATGT GAATATATTT AAAATGACCA CAACAAATGT AATATACAAC 870 .......... .......... .......... .......... .......... .......... 587 TTTGAGCACC CACAAGCGAC CAATATATCA AGTAAATAGT TGCCTAGGTG CTCAATGTTT 810 .......... .......... .......... .......... .......... .......... 587 AAGTTTATAC CATTCATTCA ACTTTATATA AAAATATATA TAGTTCGATA CGAGATTAAT 750 .......... .......... .......... .......... .......... .......... 587 GGGTGCTCAA TGTTAGGGTT TTGTCATGAT TTTCTTACCT TATTTGAAGT TCATTTTTTT 690 .......... .......... .......... .......... .......... .......... 587 CTTAAAAGAA AAGACAAAAC ATTATTATAA ATGGTTTTAC TTTTCCTGAA GGAAAATATA 630 .......... .......... .......... .......... .......... .......... 587 ATATTCTTTC ATATTTGGTG TTTATTCCTT TATTAGAAGA GAAACTTGGA ACTCTATAAA 570 .......... .......... .......... .......... .......... .......... 587 TTGAAGATCC TTCTTCTCAT ATCGACAACA ATAAAATTCA CAATGTAGTT GTTTAGAGAC 510 .......... .......... .......... .......... .......... .......... 587 TTTTATTTAT GGGGAGATCT ATTCTCACTA CAATTTTAAT GTCTTTTTAT TAACTTTTAA 450 .......... .......... .......... .......... .......... .......... 587 TTATTTAGGC CAATTGGTCA TCATATAATA ATATTTTTTC TATTAGTATT AATTTTTTTT 390 .......... .......... .......... .......... .......... .......... 587 ATCTAATTTA TTAACTATAT GATTTACAAA TTATAAGTTT CCACATGAAG CCGTATTCAC 330 .......... .......... .......... .......... .......... .......... 587 CACTTCATAT CCCCGACAAG TGATATCAGA GCACATGGTC CAACAATCAT ATGACTCAAC 270 .......... .......... .......... .......... .......... .......... 587 AAGTGGTATC AGAGAGCACG GTTCAAAGAT CTGATGGTTC AGTGATCTAA TGCTTGAAGG 210 .......... .......... .......... .......... .......... .......... 587 AGGTTGAAGA CAAACTCAAG TGGTTTCAAA ACATTTTGTA ACAAATTTGA TGATAATGAA 150 .......... .......... .......... .......... .......... .......... 587 GATTTTGTCA AGTTACATGT GGAGAAAAAG TTTCAACCAT ATTTTTAACA TTTTGCAATT 90 .......... .......... .......... .......... .......... .......... 587 TTTAAAAATA AAAATAAAAT ATTTTTAAAT CATATATTTT AAATTTTAAT TTTAACCATG 30 .......... .......... .......... .......... .......... .......... 587 GCAAGCAACA ATAATGAAGA TTTTGTGAT 1 | |||| ||| .......... .........A TTTTACGAT 597 hqPGS_C06HBa0153O03.1-1-_SGN-E550127- (3787 3249) ******************************************************************************** EST sequence 184 +strand 690 n (File: SGN-E377133+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG Predicted gene structure (within gDNA segment 4692 to 1): Exon 1 3787 3249 ( 539 n); cDNA 1 556 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 557 600 ( 44 n); score: 0.652 MATCH C06HBa0153O03.1-1- SGN-E377133+ 0.776 585 0.848 C PGS_C06HBa0153O03.1-1-_SGN-E377133+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 535 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 556 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 556 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 556 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 556 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 556 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 556 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 556 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 556 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 556 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 556 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 556 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 556 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 590 TAAGGCTCAT 2480 || | | || TACGATTTAT 600 hqPGS_C06HBa0153O03.1-1-_SGN-E377133+ (3787 3249) ******************************************************************************** EST sequence 112 +strand 729 n (File: SGN-E550212+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAACTCG 721 GGGGGGGGC Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.68) Exon 2 2525 2486 ( 40 n); cDNA 558 595 ( 38 n); score: 0.675 Intron 2 2485 178 (2308 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.60) Exon 3 177 121 ( 57 n); cDNA 596 648 ( 53 n); score: 0.632 PPA cDNA 699 716 MATCH C06HBa0153O03.1-1- SGN-E550212+ 0.762 636 0.872 C PGS_C06HBa0153O03.1-1-_SGN-E550212+ (3787 3249,2525 2486,177 121) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA ATTCTTTTCT TAAAATGGTA 2430 || | TACG...... .......... .......... .......... .......... .......... 595 TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA GTTAAGTAAA GAGTAAGAGT 2370 .......... .......... .......... .......... .......... .......... 595 TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT CCCTAAAGGT TGATAAATGT 2310 .......... .......... .......... .......... .......... .......... 595 TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG AGTTTTGAAC TAAAAGGGGA 2250 .......... .......... .......... .......... .......... .......... 595 ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC AGTAAATTAT CTCAACCCAA 2190 .......... .......... .......... .......... .......... .......... 595 AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT TTGGGAGTAG TATTGAGCAC 2130 .......... .......... .......... .......... .......... .......... 595 CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG TAAATCATGT AGCTATCATG 2070 .......... .......... .......... .......... .......... .......... 595 GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT TAGCCAGTGG ATCACTAGGT 2010 .......... .......... .......... .......... .......... .......... 595 TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT GGCAGCGTGT ACACGACACT 1950 .......... .......... .......... .......... .......... .......... 595 GTATTATCAC TTAGGCTCAT AGTGATGGCT GTCAGTTAGA GAAACTCCAG CAGAAGCTAT 1890 .......... .......... .......... .......... .......... .......... 595 ATTACTTTCA TATATAAGTA AAGTTGAGTT TATTACATGT GTCCTTATTG CTTTATATTG 1830 .......... .......... .......... .......... .......... .......... 595 AGTTGTTATC TTATGAGTTG AGTAGAGCCA AGATAAGTTC ACCCTTACTC CATTTCAAGC 1770 .......... .......... .......... .......... .......... .......... 595 GTTATAGTTG TGCTTAGCAT TCCAACTCGT ATACTTGTAC ATTCAATGTA CTGAAGACAG 1710 .......... .......... .......... .......... .......... .......... 595 TTGGCCTGCA TCATCTTGAG ATGCAGACAC AGGTAACCAG GATCAGCACG CAGCACACCG 1650 .......... .......... .......... .......... .......... .......... 595 TTGATCCATT TGAACATTCT GTAGTCATTT GGTGAGCCTC TTTGCATTCC GGAGGACATC 1590 .......... .......... .......... .......... .......... .......... 595 CCTTTATTTA CTTTCCTAGT TTAGTTATTA GGATGTTGTG GGGTCTGTTC CAACATCCAT 1530 .......... .......... .......... .......... .......... .......... 595 CTTAGTCAGT TTAGAGGCTT AATAGACAAT GTAGCAGTTC AGTTTTGGAG TCTCCTTTAT 1470 .......... .......... .......... .......... .......... .......... 595 CTTATACTTC GTATCACTAC AACAAAAATG TCCATTTGCG ACATTTAATT CTTAATTGCC 1410 .......... .......... .......... .......... .......... .......... 595 GCTAAGTATG TATTTTTAGA GGCAATTGTC ACTATTTGTA TATGTCCCTA TTGCCTTTAG 1350 .......... .......... .......... .......... .......... .......... 595 AGACATTGGT TCTAATGACA CTTAACTAAT GCCGGTAAAT ACTTTAGAAC TCTTTATTAG 1290 .......... .......... .......... .......... .......... .......... 595 TGTCAATATT TAATGCCACT AAAAGTTATT TTTGTTGTAG TGTATATTAC AGTATTAAGA 1230 .......... .......... .......... .......... .......... .......... 595 CTTAATTGCC ATTTTGGCTA AGACAAGTTA CTATCTTATG AGAATGACTT TAATTATTTA 1170 .......... .......... .......... .......... .......... .......... 595 TCTAAGTTTA GCATAGTCAT CCATTAAGTT AAGTAAGCCA GACCAAGGGT TCACTTAAGA 1110 .......... .......... .......... .......... .......... .......... 595 CCAGAAATGA TCGTTAAGTG TCGGCCACGT CCGTGGTGTA GGCTTCGGCA TGACATTTTT 1050 .......... .......... .......... .......... .......... .......... 595 TGGTAGCAAG TAAGAGGCGT GAATGACTGA CAAAAATTAT GTAGTTATTA TTGACGATGT 990 .......... .......... .......... .......... .......... .......... 595 CGAATCTAGG ATTTTAAGGT TGTGGACACT TCAAGAAGAG TTGGTCTTTA AATTTTTTAT 930 .......... .......... .......... .......... .......... .......... 595 TAAAAAGGAT TCCATAATGT GAATATATTT AAAATGACCA CAACAAATGT AATATACAAC 870 .......... .......... .......... .......... .......... .......... 595 TTTGAGCACC CACAAGCGAC CAATATATCA AGTAAATAGT TGCCTAGGTG CTCAATGTTT 810 .......... .......... .......... .......... .......... .......... 595 AAGTTTATAC CATTCATTCA ACTTTATATA AAAATATATA TAGTTCGATA CGAGATTAAT 750 .......... .......... .......... .......... .......... .......... 595 GGGTGCTCAA TGTTAGGGTT TTGTCATGAT TTTCTTACCT TATTTGAAGT TCATTTTTTT 690 .......... .......... .......... .......... .......... .......... 595 CTTAAAAGAA AAGACAAAAC ATTATTATAA ATGGTTTTAC TTTTCCTGAA GGAAAATATA 630 .......... .......... .......... .......... .......... .......... 595 ATATTCTTTC ATATTTGGTG TTTATTCCTT TATTAGAAGA GAAACTTGGA ACTCTATAAA 570 .......... .......... .......... .......... .......... .......... 595 TTGAAGATCC TTCTTCTCAT ATCGACAACA ATAAAATTCA CAATGTAGTT GTTTAGAGAC 510 .......... .......... .......... .......... .......... .......... 595 TTTTATTTAT GGGGAGATCT ATTCTCACTA CAATTTTAAT GTCTTTTTAT TAACTTTTAA 450 .......... .......... .......... .......... .......... .......... 595 TTATTTAGGC CAATTGGTCA TCATATAATA ATATTTTTTC TATTAGTATT AATTTTTTTT 390 .......... .......... .......... .......... .......... .......... 595 ATCTAATTTA TTAACTATAT GATTTACAAA TTATAAGTTT CCACATGAAG CCGTATTCAC 330 .......... .......... .......... .......... .......... .......... 595 CACTTCATAT CCCCGACAAG TGATATCAGA GCACATGGTC CAACAATCAT ATGACTCAAC 270 .......... .......... .......... .......... .......... .......... 595 AAGTGGTATC AGAGAGCACG GTTCAAAGAT CTGATGGTTC AGTGATCTAA TGCTTGAAGG 210 .......... .......... .......... .......... .......... .......... 595 AGGTTGAAGA CAAACTCAAG TGGTTTCAAA ACATTTTGTA ACAAATTTGA TGATAATGAA 150 | ||| || || | | | | || || || .......... .......... .......... ..A-TTTATA AC-ACTATTA -GA-AACAAA 619 GATTTTGTCA AGTTACATGT GGAGAAAAA 121 |||||| ||| | | | |||||| GATTTTCTCA ACCATGAATT AATGAAAAA 648 hqPGS_C06HBa0153O03.1-1-_SGN-E550212+ (3787 3249) ******************************************************************************** EST sequence 115 +strand 732 n (File: SGN-E550201+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCNA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAAACT 721 CGAGGGGGGG CC Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.68) Exon 2 2525 2486 ( 40 n); cDNA 558 595 ( 38 n); score: 0.675 Intron 2 2485 178 (2308 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.58) Exon 3 177 121 ( 57 n); cDNA 596 648 ( 53 n); score: 0.614 PPA cDNA 699 718 MATCH C06HBa0153O03.1-1- SGN-E550201+ 0.760 636 0.869 C PGS_C06HBa0153O03.1-1-_SGN-E550201+ (3787 3249,2525 2486,177 121) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA ATTCTTTTCT TAAAATGGTA 2430 || | TACG...... .......... .......... .......... .......... .......... 595 TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA GTTAAGTAAA GAGTAAGAGT 2370 .......... .......... .......... .......... .......... .......... 595 TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT CCCTAAAGGT TGATAAATGT 2310 .......... .......... .......... .......... .......... .......... 595 TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG AGTTTTGAAC TAAAAGGGGA 2250 .......... .......... .......... .......... .......... .......... 595 ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC AGTAAATTAT CTCAACCCAA 2190 .......... .......... .......... .......... .......... .......... 595 AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT TTGGGAGTAG TATTGAGCAC 2130 .......... .......... .......... .......... .......... .......... 595 CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG TAAATCATGT AGCTATCATG 2070 .......... .......... .......... .......... .......... .......... 595 GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT TAGCCAGTGG ATCACTAGGT 2010 .......... .......... .......... .......... .......... .......... 595 TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT GGCAGCGTGT ACACGACACT 1950 .......... .......... .......... .......... .......... .......... 595 GTATTATCAC TTAGGCTCAT AGTGATGGCT GTCAGTTAGA GAAACTCCAG CAGAAGCTAT 1890 .......... .......... .......... .......... .......... .......... 595 ATTACTTTCA TATATAAGTA AAGTTGAGTT TATTACATGT GTCCTTATTG CTTTATATTG 1830 .......... .......... .......... .......... .......... .......... 595 AGTTGTTATC TTATGAGTTG AGTAGAGCCA AGATAAGTTC ACCCTTACTC CATTTCAAGC 1770 .......... .......... .......... .......... .......... .......... 595 GTTATAGTTG TGCTTAGCAT TCCAACTCGT ATACTTGTAC ATTCAATGTA CTGAAGACAG 1710 .......... .......... .......... .......... .......... .......... 595 TTGGCCTGCA TCATCTTGAG ATGCAGACAC AGGTAACCAG GATCAGCACG CAGCACACCG 1650 .......... .......... .......... .......... .......... .......... 595 TTGATCCATT TGAACATTCT GTAGTCATTT GGTGAGCCTC TTTGCATTCC GGAGGACATC 1590 .......... .......... .......... .......... .......... .......... 595 CCTTTATTTA CTTTCCTAGT TTAGTTATTA GGATGTTGTG GGGTCTGTTC CAACATCCAT 1530 .......... .......... .......... .......... .......... .......... 595 CTTAGTCAGT TTAGAGGCTT AATAGACAAT GTAGCAGTTC AGTTTTGGAG TCTCCTTTAT 1470 .......... .......... .......... .......... .......... .......... 595 CTTATACTTC GTATCACTAC AACAAAAATG TCCATTTGCG ACATTTAATT CTTAATTGCC 1410 .......... .......... .......... .......... .......... .......... 595 GCTAAGTATG TATTTTTAGA GGCAATTGTC ACTATTTGTA TATGTCCCTA TTGCCTTTAG 1350 .......... .......... .......... .......... .......... .......... 595 AGACATTGGT TCTAATGACA CTTAACTAAT GCCGGTAAAT ACTTTAGAAC TCTTTATTAG 1290 .......... .......... .......... .......... .......... .......... 595 TGTCAATATT TAATGCCACT AAAAGTTATT TTTGTTGTAG TGTATATTAC AGTATTAAGA 1230 .......... .......... .......... .......... .......... .......... 595 CTTAATTGCC ATTTTGGCTA AGACAAGTTA CTATCTTATG AGAATGACTT TAATTATTTA 1170 .......... .......... .......... .......... .......... .......... 595 TCTAAGTTTA GCATAGTCAT CCATTAAGTT AAGTAAGCCA GACCAAGGGT TCACTTAAGA 1110 .......... .......... .......... .......... .......... .......... 595 CCAGAAATGA TCGTTAAGTG TCGGCCACGT CCGTGGTGTA GGCTTCGGCA TGACATTTTT 1050 .......... .......... .......... .......... .......... .......... 595 TGGTAGCAAG TAAGAGGCGT GAATGACTGA CAAAAATTAT GTAGTTATTA TTGACGATGT 990 .......... .......... .......... .......... .......... .......... 595 CGAATCTAGG ATTTTAAGGT TGTGGACACT TCAAGAAGAG TTGGTCTTTA AATTTTTTAT 930 .......... .......... .......... .......... .......... .......... 595 TAAAAAGGAT TCCATAATGT GAATATATTT AAAATGACCA CAACAAATGT AATATACAAC 870 .......... .......... .......... .......... .......... .......... 595 TTTGAGCACC CACAAGCGAC CAATATATCA AGTAAATAGT TGCCTAGGTG CTCAATGTTT 810 .......... .......... .......... .......... .......... .......... 595 AAGTTTATAC CATTCATTCA ACTTTATATA AAAATATATA TAGTTCGATA CGAGATTAAT 750 .......... .......... .......... .......... .......... .......... 595 GGGTGCTCAA TGTTAGGGTT TTGTCATGAT TTTCTTACCT TATTTGAAGT TCATTTTTTT 690 .......... .......... .......... .......... .......... .......... 595 CTTAAAAGAA AAGACAAAAC ATTATTATAA ATGGTTTTAC TTTTCCTGAA GGAAAATATA 630 .......... .......... .......... .......... .......... .......... 595 ATATTCTTTC ATATTTGGTG TTTATTCCTT TATTAGAAGA GAAACTTGGA ACTCTATAAA 570 .......... .......... .......... .......... .......... .......... 595 TTGAAGATCC TTCTTCTCAT ATCGACAACA ATAAAATTCA CAATGTAGTT GTTTAGAGAC 510 .......... .......... .......... .......... .......... .......... 595 TTTTATTTAT GGGGAGATCT ATTCTCACTA CAATTTTAAT GTCTTTTTAT TAACTTTTAA 450 .......... .......... .......... .......... .......... .......... 595 TTATTTAGGC CAATTGGTCA TCATATAATA ATATTTTTTC TATTAGTATT AATTTTTTTT 390 .......... .......... .......... .......... .......... .......... 595 ATCTAATTTA TTAACTATAT GATTTACAAA TTATAAGTTT CCACATGAAG CCGTATTCAC 330 .......... .......... .......... .......... .......... .......... 595 CACTTCATAT CCCCGACAAG TGATATCAGA GCACATGGTC CAACAATCAT ATGACTCAAC 270 .......... .......... .......... .......... .......... .......... 595 AAGTGGTATC AGAGAGCACG GTTCAAAGAT CTGATGGTTC AGTGATCTAA TGCTTGAAGG 210 .......... .......... .......... .......... .......... .......... 595 AGGTTGAAGA CAAACTCAAG TGGTTTCAAA ACATTTTGTA ACAAATTTGA TGATAATGAA 150 | ||| || || | | | | || || || .......... .......... .......... ..A-TTTATA AC-ACTATTA -GA-AACAAA 619 GATTTTGTCA AGTTACATGT GGAGAAAAA 121 |||||| || | | | |||||| GATTTTCTCN ACCATGAATT AATGAAAAA 648 hqPGS_C06HBa0153O03.1-1-_SGN-E550201+ (3787 3249) ******************************************************************************** EST sequence 127 +strand 720 n (File: SGN-E389834+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTA CGTCGACTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAGAACGAC 541 TAAACAGGAC GTTACATTTA TGATCGTCCT ACTTAAATAT CATTATTATT TTACGATTTA 601 TAACACTATT AGAAACGAAG ATTTTCTCGA CCATGAATTA ATGAAAAAAT ATGCCATGAA 661 ATATAAAAAT TTACTCGTTC TTCATTGAGC TATTCGTGAA AAAAAAAAAA AAATCGAGGG Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.768 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.86), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 Intron 2 2479 1979 ( 501 n); Pd: 0.000 (s: 0.65), Pa: 0.295 (s: 0) Exon 3 1978 1963 ( 16 n); cDNA 602 617 ( 16 n); score: 0.625 PPA cDNA 699 714 MATCH C06HBa0153O03.1-1- SGN-E389834+ 0.768 601 0.835 C PGS_C06HBa0153O03.1-1-_SGN-E389834+ (3787 3249,2525 2480,1978 1963) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| ||| ||||| || ||||| ||||| | || |||||||||| || ||| || TCCGTCGTGG GTTACGTCGA CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAGAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| || ||||||| CGACTAAACA GGACGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | | || |||| ||||| | || | || ||| .......... .......... ....TTATGA TCGTCCTACT TAAATATCAT TA-TT-ATTT 591 TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA ATTCTTTTCT TAAAATGGTA 2430 || | | || TACGATTTAT .......... .......... .......... .......... .......... 601 TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA GTTAAGTAAA GAGTAAGAGT 2370 .......... .......... .......... .......... .......... .......... 601 TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT CCCTAAAGGT TGATAAATGT 2310 .......... .......... .......... .......... .......... .......... 601 TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG AGTTTTGAAC TAAAAGGGGA 2250 .......... .......... .......... .......... .......... .......... 601 ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC AGTAAATTAT CTCAACCCAA 2190 .......... .......... .......... .......... .......... .......... 601 AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT TTGGGAGTAG TATTGAGCAC 2130 .......... .......... .......... .......... .......... .......... 601 CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG TAAATCATGT AGCTATCATG 2070 .......... .......... .......... .......... .......... .......... 601 GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT TAGCCAGTGG ATCACTAGGT 2010 .......... .......... .......... .......... .......... .......... 601 TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT GGCAGCG 1963 ||| | || | | || .......... .......... .......... .AACACTATT AGAAACG 617 hqPGS_C06HBa0153O03.1-1-_SGN-E389834+ (3787 3249) ******************************************************************************** EST sequence 126 +strand 714 n (File: SGN-E390013+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACNAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 Intron 2 2479 1979 ( 501 n); Pd: 0.000 (s: 0.65), Pa: 0.295 (s: 0) Exon 3 1978 1970 ( 9 n); cDNA 602 610 ( 9 n); score: 0.667 PPA cDNA 699 714 MATCH C06HBa0153O03.1-1- SGN-E390013+ 0.776 594 0.832 C PGS_C06HBa0153O03.1-1-_SGN-E390013+ (3787 3249,2525 2480,1978 1970) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA ATTCTTTTCT TAAAATGGTA 2430 || | | || TACGATTTAT .......... .......... .......... .......... .......... 601 TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA GTTAAGTAAA GAGTAAGAGT 2370 .......... .......... .......... .......... .......... .......... 601 TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT CCCTAAAGGT TGATAAATGT 2310 .......... .......... .......... .......... .......... .......... 601 TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG AGTTTTGAAC TAAAAGGGGA 2250 .......... .......... .......... .......... .......... .......... 601 ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC AGTAAATTAT CTCAACCCAA 2190 .......... .......... .......... .......... .......... .......... 601 AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT TTGGGAGTAG TATTGAGCAC 2130 .......... .......... .......... .......... .......... .......... 601 CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG TAAATCATGT AGCTATCATG 2070 .......... .......... .......... .......... .......... .......... 601 GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT TAGCCAGTGG ATCACTAGGT 2010 .......... .......... .......... .......... .......... .......... 601 TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT 1970 ||| | || .......... .......... .......... .AACACTATT 610 hqPGS_C06HBa0153O03.1-1-_SGN-E390013+ (3787 3249) ******************************************************************************** EST sequence 113 +strand 710 n (File: SGN-E550065+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATGA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTGATTCAT AAGAAAAAAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 MATCH C06HBa0153O03.1-1- SGN-E550065+ 0.776 585 0.824 C PGS_C06HBa0153O03.1-1-_SGN-E550065+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E550065+ (3787 3249) ******************************************************************************** EST sequence 116 +strand 709 n (File: SGN-E550207+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTNCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 699 709 MATCH C06HBa0153O03.1-1- SGN-E550207+ 0.776 585 0.825 C PGS_C06HBa0153O03.1-1-_SGN-E550207+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E550207+ (3787 3249) ******************************************************************************** EST sequence 118 +strand 715 n (File: SGN-E550335+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAATCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCNAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 4872 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 556 ( 555 n); score: 0.772 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 557 600 ( 44 n); score: 0.652 PPA cDNA 698 715 MATCH C06HBa0153O03.1-1- SGN-E550335+ 0.772 585 0.818 C PGS_C06HBa0153O03.1-1-_SGN-E550335+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||| ||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTC-AAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-C-TC --TGAA---- -GA--GT-C- T-------GT 3448 ||| || || || ||||||| ||||| | | ||| || || | | || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAATCTG TGACGGTCCG TCACGCCCGT 357 AACGGTTCGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 ||||| ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 535 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 556 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 556 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 556 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 556 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 556 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 556 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 556 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 556 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 556 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 556 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 556 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 556 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 590 TAAGGCTCAT 2480 || | | || TACGATTTAT 600 hqPGS_C06HBa0153O03.1-1-_SGN-E550335+ (3787 3249) ******************************************************************************** EST sequence 130 +strand 717 n (File: SGN-E550484+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 699 717 MATCH C06HBa0153O03.1-1- SGN-E550484+ 0.776 585 0.816 C PGS_C06HBa0153O03.1-1-_SGN-E550484+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E550484+ (3787 3249) ******************************************************************************** EST sequence 131 +strand 713 n (File: SGN-E550211+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 4872 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 556 ( 555 n); score: 0.774 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 557 600 ( 44 n); score: 0.652 PPA cDNA 698 713 MATCH C06HBa0153O03.1-1- SGN-E550211+ 0.774 585 0.820 C PGS_C06HBa0153O03.1-1-_SGN-E550211+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||| ||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTC-AAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 535 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 556 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 556 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 556 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 556 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 556 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 556 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 556 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 556 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 556 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 556 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 556 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 556 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 590 TAAGGCTCAT 2480 || | | || TACGATTTAT 600 hqPGS_C06HBa0153O03.1-1-_SGN-E550211+ (3787 3249) ******************************************************************************** EST sequence 132 +strand 713 n (File: SGN-E550464+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GNTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCTA CCATGAATTA ATGAAAAATT ATGCCATAAG 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.774 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.88), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 698 713 MATCH C06HBa0153O03.1-1- SGN-E550464+ 0.774 585 0.820 C PGS_C06HBa0153O03.1-1-_SGN-E550464+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| ||||| |||| CGACTAAACA GGTCGNTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E550464+ (3787 3249) ******************************************************************************** EST sequence 133 +strand 713 n (File: SGN-E549941+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA TATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCGA CCNATGATTA ATGAAAAATT ATGCCATCAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.774 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.88), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 699 713 MATCH C06HBa0153O03.1-1- SGN-E549941+ 0.774 585 0.820 C PGS_C06HBa0153O03.1-1-_SGN-E549941+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||| ||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAATATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E549941+ (3787 3249) ******************************************************************************** EST sequence 134 +strand 714 n (File: SGN-E550025+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 699 714 MATCH C06HBa0153O03.1-1- SGN-E550025+ 0.776 585 0.819 C PGS_C06HBa0153O03.1-1-_SGN-E550025+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E550025+ (3787 3249) ******************************************************************************** EST sequence 170 +strand 711 n (File: SGN-E396039+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AAAAACAAAG ATTTTCTCCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AAATAAAAAA AATTTACTCA TTTTTTCTTG GAGCTAATTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 661 672 MATCH C06HBa0153O03.1-1- SGN-E396039+ 0.776 585 0.823 C PGS_C06HBa0153O03.1-1-_SGN-E396039+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E396039+ (3787 3249) ******************************************************************************** EST sequence 172 +strand 711 n (File: SGN-E396056+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAA ATTTTCTCAC CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATAATAAAA ATTTACTCAT TTTTTCTTTG AGCTAATTCA TAAAAAAAAA A Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3249 ( 539 n); cDNA 2 557 ( 556 n); score: 0.776 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 558 601 ( 44 n); score: 0.652 PPA cDNA 700 711 MATCH C06HBa0153O03.1-1- SGN-E396056+ 0.776 585 0.823 C PGS_C06HBa0153O03.1-1-_SGN-E396056+ (3787 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA 3210 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG 3150 .......... .......... .......... .......... .......... .......... 557 AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA 3090 .......... .......... .......... .......... .......... .......... 557 GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC 3030 .......... .......... .......... .......... .......... .......... 557 AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT 2970 .......... .......... .......... .......... .......... .......... 557 GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG 2910 .......... .......... .......... .......... .......... .......... 557 TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA 2850 .......... .......... .......... .......... .......... .......... 557 TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC 2790 .......... .......... .......... .......... .......... .......... 557 AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT 2730 .......... .......... .......... .......... .......... .......... 557 TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC 2670 .......... .......... .......... .......... .......... .......... 557 TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT 2610 .......... .......... .......... .......... .......... .......... 557 TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT 2550 .......... .......... .......... .......... .......... .......... 557 AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT 2490 || | || |||| ||||| |||| | || ||| .......... .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT 591 TAAGGCTCAT 2480 || | | || TACGATTTAT 601 hqPGS_C06HBa0153O03.1-1-_SGN-E396056+ (3787 3249) ******************************************************************************** EST sequence 114 +strand 726 n (File: SGN-E550322+) 1 TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC 61 CTCCTTCTTT TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT 121 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 181 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 241 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 301 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 361 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 421 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 481 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 541 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 601 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 661 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA 721 AAAAAC Predicted gene structure (within gDNA segment 4792 to 1): Exon 1 3776 3249 ( 528 n); cDNA 22 566 ( 545 n); score: 0.782 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 567 610 ( 44 n); score: 0.652 PPA cDNA 708 725 MATCH C06HBa0153O03.1-1- SGN-E550322+ 0.782 574 0.791 C PGS_C06HBa0153O03.1-1-_SGN-E550322+ (3776 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCTCTCTCTC TTTGTTCTTT CTACTTTTCT TATTCAAACC CTCTTTCTTT TACCCTAATT 3717 ||||||||| | |||||||| | |||||| |||||||||| ||| |||||| |||||||||| TCTCTCTCTT TCTGTTCTTT -TCTTTTTCT TATTCAAACC CTCCTTCTTT TACCCTAATT 80 AGCATATAAT TAAGAACAAA AGATGGCAAT AATAACTCAC TAATTAACTT AAGGTTACCT 3657 |||||||||| |||||| ||| |||||| ||| |||||| ||| ||||| ||| |||||||||| AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC AAGGTTACCT 139 CTTTTAACCC CCAAGTAATT AGACTTATTA AAATTAACCC ACTAACTTTA TAATTAAAGC 3597 |||||||||| ||| |||||| |||||||||| | || ||||| |||||||||| ||||||||| CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT 199 AGGAATAGTC CAAAACGCCC CTTAAAATAA TTACAGAAAT CTGACCCAGC CTGGGATTAC 3537 |||||||||| ||||||| || ||||||| || |||||| | ||||||| |||||||||| AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC 259 GCAGCCTGTG ACGGCCCGTC GCGCCTGCGA CGGTCCATTC TGCTGCTCCG TCACAGAGTT 3477 ||| |||||| | |||||||| | |||||||| |||||| | | ||| | | || || || ||| GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGT-CG TCGCAAGGTT 318 CCGAGACTCA ATTT-CTCTG AAGAGTCTGT -A-------- -ACG---GT- ----T-CGTC 3437 | |||||||| |||| | | |||||||||| | ||| || | |||| CAGAGACTCA ATTTCCACCA AAGAGTCTGT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 378 CTGCCATTCC GTTACGAAGT TCAGAAAGTC GA-TTTCAGT ACCCAATTTT GAGAA-TTCT 3379 ||||||||| |||||||||| ||||| |||| || ||| ||| |||||| ||| |||| |||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCCAA-TTT CAGAATTTCT 437 AAGTATTTTG GAATGAGATA TCCTCGACGG TCCGTCGTGC CCATGACGGT CGGTCGTGAG 3319 |||| ||||| || |||| |||||||||| ||| |||||| ||||||||| | |||||| | AAGTGTTTTG AAACGAGA-C TCCTCGACGG TCCATCGTGC TCATGACGGT CCGTCGTGGG 496 TTCCGTCGTC TTTGCCTGTT TTTCAAGAAA TAAAATCTGC TGCTCGAAAC GACTAAACAG 3259 |||||||||| | |||||| |||| | ||| |||||||||| | ||| |||| |||||||||| TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC GACTAAACAG 556 GTCGTTACAA GTATTGCCTA ATTCCTTTTA AGGGTTATTT AGGGGTAAAG AACAAGTCTC 3199 ||||||||| GTCGTTACAT .......... .......... .......... .......... .......... 566 AAATCATTTT TACAATAATT TAGAACGCTT AGGTCTTGGG AGAAAAGAGA AAATAAAAGG 3139 .......... .......... .......... .......... .......... .......... 566 GATCAAGAAT TGTCCAAGAA CTCCAACATT GTTGCGTAGA TTTTATCAAG GGTTATCCCT 3079 .......... .......... .......... .......... .......... .......... 566 ATCGAGGTAT GTGAGATCAT TCGGTGTTGG ATCCTTTCAT CCACACTCCA AATTTATTCA 3019 .......... .......... .......... .......... .......... .......... 566 ATACAATAAA TTAGAGTTTA TATTTAATAA AAATCTTGAT GTTCTTGATG AAAGTTATGA 2959 .......... .......... .......... .......... .......... .......... 566 AGTTCTTGTT GGTTGAATTA TAGTTGATCT CTAGTAGTTA TGATTCATGT TTTCGGGTCT 2899 .......... .......... .......... .......... .......... .......... 566 AAATATTGGG TAATGAGTTT TTTAGGGATC CTAAGTTTTT GAGGTGGGAT CCATGGGAGT 2839 .......... .......... .......... .......... .......... .......... 566 TTGGTAGGTT TAGATATGAA AGAGGAGAAG AAAAGTCGGA GGTCCAGTCA GACAACCCTT 2779 .......... .......... .......... .......... .......... .......... 566 GGGGCGCCGC GCCTCTTAGA GCGCCAATAC CCTCGGAGAC CCTTTTCTTT CCCTATATTT 2719 .......... .......... .......... .......... .......... .......... 566 TCGTACTAGT TCCTAAGTGA TGTACCTCTC ATTCCTAGTT GACCAACACT CTAGAATAAA 2659 .......... .......... .......... .......... .......... .......... 566 TATAAACATC ATGAAATCAT CCATAAACAT GAGATTATGA TCCTTGAATT CATAATCCAA 2599 .......... .......... .......... .......... .......... .......... 566 TTCAAGAGAA ACTAAGATCA AAGTCAAGAA AGTAAGCAAT GAAGGGAGTA AAAGTAAAGC 2539 .......... .......... .......... .......... .......... .......... 566 TTTTATTTCA AAGTTCTTAG AATCTTACTT AAATGTTATA ATTTCGTTTT AAGGCTCAT 2480 || | || ||||| |||| |||| | || |||| | | | || .......... ...TTATGTT CTTCCTACTT AAATATTATT A-TT-ATTTT ACGATTTAT 610 hqPGS_C06HBa0153O03.1-1-_SGN-E550322+ (3776 3249) ******************************************************************************** EST sequence 62 -strand 674 n (File: SGN-E396057-) 1 TTTTTTATTC AAACCTTCTT TTTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG 61 GAATAATAAC CCACTAATTT ACTCAAGGTT ACCTCTTTTA ACCCCCAGGT AATTAGACTT 121 ATTAACATAA ACCCACTAAC TTTATAATTA AAGTAGGAAT AGTCCAAAAC GTCCCTTAAA 181 ACGTGTAAAG AAATCCGACC CAGACTGGGA TTACGCAACC TGTGATGGCC CGTCGTGCCT 241 GCGACGGTCC GTCCTGCAGG TCGTCGCAAG GTTCAGAGAC TCAATTTCCA CCAAAGAGTC 301 TGTGACGGTC CGTCACGCCC GTGACGGTCC GTCGTGCCAT TCCGTTACGA AGTTCAGAGA 361 GTCGATTTTT AGTACCCAAT TTCAGAATTT TTAAGTGTTT TGAAACGAGA CTCCTCGACG 421 GTCCATCGTG CTCATGACGG TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA 481 ATAAAATCTG CTACTCAAAA CGACTAAACA GGTCGTTACA TTTATGTTCT TCCTACTTAA 541 ATATTATTAT TATTTTACGA TTTATAACAC TATTAGAAAC AAAGATTTTC TCAACCATGA 601 ATTAATGAAA AAATTATGCC ATAAAATATA AAAAATTTAC TCATTTTTCA TTGAGCTAAT 661 TCATAAAAAA AAAA Predicted gene structure (within gDNA segment 5116 to 1): Exon 1 3751 3249 ( 503 n); cDNA 1 521 ( 521 n); score: 0.775 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 522 565 ( 44 n); score: 0.652 PPA cDNA 663 674 MATCH C06HBa0153O03.1-1- SGN-E396057- 0.775 549 0.815 C PGS_C06HBa0153O03.1-1-_SGN-E396057- (3751 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TTTCTTATTC AAACCCTCTT TCTTTTACCC TAATTAGCAT ATAATTAAGA ACAAAAGATG 3692 ||| |||||| ||||| |||| | |||||||| |||||||||| |||||||||| | |||||||| TTTTTTATTC AAACCTTCTT TTTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG 60 GCAATAATAA CTCACTAATT AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACT 3632 | |||||||| | |||||||| ||| ||||| |||||||||| |||||||| | |||||||||| G-AATAATAA CCCACTAATT TACTCAAGGT TACCTCTTTT AACCCCCAGG TAATTAGACT 119 TATTAAAATT AACCCACTAA CTTTATAATT AAAGCAGGAA TAGTCCAAAA CGCCCCTTAA 3572 |||||| || |||||||||| |||||||||| |||| ||||| |||||||||| || ||||||| TATTAACATA AACCCACTAA CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCCTTAA 179 AATAATTACA GAAATCTGAC CCAGCCTGGG ATTACGCAGC CTGTGACGGC CCGTCGCGCC 3512 || || | |||||| ||| |||| ||||| |||||||| | |||||| ||| |||||| ||| AACGTGTAAA GAAATCCGAC CCAGACTGGG ATTACGCAAC CTGTGATGGC CCGTCGTGCC 239 TGCGACGGTC CATTCTGCTG CTCCGTCACA GAGTTCCGAG ACTCAATTT- CTCTGAAGAG 3453 |||||||||| | | |||| | | |||| || |||| ||| ||||||||| | | ||||| TGCGACGGTC CGTCCTGCAG GT-CGTCGCA AGGTTCAGAG ACTCAATTTC CACCAAAGAG 298 TCTGT-A--- ------ACG- --GT-----T -CGTCCTGCC ATTCCGTTAC GAAGTTCAGA 3412 ||||| | ||| || | |||| |||| |||||||||| |||||||||| TCTGTGACGG TCCGTCACGC CCGTGACGGT CCGTCGTGCC ATTCCGTTAC GAAGTTCAGA 358 AAGTCGA-TT TCAGTACCCA ATTTTGAGAA -TTCTAAGTA TTTTGGAATG AGATATCCTC 3354 |||||| || | |||||||| | ||| |||| || ||||| ||||| || | ||| ||||| GAGTCGATTT TTAGTACCCA A-TTTCAGAA TTTTTAAGTG TTTTGAAACG AGA-CTCCTC 416 GACGGTCCGT CGTGCCCATG ACGGTCGGTC GTGAGTTCCG TCGTCTTTGC CTGTTTTTCA 3294 |||||||| | ||||| |||| |||||| ||| ||| |||||| |||||| | ||||||||| GACGGTCCAT CGTGCTCATG ACGGTCCGTC GTGGGTTCCG TCGTCTCAAC CTGTTTTTCC 476 AGAAATAAAA TCTGCTGCTC GAAACGACTA AACAGGTCGT TACAAGTATT GCCTAATTCC 3234 | |||||||| |||||| ||| ||||||||| |||||||||| |||| AAAAATAAAA TCTGCTACTC AAAACGACTA AACAGGTCGT TACAT..... .......... 521 TTTTAAGGGT TATTTAGGGG TAAAGAACAA GTCTCAAATC ATTTTTACAA TAATTTAGAA 3174 .......... .......... .......... .......... .......... .......... 521 CGCTTAGGTC TTGGGAGAAA AGAGAAAATA AAAGGGATCA AGAATTGTCC AAGAACTCCA 3114 .......... .......... .......... .......... .......... .......... 521 ACATTGTTGC GTAGATTTTA TCAAGGGTTA TCCCTATCGA GGTATGTGAG ATCATTCGGT 3054 .......... .......... .......... .......... .......... .......... 521 GTTGGATCCT TTCATCCACA CTCCAAATTT ATTCAATACA ATAAATTAGA GTTTATATTT 2994 .......... .......... .......... .......... .......... .......... 521 AATAAAAATC TTGATGTTCT TGATGAAAGT TATGAAGTTC TTGTTGGTTG AATTATAGTT 2934 .......... .......... .......... .......... .......... .......... 521 GATCTCTAGT AGTTATGATT CATGTTTTCG GGTCTAAATA TTGGGTAATG AGTTTTTTAG 2874 .......... .......... .......... .......... .......... .......... 521 GGATCCTAAG TTTTTGAGGT GGGATCCATG GGAGTTTGGT AGGTTTAGAT ATGAAAGAGG 2814 .......... .......... .......... .......... .......... .......... 521 AGAAGAAAAG TCGGAGGTCC AGTCAGACAA CCCTTGGGGC GCCGCGCCTC TTAGAGCGCC 2754 .......... .......... .......... .......... .......... .......... 521 AATACCCTCG GAGACCCTTT TCTTTCCCTA TATTTTCGTA CTAGTTCCTA AGTGATGTAC 2694 .......... .......... .......... .......... .......... .......... 521 CTCTCATTCC TAGTTGACCA ACACTCTAGA ATAAATATAA ACATCATGAA ATCATCCATA 2634 .......... .......... .......... .......... .......... .......... 521 AACATGAGAT TATGATCCTT GAATTCATAA TCCAATTCAA GAGAAACTAA GATCAAAGTC 2574 .......... .......... .......... .......... .......... .......... 521 AAGAAAGTAA GCAATGAAGG GAGTAAAAGT AAAGCTTTTA TTTCAAAGTT CTTAGAATCT 2514 || | || .......... .......... .......... .......... ........TT ATGTTCTTCC 533 TACTTAAATG TTATAATTTC GTTTTAAGGC TCAT 2480 ||||||||| |||| | || ||||| | | || TACTTAAATA TTATTA-TT- ATTTTACGAT TTAT 565 hqPGS_C06HBa0153O03.1-1-_SGN-E396057- (3751 3249) ******************************************************************************** EST sequence 73 -strand 658 n (File: SGN-E377132-) 1 TTCCTTCTTT TACCCTAATT AGCATATATT TAAGAATAAA AGATGGAATA ATAACCCACT 61 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 121 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 181 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 241 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 301 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 361 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 421 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 481 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 541 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 601 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAA Predicted gene structure (within gDNA segment 4966 to 1): Exon 1 3735 3249 ( 487 n); cDNA 2 506 ( 505 n); score: 0.772 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 507 550 ( 44 n); score: 0.652 PPA cDNA 648 658 MATCH C06HBa0153O03.1-1- SGN-E377132- 0.772 533 0.810 C PGS_C06HBa0153O03.1-1-_SGN-E377132- (3735 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TCTTTCTTTT ACCCTAATTA GCATATAATT AAGAACAAAA GATGGCAATA ATAACTCACT 3676 || ||||||| |||||||||| ||||||| || ||||| |||| ||||| |||| ||||| |||| TCCTTCTTTT ACCCTAATTA GCATATATTT AAGAATAAAA GATGG-AATA ATAACCCACT 60 AATTAACTTA AGGTTACCTC TTTTAACCCC CAAGTAATTA GACTTATTAA AATTAACCCA 3616 |||| ||| | |||||||||| |||||||||| || ||||||| |||||||||| || |||||| AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 120 CTAACTTTAT AATTAAAGCA GGAATAGTCC AAAACGCCCC TTAAAATAAT TACAGAAATC 3556 |||||||||| |||||||| | |||||||||| |||||| ||| |||||| || ||||||| CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 180 TGACCCAGCC TGGGATTACG CAGCCTGTGA CGGCCCGTCG CGCCTGCGAC GGTCCATTCT 3496 ||||||| | |||||||||| || ||||||| ||||||||| ||||||||| ||||| | || CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 240 GCTGCTCCGT CACAGAGTTC CGAGACTCAA TTT-CTCTGA AGAGTCTGT- A--------- 3447 || | | ||| | || |||| ||||||||| ||| | | | ||||||||| | GCAGGT-CGT CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC 299 ACG---GT-- ---T-CGTCC TGCCATTCCG TTACGAAGTT CAGAAAGTCG A-TTTCAGTA 3397 ||| || | |||| |||||||||| |||||||||| |||| ||||| | ||| |||| ACGCCCGTGA CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA 359 CCCAATTTTG AGAA-TTCTA AGTATTTTGG AATGAGATAT CCTCGACGGT CCGTCGTGCC 3338 ||||| ||| |||| ||||| ||| ||||| || |||| | |||||||||| || |||||| CCCAA-TTTC AGAATTTCTA AGTGTTTTGA AACGAGA-CT CCTCGACGGT CCATCGTGCT 417 CATGACGGTC GGTCGTGAGT TCCGTCGTCT TTGCCTGTTT TTCAAGAAAT AAAATCTGCT 3278 |||||||||| |||||| || |||||||||| ||||||| ||| | |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 477 GCTCGAAACG ACTAAACAGG TCGTTACAAG TATTGCCTAA TTCCTTTTAA GGGTTATTTA 3218 ||| ||||| |||||||||| |||||||| ACTCAAAACG ACTAAACAGG TCGTTACAT. .......... .......... .......... 506 GGGGTAAAGA ACAAGTCTCA AATCATTTTT ACAATAATTT AGAACGCTTA GGTCTTGGGA 3158 .......... .......... .......... .......... .......... .......... 506 GAAAAGAGAA AATAAAAGGG ATCAAGAATT GTCCAAGAAC TCCAACATTG TTGCGTAGAT 3098 .......... .......... .......... .......... .......... .......... 506 TTTATCAAGG GTTATCCCTA TCGAGGTATG TGAGATCATT CGGTGTTGGA TCCTTTCATC 3038 .......... .......... .......... .......... .......... .......... 506 CACACTCCAA ATTTATTCAA TACAATAAAT TAGAGTTTAT ATTTAATAAA AATCTTGATG 2978 .......... .......... .......... .......... .......... .......... 506 TTCTTGATGA AAGTTATGAA GTTCTTGTTG GTTGAATTAT AGTTGATCTC TAGTAGTTAT 2918 .......... .......... .......... .......... .......... .......... 506 GATTCATGTT TTCGGGTCTA AATATTGGGT AATGAGTTTT TTAGGGATCC TAAGTTTTTG 2858 .......... .......... .......... .......... .......... .......... 506 AGGTGGGATC CATGGGAGTT TGGTAGGTTT AGATATGAAA GAGGAGAAGA AAAGTCGGAG 2798 .......... .......... .......... .......... .......... .......... 506 GTCCAGTCAG ACAACCCTTG GGGCGCCGCG CCTCTTAGAG CGCCAATACC CTCGGAGACC 2738 .......... .......... .......... .......... .......... .......... 506 CTTTTCTTTC CCTATATTTT CGTACTAGTT CCTAAGTGAT GTACCTCTCA TTCCTAGTTG 2678 .......... .......... .......... .......... .......... .......... 506 ACCAACACTC TAGAATAAAT ATAAACATCA TGAAATCATC CATAAACATG AGATTATGAT 2618 .......... .......... .......... .......... .......... .......... 506 CCTTGAATTC ATAATCCAAT TCAAGAGAAA CTAAGATCAA AGTCAAGAAA GTAAGCAATG 2558 .......... .......... .......... .......... .......... .......... 506 AAGGGAGTAA AAGTAAAGCT TTTATTTCAA AGTTCTTAGA ATCTTACTTA AATGTTATAA 2498 || | || |||||| ||| |||| | .......... .......... .......... ..TTATGTTC TTCCTACTTA AATATTATTA 534 TTTCGTTTTA AGGCTCAT 2480 || ||||| | | || -TT-ATTTTA CGATTTAT 550 hqPGS_C06HBa0153O03.1-1-_SGN-E377132- (3735 3249) ******************************************************************************** EST sequence 60 -strand 647 n (File: SGN-E396055-) 1 ATTAGCATAT AATTAAGAAT AAAAAGATGG AAATAATAAC CCACTAATTT ACTCAAGGGT 61 TCCTTTTTTT AACCCCCAGG GTAATTAGAC TTATTAACAT AAACCCCACT AACTTTATAA 121 TTAAAGTAGG AATAGTCCAA AACGTCCCTT AAAACGTGTA AAGAAATCCG ACCCAGACTG 181 GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGTCGTCGC 241 AAGGTTCAGA GACTCAATTT CCACCAAAGA GTCTGTGACG GTCCGTCACG CCCGTGACGG 301 TCCGTCGTGC CATTCCGTTA CGAAGTTCAG AGAGTCGATT TTTAGTACCC AATTTCAGAA 361 TTTCTAAGTG TTTTGAAACG AGACTCCTCG ACGGTCCATC GTGCTCATGA CGGTCCGTCG 421 TGGGTTCCGT CGTCTCAACC TGTTTTTCCA AAAATAAAAT CTGCTACTCA AAACGACTAA 481 ACAGGTCGTT ACATTTATGT TCTTCCTACT TAAATATTAT TATTATTTTA CGATTTATAA 541 CACTATTAGA AACAAAGATT TTCTCAACCA TGAATTAATG AAAAAATTAT GCCATAAAAT 601 ATAAAAAATT TACTCATTTT TCATTGAGCT AATTCATAAA AAAAAAA Predicted gene structure (within gDNA segment 5052 to 1): Exon 1 3719 3249 ( 471 n); cDNA 1 494 ( 494 n); score: 0.741 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 495 538 ( 44 n); score: 0.652 PPA cDNA 636 647 MATCH C06HBa0153O03.1-1- SGN-E396055- 0.741 517 0.799 C PGS_C06HBa0153O03.1-1-_SGN-E396055- (3719 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): ATTAGCATAT AATTAAGAA- CAAAAGATGG CAATAATAAC TCACTAATTA ACTTAAGGTT 3661 |||||||||| ||||||||| ||||||||| ||||||||| |||||||| ||| |||| | ATTAGCATAT AATTAAGAAT AAAAAGATGG AAATAATAAC CCACTAATTT ACTCAAGGGT 60 ACCTCTTT-T AACCCCCA-A GTAATTAGAC TTATTAAAAT TAACCC-ACT AACTTTATAA 3604 ||| ||| | |||||||| |||||||||| ||||||| || ||||| ||| |||||||||| TCCTTTTTTT AACCCCCAGG GTAATTAGAC TTATTAACAT AAACCCCACT AACTTTATAA 120 TTAAAGCAGG AATAGTCCAA AACGCCCCTT AAAATAATTA CAGAAATCTG ACCCAGCCTG 3544 |||||| ||| |||||||||| |||| ||||| |||| || ||||||| | |||||| ||| TTAAAGTAGG AATAGTCCAA AACGTCCCTT AAAACGTGTA AAGAAATCCG ACCCAGACTG 180 GGATTACGCA GCCTGTGACG GCCCGTCGCG CCTGCGACGG TCCATTCTGC TGCTCCGTCA 3484 |||||||||| ||||||| | |||||||| | |||||||||| ||| | |||| | | |||| GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGT-CGTCG 239 CAGAGTTCCG AGACTCAATT T-CTCTGAAG AGTCTGT-A- --------AC G---GT---- 3442 || |||| | |||||||||| | | | ||| ||||||| | || | || CAAGGTTCAG AGACTCAATT TCCACCAAAG AGTCTGTGAC GGTCCGTCAC GCCCGTGACG 299 -T-CGTCCTG CCATTCCGTT ACGAAGTTCA GAAAGTCGA- TTTCAGTACC CAATTTTGAG 3385 | |||| || |||||||||| |||||||||| || |||||| ||| |||||| ||| ||| || GTCCGTCGTG CCATTCCGTT ACGAAGTTCA GAGAGTCGAT TTTTAGTACC CAA-TTTCAG 358 AA-TTCTAAG TATTTTGGAA TGAGATATCC TCGACGGTCC GTCGTGCCCA TGACGGTCGG 3326 || ||||||| | ||||| || |||| ||| |||||||||| |||||| || |||||||| | AATTTCTAAG TGTTTTGAAA CGAGA-CTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 417 TCGTGAGTTC CGTCGTCTTT GCCTGTTTTT CAAGAAATAA AATCTGCTGC TCGAAACGAC 3266 ||||| |||| |||||||| ||||||||| | | |||||| |||||||| | || ||||||| TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 477 TAAACAGGTC GTTACAAGTA TTGCCTAATT CCTTTTAAGG GTTATTTAGG GGTAAAGAAC 3206 |||||||||| |||||| TAAACAGGTC GTTACAT... .......... .......... .......... .......... 494 AAGTCTCAAA TCATTTTTAC AATAATTTAG AACGCTTAGG TCTTGGGAGA AAAGAGAAAA 3146 .......... .......... .......... .......... .......... .......... 494 TAAAAGGGAT CAAGAATTGT CCAAGAACTC CAACATTGTT GCGTAGATTT TATCAAGGGT 3086 .......... .......... .......... .......... .......... .......... 494 TATCCCTATC GAGGTATGTG AGATCATTCG GTGTTGGATC CTTTCATCCA CACTCCAAAT 3026 .......... .......... .......... .......... .......... .......... 494 TTATTCAATA CAATAAATTA GAGTTTATAT TTAATAAAAA TCTTGATGTT CTTGATGAAA 2966 .......... .......... .......... .......... .......... .......... 494 GTTATGAAGT TCTTGTTGGT TGAATTATAG TTGATCTCTA GTAGTTATGA TTCATGTTTT 2906 .......... .......... .......... .......... .......... .......... 494 CGGGTCTAAA TATTGGGTAA TGAGTTTTTT AGGGATCCTA AGTTTTTGAG GTGGGATCCA 2846 .......... .......... .......... .......... .......... .......... 494 TGGGAGTTTG GTAGGTTTAG ATATGAAAGA GGAGAAGAAA AGTCGGAGGT CCAGTCAGAC 2786 .......... .......... .......... .......... .......... .......... 494 AACCCTTGGG GCGCCGCGCC TCTTAGAGCG CCAATACCCT CGGAGACCCT TTTCTTTCCC 2726 .......... .......... .......... .......... .......... .......... 494 TATATTTTCG TACTAGTTCC TAAGTGATGT ACCTCTCATT CCTAGTTGAC CAACACTCTA 2666 .......... .......... .......... .......... .......... .......... 494 GAATAAATAT AAACATCATG AAATCATCCA TAAACATGAG ATTATGATCC TTGAATTCAT 2606 .......... .......... .......... .......... .......... .......... 494 AATCCAATTC AAGAGAAACT AAGATCAAAG TCAAGAAAGT AAGCAATGAA GGGAGTAAAA 2546 .......... .......... .......... .......... .......... .......... 494 GTAAAGCTTT TATTTCAAAG TTCTTAGAAT CTTACTTAAA TGTTATAATT TCGTTTTAAG 2486 || | | | |||||||| | |||| | | | ||||| | .......... .......... TTATGTTCTT CCTACTTAAA TATTATTA-T T-ATTTTACG 532 GCTCAT 2480 | || ATTTAT 538 hqPGS_C06HBa0153O03.1-1-_SGN-E396055- (3719 3249) ******************************************************************************** EST sequence 59 -strand 640 n (File: SGN-E398551-) 1 TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAA TAATTTACTC AAGGTTACCT 61 CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT 121 AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC 181 GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC 241 AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 301 TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA GAATTTCTAA 361 GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG TCGTGGGTTC 421 CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC TAAACAGGTC 481 GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA TAACACTATT 541 AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA AATATAAAAA 601 ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA Predicted gene structure (within gDNA segment 4776 to 1): Exon 1 3717 3249 ( 469 n); cDNA 1 487 ( 487 n); score: 0.765 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 488 531 ( 44 n); score: 0.652 PPA cDNA 629 640 MATCH C06HBa0153O03.1-1- SGN-E398551- 0.765 515 0.805 C PGS_C06HBa0153O03.1-1-_SGN-E398551- (3717 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT TAAGGTTACC 3658 |||||||||| ||||||| || ||||||| || ||||||| || ||||| ||| ||||||||| TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA ATAATTTACT CAAGGTTACC 59 TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT ATAATTAAAG 3598 |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| |||||||||| TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT ATAATTAAAG 119 CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG CCTGGGATTA 3538 ||||||||| |||||||| | |||||||| || ||||| || ||||||| ||||||||| TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG ACTGGGATTA 179 CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC GTCACAGAGT 3478 |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ||| || || CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C GTCGCAAGGT 238 TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT -----T-CGT 3438 || ||||||| ||||| | | ||||||||| | | ||| || | ||| TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT GACGGTCCGT 298 CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT TGAGAA-TTC 3380 | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || | |||| ||| CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT TCAGAATTTC 357 TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG TCGGTCGTGA 3320 ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| || |||||| TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG TCCGTCGTGG 416 GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA CGACTAAACA 3260 |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| |||||||||| GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA CGACTAAACA 476 GGTCGTTACA AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA GAACAAGTCT 3200 |||||||||| GGTCGTTACA T......... .......... .......... .......... .......... 487 CAAATCATTT TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG AAAATAAAAG 3140 .......... .......... .......... .......... .......... .......... 487 GGATCAAGAA TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA GGGTTATCCC 3080 .......... .......... .......... .......... .......... .......... 487 TATCGAGGTA TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC AAATTTATTC 3020 .......... .......... .......... .......... .......... .......... 487 AATACAATAA ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT GAAAGTTATG 2960 .......... .......... .......... .......... .......... .......... 487 AAGTTCTTGT TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG TTTTCGGGTC 2900 .......... .......... .......... .......... .......... .......... 487 TAAATATTGG GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA TCCATGGGAG 2840 .......... .......... .......... .......... .......... .......... 487 TTTGGTAGGT TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC AGACAACCCT 2780 .......... .......... .......... .......... .......... .......... 487 TGGGGCGCCG CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT TCCCTATATT 2720 .......... .......... .......... .......... .......... .......... 487 TTCGTACTAG TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC TCTAGAATAA 2660 .......... .......... .......... .......... .......... .......... 487 ATATAAACAT CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT TCATAATCCA 2600 .......... .......... .......... .......... .......... .......... 487 ATTCAAGAGA AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT AAAAGTAAAG 2540 .......... .......... .......... .......... .......... .......... 487 CTTTTATTTC AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT TAAGGCTCAT 2480 || | || |||| ||||| |||| | || ||| || | | || .......... ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT TACGATTTAT 531 hqPGS_C06HBa0153O03.1-1-_SGN-E398551- (3717 3249) ******************************************************************************** EST sequence 56 -strand 630 n (File: SGN-E396038-) 1 TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC AAGGTTACCT CTTTTAACCC 61 CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT AGGAATAGTC 121 CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC GCAACCTGTG 181 ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC AGAGACTCAA 241 TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG TGCCATTCCG 301 TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA GAATTTCTAA GTGTTTTGAA 361 ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG TCGTGGGTTC CGTCGTCTCA 421 ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC TAAACAGGTC GTTACATTTA 481 TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA TAACACTATT AGAAACAAAG 541 ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA AATATAAAAA ATTTACTCAT 601 TTTTCATAGA GTTAAATAAT AAAAAAAAAA Predicted gene structure (within gDNA segment 4676 to 1): Exon 1 3707 3249 ( 459 n); cDNA 1 477 ( 477 n); score: 0.763 Intron 1 3248 2526 ( 723 n); Pd: 0.252 (s: 0.90), Pa: 0.000 (s: 0.65) Exon 2 2525 2480 ( 46 n); cDNA 478 521 ( 44 n); score: 0.652 PPA cDNA 621 630 MATCH C06HBa0153O03.1-1- SGN-E396038- 0.763 505 0.802 C PGS_C06HBa0153O03.1-1-_SGN-E396038- (3707 3249,2525 2480) Alignment (genomic DNA sequence = upper lines): TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT TAAGGTTACC TCTTTTAACC 3648 ||||||| || ||||||| || ||||||| || |||||| ||| ||||||||| |||||||||| TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT CAAGGTTACC TCTTTTAACC 59 CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT ATAATTAAAG CAGGAATAGT 3588 |||| ||||| |||||||||| || || |||| |||||||||| |||||||||| ||||||||| CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT ATAATTAAAG TAGGAATAGT 119 CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG CCTGGGATTA CGCAGCCTGT 3528 |||||||| | |||||||| || ||||| || ||||||| ||||||||| |||| ||||| CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG ACTGGGATTA CGCAACCTGT 179 GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC GTCACAGAGT TCCGAGACTC 3468 || ||||||| || ||||||| ||||||| | |||| | | | ||| || || || ||||||| GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C GTCGCAAGGT TCAGAGACTC 238 AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT -----T-CGT CCTGCCATTC 3428 ||||| | | ||||||||| | | ||| || | ||| | |||||||| AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT GACGGTCCGT CGTGCCATTC 298 CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT TGAGAA-TTC TAAGTATTTT 3370 |||||||||| |||||| ||| ||| ||| || ||||||| || | |||| ||| ||||| |||| CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT TCAGAATTTC TAAGTGTTTT 357 GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG TCGGTCGTGA GTTCCGTCGT 3310 | || |||| ||||||||| |||| ||||| | |||||||| || |||||| |||||||||| GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG TCCGTCGTGG GTTCCGTCGT 416 CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA CGACTAAACA GGTCGTTACA 3250 || ||||| ||||| | || |||||||||| || ||| ||| |||||||||| |||||||||| CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA CGACTAAACA GGTCGTTACA 476 AGTATTGCCT AATTCCTTTT AAGGGTTATT TAGGGGTAAA GAACAAGTCT CAAATCATTT 3190 T......... .......... .......... .......... .......... .......... 477 TTACAATAAT TTAGAACGCT TAGGTCTTGG GAGAAAAGAG AAAATAAAAG GGATCAAGAA 3130 .......... .......... .......... .......... .......... .......... 477 TTGTCCAAGA ACTCCAACAT TGTTGCGTAG ATTTTATCAA GGGTTATCCC TATCGAGGTA 3070 .......... .......... .......... .......... .......... .......... 477 TGTGAGATCA TTCGGTGTTG GATCCTTTCA TCCACACTCC AAATTTATTC AATACAATAA 3010 .......... .......... .......... .......... .......... .......... 477 ATTAGAGTTT ATATTTAATA AAAATCTTGA TGTTCTTGAT GAAAGTTATG AAGTTCTTGT 2950 .......... .......... .......... .......... .......... .......... 477 TGGTTGAATT ATAGTTGATC TCTAGTAGTT ATGATTCATG TTTTCGGGTC TAAATATTGG 2890 .......... .......... .......... .......... .......... .......... 477 GTAATGAGTT TTTTAGGGAT CCTAAGTTTT TGAGGTGGGA TCCATGGGAG TTTGGTAGGT 2830 .......... .......... .......... .......... .......... .......... 477 TTAGATATGA AAGAGGAGAA GAAAAGTCGG AGGTCCAGTC AGACAACCCT TGGGGCGCCG 2770 .......... .......... .......... .......... .......... .......... 477 CGCCTCTTAG AGCGCCAATA CCCTCGGAGA CCCTTTTCTT TCCCTATATT TTCGTACTAG 2710 .......... .......... .......... .......... .......... .......... 477 TTCCTAAGTG ATGTACCTCT CATTCCTAGT TGACCAACAC TCTAGAATAA ATATAAACAT 2650 .......... .......... .......... .......... .......... .......... 477 CATGAAATCA TCCATAAACA TGAGATTATG ATCCTTGAAT TCATAATCCA ATTCAAGAGA 2590 .......... .......... .......... .......... .......... .......... 477 AACTAAGATC AAAGTCAAGA AAGTAAGCAA TGAAGGGAGT AAAAGTAAAG CTTTTATTTC 2530 .......... .......... .......... .......... .......... .......... 477 AAAGTTCTTA GAATCTTACT TAAATGTTAT AATTTCGTTT TAAGGCTCAT 2480 || | || |||| ||||| |||| | || ||| || | | || ....TTATGT TCTTCCTACT TAAATATTAT TA-TT-ATTT TACGATTTAT 521 hqPGS_C06HBa0153O03.1-1-_SGN-E396038- (3707 3249) ******************************************************************************** EST sequence 14 -strand 673 n (File: SGN-E550140-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTACTCN 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATGTTATCAA CCATGAATTA ACAAAAAATT AGACCAAAAA 661 TATAAAAAAT TAC Predicted gene structure (within gDNA segment 4712 to 1): Exon 1 3787 3250 ( 538 n); cDNA 2 556 ( 555 n); score: 0.773 MATCH C06HBa0153O03.1-1- SGN-E550140- 0.773 538 0.799 C PGS_C06HBa0153O03.1-1-_SGN-E550140- (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || ||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTACTC 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| NAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 556 hqPGS_C06HBa0153O03.1-1-_SGN-E550140- (3787 3250) ******************************************************************************** EST sequence 20 -strand 681 n (File: SGN-E389553-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCGTTCC TACTTAAATA TTATTATTAT TTTACGATTT 601 ATAACACTAT TAGAAACAAA GATTTTCTCA ACCATGAATT AATGAAAAAA TTATGGAATA 661 AAATATAAAA AATTACTCAT T Predicted gene structure (within gDNA segment 4712 to 1): Exon 1 3787 3250 ( 538 n); cDNA 2 556 ( 555 n); score: 0.777 MATCH C06HBa0153O03.1-1- SGN-E389553- 0.777 538 0.790 C PGS_C06HBa0153O03.1-1-_SGN-E389553- (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 556 hqPGS_C06HBa0153O03.1-1-_SGN-E389553- (3787 3250) ******************************************************************************** EST sequence 171 +strand 618 n (File: SGN-E396054+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAA Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3250 ( 538 n); cDNA 2 556 ( 555 n); score: 0.777 MATCH C06HBa0153O03.1-1- SGN-E396054+ 0.777 538 0.871 C PGS_C06HBa0153O03.1-1-_SGN-E396054+ (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 556 hqPGS_C06HBa0153O03.1-1-_SGN-E396054+ (3787 3250) ******************************************************************************** EST sequence 173 +strand 610 n (File: SGN-E396058+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTGTTT CCAAAAATAA AATCTGCTAC TCACAACGAC 541 TAAACAGGTC GTTACATTTA GGTTCTTCAT AGTTAACTAT TATTATTATT TTACGATTTA 601 TAACACTATT Predicted gene structure (within gDNA segment 4702 to 1): Exon 1 3787 3250 ( 538 n); cDNA 2 556 ( 555 n); score: 0.773 MATCH C06HBa0153O03.1-1- SGN-E396058+ 0.773 538 0.882 C PGS_C06HBa0153O03.1-1-_SGN-E396058+ (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 417 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| |||| | || |||||||||| || ||| || TCCGTCGTGG GTTCCGTCGT CTCAACCTGT GTTTCCAAAA ATAAAATCTG CTACTCACAA 536 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 556 hqPGS_C06HBa0153O03.1-1-_SGN-E396058+ (3787 3250) ******************************************************************************** EST sequence 135 +strand 558 n (File: SGN-E231589+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA TAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTT Predicted gene structure (within gDNA segment 4692 to 1): Exon 1 3787 3250 ( 538 n); cDNA 1 555 ( 555 n); score: 0.775 MATCH C06HBa0153O03.1-1- SGN-E231589+ 0.775 538 0.964 C PGS_C06HBa0153O03.1-1-_SGN-E231589+ (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || |||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCATAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 535 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 555 hqPGS_C06HBa0153O03.1-1-_SGN-E231589+ (3787 3250) ******************************************************************************** EST sequence 162 +strand 649 n (File: SGN-E374999+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CCAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTTCAC CCTGAATTAA TGAAAAAAT Predicted gene structure (within gDNA segment 4692 to 1): Exon 1 3787 3250 ( 538 n); cDNA 1 555 ( 555 n); score: 0.777 MATCH C06HBa0153O03.1-1- SGN-E374999+ 0.777 538 0.829 C PGS_C06HBa0153O03.1-1-_SGN-E374999+ (3787 3250) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCCAAA 535 CGACTAAACA GGTCGTTACA 3250 |||||||||| |||||||||| CGACTAAACA GGTCGTTACA 555 hqPGS_C06HBa0153O03.1-1-_SGN-E374999+ (3787 3250) ******************************************************************************** EST sequence 210 +strand 545 n (File: SGN-E241959+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACA Predicted gene structure (within gDNA segment 4692 to 1): Exon 1 3787 3260 ( 528 n); cDNA 1 545 ( 545 n); score: 0.773 MATCH C06HBa0153O03.1-1- SGN-E241959+ 0.773 528 0.969 C PGS_C06HBa0153O03.1-1-_SGN-E241959+ (3787 3260) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 ||| | | ||||||||| | ||||||| | | ||||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 |||||||||| |||||||||| ||||||| || ||||||| || ||||||| || |||||| ||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 ||||||||| |||||||||| |||| ||||| |||||||||| || || |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 |||||||||| ||||||||| |||||||| | |||||||| || ||||| || ||||||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 ||||||||| |||| ||||| || ||||||| || ||||||| ||||||| | |||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCCGAGACTC AATTT-CTCT GAAGAGTCTG T-A------- --ACG---GT 3442 ||| || || || ||||||| ||||| | | ||||||||| | | ||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 -----T-CGT CCTGCCATTC CGTTACGAAG TTCAGAAAGT CGA-TTTCAG TACCCAATTT 3389 | ||| | |||||||| |||||||||| |||||| ||| ||| ||| || ||||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCAA-TT 416 TGAGAA-TTC TAAGTATTTT GGAATGAGAT ATCCTCGACG GTCCGTCGTG CCCATGACGG 3330 | |||| ||| ||||| |||| | || |||| ||||||||| |||| ||||| | |||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGA- CTCCTCGACG GTCCATCGTG CTCATGACGG 475 TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA ATAAAATCTG CTGCTCGAAA 3270 || |||||| |||||||||| || ||||| ||||| | || |||||||||| || ||| ||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 535 CGACTAAACA 3260 |||||||||| CGACTAAACA 545 hqPGS_C06HBa0153O03.1-1-_SGN-E241959+ (3787 3260) ******************************************************************************** EST sequence 100 -strand 660 n (File: SGN-E349296-) 1 AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 61 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 121 TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 181 ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 241 ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 301 ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCCT 361 TAAAACAATT GAGGAATTCC GACTCAGACT GGGATTTACG CAGCCTGTGA CAGCCCGTTG 421 TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC GCAAGGTTCA GAGACTGGAT TTTCACTGAA 481 GACTCTGTGA TGGTCCATCA CGCCTGTGAC GGTCCGTCTT GCCATTCCGT TACGAAGTTC 541 AGAGAGTCGA TTTTCAGTAC CCAATTTCAG ATTTCCTAAG TGTTTTGAAA TGAGACCCTG 601 CGACGGTCCG TCGTGCCCAT GATGGTCCGT CGTGGGGTCC GTCATTTCTG CCAGTTTTTC Predicted gene structure (within gDNA segment 4848 to 1859): Exon 1 3923 3432 ( 492 n); cDNA 1 504 ( 504 n); score: 0.823 MATCH C06HBa0153O03.1-1- SGN-E349296- 0.823 492 0.745 C PGS_C06HBa0153O03.1-1-_SGN-E349296- (3923 3432) Alignment (genomic DNA sequence = upper lines): AATATTATCA ATATATATCA TTCGCTATTA AGAGTTTACT ACGAATATCG TAAGAGAAAC 3864 |||||||||| ||| |||| | |||||||||| |||| ||||| |||||||||| |||||||||| AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 60 CATAACCTAC CTCCACCGAA GATTAGTGAT CAAGCAAGAA ATTTCCCCAA GC-TT-TG-T 3807 |||||||||| |||||||||| |||| ||||| |||||||| |||| ||||| || || || | CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTT-CCCAA GCTTTGTGTT 119 TCTTCGT-T- -TTC--TC-T CTTCCTCGTT CGA--TCCTC TCTCTCTCT- TTGTTCTTTC 3756 | ||| | | ||| || | ||| |||||| ||| | ||| ||||| ||| |||||||||| TTTTCCTCTC GTTCGATCCT CTTTCTCGTT CGACTTTCTC TCTCTTTCTC TTGTTCTTTC 179 TACTTTTCTT ATTCAAACCC TCTTTCTTTT ACCCTAATTA GCATATAATT AAGAACAAAA 3696 || ||| || |||||||||| |||||||||| |||||||||| | |||||||| ||||| |||| TATTTTCTTT ATTCAAACCC TCTTTCTTTT ACCCTAATTA GTATATAATT AAGAATAAAA 239 GATGGCAATA ATAACTCACT AATTAACTTA AGGTTACCTC TTTTAACCCC CAAGTAATTA 3636 ||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| TATGGCAATA ATAACCCACT AATTAACTTA AGGTTACCTC TTTTAACCCC CAAGTAATTA 299 GACTTATTAA AATTAACCCA CTAACTTTAT AATTAAAGCA GGAATAGTCC AAAACGCCCC 3576 |||||||||| ||||||||| |||||||||| |||||||||| ||||||||| |||||| ||| GACTTATTAA CATTAACCCA CTAACTTTAT AATTAAAGCA GGAATAGTCA AAAACGTCCC 359 TTAAAATAAT TACAGAAATC TGACCCAGCC TGGGA-TTAC GCAGCCTGTG ACGGCCCGTC 3517 |||||| ||| | ||| || ||| ||| | ||||| |||| |||||||||| || |||||| TTAAAACAAT TGAGGAATTC CGACTCAGAC TGGGATTTAC GCAGCCTGTG ACAGCCCGTT 419 GCGCCTGCGA CGGTCCATTC TGCTGCTCCG TCACAGAGTT CCGAGACT-C AATTTCTCTG 3458 | |||||||| |||||| | | ||| | | || || || ||| | |||||| | |||| ||| GTGCCTGCGA CGGTCCGTCC TGCAGGT-CG TCGCAAGGTT CAGAGACTGG ATTTTCACTG 478 AAGAGTCTGT AACGGTTCGT CCTGCC 3432 |||| ||||| | ||| | | | ||| AAGACTCTGT GATGGTCCAT CACGCC 504 hqPGS_C06HBa0153O03.1-1-_SGN-E349296- (3923 3432) ******************************************************************************** EST sequence 169 +strand 356 n (File: SGN-E396037+) 1 GGGCAGCGGA GCCTCATGTT TTGTTTACCA CTATGCCGCA TCTATATGAT TAACATGATG 61 ATGATGATGA TGACTACCAC GATTCACGAG AAGAAGATGA GGATGAATGG GGTATTGAGA 121 TGGATGTTTT ACTCGAGGTT ACCTCTTTTA ACCCCCAGGT AATTAGACTT ATTAACATAA 181 ACCCACTAAC TTTATAATTA AAGTAGGAAT AGTCCAAAAC GTCCCTTAAA ACGTGTAAAG 241 AAATCCGACC CAGACTGGGA TTACGCAACC TGTGATGGCC CGTCGTGCCT GCGACGGTCC 301 GTCCTGCAGG TCTTCTCTAG GTTCAGAGAC TCTCTTTCCA CCAAAGAGTC TGTGAC Predicted gene structure (within gDNA segment 5615 to 1620): Exon 1 3673 3445 ( 229 n); cDNA 128 356 ( 229 n); score: 0.836 MATCH C06HBa0153O03.1-1- SGN-E396037+ 0.836 229 0.643 C PGS_C06HBa0153O03.1-1-_SGN-E396037+ (3673 3445) Alignment (genomic DNA sequence = upper lines): TTAACTTAAG GTTACCTCTT TTAACCCCCA AGTAATTAGA CTTATTAAAA TTAACCCACT 3614 || ||| || |||||||||| |||||||||| ||||||||| |||||||| | | |||||||| TTTACTCGAG GTTACCTCTT TTAACCCCCA GGTAATTAGA CTTATTAACA TAAACCCACT 187 AACTTTATAA TTAAAGCAGG AATAGTCCAA AACGCCCCTT AAAATAATTA CAGAAATCTG 3554 |||||||||| |||||| ||| |||||||||| |||| ||||| |||| || ||||||| | AACTTTATAA TTAAAGTAGG AATAGTCCAA AACGTCCCTT AAAACGTGTA AAGAAATCCG 247 ACCCAGCCTG GGATTACGCA GCCTGTGACG GCCCGTCGCG CCTGCGACGG TCCATTCTGC 3494 |||||| ||| |||||||||| ||||||| | |||||||| | |||||||||| ||| | |||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 307 TGCTCCGTCA CAGAGTTCCG AGACTCAATT T-CTCTGAAG AGTCTGTAAC 3445 | | | || | |||| | |||||| || | | | ||| ||||||| || AGGT-CTTCT CTAGGTTCAG AGACTCTCTT TCCACCAAAG AGTCTGTGAC 356 hqPGS_C06HBa0153O03.1-1-_SGN-E396037+ (3673 3445) ******************************************************************************** EST sequence 148 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 6616 to 1791): Exon 1 4852 4423 ( 430 n); cDNA 1 431 ( 431 n); score: 0.800 Intron 1 4422 4341 ( 82 n); Pd: 0.794 (s: 0.76), Pa: 0.000 (s: 0.88) Exon 2 4340 4144 ( 197 n); cDNA 432 629 ( 198 n); score: 0.855 Intron 2 4143 4094 ( 50 n); Pd: 0.900 (s: 0.89), Pa: 0.000 (s: 0.88) Exon 3 4093 4037 ( 57 n); cDNA 630 686 ( 57 n); score: 0.895 MATCH C06HBa0153O03.1-1- SGN-E241789+ 0.824 684 0.997 C PGS_C06HBa0153O03.1-1-_SGN-E241789+ (4852 4423,4340 4144,4093 4037) Alignment (genomic DNA sequence = upper lines): ATGCCGGAAC TTCAAAG-CA TCAAGACATG AA-GAG-GAA GATCCAGTCC AAGCTAGAAG 4796 ||| | || | ||||| | || |||||||||| || ||| || ||||||||| |||||| | ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 CATTAGCTCA CCCTGATATC CGGAGTAATG AAGACTGGCT AGAGTTACTG TTGAGTCGAA 4736 || ||||||| |||||| ||| | || ||| |||||||| | |||||| | | |||||| ||| CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 120 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAAACAAGAA GAAAACATAA AAGTAGGGGT 4676 || ||||| | |||||||||| ||||||| || | ||||| || |||||||||| |||||||||| GACGACGGTA CGTTTGCTGC ACTCCAC-AA TTAACAAAAA GAAAACATAA AAGTAGGGGT 179 CAGTACAAAC ACGGGTACTG AGTAGATATC ATCGGCCAAC TCAAAATAGA GATCAATATA 4616 |||||||||| ||| |||||| |||||||||| |||||||||| ||| |||||| || ||||||| CAGTACAAAC ACGAGTACTG AGTAGATATC ATCGGCCAAC TCAGAATAGA GAACAATATA 239 TACCAAGTAA TATCATAAAA TCAACTATGA TACTCAACAT GTAGCAACAT CAAATACTAT 4556 || ||| ||| || |||||| ||||| || | ||| |||| || ||||| ||| ||| || TATCAAATAA TAAAATAAAA TCAACCATAA CACTTAACAG GTGACAACAA CAAGTACCAT 299 -ATCATTAAC AATTACCGTC AAGTTCACAC ACGAGGACTC AAGCCTCAAT ACCGTACTCA 4497 | |||| | ||| | ||| || | |||||||| ||||||| | ||| |||||| AACCATTGGG CACAACC--C AAGAACATCT ATGAGGACTC AAGCCTCCAC ACCATACTCA 357 TTTGGGAATT ATGTTCATTG GATTGAGTAT ATTATCATCT TTCAAGATTC ATTATCTTTA 4437 |||||||| | ||||||| |||||||| |||| ||| |||||||||| ||| | |||| TTTGGGAAAC AGGTTCATTA AATTGAGTAC ATTAACATAA TTCAAGATTC ATTCTTTTTA 417 TTTCTCTTGT GTCGGTACGT GACACTCCGC TCCCTCATAT TCATTAATCC TCTTGTGTCG 4377 | | || |||| CTATCGTGGT GTCG...... .......... .......... .......... .......... 431 GTACGTGACA CTTCGATCCC CCACTACTAT GTGTCGGAAC GTGACACTTC GATCCTCTAA 4317 |||| |||| ||| | ||||| |||| .......... .......... .......... ......GAAC GTGATACTCC GATCCCCTAA 455 ATCTACGTGT CGGTTCGTGA CACTCGATCT CCTAAATCTA AGTGTCGGTT CGTGACACCA 4257 |||||||| |||||||||| ||| ||||| ||||| ||| ||||||||| ||| ||||| TGCTACGTGT CGGTTCGTGA CACCCGATCC CCTAATACTA CGTGTCGGTT CGTTACACCC 515 GATCCCCTAA ATCTACGTGT CAGTTCGTGA CACCCGATCC CCTAAATCTA CGTGTCGGTT 4197 |||| ||||| ||||||| | ||||||| |||||||||| ||| ||| ||||||||| GATCTCCTAA TACTACGTGC CGATTCGTGA CACCCGATCC ATTAATACTA TGTGTCGGTT 575 CGTGACACCC GATCCCTAAA T-CTACGTGT CGGTTCGTGA CACCCTATCC CCTAATCTCC 4138 |||||||||| ||||| | || | |||||||| |||||||||| ||||| |||| |||| CGTGACACCC GATCCATTAA TACTACGTGT CGGTTCGTGA CACCCGATCC CCTA...... 629 TTCTATCAAT TCATCAAGCC TTCTTTCTTA CCAAGGCATC ATCAATCTCA TTATTTTAGT 4078 | |||| || ||||||| .......... .......... .......... .......... ....ACCTCA TTCTTTTAGT 645 TCATCACGCC TTCTTTTATA CCAAGGCCCC ATCATTAACA A 4037 |||||| ||| |||||||||| ||||| | | |||||||||| | TCATCAAGCC TTCTTTTATA CCAAGACATC ATCATTAACA A 686 hqPGS_C06HBa0153O03.1-1-_SGN-E241789+ (4852 4423,4340 4144,4093 4037) ******************************************************************************** EST sequence 3 -strand 731 n (File: SGN-E578076-) 1 GCATCATCAA TCCCATTATT TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCTCATTAT 61 GAACAAAGAG ATTAAGATTT TGCAAGATTT GGGATTCAAT AACTTCATCA TGCTTATATA 121 ATCACAATTA TATAGTTACA TTCATGCAAG CATACAATTA AGCACATAGC AGGGTTTACA 181 ATATTATCAA TACATATCAT TCTCTATTAA GAGTTTACTA CGAATATCGT AAGAGAAACC 241 ATAACCTACC TCCACCGAAG AATTGCGATC AACAAGTTAT CTTCTCAAAA TCCTTGCTAT 301 CCTCTTCGTT TCTCTTTCTT TTTCTGTTTT CTCTTTGTTC TTTCTATTTT TCTTATTCAA 361 ACGTCCTAAC GAGCAAAACA GAGAACAAAA CAACCCTAAA AATTTCAACT TTTTTCGGTT 421 TCCCGACTTC CAATTTACCA GAGATATAGA TAATTCACTG AAATTGAACA AGGGTTAAGA 481 GCAGAAGAAA TTTACGTTGT GATTAATTGG GGCAAAGCGT CGAACAGTTG AACTGCAAAT 541 TTGTTCTTCA GTTATAGATA CAAAAGATAG AGTCTTATAT GAGTTAAAGA AGACGTAGAG 601 TATACCCTAG TAAGCGAGCT GACCACGGCG GAATGAGGTG GTGAGTTGGT GGGTTTCGTC 661 GGTCAACCAA GAAATGAAAA GGAAATTGAA GTATGAAAAA CTACAGAAAA ATGACGCGTT 721 TGGCCGAGAA A Predicted gene structure (within gDNA segment 4829 to 1): Exon 1 4102 3738 ( 365 n); cDNA 1 362 ( 362 n); score: 0.877 MATCH C06HBa0153O03.1-1- SGN-E578076- 0.877 365 0.499 C PGS_C06HBa0153O03.1-1-_SGN-E578076- (4102 3738) Alignment (genomic DNA sequence = upper lines): GCATCATCAA TCTCATTATT TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCCCATCAT 4043 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ||| ||| || GCATCATCAA TCCCATTATT TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCTCATTAT 60 TAACAAAGAG ATTAGGGTTT TGCAAGATTT GGGATTCAAT AACTTCATCA TGCTTATATA 3983 ||||||||| |||| | ||| |||||||||| |||||||||| |||||||||| |||||||||| GAACAAAGAG ATTAAGATTT TGCAAGATTT GGGATTCAAT AACTTCATCA TGCTTATATA 120 ACCACAATTA TAAAATTACA TTCATGCAAG CATACAATTA AGCACATAGC AGGGTTTACA 3923 | |||||||| || | ||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCACAATTA TATAGTTACA TTCATGCAAG CATACAATTA AGCACATAGC AGGGTTTACA 180 ATATTATCAA TATATATCAT TCGCTATTAA GAGTTTACTA CGAATATCGT AAGAGAAACC 3863 |||||||||| || ||||||| || ||||||| |||||||||| |||||||||| |||||||||| ATATTATCAA TACATATCAT TCTCTATTAA GAGTTTACTA CGAATATCGT AAGAGAAACC 240 ATAACCTACC TCCACCGAAG ATTAGTGATC AAGCAAGAAA TTTCCCCAAG CTTTGTTCT- 3804 |||||||||| |||||||||| | | | |||| || |||| | | | | ||| | | || ATAACCTACC TCCACCGAAG AATTGCGATC AA-CAAGTTA TCTTCTCAAA ATCCTTGCTA 299 TCGTTTTCTC TCTTCCTCGT TCGATCCTCT -CTCTCTCTT TGTTCTTTCT ACTTTTCTTA 3745 || | ||| | || ||| | || | ||| | |||||| |||||||||| | |||||||| TCCTCTTC-G T-TT-CTCTT TC-TTTTTCT GTTTTCTCTT TGTTCTTTCT ATTTTTCTTA 355 TTCAAAC 3738 ||||||| TTCAAAC 362 hqPGS_C06HBa0153O03.1-1-_SGN-E578076- (4102 3738) ******************************************************************************** EST sequence 1 -strand 605 n (File: SGN-E347579-) 1 ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC CTAATTCTAC GTGTCGGTTC 61 GTGACACCTG ATCCCCTAAT CTACGTGCCG GTTCGTGACA CCCGATCCCC TAATTCTACG 121 TGCCAGTTCG TGACACCCGA TCCCCTAATT CTACGTGTCG GTTCGTGACA CCCGATCCCC 181 TGCATGTGTC GGTACGTGAC ACTCCGATCC ACTAATATCA TTCTGTAAAT CATCAGGCCT 241 TCTCTATACC AAGGCATCAT CAATCCCATT ACTTTTATTC ATCAAGCCTT CTTCTATACC 301 AAGGCATCAT CATTAATAAG AGATTAGATT TTTATCAAGA TTTGGGATTC AATAACTTCA 361 TCATGCTTAA TATAATCACA ATTATATAAT CACGTTCATG CATGCATACA ATTAAGCATA 421 TAGCAGGGTT TACAATACTA CCAATACATA TCATTCTCTA TTAAGAGTTT ACTATGAAAG 481 CATGAAAACC ATAACCTACC TCCACCGAAG ATTAGTGATC AAGCAAGCAA ATTTTTCTCC 541 AAGCTTTGTT TCTCCCTTCT CGTTCGATTC TTCCTCTCTC TCTTGTTCTT TCTATTTTCT 601 TTATT Predicted gene structure (within gDNA segment 7499 to 2446): Exon 1 4361 3831 ( 531 n); cDNA 1 522 ( 522 n); score: 0.856 MATCH C06HBa0153O03.1-1- SGN-E347579- 0.856 531 0.878 C PGS_C06HBa0153O03.1-1-_SGN-E347579- (4361 3831) Alignment (genomic DNA sequence = upper lines): ATCCCCCACT ACTATGTGTC GGAACGTGAC ACTTCGATCC TCTAAATCTA CGTGTCGGTT 4302 |||||| | | ||| ||||| || |||||| || |||||| |||| |||| |||||||||| ATCCCCTAAT TCTACGTGTC GGTTCGTGAC AC-CCGATCC CCTAATTCTA CGTGTCGGTT 59 CGTGACACTC GATCTCCTAA ATCTAAGTGT CGGTTCGTGA CACCAGATCC CCTAAATCTA 4242 |||||||| |||| ||| | ||||| ||| |||||||||| |||| ||||| ||||| |||| CGTGACACCT GATCCCCT-A ATCTACGTGC CGGTTCGTGA CACCCGATCC CCTAATTCTA 118 CGTGTCAGTT CGTGACACCC GATCCCCTAA ATCTACGTGT CGGTTCGTGA CACCCGATCC 4182 |||| ||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CGTGCCAGTT CGTGACACCC GATCCCCTAA TTCTACGTGT CGGTTCGTGA CACCCGATCC 178 CTAAATCTAC GTGTCGGTTC GTGACAC-CC TATCCCCTAA TCTCCTTCTA TCAATTCATC 4123 | | | |||||||| | ||||||| || |||| |||| | || |||| | || ||||| C---CTGCAT GTGTCGGTAC GTGACACTCC GATCCACTAA TATCATTCTG T-AAATCATC 234 AAGCCTTCTT TCTTACCAAG GCATCATCAA TCTCATTATT TTAGTTCATC ACGCCTTCTT 4063 | ||||||| | ||||||| |||||||||| || ||||| | || |||||| | |||||||| AGGCCTTCTC T-ATACCAAG GCATCATCAA TCCCATTACT TTTATTCATC AAGCCTTCTT 293 TTATACCAAG GCCCCATCAT TAACAAAGAG ATTAGGGTTT T-GCAAGATT TGGGATTCAA 4004 ||||||||| || |||||| ||| ||||| ||||| ||| | ||||||| |||||||||| CTATACCAAG GCATCATCAT TAA-TAAGAG ATTAGATTTT TATCAAGATT TGGGATTCAA 352 TAACTTCATC ATGCTT-ATA TAACCACAAT TATAAAATTA CATTCATGCA AGCATACAAT 3945 |||||||||| |||||| ||| ||| |||||| |||| ||| | | |||||||| ||||||||| TAACTTCATC ATGCTTAATA TAATCACAAT TATATAATCA CGTTCATGCA TGCATACAAT 412 TAAGCACATA GCAGGGTTTA CAATATTATC AATATATATC ATTCGCTATT AAGAGTTTAC 3885 |||||| ||| |||||||||| ||||| || | |||| ||||| |||| ||||| |||||||||| TAAGCATATA GCAGGGTTTA CAATACTACC AATACATATC ATTCTCTATT AAGAGTTTAC 472 TACGAATATC GTAAGAGAAA CCATAACCTA CCTCCACCGA AGATTAGTGA TCAA 3831 || ||| | | | || ||| |||||||||| |||||||||| |||||||||| |||| TATGAA-A-- GCATGA-AAA CCATAACCTA CCTCCACCGA AGATTAGTGA TCAA 522 hqPGS_C06HBa0153O03.1-1-_SGN-E347579- (4361 3831) ******************************************************************************** EST sequence 101 -strand 717 n (File: SGN-E349726-) 1 TTATGTTCAT TAGATTGAGT ATATAACATC TTTCAAGATT CATTGTCTTT ATTTCTCTTG 61 TGTCGGTACG TGACATTCCG CTCCNTCATA TTCATTAATC TTCTTGTGTC GGTACGTGAT 121 ACTCTGATCC CCTAAATCTA CGTGTCGGAA CGTGACACTC CGATCCCCTA AATCTACGTG 181 TCGGTTCGTG ACACCTGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC CGATCCCCTA 241 AATCTACGTG TCGGTTCGTG ACACCCGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC 301 CGATCCCCTA ATCTATGTGT TGGTTCGTGA CACCTGATCC CTTAATCTAC GTGTCGGTTT 361 GTGACACCCG ATCCCCTAAT TCTACGTGTC AGTTCGTGAC ACCCGATCCC CTAATCTCAT 421 TCTATCAATT CATCAAGCCT TCTCCCTTAC CAAGGCATCA TCAATCTCAT TACTTTAGTT 481 CATCAAGCCT TCTCCCTTAC CAAGGCATCA TCATTAAAAA GAGATTAGGT TTTTACAAGA 541 TTTGGGATTC AATAACTTCA TCATGCTTAT ATAATAACAA TTATATAGTT ACATTCATGC 601 AAGCATACAA TTAAGCACAT AGCAGGGTTT ACAATATTAT CAATACATAT CATTCTCTAT 661 TAAGAGTTTA CTACGAATAT CGTAAGAGAA ACAATAACCT ACCTCCACCG AAGACTA Predicted gene structure (within gDNA segment 5331 to 3212): Exon 1 4428 3839 ( 590 n); cDNA 142 717 ( 576 n); score: 0.887 MATCH C06HBa0153O03.1-1- SGN-E349726- 0.887 590 0.823 C PGS_C06HBa0153O03.1-1-_SGN-E349726- (4428 3839) Alignment (genomic DNA sequence = upper lines): GTGTCGGTAC GTGACACTCC GCTCCCTCAT ATTCATTAAT CCTCTTGTGT CGGTACGTGA 4369 ||||||| || |||||||||| | |||| | | | ||| || |||| |||| ||||| GTGTCGGAAC GTGACACTCC GATCCC-C-- --T-A--AAT -CTA-CGTGT CGGTTCGTGA 191 CACTTCGATC CCCCACTACT ATGTGTCGGA ACGTGACACT TCGATCCTCT AAATCTACGT 4309 ||| | |||| ||| | || | ||||||| |||||||| |||||| || |||||||||| CACCT-GATC CCCTAAATCT ACGTGTCGGT TCGTGACAC- CCGATCCCCT AAATCTACGT 249 GTCGGTTCGT GACACTCGAT CTCCTAAATC TAAGTGTCGG TTCGTGACAC CAGATCCCCT 4249 |||||||||| ||||| |||| | |||||||| || ||||||| |||||||||| | |||||||| GTCGGTTCGT GACACCCGAT CCCCTAAATC TACGTGTCGG TTCGTGACAC CCGATCCCCT 309 AAATCTACGT GTCAGTTCGT GACACCCGAT CCCCTAAATC TACGTGTCGG TTCGTGACAC 4189 |||||| || || |||||| |||||| ||| |||| |||| |||||||||| || ||||||| -AATCTATGT GTTGGTTCGT GACACCTGAT -CCCTTAATC TACGTGTCGG TTTGTGACAC 367 CCGAT-CCCT AAATCTACGT GTCGGTTCGT GACACCCTAT CCCCTAATCT CCTTCTATCA 4130 ||||| |||| || ||||||| ||| |||||| ||||||| || |||||||||| | |||||||| CCGATCCCCT AATTCTACGT GTCAGTTCGT GACACCCGAT CCCCTAATCT CATTCTATCA 427 ATTCATCAAG CCTTCTTTCT TACCAAGGCA TCATCAATCT CATTATTTTA GTTCATCACG 4070 |||||||||| |||||| || |||||||||| |||||||||| ||||| |||| |||||||| | ATTCATCAAG CCTTCTCCCT TACCAAGGCA TCATCAATCT CATTACTTTA GTTCATCAAG 487 CCTTCTTTTA TACCAAGGCC CCATCATTAA CAAAGAGATT AGGGTTTTGC AAGATTTGGG 4010 |||||| ||||||||| ||||||||| ||||||||| ||| |||| | |||||||||| CCTTCTCCCT TACCAAGGCA TCATCATTAA -AAAGAGATT AGGTTTTTAC AAGATTTGGG 546 ATTCAATAAC TTCATCATGC TTATATAACC ACAATTATAA AATTACATTC ATGCAAGCAT 3950 |||||||||| |||||||||| |||||||| ||||||||| | |||||||| |||||||||| ATTCAATAAC TTCATCATGC TTATATAATA ACAATTATAT AGTTACATTC ATGCAAGCAT 606 ACAATTAAGC ACATAGCAGG GTTTACAATA TTATCAATAT ATATCATTCG CTATTAAGAG 3890 |||||||||| |||||||||| |||||||||| ||||||||| ||||||||| |||||||||| ACAATTAAGC ACATAGCAGG GTTTACAATA TTATCAATAC ATATCATTCT CTATTAAGAG 666 TTTACTACGA ATATCGTAAG AGAAACCATA ACCTACCTCC ACCGAAGATT A 3839 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||| | | TTTACTACGA ATATCGTAAG AGAAACAATA ACCTACCTCC ACCGAAGACT A 717 hqPGS_C06HBa0153O03.1-1-_SGN-E349726- (4428 3839) ******************************************************************************** EST sequence 94 -strand 402 n (File: SGN-E357559-) 1 TGTGTTGGTT CGTGACACCT GATCCCTTAA TCTACGTGTC GGTTTGTGAC ACCCGATCCC 61 CTAATTCTAC GTGTCAGTTC GTGACACCCG ATCCCCTAAT CTCATTCTAT CAATTCATCA 121 AGCCTTCTCC CTTACCAAGG CATCATCAAT CTCATTACTT TAGTTCATCA AGCCTTCTCC 181 CTTACCAAGG CATCATCATT AAAAAGAGAT TAGGTTTTTA CAAGATTTGG GATTCAATAA 241 CTTCATCATG CTTATATAAT AACAATTATA TAGTTACATT CATGCAAGCA TACAATTAAG 301 CACATAGCAG GGTTTACAAT ATTATCAATA CATATCATTC TCTATTAAGA GTTTACTACG 361 AATATCGTAA GAGAAACAAT AACCTACCTC CACCGAAGAC TA Predicted gene structure (within gDNA segment 5305 to 3212): Exon 1 4240 3839 ( 402 n); cDNA 2 402 ( 401 n); score: 0.917 MATCH C06HBa0153O03.1-1- SGN-E357559- 0.917 402 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E357559- (4240 3839) Alignment (genomic DNA sequence = upper lines): GTGTCAGTTC GTGACACCCG ATCCCCTAAA TCTACGTGTC GGTTCGTGAC ACCCGAT-CC 4182 |||| |||| |||||||| | || |||| || |||||||||| |||| ||||| ||||||| || GTGTTGGTTC GTGACACCTG AT-CCCTTAA TCTACGTGTC GGTTTGTGAC ACCCGATCCC 60 CTAAATCTAC GTGTCGGTTC GTGACACCCT ATCCCCTAAT CTCCTTCTAT CAATTCATCA 4122 |||| ||||| ||||| |||| ||||||||| |||||||||| ||| |||||| |||||||||| CTAATTCTAC GTGTCAGTTC GTGACACCCG ATCCCCTAAT CTCATTCTAT CAATTCATCA 120 AGCCTTCTTT CTTACCAAGG CATCATCAAT CTCATTATTT TAGTTCATCA CGCCTTCTTT 4062 |||||||| |||||||||| |||||||||| ||||||| || |||||||||| ||||||| AGCCTTCTCC CTTACCAAGG CATCATCAAT CTCATTACTT TAGTTCATCA AGCCTTCTCC 180 TATACCAAGG CCCCATCATT AACAAAGAGA TTAGGGTTTT GCAAGATTTG GGATTCAATA 4002 |||||||| | ||||||| || ||||||| ||||| |||| ||||||||| |||||||||| CTTACCAAGG CATCATCATT AA-AAAGAGA TTAGGTTTTT ACAAGATTTG GGATTCAATA 239 ACTTCATCAT GCTTATATAA CCACAATTAT AAAATTACAT TCATGCAAGC ATACAATTAA 3942 |||||||||| |||||||||| |||||||| | | |||||| |||||||||| |||||||||| ACTTCATCAT GCTTATATAA TAACAATTAT ATAGTTACAT TCATGCAAGC ATACAATTAA 299 GCACATAGCA GGGTTTACAA TATTATCAAT ATATATCATT CGCTATTAAG AGTTTACTAC 3882 |||||||||| |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| GCACATAGCA GGGTTTACAA TATTATCAAT ACATATCATT CTCTATTAAG AGTTTACTAC 359 GAATATCGTA AGAGAAACCA TAACCTACCT CCACCGAAGA TTA 3839 |||||||||| |||||||| | |||||||||| |||||||||| || GAATATCGTA AGAGAAACAA TAACCTACCT CCACCGAAGA CTA 402 hqPGS_C06HBa0153O03.1-1-_SGN-E357559- (4240 3839) ******************************************************************************** EST sequence 95 -strand 239 n (File: SGN-E391780-) 1 ATCAACAAAT ACTATATCAT TAACAATTAC CGTCAAGTTC ACACATGAGG ACTCAAGCCT 61 CAATACCATA CTCATTTGGG AATCATGTTC ATTAGATTGA GTATATTAAC ATCTTTCAAG 121 ATTCATTATC TTTATTTCTC TTGTGTCGGT ACGTGACACT CCGCTCCCTC AATATTCATT 181 AATCCTCTTG TGTCGGTACG TGACACTCCG ATCCCCTAAA TCTATATGTC GGTTTGTGA Predicted gene structure (within gDNA segment 5225 to 3373): Exon 1 4568 4333 ( 236 n); cDNA 3 239 ( 237 n); score: 0.926 MATCH C06HBa0153O03.1-1- SGN-E391780- 0.926 236 0.987 C PGS_C06HBa0153O03.1-1-_SGN-E391780- (4568 4333) Alignment (genomic DNA sequence = upper lines): CATCAAATAC TATATCATTA ACAATTACCG TCAAGTTCAC ACACGAGGAC TCAAGCCTCA 4509 || ||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| CAACAAATAC TATATCATTA ACAATTACCG TCAAGTTCAC ACATGAGGAC TCAAGCCTCA 62 ATACCGTACT CATTTGGGAA TTATGTTCAT TGGATTGAGT ATATTATCAT CTTTCAAGAT 4449 ||||| |||| |||||||||| | |||||||| | |||||||| |||||| ||| |||||||||| ATACCATACT CATTTGGGAA TCATGTTCAT TAGATTGAGT ATATTAACAT CTTTCAAGAT 122 TCATTATCTT TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTC-A TATTCATTAA 4390 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| TCATTATCTT TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTCAA TATTCATTAA 182 TCCTCTTGTG TCGGTACGTG ACACTTCGAT CCCCCACTAC TATGTGTCGG AACGTGA 4333 |||||||||| |||||||||| ||||| |||| |||| | | ||| |||||| |||| TCCTCTTGTG TCGGTACGTG ACACTCCGAT CCCCTAAATC TATATGTCGG TTTGTGA 239 hqPGS_C06HBa0153O03.1-1-_SGN-E391780- (4568 4333) ******************************************************************************** EST sequence 79 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 5420 to 3383): Exon 1 4810 4334 ( 477 n); cDNA 1 481 ( 481 n); score: 0.915 MATCH C06HBa0153O03.1-1- SGN-E246710- 0.915 477 0.992 C PGS_C06HBa0153O03.1-1-_SGN-E246710- (4810 4334) Alignment (genomic DNA sequence = upper lines): AGTCCAAGCT AGAAGCATTA GCTCACCCTG -ATATCCGGA GTAATGAAGA CTGGCTAGAG 4752 |||||||||| |||||||||| |||||||||| || |||| ||| | |||| |||||| || AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGT-AAGA CTGGCTTGAA 59 TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC CACAAATAAA CAAGAAGAAA 4692 |||||||||| || ||| | | | |||||||| |||||||||| |||||||||| |||||||| | TTACTGTTGA GTTGAACACG ATGGCACGTT TGCTGCACTC CACAAATAAA CAAGAAGAGA 119 ACATAAAAGT AGGGGTCAGT AC-AAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 4633 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 179 AAATAGAGAT C-A-ATATAT ACCAAGTAAT ATCATAAAAT CAACTATGAT ACTCAACATG 4575 ||||||| || | | |||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATAGAAAT CAATATATAT ACCAAGTAAT ATCATAAAAT CAACTATGAT ACTCAACATG 239 TAGCAACATC AAATACTATA TCATTAACAA TTACCGTCAA GTTCACACAC GAGGACTCAA 4515 |||||||| | |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| TAGCAACAAC AAATACTATA TCATTAACAA TTACCGTCAA GTTCACACAT GAGGACTCAA 299 GCCTCAATAC CGTACTCATT TGGGAATTAT GTTCATTGGA TTGAGTATAT TATCATCTTT 4455 |||||||||| | |||||||| ||||||| || ||||||| || |||||||||| || ||||||| GCCTCAATAC CATACTCATT TGGGAATCAT GTTCATTAGA TTGAGTATAT TAACATCTTT 359 CAAGATTCAT TATCTTTATT TCTCTTGTGT CGGTACGTGA CACTCCGCTC CCTC-ATATT 4396 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| CAAGATTCAT TATCTTTATT TCTCTTGTGT CGGTACGTGA CACTCCGCTC CCTCAATATT 419 CATTAATCCT CTTGTGTCGG TACGTGACAC TTCGATCCCC CACTACTATG TGTCGGAACG 4336 |||||||||| |||||||||| |||||||||| | |||||||| | |||| |||||| | CATTAATCCT CTTGTGTCGG TACGTGACAC TCCGATCCCC TAAATCTATA TGTCGGTTTG 479 TG 4334 || TG 481 hqPGS_C06HBa0153O03.1-1-_SGN-E246710- (4810 4334) ******************************************************************************** EST sequence 208 +strand 730 n (File: SGN-E546506+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAACAAT TCAATACTAT TATTATTATC CCCAAAATCT 61 GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA ACTAGGAATG TCTAAGAACA 121 AAATAACTAA AAAGCTAGTC CATGCCGGAA ATTCAAGGCA TCAAGACTTG AAGAAGAAGA 181 CCCAGTCCAA GCTAGACGCA TTAGCTCACC CTGAATTTTC CGATGAAGTG AAGACTGGCT 241 AGATCTACTG TTGAGTTGAA GTTGACGGAA CGTTTGCTGC ATTACACAAA TAACAAAGAG 301 GAAAACATGA AAGTAGGGGT CAGTACAACC ACACGTACTG AGTAGATATC ATCGGCCAAC 361 TCAAAATAGG GAACAGTATA TATCAATAAT AATGTAAATC AACTACAATA CTCAACATGT 421 AGCAATAACA CCATGAATTC ATCAATAACT ACAACCGAGT TCACACATGA GGACTCAAGC 481 CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT GAGTATATTC ATTATCTTTC 541 AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC ACTCCGATCC TCTATTTCTA 601 TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATTCTATC CTGGTACCGG AACGTGGCAC 661 CCGATCCATT TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA TTCTATCCTG 721 GTACCGGAAC Predicted gene structure (within gDNA segment 5956 to 1646): Exon 1 4956 4347 ( 610 n); cDNA 41 639 ( 599 n); score: 0.839 PPA cDNA 18 1 MATCH C06HBa0153O03.1-1- SGN-E546506+ 0.839 610 0.836 C PGS_C06HBa0153O03.1-1-_SGN-E546506+ (4956 4347) Alignment (genomic DNA sequence = upper lines): TATTATTATC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACTTC AAACTACTAA 4897 |||||||||| |||||||||| |||||||||| |||||||||| |||||| || ||| |||| | TATTATTATC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA 100 ATCTAAGAGT TTCTAAGAAG CTAAAAATAC ATAAAAGCTA GTCCATGCCG GAACTTCAAA 4837 | ||| || | |||||||| | |||| || |||||||| |||||||||| ||| ||||| A-CTAGGAAT GTCTAAGAA- C-AAAATAAC TAAAAAGCTA GTCCATGCCG GAAATTCAAG 157 GCATCAAGAC ATGAAGAGGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTG-A-T 4779 |||||||||| |||||| || ||| |||||| ||||||||| |||||||||| |||||| | | GCATCAAGAC TTGAAGAAGA AGACCCAGTC CAAGCTAGAC GCATTAGCTC ACCCTGAATT 217 ATCCGGAGTA ATGAAGACTG GCTAGAGTTA CTGTTGAGTC GAAGATGACG GCACGTTTGC 4719 |||| | | ||||||||| |||||| || ||||||||| |||| ||||| | |||||||| TTCCGATGAA GTGAAGACTG GCTAGATCTA CTGTTGAGTT GAAGTTGACG GAACGTTTGC 277 TGCACTCCAC AAATAAACAA GAAGAAAACA TAAAAGTAGG GGTCAGTACA AACACGGGTA 4659 |||| | ||| |||||| || || ||||||| | |||||||| |||||||||| | ||| ||| TGCATTACAC AAATAACAAA GAGGAAAACA TGAAAGTAGG GGTCAGTACA ACCACACGTA 337 CTGAGTAGAT ATCATCGGCC AACTCAAAAT AGAGATCAAT ATATACCAAG TAATATCATA 4599 |||||||||| |||||||||| |||||||||| || || || | ||||| ||| ||||| | CTGAGTAGAT ATCATCGGCC AACTCAAAAT AGGGAACAGT ATATATCAA- TAATAATGT- 395 AAATCAACTA TGATACTCAA CATGTAGCAA CATCAAATAC TATATCATTA ACAATTACCG 4539 |||||||||| |||||||| |||||||||| | | | | | |||| | | || ||| AAATCAACTA CAATACTCAA CATGTAGCAA TAAC-ACCAT GAATTCATCA ATAACTACAA 454 TCAAGTTCAC ACACGAGGAC TCAAGCCTCA ATACCGTACT CATTTGGGAA TTATGTTCAT 4479 | ||||||| ||| |||||| |||||||||| ||||| |||| |||||||||| ||| |||||| CCGAGTTCAC ACATGAGGAC TCAAGCCTCA ATACCATACT CATTTGGGAA TTAAGTTCAT 514 TGGATTGAGT ATATT-ATCA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGTCGGTA 4420 | |||||||| ||||| || | |||||||||| |||||||||| || || |||| |||||||||| TAGATTGAGT ATATTCATTA TCTTTCAAGA TTCATTATCT TTCTTCCTCT TGTGTCGGTA 574 CGTGACACTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TCGGTACGTG ACACTTCGAT 4360 |||||||||| || | |||| |||| | | | | || ||| ||| ||||| |||| |||| CGTGACACTC CGAT-CCTC- TATT--TCTA T-C-CTGGTG CCGGAACGTG GCACTCCGAT 628 CCCCCACTAC TAT 4347 || || | | ||| -CCTCA-TTC TAT 639 hqPGS_C06HBa0153O03.1-1-_SGN-E546506+ (4956 4347) ******************************************************************************** EST sequence 196 +strand 337 n (File: SGN-E357033+) 1 ACTGAATAGA TATCATCGCC CAACTCAAAA TAGAAATCAA TATATATCAA GNATTATCAT 61 AAAATCAACT ATGATACTCA ACATGTAGCA ACAACAAGCA CTATATCATT AACAATTACC 121 GTCACGTTCA CACATGAGGA CTCAAGCCTC AATACCATAC TCATTTGGGA ATCATGTTCA 181 TTAGATTTAG TATATTAACA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGACGGAA 241 CATGACATTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TGAACACATG ACACTCCGAT 301 CCCCTAAATC TACATGACAG TTTCATGAAC GCTATCC Predicted gene structure (within gDNA segment 5745 to 3209): Exon 1 4659 4379 ( 281 n); cDNA 1 281 ( 281 n); score: 0.929 MATCH C06HBa0153O03.1-1- SGN-E357033+ 0.929 281 0.834 C PGS_C06HBa0153O03.1-1-_SGN-E357033+ (4659 4379) Alignment (genomic DNA sequence = upper lines): ACTGAGTAGA TATCATCGGC CAACTCAAAA TAGAGATCAA TATATACCAA GTAATATCAT 4600 ||||| |||| |||||||| | |||||||||| |||| ||||| |||||| ||| | | |||||| ACTGAATAGA TATCATCGCC CAACTCAAAA TAGAAATCAA TATATATCAA GNATTATCAT 60 AAAATCAACT ATGATACTCA ACATGTAGCA ACATCAAATA CTATATCATT AACAATTACC 4540 |||||||||| |||||||||| |||||||||| ||| ||| | |||||||||| |||||||||| AAAATCAACT ATGATACTCA ACATGTAGCA ACAACAAGCA CTATATCATT AACAATTACC 120 GTCAAGTTCA CACACGAGGA CTCAAGCCTC AATACCGTAC TCATTTGGGA ATTATGTTCA 4480 |||| ||||| |||| ||||| |||||||||| |||||| ||| |||||||||| || ||||||| GTCACGTTCA CACATGAGGA CTCAAGCCTC AATACCATAC TCATTTGGGA ATCATGTTCA 180 TTGGATTGAG TATATTATCA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGTCGGTA 4420 || |||| || ||||||| || |||||||||| |||||||||| |||||||||| |||| ||| | TTAGATTTAG TATATTAACA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGACGGAA 240 CGTGACACTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG T 4379 | ||||| || |||||||||| |||||||||| |||||||||| | CATGACATTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG T 281 hqPGS_C06HBa0153O03.1-1-_SGN-E357033+ (4659 4379) ******************************************************************************** EST sequence 102 -strand 236 n (File: SGN-E209683-) 1 CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 61 AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 121 ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 181 ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA Predicted gene structure (within gDNA segment 5483 to 2527): Exon 1 4711 4559 ( 153 n); cDNA 1 154 ( 154 n); score: 0.837 MATCH C06HBa0153O03.1-1- SGN-E209683- 0.837 153 0.648 C PGS_C06HBa0153O03.1-1-_SGN-E209683- (4711 4559) Alignment (genomic DNA sequence = upper lines): CACAAATAAA CAAGAAGA-A AACATAAAAG TAGGGGTCAG TACAAA-CAC GGGTACTGAG 4654 ||||||| || |||||||| | |||||||||| |||||||||| |||||| ||| |||||||||| CACAAAT-AA CAAGAAGATA AACATAAAAG TAGGGGTCAG TACAAACCAC GGGTACTGAG 59 TAGATATCAT CGGCCAACTC AAAATAGAGA TCAATATATA CCAAGTAATA TCATAAAATC 4594 |||||||||| |||||||||| ||||||| || || ||| || ||| |||| |||||||||| TAGATATCAT CGGCCAACTC AAAATAGGGA ACAGTATGTA TTAAGCAATA TCATAAAATC 119 AACTATGATA CTCAACATGT AGCAACATCA AATAC 4559 ||||| || || |||||| |||| | ||| AACTAATATC CTTAACATGC AGCATTTATA GTTAC 154 hqPGS_C06HBa0153O03.1-1-_SGN-E209683- (4711 4559) ******************************************************************************** EST sequence 46 -strand 729 n (File: SGN-E351546-) 1 AGTCGTTGCT CTAGTTCTAC CCATCTGGCA AGAGAGTGAG NATGGTCAGA TACCAATTCG 61 TATCGCTTAG ATACCAATTG ACTCGAAGTA GTAGCACGAA AGAAAGAATG AAAGAGTGAA 121 GTTTTCCTAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAA GCGTCCCCCT ACCGTTCCTT 181 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 241 AGTTTTGTCA CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA 301 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA 361 AGACTTAAAC TCATTAATAA AATCAATAAC TACTATTATT AAACATCTAT TATTCCCAAA 421 ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA GAGTTTCTAA 481 GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 541 AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 601 AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 661 ACCAAGAAGA AAAACATAAA AGTAGGGGTC AGTACAAAAC ACGGCTACTG AGTAGATATC 721 ATCGGCCAA Predicted gene structure (within gDNA segment 6774 to 4037): Exon 1 5346 4637 ( 710 n); cDNA 1 729 ( 729 n); score: 0.858 MATCH C06HBa0153O03.1-1- SGN-E351546- 0.858 710 0.974 C PGS_C06HBa0153O03.1-1-_SGN-E351546- (5346 4637) Alignment (genomic DNA sequence = upper lines): AGTTGTTGCT CTAGTTCTAA CCATCTGCGA AACAGAGTGA AGATGGTCAG ATACCAATTT 5287 ||| |||||| ||||||||| ||||||| | || ||||||| |||||||| ||||||||| AGTCGTTGCT CTAGTTCTAC CCATCTG-GC AAGAGAGTGA GNATGGTCAG ATACCAATTC 59 GTATCACCTA GATACCAATT GGACCCAAGT AATAGCACGA AAGAAAGAAT GAAAGAATGG 5227 ||||| | || |||||||||| | | |||| | |||||||| |||||||||| |||||| || GTATCGCTTA GATACCAATT GACTCGAAGT AGTAGCACGA AAGAAAGAAT GAAAGAGTGA 119 AATTTTCCTA AAGTCTTATA GCCCCTCAAA GAAAAGTAAA GGTGTCCCCC TACCGTTCCT 5167 | |||||||| |||||||||| ||| ||||| |||||||||| | ||||||| |||||||||| AGTTTTCCTA AAGTCTTATA GCCTCTCAAG GAAAAGTAAA AGCGTCCCCC TACCGTTCCT 179 TAAGACTCTA CCAGACTCGT TCTTGTGTGA TGAGACCAAC GAACCTAATG CTCTGATACC 5107 |||||||||| | ||||| || |||||||||| |||||||||| |||||||||| |||||||||| TAAGACTCTA CTAGACTTGT TCTTGTGTGA TGAGACCAAC GAACCTAATG CTCTGATACC 239 AAG-TTTGTC ACGACCCAAA ACCGGATCGC GACTGGCACC CACACTTACC CTCCTATGTG 5048 ||| |||||| |||||||||| |||| ||| ||||||||| |||||||||| |||||||||| AAGTTTTGTC ACGACCCAAA TCCGGGCCGC CACTGGCACC CACACTTACC CTCCTATGTG 299 AGCGAACCAA CCAATCTAAA CCTTAATATT TCAATAGAAT ATCAACAGAA AGTAATGCGG 4988 |||||||||| |||||||||| |||||| ||| ||||| ||| | |||||||| |||||||||| AGCGAACCAA CCAATCTAAA CCTTAACATT TCAATGTAAT AGCAACAGAA AGTAATGCGG 359 AAGACTTAAA CTCATTAATA AAATCAATAA --A-TATTAT T--A--TC-- -----CCCAA 4942 |||||||||| |||||||||| |||||||||| | |||||| | | || ||||| AAGACTTAAA CTCATTAATA AAATCAATAA CTACTATTAT TAAACATCTA TTATTCCCAA 419 AATCTGGAAG TCATCATCAC AAGAACATCT ACTTCAAACT ACTAAATCTA AGAGTTTCTA 4882 || ||||||| |||||||||| |||||||||| |||| ||||| ||||| |||| |||||||||| AACCTGGAAG TCATCATCAC AAGAACATCT ACTTTAAACT ACTAATTCTA AGAGTTTCTA 479 AGAAGCT-AA AAA-TACATA A-AAGCTAGT CCATGCCGGA ACTTCAAAGC ATCAAGACAT 4825 ||||||| || ||| |||||| | |||||||| |||||||||| | ||||| || |||||||||| AGAAGCTAAA AAATTACATA AGAAGCTAGT CCATGCCGGA AGTTCAAGGC ATCAAGACAT 539 GAAGAGGAAG ATCCAGTCCA AGCTAGAAGC ATTAGCTCAC CCTG-ATATC CGGAGTAATG 4766 |||| |||| |||||||||| |||||||||| ||||||||| |||| | ||| ||| || | | GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC GTTAGCTCAC CCTGAAGATC CGGTGTGACG 599 AAGACTGGCT AGAGTTACTG TTGAGTCGAA GATGACGGCA CGTTTGCTGC ACTCCACAAA 4706 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGACTGGCT TGAGTTACTG TTGAGTCGAA GATGACGGCA CGTTTGCTGC ACTCCACAAA 659 TAAACAAGAA G-AAAACATA AAAGTAGGGG TCAGTAC-AA ACACGGGTAC TGAGTAGATA 4648 | | |||||| | |||||||| |||||||||| ||||||| || |||||| ||| |||||||||| T-ACCAAGAA GAAAAACATA AAAGTAGGGG TCAGTACAAA ACACGGCTAC TGAGTAGATA 718 TCATCGGCCA A 4637 |||||||||| | TCATCGGCCA A 729 hqPGS_C06HBa0153O03.1-1-_SGN-E351546- (5346 4637) ******************************************************************************** EST sequence 87 -strand 655 n (File: SGN-E356696-) 1 CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 61 TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 121 CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 181 CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 241 CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 301 TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 361 ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 421 ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 481 GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 541 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAACA AGAAGAAAAA 601 CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA GATATCATCG GCCAA Predicted gene structure (within gDNA segment 6034 to 4037): Exon 1 5271 4637 ( 635 n); cDNA 1 655 ( 655 n); score: 0.862 MATCH C06HBa0153O03.1-1- SGN-E356696- 0.862 635 0.969 C PGS_C06HBa0153O03.1-1-_SGN-E356696- (5271 4637) Alignment (genomic DNA sequence = upper lines): CAATTGGACC CAAGTAATAG CACGAAAGAA AGAATGAAAG AATGGAATTT TCCTAAAGTC 5212 ||||||||| |||||| ||| |||||||||| |||||||||| | || | ||| |||||||||| CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 60 TTATAGCCCC TCAAAGAAAA GTAAAGGTGT CCCCCTACCG TTCCTTAAGA CTCTACCAGA 5152 |||||||| | |||| ||||| ||||| | || |||||||||| |||||||||| |||||| ||| TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 120 CTCGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAG-T TTGTCACGAC 5093 || ||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 180 CCAAAACCGG ATCGCGACTG GCACCCACAC TTACCCTCCT ATGTGAGCGA ACCAACCAAT 5033 ||||| |||| ||| |||| |||||||||| |||||||| | |||||||||| |||||||||| CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 240 CTAAACCTTA ATATTTCAAT AGAATATCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 4973 |||||||||| | |||||||| |||| ||| |||||||||| |||||||||| |||||||||| CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 300 TAATAAAATC AATAA--A-T ATTATT---- ATC------- CCCAAAATCT GGAAGTCATC 4927 |||||||||| ||||| | | |||||| ||| ||||||| || |||||||||| TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 360 ATCACAAGAA CATCTACTTC AAACTACTAA ATCTAAGAGT TTCTAAGAAG CT-AAAAA-T 4869 |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| || ||||| | ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 420 ACATAA-AAG CTAGTCCATG CCGGAACTTC AAAGCATCAA GACATGAAGA GGAAGATCCA 4810 |||||| ||| |||||||||| |||||| ||| || ||||||| ||||||||| ||||||||| ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 480 GTCCAAGCTA GAAGCATTAG CTCACCCTG- ATATCCGGAG TAATGAAGAC TGGCTAGAGT 4751 |||||||||| ||||| |||| ||||||||| | |||||| | | | |||||| ||||| |||| GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 540 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAG-AAA 4692 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||| ||| TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAAT-AAC AAGAAGAAAA 599 ACATAAAAGT AGGGGTCAGT AC-AAACACG GGTACTGAGT AGATATCATC GGCCAA 4637 |||||||||| |||||||||| || ||||||| | |||||||| |||||||||| |||||| ACATAAAAGT AGGGGTCAGT ACAAAACACG GCTACTGAGT AGATATCATC GGCCAA 655 hqPGS_C06HBa0153O03.1-1-_SGN-E356696- (5271 4637) ******************************************************************************** EST sequence 83 -strand 580 n (File: SGN-E356206-) 1 GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 61 TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 121 CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 181 TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 241 CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 301 ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 361 CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 421 GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 481 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA AAGTAGGGGT 541 CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA Predicted gene structure (within gDNA segment 6238 to 4037): Exon 1 5196 4637 ( 560 n); cDNA 1 580 ( 580 n); score: 0.853 MATCH C06HBa0153O03.1-1- SGN-E356206- 0.853 560 0.966 C PGS_C06HBa0153O03.1-1-_SGN-E356206- (5196 4637) Alignment (genomic DNA sequence = upper lines): GAAAAGTAAA GGTGTCCCCC TACCGTTCCT TAAGACTCTA CCAGACTCGT TCTTGTGTGA 5137 |||||||||| | |||||| |||||| ||| |||||||||| | ||||| || |||||||||| GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 60 TGAGACCAAC GAACCTAATG CTCTGATACC AAG-TTTGTC ACGACCCAAA ACCGGATCGC 5078 |||||||||| || ||||||| |||||||||| ||| |||||| |||||||||| |||| ||| TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 120 GACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAATATT 5018 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 180 TCAATAGAAT ATCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 4958 ||||| ||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 240 --A-TATTAT T----ATC-- -----CCCAA AATCTGGAAG TCATCATCAC AAGAACATCT 4912 | |||||| | ||| ||||| || ||||||| |||||||||| |||||||||| CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 300 ACTTCAAACT ACTAAATCTA AGAGTTTCTA AGAAGCT-AA AAA-TACATA A-AAGCTAGT 4855 |||| ||||| ||||| |||| |||||||||| ||||||| || ||| |||||| | |||||||| ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 360 CCATGCCGGA ACTTCAAAGC ATCAAGACAT GAAGAGGAAG ATCCAGTCCA AGCTAGAAGC 4795 |||||||||| | ||||| || |||||||||| |||| |||| |||||||||| |||||||||| CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 420 ATTAGCTCAC CCTG-ATATC CGGAGTAATG AAGACTGGCT AGAGTTACTG TTGAGTCGAA 4736 ||||||||| |||| | ||| ||| || | | |||||||||| ||||||||| |||||||||| GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 480 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAAACAAGAA G-AAAACATA AAAGTAGGGG 4677 |||||||||| |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| GATGACGGCA CGTTTGCTGC ACTCCACAAA T-AACAAGAA GAAAAACATA AAAGTAGGGG 539 TCAGTAC-AA ACACGGGTAC TGAGTAGATA TCATCGGCCA A 4637 ||||||| || |||||| ||| |||||||||| |||||||||| | TCAGTACAAA ACACGGCTAC TGAGTAGATA TCATCGGCCA A 580 hqPGS_C06HBa0153O03.1-1-_SGN-E356206- (5196 4637) ******************************************************************************** EST sequence 110 +strand 434 n (File: SGN-E222578+) 1 TTTTTTTTTT TTTTTTTTTA ATAAAAACCA ATTCAATAAC TATCAATATT CAACATCTAT 61 TATTCCCAAA ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA 121 GAGTTTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAGGTT CAAGGCATCA 181 AGACATGAAG GAGAAGATCC AGTCCAAGCT AGACGCGTTA GCTCACCCTG AAGATCCGGT 241 GTGACGAAGA CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC 301 CACAACTTTC TAGATGGGGA CTTTCTTCAA GGCTTCGAGA TGGAAACTTG CTTGCAGAGC 361 TTCGAGTGTT ACCAGCTTCA AGATGGAGTT TCAGTGATGA GGCTTGCTAG TCTCGAGTTT 421 TTTTTTTTTT TTTT Predicted gene structure (within gDNA segment 6258 to 2807): Exon 1 4953 4707 ( 247 n); cDNA 58 305 ( 248 n); score: 0.911 PPA cDNA 19 1 MATCH C06HBa0153O03.1-1- SGN-E222578+ 0.911 247 0.569 C PGS_C06HBa0153O03.1-1-_SGN-E222578+ (4953 4707) Alignment (genomic DNA sequence = upper lines): TATTATCCCC AAAATCTGGA AGTCATCATC ACAAGAACAT CTACTTCAAA CTACTAAATC 4894 |||||| ||| |||| ||||| |||||||||| |||||||||| |||||| ||| ||||||| || TATTATTCCC AAAACCTGGA AGTCATCATC ACAAGAACAT CTACTTTAAA CTACTAATTC 117 TAAGAGTTTC TAAGAAGCTA AAAATACATA A-AAGCTAGT CCATGCCGGA ACTTCAAAGC 4835 |||||||||| ||| |||||| |||||||||| | |||||||| |||||||||| ||||| || TAAGAGTTTC TAA-AAGCTA AAAATACATA AGAAGCTAGT CCATGCCGGA GGTTCAAGGC 176 ATCAAGACAT GAAGAGGAAG ATCCAGTCCA AGCTAGAAGC ATTAGCTCAC CCTG-ATATC 4776 |||||||||| |||| |||| |||||||||| ||||||| || ||||||||| |||| | ||| ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGACGC GTTAGCTCAC CCTGAAGATC 236 CGGAGTAATG AAGACTGGCT AGAGTTACTG TTGAGTCGAA GATGACGGCA CGTTTGCTGC 4716 ||| || | | |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA GATGACGGCA CGTTTGCTGC 296 ACTCCACAA 4707 ||||||||| ACTCCACAA 305 hqPGS_C06HBa0153O03.1-1-_SGN-E222578+ (4953 4707) ******************************************************************************** EST sequence 204 +strand 710 n (File: SGN-E392027+) 1 CCACAGCCCC AGTGGCTGGC TCAGTCGCAC CCTGTCCCGC CGGTGCTGGT GTTGATGCTG 61 GCGTAGTCGT TGCTCTAGTT CTAACCATCT GCGAAATAGA GTGAAGATGG TCAGATACCA 121 ATTTGTATCA CCTAGATACC AATTGGACCC AAGTAATAGC ACGAAAGAAG AAAGAATGGA 181 ATTTTCCAAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAG GCATCCCCCT ACCGTTCCTT 241 AAGACTCTAC TAGACTCGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 301 AGTTTGTCAC GACCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 361 GAACCAACCA ATCTAACCTT AACATTTCAA TATAATATCA ACAGAAAGTA ATGTGGAAGA 421 CTTAAACTCA TTAAATACAG ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC 481 CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA 541 GTCTAAAAGC TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC 601 TTGAAGAAGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATT TCCGATGTAG 661 TAAGACTGGC TTGAATTACT GTTGAGTTGA ACACGATGGC ACGTTTGCTG Predicted gene structure (within gDNA segment 6622 to 3540): Exon 1 5342 4717 ( 626 n); cDNA 69 710 ( 642 n); score: 0.831 MATCH C06HBa0153O03.1-1- SGN-E392027+ 0.831 626 0.882 C PGS_C06HBa0153O03.1-1-_SGN-E392027+ (5342 4717) Alignment (genomic DNA sequence = upper lines): GTTGCTCTAG TTCTAACCAT CTGCGAAACA GAGTGAAGAT GGTCAGATAC CAATTTGTAT 5283 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| GTTGCTCTAG TTCTAACCAT CTGCGAAATA GAGTGAAGAT GGTCAGATAC CAATTTGTAT 128 CACCTAGATA CCAATTGGAC CCAAGTAATA GCACGAAAGA AAGAATGAAA GAATGGAATT 5223 |||||||||| |||||||||| |||||||||| |||| || ||||| |||| |||||||||| CACCTAGATA CCAATTGGAC CCAAGTAATA GCAC----GA AAGAA-GAAA GAATGGAATT 183 TTCCTAAAGT CTTATAGCCC CTCAAAGAAA AGTAAAGGTG TCCCCCTACC GTTCCTTAAG 5163 |||| ||||| ||||||||| ||||| |||| |||||||| |||||||||| |||||||||| TTCCAAAAGT CTTATAGCCT CTCAAGGAAA AGTAAAGGCA TCCCCCTACC GTTCCTTAAG 243 ACTCTACCAG ACTCGTTCTT GTGTGATGAG ACCAACGAAC CTAATGCTCT GATACCAAGT 5103 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTCTACTAG ACTCGTTCTT GTGTGATGAG ACCAACGAAC CTAATGCTCT GATACCAAGT 303 TTGTCACGAC CCAAAACCGG ATCGCGACTG GCACCCACAC TTACCCTCCT ATGTGAGCGA 5043 ||||||||| |||||||||| | ||||||| |||||||||| |||||||||| |||||||||| TTGTCACGA- CCAAAACCGG GTTGCGACTG GCACCCACAC TTACCCTCCT ATGTGAGCGA 362 ACCAACCAAT CTAAACCTTA ATATTTCAAT AGAATATCAA CAGAAAGTAA TGCGGAAGAC 4983 |||||||||| || ||||||| | |||||||| | |||||||| |||||||||| || ||||||| ACCAACCAAT CT-AACCTTA ACATTTCAAT ATAATATCAA CAGAAAGTAA TGTGGAAGAC 421 TTAAACTCAT T-AAT--A-A --AA-TC--- AA----T-AA A-T----AT- TATTA-TCCC 4945 |||||||||| | ||| | | || || || | || | | || ||||| |||| TTAAACTCAT TAAATACAGA CCAATTCATT AACTTCTAAA ATTCAACATC TATTATTCCC 481 CAAAATCTGG AAGTCATCAT CACAAGAACA TCTAC-TTCA AACTACTAAA TCTAAGAGT- 4887 |||||||||| ||||||||| |||||||||| ||||| ||| || |||||| |||||||| CAAAATCTGG AAGTCATCAC CACAAGAACA TCTACGATCA AATGACTAAA -CTAAGAGTA 540 TTCTAAGAAG CTAAAAATAC ATAA-AAGCT AGTCCATGCC GGAACTTCAA AGCATCAAGA 4828 ||||| ||| |||||||||| |||| ||||| |||||||||| |||| ||||| ||||||||| GTCTAA-AAG CTAAAAATAC ATAAGAAGCT AGTCCATGCC GGAAGTTCAA GGCATCAAGA 599 CATGAAGAGG AAGATCCAGT CCAAGCTAGA AGCATTAGCT CACCCTG-AT ATCCGGAGTA 4769 | |||||| | |||||||||| |||||||||| |||||||||| ||||||| || |||| ||| CTTGAAGAAG AAGATCCAGT CCAAGCTAGA AGCATTAGCT CACCCTGAAT TTCCGATGTA 659 ATGAAGACTG GCTAGAGTTA CTGTTGAGTC GAAGATGACG GCACGTTTGC TG 4717 | ||||||| ||| || ||| ||||||||| ||| | || | |||||||||| || GT-AAGACTG GCTTGAATTA CTGTTGAGTT GAACACGATG GCACGTTTGC TG 710 hqPGS_C06HBa0153O03.1-1-_SGN-E392027+ (5342 4717) ******************************************************************************** EST sequence 158 +strand 840 n (File: SGN-E542084+) 1 TTTTTTTTTT TAGGGGAAAA TTTCTTACTT CTATAAATGT CACGACCCAA ATCGGATCGC 61 GACTGGCACC CACACTTACC CTGCTATGTG AGCGAACCAA CCAATCCAAA CCTTAACATT 121 TCAATGTAAT ATCAACATAA AGTAATGCGG AAGACTTAAA CTTATTAATG AAAACCAATT 181 CAATAACTAT TATTTCCCAA AATCTGGAAG TCATCATCAT AAGAACATCT ACTTCAAATT 241 ACTAAATCTA AGAGTTTCTA AGAAGCTAAA AAATACATAA AAGCTAGTCC ATGCCGGAAC 301 TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAT CTAGAAAGCA TTAGCTCACC 361 CTGATATCCG AAGTAATGAA GACTGGCTAG AGTTACTGTT GAGTCGAAGA TGACGGCACG 421 TTTGCTAAAA TCAGTGGACG GAGGAGAAGG GAAAGCACAC CGGGAATGAG AAGAAGCTGA 481 AGGAGGAACC AAAGAGGAAT CCCATTGCAA AGTAAATGAG AGTGTAAGCT AGCAGACGCG 541 ATGGAAGAGC TTACGCAGAA ATAACACTCT CATTTGGTGA TTTAGTTTGG AGATCATCTG 601 AGACCTTCGT GTTGGACAAC ATCATCCATG AAGATGTCAT TAGAAAAGTT AGATGCTTTA 661 TATACATGTT GATAGTTCCT GACTACTCTA TTTCTTTTTC AGAAAGCCCC GAAATTTCTC 721 AGATGATAAA TGCTGTCTGT TTTGGAAAAC CATCTCTATG CAAAGATGAT GTTTGCTGCA 781 TTGAGGTGTC AATATTGGGA ATTTCAAGAA AATTATGCCT TGTAGAATAT GTACAGCAAC Predicted gene structure (within gDNA segment 6205 to 1): Exon 1 5101 4718 ( 384 n); cDNA 38 426 ( 389 n); score: 0.911 Intron 1 4717 2045 (2673 n); Pd: 0.000 (s: 1.00), Pa: 0.177 (s: 0) Exon 2 2044 2038 ( 7 n); cDNA 427 433 ( 7 n); score: 0.714 PPA cDNA 11 1 MATCH C06HBa0153O03.1-1- SGN-E542084+ 0.911 391 0.465 C PGS_C06HBa0153O03.1-1-_SGN-E542084+ (5101 4718,2044 2038) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAAACCGGA TCGCGACTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 5042 |||||||||| | ||| |||| |||||||||| |||||||||| |||||| ||| |||||||||| TGTCACGACC C-AAATCGGA TCGCGACTGG CACCCACACT TACCCTGCTA TGTGAGCGAA 96 CCAACCAATC TAAACCTTAA TATTTCAATA GAATATCAAC AGAAAGTAAT GCGGAAGACT 4982 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||||||| CCAACCAATC CAAACCTTAA CATTTCAATG TAATATCAAC ATAAAGTAAT GCGGAAGACT 156 TAAACTCATT AAT-AAAATC AA-TAAAT-A TTATTA-TCC CCAAAATCTG GAAGTCATCA 4926 |||||| ||| ||| |||| | || | ||| | ||||| | | |||||||||| |||||||||| TAAACTTATT AATGAAAACC AATTCAATAA CTATTATTTC CCAAAATCTG GAAGTCATCA 216 TCACAAGAAC ATCTACTTCA AACTACTAAA TCTAAGAGTT TCTAAGAAGC T-AAAAATAC 4867 ||| |||||| |||||||||| || ||||||| |||||||||| |||||||||| | |||||||| TCATAAGAAC ATCTACTTCA AATTACTAAA TCTAAGAGTT TCTAAGAAGC TAAAAAATAC 276 ATAAAAGCTA GTCCATGCCG GAACTTCAAA GCATCAAGAC ATGAAGAGGA AGATCCAGTC 4807 |||||||||| |||||||||| ||||||||| ||||||||| |||||||||| |||||||||| ATAAAAGCTA GTCCATGCCG GAACTTCAAG ACATCAAGAC ATGAAGAGGA AGATCCAGTC 336 CAAGCTAG-A AGCATTAGCT CACCCTGATA TCCGGAGTAA TGAAGACTGG CTAGAGTTAC 4748 ||| |||| | |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| CAATCTAGAA AGCATTAGCT CACCCTGATA TCCGAAGTAA TGAAGACTGG CTAGAGTTAC 396 TGTTGAGTCG AAGATGACGG CACGTTTGCT GCACTCCACA AATAAACAAG AAGAAAACAT 4688 |||||||||| |||||||||| |||||||||| TGTTGAGTCG AAGATGACGG CACGTTTGCT .......... .......... .......... 426 AAAAGTAGGG GTCAGTACAA ACACGGGTAC TGAGTAGATA TCATCGGCCA ACTCAAAATA 4628 .......... .......... .......... .......... .......... .......... 426 GAGATCAATA TATACCAAGT AATATCATAA AATCAACTAT GATACTCAAC ATGTAGCAAC 4568 .......... .......... .......... .......... .......... .......... 426 ATCAAATACT ATATCATTAA CAATTACCGT CAAGTTCACA CACGAGGACT CAAGCCTCAA 4508 .......... .......... .......... .......... .......... .......... 426 TACCGTACTC ATTTGGGAAT TATGTTCATT GGATTGAGTA TATTATCATC TTTCAAGATT 4448 .......... .......... .......... .......... .......... .......... 426 CATTATCTTT ATTTCTCTTG TGTCGGTACG TGACACTCCG CTCCCTCATA TTCATTAATC 4388 .......... .......... .......... .......... .......... .......... 426 CTCTTGTGTC GGTACGTGAC ACTTCGATCC CCCACTACTA TGTGTCGGAA CGTGACACTT 4328 .......... .......... .......... .......... .......... .......... 426 CGATCCTCTA AATCTACGTG TCGGTTCGTG ACACTCGATC TCCTAAATCT AAGTGTCGGT 4268 .......... .......... .......... .......... .......... .......... 426 TCGTGACACC AGATCCCCTA AATCTACGTG TCAGTTCGTG ACACCCGATC CCCTAAATCT 4208 .......... .......... .......... .......... .......... .......... 426 ACGTGTCGGT TCGTGACACC CGATCCCTAA ATCTACGTGT CGGTTCGTGA CACCCTATCC 4148 .......... .......... .......... .......... .......... .......... 426 CCTAATCTCC TTCTATCAAT TCATCAAGCC TTCTTTCTTA CCAAGGCATC ATCAATCTCA 4088 .......... .......... .......... .......... .......... .......... 426 TTATTTTAGT TCATCACGCC TTCTTTTATA CCAAGGCCCC ATCATTAACA AAGAGATTAG 4028 .......... .......... .......... .......... .......... .......... 426 GGTTTTGCAA GATTTGGGAT TCAATAACTT CATCATGCTT ATATAACCAC AATTATAAAA 3968 .......... .......... .......... .......... .......... .......... 426 TTACATTCAT GCAAGCATAC AATTAAGCAC ATAGCAGGGT TTACAATATT ATCAATATAT 3908 .......... .......... .......... .......... .......... .......... 426 ATCATTCGCT ATTAAGAGTT TACTACGAAT ATCGTAAGAG AAACCATAAC CTACCTCCAC 3848 .......... .......... .......... .......... .......... .......... 426 CGAAGATTAG TGATCAAGCA AGAAATTTCC CCAAGCTTTG TTCTTCGTTT TCTCTCTTCC 3788 .......... .......... .......... .......... .......... .......... 426 TCGTTCGATC CTCTCTCTCT CTTTGTTCTT TCTACTTTTC TTATTCAAAC CCTCTTTCTT 3728 .......... .......... .......... .......... .......... .......... 426 TTACCCTAAT TAGCATATAA TTAAGAACAA AAGATGGCAA TAATAACTCA CTAATTAACT 3668 .......... .......... .......... .......... .......... .......... 426 TAAGGTTACC TCTTTTAACC CCCAAGTAAT TAGACTTATT AAAATTAACC CACTAACTTT 3608 .......... .......... .......... .......... .......... .......... 426 ATAATTAAAG CAGGAATAGT CCAAAACGCC CCTTAAAATA ATTACAGAAA TCTGACCCAG 3548 .......... .......... .......... .......... .......... .......... 426 CCTGGGATTA CGCAGCCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCATT CTGCTGCTCC 3488 .......... .......... .......... .......... .......... .......... 426 GTCACAGAGT TCCGAGACTC AATTTCTCTG AAGAGTCTGT AACGGTTCGT CCTGCCATTC 3428 .......... .......... .......... .......... .......... .......... 426 CGTTACGAAG TTCAGAAAGT CGATTTCAGT ACCCAATTTT GAGAATTCTA AGTATTTTGG 3368 .......... .......... .......... .......... .......... .......... 426 AATGAGATAT CCTCGACGGT CCGTCGTGCC CATGACGGTC GGTCGTGAGT TCCGTCGTCT 3308 .......... .......... .......... .......... .......... .......... 426 TTGCCTGTTT TTCAAGAAAT AAAATCTGCT GCTCGAAACG ACTAAACAGG TCGTTACAAG 3248 .......... .......... .......... .......... .......... .......... 426 TATTGCCTAA TTCCTTTTAA GGGTTATTTA GGGGTAAAGA ACAAGTCTCA AATCATTTTT 3188 .......... .......... .......... .......... .......... .......... 426 ACAATAATTT AGAACGCTTA GGTCTTGGGA GAAAAGAGAA AATAAAAGGG ATCAAGAATT 3128 .......... .......... .......... .......... .......... .......... 426 GTCCAAGAAC TCCAACATTG TTGCGTAGAT TTTATCAAGG GTTATCCCTA TCGAGGTATG 3068 .......... .......... .......... .......... .......... .......... 426 TGAGATCATT CGGTGTTGGA TCCTTTCATC CACACTCCAA ATTTATTCAA TACAATAAAT 3008 .......... .......... .......... .......... .......... .......... 426 TAGAGTTTAT ATTTAATAAA AATCTTGATG TTCTTGATGA AAGTTATGAA GTTCTTGTTG 2948 .......... .......... .......... .......... .......... .......... 426 GTTGAATTAT AGTTGATCTC TAGTAGTTAT GATTCATGTT TTCGGGTCTA AATATTGGGT 2888 .......... .......... .......... .......... .......... .......... 426 AATGAGTTTT TTAGGGATCC TAAGTTTTTG AGGTGGGATC CATGGGAGTT TGGTAGGTTT 2828 .......... .......... .......... .......... .......... .......... 426 AGATATGAAA GAGGAGAAGA AAAGTCGGAG GTCCAGTCAG ACAACCCTTG GGGCGCCGCG 2768 .......... .......... .......... .......... .......... .......... 426 CCTCTTAGAG CGCCAATACC CTCGGAGACC CTTTTCTTTC CCTATATTTT CGTACTAGTT 2708 .......... .......... .......... .......... .......... .......... 426 CCTAAGTGAT GTACCTCTCA TTCCTAGTTG ACCAACACTC TAGAATAAAT ATAAACATCA 2648 .......... .......... .......... .......... .......... .......... 426 TGAAATCATC CATAAACATG AGATTATGAT CCTTGAATTC ATAATCCAAT TCAAGAGAAA 2588 .......... .......... .......... .......... .......... .......... 426 CTAAGATCAA AGTCAAGAAA GTAAGCAATG AAGGGAGTAA AAGTAAAGCT TTTATTTCAA 2528 .......... .......... .......... .......... .......... .......... 426 AGTTCTTAGA ATCTTACTTA AATGTTATAA TTTCGTTTTA AGGCTCATGT TCGAGTTAAG 2468 .......... .......... .......... .......... .......... .......... 426 GAAAAGAGTA AAGGTTGAAT TCTTTTCTTA AAATGGTATA TAAGAAAACT AAGTATTTCC 2408 .......... .......... .......... .......... .......... .......... 426 TAAGAGTTAA AGTTTAAAGT TAAGTAAAGA GTAAGAGTTG AGTTCATTTC TCTAAAGATA 2348 .......... .......... .......... .......... .......... .......... 426 TAAGGGGGAC TAAGTACTCC CTAAAGGTTG ATAAATGTTT CACATTTAAG TTAAAAGAAA 2288 .......... .......... .......... .......... .......... .......... 426 ACTAAGAAGT TTCAAAAGAG TTTTGAACTA AAAGGGGAAC ATTGATTCCA AAAGGAGATT 2228 .......... .......... .......... .......... .......... .......... 426 TGTAAAGCTA AAGGGTTCAG TAAATTATCT CAACCCAAAG GAAGGAAGTT TTGTTAAAAG 2168 .......... .......... .......... .......... .......... .......... 426 TATGAGCTAA AGTATGTTTT GGGAGTAGTA TTGAGCACCG ATGTAGGAAT GAGAGTTCAG 2108 .......... .......... .......... .......... .......... .......... 426 ATAACTCAAG TCCCCATGTA AATCATGTAG CTATCATGGG TGTTAACATG TCATACTTTT 2048 .......... .......... .......... .......... .......... .......... 426 TAGATGATCA 2038 | |||| ...AAAATCA 433 hqPGS_C06HBa0153O03.1-1-_SGN-E542084+ (5101 4718) ******************************************************************************** EST sequence 37 -strand 405 n (File: SGN-E336814-) 1 TTTTTTTTTT TAATAAAATC AATAACTATT ATTATCCCCA AAATCTGGAA GTCATCACCA 61 CAAGAACATC TATGATCAAA GTACTAAACT AAGAGTGTTC TAAAAAGCTA AAATACAAGA 121 AAGCTAGTCC ATGCCGGAAC TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAG 181 CTAGAAACAT TAGCTCACCT TAATATCCGG AATAATGAAG ACTGGCTAGA GTTACTGTTG 241 AGTCGAAGAT GACGGCACGT TTGCTCTCTT GTGTGATTTG CAAGCTATTG CAAAGGAGTA 301 GTTGCGGGTT TCCTTGTCAT AAAAAATAAT GCAAGGGAAA GCAGAAGATA ATGAAAATTA 361 GCAATATGTC AATCCATGAT CATGAAGTGC TTTGTTTGCT TTGGG Predicted gene structure (within gDNA segment 5673 to 2718): Exon 1 4982 4718 ( 265 n); cDNA 1 265 ( 265 n); score: 0.898 Intron 1 4717 4434 ( 284 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: 0) Exon 2 4433 4425 ( 9 n); cDNA 266 274 ( 9 n); score: 1.000 MATCH C06HBa0153O03.1-1- SGN-E336814- 0.898 274 0.677 C PGS_C06HBa0153O03.1-1-_SGN-E336814- (4982 4718,4433 4425) Alignment (genomic DNA sequence = upper lines): TTAAACTCAT TAATAAAATC AATAAATATT ATTATCCCCA AAATCTGGAA GTCATCATCA 4923 || | | |||||||||| ||||| |||| |||||||||| |||||||||| ||||||| || TTTTTTTTTT TAATAAAATC AATAACTATT ATTATCCCCA AAATCTGGAA GTCATCACCA 60 CAAGAACATC TA-CTTCAAA CTACTAAATC TAAGAGT-TT CTAAGAAGCT AAAAATACAT 4865 |||||||||| || ||||| ||||||| | ||||||| || |||| ||||| |||||||| CAAGAACATC TATGATCAAA GTACTAAA-C TAAGAGTGTT CTAAAAAGCT -AAAATACAA 118 AAAAGCTAGT CCATGCCGGA ACTTCAAAGC ATCAAGACAT GAAGAGGAAG ATCCAGTCCA 4805 ||||||||| |||||||||| ||||||| | |||||||||| |||||||||| |||||||||| GAAAGCTAGT CCATGCCGGA ACTTCAAGAC ATCAAGACAT GAAGAGGAAG ATCCAGTCCA 178 AGCTAGAAGC ATTAGCTCAC CCTGATATCC GGAGTAATGA AGACTGGCTA GAGTTACTGT 4745 |||||||| | |||||||||| | | |||||| ||| |||||| |||||||||| |||||||||| AGCTAGAAAC ATTAGCTCAC CTTAATATCC GGAATAATGA AGACTGGCTA GAGTTACTGT 238 TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT AAACAAGAAG AAAACATAAA 4685 |||||||||| |||||||||| ||||||| TGAGTCGAAG ATGACGGCAC GTTTGCT... .......... .......... .......... 265 AGTAGGGGTC AGTACAAACA CGGGTACTGA GTAGATATCA TCGGCCAACT CAAAATAGAG 4625 .......... .......... .......... .......... .......... .......... 265 ATCAATATAT ACCAAGTAAT ATCATAAAAT CAACTATGAT ACTCAACATG TAGCAACATC 4565 .......... .......... .......... .......... .......... .......... 265 AAATACTATA TCATTAACAA TTACCGTCAA GTTCACACAC GAGGACTCAA GCCTCAATAC 4505 .......... .......... .......... .......... .......... .......... 265 CGTACTCATT TGGGAATTAT GTTCATTGGA TTGAGTATAT TATCATCTTT CAAGATTCAT 4445 .......... .......... .......... .......... .......... .......... 265 TATCTTTATT TCTCTTGTGT 4425 ||||||||| .......... .CTCTTGTGT 274 hqPGS_C06HBa0153O03.1-1-_SGN-E336814- (4982 4718) ******************************************************************************** EST sequence 32 -strand 299 n (File: SGN-E373117-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 61 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 241 GAGTTGAAGA CGACGGTACG TTTGCCAAAA TTACGACAGT ATTTGGACAA GCTAGAAGA Predicted gene structure (within gDNA segment 5836 to 3095): Exon 1 4982 4729 ( 254 n); cDNA 2 255 ( 254 n); score: 0.892 MATCH C06HBa0153O03.1-1- SGN-E373117- 0.892 254 0.849 C PGS_C06HBa0153O03.1-1-_SGN-E373117- (4982 4729) Alignment (genomic DNA sequence = upper lines): TTAAACTCAT TAATAAAATC AATAAATATT ATTATCCCCA AAATCTGGAA GTCATCATCA 4923 || | | | ||| || || || |||| |||||||||| |||||||||| |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAC TACTAAATCT AAGAGTTTCT AAGAAGCTAA AAATACATAA 4863 |||||||||| ||||||||| |||||||||| |||||| ||| |||||||| | |||||||||| CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCT-A AAATACATAA 120 A-AGCTAGTC CATGCCGGAA CTTCAAAGCA TCAAGACATG AAGAGGAAGA TCCAGTCCAA 4804 | |||||||| |||||||||| |||||| ||| |||||||||| |||| ||||| |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 180 GCTAGAAGCA TTAGCTCACC CTGATATCCG GAGTAATGAA GACTGGCTAG AGTTACTGTT 4744 ||||||||| |||||||||| |||| ||||| |||||||| |||||||||| |||| | ||| GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 240 GAGTCGAAGA TGACG 4729 |||| ||||| |||| GAGTTGAAGA CGACG 255 hqPGS_C06HBa0153O03.1-1-_SGN-E373117- (4982 4729) ******************************************************************************** EST sequence 136 +strand 299 n (File: SGN-E373116+) 1 TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCGT TAGCTCACCC TGAAATCCGA TGTAATGAAG ACTGGCTAGA GTTGCGGTTG 241 AGTTGAAGAC GACGGTACGT TTGCCAAAAT TACGACAGTA TTTGGACAAG CTAGAAGAG Predicted gene structure (within gDNA segment 5816 to 3075): Exon 1 4982 4729 ( 254 n); cDNA 1 254 ( 254 n); score: 0.892 MATCH C06HBa0153O03.1-1- SGN-E373116+ 0.892 254 0.849 C PGS_C06HBa0153O03.1-1-_SGN-E373116+ (4982 4729) Alignment (genomic DNA sequence = upper lines): TTAAACTCAT TAATAAAATC AATAAATATT ATTATCCCCA AAATCTGGAA GTCATCATCA 4923 || | | | ||| || || || |||| |||||||||| |||||||||| |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 60 CAAGAACATC TACTTCAAAC TACTAAATCT AAGAGTTTCT AAGAAGCTAA AAATACATAA 4863 |||||||||| ||||||||| |||||||||| |||||| ||| |||||||| | |||||||||| CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCT-A AAATACATAA 119 A-AGCTAGTC CATGCCGGAA CTTCAAAGCA TCAAGACATG AAGAGGAAGA TCCAGTCCAA 4804 | |||||||| |||||||||| |||||| ||| |||||||||| |||| ||||| |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 179 GCTAGAAGCA TTAGCTCACC CTGATATCCG GAGTAATGAA GACTGGCTAG AGTTACTGTT 4744 ||||||||| |||||||||| |||| ||||| |||||||| |||||||||| |||| | ||| GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 239 GAGTCGAAGA TGACG 4729 |||| ||||| |||| GAGTTGAAGA CGACG 254 hqPGS_C06HBa0153O03.1-1-_SGN-E373116+ (4982 4729) ******************************************************************************** EST sequence 109 +strand 679 n (File: SGN-E370357+) 1 TTTTTTTTTT CTTACAATTA TATTATGAAT TCGATAATCT TTAATGTCAC GACCCAAATC 61 GAGCCGCAAG TGGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATACAAAATC 121 CAACATTTCA ATATAATGAC GGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC 181 AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA 241 TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT ATCTAGAAAG CTAGAATAAC 301 TAAAAAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 361 CAAGCTAGAA GCGTTAGCTC ACACTGAAAT CCGGTATAAT GAAGACTGGC TAGAGTTGCG 421 GTTGAGTTGA AGACGACGGT ACGTTTGCTT TATTCGAGTG TCAATTAATC ATTCGGCTGT 481 CACCCAAATA TTATTGATTG ATTACACCTC TGCCATTTGT AAAATTTTTC AAATTTGCCT 541 ACGGATGCAG AATTTTCCTC GAATTTCTGA TGTGTTTTCT TGTAAATAGT GGCCATTTGT 601 GTAAGTAAAT GCCCATTTCT CCTCCTACAA AGTCCAATTC CATTTTTCCC CCAATCCACC 661 ATGGCAACAC CACCTCCAA Predicted gene structure (within gDNA segment 6974 to 515): Exon 1 4968 4729 ( 240 n); cDNA 198 438 ( 241 n); score: 0.885 Intron 1 4728 2012 (2717 n); Pd: 0.993 (s: 0.88), Pa: 0.099 (s: 0) Exon 2 2011 2005 ( 7 n); cDNA 439 445 ( 7 n); score: 0.429 Intron 2 2004 1350 ( 655 n); Pd: 0.900 (s: 0), Pa: 0.779 (s: 0) Exon 3 1349 1325 ( 25 n); cDNA 446 468 ( 23 n); score: 0.640 PPA cDNA 13 1 MATCH C06HBa0153O03.1-1- SGN-E370357+ 0.885 272 0.401 C PGS_C06HBa0153O03.1-1-_SGN-E370357+ (4968 4729,2011 2005,1349 1325) Alignment (genomic DNA sequence = upper lines): AAAATCAATA AATATTATTA TCCCCAAAAT CTGGAAGTCA TCATCACAAG AACATCTA-C 4910 ||| |||| | | |||||||| |||||||||| |||||||||| |||||||||| |||||||| | AAACTCAACA ACTATTATTA TCCCCAAAAT CTGGAAGTCA TCATCACAAG AACATCTATC 257 TTCAAACTAC TAAATCTAAG AGTTTCTAAG AAGCTAAAAA TACATAAAAG CTAGTCCATG 4850 ||||| ||| ||| |||||| ||| |||| |||||| || || ||||| |||||||||| CTCAAATTAC TAATTCTAAG AGTATCTAGA AAGCTAGAAT AACTAAAAAG CTAGTCCATG 317 CCGGAACTTC AAAGCATCAA GACATGAAGA GGAAGATCCA GTCCAAGCTA GAAGCATTAG 4790 |||||||||| || ||||||| |||||||||| ||||||||| |||||||||| ||||| |||| CCGGAACTTC AAGGCATCAA GACATGAAGA AGAAGATCCA GTCCAAGCTA GAAGCGTTAG 377 CTCACCCTGA TATCCGGAGT AATGAAGACT GGCTAGAGTT ACTGTTGAGT CGAAGATGAC 4730 ||||| |||| |||||| | |||||||||| |||||||||| | ||||||| ||||| ||| CTCACACTGA AATCCGGTAT AATGAAGACT GGCTAGAGTT GCGGTTGAGT TGAAGACGAC 437 GGCACGTTTG CTGCACTCCA CAAATAAACA AGAAGAAAAC ATAAAAGTAG GGGTCAGTAC 4670 | G......... .......... .......... .......... .......... .......... 438 AAACACGGGT ACTGAGTAGA TATCATCGGC CAACTCAAAA TAGAGATCAA TATATACCAA 4610 .......... .......... .......... .......... .......... .......... 438 GTAATATCAT AAAATCAACT ATGATACTCA ACATGTAGCA ACATCAAATA CTATATCATT 4550 .......... .......... .......... .......... .......... .......... 438 AACAATTACC GTCAAGTTCA CACACGAGGA CTCAAGCCTC AATACCGTAC TCATTTGGGA 4490 .......... .......... .......... .......... .......... .......... 438 ATTATGTTCA TTGGATTGAG TATATTATCA TCTTTCAAGA TTCATTATCT TTATTTCTCT 4430 .......... .......... .......... .......... .......... .......... 438 TGTGTCGGTA CGTGACACTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TCGGTACGTG 4370 .......... .......... .......... .......... .......... .......... 438 ACACTTCGAT CCCCCACTAC TATGTGTCGG AACGTGACAC TTCGATCCTC TAAATCTACG 4310 .......... .......... .......... .......... .......... .......... 438 TGTCGGTTCG TGACACTCGA TCTCCTAAAT CTAAGTGTCG GTTCGTGACA CCAGATCCCC 4250 .......... .......... .......... .......... .......... .......... 438 TAAATCTACG TGTCAGTTCG TGACACCCGA TCCCCTAAAT CTACGTGTCG GTTCGTGACA 4190 .......... .......... .......... .......... .......... .......... 438 CCCGATCCCT AAATCTACGT GTCGGTTCGT GACACCCTAT CCCCTAATCT CCTTCTATCA 4130 .......... .......... .......... .......... .......... .......... 438 ATTCATCAAG CCTTCTTTCT TACCAAGGCA TCATCAATCT CATTATTTTA GTTCATCACG 4070 .......... .......... .......... .......... .......... .......... 438 CCTTCTTTTA TACCAAGGCC CCATCATTAA CAAAGAGATT AGGGTTTTGC AAGATTTGGG 4010 .......... .......... .......... .......... .......... .......... 438 ATTCAATAAC TTCATCATGC TTATATAACC ACAATTATAA AATTACATTC ATGCAAGCAT 3950 .......... .......... .......... .......... .......... .......... 438 ACAATTAAGC ACATAGCAGG GTTTACAATA TTATCAATAT ATATCATTCG CTATTAAGAG 3890 .......... .......... .......... .......... .......... .......... 438 TTTACTACGA ATATCGTAAG AGAAACCATA ACCTACCTCC ACCGAAGATT AGTGATCAAG 3830 .......... .......... .......... .......... .......... .......... 438 CAAGAAATTT CCCCAAGCTT TGTTCTTCGT TTTCTCTCTT CCTCGTTCGA TCCTCTCTCT 3770 .......... .......... .......... .......... .......... .......... 438 CTCTTTGTTC TTTCTACTTT TCTTATTCAA ACCCTCTTTC TTTTACCCTA ATTAGCATAT 3710 .......... .......... .......... .......... .......... .......... 438 AATTAAGAAC AAAAGATGGC AATAATAACT CACTAATTAA CTTAAGGTTA CCTCTTTTAA 3650 .......... .......... .......... .......... .......... .......... 438 CCCCCAAGTA ATTAGACTTA TTAAAATTAA CCCACTAACT TTATAATTAA AGCAGGAATA 3590 .......... .......... .......... .......... .......... .......... 438 GTCCAAAACG CCCCTTAAAA TAATTACAGA AATCTGACCC AGCCTGGGAT TACGCAGCCT 3530 .......... .......... .......... .......... .......... .......... 438 GTGACGGCCC GTCGCGCCTG CGACGGTCCA TTCTGCTGCT CCGTCACAGA GTTCCGAGAC 3470 .......... .......... .......... .......... .......... .......... 438 TCAATTTCTC TGAAGAGTCT GTAACGGTTC GTCCTGCCAT TCCGTTACGA AGTTCAGAAA 3410 .......... .......... .......... .......... .......... .......... 438 GTCGATTTCA GTACCCAATT TTGAGAATTC TAAGTATTTT GGAATGAGAT ATCCTCGACG 3350 .......... .......... .......... .......... .......... .......... 438 GTCCGTCGTG CCCATGACGG TCGGTCGTGA GTTCCGTCGT CTTTGCCTGT TTTTCAAGAA 3290 .......... .......... .......... .......... .......... .......... 438 ATAAAATCTG CTGCTCGAAA CGACTAAACA GGTCGTTACA AGTATTGCCT AATTCCTTTT 3230 .......... .......... .......... .......... .......... .......... 438 AAGGGTTATT TAGGGGTAAA GAACAAGTCT CAAATCATTT TTACAATAAT TTAGAACGCT 3170 .......... .......... .......... .......... .......... .......... 438 TAGGTCTTGG GAGAAAAGAG AAAATAAAAG GGATCAAGAA TTGTCCAAGA ACTCCAACAT 3110 .......... .......... .......... .......... .......... .......... 438 TGTTGCGTAG ATTTTATCAA GGGTTATCCC TATCGAGGTA TGTGAGATCA TTCGGTGTTG 3050 .......... .......... .......... .......... .......... .......... 438 GATCCTTTCA TCCACACTCC AAATTTATTC AATACAATAA ATTAGAGTTT ATATTTAATA 2990 .......... .......... .......... .......... .......... .......... 438 AAAATCTTGA TGTTCTTGAT GAAAGTTATG AAGTTCTTGT TGGTTGAATT ATAGTTGATC 2930 .......... .......... .......... .......... .......... .......... 438 TCTAGTAGTT ATGATTCATG TTTTCGGGTC TAAATATTGG GTAATGAGTT TTTTAGGGAT 2870 .......... .......... .......... .......... .......... .......... 438 CCTAAGTTTT TGAGGTGGGA TCCATGGGAG TTTGGTAGGT TTAGATATGA AAGAGGAGAA 2810 .......... .......... .......... .......... .......... .......... 438 GAAAAGTCGG AGGTCCAGTC AGACAACCCT TGGGGCGCCG CGCCTCTTAG AGCGCCAATA 2750 .......... .......... .......... .......... .......... .......... 438 CCCTCGGAGA CCCTTTTCTT TCCCTATATT TTCGTACTAG TTCCTAAGTG ATGTACCTCT 2690 .......... .......... .......... .......... .......... .......... 438 CATTCCTAGT TGACCAACAC TCTAGAATAA ATATAAACAT CATGAAATCA TCCATAAACA 2630 .......... .......... .......... .......... .......... .......... 438 TGAGATTATG ATCCTTGAAT TCATAATCCA ATTCAAGAGA AACTAAGATC AAAGTCAAGA 2570 .......... .......... .......... .......... .......... .......... 438 AAGTAAGCAA TGAAGGGAGT AAAAGTAAAG CTTTTATTTC AAAGTTCTTA GAATCTTACT 2510 .......... .......... .......... .......... .......... .......... 438 TAAATGTTAT AATTTCGTTT TAAGGCTCAT GTTCGAGTTA AGGAAAAGAG TAAAGGTTGA 2450 .......... .......... .......... .......... .......... .......... 438 ATTCTTTTCT TAAAATGGTA TATAAGAAAA CTAAGTATTT CCTAAGAGTT AAAGTTTAAA 2390 .......... .......... .......... .......... .......... .......... 438 GTTAAGTAAA GAGTAAGAGT TGAGTTCATT TCTCTAAAGA TATAAGGGGG ACTAAGTACT 2330 .......... .......... .......... .......... .......... .......... 438 CCCTAAAGGT TGATAAATGT TTCACATTTA AGTTAAAAGA AAACTAAGAA GTTTCAAAAG 2270 .......... .......... .......... .......... .......... .......... 438 AGTTTTGAAC TAAAAGGGGA ACATTGATTC CAAAAGGAGA TTTGTAAAGC TAAAGGGTTC 2210 .......... .......... .......... .......... .......... .......... 438 AGTAAATTAT CTCAACCCAA AGGAAGGAAG TTTTGTTAAA AGTATGAGCT AAAGTATGTT 2150 .......... .......... .......... .......... .......... .......... 438 TTGGGAGTAG TATTGAGCAC CGATGTAGGA ATGAGAGTTC AGATAACTCA AGTCCCCATG 2090 .......... .......... .......... .......... .......... .......... 438 TAAATCATGT AGCTATCATG GGTGTTAACA TGTCATACTT TTTAGATGAT CACGTAAGTT 2030 .......... .......... .......... .......... .......... .......... 438 TAGCCAGTGG ATCACTAGGT TGATGATATC CTATGCGACG ACAAATTATA GGACAGTTTT 1970 || | .......... ........GT ACGTT..... .......... .......... .......... 445 GGCAGCGTGT ACACGACACT GTATTATCAC TTAGGCTCAT AGTGATGGCT GTCAGTTAGA 1910 .......... .......... .......... .......... .......... .......... 445 GAAACTCCAG CAGAAGCTAT ATTACTTTCA TATATAAGTA AAGTTGAGTT TATTACATGT 1850 .......... .......... .......... .......... .......... .......... 445 GTCCTTATTG CTTTATATTG AGTTGTTATC TTATGAGTTG AGTAGAGCCA AGATAAGTTC 1790 .......... .......... .......... .......... .......... .......... 445 ACCCTTACTC CATTTCAAGC GTTATAGTTG TGCTTAGCAT TCCAACTCGT ATACTTGTAC 1730 .......... .......... .......... .......... .......... .......... 445 ATTCAATGTA CTGAAGACAG TTGGCCTGCA TCATCTTGAG ATGCAGACAC AGGTAACCAG 1670 .......... .......... .......... .......... .......... .......... 445 GATCAGCACG CAGCACACCG TTGATCCATT TGAACATTCT GTAGTCATTT GGTGAGCCTC 1610 .......... .......... .......... .......... .......... .......... 445 TTTGCATTCC GGAGGACATC CCTTTATTTA CTTTCCTAGT TTAGTTATTA GGATGTTGTG 1550 .......... .......... .......... .......... .......... .......... 445 GGGTCTGTTC CAACATCCAT CTTAGTCAGT TTAGAGGCTT AATAGACAAT GTAGCAGTTC 1490 .......... .......... .......... .......... .......... .......... 445 AGTTTTGGAG TCTCCTTTAT CTTATACTTC GTATCACTAC AACAAAAATG TCCATTTGCG 1430 .......... .......... .......... .......... .......... .......... 445 ACATTTAATT CTTAATTGCC GCTAAGTATG TATTTTTAGA GGCAATTGTC ACTATTTGTA 1370 .......... .......... .......... .......... .......... .......... 445 TATGTCCCTA TTGCCTTTAG AGACATTGGT TCTAATGACA CTTAA 1325 | | || | || | || || |||| .......... .......... TG-C-TTTAT TCGAGTGTCA ATTAA 468 hqPGS_C06HBa0153O03.1-1-_SGN-E370357+ (4968 4729) ******************************************************************************** EST sequence 55 -strand 219 n (File: SGN-E298638-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 61 ACAAGAACAT GTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA Predicted gene structure (within gDNA segment 5836 to 3895): Exon 1 4982 4765 ( 218 n); cDNA 2 219 ( 218 n); score: 0.883 MATCH C06HBa0153O03.1-1- SGN-E298638- 0.883 218 0.995 C PGS_C06HBa0153O03.1-1-_SGN-E298638- (4982 4765) Alignment (genomic DNA sequence = upper lines): TTAAACTCAT TAATAAAATC AATAAATATT ATTATCCCCA AAATCTGGAA GTCATCATCA 4923 || | | | ||| || || || |||| |||||||||| |||||||| | |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGCA GTCATCATCA 61 CAAGAACATC TACTTCAAAC TACTAAATCT AAGAGTTTCT AAGAAGCTAA AAATACATAA 4863 ||||||||| ||||||||| |||||||||| |||||| ||| |||||||| | |||||||||| CAAGAACATG TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCT-A AAATACATAA 120 A-AGCTAGTC CATGCCGGAA CTTCAAAGCA TCAAGACATG AAGAGGAAGA TCCAGTCCAA 4804 | |||||||| |||||||||| |||||| ||| |||||||||| |||| ||||| |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 180 GCTAGAAGCA TTAGCTCACC CTGATATCCG GAGTAATGA 4765 ||||||||| |||||||||| |||| ||||| ||||||| GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA 219 hqPGS_C06HBa0153O03.1-1-_SGN-E298638- (4982 4765) ******************************************************************************** EST sequence 39 -strand 402 n (File: SGN-E352844-) 1 TTTTTTTTAT AAAAACCAAT TCAATAACTA TTATTTCCCA AAATCTGGAA GTTATCATCA 61 CAAGAACATC TACTTCGAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCTT TGTTTTATCG AAAAAAGGTG ATTTTTCGAA AAGAGTTTGT TTTATTTTAA 241 AGTATTTTTC GACTTTAGGA GTCGCCACTT AATTTTTAAG AAAAATCAAG AAAACTCATT 301 CTCAAAACAA TTTAAACAGA AAAGTCGTTT TGAAAATATT TTTTAGGATT CGGGATTCTT 361 ATTAGCGTCT TAGGAAGGTG TTTAAGGCAC CTAAGACACT CC Predicted gene structure (within gDNA segment 5916 to 2055): Exon 1 4964 4795 ( 170 n); cDNA 21 188 ( 168 n); score: 0.926 MATCH C06HBa0153O03.1-1- SGN-E352844- 0.926 170 0.423 C PGS_C06HBa0153O03.1-1-_SGN-E352844- (4964 4795) Alignment (genomic DNA sequence = upper lines): TCAATAAATA TTATTATCCC CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACTTCAA 4905 ||||||| || ||||| | || |||||||||| |||| ||||| |||||||||| |||||||| | TCAATAACTA TTATT-T-CC CAAAATCTGG AAGTTATCAT CACAAGAACA TCTACTTCGA 78 ACTACTAAAT CTAAGAGTTT CTAAGAAGCT AAAAATACAT AAA-AGCTAG TCCATGCCGG 4846 | |||||||| |||||||| | |||||||||| | |||||||| ||| |||||| |||||||||| ATTACTAAAT CTAAGAGTAT CTAAGAAGCT A-AAATACAT AAACAGCTAG TCCATGCCGG 137 AACTTCAAAG CATCAAGACA TGAAGAGGAA GATCCAGTCC AAGCTAGAAG C 4795 |||||||| | |||||||||| |||||| ||| |||||||||| |||||||||| | AACTTCAAGG CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG C 188 hqPGS_C06HBa0153O03.1-1-_SGN-E352844- (4964 4795) ******************************************************************************** EST sequence 71 -strand 620 n (File: SGN-E238551-) 1 CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTACTTCG AATTACTAAA 61 TCTAAGAGTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG AACTTCAAGG 121 CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTGTTTTA TCGAAAAAAG 181 GTGATTTTTC GAAAAGAGTT TGTTTTATTT TAAAGTATTT TTCGACTTTA GGAGTCGCCA 241 CTTAATTTTT AAGAAAAATC AAGAAAACTC ATTCTCAAAA CAATTTAAAC AGAAAAGTCG 301 TTTTGAAAAT ATTTTTTAGG ATTCGGGATT CTTATTAGCG TCTTAGGAAG GTGTTTAAGG 361 CACCTAAGAC ACTCCGTTAA ATACGGTTTT CCAACGACTA ACTTATTTGA TTATTTTTAT 421 TTTTACCCTT TGCAAATTTA TTTGAACTTT TATCACGATT TACTTAGCCA AACTTTGCAA 481 ATTTGAGATA TTAATCTTTT AAGATTCCGT CTTAGTTAAA CTTTCTAAGC CTTAACTCTC 541 TAAGCAGACT TTCAAATTTT AAACCTCTAT CGTTTCAAAA CTTCAATTTT TATTTTTTAG 601 TTTCATAAAG CAAAAGGCGT Predicted gene structure (within gDNA segment 5646 to 1): Exon 1 4956 4795 ( 162 n); cDNA 2 161 ( 160 n); score: 0.929 MATCH C06HBa0153O03.1-1- SGN-E238551- 0.929 162 0.261 C PGS_C06HBa0153O03.1-1-_SGN-E238551- (4956 4795) Alignment (genomic DNA sequence = upper lines): TATTATTATC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACTTC AAACTACTAA 4897 ||||||| | |||||||||| |||||| ||| |||||||||| |||||||||| || |||||| TATTATT-T- CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTACTTC GAATTACTAA 59 ATCTAAGAGT TTCTAAGAAG CTAAAAATAC ATAAA-AGCT AGTCCATGCC GGAACTTCAA 4838 |||||||||| ||||||||| || ||||||| ||||| |||| |||||||||| |||||||||| ATCTAAGAGT ATCTAAGAAG CT-AAAATAC ATAAACAGCT AGTCCATGCC GGAACTTCAA 118 AGCATCAAGA CATGAAGAGG AAGATCCAGT CCAAGCTAGA AGC 4795 ||||||||| |||||||| | |||||||||| |||||||||| ||| GGCATCAAGA CATGAAGAAG AAGATCCAGT CCAAGCTAGA AGC 161 hqPGS_C06HBa0153O03.1-1-_SGN-E238551- (4956 4795) ******************************************************************************** EST sequence 70 -strand 286 n (File: SGN-E355114-) 1 CCACAGCCCC AGTGGCTGGC TCAGTTGTTT CTTGTCTGGC CGGTGTTGGT GTTGACGTGG 61 TCGTTGCTCT AGTTCTAACC ATCTGCAAAA GAGAGTGAAG ATGGTCAGAT ACCAATTTGT 121 ATCGCCTAGA TACCAATTGG ACTCAAGTAG TAGCACGAAA GAAAGAATGA AAGGGTGAAA 181 TTTTCCTAAA GTCTTATAGC CTCTCAAAGA AAAGTAAAGG CGTCCCCCTA CCGTTCCTAA 241 AGACTCTACT AGACCTGTTC TTGTGTGATG AGACCAACGA ACCTAA Predicted gene structure (within gDNA segment 6572 to 4519): Exon 1 5402 5119 ( 284 n); cDNA 3 286 ( 284 n); score: 0.905 MATCH C06HBa0153O03.1-1- SGN-E355114- 0.905 284 0.993 C PGS_C06HBa0153O03.1-1-_SGN-E355114- (5402 5119) Alignment (genomic DNA sequence = upper lines): ACAGCCCCAA TGGCTGGCTC GGACGCTTCT TGTCTTGCCG ATGTTGGTAT TGGTGCAGTT 5343 ||||||||| |||||||||| | | |||| ||||| |||| ||||||| | || | || ACAGCCCCAG TGGCTGGCTC AGTTGTTTCT TGTCTGGCCG GTGTTGGTGT TGACGTGGTC 62 GTTGCTCTAG TTCTAACCAT CTGCGAAACA GAGTGAAGAT GGTCAGATAC CAATTTGTAT 5283 |||||||||| |||||||||| |||| ||| | |||||||||| |||||||||| |||||||||| GTTGCTCTAG TTCTAACCAT CTGCAAAAGA GAGTGAAGAT GGTCAGATAC CAATTTGTAT 122 CACCTAGATA CCAATTGGAC CCAAGTAATA GCACGAAAGA AAGAATGAAA GAATGGAATT 5223 | |||||||| |||||||||| |||||| || |||||||||| |||||||||| | || |||| CGCCTAGATA CCAATTGGAC TCAAGTAGTA GCACGAAAGA AAGAATGAAA GGGTGAAATT 182 TTCCTAAAGT CTTATAGCCC CTCAAAGAAA AGTAAAGGTG TCCCCCTACC GTTCCTTAAG 5163 |||||||||| ||||||||| |||||||||| |||||||| | |||||||||| |||||| ||| TTCCTAAAGT CTTATAGCCT CTCAAAGAAA AGTAAAGGCG TCCCCCTACC GTTCCTAAAG 242 ACTCTACCAG ACTCGTTCTT GTGTGATGAG ACCAACGAAC CTAA 5119 ||||||| || || |||||| |||||||||| |||||||||| |||| ACTCTACTAG ACCTGTTCTT GTGTGATGAG ACCAACGAAC CTAA 286 hqPGS_C06HBa0153O03.1-1-_SGN-E355114- (5402 5119) ******************************************************************************** EST sequence 175 +strand 694 n (File: SGN-E353359+) 1 TGTAGTCTAT GCACATTCAA AAACTGCCGA TCTTTACAAC AAAACCGGAG CACCCCAAGG 61 AGATGCACTT GGTCTAATGA AGCCTTTGTT CAATAACTCT TGAAGTTGTG CCTTTAACTC 121 TCTTAACTCT GCGGGAGCCA TTCTATAAGG GGGTATAAAA ATGGGGCGTG TGCCCGGTTC 181 GAGATCAATA CAGAAGTCAA TATCCCTATC CGGTGGCATA CCAGGATGAT CTGCAGGGAA 241 CACATCCATA AACTCACGAA CTACTGAAAC TGAGTCAATC GAAGGTACTT GGGTAGTGTT 301 ATCCTTGAGA TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG 361 AAAGGAGATG ATACGCACCG GATTGGAAGT GTAGTCACCC TCCTACACTA ACAGATCTGT 421 CCCAGGCTTG GCTAACGTCA CGGTTTTAGC ATTACAATCC AAGATTGCAA AATTTGGAGA 481 AAGCCAAGTC ATACCCAGAA TTACATCGAA GTCAACCATT TCTTGCAAGG ATTTTGCCGT 541 AGCCGCTACC TGTAACGCTG AAATCCGCAA CTCTGACCTC AACCCTTTCA CAAAACGACG 601 AATCCTCTCT TGTGGACTGA AACAAAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 661 AGCCTCATAT GCATTGACCT ACATCCTACC TTGC Predicted gene structure (within gDNA segment 8556 to 5306): Exon 1 7495 7036 ( 460 n); cDNA 48 507 ( 460 n); score: 0.933 Intron 1 7035 6292 ( 744 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0.86) Exon 2 6291 6105 ( 187 n); cDNA 508 694 ( 187 n); score: 0.930 MATCH C06HBa0153O03.1-1- SGN-E353359+ 0.932 647 0.932 C PGS_C06HBa0153O03.1-1-_SGN-E353359+ (7495 7036,6291 6105) Alignment (genomic DNA sequence = upper lines): GAGCACCCCA AGGAGATGCA CTTGGTCTAA TGAAACCCTT GCTCAACAAC TCTTGAAGTT 7436 |||||||||| |||||||||| |||||||||| |||| || || | |||| ||| |||||||||| GAGCACCCCA AGGAGATGCA CTTGGTCTAA TGAAGCCTTT GTTCAATAAC TCTTGAAGTT 107 GGGCTTTTAA CTCTCTTAAC TCCGCGGGAG CAATTCTATA AGGGGGTATA GAAATGGGGC 7376 | || ||||| |||||||||| || ||||||| | |||||||| |||||||||| ||||||||| GTGCCTTTAA CTCTCTTAAC TCTGCGGGAG CCATTCTATA AGGGGGTATA AAAATGGGGC 167 GAGTACCCGG TTCAAGATCA ATACAGAAGT CAATATCCCT ATTTGGTGTC ATACCAGGAA 7316 | || ||||| ||| |||||| |||||||||| |||||||||| || |||| | ||||||||| GTGTGCCCGG TTCGAGATCA ATACAGAAGT CAATATCCCT ATCCGGTGGC ATACCAGGAT 227 GATCTGCAGG GAACACATCC AGAAACTCAC GGACCACCGA AACCGACTCA ATCGAAGGTA 7256 |||||||||| |||||||||| | |||||||| | || || || ||| || ||| |||||||||| GATCTGCAGG GAACACATCC ATAAACTCAC GAACTACTGA AACTGAGTCA ATCGAAGGTA 287 CTTGGGTGGT GTCATCCTTG AGATGTGCCA AGAAAGCTAA ACACCCTTTA CTAACCATTT 7196 ||||||| || || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGGGTAGT GTTATCCTTG AGATGTGCCA AGAAAGCTAA ACACCCTTTA CTAACCATTT 347 TCTTAGCACG AAGAAAGGAG ATGATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA 7136 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| TCTTAGCACG AAGAAAGGAG ATGATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCTACA 407 CTAACGGATC TGTCTCAGGC TTGGCTAACG TCAGAGTTTT AGCATTACAA TCCAAGATCG 7076 ||||| |||| |||| ||||| |||||||||| ||| ||||| |||||||||| |||||||| | CTAACAGATC TGTCCCAGGC TTGGCTAACG TCACGGTTTT AGCATTACAA TCCAAGATTG 467 CAAAATTCGG AGAAAGCCAA GTCATACCCA GAATTACATC AAAATCATCC ATTTCTAAGA 7016 ||||||| || |||||||||| |||||||||| |||||||||| CAAAATTTGG AGAAAGCCAA GTCATACCCA GAATTACATC .......... .......... 507 TAACCAAATC TACATAAGTG TTGCTCCCCA AAAAGTTCAC CAAACAAGAC CTATATACCT 6956 .......... .......... .......... .......... .......... .......... 507 TTTCAACTAC CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATA TCAAGTAATT 6896 .......... .......... .......... .......... .......... .......... 507 CACAATATAA ATTTAGACCG TTAGCAAATG AGGAAGATAC ATAAGAAAAT GTGGATCCAG 6836 .......... .......... .......... .......... .......... .......... 507 GATCAAACAA TACAGAAGCC ATGCAATCAC AAACTAGAAG ATTACCTGTG ATGACAGCAT 6776 .......... .......... .......... .......... .......... .......... 507 CAGATGCCTC CGCTTCAGAC CGCCCAGGGA AAGCGTAACA ATGGGCCCTA TCATTTGTCT 6716 .......... .......... .......... .......... .......... .......... 507 GTCCATTGCC CCTACCATGT TGTGATGTAG TGGCTCCGTT TTTCCCATCA CCTCGGCCGT 6656 .......... .......... .......... .......... .......... .......... 507 TTTGGTGACC ACCATTACCC CGACCACCAC GTCCTCAAGA ATAACGGCCT CTACCACGAC 6596 .......... .......... .......... .......... .......... .......... 507 CACCTCTACC TCTAGCCATT GGGGGTCTAT AACTCTGTTT TGGACAATTC CTCCTAATAT 6536 .......... .......... .......... .......... .......... .......... 507 GTCCAGTCTC CCCACATCCA TAACACTCTC TGGAGTCAAG CATAGGTCTC TCAGAGTAGT 6476 .......... .......... .......... .......... .......... .......... 507 GTTGACCGGT CTGAGGTGGA CCCCCAACTA CAGTCTGTAG TGAAGACTGA ATGGGTCGAA 6416 .......... .......... .......... .......... .......... .......... 507 CTGAGTAACC TACGGAACCC TGTCCTCTAG AGTAAGAACC ATTAAACTCA CCTCCCTTTC 6356 .......... .......... .......... .......... .......... .......... 507 GAAACCTCTT TGATGTCGAT GACATGGTGA ATTCATCTGG CTTCACTCCT TCCACCTCTA 6296 .......... .......... .......... .......... .......... .......... 507 CCACGAAATC TACCACTTCT TGGAAGGATT TTACCGTAGC TGCTACCTGT AAGGCTGAAA 6236 ||| || |||| |||| || ||||||| || ||||||| ||||||||| || ||||||| ....GAAGTC AACCATTTCT TGCAAGGATT TTGCCGTAGC CGCTACCTGT AACGCTGAAA 563 TCCGCAACTC TGACCTCAAC CCCTTCACAA AACGACGAAT CCACTCTTGT GGACTGAAAC 6176 |||||||||| |||||||||| || ||||||| |||||||||| || ||||||| |||||||||| TCCGCAACTC TGACCTCAAC CCTTTCACAA AACGACGAAT CCTCTCTTGT GGACTGAAAC 623 AAAGTTGGGT GGCATATCTG GATAGTGCAC GAAACTTAGC CTCATATGCA GTAACCGACA 6116 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| | ||| ||| AAAGTTGAGT GGCATATCTG GATAGTGCAC GAAACTTAGC CTCATATGCA TTGACCTACA 683 TCCTACCTTG C 6105 |||||||||| | TCCTACCTTG C 694 hqPGS_C06HBa0153O03.1-1-_SGN-E353359+ (7495 7036,6291 6105) ******************************************************************************** EST sequence 163 +strand 433 n (File: SGN-E352180+) 1 CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 61 GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 121 CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 181 ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 241 ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 301 TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 361 CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 421 GCCGCTGTTC TCC Predicted gene structure (within gDNA segment 6551 to 4045): Exon 1 5843 5413 ( 431 n); cDNA 1 431 ( 431 n); score: 0.896 MATCH C06HBa0153O03.1-1- SGN-E352180+ 0.896 431 0.995 C PGS_C06HBa0153O03.1-1-_SGN-E352180+ (5843 5413) Alignment (genomic DNA sequence = upper lines): CCCTTGAAGA CTGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 5784 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 60 GTCATTATAG GCCCAGTAGT CAGACGTGGA AACGTGCCTA TGTCCAATGA GGCATCCATA 5724 |||||||||| |||||||||| || ||||||| || || |||| ||| ||||| ||||||| GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 120 CGAGGAGCCA TAGTGGCTGC ATGTTTTACC TCTGAAACTG GAGGTGTTGG TGCAGAAAAC 5664 || ||||||| |||| || || ||||| ||| |||||||| | |||||||||| ||||||||| CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 180 ACTGGAGGGG CCTGACCCTG ATCAGACAAC CCACTAAGAT AAGCGAGAAC CTGATTGATC 5604 |||||||| | | ||||| || |||||| || || ||||||| |||| ||||| |||||||||| ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 240 ATCTCTGGGG TAGGTTGGGG TGACAATTCC TTATTTTGCA CTTGTTCATT CTCCCCTTCC 5544 |||||||||| |||||||||| || || |||| | |||||||| |||||||| | ||||| ||| ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 300 TCACCCTCTC TTACCACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGG ATCAGACAGG 5484 || || |||| ||| ||||| |||||||||| |||||||||| ||||||||| | |||| || TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 360 CTAGGTGCTC GTCCTCTTCC TCTAGAGGAC GTCCTCCCTC GACCTCTACC ACGGCCTCTT 5424 || ||| ||| || ||||||| |||||| ||| |||||||| | |||||||||| |||||| ||| CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 420 GCCGCTACTC T 5413 |||||| || | GCCGCTGTTC T 431 hqPGS_C06HBa0153O03.1-1-_SGN-E352180+ (5843 5413) ******************************************************************************** EST sequence 80 -strand 282 n (File: SGN-E368761-) 1 CCTCCAAGCT CGGACACTCC TTGGGAACTC TGGGGTCAGA ATTTGGCCGA TAGGTTTAAG 61 ATTGATGTAC CATCTATTAT GTACTGTAAT AGTTTCTATG AATGCCATGT AGACTATTTA 121 ATACCCTGTT TCTTGTTGTT CTCCCCATCC TCCCCCTCTC TTACTAGCTC CTCAGTCGGT 181 GGAGGAGTCA CCGCCCTAGC ATCAGATGGA CTAGGTGCTC GTCCTCTTCC TCTAGAGGAC 241 GTTCTCTCAC GACCTCTACC ACGGCCTCTT GCCGCTGTTC TC Predicted gene structure (within gDNA segment 7815 to 4758): Exon 1 5562 5413 ( 150 n); cDNA 133 281 ( 149 n); score: 0.893 MATCH C06HBa0153O03.1-1- SGN-E368761- 0.893 150 0.532 C PGS_C06HBa0153O03.1-1-_SGN-E368761- (5562 5413) Alignment (genomic DNA sequence = upper lines): TTGTTCATTC TCCCCTTCCT CACCCTCTCT TACCACTTCC TCAGTCGGTG GAGGAGTCAC 5503 ||||| ||| ||||| |||| | |||||||| ||| | ||| |||||||||| |||||||||| TTGTT-GTTC TCCCCATCCT CCCCCTCTCT TACTAGCTCC TCAGTCGGTG GAGGAGTCAC 191 CGCCCTAGGA TCAGACAGGC TAGGTGCTCG TCCTCTTCCT CTAGAGGACG TCCTCCCTCG 5443 |||||||| | ||||| | | |||||||||| |||||||||| |||||||||| | ||| | || CGCCCTAGCA TCAGATGGAC TAGGTGCTCG TCCTCTTCCT CTAGAGGACG TTCTCTCACG 251 ACCTCTACCA CGGCCTCTTG CCGCTACTCT 5413 |||||||||| |||||||||| ||||| ||| ACCTCTACCA CGGCCTCTTG CCGCTGTTCT 281 hqPGS_C06HBa0153O03.1-1-_SGN-E368761- (5562 5413) ******************************************************************************** EST sequence 93 -strand 554 n (File: SGN-E329287-) 1 AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 61 ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 121 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 181 ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 241 GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 301 TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 361 CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 421 GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 481 CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 541 CAGATGGGCT AGGT Predicted gene structure (within gDNA segment 6641 to 4518): Exon 1 6031 5478 ( 554 n); cDNA 1 554 ( 554 n); score: 0.904 MATCH C06HBa0153O03.1-1- SGN-E329287- 0.904 554 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E329287- (6031 5478) Alignment (genomic DNA sequence = upper lines): AGAATGATGC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCACC 5972 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ||||||| || AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 60 ACCACATTTT GGCGTTCCCT TGAAACTGAT AACTAACGAA CTCAACACCA AACCGTTCTA 5912 |||||||||| |||||||||| |||||||||| || | || || |||||||||| |||||||||| ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 120 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAA GCATCCTCAA 5852 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ||||||||| CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 180 ATTTAGCACC CTTGAAGACT GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT 5792 ||| | |||| |||| ||||| ||||||||| |||||||||| |||||| ||| |||||||||| ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 240 GATCATTTGT CATTATAGGC CCAGTAGTCA GACGTGGAAA CGTGCCTATG TCCAATGAGG 5732 |||||||||| |||||||||| || ||||||| |||| ||||| ||| ||||| |||||||||| GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 300 CATCCATACG AGGAGCCATA GTGGCTGCAT GTTTTACCTC TGAAACTGGA GGTGTTGGTG 5672 ||| || || |||| || | || | |||| ||| |||||| | | | || |||| | ||| TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 360 CAGAAAACAC TGGAGGGGCC TGACCCTGAT CAGACAACCC ACTAAGATAA GCGAGAACCT 5612 |||||||||| |||||| || || || |||| |||| ||||| ||||| ||| || ||||||| CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 420 GATTGATCAT CTCTGGGGTA GGTTGGGGTG ACAATTCCTT ATTTTGCACT TGTTCATTCT 5552 |||||||||| |||| ||||| |||||||||| |||||||| ||| |||||| |||||||||| GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 480 CCCCTTCCTC ACCCTCTCTT ACCACTTCCT CAGTCGGTGG AGGAGTCACC GCCCTAGGAT 5492 |||| ||||| |||||||||| |||||||||| |||| ||||| ||| |||||| ||| ||| | CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 540 CAGACAGGCT AGGT 5478 |||| |||| |||| CAGATGGGCT AGGT 554 hqPGS_C06HBa0153O03.1-1-_SGN-E329287- (6031 5478) ******************************************************************************** EST sequence 111 +strand 716 n (File: SGN-E577713+) 1 CGGTGGATAC CTAGGCACCC AGAGACGAGG AAGGGCGTAG TAATCGACGA AATGCTTCGG 61 GGAGTTGAAA ATAAGCATAG ATCCGGAGAT TCCCGAATAG GGCAACCTTT CGAACTGCTG 121 CTGAATCCAT GGGCAGGCAA GAGACAACCT GGCGAACTGA AACATCTTAG TAGCCAGAGG 181 AAAAGAAAGC AAATAAGGAA GATACATAAG AAAACGTGGA TCCAGGATCA AACAATACAG 241 AAGCCATGCA ATCACAAACC AGAAGATTAC CTGTGATGAC AACATCAGAT GCCTCCGCTT 301 CACACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT CGTCTGTCCG TTGCCCCTAC 361 CATGTTGTGA TGTAGTGGCC CAAGTTTGCC CATTACCTCT GCCGTTTTGG TGACCACCAT 421 TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT CTACCTCTAG 481 CTATTGGGGG TCTATAACTT TGTCTGGGAC AATTTTTCCT AATATGTCCA ATCTCCCCAC 541 ATCCATACCA TTCTCTGGAG TCAATCCTAG GCCCCTCGGA GAAGTGTTGA CCGGTCTGAG 601 GTGGTCCCCC AACTACAGTC TGTAGTGAAG ACTCAATTGG TCGGACTGAG TAACTTCCCG 661 AACCCTGTCC TCTAGTGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGC CTTTTT Predicted gene structure (within gDNA segment 9415 to 5663): Exon 1 6940 6911 ( 30 n); cDNA 156 185 ( 30 n); score: 0.633 Intron 1 6910 6876 ( 35 n); Pd: 0.631 (s: 0), Pa: 0.000 (s: 0.92) Exon 2 6875 6345 ( 531 n); cDNA 186 716 ( 531 n); score: 0.915 MATCH C06HBa0153O03.1-1- SGN-E577713+ 0.915 561 0.784 C PGS_C06HBa0153O03.1-1-_SGN-E577713+ (6940 6911,6875 6345) Alignment (genomic DNA sequence = upper lines): ACTCACCCAC CGGAGTAGAA ACACGAATAG GCATATCAAG TAATTCACAA TATAAATTTA 6881 ||| | || | ||||| | | ||| || ACTGAAACAT CTTAGTAGCC AGAGGAAAAG .......... .......... .......... 185 GACCGTTAGC AAATGAGGAA GATACATAAG AAAATGTGGA TCCAGGATCA AACAATACAG 6821 ||| |||| ||||| |||||||||| |||| ||||| |||||||||| |||||||||| .....AAAGC AAATAAGGAA GATACATAAG AAAACGTGGA TCCAGGATCA AACAATACAG 240 AAGCCATGCA ATCACAAACT AGAAGATTAC CTGTGATGAC AGCATCAGAT GCCTCCGCTT 6761 |||||||||| ||||||||| |||||||||| |||||||||| | |||||||| |||||||||| AAGCCATGCA ATCACAAACC AGAAGATTAC CTGTGATGAC AACATCAGAT GCCTCCGCTT 300 CAGACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCATT TGTCTGTCCA TTGCCCCTAC 6701 || ||||||| |||||||||| |||||||||| ||||||| || |||||||| |||||||||| CACACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT CGTCTGTCCG TTGCCCCTAC 360 CATGTTGTGA TGTAGTGGCT CCGTTTTTCC CATCACCTCG GCCGTTTTGG TGACCACCAT 6641 |||||||||| ||||||||| | ||| || ||| ||||| |||||||||| |||||||||| CATGTTGTGA TGTAGTGGCC CAAGTTTGCC CATTACCTCT GCCGTTTTGG TGACCACCAT 420 TACCCCGACC ACCACGTCCT CAAGAATAAC GGCCTCTACC ACGACCACCT CTACCTCTAG 6581 |||| ||||| |||||||||| | |||||||| |||||||||| | |||||||| |||||||||| TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT CTACCTCTAG 480 CCATTGGGGG TCTATAACTC TGTTTTGGAC AATTCCTCCT AATATGTCCA GTCTCCCCAC 6521 | |||||||| ||||||||| ||| | |||| |||| |||| |||||||||| ||||||||| CTATTGGGGG TCTATAACTT TGTCTGGGAC AATTTTTCCT AATATGTCCA ATCTCCCCAC 540 ATCCATAACA CTCTCTGGAG TCAAGCATAG GTCTCTCAGA GTAGTGTTGA CCGGTCTGAG 6461 ||||||| || ||||||||| |||| | ||| | | ||| || | |||||||| |||||||||| ATCCATACCA TTCTCTGGAG TCAATCCTAG GCCCCTCGGA GAAGTGTTGA CCGGTCTGAG 600 GTGGACCCCC AACTACAGTC TGTAGTGAAG ACTGAATGGG TCGAACTGAG TAACCTACGG 6401 |||| ||||| |||||||||| |||||||||| ||| ||| || ||| |||||| |||| | | | GTGGTCCCCC AACTACAGTC TGTAGTGAAG ACTCAATTGG TCGGACTGAG TAACTTCCCG 660 AACCCTGTCC TCTAGAGTAA GAACCATTAA ACTCACCTCC CTTTCGAAAC CTCTTT 6345 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||| | || ||| AACCCTGTCC TCTAGTGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGC CTTTTT 716 hqPGS_C06HBa0153O03.1-1-_SGN-E577713+ (6940 6911,6875 6345) ******************************************************************************** EST sequence 194 +strand 720 n (File: SGN-E356614+) 1 GCGAATCCGC TCTTGAGGAC TGAAACAGAG TTGAGTGGCA TATCTGGATA GTGCACGAAA 61 CTTAGCCTCA TAAGCATTAA CTGACACCCT ACCTTGCTCT AGGCTCAAGA ACTCATCCCT 121 TTTCTTATCC CTCAAAGTCC GGGGGATATA CTTCTCCATA AACAAACTAT AGAATGAGGC 181 CCAAGTCATA GGTGATGCCT CTATTGGTTG ACACTCGATA TGTGACCGCC ACCACATTTG 241 GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCCACACCA AACCGTTCTA CTATACCCAT 301 CTTGTGTAGT AGCTCGTGAC AGTCAACCAG AAAATCATAA GCATCCTCCG ATTCAGCACC 361 CTTGAAGACC GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT GATCATTTGT 421 CATTATAGGG ACAGTAGTCA AACGTGGAAA TGTGCCTATG TCCAATGGAA CATCCATGCG 481 GGGAGCCATA GTAGCCGCAT GTTGTACTTC TGAAACCGGA GGTGTTGGCG CAGAAAACAC 541 TGGAGGTGCT TGACCTTGAT CAGATAACCC GCTAAGATAA GCCAGAACCT GATTGATCAT 601 CTCTGGGGTA GGTTGGGGTG GCATTTCCTC ATTTTGCACT TGNTCAGTTT CCCCATCCCT 661 CCCTTTCTCT ATTACTTCCT CAGTCAGTGG AAGAGTCACT GCCCTAGTAT CAGATGGGCT Predicted gene structure (within gDNA segment 7107 to 3972): Exon 1 6200 5482 ( 719 n); cDNA 2 720 ( 719 n); score: 0.901 MATCH C06HBa0153O03.1-1- SGN-E356614+ 0.901 719 0.999 C PGS_C06HBa0153O03.1-1-_SGN-E356614+ (6200 5482) Alignment (genomic DNA sequence = upper lines): CGAATCCACT CTTGTGGACT GAAACAAAGT TGGGTGGCAT ATCTGGATAG TGCACGAAAC 6141 ||||||| || |||| ||||| |||||| ||| || ||||||| |||||||||| |||||||||| CGAATCCGCT CTTGAGGACT GAAACAGAGT TGAGTGGCAT ATCTGGATAG TGCACGAAAC 61 TTAGCCTCAT ATGCAGTAAC CGACATCCTA CCTTGCTCTA GGCTCAAGAA CTCATCTCTT 6081 |||||||||| | ||| |||| |||| |||| |||||||||| |||||||||| |||||| ||| TTAGCCTCAT AAGCATTAAC TGACACCCTA CCTTGCTCTA GGCTCAAGAA CTCATCCCTT 121 TTCCTATCCC TCAAAGTGCG GGGTATATAC TTCTCCATAA ACAAACTAGA GAATGATGCC 6021 ||| |||||| ||||||| || ||| |||||| |||||||||| |||||||| | |||||| ||| TTCTTATCCC TCAAAGTCCG GGGGATATAC TTCTCCATAA ACAAACTATA GAATGAGGCC 181 CAAGTCATAG GTGGTGCCTC TGTTGGTTGA CACTCAACAT GTGACCACCA CCACATTTTG 5961 |||||||||| ||| |||||| | |||||||| ||||| | || |||||| ||| |||||||| | CAAGTCATAG GTGATGCCTC TATTGGTTGA CACTCGATAT GTGACCGCCA CCACATTTGG 241 GCGTTCCCTT GAAACTGATA ACTAACGAAC TCAACACCAA ACCGTTCTAC TATACCCATC 5901 |||||||||| |||||||||| | | || ||| || ||||||| |||||||||| |||||||||| GCGTTCCCTT GAAACTGATA AGTCACAAAC TCCACACCAA ACCGTTCTAC TATACCCATC 301 TTGTGTAGTA GCTCATGACA GTCAACCAGA AAATCGTAAG CATCCTCAAA TTTAGCACCC 5841 |||||||||| |||| ||||| |||||||||| ||||| |||| ||||||| | || ||||||| TTGTGTAGTA GCTCGTGACA GTCAACCAGA AAATCATAAG CATCCTCCGA TTCAGCACCC 361 TTGAAGACTG GAGGTTTCAA TTTCAAGAAC TTACTGAAAA GTTCATGCTG ATCATTTGTC 5781 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAAGACCG GAGGTTTCAA TTTCAAGAAC TTACTGAAAA GTTCATGCTG ATCATTTGTC 421 ATTATAGGCC CAGTAGTCAG ACGTGGAAAC GTGCCTATGT CCAATGAGGC ATCCATACGA 5721 |||||||| ||||||||| ||||||||| |||||||||| |||||| | |||||| || ATTATAGGGA CAGTAGTCAA ACGTGGAAAT GTGCCTATGT CCAATGGAAC ATCCATGCGG 481 GGAGCCATAG TGGCTGCATG TTTTACCTCT GAAACTGGAG GTGTTGGTGC AGAAAACACT 5661 |||||||||| | || ||||| || ||| ||| ||||| |||| ||||||| || |||||||||| GGAGCCATAG TAGCCGCATG TTGTACTTCT GAAACCGGAG GTGTTGGCGC AGAAAACACT 541 GGAGGGGCCT GACCCTGATC AGACAACCCA CTAAGATAAG CGAGAACCTG ATTGATCATC 5601 ||||| || | |||| ||||| ||| ||||| |||||||||| | |||||||| |||||||||| GGAGGTGCTT GACCTTGATC AGATAACCCG CTAAGATAAG CCAGAACCTG ATTGATCATC 601 TCTGGGGTAG GTTGGGGTGA CAATTCCTTA TTTTGCACTT GTTCATTCTC CCCTTCCTCA 5541 |||||||||| ||||||||| || ||||| | |||||||||| | ||| | || ||| ||| | TCTGGGGTAG GTTGGGGTGG CATTTCCTCA TTTTGCACTT GNTCAGTTTC CCCATCC-CT 660 CCCTCTCT-T ACCACTTCCT CAGTCGGTGG AGGAGTCACC GCCCTAGGAT CAGACAGGCT 5482 |||| ||| | | ||||||| ||||| |||| | ||||||| ||||||| || |||| |||| CCCTTTCTCT ATTACTTCCT CAGTCAGTGG AAGAGTCACT GCCCTAGTAT CAGATGGGCT 720 hqPGS_C06HBa0153O03.1-1-_SGN-E356614+ (6200 5482) ******************************************************************************** EST sequence 122 +strand 664 n (File: SGN-E352401+) 1 AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 61 AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 121 CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 181 AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 241 GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 301 GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 361 GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 421 TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 481 AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 541 AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 601 TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT TTGCACTTGT TCAGTTTCCC CATCCTCCCC 661 TTCT Predicted gene structure (within gDNA segment 7077 to 4502): Exon 1 6198 5535 ( 664 n); cDNA 1 664 ( 664 n); score: 0.908 MATCH C06HBa0153O03.1-1- SGN-E352401+ 0.908 664 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E352401+ (6198 5535) Alignment (genomic DNA sequence = upper lines): AATCCACTCT TGTGGACTGA AACAAAGTTG GGTGGCATAT CTGGATAGTG CACGAAACTT 6139 ||||| |||| || ||||||| |||| ||||| ||||||||| |||||||||| |||||||||| AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 60 AGCCTCATAT GCAGTAACCG ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCTCTTTT 6079 ||||||||| ||| |||| | ||| |||||| |||||||||| |||||||||| |||| ||||| AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 120 CCTATCCCTC AAAGTGCGGG GTATATACTT CTCCATAAAC AAACTAGAGA ATGATGCCCA 6019 | |||||||| ||||| |||| | |||||||| |||||||||| |||||| ||| |||| ||||| CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 180 AGTCATAGGT GGTGCCTCTG TTGGTTGACA CTCAACATGT GACCACCACC ACATTTTGGC 5959 |||||||||| | ||||||| |||||||||| ||| | |||| |||| ||||| |||||| ||| AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 240 GTTCCCTTGA AACTGATAAC TAACGAACTC AACACCAAAC CGTTCTACTA TACCCATCTT 5899 |||||||||| ||||||||| | || ||||| ||||||||| |||||||||| |||||||||| GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 300 GTGTAGTAGC TCATGACAGT CAACCAGAAA ATCGTAAGCA TCCTCAAATT TAGCACCCTT 5839 |||||||||| || ||||||| |||||||||| ||| |||||| ||||| ||| ||||||||| GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 360 GAAGACTGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 5779 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 420 TATAGGCCCA GTAGTCAGAC GTGGAAACGT GCCTATGTCC AATGAGGCAT CCATACGAGG 5719 |||||| || ||||||| || ||||||| || |||||||||| |||| ||| |||| || || TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 480 AGCCATAGTG GCTGCATGTT TTACCTCTGA AACTGGAGGT GTTGGTGCAG AAAACACTGG 5659 ||||||||| || ||||||| ||| ||||| ||| |||||| ||||| |||| |||||||||| AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 540 AGGGGCCTGA CCCTGATCAG ACAACCCACT AAGATAAGCG AGAACCTGAT TGATCATCTC 5599 ||| || ||| || ||||||| | ||||| || ||||||||| |||||||||| |||||||||| AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 600 TGGGGTAGGT TGGGGTGACA ATTCCTTATT TTGCACTTGT TCATTCTCCC CTTCCTCACC 5539 |||||||||| ||||||| || |||| | | |||||||||| ||| | |||| | ||||| || TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT TTGCACTTGT TCAGTTTCCC CATCCTCCCC 660 CTCT 5535 ||| TTCT 664 hqPGS_C06HBa0153O03.1-1-_SGN-E352401+ (6198 5535) ******************************************************************************** EST sequence 187 +strand 713 n (File: SGN-E349404+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAGG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATGC GGGGAGCCAT AGTAGCCGCA TGTCGTATCT CTG Predicted gene structure (within gDNA segment 8379 to 4522): Exon 1 6402 5690 ( 713 n); cDNA 1 713 ( 713 n); score: 0.870 MATCH C06HBa0153O03.1-1- SGN-E349404+ 0.870 713 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E349404+ (6402 5690) Alignment (genomic DNA sequence = upper lines): GGAACCCTGT CCTCTAGAGT AAGAACCATT AAACTCACCT CCCTTTCGAA ACCTCTTTGA 6343 || |||| |||||||| | |||||||||| | |||| ||| ||| ||| || ||||||| || GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 60 TGTCGATGAC ATGGTGAATT CATCTGGCTT CACTCCTTCC ACCTCTACCA CGAAATCTAC 6283 |||||||| | ||||||| | |||| || || |||||||||| ||||| | || | || ||||| TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 120 CACTTCTTGG AAGGATTTTA CCGTAGCTGC TACCTGTAAG GCTGAAATCC GCAACTCTGA 6223 ||| ||||| |||||| || | || || || || ||||||| |||||||||| |||| ||||| CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 180 CCTCAACCCC TTCACAAAAC GACGAATCCA CTCTTGTGGA CTGAAACAAA GTTGGGTGGC 6163 ||||||||| |||||||||| | || |||| ||||||||| || ||||| | |||||||||| TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 240 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATATGCAGTA ACCGACATCC TACCTTGCTC 6103 ||||||||| ||||| |||| | |||||||| ||| || | || | ||||| | || || || GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 300 TAGGCTCAAG AACTCATCTC TTTTCCTATC CCTCAAAGTG CGGGGTATAT ACTTCTCCAT 6043 || |||||| |||||||| | |||||||||| |||||||| | || |||| | |||||||| AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 360 AAACAAACTA GAGAATGATG CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 5983 ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 420 ATGTGACCAC CACCACATTT TGGCGTTCCC TTGAAACTGA TAACTAACGA ACTCAACACC 5923 |||||| | | |||||||| | |||||||||| ||| |||||| || || |||| |||| || || ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 480 AAACCGTTCT ACTATACCCA TCTTGTGTAG TAGCTCATGA CAGTCAACCA GAAAATCGTA 5863 ||||| ||| ||||| |||| | |||||||| |||||||||| || ||||||| ||||||| || AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 540 AGCATCCTCA AATTTAGCAC CCTTGAAGAC TGGAGGTTTC AATTTCAAGA ACTTACTGAA 5803 |||||| ||| ||| |||| |||||||||| |||||||||| | |||||||| |||||||||| AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 600 AAGTTCATGC TGATCATTTG TCATTATAGG CCCAGTAGTC AGACGTGGAA ACGTGCCTAT 5743 ||||||||| ||||| |||| |||| ||||| |||||||||| |||||||||| | |||||||| AAGTTCATGT TGATCGTTTG TCATAATAGG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 660 GTCCAATGAG GCATCCATAC GAGGAGCCAT AGTGGCTGCA TGTTTTACCT CTG 5690 |||||| ||||| || | | |||||||| ||| || ||| ||| || || ||| TCCCAATGGT GCATCTATGC GGGGAGCCAT AGTAGCCGCA TGTCGTATCT CTG 713 hqPGS_C06HBa0153O03.1-1-_SGN-E349404+ (6402 5690) ******************************************************************************** EST sequence 159 +strand 679 n (File: SGN-E351625+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAAG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATG Predicted gene structure (within gDNA segment 8379 to 4484): Exon 1 6402 5725 ( 678 n); cDNA 1 678 ( 678 n); score: 0.872 MATCH C06HBa0153O03.1-1- SGN-E351625+ 0.872 678 0.999 C PGS_C06HBa0153O03.1-1-_SGN-E351625+ (6402 5725) Alignment (genomic DNA sequence = upper lines): GGAACCCTGT CCTCTAGAGT AAGAACCATT AAACTCACCT CCCTTTCGAA ACCTCTTTGA 6343 || |||| |||||||| | |||||||||| | |||| ||| ||| ||| || ||||||| || GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 60 TGTCGATGAC ATGGTGAATT CATCTGGCTT CACTCCTTCC ACCTCTACCA CGAAATCTAC 6283 |||||||| | ||||||| | |||| || || |||||||||| ||||| | || | || ||||| TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 120 CACTTCTTGG AAGGATTTTA CCGTAGCTGC TACCTGTAAG GCTGAAATCC GCAACTCTGA 6223 ||| ||||| |||||| || | || || || || ||||||| |||||||||| |||| ||||| CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 180 CCTCAACCCC TTCACAAAAC GACGAATCCA CTCTTGTGGA CTGAAACAAA GTTGGGTGGC 6163 ||||||||| |||||||||| | || |||| ||||||||| || ||||| | |||||||||| TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 240 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATATGCAGTA ACCGACATCC TACCTTGCTC 6103 ||||||||| ||||| |||| | |||||||| ||| || | || | ||||| | || || || GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 300 TAGGCTCAAG AACTCATCTC TTTTCCTATC CCTCAAAGTG CGGGGTATAT ACTTCTCCAT 6043 || |||||| |||||||| | |||||||||| |||||||| | || |||| | |||||||| AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 360 AAACAAACTA GAGAATGATG CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 5983 ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 420 ATGTGACCAC CACCACATTT TGGCGTTCCC TTGAAACTGA TAACTAACGA ACTCAACACC 5923 |||||| | | |||||||| | |||||||||| ||| |||||| || || |||| |||| || || ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 480 AAACCGTTCT ACTATACCCA TCTTGTGTAG TAGCTCATGA CAGTCAACCA GAAAATCGTA 5863 ||||| ||| ||||| |||| | |||||||| |||||||||| || ||||||| ||||||| || AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 540 AGCATCCTCA AATTTAGCAC CCTTGAAGAC TGGAGGTTTC AATTTCAAGA ACTTACTGAA 5803 |||||| ||| ||| |||| |||||||||| |||||||||| | |||||||| |||||||||| AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 600 AAGTTCATGC TGATCATTTG TCATTATAGG CCCAGTAGTC AGACGTGGAA ACGTGCCTAT 5743 ||||||||| ||||| |||| |||| ||| | |||||||||| |||||||||| | |||||||| AAGTTCATGT TGATCGTTTG TCATAATAAG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 660 GTCCAATGAG GCATCCAT 5725 |||||| ||||| || TCCCAATGGT GCATCTAT 678 hqPGS_C06HBa0153O03.1-1-_SGN-E351625+ (6402 5725) ******************************************************************************** EST sequence 200 +strand 612 n (File: SGN-E357065+) 1 GGGTTCTGTC CTCTAGAATA AGAACCATTA TACTCTCCTC CCCTTCTAAA CCTCTTCGAT 61 GTCGATGTCG TGGTGAAGTC ATCCGGTTTC ACTCCTTCCA CCTCAATCAC AAAGTCTACC 121 ACCTCTTGAA AGGATGTTGC TGTTGCCGCT ATCTGTAAGG CTGAAATCCG CAATTCTGAT 181 CTCAACCCCT TCACAAAACG GCGGATCCGT TCTTGTGGAC TAAAACAGAG TTGGGTGGCG 241 TATCTGGATA GTGCGCGAAA TTTAGCCTCA TACGCGTTTA CTGTCATCCT TCCCTGTTCA 301 AGACTCAAGA ACTCATCCCT TTTCCTATCT CTCAAAGTCC TTGGGATATA TTTCTCCATG 361 AACAAACTAG AGAATGATTC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAACA 421 TGTGAACGCC ACCACATCTT GGCGTTCCCT TGGAACTGAT AGCTCACGAA CTCGACGCCA 481 AACCTCTCTA CTATCCCCAT TTTGTGTAGT AGCTCATGAC AATCAACCAG AAAATCATAA 541 GCATCTTCAG ATTCTGCACC CTTGAAGACT GGAGGTTTCA GTTTCAAGAA CTTACTGAAA 601 AGTCATGTTG AT Predicted gene structure (within gDNA segment 8369 to 5099): Exon 1 6396 5789 ( 608 n); cDNA 6 612 ( 607 n); score: 0.877 MATCH C06HBa0153O03.1-1- SGN-E357065+ 0.877 608 0.993 C PGS_C06HBa0153O03.1-1-_SGN-E357065+ (6396 5789) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT CGAAACCTCT TTGATGTCGA 6337 |||||||||| || ||||||| ||||| |||| |||||| || | |||||||| | |||||||| CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 65 TGACATGGTG AATTCATCTG GCTTCACTCC TTCCACCTCT ACCACGAAAT CTACCACTTC 6277 || | ||||| || ||||| | | |||||||| ||||||||| | ||| || | ||||||| || TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 125 TTGGAAGGAT TTTACCGTAG CTGCTACCTG TAAGGCTGAA ATCCGCAACT CTGACCTCAA 6217 ||| |||||| || | || | | |||| ||| |||||||||| |||||||| | |||| ||||| TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 185 CCCCTTCACA AAACGACGAA TCCACTCTTG TGGACTGAAA CAAAGTTGGG TGGCATATCT 6157 |||||||||| ||||| || | ||| ||||| |||||| ||| || ||||||| |||| ||||| CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 245 GGATAGTGCA CGAAACTTAG CCTCATATGC AGTAACCGAC ATCCTACCTT GCTCTAGGCT 6097 ||||||||| ||||| |||| ||||||| || | || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 305 CAAGAACTCA TCTCTTTTCC TATCCCTCAA AGTGCGGGGT ATATACTTCT CCATAAACAA 6037 |||||||||| || ||||||| |||| ||||| ||| | || ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 365 ACTAGAGAAT GATGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 5977 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 425 CCACCACCAC ATTTTGGCGT TCCCTTGAAA CTGATAACTA ACGAACTCAA CACCAAACCG 5917 | ||||||| || ||||||| ||||||| || |||||| || |||||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 485 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAAGCATC 5857 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | |||||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 545 CTCAAATTTA GCACCCTTGA AGACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 5797 ||| ||| |||||||||| |||||||||| ||||| |||| |||||||||| ||||||| || TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAG-TC 604 ATGCTGAT 5789 ||| |||| ATGTTGAT 612 hqPGS_C06HBa0153O03.1-1-_SGN-E357065+ (6396 5789) ******************************************************************************** EST sequence 124 +strand 524 n (File: SGN-E352365+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAAT Predicted gene structure (within gDNA segment 8379 to 4387): Exon 1 6402 5881 ( 522 n); cDNA 1 522 ( 522 n); score: 0.866 MATCH C06HBa0153O03.1-1- SGN-E352365+ 0.866 522 0.996 C PGS_C06HBa0153O03.1-1-_SGN-E352365+ (6402 5881) Alignment (genomic DNA sequence = upper lines): GGAACCCTGT CCTCTAGAGT AAGAACCATT AAACTCACCT CCCTTTCGAA ACCTCTTTGA 6343 || |||| |||||||| | |||||||||| | |||| ||| ||| ||| || ||||||| || GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 60 TGTCGATGAC ATGGTGAATT CATCTGGCTT CACTCCTTCC ACCTCTACCA CGAAATCTAC 6283 |||||||| | ||||||| | |||| || || |||||||||| ||||| | || | || ||||| TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 120 CACTTCTTGG AAGGATTTTA CCGTAGCTGC TACCTGTAAG GCTGAAATCC GCAACTCTGA 6223 ||| ||||| |||||| || | || || || || ||||||| |||||||||| |||| ||||| CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 180 CCTCAACCCC TTCACAAAAC GACGAATCCA CTCTTGTGGA CTGAAACAAA GTTGGGTGGC 6163 ||||||||| |||||||||| | || |||| ||||||||| || ||||| | |||||||||| TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 240 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATATGCAGTA ACCGACATCC TACCTTGCTC 6103 ||||||||| ||||| |||| | |||||||| ||| || | || | ||||| | || || || GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 300 TAGGCTCAAG AACTCATCTC TTTTCCTATC CCTCAAAGTG CGGGGTATAT ACTTCTCCAT 6043 || |||||| |||||||| | |||||||||| |||||||| | || |||| | |||||||| AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 360 AAACAAACTA GAGAATGATG CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 5983 ||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 420 ATGTGACCAC CACCACATTT TGGCGTTCCC TTGAAACTGA TAACTAACGA ACTCAACACC 5923 |||||| | | |||||||| | |||||||||| ||| |||||| || || |||| |||| || || ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 480 AAACCGTTCT ACTATACCCA TCTTGTGTAG TAGCTCATGA CA 5881 ||||| ||| ||||| |||| | |||||||| |||||||||| || AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CA 522 hqPGS_C06HBa0153O03.1-1-_SGN-E352365+ (6402 5881) ******************************************************************************** EST sequence 197 +strand 790 n (File: SGN-E356912+) 1 CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 61 CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 121 TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 181 CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 241 GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 301 TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 361 AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 421 GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 481 TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 541 CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 601 AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 661 AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 721 ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 781 GGATATACTT Predicted gene structure (within gDNA segment 7438 to 5304): Exon 1 6838 6049 ( 790 n); cDNA 1 790 ( 790 n); score: 0.922 MATCH C06HBa0153O03.1-1- SGN-E356912+ 0.922 790 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E356912+ (6838 6049) Alignment (genomic DNA sequence = upper lines): CAGGATCAAA CAATACAGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 6779 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 60 CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCATTTG 6719 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 120 TCTGTCCATT GCCCCTACCA TGTTGTGATG TAGTGGCTCC GTTTTTCCCA TCACCTCGGC 6659 |||| ||||| |||||||||| |||||||||| ||||||| || ||| |||| | ||||| || TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 180 CGTTTTGGTG ACCACCATTA CCCCGACCAC CACGTCCTCA AGAATAACGG CCTCTACCAC 6599 |||||||||| |||||||||| || ||||||| ||||||||| |||||||||| ||||||||| CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 240 GACCACCTCT ACCTCTAGCC ATTGGGGGTC TATAACTCTG TTTTGGACAA TTCCTCCTAA 6539 |||||||||| ||||||||| |||||||||| ||||||| | | |||||| || |||||| GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 300 TATGTCCAGT CTCCCCACAT CCATAACACT CTCTGGAGTC AAGCATAGGT CTCTCAGAGT 6479 |||||||| | |||||||||| |||||||| | |||||||||| | |||||| | || ||| TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 360 AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 6419 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 420 GAACTGAGTA ACCTACGGAA CCCTGTCCTC TAGAGTAAGA ACCATTAAAC TCACCTCCCT 6359 || | ||||| |||| ||||| || ||||||| |||||||||| ||| |||||| |||||||||| GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 480 TTCGAAACCT CTTTGATGTC GATGACATGG TGAATTCATC TGGCTTCACT CCTTCCACCT 6299 |||||||||| ||||||||| |||| | ||| |||| || || |||||||||| |||||||| | TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 540 CTACCACGAA ATCTACCACT TCTTGGAAGG ATTTTACCGT AGCTGCTACC TGTAAGGCTG 6239 ||| ||| || ||||||||| |||||||||| ||||| |||| || |||||| |||||||| | CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 600 AAATCCGCAA CTCTGACCTC AACCCCTTCA CAAAACGACG AATCCACTCT TGTGGACTGA 6179 |||||||||| ||||||||| |||||||||| |||| || | ||||| |||| || ||||||| AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 660 AACAAAGTTG GGTGGCATAT CTGGATAGTG CACGAAACTT AGCCTCATAT GCAGTAACCG 6119 |||| ||||| ||||||||| | |||||||| |||| ||||| ||||||||| ||| |||||| AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 720 ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCTCTTTT CCTATCCCTC AAAGTGCGGG 6059 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| ||||| |||| ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 780 GTATATACTT 6049 | |||||||| GGATATACTT 790 hqPGS_C06HBa0153O03.1-1-_SGN-E356912+ (6838 6049) ******************************************************************************** EST sequence 192 +strand 698 n (File: SGN-E356209+) 1 TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 61 ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 121 ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 181 AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 241 ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 301 AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 361 CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 421 TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 481 CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 541 CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 601 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 661 TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG Predicted gene structure (within gDNA segment 7362 to 5455): Exon 1 6762 6065 ( 698 n); cDNA 1 698 ( 698 n); score: 0.913 MATCH C06HBa0153O03.1-1- SGN-E356209+ 0.913 698 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E356209+ (6762 6065) Alignment (genomic DNA sequence = upper lines): TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCA TTTGTCTGTC CATTGCCCCT 6703 |||||||||| |||||||||| |||||||||| ||||||||| || ||||||| | |||||||| TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 60 ACCATGTTGT GATGTAGTGG CTCCGTTTTT CCCATCACCT CGGCCGTTTT GGTGACCACC 6643 |||||||||| |||||||||| | || ||| ||||| |||| | ||| |||| |||||||||| ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 120 ATTACCCCGA CCACCACGTC CTCAAGAATA ACGGCCTCTA CCACGACCAC CTCTACCTCT 6583 |||||| ||| |||||||||| ||| |||||| |||||||||| ||| |||||| |||||||||| ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 180 AGCCATTGGG GGTCTATAAC TCTGTTTTGG ACAATTCCTC CTAATATGTC CAGTCTCCCC 6523 ||| |||||| |||||||||| | ||| | || |||||| || |||||||||| || ||||||| AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 240 ACATCCATAA CACTCTCTGG AGTCAAGCAT AGGTCTCTCA GAGTAGTGTT GACCGGTCTG 6463 |||||||||| || ||||||| |||||||||| ||| | ||| ||| |||||| ||||||||| ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 300 AGGTGGACCC CCAACTACAG TCTGTAGTGA AGACTGAATG GGTCGAACTG AGTAACCTAC 6403 |||||| | | |||||||||| |||||||||| ||||||||| ||||| |||| |||||| | | AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 360 GGAACCCTGT CCTCTAGAGT AAGAACCATT AAACTCACCT CCCTTTCGAA ACCTCTTTGA 6343 ||||||||| ||||||| || |||||||||| |||||||||| ||||| |||| |||| ||||| CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 420 TGTCGATGAC ATGGTGAATT CATCTGGCTT CACTCCTTCC ACCTCTACCA CGAAATCTAC 6283 |||||||| ||||||| | | |||||||| |||||||||| || |||| || | || ||||| TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 480 CACTTCTTGG AAGGATTTTA CCGTAGCTGC TACCTGTAAG GCTGAAATCC GCAACTCTGA 6223 |||||||||| ||||||||| |||| || || || ||||||| | ||||||| |||| ||||| CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 540 CCTCAACCCC TTCACAAAAC GACGAATCCA CTCTTGTGGA CTGAAACAAA GTTGGGTGGC 6163 |||||||||| |||||||| | | ||||||| |||||| ||| |||||||| | |||| ||||| CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 600 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATATGCAGTA ACCGACATCC TACCTTGCTC 6103 |||||||||| |||||||||| |||||||||| ||| ||| || |||||||||| |||||||||| ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 660 TAGGCTCAAG AACTCATCTC TTTTCCTATC CCTCAAAG 6065 |||||||||| |||||||| | |||||||||| |||||||| TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG 698 hqPGS_C06HBa0153O03.1-1-_SGN-E356209+ (6762 6065) ******************************************************************************** EST sequence 151 +strand 763 n (File: SGN-E214046+) 1 ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 61 TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 121 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 181 ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 241 AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 301 AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 361 TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 421 ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 481 TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 541 ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 601 TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 661 TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA CAACCGGCGA 721 ATCCGCTCTT GAGGGACTGA ACAGAGTTGA GTGGCATATC TGG Predicted gene structure (within gDNA segment 7905 to 3566): Exon 1 6918 6155 ( 764 n); cDNA 1 763 ( 763 n); score: 0.897 MATCH C06HBa0153O03.1-1- SGN-E214046+ 0.897 764 1.001 C PGS_C06HBa0153O03.1-1-_SGN-E214046+ (6918 6155) Alignment (genomic DNA sequence = upper lines): ACGAATAGGC ATATCAAGTA ATTCACAATA TAAATTTAGA CCGTTAGCAA ATGAGGAAGA 6859 |||||||||| |||||||| | ||||||||| |||||||||| || ||||||| |||||||||| ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 60 TACATAAGAA AATGTGGATC CAGGATCAAA CAATACAGAA GCCATGCAAT CACAAACTAG 6799 |||||||||| || ||||||| |||||||||| |||||| ||| |||||||||| ||||||| || TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 120 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA 6739 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 180 ACAATGGGCC CTATCATTTG TCTGTCCATT GCCCCTACCA TGTTGTGATG TAGTGGCTCC 6679 |||||||||| |||| |||| | |||||||| |||||||||| ||| |||||| ||||||| || ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 240 GTTTTTCCCA TCACCTCGGC CGTTTTGGTG ACCACCATTA CCCCGACCAC CACGTCCTCA 6619 ||| ||| | ||||| || |||||||||| ||||||||| || ||||||| ||||||||| AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 300 AGAATAACGG CCTCTACCAC GACCACCTCT ACCTCTAGCC ATTGGGGGTC TATAACTCTG 6559 | |||||||| ||||||||| ||| |||||| ||||||| | |||||||||| ||||||| | AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 360 TTTTGGACAA TTCCTCCTAA TATGTCCAGT CTCCCCACAT CCATAACACT CTCTGGAGTC 6499 | ||| || || ||||||| |||||||| | |||||||||| |||||||| | |||| ||||| TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 420 AAGCATAGGT CTCTCAGAGT AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG 6439 | |||||| | ||| ||| |||||||||| |||||||||| |||||||||| ||||||| || ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 480 TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTACGGAA CCCTGTCCTC TAGAGTAAGA 6379 |||||||||| |||||||||| |||||||||| |||| | ||| || ||||||| |||||||||| TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 540 ACCATTAAAC TCACCTCCCT TTCGAAACCT CTTTGATGTC GATGACATGG TGAATTCATC 6319 ||| |||||| ||| ||||| || ||||||| | | |||||| |||| | ||| |||| || || ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 600 TGGCTTCACT CCTTCCACCT CTACCACGAA ATCTACCACT TCTTGGAAGG ATTTTACCGT 6259 ||| |||||| |||||||| | ||| || || ||||||||| |||||||||| ||||| |||| TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 660 AGCTGCTACC TGTAAGGCTG AAATCCGCAA CTCTGACCTC AACCCCTTCA CAAAACGACG 6199 || |||||| ||||||| | |||||||||| ||||||||| |||||||||| | || || || TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA C-AACCGGCG 719 AATCCACTCT TG-TGGACTG AAACAAAGTT GGGTGGCATA TCTGG 6155 ||||| |||| || |||||| |||| |||| | |||||||| ||||| AATCCGCTCT TGAGGGACTG -AACAGAGTT GAGTGGCATA TCTGG 763 hqPGS_C06HBa0153O03.1-1-_SGN-E214046+ (6918 6155) ******************************************************************************** EST sequence 143 +strand 533 n (File: SGN-E353805+) 1 GGATCAACAA TACGGAAGCC ATGCAATCAC AAACTAGAAG ATTACCTGTG ATGACAGCAT 61 CAGATGCCTC CGCTTCAGAC CGCCCAGGGA AAGCGTAACA ATGGGCCCTA TCGTTTGTCT 121 GCCCATTGCC CCTACCATGT TGTGATGTAG TGGCCCCAGT TTGCCCATTA CCTCTGCCGT 181 TTTGGTGACC ACCATTACCT CGACCACCAC GTCCTCCAGA ATAACGGCCT CTACCATGAC 241 CACCTCTACC TCTAGCTATT GGGGGTCTAT AACTTGGTCC AGGACAATTT ATCCTAATAT 301 GTCCAATCTC CCCACATCCA TAACATTCTC TGGAGTCACT CATAGGCCCC TTGGAGAAGT 361 GTTGACCGGT CTGAGGTGGA CCCCCAACTA CAGTCTGTAG TGAAGACTGA ATGGGTCGAG 421 CCGAGTAACC TCCGGAACCT TGTCCTCTAA AGTAAGAACC CTTAAACTCA CCTCCCTTTC 481 GAAACCTCTT TGATGTTGAT GTCGTGGTGA AGTCGTCTGG CTTCACTCCT TCC Predicted gene structure (within gDNA segment 7561 to 5360): Exon 1 6836 6303 ( 534 n); cDNA 1 533 ( 533 n); score: 0.923 MATCH C06HBa0153O03.1-1- SGN-E353805+ 0.923 534 1.002 C PGS_C06HBa0153O03.1-1-_SGN-E353805+ (6836 6303) Alignment (genomic DNA sequence = upper lines): GGATCAAACA ATACAGAAGC CATGCAATCA CAAACTAGAA GATTACCTGT GATGACAGCA 6777 ||||| |||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATC-AACA ATACGGAAGC CATGCAATCA CAAACTAGAA GATTACCTGT GATGACAGCA 59 TCAGATGCCT CCGCTTCAGA CCGCCCAGGG AAAGCGTAAC AATGGGCCCT ATCATTTGTC 6717 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| TCAGATGCCT CCGCTTCAGA CCGCCCAGGG AAAGCGTAAC AATGGGCCCT ATCGTTTGTC 119 TGTCCATTGC CCCTACCATG TTGTGATGTA GTGGCTCCGT TTTTCCCATC ACCTCGGCCG 6657 || ||||||| |||||||||| |||||||||| ||||| || ||| ||||| ||||| |||| TGCCCATTGC CCCTACCATG TTGTGATGTA GTGGCCCCAG TTTGCCCATT ACCTCTGCCG 179 TTTTGGTGAC CACCATTACC CCGACCACCA CGTCCTCAAG AATAACGGCC TCTACCACGA 6597 |||||||||| |||||||||| ||||||||| ||||||| || |||||||||| ||||||| || TTTTGGTGAC CACCATTACC TCGACCACCA CGTCCTCCAG AATAACGGCC TCTACCATGA 239 CCACCTCTAC CTCTAGCCAT TGGGGGTCTA TAACTCTGTT TTGGACAATT CCTCCTAATA 6537 |||||||||| ||||||| || |||||||||| ||||| || |||||||| |||||||| CCACCTCTAC CTCTAGCTAT TGGGGGTCTA TAACTTGGTC CAGGACAATT TATCCTAATA 299 TGTCCAGTCT CCCCACATCC ATAACACTCT CTGGAGTCAA GCATAGGTCT CTCAGAGTAG 6477 |||||| ||| |||||||||| |||||| ||| ||||||||| |||||| | || ||| || TGTCCAATCT CCCCACATCC ATAACATTCT CTGGAGTCAC TCATAGGCCC CTTGGAGAAG 359 TGTTGACCGG TCTGAGGTGG ACCCCCAACT ACAGTCTGTA GTGAAGACTG AATGGGTCGA 6417 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTGACCGG TCTGAGGTGG ACCCCCAACT ACAGTCTGTA GTGAAGACTG AATGGGTCGA 419 ACTGAGTAAC CTACGGAACC CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT 6357 | ||||||| || ||||||| ||||||||| ||||||||| | |||||||| |||||||||| GCCGAGTAAC CTCCGGAACC TTGTCCTCTA AAGTAAGAAC CCTTAAACTC ACCTCCCTTT 479 CGAAACCTCT TTGATGTCGA TGACATGGTG AATTCATCTG GCTTCACTCC TTCC 6303 |||||||||| ||||||| || || | ||||| || || |||| |||||||||| |||| CGAAACCTCT TTGATGTTGA TGTCGTGGTG AAGTCGTCTG GCTTCACTCC TTCC 533 hqPGS_C06HBa0153O03.1-1-_SGN-E353805+ (6836 6303) ******************************************************************************** EST sequence 155 +strand 559 n (File: SGN-E244046+) 1 AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 61 AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 121 CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 181 CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 241 CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 301 CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 361 TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 421 GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 481 TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 541 AAGCACCATT AAACTCACC Predicted gene structure (within gDNA segment 7522 to 5214): Exon 1 6922 6364 ( 559 n); cDNA 1 559 ( 559 n); score: 0.918 MATCH C06HBa0153O03.1-1- SGN-E244046+ 0.918 559 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E244046+ (6922 6364) Alignment (genomic DNA sequence = upper lines): AAACACGAAT AGGCATATCA AGTAATTCAC AATATAAATT TAGACCGTTA GCAAATGAGG 6863 |||||||||| |||||||||| |||||||||| ||| |||| | |||||| ||| |||||||||| AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 60 AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 6803 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 120 CTAGAAGATT ACCTGTGATG ACAGCATCAG ATGCCTCCGC TTCAGACCGC CCAGGGAAAG 6743 | | |||||| |||||||||| ||| |||||| || ||||||| |||||||||| |||||||||| CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 180 CGTAACAATG GGCCCTATCA TTTGTCTGTC CATTGCCCCT ACCATGTTGT GATGTAGTGG 6683 |||||||||| |||||||||| || | |||| |||||||| |||||||||| |||||||||| CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 240 CTCCGTTTTT CCCATCACCT CGGCCGTTTT GGTGACCACC ATTACCCCGA CCACCACGTC 6623 |||| ||| |||||||||| |||||||||| |||||||||| ||| || ||| || ||||||| CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 300 CTCAAGAATA ACGGCCTCTA CCACGACCAC CTCTACCTCT AGCCATTGGG GGTCTATAAC 6563 ||| |||||| |||| |||| ||| |||||| |||||||||| | || |||| ||||| |||| CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 360 TCTGTTTTGG ACAATTCCTC CTAATATGTC CAGTCTCCCC ACATCCATAA CACTCTCTGG 6503 |||||||||| ||||| ||| ||||||||| || ||||||| ||| ||||| |||||||||| TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 420 AGTCAAGCAT AGGTCTCTCA GAGTAGTGTT GACCGGTCTG AGGTGGACCC CCAACTACAG 6443 ||| |||| | |||||||| ||| |||||| |||||||| |||||||||| |||||||||| GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 480 TCTGTAGTGA AGACTGAATG GGTCGAACTG AGTAACCTAC GGAACCCTGT CCTCTAGAGT 6383 |||||||||| ||||||||| ||||| |||| |||||| | | ||||||||| ||||||| || TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 540 AAGAACCATT AAACTCACC 6364 ||| |||||| ||||||||| AAGCACCATT AAACTCACC 559 hqPGS_C06HBa0153O03.1-1-_SGN-E244046+ (6922 6364) ******************************************************************************** EST sequence 185 +strand 543 n (File: SGN-E355026+) 1 GAACACATCC AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT 61 GTCATCCTTG AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA 121 AAGAAAGGAG ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 181 TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG 241 AGAAAGCCAA GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC 301 TAGATAAGTG TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT 361 CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA 421 TTTTAGACCA TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA 481 TACAGATGCC ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC 541 TGC Predicted gene structure (within gDNA segment 9102 to 5010): Exon 1 7305 6763 ( 543 n); cDNA 1 543 ( 543 n); score: 0.899 MATCH C06HBa0153O03.1-1- SGN-E355026+ 0.899 543 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E355026+ (7305 6763) Alignment (genomic DNA sequence = upper lines): GAACACATCC AGAAACTCAC GGACCACCGA AACCGACTCA ATCGAAGGTA CTTGGGTGGT 7246 |||||||||| | ||| || | || ||| | ||| | ||| || || |||| ||||| | || GAACACATCC AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT 60 GTCATCCTTG AGATGTGCCA AGAAAGCTAA ACACCCTTTA CTAACCATTT TCTTAGCACG 7186 |||||||||| |||||||||| |||| | || ||| || ||| |||||||| | ||| ||||| GTCATCCTTG AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA 120 AAGAAAGGAG ATGATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 7126 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAAGGAG ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 180 TGTCTCAGGC TTGGCTAACG TCAGAGTTTT AGCATTACAA TCCAAGATCG CAAAATTCGG 7066 |||| ||||| |||||||||| ||| ||||| |||||||||| |||||||||| ||||||| || TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG 240 AGAAAGCCAA GTCATACCCA GAATTACATC AAAATCATCC ATTTCTAAGA TAACCAAATC 7006 |||||||||| |||||||||| |||||||||| ||| ||| || |||||||| |||||||||| AGAAAGCCAA GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC 300 TACATAAGTG TTGCTCCCCA AAAAGTTCAC CAAACAAGAC CTATATACCT TTTCAACTAC 6946 || ||||||| |||||||||| || ||||| |||||||||| |||| ||||| ||||||||| TAGATAAGTG TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT 360 CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATA TCAAGTAATT CACAATATAA 6886 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||| || CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA 420 ATTTAGACCG TTAGCAAATG AGGAAGATAC ATAAGAAAAT GTGGATCCAG GATCAAACAA 6826 |||||||| |||||||||| | |||||||| ||| ||||| ||||| |||| | ||||| || TTTTAGACCA TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA 480 TACAGAAGCC ATGCAATCAC AAACTAGAAG ATTACCTGTG ATGACAGCAT CAGATGCCTC 6766 |||||| ||| |||||||||| |||| | || |||||||||| || ||||||| |||| ||||| TACAGATGCC ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC 540 CGC 6763 || TGC 543 hqPGS_C06HBa0153O03.1-1-_SGN-E355026+ (7305 6763) ******************************************************************************** EST sequence 182 +strand 761 n (File: SGN-E355244+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 GCAAATGAGG AAGATACANT AGAAAACGTG GATCCCAGGA TCAACAATAC AGAAGCCATG 721 CAATCACAAT CGAAAGATTA CCTGTGATGA CAGCATCTAA T Predicted gene structure (within gDNA segment 8159 to 5415): Exon 1 7532 6771 ( 762 n); cDNA 1 761 ( 761 n); score: 0.943 MATCH C06HBa0153O03.1-1- SGN-E355244+ 0.943 762 1.001 C PGS_C06HBa0153O03.1-1-_SGN-E355244+ (7532 6771) Alignment (genomic DNA sequence = upper lines): CGAAAACTCC CATCCTTCTT CTTTACAAAC AAAACAGGAG CACCCCAAGG AGATGCACTT 7473 || ||||||| |||||||||| ||| |||||| |||| |||| |||||||||| |||||||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATGA AACCCTTGCT CAACAACTCT TGAAGTTGGG CTTTTAACTC TCTTAACTCC 7413 |||||||||| |||| ||||| |||||||||| |||||||||| | |||||||| ||||||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GCGGGAGCAA TTCTATAAGG GGGTATAGAA ATGGGGCGAG TACCCGGTTC AAGATCAATA 7353 ||||||| | |||||||||| |||||||||| |||||||| | | |||||||| |||||| ||| ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 180 CAGAAGTCAA TATCCCTATT TGGTGTCATA CCAGGAAGAT CTGCAGGGAA CACATCCAGA 7293 |||||||||| ||||||||| ||||| |||| |||||||||| |||||||||| |||||||| | CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 240 AACTCACGGA CCACCGAAAC CGACTCAATC GAAGGTACTT GGGTGGTGTC ATCCTTGAGA 7233 ||||||| || | | |||| |||||||||| |||||||||| |||| ||||| |||||||||| AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 300 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 7173 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 360 ATACGCACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CTCAGGCTTG 7113 || | ||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 420 GCTAACGTCA GAGTTTTAGC ATTACAATCC AAGATCGCAA AATTCGGAGA AAGCCAAGTC 7053 ||||| |||| |||||||| |||||||||| |||||||||| | | |||||| |||||||||| GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 480 ATACCCAGAA TTACATCAAA ATCATCCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 6993 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 540 CTCCCCAAAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 6933 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 600 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATATAAATT TAGACCGTTA 6873 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 660 GCAAATGAGG AAGATACATA AGAAAATGTG GAT-CCAGGA TCAAACAATA CAGAAGCCAT 6814 |||||||||| |||||||| |||||| ||| ||| |||||| || ||||||| |||||||||| GCAAATGAGG AAGATACANT AGAAAACGTG GATCCCAGGA TC-AACAATA CAGAAGCCAT 719 GCAATCACAA ACTAGAAGAT TACCTGTGAT GACAGCATCA GAT 6771 |||||||||| | | ||||| |||||||||| ||||||||| || GCAATCACAA TCGA-AAGAT TACCTGTGAT GACAGCATCT AAT 761 hqPGS_C06HBa0153O03.1-1-_SGN-E355244+ (7532 6771) ******************************************************************************** EST sequence 144 +strand 331 n (File: SGN-E352716+) 1 GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 61 TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 121 AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 181 CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 241 TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 301 ATAATACAAA AGCCATGCAA TCACAAAAAA A Predicted gene structure (within gDNA segment 7810 to 6153): Exon 1 7129 6803 ( 327 n); cDNA 1 327 ( 327 n); score: 0.920 MATCH C06HBa0153O03.1-1- SGN-E352716+ 0.920 327 0.988 C PGS_C06HBa0153O03.1-1-_SGN-E352716+ (7129 6803) Alignment (genomic DNA sequence = upper lines): GATCTGTCTC AGGCTTGGCT AACGTCAGAG TTTTAGCATT ACAATCCAAG ATCGCAAAAT 7070 |||||||| | |||||||||| ||||||| || |||||||||| |||||||||| || ||||||| GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 60 TCGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC ATCCATTTCT AAGATAACCA 7010 | |||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 120 AATCTACATA AGTGTTGCTC CCCAAAAAGT TCACCAAACA AGACCTATAT ACCTTTTCAA 6950 | |||||||| ||| |||||| |||| ||| |||||| || |||||||||| || ||||||| AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 180 CTACCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATATCAAGT AATTCACAAT 6890 ||| |||||| |||||||||| |||||||||| |||||||||| ||| ||||| |||||| ||| CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 240 ATAAATTTAG ACCGTTAGCA AATGAGGAAG ATACATAAGA AAATGTGGAT CCAGGATCAA 6830 ||| || || ||| ||||| |||||||||| ||||||| || |||||||||| ||||| |||| TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 300 ACAATACAGA AGCCATGCAA TCACAAA 6803 | |||||| | |||||||||| ||||||| ATAATACAAA AGCCATGCAA TCACAAA 327 hqPGS_C06HBa0153O03.1-1-_SGN-E352716+ (7129 6803) ******************************************************************************** EST sequence 51 -strand 659 n (File: SGN-E352117-) 1 CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 61 TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAGGGGG TATGGAAATG GGGTGATCAC 121 CCGGCTCCAG ATCAATGCAA AAGTCAATAT CCCTATCCGG TGGCATACCA GGAAGGTCTG 181 CAGGAAACAC ATCCAGAAAC TCACGGACTA TCGAAACTGA CTCAATTGAA GGTACTTGGG 241 TAGTATCATC CCTGAGGTGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTCTCTTAG 301 CACAAAGAAA GGAGATGATA CGAACTGGAG TGGAAGTGTA GTCACCCTCC CACACTAACG 361 CATCTATCTC AGGCTTGGCC AACGTTACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 421 TTGGAGACAG CCAAGTCATA CCCAGAATTA CATCGAAGTC AACCATTTAT AGGATAACCA 481 AGTCTACATG AGTATTGCTC CCACAAATGT CACAAGACAA GACCTATACA CCTTTTCAAC 541 AATCACAGAC TCACCCACCG GAGTAGAACA CGAATAGGTA TGTCAAGCGA TTCACTATGT 601 AAATTAAGAA CAGTAGCAAA TGAGGAAGAT ACATATGAAA ATGTGGAGGC AGGATCAAA Predicted gene structure (within gDNA segment 10412 to 4386): Exon 1 7489 6829 ( 661 n); cDNA 1 659 ( 659 n); score: 0.882 MATCH C06HBa0153O03.1-1- SGN-E352117- 0.882 661 1.003 C PGS_C06HBa0153O03.1-1-_SGN-E352117- (7489 6829) Alignment (genomic DNA sequence = upper lines): CCCAAGGAGA TGCACTTGGT CTAATGAAAC CCTTGCTCAA CAACTCTTGA AGTTGGGCTT 7430 ||||||| || ||||||||| ||||||| | |||| | || |||||||||| |||||||| | CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 60 TTAACTCTCT TAACTCCGCG GGAGCAATTC TATAAGGGGG TATAGAAATG GGGCGAGTAC 7370 ||||||| | ||||| ||| ||||| |||| |||||||||| ||| |||||| ||| || || TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAGGGGG TATGGAAATG GGGTGATCAC 120 CCGGTTCAAG ATCAATACAG AAGTCAATAT CCCTATTTGG TGTCATACCA GGAAGATCTG 7310 |||| || || |||||| || |||||||||| |||||| || || ||||||| ||||| |||| CCGGCTCCAG ATCAATGCAA AAGTCAATAT CCCTATCCGG TGGCATACCA GGAAGGTCTG 180 CAGGGAACAC ATCCAGAAAC TCACGGACCA CCGAAACCGA CTCAATCGAA GGTACTTGGG 7250 |||| ||||| |||||||||| |||||||| | |||||| || |||||| ||| |||||||||| CAGGAAACAC ATCCAGAAAC TCACGGACTA TCGAAACTGA CTCAATTGAA GGTACTTGGG 240 TGGTGTCATC CTTGAGATGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTTTCTTAG 7190 | || ||||| | |||| ||| |||||||||| |||||||||| |||||||||| ||| |||||| TAGTATCATC CCTGAGGTGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTCTCTTAG 300 CACGAAGAAA GGAGATGATA CGCACCGGAT TGGAAGTGTA GTCACCCTCC CACACTAACG 7130 ||| |||||| |||||||||| || || ||| |||||||||| |||||||||| |||||||||| CACAAAGAAA GGAGATGATA CGAACTGGAG TGGAAGTGTA GTCACCCTCC CACACTAACG 360 GATCTGTCTC AGGCTTGGCT AACGTCAGAG TTTTAGCATT ACAATCCAAG ATCGCAAAAT 7070 |||| |||| ||||||||| ||||| | || |||||||||| |||||||||| || ||||||| CATCTATCTC AGGCTTGGCC AACGTTACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 420 TCGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC ATCCATTTCT AAGATAACCA 7010 | ||||| || |||||||||| |||||||||| |||| || || | |||||| | | |||||||| TTGGAGACAG CCAAGTCATA CCCAGAATTA CATCGAAGTC AACCATTTAT AGGATAACCA 480 AATCTACATA AGTGTTGCTC CCCAAAAAGT TCACCAAACA AGACCTATAT ACCTTTTCAA 6950 | ||||||| ||| ||||| |||| ||| |||| | ||| ||||||||| |||||||||| AGTCTACATG AGTATTGCT- CCCACAAATG TCACAAGACA AGACCTATAC ACCTTTTCAA 539 CTACCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATATCAAGT AATTCACAAT 6890 | | |||||| |||||||||| ||||||| || |||||||||| || ||||| |||||| || CAATCACAGA CTCACCCACC GGAGTAG-AA CACGAATAGG TATGTCAAGC GATTCACTAT 598 ATAAATTTAG ACCGTTAGCA AATGAGGAAG ATACATAAGA AAATGTGGAT CCAGGATCAA 6830 |||||| || | | ||||| |||||||||| ||||||| || ||||||||| ||||||||| GTAAATTAAG AACAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAG GCAGGATCAA 658 A 6829 | A 659 hqPGS_C06HBa0153O03.1-1-_SGN-E352117- (7489 6829) ******************************************************************************** EST sequence 153 +strand 661 n (File: SGN-E351414+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 G Predicted gene structure (within gDNA segment 8159 to 6262): Exon 1 7532 6872 ( 661 n); cDNA 1 661 ( 661 n); score: 0.952 MATCH C06HBa0153O03.1-1- SGN-E351414+ 0.952 661 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E351414+ (7532 6872) Alignment (genomic DNA sequence = upper lines): CGAAAACTCC CATCCTTCTT CTTTACAAAC AAAACAGGAG CACCCCAAGG AGATGCACTT 7473 || ||||||| |||||||||| ||| |||||| |||| |||| |||||||||| |||||||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATGA AACCCTTGCT CAACAACTCT TGAAGTTGGG CTTTTAACTC TCTTAACTCC 7413 |||||||||| |||| ||||| |||||||||| |||||||||| | |||||||| ||||||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GCGGGAGCAA TTCTATAAGG GGGTATAGAA ATGGGGCGAG TACCCGGTTC AAGATCAATA 7353 ||||||| | |||||||||| |||||||||| |||||||| | | |||||||| |||||| ||| ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 180 CAGAAGTCAA TATCCCTATT TGGTGTCATA CCAGGAAGAT CTGCAGGGAA CACATCCAGA 7293 |||||||||| ||||||||| ||||| |||| |||||||||| |||||||||| |||||||| | CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 240 AACTCACGGA CCACCGAAAC CGACTCAATC GAAGGTACTT GGGTGGTGTC ATCCTTGAGA 7233 ||||||| || | | |||| |||||||||| |||||||||| |||| ||||| |||||||||| AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 300 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 7173 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 360 ATACGCACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CTCAGGCTTG 7113 || | ||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 420 GCTAACGTCA GAGTTTTAGC ATTACAATCC AAGATCGCAA AATTCGGAGA AAGCCAAGTC 7053 ||||| |||| |||||||| |||||||||| |||||||||| | | |||||| |||||||||| GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 480 ATACCCAGAA TTACATCAAA ATCATCCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 6993 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 540 CTCCCCAAAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 6933 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 600 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATATAAATT TAGACCGTTA 6873 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 660 G 6872 | G 661 hqPGS_C06HBa0153O03.1-1-_SGN-E351414+ (7532 6872) ******************************************************************************** EST sequence 119 +strand 560 n (File: SGN-E242765+) 1 TGGTTCTAGC ATTAGGACAT CATAGCTCAT TTGATTATTT CTCATCTCAT AATTAGTATT 61 TAGTATTCCC TCAATTTAAT AATTTCATTA AAGTGTTCAT AGAGACTTAT CTCTTCATTA 121 GCTTTACACT ATAAAAGGTG AGTAAGTGTT GGTAATATTT ACTTAGGCTT ATTTGCTATT 181 GAAACCGACT CAATCGAAGG TACTTGGGTA GTGTCATCAT TGAGATGTGC TAAGAAAGAT 241 AAACACCTTT TACTAATCAT TTTCTTAGCA AGAAGAAAGG AGACGATGCG GACCGGATTG 301 GAAGTGTAGT CACCCTCTCA CACTAACGGG TCTATCCCAG GCTTGGCTAA CGTCACCGTT 361 TTAGCATTAC AATCCAAGAT TGCAAATTGC GGAGAAAGCA AAGTCATACC CAGAATCACA 421 TCAAAATCAT CCATTGCCAA GATAACCAAA TCTACATAAG TGTTGCTCCC CACAAAGTTT 481 ACGACACAAG ACCTATATAC TTTTTCAACT ACCTCAGACT CACCCACCGG AGTAGAAACA 541 CGAATAGGCA TATCAAGAAA Predicted gene structure (within gDNA segment 11090 to 6261): Exon 1 7318 6898 ( 421 n); cDNA 140 560 ( 421 n); score: 0.874 MATCH C06HBa0153O03.1-1- SGN-E242765+ 0.874 421 0.752 C PGS_C06HBa0153O03.1-1-_SGN-E242765+ (7318 6898) Alignment (genomic DNA sequence = upper lines): GAAGATCTGC AGGGAACACA TCCAGAAACT CACGGACCAC CGAAACCGAC TCAATCGAAG 7259 || | || || || | | | | || | | | ||||||||| |||||||||| GAGTAAGTGT TGGTAATATT TACTTAGGCT TATTTGCTAT TGAAACCGAC TCAATCGAAG 199 GTACTTGGGT GGTGTCATCC TTGAGATGTG CCAAGAAAGC TAAACACCCT TTACTAACCA 7199 |||||||||| |||||||| |||||||||| | ||||||| |||||||| | ||||||| || GTACTTGGGT AGTGTCATCA TTGAGATGTG CTAAGAAAGA TAAACACCTT TTACTAATCA 259 TTTTCTTAGC ACGAAGAAAG GAGATGATAC GCACCGGATT GGAAGTGTAG TCACCCTCCC 7139 |||||||||| | |||||||| |||| ||| | | |||||||| |||||||||| |||||||| | TTTTCTTAGC AAGAAGAAAG GAGACGATGC GGACCGGATT GGAAGTGTAG TCACCCTCTC 319 ACACTAACGG ATCTGTCTCA GGCTTGGCTA ACGTCAGAGT TTTAGCATTA CAATCCAAGA 7079 |||||||||| ||| || || |||||||||| |||||| || |||||||||| |||||||||| ACACTAACGG GTCTATCCCA GGCTTGGCTA ACGTCACCGT TTTAGCATTA CAATCCAAGA 379 TCGCAAAATT CGGAGAAAGC CAAGTCATAC CCAGAATTAC ATCAAAATCA TCCATTTCTA 7019 | ||||| | |||||||||| ||||||||| ||||||| || |||||||||| |||||| | | TTGCAAATTG CGGAGAAAGC AAAGTCATAC CCAGAATCAC ATCAAAATCA TCCATTGCCA 439 AGATAACCAA ATCTACATAA GTGTTGCTCC CCAAAAAGTT CACCAAACAA GACCTATATA 6959 |||||||||| |||||||||| |||||||||| ||| |||||| || | |||| |||||||||| AGATAACCAA ATCTACATAA GTGTTGCTCC CCACAAAGTT TACGACACAA GACCTATATA 499 CCTTTTCAAC TACCACAGAC TCACCCACCG GAGTAGAAAC ACGAATAGGC ATATCAAGTA 6899 | |||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||| | CTTTTTCAAC TACCTCAGAC TCACCCACCG GAGTAGAAAC ACGAATAGGC ATATCAAGAA 559 A 6898 | A 560 hqPGS_C06HBa0153O03.1-1-_SGN-E242765+ (7318 6898) ******************************************************************************** EST sequence 179 +strand 658 n (File: SGN-E355232+) 1 TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 61 ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 121 GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 181 CTTAATTCGG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCTGGTTCA 241 AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGAAAC 301 ACGTCCAAAA ACTCGCGGAC TACCGAAACC GACTCAATTG AGGGTACTTG AGTAGTGTCA 361 TCCTTGAGAT GTGCCAAGAA AGCTGAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 421 AAGGATATGA TACGCACCAG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 481 CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA ATTTGGAGAA 541 AGCCAAGTCA TACCCAGAAT TACATCAATG TCACCCATTT CTAAAAAACC AAATCTACAT 601 AAGTGTTGCT CCCCACGAAA TTCNACAAAA CAGAACTATG TACCTTTTCA ACTACCAC Predicted gene structure (within gDNA segment 9146 to 5524): Exon 1 7601 6943 ( 659 n); cDNA 1 658 ( 658 n); score: 0.917 MATCH C06HBa0153O03.1-1- SGN-E355232+ 0.917 659 1.002 C PGS_C06HBa0153O03.1-1-_SGN-E355232+ (7601 6943) Alignment (genomic DNA sequence = upper lines): TCAATGCGAG GAAGAGGATA CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGTCT 7542 |||||||||| |||||||||| ||||||||| || ||||||| |||||| || | ||||||| TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 60 ATGCACATCC GAAAACTCCC ATCCTTCTTC TTTACAAACA AAACAGGAGC ACCCCAAGGA 7482 |||||||| | |||||| || |||||||||| || ||||| | |||| ||||| |||||||||| ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 120 GATGCACTTG GTCTAATGAA ACCCTTGCTC AACAACTCTT GAAGTTGGGC TTTTAACTCT 7422 |||||||||| |||||||||| |||||||| |||||||||| ||||||| || ||||||||| GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 180 CTTAACTCCG CGGGAGCAAT TCTATAAGGG GGTATAGAAA TGGGGCGAGT ACCCGGTTCA 7362 ||||| || | ||||||| || |||||||||| |||||||||| ||||||| || || |||||| CTTAATTCGG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCTGGTTCA 240 AGATCAATAC AGAAGTCAAT ATCCCTATTT GGTGTCATAC CAGGAAGATC TGCAGGGAAC 7302 ||||| |||| |||||||||| |||||||| | |||| ||||| |||||||||| |||||| ||| AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGAAAC 300 ACATCCAGAA ACTCACGGAC CACCGAAACC GACTCAATCG AAGGTACTTG GGTGGTGTCA 7242 || |||| || |||| ||||| ||||||||| |||||||| | | |||||||| || |||||| ACGTCCAAAA ACTCGCGGAC TACCGAAACC GACTCAATTG AGGGTACTTG AGTAGTGTCA 360 TCCTTGAGAT GTGCCAAGAA AGCTAAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 7182 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| TCCTTGAGAT GTGCCAAGAA AGCTGAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 420 AAGGAGATGA TACGCACCGG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 7122 ||||| |||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| AAGGATATGA TACGCACCAG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 480 TCAGGCTTGG CTAACGTCAG AGTTTTAGCA TTACAATCCA AGATCGCAAA ATTCGGAGAA 7062 ||||||||| ||||||||| ||||||||| |||||||||| |||||||||| ||| |||||| CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA ATTTGGAGAA 540 AGCCAAGTCA TACCCAGAAT TACATCAAAA TCATCCATTT CTAAGATAAC CAAATCTACA 7002 |||||||||| |||||||||| |||||||| ||| |||||| |||| | ||| |||||||||| AGCCAAGTCA TACCCAGAAT TACATCAATG TCACCCATTT CTAA-AAAAC CAAATCTACA 599 TAAGTGTTGC TCCCCAAAAA GTTCACCAAA CAAGACCTAT ATACCTTTTC AACTACCAC 6943 |||||||||| |||||| || ||| |||| ||| |||| ||||||||| ||||||||| TAAGTGTTGC TCCCCACGAA ATTCNACAAA ACAGAACTAT GTACCTTTTC AACTACCAC 658 hqPGS_C06HBa0153O03.1-1-_SGN-E355232+ (7601 6943) ******************************************************************************** EST sequence 189 +strand 679 n (File: SGN-E368762+) 1 GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACATC 361 CGTTGCCCGT ATTTTCAATT GATGATAACC GGATCTCAAG TCAATCTTAG AGAAGACACA 421 AGCACCTTGT AACTGATCGA ACAAGTCATC AATGCGAGGA AGTGGATACT TGTTCTTAAT 481 AGTTACCTTG TTCAACTGCC GGTAGTCTAT GCACATCCGA AAACTCCCAT CCTTCTTCTT 541 CACAAACCAA ACCGGAGCAC CCCAAGGAGA TGCACTTGGT CTAATGAAGA CTTTGCTCAA 601 AAACTCTTGA AGTTGGGCCT TTAACTCTCT TAACTCCGCG GGAGCCATTC TATAAGGGGG 661 TATAGAAATG GGGCGAGTG Predicted gene structure (within gDNA segment 8695 to 6752): Exon 1 8050 7372 ( 679 n); cDNA 1 678 ( 678 n); score: 0.953 MATCH C06HBa0153O03.1-1- SGN-E368762+ 0.953 679 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E368762+ (8050 7372) Alignment (genomic DNA sequence = upper lines): GTCTTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACT CTATCCTTAG 7991 |||| ||||| |||||||||| |||||||||| || ||||||| |||||||||| | |||||||| GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 60 AAACCACGTG CCCCAAGAAG GACACTGCAT CTAGCCAAAA CTCACACTTA GAGAACTTGG 7931 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGCTT TTTCTCCCTC AACATTTCCA ATACCATTCT CAAATGCTCC TCATTTTCCT 7871 |||||||||| |||||||||| |||||||| | |||| ||||| |||||||||| |||| ||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 CCCTACTTTT AGAGTATATC AGTATGTCAT CAATAAATAC GATCACAAAG AGATCCAAAT 7811 | |||| || |||||||||| ||||| |||| ||||||| | ||| |||||| |||||||||| TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 240 ATGGCTTAAA AACCCCGTTC ATCAAACTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 7751 ||||||| || || ||||||| ||||| |||| |||||||||| |||||||||| |||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGATATCAC TACAAATTTG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACAT 7691 |||| ||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACAT 359 CCGTTGCCCG TATTTTCAAT TGATGATAGC CGGATCTCAA GTCAATCTTA GAGAAGACAC 7631 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| CCGTTGCCCG TATTTTCAAT TGATGATAAC CGGATCTCAA GTCAATCTTA GAGAAGACAC 419 AAGCACCTTG TAACTGATCG AACAAGTCAT CAATGCGAGG AAGAGGATAC TTGTTCTTTA 7571 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||| | AAGCACCTTG TAACTGATCG AACAAGTCAT CAATGCGAGG AAGTGGATAC TTGTTCTTAA 479 TGGTTACCTT GTTCAACTGC CGGTAGTCTA TGCACATCCG AAAACTCCCA TCCTTCTTCT 7511 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTTACCTT GTTCAACTGC CGGTAGTCTA TGCACATCCG AAAACTCCCA TCCTTCTTCT 539 TTACAAACAA AACAGGAGCA CCCCAAGGAG ATGCACTTGG TCTAATGAAA CCCTTGCTCA 7451 | |||||| | ||| |||||| |||||||||| |||||||||| ||||||||| | ||||||| TCACAAACCA AACCGGAGCA CCCCAAGGAG ATGCACTTGG TCTAATGAAG ACTTTGCTCA 599 ACAACTCTTG AAGTTGGGCT TTTAACTCTC TTAACTCCGC GGGAGCAATT CTATAAGGGG 7391 | |||||||| ||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| AAAACTCTTG AAGTTGGGCC TTTAACTCTC TTAACTCCGC GGGAGCCATT CTATAAGGGG 659 GTATAGAAAT GGGGCGAGT 7372 |||||||||| ||||||||| GTATAGAAAT GGGGCGAGT 678 hqPGS_C06HBa0153O03.1-1-_SGN-E368762+ (8050 7372) ******************************************************************************** EST sequence 120 +strand 712 n (File: SGN-E379315+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTNCACTG CCGGTAGCCT ATGCACATCC GAAAACTCCA 601 ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCAGTTG GTCTAATGAA 661 GCCTTTGCTC ACAACTCTTT GAAGTGGTCT TTTAACTCTC TTAACTCTGC GG Predicted gene structure (within gDNA segment 8721 to 6206): Exon 1 8121 7409 ( 713 n); cDNA 1 712 ( 712 n); score: 0.954 MATCH C06HBa0153O03.1-1- SGN-E379315+ 0.954 713 1.001 C PGS_C06HBa0153O03.1-1-_SGN-E379315+ (8121 7409) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 8062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 8002 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCTATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 7942 || ||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAACTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 7882 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATTTTCC TCCCTACTTT TAGAGTATAT CAGTATGTCA TCAATAAATA CGATCACAAA 7822 |||| |||| ||| |||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCCAAA TATGGCTTAA AAACCCCGTT CATCAAACTC ATGAACGCAG CAGGGGCATT 7762 |||||||||| |||||||||| ||| ||| || || |||||| || ||||||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGATATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 7702 | |||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAG CCGGATCTCA AGTCAATCTT 7642 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGAG GAAGAGGATA 7582 ||| |||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 540 CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGTCT ATGCACATCC GAAAACTCCC 7522 |||||||||| |||||||||| |||| |||| ||||||| || |||||||||| ||||||||| CTTGTTCTTT ATGGTTACCT TGTTNCACTG CCGGTAGCCT ATGCACATCC GAAAACTCCA 600 ATCCTTCTTC TTTACAAACA AAACAGGAGC ACCCCAAGGA GATGCACTTG GTCTAATGAA 7462 |||||||||| |||||||||| |||| ||||| |||||||||| |||||| ||| |||||||||| ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCAGTTG GTCTAATGAA 660 ACCCTTGCTC AACAACTCTT GAAGTTGGGC TTTTAACTCT CTTAACTCCG CGG 7409 || |||||| ||||||||| | ||| | |||||||||| |||||||| | ||| GCCTTTGCTC -ACAACTCTT TGAAGTGGTC TTTTAACTCT CTTAACTCTG CGG 712 hqPGS_C06HBa0153O03.1-1-_SGN-E379315+ (8121 7409) ******************************************************************************** EST sequence 108 +strand 709 n (File: SGN-E578271+) 1 GTTCTTTATG GTTACCTTGT TTAGTTGTCT GTAGTCTATA TACATTCGAA AACTCCCATC 61 CTTCTTCTTT ACAAACAAAA CCGGAGCACC CAAGGGGATG CACTTGGTCT AATGAAGCCT 121 TTGTTGCTAC AAAGATATGA CCTATATATC ATATCTTGAC TGGTTCTTTA GATCCAGATA 181 ATGCGAAGTG ATGGGTTGGT TATTAGTTCT ATAGTTTTTA GTTCATACTA TGTGGGCTGG 241 GTTTTTTTAA TCCTAACCCT AACAAAACCC ACGAGTCACA CACTAAGCAT AGCAATTATA 301 TCAAATGGTC AATCGAATTT TTATTCAACC TTATAGAATT AAGAATTAGA AAGAATTAAG 361 AATTAGAAAT GTTCCCCTTG ATTAGAAAAA GAATGAATTG GTCTTTTTTT TTGTTCAATC 421 ATTGGATAGA AGGGAAAGAC AAGTAGTAAA ATTATTCCTC GTCTAGAAAT ATCCAAATTT 481 TGATGCCCAA TATTCCATAG ATAGTTCGAA CTGTATAAGA GCAATAATCA ATTTTAGCTC 541 GAATCGTTTG TAGGGGAACC CTGCCTTCTC TGATCCATTC GACACGTGCA ATTTCTTTTC 601 CGTCGATACG CCCCGCAATT TGTATTTGAA TTCCTTGTGT ATCCGCTTGT TCTGTTAATT 661 CAATAGCCTT TTTCATTGCT TTTCGAAATG AAACTCTATT CTTTAATTG Predicted gene structure (within gDNA segment 8592 to 922): Exon 1 7578 7456 ( 123 n); cDNA 1 122 ( 122 n); score: 0.894 MATCH C06HBa0153O03.1-1- SGN-E578271+ 0.894 123 0.173 C PGS_C06HBa0153O03.1-1-_SGN-E578271+ (7578 7456) Alignment (genomic DNA sequence = upper lines): GTTCTTTATG GTTACCTTGT TCAACTGCCG GTAGTCTATG CACATCCGAA AACTCCCATC 7519 |||||||||| |||||||||| | | || | ||||||||| |||| |||| |||||||||| GTTCTTTATG GTTACCTTGT TTAGTTGTCT GTAGTCTATA TACATTCGAA AACTCCCATC 60 CTTCTTCTTT ACAAACAAAA CAGGAGCACC CCAAGGAGAT GCACTTGGTC TAATGAAACC 7459 |||||||||| |||||||||| | |||||||| ||||| ||| |||||||||| ||||||| || CTTCTTCTTT ACAAACAAAA CCGGAGCACC -CAAGGGGAT GCACTTGGTC TAATGAAGCC 119 CTT 7456 || TTT 122 hqPGS_C06HBa0153O03.1-1-_SGN-E578271+ (7578 7456) ******************************************************************************** EST sequence 141 +strand 596 n (File: SGN-E375319+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGCCT ATGCACATCC GAAAAC Predicted gene structure (within gDNA segment 8721 to 6916): Exon 1 8121 7526 ( 596 n); cDNA 1 596 ( 596 n); score: 0.968 MATCH C06HBa0153O03.1-1- SGN-E375319+ 0.968 596 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E375319+ (8121 7526) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 8062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 8002 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCTATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 7942 || ||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAACTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 7882 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATTTTCC TCCCTACTTT TAGAGTATAT CAGTATGTCA TCAATAAATA CGATCACAAA 7822 |||| |||| ||| |||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCCAAA TATGGCTTAA AAACCCCGTT CATCAAACTC ATGAACGCAG CAGGGGCATT 7762 |||||||||| |||||||||| ||| ||| || || |||||| || ||||||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGATATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 7702 | |||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAG CCGGATCTCA AGTCAATCTT 7642 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGAG GAAGAGGATA 7582 ||| |||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 540 CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGTCT ATGCACATCC GAAAAC 7526 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||| CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGCCT ATGCACATCC GAAAAC 596 hqPGS_C06HBa0153O03.1-1-_SGN-E375319+ (8121 7526) ******************************************************************************** EST sequence 128 +strand 526 n (File: SGN-E204434+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATG Predicted gene structure (within gDNA segment 8721 to 6941): Exon 1 8121 7596 ( 526 n); cDNA 1 526 ( 526 n); score: 0.966 MATCH C06HBa0153O03.1-1- SGN-E204434+ 0.966 526 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E204434+ (8121 7596) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 8062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 8002 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCTATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 7942 || ||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAACTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 7882 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATTTTCC TCCCTACTTT TAGAGTATAT CAGTATGTCA TCAATAAATA CGATCACAAA 7822 |||| |||| ||| |||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCCAAA TATGGCTTAA AAACCCCGTT CATCAAACTC ATGAACGCAG CAGGGGCATT 7762 |||||||||| |||||||||| ||| ||| || || |||||| || ||||||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGATATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 7702 | |||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAG CCGGATCTCA AGTCAATCTT 7642 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATG 7596 ||| |||||| |||||||||| |||||||||| |||||||||| | |||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATG 526 hqPGS_C06HBa0153O03.1-1-_SGN-E204434+ (8121 7596) ******************************************************************************** EST sequence 206 +strand 358 n (File: SGN-E240817+) 1 GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACA Predicted gene structure (within gDNA segment 8749 to 7082): Exon 1 8050 7692 ( 359 n); cDNA 1 358 ( 358 n); score: 0.930 MATCH C06HBa0153O03.1-1- SGN-E240817+ 0.930 359 1.003 C PGS_C06HBa0153O03.1-1-_SGN-E240817+ (8050 7692) Alignment (genomic DNA sequence = upper lines): GTCTTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACT CTATCCTTAG 7991 |||| ||||| ||||||||| |||||||||| || |||||| |||||||||| | |||||||| GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 60 AAACCACGTG CCCCAAGAAG GACACTGCAT CTAGCCAAAA CTCACACTTA GAGAACTTGG 7931 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGCTT TTTCTCCCTC AACATTTCCA ATACCATTCT CAAATGCTCC TCATTTTCCT 7871 |||||||||| |||||||||| |||||||| | |||| ||||| |||||||||| |||| ||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 CCCTACTTTT AGAGTATATC AGTATGTCAT CAATAAATAC GATCACAAAG AGATCCAAAT 7811 | |||| || |||| ||||| ||||| |||| |||| || | ||| || || |||||||||| TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 240 ATGGCTTAAA AACCCCGTTC ATCAAACTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 7751 ||||||| || || ||||||| ||||| |||| |||||||||| |||||||||| |||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGATATCAC TACAAATTTG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACA 7692 |||| ||||| |||||||| | |||||||||| |||||||||| |||||||||| ||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACA 358 hqPGS_C06HBa0153O03.1-1-_SGN-E240817+ (8050 7692) ******************************************************************************** EST sequence 167 +strand 587 n (File: SGN-E352950+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC Predicted gene structure (within gDNA segment 9994 to 6560): Exon 1 8935 8350 ( 586 n); cDNA 1 586 ( 586 n); score: 0.881 MATCH C06HBa0153O03.1-1- SGN-E352950+ 0.881 586 0.998 C PGS_C06HBa0153O03.1-1-_SGN-E352950+ (8935 8350) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAA AATAGTGTTG ATTAAATCAT CGACGCGGGG TACACATACC CTTCCCTTAA 8876 ||||||| | |||||||| || ||||| | || | ||||| |||||||||| ||||||||| CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TCCTCAAAAC ACCTTCCTCA TCGATTGTCG CTTCTTTAGC CTCTCCTTGC AACACTTTAT 8816 |||||||||| |||||||| | |||||| | | |||||||||| ||||||| || | || |||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTCGGATCCG GATCAATTTC TCATCATTAA ACTGCTTTCC CTTAATCTTG TCAAGGAAGG 8756 |||| |||| ||||| |||| ||||| ||| |||||||||| |||||||| ||||| |||| CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 GAGATCTTGC CTCCACACAA GCTAAAAATC CTCCCTTCTC ATTTACTTCT AATCTTATAA 8696 ||||||||| |||||||||| || |||||| |||||||||| ||||| |||| | || |||| AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 GGTCGTTAGC CAGAATCTAA ACTTCTCTAG CCAATAGGCG TCTAGAAGCT TGCAAGTGAG 8636 |||| ||||| |||| ||| | || ||||||| ||||| ||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACATCC CATGCTTCCC GCCTTTCTAC TTAAAGCATC CGCTACAACA TTCGCCTTCC 8576 |||||| ||| |||||| || |||||||||| |||||||||| || |||||| || ||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGAATGATA CAAAATAGTG ATATCGTAGT CCTTCAGTAG TTCCATCCAT CTCCTCTGTC 8516 ||| |||||| ||| |||||| ||||| |||| |||||||||| |||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TCAAGTTCAA ATCTTTCAGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGACCT 8456 |||| ||||| ||||||| || |||||||||| |||||| || |||||||||| ||||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACT ACTGCGGCCA 8396 |||||||||| |||||| | | || |||||| ||| |||||| ||||| |||| ||||| || | CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC ATGAGTTGGA TAGTTACGTT CATGCACTTT TAATTG 8350 | |||||||| ||| || ||| || ||||||| ||||||| || |||||| ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTG 586 hqPGS_C06HBa0153O03.1-1-_SGN-E352950+ (8935 8350) ******************************************************************************** EST sequence 198 +strand 587 n (File: SGN-E357100+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC Predicted gene structure (within gDNA segment 9994 to 6560): Exon 1 8935 8350 ( 586 n); cDNA 1 586 ( 586 n); score: 0.881 MATCH C06HBa0153O03.1-1- SGN-E357100+ 0.881 586 0.998 C PGS_C06HBa0153O03.1-1-_SGN-E357100+ (8935 8350) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAA AATAGTGTTG ATTAAATCAT CGACGCGGGG TACACATACC CTTCCCTTAA 8876 ||||||| | |||||||| || ||||| | || | ||||| |||||||||| ||||||||| CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TCCTCAAAAC ACCTTCCTCA TCGATTGTCG CTTCTTTAGC CTCTCCTTGC AACACTTTAT 8816 |||||||||| |||||||| | |||||| | | |||||||||| ||||||| || | || |||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTCGGATCCG GATCAATTTC TCATCATTAA ACTGCTTTCC CTTAATCTTG TCAAGGAAGG 8756 |||| |||| ||||| |||| ||||| ||| |||||||||| |||||||| ||||| |||| CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 GAGATCTTGC CTCCACACAA GCTAAAAATC CTCCCTTCTC ATTTACTTCT AATCTTATAA 8696 ||||||||| |||||||||| || |||||| |||||||||| ||||| |||| | || |||| AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 GGTCGTTAGC CAGAATCTAA ACTTCTCTAG CCAATAGGCG TCTAGAAGCT TGCAAGTGAG 8636 |||| ||||| |||| ||| | || ||||||| ||||| ||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACATCC CATGCTTCCC GCCTTTCTAC TTAAAGCATC CGCTACAACA TTCGCCTTCC 8576 |||||| ||| |||||| || |||||||||| |||||||||| || |||||| || ||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGAATGATA CAAAATAGTG ATATCGTAGT CCTTCAGTAG TTCCATCCAT CTCCTCTGTC 8516 ||| |||||| ||| |||||| ||||| |||| |||||||||| |||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TCAAGTTCAA ATCTTTCAGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGACCT 8456 |||| ||||| ||||||| || |||||||||| |||||| || |||||||||| ||||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACT ACTGCGGCCA 8396 |||||||||| |||||| | | || |||||| ||| |||||| ||||| |||| ||||| || | CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC ATGAGTTGGA TAGTTACGTT CATGCACTTT TAATTG 8350 | |||||||| ||| || ||| || ||||||| ||||||| || |||||| ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTG 586 hqPGS_C06HBa0153O03.1-1-_SGN-E357100+ (8935 8350) ******************************************************************************** EST sequence 146 +strand 554 n (File: SGN-E352647+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAA CCCATAGAGA TAATGGCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGG Predicted gene structure (within gDNA segment 9994 to 6890): Exon 1 8935 8383 ( 553 n); cDNA 1 553 ( 553 n); score: 0.877 MATCH C06HBa0153O03.1-1- SGN-E352647+ 0.877 553 0.998 C PGS_C06HBa0153O03.1-1-_SGN-E352647+ (8935 8383) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAA AATAGTGTTG ATTAAATCAT CGACGCGGGG TACACATACC CTTCCCTTAA 8876 ||||||| | |||||||| || ||||| | || | ||||| |||||||||| ||||||||| CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TCCTCAAAAC ACCTTCCTCA TCGATTGTCG CTTCTTTAGC CTCTCCTTGC AACACTTTAT 8816 |||||||||| |||||||| | |||||| | | |||||||||| ||||||| || | || |||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTCGGATCCG GATCAATTTC TCATCATTAA ACTGCTTTCC CTTAATCTTG TCAAGGAAGG 8756 |||| |||| ||||| |||| ||||| ||| |||||||||| |||||||| ||||| |||| CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 GAGATCTTGC CTCCACACAA GCTAAAAATC CTCCCTTCTC ATTTACTTCT AATCTTATAA 8696 ||||||||| |||||||||| || |||||| |||||||||| ||||| |||| | || |||| AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 GGTCGTTAGC CAGAATCTAA ACTTCTCTAG CCAATAGGCG TCTAGAAGCT TGCAAGTGAG 8636 |||| ||||| |||| ||| | || ||||||| ||||| ||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACATCC CATGCTTCCC GCCTTTCTAC TTAAAGCATC CGCTACAACA TTCGCCTTCC 8576 |||||| ||| |||||| || |||||||||| |||||||||| || |||||| || ||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGAATGATA CAAAATAGTG ATATCGTAGT CCTTCAGTAG TTCCATCCAT CTCCTCTGTC 8516 ||| |||||| ||| |||||| ||||| |||| |||||||||| |||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TCAAGTTCAA ATCTTTCAGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGACCT 8456 |||| ||||| ||||||| || |||||||||| |||||| || |||||||||| ||||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACT ACTGCGGCCA 8396 ||||||||| |||||| | | || || ||| ||| |||||| ||||| |||| ||||| || | CACACTTAAA CCCATAGAGA TAATGGCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC ATG 8383 | |||||||| ||| ACTCCAAATC ATG 553 hqPGS_C06HBa0153O03.1-1-_SGN-E352647+ (8935 8383) ******************************************************************************** EST sequence 165 +strand 542 n (File: SGN-E353207+) 1 TCGGCATGTA CTGTTTTCCA AAACTTAGAA GTAAATTGCG TACCTCTATC TGATATGATG 61 GAGAGTGGAA CTCCGTGCAA TCGAACAATT TCTAAGATGT AAAGTTTGGC TAACTTCTCT 121 GCATTGTAAG TCACCTTAAC CGAAATAGAG TGAGCAGACT TAGTTAATCT GTCAACAATC 181 ACCCAAATAG AGTCATACCT ACCCATCGTC TTTGGATGAC TAACCACGAA GTCCATTGCA 241 ATTCTCTCCC ACTTCCATTC AGGAATGGGC ATTCTCTGAA GTGTCCCTCC AGGCCTCTGG 301 TGTTCATACT TTACTTGCTG ACAATTCGGG CATTGAGCAA CAAAATTAAC AATGTCACGC 361 TTCATCCTAC TCCACCAAAA ATGTTGCTTT AGGTCACGAT ACATCTTGTT TGCACCAGGA 421 TGTATAGAGT ACTTCGAACT ATGAGCCTAT ATAAGAATAG TGTGAATCAA ATCATCGACG 481 CGGGGTACAC ATACTCCTTC CCTTATTCTC AAAACACCTT CCTCATCGAT TTTTGCTTCT 541 TT Predicted gene structure (within gDNA segment 10078 to 8130): Exon 1 9379 8839 ( 541 n); cDNA 1 542 ( 542 n); score: 0.858 MATCH C06HBa0153O03.1-1- SGN-E353207+ 0.858 541 0.998 C PGS_C06HBa0153O03.1-1-_SGN-E353207+ (9379 8839) Alignment (genomic DNA sequence = upper lines): TCCGCATGCA ATGTTTTCCA AAACTTAAAA GTAAACTGCG TACCTCTATC TGATATAATG 9320 || ||||| | ||||||||| ||||||| || ||||| |||| |||||||||| |||||| ||| TCGGCATGTA CTGTTTTCCA AAACTTAGAA GTAAATTGCG TACCTCTATC TGATATGATG 60 GAGAGTGGAA CCCCATGCAA TCGCACCACT TCCGAGATGT AAAGTTTGGA TAACTTCTCT 9260 |||||||||| | || ||||| ||| || | | || |||||| ||||||||| |||||||||| GAGAGTGGAA CTCCGTGCAA TCGAACAATT TCTAAGATGT AAAGTTTGGC TAACTTCTCT 120 GCATTGTAAG GCACCTTGAC CGGAATGAAG TGAGCAGACT TAGTTAACCT ATCAACAATT 9200 |||||||||| |||||| || || ||| || |||||||||| ||||||| || |||||||| GCATTGTAAG TCACCTTAAC CGAAATAGAG TGAGCAGACT TAGTTAATCT GTCAACAATC 180 ACCCAAATGG AATCAAACTT TCCCAATGTC TTTGGAAGAC CAACCACGAA ATCCATTGCA 9140 |||||||| | | ||| || | |||| ||| |||||| ||| ||||||||| ||||||||| ACCCAAATAG AGTCATACCT ACCCATCGTC TTTGGATGAC TAACCACGAA GTCCATTGCA 240 ATCCTTTCCC ACTTCCATTC GGGAATGGGC ATTCTCTGAA GTGTTCCTCC GGGCCTTTGG 9080 || || |||| |||||||||| ||||||||| |||||||||| |||| ||||| ||||| ||| ATTCTCTCCC ACTTCCATTC AGGAATGGGC ATTCTCTGAA GTGTCCCTCC AGGCCTCTGG 300 TGTTCATACT TTACCTGTTG ACAGTTTGGA CATTTGGCAA TAAAATCCAC AATATCACGC 9020 |||||||||| |||| || || ||| || || |||| |||| ||||| || ||| |||||| TGTTCATACT TTACTTGCTG ACAATTCGGG CATTGAGCAA CAAAATTAAC AATGTCACGC 360 TTCATTCTAC TCTACC-AAA GTGTTGTTTT AGGTCACGGT ACATTTTGGT TGCACCCGGA 8961 ||||| |||| || ||| ||| ||||| ||| |||||||| | |||| ||| | |||||| ||| TTCATCCTAC TCCACCAAAA ATGTTGCTTT AGGTCACGAT ACATCTTGTT TGCACCAGGA 420 TGTATCGAAT ACCTTGAACT ATGAGCCTCT GTCAAAATAG TGTTGATTAA ATCATCGACG 8901 ||||| || | || | ||||| |||||||| | | | ||||| ||| || || |||||||||| TGTATAGAGT ACTTCGAACT ATGAGCCTAT ATAAGAATAG TGTGAATCAA ATCATCGACG 480 CGGGGTACAC ATAC-CCTTC CCTTAATCCT CAAAACACCT TCCTCATCGA TTGTCGCTTC 8842 |||||||||| |||| ||||| |||| || || |||||||||| |||||||||| || | ||||| CGGGGTACAC ATACTCCTTC CCTT-ATTCT CAAAACACCT TCCTCATCGA TTTTTGCTTC 539 TTT 8839 ||| TTT 542 hqPGS_C06HBa0153O03.1-1-_SGN-E353207+ (9379 8839) ******************************************************************************** EST sequence 106 +strand 188 n (File: SGN-E577888+) 1 CTACATCTCC TACCATACAA TGCTTCAAAT GGAGCCATAT CAATGCTTGA TTGATAGCTA 61 TTATTGTATG AAAACTCCGC TAAGGGTAGG AAGCTATCCC AATGACCACC AAACTCTATC 121 ACACACGCAC GAAGCATATT CTCCAACACT TGAATCGTTC GCTCAGACTG ACCATCGGGA 181 GAACTAGT Predicted gene structure (within gDNA segment 10494 to 8714): Exon 1 9597 9425 ( 173 n); cDNA 1 177 ( 177 n); score: 0.861 MATCH C06HBa0153O03.1-1- SGN-E577888+ 0.861 173 0.920 C PGS_C06HBa0153O03.1-1-_SGN-E577888+ (9597 9425) Alignment (genomic DNA sequence = upper lines): CTACATCTTC TCCCATATAG TGCCTCAAAT GGGGCCATAT CAATGCTTGA GTGATAGCTA 9538 |||||||| | | ||||| | ||| |||||| || ||||||| |||||||||| ||||||||| CTACATCTCC TACCATACAA TGCTTCAAAT GGAGCCATAT CAATGCTTGA TTGATAGCTA 60 TTATTGTAGG AGAACTCTGC TAAGGGT--- -AGCTATCCC ACTGACCACC AAATTCTATC 9482 |||||||| | | ||||| || ||||||| ||||||||| | |||||||| ||| |||||| TTATTGTATG AAAACTCCGC TAAGGGTAGG AAGCTATCCC AATGACCACC AAACTCTATC 120 ACACATGCAC GAAGCATATC CTCCAACACT TGAATCGTTC GCTCAGACTG ACCATCG 9425 ||||| |||| ||||||||| |||||||||| |||||||||| |||||||||| ||||||| ACACACGCAC GAAGCATATT CTCCAACACT TGAATCGTTC GCTCAGACTG ACCATCG 177 hqPGS_C06HBa0153O03.1-1-_SGN-E577888+ (9597 9425) ******************************************************************************** EST sequence 67 -strand 763 n (File: SGN-E354383-) 1 AGAGAGTCGA TTTTCATATC CAAATTCAGA AATTCTAAGT ATGCTGAAAC GATGCACCTT 61 CGACGGGCCG TCGTGCCTGT GACGGTCCGT CGCAGTGCCC GTGGTCTTGG CCAGTTTTTC 121 CAGAATTAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 181 CCCATCGTTC GTCCCCGAAC GATCAAATGA AGAAAAACAA TGGCGAAAAG GAGTACCTGA 241 ATCTGTAAAC AGATGTGGGT ATTTTTCTTG CATATCTGCC TCCTTCTCCC AAGTAGCTTC 301 TTCAACCGGT CGATTCTTCC ATTGAACTTT GATGGATGCA ATTTCTCTCG ACCTCAACTT 361 GCGAACCTCT CTATCTAAAA TAGCAACTGG TTCCTCCTCA TAAGTCCAAT TCTCACCAAG 421 CGAAACTGAA TCCCAACGGA TGATGTAGTT TCCATCTCCA TGGTATCTTT TCAACATGGA 481 TACATGAAAT ACCGGATGTA CTCCGGACAG CCCTGGAGGT AAGGCTAATT CATAAGCCAC 541 CTCCCCTACT CGCTTTAGTA CCTCAAATGG TCCAATATAC CTTGGACTTA ACTTACCTCT 601 TTTACGAAAC CGCATCACCC CTTTCATTGG CGAGACTTTC AACAAGACTT GTTCGCCCTC 661 CATGAACTCT AAGTCTCTAA CCTTTCGATC TGCATATTCT TTTTGTCTAC TTTGCGCCGC 721 TAGAAGCTTT TCTTGAATAG ACTTCACTTT CTCCATCGAA TCT Predicted gene structure (within gDNA segment 12308 to 7186): Exon 1 10420 9659 ( 762 n); cDNA 1 762 ( 762 n); score: 0.867 MATCH C06HBa0153O03.1-1- SGN-E354383- 0.867 762 0.999 C PGS_C06HBa0153O03.1-1-_SGN-E354383- (10420 9659) Alignment (genomic DNA sequence = upper lines): AGAAAGTCGA TTTCAGTACC CAATTTTCAG AATTCTAAGT ATTTTGGAAT GAGATACCCT 10361 ||| |||||| ||| || | ||| || |||||||||| || || || || ||| | AGAGAGTCGA TTTTCATATC CAAATTCAGA AATTCTAAGT ATGCTGAAAC GATGCACCTT 60 CAACGGTCTG TCGTGCCCAT GACGGTCCGT CGTGGGTTCC GTCATCTCAG CCTGTTTTTC 10301 | |||| | | ||||||| | |||||||||| || | || || ||| | || ||||||| CGACGGGCCG TCGTGCCTGT GACGGTCCGT CGCAGTGCCC GTGGTCTTGG CCAGTTTTTC 120 AAGAAATAAA ATCTGCTGCT CGAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 10241 |||| |||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| CAGAATTAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 180 CCCATCGTTC GTCCCCGAAC GATCACAAGA AGGAAAACAA GGGCGAAAAG GAGTACCTGA 10181 |||||||||| |||||||||| ||||| | || || ||||||| ||||||||| |||||||||| CCCATCGTTC GTCCCCGAAC GATCAAATGA AGAAAAACAA TGGCGAAAAG GAGTACCTGA 240 ATCTGTAAAC AGGTGTGGGT ATCTTTCTCG CATATCAGCC TTGTTCTCCC AAGTGGCTTC 10121 |||||||||| || ||||||| || ||||| | |||||| ||| | ||||||| |||| ||||| ATCTGTAAAC AGATGTGGGT ATTTTTCTTG CATATCTGCC TCCTTCTCCC AAGTAGCTTC 300 TTCGACTGGT CGATTCTTCC TTTGAACTTT GATGGATGCA ATCTCCCTTG ATCTCAACTT 10061 ||| || ||| |||||||||| ||||||||| |||||||||| || || || | | |||||||| TTCAACCGGT CGATTCTTCC ATTGAACTTT GATGGATGCA ATTTCTCTCG ACCTCAACTT 360 GCGAATTTCT CTATCTAGAA TGGCAACAGG CTCCTCCTCA TAAGTCAAAT TTTCATCAAG 10001 ||||| ||| ||||||| || | ||||| || ||||||||| |||||| ||| | ||| |||| GCGAACCTCT CTATCTAAAA TAGCAACTGG TTCCTCCTCA TAAGTCCAAT TCTCACCAAG 420 CAAAACTGAA TCCCAACGGA TAATGTAGTT TCCATCCCCA TGGTATCTTT TCAACATAGA 9941 | |||||||| |||||||||| | |||||||| |||||| ||| |||||||||| ||||||| || CGAAACTGAA TCCCAACGGA TGATGTAGTT TCCATCTCCA TGGTATCTTT TCAACATGGA 480 CACATGAAAT ACCGGATGCA CTCCGGACAG CCCTGGAGGC AAGGCTAACT CATAAGCCAC 9881 ||||||||| |||||||| | |||||||||| ||||||||| |||||||| | |||||||||| TACATGAAAT ACCGGATGTA CTCCGGACAG CCCTGGAGGT AAGGCTAATT CATAAGCCAC 540 CTCCCCTACT CGCTTGAGTA CTTCAAATGG ACCAATATAC CTTGGGCTAA GTTTACCTCG 9821 |||||||||| ||||| |||| | |||||||| ||||||||| ||||| || | ||||||| CTCCCCTACT CGCTTTAGTA CCTCAAATGG TCCAATATAC CTTGGACTTA ACTTACCTCT 600 CTTACCAAAC CGCATCACCC CTTTCATGGG CGAAACCTTC AGCAAGACTT GTTCACCCTC 9761 |||| |||| |||||||||| ||||||| || ||| || ||| | |||||||| |||| ||||| TTTACGAAAC CGCATCACCC CTTTCATTGG CGAGACTTTC AACAAGACTT GTTCGCCCTC 660 CATGAACACC AAGTCTCTAA CTTTTCTATC TGTATATTCC TTTTGCCTAC TTTGCGCCGC 9701 ||||||| | |||||||||| | |||| ||| || |||||| ||||| |||| |||||||||| CATGAACTCT AAGTCTCTAA CCTTTCGATC TGCATATTCT TTTTGTCTAC TTTGCGCCGC 720 TAACAGTTTT TCTTGAATAG ATTTCACTTT ATCTAACGAT TC 9659 || || ||| |||||||||| | |||||||| || | ||| || TAGAAGCTTT TCTTGAATAG ACTTCACTTT CTCCATCGAA TC 762 hqPGS_C06HBa0153O03.1-1-_SGN-E354383- (10420 9659) ******************************************************************************** EST sequence 34 -strand 542 n (File: SGN-E252199-) 1 CGACCCAGCC TGGGATTACG CAGTCTGTGA CGGTCCGTCC TGCACGTCCG TCACAGAGTT 61 CAGAGACTAG ATTTTTACCA AGGGTCTGTG ACGGCCCATC ACGCCTGTGA CGGTCCGTCC 121 TGCCATTCCG TCACGAAGTT CAGAGAGTCG ATTTCAGTAC CCAAATTTCA GAATTCTAAG 181 TGTTTTGGAA CGAGACCCCC TCGACGGTCC GTCGTGGGAT CCGTCGTCTC AGTCAGTTTT 241 TCCAGAAATA AAATCTGTTA CTCAAAACGA CTAAACAGGT CGTTACAATA GATACCAATT 301 TACCCATCGT TCGTCCCCGA ACGATCACAA GAAGAAAAAC AAGGGCGAAA AGGAGTACCT 361 GAATCTGTAA ACAGGTATGG GTATCTTTCT CGCATATCAA CTTCCTTCTC CCAAGTGGAT 421 TCTTCAACTG GTCGATTCTT CCATTGAACT TTGATAGATG CAATCTCCCT TGACCTCAAT 481 TTGCGGACTT CTCTATCTAA AATGGCAACA GGCTCCTCCT CATAAGACAA ATTCTCATCA 541 AG Predicted gene structure (within gDNA segment 12098 to 8780): Exon 1 10476 10001 ( 476 n); cDNA 86 542 ( 457 n); score: 0.889 MATCH C06HBa0153O03.1-1- SGN-E252199- 0.889 476 0.878 C PGS_C06HBa0153O03.1-1-_SGN-E252199- (10476 10001) Alignment (genomic DNA sequence = upper lines): CTGTGACGGT CCGTCACACC TGTGACGGTC CGTCCTGCCA TTTCGTTACG AAGTTCAGAA 10417 ||||||||| || |||| || |||||||||| |||||||||| || ||| ||| ||||||||| CTGTGACGGC CCATCACGCC TGTGACGGTC CGTCCTGCCA TTCCGTCACG AAGTTCAGAG 145 AGTCGATTTC AGTACCCAAT TTTCAGAATT CTAAGTATTT TGGAATGAGA TACCCTCAAC 10357 |||||||||| ||||||||| |||||||||| |||||| ||| ||||| ||| |||| | | AGTCGATTTC AGTACCCAAA TTTCAGAATT CTAAGTGTTT TGGAACGAG- -ACCC-C--C 200 GGTCTGTCGT GCCCATGACG GTCCGTCGTG GGTTCCGTCA TCTCAGCCTG TTTTTCAAGA 10297 || |||| |||||||||| || |||||| |||||| | | |||||| ||| --TC------ ------GACG GTCCGTCGTG GGATCCGTCG TCTCAGTCAG TTTTTCCAGA 246 AATAAAATCT GCTGCTCGAA ACGACTAAAC AGGTCGTTAC AATAGATACC AATTTACCCA 10237 |||||||||| | | ||| || |||||||||| |||||||||| |||||||||| |||||||||| AATAAAATCT GTTACTCAAA ACGACTAAAC AGGTCGTTAC AATAGATACC AATTTACCCA 306 TCGTTCGTCC CCGAACGATC ACAAGAAGGA AAACAAGGGC GAAAAGGAGT ACCTGAATCT 10177 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| TCGTTCGTCC CCGAACGATC ACAAGAAGAA AAACAAGGGC GAAAAGGAGT ACCTGAATCT 366 GTAAACAGGT GTGGGTATCT TTCTCGCATA TCAGCCTTGT TCTCCCAAGT GGCTTCTTCG 10117 |||||||||| ||||||||| |||||||||| ||| | | | |||||||||| || |||||| GTAAACAGGT ATGGGTATCT TTCTCGCATA TCAACTTCCT TCTCCCAAGT GGATTCTTCA 426 ACTGGTCGAT TCTTCCTTTG AACTTTGATG GATGCAATCT CCCTTGATCT CAACTTGCGA 10057 |||||||||| |||||| ||| ||||||||| |||||||||| ||||||| || ||| ||||| ACTGGTCGAT TCTTCCATTG AACTTTGATA GATGCAATCT CCCTTGACCT CAATTTGCGG 486 ATTTCTCTAT CTAGAATGGC AACAGGCTCC TCCTCATAAG TCAAATTTTC ATCAAG 10001 | |||||||| ||| |||||| |||||||||| |||||||||| |||||| || |||||| ACTTCTCTAT CTAAAATGGC AACAGGCTCC TCCTCATAAG ACAAATTCTC ATCAAG 542 hqPGS_C06HBa0153O03.1-1-_SGN-E252199- (10476 10001) ******************************************************************************** EST sequence 160 +strand 390 n (File: SGN-E242274+) 1 TGAGTCGACG GATCCCACGA CGGACCGTCA TGAGCACGAC GGACCGTGGA GGGTGTCTCG 61 TTCCAAAACA CTTAGAATTC TGAAATTTGG GTACTGAGAT CGACTCTCTG AACTTCGCGA 121 CGAAATGGCA CGACGGACCG TCACAGGCAT GACGGGCTGT CACAAACCCT TAGTGAAATT 181 TAATCTCTGA ACTTTGTGAC GGAAGCAGCA GGACGGACCG TCGCAGGCAC GACTGGTCGT 241 CACAGACTGC GTAACCCTGA CTGGGTCGGA TTTTTGTTAA ATGTTTTAAG GGGCGTTTTG 301 GACTATTCCT GCTTATAATT ATGAAATTAG TGGTTTAATG TTAATAATTC AATTACTTGG 361 GGGGTTGAAG GAGATAACCT TGAATTAATT Predicted gene structure (within gDNA segment 12100 to 8700): Exon 1 10924 10618 ( 307 n); cDNA 1 305 ( 305 n); score: 0.881 MATCH C06HBa0153O03.1-1- SGN-E242274+ 0.881 307 0.787 C PGS_C06HBa0153O03.1-1-_SGN-E242274+ (10924 10618) Alignment (genomic DNA sequence = upper lines): TGAGACGACG GATCCCACGA CGGACCGTCA TGGGCACGAT GGACCGTCGA GGGGGTCTCG 10865 |||| ||||| |||||||||| |||||||||| || |||||| ||||||| || ||| |||||| TGAGTCGACG GATCCCACGA CGGACCGTCA TGAGCACGAC GGACCGTGGA GGGTGTCTCG 60 TTCAAAAACA CTTAGAATTC TGAAATTTGG ATACTGAAAT TGACTCTCTG AACTTCGTGA 10805 ||| |||||| |||||||||| |||||||||| |||||| || ||||||||| ||||||| || TTCCAAAACA CTTAGAATTC TGAAATTTGG GTACTGAGAT CGACTCTCTG AACTTCGCGA 120 CGAAGTGACA GGACGGACCG TCACAGGCAT GACGGGCCGT CACAGACTCT TCAGTAAATT 10745 |||| || || ||||||||| |||||||||| ||||||| || |||| || || | ||| || | CGAAATGGCA CGACGGACCG TCACAGGCAT GACGGGCTGT CACAAACCCT T-AGTGAAAT 179 TCAGTCTCTG AACTCTGTGA TGGAAGCAGC AGGACGGACC GTCGCAGGCA CGATGGCCCG 10685 | | |||||| |||| ||||| ||||||||| |||||||||| |||||||||| ||| | || TTAATCTCTG AACTTTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACTGGTCG 239 TCACAGACTG CGTAATCCCA GGCTGAGTCG GATTTCT-TT AAATGTTTTA AGGGGGCGTT 10626 |||||||||| ||||| ||| | ||| |||| ||||| | || |||||||||| | |||||||| TCACAGACTG CGTAA-CCCT GACTGGGTCG GATTTTTGTT AAATGTTTTA A-GGGGCGTT 297 TTGGATTA 10618 ||||| || TTGGACTA 305 hqPGS_C06HBa0153O03.1-1-_SGN-E242274+ (10924 10618) ******************************************************************************** EST sequence 139 +strand 542 n (File: SGN-E252199+) 1 CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTGTTGCCA TTTTAGATAG AGAAGTCCGC 61 AAATTGAGGT CAAGGGAGAT TGCATCTATC AAAGTTCAAT GGAAGAATCG ACCAGTTGAA 121 GAATCCACTT GGGAGAAGGA AGTTGATATG CGAGAAAGAT ACCCATACCT GTTTACAGAT 181 TCAGGTACTC CTTTTCGCCC TTGTTTTTCT TCTTGTGATC GTTCGGGGAC GAACGATGGG 241 TAAATTGGTA TCTATTGTAA CGACCTGTTT AGTCGTTTTG AGTAACAGAT TTTATTTCTG 301 GAAAAACTGA CTGAGACGAC GGATCCCACG ACGGACCGTC GAGGGGGTCT CGTTCCAAAA 361 CACTTAGAAT TCTGAAATTT GGGTACTGAA ATCGACTCTC TGAACTTCGT GACGGAATGG 421 CAGGACGGAC CGTCACAGGC GTGATGGGCC GTCACAGACC CTTGGTAAAA ATCTAGTCTC 481 TGAACTCTGT GACGGACGTG CAGGACGGAC CGTCACAGAC TGCGTAATCC CAGGCTGGGT 541 CG Predicted gene structure (within gDNA segment 12456 to 9147): Exon 1 11233 10698 ( 536 n); cDNA 1 518 ( 518 n); score: 0.879 MATCH C06HBa0153O03.1-1- SGN-E252199+ 0.879 536 0.989 C PGS_C06HBa0153O03.1-1-_SGN-E252199+ (11233 10698) Alignment (genomic DNA sequence = upper lines): CTTGATGAGA ATTTGACTTA TGAGGAGGAG CCTGTTGCCA TCATAGATAG AG-A-TTCGC 11176 |||||||||| ||||| |||| |||||||||| |||||||||| | ||||||| || | | ||| CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTGTTGCCA TTTTAGATAG AGAAGTCCGC 60 AAGTTGAGAT CAAGGGAGAT TGCATCCATC AAAGTTCAAT GGAAGAATCG ACTAGTTGAA 11116 || ||||| | |||||||||| |||||| ||| |||||||||| |||||||||| || ||||||| AAATTGAGGT CAAGGGAGAT TGCATCTATC AAAGTTCAAT GGAAGAATCG ACCAGTTGAA 120 GAGTCCACGT GGGAGAAGGA GGCTGATATG CGAGAAAGAT ACCCACACCT GTTTACAGAT 11056 || ||||| | |||||||||| | ||||||| |||||||||| ||||| |||| |||||||||| GAATCCACTT GGGAGAAGGA AGTTGATATG CGAGAAAGAT ACCCATACCT GTTTACAGAT 180 TCAAGTACTC CTTTTCGCCC TTGTTTTTCT TCTTGTGATC ATTCGGGGAT GAACGATGGG 10996 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||| |||||||||| TCAGGTACTC CTTTTCGCCC TTGTTTTTCT TCTTGTGATC GTTCGGGGAC GAACGATGGG 240 TAAATTGGTA TCTATTGTAA CGACCTATTT AGTCGTTTTG AGCAGCAGAT TTTATTTCTG 10936 |||||||||| |||||||||| |||||| ||| |||||||||| || | ||||| |||||||||| TAAATTGGTA TCTATTGTAA CGACCTGTTT AGTCGTTTTG AGTAACAGAT TTTATTTCTG 300 GAAAAACTGG CTGAGACGAC GGATCCCACG ACGGACCGTC ATGGGCACGA TGGACCGTCG 10876 ||||||||| |||||||||| |||| | | | | | | || ||||||||| GAAAAACTGA CTGAGACGAC GGAT--C-C- -C--A----C ----G-AC-- -GGACCGTCG 341 AGGGGGTCTC GTTCAAAAAC ACTTAGAATT CTGAAATTTG GATACTGAAA TTGACTCTCT 10816 |||||||||| |||| ||||| |||||||||| |||||||||| | |||||||| | |||||||| AGGGGGTCTC GTTCCAAAAC ACTTAGAATT CTGAAATTTG GGTACTGAAA TCGACTCTCT 401 GAACTTCGTG ACGAAGTGAC AGGACGGACC GTCACAGGCA TGACGGGCCG TCACAGACTC 10756 |||||||||| ||| | || | |||||||||| ||||||||| ||| |||||| |||||||| | GAACTTCGTG ACGGAATGGC AGGACGGACC GTCACAGGCG TGATGGGCCG TCACAGACCC 461 TTCAGTAAAT TTCAGTCTCT GAACTCTGTG ATGGAAGCAG CAGGACGGAC CGTCGCAG 10698 || ||| | ||||||| |||||||||| | ||| | | |||||||||| |||| ||| TTGGTAAAAA TCTAGTCTCT GAACTCTGTG ACGGACG-TG CAGGACGGAC CGTCACAG 518 hqPGS_C06HBa0153O03.1-1-_SGN-E252199+ (11233 10698) ******************************************************************************** EST sequence 178 +strand 763 n (File: SGN-E354383+) 1 AGATTCGATG GAGAAAGTGA AGTCTATTCA AGAAAAGCTT CTAGCGGCGC AAAGTAGACA 61 AAAAGAATAT GCAGATCGAA AGGTTAGAGA CTTAGAGTTC ATGGAGGGCG AACAAGTCTT 121 GTTGAAAGTC TCGCCAATGA AAGGGGTGAT GCGGTTTCGT AAAAGAGGTA AGTTAAGTCC 181 AAGGTATATT GGACCATTTG AGGTACTAAA GCGAGTAGGG GAGGTGGCTT ATGAATTAGC 241 CTTACCTCCA GGGCTGTCCG GAGTACATCC GGTATTTCAT GTATCCATGT TGAAAAGATA 301 CCATGGAGAT GGAAACTACA TCATCCGTTG GGATTCAGTT TCGCTTGGTG AGAATTGGAC 361 TTATGAGGAG GAACCAGTTG CTATTTTAGA TAGAGAGGTT CGCAAGTTGA GGTCGAGAGA 421 AATTGCATCC ATCAAAGTTC AATGGAAGAA TCGACCGGTT GAAGAAGCTA CTTGGGAGAA 481 GGAGGCAGAT ATGCAAGAAA AATACCCACA TCTGTTTACA GATTCAGGTA CTCCTTTTCG 541 CCATTGTTTT TCTTCATTTG ATCGTTCGGG GACGAACGAT GGGTAAATTG GTATCTATTG 601 TAACGACCTG TTTAGTCGTT TTGAGCAGCA GATTTTAATT CTGGAAAAAC TGGCCAAGAC 661 CACGGGCACT GCGACGGACC GTCACAGGCA CGACGGCCCG TCGAAGGTGC ATCGTTTCAG 721 CATACTTAGA ATTTCTGAAT TTGGATATGA AAATCGACTC TCT Predicted gene structure (within gDNA segment 12869 to 9225): Exon 1 11575 10816 ( 760 n); cDNA 2 763 ( 762 n); score: 0.862 MATCH C06HBa0153O03.1-1- SGN-E354383+ 0.862 760 0.996 C PGS_C06HBa0153O03.1-1-_SGN-E354383+ (11575 10816) Alignment (genomic DNA sequence = upper lines): GATTCATTAG AGAAGGTGAA ATCTATTCAA GAAAAGCTCT TAGCGGCTCA AAGCAGGCAA 11516 ||||| | | |||| ||||| ||||||||| |||||||| ||||||| || ||| || ||| GATTCGATGG AGAAAGTGAA GTCTATTCAA GAAAAGCTTC TAGCGGCGCA AAGTAGACAA 61 AAGGAATATG CCGATTGAAA GGTTAGAGAC TTAGAGTTCA TGGAGGGTGA GCAAGTCTTG 11456 || ||||||| | ||| |||| |||||||||| |||||||||| ||||||| || ||||||||| AAAGAATATG CAGATCGAAA GGTTAGAGAC TTAGAGTTCA TGGAGGGCGA ACAAGTCTTG 121 CTGAAGGTTT CACCCATGAA AGGGGTGATG CGGTTTGGAA AAAGAGGTAA GCTAAGCCCA 11396 |||| || | | || ||||| |||||||||| |||||| | | |||||||||| | |||| ||| TTGAAAGTCT CGCCAATGAA AGGGGTGATG CGGTTTCGTA AAAGAGGTAA GTTAAGTCCA 181 AGGTATATTG GACCATTTGA AGTACTTAAG CGAGTAGGGG AGGTGGCTTA TGAATTAGCC 11336 |||||||||| |||||||||| ||||| ||| |||||||||| |||||||||| |||||||||| AGGTATATTG GACCATTTGA GGTACTAAAG CGAGTAGGGG AGGTGGCTTA TGAATTAGCC 241 TTGCTCCCAG GACTGTCAGG AGTGCATCCG GTATTTCATG TGTCTATGTT GAAGAGATAC 11276 || | |||| | ||||| || ||| |||||| |||||||||| | || ||||| ||| |||||| TTACCTCCAG GGCTGTCCGG AGTACATCCG GTATTTCATG TATCCATGTT GAAAAGATAC 301 CATGGGGATG GAAACTACAT CATTCATTGG GATTCGGTTC TTCTTGATGA GAATTTGACT 11216 ||||| |||| |||||||||| ||| | |||| ||||| ||| |||| ||| ||||| |||| CATGGAGATG GAAACTACAT CATCCGTTGG GATTCAGTTT CGCTTGGTGA GAATTGGACT 361 TATGAGGAGG AGCCTGTTGC CATCATAGAT AGAGA--TTC GCAAGTTGAG ATCAAGGGAG 11158 |||||||||| | || ||||| || ||||| ||||| ||| |||||||||| || || || TATGAGGAGG AACCAGTTGC TATTTTAGAT AGAGAGGTTC GCAAGTTGAG GTCGAGAGAA 421 ATTGCATCCA TCAAAGTTCA ATGGAAGAAT CGACTAGTTG AAGAGTCCAC GTGGGAGAAG 11098 |||||||||| |||||||||| |||||||||| |||| |||| |||| | || ||||||||| ATTGCATCCA TCAAAGTTCA ATGGAAGAAT CGACCGGTTG AAGAAGCTAC TTGGGAGAAG 481 GAGGCTGATA TGCGAGAAAG ATACCCACAC CTGTTTACAG ATTCAAGTAC TCCTTTTCGC 11038 ||||| |||| ||| ||||| ||||||||| |||||||||| ||||| |||| |||||||||| GAGGCAGATA TGCAAGAAAA ATACCCACAT CTGTTTACAG ATTCAGGTAC TCCTTTTCGC 541 CCTTGTTTTT CTTCTTGTGA TCATTCGGGG ATGAACGATG GGTAAATTGG TATCTATTGT 10978 | |||||||| |||| | ||| || ||||||| | |||||||| |||||||||| |||||||||| CATTGTTTTT CTTCATTTGA TCGTTCGGGG ACGAACGATG GGTAAATTGG TATCTATTGT 601 AACGACCTAT TTAGTCGTTT TGAGCAGCAG ATTTTATTTC TGGAAAAACT GGCTGAGACG 10918 |||||||| | |||||||||| |||||||||| |||||| ||| |||||||||| ||| |||| AACGACCTGT TTAGTCGTTT TGAGCAGCAG ATTTTAATTC TGGAAAAACT GGCCAAGACC 661 ACGGATCCCA CGACGGACCG TCATGGGCAC GATGGACCGT CGAGGGGGTC TCGTTCAAAA 10858 |||| | |||||||||| ||| ||||| || || |||| ||| || | ||||| | ACGGGCACTG CGACGGACCG TCACAGGCAC GACGGCCCGT CGAAGGTGCA TCGTTTCAGC 721 ACACTTAGAA TTCTGAAATT TGGATACTGA AATTGACTCT CT 10816 | |||||||| || |||| |||||| | ||| |||||| || ATACTTAGAA TTTCTGAATT TGGATATGAA AATCGACTCT CT 763 hqPGS_C06HBa0153O03.1-1-_SGN-E354383+ (11575 10816) ******************************************************************************** EST sequence 85 -strand 548 n (File: SGN-E356257-) 1 GTTAACTAGA AAATTAAAGT GATAGAGTCA AATAATGTAA CGACCCGTTT AGTCGTTTTG 61 AGCAGCAGAC TTTATTTCTG GAAAAACTGG CAGAAGCGAC GGACCCCACG ACGGACCGTC 121 ATGGGCACGA CGGACCATCG CAGGGTCTCG TTTCAAAACC CTCTTTCTTT TACCCCAAAT 181 TAACATATAA TTAAGAATAA AAGATGGCAA TAATACCCCA CTAATTAACT TAGGGTTACC 241 TCTTTTAACC CCAAGAATTT GAGTTATTAA TATAAACCCA CGAAATCTAT AATTAAGGAA 301 AGAATAGTCC AAAAACGTCC CTTAAAACGT GTAAGGAAAT CCGATTCTGC CTGGGATTTG 361 CGCAACCTGT GACGGGCCGT CGTGACTGTG ACGGTCCGTC CTGCAGGTCG TCGCAAGGGT 421 CAGAGAGTCA ATTTCCACTG AACAATCTAT GACGGTCCGT CACGCCTGTG ATGGTCCGTC 481 CTGTCATTCC GTCACGAAGT TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTTCTA 541 AGTGTTTT Predicted gene structure (within gDNA segment 12048 to 6106): Exon 1 10980 10853 ( 128 n); cDNA 36 162 ( 127 n); score: 0.883 MATCH C06HBa0153O03.1-1- SGN-E356257- 0.883 128 0.234 C PGS_C06HBa0153O03.1-1-_SGN-E356257- (10980 10853) Alignment (genomic DNA sequence = upper lines): TGTAACGACC TATTTAGTCG TTTTGAGCAG CAGATTTTAT TTCTGGAAAA ACTGGCTGAG 10921 |||||||||| |||||||| |||||||||| |||| ||||| |||||||||| |||||| || TGTAACGACC CGTTTAGTCG TTTTGAGCAG CAGACTTTAT TTCTGGAAAA ACTGGCAGAA 95 ACGACGGATC CCACGACGGA CCGTCATGGG CACGATGGAC CGTCGAGGGG GTCTCGTTCA 10861 ||||||| | |||||||||| |||||||||| ||||| |||| | ||| || |||||||| GCGACGGACC CCACGACGGA CCGTCATGGG CACGACGGAC CATCG-CAGG GTCTCGTTTC 154 AAAACACT 10853 ||||| || AAAACCCT 162 hqPGS_C06HBa0153O03.1-1-_SGN-E356257- (10980 10853) ******************************************************************************** EST sequence 53 -strand 542 n (File: SGN-E353207-) 1 AAAGAAGCAA AAATCGATGA GGAAGGTGTT TTGAGAATAA GGGAAGGAGT ATGTGTACCC 61 CGCGTCGATG ATTTGATTCA CACTATTCTT ATATAGGCTC ATAGTTCGAA GTACTCTATA 121 CATCCTGGTG CAAACAAGAT GTATCGTGAC CTAAAGCAAC ATTTTTGGTG GAGTAGGATG 181 AAGCGTGACA TTGTTAATTT TGTTGCTCAA TGCCCGAATT GTCAGCAAGT AAAGTATGAA 241 CACCAGAGGC CTGGAGGGAC ACTTCAGAGA ATGCCCATTC CTGAATGGAA GTGGGAGAGA 301 ATTGCAATGG ACTTCGTGGT TAGTCATCCA AAGACGATGG GTAGGTATGA CTCTATTTGG 361 GTGATTGTTG ACAGATTAAC TAAGTCTGCT CACTCTATTT CGGTTAAGGT GACTTACAAT 421 GCAGAGAAGT TAGCCAAACT TTACATCTTA GAAATTGTTC GATTGCACGG AGTTCCACTC 481 TCCATCATAT CAGATAGAGG TACGCAATTT ACTTCTAAGT TTTGGAAAAC AGTACATGCC 541 GA Predicted gene structure (within gDNA segment 14036 to 11160): Exon 1 12400 11859 ( 542 n); cDNA 1 542 ( 542 n); score: 0.900 MATCH C06HBa0153O03.1-1- SGN-E353207- 0.900 542 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E353207- (12400 11859) Alignment (genomic DNA sequence = upper lines): AAAGAAGCAA TAATGCATGA GGAAGGTGTT TTGAGAATTA AGGGATGAGT ATGTGTGCCC 12341 |||||||||| ||| |||| |||||||||| |||||||| | || | |||| |||||| ||| AAAGAAGCAA AAATCGATGA GGAAGGTGTT TTGAGAATAA GGGAAGGAGT ATGTGTACCC 60 CGTGTTGATG ATTTGATCCA TACTATTCTT ACAGAGGCTC ATAGTTCCAG ATATTCTATA 12281 || || |||| ||||||| || ||||||||| | | |||||| ||||||| | || |||||| CGCGTCGATG ATTTGATTCA CACTATTCTT ATATAGGCTC ATAGTTCGAA GTACTCTATA 120 CATCCTGGTG CAACCAAGAT GTACCGTGAC CTAAAGCAAC ACTTTTGGTG GAGTAGGATG 12221 |||||||||| ||| |||||| ||| |||||| |||||||||| | |||||||| |||||||||| CATCCTGGTG CAAACAAGAT GTATCGTGAC CTAAAGCAAC ATTTTTGGTG GAGTAGGATG 180 AAGCGCGACA TTGTGGATTT TGTTGCCAAA TGTCCAAATT GTCAGCAAGT AAAGTATGAC 12161 ||||| |||| |||| |||| |||||| || || || |||| |||||||||| ||||||||| AAGCGTGACA TTGTTAATTT TGTTGCTCAA TGCCCGAATT GTCAGCAAGT AAAGTATGAA 240 CACCAGAGGC CCGGAGGAAC ACTTCAGAGA ATGCCCATTC CTGAATGGAA GTGGGAGAGA 12101 |||||||||| | ||||| || |||||||||| |||||||||| |||||||||| |||||||||| CACCAGAGGC CTGGAGGGAC ACTTCAGAGA ATGCCCATTC CTGAATGGAA GTGGGAGAGA 300 ATTGCAATGG ACTTCGTGGT TGGTCTTCCA AAGACATTGG GGAAGTTTGA CTCTATTTGG 12041 |||||||||| |||||||||| | ||| |||| ||||| ||| | | || ||| |||||||||| ATTGCAATGG ACTTCGTGGT TAGTCATCCA AAGACGATGG GTAGGTATGA CTCTATTTGG 360 GTAATTGTGG ACAGATTAAC TAAGTCTGCT CATTTCATTC CGGTCAAGGT GACTTATAAT 11981 || ||||| | |||||||||| |||||||||| || | ||| |||| ||||| |||||| ||| GTGATTGTTG ACAGATTAAC TAAGTCTGCT CACTCTATTT CGGTTAAGGT GACTTACAAT 420 GCAGAGAAGT TAGCCAAAAT TTACATCTCA GAAATTGTTC GATTGCATGG AGTTCCACTT 11921 |||||||||| |||||||| | |||||||| | |||||||||| ||||||| || ||||||||| GCAGAGAAGT TAGCCAAACT TTACATCTTA GAAATTGTTC GATTGCACGG AGTTCCACTC 480 TCCATCATAT CAGATAGAGG TACGCAGTTT ACTTCTAAGT TTTGGAAAAC ATTGCATGCG 11861 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| | | ||||| TCCATCATAT CAGATAGAGG TACGCAATTT ACTTCTAAGT TTTGGAAAAC AGTACATGCC 540 GA 11859 || GA 542 hqPGS_C06HBa0153O03.1-1-_SGN-E353207- (12400 11859) ******************************************************************************** EST sequence 5 -strand 654 n (File: SGN-E578131-) 1 CTTAGCAAGT CCGGCTTCCC ACAGGAGAAC CTTCACGTTT GCGGGCTTCC TCCTCTTCCA 61 GTAACCTCTT TTGACGCTCC TCCTCAGTTC GCAGAAAGAA TAACATTTTC CTCTTAGCTT 121 CCCTCTCCTG CTTCCTTGAT TGAATTATCT GGCTGATCCT TTCTCGTCTC TCCTGCTTCA 181 ATCTATTGAG TTCAGCTTCC CGGCTACTAA CAACTCTTTC TTGCAAAATC CTCTGCATAA 241 TTAGAAACAT ATCAGCGGAT CTGTTTTGTT AAGAGACCAA GACCAATCTT CAAGCAAGCC 301 CGTGGCTTCA CAAATTTTGA ACTAGTGGGA AAGTAGCGTC CATTAGCATT TCTAACCGAA 361 TGGTCAGCAT GTAAAGTATG AACAGCAAAG GCCTGGACGG ACACTTCAGA GAATGCCCAT 421 TCCTGAATGG AAGTGGGAGA GAATTGCAAT GGACTTCGTG GTTGGCCTTC CAAAGACAAT 481 GGGTAAGTAT GACTCCATTT GTGTAATTGT TGACAGATTG ACTAAGTCTG CTCATTGCAT 541 TCCGGTCAAG GTGACCTACA ATGTAGAGAA GTTAGTCAGA ATCTATATCT CAGAAATCGT 601 TCGATTGCAT GGAGTTCCAC TCTCCATCAT ATCAGATAGA GGTATGCAGT TTAC Predicted gene structure (within gDNA segment 16752 to 11199): Exon 1 12187 11889 ( 299 n); cDNA 356 654 ( 299 n); score: 0.906 MATCH C06HBa0153O03.1-1- SGN-E578131- 0.906 299 0.457 C PGS_C06HBa0153O03.1-1-_SGN-E578131- (12187 11889) Alignment (genomic DNA sequence = upper lines): CCAAATTGTC AGCAAGTAAA GTATGACCAC CAGAGGCCCG GAGGAACACT TCAGAGAATG 12128 || ||| ||| |||| ||||| |||||| || || ||||| | || | ||||| |||||||||| CCGAATGGTC AGCATGTAAA GTATGAACAG CAAAGGCCTG GACGGACACT TCAGAGAATG 415 CCCATTCCTG AATGGAAGTG GGAGAGAATT GCAATGGACT TCGTGGTTGG TCTTCCAAAG 12068 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CCCATTCCTG AATGGAAGTG GGAGAGAATT GCAATGGACT TCGTGGTTGG CCTTCCAAAG 475 ACATTGGGGA AGTTTGACTC TATTTGGGTA ATTGTGGACA GATTAACTAA GTCTGCTCAT 12008 ||| |||| | ||| |||||| ||||| ||| ||||| |||| |||| ||||| |||||||||| ACAATGGGTA AGTATGACTC CATTTGTGTA ATTGTTGACA GATTGACTAA GTCTGCTCAT 535 TTCATTCCGG TCAAGGTGAC TTATAATGCA GAGAAGTTAG CCAAAATTTA CATCTCAGAA 11948 | |||||||| |||||||||| || |||| | |||||||||| || ||| || ||||||||| TGCATTCCGG TCAAGGTGAC CTACAATGTA GAGAAGTTAG TCAGAATCTA TATCTCAGAA 595 ATTGTTCGAT TGCATGGAGT TCCACTTTCC ATCATATCAG ATAGAGGTAC GCAGTTTAC 11889 || ||||||| |||||||||| |||||| ||| |||||||||| ||||||||| ||||||||| ATCGTTCGAT TGCATGGAGT TCCACTCTCC ATCATATCAG ATAGAGGTAT GCAGTTTAC 654 hqPGS_C06HBa0153O03.1-1-_SGN-E578131- (12187 11889) ******************************************************************************** EST sequence 54 -strand 587 n (File: SGN-E352950-) 1 GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 61 GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 121 TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 181 GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 241 TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 301 TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 361 ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 421 GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 481 AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 541 ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG Predicted gene structure (within gDNA segment 14400 to 11146): Exon 1 12890 12304 ( 587 n); cDNA 1 587 ( 587 n); score: 0.862 MATCH C06HBa0153O03.1-1- SGN-E352950- 0.862 587 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E352950- (12890 12304) Alignment (genomic DNA sequence = upper lines): GCAATTAAAG GTGCATGAAC GTAACTATCC GACCCACGAT TTAGAATTGA CCGCAGTTGT 12831 |||||||||| |||||||||| |||| ||||| |||||| ||| || || || | ||||| || GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 60 GTTTGCATTA AAGCAATGGA GACATTATCT ATATGGGGTC AAGTGTGAAG TCTATACAGA 12771 | |||||||| ||| ||| || |||||||||| |||||||| |||||||||| |||||| || GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 120 TCATCGTATA CTACAGTATG TCTTTACTTA GAAAGAATTG AACTTGAGAC AGAGGAGATG 12711 |||||||| ||||||||| |||||||| | |||||| ||| || ||||||| ||||||||| TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 180 GATTGAACTA CTGAAGGATT ATGATGTTAC CATCTTGTAT CACCCAGGAA AGGCTAATGT 12651 ||| || ||| |||||||| | ||||| | || ||||||||| || || |||| |||||||||| GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 240 TGTGGCAGAC GCCTTAAGTA GAAAAGCAGG GAGCATGGGT AGTTTAACCC ACTTACAAGT 12591 ||||||||| || ||||||| |||| ||||| ||||||||| ||| || | | |||| || || TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 300 TTCTAAACGC CCATTGGCTA GAGAGGTTCA GACTCTGACT AACGAGTTTA TGAGGTTAGA 12531 ||||| | || |||||||||| |||||||||| ||||||| || || || ||| ||||| |||| TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 360 AGTAAATGAG AAGGGAGGAT TTTTGGCCAG TGTGGAGGCG AGATCTTCTT TTCTTGACAA 12471 | |||||||| ||||||| || ||||||| | ||||||||| |||||||| | ||||||| || ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 420 GATCAAGGGA AAACAGTTTG ATGATGAGAA ACTAAGCCGA ATTCGGGATA TGGTGTTGCG 12411 ||| || ||| || |||||| |||||||| ||| | | | ||||| |||| ||| |||| GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 480 AGGAGAGGCT AAAGAAGCAA TAATGCATGA GGAAGGTGTT TTGAGAATTA AGGGATGAGT 12351 |||||||||| |||||||||| ||| || | |||||||||| ||||| |||| ||||| || AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 540 ATGTGTGCCC CGTGTTGATG ATTTGATCCA TACTATTCTT ACAGAGG 12304 |||||| ||| |||| || | ||||||| || ||||||||| ||||||| ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG 587 hqPGS_C06HBa0153O03.1-1-_SGN-E352950- (12890 12304) ******************************************************************************** EST sequence 91 -strand 587 n (File: SGN-E357100-) 1 GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 61 GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 121 TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 181 GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 241 TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 301 TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 361 ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 421 GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 481 AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 541 ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG Predicted gene structure (within gDNA segment 14400 to 11146): Exon 1 12890 12304 ( 587 n); cDNA 1 587 ( 587 n); score: 0.862 MATCH C06HBa0153O03.1-1- SGN-E357100- 0.862 587 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E357100- (12890 12304) Alignment (genomic DNA sequence = upper lines): GCAATTAAAG GTGCATGAAC GTAACTATCC GACCCACGAT TTAGAATTGA CCGCAGTTGT 12831 |||||||||| |||||||||| |||| ||||| |||||| ||| || || || | ||||| || GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 60 GTTTGCATTA AAGCAATGGA GACATTATCT ATATGGGGTC AAGTGTGAAG TCTATACAGA 12771 | |||||||| ||| ||| || |||||||||| |||||||| |||||||||| |||||| || GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 120 TCATCGTATA CTACAGTATG TCTTTACTTA GAAAGAATTG AACTTGAGAC AGAGGAGATG 12711 |||||||| ||||||||| |||||||| | |||||| ||| || ||||||| ||||||||| TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 180 GATTGAACTA CTGAAGGATT ATGATGTTAC CATCTTGTAT CACCCAGGAA AGGCTAATGT 12651 ||| || ||| |||||||| | ||||| | || ||||||||| || || |||| |||||||||| GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 240 TGTGGCAGAC GCCTTAAGTA GAAAAGCAGG GAGCATGGGT AGTTTAACCC ACTTACAAGT 12591 ||||||||| || ||||||| |||| ||||| ||||||||| ||| || | | |||| || || TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 300 TTCTAAACGC CCATTGGCTA GAGAGGTTCA GACTCTGACT AACGAGTTTA TGAGGTTAGA 12531 ||||| | || |||||||||| |||||||||| ||||||| || || || ||| ||||| |||| TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 360 AGTAAATGAG AAGGGAGGAT TTTTGGCCAG TGTGGAGGCG AGATCTTCTT TTCTTGACAA 12471 | |||||||| ||||||| || ||||||| | ||||||||| |||||||| | ||||||| || ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 420 GATCAAGGGA AAACAGTTTG ATGATGAGAA ACTAAGCCGA ATTCGGGATA TGGTGTTGCG 12411 ||| || ||| || |||||| |||||||| ||| | | | ||||| |||| ||| |||| GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 480 AGGAGAGGCT AAAGAAGCAA TAATGCATGA GGAAGGTGTT TTGAGAATTA AGGGATGAGT 12351 |||||||||| |||||||||| ||| || | |||||||||| ||||| |||| ||||| || AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 540 ATGTGTGCCC CGTGTTGATG ATTTGATCCA TACTATTCTT ACAGAGG 12304 |||||| ||| |||| || | ||||||| || ||||||||| ||||||| ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG 587 hqPGS_C06HBa0153O03.1-1-_SGN-E357100- (12890 12304) ******************************************************************************** EST sequence 40 -strand 554 n (File: SGN-E352647-) 1 CCATGATTTG GAGTTAGCGG CAGTAGTGAT TGCATTAAAG TAATAGAGCC ATTATCTCTA 61 TGGGTTTAAG TGTGAAGTCT ATATGGATCA TCGTAGTTTA CAGTATGTCT TTACTCAGAA 121 AGATTTGAAT TTGAGACAGA GGAGATCGAT GGAGCTACTG AAGGACTATG ATATCACTAT 181 CTTGTATCAT CCGGGAAAGG CTAATGTTGT GGCAGATGCT TTAAGTAGAA AGGCAGGGAG 241 CATGGGAAGT CTAGCTCACT TGCAGGTTTC TAGATGCCCA TTGGCTAGAG AGGTTCAGAC 301 TCTGGCTAAT GACCTTATGA GGCTAGAATT AAATGAGAAG GGAGAATTTT TGGCTTGTGT 361 GGAGGCAAGA TCTTCCTTTC TTGATAAGAT TAAAGGAAAG CAGTTTACCG ATGAGAAACT 421 GATCTGGATT CGAGATAAGG TAATGCGAGG AGAGGCTAAA GAAGCAAAAA TCGATAAGGA 481 AGGTGTTTTG AGGATTAAGG GAAAGGTATG TGTACCCCGT GCCGACGATT TGATTCACAC 541 TATTCTTACA GAGG Predicted gene structure (within gDNA segment 14070 to 11146): Exon 1 12857 12304 ( 554 n); cDNA 1 554 ( 554 n); score: 0.852 MATCH C06HBa0153O03.1-1- SGN-E352647- 0.852 554 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E352647- (12857 12304) Alignment (genomic DNA sequence = upper lines): CCACGATTTA GAATTGACCG CAGTTGTGTT TGCATTAAAG CAATGGAGAC ATTATCTATA 12798 ||| ||||| || || | | |||| ||| | |||||||||| ||| ||| | ||||||| || CCATGATTTG GAGTTAGCGG CAGTAGTGAT TGCATTAAAG TAATAGAGCC ATTATCTCTA 60 TGGGGTCAAG TGTGAAGTCT ATACAGATCA TCGTATACTA CAGTATGTCT TTACTTAGAA 12738 |||| | ||| |||||||||| ||| ||||| ||||| || |||||||||| ||||| |||| TGGGTTTAAG TGTGAAGTCT ATATGGATCA TCGTAGTTTA CAGTATGTCT TTACTCAGAA 120 AGAATTGAAC TTGAGACAGA GGAGATGGAT TGAACTACTG AAGGATTATG ATGTTACCAT 12678 ||| ||||| |||||||||| |||||| ||| || |||||| ||||| |||| || | || || AGATTTGAAT TTGAGACAGA GGAGATCGAT GGAGCTACTG AAGGACTATG ATATCACTAT 180 CTTGTATCAC CCAGGAAAGG CTAATGTTGT GGCAGACGCC TTAAGTAGAA AAGCAGGGAG 12618 ||||||||| || ||||||| |||||||||| |||||| || |||||||||| | |||||||| CTTGTATCAT CCGGGAAAGG CTAATGTTGT GGCAGATGCT TTAAGTAGAA AGGCAGGGAG 240 CATGGGTAGT TTAACCCACT TACAAGTTTC TAAACGCCCA TTGGCTAGAG AGGTTCAGAC 12558 |||||| ||| || | |||| | || ||||| || | ||||| |||||||||| |||||||||| CATGGGAAGT CTAGCTCACT TGCAGGTTTC TAGATGCCCA TTGGCTAGAG AGGTTCAGAC 300 TCTGACTAAC GAGTTTATGA GGTTAGAAGT AAATGAGAAG GGAGGATTTT TGGCCAGTGT 12498 |||| |||| || |||||| || ||||| | |||||||||| |||| ||||| |||| |||| TCTGGCTAAT GACCTTATGA GGCTAGAATT AAATGAGAAG GGAGAATTTT TGGCTTGTGT 360 GGAGGCGAGA TCTTCTTTTC TTGACAAGAT CAAGGGAAAA CAGTTTGATG ATGAGAAACT 12438 |||||| ||| ||||| |||| |||| ||||| || ||||| |||||| | |||||||||| GGAGGCAAGA TCTTCCTTTC TTGATAAGAT TAAAGGAAAG CAGTTTACCG ATGAGAAACT 420 AAGCCGAATT CGGGATATGG TGTTGCGAGG AGAGGCTAAA GAAGCAATAA TGCATGAGGA 12378 | | | ||| || |||| || | ||||||| |||||||||| ||||||| || | || |||| GATCTGGATT CGAGATAAGG TAATGCGAGG AGAGGCTAAA GAAGCAAAAA TCGATAAGGA 480 AGGTGTTTTG AGAATTAAGG GATGAGTATG TGTGCCCCGT GTTGATGATT TGATCCATAC 12318 |||||||||| || ||||||| || ||||| ||| |||||| | || |||| |||| || || AGGTGTTTTG AGGATTAAGG GAAAGGTATG TGTACCCCGT GCCGACGATT TGATTCACAC 540 TATTCTTACA GAGG 12304 |||||||||| |||| TATTCTTACA GAGG 554 hqPGS_C06HBa0153O03.1-1-_SGN-E352647- (12857 12304) ******************************************************************************** EST sequence 65 +strand 694 n (File: SGN-E353359+) 1 TGTAGTCTAT GCACATTCAA AAACTGCCGA TCTTTACAAC AAAACCGGAG CACCCCAAGG 61 AGATGCACTT GGTCTAATGA AGCCTTTGTT CAATAACTCT TGAAGTTGTG CCTTTAACTC 121 TCTTAACTCT GCGGGAGCCA TTCTATAAGG GGGTATAAAA ATGGGGCGTG TGCCCGGTTC 181 GAGATCAATA CAGAAGTCAA TATCCCTATC CGGTGGCATA CCAGGATGAT CTGCAGGGAA 241 CACATCCATA AACTCACGAA CTACTGAAAC TGAGTCAATC GAAGGTACTT GGGTAGTGTT 301 ATCCTTGAGA TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG 361 AAAGGAGATG ATACGCACCG GATTGGAAGT GTAGTCACCC TCCTACACTA ACAGATCTGT 421 CCCAGGCTTG GCTAACGTCA CGGTTTTAGC ATTACAATCC AAGATTGCAA AATTTGGAGA 481 AAGCCAAGTC ATACCCAGAA TTACATCGAA GTCAACCATT TCTTGCAAGG ATTTTGCCGT 541 AGCCGCTACC TGTAACGCTG AAATCCGCAA CTCTGACCTC AACCCTTTCA CAAAACGACG 601 AATCCTCTCT TGTGGACTGA AACAAAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 661 AGCCTCATAT GCATTGACCT ACATCCTACC TTGC Predicted gene structure (within gDNA segment 12710 to 16679): Exon 1 13740 14211 ( 472 n); cDNA 44 519 ( 476 n); score: 0.870 Intron 1 14212 14955 ( 744 n); Pd: 0.000 (s: 0.94), Pa: 0.000 (s: 0.94) Exon 2 14956 15130 ( 175 n); cDNA 520 694 ( 175 n); score: 0.891 MATCH C06HBa0153O03.1-1+ SGN-E353359+ 0.876 647 0.932 C PGS_C06HBa0153O03.1-1+_SGN-E353359+ (13740 14211,14956 15130) Alignment (genomic DNA sequence = upper lines): ACCGGAGCAC CCCAAGGAGA AGCACTTGGT CTAATGAATC CTTCGCTCAA CAACTCTTGA 13799 |||||||||| |||||||||| ||||||||| |||||||| | ||| | |||| ||||||||| ACCGGAGCAC CCCAAGGAGA TGCACTTGGT CTAATGAAGC CTTTGTTCAA TAACTCTTGA 103 AGTTGGGCAT TTAACTCTCT CAACTCCGTG GGAGCCATTC TATAAGAGGG ATATAGAAAT 13859 ||||| || | |||||||||| ||||| | | |||||||||| |||||| ||| |||| |||| AGTTGTGCCT TTAACTCTCT TAACTCTGCG GGAGCCATTC TATAAG-GGG GTATAAAAAT 162 GGGGCGAGTA CCTGGTTTAA GATCAATGCA GAAGTCAATA TCTCTATCCG GTGGCATACC 13919 |||||| || || |||| | ||||||| || |||||||||| || ||||||| |||||||||| GGGGCGTGTG CCCGGTTCGA GATCAATACA GAAGTCAATA TCCCTATCCG GTGGCATACC 222 CGGAAGGTCT GC-GGGAA-A C-T--AAAAA CTCACGAACT ATCAAAACCG ACTCAATTGG 13974 ||| | ||| || ||||| | | | | ||| |||||||||| | |||| | | ||||| | AGGATGATCT GCAGGGAACA CATCCATAAA CTCACGAACT ACTGAAACTG AGTCAATCGA 282 AGGTACTTGG GTAGTATCAT CCCTGAGATG TGCGAAGAAC GCTAAACAAA CCTTACTAAC 14034 |||||||||| ||||| | || || ||||||| ||| ||||| |||||||| | |||||||| AGGTACTTGG GTAGTGTTAT CCTTGAGATG TGCCAAGAAA GCTAAACACC CTTTACTAAC 342 CATTTTCTTA GCACGAAGAA AGGAGATGAT ACGAACCGGA TCGGAAGTGT AGTCACCCTC 14094 |||||||||| |||||||||| |||||||||| ||| |||||| | |||||||| |||||||||| CATTTTCTTA GCACGAAGAA AGGAGATGAT ACGCACCGGA TTGGAAGTGT AGTCACCCTC 402 CCACACTAAC GGATCTGTCC CAGGCTTGGC TAACATCACA GTTTTAGCAT TACAATCCAA 14154 | |||||||| ||||||||| |||||||||| |||| |||| |||||||||| |||||||||| CTACACTAAC AGATCTGTCC CAGGCTTGGC TAACGTCACG GTTTTAGCAT TACAATCCAA 462 GATCGCAAAA TTCGGAGAAA GCCAAGTCAT ACCCAGAATT ACATCAAAAT CAACCATTTC 14214 ||| |||||| || ||||||| |||||||||| |||||||||| ||||| || | ||||||| GATTGCAAAA TTTGGAGAAA GCCAAGTCAT ACCCAGAATT ACATCGAAGT CAACCAT... 519 TAAGATAACC AAGTCTACAT AAGTATTGCT CCCCACAAAA GTCACCAGAC CAGACCTATA 14274 .......... .......... .......... .......... .......... .......... 519 TACTTTTTCA ACTATCACAG ACTCACCCAC CGGAGTAGAA ACACGAATAG GCATGTCAAG 14334 .......... .......... .......... .......... .......... .......... 519 CAATTCACAA TGTAAATTAA GACCATTAGC AAATGAGGAA GATACATAAG AAAATGTGGA 14394 .......... .......... .......... .......... .......... .......... 519 TCCAGGATAA AACAATACAT AAGCAATGCA ATCACAAACC AAAAGATTAC CTGTGATGAC 14454 .......... .......... .......... .......... .......... .......... 519 AGCATCCGAT GTCTCCGCTT CAGATCTCCC AGGGAAAGCA TAACAACGGG CCCTATCACC 14514 .......... .......... .......... .......... .......... .......... 519 TGTCTGCCCG TTGCCCCTAC CATGTTGCGC TGCAGTAGTT CCCATTTGTC CGTCACCCCC 14574 .......... .......... .......... .......... .......... .......... 519 GCCGTTTTGG TGACCACCAT TACCTCGGCC ACCATGTCCT CCAGAATAGC GGCCTCTACC 14634 .......... .......... .......... .......... .......... .......... 519 ATGACCACCT CTACCTCTAG CTATTGAGGG TCTATAACTC TATTTTGGAC AATTCCTCCT 14694 .......... .......... .......... .......... .......... .......... 519 AATATGTCCA GTCTCCCCAC ATCCATAACA CTCCCTGGAG TCAAGCATAG GTCTCTCAGA 14754 .......... .......... .......... .......... .......... .......... 519 GAAGTGTTGA CCGGTCTGAG GTGGACCCCC AACTACAGTT TGTAGTGAAG ACTGAATTGG 14814 .......... .......... .......... .......... .......... .......... 519 TTGGACTGAG TAACCTCCTG AACCCTGTCC TCTAGAGTAA GAACCATTAA ACTCACCTCC 14874 .......... .......... .......... .......... .......... .......... 519 CTTTCGAAGA ATTTTTGATG ACATTGCCAT GGTGAAGTCA TCTGGCTTCA CTCCTTCTAC 14934 .......... .......... .......... .......... .......... .......... 519 CTCTATCACG AAGTCTACCT CTTCTTGAAA AGATTTTGCC GTAGCCGCTA CCTGTAAGGC 14994 |||||| || ||||||||| |||||||||| ||||||| || .......... .......... .TTCTTGCAA GGATTTTGCC GTAGCCGCTA CCTGTAACGC 558 TGAAATCCGC AATTCTGACC TTAACCCCTT CATAAAATGG TGAATCCGCT CTTGTGGACT 15054 |||||||||| || ||||||| | ||||| || || |||| | |||||| || |||||||||| TGAAATCCGC AACTCTGACC TCAACCCTTT CACAAAACGA CGAATCCTCT CTTGTGGACT 618 GAAATAAAGT TGGGTGGCAT ACCTGGATAA TGCACGAAAC TTAGCCTCAT ATGCGGTAAC 15114 |||| ||||| || ||||||| | ||||||| |||||||||| |||||||||| |||| | || GAAACAAAGT TGAGTGGCAT ATCTGGATAG TGCACGAAAC TTAGCCTCAT ATGCATTGAC 678 CGACATCCTA CCTTGC 15130 | |||||||| |||||| CTACATCCTA CCTTGC 694 hqPGS_C06HBa0153O03.1-1+_SGN-E353359+ (13740 14211,14956 15130) ******************************************************************************** EST sequence 17 +strand 712 n (File: SGN-E379315+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTNCACTG CCGGTAGCCT ATGCACATCC GAAAACTCCA 601 ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCAGTTG GTCTAATGAA 661 GCCTTTGCTC ACAACTCTTT GAAGTGGTCT TTTAACTCTC TTAACTCTGC GG Predicted gene structure (within gDNA segment 12185 to 16158): Exon 1 13118 13830 ( 713 n); cDNA 1 712 ( 712 n); score: 0.901 MATCH C06HBa0153O03.1-1+ SGN-E379315+ 0.901 713 1.001 C PGS_C06HBa0153O03.1-1+_SGN-E379315+ (13118 13830) Alignment (genomic DNA sequence = upper lines): GAATCCCTTA ACAAATCGAC GGTAATAGCT AGCTAAACCA ACAAAGCTCC TTATTTTTGA 13177 ||||||||| |||||||| | |||| ||||| |||||| ||| |||||||||| |||||| ||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTATTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 13237 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 CCCATCCTTA GAAACCACAT GCCCCAAGAA GGACATTGCA TCTAGCCAAA ACTCACACTT 13297 ||||||||| |||||||| | ||||||||| ||||| | || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 GGAGAATTTT GCATAAAGC- TTTTCTCCCT CAACATTTCC AATACAATTC TCAAGTGCTC 13356 ||||| || ||||||| | |||||||||| |||||||||| ||||| |||| |||| ||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATATTCC TTCTTGCTCT TTAAGTATAC CAATATATCA TCAATAAATA CGATCACAAA 13416 |||| |||| | ||| || | | |||||| || ||||||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCTAGA TATGGCTTAA AAATCCCATT CATCAAGCTC ATGAAAGCAG CAGGGGCATT 13476 |||||| | | |||||||||| ||||||| || || || ||| || || |||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGACATCA CTACAAATTC GTAATGCCCA TACCTGGTCC GAAAAAGCAG 13536 | |||||||| |||||||||| ||||||||| |||||||||| |||||||| | |||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC -TAAAAGCAG 419 TCTTTGGCAC ATCCGTTGCC CGCATTTTCA ATTGATGATA ACCGGACCTC AAGTCAATCT 13596 |||||||||| |||||||||| || ||||||| |||||||||| |||||| ||| |||||||||| TCTTTGGCAC ATCCGTTGCC CGTATTTTCA ATTGATGATA ACCGGATCTC AAGTCAATCT 479 TAGAGAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATCAATGCGA GGAATGGGAT 13656 |||| ||||| |||||||||| |||||||||| |||||||||| || ||||||| |||| |||| TAGAAAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATTAATGCGA GGAAGAGGAT 539 ACTTGTTCTT AATTGTTACC TTGTTCAACT GCCGGTAGTC TATGCACATC CGAAAACTCC 13716 |||||||||| || |||||| ||||| ||| |||||||| | |||||||||| |||||||||| ACTTGTTCTT TATGGTTACC TTGTTNCACT GCCGGTAGCC TATGCACATC CGAAAACTCC 599 CATCCTTCTT CTTCACAAAT AACACCGGAG CACCCCAAGG AGAAGCACTT GGTCTAATGA 13776 ||||||||| ||| ||||| || ||||||| |||||||||| ||| ||| || |||||||||| AATCCTTCTT CTTTACAAAC AAAACCGGAG CACCCCAAGG AGATGCAGTT GGTCTAATGA 659 ATCCTTCGCT CAACAACTCT TGAAGTTGGG CATTTAACTC TCTCAACTCC GTGG 13830 | |||| ||| | |||||||| | | ||| | |||||||| ||| ||||| | || AGCCTTTGCT C-ACAACTCT TTGAAGTGGT CTTTTAACTC TCTTAACTCT GCGG 712 hqPGS_C06HBa0153O03.1-1+_SGN-E379315+ (13118 13830) ******************************************************************************** EST sequence 35 +strand 596 n (File: SGN-E375319+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGCCT ATGCACATCC GAAAAC Predicted gene structure (within gDNA segment 12185 to 14323): Exon 1 13118 13713 ( 596 n); cDNA 1 596 ( 596 n); score: 0.915 MATCH C06HBa0153O03.1-1+ SGN-E375319+ 0.915 596 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E375319+ (13118 13713) Alignment (genomic DNA sequence = upper lines): GAATCCCTTA ACAAATCGAC GGTAATAGCT AGCTAAACCA ACAAAGCTCC TTATTTTTGA 13177 ||||||||| |||||||| | |||| ||||| |||||| ||| |||||||||| |||||| ||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTATTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 13237 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 CCCATCCTTA GAAACCACAT GCCCCAAGAA GGACATTGCA TCTAGCCAAA ACTCACACTT 13297 ||||||||| |||||||| | ||||||||| ||||| | || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 GGAGAATTTT GCATAAAGC- TTTTCTCCCT CAACATTTCC AATACAATTC TCAAGTGCTC 13356 ||||| || ||||||| | |||||||||| |||||||||| ||||| |||| |||| ||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATATTCC TTCTTGCTCT TTAAGTATAC CAATATATCA TCAATAAATA CGATCACAAA 13416 |||| |||| | ||| || | | |||||| || ||||||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCTAGA TATGGCTTAA AAATCCCATT CATCAAGCTC ATGAAAGCAG CAGGGGCATT 13476 |||||| | | |||||||||| ||||||| || || || ||| || || |||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGACATCA CTACAAATTC GTAATGCCCA TACCTGGTCC GAAAAAGCAG 13536 | |||||||| |||||||||| ||||||||| |||||||||| |||||||| | |||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC -TAAAAGCAG 419 TCTTTGGCAC ATCCGTTGCC CGCATTTTCA ATTGATGATA ACCGGACCTC AAGTCAATCT 13596 |||||||||| |||||||||| || ||||||| |||||||||| |||||| ||| |||||||||| TCTTTGGCAC ATCCGTTGCC CGTATTTTCA ATTGATGATA ACCGGATCTC AAGTCAATCT 479 TAGAGAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATCAATGCGA GGAATGGGAT 13656 |||| ||||| |||||||||| |||||||||| |||||||||| || ||||||| |||| |||| TAGAAAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATTAATGCGA GGAAGAGGAT 539 ACTTGTTCTT AATTGTTACC TTGTTCAACT GCCGGTAGTC TATGCACATC CGAAAAC 13713 |||||||||| || |||||| |||||||||| |||||||| | |||||||||| ||||||| ACTTGTTCTT TATGGTTACC TTGTTCAACT GCCGGTAGCC TATGCACATC CGAAAAC 596 hqPGS_C06HBa0153O03.1-1+_SGN-E375319+ (13118 13713) ******************************************************************************** EST sequence 23 +strand 526 n (File: SGN-E204434+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATG Predicted gene structure (within gDNA segment 12185 to 14298): Exon 1 13118 13643 ( 526 n); cDNA 1 526 ( 526 n); score: 0.913 MATCH C06HBa0153O03.1-1+ SGN-E204434+ 0.913 526 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E204434+ (13118 13643) Alignment (genomic DNA sequence = upper lines): GAATCCCTTA ACAAATCGAC GGTAATAGCT AGCTAAACCA ACAAAGCTCC TTATTTTTGA 13177 ||||||||| |||||||| | |||| ||||| |||||| ||| |||||||||| |||||| ||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTATTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 13237 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 CCCATCCTTA GAAACCACAT GCCCCAAGAA GGACATTGCA TCTAGCCAAA ACTCACACTT 13297 ||||||||| |||||||| | ||||||||| ||||| | || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 GGAGAATTTT GCATAAAGC- TTTTCTCCCT CAACATTTCC AATACAATTC TCAAGTGCTC 13356 ||||| || ||||||| | |||||||||| |||||||||| ||||| |||| |||| ||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 CTCATATTCC TTCTTGCTCT TTAAGTATAC CAATATATCA TCAATAAATA CGATCACAAA 13416 |||| |||| | ||| || | | |||||| || ||||||| |||||||||| |||||||||| TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGATCTAGA TATGGCTTAA AAATCCCATT CATCAAGCTC ATGAAAGCAG CAGGGGCATT 13476 |||||| | | |||||||||| ||||||| || || || ||| || || |||| |||||||||| GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CGTAAGACCA AAAGACATCA CTACAAATTC GTAATGCCCA TACCTGGTCC GAAAAAGCAG 13536 | |||||||| |||||||||| ||||||||| |||||||||| |||||||| | |||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC -TAAAAGCAG 419 TCTTTGGCAC ATCCGTTGCC CGCATTTTCA ATTGATGATA ACCGGACCTC AAGTCAATCT 13596 |||||||||| |||||||||| || ||||||| |||||||||| |||||| ||| |||||||||| TCTTTGGCAC ATCCGTTGCC CGTATTTTCA ATTGATGATA ACCGGATCTC AAGTCAATCT 479 TAGAGAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATCAATG 13643 |||| ||||| |||||||||| |||||||||| |||||||||| || |||| TAGAAAAGAC ACAAGCACCT TGTAACTGAT CGAACAAGTC ATTAATG 526 hqPGS_C06HBa0153O03.1-1+_SGN-E204434+ (13118 13643) ******************************************************************************** EST sequence 81 +strand 679 n (File: SGN-E368762+) 1 GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACATC 361 CGTTGCCCGT ATTTTCAATT GATGATAACC GGATCTCAAG TCAATCTTAG AGAAGACACA 421 AGCACCTTGT AACTGATCGA ACAAGTCATC AATGCGAGGA AGTGGATACT TGTTCTTAAT 481 AGTTACCTTG TTCAACTGCC GGTAGTCTAT GCACATCCGA AAACTCCCAT CCTTCTTCTT 541 CACAAACCAA ACCGGAGCAC CCCAAGGAGA TGCACTTGGT CTAATGAAGA CTTTGCTCAA 601 AAACTCTTGA AGTTGGGCCT TTAACTCTCT TAACTCCGCG GGAGCCATTC TATAAGGGGG 661 TATAGAAATG GGGCGAGTG Predicted gene structure (within gDNA segment 12544 to 15027): Exon 1 13189 13868 ( 680 n); cDNA 1 678 ( 678 n); score: 0.926 MATCH C06HBa0153O03.1-1+ SGN-E368762+ 0.926 680 1.001 C PGS_C06HBa0153O03.1-1+_SGN-E368762+ (13189 13868) Alignment (genomic DNA sequence = upper lines): GTATTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACC CCATCCTTAG 13248 || | ||||| |||||||||| |||||||||| || ||||||| ||||||||| |||||||||| GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 60 AAACCACATG CCCCAAGAAG GACATTGCAT CTAGCCAAAA CTCACACTTG GAGAATTTTG 13308 ||||||| || |||||||||| |||| ||||| ||| |||||| ||||||||| ||||| || | AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGC-T TTTCTCCCTC AACATTTCCA ATACAATTCT CAAGTGCTCC TCATATTCCT 13367 |||||||| | |||||||||| |||||||| | |||||||||| ||| |||||| |||| ||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 TCTTGCTCTT TAAGTATACC AATATATCAT CAATAAATAC GATCACAAAG AGATCTAGAT 13427 |||| || || |||||| | | |||||||| ||||||| | ||| |||||| ||||| | || TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 240 ATGGCTTAAA AATCCCATTC ATCAAGCTCA TGAAAGCAGC AGGGGCATTC GTAAGACCAA 13487 ||||||| || |||||| ||| |||||||||| |||| ||||| |||||||||| |||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTCCG AAAAAGCAGT CTTTGGCACA 13547 |||||||||| |||||||||| |||||||||| ||||||| | ||||||||| |||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTC- TAAAAGCAGT CTTTGGCACA 358 TCCGTTGCCC GCATTTTCAA TTGATGATAA CCGGACCTCA AGTCAATCTT AGAGAAGACA 13607 |||||||||| | |||||||| |||||||||| ||||| |||| |||||||||| |||||||||| TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT AGAGAAGACA 418 CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGAG GAATGGGATA CTTGTTCTTA 13667 |||||||||| |||||||||| |||||||||| |||||||||| ||| ||||| |||||||||| CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGAG GAAGTGGATA CTTGTTCTTA 478 ATTGTTACCT TGTTCAACTG CCGGTAGTCT ATGCACATCC GAAAACTCCC ATCCTTCTTC 13727 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATAGTTACCT TGTTCAACTG CCGGTAGTCT ATGCACATCC GAAAACTCCC ATCCTTCTTC 538 TTCACAAATA ACACCGGAGC ACCCCAAGGA GAAGCACTTG GTCTAATGAA TCCTTCGCTC 13787 |||||||| | |||||||| |||||||||| || ||||||| |||||||||| ||| |||| TTCACAAACC AAACCGGAGC ACCCCAAGGA GATGCACTTG GTCTAATGAA GACTTTGCTC 598 AACAACTCTT GAAGTTGGGC ATTTAACTCT CTCAACTCCG TGGGAGCCAT TCTATAAGAG 13847 || ||||||| |||||||||| ||||||||| || ||||||| ||||||||| |||||||| | AAAAACTCTT GAAGTTGGGC CTTTAACTCT CTTAACTCCG CGGGAGCCAT TCTATAAG-G 657 GGATATAGAA ATGGGGCGAG T 13868 || ||||||| |||||||||| | GGGTATAGAA ATGGGGCGAG T 678 hqPGS_C06HBa0153O03.1-1+_SGN-E368762+ (13189 13868) ******************************************************************************** EST sequence 97 +strand 358 n (File: SGN-E240817+) 1 GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACA Predicted gene structure (within gDNA segment 12490 to 14354): Exon 1 13189 13547 ( 359 n); cDNA 1 358 ( 358 n); score: 0.893 MATCH C06HBa0153O03.1-1+ SGN-E240817+ 0.893 359 1.003 C PGS_C06HBa0153O03.1-1+_SGN-E240817+ (13189 13547) Alignment (genomic DNA sequence = upper lines): GTATTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACC CCATCCTTAG 13248 || | ||||| ||||||||| |||||||||| || |||||| ||||||||| |||||||||| GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 60 AAACCACATG CCCCAAGAAG GACATTGCAT CTAGCCAAAA CTCACACTTG GAGAATTTTG 13308 ||||||| || |||||||||| |||| ||||| ||| |||||| ||||||||| ||||| || | AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGC-T TTTCTCCCTC AACATTTCCA ATACAATTCT CAAGTGCTCC TCATATTCCT 13367 |||||||| | |||||||||| |||||||| | |||||||||| ||| |||||| |||| ||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 TCTTGCTCTT TAAGTATACC AATATATCAT CAATAAATAC GATCACAAAG AGATCTAGAT 13427 |||| || || || ||| | | |||||||| |||| || | ||| || || ||||| | || TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 240 ATGGCTTAAA AATCCCATTC ATCAAGCTCA TGAAAGCAGC AGGGGCATTC GTAAGACCAA 13487 ||||||| || |||||| ||| |||||||||| |||| ||||| |||||||||| |||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTCCG AAAAAGCAGT CTTTGGCACA 13547 |||||||||| |||||||||| |||||||||| ||||||| | ||||||||| |||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTC- TAAAAGCAGT CTTTGGCACA 358 hqPGS_C06HBa0153O03.1-1+_SGN-E240817+ (13189 13547) ******************************************************************************** EST sequence 7 +strand 716 n (File: SGN-E577713+) 1 CGGTGGATAC CTAGGCACCC AGAGACGAGG AAGGGCGTAG TAATCGACGA AATGCTTCGG 61 GGAGTTGAAA ATAAGCATAG ATCCGGAGAT TCCCGAATAG GGCAACCTTT CGAACTGCTG 121 CTGAATCCAT GGGCAGGCAA GAGACAACCT GGCGAACTGA AACATCTTAG TAGCCAGAGG 181 AAAAGAAAGC AAATAAGGAA GATACATAAG AAAACGTGGA TCCAGGATCA AACAATACAG 241 AAGCCATGCA ATCACAAACC AGAAGATTAC CTGTGATGAC AACATCAGAT GCCTCCGCTT 301 CACACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT CGTCTGTCCG TTGCCCCTAC 361 CATGTTGTGA TGTAGTGGCC CAAGTTTGCC CATTACCTCT GCCGTTTTGG TGACCACCAT 421 TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT CTACCTCTAG 481 CTATTGGGGG TCTATAACTT TGTCTGGGAC AATTTTTCCT AATATGTCCA ATCTCCCCAC 541 ATCCATACCA TTCTCTGGAG TCAATCCTAG GCCCCTCGGA GAAGTGTTGA CCGGTCTGAG 601 GTGGTCCCCC AACTACAGTC TGTAGTGAAG ACTCAATTGG TCGGACTGAG TAACTTCCCG 661 AACCCTGTCC TCTAGTGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGC CTTTTT Predicted gene structure (within gDNA segment 11370 to 15563): Exon 1 14295 14324 ( 30 n); cDNA 156 185 ( 30 n); score: 0.633 Intron 1 14325 14359 ( 35 n); Pd: 0.799 (s: 0), Pa: 0.000 (s: 0.90) Exon 2 14360 14890 ( 531 n); cDNA 186 716 ( 531 n); score: 0.881 MATCH C06HBa0153O03.1-1+ SGN-E577713+ 0.881 561 0.784 C PGS_C06HBa0153O03.1-1+_SGN-E577713+ (14295 14324,14360 14890) Alignment (genomic DNA sequence = upper lines): ACTCACCCAC CGGAGTAGAA ACACGAATAG GCATGTCAAG CAATTCACAA TGTAAATTAA 14354 ||| | || | ||||| | | ||| || ACTGAAACAT CTTAGTAGCC AGAGGAAAAG .......... .......... .......... 185 GACCATTAGC AAATGAGGAA GATACATAAG AAAATGTGGA TCCAGGATAA AACAATACAT 14414 ||| |||| ||||| |||||||||| |||| ||||| |||||||| | ||||||||| .....AAAGC AAATAAGGAA GATACATAAG AAAACGTGGA TCCAGGATCA AACAATACAG 240 AAGCAATGCA ATCACAAACC AAAAGATTAC CTGTGATGAC AGCATCCGAT GTCTCCGCTT 14474 |||| ||||| |||||||||| | |||||||| |||||||||| | |||| ||| | |||||||| AAGCCATGCA ATCACAAACC AGAAGATTAC CTGTGATGAC AACATCAGAT GCCTCCGCTT 300 CAGATCTCCC AGGGAAAGCA TAACAACGGG CCCTATCACC TGTCTGCCCG TTGCCCCTAC 14534 || | | ||| ||||||||| |||||| ||| ||||||| ||||| ||| |||||||||| CACACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT CGTCTGTCCG TTGCCCCTAC 360 CATGTTGCGC TGCAGTAGTT CCCATTTGTC CGTCACCCCC GCCGTTTTGG TGACCACCAT 14594 ||||||| | || ||| | | |||| | | | ||| | |||||||||| |||||||||| CATGTTGTGA TGTAGTGGCC CAAGTTTGCC CATTACCTCT GCCGTTTTGG TGACCACCAT 420 TACCTCGGCC ACCATGTCCT CCAGAATAGC GGCCTCTACC ATGACCACCT CTACCTCTAG 14654 ||||||| || |||| ||||| |||||||| | |||||||||| |||||||||| |||||||||| TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT CTACCTCTAG 480 CTATTGAGGG TCTATAACTC TATTTTGGAC AATTCCTCCT AATATGTCCA GTCTCCCCAC 14714 |||||| ||| ||||||||| | | | |||| |||| |||| |||||||||| ||||||||| CTATTGGGGG TCTATAACTT TGTCTGGGAC AATTTTTCCT AATATGTCCA ATCTCCCCAC 540 ATCCATAACA CTCCCTGGAG TCAAGCATAG GTCTCTCAGA GAAGTGTTGA CCGGTCTGAG 14774 ||||||| || || |||||| |||| | ||| | | ||| || |||||||||| |||||||||| ATCCATACCA TTCTCTGGAG TCAATCCTAG GCCCCTCGGA GAAGTGTTGA CCGGTCTGAG 600 GTGGACCCCC AACTACAGTT TGTAGTGAAG ACTGAATTGG TTGGACTGAG TAACCTCCTG 14834 |||| ||||| ||||||||| |||||||||| ||| |||||| | |||||||| |||| ||| | GTGGTCCCCC AACTACAGTC TGTAGTGAAG ACTCAATTGG TCGGACTGAG TAACTTCCCG 660 AACCCTGTCC TCTAGAGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGA ATTTTT 14890 |||||||||| ||||| |||| |||||||||| |||||||||| ||||||||| ||||| AACCCTGTCC TCTAGTGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGC CTTTTT 716 hqPGS_C06HBa0153O03.1-1+_SGN-E577713+ (14295 14324,14360 14890) ******************************************************************************** EST sequence 68 +strand 658 n (File: SGN-E355232+) 1 TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 61 ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 121 GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 181 CTTAATTCGG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCTGGTTCA 241 AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGAAAC 301 ACGTCCAAAA ACTCGCGGAC TACCGAAACC GACTCAATTG AGGGTACTTG AGTAGTGTCA 361 TCCTTGAGAT GTGCCAAGAA AGCTGAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 421 AAGGATATGA TACGCACCAG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 481 CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA ATTTGGAGAA 541 AGCCAAGTCA TACCCAGAAT TACATCAATG TCACCCATTT CTAAAAAACC AAATCTACAT 601 AAGTGTTGCT CCCCACGAAA TTCNACAAAA CAGAACTATG TACCTTTTCA ACTACCAC Predicted gene structure (within gDNA segment 12093 to 15711): Exon 1 13638 14292 ( 655 n); cDNA 1 658 ( 658 n); score: 0.868 MATCH C06HBa0153O03.1-1+ SGN-E355232+ 0.868 655 0.995 C PGS_C06HBa0153O03.1-1+_SGN-E355232+ (13638 14292) Alignment (genomic DNA sequence = upper lines): TCAATGCGAG GAATGGGATA CTTGTTCTTA ATTGTTACCT TGTTCAACTG CCGGTAGTCT 13697 |||||||||| ||| ||||| |||||||||| || ||||||| |||||| || | ||||||| TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 60 ATGCACATCC GAAAACTCCC ATCCTTCTTC TTCACAAATA ACACCGGAGC ACCCCAAGGA 13757 |||||||| | |||||| || |||||||||| |||||||||| | || ||||| |||||||||| ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 120 GAAGCACTTG GTCTAATGAA TCCTTCGCTC AACAACTCTT GAAGTTGGGC ATTTAACTCT 13817 || ||||||| |||||||||| | | |||| |||||||||| ||||||| || ||||||||| GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 180 CTCAACTCCG TGGGAGCCAT TCTATAAGAG GGATATAGAA ATGGGGCGAG TACCTGGTTT 13877 || || || | ||||||||| |||||||| | || ||||||| |||||||| | | ||||||| CTTAATTCGG CGGGAGCCAT TCTATAAG-G GGGTATAGAA ATGGGGCGTG TGCCTGGTTC 239 AAGATCAATG CAGAAGTCAA TATCTCTATC CGGTGGCATA CCCGGAAGGT CTGC-GGGAA 13936 |||||| || |||||||||| |||| ||||| ||||||||| || ||||| | |||| || || AAGATCGATA CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGAAA 299 -AC-T--AAA AACTCACGAA CTATCAAAAC CGACTCAATT GGAGGTACTT GGGTAGTATC 13992 || | ||| ||||| || | ||| | |||| |||||||||| | ||||||| | ||||| || CACGTCCAAA AACTCGCGGA CTACCGAAAC CGACTCAATT GAGGGTACTT GAGTAGTGTC 359 ATCCCTGAGA TGTGCGAAGA ACGCTAAACA AACCTTACTA ACCATTTTCT TAGCACGAAG 14052 |||| ||||| ||||| |||| | ||| |||| | |||||| |||||||||| |||||||||| ATCCTTGAGA TGTGCCAAGA AAGCTGAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG 419 AAAGGAGATG ATACGAACCG GATCGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT 14112 |||||| ||| ||||| ||| ||| |||||| |||||||||| |||||||||| |||||||||| AAAGGATATG ATACGCACCA GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT 479 CCCAGGCTTG GCTAACATCA CAGTTTTAGC ATTACAATCC AAGATCGCAA AATTCGGAGA 14172 |||||||||| |||||| ||| | |||||||| |||||||||| |||||||||| |||| ||||| CCCAGGCTTG GCTAACGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA AATTTGGAGA 539 AAGCCAAGTC ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAGTCTAC 14232 |||||||||| |||||||||| ||||||||| ||| ||||| ||||| | || |||| ||||| AAGCCAAGTC ATACCCAGAA TTACATCAAT GTCACCCATT TCTAA-AAAA CCAAATCTAC 598 ATAAGTATTG CTCCCCACAA AAGTCACCAG ACCAGACCTA TATACTTTTT CAACTATCAC 14292 |||||| ||| |||||||| | || || || | |||| ||| | ||| |||| |||||| ||| ATAAGTGTTG CTCCCCACGA AATTCNACAA AACAGAACTA TGTACCTTTT CAACTACCAC 658 hqPGS_C06HBa0153O03.1-1+_SGN-E355232+ (13638 14292) ******************************************************************************** EST sequence 72 +strand 761 n (File: SGN-E355244+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 GCAAATGAGG AAGATACANT AGAAAACGTG GATCCCAGGA TCAACAATAC AGAAGCCATG 721 CAATCACAAT CGAAAGATTA CCTGTGATGA CAGCATCTAA T Predicted gene structure (within gDNA segment 13080 to 15820): Exon 1 13707 14464 ( 758 n); cDNA 1 761 ( 761 n); score: 0.889 MATCH C06HBa0153O03.1-1+ SGN-E355244+ 0.889 758 0.996 C PGS_C06HBa0153O03.1-1+_SGN-E355244+ (13707 14464) Alignment (genomic DNA sequence = upper lines): CGAAAACTCC CATCCTTCTT CTTCACAAAT AACACCGGAG CACCCCAAGG AGAAGCACTT 13766 || ||||||| |||||||||| ||||||||| || | ||||| |||||||||| ||| |||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATGA ATCCTTCGCT CAACAACTCT TGAAGTTGGG CATTTAACTC TCTCAACTCC 13826 |||||||||| | |||| ||| |||||||||| |||||||||| | |||||||| ||| ||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GTGGGAGCCA TTCTATAAGA GGGATATAGA AATGGGGCGA GTACCTGGTT TAAGATCAAT 13886 |||||||| ||||||||| ||| |||||| ||||||||| || || |||| |||||| || ACGGGAGCCA TTCTATAAG- GGGGTATAGA AATGGGGCGT GTGCCCGGTT CAAGATCGAT 179 GCAGAAGTCA ATATCTCTAT CCGGTGGCAT ACCCGGAAGG TCTGC-GGGA A-AC-T--AA 13941 ||||||||| ||||| |||| | |||||||| ||| ||||| ||||| |||| | || | || ACAGAAGTCA ATATCCCTAT CTGGTGGCAT ACCAGGAAGA TCTGCAGGGA ACACATCCAA 239 AAACTCACGA ACTATCAAAA CCGACTCAAT TGGAGGTACT TGGGTAGTAT CATCCCTGAG 14001 |||||||| ||||| |||| |||||||||| | ||||||| |||||||| | ||||| |||| AAACTCACAG ACTATAAAAA CCGACTCAAT CGAAGGTACT TGGGTAGTGT CATCCTTGAG 299 ATGTGCGAAG AACGCTAAAC AAACCTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT 14061 |||||| ||| || ||||||| | | ||||| |||||||||| |||||||||| |||||||||| ATGTGCCAAG AAAGCTAAAC ACCCTTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT 359 GATACGAACC GGATCGGAAG TGTAGTCACC CTCCCACACT AACGGATCTG TCCCAGGCTT 14121 ||| | ||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGCACACC GGATTGGAAG TGTAGTCACC CTCCCACACT AACGGATCTG TCCCAGGCTT 419 GGCTAACATC ACAGTTTTAG CATTACAATC CAAGATCGCA AAATTCGGAG AAAGCCAAGT 14181 |||||| || || ||||||| |||||||||| |||||||||| || | ||||| |||||||||| GGCTAATGTC ACCGTTTTAG CATTACAATC CAAGATCGCA AATTGCGGAG AAAGCCAAGT 479 CATACCCAGA ATTACATCAA AATCAACCAT TTCTAAGATA ACCAAGTCTA CATAAGTATT 14241 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ||||||| || CATACCCAGA ATTACATCAA AATCAACCAT TTCTAAGATA ACCAAATCTA CATAAGTGTT 539 GCTCCCCACA AAAGTCACCA GACCAGACCT ATATACTTTT TCAACTATCA CAGACTCACC 14301 |||||||||| || |||||| || |||||| |||||| ||| ||||||| || |||||||||| GCTCCCCACA AAGTTCACCA AACAAGACCT ATATACCTTT TCAACTACCA CAGACTCACC 599 CACCGGAGTA GAAACACGAA TAGGCATGTC AAGCAATTCA CAATGTAAAT TAAGACCATT 14361 |||||||||| |||||||||| ||||||| || ||| |||||| |||||||||| | ||||| || CACCGGAGTA GAAACACGAA TAGGCATATC AAGTAATTCA CAATGTAAAT TTAGACCGTT 659 AGCAAATGAG GAAGATACAT AAGAAAATGT GGAT-CCAGG ATAAAACAAT ACATAAGCAA 14420 |||||||||| ||||||||| |||||| || |||| ||||| || |||||| ||| |||| | AGCAAATGAG GAAGATACAN TAGAAAACGT GGATCCCAGG AT-CAACAAT ACAGAAGCCA 718 TGCAATCACA AACCAAAAGA TTACCTGTGA TGACAGCATC CGAT 14464 ||||||||| || | ||||| |||||||||| |||||||||| || TGCAATCAC- AATCGAAAGA TTACCTGTGA TGACAGCATC TAAT 761 hqPGS_C06HBa0153O03.1-1+_SGN-E355244+ (13707 14464) ******************************************************************************** EST sequence 43 +strand 661 n (File: SGN-E351414+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 G Predicted gene structure (within gDNA segment 13080 to 15072): Exon 1 13707 14363 ( 657 n); cDNA 1 661 ( 661 n); score: 0.894 MATCH C06HBa0153O03.1-1+ SGN-E351414+ 0.894 657 0.994 C PGS_C06HBa0153O03.1-1+_SGN-E351414+ (13707 14363) Alignment (genomic DNA sequence = upper lines): CGAAAACTCC CATCCTTCTT CTTCACAAAT AACACCGGAG CACCCCAAGG AGAAGCACTT 13766 || ||||||| |||||||||| ||||||||| || | ||||| |||||||||| ||| |||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATGA ATCCTTCGCT CAACAACTCT TGAAGTTGGG CATTTAACTC TCTCAACTCC 13826 |||||||||| | |||| ||| |||||||||| |||||||||| | |||||||| ||| ||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GTGGGAGCCA TTCTATAAGA GGGATATAGA AATGGGGCGA GTACCTGGTT TAAGATCAAT 13886 |||||||| ||||||||| ||| |||||| ||||||||| || || |||| |||||| || ACGGGAGCCA TTCTATAAG- GGGGTATAGA AATGGGGCGT GTGCCCGGTT CAAGATCGAT 179 GCAGAAGTCA ATATCTCTAT CCGGTGGCAT ACCCGGAAGG TCTGC-GGGA A-AC-T--AA 13941 ||||||||| ||||| |||| | |||||||| ||| ||||| ||||| |||| | || | || ACAGAAGTCA ATATCCCTAT CTGGTGGCAT ACCAGGAAGA TCTGCAGGGA ACACATCCAA 239 AAACTCACGA ACTATCAAAA CCGACTCAAT TGGAGGTACT TGGGTAGTAT CATCCCTGAG 14001 |||||||| ||||| |||| |||||||||| | ||||||| |||||||| | ||||| |||| AAACTCACAG ACTATAAAAA CCGACTCAAT CGAAGGTACT TGGGTAGTGT CATCCTTGAG 299 ATGTGCGAAG AACGCTAAAC AAACCTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT 14061 |||||| ||| || ||||||| | | ||||| |||||||||| |||||||||| |||||||||| ATGTGCCAAG AAAGCTAAAC ACCCTTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT 359 GATACGAACC GGATCGGAAG TGTAGTCACC CTCCCACACT AACGGATCTG TCCCAGGCTT 14121 ||| | ||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGCACACC GGATTGGAAG TGTAGTCACC CTCCCACACT AACGGATCTG TCCCAGGCTT 419 GGCTAACATC ACAGTTTTAG CATTACAATC CAAGATCGCA AAATTCGGAG AAAGCCAAGT 14181 |||||| || || ||||||| |||||||||| |||||||||| || | ||||| |||||||||| GGCTAATGTC ACCGTTTTAG CATTACAATC CAAGATCGCA AATTGCGGAG AAAGCCAAGT 479 CATACCCAGA ATTACATCAA AATCAACCAT TTCTAAGATA ACCAAGTCTA CATAAGTATT 14241 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ||||||| || CATACCCAGA ATTACATCAA AATCAACCAT TTCTAAGATA ACCAAATCTA CATAAGTGTT 539 GCTCCCCACA AAAGTCACCA GACCAGACCT ATATACTTTT TCAACTATCA CAGACTCACC 14301 |||||||||| || |||||| || |||||| |||||| ||| ||||||| || |||||||||| GCTCCCCACA AAGTTCACCA AACAAGACCT ATATACCTTT TCAACTACCA CAGACTCACC 599 CACCGGAGTA GAAACACGAA TAGGCATGTC AAGCAATTCA CAATGTAAAT TAAGACCATT 14361 |||||||||| |||||||||| ||||||| || ||| |||||| |||||||||| | ||||| || CACCGGAGTA GAAACACGAA TAGGCATATC AAGTAATTCA CAATGTAAAT TTAGACCGTT 659 AG 14363 || AG 661 hqPGS_C06HBa0153O03.1-1+_SGN-E351414+ (13707 14363) ******************************************************************************** EST sequence 164 -strand 659 n (File: SGN-E352117-) 1 CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 61 TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAGGGGG TATGGAAATG GGGTGATCAC 121 CCGGCTCCAG ATCAATGCAA AAGTCAATAT CCCTATCCGG TGGCATACCA GGAAGGTCTG 181 CAGGAAACAC ATCCAGAAAC TCACGGACTA TCGAAACTGA CTCAATTGAA GGTACTTGGG 241 TAGTATCATC CCTGAGGTGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTCTCTTAG 301 CACAAAGAAA GGAGATGATA CGAACTGGAG TGGAAGTGTA GTCACCCTCC CACACTAACG 361 CATCTATCTC AGGCTTGGCC AACGTTACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 421 TTGGAGACAG CCAAGTCATA CCCAGAATTA CATCGAAGTC AACCATTTAT AGGATAACCA 481 AGTCTACATG AGTATTGCTC CCACAAATGT CACAAGACAA GACCTATACA CCTTTTCAAC 541 AATCACAGAC TCACCCACCG GAGTAGAACA CGAATAGGTA TGTCAAGCGA TTCACTATGT 601 AAATTAAGAA CAGTAGCAAA TGAGGAAGAT ACATATGAAA ATGTGGAGGC AGGATCAAA Predicted gene structure (within gDNA segment 10400 to 16534): Exon 1 13750 14406 ( 657 n); cDNA 1 659 ( 659 n); score: 0.865 MATCH C06HBa0153O03.1-1+ SGN-E352117- 0.865 657 0.997 C PGS_C06HBa0153O03.1-1+_SGN-E352117- (13750 14406) Alignment (genomic DNA sequence = upper lines): CCCAAGGAGA AGCACTTGGT CTAATGAATC CTTCGCTCAA CAACTCTTGA AGTTGGGCAT 13809 ||||||| || |||||||| ||||||| | | | | || |||||||||| |||||||| | CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 60 TTAACTCTCT CAACTCCGTG GGAGCCATTC TATAAGAGGG ATATAGAAAT GGGGCGAGTA 13869 ||||||| | |||||| | | |||||||||| |||||| ||| ||| ||||| |||| || | TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAG-GGG GTATGGAAAT GGGGTGATCA 119 CCTGGTTTAA GATCAATGCA GAAGTCAATA TCTCTATCCG GTGGCATACC CGGAAGGTCT 13929 || || | | |||||||||| ||||||||| || ||||||| |||||||||| ||||||||| CCCGGCTCCA GATCAATGCA AAAGTCAATA TCCCTATCCG GTGGCATACC AGGAAGGTCT 179 GC-GGGAA-A C-T--AAAAA CTCACGAACT ATCAAAACCG ACTCAATTGG AGGTACTTGG 13984 || || || | | | | ||| |||||| ||| ||| |||| | ||||||||| |||||||||| GCAGGAAACA CATCCAGAAA CTCACGGACT ATCGAAACTG ACTCAATTGA AGGTACTTGG 239 GTAGTATCAT CCCTGAGATG TGCGAAGAAC GCTAAACAAA CCTTACTAAC CATTTTCTTA 14044 |||||||||| ||||||| || ||| ||||| |||||||| | |||||||| |||| ||||| GTAGTATCAT CCCTGAGGTG TGCCAAGAAA GCTAAACACC CTTTACTAAC CATTCTCTTA 299 GCACGAAGAA AGGAGATGAT ACGAACCGGA TCGGAAGTGT AGTCACCCTC CCACACTAAC 14104 |||| ||||| |||||||||| |||||| ||| |||||||| |||||||||| |||||||||| GCACAAAGAA AGGAGATGAT ACGAACTGGA GTGGAAGTGT AGTCACCCTC CCACACTAAC 359 GGATCTGTCC CAGGCTTGGC TAACATCACA GTTTTAGCAT TACAATCCAA GATCGCAAAA 14164 | |||| || |||||||||| ||| | ||| |||||||||| |||||||||| ||| |||||| GCATCTATCT CAGGCTTGGC CAACGTTACA GTTTTAGCAT TACAATCCAA GATTGCAAAA 419 TTCGGAGAAA GCCAAGTCAT ACCCAGAATT ACATCAAAAT CAACCATTTC TAAGATAACC 14224 || ||||| | |||||||||| |||||||||| ||||| || | ||||||||| || ||||||| TTTGGAGACA GCCAAGTCAT ACCCAGAATT ACATCGAAGT CAACCATTTA TAGGATAACC 479 AAGTCTACAT AAGTATTGCT CCCCACAAAA GTCACCAGAC CAGACCTATA TACTTTTTCA 14284 |||||||||| ||||||||| |||||||| ||||| |||| ||||||||| || |||||| AAGTCTACAT GAGTATTGCT -CCCACAAAT GTCACAAGAC AAGACCTATA CACCTTTTCA 538 ACTATCACAG ACTCACCCAC CGGAGTAGAA ACACGAATAG GCATGTCAAG CAATTCACAA 14344 || ||||||| |||||||||| |||||||| | |||||||||| | |||||||| | |||||| | ACAATCACAG ACTCACCCAC CGGAGTAG-A ACACGAATAG GTATGTCAAG CGATTCACTA 597 TGTAAATTAA GACCATTAGC AAATGAGGAA GATACATAAG AAAATGTGGA TCCAGGATAA 14404 |||||||||| || || |||| |||||||||| |||||||| | |||||||||| |||||| | TGTAAATTAA GAACAGTAGC AAATGAGGAA GATACATATG AAAATGTGGA GGCAGGATCA 657 AA 14406 || AA 659 hqPGS_C06HBa0153O03.1-1+_SGN-E352117- (13750 14406) ******************************************************************************** EST sequence 75 +strand 543 n (File: SGN-E355026+) 1 GAACACATCC AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT 61 GTCATCCTTG AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA 121 AAGAAAGGAG ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 181 TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG 241 AGAAAGCCAA GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC 301 TAGATAAGTG TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT 361 CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA 421 TTTTAGACCA TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA 481 TACAGATGCC ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC 541 TGC Predicted gene structure (within gDNA segment 12007 to 16324): Exon 1 13940 14472 ( 533 n); cDNA 11 543 ( 533 n); score: 0.889 MATCH C06HBa0153O03.1-1+ SGN-E355026+ 0.889 533 0.982 C PGS_C06HBa0153O03.1-1+_SGN-E355026+ (13940 14472) Alignment (genomic DNA sequence = upper lines): AAAAACTCAC GAACTATCAA AACCGACTCA ATTGGAGGTA CTTGGGTAGT ATCATCCCTG 13999 ||||| || | ||||| ||| ||| | ||| |||| |||| ||||| |||| |||||| || AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT GTCATCCTTG 70 AGATGTGCGA AGAACGCTAA ACAAACCTTA CTAACCATTT TCTTAGCACG AAGAAAGGAG 14059 |||||||| | |||| | || |||| ||||| |||||||| | ||| ||||| |||||||||| AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA AAGAAAGGAG 130 ATGATACGAA CCGGATCGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC TGTCCCAGGC 14119 || ||||| | |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC TGTCCCAGGC 190 TTGGCTAACA TCACAGTTTT AGCATTACAA TCCAAGATCG CAAAATTCGG AGAAAGCCAA 14179 ||||||||| |||| ||||| |||||||||| |||||||||| ||||||| || |||||||||| TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG AGAAAGCCAA 250 GTCATACCCA GAATTACATC AAAATCAACC ATTTCTAAGA TAACCAAGTC TACATAAGTA 14239 |||||||||| |||||||||| ||| ||| || |||||||| ||||||| || || |||||| GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC TAGATAAGTG 310 TTGCTCCCCA CAAAAGTCAC CAGACCAGAC CTATATACTT TTTCAACTAT CACAGACTCA 14299 |||||||||| | ||| |||| || || |||| |||| ||| | |||||||||| |||||||||| TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT CACAGACTCA 370 CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGCAATT CACAATGTAA ATTAAGACCA 14359 |||||||||| |||||||||| |||||||||| ||||| |||| ||||||| || || |||||| CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA TTTTAGACCA 430 TTAGCAAATG AGGAAGATAC ATAAGAAAAT GTGGATCCAG GATAAAACAA TACATAAGCA 14419 |||||||||| | |||||||| ||| ||||| ||||| |||| | | ||| || |||| | || TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA TACAGATGCC 490 ATGCAATCAC AAACCAAAAG ATTACCTGTG ATGACAGCAT CCGATGTCTC CGC 14472 |||||||||| ||||||| || |||||||||| || ||||||| | || | ||| || ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC TGC 543 hqPGS_C06HBa0153O03.1-1+_SGN-E355026+ (13940 14472) ******************************************************************************** EST sequence 16 +strand 560 n (File: SGN-E242765+) 1 TGGTTCTAGC ATTAGGACAT CATAGCTCAT TTGATTATTT CTCATCTCAT AATTAGTATT 61 TAGTATTCCC TCAATTTAAT AATTTCATTA AAGTGTTCAT AGAGACTTAT CTCTTCATTA 121 GCTTTACACT ATAAAAGGTG AGTAAGTGTT GGTAATATTT ACTTAGGCTT ATTTGCTATT 181 GAAACCGACT CAATCGAAGG TACTTGGGTA GTGTCATCAT TGAGATGTGC TAAGAAAGAT 241 AAACACCTTT TACTAATCAT TTTCTTAGCA AGAAGAAAGG AGACGATGCG GACCGGATTG 301 GAAGTGTAGT CACCCTCTCA CACTAACGGG TCTATCCCAG GCTTGGCTAA CGTCACCGTT 361 TTAGCATTAC AATCCAAGAT TGCAAATTGC GGAGAAAGCA AAGTCATACC CAGAATCACA 421 TCAAAATCAT CCATTGCCAA GATAACCAAA TCTACATAAG TGTTGCTCCC CACAAAGTTT 481 ACGACACAAG ACCTATATAC TTTTTCAACT ACCTCAGACT CACCCACCGG AGTAGAAACA 541 CGAATAGGCA TATCAAGAAA Predicted gene structure (within gDNA segment 10487 to 15028): Exon 1 13954 14337 ( 384 n); cDNA 177 560 ( 384 n); score: 0.883 MATCH C06HBa0153O03.1-1+ SGN-E242765+ 0.883 384 0.686 C PGS_C06HBa0153O03.1-1+_SGN-E242765+ (13954 14337) Alignment (genomic DNA sequence = upper lines): TATCAAAACC GACTCAATTG GAGGTACTTG GGTAGTATCA TCCCTGAGAT GTGCGAAGAA 14013 ||| ||||| |||||||| | ||||||||| |||||| ||| || |||||| |||| ||||| TATTGAAACC GACTCAATCG AAGGTACTTG GGTAGTGTCA TCATTGAGAT GTGCTAAGAA 236 CGCTAAACAA ACCTTACTAA CCATTTTCTT AGCACGAAGA AAGGAGATGA TACGAACCGG 14073 | |||||| ||||||| ||||||||| |||| ||||| ||||||| || | || ||||| AGATAAACAC CTTTTACTAA TCATTTTCTT AGCAAGAAGA AAGGAGACGA TGCGGACCGG 296 ATCGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC CCAGGCTTGG CTAACATCAC 14133 || ||||||| |||||||||| | |||||||| ||| ||| || |||||||||| ||||| |||| ATTGGAAGTG TAGTCACCCT CTCACACTAA CGGGTCTATC CCAGGCTTGG CTAACGTCAC 356 AGTTTTAGCA TTACAATCCA AGATCGCAAA ATTCGGAGAA AGCCAAGTCA TACCCAGAAT 14193 ||||||||| |||||||||| |||| ||||| | ||||||| ||| |||||| |||||||||| CGTTTTAGCA TTACAATCCA AGATTGCAAA TTGCGGAGAA AGCAAAGTCA TACCCAGAAT 416 TACATCAAAA TCAACCATTT CTAAGATAAC CAAGTCTACA TAAGTATTGC TCCCCACAAA 14253 ||||||||| ||| ||||| | |||||||| ||| |||||| ||||| |||| |||||||||| CACATCAAAA TCATCCATTG CCAAGATAAC CAAATCTACA TAAGTGTTGC TCCCCACAAA 476 AGTCACCAGA CCAGACCTAT ATACTTTTTC AACTATCACA GACTCACCCA CCGGAGTAGA 14313 | || | | | |||||||| |||||||||| ||||| | || |||||||||| |||||||||| GTTTACGACA CAAGACCTAT ATACTTTTTC AACTACCTCA GACTCACCCA CCGGAGTAGA 536 AACACGAATA GGCATGTCAA GCAA 14337 |||||||||| ||||| |||| | || AACACGAATA GGCATATCAA GAAA 560 hqPGS_C06HBa0153O03.1-1+_SGN-E242765+ (13954 14337) ******************************************************************************** EST sequence 38 +strand 331 n (File: SGN-E352716+) 1 GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 61 TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 121 AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 181 CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 241 TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 301 ATAATACAAA AGCCATGCAA TCACAAAAAA A Predicted gene structure (within gDNA segment 13506 to 15370): Exon 1 14106 14436 ( 331 n); cDNA 1 331 ( 331 n); score: 0.955 MATCH C06HBa0153O03.1-1+ SGN-E352716+ 0.955 331 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E352716+ (14106 14436) Alignment (genomic DNA sequence = upper lines): GATCTGTCCC AGGCTTGGCT AACATCACAG TTTTAGCATT ACAATCCAAG ATCGCAAAAT 14165 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| || ||||||| GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 60 TCGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 14225 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 120 AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 14285 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 180 CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCACAAT 14345 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 240 GTAAATTAAG ACCATTAGCA AATGAGGAAG ATACATAAGA AAATGTGGAT CCAGGATAAA 14405 ||| ||||| |||| ||||| |||||||||| ||||||| || |||||||||| ||||| | || TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 300 ACAATACATA AGCAATGCAA TCACAAACCA A 14436 | |||||| | ||| |||||| ||||||| | | ATAATACAAA AGCCATGCAA TCACAAAAAA A 331 hqPGS_C06HBa0153O03.1-1+_SGN-E352716+ (14106 14436) ******************************************************************************** EST sequence 44 +strand 559 n (File: SGN-E244046+) 1 AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 61 AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 121 CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 181 CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 241 CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 301 CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 361 TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 421 GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 481 TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 541 AAGCACCATT AAACTCACC Predicted gene structure (within gDNA segment 13344 to 15679): Exon 1 14313 14871 ( 559 n); cDNA 1 559 ( 559 n); score: 0.882 MATCH C06HBa0153O03.1-1+ SGN-E244046+ 0.882 559 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E244046+ (14313 14871) Alignment (genomic DNA sequence = upper lines): AAACACGAAT AGGCATGTCA AGCAATTCAC AATGTAAATT AAGACCATTA GCAAATGAGG 14372 |||||||||| |||||| ||| || ||||||| |||||||| | ||||||||| |||||||||| AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 60 AAGATACATA AGAAAATGTG GATCCAGGAT AAAACAATAC ATAAGCAATG CAATCACAAA 14432 |||||||||| |||||||||| |||||||||| ||||||||| | |||| ||| |||||||||| AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 120 CCAAAAGATT ACCTGTGATG ACAGCATCCG ATGTCTCCGC TTCAGATCTC CCAGGGAAAG 14492 |||||||||| |||||||||| ||| |||| | || |||||| |||||| | | |||||||||| CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 180 CATAACAACG GGCCCTATCA CCTGTCTGCC CGTTGCCCCT ACCATGTTGC GCTGCAGTAG 14552 | |||||| | |||||||||| | || | ||||||||| ||||||||| | || ||| | CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 240 TTCCCATTTG TCCGTCACCC CCGCCGTTTT GGTGACCACC ATTACCTCGG CCACCATGTC 14612 ||| |||| || ||||| | |||||||| |||||||||| ||| ||||| || ||| ||| CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 300 CTCCAGAATA GCGGCCTCTA CCATGACCAC CTCTACCTCT AGCTATTGAG GGTCTATAAC 14672 |||||||||| ||| |||| |||||||||| |||||||||| | | ||| ||||| |||| CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 360 TCTATTTTGG ACAATTCCTC CTAATATGTC CAGTCTCCCC ACATCCATAA CACTCCCTGG 14732 ||| |||||| ||||| ||| ||||||||| || ||||||| ||| ||||| ||||| |||| TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 420 AGTCAAGCAT AGGTCTCTCA GAGAAGTGTT GACCGGTCTG AGGTGGACCC CCAACTACAG 14792 ||| |||| | |||||||| |||||||||| |||||||| |||||||||| |||||||||| GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 480 TTTGTAGTGA AGACTGAATT GGTTGGACTG AGTAACCTCC TGAACCCTGT CCTCTAGAGT 14852 | |||||||| |||||||||| ||| |||||| |||||| ||| ||||||||| ||||||| || TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 540 AAGAACCATT AAACTCACC 14871 ||| |||||| ||||||||| AAGCACCATT AAACTCACC 559 hqPGS_C06HBa0153O03.1-1+_SGN-E244046+ (14313 14871) ******************************************************************************** EST sequence 42 +strand 763 n (File: SGN-E214046+) 1 ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 61 TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 121 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 181 ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 241 AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 301 AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 361 TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 421 ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 481 TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 541 ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 601 TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 661 TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA CAACCGGCGA 721 ATCCGCTCTT GAGGGACTGA ACAGAGTTGA GTGGCATATC TGG Predicted gene structure (within gDNA segment 13546 to 16265): Exon 1 14317 15080 ( 764 n); cDNA 1 763 ( 763 n); score: 0.858 MATCH C06HBa0153O03.1-1+ SGN-E214046+ 0.858 764 1.001 C PGS_C06HBa0153O03.1-1+_SGN-E214046+ (14317 15080) Alignment (genomic DNA sequence = upper lines): ACGAATAGGC ATGTCAAGCA ATTCACAATG TAAATTAAGA CCATTAGCAA ATGAGGAAGA 14376 |||||||||| || ||||| | |||||||||| |||||| ||| |||||||||| |||||||||| ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 60 TACATAAGAA AATGTGGATC CAGGATAAAA CAATACATAA GCAATGCAAT CACAAACCAA 14436 |||||||||| || ||||||| |||||| ||| |||||| || || ||||||| ||||||| | TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 120 AAGATTACCT GTGATGACAG CATCCGATGT CTCCGCTTCA GATCTCCCAG GGAAAGCATA 14496 |||||||||| |||||||||| |||| |||| |||||||| | || | ||||| ||||||| || AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 180 ACAACGGGCC CTATCACCTG TCTGCCCGTT GCCCCTACCA TGTTGCGCTG CAGTAGTTCC 14556 |||| ||||| |||| || | || || || |||||||||| ||| | | || ||| | || ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 240 CATTTGTCCG TCACCCCCGC CGTTTTGGTG ACCACCATTA CCTCGGCCAC CATGTCCTCC 14616 ||||||| | ||| | || |||||||||| ||||||||| ||||| |||| || ||||||| AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 300 AGAATAGCGG CCTCTACCAT GACCACCTCT ACCTCTAGCT ATTGAGGGTC TATAACTCTA 14676 | |||| ||| |||||||||| ||| |||||| ||||||| || |||| ||||| ||||||| AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 360 TTTTGGACAA TTCCTCCTAA TATGTCCAGT CTCCCCACAT CCATAACACT CCCTGGAGTC 14736 | ||| || || ||||||| |||||||| | |||||||||| |||||||| | | || ||||| TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 420 AAGCATAGGT CTCTCAGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 14796 | |||||| | ||| |||| |||||||||| |||||||||| |||||||||| |||||||||| ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 480 TAGTGAAGAC TGAATTGGTT GGACTGAGTA ACCTCCTGAA CCCTGTCCTC TAGAGTAAGA 14856 |||||||||| ||||| ||| | |||||||| |||||| ||| || ||||||| |||||||||| TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 540 ACCATTAAAC TCACCTCCCT TTCGAAGAAT TTTTGATGAC ATTGCCATGG TGAAGTCATC 14916 ||| |||||| ||| ||||| || ||| | | |||| | || | ||| ||||||| || ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 600 TGGCTTCACT CCTTCTACCT CTATCACGAA GTCTACCTCT TCTTGAAAAG ATTTTGCCGT 14976 ||| |||||| ||||| || | |||||| || ||||||| || ||||| || | |||||||||| TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 660 AGCCGCTACC TGTAAGGCTG AAATCCGCAA TTCTGACCTT AACCCCTTCA TAAAATGGTG 15036 ||||||||| ||||||| | |||||||||| ||||||||| |||||||||| || || | TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA -CAACCGGCG 719 AATCCGCTCT TG-TGGACTG AAATAAAGTT GGGTGGCATA CCTGG 15080 |||||||||| || |||||| || | |||| | |||||||| |||| AATCCGCTCT TGAGGGACTG -AACAGAGTT GAGTGGCATA TCTGG 763 hqPGS_C06HBa0153O03.1-1+_SGN-E214046+ (14317 15080) ******************************************************************************** EST sequence 90 +strand 790 n (File: SGN-E356912+) 1 CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 61 CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 121 TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 181 CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 241 GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 301 TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 361 AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 421 GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 481 TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 541 CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 601 AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 661 AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 721 ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 781 GGATATACTT Predicted gene structure (within gDNA segment 11754 to 15931): Exon 1 14397 15186 ( 790 n); cDNA 1 790 ( 790 n); score: 0.878 MATCH C06HBa0153O03.1-1+ SGN-E356912+ 0.878 790 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E356912+ (14397 15186) Alignment (genomic DNA sequence = upper lines): CAGGATAAAA CAATACATAA GCAATGCAAT CACAAACCAA AAGATTACCT GTGATGACAG 14456 |||||| ||| |||||| || || ||||||| ||||||| | |||||||||| |||||||||| CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 60 CATCCGATGT CTCCGCTTCA GATCTCCCAG GGAAAGCATA ACAACGGGCC CTATCACCTG 14516 |||| |||| |||||||||| || | ||||| ||||||| || |||| ||||| ||||| || CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 120 TCTGCCCGTT GCCCCTACCA TGTTGCGCTG CAGTAGTTCC CATTTGTCCG TCACCCCCGC 14576 ||||||| || |||||||||| ||||| | || ||| | || |||| || | ||| | || TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 180 CGTTTTGGTG ACCACCATTA CCTCGGCCAC CATGTCCTCC AGAATAGCGG CCTCTACCAT 14636 |||||||||| |||||||||| ||||| |||| || ||||||| |||||| ||| |||||||||| CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 240 GACCACCTCT ACCTCTAGCT ATTGAGGGTC TATAACTCTA TTTTGGACAA TTCCTCCTAA 14696 |||||||||| |||||||||| |||| ||||| ||||||| | |||||| || |||||| GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 300 TATGTCCAGT CTCCCCACAT CCATAACACT CCCTGGAGTC AAGCATAGGT CTCTCAGAGA 14756 |||||||| | |||||||||| |||||||| | | |||||||| | |||||| | || |||| TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 360 AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG TAGTGAAGAC TGAATTGGTT 14816 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| ||||| ||| AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 420 GGACTGAGTA ACCTCCTGAA CCCTGTCCTC TAGAGTAAGA ACCATTAAAC TCACCTCCCT 14876 | | ||||| |||||| ||| || ||||||| |||||||||| ||| |||||| |||||||||| GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 480 TTCGAAGAAT TTTTGATGAC ATTGCCATGG TGAAGTCATC TGGCTTCACT CCTTCTACCT 14936 |||||| | ||||||| || | ||| ||||||| || |||||||||| ||||| || | TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 540 CTATCACGAA GTCTACCTCT TCTTGAAAAG ATTTTGCCGT AGCCGCTACC TGTAAGGCTG 14996 ||||||| || ||||||| || ||||| || | |||||||||| ||||||||| |||||||| | CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 600 AAATCCGCAA TTCTGACCTT AACCCCTTCA TAAAATGGTG AATCCGCTCT TGTGGACTGA 15056 |||||||||| ||||||||| |||||||||| ||| || | |||||||||| || ||||||| AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 660 AATAAAGTTG GGTGGCATAC CTGGATAATG CACGAAACTT AGCCTCATAT GCGGTAACCG 15116 || | ||||| |||||||| | ||||| || |||| ||||| ||||||||| || |||||| AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 720 ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCTCTTTT CCTATCCCTC AAAGTACGGG 15176 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| ||||| |||| ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 780 GGATATACTT 15186 |||||||||| GGATATACTT 790 hqPGS_C06HBa0153O03.1-1+_SGN-E356912+ (14397 15186) ******************************************************************************** EST sequence 36 +strand 533 n (File: SGN-E353805+) 1 GGATCAACAA TACGGAAGCC ATGCAATCAC AAACTAGAAG ATTACCTGTG ATGACAGCAT 61 CAGATGCCTC CGCTTCAGAC CGCCCAGGGA AAGCGTAACA ATGGGCCCTA TCGTTTGTCT 121 GCCCATTGCC CCTACCATGT TGTGATGTAG TGGCCCCAGT TTGCCCATTA CCTCTGCCGT 181 TTTGGTGACC ACCATTACCT CGACCACCAC GTCCTCCAGA ATAACGGCCT CTACCATGAC 241 CACCTCTACC TCTAGCTATT GGGGGTCTAT AACTTGGTCC AGGACAATTT ATCCTAATAT 301 GTCCAATCTC CCCACATCCA TAACATTCTC TGGAGTCACT CATAGGCCCC TTGGAGAAGT 361 GTTGACCGGT CTGAGGTGGA CCCCCAACTA CAGTCTGTAG TGAAGACTGA ATGGGTCGAG 421 CCGAGTAACC TCCGGAACCT TGTCCTCTAA AGTAAGAACC CTTAAACTCA CCTCCCTTTC 481 GAAACCTCTT TGATGTTGAT GTCGTGGTGA AGTCGTCTGG CTTCACTCCT TCC Predicted gene structure (within gDNA segment 11784 to 16631): Exon 1 14405 14931 ( 527 n); cDNA 6 532 ( 527 n); score: 0.867 MATCH C06HBa0153O03.1-1+ SGN-E353805+ 0.867 527 0.989 C PGS_C06HBa0153O03.1-1+_SGN-E353805+ (14405 14931) Alignment (genomic DNA sequence = upper lines): AACAATACAT AAGCAATGCA ATCACAAACC AAAAGATTAC CTGTGATGAC AGCATCCGAT 14464 |||||||| |||| ||||| ||||||||| | |||||||| |||||||||| |||||| ||| AACAATACGG AAGCCATGCA ATCACAAACT AGAAGATTAC CTGTGATGAC AGCATCAGAT 65 GTCTCCGCTT CAGATCTCCC AGGGAAAGCA TAACAACGGG CCCTATCACC TGTCTGCCCG 14524 | |||||||| |||| | ||| ||||||||| |||||| ||| ||||||| ||||||||| GCCTCCGCTT CAGACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT TGTCTGCCCA 125 TTGCCCCTAC CATGTTGCGC TGCAGTAGTT CCCATTTGTC CGTCACCCCC GCCGTTTTGG 14584 |||||||||| ||||||| | || ||| | || |||| | | | ||| | |||||||||| TTGCCCCTAC CATGTTGTGA TGTAGTGGCC CCAGTTTGCC CATTACCTCT GCCGTTTTGG 185 TGACCACCAT TACCTCGGCC ACCATGTCCT CCAGAATAGC GGCCTCTACC ATGACCACCT 14644 |||||||||| ||||||| || |||| ||||| |||||||| | |||||||||| |||||||||| TGACCACCAT TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT 245 CTACCTCTAG CTATTGAGGG TCTATAACTC TATTTTGGAC AATTCCTCCT AATATGTCCA 14704 |||||||||| |||||| ||| ||||||||| | |||| |||| |||| |||||||||| CTACCTCTAG CTATTGGGGG TCTATAACTT GGTCCAGGAC AATTTATCCT AATATGTCCA 305 GTCTCCCCAC ATCCATAACA CTCCCTGGAG TCAAGCATAG GTCTCTCAGA GAAGTGTTGA 14764 ||||||||| |||||||||| || |||||| ||| ||||| | | || || |||||||||| ATCTCCCCAC ATCCATAACA TTCTCTGGAG TCACTCATAG GCCCCTTGGA GAAGTGTTGA 365 CCGGTCTGAG GTGGACCCCC AACTACAGTT TGTAGTGAAG ACTGAATTGG TTGGACTGAG 14824 |||||||||| |||||||||| ||||||||| |||||||||| ||||||| || | | | ||| CCGGTCTGAG GTGGACCCCC AACTACAGTC TGTAGTGAAG ACTGAATGGG TCGAGCCGAG 425 TAACCTCCTG AACCCTGTCC TCTAGAGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGA 14884 |||||||| | |||| ||||| |||| ||||| ||||| |||| |||||||||| |||||||| TAACCTCCGG AACCTTGTCC TCTAAAGTAA GAACCCTTAA ACTCACCTCC CTTTCGAAAC 485 ATTTTTGATG ACATTGCCAT GGTGAAGTCA TCTGGCTTCA CTCCTTC 14931 | ||||||| || | | ||||||||| |||||||||| ||||||| CTCTTTGATG TTGATGTCGT GGTGAAGTCG TCTGGCTTCA CTCCTTC 532 hqPGS_C06HBa0153O03.1-1+_SGN-E353805+ (14405 14931) ******************************************************************************** EST sequence 84 +strand 698 n (File: SGN-E356209+) 1 TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 61 ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 121 ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 181 AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 241 ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 301 AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 361 CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 421 TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 481 CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 541 CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 601 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 661 TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG Predicted gene structure (within gDNA segment 12514 to 15780): Exon 1 14473 15170 ( 698 n); cDNA 1 698 ( 698 n); score: 0.881 MATCH C06HBa0153O03.1-1+ SGN-E356209+ 0.881 698 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E356209+ (14473 15170) Alignment (genomic DNA sequence = upper lines): TTCAGATCTC CCAGGGAAAG CATAACAACG GGCCCTATCA CCTGTCTGCC CGTTGCCCCT 14532 |||||| | | |||||||||| | |||||| | ||||||||| ||||| | |||||||||| TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 60 ACCATGTTGC GCTGCAGTAG TTCCCATTTG TCCGTCACCC CCGCCGTTTT GGTGACCACC 14592 ||||||||| | || ||| | || |||| || | ||| | ||| |||| |||||||||| ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 120 ATTACCTCGG CCACCATGTC CTCCAGAATA GCGGCCTCTA CCATGACCAC CTCTACCTCT 14652 ||||||||| |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 180 AGCTATTGAG GGTCTATAAC TCTATTTTGG ACAATTCCTC CTAATATGTC CAGTCTCCCC 14712 |||||||| | |||||||||| | | | | || |||||| || |||||||||| || ||||||| AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 240 ACATCCATAA CACTCCCTGG AGTCAAGCAT AGGTCTCTCA GAGAAGTGTT GACCGGTCTG 14772 |||||||||| || || |||| |||||||||| ||| | ||| |||||||||| ||||||||| ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 300 AGGTGGACCC CCAACTACAG TTTGTAGTGA AGACTGAATT GGTTGGACTG AGTAACCTCC 14832 |||||| | | |||||||||| | |||||||| |||||||||| ||| |||||| |||||| ||| AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 360 TGAACCCTGT CCTCTAGAGT AAGAACCATT AAACTCACCT CCCTTTCGAA GAATTTTTGA 14892 ||||||||| ||||||| || |||||||||| |||||||||| ||||| |||| ||||||| CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 420 TGACATTGCC ATGGTGAAGT CATCTGGCTT CACTCCTTCT ACCTCTATCA CGAAGTCTAC 14952 || | || ||||||||| | |||||||| ||||||||| || ||||||| | |||||||| TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 480 CTCTTCTTGA AAAGATTTTG CCGTAGCCGC TACCTGTAAG GCTGAAATCC GCAATTCTGA 15012 | ||||||| || ||||||| |||| ||||| || ||||||| | ||||||| |||||||||| CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 540 CCTTAACCCC TTCATAAAAT GGTGAATCCG CTCTTGTGGA CTGAAATAAA GTTGGGTGGC 15072 ||| |||||| |||| ||| || ||||||| |||||| ||| |||||| | | |||| ||||| CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 600 ATACCTGGAT AATGCACGAA ACTTAGCCTC ATATGCGGTA ACCGACATCC TACCTTGCTC 15132 ||| |||||| | |||||||| |||||||||| ||| || || |||||||||| |||||||||| ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 660 TAGGCTCAAG AACTCATCTC TTTTCCTATC CCTCAAAG 15170 |||||||||| |||||||| | |||||||||| |||||||| TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG 698 hqPGS_C06HBa0153O03.1-1+_SGN-E356209+ (14473 15170) ******************************************************************************** EST sequence 76 +strand 713 n (File: SGN-E349404+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAGG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATGC GGGGAGCCAT AGTAGCCGCA TGTCGTATCT CTG Predicted gene structure (within gDNA segment 10813 to 17091): Exon 1 14839 15543 ( 705 n); cDNA 7 711 ( 705 n); score: 0.867 MATCH C06HBa0153O03.1-1+ SGN-E349404+ 0.867 705 0.989 C PGS_C06HBa0153O03.1-1+_SGN-E349404+ (14839 15543) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT CGAAGAATTT TTGATGACAT 14898 |||||||||| || ||||||| ||||| |||| |||||| || | || | | | |||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGCCATGGTG AAGTCATCTG GCTTCACTCC TTCTACCTCT ATCACGAAGT CTACCTCTTC 14958 || | ||||| |||||||| | | |||||||| ||| ||||| ||||| |||| ||||| | || TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGAAAAGAT TTTGCCGTAG CCGCTACCTG TAAGGCTGAA ATCCGCAATT CTGACCTTAA 15018 |||||| ||| |||| || | |||||| ||| |||||||||| |||||||||| |||| || || TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCATA AAATGGTGAA TCCGCTCTTG TGGACTGAAA TAAAGTTGGG TGGCATACCT 15078 |||||||| | ||| || | | |||| ||||| |||||| ||| | ||||||| |||| || || CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC GGTAACCGAC ATCCTACCTT GCTCTAGGCT 15138 ||||| ||| ||||| |||| ||||||| || | | || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCTCTTTTCC TATCCCTCAA AGTACGGGGG ATATACTTCT CCATAAACAA 15198 |||||||||| || ||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GATGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 15258 ||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCGT TCCCTTGGAA CTGATAAGTC ACAAACTCAA CACCAAACCG 15318 ||||||||| || ||||||| |||||||||| |||||| || || ||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAGGCATC 15378 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | || ||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 546 CTCAGATTCA GCACCCTTGA AGACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 15438 |||||||| |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAGTTC 606 ATGCTGATCA TTTGTCATTA TAGGCCCTGT AGTTAGACGA GGAAACATGT CTATTTCCAA 15498 ||| ||||| |||||||| | ||||||| || ||| ||||| ||||| || ||||| |||| ATGTTGATCG TTTGTCATAA TAGGCCCAGT AGTCAGACGT GGAAATGTGC CTATTCCCAA 666 TGAGGCATCC ATGCGGGGAG CCACAGTAGC CGCATGTTGT ACCTC 15543 || ||||| |||||||||| ||| |||||| ||||||| || | ||| TGGTGCATCT ATGCGGGGAG CCATAGTAGC CGCATGTCGT ATCTC 711 hqPGS_C06HBa0153O03.1-1+_SGN-E349404+ (14839 15543) ******************************************************************************** EST sequence 47 +strand 679 n (File: SGN-E351625+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAAG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATG Predicted gene structure (within gDNA segment 10813 to 16751): Exon 1 14839 15511 ( 673 n); cDNA 7 679 ( 673 n); score: 0.863 MATCH C06HBa0153O03.1-1+ SGN-E351625+ 0.863 673 0.991 C PGS_C06HBa0153O03.1-1+_SGN-E351625+ (14839 15511) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT CGAAGAATTT TTGATGACAT 14898 |||||||||| || ||||||| ||||| |||| |||||| || | || | | | |||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGCCATGGTG AAGTCATCTG GCTTCACTCC TTCTACCTCT ATCACGAAGT CTACCTCTTC 14958 || | ||||| |||||||| | | |||||||| ||| ||||| ||||| |||| ||||| | || TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGAAAAGAT TTTGCCGTAG CCGCTACCTG TAAGGCTGAA ATCCGCAATT CTGACCTTAA 15018 |||||| ||| |||| || | |||||| ||| |||||||||| |||||||||| |||| || || TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCATA AAATGGTGAA TCCGCTCTTG TGGACTGAAA TAAAGTTGGG TGGCATACCT 15078 |||||||| | ||| || | | |||| ||||| |||||| ||| | ||||||| |||| || || CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC GGTAACCGAC ATCCTACCTT GCTCTAGGCT 15138 ||||| ||| ||||| |||| ||||||| || | | || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCTCTTTTCC TATCCCTCAA AGTACGGGGG ATATACTTCT CCATAAACAA 15198 |||||||||| || ||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GATGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 15258 ||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCGT TCCCTTGGAA CTGATAAGTC ACAAACTCAA CACCAAACCG 15318 ||||||||| || ||||||| |||||||||| |||||| || || ||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAGGCATC 15378 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | || ||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 546 CTCAGATTCA GCACCCTTGA AGACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 15438 |||||||| |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAGTTC 606 ATGCTGATCA TTTGTCATTA TAGGCCCTGT AGTTAGACGA GGAAACATGT CTATTTCCAA 15498 ||| ||||| |||||||| | || |||| || ||| ||||| ||||| || ||||| |||| ATGTTGATCG TTTGTCATAA TAAGCCCAGT AGTCAGACGT GGAAATGTGC CTATTCCCAA 666 TGAGGCATCC ATG 15511 || ||||| ||| TGGTGCATCT ATG 679 hqPGS_C06HBa0153O03.1-1+_SGN-E351625+ (14839 15511) ******************************************************************************** EST sequence 92 +strand 612 n (File: SGN-E357065+) 1 GGGTTCTGTC CTCTAGAATA AGAACCATTA TACTCTCCTC CCCTTCTAAA CCTCTTCGAT 61 GTCGATGTCG TGGTGAAGTC ATCCGGTTTC ACTCCTTCCA CCTCAATCAC AAAGTCTACC 121 ACCTCTTGAA AGGATGTTGC TGTTGCCGCT ATCTGTAAGG CTGAAATCCG CAATTCTGAT 181 CTCAACCCCT TCACAAAACG GCGGATCCGT TCTTGTGGAC TAAAACAGAG TTGGGTGGCG 241 TATCTGGATA GTGCGCGAAA TTTAGCCTCA TACGCGTTTA CTGTCATCCT TCCCTGTTCA 301 AGACTCAAGA ACTCATCCCT TTTCCTATCT CTCAAAGTCC TTGGGATATA TTTCTCCATG 361 AACAAACTAG AGAATGATTC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAACA 421 TGTGAACGCC ACCACATCTT GGCGTTCCCT TGGAACTGAT AGCTCACGAA CTCGACGCCA 481 AACCTCTCTA CTATCCCCAT TTTGTGTAGT AGCTCATGAC AATCAACCAG AAAATCATAA 541 GCATCTTCAG ATTCTGCACC CTTGAAGACT GGAGGTTTCA GTTTCAAGAA CTTACTGAAA 601 AGTCATGTTG AT Predicted gene structure (within gDNA segment 10823 to 16136): Exon 1 14839 15446 ( 608 n); cDNA 6 612 ( 607 n); score: 0.868 MATCH C06HBa0153O03.1-1+ SGN-E357065+ 0.868 608 0.993 C PGS_C06HBa0153O03.1-1+_SGN-E357065+ (14839 15446) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT CGAAGAATTT TTGATGACAT 14898 |||||||||| || ||||||| ||||| |||| |||||| || | || | | | |||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 65 TGCCATGGTG AAGTCATCTG GCTTCACTCC TTCTACCTCT ATCACGAAGT CTACCTCTTC 14958 || | ||||| |||||||| | | |||||||| ||| ||||| ||||| |||| ||||| | || TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 125 TTGAAAAGAT TTTGCCGTAG CCGCTACCTG TAAGGCTGAA ATCCGCAATT CTGACCTTAA 15018 |||||| ||| |||| || | |||||| ||| |||||||||| |||||||||| |||| || || TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 185 CCCCTTCATA AAATGGTGAA TCCGCTCTTG TGGACTGAAA TAAAGTTGGG TGGCATACCT 15078 |||||||| | ||| || | | |||| ||||| |||||| ||| | ||||||| |||| || || CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 245 GGATAATGCA CGAAACTTAG CCTCATATGC GGTAACCGAC ATCCTACCTT GCTCTAGGCT 15138 ||||| ||| ||||| |||| ||||||| || | | || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 305 CAAGAACTCA TCTCTTTTCC TATCCCTCAA AGTACGGGGG ATATACTTCT CCATAAACAA 15198 |||||||||| || ||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 365 GCTAGAGAAT GATGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 15258 ||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 425 CCGCCACCAC ATTTTGGCGT TCCCTTGGAA CTGATAAGTC ACAAACTCAA CACCAAACCG 15318 ||||||||| || ||||||| |||||||||| |||||| || || ||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 485 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAGGCATC 15378 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | || ||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 545 CTCAGATTCA GCACCCTTGA AGACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 15438 |||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ||||||| || TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAG-TC 604 ATGCTGAT 15446 ||| |||| ATGTTGAT 612 hqPGS_C06HBa0153O03.1-1+_SGN-E357065+ (14839 15446) ******************************************************************************** EST sequence 19 +strand 524 n (File: SGN-E352365+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAAT Predicted gene structure (within gDNA segment 10813 to 16524): Exon 1 14839 15354 ( 516 n); cDNA 7 522 ( 516 n); score: 0.860 MATCH C06HBa0153O03.1-1+ SGN-E352365+ 0.860 516 0.985 C PGS_C06HBa0153O03.1-1+_SGN-E352365+ (14839 15354) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GAGTAAGAAC CATTAAACTC ACCTCCCTTT CGAAGAATTT TTGATGACAT 14898 |||||||||| || ||||||| ||||| |||| |||||| || | || | | | |||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGCCATGGTG AAGTCATCTG GCTTCACTCC TTCTACCTCT ATCACGAAGT CTACCTCTTC 14958 || | ||||| |||||||| | | |||||||| ||| ||||| ||||| |||| ||||| | || TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGAAAAGAT TTTGCCGTAG CCGCTACCTG TAAGGCTGAA ATCCGCAATT CTGACCTTAA 15018 |||||| ||| |||| || | |||||| ||| |||||||||| |||||||||| |||| || || TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCATA AAATGGTGAA TCCGCTCTTG TGGACTGAAA TAAAGTTGGG TGGCATACCT 15078 |||||||| | ||| || | | |||| ||||| |||||| ||| | ||||||| |||| || || CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC GGTAACCGAC ATCCTACCTT GCTCTAGGCT 15138 ||||| ||| ||||| |||| ||||||| || | | || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCTCTTTTCC TATCCCTCAA AGTACGGGGG ATATACTTCT CCATAAACAA 15198 |||||||||| || ||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GATGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 15258 ||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCGT TCCCTTGGAA CTGATAAGTC ACAAACTCAA CACCAAACCG 15318 ||||||||| || ||||||| |||||||||| |||||| || || ||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACA 15354 |||||||| ||||| |||| |||||||||| |||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACA 522 hqPGS_C06HBa0153O03.1-1+_SGN-E352365+ (14839 15354) ******************************************************************************** EST sequence 88 +strand 720 n (File: SGN-E356614+) 1 GCGAATCCGC TCTTGAGGAC TGAAACAGAG TTGAGTGGCA TATCTGGATA GTGCACGAAA 61 CTTAGCCTCA TAAGCATTAA CTGACACCCT ACCTTGCTCT AGGCTCAAGA ACTCATCCCT 121 TTTCTTATCC CTCAAAGTCC GGGGGATATA CTTCTCCATA AACAAACTAT AGAATGAGGC 181 CCAAGTCATA GGTGATGCCT CTATTGGTTG ACACTCGATA TGTGACCGCC ACCACATTTG 241 GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCCACACCA AACCGTTCTA CTATACCCAT 301 CTTGTGTAGT AGCTCGTGAC AGTCAACCAG AAAATCATAA GCATCCTCCG ATTCAGCACC 361 CTTGAAGACC GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT GATCATTTGT 421 CATTATAGGG ACAGTAGTCA AACGTGGAAA TGTGCCTATG TCCAATGGAA CATCCATGCG 481 GGGAGCCATA GTAGCCGCAT GTTGTACTTC TGAAACCGGA GGTGTTGGCG CAGAAAACAC 541 TGGAGGTGCT TGACCTTGAT CAGATAACCC GCTAAGATAA GCCAGAACCT GATTGATCAT 601 CTCTGGGGTA GGTTGGGGTG GCATTTCCTC ATTTTGCACT TGNTCAGTTT CCCCATCCCT 661 CCCTTTCTCT ATTACTTCCT CAGTCAGTGG AAGAGTCACT GCCCTAGTAT CAGATGGGCT Predicted gene structure (within gDNA segment 13975 to 17254): Exon 1 15036 15752 ( 717 n); cDNA 3 719 ( 717 n); score: 0.899 MATCH C06HBa0153O03.1-1+ SGN-E356614+ 0.899 717 0.996 C PGS_C06HBa0153O03.1-1+_SGN-E356614+ (15036 15752) Alignment (genomic DNA sequence = upper lines): GAATCCGCTC TTGTGGACTG AAATAAAGTT GGGTGGCATA CCTGGATAAT GCACGAAACT 15095 |||||||||| ||| |||||| ||| | |||| | |||||||| ||||||| | |||||||||| GAATCCGCTC TTGAGGACTG AAACAGAGTT GAGTGGCATA TCTGGATAGT GCACGAAACT 62 TAGCCTCATA TGCGGTAACC GACATCCTAC CTTGCTCTAG GCTCAAGAAC TCATCTCTTT 15155 |||||||||| || |||| |||| ||||| |||||||||| |||||||||| ||||| |||| TAGCCTCATA AGCATTAACT GACACCCTAC CTTGCTCTAG GCTCAAGAAC TCATCCCTTT 122 TCCTATCCCT CAAAGTACGG GGGATATACT TCTCCATAAA CAAGCTAGAG AATGATGCCC 15215 || ||||||| |||||| ||| |||||||||| |||||||||| ||| ||| || ||||| |||| TCTTATCCCT CAAAGTCCGG GGGATATACT TCTCCATAAA CAAACTATAG AATGAGGCCC 182 AAGTCATAGG TGGTGCCTCT GTTGGTTGAC ACTCAACATG TGACCGCCAC CACATTTTGG 15275 |||||||||| || ||||||| ||||||||| |||| | ||| |||||||||| ||||||| || AAGTCATAGG TGATGCCTCT ATTGGTTGAC ACTCGATATG TGACCGCCAC CACATTTGGG 242 CGTTCCCTTG GAACTGATAA GTCACAAACT CAACACCAAA CCGTTCTACT ATACCCATCT 15335 |||||||||| ||||||||| |||||||||| | |||||||| |||||||||| |||||||||| CGTTCCCTTG AAACTGATAA GTCACAAACT CCACACCAAA CCGTTCTACT ATACCCATCT 302 TGTGTAGTAG CTCATGACAG TCAACCAGAA AATCGTAGGC ATCCTCAGAT TCAGCACCCT 15395 |||||||||| ||| |||||| |||||||||| |||| || || |||||| ||| |||||||||| TGTGTAGTAG CTCGTGACAG TCAACCAGAA AATCATAAGC ATCCTCCGAT TCAGCACCCT 362 TGAAGACTGG AGGTTTCAAT TTCAAGAACT TACTGAAAAG TTCATGCTGA TCATTTGTCA 15455 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGACCGG AGGTTTCAAT TTCAAGAACT TACTGAAAAG TTCATGCTGA TCATTTGTCA 422 TTATAGGCCC TGTAGTTAGA CGAGGAAACA TGTCTATTTC CAATGAGGCA TCCATGCGGG 15515 ||||||| | ||||| | | || ||||| || |||| || ||||| || |||||||||| TTATAGGGAC AGTAGTCAAA CGTGGAAATG TGCCTATGTC CAATGGAACA TCCATGCGGG 482 GAGCCACAGT AGCCGCATGT TGTACCTCCG GAGCCTGAGG TGCTGGTGTA GAAAACACTG 15575 |||||| ||| |||||||||| ||||| || | | || |||| || ||| | | |||||||||| GAGCCATAGT AGCCGCATGT TGTACTTCTG AAACCGGAGG TGTTGGCGCA GAAAACACTG 542 GAGGCGCTTG GCCTTGATCA CATAACCCGC TAAGATAAGC AAGAACCTGA TTGATCATCT 15635 |||| ||||| ||||||||| ||||||||| |||||||||| ||||||||| |||||||||| GAGGTGCTTG ACCTTGATCA GATAACCCGC TAAGATAAGC CAGAACCTGA TTGATCATCT 602 CTAGGGTAGG TTGGGGTGGT AATTCCTCAT TCTGTACTTG TTCATTTTCC CCAT-CCTCC 15694 || ||||||| ||||||||| | |||||||| | || ||||| ||| ||||| |||| ||||| CTGGGGTAGG TTGGGGTGGC ATTTCCTCAT TTTGCACTTG NTCAGTTTCC CCATCCCTCC 662 CCTTCTCTTA CTACTTCCTC AGTCGGTGGA GGAGTCACCG CCCTAGTACC AGATAGGC 15752 | ||||| || ||||||||| |||| ||||| ||||||| | |||||||| | |||| ||| CTTTCTC-TA TTACTTCCTC AGTCAGTGGA AGAGTCACTG CCCTAGTATC AGATGGGC 719 hqPGS_C06HBa0153O03.1-1+_SGN-E356614+ (15036 15752) ******************************************************************************** EST sequence 18 +strand 664 n (File: SGN-E352401+) 1 AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 61 AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 121 CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 181 AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 241 GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 301 GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 361 GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 421 TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 481 AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 541 AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 601 TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT TTGCACTTGT TCAGTTTCCC CATCCTCCCC 661 TTCT Predicted gene structure (within gDNA segment 14005 to 16877): Exon 1 15037 15700 ( 664 n); cDNA 1 664 ( 664 n); score: 0.903 MATCH C06HBa0153O03.1-1+ SGN-E352401+ 0.903 664 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E352401+ (15037 15700) Alignment (genomic DNA sequence = upper lines): AATCCGCTCT TGTGGACTGA AATAAAGTTG GGTGGCATAC CTGGATAATG CACGAAACTT 15096 |||||||||| || ||||||| || | ||||| |||||||| ||||||| || |||||||||| AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 60 AGCCTCATAT GCGGTAACCG ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCTCTTTT 15156 ||||||||| || |||| | ||| |||||| |||||||||| |||||||||| |||| ||||| AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 120 CCTATCCCTC AAAGTACGGG GGATATACTT CTCCATAAAC AAGCTAGAGA ATGATGCCCA 15216 | |||||||| ||||| |||| |||||||||| |||||||||| || ||| ||| |||| ||||| CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 180 AGTCATAGGT GGTGCCTCTG TTGGTTGACA CTCAACATGT GACCGCCACC ACATTTTGGC 15276 |||||||||| | ||||||| |||||||||| ||| | |||| |||||||||| |||||| ||| AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 240 GTTCCCTTGG AACTGATAAG TCACAAACTC AACACCAAAC CGTTCTACTA TACCCATCTT 15336 ||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 300 GTGTAGTAGC TCATGACAGT CAACCAGAAA ATCGTAGGCA TCCTCAGATT CAGCACCCTT 15396 |||||||||| || ||||||| |||||||||| ||| || ||| ||||| |||| |||||||||| GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 360 GAAGACTGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 15456 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 420 TATAGGCCCT GTAGTTAGAC GAGGAAACAT GTCTATTTCC AATGAGGCAT CCATGCGGGG 15516 |||||| | ||||| | || | ||||| | | |||| ||| |||| ||| |||||||||| TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 480 AGCCACAGTA GCCGCATGTT GTACCTCCGG AGCCTGAGGT GCTGGTGTAG AAAACACTGG 15576 ||||| |||| |||||||||| |||| || | | || ||||| | ||| | || |||||||||| AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 540 AGGCGCTTGG CCTTGATCAC ATAACCCGCT AAGATAAGCA AGAACCTGAT TGATCATCTC 15636 ||| ||||| ||||||||| |||||||||| ||||||||| |||||||||| |||||||||| AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 600 TAGGGTAGGT TGGGGTGGTA ATT-CCTCAT TCTGTACTTG TTCATTTTCC CCATCCTCCC 15695 | |||||||| ||||||| | || |||||| | || ||||| |||| ||||| |||||||||| TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT T-TGCACTTG TTCAGTTTCC CCATCCTCCC 659 CTTCT 15700 ||||| CTTCT 664 hqPGS_C06HBa0153O03.1-1+_SGN-E352401+ (15037 15700) ******************************************************************************** EST sequence 201 -strand 554 n (File: SGN-E329287-) 1 AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 61 ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 121 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 181 ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 241 GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 301 TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 361 CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 421 GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 481 CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 541 CAGATGGGCT AGGT Predicted gene structure (within gDNA segment 14594 to 17284): Exon 1 15204 15756 ( 553 n); cDNA 1 553 ( 553 n); score: 0.942 MATCH C06HBa0153O03.1-1+ SGN-E329287- 0.942 553 0.998 C PGS_C06HBa0153O03.1-1+_SGN-E329287- (15204 15756) Alignment (genomic DNA sequence = upper lines): AGAATGATGC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 15263 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 60 ACCACATTTT GGCGTTCCCT TGGAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 15323 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 120 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 15383 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 180 ATTCAGCACC CTTGAAGACT GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT 15443 ||||| |||| |||| ||||| ||||||||| |||||||||| |||||| ||| |||||||||| ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 240 GATCATTTGT CATTATAGGC CCTGTAGTTA GACGAGGAAA CATGTCTATT TCCAATGAGG 15503 |||||||||| |||||||||| |||||||| | |||| ||||| | | ||||| |||||||||| GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 300 CATCCATGCG GGGAGCCACA GTAGCCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTGGTG 15563 ||| ||||| ||||| |||| |||| ||||| |||||||||| |||||||||| |||||| ||| TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 360 TAGAAAACAC TGGAGGCGCT TGGCCTTGAT CACATAACCC GCTAAGATAA GCAAGAACCT 15623 ||||||||| |||||| ||| |||||||||| || ||||||| |||||| ||| |||||||||| CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 420 GATTGATCAT CTCTAGGGTA GGTTGGGGTG GTAATTCCTC ATTCTGTACT TGTTCATTTT 15683 |||||||||| |||| ||||| |||||||||| | |||||||| |||||| ||| |||||||| | GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 480 CCCCATCCTC CCCTTCTCTT ACTACTTCCT CAGTCGGTGG AGGAGTCACC GCCCTAGTAC 15743 |||||||||| || |||||| || ||||||| |||| ||||| ||| |||||| ||| |||||| CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 540 CAGATAGGCC AGG 15756 ||||| ||| ||| CAGATGGGCT AGG 553 hqPGS_C06HBa0153O03.1-1+_SGN-E329287- (15204 15756) ******************************************************************************** EST sequence 49 +strand 433 n (File: SGN-E352180+) 1 CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 61 GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 121 CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 181 ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 241 ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 301 TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 361 CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 421 GCCGCTGTTC TCC Predicted gene structure (within gDNA segment 14684 to 17118): Exon 1 15392 15821 ( 430 n); cDNA 1 430 ( 430 n); score: 0.884 MATCH C06HBa0153O03.1-1+ SGN-E352180+ 0.884 430 0.993 C PGS_C06HBa0153O03.1-1+_SGN-E352180+ (15392 15821) Alignment (genomic DNA sequence = upper lines): CCCTTGAAGA CTGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 15451 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 60 GTCATTATAG GCCCTGTAGT TAGACGAGGA AACATGTCTA TTTCCAATGA GGCATCCATG 15511 |||||||||| |||| ||||| | ||| ||| || | ||| | | ||||| |||||||| GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 120 CGGGGAGCCA CAGTAGCCGC ATGTTGTACC TCCGGAGCCT GAGGTGCTGG TGTAGAAAAC 15571 |||||||||| ||||||||| ||||||||| || | | || |||||| ||| | ||||||| CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 180 ACTGGAGGCG CTTGGCCTTG ATCACATAAC CCGCTAAGAT AAGCAAGAAC CTGATTGATC 15631 |||||||| | |||| ||||| |||| |||| |||||||||| |||| ||||| |||||||||| ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 240 ATCTCTAGGG TAGGTTGGGG TGGTAATTCC TCATTCTGTA CTTGTTCATT TTCCCCATCC 15691 |||||| ||| |||||||||| ||| | |||| ||||| || | |||||||| | |||||||||| ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 300 TCCCCTTCTC TTACTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATAGG 15751 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| ||||||| || TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 360 CCAGGCGTTC ATCCTCTTCC TCTAGAAGAC GTCCTCCCGC GACCTCTACC GCGGCCTCTT 15811 | || || | ||||||| |||||| ||| |||||||| | |||||||||| ||||| ||| CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 420 GCTACTGCTC 15821 || ||| || GCCGCTGTTC 430 hqPGS_C06HBa0153O03.1-1+_SGN-E352180+ (15392 15821) ******************************************************************************** EST sequence 6 -strand 679 n (File: SGN-E370357-) 1 TTGGAGGTGG TGTTGCCATG GTGGATTGGG GGAAAAATGG AATTGGACTT TGTAGGAGGA 61 GAAATGGGCA TTTACTTACA CAAATGGCCA CTATTTACAA GAAAACACAT CAGAAATTCG 121 AGGAAAATTC TGCATCCGTA GGCAAATTTG AAAAATTTTA CAAATGGCAG AGGTGTAATC 181 AATCAATAAT ATTTGGGTGA CAGCCGAATG ATTAATTGAC ACTCGAATAA AGCAAACGTA 241 CCGTCGTCTT CAACTCAACC GCAACTCTAG CCAGTCTTCA TTATACCGGA TTTCAGTGTG 301 AGCTAACGCT TCTAGCTTGG ACTGGATCTT CTTCTTCATG TCTTGATGCC TTGAAGTTCC 361 GGCATGGACT AGCTTTTTAG TTATTCTAGC TTTCTAGATA CTCTTAGAAT TAGTAATTTG 421 AGGATAGATG TTCTTGTGAT GATGACTTCC AGATTTTGGG GATAATAATA GTTGTTGAGT 481 TTTTAGAAGT TATTTAATTG ATTTTCATTA ATGAGTTTAA GTCTTCCGCA TTATATTCCG 541 TCATTATATT GAAATGTTGG ATTTTGTATT GGTTGGTTCG CTCACATAGG AGGGTAAGTG 601 TGGGTGCCAC TTGCGGCTCG ATTTGGGTCG TGACATTAAA GATTATCGAA TTCATAATAT 661 AATTGTAAGA AAAAAAAAA Predicted gene structure (within gDNA segment 19537 to 14879): Exon 1 18806 18790 ( 17 n); cDNA 174 189 ( 16 n); score: 0.647 Intron 1 18789 17936 ( 854 n); Pd: 0.900 (s: 0), Pa: 0.965 (s: 0) Exon 2 17935 17919 ( 17 n); cDNA 190 207 ( 18 n); score: 0.500 Intron 2 17918 17240 ( 679 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 3 17239 17230 ( 10 n); cDNA 208 217 ( 10 n); score: 0.900 Intron 3 17229 16828 ( 402 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 4 16827 16813 ( 15 n); cDNA 218 232 ( 15 n); score: 0.800 Intron 4 16812 16536 ( 277 n); Pd: 0.890 (s: 0), Pa: 0.968 (s: 0.92) Exon 5 16535 16135 ( 401 n); cDNA 233 635 ( 403 n); score: 0.883 PPA cDNA 667 679 MATCH C06HBa0153O03.1-1- SGN-E370357- 0.883 460 0.677 C PGS_C06HBa0153O03.1-1-_SGN-E370357- (18806 18790,17935 17919,17239 17230,16827 16813,16535 16135) Alignment (genomic DNA sequence = upper lines): TGTAAGTTAT TACTTAAGTA TCTTTTTTTG TAAATGGAAA AGGGCTAAAA ATGCCCTTAA 18747 ||||| || | ||| TGTAATCAAT CAA-TAA... .......... .......... .......... .......... 189 CTTAGTGGAA ATGGTTCAAA ATACCATCCT TCTACCTTTT GAGTTAAAAA TACCCTCCAC 18687 .......... .......... .......... .......... .......... .......... 189 CTTTATTTTG GTTCAAAGAT GCCTTTCCTT CCACCTTTTG ATTTAATAAT ACTCTTAACC 18627 .......... .......... .......... .......... .......... .......... 189 CCCCATTTAA TTAAATTTAT AAAATAAAAA ATTCTTAATA TTAGCTCATT CCAAAATCTT 18567 .......... .......... .......... .......... .......... .......... 189 TATGATAAAT ATATCTAAAA AATAAAATAA AAAATTTATT ATATGTATAA AAAGCAAAAA 18507 .......... .......... .......... .......... .......... .......... 189 TAAAAATAAA ATTTCTCAAA GTTCTTATTC TTTGTATTAA AATAATAAGA CAATAAAAAT 18447 .......... .......... .......... .......... .......... .......... 189 CTTAAGATTC TTATTCTTCA TTTTTGCGCA AAAAAATCTT TATTTTATTT TATGTTTTAT 18387 .......... .......... .......... .......... .......... .......... 189 ACATATTATT TAATATTTTA ATTTGTGAGA AATTTTTTTA AGTTATTTGG ATTAAATTTT 18327 .......... .......... .......... .......... .......... .......... 189 TAAATTATAT TGAGAAAATG CACAAGTATT CCCTCAAACT ATGTCTGAAA TCCCAGAGAC 18267 .......... .......... .......... .......... .......... .......... 189 ACACTTATAC TATATTAAGG TCATATTACC CCCTGAACTT ATTTTATAAG TAATTTTCTA 18207 .......... .......... .......... .......... .......... .......... 189 CCCCTTTTGA CCTACGTGGC TCTAGCTTGA AAAAAAAGTC AATCAGCGTT GGACCCACAA 18147 .......... .......... .......... .......... .......... .......... 189 GATAGTGCCA CATAGACCGA AAAGGGCTAG AAAATTATTA ATAAAATAAG TTCAGGGATA 18087 .......... .......... .......... .......... .......... .......... 189 ATAGGACCTT AGTATAGTGT AAGTATGACT TTAAAATTTC AGGCATAAAT TGAGAGGGTA 18027 .......... .......... .......... .......... .......... .......... 189 CTTGTGCATT ATCTCAATAA TATTCAAATC TTTACATTAA TATCTAATTT GATGTAATAT 17967 .......... .......... .......... .......... .......... .......... 189 TTTAATAATA ATAATGTAAC GACCTATTTA GTCGTTTTGA G-CAGCAGAT TTTATTTTTG 17908 | || | | |||| || .......... .......... .......... .TATTTGGGT GACAGCCGA. .......... 207 GAAAAACTGG CTGAGACGAC GGATCCCACG ATGGACCGTC ATGGGCACGA TGGACCGTCG 17848 .......... .......... .......... .......... .......... .......... 207 AGGGGGTCTC GTTCCAAAAT ACATAGAATT CTGAAATTTG GGTTTTGAAA TCGACTCTCT 17788 .......... .......... .......... .......... .......... .......... 207 GAACTTCGTG ATGAAGTGGC AGGACGGACC GTCACAGGCA TGACGGGCCG TCACAGTCTC 17728 .......... .......... .......... .......... .......... .......... 207 TTCAGAAAAT TTCAGTCTCT GAACTCTGTG ACGGAAGCAG CAGGACGGAC CGTCGCAGGC 17668 .......... .......... .......... .......... .......... .......... 207 ACGACGACCC GTCACAGACT GCGTAATCCC AGGCTGAGTC GGATTTCTTT AAATGTTTTA 17608 .......... .......... .......... .......... .......... .......... 207 AGGGGGCGTT TTGGACTATT CCTGCTATAA TTATAAATTT AGTGGGTTAA TGTTAATAAT 17548 .......... .......... .......... .......... .......... .......... 207 TTAACTACTT GAGGGTTAAA AGAGATAACC TTGAATTAGT TAGTGGGTTA AACTCATCAT 17488 .......... .......... .......... .......... .......... .......... 207 CTTTCATACT TAATTATATG CTAATTAGGG TAAAAGAAAG AAGGTTTGAA TAAGAAAAAG 17428 .......... .......... .......... .......... .......... .......... 207 AAAAGAACAG AAAGAGAGGG AGAAACGATC GAGAGAGAGA GAGGAATGAA GAGGAAAGCA 17368 .......... .......... .......... .......... .......... .......... 207 AAGATCTTGA GGAAATTGCT TGCTTGATCA CGAATCTTCG GTGGAAGTAG GTTATGGTTT 17308 .......... .......... .......... .......... .......... .......... 207 TTATACTATT CGTAGTTAAC TCTTAATAGC GAATGATATG TGTTGGGTTG TATTGTAAAG 17248 .......... .......... .......... .......... .......... .......... 207 TCTTCTATAT GCTTAATTGT ATGCTTGCAT GAATATGATT ATATAATTGT GATGAAATAA 17188 || | |||||| ........AT GATTAATT.. .......... .......... .......... .......... 217 GCATGATGAA GCTATTGAAT CCCAAATCTT GAAAACTCCA ATCTTGAAAA CCCCTTGTTA 17128 .......... .......... .......... .......... .......... .......... 217 TTGATGATGC CTTGGTATAA AAGAAGGCTT GATGAACTAA AGTAATGAGA TTGATGATGC 17068 .......... .......... .......... .......... .......... .......... 217 CTTGGTATAA AAGAAGGCTT GATGAATTAA TAGAATGAGA TTAGTGGAGT AGGTGTCACG 17008 .......... .......... .......... .......... .......... .......... 217 AACCGACACA TAGAATTAGG GGATCGGGTG CCACGAACCG ACACGTAGAA TTAAGGGATC 16948 .......... .......... .......... .......... .......... .......... 217 GGGTGTCACG AACCGACACG TAGAATTAGG GAATTGGGTG TCACAAACTG ACACGTAGAA 16888 .......... .......... .......... .......... .......... .......... 217 TTAGGGGATC GGGTGTCACG AATCGACACG TAGAACTAGG GAATCGGAGT GTCACGTACC 16828 .......... .......... .......... .......... .......... .......... 217 GACACAAGAG TAAAGGTGAT GAATCTTGAA AGATGTTAAT ATACTCAATC TAATGAACCT 16768 ||||| || ||||| GACACTCGAA TAAAG..... .......... .......... .......... .......... 232 AAGTCCCAAA TGAGTATGGT ATTGAGGCTT GAGTCCTCAT GAGTGTACTT GACGTTATTT 16708 .......... .......... .......... .......... .......... .......... 232 ATCAAAGATT CTTGTACTTG TTGCTACATG TTGAGTAATG TAGTTGATTT TATATTATTA 16648 .......... .......... .......... .......... .......... .......... 232 CTTGATATAT ATTGTTTTCT ATTTTGAGTT GGCCGATGAT ATCTACTCAG TACCCATGTT 16588 .......... .......... .......... .......... .......... .......... 232 TTGTACTGAC CCCTACTTGT ATGTTTCTTT CCTTGTTATT TGTGGAGTGC AGCAAACGTG 16528 ||||||| .......... .......... .......... .......... .......... ..CAAACGTA 240 CCGTCGTCTT CAACTCAACC GCAACTCTAG CAAGACTTCA TAACACCGGA TTTCAGGGTG 16468 |||||||||| |||||||||| |||||||||| | || ||||| | | |||||| |||||| ||| CCGTCGTCTT CAACTCAACC GCAACTCTAG CCAGTCTTCA TTATACCGGA TTTCAGTGTG 300 AGCT-ACACT TCTAGCTTGA ACTGGATCTT CTTGTTCATG TCTTGATGCC TTGAAGTTCC 16409 |||| || || ||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| AGCTAACGCT TCTAGCTTGG ACTGGATCTT CTTCTTCATG TCTTGATGCC TTGAAGTTCC 360 AGCATGGACT AGCTTTTTAT TTATTCTAGC TTTCTAGATA CTCTTAGCTT TAGTAATTTG 16349 ||||||||| ||||||||| |||||||||| |||||||||| ||||||| | |||||||||| GGCATGGACT AGCTTTTTAG TTATTCTAGC TTTCTAGATA CTCTTAGAAT TAGTAATTTG 420 AGGATAGATG TTCTTGTGAT GATGACTTCC AGATTTTGGG GATAATGATA AGT-TTGAG- 16291 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| | ||||| AGGATAGATG TTCTTGTGAT GATGACTTCC AGATTTTGGG GATAATAATA GTTGTTGAGT 480 TTTTAGAAAG TGATT-ATTG ATTTTCATTA ATGAGTTTAA GTCTTCCGCA TTATATTATG 16232 |||||||| | || |||| |||||||||| |||||||||| |||||||||| ||||||| | TTTTAGAAGT TATTTAATTG ATTTTCATTA ATGAGTTTAA GTCTTCCGCA TTATATTCCG 540 TTAATTATGT TTGAAATGTT GGGGTTCAGA TTGGTTGGTT CGCTCACATA GTAGGATAAG 16172 | ||||| |||||||||| || || | |||||||||| |||||||||| | ||| |||| -TCATTAT-A TTGAAATGTT GGATTTTGTA TTGGTTGGTT CGCTCACATA GGAGGGTAAG 598 TGTGGGTGCC ACTCGCGACC CGTTTTGGGT CGTGACA 16135 |||||||||| ||| ||| | || ||||||| ||||||| TGTGGGTGCC ACTTGCGGCT CGATTTGGGT CGTGACA 635 hqPGS_C06HBa0153O03.1-1-_SGN-E370357- (16827 16813,16535 16135) ******************************************************************************** EST sequence 98 -strand 730 n (File: SGN-E546506-) 1 GTTCCGGTAC CAGGATAGAA TATGAGGATC GGAGTGTCAC GTTCCGACAC CAGGATAGAA 61 AATGGATCGG GTGCCACGTT CCGGTACCAG GATAGAATGA GGATCGGAGT GCCACGTTCC 121 GGCACCAGGA TAGAAATAGA GGATCGGAGT GTCACGTACC GACACAAGAG GAAGAAAGAT 181 AATGAATCTT GAAAGATAAT GAATATACTC AATCTAATGA ACTTAATTCC CAAATGAGTA 241 TGGTATTGAG GCTTGAGTCC TCATGTGTGA ACTCGGTTGT AGTTATTGAT GAATTCATGG 301 TGTTATTGCT ACATGTTGAG TATTGTAGTT GATTTACATT ATTATTGATA TATACTGTTC 361 CCTATTTTGA GTTGGCCGAT GATATCTACT CAGTACGTGT GGTTGTACTG ACCCCTACTT 421 TCATGTTTTC CTCTTTGTTA TTTGTGTAAT GCAGCAAACG TTCCGTCAAC TTCAACTCAA 481 CAGTAGATCT AGCCAGTCTT CACTTCATCG GAAAATTCAG GGTGAGCTAA TGCGTCTAGC 541 TTGGACTGGG TCTTCTTCTT CAAGTCTTGA TGCCTTGAAT TTCCGGCATG GACTAGCTTT 601 TTAGTTATTT TGTTCTTAGA CATTCCTAGT TAAGTAATTT GAGATAGATG TTCTTGTGAT 661 GATGACTTCC AGATTTTGGG GATAATAATA ATAGTATTGA ATTGTTTTTA TTAAAAAAAA 721 AAAAAAAAAA Predicted gene structure (within gDNA segment 18875 to 15263): Exon 1 16998 16959 ( 40 n); cDNA 92 130 ( 39 n); score: 0.712 Intron 1 16958 16858 ( 101 n); Pd: 0.000 (s: 0.71), Pa: 0.000 (s: 0.73) Exon 2 16857 16298 ( 560 n); cDNA 131 691 ( 561 n); score: 0.818 PPA cDNA 713 730 MATCH C06HBa0153O03.1-1- SGN-E546506- 0.818 600 0.822 C PGS_C06HBa0153O03.1-1-_SGN-E546506- (16998 16959,16857 16298) Alignment (genomic DNA sequence = upper lines): ATAGAATTAG GGGATCGG-G TGCCACGAAC CGACACGTAG AATTAAGGGA TCGGGTGTCA 16940 |||||| | | ||||||| | ||||||| | || ||| | | ATAGAA-T-G AGGATCGGAG TGCCACGTTC CGGCACCAGG A......... .......... 130 CGAACCGACA CGTAGAATTA GGGAATTGGG TGTCACAAAC TGACACGTAG AATTAGGGGA 16880 .......... .......... .......... .......... .......... .......... 130 TCGGGTGTCA CGAATCGACA CGTAGAACTA GGGAATCGGA GTGTCACGTA CCGACACAAG 16820 ||||| || | | |||||| |||||||||| |||||||||| .......... .......... ..TAGAAATA GAGGATCGGA GTGTCACGTA CCGACACAAG 168 A-GTA-AAGG -TGATGAATC TTGAAAGAT- GTTAATATAC TCAATCTAAT GAACCTAAGT 16764 | | | || | | ||||||| ||||||||| | ||||||| |||||||||| |||| ||| | AGGAAGAAAG ATAATGAATC TTGAAAGATA ATGAATATAC TCAATCTAAT GAACTTAATT 228 CCCAAATGAG TATGGTATTG AGGCTTGAGT CCTCATGAGT GTACTTGACG TTATTTATCA 16704 |||||||||| |||||||||| |||||||||| ||||||| || | ||| | || |||| CCCAAATGAG TATGGTATTG AGGCTTGAGT CCTCATGTGT GAACTCGGTT GTAGTTATTG 288 AAG-ATTCTT GTACTTGTTG CTACATGTTG AGTAATGTAG TTGATTTTAT ATTATTACTT 16645 | | |||| | | || ||| |||||||||| |||| ||||| |||| |||| ||||||| || ATGAATTCAT GGTGTTATTG CTACATGTTG AGTATTGTAG TTGA-TTTAC ATTATTA-TT 346 GATATATATT GTTTTCTATT TTGAGTTGGC CGATGATATC TACTCAGTAC CCATGTTTTG 16585 |||||||| | ||| ||||| |||||||||| |||||||||| |||||||||| || ||| GATATATACT GTTCCCTATT TTGAGTTGGC CGATGATATC TACTCAGTAC GTGTG-GTTG 405 TACTGACCCC TACTTGTATG TTTCTTTCCT TGTTATTTGT GGAGTGCAGC AAACGTGCCG 16525 |||||||||| ||||| ||| ||| || | |||||||||| | | |||||| |||||| ||| TACTGACCCC TACTTTCATG TTTTCCTCTT TGTTATTTGT GTAATGCAGC AAACGTTCCG 465 TCGTCTTCAA CTCAACCGCA ACTCTAGCAA GACTTCATAA CACCGG--AT TTCAGGGTGA 16467 || |||||| |||||| | | |||||| | | ||||| || ||| | |||||||||| TCAACTTCAA CTCAACAGTA GATCTAGCCA GTCTTCACTT CATCGGAAAA TTCAGGGTGA 525 GCT-ACACTT CTAGCTTGAA CTGGATCTTC TTGTTCATGT CTTGATGCCT TGAAGTTCCA 16408 ||| | | | |||||||| | |||| ||||| || |||| || |||||||||| |||| |||| GCTAATGCGT CTAGCTTGGA CTGGGTCTTC TTCTTCAAGT CTTGATGCCT TGAATTTCCG 585 GCATGGACTA GCTTTTTATT TATTCTAGCT TTCTAGATAC TCTTAGCTTT AGTAATTTGA 16348 |||||||||| |||||||| | |||| | | | | |||| | || ||| || |||||||||| GCATGGACTA GCTTTTTAGT TATT-TTG-T TCTTAGACAT TCCTAG-TTA AGTAATTTGA 642 GGATAGATGT TCTTGTGATG ATGACTTCCA GATTTTGGGG ATAATGATAA 16298 ||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| -GATAGATGT TCTTGTGATG ATGACTTCCA GATTTTGGGG ATAATAATAA 691 hqPGS_C06HBa0153O03.1-1-_SGN-E546506- (16998 16959,16857 16298) ******************************************************************************** EST sequence 181 +strand 286 n (File: SGN-E355114+) 1 TTAGGTTCGT TGGTCTCATC ACACAAGAAC AGGTCTAGTA GAGTCTTTAG GAACGGTAGG 61 GGGACGCCTT TACTTTTCTT TGAGAGGCTA TAAGACTTTA GGAAAATTTC ACCCTTTCAT 121 TCTTTCTTTC GTGCTACTAC TTGAGTCCAA TTGGTATCTA GGCGATACAA ATTGGTATCT 181 GACCATCTTC ACTCTCTTTT GCAGATGGTT AGAACTAGAG CAACGACCAC GTCAACACCA 241 ACACCGGCCA GACAAGAAAC AACTGAGCCA GCCACTGGGG CTGTGG Predicted gene structure (within gDNA segment 16717 to 14663): Exon 1 16117 15833 ( 285 n); cDNA 1 284 ( 284 n); score: 0.902 MATCH C06HBa0153O03.1-1- SGN-E355114+ 0.902 285 0.997 C PGS_C06HBa0153O03.1-1-_SGN-E355114+ (16117 15833) Alignment (genomic DNA sequence = upper lines): TTAGGTTCGT TGGTCTCATC ACACAAGAAC GAGTCTAGTA GAGTCTGAAG GAACGGTAGG 16058 |||||||||| |||||||||| |||||||||| |||||||| |||||| || |||||||||| TTAGGTTCGT TGGTCTCATC ACACAAGAAC AGGTCTAGTA GAGTCTTTAG GAACGGTAGG 60 GGGACGCCTT TACTTTTCTT TGAGAGGCTA TAAGACTTTA GGAAAAATTC CATTCTTTCT 15998 |||||||||| |||||||||| |||||||||| |||||||||| || |||||| || ||||| GGGACGCCTT TACTTTTCTT TGAGAGGCTA TAAGACTTTA GG-AAAATTT CACCCTTTCA 119 TTCTTTCCTT TGTGCTATTA CTTGGATCCA ATTGGTATCT AGGTGATACA AATTGGTATC 15938 ||||||| || |||||| || |||| |||| |||||||||| ||| |||||| |||||||||| TTCTTTCTTT CGTGCTACTA CTTGAGTCCA ATTGGTATCT AGGCGATACA AATTGGTATC 179 TGACCATCTT CACTCTATTT CGCAGATGGT TAGAACTAGA GCAACAACCA CGCCAACATC 15878 |||||||||| |||||| ||| ||||||||| |||||||||| ||||| |||| || ||||| | TGACCATCTT CACTCTCTTT TGCAGATGGT TAGAACTAGA GCAACGACCA CGTCAACACC 239 AACATCGGCA AGACAAGATG CATCTGAGCC AGCCATTGTG ACTGT 15833 |||| |||| |||||||| || ||||||| ||||| || | |||| AACACCGGCC AGACAAGAAA CAACTGAGCC AGCCACTGGG GCTGT 284 hqPGS_C06HBa0153O03.1-1-_SGN-E355114+ (16117 15833) ******************************************************************************** EST sequence 188 +strand 481 n (File: SGN-E246710+) 1 CACAAACCGA CATATAGATT TAGGGGATCG GAGTGTCACG TACCGACACA AGAGGATTAA 61 TGAATATTGA GGGAGCGGAG TGTCACGTAC CGACACAAGA GAAATAAAGA TAATGAATCT 121 TGAAAGATGT TAATATACTC AATCTAATGA ACATGATTCC CAAATGAGTA TGGTATTGAG 181 GCTTGAGTCC TCATGTGTGA ACTTGACGGT AATTGTTAAT GATATAGTAT TTGTTGTTGC 241 TACATGTTGA GTATCATAGT TGATTTTATG ATATTACTTG GTATATATAT TGATTTCTAT 301 TTTGAGTTGG CCGATGATAT CTACTCAGTA CCCGTGTTTT GTACTGACCC CTACTTTTAT 361 GTTCTCTTCT TGTTTATTTG TGGAGTGCAG CAAACGTGCC ATCGTGTTCA ACTCAACAGT 421 AATTCAAGCC AGTCTTACTA CATCGGAAAT TCAGGGTGAG CTAATGCTTC TAGCTTGGAC 481 T Predicted gene structure (within gDNA segment 18193 to 15106): Exon 1 17011 16944 ( 68 n); cDNA 1 70 ( 70 n); score: 0.721 Intron 1 16943 16849 ( 95 n); Pd: 0.000 (s: 0.67), Pa: 0.000 (s: 0.76) Exon 2 16848 16446 ( 403 n); cDNA 71 481 ( 411 n); score: 0.823 MATCH C06HBa0153O03.1-1- SGN-E246710+ 0.808 471 0.979 C PGS_C06HBa0153O03.1-1-_SGN-E246710+ (17011 16944,16848 16446) Alignment (genomic DNA sequence = upper lines): CACGAACCGA CACATAGAAT TAGGGGATCG G-GTGCCACG AACCGACAC- GTAGAATTAA 16954 ||| |||||| || ||||| | |||||||||| | ||| |||| |||||||| || ||||| CACAAACCGA CATATAGATT TAGGGGATCG GAGTGTCACG TACCGACACA AGAGGATTAA 60 GGGATCGGGT GTCACGAACC GACACGTAGA ATTAGGGAAT TGGGTGTCAC AAACTGACAC 16894 | || | TGAATATTGA .......... .......... .......... .......... .......... 70 GTAGAATTAG GGGATCGGGT GTCACGAATC GACACGTAGA ACTAGGGAAT CGGAGTGTCA 16834 || | |||||||||| .......... .......... .......... .......... .....GGGAG CGGAGTGTCA 85 CGTACCGACA CAAGAG---T AAAGGTGATG AATCTTGAAA GATGTTAATA TACTCAATCT 16777 |||||||||| |||||| | |||| | ||| |||||||||| |||||||||| |||||||||| CGTACCGACA CAAGAGAAAT AAAGATAATG AATCTTGAAA GATGTTAATA TACTCAATCT 145 AATGAACCTA AGTCCCAAAT GAGTATGGTA TTGAGGCTTG AGTCCTCATG AGTGTACTTG 16717 ||||||| | | |||||||| |||||||||| |||||||||| |||||||||| ||| ||||| AATGAACATG ATTCCCAAAT GAGTATGGTA TTGAGGCTTG AGTCCTCATG TGTGAACTTG 205 ACGTTATTTA TCAAAGAT-T -CTTGTACTT GTTGCTACAT GTTGAGTAAT GTAGTTGATT 16659 ||| || || | || ||| | | | || |||||||||| |||||||| ||||||||| ACGGTAATTG TTAATGATAT AGTATTTGTT GTTGCTACAT GTTGAGTATC ATAGTTGATT 265 TTATATTATT ACTT-G-ATA TATATTGTTT TCTATTTTGA GTTGGCCGAT GATATCTACT 16601 |||| |||| |||| | ||| ||||||| || |||||||||| |||||||||| |||||||||| TTATGATATT ACTTGGTATA TATATTGATT TCTATTTTGA GTTGGCCGAT GATATCTACT 325 CAGTACCCAT GTTTTGTACT GACCCCTACT TGTATGTTTC TTTCCTTG-T TATTTGTGGA 16542 |||||||| | |||||||||| |||||||||| | |||| ||| | | |||| | |||||||||| CAGTACCCGT GTTTTGTACT GACCCCTACT TTTATG-TTC TCTTCTTGTT TATTTGTGGA 384 GTGCAGCAAA CGTGCCGTCG TCTTCAACTC AACCGCAACT CTAGCAAGAC TTCATAACAC 16482 |||||||||| |||||| ||| | |||||||| ||| | || | | ||| || | || || | GTGCAGCAAA CGTGCCATCG TGTTCAACTC AACAGTAATT CAAGCCAGTC TTACTACATC 444 CGGATTTCAG GGTGAGCT-A CACTTCTAGC TTGAACT 16446 | | ||||| |||||||| | |||||||| ||| ||| GGAAATTCAG GGTGAGCTAA TGCTTCTAGC TTGGACT 481 hqPGS_C06HBa0153O03.1-1-_SGN-E246710+ (17011 16944,16848 16446) ******************************************************************************** EST sequence 203 +strand 239 n (File: SGN-E391780+) 1 TCACAAACCG ACATATAGAT TTAGGGGATC GGAGTGTCAC GTACCGACAC AAGAGGATTA 61 ATGAATATTG AGGGAGCGGA GTGTCACGTA CCGACACAAG AGAAATAAAG ATAATGAATC 121 TTGAAAGATG TTAATATACT CAATCTAATG AACATGATTC CCAAATGAGT ATGGTATTGA 181 GGCTTGAGTC CTCATGTGTG AACTTGACGG TAATTGTTAA TGATATAGTA TTTGTTGAT Predicted gene structure (within gDNA segment 18203 to 15687): Exon 1 17012 16944 ( 69 n); cDNA 1 71 ( 71 n); score: 0.725 Intron 1 16943 16849 ( 95 n); Pd: 0.000 (s: 0.67), Pa: 0.000 (s: 0.76) Exon 2 16848 16685 ( 164 n); cDNA 72 237 ( 166 n); score: 0.845 MATCH C06HBa0153O03.1-1- SGN-E391780+ 0.809 233 0.975 C PGS_C06HBa0153O03.1-1-_SGN-E391780+ (17012 16944,16848 16685) Alignment (genomic DNA sequence = upper lines): TCACGAACCG ACACATAGAA TTAGGGGATC GG-GTGCCAC GAACCGACAC -GTAGAATTA 16955 |||| ||||| ||| ||||| |||||||||| || ||| ||| | |||||||| || |||| TCACAAACCG ACATATAGAT TTAGGGGATC GGAGTGTCAC GTACCGACAC AAGAGGATTA 60 AGGGATCGGG TGTCACGAAC CGACACGTAG AATTAGGGAA TTGGGTGTCA CAAACTGACA 16895 | | || | ATGAATATTG A......... .......... .......... .......... .......... 71 CGTAGAATTA GGGGATCGGG TGTCACGAAT CGACACGTAG AACTAGGGAA TCGGAGTGTC 16835 || | ||||||||| .......... .......... .......... .......... ......GGGA GCGGAGTGTC 85 ACGTACCGAC ACAAGAG--- TAAAGGTGAT GAATCTTGAA AGATGTTAAT ATACTCAATC 16778 |||||||||| ||||||| ||||| | || |||||||||| |||||||||| |||||||||| ACGTACCGAC ACAAGAGAAA TAAAGATAAT GAATCTTGAA AGATGTTAAT ATACTCAATC 145 TAATGAACCT AAGTCCCAAA TGAGTATGGT ATTGAGGCTT GAGTCCTCAT GAGTGTACTT 16718 |||||||| | | ||||||| |||||||||| |||||||||| |||||||||| | ||| |||| TAATGAACAT GATTCCCAAA TGAGTATGGT ATTGAGGCTT GAGTCCTCAT GTGTGAACTT 205 GACGTTATTT ATCAAAGATT CTTGTACTTG TTG 16685 |||| || || | || || | | ||| ||| ||| GACGGTAATT GTTAATGA-T ATAGTATTTG TTG 237 hqPGS_C06HBa0153O03.1-1-_SGN-E391780+ (17012 16944,16848 16685) ******************************************************************************** EST sequence 96 -strand 710 n (File: SGN-E392027-) 1 CAGCAAACGT GCCATCGTGT TCAACTCAAC AGTAATTCAA GCCAGTCTTA CTACATCGGA 61 AATTCAGGGT GAGCTAATGC TTCTAGCTTG GACTGGATCT TCTTCTTCAA GTCTTGATGC 121 CTTGAACTTC CGGCATGGAC TAGCTTCTTA TGTATTTTTA GCTTTTAGAC TACTCTTAGT 181 TTAGTCATTT GATCGTAGAT GTTCTTGTGG TGATGACTTC CAGATTTTGG GGAATAATAG 241 ATGTTGAATT TTAGAAGTTA ATGAATTGGT CTGTATTTAA TGAGTTTAAG TCTTCCACAT 301 TACTTTCTGT TGATATTATA TTGAAATGTT AAGGTTAGAT TGGTTGGTTC GCTCACATAG 361 GAGGGTAAGT GTGGGTGCCA GTCGCAACCC GGTTTTGGTC GTGACAAACT TGGTATCAGA 421 GCATTAGGTT CGTTGGTCTC ATCACACAAG AACGAGTCTA GTAGAGTCTT AAGGAACGGT 481 AGGGGGATGC CTTTACTTTT CCTTGAGAGG CTATAAGACT TTTGGAAAAT TCCATTCTTT 541 CTTCTTTCGT GCTATTACTT GGGTCCAATT GGTATCTAGG TGATACAAAT TGGTATCTGA 601 CCATCTTCAC TCTATTTCGC AGATGGTTAG AACTAGAGCA ACGACTACGC CAGCATCAAC 661 ACCAGCACCG GCGGGACAGG GTGCGACTGA GCCAGCCACT GGGGCTGTGG Predicted gene structure (within gDNA segment 20714 to 14613): Exon 1 16538 15875 ( 664 n); cDNA 1 660 ( 660 n); score: 0.856 MATCH C06HBa0153O03.1-1- SGN-E392027- 0.856 664 0.935 C PGS_C06HBa0153O03.1-1-_SGN-E392027- (16538 15875) Alignment (genomic DNA sequence = upper lines): CAGCAAACGT GCCGTCGTCT TCAACTCAAC CGCAACTCTA GCAAGACTTC ATAACACCGG 16479 |||||||||| ||| |||| | |||||||||| | || || | || || ||| || | | CAGCAAACGT GCCATCGTGT TCAACTCAAC AGTAATTCAA GCCAGTCTTA CTACATCGGA 60 ATTTCAGGGT GAGCT-ACAC TTCTAGCTTG AACTGGATCT TCTTGTTCAT GTCTTGATGC 16420 | |||||||| ||||| | | |||||||||| ||||||||| |||| |||| |||||||||| AATTCAGGGT GAGCTAATGC TTCTAGCTTG GACTGGATCT TCTTCTTCAA GTCTTGATGC 120 CTTGAAGTTC CAGCATGGAC TAGCTTTTTA TTTA-TTCTA GCTTTCTAGA -TACTCTTAG 16362 |||||| ||| | |||||||| |||||| ||| | || || || ||||| |||| ||||||||| CTTGAACTTC CGGCATGGAC TAGCTTCTTA TGTATTTTTA GCTTT-TAGA CTACTCTTAG 179 CTTTAGTAAT TTGAGGATAG ATGTTCTTGT GATGATGACT TCCAGATTTT GGGG-ATAAT 16303 |||||| || |||| ||| |||||||||| | |||||||| |||||||||| |||| ||||| -TTTAGTCAT TTGATCGTAG ATGTTCTTGT GGTGATGACT TCCAGATTTT GGGGAATAAT 238 GATAAGTTTG AGTTTTAGAA AGTGAT-TAT TGAT-TTTCA TTAATGAGTT TAAGTCTTCC 16245 | | | ||| | |||||||| | || || || | | | |||||||||| |||||||||| -AGATG-TTG AATTTTAGAA GTTAATGAAT TGGTCTGTAT TTAATGAGTT TAAGTCTTCC 296 GCATTATATT ATGTTAATTA TGT-TTGAAA TGTTGGGGTT CAGATTGGTT GGTTCGCTCA 16186 ||||| || |||| || | | |||||| |||| |||| ||||||||| |||||||||| ACATTACTTT CTGTTGATAT TATATTGAAA TGTTAAGGTT -AGATTGGTT GGTTCGCTCA 355 CATAGTAGGA TAAGTGTGGG TGCCACTCGC GACCCGTTTT GGGTCGTGAC AAACTTGGTA 16126 ||||| ||| |||||||||| ||||| |||| ||||| ||| ||||||||| |||||||||| CATAGGAGGG TAAGTGTGGG TGCCAGTCGC AACCCGGTTT TGGTCGTGAC AAACTTGGTA 415 TTAGAGCATT AGGTTCGTTG GTCTCATCAC ACAAGAACGA GTCTAGTAGA GTCTGAAGGA 16066 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| TCAGAGCATT AGGTTCGTTG GTCTCATCAC ACAAGAACGA GTCTAGTAGA GTCTTAAGGA 475 ACGGTAGGGG GACGCCTTTA CTTTTCTTTG AGAGGCTATA AGACTTTAGG AAAAATTCCA 16006 |||||||||| || ||||||| |||||| ||| |||||||||| ||||||| || ||||||||| ACGGTAGGGG GATGCCTTTA CTTTTCCTTG AGAGGCTATA AGACTTTTGG -AAAATTCCA 534 TTCTTTCTTT CTTTCCTTTG TGCTATTACT TGGATCCAAT TGGTATCTAG GTGATACAAA 15946 ||||||| || |||| | | |||||||||| ||| |||||| |||||||||| |||||||||| TTCTTTC-TT CTTT-C---G TGCTATTACT TGGGTCCAAT TGGTATCTAG GTGATACAAA 589 TTGGTATCTG ACCATCTTCA CTCTATTTCG CAGATGGTTA GAACTAGAGC AACAACCACG 15886 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| || ||| TTGGTATCTG ACCATCTTCA CTCTATTTCG CAGATGGTTA GAACTAGAGC AACGACTACG 649 CCAACATCAA C 15875 ||| |||||| | CCAGCATCAA C 660 hqPGS_C06HBa0153O03.1-1-_SGN-E392027- (16538 15875) ******************************************************************************** EST sequence 157 +strand 729 n (File: SGN-E351546+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGGTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATAGGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGTTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGAACGGTA GGGGGACGCT TTTACTTTTC CTTGAGAGGC TATAAGACTT 601 TAGGAAAACT TCACTCTTTC ATTCTTTCTT TCGTGCTACT ACTTCGAGTC AATTGGTATC 661 TAAGCGATAC GAATTGGTAT CTGACCATNC TCACTCTCTT GCCAGATGGG TAGAACTAGA 721 GCAACGACT Predicted gene structure (within gDNA segment 17471 to 14192): Exon 1 16619 15890 ( 730 n); cDNA 1 728 ( 728 n); score: 0.847 MATCH C06HBa0153O03.1-1- SGN-E351546+ 0.847 730 1.001 C PGS_C06HBa0153O03.1-1-_SGN-E351546+ (16619 15890) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATATCTACTC AGTACCCATG TTTTGTACTG ACCCCTACTT GTATGTTTCT 16560 |||||||||| |||||||||| |||| || || |||||||||| |||||||||| ||||||| | TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCGTC TTCAACTCAA CCGCAACTCT 16500 | |||| || |||||||||| |||||||||| ||||||| || ||| |||||| | | ||||| CTTCTTGGTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCAAGACTT CATAACACCG GAT-TTCAGG GTGAGCT-AC ACTTCTAGCT TGAACTGGAT 16442 ||| || ||| | | |||||| ||| |||||| ||||||| || ||||||||| || ||||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 CTTCTTGTTC ATGTCTTGAT GCCTTGAAGT TCCAGCATGG ACTAGCTTTT TAT-T--TAT 16385 ||||| ||| |||||||||| |||||||| | ||| |||||| |||||||| | ||| | | | CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TCTAGCTTTC TAGATACTCT TAGCTTTAGT AATTTGAGGA TAGATGTTCT TGTGATGATG 16325 | |||||| |||| ||||| ||| ||||| | ||| | | |||||||||| |||||||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTCCAGAT TTTGGGGATA ATGATAAGTT TGAGTTTTAG AAAGTGATTA TTGATTTTCA 16265 |||||||| | |||||| ||| || | | ||| | | || | ||| ||| |||||||| | ACTTCCAGGT TTTGGGAATA AT-AGATGTT T-A---ATA- ATAGTAGTTA TTGATTTT-A 352 TTAATGAGTT TAAGTCTTCC GCATTATATT ATGTTAAT-T ATGTTTGAAA TGTTGGGGTT 16206 |||||||||| |||||||||| |||||| || |||| | | |||||| |||| |||| TTAATGAGTT TAAGTCTTCC GCATTACTTT CTGTTGCTAT TACATTGAAA TGTTAAGGTT 412 CAGATTGGTT GGTTCGCTCA CATAGTAGGA TAAGTGTGGG TGCCACTCGC GACCC-GTTT 16147 ||||||||| |||||||||| ||||| ||| |||||||||| ||||| | || | ||| | || TAGATTGGTT GGTTCGCTCA CATAGGAGGG TAAGTGTGGG TGCCAGTGGC GGCCCGGATT 472 TGGGTCGTGA C-AAACTTGG TATTAGAGCA TTAGGTTCGT TGGTCTCATC ACACAAGAAC 16088 |||||||||| | |||||||| ||| |||||| |||||||||| |||||||||| |||||||||| TGGGTCGTGA CAAAACTTGG TATCAGAGCA TTAGGTTCGT TGGTCTCATC ACACAAGAAC 532 GAGTCTAGTA GAGTCTGAAG GAACGGTAGG GGGACGCCTT TACTTTTCTT TGAGAGGCTA 16028 ||||||||| |||||| ||| |||||||||| ||||||| || |||||||| | |||||||||| AAGTCTAGTA GAGTCTTAAG GAACGGTAGG GGGACGCTTT TACTTTTCCT TGAGAGGCTA 592 TAAGACTTTA GGAAAAATTC CATTCTTTCT TTCTTTCCTT TGTGCTATTA CTTGGATCCA 15968 |||||||||| |||||| || || |||||| ||||||| || |||||| || ||| || || TAAGACTTTA GGAAAACTT- CACTCTTTCA TTCTTTCTTT CGTGCTACTA CTTCGAGTCA 651 ATTGGTATCT AGGTGATACA AATTGGTATC TGACCATCTT CACTCTATTT CGCAGATGGT 15908 |||||||||| | | ||||| |||||||||| ||||||| | |||||| || | ||||||| ATTGGTATCT AAGCGATACG AATTGGTATC TGACCATNCT CACTCTCTTG C-CAGATGGG 710 TAGAACTAGA GCAACAAC 15890 |||||||||| ||||| || TAGAACTAGA GCAACGAC 728 hqPGS_C06HBa0153O03.1-1-_SGN-E351546+ (16619 15890) ******************************************************************************** EST sequence 193 +strand 655 n (File: SGN-E356696+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATANGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGTTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGAACGGTA GGGGGACGCT TTTACTTTTC CTTGAGAGGC TATAAGACTT 601 TAGGAAAACT TCACTCTTTC ATTCTTTCTT TCGTGCTACT ACTTGAGTCC AATTG Predicted gene structure (within gDNA segment 17471 to 14932): Exon 1 16619 15964 ( 656 n); cDNA 1 655 ( 655 n); score: 0.846 MATCH C06HBa0153O03.1-1- SGN-E356696+ 0.846 656 1.002 C PGS_C06HBa0153O03.1-1-_SGN-E356696+ (16619 15964) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATATCTACTC AGTACCCATG TTTTGTACTG ACCCCTACTT GTATGTTTCT 16560 |||||||||| |||||||||| |||| || || |||||||||| |||||||||| ||||||| | TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCGTC TTCAACTCAA CCGCAACTCT 16500 | ||||||| |||||||||| |||||||||| ||||||| || ||| |||||| | | ||||| CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCAAGACTT CATAACACCG GAT-TTCAGG GTGAGCT-AC ACTTCTAGCT TGAACTGGAT 16442 ||| || ||| | | |||||| ||| |||||| ||||||| || ||||||||| || ||||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 CTTCTTGTTC ATGTCTTGAT GCCTTGAAGT TCCAGCATGG ACTAGCTTTT TAT-T--TAT 16385 ||||| ||| |||||||||| |||||||| | ||| |||||| |||||||| | ||| | | | CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TCTAGCTTTC TAGATACTCT TAGCTTTAGT AATTTGAGGA TAGATGTTCT TGTGATGATG 16325 | |||||| |||| ||||| ||| ||||| | ||| | | |||||||||| |||||||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTCCAGAT TTTGGGGATA ATGATAAGTT TGAGTTTTAG AAAGTGATTA TTGATTTTCA 16265 |||||||| | |||||| ||| || | | ||| | | || | ||| ||| |||||||| | ACTTCCAGGT TTTGGGAATA AT-AGATGTT T-A---ATA- ATAGTAGTTA TTGATTTT-A 352 TTAATGAGTT TAAGTCTTCC GCATTATATT ATGTTAAT-T ATGTTTGAAA TGTTGGGGTT 16206 |||||||||| |||||||||| |||||| || |||| | | |||||| |||| |||| TTAATGAGTT TAAGTCTTCC GCATTACTTT CTGTTGCTAT TACATTGAAA TGTTAAGGTT 412 CAGATTGGTT GGTTCGCTCA CATAGTAGGA TAAGTGTGGG TGCCACTCGC GACCC-GTTT 16147 ||||||||| |||||||||| |||| ||| |||||||||| ||||| | || | ||| | || TAGATTGGTT GGTTCGCTCA CATANGAGGG TAAGTGTGGG TGCCAGTGGC GGCCCGGATT 472 TGGGTCGTGA C-AAACTTGG TATTAGAGCA TTAGGTTCGT TGGTCTCATC ACACAAGAAC 16088 |||||||||| | |||||||| ||| |||||| |||||||||| |||||||||| |||||||||| TGGGTCGTGA CAAAACTTGG TATCAGAGCA TTAGGTTCGT TGGTCTCATC ACACAAGAAC 532 GAGTCTAGTA GAGTCTGAAG GAACGGTAGG GGGACGCCTT TACTTTTCTT TGAGAGGCTA 16028 ||||||||| |||||| ||| |||||||||| ||||||| || |||||||| | |||||||||| AAGTCTAGTA GAGTCTTAAG GAACGGTAGG GGGACGCTTT TACTTTTCCT TGAGAGGCTA 592 TAAGACTTTA GGAAAAATTC CATTCTTTCT TTCTTTCCTT TGTGCTATTA CTTGGATCCA 15968 |||||||||| |||||| || || |||||| ||||||| || |||||| || |||| |||| TAAGACTTTA GGAAAACTT- CACTCTTTCA TTCTTTCTTT CGTGCTACTA CTTGAGTCCA 651 ATTG 15964 |||| ATTG 655 hqPGS_C06HBa0153O03.1-1-_SGN-E356696+ (16619 15964) ******************************************************************************** EST sequence 191 +strand 580 n (File: SGN-E356206+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATAGGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGGTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGGACGGTA NGGGGACGCT TTTACTTTTC Predicted gene structure (within gDNA segment 17471 to 11083): Exon 1 16619 16040 ( 580 n); cDNA 1 580 ( 580 n); score: 0.840 MATCH C06HBa0153O03.1-1- SGN-E356206+ 0.840 580 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E356206+ (16619 16040) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATATCTACTC AGTACCCATG TTTTGTACTG ACCCCTACTT GTATGTTTCT 16560 |||||||||| |||||||||| |||| || || |||||||||| |||||||||| ||||||| | TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCGTC TTCAACTCAA CCGCAACTCT 16500 | ||||||| |||||||||| |||||||||| ||||||| || ||| |||||| | | ||||| CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCAAGACTT CATAACACCG GAT-TTCAGG GTGAGCT-AC ACTTCTAGCT TGAACTGGAT 16442 ||| || ||| | | |||||| ||| |||||| ||||||| || ||||||||| || ||||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 CTTCTTGTTC ATGTCTTGAT GCCTTGAAGT TCCAGCATGG ACTAGCTTTT TAT-T--TAT 16385 ||||| ||| |||||||||| |||||||| | ||| |||||| |||||||| | ||| | | | CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TCTAGCTTTC TAGATACTCT TAGCTTTAGT AATTTGAGGA TAGATGTTCT TGTGATGATG 16325 | |||||| |||| ||||| ||| ||||| | ||| | | |||||||||| |||||||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTCCAGAT TTTGGGGATA ATGATAAGTT TGAGTTTTAG AAAGTGATTA TTGATTTTCA 16265 |||||||| | |||||| ||| || | | ||| | | || | ||| ||| |||||||| | ACTTCCAGGT TTTGGGAATA AT-AGATGTT T-A---ATA- ATAGTAGTTA TTGATTTT-A 352 TTAATGAGTT TAAGTCTTCC GCATTATATT ATGTTAAT-T ATGTTTGAAA TGTTGGGGTT 16206 |||||||||| |||||||||| |||||| || |||| | | |||||| |||| |||| TTAATGAGTT TAAGTCTTCC GCATTACTTT CTGTTGCTAT TACATTGAAA TGTTAAGGTT 412 CAGATTGGTT GGTTCGCTCA CATAGTAGGA TAAGTGTGGG TGCCACTCGC GACCC-GTTT 16147 ||||||||| |||||||||| ||||| ||| |||||||||| ||||| | || | ||| | || TAGATTGGTT GGTTCGCTCA CATAGGAGGG TAAGTGTGGG TGCCAGTGGC GGCCCGGATT 472 TGGGTCGTGA C-AAACTTGG TATTAGAGCA TTAGGTTCGT TGGTCTCATC ACACAAGAAC 16088 |||||||||| | |||||||| ||| |||||| ||||| |||| |||||||||| |||||||||| TGGGTCGTGA CAAAACTTGG TATCAGAGCA TTAGGGTCGT TGGTCTCATC ACACAAGAAC 532 GAGTCTAGTA GAGTCTGAAG GAACGGTAGG GGGACGCCTT TACTTTTC 16040 ||||||||| |||||| ||| | |||||| | ||||||| || |||||||| AAGTCTAGTA GAGTCTTAAG GGACGGTANG GGGACGCTTT TACTTTTC 580 hqPGS_C06HBa0153O03.1-1-_SGN-E356206+ (16619 16040) ******************************************************************************** EST sequence 137 +strand 299 n (File: SGN-E373117+) 1 TCTTCTAGCT TGTCCAAATA CTGTCGTAAT TTTGGCAAAC GTACCGTCGT CTTCAACTCA 61 ACCGCAACTC TAGCCAGTCT TCATTACATC GGATTTCAGG GTGAGCTAAC GCTTCTAGCT 121 TGGACTGGAT CTTCTTCTTC ATGTCTTGAT GCCTTGAAGT TCCGGCATGG ACTAGCTGTT 181 TATGTATTTT AGCTTCTTAG ATACTCTTAG ATTTAGTAAT TTGAAGTAGA TGTTCTTGTG 241 ATGATGACTT CCAGATTTTG GGGATAATAA TAGTTGTTGA GTTTTTAGAA AAAAAAAAA Predicted gene structure (within gDNA segment 17557 to 15383): Exon 1 16535 16282 ( 254 n); cDNA 36 291 ( 256 n); score: 0.892 MATCH C06HBa0153O03.1-1- SGN-E373117+ 0.892 254 0.849 C PGS_C06HBa0153O03.1-1-_SGN-E373117+ (16535 16282) Alignment (genomic DNA sequence = upper lines): CAAACGTGCC GTCGTCTTCA ACTCAACCGC AACTCTAGCA AGACTTCATA ACACCGGATT 16476 ||||||| || |||||||||| |||||||||| ||||||||| || |||||| ||| |||||| CAAACGTACC GTCGTCTTCA ACTCAACCGC AACTCTAGCC AGTCTTCATT ACATCGGATT 95 TCAGGGTGAG CTA-CACTTC TAGCTTGAAC TGGATCTTCT TGTTCATGTC TTGATGCCTT 16417 |||||||||| ||| | |||| ||||||| || |||||||||| | |||||||| |||||||||| TCAGGGTGAG CTAACGCTTC TAGCTTGGAC TGGATCTTCT TCTTCATGTC TTGATGCCTT 155 GAAGTTCCAG CATGGACTAG CTTTTTATTT ATTCTAGCTT TCTAGATACT CTTAGCTTTA 16357 |||||||| | |||||||||| || ||||| | ||| |||||| |||||||| ||||| |||| GAAGTTCCGG CATGGACTAG CTGTTTATGT ATTTTAGCTT CTTAGATACT CTTAGATTTA 215 GTAATTTGAG GATAGATGTT CTTGTGATGA TGACTTCCAG ATTTTGGGGA TAATGATAAG 16297 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||| ||| GTAATTTGAA G-TAGATGTT CTTGTGATGA TGACTTCCAG ATTTTGGGGA TAATAATAGT 274 T-TTGAG-TT TTAGAAA 16282 | ||||| || ||||||| TGTTGAGTTT TTAGAAA 291 hqPGS_C06HBa0153O03.1-1-_SGN-E373117+ (16535 16282) ******************************************************************************** EST sequence 31 -strand 299 n (File: SGN-E373116-) 1 CTCTTCTAGC TTGTCCAAAT ACTGTCGTAA TTTTGGCAAA CGTACCGTCG TCTTCAACTC 61 AACCGCAACT CTAGCCAGTC TTCATTACAT CGGATTTCAG GGTGAGCTAA CGCTTCTAGC 121 TTGGACTGGA TCTTCTTCTT CATGTCTTGA TGCCTTGAAG TTCCGGCATG GACTAGCTGT 181 TTATGTATTT TAGCTTCTTA GATACTCTTA GATTTAGTAA TTTGAAGTAG ATGTTCTTGT 241 GATGATGACT TCCAGATTTT GGGGATAATA ATAGTTGTTG AGTTTTTAGA AAAAAAAAA Predicted gene structure (within gDNA segment 17577 to 15403): Exon 1 16535 16282 ( 254 n); cDNA 37 292 ( 256 n); score: 0.892 MATCH C06HBa0153O03.1-1- SGN-E373116- 0.892 254 0.849 C PGS_C06HBa0153O03.1-1-_SGN-E373116- (16535 16282) Alignment (genomic DNA sequence = upper lines): CAAACGTGCC GTCGTCTTCA ACTCAACCGC AACTCTAGCA AGACTTCATA ACACCGGATT 16476 ||||||| || |||||||||| |||||||||| ||||||||| || |||||| ||| |||||| CAAACGTACC GTCGTCTTCA ACTCAACCGC AACTCTAGCC AGTCTTCATT ACATCGGATT 96 TCAGGGTGAG CTA-CACTTC TAGCTTGAAC TGGATCTTCT TGTTCATGTC TTGATGCCTT 16417 |||||||||| ||| | |||| ||||||| || |||||||||| | |||||||| |||||||||| TCAGGGTGAG CTAACGCTTC TAGCTTGGAC TGGATCTTCT TCTTCATGTC TTGATGCCTT 156 GAAGTTCCAG CATGGACTAG CTTTTTATTT ATTCTAGCTT TCTAGATACT CTTAGCTTTA 16357 |||||||| | |||||||||| || ||||| | ||| |||||| |||||||| ||||| |||| GAAGTTCCGG CATGGACTAG CTGTTTATGT ATTTTAGCTT CTTAGATACT CTTAGATTTA 216 GTAATTTGAG GATAGATGTT CTTGTGATGA TGACTTCCAG ATTTTGGGGA TAATGATAAG 16297 ||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||| ||| GTAATTTGAA G-TAGATGTT CTTGTGATGA TGACTTCCAG ATTTTGGGGA TAATAATAGT 275 T-TTGAG-TT TTAGAAA 16282 | ||||| || ||||||| TGTTGAGTTT TTAGAAA 292 hqPGS_C06HBa0153O03.1-1-_SGN-E373116- (16535 16282) ******************************************************************************** EST sequence 41 -strand 686 n (File: SGN-E241789-) 1 TTGTTAATGA TGATGTCTTG GTATAAAAGA AGGCTTGATG AACTAAAAGA ATGAGGTTAG 61 GGGATCGGGT GTCACGAACC GACACGTAGT ATTAATGGAT CGGGTGTCAC GAACCGACAC 121 ATAGTATTAA TGGATCGGGT GTCACGAATC GGCACGTAGT ATTAGGAGAT CGGGTGTAAC 181 GAACCGACAC GTAGTATTAG GGGATCGGGT GTCACGAACC GACACGTAGC ATTAGGGGAT 241 CGGAGTATCA CGTTCCGACA CCACGATAGT AAAAAGAATG AATCTTGAAT TATGTTAATG 301 TACTCAATTT AATGAACCTG TTTCCCAAAT GAGTATGGTG TGGAGGCTTG AGTCCTCATA 361 GATGTTCTTG GGTTGTGCCC AATGGTTATG GTACTTGTTG TTGTCACCTG TTAAGTGTTA 421 TGGTTGATTT TATTTTATTA TTTGATATAT ATTGTTCTCT ATTCTGAGTT GGCCGATGAT 481 ATCTACTCAG TACTCGTGTT TGTACTGACC CCTACTTTTA TGTTTTCTTT TTGTTAATTG 541 TGGAGTGCAG CAAACGTACC GTCGTCTTCA ACTCAACCGC AACTCTAACC AGTCTTCATC 601 ACGTCAGATT TCAGGGTGAG CTATTGTTCC TAGCTCGGAC TGGATTCTCT CTCATTCATG 661 TCTTGATGTC CTTGAAGATC AGACAT Predicted gene structure (within gDNA segment 17887 to 14909): Exon 1 17082 16404 ( 679 n); cDNA 2 686 ( 685 n); score: 0.811 MATCH C06HBa0153O03.1-1- SGN-E241789- 0.811 679 0.990 C PGS_C06HBa0153O03.1-1-_SGN-E241789- (17082 16404) Alignment (genomic DNA sequence = upper lines): TGAGATTGAT GATGCCTTGG TATAAAAGAA GGCTTGATGA ATTAATAGAA TGAGATTAGT 17023 || | |||| |||| ||||| |||||||||| |||||||||| | ||| |||| |||| |||| TGTTAATGAT GATGTCTTGG TATAAAAGAA GGCTTGATGA ACTAAAAGAA TGAGGTTAGG 61 GGAGTAGGTG TCACGAACCG ACACATAGAA TTAGGGGATC GGGTGCCACG AACCGACACG 16963 ||| |||| |||||||||| |||| ||| | ||| ||||| ||||| |||| ||||||||| GGATCGGGTG TCACGAACCG ACACGTAGTA TTAATGGATC GGGTGTCACG AACCGACACA 121 TAGAATTAAG GGATCGGGTG TCACGAACCG ACACGTAGAA TTAGGGAATT GGGTGTCACA 16903 ||| ||||| |||||||||| ||||||| || ||||||| | ||||| || |||||| || TAGTATTAAT GGATCGGGTG TCACGAATCG GCACGTAGTA TTAGGAGATC GGGTGTAACG 181 AACTGACACG TAGAATTAGG GGATCGGGTG TCACGAATCG ACACGTAGAA CTAGGGAATC 16843 ||| |||||| ||| |||||| |||||||||| ||||||| || |||||||| | ||||| ||| AACCGACACG TAGTATTAGG GGATCGGGTG TCACGAACCG ACACGTAGCA TTAGGGGATC 241 GGAGTGTCAC GTACCGACA- CA--AGAGTA AAGGTGATGA ATCTTGAAAG ATGTTAATAT 16786 ||||| |||| || |||||| || | |||| || |||| |||||||| |||||||| | GGAGTATCAC GTTCCGACAC CACGATAGTA AAAAGAATGA ATCTTGAATT ATGTTAATGT 301 ACTCAATCTA ATGAACCTAA GTCCCAAATG AGTATGGTAT TGAGGCTTGA GTCCTCATGA 16726 ||||||| || |||||||| ||||||||| |||||||| | ||||||||| |||||||| ACTCAATTTA ATGAACCTGT TTCCCAAATG AGTATGGTGT GGAGGCTTGA GTCCTCATAG 361 GTGTACTTGA CGTTATTTAT CAAAGATTCT TGTACTTGTT G--CT-ACAT GTTGAGTAAT 16669 ||| |||| ||| | ||| | || | ||||||||| | | || | ||| ||| | ATGTTCTTG- GGTT-GTGCC CAATGGTTAT GGTACTTGTT GTTGTCACCT GTTAAGTGTT 419 GTAGTTGATT TTATATTATT ACTTGATATA TATTGTTTTC TATTTTGAGT TGGCCGATGA 16609 | ||||||| |||| ||||| | |||||||| ||||||| || |||| ||||| |||||||||| ATGGTTGATT TTATTTTATT ATTTGATATA TATTGTTCTC TATTCTGAGT TGGCCGATGA 479 TATCTACTCA GTACCCATGT TTTGTACTGA CCCCTACTTG TATGTTTCTT TCCTTGTTAT 16549 |||||||||| |||| | || |||||||||| ||||||||| ||||||| | | |||||| TATCTACTCA GTACTCGTG- TTTGTACTGA CCCCTACTTT TATGTTT-TC TTTTTGTTAA 537 TTGTGGAGTG CAGCAAACGT GCCGTCGTCT TCAACTCAAC CGCAACTCTA GCAAGACTTC 16489 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| | || |||| TTGTGGAGTG CAGCAAACGT ACCGTCGTCT TCAACTCAAC CGCAACTCTA ACCAGTCTTC 597 ATAACACCGG ATTTCAGGGT GAGCTACACT T-CTAGCTTG AACTGGA-TC T-TCTTGTTC 16432 || || | | |||||||||| |||||| | | |||||| | |||||| || | ||| ||| ATCACGTCAG ATTTCAGGGT GAGCTATTGT TCCTAGCTCG GACTGGATTC TCTCTCATTC 657 ATGTCTTGAT G-CCTTGAAG TTCCAGCAT 16404 |||||||||| | |||||||| || ||| ATGTCTTGAT GTCCTTGAAG ATCAGACAT 686 hqPGS_C06HBa0153O03.1-1-_SGN-E241789- (17082 16404) ******************************************************************************** EST sequence 89 -strand 337 n (File: SGN-E357033-) 1 GGATAGCGTT CATGAAACTG TCATGTAGAT TTAGGGGATC GGAGTGTCAT GTGTTCACAC 61 AAGAGGATTA ATGAATATGA GGGAGCGGAA TGTCATGTTC CGTCACAAGA GAAATAAAGA 121 TAATGAATCT TGAAAGATGT TAATATACTA AATCTAATGA ACATGATTCC CAAATGAGTA 181 TGGTATTGAG GCTTGAGTCC TCATGTGTGA ACGTGACGGT AATTGTTAAT GATATAGTGC 241 TTGTTGTTGC TACATGTTGA GTATCATAGT TGATTTTATG ATAATNCTTG ATATATATTG 301 ATTTCTATTT TGAGTTGGGC GATGATATCT ATTCAGT Predicted gene structure (within gDNA segment 18639 to 14807): Exon 1 16848 16597 ( 252 n); cDNA 81 337 ( 257 n); score: 0.815 MATCH C06HBa0153O03.1-1- SGN-E357033- 0.815 252 0.748 C PGS_C06HBa0153O03.1-1-_SGN-E357033- (16848 16597) Alignment (genomic DNA sequence = upper lines): GGAATCGGAG TGTCACGTAC CGACACAAGA G---TAAAGG TGATGAATCT TGAAAGATGT 16792 || | |||| ||||| || | || ||||||| | ||||| | |||||||| |||||||||| GGGAGCGGAA TGTCATGTTC CGTCACAAGA GAAATAAAGA TAATGAATCT TGAAAGATGT 140 TAATATACTC AATCTAATGA ACCTAAGTCC CAAATGAGTA TGGTATTGAG GCTTGAGTCC 16732 ||||||||| |||||||||| || | | ||| |||||||||| |||||||||| |||||||||| TAATATACTA AATCTAATGA ACATGATTCC CAAATGAGTA TGGTATTGAG GCTTGAGTCC 200 TCATGAGTGT ACTTGACGTT ATTTATCAAA GAT-TCTTG- TACTTGTTGC TACATGTTGA 16674 ||||| ||| || ||||| | | || | || ||| | || | ||||||| |||||||||| TCATGTGTGA ACGTGACGGT AATTGTTAAT GATATAGTGC TTGTTGTTGC TACATGTTGA 260 GTAATGTAGT TGATTTTATA TTATTACTTG ATATATATTG TTTTCTATTT TGAGTTGGCC 16614 ||| |||| ||||||||| || | |||| |||||||||| ||||||||| |||||||| | GTATCATAGT TGATTTTATG ATAATNCTTG ATATATATTG ATTTCTATTT TGAGTTGGGC 320 GATGATATCT ACTCAGT 16597 |||||||||| | ||||| GATGATATCT ATTCAGT 337 hqPGS_C06HBa0153O03.1-1-_SGN-E357033- (16848 16597) ******************************************************************************** EST sequence 105 +strand 774 n (File: SGN-E349977+) 1 GATGTTGTAA ACCTGTCTAT ATGCTTATTG GTATGCTTGC ATGATTATGA TTATGAAATT 61 GTTATGAATT AACAAAGCAT GATGAAGCTA TTGAATCCCA AATCTTGAAA GAACCCTAAT 121 TCACTTGATA TATTATATAA TAAGATTGAT GAGGCCTTAG TATGAGAGAA GGCTTGATAA 181 ATTATATAAT GAGAATGATG ATGCCTTAGT GTAGAAGAAG GCTTGATGAA TTAGGGGATC 241 GGGTGCCACG TTTCGGTACC AGGATAGAAT GAATATGAGG ATCGGAGTGT CACGTTCCGA 301 CACCAGGATA GAATATGAGG ATCGGAGTGT CACGTTCCGA CACCAGGATA GAATATGAGG 361 ATCGGGTGCC ACGTTCCGGT ACCAGGATAG AATGAATATG AGGATCGGAG TGTTACGTTC 421 CGACACCAGG ATAGAATATG AGGATCGGGT GCCACGTTCC GGTACCAGGA TAGAATGAAT 481 ATGAGGATCG GAGTGCCACG TTCCGGCACC AGGATAGAAA TAGAGGAGCG GAGTGTCACG 541 TACCGACACA AGAGGAAGAA AGATAATGAA TCTTGAAAGA TAATGAATAT ACTCAATCTA 601 ATGAACTTAA TTCCCAAATG AGTATGGTAT TGAGGCTTGA GTCCTCATGT GTGAACTTGA 661 CGGTAATTGT TAATGATNAA GAAATTGCTA TTGCTACATG ATGAGTATTG TAGTTGATTT 721 ACATTATTAT TGATATATAT TGTTCCCTAT TTTGAGTTAG CGATGATATC TACT Predicted gene structure (within gDNA segment 19129 to 14867): Exon 1 17934 17920 ( 15 n); cDNA 416 429 ( 14 n); score: 0.733 Intron 1 17919 17000 ( 920 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.69) Exon 2 16999 16951 ( 49 n); cDNA 430 477 ( 48 n); score: 0.694 Intron 2 16950 16890 ( 61 n); Pd: 0.000 (s: 0.69), Pa: 0.000 (s: 0.48) Exon 3 16889 16601 ( 289 n); cDNA 478 774 ( 297 n); score: 0.756 MATCH C06HBa0153O03.1-1- SGN-E349977+ 0.756 353 0.456 C PGS_C06HBa0153O03.1-1-_SGN-E349977+ (17934 17920,16999 16951,16889 16601) Alignment (genomic DNA sequence = upper lines): CGTTTTGAGC AGCAGATTTT ATTTTTGGAA AAACTGGCTG AGACGACGGA TCCCACGATG 17875 |||| || | | ||| CGTTCCGA-C ACCAG..... .......... .......... .......... .......... 429 GACCGTCATG GGCACGATGG ACCGTCGAGG GGGTCTCGTT CCAAAATACA TAGAATTCTG 17815 .......... .......... .......... .......... .......... .......... 429 AAATTTGGGT TTTGAAATCG ACTCTCTGAA CTTCGTGATG AAGTGGCAGG ACGGACCGTC 17755 .......... .......... .......... .......... .......... .......... 429 ACAGGCATGA CGGGCCGTCA CAGTCTCTTC AGAAAATTTC AGTCTCTGAA CTCTGTGACG 17695 .......... .......... .......... .......... .......... .......... 429 GAAGCAGCAG GACGGACCGT CGCAGGCACG ACGACCCGTC ACAGACTGCG TAATCCCAGG 17635 .......... .......... .......... .......... .......... .......... 429 CTGAGTCGGA TTTCTTTAAA TGTTTTAAGG GGGCGTTTTG GACTATTCCT GCTATAATTA 17575 .......... .......... .......... .......... .......... .......... 429 TAAATTTAGT GGGTTAATGT TAATAATTTA ACTACTTGAG GGTTAAAAGA GATAACCTTG 17515 .......... .......... .......... .......... .......... .......... 429 AATTAGTTAG TGGGTTAAAC TCATCATCTT TCATACTTAA TTATATGCTA ATTAGGGTAA 17455 .......... .......... .......... .......... .......... .......... 429 AAGAAAGAAG GTTTGAATAA GAAAAAGAAA AGAACAGAAA GAGAGGGAGA AACGATCGAG 17395 .......... .......... .......... .......... .......... .......... 429 AGAGAGAGAG GAATGAAGAG GAAAGCAAAG ATCTTGAGGA AATTGCTTGC TTGATCACGA 17335 .......... .......... .......... .......... .......... .......... 429 ATCTTCGGTG GAAGTAGGTT ATGGTTTTTA TACTATTCGT AGTTAACTCT TAATAGCGAA 17275 .......... .......... .......... .......... .......... .......... 429 TGATATGTGT TGGGTTGTAT TGTAAAGTCT TCTATATGCT TAATTGTATG CTTGCATGAA 17215 .......... .......... .......... .......... .......... .......... 429 TATGATTATA TAATTGTGAT GAAATAAGCA TGATGAAGCT ATTGAATCCC AAATCTTGAA 17155 .......... .......... .......... .......... .......... .......... 429 AACTCCAATC TTGAAAACCC CTTGTTATTG ATGATGCCTT GGTATAAAAG AAGGCTTGAT 17095 .......... .......... .......... .......... .......... .......... 429 GAACTAAAGT AATGAGATTG ATGATGCCTT GGTATAAAAG AAGGCTTGAT GAATTAATAG 17035 .......... .......... .......... .......... .......... .......... 429 AATGAGATTA GTGGAGTAGG TGTCACGAAC CGACACATAG AATTAGGGGA TCGGGTGCCA 16975 |||| ||| | ||| |||||||||| .......... .......... .......... .....GATAG AATATGAGGA TCGGGTGCCA 454 CGAACCGACA CGTAGAATTA AGGGATCGGG TGTCACGAAC CGACACGTAG AATTAGGGAA 16915 || ||| | | || || | | CGTTCCGGTA CC-AGGATAG AATG...... .......... .......... .......... 477 TTGGGTGTCA CAAACTGACA CGTAGAATTA GGGGATCGG- GTGTCACGAA TCGACA-C-G 16858 ||| | ||||||| ||| |||| || || | | .......... .......... .....AATAT GAGGATCGGA GTGCCACGTT CCGGCACCAG 512 --TAGAACTA GGGAATCGGA GTGTCACGTA CCGACACAAG AG-TA-AAGG -TGATGAATC 16803 ||||| || | | | |||| |||||||||| |||||||||| || | || | | ||||||| GATAGAAATA GAGGAGCGGA GTGTCACGTA CCGACACAAG AGGAAGAAAG ATAATGAATC 572 TTGAAAGATG -TTAATATAC TCAATCTAAT GAACCTAAGT CCCAAATGAG TATGGTATTG 16744 ||||||||| | ||||||| |||||||||| |||| ||| | |||||||||| |||||||||| TTGAAAGATA ATGAATATAC TCAATCTAAT GAACTTAATT CCCAAATGAG TATGGTATTG 632 AGGCTTGAGT CCTCATGAGT GTACTTGACG TTATTTATCA AAGATTCTTG TACT--TGTT 16686 |||||||||| ||||||| || | |||||||| || || | | | ||| | | | || AGGCTTGAGT CCTCATGTGT GAACTTGACG GTAATTGTTA ATGATNAAGA AATTGCTATT 692 GCTACATGTT GAGTAATGTA GTTGATTTTA TATTATTACT TGATATATAT TGTTTTCTAT 16626 |||||||| | ||||| |||| ||||| |||| ||||||| | |||||||||| |||| |||| GCTACATGAT GAGTATTGTA GTTGA-TTTA CATTATTA-T TGATATATAT TGTTCCCTAT 750 TTTGAGTTGG CCGATGATAT CTACT 16601 |||||||| | ||||||||| ||||| TTTGAGTTAG -CGATGATAT CTACT 774 hqPGS_C06HBa0153O03.1-1-_SGN-E349977+ (16889 16601) ******************************************************************************** EST sequence 117 +strand 673 n (File: SGN-E550140+) 1 GTAATTTTTT ATATTTTTGG TCTAATTTTT TGTTAATTCA TGGTTGATAA CATCTTTGTT 61 TCTAATAGTG TTATAAATCG TAAAATAATA ATAATATTTA AGTAGGAAGA ACATAAATGT 121 AACGACCTGT TTAGTCGTTT TGAGTAGCAG ATTTTATTTT TGGAAAAACA GGTTGAGACG 181 ACGGAACCCA CGACGGACCG TCATGAGCAC GATGGACCGT CGAGGAGTCT CGTTTCAAAA 241 CACTTAGAAA TTCTGAAATT GGGTACTAAA AATCGACTCT CTGAACTTCG TAACGGAATG 301 GCACGACGGA CCGTCACGGG CGTGACGGAC CGTCACAGAC TCTTTGGTGG AAATTGAGTC 361 TCTGAACCTT GCGACGACCT GCAGGACGGA CCGTCGCAGG CACGACGGGC CATCACAGGT 421 TGCGTAATCC CAGTCTGGGT CGGATTTCTT TACACGTTTT AAGGGACGTT TTGGACTATT 481 CCTACTTTAA TTATAAAGTT AGTGGGTTTA TGTTAATAAG TCTAATTACC TGGGGGTTAA 541 AAGAGGTAAC CTTNGAGTAA TTAGTGGGTT ATTATTCCAT CTTTTATTCT TAATTATATG 601 CTAATTAGGG TAAAAGAAGG AGGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG 661 AAGGAGAATC GAT Predicted gene structure (within gDNA segment 23959 to 16650): Exon 1 18544 18514 ( 31 n); cDNA 81 110 ( 30 n); score: 0.710 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0), Pa: 0.000 (s: 0.90) Exon 2 17959 17396 ( 564 n); cDNA 111 672 ( 562 n); score: 0.840 MATCH C06HBa0153O03.1-1- SGN-E550140+ 0.840 595 0.884 C PGS_C06HBa0153O03.1-1-_SGN-E550140+ (18544 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): TAAAATAAAA AATTTATTAT ATGTATAAAA AGCAAAAATA AAAATAAAAT TTCTCAAAGT 18485 |||||||| | | |||| | | ||| || | TAAAATAATA ATAATATT-T AAGTAGGAAG A......... .......... .......... 110 TCTTATTCTT TGTATTAAAA TAATAAGACA ATAAAAATCT TAAGATTCTT ATTCTTCATT 18425 .......... .......... .......... .......... .......... .......... 110 TTTGCGCAAA AAAATCTTTA TTTTATTTTA TGTTTTATAC ATATTATTTA ATATTTTAAT 18365 .......... .......... .......... .......... .......... .......... 110 TTGTGAGAAA TTTTTTTAAG TTATTTGGAT TAAATTTTTA AATTATATTG AGAAAATGCA 18305 .......... .......... .......... .......... .......... .......... 110 CAAGTATTCC CTCAAACTAT GTCTGAAATC CCAGAGACAC ACTTATACTA TATTAAGGTC 18245 .......... .......... .......... .......... .......... .......... 110 ATATTACCCC CTGAACTTAT TTTATAAGTA ATTTTCTACC CCTTTTGACC TACGTGGCTC 18185 .......... .......... .......... .......... .......... .......... 110 TAGCTTGAAA AAAAAGTCAA TCAGCGTTGG ACCCACAAGA TAGTGCCACA TAGACCGAAA 18125 .......... .......... .......... .......... .......... .......... 110 AGGGCTAGAA AATTATTAAT AAAATAAGTT CAGGGATAAT AGGACCTTAG TATAGTGTAA 18065 .......... .......... .......... .......... .......... .......... 110 GTATGACTTT AAAATTTCAG GCATAAATTG AGAGGGTACT TGTGCATTAT CTCAATAATA 18005 .......... .......... .......... .......... .......... .......... 110 TTCAAATCTT TACATTAATA TCTAATTTGA TGTAATATTT TAATAATAAT AATGTAACGA 17945 | | |||||||||| .......... .......... .......... .......... .....ACATA AATGTAACGA 125 CCTATTTAGT CGTTTTGAGC AGCAGATTTT ATTTTTGGAA AAACTGGCTG AGACGACGGA 17885 ||| |||||| ||||||||| |||||||||| |||||||||| |||| || || |||||||||| CCTGTTTAGT CGTTTTGAGT AGCAGATTTT ATTTTTGGAA AAACAGGTTG AGACGACGGA 185 TCCCACGATG GACCGTCATG GGCACGATGG ACCGTCGAGG GGGTCTCGTT CCAAAATACA 17825 ||||||| | |||||||||| ||||||||| |||||||| | | |||||||| ||||| || ACCCACGACG GACCGTCATG AGCACGATGG ACCGTCGA-G GAGTCTCGTT TCAAAACACT 244 TAG-AATTCT GAAATTTGGG TTTTGAAATC GACTCTCTGA ACTTCGTGAT GAAGTGGCAG 17766 ||| |||||| |||||| || | ||||| |||||||||| ||||||| | | | ||||| TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC GGAATGGCAC 304 GACGGACCGT CACAGGCATG ACGGGCCGTC ACAGTCTCTT CAG-AAAATT TCAGTCTCTG 17707 |||||||||| ||| ||| || |||| ||||| |||| ||||| | || | | |||||||| GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT TGAGTCTCTG 364 AACTCTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACGACCCG TCACAGACTG 17647 ||| || || | | | | || |||||||||| |||||||||| ||||| || |||||| || AACCTTGCGA C-G-ACCTGC AGGACGGACC GTCGCAGGCA CGACGGGCCA TCACAGGTTG 422 CGTAATCCCA GGCTGAGTCG GATTTCTTTA AATGTTTTAA GGGGGCGTTT TGGACTATTC 17587 |||||||||| | ||| |||| |||||||||| | ||||||| ||| ||||| |||||||||| CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA -GGGACGTTT TGGACTATTC 481 CTGCTATAAT TATAAATTTA GTGGGTTAAT GTTAATAA-T TTAACTACTT GAGGGTTAAA 17528 || || |||| |||||| ||| ||||||| || |||||||| | ||| ||| | | |||||||| CTACTTTAAT TATAAAGTTA GTGGGTTTAT GTTAATAAGT CTAATTACCT GGGGGTTAAA 541 AGAGATAACC TTGAATTAGT TAGTGGGTTA AACTCATCAT CTTTCATACT TAATTATATG 17468 |||| ||||| || | || | |||||||||| | ||| |||| || || |||||||||| AGAGGTAACC TTNGAGTAAT TAGTGGGTTA TTAT-TCCAT CTTTTATTCT TAATTATATG 600 CTAATTAGGG TAAAAGAAAG AAGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGGG 17408 |||||||||| |||||||| | | |||||||| |||||||||| |||||||||| |||||||| | CTAATTAGGG TAAAAGAAGG AGGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG 660 AGAAACGATC GA 17396 | | ||| || AAGGAGAATC GA 672 hqPGS_C06HBa0153O03.1-1-_SGN-E550140+ (18544 18514,17959 17396) ******************************************************************************** EST sequence 104 +strand 605 n (File: SGN-E347579+) 1 AATAAAGAAA ATAGAAAGAA CAAGAGAGAG AGGAAGAATC GAACGAGAAG GGAGAAACAA 61 AGCTTGGAGA AAAATTTGCT TGCTTGATCA CTAATCTTCG GTGGAGGTAG GTTATGGTTT 121 TCATGCTTTC ATAGTAAACT CTTAATAGAG AATGATATGT ATTGGTAGTA TTGTAAACCC 181 TGCTATATGC TTAATTGTAT GCATGCATGA ACGTGATTAT ATAATTGTGA TTATATTAAG 241 CATGATGAAG TTATTGAATC CCAAATCTTG ATAAAAATCT AATCTCTTAT TAATGATGAT 301 GCCTTGGTAT AGAAGAAGGC TTGATGAATA AAAGTAATGG GATTGATGAT GCCTTGGTAT 361 AGAGAAGGCC TGATGATTTA CAGAATGATA TTAGTGGATC GGAGTGTCAC GTACCGACAC 421 ATGCAGGGGA TCGGGTGTCA CGAACCGACA CGTAGAATTA GGGGATCGGG TGTCACGAAC 481 TGGCACGTAG AATTAGGGGA TCGGGTGTCA CGAACCGGCA CGTAGATTAG GGGATCAGGT 541 GTCACGAACC GACACGTAGA ATTAGGGGAT CGGGTGTCAC GAACCGACAC GTAGAATTAG 601 GGGAT Predicted gene structure (within gDNA segment 19663 to 16277): Exon 1 17355 16808 ( 548 n); cDNA 73 605 ( 533 n); score: 0.849 MATCH C06HBa0153O03.1-1- SGN-E347579+ 0.849 548 0.906 C PGS_C06HBa0153O03.1-1-_SGN-E347579+ (17355 16808) Alignment (genomic DNA sequence = upper lines): AAATTGCTTG CTTGATCACG AATCTTCGGT GGAAGTAGGT TATGGTTTTT ATACTATTCG 17296 || ||||||| ||||||||| |||||||||| ||| |||||| ||||||||| || || ||| AATTTGCTTG CTTGATCACT AATCTTCGGT GGAGGTAGGT TATGGTTTTC ATGCT-TTCA 131 TAGTTAACTC TTAATAGCGA ATGATATGTG TTGGGTTGTA TTGTAAAGTC TTCTATATGC 17236 |||| ||||| ||||||| || ||||||||| || ||| ||| ||||||| | | |||||||| TAGTAAACTC TTAATAGAGA ATGATATGTA TT-GGTAGTA TTGTAAACCC TGCTATATGC 190 TTAATTGTAT GCTTGCATGA ATATGATTAT ATAATTGTGA TGA-AATAAG CATGATGAAG 17177 |||||||||| || ||||||| | ||||||| |||||||||| | | | |||| |||||||||| TTAATTGTAT GCATGCATGA ACGTGATTAT ATAATTGTGA TTATATTAAG CATGATGAAG 250 CTATTGAATC CCAAATCTTG AAAACTCCAA TCTTGAAAAC CCCTTGTTAT TGATGATGCC 17117 ||||||||| |||||||||| | || || || | || | ||| ||| |||||||||| TTATTGAATC CCAAATCTTG ATAA---AAA TC-T---AAT CTCTTATTAA TGATGATGCC 303 TTGGTATAAA AGAAGGCTTG ATGAACTAAA GTAATGAGAT TGATGATGCC TTGGTATAAA 17057 |||||||| | |||||||||| ||||| ||| |||||| ||| |||||||||| ||||||| | TTGGTATAGA AGAAGGCTTG ATGAATAAAA GTAATGGGAT TGATGATGCC TTGGTAT-AG 362 AGAAGGCTTG ATGAATTAAT AGAATGAGAT TAGTGGAGTA G-GTGTCACG AACCGACACA 16998 ||||||| || ||| ||| | ||||||| || ||||||| | |||||||| ||||||||| AGAAGGCCTG ATG-ATTTAC AGAATGATAT TAGTGGATCG GAGTGTCACG TACCGACACA 421 TAGAATTAGG GGATCGGGTG CCACGAACCG ACACGTAGAA TTAAGGGATC GGGTGTCACG 16938 | | ||| |||||||||| ||||||||| |||||||||| ||| |||||| |||||||||| T-G---CAGG GGATCGGGTG TCACGAACCG ACACGTAGAA TTAGGGGATC GGGTGTCACG 477 AACCGACACG TAGAATTAGG GAATTGGGTG TCACAAACTG ACACGTAGAA TTAGGGGATC 16878 ||| | |||| |||||||||| | || ||||| |||| ||| | ||||||| | |||||||||| AACTGGCACG TAGAATTAGG GGATCGGGTG TCACGAACCG GCACGTAG-A TTAGGGGATC 536 GGGTGTCACG AATCGACACG TAGAACTAGG GAATCGGAGT GTCACGTACC GACAC-AAGA 16819 ||||||||| || ||||||| ||||| |||| | ||||| || |||||| ||| ||||| ||| AGGTGTCACG AACCGACACG TAGAATTAGG GGATCGG-GT GTCACGAACC GACACGTAGA 595 GTAAAGGTGA T 16808 | ||| || | AT-TAGGGGA T 605 hqPGS_C06HBa0153O03.1-1-_SGN-E347579+ (17355 16808) ******************************************************************************** EST sequence 209 +strand 717 n (File: SGN-E349726+) 1 TAGTCTTCGG TGGAGGTAGG TTATTGTTTC TCTTACGATA TTCGTAGTAA ACTCTTAATA 61 GAGAATGATA TGTATTGATA ATATTGTAAA CCCTGCTATG TGCTTAATTG TATGCTTGCA 121 TGAATGTAAC TATATAATTG TTATTATATA AGCATGATGA AGTTATTGAA TCCCAAATCT 181 TGTAAAAACC TAATCTCTTT TTAATGATGA TGCCTTGGTA AGGGAGAAGG CTTGATGAAC 241 TAAAGTAATG AGATTGATGA TGCCTTGGTA AGGGAGAAGG CTTGATGAAT TGATAGAATG 301 AGATTAGGGG ATCGGGTGTC ACGAACTGAC ACGTAGAATT AGGGGATCGG GTGTCACAAA 361 CCGACACGTA GATTAAGGGA TCAGGTGTCA CGAACCAACA CATAGATTAG GGGATCGGGT 421 GTCACGAACC GACACGTAGA TTTAGGGGAT CGGGTGTCAC GAACCGACAC GTAGATTTAG 481 GGGATCGGGT GTCACGAACC GACACGTAGA TTTAGGGGAT CAGGTGTCAC GAACCGACAC 541 GTAGATTTAG GGGATCGGAG TGTCACGTTC CGACACGTAG ATTTAGGGGA TCAGAGTATC 601 ACGTACCGAC ACAAGAAGAT TAATGAATAT GANGGAGCGG AATGTCACGT ACCGACACAA 661 GAGAAATAAA GACAATGAAT CTTGAAAGAT GTTATATACT CAATCTAATG AACATAA Predicted gene structure (within gDNA segment 19946 to 16120): Exon 1 17333 16823 ( 511 n); cDNA 4 505 ( 502 n); score: 0.850 MATCH C06HBa0153O03.1-1- SGN-E349726+ 0.850 511 0.713 C PGS_C06HBa0153O03.1-1-_SGN-E349726+ (17333 16823) Alignment (genomic DNA sequence = upper lines): TCTTCGGTGG AAGTAGGTTA TGGTTT-TTA TAC--TATTC GTAGTTAACT CTTAATAGCG 17277 |||||||||| | |||||||| | |||| | ||| ||||| ||||| |||| |||||||| | TCTTCGGTGG AGGTAGGTTA TTGTTTCTCT TACGATATTC GTAGTAAACT CTTAATAGAG 63 AATGATATGT GTTGGGTTGT ATTGTAAAGT CTTCTATATG CTTAATTGTA TGCTTGCATG 17217 |||||||||| || | | | |||||||| || |||| || |||||||||| |||||||||| AATGATATGT ATT-GATAAT ATTGTAAACC CTGCTATGTG CTTAATTGTA TGCTTGCATG 122 AATATGATTA TATAATTGTG ATGAAATAAG CATGATGAAG CTATTGAATC CCAAATCTTG 17157 ||| | | || ||||||||| || | ||||| |||||||||| ||||||||| |||||||||| AATGTAACTA TATAATTGTT ATTATATAAG CATGATGAAG TTATTGAATC CCAAATCTTG 182 AAAACTCCAA TCTTGAAAAC CCCTTGTTAT TGATGATGCC TTGGTATAAA AGAAGGCTTG 17097 ||| || | | || | ||| ||| |||||||||| |||||| |||||||||| TAAA----AA -CCT---AAT CTCTTTTTAA TGATGATGCC TTGGTAAGGG AGAAGGCTTG 234 ATGAACTAAA GTAATGAGAT TGATGATGCC TTGGTATAAA AGAAGGCTTG ATGAATTAAT 17037 |||||||||| |||||||||| |||||||||| |||||| |||||||||| ||||||| || ATGAACTAAA GTAATGAGAT TGATGATGCC TTGGTAAGGG AGAAGGCTTG ATGAATTGAT 294 AGAATGAGAT TAGTGGAGTA GGTGTCACGA ACCGACACAT AGAATTAGGG GATCGGGTGC 16977 |||||||||| ||| ||| |||||||||| || ||||| | |||||||||| ||||||||| AGAATGAGAT TAGGGGATCG GGTGTCACGA ACTGACACGT AGAATTAGGG GATCGGGTGT 354 CACGAACCGA CACGTAGAAT TAAGGGATCG GGTGTCACGA ACCGACACGT AGAATTAGGG 16917 ||| |||||| ||||||| || ||||||||| |||||||||| ||| |||| | || ||||||| CACAAACCGA CACGTAG-AT TAAGGGATCA GGTGTCACGA ACCAACACAT AG-ATTAGGG 412 AATTGGGTGT CACAAACTGA CACGTAGAAT TAGGGGATCG GGTGTCACGA ATCGACACGT 16857 || |||||| ||| ||| || |||||||| | |||||||||| |||||||||| | |||||||| GATCGGGTGT CACGAACCGA CACGTAGATT TAGGGGATCG GGTGTCACGA ACCGACACGT 472 AGAACTAGGG AATCGGAGTG TCACGTACCG ACAC 16823 ||| ||||| ||||| ||| ||||| |||| |||| AGATTTAGGG GATCGG-GTG TCACGAACCG ACAC 505 hqPGS_C06HBa0153O03.1-1-_SGN-E349726+ (17333 16823) ******************************************************************************** EST sequence 202 +strand 402 n (File: SGN-E357559+) 1 TAGTCTTCGG TGGAGGTAGG TTATTGTTTC TCTTACGATA TTCGTAGTAA ACTCTTAATA 61 GAGAATGATA TGTATTGATA ATATTGTAAA CCCTGCTATG TGCTTAATTG TATGCTTGCA 121 TGAATGTAAC TATATAATTG TTATTATATA AGCATGATGA AGTTATTGAA TCCCAAATCT 181 TGTAAAAACC TAATCTCTTT TTAATGATGA TGCCTTGGTA AGGGAGAAGG CTTGATGAAC 241 TAAAGTAATG AGATTGATGA TGCCTTGGTA AGGGAGAAGG CTTGATGAAT TGATAGAATG 301 AGATTAGGGG ATCGGGTGTC ACGAACTGAC ACGTAGAATT AGGGGATCGG GTGTCACAAA 361 CCGACACGTA GATTAAGGGA TCAGGTGTCA CGAACCAACA CA Predicted gene structure (within gDNA segment 19946 to 15727): Exon 1 17333 16929 ( 405 n); cDNA 4 401 ( 398 n); score: 0.843 MATCH C06HBa0153O03.1-1- SGN-E357559+ 0.843 405 1.007 C PGS_C06HBa0153O03.1-1-_SGN-E357559+ (17333 16929) Alignment (genomic DNA sequence = upper lines): TCTTCGGTGG AAGTAGGTTA TGGTTT-TTA TAC--TATTC GTAGTTAACT CTTAATAGCG 17277 |||||||||| | |||||||| | |||| | ||| ||||| ||||| |||| |||||||| | TCTTCGGTGG AGGTAGGTTA TTGTTTCTCT TACGATATTC GTAGTAAACT CTTAATAGAG 63 AATGATATGT GTTGGGTTGT ATTGTAAAGT CTTCTATATG CTTAATTGTA TGCTTGCATG 17217 |||||||||| || | | | |||||||| || |||| || |||||||||| |||||||||| AATGATATGT ATT-GATAAT ATTGTAAACC CTGCTATGTG CTTAATTGTA TGCTTGCATG 122 AATATGATTA TATAATTGTG ATGAAATAAG CATGATGAAG CTATTGAATC CCAAATCTTG 17157 ||| | | || ||||||||| || | ||||| |||||||||| ||||||||| |||||||||| AATGTAACTA TATAATTGTT ATTATATAAG CATGATGAAG TTATTGAATC CCAAATCTTG 182 AAAACTCCAA TCTTGAAAAC CCCTTGTTAT TGATGATGCC TTGGTATAAA AGAAGGCTTG 17097 ||| || | | || | ||| ||| |||||||||| |||||| |||||||||| TAAA----AA -CCT---AAT CTCTTTTTAA TGATGATGCC TTGGTAAGGG AGAAGGCTTG 234 ATGAACTAAA GTAATGAGAT TGATGATGCC TTGGTATAAA AGAAGGCTTG ATGAATTAAT 17037 |||||||||| |||||||||| |||||||||| |||||| |||||||||| ||||||| || ATGAACTAAA GTAATGAGAT TGATGATGCC TTGGTAAGGG AGAAGGCTTG ATGAATTGAT 294 AGAATGAGAT TAGTGGAGTA GGTGTCACGA ACCGACACAT AGAATTAGGG GATCGGGTGC 16977 |||||||||| ||| ||| |||||||||| || ||||| | |||||||||| ||||||||| AGAATGAGAT TAGGGGATCG GGTGTCACGA ACTGACACGT AGAATTAGGG GATCGGGTGT 354 CACGAACCGA CACGTAGAAT TAAGGGATCG GGTGTCACGA ACCGACAC 16929 ||| |||||| ||||||| || ||||||||| |||||||||| ||| |||| CACAAACCGA CACGTAG-AT TAAGGGATCA GGTGTCACGA ACCAACAC 401 hqPGS_C06HBa0153O03.1-1-_SGN-E357559+ (17333 16929) ******************************************************************************** EST sequence 30 -strand 792 n (File: SGN-E540167-) 1 GAACCCATTT TTAGGGTTTT TTTTTTTTTT TTTGAATGGA GGGAAAAGTA GTACTTTGTG 61 CTTTAGGGAT GAATTTTGAT TTTTAATAGA CCCCACAAAA ATGGAAAAGA GTTTTTATAG 121 GAAAAAAAAA TTGATGTGAG AGTTTCTGTA TCTTCTTGGG AAGTGTGTTT GGACTGAAAG 181 GGCCCATTAA GCTTCATGGG CCCAATACTA TGTTGTGGAT CCCACTTCAG ACAATTATTC 241 TTTTTTTTTT TAAATAAAAA TAACACTTTA GTTGTTTAAA AAATAGAAAA CAAATTTGTT 301 GAAATAAATA GGGAAAATGC ACTGCCTCTC AAACTATGCT CGAAATTCCA GAGACACACT 361 TATACTATAC TAAAGTCCTA TTACCCTCTG AACTTATTTT ATAAATAATT TTCTACTCTT 421 TTTCGACTTA CGTGACACTA ATTTGAAAAA AAAGTCAATC AGTGTTGGAC CCACAAGATA 481 GTGCCACGTA GGCTGAAAAG AAGTAGAAAA ATTGTTAATA AAATAAATTC AGGGGATAAT 541 AAGACATTAG TATAGTATAA GTGTGTCTTT GAGATTTCGT GCATAGGTTG AATGGTTACC 601 TGTGCATTAT CCCAAACAAA AACATAGTAT TTTTTTTTTT GGTTTTGCAT TTGTATGAAT 661 TTAAGGGGAT GAGAGAAAAT AATGCATATA TGGAGATTTT GTATTTGTGA GTGTATGAAA 721 AGATATTTAA GGTTATAGTT ATGAAATTAA AGCACAAGCT ATTATAATAA AAAAAAAAAA 781 AAAAAAAAGA AA Predicted gene structure (within gDNA segment 22354 to 14486): Exon 1 19206 19143 ( 64 n); cDNA 237 294 ( 58 n); score: 0.719 Intron 1 19142 18332 ( 811 n); Pd: 0.000 (s: 0.70), Pa: 0.000 (s: 0.68) Exon 2 18331 18001 ( 331 n); cDNA 295 624 ( 330 n); score: 0.796 Intron 2 18000 17923 ( 78 n); Pd: 0.000 (s: 0.72), Pa: 0.984 (s: 0) Exon 3 17922 17910 ( 13 n); cDNA 625 637 ( 13 n); score: 0.692 Intron 3 17909 16203 (1707 n); Pd: 0.000 (s: 0), Pa: 0.995 (s: 0) Exon 4 16202 16196 ( 7 n); cDNA 638 644 ( 7 n); score: 0.857 PPA cDNA 769 792 MATCH C06HBa0153O03.1-1- SGN-E540167- 0.784 415 0.524 C PGS_C06HBa0153O03.1-1-_SGN-E540167- (19206 19143,18331 18001,17922 17910,16202 16196) Alignment (genomic DNA sequence = upper lines): ATTTTTTATT TTATCTAATA AGAAAATGAC AAATAATATA TTTTTAAAAA ATAAATAAAA 19147 ||| ||| || || | |||| | |||| || | | | | | |||||||| || | |||| ATTCTTTTTT TTTTTAAATA A--AAATAAC ACTTTA-GT- TGTTTAAAAA AT--AGAAAA 290 CAAAATAAAC TTTGGTTGTT AGTCATAAAA ATATAAGTTA TTCAAAAAGG TGAATGAAAG 19087 |||| CAAA...... .......... .......... .......... .......... .......... 294 AGTATAAGTG AGTCAAAAAG ATGAGTGAAG AGGCATAGCT AAGCCAAAAA AGTGAATGTA 19027 .......... .......... .......... .......... .......... .......... 294 AGGGTATTTC TAGACCAAAA GATTGATGAA GGATATTTTT AGACATAGTT CAAGGATAGT 18967 .......... .......... .......... .......... .......... .......... 294 TTTGGTCCTT TTTCGTTTAA ATAATCTCAT ATTTAGATAT TGAGTTATTT GCAGGGACGA 18907 .......... .......... .......... .......... .......... .......... 294 TTCAATATAA TTCGAGGCCT AAATTTTAAA TAAACTCTAT CTGTATTTAT TTATTTTTCT 18847 .......... .......... .......... .......... .......... .......... 294 TTTTAGATGT AAATTGTATT TATTTATTTT TCTTTTTAGA TGTAAGTTAT TACTTAAGTA 18787 .......... .......... .......... .......... .......... .......... 294 TCTTTTTTTG TAAATGGAAA AGGGCTAAAA ATGCCCTTAA CTTAGTGGAA ATGGTTCAAA 18727 .......... .......... .......... .......... .......... .......... 294 ATACCATCCT TCTACCTTTT GAGTTAAAAA TACCCTCCAC CTTTATTTTG GTTCAAAGAT 18667 .......... .......... .......... .......... .......... .......... 294 GCCTTTCCTT CCACCTTTTG ATTTAATAAT ACTCTTAACC CCCCATTTAA TTAAATTTAT 18607 .......... .......... .......... .......... .......... .......... 294 AAAATAAAAA ATTCTTAATA TTAGCTCATT CCAAAATCTT TATGATAAAT ATATCTAAAA 18547 .......... .......... .......... .......... .......... .......... 294 AATAAAATAA AAAATTTATT ATATGTATAA AAAGCAAAAA TAAAAATAAA ATTTCTCAAA 18487 .......... .......... .......... .......... .......... .......... 294 GTTCTTATTC TTTGTATTAA AATAATAAGA CAATAAAAAT CTTAAGATTC TTATTCTTCA 18427 .......... .......... .......... .......... .......... .......... 294 TTTTTGCGCA AAAAAATCTT TATTTTATTT TATGTTTTAT ACATATTATT TAATATTTTA 18367 .......... .......... .......... .......... .......... .......... 294 ATTTGTGAGA AATTTTTTTA AGTTATTTGG ATTAAATTTT TAAATTATAT TGAGAAAATG 18307 || | | || || || | ||||||| .......... .......... .......... .....TTTGT TGAAATAAAT AGGGAAAATG 319 CACAAGTATT CCCTCAAACT ATGTCTGAAA TCCCAGAGAC ACACTTATAC TATATTAAGG 18247 ||| | | | ||||||| ||| |||| | |||||||| |||||||||| |||| ||| | CAC-TGCCT- C--TCAAACT ATGCTCGAAA TTCCAGAGAC ACACTTATAC TATACTAAAG 375 TCATATTACC CCCTGAACTT ATTTTATAAG TAATTTTCTA CCCCTTTT-G ACCTACGTGG 18188 || ||||||| | |||||||| ||||||||| |||||||||| | | |||| | || |||||| TCCTATTACC CTCTGAACTT ATTTTATAAA TAATTTTCTA CTCTTTTTCG ACTTACGTGA 435 CTCTAGCTTG AAAAAAAAGT CAATCAGCGT TGGACCCACA AGATAGTGCC ACATAGACCG 18128 | ||| ||| |||||||||| ||||||| || |||||||||| |||||||||| || ||| | | CACTAATTTG AAAAAAAAGT CAATCAGTGT TGGACCCACA AGATAGTGCC ACGTAGGCTG 495 AAAAGGGCTA G-AAAATTAT TAATAAAATA AGTTCA-GGG ATAATAGGAC CTTAGTATAG 18070 ||||| || | |||||| | |||||||||| | |||| ||| |||||| ||| ||||||||| AAAAGAAGTA GAAAAATTGT TAATAAAATA AATTCAGGGG ATAATAAGAC ATTAGTATAG 555 TGTAAGTATG ACTTTAAAAT TTCAGGCATA AATTGAGAGG GTACTTGTGC ATTATCTCAA 18010 | ||||| || |||| | || ||| ||||| |||| || ||| ||||| |||||| ||| TATAAGTGTG TCTTTGAGAT TTCGTGCATA GGTTGAATGG TTACCTGTGC ATTATCCCAA 615 TAATATTCAA ATCTTTACAT TAATATCTAA TTTGATGTAA TATTTTAATA ATAATAATGT 17950 | | || ACAAAAACA. .......... .......... .......... .......... .......... 624 AACGACCTAT TTAGTCGTTT TGAGCAGCAG ATTTTATTTT TGGAAAAACT GGCTGAGACG 17890 || ||| |||| .......... .......... .......TAG TATTTTTTTT .......... .......... 637 ACGGATCCCA CGATGGACCG TCATGGGCAC GATGGACCGT CGAGGGGGTC TCGTTCCAAA 17830 .......... .......... .......... .......... .......... .......... 637 ATACATAGAA TTCTGAAATT TGGGTTTTGA AATCGACTCT CTGAACTTCG TGATGAAGTG 17770 .......... .......... .......... .......... .......... .......... 637 GCAGGACGGA CCGTCACAGG CATGACGGGC CGTCACAGTC TCTTCAGAAA ATTTCAGTCT 17710 .......... .......... .......... .......... .......... .......... 637 CTGAACTCTG TGACGGAAGC AGCAGGACGG ACCGTCGCAG GCACGACGAC CCGTCACAGA 17650 .......... .......... .......... .......... .......... .......... 637 CTGCGTAATC CCAGGCTGAG TCGGATTTCT TTAAATGTTT TAAGGGGGCG TTTTGGACTA 17590 .......... .......... .......... .......... .......... .......... 637 TTCCTGCTAT AATTATAAAT TTAGTGGGTT AATGTTAATA ATTTAACTAC TTGAGGGTTA 17530 .......... .......... .......... .......... .......... .......... 637 AAAGAGATAA CCTTGAATTA GTTAGTGGGT TAAACTCATC ATCTTTCATA CTTAATTATA 17470 .......... .......... .......... .......... .......... .......... 637 TGCTAATTAG GGTAAAAGAA AGAAGGTTTG AATAAGAAAA AGAAAAGAAC AGAAAGAGAG 17410 .......... .......... .......... .......... .......... .......... 637 GGAGAAACGA TCGAGAGAGA GAGAGGAATG AAGAGGAAAG CAAAGATCTT GAGGAAATTG 17350 .......... .......... .......... .......... .......... .......... 637 CTTGCTTGAT CACGAATCTT CGGTGGAAGT AGGTTATGGT TTTTATACTA TTCGTAGTTA 17290 .......... .......... .......... .......... .......... .......... 637 ACTCTTAATA GCGAATGATA TGTGTTGGGT TGTATTGTAA AGTCTTCTAT ATGCTTAATT 17230 .......... .......... .......... .......... .......... .......... 637 GTATGCTTGC ATGAATATGA TTATATAATT GTGATGAAAT AAGCATGATG AAGCTATTGA 17170 .......... .......... .......... .......... .......... .......... 637 ATCCCAAATC TTGAAAACTC CAATCTTGAA AACCCCTTGT TATTGATGAT GCCTTGGTAT 17110 .......... .......... .......... .......... .......... .......... 637 AAAAGAAGGC TTGATGAACT AAAGTAATGA GATTGATGAT GCCTTGGTAT AAAAGAAGGC 17050 .......... .......... .......... .......... .......... .......... 637 TTGATGAATT AATAGAATGA GATTAGTGGA GTAGGTGTCA CGAACCGACA CATAGAATTA 16990 .......... .......... .......... .......... .......... .......... 637 GGGGATCGGG TGCCACGAAC CGACACGTAG AATTAAGGGA TCGGGTGTCA CGAACCGACA 16930 .......... .......... .......... .......... .......... .......... 637 CGTAGAATTA GGGAATTGGG TGTCACAAAC TGACACGTAG AATTAGGGGA TCGGGTGTCA 16870 .......... .......... .......... .......... .......... .......... 637 CGAATCGACA CGTAGAACTA GGGAATCGGA GTGTCACGTA CCGACACAAG AGTAAAGGTG 16810 .......... .......... .......... .......... .......... .......... 637 ATGAATCTTG AAAGATGTTA ATATACTCAA TCTAATGAAC CTAAGTCCCA AATGAGTATG 16750 .......... .......... .......... .......... .......... .......... 637 GTATTGAGGC TTGAGTCCTC ATGAGTGTAC TTGACGTTAT TTATCAAAGA TTCTTGTACT 16690 .......... .......... .......... .......... .......... .......... 637 TGTTGCTACA TGTTGAGTAA TGTAGTTGAT TTTATATTAT TACTTGATAT ATATTGTTTT 16630 .......... .......... .......... .......... .......... .......... 637 CTATTTTGAG TTGGCCGATG ATATCTACTC AGTACCCATG TTTTGTACTG ACCCCTACTT 16570 .......... .......... .......... .......... .......... .......... 637 GTATGTTTCT TTCCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCGTC TTCAACTCAA 16510 .......... .......... .......... .......... .......... .......... 637 CCGCAACTCT AGCAAGACTT CATAACACCG GATTTCAGGG TGAGCTACAC TTCTAGCTTG 16450 .......... .......... .......... .......... .......... .......... 637 AACTGGATCT TCTTGTTCAT GTCTTGATGC CTTGAAGTTC CAGCATGGAC TAGCTTTTTA 16390 .......... .......... .......... .......... .......... .......... 637 TTTATTCTAG CTTTCTAGAT ACTCTTAGCT TTAGTAATTT GAGGATAGAT GTTCTTGTGA 16330 .......... .......... .......... .......... .......... .......... 637 TGATGACTTC CAGATTTTGG GGATAATGAT AAGTTTGAGT TTTAGAAAGT GATTATTGAT 16270 .......... .......... .......... .......... .......... .......... 637 TTTCATTAAT GAGTTTAAGT CTTCCGCATT ATATTATGTT AATTATGTTT GAAATGTTGG 16210 .......... .......... .......... .......... .......... .......... 637 GGTTCAGATT GGTT 16196 || |||| .......TTT GGTT 644 hqPGS_C06HBa0153O03.1-1-_SGN-E540167- (19206 19143,18331 18001) ******************************************************************************** EST sequence 11 -strand 726 n (File: SGN-E550322-) 1 GTTTTTTTTT TTTTTTTATG AATTAGCTCA ATGAAAAATG AGTAAATTTT TTATATTTTA 61 TGGCATAATT TTTTCATTAA TTCATGGTTG AGAAAATCTT TGTTTCTAAT AGTGTTATAA 121 ATCGTAAAAT AATAATAATA TTTAAGTAGG AAGAACATAA ATGTAACGAC CTGTTTAGTC 181 GTTTTGAGTA GCAGATTTTA TTTTTGGAAA AACAGGTTGA GACGACGGAA CCCACGACGG 241 ACCGTCATGA GCACGATGGA CCGTCGAGGA GTCTCGTTTC AAAACACTTA GAAATTCTGA 301 AATTGGGTAC TAAAAATCGA CTCTCTGAAC TTCGTAACGG AATGGCACGA CGGACCGTCA 361 CGGGCGTGAC GGACCGTCAC AGACTCTTTG GTGGAAATTG AGTCTCTGAA CCTTGCGACG 421 ACCTGCAGGA CGGACCGTCG CAGGCACGAC GGGCCATCAC AGGTTGCGTA ATCCCAGTCT 481 GGGTCGGATT TCTTTACACG TTTTAAGGGA CGTTTTGGAC TATTCCTACT TTAATTATAA 541 AGTTAGTGGG TTTATGTTAA TAAGTCTAAT TACCTGGGGG TTAAAAGAGG TAACCTTGAG 601 TAAATTAGTG GGTTATTATT CCATCTTTTA TTCTTAATTA TATGCTAATT AGGGTAAAAG 661 AAGGAGGGTT TGAATAAGAA AAAGAAAAGA ACAGAAAGAG AGAGAAGGAG AATCGATCTG 721 GTGCGA Predicted gene structure (within gDNA segment 24409 to 16570): Exon 1 21799 21794 ( 6 n); cDNA 87 92 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 93 154 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17386 ( 574 n); cDNA 155 726 ( 572 n); score: 0.834 PPA cDNA 19 2 MATCH C06HBa0153O03.1-1- SGN-E550322- 0.812 641 0.883 C PGS_C06HBa0153O03.1-1-_SGN-E550322- (21799 21794,18574 18514,17959 17386) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 92 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 92 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 92 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 92 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 92 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 92 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 92 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 92 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 92 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 92 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 92 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 92 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 92 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 92 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 92 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 92 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 92 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 92 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 92 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 92 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 92 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 92 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 92 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 92 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 92 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 92 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 92 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 92 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 92 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 92 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 92 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 92 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 92 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 92 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 92 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 92 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 92 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 92 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 92 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 92 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 92 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 92 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 92 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 92 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 92 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 92 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 92 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 92 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 92 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 92 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 92 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 92 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 92 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 107 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 154 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 154 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 154 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 154 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 154 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 154 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 154 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 154 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 154 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 154 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 212 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 271 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 331 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 391 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 449 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 508 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 568 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 627 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA 687 AGAACAGAAA GAGAGGGAGA AACGATCGAG AGAGAGAGA 17386 |||||||||| ||||| || | ||||| | | || AGAACAGAAA GAGAGAGAAG GAGAATCGAT CTGGTGCGA 726 hqPGS_C06HBa0153O03.1-1-_SGN-E550322- (17959 17386) ******************************************************************************** EST sequence 48 -strand 649 n (File: SGN-E374999-) 1 ATTTTTTCAT TAATTCAGGG TGAAGAAAAT CTTTGTTTCT AATAGTGTTA TAAATCGTAA 61 AATAATAATA ATATTTAAGT AGGAAGAACA TAAATGTAAC GACCTGTTTA GTCGTTTGGA 121 GTAGCAGATT TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA 181 TGAGCACGAT GGACCGTCGA GGAGTCTCGT TTCAAAACAC TTAGAAATTC TGAAATTGGG 241 TACTAAAAAT CGACTCTCTG AACTTCGTAA CGGAATGGCA CGACGGACCG TCACGGGCGT 301 GACGGACCGT CACAGACTCT TTGGTGGAAA TTGAGTCTCT GAACCTTGCG ACGACCTGCA 361 GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG TCTGGGTCGG 421 ATTTCTTTAC ACGTTTTAAG GGACGTTTTG GACTATTCCT ACTTTAATTA TAAAGTTAGT 481 GGGTTTATGT TAATAAGTCT AATTACCTGG GGGTTAAAAG AGGTAACCTT GAGTAAATTA 541 GTGGGTTATT ATTCCATCTT TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG 601 GTTTGAATAA GAAAAAGAAA AGAACAGAAA GAGAGAGAAG GAGAATCGA Predicted gene structure (within gDNA segment 23739 to 16670): Exon 1 17963 17396 ( 568 n); cDNA 84 649 ( 566 n); score: 0.839 MATCH C06HBa0153O03.1-1- SGN-E374999- 0.839 568 0.875 C PGS_C06HBa0153O03.1-1-_SGN-E374999- (17963 17396) Alignment (genomic DNA sequence = upper lines): AATAATAATA ATGTAACGAC CTATTTAGTC GTTTTGAGCA GCAGATTTTA TTTTTGGAAA 17904 || || | | |||||||||| || ||||||| |||| ||| | |||||||||| |||||||||| AAGAACATAA ATGTAACGAC CTGTTTAGTC GTTTGGAGTA GCAGATTTTA TTTTTGGAAA 143 AACTGGCTGA GACGACGGAT CCCACGATGG ACCGTCATGG GCACGATGGA CCGTCGAGGG 17844 ||| || ||| ||||||||| ||||||| || ||||||||| |||||||||| ||||||| || AACAGGTTGA GACGACGGAA CCCACGACGG ACCGTCATGA GCACGATGGA CCGTCGA-GG 202 GGTCTCGTTC CAAAATACAT AG-AATTCTG AAATTTGGGT TTTGAAATCG ACTCTCTGAA 17785 |||||||| ||||| || | || ||||||| ||||| || | |||||| |||||||||| AGTCTCGTTT CAAAACACTT AGAAATTCTG AAATTGGGTA CTAAAAATCG ACTCTCTGAA 262 CTTCGTGATG AAGTGGCAGG ACGGACCGTC ACAGGCATGA CGGGCCGTCA CAGTCTCTTC 17725 |||||| | | | ||||| | |||||||||| || ||| ||| ||| |||||| ||| ||||| CTTCGTAACG GAATGGCACG ACGGACCGTC ACGGGCGTGA CGGACCGTCA CAGACTCTTT 322 AG-AAAATTT CAGTCTCTGA ACTCTGTGAC GGAAGCAGCA GGACGGACCG TCGCAGGCAC 17666 | || || ||||||||| || || ||| | | | ||| |||||||||| |||||||||| GGTGGAAATT GAGTCTCTGA ACCTTGCGAC -G-ACCTGCA GGACGGACCG TCGCAGGCAC 380 GACGACCCGT CACAGACTGC GTAATCCCAG GCTGAGTCGG ATTTCTTTAA ATGTTTTAAG 17606 |||| || | ||||| ||| |||||||||| ||| ||||| ||||||||| | ||||||| GACGGGCCAT CACAGGTTGC GTAATCCCAG TCTGGGTCGG ATTTCTTTAC ACGTTTTAA- 439 GGGGCGTTTT GGACTATTCC TGCTATAATT ATAAATTTAG TGGGTTAATG TTAATAA-TT 17547 ||| |||||| |||||||||| | || ||||| ||||| |||| |||||| ||| ||||||| | GGGACGTTTT GGACTATTCC TACTTTAATT ATAAAGTTAG TGGGTTTATG TTAATAAGTC 499 TAACTACTTG AGGGTTAAAA GAGATAACCT TGAATTAGTT AGTGGGTTAA ACTCATCATC 17487 ||| ||| || ||||||||| ||| |||||| ||| | | || ||||||||| | |||| TAATTACCTG GGGGTTAAAA GAGGTAACCT TGAGTAAATT AGTGGGTTAT TAT-TCCATC 558 TTTCATACTT AATTATATGC TAATTAGGGT AAAAGAAAGA AGGTTTGAAT AAGAAAAAGA 17427 ||| || ||| |||||||||| |||||||||| ||||||| || ||||||||| |||||||||| TTTTATTCTT AATTATATGC TAATTAGGGT AAAAGAAGGA GGGTTTGAAT AAGAAAAAGA 618 AAAGAACAGA AAGAGAGGGA GAAACGATCG A 17396 |||||||||| ||||||| || | |||| | AAAGAACAGA AAGAGAGAGA AGGAGAATCG A 649 hqPGS_C06HBa0153O03.1-1-_SGN-E374999- (17963 17396) ******************************************************************************** EST sequence 22 -strand 720 n (File: SGN-E389834-) 1 CCCTCGATTT TTTTTTTTTT TTCACGAATA GCTCAATGAA GAACGAGTAA ATTTTTATAT 61 TTCATGGCAT ATTTTTTCAT TAATTCATGG TCGAGAAAAT CTTCGTTTCT AATAGTGTTA 121 TAAATCGTAA AATAATAATG ATATTTAAGT AGGACGATCA TAAATGTAAC GTCCTGTTTA 181 GTCGTTCTGA GTAGCAGATT TTATTTTTGG AAAAACAGGT TGAGTCGACG TAACCCACGA 241 CGGACCGTCA TGAGCACGAT GGACCGTCGA GGAGTCTCGT TTCAAAACAC TTAGAAATTC 301 TGAAATTGGG TACTAAAAAT CGACTCTCTG AACTTCGTAA CGGAATGGCA CGACGGACCG 361 TCACGGGCGT GACGGACCGT CACAGACTCT TTGGTGGAAA TTGAGTCTCT GAACCTTGCG 421 ACGACCTGCA GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG 481 TCTGGGTCGG ATTTCTTTAC ACGTTTTAAG GGACGTTTTG GACTATTCCT ACTTTAATTA 541 TAAAGTTAGT GGGTTTATGT TAATAAGTCT AATTACCTGG GGGTTAAAAG AGGTAACCTT 601 GAGTAAATTA GTGGGTTATT ATTCCATCTT TTATTCTTAA TTATATGCTA ATTAGGGTAA 661 AAGAAGGAGG GTTTGAATAA GAAAAAGAAA AGAACAGAAA GAGAGAGAAG GAGAATCGAT Predicted gene structure (within gDNA segment 25639 to 16660): Exon 1 24892 24864 ( 29 n); cDNA 90 118 ( 29 n); score: 0.569 Intron 1 24863 18554 (6310 n); Pd: 0.313 (s: 0), Pa: 0.000 (s: 0.65) Exon 2 18553 18514 ( 40 n); cDNA 119 157 ( 39 n); score: 0.650 Intron 2 18513 17962 ( 552 n); Pd: 0.845 (s: 0.65), Pa: 0.000 (s: 0.86) Exon 3 17961 17396 ( 566 n); cDNA 158 719 ( 562 n); score: 0.835 PPA cDNA 22 7 MATCH C06HBa0153O03.1-1- SGN-E389834- 0.835 635 0.882 C PGS_C06HBa0153O03.1-1-_SGN-E389834- (24892 24864,18553 18514,17961 17396) Alignment (genomic DNA sequence = upper lines): GTCACGATAC ATCTTGGTTG C-ACCAGGAT GTATATAATA CCTTGAACTA TGAGCCTCTG 24834 ||| || | ||||| ||| | | || | GTCGAGA-AA ATCTTCGTTT CTAATAGTGT .......... .......... .......... 118 TAAGAATAGT GTGAATCAAA TTATCAACAC GGGGCACACA TATCCTTCCC TTAATTCTCA 24774 .......... .......... .......... .......... .......... .......... 118 AAACACCTTC CTCGTCAATT ATTGCCTCTT TAGCCTCTCC TCGTAATACC ATATCTCGAA 24714 .......... .......... .......... .......... .......... .......... 118 TTCGGCTCAG CTTCTCGTCA GTAAACTGTT TTCCCTTAAT CTTGTCAAGA AAAGAAGATC 24654 .......... .......... .......... .......... .......... .......... 118 TTGCCTCCAC ACAAGCCAAA AATCCTCCCT TCTCTAGTAC TTCCAGCCTC ATAAAGTCAT 24594 .......... .......... .......... .......... .......... .......... 118 TAACCAGGGT CTAAACCTCT CTAGCCAATG GGCGCCTAGA AACCTGTAAG TGGGCTAGGC 24534 .......... .......... .......... .......... .......... .......... 118 TTCCCCTGCT CCCTGCTTTT CTACTTAAAG CGTCTGCCAC AACATTAGCT TTTCCTGGGT 24474 .......... .......... .......... .......... .......... .......... 118 GATACAAAAT AGTAATATCA TAGTCCTTCA GTAGTTCCAT CTACCTTCAC TGCCTCAAAT 24414 .......... .......... .......... .......... .......... .......... 118 TCAAATCTTT CTTAGTCAAA TTTGTCAATT GGGAAGCGAC AGAAGAAAAT CCCTTGACAA 24354 .......... .......... .......... .......... .......... .......... 118 ATCGACGGTA GGAGCTAGCT AAACCAAAAA ACGCTCCTTA CCTCTGTAAC ATTAGTAGGT 24294 .......... .......... .......... .......... .......... .......... 118 CTTACCCAAT TCTTCACTAC TTTAATATTA GATGGATCCA CCATCACTCC ATCCTTGGAA 24234 .......... .......... .......... .......... .......... .......... 118 ACCAACATGC CCCAAGAAGG ACACTGAATC TAGCCAAAAC TCACACTTGG AGAATTTGGC 24174 .......... .......... .......... .......... .......... .......... 118 ATAAAGCCTT TTTCTCTCTC AACAATTCCA TAACAATTTT CAAATACTCC CCATGTTCTT 24114 .......... .......... .......... .......... .......... .......... 118 TTCTGCTCTT TGAGTATATC AGTATATCAT CAATAAATAC AATAACAAAC AGATCCAGAT 24054 .......... .......... .......... .......... .......... .......... 118 ATGGCTTAAA AATCCTGTTC ATCAGGCTCA TGAACGCAGA AGAGGCATTC GTAAGCCCAA 23994 .......... .......... .......... .......... .......... .......... 118 AAGACATTAC TAAGAATTCA TAATGCCCAT ACCTGGTTCG AAACACAGCC TTTGGCACAT 23934 .......... .......... .......... .......... .......... .......... 118 CTGCTGCCCG TATTTTCAAT TCATGATAAC TAGATCTCAA ATCGATTTTT GAAAAGATAC 23874 .......... .......... .......... .......... .......... .......... 118 AAGCACCTTG TAACTGATCG AACAAATCAT CGATGCGAGG AAGAGGATAC TTGTTCTTAA 23814 .......... .......... .......... .......... .......... .......... 118 TAGTTACCTT ATTCAGTTGC CTGTAGTCTA TGCACATCCG AAAACTTCCA TCCTTCTTCT 23754 .......... .......... .......... .......... .......... .......... 118 TCACAAATAA AACAGGAGCA CCCCAAGGGG ATGAACTTGG TCTAGTAAAG TCTTTACCTA 23694 .......... .......... .......... .......... .......... .......... 118 ACAACTCCTG AAGTTGGGCC TTTAACTCCC TTAACTCAGA TAGGGTCATT CTATAAGGGG 23634 .......... .......... .......... .......... .......... .......... 118 GTATGGAAAT GGGGCGAGTA CCCGGCTCGA GATCAATACA AAAATCAATA TCCCTATCTG 23574 .......... .......... .......... .......... .......... .......... 118 GTGGCATACC AGGAAGGTCT GCAGGAAACA CATCCAGAAA CTCACAGACT ATCGAAACAG 23514 .......... .......... .......... .......... .......... .......... 118 ACTCAATCGA AGGTACCTTG GAAGTATCAT CCCTGAGATG GGCCAAGAAA GCTAAACAAC 23454 .......... .......... .......... .......... .......... .......... 118 CCCTACTAAC CATCCTCTTA GCACGAAGAA AAGAGATAAT ATGAACTAGG GTGGAAATAT 23394 .......... .......... .......... .......... .......... .......... 118 AGTCACCCTC CCATACTAGC GGATCTGTCC CAGGCTTGGT CAATGTCACA GTTTTAGCGT 23334 .......... .......... .......... .......... .......... .......... 118 TACAATCTAA GATTGCAAAG TTTGGAGAAA GCCAAGTCAT ACCCAAAATT ACATCGAAAT 23274 .......... .......... .......... .......... .......... .......... 118 CAACCATCTC TAGAATAATC AAATCTAAAT GAGTATTGCT CCCCATAAAA ACCACAAGAC 23214 .......... .......... .......... .......... .......... .......... 118 AAGACCTATA CACCTTATCA ACTATCACAG ACTCACCCAC AGGAGTAAAG ACACGAATAG 23154 .......... .......... .......... .......... .......... .......... 118 GCATGTCAAG CAAGTCACAA TGTAAATCAA GACCAGTAAC AAATGAGGAA GATACATATG 23094 .......... .......... .......... .......... .......... .......... 118 AAAATGTGGA TCCAGGATCA AATAATACAG AAGCCATGCA ATCACAAACC AAAAGATTAC 23034 .......... .......... .......... .......... .......... .......... 118 CTGTGATAAC AGCATCAGAT GTCTCTGCTT CAAACCTCTC GGGGAAATCA TAACAATGGG 22974 .......... .......... .......... .......... .......... .......... 118 CCCTATCACC TGTCTGCCCG TTGCCCTTAC CATGTTGTGC TGCAGTAGTT CCAACTTGCC 22914 .......... .......... .......... .......... .......... .......... 118 CGCCACCCCG GCTGATTTGG TGACCACCAT TACCTTGGCC ACCACGTCCT CCATAATGGC 22854 .......... .......... .......... .......... .......... .......... 118 GGCCTCTCCC ATGATTTCCT CTACCTCTAA TTATTGGGGG TCTGTAACTC TATTTTGGAC 22794 .......... .......... .......... .......... .......... .......... 118 AATACCTCCT AATATGTCCA GTCTCTCCAC ATCCACTACA ATCTCTGGAG TCAAGCATAG 22734 .......... .......... .......... .......... .......... .......... 118 GTCTCTAAGA GAATGACGAA GTCTGGGGAT AACCTCCAAA CTCAGAGAAA TGTTGACTGG 22674 .......... .......... .......... .......... .......... .......... 118 TCTGCGATGG ACCCCCAGCT ACAGCCTGTA GTGACGACTG AATAGGTCAG GCTGGGTAAC 22614 .......... .......... .......... .......... .......... .......... 118 CTCCTGAACT CTGCCCTCTG GAGTAAGAAC CACTAAACTC ACCTCCCGTA CGGAACTTCT 22554 .......... .......... .......... .......... .......... .......... 118 TAGATGTCGA CACCATGGTG AAGTTGTCTG GCTTCACCCC CTCCACCTCA ATCACAAAGT 22494 .......... .......... .......... .......... .......... .......... 118 CAACCACTTC CTGAAAGGAT TTTGCTGCGG CAGCTACCTG TAACTGGGAT CTGCAAATCT 22434 .......... .......... .......... .......... .......... .......... 118 AACCTCAATC CTTTCACAAA GCGGCGAATC CGCTCTTATG GACTGAAGCA AAGCTGGGTG 22374 .......... .......... .......... .......... .......... .......... 118 GCATACCTGG ATAGCGCACG AAATTTGGCC TCATAAGCGG CAACAGACAT CCTTCCTTGC 22314 .......... .......... .......... .......... .......... .......... 118 TATAGGCTCA AGAACTCATC TCTCCTCCTA TCCCTCAAAG TCCGGGGTAT ATACTTCTCC 22254 .......... .......... .......... .......... .......... .......... 118 ATAAATAAGC TAGAGAATGA TTCCCAAGTC ATAGGTGGTG CCTGTGCTGG TTGACACTCA 22194 .......... .......... .......... .......... .......... .......... 118 ACATACGACC GCCACCACAT TTTGGCATTC CCCTGAAACT GGTAGGTCAC AAAATCAACA 22134 .......... .......... .......... .......... .......... .......... 118 CCGAATCGTT CTACTATGTC CATCTTATGT AGCAGCTCAT GACAATCAAC CAGAAAATCA 22074 .......... .......... .......... .......... .......... .......... 118 TAGGCATCCT CAGATTTAGC ACCCTTGAAG ACAGGAGGTT TCAACTTTAA GAATTTAGTG 22014 .......... .......... .......... .......... .......... .......... 118 AAAAGTTCAT GTTGATCACT TGTCATTATA GACCCTGTAG TCAATCGAGG AAACGTGCCT 21954 .......... .......... .......... .......... .......... .......... 118 ACTTCCAATG AGGCATCCAT GCGGGGAGCC ACAACAGTTG CATGTTGTAC TCCTGGAACC 21894 .......... .......... .......... .......... .......... .......... 118 TGAGGTGCTG GTACAAGAAA CACTGGAGGT GTCTGGCCTC GATCAGATAA CCCGCTAAGA 21834 .......... .......... .......... .......... .......... .......... 118 TAAGTAAGAA CCTGGTTGAT CATCTCTGGG GTAGGTTGGG GTGGTAATTC CTCATCTTGC 21774 .......... .......... .......... .......... .......... .......... 118 ACCTGTTTAT TTTCCCCTTC CTCACCCTCT CTTACTACCT CATCAGTCGG TGGAGGAGTC 21714 .......... .......... .......... .......... .......... .......... 118 ACCACCCTAT TACTAGCTTG ACCAGGTGTT TGTCCTCCAC CTCTAGAGAT CGTCCTCTTG 21654 .......... .......... .......... .......... .......... .......... 118 CGACCTCTAC CACGACCTCT TGCCACTGCT CCTCCTCGAG CTACAACCCC AATGTTTGGC 21594 .......... .......... .......... .......... .......... .......... 118 TCAGACGCAC GCTATCTTGC CGGTGTTGGT GTTGGCACAG TTGTTTCTCT AGTTCTAACC 21534 .......... .......... .......... .......... .......... .......... 118 ATATGCGAAA TAGAGTGAGG ATGTCAGATA CCAATTTGTA TCACCTAGAT ACCACTTGGA 21474 .......... .......... .......... .......... .......... .......... 118 TCCAAGTAAT AGCACGAAAG AAGGAAAGAA TGGAATTTTG CTAAAGTCCT ATAGCCTCTC 21414 .......... .......... .......... .......... .......... .......... 118 GAAGAAAAGT AAGGGCGTCC CCCTACCGTT CCTCAAGACT CTACTAGACT TGTTCTTGTG 21354 .......... .......... .......... .......... .......... .......... 118 TGATGAGACC AACGAACCTA ACGCTCTGAT ACCAAGTTTG TCACGACCCA AAACGATCCG 21294 .......... .......... .......... .......... .......... .......... 118 TAAGTGGCAC CCACCCTTAC TCTCCTAGGT GAGCGAACCA ACAAATCTAA ACCCCAACAT 21234 .......... .......... .......... .......... .......... .......... 118 TTACCAGTAT ATCAACTATA AATAATATAA ATAATGCGGA AGCTCCAAAA CTCATTACGA 21174 .......... .......... .......... .......... .......... .......... 118 AATTAATTAA ATCAACATCT AAAGTTAAAT ACTTATTATT CCCAAAATCT GTAAGTCATC 21114 .......... .......... .......... .......... .......... .......... 118 ACACCAAGAA CATCTATCCT CGAATTTCTA AATCTAAGAG TATTCAAGAA GCTAAAAATA 21054 .......... .......... .......... .......... .......... .......... 118 GTAAAAAGAT GGTCCATGTC CGAACTTCAA GACATCAAGA CGTGAAGGAG AGAATCCAGC 20994 .......... .......... .......... .......... .......... .......... 118 ACGAGCTAGG AATAATAGCT CACCCTGAAT TCTGATATGC TAAAGACCGG CTAGATCTGA 20934 .......... .......... .......... .......... .......... .......... 118 TGACGAGTCG AAGTCGATGG CACGCTTGCT GCACTCCACA AATAACAAAG AAGAAAATTA 20874 .......... .......... .......... .......... .......... .......... 118 CAAGTAGGGG TCAGTACAAG GAACACGTAC TGAGTAGGTA TCATCGGCCA ACTCAAAATA 20814 .......... .......... .......... .......... .......... .......... 118 GAAAACAATA TATACTGAAT AATAATATAA AATCAACCAT AATACTTAAC AGGTGACAAT 20754 .......... .......... .......... .......... .......... .......... 118 CAACAAGTAT AAGAACCATT GACAACAACA GCAAGCACAT CTATGAGGAC TCAAGCCTCC 20694 .......... .......... .......... .......... .......... .......... 118 ACACCATACT CATTTGGGAA ATAGGTTCTT TGAATTTGAG TACATTAACA TAATTCAAGA 20634 .......... .......... .......... .......... .......... .......... 118 TTCATTCTCT TTATCATTAT CGTGTCGGAA CGTTACACCC GATCCCCTAC TACTACCGTG 20574 .......... .......... .......... .......... .......... .......... 118 TCGGAACGTG ACACTCTGAT CCCCTAATAC TACCGTGTCA GAACGTGACA CCCGATCCCC 20514 .......... .......... .......... .......... .......... .......... 118 TAATACTACC GTGTCAGAAT GTGACACTCC GATCCCCTAA TACTACCGTG TCGAAACATG 20454 .......... .......... .......... .......... .......... .......... 118 ACACCCAATC CATTTATCTC ATTATTTTAG TTCATCAAGC CTTCTTTATG TCAAGGCGCC 20394 .......... .......... .......... .......... .......... .......... 118 ATCTTAATAG AGAGGATTTA AGATTGAAGA TTCAACAGTT TCATCATTCT GACCACCACA 20334 .......... .......... .......... .......... .......... .......... 118 ATTACACAAT CACAACATAC AAACACACAA TCAAGCATAT AGAAGACTTT ACAATACCAC 20274 .......... .......... .......... .......... .......... .......... 118 CCAATACATA TCGATCACTA TTTAGAGTTT ATCTATCATA TATAAATAAA TCATAACCTA 20214 .......... .......... .......... .......... .......... .......... 118 CCTCCACTGA AGAATCGTGA TCAAGCAAGC TACCTTCCCA ATGCCTTTGC TTTCCTCTTC 20154 .......... .......... .......... .......... .......... .......... 118 GTTCTCTCTT TCTCGCTCGT TCTCCCTCTG TGTTTCTTTT TATTTTTCTT ACTCAAAATC 20094 .......... .......... .......... .......... .......... .......... 118 TTGTTCTTTT ACCCTAAATG TCATATAATC AATTATAAAA GATGATAAAA GTACCTCACT 20034 .......... .......... .......... .......... .......... .......... 118 ATTTATTCCC TTATTAACTT CTTTAACCCC CAAGTAAATA AATTATTAAA CTTACCCCAC 19974 .......... .......... .......... .......... .......... .......... 118 TAATTCCATA ATTATAATCA TGAATAGTCC AAAACACCCC TTTAAAACTT TTAGCAGAAA 19914 .......... .......... .......... .......... .......... .......... 118 TCCGACCCAG TCGAGGTTAC GCAGCTTGTG ACGGTCCGTT GTGTCTACGA CGGTCCGTGC 19854 .......... .......... .......... .......... .......... .......... 118 TGTAGTTCCG TCGCGGAGTT CAGAGAGTCG CTCCCAGTAC CCAGATTTTC AGAGTTGAAG 19794 .......... .......... .......... .......... .......... .......... 118 TGTTTTGGAA CGGAGACGCT CGACGGACCG TCGTGCCTGT GACGGTTTGT CCTACCTGCC 19734 .......... .......... .......... .......... .......... .......... 118 GTCGAGGGTA ATGAGGAGAG CAACAGAAGA AATTACACAA GTATGGGACG ACGGAGTCCA 19674 .......... .......... .......... .......... .......... .......... 118 TCACGGTCCA TCGTGACCAT GACGGTCCGT CGTGACCATG ACGGTCCGTC GCGTGATCCG 19614 .......... .......... .......... .......... .......... .......... 118 TCGACCCAGT CAGTTTTTTA TCAAAAATAG TTCTACTGCT CGAACCGACT AAACAGGTCA 19554 .......... .......... .......... .......... .......... .......... 118 TTACAATTTT CCTACTTTAG TTTTCCCTAT GGCTACCACT GTCCATCTAC TATTTTTTTT 19494 .......... .......... .......... .......... .......... .......... 118 CATGATTTGA TCTTTTAAGT AAAATTATTT GTGGAACTTC TCTTTGAAGA TATCTCTCAC 19434 .......... .......... .......... .......... .......... .......... 118 AAATTAGCGA AAAAGTTAGT TAATTTATTT TTATTTTAAA AAGTAAGATA AATGTTTTTG 19374 .......... .......... .......... .......... .......... .......... 118 CGCTATCAAT ATTTTATAAT ATGTAAATTA TTCGAAACAA ACTTTTAATA AATAAAAATT 19314 .......... .......... .......... .......... .......... .......... 118 AGAGCAAATA TAAAAATCAC TAGATTTTTT TTTAAAAAAA TGGGGCCTTG AAACGGTATA 19254 .......... .......... .......... .......... .......... .......... 118 TATTTTTTTA TTTGAATAGA TTATGGGGGA GAATTAATAG AGGTAAGATT TTTTATTTTA 19194 .......... .......... .......... .......... .......... .......... 118 TCTAATAAGA AAATGACAAA TAATATATTT TTAAAAAATA AATAAAACAA AATAAACTTT 19134 .......... .......... .......... .......... .......... .......... 118 GGTTGTTAGT CATAAAAATA TAAGTTATTC AAAAAGGTGA ATGAAAGAGT ATAAGTGAGT 19074 .......... .......... .......... .......... .......... .......... 118 CAAAAAGATG AGTGAAGAGG CATAGCTAAG CCAAAAAAGT GAATGTAAGG GTATTTCTAG 19014 .......... .......... .......... .......... .......... .......... 118 ACCAAAAGAT TGATGAAGGA TATTTTTAGA CATAGTTCAA GGATAGTTTT GGTCCTTTTT 18954 .......... .......... .......... .......... .......... .......... 118 CGTTTAAATA ATCTCATATT TAGATATTGA GTTATTTGCA GGGACGATTC AATATAATTC 18894 .......... .......... .......... .......... .......... .......... 118 GAGGCCTAAA TTTTAAATAA ACTCTATCTG TATTTATTTA TTTTTCTTTT TAGATGTAAA 18834 .......... .......... .......... .......... .......... .......... 118 TTGTATTTAT TTATTTTTCT TTTTAGATGT AAGTTATTAC TTAAGTATCT TTTTTTGTAA 18774 .......... .......... .......... .......... .......... .......... 118 ATGGAAAAGG GCTAAAAATG CCCTTAACTT AGTGGAAATG GTTCAAAATA CCATCCTTCT 18714 .......... .......... .......... .......... .......... .......... 118 ACCTTTTGAG TTAAAAATAC CCTCCACCTT TATTTTGGTT CAAAGATGCC TTTCCTTCCA 18654 .......... .......... .......... .......... .......... .......... 118 CCTTTTGATT TAATAATACT CTTAACCCCC CATTTAATTA AATTTATAAA ATAAAAAATT 18594 .......... .......... .......... .......... .......... .......... 118 CTTAATATTA GCTCATTCCA AAATCTTTAT GATAAATATA TCTAAAAAAT AAAATAAAAA 18534 | |||| | ||||||| || .......... .......... .......... .......... TATAAATCGT AAAATAATAA 138 ATTTATTATA TGTATAAAAA GCAAAAATAA AAATAAAATT TCTCAAAGTT CTTATTCTTT 18474 |||| || ||| | | TGATATT-TA AGTAGGACGA .......... .......... .......... .......... 157 GTATTAAAAT AATAAGACAA TAAAAATCTT AAGATTCTTA TTCTTCATTT TTGCGCAAAA 18414 .......... .......... .......... .......... .......... .......... 157 AAATCTTTAT TTTATTTTAT GTTTTATACA TATTATTTAA TATTTTAATT TGTGAGAAAT 18354 .......... .......... .......... .......... .......... .......... 157 TTTTTTAAGT TATTTGGATT AAATTTTTAA ATTATATTGA GAAAATGCAC AAGTATTCCC 18294 .......... .......... .......... .......... .......... .......... 157 TCAAACTATG TCTGAAATCC CAGAGACACA CTTATACTAT ATTAAGGTCA TATTACCCCC 18234 .......... .......... .......... .......... .......... .......... 157 TGAACTTATT TTATAAGTAA TTTTCTACCC CTTTTGACCT ACGTGGCTCT AGCTTGAAAA 18174 .......... .......... .......... .......... .......... .......... 157 AAAAGTCAAT CAGCGTTGGA CCCACAAGAT AGTGCCACAT AGACCGAAAA GGGCTAGAAA 18114 .......... .......... .......... .......... .......... .......... 157 ATTATTAATA AAATAAGTTC AGGGATAATA GGACCTTAGT ATAGTGTAAG TATGACTTTA 18054 .......... .......... .......... .......... .......... .......... 157 AAATTTCAGG CATAAATTGA GAGGGTACTT GTGCATTATC TCAATAATAT TCAAATCTTT 17994 .......... .......... .......... .......... .......... .......... 157 ACATTAATAT CTAATTTGAT GTAATATTTT AATAATAATA ATGTAACGAC CTATTTAGTC 17934 | || | | |||||||| | || ||||||| .......... .......... .......... ..TCAT-A-A ATGTAACGTC CTGTTTAGTC 183 GTTTTGAGCA GCAGATTTTA TTTTTGGAAA AACTGGCTGA GACGACGGAT CCCACGATGG 17874 ||| |||| | |||||||||| |||||||||| ||| || ||| | ||||| | ||||||| || GTTCTGAGTA GCAGATTTTA TTTTTGGAAA AACAGGTTGA GTCGACGTAA CCCACGACGG 243 ACCGTCATGG GCACGATGGA CCGTCGAGGG GGTCTCGTTC CAAAATACAT AG-AATTCTG 17815 ||||||||| |||||||||| ||||||| || |||||||| ||||| || | || ||||||| ACCGTCATGA GCACGATGGA CCGTCGA-GG AGTCTCGTTT CAAAACACTT AGAAATTCTG 302 AAATTTGGGT TTTGAAATCG ACTCTCTGAA CTTCGTGATG AAGTGGCAGG ACGGACCGTC 17755 ||||| || | |||||| |||||||||| |||||| | | | ||||| | |||||||||| AAATTGGGTA CTAAAAATCG ACTCTCTGAA CTTCGTAACG GAATGGCACG ACGGACCGTC 362 ACAGGCATGA CGGGCCGTCA CAGTCTCTTC AG-AAAATTT CAGTCTCTGA ACTCTGTGAC 17696 || ||| ||| ||| |||||| ||| ||||| | || || ||||||||| || || ||| ACGGGCGTGA CGGACCGTCA CAGACTCTTT GGTGGAAATT GAGTCTCTGA ACCTTGCGAC 422 GGAAGCAGCA GGACGGACCG TCGCAGGCAC GACGACCCGT CACAGACTGC GTAATCCCAG 17636 | | | ||| |||||||||| |||||||||| |||| || | ||||| ||| |||||||||| -G-ACCTGCA GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG 480 GCTGAGTCGG ATTTCTTTAA ATGTTTTAAG GGGGCGTTTT GGACTATTCC TGCTATAATT 17576 ||| ||||| ||||||||| | ||||||| ||| |||||| |||||||||| | || ||||| TCTGGGTCGG ATTTCTTTAC ACGTTTTAA- GGGACGTTTT GGACTATTCC TACTTTAATT 539 ATAAATTTAG TGGGTTAATG TTAATAA-TT TAACTACTTG AGGGTTAAAA GAGATAACCT 17517 ||||| |||| |||||| ||| ||||||| | ||| ||| || ||||||||| ||| |||||| ATAAAGTTAG TGGGTTTATG TTAATAAGTC TAATTACCTG GGGGTTAAAA GAGGTAACCT 599 TGAATTAGTT AGTGGGTTAA ACTCATCATC TTTCATACTT AATTATATGC TAATTAGGGT 17457 ||| | | || ||||||||| | |||| ||| || ||| |||||||||| |||||||||| TGAGTAAATT AGTGGGTTAT TAT-TCCATC TTTTATTCTT AATTATATGC TAATTAGGGT 658 AAAAGAAAGA AGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGGGA GAAACGATCG 17397 ||||||| || ||||||||| |||||||||| |||||||||| ||||||| || | |||| AAAAGAAGGA GGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGAGA AGGAGAATCG 718 A 17396 | A 719 hqPGS_C06HBa0153O03.1-1-_SGN-E389834- (17961 17396) ******************************************************************************** EST sequence 125 +strand 681 n (File: SGN-E389553+) 1 AATGAGTAAT TTTTTATATT TTATTCCATA ATTTTTTCAT TAATTCATGG TTGAGAAAAT 61 CTTTGTTTCT AATAGTGTTA TAAATCGTAA AATAATAATA ATATTTAAGT AGGAACGAAC 121 ATAAATGTAA CGACCTGTTT AGTCGTTTTG AGTAGCAGAT TTTATTTTTG GAAAAACAGG 181 TTGAGACGAC GGAACCCACG ACGGACCGTC ATGAGCACGA TGGACCGTCG AGGAGTCTCG 241 TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA TCGACTCTCT GAACTTCGTA 301 ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG TCACAGACTC TTTGGTGGAA 361 ATTGAGTCTC TGAACCTTGC GACGACCTGC AGGACGGACC GTCGCAGGCA CGACGGGCCA 421 TCACAGGTTG CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA GGGACGTTTT 481 GGACTATTCC TACTTTAATT ATAAAGTTAG TGGGTTTATG TTAATAAGTC TAATTACCTG 541 GGGGTTAAAA GAGGTAACCT TGAGTAAATT AGTGGGTTAT TATTCCATCT TTTATTCTTA 601 ATTATATGCT AATTAGGGTA AAAGAAGGAG GGTTTGAATA AGAAAAAGAA AAGAACAGAA 661 AGAGAGAGAA GGAGAATCGA T Predicted gene structure (within gDNA segment 24039 to 16650): Exon 1 18551 18514 ( 38 n); cDNA 81 117 ( 37 n); score: 0.658 Intron 1 18513 17961 ( 553 n); Pd: 0.845 (s: 0), Pa: 0.000 (s: 0.90) Exon 2 17960 17396 ( 565 n); cDNA 118 680 ( 563 n); score: 0.842 MATCH C06HBa0153O03.1-1- SGN-E389553+ 0.842 603 0.885 C PGS_C06HBa0153O03.1-1-_SGN-E389553+ (18551 18514,17960 17396) Alignment (genomic DNA sequence = upper lines): TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA ATAAAATTTC 18492 |||| ||| ||||| || |||| || | || || TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAACG.. .......... .......... 117 TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA GATTCTTATT 18432 .......... .......... .......... .......... .......... .......... 117 CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA TTATTTAATA 18372 .......... .......... .......... .......... .......... .......... 117 TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT TATATTGAGA 18312 .......... .......... .......... .......... .......... .......... 117 AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT TATACTATAT 18252 .......... .......... .......... .......... .......... .......... 117 TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT TTTGACCTAC 18192 .......... .......... .......... .......... .......... .......... 117 GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG TGCCACATAG 18132 .......... .......... .......... .......... .......... .......... 117 ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG ACCTTAGTAT 18072 .......... .......... .......... .......... .......... .......... 117 AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT GCATTATCTC 18012 .......... .......... .......... .......... .......... .......... 117 AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA TAATAATAAT 17952 || | ||| .......... .......... .......... .......... .......... .AACATAAAT 126 GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA CTGGCTGAGA 17892 |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| | || ||||| GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA CAGGTTGAGA 186 CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG TCTCGTTCCA 17832 ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | ||||||| || CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG TCTCGTTTCA 245 AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT TCGTGATGAA 17773 ||| || ||| ||||||||| ||| || | |||||||| |||||||||| |||| | | | AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT TCGTAACGGA 305 GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG -AAAATTTCA 17714 ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | || || | ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG TGGAAATTGA 365 GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA CGACCCGTCA 17654 |||||||||| || ||| | | | ||||| |||||||||| |||||||||| || || ||| GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA CGGGCCATCA 423 CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG GGCGTTTTGG 17594 ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || | |||||||| CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG GACGTTTTGG 482 ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA ACTACTTGAG 17535 ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || | ||| || | ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA ATTACCTGGG 542 GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT TCATACTTAA 17475 |||||||||| | |||||||| | | | |||| ||||||| | |||||| | || ||||| GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT TTATTCTTAA 601 TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA AGAACAGAAA 17415 |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| |||||||||| TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA AGAACAGAAA 661 GAGAGGGAGA AACGATCGA 17396 ||||| || | ||||| GAGAGAGAAG GAGAATCGA 680 hqPGS_C06HBa0153O03.1-1-_SGN-E389553+ (17960 17396) ******************************************************************************** EST sequence 12 -strand 732 n (File: SGN-E550201-) 1 GGCCCCCCCT CGAGTTTTTT TTTTTTTTTT TTATGAATTA GCTCAATGAA AAATGAGTAA 61 ATTTTTTATA TTTTATGGCA TAATTTTTTC ATTAATTCAT GGTNGAGAAA ATCTTTGTTT 121 CTAATAGTGT TATAAATCGT AAAATAATAA TAATATTTAA GTAGGAAGAA CATAAATGTA 181 ACGACCTGTT TAGTCGTTTT GAGTAGCAGA TTTTATTTTT GGAAAAACAG GTTGAGACGA 241 CGGAACCCAC GACGGACCGT CATGAGCACG ATGGACCGTC GAGGAGTCTC GTTTCAAAAC 301 ACTTAGAAAT TCTGAAATTG GGTACTAAAA ATCGACTCTC TGAACTTCGT AACGGAATGG 361 CACGACGGAC CGTCACGGGC GTGACGGACC GTCACAGACT CTTTGGTGGA AATTGAGTCT 421 CTGAACCTTG CGACGACCTG CAGGACGGAC CGTCGCAGGC ACGACGGGCC ATCACAGGTT 481 GCGTAATCCC AGTCTGGGTC GGATTTCTTT ACACGTTTTA AGGGACGTTT TGGACTATTC 541 CTACTTTAAT TATAAAGTTA GTGGGTTTAT GTTAATAAGT CTAATTACCT GGGGGTTAAA 601 AGAGGTAACC TTGAGTAAAT TAGTGGGTTA TTATTCCATC TTTTATTCTT AATTATATGC 661 TAATTAGGGT AAAAGAAGGA GGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGAGA 721 AGGAGAATCG AT Predicted gene structure (within gDNA segment 25759 to 16660): Exon 1 24892 24864 ( 29 n); cDNA 102 130 ( 29 n); score: 0.534 Intron 1 24863 18554 (6310 n); Pd: 0.313 (s: 0), Pa: 0.000 (s: 0.68) Exon 2 18553 18514 ( 40 n); cDNA 131 169 ( 39 n); score: 0.675 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.68), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 170 731 ( 562 n); score: 0.841 PPA cDNA 34 15 MATCH C06HBa0153O03.1-1- SGN-E550201- 0.841 633 0.865 C PGS_C06HBa0153O03.1-1-_SGN-E550201- (24892 24864,18553 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTCACGATAC ATCTTGGTTG C-ACCAGGAT GTATATAATA CCTTGAACTA TGAGCCTCTG 24834 || || | ||||| ||| | | || | GTNGAGAAA- ATCTTTGTTT CTAATAGTGT .......... .......... .......... 130 TAAGAATAGT GTGAATCAAA TTATCAACAC GGGGCACACA TATCCTTCCC TTAATTCTCA 24774 .......... .......... .......... .......... .......... .......... 130 AAACACCTTC CTCGTCAATT ATTGCCTCTT TAGCCTCTCC TCGTAATACC ATATCTCGAA 24714 .......... .......... .......... .......... .......... .......... 130 TTCGGCTCAG CTTCTCGTCA GTAAACTGTT TTCCCTTAAT CTTGTCAAGA AAAGAAGATC 24654 .......... .......... .......... .......... .......... .......... 130 TTGCCTCCAC ACAAGCCAAA AATCCTCCCT TCTCTAGTAC TTCCAGCCTC ATAAAGTCAT 24594 .......... .......... .......... .......... .......... .......... 130 TAACCAGGGT CTAAACCTCT CTAGCCAATG GGCGCCTAGA AACCTGTAAG TGGGCTAGGC 24534 .......... .......... .......... .......... .......... .......... 130 TTCCCCTGCT CCCTGCTTTT CTACTTAAAG CGTCTGCCAC AACATTAGCT TTTCCTGGGT 24474 .......... .......... .......... .......... .......... .......... 130 GATACAAAAT AGTAATATCA TAGTCCTTCA GTAGTTCCAT CTACCTTCAC TGCCTCAAAT 24414 .......... .......... .......... .......... .......... .......... 130 TCAAATCTTT CTTAGTCAAA TTTGTCAATT GGGAAGCGAC AGAAGAAAAT CCCTTGACAA 24354 .......... .......... .......... .......... .......... .......... 130 ATCGACGGTA GGAGCTAGCT AAACCAAAAA ACGCTCCTTA CCTCTGTAAC ATTAGTAGGT 24294 .......... .......... .......... .......... .......... .......... 130 CTTACCCAAT TCTTCACTAC TTTAATATTA GATGGATCCA CCATCACTCC ATCCTTGGAA 24234 .......... .......... .......... .......... .......... .......... 130 ACCAACATGC CCCAAGAAGG ACACTGAATC TAGCCAAAAC TCACACTTGG AGAATTTGGC 24174 .......... .......... .......... .......... .......... .......... 130 ATAAAGCCTT TTTCTCTCTC AACAATTCCA TAACAATTTT CAAATACTCC CCATGTTCTT 24114 .......... .......... .......... .......... .......... .......... 130 TTCTGCTCTT TGAGTATATC AGTATATCAT CAATAAATAC AATAACAAAC AGATCCAGAT 24054 .......... .......... .......... .......... .......... .......... 130 ATGGCTTAAA AATCCTGTTC ATCAGGCTCA TGAACGCAGA AGAGGCATTC GTAAGCCCAA 23994 .......... .......... .......... .......... .......... .......... 130 AAGACATTAC TAAGAATTCA TAATGCCCAT ACCTGGTTCG AAACACAGCC TTTGGCACAT 23934 .......... .......... .......... .......... .......... .......... 130 CTGCTGCCCG TATTTTCAAT TCATGATAAC TAGATCTCAA ATCGATTTTT GAAAAGATAC 23874 .......... .......... .......... .......... .......... .......... 130 AAGCACCTTG TAACTGATCG AACAAATCAT CGATGCGAGG AAGAGGATAC TTGTTCTTAA 23814 .......... .......... .......... .......... .......... .......... 130 TAGTTACCTT ATTCAGTTGC CTGTAGTCTA TGCACATCCG AAAACTTCCA TCCTTCTTCT 23754 .......... .......... .......... .......... .......... .......... 130 TCACAAATAA AACAGGAGCA CCCCAAGGGG ATGAACTTGG TCTAGTAAAG TCTTTACCTA 23694 .......... .......... .......... .......... .......... .......... 130 ACAACTCCTG AAGTTGGGCC TTTAACTCCC TTAACTCAGA TAGGGTCATT CTATAAGGGG 23634 .......... .......... .......... .......... .......... .......... 130 GTATGGAAAT GGGGCGAGTA CCCGGCTCGA GATCAATACA AAAATCAATA TCCCTATCTG 23574 .......... .......... .......... .......... .......... .......... 130 GTGGCATACC AGGAAGGTCT GCAGGAAACA CATCCAGAAA CTCACAGACT ATCGAAACAG 23514 .......... .......... .......... .......... .......... .......... 130 ACTCAATCGA AGGTACCTTG GAAGTATCAT CCCTGAGATG GGCCAAGAAA GCTAAACAAC 23454 .......... .......... .......... .......... .......... .......... 130 CCCTACTAAC CATCCTCTTA GCACGAAGAA AAGAGATAAT ATGAACTAGG GTGGAAATAT 23394 .......... .......... .......... .......... .......... .......... 130 AGTCACCCTC CCATACTAGC GGATCTGTCC CAGGCTTGGT CAATGTCACA GTTTTAGCGT 23334 .......... .......... .......... .......... .......... .......... 130 TACAATCTAA GATTGCAAAG TTTGGAGAAA GCCAAGTCAT ACCCAAAATT ACATCGAAAT 23274 .......... .......... .......... .......... .......... .......... 130 CAACCATCTC TAGAATAATC AAATCTAAAT GAGTATTGCT CCCCATAAAA ACCACAAGAC 23214 .......... .......... .......... .......... .......... .......... 130 AAGACCTATA CACCTTATCA ACTATCACAG ACTCACCCAC AGGAGTAAAG ACACGAATAG 23154 .......... .......... .......... .......... .......... .......... 130 GCATGTCAAG CAAGTCACAA TGTAAATCAA GACCAGTAAC AAATGAGGAA GATACATATG 23094 .......... .......... .......... .......... .......... .......... 130 AAAATGTGGA TCCAGGATCA AATAATACAG AAGCCATGCA ATCACAAACC AAAAGATTAC 23034 .......... .......... .......... .......... .......... .......... 130 CTGTGATAAC AGCATCAGAT GTCTCTGCTT CAAACCTCTC GGGGAAATCA TAACAATGGG 22974 .......... .......... .......... .......... .......... .......... 130 CCCTATCACC TGTCTGCCCG TTGCCCTTAC CATGTTGTGC TGCAGTAGTT CCAACTTGCC 22914 .......... .......... .......... .......... .......... .......... 130 CGCCACCCCG GCTGATTTGG TGACCACCAT TACCTTGGCC ACCACGTCCT CCATAATGGC 22854 .......... .......... .......... .......... .......... .......... 130 GGCCTCTCCC ATGATTTCCT CTACCTCTAA TTATTGGGGG TCTGTAACTC TATTTTGGAC 22794 .......... .......... .......... .......... .......... .......... 130 AATACCTCCT AATATGTCCA GTCTCTCCAC ATCCACTACA ATCTCTGGAG TCAAGCATAG 22734 .......... .......... .......... .......... .......... .......... 130 GTCTCTAAGA GAATGACGAA GTCTGGGGAT AACCTCCAAA CTCAGAGAAA TGTTGACTGG 22674 .......... .......... .......... .......... .......... .......... 130 TCTGCGATGG ACCCCCAGCT ACAGCCTGTA GTGACGACTG AATAGGTCAG GCTGGGTAAC 22614 .......... .......... .......... .......... .......... .......... 130 CTCCTGAACT CTGCCCTCTG GAGTAAGAAC CACTAAACTC ACCTCCCGTA CGGAACTTCT 22554 .......... .......... .......... .......... .......... .......... 130 TAGATGTCGA CACCATGGTG AAGTTGTCTG GCTTCACCCC CTCCACCTCA ATCACAAAGT 22494 .......... .......... .......... .......... .......... .......... 130 CAACCACTTC CTGAAAGGAT TTTGCTGCGG CAGCTACCTG TAACTGGGAT CTGCAAATCT 22434 .......... .......... .......... .......... .......... .......... 130 AACCTCAATC CTTTCACAAA GCGGCGAATC CGCTCTTATG GACTGAAGCA AAGCTGGGTG 22374 .......... .......... .......... .......... .......... .......... 130 GCATACCTGG ATAGCGCACG AAATTTGGCC TCATAAGCGG CAACAGACAT CCTTCCTTGC 22314 .......... .......... .......... .......... .......... .......... 130 TATAGGCTCA AGAACTCATC TCTCCTCCTA TCCCTCAAAG TCCGGGGTAT ATACTTCTCC 22254 .......... .......... .......... .......... .......... .......... 130 ATAAATAAGC TAGAGAATGA TTCCCAAGTC ATAGGTGGTG CCTGTGCTGG TTGACACTCA 22194 .......... .......... .......... .......... .......... .......... 130 ACATACGACC GCCACCACAT TTTGGCATTC CCCTGAAACT GGTAGGTCAC AAAATCAACA 22134 .......... .......... .......... .......... .......... .......... 130 CCGAATCGTT CTACTATGTC CATCTTATGT AGCAGCTCAT GACAATCAAC CAGAAAATCA 22074 .......... .......... .......... .......... .......... .......... 130 TAGGCATCCT CAGATTTAGC ACCCTTGAAG ACAGGAGGTT TCAACTTTAA GAATTTAGTG 22014 .......... .......... .......... .......... .......... .......... 130 AAAAGTTCAT GTTGATCACT TGTCATTATA GACCCTGTAG TCAATCGAGG AAACGTGCCT 21954 .......... .......... .......... .......... .......... .......... 130 ACTTCCAATG AGGCATCCAT GCGGGGAGCC ACAACAGTTG CATGTTGTAC TCCTGGAACC 21894 .......... .......... .......... .......... .......... .......... 130 TGAGGTGCTG GTACAAGAAA CACTGGAGGT GTCTGGCCTC GATCAGATAA CCCGCTAAGA 21834 .......... .......... .......... .......... .......... .......... 130 TAAGTAAGAA CCTGGTTGAT CATCTCTGGG GTAGGTTGGG GTGGTAATTC CTCATCTTGC 21774 .......... .......... .......... .......... .......... .......... 130 ACCTGTTTAT TTTCCCCTTC CTCACCCTCT CTTACTACCT CATCAGTCGG TGGAGGAGTC 21714 .......... .......... .......... .......... .......... .......... 130 ACCACCCTAT TACTAGCTTG ACCAGGTGTT TGTCCTCCAC CTCTAGAGAT CGTCCTCTTG 21654 .......... .......... .......... .......... .......... .......... 130 CGACCTCTAC CACGACCTCT TGCCACTGCT CCTCCTCGAG CTACAACCCC AATGTTTGGC 21594 .......... .......... .......... .......... .......... .......... 130 TCAGACGCAC GCTATCTTGC CGGTGTTGGT GTTGGCACAG TTGTTTCTCT AGTTCTAACC 21534 .......... .......... .......... .......... .......... .......... 130 ATATGCGAAA TAGAGTGAGG ATGTCAGATA CCAATTTGTA TCACCTAGAT ACCACTTGGA 21474 .......... .......... .......... .......... .......... .......... 130 TCCAAGTAAT AGCACGAAAG AAGGAAAGAA TGGAATTTTG CTAAAGTCCT ATAGCCTCTC 21414 .......... .......... .......... .......... .......... .......... 130 GAAGAAAAGT AAGGGCGTCC CCCTACCGTT CCTCAAGACT CTACTAGACT TGTTCTTGTG 21354 .......... .......... .......... .......... .......... .......... 130 TGATGAGACC AACGAACCTA ACGCTCTGAT ACCAAGTTTG TCACGACCCA AAACGATCCG 21294 .......... .......... .......... .......... .......... .......... 130 TAAGTGGCAC CCACCCTTAC TCTCCTAGGT GAGCGAACCA ACAAATCTAA ACCCCAACAT 21234 .......... .......... .......... .......... .......... .......... 130 TTACCAGTAT ATCAACTATA AATAATATAA ATAATGCGGA AGCTCCAAAA CTCATTACGA 21174 .......... .......... .......... .......... .......... .......... 130 AATTAATTAA ATCAACATCT AAAGTTAAAT ACTTATTATT CCCAAAATCT GTAAGTCATC 21114 .......... .......... .......... .......... .......... .......... 130 ACACCAAGAA CATCTATCCT CGAATTTCTA AATCTAAGAG TATTCAAGAA GCTAAAAATA 21054 .......... .......... .......... .......... .......... .......... 130 GTAAAAAGAT GGTCCATGTC CGAACTTCAA GACATCAAGA CGTGAAGGAG AGAATCCAGC 20994 .......... .......... .......... .......... .......... .......... 130 ACGAGCTAGG AATAATAGCT CACCCTGAAT TCTGATATGC TAAAGACCGG CTAGATCTGA 20934 .......... .......... .......... .......... .......... .......... 130 TGACGAGTCG AAGTCGATGG CACGCTTGCT GCACTCCACA AATAACAAAG AAGAAAATTA 20874 .......... .......... .......... .......... .......... .......... 130 CAAGTAGGGG TCAGTACAAG GAACACGTAC TGAGTAGGTA TCATCGGCCA ACTCAAAATA 20814 .......... .......... .......... .......... .......... .......... 130 GAAAACAATA TATACTGAAT AATAATATAA AATCAACCAT AATACTTAAC AGGTGACAAT 20754 .......... .......... .......... .......... .......... .......... 130 CAACAAGTAT AAGAACCATT GACAACAACA GCAAGCACAT CTATGAGGAC TCAAGCCTCC 20694 .......... .......... .......... .......... .......... .......... 130 ACACCATACT CATTTGGGAA ATAGGTTCTT TGAATTTGAG TACATTAACA TAATTCAAGA 20634 .......... .......... .......... .......... .......... .......... 130 TTCATTCTCT TTATCATTAT CGTGTCGGAA CGTTACACCC GATCCCCTAC TACTACCGTG 20574 .......... .......... .......... .......... .......... .......... 130 TCGGAACGTG ACACTCTGAT CCCCTAATAC TACCGTGTCA GAACGTGACA CCCGATCCCC 20514 .......... .......... .......... .......... .......... .......... 130 TAATACTACC GTGTCAGAAT GTGACACTCC GATCCCCTAA TACTACCGTG TCGAAACATG 20454 .......... .......... .......... .......... .......... .......... 130 ACACCCAATC CATTTATCTC ATTATTTTAG TTCATCAAGC CTTCTTTATG TCAAGGCGCC 20394 .......... .......... .......... .......... .......... .......... 130 ATCTTAATAG AGAGGATTTA AGATTGAAGA TTCAACAGTT TCATCATTCT GACCACCACA 20334 .......... .......... .......... .......... .......... .......... 130 ATTACACAAT CACAACATAC AAACACACAA TCAAGCATAT AGAAGACTTT ACAATACCAC 20274 .......... .......... .......... .......... .......... .......... 130 CCAATACATA TCGATCACTA TTTAGAGTTT ATCTATCATA TATAAATAAA TCATAACCTA 20214 .......... .......... .......... .......... .......... .......... 130 CCTCCACTGA AGAATCGTGA TCAAGCAAGC TACCTTCCCA ATGCCTTTGC TTTCCTCTTC 20154 .......... .......... .......... .......... .......... .......... 130 GTTCTCTCTT TCTCGCTCGT TCTCCCTCTG TGTTTCTTTT TATTTTTCTT ACTCAAAATC 20094 .......... .......... .......... .......... .......... .......... 130 TTGTTCTTTT ACCCTAAATG TCATATAATC AATTATAAAA GATGATAAAA GTACCTCACT 20034 .......... .......... .......... .......... .......... .......... 130 ATTTATTCCC TTATTAACTT CTTTAACCCC CAAGTAAATA AATTATTAAA CTTACCCCAC 19974 .......... .......... .......... .......... .......... .......... 130 TAATTCCATA ATTATAATCA TGAATAGTCC AAAACACCCC TTTAAAACTT TTAGCAGAAA 19914 .......... .......... .......... .......... .......... .......... 130 TCCGACCCAG TCGAGGTTAC GCAGCTTGTG ACGGTCCGTT GTGTCTACGA CGGTCCGTGC 19854 .......... .......... .......... .......... .......... .......... 130 TGTAGTTCCG TCGCGGAGTT CAGAGAGTCG CTCCCAGTAC CCAGATTTTC AGAGTTGAAG 19794 .......... .......... .......... .......... .......... .......... 130 TGTTTTGGAA CGGAGACGCT CGACGGACCG TCGTGCCTGT GACGGTTTGT CCTACCTGCC 19734 .......... .......... .......... .......... .......... .......... 130 GTCGAGGGTA ATGAGGAGAG CAACAGAAGA AATTACACAA GTATGGGACG ACGGAGTCCA 19674 .......... .......... .......... .......... .......... .......... 130 TCACGGTCCA TCGTGACCAT GACGGTCCGT CGTGACCATG ACGGTCCGTC GCGTGATCCG 19614 .......... .......... .......... .......... .......... .......... 130 TCGACCCAGT CAGTTTTTTA TCAAAAATAG TTCTACTGCT CGAACCGACT AAACAGGTCA 19554 .......... .......... .......... .......... .......... .......... 130 TTACAATTTT CCTACTTTAG TTTTCCCTAT GGCTACCACT GTCCATCTAC TATTTTTTTT 19494 .......... .......... .......... .......... .......... .......... 130 CATGATTTGA TCTTTTAAGT AAAATTATTT GTGGAACTTC TCTTTGAAGA TATCTCTCAC 19434 .......... .......... .......... .......... .......... .......... 130 AAATTAGCGA AAAAGTTAGT TAATTTATTT TTATTTTAAA AAGTAAGATA AATGTTTTTG 19374 .......... .......... .......... .......... .......... .......... 130 CGCTATCAAT ATTTTATAAT ATGTAAATTA TTCGAAACAA ACTTTTAATA AATAAAAATT 19314 .......... .......... .......... .......... .......... .......... 130 AGAGCAAATA TAAAAATCAC TAGATTTTTT TTTAAAAAAA TGGGGCCTTG AAACGGTATA 19254 .......... .......... .......... .......... .......... .......... 130 TATTTTTTTA TTTGAATAGA TTATGGGGGA GAATTAATAG AGGTAAGATT TTTTATTTTA 19194 .......... .......... .......... .......... .......... .......... 130 TCTAATAAGA AAATGACAAA TAATATATTT TTAAAAAATA AATAAAACAA AATAAACTTT 19134 .......... .......... .......... .......... .......... .......... 130 GGTTGTTAGT CATAAAAATA TAAGTTATTC AAAAAGGTGA ATGAAAGAGT ATAAGTGAGT 19074 .......... .......... .......... .......... .......... .......... 130 CAAAAAGATG AGTGAAGAGG CATAGCTAAG CCAAAAAAGT GAATGTAAGG GTATTTCTAG 19014 .......... .......... .......... .......... .......... .......... 130 ACCAAAAGAT TGATGAAGGA TATTTTTAGA CATAGTTCAA GGATAGTTTT GGTCCTTTTT 18954 .......... .......... .......... .......... .......... .......... 130 CGTTTAAATA ATCTCATATT TAGATATTGA GTTATTTGCA GGGACGATTC AATATAATTC 18894 .......... .......... .......... .......... .......... .......... 130 GAGGCCTAAA TTTTAAATAA ACTCTATCTG TATTTATTTA TTTTTCTTTT TAGATGTAAA 18834 .......... .......... .......... .......... .......... .......... 130 TTGTATTTAT TTATTTTTCT TTTTAGATGT AAGTTATTAC TTAAGTATCT TTTTTTGTAA 18774 .......... .......... .......... .......... .......... .......... 130 ATGGAAAAGG GCTAAAAATG CCCTTAACTT AGTGGAAATG GTTCAAAATA CCATCCTTCT 18714 .......... .......... .......... .......... .......... .......... 130 ACCTTTTGAG TTAAAAATAC CCTCCACCTT TATTTTGGTT CAAAGATGCC TTTCCTTCCA 18654 .......... .......... .......... .......... .......... .......... 130 CCTTTTGATT TAATAATACT CTTAACCCCC CATTTAATTA AATTTATAAA ATAAAAAATT 18594 .......... .......... .......... .......... .......... .......... 130 CTTAATATTA GCTCATTCCA AAATCTTTAT GATAAATATA TCTAAAAAAT AAAATAAAAA 18534 | |||| | ||||||| || .......... .......... .......... .......... TATAAATCGT AAAATAATAA 150 ATTTATTATA TGTATAAAAA GCAAAAATAA AAATAAAATT TCTCAAAGTT CTTATTCTTT 18474 |||| || ||| || | TAATATT-TA AGTAGGAAGA .......... .......... .......... .......... 169 GTATTAAAAT AATAAGACAA TAAAAATCTT AAGATTCTTA TTCTTCATTT TTGCGCAAAA 18414 .......... .......... .......... .......... .......... .......... 169 AAATCTTTAT TTTATTTTAT GTTTTATACA TATTATTTAA TATTTTAATT TGTGAGAAAT 18354 .......... .......... .......... .......... .......... .......... 169 TTTTTTAAGT TATTTGGATT AAATTTTTAA ATTATATTGA GAAAATGCAC AAGTATTCCC 18294 .......... .......... .......... .......... .......... .......... 169 TCAAACTATG TCTGAAATCC CAGAGACACA CTTATACTAT ATTAAGGTCA TATTACCCCC 18234 .......... .......... .......... .......... .......... .......... 169 TGAACTTATT TTATAAGTAA TTTTCTACCC CTTTTGACCT ACGTGGCTCT AGCTTGAAAA 18174 .......... .......... .......... .......... .......... .......... 169 AAAAGTCAAT CAGCGTTGGA CCCACAAGAT AGTGCCACAT AGACCGAAAA GGGCTAGAAA 18114 .......... .......... .......... .......... .......... .......... 169 ATTATTAATA AAATAAGTTC AGGGATAATA GGACCTTAGT ATAGTGTAAG TATGACTTTA 18054 .......... .......... .......... .......... .......... .......... 169 AAATTTCAGG CATAAATTGA GAGGGTACTT GTGCATTATC TCAATAATAT TCAAATCTTT 17994 .......... .......... .......... .......... .......... .......... 169 ACATTAATAT CTAATTTGAT GTAATATTTT AATAATAATA ATGTAACGAC CTATTTAGTC 17934 | | | |||||||||| || ||||||| .......... .......... .......... ....ACATAA ATGTAACGAC CTGTTTAGTC 195 GTTTTGAGCA GCAGATTTTA TTTTTGGAAA AACTGGCTGA GACGACGGAT CCCACGATGG 17874 |||||||| | |||||||||| |||||||||| ||| || ||| ||||||||| ||||||| || GTTTTGAGTA GCAGATTTTA TTTTTGGAAA AACAGGTTGA GACGACGGAA CCCACGACGG 255 ACCGTCATGG GCACGATGGA CCGTCGAGGG GGTCTCGTTC CAAAATACAT AG-AATTCTG 17815 ||||||||| |||||||||| ||||||| || |||||||| ||||| || | || ||||||| ACCGTCATGA GCACGATGGA CCGTCGA-GG AGTCTCGTTT CAAAACACTT AGAAATTCTG 314 AAATTTGGGT TTTGAAATCG ACTCTCTGAA CTTCGTGATG AAGTGGCAGG ACGGACCGTC 17755 ||||| || | |||||| |||||||||| |||||| | | | ||||| | |||||||||| AAATTGGGTA CTAAAAATCG ACTCTCTGAA CTTCGTAACG GAATGGCACG ACGGACCGTC 374 ACAGGCATGA CGGGCCGTCA CAGTCTCTTC AG-AAAATTT CAGTCTCTGA ACTCTGTGAC 17696 || ||| ||| ||| |||||| ||| ||||| | || || ||||||||| || || ||| ACGGGCGTGA CGGACCGTCA CAGACTCTTT GGTGGAAATT GAGTCTCTGA ACCTTGCGAC 434 GGAAGCAGCA GGACGGACCG TCGCAGGCAC GACGACCCGT CACAGACTGC GTAATCCCAG 17636 | | | ||| |||||||||| |||||||||| |||| || | ||||| ||| |||||||||| -G-ACCTGCA GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG 492 GCTGAGTCGG ATTTCTTTAA ATGTTTTAAG GGGGCGTTTT GGACTATTCC TGCTATAATT 17576 ||| ||||| ||||||||| | ||||||| ||| |||||| |||||||||| | || ||||| TCTGGGTCGG ATTTCTTTAC ACGTTTTAA- GGGACGTTTT GGACTATTCC TACTTTAATT 551 ATAAATTTAG TGGGTTAATG TTAATAA-TT TAACTACTTG AGGGTTAAAA GAGATAACCT 17517 ||||| |||| |||||| ||| ||||||| | ||| ||| || ||||||||| ||| |||||| ATAAAGTTAG TGGGTTTATG TTAATAAGTC TAATTACCTG GGGGTTAAAA GAGGTAACCT 611 TGAATTAGTT AGTGGGTTAA ACTCATCATC TTTCATACTT AATTATATGC TAATTAGGGT 17457 ||| | | || ||||||||| | |||| ||| || ||| |||||||||| |||||||||| TGAGTAAATT AGTGGGTTAT TAT-TCCATC TTTTATTCTT AATTATATGC TAATTAGGGT 670 AAAAGAAAGA AGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGGGA GAAACGATCG 17397 ||||||| || ||||||||| |||||||||| |||||||||| ||||||| || | |||| AAAAGAAGGA GGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGAGA AGGAGAATCG 730 A 17396 | A 731 hqPGS_C06HBa0153O03.1-1-_SGN-E550201- (17959 17396) ******************************************************************************** EST sequence 15 -strand 715 n (File: SGN-E550335-) 1 TTTTTTTTTT TTTTTTATGA ATTAGCTCAA TGAAAAATGA GTAAATTTTT TATATTTTAT 61 GGCATAATTT TTTCATTAAT TCATGGTNGA GAAAATCTTT GTTTCTAATA GTGTTATAAA 121 TCGTAAAATA ATAATAATAT TTAAGTAGGA AGAACATAAA TGTAACGACC TGTTTAGTCG 181 TTTTGAGTAG CAGATTTTAT TTTTGGAAAA ACAGGTTGAG ACGACGGAAC CCACGACGGA 241 CCGTCATGAG CACGATGGAC CGTCGAGGAG TCTCGTTTCA AAACACTTAG AAATTCTGAA 301 ATTGGGTACT AAAAATCGAC TCTCTGAACT TCGTAACGGA ATGGCACGAC GGACCGTCAC 361 GGGCGTGACG GACCGTCACA GATTCTTTGG TGGAAATTGA GTCTCTGAAC CTTGCGACGA 421 CCTGCAGGAC GGACCGTCGC AGGCACGACG GGCCATCACA GGTTGCGTAA TCCCAGTCTG 481 GGTCGGATTT CTTTACACGT TTTAAGGGAC GTTTTGGACT ATTCCTACTT TAATTATAAA 541 GTTAGTGGGT TTATGTTAAT AAGTCTAATT ACCTGGGGGT TAAAAGAGGT AACCTTGAGT 601 AAATTAGTGG GTTATTATTC CATCTTTTAT TCTTAATTAT ATGCTAATTA GGGTAAAAGA 661 AGGAGGGTTG AATAAGAAAA AGAAAAGAAC AGAAAGAGAG AGAAGGAGAA TCGAT Predicted gene structure (within gDNA segment 25599 to 16660): Exon 1 24892 24864 ( 29 n); cDNA 86 114 ( 29 n); score: 0.534 Intron 1 24863 18554 (6310 n); Pd: 0.313 (s: 0), Pa: 0.000 (s: 0.68) Exon 2 18553 18514 ( 40 n); cDNA 115 153 ( 39 n); score: 0.675 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.68), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 154 714 ( 561 n); score: 0.838 PPA cDNA 18 1 MATCH C06HBa0153O03.1-1- SGN-E550335- 0.838 633 0.885 C PGS_C06HBa0153O03.1-1-_SGN-E550335- (24892 24864,18553 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTCACGATAC ATCTTGGTTG C-ACCAGGAT GTATATAATA CCTTGAACTA TGAGCCTCTG 24834 || || | ||||| ||| | | || | GTNGAGAAA- ATCTTTGTTT CTAATAGTGT .......... .......... .......... 114 TAAGAATAGT GTGAATCAAA TTATCAACAC GGGGCACACA TATCCTTCCC TTAATTCTCA 24774 .......... .......... .......... .......... .......... .......... 114 AAACACCTTC CTCGTCAATT ATTGCCTCTT TAGCCTCTCC TCGTAATACC ATATCTCGAA 24714 .......... .......... .......... .......... .......... .......... 114 TTCGGCTCAG CTTCTCGTCA GTAAACTGTT TTCCCTTAAT CTTGTCAAGA AAAGAAGATC 24654 .......... .......... .......... .......... .......... .......... 114 TTGCCTCCAC ACAAGCCAAA AATCCTCCCT TCTCTAGTAC TTCCAGCCTC ATAAAGTCAT 24594 .......... .......... .......... .......... .......... .......... 114 TAACCAGGGT CTAAACCTCT CTAGCCAATG GGCGCCTAGA AACCTGTAAG TGGGCTAGGC 24534 .......... .......... .......... .......... .......... .......... 114 TTCCCCTGCT CCCTGCTTTT CTACTTAAAG CGTCTGCCAC AACATTAGCT TTTCCTGGGT 24474 .......... .......... .......... .......... .......... .......... 114 GATACAAAAT AGTAATATCA TAGTCCTTCA GTAGTTCCAT CTACCTTCAC TGCCTCAAAT 24414 .......... .......... .......... .......... .......... .......... 114 TCAAATCTTT CTTAGTCAAA TTTGTCAATT GGGAAGCGAC AGAAGAAAAT CCCTTGACAA 24354 .......... .......... .......... .......... .......... .......... 114 ATCGACGGTA GGAGCTAGCT AAACCAAAAA ACGCTCCTTA CCTCTGTAAC ATTAGTAGGT 24294 .......... .......... .......... .......... .......... .......... 114 CTTACCCAAT TCTTCACTAC TTTAATATTA GATGGATCCA CCATCACTCC ATCCTTGGAA 24234 .......... .......... .......... .......... .......... .......... 114 ACCAACATGC CCCAAGAAGG ACACTGAATC TAGCCAAAAC TCACACTTGG AGAATTTGGC 24174 .......... .......... .......... .......... .......... .......... 114 ATAAAGCCTT TTTCTCTCTC AACAATTCCA TAACAATTTT CAAATACTCC CCATGTTCTT 24114 .......... .......... .......... .......... .......... .......... 114 TTCTGCTCTT TGAGTATATC AGTATATCAT CAATAAATAC AATAACAAAC AGATCCAGAT 24054 .......... .......... .......... .......... .......... .......... 114 ATGGCTTAAA AATCCTGTTC ATCAGGCTCA TGAACGCAGA AGAGGCATTC GTAAGCCCAA 23994 .......... .......... .......... .......... .......... .......... 114 AAGACATTAC TAAGAATTCA TAATGCCCAT ACCTGGTTCG AAACACAGCC TTTGGCACAT 23934 .......... .......... .......... .......... .......... .......... 114 CTGCTGCCCG TATTTTCAAT TCATGATAAC TAGATCTCAA ATCGATTTTT GAAAAGATAC 23874 .......... .......... .......... .......... .......... .......... 114 AAGCACCTTG TAACTGATCG AACAAATCAT CGATGCGAGG AAGAGGATAC TTGTTCTTAA 23814 .......... .......... .......... .......... .......... .......... 114 TAGTTACCTT ATTCAGTTGC CTGTAGTCTA TGCACATCCG AAAACTTCCA TCCTTCTTCT 23754 .......... .......... .......... .......... .......... .......... 114 TCACAAATAA AACAGGAGCA CCCCAAGGGG ATGAACTTGG TCTAGTAAAG TCTTTACCTA 23694 .......... .......... .......... .......... .......... .......... 114 ACAACTCCTG AAGTTGGGCC TTTAACTCCC TTAACTCAGA TAGGGTCATT CTATAAGGGG 23634 .......... .......... .......... .......... .......... .......... 114 GTATGGAAAT GGGGCGAGTA CCCGGCTCGA GATCAATACA AAAATCAATA TCCCTATCTG 23574 .......... .......... .......... .......... .......... .......... 114 GTGGCATACC AGGAAGGTCT GCAGGAAACA CATCCAGAAA CTCACAGACT ATCGAAACAG 23514 .......... .......... .......... .......... .......... .......... 114 ACTCAATCGA AGGTACCTTG GAAGTATCAT CCCTGAGATG GGCCAAGAAA GCTAAACAAC 23454 .......... .......... .......... .......... .......... .......... 114 CCCTACTAAC CATCCTCTTA GCACGAAGAA AAGAGATAAT ATGAACTAGG GTGGAAATAT 23394 .......... .......... .......... .......... .......... .......... 114 AGTCACCCTC CCATACTAGC GGATCTGTCC CAGGCTTGGT CAATGTCACA GTTTTAGCGT 23334 .......... .......... .......... .......... .......... .......... 114 TACAATCTAA GATTGCAAAG TTTGGAGAAA GCCAAGTCAT ACCCAAAATT ACATCGAAAT 23274 .......... .......... .......... .......... .......... .......... 114 CAACCATCTC TAGAATAATC AAATCTAAAT GAGTATTGCT CCCCATAAAA ACCACAAGAC 23214 .......... .......... .......... .......... .......... .......... 114 AAGACCTATA CACCTTATCA ACTATCACAG ACTCACCCAC AGGAGTAAAG ACACGAATAG 23154 .......... .......... .......... .......... .......... .......... 114 GCATGTCAAG CAAGTCACAA TGTAAATCAA GACCAGTAAC AAATGAGGAA GATACATATG 23094 .......... .......... .......... .......... .......... .......... 114 AAAATGTGGA TCCAGGATCA AATAATACAG AAGCCATGCA ATCACAAACC AAAAGATTAC 23034 .......... .......... .......... .......... .......... .......... 114 CTGTGATAAC AGCATCAGAT GTCTCTGCTT CAAACCTCTC GGGGAAATCA TAACAATGGG 22974 .......... .......... .......... .......... .......... .......... 114 CCCTATCACC TGTCTGCCCG TTGCCCTTAC CATGTTGTGC TGCAGTAGTT CCAACTTGCC 22914 .......... .......... .......... .......... .......... .......... 114 CGCCACCCCG GCTGATTTGG TGACCACCAT TACCTTGGCC ACCACGTCCT CCATAATGGC 22854 .......... .......... .......... .......... .......... .......... 114 GGCCTCTCCC ATGATTTCCT CTACCTCTAA TTATTGGGGG TCTGTAACTC TATTTTGGAC 22794 .......... .......... .......... .......... .......... .......... 114 AATACCTCCT AATATGTCCA GTCTCTCCAC ATCCACTACA ATCTCTGGAG TCAAGCATAG 22734 .......... .......... .......... .......... .......... .......... 114 GTCTCTAAGA GAATGACGAA GTCTGGGGAT AACCTCCAAA CTCAGAGAAA TGTTGACTGG 22674 .......... .......... .......... .......... .......... .......... 114 TCTGCGATGG ACCCCCAGCT ACAGCCTGTA GTGACGACTG AATAGGTCAG GCTGGGTAAC 22614 .......... .......... .......... .......... .......... .......... 114 CTCCTGAACT CTGCCCTCTG GAGTAAGAAC CACTAAACTC ACCTCCCGTA CGGAACTTCT 22554 .......... .......... .......... .......... .......... .......... 114 TAGATGTCGA CACCATGGTG AAGTTGTCTG GCTTCACCCC CTCCACCTCA ATCACAAAGT 22494 .......... .......... .......... .......... .......... .......... 114 CAACCACTTC CTGAAAGGAT TTTGCTGCGG CAGCTACCTG TAACTGGGAT CTGCAAATCT 22434 .......... .......... .......... .......... .......... .......... 114 AACCTCAATC CTTTCACAAA GCGGCGAATC CGCTCTTATG GACTGAAGCA AAGCTGGGTG 22374 .......... .......... .......... .......... .......... .......... 114 GCATACCTGG ATAGCGCACG AAATTTGGCC TCATAAGCGG CAACAGACAT CCTTCCTTGC 22314 .......... .......... .......... .......... .......... .......... 114 TATAGGCTCA AGAACTCATC TCTCCTCCTA TCCCTCAAAG TCCGGGGTAT ATACTTCTCC 22254 .......... .......... .......... .......... .......... .......... 114 ATAAATAAGC TAGAGAATGA TTCCCAAGTC ATAGGTGGTG CCTGTGCTGG TTGACACTCA 22194 .......... .......... .......... .......... .......... .......... 114 ACATACGACC GCCACCACAT TTTGGCATTC CCCTGAAACT GGTAGGTCAC AAAATCAACA 22134 .......... .......... .......... .......... .......... .......... 114 CCGAATCGTT CTACTATGTC CATCTTATGT AGCAGCTCAT GACAATCAAC CAGAAAATCA 22074 .......... .......... .......... .......... .......... .......... 114 TAGGCATCCT CAGATTTAGC ACCCTTGAAG ACAGGAGGTT TCAACTTTAA GAATTTAGTG 22014 .......... .......... .......... .......... .......... .......... 114 AAAAGTTCAT GTTGATCACT TGTCATTATA GACCCTGTAG TCAATCGAGG AAACGTGCCT 21954 .......... .......... .......... .......... .......... .......... 114 ACTTCCAATG AGGCATCCAT GCGGGGAGCC ACAACAGTTG CATGTTGTAC TCCTGGAACC 21894 .......... .......... .......... .......... .......... .......... 114 TGAGGTGCTG GTACAAGAAA CACTGGAGGT GTCTGGCCTC GATCAGATAA CCCGCTAAGA 21834 .......... .......... .......... .......... .......... .......... 114 TAAGTAAGAA CCTGGTTGAT CATCTCTGGG GTAGGTTGGG GTGGTAATTC CTCATCTTGC 21774 .......... .......... .......... .......... .......... .......... 114 ACCTGTTTAT TTTCCCCTTC CTCACCCTCT CTTACTACCT CATCAGTCGG TGGAGGAGTC 21714 .......... .......... .......... .......... .......... .......... 114 ACCACCCTAT TACTAGCTTG ACCAGGTGTT TGTCCTCCAC CTCTAGAGAT CGTCCTCTTG 21654 .......... .......... .......... .......... .......... .......... 114 CGACCTCTAC CACGACCTCT TGCCACTGCT CCTCCTCGAG CTACAACCCC AATGTTTGGC 21594 .......... .......... .......... .......... .......... .......... 114 TCAGACGCAC GCTATCTTGC CGGTGTTGGT GTTGGCACAG TTGTTTCTCT AGTTCTAACC 21534 .......... .......... .......... .......... .......... .......... 114 ATATGCGAAA TAGAGTGAGG ATGTCAGATA CCAATTTGTA TCACCTAGAT ACCACTTGGA 21474 .......... .......... .......... .......... .......... .......... 114 TCCAAGTAAT AGCACGAAAG AAGGAAAGAA TGGAATTTTG CTAAAGTCCT ATAGCCTCTC 21414 .......... .......... .......... .......... .......... .......... 114 GAAGAAAAGT AAGGGCGTCC CCCTACCGTT CCTCAAGACT CTACTAGACT TGTTCTTGTG 21354 .......... .......... .......... .......... .......... .......... 114 TGATGAGACC AACGAACCTA ACGCTCTGAT ACCAAGTTTG TCACGACCCA AAACGATCCG 21294 .......... .......... .......... .......... .......... .......... 114 TAAGTGGCAC CCACCCTTAC TCTCCTAGGT GAGCGAACCA ACAAATCTAA ACCCCAACAT 21234 .......... .......... .......... .......... .......... .......... 114 TTACCAGTAT ATCAACTATA AATAATATAA ATAATGCGGA AGCTCCAAAA CTCATTACGA 21174 .......... .......... .......... .......... .......... .......... 114 AATTAATTAA ATCAACATCT AAAGTTAAAT ACTTATTATT CCCAAAATCT GTAAGTCATC 21114 .......... .......... .......... .......... .......... .......... 114 ACACCAAGAA CATCTATCCT CGAATTTCTA AATCTAAGAG TATTCAAGAA GCTAAAAATA 21054 .......... .......... .......... .......... .......... .......... 114 GTAAAAAGAT GGTCCATGTC CGAACTTCAA GACATCAAGA CGTGAAGGAG AGAATCCAGC 20994 .......... .......... .......... .......... .......... .......... 114 ACGAGCTAGG AATAATAGCT CACCCTGAAT TCTGATATGC TAAAGACCGG CTAGATCTGA 20934 .......... .......... .......... .......... .......... .......... 114 TGACGAGTCG AAGTCGATGG CACGCTTGCT GCACTCCACA AATAACAAAG AAGAAAATTA 20874 .......... .......... .......... .......... .......... .......... 114 CAAGTAGGGG TCAGTACAAG GAACACGTAC TGAGTAGGTA TCATCGGCCA ACTCAAAATA 20814 .......... .......... .......... .......... .......... .......... 114 GAAAACAATA TATACTGAAT AATAATATAA AATCAACCAT AATACTTAAC AGGTGACAAT 20754 .......... .......... .......... .......... .......... .......... 114 CAACAAGTAT AAGAACCATT GACAACAACA GCAAGCACAT CTATGAGGAC TCAAGCCTCC 20694 .......... .......... .......... .......... .......... .......... 114 ACACCATACT CATTTGGGAA ATAGGTTCTT TGAATTTGAG TACATTAACA TAATTCAAGA 20634 .......... .......... .......... .......... .......... .......... 114 TTCATTCTCT TTATCATTAT CGTGTCGGAA CGTTACACCC GATCCCCTAC TACTACCGTG 20574 .......... .......... .......... .......... .......... .......... 114 TCGGAACGTG ACACTCTGAT CCCCTAATAC TACCGTGTCA GAACGTGACA CCCGATCCCC 20514 .......... .......... .......... .......... .......... .......... 114 TAATACTACC GTGTCAGAAT GTGACACTCC GATCCCCTAA TACTACCGTG TCGAAACATG 20454 .......... .......... .......... .......... .......... .......... 114 ACACCCAATC CATTTATCTC ATTATTTTAG TTCATCAAGC CTTCTTTATG TCAAGGCGCC 20394 .......... .......... .......... .......... .......... .......... 114 ATCTTAATAG AGAGGATTTA AGATTGAAGA TTCAACAGTT TCATCATTCT GACCACCACA 20334 .......... .......... .......... .......... .......... .......... 114 ATTACACAAT CACAACATAC AAACACACAA TCAAGCATAT AGAAGACTTT ACAATACCAC 20274 .......... .......... .......... .......... .......... .......... 114 CCAATACATA TCGATCACTA TTTAGAGTTT ATCTATCATA TATAAATAAA TCATAACCTA 20214 .......... .......... .......... .......... .......... .......... 114 CCTCCACTGA AGAATCGTGA TCAAGCAAGC TACCTTCCCA ATGCCTTTGC TTTCCTCTTC 20154 .......... .......... .......... .......... .......... .......... 114 GTTCTCTCTT TCTCGCTCGT TCTCCCTCTG TGTTTCTTTT TATTTTTCTT ACTCAAAATC 20094 .......... .......... .......... .......... .......... .......... 114 TTGTTCTTTT ACCCTAAATG TCATATAATC AATTATAAAA GATGATAAAA GTACCTCACT 20034 .......... .......... .......... .......... .......... .......... 114 ATTTATTCCC TTATTAACTT CTTTAACCCC CAAGTAAATA AATTATTAAA CTTACCCCAC 19974 .......... .......... .......... .......... .......... .......... 114 TAATTCCATA ATTATAATCA TGAATAGTCC AAAACACCCC TTTAAAACTT TTAGCAGAAA 19914 .......... .......... .......... .......... .......... .......... 114 TCCGACCCAG TCGAGGTTAC GCAGCTTGTG ACGGTCCGTT GTGTCTACGA CGGTCCGTGC 19854 .......... .......... .......... .......... .......... .......... 114 TGTAGTTCCG TCGCGGAGTT CAGAGAGTCG CTCCCAGTAC CCAGATTTTC AGAGTTGAAG 19794 .......... .......... .......... .......... .......... .......... 114 TGTTTTGGAA CGGAGACGCT CGACGGACCG TCGTGCCTGT GACGGTTTGT CCTACCTGCC 19734 .......... .......... .......... .......... .......... .......... 114 GTCGAGGGTA ATGAGGAGAG CAACAGAAGA AATTACACAA GTATGGGACG ACGGAGTCCA 19674 .......... .......... .......... .......... .......... .......... 114 TCACGGTCCA TCGTGACCAT GACGGTCCGT CGTGACCATG ACGGTCCGTC GCGTGATCCG 19614 .......... .......... .......... .......... .......... .......... 114 TCGACCCAGT CAGTTTTTTA TCAAAAATAG TTCTACTGCT CGAACCGACT AAACAGGTCA 19554 .......... .......... .......... .......... .......... .......... 114 TTACAATTTT CCTACTTTAG TTTTCCCTAT GGCTACCACT GTCCATCTAC TATTTTTTTT 19494 .......... .......... .......... .......... .......... .......... 114 CATGATTTGA TCTTTTAAGT AAAATTATTT GTGGAACTTC TCTTTGAAGA TATCTCTCAC 19434 .......... .......... .......... .......... .......... .......... 114 AAATTAGCGA AAAAGTTAGT TAATTTATTT TTATTTTAAA AAGTAAGATA AATGTTTTTG 19374 .......... .......... .......... .......... .......... .......... 114 CGCTATCAAT ATTTTATAAT ATGTAAATTA TTCGAAACAA ACTTTTAATA AATAAAAATT 19314 .......... .......... .......... .......... .......... .......... 114 AGAGCAAATA TAAAAATCAC TAGATTTTTT TTTAAAAAAA TGGGGCCTTG AAACGGTATA 19254 .......... .......... .......... .......... .......... .......... 114 TATTTTTTTA TTTGAATAGA TTATGGGGGA GAATTAATAG AGGTAAGATT TTTTATTTTA 19194 .......... .......... .......... .......... .......... .......... 114 TCTAATAAGA AAATGACAAA TAATATATTT TTAAAAAATA AATAAAACAA AATAAACTTT 19134 .......... .......... .......... .......... .......... .......... 114 GGTTGTTAGT CATAAAAATA TAAGTTATTC AAAAAGGTGA ATGAAAGAGT ATAAGTGAGT 19074 .......... .......... .......... .......... .......... .......... 114 CAAAAAGATG AGTGAAGAGG CATAGCTAAG CCAAAAAAGT GAATGTAAGG GTATTTCTAG 19014 .......... .......... .......... .......... .......... .......... 114 ACCAAAAGAT TGATGAAGGA TATTTTTAGA CATAGTTCAA GGATAGTTTT GGTCCTTTTT 18954 .......... .......... .......... .......... .......... .......... 114 CGTTTAAATA ATCTCATATT TAGATATTGA GTTATTTGCA GGGACGATTC AATATAATTC 18894 .......... .......... .......... .......... .......... .......... 114 GAGGCCTAAA TTTTAAATAA ACTCTATCTG TATTTATTTA TTTTTCTTTT TAGATGTAAA 18834 .......... .......... .......... .......... .......... .......... 114 TTGTATTTAT TTATTTTTCT TTTTAGATGT AAGTTATTAC TTAAGTATCT TTTTTTGTAA 18774 .......... .......... .......... .......... .......... .......... 114 ATGGAAAAGG GCTAAAAATG CCCTTAACTT AGTGGAAATG GTTCAAAATA CCATCCTTCT 18714 .......... .......... .......... .......... .......... .......... 114 ACCTTTTGAG TTAAAAATAC CCTCCACCTT TATTTTGGTT CAAAGATGCC TTTCCTTCCA 18654 .......... .......... .......... .......... .......... .......... 114 CCTTTTGATT TAATAATACT CTTAACCCCC CATTTAATTA AATTTATAAA ATAAAAAATT 18594 .......... .......... .......... .......... .......... .......... 114 CTTAATATTA GCTCATTCCA AAATCTTTAT GATAAATATA TCTAAAAAAT AAAATAAAAA 18534 | |||| | ||||||| || .......... .......... .......... .......... TATAAATCGT AAAATAATAA 134 ATTTATTATA TGTATAAAAA GCAAAAATAA AAATAAAATT TCTCAAAGTT CTTATTCTTT 18474 |||| || ||| || | TAATATT-TA AGTAGGAAGA .......... .......... .......... .......... 153 GTATTAAAAT AATAAGACAA TAAAAATCTT AAGATTCTTA TTCTTCATTT TTGCGCAAAA 18414 .......... .......... .......... .......... .......... .......... 153 AAATCTTTAT TTTATTTTAT GTTTTATACA TATTATTTAA TATTTTAATT TGTGAGAAAT 18354 .......... .......... .......... .......... .......... .......... 153 TTTTTTAAGT TATTTGGATT AAATTTTTAA ATTATATTGA GAAAATGCAC AAGTATTCCC 18294 .......... .......... .......... .......... .......... .......... 153 TCAAACTATG TCTGAAATCC CAGAGACACA CTTATACTAT ATTAAGGTCA TATTACCCCC 18234 .......... .......... .......... .......... .......... .......... 153 TGAACTTATT TTATAAGTAA TTTTCTACCC CTTTTGACCT ACGTGGCTCT AGCTTGAAAA 18174 .......... .......... .......... .......... .......... .......... 153 AAAAGTCAAT CAGCGTTGGA CCCACAAGAT AGTGCCACAT AGACCGAAAA GGGCTAGAAA 18114 .......... .......... .......... .......... .......... .......... 153 ATTATTAATA AAATAAGTTC AGGGATAATA GGACCTTAGT ATAGTGTAAG TATGACTTTA 18054 .......... .......... .......... .......... .......... .......... 153 AAATTTCAGG CATAAATTGA GAGGGTACTT GTGCATTATC TCAATAATAT TCAAATCTTT 17994 .......... .......... .......... .......... .......... .......... 153 ACATTAATAT CTAATTTGAT GTAATATTTT AATAATAATA ATGTAACGAC CTATTTAGTC 17934 | | | |||||||||| || ||||||| .......... .......... .......... ....ACATAA ATGTAACGAC CTGTTTAGTC 179 GTTTTGAGCA GCAGATTTTA TTTTTGGAAA AACTGGCTGA GACGACGGAT CCCACGATGG 17874 |||||||| | |||||||||| |||||||||| ||| || ||| ||||||||| ||||||| || GTTTTGAGTA GCAGATTTTA TTTTTGGAAA AACAGGTTGA GACGACGGAA CCCACGACGG 239 ACCGTCATGG GCACGATGGA CCGTCGAGGG GGTCTCGTTC CAAAATACAT AG-AATTCTG 17815 ||||||||| |||||||||| ||||||| || |||||||| ||||| || | || ||||||| ACCGTCATGA GCACGATGGA CCGTCGA-GG AGTCTCGTTT CAAAACACTT AGAAATTCTG 298 AAATTTGGGT TTTGAAATCG ACTCTCTGAA CTTCGTGATG AAGTGGCAGG ACGGACCGTC 17755 ||||| || | |||||| |||||||||| |||||| | | | ||||| | |||||||||| AAATTGGGTA CTAAAAATCG ACTCTCTGAA CTTCGTAACG GAATGGCACG ACGGACCGTC 358 ACAGGCATGA CGGGCCGTCA CAGTCTCTTC AG-AAAATTT CAGTCTCTGA ACTCTGTGAC 17696 || ||| ||| ||| |||||| ||| |||| | || || ||||||||| || || ||| ACGGGCGTGA CGGACCGTCA CAGATTCTTT GGTGGAAATT GAGTCTCTGA ACCTTGCGAC 418 GGAAGCAGCA GGACGGACCG TCGCAGGCAC GACGACCCGT CACAGACTGC GTAATCCCAG 17636 | | | ||| |||||||||| |||||||||| |||| || | ||||| ||| |||||||||| -G-ACCTGCA GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG 476 GCTGAGTCGG ATTTCTTTAA ATGTTTTAAG GGGGCGTTTT GGACTATTCC TGCTATAATT 17576 ||| ||||| ||||||||| | ||||||| ||| |||||| |||||||||| | || ||||| TCTGGGTCGG ATTTCTTTAC ACGTTTTAA- GGGACGTTTT GGACTATTCC TACTTTAATT 535 ATAAATTTAG TGGGTTAATG TTAATAA-TT TAACTACTTG AGGGTTAAAA GAGATAACCT 17517 ||||| |||| |||||| ||| ||||||| | ||| ||| || ||||||||| ||| |||||| ATAAAGTTAG TGGGTTTATG TTAATAAGTC TAATTACCTG GGGGTTAAAA GAGGTAACCT 595 TGAATTAGTT AGTGGGTTAA ACTCATCATC TTTCATACTT AATTATATGC TAATTAGGGT 17457 ||| | | || ||||||||| | |||| ||| || ||| |||||||||| |||||||||| TGAGTAAATT AGTGGGTTAT TAT-TCCATC TTTTATTCTT AATTATATGC TAATTAGGGT 654 AAAAGAAAGA AGGTTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGGGA GAAACGATCG 17397 ||||| ||| ||| |||||| |||||||||| |||||||||| ||||||| || | |||| AAAAG-AAGG AGGGTTGAAT AAGAAAAAGA AAAGAACAGA AAGAGAGAGA AGGAGAATCG 713 A 17396 | A 714 hqPGS_C06HBa0153O03.1-1-_SGN-E550335- (17959 17396) ******************************************************************************** EST sequence 8 -strand 729 n (File: SGN-E550212-) 1 GCCCCCCCCC GAGTTTTTTT TTTTTTTTTA TGAATTAGCT CAATGAAAAA TGAGTAAATT 61 TTTTATATTT TATGGCATAA TTTTTTCATT AATTCATGGT TGAGAAAATC TTTGTTTCTA 121 ATAGTGTTAT AAATCGTAAA ATAATAATAA TATTTAAGTA GGAAGAACAT AAATGTAACG 181 ACCTGTTTAG TCGTTTTGAG TAGCAGATTT TATTTTTGGA AAAACAGGTT GAGACGACGG 241 AACCCACGAC GGACCGTCAT GAGCACGATG GACCGTCGAG GAGTCTCGTT TCAAAACACT 301 TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC GGAATGGCAC 361 GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT TGAGTCTCTG 421 AACCTTGCGA CGACCTGCAG GACGGACCGT CGCAGGCACG ACGGGCCATC ACAGGTTGCG 481 TAATCCCAGT CTGGGTCGGA TTTCTTTACA CGTTTTAAGG GACGTTTTGG ACTATTCCTA 541 CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA ATTACCTGGG GGTTAAAAGA 601 GGTAACCTTG AGTAAATTAG TGGGTTATTA TTCCATCTTT TATTCTTAAT TATATGCTAA 661 TTAGGGTAAA AGAAGGAGGG TTTGAATAAG AAAAAGAAAA GAACAGAAAG AGAGAGAAGG 721 AGAATCGAT Predicted gene structure (within gDNA segment 24529 to 16660): Exon 1 21799 21794 ( 6 n); cDNA 99 104 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 105 166 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 167 728 ( 562 n); score: 0.841 PPA cDNA 31 14 MATCH C06HBa0153O03.1-1- SGN-E550212- 0.818 631 0.866 C PGS_C06HBa0153O03.1-1-_SGN-E550212- (21799 21794,18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 104 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 104 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 104 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 104 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 104 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 104 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 104 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 104 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 104 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 104 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 104 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 104 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 104 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 104 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 104 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 104 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 104 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 104 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 104 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 104 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 104 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 104 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 104 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 104 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 104 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 104 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 104 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 104 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 104 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 104 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 104 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 104 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 104 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 104 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 104 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 104 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 104 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 104 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 104 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 104 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 104 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 104 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 104 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 104 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 104 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 104 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 104 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 104 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 104 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 104 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 104 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 104 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 104 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 119 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 166 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 166 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 166 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 166 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 166 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 166 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 166 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 166 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 166 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 166 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 224 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 283 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 343 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 403 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 461 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 520 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 580 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 639 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA 699 AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| ||||| || | ||||| AGAACAGAAA GAGAGAGAAG GAGAATCGA 728 hqPGS_C06HBa0153O03.1-1-_SGN-E550212- (17959 17396) ******************************************************************************** EST sequence 9 -strand 710 n (File: SGN-E550065-) 1 TTTTTTTCTT ATGAATCAGC TCAATGAAAA ATGAGTAAAT TTTTTATATT TCATGGCATA 61 ATTTTTTCAT TAATTCATGG TTGAGAAAAT CTTTGTTTCT AATAGTGTTA TAAATCGTAA 121 AATAATAATA ATATTTAAGT AGGAAGAACA TAAATGTAAC GACCTGTTTA GTCGTTTTGA 181 GTAGCAGATT TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA 241 TGAGCACGAT GGACCGTCGA GGAGTCTCGT TTCAAAACAC TTAGAAATTC TGAAATTGGG 301 TACTAAAAAT CGACTCTCTG AACTTCGTAA CGGAATGGCA CGACGGACCG TCACGGGCGT 361 GACGGACCGT CACAGACTCT TTGGTGGAAA TTGAGTCTCT GAACCTTGCG ACGACCTGCA 421 GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC GTAATCCCAG TCTGGGTCGG 481 ATTTCTTTAC ACGTTTTAAG GGACGTTTTG GACTATTCCT ACTTTAATTA TAAAGTTAGT 541 GGGTTTATGT TAATAAGTCT AATTACCTGG GGGTTAAAAG AGGTAACCTT GAGTAAATTA 601 GTGGGTTATT ATTCCATCTT TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG 661 GTTTGAATAA GAAAAAGAAA AGAACAGAAA GAGAGAGAAG GAGAATCGAT Predicted gene structure (within gDNA segment 24339 to 16660): Exon 1 21799 21794 ( 6 n); cDNA 80 85 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 86 147 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 148 709 ( 562 n); score: 0.841 MATCH C06HBa0153O03.1-1- SGN-E550065- 0.818 631 0.889 C PGS_C06HBa0153O03.1-1-_SGN-E550065- (21799 21794,18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 85 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 85 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 85 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 85 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 85 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 85 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 85 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 85 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 85 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 85 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 85 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 85 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 85 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 85 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 85 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 85 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 85 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 85 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 85 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 85 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 85 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 85 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 85 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 85 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 85 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 85 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 85 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 85 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 85 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 85 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 85 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 85 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 85 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 85 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 85 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 85 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 85 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 85 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 85 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 85 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 85 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 85 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 85 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 85 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 85 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 85 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 85 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 85 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 85 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 85 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 85 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 85 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 85 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 100 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 147 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 147 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 147 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 147 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 147 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 147 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 147 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 147 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 147 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 147 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 205 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 264 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 324 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 384 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 442 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 501 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 561 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 620 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA 680 AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| ||||| || | ||||| AGAACAGAAA GAGAGAGAAG GAGAATCGA 709 hqPGS_C06HBa0153O03.1-1-_SGN-E550065- (17959 17396) ******************************************************************************** EST sequence 21 -strand 714 n (File: SGN-E390013-) 1 TTTTTTTTTT TTTTATGAAT TAGCTCAATG AAAAATGAGT AAATTTTTTA TATTTTATGG 61 CATAATTTTT TCATTAATTC ATGGTTGAGA AAATCTTNGT TTCTAATAGT GTTATAAATC 121 GTAAAATAAT AATAATATTT AAGTAGGAAG AACATAAATG TAACGACCTG TTTAGTCGTT 181 TTGAGTAGCA GATTTTATTT TTGGAAAAAC AGGTTGAGAC GACGGAACCC ACGACGGACC 241 GTCATGAGCA CGATGGACCG TCGAGGAGTC TCGTTTCAAA ACACTTAGAA ATTCTGAAAT 301 TGGGTACTAA AAATCGACTC TCTGAACTTC GTAACGGAAT GGCACGACGG ACCGTCACGG 361 GCGTGACGGA CCGTCACAGA CTCTTTGGTG GAAATTGAGT CTCTGAACCT TGCGACGACC 421 TGCAGGACGG ACCGTCGCAG GCACGACGGG CCATCACAGG TTGCGTAATC CCAGTCTGGG 481 TCGGATTTCT TTACACGTTT TAAGGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT 541 TAGTGGGTTT ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA 601 ATTAGTGGGT TATTATTCCA TCTTTTATTC TTAATTATAT GCTAATTAGG GTAAAAGAAG 661 GAGGGTTTGA ATAAGAAAAA GAAAAGAACA GAAAGAGAGA GAAGGAGAAT CGAT Predicted gene structure (within gDNA segment 24379 to 16660): Exon 1 21799 21791 ( 9 n); cDNA 84 92 ( 9 n); score: 0.556 Intron 1 21790 18808 (2983 n); Pd: 0.520 (s: 0), Pa: 0.295 (s: 0) Exon 2 18807 18790 ( 18 n); cDNA 93 109 ( 17 n); score: 0.556 Intron 2 18789 18558 ( 232 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.66) Exon 3 18557 18514 ( 44 n); cDNA 110 151 ( 42 n); score: 0.659 Intron 3 18513 17960 ( 554 n); Pd: 0.845 (s: 0.66), Pa: 0.000 (s: 0.90) Exon 4 17959 17396 ( 564 n); cDNA 152 713 ( 562 n); score: 0.841 PPA cDNA 16 1 MATCH C06HBa0153O03.1-1- SGN-E390013- 0.841 635 0.889 C PGS_C06HBa0153O03.1-1-_SGN-E390013- (21799 21791,18807 18790,18557 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAGAAA. .......... .......... .......... .......... .......... 92 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 92 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 92 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 92 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 92 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 92 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 92 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 92 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 92 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 92 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 92 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 92 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 92 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 92 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 92 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 92 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 92 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 92 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 92 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 92 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 92 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 92 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 92 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 92 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 92 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 92 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 92 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 92 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 92 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 92 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 92 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 92 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 92 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 92 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 92 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 92 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 92 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 92 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 92 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 92 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 92 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 92 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 92 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 92 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 92 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 92 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 92 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 92 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 92 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 || | || .......... .......... .......... .......... .......... ..ATCTTNGT 100 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 | || || TTCTA-ATAG .......... .......... .......... .......... .......... 109 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 109 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 109 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 .......... .......... .......... .......... .......... .......... 109 AATATATCTA AAAAATAAAA TAAAAAATTT ATTATATGTA TAAAAAGCAA AAATAAAAAT 18500 | | | || || ||||| ||| || | ||| || ||| || | ..TGT-TATA AATCGTAAAA TAATAATAAT ATT-TAAGTA GGAAGA.... .......... 151 AAAATTTCTC AAAGTTCTTA TTCTTTGTAT TAAAATAATA AGACAATAAA AATCTTAAGA 18440 .......... .......... .......... .......... .......... .......... 151 TTCTTATTCT TCATTTTTGC GCAAAAAAAT CTTTATTTTA TTTTATGTTT TATACATATT 18380 .......... .......... .......... .......... .......... .......... 151 ATTTAATATT TTAATTTGTG AGAAATTTTT TTAAGTTATT TGGATTAAAT TTTTAAATTA 18320 .......... .......... .......... .......... .......... .......... 151 TATTGAGAAA ATGCACAAGT ATTCCCTCAA ACTATGTCTG AAATCCCAGA GACACACTTA 18260 .......... .......... .......... .......... .......... .......... 151 TACTATATTA AGGTCATATT ACCCCCTGAA CTTATTTTAT AAGTAATTTT CTACCCCTTT 18200 .......... .......... .......... .......... .......... .......... 151 TGACCTACGT GGCTCTAGCT TGAAAAAAAA GTCAATCAGC GTTGGACCCA CAAGATAGTG 18140 .......... .......... .......... .......... .......... .......... 151 CCACATAGAC CGAAAAGGGC TAGAAAATTA TTAATAAAAT AAGTTCAGGG ATAATAGGAC 18080 .......... .......... .......... .......... .......... .......... 151 CTTAGTATAG TGTAAGTATG ACTTTAAAAT TTCAGGCATA AATTGAGAGG GTACTTGTGC 18020 .......... .......... .......... .......... .......... .......... 151 ATTATCTCAA TAATATTCAA ATCTTTACAT TAATATCTAA TTTGATGTAA TATTTTAATA 17960 .......... .......... .......... .......... .......... .......... 151 ATAATAATGT AACGACCTAT TTAGTCGTTT TGAGCAGCAG ATTTTATTTT TGGAAAAACT 17900 | | ||||| |||||||| | |||||||||| |||| ||||| |||||||||| ||||||||| ACATAAATGT AACGACCTGT TTAGTCGTTT TGAGTAGCAG ATTTTATTTT TGGAAAAACA 211 GGCTGAGACG ACGGATCCCA CGATGGACCG TCATGGGCAC GATGGACCGT CGAGGGGGTC 17840 || ||||||| ||||| |||| ||| |||||| ||||| |||| |||||||||| ||| || ||| GGTTGAGACG ACGGAACCCA CGACGGACCG TCATGAGCAC GATGGACCGT CGA-GGAGTC 270 TCGTTCCAAA ATACATAG-A ATTCTGAAAT TTGGGTTTTG AAATCGACTC TCTGAACTTC 17781 ||||| |||| | || ||| | |||||||||| | || | |||||||||| |||||||||| TCGTTTCAAA ACACTTAGAA ATTCTGAAAT TGGGTACTAA AAATCGACTC TCTGAACTTC 330 GTGATGAAGT GGCAGGACGG ACCGTCACAG GCATGACGGG CCGTCACAGT CTCTTCAG-A 17722 || | | | | |||| ||||| |||||||| | || |||||| ||||||||| ||||| | GTAACGGAAT GGCACGACGG ACCGTCACGG GCGTGACGGA CCGTCACAGA CTCTTTGGTG 390 AAATTTCAGT CTCTGAACTC TGTGACGGAA GCAGCAGGAC GGACCGTCGC AGGCACGACG 17662 || || ||| |||||||| || ||| | | | ||||||| |||||||||| |||||||||| GAAATTGAGT CTCTGAACCT TGCGAC-G-A CCTGCAGGAC GGACCGTCGC AGGCACGACG 448 ACCCGTCACA GACTGCGTAA TCCCAGGCTG AGTCGGATTT CTTTAAATGT TTTAAGGGGG 17602 || ||||| | ||||||| |||||| ||| ||||||||| ||||| | || ||||| ||| GGCCATCACA GGTTGCGTAA TCCCAGTCTG GGTCGGATTT CTTTACACGT TTTAA-GGGA 507 CGTTTTGGAC TATTCCTGCT ATAATTATAA ATTTAGTGGG TTAATGTTAA TAA-TTTAAC 17543 |||||||||| ||||||| || ||||||||| | |||||||| || ||||||| ||| | ||| CGTTTTGGAC TATTCCTACT TTAATTATAA AGTTAGTGGG TTTATGTTAA TAAGTCTAAT 567 TACTTGAGGG TTAAAAGAGA TAACCTTGAA TTAGTTAGTG GGTTAAACTC ATCATCTTTC 17483 ||| || ||| ||||||||| ||||||||| | | |||||| ||||| | ||||||| TACCTGGGGG TTAAAAGAGG TAACCTTGAG TAAATTAGTG GGTTATTAT- TCCATCTTTT 626 ATACTTAATT ATATGCTAAT TAGGGTAAAA GAAAGAAGGT TTGAATAAGA AAAAGAAAAG 17423 || ||||||| |||||||||| |||||||||| ||| || ||| |||||||||| |||||||||| ATTCTTAATT ATATGCTAAT TAGGGTAAAA GAAGGAGGGT TTGAATAAGA AAAAGAAAAG 686 AACAGAAAGA GAGGGAGAAA CGATCGA 17396 |||||||||| ||| || | ||||| AACAGAAAGA GAGAGAAGGA GAATCGA 713 hqPGS_C06HBa0153O03.1-1-_SGN-E390013- (17959 17396) ******************************************************************************** EST sequence 24 -strand 717 n (File: SGN-E550484-) 1 TTTTTTTTTT TTTTTTTATG AATTAGCTCA ATGAAAAATG AGTAAATTTT TTATATTTTA 61 TGGCATAATT TTTTCATTAA TTCATGGTTG AGAAAATCTT TGTTTCTAAT AGTGTTATAA 121 ATCGTAAAAT AATAATAATA TTTAAGTAGG AAGAACATAA ATGTAACGAC CTGTTTAGTC 181 GTTTTGAGTA GCAGATTTTA TTTTTGGAAA AACAGGTTGA GACGACGGAA CCCACGACGG 241 ACCGTCATGA GCACGATGGA CCGTCGAGGA GTCTCGTTTC AAAACACTTA GAAATTCTGA 301 AATTGGGTAC TAAAAATCGA CTCTCTGAAC TTCGTAACGG AATGGCACGA CGGACCGTCA 361 CGGGCGTGAC GGACCGTCAC AGACTCTTTG GTGGAAATTG AGTCTCTGAA CCTTGCGACG 421 ACCTGCAGGA CGGACCGTCG CAGGCACGAC GGGCCATCAC AGGTTGCGTA ATCCCAGTCT 481 GGGTCGGATT TCTTTACACG TTTTAAGGGA CGTTTTGGAC TATTCCTACT TTAATTATAA 541 AGTTAGTGGG TTTATGTTAA TAAGTCTAAT TACCTGGGGG TTAAAAGAGG TAACCTTGAG 601 TAAATTAGTG GGTTATTATT CCATCTTTTA TTCTTAATTA TATGCTAATT AGGGTAAAAG 661 AAGGAGGGTT TGAATAAGAA AAAGAAAAGA ACAGAAAGAG AGAGAAGGAG AATCGAT Predicted gene structure (within gDNA segment 24409 to 16660): Exon 1 21799 21794 ( 6 n); cDNA 87 92 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 93 154 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 155 716 ( 562 n); score: 0.841 PPA cDNA 19 1 MATCH C06HBa0153O03.1-1- SGN-E550484- 0.818 631 0.880 C PGS_C06HBa0153O03.1-1-_SGN-E550484- (21799 21794,18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 92 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 92 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 92 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 92 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 92 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 92 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 92 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 92 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 92 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 92 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 92 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 92 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 92 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 92 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 92 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 92 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 92 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 92 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 92 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 92 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 92 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 92 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 92 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 92 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 92 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 92 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 92 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 92 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 92 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 92 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 92 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 92 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 92 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 92 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 92 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 92 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 92 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 92 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 92 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 92 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 92 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 92 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 92 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 92 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 92 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 92 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 92 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 92 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 92 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 92 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 92 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 92 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 92 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 107 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 154 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 154 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 154 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 154 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 154 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 154 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 154 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 154 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 154 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 154 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 212 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 271 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 331 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 391 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 449 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 508 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 568 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 627 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA 687 AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| ||||| || | ||||| AGAACAGAAA GAGAGAGAAG GAGAATCGA 716 hqPGS_C06HBa0153O03.1-1-_SGN-E550484- (17959 17396) ******************************************************************************** EST sequence 25 -strand 713 n (File: SGN-E550211-) 1 TTTTTTTTTT TTTTATGAAT TAGCTCAATG AAAAATGAGT AAATTTTTTA TATTTTATGG 61 CATAATTTTT TCATTAATTC ATGGTTGAGA AAATCTTTGT TTCTAATAGT GTTATAAATC 121 GTAAAATAAT AATAATATTT AAGTAGGAAG AACATAAATG TAACGACCTG TTTAGTCGTT 181 TTGAGTAGCA GATTTTATTT TTGGAAAAAC AGGTTGAGAC GACGGAACCC ACGACGGACC 241 GTCATGAGCA CGATGGACCG TCGAGGAGTC TCGTTTCAAA ACACTTAGAA ATTCTGAAAT 301 TGGGTACTAA AAATCGACTC TCTGAACTTC GTAACGGAAT GGCACGACGG ACCGTCACGG 361 GCGTGACGGA CCGTCACAGA CTCTTTGGTG GAAATTGAGT CTCTGAACCT TGCGACGACC 421 TGCAGGACGG ACCGTCGCAG GCACGACGGG CCATCACAGG TTGCGTAATC CCAGTCTGGG 481 TCGGATTTCT TTACACGTTT TAAGGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT 541 TAGTGGGTTT ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA 601 ATTAGTGGGT TATTATTCCA TCTTTTATTC TTAATTATAT GCTAATTAGG GTAAAAGAAG 661 GAGGGTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG AAGGAGAATC GAT Predicted gene structure (within gDNA segment 24379 to 16660): Exon 1 21799 21794 ( 6 n); cDNA 84 89 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 90 151 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 152 712 ( 561 n); score: 0.840 PPA cDNA 16 1 MATCH C06HBa0153O03.1-1- SGN-E550211- 0.817 631 0.885 C PGS_C06HBa0153O03.1-1-_SGN-E550211- (21799 21794,18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 89 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 89 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 89 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 89 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 89 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 89 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 89 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 89 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 89 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 89 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 89 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 89 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 89 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 89 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 89 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 89 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 89 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 89 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 89 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 89 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 89 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 89 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 89 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 89 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 89 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 89 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 89 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 89 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 89 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 89 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 89 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 89 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 89 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 89 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 89 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 89 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 89 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 89 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 89 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 89 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 89 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 89 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 89 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 89 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 89 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 89 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 89 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 89 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 89 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 89 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 89 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 89 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 89 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 104 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 151 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 151 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 151 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 151 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 151 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 151 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 151 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 151 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 151 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 151 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 209 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 268 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 328 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 388 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 446 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 505 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 565 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 624 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||| ||| || | |||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAG-AAGGAG GGTTGAATAA GAAAAAGAAA 683 AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| ||||| || | ||||| AGAACAGAAA GAGAGAGAAG GAGAATCGA 712 hqPGS_C06HBa0153O03.1-1-_SGN-E550211- (17959 17396) ******************************************************************************** EST sequence 28 -strand 714 n (File: SGN-E550025-) 1 TTTTTTTTTT TTTTATGAAT TAGCTCAATG AAAAATGAGT AAATTTTTTA TATTTTATGG 61 CATAATTTTT TCATTAATTC ATGGTTGAGA AAATCTTTGT TTCTAATAGT GTTATAAATC 121 GTAAAATAAT AATAATATTT AAGTAGGAAG AACATAAATG TAACGACCTG TTTAGTCGTT 181 TTGAGTAGCA GATTTTATTT TTGGAAAAAC AGGTTGAGAC GACGGAACCC ACGACGGACC 241 GTCATGAGCA CGATGGACCG TCGAGGAGTC TCGTTTCAAA ACACTTAGAA ATTCTGAAAT 301 TGGGTACTAA AAATCGACTC TCTGAACTTC GTAACGGAAT GGCACGACGG ACCGTCACGG 361 GCGTGACGGA CCGTCACAGA CTCTTTGGTG GAAATTGAGT CTCTGAACCT TGCGACGACC 421 TGCAGGACGG ACCGTCGCAG GCACGACGGG CCATCACAGG TTGCGTAATC CCAGTCTGGG 481 TCGGATTTCT TTACACGTTT TAAGGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT 541 TAGTGGGTTT ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA 601 ATTAGTGGGT TATTATTCCA TCTTTTATTC TTAATTATAT GCTAATTAGG GTAAAAGAAG 661 GAGGGTTTGA ATAAGAAAAA GAAAAGAACA GAAAGAGAGA GAAGGAGAAT CGAT Predicted gene structure (within gDNA segment 24379 to 16660): Exon 1 21799 21794 ( 6 n); cDNA 84 89 ( 6 n); score: 0.833 Intron 1 21793 18575 (3219 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.60) Exon 2 18574 18514 ( 61 n); cDNA 90 151 ( 62 n); score: 0.607 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 152 713 ( 562 n); score: 0.841 PPA cDNA 16 1 MATCH C06HBa0153O03.1-1- SGN-E550025- 0.818 631 0.884 C PGS_C06HBa0153O03.1-1-_SGN-E550025- (21799 21794,18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): GTTGGGGTGG TAATTCCTCA TCTTGCACCT GTTTATTTTC CCCTTCCTCA CCCTCTCTTA 21740 |||| | GTTGAG.... .......... .......... .......... .......... .......... 89 CTACCTCATC AGTCGGTGGA GGAGTCACCA CCCTATTACT AGCTTGACCA GGTGTTTGTC 21680 .......... .......... .......... .......... .......... .......... 89 CTCCACCTCT AGAGATCGTC CTCTTGCGAC CTCTACCACG ACCTCTTGCC ACTGCTCCTC 21620 .......... .......... .......... .......... .......... .......... 89 CTCGAGCTAC AACCCCAATG TTTGGCTCAG ACGCACGCTA TCTTGCCGGT GTTGGTGTTG 21560 .......... .......... .......... .......... .......... .......... 89 GCACAGTTGT TTCTCTAGTT CTAACCATAT GCGAAATAGA GTGAGGATGT CAGATACCAA 21500 .......... .......... .......... .......... .......... .......... 89 TTTGTATCAC CTAGATACCA CTTGGATCCA AGTAATAGCA CGAAAGAAGG AAAGAATGGA 21440 .......... .......... .......... .......... .......... .......... 89 ATTTTGCTAA AGTCCTATAG CCTCTCGAAG AAAAGTAAGG GCGTCCCCCT ACCGTTCCTC 21380 .......... .......... .......... .......... .......... .......... 89 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAACGC TCTGATACCA 21320 .......... .......... .......... .......... .......... .......... 89 AGTTTGTCAC GACCCAAAAC GATCCGTAAG TGGCACCCAC CCTTACTCTC CTAGGTGAGC 21260 .......... .......... .......... .......... .......... .......... 89 GAACCAACAA ATCTAAACCC CAACATTTAC CAGTATATCA ACTATAAATA ATATAAATAA 21200 .......... .......... .......... .......... .......... .......... 89 TGCGGAAGCT CCAAAACTCA TTACGAAATT AATTAAATCA ACATCTAAAG TTAAATACTT 21140 .......... .......... .......... .......... .......... .......... 89 ATTATTCCCA AAATCTGTAA GTCATCACAC CAAGAACATC TATCCTCGAA TTTCTAAATC 21080 .......... .......... .......... .......... .......... .......... 89 TAAGAGTATT CAAGAAGCTA AAAATAGTAA AAAGATGGTC CATGTCCGAA CTTCAAGACA 21020 .......... .......... .......... .......... .......... .......... 89 TCAAGACGTG AAGGAGAGAA TCCAGCACGA GCTAGGAATA ATAGCTCACC CTGAATTCTG 20960 .......... .......... .......... .......... .......... .......... 89 ATATGCTAAA GACCGGCTAG ATCTGATGAC GAGTCGAAGT CGATGGCACG CTTGCTGCAC 20900 .......... .......... .......... .......... .......... .......... 89 TCCACAAATA ACAAAGAAGA AAATTACAAG TAGGGGTCAG TACAAGGAAC ACGTACTGAG 20840 .......... .......... .......... .......... .......... .......... 89 TAGGTATCAT CGGCCAACTC AAAATAGAAA ACAATATATA CTGAATAATA ATATAAAATC 20780 .......... .......... .......... .......... .......... .......... 89 AACCATAATA CTTAACAGGT GACAATCAAC AAGTATAAGA ACCATTGACA ACAACAGCAA 20720 .......... .......... .......... .......... .......... .......... 89 GCACATCTAT GAGGACTCAA GCCTCCACAC CATACTCATT TGGGAAATAG GTTCTTTGAA 20660 .......... .......... .......... .......... .......... .......... 89 TTTGAGTACA TTAACATAAT TCAAGATTCA TTCTCTTTAT CATTATCGTG TCGGAACGTT 20600 .......... .......... .......... .......... .......... .......... 89 ACACCCGATC CCCTACTACT ACCGTGTCGG AACGTGACAC TCTGATCCCC TAATACTACC 20540 .......... .......... .......... .......... .......... .......... 89 GTGTCAGAAC GTGACACCCG ATCCCCTAAT ACTACCGTGT CAGAATGTGA CACTCCGATC 20480 .......... .......... .......... .......... .......... .......... 89 CCCTAATACT ACCGTGTCGA AACATGACAC CCAATCCATT TATCTCATTA TTTTAGTTCA 20420 .......... .......... .......... .......... .......... .......... 89 TCAAGCCTTC TTTATGTCAA GGCGCCATCT TAATAGAGAG GATTTAAGAT TGAAGATTCA 20360 .......... .......... .......... .......... .......... .......... 89 ACAGTTTCAT CATTCTGACC ACCACAATTA CACAATCACA ACATACAAAC ACACAATCAA 20300 .......... .......... .......... .......... .......... .......... 89 GCATATAGAA GACTTTACAA TACCACCCAA TACATATCGA TCACTATTTA GAGTTTATCT 20240 .......... .......... .......... .......... .......... .......... 89 ATCATATATA AATAAATCAT AACCTACCTC CACTGAAGAA TCGTGATCAA GCAAGCTACC 20180 .......... .......... .......... .......... .......... .......... 89 TTCCCAATGC CTTTGCTTTC CTCTTCGTTC TCTCTTTCTC GCTCGTTCTC CCTCTGTGTT 20120 .......... .......... .......... .......... .......... .......... 89 TCTTTTTATT TTTCTTACTC AAAATCTTGT TCTTTTACCC TAAATGTCAT ATAATCAATT 20060 .......... .......... .......... .......... .......... .......... 89 ATAAAAGATG ATAAAAGTAC CTCACTATTT ATTCCCTTAT TAACTTCTTT AACCCCCAAG 20000 .......... .......... .......... .......... .......... .......... 89 TAAATAAATT ATTAAACTTA CCCCACTAAT TCCATAATTA TAATCATGAA TAGTCCAAAA 19940 .......... .......... .......... .......... .......... .......... 89 CACCCCTTTA AAACTTTTAG CAGAAATCCG ACCCAGTCGA GGTTACGCAG CTTGTGACGG 19880 .......... .......... .......... .......... .......... .......... 89 TCCGTTGTGT CTACGACGGT CCGTGCTGTA GTTCCGTCGC GGAGTTCAGA GAGTCGCTCC 19820 .......... .......... .......... .......... .......... .......... 89 CAGTACCCAG ATTTTCAGAG TTGAAGTGTT TTGGAACGGA GACGCTCGAC GGACCGTCGT 19760 .......... .......... .......... .......... .......... .......... 89 GCCTGTGACG GTTTGTCCTA CCTGCCGTCG AGGGTAATGA GGAGAGCAAC AGAAGAAATT 19700 .......... .......... .......... .......... .......... .......... 89 ACACAAGTAT GGGACGACGG AGTCCATCAC GGTCCATCGT GACCATGACG GTCCGTCGTG 19640 .......... .......... .......... .......... .......... .......... 89 ACCATGACGG TCCGTCGCGT GATCCGTCGA CCCAGTCAGT TTTTTATCAA AAATAGTTCT 19580 .......... .......... .......... .......... .......... .......... 89 ACTGCTCGAA CCGACTAAAC AGGTCATTAC AATTTTCCTA CTTTAGTTTT CCCTATGGCT 19520 .......... .......... .......... .......... .......... .......... 89 ACCACTGTCC ATCTACTATT TTTTTTCATG ATTTGATCTT TTAAGTAAAA TTATTTGTGG 19460 .......... .......... .......... .......... .......... .......... 89 AACTTCTCTT TGAAGATATC TCTCACAAAT TAGCGAAAAA GTTAGTTAAT TTATTTTTAT 19400 .......... .......... .......... .......... .......... .......... 89 TTTAAAAAGT AAGATAAATG TTTTTGCGCT ATCAATATTT TATAATATGT AAATTATTCG 19340 .......... .......... .......... .......... .......... .......... 89 AAACAAACTT TTAATAAATA AAAATTAGAG CAAATATAAA AATCACTAGA TTTTTTTTTA 19280 .......... .......... .......... .......... .......... .......... 89 AAAAAATGGG GCCTTGAAAC GGTATATATT TTTTTATTTG AATAGATTAT GGGGGAGAAT 19220 .......... .......... .......... .......... .......... .......... 89 TAATAGAGGT AAGATTTTTT ATTTTATCTA ATAAGAAAAT GACAAATAAT ATATTTTTAA 19160 .......... .......... .......... .......... .......... .......... 89 AAAATAAATA AAACAAAATA AACTTTGGTT GTTAGTCATA AAAATATAAG TTATTCAAAA 19100 .......... .......... .......... .......... .......... .......... 89 AGGTGAATGA AAGAGTATAA GTGAGTCAAA AAGATGAGTG AAGAGGCATA GCTAAGCCAA 19040 .......... .......... .......... .......... .......... .......... 89 AAAAGTGAAT GTAAGGGTAT TTCTAGACCA AAAGATTGAT GAAGGATATT TTTAGACATA 18980 .......... .......... .......... .......... .......... .......... 89 GTTCAAGGAT AGTTTTGGTC CTTTTTCGTT TAAATAATCT CATATTTAGA TATTGAGTTA 18920 .......... .......... .......... .......... .......... .......... 89 TTTGCAGGGA CGATTCAATA TAATTCGAGG CCTAAATTTT AAATAAACTC TATCTGTATT 18860 .......... .......... .......... .......... .......... .......... 89 TATTTATTTT TCTTTTTAGA TGTAAATTGT ATTTATTTAT TTTTCTTTTT AGATGTAAGT 18800 .......... .......... .......... .......... .......... .......... 89 TATTACTTAA GTATCTTTTT TTGTAAATGG AAAAGGGCTA AAAATGCCCT TAACTTAGTG 18740 .......... .......... .......... .......... .......... .......... 89 GAAATGGTTC AAAATACCAT CCTTCTACCT TTTGAGTTAA AAATACCCTC CACCTTTATT 18680 .......... .......... .......... .......... .......... .......... 89 TTGGTTCAAA GATGCCTTTC CTTCCACCTT TTGATTTAAT AATACTCTTA ACCCCCCATT 18620 .......... .......... .......... .......... .......... .......... 89 TAATTAAATT TATAAAATAA AAAATTCTTA ATATTAGCTC ATTCCAAAAT CTTTATGATA 18560 ||||| |||| | .......... .......... .......... .......... .....AAAAT CTTTGTTTCT 104 AATA-T-ATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA 18502 |||| | | |||| ||| ||||| || |||| || | || || | AATAGTGTTA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... 151 ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA 18442 .......... .......... .......... .......... .......... .......... 151 GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA 18382 .......... .......... .......... .......... .......... .......... 151 TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT 18322 .......... .......... .......... .......... .......... .......... 151 TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT 18262 .......... .......... .......... .......... .......... .......... 151 TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT 18202 .......... .......... .......... .......... .......... .......... 151 TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG 18142 .......... .......... .......... .......... .......... .......... 151 TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG 18082 .......... .......... .......... .......... .......... .......... 151 ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT 18022 .......... .......... .......... .......... .......... .......... 151 GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA 17962 .......... .......... .......... .......... .......... .......... 151 TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA 17902 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA 209 CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG 17842 | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG 268 TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT 17783 ||||||| || ||| || ||| ||||||||| ||| || | |||||||| |||||||||| TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT 328 TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG 17723 |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG 388 -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA 17664 || || | |||||||||| || ||| | | | ||||| |||||||||| |||||||||| TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA 446 CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG 17604 || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG 505 GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA 17545 | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA 565 ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT 17485 | ||| || | |||||||||| | |||||||| | | | |||| ||||||| | |||||| ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT 624 TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA 17425 | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA 684 AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| ||||| || | ||||| AGAACAGAAA GAGAGAGAAG GAGAATCGA 713 hqPGS_C06HBa0153O03.1-1-_SGN-E550025- (17959 17396) ******************************************************************************** EST sequence 61 -strand 711 n (File: SGN-E396056-) 1 TTTTTTTTTT ATGAATTAGC TCAAAGAAAA AATGAGTAAA TTTTTATTAT TTTATGGCAT 61 AATTTTTTCA TTAATTCATG GGTGAGAAAA TTTTTGTTTC TAATAGTGTT ATAAATCGTA 121 AAATAATAAT AATATTTAAG TAGGAAGAAC ATAAATGTAA CGACCTGTTT AGTCGTTTTG 181 AGTAGCAGAT TTTATTTTTG GAAAAACAGG TTGAGACGAC GGAACCCACG ACGGACCGTC 241 ATGAGCACGA TGGACCGTCG AGGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG 301 GTACTAAAAA TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG 361 TGACGGACCG TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GACGACCTGC 421 AGGACGGACC GTCGCAGGCA CGACGGGCCA TCACAGGTTG CGTAATCCCA GTCTGGGTCG 481 GATTTCTTTA CACGTTTTAA GGGACGTTTT GGACTATTCC TACTTTAATT ATAAAGTTAG 541 TGGGTTTATG TTAATAAGTC TAATTACCTG GGGGTTAAAA GAGGTAACCT TGAGTAAATT 601 AGTGGGTTAT TATTCCATCT TTTATTCTTA ATTATATGCT AATTAGGGTA AAAGAAGGAG 661 GGTTTGAATA AGAAAAAGAA AAGAACAGAA AGAGAGAGAA GGAGAATCGA T Predicted gene structure (within gDNA segment 24349 to 16660): Exon 1 19231 19212 ( 20 n); cDNA 78 96 ( 19 n); score: 0.650 Intron 1 19211 18569 ( 643 n); Pd: 0.964 (s: 0), Pa: 0.000 (s: 0.66) Exon 2 18568 18514 ( 55 n); cDNA 97 148 ( 52 n); score: 0.655 Intron 2 18513 17960 ( 554 n); Pd: 0.845 (s: 0.64), Pa: 0.000 (s: 0.90) Exon 3 17959 17396 ( 564 n); cDNA 149 710 ( 562 n); score: 0.841 PPA cDNA 12 1 MATCH C06HBa0153O03.1-1- SGN-E396056- 0.825 639 0.899 C PGS_C06HBa0153O03.1-1-_SGN-E396056- (19231 19212,18568 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): ATGGGGGAGA ATTAATAGAG GTAAGATTTT TTATTTTATC TAATAAGAAA ATGACAAATA 19172 ||||| |||| | | | | ATGGGTGAGA A-AATTTTTG .......... .......... .......... .......... 96 ATATATTTTT AAAAAATAAA TAAAACAAAA TAAACTTTGG TTGTTAGTCA TAAAAATATA 19112 .......... .......... .......... .......... .......... .......... 96 AGTTATTCAA AAAGGTGAAT GAAAGAGTAT AAGTGAGTCA AAAAGATGAG TGAAGAGGCA 19052 .......... .......... .......... .......... .......... .......... 96 TAGCTAAGCC AAAAAAGTGA ATGTAAGGGT ATTTCTAGAC CAAAAGATTG ATGAAGGATA 18992 .......... .......... .......... .......... .......... .......... 96 TTTTTAGACA TAGTTCAAGG ATAGTTTTGG TCCTTTTTCG TTTAAATAAT CTCATATTTA 18932 .......... .......... .......... .......... .......... .......... 96 GATATTGAGT TATTTGCAGG GACGATTCAA TATAATTCGA GGCCTAAATT TTAAATAAAC 18872 .......... .......... .......... .......... .......... .......... 96 TCTATCTGTA TTTATTTATT TTTCTTTTTA GATGTAAATT GTATTTATTT ATTTTTCTTT 18812 .......... .......... .......... .......... .......... .......... 96 TTAGATGTAA GTTATTACTT AAGTATCTTT TTTTGTAAAT GGAAAAGGGC TAAAAATGCC 18752 .......... .......... .......... .......... .......... .......... 96 CTTAACTTAG TGGAAATGGT TCAAAATACC ATCCTTCTAC CTTTTGAGTT AAAAATACCC 18692 .......... .......... .......... .......... .......... .......... 96 TCCACCTTTA TTTTGGTTCA AAGATGCCTT TCCTTCCACC TTTTGATTTA ATAATACTCT 18632 .......... .......... .......... .......... .......... .......... 96 TAACCCCCCA TTTAATTAAA TTTATAAAAT AAAAAATTCT TAATATTAGC TCATTCCAAA 18572 .......... .......... .......... .......... .......... .......... 96 ATCTTTATGA TAAATATATC TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC 18512 ||| | | | | | | | |||| ||| ||||| || |||| || | || || | ...TTTCTAA T-AGTGT-TA TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. 148 AAAAATAAAA ATAAAATTTC TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA 18452 .......... .......... .......... .......... .......... .......... 148 AAAATCTTAA GATTCTTATT CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT 18392 .......... .......... .......... .......... .......... .......... 148 TTTATACATA TTATTTAATA TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA 18332 .......... .......... .......... .......... .......... .......... 148 ATTTTTAAAT TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA 18272 .......... .......... .......... .......... .......... .......... 148 GAGACACACT TATACTATAT TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT 18212 .......... .......... .......... .......... .......... .......... 148 TTCTACCCCT TTTGACCTAC GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC 18152 .......... .......... .......... .......... .......... .......... 148 CACAAGATAG TGCCACATAG ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG 18092 .......... .......... .......... .......... .......... .......... 148 GGATAATAGG ACCTTAGTAT AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA 18032 .......... .......... .......... .......... .......... .......... 148 GGGTACTTGT GCATTATCTC AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT 17972 .......... .......... .......... .......... .......... .......... 148 AATATTTTAA TAATAATAAT GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT 17912 | | ||| |||||||||| ||||||||| |||||| ||| |||||||||| .......... ..ACATAAAT GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT 196 TTTGGAAAAA CTGGCTGAGA CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC 17852 |||||||||| | || ||||| ||||||| || ||||| |||| ||||||| || |||||||||| TTTGGAAAAA CAGGTTGAGA CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC 256 GTCGAGGGGG TCTCGTTCCA AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC 17793 ||||| || | ||||||| || ||| || ||| ||||||||| ||| || | |||||||| GTCGA-GGAG TCTCGTTTCA AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC 315 TCTCTGAACT TCGTGATGAA GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA 17733 |||||||||| |||| | | | ||||| ||| |||||||||| ||| ||||| | |||||||| TCTCTGAACT TCGTAACGGA ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA 375 GTCTCTTCAG -AAAATTTCA GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC 17674 | ||||| | || || | |||||||||| || ||| | | | ||||| |||||||||| GACTCTTTGG TGGAAATTGA GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC 433 GCAGGCACGA CGACCCGTCA CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT 17614 |||||||||| || || ||| ||| ||||| |||||||| | || ||||||| ||||||| | GCAGGCACGA CGGGCCATCA CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC 493 GTTTTAAGGG GGCGTTTTGG ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT 17554 ||||||| || | |||||||| ||||||||| || ||||||| ||| |||||| |||| ||||| GTTTTAA-GG GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT 552 AATAA-TTTA ACTACTTGAG GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC 17495 ||||| | || | ||| || | |||||||||| | |||||||| | | | |||| ||||||| AATAAGTCTA ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA 612 TCATCATCTT TCATACTTAA TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA 17435 | |||||| | || ||||| |||||||||| |||||||||| ||||| || | |||||||||| T-TCCATCTT TTATTCTTAA TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA 671 GAAAAAGAAA AGAACAGAAA GAGAGGGAGA AACGATCGA 17396 |||||||||| |||||||||| ||||| || | ||||| GAAAAAGAAA AGAACAGAAA GAGAGAGAAG GAGAATCGA 710 hqPGS_C06HBa0153O03.1-1-_SGN-E396056- (17959 17396) ******************************************************************************** EST sequence 13 -strand 709 n (File: SGN-E550207-) 1 TTTTTTTTTA TGAATTAGCT CAATGAAAAA TGAGTAAATT TTTTATATTT TATGGCATAA 61 TTTTTTCATT AATTCATGGT GNAGAAAATC TTTGTTTCTA ATAGTGTTAT AAATCGTAAA 121 ATAATAATAA TATTTAAGTA GGAAGAACAT AAATGTAACG ACCTGTTTAG TCGTTTTGAG 181 TAGCAGATTT TATTTTTGGA AAAACAGGTT GAGACGACGG AACCCACGAC GGACCGTCAT 241 GAGCACGATG GACCGTCGAG GAGTCTCGTT TCAAAACACT TAGAAATTCT GAAATTGGGT 301 ACTAAAAATC GACTCTCTGA ACTTCGTAAC GGAATGGCAC GACGGACCGT CACGGGCGTG 361 ACGGACCGTC ACAGACTCTT TGGTGGAAAT TGAGTCTCTG AACCTTGCGA CGACCTGCAG 421 GACGGACCGT CGCAGGCACG ACGGGCCATC ACAGGTTGCG TAATCCCAGT CTGGGTCGGA 481 TTTCTTTACA CGTTTTAAGG GACGTTTTGG ACTATTCCTA CTTTAATTAT AAAGTTAGTG 541 GGTTTATGTT AATAAGTCTA ATTACCTGGG GGTTAAAAGA GGTAACCTTG AGTAAATTAG 601 TGGGTTATTA TTCCATCTTT TATTCTTAAT TATATGCTAA TTAGGGTAAA AGAAGGAGGG 661 TTTGAATAAG AAAAAGAAAA GAACAGAAAG AGAGAGAAGG AGAATCGAT Predicted gene structure (within gDNA segment 24329 to 16660): Exon 1 18574 18514 ( 61 n); cDNA 85 146 ( 62 n); score: 0.607 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.90) Exon 2 17959 17396 ( 564 n); cDNA 147 708 ( 562 n); score: 0.841 PPA cDNA 11 1 MATCH C06HBa0153O03.1-1- SGN-E550207- 0.818 625 0.882 C PGS_C06HBa0153O03.1-1-_SGN-E550207- (18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): AAAATCTTTA TGATAAATA- T-ATCTAAAA AATAAAATAA AAAATTTATT ATATGTATAA 18517 ||||||||| | |||| | | |||| |||||||| || |||| || ||| | AAAATCTTTG TTTCTAATAG TGTTATAAAT CGTAAAATAA TAATAATATT -TAAGTAGGA 143 AAAGCAAAAA TAAAAATAAA ATTTCTCAAA GTTCTTATTC TTTGTATTAA AATAATAAGA 18457 | | AGA....... .......... .......... .......... .......... .......... 146 CAATAAAAAT CTTAAGATTC TTATTCTTCA TTTTTGCGCA AAAAAATCTT TATTTTATTT 18397 .......... .......... .......... .......... .......... .......... 146 TATGTTTTAT ACATATTATT TAATATTTTA ATTTGTGAGA AATTTTTTTA AGTTATTTGG 18337 .......... .......... .......... .......... .......... .......... 146 ATTAAATTTT TAAATTATAT TGAGAAAATG CACAAGTATT CCCTCAAACT ATGTCTGAAA 18277 .......... .......... .......... .......... .......... .......... 146 TCCCAGAGAC ACACTTATAC TATATTAAGG TCATATTACC CCCTGAACTT ATTTTATAAG 18217 .......... .......... .......... .......... .......... .......... 146 TAATTTTCTA CCCCTTTTGA CCTACGTGGC TCTAGCTTGA AAAAAAAGTC AATCAGCGTT 18157 .......... .......... .......... .......... .......... .......... 146 GGACCCACAA GATAGTGCCA CATAGACCGA AAAGGGCTAG AAAATTATTA ATAAAATAAG 18097 .......... .......... .......... .......... .......... .......... 146 TTCAGGGATA ATAGGACCTT AGTATAGTGT AAGTATGACT TTAAAATTTC AGGCATAAAT 18037 .......... .......... .......... .......... .......... .......... 146 TGAGAGGGTA CTTGTGCATT ATCTCAATAA TATTCAAATC TTTACATTAA TATCTAATTT 17977 .......... .......... .......... .......... .......... .......... 146 GATGTAATAT TTTAATAATA ATAATGTAAC GACCTATTTA GTCGTTTTGA GCAGCAGATT 17917 | | |||||||| ||||| |||| |||||||||| | |||||||| .......... .......ACA TAAATGTAAC GACCTGTTTA GTCGTTTTGA GTAGCAGATT 189 TTATTTTTGG AAAAACTGGC TGAGACGACG GATCCCACGA TGGACCGTCA TGGGCACGAT 17857 |||||||||| |||||| || |||||||||| || ||||||| ||||||||| || ||||||| TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA TGAGCACGAT 249 GGACCGTCGA GGGGGTCTCG TTCCAAAATA CATAG-AATT CTGAAATTTG GGTTTTGAAA 17798 |||||||||| || |||||| || ||||| | | ||| |||| |||||||| | | | ||| GGACCGTCGA -GGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA 308 TCGACTCTCT GAACTTCGTG ATGAAGTGGC AGGACGGACC GTCACAGGCA TGACGGGCCG 17738 |||||||||| ||||||||| | | | |||| | |||||||| ||||| ||| |||||| ||| TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG 368 TCACAGTCTC TTCAG-AAAA TTTCAGTCTC TGAACTCTGT GACGGAAGCA GCAGGACGGA 17679 |||||| ||| || | || || |||||| ||||| || ||| | | | |||||||||| TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GAC-G-ACCT GCAGGACGGA 426 CCGTCGCAGG CACGACGACC CGTCACAGAC TGCGTAATCC CAGGCTGAGT CGGATTTCTT 17619 |||||||||| ||||||| | | |||||| |||||||||| ||| ||| || |||||||||| CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT CGGATTTCTT 486 TAAATGTTTT AAGGGGGCGT TTTGGACTAT TCCTGCTATA ATTATAAATT TAGTGGGTTA 17559 || | ||||| || ||| ||| |||||||||| |||| || || |||||||| | ||||||||| TACACGTTTT AA-GGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT TAGTGGGTTT 545 ATGTTAATAA -TTTAACTAC TTGAGGGTTA AAAGAGATAA CCTTGAATTA GTTAGTGGGT 17500 |||||||||| | ||| ||| || |||||| |||||| ||| |||||| | | ||||||||| ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA ATTAGTGGGT 605 TAAACTCATC ATCTTTCATA CTTAATTATA TGCTAATTAG GGTAAAAGAA AGAAGGTTTG 17440 || | | |||||| || |||||||||| |||||||||| |||||||||| || |||||| TATTAT-TCC ATCTTTTATT CTTAATTATA TGCTAATTAG GGTAAAAGAA GGAGGGTTTG 664 AATAAGAAAA AGAAAAGAAC AGAAAGAGAG GGAGAAACGA TCGA 17396 |||||||||| |||||||||| |||||||||| || | | |||| AATAAGAAAA AGAAAAGAAC AGAAAGAGAG AGAAGGAGAA TCGA 708 hqPGS_C06HBa0153O03.1-1-_SGN-E550207- (17959 17396) ******************************************************************************** EST sequence 26 -strand 713 n (File: SGN-E550464-) 1 TTTTTTTTTT TTTTATGAAT TAGCTCAATG AAAAATGAGT AAATTTTTTA TATCTTATGG 61 CATAATTTTT CATTAATTCA TGGTAGAGAA AATCTTTGTT TCTAATAGTG TTATAAATCG 121 TAAAATAATA ATAATATTTA AGTAGGAAGA ACATAAATGT ANCGACCTGT TTAGTCGTTT 181 TGAGTAGCAG ATTTTATTTT TGGAAAAACA GGTTGAGACG ACGGAACCCA CGACGGACCG 241 TCATGAGCAC GATGGACCGT CGAGGAGTCT CGTTTCAAAA CACTTAGAAA TTCTGAAATT 301 GGGTACTAAA AATCGACTCT CTGAACTTCG TAACGGAATG GCACGACGGA CCGTCACGGG 361 CGTGACGGAC CGTCACAGAC TCTTTGGTGG AAATTGAGTC TCTGAACCTT GCGACGACCT 421 GCAGGACGGA CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT 481 CGGATTTCTT TACACGTTTT AAGGGACGTT TTGGACTATT CCTACTTTAA TTATAAAGTT 541 AGTGGGTTTA TGTTAATAAG TCTAATTACC TGGGGGTTAA AAGAGGTAAC CTTGAGTAAA 601 TTAGTGGGTT ATTATTCCAT CTTTTATTCT TAATTATATG CTAATTAGGG TAAAAGAAGG 661 AGGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG AAGGAGAATC GAT Predicted gene structure (within gDNA segment 24369 to 16660): Exon 1 18574 18514 ( 61 n); cDNA 89 150 ( 62 n); score: 0.607 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.88) Exon 2 17959 17396 ( 564 n); cDNA 151 712 ( 562 n); score: 0.840 PPA cDNA 16 1 MATCH C06HBa0153O03.1-1- SGN-E550464- 0.817 625 0.877 C PGS_C06HBa0153O03.1-1-_SGN-E550464- (18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): AAAATCTTTA TGATAAATA- T-ATCTAAAA AATAAAATAA AAAATTTATT ATATGTATAA 18517 ||||||||| | |||| | | |||| |||||||| || |||| || ||| | AAAATCTTTG TTTCTAATAG TGTTATAAAT CGTAAAATAA TAATAATATT -TAAGTAGGA 147 AAAGCAAAAA TAAAAATAAA ATTTCTCAAA GTTCTTATTC TTTGTATTAA AATAATAAGA 18457 | | AGA....... .......... .......... .......... .......... .......... 150 CAATAAAAAT CTTAAGATTC TTATTCTTCA TTTTTGCGCA AAAAAATCTT TATTTTATTT 18397 .......... .......... .......... .......... .......... .......... 150 TATGTTTTAT ACATATTATT TAATATTTTA ATTTGTGAGA AATTTTTTTA AGTTATTTGG 18337 .......... .......... .......... .......... .......... .......... 150 ATTAAATTTT TAAATTATAT TGAGAAAATG CACAAGTATT CCCTCAAACT ATGTCTGAAA 18277 .......... .......... .......... .......... .......... .......... 150 TCCCAGAGAC ACACTTATAC TATATTAAGG TCATATTACC CCCTGAACTT ATTTTATAAG 18217 .......... .......... .......... .......... .......... .......... 150 TAATTTTCTA CCCCTTTTGA CCTACGTGGC TCTAGCTTGA AAAAAAAGTC AATCAGCGTT 18157 .......... .......... .......... .......... .......... .......... 150 GGACCCACAA GATAGTGCCA CATAGACCGA AAAGGGCTAG AAAATTATTA ATAAAATAAG 18097 .......... .......... .......... .......... .......... .......... 150 TTCAGGGATA ATAGGACCTT AGTATAGTGT AAGTATGACT TTAAAATTTC AGGCATAAAT 18037 .......... .......... .......... .......... .......... .......... 150 TGAGAGGGTA CTTGTGCATT ATCTCAATAA TATTCAAATC TTTACATTAA TATCTAATTT 17977 .......... .......... .......... .......... .......... .......... 150 GATGTAATAT TTTAATAATA ATAATGTAAC GACCTATTTA GTCGTTTTGA GCAGCAGATT 17917 | | |||||| | ||||| |||| |||||||||| | |||||||| .......... .......ACA TAAATGTANC GACCTGTTTA GTCGTTTTGA GTAGCAGATT 193 TTATTTTTGG AAAAACTGGC TGAGACGACG GATCCCACGA TGGACCGTCA TGGGCACGAT 17857 |||||||||| |||||| || |||||||||| || ||||||| ||||||||| || ||||||| TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA TGAGCACGAT 253 GGACCGTCGA GGGGGTCTCG TTCCAAAATA CATAG-AATT CTGAAATTTG GGTTTTGAAA 17798 |||||||||| || |||||| || ||||| | | ||| |||| |||||||| | | | ||| GGACCGTCGA -GGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA 312 TCGACTCTCT GAACTTCGTG ATGAAGTGGC AGGACGGACC GTCACAGGCA TGACGGGCCG 17738 |||||||||| ||||||||| | | | |||| | |||||||| ||||| ||| |||||| ||| TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG 372 TCACAGTCTC TTCAG-AAAA TTTCAGTCTC TGAACTCTGT GACGGAAGCA GCAGGACGGA 17679 |||||| ||| || | || || |||||| ||||| || ||| | | | |||||||||| TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GAC-G-ACCT GCAGGACGGA 430 CCGTCGCAGG CACGACGACC CGTCACAGAC TGCGTAATCC CAGGCTGAGT CGGATTTCTT 17619 |||||||||| ||||||| | | |||||| |||||||||| ||| ||| || |||||||||| CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT CGGATTTCTT 490 TAAATGTTTT AAGGGGGCGT TTTGGACTAT TCCTGCTATA ATTATAAATT TAGTGGGTTA 17559 || | ||||| || ||| ||| |||||||||| |||| || || |||||||| | ||||||||| TACACGTTTT AA-GGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT TAGTGGGTTT 549 ATGTTAATAA -TTTAACTAC TTGAGGGTTA AAAGAGATAA CCTTGAATTA GTTAGTGGGT 17500 |||||||||| | ||| ||| || |||||| |||||| ||| |||||| | | ||||||||| ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA ATTAGTGGGT 609 TAAACTCATC ATCTTTCATA CTTAATTATA TGCTAATTAG GGTAAAAGAA AGAAGGTTTG 17440 || | | |||||| || |||||||||| |||||||||| |||||||||| || |||||| TATTAT-TCC ATCTTTTATT CTTAATTATA TGCTAATTAG GGTAAAAGAA GGAGGGTTTG 668 AATAAGAAAA AGAAAAGAAC AGAAAGAGAG GGAGAAACGA TCGA 17396 |||||||||| |||||||||| |||||||||| || | | |||| AATAAGAAAA AGAAAAGAAC AGAAAGAGAG AGAAGGAGAA TCGA 712 hqPGS_C06HBa0153O03.1-1-_SGN-E550464- (17959 17396) ******************************************************************************** EST sequence 27 -strand 713 n (File: SGN-E549941-) 1 TTTTTTTTTT TTTATGAATT AGCTCAATGA AAAATGAGTA AATTTTTTAT ATTTTGATGG 61 CATAATTTTT CATTAATCAT NGGTCGAGAA AATCTTTGTT TCTAATAGTG TTATAAATCG 121 TAAAATAATA ATAATATTTA AGTAGGAAGA ACATAAATGT AACGACCTGT TTAGTCGTTT 181 TGAGTAGCAG ATATTATTTT TGGAAAAACA GGTTGAGACG ACGGAACCCA CGACGGACCG 241 TCATGAGCAC GATGGACCGT CGAGGAGTCT CGTTTCAAAA CACTTAGAAA TTCTGAAATT 301 GGGTACTAAA AATCGACTCT CTGAACTTCG TAACGGAATG GCACGACGGA CCGTCACGGG 361 CGTGACGGAC CGTCACAGAC TCTTTGGTGG AAATTGAGTC TCTGAACCTT GCGACGACCT 421 GCAGGACGGA CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT 481 CGGATTTCTT TACACGTTTT AAGGGACGTT TTGGACTATT CCTACTTTAA TTATAAAGTT 541 AGTGGGTTTA TGTTAATAAG TCTAATTACC TGGGGGTTAA AAGAGGTAAC CTTGAGTAAA 601 TTAGTGGGTT ATTATTCCAT CTTTTATTCT TAATTATATG CTAATTAGGG TAAAAGAAGG 661 AGGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG AAGGAGAATC GAT Predicted gene structure (within gDNA segment 24369 to 16660): Exon 1 18574 18514 ( 61 n); cDNA 89 150 ( 62 n); score: 0.607 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0.56), Pa: 0.000 (s: 0.88) Exon 2 17959 17396 ( 564 n); cDNA 151 712 ( 562 n); score: 0.840 PPA cDNA 15 1 MATCH C06HBa0153O03.1-1- SGN-E549941- 0.817 625 0.877 C PGS_C06HBa0153O03.1-1-_SGN-E549941- (18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): AAAATCTTTA TGATAAATA- T-ATCTAAAA AATAAAATAA AAAATTTATT ATATGTATAA 18517 ||||||||| | |||| | | |||| |||||||| || |||| || ||| | AAAATCTTTG TTTCTAATAG TGTTATAAAT CGTAAAATAA TAATAATATT -TAAGTAGGA 147 AAAGCAAAAA TAAAAATAAA ATTTCTCAAA GTTCTTATTC TTTGTATTAA AATAATAAGA 18457 | | AGA....... .......... .......... .......... .......... .......... 150 CAATAAAAAT CTTAAGATTC TTATTCTTCA TTTTTGCGCA AAAAAATCTT TATTTTATTT 18397 .......... .......... .......... .......... .......... .......... 150 TATGTTTTAT ACATATTATT TAATATTTTA ATTTGTGAGA AATTTTTTTA AGTTATTTGG 18337 .......... .......... .......... .......... .......... .......... 150 ATTAAATTTT TAAATTATAT TGAGAAAATG CACAAGTATT CCCTCAAACT ATGTCTGAAA 18277 .......... .......... .......... .......... .......... .......... 150 TCCCAGAGAC ACACTTATAC TATATTAAGG TCATATTACC CCCTGAACTT ATTTTATAAG 18217 .......... .......... .......... .......... .......... .......... 150 TAATTTTCTA CCCCTTTTGA CCTACGTGGC TCTAGCTTGA AAAAAAAGTC AATCAGCGTT 18157 .......... .......... .......... .......... .......... .......... 150 GGACCCACAA GATAGTGCCA CATAGACCGA AAAGGGCTAG AAAATTATTA ATAAAATAAG 18097 .......... .......... .......... .......... .......... .......... 150 TTCAGGGATA ATAGGACCTT AGTATAGTGT AAGTATGACT TTAAAATTTC AGGCATAAAT 18037 .......... .......... .......... .......... .......... .......... 150 TGAGAGGGTA CTTGTGCATT ATCTCAATAA TATTCAAATC TTTACATTAA TATCTAATTT 17977 .......... .......... .......... .......... .......... .......... 150 GATGTAATAT TTTAATAATA ATAATGTAAC GACCTATTTA GTCGTTTTGA GCAGCAGATT 17917 | | |||||||| ||||| |||| |||||||||| | ||||||| .......... .......ACA TAAATGTAAC GACCTGTTTA GTCGTTTTGA GTAGCAGATA 193 TTATTTTTGG AAAAACTGGC TGAGACGACG GATCCCACGA TGGACCGTCA TGGGCACGAT 17857 |||||||||| |||||| || |||||||||| || ||||||| ||||||||| || ||||||| TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA TGAGCACGAT 253 GGACCGTCGA GGGGGTCTCG TTCCAAAATA CATAG-AATT CTGAAATTTG GGTTTTGAAA 17798 |||||||||| || |||||| || ||||| | | ||| |||| |||||||| | | | ||| GGACCGTCGA -GGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA 312 TCGACTCTCT GAACTTCGTG ATGAAGTGGC AGGACGGACC GTCACAGGCA TGACGGGCCG 17738 |||||||||| ||||||||| | | | |||| | |||||||| ||||| ||| |||||| ||| TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG 372 TCACAGTCTC TTCAG-AAAA TTTCAGTCTC TGAACTCTGT GACGGAAGCA GCAGGACGGA 17679 |||||| ||| || | || || |||||| ||||| || ||| | | | |||||||||| TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GAC-G-ACCT GCAGGACGGA 430 CCGTCGCAGG CACGACGACC CGTCACAGAC TGCGTAATCC CAGGCTGAGT CGGATTTCTT 17619 |||||||||| ||||||| | | |||||| |||||||||| ||| ||| || |||||||||| CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT CGGATTTCTT 490 TAAATGTTTT AAGGGGGCGT TTTGGACTAT TCCTGCTATA ATTATAAATT TAGTGGGTTA 17559 || | ||||| || ||| ||| |||||||||| |||| || || |||||||| | ||||||||| TACACGTTTT AA-GGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT TAGTGGGTTT 549 ATGTTAATAA -TTTAACTAC TTGAGGGTTA AAAGAGATAA CCTTGAATTA GTTAGTGGGT 17500 |||||||||| | ||| ||| || |||||| |||||| ||| |||||| | | ||||||||| ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA ATTAGTGGGT 609 TAAACTCATC ATCTTTCATA CTTAATTATA TGCTAATTAG GGTAAAAGAA AGAAGGTTTG 17440 || | | |||||| || |||||||||| |||||||||| |||||||||| || |||||| TATTAT-TCC ATCTTTTATT CTTAATTATA TGCTAATTAG GGTAAAAGAA GGAGGGTTTG 668 AATAAGAAAA AGAAAAGAAC AGAAAGAGAG GGAGAAACGA TCGA 17396 |||||||||| |||||||||| |||||||||| || | | |||| AATAAGAAAA AGAAAAGAAC AGAAAGAGAG AGAAGGAGAA TCGA 712 hqPGS_C06HBa0153O03.1-1-_SGN-E549941- (17959 17396) ******************************************************************************** EST sequence 57 -strand 711 n (File: SGN-E396039-) 1 TTTTTTTTTT TGAATTAGCT CCAAGAAAAA ATGAGTAAAT TTTTTTTATT TTTATGGCAT 61 AATTTTTTCA TTAATTCATG GTGGAGAAAA TCTTTGTTTT TAATAGTGTT ATAAATCGTA 121 AAATAATAAT AATATTTAAG TAGGAAGAAC ATAAATGTAA CGACCTGTTT AGTCGTTTTG 181 AGTAGCAGAT TTTATTTTTG GAAAAACAGG TTGAGACGAC GGAACCCACG ACGGACCGTC 241 ATGAGCACGA TGGACCGTCG AGGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG 301 GTACTAAAAA TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG 361 TGACGGACCG TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GACGACCTGC 421 AGGACGGACC GTCGCAGGCA CGACGGGCCA TCACAGGTTG CGTAATCCCA GTCTGGGTCG 481 GATTTCTTTA CACGTTTTAA GGGACGTTTT GGACTATTCC TACTTTAATT ATAAAGTTAG 541 TGGGTTTATG TTAATAAGTC TAATTACCTG GGGGTTAAAA GAGGTAACCT TGAGTAAATT 601 AGTGGGTTAT TATTCCATCT TTTATTCTTA ATTATATGCT AATTAGGGTA AAAGAAGGAG 661 GGTTTGAATA AGAAAAAGAA AAGAACAGAA AGAGAGAGAA GGAGAATCGA T Predicted gene structure (within gDNA segment 24349 to 16660): Exon 1 18574 18514 ( 61 n); cDNA 87 148 ( 62 n); score: 0.623 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0.58), Pa: 0.000 (s: 0.90) Exon 2 17959 17396 ( 564 n); cDNA 149 710 ( 562 n); score: 0.841 PPA cDNA 50 40 MATCH C06HBa0153O03.1-1- SGN-E396039- 0.820 625 0.879 C PGS_C06HBa0153O03.1-1-_SGN-E396039- (18574 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): AAAATCTTTA TGATAAATA- T-ATCTAAAA AATAAAATAA AAAATTTATT ATATGTATAA 18517 ||||||||| | | |||| | | |||| |||||||| || |||| || ||| | AAAATCTTTG TTTTTAATAG TGTTATAAAT CGTAAAATAA TAATAATATT -TAAGTAGGA 145 AAAGCAAAAA TAAAAATAAA ATTTCTCAAA GTTCTTATTC TTTGTATTAA AATAATAAGA 18457 | | AGA....... .......... .......... .......... .......... .......... 148 CAATAAAAAT CTTAAGATTC TTATTCTTCA TTTTTGCGCA AAAAAATCTT TATTTTATTT 18397 .......... .......... .......... .......... .......... .......... 148 TATGTTTTAT ACATATTATT TAATATTTTA ATTTGTGAGA AATTTTTTTA AGTTATTTGG 18337 .......... .......... .......... .......... .......... .......... 148 ATTAAATTTT TAAATTATAT TGAGAAAATG CACAAGTATT CCCTCAAACT ATGTCTGAAA 18277 .......... .......... .......... .......... .......... .......... 148 TCCCAGAGAC ACACTTATAC TATATTAAGG TCATATTACC CCCTGAACTT ATTTTATAAG 18217 .......... .......... .......... .......... .......... .......... 148 TAATTTTCTA CCCCTTTTGA CCTACGTGGC TCTAGCTTGA AAAAAAAGTC AATCAGCGTT 18157 .......... .......... .......... .......... .......... .......... 148 GGACCCACAA GATAGTGCCA CATAGACCGA AAAGGGCTAG AAAATTATTA ATAAAATAAG 18097 .......... .......... .......... .......... .......... .......... 148 TTCAGGGATA ATAGGACCTT AGTATAGTGT AAGTATGACT TTAAAATTTC AGGCATAAAT 18037 .......... .......... .......... .......... .......... .......... 148 TGAGAGGGTA CTTGTGCATT ATCTCAATAA TATTCAAATC TTTACATTAA TATCTAATTT 17977 .......... .......... .......... .......... .......... .......... 148 GATGTAATAT TTTAATAATA ATAATGTAAC GACCTATTTA GTCGTTTTGA GCAGCAGATT 17917 | | |||||||| ||||| |||| |||||||||| | |||||||| .......... .......ACA TAAATGTAAC GACCTGTTTA GTCGTTTTGA GTAGCAGATT 191 TTATTTTTGG AAAAACTGGC TGAGACGACG GATCCCACGA TGGACCGTCA TGGGCACGAT 17857 |||||||||| |||||| || |||||||||| || ||||||| ||||||||| || ||||||| TTATTTTTGG AAAAACAGGT TGAGACGACG GAACCCACGA CGGACCGTCA TGAGCACGAT 251 GGACCGTCGA GGGGGTCTCG TTCCAAAATA CATAG-AATT CTGAAATTTG GGTTTTGAAA 17798 |||||||||| || |||||| || ||||| | | ||| |||| |||||||| | | | ||| GGACCGTCGA -GGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA 310 TCGACTCTCT GAACTTCGTG ATGAAGTGGC AGGACGGACC GTCACAGGCA TGACGGGCCG 17738 |||||||||| ||||||||| | | | |||| | |||||||| ||||| ||| |||||| ||| TCGACTCTCT GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG 370 TCACAGTCTC TTCAG-AAAA TTTCAGTCTC TGAACTCTGT GACGGAAGCA GCAGGACGGA 17679 |||||| ||| || | || || |||||| ||||| || ||| | | | |||||||||| TCACAGACTC TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GAC-G-ACCT GCAGGACGGA 428 CCGTCGCAGG CACGACGACC CGTCACAGAC TGCGTAATCC CAGGCTGAGT CGGATTTCTT 17619 |||||||||| ||||||| | | |||||| |||||||||| ||| ||| || |||||||||| CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT CGGATTTCTT 488 TAAATGTTTT AAGGGGGCGT TTTGGACTAT TCCTGCTATA ATTATAAATT TAGTGGGTTA 17559 || | ||||| || ||| ||| |||||||||| |||| || || |||||||| | ||||||||| TACACGTTTT AA-GGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT TAGTGGGTTT 547 ATGTTAATAA -TTTAACTAC TTGAGGGTTA AAAGAGATAA CCTTGAATTA GTTAGTGGGT 17500 |||||||||| | ||| ||| || |||||| |||||| ||| |||||| | | ||||||||| ATGTTAATAA GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA ATTAGTGGGT 607 TAAACTCATC ATCTTTCATA CTTAATTATA TGCTAATTAG GGTAAAAGAA AGAAGGTTTG 17440 || | | |||||| || |||||||||| |||||||||| |||||||||| || |||||| TATTAT-TCC ATCTTTTATT CTTAATTATA TGCTAATTAG GGTAAAAGAA GGAGGGTTTG 666 AATAAGAAAA AGAAAAGAAC AGAAAGAGAG GGAGAAACGA TCGA 17396 |||||||||| |||||||||| |||||||||| || | | |||| AATAAGAAAA AGAAAAGAAC AGAAAGAGAG AGAAGGAGAA TCGA 710 hqPGS_C06HBa0153O03.1-1-_SGN-E396039- (17959 17396) ******************************************************************************** EST sequence 74 -strand 690 n (File: SGN-E377133-) 1 CTCAATGAAA AATGAGTAAA TTTTTTATAT TTTATGGCAT AATTTTTTCA TTAATTCATG 61 GTTGAGAAAA TCTTTGTTTC TAATAGTGTT ATAAATCGTA AAATAATAAT AATATTTAAG 121 TAGGAAGAAC ATAAATGTAA CGACCTGTTT AGTCGTTTTG AGTAGCAGAT TTTATTTTTG 181 GAAAAACAGG TTGAGACGAC GGAACCCACG ACGGACCGTC ATGAGCACGA TGGACCGTCG 241 AGGAGTCTCG TTTCAAAACA CTTAGAAATT CTGAAATTGG GTACTAAAAA TCGACTCTCT 301 GAACTTCGTA ACGGAATGGC ACGACGGACC GTCACGGGCG TGACGGACCG TCACAGACTC 361 TTTGGTGGAA ATTGAGTCTC TGAACCTTGC GACGACCTGC AGGACGGACC GTCGCAGGCA 421 CGACGGGCCA TCACAGGTTG CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA 481 GGGACGTTTT GGACTATTCC TACTTTAATT ATAAAGTTAG TGGGTTTATG TTAATAAGTC 541 TAATTACCTG GGGGTTAAAA GAGGTAACCT TGAGTAAATT AGTGGGTTAT TATTCCATCT 601 TTTATTCTTA ATTATATGCT AATTAGGGTA AAAGAAGGAG GGTTTGAATA AGAAAAAGAA 661 AAGAACAGAA AGAGAGAGAA GGAGAATCGA Predicted gene structure (within gDNA segment 24149 to 16670): Exon 1 18551 18514 ( 38 n); cDNA 92 128 ( 37 n); score: 0.684 Intron 1 18513 17960 ( 554 n); Pd: 0.845 (s: 0), Pa: 0.000 (s: 0.90) Exon 2 17959 17396 ( 564 n); cDNA 129 690 ( 562 n); score: 0.841 MATCH C06HBa0153O03.1-1- SGN-E377133- 0.841 602 0.872 C PGS_C06HBa0153O03.1-1-_SGN-E377133- (18551 18514,17959 17396) Alignment (genomic DNA sequence = upper lines): TAAAAAATAA AATAAAAAAT TTATTATATG TATAAAAAGC AAAAATAAAA ATAAAATTTC 18492 |||| ||| ||||| || |||| || | || || | TAAATCGTAA AATAATAATA ATATT-TAAG TAGGAAGA.. .......... .......... 128 TCAAAGTTCT TATTCTTTGT ATTAAAATAA TAAGACAATA AAAATCTTAA GATTCTTATT 18432 .......... .......... .......... .......... .......... .......... 128 CTTCATTTTT GCGCAAAAAA ATCTTTATTT TATTTTATGT TTTATACATA TTATTTAATA 18372 .......... .......... .......... .......... .......... .......... 128 TTTTAATTTG TGAGAAATTT TTTTAAGTTA TTTGGATTAA ATTTTTAAAT TATATTGAGA 18312 .......... .......... .......... .......... .......... .......... 128 AAATGCACAA GTATTCCCTC AAACTATGTC TGAAATCCCA GAGACACACT TATACTATAT 18252 .......... .......... .......... .......... .......... .......... 128 TAAGGTCATA TTACCCCCTG AACTTATTTT ATAAGTAATT TTCTACCCCT TTTGACCTAC 18192 .......... .......... .......... .......... .......... .......... 128 GTGGCTCTAG CTTGAAAAAA AAGTCAATCA GCGTTGGACC CACAAGATAG TGCCACATAG 18132 .......... .......... .......... .......... .......... .......... 128 ACCGAAAAGG GCTAGAAAAT TATTAATAAA ATAAGTTCAG GGATAATAGG ACCTTAGTAT 18072 .......... .......... .......... .......... .......... .......... 128 AGTGTAAGTA TGACTTTAAA ATTTCAGGCA TAAATTGAGA GGGTACTTGT GCATTATCTC 18012 .......... .......... .......... .......... .......... .......... 128 AATAATATTC AAATCTTTAC ATTAATATCT AATTTGATGT AATATTTTAA TAATAATAAT 17952 | | ||| .......... .......... .......... .......... .......... ..ACATAAAT 136 GTAACGACCT ATTTAGTCGT TTTGAGCAGC AGATTTTATT TTTGGAAAAA CTGGCTGAGA 17892 |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| | || ||||| GTAACGACCT GTTTAGTCGT TTTGAGTAGC AGATTTTATT TTTGGAAAAA CAGGTTGAGA 196 CGACGGATCC CACGATGGAC CGTCATGGGC ACGATGGACC GTCGAGGGGG TCTCGTTCCA 17832 ||||||| || ||||| |||| ||||||| || |||||||||| ||||| || | ||||||| || CGACGGAACC CACGACGGAC CGTCATGAGC ACGATGGACC GTCGA-GGAG TCTCGTTTCA 255 AAATACATAG -AATTCTGAA ATTTGGGTTT TGAAATCGAC TCTCTGAACT TCGTGATGAA 17773 ||| || ||| ||||||||| ||| || | |||||||| |||||||||| |||| | | | AAACACTTAG AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT TCGTAACGGA 315 GTGGCAGGAC GGACCGTCAC AGGCATGACG GGCCGTCACA GTCTCTTCAG -AAAATTTCA 17714 ||||| ||| |||||||||| ||| ||||| | |||||||| | ||||| | || || | ATGGCACGAC GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG TGGAAATTGA 375 GTCTCTGAAC TCTGTGACGG AAGCAGCAGG ACGGACCGTC GCAGGCACGA CGACCCGTCA 17654 |||||||||| || ||| | | | ||||| |||||||||| |||||||||| || || ||| GTCTCTGAAC CTTGCGAC-G -ACCTGCAGG ACGGACCGTC GCAGGCACGA CGGGCCATCA 433 CAGACTGCGT AATCCCAGGC TGAGTCGGAT TTCTTTAAAT GTTTTAAGGG GGCGTTTTGG 17594 ||| ||||| |||||||| | || ||||||| ||||||| | ||||||| || | |||||||| CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAA-GG GACGTTTTGG 492 ACTATTCCTG CTATAATTAT AAATTTAGTG GGTTAATGTT AATAA-TTTA ACTACTTGAG 17535 ||||||||| || ||||||| ||| |||||| |||| ||||| ||||| | || | ||| || | ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA ATTACCTGGG 552 GGTTAAAAGA GATAACCTTG AATTAGTTAG TGGGTTAAAC TCATCATCTT TCATACTTAA 17475 |||||||||| | |||||||| | | | |||| ||||||| | |||||| | || ||||| GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA T-TCCATCTT TTATTCTTAA 611 TTATATGCTA ATTAGGGTAA AAGAAAGAAG GTTTGAATAA GAAAAAGAAA AGAACAGAAA 17415 |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| |||||||||| TTATATGCTA ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA AGAACAGAAA 671 GAGAGGGAGA AACGATCGA 17396 ||||| || | ||||| GAGAGAGAAG GAGAATCGA 690 hqPGS_C06HBa0153O03.1-1-_SGN-E377133- (17959 17396) ******************************************************************************** EST sequence 29 -strand 558 n (File: SGN-E231589-) 1 AAATGTAACG ACCTGTTTAG TCGTTTTGAG TAGCAGATTT TATTTTTGGA AAAACAGGTT 61 GAGACGACGG AACCCACGAC GGACCGTCAT GAGCACGATG GACCGTCGAG GAGTCTCGTT 121 TCAAAACACT TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC 181 GGAATGGCAC GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT 241 TGAGTCTATG AACCTTGCGA CGACCTGCAG GACGGACCGT CGCAGGCACG ACGGGCCATC 301 ACAGGTTGCG TAATCCCAGT CTGGGTCGGA TTTCTTTACA CGTTTTAAGG GACGTTTTGG 361 ACTATTCCTA CTTTAATTAT AAAGTTAGTG GGTTTATGTT AATAAGTCTA ATTACCTGGG 421 GGTTAAAAGA GGTAACCTTG AGTAAATTAG TGGGTTATTA TTCCATCTTT TATTCTTAAT 481 TATATGCTAA TTAGGGTAAA AGAAGGAGGG TTTGAATAAG AAAAAGAAAA GAACAGAAAG 541 AGAGAGAAGG AGAATCGA Predicted gene structure (within gDNA segment 22829 to 16670): Exon 1 17954 17396 ( 559 n); cDNA 2 558 ( 557 n); score: 0.843 MATCH C06HBa0153O03.1-1- SGN-E231589- 0.843 559 1.002 C PGS_C06HBa0153O03.1-1-_SGN-E231589- (17954 17396) Alignment (genomic DNA sequence = upper lines): AATGTAACGA CCTATTTAGT CGTTTTGAGC AGCAGATTTT ATTTTTGGAA AAACTGGCTG 17895 |||||||||| ||| |||||| ||||||||| |||||||||| |||||||||| |||| || || AATGTAACGA CCTGTTTAGT CGTTTTGAGT AGCAGATTTT ATTTTTGGAA AAACAGGTTG 61 AGACGACGGA TCCCACGATG GACCGTCATG GGCACGATGG ACCGTCGAGG GGGTCTCGTT 17835 |||||||||| ||||||| | |||||||||| ||||||||| |||||||| | | |||||||| AGACGACGGA ACCCACGACG GACCGTCATG AGCACGATGG ACCGTCGA-G GAGTCTCGTT 120 CCAAAATACA TAG-AATTCT GAAATTTGGG TTTTGAAATC GACTCTCTGA ACTTCGTGAT 17776 ||||| || ||| |||||| |||||| || | ||||| |||||||||| ||||||| | TCAAAACACT TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC 180 GAAGTGGCAG GACGGACCGT CACAGGCATG ACGGGCCGTC ACAGTCTCTT CAG-AAAATT 17717 | | ||||| |||||||||| ||| ||| || |||| ||||| |||| ||||| | || | GGAATGGCAC GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT 240 TCAGTCTCTG AACTCTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACGACCCG 17657 | ||||| || ||| || || | | | | || |||||||||| |||||||||| ||||| || TGAGTCTATG AACCTTGCGA C-G-ACCTGC AGGACGGACC GTCGCAGGCA CGACGGGCCA 298 TCACAGACTG CGTAATCCCA GGCTGAGTCG GATTTCTTTA AATGTTTTAA GGGGGCGTTT 17597 |||||| || |||||||||| | ||| |||| |||||||||| | ||||||| ||| ||||| TCACAGGTTG CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA -GGGACGTTT 357 TGGACTATTC CTGCTATAAT TATAAATTTA GTGGGTTAAT GTTAATAA-T TTAACTACTT 17538 |||||||||| || || |||| |||||| ||| ||||||| || |||||||| | ||| ||| | TGGACTATTC CTACTTTAAT TATAAAGTTA GTGGGTTTAT GTTAATAAGT CTAATTACCT 417 GAGGGTTAAA AGAGATAACC TTGAATTAGT TAGTGGGTTA AACTCATCAT CTTTCATACT 17478 | |||||||| |||| ||||| |||| | | | |||||||||| | ||| |||| || || GGGGGTTAAA AGAGGTAACC TTGAGTAAAT TAGTGGGTTA TTAT-TCCAT CTTTTATTCT 476 TAATTATATG CTAATTAGGG TAAAAGAAAG AAGGTTTGAA TAAGAAAAAG AAAAGAACAG 17418 |||||||||| |||||||||| |||||||| | | |||||||| |||||||||| |||||||||| TAATTATATG CTAATTAGGG TAAAAGAAGG AGGGTTTGAA TAAGAAAAAG AAAAGAACAG 536 AAAGAGAGGG AGAAACGATC GA 17396 |||||||| | | | ||| || AAAGAGAGAG AAGGAGAATC GA 558 hqPGS_C06HBa0153O03.1-1-_SGN-E231589- (17954 17396) ******************************************************************************** EST sequence 58 -strand 618 n (File: SGN-E396054-) 1 TTGTTTCTAA TAGTGTTATA AATCGTAAAA TAATAATAAT ATTTAAGTAG GAAGAACATA 61 AATGTAACGA CCTGTTTAGT CGTTTTGAGT AGCAGATTTT ATTTTTGGAA AAACAGGTTG 121 AGACGACGGA ACCCACGACG GACCGTCATG AGCACGATGG ACCGTCGAGG AGTCTCGTTT 181 CAAAACACTT AGAAATTCTG AAATTGGGTA CTAAAAATCG ACTCTCTGAA CTTCGTAACG 241 GAATGGCACG ACGGACCGTC ACGGGCGTGA CGGACCGTCA CAGACTCTTT GGTGGAAATT 301 GAGTCTCTGA ACCTTGCGAC GACCTGCAGG ACGGACCGTC GCAGGCACGA CGGGCCATCA 361 CAGGTTGCGT AATCCCAGTC TGGGTCGGAT TTCTTTACAC GTTTTAAGGG ACGTTTTGGA 421 CTATTCCTAC TTTAATTATA AAGTTAGTGG GTTTATGTTA ATAAGTCTAA TTACCTGGGG 481 GTTAAAAGAG GTAACCTTGA GTAAATTAGT GGGTTATTAT TCCATCTTTT ATTCTTAATT 541 ATATGCTAAT TAGGGTAAAA GAAGGAGGGT TTGAATAAGA AAAAGAAAAG AACAGAAAGA 601 GAGAGAAGGA GAATCGAT Predicted gene structure (within gDNA segment 23419 to 16660): Exon 1 17954 17396 ( 559 n); cDNA 61 617 ( 557 n); score: 0.845 MATCH C06HBa0153O03.1-1- SGN-E396054- 0.845 559 0.905 C PGS_C06HBa0153O03.1-1-_SGN-E396054- (17954 17396) Alignment (genomic DNA sequence = upper lines): AATGTAACGA CCTATTTAGT CGTTTTGAGC AGCAGATTTT ATTTTTGGAA AAACTGGCTG 17895 |||||||||| ||| |||||| ||||||||| |||||||||| |||||||||| |||| || || AATGTAACGA CCTGTTTAGT CGTTTTGAGT AGCAGATTTT ATTTTTGGAA AAACAGGTTG 120 AGACGACGGA TCCCACGATG GACCGTCATG GGCACGATGG ACCGTCGAGG GGGTCTCGTT 17835 |||||||||| ||||||| | |||||||||| ||||||||| |||||||| | | |||||||| AGACGACGGA ACCCACGACG GACCGTCATG AGCACGATGG ACCGTCGA-G GAGTCTCGTT 179 CCAAAATACA TAG-AATTCT GAAATTTGGG TTTTGAAATC GACTCTCTGA ACTTCGTGAT 17776 ||||| || ||| |||||| |||||| || | ||||| |||||||||| ||||||| | TCAAAACACT TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC 239 GAAGTGGCAG GACGGACCGT CACAGGCATG ACGGGCCGTC ACAGTCTCTT CAG-AAAATT 17717 | | ||||| |||||||||| ||| ||| || |||| ||||| |||| ||||| | || | GGAATGGCAC GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT 299 TCAGTCTCTG AACTCTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACGACCCG 17657 | |||||||| ||| || || | | | | || |||||||||| |||||||||| ||||| || TGAGTCTCTG AACCTTGCGA C-G-ACCTGC AGGACGGACC GTCGCAGGCA CGACGGGCCA 357 TCACAGACTG CGTAATCCCA GGCTGAGTCG GATTTCTTTA AATGTTTTAA GGGGGCGTTT 17597 |||||| || |||||||||| | ||| |||| |||||||||| | ||||||| ||| ||||| TCACAGGTTG CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA -GGGACGTTT 416 TGGACTATTC CTGCTATAAT TATAAATTTA GTGGGTTAAT GTTAATAA-T TTAACTACTT 17538 |||||||||| || || |||| |||||| ||| ||||||| || |||||||| | ||| ||| | TGGACTATTC CTACTTTAAT TATAAAGTTA GTGGGTTTAT GTTAATAAGT CTAATTACCT 476 GAGGGTTAAA AGAGATAACC TTGAATTAGT TAGTGGGTTA AACTCATCAT CTTTCATACT 17478 | |||||||| |||| ||||| |||| | | | |||||||||| | ||| |||| || || GGGGGTTAAA AGAGGTAACC TTGAGTAAAT TAGTGGGTTA TTAT-TCCAT CTTTTATTCT 535 TAATTATATG CTAATTAGGG TAAAAGAAAG AAGGTTTGAA TAAGAAAAAG AAAAGAACAG 17418 |||||||||| |||||||||| |||||||| | | |||||||| |||||||||| |||||||||| TAATTATATG CTAATTAGGG TAAAAGAAGG AGGGTTTGAA TAAGAAAAAG AAAAGAACAG 595 AAAGAGAGGG AGAAACGATC GA 17396 |||||||| | | | ||| || AAAGAGAGAG AAGGAGAATC GA 617 hqPGS_C06HBa0153O03.1-1-_SGN-E396054- (17954 17396) ******************************************************************************** EST sequence 63 -strand 610 n (File: SGN-E396058-) 1 AATAGTGTTA TAAATCGTAA AATAATAATA ATAGTTAACT ATGAAGAACC TAAATGTAAC 61 GACCTGTTTA GTCGTTGTGA GTAGCAGATT TTATTTTTGG AAACACAGGT TGAGACGACG 121 GAACCCACGA CGGACCGTCA TGAGCACGAT GGACCGTCGA GGAGTCTCGT TTCAAAACAC 181 TTAGAAATTC TGAAATTGGG TACTAAAAAT CGACTCTCTG AACTTCGTAA CGGAATGGCA 241 CGACGGACCG TCACGGGCGT GACGGACCGT CACAGACTCT TTGGTGGAAA TTGAGTCTCT 301 GAACCTTGCG ACGACCTGCA GGACGGACCG TCGCAGGCAC GACGGGCCAT CACAGGTTGC 361 GTAATCCCAG TCTGGGTCGG ATTTCTTTAC ACGTTTTAAG GGACGTTTTG GACTATTCCT 421 ACTTTAATTA TAAAGTTAGT GGGTTTATGT TAATAAGTCT AATTACCTGG GGGTTAAAAG 481 AGGTAACCTT GAGTAAATTA GTGGGTTATT ATTCCATCTT TTATTCTTAA TTATATGCTA 541 ATTAGGGTAA AAGAAGGAGG GTTTGAATAA GAAAAAGAAA AGAACAGAAA GAGAGAGAAG 601 GAGAATCGAT Predicted gene structure (within gDNA segment 23339 to 16660): Exon 1 17954 17396 ( 559 n); cDNA 53 609 ( 557 n); score: 0.842 MATCH C06HBa0153O03.1-1- SGN-E396058- 0.842 559 0.916 C PGS_C06HBa0153O03.1-1-_SGN-E396058- (17954 17396) Alignment (genomic DNA sequence = upper lines): AATGTAACGA CCTATTTAGT CGTTTTGAGC AGCAGATTTT ATTTTTGGAA AAACTGGCTG 17895 |||||||||| ||| |||||| |||| |||| |||||||||| |||||||||| | || || || AATGTAACGA CCTGTTTAGT CGTTGTGAGT AGCAGATTTT ATTTTTGGAA ACACAGGTTG 112 AGACGACGGA TCCCACGATG GACCGTCATG GGCACGATGG ACCGTCGAGG GGGTCTCGTT 17835 |||||||||| ||||||| | |||||||||| ||||||||| |||||||| | | |||||||| AGACGACGGA ACCCACGACG GACCGTCATG AGCACGATGG ACCGTCGA-G GAGTCTCGTT 171 CCAAAATACA TAG-AATTCT GAAATTTGGG TTTTGAAATC GACTCTCTGA ACTTCGTGAT 17776 ||||| || ||| |||||| |||||| || | ||||| |||||||||| ||||||| | TCAAAACACT TAGAAATTCT GAAATTGGGT ACTAAAAATC GACTCTCTGA ACTTCGTAAC 231 GAAGTGGCAG GACGGACCGT CACAGGCATG ACGGGCCGTC ACAGTCTCTT CAG-AAAATT 17717 | | ||||| |||||||||| ||| ||| || |||| ||||| |||| ||||| | || | GGAATGGCAC GACGGACCGT CACGGGCGTG ACGGACCGTC ACAGACTCTT TGGTGGAAAT 291 TCAGTCTCTG AACTCTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACGACCCG 17657 | |||||||| ||| || || | | | | || |||||||||| |||||||||| ||||| || TGAGTCTCTG AACCTTGCGA C-G-ACCTGC AGGACGGACC GTCGCAGGCA CGACGGGCCA 349 TCACAGACTG CGTAATCCCA GGCTGAGTCG GATTTCTTTA AATGTTTTAA GGGGGCGTTT 17597 |||||| || |||||||||| | ||| |||| |||||||||| | ||||||| ||| ||||| TCACAGGTTG CGTAATCCCA GTCTGGGTCG GATTTCTTTA CACGTTTTAA -GGGACGTTT 408 TGGACTATTC CTGCTATAAT TATAAATTTA GTGGGTTAAT GTTAATAA-T TTAACTACTT 17538 |||||||||| || || |||| |||||| ||| ||||||| || |||||||| | ||| ||| | TGGACTATTC CTACTTTAAT TATAAAGTTA GTGGGTTTAT GTTAATAAGT CTAATTACCT 468 GAGGGTTAAA AGAGATAACC TTGAATTAGT TAGTGGGTTA AACTCATCAT CTTTCATACT 17478 | |||||||| |||| ||||| |||| | | | |||||||||| | ||| |||| || || GGGGGTTAAA AGAGGTAACC TTGAGTAAAT TAGTGGGTTA TTAT-TCCAT CTTTTATTCT 527 TAATTATATG CTAATTAGGG TAAAAGAAAG AAGGTTTGAA TAAGAAAAAG AAAAGAACAG 17418 |||||||||| |||||||||| |||||||| | | |||||||| |||||||||| |||||||||| TAATTATATG CTAATTAGGG TAAAAGAAGG AGGGTTTGAA TAAGAAAAAG AAAAGAACAG 587 AAAGAGAGGG AGAAACGATC GA 17396 |||||||| | | | ||| || AAAGAGAGAG AAGGAGAATC GA 609 hqPGS_C06HBa0153O03.1-1-_SGN-E396058- (17954 17396) ******************************************************************************** EST sequence 103 -strand 545 n (File: SGN-E241959-) 1 TGTTTAGTCG TTTTGAGTAG CAGATTTTAT TTTTGGAAAA ACAGGTTGAG ACGACGGAAC 61 CCACGACGGA CCGTCATGAG CACGATGGAC CGTCGAGGAG TCTCGTTTCA AAACACTTAG 121 AAATTCTGAA ATTGGGTACT AAAAATCGAC TCTCTGAACT TCGTAACGGA ATGGCACGAC 181 GGACCGTCAC GGGCGTGACG GACCGTCACA GACTCTTTGG TGGAAATTGA GTCTCTGAAC 241 CTTGCGACGA CCTGCAGGAC GGACCGTCGC AGGCACGACG GGCCATCACA GGTTGCGTAA 301 TCCCAGTCTG GGTCGGATTT CTTTACACGT TTTAAGGGAC GTTTTGGACT ATTCCTACTT 361 TAATTATAAA GTTAGTGGGT TTATGTTAAT AAGTCTAATT ACCTGGGGGT TAAAAGAGGT 421 AACCTTGAGT AAATTAGTGG GTTATTATTC CATCTTTTAT TCTTAATTAT ATGCTAATTA 481 GGGTAAAAGA AGGAGGGTTT GAATAAGAAA AAGAAAAGAA CAGAAAGAGA GAGAAGGAGA 541 ATCGA Predicted gene structure (within gDNA segment 22699 to 16670): Exon 1 17940 17396 ( 545 n); cDNA 3 545 ( 543 n); score: 0.843 MATCH C06HBa0153O03.1-1- SGN-E241959- 0.843 545 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E241959- (17940 17396) Alignment (genomic DNA sequence = upper lines): TTTAGTCGTT TTGAGCAGCA GATTTTATTT TTGGAAAAAC TGGCTGAGAC GACGGATCCC 17881 |||||||||| ||||| |||| |||||||||| |||||||||| || |||||| |||||| ||| TTTAGTCGTT TTGAGTAGCA GATTTTATTT TTGGAAAAAC AGGTTGAGAC GACGGAACCC 62 ACGATGGACC GTCATGGGCA CGATGGACCG TCGAGGGGGT CTCGTTCCAA AATACATAG- 17822 |||| ||||| |||||| ||| |||||||||| |||| || || |||||| ||| || || ||| ACGACGGACC GTCATGAGCA CGATGGACCG TCGA-GGAGT CTCGTTTCAA AACACTTAGA 121 AATTCTGAAA TTTGGGTTTT GAAATCGACT CTCTGAACTT CGTGATGAAG TGGCAGGACG 17762 |||||||||| || || | ||||||||| |||||||||| ||| | | | ||||| |||| AATTCTGAAA TTGGGTACTA AAAATCGACT CTCTGAACTT CGTAACGGAA TGGCACGACG 181 GACCGTCACA GGCATGACGG GCCGTCACAG TCTCTTCAG- AAAATTTCAG TCTCTGAACT 17703 ||||||||| ||| |||||| ||||||||| ||||| | || || || ||||||||| GACCGTCACG GGCGTGACGG ACCGTCACAG ACTCTTTGGT GGAAATTGAG TCTCTGAACC 241 CTGTGACGGA AGCAGCAGGA CGGACCGTCG CAGGCACGAC GACCCGTCAC AGACTGCGTA 17643 || ||| | | | |||||| |||||||||| |||||||||| | || |||| || |||||| TTGCGAC-G- ACCTGCAGGA CGGACCGTCG CAGGCACGAC GGGCCATCAC AGGTTGCGTA 299 ATCCCAGGCT GAGTCGGATT TCTTTAAATG TTTTAAGGGG GCGTTTTGGA CTATTCCTGC 17583 ||||||| || | |||||||| |||||| | | |||||| ||| ||||||||| |||||||| | ATCCCAGTCT GGGTCGGATT TCTTTACACG TTTTAA-GGG ACGTTTTGGA CTATTCCTAC 358 TATAATTATA AATTTAGTGG GTTAATGTTA ATAA-TTTAA CTACTTGAGG GTTAAAAGAG 17524 | |||||||| || ||||||| ||| |||||| |||| | ||| ||| || || |||||||||| TTTAATTATA AAGTTAGTGG GTTTATGTTA ATAAGTCTAA TTACCTGGGG GTTAAAAGAG 418 ATAACCTTGA ATTAGTTAGT GGGTTAAACT CATCATCTTT CATACTTAAT TATATGCTAA 17464 ||||||||| | | ||||| |||||| | ||||||| || |||||| |||||||||| GTAACCTTGA GTAAATTAGT GGGTTATTAT -TCCATCTTT TATTCTTAAT TATATGCTAA 477 TTAGGGTAAA AGAAAGAAGG TTTGAATAAG AAAAAGAAAA GAACAGAAAG AGAGGGAGAA 17404 |||||||||| |||| || || |||||||||| |||||||||| |||||||||| |||| || TTAGGGTAAA AGAAGGAGGG TTTGAATAAG AAAAAGAAAA GAACAGAAAG AGAGAGAAGG 537 ACGATCGA 17396 | ||||| AGAATCGA 545 hqPGS_C06HBa0153O03.1-1-_SGN-E241959- (17940 17396) ******************************************************************************** EST sequence 66 -strand 472 n (File: SGN-E236652-) 1 TCATGAGCAC GATGGACCGT CGAGGAGTCT CGTTTCAAAA CACTTAGAAA TTCTGAAATT 61 GGGTACTAAA AATCGACTCT CTGAACTTCG TAACGGAATG GCACGACGGA CCGTCACGGG 121 CGTGACGGAC CGTCACAGAC TCTTTGGTGG AAATTGAGTC TCTGAACCTT GCGACGACCT 181 GCAGGACGGA CCGTCGCAGG CACGACGGGC CATCACAGGT TGCGTAATCC CAGTCTGGGT 241 CGGATTTCTT TACACGTTTT AAGGGACGTT TTGGACTATT CCTACTTTAA TTATAAAGTT 301 AGTGGGTTTA TGTTAATAAG TCTAATTACC TGGGGGTTAA AAGAGGTAAC CTTGAGTAAA 361 TTAGTGGGTT ATTATTCCAT CTTTTATTCT TAATTATATG CTAATTAGGG TAAAAGAAGG 421 AGGGTTTGAA TAAGAAAAAG AAAAGAACAG AAAGAGAGAG AAGGAGAATC GA Predicted gene structure (within gDNA segment 21969 to 16670): Exon 1 17869 17396 ( 474 n); cDNA 1 472 ( 472 n); score: 0.830 MATCH C06HBa0153O03.1-1- SGN-E236652- 0.830 474 1.004 C PGS_C06HBa0153O03.1-1-_SGN-E236652- (17869 17396) Alignment (genomic DNA sequence = upper lines): TCATGGGCAC GATGGACCGT CGAGGGGGTC TCGTTCCAAA ATACATAG-A ATTCTGAAAT 17811 ||||| |||| |||||||||| ||| || ||| ||||| |||| | || ||| | |||||||||| TCATGAGCAC GATGGACCGT CGA-GGAGTC TCGTTTCAAA ACACTTAGAA ATTCTGAAAT 59 TTGGGTTTTG AAATCGACTC TCTGAACTTC GTGATGAAGT GGCAGGACGG ACCGTCACAG 17751 | || | |||||||||| |||||||||| || | | | | |||| ||||| |||||||| | TGGGTACTAA AAATCGACTC TCTGAACTTC GTAACGGAAT GGCACGACGG ACCGTCACGG 119 GCATGACGGG CCGTCACAGT CTCTTCAG-A AAATTTCAGT CTCTGAACTC TGTGACGGAA 17692 || |||||| ||||||||| ||||| | || || ||| |||||||| || ||| | | GCGTGACGGA CCGTCACAGA CTCTTTGGTG GAAATTGAGT CTCTGAACCT TGCGAC-G-A 177 GCAGCAGGAC GGACCGTCGC AGGCACGACG ACCCGTCACA GACTGCGTAA TCCCAGGCTG 17632 | ||||||| |||||||||| |||||||||| || ||||| | ||||||| |||||| ||| CCTGCAGGAC GGACCGTCGC AGGCACGACG GGCCATCACA GGTTGCGTAA TCCCAGTCTG 237 AGTCGGATTT CTTTAAATGT TTTAAGGGGG CGTTTTGGAC TATTCCTGCT ATAATTATAA 17572 ||||||||| ||||| | || ||||| ||| |||||||||| ||||||| || ||||||||| GGTCGGATTT CTTTACACGT TTTAA-GGGA CGTTTTGGAC TATTCCTACT TTAATTATAA 296 ATTTAGTGGG TTAATGTTAA TAA-TTTAAC TACTTGAGGG TTAAAAGAGA TAACCTTGAA 17513 | |||||||| || ||||||| ||| | ||| ||| || ||| ||||||||| ||||||||| AGTTAGTGGG TTTATGTTAA TAAGTCTAAT TACCTGGGGG TTAAAAGAGG TAACCTTGAG 356 TTAGTTAGTG GGTTAAACTC ATCATCTTTC ATACTTAATT ATATGCTAAT TAGGGTAAAA 17453 | | |||||| ||||| | ||||||| || ||||||| |||||||||| |||||||||| TAAATTAGTG GGTTATTAT- TCCATCTTTT ATTCTTAATT ATATGCTAAT TAGGGTAAAA 415 GAAAGAAGGT TTGAATAAGA AAAAGAAAAG AACAGAAAGA GAGGGAGAAA CGATCGA 17396 ||| || ||| |||||||||| |||||||||| |||||||||| ||| || | ||||| GAAGGAGGGT TTGAATAAGA AAAAGAAAAG AACAGAAAGA GAGAGAAGGA GAATCGA 472 hqPGS_C06HBa0153O03.1-1-_SGN-E236652- (17869 17396) ******************************************************************************** EST sequence 64 -strand 454 n (File: SGN-E396070-) 1 TCGAGGAGTC TAGTTTCAAA ACAATTAGAA ACTCCCCCCC CCGGTACTAA AAATCGACTC 61 TCTGAACTTC GTAACGGAAT GGCACGACGG ACCGTCACGG GCGTGACGGA CCGTCACAGA 121 CTCTTTGGTG GAAATTGAGT CTCTGAACCT TGCGACGACC TGCAGGACGG ACCGTCGCAG 181 GCACGACGGG CCATCACAGG TTGCGTAATC CCAGTCTGGG TCGGATTTCT TTACACGTTT 241 TAAGGGACGT TTTGGACTAT TCCTACTTTA ATTATAAAGT TAGTGGGTTT ATGTTAATAA 301 GTCTAATTAC CTGGGGGTTA AAAGAGGTAA CCTTGAGTAA ATTAGTGGGT TATTATTCCA 361 TCTTTTATTC TTAATTATAT GCTAATTAGG GTAAAAGAAG GAGGGTTTGA ATAAGAAAAA 421 GAAAAGAACA GAAAGAGAGA GAAGGAGAAT CGAT Predicted gene structure (within gDNA segment 21779 to 16660): Exon 1 17850 17396 ( 455 n); cDNA 1 453 ( 453 n); score: 0.803 MATCH C06HBa0153O03.1-1- SGN-E396070- 0.803 455 1.002 C PGS_C06HBa0153O03.1-1-_SGN-E396070- (17850 17396) Alignment (genomic DNA sequence = upper lines): TCGAGGGGGT CTCGTTCCAA AATACATAG- AATTCTGAAA TTTGGGTTTT GAAATCGACT 17792 ||||| | || || ||| ||| || | ||| || || || | ||||||||| TCGAG-GAGT CTAGTTTCAA AACAATTAGA AACTCCCCCC CCCGGTACTA AAAATCGACT 59 CTCTGAACTT CGTGATGAAG TGGCAGGACG GACCGTCACA GGCATGACGG GCCGTCACAG 17732 |||||||||| ||| | | | ||||| |||| ||||||||| ||| |||||| ||||||||| CTCTGAACTT CGTAACGGAA TGGCACGACG GACCGTCACG GGCGTGACGG ACCGTCACAG 119 TCTCTTCAG- AAAATTTCAG TCTCTGAACT CTGTGACGGA AGCAGCAGGA CGGACCGTCG 17673 ||||| | || || || ||||||||| || ||| | | | |||||| |||||||||| ACTCTTTGGT GGAAATTGAG TCTCTGAACC TTGCGAC-G- ACCTGCAGGA CGGACCGTCG 177 CAGGCACGAC GACCCGTCAC AGACTGCGTA ATCCCAGGCT GAGTCGGATT TCTTTAAATG 17613 |||||||||| | || |||| || |||||| ||||||| || | |||||||| |||||| | | CAGGCACGAC GGGCCATCAC AGGTTGCGTA ATCCCAGTCT GGGTCGGATT TCTTTACACG 237 TTTTAAGGGG GCGTTTTGGA CTATTCCTGC TATAATTATA AATTTAGTGG GTTAATGTTA 17553 |||||| ||| ||||||||| |||||||| | | |||||||| || ||||||| ||| |||||| TTTTAA-GGG ACGTTTTGGA CTATTCCTAC TTTAATTATA AAGTTAGTGG GTTTATGTTA 296 ATAA-TTTAA CTACTTGAGG GTTAAAAGAG ATAACCTTGA ATTAGTTAGT GGGTTAAACT 17494 |||| | ||| ||| || || |||||||||| ||||||||| | | ||||| |||||| | ATAAGTCTAA TTACCTGGGG GTTAAAAGAG GTAACCTTGA GTAAATTAGT GGGTTATTAT 356 CATCATCTTT CATACTTAAT TATATGCTAA TTAGGGTAAA AGAAAGAAGG TTTGAATAAG 17434 ||||||| || |||||| |||||||||| |||||||||| |||| || || |||||||||| -TCCATCTTT TATTCTTAAT TATATGCTAA TTAGGGTAAA AGAAGGAGGG TTTGAATAAG 415 AAAAAGAAAA GAACAGAAAG AGAGGGAGAA ACGATCGA 17396 |||||||||| |||||||||| |||| || | ||||| AAAAAGAAAA GAACAGAAAG AGAGAGAAGG AGAATCGA 453 hqPGS_C06HBa0153O03.1-1-_SGN-E396070- (17850 17396) ******************************************************************************** EST sequence 161 +strand 390 n (File: SGN-E242274+) 1 TGAGTCGACG GATCCCACGA CGGACCGTCA TGAGCACGAC GGACCGTGGA GGGTGTCTCG 61 TTCCAAAACA CTTAGAATTC TGAAATTTGG GTACTGAGAT CGACTCTCTG AACTTCGCGA 121 CGAAATGGCA CGACGGACCG TCACAGGCAT GACGGGCTGT CACAAACCCT TAGTGAAATT 181 TAATCTCTGA ACTTTGTGAC GGAAGCAGCA GGACGGACCG TCGCAGGCAC GACTGGTCGT 241 CACAGACTGC GTAACCCTGA CTGGGTCGGA TTTTTGTTAA ATGTTTTAAG GGGCGTTTTG 301 GACTATTCCT GCTTATAATT ATGAAATTAG TGGTTTAATG TTAATAATTC AATTACTTGG 361 GGGGTTGAAG GAGATAACCT TGAATTAATT Predicted gene structure (within gDNA segment 19144 to 15483): Exon 1 17896 17507 ( 390 n); cDNA 1 390 ( 390 n); score: 0.865 MATCH C06HBa0153O03.1-1- SGN-E242274+ 0.865 390 1.000 C PGS_C06HBa0153O03.1-1-_SGN-E242274+ (17896 17507) Alignment (genomic DNA sequence = upper lines): TGAGACGACG GATCCCACGA TGGACCGTCA TGGGCACGAT GGACCGTCGA GGGGGTCTCG 17837 |||| ||||| |||||||||| ||||||||| || |||||| ||||||| || ||| |||||| TGAGTCGACG GATCCCACGA CGGACCGTCA TGAGCACGAC GGACCGTGGA GGGTGTCTCG 60 TTCCAAAATA CATAGAATTC TGAAATTTGG GTTTTGAAAT CGACTCTCTG AACTTCGTGA 17777 |||||||| | | |||||||| |||||||||| || ||| || |||||||||| ||||||| || TTCCAAAACA CTTAGAATTC TGAAATTTGG GTACTGAGAT CGACTCTCTG AACTTCGCGA 120 TGAAGTGGCA GGACGGACCG TCACAGGCAT GACGGGCCGT CACAGTCTCT TCAGAAAATT 17717 ||| ||||| ||||||||| |||||||||| ||||||| || |||| | || | || || | CGAAATGGCA CGACGGACCG TCACAGGCAT GACGGGCTGT CACAAACCCT T-AGTGAAAT 179 TCAGTCTCTG AACTCTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACGACCCG 17657 | | |||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||| || TTAATCTCTG AACTTTGTGA CGGAAGCAGC AGGACGGACC GTCGCAGGCA CGACTGGTCG 239 TCACAGACTG CGTAATCCCA GGCTGAGTCG GATTTCT-TT AAATGTTTTA AGGGGGCGTT 17598 |||||||||| ||||| ||| | ||| |||| ||||| | || |||||||||| | |||||||| TCACAGACTG CGTAA-CCCT GACTGGGTCG GATTTTTGTT AAATGTTTTA A-GGGGCGTT 297 TTGGACTATT CCTGC-TATA ATTATAAATT TAGTGGGTTA ATGTTAATAA TTTAACTACT 17539 |||||||||| ||||| |||| ||||| || | |||||| ||| |||||||||| || || |||| TTGGACTATT CCTGCTTATA ATTATGAAAT TAGTGGTTTA ATGTTAATAA TTCAATTACT 357 T-GAGGGTTA AAAGAGATAA CCTTGAATTA GTT 17507 | | ||||| || ||||||| |||||||||| || TGGGGGGTTG AAGGAGATAA CCTTGAATTA ATT 390 hqPGS_C06HBa0153O03.1-1-_SGN-E242274+ (17896 17507) ******************************************************************************** EST sequence 140 +strand 542 n (File: SGN-E252199+) 1 CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTGTTGCCA TTTTAGATAG AGAAGTCCGC 61 AAATTGAGGT CAAGGGAGAT TGCATCTATC AAAGTTCAAT GGAAGAATCG ACCAGTTGAA 121 GAATCCACTT GGGAGAAGGA AGTTGATATG CGAGAAAGAT ACCCATACCT GTTTACAGAT 181 TCAGGTACTC CTTTTCGCCC TTGTTTTTCT TCTTGTGATC GTTCGGGGAC GAACGATGGG 241 TAAATTGGTA TCTATTGTAA CGACCTGTTT AGTCGTTTTG AGTAACAGAT TTTATTTCTG 301 GAAAAACTGA CTGAGACGAC GGATCCCACG ACGGACCGTC GAGGGGGTCT CGTTCCAAAA 361 CACTTAGAAT TCTGAAATTT GGGTACTGAA ATCGACTCTC TGAACTTCGT GACGGAATGG 421 CAGGACGGAC CGTCACAGGC GTGATGGGCC GTCACAGACC CTTGGTAAAA ATCTAGTCTC 481 TGAACTCTGT GACGGACGTG CAGGACGGAC CGTCACAGAC TGCGTAATCC CAGGCTGGGT 541 CG Predicted gene structure (within gDNA segment 22664 to 16119): Exon 1 21798 21791 ( 8 n); cDNA 245 252 ( 8 n); score: 0.625 Intron 1 21790 17956 (3835 n); Pd: 0.520 (s: 0), Pa: 0.000 (s: 0.90) Exon 2 17955 17658 ( 298 n); cDNA 253 530 ( 278 n); score: 0.819 MATCH C06HBa0153O03.1-1- SGN-E252199+ 0.819 306 0.565 C PGS_C06HBa0153O03.1-1-_SGN-E252199+ (21798 21791,17955 17658) Alignment (genomic DNA sequence = upper lines): TTGGGGTGGT AATTCCTCAT CTTGCACCTG TTTATTTTCC CCTTCCTCAC CCTCTCTTAC 21739 |||| | TTGGTATC.. .......... .......... .......... .......... .......... 252 TACCTCATCA GTCGGTGGAG GAGTCACCAC CCTATTACTA GCTTGACCAG GTGTTTGTCC 21679 .......... .......... .......... .......... .......... .......... 252 TCCACCTCTA GAGATCGTCC TCTTGCGACC TCTACCACGA CCTCTTGCCA CTGCTCCTCC 21619 .......... .......... .......... .......... .......... .......... 252 TCGAGCTACA ACCCCAATGT TTGGCTCAGA CGCACGCTAT CTTGCCGGTG TTGGTGTTGG 21559 .......... .......... .......... .......... .......... .......... 252 CACAGTTGTT TCTCTAGTTC TAACCATATG CGAAATAGAG TGAGGATGTC AGATACCAAT 21499 .......... .......... .......... .......... .......... .......... 252 TTGTATCACC TAGATACCAC TTGGATCCAA GTAATAGCAC GAAAGAAGGA AAGAATGGAA 21439 .......... .......... .......... .......... .......... .......... 252 TTTTGCTAAA GTCCTATAGC CTCTCGAAGA AAAGTAAGGG CGTCCCCCTA CCGTTCCTCA 21379 .......... .......... .......... .......... .......... .......... 252 AGACTCTACT AGACTTGTTC TTGTGTGATG AGACCAACGA ACCTAACGCT CTGATACCAA 21319 .......... .......... .......... .......... .......... .......... 252 GTTTGTCACG ACCCAAAACG ATCCGTAAGT GGCACCCACC CTTACTCTCC TAGGTGAGCG 21259 .......... .......... .......... .......... .......... .......... 252 AACCAACAAA TCTAAACCCC AACATTTACC AGTATATCAA CTATAAATAA TATAAATAAT 21199 .......... .......... .......... .......... .......... .......... 252 GCGGAAGCTC CAAAACTCAT TACGAAATTA ATTAAATCAA CATCTAAAGT TAAATACTTA 21139 .......... .......... .......... .......... .......... .......... 252 TTATTCCCAA AATCTGTAAG TCATCACACC AAGAACATCT ATCCTCGAAT TTCTAAATCT 21079 .......... .......... .......... .......... .......... .......... 252 AAGAGTATTC AAGAAGCTAA AAATAGTAAA AAGATGGTCC ATGTCCGAAC TTCAAGACAT 21019 .......... .......... .......... .......... .......... .......... 252 CAAGACGTGA AGGAGAGAAT CCAGCACGAG CTAGGAATAA TAGCTCACCC TGAATTCTGA 20959 .......... .......... .......... .......... .......... .......... 252 TATGCTAAAG ACCGGCTAGA TCTGATGACG AGTCGAAGTC GATGGCACGC TTGCTGCACT 20899 .......... .......... .......... .......... .......... .......... 252 CCACAAATAA CAAAGAAGAA AATTACAAGT AGGGGTCAGT ACAAGGAACA CGTACTGAGT 20839 .......... .......... .......... .......... .......... .......... 252 AGGTATCATC GGCCAACTCA AAATAGAAAA CAATATATAC TGAATAATAA TATAAAATCA 20779 .......... .......... .......... .......... .......... .......... 252 ACCATAATAC TTAACAGGTG ACAATCAACA AGTATAAGAA CCATTGACAA CAACAGCAAG 20719 .......... .......... .......... .......... .......... .......... 252 CACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT GGGAAATAGG TTCTTTGAAT 20659 .......... .......... .......... .......... .......... .......... 252 TTGAGTACAT TAACATAATT CAAGATTCAT TCTCTTTATC ATTATCGTGT CGGAACGTTA 20599 .......... .......... .......... .......... .......... .......... 252 CACCCGATCC CCTACTACTA CCGTGTCGGA ACGTGACACT CTGATCCCCT AATACTACCG 20539 .......... .......... .......... .......... .......... .......... 252 TGTCAGAACG TGACACCCGA TCCCCTAATA CTACCGTGTC AGAATGTGAC ACTCCGATCC 20479 .......... .......... .......... .......... .......... .......... 252 CCTAATACTA CCGTGTCGAA ACATGACACC CAATCCATTT ATCTCATTAT TTTAGTTCAT 20419 .......... .......... .......... .......... .......... .......... 252 CAAGCCTTCT TTATGTCAAG GCGCCATCTT AATAGAGAGG ATTTAAGATT GAAGATTCAA 20359 .......... .......... .......... .......... .......... .......... 252 CAGTTTCATC ATTCTGACCA CCACAATTAC ACAATCACAA CATACAAACA CACAATCAAG 20299 .......... .......... .......... .......... .......... .......... 252 CATATAGAAG ACTTTACAAT ACCACCCAAT ACATATCGAT CACTATTTAG AGTTTATCTA 20239 .......... .......... .......... .......... .......... .......... 252 TCATATATAA ATAAATCATA ACCTACCTCC ACTGAAGAAT CGTGATCAAG CAAGCTACCT 20179 .......... .......... .......... .......... .......... .......... 252 TCCCAATGCC TTTGCTTTCC TCTTCGTTCT CTCTTTCTCG CTCGTTCTCC CTCTGTGTTT 20119 .......... .......... .......... .......... .......... .......... 252 CTTTTTATTT TTCTTACTCA AAATCTTGTT CTTTTACCCT AAATGTCATA TAATCAATTA 20059 .......... .......... .......... .......... .......... .......... 252 TAAAAGATGA TAAAAGTACC TCACTATTTA TTCCCTTATT AACTTCTTTA ACCCCCAAGT 19999 .......... .......... .......... .......... .......... .......... 252 AAATAAATTA TTAAACTTAC CCCACTAATT CCATAATTAT AATCATGAAT AGTCCAAAAC 19939 .......... .......... .......... .......... .......... .......... 252 ACCCCTTTAA AACTTTTAGC AGAAATCCGA CCCAGTCGAG GTTACGCAGC TTGTGACGGT 19879 .......... .......... .......... .......... .......... .......... 252 CCGTTGTGTC TACGACGGTC CGTGCTGTAG TTCCGTCGCG GAGTTCAGAG AGTCGCTCCC 19819 .......... .......... .......... .......... .......... .......... 252 AGTACCCAGA TTTTCAGAGT TGAAGTGTTT TGGAACGGAG ACGCTCGACG GACCGTCGTG 19759 .......... .......... .......... .......... .......... .......... 252 CCTGTGACGG TTTGTCCTAC CTGCCGTCGA GGGTAATGAG GAGAGCAACA GAAGAAATTA 19699 .......... .......... .......... .......... .......... .......... 252 CACAAGTATG GGACGACGGA GTCCATCACG GTCCATCGTG ACCATGACGG TCCGTCGTGA 19639 .......... .......... .......... .......... .......... .......... 252 CCATGACGGT CCGTCGCGTG ATCCGTCGAC CCAGTCAGTT TTTTATCAAA AATAGTTCTA 19579 .......... .......... .......... .......... .......... .......... 252 CTGCTCGAAC CGACTAAACA GGTCATTACA ATTTTCCTAC TTTAGTTTTC CCTATGGCTA 19519 .......... .......... .......... .......... .......... .......... 252 CCACTGTCCA TCTACTATTT TTTTTCATGA TTTGATCTTT TAAGTAAAAT TATTTGTGGA 19459 .......... .......... .......... .......... .......... .......... 252 ACTTCTCTTT GAAGATATCT CTCACAAATT AGCGAAAAAG TTAGTTAATT TATTTTTATT 19399 .......... .......... .......... .......... .......... .......... 252 TTAAAAAGTA AGATAAATGT TTTTGCGCTA TCAATATTTT ATAATATGTA AATTATTCGA 19339 .......... .......... .......... .......... .......... .......... 252 AACAAACTTT TAATAAATAA AAATTAGAGC AAATATAAAA ATCACTAGAT TTTTTTTTAA 19279 .......... .......... .......... .......... .......... .......... 252 AAAAATGGGG CCTTGAAACG GTATATATTT TTTTATTTGA ATAGATTATG GGGGAGAATT 19219 .......... .......... .......... .......... .......... .......... 252 AATAGAGGTA AGATTTTTTA TTTTATCTAA TAAGAAAATG ACAAATAATA TATTTTTAAA 19159 .......... .......... .......... .......... .......... .......... 252 AAATAAATAA AACAAAATAA ACTTTGGTTG TTAGTCATAA AAATATAAGT TATTCAAAAA 19099 .......... .......... .......... .......... .......... .......... 252 GGTGAATGAA AGAGTATAAG TGAGTCAAAA AGATGAGTGA AGAGGCATAG CTAAGCCAAA 19039 .......... .......... .......... .......... .......... .......... 252 AAAGTGAATG TAAGGGTATT TCTAGACCAA AAGATTGATG AAGGATATTT TTAGACATAG 18979 .......... .......... .......... .......... .......... .......... 252 TTCAAGGATA GTTTTGGTCC TTTTTCGTTT AAATAATCTC ATATTTAGAT ATTGAGTTAT 18919 .......... .......... .......... .......... .......... .......... 252 TTGCAGGGAC GATTCAATAT AATTCGAGGC CTAAATTTTA AATAAACTCT ATCTGTATTT 18859 .......... .......... .......... .......... .......... .......... 252 ATTTATTTTT CTTTTTAGAT GTAAATTGTA TTTATTTATT TTTCTTTTTA GATGTAAGTT 18799 .......... .......... .......... .......... .......... .......... 252 ATTACTTAAG TATCTTTTTT TGTAAATGGA AAAGGGCTAA AAATGCCCTT AACTTAGTGG 18739 .......... .......... .......... .......... .......... .......... 252 AAATGGTTCA AAATACCATC CTTCTACCTT TTGAGTTAAA AATACCCTCC ACCTTTATTT 18679 .......... .......... .......... .......... .......... .......... 252 TGGTTCAAAG ATGCCTTTCC TTCCACCTTT TGATTTAATA ATACTCTTAA CCCCCCATTT 18619 .......... .......... .......... .......... .......... .......... 252 AATTAAATTT ATAAAATAAA AAATTCTTAA TATTAGCTCA TTCCAAAATC TTTATGATAA 18559 .......... .......... .......... .......... .......... .......... 252 ATATATCTAA AAAATAAAAT AAAAAATTTA TTATATGTAT AAAAAGCAAA AATAAAAATA 18499 .......... .......... .......... .......... .......... .......... 252 AAATTTCTCA AAGTTCTTAT TCTTTGTATT AAAATAATAA GACAATAAAA ATCTTAAGAT 18439 .......... .......... .......... .......... .......... .......... 252 TCTTATTCTT CATTTTTGCG CAAAAAAATC TTTATTTTAT TTTATGTTTT ATACATATTA 18379 .......... .......... .......... .......... .......... .......... 252 TTTAATATTT TAATTTGTGA GAAATTTTTT TAAGTTATTT GGATTAAATT TTTAAATTAT 18319 .......... .......... .......... .......... .......... .......... 252 ATTGAGAAAA TGCACAAGTA TTCCCTCAAA CTATGTCTGA AATCCCAGAG ACACACTTAT 18259 .......... .......... .......... .......... .......... .......... 252 ACTATATTAA GGTCATATTA CCCCCTGAAC TTATTTTATA AGTAATTTTC TACCCCTTTT 18199 .......... .......... .......... .......... .......... .......... 252 GACCTACGTG GCTCTAGCTT GAAAAAAAAG TCAATCAGCG TTGGACCCAC AAGATAGTGC 18139 .......... .......... .......... .......... .......... .......... 252 CACATAGACC GAAAAGGGCT AGAAAATTAT TAATAAAATA AGTTCAGGGA TAATAGGACC 18079 .......... .......... .......... .......... .......... .......... 252 TTAGTATAGT GTAAGTATGA CTTTAAAATT TCAGGCATAA ATTGAGAGGG TACTTGTGCA 18019 .......... .......... .......... .......... .......... .......... 252 TTATCTCAAT AATATTCAAA TCTTTACATT AATATCTAAT TTGATGTAAT ATTTTAATAA 17959 .......... .......... .......... .......... .......... .......... 252 TAATAATGTA ACGACCTATT TAGTCGTTTT GAGCAGCAGA TTTTATTTTT GGAAAAACTG 17899 || |||| ||||||| || |||||||||| ||| | |||| |||||||| | |||||||||| ...TATTGTA ACGACCTGTT TAGTCGTTTT GAGTAACAGA TTTTATTTCT GGAAAAACTG 309 GCTGAGACGA CGGATCCCAC GATGGACCGT CATGGGCACG ATGGACCGTC GAGGGGGTCT 17839 ||||||||| |||||||||| || || ||||||| |||||||||| ACTGAGACGA CGGATCCCAC ----GA-CG- ---------- ---GACCGTC GAGGGGGTCT 350 CGTTCCAAAA TACATAGAAT TCTGAAATTT GGGTTTTGAA ATCGACTCTC TGAACTTCGT 17779 |||||||||| || |||||| |||||||||| |||| |||| |||||||||| |||||||||| CGTTCCAAAA CACTTAGAAT TCTGAAATTT GGGTACTGAA ATCGACTCTC TGAACTTCGT 410 GATGAAGTGG CAGGACGGAC CGTCACAGGC ATGACGGGCC GTCACAGTCT CTTCAGAAAA 17719 || | | ||| |||||||||| |||||||||| ||| ||||| ||||||| | ||| |||| GACGGAATGG CAGGACGGAC CGTCACAGGC GTGATGGGCC GTCACAGACC CTTGGTAAAA 470 TTTCAGTCTC TGAACTCTGT GACGGAAGCA GCAGGACGGA CCGTCGCAGG CACGACGACC 17659 | |||||| |||||||||| |||||| | |||||||||| ||||| ||| | | | ATCTAGTCTC TGAACTCTGT GACGGACG-T GCAGGACGGA CCGTCACAGA CTGCGTAATC 529 C 17658 | C 530 hqPGS_C06HBa0153O03.1-1-_SGN-E252199+ (17955 17658) ******************************************************************************** EST sequence 138 +strand 858 n (File: SGN-E540411+) 1 TTTTTTTTTT CTCATAAACA TAAAGTACAC TAATATTATT ATTATAAAAT TCTCCAGTCT 61 TATACAAAAC AACATATGAC TTCACTTGAA CTATAATTAA AGAACAATAA AGGGATAATG 121 CACAAGTACC CCCTCAACCT ATGCCCGAAA TTCCAGAGAC ACACTTATAC TATACTAAGG 181 TCCTATTACC CCCCTGAACT TATTTTATAA GTAATTTTCT ACCCCTTTTT AGCCTACGTG 241 GCACTAGTTT AAAAAAAAAG TCAACAACCA TTGGGCCCAC AAGATAGTGC CACGTAGGTC 301 TAAAAGGGGT AAAAAATTAT TAATAAATAA GTTCAGGGGG TAATAAGATC TTAGTATGGT 361 ATAAGTGTAT CTCTGAGATT TTGGACATAG GCTAAGGGGG TACTTGGACA TTATCCCAAC 421 AATAAATAAA GGATATAACA TGATTCAAAA GACAACACGT GATAACCACT TCTACAACTT 481 GTGCATGATC AATTGAGCAC CATATTGAGG TTGAAGCAAC AATGGGTGAG GAGCATGAGC 541 ATAAGATGGA GAGAGTTCAA ATGCATAATG TTTTAGAATC ATGACCATTG CCATTTTTGC 601 CTCTAACATA GCAAAATTTT GCCCAATACA TATTCTTGGA CCCCAACTAA ATGGAAAAAA 661 TACAACTTGT CCTTTTGTTG CTTTTGATAT TCCTTCACTA AATCTCTCTG GCATAAACTC 721 CATTGCATCA TCTCCCCATA TTTCAGTATC ATGATGCACT AACATTGTTG CCAATATGAG 781 TTGGACCCCA GAGGGTAACA CAAATCCCCT AACTTTGTTT CTGTATTCAC CATGCGATTA 841 ATCGCGTATA CTGATGTA Predicted gene structure (within gDNA segment 20394 to 11299): Exon 1 18340 18006 ( 335 n); cDNA 86 423 ( 338 n); score: 0.794 PPA cDNA 12 1 MATCH C06HBa0153O03.1-1- SGN-E540411+ 0.794 335 0.390 C PGS_C06HBa0153O03.1-1-_SGN-E540411+ (18340 18006) Alignment (genomic DNA sequence = upper lines): TTGGATTA-A ATTTTTAAAT TATATTGAGA AAATGCACAA GTATTCCCTC AAACTATGTC 18282 ||| | || | ||| || ||| | || ||||||||| ||| ||||| || ||||| | TTGAACTATA ATTAAAGAAC AATAAAGGGA TAATGCACAA GTACCCCCTC AACCTATGCC 145 TGAAATCCCA GAGACACACT TATACTATAT TAAGGTCATA TTA-CCCCCT GAACTTATTT 18223 ||||| ||| |||||||||| ||||||||| ||||||| || ||| |||||| |||||||||| CGAAATTCCA GAGACACACT TATACTATAC TAAGGTCCTA TTACCCCCCT GAACTTATTT 205 TATAAGTAAT TTTCTACCCC TTTTGA-CCT ACGTGGCTCT AGCTTGAAAA AAAAGTCAAT 18164 |||||||||| |||||||||| |||| | ||| ||||||| || || || |||| ||||||||| TATAAGTAAT TTTCTACCCC TTTTTAGCCT ACGTGGCACT AGTTTAAAAA AAAAGTCAAC 265 CAGCGTTGGA CCCACAAGAT AGTGCCACAT AGACCGAAAA GGGCTAGAAA ATTATTAATA 18104 | | |||| |||||||||| |||||||| | || | |||| ||| || ||| ||||||||| AACCATTGGG CCCACAAGAT AGTGCCACGT AGGTCTAAAA GGGGTAAAAA ATTATTAAT- 324 AAATAAGTTC A-GGGATAAT AGGACCTTAG TATAGTGTAA GTATGACTTT AAAATTTCAG 18045 |||||||||| | ||| |||| | || ||||| ||| || ||| || | || | | |||| | AAATAAGTTC AGGGGGTAAT AAGATCTTAG TATGGTATAA GTGTATCTCT GAGATTTTGG 384 GCATAAATTG AGAGGGTACT TGTGCATTAT CTCAATAAT 18006 |||| | || ||||||| || |||||| | ||| ||| ACATAGGCTA AGGGGGTACT TGGACATTAT CCCAACAAT 423 hqPGS_C06HBa0153O03.1-1-_SGN-E540411+ (18340 18006) ******************************************************************************** EST sequence 149 -strand 686 n (File: SGN-E241789-) 1 TTGTTAATGA TGATGTCTTG GTATAAAAGA AGGCTTGATG AACTAAAAGA ATGAGGTTAG 61 GGGATCGGGT GTCACGAACC GACACGTAGT ATTAATGGAT CGGGTGTCAC GAACCGACAC 121 ATAGTATTAA TGGATCGGGT GTCACGAATC GGCACGTAGT ATTAGGAGAT CGGGTGTAAC 181 GAACCGACAC GTAGTATTAG GGGATCGGGT GTCACGAACC GACACGTAGC ATTAGGGGAT 241 CGGAGTATCA CGTTCCGACA CCACGATAGT AAAAAGAATG AATCTTGAAT TATGTTAATG 301 TACTCAATTT AATGAACCTG TTTCCCAAAT GAGTATGGTG TGGAGGCTTG AGTCCTCATA 361 GATGTTCTTG GGTTGTGCCC AATGGTTATG GTACTTGTTG TTGTCACCTG TTAAGTGTTA 421 TGGTTGATTT TATTTTATTA TTTGATATAT ATTGTTCTCT ATTCTGAGTT GGCCGATGAT 481 ATCTACTCAG TACTCGTGTT TGTACTGACC CCTACTTTTA TGTTTTCTTT TTGTTAATTG 541 TGGAGTGCAG CAAACGTACC GTCGTCTTCA ACTCAACCGC AACTCTAACC AGTCTTCATC 601 ACGTCAGATT TCAGGGTGAG CTATTGTTCC TAGCTCGGAC TGGATTCTCT CTCATTCATG 661 TCTTGATGTC CTTGAAGATC AGACAT Predicted gene structure (within gDNA segment 17276 to 23917): Exon 1 20437 21038 ( 602 n); cDNA 91 686 ( 596 n); score: 0.824 MATCH C06HBa0153O03.1-1+ SGN-E241789- 0.824 602 0.878 C PGS_C06HBa0153O03.1-1+_SGN-E241789- (20437 21038) Alignment (genomic DNA sequence = upper lines): ATAAATGGAT TGGGTGTCAT GTTTCGACAC GGTAGTATTA GGGGATCGGA GTGTCAC-AT 20495 || ||||||| |||||||| | |||||| |||||||| ||||||| ||||||| | ATTAATGGAT CGGGTGTCAC GAACCGACAC -ATAGTATTA ATGGATCGG- GTGTCACGAA 148 TCTGACACGG TAGTATTAGG GGATCGGGTG TCACGTTCTG ACACGGTAGT ATTAGGGGAT 20555 || | ||| | |||||||||| ||||||||| | ||| | | |||| ||||| |||||||||| TC-GGCAC-G TAGTATTAGG AGATCGGGTG TAACGAACCG ACAC-GTAGT ATTAGGGGAT 205 CAGAGTGTCA CGTTCCGACA CGGTAGTAGT AGGGGATCGG -GTGTAACGT TCCGACACGA 20614 | | |||||| || |||||| | |||| | | |||||||||| || | |||| |||||||| | C-GGGTGTCA CGAACCGACA C-GTAGCATT AGGGGATCGG AGTATCACGT TCCGACACCA 263 TAATGATAAA GAGAATGAAT CTTGAATTAT GTTAATGTAC TCAAATTCAA AGAACCTATT 20674 || |||| ||||||||| |||||||||| |||||||||| || |||| || |||||| || CGATAGTAAA AAGAATGAAT CTTGAATTAT GTTAATGTAC TC-AATTTAA TGAACCTGTT 322 TCCCAAATGA GTATGGTGTG GAGGCTTGAG TCCTCATAGA TGTGCTTGCT GTTGTTGTCA 20734 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||| ||||| || TCCCAAATGA GTATGGTGTG GAGGCTTGAG TCCTCATAGA TGTTCTTG-G GTTGTGCCCA 381 ATGGTTCTTA TACTTGTTGA TTGTCACCTG TTAAGTATTA TGGTTGATTT TATATTATTA 20794 |||||| | ||||||||| |||||||||| |||||| ||| |||||||||| ||| |||||| ATGGTTATGG TACTTGTTG- TTGTCACCTG TTAAGTGTTA TGGTTGATTT TATTTTATTA 440 TTCAGTATAT ATTGTTTTCT ATTTTGAGTT GGCCGATGAT ACCTACTCAG TACGTGTTCC 20854 || ||||| |||||| ||| ||| |||||| |||||||||| | |||||||| ||| | | TTTGATATAT ATTGTTCTCT ATTCTGAGTT GGCCGATGAT ATCTACTCAG TAC-TCGTGT 499 TTGTACTGAC CCCTACTTGT A-ATTTTCTT CTTTGTTATT TGTGGAGTGC AGCAAGCGTG 20913 |||||||||| |||||||| | | ||||||| ||||||| | |||||||||| ||||| ||| TTGTACTGAC CCCTACTTTT ATGTTTTCTT -TTTGTTAAT TGTGGAGTGC AGCAAACGTA 558 CCATCGACTT CGACTCGTCA TCAGATCTAG CCGGTCTTTA GCATATCAGA ATTCAGGGTG 20973 || ||| ||| | |||| | || |||| || ||||| | || ||||| ||||||||| CCGTCGTCTT CAACTCAACC GCAACTCTAA CCAGTCTTCA TCACGTCAGA TTTCAGGGTG 618 AGCTATTATT CCTAGCTCGT GCTGGATTCT CTC-C-TTCA CGTCTTGATG T-CTTGAAGT 21030 ||||||| || ||||||||| ||||||||| ||| | |||| ||||||||| | ||||||| AGCTATTGTT CCTAGCTCGG ACTGGATTCT CTCTCATTCA TGTCTTGATG TCCTTGAAGA 678 TCGGACAT 21038 || ||||| TCAGACAT 686 hqPGS_C06HBa0153O03.1-1+_SGN-E241789- (20437 21038) ******************************************************************************** EST sequence 78 +strand 481 n (File: SGN-E246710+) 1 CACAAACCGA CATATAGATT TAGGGGATCG GAGTGTCACG TACCGACACA AGAGGATTAA 61 TGAATATTGA GGGAGCGGAG TGTCACGTAC CGACACAAGA GAAATAAAGA TAATGAATCT 121 TGAAAGATGT TAATATACTC AATCTAATGA ACATGATTCC CAAATGAGTA TGGTATTGAG 181 GCTTGAGTCC TCATGTGTGA ACTTGACGGT AATTGTTAAT GATATAGTAT TTGTTGTTGC 241 TACATGTTGA GTATCATAGT TGATTTTATG ATATTACTTG GTATATATAT TGATTTCTAT 301 TTTGAGTTGG CCGATGATAT CTACTCAGTA CCCGTGTTTT GTACTGACCC CTACTTTTAT 361 GTTCTCTTCT TGTTTATTTG TGGAGTGCAG CAAACGTGCC ATCGTGTTCA ACTCAACAGT 421 AATTCAAGCC AGTCTTACTA CATCGGAAAT TCAGGGTGAG CTAATGCTTC TAGCTTGGAC 481 T Predicted gene structure (within gDNA segment 18514 to 22398): Exon 1 20588 20996 ( 409 n); cDNA 71 481 ( 411 n); score: 0.758 MATCH C06HBa0153O03.1-1+ SGN-E246710+ 0.758 409 0.850 C PGS_C06HBa0153O03.1-1+_SGN-E246710+ (20588 20996) Alignment (genomic DNA sequence = upper lines): GGGATCGG-G TGTAACGTTC CGACACGATA ATGATAAAGA GAATGAATCT TGAATTATGT 20646 |||| ||| | ||| |||| | |||||| | | ||||||| ||||||||| |||| |||| GGGAGCGGAG TGTCACGTAC CGACACAAGA GAAATAAAGA TAATGAATCT TGAAAGATGT 130 TAATGTACTC AAATTCAAAG AACCTATTTC CCAAATGAGT ATGGTGTGGA GGCTTGAGTC 20706 |||| ||||| ||| || | ||| | ||| |||||||||| ||||| | || |||||||||| TAATATACTC -AATCTAATG AACATGATTC CCAAATGAGT ATGGTATTGA GGCTTGAGTC 189 CTCATAGATG TGCTTG-CTG TTGTTGTCAA TGGTTCTTAT ACTTGTTGAT TGTCACCTGT 20765 ||||| || |||| | | | |||| || | | | | | | |||||| | || || ||| CTCATGTGTG AACTTGACGG TAATTGTTAA T-GATATAGT ATTTGTTG-T TGCTACATGT 247 TAAGTATTAT GGTTGATTTT ATATTATTA- TT-CAGTATA TATTGTTTTC TATTTTGAGT 20823 | ||||| || ||||||||| || ||||| || |||| ||||| |||| |||||||||| TGAGTATCAT AGTTGATTTT ATGATATTAC TTGGTATATA TATTGATTTC TATTTTGAGT 307 TGGCCGATGA TACCTACTCA GTACGTGTTC CTTGTACTGA CCCCTACTTG TA-ATTTTCT 20882 |||||||||| || ||||||| |||| || ||||||||| ||||||||| || || ||| TGGCCGATGA TATCTACTCA GTACCCGTGT TTTGTACTGA CCCCTACTTT TATGTTCTCT 367 TCTTTGTTAT TTGTGGAGTG CAGCAAGCGT GCCATCGACT TCGACTCGTC ATCAGATCTA 20942 |||| |||| |||||||||| |||||| ||| ||||||| | || |||| | | | || | TCTTGTTTAT TTGTGGAGTG CAGCAAACGT GCCATCGTGT TCAACTCAAC AGTAATTCAA 427 GCCGGTCTTT AGCATATCAG -AATTCAGGG TGAGCTATTA TTCCTAGCTC GTGCT 20996 ||| ||| || | | ||| | ||||||||| ||||||| | | |||||| | || GCCAGTC-TT ACTACATCGG AAATTCAGGG TGAGCTAATG CTTCTAGCTT GGACT 481 hqPGS_C06HBa0153O03.1-1+_SGN-E246710+ (20588 20996) ******************************************************************************** EST sequence 45 +strand 729 n (File: SGN-E351546+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGGTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATAGGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGTTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGAACGGTA GGGGGACGCT TTTACTTTTC CTTGAGAGGC TATAAGACTT 601 TAGGAAAACT TCACTCTTTC ATTCTTTCTT TCGTGCTACT ACTTCGAGTC AATTGGTATC 661 TAAGCGATAC GAATTGGTAT CTGACCATNC TCACTCTCTT GCCAGATGGG TAGAACTAGA 721 GCAACGACT Predicted gene structure (within gDNA segment 19935 to 23609): Exon 1 20823 21555 ( 733 n); cDNA 1 729 ( 729 n); score: 0.768 MATCH C06HBa0153O03.1-1+ SGN-E351546+ 0.768 733 1.005 C PGS_C06HBa0153O03.1-1+_SGN-E351546+ (20823 21555) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATACCTACTC AGTACGTGTT CCTTGTACTG ACCCCTACTT GTA-ATTTTC 20881 |||||||||| ||| |||||| |||| || |||||||| |||||||||| || |||| TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCTTTGTTA TTTGTGGAGT GCAGCAAGCG TGCCATCGAC TTCGACTCGT CATCAGATCT 20941 | ||| || |||||||||| ||||||| || |||| || | |||||||| || | || CTTCTTGGTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCCGGTCTT TAGCATATCA GA-ATTCAGG GTGAGCTATT ATTCCTAGCT CGTGCTGGAT 21000 |||| ||||| || | | || |||||| |||||||| | |||||| | |||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 TCTCTCCTTC ACGTCTTGAT GTCTTGAAGT TCGGACATGG ACCATCTTTT TA-CT-A-TT 21057 |||||||| | |||||||| | |||||| | || | ||||| || | ||| | || | | || CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TTTAGCTTCT TGAATACTCT TAGATTTAGA AATTCGAGGA TAGATGTTCT TGGTGTGATG 21117 |||||||||| | | ||||| |||| |||| | || | | |||||||||| || ||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTACAGAT TTTGGGAATA ATAAGTATTT AACTTTAGAT GTTGATTTAA TTAATTTCGT 21177 |||| ||| | |||||||||| ||| | ||| || || || | | ||| | | ||| | ACTTCCAGGT TTTGGGAATA ATAGATGTTT AA---TA-AT AGT-AGTTAT TGATTTTATT 354 AATGAGTTTT GGAGCTTCCG CATTATTTAT ATTATTTATA GTTGATATAC TGGTAAATGT 21237 ||||||||| | |||||| ||||| || | | || || || | || | | |||||| AATGAGTTTA AG-TCTTCCG CATTACTT-T -CTGTTGCTA -TT-ACAT-- T-G-AAATGT 405 TGGGGTTTAG ATTTGTTGGT TCGCTCACCT AGGAGAGTAA GGGTGGGTGC CACTTACGGA 21297 | ||||||| ||| |||||| |||||||| | ||||| |||| | |||||||| || | ||| TAAGGTTTAG ATTGGTTGGT TCGCTCACAT AGGAGGGTAA GTGTGGGTGC CAGTGGCGGC 465 TC-GTTTTGG GTCGTGAC-A AACTTGGTAT CAGAGCGTTA GGTTCGTTGG TCTCATCACA 21355 | | ||||| |||||||| | |||||||||| |||||| ||| |||||||||| |||||||||| CCGGATTTGG GTCGTGACAA AACTTGGTAT CAGAGCATTA GGTTCGTTGG TCTCATCACA 525 CAAGAACAAG TCTAGTAGAG TCTTGAGGAA CGGTAGGGGG ACGCCCTTAC TTTTCTTCGA 21415 |||||||||| |||||||||| |||| ||||| |||||||||| |||| |||| ||||| | || CAAGAACAAG TCTAGTAGAG TCTTAAGGAA CGGTAGGGGG ACGCTTTTAC TTTTCCTTGA 585 GAGGCTATAG GACTTTAGCA AAA-TT--C- CATTCTTTCC TTCTTTCGTG CTATTACTTG 21471 ||||||||| |||||||| | ||| || | | ||| ||| |||||||||| ||| ||||| GAGGCTATAA GACTTTAGGA AAACTTCACT CTTTCATTCT TTCTTTCGTG CTACTACTTC 645 GATCCAAGTG GTATCTAGGT GATACAAATT GGTATCTGA- CATCCTCACT CTATTTCGCA 21530 || ||| || ||||||| | ||||| |||| ||||||||| ||| |||||| || || | || GAGTCAATTG GTATCTAAGC GATACGAATT GGTATCTGAC CATNCTCACT CTCTTGC-CA 704 TATGGTTAGA ACTAGAGAAA CAACT 21555 |||| |||| ||||||| || | ||| GATGGGTAGA ACTAGAGCAA CGACT 729 hqPGS_C06HBa0153O03.1-1+_SGN-E351546+ (20823 21555) ******************************************************************************** EST sequence 86 +strand 655 n (File: SGN-E356696+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATANGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGTTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGAACGGTA GGGGGACGCT TTTACTTTTC CTTGAGAGGC TATAAGACTT 601 TAGGAAAACT TCACTCTTTC ATTCTTTCTT TCGTGCTACT ACTTGAGTCC AATTG Predicted gene structure (within gDNA segment 19935 to 22869): Exon 1 20823 21458 ( 636 n); cDNA 1 628 ( 628 n); score: 0.776 MATCH C06HBa0153O03.1-1+ SGN-E356696+ 0.776 636 0.971 C PGS_C06HBa0153O03.1-1+_SGN-E356696+ (20823 21458) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATACCTACTC AGTACGTGTT CCTTGTACTG ACCCCTACTT GTA-ATTTTC 20881 |||||||||| ||| |||||| |||| || |||||||| |||||||||| || |||| TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCTTTGTTA TTTGTGGAGT GCAGCAAGCG TGCCATCGAC TTCGACTCGT CATCAGATCT 20941 | |||||| |||||||||| ||||||| || |||| || | |||||||| || | || CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCCGGTCTT TAGCATATCA GA-ATTCAGG GTGAGCTATT ATTCCTAGCT CGTGCTGGAT 21000 |||| ||||| || | | || |||||| |||||||| | |||||| | |||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 TCTCTCCTTC ACGTCTTGAT GTCTTGAAGT TCGGACATGG ACCATCTTTT TA-CT-A-TT 21057 |||||||| | |||||||| | |||||| | || | ||||| || | ||| | || | | || CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TTTAGCTTCT TGAATACTCT TAGATTTAGA AATTCGAGGA TAGATGTTCT TGGTGTGATG 21117 |||||||||| | | ||||| |||| |||| | || | | |||||||||| || ||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTACAGAT TTTGGGAATA ATAAGTATTT AACTTTAGAT GTTGATTTAA TTAATTTCGT 21177 |||| ||| | |||||||||| ||| | ||| || || || | | ||| | | ||| | ACTTCCAGGT TTTGGGAATA ATAGATGTTT AA---TA-AT AGT-AGTTAT TGATTTTATT 354 AATGAGTTTT GGAGCTTCCG CATTATTTAT ATTATTTATA GTTGATATAC TGGTAAATGT 21237 ||||||||| | |||||| ||||| || | | || || || | || | | |||||| AATGAGTTTA AG-TCTTCCG CATTACTT-T -CTGTTGCTA -TT-ACAT-- T-G-AAATGT 405 TGGGGTTTAG ATTTGTTGGT TCGCTCACCT AGGAGAGTAA GGGTGGGTGC CACTTACGGA 21297 | ||||||| ||| |||||| |||||||| | | ||| |||| | |||||||| || | ||| TAAGGTTTAG ATTGGTTGGT TCGCTCACAT ANGAGGGTAA GTGTGGGTGC CAGTGGCGGC 465 TC-GTTTTGG GTCGTGAC-A AACTTGGTAT CAGAGCGTTA GGTTCGTTGG TCTCATCACA 21355 | | ||||| |||||||| | |||||||||| |||||| ||| |||||||||| |||||||||| CCGGATTTGG GTCGTGACAA AACTTGGTAT CAGAGCATTA GGTTCGTTGG TCTCATCACA 525 CAAGAACAAG TCTAGTAGAG TCTTGAGGAA CGGTAGGGGG ACGCCCTTAC TTTTCTTCGA 21415 |||||||||| |||||||||| |||| ||||| |||||||||| |||| |||| ||||| | || CAAGAACAAG TCTAGTAGAG TCTTAAGGAA CGGTAGGGGG ACGCTTTTAC TTTTCCTTGA 585 GAGGCTATAG GACTTTAGCA AAATTCCATT CTTTCCTTCT TTC 21458 ||||||||| |||||||| | ||| | || | ||||| |||| ||| GAGGCTATAA GACTTTAGGA AAACTTCACT CTTTCATTCT TTC 628 hqPGS_C06HBa0153O03.1-1+_SGN-E356696+ (20823 21458) ******************************************************************************** EST sequence 82 +strand 580 n (File: SGN-E356206+) 1 TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 61 CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 121 AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 181 CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 241 TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAGT AGATGTTCTT GTGATGATGA 301 CTTCCAGGTT TTGGGAATAA TAGATGTTTA ATAATAGTAG TTATTGATTT TATTAATGAG 361 TTTAAGTCTT CCGCATTACT TTCTGTTGCT ATTACATTGA AATGTTAAGG TTTAGATTGG 421 TTGGTTCGCT CACATAGGAG GGTAAGTGTG GGTGCCAGTG GCGGCCCGGA TTTGGGTCGT 481 GACAAAACTT GGTATCAGAG CATTAGGGTC GTTGGTCTCA TCACACAAGA ACAAGTCTAG 541 TAGAGTCTTA AGGGACGGTA NGGGGACGCT TTTACTTTTC Predicted gene structure (within gDNA segment 19935 to 22299): Exon 1 20823 21410 ( 588 n); cDNA 1 580 ( 580 n); score: 0.768 MATCH C06HBa0153O03.1-1+ SGN-E356206+ 0.768 588 1.014 C PGS_C06HBa0153O03.1-1+_SGN-E356206+ (20823 21410) Alignment (genomic DNA sequence = upper lines): TTGGCCGATG ATACCTACTC AGTACGTGTT CCTTGTACTG ACCCCTACTT GTA-ATTTTC 20881 |||||||||| ||| |||||| |||| || |||||||| |||||||||| || |||| TTGGCCGATG ATATCTACTC AGTAGCCGTG TTTTGTACTG ACCCCTACTT TTATGTTTTT 60 TTCTTTGTTA TTTGTGGAGT GCAGCAAGCG TGCCATCGAC TTCGACTCGT CATCAGATCT 20941 | |||||| |||||||||| ||||||| || |||| || | |||||||| || | || CTTCTTGTTA TTTGTGGAGT GCAGCAAACG TGCCGTCATC TTCGACTCAA CAGTAACTCA 120 AGCCGGTCTT TAGCATATCA GA-ATTCAGG GTGAGCTATT ATTCCTAGCT CGTGCTGGAT 21000 |||| ||||| || | | || |||||| |||||||| | |||||| | |||||| AGCCAGTCTT CGTCACACCG GATCTTCAGG GTGAGCTAAC GCTTCTAGCT TGGACTGGAT 180 TCTCTCCTTC ACGTCTTGAT GTCTTGAAGT TCGGACATGG ACCATCTTTT TA-CT-A-TT 21057 |||||||| | |||||||| | |||||| | || | ||||| || | ||| | || | | || CTTCTCCTTC ATGTCTTGAT GCCTTGAACT TCCGGCATGG ACTAGCTTCT TATGTAATTT 240 TTTAGCTTCT TGAATACTCT TAGATTTAGA AATTCGAGGA TAGATGTTCT TGGTGTGATG 21117 |||||||||| | | ||||| |||| |||| | || | | |||||||||| || ||||| TTTAGCTTCT TAGAAACTCT TAGAATTAGT AGTTTAAAG- TAGATGTTCT TGTGATGATG 299 ACTTACAGAT TTTGGGAATA ATAAGTATTT AACTTTAGAT GTTGATTTAA TTAATTTCGT 21177 |||| ||| | |||||||||| ||| | ||| || || || | | ||| | | ||| | ACTTCCAGGT TTTGGGAATA ATAGATGTTT AA---TA-AT AGT-AGTTAT TGATTTTATT 354 AATGAGTTTT GGAGCTTCCG CATTATTTAT ATTATTTATA GTTGATATAC TGGTAAATGT 21237 ||||||||| | |||||| ||||| || | | || || || | || | | |||||| AATGAGTTTA AG-TCTTCCG CATTACTT-T -CTGTTGCTA -TT-ACAT-- T-G-AAATGT 405 TGGGGTTTAG ATTTGTTGGT TCGCTCACCT AGGAGAGTAA GGGTGGGTGC CACTTACGGA 21297 | ||||||| ||| |||||| |||||||| | ||||| |||| | |||||||| || | ||| TAAGGTTTAG ATTGGTTGGT TCGCTCACAT AGGAGGGTAA GTGTGGGTGC CAGTGGCGGC 465 TC-GTTTTGG GTCGTGAC-A AACTTGGTAT CAGAGCGTTA GGTTCGTTGG TCTCATCACA 21355 | | ||||| |||||||| | |||||||||| |||||| ||| || ||||||| |||||||||| CCGGATTTGG GTCGTGACAA AACTTGGTAT CAGAGCATTA GGGTCGTTGG TCTCATCACA 525 CAAGAACAAG TCTAGTAGAG TCTTGAGGAA CGGTAGGGGG ACGCCCTTAC TTTTC 21410 |||||||||| |||||||||| |||| ||| | ||||| |||| |||| |||| ||||| CAAGAACAAG TCTAGTAGAG TCTTAAGGGA CGGTANGGGG ACGCTTTTAC TTTTC 580 hqPGS_C06HBa0153O03.1-1+_SGN-E356206+ (20823 21410) ******************************************************************************** EST sequence 205 -strand 710 n (File: SGN-E392027-) 1 CAGCAAACGT GCCATCGTGT TCAACTCAAC AGTAATTCAA GCCAGTCTTA CTACATCGGA 61 AATTCAGGGT GAGCTAATGC TTCTAGCTTG GACTGGATCT TCTTCTTCAA GTCTTGATGC 121 CTTGAACTTC CGGCATGGAC TAGCTTCTTA TGTATTTTTA GCTTTTAGAC TACTCTTAGT 181 TTAGTCATTT GATCGTAGAT GTTCTTGTGG TGATGACTTC CAGATTTTGG GGAATAATAG 241 ATGTTGAATT TTAGAAGTTA ATGAATTGGT CTGTATTTAA TGAGTTTAAG TCTTCCACAT 301 TACTTTCTGT TGATATTATA TTGAAATGTT AAGGTTAGAT TGGTTGGTTC GCTCACATAG 361 GAGGGTAAGT GTGGGTGCCA GTCGCAACCC GGTTTTGGTC GTGACAAACT TGGTATCAGA 421 GCATTAGGTT CGTTGGTCTC ATCACACAAG AACGAGTCTA GTAGAGTCTT AAGGAACGGT 481 AGGGGGATGC CTTTACTTTT CCTTGAGAGG CTATAAGACT TTTGGAAAAT TCCATTCTTT 541 CTTCTTTCGT GCTATTACTT GGGTCCAATT GGTATCTAGG TGATACAAAT TGGTATCTGA 601 CCATCTTCAC TCTATTTCGC AGATGGTTAG AACTAGAGCA ACGACTACGC CAGCATCAAC 661 ACCAGCACCG GCGGGACAGG GTGCGACTGA GCCAGCCACT GGGGCTGTGG Predicted gene structure (within gDNA segment 16736 to 23201): Exon 1 20903 21572 ( 670 n); cDNA 1 663 ( 663 n); score: 0.801 MATCH C06HBa0153O03.1-1+ SGN-E392027- 0.801 670 0.944 C PGS_C06HBa0153O03.1-1+_SGN-E392027- (20903 21572) Alignment (genomic DNA sequence = upper lines): CAGCAAGCGT GCCATCGACT TCGACTCGTC ATCAGATCTA GCCGGTCTTT AGCATATCAG 20962 |||||| ||| ||||||| | || |||| | | | || | ||| ||| || | | ||| | CAGCAAACGT GCCATCGTGT TCAACTCAAC AGTAATTCAA GCCAGTC-TT ACTACATCGG 59 -AATTCAGGG TGAGCTATTA TTCCTAGCTC GTGCTGGATT CTCTCCTTCA CGTCTTGATG 21021 ||||||||| ||||||| | | |||||| | |||||| ||| ||||| ||||||||| AAATTCAGGG TGAGCTAATG CTTCTAGCTT GGACTGGATC TTCTTCTTCA AGTCTTGATG 119 TCTTGAAGTT CGGACATGGA CCATCTTTTT A-CTATTTTT AGCTTCTTGA ATACTCTTAG 21080 |||||| || | | |||||| | | ||| || | ||||||| ||||| | || ||||||||| CCTTGAACTT CCGGCATGGA CTAGCTTCTT ATGTATTTTT AGCTTTTAGA CTACTCTTAG 179 ATTTAGAAAT TCGAGGATAG ATGTTCTTGG TGTGATGACT TACAGATTTT -GGGAATAAT 21139 ||||| || | || ||| ||||||||| ||||||||| | |||||||| ||||||||| -TTTAGTCAT TTGATCGTAG ATGTTCTTGT GGTGATGACT TCCAGATTTT GGGGAATAAT 238 AAGTATTTAA CTTTAGATGT TGATTTAATT AAT-T-T-CG TAATGAGTTT TGGAGCTTCC 21196 | | || || |||||| || | | | |||| | | | |||||||||| | ||||| AGATGTTGAA TTTTAGAAGT T-AATGAATT GGTCTGTATT TAATGAGTTT AAG-TCTTCC 296 GCATTATTTA TATTATTTAT AGTTGATATA CTGGTAAATG TTGGGGTTTA GATTTGTTGG 21256 ||||| || | | || || | || |||| | | ||||| || || ||| |||| ||||| ACATTACTT- T-CTGTTGAT A-TT-ATAT- -T-G-AAATG TTAAGG-TTA GATTGGTTGG 347 TTCGCTCACC TAGGAGAGTA AGGGTGGGTG CCACTTACGG ATCGTTTTGG GTCGTGACAA 21316 ||||||||| |||||| ||| || ||||||| ||| | | || ||| | |||||||||| TTCGCTCACA TAGGAGGGTA AGTGTGGGTG CCAGTCGCAA CCCGGTTTTG GTCGTGACAA 407 ACTTGGTATC AGAGCGTTAG GTTCGTTGGT CTCATCACAC AAGAACAAGT CTAGTAGAGT 21376 |||||||||| ||||| |||| |||||||||| |||||||||| |||||| ||| |||||||||| ACTTGGTATC AGAGCATTAG GTTCGTTGGT CTCATCACAC AAGAACGAGT CTAGTAGAGT 467 CTTGAGGAAC GGTAGGGGGA CGCCCTTACT TTTCTTCGAG AGGCTATAGG ACTTTAGCAA 21436 ||| |||||| |||||||||| ||| ||||| |||| | ||| |||||||| | ||||| | || CTTAAGGAAC GGTAGGGGGA TGCCTTTACT TTTCCTTGAG AGGCTATAAG ACTTTTGGAA 527 AATTCCATTC TTTCCTTCTT TCGTGCTATT ACTTGGATCC AAGTGGTATC TAGGTGATAC 21496 |||||||||| ||| |||||| |||||||||| |||||| ||| || ||||||| |||||||||| AATTCCATTC TTT-CTTCTT TCGTGCTATT ACTTGGGTCC AATTGGTATC TAGGTGATAC 586 AAATTGGTAT CTGA-CATCC TCACTCTATT TCGCATATGG TTAGAACTAG AGAAACAACT 21555 |||||||||| |||| |||| |||||||||| ||||| |||| |||||||||| || ||| ||| AAATTGGTAT CTGACCATCT TCACTCTATT TCGCAGATGG TTAGAACTAG AGCAACGACT 646 GTGCCAACAC CAACACC 21572 |||| || ||||||| ACGCCAGCAT CAACACC 663 hqPGS_C06HBa0153O03.1-1+_SGN-E392027- (20903 21572) ******************************************************************************** EST sequence 207 -strand 835 n (File: SGN-E546219-) 1 TGATCAGACA CCTTTATTCT TCGTACTTAC CATGGGTAGC ACAAAATACG TTCAGCGACA 61 TGTACAACCC GATAACCTTG CATCTTGGTT GAAGGATTGT AAGGACAGAA AGTTGAAGCC 121 ATACTTGAAG TCACAGCCAA TTCCAGAATT TAACAATGAA ACTGTGAAGG TGGTGGTTGC 181 AGAGACTCTT GACGATATGG TGTTTAATTC GGGAAAAGAT GTACTGTTGG AGTTCTATCG 241 ACTTGGGTGT AGATATTGTG AGGAGTTTGC TCCCGTCTTG GATGAAATAG CTATCTCATT 301 TGAAAAAGAT CCTCATGTCG TGATTGCTAA AATTGATGGA ACGGAAAATG ATATACCTCG 361 TGACGTATTT GAAGTTGAAG GATTCCCAAC TTTATACTTG AGATCTTCAA CGGGTAGTTT 421 GTCACGGTTT GAGGGTAATA GAACAAAAGA GGCCATTATT GAATTTATCC AGACAAACAG 481 AGGTAGCCCT GCCTTTGATT TTAGCATATC ACAAACTCAA ACAGATCAAG TGAAGGATGA 541 GTTGTAAGAA GCAAGTGCAC CATCAACTTC GACCCGCCTT CAACTCTAGC CAGTCTCCAG 601 CACATCAGAT TTCAGGGTGA GCTATTATTC CTAGCTCGGA CTGGATTCTC TCCCTCACGT 661 CTTGATGTCT TGAAGTTCGG ACATGGACCG TTTTCTTTTT ACTTATTTTA GCTTCTTAAA 721 TACTCTTAGA TTTACTGAGG ATAGATGTTA TTGATGTGAT GACTTCCAGA TTTTGGGATT 781 AATAAATATT AAACTTTAGA AGTTTATTTA ATTGATTACG TTAATAAAAA AAAAA Predicted gene structure (within gDNA segment 14255 to 23103): Exon 1 20904 21173 ( 270 n); cDNA 550 817 ( 268 n); score: 0.839 PPA cDNA 823 835 MATCH C06HBa0153O03.1-1+ SGN-E546219- 0.839 270 0.323 C PGS_C06HBa0153O03.1-1+_SGN-E546219- (20904 21173) Alignment (genomic DNA sequence = upper lines): AGCAAGCGTG CCATCGACTT CGACTCGTCA TCAGATCTAG CCGGTCTTTA GCATATCAGA 20963 |||||| | ||||| |||| |||| || | ||| ||||| || |||| | ||| |||||| AGCAAGTGCA CCATCAACTT CGACCCGCCT TCAACTCTAG CCAGTCTCCA GCACATCAGA 609 ATTCAGGGTG AGCTATTATT CCTAGCTCGT GCTGGATTCT CTCCTTCACG TCTTGATGTC 21023 ||||||||| |||||||||| ||||||||| ||||||||| |||| ||||| |||||||||| TTTCAGGGTG AGCTATTATT CCTAGCTCGG ACTGGATTCT CTCCCTCACG TCTTGATGTC 669 TTGAAGTTCG GACATGGACC ---ATCTTTT TACTATTTTT AGCTTCTTGA ATACTCTTAG 21080 |||||||||| |||||||||| |||||| |||| |||| |||||||| | |||||||||| TTGAAGTTCG GACATGGACC GTTTTCTTTT TACTTATTTT AGCTTCTTAA ATACTCTTAG 729 ATTTAGAAAT TCGAGGATAG ATGTTCTTGG TGTGATGACT TACAGATTTT GGGAATAATA 21140 |||| | | |||||||| ||||| ||| |||||||||| | |||||||| |||| ||||| ATTT----AC T-GAGGATAG ATGTTATTGA TGTGATGACT TCCAGATTTT GGGATTAATA 784 AGTATTTAAC TTTAGATGTT GATTTAATTA ATT 21173 | |||| ||| |||||| ||| |||||||| ||| AATATTAAAC TTTAGAAGTT TATTTAATTG ATT 817 hqPGS_C06HBa0153O03.1-1+_SGN-E546219- (20904 21173) ******************************************************************************** EST sequence 69 +strand 286 n (File: SGN-E355114+) 1 TTAGGTTCGT TGGTCTCATC ACACAAGAAC AGGTCTAGTA GAGTCTTTAG GAACGGTAGG 61 GGGACGCCTT TACTTTTCTT TGAGAGGCTA TAAGACTTTA GGAAAATTTC ACCCTTTCAT 121 TCTTTCTTTC GTGCTACTAC TTGAGTCCAA TTGGTATCTA GGCGATACAA ATTGGTATCT 181 GACCATCTTC ACTCTCTTTT GCAGATGGTT AGAACTAGAG CAACGACCAC GTCAACACCA 241 ACACCGGCCA GACAAGAAAC AACTGAGCCA GCCACTGGGG CTGTGG Predicted gene structure (within gDNA segment 20733 to 22565): Exon 1 21333 21611 ( 279 n); cDNA 1 284 ( 284 n); score: 0.823 MATCH C06HBa0153O03.1-1+ SGN-E355114+ 0.823 279 0.976 C PGS_C06HBa0153O03.1-1+_SGN-E355114+ (21333 21611) Alignment (genomic DNA sequence = upper lines): TTAGGTTCGT TGGTCTCATC ACACAAGAAC AAGTCTAGTA GAGTCTTGAG GAACGGTAGG 21392 |||||||||| |||||||||| |||||||||| | |||||||| ||||||| || |||||||||| TTAGGTTCGT TGGTCTCATC ACACAAGAAC AGGTCTAGTA GAGTCTTTAG GAACGGTAGG 60 GGGACGCCCT TACTTTTCTT CGAGAGGCTA TAGGACTTTA GCAAAA-TT- --CCATTCTT 21448 |||||||| | |||||||||| ||||||||| || ||||||| | |||| || || ||| | GGGACGCCTT TACTTTTCTT TGAGAGGCTA TAAGACTTTA GGAAAATTTC ACCCTTTCAT 120 TCCTTCTTTC GTGCTATTAC TTGGATCCAA GTGGTATCTA GGTGATACAA ATTGGTATCT 21508 || ||||||| |||||| ||| ||| ||||| ||||||||| || ||||||| |||||||||| TCTTTCTTTC GTGCTACTAC TTGAGTCCAA TTGGTATCTA GGCGATACAA ATTGGTATCT 180 GA-CATCCTC ACTCTATTTC GCATATGGTT AGAACTAGAG AAACAACTGT GCCAACACCA 21567 || |||| || ||||| ||| ||| |||||| |||||||||| ||| || | |||||||| GACCATCTTC ACTCTCTTTT GCAGATGGTT AGAACTAGAG CAACGACCAC GTCAACACCA 240 ACACCGGCAA GATAGCGTGC GTCTGAGCCA AACATTGGGG TTGT 21611 |||||||| | || | | |||||||| || ||||| ||| ACACCGGCCA GACAAGAAAC AACTGAGCCA GCCACTGGGG CTGT 284 hqPGS_C06HBa0153O03.1-1+_SGN-E355114+ (21333 21611) ******************************************************************************** EST sequence 152 -strand 763 n (File: SGN-E214046-) 1 CCAGATATGC CACTCAACTC TGTTCAGTCC CTCAAGAGCG GATTCGCCGG TTGTGAAGGG 61 GTTGAGGTCA GAATTGCGGA TTTCGCCCTT ACAGGTAGCG GCAACGGCAA AATCCTTCCA 121 AGAAGTGGTA GACTTTTTGA TAGAAGTGGA AGGAGTGAAA CCAGACGACT TCACCACGAC 181 ATCGACATCG ATGAGGTTTC AAAGGGGAGA TGAGTTTAAG GGTTCTTACT CTAGAGGACA 241 AGGTTCTGGA GGTTACTCAG TTCGACCCAT TCAGTCTTCA CTACAAACTG TAGTTGGGGG 301 TCCACCTCAG ACCGGTCAAC ACTTCTCCGA GGGGCCTATG AGTGACTCTA GAGAATGTTA 361 TGGATGTGGG GAGATTGGAC ATATTAGGAG AAATTTTCCC GGACCAAGTT ATAGACCCCC 421 AATAGTTAGA GGTAGAGGTT GTCATGGTAG AGGCCGTTAT TTTGGAGGAC GTGGTGGTCG 481 AGGCAATGGT GGTCACCAAA ACGGCAGAGG TAATGGACAA ACTGGGGCCA CTACATCACG 541 ACATGGTAGG GGCAATGGAC AAACAAACTA TAGGGCCCAT TGTTACGCTT TCCCTGGGCG 601 GTCTAAAGCG GAGGCATCTG ATGCTGTCAT CACAGGTAAT CTTCTTGTTT GTGATTGCAT 661 GGCTTCCGTA TTGTTTGATC CTGGATCCAC GTTTTCTTAT GTATCTTCCT CATTTGCTAA 721 TGGTCTAAAT TTACATTGTG AATTTCTTGA TATGCCTATT CGT Predicted gene structure (within gDNA segment 15975 to 24184): Exon 1 22364 22692 ( 329 n); cDNA 1 330 ( 330 n); score: 0.786 Intron 1 22693 22728 ( 36 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.80) Exon 2 22729 23161 ( 433 n); cDNA 331 763 ( 433 n); score: 0.820 MATCH C06HBa0153O03.1-1+ SGN-E214046- 0.805 762 0.999 C PGS_C06HBa0153O03.1-1+_SGN-E214046- (22364 22692,22729 23161) Alignment (genomic DNA sequence = upper lines): CCAGGTATGC CACCCAGCTT TGCTTCAGTC CAT-AAGAGC GGATTCGCCG CTTTGTGAAA 22422 |||| ||||| ||| || || || ||||||| | | |||||| |||||||||| ||||||| CCAGATATGC CACTCAACTC TG-TTCAGTC CCTCAAGAGC GGATTCGCCG -GTTGTGAAG 58 GGATTGAGGT TAGATTTGCA GA--TCCCAG TTACAGGTAG CTGCCGCAGC AAAATCCTTT 22480 || ||||||| ||| |||| || || | |||||||||| | || | || ||||||||| GGGTTGAGGT CAGAATTGCG GATTTCGCCC TTACAGGTAG CGGCAACGGC AAAATCCTTC 118 CAGGAAGTGG TTGACTTTGT GATTGAGGTG GAGGGGGTGA AGCCAGACAA CTTCACCATG 22540 || ||||||| | |||||| | ||| || ||| || || |||| | |||||| | |||||||| | CAAGAAGTGG TAGACTTTTT GATAGAAGTG GAAGGAGTGA AACCAGACGA CTTCACCACG 178 GTGTCGACAT CTAAGAAGTT CCGTACGGGA GGTGAGTTTA GTGGTTCTTA CTCCAGAGGG 22600 ||||||| | | || ||| | | |||| | |||||||| |||||||| ||| ||||| ACATCGACAT CGATGAGGTT TCAAAGGGGA GATGAGTTTA AGGGTTCTTA CTCTAGAGGA 238 CAGAGTTCAG GAGGTTACCC AGCCTGACCT ATTCAGTCGT CACTACAGGC TGTAGCTGGG 22660 || |||| | |||||||| | || |||| |||||||| | ||||||| | ||||| |||| CAAGGTTCTG GAGGTTACTC AGTTCGACCC ATTCAGTCTT CACTACAAAC TGTAGTTGGG 298 GGTCCATCGC AGACCAGTCA ACATTTCTCT GAGTTTGGAG GTTATCCCCA GACTTCGTCA 22720 |||||| | | ||||| |||| ||| ||||| || GGTCCACCTC AGACCGGTCA ACACTTCTCC GA........ .......... .......... 330 TTCTCTTAGA GACCTATGCT TGACTCCAGA GATTGTAGTG GATGTGGAGA GACTGGACAT 22780 | | |||||| |||||| ||| || ||| || ||||||| || || ||||||| ........GG GGCCTATGAG TGACTCTAGA GAATGTTATG GATGTGGGGA GATTGGACAT 382 ATTAGGAGGT ATTGTCCAAA ATAGAGTTAC AGACCCCCAA TAATTAGAGG TAGAGGAAAT 22840 |||||||| ||| ||| | ||||| |||||||||| || ||||||| |||||| | ATTAGGAGAA ATTTTCCCGG ACCAAGTTAT AGACCCCCAA TAGTTAGAGG TAGAGGTTGT 442 CATGGGAGAG GCCGCCATTA TGGAGGACGT GGTGGCCAAG GTAATGGTGG TCACCAAATC 22900 ||||| |||| |||| ||| |||||||||| ||||| | || | |||||||| |||||||| | CATGGTAGAG GCCGTTATTT TGGAGGACGT GGTGGTCGAG GCAATGGTGG TCACCAAAAC 502 AGCCGGGGTG GCGGGCAAGT TGGAACTACT GCAGCACAAC ATGGTAAGGG CAACGGGCAG 22960 || | ||| || ||| ||| | ||| || ||| || |||||| ||| ||| || || GGCAGAGGTA ATGGACAAAC TGGGGCCACT ACATCACGAC ATGGTAGGGG CAATGGACAA 562 ACAGGTGATA GGGCCCATTG TTATGATTTC CCCGAGAGGT TTGAAGCAGA GACATCTGAT 23020 ||| ||| |||||||||| ||| | |||| || | | ||| | |||| || | |||||||| ACAAACTATA GGGCCCATTG TTACGCTTTC CCTGGGCGGT CTAAAGCGGA GGCATCTGAT 622 GCTGTTATCA CAGGTAATCT TTTGGTTTGT GATTGCATGG CTTCTGTATT ATTTGATCCT 23080 ||||| |||| |||||||||| | | |||||| |||||||||| |||| ||||| ||||||||| GCTGTCATCA CAGGTAATCT TCTTGTTTGT GATTGCATGG CTTCCGTATT GTTTGATCCT 682 GGATCCACAT TTTCATATGT ATCTTCCTCA TTTGTTACTG GTCTTGATTT ACATTGTGAC 23140 |||||||| | |||| ||||| |||||||||| |||| || || |||| |||| ||||||||| GGATCCACGT TTTCTTATGT ATCTTCCTCA TTTGCTAATG GTCTAAATTT ACATTGTGAA 742 TTGCTTGACA TGCCTATTCG T 23161 || ||||| | |||||||||| | TTTCTTGATA TGCCTATTCG T 763 hqPGS_C06HBa0153O03.1-1+_SGN-E214046- (22364 22692,22729 23161) ******************************************************************************** EST sequence 156 -strand 559 n (File: SGN-E244046-) 1 GGTGAGTTTA ATGGTGCTTA CACTAGAGGA CAGGGTTCGG GAAGTTACTC AGTCCGACCA 61 ATTCAGTCTT CACTACAGAC TGTAGTTGGG GGTCCACCTT CGACCGGTCA ACACTTCTCT 121 GAGAGACTTA TGCATGAACC CAGAGAGTGC TATGGGTGTG GGGAGATTGG ACATATTAAG 181 AGATATTGTC CAAAACAGAG TTACAGACCT CCAAAGGTTA GAGGTAGAGG TGGTCATGGC 241 AGAGTCCGTT ATTCTGGAGG ACGTGGCGGT CGAGGAAATG GTGGTCACCA AAACGGCCGA 301 GGTGATGGGC AAACTGGAGC CACTACATCA CAACATGGTA GGGGCAACAG ACATAAGAAT 361 GATAGGGCCC ATTGTTACGC TTTCCCTGGG CGGTCTGAAG CGGAGGAATC TGATGTTGTC 421 ATCACAGGTA ATCTTTTGGT TTGTGATTGC ATGGCTTCTG TATTGTTTGA TCCTGGATCC 481 ACATTTTCTT ATGTATCTTC CTCATTTGCT AATGGTCTAA CTTTACATTG TGAATTACTT 541 GATATGCCTA TTCGTGTTT Predicted gene structure (within gDNA segment 18217 to 24224): Exon 1 22571 22692 ( 122 n); cDNA 1 122 ( 122 n); score: 0.828 Intron 1 22693 22728 ( 36 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.78) Exon 2 22729 23163 ( 435 n); cDNA 123 557 ( 435 n); score: 0.839 MATCH C06HBa0153O03.1-1+ SGN-E244046- 0.837 557 0.996 C PGS_C06HBa0153O03.1-1+_SGN-E244046- (22571 22692,22729 23163) Alignment (genomic DNA sequence = upper lines): GGTGAGTTTA GTGGTTCTTA CTCCAGAGGG CAGAGTTCAG GAGGTTACCC AGCCTGACCT 22630 |||||||||| |||| |||| | | ||||| ||| |||| | || ||||| | || | |||| GGTGAGTTTA ATGGTGCTTA CACTAGAGGA CAGGGTTCGG GAAGTTACTC AGTCCGACCA 60 ATTCAGTCGT CACTACAGGC TGTAGCTGGG GGTCCATCGC AGACCAGTCA ACATTTCTCT 22690 |||||||| | |||||||| | ||||| |||| |||||| | |||| |||| ||| |||||| ATTCAGTCTT CACTACAGAC TGTAGTTGGG GGTCCACCTT CGACCGGTCA ACACTTCTCT 120 GAGTTTGGAG GTTATCCCCA GACTTCGTCA TTCTCTTAGA GACCTATGCT TGACTCCAGA 22750 || || ||| ||||| ||| ||||| GA........ .......... .......... ........GA GACTTATGCA TGAACCCAGA 144 GATTGTAGTG GATGTGGAGA GACTGGACAT ATTAGGAGGT ATTGTCCAAA ATAGAGTTAC 22810 || || || | ||||| || || ||||||| |||| ||| | |||||||||| | |||||||| GAGTGCTATG GGTGTGGGGA GATTGGACAT ATTAAGAGAT ATTGTCCAAA ACAGAGTTAC 204 AGACCCCCAA TAATTAGAGG TAGAGGAAAT CATGGGAGAG GCCGCCATTA TGGAGGACGT 22870 ||||| |||| ||||||| |||||| | ||||| |||| ||| ||| |||||||||| AGACCTCCAA AGGTTAGAGG TAGAGGTGGT CATGGCAGAG TCCGTTATTC TGGAGGACGT 264 GGTGGCCAAG GTAATGGTGG TCACCAAATC AGCCGGGGTG GCGGGCAAGT TGGAACTACT 22930 || || | || | |||||||| |||||||| | |||| |||| |||||| |||| | ||| GGCGGTCGAG GAAATGGTGG TCACCAAAAC GGCCGAGGTG ATGGGCAAAC TGGAGCCACT 324 GCAGCACAAC ATGGTAAGGG CAACGGGCAG ACAGGTGATA GGGCCCATTG TTATGATTTC 22990 || |||||| |||||| ||| |||| | || | ||||| |||||||||| ||| | |||| ACATCACAAC ATGGTAGGGG CAACAGACAT AAGAATGATA GGGCCCATTG TTACGCTTTC 384 CCCGAGAGGT TTGAAGCAGA GACATCTGAT GCTGTTATCA CAGGTAATCT TTTGGTTTGT 23050 || | | ||| |||||| || | ||||||| | ||| |||| |||||||||| |||||||||| CCTGGGCGGT CTGAAGCGGA GGAATCTGAT GTTGTCATCA CAGGTAATCT TTTGGTTTGT 444 GATTGCATGG CTTCTGTATT ATTTGATCCT GGATCCACAT TTTCATATGT ATCTTCCTCA 23110 |||||||||| |||||||||| ||||||||| |||||||||| |||| ||||| |||||||||| GATTGCATGG CTTCTGTATT GTTTGATCCT GGATCCACAT TTTCTTATGT ATCTTCCTCA 504 TTTGTTACTG GTCTTGATTT ACATTGTGAC TTGCTTGACA TGCCTATTCG TGT 23163 |||| || || |||| ||| ||||||||| || ||||| | |||||||||| ||| TTTGCTAATG GTCTAACTTT ACATTGTGAA TTACTTGATA TGCCTATTCG TGT 557 hqPGS_C06HBa0153O03.1-1+_SGN-E244046- (22571 22692,22729 23163) ******************************************************************************** EST sequence 195 -strand 720 n (File: SGN-E356614-) 1 AGCCCATCTG ATACTAGGGC AGTGACTCTT CCACTGACTG AGGAAGTAAT AGAGAAAGGG 61 AGGGATGGGG AAACTGANCA AGTGCAAAAT GAGGAAATGC CACCCCAACC TACCCCAGAG 121 ATGATCAATC AGGTTCTGGC TTATCTTAGC GGGTTATCTG ATCAAGGTCA AGCACCTCCA 181 GTGTTTTCTG CGCCAACACC TCCGGTTTCA GAAGTACAAC ATGCGGCTAC TATGGCTCCC 241 CGCATGGATG TTCCATTGGA CATAGGCACA TTTCCACGTT TGACTACTGT CCCTATAATG 301 ACAAATGATC AGCATGAACT TTTCAGTAAG TTCTTGAAAT TGAAACCTCC GGTCTTCAAG 361 GGTGCTGAAT CGGAGGATGC TTATGATTTT CTGGTTGACT GTCACGAGCT ACTACACAAG 421 ATGGGTATAG TAGAACGGTT TGGTGTGGAG TTTGTGACTT ATCAGTTTCA AGGGAACGCC 481 CAAATGTGGT GGCGGTCACA TATCGAGTGT CAACCAATAG AGGCATCACC TATGACTTGG 541 GCCTCATTCT ATAGTTTGTT TATGGAGAAG TATATCCCCC GGACTTTGAG GGATAAGAAA 601 AGGGATGAGT TCTTGAGCCT AGAGCAAGGT AGGGTGTCAG TTAATGCTTA TGAGGCTAAG 661 TTTCGTGCAC TATCCAGATA TGCCACTCAA CTCTGTTTCA GTCCTCAAGA GCGGATTCGC Predicted gene structure (within gDNA segment 20190 to 23901): Exon 1 21698 22410 ( 713 n); cDNA 8 720 ( 713 n); score: 0.838 MATCH C06HBa0153O03.1-1+ SGN-E356614- 0.838 713 0.990 C PGS_C06HBa0153O03.1-1+_SGN-E356614- (21698 22410) Alignment (genomic DNA sequence = upper lines): CTAGTAATAG GGTGGTGACT CCTCCACCGA CTGATGAGGT AGTA-AGAGA GGGTGAGGAA 21756 || || ||| || |||||| | ||||| || |||| || || | || ||| | ||| | ||| CTGATACTAG GGCAGTGACT CTTCCACTGA CTGAGGAAGT AATAGAGAAA GGGAG-GGAT 66 GGGGAAAATA AACAGGTGCA AGATGAGGAA TTACCACCCC AACCTACCCC AGAGATGATC 21816 ||||||| | | || ||||| | |||||||| | ||||||| |||||||||| |||||||||| GGGGAAACTG ANCAAGTGCA AAATGAGGAA ATGCCACCCC AACCTACCCC AGAGATGATC 126 AACCAGGTTC TTACTTATCT TAGCGGGTTA TCTGATCGAG GCCAGACACC TCCAGTGTTT 21876 || ||||||| | ||||||| |||||||||| ||||||| || | || |||| |||||||||| AATCAGGTTC TGGCTTATCT TAGCGGGTTA TCTGATCAAG GTCAAGCACC TCCAGTGTTT 186 CTTGTACCAG CACCTCAGGT TCCAGGAGTA CAACATGCAA CTGTTGTGGC TCCCCGCATG 21936 || ||| |||||| ||| | ||| |||| |||||||| || | |||| |||||||||| TCTGCGCCAA CACCTCCGGT TTCAGAAGTA CAACATGCGG CTACTATGGC TCCCCGCATG 246 GATGCCTCAT TGGAAGTAGG CACGTTTCCT CGATTGACTA CAGGGTCTAT AATGACAAGT 21996 |||| ||| |||| |||| ||| ||||| || ||||||| | | |||| |||||||| | GATGTTCCAT TGGACATAGG CACATTTCCA CGTTTGACTA CTGTCCCTAT AATGACAAAT 306 GATCAACATG AACTTTTCAC TAAATTCTTA AAGTTGAAAC CTCCTGTCTT CAAGGGTGCT 22056 ||||| |||| ||||||||| ||| ||||| || ||||||| |||| ||||| |||||||||| GATCAGCATG AACTTTTCAG TAAGTTCTTG AAATTGAAAC CTCCGGTCTT CAAGGGTGCT 366 AAATCTGAGG ATGCCTATGA TTTTCTGGTT GATTGTCATG AGCTGCTACA TAAGATGGAC 22116 |||| |||| |||| ||||| |||||||||| || ||||| | |||| ||||| ||||||| GAATCGGAGG ATGCTTATGA TTTTCTGGTT GACTGTCACG AGCTACTACA CAAGATGGGT 426 ATAGTAGAAC GATTCGGTGT TGATTTTGTG ACCTACCAGT TTCAGGGGAA TGCCAAAATG 22176 |||||||||| | || ||||| || |||||| || || |||| |||| ||||| ||| ||||| ATAGTAGAAC GGTTTGGTGT GGAGTTTGTG ACTTATCAGT TTCAAGGGAA CGCCCAAATG 486 TGGTGGCGGT CGTATGTTGA GTGTCAACCA GCACAGGCAC CACCTATGAC TTGGGAATCA 22236 |||||||||| | || | || |||||||||| | ||||| |||||||||| ||||| ||| TGGTGGCGGT CACATATCGA GTGTCAACCA ATAGAGGCAT CACCTATGAC TTGGGCCTCA 546 TTCTCTAGCT TATTTATGGA GAAGTATATA CCCCGGACTT TGAGGGATAG GAGGAGAGAT 22296 |||| ||| | | |||||||| ||||||||| |||||||||| ||||||||| || || ||| TTCTATAGTT TGTTTATGGA GAAGTATATC CCCCGGACTT TGAGGGATAA GAAAAGGGAT 606 GAGTTCTTGA GCCTATAGCA AGGAAGGATG TCTGTTGCCG CTTATGAGGC CAAATTTCGT 22356 |||||||||| ||||| |||| ||| ||| || || ||| | |||||||||| || |||||| GAGTTCTTGA GCCTAGAGCA AGGTAGGGTG TCAGTTAATG CTTATGAGGC TAAGTTTCGT 666 GCGCTATCCA GGTATGCCAC CCAGCTTTGC TTCAGTCCAT AAGAGCGGAT TCGC 22410 || ||||||| | |||||||| || || || |||||||| |||||||||| |||| GCACTATCCA GATATGCCAC TCAACTCTGT TTCAGTCCTC AAGAGCGGAT TCGC 720 hqPGS_C06HBa0153O03.1-1+_SGN-E356614- (21698 22410) ******************************************************************************** EST sequence 123 -strand 664 n (File: SGN-E352401-) 1 AGAAGGGGAG GATGGGGAAA CTGAACAAGT GCAAATGAGG GAAATGGCAC CCCAACCTAC 61 CCCAGAGATG ATCAATCAGG TTCTGGCTTA TCTTAGCGGG TTATCTGATC AAGGTCAAGC 121 ACCTCCAGTG TTTTCTGCGC CAACACCTCC GGTTTCAGAA GTACAACATG CGGCTACTAT 181 GGCTCCCCGC ATGGATGTTC CATTGGACAT AGGCACATTT CCACGTTTGA CTACTGTCCC 241 TATAATGACA AATGATCAGC ATGAACTTTT CAGTAAGTTC TTGAAATTGA AACCTCCGGT 301 CTTCAAGGGT GCTGAATCGG AGGATGCTTA TGATTTTCTG GTTGACTGTC ACGAGCTACT 361 ACACAAGATG GGTATAGTAG AACGGTTTGG TGTGGAGTTT GTGACTTATC AGTTTCAAGG 421 GAACGCCCAA ATGTGGTGGC GGTCACATAT CGAGTGTCAA CCAATAGAGG CATCACCTAT 481 GACTTGGGCC TCATTCTATA GTTTGTTTAT GGAGAAGTAT ATCCCCCGGA CTTTGAGGGA 541 TAAGAAAAGG GATGAGTTCT TGAGCCTAGA GCAAGGTAGG GTGTCAGTTA ATGCTTATGA 601 GGCTAAGTTT CGTGCACTAT CCAGATATGC CACTCAACTC TGTTTCAGTC CTCAAGAGCG 661 GATT Predicted gene structure (within gDNA segment 20711 to 23871): Exon 1 21744 22407 ( 664 n); cDNA 1 664 ( 664 n); score: 0.843 MATCH C06HBa0153O03.1-1+ SGN-E352401- 0.843 664 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E352401- (21744 22407) Alignment (genomic DNA sequence = upper lines): AGAGGGTGAG GAAGGGGAAA ATAAACAGGT GCAAGATGA- GGAATTACCA CCCCAACCTA 21802 ||| || ||| || ||||||| | |||| || |||| |||| |||| | || |||||||||| AGAAGGGGAG GATGGGGAAA CTGAACAAGT GCAA-ATGAG GGAAATGGCA CCCCAACCTA 59 CCCCAGAGAT GATCAACCAG GTTCTTACTT ATCTTAGCGG GTTATCTGAT CGAGGCCAGA 21862 |||||||||| |||||| ||| ||||| ||| |||||||||| |||||||||| | ||| || CCCCAGAGAT GATCAATCAG GTTCTGGCTT ATCTTAGCGG GTTATCTGAT CAAGGTCAAG 119 CACCTCCAGT GTTTCTTGTA CCAGCACCTC AGGTTCCAGG AGTACAACAT GCAACTGTTG 21922 |||||||||| |||| || ||| |||||| |||| ||| |||||||||| || || | CACCTCCAGT GTTTTCTGCG CCAACACCTC CGGTTTCAGA AGTACAACAT GCGGCTACTA 179 TGGCTCCCCG CATGGATGCC TCATTGGAAG TAGGCACGTT TCCTCGATTG ACTACAGGGT 21982 |||||||||| |||||||| ||||||| ||||||| || ||| || ||| ||||| | TGGCTCCCCG CATGGATGTT CCATTGGACA TAGGCACATT TCCACGTTTG ACTACTGTCC 239 CTATAATGAC AAGTGATCAA CATGAACTTT TCACTAAATT CTTAAAGTTG AAACCTCCTG 22042 |||||||||| || |||||| |||||||||| ||| ||| || ||| || ||| |||||||| | CTATAATGAC AAATGATCAG CATGAACTTT TCAGTAAGTT CTTGAAATTG AAACCTCCGG 299 TCTTCAAGGG TGCTAAATCT GAGGATGCCT ATGATTTTCT GGTTGATTGT CATGAGCTGC 22102 |||||||||| |||| |||| |||||||| | |||||||||| |||||| ||| || ||||| | TCTTCAAGGG TGCTGAATCG GAGGATGCTT ATGATTTTCT GGTTGACTGT CACGAGCTAC 359 TACATAAGAT GGACATAGTA GAACGATTCG GTGTTGATTT TGTGACCTAC CAGTTTCAGG 22162 |||| ||||| || |||||| ||||| || | |||| || || |||||| || |||||||| | TACACAAGAT GGGTATAGTA GAACGGTTTG GTGTGGAGTT TGTGACTTAT CAGTTTCAAG 419 GGAATGCCAA AATGTGGTGG CGGTCGTATG TTGAGTGTCA ACCAGCACAG GCACCACCTA 22222 |||| ||| | |||||||||| ||||| || | |||||||| |||| | || ||| |||||| GGAACGCCCA AATGTGGTGG CGGTCACATA TCGAGTGTCA ACCAATAGAG GCATCACCTA 479 TGACTTGGGA ATCATTCTCT AGCTTATTTA TGGAGAAGTA TATACCCCGG ACTTTGAGGG 22282 ||||||||| ||||||| | || || |||| |||||||||| ||| |||||| |||||||||| TGACTTGGGC CTCATTCTAT AGTTTGTTTA TGGAGAAGTA TATCCCCCGG ACTTTGAGGG 539 ATAGGAGGAG AGATGAGTTC TTGAGCCTAT AGCAAGGAAG GATGTCTGTT GCCGCTTATG 22342 ||| || || ||||||||| ||||||||| ||||||| || | |||| ||| ||||||| ATAAGAAAAG GGATGAGTTC TTGAGCCTAG AGCAAGGTAG GGTGTCAGTT AATGCTTATG 599 AGGCCAAATT TCGTGCGCTA TCCAGGTATG CCACCCAGCT TTGCTTCAGT CCATAAGAGC 22402 |||| || || |||||| ||| ||||| |||| |||| || || || |||||| || |||||| AGGCTAAGTT TCGTGCACTA TCCAGATATG CCACTCAACT CTGTTTCAGT CCTCAAGAGC 659 GGATT 22407 ||||| GGATT 664 hqPGS_C06HBa0153O03.1-1+_SGN-E352401- (21744 22407) ******************************************************************************** EST sequence 186 -strand 543 n (File: SGN-E355026-) 1 GCAGAGGCGT CTGATGCTGT TATCACAGGT AATCTCTTGG TTTGTGATTG CATGGCATCT 61 GTATTATTTG ACCCTGGCTC CACGTTTTCA TATGTATCTT CATCATTTGC TAATGGTCTA 121 AAATTGCATT GTGAATTACT TGACATGCCT ATTCGTGTTT CTACTCCGGT GGGTGAGTCT 181 GTGATAGTTG AAAAGGTACA TAGGTCTTGT TTGGTGAATT TCGTGGGGAG CAACACTTAT 241 CTAGATTTGG TTACTTTAGA AATGGGTGAC TTTGATGTAA TTCTGGGTAT GACTTGGCTT 301 TCTCCCAATT TTGCGATCTT GGATTGTAAT GCTAAAACGG TGACGTTAGC CAAGCCTGGG 361 ACAGATCCGT TAGTGTGGGA GGGTGACTAC ACTTCCAATC CGGTGCGTAT TATCTCCTTT 421 CTTTGTGCTC AGAAGATGGT TAGTAAGGGT TGTCTATCCT TCTTGGCACA TCTCAAGGAT 481 GACACTATCC AAGTACCCTC AATTGAATTA GTTTTGGTAG TTTGAGAATT TTTGGATGTG 541 TTC Predicted gene structure (within gDNA segment 22315 to 27316): Exon 1 23006 23547 ( 542 n); cDNA 1 542 ( 542 n); score: 0.845 MATCH C06HBa0153O03.1-1+ SGN-E355026- 0.845 542 0.998 C PGS_C06HBa0153O03.1-1+_SGN-E355026- (23006 23547) Alignment (genomic DNA sequence = upper lines): GCAGAGACAT CTGATGCTGT TATCACAGGT AATCTTTTGG TTTGTGATTG CATGGCTTCT 23065 |||||| | | |||||||||| |||||||||| ||||| |||| |||||||||| |||||| ||| GCAGAGGCGT CTGATGCTGT TATCACAGGT AATCTCTTGG TTTGTGATTG CATGGCATCT 60 GTATTATTTG ATCCTGGATC CACATTTTCA TATGTATCTT CCTCATTTGT TACTGGTCTT 23125 |||||||||| | ||||| || ||| |||||| |||||||||| | ||||||| || |||||| GTATTATTTG ACCCTGGCTC CACGTTTTCA TATGTATCTT CATCATTTGC TAATGGTCTA 120 GATTTACATT GTGACTTGCT TGACATGCCT ATTCGTGTCT TTACTCCTGT GGGTGAGTCT 23185 | || |||| |||| || || |||||||||| |||||||| | |||||| || |||||||||| AAATTGCATT GTGAATTACT TGACATGCCT ATTCGTGTTT CTACTCCGGT GGGTGAGTCT 180 GTGATAGTTG ATAAGGTGTA TAGGTCTTGT CTTGTGGTTT TTATGGGGAG CAATACTCAT 23245 |||||||||| | ||||| | |||||||||| | ||| || | ||||||| ||| ||| || GTGATAGTTG AAAAGGTACA TAGGTCTTGT TTGGTGAATT TCGTGGGGAG CAACACTTAT 240 TTAGATTTGA TTATTCTAGA GATGGTTGAT TTCGATGTAA TTTTGGGTAT GACTTGGCTT 23305 |||||||| ||| | |||| |||| ||| || ||||||| || ||||||| |||||||||| CTAGATTTGG TTACTTTAGA AATGGGTGAC TTTGATGTAA TTCTGGGTAT GACTTGGCTT 300 TCTCCAAACT TTGCAATCTT AGATTGTAAC GCTAAAACTG TGACATTGAC CAAGCCTGGG 23365 ||||| || | |||| ||||| |||||||| |||||||| | |||| || | |||||||||| TCTCCCAATT TTGCGATCTT GGATTGTAAT GCTAAAACGG TGACGTTAGC CAAGCCTGGG 360 ACAGATCCGC TAGTATGGGA GGGTGACTAT ATTTCCACCC TAGTTCATAT TATCTCTTTT 23425 ||||||||| |||| ||||| ||||||||| | ||||| | || | ||| |||||| ||| ACAGATCCGT TAGTGTGGGA GGGTGACTAC ACTTCCAATC CGGTGCGTAT TATCTCCTTT 420 CTTCGTGCTA AGAGGATGGT TAGTAGGGGT TGTTTAGCTT TCTTGGCCCA TCTCAGGGAT 23485 ||| ||||| ||| |||||| ||||| |||| ||| || | | ||||||| || ||||| |||| CTTTGTGCTC AGAAGATGGT TAGTAAGGGT TGTCTATCCT TCTTGGCACA TCTCAAGGAT 480 GATACTTCCA AGGTACCTTC GATTGAGTCT GTTTCGATAG TCTGTGAGTT TCTGGATGTG 23545 || ||| | | ||||| || ||||| | |||| | ||| | || || || | |||||||| GACACTATCC AAGTACCCTC AATTGAATTA GTTTTGGTAG TTTGAGAATT TTTGGATGTG 540 TT 23547 || TT 542 hqPGS_C06HBa0153O03.1-1+_SGN-E355026- (23006 23547) ******************************************************************************** EST sequence 183 -strand 761 n (File: SGN-E355244-) 1 ATTAGATGCT GTCATCACAG GTAATCTTTC GATTGTGATT GCATGGCTTC TGTATTGTTG 61 ATCCTGGGAT CCACGTTTTC TANTGTATCT TCCTCATTTG CTAACGGTCT AAATTTACAT 121 TGTGAATTAC TTGATATGCC TATTCGTGTT TCTACTCCGG TGGGTGAGTC TGTGGTAGTT 181 GAAAAGGTAT ATAGGTCTTG TTTGGTGAAC TTTGTGGGGA GCAACACTTA TGTAGATTTG 241 GTTATCTTAG AAATGGTTGA TTTTGATGTA ATTCTGGGTA TGACTTGGCT TTCTCCGCAA 301 TTTGCGATCT TGGATTGTAA TGCTAAAACG GTGACATTAG CCAAGCCTGG GACAGATCCG 361 TTAGTGTGGG AGGGTGACTA CACTTCCAAT CCGGTGTGCA TCATCTCCTT TCTTCGTGCT 421 AAGAAAATGG TTAGTAAAGG GTGTTTAGCT TTCTTGGCAC ATCTCAAGGA TGACACTACC 481 CAAGTACCTT CGATTGAGTC GGTTTTTATA GTCTGTGAGT TTTTGGATGT GTTCCCTGCA 541 GATCTTCCTG GTATGCCACC AGATAGGGAT ATTGACTTCT GTATCGATCT TGAACCGGGC 601 ACACGCCCCA TTTCTATACC CCCTTATAGA ATGGCTCCCG TAGAGTTAAG AGAGTTAAAG 661 GCCCAACTTC AAGAGTTGTT GAGCAAAGGT TTCATTAGAC CAAGTGCATC TCCTTGGGGT 721 GCTCCGATTT TGTTTGTGAA GAAGAAGGAT GGGAGTTTCC G Predicted gene structure (within gDNA segment 18733 to 25185): Exon 1 23014 23775 ( 762 n); cDNA 1 761 ( 761 n); score: 0.850 MATCH C06HBa0153O03.1-1+ SGN-E355244- 0.850 762 1.001 C PGS_C06HBa0153O03.1-1+_SGN-E355244- (23014 23775) Alignment (genomic DNA sequence = upper lines): ATCTGATGCT GTTATCACAG GTAATCTTTT GGTTTGTGAT TGCATGGCTT CTGTATTATT 23073 || |||||| || ||||||| |||||| ||| | ||||||| |||||||||| ||||||| | ATTAGATGCT GTCATCACAG GTAATC-TTT CGATTGTGAT TGCATGGCTT CTGTATT-GT 58 TGATCCT-GG ATCCACATTT TCATATGTAT CTTCCTCATT TGTTACTGGT CTTGATTTAC 23132 ||||||| || |||||| ||| || ||||| |||||||||| || || ||| || |||||| TGATCCTGGG ATCCACGTTT TCTANTGTAT CTTCCTCATT TGCTAACGGT CTAAATTTAC 118 ATTGTGACTT GCTTGACATG CCTATTCGTG TCTTTACTCC TGTGGGTGAG TCTGTGATAG 23192 ||||||| || ||||| ||| |||||||||| | | |||||| ||||||||| |||||| ||| ATTGTGAATT ACTTGATATG CCTATTCGTG TTTCTACTCC GGTGGGTGAG TCTGTGGTAG 178 TTGATAAGGT GTATAGGTCT TGTCTTGTGG TTTTTATGGG GAGCAATACT CATTTAGATT 23252 |||| ||||| ||||||||| ||| | ||| ||| |||| |||||| ||| || |||||| TTGAAAAGGT ATATAGGTCT TGTTTGGTGA ACTTTGTGGG GAGCAACACT TATGTAGATT 238 TGATTATTCT AGAGATGGTT GATTTCGATG TAATTTTGGG TATGACTTGG CTTTCTCCAA 23312 || |||| | ||| |||||| ||||| |||| ||||| |||| |||||||||| |||||||| TGGTTATCTT AGAAATGGTT GATTTTGATG TAATTCTGGG TATGACTTGG CTTTCTCCGC 298 ACTTTGCAAT CTTAGATTGT AACGCTAAAA CTGTGACATT GACCAAGCCT GGGACAGATC 23372 | ||||| || ||| |||||| || ||||||| | |||||||| |||||||| |||||||||| AATTTGCGAT CTTGGATTGT AATGCTAAAA CGGTGACATT AGCCAAGCCT GGGACAGATC 358 CGCTAGTATG GGAGGGTGAC TATATTTCCA CCCTAGTTCA TATTATCTCT TTTCTTCGTG 23432 || |||| || |||||||||| || | ||||| | || || ||||| |||||||||| CGTTAGTGTG GGAGGGTGAC TACACTTCCA ATCCGGTGTG CATCATCTCC TTTCTTCGTG 418 CTAAGAGGAT GGTTAGTAGG GGTTGTTTAG CTTTCTTGGC CCATCTCAGG GATGATACTT 23492 |||||| || |||||||| || ||||||| |||||||||| ||||||| | ||||| ||| CTAAGAAAAT GGTTAGTAAA GGGTGTTTAG CTTTCTTGGC ACATCTCAAG GATGACACTA 478 CCAAGGTACC TTCGATTGAG TCTGTTTCGA TAGTCTGTGA GTTTCTGGAT GTGTTTCCTG 23552 || | ||||| |||||||||| || |||| | |||||||||| |||| ||||| ||||| |||| CCCAAGTACC TTCGATTGAG TCGGTTTTTA TAGTCTGTGA GTTTTTGGAT GTGTTCCCTG 538 CAGACCTTCC TGGTATGCCA CCAGATAGGG ATATTGATTT TTGTATTGAT CTCGAGCCGG 23612 |||| ||||| |||||||||| |||||||||| ||||||| || ||||| ||| || || |||| CAGATCTTCC TGGTATGCCA CCAGATAGGG ATATTGACTT CTGTATCGAT CTTGAACCGG 598 GTACTCGCCC CATTTCCATA CCCCCTTATA GAATGACCCT ATCTGAGTTA AGGGAGTTAA 23672 | || ||||| |||||| ||| |||||||||| ||||| | | |||||| || ||||||| GCACACGCCC CATTTCTATA CCCCCTTATA GAATGGCTCC CGTAGAGTTA AGAGAGTTAA 658 AGGCCCAACT TCAGGAGTTG TTAGGTAAAG ACTTTACTAG ACCAAGTTCA TCCCCTTGGG 23732 |||||||||| ||| |||||| || | |||| || | ||| ||||||| || || ||||||| AGGCCCAACT TCAAGAGTTG TTGAGCAAAG GTTTCATTAG ACCAAGTGCA TCTCCTTGGG 718 GTGCTCCTGT TTTATTTGTG AAGAAGAAGG ATGGAAGTTT TCG 23775 ||||||| | ||| |||||| |||||||||| |||| ||||| || GTGCTCCGAT TTTGTTTGTG AAGAAGAAGG ATGGGAGTTT CCG 761 hqPGS_C06HBa0153O03.1-1+_SGN-E355244- (23014 23775) ******************************************************************************** EST sequence 145 -strand 331 n (File: SGN-E352716-) 1 TTTTTTTGTG ATTGCATGGC TTTTGTATTA TTTGACCCTG GATCCACATT TTCATATGTA 61 TCTTCCTCAT TTGCTACTGG TCTTAAATTA AATTATGAAT TGCTTGACAT GCCTATTCGT 121 GTTTCTACTC CGGTGGGTGA GTCTGTGATA GTTGAAAAAG TATATAGGTC TGGTCTGGTG 181 ACTTTTGTGG GGAGCAATAC TTATGTAGAC TTGGTTATCT TAGAAATGGT TGATTTTGAT 241 GTAATTCTGG GTATGACTTG GCTTTCTCCA AATTTTGCAA TCTTGGATTG TAATGCTAAA 301 ACTGTGACGT TAGCCAAGCC TGGGACAGAT C Predicted gene structure (within gDNA segment 22396 to 25556): Exon 1 23042 23372 ( 331 n); cDNA 1 331 ( 331 n); score: 0.891 MATCH C06HBa0153O03.1-1+ SGN-E352716- 0.891 331 1.000 C PGS_C06HBa0153O03.1-1+_SGN-E352716- (23042 23372) Alignment (genomic DNA sequence = upper lines): TTGGTTTGTG ATTGCATGGC TTCTGTATTA TTTGATCCTG GATCCACATT TTCATATGTA 23101 || |||||| |||||||||| || ||||||| ||||| |||| |||||||||| |||||||||| TTTTTTTGTG ATTGCATGGC TTTTGTATTA TTTGACCCTG GATCCACATT TTCATATGTA 60 TCTTCCTCAT TTGTTACTGG TCTTGATTTA CATTGTGACT TGCTTGACAT GCCTATTCGT 23161 |||||||||| ||| |||||| |||| | ||| ||| ||| | |||||||||| |||||||||| TCTTCCTCAT TTGCTACTGG TCTTAAATTA AATTATGAAT TGCTTGACAT GCCTATTCGT 120 GTCTTTACTC CTGTGGGTGA GTCTGTGATA GTTGATAAGG TGTATAGGTC TTGTCTTGTG 23221 || | ||||| | |||||||| |||||||||| ||||| || | | |||||||| | |||| ||| GTTTCTACTC CGGTGGGTGA GTCTGTGATA GTTGAAAAAG TATATAGGTC TGGTCTGGTG 180 GTTTTTATGG GGAGCAATAC TCATTTAGAT TTGATTATTC TAGAGATGGT TGATTTCGAT 23281 |||| ||| |||||||||| | || |||| ||| |||| |||| ||||| |||||| ||| ACTTTTGTGG GGAGCAATAC TTATGTAGAC TTGGTTATCT TAGAAATGGT TGATTTTGAT 240 GTAATTTTGG GTATGACTTG GCTTTCTCCA AACTTTGCAA TCTTAGATTG TAACGCTAAA 23341 |||||| ||| |||||||||| |||||||||| || ||||||| |||| ||||| ||| |||||| GTAATTCTGG GTATGACTTG GCTTTCTCCA AATTTTGCAA TCTTGGATTG TAATGCTAAA 300 ACTGTGACAT TGACCAAGCC TGGGACAGAT C 23372 |||||||| | | ||||||| |||||||||| | ACTGTGACGT TAGCCAAGCC TGGGACAGAT C 331 hqPGS_C06HBa0153O03.1-1+_SGN-E352716- (23042 23372) ******************************************************************************** EST sequence 50 +strand 659 n (File: SGN-E352117+) 1 TTTGATCCTG CCTCCACATT TTCATATGTA TCTTCCTCAT TTGCTACTGT TCTTAATTTA 61 CATAGTGAAT CGCTTGACAT ACCTATTCGT GTTCTACTCC GGTGGGTGAG TCTGTGATTG 121 TTGAAAAGGT GTATAGGTCT TGTCTTGTGA CATTTGTGGG AGCAATACTC ATGTAGACTT 181 GGTTATCCTA TAAATGGTTG ACTTCGATGT AATTCTGGGT ATGACTTGGC TGTCTCCAAA 241 TTTTGCAATC TTGGATTGTA ATGCTAAAAC TGTAACGTTG GCCAAGCCTG AGATAGATGC 301 GTTAGTGTGG GAGGGTGACT ACACTTCCAC TCCAGTTCGT ATCATCTCCT TTCTTTGTGC 361 TAAGAGAATG GTTAGTAAAG GGTGTTTAGC TTTCTTGGCA CACCTCAGGG ATGATACTAC 421 CCAAGTACCT TCAATTGAGT CAGTTTCGAT AGTCCGTGAG TTTCTGGATG TGTTTCCTGC 481 AGACCTTCCT GGTATGCCAC CGGATAGGGA TATTGACTTT TGCATTGATC TGGAGCCGGG 541 TGATCACCCC ATTTCCATAC CCCCTTATAG AATGGCTCCC GCTGAGTTGG GGGAGTTAAA 601 GGCCCAACTT CAAGAGTTGT TANGTAAGGG CTTCATTANG CCAAGTGCAT CCCCTTGGG Predicted gene structure (within gDNA segment 22364 to 24765): Exon 1 23072 23732 ( 661 n); cDNA 1 659 ( 659 n); score: 0.876 MATCH C06HBa0153O03.1-1+ SGN-E352117+ 0.876 661 1.003 C PGS_C06HBa0153O03.1-1+_SGN-E352117+ (23072 23732) Alignment (genomic DNA sequence = upper lines): TTTGATCCTG GATCCACATT TTCATATGTA TCTTCCTCAT TTGTTACTGG TCTTGATTTA 23131 |||||||||| |||||||| |||||||||| |||||||||| ||| ||||| |||| ||||| TTTGATCCTG CCTCCACATT TTCATATGTA TCTTCCTCAT TTGCTACTGT TCTTAATTTA 60 CATTGTGACT TGCTTGACAT GCCTATTCGT GTCTTTACTC CTGTGGGTGA GTCTGTGATA 23191 ||| |||| | ||||||||| ||||||||| || | ||||| | |||||||| ||||||||| CATAGTGAAT CGCTTGACAT ACCTATTCGT GT-TCTACTC CGGTGGGTGA GTCTGTGATT 119 GTTGATAAGG TGTATAGGTC TTGTCTTGTG GTTTTTATGG GGAGCAATAC TCATTTAGAT 23251 ||||| |||| |||||||||| |||||||||| ||| | | |||||||||| |||| |||| GTTGAAAAGG TGTATAGGTC TTGTCTTGTG ACATTTGT-G GGAGCAATAC TCATGTAGAC 178 TTGATTATTC TAGAGATGGT TGATTTCGAT GTAATTTTGG GTATGACTTG GCTTTCTCCA 23311 ||| |||| | || | ||||| ||| |||||| |||||| ||| |||||||||| ||| |||||| TTGGTTATCC TATAAATGGT TGACTTCGAT GTAATTCTGG GTATGACTTG GCTGTCTCCA 238 AACTTTGCAA TCTTAGATTG TAACGCTAAA ACTGTGACAT TGACCAAGCC TGGGACAGAT 23371 || ||||||| |||| ||||| ||| |||||| ||||| || | || ||||||| || || |||| AATTTTGCAA TCTTGGATTG TAATGCTAAA ACTGTAACGT TGGCCAAGCC TGAGATAGAT 298 CCGCTAGTAT GGGAGGGTGA CTATATTTCC ACCCTAGTTC ATATTATCTC TTTTCTTCGT 23431 || |||| | |||||||||| ||| | |||| || | ||||| ||| ||||| |||||| || GCGTTAGTGT GGGAGGGTGA CTACACTTCC ACTCCAGTTC GTATCATCTC CTTTCTTTGT 358 GCTAAGAGGA TGGTTAGTAG GGGTTGTTTA GCTTTCTTGG CCCATCTCAG GGATGATACT 23491 |||||||| | ||||||||| || |||||| |||||||||| | || ||||| |||||||||| GCTAAGAGAA TGGTTAGTAA AGGGTGTTTA GCTTTCTTGG CACACCTCAG GGATGATACT 418 TCCAAGGTAC CTTCGATTGA GTCTGTTTCG ATAGTCTGTG AGTTTCTGGA TGTGTTTCCT 23551 || | |||| |||| ||||| ||| |||||| |||||| ||| |||||||||| |||||||||| ACCCAAGTAC CTTCAATTGA GTCAGTTTCG ATAGTCCGTG AGTTTCTGGA TGTGTTTCCT 478 GCAGACCTTC CTGGTATGCC ACCAGATAGG GATATTGATT TTTGTATTGA TCTCGAGCCG 23611 |||||||||| |||||||||| ||| |||||| |||||||| | |||| ||||| ||| |||||| GCAGACCTTC CTGGTATGCC ACCGGATAGG GATATTGACT TTTGCATTGA TCTGGAGCCG 538 GGTACTCGCC CCATTTCCAT ACCCCCTTAT AGAATGACCC TATCTGAGTT AAGGGAGTTA 23671 ||| || || |||||||||| |||||||||| |||||| | | ||||||| |||||||| GGTGATCACC CCATTTCCAT ACCCCCTTAT AGAATGGCTC CCGCTGAGTT GGGGGAGTTA 598 AAGGCCCAAC TTCAGGAGTT GTTAGGTAAA GACTTTACTA GACCAAGTTC ATCCCCTTGG 23731 |||||||||| |||| ||||| |||| |||| | ||| | || |||||| | |||||||||| AAGGCCCAAC TTCAAGAGTT GTTANGTAAG GGCTTCATTA NGCCAAGTGC ATCCCCTTGG 658 G 23732 | G 659 hqPGS_C06HBa0153O03.1-1+_SGN-E352117+ (23072 23732) ******************************************************************************** EST sequence 154 -strand 661 n (File: SGN-E351414-) 1 CTAACGGTCT AAATTTACAT TGTGAATTAC TTGATATGCC TATTCGTGTT TCTACTCCGG 61 TGGGTGAGTC TGTGGTAGTT GAAAAGGTAT ATAGGTCTTG TTTGGTGAAC TTTGTGGGGA 121 GCAACACTTA TGTAGATTTG GTTATCTTAG AAATGGTTGA TTTTGATGTA ATTCTGGGTA 181 TGACTTGGCT TTCTCCGCAA TTTGCGATCT TGGATTGTAA TGCTAAAACG GTGACATTAG 241 CCAAGCCTGG GACAGATCCG TTAGTGTGGG AGGGTGACTA CACTTCCAAT CCGGTGTGCA 301 TCATCTCCTT TCTTCGTGCT AAGAAAATGG TTAGTAAAGG GTGTTTAGCT TTCTTGGCAC 361 ATCTCAAGGA TGACACTACC CAAGTACCTT CGATTGAGTC GGTTTTTATA GTCTGTGAGT 421 TTTTGGATGT GTTCCCTGCA GATCTTCCTG GTATGCCACC AGATAGGGAT ATTGACTTCT 481 GTATCGATCT TGAACCGGGC ACACGCCCCA TTTCTATACC CCCTTATAGA ATGGCTCCCG 541 TAGAGTTAAG AGAGTTAAAG GCCCAACTTC AAGAGTTGTT GAGCAAAGGT TTCATTAGAC 601 CAAGTGCATC TCCTTGGGGT GCTCCGATTT TGTTTGTGAA GAAGAAGGAT GGGAGTTTCC 661 G Predicted gene structure (within gDNA segment 19733 to 25185): Exon 1 23116 23775 ( 660 n); cDNA 2 661 ( 660 n); score: 0.850 MATCH C06HBa0153O03.1-1+ SGN-E351414- 0.850 660 0.998 C PGS_C06HBa0153O03.1-1+_SGN-E351414- (23116 23775) Alignment (genomic DNA sequence = upper lines): TACTGGTCTT GATTTACATT GTGACTTGCT TGACATGCCT ATTCGTGTCT TTACTCCTGT 23175 || ||||| ||||||||| |||| || || ||| |||||| |||||||| | |||||| || TAACGGTCTA AATTTACATT GTGAATTACT TGATATGCCT ATTCGTGTTT CTACTCCGGT 61 GGGTGAGTCT GTGATAGTTG ATAAGGTGTA TAGGTCTTGT CTTGTGGTTT TTATGGGGAG 23235 |||||||||| ||| |||||| | ||||| || |||||||||| | ||| | || ||||||| GGGTGAGTCT GTGGTAGTTG AAAAGGTATA TAGGTCTTGT TTGGTGAACT TTGTGGGGAG 121 CAATACTCAT TTAGATTTGA TTATTCTAGA GATGGTTGAT TTCGATGTAA TTTTGGGTAT 23295 ||| ||| || |||||||| |||| |||| ||||||||| || ||||||| || ||||||| CAACACTTAT GTAGATTTGG TTATCTTAGA AATGGTTGAT TTTGATGTAA TTCTGGGTAT 181 GACTTGGCTT TCTCCAAACT TTGCAATCTT AGATTGTAAC GCTAAAACTG TGACATTGAC 23355 |||||||||| ||||| | | |||| ||||| |||||||| |||||||| | ||||||| | GACTTGGCTT TCTCCGCAAT TTGCGATCTT GGATTGTAAT GCTAAAACGG TGACATTAGC 241 CAAGCCTGGG ACAGATCCGC TAGTATGGGA GGGTGACTAT ATTTCCACCC TAGTTCATAT 23415 |||||||||| ||||||||| |||| ||||| ||||||||| | ||||| | || || CAAGCCTGGG ACAGATCCGT TAGTGTGGGA GGGTGACTAC ACTTCCAATC CGGTGTGCAT 301 TATCTCTTTT CTTCGTGCTA AGAGGATGGT TAGTAGGGGT TGTTTAGCTT TCTTGGCCCA 23475 ||||| ||| |||||||||| ||| ||||| ||||| || |||||||||| ||||||| || CATCTCCTTT CTTCGTGCTA AGAAAATGGT TAGTAAAGGG TGTTTAGCTT TCTTGGCACA 361 TCTCAGGGAT GATACTTCCA AGGTACCTTC GATTGAGTCT GTTTCGATAG TCTGTGAGTT 23535 ||||| |||| || ||| || | |||||||| ||||||||| |||| |||| |||||||||| TCTCAAGGAT GACACTACCC AAGTACCTTC GATTGAGTCG GTTTTTATAG TCTGTGAGTT 421 TCTGGATGTG TTTCCTGCAG ACCTTCCTGG TATGCCACCA GATAGGGATA TTGATTTTTG 23595 | |||||||| || ||||||| | |||||||| |||||||||| |||||||||| |||| || || TTTGGATGTG TTCCCTGCAG ATCTTCCTGG TATGCCACCA GATAGGGATA TTGACTTCTG 481 TATTGATCTC GAGCCGGGTA CTCGCCCCAT TTCCATACCC CCTTATAGAA TGACCCTATC 23655 ||| ||||| || ||||| | | |||||||| ||| |||||| |||||||||| || | | TATCGATCTT GAACCGGGCA CACGCCCCAT TTCTATACCC CCTTATAGAA TGGCTCCCGT 541 TGAGTTAAGG GAGTTAAAGG CCCAACTTCA GGAGTTGTTA GGTAAAGACT TTACTAGACC 23715 |||||||| |||||||||| |||||||||| |||||||| | |||| | | | |||||| AGAGTTAAGA GAGTTAAAGG CCCAACTTCA AGAGTTGTTG AGCAAAGGTT TCATTAGACC 601 AAGTTCATCC CCTTGGGGTG CTCCTGTTTT ATTTGTGAAG AAGAAGGATG GAAGTTTTCG 23775 |||| |||| |||||||||| |||| |||| ||||||||| |||||||||| | ||||| || AAGTGCATCT CCTTGGGGTG CTCCGATTTT GTTTGTGAAG AAGAAGGATG GGAGTTTCCG 661 hqPGS_C06HBa0153O03.1-1+_SGN-E351414- (23116 23775) ******************************************************************************** EST sequence 180 -strand 658 n (File: SGN-E355232-) 1 GTGGTAGTTG AAAAGGTACA TAGTTCTGTT TTGTNGAATT TCGTGGGGAG CAACACTTAT 61 GTAGATTTGG TTTTTTAGAA ATGGGTGACA TTGATGTAAT TCTGGGTATG ACTTGGCTTT 121 CTCCAAATTT TGCGATCTTG GATTGTAATG CTAAAACGGT GACGTTAGCC AAGCCTGGGA 181 CAGATCCGTT AGTGTGGGAG GGTGACTACA CTTCCAATCT GGTGCGTATC ATATCCTTTC 241 TTCGTGCTAA GAAAATGGTT AGTAAAGGGT GTTCAGCTTT CTTGGCACAT CTCAAGGATG 301 ACACTACTCA AGTACCCTCA ATTGAGTCGG TTTCGGTAGT CCGCGAGTTT TTGGACGTGT 361 TTCCTGCAGA TCTTCCTGGT ATGCCACCAG ATAGGGATAT TGACTTCTGT ATCGATCTTG 421 AACCAGGCAC ACGCCCCATT TCTATACCCC CTTATAGAAT GGCTCCCGCC GAATTAAGAG 481 AGTTAAAGGC TCAACTTCAA GAGTTGTTGA GCAAGGTCTT CATTAGACCA AGTGCATCTC 541 CTTGGGGTGC TCCAGTTTTA TTTGTGAAGA AGAAGGATGG AAGTTTTAGA ATGTGCATAG 601 ACTACAGACA ACTGAACAAG GTAACTATTA AGAACAAGTA TCCTCTTCCT CGCATTGA Predicted gene structure (within gDNA segment 19238 to 24471): Exon 1 23186 23844 ( 659 n); cDNA 1 658 ( 658 n); score: 0.846 MATCH C06HBa0153O03.1-1+ SGN-E355232- 0.846 659 1.002 C PGS_C06HBa0153O03.1-1+_SGN-E355232- (23186 23844) Alignment (genomic DNA sequence = upper lines): GTGATAGTTG ATAAGGTGTA TAGGTCTTGT CTTGTGG-TT TTTATGGGGA GCAATACTCA 23244 ||| |||||| | ||||| | ||| || ||| |||| | | || |||||| |||| ||| | GTGGTAGTTG AAAAGGTACA TAGTTC-TGT TTTGTNGAAT TTCGTGGGGA GCAACACTTA 59 TTTAGATTTG ATTATTCTAG AGATGGTTGA TTTCGATGTA ATTTTGGGTA TGACTTGGCT 23304 | |||||||| || || ||| | |||| ||| | |||||| ||| |||||| |||||||||| TGTAGATTTG GTT-TTTTAG AAATGGGTGA CATTGATGTA ATTCTGGGTA TGACTTGGCT 118 TTCTCCAAAC TTTGCAATCT TAGATTGTAA CGCTAAAACT GTGACATTGA CCAAGCCTGG 23364 ||||||||| ||||| |||| | |||||||| |||||||| ||||| || |||||||||| TTCTCCAAAT TTTGCGATCT TGGATTGTAA TGCTAAAACG GTGACGTTAG CCAAGCCTGG 178 GACAGATCCG CTAGTATGGG AGGGTGACTA TATTTCCACC CTAGTTCATA TTATCTCTTT 23424 |||||||||| |||| |||| |||||||||| | ||||| || || | || | || || || GACAGATCCG TTAGTGTGGG AGGGTGACTA CACTTCCAAT CTGGTGCGTA TCATATCCTT 238 TCTTCGTGCT AAGAGGATGG TTAGTAGGGG TTGTTTAGCT TTCTTGGCCC ATCTCAGGGA 23484 |||||||||| |||| |||| |||||| || |||| |||| |||||||| | |||||| ||| TCTTCGTGCT AAGAAAATGG TTAGTAAAGG GTGTTCAGCT TTCTTGGCAC ATCTCAAGGA 298 TGATACTTCC AAGGTACCTT CGATTGAGTC TGTTTCGATA GTCTGTGAGT TTCTGGATGT 23544 ||| ||| | | ||||| | | |||||||| |||||| || ||| | |||| || |||| || TGACACTACT CAAGTACCCT CAATTGAGTC GGTTTCGGTA GTCCGCGAGT TTTTGGACGT 358 GTTTCCTGCA GACCTTCCTG GTATGCCACC AGATAGGGAT ATTGATTTTT GTATTGATCT 23604 |||||||||| || ||||||| |||||||||| |||||||||| ||||| || | |||| ||||| GTTTCCTGCA GATCTTCCTG GTATGCCACC AGATAGGGAT ATTGACTTCT GTATCGATCT 418 CGAGCCGGGT ACTCGCCCCA TTTCCATACC CCCTTATAGA ATGACCCTAT CTGAGTTAAG 23664 || || || || ||||||| |||| ||||| |||||||||| ||| | | | || ||||| TGAACCAGGC ACACGCCCCA TTTCTATACC CCCTTATAGA ATGGCTCCCG CCGAATTAAG 478 GGAGTTAAAG GCCCAACTTC AGGAGTTGTT AGGTAAAGAC TTTACTAGAC CAAGTTCATC 23724 ||||||||| || ||||||| | |||||||| | || | | || | ||||| ||||| |||| AGAGTTAAAG GCTCAACTTC AAGAGTTGTT GAGCAAGGTC TTCATTAGAC CAAGTGCATC 538 CCCTTGGGGT GCTCCTGTTT TATTTGTGAA GAAGAAGGAT GGAAGTTTTC GGATGTGCAT 23784 ||||||||| ||||| |||| |||||||||| |||||||||| ||||||||| | |||||||| TCCTTGGGGT GCTCCAGTTT TATTTGTGAA GAAGAAGGAT GGAAGTTTTA GAATGTGCAT 598 AGACTACAGG CAACTGAATA AGGTAACTAT TAAGAACAAG TATCCTCTTC CTCGCATCGA 23844 ||||||||| |||||||| | |||||||||| |||||||||| |||||||||| ||||||| || AGACTACAGA CAACTGAACA AGGTAACTAT TAAGAACAAG TATCCTCTTC CTCGCATTGA 658 hqPGS_C06HBa0153O03.1-1+_SGN-E355232- (23186 23844) ******************************************************************************** EST sequence 190 -strand 679 n (File: SGN-E368762-) 1 CACTCGCCCC ATTTCTATAC CCCCTTATAG AATGGCTCCC GCGGAGTTAA GAGAGTTAAA 61 GGCCCAACTT CAAGAGTTTT TGAGCAAAGT CTTCATTAGA CCAAGTGCAT CTCCTTGGGG 121 TGCTCCGGTT TGGTTTGTGA AGAAGAAGGA TGGGAGTTTT CGGATGTGCA TAGACTACCG 181 GCAGTTGAAC AAGGTAACTA TTAAGAACAA GTATCCACTT CCTCGCATTG ATGACTTGTT 241 CGATCAGTTA CAAGGTGCTT GTGTCTTCTC TAAGATTGAC TTGAGATCCG GTTATCATCA 301 ATTGAAAATA CGGGCAACGG ATGTGCCAAA GACTGCTTTT AGAACCAGGT ATGGGCATTA 361 CGAATTTGTA GTGATGTCTT TTGGTCTTAC GAATGCCCCT GCTGCGTTCA TGAGCTTGAT 421 GAACGGGATT TTAAGCCATA TTTGGATCTC TTTGTCATCA TGTTTATTGA TGATATACTG 481 ATATACTCTA ATAGTAAGAA GGAACATGAG GAGCATTTGA GAATTGTATT AGAAATGTTG 541 AGGGAGAAAA AGCTTTATGC CAAGTTCTCT AAGTGTGAGT TTTGGATAGA TGCAGTGTCC 601 TTCTTGGGGC ACGTGGTTTC TAAGGATGGA GTGATGGTGG ATCCTTGTAA GATTGAGACA 661 GTGAAGAATT GGGTGAGAC Predicted gene structure (within gDNA segment 22860 to 25210): Exon 1 23615 24295 ( 681 n); cDNA 2 679 ( 678 n); score: 0.868 MATCH C06HBa0153O03.1-1+ SGN-E368762- 0.868 681 1.003 C PGS_C06HBa0153O03.1-1+_SGN-E368762- (23615 24295) Alignment (genomic DNA sequence = upper lines): ACTCGCCCCA TTTCCATACC CCCTTATAGA ATGACCCTAT CTGAGTTAAG GGAGTTAAAG 23674 |||||||||| |||| ||||| |||||||||| ||| | | | |||||||| ||||||||| ACTCGCCCCA TTTCTATACC CCCTTATAGA ATGGCTCCCG CGGAGTTAAG AGAGTTAAAG 61 GCCCAACTTC AGGAGTTGTT AGGTAAAGAC TTTACTAGAC CAAGTTCATC CCCTTGGGGT 23734 |||||||||| | ||||| || | |||| | || | ||||| ||||| |||| ||||||||| GCCCAACTTC AAGAGTTTTT GAGCAAAGTC TTCATTAGAC CAAGTGCATC TCCTTGGGGT 121 GCTCCTGTTT TATTTGTGAA GAAGAAGGAT GGAAGTTTTC GGATGTGCAT AGACTACAGG 23794 ||||| |||| |||||||| |||||||||| || ||||||| |||||||||| ||||||| || GCTCCGGTTT GGTTTGTGAA GAAGAAGGAT GGGAGTTTTC GGATGTGCAT AGACTACCGG 181 CAACTGAATA AGGTAACTAT TAAGAACAAG TATCCTCTTC CTCGCATCGA TGATTTGTTC 23854 || |||| | |||||||||| |||||||||| ||||| |||| ||||||| || ||| |||||| CAGTTGAACA AGGTAACTAT TAAGAACAAG TATCCACTTC CTCGCATTGA TGACTTGTTC 241 GATCAGTTAC AAGGTGCTTG TATCTTTTCA AAAATCGATT TGAGATCTAG TTATCATGAA 23914 |||||||||| |||||||||| | |||| || || || || | ||||||| | ||||||| || GATCAGTTAC AAGGTGCTTG TGTCTTCTCT AAGATTGACT TGAGATCCGG TTATCATCAA 301 TTGAAAATAC GGGCAGCAGA TGTGCCAAAG GCTGTGTTTC GAACCAGGTA TGGGCATTAT 23974 |||||||||| ||||| | || |||||||||| ||| ||| |||||||||| ||||||||| TTGAAAATAC GGGCAACGGA TGTGCCAAAG ACTGCTTTTA GAACCAGGTA TGGGCATTAC 361 GAATTCTTAG TAATGTCTTT TGGGCTTACG AATGCCTCTT CTGCGTTCAT GAGCCTGATG 24034 ||||| ||| | |||||||| ||| |||||| |||||| || |||||||||| |||| ||||| GAATTTGTAG TGATGTCTTT TGGTCTTACG AATGCCCCTG CTGCGTTCAT GAGCTTGATG 421 AACAGGATTT TTAAGCCATA TCTGGATCTG TTTGTTATTG TATTTATTGA TGATATACTG 24094 ||| ||| || |||||||||| | ||||||| ||||| || | |||||||| |||||||||| AACGGGA-TT TTAAGCCATA TTTGGATCTC TTTGTCATCA TGTTTATTGA TGATATACTG 480 ATATACTCAA AGAGCAGAAA AGAACATGGG GAGTATTTGA AAATTGTTAT GGAATTGTTG 24154 |||||||| | | || | || ||||||| | ||| |||||| |||||| | ||| ||||| ATATACTCTA ATAGTAAGAA GGAACATGAG GAGCATTTGA GAATTGTATT AGAAATGTTG 540 AGAGAGAAAA AGGCTTTATG CCAAATTCTC CAAGTGTGAG TTTTGGCTAG ATTCAGTGTC 24214 || ||||||| | |||||||| |||| ||||| ||||||||| |||||| ||| || ||||||| AGGGAGAAAA A-GCTTTATG CCAAGTTCTC TAAGTGTGAG TTTTGGATAG ATGCAGTGTC 599 CTTCTTGGGG CATGTTGGTT TCCAAGGATG GAGTGATGGT GGATCCATCT AATATTAAAG 24274 |||||||||| || | ||||| || ||||||| |||||||||| |||||| | | || ||| | CTTCTTGGGG CACG-TGGTT TCTAAGGATG GAGTGATGGT GGATCCTTGT AAGATTGAGA 658 TAGTGAAGAA TTGGGTAAGA C 24295 ||||||||| |||||| ||| | CAGTGAAGAA TTGGGTGAGA C 679 hqPGS_C06HBa0153O03.1-1+_SGN-E368762- (23615 24295) ******************************************************************************** EST sequence 121 -strand 712 n (File: SGN-E379315-) 1 CCGCAGAGTT AAGAGAGTTA AAAGACCACT TCAAAGAGTT GTGAGCAAAG GCTTCATTAG 61 ACCAACTGCA TCTCCTTGGG GTGCTCCGGT TTTGTTTGTA AAGAAGAAGG ATTGGAGTTT 121 TCGGATGTGC ATAGGCTACC GGCAGTGNAA CAAGGTAACC ATAAAGAACA AGTATCCTCT 181 TCCTCGCATT AATGACTTGT TCGATCAGTT ACAAGGTGCT TGTGTCTTTT CTAAGATTGA 241 CTTGAGATCC GGTTATCATC AATTGAAAAT ACGGGCAACG GATGTGCCAA AGACTGCTTT 301 TAGAACCAGG TATGGGCATT ACAAATTTGT AGTGATGTCT TTTGGTCTTA TGAATGCCCC 361 TGCTGCGTTT ATGAGTTTAA TAAAAGGGAT TTTTAAGCCA TATTTGGATC TCTTTGTGAT 421 CGTATTTATT GATGATATAC TGATATACTC TAAAAGTAAG GAGGAACATG AAGAGCATTT 481 GAGAATGGTA TTGGAAATGT TGAGGGAGAA AAAGTTTTAT GCCAAGTTCT CTAAGTGTGA 541 GTTTTGGCTA GATGTAGTGT CCTTCTTGGG GAACGTGGTT TCTAAGGATG GAGTGATGGT 601 GGATCCTTCT AAGATTGAGA CAGTGAAGAA TTGGGTAAGA CCTACTAATG TGTCAGAAAT 661 AAGGAGCTTT GTTGGGTTAG CTAGCTACTA CCGCCGATTT GTCAAGGGAT TC Predicted gene structure (within gDNA segment 21576 to 25515): Exon 1 23657 24366 ( 710 n); cDNA 6 711 ( 706 n); score: 0.846 MATCH C06HBa0153O03.1-1+ SGN-E379315- 0.846 710 0.997 C PGS_C06HBa0153O03.1-1+_SGN-E379315- (23657 24366) Alignment (genomic DNA sequence = upper lines): GAGTTAAGGG AGTTAAAGGC CCAACTTCAG GAGTTGTTAG GTAAAGACTT TACTAGACCA 23716 |||||||| | ||||||| | ||| | | ||||||| | | |||| ||| | ||||||| GAGTTAAGAG AGTTAAAAGA CCACTTCAAA GAGTTGTGA- GCAAAGGCTT CATTAGACCA 64 AGTTCATCCC CTTGGGGTGC TCCTGTTTTA TTTGTGAAGA AGAAGGATGG AAGTTTTCGG 23776 | | |||| | |||||||||| ||| ||||| ||||| |||| |||||||| | ||||||||| ACTGCATCTC CTTGGGGTGC TCCGGTTTTG TTTGTAAAGA AGAAGGATTG GAGTTTTCGG 124 ATGTGCATAG ACTACAGGCA ACTGAATAAG GTAACTATTA AGAACAAGTA TCCTCTTCCT 23836 |||||||||| |||| |||| || ||| ||||| || | |||||||||| |||||||||| ATGTGCATAG GCTACCGGCA GTGNAACAAG GTAACCATAA AGAACAAGTA TCCTCTTCCT 184 CGCATCGATG ATTTGTTCGA TCAGTTACAA GGTGCTTGTA TCTTTTCAAA AATCGATTTG 23896 ||||| ||| | |||||||| |||||||||| ||||||||| ||||||| || || || ||| CGCATTAATG ACTTGTTCGA TCAGTTACAA GGTGCTTGTG TCTTTTCTAA GATTGACTTG 244 AGATCTAGTT ATCATGAATT GAAAATACGG GCAGCAGATG TGCCAAAGGC TGTGTTTCGA 23956 ||||| ||| ||||| |||| |||||||||| ||| | |||| |||||||| | || ||| || AGATCCGGTT ATCATCAATT GAAAATACGG GCAACGGATG TGCCAAAGAC TGCTTTTAGA 304 ACCAGGTATG GGCATTATGA ATTCTTAGTA ATGTCTTTTG GGCTTACGAA TGCCTCTTCT 24016 |||||||||| ||||||| | ||| |||| |||||||||| | |||| ||| |||| || || ACCAGGTATG GGCATTACAA ATTTGTAGTG ATGTCTTTTG GTCTTATGAA TGCCCCTGCT 364 GCGTTCATGA GCCTGATGAA CAGGATTTTT AAGCCATATC TGGATCTGTT TGTTATTGTA 24076 ||||| |||| | | || || |||||||| ||||||||| ||||||| || ||| || ||| GCGTTTATGA GTTTAATAAA AGGGATTTTT AAGCCATATT TGGATCTCTT TGTGATCGTA 424 TTTATTGATG ATATACTGAT ATACTCAAAG AGCAGAAAAG AACATGGGGA GTATTTGAAA 24136 |||||||||| |||||||||| |||||| || || | | | |||||| || | |||||| | TTTATTGATG ATATACTGAT ATACTCTAAA AGTAAGGAGG AACATGAAGA GCATTTGAGA 484 ATTGTTATGG AATTGTTGAG AGAGAAAAAG GCTTTATGCC AAATTCTCCA AGTGTGAGTT 24196 || || ||| || ||||||| |||||||| | |||||||| || ||||| | |||||||||| ATGGTATTGG AAATGTTGAG GGAGAAAAA- GTTTTATGCC AAGTTCTCTA AGTGTGAGTT 543 TTGGCTAGAT TCAGTGTCCT TCTTGGGGCA TGTTGGTTTC CAAGGATGGA GTGATGGTGG 24256 |||||||||| |||||||| |||||||| | | ||||||| ||||||||| |||||||||| TTGGCTAGAT GTAGTGTCCT TCTTGGGGAA CG-TGGTTTC TAAGGATGGA GTGATGGTGG 602 ATCCATCTAA TATTAAAGTA GTGAAGAATT GGGTAAGACC TACTAATGTT ACAGAGGTAA 24316 |||| ||||| ||| | | |||||||||| |||||||||| ||||||||| |||| ||| ATCCTTCTAA GATTGAGACA GTGAAGAATT GGGTAAGACC TACTAATGTG TCAGAAATAA 662 GGAGCGTTTT TTGGTTTAGC TAGCTCCTAC CGTCGATTTG TCAAGGGATT 24366 ||||| ||| |||| ||||| ||||| |||| || ||||||| |||||||||| GGAGC-TTTG TTGGGTTAGC TAGCTACTAC CGCCGATTTG TCAAGGGATT 711 hqPGS_C06HBa0153O03.1-1+_SGN-E379315- (23657 24366) ******************************************************************************** EST sequence 142 -strand 596 n (File: SGN-E375319-) 1 GTTTTCGGAT GTGCATAGGC TACCGGCAGT TGAACAAGGT AACCATAAAG AACAAGTATC 61 CTCTTCCTCG CATTAATGAC TTGTTCGATC AGTTACAAGG TGCTTGTGTC TTTTCTAAGA 121 TTGACTTGAG ATCCGGTTAT CATCAATTGA AAATACGGGC AACGGATGTG CCAAAGACTG 181 CTTTTAGAAC CAGGTATGGG CATTACAAAT TTGTAGTGAT GTCTTTTGGT CTTATGAATG 241 CCCCTGCTGC GTTTATGAGT TTAATAAAAG GGATTTTTAA GCCATATTTG GATCTCTTTG 301 TGATCGTATT TATTGATGAT ATACTGATAT ACTCTAAAAG TAAGGAGGAA CATGAAGAGC 361 ATTTGAGAAT GGTATTGGAA ATGTTGAGGG AGAAAAAGTT TTATGCCAAG TTCTCTAAGT 421 GTGAGTTTTG GCTAGATGTA GTGTCCTTCT TGGGGAACGT GGTTTCTAAG GATGGAGTGA 481 TGGTGGATCC TTCTAAGATT GAGACAGTGA AGAATTGGGT AAGACCTACT AATGTGTCAG 541 AAATAAGGAG CTTTGTTGGG TTAGCTAGCT ACTACCGCCG ATTTGTCAAG GGATTC Predicted gene structure (within gDNA segment 22736 to 25515): Exon 1 23769 24366 ( 598 n); cDNA 1 595 ( 595 n); score: 0.858 MATCH C06HBa0153O03.1-1+ SGN-E375319- 0.858 598 1.003 C PGS_C06HBa0153O03.1-1+_SGN-E375319- (23769 24366) Alignment (genomic DNA sequence = upper lines): GTTTTCGGAT GTGCATAGAC TACAGGCAAC TGAATAAGGT AACTATTAAG AACAAGTATC 23828 |||||||||| |||||||| | ||| |||| |||| ||||| ||| || ||| |||||||||| GTTTTCGGAT GTGCATAGGC TACCGGCAGT TGAACAAGGT AACCATAAAG AACAAGTATC 60 CTCTTCCTCG CATCGATGAT TTGTTCGATC AGTTACAAGG TGCTTGTATC TTTTCAAAAA 23888 |||||||||| ||| |||| |||||||||| |||||||||| ||||||| || ||||| || | CTCTTCCTCG CATTAATGAC TTGTTCGATC AGTTACAAGG TGCTTGTGTC TTTTCTAAGA 120 TCGATTTGAG ATCTAGTTAT CATGAATTGA AAATACGGGC AGCAGATGTG CCAAAGGCTG 23948 | || ||||| ||| ||||| ||| |||||| |||||||||| | | |||||| |||||| ||| TTGACTTGAG ATCCGGTTAT CATCAATTGA AAATACGGGC AACGGATGTG CCAAAGACTG 180 TGTTTCGAAC CAGGTATGGG CATTATGAAT TCTTAGTAAT GTCTTTTGGG CTTACGAATG 24008 ||| |||| |||||||||| ||||| ||| | |||| || ||||||||| |||| ||||| CTTTTAGAAC CAGGTATGGG CATTACAAAT TTGTAGTGAT GTCTTTTGGT CTTATGAATG 240 CCTCTTCTGC GTTCATGAGC CTGATGAACA GGATTTTTAA GCCATATCTG GATCTGTTTG 24068 || || |||| ||| ||||| | || || |||||||||| ||||||| || ||||| |||| CCCCTGCTGC GTTTATGAGT TTAATAAAAG GGATTTTTAA GCCATATTTG GATCTCTTTG 300 TTATTGTATT TATTGATGAT ATACTGATAT ACTCAAAGAG CAGAAAAGAA CATGGGGAGT 24128 | || ||||| |||||||||| |||||||||| |||| || || | | ||| |||| ||| TGATCGTATT TATTGATGAT ATACTGATAT ACTCTAAAAG TAAGGAGGAA CATGAAGAGC 360 ATTTGAAAAT TGTTATGGAA TTGTTGAGAG AGAAAAAGGC TTTATGCCAA ATTCTCCAAG 24188 |||||| ||| || ||||| ||||||| | ||||||| | |||||||||| ||||| ||| ATTTGAGAAT GGTATTGGAA ATGTTGAGGG AGAAAAA-GT TTTATGCCAA GTTCTCTAAG 419 TGTGAGTTTT GGCTAGATTC AGTGTCCTTC TTGGGGCATG TTGGTTTCCA AGGATGGAGT 24248 |||||||||| |||||||| |||||||||| |||||| | | ||||||| | |||||||||| TGTGAGTTTT GGCTAGATGT AGTGTCCTTC TTGGGGAACG -TGGTTTCTA AGGATGGAGT 478 GATGGTGGAT CCATCTAATA TTAAAGTAGT GAAGAATTGG GTAAGACCTA CTAATGTTAC 24308 |||||||||| || ||||| | || | ||| |||||||||| |||||||||| ||||||| | GATGGTGGAT CCTTCTAAGA TTGAGACAGT GAAGAATTGG GTAAGACCTA CTAATGTGTC 538 AGAGGTAAGG AGCGTTTTTT GGTTTAGCTA GCTCCTACCG TCGATTTGTC AAGGGATT 24366 ||| ||||| ||| ||| || || ||||||| ||| |||||| ||||||||| |||||||| AGAAATAAGG AGC-TTTGTT GGGTTAGCTA GCTACTACCG CCGATTTGTC AAGGGATT 595 hqPGS_C06HBa0153O03.1-1+_SGN-E375319- (23769 24366) ******************************************************************************** EST sequence 129 -strand 526 n (File: SGN-E204434-) 1 CATTAATGAC TTGTTCGATC AGTTACAAGG TGCTTGTGTC TTTTCTAAGA TTGACTTGAG 61 ATCCGGTTAT CATCAATTGA AAATACGGGC AACGGATGTG CCAAAGACTG CTTTTAGAAC 121 CAGGTATGGG CATTACAAAT TTGTAGTGAT GTCTTTTGGT CTTATGAATG CCCCTGCTGC 181 GTTTATGAGT TTAATAAAAG GGATTTTTAA GCCATATTTG GATCTCTTTG TGATCGTATT 241 TATTGATGAT ATACTGATAT ACTCTAAAAG TAAGGAGGAA CATGAAGAGC ATTTGAGAAT 301 GGTATTGGAA ATGTTGAGGG AGAAAAAGTT TTATGCCAAG TTCTCTAAGT GTGAGTTTTG 361 GCTAGATGTA GTGTCCTTCT TGGGGAACGT GGTTTCTAAG GATGGAGTGA TGGTGGATCC 421 TTCTAAGATT GAGACAGTGA AGAATTGGGT AAGACCTACT AATGTGTCAG AAATAAGGAG 481 CTTTGTTGGG TTAGCTAGCT ACTACCGCCG ATTTGTCAAG GGATTC Predicted gene structure (within gDNA segment 21429 to 25515): Exon 1 23839 24366 ( 528 n); cDNA 1 525 ( 525 n); score: 0.852 MATCH C06HBa0153O03.1-1+ SGN-E204434- 0.852 528 1.004 C PGS_C06HBa0153O03.1-1+_SGN-E204434- (23839 24366) Alignment (genomic DNA sequence = upper lines): CATCGATGAT TTGTTCGATC AGTTACAAGG TGCTTGTATC TTTTCAAAAA TCGATTTGAG 23898 ||| |||| |||||||||| |||||||||| ||||||| || ||||| || | | || ||||| CATTAATGAC TTGTTCGATC AGTTACAAGG TGCTTGTGTC TTTTCTAAGA TTGACTTGAG 60 ATCTAGTTAT CATGAATTGA AAATACGGGC AGCAGATGTG CCAAAGGCTG TGTTTCGAAC 23958 ||| ||||| ||| |||||| |||||||||| | | |||||| |||||| ||| ||| |||| ATCCGGTTAT CATCAATTGA AAATACGGGC AACGGATGTG CCAAAGACTG CTTTTAGAAC 120 CAGGTATGGG CATTATGAAT TCTTAGTAAT GTCTTTTGGG CTTACGAATG CCTCTTCTGC 24018 |||||||||| ||||| ||| | |||| || ||||||||| |||| ||||| || || |||| CAGGTATGGG CATTACAAAT TTGTAGTGAT GTCTTTTGGT CTTATGAATG CCCCTGCTGC 180 GTTCATGAGC CTGATGAACA GGATTTTTAA GCCATATCTG GATCTGTTTG TTATTGTATT 24078 ||| ||||| | || || |||||||||| ||||||| || ||||| |||| | || ||||| GTTTATGAGT TTAATAAAAG GGATTTTTAA GCCATATTTG GATCTCTTTG TGATCGTATT 240 TATTGATGAT ATACTGATAT ACTCAAAGAG CAGAAAAGAA CATGGGGAGT ATTTGAAAAT 24138 |||||||||| |||||||||| |||| || || | | ||| |||| ||| |||||| ||| TATTGATGAT ATACTGATAT ACTCTAAAAG TAAGGAGGAA CATGAAGAGC ATTTGAGAAT 300 TGTTATGGAA TTGTTGAGAG AGAAAAAGGC TTTATGCCAA ATTCTCCAAG TGTGAGTTTT 24198 || ||||| ||||||| | ||||||| | |||||||||| ||||| ||| |||||||||| GGTATTGGAA ATGTTGAGGG AGAAAAA-GT TTTATGCCAA GTTCTCTAAG TGTGAGTTTT 359 GGCTAGATTC AGTGTCCTTC TTGGGGCATG TTGGTTTCCA AGGATGGAGT GATGGTGGAT 24258 |||||||| |||||||||| |||||| | | ||||||| | |||||||||| |||||||||| GGCTAGATGT AGTGTCCTTC TTGGGGAACG -TGGTTTCTA AGGATGGAGT GATGGTGGAT 418 CCATCTAATA TTAAAGTAGT GAAGAATTGG GTAAGACCTA CTAATGTTAC AGAGGTAAGG 24318 || ||||| | || | ||| |||||||||| |||||||||| ||||||| | ||| ||||| CCTTCTAAGA TTGAGACAGT GAAGAATTGG GTAAGACCTA CTAATGTGTC AGAAATAAGG 478 AGCGTTTTTT GGTTTAGCTA GCTCCTACCG TCGATTTGTC AAGGGATT 24366 ||| ||| || || ||||||| ||| |||||| ||||||||| |||||||| AGC-TTTGTT GGGTTAGCTA GCTACTACCG CCGATTTGTC AAGGGATT 525 hqPGS_C06HBa0153O03.1-1+_SGN-E204434- (23839 24366) ******************************************************************************** EST sequence 147 -strand 554 n (File: SGN-E352647-) 1 CCATGATTTG GAGTTAGCGG CAGTAGTGAT TGCATTAAAG TAATAGAGCC ATTATCTCTA 61 TGGGTTTAAG TGTGAAGTCT ATATGGATCA TCGTAGTTTA CAGTATGTCT TTACTCAGAA 121 AGATTTGAAT TTGAGACAGA GGAGATCGAT GGAGCTACTG AAGGACTATG ATATCACTAT 181 CTTGTATCAT CCGGGAAAGG CTAATGTTGT GGCAGATGCT TTAAGTAGAA AGGCAGGGAG 241 CATGGGAAGT CTAGCTCACT TGCAGGTTTC TAGATGCCCA TTGGCTAGAG AGGTTCAGAC 301 TCTGGCTAAT GACCTTATGA GGCTAGAATT AAATGAGAAG GGAGAATTTT TGGCTTGTGT 361 GGAGGCAAGA TCTTCCTTTC TTGATAAGAT TAAAGGAAAG CAGTTTACCG ATGAGAAACT 421 GATCTGGATT CGAGATAAGG TAATGCGAGG AGAGGCTAAA GAAGCAAAAA TCGATAAGGA 481 AGGTGTTTTG AGGATTAAGG GAAAGGTATG TGTACCCCGT GCCGACGATT TGATTCACAC 541 TATTCTTACA GAGG Predicted gene structure (within gDNA segment 20571 to 25997): Exon 1 24395 24839 ( 445 n); cDNA 110 554 ( 445 n); score: 0.872 MATCH C06HBa0153O03.1-1+ SGN-E352647- 0.872 445 0.803 C PGS_C06HBa0153O03.1-1+_SGN-E352647- (24395 24839) Alignment (genomic DNA sequence = upper lines): TTGACTAAGA AAGATTTGAA TTTGAGGCAG TGAAGGTAGA TGGAACTACT GAAGGACTAT 24454 || ||| ||| |||||||||| |||||| ||| | || | || |||| ||||| |||||||||| TTTACTCAGA AAGATTTGAA TTTGAGACAG AGGAGATCGA TGGAGCTACT GAAGGACTAT 169 GATATTACTA TTTTGTATCA CCCAGGAAAA GCTAATGTTG TGGCAGACGC TTTAAGTAGA 24514 ||||| |||| | |||||||| || ||||| |||||||||| ||||||| || |||||||||| GATATCACTA TCTTGTATCA TCCGGGAAAG GCTAATGTTG TGGCAGATGC TTTAAGTAGA 229 AAAGCAGGGA GCAGGGGAAG CCTAGCCCAC TTACAGGTTT CTAGGCGCCC ATTGGCTAGA 24574 || ||||||| ||| |||||| ||||| ||| || ||||||| |||| |||| |||||||||| AAGGCAGGGA GCATGGGAAG TCTAGCTCAC TTGCAGGTTT CTAGATGCCC ATTGGCTAGA 289 GAGGTTTAGA CCCTGGTTAA TGACTTTATG AGGCTGGAAG TACTAGAGAA GGGAGGATTT 24634 |||||| ||| | |||| ||| |||| ||||| ||||| ||| || ||||| ||||| |||| GAGGTTCAGA CTCTGGCTAA TGACCTTATG AGGCTAGAAT TAAATGAGAA GGGAGAATTT 349 TTGGCTTGTG TGGAGGCAAG ATCTTCTTTT CTTGACAAGA TTAAGGGAAA ACAGTTTACT 24694 |||||||||| |||||||||| |||||| ||| ||||| |||| |||| ||||| |||||||| TTGGCTTGTG TGGAGGCAAG ATCTTCCTTT CTTGATAAGA TTAAAGGAAA GCAGTTTACC 409 GACGAGAAGC TGAGCCGAAT TCGAGATATG GTATTACGAG GAGAGGCTAA AGAGGCAATA 24754 || ||||| | ||| | | || |||||||| | ||| | |||| |||||||||| ||| |||| | GATGAGAAAC TGATCTGGAT TCGAGATAAG GTAATGCGAG GAGAGGCTAA AGAAGCAAAA 469 ATTGACGAGG AAGGTGTTTT GAGAATTAAG GGAAGGATAT GTGTGCCCCG TGTTGATAAT 24814 || || ||| |||||||||| ||| |||||| |||| | ||| |||| ||||| || || || ATCGATAAGG AAGGTGTTTT GAGGATTAAG GGAAAGGTAT GTGTACCCCG TGCCGACGAT 529 TTGATTCACA CTATTCTTAC AGAGG 24839 |||||||||| |||||||||| ||||| TTGATTCACA CTATTCTTAC AGAGG 554 hqPGS_C06HBa0153O03.1-1+_SGN-E352647- (24395 24839) ******************************************************************************** EST sequence 168 -strand 587 n (File: SGN-E352950-) 1 GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 61 GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 121 TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 181 GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 241 TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 301 TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 361 ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 421 GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 481 AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 541 ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG Predicted gene structure (within gDNA segment 20241 to 25997): Exon 1 24395 24839 ( 445 n); cDNA 143 587 ( 445 n); score: 0.872 MATCH C06HBa0153O03.1-1+ SGN-E352950- 0.872 445 0.758 C PGS_C06HBa0153O03.1-1+_SGN-E352950- (24395 24839) Alignment (genomic DNA sequence = upper lines): TTGACTAAGA AAGATTTGAA TTTGAGGCAG TGAAGGTAGA TGGAACTACT GAAGGACTAT 24454 || ||| ||| |||||||||| |||||| ||| | || | || |||| ||||| |||||||||| TTTACTCAGA AAGATTTGAA TTTGAGACAG AGGAGATCGA TGGAGCTACT GAAGGACTAT 202 GATATTACTA TTTTGTATCA CCCAGGAAAA GCTAATGTTG TGGCAGACGC TTTAAGTAGA 24514 ||||| |||| | |||||||| || ||||| |||||||||| ||||||| || |||||||||| GATATCACTA TCTTGTATCA TCCGGGAAAG GCTAATGTTG TGGCAGATGC TTTAAGTAGA 262 AAAGCAGGGA GCAGGGGAAG CCTAGCCCAC TTACAGGTTT CTAGGCGCCC ATTGGCTAGA 24574 || ||||||| ||| |||||| ||||| ||| || ||||||| |||| |||| |||||||||| AAGGCAGGGA GCATGGGAAG TCTAGCTCAC TTGCAGGTTT CTAGATGCCC ATTGGCTAGA 322 GAGGTTTAGA CCCTGGTTAA TGACTTTATG AGGCTGGAAG TACTAGAGAA GGGAGGATTT 24634 |||||| ||| | |||| ||| |||| ||||| ||||| ||| || ||||| ||||| |||| GAGGTTCAGA CTCTGGCTAA TGACCTTATG AGGCTAGAAT TAAATGAGAA GGGAGAATTT 382 TTGGCTTGTG TGGAGGCAAG ATCTTCTTTT CTTGACAAGA TTAAGGGAAA ACAGTTTACT 24694 |||||||||| |||||||||| |||||| ||| ||||| |||| |||| ||||| |||||||| TTGGCTTGTG TGGAGGCAAG ATCTTCCTTT CTTGATAAGA TTAAAGGAAA GCAGTTTACC 442 GACGAGAAGC TGAGCCGAAT TCGAGATATG GTATTACGAG GAGAGGCTAA AGAGGCAATA 24754 || ||||| | ||| | | || |||||||| | ||| | |||| |||||||||| ||| |||| | GATGAGAAAC TGATCTGGAT TCGAGATAAG GTAATGCGAG GAGAGGCTAA AGAAGCAAAA 502 ATTGACGAGG AAGGTGTTTT GAGAATTAAG GGAAGGATAT GTGTGCCCCG TGTTGATAAT 24814 || || ||| |||||||||| ||| |||||| |||| | ||| |||| ||||| || || || ATCGATAAGG AAGGTGTTTT GAGGATTAAG GGAAAGGTAT GTGTACCCCG TGCCGACGAT 562 TTGATTCACA CTATTCTTAC AGAGG 24839 |||||||||| |||||||||| ||||| TTGATTCACA CTATTCTTAC AGAGG 587 hqPGS_C06HBa0153O03.1-1+_SGN-E352950- (24395 24839) ******************************************************************************** EST sequence 199 -strand 587 n (File: SGN-E357100-) 1 GCAATTAAAG GTGCATGAAC GTAATTATCC GACCCATGAT TTGGAGTTAG CGGCAGTAGT 61 GATTGCATTA AAGTAATAGA GACATTATCT CTATGGGGTT AAGTGTGAAG TCTATATGGA 121 TCATCGTAGT TTACAGTATG TCTTTACTCA GAAAGATTTG AATTTGAGAC AGAGGAGATC 181 GATGGAGCTA CTGAAGGACT ATGATATCAC TATCTTGTAT CATCCGGGAA AGGCTAATGT 241 TGTGGCAGAT GCTTTAAGTA GAAAGGCAGG GAGCATGGGA AGTCTAGCTC ACTTGCAGGT 301 TTCTAGATGC CCATTGGCTA GAGAGGTTCA GACTCTGGCT AATGACCTTA TGAGGCTAGA 361 ATTAAATGAG AAGGGAGAAT TTTTGGCTTG TGTGGAGGCA AGATCTTCCT TTCTTGATAA 421 GATTAAAGGA AAGCAGTTTA CCGATGAGAA ACTGATCTGG ATTCGAGATA AGGTAATGCG 481 AGGAGAGGCT AAAGAAGCAA AAATCGATAA GGAAGGTGTT TTGAGGATTA AGGGAAAGGT 541 ATGTGTACCC CGTGCCGACG ATTTGATTCA CACTATTCTT ACAGAGG Predicted gene structure (within gDNA segment 20241 to 25997): Exon 1 24395 24839 ( 445 n); cDNA 143 587 ( 445 n); score: 0.872 MATCH C06HBa0153O03.1-1+ SGN-E357100- 0.872 445 0.758 C PGS_C06HBa0153O03.1-1+_SGN-E357100- (24395 24839) Alignment (genomic DNA sequence = upper lines): TTGACTAAGA AAGATTTGAA TTTGAGGCAG TGAAGGTAGA TGGAACTACT GAAGGACTAT 24454 || ||| ||| |||||||||| |||||| ||| | || | || |||| ||||| |||||||||| TTTACTCAGA AAGATTTGAA TTTGAGACAG AGGAGATCGA TGGAGCTACT GAAGGACTAT 202 GATATTACTA TTTTGTATCA CCCAGGAAAA GCTAATGTTG TGGCAGACGC TTTAAGTAGA 24514 ||||| |||| | |||||||| || ||||| |||||||||| ||||||| || |||||||||| GATATCACTA TCTTGTATCA TCCGGGAAAG GCTAATGTTG TGGCAGATGC TTTAAGTAGA 262 AAAGCAGGGA GCAGGGGAAG CCTAGCCCAC TTACAGGTTT CTAGGCGCCC ATTGGCTAGA 24574 || ||||||| ||| |||||| ||||| ||| || ||||||| |||| |||| |||||||||| AAGGCAGGGA GCATGGGAAG TCTAGCTCAC TTGCAGGTTT CTAGATGCCC ATTGGCTAGA 322 GAGGTTTAGA CCCTGGTTAA TGACTTTATG AGGCTGGAAG TACTAGAGAA GGGAGGATTT 24634 |||||| ||| | |||| ||| |||| ||||| ||||| ||| || ||||| ||||| |||| GAGGTTCAGA CTCTGGCTAA TGACCTTATG AGGCTAGAAT TAAATGAGAA GGGAGAATTT 382 TTGGCTTGTG TGGAGGCAAG ATCTTCTTTT CTTGACAAGA TTAAGGGAAA ACAGTTTACT 24694 |||||||||| |||||||||| |||||| ||| ||||| |||| |||| ||||| |||||||| TTGGCTTGTG TGGAGGCAAG ATCTTCCTTT CTTGATAAGA TTAAAGGAAA GCAGTTTACC 442 GACGAGAAGC TGAGCCGAAT TCGAGATATG GTATTACGAG GAGAGGCTAA AGAGGCAATA 24754 || ||||| | ||| | | || |||||||| | ||| | |||| |||||||||| ||| |||| | GATGAGAAAC TGATCTGGAT TCGAGATAAG GTAATGCGAG GAGAGGCTAA AGAAGCAAAA 502 ATTGACGAGG AAGGTGTTTT GAGAATTAAG GGAAGGATAT GTGTGCCCCG TGTTGATAAT 24814 || || ||| |||||||||| ||| |||||| |||| | ||| |||| ||||| || || || ATCGATAAGG AAGGTGTTTT GAGGATTAAG GGAAAGGTAT GTGTACCCCG TGCCGACGAT 562 TTGATTCACA CTATTCTTAC AGAGG 24839 |||||||||| |||||||||| ||||| TTGATTCACA CTATTCTTAC AGAGG 587 hqPGS_C06HBa0153O03.1-1+_SGN-E357100- (24395 24839) ******************************************************************************** EST sequence 166 -strand 542 n (File: SGN-E353207-) 1 AAAGAAGCAA AAATCGATGA GGAAGGTGTT TTGAGAATAA GGGAAGGAGT ATGTGTACCC 61 CGCGTCGATG ATTTGATTCA CACTATTCTT ATATAGGCTC ATAGTTCGAA GTACTCTATA 121 CATCCTGGTG CAAACAAGAT GTATCGTGAC CTAAAGCAAC ATTTTTGGTG GAGTAGGATG 181 AAGCGTGACA TTGTTAATTT TGTTGCTCAA TGCCCGAATT GTCAGCAAGT AAAGTATGAA 241 CACCAGAGGC CTGGAGGGAC ACTTCAGAGA ATGCCCATTC CTGAATGGAA GTGGGAGAGA 301 ATTGCAATGG ACTTCGTGGT TAGTCATCCA AAGACGATGG GTAGGTATGA CTCTATTTGG 361 GTGATTGTTG ACAGATTAAC TAAGTCTGCT CACTCTATTT CGGTTAAGGT GACTTACAAT 421 GCAGAGAAGT TAGCCAAACT TTACATCTTA GAAATTGTTC GATTGCACGG AGTTCCACTC 481 TCCATCATAT CAGATAGAGG TACGCAATTT ACTTCTAAGT TTTGGAAAAC AGTACATGCC 541 GA Predicted gene structure (within gDNA segment 23089 to 26009): Exon 1 24743 25265 ( 523 n); cDNA 1 542 ( 542 n); score: 0.799 MATCH C06HBa0153O03.1-1+ SGN-E353207- 0.799 523 0.965 C PGS_C06HBa0153O03.1-1+_SGN-E353207- (24743 25265) Alignment (genomic DNA sequence = upper lines): AAAGAGGCAA TAATTGACGA GGAAGGTGTT TTGAGAATTA AGGGAAGGA- TATGTGTGCC 24801 ||||| |||| ||| || || |||||||||| ||||||| || ||||||||| ||||||| || AAAGAAGCAA AAATCGATGA GGAAGGTGTT TTGAGAA-TA AGGGAAGGAG TATGTGTACC 59 CCGTGTTGAT AATTTGATTC ACACTATTCT TACAGAGGCT CATAGTTCAA GGTATTATAT 24861 ||| || ||| ||||||||| |||||||||| || | ||||| |||||||| | ||| | ||| CCGCGTCGAT GATTTGATTC ACACTATTCT TATATAGGCT CATAGTTCGA AGTACTCTAT 119 ACATCCTGGT GCAACCAAGA TGTATCGTGA CCTAAAGAAA CATTTCTGGT AGAGTAGAAT 24921 |||||||||| |||| ||||| |||||||||| ||||||| || ||||| |||| |||||| || ACATCCTGGT GCAAACAAGA TGTATCGTGA CCTAAAGCAA CATTTTTGGT GGAGTAGGAT 179 GAAGTGTGAC ATTGTTAATT TTGTTGCCCA ATGCCCGAAT TGTCAGCAGG TAAAGTATGA 24981 |||| ||||| |||||||||| ||||||| || |||||||||| |||||||| | |||||||||| GAAGCGTGAC ATTGTTAATT TTGTTGCTCA ATGCCCGAAT TGTCAGCAAG TAAAGTATGA 239 CCACCAGAGG CCCGGAGGAA CACTTCAGA- AA-----A-T -C-G---GG- A-----A-AG 25022 ||||||||| || ||||| | ||||||||| || | | | | || | | || ACACCAGAGG CCTGGAGGGA CACTTCAGAG AATGCCCATT CCTGAATGGA AGTGGGAGAG 299 AATTGCAATG GATTTTGTGG TTGGTCTTCC CAAGACATTG GTTAAGTTCG ATTCTATTTG 25082 |||||||||| || || |||| || ||| ||| ||||| || | || || | | |||||||| AATTGCAATG GACTTCGTGG TTAGTCATCC AAAGACGATG GGTAGGTATG ACTCTATTTG 359 GGTAATTGTT GACAGATTAA CTAAGTTTGC TCACTTCATT CCGATCAAGG TGACTTACAA 25142 ||| |||||| |||||||||| |||||| ||| ||||| ||| || | |||| |||||||||| GGTGATTGTT GACAGATTAA CTAAGTCTGC TCACTCTATT TCGGTTAAGG TGACTTACAA 419 TGCAGAGAAG TTAACCAAAC TCTATATCTC AGAAATTGCT CGATTGCATG GAGTTCCACT 25202 |||||||||| ||| |||||| | || |||| |||||||| | |||||||| | |||||||||| TGCAGAGAAG TTAGCCAAAC TTTACATCTT AGAAATTGTT CGATTGCACG GAGTTCCACT 479 CTCCATCATA TCAGATAGAG GTACGCAATT TACTTCTAAG TTTTGGAGAA CATTGCATGC 25262 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || || | ||||| CTCCATCATA TCAGATAGAG GTACGCAATT TACTTCTAAG TTTTGGAAAA CAGTACATGC 539 TGA 25265 || CGA 542 hqPGS_C06HBa0153O03.1-1+_SGN-E353207- (24743 25265) ******************************************************************************** EST sequence 107 -strand 654 n (File: SGN-E578131-) 1 CTTAGCAAGT CCGGCTTCCC ACAGGAGAAC CTTCACGTTT GCGGGCTTCC TCCTCTTCCA 61 GTAACCTCTT TTGACGCTCC TCCTCAGTTC GCAGAAAGAA TAACATTTTC CTCTTAGCTT 121 CCCTCTCCTG CTTCCTTGAT TGAATTATCT GGCTGATCCT TTCTCGTCTC TCCTGCTTCA 181 ATCTATTGAG TTCAGCTTCC CGGCTACTAA CAACTCTTTC TTGCAAAATC CTCTGCATAA 241 TTAGAAACAT ATCAGCGGAT CTGTTTTGTT AAGAGACCAA GACCAATCTT CAAGCAAGCC 301 CGTGGCTTCA CAAATTTTGA ACTAGTGGGA AAGTAGCGTC CATTAGCATT TCTAACCGAA 361 TGGTCAGCAT GTAAAGTATG AACAGCAAAG GCCTGGACGG ACACTTCAGA GAATGCCCAT 421 TCCTGAATGG AAGTGGGAGA GAATTGCAAT GGACTTCGTG GTTGGCCTTC CAAAGACAAT 481 GGGTAAGTAT GACTCCATTT GTGTAATTGT TGACAGATTG ACTAAGTCTG CTCATTGCAT 541 TCCGGTCAAG GTGACCTACA ATGTAGAGAA GTTAGTCAGA ATCTATATCT CAGAAATCGT 601 TCGATTGCAT GGAGTTCCAC TCTCCATCAT ATCAGATAGA GGTATGCAGT TTAC Predicted gene structure (within gDNA segment 18743 to 25925): Exon 1 24839 24852 ( 14 n); cDNA 415 428 ( 14 n); score: 0.571 Intron 1 24853 25009 ( 157 n); Pd: 0.617 (s: 0), Pa: 0.000 (s: 0.82) Exon 2 25010 25235 ( 226 n); cDNA 429 654 ( 226 n); score: 0.863 MATCH C06HBa0153O03.1-1+ SGN-E578131- 0.863 240 0.367 C PGS_C06HBa0153O03.1-1+_SGN-E578131- (24839 24852,25010 25235) Alignment (genomic DNA sequence = upper lines): GCTCATAGTT CAAGGTATTA TATACATCCT GGTGCAACCA AGATGTATCG TGACCTAAAG 24898 || ||| | || GCCCATTCCT GAAT...... .......... .......... .......... .......... 428 AAACATTTCT GGTAGAGTAG AATGAAGTGT GACATTGTTA ATTTTGTTGC CCAATGCCCG 24958 .......... .......... .......... .......... .......... .......... 428 AATTGTCAGC AGGTAAAGTA TGACCACCAG AGGCCCGGAG GAACACTTCA GAAAATCGGG 25018 || ||| .......... .......... .......... .......... .......... .GGAAGTGGG 437 AAAGAATTGC AATGGATTTT GTGGTTGGTC TTCCCAAGAC ATTGGTTAAG TTCGATTCTA 25078 | |||||||| |||||| || |||||||| | |||| ||||| | ||| |||| | || || | AGAGAATTGC AATGGACTTC GTGGTTGGCC TTCCAAAGAC AATGGGTAAG TATGACTCCA 497 TTTGGGTAAT TGTTGACAGA TTAACTAAGT TTGCTCACTT CATTCCGATC AAGGTGACTT 25138 |||| ||||| |||||||||| || ||||||| |||||| | ||||||| || |||||||| | TTTGTGTAAT TGTTGACAGA TTGACTAAGT CTGCTCATTG CATTCCGGTC AAGGTGACCT 557 ACAATGCAGA GAAGTTAACC AAACTCTATA TCTCAGAAAT TGCTCGATTG CATGGAGTTC 25198 |||||| ||| ||||||| | | | |||||| |||||||||| | ||||||| |||||||||| ACAATGTAGA GAAGTTAGTC AGAATCTATA TCTCAGAAAT CGTTCGATTG CATGGAGTTC 617 CACTCTCCAT CATATCAGAT AGAGGTACGC AATTTAC 25235 |||||||||| |||||||||| ||||||| || | ||||| CACTCTCCAT CATATCAGAT AGAGGTATGC AGTTTAC 654 hqPGS_C06HBa0153O03.1-1+_SGN-E578131- (25010 25235) ******************************************************************************** EST sequence 33 +strand 542 n (File: SGN-E252199+) 1 CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTGTTGCCA TTTTAGATAG AGAAGTCCGC 61 AAATTGAGGT CAAGGGAGAT TGCATCTATC AAAGTTCAAT GGAAGAATCG ACCAGTTGAA 121 GAATCCACTT GGGAGAAGGA AGTTGATATG CGAGAAAGAT ACCCATACCT GTTTACAGAT 181 TCAGGTACTC CTTTTCGCCC TTGTTTTTCT TCTTGTGATC GTTCGGGGAC GAACGATGGG 241 TAAATTGGTA TCTATTGTAA CGACCTGTTT AGTCGTTTTG AGTAACAGAT TTTATTTCTG 301 GAAAAACTGA CTGAGACGAC GGATCCCACG ACGGACCGTC GAGGGGGTCT CGTTCCAAAA 361 CACTTAGAAT TCTGAAATTT GGGTACTGAA ATCGACTCTC TGAACTTCGT GACGGAATGG 421 CAGGACGGAC CGTCACAGGC GTGATGGGCC GTCACAGACC CTTGGTAAAA ATCTAGTCTC 481 TGAACTCTGT GACGGACGTG CAGGACGGAC CGTCACAGAC TGCGTAATCC CAGGCTGGGT 541 CG Predicted gene structure (within gDNA segment 25288 to 28541): Exon 1 25888 26232 ( 345 n); cDNA 1 344 ( 344 n); score: 0.857 Intron 1 26233 26349 ( 117 n); Pd: 0.000 (s: 0.80), Pa: 0.000 (s: 0.77) Exon 2 26350 26462 ( 113 n); cDNA 345 456 ( 112 n); score: 0.757 MATCH C06HBa0153O03.1-1+ SGN-E252199+ 0.832 458 0.845 C PGS_C06HBa0153O03.1-1+_SGN-E252199+ (25888 26232,26350 26462) Alignment (genomic DNA sequence = upper lines): CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTATTGCTA TTTTAGGTTA GAGAGGTCTG 25947 |||||||||| |||||||||| |||||||||| ||| |||| | ||||| | || |||| ||| | CTTGATGAGA ATTTGTCTTA TGAGGAGGAG CCTGTTGCCA TTTTA-GATA GAGAAGTCCG 59 CAAGTTGAGA TCAAAGGAGA TTGCATCTAT CAAGGTTCGG TGGAAGAATC GGCCAATTGA 26007 ||| ||||| |||| ||||| |||||||||| ||| |||| |||||||||| | ||| |||| CAAATTGAGG TCAAGGGAGA TTGCATCTAT CAAAGTTCAA TGGAAGAATC GACCAGTTGA 119 AGAGTCCACT TGGGAGAATG AGGCCGATAT GTGAAAAAGA TATCCACATC TTTTTATAGA 26067 ||| |||||| |||||||| | | | ||||| | || ||||| || ||| | | | |||| ||| AGAATCCACT TGGGAGAAGG AAGTTGATAT GCGAGAAAGA TACCCATACC TGTTTACAGA 179 TTCAGGTACT CTTTCTCGCC CTTGCTTTTC TTCTTGTGAT CGTTCGGGGA CGAACGATGG 26127 |||||||||| | || ||||| |||| ||||| |||||||||| |||||||||| |||||||||| TTCAGGTACT CCTTTTCGCC CTTGTTTTTC TTCTTGTGAT CGTTCGGGGA CGAACGATGG 239 GTAAATTGGT ATCTATTGTA ACGACTTGTT TAGTCGGTTC GAGCAGTAGA -ACTATTTTT 26186 |||||||||| |||||||||| ||||| |||| |||||| || ||| | ||| ||||| | GTAAATTGGT ATCTATTGTA ACGACCTGTT TAGTCGTTTT GAGTAACAGA TTTTATTTCT 299 GATAAAAACT GACTGGGTCG ACGGATCACG CGACAGACCG TCATGGTCAC GACGGACCGT 26246 | ||||||| ||||| | || ||||||| | |||| ||||| || || G-GAAAAACT GACTGAGACG ACGGATCCCA CGACGGACCG TCGAGG.... .......... 344 GATGGACTCC GTCGTCCCAT ACTTATGTAA TTTCTTCTAT TGCTCTCCTC ATTACCCTCG 26306 .......... .......... .......... .......... .......... .......... 344 ACGGCAGGTA GGACGAACCG TCATAGGCAC GACAGTCCGT CGAGCGTCTC CGTTCCAAAA 26366 | |||| |||||||||| .......... .......... .......... .......... ...GGGTCT- CGTTCCAAAA 360 CACTT-CAAC TCTGAAAATC TGGGTACTGG GAGCGACTCT CTGAAATCCG CGATGGAACT 26425 ||||| || |||| |||| ||||||||| | ||||||| ||||| | || || |||| CACTTAGAAT TCTG-AAATT TGGGTACTGA AATCGACTCT CTGAACTTCG TGACGGAATG 419 GCAGCATGGA CCGTCGTAGA CACGACGGAC CGTCTCA 26462 |||| | ||| ||||| || | || || | |||| || GCAGGACGGA CCGTCACAGG CGTGATGGGC CGTCACA 456 hqPGS_C06HBa0153O03.1-1+_SGN-E252199+ (25888 26232,26350 26462) ******************************************************************************** EST sequence 150 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 28541 to 23759): Exon 1 27539 27019 ( 521 n); cDNA 1 519 ( 519 n); score: 0.820 MATCH C06HBa0153O03.1-1- SGN-E241789+ 0.820 521 0.759 C PGS_C06HBa0153O03.1-1-_SGN-E241789+ (27539 27019) Alignment (genomic DNA sequence = upper lines): ATGTCCGAAC TT-AAAGACA TCAAGACCTG AA-G-GAGAG AATCCAGCAC GAGCTAGGAA 27483 ||||| || | || || |||| ||||||| || || | ||||| ||||||| | |||||||||| ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 TAATAGCTCA CCCTGAATTC TGATATGCTG AAAAATGGCT AGATCTGAGG ACGAGTCAAA 27423 ||||||||| ||||||| || ||| || || || | ||| | ||| || || |||| || CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 120 GTCGATGGCA TGCTTGCTGC ACTCCACAAA TAACAAAGAA GAAAA-TTAC AAGTAGGGGT 27364 | ||| || | | ||||||| ||||||||| ||||||| || ||||| || |||||||||| GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAA-AA GAAAACATAA AAGTAGGGGT 179 CAGTACAAGG AACACGTACT GAGTAGGTAT CATCAGCCAA CTCAAAATAG AAAACAATAT 27304 |||||||| || ||||| |||||| ||| |||| ||||| |||| ||||| | |||||||| CAGTACAA-A CACGAGTACT GAGTAGATAT CATCGGCCAA CTCAGAATAG AGAACAATAT 238 ATACTGAATA ATAATATAAA ATCAACCATA ATACTTAACA GGTGACAATC AACAAGTATA 27244 ||| |||| |||| ||||| |||||||||| | |||||||| |||||||| | |||||||| ATATCAAATA ATAAAATAAA ATCAACCATA ACACTTAACA GGTGACAA-C AACAAGTACC 297 AGAACCATTG ACAACAACAG CAAGCACACC TATGAGGACT CAAGCCTCCA CACCATACTC 27184 | |||||||| ||||| |||| ||| | |||||||||| |||||||||| |||||||||| ATAACCATTG GGCACAAC-C CAAGAACATC TATGAGGACT CAAGCCTCCA CACCATACTC 356 ATTTGGGAAA TAGGTTCTTT GAATTTGAGT ACATTAACAT AATTCAAGAT TCATTGTCTT 27124 |||||||||| |||||| || || |||||| |||||||||| |||||||||| ||||| | || ATTTGGGAAA CAGGTTCATT -AAATTGAGT ACATTAACAT AATTCAAGAT TCATTCTTTT 415 TATCATTATC GTGTCGGAAC GTGACAC-CC GATCCCCTAA TACTACCGTG TTGGAACGTG 27065 || || | |||||||||| |||| || || |||||||||| | ||| |||| | || |||| TACTATCGTG GTGTCGGAAC GTGATACTCC GATCCCCTAA TGCTA-CGTG TCGGTTCGTG 474 ACACTCCGA- CCCCTAATAC TACCGTGTCG GAATGTGACA CTCGATC 27019 |||| |||| |||||||||| || ||||||| | || ||| | ||||| ACAC-CCGAT CCCCTAATAC TA-CGTGTCG GTTCGTTACA CCCGATC 519 hqPGS_C06HBa0153O03.1-1-_SGN-E241789+ (27539 27019) ******************************************************************************** EST sequence 77 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 28541 to 25015): Exon 1 27420 27097 ( 324 n); cDNA 78 402 ( 325 n); score: 0.784 MATCH C06HBa0153O03.1-1- SGN-E246710- 0.784 324 0.674 C PGS_C06HBa0153O03.1-1-_SGN-E246710- (27420 27097) Alignment (genomic DNA sequence = upper lines): CGATGGCATG CTTGCTGCAC TCCACAAATA ACAAAGAAGA -AAATTACAA GTAGGGGTCA 27362 |||||||| | ||||||||| |||||||||| | ||||||| || || || |||||||||| CGATGGCACG TTTGCTGCAC TCCACAAATA AACAAGAAGA GAACATAAAA GTAGGGGTCA 137 GTACAAGGAA CACGTACTGA GTAGGTATCA TCAGCCAACT CAAAATAGAA AACAATATAT 27302 |||||| | | ||||||| |||| ||||| || ||||||| |||||||||| | |||||||| GTACAAAACA CGGGTACTGA GTAGATATCA TCGGCCAACT CAAAATAGAA ATCAATATAT 197 A-CTGAA-TA ATAATATAAA ATCAACCATA ATACTTAACA GGTGACAATC AACAAGTATA 27244 | || || ||| ||||| |||||| || ||||| |||| || ||| | ||||| || ATACCAAGTA ATATCATAAA ATCAACTATG ATACTCAACA TGTAGCAA-C AACAAATACT 256 AGAACCATTG ACAACAACAG -CAAGCACAC CTATGAGGAC TCAAGCCTCC ACACCATACT 27185 | | |||| |||| || | |||| ||| |||||||| ||||||||| | |||||||| A-TATCATTA ACAATTACCG TCAAGTTCAC ACATGAGGAC TCAAGCCTCA ATACCATACT 315 CATTTGGGAA ATAGGTTCTT TGAATTTGAG TACATTAACA TAATTCAAGA TTCATTGTCT 27125 |||||||||| | |||| | | | ||||| || ||||||| | ||||||| |||||| ||| CATTTGGGAA TCATGTTCAT T-AGATTGAG TATATTAACA TCTTTCAAGA TTCATTATCT 374 TTATCATTAT CGTGTCGGAA CGTGACAC 27097 |||| | | ||||||| | |||||||| TTATTTCTCT TGTGTCGGTA CGTGACAC 402 hqPGS_C06HBa0153O03.1-1-_SGN-E246710- (27420 27097) ******************************************************************************** EST sequence 52 -strand 505 n (File: SGN-E353206-) 1 CGCATTGATA CCGAAAAATC AAGGTATAGA GTGNAAGTCA ATCTCAATCA CATCAAACTT 61 GAAACTCTTA TAAATTTTCA GCTATATAAC TCCCATCATC ATAGATCCTG AACTCTGATG 121 TGCTGGAGAC TGGTTAGAGA TGAGGGCGAG TCGTAGTCAA TGGTACACTT GTTGCACTCC 181 ACAAAAAAAC AGAGAAGAAA ATACAAGTAG GGGTCAGTAC AAGGAACACG TACTGAGTAG 241 GTATCATCGG TCAACTCAAA ATAATAATCA ATATATATTG AATAATAATA TAAAATCAAC 301 TACAATACTT AACAGGTGGC AAGCAACAAA ACACATGAAC CATTAACAAC AACAACATAA 361 CATGTACACC ATCAAGCACA CCTATGAGGA CTCATGGCTC CACACCATAC TCATTTGGAA 421 AATAGGTTCT TTGAGATTAG ATATATTAAG TTAGTTCAAG ATTTATTTCC TTTAATGTTA 481 TTGTTTCGGA ACGTGACACT CCGAT Predicted gene structure (within gDNA segment 28541 to 25708): Exon 1 27471 27225 ( 247 n); cDNA 107 354 ( 248 n); score: 0.842 MATCH C06HBa0153O03.1-1- SGN-E353206- 0.842 247 0.489 C PGS_C06HBa0153O03.1-1-_SGN-E353206- (27471 27225) Alignment (genomic DNA sequence = upper lines): CCTGAATTCT GATATGCTGA AAAATGGCTA GATCTGAGGA CGAGTCAAAG TCGATGGCAT 27412 |||||| ||| ||| ||||| | | ||| || || ||||| |||||| || || |||| | CCTGAACTCT GATGTGCTGG AGACTGGTTA GAGATGAGGG CGAGTCGTAG TCAATGGTAC 166 GCTTGCTGCA CTCCACA-AA TAACAAAGAA GAAAATTACA AGTAGGGGTC AGTACAAGGA 27353 |||| |||| ||||||| || |||| |||| ||||| |||| |||||||||| |||||||||| ACTTGTTGCA CTCCACAAAA AAACAGAGAA GAAAA-TACA AGTAGGGGTC AGTACAAGGA 225 ACACGTACTG AGTAGGTATC ATCAGCCAAC TCAAAATAGA AAACAATATA TACTGAATAA 27293 |||||||||| |||||||||| ||| | |||| |||||||| || ||||||| || ||||||| ACACGTACTG AGTAGGTATC ATCGGTCAAC TCAAAATAAT AATCAATATA TATTGAATAA 285 TAATATAAAA TCAACCATAA TACTTAACAG GTGACAATCA AC-AAGTATA AGAACCATTG 27234 |||||||||| ||||| | || |||||||||| ||| ||| || || || | | |||||||| TAATATAAAA TCAACTACAA TACTTAACAG GTGGCAAGCA ACAAAACACA TGAACCATTA 345 ACAACAACA 27225 ||||||||| ACAACAACA 354 hqPGS_C06HBa0153O03.1-1-_SGN-E353206- (27471 27225) Total number of EST alignments reported: 209 ________________________________________________________________________________ Predicted gene locations (8) in segment 1 to 28541: PGL 1 (- strand): 1452 1249 AGS-1 (1452 1249) SCR (e 0.882) Exon 1 1452 1249 ( 204 n); score: 0.882 PGS (1452 1249) SGN-E353447+ 3-phase translation of AGS-1 (-strand): . . . . . . 1452 TACAACAAAAATGTCCATTTGCGACATTTAATTCTTAATTGCCGCTAAGTATGTATTTTT Y N K N V H L R H L I L N C R - V C I F T T K M S I C D I - F L I A A K Y V F L Q Q K C P F A T F N S - L P L S M Y F . . . . . . 1392 AGAGGCAATTGTCACTATTTGTATATGTCCCTATTGCCTTTAGAGACATTGGTTCTAATG R G N C H Y L Y M S L L P L E T L V L M E A I V T I C I C P Y C L - R H W F - - - R Q L S L F V Y V P I A F R D I G S N . . . . . . 1332 ACACTTAACTAATGCCGGTAAATACTTTAGAACTCTTTATTAGTGTCAATATTTAATGCC T L N - C R - I L - N S L L V S I F N A H L T N A G K Y F R T L Y - C Q Y L M P D T - L M P V N T L E L F I S V N I - C . . . 1272 ACTAAAAGTTATTTTTGTTGTAGT T K S Y F C C S L K V I F V V H - K L F L L - Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 1249 ACTACAACAAAAATAACTTTTAGTGGCATTAAATATTGACACTAATAAAGAGTTCTAAAG T T T K I T F S G I K Y - H - - R V L K L Q Q K - L L V A L N I D T N K E F - S Y N K N N F - W H - I L T L I K S S K . . . . . . 1309 TATTTACCGGCATTAGTTAAGTGTCATTAGAACCAATGTCTCTAAAGGCAATAGGGACAT Y L P A L V K C H - N Q C L - R Q - G H I Y R H - L S V I R T N V S K G N R D I V F T G I S - V S L E P M S L K A I G T . . . . . . 1369 ATACAAATAGTGACAATTGCCTCTAAAAATACATACTTAGCGGCAATTAAGAATTAAATG I Q I V T I A S K N T Y L A A I K N - M Y K - - Q L P L K I H T - R Q L R I K C Y T N S D N C L - K Y I L S G N - E L N . . . 1429 TCGCAAATGGACATTTTTGTTGTA S Q M D I F V V R K W T F L L V A N G H F C C Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 2086 2437 AGS-1 (2086 2240,2328 2437) SCR (e 0.861 d 0.000 a 0.717,e 0.750) Exon 1 2086 2240 ( 155 n); score: 0.861 Intron 1 2241 2327 ( 87 n); Pd: 0.000 Pa: 0.717 Exon 2 2328 2437 ( 110 n); score: 0.750 PGS (2086 2240,2328 2437) SGN-E578113+ 3-phase translation of AGS-1 (+strand): . . . . . . 2086 TTTACATGGGGACTTGAGTTATCTGAACTCTCATTCCTACATCGGTGCTCAATACTACTC F T W G L E L S E L S F L H R C S I L L L H G D L S Y L N S H S Y I G A Q Y Y S Y M G T - V I - T L I P T S V L N T T . . . . . . 2146 CCAAAACATACTTTAGCTCATACTTTTAACAAAACTTCCTTCCTTTGGGTTGAGATAATT P K H T L A H T F N K T S F L W V E I I Q N I L - L I L L T K L P S F G L R - F P K T Y F S S Y F - Q N F L P L G - D N . . . . : . . 2206 TACTGAACCCTTTAGCTTTACAAATCTCCTTTTGG : GGAGTACTTAGTCCCCCTTATATCT Y - T L - L Y K S P F G : E Y L V P L I S T E P F S F T N L L L : G S T - S P L Y L L L N P L A L Q I S F W : G V L S P P Y I . . . . . . 2353 TTAGAGAAATGAACTCAACTCTTACTCTTTACTTAACTTTAAACTTTAACTCTTAGGAAA L E K - T Q L L L F T - L - T L T L R K - R N E L N S Y S L L N F K L - L L G N F R E M N S T L T L Y L T L N F N S - E . . . 2413 TACTTAGTTTTCTTATATACCATTT Y L V F L Y T I T - F S Y I P F I L S F L I Y H Maximal non-overlapping open reading frames (>= 64 codons): none PGL 3 (- strand): 12890 3249 AGS-1 (4142 4069,4034 3249) SCR (e 0.824 d 0.000 a 0.000,e 0.782) Exon 1 4142 4069 ( 74 n); score: 0.824 Intron 1 4068 4035 ( 34 n); Pd: 0.000 Pa: 0.000 Exon 2 4034 3249 ( 786 n); score: 0.782 PGS (3787 3249) SGN-E550127- PGS (3787 3249) SGN-E377133+ PGS (3787 3249) SGN-E550212+ PGS (3787 3249) SGN-E550201+ PGS (3787 3249) SGN-E389834+ PGS (3787 3249) SGN-E390013+ PGS (3787 3249) SGN-E550065+ PGS (3787 3249) SGN-E550207+ PGS (3787 3249) SGN-E550335+ PGS (3787 3249) SGN-E550484+ PGS (3787 3249) SGN-E550211+ PGS (3787 3249) SGN-E550464+ PGS (3787 3249) SGN-E549941+ PGS (3787 3249) SGN-E550025+ PGS (3787 3249) SGN-E396039+ PGS (3787 3249) SGN-E396056+ PGS (3776 3249) SGN-E550322+ PGS (3751 3249) SGN-E396057- PGS (3735 3249) SGN-E377132- PGS (3719 3249) SGN-E396055- PGS (3717 3249) SGN-E398551- PGS (3707 3249) SGN-E396038- PGS (3787 3250) SGN-E550140- PGS (3787 3250) SGN-E389553- PGS (3787 3250) SGN-E396054+ PGS (3787 3250) SGN-E396058+ PGS (3787 3250) SGN-E231589+ PGS (3787 3250) SGN-E374999+ PGS (3787 3260) SGN-E241959+ PGS (3923 3432) SGN-E349296- PGS (3673 3445) SGN-E396037+ PGS (4142 4069,4034 3547) SGN-E546548- 3-phase translation of AGS-1 (-strand): . . . . . . 4142 TCTCCTTCTATCAATTCATCAAGCCTTCTTTCTTACCAAGGCATCATCAATCTCATTATT S P S I N S S S L L S Y Q G I I N L I I L L L S I H Q A F F L T K A S S I S L F S F Y Q F I K P S F L P R H H Q S H Y . . : . . . . 4082 TTAGTTCATCACGC : AGATTAGGGTTTTGCAAGATTTGGGATTCAATAACTTCATCATGCT L V H H A : D - G F A R F G I Q - L H H A - F I T : Q I R V L Q D L G F N N F I M L F S S S R : R L G F C K I W D S I T S S C . . . . . . 3988 TATATAACCACAATTATAAAATTACATTCATGCAAGCATACAATTAAGCACATAGCAGGG Y I T T I I K L H S C K H T I K H I A G I - P Q L - N Y I H A S I Q L S T - Q G L Y N H N Y K I T F M Q A Y N - A H S R . . . . . . 3928 TTTACAATATTATCAATATATATCATTCGCTATTAAGAGTTTACTACGAATATCGTAAGA F T I L S I Y I I R Y - E F T T N I V R L Q Y Y Q Y I S F A I K S L L R I S - E V Y N I I N I Y H S L L R V Y Y E Y R K . . . . . . 3868 GAAACCATAACCTACCTCCACCGAAGATTAGTGATCAAGCAAGAAATTTCCCCAAGCTTT E T I T Y L H R R L V I K Q E I S P S F K P - P T S T E D - - S S K K F P Q A L R N H N L P P P K I S D Q A R N F P K L . . . . . . 3808 GTTCTTCGTTTTCTCTCTTCCTCGTTCGATCCTCTCTCTCTCTTTGTTCTTTCTACTTTT V L R F L S S S F D P L S L F V L S T F F F V F S L P R S I L S L S L F F L L F C S S F S L F L V R S S L S L C S F Y F . . . . . . 3748 CTTATTCAAACCCTCTTTCTTTTACCCTAATTAGCATATAATTAAGAACAAAAGATGGCA L I Q T L F L L P - L A Y N - E Q K M A L F K P S F F Y P N - H I I K N K R W Q S Y S N P L S F T L I S I - L R T K D G . . . . . . 3688 ATAATAACTCACTAATTAACTTAAGGTTACCTCTTTTAACCCCCAAGTAATTAGACTTAT I I T H - L T - G Y L F - P P S N - T Y - - L T N - L K V T S F N P Q V I R L I N N N S L I N L R L P L L T P K - L D L . . . . . . 3628 TAAAATTAACCCACTAACTTTATAATTAAAGCAGGAATAGTCCAAAACGCCCCTTAAAAT - N - P T N F I I K A G I V Q N A P - N K I N P L T L - L K Q E - S K T P L K I L K L T H - L Y N - S R N S P K R P L K . . . . . . 3568 AATTACAGAAATCTGACCCAGCCTGGGATTACGCAGCCTGTGACGGCCCGTCGCGCCTGC N Y R N L T Q P G I T Q P V T A R R A C I T E I - P S L G L R S L - R P V A P A - L Q K S D P A W D Y A A C D G P S R L . . . . . . 3508 GACGGTCCATTCTGCTGCTCCGTCACAGAGTTCCGAGACTCAATTTCTCTGAAGAGTCTG D G P F C C S V T E F R D S I S L K S L T V H S A A P S Q S S E T Q F L - R V C R R S I L L L R H R V P R L N F S E E S . . . . . . 3448 TAACGGTTCGTCCTGCCATTCCGTTACGAAGTTCAGAAAGTCGATTTCAGTACCCAATTT - R F V L P F R Y E V Q K V D F S T Q F N G S S C H S V T K F R K S I S V P N F V T V R P A I P L R S S E S R F Q Y P I . . . . . . 3388 TGAGAATTCTAAGTATTTTGGAATGAGATATCCTCGACGGTCCGTCGTGCCCATGACGGT - E F - V F W N E I S S T V R R A H D G E N S K Y F G M R Y P R R S V V P M T V L R I L S I L E - D I L D G P S C P - R . . . . . . 3328 CGGTCGTGAGTTCCGTCGTCTTTGCCTGTTTTTCAAGAAATAAAATCTGCTGCTCGAAAC R S - V P S S L P V F Q E I K S A A R N G R E F R R L C L F F K K - N L L L E T S V V S S V V F A C F S R N K I C C S K . . 3268 GACTAAACAGGTCGTTACAA D - T G R Y T K Q V V T R L N R S L Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-1_PPS_1 (3941 3708) (frame '0'; 231 bp, 77 residues) 1 AHSRVYNIIN IYHSLLRVYY EYRKRNHNLP PPKISDQARN FPKLCSSFSL FLVRSSLSLC 61 SFYFSYSNPL SFTLISI- >C06HBa0153O03.1-1-_PGL-3_AGS-1_PPS_2 (3566 3363) (frame '0'; 201 bp, 67 residues) 1 LQKSDPAWDY AACDGPSRLR RSILLLRHRV PRLNFSEESV TVRPAIPLRS SESRFQYPIL 61 RILSILE- AGS-2 (5402 3448,3353 3321) SCR (e 0.877 d 0.000 a 0.000,e 0.879) Exon 1 5402 3448 (1955 n); score: 0.877 Intron 1 3447 3354 ( 94 n); Pd: 0.000 Pa: 0.000 Exon 2 3353 3321 ( 33 n); score: 0.879 PGS (3787 3448,3353 3321) SGN-E396070+ PGS (3787 3448,3353 3321) SGN-E236652+ PGS (4102 3738) SGN-E578076- PGS (4361 3831) SGN-E347579- PGS (4428 3839) SGN-E349726- PGS (4240 3839) SGN-E357559- PGS (4568 4333) SGN-E391780- PGS (4810 4334) SGN-E246710- PGS (4956 4347) SGN-E546506+ PGS (4659 4379) SGN-E357033+ PGS (4711 4559) SGN-E209683- PGS (5346 4637) SGN-E351546- PGS (5271 4637) SGN-E356696- PGS (5196 4637) SGN-E356206- PGS (4953 4707) SGN-E222578+ PGS (5342 4717) SGN-E392027+ PGS (5101 4718) SGN-E542084+ PGS (4982 4718) SGN-E336814- PGS (4982 4729) SGN-E373117- PGS (4982 4729) SGN-E373116+ PGS (4968 4729) SGN-E370357+ PGS (4982 4765) SGN-E298638- PGS (4964 4795) SGN-E352844- PGS (4956 4795) SGN-E238551- PGS (5402 5119) SGN-E355114- 3-phase translation of AGS-2 (-strand): . . . . . . 5402 ACAGCCCCAATGGCTGGCTCGGACGCTTCTTGTCTTGCCGATGTTGGTATTGGTGCAGTT T A P M A G S D A S C L A D V G I G A V Q P Q W L A R T L L V L P M L V L V Q L S P N G W L G R F L S C R C W Y W C S . . . . . . 5342 GTTGCTCTAGTTCTAACCATCTGCGAAACAGAGTGAAGATGGTCAGATACCAATTTGTAT V A L V L T I C E T E - R W S D T N L Y L L - F - P S A K Q S E D G Q I P I C I C C S S S N H L R N R V K M V R Y Q F V . . . . . . 5282 CACCTAGATACCAATTGGACCCAAGTAATAGCACGAAAGAAAGAATGAAAGAATGGAATT H L D T N W T Q V I A R K K E - K N G I T - I P I G P K - - H E R K N E R M E F S P R Y Q L D P S N S T K E R M K E W N . . . . . . 5222 TTCCTAAAGTCTTATAGCCCCTCAAAGAAAAGTAAAGGTGTCCCCCTACCGTTCCTTAAG F L K S Y S P S K K S K G V P L P F L K S - S L I A P Q R K V K V S P Y R S L R F P K V L - P L K E K - R C P P T V P - . . . . . . 5162 ACTCTACCAGACTCGTTCTTGTGTGATGAGACCAACGAACCTAATGCTCTGATACCAAGT T L P D S F L C D E T N E P N A L I P S L Y Q T R S C V M R P T N L M L - Y Q V D S T R L V L V - - D Q R T - C S D T K . . . . . . 5102 TTGTCACGACCCAAAACCGGATCGCGACTGGCACCCACACTTACCCTCCTATGTGAGCGA L S R P K T G S R L A P T L T L L C E R C H D P K P D R D W H P H L P S Y V S E F V T T Q N R I A T G T H T Y P P M - A . . . . . . 5042 ACCAACCAATCTAAACCTTAATATTTCAATAGAATATCAACAGAAAGTAATGCGGAAGAC T N Q S K P - Y F N R I S T E S N A E D P T N L N L N I S I E Y Q Q K V M R K T N Q P I - T L I F Q - N I N R K - C G R . . . . . . 4982 TTAAACTCATTAATAAAATCAATAAATATTATTATCCCCAAAATCTGGAAGTCATCATCA L N S L I K S I N I I I P K I W K S S S - T H - - N Q - I L L S P K S G S H H H L K L I N K I N K Y Y Y P Q N L E V I I . . . . . . 4922 CAAGAACATCTACTTCAAACTACTAAATCTAAGAGTTTCTAAGAAGCTAAAAATACATAA Q E H L L Q T T K S K S F - E A K N T - K N I Y F K L L N L R V S K K L K I H K T R T S T S N Y - I - E F L R S - K Y I . . . . . . 4862 AAGCTAGTCCATGCCGGAACTTCAAAGCATCAAGACATGAAGAGGAAGATCCAGTCCAAG K L V H A G T S K H Q D M K R K I Q S K S - S M P E L Q S I K T - R G R S S P S K A S P C R N F K A S R H E E E D P V Q . . . . . . 4802 CTAGAAGCATTAGCTCACCCTGATATCCGGAGTAATGAAGACTGGCTAGAGTTACTGTTG L E A L A H P D I R S N E D W L E L L L - K H - L T L I S G V M K T G - S Y C - A R S I S S P - Y P E - - R L A R V T V . . . . . . 4742 AGTCGAAGATGACGGCACGTTTGCTGCACTCCACAAATAAACAAGAAGAAAACATAAAAG S R R - R H V C C T P Q I N K K K T - K V E D D G T F A A L H K - T R R K H K S E S K M T A R L L H S T N K Q E E N I K . . . . . . 4682 TAGGGGTCAGTACAAACACGGGTACTGAGTAGATATCATCGGCCAACTCAAAATAGAGAT - G S V Q T R V L S R Y H R P T Q N R D R G Q Y K H G Y - V D I I G Q L K I E I V G V S T N T G T E - I S S A N S K - R . . . . . . 4622 CAATATATACCAAGTAATATCATAAAATCAACTATGATACTCAACATGTAGCAACATCAA Q Y I P S N I I K S T M I L N M - Q H Q N I Y Q V I S - N Q L - Y S T C S N I K S I Y T K - Y H K I N Y D T Q H V A T S . . . . . . 4562 ATACTATATCATTAACAATTACCGTCAAGTTCACACACGAGGACTCAAGCCTCAATACCG I L Y H - Q L P S S S H T R T Q A S I P Y Y I I N N Y R Q V H T R G L K P Q Y R N T I S L T I T V K F T H E D S S L N T . . . . . . 4502 TACTCATTTGGGAATTATGTTCATTGGATTGAGTATATTATCATCTTTCAAGATTCATTA Y S F G N Y V H W I E Y I I I F Q D S L T H L G I M F I G L S I L S S F K I H Y V L I W E L C S L D - V Y Y H L S R F I . . . . . . 4442 TCTTTATTTCTCTTGTGTCGGTACGTGACACTCCGCTCCCTCATATTCATTAATCCTCTT S L F L L C R Y V T L R S L I F I N P L L Y F S C V G T - H S A P S Y S L I L L I F I S L V S V R D T P L P H I H - S S . . . . . . 4382 GTGTCGGTACGTGACACTTCGATCCCCCACTACTATGTGTCGGAACGTGACACTTCGATC V S V R D T S I P H Y Y V S E R D T S I C R Y V T L R S P T T M C R N V T L R S C V G T - H F D P P L L C V G T - H F D . . . . . . 4322 CTCTAAATCTACGTGTCGGTTCGTGACACTCGATCTCCTAAATCTAAGTGTCGGTTCGTG L - I Y V S V R D T R S P K S K C R F V S K S T C R F V T L D L L N L S V G S - P L N L R V G S - H S I S - I - V S V R . . . . . . 4262 ACACCAGATCCCCTAAATCTACGTGTCAGTTCGTGACACCCGATCCCCTAAATCTACGTG T P D P L N L R V S S - H P I P - I Y V H Q I P - I Y V S V R D T R S P K S T C D T R S P K S T C Q F V T P D P L N L R . . . . . . 4202 TCGGTTCGTGACACCCGATCCCTAAATCTACGTGTCGGTTCGTGACACCCTATCCCCTAA S V R D T R S L N L R V G S - H P I P - R F V T P D P - I Y V S V R D T L S P N V G S - H P I P K S T C R F V T P Y P L . . . . . . 4142 TCTCCTTCTATCAATTCATCAAGCCTTCTTTCTTACCAAGGCATCATCAATCTCATTATT S P S I N S S S L L S Y Q G I I N L I I L L L S I H Q A F F L T K A S S I S L F I S F Y Q F I K P S F L P R H H Q S H Y . . . . . . 4082 TTAGTTCATCACGCCTTCTTTTATACCAAGGCCCCATCATTAACAAAGAGATTAGGGTTT L V H H A F F Y T K A P S L T K R L G F - F I T P S F I P R P H H - Q R D - G F F S S S R L L L Y Q G P I I N K E I R V . . . . . . 4022 TGCAAGATTTGGGATTCAATAACTTCATCATGCTTATATAACCACAATTATAAAATTACA C K I W D S I T S S C L Y N H N Y K I T A R F G I Q - L H H A Y I T T I I K L H L Q D L G F N N F I M L I - P Q L - N Y . . . . . . 3962 TTCATGCAAGCATACAATTAAGCACATAGCAGGGTTTACAATATTATCAATATATATCAT F M Q A Y N - A H S R V Y N I I N I Y H S C K H T I K H I A G F T I L S I Y I I I H A S I Q L S T - Q G L Q Y Y Q Y I S . . . . . . 3902 TCGCTATTAAGAGTTTACTACGAATATCGTAAGAGAAACCATAACCTACCTCCACCGAAG S L L R V Y Y E Y R K R N H N L P P P K R Y - E F T T N I V R E T I T Y L H R R F A I K S L L R I S - E K P - P T S T E . . . . . . 3842 ATTAGTGATCAAGCAAGAAATTTCCCCAAGCTTTGTTCTTCGTTTTCTCTCTTCCTCGTT I S D Q A R N F P K L C S S F S L F L V L V I K Q E I S P S F V L R F L S S S F D - - S S K K F P Q A L F F V F S L P R . . . . . . 3782 CGATCCTCTCTCTCTCTTTGTTCTTTCTACTTTTCTTATTCAAACCCTCTTTCTTTTACC R S S L S L C S F Y F S Y S N P L S F T D P L S L F V L S T F L I Q T L F L L P S I L S L S L F F L L F L F K P S F F Y . . . . . . 3722 CTAATTAGCATATAATTAAGAACAAAAGATGGCAATAATAACTCACTAATTAACTTAAGG L I S I - L R T K D G N N N S L I N L R - L A Y N - E Q K M A I I T H - L T - G P N - H I I K N K R W Q - - L T N - L K . . . . . . 3662 TTACCTCTTTTAACCCCCAAGTAATTAGACTTATTAAAATTAACCCACTAACTTTATAAT L P L L T P K - L D L L K L T H - L Y N Y L F - P P S N - T Y - N - P T N F I I V T S F N P Q V I R L I K I N P L T L - . . . . . . 3602 TAAAGCAGGAATAGTCCAAAACGCCCCTTAAAATAATTACAGAAATCTGACCCAGCCTGG - S R N S P K R P L K - L Q K S D P A W K A G I V Q N A P - N N Y R N L T Q P G L K Q E - S K T P L K I I T E I - P S L . . . . . . 3542 GATTACGCAGCCTGTGACGGCCCGTCGCGCCTGCGACGGTCCATTCTGCTGCTCCGTCAC D Y A A C D G P S R L R R S I L L L R H I T Q P V T A R R A C D G P F C C S V T G L R S L - R P V A P A T V H S A A P S . . . . : . . 3482 AGAGTTCCGAGACTCAATTTCTCTGAAGAGTCTGT : GACGGTCCGTCGTGCCCATGACGGT R V P R L N F S E E S V : T V R R A H D G E F R D S I S L K S L : - R S V V P M T V Q S S E T Q F L - R V C : D G P S C P - R . 3328 CGGTCGTG R S G R S V V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-2_PPS_1 (3941 3708) (frame '1'; 231 bp, 77 residues) 1 AHSRVYNIIN IYHSLLRVYY EYRKRNHNLP PPKISDQARN FPKLCSSFSL FLVRSSLSLC 61 SFYFSYSNPL SFTLISI- >C06HBa0153O03.1-1-_PGL-3_AGS-2_PPS_2 (4547 4317) (frame '1'; 228 bp, 76 residues) 1 QLPSSSHTRT QASIPYSFGN YVHWIEYIII FQDSLSLFLL CRYVTLRSLI FINPLVSVRD 61 TSIPHYYVSE RDTSIL- >C06HBa0153O03.1-1-_PGL-3_AGS-2_PPS_3 (5234 5022) (frame '1'; 210 bp, 70 residues) 1 KNGIFLKSYS PSKKSKGVPL PFLKTLPDSF LCDETNEPNA LIPSLSRPKT GSRLAPTLTL 61 LCERTNQSKP - >C06HBa0153O03.1-1-_PGL-3_AGS-2_PPS_4 (4191 3982) (frame '0'; 207 bp, 69 residues) 1 HPIPKSTCRF VTPYPLISFY QFIKPSFLPR HHQSHYFSSS RLLLYQGPII NKEIRVLQDL 61 GFNNFIMLI- >C06HBa0153O03.1-1-_PGL-3_AGS-2_PPS_5 (5400 5206) (frame '0'; 192 bp, 64 residues) 1 SPNGWLGRFL SCRCWYWCSC CSSSNHLRNR VKMVRYQFVS PRYQLDPSNS TKERMKEWNF 61 PKVL- AGS-3 (4852 4423,4340 4144,4093 4037) SCR (e 0.800 d 0.794 a 0.000,e 0.855 d 0.900 a 0.000,e 0.895) Exon 1 4852 4423 ( 430 n); score: 0.800 Intron 1 4422 4341 ( 82 n); Pd: 0.794 Pa: 0.000 Exon 2 4340 4144 ( 197 n); score: 0.855 Intron 2 4143 4094 ( 50 n); Pd: 0.900 Pa: 0.000 Exon 3 4093 4037 ( 57 n); score: 0.895 PGS (4852 4423,4340 4144,4093 4037) SGN-E241789+ 3-phase translation of AGS-3 (-strand): . . . . . . 4852 ATGCCGGAACTTCAAAGCATCAAGACATGAAGAGGAAGATCCAGTCCAAGCTAGAAGCAT M P E L Q S I K T - R G R S S P S - K H C R N F K A S R H E E E D P V Q A R S I A G T S K H Q D M K R K I Q S K L E A . . . . . . 4792 TAGCTCACCCTGATATCCGGAGTAATGAAGACTGGCTAGAGTTACTGTTGAGTCGAAGAT - L T L I S G V M K T G - S Y C - V E D S S P - Y P E - - R L A R V T V E S K M L A H P D I R S N E D W L E L L L S R R . . . . . . 4732 GACGGCACGTTTGCTGCACTCCACAAATAAACAAGAAGAAAACATAAAAGTAGGGGTCAG D G T F A A L H K - T R R K H K S R G Q T A R L L H S T N K Q E E N I K V G V S - R H V C C T P Q I N K K K T - K - G S . . . . . . 4672 TACAAACACGGGTACTGAGTAGATATCATCGGCCAACTCAAAATAGAGATCAATATATAC Y K H G Y - V D I I G Q L K I E I N I Y T N T G T E - I S S A N S K - R S I Y T V Q T R V L S R Y H R P T Q N R D Q Y I . . . . . . 4612 CAAGTAATATCATAAAATCAACTATGATACTCAACATGTAGCAACATCAAATACTATATC Q V I S - N Q L - Y S T C S N I K Y Y I K - Y H K I N Y D T Q H V A T S N T I S P S N I I K S T M I L N M - Q H Q I L Y . . . . . . 4552 ATTAACAATTACCGTCAAGTTCACACACGAGGACTCAAGCCTCAATACCGTACTCATTTG I N N Y R Q V H T R G L K P Q Y R T H L L T I T V K F T H E D S S L N T V L I W H - Q L P S S S H T R T Q A S I P Y S F . . . . . . 4492 GGAATTATGTTCATTGGATTGAGTATATTATCATCTTTCAAGATTCATTATCTTTATTTC G I M F I G L S I L S S F K I H Y L Y F E L C S L D - V Y Y H L S R F I I F I S G N Y V H W I E Y I I I F Q D S L S L F . : . . . . . 4432 TCTTGTGTCG : GAACGTGACACTTCGATCCTCTAAATCTACGTGTCGGTTCGTGACACTCG S C V : G T - H F D P L N L R V G S - H S L V S : E R D T S I L - I Y V S V R D T R L L C R : N V T L R S S K S T C R F V T L . . . . . . 4290 ATCTCCTAAATCTAAGTGTCGGTTCGTGACACCAGATCCCCTAAATCTACGTGTCAGTTC I S - I - V S V R D T R S P K S T C Q F S P K S K C R F V T P D P L N L R V S S D L L N L S V G S - H Q I P - I Y V S V . . . . . . 4230 GTGACACCCGATCCCCTAAATCTACGTGTCGGTTCGTGACACCCGATCCCTAAATCTACG V T P D P L N L R V G S - H P I P K S T - H P I P - I Y V S V R D T R S L N L R R D T R S P K S T C R F V T P D P - I Y . . . : . . . 4170 TGTCGGTTCGTGACACCCTATCCCCTA : ATCTCATTATTTTAGTTCATCACGCCTTCTTTT C R F V T P Y P L : I S L F - F I T P S F V G S - H P I P - : S H Y F S S S R L L L V S V R D T L S P : N L I I L V H H A F F . . . 4060 ATACCAAGGCCCCATCATTAACAA I P R P H H - Q Y Q G P I I N Y T K A P S L T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-3_PPS_1 (4547 4423,4340 4262) (frame '0'; 201 bp, 67 residues) 1 QLPSSSHTRT QASIPYSFGN YVHWIEYIII FQDSLSLFLL CRNVTLRSSK STCRFVTLDL 61 LNLSVGS- AGS-4 (8121 5413) SCR (e 0.901) Exon 1 8121 5413 (2709 n); score: 0.901 PGS (5843 5413) SGN-E352180+ PGS (5562 5413) SGN-E368761- PGS (6031 5478) SGN-E329287- PGS (6200 5482) SGN-E356614+ PGS (6198 5535) SGN-E352401+ PGS (6402 5690) SGN-E349404+ PGS (6402 5725) SGN-E351625+ PGS (6396 5789) SGN-E357065+ PGS (6402 5881) SGN-E352365+ PGS (6838 6049) SGN-E356912+ PGS (6762 6065) SGN-E356209+ PGS (6918 6155) SGN-E214046+ PGS (6836 6303) SGN-E353805+ PGS (6922 6364) SGN-E244046+ PGS (7305 6763) SGN-E355026+ PGS (7532 6771) SGN-E355244+ PGS (7129 6803) SGN-E352716+ PGS (7489 6829) SGN-E352117- PGS (7532 6872) SGN-E351414+ PGS (7318 6898) SGN-E242765+ PGS (7601 6943) SGN-E355232+ PGS (8050 7372) SGN-E368762+ PGS (8121 7409) SGN-E379315+ PGS (7578 7456) SGN-E578271+ PGS (8121 7526) SGN-E375319+ PGS (8121 7596) SGN-E204434+ PGS (8050 7692) SGN-E240817+ 3-phase translation of AGS-4 (-strand): . . . . . . 8121 GAATCCCTTGACAAATCGGCGGTAGTAGCTAGCTAACCCAACAAAGCTCCTTATTTCTGA E S L D K S A V V A S - P N K A P Y F - N P L T N R R - - L A N P T K L L I S D I P - Q I G G S S - L T Q Q S S L F L . . . . . . 8061 CACATTAGTAGGTCTTACCCAATTCTTCACTGTCTCAATCTTAGAAGGATCCACCATCAC H I S R S Y P I L H C L N L R R I H H H T L V G L T Q F F T V S I L E G S T I T T H - - V L P N S S L S Q S - K D P P S . . . . . . 8001 TCTATCCTTAGAAACCACGTGCCCCAAGAAGGACACTGCATCTAGCCAAAACTCACACTT S I L R N H V P Q E G H C I - P K L T L L S L E T T C P K K D T A S S Q N S H L L Y P - K P R A P R R T L H L A K T H T . . . . . . 7941 AGAGAACTTGGCATAAAGCTTTTTCTCCCTCAACATTTCCAATACCATTCTCAAATGCTC R E L G I K L F L P Q H F Q Y H S Q M L E N L A - S F F S L N I S N T I L K C S - R T W H K A F S P S T F P I P F S N A . . . . . . 7881 CTCATTTTCCTCCCTACTTTTAGAGTATATCAGTATGTCATCAATAAATACGATCACAAA L I F L P T F R V Y Q Y V I N K Y D H K S F S S L L L E Y I S M S S I N T I T K P H F P P Y F - S I S V C H Q - I R S Q . . . . . . 7821 GAGATCCAAATATGGCTTAAAAACCCCGTTCATCAAACTCATGAACGCAGCAGGGGCATT E I Q I W L K N P V H Q T H E R S R G I R S K Y G L K T P F I K L M N A A G A F R D P N M A - K P R S S N S - T Q Q G H . . . . . . 7761 CGTAAGACCAAAAGATATCACTACAAATTTGTAATGCCCATACCTGGTTCTAAAAGCAGT R K T K R Y H Y K F V M P I P G S K S S V R P K D I T T N L - C P Y L V L K A V S - D Q K I S L Q I C N A H T W F - K Q . . . . . . 7701 CTTTGGCACATCCGTTGCCCGTATTTTCAATTGATGATAGCCGGATCTCAAGTCAATCTT L W H I R C P Y F Q L M I A G S Q V N L F G T S V A R I F N - - - P D L K S I L S L A H P L P V F S I D D S R I S S Q S . . . . . . 7641 AGAGAAGACACAAGCACCTTGTAACTGATCGAACAAGTCATCAATGCGAGGAAGAGGATA R E D T S T L - L I E Q V I N A R K R I E K T Q A P C N - S N K S S M R G R G Y - R R H K H L V T D R T S H Q C E E E D . . . . . . 7581 CTTGTTCTTTATGGTTACCTTGTTCAACTGCCGGTAGTCTATGCACATCCGAAAACTCCC L V L Y G Y L V Q L P V V Y A H P K T P L F F M V T L F N C R - S M H I R K L P T C S L W L P C S T A G S L C T S E N S . . . . . . 7521 ATCCTTCTTCTTTACAAACAAAACAGGAGCACCCCAAGGAGATGCACTTGGTCTAATGAA I L L L Y K Q N R S T P R R C T W S N E S F F F T N K T G A P Q G D A L G L M K H P S S L Q T K Q E H P K E M H L V - - . . . . . . 7461 ACCCTTGCTCAACAACTCTTGAAGTTGGGCTTTTAACTCTCTTAACTCCGCGGGAGCAAT T L A Q Q L L K L G F - L S - L R G S N P L L N N S - S W A F N S L N S A G A I N P C S T T L E V G L L T L L T P R E Q . . . . . . 7401 TCTATAAGGGGGTATAGAAATGGGGCGAGTACCCGGTTCAAGATCAATACAGAAGTCAAT S I R G Y R N G A S T R F K I N T E V N L - G G I E M G R V P G S R S I Q K S I F Y K G V - K W G E Y P V Q D Q Y R S Q . . . . . . 7341 ATCCCTATTTGGTGTCATACCAGGAAGATCTGCAGGGAACACATCCAGAAACTCACGGAC I P I W C H T R K I C R E H I Q K L T D S L F G V I P G R S A G N T S R N S R T Y P Y L V S Y Q E D L Q G T H P E T H G . . . . . . 7281 CACCGAAACCGACTCAATCGAAGGTACTTGGGTGGTGTCATCCTTGAGATGTGCCAAGAA H R N R L N R R Y L G G V I L E M C Q E T E T D S I E G T W V V S S L R C A K K P P K P T Q S K V L G W C H P - D V P R . . . . . . 7221 AGCTAAACACCCTTTACTAACCATTTTCTTAGCACGAAGAAAGGAGATGATACGCACCGG S - T P F T N H F L S T K K G D D T H R A K H P L L T I F L A R R K E M I R T G K L N T L Y - P F S - H E E R R - Y A P . . . . . . 7161 ATTGGAAGTGTAGTCACCCTCCCACACTAACGGATCTGTCTCAGGCTTGGCTAACGTCAG I G S V V T L P H - R I C L R L G - R Q L E V - S P S H T N G S V S G L A N V R D W K C S H P P T L T D L S Q A W L T S . . . . . . 7101 AGTTTTAGCATTACAATCCAAGATCGCAAAATTCGGAGAAAGCCAAGTCATACCCAGAAT S F S I T I Q D R K I R R K P S H T Q N V L A L Q S K I A K F G E S Q V I P R I E F - H Y N P R S Q N S E K A K S Y P E . . . . . . 7041 TACATCAAAATCATCCATTTCTAAGATAACCAAATCTACATAAGTGTTGCTCCCCAAAAA Y I K I I H F - D N Q I Y I S V A P Q K T S K S S I S K I T K S T - V L L P K K L H Q N H P F L R - P N L H K C C S P K . . . . . . 6981 GTTCACCAAACAAGACCTATATACCTTTTCAACTACCACAGACTCACCCACCGGAGTAGA V H Q T R P I Y L F N Y H R L T H R S R F T K Q D L Y T F S T T T D S P T G V E S S P N K T Y I P F Q L P Q T H P P E - . . . . . . 6921 AACACGAATAGGCATATCAAGTAATTCACAATATAAATTTAGACCGTTAGCAAATGAGGA N T N R H I K - F T I - I - T V S K - G T R I G I S S N S Q Y K F R P L A N E E K H E - A Y Q V I H N I N L D R - Q M R . . . . . . 6861 AGATACATAAGAAAATGTGGATCCAGGATCAAACAATACAGAAGCCATGCAATCACAAAC R Y I R K C G S R I K Q Y R S H A I T N D T - E N V D P G S N N T E A M Q S Q T K I H K K M W I Q D Q T I Q K P C N H K . . . . . . 6801 TAGAAGATTACCTGTGATGACAGCATCAGATGCCTCCGCTTCAGACCGCCCAGGGAAAGC - K I T C D D S I R C L R F R P P R E S R R L P V M T A S D A S A S D R P G K A L E D Y L - - Q H Q M P P L Q T A Q G K . . . . . . 6741 GTAACAATGGGCCCTATCATTTGTCTGTCCATTGCCCCTACCATGTTGTGATGTAGTGGC V T M G P I I C L S I A P T M L - C S G - Q W A L S F V C P L P L P C C D V V A R N N G P Y H L S V H C P Y H V V M - W . . . . . . 6681 TCCGTTTTTCCCATCACCTCGGCCGTTTTGGTGACCACCATTACCCCGACCACCACGTCC S V F P I T S A V L V T T I T P T T T S P F F P S P R P F W - P P L P R P P R P L R F S H H L G R F G D H H Y P D H H V . . . . . . 6621 TCAAGAATAACGGCCTCTACCACGACCACCTCTACCTCTAGCCATTGGGGGTCTATAACT S R I T A S T T T T S T S S H W G S I T Q E - R P L P R P P L P L A I G G L - L L K N N G L Y H D H L Y L - P L G V Y N . . . . . . 6561 CTGTTTTGGACAATTCCTCCTAATATGTCCAGTCTCCCCACATCCATAACACTCTCTGGA L F W T I P P N M S S L P T S I T L S G C F G Q F L L I C P V S P H P - H S L E S V L D N S S - Y V Q S P H I H N T L W . . . . . . 6501 GTCAAGCATAGGTCTCTCAGAGTAGTGTTGACCGGTCTGAGGTGGACCCCCAACTACAGT V K H R S L R V V L T G L R W T P N Y S S S I G L S E - C - P V - G G P P T T V S Q A - V S Q S S V D R S E V D P Q L Q . . . . . . 6441 CTGTAGTGAAGACTGAATGGGTCGAACTGAGTAACCTACGGAACCCTGTCCTCTAGAGTA L - - R L N G S N - V T Y G T L S S R V C S E D - M G R T E - P T E P C P L E - S V V K T E W V E L S N L R N P V L - S . . . . . . 6381 AGAACCATTAAACTCACCTCCCTTTCGAAACCTCTTTGATGTCGATGACATGGTGAATTC R T I K L T S L S K P L - C R - H G E F E P L N S P P F R N L F D V D D M V N S K N H - T H L P F E T S L M S M T W - I . . . . . . 6321 ATCTGGCTTCACTCCTTCCACCTCTACCACGAAATCTACCACTTCTTGGAAGGATTTTAC I W L H S F H L Y H E I Y H F L E G F Y S G F T P S T S T T K S T T S W K D F T H L A S L L P P L P R N L P L L G R I L . . . . . . 6261 CGTAGCTGCTACCTGTAAGGCTGAAATCCGCAACTCTGACCTCAACCCCTTCACAAAACG R S C Y L - G - N P Q L - P Q P L H K T V A A T C K A E I R N S D L N P F T K R P - L L P V R L K S A T L T S T P S Q N . . . . . . 6201 ACGAATCCACTCTTGTGGACTGAAACAAAGTTGGGTGGCATATCTGGATAGTGCACGAAA T N P L L W T E T K L G G I S G - C T K R I H S C G L K Q S W V A Y L D S A R N D E S T L V D - N K V G W H I W I V H E . . . . . . 6141 CTTAGCCTCATATGCAGTAACCGACATCCTACCTTGCTCTAGGCTCAAGAACTCATCTCT L S L I C S N R H P T L L - A Q E L I S L A S Y A V T D I L P C S R L K N S S L T - P H M Q - P T S Y L A L G S R T H L . . . . . . 6081 TTTCCTATCCCTCAAAGTGCGGGGTATATACTTCTCCATAAACAAACTAGAGAATGATGC F P I P Q S A G Y I L L H K Q T R E - C F L S L K V R G I Y F S I N K L E N D A F S Y P S K C G V Y T S P - T N - R M M . . . . . . 6021 CCAAGTCATAGGTGGTGCCTCTGTTGGTTGACACTCAACATGTGACCACCACCACATTTT P S H R W C L C W L T L N M - P P P H F Q V I G G A S V G - H S T C D H H H I L P K S - V V P L L V D T Q H V T T T T F . . . . . . 5961 GGCGTTCCCTTGAAACTGATAACTAACGAACTCAACACCAAACCGTTCTACTATACCCAT G V P L K L I T N E L N T K P F Y Y T H A F P - N - - L T N S T P N R S T I P I W R S L E T D N - R T Q H Q T V L L Y P . . . . . . 5901 CTTGTGTAGTAGCTCATGACAGTCAACCAGAAAATCGTAAGCATCCTCAAATTTAGCACC L V - - L M T V N Q K I V S I L K F S T L C S S S - Q S T R K S - A S S N L A P S C V V A H D S Q P E N R K H P Q I - H . . . . . . 5841 CTTGAAGACTGGAGGTTTCAATTTCAAGAACTTACTGAAAAGTTCATGCTGATCATTTGT L E D W R F Q F Q E L T E K F M L I I C L K T G G F N F K N L L K S S C - S F V P - R L E V S I S R T Y - K V H A D H L . . . . . . 5781 CATTATAGGCCCAGTAGTCAGACGTGGAAACGTGCCTATGTCCAATGAGGCATCCATACG H Y R P S S Q T W K R A Y V Q - G I H T I I G P V V R R G N V P M S N E A S I R S L - A Q - S D V E T C L C P M R H P Y . . . . . . 5721 AGGAGCCATAGTGGCTGCATGTTTTACCTCTGAAACTGGAGGTGTTGGTGCAGAAAACAC R S H S G C M F Y L - N W R C W C R K H G A I V A A C F T S E T G G V G A E N T E E P - W L H V L P L K L E V L V Q K T . . . . . . 5661 TGGAGGGGCCTGACCCTGATCAGACAACCCACTAAGATAAGCGAGAACCTGATTGATCAT W R G L T L I R Q P T K I S E N L I D H G G A - P - S D N P L R - A R T - L I I L E G P D P D Q T T H - D K R E P D - S . . . . . . 5601 CTCTGGGGTAGGTTGGGGTGACAATTCCTTATTTTGCACTTGTTCATTCTCCCCTTCCTC L W G R L G - Q F L I L H L F I L P F L S G V G W G D N S L F C T C S F S P S S S L G - V G V T I P Y F A L V H S P L P . . . . . . 5541 ACCCTCTCTTACCACTTCCTCAGTCGGTGGAGGAGTCACCGCCCTAGGATCAGACAGGCT T L S Y H F L S R W R S H R P R I R Q A P S L T T S S V G G G V T A L G S D R L H P L L P L P Q S V E E S P P - D Q T G . . . . . . 5481 AGGTGCTCGTCCTCTTCCTCTAGAGGACGTCCTCCCTCGACCTCTACCACGGCCTCTTGC R C S S S S S R G R P P S T S T T A S C G A R P L P L E D V L P R P L P R P L A - V L V L F L - R T S S L D L Y H G L L . 5421 CGCTACTCT R Y S A T P L L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-4_PPS_1 (6380 5991) (frame '2'; 387 bp, 129 residues) 1 EPLNSPPFRN LFDVDDMVNS SGFTPSTSTT KSTTSWKDFT VAATCKAEIR NSDLNPFTKR 61 RIHSCGLKQS WVAYLDSARN LASYAVTDIL PCSRLKNSSL FLSLKVRGIY FSINKLENDA 121 QVIGGASVG- >C06HBa0153O03.1-1-_PGL-3_AGS-4_PPS_2 (7956 7618) (frame '1'; 336 bp, 112 residues) 1 PKLTLRELGI KLFLPQHFQY HSQMLLIFLP TFRVYQYVIN KYDHKEIQIW LKNPVHQTHE 61 RSRGIRKTKR YHYKFVMPIP GSKSSLWHIR CPYFQLMIAG SQVNLREDTS TL- >C06HBa0153O03.1-1-_PGL-3_AGS-4_PPS_3 (6690 6436) (frame '1'; 252 bp, 84 residues) 1 CSGSVFPITS AVLVTTITPT TTSSRITAST TTTSTSSHWG SITLFWTIPP NMSSLPTSIT 61 LSGVKHRSLR VVLTGLRWTP NYSL- >C06HBa0153O03.1-1-_PGL-3_AGS-4_PPS_4 (7394 7149) (frame '2'; 243 bp, 81 residues) 1 GGIEMGRVPG SRSIQKSISL FGVIPGRSAG NTSRNSRTTE TDSIEGTWVV SSLRCAKKAK 61 HPLLTIFLAR RKEMIRTGLE V- >C06HBa0153O03.1-1-_PGL-3_AGS-4_PPS_5 (5609 5415) (frame '2'; 195 bp, 65 residues) 1 LIISGVGWGD NSLFCTCSFS PSSPSLTTSS VGGGVTALGS DRLGARPLPL EDVLPRPLPR 61 PLAAT 3-phase translation of AGS-4 (+strand): . . . . . . 5413 AGAGTAGCGGCAAGAGGCCGTGGTAGAGGTCGAGGGAGGACGTCCTCTAGAGGAAGAGGA R V A A R G R G R G R G R T S S R G R G E - R Q E A V V E V E G G R P L E E E D S S G K R P W - R S R E D V L - R K R . . . . . . 5473 CGAGCACCTAGCCTGTCTGATCCTAGGGCGGTGACTCCTCCACCGACTGAGGAAGTGGTA R A P S L S D P R A V T P P P T E E V V E H L A C L I L G R - L L H R L R K W - T S T - P V - S - G G D S S T D - G S G . . . . . . 5533 AGAGAGGGTGAGGAAGGGGAGAATGAACAAGTGCAAAATAAGGAATTGTCACCCCAACCT R E G E E G E N E Q V Q N K E L S P Q P E R V R K G R M N K C K I R N C H P N L K R G - G R G E - T S A K - G I V T P T . . . . . . 5593 ACCCCAGAGATGATCAATCAGGTTCTCGCTTATCTTAGTGGGTTGTCTGATCAGGGTCAG T P E M I N Q V L A Y L S G L S D Q G Q P Q R - S I R F S L I L V G C L I R V R Y P R D D Q S G S R L S - W V V - S G S . . . . . . 5653 GCCCCTCCAGTGTTTTCTGCACCAACACCTCCAGTTTCAGAGGTAAAACATGCAGCCACT A P P V F S A P T P P V S E V K H A A T P L Q C F L H Q H L Q F Q R - N M Q P L G P S S V F C T N T S S F R G K T C S H . . . . . . 5713 ATGGCTCCTCGTATGGATGCCTCATTGGACATAGGCACGTTTCCACGTCTGACTACTGGG M A P R M D A S L D I G T F P R L T T G W L L V W M P H W T - A R F H V - L L G Y G S S Y G C L I G H R H V S T S D Y W . . . . . . 5773 CCTATAATGACAAATGATCAGCATGAACTTTTCAGTAAGTTCTTGAAATTGAAACCTCCA P I M T N D Q H E L F S K F L K L K P P L - - Q M I S M N F S V S S - N - N L Q A Y N D K - S A - T F Q - V L E I E T S . . . . . . 5833 GTCTTCAAGGGTGCTAAATTTGAGGATGCTTACGATTTTCTGGTTGACTGTCATGAGCTA V F K G A K F E D A Y D F L V D C H E L S S R V L N L R M L T I F W L T V M S Y S L Q G C - I - G C L R F S G - L S - A . . . . . . 5893 CTACACAAGATGGGTATAGTAGAACGGTTTGGTGTTGAGTTCGTTAGTTATCAGTTTCAA L H K M G I V E R F G V E F V S Y Q F Q Y T R W V - - N G L V L S S L V I S F K T T Q D G Y S R T V W C - V R - L S V S . . . . . . 5953 GGGAACGCCAAAATGTGGTGGTGGTCACATGTTGAGTGTCAACCAACAGAGGCACCACCT G N A K M W W W S H V E C Q P T E A P P G T P K C G G G H M L S V N Q Q R H H L R E R Q N V V V V T C - V S T N R G T T . . . . . . 6013 ATGACTTGGGCATCATTCTCTAGTTTGTTTATGGAGAAGTATATACCCCGCACTTTGAGG M T W A S F S S L F M E K Y I P R T L R - L G H H S L V C L W R S I Y P A L - G Y D L G I I L - F V Y G E V Y T P H F E . . . . . . 6073 GATAGGAAAAGAGATGAGTTCTTGAGCCTAGAGCAAGGTAGGATGTCGGTTACTGCATAT D R K R D E F L S L E Q G R M S V T A Y I G K E M S S - A - S K V G C R L L H M G - E K R - V L E P R A R - D V G Y C I . . . . . . 6133 GAGGCTAAGTTTCGTGCACTATCCAGATATGCCACCCAACTTTGTTTCAGTCCACAAGAG E A K F R A L S R Y A T Q L C F S P Q E R L S F V H Y P D M P P N F V S V H K S - G - V S C T I Q I C H P T L F Q S T R . . . . . . 6193 TGGATTCGTCGTTTTGTGAAGGGGTTGAGGTCAGAGTTGCGGATTTCAGCCTTACAGGTA W I R R F V K G L R S E L R I S A L Q V G F V V L - R G - G Q S C G F Q P Y R - V D S S F C E G V E V R V A D F S L T G . . . . . . 6253 GCAGCTACGGTAAAATCCTTCCAAGAAGTGGTAGATTTCGTGGTAGAGGTGGAAGGAGTG A A T V K S F Q E V V D F V V E V E G V Q L R - N P S K K W - I S W - R W K E - S S Y G K I L P R S G R F R G R G G R S . . . . . . 6313 AAGCCAGATGAATTCACCATGTCATCGACATCAAAGAGGTTTCGAAAGGGAGGTGAGTTT K P D E F T M S S T S K R F R K G G E F S Q M N S P C H R H Q R G F E R E V S L E A R - I H H V I D I K E V S K G R - V . . . . . . 6373 AATGGTTCTTACTCTAGAGGACAGGGTTCCGTAGGTTACTCAGTTCGACCCATTCAGTCT N G S Y S R G Q G S V G Y S V R P I Q S M V L T L E D R V P - V T Q F D P F S L - W F L L - R T G F R R L L S S T H S V . . . . . . 6433 TCACTACAGACTGTAGTTGGGGGTCCACCTCAGACCGGTCAACACTACTCTGAGAGACCT S L Q T V V G G P P Q T G Q H Y S E R P H Y R L - L G V H L R P V N T T L R D L F T T D C S W G S T S D R S T L L - E T . . . . . . 6493 ATGCTTGACTCCAGAGAGTGTTATGGATGTGGGGAGACTGGACATATTAGGAGGAATTGT M L D S R E C Y G C G E T G H I R R N C C L T P E S V M D V G R L D I L G G I V Y A - L Q R V L W M W G D W T Y - E E L . . . . . . 6553 CCAAAACAGAGTTATAGACCCCCAATGGCTAGAGGTAGAGGTGGTCGTGGTAGAGGCCGT P K Q S Y R P P M A R G R G G R G R G R Q N R V I D P Q W L E V E V V V V E A V S K T E L - T P N G - R - R W S W - R P . . . . . . 6613 TATTCTTGAGGACGTGGTGGTCGGGGTAATGGTGGTCACCAAAACGGCCGAGGTGATGGG Y S - G R G G R G N G G H Q N G R G D G I L E D V V V G V M V V T K T A E V M G L F L R T W W S G - W W S P K R P R - W . . . . . . 6673 AAAAACGGAGCCACTACATCACAACATGGTAGGGGCAATGGACAGACAAATGATAGGGCC K N G A T T S Q H G R G N G Q T N D R A K T E P L H H N M V G A M D R Q M I G P E K R S H Y I T T W - G Q W T D K - - G . . . . . . 6733 CATTGTTACGCTTTCCCTGGGCGGTCTGAAGCGGAGGCATCTGATGCTGTCATCACAGGT H C Y A F P G R S E A E A S D A V I T G I V T L S L G G L K R R H L M L S S Q V P L L R F P W A V - S G G I - C C H H R . . . . . . 6793 AATCTTCTAGTTTGTGATTGCATGGCTTCTGTATTGTTTGATCCTGGATCCACATTTTCT N L L V C D C M A S V L F D P G S T F S I F - F V I A W L L Y C L I L D P H F L - S S S L - L H G F C I V - S W I H I F . . . . . . 6853 TATGTATCTTCCTCATTTGCTAACGGTCTAAATTTATATTGTGAATTACTTGATATGCCT Y V S S S F A N G L N L Y C E L L D M P M Y L P H L L T V - I Y I V N Y L I C L L C I F L I C - R S K F I L - I T - Y A . . . . . . 6913 ATTCGTGTTTCTACTCCGGTGGGTGAGTCTGTGGTAGTTGAAAAGGTATATAGGTCTTGT I R V S T P V G E S V V V E K V Y R S C F V F L L R W V S L W - L K R Y I G L V Y S C F Y S G G - V C G S - K G I - V L . . . . . . 6973 TTGGTGAACTTTTTGGGGAGCAACACTTATGTAGATTTGGTTATCTTAGAAATGGATGAT L V N F L G S N T Y V D L V I L E M D D W - T F W G A T L M - I W L S - K W M I F G E L F G E Q H L C R F G Y L R N G - . . . . . . 7033 TTTGATGTAATTCTGGGTATGACTTGGCTTTCTCCGAATTTTGCGATCTTGGATTGTAAT F D V I L G M T W L S P N F A I L D C N L M - F W V - L G F L R I L R S W I V M F - C N S G Y D L A F S E F C D L G L - . . . . . . 7093 GCTAAAACTCTGACGTTAGCCAAGCCTGAGACAGATCCGTTAGTGTGGGAGGGTGACTAC A K T L T L A K P E T D P L V W E G D Y L K L - R - P S L R Q I R - C G R V T T C - N S D V S Q A - D R S V S V G G - L . . . . . . 7153 ACTTCCAATCCGGTGCGTATCATCTCCTTTCTTCGTGCTAAGAAAATGGTTAGTAAAGGG T S N P V R I I S F L R A K K M V S K G L P I R C V S S P F F V L R K W L V K G H F Q S G A Y H L L S S C - E N G - - R . . . . . . 7213 TGTTTAGCTTTCTTGGCACATCTCAAGGATGACACCACCCAAGTACCTTCGATTGAGTCG C L A F L A H L K D D T T Q V P S I E S V - L S W H I S R M T P P K Y L R L S R V F S F L G T S Q G - H H P S T F D - V . . . . . . 7273 GTTTCGGTGGTCCGTGAGTTTCTGGATGTGTTCCCTGCAGATCTTCCTGGTATGACACCA V S V V R E F L D V F P A D L P G M T P F R W S V S F W M C S L Q I F L V - H Q G F G G P - V S G C V P C R S S W Y D T . . . . . . 7333 AATAGGGATATTGACTTCTGTATTGATCTTGAACCGGGTACTCGCCCCATTTCTATACCC N R D I D F C I D L E P G T R P I S I P I G I L T S V L I L N R V L A P F L Y P K - G Y - L L Y - S - T G Y S P H F Y T . . . . . . 7393 CCTTATAGAATTGCTCCCGCGGAGTTAAGAGAGTTAAAAGCCCAACTTCAAGAGTTGTTG P Y R I A P A E L R E L K A Q L Q E L L L I E L L P R S - E S - K P N F K S C - P L - N C S R G V K R V K S P T S R V V . . . . . . 7453 AGCAAGGGTTTCATTAGACCAAGTGCATCTCCTTGGGGTGCTCCTGTTTTGTTTGTAAAG S K G F I R P S A S P W G A P V L F V K A R V S L D Q V H L L G V L L F C L - R E Q G F H - T K C I S L G C S C F V C K . . . . . . 7513 AAGAAGGATGGGAGTTTTCGGATGTGCATAGACTACCGGCAGTTGAACAAGGTAACCATA K K D G S F R M C I D Y R Q L N K V T I R R M G V F G C A - T T G S - T R - P - E E G W E F S D V H R L P A V E Q G N H . . . . . . 7573 AAGAACAAGTATCCTCTTCCTCGCATTGATGACTTGTTCGATCAGTTACAAGGTGCTTGT K N K Y P L P R I D D L F D Q L Q G A C R T S I L F L A L M T C S I S Y K V L V K E Q V S S S S H - - L V R S V T R C L . . . . . . 7633 GTCTTCTCTAAGATTGACTTGAGATCCGGCTATCATCAATTGAAAATACGGGCAACGGAT V F S K I D L R S G Y H Q L K I R A T D S S L R L T - D P A I I N - K Y G Q R M C L L - D - L E I R L S S I E N T G N G . . . . . . 7693 GTGCCAAAGACTGCTTTTAGAACCAGGTATGGGCATTACAAATTTGTAGTGATATCTTTT V P K T A F R T R Y G H Y K F V V I S F C Q R L L L E P G M G I T N L - - Y L L C A K D C F - N Q V W A L Q I C S D I F . . . . . . 7753 GGTCTTACGAATGCCCCTGCTGCGTTCATGAGTTTGATGAACGGGGTTTTTAAGCCATAT G L T N A P A A F M S L M N G V F K P Y V L R M P L L R S - V - - T G F L S H I W S Y E C P C C V H E F D E R G F - A I . . . . . . 7813 TTGGATCTCTTTGTGATCGTATTTATTGATGACATACTGATATACTCTAAAAGTAGGGAG L D L F V I V F I D D I L I Y S K S R E W I S L - S Y L L M T Y - Y T L K V G R F G S L C D R I Y - - H T D I L - K - G . . . . . . 7873 GAAAATGAGGAGCATTTGAGAATGGTATTGGAAATGTTGAGGGAGAAAAAGCTTTATGCC E N E E H L R M V L E M L R E K K L Y A K M R S I - E W Y W K C - G R K S F M P G K - G A F E N G I G N V E G E K A L C . . . . . . 7933 AAGTTCTCTAAGTGTGAGTTTTGGCTAGATGCAGTGTCCTTCTTGGGGCACGTGGTTTCT K F S K C E F W L D A V S F L G H V V S S S L S V S F G - M Q C P S W G T W F L Q V L - V - V L A R C S V L L G A R G F . . . . . . 7993 AAGGATAGAGTGATGGTGGATCCTTCTAAGATTGAGACAGTGAAGAATTGGGTAAGACCT K D R V M V D P S K I E T V K N W V R P R I E - W W I L L R L R Q - R I G - D L - G - S D G G S F - D - D S E E L G K T . . . . . . 8053 ACTAATGTGTCAGAAATAAGGAGCTTTGTTGGGTTAGCTAGCTACTACCGCCGATTTGTC T N V S E I R S F V G L A S Y Y R R F V L M C Q K - G A L L G - L A T T A D L S Y - C V R N K E L C W V S - L L P P I C . 8113 AAGGGATTC K G F R D Q G I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-3_AGS-4_PPS_1 (6622 8121) (frame '1'; 1500 bp, 500 residues) 1 GRGGRGNGGH QNGRGDGKNG ATTSQHGRGN GQTNDRAHCY AFPGRSEAEA SDAVITGNLL 61 VCDCMASVLF DPGSTFSYVS SSFANGLNLY CELLDMPIRV STPVGESVVV EKVYRSCLVN 121 FLGSNTYVDL VILEMDDFDV ILGMTWLSPN FAILDCNAKT LTLAKPETDP LVWEGDYTSN 181 PVRIISFLRA KKMVSKGCLA FLAHLKDDTT QVPSIESVSV VREFLDVFPA DLPGMTPNRD 241 IDFCIDLEPG TRPISIPPYR IAPAELRELK AQLQELLSKG FIRPSASPWG APVLFVKKKD 301 GSFRMCIDYR QLNKVTIKNK YPLPRIDDLF DQLQGACVFS KIDLRSGYHQ LKIRATDVPK 361 TAFRTRYGHY KFVVISFGLT NAPAAFMSLM NGVFKPYLDL FVIVFIDDIL IYSKSREENE 421 EHLRMVLEML REKKLYAKFS KCEFWLDAVS FLGHVVSKDR VMVDPSKIET VKNWVRPTNV 481 SEIRSFVGLA SYYRRFVKGF >C06HBa0153O03.1-1+_PGL-3_AGS-4_PPS_2 (5413 6621) (frame '1'; 1206 bp, 402 residues) 1 RVAARGRGRG RGRTSSRGRG RAPSLSDPRA VTPPPTEEVV REGEEGENEQ VQNKELSPQP 61 TPEMINQVLA YLSGLSDQGQ APPVFSAPTP PVSEVKHAAT MAPRMDASLD IGTFPRLTTG 121 PIMTNDQHEL FSKFLKLKPP VFKGAKFEDA YDFLVDCHEL LHKMGIVERF GVEFVSYQFQ 181 GNAKMWWWSH VECQPTEAPP MTWASFSSLF MEKYIPRTLR DRKRDEFLSL EQGRMSVTAY 241 EAKFRALSRY ATQLCFSPQE WIRRFVKGLR SELRISALQV AATVKSFQEV VDFVVEVEGV 301 KPDEFTMSST SKRFRKGGEF NGSYSRGQGS VGYSVRPIQS SLQTVVGGPP QTGQHYSERP 361 MLDSRECYGC GETGHIRRNC PKQSYRPPMA RGRGGRGRGR YS- AGS-5 (7495 7036,6291 6105) SCR (e 0.933 d 0.000 a 0.000,e 0.930) Exon 1 7495 7036 ( 460 n); score: 0.933 Intron 1 7035 6292 ( 744 n); Pd: 0.000 Pa: 0.000 Exon 2 6291 6105 ( 187 n); score: 0.930 PGS (7495 7036,6291 6105) SGN-E353359+ 3-phase translation of AGS-5 (-strand): . . . . . . 7495 GAGCACCCCAAGGAGATGCACTTGGTCTAATGAAACCCTTGCTCAACAACTCTTGAAGTT E H P K E M H L V - - N P C S T T L E V S T P R R C T W S N E T L A Q Q L L K L A P Q G D A L G L M K P L L N N S - S . . . . . . 7435 GGGCTTTTAACTCTCTTAACTCCGCGGGAGCAATTCTATAAGGGGGTATAGAAATGGGGC G L L T L L T P R E Q F Y K G V - K W G G F - L S - L R G S N S I R G Y R N G A W A F N S L N S A G A I L - G G I E M G . . . . . . 7375 GAGTACCCGGTTCAAGATCAATACAGAAGTCAATATCCCTATTTGGTGTCATACCAGGAA E Y P V Q D Q Y R S Q Y P Y L V S Y Q E S T R F K I N T E V N I P I W C H T R K R V P G S R S I Q K S I S L F G V I P G . . . . . . 7315 GATCTGCAGGGAACACATCCAGAAACTCACGGACCACCGAAACCGACTCAATCGAAGGTA D L Q G T H P E T H G P P K P T Q S K V I C R E H I Q K L T D H R N R L N R R Y R S A G N T S R N S R T T E T D S I E G . . . . . . 7255 CTTGGGTGGTGTCATCCTTGAGATGTGCCAAGAAAGCTAAACACCCTTTACTAACCATTT L G W C H P - D V P R K L N T L Y - P F L G G V I L E M C Q E S - T P F T N H F T W V V S S L R C A K K A K H P L L T I . . . . . . 7195 TCTTAGCACGAAGAAAGGAGATGATACGCACCGGATTGGAAGTGTAGTCACCCTCCCACA S - H E E R R - Y A P D W K C S H P P T L S T K K G D D T H R I G S V V T L P H F L A R R K E M I R T G L E V - S P S H . . . . . . 7135 CTAACGGATCTGTCTCAGGCTTGGCTAACGTCAGAGTTTTAGCATTACAATCCAAGATCG L T D L S Q A W L T S E F - H Y N P R S - R I C L R L G - R Q S F S I T I Q D R T N G S V S G L A N V R V L A L Q S K I . . . . : . . 7075 CAAAATTCGGAGAAAGCCAAGTCATACCCAGAATTACATC : GAAATCTACCACTTCTTGGA Q N S E K A K S Y P E L H : R N L P L L G K I R R K P S H T Q N Y I : E I Y H F L E A K F G E S Q V I P R I T S : K S T T S W . . . . . . 6271 AGGATTTTACCGTAGCTGCTACCTGTAAGGCTGAAATCCGCAACTCTGACCTCAACCCCT R I L P - L L P V R L K S A T L T S T P G F Y R S C Y L - G - N P Q L - P Q P L K D F T V A A T C K A E I R N S D L N P . . . . . . 6211 TCACAAAACGACGAATCCACTCTTGTGGACTGAAACAAAGTTGGGTGGCATATCTGGATA S Q N D E S T L V D - N K V G W H I W I H K T T N P L L W T E T K L G G I S G - F T K R R I H S C G L K Q S W V A Y L D . . . . . 6151 GTGCACGAAACTTAGCCTCATATGCAGTAACCGACATCCTACCTTGC V H E T - P H M Q - P T S Y L C T K L S L I C S N R H P T L S A R N L A S Y A V T D I L P C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-5_PPS_1 (7148 7036,6291 6105) (frame '0'; 300 bp, 100 residues) 1 SPSHTNGSVS GLANVRVLAL QSKIAKFGES QVIPRITSKS TTSWKDFTVA ATCKAEIRNS 61 DLNPFTKRRI HSCGLKQSWV AYLDSARNLA SYAVTDILPC >C06HBa0153O03.1-1-_PGL-3_AGS-5_PPS_2 (7394 7149) (frame '0'; 243 bp, 81 residues) 1 GGIEMGRVPG SRSIQKSISL FGVIPGRSAG NTSRNSRTTE TDSIEGTWVV SSLRCAKKAK 61 HPLLTIFLAR RKEMIRTGLE V- AGS-6 (6940 6911,6875 6345) SCR (e 0.633 d 0.631 a 0.000,e 0.915) Exon 1 6940 6911 ( 30 n); score: 0.633 Intron 1 6910 6876 ( 35 n); Pd: 0.631 Pa: 0.000 Exon 2 6875 6345 ( 531 n); score: 0.915 PGS (6940 6911,6875 6345) SGN-E577713+ 3-phase translation of AGS-6 (-strand): . . . : . . . 6940 ACTCACCCACCGGAGTAGAAACACGAATAG : TTAGCAAATGAGGAAGATACATAAGAAAAT T H P P E - K H E - : L A N E E D T - E N L T H R S R N T N S : - Q M R K I H K K M S P T G V E T R I : V S K - G R Y I R K . . . . . . 6845 GTGGATCCAGGATCAAACAATACAGAAGCCATGCAATCACAAACTAGAAGATTACCTGTG V D P G S N N T E A M Q S Q T R R L P V W I Q D Q T I Q K P C N H K L E D Y L - C G S R I K Q Y R S H A I T N - K I T C . . . . . . 6785 ATGACAGCATCAGATGCCTCCGCTTCAGACCGCCCAGGGAAAGCGTAACAATGGGCCCTA M T A S D A S A S D R P G K A - Q W A L - Q H Q M P P L Q T A Q G K R N N G P Y D D S I R C L R F R P P R E S V T M G P . . . . . . 6725 TCATTTGTCTGTCCATTGCCCCTACCATGTTGTGATGTAGTGGCTCCGTTTTTCCCATCA S F V C P L P L P C C D V V A P F F P S H L S V H C P Y H V V M - W L R F S H H I I C L S I A P T M L - C S G S V F P I . . . . . . 6665 CCTCGGCCGTTTTGGTGACCACCATTACCCCGACCACCACGTCCTCAAGAATAACGGCCT P R P F W - P P L P R P P R P Q E - R P L G R F G D H H Y P D H H V L K N N G L T S A V L V T T I T P T T T S S R I T A . . . . . . 6605 CTACCACGACCACCTCTACCTCTAGCCATTGGGGGTCTATAACTCTGTTTTGGACAATTC L P R P P L P L A I G G L - L C F G Q F Y H D H L Y L - P L G V Y N S V L D N S S T T T T S T S S H W G S I T L F W T I . . . . . . 6545 CTCCTAATATGTCCAGTCTCCCCACATCCATAACACTCTCTGGAGTCAAGCATAGGTCTC L L I C P V S P H P - H S L E S S I G L S - Y V Q S P H I H N T L W S Q A - V S P P N M S S L P T S I T L S G V K H R S . . . . . . 6485 TCAGAGTAGTGTTGACCGGTCTGAGGTGGACCCCCAACTACAGTCTGTAGTGAAGACTGA S E - C - P V - G G P P T T V C S E D - Q S S V D R S E V D P Q L Q S V V K T E L R V V L T G L R W T P N Y S L - - R L . . . . . . 6425 ATGGGTCGAACTGAGTAACCTACGGAACCCTGTCCTCTAGAGTAAGAACCATTAAACTCA M G R T E - P T E P C P L E - E P L N S W V E L S N L R N P V L - S K N H - T H N G S N - V T Y G T L S S R V R T I K L . . . 6365 CCTCCCTTTCGAAACCTCTTT P P F R N L F L P F E T S T S L S K P L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-6_PPS_1 (6690 6436) (frame '0'; 252 bp, 84 residues) 1 CSGSVFPITS AVLVTTITPT TTSSRITAST TTTSTSSHWG SITLFWTIPP NMSSLPTSIT 61 LSGVKHRSLR VVLTGLRWTP NYSL- AGS-7 (9379 8350) SCR (e 0.881) Exon 1 9379 8350 (1030 n); score: 0.881 PGS (8935 8350) SGN-E352950+ PGS (8935 8350) SGN-E357100+ PGS (8935 8383) SGN-E352647+ PGS (9379 8839) SGN-E353207+ 3-phase translation of AGS-7 (-strand): . . . . . . 9379 TCCGCATGCAATGTTTTCCAAAACTTAAAAGTAAACTGCGTACCTCTATCTGATATAATG S A C N V F Q N L K V N C V P L S D I M P H A M F S K T - K - T A Y L Y L I - W R M Q C F P K L K S K L R T S I - Y N . . . . . . 9319 GAGAGTGGAACCCCATGCAATCGCACCACTTCCGAGATGTAAAGTTTGGATAACTTCTCT E S G T P C N R T T S E M - S L D N F S R V E P H A I A P L P R C K V W I T S L G E W N P M Q S H H F R D V K F G - L L . . . . . . 9259 GCATTGTAAGGCACCTTGACCGGAATGAAGTGAGCAGACTTAGTTAACCTATCAACAATT A L - G T L T G M K - A D L V N L S T I H C K A P - P E - S E Q T - L T Y Q Q L C I V R H L D R N E V S R L S - P I N N . . . . . . 9199 ACCCAAATGGAATCAAACTTTCCCAATGTCTTTGGAAGACCAACCACGAAATCCATTGCA T Q M E S N F P N V F G R P T T K S I A P K W N Q T F P M S L E D Q P R N P L Q Y P N G I K L S Q C L W K T N H E I H C . . . . . . 9139 ATCCTTTCCCACTTCCATTCGGGAATGGGCATTCTCTGAAGTGTTCCTCCGGGCCTTTGG I L S H F H S G M G I L - S V P P G L W S F P T S I R E W A F S E V F L R A F G N P F P L P F G N G H S L K C S S G P L . . . . . . 9079 TGTTCATACTTTACCTGTTGACAGTTTGGACATTTGGCAATAAAATCCACAATATCACGC C S Y F T C - Q F G H L A I K S T I S R V H T L P V D S L D I W Q - N P Q Y H A V F I L Y L L T V W T F G N K I H N I T . . . . . . 9019 TTCATTCTACTCTACCAAAGTGTTGTTTTAGGTCACGGTACATTTTGGTTGCACCCGGAT F I L L Y Q S V V L G H G T F W L H P D S F Y S T K V L F - V T V H F G C T R M L H S T L P K C C F R S R Y I L V A P G . . . . . . 8959 GTATCGAATACCTTGAACTATGAGCCTCTGTCAAAATAGTGTTGATTAAATCATCGACGC V S N T L N Y E P L S K - C - L N H R R Y R I P - T M S L C Q N S V D - I I D A C I E Y L E L - A S V K I V L I K S S T . . . . . . 8899 GGGGTACACATACCCTTCCCTTAATCCTCAAAACACCTTCCTCATCGATTGTCGCTTCTT G V H I P F P - S S K H L P H R L S L L G Y T Y P S L N P Q N T F L I D C R F F R G T H T L P L I L K T P S S S I V A S . . . . . . 8839 TAGCCTCTCCTTGCAACACTTTATCTCGGATCCGGATCAATTTCTCATCATTAAACTGCT - P L L A T L Y L G S G S I S H H - T A S L S L Q H F I S D P D Q F L I I K L L L A S P C N T L S R I R I N F S S L N C . . . . . . 8779 TTCCCTTAATCTTGTCAAGGAAGGGAGATCTTGCCTCCACACAAGCTAAAAATCCTCCCT F P - S C Q G R E I L P P H K L K I L P S L N L V K E G R S C L H T S - K S S L F P L I L S R K G D L A S T Q A K N P P . . . . . . 8719 TCTCATTTACTTCTAATCTTATAAGGTCGTTAGCCAGAATCTAAACTTCTCTAGCCAATA S H L L L I L - G R - P E S K L L - P I L I Y F - S Y K V V S Q N L N F S S Q - F S F T S N L I R S L A R I - T S L A N . . . . . . 8659 GGCGTCTAGAAGCTTGCAAGTGAGCTAGACATCCCATGCTTCCCGCCTTTCTACTTAAAG G V - K L A S E L D I P C F P P F Y L K A S R S L Q V S - T S H A S R L S T - S R R L E A C K - A R H P M L P A F L L K . . . . . . 8599 CATCCGCTACAACATTCGCCTTCCCCGAATGATACAAAATAGTGATATCGTAGTCCTTCA H P L Q H S P S P N D T K - - Y R S P S I R Y N I R L P R M I Q N S D I V V L Q A S A T T F A F P E - Y K I V I S - S F . . . . . . 8539 GTAGTTCCATCCATCTCCTCTGTCTCAAGTTCAAATCTTTCAGAGTAAAGACATACTGTA V V P S I S S V S S S N L S E - R H T V - F H P S P L S Q V Q I F Q S K D I L - S S S I H L L C L K F K S F R V K T Y C . . . . . . 8479 GGCTACGATGATCCGTATAGACCTCACACTTAACCCCATATAAATAGTGTCTCCATTGCT G Y D D P Y R P H T - P H I N S V S I A A T M I R I D L T L N P I - I V S P L L R L R - S V - T S H L T P Y K - C L H C . . . . . . 8419 TTAATGCAAACACTACTGCGGCCAATTCCAAATCATGAGTTGGATAGTTACGTTCATGCA L M Q T L L R P I P N H E L D S Y V H A - C K H Y C G Q F Q I M S W I V T F M H F N A N T T A A N S K S - V G - L R S C . 8359 CTTTTAATTG L L I F - L T F N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-7_PPS_1 (9212 8937) (frame '0'; 273 bp, 91 residues) 1 PINNYPNGIK LSQCLWKTNH EIHCNPFPLP FGNGHSLKCS SGPLVFILYL LTVWTFGNKI 61 HNITLHSTLP KCCFRSRYIL VAPGCIEYLE L- >C06HBa0153O03.1-1-_PGL-3_AGS-7_PPS_2 (8936 8676) (frame '0'; 258 bp, 86 residues) 1 ASVKIVLIKS STRGTHTLPL ILKTPSSSIV ASLASPCNTL SRIRINFSSL NCFPLILSRK 61 GDLASTQAKN PPFSFTSNLI RSLARI- 3-phase translation of AGS-7 (+strand): . . . . . . 8350 CAATTAAAAGTGCATGAACGTAACTATCCAACTCATGATTTGGAATTGGCCGCAGTAGTG Q L K V H E R N Y P T H D L E L A A V V N - K C M N V T I Q L M I W N W P Q - C I K S A - T - L S N S - F G I G R S S . . . . . . 8410 TTTGCATTAAAGCAATGGAGACACTATTTATATGGGGTTAAGTGTGAGGTCTATACGGAT F A L K Q W R H Y L Y G V K C E V Y T D L H - S N G D T I Y M G L S V R S I R I V C I K A M E T L F I W G - V - G L Y G . . . . . . 8470 CATCGTAGCCTACAGTATGTCTTTACTCTGAAAGATTTGAACTTGAGACAGAGGAGATGG H R S L Q Y V F T L K D L N L R Q R R W I V A Y S M S L L - K I - T - D R G D G S S - P T V C L Y S E R F E L E T E E M . . . . . . 8530 ATGGAACTACTGAAGGACTACGATATCACTATTTTGTATCATTCGGGGAAGGCGAATGTT M E L L K D Y D I T I L Y H S G K A N V W N Y - R T T I S L F C I I R G R R M L D G T T E G L R Y H Y F V S F G E G E C . . . . . . 8590 GTAGCGGATGCTTTAAGTAGAAAGGCGGGAAGCATGGGATGTCTAGCTCACTTGCAAGCT V A D A L S R K A G S M G C L A H L Q A - R M L - V E R R E A W D V - L T C K L C S G C F K - K G G K H G M S S S L A S . . . . . . 8650 TCTAGACGCCTATTGGCTAGAGAAGTTTAGATTCTGGCTAACGACCTTATAAGATTAGAA S R R L L A R E V - I L A N D L I R L E L D A Y W L E K F R F W L T T L - D - K F - T P I G - R S L D S G - R P Y K I R . . . . . . 8710 GTAAATGAGAAGGGAGGATTTTTAGCTTGTGTGGAGGCAAGATCTCCCTTCCTTGACAAG V N E K G G F L A C V E A R S P F L D K - M R R E D F - L V W R Q D L P S L T R S K - E G R I F S L C G G K I S L P - Q . . . . . . 8770 ATTAAGGGAAAGCAGTTTAATGATGAGAAATTGATCCGGATCCGAGATAAAGTGTTGCAA I K G K Q F N D E K L I R I R D K V L Q L R E S S L M M R N - S G S E I K C C K D - G K A V - - - E I D P D P R - S V A . . . . . . 8830 GGAGAGGCTAAAGAAGCGACAATCGATGAGGAAGGTGTTTTGAGGATTAAGGGAAGGGTA G E A K E A T I D E E G V L R I K G R V E R L K K R Q S M R K V F - G L R E G Y R R G - R S D N R - G R C F E D - G K G . . . . . . 8890 TGTGTACCCCGCGTCGATGATTTAATCAACACTATTTTGACAGAGGCTCATAGTTCAAGG C V P R V D D L I N T I L T E A H S S R V Y P A S M I - S T L F - Q R L I V Q G M C T P R R - F N Q H Y F D R G S - F K . . . . . . 8950 TATTCGATACATCCGGGTGCAACCAAAATGTACCGTGACCTAAAACAACACTTTGGTAGA Y S I H P G A T K M Y R D L K Q H F G R I R Y I R V Q P K C T V T - N N T L V E V F D T S G C N Q N V P - P K T T L W - . . . . . . 9010 GTAGAATGAAGCGTGATATTGTGGATTTTATTGCCAAATGTCCAAACTGTCAACAGGTAA V E - S V I L W I L L P N V Q T V N R - - N E A - Y C G F Y C Q M S K L S T G K S R M K R D I V D F I A K C P N C Q Q V . . . . . . 9070 AGTATGAACACCAAAGGCCCGGAGGAACACTTCAGAGAATGCCCATTCCCGAATGGAAGT S M N T K G P E E H F R E C P F P N G S V - T P K A R R N T S E N A H S R M E V K Y E H Q R P G G T L Q R M P I P E W K . . . . . . 9130 GGGAAAGGATTGCAATGGATTTCGTGGTTGGTCTTCCAAAGACATTGGGAAAGTTTGATT G K G L Q W I S W L V F Q R H W E S L I G K D C N G F R G W S S K D I G K V - F W E R I A M D F V V G L P K T L G K F D . . . . . . 9190 CCATTTGGGTAATTGTTGATAGGTTAACTAAGTCTGCTCACTTCATTCCGGTCAAGGTGC P F G - L L I G - L S L L T S F R S R C H L G N C - - V N - V C S L H S G Q G A S I W V I V D R L T K S A H F I P V K V . . . . . . 9250 CTTACAATGCAGAGAAGTTATCCAAACTTTACATCTCGGAAGTGGTGCGATTGCATGGGG L T M Q R S Y P N F T S R K W C D C M G L Q C R E V I Q T L H L G S G A I A W G P Y N A E K L S K L Y I S E V V R L H G . . . . . . 9310 TTCCACTCTCCATTATATCAGATAGAGGTACGCAGTTTACTTTTAAGTTTTGGAAAACAT F H S P L Y Q I E V R S L L L S F G K H S T L H Y I R - R Y A V Y F - V L E N I V P L S I I S D R G T Q F T F K F W K T . 9370 TGCATGCGGA C M R A C G L H A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-3_AGS-7_PPS_1 (9009 9377) (frame '0'; 369 bp, 123 residues) 1 SRMKRDIVDF IAKCPNCQQV KYEHQRPGGT LQRMPIPEWK WERIAMDFVV GLPKTLGKFD 61 SIWVIVDRLT KSAHFIPVKV PYNAEKLSKL YISEVVRLHG VPLSIISDRG TQFTFKFWKT 121 LHA >C06HBa0153O03.1-1+_PGL-3_AGS-7_PPS_2 (8680 9018) (frame '1'; 336 bp, 112 residues) 1 ILANDLIRLE VNEKGGFLAC VEARSPFLDK IKGKQFNDEK LIRIRDKVLQ GEAKEATIDE 61 EGVLRIKGRV CVPRVDDLIN TILTEAHSSR YSIHPGATKM YRDLKQHFGR VE- >C06HBa0153O03.1-1+_PGL-3_AGS-7_PPS_3 (8350 8679) (frame '1'; 327 bp, 109 residues) 1 QLKVHERNYP THDLELAAVV FALKQWRHYL YGVKCEVYTD HRSLQYVFTL KDLNLRQRRW 61 MELLKDYDIT ILYHSGKANV VADALSRKAG SMGCLAHLQA SRRLLAREV- AGS-8 (9597 9425) SCR (e 0.861) Exon 1 9597 9425 ( 173 n); score: 0.861 PGS (9597 9425) SGN-E577888+ 3-phase translation of AGS-8 (-strand): . . . . . . 9597 CTACATCTTCTCCCATATAGTGCCTCAAATGGGGCCATATCAATGCTTGAGTGATAGCTA L H L L P Y S A S N G A I S M L E - - L Y I F S H I V P Q M G P Y Q C L S D S Y T S S P I - C L K W G H I N A - V I A . . . . . . 9537 TTATTGTAGGAGAACTCTGCTAAGGGTAGCTATCCCACTGACCACCAAATTCTATCACAC L L - E N S A K G S Y P T D H Q I L S H Y C R R T L L R V A I P L T T K F Y H T I I V G E L C - G - L S H - P P N S I T . . . . . . 9477 ATGCACGAAGCATATCCTCCAACACTTGAATCGTTCGCTCAGACTGACCATCG M H E A Y P P T L E S F A Q T D H C T K H I L Q H L N R S L R L T I H A R S I S S N T - I V R S D - P S Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-8 (+strand): . . . . . . 9425 CGATGGTCAGTCTGAGCGAACGATTCAAGTGTTGGAGGATATGCTTCGTGCATGTGTGAT R W S V - A N D S S V G G Y A S C M C D D G Q S E R T I Q V L E D M L R A C V I M V S L S E R F K C W R I C F V H V - . . . . . . 9485 AGAATTTGGTGGTCAGTGGGATAGCTACCCTTAGCAGAGTTCTCCTACAATAATAGCTAT R I W W S V G - L P L A E F S Y N N S Y E F G G Q W D S Y P - Q S S P T I I A I - N L V V S G I A T L S R V L L Q - - L . . . . . . 9545 CACTCAAGCATTGATATGGCCCCATTTGAGGCACTATATGGGAGAAGATGTAG H S S I D M A P F E A L Y G R R C T Q A L I W P H L R H Y M G E D V S L K H - Y G P I - G T I W E K M - Maximal non-overlapping open reading frames (>= 64 codons): none AGS-9 (10476 9659) SCR (e 0.867) Exon 1 10476 9659 ( 818 n); score: 0.867 PGS (10420 9659) SGN-E354383- PGS (10476 10001) SGN-E252199- 3-phase translation of AGS-9 (-strand): . . . . . . 10476 CTGTGACGGTCCGTCACACCTGTGACGGTCCGTCCTGCCATTTCGTTACGAAGTTCAGAA L - R S V T P V T V R P A I S L R S S E C D G P S H L - R S V L P F R Y E V Q K V T V R H T C D G P S C H F V T K F R . . . . . . 10416 AGTCGATTTCAGTACCCAATTTTCAGAATTCTAAGTATTTTGGAATGAGATACCCTCAAC S R F Q Y P I F R I L S I L E - D T L N V D F S T Q F S E F - V F W N E I P S T K S I S V P N F Q N S K Y F G M R Y P Q . . . . . . 10356 GGTCTGTCGTGCCCATGACGGTCCGTCGTGGGTTCCGTCATCTCAGCCTGTTTTTCAAGA G L S C P - R S V V G S V I S A C F S R V C R A H D G P S W V P S S Q P V F Q E R S V V P M T V R R G F R H L S L F F K . . . . . . 10296 AATAAAATCTGCTGCTCGAAACGACTAAACAGGTCGTTACAATAGATACCAATTTACCCA N K I C C S K R L N R S L Q - I P I Y P I K S A A R N D - T G R Y N R Y Q F T H K - N L L L E T T K Q V V T I D T N L P . . . . . . 10236 TCGTTCGTCCCCGAACGATCACAAGAAGGAAAACAAGGGCGAAAAGGAGTACCTGAATCT S F V P E R S Q E G K Q G R K G V P E S R S S P N D H K K E N K G E K E Y L N L I V R P R T I T R R K T R A K R S T - I . . . . . . 10176 GTAAACAGGTGTGGGTATCTTTCTCGCATATCAGCCTTGTTCTCCCAAGTGGCTTCTTCG V N R C G Y L S R I S A L F S Q V A S S - T G V G I F L A Y Q P C S P K W L L R C K Q V W V S F S H I S L V L P S G F F . . . . . . 10116 ACTGGTCGATTCTTCCTTTGAACTTTGATGGATGCAATCTCCCTTGATCTCAACTTGCGA T G R F F L - T L M D A I S L D L N L R L V D S S F E L - W M Q S P L I S T C E D W S I L P L N F D G C N L P - S Q L A . . . . . . 10056 ATTTCTCTATCTAGAATGGCAACAGGCTCCTCCTCATAAGTCAAATTTTCATCAAGCAAA I S L S R M A T G S S S - V K F S S S K F L Y L E W Q Q A P P H K S N F H Q A K N F S I - N G N R L L L I S Q I F I K Q . . . . . . 9996 ACTGAATCCCAACGGATAATGTAGTTTCCATCCCCATGGTATCTTTTCAACATAGACACA T E S Q R I M - F P S P W Y L F N I D T L N P N G - C S F H P H G I F S T - T H N - I P T D N V V S I P M V S F Q H R H . . . . . . 9936 TGAAATACCGGATGCACTCCGGACAGCCCTGGAGGCAAGGCTAACTCATAAGCCACCTCC - N T G C T P D S P G G K A N S - A T S E I P D A L R T A L E A R L T H K P P P M K Y R M H S G Q P W R Q G - L I S H L . . . . . . 9876 CCTACTCGCTTGAGTACTTCAAATGGACCAATATACCTTGGGCTAAGTTTACCTCGCTTA P T R L S T S N G P I Y L G L S L P R L L L A - V L Q M D Q Y T L G - V Y L A Y P Y S L E Y F K W T N I P W A K F T S L . . . . . . 9816 CCAAACCGCATCACCCCTTTCATGGGCGAAACCTTCAGCAAGACTTGTTCACCCTCCATG P N R I T P F M G E T F S K T C S P S M Q T A S P L S W A K P S A R L V H P P - T K P H H P F H G R N L Q Q D L F T L H . . . . . . 9756 AACACCAAGTCTCTAACTTTTCTATCTGTATATTCCTTTTGCCTACTTTGCGCCGCTAAC N T K S L T F L S V Y S F C L L C A A N T P S L - L F Y L Y I P F A Y F A P L T E H Q V S N F S I C I F L L P T L R R - . . . . 9696 AGTTTTTCTTGAATAGATTTCACTTTATCTAACGATTC S F S - I D F T L S N D V F L E - I S L Y L T I Q F F L N R F H F I - R F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-9_PPS_1 (9885 9685) (frame '1'; 198 bp, 66 residues) 1 ATSPTRLSTS NGPIYLGLSL PRLPNRITPF MGETFSKTCS PSMNTKSLTF LSVYSFCLLC 61 AANSFS- >C06HBa0153O03.1-1-_PGL-3_AGS-9_PPS_2 (9892 9698) (frame '0'; 192 bp, 64 residues) 1 LISHLPYSLE YFKWTNIPWA KFTSLTKPHH PFHGRNLQQD LFTLHEHQVS NFSICIFLLP 61 TLRR- 3-phase translation of AGS-9 (+strand): . . . . . . 9659 GAATCGTTAGATAAAGTGAAATCTATTCAAGAAAAACTGTTAGCGGCGCAAAGTAGGCAA E S L D K V K S I Q E K L L A A Q S R Q N R - I K - N L F K K N C - R R K V G K I V R - S E I Y S R K T V S G A K - A . . . . . . 9719 AAGGAATATACAGATAGAAAAGTTAGAGACTTGGTGTTCATGGAGGGTGAACAAGTCTTG K E Y T D R K V R D L V F M E G E Q V L R N I Q I E K L E T W C S W R V N K S C K G I Y R - K S - R L G V H G G - T S L . . . . . . 9779 CTGAAGGTTTCGCCCATGAAAGGGGTGATGCGGTTTGGTAAGCGAGGTAAACTTAGCCCA L K V S P M K G V M R F G K R G K L S P - R F R P - K G - C G L V S E V N L A Q A E G F A H E R G D A V W - A R - T - P . . . . . . 9839 AGGTATATTGGTCCATTTGAAGTACTCAAGCGAGTAGGGGAGGTGGCTTATGAGTTAGCC R Y I G P F E V L K R V G E V A Y E L A G I L V H L K Y S S E - G R W L M S - P K V Y W S I - S T Q A S R G G G L - V S . . . . . . 9899 TTGCCTCCAGGGCTGTCCGGAGTGCATCCGGTATTTCATGTGTCTATGTTGAAAAGATAC L P P G L S G V H P V F H V S M L K R Y C L Q G C P E C I R Y F M C L C - K D T L A S R A V R S A S G I S C V Y V E K I . . . . . . 9959 CATGGGGATGGAAACTACATTATCCGTTGGGATTCAGTTTTGCTTGATGAAAATTTGACT H G D G N Y I I R W D S V L L D E N L T M G M E T T L S V G I Q F C L M K I - L P W G W K L H Y P L G F S F A - - K F D . . . . . . 10019 TATGAGGAGGAGCCTGTTGCCATTCTAGATAGAGAAATTCGCAAGTTGAGATCAAGGGAG Y E E E P V A I L D R E I R K L R S R E M R R S L L P F - I E K F A S - D Q G R L - G G A C C H S R - R N S Q V E I K G . . . . . . 10079 ATTGCATCCATCAAAGTTCAAAGGAAGAATCGACCAGTCGAAGAAGCCACTTGGGAGAAC I A S I K V Q R K N R P V E E A T W E N L H P S K F K G R I D Q S K K P L G R T D C I H Q S S K E E S T S R R S H L G E . . . . . . 10139 AAGGCTGATATGCGAGAAAGATACCCACACCTGTTTACAGATTCAGGTACTCCTTTTCGC K A D M R E R Y P H L F T D S G T P F R R L I C E K D T H T C L Q I Q V L L F A Q G - Y A R K I P T P V Y R F R Y S F S . . . . . . 10199 CCTTGTTTTCCTTCTTGTGATCGTTCGGGGACGAACGATGGGTAAATTGGTATCTATTGT P C F P S C D R S G T N D G - I G I Y C L V F L L V I V R G R T M G K L V S I V P L F S F L - S F G D E R W V N W Y L L . . . . . . 10259 AACGACCTGTTTAGTCGTTTCGAGCAGCAGATTTTATTTCTTGAAAAACAGGCTGAGATG N D L F S R F E Q Q I L F L E K Q A E M T T C L V V S S S R F Y F L K N R L R - - R P V - S F R A A D F I S - K T G - D . . . . . . 10319 ACGGAACCCACGACGGACCGTCATGGGCACGACAGACCGTTGAGGGTATCTCATTCCAAA T E P T T D R H G H D R P L R V S H S K R N P R R T V M G T T D R - G Y L I P K D G T H D G P S W A R Q T V E G I S F Q . . . . . . 10379 ATACTTAGAATTCTGAAAATTGGGTACTGAAATCGACTTTCTGAACTTCGTAACGAAATG I L R I L K I G Y - N R L S E L R N E M Y L E F - K L G T E I D F L N F V T K W N T - N S E N W V L K S T F - T S - R N . . . . 10439 GCAGGACGGACCGTCACAGGTGTGACGGACCGTCACAG A G R T V T G V T D R H Q D G P S Q V - R T V T G R T D R H R C D G P S Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-3_AGS-9_PPS_1 (9659 10243) (frame '1'; 582 bp, 194 residues) 1 ESLDKVKSIQ EKLLAAQSRQ KEYTDRKVRD LVFMEGEQVL LKVSPMKGVM RFGKRGKLSP 61 RYIGPFEVLK RVGEVAYELA LPPGLSGVHP VFHVSMLKRY HGDGNYIIRW DSVLLDENLT 121 YEEEPVAILD REIRKLRSRE IASIKVQRKN RPVEEATWEN KADMRERYPH LFTDSGTPFR 181 PCFPSCDRSG TNDG- AGS-10 (11575 10618) SCR (e 0.862) Exon 1 11575 10618 ( 958 n); score: 0.862 PGS (10924 10618) SGN-E242274+ PGS (11233 10698) SGN-E252199+ PGS (11575 10816) SGN-E354383+ PGS (10980 10853) SGN-E356257- 3-phase translation of AGS-10 (-strand): . . . . . . 11575 GATTCATTAGAGAAGGTGAAATCTATTCAAGAAAAGCTCTTAGCGGCTCAAAGCAGGCAA D S L E K V K S I Q E K L L A A Q S R Q I H - R R - N L F K K S S - R L K A G K F I R E G E I Y S R K A L S G S K Q A . . . . . . 11515 AAGGAATATGCCGATTGAAAGGTTAGAGACTTAGAGTTCATGGAGGGTGAGCAAGTCTTG K E Y A D - K V R D L E F M E G E Q V L R N M P I E R L E T - S S W R V S K S C K G I C R L K G - R L R V H G G - A S L . . . . . . 11455 CTGAAGGTTTCACCCATGAAAGGGGTGATGCGGTTTGGAAAAAGAGGTAAGCTAAGCCCA L K V S P M K G V M R F G K R G K L S P - R F H P - K G - C G L E K E V S - A Q A E G F T H E R G D A V W K K R - A K P . . . . . . 11395 AGGTATATTGGACCATTTGAAGTACTTAAGCGAGTAGGGGAGGTGGCTTATGAATTAGCC R Y I G P F E V L K R V G E V A Y E L A G I L D H L K Y L S E - G R W L M N - P K V Y W T I - S T - A S R G G G L - I S . . . . . . 11335 TTGCTCCCAGGACTGTCAGGAGTGCATCCGGTATTTCATGTGTCTATGTTGAAGAGATAC L L P G L S G V H P V F H V S M L K R Y C S Q D C Q E C I R Y F M C L C - R D T L A P R T V R S A S G I S C V Y V E E I . . . . . . 11275 CATGGGGATGGAAACTACATCATTCATTGGGATTCGGTTCTTCTTGATGAGAATTTGACT H G D G N Y I I H W D S V L L D E N L T M G M E T T S F I G I R F F L M R I - L P W G W K L H H S L G F G S S - - E F D . . . . . . 11215 TATGAGGAGGAGCCTGTTGCCATCATAGATAGAGATTCGCAAGTTGAGATCAAGGGAGAT Y E E E P V A I I D R D S Q V E I K G D M R R S L L P S - I E I R K L R S R E I L - G G A C C H H R - R F A S - D Q G R . . . . . . 11155 TGCATCCATCAAAGTTCAATGGAAGAATCGACTAGTTGAAGAGTCCACGTGGGAGAAGGA C I H Q S S M E E S T S - R V H V G E G A S I K V Q W K N R L V E E S T W E K E L H P S K F N G R I D - L K S P R G R R . . . . . . 11095 GGCTGATATGCGAGAAAGATACCCACACCTGTTTACAGATTCAAGTACTCCTTTTCGCCC G - Y A R K I P T P V Y R F K Y S F S P A D M R E R Y P H L F T D S S T P F R P R L I C E K D T H T C L Q I Q V L L F A . . . . . . 11035 TTGTTTTTCTTCTTGTGATCATTCGGGGATGAACGATGGGTAAATTGGTATCTATTGTAA L F F F L - S F G D E R W V N W Y L L - C F S S C D H S G M N D G - I G I Y C N L V F L L V I I R G - T M G K L V S I V . . . . . . 10975 CGACCTATTTAGTCGTTTTGAGCAGCAGATTTTATTTCTGGAAAAACTGGCTGAGACGAC R P I - S F - A A D F I S G K T G - D D D L F S R F E Q Q I L F L E K L A E T T T T Y L V V L S S R F Y F W K N W L R R . . . . . . 10915 GGATCCCACGACGGACCGTCATGGGCACGATGGACCGTCGAGGGGGTCTCGTTCAAAAAC G S H D G P S W A R W T V E G V S F K N D P T T D R H G H D G P S R G S R S K T R I P R R T V M G T M D R R G G L V Q K . . . . . . 10855 ACTTAGAATTCTGAAATTTGGATACTGAAATTGACTCTCTGAACTTCGTGACGAAGTGAC T - N S E I W I L K L T L - T S - R S D L R I L K F G Y - N - L S E L R D E V T H L E F - N L D T E I D S L N F V T K - . . . . . . 10795 AGGACGGACCGTCACAGGCATGACGGGCCGTCACAGACTCTTCAGTAAATTTCAGTCTCT R T D R H R H D G P S Q T L Q - I S V S G R T V T G M T G R H R L F S K F Q S L Q D G P S Q A - R A V T D S S V N F S L . . . . . . 10735 GAACTCTGTGATGGAAGCAGCAGGACGGACCGTCGCAGGCACGATGGCCCGTCACAGACT E L C D G S S R T D R R R H D G P S Q T N S V M E A A G R T V A G T M A R H R L - T L - W K Q Q D G P S Q A R W P V T D . . . . . . 10675 GCGTAATCCCAGGCTGAGTCGGATTTCTTTAAATGTTTTAAGGGGGCGTTTTGGATTA A - S Q A E S D F F K C F K G A F W I R N P R L S R I S L N V L R G R F G L C V I P G - V G F L - M F - G G V L D Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-10_PPS_1 (11497 11117) (frame '1'; 378 bp, 126 residues) 1 KVRDLEFMEG EQVLLKVSPM KGVMRFGKRG KLSPRYIGPF EVLKRVGEVA YELALLPGLS 61 GVHPVFHVSM LKRYHGDGNY IIHWDSVLLD ENLTYEEEPV AIIDRDSQVE IKGDCIHQSS 121 MEESTS- >C06HBa0153O03.1-1-_PGL-3_AGS-10_PPS_2 (10821 10618) (frame '2'; 204 bp, 68 residues) 1 LSELRDEVTG RTVTGMTGRH RLFSKFQSLN SVMEAAGRTV AGTMARHRLR NPRLSRISLN 61 VLRGRFGL 3-phase translation of AGS-10 (+strand): . . . . . . 10618 TAATCCAAAACGCCCCCTTAAAACATTTAAAGAAATCCGACTCAGCCTGGGATTACGCAG - S K T P P - N I - R N P T Q P G I T Q N P K R P L K T F K E I R L S L G L R S I Q N A P L K H L K K S D S A W D Y A . . . . . . 10678 TCTGTGACGGGCCATCGTGCCTGCGACGGTCCGTCCTGCTGCTTCCATCACAGAGTTCAG S V T G H R A C D G P S C C F H H R V Q L - R A I V P A T V R P A A S I T E F R V C D G P S C L R R S V L L L P S Q S S . . . . . . 10738 AGACTGAAATTTACTGAAGAGTCTGTGACGGCCCGTCATGCCTGTGACGGTCCGTCCTGT R L K F T E E S V T A R H A C D G P S C D - N L L K S L - R P V M P V T V R P V E T E I Y - R V C D G P S C L - R S V L . . . . . . 10798 CACTTCGTCACGAAGTTCAGAGAGTCAATTTCAGTATCCAAATTTCAGAATTCTAAGTGT H F V T K F R E S I S V S K F Q N S K C T S S R S S E S Q F Q Y P N F R I L S V S L R H E V Q R V N F S I Q I S E F - V . . . . . . 10858 TTTTGAACGAGACCCCCTCGACGGTCCATCGTGCCCATGACGGTCCGTCGTGGGATCCGT F - T R P P R R S I V P M T V R R G I R F E R D P L D G P S C P - R S V V G S V F L N E T P S T V H R A H D G P S W D P . . . . . . 10918 CGTCTCAGCCAGTTTTTCCAGAAATAAAATCTGCTGCTCAAAACGACTAAATAGGTCGTT R L S Q F F Q K - N L L L K T T K - V V V S A S F S R N K I C C S K R L N R S L S S Q P V F P E I K S A A Q N D - I G R . . . . . . 10978 ACAATAGATACCAATTTACCCATCGTTCATCCCCGAATGATCACAAGAAGAAAAACAAGG T I D T N L P I V H P R M I T R R K T R Q - I P I Y P S F I P E - S Q E E K Q G Y N R Y Q F T H R S S P N D H K K K N K . . . . . . 11038 GCGAAAAGGAGTACTTGAATCTGTAAACAGGTGTGGGTATCTTTCTCGCATATCAGCCTC A K R S T - I C K Q V W V S F S H I S L R K G V L E S V N R C G Y L S R I S A S G E K E Y L N L - T G V G I F L A Y Q P . . . . . . 11098 CTTCTCCCACGTGGACTCTTCAACTAGTCGATTCTTCCATTGAACTTTGATGGATGCAAT L L P R G L F N - S I L P L N F D G C N F S H V D S S T S R F F H - T L M D A I P S P T W T L Q L V D S S I E L - W M Q . . . . . . 11158 CTCCCTTGATCTCAACTTGCGAATCTCTATCTATGATGGCAACAGGCTCCTCCTCATAAG L P - S Q L A N L Y L - W Q Q A P P H K S L D L N L R I S I Y D G N R L L L I S S P L I S T C E S L S M M A T G S S S - . . . . . . 11218 TCAAATTCTCATCAAGAAGAACCGAATCCCAATGAATGATGTAGTTTCCATCCCCATGGT S N S H Q E E P N P N E - C S F H P H G Q I L I K K N R I P M N D V V S I P M V V K F S S R R T E S Q - M M - F P S P W . . . . . . 11278 ATCTCTTCAACATAGACACATGAAATACCGGATGCACTCCTGACAGTCCTGGGAGCAAGG I S S T - T H E I P D A L L T V L G A R S L Q H R H M K Y R M H S - Q S W E Q G Y L F N I D T - N T G C T P D S P G S K . . . . . . 11338 CTAATTCATAAGCCACCTCCCCTACTCGCTTAAGTACTTCAAATGGTCCAATATACCTTG L I H K P P P L L A - V L Q M V Q Y T L - F I S H L P Y S L K Y F K W S N I P W A N S - A T S P T R L S T S N G P I Y L . . . . . . 11398 GGCTTAGCTTACCTCTTTTTCCAAACCGCATCACCCCTTTCATGGGTGAAACCTTCAGCA G L A Y L F F Q T A S P L S W V K P S A A - L T S F S K P H H P F H G - N L Q Q G L S L P L F P N R I T P F M G E T F S . . . . . . 11458 AGACTTGCTCACCCTCCATGAACTCTAAGTCTCTAACCTTTCAATCGGCATATTCCTTTT R L A H P P - T L S L - P F N R H I P F D L L T L H E L - V S N L S I G I F L L K T C S P S M N S K S L T F Q S A Y S F . . . . . . 11518 GCCTGCTTTGAGCCGCTAAGAGCTTTTCTTGAATAGATTTCACCTTCTCTAATGAATC A C F E P L R A F L E - I S P S L M N P A L S R - E L F L N R F H L L - - I C L L - A A K S F S - I D F T F S N E Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-3_AGS-10_PPS_1 (10648 10863) (frame '1'; 213 bp, 71 residues) 1 RNPTQPGITQ SVTGHRACDG PSCCFHHRVQ RLKFTEESVT ARHACDGPSC HFVTKFRESI 61 SVSKFQNSKC F- AGS-11 (12890 11859) SCR (e 0.862) Exon 1 12890 11859 (1032 n); score: 0.862 PGS (12400 11859) SGN-E353207- PGS (12187 11889) SGN-E578131- PGS (12890 12304) SGN-E352950- PGS (12890 12304) SGN-E357100- PGS (12857 12304) SGN-E352647- 3-phase translation of AGS-11 (-strand): . . . . . . 12890 GCAATTAAAGGTGCATGAACGTAACTATCCGACCCACGATTTAGAATTGACCGCAGTTGT A I K G A - T - L S D P R F R I D R S C Q L K V H E R N Y P T H D L E L T A V V N - R C M N V T I R P T I - N - P Q L . . . . . . 12830 GTTTGCATTAAAGCAATGGAGACATTATCTATATGGGGTCAAGTGTGAAGTCTATACAGA V C I K A M E T L S I W G Q V - S L Y R F A L K Q W R H Y L Y G V K C E V Y T D C L H - S N G D I I Y M G S S V K S I Q . . . . . . 12770 TCATCGTATACTACAGTATGTCTTTACTTAGAAAGAATTGAACTTGAGACAGAGGAGATG S S Y T T V C L Y L E R I E L E T E E M H R I L Q Y V F T - K E L N L R Q R R W I I V Y Y S M S L L R K N - T - D R G D . . . . . . 12710 GATTGAACTACTGAAGGATTATGATGTTACCATCTTGTATCACCCAGGAAAGGCTAATGT D - T T E G L - C Y H L V S P R K G - C I E L L K D Y D V T I L Y H P G K A N V G L N Y - R I M M L P S C I T Q E R L M . . . . . . 12650 TGTGGCAGACGCCTTAAGTAGAAAAGCAGGGAGCATGGGTAGTTTAACCCACTTACAAGT C G R R L K - K S R E H G - F N P L T S V A D A L S R K A G S M G S L T H L Q V L W Q T P - V E K Q G A W V V - P T Y K . . . . . . 12590 TTCTAAACGCCCATTGGCTAGAGAGGTTCAGACTCTGACTAACGAGTTTATGAGGTTAGA F - T P I G - R G S D S D - R V Y E V R S K R P L A R E V Q T L T N E F M R L E F L N A H W L E R F R L - L T S L - G - . . . . . . 12530 AGTAAATGAGAAGGGAGGATTTTTGGCCAGTGTGGAGGCGAGATCTTCTTTTCTTGACAA S K - E G R I F G Q C G G E I F F S - Q V N E K G G F L A S V E A R S S F L D K K - M R R E D F W P V W R R D L L F L T . . . . . . 12470 GATCAAGGGAAAACAGTTTGATGATGAGAAACTAAGCCGAATTCGGGATATGGTGTTGCG D Q G K T V - - - E T K P N S G Y G V A I K G K Q F D D E K L S R I R D M V L R R S R E N S L M M R N - A E F G I W C C . . . . . . 12410 AGGAGAGGCTAAAGAAGCAATAATGCATGAGGAAGGTGTTTTGAGAATTAAGGGATGAGT R R G - R S N N A - G R C F E N - G M S G E A K E A I M H E E G V L R I K G - V E E R L K K Q - C M R K V F - E L R D E . . . . . . 12350 ATGTGTGCCCCGTGTTGATGATTTGATCCATACTATTCTTACAGAGGCTCATAGTTCCAG M C A P C - - F D P Y Y S Y R G S - F Q C V P R V D D L I H T I L T E A H S S R Y V C P V L M I - S I L F L Q R L I V P . . . . . . 12290 ATATTCTATACATCCTGGTGCAACCAAGATGTACCGTGACCTAAAGCAACACTTTTGGTG I F Y T S W C N Q D V P - P K A T L L V Y S I H P G A T K M Y R D L K Q H F W W D I L Y I L V Q P R C T V T - S N T F G . . . . . . 12230 GAGTAGGATGAAGCGCGACATTGTGGATTTTGTTGCCAAATGTCCAAATTGTCAGCAAGT E - D E A R H C G F C C Q M S K L S A S S R M K R D I V D F V A K C P N C Q Q V G V G - S A T L W I L L P N V Q I V S K . . . . . . 12170 AAAGTATGACCACCAGAGGCCCGGAGGAACACTTCAGAGAATGCCCATTCCTGAATGGAA K V - P P E A R R N T S E N A H S - M E K Y D H Q R P G G T L Q R M P I P E W K - S M T T R G P E E H F R E C P F L N G . . . . . . 12110 GTGGGAGAGAATTGCAATGGACTTCGTGGTTGGTCTTCCAAAGACATTGGGGAAGTTTGA V G E N C N G L R G W S S K D I G E V - W E R I A M D F V V G L P K T L G K F D S G R E L Q W T S W L V F Q R H W G S L . . . . . . 12050 CTCTATTTGGGTAATTGTGGACAGATTAACTAAGTCTGCTCATTTCATTCCGGTCAAGGT L Y L G N C G Q I N - V C S F H S G Q G S I W V I V D R L T K S A H F I P V K V T L F G - L W T D - L S L L I S F R S R . . . . . . 11990 GACTTATAATGCAGAGAAGTTAGCCAAAATTTACATCTCAGAAATTGTTCGATTGCATGG D L - C R E V S Q N L H L R N C S I A W T Y N A E K L A K I Y I S E I V R L H G - L I M Q R S - P K F T S Q K L F D C M . . . . . . 11930 AGTTCCACTTTCCATCATATCAGATAGAGGTACGCAGTTTACTTCTAAGTTTTGGAAAAC S S T F H H I R - R Y A V Y F - V L E N V P L S I I S D R G T Q F T S K F W K T E F H F P S Y Q I E V R S L L L S F G K . . 11870 ATTGCATGCGGA I A C G L H A H C M R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-3_AGS-11_PPS_1 (12352 11861) (frame '2'; 492 bp, 164 residues) 1 VCVPRVDDLI HTILTEAHSS RYSIHPGATK MYRDLKQHFW WSRMKRDIVD FVAKCPNCQQ 61 VKYDHQRPGG TLQRMPIPEW KWERIAMDFV VGLPKTLGKF DSIWVIVDRL TKSAHFIPVK 121 VTYNAEKLAK IYISEIVRLH GVPLSIISDR GTQFTSKFWK TLHA >C06HBa0153O03.1-1-_PGL-3_AGS-11_PPS_2 (12739 12353) (frame '2'; 384 bp, 128 residues) 1 KELNLRQRRW IELLKDYDVT ILYHPGKANV VADALSRKAG SMGSLTHLQV SKRPLAREVQ 61 TLTNEFMRLE VNEKGGFLAS VEARSSFLDK IKGKQFDDEK LSRIRDMVLR GEAKEAIMHE 121 EGVLRIKG- 3-phase translation of AGS-11 (+strand): . . . . . . 11859 TCCGCATGCAATGTTTTCCAAAACTTAGAAGTAAACTGCGTACCTCTATCTGATATGATG S A C N V F Q N L E V N C V P L S D M M P H A M F S K T - K - T A Y L Y L I - W R M Q C F P K L R S K L R T S I - Y D . . . . . . 11919 GAAAGTGGAACTCCATGCAATCGAACAATTTCTGAGATGTAAATTTTGGCTAACTTCTCT E S G T P C N R T I S E M - I L A N F S K V E L H A I E Q F L R C K F W L T S L G K W N S M Q S N N F - D V N F G - L L . . . . . . 11979 GCATTATAAGTCACCTTGACCGGAATGAAATGAGCAGACTTAGTTAATCTGTCCACAATT A L - V T L T G M K - A D L V N L S T I H Y K S P - P E - N E Q T - L I C P Q L C I I S H L D R N E M S R L S - S V H N . . . . . . 12039 ACCCAAATAGAGTCAAACTTCCCCAATGTCTTTGGAAGACCAACCACGAAGTCCATTGCA T Q I E S N F P N V F G R P T T K S I A P K - S Q T S P M S L E D Q P R S P L Q Y P N R V K L P Q C L W K T N H E V H C . . . . . . 12099 ATTCTCTCCCACTTCCATTCAGGAATGGGCATTCTCTGAAGTGTTCCTCCGGGCCTCTGG I L S H F H S G M G I L - S V P P G L W F S P T S I Q E W A F S E V F L R A S G N S L P L P F R N G H S L K C S S G P L . . . . . . 12159 TGGTCATACTTTACTTGCTGACAATTTGGACATTTGGCAACAAAATCCACAATGTCGCGC W S Y F T C - Q F G H L A T K S T M S R G H T L L A D N L D I W Q Q N P Q C R A V V I L Y L L T I W T F G N K I H N V A . . . . . . 12219 TTCATCCTACTCCACCAAAAGTGTTGCTTTAGGTCACGGTACATCTTGGTTGCACCAGGA F I L L H Q K C C F R S R Y I L V A P G S S Y S T K S V A L G H G T S W L H Q D L H P T P P K V L L - V T V H L G C T R . . . . . . 12279 TGTATAGAATATCTGGAACTATGAGCCTCTGTAAGAATAGTATGGATCAAATCATCAACA C I E Y L E L - A S V R I V W I K S S T V - N I W N Y E P L - E - Y G S N H Q H M Y R I S G T M S L C K N S M D Q I I N . . . . . . 12339 CGGGGCACACATACTCATCCCTTAATTCTCAAAACACCTTCCTCATGCATTATTGCTTCT R G T H T H P L I L K T P S S C I I A S G A H I L I P - F S K H L P H A L L L L T G H T Y S S L N S Q N T F L M H Y C F . . . . . . 12399 TTAGCCTCTCCTCGCAACACCATATCCCGAATTCGGCTTAGTTTCTCATCATCAAACTGT L A S P R N T I S R I R L S F S S S N C - P L L A T P Y P E F G L V S H H Q T V F S L S S Q H H I P N S A - F L I I K L . . . . . . 12459 TTTCCCTTGATCTTGTCAAGAAAAGAAGATCTCGCCTCCACACTGGCCAAAAATCCTCCC F P L I L S R K E D L A S T L A K N P P F P - S C Q E K K I S P P H W P K I L P F S L D L V K K R R S R L H T G Q K S S . . . . . . 12519 TTCTCATTTACTTCTAACCTCATAAACTCGTTAGTCAGAGTCTGAACCTCTCTAGCCAAT F S F T S N L I N S L V R V - T S L A N S H L L L T S - T R - S E S E P L - P M L L I Y F - P H K L V S Q S L N L S S Q . . . . . . 12579 GGGCGTTTAGAAACTTGTAAGTGGGTTAAACTACCCATGCTCCCTGCTTTTCTACTTAAG G R L E T C K W V K L P M L P A F L L K G V - K L V S G L N Y P C S L L F Y L R W A F R N L - V G - T T H A P C F S T - . . . . . . 12639 GCGTCTGCCACAACATTAGCCTTTCCTGGGTGATACAAGATGGTAACATCATAATCCTTC A S A T T L A F P G - Y K M V T S - S F R L P Q H - P F L G D T R W - H H N P S G V C H N I S L S W V I Q D G N I I I L . . . . . . 12699 AGTAGTTCAATCCATCTCCTCTGTCTCAAGTTCAATTCTTTCTAAGTAAAGACATACTGT S S S I H L L C L K F N S F - V K T Y C V V Q S I S S V S S S I L S K - R H T V Q - F N P S P L S Q V Q F F L S K D I L . . . . . . 12759 AGTATACGATGATCTGTATAGACTTCACACTTGACCCCATATAGATAATGTCTCCATTGC S I R - S V - T S H L T P Y R - C L H C V Y D D L Y R L H T - P H I D N V S I A - Y T M I C I D F T L D P I - I M S P L . . . . . . 12819 TTTAATGCAAACACAACTGCGGTCAATTCTAAATCGTGGGTCGGATAGTTACGTTCATGC F N A N T T A V N S K S W V G - L R S C L M Q T Q L R S I L N R G S D S Y V H A L - C K H N C G Q F - I V G R I V T F M . . 12879 ACCTTTAATTGC T F N C P L I H L - L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-3_AGS-11_PPS_1 (12303 12563) (frame '1'; 258 bp, 86 residues) 1 ASVRIVWIKS STRGTHTHPL ILKTPSSCII ASLASPRNTI SRIRLSFSSS NCFPLILSRK 61 EDLASTLAKN PPFSFTSNLI NSLVRV- >C06HBa0153O03.1-1+_PGL-3_AGS-11_PPS_2 (12049 12285) (frame '2'; 234 bp, 78 residues) 1 SQTSPMSLED QPRSPLQFSP TSIQEWAFSE VFLRASGGHT LLADNLDIWQ QNPQCRASSY 61 STKSVALGHG TSWLHQDV- PGL 4 (+ strand): 13118 15821 AGS-1 (13118 15821) SCR (e 0.901) Exon 1 13118 15821 (2704 n); score: 0.901 PGS (13118 13830) SGN-E379315+ PGS (13118 13713) SGN-E375319+ PGS (13118 13643) SGN-E204434+ PGS (13189 13868) SGN-E368762+ PGS (13189 13547) SGN-E240817+ PGS (13638 14292) SGN-E355232+ PGS (13707 14464) SGN-E355244+ PGS (13707 14363) SGN-E351414+ PGS (13750 14406) SGN-E352117- PGS (13940 14472) SGN-E355026+ PGS (13954 14337) SGN-E242765+ PGS (14106 14436) SGN-E352716+ PGS (14313 14871) SGN-E244046+ PGS (14317 15080) SGN-E214046+ PGS (14397 15186) SGN-E356912+ PGS (14405 14931) SGN-E353805+ PGS (14473 15170) SGN-E356209+ PGS (14839 15543) SGN-E349404+ PGS (14839 15511) SGN-E351625+ PGS (14839 15446) SGN-E357065+ PGS (14839 15354) SGN-E352365+ PGS (15036 15752) SGN-E356614+ PGS (15037 15700) SGN-E352401+ PGS (15204 15756) SGN-E329287- PGS (15392 15821) SGN-E352180+ 3-phase translation of AGS-1 (+strand): . . . . . . 13118 GAATCCCTTAACAAATCGACGGTAATAGCTAGCTAAACCAACAAAGCTCCTTATTTTTGA E S L N K S T V I A S - T N K A P Y F - N P L T N R R - - L A K P T K L L I F D I P - Q I D G N S - L N Q Q S S L F L . . . . . . 13178 CACATTAGTAGGTATTACCCAATTCTTCACTGTCTCAATCTTAGAAGGATCCACCATCAC H I S R Y Y P I L H C L N L R R I H H H T L V G I T Q F F T V S I L E G S T I T T H - - V L P N S S L S Q S - K D P P S . . . . . . 13238 CCCATCCTTAGAAACCACATGCCCCAAGAAGGACATTGCATCTAGCCAAAACTCACACTT P I L R N H M P Q E G H C I - P K L T L P S L E T T C P K K D I A S S Q N S H L P H P - K P H A P R R T L H L A K T H T . . . . . . 13298 GGAGAATTTTGCATAAAGCTTTTCTCCCTCAACATTTCCAATACAATTCTCAAGTGCTCC G E F C I K L F S L N I S N T I L K C S E N F A - S F S P S T F P I Q F S S A P W R I L H K A F L P Q H F Q Y N S Q V L . . . . . . 13358 TCATATTCCTTCTTGCTCTTTAAGTATACCAATATATCATCAATAAATACGATCACAAAG S Y S F L L F K Y T N I S S I N T I T K H I P S C S L S I P I Y H Q - I R S Q R L I F L L A L - V Y Q Y I I N K Y D H K . . . . . . 13418 AGATCTAGATATGGCTTAAAAATCCCATTCATCAAGCTCATGAAAGCAGCAGGGGCATTC R S R Y G L K I P F I K L M K A A G A F D L D M A - K S H S S S S - K Q Q G H S E I - I W L K N P I H Q A H E S S R G I . . . . . . 13478 GTAAGACCAAAAGACATCACTACAAATTCGTAATGCCCATACCTGGTCCGAAAAAGCAGT V R P K D I T T N S - C P Y L V R K S S - D Q K T S L Q I R N A H T W S E K A V R K T K R H H Y K F V M P I P G P K K Q . . . . . . 13538 CTTTGGCACATCCGTTGCCCGCATTTTCAATTGATGATAACCGGACCTCAAGTCAATCTT L W H I R C P H F Q L M I T G P Q V N L F G T S V A R I F N - - - P D L K S I L S L A H P L P A F S I D D N R T S S Q S . . . . . . 13598 AGAGAAGACACAAGCACCTTGTAACTGATCGAACAAGTCATCAATGCGAGGAATGGGATA R E D T S T L - L I E Q V I N A R N G I E K T Q A P C N - S N K S S M R G M G Y - R R H K H L V T D R T S H Q C E E W D . . . . . . 13658 CTTGTTCTTAATTGTTACCTTGTTCAACTGCCGGTAGTCTATGCACATCCGAAAACTCCC L V L N C Y L V Q L P V V Y A H P K T P L F L I V T L F N C R - S M H I R K L P T C S - L L P C S T A G S L C T S E N S . . . . . . 13718 ATCCTTCTTCTTCACAAATAACACCGGAGCACCCCAAGGAGAAGCACTTGGTCTAATGAA I L L L H K - H R S T P R R S T W S N E S F F F T N N T G A P Q G E A L G L M N H P S S S Q I T P E H P K E K H L V - - . . . . . . 13778 TCCTTCGCTCAACAACTCTTGAAGTTGGGCATTTAACTCTCTCAACTCCGTGGGAGCCAT S F A Q Q L L K L G I - L S Q L R G S H P S L N N S - S W A F N S L N S V G A I I L R S T T L E V G H L T L S T P W E P . . . . . . 13838 TCTATAAGAGGGATATAGAAATGGGGCGAGTACCTGGTTTAAGATCAATGCAGAAGTCAA S I R G I - K W G E Y L V - D Q C R S Q L - E G Y R N G A S T W F K I N A E V N F Y K R D I E M G R V P G L R S M Q K S . . . . . . 13898 TATCTCTATCCGGTGGCATACCCGGAAGGTCTGCGGGAAACTAAAAACTCACGAACTATC Y L Y P V A Y P E G L R E T K N S R T I I S I R W H T R K V C G K L K T H E L S I S L S G G I P G R S A G N - K L T N Y . . . . . . 13958 AAAACCGACTCAATTGGAGGTACTTGGGTAGTATCATCCCTGAGATGTGCGAAGAACGCT K T D S I G G T W V V S S L R C A K N A K P T Q L E V L G - Y H P - D V R R T L Q N R L N W R Y L G S I I P E M C E E R . . . . . . 14018 AAACAAACCTTACTAACCATTTTCTTAGCACGAAGAAAGGAGATGATACGAACCGGATCG K Q T L L T I F L A R R K E M I R T G S N K P Y - P F S - H E E R R - Y E P D R - T N L T N H F L S T K K G D D T N R I . . . . . . 14078 GAAGTGTAGTCACCCTCCCACACTAACGGATCTGTCCCAGGCTTGGCTAACATCACAGTT E V - S P S H T N G S V P G L A N I T V K C S H P P T L T D L S Q A W L T S Q F G S V V T L P H - R I C P R L G - H H S . . . . . . 14138 TTAGCATTACAATCCAAGATCGCAAAATTCGGAGAAAGCCAAGTCATACCCAGAATTACA L A L Q S K I A K F G E S Q V I P R I T - H Y N P R S Q N S E K A K S Y P E L H F S I T I Q D R K I R R K P S H T Q N Y . . . . . . 14198 TCAAAATCAACCATTTCTAAGATAACCAAGTCTACATAAGTATTGCTCCCCACAAAAGTC S K S T I S K I T K S T - V L L P T K V Q N Q P F L R - P S L H K Y C S P Q K S I K I N H F - D N Q V Y I S I A P H K S . . . . . . 14258 ACCAGACCAGACCTATATACTTTTTCAACTATCACAGACTCACCCACCGGAGTAGAAACA T R P D L Y T F S T I T D S P T G V E T P D Q T Y I L F Q L S Q T H P P E - K H H Q T R P I Y F F N Y H R L T H R S R N . . . . . . 14318 CGAATAGGCATGTCAAGCAATTCACAATGTAAATTAAGACCATTAGCAAATGAGGAAGAT R I G M S S N S Q C K L R P L A N E E D E - A C Q A I H N V N - D H - Q M R K I T N R H V K Q F T M - I K T I S K - G R . . . . . . 14378 ACATAAGAAAATGTGGATCCAGGATAAAACAATACATAAGCAATGCAATCACAAACCAAA T - E N V D P G - N N T - A M Q S Q T K H K K M W I Q D K T I H K Q C N H K P K Y I R K C G S R I K Q Y I S N A I T N Q . . . . . . 14438 AGATTACCTGTGATGACAGCATCCGATGTCTCCGCTTCAGATCTCCCAGGGAAAGCATAA R L P V M T A S D V S A S D L P G K A - D Y L - - Q H P M S P L Q I S Q G K H N K I T C D D S I R C L R F R S P R E S I . . . . . . 14498 CAACGGGCCCTATCACCTGTCTGCCCGTTGCCCCTACCATGTTGCGCTGCAGTAGTTCCC Q R A L S P V C P L P L P C C A A V V P N G P Y H L S A R C P Y H V A L Q - F P T T G P I T C L P V A P T M L R C S S S . . . . . . 14558 ATTTGTCCGTCACCCCCGCCGTTTTGGTGACCACCATTACCTCGGCCACCATGTCCTCCA I C P S P P P F W - P P L P R P P C P P F V R H P R R F G D H H Y L G H H V L Q H L S V T P A V L V T T I T S A T M S S . . . . . . 14618 GAATAGCGGCCTCTACCATGACCACCTCTACCTCTAGCTATTGAGGGTCTATAACTCTAT E - R P L P - P P L P L A I E G L - L Y N S G L Y H D H L Y L - L L R V Y N S I R I A A S T M T T S T S S Y - G S I T L . . . . . . 14678 TTTGGACAATTCCTCCTAATATGTCCAGTCTCCCCACATCCATAACACTCCCTGGAGTCA F G Q F L L I C P V S P H P - H S L E S L D N S S - Y V Q S P H I H N T P W S Q F W T I P P N M S S L P T S I T L P G V . . . . . . 14738 AGCATAGGTCTCTCAGAGAAGTGTTGACCGGTCTGAGGTGGACCCCCAACTACAGTTTGT S I G L S E K C - P V - G G P P T T V C A - V S Q R S V D R S E V D P Q L Q F V K H R S L R E V L T G L R W T P N Y S L . . . . . . 14798 AGTGAAGACTGAATTGGTTGGACTGAGTAACCTCCTGAACCCTGTCCTCTAGAGTAAGAA S E D - I G W T E - P P E P C P L E - E V K T E L V G L S N L L N P V L - S K N - - R L N W L D - V T S - T L S S R V R . . . . . . 14858 CCATTAAACTCACCTCCCTTTCGAAGAATTTTTGATGACATTGCCATGGTGAAGTCATCT P L N S P P F R R I F D D I A M V K S S H - T H L P F E E F L M T L P W - S H L T I K L T S L S K N F - - H C H G E V I . . . . . . 14918 GGCTTCACTCCTTCTACCTCTATCACGAAGTCTACCTCTTCTTGAAAAGATTTTGCCGTA G F T P S T S I T K S T S S - K D F A V A S L L L P L S R S L P L L E K I L P - W L H S F Y L Y H E V Y L F L K R F C R . . . . . . 14978 GCCGCTACCTGTAAGGCTGAAATCCGCAATTCTGACCTTAACCCCTTCATAAAATGGTGA A A T C K A E I R N S D L N P F I K W - P L P V R L K S A I L T L T P S - N G E S R Y L - G - N P Q F - P - P L H K M V . . . . . . 15038 ATCCGCTCTTGTGGACTGAAATAAAGTTGGGTGGCATACCTGGATAATGCACGAAACTTA I R S C G L K - S W V A Y L D N A R N L S A L V D - N K V G W H T W I M H E T - N P L L W T E I K L G G I P G - C T K L . . . . . . 15098 GCCTCATATGCGGTAACCGACATCCTACCTTGCTCTAGGCTCAAGAACTCATCTCTTTTC A S Y A V T D I L P C S R L K N S S L F P H M R - P T S Y L A L G S R T H L F S S L I C G N R H P T L L - A Q E L I S F . . . . . . 15158 CTATCCCTCAAAGTACGGGGGATATACTTCTCCATAAACAAGCTAGAGAATGATGCCCAA L S L K V R G I Y F S I N K L E N D A Q Y P S K Y G G Y T S P - T S - R M M P K P I P Q S T G D I L L H K Q A R E - C P . . . . . . 15218 GTCATAGGTGGTGCCTCTGTTGGTTGACACTCAACATGTGACCGCCACCACATTTTGGCG V I G G A S V G - H S T C D R H H I L A S - V V P L L V D T Q H V T A T T F W R S H R W C L C W L T L N M - P P P H F G . . . . . . 15278 TTCCCTTGGAACTGATAAGTCACAAACTCAACACCAAACCGTTCTACTATACCCATCTTG F P W N - - V T N S T P N R S T I P I L S L G T D K S Q T Q H Q T V L L Y P S C V P L E L I S H K L N T K P F Y Y T H L . . . . . . 15338 TGTAGTAGCTCATGACAGTCAACCAGAAAATCGTAGGCATCCTCAGATTCAGCACCCTTG C S S S - Q S T R K S - A S S D S A P L V V A H D S Q P E N R R H P Q I Q H P - V - - L M T V N Q K I V G I L R F S T L . . . . . . 15398 AAGACTGGAGGTTTCAATTTCAAGAACTTACTGAAAAGTTCATGCTGATCATTTGTCATT K T G G F N F K N L L K S S C - S F V I R L E V S I S R T Y - K V H A D H L S L E D W R F Q F Q E L T E K F M L I I C H . . . . . . 15458 ATAGGCCCTGTAGTTAGACGAGGAAACATGTCTATTTCCAATGAGGCATCCATGCGGGGA I G P V V R R G N M S I S N E A S M R G - A L - L D E E T C L F P M R H P C G E Y R P C S - T R K H V Y F Q - G I H A G . . . . . . 15518 GCCACAGTAGCCGCATGTTGTACCTCCGGAGCCTGAGGTGCTGGTGTAGAAAACACTGGA A T V A A C C T S G A - G A G V E N T G P Q - P H V V P P E P E V L V - K T L E S H S S R M L Y L R S L R C W C R K H W . . . . . . 15578 GGCGCTTGGCCTTGATCACATAACCCGCTAAGATAAGCAAGAACCTGATTGATCATCTCT G A W P - S H N P L R - A R T - L I I S A L G L D H I T R - D K Q E P D - S S L R R L A L I T - P A K I S K N L I D H L . . . . . . 15638 AGGGTAGGTTGGGGTGGTAATTCCTCATTCTGTACTTGTTCATTTTCCCCATCCTCCCCT R V G W G G N S S F C T C S F S P S S P G - V G V V I P H S V L V H F P H P P L - G R L G W - F L I L Y L F I F P I L P . . . . . . 15698 TCTCTTACTACTTCCTCAGTCGGTGGAGGAGTCACCGCCCTAGTACCAGATAGGCCAGGC S L T T S S V G G G V T A L V P D R P G L L L L P Q S V E E S P P - Y Q I G Q A F S Y Y F L S R W R S H R P S T R - A R . . . . . . 15758 GTTCATCCTCTTCCTCTAGAAGACGTCCTCCCGCGACCTCTACCGCGGCCTCTTGCTACT V H P L P L E D V L P R P L P R P L A T F I L F L - K T S S R D L Y R G L L L L R S S S S S R R R P P A T S T A A S C Y . 15818 GCTC A L C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-4_AGS-1_PPS_1 (14371 14661) (frame '0'; 288 bp, 96 residues) 1 GRYIRKCGSR IKQYISNAIT NQKITCDDSI RCLRFRSPRE SITTGPITCL PVAPTMLRCS 61 SSHLSVTPAV LVTTITSATM SSRIAASTMT TSTSSY- >C06HBa0153O03.1-1+_PGL-4_AGS-1_PPS_2 (13283 13510) (frame '1'; 225 bp, 75 residues) 1 PKLTLGEFCI KLFSLNISNT ILKCSSYSFL LFKYTNISSI NTITKRSRYG LKIPFIKLMK 61 AAGAFVRPKD ITTNS- >C06HBa0153O03.1-1+_PGL-4_AGS-1_PPS_3 (13880 14086) (frame '1'; 204 bp, 68 residues) 1 DQCRSQYLYP VAYPEGLRET KNSRTIKTDS IGGTWVVSSL RCAKNAKQTL LTIFLARRKE 61 MIRTGSEV- >C06HBa0153O03.1-1+_PGL-4_AGS-1_PPS_4 (15626 15820) (frame '1'; 195 bp, 65 residues) 1 LIISRVGWGG NSSFCTCSFS PSSPSLTTSS VGGGVTALVP DRPGVHPLPL EDVLPRPLPR 61 PLATA 3-phase translation of AGS-1 (-strand): . . . . . . 15821 GAGCAGTAGCAAGAGGCCGCGGTAGAGGTCGCGGGAGGACGTCTTCTAGAGGAAGAGGAT E Q - Q E A A V E V A G G R L L E E E D S S S K R P R - R S R E D V F - R K R M A V A R G R G R G R G R T S S R G R G . . . . . . 15761 GAACGCCTGGCCTATCTGGTACTAGGGCGGTGACTCCTCCACCGACTGAGGAAGTAGTAA E R L A Y L V L G R - L L H R L R K - - N A W P I W Y - G G D S S T D - G S S K - T P G L S G T R A V T P P P T E E V V . . . . . . 15701 GAGAAGGGGAGGATGGGGAAAATGAACAAGTACAGAATGAGGAATTACCACCCCAACCTA E K G R M G K M N K Y R M R N Y H P N L R R G G W G K - T S T E - G I T T P T Y R E G E D G E N E Q V Q N E E L P P Q P . . . . . . 15641 CCCTAGAGATGATCAATCAGGTTCTTGCTTATCTTAGCGGGTTATGTGATCAAGGCCAAG P - R - S I R F L L I L A G Y V I K A K P R D D Q S G S C L S - R V M - S R P S T L E M I N Q V L A Y L S G L C D Q G Q . . . . . . 15581 CGCCTCCAGTGTTTTCTACACCAGCACCTCAGGCTCCGGAGGTACAACATGCGGCTACTG R L Q C F L H Q H L R L R R Y N M R L L A S S V F Y T S T S G S G G T T C G Y C A P P V F S T P A P Q A P E V Q H A A T . . . . . . 15521 TGGCTCCCCGCATGGATGCCTCATTGGAAATAGACATGTTTCCTCGTCTAACTACAGGGC W L P A W M P H W K - T C F L V - L Q G G S P H G C L I G N R H V S S S N Y R A V A P R M D A S L E I D M F P R L T T G . . . . . . 15461 CTATAATGACAAATGATCAGCATGAACTTTTCAGTAAGTTCTTGAAATTGAAACCTCCAG L - - Q M I S M N F S V S S - N - N L Q Y N D K - S A - T F Q - V L E I E T S S P I M T N D Q H E L F S K F L K L K P P . . . . . . 15401 TCTTCAAGGGTGCTGAATCTGAGGATGCCTACGATTTTCTGGTTGACTGTCATGAGCTAC S S R V L N L R M P T I F W L T V M S Y L Q G C - I - G C L R F S G - L S - A T V F K G A E S E D A Y D F L V D C H E L . . . . . . 15341 TACACAAGATGGGTATAGTAGAACGGTTTGGTGTTGAGTTTGTGACTTATCAGTTCCAAG Y T R W V - - N G L V L S L - L I S S K T Q D G Y S R T V W C - V C D L S V P R L H K M G I V E R F G V E F V T Y Q F Q . . . . . . 15281 GGAACGCCAAAATGTGGTGGCGGTCACATGTTGAGTGTCAACCAACAGAGGCACCACCTA G T P K C G G G H M L S V N Q Q R H H L E R Q N V V A V T C - V S T N R G T T Y G N A K M W W R S H V E C Q P T E A P P . . . . . . 15221 TGACTTGGGCATCATTCTCTAGCTTGTTTATGGAGAAGTATATCCCCCGTACTTTGAGGG - L G H H S L A C L W R S I S P V L - G D L G I I L - L V Y G E V Y P P Y F E G M T W A S F S S L F M E K Y I P R T L R . . . . . . 15161 ATAGGAAAAGAGATGAGTTCTTGAGCCTAGAGCAAGGTAGGATGTCGGTTACCGCATATG I G K E M S S - A - S K V G C R L P H M - E K R - V L E P R A R - D V G Y R I - D R K R D E F L S L E Q G R M S V T A Y . . . . . . 15101 AGGCTAAGTTTCGTGCATTATCCAGGTATGCCACCCAACTTTATTTCAGTCCACAAGAGC R L S F V H Y P G M P P N F I S V H K S G - V S C I I Q V C H P T L F Q S T R A E A K F R A L S R Y A T Q L Y F S P Q E . . . . . . 15041 GGATTCACCATTTTATGAAGGGGTTAAGGTCAGAATTGCGGATTTCAGCCTTACAGGTAG G F T I L - R G - G Q N C G F Q P Y R - D S P F Y E G V K V R I A D F S L T G S R I H H F M K G L R S E L R I S A L Q V . . . . . . 14981 CGGCTACGGCAAAATCTTTTCAAGAAGAGGTAGACTTCGTGATAGAGGTAGAAGGAGTGA R L R Q N L F K K R - T S - - R - K E - G Y G K I F S R R G R L R D R G R R S E A A T A K S F Q E E V D F V I E V E G V . . . . . . 14921 AGCCAGATGACTTCACCATGGCAATGTCATCAAAAATTCTTCGAAAGGGAGGTGAGTTTA S Q M T S P W Q C H Q K F F E R E V S L A R - L H H G N V I K N S S K G R - V - K P D D F T M A M S S K I L R K G G E F . . . . . . 14861 ATGGTTCTTACTCTAGAGGACAGGGTTCAGGAGGTTACTCAGTCCAACCAATTCAGTCTT M V L T L E D R V Q E V T Q S N Q F S L W F L L - R T G F R R L L S P T N S V F N G S Y S R G Q G S G G Y S V Q P I Q S . . . . . . 14801 CACTACAAACTGTAGTTGGGGGTCCACCTCAGACCGGTCAACACTTCTCTGAGAGACCTA H Y K L - L G V H L R P V N T S L R D L T T N C S W G S T S D R S T L L - E T Y S L Q T V V G G P P Q T G Q H F S E R P . . . . . . 14741 TGCTTGACTCCAGGGAGTGTTATGGATGTGGGGAGACTGGACATATTAGGAGGAATTGTC C L T P G S V M D V G R L D I L G G I V A - L Q G V L W M W G D W T Y - E E L S M L D S R E C Y G C G E T G H I R R N C . . . . . . 14681 CAAAATAGAGTTATAGACCCTCAATAGCTAGAGGTAGAGGTGGTCATGGTAGAGGCCGCT Q N R V I D P Q - L E V E V V M V E A A K I E L - T L N S - R - R W S W - R P L P K - S Y R P S I A R G R G G H G R G R . . . . . . 14621 ATTCTGGAGGACATGGTGGCCGAGGTAATGGTGGTCACCAAAACGGCGGGGGTGACGGAC I L E D M V A E V M V V T K T A G V T D F W R T W W P R - W W S P K R R G - R T Y S G G H G G R G N G G H Q N G G G D G . . . . . . 14561 AAATGGGAACTACTGCAGCGCAACATGGTAGGGGCAACGGGCAGACAGGTGATAGGGCCC K W E L L Q R N M V G A T G R Q V I G P N G N Y C S A T W - G Q R A D R - - G P Q M G T T A A Q H G R G N G Q T G D R A . . . . . . 14501 GTTGTTATGCTTTCCCTGGGAGATCTGAAGCGGAGACATCGGATGCTGTCATCACAGGTA V V M L S L G D L K R R H R M L S S Q V L L C F P W E I - S G D I G C C H H R - R C Y A F P G R S E A E T S D A V I T G . . . . . . 14441 ATCTTTTGGTTTGTGATTGCATTGCTTATGTATTGTTTTATCCTGGATCCACATTTTCTT I F W F V I A L L M Y C F I L D P H F L S F G L - L H C L C I V L S W I H I F L N L L V C D C I A Y V L F Y P G S T F S . . . . . . 14381 ATGTATCTTCCTCATTTGCTAATGGTCTTAATTTACATTGTGAATTGCTTGACATGCCTA M Y L P H L L M V L I Y I V N C L T C L C I F L I C - W S - F T L - I A - H A Y Y V S S S F A N G L N L H C E L L D M P . . . . . . 14321 TTCGTGTTTCTACTCCGGTGGGTGAGTCTGTGATAGTTGAAAAAGTATATAGGTCTGGTC F V F L L R W V S L - - L K K Y I G L V S C F Y S G G - V C D S - K S I - V W S I R V S T P V G E S V I V E K V Y R S G . . . . . . 14261 TGGTGACTTTTGTGGGGAGCAATACTTATGTAGACTTGGTTATCTTAGAAATGGTTGATT W - L L W G A I L M - T W L S - K W L I G D F C G E Q Y L C R L G Y L R N G - F L V T F V G S N T Y V D L V I L E M V D . . . . . . 14201 TTGATGTAATTCTGGGTATGACTTGGCTTTCTCCGAATTTTGCGATCTTGGATTGTAATG L M - F W V - L G F L R I L R S W I V M - C N S G Y D L A F S E F C D L G L - C F D V I L G M T W L S P N F A I L D C N . . . . . . 14141 CTAAAACTGTGATGTTAGCCAAGCCTGGGACAGATCCGTTAGTGTGGGAGGGTGACTACA L K L - C - P S L G Q I R - C G R V T T - N C D V S Q A W D R S V S V G G - L H A K T V M L A K P G T D P L V W E G D Y . . . . . . 14081 CTTCCGATCCGGTTCGTATCATCTCCTTTCTTCGTGCTAAGAAAATGGTTAGTAAGGTTT L P I R F V S S P F F V L R K W L V R F F R S G S Y H L L S S C - E N G - - G L T S D P V R I I S F L R A K K M V S K V . . . . . . 14021 GTTTAGCGTTCTTCGCACATCTCAGGGATGATACTACCCAAGTACCTCCAATTGAGTCGG V - R S S H I S G M I L P K Y L Q L S R F S V L R T S Q G - Y Y P S T S N - V G C L A F F A H L R D D T T Q V P P I E S . . . . . . 13961 TTTTGATAGTTCGTGAGTTTTTAGTTTCCCGCAGACCTTCCGGGTATGCCACCGGATAGA F - - F V S F - F P A D L P G M P P D R F D S S - V F S F P Q T F R V C H R I E V L I V R E F L V S R R P S G Y A T G - . . . . . . 13901 GATATTGACTTCTGCATTGATCTTAAACCAGGTACTCGCCCCATTTCTATATCCCTCTTA D I D F C I D L K P G T R P I S I S L L I L T S A L I L N Q V L A P F L Y P S Y R Y - L L H - S - T R Y S P H F Y I P L . . . . . . 13841 TAGAATGGCTCCCACGGAGTTGAGAGAGTTAAATGCCCAACTTCAAGAGTTGTTGAGCGA - N G S H G V E R V K C P T S R V V E R R M A P T E L R E L N A Q L Q E L L S E I E W L P R S - E S - M P N F K S C - A . . . . . . 13781 AGGATTCATTAGACCAAGTGCTTCTCCTTGGGGTGCTCCGGTGTTATTTGTGAAGAAGAA R I H - T K C F S L G C S G V I C E E E G F I R P S A S P W G A P V L F V K K K K D S L D Q V L L L G V L R C Y L - R R . . . . . . 13721 GGATGGGAGTTTTCGGATGTGCATAGACTACCGGCAGTTGAACAAGGTAACAATTAAGAA G W E F S D V H R L P A V E Q G N N - E D G S F R M C I D Y R Q L N K V T I K N R M G V F G C A - T T G S - T R - Q L R . . . . . . 13661 CAAGTATCCCATTCCTCGCATTGATGACTTGTTCGATCAGTTACAAGGTGCTTGTGTCTT Q V S H S S H - - L V R S V T R C L C L K Y P I P R I D D L F D Q L Q G A C V F T S I P F L A L M T C S I S Y K V L V S . . . . . . 13601 CTCTAAGATTGACTTGAGGTCCGGTTATCATCAATTGAAAATGCGGGCAACGGATGTGCC L - D - L E V R L S S I E N A G N G C A S K I D L R S G Y H Q L K M R A T D V P S L R L T - G P V I I N - K C G Q R M C . . . . . . 13541 AAAGACTGCTTTTTCGGACCAGGTATGGGCATTACGAATTTGTAGTGATGTCTTTTGGTC K D C F F G P G M G I T N L - - C L L V K T A F S D Q V W A L R I C S D V F W S Q R L L F R T R Y G H Y E F V V M S F G . . . . . . 13481 TTACGAATGCCCCTGCTGCTTTCATGAGCTTGATGAATGGGATTTTTAAGCCATATCTAG L R M P L L L S - A - - M G F L S H I - Y E C P C C F H E L D E W D F - A I S R L T N A P A A F M S L M N G I F K P Y L . . . . . . 13421 ATCTCTTTGTGATCGTATTTATTGATGATATATTGGTATACTTAAAGAGCAAGAAGGAAT I S L - S Y L L M I Y W Y T - R A R R N S L C D R I Y - - Y I G I L K E Q E G I D L F V I V F I D D I L V Y L K S K K E . . . . . . 13361 ATGAGGAGCACTTGAGAATTGTATTGGAAATGTTGAGGGAGAAAAGCTTTATGCAAAATT M R S T - E L Y W K C - G R K A L C K I - G A L E N C I G N V E G E K L Y A K F Y E E H L R I V L E M L R E K S F M Q N . . . . . . 13301 CTCCAAGTGTGAGTTTTGGCTAGATGCAATGTCCTTCTTGGGGCATGTGGTTTCTAAGGA L Q V - V L A R C N V L L G A C G F - G S K C E F W L D A M S F L G H V V S K D S P S V S F G - M Q C P S W G M W F L R . . . . . . 13241 TGGGGTGATGGTGGATCCTTCTAAGATTGAGACAGTGAAGAATTGGGTAATACCTACTAA W G D G G S F - D - D S E E L G N T Y - G V M V D P S K I E T V K N W V I P T N M G - W W I L L R L R Q - R I G - Y L L . . . . . . 13181 TGTGTCAAAAATAAGGAGCTTTGTTGGTTTAGCTAGCTATTACCGTCGATTTGTTAAGGG C V K N K E L C W F S - L L P S I C - G V S K I R S F V G L A S Y Y R R F V K G M C Q K - G A L L V - L A I T V D L L R . 13121 ATTC I F D Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-4_AGS-1_PPS_1 (15759 14674) (frame '0'; 1083 bp, 361 residues) 1 TPGLSGTRAV TPPPTEEVVR EGEDGENEQV QNEELPPQPT LEMINQVLAY LSGLCDQGQA 61 PPVFSTPAPQ APEVQHAATV APRMDASLEI DMFPRLTTGP IMTNDQHELF SKFLKLKPPV 121 FKGAESEDAY DFLVDCHELL HKMGIVERFG VEFVTYQFQG NAKMWWRSHV ECQPTEAPPM 181 TWASFSSLFM EKYIPRTLRD RKRDEFLSLE QGRMSVTAYE AKFRALSRYA TQLYFSPQER 241 IHHFMKGLRS ELRISALQVA ATAKSFQEEV DFVIEVEGVK PDDFTMAMSS KILRKGGEFN 301 GSYSRGQGSG GYSVQPIQSS LQTVVGGPPQ TGQHFSERPM LDSRECYGCG ETGHIRRNCP 361 K- >C06HBa0153O03.1-1-_PGL-4_AGS-1_PPS_2 (14673 13903) (frame '0'; 768 bp, 256 residues) 1 SYRPSIARGR GGHGRGRYSG GHGGRGNGGH QNGGGDGQMG TTAAQHGRGN GQTGDRARCY 61 AFPGRSEAET SDAVITGNLL VCDCIAYVLF YPGSTFSYVS SSFANGLNLH CELLDMPIRV 121 STPVGESVIV EKVYRSGLVT FVGSNTYVDL VILEMVDFDV ILGMTWLSPN FAILDCNAKT 181 VMLAKPGTDP LVWEGDYTSD PVRIISFLRA KKMVSKVCLA FFAHLRDDTT QVPPIESVLI 241 VREFLVSRRP SGYATG- >C06HBa0153O03.1-1-_PGL-4_AGS-1_PPS_3 (13563 13279) (frame '0'; 282 bp, 94 residues) 1 KCGQRMCQRL LFRTRYGHYE FVVMSFGLTN APAAFMSLMN GIFKPYLDLF VIVFIDDILV 61 YLKSKKEYEE HLRIVLEMLR EKSFMQNSPS VSFG- AGS-2 (13740 14211,14956 15130) SCR (e 0.870 d 0.000 a 0.000,e 0.891) Exon 1 13740 14211 ( 472 n); score: 0.870 Intron 1 14212 14955 ( 744 n); Pd: 0.000 Pa: 0.000 Exon 2 14956 15130 ( 175 n); score: 0.891 PGS (13740 14211,14956 15130) SGN-E353359+ 3-phase translation of AGS-2 (+strand): . . . . . . 13740 ACCGGAGCACCCCAAGGAGAAGCACTTGGTCTAATGAATCCTTCGCTCAACAACTCTTGA T G A P Q G E A L G L M N P S L N N S - P E H P K E K H L V - - I L R S T T L E R S T P R R S T W S N E S F A Q Q L L . . . . . . 13800 AGTTGGGCATTTAACTCTCTCAACTCCGTGGGAGCCATTCTATAAGAGGGATATAGAAAT S W A F N S L N S V G A I L - E G Y R N V G H L T L S T P W E P F Y K R D I E M K L G I - L S Q L R G S H S I R G I - K . . . . . . 13860 GGGGCGAGTACCTGGTTTAAGATCAATGCAGAAGTCAATATCTCTATCCGGTGGCATACC G A S T W F K I N A E V N I S I R W H T G R V P G L R S M Q K S I S L S G G I P W G E Y L V - D Q C R S Q Y L Y P V A Y . . . . . . 13920 CGGAAGGTCTGCGGGAAACTAAAAACTCACGAACTATCAAAACCGACTCAATTGGAGGTA R K V C G K L K T H E L S K P T Q L E V G R S A G N - K L T N Y Q N R L N W R Y P E G L R E T K N S R T I K T D S I G G . . . . . . 13980 CTTGGGTAGTATCATCCCTGAGATGTGCGAAGAACGCTAAACAAACCTTACTAACCATTT L G - Y H P - D V R R T L N K P Y - P F L G S I I P E M C E E R - T N L T N H F T W V V S S L R C A K N A K Q T L L T I . . . . . . 14040 TCTTAGCACGAAGAAAGGAGATGATACGAACCGGATCGGAAGTGTAGTCACCCTCCCACA S - H E E R R - Y E P D R K C S H P P T L S T K K G D D T N R I G S V V T L P H F L A R R K E M I R T G S E V - S P S H . . . . . . 14100 CTAACGGATCTGTCCCAGGCTTGGCTAACATCACAGTTTTAGCATTACAATCCAAGATCG L T D L S Q A W L T S Q F - H Y N P R S - R I C P R L G - H H S F S I T I Q D R T N G S V P G L A N I T V L A L Q S K I . . . . . . : 14160 CAAAATTCGGAGAAAGCCAAGTCATACCCAGAATTACATCAAAATCAACCAT : TTCTTGAA Q N S E K A K S Y P E L H Q N Q P : F L E K I R R K P S H T Q N Y I K I N H : F L K A K F G E S Q V I P R I T S K S T I : S - . . . . . . 14964 AAGATTTTGCCGTAGCCGCTACCTGTAAGGCTGAAATCCGCAATTCTGACCTTAACCCCT K I L P - P L P V R L K S A I L T L T P R F C R S R Y L - G - N P Q F - P - P L K D F A V A A T C K A E I R N S D L N P . . . . . . 15024 TCATAAAATGGTGAATCCGCTCTTGTGGACTGAAATAAAGTTGGGTGGCATACCTGGATA S - N G E S A L V D - N K V G W H T W I H K M V N P L L W T E I K L G G I P G - F I K W - I R S C G L K - S W V A Y L D . . . . . 15084 ATGCACGAAACTTAGCCTCATATGCGGTAACCGACATCCTACCTTGC M H E T - P H M R - P T S Y L C T K L S L I C G N R H P T L N A R N L A S Y A V T D I L P C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-4_AGS-2_PPS_1 (13880 14086) (frame '0'; 204 bp, 68 residues) 1 DQCRSQYLYP VAYPEGLRET KNSRTIKTDS IGGTWVVSSL RCAKNAKQTL LTIFLARRKE 61 MIRTGSEV- AGS-3 (14295 14324,14360 14890) SCR (e 0.633 d 0.799 a 0.000,e 0.881) Exon 1 14295 14324 ( 30 n); score: 0.633 Intron 1 14325 14359 ( 35 n); Pd: 0.799 Pa: 0.000 Exon 2 14360 14890 ( 531 n); score: 0.881 PGS (14295 14324,14360 14890) SGN-E577713+ 3-phase translation of AGS-3 (+strand): . . . : . . . 14295 ACTCACCCACCGGAGTAGAAACACGAATAG : TTAGCAAATGAGGAAGATACATAAGAAAAT T H P P E - K H E - : L A N E E D T - E N L T H R S R N T N S : - Q M R K I H K K M S P T G V E T R I : V S K - G R Y I R K . . . . . . 14390 GTGGATCCAGGATAAAACAATACATAAGCAATGCAATCACAAACCAAAAGATTACCTGTG V D P G - N N T - A M Q S Q T K R L P V W I Q D K T I H K Q C N H K P K D Y L - C G S R I K Q Y I S N A I T N Q K I T C . . . . . . 14450 ATGACAGCATCCGATGTCTCCGCTTCAGATCTCCCAGGGAAAGCATAACAACGGGCCCTA M T A S D V S A S D L P G K A - Q R A L - Q H P M S P L Q I S Q G K H N N G P Y D D S I R C L R F R S P R E S I T T G P . . . . . . 14510 TCACCTGTCTGCCCGTTGCCCCTACCATGTTGCGCTGCAGTAGTTCCCATTTGTCCGTCA S P V C P L P L P C C A A V V P I C P S H L S A R C P Y H V A L Q - F P F V R H I T C L P V A P T M L R C S S S H L S V . . . . . . 14570 CCCCCGCCGTTTTGGTGACCACCATTACCTCGGCCACCATGTCCTCCAGAATAGCGGCCT P P P F W - P P L P R P P C P P E - R P P R R F G D H H Y L G H H V L Q N S G L T P A V L V T T I T S A T M S S R I A A . . . . . . 14630 CTACCATGACCACCTCTACCTCTAGCTATTGAGGGTCTATAACTCTATTTTGGACAATTC L P - P P L P L A I E G L - L Y F G Q F Y H D H L Y L - L L R V Y N S I L D N S S T M T T S T S S Y - G S I T L F W T I . . . . . . 14690 CTCCTAATATGTCCAGTCTCCCCACATCCATAACACTCCCTGGAGTCAAGCATAGGTCTC L L I C P V S P H P - H S L E S S I G L S - Y V Q S P H I H N T P W S Q A - V S P P N M S S L P T S I T L P G V K H R S . . . . . . 14750 TCAGAGAAGTGTTGACCGGTCTGAGGTGGACCCCCAACTACAGTTTGTAGTGAAGACTGA S E K C - P V - G G P P T T V C S E D - Q R S V D R S E V D P Q L Q F V V K T E L R E V L T G L R W T P N Y S L - - R L . . . . . . 14810 ATTGGTTGGACTGAGTAACCTCCTGAACCCTGTCCTCTAGAGTAAGAACCATTAAACTCA I G W T E - P P E P C P L E - E P L N S L V G L S N L L N P V L - S K N H - T H N W L D - V T S - T L S S R V R T I K L . . . 14870 CCTCCCTTTCGAAGAATTTTT P P F R R I F L P F E E F T S L S K N F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-4_AGS-3_PPS_1 (14371 14661) (frame '0'; 288 bp, 96 residues) 1 GRYIRKCGSR IKQYISNAIT NQKITCDDSI RCLRFRSPRE SITTGPITCL PVAPTMLRCS 61 SSHLSVTPAV LVTTITSATM SSRIAASTMT TSTSSY- PGL 5 (- strand): 19206 15833 AGS-1 (16998 16959,16857 15833) SCR (e 0.712 d 0.000 a 0.000,e 0.856) Exon 1 16998 16959 ( 40 n); score: 0.712 Intron 1 16958 16858 ( 101 n); Pd: 0.000 Pa: 0.000 Exon 2 16857 15833 (1025 n); score: 0.856 PGS (16117 15833) SGN-E355114+ PGS (16538 15875) SGN-E392027- PGS (16619 15890) SGN-E351546+ PGS (16619 15964) SGN-E356696+ PGS (16619 16040) SGN-E356206+ PGS (16535 16282) SGN-E373117+ PGS (16535 16282) SGN-E373116- PGS (16998 16959,16857 16298) SGN-E546506- PGS (16848 16597) SGN-E357033- 3-phase translation of AGS-1 (-strand): . . . . : . . 16998 ATAGAATTAGGGGATCGGGTGCCACGAACCGACACGTAGA : TAGAACTAGGGAATCGGAGT I E L G D R V P R T D T - : I E L G N R S - N - G I G C H E P T R R : - N - G I G V R I R G S G A T N R H V D : R T R E S E . . . . . . 16837 GTCACGTACCGACACAAGAGTAAAGGTGATGAATCTTGAAAGATGTTAATATACTCAATC V T Y R H K S K G D E S - K M L I Y S I S R T D T R V K V M N L E R C - Y T Q S C H V P T Q E - R - - I L K D V N I L N . . . . . . 16777 TAATGAACCTAAGTCCCAAATGAGTATGGTATTGAGGCTTGAGTCCTCATGAGTGTACTT - - T - V P N E Y G I E A - V L M S V L N E P K S Q M S M V L R L E S S - V Y L L M N L S P K - V W Y - G L S P H E C T . . . . . . 16717 GACGTTATTTATCAAAGATTCTTGTACTTGTTGCTACATGTTGAGTAATGTAGTTGATTT D V I Y Q R F L Y L L L H V E - C S - F T L F I K D S C T C C Y M L S N V V D F - R Y L S K I L V L V A T C - V M - L I . . . . . . 16657 TATATTATTACTTGATATATATTGTTTTCTATTTTGAGTTGGCCGATGATATCTACTCAG Y I I T - Y I L F S I L S W P M I S T Q I L L L D I Y C F L F - V G R - Y L L S L Y Y Y L I Y I V F Y F E L A D D I Y S . . . . . . 16597 TACCCATGTTTTGTACTGACCCCTACTTGTATGTTTCTTTCCTTGTTATTTGTGGAGTGC Y P C F V L T P T C M F L S L L F V E C T H V L Y - P L L V C F F P C Y L W S A V P M F C T D P Y L Y V S F L V I C G V . . . . . . 16537 AGCAAACGTGCCGTCGTCTTCAACTCAACCGCAACTCTAGCAAGACTTCATAACACCGGA S K R A V V F N S T A T L A R L H N T G A N V P S S S T Q P Q L - Q D F I T P D Q Q T C R R L Q L N R N S S K T S - H R . . . . . . 16477 TTTCAGGGTGAGCTACACTTCTAGCTTGAACTGGATCTTCTTGTTCATGTCTTGATGCCT F Q G E L H F - L E L D L L V H V L M P F R V S Y T S S L N W I F L F M S - C L I S G - A T L L A - T G S S C S C L D A . . . . . . 16417 TGAAGTTCCAGCATGGACTAGCTTTTTATTTATTCTAGCTTTCTAGATACTCTTAGCTTT - S S S M D - L F I Y S S F L D T L S F E V P A W T S F L F I L A F - I L L A L L K F Q H G L A F Y L F - L S R Y S - L . . . . . . 16357 AGTAATTTGAGGATAGATGTTCTTGTGATGATGACTTCCAGATTTTGGGGATAATGATAA S N L R I D V L V M M T S R F W G - - - V I - G - M F L - - - L P D F G D N D K - - F E D R C S C D D D F Q I L G I M I . . . . . . 16297 GTTTGAGTTTTAGAAAGTGATTATTGATTTTCATTAATGAGTTTAAGTCTTCCGCATTAT V - V L E S D Y - F S L M S L S L P H Y F E F - K V I I D F H - - V - V F R I I S L S F R K - L L I F I N E F K S S A L . . . . . . 16237 ATTATGTTAATTATGTTTGAAATGTTGGGGTTCAGATTGGTTGGTTCGCTCACATAGTAG I M L I M F E M L G F R L V G S L T - - L C - L C L K C W G S D W L V R S H S R Y Y V N Y V - N V G V Q I G W F A H I V . . . . . . 16177 GATAAGTGTGGGTGCCACTCGCGACCCGTTTTGGGTCGTGACAAACTTGGTATTAGAGCA D K C G C H S R P V L G R D K L G I R A I S V G A T R D P F W V V T N L V L E H G - V W V P L A T R F G S - Q T W Y - S . . . . . . 16117 TTAGGTTCGTTGGTCTCATCACACAAGAACGAGTCTAGTAGAGTCTGAAGGAACGGTAGG L G S L V S S H K N E S S R V - R N G R - V R W S H H T R T S L V E S E G T V G I R F V G L I T Q E R V - - S L K E R - . . . . . . 16057 GGGACGCCTTTACTTTTCTTTGAGAGGCTATAAGACTTTAGGAAAAATTCCATTCTTTCT G T P L L F F E R L - D F R K N S I L S G R L Y F S L R G Y K T L G K I P F F L G D A F T F L - E A I R L - E K F H S F . . . . . . 15997 TTCTTTCCTTTGTGCTATTACTTGGATCCAATTGGTATCTAGGTGATACAAATTGGTATC F F P L C Y Y L D P I G I - V I Q I G I S F L C A I T W I Q L V S R - Y K L V S F L S F V L L L G S N W Y L G D T N W Y . . . . . . 15937 TGACCATCTTCACTCTATTTCGCAGATGGTTAGAACTAGAGCAACAACCACGCCAACATC - P S S L Y F A D G - N - S N N H A N I D H L H S I S Q M V R T R A T T T P T S L T I F T L F R R W L E L E Q Q P R Q H . . . . . 15877 AACATCGGCAAGACAAGATGCATCTGAGCCAGCCATTGTGACTGT N I G K T R C I - A S H C D C T S A R Q D A S E P A I V T Q H R Q D K M H L S Q P L - L Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (17355 16813,16535 16135) SCR (e 0.849 d 0.890 a 0.968,e 0.883) Exon 1 17355 16813 ( 543 n); score: 0.849 Intron 1 16812 16536 ( 277 n); Pd: 0.890 Pa: 0.968 Exon 2 16535 16135 ( 401 n); score: 0.883 PGS (16827 16813,16535 16135) SGN-E370357- PGS (17355 16808) SGN-E347579+ PGS (17333 16823) SGN-E349726+ PGS (17333 16929) SGN-E357559+ 3-phase translation of AGS-2 (-strand): . . . . . . 17355 AAATTGCTTGCTTGATCACGAATCTTCGGTGGAAGTAGGTTATGGTTTTTATACTATTCG K L L A - S R I F G G S R L W F L Y Y S N C L L D H E S S V E V G Y G F Y T I R I A C L I T N L R W K - V M V F I L F . . . . . . 17295 TAGTTAACTCTTAATAGCGAATGATATGTGTTGGGTTGTATTGTAAAGTCTTCTATATGC - L T L N S E - Y V L G C I V K S S I C S - L L I A N D M C W V V L - S L L Y A V V N S - - R M I C V G L Y C K V F Y M . . . . . . 17235 TTAATTGTATGCTTGCATGAATATGATTATATAATTGTGATGAAATAAGCATGATGAAGC L I V C L H E Y D Y I I V M K - A - - S - L Y A C M N M I I - L - - N K H D E A L N C M L A - I - L Y N C D E I S M M K . . . . . . 17175 TATTGAATCCCAAATCTTGAAAACTCCAATCTTGAAAACCCCTTGTTATTGATGATGCCT Y - I P N L E N S N L E N P L L L M M P I E S Q I L K T P I L K T P C Y - - C L L L N P K S - K L Q S - K P L V I D D A . . . . . . 17115 TGGTATAAAAGAAGGCTTGATGAACTAAAGTAATGAGATTGATGATGCCTTGGTATAAAA W Y K R R L D E L K - - D - - C L G I K G I K E G L M N - S N E I D D A L V - K L V - K K A - - T K V M R L M M P W Y K . . . . . . 17055 GAAGGCTTGATGAATTAATAGAATGAGATTAGTGGAGTAGGTGTCACGAACCGACACATA E G L M N - - N E I S G V G V T N R H I K A - - I N R M R L V E - V S R T D T - R R L D E L I E - D - W S R C H E P T H . . . . . . 16995 GAATTAGGGGATCGGGTGCCACGAACCGACACGTAGAATTAAGGGATCGGGTGTCACGAA E L G D R V P R T D T - N - G I G C H E N - G I G C H E P T R R I K G S G V T N R I R G S G A T N R H V E L R D R V S R . . . . . . 16935 CCGACACGTAGAATTAGGGAATTGGGTGTCACAAACTGACACGTAGAATTAGGGGATCGG P T R R I R E L G V T N - H V E L G D R R H V E L G N W V S Q T D T - N - G I G T D T - N - G I G C H K L T R R I R G S . . . . . . 16875 GTGTCACGAATCGACACGTAGAACTAGGGAATCGGAGTGTCACGTACCGACACAAGAGTA V S R I D T - N - G I G V S R T D T R V C H E S T R R T R E S E C H V P T Q E - G V T N R H V E L G N R S V T Y R H K S . : . . . . . 16815 AAG : CAAACGTGCCGTCGTCTTCAACTCAACCGCAACTCTAGCAAGACTTCATAACACCGG K : Q T C R R L Q L N R N S S K T S - H R S : K R A V V F N S T A T L A R L H N T G K : A N V P S S S T Q P Q L - Q D F I T P . . . . . . 16478 ATTTCAGGGTGAGCTACACTTCTAGCTTGAACTGGATCTTCTTGTTCATGTCTTGATGCC I S G - A T L L A - T G S S C S C L D A F Q G E L H F - L E L D L L V H V L M P D F R V S Y T S S L N W I F L F M S - C . . . . . . 16418 TTGAAGTTCCAGCATGGACTAGCTTTTTATTTATTCTAGCTTTCTAGATACTCTTAGCTT L K F Q H G L A F Y L F - L S R Y S - L - S S S M D - L F I Y S S F L D T L S F L E V P A W T S F L F I L A F - I L L A . . . . . . 16358 TAGTAATTTGAGGATAGATGTTCTTGTGATGATGACTTCCAGATTTTGGGGATAATGATA - - F E D R C S C D D D F Q I L G I M I S N L R I D V L V M M T S R F W G - - - L V I - G - M F L - - - L P D F G D N D . . . . . . 16298 AGTTTGAGTTTTAGAAAGTGATTATTGATTTTCATTAATGAGTTTAAGTCTTCCGCATTA S L S F R K - L L I F I N E F K S S A L V - V L E S D Y - F S L M S L S L P H Y K F E F - K V I I D F H - - V - V F R I . . . . . . 16238 TATTATGTTAATTATGTTTGAAATGTTGGGGTTCAGATTGGTTGGTTCGCTCACATAGTA Y Y V N Y V - N V G V Q I G W F A H I V I M L I M F E M L G F R L V G S L T - - I L C - L C L K C W G S D W L V R S H S . . . . . 16178 GGATAAGTGTGGGTGCCACTCGCGACCCGTTTTGGGTCGTGACA G - V W V P L A T R F G S - D K C G C H S R P V L G R D R I S V G A T R D P F W V V T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (17082 16404) SCR (e 0.811) Exon 1 17082 16404 ( 679 n); score: 0.811 PGS (17082 16404) SGN-E241789- PGS (16889 16601) SGN-E349977+ 3-phase translation of AGS-3 (-strand): . . . . . . 17082 TGAGATTGATGATGCCTTGGTATAAAAGAAGGCTTGATGAATTAATAGAATGAGATTAGT - D - - C L G I K E G L M N - - N E I S E I D D A L V - K K A - - I N R M R L V R L M M P W Y K R R L D E L I E - D - . . . . . . 17022 GGAGTAGGTGTCACGAACCGACACATAGAATTAGGGGATCGGGTGCCACGAACCGACACG G V G V T N R H I E L G D R V P R T D T E - V S R T D T - N - G I G C H E P T R W S R C H E P T H R I R G S G A T N R H . . . . . . 16962 TAGAATTAAGGGATCGGGTGTCACGAACCGACACGTAGAATTAGGGAATTGGGTGTCACA - N - G I G C H E P T R R I R E L G V T R I K G S G V T N R H V E L G N W V S Q V E L R D R V S R T D T - N - G I G C H . . . . . . 16902 AACTGACACGTAGAATTAGGGGATCGGGTGTCACGAATCGACACGTAGAACTAGGGAATC N - H V E L G D R V S R I D T - N - G I T D T - N - G I G C H E S T R R T R E S K L T R R I R G S G V T N R H V E L G N . . . . . . 16842 GGAGTGTCACGTACCGACACAAGAGTAAAGGTGATGAATCTTGAAAGATGTTAATATACT G V S R T D T R V K V M N L E R C - Y T E C H V P T Q E - R - - I L K D V N I L R S V T Y R H K S K G D E S - K M L I Y . . . . . . 16782 CAATCTAATGAACCTAAGTCCCAAATGAGTATGGTATTGAGGCTTGAGTCCTCATGAGTG Q S N E P K S Q M S M V L R L E S S - V N L M N L S P K - V W Y - G L S P H E C S I - - T - V P N E Y G I E A - V L M S . . . . . . 16722 TACTTGACGTTATTTATCAAAGATTCTTGTACTTGTTGCTACATGTTGAGTAATGTAGTT Y L T L F I K D S C T C C Y M L S N V V T - R Y L S K I L V L V A T C - V M - L V L D V I Y Q R F L Y L L L H V E - C S . . . . . . 16662 GATTTTATATTATTACTTGATATATATTGTTTTCTATTTTGAGTTGGCCGATGATATCTA D F I L L L D I Y C F L F - V G R - Y L I L Y Y Y L I Y I V F Y F E L A D D I Y - F Y I I T - Y I L F S I L S W P M I S . . . . . . 16602 CTCAGTACCCATGTTTTGTACTGACCCCTACTTGTATGTTTCTTTCCTTGTTATTTGTGG L S T H V L Y - P L L V C F F P C Y L W S V P M F C T D P Y L Y V S F L V I C G T Q Y P C F V L T P T C M F L S L L F V . . . . . . 16542 AGTGCAGCAAACGTGCCGTCGTCTTCAACTCAACCGCAACTCTAGCAAGACTTCATAACA S A A N V P S S S T Q P Q L - Q D F I T V Q Q T C R R L Q L N R N S S K T S - H E C S K R A V V F N S T A T L A R L H N . . . . . . 16482 CCGGATTTCAGGGTGAGCTACACTTCTAGCTTGAACTGGATCTTCTTGTTCATGTCTTGA P D F R V S Y T S S L N W I F L F M S - R I S G - A T L L A - T G S S C S C L D T G F Q G E L H F - L E L D L L V H V L . . 16422 TGCCTTGAAGTTCCAGCAT C L E V P A A L K F Q H M P - S S S Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-3 (+strand): . . . . . . 16404 ATGCTGGAACTTCAAGGCATCAAGACATGAACAAGAAGATCCAGTTCAAGCTAGAAGTGT M L E L Q G I K T - T R R S S S S - K C C W N F K A S R H E Q E D P V Q A R S V A G T S R H Q D M N K K I Q F K L E V . . . . . . 16464 AGCTCACCCTGAAATCCGGTGTTATGAAGTCTTGCTAGAGTTGCGGTTGAGTTGAAGACG S S P - N P V L - S L A R V A V E L K T A H P E I R C Y E V L L E L R L S - R R - L T L K S G V M K S C - S C G - V E D . . . . . . 16524 ACGGCACGTTTGCTGCACTCCACAAATAACAAGGAAAGAAACATACAAGTAGGGGTCAGT T A R L L H S T N N K E R N I Q V G V S R H V C C T P Q I T R K E T Y K - G S V D G T F A A L H K - Q G K K H T S R G Q . . . . . . 16584 ACAAAACATGGGTACTGAGTAGATATCATCGGCCAACTCAAAATAGAAAACAATATATAT T K H G Y - V D I I G Q L K I E N N I Y Q N M G T E - I S S A N S K - K T I Y I Y K T W V L S R Y H R P T Q N R K Q Y I . . . . . . 16644 CAAGTAATAATATAAAATCAACTACATTACTCAACATGTAGCAACAAGTACAAGAATCTT Q V I I - N Q L H Y S T C S N K Y K N L K - - Y K I N Y I T Q H V A T S T R I F S S N N I K S T T L L N M - Q Q V Q E S . . . . . . 16704 TGATAAATAACGTCAAGTACACTCATGAGGACTCAAGCCTCAATACCATACTCATTTGGG - - I T S S T L M R T Q A S I P Y S F G D K - R Q V H S - G L K P Q Y H T H L G L I N N V K Y T H E D S S L N T I L I W . . . . . . 16764 ACTTAGGTTCATTAGATTGAGTATATTAACATCTTTCAAGATTCATCACCTTTACTCTTG T - V H - I E Y I N I F Q D S S P L L L L R F I R L S I L T S F K I H H L Y S C D L G S L D - V Y - H L S R F I T F T L . . . . . . 16824 TGTCGGTACGTGACACTCCGATTCCCTAGTTCTACGTGTCGATTCGTGACACCCGATCCC C R Y V T L R F P S S T C R F V T P D P V G T - H S D S L V L R V D S - H P I P V S V R D T P I P - F Y V S I R D T R S . . . . . . 16884 CTAATTCTACGTGTCAGTTTGTGACACCCAATTCCCTAATTCTACGTGTCGGTTCGTGAC L I L R V S L - H P I P - F Y V S V R D - F Y V S V C D T Q F P N S T C R F V T P N S T C Q F V T P N S L I L R V G S - . . . . . . 16944 ACCCGATCCCTTAATTCTACGTGTCGGTTCGTGGCACCCGATCCCCTAATTCTATGTGTC T R S L N S T C R F V A P D P L I L C V P D P L I L R V G S W H P I P - F Y V S H P I P - F Y V S V R G T R S P N S M C . . . . . . 17004 GGTTCGTGACACCTACTCCACTAATCTCATTCTATTAATTCATCAAGCCTTCTTTTATAC G S - H L L H - S H S I N S S S L L L Y V R D T Y S T N L I L L I H Q A F F Y T R F V T P T P L I S F Y - F I K P S F I . . 17064 CAAGGCATCATCAATCTCA Q G I I N L K A S S I S P R H H Q S Maximal non-overlapping open reading frames (>= 64 codons): none AGS-4 (17012 16944,16848 16446) SCR (e 0.725 d 0.000 a 0.000,e 0.823) Exon 1 17012 16944 ( 69 n); score: 0.725 Intron 1 16943 16849 ( 95 n); Pd: 0.000 Pa: 0.000 Exon 2 16848 16446 ( 403 n); score: 0.823 PGS (17011 16944,16848 16446) SGN-E246710+ PGS (17012 16944,16848 16685) SGN-E391780+ 3-phase translation of AGS-4 (-strand): . . . . . . 17012 TCACGAACCGACACATAGAATTAGGGGATCGGGTGCCACGAACCGACACGTAGAATTAAG S R T D T - N - G I G C H E P T R R I K H E P T H R I R G S G A T N R H V E L R T N R H I E L G D R V P R T D T - N - . : . . . . . 16952 GGATCGGGT : GGAATCGGAGTGTCACGTACCGACACAAGAGTAAAGGTGATGAATCTTGAA G S G : G I G V S R T D T R V K V M N L E D R V : E S E C H V P T Q E - R - - I L K G I G : W N R S V T Y R H K S K G D E S - . . . . . . 16797 AGATGTTAATATACTCAATCTAATGAACCTAAGTCCCAAATGAGTATGGTATTGAGGCTT R C - Y T Q S N E P K S Q M S M V L R L D V N I L N L M N L S P K - V W Y - G L K M L I Y S I - - T - V P N E Y G I E A . . . . . . 16737 GAGTCCTCATGAGTGTACTTGACGTTATTTATCAAAGATTCTTGTACTTGTTGCTACATG E S S - V Y L T L F I K D S C T C C Y M S P H E C T - R Y L S K I L V L V A T C - V L M S V L D V I Y Q R F L Y L L L H . . . . . . 16677 TTGAGTAATGTAGTTGATTTTATATTATTACTTGATATATATTGTTTTCTATTTTGAGTT L S N V V D F I L L L D I Y C F L F - V - V M - L I L Y Y Y L I Y I V F Y F E L V E - C S - F Y I I T - Y I L F S I L S . . . . . . 16617 GGCCGATGATATCTACTCAGTACCCATGTTTTGTACTGACCCCTACTTGTATGTTTCTTT G R - Y L L S T H V L Y - P L L V C F F A D D I Y S V P M F C T D P Y L Y V S F W P M I S T Q Y P C F V L T P T C M F L . . . . . . 16557 CCTTGTTATTTGTGGAGTGCAGCAAACGTGCCGTCGTCTTCAACTCAACCGCAACTCTAG P C Y L W S A A N V P S S S T Q P Q L - L V I C G V Q Q T C R R L Q L N R N S S S L L F V E C S K R A V V F N S T A T L . . . . . . 16497 CAAGACTTCATAACACCGGATTTCAGGGTGAGCTACACTTCTAGCTTGAACT Q D F I T P D F R V S Y T S S L N K T S - H R I S G - A T L L A - T A R L H N T G F Q G E L H F - L E Maximal non-overlapping open reading frames (>= 64 codons): none AGS-5 (18544 18514,17959 17386) SCR (e 0.710 d 0.845 a 0.000,e 0.841) Exon 1 18544 18514 ( 31 n); score: 0.710 Intron 1 18513 17960 ( 554 n); Pd: 0.845 Pa: 0.000 Exon 2 17959 17386 ( 574 n); score: 0.841 PGS (17959 17386) SGN-E550322- PGS (18544 18514,17959 17396) SGN-E550140+ PGS (17963 17396) SGN-E374999- PGS (17961 17396) SGN-E389834- PGS (17960 17396) SGN-E389553+ PGS (17959 17396) SGN-E550201- PGS (17959 17396) SGN-E550335- PGS (17959 17396) SGN-E550212- PGS (17959 17396) SGN-E550065- PGS (17959 17396) SGN-E390013- PGS (17959 17396) SGN-E550484- PGS (17959 17396) SGN-E550211- PGS (17959 17396) SGN-E550025- PGS (17959 17396) SGN-E396056- PGS (17959 17396) SGN-E550207- PGS (17959 17396) SGN-E550464- PGS (17959 17396) SGN-E549941- PGS (17959 17396) SGN-E396039- PGS (17959 17396) SGN-E377133- PGS (17954 17396) SGN-E231589- PGS (17954 17396) SGN-E396054- PGS (17954 17396) SGN-E396058- PGS (17940 17396) SGN-E241959- PGS (17869 17396) SGN-E236652- PGS (17850 17396) SGN-E396070- PGS (17896 17507) SGN-E242274+ PGS (17955 17658) SGN-E252199+ 3-phase translation of AGS-5 (-strand): . . . . : . . 18544 TAAAATAAAAAATTTATTATATGTATAAAAA : ATAATAATGTAACGACCTATTTAGTCGTT - N K K F I I C I K : N N N V T T Y L V V K I K N L L Y V - K : I I M - R P I - S F K - K I Y Y M Y K K : - - C N D L F S R . . . . . . 17930 TTGAGCAGCAGATTTTATTTTTGGAAAAACTGGCTGAGACGACGGATCCCACGATGGACC L S S R F Y F W K N W L R R R I P R W T - A A D F I F G K T G - D D G S H D G P F E Q Q I L F L E K L A E T T D P T M D . . . . . . 17870 GTCATGGGCACGATGGACCGTCGAGGGGGTCTCGTTCCAAAATACATAGAATTCTGAAAT V M G T M D R R G G L V P K Y I E F - N S W A R W T V E G V S F Q N T - N S E I R H G H D G P S R G S R S K I H R I L K . . . . . . 17810 TTGGGTTTTGAAATCGACTCTCTGAACTTCGTGATGAAGTGGCAGGACGGACCGTCACAG L G F E I D S L N F V M K W Q D G P S Q W V L K S T L - T S - - S G R T D R H R F G F - N R L S E L R D E V A G R T V T . . . . . . 17750 GCATGACGGGCCGTCACAGTCTCTTCAGAAAATTTCAGTCTCTGAACTCTGTGACGGAAG A - R A V T V S S E N F S L - T L - R K H D G P S Q S L Q K I S V S E L C D G S G M T G R H S L F R K F Q S L N S V T E . . . . . . 17690 CAGCAGGACGGACCGTCGCAGGCACGACGACCCGTCACAGACTGCGTAATCCCAGGCTGA Q Q D G P S Q A R R P V T D C V I P G - S R T D R R R H D D P S Q T A - S Q A E A A G R T V A G T T T R H R L R N P R L . . . . . . 17630 GTCGGATTTCTTTAAATGTTTTAAGGGGGCGTTTTGGACTATTCCTGCTATAATTATAAA V G F L - M F - G G V L D Y S C Y N Y K S D F F K C F K G A F W T I P A I I I N S R I S L N V L R G R F G L F L L - L - . . . . . . 17570 TTTAGTGGGTTAATGTTAATAATTTAACTACTTGAGGGTTAAAAGAGATAACCTTGAATT F S G L M L I I - L L E G - K R - P - I L V G - C - - F N Y L R V K R D N L E L I - W V N V N N L T T - G L K E I T L N . . . . . . 17510 AGTTAGTGGGTTAAACTCATCATCTTTCATACTTAATTATATGCTAATTAGGGTAAAAGA S - W V K L I I F H T - L Y A N - G K R V S G L N S S S F I L N Y M L I R V K E - L V G - T H H L S Y L I I C - L G - K . . . . . . 17450 AAGAAGGTTTGAATAAGAAAAAGAAAAGAACAGAAAGAGAGGGAGAAACGATCGAGAGAG K K V - I R K R K E Q K E R E K R S R E R R F E - E K E K N R K R G R N D R E R K E G L N K K K K R T E R E G E T I E R . 17390 AGAGA R E E R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-5_AGS-5_PPS_1 (17799 17578) (frame '0'; 219 bp, 73 residues) 1 NRLSELRDEV AGRTVTGMTG RHSLFRKFQS LNSVTEAAGR TVAGTTTRHR LRNPRLSRIS 61 LNVLRGRFGL FLL- AGS-6 (19206 19143,18331 18001) SCR (e 0.719 d 0.000 a 0.000,e 0.796) Exon 1 19206 19143 ( 64 n); score: 0.719 Intron 1 19142 18332 ( 811 n); Pd: 0.000 Pa: 0.000 Exon 2 18331 18001 ( 331 n); score: 0.796 PGS (19206 19143,18331 18001) SGN-E540167- PGS (18340 18006) SGN-E540411+ 3-phase translation of AGS-6 (-strand): . . . . . . 19206 ATTTTTTATTTTATCTAATAAGAAAATGACAAATAATATATTTTTAAAAAATAAATAAAA I F Y F I - - E N D K - Y I F K K - I K F F I L S N K K M T N N I F L K N K - N F L F Y L I R K - Q I I Y F - K I N K . : . . . . . 19146 CAAA : ATTTTTAAATTATATTGAGAAAATGCACAAGTATTCCCTCAAACTATGTCTGAAAT Q : N F - I I L R K C T S I P S N Y V - N K : I F K L Y - E N A Q V F P Q T M S E I T K : F L N Y I E K M H K Y S L K L C L K . . . . . . 18275 CCCAGAGACACACTTATACTATATTAAGGTCATATTACCCCCTGAACTTATTTTATAAGT P R D T L I L Y - G H I T P - T Y F I S P E T H L Y Y I K V I L P P E L I L - V S Q R H T Y T I L R S Y Y P L N L F Y K . . . . . . 18215 AATTTTCTACCCCTTTTGACCTACGTGGCTCTAGCTTGAAAAAAAAGTCAATCAGCGTTG N F L P L L T Y V A L A - K K S Q S A L I F Y P F - P T W L - L E K K V N Q R W - F S T P F D L R G S S L K K K S I S V . . . . . . 18155 GACCCACAAGATAGTGCCACATAGACCGAAAAGGGCTAGAAAATTATTAATAAAATAAGT D P Q D S A T - T E K G - K I I N K I S T H K I V P H R P K R A R K L L I K - V G P T R - C H I D R K G L E N Y - - N K . . . . . . 18095 TCAGGGATAATAGGACCTTAGTATAGTGTAAGTATGACTTTAAAATTTCAGGCATAAATT S G I I G P - Y S V S M T L K F Q A - I Q G - - D L S I V - V - L - N F R H K L F R D N R T L V - C K Y D F K I S G I N . . . . 18035 GAGAGGGTACTTGTGCATTATCTCAATAATATTCA E R V L V H Y L N N I R G Y L C I I S I I F - E G T C A L S Q - Y S Maximal non-overlapping open reading frames (>= 64 codons): none PGL 6 (+ strand): 20437 25265 AGS-1 (20437 21611) SCR (e 0.768) Exon 1 20437 21611 (1175 n); score: 0.768 PGS (20437 21038) SGN-E241789- PGS (20588 20996) SGN-E246710+ PGS (20823 21555) SGN-E351546+ PGS (20823 21458) SGN-E356696+ PGS (20823 21410) SGN-E356206+ PGS (20903 21572) SGN-E392027- PGS (20904 21173) SGN-E546219- PGS (21333 21611) SGN-E355114+ 3-phase translation of AGS-1 (+strand): . . . . . . 20437 ATAAATGGATTGGGTGTCATGTTTCGACACGGTAGTATTAGGGGATCGGAGTGTCACATT I N G L G V M F R H G S I R G S E C H I - M D W V S C F D T V V L G D R S V T F K W I G C H V S T R - Y - G I G V S H . . . . . . 20497 CTGACACGGTAGTATTAGGGGATCGGGTGTCACGTTCTGACACGGTAGTATTAGGGGATC L T R - Y - G I G C H V L T R - Y - G I - H G S I R G S G V T F - H G S I R G S S D T V V L G D R V S R S D T V V L G D . . . . . . 20557 AGAGTGTCACGTTCCGACACGGTAGTAGTAGGGGATCGGGTGTAACGTTCCGACACGATA R V S R S D T V V V G D R V - R S D T I E C H V P T R - - - G I G C N V P T R - Q S V T F R H G S S R G S G V T F R H D . . . . . . 20617 ATGATAAAGAGAATGAATCTTGAATTATGTTAATGTACTCAAATTCAAAGAACCTATTTC M I K R M N L E L C - C T Q I Q R T Y F - - R E - I L N Y V N V L K F K E P I S N D K E N E S - I M L M Y S N S K N L F . . . . . . 20677 CCAAATGAGTATGGTGTGGAGGCTTGAGTCCTCATAGATGTGCTTGCTGTTGTTGTCAAT P N E Y G V E A - V L I D V L A V V V N Q M S M V W R L E S S - M C L L L L S M P K - V W C G G L S P H R C A C C C C Q . . . . . . 20737 GGTTCTTATACTTGTTGATTGTCACCTGTTAAGTATTATGGTTGATTTTATATTATTATT G S Y T C - L S P V K Y Y G - F Y I I I V L I L V D C H L L S I M V D F I L L F W F L Y L L I V T C - V L W L I L Y Y Y . . . . . . 20797 CAGTATATATTGTTTTCTATTTTGAGTTGGCCGATGATACCTACTCAGTACGTGTTCCTT Q Y I L F S I L S W P M I P T Q Y V F L S I Y C F L F - V G R - Y L L S T C S L S V Y I V F Y F E L A D D T Y S V R V P . . . . . . 20857 GTACTGACCCCTACTTGTAATTTTCTTCTTTGTTATTTGTGGAGTGCAGCAAGCGTGCCA V L T P T C N F L L C Y L W S A A S V P Y - P L L V I F F F V I C G V Q Q A C H C T D P Y L - F S S L L F V E C S K R A . . . . . . 20917 TCGACTTCGACTCGTCATCAGATCTAGCCGGTCTTTAGCATATCAGAATTCAGGGTGAGC S T S T R H Q I - P V F S I S E F R V S R L R L V I R S S R S L A Y Q N S G - A I D F D S S S D L A G L - H I R I Q G E . . . . . . 20977 TATTATTCCTAGCTCGTGCTGGATTCTCTCCTTCACGTCTTGATGTCTTGAAGTTCGGAC Y Y S - L V L D S L L H V L M S - S S D I I P S S C W I L S F T S - C L E V R T L L F L A R A G F S P S R L D V L K F G . . . . . . 21037 ATGGACCATCTTTTTACTATTTTTAGCTTCTTGAATACTCTTAGATTTAGAAATTCGAGG M D H L F T I F S F L N T L R F R N S R W T I F L L F L A S - I L L D L E I R G H G P S F Y Y F - L L E Y S - I - K F E . . . . . . 21097 ATAGATGTTCTTGGTGTGATGACTTACAGATTTTGGGAATAATAAGTATTTAACTTTAGA I D V L G V M T Y R F W E - - V F N F R - M F L V - - L T D F G N N K Y L T L D D R C S W C D D L Q I L G I I S I - L - . . . . . . 21157 TGTTGATTTAATTAATTTCGTAATGAGTTTTGGAGCTTCCGCATTATTTATATTATTTAT C - F N - F R N E F W S F R I I Y I I Y V D L I N F V M S F G A S A L F I L F I M L I - L I S - - V L E L P H Y L Y Y L . . . . . . 21217 AGTTGATATACTGGTAAATGTTGGGGTTTAGATTTGTTGGTTCGCTCACCTAGGAGAGTA S - Y T G K C W G L D L L V R S P R R V V D I L V N V G V - I C W F A H L G E - - L I Y W - M L G F R F V G S L T - E S . . . . . . 21277 AGGGTGGGTGCCACTTACGGATCGTTTTGGGTCGTGACAAACTTGGTATCAGAGCGTTAG R V G A T Y G S F W V V T N L V S E R - G W V P L T D R F G S - Q T W Y Q S V R K G G C H L R I V L G R D K L G I R A L . . . . . . 21337 GTTCGTTGGTCTCATCACACAAGAACAAGTCTAGTAGAGTCTTGAGGAACGGTAGGGGGA V R W S H H T R T S L V E S - G T V G G F V G L I T Q E Q V - - S L E E R - G D G S L V S S H K N K S S R V L R N G R G . . . . . . 21397 CGCCCTTACTTTTCTTCGAGAGGCTATAGGACTTTAGCAAAATTCCATTCTTTCCTTCTT R P Y F S S R G Y R T L A K F H S F L L A L T F L R E A I G L - Q N S I L S F F T P L L F F E R L - D F S K I P F F P S . . . . . . 21457 TCGTGCTATTACTTGGATCCAAGTGGTATCTAGGTGATACAAATTGGTATCTGACATCCT S C Y Y L D P S G I - V I Q I G I - H P R A I T W I Q V V S R - Y K L V S D I L F V L L L G S K W Y L G D T N W Y L T S . . . . . . 21517 CACTCTATTTCGCATATGGTTAGAACTAGAGAAACAACTGTGCCAACACCAACACCGGCA H S I S H M V R T R E T T V P T P T P A T L F R I W L E L E K Q L C Q H Q H R Q S L Y F A Y G - N - R N N C A N T N T G . . . . 21577 AGATAGCGTGCGTCTGAGCCAAACATTGGGGTTGT R - R A S E P N I G V D S V R L S Q T L G L K I A C V - A K H W G C Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 21611 ACAACCCCAATGTTTGGCTCAGACGCACGCTATCTTGCCGGTGTTGGTGTTGGCACAGTT T T P M F G S D A R Y L A G V G V G T V Q P Q C L A Q T H A I L P V L V L A Q L N P N V W L R R T L S C R C W C W H S . . . . . . 21551 GTTTCTCTAGTTCTAACCATATGCGAAATAGAGTGAGGATGTCAGATACCAATTTGTATC V S L V L T I C E I E - G C Q I P I C I F L - F - P Y A K - S E D V R Y Q F V S C F S S S N H M R N R V R M S D T N L Y . . . . . . 21491 ACCTAGATACCACTTGGATCCAAGTAATAGCACGAAAGAAGGAAAGAATGGAATTTTGCT T - I P L G S K - - H E R R K E W N F A P R Y H L D P S N S T K E G K N G I L L H L D T T W I Q V I A R K K E R M E F C . . . . . . 21431 AAAGTCCTATAGCCTCTCGAAGAAAAGTAAGGGCGTCCCCCTACCGTTCCTCAAGACTCT K V L - P L E E K - G R P P T V P Q D S K S Y S L S K K S K G V P L P F L K T L - S P I A S R R K V R A S P Y R S S R L . . . . . . 21371 ACTAGACTTGTTCTTGTGTGATGAGACCAACGAACCTAACGCTCTGATACCAAGTTTGTC T R L V L V - - D Q R T - R S D T K F V L D L F L C D E T N E P N A L I P S L S Y - T C S C V M R P T N L T L - Y Q V C . . . . . . 21311 ACGACCCAAAACGATCCGTAAGTGGCACCCACCCTTACTCTCCTAGGTGAGCGAACCAAC T T Q N D P - V A P T L T L L G E R T N R P K T I R K W H P P L L S - V S E P T H D P K R S V S G T H P Y S P R - A N Q . . . . . . 21251 AAATCTAAACCCCAACATTTACCAGTATATCAACTATAAATAATATAAATAATGCGGAAG K S K P Q H L P V Y Q L - I I - I M R K N L N P N I Y Q Y I N Y K - Y K - C G S Q I - T P T F T S I S T I N N I N N A E . . . . . . 21191 CTCCAAAACTCATTACGAAATTAATTAAATCAACATCTAAAGTTAAATACTTATTATTCC L Q N S L R N - L N Q H L K L N T Y Y S S K T H Y E I N - I N I - S - I L I I P A P K L I T K L I K S T S K V K Y L L F . . . . . . 21131 CAAAATCTGTAAGTCATCACACCAAGAACATCTATCCTCGAATTTCTAAATCTAAGAGTA Q N L - V I T P R T S I L E F L N L R V K I C K S S H Q E H L S S N F - I - E Y P K S V S H H T K N I Y P R I S K S K S . . . . . . 21071 TTCAAGAAGCTAAAAATAGTAAAAAGATGGTCCATGTCCGAACTTCAAGACATCAAGACG F K K L K I V K R W S M S E L Q D I K T S R S - K - - K D G P C P N F K T S R R I Q E A K N S K K M V H V R T S R H Q D . . . . . . 21011 TGAAGGAGAGAATCCAGCACGAGCTAGGAATAATAGCTCACCCTGAATTCTGATATGCTA - R R E S S T S - E - - L T L N S D M L E G E N P A R A R N N S S P - I L I C - V K E R I Q H E L G I I A H P E F - Y A . . . . . . 20951 AAGACCGGCTAGATCTGATGACGAGTCGAAGTCGATGGCACGCTTGCTGCACTCCACAAA K T G - I - - R V E V D G T L A A L H K R P A R S D D E S K S M A R L L H S T N K D R L D L M T S R S R W H A C C T P Q . . . . . . 20891 TAACAAAGAAGAAAATTACAAGTAGGGGTCAGTACAAGGAACACGTACTGAGTAGGTATC - Q R R K L Q V G V S T R N T Y - V G I N K E E N Y K - G S V Q G T R T E - V S I T K K K I T S R G Q Y K E H V L S R Y . . . . . . 20831 ATCGGCCAACTCAAAATAGAAAACAATATATACTGAATAATAATATAAAATCAACCATAA I G Q L K I E N N I Y - I I I - N Q P - S A N S K - K T I Y T E - - Y K I N H N H R P T Q N R K Q Y I L N N N I K S T I . . . . . . 20771 TACTTAACAGGTGACAATCAACAAGTATAAGAACCATTGACAACAACAGCAAGCACATCT Y L T G D N Q Q V - E P L T T T A S T S T - Q V T I N K Y K N H - Q Q Q Q A H L I L N R - Q S T S I R T I D N N S K H I . . . . . . 20711 ATGAGGACTCAAGCCTCCACACCATACTCATTTGGGAAATAGGTTCTTTGAATTTGAGTA M R T Q A S T P Y S F G K - V L - I - V - G L K P P H H T H L G N R F F E F E Y Y E D S S L H T I L I W E I G S L N L S . . . . . . 20651 CATTAACATAATTCAAGATTCATTCTCTTTATCATTATCGTGTCGGAACGTTACACCCGA H - H N S R F I L F I I I V S E R Y T R I N I I Q D S F S L S L S C R N V T P D T L T - F K I H S L Y H Y R V G T L H P . . . . . . 20591 TCCCCTACTACTACCGTGTCGGAACGTGACACTCTGATCCCCTAATACTACCGTGTCAGA S P T T T V S E R D T L I P - Y Y R V R P L L L P C R N V T L - S P N T T V S E I P Y Y Y R V G T - H S D P L I L P C Q . . . . . . 20531 ACGTGACACCCGATCCCCTAATACTACCGTGTCAGAATGTGACACTCCGATCCCCTAATA T - H P I P - Y Y R V R M - H S D P L I R D T R S P N T T V S E C D T P I P - Y N V T P D P L I L P C Q N V T L R S P N . . . . 20471 CTACCGTGTCGAAACATGACACCCAATCCATTTAT L P C R N M T P N P F Y R V E T - H P I H L T T V S K H D T Q S I Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-6_AGS-1_PPS_1 (21243 20959) (frame '0'; 282 bp, 94 residues) 1 TPTFTSISTI NNINNAEAPK LITKLIKSTS KVKYLLFPKS VSHHTKNIYP RISKSKSIQE 61 AKNSKKMVHV RTSRHQDVKE RIQHELGIIA HPEF- >C06HBa0153O03.1-1-_PGL-6_AGS-1_PPS_2 (21520 21266) (frame '2'; 252 bp, 84 residues) 1 SEDVRYQFVS PRYHLDPSNS TKEGKNGILL KSYSLSKKSK GVPLPFLKTL LDLFLCDETN 61 EPNALIPSLS RPKTIRKWHP PLLS- >C06HBa0153O03.1-1-_PGL-6_AGS-1_PPS_3 (20958 20758) (frame '0'; 198 bp, 66 residues) 1 YAKDRLDLMT SRSRWHACCT PQITKKKITS RGQYKEHVLS RYHRPTQNRK QYILNNNIKS 61 TIILNR- AGS-2 (21698 22692,22729 24366) SCR (e 0.838 d 0.000 a 0.000,e 0.845) Exon 1 21698 22692 ( 995 n); score: 0.838 Intron 1 22693 22728 ( 36 n); Pd: 0.000 Pa: 0.000 Exon 2 22729 24366 (1638 n); score: 0.845 PGS (21698 22410) SGN-E356614- PGS (21744 22407) SGN-E352401- PGS (22364 22692,22729 23161) SGN-E214046- PGS (22571 22692,22729 23163) SGN-E244046- PGS (23006 23547) SGN-E355026- PGS (23014 23775) SGN-E355244- PGS (23042 23372) SGN-E352716- PGS (23072 23732) SGN-E352117+ PGS (23116 23775) SGN-E351414- PGS (23186 23844) SGN-E355232- PGS (23615 24295) SGN-E368762- PGS (23657 24366) SGN-E379315- PGS (23769 24366) SGN-E375319- PGS (23839 24366) SGN-E204434- 3-phase translation of AGS-2 (+strand): . . . . . . 21698 CTAGTAATAGGGTGGTGACTCCTCCACCGACTGATGAGGTAGTAAGAGAGGGTGAGGAAG L V I G W - L L H R L M R - - E R V R K - - - G G D S S T D - - G S K R G - G R S N R V V T P P P T D E V V R E G E E . . . . . . 21758 GGGAAAATAAACAGGTGCAAGATGAGGAATTACCACCCCAACCTACCCCAGAGATGATCA G K I N R C K M R N Y H P N L P Q R - S G K - T G A R - G I T T P T Y P R D D Q G E N K Q V Q D E E L P P Q P T P E M I . . . . . . 21818 ACCAGGTTCTTACTTATCTTAGCGGGTTATCTGATCGAGGCCAGACACCTCCAGTGTTTC T R F L L I L A G Y L I E A R H L Q C F P G S Y L S - R V I - S R P D T S S V S N Q V L T Y L S G L S D R G Q T P P V F . . . . . . 21878 TTGTACCAGCACCTCAGGTTCCAGGAGTACAACATGCAACTGTTGTGGCTCCCCGCATGG L Y Q H L R F Q E Y N M Q L L W L P A W C T S T S G S R S T T C N C C G S P H G L V P A P Q V P G V Q H A T V V A P R M . . . . . . 21938 ATGCCTCATTGGAAGTAGGCACGTTTCCTCGATTGACTACAGGGTCTATAATGACAAGTG M P H W K - A R F L D - L Q G L - - Q V C L I G S R H V S S I D Y R V Y N D K - D A S L E V G T F P R L T T G S I M T S . . . . . . 21998 ATCAACATGAACTTTTCACTAAATTCTTAAAGTTGAAACCTCCTGTCTTCAAGGGTGCTA I N M N F S L N S - S - N L L S S R V L S T - T F H - I L K V E T S C L Q G C - D Q H E L F T K F L K L K P P V F K G A . . . . . . 22058 AATCTGAGGATGCCTATGATTTTCTGGTTGATTGTCATGAGCTGCTACATAAGATGGACA N L R M P M I F W L I V M S C Y I R W T I - G C L - F S G - L S - A A T - D G H K S E D A Y D F L V D C H E L L H K M D . . . . . . 22118 TAGTAGAACGATTCGGTGTTGATTTTGTGACCTACCAGTTTCAGGGGAATGCCAAAATGT - - N D S V L I L - P T S F R G M P K C S R T I R C - F C D L P V S G E C Q N V I V E R F G V D F V T Y Q F Q G N A K M . . . . . . 22178 GGTGGCGGTCGTATGTTGAGTGTCAACCAGCACAGGCACCACCTATGACTTGGGAATCAT G G G R M L S V N Q H R H H L - L G N H V A V V C - V S T S T G T T Y D L G I I W W R S Y V E C Q P A Q A P P M T W E S . . . . . . 22238 TCTCTAGCTTATTTATGGAGAAGTATATACCCCGGACTTTGAGGGATAGGAGGAGAGATG S L A Y L W R S I Y P G L - G I G G E M L - L I Y G E V Y T P D F E G - E E R - F S S L F M E K Y I P R T L R D R R R D . . . . . . 22298 AGTTCTTGAGCCTATAGCAAGGAAGGATGTCTGTTGCCGCTTATGAGGCCAAATTTCGTG S S - A Y S K E G C L L P L M R P N F V V L E P I A R K D V C C R L - G Q I S C E F L S L - Q G R M S V A A Y E A K F R . . . . . . 22358 CGCTATCCAGGTATGCCACCCAGCTTTGCTTCAGTCCATAAGAGCGGATTCGCCGCTTTG R Y P G M P P S F A S V H K S G F A A L A I Q V C H P A L L Q S I R A D S P L C A L S R Y A T Q L C F S P - E R I R R F . . . . . . 22418 TGAAAGGATTGAGGTTAGATTTGCAGATCCCAGTTACAGGTAGCTGCCGCAGCAAAATCC - K D - G - I C R S Q L Q V A A A A K S E R I E V R F A D P S Y R - L P Q Q N P V K G L R L D L Q I P V T G S C R S K I . . . . . . 22478 TTTCAGGAAGTGGTTGACTTTGTGATTGAGGTGGAGGGGGTGAAGCCAGACAACTTCACC F Q E V V D F V I E V E G V K P D N F T F R K W L T L - L R W R G - S Q T T S P L S G S G - L C D - G G G G E A R Q L H . . . . . . 22538 ATGGTGTCGACATCTAAGAAGTTCCGTACGGGAGGTGAGTTTAGTGGTTCTTACTCCAGA M V S T S K K F R T G G E F S G S Y S R W C R H L R S S V R E V S L V V L T P E H G V D I - E V P Y G R - V - W F L L Q . . . . . . 22598 GGGCAGAGTTCAGGAGGTTACCCAGCCTGACCTATTCAGTCGTCACTACAGGCTGTAGCT G Q S S G G Y P A - P I Q S S L Q A V A G R V Q E V T Q P D L F S R H Y R L - L R A E F R R L P S L T Y S V V T T G C S . . . . : . . 22658 GGGGGTCCATCGCAGACCAGTCAACATTTCTCTGA : GAGACCTATGCTTGACTCCAGAGAT G G P S Q T S Q H F S E : R P M L D S R D G V H R R P V N I S L : R D L C L T P E I W G S I A D Q S T F L - : E T Y A - L Q R . . . . . . 22754 TGTAGTGGATGTGGAGAGACTGGACATATTAGGAGGTATTGTCCAAAATAGAGTTACAGA C S G C G E T G H I R R Y C P K - S Y R V V D V E R L D I L G G I V Q N R V T D L - W M W R D W T Y - E V L S K I E L Q . . . . . . 22814 CCCCCAATAATTAGAGGTAGAGGAAATCATGGGAGAGGCCGCCATTATGGAGGACGTGGT P P I I R G R G N H G R G R H Y G G R G P Q - L E V E E I M G E A A I M E D V V T P N N - R - R K S W E R P P L W R T W . . . . . . 22874 GGCCAAGGTAATGGTGGTCACCAAATCAGCCGGGGTGGCGGGCAAGTTGGAACTACTGCA G Q G N G G H Q I S R G G G Q V G T T A A K V M V V T K S A G V A G K L E L L Q W P R - W W S P N Q P G W R A S W N Y C . . . . . . 22934 GCACAACATGGTAAGGGCAACGGGCAGACAGGTGATAGGGCCCATTGTTATGATTTCCCC A Q H G K G N G Q T G D R A H C Y D F P H N M V R A T G R Q V I G P I V M I S P S T T W - G Q R A D R - - G P L L - F P . . . . . . 22994 GAGAGGTTTGAAGCAGAGACATCTGATGCTGTTATCACAGGTAATCTTTTGGTTTGTGAT E R F E A E T S D A V I T G N L L V C D R G L K Q R H L M L L S Q V I F W F V I R E V - S R D I - C C Y H R - S F G L - . . . . . . 23054 TGCATGGCTTCTGTATTATTTGATCCTGGATCCACATTTTCATATGTATCTTCCTCATTT C M A S V L F D P G S T F S Y V S S S F A W L L Y Y L I L D P H F H M Y L P H L L H G F C I I - S W I H I F I C I F L I . . . . . . 23114 GTTACTGGTCTTGATTTACATTGTGACTTGCTTGACATGCCTATTCGTGTCTTTACTCCT V T G L D L H C D L L D M P I R V F T P L L V L I Y I V T C L T C L F V S L L L C Y W S - F T L - L A - H A Y S C L Y S . . . . . . 23174 GTGGGTGAGTCTGTGATAGTTGATAAGGTGTATAGGTCTTGTCTTGTGGTTTTTATGGGG V G E S V I V D K V Y R S C L V V F M G W V S L - - L I R C I G L V L W F L W G C G - V C D S - - G V - V L S C G F Y G . . . . . . 23234 AGCAATACTCATTTAGATTTGATTATTCTAGAGATGGTTGATTTCGATGTAATTTTGGGT S N T H L D L I I L E M V D F D V I L G A I L I - I - L F - R W L I S M - F W V E Q Y S F R F D Y S R D G - F R C N F G . . . . . . 23294 ATGACTTGGCTTTCTCCAAACTTTGCAATCTTAGATTGTAACGCTAAAACTGTGACATTG M T W L S P N F A I L D C N A K T V T L - L G F L Q T L Q S - I V T L K L - H - Y D L A F S K L C N L R L - R - N C D I . . . . . . 23354 ACCAAGCCTGGGACAGATCCGCTAGTATGGGAGGGTGACTATATTTCCACCCTAGTTCAT T K P G T D P L V W E G D Y I S T L V H P S L G Q I R - Y G R V T I F P P - F I D Q A W D R S A S M G G - L Y F H P S S . . . . . . 23414 ATTATCTCTTTTCTTCGTGCTAAGAGGATGGTTAGTAGGGGTTGTTTAGCTTTCTTGGCC I I S F L R A K R M V S R G C L A F L A L S L F F V L R G W L V G V V - L S W P Y Y L F S S C - E D G - - G L F S F L G . . . . . . 23474 CATCTCAGGGATGATACTTCCAAGGTACCTTCGATTGAGTCTGTTTCGATAGTCTGTGAG H L R D D T S K V P S I E S V S I V C E I S G M I L P R Y L R L S L F R - S V S P S Q G - Y F Q G T F D - V C F D S L - . . . . . . 23534 TTTCTGGATGTGTTTCCTGCAGACCTTCCTGGTATGCCACCAGATAGGGATATTGATTTT F L D V F P A D L P G M P P D R D I D F F W M C F L Q T F L V C H Q I G I L I F V S G C V S C R P S W Y A T R - G Y - F . . . . . . 23594 TGTATTGATCTCGAGCCGGGTACTCGCCCCATTTCCATACCCCCTTATAGAATGACCCTA C I D L E P G T R P I S I P P Y R M T L V L I S S R V L A P F P Y P L I E - P Y L Y - S R A G Y S P H F H T P L - N D P . . . . . . 23654 TCTGAGTTAAGGGAGTTAAAGGCCCAACTTCAGGAGTTGTTAGGTAAAGACTTTACTAGA S E L R E L K A Q L Q E L L G K D F T R L S - G S - R P N F R S C - V K T L L D I - V K G V K G P T S G V V R - R L Y - . . . . . . 23714 CCAAGTTCATCCCCTTGGGGTGCTCCTGTTTTATTTGTGAAGAAGAAGGATGGAAGTTTT P S S S P W G A P V L F V K K K D G S F Q V H P L G V L L F Y L - R R R M E V F T K F I P L G C S C F I C E E E G W K F . . . . . . 23774 CGGATGTGCATAGACTACAGGCAACTGAATAAGGTAACTATTAAGAACAAGTATCCTCTT R M C I D Y R Q L N K V T I K N K Y P L G C A - T T G N - I R - L L R T S I L F S D V H R L Q A T E - G N Y - E Q V S S . . . . . . 23834 CCTCGCATCGATGATTTGTTCGATCAGTTACAAGGTGCTTGTATCTTTTCAAAAATCGAT P R I D D L F D Q L Q G A C I F S K I D L A S M I C S I S Y K V L V S F Q K S I S S H R - F V R S V T R C L Y L F K N R . . . . . . 23894 TTGAGATCTAGTTATCATGAATTGAAAATACGGGCAGCAGATGTGCCAAAGGCTGTGTTT L R S S Y H E L K I R A A D V P K A V F - D L V I M N - K Y G Q Q M C Q R L C F F E I - L S - I E N T G S R C A K G C V . . . . . . 23954 CGAACCAGGTATGGGCATTATGAATTCTTAGTAATGTCTTTTGGGCTTACGAATGCCTCT R T R Y G H Y E F L V M S F G L T N A S E P G M G I M N S - - C L L G L R M P L S N Q V W A L - I L S N V F W A Y E C L . . . . . . 24014 TCTGCGTTCATGAGCCTGATGAACAGGATTTTTAAGCCATATCTGGATCTGTTTGTTATT S A F M S L M N R I F K P Y L D L F V I L R S - A - - T G F L S H I W I C L L L F C V H E P D E Q D F - A I S G S V C Y . . . . . . 24074 GTATTTATTGATGATATACTGATATACTCAAAGAGCAGAAAAGAACATGGGGAGTATTTG V F I D D I L I Y S K S R K E H G E Y L Y L L M I Y - Y T Q R A E K N M G S I - C I Y - - Y T D I L K E Q K R T W G V F . . . . . . 24134 AAAATTGTTATGGAATTGTTGAGAGAGAAAAAGGCTTTATGCCAAATTCTCCAAGTGTGA K I V M E L L R E K K A L C Q I L Q V - K L L W N C - E R K R L Y A K F S K C E E N C Y G I V E R E K G F M P N S P S V . . . . . . 24194 GTTTTGGCTAGATTCAGTGTCCTTCTTGGGGCATGTTGGTTTCCAAGGATGGAGTGATGG V L A R F S V L L G A C W F P R M E - W F W L D S V S F L G H V G F Q G W S D G S F G - I Q C P S W G M L V S K D G V M . . . . . . 24254 TGGATCCATCTAATATTAAAGTAGTGAAGAATTGGGTAAGACCTACTAATGTTACAGAGG W I H L I L K - - R I G - D L L M L Q R G S I - Y - S S E E L G K T Y - C Y R G V D P S N I K V V K N W V R P T N V T E . . . . . . 24314 TAAGGAGCGTTTTTTGGTTTAGCTAGCTCCTACCGTCGATTTGTCAAGGGATT - G A F F G L A S S Y R R F V K G K E R F L V - L A P T V D L S R D V R S V F W F S - L L P S I C Q G I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-6_AGS-2_PPS_1 (22805 24193) (frame '1'; 1386 bp, 462 residues) 1 SYRPPIIRGR GNHGRGRHYG GRGGQGNGGH QISRGGGQVG TTAAQHGKGN GQTGDRAHCY 61 DFPERFEAET SDAVITGNLL VCDCMASVLF DPGSTFSYVS SSFVTGLDLH CDLLDMPIRV 121 FTPVGESVIV DKVYRSCLVV FMGSNTHLDL IILEMVDFDV ILGMTWLSPN FAILDCNAKT 181 VTLTKPGTDP LVWEGDYIST LVHIISFLRA KRMVSRGCLA FLAHLRDDTS KVPSIESVSI 241 VCEFLDVFPA DLPGMPPDRD IDFCIDLEPG TRPISIPPYR MTLSELRELK AQLQELLGKD 301 FTRPSSSPWG APVLFVKKKD GSFRMCIDYR QLNKVTIKNK YPLPRIDDLF DQLQGACIFS 361 KIDLRSSYHE LKIRAADVPK AVFRTRYGHY EFLVMSFGLT NASSAFMSLM NRIFKPYLDL 421 FVIVFIDDIL IYSKSRKEHG EYLKIVMELL REKKALCQIL QV- >C06HBa0153O03.1-1+_PGL-6_AGS-2_PPS_2 (21700 22314) (frame '0'; 612 bp, 204 residues) 1 SNRVVTPPPT DEVVREGEEG ENKQVQDEEL PPQPTPEMIN QVLTYLSGLS DRGQTPPVFL 61 VPAPQVPGVQ HATVVAPRMD ASLEVGTFPR LTTGSIMTSD QHELFTKFLK LKPPVFKGAK 121 SEDAYDFLVD CHELLHKMDI VERFGVDFVT YQFQGNAKMW WRSYVECQPA QAPPMTWESF 181 SSLFMEKYIP RTLRDRRRDE FLSL- AGS-3 (24395 25265) SCR (e 0.799) Exon 1 24395 25265 ( 871 n); score: 0.799 PGS (24395 24839) SGN-E352647- PGS (24395 24839) SGN-E352950- PGS (24395 24839) SGN-E357100- PGS (24743 25265) SGN-E353207- PGS (25010 25235) SGN-E578131- 3-phase translation of AGS-3 (+strand): . . . . . . 24395 TTGACTAAGAAAGATTTGAATTTGAGGCAGTGAAGGTAGATGGAACTACTGAAGGACTAT L T K K D L N L R Q - R - M E L L K D Y - L R K I - I - G S E G R W N Y - R T M D - E R F E F E A V K V D G T T E G L . . . . . . 24455 GATATTACTATTTTGTATCACCCAGGAAAAGCTAATGTTGTGGCAGACGCTTTAAGTAGA D I T I L Y H P G K A N V V A D A L S R I L L F C I T Q E K L M L W Q T L - V E - Y Y Y F V S P R K S - C C G R R F K - . . . . . . 24515 AAAGCAGGGAGCAGGGGAAGCCTAGCCCACTTACAGGTTTCTAGGCGCCCATTGGCTAGA K A G S R G S L A H L Q V S R R P L A R K Q G A G E A - P T Y R F L G A H W L E K S R E Q G K P S P L T G F - A P I G - . . . . . . 24575 GAGGTTTAGACCCTGGTTAATGACTTTATGAGGCTGGAAGTACTAGAGAAGGGAGGATTT E V - T L V N D F M R L E V L E K G G F R F R P W L M T L - G W K Y - R R E D F R G L D P G - - L Y E A G S T R E G R I . . . . . . 24635 TTGGCTTGTGTGGAGGCAAGATCTTCTTTTCTTGACAAGATTAAGGGAAAACAGTTTACT L A C V E A R S S F L D K I K G K Q F T W L V W R Q D L L F L T R L R E N S L L F G L C G G K I F F S - Q D - G K T V Y . . . . . . 24695 GACGAGAAGCTGAGCCGAATTCGAGATATGGTATTACGAGGAGAGGCTAAAGAGGCAATA D E K L S R I R D M V L R G E A K E A I T R S - A E F E I W Y Y E E R L K R Q - - R E A E P N S R Y G I T R R G - R G N . . . . . . 24755 ATTGACGAGGAAGGTGTTTTGAGAATTAAGGGAAGGATATGTGTGCCCCGTGTTGATAAT I D E E G V L R I K G R I C V P R V D N L T R K V F - E L R E G Y V C P V L I I N - R G R C F E N - G K D M C A P C - - . . . . . . 24815 TTGATTCACACTATTCTTACAGAGGCTCATAGTTCAAGGTATTATATACATCCTGGTGCA L I H T I L T E A H S S R Y Y I H P G A - F T L F L Q R L I V Q G I I Y I L V Q F D S H Y S Y R G S - F K V L Y T S W C . . . . . . 24875 ACCAAGATGTATCGTGACCTAAAGAAACATTTCTGGTAGAGTAGAATGAAGTGTGACATT T K M Y R D L K K H F W - S R M K C D I P R C I V T - R N I S G R V E - S V T L N Q D V S - P K E T F L V E - N E V - H . . . . . . 24935 GTTAATTTTGTTGCCCAATGCCCGAATTGTCAGCAGGTAAAGTATGACCACCAGAGGCCC V N F V A Q C P N C Q Q V K Y D H Q R P L I L L P N A R I V S R - S M T T R G P C - F C C P M P E L S A G K V - P P E A . . . . . . 24995 GGAGGAACACTTCAGAAAATCGGGAAAGAATTGCAATGGATTTTGTGGTTGGTCTTCCCA G G T L Q K I G K E L Q W I L W L V F P E E H F R K S G K N C N G F C G W S S Q R R N T S E N R E R I A M D F V V G L P . . . . . . 25055 AGACATTGGTTAAGTTCGATTCTATTTGGGTAATTGTTGACAGATTAACTAAGTTTGCTC R H W L S S I L F G - L L T D - L S L L D I G - V R F Y L G N C - Q I N - V C S K T L V K F D S I W V I V D R L T K F A . . . . . . 25115 ACTTCATTCCGATCAAGGTGACTTACAATGCAGAGAAGTTAACCAAACTCTATATCTCAG T S F R S R - L T M Q R S - P N S I S Q L H S D Q G D L Q C R E V N Q T L Y L R H F I P I K V T Y N A E K L T K L Y I S . . . . . . 25175 AAATTGCTCGATTGCATGGAGTTCCACTCTCCATCATATCAGATAGAGGTACGCAATTTA K L L D C M E F H S P S Y Q I E V R N L N C S I A W S S T L H H I R - R Y A I Y E I A R L H G V P L S I I S D R G T Q F . . . . 25235 CTTCTAAGTTTTGGAGAACATTGCATGCTGA L L S F G E H C M L F - V L E N I A C - T S K F W R T L H A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1+_PGL-6_AGS-3_PPS_1 (24584 24913) (frame '1'; 327 bp, 109 residues) 1 TLVNDFMRLE VLEKGGFLAC VEARSSFLDK IKGKQFTDEK LSRIRDMVLR GEAKEAIIDE 61 EGVLRIKGRI CVPRVDNLIH TILTEAHSSR YYIHPGATKM YRDLKKHFW- >C06HBa0153O03.1-1+_PGL-6_AGS-3_PPS_2 (24982 25263) (frame '0'; 282 bp, 94 residues) 1 PPEARRNTSE NRERIAMDFV VGLPKTLVKF DSIWVIVDRL TKFAHFIPIK VTYNAEKLTK 61 LYISEIARLH GVPLSIISDR GTQFTSKFWR TLHA 3-phase translation of AGS-3 (-strand): . . . . . . 25265 TCAGCATGCAATGTTCTCCAAAACTTAGAAGTAAATTGCGTACCTCTATCTGATATGATG S A C N V L Q N L E V N C V P L S D M M Q H A M F S K T - K - I A Y L Y L I - W S M Q C S P K L R S K L R T S I - Y D . . . . . . 25205 GAGAGTGGAACTCCATGCAATCGAGCAATTTCTGAGATATAGAGTTTGGTTAACTTCTCT E S G T P C N R A I S E I - S L V N F S R V E L H A I E Q F L R Y R V W L T S L G E W N S M Q S S N F - D I E F G - L L . . . . . . 25145 GCATTGTAAGTCACCTTGATCGGAATGAAGTGAGCAAACTTAGTTAATCTGTCAACAATT A L - V T L I G M K - A N L V N L S T I H C K S P - S E - S E Q T - L I C Q Q L C I V S H L D R N E V S K L S - S V N N . . . . . . 25085 ACCCAAATAGAATCGAACTTAACCAATGTCTTGGGAAGACCAACCACAAAATCCATTGCA T Q I E S N L T N V L G R P T T K S I A P K - N R T - P M S W E D Q P Q N P L Q Y P N R I E L N Q C L G K T N H K I H C . . . . . . 25025 ATTCTTTCCCGATTTTCTGAAGTGTTCCTCCGGGCCTCTGGTGGTCATACTTTACCTGCT I L S R F S E V F L R A S G G H T L P A F F P D F L K C S S G P L V V I L Y L L N S F P I F - S V P P G L W W S Y F T C . . . . . . 24965 GACAATTCGGGCATTGGGCAACAAAATTAACAATGTCACACTTCATTCTACTCTACCAGA D N S G I G Q Q N - Q C H T S F Y S T R T I R A L G N K I N N V T L H S T L P E - Q F G H W A T K L T M S H F I L L Y Q . . . . . . 24905 AATGTTTCTTTAGGTCACGATACATCTTGGTTGCACCAGGATGTATATAATACCTTGAAC N V S L G H D T S W L H Q D V Y N T L N M F L - V T I H L G C T R M Y I I P - T K C F F R S R Y I L V A P G C I - Y L E . . . . . . 24845 TATGAGCCTCTGTAAGAATAGTGTGAATCAAATTATCAACACGGGGCACACATATCCTTC Y E P L - E - C E S N Y Q H G A H I S F M S L C K N S V N Q I I N T G H T Y P S L - A S V R I V - I K L S T R G T H I L . . . . . . 24785 CCTTAATTCTCAAAACACCTTCCTCGTCAATTATTGCCTCTTTAGCCTCTCCTCGTAATA P - F S K H L P R Q L L P L - P L L V I L N S Q N T F L V N Y C L F S L S S - Y P L I L K T P S S S I I A S L A S P R N . . . . . . 24725 CCATATCTCGAATTCGGCTCAGCTTCTCGTCAGTAAACTGTTTTCCCTTAATCTTGTCAA P Y L E F G S A S R Q - T V F P - S C Q H I S N S A Q L L V S K L F S L N L V K T I S R I R L S F S S V N C F P L I L S . . . . . . 24665 GAAAAGAAGATCTTGCCTCCACACAAGCCAAAAATCCTCCCTTCTCTAGTACTTCCAGCC E K K I L P P H K P K I L P S L V L P A K R R S C L H T S Q K S S L L - Y F Q P R K E D L A S T Q A K N P P F S S T S S . . . . . . 24605 TCATAAAGTCATTAACCAGGGTCTAAACCTCTCTAGCCAATGGGCGCCTAGAAACCTGTA S - S H - P G S K P L - P M G A - K P V H K V I N Q G L N L S S Q W A P R N L - L I K S L T R V - T S L A N G R L E T C . . . . . . 24545 AGTGGGCTAGGCTTCCCCTGCTCCCTGCTTTTCTACTTAAAGCGTCTGCCACAACATTAG S G L G F P C S L L F Y L K R L P Q H - V G - A S P A P C F S T - S V C H N I S K W A R L P L L P A F L L K A S A T T L . . . . . . 24485 CTTTTCCTGGGTGATACAAAATAGTAATATCATAGTCCTTCAGTAGTTCCATCTACCTTC L F L G D T K - - Y H S P S V V P S T F F S W V I Q N S N I I V L Q - F H L P S A F P G - Y K I V I S - S F S S S I Y L . . . . 24425 ACTGCCTCAAATTCAAATCTTTCTTAGTCAA T A S N S N L S - S L P Q I Q I F L S Q H C L K F K S F L V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-6_AGS-3_PPS_1 (24819 24580) (frame '0'; 237 bp, 79 residues) 1 IKLSTRGTHI LPLILKTPSS SIIASLASPR NTISRIRLSF SSVNCFPLIL SRKEDLASTQ 61 AKNPPFSSTS SLIKSLTRV- PGL 7 (+ strand): 25888 26462 AGS-1 (25888 26232,26350 26462) SCR (e 0.857 d 0.000 a 0.000,e 0.757) Exon 1 25888 26232 ( 345 n); score: 0.857 Intron 1 26233 26349 ( 117 n); Pd: 0.000 Pa: 0.000 Exon 2 26350 26462 ( 113 n); score: 0.757 PGS (25888 26232,26350 26462) SGN-E252199+ 3-phase translation of AGS-1 (+strand): . . . . . . 25888 CTTGATGAGAATTTGTCTTATGAGGAGGAGCCTATTGCTATTTTAGGTTAGAGAGGTCTG L D E N L S Y E E E P I A I L G - R G L L M R I C L M R R S L L L F - V R E V C - - E F V L - G G A Y C Y F R L E R S . . . . . . 25948 CAAGTTGAGATCAAAGGAGATTGCATCTATCAAGGTTCGGTGGAAGAATCGGCCAATTGA Q V E I K G D C I Y Q G S V E E S A N - K L R S K E I A S I K V R W K N R P I E A S - D Q R R L H L S R F G G R I G Q L . . . . . . 26008 AGAGTCCACTTGGGAGAATGAGGCCGATATGTGAAAAAGATATCCACATCTTTTTATAGA R V H L G E - G R Y V K K I S T S F Y R E S T W E N E A D M - K R Y P H L F I D K S P L G R M R P I C E K D I H I F L - . . . . . . 26068 TTCAGGTACTCTTTCTCGCCCTTGCTTTTCTTCTTGTGATCGTTCGGGGACGAACGATGG F R Y S F S P L L F F L - S F G D E R W S G T L S R P C F S S C D R S G T N D G I Q V L F L A L A F L L V I V R G R T M . . . . . . 26128 GTAAATTGGTATCTATTGTAACGACTTGTTTAGTCGGTTCGAGCAGTAGAACTATTTTTG V N W Y L L - R L V - S V R A V E L F L - I G I Y C N D L F S R F E Q - N Y F - G K L V S I V T T C L V G S S S R T I F . . . . . : . 26188 ATAAAAACTGACTGGGTCGACGGATCACGCGACAGACCGTCATGG : GCGTCTCCGTTCCAA I K T D W V D G S R D R P S W : A S P F Q - K L T G S T D H A T D R H G : R L R S K D K N - L G R R I T R Q T V M : G V S V P . . . . . . 26365 AACACTTCAACTCTGAAAATCTGGGTACTGGGAGCGACTCTCTGAAATCCGCGATGGAAC N T S T L K I W V L G A T L - N P R W N T L Q L - K S G Y W E R L S E I R D G T K H F N S E N L G T G S D S L K S A M E . . . . 26425 TGCAGCATGGACCGTCGTAGACACGACGGACCGTCTCA C S M D R R R H D G P S A A W T V V D T T D R L L Q H G P S - T R R T V S Maximal non-overlapping open reading frames (>= 64 codons): none PGL 8 (- strand): 27539 27019 AGS-1 (27539 27019) SCR (e 0.820) Exon 1 27539 27019 ( 521 n); score: 0.820 PGS (27539 27019) SGN-E241789+ PGS (27420 27097) SGN-E246710- PGS (27471 27225) SGN-E353206- 3-phase translation of AGS-1 (-strand): . . . . . . 27539 ATGTCCGAACTTAAAGACATCAAGACCTGAAGGAGAGAATCCAGCACGAGCTAGGAATAA M S E L K D I K T - R R E S S T S - E - C P N L K T S R P E G E N P A R A R N N V R T - R H Q D L K E R I Q H E L G I . . . . . . 27479 TAGCTCACCCTGAATTCTGATATGCTGAAAAATGGCTAGATCTGAGGACGAGTCAAAGTC - L T L N S D M L K N G - I - G R V K V S S P - I L I C - K M A R S E D E S K S I A H P E F - Y A E K W L D L R T S Q S . . . . . . 27419 GATGGCATGCTTGCTGCACTCCACAAATAACAAAGAAGAAAATTACAAGTAGGGGTCAGT D G M L A A L H K - Q R R K L Q V G V S M A C L L H S T N N K E E N Y K - G S V R W H A C C T P Q I T K K K I T S R G Q . . . . . . 27359 ACAAGGAACACGTACTGAGTAGGTATCATCAGCCAACTCAAAATAGAAAACAATATATAC T R N T Y - V G I I S Q L K I E N N I Y Q G T R T E - V S S A N S K - K T I Y T Y K E H V L S R Y H Q P T Q N R K Q Y I . . . . . . 27299 TGAATAATAATATAAAATCAACCATAATACTTAACAGGTGACAATCAACAAGTATAAGAA - I I I - N Q P - Y L T G D N Q Q V - E E - - Y K I N H N T - Q V T I N K Y K N L N N N I K S T I I L N R - Q S T S I R . . . . . . 27239 CCATTGACAACAACAGCAAGCACACCTATGAGGACTCAAGCCTCCACACCATACTCATTT P L T T T A S T P M R T Q A S T P Y S F H - Q Q Q Q A H L - G L K P P H H T H L T I D N N S K H T Y E D S S L H T I L I . . . . . . 27179 GGGAAATAGGTTCTTTGAATTTGAGTACATTAACATAATTCAAGATTCATTGTCTTTATC G K - V L - I - V H - H N S R F I V F I G N R F F E F E Y I N I I Q D S L S L S W E I G S L N L S T L T - F K I H C L Y . . . . . . 27119 ATTATCGTGTCGGAACGTGACACCCGATCCCCTAATACTACCGTGTTGGAACGTGACACT I I V S E R D T R S P N T T V L E R D T L S C R N V T P D P L I L P C W N V T L H Y R V G T - H P I P - Y Y R V G T - H . . . . . 27059 CCGACCCCTAATACTACCGTGTCGGAATGTGACACTCGATC P T P N T T V S E C D T R R P L I L P C R N V T L D S D P - Y Y R V G M - H S I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-1-_PGL-8_AGS-1_PPS_1 (27459 27259) (frame '0'; 198 bp, 66 residues) 1 YAEKWLDLRT SQSRWHACCT PQITKKKITS RGQYKEHVLS RYHQPTQNRK QYILNNNIKS 61 TIILNR- 3-phase translation of AGS-1 (+strand): . . . . . . 27019 GATCGAGTGTCACATTCCGACACGGTAGTATTAGGGGTCGGAGTGTCACGTTCCAACACG D R V S H S D T V V L G V G V S R S N T I E C H I P T R - Y - G S E C H V P T R S S V T F R H G S I R G R S V T F Q H . . . . . . 27079 GTAGTATTAGGGGATCGGGTGTCACGTTCCGACACGATAATGATAAAGACAATGAATCTT V V L G D R V S R S D T I M I K T M N L - Y - G I G C H V P T R - - - R Q - I L G S I R G S G V T F R H D N D K D N E S . . . . . . 27139 GAATTATGTTAATGTACTCAAATTCAAAGAACCTATTTCCCAAATGAGTATGGTGTGGAG E L C - C T Q I Q R T Y F P N E Y G V E N Y V N V L K F K E P I S Q M S M V W R - I M L M Y S N S K N L F P K - V W C G . . . . . . 27199 GCTTGAGTCCTCATAGGTGTGCTTGCTGTTGTTGTCAATGGTTCTTATACTTGTTGATTG A - V L I G V L A V V V N G S Y T C - L L E S S - V C L L L L S M V L I L V D C G L S P H R C A C C C C Q W F L Y L L I . . . . . . 27259 TCACCTGTTAAGTATTATGGTTGATTTTATATTATTATTCAGTATATATTGTTTTCTATT S P V K Y Y G - F Y I I I Q Y I L F S I H L L S I M V D F I L L F S I Y C F L F V T C - V L W L I L Y Y Y S V Y I V F Y . . . . . . 27319 TTGAGTTGGCTGATGATACCTACTCAGTACGTGTTCCTTGTACTGACCCCTACTTGTAAT L S W L M I P T Q Y V F L V L T P T C N - V G - - Y L L S T C S L Y - P L L V I F E L A D D T Y S V R V P C T D P Y L - . . . . . . 27379 TTTCTTCTTTGTTATTTGTGGAGTGCAGCAAGCATGCCATCGACTTTGACTCGTCCTCAG F L L C Y L W S A A S M P S T L T R P Q F F F V I C G V Q Q A C H R L - L V L R F S S L L F V E C S K H A I D F D S S S . . . . . . 27439 ATCTAGCCATTTTTCAGCATATCAGAATTCAGGGTGAGCTATTATTCCTAGCTCGTGCTG I - P F F S I S E F R V S Y Y S - L V L S S H F S A Y Q N S G - A I I P S S C W D L A I F Q H I R I Q G E L L F L A R A . . . . . 27499 GATTCTCTCCTTCAGGTCTTGATGTCTTTAAGTTCGGACAT D S L L Q V L M S L S S D I L S F R S - C L - V R T G F S P S G L D V F K F G H Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:16:33 2006 ________________________________________________________________________________ Sequence 2: C06HBa0153O03.1-2, from 1 to 11633, both strands analyzed. ... started at: Mon Aug 28 22:16:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 49 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 94 ******************************************************************************** EST sequence 17 -strand 693 n (File: SGN-E556422-) 1 TTTTTTTTTT AAAAATCAAA ACACTTTCAT AATTTAACAA TATGTGTTGT TATGTCAATA 61 AATACTTCAA ACATCTTGTT TAACTCCAAA AAACAAAAAA AAATCAAACA TTGAAAACTA 121 ACTAGCCTAC ATTAACAATT TCATCTTCAG ATGATGGGTT TAATACATTC TTCATTCTTG 181 AAGGGTCATC ACTGTTGCTA ACATATCCAG CATTCATCTT GTCGAGACCA TACTTCAATA 241 AAAGAATCGC ATATCTTTTG CGAAGATAGC TACTCTTAAA ACTATTACAT GACATGTTGA 301 TTTTGTCACT CAGAATTCTG CATATACAAT TACGAATAGT CCGCAATCAA GGCTATCACT 361 TATTTGTTGC ATGTTGTCTT GAGCAAACTC CACATTAAAC GAATGTTGTG GACCCAAAAG 421 TTCTCCAGTT TTCTTGTCTT TATATGCATC CAATGCTGCC CAATCTTTCA AGAATATTGG 481 ACGAAAATGG CGGACAGAGT AGACGAAATT GGCGATTTGA TTTTTTAAGG CGGACAGAGT 541 AGACACTAAA TGGCGGACAG ATTAGACGAA GACGGACGGA CGGAGAGCAC AATTTTGATT 601 TTCAAAGGCT GACAAAGTAG ACAAAAATGG CGGACAGTTT AGACGAAGGC TGACGGATGG 661 AGAGCACAAA TTTTGATTTT CAAAATTGAG AAG Predicted gene structure (within gDNA segment 1998 to 1): Exon 1 1282 940 ( 343 n); cDNA 8 349 ( 342 n); score: 0.948 Intron 1 939 865 ( 75 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.98) Exon 2 864 748 ( 117 n); cDNA 350 466 ( 117 n); score: 0.957 MATCH C06HBa0153O03.1-2- SGN-E556422- 0.950 460 0.664 C PGS_C06HBa0153O03.1-2-_SGN-E556422- (1282 940,864 748) Alignment (genomic DNA sequence = upper lines): TTTTAAAATC AAAACACTTT CATAATTTAA CAATATGTGT TGTTATGTTA ATAAATACTT 1223 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| TTTAAAAATC AAAACACTTT CATAATTTAA CAATATGTGT TGTTATGTCA ATAAATACTT 67 CAAACATCTT GTTTAACTCC AAAAAAAAAA ATAAAATCAA ACATTTAAAA CTAACTAGCC 1163 |||||||||| |||||||||| |||||| ||| | |||||||| ||||| |||| |||||||||| CAAACATCTT GTTTAACTCC AAAAAACAAA AAAAAATCAA ACATTGAAAA CTAACTAGCC 127 TACATTAACA ATTTCATCTT CAAATGATGG GTTTAATACA TTCTTCATTC TTGGAGGGTC 1103 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| ||| |||||| TACATTAACA ATTTCATCTT CAGATGATGG GTTTAATACA TTCTTCATTC TTGAAGGGTC 187 ATCACTGTTG CTAACATATC CAGCATTCAT CTTGTCGAGA CCATACTTCA ATAAAAGTAT 1043 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || ATCACTGTTG CTAACATATC CAGCATTCAT CTTGTCGAGA CCATACTTCA ATAAAAGAAT 247 CGCATATCTT TTGCCAAGAT AGCTACTCTC AAAACTATTA CATGACATAT TAATTTTTTC 983 |||||||||| |||| ||||| ||||||||| |||||||||| |||||||| | | ||||| || CGCATATCTT TTGCGAAGAT AGCTACTCTT AAAACTATTA CATGACATGT TGATTTTGTC 307 ACTCAGAAAT TCTGCATATA CAGCTACAAA CAGTCCGCAA TCACTACAAA AACAATAAGA 923 |||||| ||| |||||||||| || ||| || ||||||||| ||| ACTCAG-AAT TCTGCATATA CAATTACGAA TAGTCCGCAA TCA....... .......... 349 TTAATTATAA GGATGAATTA GATATAAAAA ATAAAATAAA ATAATAAACA ATACTCTCAG 863 || .......... .......... .......... .......... .......... ........AG 351 GCTATCACTT ATTTGTTGCA TGTTGTCTTG AGCAAATTCC ACATTAAACG AATGTTGTGG 803 |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| GCTATCACTT ATTTGTTGCA TGTTGTCTTG AGCAAACTCC ACATTAAACG AATGTTGTGG 411 ATTTAAAAGT TCTCCAGTTT TCATGTCTTT ATATGCATCC AATGCTGCCC AATCT 748 | |||||| |||||||||| || ||||||| |||||||||| |||||||||| ||||| ACCCAAAAGT TCTCCAGTTT TCTTGTCTTT ATATGCATCC AATGCTGCCC AATCT 466 hqPGS_C06HBa0153O03.1-2-_SGN-E556422- (1282 940,864 748) ******************************************************************************** EST sequence 28 -strand 658 n (File: SGN-E377132-) 1 TTCCTTCTTT TACCCTAATT AGCATATATT TAAGAATAAA AGATGGAATA ATAACCCACT 61 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 121 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 181 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 241 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 301 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 361 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 421 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 481 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 541 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 601 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAA Predicted gene structure (within gDNA segment 2548 to 1): Exon 1 1901 1401 ( 501 n); cDNA 2 499 ( 498 n); score: 0.869 PPA cDNA 648 658 MATCH C06HBa0153O03.1-2- SGN-E377132- 0.869 501 0.761 C PGS_C06HBa0153O03.1-2-_SGN-E377132- (1901 1401) Alignment (genomic DNA sequence = upper lines): TCTTTCTTTT ACCCTAATTA GTATATAATT AAGAATAAAA GATGGCAATA ATACCCCACT 1842 || ||||||| |||||||||| | ||||| || |||||||||| ||||| |||| ||| |||||| TCCTTCTTTT ACCCTAATTA GCATATATTT AAGAATAAAA GATGG-AATA ATAACCCACT 60 AATTAACTTA AGGTTACCTC TTTTAACCCC CAAGGATTTT GAGTTATTAA TATAAACCCA 1782 |||| ||| | |||||||||| |||||||||| || | | || || ||||||| ||||||||| AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 120 TGAAATATAT AATCATAGCA GGAATAGTCC AAAACGCCCC TTTAAAACTT AACCAGAAAT 1722 || | ||| ||| | || | |||||||||| |||||| ||| ||||||| | |||||| CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC -TTAAAACGT GTAAAGAAAT 179 CTGACTCCAA CTGGGATTGC ACAACCTGTG ACGGGCCGTC GTGCCTGCGA CGGTCCGTCC 1662 | ||| | | |||||||| | ||||||||| | || ||||| |||||||||| |||||||||| CCGACCCAGA CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC 239 TGCAGGTCGT CGCAAAGTTC AGAGACCCAA TATTTCCACC -AAGGGTCTG TGATGGTCCG 1603 |||||||||| ||||| |||| |||||| | | ||||||||| ||| ||||| ||| |||||| TGCAGGTCGT CGCAAGGTTC AGAGACTC-A -ATTTCCACC AAAGAGTCTG TGACGGTCCG 297 TCACACCTGT GACGGTCCGT CCTGCCATTC CGTCACGAAG TTCAGAGAGT TGATTTTCAG 1543 |||| || || |||||||||| | |||||||| ||| |||||| |||||||||| |||||| || TCACGCCCGT GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG 357 TACCCAATTT TAGATTTTCT AAGTGTTTTG AAACGAGACC CTGCGACGGT CTGTCGTGCC 1483 |||||||||| ||| ||||| |||||||||| ||||||||| | ||||||| | |||||| TACCCAATTT CAGAATTTCT AAGTGTTTTG AAACGAGACT CCTCGACGGT CCATCGTGCT 417 CATGACGGTC CGTCGTTGGG TTCGTCGCCT CAGCCTGTTT TTCCAGAAAT AAAATCCGCT 1423 |||||||||| |||||| || | ||||| || || ||||||| ||||| |||| |||||| ||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 477 GCTCAAAACG ACTAAACAGG TC 1401 ||||||||| |||||||||| || ACTCAAAACG ACTAAACAGG TC 499 hqPGS_C06HBa0153O03.1-2-_SGN-E377132- (1901 1401) ******************************************************************************** EST sequence 12 -strand 542 n (File: SGN-E252199-) 1 CGACCCAGCC TGGGATTACG CAGTCTGTGA CGGTCCGTCC TGCACGTCCG TCACAGAGTT 61 CAGAGACTAG ATTTTTACCA AGGGTCTGTG ACGGCCCATC ACGCCTGTGA CGGTCCGTCC 121 TGCCATTCCG TCACGAAGTT CAGAGAGTCG ATTTCAGTAC CCAAATTTCA GAATTCTAAG 181 TGTTTTGGAA CGAGACCCCC TCGACGGTCC GTCGTGGGAT CCGTCGTCTC AGTCAGTTTT 241 TCCAGAAATA AAATCTGTTA CTCAAAACGA CTAAACAGGT CGTTACAATA GATACCAATT 301 TACCCATCGT TCGTCCCCGA ACGATCACAA GAAGAAAAAC AAGGGCGAAA AGGAGTACCT 361 GAATCTGTAA ACAGGTATGG GTATCTTTCT CGCATATCAA CTTCCTTCTC CCAAGTGGAT 421 TCTTCAACTG GTCGATTCTT CCATTGAACT TTGATAGATG CAATCTCCCT TGACCTCAAT 481 TTGCGGACTT CTCTATCTAA AATGGCAACA GGCTCCTCCT CATAAGACAA ATTCTCATCA 541 AG Predicted gene structure (within gDNA segment 3237 to 1): Exon 1 1677 1401 ( 277 n); cDNA 25 281 ( 257 n); score: 0.812 MATCH C06HBa0153O03.1-2- SGN-E252199- 0.812 277 0.511 C PGS_C06HBa0153O03.1-2-_SGN-E252199- (1677 1401) Alignment (genomic DNA sequence = upper lines): CTGCGACGGT CCGTCCTGCA GGTC-GTCGC AAAGTTCAGA GACCCAATAT TTCCACCAAG 1619 ||| |||||| |||||||||| ||| ||| | | |||||||| || | | || || |||||| CTGTGACGGT CCGTCCTGCA CGTCCGTCAC AGAGTTCAGA GA-CTAG-AT TTTTACCAAG 82 GGTCTGTGAT GGTCCGTCAC ACCTGTGACG GTCCGTCCTG CCATTCCGTC ACGAAGTTCA 1559 ||||||||| || || |||| ||||||||| |||||||||| |||||||||| |||||||||| GGTCTGTGAC GGCCCATCAC GCCTGTGACG GTCCGTCCTG CCATTCCGTC ACGAAGTTCA 142 GAGAGTTGAT TTTCAGTACC CAA-TTTTAG ATTTTCTAAG TGTTTTGAAA CGAGACCCTG 1500 |||||| || |||||||||| ||| ||| || | ||||||| ||||||| || ||||| || GAGAGTCGA- TTTCAGTACC CAAATTTCAG A-ATTCTAAG TGTTTTGGAA CGAGA-CC-- 197 CGACGGTCTG TCGTGCCCAT GACGGTCCGT CGTTGGGTTC GTCGCCTCAG CCTGTTTTTC 1440 | | | ||| ||||||||| ||| || | | |||| ||||| | ||||||| C--C---C-- TCG------- -ACGGTCCGT CGTGGGATCC GTCGTCTCAG TCAGTTTTTC 242 CAGAAATAAA ATCCGCTGCT CAAAACGACT AAACAGGTC 1401 |||||||||| ||| | | || |||||||||| ||||||||| CAGAAATAAA ATCTGTTACT CAAAACGACT AAACAGGTC 281 hqPGS_C06HBa0153O03.1-2-_SGN-E252199- (1677 1401) ******************************************************************************** EST sequence 19 -strand 515 n (File: SGN-E242359-) 1 AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 61 TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 121 ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 181 ATTTATTCCC CTCGTGTCCT TACGTGACAC TCCACTCCTC AATATACTAT CCTGGCACCG 241 GAACGTGGCA CCCGATCCAT ATTCTATCCT GGTGTCAGAA CGTGACACCC GATCCATATT 301 CTATCCTGGT GTCGGAACGT GACACCCGAT CCATATTCTA TCCTGGTACC GGAACGTGGC 361 ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACC CGATCCATAT TCTATCCTGG 421 TACCGGAACG TGGCACCCGA TCCCCTAATC TCACCACTTT CGTTCATCAA GCCTTCTTTT 481 ATACCAAGGC ATCATTATTA ACAAAGTAGA TTAGG Predicted gene structure (within gDNA segment 4098 to 1604): Exon 1 2759 2471 ( 289 n); cDNA 1 288 ( 288 n); score: 0.913 Intron 1 2470 2431 ( 40 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.96) Exon 2 2430 2204 ( 227 n); cDNA 289 515 ( 227 n); score: 0.978 MATCH C06HBa0153O03.1-2- SGN-E242359- 0.942 516 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E242359- (2759 2471,2430 2204) Alignment (genomic DNA sequence = upper lines): AGTATGTATT AAGCAATATC ATAAAATCAA TTAATATCCT TAGCATGCAG CATTTACAGT 2700 |||||||||| |||||||||| ||||||| || ||||||||| |||||||||| ||||| || | AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 60 TACCATAACC CTTGGTTACA ACACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAC 2640 |||||||||| ||||||| || ||||||||| |||||||||| |||||||||| ||||||||| TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 120 ACTCATTTGG GAATTTAGTT CATTAGATTG GATATATTAA CATATTTCAA GATTCATTAT 2580 ||| |||||| |||||||||| |||| ||||| ||||||||| |||||||||| ||||||| || ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 180 CTTTATTCTC CTCGTGTCGG TACGTGACAC TCCGCTCCTC AATATACTAT CCTGGTGTCG 2520 ||||||| | |||||||| |||||||||| ||| |||||| |||||||||| ||||| || ATTTATTCCC CTCGTGTCCT TACGTGACAC TCCACTCCTC AATATACTAT CCTGGCACCG 240 GAACGTGACA CTCTGATCCT CATTCTATCC TGGTGTCGGA ACGTGACACT CCGATCCTCA 2460 ||||||| || | | ||||| ||||||||| ||||||| || ||||||||| GAACGTGGCA C-CCGATCCA TATTCTATCC TGGTGTCAGA ACGTGACAC. .......... 288 TATACTATCC TGGTACCGGA ACGTGGCACC CGATCCATAT TCTATCCTGG TGTCAGAACG 2400 | |||||||||| |||||||||| |||| ||||| .......... .......... .........C CGATCCATAT TCTATCCTGG TGTCGGAACG 319 TGACACCCGA TCCATATCCT ATCCTGGTAC CGGAACGTGG CACCCGATCC ATATTCTATC 2340 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| TGACACCCGA TCCATATTCT ATCCTGGTAC CGGAACGTGG CACCCGATCC ATATTCTATC 379 TTGGTGTCGG AACGTGACAC CCGATCTATA TTCTATCCTG GTACCGGAAC GTGGCACCCG 2280 ||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CTGGTGTCGG AACGTGACAC CCGATCCATA TTCTATCCTG GTACCGGAAC GTGGCACCCG 439 ATCCCCTAAT CTCACCACTT TCGTTCATCA AGCCTTCTTT TATACCAAGG CATCATCATT 2220 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ATCCCCTAAT CTCACCACTT TCGTTCATCA AGCCTTCTTT TATACCAAGG CATCATTATT 499 AACAAAGTAG ATTAGG 2204 |||||||||| |||||| AACAAAGTAG ATTAGG 515 hqPGS_C06HBa0153O03.1-2-_SGN-E242359- (2759 2471,2430 2204) ******************************************************************************** EST sequence 94 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 4754 to 1): Exon 1 2991 2448 ( 544 n); cDNA 1 534 ( 534 n); score: 0.777 Intron 1 2447 2371 ( 77 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.76) Exon 2 2370 2215 ( 156 n); cDNA 535 686 ( 152 n); score: 0.795 MATCH C06HBa0153O03.1-2- SGN-E241789+ 0.781 700 1.020 C PGS_C06HBa0153O03.1-2-_SGN-E241789+ (2991 2448,2370 2215) Alignment (genomic DNA sequence = upper lines): ATGCCGGAAG TTCAAGG-CA TCAAGACTTG AAGAAGA-AG -ACCCAGTCC AAGCTAGAAG 2935 ||| | || ||||||| || ||||||| || || ||| || | ||||||| |||||| | ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 CATTAGCTCA CCCTGAATAT CCGGTATGAC GAAGACTGGC TAGAATCACT GCTGAGTTGA 2875 || ||||||| ||||||| || | | ||| ||||||||| |||| | | | |||||||| CAATAGCTCA CCCTGAA-AT CTGACGTGAT GAAGACTGGT TAGAGTTGCG GTTGAGTTGA 119 AGATGACGGA ACGTTTGCTG CACTCCACAA ATAACAAGAA GAAAACATAA AAGTAGGGGT 2815 ||| ||||| |||||||||| |||||||||| |||||| || |||||||||| |||||||||| AGACGACGGT ACGTTTGCTG CACTCCACAA TTAACAAAAA GAAAACATAA AAGTAGGGGT 179 CAGTACAAAA CACGGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT 2755 |||||| ||| |||| ||||| |||||||||| |||||||||| |||| ||||| | |||| ||| CAGTAC-AAA CACGAGTACT GAGTAGATAT CATCGGCCAA CTCAGAATAG AGAACAATAT 238 GTATTAAGCA ATATCATAAA ATCAATTAAT ATCCTTAGCA TGCAGCATTT ACAGTTACCA 2695 ||| || | ||| ||||| ||||| | | |||| || | || ||| ||||| ATATCAAATA ATAAAATAAA ATCAACCATA ACACTTAACA GGTGACAACA ACAAGTACCA 298 TAACCCTTGG TTACAACACC AAGCACATCA ATGAGGACTC ACACCTCCTC ATCACACTCA 2635 ||||| |||| ||||| || ||| ||||| |||||||||| | ||||| | | || ||||| TAACCATTGG GCACAAC-CC AAGAACATCT ATGAGGACTC AAGCCTCCAC ACCATACTCA 357 TTTGGGAATT TAGTTCATTA GATTGGATAT ATTAACATAT TTCAAGATTC ATTATCTTTA 2575 |||||||| |||||||| |||| || ||||||||| |||||||||| ||| | |||| TTTGGGAAAC AGGTTCATTA AATTGAGTAC ATTAACATAA TTCAAGATTC ATTCTTTTTA 417 TTCTCCTCGT GTCGGTACGT GACACTCCGC TCCTCAATAT ACTATCCTGG TGTCGGAACG 2515 | || | || ||||| |||| || |||||| ||| | | || ||| | | |||||| || CTATCGTGGT GTCGGAACGT GATACTCCGA TCCCCTA-AT GCTA--C--G TGTCGGTTCG 472 TGACACTCTG ATCCTCATTC TATCCTGGTG TCGGAACGTG ACACTCCGAT CCTCATATAC 2455 |||||| | | |||| | | || | ||| |||| ||| |||| ||||| | | |||| TGACAC-CCG ATCC-CCTAA TA-CTACGTG TCGGTTCGTT ACAC-CCGAT CTCCTAATAC 528 TATCCTGGTA CCGGAACGTG GCACCCGATC CATATTCTAT CCTGGTGTCA GAACGTGACA 2395 || | || TA-CGTG... .......... .......... .......... .......... .......... 534 CCCGATCCAT ATCCTATCCT GGTACCGGAA CGTGGCACCC GATCCATATT CTATCTTGGT 2335 ||| |||| ||||| ||||||| | || || || .......... .......... ....CCGATT CGTGACACCC GATCCAT-TA ATA-CTATGT 568 GTCGGAACGT GACACCCGAT CTATATTCTA TCCTGGTACC GGAACGTGGC ACCCGATCCC 2275 ||||| ||| |||||||||| | || | || | || | || |||| | |||||||||| GTCGGTTCGT GACACCCGAT CCAT-TAATA -CTACGTGTC GGTTCGTGAC ACCCGATCCC 626 CTAATCTCAC CACTTTCGTT CATCAAGCCT TCTTTTATAC CAAGGCATCA TCATTAACAA 2215 |||| |||| ||| ||| |||||||||| |||||||||| |||| ||||| |||||||||| CTAACCTCAT TCTTTTAGTT CATCAAGCCT TCTTTTATAC CAAGACATCA TCATTAACAA 686 hqPGS_C06HBa0153O03.1-2-_SGN-E241789+ (2991 2448,2370 2215) ******************************************************************************** EST sequence 86 +strand 558 n (File: SGN-E231589+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA TAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTT Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1395 ( 556 n); cDNA 1 555 ( 555 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E231589+ 0.861 556 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E231589+ (1950 1395) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| || |||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CATAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 1395 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E231589+ (1950 1395) ******************************************************************************** EST sequence 8 -strand 681 n (File: SGN-E389553-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCGTTCC TACTTAAATA TTATTATTAT TTTACGATTT 601 ATAACACTAT TAGAAACAAA GATTTTCTCA ACCATGAATT AATGAAAAAA TTATGGAATA 661 AAATATAAAA AATTACTCAT T Predicted gene structure (within gDNA segment 3058 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E389553- 0.861 550 0.808 C PGS_C06HBa0153O03.1-2-_SGN-E389553- (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E389553- (1950 1401) ******************************************************************************** EST sequence 4 -strand 679 n (File: SGN-E550127-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTCCT TTTCTTTTTC TTATCAAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTGCACAA CCATGAATTA ATGAAAAAAT TATGACATAA 661 AATATAAAAA ATTACTCAT Predicted gene structure (within gDNA segment 3058 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.855 MATCH C06HBa0153O03.1-2- SGN-E550127- 0.855 550 0.810 C PGS_C06HBa0153O03.1-2-_SGN-E550127- (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTCT TTGT-TCTTT CTATTTT-CT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| |||||| || | || || | |||| || ||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTCCTT TTCTTTTTCT TATCAAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550127- (1950 1401) ******************************************************************************** EST sequence 6 -strand 673 n (File: SGN-E550140-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTACTCN 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATGTTATCAA CCATGAATTA ACAAAAAATT AGACCAAAAA 661 TATAAAAAAT TAC Predicted gene structure (within gDNA segment 3058 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.857 MATCH C06HBa0153O03.1-2- SGN-E550140- 0.857 550 0.817 C PGS_C06HBa0153O03.1-2-_SGN-E550140- (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| |||||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTACTCN 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550140- (1950 1401) ******************************************************************************** EST sequence 61 +strand 732 n (File: SGN-E550201+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCNA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAAACT 721 CGAGGGGGGG CC Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 718 MATCH C06HBa0153O03.1-2- SGN-E550201+ 0.861 550 0.751 C PGS_C06HBa0153O03.1-2-_SGN-E550201+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550201+ (1950 1401) ******************************************************************************** EST sequence 63 +strand 709 n (File: SGN-E550207+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTNCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 709 MATCH C06HBa0153O03.1-2- SGN-E550207+ 0.861 550 0.776 C PGS_C06HBa0153O03.1-2-_SGN-E550207+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550207+ (1950 1401) ******************************************************************************** EST sequence 65 +strand 715 n (File: SGN-E550335+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAATCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCNAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 549 ( 548 n); score: 0.859 PPA cDNA 698 715 MATCH C06HBa0153O03.1-2- SGN-E550335+ 0.859 550 0.769 C PGS_C06HBa0153O03.1-2-_SGN-E550335+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| |||| ||||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATT-CAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| ||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAATCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 549 hqPGS_C06HBa0153O03.1-2-_SGN-E550335+ (1950 1401) ******************************************************************************** EST sequence 71 +strand 714 n (File: SGN-E390013+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACNAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E390013+ 0.861 550 0.770 C PGS_C06HBa0153O03.1-2-_SGN-E390013+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E390013+ (1950 1401) ******************************************************************************** EST sequence 76 +strand 717 n (File: SGN-E550484+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 717 MATCH C06HBa0153O03.1-2- SGN-E550484+ 0.861 550 0.767 C PGS_C06HBa0153O03.1-2-_SGN-E550484+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550484+ (1950 1401) ******************************************************************************** EST sequence 78 +strand 713 n (File: SGN-E550211+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 549 ( 548 n); score: 0.861 PPA cDNA 698 713 MATCH C06HBa0153O03.1-2- SGN-E550211+ 0.861 550 0.771 C PGS_C06HBa0153O03.1-2-_SGN-E550211+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| |||| ||||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATT-CAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 549 hqPGS_C06HBa0153O03.1-2-_SGN-E550211+ (1950 1401) ******************************************************************************** EST sequence 80 +strand 713 n (File: SGN-E550464+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GNTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCTA CCATGAATTA ATGAAAAATT ATGCCATAAG 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 698 713 MATCH C06HBa0153O03.1-2- SGN-E550464+ 0.861 550 0.771 C PGS_C06HBa0153O03.1-2-_SGN-E550464+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550464+ (1950 1401) ******************************************************************************** EST sequence 82 +strand 713 n (File: SGN-E549941+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA TATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCGA CCNATGATTA ATGAAAAATT ATGCCATCAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.859 PPA cDNA 699 713 MATCH C06HBa0153O03.1-2- SGN-E549941+ 0.859 550 0.771 C PGS_C06HBa0153O03.1-2-_SGN-E549941+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||| ||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAATATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E549941+ (1950 1401) ******************************************************************************** EST sequence 84 +strand 714 n (File: SGN-E550025+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E550025+ 0.861 550 0.770 C PGS_C06HBa0153O03.1-2-_SGN-E550025+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550025+ (1950 1401) ******************************************************************************** EST sequence 102 +strand 649 n (File: SGN-E374999+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CCAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTTCAC CCTGAATTAA TGAAAAAAT Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1401 ( 550 n); cDNA 1 549 ( 549 n); score: 0.859 MATCH C06HBa0153O03.1-2- SGN-E374999+ 0.859 550 0.847 C PGS_C06HBa0153O03.1-2-_SGN-E374999+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | ||| |||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCCAAAC 536 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 549 hqPGS_C06HBa0153O03.1-2-_SGN-E374999+ (1950 1401) ******************************************************************************** EST sequence 107 +strand 711 n (File: SGN-E396039+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AAAAACAAAG ATTTTCTCCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AAATAAAAAA AATTTACTCA TTTTTTCTTG GAGCTAATTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 661 672 MATCH C06HBa0153O03.1-2- SGN-E396039+ 0.861 550 0.774 C PGS_C06HBa0153O03.1-2-_SGN-E396039+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E396039+ (1950 1401) ******************************************************************************** EST sequence 111 +strand 711 n (File: SGN-E396056+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAA ATTTTCTCAC CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATAATAAAA ATTTACTCAT TTTTTCTTTG AGCTAATTCA TAAAAAAAAA A Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 700 711 MATCH C06HBa0153O03.1-2- SGN-E396056+ 0.861 550 0.774 C PGS_C06HBa0153O03.1-2-_SGN-E396056+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E396056+ (1950 1401) ******************************************************************************** EST sequence 121 +strand 690 n (File: SGN-E377133+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1401 ( 550 n); cDNA 1 549 ( 549 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E377133+ 0.861 550 0.797 C PGS_C06HBa0153O03.1-2-_SGN-E377133+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 549 hqPGS_C06HBa0153O03.1-2-_SGN-E377133+ (1950 1401) ******************************************************************************** EST sequence 55 +strand 729 n (File: SGN-E550212+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAACTCG 721 GGGGGGGGC Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 PPA cDNA 699 716 MATCH C06HBa0153O03.1-2- SGN-E550212+ 0.861 550 0.754 C PGS_C06HBa0153O03.1-2-_SGN-E550212+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550212+ (1950 1401) ******************************************************************************** EST sequence 57 +strand 710 n (File: SGN-E550065+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATGA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTGATTCAT AAGAAAAAAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1401 ( 550 n); cDNA 2 550 ( 549 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E550065+ 0.861 550 0.775 C PGS_C06HBa0153O03.1-2-_SGN-E550065+ (1950 1401) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTC 1401 |||||||||| ||| GACTAAACAG GTC 550 hqPGS_C06HBa0153O03.1-2-_SGN-E550065+ (1950 1401) ******************************************************************************** EST sequence 59 +strand 726 n (File: SGN-E550322+) 1 TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC 61 CTCCTTCTTT TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT 121 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 181 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 241 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 301 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 361 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 421 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 481 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 541 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 601 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 661 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA 721 AAAAAC Predicted gene structure (within gDNA segment 3138 to 1): Exon 1 1928 1401 ( 528 n); cDNA 35 559 ( 525 n); score: 0.870 PPA cDNA 708 725 MATCH C06HBa0153O03.1-2- SGN-E550322+ 0.870 528 0.727 C PGS_C06HBa0153O03.1-2-_SGN-E550322+ (1928 1401) Alignment (genomic DNA sequence = upper lines): GTTCTTTCTA TTTTCTTATT CCAACCCTCT TTCTTTTACC CTAATTAGTA TATAATTAAG 1869 ||||||| |||||||||| | ||||||| |||||||||| |||||||| | |||||||||| GTTCTTTTCT TTTTCTTATT CAAACCCTCC TTCTTTTACC CTAATTAGCA TATAATTAAG 94 AATAAAAGAT GGCAATAATA CCCCACTAAT TAACTTAAGG TTACCTCTTT TAACCCCCAA 1809 |||||||||| || ||||||| ||||||||| | ||| |||| |||||||||| ||||||||| AATAAAAGAT GG-AATAATA ACCCACTAAT TTACTCAAGG TTACCTCTTT TAACCCCCAG 153 GGATTTTGAG TTATTAATAT AAACCCATGA AATATATAAT CATAGCAGGA ATAGTCCAAA 1749 | | || || ||||||| || ||||||| | | | |||||| | || |||| |||||||||| GTAATTAGAC TTATTAACAT AAACCCACTA ACTTTATAAT TAAAGTAGGA ATAGTCCAAA 213 ACGCCCCTTT AAAACTTAAC CAGAAATCTG ACTCCAACTG GGATTGCACA ACCTGTGACG 1689 ||| ||| || ||||| | ||||||| | || | |||| ||||| | || |||||||| | ACGTCCC-TT AAAACGTGTA AAGAAATCCG ACCCAGACTG GGATTACGCA ACCTGTGATG 272 GGCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGTCGTCGC AAAGTTCAGA GACCCAATAT 1629 | |||||||| |||||||||| |||||||||| |||||||||| || ||||||| ||| | | || GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGTCGTCGC AAGGTTCAGA GACTC-A-AT 330 TTCCACC-AA GGGTCTGTGA TGGTCCGTCA CACCTGTGAC GGTCCGTCCT GCCATTCCGT 1570 ||||||| || | |||||||| ||||||||| | || ||||| |||||||| | |||||||||| TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC GGTCCGTCGT GCCATTCCGT 390 CACGAAGTTC AGAGAGTTGA TTTTCAGTAC CCAATTTTAG ATTTTCTAAG TGTTTTGAAA 1510 ||||||||| ||||||| || |||| ||||| ||||||| || | |||||||| |||||||||| TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG AATTTCTAAG TGTTTTGAAA 450 CGAGACCCTG CGACGGTCTG TCGTGCCCAT GACGGTCCGT CGTTGGGTTC GTCGCCTCAG 1450 |||||| | |||||||| |||||| ||| |||||||||| ||| || | | |||| |||| CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT CGTGGGTTCC GTCGTCTCAA 510 CCTGTTTTTC CAGAAATAAA ATCCGCTGCT CAAAACGACT AAACAGGTC 1401 |||||||||| || ||||||| ||| ||| || |||||||||| ||||||||| CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT AAACAGGTC 559 hqPGS_C06HBa0153O03.1-2-_SGN-E550322+ (1928 1401) ******************************************************************************** EST sequence 73 +strand 720 n (File: SGN-E389834+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTA CGTCGACTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAGAACGAC 541 TAAACAGGAC GTTACATTTA TGATCGTCCT ACTTAAATAT CATTATTATT TTACGATTTA 601 TAACACTATT AGAAACGAAG ATTTTCTCGA CCATGAATTA ATGAAAAAAT ATGCCATGAA 661 ATATAAAAAT TTACTCGTTC TTCATTGAGC TATTCGTGAA AAAAAAAAAA AAATCGAGGG Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1404 ( 547 n); cDNA 2 547 ( 546 n); score: 0.858 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E389834+ 0.858 547 0.760 C PGS_C06HBa0153O03.1-2-_SGN-E389834+ (1950 1404) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||| ||| CCGTCGTGGG TTACGTCGAC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAGAAC 537 GACTAAACAG 1404 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0153O03.1-2-_SGN-E389834+ (1950 1404) ******************************************************************************** EST sequence 109 +strand 618 n (File: SGN-E396054+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1404 ( 547 n); cDNA 2 547 ( 546 n); score: 0.860 MATCH C06HBa0153O03.1-2- SGN-E396054+ 0.860 547 0.885 C PGS_C06HBa0153O03.1-2-_SGN-E396054+ (1950 1404) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG 1404 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0153O03.1-2-_SGN-E396054+ (1950 1404) ******************************************************************************** EST sequence 113 +strand 610 n (File: SGN-E396058+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTGTTT CCAAAAATAA AATCTGCTAC TCACAACGAC 541 TAAACAGGTC GTTACATTTA GGTTCTTCAT AGTTAACTAT TATTATTATT TTACGATTTA 601 TAACACTATT Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1404 ( 547 n); cDNA 2 547 ( 546 n); score: 0.856 MATCH C06HBa0153O03.1-2- SGN-E396058+ 0.856 547 0.897 C PGS_C06HBa0153O03.1-2-_SGN-E396058+ (1950 1404) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| ||||| |||||| ||| ||||||| || | |||| ||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTG TTTCCAAAAA TAAAATCTGC TACTCACAAC 537 GACTAAACAG 1404 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0153O03.1-2-_SGN-E396058+ (1950 1404) ******************************************************************************** EST sequence 139 +strand 545 n (File: SGN-E241959+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACA Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1405 ( 546 n); cDNA 1 545 ( 545 n); score: 0.860 MATCH C06HBa0153O03.1-2- SGN-E241959+ 0.860 546 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E241959+ (1950 1405) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGACGGT 1474 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCCGC TGCTCAAAAC 1414 ||||||| || | ||||| | ||| |||||| |||||| ||| ||||||| || | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACA 1405 ||||||||| GACTAAACA 545 hqPGS_C06HBa0153O03.1-2-_SGN-E241959+ (1950 1405) ******************************************************************************** EST sequence 2 -strand 774 n (File: SGN-E349977-) 1 AGTAGATATC ATCGCTAACT CAAAATAGGG AACAATATAT ATCAATAATA ATGTAAATCA 61 ACTACAATAC TCATCATGTA GCAATAGCAA TTTCTTNATC ATTAACAATT ACCGTCAAGT 121 TCACACATGA GGACTCAAGC CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT 181 GAGTATATTC ATTATCTTTC AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC 241 ACTCCGCTCC TCTATTTCTA TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATATTCAT 301 TCTATCCTGG TACCGGAACG TGGCACCCGA TCCTCATATT CTATCCTGGT GTCGGAACGT 361 AACACTCCGA TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCTCAT 421 ATTCTATCCT GGTGTCGGAA CGTGACACTC CGATCCTCAT ATTCTATCCT GGTGTCGGAA 481 CGTGACACTC CGATCCTCAT ATTCATTCTA TCCTGGTACC GAAACGTGGC ACCCGATCCC 541 CTAATTCATC AAGCCTTCTT CTACACTAAG GCATCATCAT TCTCATTATA TAATTTATCA 601 AGCCTTCTCT CATACTAAGG CCTCATCAAT CTTATTATAT AATATATCAA GTGAATTAGG 661 GTTCTTTCAA GATTTGGGAT TCAATAGCTT CATCATGCTT TGTTAATTCA TAACAATTTC 721 ATAATCATAA TCATGCAAGC ATACCAATAA GCATATAGAC AGGTTTACAA CATC Predicted gene structure (within gDNA segment 5165 to 1): Exon 1 2793 2494 ( 300 n); cDNA 1 297 ( 297 n); score: 0.757 Intron 1 2493 2460 ( 34 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.85) Exon 2 2459 2258 ( 202 n); cDNA 298 512 ( 215 n); score: 0.735 MATCH C06HBa0153O03.1-2- SGN-E349977- 0.748 502 0.649 C PGS_C06HBa0153O03.1-2-_SGN-E349977- (2793 2494,2459 2258) Alignment (genomic DNA sequence = upper lines): AGTAGATATC ATCGGCCAAC TCAAAATAGA AAACAGTATG TATTAAGCAA TATCATAAAA 2734 |||||||||| ||| || ||| ||||||||| |||| ||| ||| || || || | ||| AGTAGATATC ATC-GCTAAC TCAAAATAGG GAACAATATA TATCAA-TAA TAATGT-AAA 57 TCAATTAATA TCCTTAGCAT GCAGCATTTA CAGTTACCAT AACCCTTGGT TA-CAACACC 2675 |||| || | | || | ||| | |||| | || || | | | | || | | | | TCAACTACAA TACTCATCAT GTAGCAATAG CAATT-TCTT NATCATTAAC AATTACCGTC 116 AAGCACATCA ATGAGGACTC ACACCTCCTC ATCACACTCA TTTGGGAATT TAGTTCATTA 2615 ||| || |||||||||| | |||| | || ||||| |||||||||| ||||||||| AAGTTCACAC ATGAGGACTC AAGCCTCAAT ACCATACTCA TTTGGGAATT AAGTTCATTA 176 GATTGGATAT ATT-AACATA TTTCAAGATT CATTATCTTT ATTCTCCTCG TGTCGGTACG 2556 ||||| ||| ||| | || |||||||||| |||||||||| ||| || | |||||||||| GATTGAGTAT ATTCATTATC TTTCAAGATT CATTATCTTT CTTCCTCTTG TGTCGGTACG 236 TGACACTCCG CTCCTCAATA TACTATCCTG GTGTCGGAAC GTGACACTCT GATCCTCATT 2496 |||||||||| |||||| || | |||||||| ||| |||||| ||| ||||| ||||||||| TGACACTCCG CTCCTCTAT- TTCTATCCTG GTGCCGGAAC GTGGCACTCC GATCCTCATA 295 CTATCCTGGT GTCGGAACGT GACACTCCGA TCCTCATATA CTATCCTGGT ACCGGAACGT 2436 | || |||||||||| |||||||||| TT........ .......... .......... ......CATT CTATCCTGGT ACCGGAACGT 321 GGCACCCGAT -C-CATATTC TATCCTGGTG TCAGAACGTG ACAC-CCGAT -C-CATA-TC 2382 |||||||||| | ||||||| |||||||||| || |||||| |||| ||||| | |||| || GGCACCCGAT CCTCATATTC TATCCTGGTG TCGGAACGTA ACACTCCGAT CCTCATATTC 381 ---CTATCCT GGTACCGGAA CGTGGCACCC GAT-C-CATA TTCTATCTTG GTGTCGGAAC 2327 ||||||| |||||||||| |||||||||| ||| | |||| ||||||| || |||||||||| ATTCTATCCT GGTACCGGAA CGTGGCACCC GATCCTCATA TTCTATCCTG GTGTCGGAAC 441 GTGACAC-CC GAT-CT-ATA TTCTATCCTG GTACCGGAAC GTGGCAC-CC GATCCCCTAA 2271 ||||||| || ||| || ||| |||||||||| || |||||| ||| ||| || ||||| | | GTGACACTCC GATCCTCATA TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA 501 TCTCACCACT TTC 2258 | ||| || || T-TCA-TTCT ATC 512 hqPGS_C06HBa0153O03.1-2-_SGN-E349977- (2793 2494,2459 2258) ******************************************************************************** EST sequence 135 +strand 730 n (File: SGN-E546506+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAACAAT TCAATACTAT TATTATTATC CCCAAAATCT 61 GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA ACTAGGAATG TCTAAGAACA 121 AAATAACTAA AAAGCTAGTC CATGCCGGAA ATTCAAGGCA TCAAGACTTG AAGAAGAAGA 181 CCCAGTCCAA GCTAGACGCA TTAGCTCACC CTGAATTTTC CGATGAAGTG AAGACTGGCT 241 AGATCTACTG TTGAGTTGAA GTTGACGGAA CGTTTGCTGC ATTACACAAA TAACAAAGAG 301 GAAAACATGA AAGTAGGGGT CAGTACAACC ACACGTACTG AGTAGATATC ATCGGCCAAC 361 TCAAAATAGG GAACAGTATA TATCAATAAT AATGTAAATC AACTACAATA CTCAACATGT 421 AGCAATAACA CCATGAATTC ATCAATAACT ACAACCGAGT TCACACATGA GGACTCAAGC 481 CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT GAGTATATTC ATTATCTTTC 541 AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC ACTCCGATCC TCTATTTCTA 601 TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATTCTATC CTGGTACCGG AACGTGGCAC 661 CCGATCCATT TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA TTCTATCCTG 721 GTACCGGAAC Predicted gene structure (within gDNA segment 4177 to 1606): Exon 1 3087 2537 ( 551 n); cDNA 50 595 ( 546 n); score: 0.808 Intron 1 2536 2498 ( 39 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.92) Exon 2 2497 2387 ( 111 n); cDNA 596 705 ( 110 n); score: 0.923 Intron 2 2386 2316 ( 71 n); Pd: 0.900 (s: 0.91), Pa: 0.000 (s: 0) Exon 3 2315 2290 ( 26 n); cDNA 706 730 ( 25 n); score: 0.962 PPA cDNA 18 1 MATCH C06HBa0153O03.1-2- SGN-E546506+ 0.827 688 0.942 C PGS_C06HBa0153O03.1-2-_SGN-E546506+ (3087 2537,2497 2387,2315 2290) Alignment (genomic DNA sequence = upper lines): CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTACGA TCAAATGACT AAACTAAGAG 3028 |||||||||| |||||||||| |||||||||| ||||||| |||||| ||| ||||| || CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTA-TC TCAAATTACT TAACTAGGAA 108 TATTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 2968 | ||||| | | |||||| ||| |||| |||||||||| ||||| |||| |||||||||| T-GTCTAAGA AC--AAAATA ACTAAAAAGC TAGTCCATGC CGGAAATTCA AGGCATCAAG 165 ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA -TATCCGGTA 2909 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| | |||| | ACTTGAAGAA GAAGACCCAG TCCAAGCTAG ACGCATTAGC TCACCCTGAA TTTTCCGATG 225 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCTGCACTCC 2849 |||||| |||||||| |||| |||| |||||| ||| |||||||||| |||||| | | AAGTGAAGAC TGGCTAGATC TACTGTTGAG TTGAAGTTGA CGGAACGTTT GCTGCATTAC 285 ACAAATAAC- AAGAAGAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 2790 ||||||||| |||| ||||| ||| |||||| |||||||||| | || ||| |||||||||| ACAAATAACA AAGAGGAAAA CATGAAAGTA GGGGTCAGTA C-AACCACAC GTACTGAGTA 344 GATATCATCG GCCAACTCAA AATAGAAAAC AGTATGTATT AAGCAATATC ATAAAATCAA 2730 |||||||||| |||||||||| ||||| ||| ||||| ||| || |||| | ||||||| GATATCATCG GCCAACTCAA AATAGGGAAC AGTATATATC AA-TAATAAT GT-AAATCAA 402 TTAATATCCT TAGCATGCAG CATTTACAGT TACCATAACC CTTGGTTACA AC-ACCAAGC 2671 || || || | |||| || || | ||| | | | | | | || || ||| || CTACAATACT CAACATGTAG CAATAACA-C CATGA-ATTC ATCAATAACT ACAACCGAGT 460 ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT TCATTAGATT 2611 || |||| ||||||| | ||| | || ||||||||| |||||| ||| |||||||||| TCACACATGA GGACTCAAGC CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT 520 GGATATATT- AACATATTTC AAGATTCATT ATCTTTATTC TCCTCGTGTC GGTACGTGAC 2552 | |||||| | || |||| |||||||||| |||||| ||| || ||||| |||||||||| GAGTATATTC ATTATCTTTC AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC 580 ACTCCGCTCC TCAATATACT ATCCTGGTGT CGGAACGTGA CACTCTGATC CTCATTCTAT 2492 |||||| ||| || || |||||| ACTCCGATCC TCTAT..... .......... .......... .......... ....TTCTAT 601 CCTGGTGTCG GAACGTGACA CTCCGATCCT CATATACTAT CCTGGTACCG GAACGTGGCA 2432 ||||||| || ||||||| || |||||||||| ||| | |||| |||||||||| |||||||||| CCTGGTGCCG GAACGTGGCA CTCCGATCCT CAT-T-CTAT CCTGGTACCG GAACGTGGCA 659 CCCGATCCAT ATTCTATCCT GGTGTCAGAA CGTGACAC-C CGATCCATAT CCTATCCTGG 2373 |||||||||| ||||||||| |||||| ||| |||||||| | |||||| CCCGATCCAT TTTCTATCCT GGTGTCGGAA CGTGACACTC CGATCC.... .......... 705 TACCGGAACG TGGCACCCGA TCCATATTCT ATCTTGGTGT CGGAACGTGA CACCCGATCT 2313 || .......... .......... .......... .......... .......... .......TC- 707 ATATTCTATC CTGGTACCGG AAC 2290 |||||||||| |||||||||| ||| ATATTCTATC CTGGTACCGG AAC 730 hqPGS_C06HBa0153O03.1-2-_SGN-E546506+ (3087 2537,2497 2387,2315 2290) ******************************************************************************** EST sequence 44 -strand 660 n (File: SGN-E349296-) 1 AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 61 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 121 TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 181 ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 241 ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 301 ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCCT 361 TAAAACAATT GAGGAATTCC GACTCAGACT GGGATTTACG CAGCCTGTGA CAGCCCGTTG 421 TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC GCAAGGTTCA GAGACTGGAT TTTCACTGAA 481 GACTCTGTGA TGGTCCATCA CGCCTGTGAC GGTCCGTCTT GCCATTCCGT TACGAAGTTC 541 AGAGAGTCGA TTTTCAGTAC CCAATTTCAG ATTTCCTAAG TGTTTTGAAA TGAGACCCTG 601 CGACGGTCCG TCGTGCCCAT GATGGTCCGT CGTGGGGTCC GTCATTTCTG CCAGTTTTTC Predicted gene structure (within gDNA segment 4230 to 3): Exon 1 2095 1440 ( 656 n); cDNA 1 660 ( 660 n); score: 0.829 MATCH C06HBa0153O03.1-2- SGN-E349296- 0.829 656 0.994 C PGS_C06HBa0153O03.1-2-_SGN-E349296- (2095 1440) Alignment (genomic DNA sequence = upper lines): AATACTACTA ACACATATCA TTCGCTATTA AGAGTTTGCT ACGAATAGTA TGA-AATAAC 2037 |||| || | | |||||| | |||||||||| |||| || || ||||||| | | | ||| AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 60 CATAACCTAC CTCCACTGAA GATTAGTGAT TAAGCAAG-A AATTCCCAAG GCTTTTGTTC 1978 |||||||||| |||||| ||| |||| ||||| ||||||| | |||||||| || |||| CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 120 CTTCTTCTCG TTCGATCCTC CCTC-AATTC GTTTCTCTTT CCCTCTCT-T TGTTCTTTCT 1920 ||| ||||| |||||||||| || ||| | | ||| | | || ||| | |||||||||| TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 180 ATTTTC-TTA TTCCAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAG 1861 |||||| ||| ||| |||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 240 ATGGCAATAA TACCCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGGATTTTG 1801 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ||| | || | ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 300 AGTTATTAAT ATAAACCCAT GAAATATATA ATCATAGCAG GAATAGTCCA AAACGCCCCT 1741 | ||||||| || |||||| || | |||| || | ||||| |||||||| | ||||| ||| ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCC- 359 TTAAAACTTA ACCAGAAATC TGACTCCAAC TGGGA-TTGC ACAACCTGTG ACGGGCCGTC 1682 ||||||| ||| || ||||| || ||||| || | || |||||| || | |||| TTAAAACAAT TGAGGAATTC CGACTCAGAC TGGGATTTAC GCAGCCTGTG ACAGCCCGTT 419 GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAAGTTC AGAGACCCAA TATTTCCACC 1622 |||||||||| |||||||||| |||||||||| ||||| |||| |||||| | | |||| GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC AGAGACTGGA T-TTTCACTG 478 AAGGGTCTGT GATGGTCCGT CACACCTGTG ACGGTCCGTC CTGCCATTCC GTCACGAAGT 1562 ||| ||||| |||||||| | ||| |||||| |||||||||| ||||||||| || ||||||| AAGACTCTGT GATGGTCCAT CACGCCTGTG ACGGTCCGTC TTGCCATTCC GTTACGAAGT 538 TCAGAGAGTT GATTTTCAGT ACCCAATTTT AGATTTTCTA AGTGTTTTGA AACGAGACCC 1502 ||||||||| |||||||||| ||||||||| |||||| ||| |||||||||| || ||||||| TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTCCTA AGTGTTTTGA AATGAGACCC 598 TGCGACGGTC TGTCGTGCCC ATGACGGTCC GTCGTTGGGT TCGTCGCCTC AGCCTGTTTT 1442 |||||||||| ||||||||| |||| ||||| ||||| |||| |||| || ||| ||||| TGCGACGGTC CGTCGTGCCC ATGATGGTCC GTCGTGGGGT CCGTCATTTC TGCCAGTTTT 658 TC 1440 || TC 660 hqPGS_C06HBa0153O03.1-2-_SGN-E349296- (2095 1440) ******************************************************************************** EST sequence 117 +strand 472 n (File: SGN-E236652+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GA Predicted gene structure (within gDNA segment 3038 to 1): Exon 1 1950 1478 ( 473 n); cDNA 1 472 ( 472 n); score: 0.855 MATCH C06HBa0153O03.1-2- SGN-E236652+ 0.855 473 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E236652+ (1950 1478) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCTGTCGTGC CCATGA 1478 | ||| |||| |||||||||| |||||||||| | |||||| || |||||| ||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGA 472 hqPGS_C06HBa0153O03.1-2-_SGN-E236652+ (1950 1478) ******************************************************************************** EST sequence 115 +strand 454 n (File: SGN-E396070+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCGGGGGGGG 421 GAGTTTCTAA TTGTTTTGAA ACTAGACTCC TCGA Predicted gene structure (within gDNA segment 3048 to 1): Exon 1 1950 1497 ( 454 n); cDNA 2 454 ( 453 n); score: 0.836 MATCH C06HBa0153O03.1-2- SGN-E396070+ 0.836 454 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E396070+ (1950 1497) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 1893 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGGCAAT AATACCCCAC TAATTAACTT 1833 |||||||||| || ||||||| |||||||||| |||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 1773 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 1713 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CACAACCTGT GACGGGCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 1653 ||||||||| | |||||||| || || |||| |||||||||| |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGATGGTCC GTCACACCTG 1594 |||||| ||| ||||||| | | |||||||| | ||| |||| |||| ||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TTGATTTTCA GTACCCAATT 1534 |||||||||| || ||||||| |||| ||||| |||||||||| | |||||| | ||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCGGGGG 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGA 1497 || |||| ||| |||||| ||||| |||| | ||| GGGGAGTTTC TAATTGTTTT GAAACTAGAC TCCTCGA 454 hqPGS_C06HBa0153O03.1-2-_SGN-E396070+ (1950 1497) ******************************************************************************** EST sequence 37 -strand 548 n (File: SGN-E356257-) 1 GTTAACTAGA AAATTAAAGT GATAGAGTCA AATAATGTAA CGACCCGTTT AGTCGTTTTG 61 AGCAGCAGAC TTTATTTCTG GAAAAACTGG CAGAAGCGAC GGACCCCACG ACGGACCGTC 121 ATGGGCACGA CGGACCATCG CAGGGTCTCG TTTCAAAACC CTCTTTCTTT TACCCCAAAT 181 TAACATATAA TTAAGAATAA AAGATGGCAA TAATACCCCA CTAATTAACT TAGGGTTACC 241 TCTTTTAACC CCAAGAATTT GAGTTATTAA TATAAACCCA CGAAATCTAT AATTAAGGAA 301 AGAATAGTCC AAAAACGTCC CTTAAAACGT GTAAGGAAAT CCGATTCTGC CTGGGATTTG 361 CGCAACCTGT GACGGGCCGT CGTGACTGTG ACGGTCCGTC CTGCAGGTCG TCGCAAGGGT 421 CAGAGAGTCA ATTTCCACTG AACAATCTAT GACGGTCCGT CACGCCTGTG ATGGTCCGTC 481 CTGTCATTCC GTCACGAAGT TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTTCTA 541 AGTGTTTT Predicted gene structure (within gDNA segment 4076 to 914): Exon 1 1906 1514 ( 393 n); cDNA 157 548 ( 392 n); score: 0.855 MATCH C06HBa0153O03.1-2- SGN-E356257- 0.855 393 0.717 C PGS_C06HBa0153O03.1-2-_SGN-E356257- (1906 1514) Alignment (genomic DNA sequence = upper lines): AACCCTCTTT CTTTTACCC- TAATTAGTAT ATAATTAAGA ATAAAAGATG GCAATAATAC 1848 |||||||||| ||||||||| ||||| || |||||||||| |||||||||| |||||||||| AACCCTCTTT CTTTTACCCC AAATTAACAT ATAATTAAGA ATAAAAGATG GCAATAATAC 216 CCCACTAATT AACTTAAGGT TACCTCTTTT AACCCCCAAG GATTTTGAGT TATTAATATA 1788 |||||||||| |||||| ||| |||||||||| || |||||| || ||||||| |||||||||| CCCACTAATT AACTTAGGGT TACCTCTTTT AA-CCCCAA- GAATTTGAGT TATTAATATA 274 AACCCATGAA ATATATAATC ATAGCAGGAA TAGTCC-AAA ACGCCCCTTT AAAACTTAAC 1729 |||||| ||| || |||||| | | | ||| |||||| ||| ||| ||| || ||||| | AACCCACGAA ATCTATAATT AAGGAAAGAA TAGTCCAAAA ACGTCCC-TT AAAACGTGTA 333 CAGAAATCTG ACTCCAACTG GGA-TTGCAC AACCTGTGAC GGGCCGTCGT GCCTGCGACG 1670 |||||| | | || ||| ||| |||| | |||||||||| |||||||||| | ||| |||| AGGAAATCCG ATTCTGCCTG GGATTTGCGC AACCTGTGAC GGGCCGTCGT GACTGTGACG 393 GTCCGTCCTG CAGGTCGTCG CAAAGTTCAG AGACCCAATA TTTCCAC-CA AGGGTCTGTG 1611 |||||||||| |||||||||| ||| | |||| ||| | | | ||||||| | | ||| || GTCCGTCCTG CAGGTCGTCG CAAGGGTCAG AGAGTC-A-A TTTCCACTGA ACAATCTATG 451 ATGGTCCGTC ACACCTGTGA CGGTCCGTCC TGCCATTCCG TCACGAAGTT CAGAGAGTTG 1551 | |||||||| || ||||||| ||||||||| || ||||||| |||||||||| |||||||| | ACGGTCCGTC ACGCCTGTGA TGGTCCGTCC TGTCATTCCG TCACGAAGTT CAGAGAGTCG 511 ATTTTCAGTA CCCAATTTTA GATTTTCTAA GTGTTTT 1514 |||||||||| |||||||| | |||||||||| ||||||| ATTTTCAGTA CCCAATTTCA GATTTTCTAA GTGTTTT 548 hqPGS_C06HBa0153O03.1-2-_SGN-E356257- (1906 1514) ******************************************************************************** EST sequence 42 -strand 725 n (File: SGN-E546548-) 1 GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 61 TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCCCTA ATCCATCAAG 121 CCTTCTTTTA CACTAAGGCA TCATCATTCT CATTATATAA TTTATCAAGC CTTCTTTCAT 181 ACTAAGGCAT CATCATTCTC ATTATATAAT ATATCAAGCG AATTAGGGTT CTTTCAAGAT 241 TTGGGATTCA ATTGCTTCAT CATGCTTTGT TAATTCATCG CAATTTCATA ATCATAATCA 301 TGCAAGCATA CAACTTAAGC ACATAGCAGG GTTTACAATA CTATCAACAC ATAATATTCA 361 CTATTAAGAG TTCACTACGA ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT 421 TGAATCAACA AGCTATCTTC TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT 481 CGTTCGTTTC TCCTCTCTTT CTGTTCTTTT CTTTTGTTTT GTTTTATTCA AACCCTCCTT 541 CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG GACAATAAAA CCCACTAATT 601 AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACC TATTAATATT AACCCTCAAT 661 CTTTATAATT AAGGAAAGAA TAGTCCAAAA CGACCCCTAA AACGTGTAGA GGAATCCTAT 721 TTTGC Predicted gene structure (within gDNA segment 3378 to 1): Exon 1 2526 2411 ( 116 n); cDNA 1 120 ( 120 n); score: 0.789 Intron 1 2410 2349 ( 62 n); Pd: 0.000 (s: 0.67), Pa: 0.000 (s: 0.49) Exon 2 2348 2300 ( 49 n); cDNA 121 169 ( 49 n); score: 0.490 Intron 2 2299 2263 ( 37 n); Pd: 0.000 (s: 0.49), Pa: 0.000 (s: 0.50) Exon 3 2262 1747 ( 516 n); cDNA 170 691 ( 522 n); score: 0.696 MATCH C06HBa0153O03.1-2- SGN-E546548- 0.713 681 0.939 C PGS_C06HBa0153O03.1-2-_SGN-E546548- (2526 2411,2348 2300,2262 1747) Alignment (genomic DNA sequence = upper lines): GGTGTCGGAA CGTGACACTC TGATCCTCAT TCTATCCTGG TGTCGGAACG TGACACTCCG 2467 ||| ||||| |||| ||| | ||||| || |||||||||| |||||||||| |||||||||| GGTACCGGAA CGTGGCAC-C CGATCCATAT TCTATCCTGG TGTCGGAACG TGACACTCCG 59 ATCCTCATA- T-A--CTATC CTGGTACCGG AACGTGGCAC CCGATCC-AT ATTCTATCCT 2412 ||||||||| | | ||||| |||||||||| |||||||||| ||||||| | | || ||| ATCCTCATAT TCATTCTATC CTGGTACCGG AACGTGGCAC CCGATCCCCT AATCCATCAA 119 GGTGTCAGAA CGTGACACCC GATCCATATC CTATCCTGGT ACCGGAACGT GGCACCCGAT 2352 | G......... .......... .......... .......... .......... .......... 120 CCATATTCTA TCTTGGTGTC GGAACGTGAC ACCCGATCTA TATTCTATCC TGGTACCGGA 2292 |||| | | | | | | | | | | || || | |||| | ...CCTTCTT TTACACTAAG GCATCATCAT TCTCATTATA TAATTTATCA AG........ 169 ACGTGGCACC CGATCCCCTA ATCTCACCAC TTTCGTTCAT -C-AAGCCTT CTTTTATACC 2234 | ||| ||||| | ||| | | | | || | .......... .......... .........C CTTCTTTCAT ACTAAGGCAT C-ATCATTCT 199 AAGGCATCAT CATTAACAAA GTAGATTAGG GTTTCTTTTC AAGATTTGGG ATTCAATGGC 2174 | || || || | || | |||||| | ||| |||| |||||||||| ||||||| || CA-TTAT-AT AATATATCAA GCGAATTAGG G-TTC-TTTC AAGATTTGGG ATTCAATTGC 255 TTCATCATGC -TTATTTATT CA-C--AATT ACATAATCAC ATCATTCATG CAAGCATACA 2118 |||||||||| || || ||| || | |||| |||||| | || | ||||| |||||||||| TTCATCATGC TTTGTTAATT CATCGCAATT TCATAAT--C AT-AATCATG CAAGCATACA 312 A-TTAAGCAT ATAG-AATGT TTACAATACT ACTAACACAT ATCATTCGCT ATTAAGAGTT 2060 | ||||||| |||| | || |||||||||| | ||||||| | |||| || |||||||||| ACTTAAGCAC ATAGCAGGGT TTACAATACT ATCAACACAT AATATTCACT ATTAAGAGTT 372 TGCTACGAAT AGTATGAAAT -AACCATAAC CTACCTCCAC TGAAG-ATT- AGTGATTAAG 2003 |||||||| | | | || ||||||||| |||||||||| |||| ||| | | | ||| CACTACGAAT ATCGTAACAT AAACCATAAC CTACCTCCAC CGAAGAATTG AATCAACAAG 432 CAAGAAATTC CCAAGGCTTT TGTTCCTTCT TCTCGTTCGA TC-CTC-CCT CAATTCGTTT 1945 | | || || ||| ||| ||| || ||| || ||| || | ||||||| CTATCTTCTC AAAATCCTTG CTATCC-TCT TC-GTTTCTC TCTCTCTACT C-GTTCGTTT 489 CT-CTTTCCC TCTCTT-TGT TC-TTTCTAT TTTCTTATTC CAACCCTCTT TC-TTTTACC 1889 || || || ||| || | | || ||| | | | | |||||| ||||||| | || ||||||| CTCCTCTCTT TCTGTTCTTT TCTTTTGTTT TGTTTTATTC AAACCCTCCT TCTTTTTACC 549 CTAATTAGTA -TATAATTAA GAATAAAAGA TGGCAATAAT ACCCCACTAA TTAACTTAAG 1830 ||||||| | ||||||||| | |||| || | |||||| | |||||||| |||||||||| CTAATTAAAA GTATAATTAA GTGTAAAGGA GGACAATAA- AACCCACTAA TTAACTTAAG 608 GTTACCTCTT TTAACCCCCA AGGATTTTGA GTTATTAATA TAAACCCATG AAATATATAA 1770 |||||||||| |||||||||| || | || || |||||||| | ||||| | | ||||| GTTACCTCTT TTAACCCCCA AGTAATTAGA CCTATTAATA TTAACCCTCA ATCTTTATAA 668 TCATAGCAGG AATAGTCCAA AAC 1747 | | | | | |||||||||| ||| TTAAGGAAAG AATAGTCCAA AAC 691 hqPGS_C06HBa0153O03.1-2-_SGN-E546548- (2526 2411) ******************************************************************************** EST sequence 100 +strand 840 n (File: SGN-E542084+) 1 TTTTTTTTTT TAGGGGAAAA TTTCTTACTT CTATAAATGT CACGACCCAA ATCGGATCGC 61 GACTGGCACC CACACTTACC CTGCTATGTG AGCGAACCAA CCAATCCAAA CCTTAACATT 121 TCAATGTAAT ATCAACATAA AGTAATGCGG AAGACTTAAA CTTATTAATG AAAACCAATT 181 CAATAACTAT TATTTCCCAA AATCTGGAAG TCATCATCAT AAGAACATCT ACTTCAAATT 241 ACTAAATCTA AGAGTTTCTA AGAAGCTAAA AAATACATAA AAGCTAGTCC ATGCCGGAAC 301 TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAT CTAGAAAGCA TTAGCTCACC 361 CTGATATCCG AAGTAATGAA GACTGGCTAG AGTTACTGTT GAGTCGAAGA TGACGGCACG 421 TTTGCTAAAA TCAGTGGACG GAGGAGAAGG GAAAGCACAC CGGGAATGAG AAGAAGCTGA 481 AGGAGGAACC AAAGAGGAAT CCCATTGCAA AGTAAATGAG AGTGTAAGCT AGCAGACGCG 541 ATGGAAGAGC TTACGCAGAA ATAACACTCT CATTTGGTGA TTTAGTTTGG AGATCATCTG 601 AGACCTTCGT GTTGGACAAC ATCATCCATG AAGATGTCAT TAGAAAAGTT AGATGCTTTA 661 TATACATGTT GATAGTTCCT GACTACTCTA TTTCTTTTTC AGAAAGCCCC GAAATTTCTC 721 AGATGATAAA TGCTGTCTGT TTTGGAAAAC CATCTCTATG CAAAGATGAT GTTTGCTGCA 781 TTGAGGTGTC AATATTGGGA ATTTCAAGAA AATTATGCCT TGTAGAATAT GTACAGCAAC Predicted gene structure (within gDNA segment 11251 to 2175): Exon 1 10296 10138 ( 159 n); cDNA 32 188 ( 157 n); score: 0.887 Intron 1 10137 3095 (7043 n); Pd: 0.000 (s: 0.90), Pa: 0.000 (s: 0.90) Exon 2 3094 2856 ( 239 n); cDNA 189 426 ( 238 n); score: 0.854 PPA cDNA 11 1 MATCH C06HBa0153O03.1-2- SGN-E542084+ 0.867 398 0.474 C PGS_C06HBa0153O03.1-2-_SGN-E542084+ (10296 10138,3094 2856) Alignment (genomic DNA sequence = upper lines): TAACTATGTC ACGACCCAAA TCCGGGCCGC GTCTGGCACC CACACTTACC CTCCTATGTG 10237 || ||||| |||||||||| | ||| ||| | |||||||| |||||||||| || ||||||| TATAAATGTC ACGACCCAAA T-CGGATCGC GACTGGCACC CACACTTACC CTGCTATGTG 90 AGCGAACCAA CCAATCTAAA CCTTAACATT TCAATATAAT ATAACCAGAA AGTAATGCGG 10177 |||||||||| |||||| ||| |||||||||| ||||| |||| || | || || |||||||||| AGCGAACCAA CCAATCCAAA CCTTAACATT TCAATGTAAT ATCAACATAA AGTAATGCGG 150 AAGACTTAAA CTCATTAAAT AAAGACCAAT TCATTAACTT CTAAAATTCA ACATCTATTA 10117 |||||||||| || ||| ||| || |||||| ||| ||||| AAGACTTAAA CTTATT-AAT GAAAACCAAT TCAATAACT. .......... .......... 188 TTCCCCCAAA ATCTGGAAGT CATCATCACA AGAACATCTA CGATCAAATG ACTAAACTAA 10057 .......... .......... .......... .......... .......... .......... 188 GAGTATTCTA AAAGCTAAAA ATACATAAGA AGCTAGTCCA TGCCGGAAGT TCAAGGCATC 9997 .......... .......... .......... .......... .......... .......... 188 AAGACTTGAA GAAGAAGACC CAGTCCAAGC TAGAAGCATT AGCTCACCCT GAATATCCGG 9937 .......... .......... .......... .......... .......... .......... 188 TATGACGAAG ACTGGCTAGA ATCACTGCTG AGTTGAAGAT GACGGAACGT TTGCTGCACT 9877 .......... .......... .......... .......... .......... .......... 188 CCACAAATAA CAAGAAGAAA ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT 9817 .......... .......... .......... .......... .......... .......... 188 AGATATCATC GGCCAACTCA AAATAGAAAA CAGTATGTAT TAAGCAATAT CATAAAATCA 9757 .......... .......... .......... .......... .......... .......... 188 ATTAATATCC TTAGCATGCA GCATTTACAG TTACCATAAC CCTTGGTTAC AACACCAAGC 9697 .......... .......... .......... .......... .......... .......... 188 ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT TCATTAGATT 9637 .......... .......... .......... .......... .......... .......... 188 GGATATATTA ACATATTTCA AGATTCATTA TCTTTATTCC CCTCGTGTCG GTACATGACA 9577 .......... .......... .......... .......... .......... .......... 188 CTCCGCTCCT CAATATACTA TCCTGGTGTC GGAACGTGAC ACTCTGATCC TCATTCTATC 9517 .......... .......... .......... .......... .......... .......... 188 CTGGTGTCGG AACGTGACAC TCCGATCCTC ATATACTATC CTGGTACCGG AACGTGGTAC 9457 .......... .......... .......... .......... .......... .......... 188 CCGATCCATA TTCTATCCTG GTGTCAGAAC GTGACACCCG ATCCATATCC TATCCTGGTA 9397 .......... .......... .......... .......... .......... .......... 188 CCGGAACGTG GCACCCGATC AATATTCTAT CTTGGTGTCG GAACGTGACA CCCGATCCAT 9337 .......... .......... .......... .......... .......... .......... 188 ATTCTATCCT GGTACCGAAA CGTGGCACCG GATCCCCTAA TCTCATCACT TTCGTTCATC 9277 .......... .......... .......... .......... .......... .......... 188 AAGCCTTCTT TTATACCAAG GCATCATCAT TAACAAAGTA GATTAGGGTT TCTTTTCAAG 9217 .......... .......... .......... .......... .......... .......... 188 ATTTGGGATT CAATGGCTTC ATCATACTTA TTTATTCACA ATTACATAAT CACATCATTC 9157 .......... .......... .......... .......... .......... .......... 188 ATGCAAGCAT ACAATTAAGC ATATAGAAGG TTTACAATAC TACTAACACA TATCATTCGC 9097 .......... .......... .......... .......... .......... .......... 188 TATTAAGAGT TTGCTACGAA TAGCATGAAA TAACCATAAC CTACCTCCAC TGAAGATTAG 9037 .......... .......... .......... .......... .......... .......... 188 TGATTAAGCA AGAAATTCCC AAGGCTTTTG TTCCTTCTTC TCGTTCGATC CTCCCTCAAT 8977 .......... .......... .......... .......... .......... .......... 188 TCGTTTCTCT TTCCCTCTCT TTGTTCTTTC TATTTTCTTA TTCCAACCCT CTTTCTTTTA 8917 .......... .......... .......... .......... .......... .......... 188 CCCTAATTAG TATATAATTA AGAATAAAAG ATGACAATAA TACCCCACTA ATTAACTTAA 8857 .......... .......... .......... .......... .......... .......... 188 GGTTACCTCT TTTAACCCCC AAGGATTTTG AGTTATTAAT ATAAACCCAT GAAATATATA 8797 .......... .......... .......... .......... .......... .......... 188 ATCATAGCAG GAATAGTCCA AAACGCCCCT TTAAAACTTA ACCAGAAATC TGACTCCAAC 8737 .......... .......... .......... .......... .......... .......... 188 TGGGATTGCG CAACCTGTGA CGGGCCGTCG TGCCTGGGAC GGTCCGTCCT GCAGGTCGTC 8677 .......... .......... .......... .......... .......... .......... 188 GCAAAGTTCA GAGACCCAAT ATTTCCACCA AGGGTCTGTG ACGGTCCGTC ACACCTGTGA 8617 .......... .......... .......... .......... .......... .......... 188 CGGTCCGTCC TGCCATTCCG TCACGAAGTT CAGAGAGTCG ATTTTCTGTA CCCAATTTTA 8557 .......... .......... .......... .......... .......... .......... 188 GATTTTCTAA GTGTTTTGAA ACGAGACCCT GCGACGGTCC GTCGTGCCCA TGACGGTCCG 8497 .......... .......... .......... .......... .......... .......... 188 TCATTGGGTT CGTCGCCTCA GCCTGTTTTT CCAGAAATAA AATCTGCTGC TCAAAACGAC 8437 .......... .......... .......... .......... .......... .......... 188 TAAACAGGTC GTTACAATAG ATACCAATTT ACCCATCGTT CGTCCCCGAA CGATCACAAG 8377 .......... .......... .......... .......... .......... .......... 188 AAGGAAAACA AGGGCGAAAA GGAGTACCTG AATCTGTAAA CAGATGTGGG TATTTTTCTC 8317 .......... .......... .......... .......... .......... .......... 188 GCATATCCGC CTCCTTCTCC CAAGTGGCTT CATCAACGGG TCGATTCTTC CATTGCACCT 8257 .......... .......... .......... .......... .......... .......... 188 TGATGGATGC AATCTCTCTT GACCTCAACT TGCGAACTTC TCTATCTAAA ATAGCAACAG 8197 .......... .......... .......... .......... .......... .......... 188 GCTCCTCCTC ATAAGACAAG TTCTCATCAA GCAAAACTGA ATCCCAACGG ATAATGTAAT 8137 .......... .......... .......... .......... .......... .......... 188 TTCCATTCCC ATGATATCTT TTCAACATAG ACACATGGAA TACCGGATGT ACTCCGGACA 8077 .......... .......... .......... .......... .......... .......... 188 GCCCTGGAGG CAAGGCTAAC TCATAAGCCA CCTCTCCTAC TCGCTTAAGT ACTTCAAATG 8017 .......... .......... .......... .......... .......... .......... 188 GTCCAATGTA CCTTGGACTT AGTTTACCCC TTTTTCCGAA CCGCATCACC CCTTTCATGG 7957 .......... .......... .......... .......... .......... .......... 188 GCGAAACTTT CAACAAGACT TGTTCACCTT CCATGAACTC TAAGTCTCTA ACCTTTCGAT 7897 .......... .......... .......... .......... .......... .......... 188 CTGCATATTC TTTTTGTCTA CTTTGCGACG CTAACAACTT TTCTTGAATA GATTTCACTT 7837 .......... .......... .......... .......... .......... .......... 188 TATCTAACGA TTCTCTCAAA AGGTCAGTAC CCCAAGGCCT AACCTCAAAT GCATCAAACC 7777 .......... .......... .......... .......... .......... .......... 188 AACCAATGGG AGACCTACAT CTCCTACCAT ACAATGCTTC AAATGGAGCC ATATCAATGC 7717 .......... .......... .......... .......... .......... .......... 188 TTGAGTGATA GCTATTATTG TATGAAAACT CCGCTAAGGG TAGGAAGCTA TCCCAATGAC 7657 .......... .......... .......... .......... .......... .......... 188 CACCAAACTC TATCACACAC GCACGAAGCA TATCTTCCAA CACTTGAATC GTCCTTTCAG 7597 .......... .......... .......... .......... .......... .......... 188 ACTGACCATC GGTCTGAGGA TGGAACGCAG TACTAAGGTC CAACCTAGTA CCCAATTCTG 7537 .......... .......... .......... .......... .......... .......... 188 CATGCAATGT TTTCCAAAAC TTAGAAGTAA ACTGCGTACC CCTATCTGAT ATGATGGATA 7477 .......... .......... .......... .......... .......... .......... 188 GTGGAACCCC ATGCAATCGA ACGATTTCTG AGATATAGAT CTTGGCTAAC TTCTCTGCAT 7417 .......... .......... .......... .......... .......... .......... 188 TGTAAGTCAC CTTTACCGGA ATGAAATGAG CAGATTTAGT TAACCTATCA ACAATCACCC 7357 .......... .......... .......... .......... .......... .......... 188 AAATGGAGTC ATACTTACCC ATTGTCCTTG GAAGACCAAC CACAAAGTCC ATTGCAATTC 7297 .......... .......... .......... .......... .......... .......... 188 TTTCCCACTT CCATTCCGGA ATGGGCATTC TCTGAAGTGT TCCTCCGGGC CTTTGGTGTT 7237 .......... .......... .......... .......... .......... .......... 188 CATACTTTAC TTGTTGACAG TTTGGACACT TGGCAATAAA GTCAACAATA TCACGCTTCA 7177 .......... .......... .......... .......... .......... .......... 188 TTCTACTCCA CCAAAAGTGT TGTTTTAGGT CACGATACAT CTTGGTTGCA CTTGGATGTA 7117 .......... .......... .......... .......... .......... .......... 188 TAGAATACCT TGAACTATGA GCCTCTGTCA GAATAGTGTT GATTAAATCA TTGACGGGGT 7057 .......... .......... .......... .......... .......... .......... 188 ACACATACCC TTCCCTTGAT TCTCAAAACA CCTTCCTCAT CGATTTGTGC TTCCTTAGCC 6997 .......... .......... .......... .......... .......... .......... 188 TCTCCTCGCA ATACCTTATC TTGGATTCTT CTTAGTTTCT CATCATCAAA CTGTTTTCCC 6937 .......... .......... .......... .......... .......... .......... 188 TTAATTTTGT CAAGAAAAGA AGATCTTGAC TCCACACTAG CCAACAATCC TCCCTTCTCA 6877 .......... .......... .......... .......... .......... .......... 188 TTTACTTCTA ATATCATCAA GTCATTAGCT AGAGTCTGAA CCTCTCTAGC CAATGGGCGT 6817 .......... .......... .......... .......... .......... .......... 188 CTAGAAGCTT GCAAGTGAGC TAGACTTCCC ATGCTTCCCA CCTTTCTACT TAAAGCATCC 6757 .......... .......... .......... .......... .......... .......... 188 GCTACAACAT TAGCCTTCCC CGGATGATAC AAAATAGTGA TATCGTAGTC CTTTAGTAAC 6697 .......... .......... .......... .......... .......... .......... 188 TCCATCCATC TCCTCTGTCT TAAGTTCAAA TCTTTCTGAG TAAAGACATA CTGTAGGCTA 6637 .......... .......... .......... .......... .......... .......... 188 CGATGATCCG TATAGATCTC ACACTTAACC CCATATAAAT AGTGTCTCCA TTGCTTTAAT 6577 .......... .......... .......... .......... .......... .......... 188 GCAAACACCA CCGCAGCCAA TTCCAAATCG TGGGTCGGAT AGTTACGTTC ATGCACCTTT 6517 .......... .......... .......... .......... .......... .......... 188 AGTTGCCTTG AAGCATAAGC AATCACACTC TTCTCTTGCA TTAGTACAAC ACCCAAACCA 6457 .......... .......... .......... .......... .......... .......... 188 GAATAGGATG CATCACAATA AACAATGAAG TTCTTACCCT CTACTGGCAA GGTAAGGATA 6397 .......... .......... .......... .......... .......... .......... 188 GGTGCGGTAG TCAACAAAGT CTTGAGCTTC TGAAAGCTTT CCTCACATTC GTCCGACCAT 6337 .......... .......... .......... .......... .......... .......... 188 ACAAATGGAA CATTCTGCTT AGTCAAGTTC GTCAATTGGG AAGCAATAGA AGAGAATCCC 6277 .......... .......... .......... .......... .......... .......... 188 TTGACAAATC GACGGTAGTA GCTAGCTAAC CCAACAAAGC TCCTTATTTC TGACACATTA 6217 .......... .......... .......... .......... .......... .......... 188 GTAGGTCTTA CCCAATTCTT CACTGTCTCA ATCTTAGAAG GATCCACCAT CACTCCATCC 6157 .......... .......... .......... .......... .......... .......... 188 TTAGAAACCA CGTGCCCCAA GAAGGACACT GCATCTAGCC AAAACTCACA CTTAGAGAAT 6097 .......... .......... .......... .......... .......... .......... 188 TTGGCATAAA GCTTTTTCTC CCTCAACATT TCCAATACCA TTCTCAAATG CTCTTCATGT 6037 .......... .......... .......... .......... .......... .......... 188 TCCTTCTTGC TCTTTGAGTA TACCAATATA TCATCAATAA ATACGATCAC GAAGAGGTCC 5977 .......... .......... .......... .......... .......... .......... 188 AAATATGGCT TAAAAATCCC GTTCATCAAG CTCATGAACG CAACAGGGGC GTTCATAAGA 5917 .......... .......... .......... .......... .......... .......... 188 CCAAAAGACA TCACTACAAA TTTGTAATGC CCATACCTCG TTCGAAAAGC AGTCTTTGGC 5857 .......... .......... .......... .......... .......... .......... 188 ACATCCGTTG CCCGTATTTT CAATTGATGA TAACCGGATC TCAAGTCAAT CTTAGAGAAG 5797 .......... .......... .......... .......... .......... .......... 188 ACACAAGCAC CTTGTAACTG ATCGAACAAG TCATCAATGC GGGGAAGAGG ATACTTGTTC 5737 .......... .......... .......... .......... .......... .......... 188 TTTATGGTTA CCTTGTTTAG TTGTCTGTAG TCTATACACA TTCGAAAGCT CCCATCCTTC 5677 .......... .......... .......... .......... .......... .......... 188 TTCTTTACAA ACAAAACCGG AGCACCCCAA GGAGATGCAC TTGGTCTAAT AAAGCCTTTG 5617 .......... .......... .......... .......... .......... .......... 188 TTCAATAACT CTTGAAGTTG TGCCTTTAAC TCTCTTAACT CTGCGGGAGC CATTCTATAA 5557 .......... .......... .......... .......... .......... .......... 188 GGGGGTATAG AAATGGGGCG TGTGCCCGGT TCTAGATCGA TACAGAAGTC AATATCCCTA 5497 .......... .......... .......... .......... .......... .......... 188 TCTGGTGGCA TACCAGGAAG ATCTGCAGGG AACACATCCA GAAACTCACG GACTACTGAA 5437 .......... .......... .......... .......... .......... .......... 188 ACCGACTCAA TCGAAGGCAC TTGGGTAGTG TCATCCTTGA GATGTGCCAA GAAAGCTAAA 5377 .......... .......... .......... .......... .......... .......... 188 CAACCTTTAC TAACCATTTT CTTAGCACGA AGAAAGGAGA TGATATGCAC CGGATTGGAA 5317 .......... .......... .......... .......... .......... .......... 188 GCGTTGTCAC CCTCCCACAC TAACGGATCT GTCCCAGGCT TGGCTAACGT CACCGTTTTA 5257 .......... .......... .......... .......... .......... .......... 188 GCATTACAAT CCAAGATCGC AAATTGCGGA GAAAGCCAAG TCATACCTAG AATTACATCA 5197 .......... .......... .......... .......... .......... .......... 188 AAATCATCCA TTTCTAAGAT AACCAAATCT ACATAAGTGT TGCTCCCTAC AAAGTTCACC 5137 .......... .......... .......... .......... .......... .......... 188 AAAAAAGACC TATACACCTT TTCAACTACC ACAGATTCAC CCACCGGAGT AGAAACACGA 5077 .......... .......... .......... .......... .......... .......... 188 ATAGGCATAT CAAGTAATTC ACAATGTAAA TTTAGACCAT TAGCAAATGA GGAAGATACA 5017 .......... .......... .......... .......... .......... .......... 188 TAAGAAAATG TGGATCCAGG ATCAAACAAT ACAGAGGCCA TGCAATCACA AACCAGAAGA 4957 .......... .......... .......... .......... .......... .......... 188 TTACCTGTGA TGACAGCATC AGATGCCTCC GCTTCAGACC GCCCAGGGAA AGCGTAACAA 4897 .......... .......... .......... .......... .......... .......... 188 TGGGCCCTAT CGTTCGTCTG TCCGTTGCCC CTAACTTGTT GTGATGTAGT GGCTCCAGTT 4837 .......... .......... .......... .......... .......... .......... 188 TGCCCATCAC CTTGGCCGTT TTGGTTACCA CCATTTCCTT GACCACCACG TCCTCCAGAA 4777 .......... .......... .......... .......... .......... .......... 188 TAACGGCCTC TGCCATGACC ACCTCTACCT CTAACATTTG GAGGTCTGTA ACTCTGTTTT 4717 .......... .......... .......... .......... .......... .......... 188 GGACAATATC TCTTAATATG TTCGATCTCC CCACATCCAT AACACTTTCT GGGTTCATGC 4657 .......... .......... .......... .......... .......... .......... 188 ATAGGTCTCT CAGAGAAGTG TTGACCGGTC GGAGGTGGAC CCCCAACTAC AGTCTGTAGT 4597 .......... .......... .......... .......... .......... .......... 188 GAAGACTGAA TTGGTCGGAC TGAGTAACTT CCCGAACCCT GTCCTCTAGT GTAAGCACCA 4537 .......... .......... .......... .......... .......... .......... 188 TTAAACTCAC CTCCCTTTCG AAACCTTTTT GATGTCAATG TCGGGGTGAA GTCGTCTGGC 4477 .......... .......... .......... .......... .......... .......... 188 TTCACTCCTT CCACTTCTAT CACAAAGTCT ACCACCTCTT GGAAGGATTT TGCCGTTGCC 4417 .......... .......... .......... .......... .......... .......... 188 ACTATCTGTA AGGCCGAAAT CCGCAATTCT GACCTCAACC CCTTCACAAA CCGGCGAATT 4357 .......... .......... .......... .......... .......... .......... 188 CGCTCTTGTG GACTGAAACA CAGTTGGGTG GCATACCGGG ATAATGCACG AAACTTAGCC 4297 .......... .......... .......... .......... .......... .......... 188 TCATATGCAT TGACCGACAT CCTACCTTGC TCTAGGCTCA AGAACTCATC CCTTTTCCTA 4237 .......... .......... .......... .......... .......... .......... 188 TCCCTCAAAG TTCGGGGGAT ATACTTCTCC ATAAACAAGC TAGAGAATGA GGCCCAAGTC 4177 .......... .......... .......... .......... .......... .......... 188 ATAGGTGGTG CCTCTGTTGG TTGACACTCA ATATGTGACC GCCACCACAT TTTGGCATTA 4117 .......... .......... .......... .......... .......... .......... 188 CCTTGAAACT GATAACTTAC GAACTCAACA CCAAACCGTT CTACTATACC CATCTTGTGT 4057 .......... .......... .......... .......... .......... .......... 188 AGTAGCTCAT GACAGTCAAC CAGAAAATCG TAAGCATCCT CAGATTCCGC ACCCTTGAAT 3997 .......... .......... .......... .......... .......... .......... 188 ACTGGAGGTT TCAATTTCAA GAACTTACTG AAAAGTTCAT GCTGATCATT TGTCATTATA 3937 .......... .......... .......... .......... .......... .......... 188 GGCCCAGTAG TCAGACGTGG AAACGTGCCT ATTTCCAATG GAACATCCAT GCGGGGAGCC 3877 .......... .......... .......... .......... .......... .......... 188 ATAGTAGCCG CATGTTGTAC CTCCGGAGCC TGAGGTGCTG GTGTAGAAAA CACTGGGGTG 3817 .......... .......... .......... .......... .......... .......... 188 TCTGGCCCTG ATCATATAAC CCGCTAAGAT AAGCCAGAAC CTGATTGATC ATCTCTGGGG 3757 .......... .......... .......... .......... .......... .......... 188 TAGGTTGGGG TGGCAATCCC TCATTCTGCA CTTGTTCAGT TTCCCCATCG TCCCCTTCTC 3697 .......... .......... .......... .......... .......... .......... 188 TTATTACTTC CTCAGTCGGT GGAGGAGTCA CTGCCCTAGT ATCAGATGGG CTAGGCGCTC 3637 .......... .......... .......... .......... .......... .......... 188 GTCCTCTTCC CCTAGAGGAC GTCCTCCCAC TACCTCTACC ATGGCCCCTT GCCGCTGTTC 3577 .......... .......... .......... .......... .......... .......... 188 TTCCTCGAGC CACAGCCCCA GTGGTTGGCT CAGTTGTTTC TTGTCTGGCC GGTATTGGTG 3517 .......... .......... .......... .......... .......... .......... 188 TTGGCGTAGT CGTTGCTCTA GTTCTAACCA TCTGTGAAAG AGAGTGAAGA TGGTCAGATA 3457 .......... .......... .......... .......... .......... .......... 188 CTAATTCGTA TCGCCTAGAT ACCAATTGGA CTCAAGTAGT AGCACGAAAG AAAGAATGAG 3397 .......... .......... .......... .......... .......... .......... 188 AGAGTGAAAT TTTCCTAAAG TCTTATAGCC TCTCAAGAAA AAGTAAAGGC GTCCCCCTAC 3337 .......... .......... .......... .......... .......... .......... 188 CGTTCCTTAA GACTCTACTA GACCTGTTCT TGTGTGATGA GACCAACGAA CCTAATGCTC 3277 .......... .......... .......... .......... .......... .......... 188 TGATACCAAG TTTGTCACGA CCCAAATCCA GGCCACGACT GGCACCCACA CTTACCCTCC 3217 .......... .......... .......... .......... .......... .......... 188 TATGTGAGCG AACCAACCAA TCTAAACCTT AACATTTCAA TATAATATAA CCAGAAAGTA 3157 .......... .......... .......... .......... .......... .......... 188 ATGCGGAAGA CTTAAAATCA TTAAATAAAG ACCAATTCAT TAACTTCTAA AATTCAACAT 3097 .......... .......... .......... .......... .......... .......... 188 CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA 3037 |||||| |||||||||| |||||||||| |||| ||||| ||||||| | ||||| |||| ..ATTATT-T CCCAAAATCT GGAAGTCATC ATCATAAGAA CATCTAC-TT CAAATTACTA 244 AA-CTAAGAG TATTCTAA-A AGCT-AAAAA TACATAAGAA GCTAGTCCAT GCCGGAAGTT 2980 || ||||||| | |||||| | |||| ||||| ||||||| || |||||||||| ||||||| || AATCTAAGAG T-TTCTAAGA AGCTAAAAAA TACATAA-AA GCTAGTCCAT GCCGGAACTT 302 CAAGGCATCA AGACTTGAAG AAGAAGACCC AGTCCAAGCT AG-AAGCATT AGCTCACCCT 2921 |||| ||||| |||| ||||| | ||||| || ||||||| || || ||||||| |||||||||| CAAGACATCA AGACATGAAG AGGAAGATCC AGTCCAATCT AGAAAGCATT AGCTCACCCT 362 GAATATCCGG TATGACGAAG ACTGGCTAGA ATCACTGCTG AGTTGAAGAT GACGGAACGT 2861 | ||||||| | | |||| |||||||||| | |||| || ||| |||||| ||||| |||| G-ATATCCGA AGTAATGAAG ACTGGCTAGA GTTACTGTTG AGTCGAAGAT GACGGCACGT 421 TTGCT 2856 ||||| TTGCT 426 hqPGS_C06HBa0153O03.1-2-_SGN-E542084+ (10296 10138,3094 2856) ******************************************************************************** EST sequence 33 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 3559 to 1232): Exon 1 2949 2542 ( 408 n); cDNA 1 410 ( 410 n); score: 0.817 MATCH C06HBa0153O03.1-2- SGN-E246710- 0.817 408 0.848 C PGS_C06HBa0153O03.1-2-_SGN-E246710- (2949 2542) Alignment (genomic DNA sequence = upper lines): AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATATCCGGT ATGACGAAGA CTGGCTAGAA 2890 |||||||||| |||||||||| |||||||||| ||| |||| | | | |||| |||||| ||| AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GT-AGTAAGA CTGGCTTGAA 59 TCACTGCTGA GTTGAAGATG ACGGAACGTT TGCTGCACTC CACAAAT-AA CAAGAAGAAA 2831 | |||| ||| |||||| | | | || ||||| |||||||||| ||||||| || |||||||| | TTACTGTTGA GTTGAACACG ATGGCACGTT TGCTGCACTC CACAAATAAA CAAGAAGAGA 119 ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 2771 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 179 AAATAGAAAA CAGTATGTAT --TAAGCAAT ATCATAAAAT CAATTAATAT CCTTAGCATG 2713 ||||||||| || ||| ||| ||| ||| |||||||||| ||| || || || | |||| AAATAGAAAT CAATATATAT ACCAAGTAAT ATCATAAAAT CAACTATGAT ACTCAACATG 239 CAGCATTTAC AGTTACCATA ACCCTTGGTT ACAACACCAA GCACATCAAT GAGGACTCAC 2653 |||| || | ||| ||| | | | | ||| | || || ||||||||| TAGCAACAAC AAATACTATA TCATTAACAA TTACCGTCAA GTTCACACAT GAGGACTCAA 299 ACCTCCTCAT CACACTCATT TGGGAATTTA GTTCATTAGA TTGGATATAT TAACATATTT 2593 |||| | || ||||||| ||||||| |||||||||| ||| ||||| |||||| ||| GCCTCAATAC CATACTCATT TGGGAATCAT GTTCATTAGA TTGAGTATAT TAACATCTTT 359 CAAGATTCAT TATCTTTATT CTCCTCGTGT CGGTACGTGA CACTCCGCTC C 2542 |||||||||| |||||||||| || |||| |||||||||| |||||||||| | CAAGATTCAT TATCTTTATT TCTCTTGTGT CGGTACGTGA CACTCCGCTC C 410 hqPGS_C06HBa0153O03.1-2-_SGN-E246710- (2949 2542) ******************************************************************************** EST sequence 46 -strand 236 n (File: SGN-E209683-) 1 CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 61 AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 121 ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 181 ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA Predicted gene structure (within gDNA segment 3459 to 2015): Exon 1 2849 2615 ( 235 n); cDNA 1 236 ( 236 n); score: 0.960 MATCH C06HBa0153O03.1-2- SGN-E209683- 0.960 235 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E209683- (2849 2615) Alignment (genomic DNA sequence = upper lines): CACAAATAAC AAGAAGA-AA ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT 2791 |||||||||| ||||||| || |||||||||| |||||||||| ||||| |||| |||||||||| CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 60 AGATATCATC GGCCAACTCA AAATAGAAAA CAGTATGTAT TAAGCAATAT CATAAAATCA 2731 |||||||||| |||||||||| |||||| || |||||||||| |||||||||| |||||||||| AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 120 ATTAATATCC TTAGCATGCA GCATTTACAG TTACCATAAC CCTTGGTTAC AACACCAAGC 2671 | |||||||| ||| |||||| ||||||| || |||||||||| |||||||||| |||||||||| ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 180 ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT TCATTA 2615 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||| ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA 236 hqPGS_C06HBa0153O03.1-2-_SGN-E209683- (2849 2615) ******************************************************************************** EST sequence 16 -strand 729 n (File: SGN-E351546-) 1 AGTCGTTGCT CTAGTTCTAC CCATCTGGCA AGAGAGTGAG NATGGTCAGA TACCAATTCG 61 TATCGCTTAG ATACCAATTG ACTCGAAGTA GTAGCACGAA AGAAAGAATG AAAGAGTGAA 121 GTTTTCCTAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAA GCGTCCCCCT ACCGTTCCTT 181 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 241 AGTTTTGTCA CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA 301 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA 361 AGACTTAAAC TCATTAATAA AATCAATAAC TACTATTATT AAACATCTAT TATTCCCAAA 421 ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA GAGTTTCTAA 481 GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 541 AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 601 AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 661 ACCAAGAAGA AAAACATAAA AGTAGGGGTC AGTACAAAAC ACGGCTACTG AGTAGATATC 721 ATCGGCCAA Predicted gene structure (within gDNA segment 4883 to 2175): Exon 1 3509 2775 ( 735 n); cDNA 1 729 ( 729 n); score: 0.899 MATCH C06HBa0153O03.1-2- SGN-E351546- 0.899 735 1.008 C PGS_C06HBa0153O03.1-2-_SGN-E351546- (3509 2775) Alignment (genomic DNA sequence = upper lines): AGTCGTTGCT CTAGTTCTAA CCATCTGTGA AAGAGAGTGA AGATGGTCAG ATACTAATTC 3450 |||||||||| ||||||||| ||||||| | |||||||||| |||||||| |||| ||||| AGTCGTTGCT CTAGTTCTAC CCATCTG-GC AAGAGAGTGA GNATGGTCAG ATACCAATTC 59 GTATCGCCTA GATACCAATT GGACTC-AAG TAGTAGCACG AAAGAAAGAA TGAGAGAGTG 3391 ||||||| || |||||||||| ||||| ||| |||||||||| |||||||||| ||| |||||| GTATCGCTTA GATACCAATT -GACTCGAAG TAGTAGCACG AAAGAAAGAA TGAAAGAGTG 118 AAATTTTCCT AAAGTCTTAT AGCCTCTCAA GAAAAAGTAA AGGCGTCCCC CTACCGTTCC 3331 || ||||||| |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| AAGTTTTCCT AAAGTCTTAT AGCCTCTCAA GGAAAAGTAA AAGCGTCCCC CTACCGTTCC 178 TTAAGACTCT ACTAGACCTG TTCTTGTGTG ATGAGACCAA CGAACCTAAT GCTCTGATAC 3271 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| TTAAGACTCT ACTAGACTTG TTCTTGTGTG ATGAGACCAA CGAACCTAAT GCTCTGATAC 238 CAAG-TTTGT CACGACCCAA ATCCAGGCCA CGACTGGCAC CCACACTTAC CCTCCTATGT 3212 |||| ||||| |||||||||| |||| |||| | |||||||| |||||||||| |||||||||| CAAGTTTTGT CACGACCCAA ATCCGGGCCG CCACTGGCAC CCACACTTAC CCTCCTATGT 298 GAGCGAACCA ACCAATCTAA ACCTTAACAT TTCAATATAA TATAACCAGA AAGTAATGCG 3152 |||||||||| |||||||||| |||||||||| |||||| ||| || | |||| |||||||||| GAGCGAACCA ACCAATCTAA ACCTTAACAT TTCAATGTAA TAGCAACAGA AAGTAATGCG 358 GAAGACTTAA AATCATTAAA TAAAGACCAA TTCATTAACT TCTAAAATTC AACATCTATT 3092 |||||||||| | ||||| || | | | || ||| ||||| ||| ||| |||||||||| GAAGACTTAA ACTCATT-AA T--A-A--AA -TCAATAACT ACTATTATTA AACATCTATT 411 ATTCCCCCAA AATCTGGAAG TCATCATCAC AAGAACATCT ACGATCAAAT GACTAA-ACT 3033 ||| ||||| || ||||||| |||||||||| |||||||||| || | ||| ||||| || ATT--CCCAA AACCTGGAAG TCATCATCAC AAGAACATCT AC-TTTAAAC TACTAATTCT 468 AAGAGTATTC TAA-AAGCT- AAAAA-TACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG 2976 |||||| ||| ||| ||||| ||||| |||| |||||||||| |||||||||| |||||||||| AAGAGT-TTC TAAGAAGCTA AAAAATTACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG 527 GCATCAAGAC TTGAAGAAGA AGACCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATA 2916 |||||||||| ||||| ||| ||| |||||| |||||||||| || ||||||| |||||||| | GCATCAAGAC ATGAAGGAGA AGATCCAGTC CAAGCTAGAA GCGTTAGCTC ACCCTGAAGA 587 TCCGGTATGA CGAAGACTGG CTAGAATCAC TGCTGAGTTG AAGATGACGG AACGTTTGCT 2856 |||||| ||| |||||||||| || || | || || ||||| | |||||||||| ||||||||| TCCGGTGTGA CGAAGACTGG CTTGAGTTAC TGTTGAGTCG AAGATGACGG CACGTTTGCT 647 GCACTCCACA AATAACAAGA AG-AAAACAT AAAAGTAGGG GTCAGTACAA AACACGGGTA 2797 |||||||||| |||| ||||| || ||||||| |||||||||| |||||||||| ||||||| || GCACTCCACA AATACCAAGA AGAAAAACAT AAAAGTAGGG GTCAGTACAA AACACGGCTA 707 CTGAGTAGAT ATCATCGGCC AA 2775 |||||||||| |||||||||| || CTGAGTAGAT ATCATCGGCC AA 729 hqPGS_C06HBa0153O03.1-2-_SGN-E351546- (3509 2775) ******************************************************************************** EST sequence 39 -strand 655 n (File: SGN-E356696-) 1 CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 61 TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 121 CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 181 CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 241 CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 301 TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 361 ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 421 ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 481 GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 541 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAACA AGAAGAAAAA 601 CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA GATATCATCG GCCAA Predicted gene structure (within gDNA segment 4044 to 2175): Exon 1 3434 2775 ( 660 n); cDNA 1 655 ( 655 n); score: 0.903 MATCH C06HBa0153O03.1-2- SGN-E356696- 0.903 660 1.008 C PGS_C06HBa0153O03.1-2-_SGN-E356696- (3434 2775) Alignment (genomic DNA sequence = upper lines): CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAGAG AGTGAAATTT TCCTAAAGTC 3375 |||||||||| |||||||||| |||||||||| ||||||| || |||||| ||| |||||||||| CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 60 TTATAGCCTC TCAAGAAAAA GTAAAGGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 3315 |||||||||| ||||| |||| ||||| |||| |||||||||| |||||||||| |||||||||| TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 120 CCTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAG-T TTGTCACGAC 3256 | |||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 180 CCAAATCCAG GCCACGACTG GCACCCACAC TTACCCTCCT ATGTGAGCGA ACCAACCAAT 3196 |||||||| | ||| | |||| |||||||||| |||||||| | |||||||||| |||||||||| CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 240 CTAAACCTTA ACATTTCAAT ATAATATAAC CAGAAAGTAA TGCGGAAGAC TTAAAATCAT 3136 |||||||||| |||||||||| ||||| | |||||||||| |||||||||| ||||| |||| CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 300 TAAATAAAGA CCAATTCATT AACTTCTAAA ATTCAACATC TATTATTCCC CCAAAATCTG 3076 | ||| | | || ||| | |||| ||| ||| |||||| ||||||| | |||||| ||| T-AAT--A-A --AA-TCAAT AACTACTATT ATTAAACATC TATTATT--C CCAAAACCTG 351 GAAGTCATCA TCACAAGAAC ATCTACGATC AAATGACTAA -ACTAAGAGT ATTCTAA-AA 3018 |||||||||| |||||||||| |||||| | ||| ||||| |||||||| |||||| || GAAGTCATCA TCACAAGAAC ATCTAC-TTT AAACTACTAA TTCTAAGAGT -TTCTAAGAA 409 GCT-AAAAA- TACATAAGAA GCTAGTCCAT GCCGGAAGTT CAAGGCATCA AGACTTGAAG 2960 ||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| GCTAAAAAAT TACATAAGAA GCTAGTCCAT GCCGGAAGTT CAAGGCATCA AGACATGAAG 469 AAGAAGACCC AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATATCCGGT ATGACGAAGA 2900 |||||| || |||||||||| |||||| ||| |||||||||| || ||||||| ||||||||| GAGAAGATCC AGTCCAAGCT AGAAGCGTTA GCTCACCCTG AAGATCCGGT GTGACGAAGA 529 CTGGCTAGAA TCACTGCTGA GTTGAAGATG ACGGAACGTT TGCTGCACTC CACAAATAAC 2840 |||||| || | |||| ||| || ||||||| |||| ||||| |||||||||| |||||||||| CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC CACAAATAAC 589 AAGAAG-AAA ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC 2781 |||||| ||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| AAGAAGAAAA ACATAAAAGT AGGGGTCAGT ACAAAACACG GCTACTGAGT AGATATCATC 649 GGCCAA 2775 |||||| GGCCAA 655 hqPGS_C06HBa0153O03.1-2-_SGN-E356696- (3434 2775) ******************************************************************************** EST sequence 35 -strand 580 n (File: SGN-E356206-) 1 GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 61 TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 121 CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 181 TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 241 CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 301 ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 361 CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 421 GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 481 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA AAGTAGGGGT 541 CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA Predicted gene structure (within gDNA segment 4212 to 2175): Exon 1 3358 2775 ( 584 n); cDNA 2 580 ( 579 n); score: 0.892 MATCH C06HBa0153O03.1-2- SGN-E356206- 0.892 584 1.007 C PGS_C06HBa0153O03.1-2-_SGN-E356206- (3358 2775) Alignment (genomic DNA sequence = upper lines): AAAAGTAAAG GCGTCCCCCT ACCGTTCCTT AAGACTCTAC TAGACCTGTT CTTGTGTGAT 3299 ||||||||| |||||||| | ||||| |||| |||||||||| ||||| |||| |||||||||| AAAAGTAAAA GCGTCCCCNT ACCGTCCCTT AAGACTCTAC TAGACTTGTT CTTGTGTGAT 61 GAGACCAACG AACCTAATGC TCTGATACCA AG-TTTGTCA CGACCCAAAT CCAGGCCACG 3240 |||||||||| | |||||||| |||||||||| || ||||||| |||||||||| || |||| | GAGACCAACG ACCCTAATGC TCTGATACCA AGTTTTGTCA CGACCCAAAT CCGGGCCGCC 121 ACTGGCACCC ACACTTACCC TCCTATGTGA GCGAACCAAC CAATCTAAAC CTTAACATTT 3180 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGGCACCC ACACTTACCC TCCTATGTGA GCGAACCAAC CAATCTAAAC CTTAACATTT 181 CAATATAATA TAACCAGAAA GTAATGCGGA AGACTTAAAA TCATTAAATA AAGACCAATT 3120 |||| ||||| | |||||| |||||||||| ||||||||| ||||| ||| | | || | CAATGTAATA GCAACAGAAA GTAATGCGGA AGACTTAAAC TCATT-AAT- -A-A--AA-T 234 CATTAACTTC TAAAATTCAA CATCTATTAT TCCCCCAAAA TCTGGAAGTC ATCATCACAA 3060 || ||||| | || ||| || |||||||||| | ||||||| ||||||||| |||||||||| CAATAACTAC TATTATTAAA CATCTATTAT T--CCCAAAA CCTGGAAGTC ATCATCACAA 292 GAACATCTAC GATCAAATGA CTAA-ACTAA GAGTATTCTA A-AAGCT-AA AAA-TACATA 3004 |||||||||| | ||| | |||| |||| |||| ||||| | ||||| || ||| |||||| GAACATCTAC -TTTAAACTA CTAATTCTAA GAGT-TTCTA AGAAGCTAAA AAATTACATA 350 AGAAGCTAGT CCATGCCGGA AGTTCAAGGC ATCAAGACTT GAAGAAGAAG ACCCAGTCCA 2944 |||||||||| |||||||||| |||||||||| |||||||| | |||| ||||| | |||||||| AGAAGCTAGT CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA 410 AGCTAGAAGC ATTAGCTCAC CCTGAATATC CGGTATGACG AAGACTGGCT AGAATCACTG 2884 |||||||||| ||||||||| |||||| ||| |||| ||||| |||||||||| || | |||| AGCTAGAAGC GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG 470 CTGAGTTGAA GATGACGGAA CGTTTGCTGC ACTCCACAAA TAACAAGAAG -AAAACATAA 2825 ||||| ||| |||||||| | |||||||||| |||||||||| |||||||||| ||||||||| TTGAGTCGAA GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA 530 AAGTAGGGGT CAGTACAAAA CACGGGTACT GAGTAGATAT CATCGGCCAA 2775 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| AAGTAGGGGT CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA 580 hqPGS_C06HBa0153O03.1-2-_SGN-E356206- (3358 2775) ******************************************************************************** EST sequence 52 +strand 434 n (File: SGN-E222578+) 1 TTTTTTTTTT TTTTTTTTTA ATAAAAACCA ATTCAATAAC TATCAATATT CAACATCTAT 61 TATTCCCAAA ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA 121 GAGTTTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAGGTT CAAGGCATCA 181 AGACATGAAG GAGAAGATCC AGTCCAAGCT AGACGCGTTA GCTCACCCTG AAGATCCGGT 241 GTGACGAAGA CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC 301 CACAACTTTC TAGATGGGGA CTTTCTTCAA GGCTTCGAGA TGGAAACTTG CTTGCAGAGC 361 TTCGAGTGTT ACCAGCTTCA AGATGGAGTT TCAGTGATGA GGCTTGCTAG TCTCGAGTTT 421 TTTTTTTTTT TTTT Predicted gene structure (within gDNA segment 4175 to 1): Exon 1 3126 2845 ( 282 n); cDNA 27 305 ( 279 n); score: 0.888 PPA cDNA 19 1 MATCH C06HBa0153O03.1-2- SGN-E222578+ 0.888 282 0.650 C PGS_C06HBa0153O03.1-2-_SGN-E222578+ (3126 2845) Alignment (genomic DNA sequence = upper lines): ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC CCCAAAATCT GGAAGTCATC 3067 ||||||||| ||||| || ||||||||| |||||||| ||||||| || |||||||||| ACCAATTCAA TAACTATCAA TATTCAACAT CTATTATT-- CCCAAAACCT GGAAGTCATC 84 ATCACAAGAA CATCTACGAT CAAATGACTA A-ACTAAGAG TATTCTAAAA GCTAAAAATA 3008 |||||||||| ||||||| | ||| |||| | ||||||| | |||||||| |||||||||| ATCACAAGAA CATCTAC-TT TAAACTACTA ATTCTAAGAG T-TTCTAAAA GCTAAAAATA 142 CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG ACTTGAAGAA GAAGACCCAG 2948 |||||||||| |||||||||| |||| ||||| |||||||||| || ||||| | ||||| |||| CATAAGAAGC TAGTCCATGC CGGAGGTTCA AGGCATCAAG ACATGAAGGA GAAGATCCAG 202 TCCAAGCTAG AAGCATTAGC TCACCCTGAA TATCCGGTAT GACGAAGACT GGCTAGAATC 2888 |||||||||| | || ||||| |||||||||| ||||||| | |||||||||| |||| || | TCCAAGCTAG ACGCGTTAGC TCACCCTGAA GATCCGGTGT GACGAAGACT GGCTTGAGTT 262 ACTGCTGAGT TGAAGATGAC GGAACGTTTG CTGCACTCCA CAA 2845 |||| ||||| ||||||||| || ||||||| |||||||||| ||| ACTGTTGAGT CGAAGATGAC GGCACGTTTG CTGCACTCCA CAA 305 hqPGS_C06HBa0153O03.1-2-_SGN-E222578+ (3126 2845) ******************************************************************************** EST sequence 132 +strand 710 n (File: SGN-E392027+) 1 CCACAGCCCC AGTGGCTGGC TCAGTCGCAC CCTGTCCCGC CGGTGCTGGT GTTGATGCTG 61 GCGTAGTCGT TGCTCTAGTT CTAACCATCT GCGAAATAGA GTGAAGATGG TCAGATACCA 121 ATTTGTATCA CCTAGATACC AATTGGACCC AAGTAATAGC ACGAAAGAAG AAAGAATGGA 181 ATTTTCCAAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAG GCATCCCCCT ACCGTTCCTT 241 AAGACTCTAC TAGACTCGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 301 AGTTTGTCAC GACCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 361 GAACCAACCA ATCTAACCTT AACATTTCAA TATAATATCA ACAGAAAGTA ATGTGGAAGA 421 CTTAAACTCA TTAAATACAG ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC 481 CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA 541 GTCTAAAAGC TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC 601 TTGAAGAAGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATT TCCGATGTAG 661 TAAGACTGGC TTGAATTACT GTTGAGTTGA ACACGATGGC ACGTTTGCTG Predicted gene structure (within gDNA segment 4695 to 1697): Exon 1 3507 2855 ( 653 n); cDNA 67 710 ( 644 n); score: 0.928 MATCH C06HBa0153O03.1-2- SGN-E392027+ 0.928 653 0.920 C PGS_C06HBa0153O03.1-2-_SGN-E392027+ (3507 2855) Alignment (genomic DNA sequence = upper lines): TCGTTGCTCT AGTTCTAACC ATCTGTGAAA GAGAGTGAAG ATGGTCAGAT ACTAATTCGT 3448 |||||||||| |||||||||| ||||| |||| ||||||||| |||||||||| || |||| || TCGTTGCTCT AGTTCTAACC ATCTGCGAAA TAGAGTGAAG ATGGTCAGAT ACCAATTTGT 126 ATCGCCTAGA TACCAATTGG ACTCAAGTAG TAGCACGAAA GAAAGAATGA GAGAGTGAAA 3388 ||| |||||| |||||||||| || |||||| |||||||||| | ||||| | || | || || ATCACCTAGA TACCAATTGG ACCCAAGTAA TAGCACGAAA G-AAGAA--A GA-A-TGGAA 181 TTTTCCTAAA GTCTTATAGC CTCTCAAGAA AAAGTAAAGG CGTCCCCCTA CCGTTCCTTA 3328 |||||| ||| |||||||||| |||||||| | |||||||||| | |||||||| |||||||||| TTTTCCAAAA GTCTTATAGC CTCTCAAGGA AAAGTAAAGG CATCCCCCTA CCGTTCCTTA 241 AGACTCTACT AGACCTGTTC TTGTGTGATG AGACCAACGA ACCTAATGCT CTGATACCAA 3268 |||||||||| |||| |||| |||||||||| |||||||||| |||||||||| |||||||||| AGACTCTACT AGACTCGTTC TTGTGTGATG AGACCAACGA ACCTAATGCT CTGATACCAA 301 GTTTGTCACG ACCCAAATCC AGGCCACGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 3208 |||||||||| | ||||| || || |||| |||||||||| |||||||||| |||||||||| GTTTGTCACG A-CCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 360 GAACCAACCA ATCTAAACCT TAACATTTCA ATATAATATA ACCAGAAAGT AATGCGGAAG 3148 |||||||||| |||| ||||| |||||||||| ||||||||| | |||||||| |||| ||||| GAACCAACCA ATCT-AACCT TAACATTTCA ATATAATATC AACAGAAAGT AATGTGGAAG 419 ACTTAAAATC ATTAAATAAA GACCAATTCA TTAACTTCTA AAATTCAACA TCTATTATTC 3088 ||||||| || |||||||| | |||||||||| |||||||||| |||||||||| ||||||||| ACTTAAACTC ATTAAATACA GACCAATTCA TTAACTTCTA AAATTCAACA TCTATTATT- 478 CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTACGA TCAAATGACT AAACTAAGAG 3028 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| CCCCAAAATC TGGAAGTCAT CACCACAAGA ACATCTACGA TCAAATGACT AAACTAAGAG 538 TATTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 2968 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 598 ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA TATCCGGTAT 2908 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| | |||| | | ACTTGAAGAA GAAGATCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA TTTCCGATGT 658 GACGAAGACT GGCTAGAATC ACTGCTGAGT TGAAGATGAC GGAACGTTTG CTG 2855 | |||||| |||| |||| |||| ||||| |||| | || || ||||||| ||| -AGTAAGACT GGCTTGAATT ACTGTTGAGT TGAACACGAT GGCACGTTTG CTG 710 hqPGS_C06HBa0153O03.1-2-_SGN-E392027+ (3507 2855) ******************************************************************************** EST sequence 50 +strand 679 n (File: SGN-E370357+) 1 TTTTTTTTTT CTTACAATTA TATTATGAAT TCGATAATCT TTAATGTCAC GACCCAAATC 61 GAGCCGCAAG TGGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATACAAAATC 121 CAACATTTCA ATATAATGAC GGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC 181 AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA 241 TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT ATCTAGAAAG CTAGAATAAC 301 TAAAAAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 361 CAAGCTAGAA GCGTTAGCTC ACACTGAAAT CCGGTATAAT GAAGACTGGC TAGAGTTGCG 421 GTTGAGTTGA AGACGACGGT ACGTTTGCTT TATTCGAGTG TCAATTAATC ATTCGGCTGT 481 CACCCAAATA TTATTGATTG ATTACACCTC TGCCATTTGT AAAATTTTTC AAATTTGCCT 541 ACGGATGCAG AATTTTCCTC GAATTTCTGA TGTGTTTTCT TGTAAATAGT GGCCATTTGT 601 GTAAGTAAAT GCCCATTTCT CCTCCTACAA AGTCCAATTC CATTTTTCCC CCAATCCACC 661 ATGGCAACAC CACCTCCAA Predicted gene structure (within gDNA segment 4304 to 1): Exon 1 3264 2856 ( 409 n); cDNA 45 449 ( 405 n); score: 0.844 PPA cDNA 13 1 MATCH C06HBa0153O03.1-2- SGN-E370357+ 0.844 409 0.602 C PGS_C06HBa0153O03.1-2-_SGN-E370357+ (3264 2856) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAATCCAGG CCACGACTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 3205 |||||||||| |||||| | | || | | ||| |||||||||| |||||||||| |||||||||| TGTCACGACC CAAATCGA-G CCGCAAGTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 103 CCAACCAATC TAAACCTTAA CATTTCAATA TAATATAACC AGAAAGTAAT GCGGAAGACT 3145 ||||||||| ||| || |||||||||| | | || | | ||| |||| |||||||||| CCAACCAATA CAAAATCCAA CATTTCAATA T-A-ATGA-C GGAATATAAT GCGGAAGACT 160 TAAAATCATT AAATAAAGAC CAATTCATTA ACTTCT-AAA ATTCAACATC TATTATT-CC 3087 |||| ||||| ||| || | ||||| | || |||||| ||| | |||||| | ||||||| | TAAACTCATT -AATGAAAAT CAATTAAATA ACTTCTAAAA ACTCAACAAC TATTATTATC 219 CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA A-ACTAAGAG 3028 |||||||||| |||||||||| |||||||||| |||||| | ||||| |||| | ||||||| CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTATCCT CAAATTACTA ATTCTAAGAG 279 TATTCTA-AA AGCTAAAAAT ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 2969 || |||| || |||| | ||| | ||| ||| |||||||||| |||||| ||| |||||||||| TA-TCTAGAA AGCT-AGAAT AACTAAAAAG CTAGTCCATG CCGGAACTTC AAGGCATCAA 337 GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGCATTAG CTCACCCTGA ATATCCGGTA 2909 ||| |||||| |||||| ||| |||||||||| ||||| |||| ||||| |||| | |||||||| GACATGAAGA AGAAGATCCA GTCCAAGCTA GAAGCGTTAG CTCACACTGA A-ATCCGGTA 396 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCT 2856 | | |||||| |||||||| | | | |||| ||||||| || ||| |||||| ||| TAATGAAGAC TGGCTAGAGT TGCGGTTGAG TTGAAGACGA CGGTACGTTT GCT 449 hqPGS_C06HBa0153O03.1-2-_SGN-E370357+ (3264 2856) ******************************************************************************** EST sequence 10 -strand 299 n (File: SGN-E373117-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 61 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 241 GAGTTGAAGA CGACGGTACG TTTGCCAAAA TTACGACAGT ATTTGGACAA GCTAGAAGA Predicted gene structure (within gDNA segment 4057 to 1234): Exon 1 3121 2859 ( 263 n); cDNA 1 263 ( 263 n); score: 0.844 MATCH C06HBa0153O03.1-2- SGN-E373117- 0.844 263 0.880 C PGS_C06HBa0153O03.1-2-_SGN-E373117- (3121 2859) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCT-AAAATT CAACATCTAT TATT-CCCCC AAAATCTGGA AGTCATCATC 3064 || || | ||| |||| | ||||| |||| |||| |||| |||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 60 ACAAGAACAT CTACGATCAA ATGACTAAA- CTAAGAGTAT TCTAA-AAGC TAAAAATACA 3006 |||||||||| |||| |||| || |||||| ||||||||| ||||| |||| | |||||||| ACAAGAACAT CTAC-TTCAA ATTACTAAAT CTAAGAGTA- TCTAAGAAGC T-AAAATACA 117 TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA AGACCCAGTC 2946 ||| ||||| |||||||||| ||| |||||| |||||||||| ||||||||| ||| |||||| TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 177 CAAGCTAGAA GCATTAGCTC ACCCTGAATA TCCGGTATGA CGAAGACTGG CTAGAATCAC 2886 |||||||||| || ||||||| |||||||| | |||| | | | ||||||||| ||||| | | CAAGCTAGAA GCGTTAGCTC ACCCTGAA-A TCCGATGTAA TGAAGACTGG CTAGAGTTGC 236 TGCTGAGTTG AAGATGACGG AACGTTT 2859 | ||||||| |||| ||||| |||||| GGTTGAGTTG AAGACGACGG TACGTTT 263 hqPGS_C06HBa0153O03.1-2-_SGN-E373117- (3121 2859) ******************************************************************************** EST sequence 88 +strand 299 n (File: SGN-E373116+) 1 TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCGT TAGCTCACCC TGAAATCCGA TGTAATGAAG ACTGGCTAGA GTTGCGGTTG 241 AGTTGAAGAC GACGGTACGT TTGCCAAAAT TACGACAGTA TTTGGACAAG CTAGAAGAG Predicted gene structure (within gDNA segment 4037 to 1214): Exon 1 3121 2859 ( 263 n); cDNA 1 262 ( 262 n); score: 0.842 MATCH C06HBa0153O03.1-2- SGN-E373116+ 0.842 263 0.880 C PGS_C06HBa0153O03.1-2-_SGN-E373116+ (3121 2859) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCTAAAATTC AACATCTATT ATT-CCCCCA AAATCTGGAA GTCATCATCA 3063 || || | |||| || |||| ||||| ||| ||||| |||||||||| |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 60 CAAGAACATC TACGATCAAA TGACTAAA-C TAAGAGTATT CTAA-AAGCT AAAAATACAT 3005 |||||||||| ||| ||||| | |||||| | |||||||| | |||| ||||| ||||||||| CAAGAACATC TAC-TTCAAA TTACTAAATC TAAGAGTA-T CTAAGAAGCT -AAAATACAT 117 AAGAAGCTAG TCCATGCCGG AAGTTCAAGG CATCAAGACT TGAAGAAGAA GACCCAGTCC 2945 || |||||| |||||||||| || ||||||| ||||||||| |||||||||| || ||||||| AAACAGCTAG TCCATGCCGG AACTTCAAGG CATCAAGACA TGAAGAAGAA GATCCAGTCC 177 AAGCTAGAAG CATTAGCTCA CCCTGAATAT CCGGTATGAC GAAGACTGGC TAGAATCACT 2885 |||||||||| | |||||||| ||||||| || ||| | | | |||||||||| |||| | | AAGCTAGAAG CGTTAGCTCA CCCTGAA-AT CCGATGTAAT GAAGACTGGC TAGAGTTGCG 236 GCTGAGTTGA AGATGACGGA ACGTTT 2859 | |||||||| ||| ||||| |||||| GTTGAGTTGA AGACGACGGT ACGTTT 262 hqPGS_C06HBa0153O03.1-2-_SGN-E373116+ (3121 2859) ******************************************************************************** EST sequence 137 +strand 265 n (File: SGN-E216150+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAATCAA TAATCAACTT GTATAACTCA AAACTTATCA 61 TTCCCCAAAA TCTGGAAGTC ATCATCACCA GAGCCTCTAT CATAAAATTA CTAAACTAAG 121 AGTATTCTAA GAAGCTAAAA ATACATACGA AGCTAGTCCA TGCCGGAAGT TCAAGGCATC 181 AAGACTTGAA GAAGAAGATC CAGTCAAACC TAGAAGCATT AGCTCACCCT GAATTTCCGA 241 TGTAGTAGGA CTGGCTTGAG TTACT Predicted gene structure (within gDNA segment 4730 to 1997): Exon 1 3112 2885 ( 228 n); cDNA 39 265 ( 227 n); score: 0.862 PPA cDNA 18 1 MATCH C06HBa0153O03.1-2- SGN-E216150+ 0.862 228 0.860 C PGS_C06HBa0153O03.1-2-_SGN-E216150+ (3112 2885) Alignment (genomic DNA sequence = upper lines): TTCTAAAATT CAACATCTAT TATTCCCCCA AAATCTGGAA GTCATCATCA CAAGAACATC 3053 || || || | ||| | ||| ||| ||||| |||||||||| |||||||||| | ||| | || TTGTATAACT CAAAACTTAT CATT-CCCCA AAATCTGGAA GTCATCATCA CCAGAGCCTC 97 TACGATCAAA TGACTAAACT AAGAGTATTC TAA-AAGCTA AAAATACATA AGAAGCTAGT 2994 || || ||| | |||||||| |||||||||| ||| |||||| |||||||||| ||||||||| TATCATAAAA TTACTAAACT AAGAGTATTC TAAGAAGCTA AAAATACATA CGAAGCTAGT 157 CCATGCCGGA AGTTCAAGGC ATCAAGACTT GAAGAAGAAG ACCCAGTCCA AGCTAGAAGC 2934 |||||||||| |||||||||| |||||||||| |||||||||| | |||||| | | |||||||| CCATGCCGGA AGTTCAAGGC ATCAAGACTT GAAGAAGAAG ATCCAGTCAA ACCTAGAAGC 217 ATTAGCTCAC CCTGAATATC CGGTATGACG AAGACTGGCT AGAATCACT 2885 |||||||||| ||||||| || || | | | | |||||||| || | ||| ATTAGCTCAC CCTGAATTTC CGATGT-AGT AGGACTGGCT TGAGTTACT 265 hqPGS_C06HBa0153O03.1-2-_SGN-E216150+ (3112 2885) ******************************************************************************** EST sequence 22 -strand 219 n (File: SGN-E298638-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 61 ACAAGAACAT GTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA Predicted gene structure (within gDNA segment 4830 to 2034): Exon 1 3121 2903 ( 219 n); cDNA 1 219 ( 219 n); score: 0.836 MATCH C06HBa0153O03.1-2- SGN-E298638- 0.836 219 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E298638- (3121 2903) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCT-AAAATT CAACATCTAT TATT-CCCCC AAAATCTGGA AGTCATCATC 3064 || || | ||| |||| | ||||| |||| |||| |||| ||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 60 ACAAGAACAT CTACGATCAA ATGACTAAA- CTAAGAGTAT TCTAA-AAGC TAAAAATACA 3006 |||||||||| ||| |||| || |||||| ||||||||| ||||| |||| | |||||||| ACAAGAACAT GTAC-TTCAA ATTACTAAAT CTAAGAGTA- TCTAAGAAGC T-AAAATACA 117 TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA AGACCCAGTC 2946 ||| ||||| |||||||||| ||| |||||| |||||||||| ||||||||| ||| |||||| TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 177 CAAGCTAGAA GCATTAGCTC ACCCTGAATA TCCGGTATGA CGA 2903 |||||||||| || ||||||| |||||||| | |||| | | | || CAAGCTAGAA GCGTTAGCTC ACCCTGAA-A TCCGATGTAA TGA 219 hqPGS_C06HBa0153O03.1-2-_SGN-E298638- (3121 2903) ******************************************************************************** EST sequence 14 -strand 402 n (File: SGN-E352844-) 1 TTTTTTTTAT AAAAACCAAT TCAATAACTA TTATTTCCCA AAATCTGGAA GTTATCATCA 61 CAAGAACATC TACTTCGAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCTT TGTTTTATCG AAAAAAGGTG ATTTTTCGAA AAGAGTTTGT TTTATTTTAA 241 AGTATTTTTC GACTTTAGGA GTCGCCACTT AATTTTTAAG AAAAATCAAG AAAACTCATT 301 CTCAAAACAA TTTAAACAGA AAAGTCGTTT TGAAAATATT TTTTAGGATT CGGGATTCTT 361 ATTAGCGTCT TAGGAAGGTG TTTAAGGCAC CTAAGACACT CC Predicted gene structure (within gDNA segment 4056 to 194): Exon 1 3121 2934 ( 188 n); cDNA 3 188 ( 186 n); score: 0.835 MATCH C06HBa0153O03.1-2- SGN-E352844- 0.835 188 0.468 C PGS_C06HBa0153O03.1-2-_SGN-E352844- (3121 2934) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCTAAAATTC AACATCTATT ATTCCCCCAA AATCTGGAAG TCATCATCAC 3062 || ||| ||||| || | ||||| ||| ||||| |||||||||| | |||||||| TTTTTTATAA AAACCAATTC AATAACTATT ATT-TCCCAA AATCTGGAAG TTATCATCAC 61 AAGAACATCT ACGATCAAAT GACTAAA-CT AAGAGTATTC TAA-AAGCTA AAAATACATA 3004 |||||||||| || || ||| |||||| || ||||||| || ||| ||||| |||||||||| AAGAACATCT AC-TTCGAAT TACTAAATCT AAGAGTA-TC TAAGAAGCT- AAAATACATA 118 AGAAGCTAGT CCATGCCGGA AGTTCAAGGC ATCAAGACTT GAAGAAGAAG ACCCAGTCCA 2944 | ||||||| |||||||||| | |||||||| |||||||| | |||||||||| | |||||||| AACAGCTAGT CCATGCCGGA ACTTCAAGGC ATCAAGACAT GAAGAAGAAG ATCCAGTCCA 178 AGCTAGAAGC 2934 |||||||||| AGCTAGAAGC 188 hqPGS_C06HBa0153O03.1-2-_SGN-E352844- (3121 2934) ******************************************************************************** EST sequence 31 -strand 666 n (File: SGN-E368629-) 1 TTTTTTTTTT TTTTTTTTTT TTTTTTATAA AAACCAATTC AATAACTATT ATTTCCCAAA 61 ATCTGGAAGT TATCATCACA AGAACATCTA CTTCGAATTA CTAAATCTAG AAGTATCTAA 121 GAGCCTAAAA TACATAACAC AGTTAGTCCA TGCCGAAACT TCAAGGCATC AAGACATAAA 181 GAAGAAGATC CAGTCCAAGC TAGAAGCTTT GTTTTATCGA AAAAAGGTGA TTTTTCGAAA 241 AGAGTTTGTT TTATTTTAAA GTATTTTTCG ACTTTAGGAG TCGCCACTTA ATTTTTAAGA 301 AAAATCAAGA AAACTCATTC TCAAAACAAT TTAAACAGAA AAGTCGTTTT GAAAATATTT 361 TTTAGGATTC GGGATTCTTA TTAGCGTCTT AGGAAGGTGT TTAAGGCACC TAAGACACTC 421 CGTTAAATAC GGTTTTCCAA CGACTAACTT ATTTGATTAT TTTTATTTTT ACCCTTTGCA 481 AATTTATTTG AACTTTTATC ACGATTTACT TAGCCAAACT TTGCAAATTT GAGATATTAA 541 TCTTTTAAGA TTCCGTCTTA GTTAAACTTT CTAAGCCTTA ACTCTCTAAG CAGACTTTCA 601 AATTTTAAAC CTCTATCGTT TCAAAACTTC AATTTTTATT TTTTAGTTTC ATAAAGCAAA 661 AGGCGT Predicted gene structure (within gDNA segment 4236 to 1): Exon 1 3106 2934 ( 173 n); cDNA 36 207 ( 172 n); score: 0.844 PPA cDNA 28 1 MATCH C06HBa0153O03.1-2- SGN-E368629- 0.844 173 0.260 C PGS_C06HBa0153O03.1-2-_SGN-E368629- (3106 2934) Alignment (genomic DNA sequence = upper lines): AATTCAACAT CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT 3047 ||||||| | |||||||| |||||||||| |||||| ||| |||||||||| ||||||| | AATTCAATAA CTATTATT-T CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTAC-TT 93 CAAATGACTA AA-CTAAGAG TATTCTAAAA GCTAAAAATA CATAAGA-AG CTAGTCCATG 2989 | ||| |||| || ||| || || ||||| | || |||||| ||||| | || ||||||||| CGAATTACTA AATCTAGAAG TA-TCTAAGA GCCTAAAATA CATAACACAG TTAGTCCATG 152 CCGGAAGTTC AAGGCATCAA GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGC 2934 ||| || ||| |||||||||| ||| | |||| |||||| ||| |||||||||| ||||| CCGAAACTTC AAGGCATCAA GACATAAAGA AGAAGATCCA GTCCAAGCTA GAAGC 207 hqPGS_C06HBa0153O03.1-2-_SGN-E368629- (3106 2934) ******************************************************************************** EST sequence 26 -strand 620 n (File: SGN-E238551-) 1 CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTACTTCG AATTACTAAA 61 TCTAAGAGTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG AACTTCAAGG 121 CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTGTTTTA TCGAAAAAAG 181 GTGATTTTTC GAAAAGAGTT TGTTTTATTT TAAAGTATTT TTCGACTTTA GGAGTCGCCA 241 CTTAATTTTT AAGAAAAATC AAGAAAACTC ATTCTCAAAA CAATTTAAAC AGAAAAGTCG 301 TTTTGAAAAT ATTTTTTAGG ATTCGGGATT CTTATTAGCG TCTTAGGAAG GTGTTTAAGG 361 CACCTAAGAC ACTCCGTTAA ATACGGTTTT CCAACGACTA ACTTATTTGA TTATTTTTAT 421 TTTTACCCTT TGCAAATTTA TTTGAACTTT TATCACGATT TACTTAGCCA AACTTTGCAA 481 ATTTGAGATA TTAATCTTTT AAGATTCCGT CTTAGTTAAA CTTTCTAAGC CTTAACTCTC 541 TAAGCAGACT TTCAAATTTT AAACCTCTAT CGTTTCAAAA CTTCAATTTT TATTTTTTAG 601 TTTCATAAAG CAAAAGGCGT Predicted gene structure (within gDNA segment 3786 to 1): Exon 1 3096 2934 ( 163 n); cDNA 1 161 ( 161 n); score: 0.880 MATCH C06HBa0153O03.1-2- SGN-E238551- 0.880 163 0.263 C PGS_C06HBa0153O03.1-2-_SGN-E238551- (3096 2934) Alignment (genomic DNA sequence = upper lines): CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA 3037 |||||||| |||||||||| |||||| ||| |||||||||| ||||||| | | ||| |||| CTATTATT-T CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTAC-TT CGAATTACTA 58 AA-CTAAGAG TATTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA 2978 || ||||||| ||| | || ||| |||||| ||||| ||| |||||||||| ||||| |||| AATCTAAGAG TATCTAAGAA GCT-AAAATA CATAAACAGC TAGTCCATGC CGGAACTTCA 117 AGGCATCAAG ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGC 2934 |||||||||| || ||||||| ||||| |||| |||||||||| |||| AGGCATCAAG ACATGAAGAA GAAGATCCAG TCCAAGCTAG AAGC 161 hqPGS_C06HBa0153O03.1-2-_SGN-E238551- (3096 2934) ******************************************************************************** EST sequence 24 -strand 286 n (File: SGN-E355114-) 1 CCACAGCCCC AGTGGCTGGC TCAGTTGTTT CTTGTCTGGC CGGTGTTGGT GTTGACGTGG 61 TCGTTGCTCT AGTTCTAACC ATCTGCAAAA GAGAGTGAAG ATGGTCAGAT ACCAATTTGT 121 ATCGCCTAGA TACCAATTGG ACTCAAGTAG TAGCACGAAA GAAAGAATGA AAGGGTGAAA 181 TTTTCCTAAA GTCTTATAGC CTCTCAAAGA AAAGTAAAGG CGTCCCCCTA CCGTTCCTAA 241 AGACTCTACT AGACCTGTTC TTGTGTGATG AGACCAACGA ACCTAA Predicted gene structure (within gDNA segment 4321 to 2682): Exon 1 3567 3282 ( 286 n); cDNA 1 286 ( 286 n); score: 0.955 MATCH C06HBa0153O03.1-2- SGN-E355114- 0.955 286 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E355114- (3567 3282) Alignment (genomic DNA sequence = upper lines): CCACAGCCCC AGTGGTTGGC TCAGTTGTTT CTTGTCTGGC CGGTATTGGT GTTGGCGTAG 3508 |||||||||| ||||| |||| |||||||||| |||||||||| |||| ||||| |||| ||| | CCACAGCCCC AGTGGCTGGC TCAGTTGTTT CTTGTCTGGC CGGTGTTGGT GTTGACGTGG 60 TCGTTGCTCT AGTTCTAACC ATCTGTGAAA GAGAGTGAAG ATGGTCAGAT ACTAATTCGT 3448 |||||||||| |||||||||| ||||| ||| |||||||||| |||||||||| || |||| || TCGTTGCTCT AGTTCTAACC ATCTGCAAAA GAGAGTGAAG ATGGTCAGAT ACCAATTTGT 120 ATCGCCTAGA TACCAATTGG ACTCAAGTAG TAGCACGAAA GAAAGAATGA GAGAGTGAAA 3388 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || |||||| ATCGCCTAGA TACCAATTGG ACTCAAGTAG TAGCACGAAA GAAAGAATGA AAGGGTGAAA 180 TTTTCCTAAA GTCTTATAGC CTCTCAAGAA AAAGTAAAGG CGTCCCCCTA CCGTTCCTTA 3328 |||||||||| |||||||||| ||||||| | |||||||||| |||||||||| |||||||| | TTTTCCTAAA GTCTTATAGC CTCTCAAAGA AAAGTAAAGG CGTCCCCCTA CCGTTCCTAA 240 AGACTCTACT AGACCTGTTC TTGTGTGATG AGACCAACGA ACCTAA 3282 |||||||||| |||||||||| |||||||||| |||||||||| |||||| AGACTCTACT AGACCTGTTC TTGTGTGATG AGACCAACGA ACCTAA 286 hqPGS_C06HBa0153O03.1-2-_SGN-E355114- (3567 3282) ******************************************************************************** EST sequence 116 +strand 694 n (File: SGN-E353359+) 1 TGTAGTCTAT GCACATTCAA AAACTGCCGA TCTTTACAAC AAAACCGGAG CACCCCAAGG 61 AGATGCACTT GGTCTAATGA AGCCTTTGTT CAATAACTCT TGAAGTTGTG CCTTTAACTC 121 TCTTAACTCT GCGGGAGCCA TTCTATAAGG GGGTATAAAA ATGGGGCGTG TGCCCGGTTC 181 GAGATCAATA CAGAAGTCAA TATCCCTATC CGGTGGCATA CCAGGATGAT CTGCAGGGAA 241 CACATCCATA AACTCACGAA CTACTGAAAC TGAGTCAATC GAAGGTACTT GGGTAGTGTT 301 ATCCTTGAGA TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG 361 AAAGGAGATG ATACGCACCG GATTGGAAGT GTAGTCACCC TCCTACACTA ACAGATCTGT 421 CCCAGGCTTG GCTAACGTCA CGGTTTTAGC ATTACAATCC AAGATTGCAA AATTTGGAGA 481 AAGCCAAGTC ATACCCAGAA TTACATCGAA GTCAACCATT TCTTGCAAGG ATTTTGCCGT 541 AGCCGCTACC TGTAACGCTG AAATCCGCAA CTCTGACCTC AACCCTTTCA CAAAACGACG 601 AATCCTCTCT TGTGGACTGA AACAAAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 661 AGCCTCATAT GCATTGACCT ACATCCTACC TTGC Predicted gene structure (within gDNA segment 6637 to 3522): Exon 1 5675 5185 ( 491 n); cDNA 31 520 ( 490 n); score: 0.943 Intron 1 5184 4441 ( 744 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.88) Exon 2 4440 4267 ( 174 n); cDNA 521 694 ( 174 n); score: 0.897 MATCH C06HBa0153O03.1-2- SGN-E353359+ 0.931 665 0.958 C PGS_C06HBa0153O03.1-2-_SGN-E353359+ (5675 5185,4440 4267) Alignment (genomic DNA sequence = upper lines): TCTTTACAAA CAAAACCGGA GCACCCCAAG GAGATGCACT TGGTCTAATA AAGCCTTTGT 5616 ||||||| || |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| TCTTTAC-AA CAAAACCGGA GCACCCCAAG GAGATGCACT TGGTCTAATG AAGCCTTTGT 89 TCAATAACTC TTGAAGTTGT GCCTTTAACT CTCTTAACTC TGCGGGAGCC ATTCTATAAG 5556 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATAACTC TTGAAGTTGT GCCTTTAACT CTCTTAACTC TGCGGGAGCC ATTCTATAAG 149 GGGGTATAGA AATGGGGCGT GTGCCCGGTT CTAGATCGAT ACAGAAGTCA ATATCCCTAT 5496 |||||||| | |||||||||| |||||||||| | ||||| || |||||||||| |||||||||| GGGGTATAAA AATGGGGCGT GTGCCCGGTT CGAGATCAAT ACAGAAGTCA ATATCCCTAT 209 CTGGTGGCAT ACCAGGAAGA TCTGCAGGGA ACACATCCAG AAACTCACGG ACTACTGAAA 5436 | |||||||| ||||||| || |||||||||| ||||||||| ||||||||| |||||||||| CCGGTGGCAT ACCAGGATGA TCTGCAGGGA ACACATCCAT AAACTCACGA ACTACTGAAA 269 CCGACTCAAT CGAAGGCACT TGGGTAGTGT CATCCTTGAG ATGTGCCAAG AAAGCTAAAC 5376 | || ||||| |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| CTGAGTCAAT CGAAGGTACT TGGGTAGTGT TATCCTTGAG ATGTGCCAAG AAAGCTAAAC 329 AACCTTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT GATATGCACC GGATTGGAAG 5316 | |||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ACCCTTTACT AACCATTTTC TTAGCACGAA GAAAGGAGAT GATACGCACC GGATTGGAAG 389 CGTTGTCACC CTCCCACACT AACGGATCTG TCCCAGGCTT GGCTAACGTC ACCGTTTTAG 5256 || |||||| |||| ||||| ||| |||||| |||||||||| |||||||||| || ||||||| TGTAGTCACC CTCCTACACT AACAGATCTG TCCCAGGCTT GGCTAACGTC ACGGTTTTAG 449 CATTACAATC CAAGATCGCA AATTGCGGAG AAAGCCAAGT CATACCTAGA ATTACATCAA 5196 |||||||||| |||||| ||| || | |||| |||||||||| |||||| ||| |||||||| | CATTACAATC CAAGATTGCA AAATTTGGAG AAAGCCAAGT CATACCCAGA ATTACATCGA 509 AATCATCCAT TTCTAAGATA ACCAAATCTA CATAAGTGTT GCTCCCTACA AAGTTCACCA 5136 | ||| |||| | AGTCAACCAT T......... .......... .......... .......... .......... 520 AAAAAGACCT ATACACCTTT TCAACTACCA CAGATTCACC CACCGGAGTA GAAACACGAA 5076 .......... .......... .......... .......... .......... .......... 520 TAGGCATATC AAGTAATTCA CAATGTAAAT TTAGACCATT AGCAAATGAG GAAGATACAT 5016 .......... .......... .......... .......... .......... .......... 520 AAGAAAATGT GGATCCAGGA TCAAACAATA CAGAGGCCAT GCAATCACAA ACCAGAAGAT 4956 .......... .......... .......... .......... .......... .......... 520 TACCTGTGAT GACAGCATCA GATGCCTCCG CTTCAGACCG CCCAGGGAAA GCGTAACAAT 4896 .......... .......... .......... .......... .......... .......... 520 GGGCCCTATC GTTCGTCTGT CCGTTGCCCC TAACTTGTTG TGATGTAGTG GCTCCAGTTT 4836 .......... .......... .......... .......... .......... .......... 520 GCCCATCACC TTGGCCGTTT TGGTTACCAC CATTTCCTTG ACCACCACGT CCTCCAGAAT 4776 .......... .......... .......... .......... .......... .......... 520 AACGGCCTCT GCCATGACCA CCTCTACCTC TAACATTTGG AGGTCTGTAA CTCTGTTTTG 4716 .......... .......... .......... .......... .......... .......... 520 GACAATATCT CTTAATATGT TCGATCTCCC CACATCCATA ACACTTTCTG GGTTCATGCA 4656 .......... .......... .......... .......... .......... .......... 520 TAGGTCTCTC AGAGAAGTGT TGACCGGTCG GAGGTGGACC CCCAACTACA GTCTGTAGTG 4596 .......... .......... .......... .......... .......... .......... 520 AAGACTGAAT TGGTCGGACT GAGTAACTTC CCGAACCCTG TCCTCTAGTG TAAGCACCAT 4536 .......... .......... .......... .......... .......... .......... 520 TAAACTCACC TCCCTTTCGA AACCTTTTTG ATGTCAATGT CGGGGTGAAG TCGTCTGGCT 4476 .......... .......... .......... .......... .......... .......... 520 TCACTCCTTC CACTTCTATC ACAAAGTCTA CCACCTCTTG GAAGGATTTT GCCGTTGCCA 4416 ||||| ||||||||| ||||| ||| .......... .......... .......... .....TCTTG CAAGGATTTT GCCGTAGCCG 545 CTATCTGTAA GGCCGAAATC CGCAATTCTG ACCTCAACCC CTTCACAAAC CGGCGAATTC 4356 ||| |||||| || |||||| ||||| |||| |||||||||| |||||||| || ||||| | CTACCTGTAA CGCTGAAATC CGCAACTCTG ACCTCAACCC TTTCACAAAA CGACGAATCC 605 GCTCTTGTGG ACTGAAACAC AGTTGGGTGG CATACCGGGA TAATGCACGA AACTTAGCCT 4296 ||||||||| ||||||||| ||||| |||| |||| | ||| || ||||||| |||||||||| TCTCTTGTGG ACTGAAACAA AGTTGAGTGG CATATCTGGA TAGTGCACGA AACTTAGCCT 665 CATATGCATT GACCGACATC CTACCTTGC 4267 |||||||||| |||| ||||| ||||||||| CATATGCATT GACCTACATC CTACCTTGC 694 hqPGS_C06HBa0153O03.1-2-_SGN-E353359+ (5675 5185,4440 4267) ******************************************************************************** EST sequence 104 +strand 433 n (File: SGN-E352180+) 1 CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 61 GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 121 CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 181 ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 241 ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 301 TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 361 CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 421 GCCGCTGTTC TCC Predicted gene structure (within gDNA segment 4713 to 2136): Exon 1 4005 3576 ( 430 n); cDNA 1 431 ( 431 n); score: 0.913 MATCH C06HBa0153O03.1-2- SGN-E352180+ 0.913 430 0.993 C PGS_C06HBa0153O03.1-2-_SGN-E352180+ (4005 3576) Alignment (genomic DNA sequence = upper lines): CCCTTGAATA CTGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 3946 |||||||| | | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCTTGAAGA CCGGAGGTTT CAATTTCAAG AACTTACTGA AAAGTTCATG CTGATCATTT 60 GTCATTATAG GCCCAGTAGT CAGACGTGGA AACGTGCCTA TTTCCAATGG AACATCCATG 3886 |||||||||| |||||||||| || ||||||| || || |||| | | |||||| |||||||||| GTCATTATAG GCCCAGTAGT CAAACGTGGA AATGTACCTA TGTGCAATGG AACATCCATG 120 CGGGGAGCCA TAGTAGCCGC ATGTTGTACC TCCGGAGCCT GAGGTGCTGG TGTAGAAAAC 3826 |||||||||| |||||||||| ||||||||| || | | || |||||| ||| | ||||||| CGGGGAGCCA TAGTAGCCGC ATGTTGTACT TCTGAAACCG GAGGTGTTGG CGCAGAAAAC 180 ACTGG-GGTG TCTGGCCCTG ATCATATAAC CCGCTAAGAT AAGCCAGAAC CTGATTGATC 3767 ||||| |||| || || || |||| |||| |||||||||| |||||||||| |||||||||| ACTGGAGGTG CTTGACCTTG ATCAGATAAA CCGCTAAGAT AAGCCAGAAC CTGATTGATC 240 ATCTCTGGGG TAGGTTGGGG TGGCAATCCC TCATTCTGCA CTTGTTCAGT TTCCCCATCG 3707 |||||||||| |||||||||| ||||| | || ||||| |||| |||||||||| ||||||||| ATCTCTGGGG TAGGTTGGGG TGGCATTTCC TCATTTTGCA CTTGTTCAGT TTCCCCATCC 300 TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CTGCCCTAGT ATCAGATGGG 3647 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| | |||||||| TCCCCTTCTC TTATTACTTC CTCAGTCGGT GGAGGAGTCA CCGCCCTAGT ACCAGATGGG 360 CTAGGCGCTC GTCCTCTTCC CCTAGAGGAC GTCCTCCCAC TACCTCTACC ATGGCCCCTT 3587 || || ||| || ||||||| ||||| ||| |||||||||| ||||||||| | |||||||| CTCGGTTCTC GTTCTCTTCC TCTAGATGAC GTCCTCCCAC GACCTCTACC ACGGCCCCTT 420 GCCGCTGTTC T 3576 |||||||||| | GCCGCTGTTC T 431 hqPGS_C06HBa0153O03.1-2-_SGN-E352180+ (4005 3576) ******************************************************************************** EST sequence 40 -strand 554 n (File: SGN-E329287-) 1 AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 61 ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 121 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 181 ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 241 GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 301 TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 361 CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 421 GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 481 CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 541 CAGATGGGCT AGGT Predicted gene structure (within gDNA segment 4992 to 2339): Exon 1 4193 3642 ( 552 n); cDNA 1 553 ( 553 n); score: 0.909 MATCH C06HBa0153O03.1-2- SGN-E329287- 0.909 552 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E329287- (4193 3642) Alignment (genomic DNA sequence = upper lines): AGAATGAGGC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAATA TGTGACCGCC 4134 ||||||| || |||||||||| ||||||||| |||||||||| |||||||| | |||||||||| AGAATGATGC CCAAGTCATA CGTGGTGCCT CTGTTGGTTG ACACTCAACA TGTGACCGCC 60 ACCACATTTT GGCATTACCT TGAAACTGAT AACTTACGAA CTCAACACCA AACCGTTCTA 4074 |||||||||| ||| || ||| |||||||||| || | || || |||||||||| |||||||||| ACCACATTTT GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCAACACCA AACCGTTCTA 120 CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAA GCATCCTCAG 4014 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| CTATACCCAT CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAG GCATCCTCAG 180 ATTCCGCACC CTTGAATACT GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT 3954 |||| |||| |||| | ||| ||||||||| |||||||||| |||||| ||| |||||||||| ATTCAACACC CTTGTAGACT TGAGGTTTCA ATTTCAAGAA CTTACTAAAA AGTTCATGCT 240 GATCATTTGT CATTATAGGC CCAGTAGTCA GACGTGGAAA CGTGCCTATT TCCAATGGAA 3894 |||||||||| |||||||||| || ||||||| |||| ||||| ||| |||||| ||||||| GATCATTTGT CATTATAGGC CCTGTAGTCA GACGGGGAAA CGTTCCTATT TCCAATGAGG 300 CATCCATGCG GGGAGCCATA GTAGCCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTGGTG 3834 ||| ||||| ||||| || | |||| ||||| |||||||||| |||||||||| |||||| ||| TATCGATGCG GGGAGTCACA GTAGGCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTAGTG 360 TAGAAAACAC TGG-GGTGTC TGGCCCTGAT CATATAACCC GCTAAGATAA GCCAGAACCT 3775 ||||||||| ||| |||| ||||| |||| || ||||||| |||||| ||| || ||||||| CAGAAAACAC TGGAGGTGCT TGGCCTTGAT CAGATAACCC GCTAAGGTAA GCAAGAACCT 420 GATTGATCAT CTCTGGGGTA GGTTGGGGTG GCAATCCCTC ATTCTGCACT TGTTCAGTTT 3715 |||||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| |||||| | | GATTGATCAT CTCTTGGGTA GGTTGGGGTG GCAATTCCTC ATTCTGCACT TGTTCATTCT 480 CCCCATCGTC CCCTTCTCTT ATTACTTCCT CAGTCGGTGG AGGAGTCACT GCCCTAGTAT 3655 ||||||| || || |||||| | ||||||| |||| ||||| ||| ||||| ||| ||||| CCCCATCCTC ACCCTCTCTT ACCACTTCCT CAGTTGGTGG AGGTGTCACC GCCTTAGTAC 540 CAGATGGGCT AGG 3642 |||||||||| ||| CAGATGGGCT AGG 553 hqPGS_C06HBa0153O03.1-2-_SGN-E329287- (4193 3642) ******************************************************************************** EST sequence 54 +strand 716 n (File: SGN-E577713+) 1 CGGTGGATAC CTAGGCACCC AGAGACGAGG AAGGGCGTAG TAATCGACGA AATGCTTCGG 61 GGAGTTGAAA ATAAGCATAG ATCCGGAGAT TCCCGAATAG GGCAACCTTT CGAACTGCTG 121 CTGAATCCAT GGGCAGGCAA GAGACAACCT GGCGAACTGA AACATCTTAG TAGCCAGAGG 181 AAAAGAAAGC AAATAAGGAA GATACATAAG AAAACGTGGA TCCAGGATCA AACAATACAG 241 AAGCCATGCA ATCACAAACC AGAAGATTAC CTGTGATGAC AACATCAGAT GCCTCCGCTT 301 CACACCGCCC AGGGAAAGCG TAACAATGGG CCCTATCGTT CGTCTGTCCG TTGCCCCTAC 361 CATGTTGTGA TGTAGTGGCC CAAGTTTGCC CATTACCTCT GCCGTTTTGG TGACCACCAT 421 TACCTCGACC ACCACGTCCT CCAGAATAAC GGCCTCTACC ATGACCACCT CTACCTCTAG 481 CTATTGGGGG TCTATAACTT TGTCTGGGAC AATTTTTCCT AATATGTCCA ATCTCCCCAC 541 ATCCATACCA TTCTCTGGAG TCAATCCTAG GCCCCTCGGA GAAGTGTTGA CCGGTCTGAG 601 GTGGTCCCCC AACTACAGTC TGTAGTGAAG ACTCAATTGG TCGGACTGAG TAACTTCCCG 661 AACCCTGTCC TCTAGTGTAA GAACCATTAA ACTCACCTCC CTTTCGAAGC CTTTTT Predicted gene structure (within gDNA segment 7577 to 3825): Exon 1 5095 5073 ( 23 n); cDNA 163 185 ( 23 n); score: 0.652 Intron 1 5072 5038 ( 35 n); Pd: 0.729 (s: 0), Pa: 0.000 (s: 0.92) Exon 2 5037 4507 ( 531 n); cDNA 186 716 ( 531 n); score: 0.911 MATCH C06HBa0153O03.1-2- SGN-E577713+ 0.911 554 0.774 C PGS_C06HBa0153O03.1-2-_SGN-E577713+ (5095 5073,5037 4507) Alignment (genomic DNA sequence = upper lines): CACCGGAGTA GAAACACGAA TAGGCATATC AAGTAATTCA CAATGTAAAT TTAGACCATT 5036 || | |||| | | | ||| || CATCTTAGTA GCCAGAGGAA AAG....... .......... .......... ........AA 187 AGCAAATGAG GAAGATACAT AAGAAAATGT GGATCCAGGA TCAAACAATA CAGAGGCCAT 4976 ||||||| || |||||||||| ||||||| || |||||||||| |||||||||| |||| ||||| AGCAAATAAG GAAGATACAT AAGAAAACGT GGATCCAGGA TCAAACAATA CAGAAGCCAT 247 GCAATCACAA ACCAGAAGAT TACCTGTGAT GACAGCATCA GATGCCTCCG CTTCAGACCG 4916 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||| |||| GCAATCACAA ACCAGAAGAT TACCTGTGAT GACAACATCA GATGCCTCCG CTTCACACCG 307 CCCAGGGAAA GCGTAACAAT GGGCCCTATC GTTCGTCTGT CCGTTGCCCC TAACTTGTTG 4856 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || | ||||| CCCAGGGAAA GCGTAACAAT GGGCCCTATC GTTCGTCTGT CCGTTGCCCC TACCATGTTG 367 TGATGTAGTG GCTCCAGTTT GCCCATCACC TTGGCCGTTT TGGTTACCAC CATTTCCTTG 4796 |||||||||| || | ||||| |||||| ||| | ||||||| |||| ||||| |||| ||| | TGATGTAGTG GCCCAAGTTT GCCCATTACC TCTGCCGTTT TGGTGACCAC CATTACCTCG 427 ACCACCACGT CCTCCAGAAT AACGGCCTCT GCCATGACCA CCTCTACCTC TAACATTTGG 4736 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| || | |||| ACCACCACGT CCTCCAGAAT AACGGCCTCT ACCATGACCA CCTCTACCTC TAGCTATTGG 487 AGGTCTGTAA CTCTGTTTTG GACAATATCT CTTAATATGT TCGATCTCCC CACATCCATA 4676 ||||| ||| || ||| | | |||||| | | | |||||||| | ||||||| |||||||||| GGGTCTATAA CTTTGTCTGG GACAATTTTT CCTAATATGT CCAATCTCCC CACATCCATA 547 ACACTTTCTG GGTTCATGCA TAGGTCTCTC AGAGAAGTGT TGACCGGTCG GAGGTGGACC 4616 || | |||| | ||| | |||| | ||| ||||||||| ||||||||| ||||||| || CCATTCTCTG GAGTCAATCC TAGGCCCCTC GGAGAAGTGT TGACCGGTCT GAGGTGGTCC 607 CCCAACTACA GTCTGTAGTG AAGACTGAAT TGGTCGGACT GAGTAACTTC CCGAACCCTG 4556 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CCCAACTACA GTCTGTAGTG AAGACTCAAT TGGTCGGACT GAGTAACTTC CCGAACCCTG 667 TCCTCTAGTG TAAGCACCAT TAAACTCACC TCCCTTTCGA AACCTTTTT 4507 |||||||||| |||| ||||| |||||||||| |||||||||| | ||||||| TCCTCTAGTG TAAGAACCAT TAAACTCACC TCCCTTTCGA AGCCTTTTT 716 hqPGS_C06HBa0153O03.1-2-_SGN-E577713+ (5095 5073,5037 4507) ******************************************************************************** EST sequence 128 +strand 720 n (File: SGN-E356614+) 1 GCGAATCCGC TCTTGAGGAC TGAAACAGAG TTGAGTGGCA TATCTGGATA GTGCACGAAA 61 CTTAGCCTCA TAAGCATTAA CTGACACCCT ACCTTGCTCT AGGCTCAAGA ACTCATCCCT 121 TTTCTTATCC CTCAAAGTCC GGGGGATATA CTTCTCCATA AACAAACTAT AGAATGAGGC 181 CCAAGTCATA GGTGATGCCT CTATTGGTTG ACACTCGATA TGTGACCGCC ACCACATTTG 241 GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCCACACCA AACCGTTCTA CTATACCCAT 301 CTTGTGTAGT AGCTCGTGAC AGTCAACCAG AAAATCATAA GCATCCTCCG ATTCAGCACC 361 CTTGAAGACC GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT GATCATTTGT 421 CATTATAGGG ACAGTAGTCA AACGTGGAAA TGTGCCTATG TCCAATGGAA CATCCATGCG 481 GGGAGCCATA GTAGCCGCAT GTTGTACTTC TGAAACCGGA GGTGTTGGCG CAGAAAACAC 541 TGGAGGTGCT TGACCTTGAT CAGATAACCC GCTAAGATAA GCCAGAACCT GATTGATCAT 601 CTCTGGGGTA GGTTGGGGTG GCATTTCCTC ATTTTGCACT TGNTCAGTTT CCCCATCCCT 661 CCCTTTCTCT ATTACTTCCT CAGTCAGTGG AAGAGTCACT GCCCTAGTAT CAGATGGGCT Predicted gene structure (within gDNA segment 5422 to 3035): Exon 1 4363 3645 ( 719 n); cDNA 1 720 ( 720 n); score: 0.912 MATCH C06HBa0153O03.1-2- SGN-E356614+ 0.912 719 0.999 C PGS_C06HBa0153O03.1-2-_SGN-E356614+ (4363 3645) Alignment (genomic DNA sequence = upper lines): GCGAATTCGC TCTTGTGGAC TGAAACACAG TTGGGTGGCA TACCGGGATA ATGCACGAAA 4304 |||||| ||| ||||| |||| ||||||| || ||| |||||| || | ||||| ||||||||| GCGAATCCGC TCTTGAGGAC TGAAACAGAG TTGAGTGGCA TATCTGGATA GTGCACGAAA 60 CTTAGCCTCA TATGCATTGA CCGACATCCT ACCTTGCTCT AGGCTCAAGA ACTCATCCCT 4244 |||||||||| || ||||| | | |||| ||| |||||||||| |||||||||| |||||||||| CTTAGCCTCA TAAGCATTAA CTGACACCCT ACCTTGCTCT AGGCTCAAGA ACTCATCCCT 120 TTTCCTATCC CTCAAAGTTC GGGGGATATA CTTCTCCATA AACAAGCTAG AGAATGAGGC 4184 |||| ||||| |||||||| | |||||||||| |||||||||| ||||| ||| |||||||||| TTTCTTATCC CTCAAAGTCC GGGGGATATA CTTCTCCATA AACAAACTAT AGAATGAGGC 180 CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAATA TGTGACCGCC ACCACATTTT 4124 |||||||||| |||| ||||| || ||||||| |||||| ||| |||||||||| ||||||||| CCAAGTCATA GGTGATGCCT CTATTGGTTG ACACTCGATA TGTGACCGCC ACCACATTTG 240 GGCATTACCT TGAAACTGAT AACTTACGAA CTCAACACCA AACCGTTCTA CTATACCCAT 4064 ||| || ||| |||||||||| || | || || ||| |||||| |||||||||| |||||||||| GGCGTTCCCT TGAAACTGAT AAGTCACAAA CTCCACACCA AACCGTTCTA CTATACCCAT 300 CTTGTGTAGT AGCTCATGAC AGTCAACCAG AAAATCGTAA GCATCCTCAG ATTCCGCACC 4004 |||||||||| ||||| |||| |||||||||| |||||| ||| |||||||| | |||| ||||| CTTGTGTAGT AGCTCGTGAC AGTCAACCAG AAAATCATAA GCATCCTCCG ATTCAGCACC 360 CTTGAATACT GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT GATCATTTGT 3944 |||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGAAGACC GGAGGTTTCA ATTTCAAGAA CTTACTGAAA AGTTCATGCT GATCATTTGT 420 CATTATAGGC CCAGTAGTCA GACGTGGAAA CGTGCCTATT TCCAATGGAA CATCCATGCG 3884 ||||||||| ||||||||| ||||||||| |||||||| |||||||||| |||||||||| CATTATAGGG ACAGTAGTCA AACGTGGAAA TGTGCCTATG TCCAATGGAA CATCCATGCG 480 GGGAGCCATA GTAGCCGCAT GTTGTACCTC CGGAGCCTGA GGTGCTGGTG TAGAAAACAC 3824 |||||||||| |||||||||| ||||||| || | | || || |||| ||| | ||||||||| GGGAGCCATA GTAGCCGCAT GTTGTACTTC TGAAACCGGA GGTGTTGGCG CAGAAAACAC 540 TGG-GGTGTC TGGCCCTGAT CATATAACCC GCTAAGATAA GCCAGAACCT GATTGATCAT 3765 ||| |||| || || |||| || ||||||| |||||||||| |||||||||| |||||||||| TGGAGGTGCT TGACCTTGAT CAGATAACCC GCTAAGATAA GCCAGAACCT GATTGATCAT 600 CTCTGGGGTA GGTTGGGGTG GCAATCCCTC ATTCTGCACT TGTTCAGTTT CCCCAT-CGT 3706 |||||||||| |||||||||| ||| | |||| ||| |||||| || ||||||| |||||| | | CTCTGGGGTA GGTTGGGGTG GCATTTCCTC ATTTTGCACT TGNTCAGTTT CCCCATCCCT 660 CCCCTTCTCT TATTACTTCC TCAGTCGGTG GAGGAGTCAC TGCCCTAGTA TCAGATGGGC 3646 ||| ||||| |||||||||| |||||| ||| || ||||||| |||||||||| |||||||||| CCCTTTCTC- TATTACTTCC TCAGTCAGTG GAAGAGTCAC TGCCCTAGTA TCAGATGGGC 719 T 3645 | T 720 hqPGS_C06HBa0153O03.1-2-_SGN-E356614+ (4363 3645) ******************************************************************************** EST sequence 69 +strand 664 n (File: SGN-E352401+) 1 AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 61 AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 121 CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 181 AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 241 GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 301 GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 361 GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 421 TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 481 AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 541 AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 601 TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT TTGCACTTGT TCAGTTTCCC CATCCTCCCC 661 TTCT Predicted gene structure (within gDNA segment 5392 to 2998): Exon 1 4360 3698 ( 663 n); cDNA 1 664 ( 664 n); score: 0.913 MATCH C06HBa0153O03.1-2- SGN-E352401+ 0.913 663 0.998 C PGS_C06HBa0153O03.1-2-_SGN-E352401+ (4360 3698) Alignment (genomic DNA sequence = upper lines): AATTCGCTCT TGTGGACTGA AACACAGTTG GGTGGCATAC CGGGATAATG CACGAAACTT 4301 ||| |||||| || ||||||| |||| ||||| |||||||| | ||||| || |||||||||| AATCCGCTCT TGAGGACTGA AACAGAGTTG AGTGGCATAT CTGGATAGTG CACGAAACTT 60 AGCCTCATAT GCATTGACCG ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 4241 ||||||||| ||||| || | ||| |||||| |||||||||| |||||||||| |||||||||| AGCCTCATAA GCATTAACTG ACACCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 120 CCTATCCCTC AAAGTTCGGG GGATATACTT CTCCATAAAC AAGCTAGAGA ATGAGGCCCA 4181 | |||||||| ||||| |||| |||||||||| |||||||||| || ||| ||| |||||||||| CTTATCCCTC AAAGTCCGGG GGATATACTT CTCCATAAAC AAACTATAGA ATGAGGCCCA 180 AGTCATAGGT GGTGCCTCTG TTGGTTGACA CTCAATATGT GACCGCCACC ACATTTTGGC 4121 |||||||||| | ||||||| |||||||||| ||| |||||| |||||||||| |||||| ||| AGTCATAGGT GATGCCTCTA TTGGTTGACA CTCGATATGT GACCGCCACC ACATTTGGGC 240 ATTACCTTGA AACTGATAAC TTACGAACTC AACACCAAAC CGTTCTACTA TACCCATCTT 4061 || |||||| ||||||||| | || ||||| ||||||||| |||||||||| |||||||||| GTTCCCTTGA AACTGATAAG TCACAAACTC CACACCAAAC CGTTCTACTA TACCCATCTT 300 GTGTAGTAGC TCATGACAGT CAACCAGAAA ATCGTAAGCA TCCTCAGATT CCGCACCCTT 4001 |||||||||| || ||||||| |||||||||| ||| |||||| ||||| |||| | |||||||| GTGTAGTAGC TCGTGACAGT CAACCAGAAA ATCATAAGCA TCCTCCGATT CAGCACCCTT 360 GAATACTGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 3941 ||| || ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGACCGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 420 TATAGGCCCA GTAGTCAGAC GTGGAAACGT GCCTATTTCC AATGGAACAT CCATGCGGGG 3881 |||||| || ||||||| || ||||||| || |||||| ||| |||||||||| |||||||||| TATAGGGACA GTAGTCAAAC GTGGAAATGT GCCTATGTCC AATGGAACAT CCATGCGGGG 480 AGCCATAGTA GCCGCATGTT GTACCTCCGG AGCCTGAGGT GCTGGTGTAG AAAACACTGG 3821 |||||||||| |||||||||| |||| || | | || ||||| | ||| | || |||||||||| AGCCATAGTA GCCGCATGTT GTACTTCTGA AACCGGAGGT GTTGGCGCAG AAAACACTGG 540 -GGTGTCTGG CCCTGATCAT ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 3762 |||| || || |||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTGCTTGA CCTTGATCAG ATAACCCGCT AAGATAAGCC AGAACCTGAT TGATCATCTC 600 TGGGGTAGGT TGGGGTGGCA -ATCCCTCAT TCTGCACTTG TTCAGTTTCC CCATCGTCCC 3703 |||||||||| ||||||| || |||||||| | |||||||| |||||||||| ||||| |||| TGGGGTAGGT TGGGGTGCCA TTTCCCTCAT T-TGCACTTG TTCAGTTTCC CCATCCTCCC 659 CTTCT 3698 ||||| CTTCT 664 hqPGS_C06HBa0153O03.1-2-_SGN-E352401+ (4360 3698) ******************************************************************************** EST sequence 124 +strand 713 n (File: SGN-E349404+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAGG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATGC GGGGAGCCAT AGTAGCCGCA TGTCGTATCT CTG Predicted gene structure (within gDNA segment 6631 to 2684): Exon 1 4558 3854 ( 705 n); cDNA 7 711 ( 705 n); score: 0.882 MATCH C06HBa0153O03.1-2- SGN-E349404+ 0.882 705 0.989 C PGS_C06HBa0153O03.1-2-_SGN-E349404+ (4558 3854) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GTGTAAGCAC CATTAAACTC ACCTCCCTTT CGAAACCTTT TTGATGTCAA 4499 |||||||||| | |||| || ||||| |||| |||||| || | |||||| | | |||||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGTCGGGGTG AAGTCGTCTG GCTTCACTCC TTCCACTTCT ATCACAAAGT CTACCACCTC 4439 ||||| |||| ||||| || | | |||||||| |||||| || |||||||||| |||||||||| TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGGAAGGAT TTTGCCGTTG CCACTATCTG TAAGGCCGAA ATCCGCAATT CTGACCTCAA 4379 ||| |||||| |||| |||| || ||||||| |||||| ||| |||||||||| |||| ||||| TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCACA AACCGGCGAA TTCGCTCTTG TGGACTGAAA CACAGTTGGG TGGCATACCG 4319 |||||||||| || ||||| | | || ||||| |||||| ||| || ||||||| |||| || | CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC ATTGACCGAC ATCCTACCTT GCTCTAGGCT 4259 ||||| ||| ||||| |||| ||||||| || || || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCCCTTTTCC TATCCCTCAA AGTTCGGGGG ATATACTTCT CCATAAACAA 4199 |||||||||| |||||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GAGGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAATATGTGA 4139 ||||||||| || |||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCAT TACCTTGAAA CTGATAACTT ACGAACTCAA CACCAAACCG 4079 ||||||||| || ||||| | | ||||| || |||||| || |||||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAAGCATC 4019 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | |||||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 546 CTCAGATTCC GCACCCTTGA ATACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 3959 |||||||| |||||||||| | |||||||| ||||| |||| |||||||||| |||||||||| TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAGTTC 606 ATGCTGATCA TTTGTCATTA TAGGCCCAGT AGTCAGACGT GGAAACGTGC CTATTTCCAA 3899 ||| ||||| |||||||| | |||||||||| |||||||||| ||||| |||| ||||| |||| ATGTTGATCG TTTGTCATAA TAGGCCCAGT AGTCAGACGT GGAAATGTGC CTATTCCCAA 666 TGGAACATCC ATGCGGGGAG CCATAGTAGC CGCATGTTGT ACCTC 3854 ||| |||| |||||||||| |||||||||| ||||||| || | ||| TGGTGCATCT ATGCGGGGAG CCATAGTAGC CGCATGTCGT ATCTC 711 hqPGS_C06HBa0153O03.1-2-_SGN-E349404+ (4558 3854) ******************************************************************************** EST sequence 101 +strand 679 n (File: SGN-E351625+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAATCAACCA GAAAATCATA 541 AGCATCTTCA GATTCTGCAC CCTTGAAGAC TGGAGGTTTC AGTTTCAAGA ACTTACTGAA 601 AAGTTCATGT TGATCGTTTG TCATAATAAG CCCAGTAGTC AGACGTGGAA ATGTGCCTAT 661 TCCCAATGGT GCATCTATG Predicted gene structure (within gDNA segment 6631 to 936): Exon 1 4558 3886 ( 673 n); cDNA 7 679 ( 673 n); score: 0.878 MATCH C06HBa0153O03.1-2- SGN-E351625+ 0.878 673 0.991 C PGS_C06HBa0153O03.1-2-_SGN-E351625+ (4558 3886) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GTGTAAGCAC CATTAAACTC ACCTCCCTTT CGAAACCTTT TTGATGTCAA 4499 |||||||||| | |||| || ||||| |||| |||||| || | |||||| | | |||||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGTCGGGGTG AAGTCGTCTG GCTTCACTCC TTCCACTTCT ATCACAAAGT CTACCACCTC 4439 ||||| |||| ||||| || | | |||||||| |||||| || |||||||||| |||||||||| TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGGAAGGAT TTTGCCGTTG CCACTATCTG TAAGGCCGAA ATCCGCAATT CTGACCTCAA 4379 ||| |||||| |||| |||| || ||||||| |||||| ||| |||||||||| |||| ||||| TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCACA AACCGGCGAA TTCGCTCTTG TGGACTGAAA CACAGTTGGG TGGCATACCG 4319 |||||||||| || ||||| | | || ||||| |||||| ||| || ||||||| |||| || | CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC ATTGACCGAC ATCCTACCTT GCTCTAGGCT 4259 ||||| ||| ||||| |||| ||||||| || || || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCCCTTTTCC TATCCCTCAA AGTTCGGGGG ATATACTTCT CCATAAACAA 4199 |||||||||| |||||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GAGGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAATATGTGA 4139 ||||||||| || |||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCAT TACCTTGAAA CTGATAACTT ACGAACTCAA CACCAAACCG 4079 ||||||||| || ||||| | | ||||| || |||||| || |||||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAAGCATC 4019 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | |||||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 546 CTCAGATTCC GCACCCTTGA ATACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 3959 |||||||| |||||||||| | |||||||| ||||| |||| |||||||||| |||||||||| TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAGTTC 606 ATGCTGATCA TTTGTCATTA TAGGCCCAGT AGTCAGACGT GGAAACGTGC CTATTTCCAA 3899 ||| ||||| |||||||| | || ||||||| |||||||||| ||||| |||| ||||| |||| ATGTTGATCG TTTGTCATAA TAAGCCCAGT AGTCAGACGT GGAAATGTGC CTATTCCCAA 666 TGGAACATCC ATG 3886 ||| |||| ||| TGGTGCATCT ATG 679 hqPGS_C06HBa0153O03.1-2-_SGN-E351625+ (4558 3886) ******************************************************************************** EST sequence 131 +strand 612 n (File: SGN-E357065+) 1 GGGTTCTGTC CTCTAGAATA AGAACCATTA TACTCTCCTC CCCTTCTAAA CCTCTTCGAT 61 GTCGATGTCG TGGTGAAGTC ATCCGGTTTC ACTCCTTCCA CCTCAATCAC AAAGTCTACC 121 ACCTCTTGAA AGGATGTTGC TGTTGCCGCT ATCTGTAAGG CTGAAATCCG CAATTCTGAT 181 CTCAACCCCT TCACAAAACG GCGGATCCGT TCTTGTGGAC TAAAACAGAG TTGGGTGGCG 241 TATCTGGATA GTGCGCGAAA TTTAGCCTCA TACGCGTTTA CTGTCATCCT TCCCTGTTCA 301 AGACTCAAGA ACTCATCCCT TTTCCTATCT CTCAAAGTCC TTGGGATATA TTTCTCCATG 361 AACAAACTAG AGAATGATTC CCAAGTCATA GGTGGTGCCT CTGTTGGTTG ACACTCAACA 421 TGTGAACGCC ACCACATCTT GGCGTTCCCT TGGAACTGAT AGCTCACGAA CTCGACGCCA 481 AACCTCTCTA CTATCCCCAT TTTGTGTAGT AGCTCATGAC AATCAACCAG AAAATCATAA 541 GCATCTTCAG ATTCTGCACC CTTGAAGACT GGAGGTTTCA GTTTCAAGAA CTTACTGAAA 601 AGTCATGTTG AT Predicted gene structure (within gDNA segment 6621 to 1596): Exon 1 4558 3951 ( 608 n); cDNA 6 612 ( 607 n); score: 0.877 MATCH C06HBa0153O03.1-2- SGN-E357065+ 0.877 608 0.993 C PGS_C06HBa0153O03.1-2-_SGN-E357065+ (4558 3951) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GTGTAAGCAC CATTAAACTC ACCTCCCTTT CGAAACCTTT TTGATGTCAA 4499 |||||||||| | |||| || ||||| |||| |||||| || | |||||| | | |||||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 65 TGTCGGGGTG AAGTCGTCTG GCTTCACTCC TTCCACTTCT ATCACAAAGT CTACCACCTC 4439 ||||| |||| ||||| || | | |||||||| |||||| || |||||||||| |||||||||| TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 125 TTGGAAGGAT TTTGCCGTTG CCACTATCTG TAAGGCCGAA ATCCGCAATT CTGACCTCAA 4379 ||| |||||| |||| |||| || ||||||| |||||| ||| |||||||||| |||| ||||| TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 185 CCCCTTCACA AACCGGCGAA TTCGCTCTTG TGGACTGAAA CACAGTTGGG TGGCATACCG 4319 |||||||||| || ||||| | | || ||||| |||||| ||| || ||||||| |||| || | CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 245 GGATAATGCA CGAAACTTAG CCTCATATGC ATTGACCGAC ATCCTACCTT GCTCTAGGCT 4259 ||||| ||| ||||| |||| ||||||| || || || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 305 CAAGAACTCA TCCCTTTTCC TATCCCTCAA AGTTCGGGGG ATATACTTCT CCATAAACAA 4199 |||||||||| |||||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 365 GCTAGAGAAT GAGGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAATATGTGA 4139 ||||||||| || |||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 425 CCGCCACCAC ATTTTGGCAT TACCTTGAAA CTGATAACTT ACGAACTCAA CACCAAACCG 4079 ||||||||| || ||||| | | ||||| || |||||| || |||||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 485 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACAGTCA ACCAGAAAAT CGTAAGCATC 4019 |||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| | |||||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACAATCA ACCAGAAAAT CATAAGCATC 545 CTCAGATTCC GCACCCTTGA ATACTGGAGG TTTCAATTTC AAGAACTTAC TGAAAAGTTC 3959 |||||||| |||||||||| | |||||||| ||||| |||| |||||||||| ||||||| || TTCAGATTCT GCACCCTTGA AGACTGGAGG TTTCAGTTTC AAGAACTTAC TGAAAAG-TC 604 ATGCTGAT 3951 ||| |||| ATGTTGAT 612 hqPGS_C06HBa0153O03.1-2-_SGN-E357065+ (4558 3951) ******************************************************************************** EST sequence 70 +strand 524 n (File: SGN-E352365+) 1 GGGGTTCTGT CCTCTAGAAT AAGAACCATT ATACTCTCCT CCCCTTCTAA ACCTCTTCGA 61 TGTCGATGTC GTGGTGAAGT CATCCGGTTT CACTCCTTCC ACCTCAATCA CAAAGTCTAC 121 CACCTCTTGA AAGGATGTTG CTGTTGCCGC TATCTGTAAG GCTGAAATCC GCAATTCTGA 181 TCTCAACCCC TTCACAAAAC GGCGGATCCG TTCTTGTGGA CTAAAACAGA GTTGGGTGGC 241 GTATCTGGAT AGTGCGCGAA ATTTAGCCTC ATACGCGTTT ACTGTCATCC TTCCCTGTTC 301 AAGACTCAAG AACTCATCCC TTTTCCTATC TCTCAAAGTC CTTGGGATAT ATTTCTCCAT 361 GAACAAACTA GAGAATGATT CCCAAGTCAT AGGTGGTGCC TCTGTTGGTT GACACTCAAC 421 ATGTGAACGC CACCACATCT TGGCGTTCCC TTGGAACTGA TAGCTCACGA ACTCGACGCC 481 AAACCTCTCT ACTATCCCCA TTTTGTGTAG TAGCTCATGA CAAT Predicted gene structure (within gDNA segment 6631 to 2486): Exon 1 4558 4043 ( 516 n); cDNA 7 522 ( 516 n); score: 0.870 MATCH C06HBa0153O03.1-2- SGN-E352365+ 0.870 516 0.985 C PGS_C06HBa0153O03.1-2-_SGN-E352365+ (4558 4043) Alignment (genomic DNA sequence = upper lines): CTGTCCTCTA GTGTAAGCAC CATTAAACTC ACCTCCCTTT CGAAACCTTT TTGATGTCAA 4499 |||||||||| | |||| || ||||| |||| |||||| || | |||||| | | |||||| | CTGTCCTCTA GAATAAGAAC CATTATACTC TCCTCCCCTT CTAAACCTCT TCGATGTCGA 66 TGTCGGGGTG AAGTCGTCTG GCTTCACTCC TTCCACTTCT ATCACAAAGT CTACCACCTC 4439 ||||| |||| ||||| || | | |||||||| |||||| || |||||||||| |||||||||| TGTCGTGGTG AAGTCATCCG GTTTCACTCC TTCCACCTCA ATCACAAAGT CTACCACCTC 126 TTGGAAGGAT TTTGCCGTTG CCACTATCTG TAAGGCCGAA ATCCGCAATT CTGACCTCAA 4379 ||| |||||| |||| |||| || ||||||| |||||| ||| |||||||||| |||| ||||| TTGAAAGGAT GTTGCTGTTG CCGCTATCTG TAAGGCTGAA ATCCGCAATT CTGATCTCAA 186 CCCCTTCACA AACCGGCGAA TTCGCTCTTG TGGACTGAAA CACAGTTGGG TGGCATACCG 4319 |||||||||| || ||||| | | || ||||| |||||| ||| || ||||||| |||| || | CCCCTTCACA AAACGGCGGA TCCGTTCTTG TGGACTAAAA CAGAGTTGGG TGGCGTATCT 246 GGATAATGCA CGAAACTTAG CCTCATATGC ATTGACCGAC ATCCTACCTT GCTCTAGGCT 4259 ||||| ||| ||||| |||| ||||||| || || || | | ||||| || | | || || || GGATAGTGCG CGAAATTTAG CCTCATACGC GTTTACTGTC ATCCTTCCCT GTTCAAGACT 306 CAAGAACTCA TCCCTTTTCC TATCCCTCAA AGTTCGGGGG ATATACTTCT CCATAAACAA 4199 |||||||||| |||||||||| |||| ||||| ||| | ||| ||||| |||| |||| ||||| CAAGAACTCA TCCCTTTTCC TATCTCTCAA AGTCCTTGGG ATATATTTCT CCATGAACAA 366 GCTAGAGAAT GAGGCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAATATGTGA 4139 ||||||||| || |||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACTAGAGAAT GATTCCCAAG TCATAGGTGG TGCCTCTGTT GGTTGACACT CAACATGTGA 426 CCGCCACCAC ATTTTGGCAT TACCTTGAAA CTGATAACTT ACGAACTCAA CACCAAACCG 4079 ||||||||| || ||||| | | ||||| || |||||| || |||||||| | | ||||||| ACGCCACCAC ATCTTGGCGT TCCCTTGGAA CTGATAGCTC ACGAACTCGA CGCCAAACCT 486 TTCTACTATA CCCATCTTGT GTAGTAGCTC ATGACA 4043 |||||||| ||||| |||| |||||||||| |||||| CTCTACTATC CCCATTTTGT GTAGTAGCTC ATGACA 522 hqPGS_C06HBa0153O03.1-2-_SGN-E352365+ (4558 4043) ******************************************************************************** EST sequence 129 +strand 790 n (File: SGN-E356912+) 1 CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 61 CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 121 TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 181 CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 241 GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 301 TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 361 AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 421 GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 481 TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 541 CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 601 AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 661 AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 721 ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 781 GGATATACTT Predicted gene structure (within gDNA segment 5780 to 3466): Exon 1 5000 4211 ( 790 n); cDNA 1 790 ( 790 n); score: 0.908 MATCH C06HBa0153O03.1-2- SGN-E356912+ 0.908 790 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E356912+ (5000 4211) Alignment (genomic DNA sequence = upper lines): CAGGATCAAA CAATACAGAG GCCATGCAAT CACAAACCAG AAGATTACCT GTGATGACAG 4941 |||||||||| |||||| || |||||||||| ||||||| || |||||||||| |||||||||| CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACTAG AAGATTACCT GTGATGACAG 60 CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTCG 4881 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA ACAATGGGCC CTATCGTTTG 120 TCTGTCCGTT GCCCCTAACT TGTTGTGATG TAGTGGCTCC AGTTTGCCCA TCACCTTGGC 4821 |||| || || ||||||| | |||||||||| ||||||| || |||||||||| | |||| || TCTGCCCATT GCCCCTACCA TGTTGTGATG TAGTGGCCCC AGTTTGCCCA TTACCTCTGC 180 CGTTTTGGTT ACCACCATTT CCTTGACCAC CACGTCCTCC AGAATAACGG CCTCTGCCAT 4761 ||||||||| ||||||||| ||| |||||| |||||||||| |||||||||| ||||| |||| CGTTTTGGTG ACCACCATTA CCTCGACCAC CACGTCCTCC AGAATAACGG CCTCTACCAT 240 GACCACCTCT ACCTCTAACA TTTGGAGGTC TGTAACTCTG TTTTGGACAA TATCTCTTAA 4701 |||||||||| ||||||| | |||| |||| | ||||| | | |||||| | | || ||| GACCACCTCT ACCTCTAGCT ATTGGGGGTC TATAACTTGG TCCAGGACAA TTTATCCTAA 300 TATGTTCGAT CTCCCCACAT CCATAACACT TTCTGGGTTC ATGCATAGGT CTCTCAGAGA 4641 ||||| | || |||||||||| |||||||| | ||||| || | |||||| | || |||| TATGTCCAAT CTCCCCACAT CCATAACATT CTCTGGAGTC ACTCATAGGC CCCTTGGAGA 360 AGTGTTGACC GGTCGGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATTGGTC 4581 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ||||| |||| AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTCTG TAGTGAAGAC TGAATGGGTC 420 GGACTGAGTA ACTTCCCGAA CCCTGTCCTC TAGTGTAAGC ACCATTAAAC TCACCTCCCT 4521 | | ||||| || ||| ||| || ||||||| ||| ||||| ||| |||||| |||||||||| GAGCCGAGTA ACCTCCGGAA CCTTGTCCTC TAGAGTAAGA ACCCTTAAAC TCACCTCCCT 480 TTCGAAACCT TTTTGATGTC AATGTCGGGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 4461 |||||||||| |||||||| |||||| || |||||||||| |||||||||| |||||||||| TTCGAAACCT CTTTGATGTT GATGTCGTGG TGAAGTCGTC TGGCTTCACT CCTTCCACTT 540 CTATCACAAA GTCTACCACC TCTTGGAAGG ATTTTGCCGT TGCCACTATC TGTAAGGCCG 4401 |||||||||| ||||||||| |||||||||| |||||||||| |||| ||| | |||||||||| CTATCACAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT TGCCGCTACC TGTAAGGCCG 600 AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGCG AATTCGCTCT TGTGGACTGA 4341 |||||||||| |||||||||| |||||||||| |||||||| | ||| |||||| || ||||||| AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGAG AATCCGCTCT TGAGGACTGA 660 AACACAGTTG GGTGGCATAC CGGGATAATG CACGAAACTT AGCCTCATAT GCATTGACCG 4281 |||| ||||| |||||||| | ||||| || |||| ||||| ||||||||| ||||| |||| AACAGAGTTG AGTGGCATAT CCGGATAGTG CACGGAACTT AGCCTCATAA GCATTAACCG 720 ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT CCTATCCCTC AAAGTTCGGG 4221 |||||||||| |||||||||| |||| ||||| |||| ||||| |||||||||| ||||| |||| ACATCCTACC TTGCTCTAGG CTCAGGAACT CATCTCTTTT CCTATCCCTC AAAGTCCGGG 780 GGATATACTT 4211 |||||||||| GGATATACTT 790 hqPGS_C06HBa0153O03.1-2-_SGN-E356912+ (5000 4211) ******************************************************************************** EST sequence 126 +strand 698 n (File: SGN-E356209+) 1 TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 61 ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 121 ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 181 AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 241 ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 301 AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 361 CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 421 TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 481 CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 541 CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 601 ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 661 TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG Predicted gene structure (within gDNA segment 5524 to 3617): Exon 1 4924 4227 ( 698 n); cDNA 1 698 ( 698 n); score: 0.924 MATCH C06HBa0153O03.1-2- SGN-E356209+ 0.924 698 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E356209+ (4924 4227) Alignment (genomic DNA sequence = upper lines): TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 4865 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGACCGC CCAGGGAAAG CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT 60 AACTTGTTGT GATGTAGTGG CTCCAGTTTG CCCATCACCT TGGCCGTTTT GGTTACCACC 4805 | | |||||| |||||||||| | |||||||| ||||| |||| ||| |||| ||| |||||| ACCATGTTGT GATGTAGTGG CCCCAGTTTG CCCATTACCT CTGCCATTTT GGTGACCACC 120 ATTTCCTTGA CCACCACGTC CTCCAGAATA ACGGCCTCTG CCATGACCAC CTCTACCTCT 4745 ||| ||| || |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ATTACCTCGA CCACCACGTC CTCCAGAATA ACGGCCTCTA CCATGACCAC CTCTACCTCT 180 AACATTTGGA GGTCTGTAAC TCTGTTTTGG ACAATATCTC TTAATATGTT CGATCTCCCC 4685 | | |||| ||||| |||| | ||| | || ||||| | || |||||||| | |||||||| AGCTATTGGG GGTCTATAAC TTTGTCTGGG ACAATTTTTC CTAATATGTC CAATCTCCCC 240 ACATCCATAA CACTTTCTGG GTTCATGCAT AGGTCTCTCA GAGAAGTGTT GACCGGTCGG 4625 |||||||||| || | ||||| ||| |||| ||| | ||| |||||||||| ||||||| | ACATCCATAA CATTCTCTGG AGTCAAGCAT AGGCCCCTCG GAGAAGTGTT AACCGGTCTG 300 AGGTGGACCC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 4565 |||||| | | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTGGTCTC CCAACTACAG TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC 360 CGAACCCTGT CCTCTAGTGT AAGCACCATT AAACTCACCT CCCTTTCGAA ACCTTTTTGA 4505 |||||||||| |||||||||| ||| |||||| |||||||||| ||||| |||| |||||||||| CGAACCCTGT CCTCTAGTGT AAGAACCATT AAACTCACCT CCCTTCCGAA ACCTTTTTGA 420 TGTCAATGTC GGGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 4445 |||| |||| | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTCGATGTT GTGGTGAAGT CGTCTGGCTT CACTCCTTCC ACTTCTATCA CAAAGTCTAC 480 CACCTCTTGG AAGGATTTTG CCGTTGCCAC TATCTGTAAG GCCGAAATCC GCAATTCTGA 4385 ||| |||||| |||||||||| |||||||| | |||||||||| | |||||||| |||||||||| CACTTCTTGG AAGGATTTTG CCGTTGCCGC TATCTGTAAG GACGAAATCC GCAATTCTGA 540 CCTCAACCCC TTCACAAACC GGCGAATTCG CTCTTGTGGA CTGAAACACA GTTGGGTGGC 4325 |||||||||| |||||||||| ||||||| || |||||| ||| |||||||| | |||| ||||| CCTCAACCCC TTCACAAACC GGCGAATCCG CTCTTGAGGA CTGAAACAGA GTTGAGTGGC 600 ATACCGGGAT AATGCACGAA ACTTAGCCTC ATATGCATTG ACCGACATCC TACCTTGCTC 4265 ||| | |||| | |||||||| |||||||||| ||| ||||| |||||||||| |||||||||| ATATCTGGAT AGTGCACGAA ACTTAGCCTC ATAAGCATTA ACCGACATCC TACCTTGCTC 660 TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG 4227 |||||||||| |||||||||| |||||||||| |||||||| TAGGCTCAAG AACTCATCCC TTTTCCTATC CCTCAAAG 698 hqPGS_C06HBa0153O03.1-2-_SGN-E356209+ (4924 4227) ******************************************************************************** EST sequence 96 +strand 763 n (File: SGN-E214046+) 1 ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 61 TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 121 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 181 ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 241 AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 301 AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 361 TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 421 ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 481 TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 541 ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 601 TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 661 TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA CAACCGGCGA 721 ATCCGCTCTT GAGGGACTGA ACAGAGTTGA GTGGCATATC TGG Predicted gene structure (within gDNA segment 5680 to 3258): Exon 1 5080 4317 ( 764 n); cDNA 1 763 ( 763 n); score: 0.892 MATCH C06HBa0153O03.1-2- SGN-E214046+ 0.892 764 1.001 C PGS_C06HBa0153O03.1-2-_SGN-E214046+ (5080 4317) Alignment (genomic DNA sequence = upper lines): ACGAATAGGC ATATCAAGTA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 5021 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| ACGAATAGGC ATATCAAGAA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 60 TACATAAGAA AATGTGGATC CAGGATCAAA CAATACAGAG GCCATGCAAT CACAAACCAG 4961 |||||||||| || ||||||| |||||||||| |||||| || |||||||||| ||||||| || TACATAAGAA AACGTGGATC CAGGATCAAA CAATACGGAA GCCATGCAAT CACAAACAAG 120 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA 4901 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTTA GACCGCCCAG GGAAAGCGTA 180 ACAATGGGCC CTATCGTTCG TCTGTCCGTT GCCCCTAACT TGTTGTGATG TAGTGGCTCC 4841 |||||||||| |||| ||| | | ||||| || ||||||| | ||| |||||| ||||||| || ACAATGGGCC CTATAGTTTG TTTGTCCATT GCCCCTACCA TGTCGTGATG TAGTGGCCCC 240 AGTTTGCCCA TCACCTTGGC CGTTTTGGTT ACCACCATTT CCTTGACCAC CACGTCCTCC 4781 |||||| ||| | |||| || ||||||||| ||||||||| ||| |||||| |||||||||| AGTTTGTCCA TTACCTCTGC CGTTTTGGTG ACCACCATTG CCTCGACCAC CACGTCCTCC 300 AGAATAACGG CCTCTGCCAT GACCACCTCT ACCTCTAACA TTTGGAGGTC TGTAACTCTG 4721 | |||||||| ||||| |||| ||| |||||| ||||||||| |||| |||| | ||||| | AAAATAACGG CCTCTACCAT GACAACCTCT ACCTCTAACT ATTGGGGGTC TATAACTTGG 360 TTTTGGACAA TATCTCTTAA TATGTTCGAT CTCCCCACAT CCATAACACT TTCTGGGTTC 4661 | ||| || | |||| ||| ||||| | || |||||||||| |||||||| | ||| | || TCCGGGAAAA TTTCTCCTAA TATGTCCAAT CTCCCCACAT CCATAACATT CTCTAGAGTC 420 ATGCATAGGT CTCTCAGAGA AGTGTTGACC GGTCGGAGGT GGACCCCCAA CTACAGTCTG 4601 | |||||| | ||| |||| |||||||||| |||| ||||| |||||||||| ||||||| || ACTCATAGGC CCCTCGGAGA AGTGTTGACC GGTCTGAGGT GGACCCCCAA CTACAGTTTG 480 TAGTGAAGAC TGAATTGGTC GGACTGAGTA ACTTCCCGAA CCCTGTCCTC TAGTGTAAGC 4541 |||||||||| ||||| |||| | |||||||| || ||| ||| || ||||||| ||| ||||| TAGTGAAGAC TGAATGGGTC GAACTGAGTA ACCTCCAGAA CCTTGTCCTC TAGAGTAAGA 540 ACCATTAAAC TCACCTCCCT TTCGAAACCT TTTTGATGTC AATGTCGGGG TGAAGTCGTC 4481 ||| |||||| ||| ||||| || ||||||| | |||||| |||||| || |||||||||| ACCCTTAAAC TCATCTCCCC TTTGAAACCT CATCGATGTC GATGTCGTGG TGAAGTCGTC 600 TGGCTTCACT CCTTCCACTT CTATCACAAA GTCTACCACC TCTTGGAAGG ATTTTGCCGT 4421 ||| |||||| |||||||||| |||||| ||| ||||||||| |||||||||| |||||||||| TGGTTTCACT CCTTCCACTT CTATCAAAAA GTCTACCACT TCTTGGAAGG ATTTTGCCGT 660 TGCCACTATC TGTAAGGCCG AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGCG 4361 |||| ||| | ||||||| || |||||||||| |||||||||| |||||||||| | |||||||| TGCCGCTACC TGTAAGGGCG AAATCCGCAA TTCTGACCTC AACCCCTTCA C-AACCGGCG 719 AATTCGCTCT TG-TGGACTG AAACACAGTT GGGTGGCATA CCGGG 4317 ||| |||||| || |||||| |||| |||| | |||||||| | || AATCCGCTCT TGAGGGACTG -AACAGAGTT GAGTGGCATA TCTGG 763 hqPGS_C06HBa0153O03.1-2-_SGN-E214046+ (5080 4317) ******************************************************************************** EST sequence 91 +strand 533 n (File: SGN-E353805+) 1 GGATCAACAA TACGGAAGCC ATGCAATCAC AAACTAGAAG ATTACCTGTG ATGACAGCAT 61 CAGATGCCTC CGCTTCAGAC CGCCCAGGGA AAGCGTAACA ATGGGCCCTA TCGTTTGTCT 121 GCCCATTGCC CCTACCATGT TGTGATGTAG TGGCCCCAGT TTGCCCATTA CCTCTGCCGT 181 TTTGGTGACC ACCATTACCT CGACCACCAC GTCCTCCAGA ATAACGGCCT CTACCATGAC 241 CACCTCTACC TCTAGCTATT GGGGGTCTAT AACTTGGTCC AGGACAATTT ATCCTAATAT 301 GTCCAATCTC CCCACATCCA TAACATTCTC TGGAGTCACT CATAGGCCCC TTGGAGAAGT 361 GTTGACCGGT CTGAGGTGGA CCCCCAACTA CAGTCTGTAG TGAAGACTGA ATGGGTCGAG 421 CCGAGTAACC TCCGGAACCT TGTCCTCTAA AGTAAGAACC CTTAAACTCA CCTCCCTTTC 481 GAAACCTCTT TGATGTTGAT GTCGTGGTGA AGTCGTCTGG CTTCACTCCT TCC Predicted gene structure (within gDNA segment 5750 to 3855): Exon 1 4998 4465 ( 534 n); cDNA 1 533 ( 533 n); score: 0.891 MATCH C06HBa0153O03.1-2- SGN-E353805+ 0.891 534 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E353805+ (4998 4465) Alignment (genomic DNA sequence = upper lines): GGATCAAACA ATACAGAGGC CATGCAATCA CAAACCAGAA GATTACCTGT GATGACAGCA 4939 ||||| |||| |||| || || |||||||||| ||||| |||| |||||||||| |||||||||| GGATC-AACA ATACGGAAGC CATGCAATCA CAAACTAGAA GATTACCTGT GATGACAGCA 59 TCAGATGCCT CCGCTTCAGA CCGCCCAGGG AAAGCGTAAC AATGGGCCCT ATCGTTCGTC 4879 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| TCAGATGCCT CCGCTTCAGA CCGCCCAGGG AAAGCGTAAC AATGGGCCCT ATCGTTTGTC 119 TGTCCGTTGC CCCTAACTTG TTGTGATGTA GTGGCTCCAG TTTGCCCATC ACCTTGGCCG 4819 || || |||| ||||| | || |||||||||| ||||| |||| ||||||||| |||| |||| TGCCCATTGC CCCTACCATG TTGTGATGTA GTGGCCCCAG TTTGCCCATT ACCTCTGCCG 179 TTTTGGTTAC CACCATTTCC TTGACCACCA CGTCCTCCAG AATAACGGCC TCTGCCATGA 4759 ||||||| || ||||||| || | |||||||| |||||||||| |||||||||| ||| |||||| TTTTGGTGAC CACCATTACC TCGACCACCA CGTCCTCCAG AATAACGGCC TCTACCATGA 239 CCACCTCTAC CTCTAACATT TGGAGGTCTG TAACTCTGTT TTGGACAATA TCTCTTAATA 4699 |||||||||| ||||| | | ||| ||||| ||||| || ||||||| | || ||||| CCACCTCTAC CTCTAGCTAT TGGGGGTCTA TAACTTGGTC CAGGACAATT TATCCTAATA 299 TGTTCGATCT CCCCACATCC ATAACACTTT CTGGGTTCAT GCATAGGTCT CTCAGAGAAG 4639 ||| | |||| |||||||||| |||||| | | |||| ||| |||||| | || |||||| TGTCCAATCT CCCCACATCC ATAACATTCT CTGGAGTCAC TCATAGGCCC CTTGGAGAAG 359 TGTTGACCGG TCGGAGGTGG ACCCCCAACT ACAGTCTGTA GTGAAGACTG AATTGGTCGG 4579 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ||| ||||| TGTTGACCGG TCTGAGGTGG ACCCCCAACT ACAGTCTGTA GTGAAGACTG AATGGGTCGA 419 ACTGAGTAAC TTCCCGAACC CTGTCCTCTA GTGTAAGCAC CATTAAACTC ACCTCCCTTT 4519 | ||||||| ||| ||||| ||||||||| ||||| || | |||||||| |||||||||| GCCGAGTAAC CTCCGGAACC TTGTCCTCTA AAGTAAGAAC CCTTAAACTC ACCTCCCTTT 479 CGAAACCTTT TTGATGTCAA TGTCGGGGTG AAGTCGTCTG GCTTCACTCC TTCC 4465 |||||||| | ||||||| | ||||| |||| |||||||||| |||||||||| |||| CGAAACCTCT TTGATGTTGA TGTCGTGGTG AAGTCGTCTG GCTTCACTCC TTCC 533 hqPGS_C06HBa0153O03.1-2-_SGN-E353805+ (4998 4465) ******************************************************************************** EST sequence 98 +strand 559 n (File: SGN-E244046+) 1 AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 61 AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 121 CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 181 CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 241 CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 301 CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 361 TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 421 GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 481 TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 541 AAGCACCATT AAACTCACC Predicted gene structure (within gDNA segment 5684 to 3916): Exon 1 5084 4526 ( 559 n); cDNA 1 559 ( 559 n); score: 0.957 MATCH C06HBa0153O03.1-2- SGN-E244046+ 0.957 559 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E244046+ (5084 4526) Alignment (genomic DNA sequence = upper lines): AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCATTA GCAAATGAGG 5025 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAAGT TAGACCATTA GCAAATGAGG 60 AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAGGCCATG CAATCACAAA 4965 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| AAGATACATA AGAAAATGTG GATCCAGGAT CAAACAATAC AGAAGCCATG CAATCACAAA 120 CCAGAAGATT ACCTGTGATG ACAGCATCAG ATGCCTCCGC TTCAGACCGC CCAGGGAAAG 4905 ||| |||||| |||||||||| ||| |||||| || ||||||| |||||||||| |||||||||| CCAAAAGATT ACCTGTGATG ACAACATCAG ATTCCTCCGC TTCAGACCGC CCAGGGAAAG 180 CGTAACAATG GGCCCTATCG TTCGTCTGTC CGTTGCCCCT AACTTGTTGT GATGTAGTGG 4845 |||||||||| ||||||||| ||| | |||| ||||||||| | | |||||| |||||||||| CGTAACAATG GGCCCTATCA TTCTTATGTC TGTTGCCCCT ACCATGTTGT GATGTAGTGG 240 CTCCAGTTTG CCCATCACCT TGGCCGTTTT GGTTACCACC ATTTCCTTGA CCACCACGTC 4785 |||||||||| |||||||||| ||||||||| ||| |||||| ||||||| || || ||||||| CTCCAGTTTG CCCATCACCT CGGCCGTTTT GGTGACCACC ATTTCCTCGA CCGCCACGTC 300 CTCCAGAATA ACGGCCTCTG CCATGACCAC CTCTACCTCT AACATTTGGA GGTCTGTAAC 4725 |||||||||| |||| ||||| |||||||||| |||||||||| ||| |||||| |||||||||| CTCCAGAATA ACGGACTCTG CCATGACCAC CTCTACCTCT AACCTTTGGA GGTCTGTAAC 360 TCTGTTTTGG ACAATATCTC TTAATATGTT CGATCTCCCC ACATCCATAA CACTTTCTGG 4665 |||||||||| |||||||||| ||||||||| | |||||||| ||| ||||| |||| ||||| TCTGTTTTGG ACAATATCTC TTAATATGTC CAATCTCCCC ACACCCATAG CACTCTCTGG 420 GTTCATGCAT AGGTCTCTCA GAGAAGTGTT GACCGGTCGG AGGTGGACCC CCAACTACAG 4605 |||||||||| | |||||||| |||||||||| ||||||||| |||||||||| |||||||||| GTTCATGCAT AAGTCTCTCA GAGAAGTGTT GACCGGTCGA AGGTGGACCC CCAACTACAG 480 TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 4545 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTGTAGTGA AGACTGAATT GGTCGGACTG AGTAACTTCC CGAACCCTGT CCTCTAGTGT 540 AAGCACCATT AAACTCACC 4526 |||||||||| ||||||||| AAGCACCATT AAACTCACC 559 hqPGS_C06HBa0153O03.1-2-_SGN-E244046+ (5084 4526) ******************************************************************************** EST sequence 123 +strand 543 n (File: SGN-E355026+) 1 GAACACATCC AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT 61 GTCATCCTTG AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA 121 AAGAAAGGAG ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 181 TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG 241 AGAAAGCCAA GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC 301 TAGATAAGTG TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT 361 CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA 421 TTTTAGACCA TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA 481 TACAGATGCC ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC 541 TGC Predicted gene structure (within gDNA segment 7471 to 3181): Exon 1 5467 4925 ( 543 n); cDNA 1 543 ( 543 n); score: 0.895 MATCH C06HBa0153O03.1-2- SGN-E355026+ 0.895 543 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E355026+ (5467 4925) Alignment (genomic DNA sequence = upper lines): GAACACATCC AGAAACTCAC GGACTACTGA AACCGACTCA ATCGAAGGCA CTTGGGTAGT 5408 |||||||||| | ||| || | ||||| | ||| | ||| || || || | ||||| |||| GAACACATCC AAAAATTCTC AAACTACCAA AACTAATTCA ATTGAGGGTA CTTGGATAGT 60 GTCATCCTTG AGATGTGCCA AGAAAGCTAA ACAACCTTTA CTAACCATTT TCTTAGCACG 5348 |||||||||| |||||||||| |||| | || |||||| ||| |||||||| | ||| ||||| GTCATCCTTG AGATGTGCCA AGAAGGATAG ACAACCCTTA CTAACCATCT TCTGAGCACA 120 AAGAAAGGAG ATGATATGCA CCGGATTGGA AGCGTTGTCA CCCTCCCACA CTAACGGATC 5288 |||||||||| || ||| ||| |||||||||| || || |||| |||||||||| |||||||||| AAGAAAGGAG ATAATACGCA CCGGATTGGA AGTGTAGTCA CCCTCCCACA CTAACGGATC 180 TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAATTGCGG 5228 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| | || TGTCCCAGGC TTGGCTAACG TCACCGTTTT AGCATTACAA TCCAAGATCG CAAAATTGGG 240 AGAAAGCCAA GTCATACCTA GAATTACATC AAAATCATCC ATTTCTAAGA TAACCAAATC 5168 |||||||||| |||||||| | |||||||||| ||| ||| || |||||||| |||||||||| AGAAAGCCAA GTCATACCCA GAATTACATC AAAGTCACCC ATTTCTAAAG TAACCAAATC 300 TACATAAGTG TTGCTCCCTA CAAAGTTCAC CAAAAAAGAC CTATACACCT TTTCAACTAC 5108 || ||||||| |||||||| | | || ||||| |||| ||||| |||| |||| ||||||||| TAGATAAGTG TTGCTCCCCA CGAAATTCAC CAAACAAGAC CTATGTACCT TTTCAACTAT 360 CACAGATTCA CCCACCGGAG TAGAAACACG AATAGGCATA TCAAGTAATT CACAATGTAA 5048 |||||| ||| |||||||||| |||||||||| ||||||||| |||||||||| ||||||| || CACAGACTCA CCCACCGGAG TAGAAACACG AATAGGCATG TCAAGTAATT CACAATGCAA 420 ATTTAGACCA TTAGCAAATG AGGAAGATAC ATAAGAAAAT GTGGATCCAG GATCAAACAA 4988 ||||||||| |||||||||| | |||||||| ||| ||||| ||||| |||| | ||||| || TTTTAGACCA TTAGCAAATG ATGAAGATAC ATATGAAAAC GTGGAGCCAG GGTCAAATAA 480 TACAGAGGCC ATGCAATCAC AAACCAGAAG ATTACCTGTG ATGACAGCAT CAGATGCCTC 4928 |||||| ||| |||||||||| |||||| || |||||||||| || ||||||| |||| ||||| TACAGATGCC ATGCAATCAC AAACCAAGAG ATTACCTGTG ATAACAGCAT CAGACGCCTC 540 CGC 4925 || TGC 543 hqPGS_C06HBa0153O03.1-2-_SGN-E355026+ (5467 4925) ******************************************************************************** EST sequence 120 +strand 761 n (File: SGN-E355244+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 GCAAATGAGG AAGATACANT AGAAAACGTG GATCCCAGGA TCAACAATAC AGAAGCCATG 721 CAATCACAAT CGAAAGATTA CCTGTGATGA CAGCATCTAA T Predicted gene structure (within gDNA segment 6348 to 3577): Exon 1 5694 4933 ( 762 n); cDNA 1 761 ( 761 n); score: 0.943 MATCH C06HBa0153O03.1-2- SGN-E355244+ 0.943 762 1.001 C PGS_C06HBa0153O03.1-2-_SGN-E355244+ (5694 4933) Alignment (genomic DNA sequence = upper lines): CGAAAGCTCC CATCCTTCTT CTTTACAAAC AAAACCGGAG CACCCCAAGG AGATGCACTT 5635 || || |||| |||||||||| ||| |||||| |||| ||||| |||||||||| |||||||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATAA AGCCTTTGTT CAATAACTCT TGAAGTTGTG CCTTTAACTC TCTTAACTCT 5575 |||||||| | | |||||| | ||| |||||| |||||||| | |||||||||| |||||||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GCGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC TAGATCGATA 5515 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 180 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAGA 5455 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 240 AACTCACGGA CTACTGAAAC CGACTCAATC GAAGGCACTT GGGTAGTGTC ATCCTTGAGA 5395 ||||||| || ||| |||| |||||||||| ||||| |||| |||||||||| |||||||||| AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 300 TGTGCCAAGA AAGCTAAACA ACCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 5335 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 360 ATATGCACCG GATTGGAAGC GTTGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 5275 || ||||| ||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 420 GCTAACGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 5215 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 480 ATACCTAGAA TTACATCAAA ATCATCCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 5155 ||||| |||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 540 CTCCCTACAA AGTTCACCAA AAAAGACCTA TACACCTTTT CAACTACCAC AGATTCACCC 5095 ||||| |||| |||||||||| | |||||||| || ||||||| |||||||||| ||| |||||| CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 600 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCATTA 5035 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 660 GCAAATGAGG AAGATACATA AGAAAATGTG GAT-CCAGGA TCAAACAATA CAGAGGCCAT 4976 |||||||||| |||||||| |||||| ||| ||| |||||| || ||||||| |||| ||||| GCAAATGAGG AAGATACANT AGAAAACGTG GATCCCAGGA TC-AACAATA CAGAAGCCAT 719 GCAATCACAA ACCAGAAGAT TACCTGTGAT GACAGCATCA GAT 4933 |||||||||| | | ||||| |||||||||| ||||||||| || GCAATCACAA TCGA-AAGAT TACCTGTGAT GACAGCATCT AAT 761 hqPGS_C06HBa0153O03.1-2-_SGN-E355244+ (5694 4933) ******************************************************************************** EST sequence 92 +strand 331 n (File: SGN-E352716+) 1 GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 61 TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 121 AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 181 CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 241 TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 301 ATAATACAAA AGCCATGCAA TCACAAAAAA A Predicted gene structure (within gDNA segment 5891 to 4315): Exon 1 5291 4965 ( 327 n); cDNA 1 327 ( 327 n); score: 0.905 MATCH C06HBa0153O03.1-2- SGN-E352716+ 0.905 327 0.988 C PGS_C06HBa0153O03.1-2-_SGN-E352716+ (5291 4965) Alignment (genomic DNA sequence = upper lines): GATCTGTCCC AGGCTTGGCT AACGTCACCG TTTTAGCATT ACAATCCAAG ATCGCAAATT 5232 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| || ||||| | GATCTGTCCC AGGCTTGGCT AACGTCACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 60 GCGGAGAAAG CCAAGTCATA CCTAGAATTA CATCAAAATC ATCCATTTCT AAGATAACCA 5172 |||||||| |||||||||| || ||||||| |||||||||| | |||||||| |||||||||| TTGGAGAAAG CCAAGTCATA CCCAGAATTA CATCAAAATC AACCATTTCT AAGATAACCA 120 AATCTACATA AGTGTTGCTC CCTACAAAGT TCACCAAAAA AGACCTATAC ACCTTTTCAA 5112 | |||||||| ||| |||||| || ||||| |||||| | ||||||||| || ||||||| AGTCTACATA AGTATTGCTC CCCACAAAAG TCACCAGACC AGACCTATAT ACTTTTTCAA 180 CTACCACAGA TTCACCCACC GGAGTAGAAA CACGAATAGG CATATCAAGT AATTCACAAT 5052 ||| |||||| ||||||||| |||||||||| |||||||||| ||| ||||| |||||| ||| CTATCACAGA CTCACCCACC GGAGTAGAAA CACGAATAGG CATGTCAAGC AATTCATAAT 240 GTAAATTTAG ACCATTAGCA AATGAGGAAG ATACATAAGA AAATGTGGAT CCAGGATCAA 4992 ||| || || |||| ||||| |||||||||| ||||||| || |||||||||| ||||| |||| TTAATTTAAG ACCAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAT CCAGGGTCAA 300 ACAATACAGA GGCCATGCAA TCACAAA 4965 | |||||| | ||||||||| ||||||| ATAATACAAA AGCCATGCAA TCACAAA 327 hqPGS_C06HBa0153O03.1-2-_SGN-E352716+ (5291 4965) ******************************************************************************** EST sequence 20 -strand 659 n (File: SGN-E352117-) 1 CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 61 TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAGGGGG TATGGAAATG GGGTGATCAC 121 CCGGCTCCAG ATCAATGCAA AAGTCAATAT CCCTATCCGG TGGCATACCA GGAAGGTCTG 181 CAGGAAACAC ATCCAGAAAC TCACGGACTA TCGAAACTGA CTCAATTGAA GGTACTTGGG 241 TAGTATCATC CCTGAGGTGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTCTCTTAG 301 CACAAAGAAA GGAGATGATA CGAACTGGAG TGGAAGTGTA GTCACCCTCC CACACTAACG 361 CATCTATCTC AGGCTTGGCC AACGTTACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 421 TTGGAGACAG CCAAGTCATA CCCAGAATTA CATCGAAGTC AACCATTTAT AGGATAACCA 481 AGTCTACATG AGTATTGCTC CCACAAATGT CACAAGACAA GACCTATACA CCTTTTCAAC 541 AATCACAGAC TCACCCACCG GAGTAGAACA CGAATAGGTA TGTCAAGCGA TTCACTATGT 601 AAATTAAGAA CAGTAGCAAA TGAGGAAGAT ACATATGAAA ATGTGGAGGC AGGATCAAA Predicted gene structure (within gDNA segment 7521 to 3573): Exon 1 5651 4991 ( 661 n); cDNA 1 659 ( 659 n); score: 0.868 MATCH C06HBa0153O03.1-2- SGN-E352117- 0.868 661 1.003 C PGS_C06HBa0153O03.1-2-_SGN-E352117- (5651 4991) Alignment (genomic DNA sequence = upper lines): CCCAAGGAGA TGCACTTGGT CTAATAAAGC CTTTGTTCAA TAACTCTTGA AGTTGTGCCT 5592 ||||||| || ||||||||| |||| |||| | || || ||||||||| ||||| |||| CCCAAGGGGA TGCACTTGGC NTAATGAAGC CCTTACNTAA CAACTCTTGA AGTTGGGCCT 60 TTAACTCTCT TAACTCTGCG GGAGCCATTC TATAAGGGGG TATAGAAATG GGGCGTGTGC 5532 ||||||| | ||||| ||| |||||||||| |||||||||| ||| |||||| ||| | | TTAACTCCCC CAACTCAGCG GGAGCCATTC TATAAGGGGG TATGGAAATG GGGTGATCAC 120 CCGGTTCTAG ATCGATACAG AAGTCAATAT CCCTATCTGG TGGCATACCA GGAAGATCTG 5472 |||| || || ||| || || |||||||||| ||||||| || |||||||||| ||||| |||| CCGGCTCCAG ATCAATGCAA AAGTCAATAT CCCTATCCGG TGGCATACCA GGAAGGTCTG 180 CAGGGAACAC ATCCAGAAAC TCACGGACTA CTGAAACCGA CTCAATCGAA GGCACTTGGG 5412 |||| ||||| |||||||||| |||||||||| ||||| || |||||| ||| || ||||||| CAGGAAACAC ATCCAGAAAC TCACGGACTA TCGAAACTGA CTCAATTGAA GGTACTTGGG 240 TAGTGTCATC CTTGAGATGT GCCAAGAAAG CTAAACAACC TTTACTAACC ATTTTCTTAG 5352 |||| ||||| | |||| ||| |||||||||| ||||||| || |||||||||| ||| |||||| TAGTATCATC CCTGAGGTGT GCCAAGAAAG CTAAACACCC TTTACTAACC ATTCTCTTAG 300 CACGAAGAAA GGAGATGATA TGCACCGGAT TGGAAGCGTT GTCACCCTCC CACACTAACG 5292 ||| |||||| |||||||||| | || ||| |||||| || |||||||||| |||||||||| CACAAAGAAA GGAGATGATA CGAACTGGAG TGGAAGTGTA GTCACCCTCC CACACTAACG 360 GATCTGTCCC AGGCTTGGCT AACGTCACCG TTTTAGCATT ACAATCCAAG ATCGCAAATT 5232 |||| || | ||||||||| ||||| || | |||||||||| |||||||||| || ||||| | CATCTATCTC AGGCTTGGCC AACGTTACAG TTTTAGCATT ACAATCCAAG ATTGCAAAAT 420 GCGGAGAAAG CCAAGTCATA CCTAGAATTA CATCAAAATC ATCCATTTCT AAGATAACCA 5172 ||||| || |||||||||| || ||||||| |||| || || | |||||| | | |||||||| TTGGAGACAG CCAAGTCATA CCCAGAATTA CATCGAAGTC AACCATTTAT AGGATAACCA 480 AATCTACATA AGTGTTGCTC CCTACAAAGT TCACCAAAAA AGACCTATAC ACCTTTTCAA 5112 | ||||||| ||| |||||| || ||||| |||| | | | |||||||||| |||||||||| AGTCTACATG AGTATTGCTC CC-ACAAATG TCACAAGACA AGACCTATAC ACCTTTTCAA 539 CTACCACAGA TTCACCCACC GGAGTAGAAA CACGAATAGG CATATCAAGT AATTCACAAT 5052 | | |||||| ||||||||| ||||||| || |||||||||| || ||||| |||||| || CAATCACAGA CTCACCCACC GGAGTAG-AA CACGAATAGG TATGTCAAGC GATTCACTAT 598 GTAAATTTAG ACCATTAGCA AATGAGGAAG ATACATAAGA AAATGTGGAT CCAGGATCAA 4992 ||||||| || | || ||||| |||||||||| ||||||| || ||||||||| ||||||||| GTAAATTAAG AACAGTAGCA AATGAGGAAG ATACATATGA AAATGTGGAG GCAGGATCAA 658 A 4991 | A 659 hqPGS_C06HBa0153O03.1-2-_SGN-E352117- (5651 4991) ******************************************************************************** EST sequence 97 +strand 661 n (File: SGN-E351414+) 1 CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 61 GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 121 ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 181 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 241 AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 301 TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 361 ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 421 GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 481 ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 541 CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 601 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 661 G Predicted gene structure (within gDNA segment 6348 to 4379): Exon 1 5694 5034 ( 661 n); cDNA 1 661 ( 661 n); score: 0.953 MATCH C06HBa0153O03.1-2- SGN-E351414+ 0.953 661 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E351414+ (5694 5034) Alignment (genomic DNA sequence = upper lines): CGAAAGCTCC CATCCTTCTT CTTTACAAAC AAAACCGGAG CACCCCAAGG AGATGCACTT 5635 || || |||| |||||||||| ||| |||||| |||| ||||| |||||||||| |||||||||| CGGAAACTCC CATCCTTCTT CTTCACAAAC AAAATCGGAG CACCCCAAGG AGATGCACTT 60 GGTCTAATAA AGCCTTTGTT CAATAACTCT TGAAGTTGTG CCTTTAACTC TCTTAACTCT 5575 |||||||| | | |||||| | ||| |||||| |||||||| | |||||||||| |||||||||| GGTCTAATGA AACCTTTGCT CAACAACTCT TGAAGTTGGG CCTTTAACTC TCTTAACTCT 120 GCGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC TAGATCGATA 5515 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ACGGGAGCCA TTCTATAAGG GGGTATAGAA ATGGGGCGTG TGCCCGGTTC AAGATCGATA 180 CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAGA 5455 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | CAGAAGTCAA TATCCCTATC TGGTGGCATA CCAGGAAGAT CTGCAGGGAA CACATCCAAA 240 AACTCACGGA CTACTGAAAC CGACTCAATC GAAGGCACTT GGGTAGTGTC ATCCTTGAGA 5395 ||||||| || ||| |||| |||||||||| ||||| |||| |||||||||| |||||||||| AACTCACAGA CTATAAAAAC CGACTCAATC GAAGGTACTT GGGTAGTGTC ATCCTTGAGA 300 TGTGCCAAGA AAGCTAAACA ACCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 5335 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TGTGCCAAGA AAGCTAAACA CCCTTTACTA ACCATTTTCT TAGCACGAAG AAAGGAGATG 360 ATATGCACCG GATTGGAAGC GTTGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 5275 || ||||| ||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ATGCACACCG GATTGGAAGT GTAGTCACCC TCCCACACTA ACGGATCTGT CCCAGGCTTG 420 GCTAACGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 5215 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTAATGTCA CCGTTTTAGC ATTACAATCC AAGATCGCAA ATTGCGGAGA AAGCCAAGTC 480 ATACCTAGAA TTACATCAAA ATCATCCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 5155 ||||| |||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ATACCCAGAA TTACATCAAA ATCAACCATT TCTAAGATAA CCAAATCTAC ATAAGTGTTG 540 CTCCCTACAA AGTTCACCAA AAAAGACCTA TACACCTTTT CAACTACCAC AGATTCACCC 5095 ||||| |||| |||||||||| | |||||||| || ||||||| |||||||||| ||| |||||| CTCCCCACAA AGTTCACCAA ACAAGACCTA TATACCTTTT CAACTACCAC AGACTCACCC 600 ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCATTA 5035 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ACCGGAGTAG AAACACGAAT AGGCATATCA AGTAATTCAC AATGTAAATT TAGACCGTTA 660 G 5034 | G 661 hqPGS_C06HBa0153O03.1-2-_SGN-E351414+ (5694 5034) ******************************************************************************** EST sequence 67 +strand 560 n (File: SGN-E242765+) 1 TGGTTCTAGC ATTAGGACAT CATAGCTCAT TTGATTATTT CTCATCTCAT AATTAGTATT 61 TAGTATTCCC TCAATTTAAT AATTTCATTA AAGTGTTCAT AGAGACTTAT CTCTTCATTA 121 GCTTTACACT ATAAAAGGTG AGTAAGTGTT GGTAATATTT ACTTAGGCTT ATTTGCTATT 181 GAAACCGACT CAATCGAAGG TACTTGGGTA GTGTCATCAT TGAGATGTGC TAAGAAAGAT 241 AAACACCTTT TACTAATCAT TTTCTTAGCA AGAAGAAAGG AGACGATGCG GACCGGATTG 301 GAAGTGTAGT CACCCTCTCA CACTAACGGG TCTATCCCAG GCTTGGCTAA CGTCACCGTT 361 TTAGCATTAC AATCCAAGAT TGCAAATTGC GGAGAAAGCA AAGTCATACC CAGAATCACA 421 TCAAAATCAT CCATTGCCAA GATAACCAAA TCTACATAAG TGTTGCTCCC CACAAAGTTT 481 ACGACACAAG ACCTATATAC TTTTTCAACT ACCTCAGACT CACCCACCGG AGTAGAAACA 541 CGAATAGGCA TATCAAGAAA Predicted gene structure (within gDNA segment 7830 to 4423): Exon 1 5443 5060 ( 384 n); cDNA 177 560 ( 384 n); score: 0.911 MATCH C06HBa0153O03.1-2- SGN-E242765+ 0.911 384 0.686 C PGS_C06HBa0153O03.1-2-_SGN-E242765+ (5443 5060) Alignment (genomic DNA sequence = upper lines): TACTGAAACC GACTCAATCG AAGGCACTTG GGTAGTGTCA TCCTTGAGAT GTGCCAAGAA 5384 || ||||||| |||||||||| |||| ||||| |||||||||| || ||||||| |||| ||||| TATTGAAACC GACTCAATCG AAGGTACTTG GGTAGTGTCA TCATTGAGAT GTGCTAAGAA 236 AGCTAAACAA CCTTTACTAA CCATTTTCTT AGCACGAAGA AAGGAGATGA TATGCACCGG 5324 || |||||| | |||||||| ||||||||| |||| ||||| ||||||| || | | ||||| AGATAAACAC CTTTTACTAA TCATTTTCTT AGCAAGAAGA AAGGAGACGA TGCGGACCGG 296 ATTGGAAGCG TTGTCACCCT CCCACACTAA CGGATCTGTC CCAGGCTTGG CTAACGTCAC 5264 |||||||| | | |||||||| | |||||||| ||| ||| || |||||||||| |||||||||| ATTGGAAGTG TAGTCACCCT CTCACACTAA CGGGTCTATC CCAGGCTTGG CTAACGTCAC 356 CGTTTTAGCA TTACAATCCA AGATCGCAAA TTGCGGAGAA AGCCAAGTCA TACCTAGAAT 5204 |||||||||| |||||||||| |||| ||||| |||||||||| ||| |||||| |||| ||||| CGTTTTAGCA TTACAATCCA AGATTGCAAA TTGCGGAGAA AGCAAAGTCA TACCCAGAAT 416 TACATCAAAA TCATCCATTT CTAAGATAAC CAAATCTACA TAAGTGTTGC TCCCTACAAA 5144 ||||||||| ||||||||| | |||||||| |||||||||| |||||||||| |||| ||||| CACATCAAAA TCATCCATTG CCAAGATAAC CAAATCTACA TAAGTGTTGC TCCCCACAAA 476 GTTCACCAAA AAAGACCTAT ACACCTTTTC AACTACCACA GATTCACCCA CCGGAGTAGA 5084 ||| || | | ||||||||| | || ||||| ||||||| || || ||||||| |||||||||| GTTTACGACA CAAGACCTAT ATACTTTTTC AACTACCTCA GACTCACCCA CCGGAGTAGA 536 AACACGAATA GGCATATCAA GTAA 5060 |||||||||| |||||||||| | || AACACGAATA GGCATATCAA GAAA 560 hqPGS_C06HBa0153O03.1-2-_SGN-E242765+ (5443 5060) ******************************************************************************** EST sequence 119 +strand 658 n (File: SGN-E355232+) 1 TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 61 ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 121 GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 181 CTTAATTCGG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCTGGTTCA 241 AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGAAAC 301 ACGTCCAAAA ACTCGCGGAC TACCGAAACC GACTCAATTG AGGGTACTTG AGTAGTGTCA 361 TCCTTGAGAT GTGCCAAGAA AGCTGAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 421 AAGGATATGA TACGCACCAG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 481 CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA ATTTGGAGAA 541 AGCCAAGTCA TACCCAGAAT TACATCAATG TCACCCATTT CTAAAAAACC AAATCTACAT 601 AAGTGTTGCT CCCCACGAAA TTCNACAAAA CAGAACTATG TACCTTTTCA ACTACCAC Predicted gene structure (within gDNA segment 6444 to 3344): Exon 1 5763 5105 ( 659 n); cDNA 1 658 ( 658 n); score: 0.917 MATCH C06HBa0153O03.1-2- SGN-E355232+ 0.917 659 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E355232+ (5763 5105) Alignment (genomic DNA sequence = upper lines): TCAATGCGGG GAAGAGGATA CTTGTTCTTT ATGGTTACCT TGTTTAGTTG TCTGTAGTCT 5704 |||||||| | |||||||||| ||||||||| || ||||||| |||| ||||| |||||||||| TCAATGCGAG GAAGAGGATA CTTGTTCTTA ATAGTTACCT TGTTCAGTTG TCTGTAGTCT 60 ATACACATTC GAAAGCTCCC ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA 5644 || ||||||| ||| || || |||||||||| || ||||| | |||| ||||| |||||||||| ATGCACATTC TAAAACTTCC ATCCTTCTTC TTCACAAATA AAACTGGAGC ACCCCAAGGA 120 GATGCACTTG GTCTAATAAA GCCTTTGTTC AATAACTCTT GAAGTTGTGC CTTTAACTCT 5584 |||||||||| ||||||| || | | ||| || || ||||||| ||||||| || |||||||||| GATGCACTTG GTCTAATGAA GACCTTGCTC AACAACTCTT GAAGTTGAGC CTTTAACTCT 180 CTTAACTCTG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCCGGTTCT 5524 ||||| || | |||||||||| |||||||||| |||||||||| |||||||||| ||| ||||| CTTAATTCGG CGGGAGCCAT TCTATAAGGG GGTATAGAAA TGGGGCGTGT GCCTGGTTCA 240 AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGGAAC 5464 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| AGATCGATAC AGAAGTCAAT ATCCCTATCT GGTGGCATAC CAGGAAGATC TGCAGGAAAC 300 ACATCCAGAA ACTCACGGAC TACTGAAACC GACTCAATCG AAGGCACTTG GGTAGTGTCA 5404 || |||| || |||| ||||| ||| |||||| |||||||| | | || ||||| ||||||||| ACGTCCAAAA ACTCGCGGAC TACCGAAACC GACTCAATTG AGGGTACTTG AGTAGTGTCA 360 TCCTTGAGAT GTGCCAAGAA AGCTAAACAA CCTTTACTAA CCATTTTCTT AGCACGAAGA 5344 |||||||||| |||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TCCTTGAGAT GTGCCAAGAA AGCTGAACAC CCTTTACTAA CCATTTTCTT AGCACGAAGA 420 AAGGAGATGA TATGCACCGG ATTGGAAGCG TTGTCACCCT CCCACACTAA CGGATCTGTC 5284 ||||| |||| || ||||| | |||||||| | | |||||||| |||||||||| |||||||||| AAGGATATGA TACGCACCAG ATTGGAAGTG TAGTCACCCT CCCACACTAA CGGATCTGTC 480 CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA TTGCGGAGAA 5224 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||||| CCAGGCTTGG CTAACGTCAC CGTTTTAGCA TTACAATCCA AGATCGCAAA ATTTGGAGAA 540 AGCCAAGTCA TACCTAGAAT TACATCAAAA TCATCCATTT CTAAGATAAC CAAATCTACA 5164 |||||||||| |||| ||||| |||||||| ||| |||||| |||| | ||| |||||||||| AGCCAAGTCA TACCCAGAAT TACATCAATG TCACCCATTT CTAA-AAAAC CAAATCTACA 599 TAAGTGTTGC TCCCTACAAA GTTCACCAAA AAAGACCTAT ACACCTTTTC AACTACCAC 5105 |||||||||| |||| || || ||| |||| | ||| |||| |||||||| ||||||||| TAAGTGTTGC TCCCCACGAA ATTCNACAAA ACAGAACTAT GTACCTTTTC AACTACCAC 658 hqPGS_C06HBa0153O03.1-2-_SGN-E355232+ (5763 5105) ******************************************************************************** EST sequence 125 +strand 679 n (File: SGN-E368762+) 1 GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACATC 361 CGTTGCCCGT ATTTTCAATT GATGATAACC GGATCTCAAG TCAATCTTAG AGAAGACACA 421 AGCACCTTGT AACTGATCGA ACAAGTCATC AATGCGAGGA AGTGGATACT TGTTCTTAAT 481 AGTTACCTTG TTCAACTGCC GGTAGTCTAT GCACATCCGA AAACTCCCAT CCTTCTTCTT 541 CACAAACCAA ACCGGAGCAC CCCAAGGAGA TGCACTTGGT CTAATGAAGA CTTTGCTCAA 601 AAACTCTTGA AGTTGGGCCT TTAACTCTCT TAACTCCGCG GGAGCCATTC TATAAGGGGG 661 TATAGAAATG GGGCGAGTG Predicted gene structure (within gDNA segment 6857 to 4887): Exon 1 6212 5533 ( 680 n); cDNA 1 679 ( 679 n); score: 0.934 MATCH C06HBa0153O03.1-2- SGN-E368762+ 0.934 680 1.001 C PGS_C06HBa0153O03.1-2-_SGN-E368762+ (6212 5533) Alignment (genomic DNA sequence = upper lines): GTCTTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACT CCATCCTTAG 6153 |||| ||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| GTCTCACCCA ATTCTTCACT GTCTCAATCT TACAAGGATC CACCATCACT CCATCCTTAG 60 AAACCACGTG CCCCAAGAAG GACACTGCAT CTAGCCAAAA CTCACACTTA GAGAATTTGG 6093 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ||||| |||| AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGCTT TTTCTCCCTC AACATTTCCA ATACCATTCT CAAATGCTCT TCATGTTCCT 6033 |||||||||| |||||||||| |||||||| | |||| ||||| ||||||||| |||||||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 TCTTGCTCTT TGAGTATACC AATATATCAT CAATAAATAC GATCACGAAG AGGTCCAAAT 5973 |||| || || ||||||| | | |||||||| ||||||| | ||| || ||| || ||||||| TCTTACTATT AGAGTATATC AGTATATCAT CAATAAACAT GATGACAAAG AGATCCAAAT 240 ATGGCTTAAA AATCCCGTTC ATCAAGCTCA TGAACGCAAC AGGGGCGTTC ATAAGACCAA 5913 ||||||| || |||||||||| |||||||||| |||||||| | |||||| ||| ||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGACATCAC TACAAATTTG TAATGCCCAT ACCTCGTTCG AAAAGCAGTC TTTGGCACAT 5853 |||||||||| |||||||| | |||||||||| |||| |||| |||||||||| |||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACAT 359 CCGTTGCCCG TATTTTCAAT TGATGATAAC CGGATCTCAA GTCAATCTTA GAGAAGACAC 5793 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTGCCCG TATTTTCAAT TGATGATAAC CGGATCTCAA GTCAATCTTA GAGAAGACAC 419 AAGCACCTTG TAACTGATCG AACAAGTCAT CAATGCGGGG AAGAGGATAC TTGTTCTTTA 5733 |||||||||| |||||||||| |||||||||| ||||||| || ||| |||||| |||||||| | AAGCACCTTG TAACTGATCG AACAAGTCAT CAATGCGAGG AAGTGGATAC TTGTTCTTAA 479 TGGTTACCTT GTTTAGTTGT CTGTAGTCTA TACACATTCG AAAGCTCCCA TCCTTCTTCT 5673 | |||||||| ||| | || | |||||||| | ||||| || ||| |||||| |||||||||| TAGTTACCTT GTTCAACTGC CGGTAGTCTA TGCACATCCG AAAACTCCCA TCCTTCTTCT 539 TTACAAACAA AACCGGAGCA CCCCAAGGAG ATGCACTTGG TCTAATAAAG CCTTTGTTCA 5613 | |||||| | |||||||||| |||||||||| |||||||||| |||||| ||| ||||| ||| TCACAAACCA AACCGGAGCA CCCCAAGGAG ATGCACTTGG TCTAATGAAG ACTTTGCTCA 599 ATAACTCTTG AAGTTGTGCC TTTAACTCTC TTAACTCTGC GGGAGCCATT CTATAAGGGG 5553 | |||||||| |||||| ||| |||||||||| ||||||| || |||||||||| |||||||||| AAAACTCTTG AAGTTGGGCC TTTAACTCTC TTAACTCCGC GGGAGCCATT CTATAAGGGG 659 GTATAGAAAT GGGGCGTGTG 5533 |||||||||| |||||| ||| GTATAGAAAT GGGGCGAGTG 679 hqPGS_C06HBa0153O03.1-2-_SGN-E368762+ (6212 5533) ******************************************************************************** EST sequence 68 +strand 712 n (File: SGN-E379315+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTNCACTG CCGGTAGCCT ATGCACATCC GAAAACTCCA 601 ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCAGTTG GTCTAATGAA 661 GCCTTTGCTC ACAACTCTTT GAAGTGGTCT TTTAACTCTC TTAACTCTGC GG Predicted gene structure (within gDNA segment 6883 to 4368): Exon 1 6283 5571 ( 713 n); cDNA 1 712 ( 712 n); score: 0.935 MATCH C06HBa0153O03.1-2- SGN-E379315+ 0.935 713 1.001 C PGS_C06HBa0153O03.1-2-_SGN-E379315+ (6283 5571) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGAC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 6224 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 6164 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCCATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 6104 |||||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAATTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 6044 |||||| ||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 TTCATGTTCC TTCTTGCTCT TTGAGTATAC CAATATATCA TCAATAAATA CGATCACGAA 5984 |||||||||| | ||| || | | ||||||| || ||||||| |||||||||| ||||||| || TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGGTCCAAA TATGGCTTAA AAATCCCGTT CATCAAGCTC ATGAACGCAA CAGGGGCGTT 5924 ||| |||||| |||||||||| ||||||| || || || ||| || |||||| ||||||| || GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTCGTTC GAAAAGCAGT 5864 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 5804 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGGG GAAGAGGATA 5744 ||| |||||| |||||||||| |||||||||| |||||||||| | |||||| | |||||||||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 540 CTTGTTCTTT ATGGTTACCT TGTTTAGTTG TCTGTAGTCT ATACACATTC GAAAGCTCCC 5684 |||||||||| |||||||||| |||| || | |||| || || ||||| | |||| |||| CTTGTTCTTT ATGGTTACCT TGTTNCACTG CCGGTAGCCT ATGCACATCC GAAAACTCCA 600 ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCACTTG GTCTAATAAA 5624 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ||||||| || ATCCTTCTTC TTTACAAACA AAACCGGAGC ACCCCAAGGA GATGCAGTTG GTCTAATGAA 660 GCCTTTGTTC AATAACTC-T TGAAGTTGTG CCTTTAACTC TCTTAACTCT GCGG 5571 ||||||| || | ||||| | |||||| || | |||||||| |||||||||| |||| GCCTTTGCTC -ACAACTCTT TGAAGTGGT- CTTTTAACTC TCTTAACTCT GCGG 712 hqPGS_C06HBa0153O03.1-2-_SGN-E379315+ (6283 5571) ******************************************************************************** EST sequence 49 +strand 709 n (File: SGN-E578271+) 1 GTTCTTTATG GTTACCTTGT TTAGTTGTCT GTAGTCTATA TACATTCGAA AACTCCCATC 61 CTTCTTCTTT ACAAACAAAA CCGGAGCACC CAAGGGGATG CACTTGGTCT AATGAAGCCT 121 TTGTTGCTAC AAAGATATGA CCTATATATC ATATCTTGAC TGGTTCTTTA GATCCAGATA 181 ATGCGAAGTG ATGGGTTGGT TATTAGTTCT ATAGTTTTTA GTTCATACTA TGTGGGCTGG 241 GTTTTTTTAA TCCTAACCCT AACAAAACCC ACGAGTCACA CACTAAGCAT AGCAATTATA 301 TCAAATGGTC AATCGAATTT TTATTCAACC TTATAGAATT AAGAATTAGA AAGAATTAAG 361 AATTAGAAAT GTTCCCCTTG ATTAGAAAAA GAATGAATTG GTCTTTTTTT TTGTTCAATC 421 ATTGGATAGA AGGGAAAGAC AAGTAGTAAA ATTATTCCTC GTCTAGAAAT ATCCAAATTT 481 TGATGCCCAA TATTCCATAG ATAGTTCGAA CTGTATAAGA GCAATAATCA ATTTTAGCTC 541 GAATCGTTTG TAGGGGAACC CTGCCTTCTC TGATCCATTC GACACGTGCA ATTTCTTTTC 601 CGTCGATACG CCCCGCAATT TGTATTTGAA TTCCTTGTGT ATCCGCTTGT TCTGTTAATT 661 CAATAGCCTT TTTCATTGCT TTTCGAAATG AAACTCTATT CTTTAATTG Predicted gene structure (within gDNA segment 6340 to 1): Exon 1 5740 5614 ( 127 n); cDNA 1 126 ( 126 n); score: 0.953 Intron 1 5613 960 (4654 n); Pd: 0.000 (s: 0.92), Pa: 0.745 (s: 0) Exon 2 959 953 ( 7 n); cDNA 127 133 ( 7 n); score: 1.000 MATCH C06HBa0153O03.1-2- SGN-E578271+ 0.953 134 0.189 C PGS_C06HBa0153O03.1-2-_SGN-E578271+ (5740 5614,959 953) Alignment (genomic DNA sequence = upper lines): GTTCTTTATG GTTACCTTGT TTAGTTGTCT GTAGTCTATA CACATTCGAA AGCTCCCATC 5681 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| | |||||||| GTTCTTTATG GTTACCTTGT TTAGTTGTCT GTAGTCTATA TACATTCGAA AACTCCCATC 60 CTTCTTCTTT ACAAACAAAA CCGGAGCACC CCAAGGAGAT GCACTTGGTC TAATAAAGCC 5621 |||||||||| |||||||||| |||||||| | |||||| ||| |||||||||| |||| ||||| CTTCTTCTTT ACAAACAAAA CCGGAGCA-C CCAAGGGGAT GCACTTGGTC TAATGAAGCC 119 TTTGTTCAAT AACTCTTGAA GTTGTGCCTT TAACTCTCTT AACTCTGCGG GAGCCATTCT 5561 |||||| TTTGTTG... .......... .......... .......... .......... .......... 126 ATAAGGGGGT ATAGAAATGG GGCGTGTGCC CGGTTCTAGA TCGATACAGA AGTCAATATC 5501 .......... .......... .......... .......... .......... .......... 126 CCTATCTGGT GGCATACCAG GAAGATCTGC AGGGAACACA TCCAGAAACT CACGGACTAC 5441 .......... .......... .......... .......... .......... .......... 126 TGAAACCGAC TCAATCGAAG GCACTTGGGT AGTGTCATCC TTGAGATGTG CCAAGAAAGC 5381 .......... .......... .......... .......... .......... .......... 126 TAAACAACCT TTACTAACCA TTTTCTTAGC ACGAAGAAAG GAGATGATAT GCACCGGATT 5321 .......... .......... .......... .......... .......... .......... 126 GGAAGCGTTG TCACCCTCCC ACACTAACGG ATCTGTCCCA GGCTTGGCTA ACGTCACCGT 5261 .......... .......... .......... .......... .......... .......... 126 TTTAGCATTA CAATCCAAGA TCGCAAATTG CGGAGAAAGC CAAGTCATAC CTAGAATTAC 5201 .......... .......... .......... .......... .......... .......... 126 ATCAAAATCA TCCATTTCTA AGATAACCAA ATCTACATAA GTGTTGCTCC CTACAAAGTT 5141 .......... .......... .......... .......... .......... .......... 126 CACCAAAAAA GACCTATACA CCTTTTCAAC TACCACAGAT TCACCCACCG GAGTAGAAAC 5081 .......... .......... .......... .......... .......... .......... 126 ACGAATAGGC ATATCAAGTA ATTCACAATG TAAATTTAGA CCATTAGCAA ATGAGGAAGA 5021 .......... .......... .......... .......... .......... .......... 126 TACATAAGAA AATGTGGATC CAGGATCAAA CAATACAGAG GCCATGCAAT CACAAACCAG 4961 .......... .......... .......... .......... .......... .......... 126 AAGATTACCT GTGATGACAG CATCAGATGC CTCCGCTTCA GACCGCCCAG GGAAAGCGTA 4901 .......... .......... .......... .......... .......... .......... 126 ACAATGGGCC CTATCGTTCG TCTGTCCGTT GCCCCTAACT TGTTGTGATG TAGTGGCTCC 4841 .......... .......... .......... .......... .......... .......... 126 AGTTTGCCCA TCACCTTGGC CGTTTTGGTT ACCACCATTT CCTTGACCAC CACGTCCTCC 4781 .......... .......... .......... .......... .......... .......... 126 AGAATAACGG CCTCTGCCAT GACCACCTCT ACCTCTAACA TTTGGAGGTC TGTAACTCTG 4721 .......... .......... .......... .......... .......... .......... 126 TTTTGGACAA TATCTCTTAA TATGTTCGAT CTCCCCACAT CCATAACACT TTCTGGGTTC 4661 .......... .......... .......... .......... .......... .......... 126 ATGCATAGGT CTCTCAGAGA AGTGTTGACC GGTCGGAGGT GGACCCCCAA CTACAGTCTG 4601 .......... .......... .......... .......... .......... .......... 126 TAGTGAAGAC TGAATTGGTC GGACTGAGTA ACTTCCCGAA CCCTGTCCTC TAGTGTAAGC 4541 .......... .......... .......... .......... .......... .......... 126 ACCATTAAAC TCACCTCCCT TTCGAAACCT TTTTGATGTC AATGTCGGGG TGAAGTCGTC 4481 .......... .......... .......... .......... .......... .......... 126 TGGCTTCACT CCTTCCACTT CTATCACAAA GTCTACCACC TCTTGGAAGG ATTTTGCCGT 4421 .......... .......... .......... .......... .......... .......... 126 TGCCACTATC TGTAAGGCCG AAATCCGCAA TTCTGACCTC AACCCCTTCA CAAACCGGCG 4361 .......... .......... .......... .......... .......... .......... 126 AATTCGCTCT TGTGGACTGA AACACAGTTG GGTGGCATAC CGGGATAATG CACGAAACTT 4301 .......... .......... .......... .......... .......... .......... 126 AGCCTCATAT GCATTGACCG ACATCCTACC TTGCTCTAGG CTCAAGAACT CATCCCTTTT 4241 .......... .......... .......... .......... .......... .......... 126 CCTATCCCTC AAAGTTCGGG GGATATACTT CTCCATAAAC AAGCTAGAGA ATGAGGCCCA 4181 .......... .......... .......... .......... .......... .......... 126 AGTCATAGGT GGTGCCTCTG TTGGTTGACA CTCAATATGT GACCGCCACC ACATTTTGGC 4121 .......... .......... .......... .......... .......... .......... 126 ATTACCTTGA AACTGATAAC TTACGAACTC AACACCAAAC CGTTCTACTA TACCCATCTT 4061 .......... .......... .......... .......... .......... .......... 126 GTGTAGTAGC TCATGACAGT CAACCAGAAA ATCGTAAGCA TCCTCAGATT CCGCACCCTT 4001 .......... .......... .......... .......... .......... .......... 126 GAATACTGGA GGTTTCAATT TCAAGAACTT ACTGAAAAGT TCATGCTGAT CATTTGTCAT 3941 .......... .......... .......... .......... .......... .......... 126 TATAGGCCCA GTAGTCAGAC GTGGAAACGT GCCTATTTCC AATGGAACAT CCATGCGGGG 3881 .......... .......... .......... .......... .......... .......... 126 AGCCATAGTA GCCGCATGTT GTACCTCCGG AGCCTGAGGT GCTGGTGTAG AAAACACTGG 3821 .......... .......... .......... .......... .......... .......... 126 GGTGTCTGGC CCTGATCATA TAACCCGCTA AGATAAGCCA GAACCTGATT GATCATCTCT 3761 .......... .......... .......... .......... .......... .......... 126 GGGGTAGGTT GGGGTGGCAA TCCCTCATTC TGCACTTGTT CAGTTTCCCC ATCGTCCCCT 3701 .......... .......... .......... .......... .......... .......... 126 TCTCTTATTA CTTCCTCAGT CGGTGGAGGA GTCACTGCCC TAGTATCAGA TGGGCTAGGC 3641 .......... .......... .......... .......... .......... .......... 126 GCTCGTCCTC TTCCCCTAGA GGACGTCCTC CCACTACCTC TACCATGGCC CCTTGCCGCT 3581 .......... .......... .......... .......... .......... .......... 126 GTTCTTCCTC GAGCCACAGC CCCAGTGGTT GGCTCAGTTG TTTCTTGTCT GGCCGGTATT 3521 .......... .......... .......... .......... .......... .......... 126 GGTGTTGGCG TAGTCGTTGC TCTAGTTCTA ACCATCTGTG AAAGAGAGTG AAGATGGTCA 3461 .......... .......... .......... .......... .......... .......... 126 GATACTAATT CGTATCGCCT AGATACCAAT TGGACTCAAG TAGTAGCACG AAAGAAAGAA 3401 .......... .......... .......... .......... .......... .......... 126 TGAGAGAGTG AAATTTTCCT AAAGTCTTAT AGCCTCTCAA GAAAAAGTAA AGGCGTCCCC 3341 .......... .......... .......... .......... .......... .......... 126 CTACCGTTCC TTAAGACTCT ACTAGACCTG TTCTTGTGTG ATGAGACCAA CGAACCTAAT 3281 .......... .......... .......... .......... .......... .......... 126 GCTCTGATAC CAAGTTTGTC ACGACCCAAA TCCAGGCCAC GACTGGCACC CACACTTACC 3221 .......... .......... .......... .......... .......... .......... 126 CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT TCAATATAAT ATAACCAGAA 3161 .......... .......... .......... .......... .......... .......... 126 AGTAATGCGG AAGACTTAAA ATCATTAAAT AAAGACCAAT TCATTAACTT CTAAAATTCA 3101 .......... .......... .......... .......... .......... .......... 126 ACATCTATTA TTCCCCCAAA ATCTGGAAGT CATCATCACA AGAACATCTA CGATCAAATG 3041 .......... .......... .......... .......... .......... .......... 126 ACTAAACTAA GAGTATTCTA AAAGCTAAAA ATACATAAGA AGCTAGTCCA TGCCGGAAGT 2981 .......... .......... .......... .......... .......... .......... 126 TCAAGGCATC AAGACTTGAA GAAGAAGACC CAGTCCAAGC TAGAAGCATT AGCTCACCCT 2921 .......... .......... .......... .......... .......... .......... 126 GAATATCCGG TATGACGAAG ACTGGCTAGA ATCACTGCTG AGTTGAAGAT GACGGAACGT 2861 .......... .......... .......... .......... .......... .......... 126 TTGCTGCACT CCACAAATAA CAAGAAGAAA ACATAAAAGT AGGGGTCAGT ACAAAACACG 2801 .......... .......... .......... .......... .......... .......... 126 GGTACTGAGT AGATATCATC GGCCAACTCA AAATAGAAAA CAGTATGTAT TAAGCAATAT 2741 .......... .......... .......... .......... .......... .......... 126 CATAAAATCA ATTAATATCC TTAGCATGCA GCATTTACAG TTACCATAAC CCTTGGTTAC 2681 .......... .......... .......... .......... .......... .......... 126 AACACCAAGC ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT 2621 .......... .......... .......... .......... .......... .......... 126 TCATTAGATT GGATATATTA ACATATTTCA AGATTCATTA TCTTTATTCT CCTCGTGTCG 2561 .......... .......... .......... .......... .......... .......... 126 GTACGTGACA CTCCGCTCCT CAATATACTA TCCTGGTGTC GGAACGTGAC ACTCTGATCC 2501 .......... .......... .......... .......... .......... .......... 126 TCATTCTATC CTGGTGTCGG AACGTGACAC TCCGATCCTC ATATACTATC CTGGTACCGG 2441 .......... .......... .......... .......... .......... .......... 126 AACGTGGCAC CCGATCCATA TTCTATCCTG GTGTCAGAAC GTGACACCCG ATCCATATCC 2381 .......... .......... .......... .......... .......... .......... 126 TATCCTGGTA CCGGAACGTG GCACCCGATC CATATTCTAT CTTGGTGTCG GAACGTGACA 2321 .......... .......... .......... .......... .......... .......... 126 CCCGATCTAT ATTCTATCCT GGTACCGGAA CGTGGCACCC GATCCCCTAA TCTCACCACT 2261 .......... .......... .......... .......... .......... .......... 126 TTCGTTCATC AAGCCTTCTT TTATACCAAG GCATCATCAT TAACAAAGTA GATTAGGGTT 2201 .......... .......... .......... .......... .......... .......... 126 TCTTTTCAAG ATTTGGGATT CAATGGCTTC ATCATGCTTA TTTATTCACA ATTACATAAT 2141 .......... .......... .......... .......... .......... .......... 126 CACATCATTC ATGCAAGCAT ACAATTAAGC ATATAGAATG TTTACAATAC TACTAACACA 2081 .......... .......... .......... .......... .......... .......... 126 TATCATTCGC TATTAAGAGT TTGCTACGAA TAGTATGAAA TAACCATAAC CTACCTCCAC 2021 .......... .......... .......... .......... .......... .......... 126 TGAAGATTAG TGATTAAGCA AGAAATTCCC AAGGCTTTTG TTCCTTCTTC TCGTTCGATC 1961 .......... .......... .......... .......... .......... .......... 126 CTCCCTCAAT TCGTTTCTCT TTCCCTCTCT TTGTTCTTTC TATTTTCTTA TTCCAACCCT 1901 .......... .......... .......... .......... .......... .......... 126 CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAG ATGGCAATAA TACCCCACTA 1841 .......... .......... .......... .......... .......... .......... 126 ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGGATTTTG AGTTATTAAT ATAAACCCAT 1781 .......... .......... .......... .......... .......... .......... 126 GAAATATATA ATCATAGCAG GAATAGTCCA AAACGCCCCT TTAAAACTTA ACCAGAAATC 1721 .......... .......... .......... .......... .......... .......... 126 TGACTCCAAC TGGGATTGCA CAACCTGTGA CGGGCCGTCG TGCCTGCGAC GGTCCGTCCT 1661 .......... .......... .......... .......... .......... .......... 126 GCAGGTCGTC GCAAAGTTCA GAGACCCAAT ATTTCCACCA AGGGTCTGTG ATGGTCCGTC 1601 .......... .......... .......... .......... .......... .......... 126 ACACCTGTGA CGGTCCGTCC TGCCATTCCG TCACGAAGTT CAGAGAGTTG ATTTTCAGTA 1541 .......... .......... .......... .......... .......... .......... 126 CCCAATTTTA GATTTTCTAA GTGTTTTGAA ACGAGACCCT GCGACGGTCT GTCGTGCCCA 1481 .......... .......... .......... .......... .......... .......... 126 TGACGGTCCG TCGTTGGGTT CGTCGCCTCA GCCTGTTTTT CCAGAAATAA AATCCGCTGC 1421 .......... .......... .......... .......... .......... .......... 126 TCAAAACGAC TAAACAGGTC GTTACAAACT ATCATTATAG TTTTGATACA ATAATGACAT 1361 .......... .......... .......... .......... .......... .......... 126 TAATGTCCTA CTAACAGTAT ACATAATTCC TGAATTCATA ACGGTGATAA AAAAAAATTA 1301 .......... .......... .......... .......... .......... .......... 126 AAGTAATGGA ATAAACAATT TTAAAATCAA AACACTTTCA TAATTTAACA ATATGTGTTG 1241 .......... .......... .......... .......... .......... .......... 126 TTATGTTAAT AAATACTTCA AACATCTTGT TTAACTCCAA AAAAAAAAAT AAAATCAAAC 1181 .......... .......... .......... .......... .......... .......... 126 ATTTAAAACT AACTAGCCTA CATTAACAAT TTCATCTTCA AATGATGGGT TTAATACATT 1121 .......... .......... .......... .......... .......... .......... 126 CTTCATTCTT GGAGGGTCAT CACTGTTGCT AACATATCCA GCATTCATCT TGTCGAGACC 1061 .......... .......... .......... .......... .......... .......... 126 ATACTTCAAT AAAAGTATCG CATATCTTTT GCCAAGATAG CTACTCTCAA AACTATTACA 1001 .......... .......... .......... .......... .......... .......... 126 TGACATATTA ATTTTTTCAC TCAGAAATTC TGCATATACA GCTACAAA 953 ||||||| .......... .......... .......... .......... .CTACAAA 133 hqPGS_C06HBa0153O03.1-2-_SGN-E578271+ (5740 5614) ******************************************************************************** EST sequence 90 +strand 596 n (File: SGN-E375319+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 541 CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGCCT ATGCACATCC GAAAAC Predicted gene structure (within gDNA segment 6883 to 4790): Exon 1 6283 5690 ( 594 n); cDNA 1 594 ( 594 n); score: 0.944 MATCH C06HBa0153O03.1-2- SGN-E375319+ 0.944 594 0.997 C PGS_C06HBa0153O03.1-2-_SGN-E375319+ (6283 5690) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGAC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 6224 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 6164 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCCATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 6104 |||||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAATTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 6044 |||||| ||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 TTCATGTTCC TTCTTGCTCT TTGAGTATAC CAATATATCA TCAATAAATA CGATCACGAA 5984 |||||||||| | ||| || | | ||||||| || ||||||| |||||||||| ||||||| || TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGGTCCAAA TATGGCTTAA AAATCCCGTT CATCAAGCTC ATGAACGCAA CAGGGGCGTT 5924 ||| |||||| |||||||||| ||||||| || || || ||| || |||||| ||||||| || GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTCGTTC GAAAAGCAGT 5864 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 5804 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATGCGGG GAAGAGGATA 5744 ||| |||||| |||||||||| |||||||||| |||||||||| | |||||| | |||||||||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATGCGAG GAAGAGGATA 540 CTTGTTCTTT ATGGTTACCT TGTTTAGTTG TCTGTAGTCT ATACACATTC GAAA 5690 |||||||||| |||||||||| |||| | || | |||| || || ||||| | |||| CTTGTTCTTT ATGGTTACCT TGTTCAACTG CCGGTAGCCT ATGCACATCC GAAA 594 hqPGS_C06HBa0153O03.1-2-_SGN-E375319+ (6283 5690) ******************************************************************************** EST sequence 75 +strand 526 n (File: SGN-E204434+) 1 GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 61 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 121 TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 181 AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 241 TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 301 GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 361 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 421 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 481 AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATG Predicted gene structure (within gDNA segment 6883 to 5103): Exon 1 6283 5758 ( 526 n); cDNA 1 526 ( 526 n); score: 0.954 MATCH C06HBa0153O03.1-2- SGN-E204434+ 0.954 526 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E204434+ (6283 5758) Alignment (genomic DNA sequence = upper lines): GAATCCCTTG ACAAATCGAC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 6224 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| GAATCCCTTG ACAAATCGGC GGTAGTAGCT AGCTAACCCA ACAAAGCTCC TTATTTCTGA 60 CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 6164 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTAGTA GGTCTTACCC AATTCTTCAC TGTCTCAATC TTAGAAGGAT CCACCATCAC 120 TCCATCCTTA GAAACCACGT GCCCCAAGAA GGACACTGCA TCTAGCCAAA ACTCACACTT 6104 |||||||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| TCCATCCTTA GAAACCACGT TCCCCAAGAA GGACACTACA TCTAGCCAAA ACTCACACTT 180 AGAGAATTTG GCATAAAGCT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 6044 |||||| ||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AGAGAACTTG GCATAAAACT TTTTCTCCCT CAACATTTCC AATACCATTC TCAAATGCTC 240 TTCATGTTCC TTCTTGCTCT TTGAGTATAC CAATATATCA TCAATAAATA CGATCACGAA 5984 |||||||||| | ||| || | | ||||||| || ||||||| |||||||||| ||||||| || TTCATGTTCC TCCTTACTTT TAGAGTATAT CAGTATATCA TCAATAAATA CGATCACAAA 300 GAGGTCCAAA TATGGCTTAA AAATCCCGTT CATCAAGCTC ATGAACGCAA CAGGGGCGTT 5924 ||| |||||| |||||||||| ||||||| || || || ||| || |||||| ||||||| || GAGATCCAAA TATGGCTTAA AAATCCCTTT TATTAAACTC ATAAACGCAG CAGGGGCATT 360 CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTCGTTC GAAAAGCAGT 5864 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ||||||||| CATAAGACCA AAAGACATCA CTACAAATTT GTAATGCCCA TACCTGGTTC TAAAAGCAGT 420 CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 5804 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTGGCACA TCCGTTGCCC GTATTTTCAA TTGATGATAA CCGGATCTCA AGTCAATCTT 480 AGAGAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TCAATG 5758 ||| |||||| |||||||||| |||||||||| |||||||||| | |||| AGAAAAGACA CAAGCACCTT GTAACTGATC GAACAAGTCA TTAATG 526 hqPGS_C06HBa0153O03.1-2-_SGN-E204434+ (6283 5758) ******************************************************************************** EST sequence 134 +strand 358 n (File: SGN-E240817+) 1 GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 61 AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 121 CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 181 TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 241 ATGGCTTAAA ATCCCGTTCA TCAAGCTCAT GAACGCAGCA GGGGCATTCG TAAGACCAAA 301 AGACATCACT ACAAATTCGT AATGCCCATA CCTGGTTCTA AAAGCAGTCT TTGGCACA Predicted gene structure (within gDNA segment 6911 to 4875): Exon 1 6212 5854 ( 359 n); cDNA 1 358 ( 358 n); score: 0.922 MATCH C06HBa0153O03.1-2- SGN-E240817+ 0.922 359 1.003 C PGS_C06HBa0153O03.1-2-_SGN-E240817+ (6212 5854) Alignment (genomic DNA sequence = upper lines): GTCTTACCCA ATTCTTCACT GTCTCAATCT TAGAAGGATC CACCATCACT CCATCCTTAG 6153 |||| ||||| ||||||||| |||||||||| || |||||| |||||||||| |||||||||| GTCTCACCCA TTTCTTCACT GTCTCAATCT TACCAGGATC CACCATCACT CCATCCTTAG 60 AAACCACGTG CCCCAAGAAG GACACTGCAT CTAGCCAAAA CTCACACTTA GAGAATTTGG 6093 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| ||||| |||| AAACCACGTG CCCCAAGAAG GACACTGCAT CTATCCAAAA CTCACACTTA GAGAACTTGG 120 CATAAAGCTT TTTCTCCCTC AACATTTCCA ATACCATTCT CAAATGCTCT TCATGTTCCT 6033 |||||||||| |||||||||| |||||||| | |||| ||||| ||||||||| |||||||||| CATAAAGCTT TTTCTCCCTC AACATTTCTA ATACAATTCT CAAATGCTCC TCATGTTCCT 180 TCTTGCTCTT TGAGTATACC AATATATCAT CAATAAATAC GATCACGAAG AGGTCCAAAT 5973 |||| || || ||| ||| | | |||||||| |||| || | ||| ||| || || ||||||| TCTTACTATT AGAGGATATC AGTATATCAT CAATGAACAT GATGACGCAG AGATCCAAAT 240 ATGGCTTAAA AATCCCGTTC ATCAAGCTCA TGAACGCAAC AGGGGCGTTC ATAAGACCAA 5913 ||||||| || |||||||||| |||||||||| |||||||| | |||||| ||| ||||||||| ATGGCTT-AA AATCCCGTTC ATCAAGCTCA TGAACGCAGC AGGGGCATTC GTAAGACCAA 299 AAGACATCAC TACAAATTTG TAATGCCCAT ACCTCGTTCG AAAAGCAGTC TTTGGCACA 5854 |||||||||| |||||||| | |||||||||| |||| |||| |||||||||| ||||||||| AAGACATCAC TACAAATTCG TAATGCCCAT ACCTGGTTCT AAAAGCAGTC TTTGGCACA 358 hqPGS_C06HBa0153O03.1-2-_SGN-E240817+ (6212 5854) ******************************************************************************** EST sequence 106 +strand 587 n (File: SGN-E352950+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC Predicted gene structure (within gDNA segment 8003 to 4722): Exon 1 7095 6511 ( 585 n); cDNA 1 587 ( 587 n); score: 0.865 MATCH C06HBa0153O03.1-2- SGN-E352950+ 0.865 585 0.997 C PGS_C06HBa0153O03.1-2-_SGN-E352950+ (7095 6511) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAG AATAGTGTTG ATTAAATCAT -TG-ACGGGG TACACATACC CTTCCCTTGA 7038 ||||||| || |||||||| || ||||| | | |||||| |||||||||| ||||||| | CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TTCTCAAAAC ACCTTCCTCA TCGATTTGTG CTTCCTTAGC CTCTCCTCGC AATACCTTAT 6978 | |||||||| |||||||| | ||||||| || |||| ||||| |||||||||| | |||||||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTTGGATTCT TCTTAGTTTC TCATCATCAA ACTGTTTTCC CTTAATTTTG TCAAGAAAAG 6918 || | || | | |||||| ||||| || |||| ||||| ||||| || |||||||| | CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 AAGATCTTGA CTCCACACTA GCCAACAATC CTCCCTTCTC ATTTACTTCT AATATCATCA 6858 ||||||||| |||||||| | ||||| ||| |||||||||| ||||| |||| | |||| | AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 AGTCATTAGC TAGAGTCTGA ACCTCTCTAG CCAATGGGCG TCTAGAAGCT TGCAAGTGAG 6798 ||||||||| ||||||||| |||||||||| ||||||||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACTTCC CATGCTTCCC ACCTTTCTAC TTAAAGCATC CGCTACAACA TTAGCCTTCC 6738 |||||||||| |||||| || ||||||||| |||||||||| || |||||| |||||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGGATGATA CAAAATAGTG ATATCGTAGT CCTTTAGTAA CTCCATCCAT CTCCTCTGTC 6678 |||||||||| ||| |||||| ||||| |||| |||| |||| ||||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TTAAGTTCAA ATCTTTCTGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGATCT 6618 | || ||||| |||||||||| |||||||||| |||||| || |||||||||| |||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACC ACCGCAGCCA 6558 |||||||||| |||||| | | || |||||| ||| |||||| ||||| ||| || || || | CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC GTGGGTCGGA TAGTTACGTT CATGCACCTT TAGTTGC 6511 | |||||||| ||||||||| || ||||||| |||||||||| || |||| ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC 587 hqPGS_C06HBa0153O03.1-2-_SGN-E352950+ (7095 6511) ******************************************************************************** EST sequence 130 +strand 587 n (File: SGN-E357100+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC Predicted gene structure (within gDNA segment 8003 to 4722): Exon 1 7095 6511 ( 585 n); cDNA 1 587 ( 587 n); score: 0.865 MATCH C06HBa0153O03.1-2- SGN-E357100+ 0.865 585 0.997 C PGS_C06HBa0153O03.1-2-_SGN-E357100+ (7095 6511) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAG AATAGTGTTG ATTAAATCAT -TG-ACGGGG TACACATACC CTTCCCTTGA 7038 ||||||| || |||||||| || ||||| | | |||||| |||||||||| ||||||| | CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TTCTCAAAAC ACCTTCCTCA TCGATTTGTG CTTCCTTAGC CTCTCCTCGC AATACCTTAT 6978 | |||||||| |||||||| | ||||||| || |||| ||||| |||||||||| | |||||||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTTGGATTCT TCTTAGTTTC TCATCATCAA ACTGTTTTCC CTTAATTTTG TCAAGAAAAG 6918 || | || | | |||||| ||||| || |||| ||||| ||||| || |||||||| | CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 AAGATCTTGA CTCCACACTA GCCAACAATC CTCCCTTCTC ATTTACTTCT AATATCATCA 6858 ||||||||| |||||||| | ||||| ||| |||||||||| ||||| |||| | |||| | AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 AGTCATTAGC TAGAGTCTGA ACCTCTCTAG CCAATGGGCG TCTAGAAGCT TGCAAGTGAG 6798 ||||||||| ||||||||| |||||||||| ||||||||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACTTCC CATGCTTCCC ACCTTTCTAC TTAAAGCATC CGCTACAACA TTAGCCTTCC 6738 |||||||||| |||||| || ||||||||| |||||||||| || |||||| |||||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGGATGATA CAAAATAGTG ATATCGTAGT CCTTTAGTAA CTCCATCCAT CTCCTCTGTC 6678 |||||||||| ||| |||||| ||||| |||| |||| |||| ||||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TTAAGTTCAA ATCTTTCTGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGATCT 6618 | || ||||| |||||||||| |||||||||| |||||| || |||||||||| |||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACC ACCGCAGCCA 6558 |||||||||| |||||| | | || |||||| ||| |||||| ||||| ||| || || || | CACACTTAAC CCCATAGAGA TAATGTCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC GTGGGTCGGA TAGTTACGTT CATGCACCTT TAGTTGC 6511 | |||||||| ||||||||| || ||||||| |||||||||| || |||| ACTCCAAATC ATGGGTCGGA TAATTACGTT CATGCACCTT TAATTGC 587 hqPGS_C06HBa0153O03.1-2-_SGN-E357100+ (7095 6511) ******************************************************************************** EST sequence 93 +strand 554 n (File: SGN-E352647+) 1 CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 61 TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 121 CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 181 AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 241 GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 301 CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 361 CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 421 TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 481 CACACTTAAA CCCATAGAGA TAATGGCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 541 ACTCCAAATC ATGG Predicted gene structure (within gDNA segment 8003 to 5052): Exon 1 7095 6544 ( 552 n); cDNA 1 554 ( 554 n); score: 0.857 MATCH C06HBa0153O03.1-2- SGN-E352647+ 0.857 552 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E352647+ (7095 6544) Alignment (genomic DNA sequence = upper lines): CCTCTGTCAG AATAGTGTTG ATTAAATCAT -TG-ACGGGG TACACATACC CTTCCCTTGA 7038 ||||||| || |||||||| || ||||| | | |||||| |||||||||| ||||||| | CCTCTGTAAG AATAGTGTGA ATCAAATCGT CGGCACGGGG TACACATACC TTTCCCTTAA 60 TTCTCAAAAC ACCTTCCTCA TCGATTTGTG CTTCCTTAGC CTCTCCTCGC AATACCTTAT 6978 | |||||||| |||||||| | ||||||| || |||| ||||| |||||||||| | |||||||| TCCTCAAAAC ACCTTCCTTA TCGATTTTTG CTTCTTTAGC CTCTCCTCGC ATTACCTTAT 120 CTTGGATTCT TCTTAGTTTC TCATCATCAA ACTGTTTTCC CTTAATTTTG TCAAGAAAAG 6918 || | || | | |||||| ||||| || |||| ||||| ||||| || |||||||| | CTCGAATCCA GATCAGTTTC TCATCGGTAA ACTGCTTTCC TTTAATCTTA TCAAGAAAGG 180 AAGATCTTGA CTCCACACTA GCCAACAATC CTCCCTTCTC ATTTACTTCT AATATCATCA 6858 ||||||||| |||||||| | ||||| ||| |||||||||| ||||| |||| | |||| | AAGATCTTGC CTCCACACAA GCCAAAAATT CTCCCTTCTC ATTTAATTCT AGCCTCATAA 240 AGTCATTAGC TAGAGTCTGA ACCTCTCTAG CCAATGGGCG TCTAGAAGCT TGCAAGTGAG 6798 ||||||||| ||||||||| |||||||||| ||||||||| ||||||| | |||||||||| GGTCATTAGC CAGAGTCTGA ACCTCTCTAG CCAATGGGCA TCTAGAAACC TGCAAGTGAG 300 CTAGACTTCC CATGCTTCCC ACCTTTCTAC TTAAAGCATC CGCTACAACA TTAGCCTTCC 6738 |||||||||| |||||| || ||||||||| |||||||||| || |||||| |||||||| | CTAGACTTCC CATGCTCCCT GCCTTTCTAC TTAAAGCATC TGCCACAACA TTAGCCTTTC 360 CCGGATGATA CAAAATAGTG ATATCGTAGT CCTTTAGTAA CTCCATCCAT CTCCTCTGTC 6678 |||||||||| ||| |||||| ||||| |||| |||| |||| ||||||| || |||||||||| CCGGATGATA CAAGATAGTG ATATCATAGT CCTTCAGTAG CTCCATCGAT CTCCTCTGTC 420 TTAAGTTCAA ATCTTTCTGA GTAAAGACAT ACTGTAGGCT ACGATGATCC GTATAGATCT 6618 | || ||||| |||||||||| |||||||||| |||||| || |||||||||| |||||| | TCAAATTCAA ATCTTTCTGA GTAAAGACAT ACTGTAAACT ACGATGATCC ATATAGACTT 480 CACACTTAAC CCCATATAAA TAGTGTCTCC ATTGCTTTAA TGCAAACACC ACCGCAGCCA 6558 ||||||||| |||||| | | || || ||| ||| |||||| ||||| ||| || || || | CACACTTAAA CCCATAGAGA TAATGGCTCT ATTACTTTAA TGCAATCACT ACTGCCGCTA 540 ATTCCAAATC GTGG 6544 | |||||||| ||| ACTCCAAATC ATGG 554 hqPGS_C06HBa0153O03.1-2-_SGN-E352647+ (7095 6544) ******************************************************************************** EST sequence 105 +strand 542 n (File: SGN-E353207+) 1 TCGGCATGTA CTGTTTTCCA AAACTTAGAA GTAAATTGCG TACCTCTATC TGATATGATG 61 GAGAGTGGAA CTCCGTGCAA TCGAACAATT TCTAAGATGT AAAGTTTGGC TAACTTCTCT 121 GCATTGTAAG TCACCTTAAC CGAAATAGAG TGAGCAGACT TAGTTAATCT GTCAACAATC 181 ACCCAAATAG AGTCATACCT ACCCATCGTC TTTGGATGAC TAACCACGAA GTCCATTGCA 241 ATTCTCTCCC ACTTCCATTC AGGAATGGGC ATTCTCTGAA GTGTCCCTCC AGGCCTCTGG 301 TGTTCATACT TTACTTGCTG ACAATTCGGG CATTGAGCAA CAAAATTAAC AATGTCACGC 361 TTCATCCTAC TCCACCAAAA ATGTTGCTTT AGGTCACGAT ACATCTTGTT TGCACCAGGA 421 TGTATAGAGT ACTTCGAACT ATGAGCCTAT ATAAGAATAG TGTGAATCAA ATCATCGACG 481 CGGGGTACAC ATACTCCTTC CCTTATTCTC AAAACACCTT CCTCATCGAT TTTTGCTTCT 541 TT Predicted gene structure (within gDNA segment 8239 to 4364): Exon 1 7540 7001 ( 540 n); cDNA 1 542 ( 542 n); score: 0.868 MATCH C06HBa0153O03.1-2- SGN-E353207+ 0.868 540 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E353207+ (7540 7001) Alignment (genomic DNA sequence = upper lines): TCTGCATGCA ATGTTTTCCA AAACTTAGAA GTAAACTGCG TACCCCTATC TGATATGATG 7481 || ||||| | ||||||||| |||||||||| ||||| |||| |||| ||||| |||||||||| TCGGCATGTA CTGTTTTCCA AAACTTAGAA GTAAATTGCG TACCTCTATC TGATATGATG 60 GATAGTGGAA CCCCATGCAA TCGAACGATT TCTGAGATAT AGATCTTGGC TAACTTCTCT 7421 || ||||||| | || ||||| |||||| ||| ||| |||| | | | ||||| |||||||||| GAGAGTGGAA CTCCGTGCAA TCGAACAATT TCTAAGATGT AAAGTTTGGC TAACTTCTCT 120 GCATTGTAAG TCACCTTTAC CGGAATGAAA TGAGCAGATT TAGTTAACCT ATCAACAATC 7361 |||||||||| ||||||| || || ||| | |||||||| | ||||||| || ||||||||| GCATTGTAAG TCACCTTAAC CGAAATAGAG TGAGCAGACT TAGTTAATCT GTCAACAATC 180 ACCCAAATGG AGTCATACTT ACCCATTGTC CTTGGAAGAC CAACCACAAA GTCCATTGCA 7301 |||||||| | |||||||| | |||||| ||| ||||| ||| |||||| || |||||||||| ACCCAAATAG AGTCATACCT ACCCATCGTC TTTGGATGAC TAACCACGAA GTCCATTGCA 240 ATTCTTTCCC ACTTCCATTC CGGAATGGGC ATTCTCTGAA GTGTTCCTCC GGGCCTTTGG 7241 ||||| |||| |||||||||| ||||||||| |||||||||| |||| ||||| ||||| ||| ATTCTCTCCC ACTTCCATTC AGGAATGGGC ATTCTCTGAA GTGTCCCTCC AGGCCTCTGG 300 TGTTCATACT TTACTTGTTG ACAGTTTGGA CACTTGGCAA TAAAGTCAAC AATATCACGC 7181 |||||||||| ||||||| || ||| || || || | |||| ||| | ||| ||| |||||| TGTTCATACT TTACTTGCTG ACAATTCGGG CATTGAGCAA CAAAATTAAC AATGTCACGC 360 TTCATTCTAC TCCACCAAAA GTGTTGTTTT AGGTCACGAT ACATCTTGGT TGCACTTGGA 7121 ||||| |||| |||||||||| ||||| ||| |||||||||| |||||||| | ||||| ||| TTCATCCTAC TCCACCAAAA ATGTTGCTTT AGGTCACGAT ACATCTTGTT TGCACCAGGA 420 TGTATAGAAT ACCTTGAACT ATGAGCCTCT GTCAGAATAG TGTTGATTAA ATCATTGA-- 7063 |||||||| | || | ||||| |||||||| | | ||||||| ||| || || ||||| || TGTATAGAGT ACTTCGAACT ATGAGCCTAT ATAAGAATAG TGTGAATCAA ATCATCGACG 480 CGGGGTACAC ATAC-CCTTC CCTTGATTCT CAAAACACCT TCCTCATCGA TTTGTGCTTC 7004 |||||||||| |||| ||||| |||| ||||| |||||||||| |||||||||| ||| |||||| CGGGGTACAC ATACTCCTTC CCTT-ATTCT CAAAACACCT TCCTCATCGA TTTTTGCTTC 539 CTT 7001 || TTT 542 hqPGS_C06HBa0153O03.1-2-_SGN-E353207+ (7540 7001) ******************************************************************************** EST sequence 48 +strand 654 n (File: SGN-E578131+) 1 GTAAACTGCA TACCTCTATC TGATATGATG GAGAGTGGAA CTCCATGCAA TCGAACGATT 61 TCTGAGATAT AGATTCTGAC TAACTTCTCT ACATTGTAGG TCACCTTGAC CGGAATGCAA 121 TGAGCAGACT TAGTCAATCT GTCAACAATT ACACAAATGG AGTCATACTT ACCCATTGTC 181 TTTGGAAGGC CAACCACGAA GTCCATTGCA ATTCTCTCCC ACTTCCATTC AGGAATGGGC 241 ATTCTCTGAA GTGTCCGTCC AGGCCTTTGC TGTTCATACT TTACATGCTG ACCATTCGGT 301 TAGAAATGCT AATGGACGCT ACTTTCCCAC TAGTTCAAAA TTTGTGAAGC CACGGGCTTG 361 CTTGAAGATT GGTCTTGGTC TCTTAACAAA ACAGATCCGC TGATATGTTT CTAATTATGC 421 AGAGGATTTT GCAAGAAAGA GTTGTTAGTA GCCGGGAAGC TGAACTCAAT AGATTGAAGC 481 AGGAGAGACG AGAAAGGATC AGCCAGATAA TTCAATCAAG GAAGCAGGAG AGGGAAGCTA 541 AGAGGAAAAT GTTATTCTTT CTGCGAACTG AGGAGGAGCG TCAAAAGAGG TTACTGGAAG 601 AGGAGGAAGC CCGCAAACGT GAAGGTTCTC CTGTGGGAAG CCGGACTTGC TAAG Predicted gene structure (within gDNA segment 8245 to 2647): Exon 1 7510 7211 ( 300 n); cDNA 1 300 ( 300 n); score: 0.893 Intron 1 7210 5088 (2123 n); Pd: 0.000 (s: 0.80), Pa: 0.000 (s: 0) Exon 2 5087 5075 ( 13 n); cDNA 301 313 ( 13 n); score: 0.769 MATCH C06HBa0153O03.1-2- SGN-E578131+ 0.893 313 0.479 C PGS_C06HBa0153O03.1-2-_SGN-E578131+ (7510 7211,5087 5075) Alignment (genomic DNA sequence = upper lines): GTAAACTGCG TACCCCTATC TGATATGATG GATAGTGGAA CCCCATGCAA TCGAACGATT 7451 ||||||||| |||| ||||| |||||||||| || ||||||| | |||||||| |||||||||| GTAAACTGCA TACCTCTATC TGATATGATG GAGAGTGGAA CTCCATGCAA TCGAACGATT 60 TCTGAGATAT AGATCTTGGC TAACTTCTCT GCATTGTAAG TCACCTTTAC CGGAATGAAA 7391 |||||||||| |||| || | |||||||||| ||||||| | ||||||| || ||||||| || TCTGAGATAT AGATTCTGAC TAACTTCTCT ACATTGTAGG TCACCTTGAC CGGAATGCAA 120 TGAGCAGATT TAGTTAACCT ATCAACAATC ACCCAAATGG AGTCATACTT ACCCATTGTC 7331 |||||||| | |||| || || |||||||| || ||||||| |||||||||| |||||||||| TGAGCAGACT TAGTCAATCT GTCAACAATT ACACAAATGG AGTCATACTT ACCCATTGTC 180 CTTGGAAGAC CAACCACAAA GTCCATTGCA ATTCTTTCCC ACTTCCATTC CGGAATGGGC 7271 ||||||| | ||||||| || |||||||||| ||||| |||| |||||||||| ||||||||| TTTGGAAGGC CAACCACGAA GTCCATTGCA ATTCTCTCCC ACTTCCATTC AGGAATGGGC 240 ATTCTCTGAA GTGTTCCTCC GGGCCTTTGG TGTTCATACT TTACTTGTTG ACAGTTTGGA 7211 |||||||||| |||| | ||| |||||||| |||||||||| |||| || || || || || ATTCTCTGAA GTGTCCGTCC AGGCCTTTGC TGTTCATACT TTACATGCTG ACCATTCGGT 300 CACTTGGCAA TAAAGTCAAC AATATCACGC TTCATTCTAC TCCACCAAAA GTGTTGTTTT 7151 .......... .......... .......... .......... .......... .......... 300 AGGTCACGAT ACATCTTGGT TGCACTTGGA TGTATAGAAT ACCTTGAACT ATGAGCCTCT 7091 .......... .......... .......... .......... .......... .......... 300 GTCAGAATAG TGTTGATTAA ATCATTGACG GGGTACACAT ACCCTTCCCT TGATTCTCAA 7031 .......... .......... .......... .......... .......... .......... 300 AACACCTTCC TCATCGATTT GTGCTTCCTT AGCCTCTCCT CGCAATACCT TATCTTGGAT 6971 .......... .......... .......... .......... .......... .......... 300 TCTTCTTAGT TTCTCATCAT CAAACTGTTT TCCCTTAATT TTGTCAAGAA AAGAAGATCT 6911 .......... .......... .......... .......... .......... .......... 300 TGACTCCACA CTAGCCAACA ATCCTCCCTT CTCATTTACT TCTAATATCA TCAAGTCATT 6851 .......... .......... .......... .......... .......... .......... 300 AGCTAGAGTC TGAACCTCTC TAGCCAATGG GCGTCTAGAA GCTTGCAAGT GAGCTAGACT 6791 .......... .......... .......... .......... .......... .......... 300 TCCCATGCTT CCCACCTTTC TACTTAAAGC ATCCGCTACA ACATTAGCCT TCCCCGGATG 6731 .......... .......... .......... .......... .......... .......... 300 ATACAAAATA GTGATATCGT AGTCCTTTAG TAACTCCATC CATCTCCTCT GTCTTAAGTT 6671 .......... .......... .......... .......... .......... .......... 300 CAAATCTTTC TGAGTAAAGA CATACTGTAG GCTACGATGA TCCGTATAGA TCTCACACTT 6611 .......... .......... .......... .......... .......... .......... 300 AACCCCATAT AAATAGTGTC TCCATTGCTT TAATGCAAAC ACCACCGCAG CCAATTCCAA 6551 .......... .......... .......... .......... .......... .......... 300 ATCGTGGGTC GGATAGTTAC GTTCATGCAC CTTTAGTTGC CTTGAAGCAT AAGCAATCAC 6491 .......... .......... .......... .......... .......... .......... 300 ACTCTTCTCT TGCATTAGTA CAACACCCAA ACCAGAATAG GATGCATCAC AATAAACAAT 6431 .......... .......... .......... .......... .......... .......... 300 GAAGTTCTTA CCCTCTACTG GCAAGGTAAG GATAGGTGCG GTAGTCAACA AAGTCTTGAG 6371 .......... .......... .......... .......... .......... .......... 300 CTTCTGAAAG CTTTCCTCAC ATTCGTCCGA CCATACAAAT GGAACATTCT GCTTAGTCAA 6311 .......... .......... .......... .......... .......... .......... 300 GTTCGTCAAT TGGGAAGCAA TAGAAGAGAA TCCCTTGACA AATCGACGGT AGTAGCTAGC 6251 .......... .......... .......... .......... .......... .......... 300 TAACCCAACA AAGCTCCTTA TTTCTGACAC ATTAGTAGGT CTTACCCAAT TCTTCACTGT 6191 .......... .......... .......... .......... .......... .......... 300 CTCAATCTTA GAAGGATCCA CCATCACTCC ATCCTTAGAA ACCACGTGCC CCAAGAAGGA 6131 .......... .......... .......... .......... .......... .......... 300 CACTGCATCT AGCCAAAACT CACACTTAGA GAATTTGGCA TAAAGCTTTT TCTCCCTCAA 6071 .......... .......... .......... .......... .......... .......... 300 CATTTCCAAT ACCATTCTCA AATGCTCTTC ATGTTCCTTC TTGCTCTTTG AGTATACCAA 6011 .......... .......... .......... .......... .......... .......... 300 TATATCATCA ATAAATACGA TCACGAAGAG GTCCAAATAT GGCTTAAAAA TCCCGTTCAT 5951 .......... .......... .......... .......... .......... .......... 300 CAAGCTCATG AACGCAACAG GGGCGTTCAT AAGACCAAAA GACATCACTA CAAATTTGTA 5891 .......... .......... .......... .......... .......... .......... 300 ATGCCCATAC CTCGTTCGAA AAGCAGTCTT TGGCACATCC GTTGCCCGTA TTTTCAATTG 5831 .......... .......... .......... .......... .......... .......... 300 ATGATAACCG GATCTCAAGT CAATCTTAGA GAAGACACAA GCACCTTGTA ACTGATCGAA 5771 .......... .......... .......... .......... .......... .......... 300 CAAGTCATCA ATGCGGGGAA GAGGATACTT GTTCTTTATG GTTACCTTGT TTAGTTGTCT 5711 .......... .......... .......... .......... .......... .......... 300 GTAGTCTATA CACATTCGAA AGCTCCCATC CTTCTTCTTT ACAAACAAAA CCGGAGCACC 5651 .......... .......... .......... .......... .......... .......... 300 CCAAGGAGAT GCACTTGGTC TAATAAAGCC TTTGTTCAAT AACTCTTGAA GTTGTGCCTT 5591 .......... .......... .......... .......... .......... .......... 300 TAACTCTCTT AACTCTGCGG GAGCCATTCT ATAAGGGGGT ATAGAAATGG GGCGTGTGCC 5531 .......... .......... .......... .......... .......... .......... 300 CGGTTCTAGA TCGATACAGA AGTCAATATC CCTATCTGGT GGCATACCAG GAAGATCTGC 5471 .......... .......... .......... .......... .......... .......... 300 AGGGAACACA TCCAGAAACT CACGGACTAC TGAAACCGAC TCAATCGAAG GCACTTGGGT 5411 .......... .......... .......... .......... .......... .......... 300 AGTGTCATCC TTGAGATGTG CCAAGAAAGC TAAACAACCT TTACTAACCA TTTTCTTAGC 5351 .......... .......... .......... .......... .......... .......... 300 ACGAAGAAAG GAGATGATAT GCACCGGATT GGAAGCGTTG TCACCCTCCC ACACTAACGG 5291 .......... .......... .......... .......... .......... .......... 300 ATCTGTCCCA GGCTTGGCTA ACGTCACCGT TTTAGCATTA CAATCCAAGA TCGCAAATTG 5231 .......... .......... .......... .......... .......... .......... 300 CGGAGAAAGC CAAGTCATAC CTAGAATTAC ATCAAAATCA TCCATTTCTA AGATAACCAA 5171 .......... .......... .......... .......... .......... .......... 300 ATCTACATAA GTGTTGCTCC CTACAAAGTT CACCAAAAAA GACCTATACA CCTTTTCAAC 5111 .......... .......... .......... .......... .......... .......... 300 TACCACAGAT TCACCCACCG GAGTAGAAAC ACGAAT 5075 |||||| | ||| .......... .......... ...TAGAAAT GCTAAT 313 hqPGS_C06HBa0153O03.1-2-_SGN-E578131+ (7510 7211) ******************************************************************************** EST sequence 47 +strand 188 n (File: SGN-E577888+) 1 CTACATCTCC TACCATACAA TGCTTCAAAT GGAGCCATAT CAATGCTTGA TTGATAGCTA 61 TTATTGTATG AAAACTCCGC TAAGGGTAGG AAGCTATCCC AATGACCACC AAACTCTATC 121 ACACACGCAC GAAGCATATT CTCCAACACT TGAATCGTTC GCTCAGACTG ACCATCGGGA 181 GAACTAGT Predicted gene structure (within gDNA segment 8362 to 6695): Exon 1 7762 7586 ( 177 n); cDNA 1 177 ( 177 n); score: 0.966 MATCH C06HBa0153O03.1-2- SGN-E577888+ 0.966 177 0.941 C PGS_C06HBa0153O03.1-2-_SGN-E577888+ (7762 7586) Alignment (genomic DNA sequence = upper lines): CTACATCTCC TACCATACAA TGCTTCAAAT GGAGCCATAT CAATGCTTGA GTGATAGCTA 7703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CTACATCTCC TACCATACAA TGCTTCAAAT GGAGCCATAT CAATGCTTGA TTGATAGCTA 60 TTATTGTATG AAAACTCCGC TAAGGGTAGG AAGCTATCCC AATGACCACC AAACTCTATC 7643 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATTGTATG AAAACTCCGC TAAGGGTAGG AAGCTATCCC AATGACCACC AAACTCTATC 120 ACACACGCAC GAAGCATATC TTCCAACACT TGAATCGTCC TTTCAGACTG ACCATCG 7586 |||||||||| ||||||||| ||||||||| |||||||| | |||||||| ||||||| ACACACGCAC GAAGCATATT CTCCAACACT TGAATCGTTC GCTCAGACTG ACCATCG 177 hqPGS_C06HBa0153O03.1-2-_SGN-E577888+ (7762 7586) ******************************************************************************** EST sequence 23 -strand 763 n (File: SGN-E354383-) 1 AGAGAGTCGA TTTTCATATC CAAATTCAGA AATTCTAAGT ATGCTGAAAC GATGCACCTT 61 CGACGGGCCG TCGTGCCTGT GACGGTCCGT CGCAGTGCCC GTGGTCTTGG CCAGTTTTTC 121 CAGAATTAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 181 CCCATCGTTC GTCCCCGAAC GATCAAATGA AGAAAAACAA TGGCGAAAAG GAGTACCTGA 241 ATCTGTAAAC AGATGTGGGT ATTTTTCTTG CATATCTGCC TCCTTCTCCC AAGTAGCTTC 301 TTCAACCGGT CGATTCTTCC ATTGAACTTT GATGGATGCA ATTTCTCTCG ACCTCAACTT 361 GCGAACCTCT CTATCTAAAA TAGCAACTGG TTCCTCCTCA TAAGTCCAAT TCTCACCAAG 421 CGAAACTGAA TCCCAACGGA TGATGTAGTT TCCATCTCCA TGGTATCTTT TCAACATGGA 481 TACATGAAAT ACCGGATGTA CTCCGGACAG CCCTGGAGGT AAGGCTAATT CATAAGCCAC 541 CTCCCCTACT CGCTTTAGTA CCTCAAATGG TCCAATATAC CTTGGACTTA ACTTACCTCT 601 TTTACGAAAC CGCATCACCC CTTTCATTGG CGAGACTTTC AACAAGACTT GTTCGCCCTC 661 CATGAACTCT AAGTCTCTAA CCTTTCGATC TGCATATTCT TTTTGTCTAC TTTGCGCCGC 721 TAGAAGCTTT TCTTGAATAG ACTTCACTTT CTCCATCGAA TCT Predicted gene structure (within gDNA segment 10329 to 6800): Exon 1 8585 7823 ( 763 n); cDNA 1 763 ( 763 n); score: 0.889 MATCH C06HBa0153O03.1-2- SGN-E354383- 0.889 763 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E354383- (8585 7823) Alignment (genomic DNA sequence = upper lines): AGAGAGTCGA TTTTCTGTAC CCAATTTTAG ATTTTCTAAG TGTTTTGAAA CGA-GACCCT 8527 |||||||||| ||||| || |||| || || | ||||||| | | ||||| ||| | ||| AGAGAGTCGA TTTTC-ATAT CCAAATTCAG AAATTCTAAG TATGCTGAAA CGATGCACCT 59 GCGACGGTCC GTCGTGCCCA TGACGGTCCG TCATTGGGTT CGTCGCCTCA GCCTGTTTTT 8467 |||||| || |||||||| |||||||||| || | | ||| | || ||| |||||| TCGACGGGCC GTCGTGCCTG TGACGGTCCG TCGCAGTGCC CGTGGTCTTG GCCAGTTTTT 119 CCAGAAATAA AATCTGCTGC TCAAAACGAC TAAACAGGTC GTTACAATAG ATACCAATTT 8407 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAGAATTAA AATCTGCTGC TCAAAACGAC TAAACAGGTC GTTACAATAG ATACCAATTT 179 ACCCATCGTT CGTCCCCGAA CGATCACAAG AAGGAAAACA AGGGCGAAAA GGAGTACCTG 8347 |||||||||| |||||||||| |||||| | | ||| |||||| | |||||||| |||||||||| ACCCATCGTT CGTCCCCGAA CGATCAAATG AAGAAAAACA ATGGCGAAAA GGAGTACCTG 239 AATCTGTAAA CAGATGTGGG TATTTTTCTC GCATATCCGC CTCCTTCTCC CAAGTGGCTT 8287 |||||||||| |||||||||| ||||||||| ||||||| || |||||||||| ||||| |||| AATCTGTAAA CAGATGTGGG TATTTTTCTT GCATATCTGC CTCCTTCTCC CAAGTAGCTT 299 CATCAACGGG TCGATTCTTC CATTGCACCT TGATGGATGC AATCTCTCTT GACCTCAACT 8227 | ||||| || |||||||||| ||||| || | |||||||||| ||| ||||| |||||||||| CTTCAACCGG TCGATTCTTC CATTGAACTT TGATGGATGC AATTTCTCTC GACCTCAACT 359 TGCGAACTTC TCTATCTAAA ATAGCAACAG GCTCCTCCTC ATAAGACAAG TTCTCATCAA 8167 ||||||| || |||||||||| |||||||| | | |||||||| ||||| | | |||||| ||| TGCGAACCTC TCTATCTAAA ATAGCAACTG GTTCCTCCTC ATAAGTCCAA TTCTCACCAA 419 GCAAAACTGA ATCCCAACGG ATAATGTAAT TTCCATTCCC ATGATATCTT TTCAACATAG 8107 || ||||||| |||||||||| || ||||| | |||||| || ||| |||||| |||||||| | GCGAAACTGA ATCCCAACGG ATGATGTAGT TTCCATCTCC ATGGTATCTT TTCAACATGG 479 ACACATGGAA TACCGGATGT ACTCCGGACA GCCCTGGAGG CAAGGCTAAC TCATAAGCCA 8047 | ||||| || |||||||||| |||||||||| |||||||||| |||||||| |||||||||| ATACATGAAA TACCGGATGT ACTCCGGACA GCCCTGGAGG TAAGGCTAAT TCATAAGCCA 539 CCTCTCCTAC TCGCTTAAGT ACTTCAAATG GTCCAATGTA CCTTGGACTT AGTTTACCCC 7987 |||| ||||| |||||| ||| || ||||||| ||||||| || |||||||||| | ||||| | CCTCCCCTAC TCGCTTTAGT ACCTCAAATG GTCCAATATA CCTTGGACTT AACTTACCTC 599 TTTTTCCGAA CCGCATCACC CCTTTCATGG GCGAAACTTT CAACAAGACT TGTTCACCTT 7927 |||| | || |||||||||| |||||||| | |||| ||||| |||||||||| ||||| || | TTTTACGAAA CCGCATCACC CCTTTCATTG GCGAGACTTT CAACAAGACT TGTTCGCCCT 659 CCATGAACTC TAAGTCTCTA ACCTTTCGAT CTGCATATTC TTTTTGTCTA CTTTGCGACG 7867 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || CCATGAACTC TAAGTCTCTA ACCTTTCGAT CTGCATATTC TTTTTGTCTA CTTTGCGCCG 719 CTAACAACTT TTCTTGAATA GATTTCACTT TATCTAACGA TTCT 7823 ||| | ||| |||||||||| || ||||||| | || | ||| ||| CTAGAAGCTT TTCTTGAATA GACTTCACTT TCTCCATCGA ATCT 763 hqPGS_C06HBa0153O03.1-2-_SGN-E354383- (8585 7823) ******************************************************************************** EST sequence 11 -strand 542 n (File: SGN-E252199-) 1 CGACCCAGCC TGGGATTACG CAGTCTGTGA CGGTCCGTCC TGCACGTCCG TCACAGAGTT 61 CAGAGACTAG ATTTTTACCA AGGGTCTGTG ACGGCCCATC ACGCCTGTGA CGGTCCGTCC 121 TGCCATTCCG TCACGAAGTT CAGAGAGTCG ATTTCAGTAC CCAAATTTCA GAATTCTAAG 181 TGTTTTGGAA CGAGACCCCC TCGACGGTCC GTCGTGGGAT CCGTCGTCTC AGTCAGTTTT 241 TCCAGAAATA AAATCTGTTA CTCAAAACGA CTAAACAGGT CGTTACAATA GATACCAATT 301 TACCCATCGT TCGTCCCCGA ACGATCACAA GAAGAAAAAC AAGGGCGAAA AGGAGTACCT 361 GAATCTGTAA ACAGGTATGG GTATCTTTCT CGCATATCAA CTTCCTTCTC CCAAGTGGAT 421 TCTTCAACTG GTCGATTCTT CCATTGAACT TTGATAGATG CAATCTCCCT TGACCTCAAT 481 TTGCGGACTT CTCTATCTAA AATGGCAACA GGCTCCTCCT CATAAGACAA ATTCTCATCA 541 AG Predicted gene structure (within gDNA segment 10020 to 7458): Exon 1 8703 8166 ( 538 n); cDNA 25 542 ( 518 n); score: 0.872 MATCH C06HBa0153O03.1-2- SGN-E252199- 0.872 538 0.993 C PGS_C06HBa0153O03.1-2-_SGN-E252199- (8703 8166) Alignment (genomic DNA sequence = upper lines): CTGGGACGGT CCGTCCTGCA GGTC-GTCGC AAAGTTCAGA GACCCAATAT TTCCACCAAG 8645 ||| |||||| |||||||||| ||| ||| | | |||||||| ||| || | || |||||| CTGTGACGGT CCGTCCTGCA CGTCCGTCAC AGAGTTCAGA GACTAGATTT TT--ACCAAG 82 GGTCTGTGAC GGTCCGTCAC ACCTGTGACG GTCCGTCCTG CCATTCCGTC ACGAAGTTCA 8585 |||||||||| || || |||| ||||||||| |||||||||| |||||||||| |||||||||| GGTCTGTGAC GGCCCATCAC GCCTGTGACG GTCCGTCCTG CCATTCCGTC ACGAAGTTCA 142 GAGAGTCGAT TTTCTGTACC C-AATTTTAG ATTTTCTAAG TGTTTTGAAA CGAGACCCTG 8526 ||||||||| |||| ||||| | ||||| || | ||||||| ||||||| || ||||| | GAGAGTCGA- TTTCAGTACC CAAATTTCAG A-ATTCTAAG TGTTTTGGAA CGAGA--C-- 196 CGACGGTCCG TCGTGCCCAT GACGGTCCGT CATTGGGTTC GTCGCCTCAG CCTGTTTTTC 8466 | | || || | ||||||||| | | || | | |||| ||||| | ||||||| C--C---CC- TC--G----- -ACGGTCCGT CGTGGGATCC GTCGTCTCAG TCAGTTTTTC 242 CAGAAATAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 8406 |||||||||| ||||| | || |||||||||| |||||||||| |||||||||| |||||||||| CAGAAATAAA ATCTGTTACT CAAAACGACT AAACAGGTCG TTACAATAGA TACCAATTTA 302 CCCATCGTTC GTCCCCGAAC GATCACAAGA AGGAAAACAA GGGCGAAAAG GAGTACCTGA 8346 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| CCCATCGTTC GTCCCCGAAC GATCACAAGA AGAAAAACAA GGGCGAAAAG GAGTACCTGA 362 ATCTGTAAAC AGATGTGGGT ATTTTTCTCG CATATCCGCC TCCTTCTCCC AAGTGGCTTC 8286 |||||||||| || | ||||| || ||||||| |||||| | |||||||||| |||||| ||| ATCTGTAAAC AGGTATGGGT ATCTTTCTCG CATATCAACT TCCTTCTCCC AAGTGGATTC 422 ATCAACGGGT CGATTCTTCC ATTGCACCTT GATGGATGCA ATCTCTCTTG ACCTCAACTT 8226 ||||| ||| |||||||||| |||| || || ||| |||||| ||||| |||| ||||||| || TTCAACTGGT CGATTCTTCC ATTGAACTTT GATAGATGCA ATCTCCCTTG ACCTCAATTT 482 GCGAACTTCT CTATCTAAAA TAGCAACAGG CTCCTCCTCA TAAGACAAGT TCTCATCAAG 8166 ||| |||||| |||||||||| | |||||||| |||||||||| |||||||| | |||||||||| GCGGACTTCT CTATCTAAAA TGGCAACAGG CTCCTCCTCA TAAGACAAAT TCTCATCAAG 542 hqPGS_C06HBa0153O03.1-2-_SGN-E252199- (8703 8166) ******************************************************************************** EST sequence 18 -strand 515 n (File: SGN-E242359-) 1 AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 61 TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 121 ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 181 ATTTATTCCC CTCGTGTCCT TACGTGACAC TCCACTCCTC AATATACTAT CCTGGCACCG 241 GAACGTGGCA CCCGATCCAT ATTCTATCCT GGTGTCAGAA CGTGACACCC GATCCATATT 301 CTATCCTGGT GTCGGAACGT GACACCCGAT CCATATTCTA TCCTGGTACC GGAACGTGGC 361 ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACC CGATCCATAT TCTATCCTGG 421 TACCGGAACG TGGCACCCGA TCCCCTAATC TCACCACTTT CGTTCATCAA GCCTTCTTTT 481 ATACCAAGGC ATCATTATTA ACAAAGTAGA TTAGG Predicted gene structure (within gDNA segment 11633 to 7300): Exon 1 9785 9497 ( 289 n); cDNA 1 288 ( 288 n); score: 0.913 Intron 1 9496 9457 ( 40 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.96) Exon 2 9456 9230 ( 227 n); cDNA 289 515 ( 227 n); score: 0.965 MATCH C06HBa0153O03.1-2- SGN-E242359- 0.936 516 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E242359- (9785 9497,9456 9230) Alignment (genomic DNA sequence = upper lines): AGTATGTATT AAGCAATATC ATAAAATCAA TTAATATCCT TAGCATGCAG CATTTACAGT 9726 |||||||||| |||||||||| ||||||| || ||||||||| |||||||||| ||||| || | AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 60 TACCATAACC CTTGGTTACA ACACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAC 9666 |||||||||| ||||||| || ||||||||| |||||||||| |||||||||| ||||||||| TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 120 ACTCATTTGG GAATTTAGTT CATTAGATTG GATATATTAA CATATTTCAA GATTCATTAT 9606 ||| |||||| |||||||||| |||| ||||| ||||||||| |||||||||| ||||||| || ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 180 CTTTATTCCC CTCGTGTCGG TACATGACAC TCCGCTCCTC AATATACTAT CCTGGTGTCG 9546 ||||||||| |||||||| ||| |||||| ||| |||||| |||||||||| ||||| || ATTTATTCCC CTCGTGTCCT TACGTGACAC TCCACTCCTC AATATACTAT CCTGGCACCG 240 GAACGTGACA CTCTGATCCT CATTCTATCC TGGTGTCGGA ACGTGACACT CCGATCCTCA 9486 ||||||| || | | ||||| ||||||||| ||||||| || ||||||||| GAACGTGGCA C-CCGATCCA TATTCTATCC TGGTGTCAGA ACGTGACAC. .......... 288 TATACTATCC TGGTACCGGA ACGTGGTACC CGATCCATAT TCTATCCTGG TGTCAGAACG 9426 | |||||||||| |||||||||| |||| ||||| .......... .......... .........C CGATCCATAT TCTATCCTGG TGTCGGAACG 319 TGACACCCGA TCCATATCCT ATCCTGGTAC CGGAACGTGG CACCCGATCA ATATTCTATC 9366 |||||||||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||||| TGACACCCGA TCCATATTCT ATCCTGGTAC CGGAACGTGG CACCCGATCC ATATTCTATC 379 TTGGTGTCGG AACGTGACAC CCGATCCATA TTCTATCCTG GTACCGAAAC GTGGCACCGG 9306 ||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||| | CTGGTGTCGG AACGTGACAC CCGATCCATA TTCTATCCTG GTACCGGAAC GTGGCACCCG 439 ATCCCCTAAT CTCATCACTT TCGTTCATCA AGCCTTCTTT TATACCAAGG CATCATCATT 9246 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||| ||| ATCCCCTAAT CTCACCACTT TCGTTCATCA AGCCTTCTTT TATACCAAGG CATCATTATT 499 AACAAAGTAG ATTAGG 9230 |||||||||| |||||| AACAAAGTAG ATTAGG 515 hqPGS_C06HBa0153O03.1-2-_SGN-E242359- (9785 9497,9456 9230) ******************************************************************************** EST sequence 95 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 11633 to 4547): Exon 1 10017 9474 ( 544 n); cDNA 1 534 ( 534 n); score: 0.773 Intron 1 9473 9397 ( 77 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.74) Exon 2 9396 9241 ( 156 n); cDNA 535 686 ( 152 n); score: 0.788 MATCH C06HBa0153O03.1-2- SGN-E241789+ 0.776 700 1.020 C PGS_C06HBa0153O03.1-2-_SGN-E241789+ (10017 9474,9396 9241) Alignment (genomic DNA sequence = upper lines): ATGCCGGAAG TTCAAGG-CA TCAAGACTTG AAGAAGA-AG -ACCCAGTCC AAGCTAGAAG 9961 ||| | || ||||||| || ||||||| || || ||| || | ||||||| |||||| | ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 CATTAGCTCA CCCTGAATAT CCGGTATGAC GAAGACTGGC TAGAATCACT GCTGAGTTGA 9901 || ||||||| ||||||| || | | ||| ||||||||| |||| | | | |||||||| CAATAGCTCA CCCTGAA-AT CTGACGTGAT GAAGACTGGT TAGAGTTGCG GTTGAGTTGA 119 AGATGACGGA ACGTTTGCTG CACTCCACAA ATAACAAGAA GAAAACATAA AAGTAGGGGT 9841 ||| ||||| |||||||||| |||||||||| |||||| || |||||||||| |||||||||| AGACGACGGT ACGTTTGCTG CACTCCACAA TTAACAAAAA GAAAACATAA AAGTAGGGGT 179 CAGTACAAAA CACGGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT 9781 |||||| ||| |||| ||||| |||||||||| |||||||||| |||| ||||| | |||| ||| CAGTAC-AAA CACGAGTACT GAGTAGATAT CATCGGCCAA CTCAGAATAG AGAACAATAT 238 GTATTAAGCA ATATCATAAA ATCAATTAAT ATCCTTAGCA TGCAGCATTT ACAGTTACCA 9721 ||| || | ||| ||||| ||||| | | |||| || | || ||| ||||| ATATCAAATA ATAAAATAAA ATCAACCATA ACACTTAACA GGTGACAACA ACAAGTACCA 298 TAACCCTTGG TTACAACACC AAGCACATCA ATGAGGACTC ACACCTCCTC ATCACACTCA 9661 ||||| |||| ||||| || ||| ||||| |||||||||| | ||||| | | || ||||| TAACCATTGG GCACAAC-CC AAGAACATCT ATGAGGACTC AAGCCTCCAC ACCATACTCA 357 TTTGGGAATT TAGTTCATTA GATTGGATAT ATTAACATAT TTCAAGATTC ATTATCTTTA 9601 |||||||| |||||||| |||| || ||||||||| |||||||||| ||| | |||| TTTGGGAAAC AGGTTCATTA AATTGAGTAC ATTAACATAA TTCAAGATTC ATTCTTTTTA 417 TTCCCCTCGT GTCGGTACAT GACACTCCGC TCCTCAATAT ACTATCCTGG TGTCGGAACG 9541 | | | || ||||| || | || |||||| ||| | | || ||| | | |||||| || CTATCGTGGT GTCGGAACGT GATACTCCGA TCCCCTA-AT GCTA--C--G TGTCGGTTCG 472 TGACACTCTG ATCCTCATTC TATCCTGGTG TCGGAACGTG ACACTCCGAT CCTCATATAC 9481 |||||| | | |||| | | || | ||| |||| ||| |||| ||||| | | |||| TGACAC-CCG ATCC-CCTAA TA-CTACGTG TCGGTTCGTT ACAC-CCGAT CTCCTAATAC 528 TATCCTGGTA CCGGAACGTG GTACCCGATC CATATTCTAT CCTGGTGTCA GAACGTGACA 9421 || | || TA-CGTG... .......... .......... .......... .......... .......... 534 CCCGATCCAT ATCCTATCCT GGTACCGGAA CGTGGCACCC GATCAATATT CTATCTTGGT 9361 ||| |||| ||||| |||| || | || || || .......... .......... ....CCGATT CGTGACACCC GATCCAT-TA ATA-CTATGT 568 GTCGGAACGT GACACCCGAT CCATATTCTA TCCTGGTACC GAAACGTGGC ACCGGATCCC 9301 ||||| ||| |||||||||| |||| | || | || | | |||| | ||| |||||| GTCGGTTCGT GACACCCGAT CCAT-TAATA -CTACGTGTC GGTTCGTGAC ACCCGATCCC 626 CTAATCTCAT CACTTTCGTT CATCAAGCCT TCTTTTATAC CAAGGCATCA TCATTAACAA 9241 |||| ||||| ||| ||| |||||||||| |||||||||| |||| ||||| |||||||||| CTAACCTCAT TCTTTTAGTT CATCAAGCCT TCTTTTATAC CAAGACATCA TCATTAACAA 686 hqPGS_C06HBa0153O03.1-2-_SGN-E241789+ (10017 9474,9396 9241) ******************************************************************************** EST sequence 3 -strand 679 n (File: SGN-E550127-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTCCT TTTCTTTTTC TTATCAAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTGCACAA CCATGAATTA ATGAAAAAAT TATGACATAA 661 AATATAAAAA ATTACTCAT Predicted gene structure (within gDNA segment 10084 to 5010): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.859 MATCH C06HBa0153O03.1-2- SGN-E550127- 0.859 556 0.819 C PGS_C06HBa0153O03.1-2-_SGN-E550127- (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTCT TTGT-TCTTT CTATTTT-CT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| |||||| || | || || | |||| || ||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTCCTT TTCTTTTTCT TATCAAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550127- (8976 8421) ******************************************************************************** EST sequence 5 -strand 673 n (File: SGN-E550140-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTACTCN 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATGTTATCAA CCATGAATTA ACAAAAAATT AGACCAAAAA 661 TATAAAAAAT TAC Predicted gene structure (within gDNA segment 10084 to 5070): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E550140- 0.861 556 0.826 C PGS_C06HBa0153O03.1-2-_SGN-E550140- (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| |||||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTACTCN 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550140- (8976 8421) ******************************************************************************** EST sequence 7 -strand 681 n (File: SGN-E389553-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCGTTCC TACTTAAATA TTATTATTAT TTTACGATTT 601 ATAACACTAT TAGAAACAAA GATTTTCTCA ACCATGAATT AATGAAAAAA TTATGGAATA 661 AAATATAAAA AATTACTCAT T Predicted gene structure (within gDNA segment 10084 to 4990): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 MATCH C06HBa0153O03.1-2- SGN-E389553- 0.864 556 0.816 C PGS_C06HBa0153O03.1-2-_SGN-E389553- (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E389553- (8976 8421) ******************************************************************************** EST sequence 74 +strand 720 n (File: SGN-E389834+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTA CGTCGACTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAGAACGAC 541 TAAACAGGAC GTTACATTTA TGATCGTCCT ACTTAAATAT CATTATTATT TTACGATTTA 601 TAACACTATT AGAAACGAAG ATTTTCTCGA CCATGAATTA ATGAAAAAAT ATGCCATGAA 661 ATATAAAAAT TTACTCGTTC TTCATTGAGC TATTCGTGAA AAAAAAAAAA AAATCGAGGG Predicted gene structure (within gDNA segment 10074 to 4590): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.861 Intron 1 8420 6641 (1780 n); Pd: 0.000 (s: 0.92), Pa: 0.147 (s: 0) Exon 2 6640 6628 ( 13 n); cDNA 557 569 ( 13 n); score: 0.615 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E389834+ 0.861 569 0.790 C PGS_C06HBa0153O03.1-2-_SGN-E389834+ (8976 8421,6640 6628) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||| ||| CCGTCGTGGG TTACGTCGAC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAGAAC 537 GACTAAACAG GTCGTTACAA TAGATACCAA TTTACCCATC GTTCGTCCCC GAACGATCAC 8380 |||||||||| | ||||||| GACTAAACAG GACGTTACA. .......... .......... .......... .......... 556 AAGAAGGAAA ACAAGGGCGA AAAGGAGTAC CTGAATCTGT AAACAGATGT GGGTATTTTT 8320 .......... .......... .......... .......... .......... .......... 556 CTCGCATATC CGCCTCCTTC TCCCAAGTGG CTTCATCAAC GGGTCGATTC TTCCATTGCA 8260 .......... .......... .......... .......... .......... .......... 556 CCTTGATGGA TGCAATCTCT CTTGACCTCA ACTTGCGAAC TTCTCTATCT AAAATAGCAA 8200 .......... .......... .......... .......... .......... .......... 556 CAGGCTCCTC CTCATAAGAC AAGTTCTCAT CAAGCAAAAC TGAATCCCAA CGGATAATGT 8140 .......... .......... .......... .......... .......... .......... 556 AATTTCCATT CCCATGATAT CTTTTCAACA TAGACACATG GAATACCGGA TGTACTCCGG 8080 .......... .......... .......... .......... .......... .......... 556 ACAGCCCTGG AGGCAAGGCT AACTCATAAG CCACCTCTCC TACTCGCTTA AGTACTTCAA 8020 .......... .......... .......... .......... .......... .......... 556 ATGGTCCAAT GTACCTTGGA CTTAGTTTAC CCCTTTTTCC GAACCGCATC ACCCCTTTCA 7960 .......... .......... .......... .......... .......... .......... 556 TGGGCGAAAC TTTCAACAAG ACTTGTTCAC CTTCCATGAA CTCTAAGTCT CTAACCTTTC 7900 .......... .......... .......... .......... .......... .......... 556 GATCTGCATA TTCTTTTTGT CTACTTTGCG ACGCTAACAA CTTTTCTTGA ATAGATTTCA 7840 .......... .......... .......... .......... .......... .......... 556 CTTTATCTAA CGATTCTCTC AAAAGGTCAG TACCCCAAGG CCTAACCTCA AATGCATCAA 7780 .......... .......... .......... .......... .......... .......... 556 ACCAACCAAT GGGAGACCTA CATCTCCTAC CATACAATGC TTCAAATGGA GCCATATCAA 7720 .......... .......... .......... .......... .......... .......... 556 TGCTTGAGTG ATAGCTATTA TTGTATGAAA ACTCCGCTAA GGGTAGGAAG CTATCCCAAT 7660 .......... .......... .......... .......... .......... .......... 556 GACCACCAAA CTCTATCACA CACGCACGAA GCATATCTTC CAACACTTGA ATCGTCCTTT 7600 .......... .......... .......... .......... .......... .......... 556 CAGACTGACC ATCGGTCTGA GGATGGAACG CAGTACTAAG GTCCAACCTA GTACCCAATT 7540 .......... .......... .......... .......... .......... .......... 556 CTGCATGCAA TGTTTTCCAA AACTTAGAAG TAAACTGCGT ACCCCTATCT GATATGATGG 7480 .......... .......... .......... .......... .......... .......... 556 ATAGTGGAAC CCCATGCAAT CGAACGATTT CTGAGATATA GATCTTGGCT AACTTCTCTG 7420 .......... .......... .......... .......... .......... .......... 556 CATTGTAAGT CACCTTTACC GGAATGAAAT GAGCAGATTT AGTTAACCTA TCAACAATCA 7360 .......... .......... .......... .......... .......... .......... 556 CCCAAATGGA GTCATACTTA CCCATTGTCC TTGGAAGACC AACCACAAAG TCCATTGCAA 7300 .......... .......... .......... .......... .......... .......... 556 TTCTTTCCCA CTTCCATTCC GGAATGGGCA TTCTCTGAAG TGTTCCTCCG GGCCTTTGGT 7240 .......... .......... .......... .......... .......... .......... 556 GTTCATACTT TACTTGTTGA CAGTTTGGAC ACTTGGCAAT AAAGTCAACA ATATCACGCT 7180 .......... .......... .......... .......... .......... .......... 556 TCATTCTACT CCACCAAAAG TGTTGTTTTA GGTCACGATA CATCTTGGTT GCACTTGGAT 7120 .......... .......... .......... .......... .......... .......... 556 GTATAGAATA CCTTGAACTA TGAGCCTCTG TCAGAATAGT GTTGATTAAA TCATTGACGG 7060 .......... .......... .......... .......... .......... .......... 556 GGTACACATA CCCTTCCCTT GATTCTCAAA ACACCTTCCT CATCGATTTG TGCTTCCTTA 7000 .......... .......... .......... .......... .......... .......... 556 GCCTCTCCTC GCAATACCTT ATCTTGGATT CTTCTTAGTT TCTCATCATC AAACTGTTTT 6940 .......... .......... .......... .......... .......... .......... 556 CCCTTAATTT TGTCAAGAAA AGAAGATCTT GACTCCACAC TAGCCAACAA TCCTCCCTTC 6880 .......... .......... .......... .......... .......... .......... 556 TCATTTACTT CTAATATCAT CAAGTCATTA GCTAGAGTCT GAACCTCTCT AGCCAATGGG 6820 .......... .......... .......... .......... .......... .......... 556 CGTCTAGAAG CTTGCAAGTG AGCTAGACTT CCCATGCTTC CCACCTTTCT ACTTAAAGCA 6760 .......... .......... .......... .......... .......... .......... 556 TCCGCTACAA CATTAGCCTT CCCCGGATGA TACAAAATAG TGATATCGTA GTCCTTTAGT 6700 .......... .......... .......... .......... .......... .......... 556 AACTCCATCC ATCTCCTCTG TCTTAAGTTC AAATCTTTCT GAGTAAAGAC ATACTGTAGG 6640 .......... .......... .......... .......... .......... .........T 557 CTACGATGAT CC 6628 || ||| | || TTATGATCGT CC 569 hqPGS_C06HBa0153O03.1-2-_SGN-E389834+ (8976 8421) ******************************************************************************** EST sequence 62 +strand 732 n (File: SGN-E550201+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCNA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAAACT 721 CGAGGGGGGG CC Predicted gene structure (within gDNA segment 10074 to 5070): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 718 MATCH C06HBa0153O03.1-2- SGN-E550201+ 0.864 556 0.760 C PGS_C06HBa0153O03.1-2-_SGN-E550201+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550201+ (8976 8421) ******************************************************************************** EST sequence 64 +strand 709 n (File: SGN-E550207+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTNCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 10074 to 5300): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 709 MATCH C06HBa0153O03.1-2- SGN-E550207+ 0.864 556 0.784 C PGS_C06HBa0153O03.1-2-_SGN-E550207+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550207+ (8976 8421) ******************************************************************************** EST sequence 66 +strand 715 n (File: SGN-E550335+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAATCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCNAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 10064 to 5230): Exon 1 8976 8421 ( 556 n); cDNA 2 555 ( 554 n); score: 0.862 PPA cDNA 698 715 MATCH C06HBa0153O03.1-2- SGN-E550335+ 0.862 556 0.778 C PGS_C06HBa0153O03.1-2-_SGN-E550335+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| |||| ||||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATT-CAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| ||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAATCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E550335+ (8976 8421) ******************************************************************************** EST sequence 72 +strand 714 n (File: SGN-E390013+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACNAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 10074 to 5250): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E390013+ 0.864 556 0.779 C PGS_C06HBa0153O03.1-2-_SGN-E390013+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E390013+ (8976 8421) ******************************************************************************** EST sequence 77 +strand 717 n (File: SGN-E550484+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 10074 to 5220): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 717 MATCH C06HBa0153O03.1-2- SGN-E550484+ 0.864 556 0.775 C PGS_C06HBa0153O03.1-2-_SGN-E550484+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550484+ (8976 8421) ******************************************************************************** EST sequence 79 +strand 713 n (File: SGN-E550211+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 10064 to 5250): Exon 1 8976 8421 ( 556 n); cDNA 2 555 ( 554 n); score: 0.864 PPA cDNA 698 713 MATCH C06HBa0153O03.1-2- SGN-E550211+ 0.864 556 0.780 C PGS_C06HBa0153O03.1-2-_SGN-E550211+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| |||| ||||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATT-CAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E550211+ (8976 8421) ******************************************************************************** EST sequence 81 +strand 713 n (File: SGN-E550464+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GNTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCTA CCATGAATTA ATGAAAAATT ATGCCATAAG 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 10074 to 5260): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.862 PPA cDNA 698 713 MATCH C06HBa0153O03.1-2- SGN-E550464+ 0.862 556 0.780 C PGS_C06HBa0153O03.1-2-_SGN-E550464+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| |||| |||| GACTAAACAG GTCGNTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550464+ (8976 8421) ******************************************************************************** EST sequence 83 +strand 713 n (File: SGN-E549941+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA TATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCGA CCNATGATTA ATGAAAAATT ATGCCATCAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 10074 to 5260): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.862 PPA cDNA 699 713 MATCH C06HBa0153O03.1-2- SGN-E549941+ 0.862 556 0.780 C PGS_C06HBa0153O03.1-2-_SGN-E549941+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| ||| |||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAATATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E549941+ (8976 8421) ******************************************************************************** EST sequence 85 +strand 714 n (File: SGN-E550025+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 10074 to 5250): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 714 MATCH C06HBa0153O03.1-2- SGN-E550025+ 0.864 556 0.779 C PGS_C06HBa0153O03.1-2-_SGN-E550025+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550025+ (8976 8421) ******************************************************************************** EST sequence 87 +strand 558 n (File: SGN-E231589+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA TAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTT Predicted gene structure (within gDNA segment 10064 to 6800): Exon 1 8976 8421 ( 556 n); cDNA 1 555 ( 555 n); score: 0.862 MATCH C06HBa0153O03.1-2- SGN-E231589+ 0.862 556 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E231589+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| || |||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CATAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E231589+ (8976 8421) ******************************************************************************** EST sequence 103 +strand 649 n (File: SGN-E374999+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CCAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTTCAC CCTGAATTAA TGAAAAAAT Predicted gene structure (within gDNA segment 10064 to 5890): Exon 1 8976 8421 ( 556 n); cDNA 1 555 ( 555 n); score: 0.862 MATCH C06HBa0153O03.1-2- SGN-E374999+ 0.862 556 0.857 C PGS_C06HBa0153O03.1-2-_SGN-E374999+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | ||| |||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCCAAAC 536 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E374999+ (8976 8421) ******************************************************************************** EST sequence 108 +strand 711 n (File: SGN-E396039+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AAAAACAAAG ATTTTCTCCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AAATAAAAAA AATTTACTCA TTTTTTCTTG GAGCTAATTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 10074 to 5280): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 661 672 MATCH C06HBa0153O03.1-2- SGN-E396039+ 0.864 556 0.782 C PGS_C06HBa0153O03.1-2-_SGN-E396039+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E396039+ (8976 8421) ******************************************************************************** EST sequence 110 +strand 618 n (File: SGN-E396054+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAA Predicted gene structure (within gDNA segment 10074 to 6210): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 MATCH C06HBa0153O03.1-2- SGN-E396054+ 0.864 556 0.900 C PGS_C06HBa0153O03.1-2-_SGN-E396054+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E396054+ (8976 8421) ******************************************************************************** EST sequence 112 +strand 711 n (File: SGN-E396056+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAA ATTTTCTCAC CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATAATAAAA ATTTACTCAT TTTTTCTTTG AGCTAATTCA TAAAAAAAAA A Predicted gene structure (within gDNA segment 10074 to 5280): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 700 711 MATCH C06HBa0153O03.1-2- SGN-E396056+ 0.864 556 0.782 C PGS_C06HBa0153O03.1-2-_SGN-E396056+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E396056+ (8976 8421) ******************************************************************************** EST sequence 114 +strand 610 n (File: SGN-E396058+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTGTTT CCAAAAATAA AATCTGCTAC TCACAACGAC 541 TAAACAGGTC GTTACATTTA GGTTCTTCAT AGTTAACTAT TATTATTATT TTACGATTTA 601 TAACACTATT Predicted gene structure (within gDNA segment 10074 to 6290): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.861 MATCH C06HBa0153O03.1-2- SGN-E396058+ 0.861 556 0.911 C PGS_C06HBa0153O03.1-2-_SGN-E396058+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| ||||| |||||| ||| |||||||||| | |||| ||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTG TTTCCAAAAA TAAAATCTGC TACTCACAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E396058+ (8976 8421) ******************************************************************************** EST sequence 122 +strand 690 n (File: SGN-E377133+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG Predicted gene structure (within gDNA segment 10064 to 5480): Exon 1 8976 8421 ( 556 n); cDNA 1 555 ( 555 n); score: 0.864 MATCH C06HBa0153O03.1-2- SGN-E377133+ 0.864 556 0.806 C PGS_C06HBa0153O03.1-2-_SGN-E377133+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0153O03.1-2-_SGN-E377133+ (8976 8421) ******************************************************************************** EST sequence 56 +strand 729 n (File: SGN-E550212+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAACTCG 721 GGGGGGGGC Predicted gene structure (within gDNA segment 10074 to 5100): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 PPA cDNA 699 716 MATCH C06HBa0153O03.1-2- SGN-E550212+ 0.864 556 0.763 C PGS_C06HBa0153O03.1-2-_SGN-E550212+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550212+ (8976 8421) ******************************************************************************** EST sequence 58 +strand 710 n (File: SGN-E550065+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATGA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTGATTCAT AAGAAAAAAA Predicted gene structure (within gDNA segment 10074 to 5290): Exon 1 8976 8421 ( 556 n); cDNA 2 556 ( 555 n); score: 0.864 MATCH C06HBa0153O03.1-2- SGN-E550065+ 0.864 556 0.783 C PGS_C06HBa0153O03.1-2-_SGN-E550065+ (8976 8421) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 120 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 180 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 239 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 299 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 417 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACA 8421 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 556 hqPGS_C06HBa0153O03.1-2-_SGN-E550065+ (8976 8421) ******************************************************************************** EST sequence 60 +strand 726 n (File: SGN-E550322+) 1 TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC 61 CTCCTTCTTT TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT 121 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 181 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 241 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 301 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 361 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 421 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 481 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 541 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 601 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 661 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA 721 AAAAAC Predicted gene structure (within gDNA segment 10164 to 5220): Exon 1 8954 8421 ( 534 n); cDNA 35 565 ( 531 n); score: 0.874 PPA cDNA 708 725 MATCH C06HBa0153O03.1-2- SGN-E550322+ 0.874 534 0.736 C PGS_C06HBa0153O03.1-2-_SGN-E550322+ (8954 8421) Alignment (genomic DNA sequence = upper lines): GTTCTTTCTA TTTTCTTATT CCAACCCTCT TTCTTTTACC CTAATTAGTA TATAATTAAG 8895 ||||||| |||||||||| | ||||||| |||||||||| |||||||| | |||||||||| GTTCTTTTCT TTTTCTTATT CAAACCCTCC TTCTTTTACC CTAATTAGCA TATAATTAAG 94 AATAAAAGAT GACAATAATA CCCCACTAAT TAACTTAAGG TTACCTCTTT TAACCCCCAA 8835 |||||||||| | ||||||| ||||||||| | ||| |||| |||||||||| ||||||||| AATAAAAGAT G-GAATAATA ACCCACTAAT TTACTCAAGG TTACCTCTTT TAACCCCCAG 153 GGATTTTGAG TTATTAATAT AAACCCATGA AATATATAAT CATAGCAGGA ATAGTCCAAA 8775 | | || || ||||||| || ||||||| | | | |||||| | || |||| |||||||||| GTAATTAGAC TTATTAACAT AAACCCACTA ACTTTATAAT TAAAGTAGGA ATAGTCCAAA 213 ACGCCCCTTT AAAACTTAAC CAGAAATCTG ACTCCAACTG GGATTGCGCA ACCTGTGACG 8715 ||| ||| || ||||| | ||||||| | || | |||| ||||| |||| |||||||| | ACGTCCC-TT AAAACGTGTA AAGAAATCCG ACCCAGACTG GGATTACGCA ACCTGTGATG 272 GGCCGTCGTG CCTGGGACGG TCCGTCCTGC AGGTCGTCGC AAAGTTCAGA GACCCAATAT 8655 | |||||||| |||| ||||| |||||||||| |||||||||| || ||||||| ||| | | || GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGTCGTCGC AAGGTTCAGA GACTC-A-AT 330 TTCCACC-AA GGGTCTGTGA CGGTCCGTCA CACCTGTGAC GGTCCGTCCT GCCATTCCGT 8596 ||||||| || | |||||||| |||||||||| | || ||||| |||||||| | |||||||||| TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC GGTCCGTCGT GCCATTCCGT 390 CACGAAGTTC AGAGAGTCGA TTTTCTGTAC CCAATTTTAG ATTTTCTAAG TGTTTTGAAA 8536 ||||||||| |||||||||| |||| |||| ||||||| || | |||||||| |||||||||| TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG AATTTCTAAG TGTTTTGAAA 450 CGAGACCCTG CGACGGTCCG TCGTGCCCAT GACGGTCCGT CATTGGGTTC GTCGCCTCAG 8476 |||||| | ||||||||| |||||| ||| |||||||||| | | || | | |||| |||| CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT CGTGGGTTCC GTCGTCTCAA 510 CCTGTTTTTC CAGAAATAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACA 8421 |||||||||| || ||||||| ||||||| || |||||||||| |||||||||| ||||| CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT AAACAGGTCG TTACA 565 hqPGS_C06HBa0153O03.1-2-_SGN-E550322+ (8954 8421) ******************************************************************************** EST sequence 27 -strand 658 n (File: SGN-E377132-) 1 TTCCTTCTTT TACCCTAATT AGCATATATT TAAGAATAAA AGATGGAATA ATAACCCACT 61 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 121 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 181 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 241 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 301 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 361 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 421 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 481 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 541 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 601 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAA Predicted gene structure (within gDNA segment 9574 to 5310): Exon 1 8927 8421 ( 507 n); cDNA 2 505 ( 504 n); score: 0.873 PPA cDNA 648 658 MATCH C06HBa0153O03.1-2- SGN-E377132- 0.873 507 0.771 C PGS_C06HBa0153O03.1-2-_SGN-E377132- (8927 8421) Alignment (genomic DNA sequence = upper lines): TCTTTCTTTT ACCCTAATTA GTATATAATT AAGAATAAAA GATGACAATA ATACCCCACT 8868 || ||||||| |||||||||| | ||||| || |||||||||| |||| |||| ||| |||||| TCCTTCTTTT ACCCTAATTA GCATATATTT AAGAATAAAA GATG-GAATA ATAACCCACT 60 AATTAACTTA AGGTTACCTC TTTTAACCCC CAAGGATTTT GAGTTATTAA TATAAACCCA 8808 |||| ||| | |||||||||| |||||||||| || | | || || ||||||| ||||||||| AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 120 TGAAATATAT AATCATAGCA GGAATAGTCC AAAACGCCCC TTTAAAACTT AACCAGAAAT 8748 || | ||| ||| | || | |||||||||| |||||| ||| ||||||| | |||||| CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC -TTAAAACGT GTAAAGAAAT 179 CTGACTCCAA CTGGGATTGC GCAACCTGTG ACGGGCCGTC GTGCCTGGGA CGGTCCGTCC 8688 | ||| | | |||||||| | |||||||||| | || ||||| ||||||| || |||||||||| CCGACCCAGA CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC 239 TGCAGGTCGT CGCAAAGTTC AGAGACCCAA TATTTCCACC -AAGGGTCTG TGACGGTCCG 8629 |||||||||| ||||| |||| |||||| | | ||||||||| ||| ||||| |||||||||| TGCAGGTCGT CGCAAGGTTC AGAGACTC-A -ATTTCCACC AAAGAGTCTG TGACGGTCCG 297 TCACACCTGT GACGGTCCGT CCTGCCATTC CGTCACGAAG TTCAGAGAGT CGATTTTCTG 8569 |||| || || |||||||||| | |||||||| ||| |||||| |||||||||| ||||||| | TCACGCCCGT GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG 357 TACCCAATTT TAGATTTTCT AAGTGTTTTG AAACGAGACC CTGCGACGGT CCGTCGTGCC 8509 |||||||||| ||| ||||| |||||||||| ||||||||| | ||||||| || |||||| TACCCAATTT CAGAATTTCT AAGTGTTTTG AAACGAGACT CCTCGACGGT CCATCGTGCT 417 CATGACGGTC CGTCATTGGG TTCGTCGCCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 8449 |||||||||| |||| | || | ||||| || || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 477 GCTCAAAACG ACTAAACAGG TCGTTACA 8421 ||||||||| |||||||||| |||||||| ACTCAAAACG ACTAAACAGG TCGTTACA 505 hqPGS_C06HBa0153O03.1-2-_SGN-E377132- (8927 8421) ******************************************************************************** EST sequence 127 +strand 548 n (File: SGN-E356257+) 1 AAAACACTTA GAAAATCTGA AATTGGGTAC TGAAAATCGA CTCTCTGAAC TTCGTGACGG 61 AATGACAGGA CGGACCATCA CAGGCGTGAC GGACCGTCAT AGATTGTTCA GTGGAAATTG 121 ACTCTCTGAC CCTTGCGACG ACCTGCAGGA CGGACCGTCA CAGTCACGAC GGCCCGTCAC 181 AGGTTGCGCA AATCCCAGGC AGAATCGGAT TTCCTTACAC GTTTTAAGGG ACGTTTTTGG 241 ACTATTCTTT CCTTAATTAT AGATTTCGTG GGTTTATATT AATAACTCAA ATTCTTGGGG 301 TTAAAAGAGG TAACCCTAAG TTAATTAGTG GGGTATTATT GCCATCTTTT ATTCTTAATT 361 ATATGTTAAT TTGGGGTAAA AGAAAGAGGG TTTTGAAACG AGACCCTGCG ATGGTCCGTC 421 GTGCCCATGA CGGTCCGTCG TGGGGTCCGT CGCTTCTGCC AGTTTTTCCA GAAATAAAGT 481 CTGCTGCTCA AAACGACTAA ACGGGTCGTT ACATTATTTG ACTCTATCAC TTTAATTTTC 541 TAGTTAAC Predicted gene structure (within gDNA segment 11633 to 7362): Exon 1 10521 10471 ( 51 n); cDNA 337 385 ( 49 n); score: 0.667 Intron 1 10470 8549 (1922 n); Pd: 0.000 (s: 0.66), Pa: 0.000 (s: 0.94) Exon 2 8548 8421 ( 128 n); cDNA 386 513 ( 128 n); score: 0.914 MATCH C06HBa0153O03.1-2- SGN-E356257+ 0.844 179 0.327 C PGS_C06HBa0153O03.1-2-_SGN-E356257+ (10521 10471,8548 8421) Alignment (genomic DNA sequence = upper lines): TACTGAATAT ATCTTATTTG TACAAAATAT ATTAATTTTT TTTAAAAAAA AATGGCGTGT 10462 || || || | ||||| || | |||| ||||||| ||||| || | TATTG-CCAT CTTTTATTCT TA-ATTATAT GTTAATTTGG GGTAAAAGAA A......... 385 ATCAAGTGTA ATAACTAGTA TCACTGTTTG TGATATTATA TATTTTGATA TTTCATGTAT 10402 .......... .......... .......... .......... .......... .......... 385 CATATACATA ATTTAATTAC GTATATAATT TTCCTTTTGT TGTATCATAC AATATATTTA 10342 .......... .......... .......... .......... .......... .......... 385 CGTATATATA ATAAGTGTTT TGTTATGACA ATAATTACAT TAGTGTAACT ATGTCACGAC 10282 .......... .......... .......... .......... .......... .......... 385 CCAAATCCGG GCCGCGTCTG GCACCCACAC TTACCCTCCT ATGTGAGCGA ACCAACCAAT 10222 .......... .......... .......... .......... .......... .......... 385 CTAAACCTTA ACATTTCAAT ATAATATAAC CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 10162 .......... .......... .......... .......... .......... .......... 385 TAAATAAAGA CCAATTCATT AACTTCTAAA ATTCAACATC TATTATTCCC CCAAAATCTG 10102 .......... .......... .......... .......... .......... .......... 385 GAAGTCATCA TCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA TTCTAAAAGC 10042 .......... .......... .......... .......... .......... .......... 385 TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA 9982 .......... .......... .......... .......... .......... .......... 385 AGACCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATA TCCGGTATGA CGAAGACTGG 9922 .......... .......... .......... .......... .......... .......... 385 CTAGAATCAC TGCTGAGTTG AAGATGACGG AACGTTTGCT GCACTCCACA AATAACAAGA 9862 .......... .......... .......... .......... .......... .......... 385 AGAAAACATA AAAGTAGGGG TCAGTACAAA ACACGGGTAC TGAGTAGATA TCATCGGCCA 9802 .......... .......... .......... .......... .......... .......... 385 ACTCAAAATA GAAAACAGTA TGTATTAAGC AATATCATAA AATCAATTAA TATCCTTAGC 9742 .......... .......... .......... .......... .......... .......... 385 ATGCAGCATT TACAGTTACC ATAACCCTTG GTTACAACAC CAAGCACATC AATGAGGACT 9682 .......... .......... .......... .......... .......... .......... 385 CACACCTCCT CATCACACTC ATTTGGGAAT TTAGTTCATT AGATTGGATA TATTAACATA 9622 .......... .......... .......... .......... .......... .......... 385 TTTCAAGATT CATTATCTTT ATTCCCCTCG TGTCGGTACA TGACACTCCG CTCCTCAATA 9562 .......... .......... .......... .......... .......... .......... 385 TACTATCCTG GTGTCGGAAC GTGACACTCT GATCCTCATT CTATCCTGGT GTCGGAACGT 9502 .......... .......... .......... .......... .......... .......... 385 GACACTCCGA TCCTCATATA CTATCCTGGT ACCGGAACGT GGTACCCGAT CCATATTCTA 9442 .......... .......... .......... .......... .......... .......... 385 TCCTGGTGTC AGAACGTGAC ACCCGATCCA TATCCTATCC TGGTACCGGA ACGTGGCACC 9382 .......... .......... .......... .......... .......... .......... 385 CGATCAATAT TCTATCTTGG TGTCGGAACG TGACACCCGA TCCATATTCT ATCCTGGTAC 9322 .......... .......... .......... .......... .......... .......... 385 CGAAACGTGG CACCGGATCC CCTAATCTCA TCACTTTCGT TCATCAAGCC TTCTTTTATA 9262 .......... .......... .......... .......... .......... .......... 385 CCAAGGCATC ATCATTAACA AAGTAGATTA GGGTTTCTTT TCAAGATTTG GGATTCAATG 9202 .......... .......... .......... .......... .......... .......... 385 GCTTCATCAT ACTTATTTAT TCACAATTAC ATAATCACAT CATTCATGCA AGCATACAAT 9142 .......... .......... .......... .......... .......... .......... 385 TAAGCATATA GAAGGTTTAC AATACTACTA ACACATATCA TTCGCTATTA AGAGTTTGCT 9082 .......... .......... .......... .......... .......... .......... 385 ACGAATAGCA TGAAATAACC ATAACCTACC TCCACTGAAG ATTAGTGATT AAGCAAGAAA 9022 .......... .......... .......... .......... .......... .......... 385 TTCCCAAGGC TTTTGTTCCT TCTTCTCGTT CGATCCTCCC TCAATTCGTT TCTCTTTCCC 8962 .......... .......... .......... .......... .......... .......... 385 TCTCTTTGTT CTTTCTATTT TCTTATTCCA ACCCTCTTTC TTTTACCCTA ATTAGTATAT 8902 .......... .......... .......... .......... .......... .......... 385 AATTAAGAAT AAAAGATGAC AATAATACCC CACTAATTAA CTTAAGGTTA CCTCTTTTAA 8842 .......... .......... .......... .......... .......... .......... 385 CCCCCAAGGA TTTTGAGTTA TTAATATAAA CCCATGAAAT ATATAATCAT AGCAGGAATA 8782 .......... .......... .......... .......... .......... .......... 385 GTCCAAAACG CCCCTTTAAA ACTTAACCAG AAATCTGACT CCAACTGGGA TTGCGCAACC 8722 .......... .......... .......... .......... .......... .......... 385 TGTGACGGGC CGTCGTGCCT GGGACGGTCC GTCCTGCAGG TCGTCGCAAA GTTCAGAGAC 8662 .......... .......... .......... .......... .......... .......... 385 CCAATATTTC CACCAAGGGT CTGTGACGGT CCGTCACACC TGTGACGGTC CGTCCTGCCA 8602 .......... .......... .......... .......... .......... .......... 385 TTCCGTCACG AAGTTCAGAG AGTCGATTTT CTGTACCCAA TTTTAGATTT TCTAAGTGTT 8542 || ||| .......... .......... .......... .......... .......... ...GAGGGTT 392 TTGAAACGAG ACCCTGCGAC GGTCCGTCGT GCCCATGACG GTCCGTCATT GGGTTCGTCG 8482 |||||||||| ||||||||| |||||||||| |||||||||| ||||||| | |||| ||||| TTGAAACGAG ACCCTGCGAT GGTCCGTCGT GCCCATGACG GTCCGTCGTG GGGTCCGTCG 452 CCTCAGCCTG TTTTTCCAGA AATAAAATCT GCTGCTCAAA ACGACTAAAC AGGTCGTTAC 8422 | || ||| | |||||||||| |||||| ||| |||||||||| |||||||||| ||||||||| CTTCTGCCAG TTTTTCCAGA AATAAAGTCT GCTGCTCAAA ACGACTAAAC GGGTCGTTAC 512 A 8421 | A 513 hqPGS_C06HBa0153O03.1-2-_SGN-E356257+ (8548 8421) ******************************************************************************** EST sequence 29 -strand 565 n (File: SGN-E275667-) 1 GAGACATCTG TGACGGACCG TCGTGCCTGT GACGGTCCGT CGTGGGTTCC GTTGTTTCAG 61 CCAATTTTCC AGAAATAAAA TCTGCTGCTC AAAACGACTA AACAGGTCGT TACAGTAATG 121 AAAAAAAAGA AGAAAGAGAA TAAAGAAAGA AGAAAGAAAG AGAAAAGGGA AGAAGAAGAA 181 AGAGAAAAAG AAAAAGAAAA AGAAAATGAA ATTGATAAAA TAAGAAAAAT AAAAATAAAA 241 ATTAATACGT GGCAGATTAT AATTGATGCG TAATTGAACT TCTTTTTTTG CAAGTGAGGA 301 TGGTTAAAAA ATGAGATATT TACAACACTT TAAAATTATT TAAGGGAGTA ATAAAATGTC 361 CGCTAAGTTA AGATATCTTT TTAATAATTT AAAATAACTT TAATGGTATT TTTATATCTT 421 TTCTCAAATA TTAAACTTTT TTAGAATACA CTCAATTCGC CTCAATGTCT TTTAAAATTT 481 TGACATTATG AATATCGACA TCACATGGTG CATATGCAAA AGTAATCACA TTATTATTGA 541 ATAGATCTAA TCTTTCATTG AGCGC Predicted gene structure (within gDNA segment 9720 to 3311): Exon 1 8534 8421 ( 114 n); cDNA 1 114 ( 114 n); score: 0.846 MATCH C06HBa0153O03.1-2- SGN-E275667- 0.846 114 0.202 C PGS_C06HBa0153O03.1-2-_SGN-E275667- (8534 8421) Alignment (genomic DNA sequence = upper lines): GAGACC-CTG CGACGGTCCG TCGTGCCCAT GACGGTCCGT CATTGGGTTC GTCGCCTCAG 8476 ||||| ||| ||||| ||| ||||||| | |||||||||| | | || | | || | |||| GAGACATCTG TGACGGACCG TCGTGCCTGT GACGGTCCGT CGTGGGTTCC GTTGTTTCAG 60 CCTGTTTTTC CAGAAATAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACA 8421 || ||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| CC-AATTTTC CAGAAATAAA ATCTGCTGCT CAAAACGACT AAACAGGTCG TTACA 114 hqPGS_C06HBa0153O03.1-2-_SGN-E275667- (8534 8421) ******************************************************************************** EST sequence 140 +strand 545 n (File: SGN-E241959+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACA Predicted gene structure (within gDNA segment 10064 to 6930): Exon 1 8976 8431 ( 546 n); cDNA 1 545 ( 545 n); score: 0.862 MATCH C06HBa0153O03.1-2- SGN-E241959+ 0.862 546 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E241959+ (8976 8431) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGACGGT 8500 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCATTGG GTTCGTCGCC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC TGCTCAAAAC 8440 ||||| | || | ||||| | ||| |||||| |||||| ||| |||||||||| | |||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACA 8431 ||||||||| GACTAAACA 545 hqPGS_C06HBa0153O03.1-2-_SGN-E241959+ (8976 8431) ******************************************************************************** EST sequence 43 -strand 660 n (File: SGN-E349296-) 1 AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 61 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 121 TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 181 ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 241 ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 301 ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCCT 361 TAAAACAATT GAGGAATTCC GACTCAGACT GGGATTTACG CAGCCTGTGA CAGCCCGTTG 421 TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC GCAAGGTTCA GAGACTGGAT TTTCACTGAA 481 GACTCTGTGA TGGTCCATCA CGCCTGTGAC GGTCCGTCTT GCCATTCCGT TACGAAGTTC 541 AGAGAGTCGA TTTTCAGTAC CCAATTTCAG ATTTCCTAAG TGTTTTGAAA TGAGACCCTG 601 CGACGGTCCG TCGTGCCCAT GATGGTCCGT CGTGGGGTCC GTCATTTCTG CCAGTTTTTC Predicted gene structure (within gDNA segment 11256 to 7524): Exon 1 9121 8466 ( 656 n); cDNA 1 660 ( 660 n); score: 0.828 MATCH C06HBa0153O03.1-2- SGN-E349296- 0.828 656 0.994 C PGS_C06HBa0153O03.1-2-_SGN-E349296- (9121 8466) Alignment (genomic DNA sequence = upper lines): AATACTACTA ACACATATCA TTCGCTATTA AGAGTTTGCT ACGAATAGCA TGA-AATAAC 9063 |||| || | | |||||| | |||||||||| |||| || || ||||||| | | | | ||| AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 60 CATAACCTAC CTCCACTGAA GATTAGTGAT TAAGCAAGAA ATTCCCAAGG CTTT-TGTTC 9004 |||||||||| |||||| ||| |||| ||||| ||||||| ||| || | | |||| |||| CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 120 CTTCTTCTCG TTCGATCCTC CCTC-AATTC GTTTCTCTTT CCCTCTCT-T TGTTCTTTCT 8946 ||| ||||| |||||||||| || ||| | | ||| | | || ||| | |||||||||| TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 180 ATTTTC-TTA TTCCAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAG 8887 |||||| ||| ||| |||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 240 ATGACAATAA TACCCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGGATTTTG 8827 ||| |||||| || ||||||| |||||||||| |||||||||| |||||||||| ||| | || | ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 300 AGTTATTAAT ATAAACCCAT GAAATATATA ATCATAGCAG GAATAGTCCA AAACGCCCCT 8767 | ||||||| || |||||| || | |||| || | ||||| |||||||| | ||||| ||| ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCC- 359 TTAAAACTTA ACCAGAAATC TGACTCCAAC TGGGA-TTGC GCAACCTGTG ACGGGCCGTC 8708 ||||||| ||| || ||||| || ||||| || | ||| |||||| || | |||| TTAAAACAAT TGAGGAATTC CGACTCAGAC TGGGATTTAC GCAGCCTGTG ACAGCCCGTT 419 GTGCCTGGGA CGGTCCGTCC TGCAGGTCGT CGCAAAGTTC AGAGACCCAA TATTTCCACC 8648 ||||||| || |||||||||| |||||||||| ||||| |||| |||||| | | |||| GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC AGAGACTGGA T-TTTCACTG 478 AAGGGTCTGT GACGGTCCGT CACACCTGTG ACGGTCCGTC CTGCCATTCC GTCACGAAGT 8588 ||| ||||| || ||||| | ||| |||||| |||||||||| ||||||||| || ||||||| AAGACTCTGT GATGGTCCAT CACGCCTGTG ACGGTCCGTC TTGCCATTCC GTTACGAAGT 538 TCAGAGAGTC GATTTTCTGT ACCCAATTTT AGATTTTCTA AGTGTTTTGA AACGAGACCC 8528 |||||||||| ||||||| || ||||||||| |||||| ||| |||||||||| || ||||||| TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTCCTA AGTGTTTTGA AATGAGACCC 598 TGCGACGGTC CGTCGTGCCC ATGACGGTCC GTCATTGGGT TCGTCGCCTC AGCCTGTTTT 8468 |||||||||| |||||||||| |||| ||||| ||| | |||| |||| || ||| ||||| TGCGACGGTC CGTCGTGCCC ATGATGGTCC GTCGTGGGGT CCGTCATTTC TGCCAGTTTT 658 TC 8466 || TC 660 hqPGS_C06HBa0153O03.1-2-_SGN-E349296- (9121 8466) ******************************************************************************** EST sequence 118 +strand 472 n (File: SGN-E236652+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GA Predicted gene structure (within gDNA segment 10064 to 7660): Exon 1 8976 8504 ( 473 n); cDNA 1 472 ( 472 n); score: 0.857 MATCH C06HBa0153O03.1-2- SGN-E236652+ 0.857 473 1.002 C PGS_C06HBa0153O03.1-2-_SGN-E236652+ (8976 8504) Alignment (genomic DNA sequence = upper lines): TCGTTTCTCT TTCCCTCTC- TT-TGTTCTT TCTATTTTCT TATTCCAACC CTCTTTCTTT 8919 ||| ||||| ||| ||||| || ||||||| | |||||| ||||| |||| ||| |||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 60 TACCCTAATT AGTATATAAT TAAGAATAAA AGATGACAAT AATACCCCAC TAATTAACTT 8859 |||||||||| || ||||||| |||||||||| ||||| ||| |||| ||||| ||||| ||| TACCCTAATT AGCATATAAT TAAGAATAAA AGATG-GAAT AATAACCCAC TAATTTACTC 119 AAGGTTACCT CTTTTAACCC CCAAGGATTT TGAGTTATTA ATATAAACCC ATGAAATATA 8799 |||||||||| |||||||||| ||| | | || || |||||| | |||||||| | || | || AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 179 TAATCATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAA TCTGACTCCA 8739 |||| | || |||||||||| ||||||| || | ||||||| | ||||| || ||| | TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA TCCGACCCAG 238 ACTGGGATTG CGCAACCTGT GACGGGCCGT CGTGCCTGGG ACGGTCCGTC CTGCAGGTCG 8679 ||||||||| |||||||||| || || |||| |||||||| | |||||||||| |||||||||| ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGTCG 298 TCGCAAAGTT CAGAGACCCA ATATTTCCAC C-AAGGGTCT GTGACGGTCC GTCACACCTG 8620 |||||| ||| ||||||| | | |||||||| | ||| |||| |||||||||| ||||| || | TCGCAAGGTT CAGAGACTC- A-ATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 356 TGACGGTCCG TCCTGCCATT CCGTCACGAA GTTCAGAGAG TCGATTTTCT GTACCCAATT 8560 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||| |||||||||| TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCAATT 416 TTAGATTTTC TAAGTGTTTT GAAACGAGAC CCTGCGACGG TCCGTCGTGC CCATGA 8504 | ||| |||| |||||||||| |||||||||| | |||||| ||| |||||| ||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGA 472 hqPGS_C06HBa0153O03.1-2-_SGN-E236652+ (8976 8504) ******************************************************************************** EST sequence 36 -strand 548 n (File: SGN-E356257-) 1 GTTAACTAGA AAATTAAAGT GATAGAGTCA AATAATGTAA CGACCCGTTT AGTCGTTTTG 61 AGCAGCAGAC TTTATTTCTG GAAAAACTGG CAGAAGCGAC GGACCCCACG ACGGACCGTC 121 ATGGGCACGA CGGACCATCG CAGGGTCTCG TTTCAAAACC CTCTTTCTTT TACCCCAAAT 181 TAACATATAA TTAAGAATAA AAGATGGCAA TAATACCCCA CTAATTAACT TAGGGTTACC 241 TCTTTTAACC CCAAGAATTT GAGTTATTAA TATAAACCCA CGAAATCTAT AATTAAGGAA 301 AGAATAGTCC AAAAACGTCC CTTAAAACGT GTAAGGAAAT CCGATTCTGC CTGGGATTTG 361 CGCAACCTGT GACGGGCCGT CGTGACTGTG ACGGTCCGTC CTGCAGGTCG TCGCAAGGGT 421 CAGAGAGTCA ATTTCCACTG AACAATCTAT GACGGTCCGT CACGCCTGTG ATGGTCCGTC 481 CTGTCATTCC GTCACGAAGT TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTTCTA 541 AGTGTTTT Predicted gene structure (within gDNA segment 11102 to 7940): Exon 1 8932 8540 ( 393 n); cDNA 157 548 ( 392 n); score: 0.858 MATCH C06HBa0153O03.1-2- SGN-E356257- 0.858 393 0.717 C PGS_C06HBa0153O03.1-2-_SGN-E356257- (8932 8540) Alignment (genomic DNA sequence = upper lines): AACCCTCTTT CTTTTACC-C TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAC 8874 |||||||||| |||||||| | ||||| || |||||||||| |||||||||| ||||||||| AACCCTCTTT CTTTTACCCC AAATTAACAT ATAATTAAGA ATAAAAGATG GCAATAATAC 216 CCCACTAATT AACTTAAGGT TACCTCTTTT AACCCCCAAG GATTTTGAGT TATTAATATA 8814 |||||||||| |||||| ||| |||||||||| || |||||| || ||||||| |||||||||| CCCACTAATT AACTTAGGGT TACCTCTTTT AA-CCCCAA- GAATTTGAGT TATTAATATA 274 AACCCATGAA ATATATAATC ATAGCAGGAA TAGTCC-AAA ACGCCCCTTT AAAACTTAAC 8755 |||||| ||| || |||||| | | | ||| |||||| ||| ||| ||| || ||||| | AACCCACGAA ATCTATAATT AAGGAAAGAA TAGTCCAAAA ACGTCCC-TT AAAACGTGTA 333 CAGAAATCTG ACTCCAACTG GGA-TTGCGC AACCTGTGAC GGGCCGTCGT GCCTGGGACG 8696 |||||| | | || ||| ||| |||||| |||||||||| |||||||||| | ||| |||| AGGAAATCCG ATTCTGCCTG GGATTTGCGC AACCTGTGAC GGGCCGTCGT GACTGTGACG 393 GTCCGTCCTG CAGGTCGTCG CAAAGTTCAG AGACCCAATA TTTCCAC-CA AGGGTCTGTG 8637 |||||||||| |||||||||| ||| | |||| ||| | | | ||||||| | | ||| || GTCCGTCCTG CAGGTCGTCG CAAGGGTCAG AGAGTC-A-A TTTCCACTGA ACAATCTATG 451 ACGGTCCGTC ACACCTGTGA CGGTCCGTCC TGCCATTCCG TCACGAAGTT CAGAGAGTCG 8577 |||||||||| || ||||||| ||||||||| || ||||||| |||||||||| |||||||||| ACGGTCCGTC ACGCCTGTGA TGGTCCGTCC TGTCATTCCG TCACGAAGTT CAGAGAGTCG 511 ATTTTCTGTA CCCAATTTTA GATTTTCTAA GTGTTTT 8540 |||||| ||| |||||||| | |||||||||| ||||||| ATTTTCAGTA CCCAATTTCA GATTTTCTAA GTGTTTT 548 hqPGS_C06HBa0153O03.1-2-_SGN-E356257- (8932 8540) ******************************************************************************** EST sequence 136 +strand 730 n (File: SGN-E546506+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAACAAT TCAATACTAT TATTATTATC CCCAAAATCT 61 GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA ACTAGGAATG TCTAAGAACA 121 AAATAACTAA AAAGCTAGTC CATGCCGGAA ATTCAAGGCA TCAAGACTTG AAGAAGAAGA 181 CCCAGTCCAA GCTAGACGCA TTAGCTCACC CTGAATTTTC CGATGAAGTG AAGACTGGCT 241 AGATCTACTG TTGAGTTGAA GTTGACGGAA CGTTTGCTGC ATTACACAAA TAACAAAGAG 301 GAAAACATGA AAGTAGGGGT CAGTACAACC ACACGTACTG AGTAGATATC ATCGGCCAAC 361 TCAAAATAGG GAACAGTATA TATCAATAAT AATGTAAATC AACTACAATA CTCAACATGT 421 AGCAATAACA CCATGAATTC ATCAATAACT ACAACCGAGT TCACACATGA GGACTCAAGC 481 CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT GAGTATATTC ATTATCTTTC 541 AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC ACTCCGATCC TCTATTTCTA 601 TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATTCTATC CTGGTACCGG AACGTGGCAC 661 CCGATCCATT TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA TTCTATCCTG 721 GTACCGGAAC Predicted gene structure (within gDNA segment 11203 to 8650): Exon 1 10113 9563 ( 551 n); cDNA 50 595 ( 546 n); score: 0.808 Intron 1 9562 9524 ( 39 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.92) Exon 2 9523 9413 ( 111 n); cDNA 596 705 ( 110 n); score: 0.914 Intron 2 9412 9341 ( 72 n); Pd: 0.900 (s: 0.89), Pa: 0.000 (s: 0) Exon 3 9340 9316 ( 25 n); cDNA 706 730 ( 25 n); score: 0.920 PPA cDNA 18 1 MATCH C06HBa0153O03.1-2- SGN-E546506+ 0.826 687 0.941 C PGS_C06HBa0153O03.1-2-_SGN-E546506+ (10113 9563,9523 9413,9340 9316) Alignment (genomic DNA sequence = upper lines): CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTACGA TCAAATGACT AAACTAAGAG 10054 |||||||||| |||||||||| |||||||||| ||||||| |||||| ||| ||||| || CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTA-TC TCAAATTACT TAACTAGGAA 108 TATTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 9994 | ||||| | | |||||| ||| |||| |||||||||| ||||| |||| |||||||||| T-GTCTAAGA AC--AAAATA ACTAAAAAGC TAGTCCATGC CGGAAATTCA AGGCATCAAG 165 ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA -TATCCGGTA 9935 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| | |||| | ACTTGAAGAA GAAGACCCAG TCCAAGCTAG ACGCATTAGC TCACCCTGAA TTTTCCGATG 225 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCTGCACTCC 9875 |||||| |||||||| |||| |||| |||||| ||| |||||||||| |||||| | | AAGTGAAGAC TGGCTAGATC TACTGTTGAG TTGAAGTTGA CGGAACGTTT GCTGCATTAC 285 ACAAATAAC- AAGAAGAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 9816 ||||||||| |||| ||||| ||| |||||| |||||||||| | || ||| |||||||||| ACAAATAACA AAGAGGAAAA CATGAAAGTA GGGGTCAGTA C-AACCACAC GTACTGAGTA 344 GATATCATCG GCCAACTCAA AATAGAAAAC AGTATGTATT AAGCAATATC ATAAAATCAA 9756 |||||||||| |||||||||| ||||| ||| ||||| ||| || |||| | ||||||| GATATCATCG GCCAACTCAA AATAGGGAAC AGTATATATC AA-TAATAAT GT-AAATCAA 402 TTAATATCCT TAGCATGCAG CATTTACAGT TACCATAACC CTTGGTTACA AC-ACCAAGC 9697 || || || | |||| || || | ||| | | | | | | || || ||| || CTACAATACT CAACATGTAG CAATAACA-C CATGA-ATTC ATCAATAACT ACAACCGAGT 460 ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT TCATTAGATT 9637 || |||| ||||||| | ||| | || ||||||||| |||||| ||| |||||||||| TCACACATGA GGACTCAAGC CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT 520 GGATATATT- AACATATTTC AAGATTCATT ATCTTTATTC CCCTCGTGTC GGTACATGAC 9578 | |||||| | || |||| |||||||||| |||||| ||| | || ||||| ||||| |||| GAGTATATTC ATTATCTTTC AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC 580 ACTCCGCTCC TCAATATACT ATCCTGGTGT CGGAACGTGA CACTCTGATC CTCATTCTAT 9518 |||||| ||| || || |||||| ACTCCGATCC TCTAT..... .......... .......... .......... ....TTCTAT 601 CCTGGTGTCG GAACGTGACA CTCCGATCCT CATATACTAT CCTGGTACCG GAACGTGGTA 9458 ||||||| || ||||||| || |||||||||| ||| | |||| |||||||||| |||||||| | CCTGGTGCCG GAACGTGGCA CTCCGATCCT CAT-T-CTAT CCTGGTACCG GAACGTGGCA 659 CCCGATCCAT ATTCTATCCT GGTGTCAGAA CGTGACAC-C CGATCCATAT CCTATCCTGG 9399 |||||||||| ||||||||| |||||| ||| |||||||| | |||||| CCCGATCCAT TTTCTATCCT GGTGTCGGAA CGTGACACTC CGATCC.... .......... 705 TACCGGAACG TGGCACCCGA TCAATATTCT ATCTTGGTGT CGGAACGTGA CACCCGATCC 9339 | .......... .......... .......... .......... .......... ........TC 707 ATATTCTATC CTGGTACCGA AAC 9316 |||||||||| ||||||||| ||| ATATTCTATC CTGGTACCGG AAC 730 hqPGS_C06HBa0153O03.1-2-_SGN-E546506+ (10113 9563,9523 9413,9340 9316) ******************************************************************************** EST sequence 41 -strand 725 n (File: SGN-E546548-) 1 GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 61 TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCCCTA ATCCATCAAG 121 CCTTCTTTTA CACTAAGGCA TCATCATTCT CATTATATAA TTTATCAAGC CTTCTTTCAT 181 ACTAAGGCAT CATCATTCTC ATTATATAAT ATATCAAGCG AATTAGGGTT CTTTCAAGAT 241 TTGGGATTCA ATTGCTTCAT CATGCTTTGT TAATTCATCG CAATTTCATA ATCATAATCA 301 TGCAAGCATA CAACTTAAGC ACATAGCAGG GTTTACAATA CTATCAACAC ATAATATTCA 361 CTATTAAGAG TTCACTACGA ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT 421 TGAATCAACA AGCTATCTTC TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT 481 CGTTCGTTTC TCCTCTCTTT CTGTTCTTTT CTTTTGTTTT GTTTTATTCA AACCCTCCTT 541 CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG GACAATAAAA CCCACTAATT 601 AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACC TATTAATATT AACCCTCAAT 661 CTTTATAATT AAGGAAAGAA TAGTCCAAAA CGACCCCTAA AACGTGTAGA GGAATCCTAT 721 TTTGC Predicted gene structure (within gDNA segment 10404 to 7284): Exon 1 9552 9481 ( 72 n); cDNA 1 71 ( 71 n); score: 0.889 Intron 1 9480 9448 ( 33 n); Pd: 0.000 (s: 0.94), Pa: 0.000 (s: 0) Exon 2 9447 9413 ( 35 n); cDNA 72 106 ( 35 n); score: 0.886 Intron 2 9412 9354 ( 59 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.40) Exon 3 9353 8773 ( 581 n); cDNA 107 691 ( 585 n); score: 0.665 MATCH C06HBa0153O03.1-2- SGN-E546548- 0.690 688 0.949 C PGS_C06HBa0153O03.1-2-_SGN-E546548- (9552 9481,9447 9413,9353 8773) Alignment (genomic DNA sequence = upper lines): GGTGTCGGAA CGTGACACTC TGATCCTCAT TCTATCCTGG TGTCGGAACG TGACACTCCG 9493 ||| ||||| |||| ||| | ||||| || |||||||||| |||||||||| |||||||||| GGTACCGGAA CGTGGCAC-C CGATCCATAT TCTATCCTGG TGTCGGAACG TGACACTCCG 59 ATCCTCATAT ACTATCCTGG TACCGGAACG TGGTACCCGA TCCATATTCT ATCCTGGTGT 9433 |||||||||| | ||||| |||||||| ATCCTCATAT TC........ .......... .......... .....ATTCT ATCCTGGTAC 86 CAGAACGTGA CACCCGATCC ATATCCTATC CTGGTACCGG AACGTGGCAC CCGATCAATA 9373 | ||||||| |||||||||| CGGAACGTGG CACCCGATCC .......... .......... .......... .......... 106 TTCTATCTTG GTGTCGGAAC GTGA-CACCC GATCCATATT CTATCCTGGT ACCGAAACGT 9314 | | | | | | || | || || || | | | | .......... .........C CTAATCCATC AAGCCTTCTT TTACACTAAG GCATCATCAT 147 GGCACCGGAT CCCCTAATCT CATCACTTTC GTTCAT-C-A AGCCTTCTTT TATACCAAGG 9256 | | || | | | || | ||| ||||| | | || | || | | || -TCTCATTAT ATAAT-TTAT CA-AGCCTTC TTTCATACTA AGGCATCATC ATTCTCATTA 204 CATCATCATT AACAAAGTAG ATTAGGGTTT CTTTTCAAGA TTTGGGATTC AATGGCTTCA 9196 || || | | | ||| | ||||||| || | |||||||| |||||||||| ||| |||||| TATAAT-A-T ATCAAGCGA- ATTAGGG-TT C-TTTCAAGA TTTGGGATTC AATTGCTTCA 259 TCATACTTAT TT-ATTCA-C --AATTACAT AATCACATCA TTCATGCAAG CATACAA-TT 9141 |||| ||| || ||||| | |||| ||| ||||| | | ||||||||| ||||||| || TCATGCTTTG TTAATTCATC GCAATTTCAT AATCATA--A -TCATGCAAG CATACAACTT 316 AAGCATATAG -AAGGTTTAC AATACTACTA ACACATATCA TTCGCTATTA AGAGTTTGCT 9082 ||||| |||| | ||||||| ||||||| | ||||||| | ||| |||||| |||||| || AAGCACATAG CAGGGTTTAC AATACTATCA ACACATAATA TTCACTATTA AGAGTTCACT 376 ACGAATAGCA TGAAAT-AAC CATAACCTAC CTCCACTGAA G-ATT-AGTG ATTAAGCAAG 9025 ||||||| | | | || ||| |||||||||| |||||| ||| | ||| | | | |||| | ACGAATATCG TAACATAAAC CATAACCTAC CTCCACCGAA GAATTGAATC AACAAGCTAT 436 AAATTCCCAA GGCTTTTGTT CCTTCTTCTC GTTCGATC-C TC-CCTCAAT TCGTTTCT-C 8968 || || ||| | || ||||| ||| || | || ||| | |||||||| | CTTCTCAAAA TCCTTGCTAT CC-TCTTC-G TTTCTCTCTC TCTACTC-GT TCGTTTCTCC 493 TTTCCCTCTC TT-TGTTC-T TTCTATTTTC TTATTCCAAC CCTCTTTC-T TTTACCCTAA 8911 | || ||| || | ||| | || | || | |||||| ||| |||| ||| | |||||||||| TCTCTTTCTG TTCTTTTCTT TTGTTTTGTT TTATTCAAAC CCTCCTTCTT TTTACCCTAA 553 TTAGTA-TAT AATTAAGAAT AAAAGATGAC AATAATACCC CACTAATTAA CTTAAGGTTA 8852 ||| | ||| ||||||| | ||| || ||| ||||| | || |||||||||| |||||||||| TTAAAAGTAT AATTAAGTGT AAAGGAGGAC AATAA-AACC CACTAATTAA CTTAAGGTTA 612 CCTCTTTTAA CCCCCAAGGA TTTTGAGTTA TTAATATAAA CCCATGAAAT ATATAATCAT 8792 |||||||||| |||||||| | || || || ||||||| || ||| | | |||||| | CCTCTTTTAA CCCCCAAGTA ATTAGACCTA TTAATATTAA CCCTCAATCT TTATAATTAA 672 AGCAGGAATA GTCCAAAAC 8773 | | ||||| ||||||||| GGAAAGAATA GTCCAAAAC 691 hqPGS_C06HBa0153O03.1-2-_SGN-E546548- (9552 9481,9447 9413) ******************************************************************************** EST sequence 32 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 11340 to 6566): Exon 1 9975 9568 ( 408 n); cDNA 1 410 ( 410 n); score: 0.817 MATCH C06HBa0153O03.1-2- SGN-E246710- 0.817 408 0.848 C PGS_C06HBa0153O03.1-2-_SGN-E246710- (9975 9568) Alignment (genomic DNA sequence = upper lines): AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATATCCGGT ATGACGAAGA CTGGCTAGAA 9916 |||||||||| |||||||||| |||||||||| ||| |||| | | | |||| |||||| ||| AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GT-AGTAAGA CTGGCTTGAA 59 TCACTGCTGA GTTGAAGATG ACGGAACGTT TGCTGCACTC CACAAAT-AA CAAGAAGAAA 9857 | |||| ||| |||||| | | | || ||||| |||||||||| ||||||| || |||||||| | TTACTGTTGA GTTGAACACG ATGGCACGTT TGCTGCACTC CACAAATAAA CAAGAAGAGA 119 ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 9797 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT AGATATCATC GGCCAACTCA 179 AAATAGAAAA CAGTATGTAT --TAAGCAAT ATCATAAAAT CAATTAATAT CCTTAGCATG 9739 ||||||||| || ||| ||| ||| ||| |||||||||| ||| || || || | |||| AAATAGAAAT CAATATATAT ACCAAGTAAT ATCATAAAAT CAACTATGAT ACTCAACATG 239 CAGCATTTAC AGTTACCATA ACCCTTGGTT ACAACACCAA GCACATCAAT GAGGACTCAC 9679 |||| || | ||| ||| | | | | ||| | || || ||||||||| TAGCAACAAC AAATACTATA TCATTAACAA TTACCGTCAA GTTCACACAT GAGGACTCAA 299 ACCTCCTCAT CACACTCATT TGGGAATTTA GTTCATTAGA TTGGATATAT TAACATATTT 9619 |||| | || ||||||| ||||||| |||||||||| ||| ||||| |||||| ||| GCCTCAATAC CATACTCATT TGGGAATCAT GTTCATTAGA TTGAGTATAT TAACATCTTT 359 CAAGATTCAT TATCTTTATT CCCCTCGTGT CGGTACATGA CACTCCGCTC C 9568 |||||||||| |||||||||| | || |||| |||||| ||| |||||||||| | CAAGATTCAT TATCTTTATT TCTCTTGTGT CGGTACGTGA CACTCCGCTC C 410 hqPGS_C06HBa0153O03.1-2-_SGN-E246710- (9975 9568) ******************************************************************************** EST sequence 45 -strand 236 n (File: SGN-E209683-) 1 CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 61 AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 121 ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 181 ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA Predicted gene structure (within gDNA segment 10485 to 9041): Exon 1 9875 9641 ( 235 n); cDNA 1 236 ( 236 n); score: 0.960 MATCH C06HBa0153O03.1-2- SGN-E209683- 0.960 235 0.996 C PGS_C06HBa0153O03.1-2-_SGN-E209683- (9875 9641) Alignment (genomic DNA sequence = upper lines): CACAAATAAC AAGAAGA-AA ACATAAAAGT AGGGGTCAGT ACAAAACACG GGTACTGAGT 9817 |||||||||| ||||||| || |||||||||| |||||||||| ||||| |||| |||||||||| CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 60 AGATATCATC GGCCAACTCA AAATAGAAAA CAGTATGTAT TAAGCAATAT CATAAAATCA 9757 |||||||||| |||||||||| |||||| || |||||||||| |||||||||| |||||||||| AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 120 ATTAATATCC TTAGCATGCA GCATTTACAG TTACCATAAC CCTTGGTTAC AACACCAAGC 9697 | |||||||| ||| |||||| ||||||| || |||||||||| |||||||||| |||||||||| ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 180 ACATCAATGA GGACTCACAC CTCCTCATCA CACTCATTTG GGAATTTAGT TCATTA 9641 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||| ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA 236 hqPGS_C06HBa0153O03.1-2-_SGN-E209683- (9875 9641) ******************************************************************************** EST sequence 15 -strand 729 n (File: SGN-E351546-) 1 AGTCGTTGCT CTAGTTCTAC CCATCTGGCA AGAGAGTGAG NATGGTCAGA TACCAATTCG 61 TATCGCTTAG ATACCAATTG ACTCGAAGTA GTAGCACGAA AGAAAGAATG AAAGAGTGAA 121 GTTTTCCTAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAA GCGTCCCCCT ACCGTTCCTT 181 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 241 AGTTTTGTCA CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA 301 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA 361 AGACTTAAAC TCATTAATAA AATCAATAAC TACTATTATT AAACATCTAT TATTCCCAAA 421 ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA GAGTTTCTAA 481 GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 541 AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 601 AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 661 ACCAAGAAGA AAAACATAAA AGTAGGGGTC AGTACAAAAC ACGGCTACTG AGTAGATATC 721 ATCGGCCAA Predicted gene structure (within gDNA segment 11633 to 9201): Exon 1 10290 9801 ( 490 n); cDNA 246 729 ( 484 n); score: 0.889 MATCH C06HBa0153O03.1-2- SGN-E351546- 0.889 490 0.672 C PGS_C06HBa0153O03.1-2-_SGN-E351546- (10290 9801) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAATCCGGG CCGCGTCTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 10231 |||||||||| |||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTCACGACC CAAATCCGGG CCGCCACTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 305 CCAACCAATC TAAACCTTAA CATTTCAATA TAATATAACC AGAAAGTAAT GCGGAAGACT 10171 |||||||||| |||||||||| ||||||||| ||||| | | |||||||||| |||||||||| CCAACCAATC TAAACCTTAA CATTTCAATG TAATAGCAAC AGAAAGTAAT GCGGAAGACT 365 TAAACTCATT AAATAAAGAC CAATTCATTA ACTTCTAAAA TTCAACATCT ATTATTCCCC 10111 |||||||||| ||| | | || ||| || ||| ||| | || ||||||| |||||| || TAAACTCATT -AAT--A-A- -AA-TCAATA ACTACTATTA TTAAACATCT ATTATT--CC 416 CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACGATCA AATGACTAA- ACTAAGAGTA 10052 ||||| |||| |||||||||| |||||||||| ||||| | | || ||||| |||||||| CAAAACCTGG AAGTCATCAT CACAAGAACA TCTAC-TTTA AACTACTAAT TCTAAGAGT- 474 TTCTAA-AAG CT-AAAAA-T ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 9995 |||||| ||| || ||||| | |||||||||| |||||||||| |||||||||| |||||||||| TTCTAAGAAG CTAAAAAATT ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 534 GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGCATTAG CTCACCCTGA ATATCCGGTA 9935 ||| ||||| |||||| ||| |||||||||| ||||| |||| |||||||||| | ||||||| GACATGAAGG AGAAGATCCA GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG 594 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCTGCACTCC 9875 |||||||||| ||||| || | |||| |||| | |||||||| ||| |||||| |||||||||| TGACGAAGAC TGGCTTGAGT TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC 654 ACAAATAACA AGAAG-AAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 9816 ||||||| || ||||| |||| |||||||||| |||||||||| |||||||||| ||||||||| ACAAATACCA AGAAGAAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA 714 GATATCATCG GCCAA 9801 |||||||||| ||||| GATATCATCG GCCAA 729 hqPGS_C06HBa0153O03.1-2-_SGN-E351546- (10290 9801) ******************************************************************************** EST sequence 34 -strand 580 n (File: SGN-E356206-) 1 GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 61 TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 121 CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 181 TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 241 CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 301 ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 361 CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 421 GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 481 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA AAGTAGGGGT 541 CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA Predicted gene structure (within gDNA segment 11633 to 9201): Exon 1 10290 9801 ( 490 n); cDNA 97 580 ( 484 n); score: 0.891 MATCH C06HBa0153O03.1-2- SGN-E356206- 0.891 490 0.845 C PGS_C06HBa0153O03.1-2-_SGN-E356206- (10290 9801) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAATCCGGG CCGCGTCTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 10231 |||||||||| |||||||||| |||| |||| |||||||||| |||||||||| |||||||||| TGTCACGACC CAAATCCGGG CCGCCACTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 156 CCAACCAATC TAAACCTTAA CATTTCAATA TAATATAACC AGAAAGTAAT GCGGAAGACT 10171 |||||||||| |||||||||| ||||||||| ||||| | | |||||||||| |||||||||| CCAACCAATC TAAACCTTAA CATTTCAATG TAATAGCAAC AGAAAGTAAT GCGGAAGACT 216 TAAACTCATT AAATAAAGAC CAATTCATTA ACTTCTAAAA TTCAACATCT ATTATTCCCC 10111 |||||||||| ||| | | || ||| || ||| ||| | || ||||||| |||||| || TAAACTCATT -AAT--A-A- -AA-TCAATA ACTACTATTA TTAAACATCT ATTATT--CC 267 CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACGATCA AATGACTAA- ACTAAGAGTA 10052 ||||| |||| |||||||||| |||||||||| ||||| | | || ||||| |||||||| CAAAACCTGG AAGTCATCAT CACAAGAACA TCTAC-TTTA AACTACTAAT TCTAAGAGT- 325 TTCTAA-AAG CT-AAAAA-T ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 9995 |||||| ||| || ||||| | |||||||||| |||||||||| |||||||||| |||||||||| TTCTAAGAAG CTAAAAAATT ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 385 GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGCATTAG CTCACCCTGA ATATCCGGTA 9935 ||| ||||| |||||| ||| |||||||||| ||||| |||| |||||||||| | ||||||| GACATGAAGG AGAAGATCCA GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG 445 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCTGCACTCC 9875 |||||||||| ||||| || | |||| |||| | |||||||| ||| |||||| |||||||||| TGACGAAGAC TGGCTTGAGT TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC 505 ACAAATAACA AGAAG-AAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 9816 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| ||||||||| ACAAATAACA AGAAGAAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA 565 GATATCATCG GCCAA 9801 |||||||||| ||||| GATATCATCG GCCAA 580 hqPGS_C06HBa0153O03.1-2-_SGN-E356206- (10290 9801) ******************************************************************************** EST sequence 38 -strand 655 n (File: SGN-E356696-) 1 CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 61 TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 121 CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 181 CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 241 CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 301 TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 361 ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 421 ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 481 GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 541 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAACA AGAAGAAAAA 601 CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA GATATCATCG GCCAA Predicted gene structure (within gDNA segment 11633 to 9201): Exon 1 10290 9801 ( 490 n); cDNA 172 655 ( 484 n); score: 0.889 MATCH C06HBa0153O03.1-2- SGN-E356696- 0.889 490 0.748 C PGS_C06HBa0153O03.1-2-_SGN-E356696- (10290 9801) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAATCCGGG CCGCGTCTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 10231 |||||||||| |||||||||| |||| |||| |||||||||| ||||||| || |||||||||| TGTCACGACC CAAATCCGGG CCGCCACTGG CACCCACACT TACCCTCNTA TGTGAGCGAA 231 CCAACCAATC TAAACCTTAA CATTTCAATA TAATATAACC AGAAAGTAAT GCGGAAGACT 10171 |||||||||| |||||||||| ||||||||| ||||| | | |||||||||| |||||||||| CCAACCAATC TAAACCTTAA CATTTCAATG TAATAGCAAC AGAAAGTAAT GCGGAAGACT 291 TAAACTCATT AAATAAAGAC CAATTCATTA ACTTCTAAAA TTCAACATCT ATTATTCCCC 10111 |||||||||| ||| | | || ||| || ||| ||| | || ||||||| |||||| || TAAACTCATT -AAT--A-A- -AA-TCAATA ACTACTATTA TTAAACATCT ATTATT--CC 342 CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACGATCA AATGACTAA- ACTAAGAGTA 10052 ||||| |||| |||||||||| |||||||||| ||||| | | || ||||| |||||||| CAAAACCTGG AAGTCATCAT CACAAGAACA TCTAC-TTTA AACTACTAAT TCTAAGAGT- 400 TTCTAA-AAG CT-AAAAA-T ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 9995 |||||| ||| || ||||| | |||||||||| |||||||||| |||||||||| |||||||||| TTCTAAGAAG CTAAAAAATT ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA 460 GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGCATTAG CTCACCCTGA ATATCCGGTA 9935 ||| ||||| |||||| ||| |||||||||| ||||| |||| |||||||||| | ||||||| GACATGAAGG AGAAGATCCA GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG 520 TGACGAAGAC TGGCTAGAAT CACTGCTGAG TTGAAGATGA CGGAACGTTT GCTGCACTCC 9875 |||||||||| ||||| || | |||| |||| | |||||||| ||| |||||| |||||||||| TGACGAAGAC TGGCTTGAGT TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC 580 ACAAATAACA AGAAG-AAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 9816 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| ||||||||| ACAAATAACA AGAAGAAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA 640 GATATCATCG GCCAA 9801 |||||||||| ||||| GATATCATCG GCCAA 655 hqPGS_C06HBa0153O03.1-2-_SGN-E356696- (10290 9801) ******************************************************************************** EST sequence 53 +strand 434 n (File: SGN-E222578+) 1 TTTTTTTTTT TTTTTTTTTA ATAAAAACCA ATTCAATAAC TATCAATATT CAACATCTAT 61 TATTCCCAAA ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA 121 GAGTTTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAGGTT CAAGGCATCA 181 AGACATGAAG GAGAAGATCC AGTCCAAGCT AGACGCGTTA GCTCACCCTG AAGATCCGGT 241 GTGACGAAGA CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC 301 CACAACTTTC TAGATGGGGA CTTTCTTCAA GGCTTCGAGA TGGAAACTTG CTTGCAGAGC 361 TTCGAGTGTT ACCAGCTTCA AGATGGAGTT TCAGTGATGA GGCTTGCTAG TCTCGAGTTT 421 TTTTTTTTTT TTTT Predicted gene structure (within gDNA segment 11201 to 6882): Exon 1 10152 9871 ( 282 n); cDNA 27 305 ( 279 n); score: 0.888 PPA cDNA 19 1 MATCH C06HBa0153O03.1-2- SGN-E222578+ 0.888 282 0.650 C PGS_C06HBa0153O03.1-2-_SGN-E222578+ (10152 9871) Alignment (genomic DNA sequence = upper lines): ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC CCCAAAATCT GGAAGTCATC 10093 ||||||||| ||||| || ||||||||| |||||||||| | |||| || |||||||||| ACCAATTCAA TAACTATCAA TATTCAACAT CTATTATTCC -C-AAAACCT GGAAGTCATC 84 ATCACAAGAA CATCTACGAT CAAATGACTA A-ACTAAGAG TATTCTAAAA GCTAAAAATA 10034 |||||||||| ||||||| | ||| |||| | ||||||| | |||||||| |||||||||| ATCACAAGAA CATCTAC-TT TAAACTACTA ATTCTAAGAG T-TTCTAAAA GCTAAAAATA 142 CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG ACTTGAAGAA GAAGACCCAG 9974 |||||||||| |||||||||| |||| ||||| |||||||||| || ||||| | ||||| |||| CATAAGAAGC TAGTCCATGC CGGAGGTTCA AGGCATCAAG ACATGAAGGA GAAGATCCAG 202 TCCAAGCTAG AAGCATTAGC TCACCCTGAA TATCCGGTAT GACGAAGACT GGCTAGAATC 9914 |||||||||| | || ||||| |||||||||| ||||||| | |||||||||| |||| || | TCCAAGCTAG ACGCGTTAGC TCACCCTGAA GATCCGGTGT GACGAAGACT GGCTTGAGTT 262 ACTGCTGAGT TGAAGATGAC GGAACGTTTG CTGCACTCCA CAA 9871 |||| ||||| ||||||||| || ||||||| |||||||||| ||| ACTGTTGAGT CGAAGATGAC GGCACGTTTG CTGCACTCCA CAA 305 hqPGS_C06HBa0153O03.1-2-_SGN-E222578+ (10152 9871) ******************************************************************************** EST sequence 51 +strand 679 n (File: SGN-E370357+) 1 TTTTTTTTTT CTTACAATTA TATTATGAAT TCGATAATCT TTAATGTCAC GACCCAAATC 61 GAGCCGCAAG TGGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATACAAAATC 121 CAACATTTCA ATATAATGAC GGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC 181 AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA 241 TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT ATCTAGAAAG CTAGAATAAC 301 TAAAAAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 361 CAAGCTAGAA GCGTTAGCTC ACACTGAAAT CCGGTATAAT GAAGACTGGC TAGAGTTGCG 421 GTTGAGTTGA AGACGACGGT ACGTTTGCTT TATTCGAGTG TCAATTAATC ATTCGGCTGT 481 CACCCAAATA TTATTGATTG ATTACACCTC TGCCATTTGT AAAATTTTTC AAATTTGCCT 541 ACGGATGCAG AATTTTCCTC GAATTTCTGA TGTGTTTTCT TGTAAATAGT GGCCATTTGT 601 GTAAGTAAAT GCCCATTTCT CCTCCTACAA AGTCCAATTC CATTTTTCCC CCAATCCACC 661 ATGGCAACAC CACCTCCAA Predicted gene structure (within gDNA segment 11321 to 6280): Exon 1 10291 9874 ( 418 n); cDNA 44 457 ( 414 n); score: 0.840 Intron 1 9873 7846 (2028 n); Pd: 0.000 (s: 0.78), Pa: 0.776 (s: 0) Exon 2 7845 7816 ( 30 n); cDNA 458 487 ( 30 n); score: 0.567 PPA cDNA 13 1 MATCH C06HBa0153O03.1-2- SGN-E370357+ 0.840 448 0.660 C PGS_C06HBa0153O03.1-2-_SGN-E370357+ (10291 9874,7845 7816) Alignment (genomic DNA sequence = upper lines): ATGTCACGAC CCAAATCCGG GCCGCGTCTG GCACCCACAC TTACCCTCCT ATGTGAGCGA 10232 |||||||||| |||||| || ||||| || |||||||||| |||||||||| |||||||||| ATGTCACGAC CCAAAT-CGA GCCGCAAGTG GCACCCACAC TTACCCTCCT ATGTGAGCGA 102 ACCAACCAAT CTAAACCTTA ACATTTCAAT ATAATATAAC CAGAAAGTAA TGCGGAAGAC 10172 |||||||||| ||| | |||||||||| || | || | | ||| ||| |||||||||| ACCAACCAAT ACAAAATCCA ACATTTCAAT AT-A-ATGA- CGGAATATAA TGCGGAAGAC 159 TTAAACTCAT TAAATAAAGA CCAATTCATT AACTTCT-AA AATTCAACAT CTATTATT-C 10114 |||||||||| | ||| || | ||||| | | ||||||| || || |||||| |||||||| TTAAACTCAT T-AATGAAAA TCAATTAAAT AACTTCTAAA AACTCAACAA CTATTATTAT 218 CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTACGA TCAAATGACT AA-ACTAAGA 10055 |||||||||| |||||||||| |||||||||| ||||||| |||||| ||| || |||||| CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTATCC TCAAATTACT AATTCTAAGA 278 GTATTCTA-A AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAAGTT CAAGGCATCA 9996 ||| |||| | ||||| | || || ||| || |||||||||| ||||||| || |||||||||| GTA-TCTAGA AAGCT-AGAA TAACTAAAAA GCTAGTCCAT GCCGGAACTT CAAGGCATCA 336 AGACTTGAAG AAGAAGACCC AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATATCCGGT 9936 |||| ||||| ||||||| || |||||||||| |||||| ||| |||||| ||| || ||||||| AGACATGAAG AAGAAGATCC AGTCCAAGCT AGAAGCGTTA GCTCACACTG AA-ATCCGGT 395 ATGACGAAGA CTGGCTAGAA TCACTGCTGA GTTGAAGATG ACGGAACGTT TGCTGCACTC 9876 || | ||||| ||||||||| | | | ||| |||||||| | |||| ||||| |||| | || ATAATGAAGA CTGGCTAGAG TTGCGGTTGA GTTGAAGACG ACGGTACGTT TGCTTTATTC 455 CACAAATAAC AAGAAGAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 9816 | GA........ .......... .......... .......... .......... .......... 457 GATATCATCG GCCAACTCAA AATAGAAAAC AGTATGTATT AAGCAATATC ATAAAATCAA 9756 .......... .......... .......... .......... .......... .......... 457 TTAATATCCT TAGCATGCAG CATTTACAGT TACCATAACC CTTGGTTACA ACACCAAGCA 9696 .......... .......... .......... .......... .......... .......... 457 CATCAATGAG GACTCACACC TCCTCATCAC ACTCATTTGG GAATTTAGTT CATTAGATTG 9636 .......... .......... .......... .......... .......... .......... 457 GATATATTAA CATATTTCAA GATTCATTAT CTTTATTCCC CTCGTGTCGG TACATGACAC 9576 .......... .......... .......... .......... .......... .......... 457 TCCGCTCCTC AATATACTAT CCTGGTGTCG GAACGTGACA CTCTGATCCT CATTCTATCC 9516 .......... .......... .......... .......... .......... .......... 457 TGGTGTCGGA ACGTGACACT CCGATCCTCA TATACTATCC TGGTACCGGA ACGTGGTACC 9456 .......... .......... .......... .......... .......... .......... 457 CGATCCATAT TCTATCCTGG TGTCAGAACG TGACACCCGA TCCATATCCT ATCCTGGTAC 9396 .......... .......... .......... .......... .......... .......... 457 CGGAACGTGG CACCCGATCA ATATTCTATC TTGGTGTCGG AACGTGACAC CCGATCCATA 9336 .......... .......... .......... .......... .......... .......... 457 TTCTATCCTG GTACCGAAAC GTGGCACCGG ATCCCCTAAT CTCATCACTT TCGTTCATCA 9276 .......... .......... .......... .......... .......... .......... 457 AGCCTTCTTT TATACCAAGG CATCATCATT AACAAAGTAG ATTAGGGTTT CTTTTCAAGA 9216 .......... .......... .......... .......... .......... .......... 457 TTTGGGATTC AATGGCTTCA TCATACTTAT TTATTCACAA TTACATAATC ACATCATTCA 9156 .......... .......... .......... .......... .......... .......... 457 TGCAAGCATA CAATTAAGCA TATAGAAGGT TTACAATACT ACTAACACAT ATCATTCGCT 9096 .......... .......... .......... .......... .......... .......... 457 ATTAAGAGTT TGCTACGAAT AGCATGAAAT AACCATAACC TACCTCCACT GAAGATTAGT 9036 .......... .......... .......... .......... .......... .......... 457 GATTAAGCAA GAAATTCCCA AGGCTTTTGT TCCTTCTTCT CGTTCGATCC TCCCTCAATT 8976 .......... .......... .......... .......... .......... .......... 457 CGTTTCTCTT TCCCTCTCTT TGTTCTTTCT ATTTTCTTAT TCCAACCCTC TTTCTTTTAC 8916 .......... .......... .......... .......... .......... .......... 457 CCTAATTAGT ATATAATTAA GAATAAAAGA TGACAATAAT ACCCCACTAA TTAACTTAAG 8856 .......... .......... .......... .......... .......... .......... 457 GTTACCTCTT TTAACCCCCA AGGATTTTGA GTTATTAATA TAAACCCATG AAATATATAA 8796 .......... .......... .......... .......... .......... .......... 457 TCATAGCAGG AATAGTCCAA AACGCCCCTT TAAAACTTAA CCAGAAATCT GACTCCAACT 8736 .......... .......... .......... .......... .......... .......... 457 GGGATTGCGC AACCTGTGAC GGGCCGTCGT GCCTGGGACG GTCCGTCCTG CAGGTCGTCG 8676 .......... .......... .......... .......... .......... .......... 457 CAAAGTTCAG AGACCCAATA TTTCCACCAA GGGTCTGTGA CGGTCCGTCA CACCTGTGAC 8616 .......... .......... .......... .......... .......... .......... 457 GGTCCGTCCT GCCATTCCGT CACGAAGTTC AGAGAGTCGA TTTTCTGTAC CCAATTTTAG 8556 .......... .......... .......... .......... .......... .......... 457 ATTTTCTAAG TGTTTTGAAA CGAGACCCTG CGACGGTCCG TCGTGCCCAT GACGGTCCGT 8496 .......... .......... .......... .......... .......... .......... 457 CATTGGGTTC GTCGCCTCAG CCTGTTTTTC CAGAAATAAA ATCTGCTGCT CAAAACGACT 8436 .......... .......... .......... .......... .......... .......... 457 AAACAGGTCG TTACAATAGA TACCAATTTA CCCATCGTTC GTCCCCGAAC GATCACAAGA 8376 .......... .......... .......... .......... .......... .......... 457 AGGAAAACAA GGGCGAAAAG GAGTACCTGA ATCTGTAAAC AGATGTGGGT ATTTTTCTCG 8316 .......... .......... .......... .......... .......... .......... 457 CATATCCGCC TCCTTCTCCC AAGTGGCTTC ATCAACGGGT CGATTCTTCC ATTGCACCTT 8256 .......... .......... .......... .......... .......... .......... 457 GATGGATGCA ATCTCTCTTG ACCTCAACTT GCGAACTTCT CTATCTAAAA TAGCAACAGG 8196 .......... .......... .......... .......... .......... .......... 457 CTCCTCCTCA TAAGACAAGT TCTCATCAAG CAAAACTGAA TCCCAACGGA TAATGTAATT 8136 .......... .......... .......... .......... .......... .......... 457 TCCATTCCCA TGATATCTTT TCAACATAGA CACATGGAAT ACCGGATGTA CTCCGGACAG 8076 .......... .......... .......... .......... .......... .......... 457 CCCTGGAGGC AAGGCTAACT CATAAGCCAC CTCTCCTACT CGCTTAAGTA CTTCAAATGG 8016 .......... .......... .......... .......... .......... .......... 457 TCCAATGTAC CTTGGACTTA GTTTACCCCT TTTTCCGAAC CGCATCACCC CTTTCATGGG 7956 .......... .......... .......... .......... .......... .......... 457 CGAAACTTTC AACAAGACTT GTTCACCTTC CATGAACTCT AAGTCTCTAA CCTTTCGATC 7896 .......... .......... .......... .......... .......... .......... 457 TGCATATTCT TTTTGTCTAC TTTGCGACGC TAACAACTTT TCTTGAATAG ATTTCACTTT 7836 | ||| || .......... .......... .......... .......... .......... GTGTCAATTA 467 ATCTAACGAT TCTCTCAAAA 7816 ||| || | || | || ATCATTCGGC TGTCACCCAA 487 hqPGS_C06HBa0153O03.1-2-_SGN-E370357+ (10291 9874) ******************************************************************************** EST sequence 133 +strand 710 n (File: SGN-E392027+) 1 CCACAGCCCC AGTGGCTGGC TCAGTCGCAC CCTGTCCCGC CGGTGCTGGT GTTGATGCTG 61 GCGTAGTCGT TGCTCTAGTT CTAACCATCT GCGAAATAGA GTGAAGATGG TCAGATACCA 121 ATTTGTATCA CCTAGATACC AATTGGACCC AAGTAATAGC ACGAAAGAAG AAAGAATGGA 181 ATTTTCCAAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAG GCATCCCCCT ACCGTTCCTT 241 AAGACTCTAC TAGACTCGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 301 AGTTTGTCAC GACCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 361 GAACCAACCA ATCTAACCTT AACATTTCAA TATAATATCA ACAGAAAGTA ATGTGGAAGA 421 CTTAAACTCA TTAAATACAG ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC 481 CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA 541 GTCTAAAAGC TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC 601 TTGAAGAAGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATT TCCGATGTAG 661 TAAGACTGGC TTGAATTACT GTTGAGTTGA ACACGATGGC ACGTTTGCTG Predicted gene structure (within gDNA segment 11633 to 8723): Exon 1 10295 9881 ( 415 n); cDNA 300 710 ( 411 n); score: 0.930 MATCH C06HBa0153O03.1-2- SGN-E392027+ 0.930 415 0.585 C PGS_C06HBa0153O03.1-2-_SGN-E392027+ (10295 9881) Alignment (genomic DNA sequence = upper lines): AACTATGTCA CGACCCAAAT CCGGGCCGCG TCTGGCACCC ACACTTACCC TCCTATGTGA 10236 || | ||||| ||| ||||| ||||| ||| ||||||||| |||||||||| |||||||||| AAGTTTGTCA CGA-CCAAAA CCGGGTTGCG ACTGGCACCC ACACTTACCC TCCTATGTGA 358 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATATAATA TAACCAGAAA GTAATGCGGA 10176 |||||||||| |||||| ||| |||||||||| |||||||||| | | |||||| |||||| ||| GCGAACCAAC CAATCT-AAC CTTAACATTT CAATATAATA TCAACAGAAA GTAATGTGGA 417 AGACTTAAAC TCATTAAATA AAGACCAATT CATTAACTTC TAAAATTCAA CATCTATTAT 10116 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| AGACTTAAAC TCATTAAATA CAGACCAATT CATTAACTTC TAAAATTCAA CATCTATTAT 477 TCCCCCAAAA TCTGGAAGTC ATCATCACAA GAACATCTAC GATCAAATGA CTAAACTAAG 10056 | |||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| T-CCCCAAAA TCTGGAAGTC ATCACCACAA GAACATCTAC GATCAAATGA CTAAACTAAG 536 AGTATTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAAGTT CAAGGCATCA 9996 |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTAGTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAAGTT CAAGGCATCA 596 AGACTTGAAG AAGAAGACCC AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATATCCGGT 9936 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| ||| |||| | AGACTTGAAG AAGAAGATCC AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT 656 ATGACGAAGA CTGGCTAGAA TCACTGCTGA GTTGAAGATG ACGGAACGTT TGCTG 9881 | | |||| |||||| ||| | |||| ||| |||||| | | | || ||||| ||||| GT-AGTAAGA CTGGCTTGAA TTACTGTTGA GTTGAACACG ATGGCACGTT TGCTG 710 hqPGS_C06HBa0153O03.1-2-_SGN-E392027+ (10295 9881) ******************************************************************************** EST sequence 9 -strand 299 n (File: SGN-E373117-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 61 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 241 GAGTTGAAGA CGACGGTACG TTTGCCAAAA TTACGACAGT ATTTGGACAA GCTAGAAGA Predicted gene structure (within gDNA segment 11083 to 8260): Exon 1 10147 9885 ( 263 n); cDNA 1 263 ( 263 n); score: 0.844 MATCH C06HBa0153O03.1-2- SGN-E373117- 0.844 263 0.880 C PGS_C06HBa0153O03.1-2-_SGN-E373117- (10147 9885) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCT-AAAATT CAACATCTAT TATT-CCCCC AAAATCTGGA AGTCATCATC 10090 || || | ||| |||| | ||||| |||| |||| |||| |||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 60 ACAAGAACAT CTACGATCAA ATGACTAAA- CTAAGAGTAT TCTAA-AAGC TAAAAATACA 10032 |||||||||| |||| |||| || |||||| ||||||||| ||||| |||| | |||||||| ACAAGAACAT CTAC-TTCAA ATTACTAAAT CTAAGAGTA- TCTAAGAAGC T-AAAATACA 117 TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA AGACCCAGTC 9972 ||| ||||| |||||||||| ||| |||||| |||||||||| ||||||||| ||| |||||| TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 177 CAAGCTAGAA GCATTAGCTC ACCCTGAATA TCCGGTATGA CGAAGACTGG CTAGAATCAC 9912 |||||||||| || ||||||| |||||||| | |||| | | | ||||||||| ||||| | | CAAGCTAGAA GCGTTAGCTC ACCCTGAA-A TCCGATGTAA TGAAGACTGG CTAGAGTTGC 236 TGCTGAGTTG AAGATGACGG AACGTTT 9885 | ||||||| |||| ||||| |||||| GGTTGAGTTG AAGACGACGG TACGTTT 263 hqPGS_C06HBa0153O03.1-2-_SGN-E373117- (10147 9885) ******************************************************************************** EST sequence 89 +strand 299 n (File: SGN-E373116+) 1 TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCGT TAGCTCACCC TGAAATCCGA TGTAATGAAG ACTGGCTAGA GTTGCGGTTG 241 AGTTGAAGAC GACGGTACGT TTGCCAAAAT TACGACAGTA TTTGGACAAG CTAGAAGAG Predicted gene structure (within gDNA segment 11063 to 8240): Exon 1 10147 9885 ( 263 n); cDNA 1 262 ( 262 n); score: 0.842 MATCH C06HBa0153O03.1-2- SGN-E373116+ 0.842 263 0.880 C PGS_C06HBa0153O03.1-2-_SGN-E373116+ (10147 9885) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCTAAAATTC AACATCTATT ATT-CCCCCA AAATCTGGAA GTCATCATCA 10089 || || | |||| || |||| ||||| ||| ||||| |||||||||| |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 60 CAAGAACATC TACGATCAAA TGACTAAA-C TAAGAGTATT CTAA-AAGCT AAAAATACAT 10031 |||||||||| ||| ||||| | |||||| | |||||||| | |||| ||||| ||||||||| CAAGAACATC TAC-TTCAAA TTACTAAATC TAAGAGTA-T CTAAGAAGCT -AAAATACAT 117 AAGAAGCTAG TCCATGCCGG AAGTTCAAGG CATCAAGACT TGAAGAAGAA GACCCAGTCC 9971 || |||||| |||||||||| || ||||||| ||||||||| |||||||||| || ||||||| AAACAGCTAG TCCATGCCGG AACTTCAAGG CATCAAGACA TGAAGAAGAA GATCCAGTCC 177 AAGCTAGAAG CATTAGCTCA CCCTGAATAT CCGGTATGAC GAAGACTGGC TAGAATCACT 9911 |||||||||| | |||||||| ||||||| || ||| | | | |||||||||| |||| | | AAGCTAGAAG CGTTAGCTCA CCCTGAA-AT CCGATGTAAT GAAGACTGGC TAGAGTTGCG 236 GCTGAGTTGA AGATGACGGA ACGTTT 9885 | |||||||| ||| ||||| |||||| GTTGAGTTGA AGACGACGGT ACGTTT 262 hqPGS_C06HBa0153O03.1-2-_SGN-E373116+ (10147 9885) ******************************************************************************** EST sequence 138 +strand 265 n (File: SGN-E216150+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAATCAA TAATCAACTT GTATAACTCA AAACTTATCA 61 TTCCCCAAAA TCTGGAAGTC ATCATCACCA GAGCCTCTAT CATAAAATTA CTAAACTAAG 121 AGTATTCTAA GAAGCTAAAA ATACATACGA AGCTAGTCCA TGCCGGAAGT TCAAGGCATC 181 AAGACTTGAA GAAGAAGATC CAGTCAAACC TAGAAGCATT AGCTCACCCT GAATTTCCGA 241 TGTAGTAGGA CTGGCTTGAG TTACT Predicted gene structure (within gDNA segment 11633 to 9023): Exon 1 10782 10771 ( 12 n); cDNA 1 12 ( 12 n); score: 0.917 Intron 1 10770 10166 ( 605 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.70) Exon 2 10165 9911 ( 255 n); cDNA 13 265 ( 253 n); score: 0.845 MATCH C06HBa0153O03.1-2- SGN-E216150+ 0.845 267 1.008 C PGS_C06HBa0153O03.1-2-_SGN-E216150+ (10782 10771,10165 9911) Alignment (genomic DNA sequence = upper lines): TTTTCTTTTT TTGTATCATA CAATTCTTTA TTTTTTGTTA AACTCTATTT AAATTTGACA 10723 |||| ||||| || TTTTTTTTTT TT........ .......... .......... .......... .......... 12 TTATATTTAT GTATCAAATA CTTTATTTTA TAATCCTTAT TCAAACTTAC TTTAATCAAT 10663 .......... .......... .......... .......... .......... .......... 12 ATTAATTGCA GAAATTTCAC CCAAATTATT TTCAGTTCTA ATTTAATTTA ATTTAGTAAT 10603 .......... .......... .......... .......... .......... .......... 12 ATAGTATCAT GTGTGTAGAT ACTTAAAAAA AATTCATATT TTGAAGGGCA AATAATTGTT 10543 .......... .......... .......... .......... .......... .......... 12 CGTAATATAT ATAAATCATA GTACTGAATA TATCTTATTT GTACAAAATA TATTAATTTT 10483 .......... .......... .......... .......... .......... .......... 12 TTTTAAAAAA AAATGGCGTG TATCAAGTGT AATAACTAGT ATCACTGTTT GTGATATTAT 10423 .......... .......... .......... .......... .......... .......... 12 ATATTTTGAT ATTTCATGTA TCATATACAT AATTTAATTA CGTATATAAT TTTCCTTTTG 10363 .......... .......... .......... .......... .......... .......... 12 TTGTATCATA CAATATATTT ACGTATATAT AATAAGTGTT TTGTTATGAC AATAATTACA 10303 .......... .......... .......... .......... .......... .......... 12 TTAGTGTAAC TATGTCACGA CCCAAATCCG GGCCGCGTCT GGCACCCACA CTTACCCTCC 10243 .......... .......... .......... .......... .......... .......... 12 TATGTGAGCG AACCAACCAA TCTAAACCTT AACATTTCAA TATAATATAA CCAGAAAGTA 10183 .......... .......... .......... .......... .......... .......... 12 ATGCGGAAGA CTTAAACTCA TTAAATAAAG ACCAATTCAT TAACTTCTAA AATTCAACAT 10123 | || |||||| | ||| | || ||||| || || |||| | .......... .......TTT TTTAATAAAA ATCAA-TAAT CAACTTGTAT AACTCAAAAC 54 CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA 10063 ||| ||| | |||||||||| |||||||||| ||||| ||| | |||| || |||| |||| TTATCATT-C CCCAAAATCT GGAAGTCATC ATCACCAGAG CCTCTATCAT AAAATTACTA 113 AACTAAGAGT ATTCTAA-AA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA 10004 |||||||||| ||||||| || |||||||||| |||| ||||| |||||||||| |||||||||| AACTAAGAGT ATTCTAAGAA GCTAAAAATA CATACGAAGC TAGTCCATGC CGGAAGTTCA 173 AGGCATCAAG ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA 9944 |||||||||| |||||||||| ||||| |||| || || |||| |||||||||| |||||||||| AGGCATCAAG ACTTGAAGAA GAAGATCCAG TCAAACCTAG AAGCATTAGC TCACCCTGAA 233 TATCCGGTAT GACGAAGACT GGCTAGAATC ACT 9911 | |||| | | | | |||| |||| || | ||| TTTCCGATGT -AGTAGGACT GGCTTGAGTT ACT 265 hqPGS_C06HBa0153O03.1-2-_SGN-E216150+ (10165 9911) ******************************************************************************** EST sequence 21 -strand 219 n (File: SGN-E298638-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 61 ACAAGAACAT GTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA Predicted gene structure (within gDNA segment 11633 to 9060): Exon 1 10147 9929 ( 219 n); cDNA 1 219 ( 219 n); score: 0.836 MATCH C06HBa0153O03.1-2- SGN-E298638- 0.836 219 1.000 C PGS_C06HBa0153O03.1-2-_SGN-E298638- (10147 9929) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCT-AAAATT CAACATCTAT TATT-CCCCC AAAATCTGGA AGTCATCATC 10090 || || | ||| |||| | ||||| |||| |||| |||| ||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 60 ACAAGAACAT CTACGATCAA ATGACTAAA- CTAAGAGTAT TCTAA-AAGC TAAAAATACA 10032 |||||||||| ||| |||| || |||||| ||||||||| ||||| |||| | |||||||| ACAAGAACAT GTAC-TTCAA ATTACTAAAT CTAAGAGTA- TCTAAGAAGC T-AAAATACA 117 TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA AGACCCAGTC 9972 ||| ||||| |||||||||| ||| |||||| |||||||||| ||||||||| ||| |||||| TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 177 CAAGCTAGAA GCATTAGCTC ACCCTGAATA TCCGGTATGA CGA 9929 |||||||||| || ||||||| |||||||| | |||| | | | || CAAGCTAGAA GCGTTAGCTC ACCCTGAA-A TCCGATGTAA TGA 219 hqPGS_C06HBa0153O03.1-2-_SGN-E298638- (10147 9929) ******************************************************************************** EST sequence 30 -strand 666 n (File: SGN-E368629-) 1 TTTTTTTTTT TTTTTTTTTT TTTTTTATAA AAACCAATTC AATAACTATT ATTTCCCAAA 61 ATCTGGAAGT TATCATCACA AGAACATCTA CTTCGAATTA CTAAATCTAG AAGTATCTAA 121 GAGCCTAAAA TACATAACAC AGTTAGTCCA TGCCGAAACT TCAAGGCATC AAGACATAAA 181 GAAGAAGATC CAGTCCAAGC TAGAAGCTTT GTTTTATCGA AAAAAGGTGA TTTTTCGAAA 241 AGAGTTTGTT TTATTTTAAA GTATTTTTCG ACTTTAGGAG TCGCCACTTA ATTTTTAAGA 301 AAAATCAAGA AAACTCATTC TCAAAACAAT TTAAACAGAA AAGTCGTTTT GAAAATATTT 361 TTTAGGATTC GGGATTCTTA TTAGCGTCTT AGGAAGGTGT TTAAGGCACC TAAGACACTC 421 CGTTAAATAC GGTTTTCCAA CGACTAACTT ATTTGATTAT TTTTATTTTT ACCCTTTGCA 481 AATTTATTTG AACTTTTATC ACGATTTACT TAGCCAAACT TTGCAAATTT GAGATATTAA 541 TCTTTTAAGA TTCCGTCTTA GTTAAACTTT CTAAGCCTTA ACTCTCTAAG CAGACTTTCA 601 AATTTTAAAC CTCTATCGTT TCAAAACTTC AATTTTTATT TTTTAGTTTC ATAAAGCAAA 661 AGGCGT Predicted gene structure (within gDNA segment 11262 to 4170): Exon 1 10132 9960 ( 173 n); cDNA 36 207 ( 172 n); score: 0.844 PPA cDNA 28 1 MATCH C06HBa0153O03.1-2- SGN-E368629- 0.844 173 0.260 C PGS_C06HBa0153O03.1-2-_SGN-E368629- (10132 9960) Alignment (genomic DNA sequence = upper lines): AATTCAACAT CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT 10073 ||||||| | |||||||| |||||||||| |||||| ||| |||||||||| ||||||| | AATTCAATAA CTATTATT-T CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTAC-TT 93 CAAATGACTA AA-CTAAGAG TATTCTAAAA GCTAAAAATA CATAAGA-AG CTAGTCCATG 10015 | ||| |||| || ||| || || ||||| | || |||||| ||||| | || ||||||||| CGAATTACTA AATCTAGAAG TA-TCTAAGA GCCTAAAATA CATAACACAG TTAGTCCATG 152 CCGGAAGTTC AAGGCATCAA GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGC 9960 ||| || ||| |||||||||| ||| | |||| |||||| ||| |||||||||| ||||| CCGAAACTTC AAGGCATCAA GACATAAAGA AGAAGATCCA GTCCAAGCTA GAAGC 207 hqPGS_C06HBa0153O03.1-2-_SGN-E368629- (10132 9960) ******************************************************************************** EST sequence 13 -strand 402 n (File: SGN-E352844-) 1 TTTTTTTTAT AAAAACCAAT TCAATAACTA TTATTTCCCA AAATCTGGAA GTTATCATCA 61 CAAGAACATC TACTTCGAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCTT TGTTTTATCG AAAAAAGGTG ATTTTTCGAA AAGAGTTTGT TTTATTTTAA 241 AGTATTTTTC GACTTTAGGA GTCGCCACTT AATTTTTAAG AAAAATCAAG AAAACTCATT 301 CTCAAAACAA TTTAAACAGA AAAGTCGTTT TGAAAATATT TTTTAGGATT CGGGATTCTT 361 ATTAGCGTCT TAGGAAGGTG TTTAAGGCAC CTAAGACACT CC Predicted gene structure (within gDNA segment 11082 to 7220): Exon 1 10132 9960 ( 173 n); cDNA 18 188 ( 171 n); score: 0.879 MATCH C06HBa0153O03.1-2- SGN-E352844- 0.879 173 0.430 C PGS_C06HBa0153O03.1-2-_SGN-E352844- (10132 9960) Alignment (genomic DNA sequence = upper lines): AATTCAACAT CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT 10073 ||||||| | |||||||| |||||||||| |||||| ||| |||||||||| ||||||| | AATTCAATAA CTATTATT-T CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTAC-TT 75 CAAATGACTA AA-CTAAGAG TATTCTAA-A AGCTAAAAAT ACATAAGAAG CTAGTCCATG 10015 | ||| |||| || ||||||| || ||||| | |||| ||||| |||||| || |||||||||| CGAATTACTA AATCTAAGAG TA-TCTAAGA AGCT-AAAAT ACATAAACAG CTAGTCCATG 133 CCGGAAGTTC AAGGCATCAA GACTTGAAGA AGAAGACCCA GTCCAAGCTA GAAGC 9960 |||||| ||| |||||||||| ||| |||||| |||||| ||| |||||||||| ||||| CCGGAACTTC AAGGCATCAA GACATGAAGA AGAAGATCCA GTCCAAGCTA GAAGC 188 hqPGS_C06HBa0153O03.1-2-_SGN-E352844- (10132 9960) ******************************************************************************** EST sequence 25 -strand 620 n (File: SGN-E238551-) 1 CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTACTTCG AATTACTAAA 61 TCTAAGAGTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG AACTTCAAGG 121 CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTGTTTTA TCGAAAAAAG 181 GTGATTTTTC GAAAAGAGTT TGTTTTATTT TAAAGTATTT TTCGACTTTA GGAGTCGCCA 241 CTTAATTTTT AAGAAAAATC AAGAAAACTC ATTCTCAAAA CAATTTAAAC AGAAAAGTCG 301 TTTTGAAAAT ATTTTTTAGG ATTCGGGATT CTTATTAGCG TCTTAGGAAG GTGTTTAAGG 361 CACCTAAGAC ACTCCGTTAA ATACGGTTTT CCAACGACTA ACTTATTTGA TTATTTTTAT 421 TTTTACCCTT TGCAAATTTA TTTGAACTTT TATCACGATT TACTTAGCCA AACTTTGCAA 481 ATTTGAGATA TTAATCTTTT AAGATTCCGT CTTAGTTAAA CTTTCTAAGC CTTAACTCTC 541 TAAGCAGACT TTCAAATTTT AAACCTCTAT CGTTTCAAAA CTTCAATTTT TATTTTTTAG 601 TTTCATAAAG CAAAAGGCGT Predicted gene structure (within gDNA segment 10812 to 4170): Exon 1 10122 9960 ( 163 n); cDNA 1 161 ( 161 n); score: 0.880 MATCH C06HBa0153O03.1-2- SGN-E238551- 0.880 163 0.263 C PGS_C06HBa0153O03.1-2-_SGN-E238551- (10122 9960) Alignment (genomic DNA sequence = upper lines): CTATTATTCC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA 10063 |||||||| |||||||||| |||||| ||| |||||||||| ||||||| | | ||| |||| CTATTATT-T CCCAAAATCT GGAAGTTATC ATCACAAGAA CATCTAC-TT CGAATTACTA 58 AA-CTAAGAG TATTCTAAAA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA 10004 || ||||||| ||| | || ||| |||||| ||||| ||| |||||||||| ||||| |||| AATCTAAGAG TATCTAAGAA GCT-AAAATA CATAAACAGC TAGTCCATGC CGGAACTTCA 117 AGGCATCAAG ACTTGAAGAA GAAGACCCAG TCCAAGCTAG AAGC 9960 |||||||||| || ||||||| ||||| |||| |||||||||| |||| AGGCATCAAG ACATGAAGAA GAAGATCCAG TCCAAGCTAG AAGC 161 hqPGS_C06HBa0153O03.1-2-_SGN-E238551- (10122 9960) Total number of EST alignments reported: 138 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 11633: PGL 1 (- strand): 10296 748 AGS-1 (1282 940,864 748) SCR (e 0.948 d 0.000 a 0.000,e 0.957) Exon 1 1282 940 ( 343 n); score: 0.948 Intron 1 939 865 ( 75 n); Pd: 0.000 Pa: 0.000 Exon 2 864 748 ( 117 n); score: 0.957 PGS (1282 940,864 748) SGN-E556422- 3-phase translation of AGS-1 (-strand): . . . . . . 1282 TTTTAAAATCAAAACACTTTCATAATTTAACAATATGTGTTGTTATGTTAATAAATACTT F - N Q N T F I I - Q Y V L L C - - I L F K I K T L S - F N N M C C Y V N K Y F L K S K H F H N L T I C V V M L I N T . . . . . . 1222 CAAACATCTTGTTTAACTCCAAAAAAAAAAATAAAATCAAACATTTAAAACTAACTAGCC Q T S C L T P K K K I K S N I - N - L A K H L V - L Q K K K - N Q T F K T N - P S N I L F N S K K K N K I K H L K L T S . . . . . . 1162 TACATTAACAATTTCATCTTCAAATGATGGGTTTAATACATTCTTCATTCTTGGAGGGTC Y I N N F I F K - W V - Y I L H S W R V T L T I S S S N D G F N T F F I L G G S L H - Q F H L Q M M G L I H S S F L E G . . . . . . 1102 ATCACTGTTGCTAACATATCCAGCATTCATCTTGTCGAGACCATACTTCAATAAAAGTAT I T V A N I S S I H L V E T I L Q - K Y S L L L T Y P A F I L S R P Y F N K S I H H C C - H I Q H S S C R D H T S I K V . . . . . . 1042 CGCATATCTTTTGCCAAGATAGCTACTCTCAAAACTATTACATGACATATTAATTTTTTC R I S F A K I A T L K T I T - H I N F F A Y L L P R - L L S K L L H D I L I F S S H I F C Q D S Y S Q N Y Y M T Y - F F . . . . . : . 982 ACTCAGAAATTCTGCATATACAGCTACAAACAGTCCGCAATCA : AGGCTATCACTTATTTG T Q K F C I Y S Y K Q S A I : K A I T Y L L R N S A Y T A T N S P Q S : R L S L I C H S E I L H I Q L Q T V R N Q : G Y H L F . . . . . . 847 TTGCATGTTGTCTTGAGCAAATTCCACATTAAACGAATGTTGTGGATTTAAAAGTTCTCC L H V V L S K F H I K R M L W I - K F S C M L S - A N S T L N E C C G F K S S P V A C C L E Q I P H - T N V V D L K V L . . . . 787 AGTTTTCATGTCTTTATATGCATCCAATGCTGCCCAATCT S F H V F I C I Q C C P I V F M S L Y A S N A A Q S Q F S C L Y M H P M L P N Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (2095 1395) SCR (e 0.829) Exon 1 2095 1395 ( 701 n); score: 0.829 PGS (1950 1395) SGN-E231589+ PGS (1950 1401) SGN-E389553- PGS (1950 1401) SGN-E550127- PGS (1950 1401) SGN-E550140- PGS (1950 1401) SGN-E550201+ PGS (1950 1401) SGN-E550207+ PGS (1950 1401) SGN-E550335+ PGS (1950 1401) SGN-E390013+ PGS (1950 1401) SGN-E550484+ PGS (1950 1401) SGN-E550211+ PGS (1950 1401) SGN-E550464+ PGS (1950 1401) SGN-E549941+ PGS (1950 1401) SGN-E550025+ PGS (1950 1401) SGN-E374999+ PGS (1950 1401) SGN-E396039+ PGS (1950 1401) SGN-E396056+ PGS (1950 1401) SGN-E377133+ PGS (1950 1401) SGN-E550212+ PGS (1950 1401) SGN-E550065+ PGS (1928 1401) SGN-E550322+ PGS (1901 1401) SGN-E377132- PGS (1677 1401) SGN-E252199- PGS (1950 1404) SGN-E389834+ PGS (1950 1404) SGN-E396054+ PGS (1950 1404) SGN-E396058+ PGS (1950 1405) SGN-E241959+ PGS (2095 1440) SGN-E349296- PGS (1950 1478) SGN-E236652+ PGS (1950 1497) SGN-E396070+ PGS (1906 1514) SGN-E356257- 3-phase translation of AGS-2 (-strand): . . . . . . 2095 AATACTACTAACACATATCATTCGCTATTAAGAGTTTGCTACGAATAGTATGAAATAACC N T T N T Y H S L L R V C Y E - Y E I T I L L T H I I R Y - E F A T N S M K - P Y Y - H I S F A I K S L L R I V - N N . . . . . . 2035 ATAACCTACCTCCACTGAAGATTAGTGATTAAGCAAGAAATTCCCAAGGCTTTTGTTCCT I T Y L H - R L V I K Q E I P K A F V P - P T S T E D - - L S K K F P R L L F L H N L P P L K I S D - A R N S Q G F C S . . . . . . 1975 TCTTCTCGTTCGATCCTCCCTCAATTCGTTTCTCTTTCCCTCTCTTTGTTCTTTCTATTT S S R S I L P Q F V S L S L S L F F L F L L V R S S L N S F L F P S L C S F Y F F F S F D P P S I R F S F P L F V L S I . . . . . . 1915 TCTTATTCCAACCCTCTTTCTTTTACCCTAATTAGTATATAATTAAGAATAAAAGATGGC S Y S N P L S F T L I S I - L R I K D G L I P T L F L L P - L V Y N - E - K M A F L F Q P S F F Y P N - Y I I K N K R W . . . . . . 1855 AATAATACCCCACTAATTAACTTAAGGTTACCTCTTTTAACCCCCAAGGATTTTGAGTTA N N T P L I N L R L P L L T P K D F E L I I P H - L T - G Y L F - P P R I L S Y Q - Y P T N - L K V T S F N P Q G F - V . . . . . . 1795 TTAATATAAACCCATGAAATATATAATCATAGCAGGAATAGTCCAAAACGCCCCTTTAAA L I - T H E I Y N H S R N S P K R P F K - Y K P M K Y I I I A G I V Q N A P L K I N I N P - N I - S - Q E - S K T P L - . . . . . . 1735 ACTTAACCAGAAATCTGACTCCAACTGGGATTGCACAACCTGTGACGGGCCGTCGTGCCT T - P E I - L Q L G L H N L - R A V V P L N Q K S D S N W D C T T C D G P S C L N L T R N L T P T G I A Q P V T G R R A . . . . . . 1675 GCGACGGTCCGTCCTGCAGGTCGTCGCAAAGTTCAGAGACCCAATATTTCCACCAAGGGT A T V R P A G R R K V Q R P N I S T K G R R S V L Q V V A K F R D P I F P P R V C D G P S C R S S Q S S E T Q Y F H Q G . . . . . . 1615 CTGTGATGGTCCGTCACACCTGTGACGGTCCGTCCTGCCATTCCGTCACGAAGTTCAGAG L - W S V T P V T V R P A I P S R S S E C D G P S H L - R S V L P F R H E V Q R S V M V R H T C D G P S C H S V T K F R . . . . . . 1555 AGTTGATTTTCAGTACCCAATTTTAGATTTTCTAAGTGTTTTGAAACGAGACCCTGCGAC S - F S V P N F R F S K C F E T R P C D V D F Q Y P I L D F L S V L K R D P A T E L I F S T Q F - I F - V F - N E T L R . . . . . . 1495 GGTCTGTCGTGCCCATGACGGTCCGTCGTTGGGTTCGTCGCCTCAGCCTGTTTTTCCAGA G L S C P - R S V V G F V A S A C F S R V C R A H D G P S L G S S P Q P V F P E R S V V P M T V R R W V R R L S L F F Q . . . . . 1435 AATAAAATCCGCTGCTCAAAACGACTAAACAGGTCGTTACA N K I R C S K R L N R S L I K S A A Q N D - T G R Y K - N P L L K T T K Q V V T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-2_PPS_1 (1736 1530) (frame '0'; 204 bp, 68 residues) 1 NLTRNLTPTG IAQPVTGRRA CDGPSCRSSQ SSETQYFHQG SVMVRHTCDG PSCHSVTKFR 61 ELIFSTQF- >C06HBa0153O03.1-2-_PGL-1_AGS-2_PPS_2 (1791 1591) (frame '2'; 198 bp, 66 residues) 1 YKPMKYIIIA GIVQNAPLKL NQKSDSNWDC TTCDGPSCLR RSVLQVVAKF RDPIFPPRVC 61 DGPSHL- 3-phase translation of AGS-2 (+strand): . . . . . . 1395 TGTAACGACCTGTTTAGTCGTTTTGAGCAGCGGATTTTATTTCTGGAAAAACAGGCTGAG C N D L F S R F E Q R I L F L E K Q A E V T T C L V V L S S G F Y F W K N R L R - R P V - S F - A A D F I S G K T G - . . . . . . 1455 GCGACGAACCCAACGACGGACCGTCATGGGCACGACAGACCGTCGCAGGGTCTCGTTTCA A T N P T T D R H G H D R P S Q G L V S R R T Q R R T V M G T T D R R R V S F Q G D E P N D G P S W A R Q T V A G S R F . . . . . . 1515 AAACACTTAGAAAATCTAAAATTGGGTACTGAAAATCAACTCTCTGAACTTCGTGACGGA K H L E N L K L G T E N Q L S E L R D G N T - K I - N W V L K I N S L N F V T E K T L R K S K I G Y - K S T L - T S - R . . . . . . 1575 ATGGCAGGACGGACCGTCACAGGTGTGACGGACCATCACAGACCCTTGGTGGAAATATTG M A G R T V T G V T D H H R P L V E I L W Q D G P S Q V - R T I T D P W W K Y W N G R T D R H R C D G P S Q T L G G N I . . . . . . 1635 GGTCTCTGAACTTTGCGACGACCTGCAGGACGGACCGTCGCAGGCACGACGGCCCGTCAC G L - T L R R P A G R T V A G T T A R H V S E L C D D L Q D G P S Q A R R P V T G S L N F A T T C R T D R R R H D G P S . . . . . . 1695 AGGTTGTGCAATCCCAGTTGGAGTCAGATTTCTGGTTAAGTTTTAAAGGGGCGTTTTGGA R L C N P S W S Q I S G - V L K G R F G G C A I P V G V R F L V K F - R G V L D Q V V Q S Q L E S D F W L S F K G A F W . . . . . . 1755 CTATTCCTGCTATGATTATATATTTCATGGGTTTATATTAATAACTCAAAATCCTTGGGG L F L L - L Y I S W V Y I N N S K S L G Y S C Y D Y I F H G F I L I T Q N P W G T I P A M I I Y F M G L Y - - L K I L G . . . . . . 1815 GTTAAAAGAGGTAACCTTAAGTTAATTAGTGGGGTATTATTGCCATCTTTTATTCTTAAT V K R G N L K L I S G V L L P S F I L N L K E V T L S - L V G Y Y C H L L F L I G - K R - P - V N - W G I I A I F Y S - . . . . . . 1875 TATATACTAATTAGGGTAAAAGAAAGAGGGTTGGAATAAGAAAATAGAAAGAACAAAGAG Y I L I R V K E R G L E - E N R K N K E I Y - L G - K K E G W N K K I E R T K R L Y T N - G K R K R V G I R K - K E Q R . . . . . . 1935 AGGGAAAGAGAAACGAATTGAGGGAGGATCGAACGAGAAGAAGGAACAAAAGCCTTGGGA R E R E T N - G R I E R E E G T K A L G G K E K R I E G G S N E K K E Q K P W E E G K R N E L R E D R T R R R N K S L G . . . . . . 1995 ATTTCTTGCTTAATCACTAATCTTCAGTGGAGGTAGGTTATGGTTATTTCATACTATTCG I S C L I T N L Q W R - V M V I S Y Y S F L A - S L I F S G G R L W L F H T I R N F L L N H - S S V E V G Y G Y F I L F . . . . . 2055 TAGCAAACTCTTAATAGCGAATGATATGTGTTAGTAGTATT - Q T L N S E - Y V L V V S K L L I A N D M C - - Y V A N S - - R M I C V S S I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2+_PGL-1_AGS-2_PPS_1 (1395 1643) (frame '1'; 246 bp, 82 residues) 1 CNDLFSRFEQ RILFLEKQAE ATNPTTDRHG HDRPSQGLVS KHLENLKLGT ENQLSELRDG 61 MAGRTVTGVT DHHRPLVEIL GL- >C06HBa0153O03.1-2+_PGL-1_AGS-2_PPS_2 (1571 1795) (frame '0'; 222 bp, 74 residues) 1 RNGRTDRHRC DGPSQTLGGN IGSLNFATTC RTDRRRHDGP SQVVQSQLES DFWLSFKGAF 61 WTIPAMIIYF MGLY- AGS-3 (3567 2471,2430 2204) SCR (e 0.899 d 0.000 a 0.000,e 0.978) Exon 1 3567 2471 (1097 n); score: 0.899 Intron 1 2470 2431 ( 40 n); Pd: 0.000 Pa: 0.000 Exon 2 2430 2204 ( 227 n); score: 0.978 PGS (2759 2471,2430 2204) SGN-E242359- PGS (2949 2542) SGN-E246710- PGS (2849 2615) SGN-E209683- PGS (3509 2775) SGN-E351546- PGS (3434 2775) SGN-E356696- PGS (3358 2775) SGN-E356206- PGS (3126 2845) SGN-E222578+ PGS (3507 2855) SGN-E392027+ PGS (3264 2856) SGN-E370357+ PGS (3121 2859) SGN-E373117- PGS (3121 2859) SGN-E373116+ PGS (3112 2885) SGN-E216150+ PGS (3121 2903) SGN-E298638- PGS (3121 2934) SGN-E352844- PGS (3106 2934) SGN-E368629- PGS (3096 2934) SGN-E238551- PGS (3567 3282) SGN-E355114- 3-phase translation of AGS-3 (-strand): . . . . . . 3567 CCACAGCCCCAGTGGTTGGCTCAGTTGTTTCTTGTCTGGCCGGTATTGGTGTTGGCGTAG P Q P Q W L A Q L F L V W P V L V L A - H S P S G W L S C F L S G R Y W C W R S T A P V V G S V V S C L A G I G V G V . . . . . . 3507 TCGTTGCTCTAGTTCTAACCATCTGTGAAAGAGAGTGAAGATGGTCAGATACTAATTCGT S L L - F - P S V K E S E D G Q I L I R R C S S S N H L - K R V K M V R Y - F V V V A L V L T I C E R E - R W S D T N S . . . . . . 3447 ATCGCCTAGATACCAATTGGACTCAAGTAGTAGCACGAAAGAAAGAATGAGAGAGTGAAA I A - I P I G L K - - H E R K N E R V K S P R Y Q L D S S S S T K E R M R E - N Y R L D T N W T Q V V A R K K E - E S E . . . . . . 3387 TTTTCCTAAAGTCTTATAGCCTCTCAAGAAAAAGTAAAGGCGTCCCCCTACCGTTCCTTA F S - S L I A S Q E K V K A S P Y R S L F P K V L - P L K K K - R R P P T V P - I F L K S Y S L S R K S K G V P L P F L . . . . . . 3327 AGACTCTACTAGACCTGTTCTTGTGTGATGAGACCAACGAACCTAATGCTCTGATACCAA R L Y - T C S C V M R P T N L M L - Y Q D S T R P V L V - - D Q R T - C S D T K K T L L D L F L C D E T N E P N A L I P . . . . . . 3267 GTTTGTCACGACCCAAATCCAGGCCACGACTGGCACCCACACTTACCCTCCTATGTGAGC V C H D P N P G H D W H P H L P S Y V S F V T T Q I Q A T T G T H T Y P P M - A S L S R P K S R P R L A P T L T L L C E . . . . . . 3207 GAACCAACCAATCTAAACCTTAACATTTCAATATAATATAACCAGAAAGTAATGCGGAAG E P T N L N L N I S I - Y N Q K V M R K N Q P I - T L T F Q Y N I T R K - C G R R T N Q S K P - H F N I I - P E S N A E . . . . . . 3147 ACTTAAAATCATTAAATAAAGACCAATTCATTAACTTCTAAAATTCAACATCTATTATTC T - N H - I K T N S L T S K I Q H L L F L K I I K - R P I H - L L K F N I Y Y S D L K S L N K D Q F I N F - N S T S I I . . . . . . 3087 CCCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAGAG P Q N L E V I I T R T S T I K - L N - E P K I W K S S S Q E H L R S N D - T K S P P K S G S H H H K N I Y D Q M T K L R . . . . . . 3027 TATTCTAAAAGCTAAAAATACATAAGAAGCTAGTCCATGCCGGAAGTTCAAGGCATCAAG Y S K S - K Y I R S - S M P E V Q G I K I L K A K N T - E A S P C R K F K A S R V F - K L K I H K K L V H A G S S R H Q . . . . . . 2967 ACTTGAAGAAGAAGACCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAATATCCGGTAT T - R R R P S P S - K H - L T L N I R Y L E E E D P V Q A R S I S S P - I S G M D L K K K T Q S K L E A L A H P E Y P V . . . . . . 2907 GACGAAGACTGGCTAGAATCACTGCTGAGTTGAAGATGACGGAACGTTTGCTGCACTCCA D E D W L E S L L S - R - R N V C C T P T K T G - N H C - V E D D G T F A A L H - R R L A R I T A E L K M T E R L L H S . . . . . . 2847 CAAATAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTAGA Q I T R R K H K S R G Q Y K T R V L S R K - Q E E N I K V G V S T K H G Y - V D T N N K K K T - K - G S V Q N T G T E - . . . . . . 2787 TATCATCGGCCAACTCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAATCAATT Y H R P T Q N R K Q Y V L S N I I K S I I I G Q L K I E N S M Y - A I S - N Q L I S S A N S K - K T V C I K Q Y H K I N . . . . . . 2727 AATATCCTTAGCATGCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCAAGCACA N I L S M Q H L Q L P - P L V T T P S T I S L A C S I Y S Y H N P W L Q H Q A H - Y P - H A A F T V T I T L G Y N T K H . . . . . . 2667 TCAATGAGGACTCACACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAGATTGGA S M R T H T S S S H S F G N L V H - I G Q - G L T P P H H T H L G I - F I R L D I N E D S H L L I T L I W E F S S L D W . . . . . . 2607 TATATTAACATATTTCAAGATTCATTATCTTTATTCTCCTCGTGTCGGTACGTGACACTC Y I N I F Q D S L S L F S S C R Y V T L I L T Y F K I H Y L Y S P R V G T - H S I Y - H I S R F I I F I L L V S V R D T . . . . . . 2547 CGCTCCTCAATATACTATCCTGGTGTCGGAACGTGACACTCTGATCCTCATTCTATCCTG R S S I Y Y P G V G T - H S D P H S I L A P Q Y T I L V S E R D T L I L I L S W P L L N I L S W C R N V T L - S S F Y P . . : . . . . 2487 GTGTCGGAACGTGACAC : CCGATCCATATTCTATCCTGGTGTCAGAACGTGACACCCGATC V S E R D T : R S I F Y P G V R T - H P I C R N V T : P D P Y S I L V S E R D T R S G V G T - H : P I H I L S W C Q N V T P D . . . . . . 2387 CATATCCTATCCTGGTACCGGAACGTGGCACCCGATCCATATTCTATCTTGGTGTCGGAA H I L S W Y R N V A P D P Y S I L V S E I S Y P G T G T W H P I H I L S W C R N P Y P I L V P E R G T R S I F Y L G V G . . . . . . 2327 CGTGACACCCGATCTATATTCTATCCTGGTACCGGAACGTGGCACCCGATCCCCTAATCT R D T R S I F Y P G T G T W H P I P - S V T P D L Y S I L V P E R G T R S P N L T - H P I Y I L S W Y R N V A P D P L I . . . . . . 2267 CACCACTTTCGTTCATCAAGCCTTCTTTTATACCAAGGCATCATCATTAACAAAGTAGAT H H F R S S S L L L Y Q G I I I N K V D T T F V H Q A F F Y T K A S S L T K - I S P L S F I K P S F I P R H H H - Q S R . 2207 TAGG - R L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-3_PPS_1 (2552 2471,2430 2210) (frame '2'; 300 bp, 100 residues) 1 HSAPQYTILV SERDTLILIL SWCRNVTPDP YSILVSERDT RSISYPGTGT WHPIHILSWC 61 RNVTPDLYSI LVPERGTRSP NLTTFVHQAF FYTKASSLTK - >C06HBa0153O03.1-2-_PGL-1_AGS-3_PPS_2 (3397 3185) (frame '0'; 210 bp, 70 residues) 1 ESEIFLKSYS LSRKSKGVPL PFLKTLLDLF LCDETNEPNA LIPSLSRPKS RPRLAPTLTL 61 LCERTNQSKP - AGS-4 (10296 10138,3094 2448,2370 2215) SCR (e 0.887 d 0.000 a 0.000,e 0.777 d 0.000 a 0.000,e 0.795) Exon 1 10296 10138 ( 159 n); score: 0.887 Intron 1 10137 3095 (7043 n); Pd: 0.000 Pa: 0.000 Exon 2 3094 2448 ( 647 n); score: 0.777 Intron 2 2447 2371 ( 77 n); Pd: 0.000 Pa: 0.000 Exon 3 2370 2215 ( 156 n); score: 0.795 PGS (2991 2448,2370 2215) SGN-E241789+ PGS (10296 10138,3094 2856) SGN-E542084+ 3-phase translation of AGS-4 (-strand): . . . . . . 10296 TAACTATGTCACGACCCAAATCCGGGCCGCGTCTGGCACCCACACTTACCCTCCTATGTG - L C H D P N P G R V W H P H L P S Y V N Y V T T Q I R A A S G T H T Y P P M - T M S R P K S G P R L A P T L T L L C . . . . . . 10236 AGCGAACCAACCAATCTAAACCTTAACATTTCAATATAATATAACCAGAAAGTAATGCGG S E P T N L N L N I S I - Y N Q K V M R A N Q P I - T L T F Q Y N I T R K - C G E R T N Q S K P - H F N I I - P E S N A . . . . : . . 10176 AAGACTTAAACTCATTAAATAAAGACCAATTCATTAACT : ATTATTCCCCCAAAATCTGGA K T - T H - I K T N S L T : I I P P K S G R L K L I K - R P I H - L : L F P Q N L E E D L N S L N K D Q F I N : Y Y S P K I W . . . . . . 3073 AGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAGAGTATTCTAAAAGCTA S H H H K N I Y D Q M T K L R V F - K L V I I T R T S T I K - L N - E Y S K S - K S S S Q E H L R S N D - T K S I L K A . . . . . . 3013 AAAATACATAAGAAGCTAGTCCATGCCGGAAGTTCAAGGCATCAAGACTTGAAGAAGAAG K I H K K L V H A G S S R H Q D L K K K K Y I R S - S M P E V Q G I K T - R R R K N T - E A S P C R K F K A S R L E E E . . . . . . 2953 ACCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAATATCCGGTATGACGAAGACTGGCT T Q S K L E A L A H P E Y P V - R R L A P S P S - K H - L T L N I R Y D E D W L D P V Q A R S I S S P - I S G M T K T G . . . . . . 2893 AGAATCACTGCTGAGTTGAAGATGACGGAACGTTTGCTGCACTCCACAAATAACAAGAAG R I T A E L K M T E R L L H S T N N K K E S L L S - R - R N V C C T P Q I T R R - N H C - V E D D G T F A A L H K - Q E . . . . . . 2833 AAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTAGATATCATCGGCCAAC K T - K - G S V Q N T G T E - I S S A N K H K S R G Q Y K T R V L S R Y H R P T E N I K V G V S T K H G Y - V D I I G Q . . . . . . 2773 TCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAATCAATTAATATCCTTAGCAT S K - K T V C I K Q Y H K I N - Y P - H Q N R K Q Y V L S N I I K S I N I L S M L K I E N S M Y - A I S - N Q L I S L A . . . . . . 2713 GCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCAAGCACATCAATGAGGACTCA A A F T V T I T L G Y N T K H I N E D S Q H L Q L P - P L V T T P S T S M R T H C S I Y S Y H N P W L Q H Q A H Q - G L . . . . . . 2653 CACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAGATTGGATATATTAACATATT H L L I T L I W E F S S L D W I Y - H I T S S S H S F G N L V H - I G Y I N I F T P P H H T H L G I - F I R L D I L T Y . . . . . . 2593 TCAAGATTCATTATCTTTATTCTCCTCGTGTCGGTACGTGACACTCCGCTCCTCAATATA S R F I I F I L L V S V R D T P L L N I Q D S L S L F S S C R Y V T L R S S I Y F K I H Y L Y S P R V G T - H S A P Q Y . . . . . . 2533 CTATCCTGGTGTCGGAACGTGACACTCTGATCCTCATTCTATCCTGGTGTCGGAACGTGA L S W C R N V T L - S S F Y P G V G T - Y P G V G T - H S D P H S I L V S E R D T I L V S E R D T L I L I L S W C R N V . . . : . . . 2473 CACTCCGATCCTCATATACTATCCTG : CCGGAACGTGGCACCCGATCCATATTCTATCTTG H S D P H I L S C : R N V A P D P Y S I L T P I L I Y Y P : A G T W H P I H I L S W T L R S S Y T I L : P E R G T R S I F Y L . . . . . . 2336 GTGTCGGAACGTGACACCCGATCTATATTCTATCCTGGTACCGGAACGTGGCACCCGATC V S E R D T R S I F Y P G T G T W H P I C R N V T P D L Y S I L V P E R G T R S G V G T - H P I Y I L S W Y R N V A P D . . . . . . 2276 CCCTAATCTCACCACTTTCGTTCATCAAGCCTTCTTTTATACCAAGGCATCATCATTAAC P - S H H F R S S S L L L Y Q G I I I N P N L T T F V H Q A F F Y T K A S S L T P L I S P L S F I K P S F I P R H H H - . 2216 AA Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-4_PPS_1 (2511 2448,2370 2216) (frame '2'; 219 bp, 73 residues) 1 HSDPHSILVS ERDTPILIYY PAGTWHPIHI LSWCRNVTPD LYSILVPERG TRSPNLTTFV 61 HQAFFYTKAS SLT AGS-5 (2793 2494,2459 2258) SCR (e 0.757 d 0.000 a 0.000,e 0.735) Exon 1 2793 2494 ( 300 n); score: 0.757 Intron 1 2493 2460 ( 34 n); Pd: 0.000 Pa: 0.000 Exon 2 2459 2258 ( 202 n); score: 0.735 PGS (2793 2494,2459 2258) SGN-E349977- 3-phase translation of AGS-5 (-strand): . . . . . . 2793 AGTAGATATCATCGGCCAACTCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAA S R Y H R P T Q N R K Q Y V L S N I I K V D I I G Q L K I E N S M Y - A I S - N - I S S A N S K - K T V C I K Q Y H K . . . . . . 2733 TCAATTAATATCCTTAGCATGCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCA S I N I L S M Q H L Q L P - P L V T T P Q L I S L A C S I Y S Y H N P W L Q H Q I N - Y P - H A A F T V T I T L G Y N T . . . . . . 2673 AGCACATCAATGAGGACTCACACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAG S T S M R T H T S S S H S F G N L V H - A H Q - G L T P P H H T H L G I - F I R K H I N E D S H L L I T L I W E F S S L . . . . . . 2613 ATTGGATATATTAACATATTTCAAGATTCATTATCTTTATTCTCCTCGTGTCGGTACGTG I G Y I N I F Q D S L S L F S S C R Y V L D I L T Y F K I H Y L Y S P R V G T - D W I Y - H I S R F I I F I L L V S V R . . . . . . : 2553 ACACTCCGCTCCTCAATATACTATCCTGGTGTCGGAACGTGACACTCTGATCCTCATTCT : T L R S S I Y Y P G V G T - H S D P H S : H S A P Q Y T I L V S E R D T L I L I L : D T P L L N I L S W C R N V T L - S S F : . . . . . . 2459 TATACTATCCTGGTACCGGAACGTGGCACCCGATCCATATTCTATCCTGGTGTCAGAACG Y T I L V P E R G T R S I F Y P G V R T I L S W Y R N V A P D P Y S I L V S E R L Y Y P G T G T W H P I H I L S W C Q N . . . . . . 2399 TGACACCCGATCCATATCCTATCCTGGTACCGGAACGTGGCACCCGATCCATATTCTATC - H P I H I L S W Y R N V A P D P Y S I D T R S I S Y P G T G T W H P I H I L S V T P D P Y P I L V P E R G T R S I F Y . . . . . . 2339 TTGGTGTCGGAACGTGACACCCGATCTATATTCTATCCTGGTACCGGAACGTGGCACCCG L V S E R D T R S I F Y P G T G T W H P W C R N V T P D L Y S I L V P E R G T R L G V G T - H P I Y I L S W Y R N V A P . . . 2279 ATCCCCTAATCTCACCACTTTC I P - S H H F S P N L T T F D P L I S P L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-5_PPS_1 (2552 2494,2459 2258) (frame '2'; 261 bp, 87 residues) 1 HSAPQYTILV SERDTLILIL ILSWYRNVAP DPYSILVSER DTRSISYPGT GTWHPIHILS 61 WCRNVTPDLY SILVPERGTR SPNLTTF AGS-6 (3087 2537,2497 2387,2315 2290) SCR (e 0.808 d 0.000 a 0.000,e 0.923 d 0.900 a 0.000,e 0.962) Exon 1 3087 2537 ( 551 n); score: 0.808 Intron 1 2536 2498 ( 39 n); Pd: 0.000 Pa: 0.000 Exon 2 2497 2387 ( 111 n); score: 0.923 Intron 2 2386 2316 ( 71 n); Pd: 0.900 Pa: 0.000 Exon 3 2315 2290 ( 26 n); score: 0.962 PGS (3087 2537,2497 2387,2315 2290) SGN-E546506+ 3-phase translation of AGS-6 (-strand): . . . . . . 3087 CCCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAGAG P Q N L E V I I T R T S T I K - L N - E P K I W K S S S Q E H L R S N D - T K S P K S G S H H H K N I Y D Q M T K L R . . . . . . 3027 TATTCTAAAAGCTAAAAATACATAAGAAGCTAGTCCATGCCGGAAGTTCAAGGCATCAAG Y S K S - K Y I R S - S M P E V Q G I K I L K A K N T - E A S P C R K F K A S R V F - K L K I H K K L V H A G S S R H Q . . . . . . 2967 ACTTGAAGAAGAAGACCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAATATCCGGTAT T - R R R P S P S - K H - L T L N I R Y L E E E D P V Q A R S I S S P - I S G M D L K K K T Q S K L E A L A H P E Y P V . . . . . . 2907 GACGAAGACTGGCTAGAATCACTGCTGAGTTGAAGATGACGGAACGTTTGCTGCACTCCA D E D W L E S L L S - R - R N V C C T P T K T G - N H C - V E D D G T F A A L H - R R L A R I T A E L K M T E R L L H S . . . . . . 2847 CAAATAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTAGA Q I T R R K H K S R G Q Y K T R V L S R K - Q E E N I K V G V S T K H G Y - V D T N N K K K T - K - G S V Q N T G T E - . . . . . . 2787 TATCATCGGCCAACTCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAATCAATT Y H R P T Q N R K Q Y V L S N I I K S I I I G Q L K I E N S M Y - A I S - N Q L I S S A N S K - K T V C I K Q Y H K I N . . . . . . 2727 AATATCCTTAGCATGCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCAAGCACA N I L S M Q H L Q L P - P L V T T P S T I S L A C S I Y S Y H N P W L Q H Q A H - Y P - H A A F T V T I T L G Y N T K H . . . . . . 2667 TCAATGAGGACTCACACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAGATTGGA S M R T H T S S S H S F G N L V H - I G Q - G L T P P H H T H L G I - F I R L D I N E D S H L L I T L I W E F S S L D W . . . . . . 2607 TATATTAACATATTTCAAGATTCATTATCTTTATTCTCCTCGTGTCGGTACGTGACACTC Y I N I F Q D S L S L F S S C R Y V T L I L T Y F K I H Y L Y S P R V G T - H S I Y - H I S R F I I F I L L V S V R D T . . : . . . . 2547 CGCTCCTCAAT : TTCTATCCTGGTGTCGGAACGTGACACTCCGATCCTCATATACTATCCT R S S I : S I L V S E R D T P I L I Y Y P A P Q : F L S W C R N V T L R S S Y T I L P L L N : F Y P G V G T - H S D P H I L S . . . . . . 2448 GGTACCGGAACGTGGCACCCGATCCATATTCTATCCTGGTGTCAGAACGTGACACCCGAT G T G T W H P I H I L S W C Q N V T P D V P E R G T R S I F Y P G V R T - H P I W Y R N V A P D P Y S I L V S E R D T R . : . . 2388 CC : TCTATATTCTATCCTGGTACCGGAAC P : L Y S I L V P E : L Y I L S W Y R N S : S I F Y P G T G Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-6_PPS_1 (2613 2537,2497 2387,2315 2291) (frame '1'; 213 bp, 71 residues) 1 IGYINIFQDS LSLFSSCRYV TLRSSISILV SERDTPILIY YPGTGTWHPI HILSWCQNVT 61 PDPLYSILVP E AGS-7 (2526 2411) SCR (e 0.789) Exon 1 2526 2411 ( 116 n); score: 0.789 PGS (2526 2411) SGN-E546548- 3-phase translation of AGS-7 (-strand): . . . . . . 2526 GGTGTCGGAACGTGACACTCTGATCCTCATTCTATCCTGGTGTCGGAACGTGACACTCCG G V G T - H S D P H S I L V S E R D T P V S E R D T L I L I L S W C R N V T L R C R N V T L - S S F Y P G V G T - H S . . . . . . 2466 ATCCTCATATACTATCCTGGTACCGGAACGTGGCACCCGATCCATATTCTATCCTG I L I Y Y P G T G T W H P I H I L S S S Y T I L V P E R G T R S I F Y P D P H I L S W Y R N V A P D P Y S I L Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-7 (+strand): . . . . . . 2411 CAGGATAGAATATGGATCGGGTGCCACGTTCCGGTACCAGGATAGTATATGAGGATCGGA Q D R I W I G C H V P V P G - Y M R I G R I E Y G S G A T F R Y Q D S I - G S E G - N M D R V P R S G T R I V Y E D R . . . . . . 2471 GTGTCACGTTCCGACACCAGGATAGAATGAGGATCAGAGTGTCACGTTCCGACACC V S R S D T R I E - G S E C H V P T C H V P T P G - N E D Q S V T F R H S V T F R H Q D R M R I R V S R S D T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-8 (6283 3576) SCR (e 0.912) Exon 1 6283 3576 (2708 n); score: 0.912 PGS (4005 3576) SGN-E352180+ PGS (4193 3642) SGN-E329287- PGS (4363 3645) SGN-E356614+ PGS (4360 3698) SGN-E352401+ PGS (4558 3854) SGN-E349404+ PGS (4558 3886) SGN-E351625+ PGS (4558 3951) SGN-E357065+ PGS (4558 4043) SGN-E352365+ PGS (5000 4211) SGN-E356912+ PGS (4924 4227) SGN-E356209+ PGS (5080 4317) SGN-E214046+ PGS (4998 4465) SGN-E353805+ PGS (5084 4526) SGN-E244046+ PGS (5467 4925) SGN-E355026+ PGS (5694 4933) SGN-E355244+ PGS (5291 4965) SGN-E352716+ PGS (5651 4991) SGN-E352117- PGS (5694 5034) SGN-E351414+ PGS (5443 5060) SGN-E242765+ PGS (5763 5105) SGN-E355232+ PGS (6212 5533) SGN-E368762+ PGS (6283 5571) SGN-E379315+ PGS (5740 5614) SGN-E578271+ PGS (6283 5690) SGN-E375319+ PGS (6283 5758) SGN-E204434+ PGS (6212 5854) SGN-E240817+ 3-phase translation of AGS-8 (-strand): . . . . . . 6283 GAATCCCTTGACAAATCGACGGTAGTAGCTAGCTAACCCAACAAAGCTCCTTATTTCTGA E S L D K S T V V A S - P N K A P Y F - N P L T N R R - - L A N P T K L L I S D I P - Q I D G S S - L T Q Q S S L F L . . . . . . 6223 CACATTAGTAGGTCTTACCCAATTCTTCACTGTCTCAATCTTAGAAGGATCCACCATCAC H I S R S Y P I L H C L N L R R I H H H T L V G L T Q F F T V S I L E G S T I T T H - - V L P N S S L S Q S - K D P P S . . . . . . 6163 TCCATCCTTAGAAACCACGTGCCCCAAGAAGGACACTGCATCTAGCCAAAACTCACACTT S I L R N H V P Q E G H C I - P K L T L P S L E T T C P K K D T A S S Q N S H L L H P - K P R A P R R T L H L A K T H T . . . . . . 6103 AGAGAATTTGGCATAAAGCTTTTTCTCCCTCAACATTTCCAATACCATTCTCAAATGCTC R E F G I K L F L P Q H F Q Y H S Q M L E N L A - S F F S L N I S N T I L K C S - R I W H K A F S P S T F P I P F S N A . . . . . . 6043 TTCATGTTCCTTCTTGCTCTTTGAGTATACCAATATATCATCAATAAATACGATCACGAA F M F L L A L - V Y Q Y I I N K Y D H E S C S F L L F E Y T N I S S I N T I T K L H V P S C S L S I P I Y H Q - I R S R . . . . . . 5983 GAGGTCCAAATATGGCTTAAAAATCCCGTTCATCAAGCTCATGAACGCAACAGGGGCGTT E V Q I W L K N P V H Q A H E R N R G V R S K Y G L K I P F I K L M N A T G A F R G P N M A - K S R S S S S - T Q Q G R . . . . . . 5923 CATAAGACCAAAAGACATCACTACAAATTTGTAATGCCCATACCTCGTTCGAAAAGCAGT H K T K R H H Y K F V M P I P R S K S S I R P K D I T T N L - C P Y L V R K A V S - D Q K T S L Q I C N A H T S F E K Q . . . . . . 5863 CTTTGGCACATCCGTTGCCCGTATTTTCAATTGATGATAACCGGATCTCAAGTCAATCTT L W H I R C P Y F Q L M I T G S Q V N L F G T S V A R I F N - - - P D L K S I L S L A H P L P V F S I D D N R I S S Q S . . . . . . 5803 AGAGAAGACACAAGCACCTTGTAACTGATCGAACAAGTCATCAATGCGGGGAAGAGGATA R E D T S T L - L I E Q V I N A G K R I E K T Q A P C N - S N K S S M R G R G Y - R R H K H L V T D R T S H Q C G E E D . . . . . . 5743 CTTGTTCTTTATGGTTACCTTGTTTAGTTGTCTGTAGTCTATACACATTCGAAAGCTCCC L V L Y G Y L V - L S V V Y T H S K A P L F F M V T L F S C L - S I H I R K L P T C S L W L P C L V V C S L Y T F E S S . . . . . . 5683 ATCCTTCTTCTTTACAAACAAAACCGGAGCACCCCAAGGAGATGCACTTGGTCTAATAAA I L L L Y K Q N R S T P R R C T W S N K S F F F T N K T G A P Q G D A L G L I K H P S S L Q T K P E H P K E M H L V - - . . . . . . 5623 GCCTTTGTTCAATAACTCTTGAAGTTGTGCCTTTAACTCTCTTAACTCTGCGGGAGCCAT A F V Q - L L K L C L - L S - L C G S H P L F N N S - S C A F N S L N S A G A I S L C S I T L E V V P L T L L T L R E P . . . . . . 5563 TCTATAAGGGGGTATAGAAATGGGGCGTGTGCCCGGTTCTAGATCGATACAGAAGTCAAT S I R G Y R N G A C A R F - I D T E V N L - G G I E M G R V P G S R S I Q K S I F Y K G V - K W G V C P V L D R Y R S Q . . . . . . 5503 ATCCCTATCTGGTGGCATACCAGGAAGATCTGCAGGGAACACATCCAGAAACTCACGGAC I P I W W H T R K I C R E H I Q K L T D S L S G G I P G R S A G N T S R N S R T Y P Y L V A Y Q E D L Q G T H P E T H G . . . . . . 5443 TACTGAAACCGACTCAATCGAAGGCACTTGGGTAGTGTCATCCTTGAGATGTGCCAAGAA Y - N R L N R R H L G S V I L E M C Q E T E T D S I E G T W V V S S L R C A K K L L K P T Q S K A L G - C H P - D V P R . . . . . . 5383 AGCTAAACAACCTTTACTAACCATTTTCTTAGCACGAAGAAAGGAGATGATATGCACCGG S - T T F T N H F L S T K K G D D M H R A K Q P L L T I F L A R R K E M I C T G K L N N L Y - P F S - H E E R R - Y A P . . . . . . 5323 ATTGGAAGCGTTGTCACCCTCCCACACTAACGGATCTGTCCCAGGCTTGGCTAACGTCAC I G S V V T L P H - R I C P R L G - R H L E A L S P S H T N G S V P G L A N V T D W K R C H P P T L T D L S Q A W L T S . . . . . . 5263 CGTTTTAGCATTACAATCCAAGATCGCAAATTGCGGAGAAAGCCAAGTCATACCTAGAAT R F S I T I Q D R K L R R K P S H T - N V L A L Q S K I A N C G E S Q V I P R I P F - H Y N P R S Q I A E K A K S Y L E . . . . . . 5203 TACATCAAAATCATCCATTTCTAAGATAACCAAATCTACATAAGTGTTGCTCCCTACAAA Y I K I I H F - D N Q I Y I S V A P Y K T S K S S I S K I T K S T - V L L P T K L H Q N H P F L R - P N L H K C C S L Q . . . . . . 5143 GTTCACCAAAAAAGACCTATACACCTTTTCAACTACCACAGATTCACCCACCGGAGTAGA V H Q K R P I H L F N Y H R F T H R S R F T K K D L Y T F S T T T D S P T G V E S S P K K T Y T P F Q L P Q I H P P E - . . . . . . 5083 AACACGAATAGGCATATCAAGTAATTCACAATGTAAATTTAGACCATTAGCAAATGAGGA N T N R H I K - F T M - I - T I S K - G T R I G I S S N S Q C K F R P L A N E E K H E - A Y Q V I H N V N L D H - Q M R . . . . . . 5023 AGATACATAAGAAAATGTGGATCCAGGATCAAACAATACAGAGGCCATGCAATCACAAAC R Y I R K C G S R I K Q Y R G H A I T N D T - E N V D P G S N N T E A M Q S Q T K I H K K M W I Q D Q T I Q R P C N H K . . . . . . 4963 CAGAAGATTACCTGTGATGACAGCATCAGATGCCTCCGCTTCAGACCGCCCAGGGAAAGC Q K I T C D D S I R C L R F R P P R E S R R L P V M T A S D A S A S D R P G K A P E D Y L - - Q H Q M P P L Q T A Q G K . . . . . . 4903 GTAACAATGGGCCCTATCGTTCGTCTGTCCGTTGCCCCTAACTTGTTGTGATGTAGTGGC V T M G P I V R L S V A P N L L - C S G - Q W A L S F V C P L P L T C C D V V A R N N G P Y R S S V R C P - L V V M - W . . . . . . 4843 TCCAGTTTGCCCATCACCTTGGCCGTTTTGGTTACCACCATTTCCTTGACCACCACGTCC S S L P I T L A V L V T T I S L T T T S P V C P S P W P F W L P P F P - P P R P L Q F A H H L G R F G Y H H F L D H H V . . . . . . 4783 TCCAGAATAACGGCCTCTGCCATGACCACCTCTACCTCTAACATTTGGAGGTCTGTAACT S R I T A S A M T T S T S N I W R S V T P E - R P L P - P P L P L T F G G L - L L Q N N G L C H D H L Y L - H L E V C N . . . . . . 4723 CTGTTTTGGACAATATCTCTTAATATGTTCGATCTCCCCACATCCATAACACTTTCTGGG L F W T I S L N M F D L P T S I T L S G C F G Q Y L L I C S I S P H P - H F L G S V L D N I S - Y V R S P H I H N T F W . . . . . . 4663 TTCATGCATAGGTCTCTCAGAGAAGTGTTGACCGGTCGGAGGTGGACCCCCAACTACAGT F M H R S L R E V L T G R R W T P N Y S S C I G L S E K C - P V G G G P P T T V V H A - V S Q R S V D R S E V D P Q L Q . . . . . . 4603 CTGTAGTGAAGACTGAATTGGTCGGACTGAGTAACTTCCCGAACCCTGTCCTCTAGTGTA L - - R L N W S D - V T S R T L S S S V C S E D - I G R T E - L P E P C P L V - S V V K T E L V G L S N F P N P V L - C . . . . . . 4543 AGCACCATTAAACTCACCTCCCTTTCGAAACCTTTTTGATGTCAATGTCGGGGTGAAGTC S T I K L T S L S K P F - C Q C R G E V A P L N S P P F R N L F D V N V G V K S K H H - T H L P F E T F L M S M S G - S . . . . . . 4483 GTCTGGCTTCACTCCTTCCACTTCTATCACAAAGTCTACCACCTCTTGGAAGGATTTTGC V W L H S F H F Y H K V Y H L L E G F C S G F T P S T S I T K S T T S W K D F A R L A S L L P L L S Q S L P P L G R I L . . . . . . 4423 CGTTGCCACTATCTGTAAGGCCGAAATCCGCAATTCTGACCTCAACCCCTTCACAAACCG R C H Y L - G R N P Q F - P Q P L H K P V A T I C K A E I R N S D L N P F T N R P L P L S V R P K S A I L T S T P S Q T . . . . . . 4363 GCGAATTCGCTCTTGTGGACTGAAACACAGTTGGGTGGCATACCGGGATAATGCACGAAA A N S L L W T E T Q L G G I P G - C T K R I R S C G L K H S W V A Y R D N A R N G E F A L V D - N T V G W H T G I M H E . . . . . . 4303 CTTAGCCTCATATGCATTGACCGACATCCTACCTTGCTCTAGGCTCAAGAACTCATCCCT L S L I C I D R H P T L L - A Q E L I P L A S Y A L T D I L P C S R L K N S S L T - P H M H - P T S Y L A L G S R T H P . . . . . . 4243 TTTCCTATCCCTCAAAGTTCGGGGGATATACTTCTCCATAAACAAGCTAGAGAATGAGGC F P I P Q S S G D I L L H K Q A R E - G F L S L K V R G I Y F S I N K L E N E A F S Y P S K F G G Y T S P - T S - R M R . . . . . . 4183 CCAAGTCATAGGTGGTGCCTCTGTTGGTTGACACTCAATATGTGACCGCCACCACATTTT P S H R W C L C W L T L N M - P P P H F Q V I G G A S V G - H S I C D R H H I L P K S - V V P L L V D T Q Y V T A T T F . . . . . . 4123 GGCATTACCTTGAAACTGATAACTTACGAACTCAACACCAAACCGTTCTACTATACCCAT G I T L K L I T Y E L N T K P F Y Y T H A L P - N - - L T N S T P N R S T I P I W H Y L E T D N L R T Q H Q T V L L Y P . . . . . . 4063 CTTGTGTAGTAGCTCATGACAGTCAACCAGAAAATCGTAAGCATCCTCAGATTCCGCACC L V - - L M T V N Q K I V S I L R F R T L C S S S - Q S T R K S - A S S D S A P S C V V A H D S Q P E N R K H P Q I P H . . . . . . 4003 CTTGAATACTGGAGGTTTCAATTTCAAGAACTTACTGAAAAGTTCATGCTGATCATTTGT L E Y W R F Q F Q E L T E K F M L I I C L N T G G F N F K N L L K S S C - S F V P - I L E V S I S R T Y - K V H A D H L . . . . . . 3943 CATTATAGGCCCAGTAGTCAGACGTGGAAACGTGCCTATTTCCAATGGAACATCCATGCG H Y R P S S Q T W K R A Y F Q W N I H A I I G P V V R R G N V P I S N G T S M R S L - A Q - S D V E T C L F P M E H P C . . . . . . 3883 GGGAGCCATAGTAGCCGCATGTTGTACCTCCGGAGCCTGAGGTGCTGGTGTAGAAAACAC G S H S S R M L Y L R S L R C W C R K H G A I V A A C C T S G A - G A G V E N T G E P - - P H V V P P E P E V L V - K T . . . . . . 3823 TGGGGTGTCTGGCCCTGATCATATAACCCGCTAAGATAAGCCAGAACCTGATTGATCATC W G V W P - S Y N P L R - A R T - L I I G V S G P D H I T R - D K P E P D - S S L G C L A L I I - P A K I S Q N L I D H . . . . . . 3763 TCTGGGGTAGGTTGGGGTGGCAATCCCTCATTCTGCACTTGTTCAGTTTCCCCATCGTCC S G V G W G G N P S F C T C S V S P S S L G - V G V A I P H S A L V Q F P H R P L W G R L G W Q S L I L H L F S F P I V . . . . . . 3703 CCTTCTCTTATTACTTCCTCAGTCGGTGGAGGAGTCACTGCCCTAGTATCAGATGGGCTA P S L I T S S V G G G V T A L V S D G L L L L L L P Q S V E E S L P - Y Q M G - P F S Y Y F L S R W R S H C P S I R W A . . . . . . 3643 GGCGCTCGTCCTCTTCCCCTAGAGGACGTCCTCCCACTACCTCTACCATGGCCCCTTGCC G A R P L P L E D V L P L P L P W P L A A L V L F P - R T S S H Y L Y H G P L P R R S S S S P R G R P P T T S T M A P C . 3583 GCTGTTCT A V L F R C S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_1 (5556 5161) (frame '2'; 393 bp, 131 residues) 1 GGIEMGRVPG SRSIQKSISL SGGIPGRSAG NTSRNSRTTE TDSIEGTWVV SSLRCAKKAK 61 QPLLTIFLAR RKEMICTGLE ALSPSHTNGS VPGLANVTVL ALQSKIANCG ESQVIPRITS 121 KSSISKITKS T- >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_2 (4542 4153) (frame '2'; 387 bp, 129 residues) 1 APLNSPPFRN LFDVNVGVKS SGFTPSTSIT KSTTSWKDFA VATICKAEIR NSDLNPFTNR 61 RIRSCGLKHS WVAYRDNARN LASYALTDIL PCSRLKNSSL FLSLKVRGIY FSINKLENEA 121 QVIGGASVG- >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_3 (4852 4598) (frame '1'; 252 bp, 84 residues) 1 CSGSSLPITL AVLVTTISLT TTSSRITASA MTTSTSNIWR SVTLFWTISL NMFDLPTSIT 61 LSGFMHRSLR EVLTGRRWTP NYSL- >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_4 (4051 3806) (frame '1'; 243 bp, 81 residues) 1 LMTVNQKIVS ILRFRTLEYW RFQFQELTEK FMLIICHYRP SSQTWKRAYF QWNIHAGSHS 61 SRMLYLRSLR CWCRKHWGVW P- >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_5 (6019 5780) (frame '1'; 237 bp, 79 residues) 1 VYQYIINKYD HEEVQIWLKN PVHQAHERNR GVHKTKRHHY KFVMPIPRSK SSLWHIRCPY 61 FQLMITGSQV NLREDTSTL- >C06HBa0153O03.1-2-_PGL-1_AGS-8_PPS_6 (3797 3576) (frame '0'; 222 bp, 74 residues) 1 PAKISQNLID HLWGRLGWQS LILHLFSFPI VPFSYYFLSR WRSHCPSIRW ARRSSSSPRG 61 RPPTTSTMAP CRCS 3-phase translation of AGS-8 (+strand): . . . . . . 3576 AGAACAGCGGCAAGGGGCCATGGTAGAGGTAGTGGGAGGACGTCCTCTAGGGGAAGAGGA R T A A R G H G R G S G R T S S R G R G E Q R Q G A M V E V V G G R P L G E E D N S G K G P W - R - W E D V L - G K R . . . . . . 3636 CGAGCGCCTAGCCCATCTGATACTAGGGCAGTGACTCCTCCACCGACTGAGGAAGTAATA R A P S P S D T R A V T P P P T E E V I E R L A H L I L G Q - L L H R L R K - - T S A - P I - Y - G S D S S T D - G S N . . . . . . 3696 AGAGAAGGGGACGATGGGGAAACTGAACAAGTGCAGAATGAGGGATTGCCACCCCAACCT R E G D D G E T E Q V Q N E G L P P Q P E K G T M G K L N K C R M R D C H P N L K R R G R W G N - T S A E - G I A T P T . . . . . . 3756 ACCCCAGAGATGATCAATCAGGTTCTGGCTTATCTTAGCGGGTTATATGATCAGGGCCAG T P E M I N Q V L A Y L S G L Y D Q G Q P Q R - S I R F W L I L A G Y M I R A R Y P R D D Q S G S G L S - R V I - S G P . . . . . . 3816 ACACCCCAGTGTTTTCTACACCAGCACCTCAGGCTCCGGAGGTACAACATGCGGCTACTA T P Q C F L H Q H L R L R R Y N M R L L H P S V F Y T S T S G S G G T T C G Y Y D T P V F S T P A P Q A P E V Q H A A T . . . . . . 3876 TGGCTCCCCGCATGGATGTTCCATTGGAAATAGGCACGTTTCCACGTCTGACTACTGGGC W L P A W M F H W K - A R F H V - L L G G S P H G C S I G N R H V S T S D Y W A M A P R M D V P L E I G T F P R L T T G . . . . . . 3936 CTATAATGACAAATGATCAGCATGAACTTTTCAGTAAGTTCTTGAAATTGAAACCTCCAG L - - Q M I S M N F S V S S - N - N L Q Y N D K - S A - T F Q - V L E I E T S S P I M T N D Q H E L F S K F L K L K P P . . . . . . 3996 TATTCAAGGGTGCGGAATCTGAGGATGCTTACGATTTTCTGGTTGACTGTCATGAGCTAC Y S R V R N L R M L T I F W L T V M S Y I Q G C G I - G C L R F S G - L S - A T V F K G A E S E D A Y D F L V D C H E L . . . . . . 4056 TACACAAGATGGGTATAGTAGAACGGTTTGGTGTTGAGTTCGTAAGTTATCAGTTTCAAG Y T R W V - - N G L V L S S - V I S F K T Q D G Y S R T V W C - V R K L S V S R L H K M G I V E R F G V E F V S Y Q F Q . . . . . . 4116 GTAATGCCAAAATGTGGTGGCGGTCACATATTGAGTGTCAACCAACAGAGGCACCACCTA V M P K C G G G H I L S V N Q Q R H H L - C Q N V V A V T Y - V S T N R G T T Y G N A K M W W R S H I E C Q P T E A P P . . . . . . 4176 TGACTTGGGCCTCATTCTCTAGCTTGTTTATGGAGAAGTATATCCCCCGAACTTTGAGGG - L G P H S L A C L W R S I S P E L - G D L G L I L - L V Y G E V Y P P N F E G M T W A S F S S L F M E K Y I P R T L R . . . . . . 4236 ATAGGAAAAGGGATGAGTTCTTGAGCCTAGAGCAAGGTAGGATGTCGGTCAATGCATATG I G K G M S S - A - S K V G C R S M H M - E K G - V L E P R A R - D V G Q C I - D R K R D E F L S L E Q G R M S V N A Y . . . . . . 4296 AGGCTAAGTTTCGTGCATTATCCCGGTATGCCACCCAACTGTGTTTCAGTCCACAAGAGC R L S F V H Y P G M P P N C V S V H K S G - V S C I I P V C H P T V F Q S T R A E A K F R A L S R Y A T Q L C F S P Q E . . . . . . 4356 GAATTCGCCGGTTTGTGAAGGGGTTGAGGTCAGAATTGCGGATTTCGGCCTTACAGATAG E F A G L - R G - G Q N C G F R P Y R - N S P V C E G V E V R I A D F G L T D S R I R R F V K G L R S E L R I S A L Q I . . . . . . 4416 TGGCAACGGCAAAATCCTTCCAAGAGGTGGTAGACTTTGTGATAGAAGTGGAAGGAGTGA W Q R Q N P S K R W - T L - - K W K E - G N G K I L P R G G R L C D R S G R S E V A T A K S F Q E V V D F V I E V E G V . . . . . . 4476 AGCCAGACGACTTCACCCCGACATTGACATCAAAAAGGTTTCGAAAGGGAGGTGAGTTTA S Q T T S P R H - H Q K G F E R E V S L A R R L H P D I D I K K V S K G R - V - K P D D F T P T L T S K R F R K G G E F . . . . . . 4536 ATGGTGCTTACACTAGAGGACAGGGTTCGGGAAGTTACTCAGTCCGACCAATTCAGTCTT M V L T L E D R V R E V T Q S D Q F S L W C L H - R T G F G K L L S P T N S V F N G A Y T R G Q G S G S Y S V R P I Q S . . . . . . 4596 CACTACAGACTGTAGTTGGGGGTCCACCTCCGACCGGTCAACACTTCTCTGAGAGACCTA H Y R L - L G V H L R P V N T S L R D L T T D C S W G S T S D R S T L L - E T Y S L Q T V V G G P P P T G Q H F S E R P . . . . . . 4656 TGCATGAACCCAGAAAGTGTTATGGATGTGGGGAGATCGAACATATTAAGAGATATTGTC C M N P E S V M D V G R S N I L R D I V A - T Q K V L W M W G D R T Y - E I L S M H E P R K C Y G C G E I E H I K R Y C . . . . . . 4716 CAAAACAGAGTTACAGACCTCCAAATGTTAGAGGTAGAGGTGGTCATGGCAGAGGCCGTT Q N R V T D L Q M L E V E V V M A E A V K T E L Q T S K C - R - R W S W Q R P L P K Q S Y R P P N V R G R G G H G R G R . . . . . . 4776 ATTCTGGAGGACGTGGTGGTCAAGGAAATGGTGGTAACCAAAACGGCCAAGGTGATGGGC I L E D V V V K E M V V T K T A K V M G F W R T W W S R K W W - P K R P R - W A Y S G G R G G Q G N G G N Q N G Q G D G . . . . . . 4836 AAACTGGAGCCACTACATCACAACAAGTTAGGGGCAACGGACAGACGAACGATAGGGCCC K L E P L H H N K L G A T D R R T I G P N W S H Y I T T S - G Q R T D E R - G P Q T G A T T S Q Q V R G N G Q T N D R A . . . . . . 4896 ATTGTTACGCTTTCCCTGGGCGGTCTGAAGCGGAGGCATCTGATGCTGTCATCACAGGTA I V T L S L G G L K R R H L M L S S Q V L L R F P W A V - S G G I - C C H H R - H C Y A F P G R S E A E A S D A V I T G . . . . . . 4956 ATCTTCTGGTTTGTGATTGCATGGCCTCTGTATTGTTTGATCCTGGATCCACATTTTCTT I F W F V I A W P L Y C L I L D P H F L S S G L - L H G L C I V - S W I H I F L N L L V C D C M A S V L F D P G S T F S . . . . . . 5016 ATGTATCTTCCTCATTTGCTAATGGTCTAAATTTACATTGTGAATTACTTGATATGCCTA M Y L P H L L M V - I Y I V N Y L I C L C I F L I C - W S K F T L - I T - Y A Y Y V S S S F A N G L N L H C E L L D M P . . . . . . 5076 TTCGTGTTTCTACTCCGGTGGGTGAATCTGTGGTAGTTGAAAAGGTGTATAGGTCTTTTT F V F L L R W V N L W - L K R C I G L F S C F Y S G G - I C G S - K G V - V F F I R V S T P V G E S V V V E K V Y R S F . . . . . . 5136 TGGTGAACTTTGTAGGGAGCAACACTTATGTAGATTTGGTTATCTTAGAAATGGATGATT W - T L - G A T L M - I W L S - K W M I G E L C R E Q H L C R F G Y L R N G - F L V N F V G S N T Y V D L V I L E M D D . . . . . . 5196 TTGATGTAATTCTAGGTATGACTTGGCTTTCTCCGCAATTTGCGATCTTGGATTGTAATG L M - F - V - L G F L R N L R S W I V M - C N S R Y D L A F S A I C D L G L - C F D V I L G M T W L S P Q F A I L D C N . . . . . . 5256 CTAAAACGGTGACGTTAGCCAAGCCTGGGACAGATCCGTTAGTGTGGGAGGGTGACAACG L K R - R - P S L G Q I R - C G R V T T - N G D V S Q A W D R S V S V G G - Q R A K T V T L A K P G T D P L V W E G D N . . . . . . 5316 CTTCCAATCCGGTGCATATCATCTCCTTTCTTCGTGCTAAGAAAATGGTTAGTAAAGGTT L P I R C I S S P F F V L R K W L V K V F Q S G A Y H L L S S C - E N G - - R L A S N P V H I I S F L R A K K M V S K G . . . . . . 5376 GTTTAGCTTTCTTGGCACATCTCAAGGATGACACTACCCAAGTGCCTTCGATTGAGTCGG V - L S W H I S R M T L P K C L R L S R F S F L G T S Q G - H Y P S A F D - V G C L A F L A H L K D D T T Q V P S I E S . . . . . . 5436 TTTCAGTAGTCCGTGAGTTTCTGGATGTGTTCCCTGCAGATCTTCCTGGTATGCCACCAG F Q - S V S F W M C S L Q I F L V C H Q F S S P - V S G C V P C R S S W Y A T R V S V V R E F L D V F P A D L P G M P P . . . . . . 5496 ATAGGGATATTGACTTCTGTATCGATCTAGAACCGGGCACACGCCCCATTTCTATACCCC I G I L T S V S I - N R A H A P F L Y P - G Y - L L Y R S R T G H T P H F Y T P D R D I D F C I D L E P G T R P I S I P . . . . . . 5556 CTTATAGAATGGCTCCCGCAGAGTTAAGAGAGTTAAAGGCACAACTTCAAGAGTTATTGA L I E W L P Q S - E S - R H N F K S Y - L - N G S R R V K R V K G T T S R V I E P Y R M A P A E L R E L K A Q L Q E L L . . . . . . 5616 ACAAAGGCTTTATTAGACCAAGTGCATCTCCTTGGGGTGCTCCGGTTTTGTTTGTAAAGA T K A L L D Q V H L L G V L R F C L - R Q R L Y - T K C I S L G C S G F V C K E N K G F I R P S A S P W G A P V L F V K . . . . . . 5676 AGAAGGATGGGAGCTTTCGAATGTGTATAGACTACAGACAACTAAACAAGGTAACCATAA R R M G A F E C V - T T D N - T R - P - E G W E L S N V Y R L Q T T K Q G N H K K K D G S F R M C I D Y R Q L N K V T I . . . . . . 5736 AGAACAAGTATCCTCTTCCCCGCATTGATGACTTGTTCGATCAGTTACAAGGTGCTTGTG R T S I L F P A L M T C S I S Y K V L V E Q V S S S P H - - L V R S V T R C L C K N K Y P L P R I D D L F D Q L Q G A C . . . . . . 5796 TCTTCTCTAAGATTGACTTGAGATCCGGTTATCATCAATTGAAAATACGGGCAACGGATG S S L R L T - D P V I I N - K Y G Q R M L L - D - L E I R L S S I E N T G N G C V F S K I D L R S G Y H Q L K I R A T D . . . . . . 5856 TGCCAAAGACTGCTTTTCGAACGAGGTATGGGCATTACAAATTTGTAGTGATGTCTTTTG C Q R L L F E R G M G I T N L - - C L L A K D C F S N E V W A L Q I C S D V F W V P K T A F R T R Y G H Y K F V V M S F . . . . . . 5916 GTCTTATGAACGCCCCTGTTGCGTTCATGAGCTTGATGAACGGGATTTTTAAGCCATATT V L - T P L L R S - A - - T G F L S H I S Y E R P C C V H E L D E R D F - A I F G L M N A P V A F M S L M N G I F K P Y . . . . . . 5976 TGGACCTCTTCGTGATCGTATTTATTGATGATATATTGGTATACTCAAAGAGCAAGAAGG W T S S - S Y L L M I Y W Y T Q R A R R G P L R D R I Y - - Y I G I L K E Q E G L D L F V I V F I D D I L V Y S K S K K . . . . . . 6036 AACATGAAGAGCATTTGAGAATGGTATTGGAAATGTTGAGGGAGAAAAAGCTTTATGCCA N M K S I - E W Y W K C - G R K S F M P T - R A F E N G I G N V E G E K A L C Q E H E E H L R M V L E M L R E K K L Y A . . . . . . 6096 AATTCTCTAAGTGTGAGTTTTGGCTAGATGCAGTGTCCTTCTTGGGGCACGTGGTTTCTA N S L S V S F G - M Q C P S W G T W F L I L - V - V L A R C S V L L G A R G F - K F S K C E F W L D A V S F L G H V V S . . . . . . 6156 AGGATGGAGTGATGGTGGATCCTTCTAAGATTGAGACAGTGAAGAATTGGGTAAGACCTA R M E - W W I L L R L R Q - R I G - D L G W S D G G S F - D - D S E E L G K T Y K D G V M V D P S K I E T V K N W V R P . . . . . . 6216 CTAATGTGTCAGAAATAAGGAGCTTTGTTGGGTTAGCTAGCTACTACCGTCGATTTGTCA L M C Q K - G A L L G - L A T T V D L S - C V R N K E L C W V S - L L P S I C Q T N V S E I R S F V G L A S Y Y R R F V . 6276 AGGGATTC R D G I K G F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2+_PGL-1_AGS-8_PPS_1 (3806 6283) (frame '0'; 2478 bp, 826 residues) 1 SGPDTPVFST PAPQAPEVQH AATMAPRMDV PLEIGTFPRL TTGPIMTNDQ HELFSKFLKL 61 KPPVFKGAES EDAYDFLVDC HELLHKMGIV ERFGVEFVSY QFQGNAKMWW RSHIECQPTE 121 APPMTWASFS SLFMEKYIPR TLRDRKRDEF LSLEQGRMSV NAYEAKFRAL SRYATQLCFS 181 PQERIRRFVK GLRSELRISA LQIVATAKSF QEVVDFVIEV EGVKPDDFTP TLTSKRFRKG 241 GEFNGAYTRG QGSGSYSVRP IQSSLQTVVG GPPPTGQHFS ERPMHEPRKC YGCGEIEHIK 301 RYCPKQSYRP PNVRGRGGHG RGRYSGGRGG QGNGGNQNGQ GDGQTGATTS QQVRGNGQTN 361 DRAHCYAFPG RSEAEASDAV ITGNLLVCDC MASVLFDPGS TFSYVSSSFA NGLNLHCELL 421 DMPIRVSTPV GESVVVEKVY RSFLVNFVGS NTYVDLVILE MDDFDVILGM TWLSPQFAIL 481 DCNAKTVTLA KPGTDPLVWE GDNASNPVHI ISFLRAKKMV SKGCLAFLAH LKDDTTQVPS 541 IESVSVVREF LDVFPADLPG MPPDRDIDFC IDLEPGTRPI SIPPYRMAPA ELRELKAQLQ 601 ELLNKGFIRP SASPWGAPVL FVKKKDGSFR MCIDYRQLNK VTIKNKYPLP RIDDLFDQLQ 661 GACVFSKIDL RSGYHQLKIR ATDVPKTAFR TRYGHYKFVV MSFGLMNAPV AFMSLMNGIF 721 KPYLDLFVIV FIDDILVYSK SKKEHEEHLR MVLEMLREKK LYAKFSKCEF WLDAVSFLGH 781 VVSKDGVMVD PSKIETVKNW VRPTNVSEIR SFVGLASYYR RFVKGF AGS-9 (5675 5185,4440 4267) SCR (e 0.943 d 0.000 a 0.000,e 0.897) Exon 1 5675 5185 ( 491 n); score: 0.943 Intron 1 5184 4441 ( 744 n); Pd: 0.000 Pa: 0.000 Exon 2 4440 4267 ( 174 n); score: 0.897 PGS (5675 5185,4440 4267) SGN-E353359+ 3-phase translation of AGS-9 (-strand): . . . . . . 5675 TCTTTACAAACAAAACCGGAGCACCCCAAGGAGATGCACTTGGTCTAATAAAGCCTTTGT S L Q T K P E H P K E M H L V - - S L C L Y K Q N R S T P R R C T W S N K A F V F T N K T G A P Q G D A L G L I K P L . . . . . . 5615 TCAATAACTCTTGAAGTTGTGCCTTTAACTCTCTTAACTCTGCGGGAGCCATTCTATAAG S I T L E V V P L T L L T L R E P F Y K Q - L L K L C L - L S - L C G S H S I R F N N S - S C A F N S L N S A G A I L - . . . . . . 5555 GGGGTATAGAAATGGGGCGTGTGCCCGGTTCTAGATCGATACAGAAGTCAATATCCCTAT G V - K W G V C P V L D R Y R S Q Y P Y G Y R N G A C A R F - I D T E V N I P I G G I E M G R V P G S R S I Q K S I S L . . . . . . 5495 CTGGTGGCATACCAGGAAGATCTGCAGGGAACACATCCAGAAACTCACGGACTACTGAAA L V A Y Q E D L Q G T H P E T H G L L K W W H T R K I C R E H I Q K L T D Y - N S G G I P G R S A G N T S R N S R T T E . . . . . . 5435 CCGACTCAATCGAAGGCACTTGGGTAGTGTCATCCTTGAGATGTGCCAAGAAAGCTAAAC P T Q S K A L G - C H P - D V P R K L N R L N R R H L G S V I L E M C Q E S - T T D S I E G T W V V S S L R C A K K A K . . . . . . 5375 AACCTTTACTAACCATTTTCTTAGCACGAAGAAAGGAGATGATATGCACCGGATTGGAAG N L Y - P F S - H E E R R - Y A P D W K T F T N H F L S T K K G D D M H R I G S Q P L L T I F L A R R K E M I C T G L E . . . . . . 5315 CGTTGTCACCCTCCCACACTAACGGATCTGTCCCAGGCTTGGCTAACGTCACCGTTTTAG R C H P P T L T D L S Q A W L T S P F - V V T L P H - R I C P R L G - R H R F S A L S P S H T N G S V P G L A N V T V L . . . . . . 5255 CATTACAATCCAAGATCGCAAATTGCGGAGAAAGCCAAGTCATACCTAGAATTACATCAA H Y N P R S Q I A E K A K S Y L E L H Q I T I Q D R K L R R K P S H T - N Y I K A L Q S K I A N C G E S Q V I P R I T S . . : . . . . 5195 AATCATCCATT : TCTTGGAAGGATTTTGCCGTTGCCACTATCTGTAAGGCCGAAATCCGCA N H P F : L G R I L P L P L S V R P K S A I I H : F L E G F C R C H Y L - G R N P Q K S S I : S W K D F A V A T I C K A E I R . . . . . . 4391 ATTCTGACCTCAACCCCTTCACAAACCGGCGAATTCGCTCTTGTGGACTGAAACACAGTT I L T S T P S Q T G E F A L V D - N T V F - P Q P L H K P A N S L L W T E T Q L N S D L N P F T N R R I R S C G L K H S . . . . . . 4331 GGGTGGCATACCGGGATAATGCACGAAACTTAGCCTCATATGCATTGACCGACATCCTAC G W H T G I M H E T - P H M H - P T S Y G G I P G - C T K L S L I C I D R H P T W V A Y R D N A R N L A S Y A L T D I L . 4271 CTTGC L L P C Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-9_PPS_1 (5556 5185,4440 4267) (frame '0'; 546 bp, 182 residues) 1 GGIEMGRVPG SRSIQKSISL SGGIPGRSAG NTSRNSRTTE TDSIEGTWVV SSLRCAKKAK 61 QPLLTIFLAR RKEMICTGLE ALSPSHTNGS VPGLANVTVL ALQSKIANCG ESQVIPRITS 121 KSSISWKDFA VATICKAEIR NSDLNPFTNR RIRSCGLKHS WVAYRDNARN LASYALTDIL 181 PC AGS-10 (5095 5073,5037 4507) SCR (e 0.652 d 0.729 a 0.000,e 0.911) Exon 1 5095 5073 ( 23 n); score: 0.652 Intron 1 5072 5038 ( 35 n); Pd: 0.729 Pa: 0.000 Exon 2 5037 4507 ( 531 n); score: 0.911 PGS (5095 5073,5037 4507) SGN-E577713+ 3-phase translation of AGS-10 (-strand): . . . : . . . 5095 CACCGGAGTAGAAACACGAATAG : TTAGCAAATGAGGAAGATACATAAGAAAATGTGGATC H R S R N T N S : - Q M R K I H K K M W I T G V E T R I : V S K - G R Y I R K C G S P E - K H E - : L A N E E D T - E N V D . . . . . . 5000 CAGGATCAAACAATACAGAGGCCATGCAATCACAAACCAGAAGATTACCTGTGATGACAG Q D Q T I Q R P C N H K P E D Y L - - Q R I K Q Y R G H A I T N Q K I T C D D S P G S N N T E A M Q S Q T R R L P V M T . . . . . . 4940 CATCAGATGCCTCCGCTTCAGACCGCCCAGGGAAAGCGTAACAATGGGCCCTATCGTTCG H Q M P P L Q T A Q G K R N N G P Y R S I R C L R F R P P R E S V T M G P I V R A S D A S A S D R P G K A - Q W A L S F . . . . . . 4880 TCTGTCCGTTGCCCCTAACTTGTTGTGATGTAGTGGCTCCAGTTTGCCCATCACCTTGGC S V R C P - L V V M - W L Q F A H H L G L S V A P N L L - C S G S S L P I T L A V C P L P L T C C D V V A P V C P S P W . . . . . . 4820 CGTTTTGGTTACCACCATTTCCTTGACCACCACGTCCTCCAGAATAACGGCCTCTGCCAT R F G Y H H F L D H H V L Q N N G L C H V L V T T I S L T T T S S R I T A S A M P F W L P P F P - P P R P P E - R P L P . . . . . . 4760 GACCACCTCTACCTCTAACATTTGGAGGTCTGTAACTCTGTTTTGGACAATATCTCTTAA D H L Y L - H L E V C N S V L D N I S - T T S T S N I W R S V T L F W T I S L N - P P L P L T F G G L - L C F G Q Y L L . . . . . . 4700 TATGTTCGATCTCCCCACATCCATAACACTTTCTGGGTTCATGCATAGGTCTCTCAGAGA Y V R S P H I H N T F W V H A - V S Q R M F D L P T S I T L S G F M H R S L R E I C S I S P H P - H F L G S C I G L S E . . . . . . 4640 AGTGTTGACCGGTCGGAGGTGGACCCCCAACTACAGTCTGTAGTGAAGACTGAATTGGTC S V D R S E V D P Q L Q S V V K T E L V V L T G R R W T P N Y S L - - R L N W S K C - P V G G G P P T T V C S E D - I G . . . . . . 4580 GGACTGAGTAACTTCCCGAACCCTGTCCTCTAGTGTAAGCACCATTAAACTCACCTCCCT G L S N F P N P V L - C K H H - T H L P D - V T S R T L S S S V S T I K L T S L R T E - L P E P C P L V - A P L N S P P . . 4520 TTCGAAACCTTTTT F E T F S K P F F R N L F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-10_PPS_1 (4852 4598) (frame '2'; 252 bp, 84 residues) 1 CSGSSLPITL AVLVTTISLT TTSSRITASA MTTSTSNIWR SVTLFWTISL NMFDLPTSIT 61 LSGFMHRSLR EVLTGRRWTP NYSL- AGS-11 (7540 6511) SCR (e 0.865) Exon 1 7540 6511 (1030 n); score: 0.865 PGS (7095 6511) SGN-E352950+ PGS (7095 6511) SGN-E357100+ PGS (7095 6544) SGN-E352647+ PGS (7540 7001) SGN-E353207+ PGS (7510 7211) SGN-E578131+ 3-phase translation of AGS-11 (-strand): . . . . . . 7540 TCTGCATGCAATGTTTTCCAAAACTTAGAAGTAAACTGCGTACCCCTATCTGATATGATG S A C N V F Q N L E V N C V P L S D M M L H A M F S K T - K - T A Y P Y L I - W C M Q C F P K L R S K L R T P I - Y D . . . . . . 7480 GATAGTGGAACCCCATGCAATCGAACGATTTCTGAGATATAGATCTTGGCTAACTTCTCT D S G T P C N R T I S E I - I L A N F S I V E P H A I E R F L R Y R S W L T S L G - W N P M Q S N D F - D I D L G - L L . . . . . . 7420 GCATTGTAAGTCACCTTTACCGGAATGAAATGAGCAGATTTAGTTAACCTATCAACAATC A L - V T F T G M K - A D L V N L S T I H C K S P L P E - N E Q I - L T Y Q Q S C I V S H L Y R N E M S R F S - P I N N . . . . . . 7360 ACCCAAATGGAGTCATACTTACCCATTGTCCTTGGAAGACCAACCACAAAGTCCATTGCA T Q M E S Y L P I V L G R P T T K S I A P K W S H T Y P L S L E D Q P Q S P L Q H P N G V I L T H C P W K T N H K V H C . . . . . . 7300 ATTCTTTCCCACTTCCATTCCGGAATGGGCATTCTCTGAAGTGTTCCTCCGGGCCTTTGG I L S H F H S G M G I L - S V P P G L W F F P T S I P E W A F S E V F L R A F G N S F P L P F R N G H S L K C S S G P L . . . . . . 7240 TGTTCATACTTTACTTGTTGACAGTTTGGACACTTGGCAATAAAGTCAACAATATCACGC C S Y F T C - Q F G H L A I K S T I S R V H T L L V D S L D T W Q - S Q Q Y H A V F I L Y L L T V W T L G N K V N N I T . . . . . . 7180 TTCATTCTACTCCACCAAAAGTGTTGTTTTAGGTCACGATACATCTTGGTTGCACTTGGA F I L L H Q K C C F R S R Y I L V A L G S F Y S T K S V V L G H D T S W L H L D L H S T P P K V L F - V T I H L G C T W . . . . . . 7120 TGTATAGAATACCTTGAACTATGAGCCTCTGTCAGAATAGTGTTGATTAAATCATTGACG C I E Y L E L - A S V R I V L I K S L T V - N T L N Y E P L S E - C - L N H - R M Y R I P - T M S L C Q N S V D - I I D . . . . . . 7060 GGGTACACATACCCTTCCCTTGATTCTCAAAACACCTTCCTCATCGATTTGTGCTTCCTT G Y T Y P S L D S Q N T F L I D L C F L G T H T L P L I L K T P S S S I C A S L G V H I P F P - F S K H L P H R F V L P . . . . . . 7000 AGCCTCTCCTCGCAATACCTTATCTTGGATTCTTCTTAGTTTCTCATCATCAAACTGTTT S L S S Q Y L I L D S S - F L I I K L F A S P R N T L S W I L L S F S S S N C F - P L L A I P Y L G F F L V S H H Q T V . . . . . . 6940 TCCCTTAATTTTGTCAAGAAAAGAAGATCTTGACTCCACACTAGCCAACAATCCTCCCTT S L N F V K K R R S - L H T S Q Q S S L P L I L S R K E D L D S T L A N N P P F F P - F C Q E K K I L T P H - P T I L P . . . . . . 6880 CTCATTTACTTCTAATATCATCAAGTCATTAGCTAGAGTCTGAACCTCTCTAGCCAATGG L I Y F - Y H Q V I S - S L N L S S Q W S F T S N I I K S L A R V - T S L A N G S H L L L I S S S H - L E S E P L - P M . . . . . . 6820 GCGTCTAGAAGCTTGCAAGTGAGCTAGACTTCCCATGCTTCCCACCTTTCTACTTAAAGC A S R S L Q V S - T S H A S H L S T - S R L E A C K - A R L P M L P T F L L K A G V - K L A S E L D F P C F P P F Y L K . . . . . . 6760 ATCCGCTACAACATTAGCCTTCCCCGGATGATACAAAATAGTGATATCGTAGTCCTTTAG I R Y N I S L P R M I Q N S D I V V L - S A T T L A F P G - Y K I V I S - S F S H P L Q H - P S P D D T K - - Y R S P L . . . . . . 6700 TAACTCCATCCATCTCCTCTGTCTTAAGTTCAAATCTTTCTGAGTAAAGACATACTGTAG - L H P S P L S - V Q I F L S K D I L - N S I H L L C L K F K S F - V K T Y C R V T P S I S S V L S S N L S E - R H T V . . . . . . 6640 GCTACGATGATCCGTATAGATCTCACACTTAACCCCATATAAATAGTGTCTCCATTGCTT A T M I R I D L T L N P I - I V S P L L L R - S V - I S H L T P Y K - C L H C F G Y D D P Y R S H T - P H I N S V S I A . . . . . . 6580 TAATGCAAACACCACCGCAGCCAATTCCAAATCGTGGGTCGGATAGTTACGTTCATGCAC - C K H H R S Q F Q I V G R I V T F M H N A N T T A A N S K S W V G - L R S C T L M Q T P P Q P I P N R G S D S Y V H A . 6520 CTTTAGTTGC L - L F S C P L V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-11_PPS_2 (7373 7149) (frame '0'; 222 bp, 74 residues) 1 PINNHPNGVI LTHCPWKTNH KVHCNSFPLP FRNGHSLKCS SGPLVFILYL LTVWTLGNKV 61 NNITLHSTPP KVLF- >C06HBa0153O03.1-2-_PGL-1_AGS-11_PPS_1 (7062 6838) (frame '2'; 222 bp, 74 residues) 1 RGTHTLPLIL KTPSSSICAS LASPRNTLSW ILLSFSSSNC FPLILSRKED LDSTLANNPP 61 FSFTSNIIKS LARV- 3-phase translation of AGS-11 (+strand): . . . . . . 6511 GCAACTAAAGGTGCATGAACGTAACTATCCGACCCACGATTTGGAATTGGCTGCGGTGGT A T K G A - T - L S D P R F G I G C G G Q L K V H E R N Y P T H D L E L A A V V N - R C M N V T I R P T I W N W L R W . . . . . . 6571 GTTTGCATTAAAGCAATGGAGACACTATTTATATGGGGTTAAGTGTGAGATCTATACGGA V C I K A M E T L F I W G - V - D L Y G F A L K Q W R H Y L Y G V K C E I Y T D C L H - S N G D T I Y M G L S V R S I R . . . . . . 6631 TCATCGTAGCCTACAGTATGTCTTTACTCAGAAAGATTTGAACTTAAGACAGAGGAGATG S S - P T V C L Y S E R F E L K T E E M H R S L Q Y V F T Q K D L N L R Q R R W I I V A Y S M S L L R K I - T - D R G D . . . . . . 6691 GATGGAGTTACTAAAGGACTACGATATCACTATTTTGTATCATCCGGGGAAGGCTAATGT D G V T K G L R Y H Y F V S S G E G - C M E L L K D Y D I T I L Y H P G K A N V G W S Y - R T T I S L F C I I R G R L M . . . . . . 6751 TGTAGCGGATGCTTTAAGTAGAAAGGTGGGAAGCATGGGAAGTCTAGCTCACTTGCAAGC C S G C F K - K G G K H G K S S S L A S V A D A L S R K V G S M G S L A H L Q A L - R M L - V E R W E A W E V - L T C K . . . . . . 6811 TTCTAGACGCCCATTGGCTAGAGAGGTTCAGACTCTAGCTAATGACTTGATGATATTAGA F - T P I G - R G S D S S - - L D D I R S R R P L A R E V Q T L A N D L M I L E L L D A H W L E R F R L - L M T - - Y - . . . . . . 6871 AGTAAATGAGAAGGGAGGATTGTTGGCTAGTGTGGAGTCAAGATCTTCTTTTCTTGACAA S K - E G R I V G - C G V K I F F S - Q V N E K G G L L A S V E S R S S F L D K K - M R R E D C W L V W S Q D L L F L T . . . . . . 6931 AATTAAGGGAAAACAGTTTGATGATGAGAAACTAAGAAGAATCCAAGATAAGGTATTGCG N - G K T V - - - E T K K N P R - G I A I K G K Q F D D E K L R R I Q D K V L R K L R E N S L M M R N - E E S K I R Y C . . . . . . 6991 AGGAGAGGCTAAGGAAGCACAAATCGATGAGGAAGGTGTTTTGAGAATCAAGGGAAGGGT R R G - G S T N R - G R C F E N Q G K G G E A K E A Q I D E E G V L R I K G R V E E R L R K H K S M R K V F - E S R E G . . . . . . 7051 ATGTGTACCCCGTCAATGATTTAATCAACACTATTCTGACAGAGGCTCATAGTTCAAGGT M C T P S M I - S T L F - Q R L I V Q G C V P R Q - F N Q H Y S D R G S - F K V Y V Y P V N D L I N T I L T E A H S S R . . . . . . 7111 ATTCTATACATCCAAGTGCAACCAAGATGTATCGTGACCTAAAACAACACTTTTGGTGGA I L Y I Q V Q P R C I V T - N N T F G G F Y T S K C N Q D V S - P K T T L L V E Y S I H P S A T K M Y R D L K Q H F W W . . . . . . 7171 GTAGAATGAAGCGTGATATTGTTGACTTTATTGCCAAGTGTCCAAACTGTCAACAAGTAA V E - S V I L L T L L P S V Q T V N K - - N E A - Y C - L Y C Q V S K L S T S K S R M K R D I V D F I A K C P N C Q Q V . . . . . . 7231 AGTATGAACACCAAAGGCCCGGAGGAACACTTCAGAGAATGCCCATTCCGGAATGGAAGT S M N T K G P E E H F R E C P F R N G S V - T P K A R R N T S E N A H S G M E V K Y E H Q R P G G T L Q R M P I P E W K . . . . . . 7291 GGGAAAGAATTGCAATGGACTTTGTGGTTGGTCTTCCAAGGACAATGGGTAAGTATGACT G K E L Q W T L W L V F Q G Q W V S M T G K N C N G L C G W S S K D N G - V - L W E R I A M D F V V G L P R T M G K Y D . . . . . . 7351 CCATTTGGGTGATTGTTGATAGGTTAACTAAATCTGCTCATTTCATTCCGGTAAAGGTGA P F G - L L I G - L N L L I S F R - R - H L G D C - - V N - I C S F H S G K G D S I W V I V D R L T K S A H F I P V K V . . . . . . 7411 CTTACAATGCAGAGAAGTTAGCCAAGATCTATATCTCAGAAATCGTTCGATTGCATGGGG L T M Q R S - P R S I S Q K S F D C M G L Q C R E V S Q D L Y L R N R S I A W G T Y N A E K L A K I Y I S E I V R L H G . . . . . . 7471 TTCCACTATCCATCATATCAGATAGGGGTACGCAGTTTACTTCTAAGTTTTGGAAAACAT F H Y P S Y Q I G V R S L L L S F G K H S T I H H I R - G Y A V Y F - V L E N I V P L S I I S D R G T Q F T S K F W K T . 7531 TGCATGCAGA C M Q A C R L H A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2+_PGL-1_AGS-11_PPS_1 (6512 7069) (frame '2'; 555 bp, 185 residues) 1 QLKVHERNYP THDLELAAVV FALKQWRHYL YGVKCEIYTD HRSLQYVFTQ KDLNLRQRRW 61 MELLKDYDIT ILYHPGKANV VADALSRKVG SMGSLAHLQA SRRPLAREVQ TLANDLMILE 121 VNEKGGLLAS VESRSSFLDK IKGKQFDDEK LRRIQDKVLR GEAKEAQIDE EGVLRIKGRV 181 CVPRQ- AGS-12 (7762 7586) SCR (e 0.966) Exon 1 7762 7586 ( 177 n); score: 0.966 PGS (7762 7586) SGN-E577888+ 3-phase translation of AGS-12 (-strand): . . . . . . 7762 CTACATCTCCTACCATACAATGCTTCAAATGGAGCCATATCAATGCTTGAGTGATAGCTA L H L L P Y N A S N G A I S M L E - - L Y I S Y H T M L Q M E P Y Q C L S D S Y T S P T I Q C F K W S H I N A - V I A . . . . . . 7702 TTATTGTATGAAAACTCCGCTAAGGGTAGGAAGCTATCCCAATGACCACCAAACTCTATC L L Y E N S A K G R K L S Q - P P N S I Y C M K T P L R V G S Y P N D H Q T L S I I V - K L R - G - E A I P M T T K L Y . . . . . . 7642 ACACACGCACGAAGCATATCTTCCAACACTTGAATCGTCCTTTCAGACTGACCATCG T H A R S I S S N T - I V L S D - P S H T H E A Y L P T L E S S F Q T D H H T R T K H I F Q H L N R P F R L T I Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-12 (+strand): . . . . . . 7586 CGATGGTCAGTCTGAAAGGACGATTCAAGTGTTGGAAGATATGCTTCGTGCGTGTGTGAT R W S V - K D D S S V G R Y A S C V C D D G Q S E R T I Q V L E D M L R A C V I M V S L K G R F K C W K I C F V R V - . . . . . . 7646 AGAGTTTGGTGGTCATTGGGATAGCTTCCTACCCTTAGCGGAGTTTTCATACAATAATAG R V W W S L G - L P T L S G V F I Q - - E F G G H W D S F L P L A E F S Y N N S - S L V V I G I A S Y P - R S F H T I I . . . . . . 7706 CTATCACTCAAGCATTGATATGGCTCCATTTGAAGCATTGTATGGTAGGAGATGTAG L S L K H - Y G S I - S I V W - E M - Y H S S I D M A P F E A L Y G R R C A I T Q A L I W L H L K H C M V G D V Maximal non-overlapping open reading frames (>= 64 codons): none AGS-13 (9121 7823) SCR (e 0.889) Exon 1 9121 7823 (1299 n); score: 0.889 PGS (8585 7823) SGN-E354383- PGS (8703 8166) SGN-E252199- PGS (8976 8421) SGN-E550127- PGS (8976 8421) SGN-E550140- PGS (8976 8421) SGN-E389553- PGS (8976 8421) SGN-E389834+ PGS (8976 8421) SGN-E550201+ PGS (8976 8421) SGN-E550207+ PGS (8976 8421) SGN-E550335+ PGS (8976 8421) SGN-E390013+ PGS (8976 8421) SGN-E550484+ PGS (8976 8421) SGN-E550211+ PGS (8976 8421) SGN-E550464+ PGS (8976 8421) SGN-E549941+ PGS (8976 8421) SGN-E550025+ PGS (8976 8421) SGN-E231589+ PGS (8976 8421) SGN-E374999+ PGS (8976 8421) SGN-E396039+ PGS (8976 8421) SGN-E396054+ PGS (8976 8421) SGN-E396056+ PGS (8976 8421) SGN-E396058+ PGS (8976 8421) SGN-E377133+ PGS (8976 8421) SGN-E550212+ PGS (8976 8421) SGN-E550065+ PGS (8954 8421) SGN-E550322+ PGS (8927 8421) SGN-E377132- PGS (8548 8421) SGN-E356257+ PGS (8534 8421) SGN-E275667- PGS (8976 8431) SGN-E241959+ PGS (9121 8466) SGN-E349296- PGS (8976 8504) SGN-E236652+ PGS (8932 8540) SGN-E356257- 3-phase translation of AGS-13 (-strand): . . . . . . 9121 AATACTACTAACACATATCATTCGCTATTAAGAGTTTGCTACGAATAGCATGAAATAACC N T T N T Y H S L L R V C Y E - H E I T I L L T H I I R Y - E F A T N S M K - P Y Y - H I S F A I K S L L R I A - N N . . . . . . 9061 ATAACCTACCTCCACTGAAGATTAGTGATTAAGCAAGAAATTCCCAAGGCTTTTGTTCCT I T Y L H - R L V I K Q E I P K A F V P - P T S T E D - - L S K K F P R L L F L H N L P P L K I S D - A R N S Q G F C S . . . . . . 9001 TCTTCTCGTTCGATCCTCCCTCAATTCGTTTCTCTTTCCCTCTCTTTGTTCTTTCTATTT S S R S I L P Q F V S L S L S L F F L F L L V R S S L N S F L F P S L C S F Y F F F S F D P P S I R F S F P L F V L S I . . . . . . 8941 TCTTATTCCAACCCTCTTTCTTTTACCCTAATTAGTATATAATTAAGAATAAAAGATGAC S Y S N P L S F T L I S I - L R I K D D L I P T L F L L P - L V Y N - E - K M T F L F Q P S F F Y P N - Y I I K N K R - . . . . . . 8881 AATAATACCCCACTAATTAACTTAAGGTTACCTCTTTTAACCCCCAAGGATTTTGAGTTA N N T P L I N L R L P L L T P K D F E L I I P H - L T - G Y L F - P P R I L S Y Q - Y P T N - L K V T S F N P Q G F - V . . . . . . 8821 TTAATATAAACCCATGAAATATATAATCATAGCAGGAATAGTCCAAAACGCCCCTTTAAA L I - T H E I Y N H S R N S P K R P F K - Y K P M K Y I I I A G I V Q N A P L K I N I N P - N I - S - Q E - S K T P L - . . . . . . 8761 ACTTAACCAGAAATCTGACTCCAACTGGGATTGCGCAACCTGTGACGGGCCGTCGTGCCT T - P E I - L Q L G L R N L - R A V V P L N Q K S D S N W D C A T C D G P S C L N L T R N L T P T G I A Q P V T G R R A . . . . . . 8701 GGGACGGTCCGTCCTGCAGGTCGTCGCAAAGTTCAGAGACCCAATATTTCCACCAAGGGT G T V R P A G R R K V Q R P N I S T K G G R S V L Q V V A K F R D P I F P P R V W D G P S C R S S Q S S E T Q Y F H Q G . . . . . . 8641 CTGTGACGGTCCGTCACACCTGTGACGGTCCGTCCTGCCATTCCGTCACGAAGTTCAGAG L - R S V T P V T V R P A I P S R S S E C D G P S H L - R S V L P F R H E V Q R S V T V R H T C D G P S C H S V T K F R . . . . . . 8581 AGTCGATTTTCTGTACCCAATTTTAGATTTTCTAAGTGTTTTGAAACGAGACCCTGCGAC S R F S V P N F R F S K C F E T R P C D V D F L Y P I L D F L S V L K R D P A T E S I F C T Q F - I F - V F - N E T L R . . . . . . 8521 GGTCCGTCGTGCCCATGACGGTCCGTCATTGGGTTCGTCGCCTCAGCCTGTTTTTCCAGA G P S C P - R S V I G F V A S A C F S R V R R A H D G P S L G S S P Q P V F P E R S V V P M T V R H W V R R L S L F F Q . . . . . . 8461 AATAAAATCTGCTGCTCAAAACGACTAAACAGGTCGTTACAATAGATACCAATTTACCCA N K I C C S K R L N R S L Q - I P I Y P I K S A A Q N D - T G R Y N R Y Q F T H K - N L L L K T T K Q V V T I D T N L P . . . . . . 8401 TCGTTCGTCCCCGAACGATCACAAGAAGGAAAACAAGGGCGAAAAGGAGTACCTGAATCT S F V P E R S Q E G K Q G R K G V P E S R S S P N D H K K E N K G E K E Y L N L I V R P R T I T R R K T R A K R S T - I . . . . . . 8341 GTAAACAGATGTGGGTATTTTTCTCGCATATCCGCCTCCTTCTCCCAAGTGGCTTCATCA V N R C G Y F S R I S A S F S Q V A S S - T D V G I F L A Y P P P S P K W L H Q C K Q M W V F F S H I R L L L P S G F I . . . . . . 8281 ACGGGTCGATTCTTCCATTGCACCTTGATGGATGCAATCTCTCTTGACCTCAACTTGCGA T G R F F H C T L M D A I S L D L N L R R V D S S I A P - W M Q S L L T S T C E N G S I L P L H L D G C N L S - P Q L A . . . . . . 8221 ACTTCTCTATCTAAAATAGCAACAGGCTCCTCCTCATAAGACAAGTTCTCATCAAGCAAA T S L S K I A T G S S S - D K F S S S K L L Y L K - Q Q A P P H K T S S H Q A K N F S I - N S N R L L L I R Q V L I K Q . . . . . . 8161 ACTGAATCCCAACGGATAATGTAATTTCCATTCCCATGATATCTTTTCAACATAGACACA T E S Q R I M - F P F P - Y L F N I D T L N P N G - C N F H S H D I F S T - T H N - I P T D N V I S I P M I S F Q H R H . . . . . . 8101 TGGAATACCGGATGTACTCCGGACAGCCCTGGAGGCAAGGCTAACTCATAAGCCACCTCT W N T G C T P D S P G G K A N S - A T S G I P D V L R T A L E A R L T H K P P L M E Y R M Y S G Q P W R Q G - L I S H L . . . . . . 8041 CCTACTCGCTTAAGTACTTCAAATGGTCCAATGTACCTTGGACTTAGTTTACCCCTTTTT P T R L S T S N G P M Y L G L S L P L F L L A - V L Q M V Q C T L D L V Y P F F S Y S L K Y F K W S N V P W T - F T P F . . . . . . 7981 CCGAACCGCATCACCCCTTTCATGGGCGAAACTTTCAACAAGACTTGTTCACCTTCCATG P N R I T P F M G E T F N K T C S P S M R T A S P L S W A K L S T R L V H L P - S E P H H P F H G R N F Q Q D L F T F H . . . . . . 7921 AACTCTAAGTCTCTAACCTTTCGATCTGCATATTCTTTTTGTCTACTTTGCGACGCTAAC N S K S L T F R S A Y S F C L L C D A N T L S L - P F D L H I L F V Y F A T L T E L - V S N L S I C I F F L S T L R R - . . . . 7861 AACTTTTCTTGAATAGATTTCACTTTATCTAACGATTCT N F S - I D F T L S N D S T F L E - I S L Y L T I Q L F L N R F H F I - R F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-13_PPS_1 (8416 8183) (frame '1'; 231 bp, 77 residues) 1 IPIYPSFVPE RSQEGKQGRK GVPESVNRCG YFSRISASFS QVASSTGRFF HCTLMDAISL 61 DLNLRTSLSK IATGSSS- >C06HBa0153O03.1-2-_PGL-1_AGS-13_PPS_2 (8762 8556) (frame '0'; 204 bp, 68 residues) 1 NLTRNLTPTG IAQPVTGRRA WDGPSCRSSQ SSETQYFHQG SVTVRHTCDG PSCHSVTKFR 61 ESIFCTQF- >C06HBa0153O03.1-2-_PGL-1_AGS-13_PPS_3 (8050 7850) (frame '1'; 198 bp, 66 residues) 1 ATSPTRLSTS NGPMYLGLSL PLFPNRITPF MGETFNKTCS PSMNSKSLTF RSAYSFCLLC 61 DANNFS- 3-phase translation of AGS-13 (+strand): . . . . . . 7823 AGAATCGTTAGATAAAGTGAAATCTATTCAAGAAAAGTTGTTAGCGTCGCAAAGTAGACA R I V R - S E I Y S R K V V S V A K - T E S L D K V K S I Q E K L L A S Q S R Q N R - I K - N L F K K S C - R R K V D . . . . . . 7883 AAAAGAATATGCAGATCGAAAGGTTAGAGACTTAGAGTTCATGGAAGGTGAACAAGTCTT K R I C R S K G - R L R V H G R - T S L K E Y A D R K V R D L E F M E G E Q V L K K N M Q I E R L E T - S S W K V N K S . . . . . . 7943 GTTGAAAGTTTCGCCCATGAAAGGGGTGATGCGGTTCGGAAAAAGGGGTAAACTAAGTCC V E S F A H E R G D A V R K K G - T K S L K V S P M K G V M R F G K R G K L S P C - K F R P - K G - C G S E K G V N - V . . . . . . 8003 AAGGTACATTGGACCATTTGAAGTACTTAAGCGAGTAGGAGAGGTGGCTTATGAGTTAGC K V H W T I - S T - A S R R G G L - V S R Y I G P F E V L K R V G E V A Y E L A Q G T L D H L K Y L S E - E R W L M S - . . . . . . 8063 CTTGCCTCCAGGGCTGTCCGGAGTACATCCGGTATTCCATGTGTCTATGTTGAAAAGATA L A S R A V R S T S G I P C V Y V E K I L P P G L S G V H P V F H V S M L K R Y P C L Q G C P E Y I R Y S M C L C - K D . . . . . . 8123 TCATGGGAATGGAAATTACATTATCCGTTGGGATTCAGTTTTGCTTGATGAGAACTTGTC S W E W K L H Y P L G F S F A - - E L V H G N G N Y I I R W D S V L L D E N L S I M G M E I T L S V G I Q F C L M R T C . . . . . . 8183 TTATGAGGAGGAGCCTGTTGCTATTTTAGATAGAGAAGTTCGCAAGTTGAGGTCAAGAGA L - G G A C C Y F R - R S S Q V E V K R Y E E E P V A I L D R E V R K L R S R E L M R R S L L L F - I E K F A S - G Q E . . . . . . 8243 GATTGCATCCATCAAGGTGCAATGGAAGAATCGACCCGTTGATGAAGCCACTTGGGAGAA D C I H Q G A M E E S T R - - S H L G E I A S I K V Q W K N R P V D E A T W E K R L H P S R C N G R I D P L M K P L G R . . . . . . 8303 GGAGGCGGATATGCGAGAAAAATACCCACATCTGTTTACAGATTCAGGTACTCCTTTTCG G G G Y A R K I P T S V Y R F R Y S F S E A D M R E K Y P H L F T D S G T P F R R R R I C E K N T H I C L Q I Q V L L F . . . . . . 8363 CCCTTGTTTTCCTTCTTGTGATCGTTCGGGGACGAACGATGGGTAAATTGGTATCTATTG P L F S F L - S F G D E R W V N W Y L L P C F P S C D R S G T N D G - I G I Y C A L V F L L V I V R G R T M G K L V S I . . . . . . 8423 TAACGACCTGTTTAGTCGTTTTGAGCAGCAGATTTTATTTCTGGAAAAACAGGCTGAGGC - R P V - S F - A A D F I S G K T G - G N D L F S R F E Q Q I L F L E K Q A E A V T T C L V V L S S R F Y F W K N R L R . . . . . . 8483 GACGAACCCAATGACGGACCGTCATGGGCACGACGGACCGTCGCAGGGTCTCGTTTCAAA D E P N D G P S W A R R T V A G S R F K T N P M T D R H G H D G P S Q G L V S K R R T Q - R T V M G T T D R R R V S F Q . . . . . . 8543 ACACTTAGAAAATCTAAAATTGGGTACAGAAAATCGACTCTCTGAACTTCGTGACGGAAT T L R K S K I G Y R K S T L - T S - R N H L E N L K L G T E N R L S E L R D G M N T - K I - N W V Q K I D S L N F V T E . . . . . . 8603 GGCAGGACGGACCGTCACAGGTGTGACGGACCGTCACAGACCCTTGGTGGAAATATTGGG G R T D R H R C D G P S Q T L G G N I G A G R T V T G V T D R H R P L V E I L G W Q D G P S Q V - R T V T D P W W K Y W . . . . . . 8663 TCTCTGAACTTTGCGACGACCTGCAGGACGGACCGTCCCAGGCACGACGGCCCGTCACAG S L N F A T T C R T D R P R H D G P S Q L - T L R R P A G R T V P G T T A R H R V S E L C D D L Q D G P S Q A R R P V T . . . . . . 8723 GTTGCGCAATCCCAGTTGGAGTCAGATTTCTGGTTAAGTTTTAAAGGGGCGTTTTGGACT V A Q S Q L E S D F W L S F K G A F W T L R N P S W S Q I S G - V L K G R F G L G C A I P V G V R F L V K F - R G V L D . . . . . . 8783 ATTCCTGCTATGATTATATATTTCATGGGTTTATATTAATAACTCAAAATCCTTGGGGGT I P A M I I Y F M G L Y - - L K I L G G F L L - L Y I S W V Y I N N S K S L G V Y S C Y D Y I F H G F I L I T Q N P W G . . . . . . 8843 TAAAAGAGGTAACCTTAAGTTAATTAGTGGGGTATTATTGTCATCTTTTATTCTTAATTA - K R - P - V N - W G I I V I F Y S - L K R G N L K L I S G V L L S S F I L N Y L K E V T L S - L V G Y Y C H L L F L I . . . . . . 8903 TATACTAATTAGGGTAAAAGAAAGAGGGTTGGAATAAGAAAATAGAAAGAACAAAGAGAG Y T N - G K R K R V G I R K - K E Q R E I L I R V K E R G L E - E N R K N K E R I Y - L G - K K E G W N K K I E R T K R . . . . . . 8963 GGAAAGAGAAACGAATTGAGGGAGGATCGAACGAGAAGAAGGAACAAAAGCCTTGGGAAT G K R N E L R E D R T R R R N K S L G N E R E T N - G R I E R E E G T K A L G I G K E K R I E G G S N E K K E Q K P W E . . . . . . 9023 TTCTTGCTTAATCACTAATCTTCAGTGGAGGTAGGTTATGGTTATTTCATGCTATTCGTA F L L N H - S S V E V G Y G Y F M L F V S C L I T N L Q W R - V M V I S C Y S - F L A - S L I F S G G R L W L F H A I R . . . . 9083 GCAAACTCTTAATAGCGAATGATATGTGTTAGTAGTATT A N S - - R M I C V S S I Q T L N S E - Y V L V V S K L L I A N D M C - - Y Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2+_PGL-1_AGS-13_PPS_1 (7824 8408) (frame '2'; 582 bp, 194 residues) 1 ESLDKVKSIQ EKLLASQSRQ KEYADRKVRD LEFMEGEQVL LKVSPMKGVM RFGKRGKLSP 61 RYIGPFEVLK RVGEVAYELA LPPGLSGVHP VFHVSMLKRY HGNGNYIIRW DSVLLDENLS 121 YEEEPVAILD REVRKLRSRE IASIKVQWKN RPVDEATWEK EADMREKYPH LFTDSGTPFR 181 PCFPSCDRSG TNDG- >C06HBa0153O03.1-2+_PGL-1_AGS-13_PPS_2 (8409 8669) (frame '2'; 258 bp, 86 residues) 1 IGIYCNDLFS RFEQQILFLE KQAEATNPMT DRHGHDGPSQ GLVSKHLENL KLGTENRLSE 61 LRDGMAGRTV TGVTDRHRPL VEILGL- AGS-14 (10295 9497,9456 9230) SCR (e 0.889 d 0.000 a 0.000,e 0.965) Exon 1 10295 9497 ( 799 n); score: 0.889 Intron 1 9496 9457 ( 40 n); Pd: 0.000 Pa: 0.000 Exon 2 9456 9230 ( 227 n); score: 0.965 PGS (9785 9497,9456 9230) SGN-E242359- PGS (9975 9568) SGN-E246710- PGS (9875 9641) SGN-E209683- PGS (10290 9801) SGN-E351546- PGS (10290 9801) SGN-E356206- PGS (10290 9801) SGN-E356696- PGS (10152 9871) SGN-E222578+ PGS (10291 9874) SGN-E370357+ PGS (10295 9881) SGN-E392027+ PGS (10147 9885) SGN-E373117- PGS (10147 9885) SGN-E373116+ PGS (10165 9911) SGN-E216150+ PGS (10147 9929) SGN-E298638- PGS (10132 9960) SGN-E368629- PGS (10132 9960) SGN-E352844- PGS (10122 9960) SGN-E238551- 3-phase translation of AGS-14 (-strand): . . . . . . 10295 AACTATGTCACGACCCAAATCCGGGCCGCGTCTGGCACCCACACTTACCCTCCTATGTGA N Y V T T Q I R A A S G T H T Y P P M - T M S R P K S G P R L A P T L T L L C E L C H D P N P G R V W H P H L P S Y V . . . . . . 10235 GCGAACCAACCAATCTAAACCTTAACATTTCAATATAATATAACCAGAAAGTAATGCGGA A N Q P I - T L T F Q Y N I T R K - C G R T N Q S K P - H F N I I - P E S N A E S E P T N L N L N I S I - Y N Q K V M R . . . . . . 10175 AGACTTAAACTCATTAAATAAAGACCAATTCATTAACTTCTAAAATTCAACATCTATTAT R L K L I K - R P I H - L L K F N I Y Y D L N S L N K D Q F I N F - N S T S I I K T - T H - I K T N S L T S K I Q H L L . . . . . . 10115 TCCCCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAG S P K I W K S S S Q E H L R S N D - T K P P K S G S H H H K N I Y D Q M T K L R F P Q N L E V I I T R T S T I K - L N - . . . . . . 10055 AGTATTCTAAAAGCTAAAAATACATAAGAAGCTAGTCCATGCCGGAAGTTCAAGGCATCA S I L K A K N T - E A S P C R K F K A S V F - K L K I H K K L V H A G S S R H Q E Y S K S - K Y I R S - S M P E V Q G I . . . . . . 9995 AGACTTGAAGAAGAAGACCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAATATCCGGT R L E E E D P V Q A R S I S S P - I S G D L K K K T Q S K L E A L A H P E Y P V K T - R R R P S P S - K H - L T L N I R . . . . . . 9935 ATGACGAAGACTGGCTAGAATCACTGCTGAGTTGAAGATGACGGAACGTTTGCTGCACTC M T K T G - N H C - V E D D G T F A A L - R R L A R I T A E L K M T E R L L H S Y D E D W L E S L L S - R - R N V C C T . . . . . . 9875 CACAAATAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTA H K - Q E E N I K V G V S T K H G Y - V T N N K K K T - K - G S V Q N T G T E - P Q I T R R K H K S R G Q Y K T R V L S . . . . . . 9815 GATATCATCGGCCAACTCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAATCAA D I I G Q L K I E N S M Y - A I S - N Q I S S A N S K - K T V C I K Q Y H K I N R Y H R P T Q N R K Q Y V L S N I I K S . . . . . . 9755 TTAATATCCTTAGCATGCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCAAGCA L I S L A C S I Y S Y H N P W L Q H Q A - Y P - H A A F T V T I T L G Y N T K H I N I L S M Q H L Q L P - P L V T T P S . . . . . . 9695 CATCAATGAGGACTCACACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAGATTG H Q - G L T P P H H T H L G I - F I R L I N E D S H L L I T L I W E F S S L D W T S M R T H T S S S H S F G N L V H - I . . . . . . 9635 GATATATTAACATATTTCAAGATTCATTATCTTTATTCCCCTCGTGTCGGTACATGACAC D I L T Y F K I H Y L Y S P R V G T - H I Y - H I S R F I I F I P L V S V H D T G Y I N I F Q D S L S L F P S C R Y M T . . . . . . 9575 TCCGCTCCTCAATATACTATCCTGGTGTCGGAACGTGACACTCTGATCCTCATTCTATCC S A P Q Y T I L V S E R D T L I L I L S P L L N I L S W C R N V T L - S S F Y P L R S S I Y Y P G V G T - H S D P H S I . . : . . . . 9515 TGGTGTCGGAACGTGACAC : CCGATCCATATTCTATCCTGGTGTCAGAACGTGACACCCGA W C R N V T : P D P Y S I L V S E R D T R G V G T - H : P I H I L S W C Q N V T P D L V S E R D T : R S I F Y P G V R T - H P . . . . . . 9415 TCCATATCCTATCCTGGTACCGGAACGTGGCACCCGATCAATATTCTATCTTGGTGTCGG S I S Y P G T G T W H P I N I L S W C R P Y P I L V P E R G T R S I F Y L G V G I H I L S W Y R N V A P D Q Y S I L V S . . . . . . 9355 AACGTGACACCCGATCCATATTCTATCCTGGTACCGAAACGTGGCACCGGATCCCCTAAT N V T P D P Y S I L V P K R G T G S P N T - H P I H I L S W Y R N V A P D P L I E R D T R S I F Y P G T E T W H R I P - . . . . . . 9295 CTCATCACTTTCGTTCATCAAGCCTTCTTTTATACCAAGGCATCATCATTAACAAAGTAG L I T F V H Q A F F Y T K A S S L T K - S S L S F I K P S F I P R H H H - Q S R S H H F R S S S L L L Y Q G I I I N K V . 9235 ATTAGG I R L D - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-14_PPS_1 (9578 9497,9456 9236) (frame '1'; 300 bp, 100 residues) 1 HSAPQYTILV SERDTLILIL SWCRNVTPDP YSILVSERDT RSISYPGTGT WHPINILSWC 61 RNVTPDPYSI LVPKRGTGSP NLITFVHQAF FYTKASSLTK - AGS-15 (10017 9474,9396 9241) SCR (e 0.773 d 0.000 a 0.000,e 0.788) Exon 1 10017 9474 ( 544 n); score: 0.773 Intron 1 9473 9397 ( 77 n); Pd: 0.000 Pa: 0.000 Exon 2 9396 9241 ( 156 n); score: 0.788 PGS (10017 9474,9396 9241) SGN-E241789+ 3-phase translation of AGS-15 (-strand): . . . . . . 10017 ATGCCGGAAGTTCAAGGCATCAAGACTTGAAGAAGAAGACCCAGTCCAAGCTAGAAGCAT M P E V Q G I K T - R R R P S P S - K H C R K F K A S R L E E E D P V Q A R S I A G S S R H Q D L K K K T Q S K L E A . . . . . . 9957 TAGCTCACCCTGAATATCCGGTATGACGAAGACTGGCTAGAATCACTGCTGAGTTGAAGA - L T L N I R Y D E D W L E S L L S - R S S P - I S G M T K T G - N H C - V E D L A H P E Y P V - R R L A R I T A E L K . . . . . . 9897 TGACGGAACGTTTGCTGCACTCCACAAATAACAAGAAGAAAACATAAAAGTAGGGGTCAG - R N V C C T P Q I T R R K H K S R G Q D G T F A A L H K - Q E E N I K V G V S M T E R L L H S T N N K K K T - K - G S . . . . . . 9837 TACAAAACACGGGTACTGAGTAGATATCATCGGCCAACTCAAAATAGAAAACAGTATGTA Y K T R V L S R Y H R P T Q N R K Q Y V T K H G Y - V D I I G Q L K I E N S M Y V Q N T G T E - I S S A N S K - K T V C . . . . . . 9777 TTAAGCAATATCATAAAATCAATTAATATCCTTAGCATGCAGCATTTACAGTTACCATAA L S N I I K S I N I L S M Q H L Q L P - - A I S - N Q L I S L A C S I Y S Y H N I K Q Y H K I N - Y P - H A A F T V T I . . . . . . 9717 CCCTTGGTTACAACACCAAGCACATCAATGAGGACTCACACCTCCTCATCACACTCATTT P L V T T P S T S M R T H T S S S H S F P W L Q H Q A H Q - G L T P P H H T H L T L G Y N T K H I N E D S H L L I T L I . . . . . . 9657 GGGAATTTAGTTCATTAGATTGGATATATTAACATATTTCAAGATTCATTATCTTTATTC G N L V H - I G Y I N I F Q D S L S L F G I - F I R L D I L T Y F K I H Y L Y S W E F S S L D W I Y - H I S R F I I F I . . . . . . 9597 CCCTCGTGTCGGTACATGACACTCCGCTCCTCAATATACTATCCTGGTGTCGGAACGTGA P S C R Y M T L R S S I Y Y P G V G T - P R V G T - H S A P Q Y T I L V S E R D P L V S V H D T P L L N I L S W C R N V . . . . . . 9537 CACTCTGATCCTCATTCTATCCTGGTGTCGGAACGTGACACTCCGATCCTCATATACTAT H S D P H S I L V S E R D T P I L I Y Y T L I L I L S W C R N V T L R S S Y T I T L - S S F Y P G V G T - H S D P H I L . : . . . . . 9477 CCTG : CCGGAACGTGGCACCCGATCAATATTCTATCTTGGTGTCGGAACGTGACACCCGAT P : A G T W H P I N I L S W C R N V T P D L : P E R G T R S I F Y L G V G T - H P I S C : R N V A P D Q Y S I L V S E R D T R . . . . . . 9340 CCATATTCTATCCTGGTACCGAAACGTGGCACCGGATCCCCTAATCTCATCACTTTCGTT P Y S I L V P K R G T G S P N L I T F V H I L S W Y R N V A P D P L I S S L S F S I F Y P G T E T W H R I P - S H H F R . . . . 9280 CATCAAGCCTTCTTTTATACCAAGGCATCATCATTAACAA H Q A F F Y T K A S S L T I K P S F I P R H H H - Q S S S L L L Y Q G I I I N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-15_PPS_1 (9537 9474,9396 9242) (frame '1'; 219 bp, 73 residues) 1 HSDPHSILVS ERDTPILIYY PAGTWHPINI LSWCRNVTPD PYSILVPKRG TGSPNLITFV 61 HQAFFYTKAS SLT AGS-16 (10113 9563,9523 9413,9340 9316) SCR (e 0.808 d 0.000 a 0.000,e 0.914 d 0.900 a 0.000,e 0.920) Exon 1 10113 9563 ( 551 n); score: 0.808 Intron 1 9562 9524 ( 39 n); Pd: 0.000 Pa: 0.000 Exon 2 9523 9413 ( 111 n); score: 0.914 Intron 2 9412 9341 ( 72 n); Pd: 0.900 Pa: 0.000 Exon 3 9340 9316 ( 25 n); score: 0.920 PGS (10113 9563,9523 9413,9340 9316) SGN-E546506+ 3-phase translation of AGS-16 (-strand): . . . . . . 10113 CCCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAGAG P Q N L E V I I T R T S T I K - L N - E P K I W K S S S Q E H L R S N D - T K S P K S G S H H H K N I Y D Q M T K L R . . . . . . 10053 TATTCTAAAAGCTAAAAATACATAAGAAGCTAGTCCATGCCGGAAGTTCAAGGCATCAAG Y S K S - K Y I R S - S M P E V Q G I K I L K A K N T - E A S P C R K F K A S R V F - K L K I H K K L V H A G S S R H Q . . . . . . 9993 ACTTGAAGAAGAAGACCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAATATCCGGTAT T - R R R P S P S - K H - L T L N I R Y L E E E D P V Q A R S I S S P - I S G M D L K K K T Q S K L E A L A H P E Y P V . . . . . . 9933 GACGAAGACTGGCTAGAATCACTGCTGAGTTGAAGATGACGGAACGTTTGCTGCACTCCA D E D W L E S L L S - R - R N V C C T P T K T G - N H C - V E D D G T F A A L H - R R L A R I T A E L K M T E R L L H S . . . . . . 9873 CAAATAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTAGA Q I T R R K H K S R G Q Y K T R V L S R K - Q E E N I K V G V S T K H G Y - V D T N N K K K T - K - G S V Q N T G T E - . . . . . . 9813 TATCATCGGCCAACTCAAAATAGAAAACAGTATGTATTAAGCAATATCATAAAATCAATT Y H R P T Q N R K Q Y V L S N I I K S I I I G Q L K I E N S M Y - A I S - N Q L I S S A N S K - K T V C I K Q Y H K I N . . . . . . 9753 AATATCCTTAGCATGCAGCATTTACAGTTACCATAACCCTTGGTTACAACACCAAGCACA N I L S M Q H L Q L P - P L V T T P S T I S L A C S I Y S Y H N P W L Q H Q A H - Y P - H A A F T V T I T L G Y N T K H . . . . . . 9693 TCAATGAGGACTCACACCTCCTCATCACACTCATTTGGGAATTTAGTTCATTAGATTGGA S M R T H T S S S H S F G N L V H - I G Q - G L T P P H H T H L G I - F I R L D I N E D S H L L I T L I W E F S S L D W . . . . . . 9633 TATATTAACATATTTCAAGATTCATTATCTTTATTCCCCTCGTGTCGGTACATGACACTC Y I N I F Q D S L S L F P S C R Y M T L I L T Y F K I H Y L Y S P R V G T - H S I Y - H I S R F I I F I P L V S V H D T . . : . . . . 9573 CGCTCCTCAAT : TTCTATCCTGGTGTCGGAACGTGACACTCCGATCCTCATATACTATCCT R S S I : S I L V S E R D T P I L I Y Y P A P Q : F L S W C R N V T L R S S Y T I L P L L N : F Y P G V G T - H S D P H I L S . . . . . . 9474 GGTACCGGAACGTGGTACCCGATCCATATTCTATCCTGGTGTCAGAACGTGACACCCGAT G T G T W Y P I H I L S W C Q N V T P D V P E R G T R S I F Y P G V R T - H P I W Y R N V V P D P Y S I L V S E R D T R . : . . 9414 CC : CCATATTCTATCCTGGTACCGAAAC P : H I L S W Y R N : P I F Y P G T E S : P Y S I L V P K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-2-_PGL-1_AGS-16_PPS_1 (9639 9563,9523 9413,9340 9316) (frame '1'; 213 bp, 71 residues) 1 IGYINIFQDS LSLFPSCRYM TLRSSISILV SERDTPILIY YPGTGTWYPI HILSWCQNVT 61 PDPHILSWYR N AGS-17 (9552 9481,9447 9413) SCR (e 0.889 d 0.000 a 0.000,e 0.886) Exon 1 9552 9481 ( 72 n); score: 0.889 Intron 1 9480 9448 ( 33 n); Pd: 0.000 Pa: 0.000 Exon 2 9447 9413 ( 35 n); score: 0.886 PGS (9552 9481,9447 9413) SGN-E546548- 3-phase translation of AGS-17 (-strand): . . . . . . 9552 GGTGTCGGAACGTGACACTCTGATCCTCATTCTATCCTGGTGTCGGAACGTGACACTCCG G V G T - H S D P H S I L V S E R D T P V S E R D T L I L I L S W C R N V T L R C R N V T L - S S F Y P G V G T - H S . . : . . . 9492 ATCCTCATATAC : ATTCTATCCTGGTGTCAGAACGTGACACCCGATCC I L I Y : I L S W C Q N V T P D S S Y T : F Y P G V R T - H P I D P H I : H S I L V S E R D T R S Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:19:17 2006 ________________________________________________________________________________ Sequence 3: C06HBa0153O03.1-3, from 1 to 1910, both strands analyzed. ... started at: Mon Aug 28 22:19:17 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:19:27 2006 ________________________________________________________________________________ Sequence 4: C06HBa0153O03.1-4, from 1 to 2666, both strands analyzed. ... started at: Mon Aug 28 22:19:27 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 2 ******************************************************************************** EST sequence 5 -strand 713 n (File: SGN-E541821-) 1 AGACATGCAT CATTATACTT GACATTGCAT TCCATTGCAT TGCACATACT TACCATTAGT 61 GAACTTGATA TCGTATTTCG CTGATGTTTT GATTGCTTTT CTGCGGAACT TGTGATTGAT 121 CAATATTGAG CTTGTTATTG CGGATATGTA ATTTGTTGAA GTATTGTTGT TGAGGATATG 181 TAATTTGTTG AAGTATTGTT GTTGAGGATA TGTAATTTGT TGAAGTGTTG TTGTTGAGGA 241 TATGTAATTT GCCTAAGTGT TGTTGTTGAG CTATGTGCTA TGTAAATTGT GAGCTGTTAG 301 GTTGGGTTGA TTTTAATGCA GGTTGTATTT GTGGAGGTTC GGTTGGGGGT GGTAGGAGTA 361 CCCGTATTTC ATCCCCTTAG CTTATGTTTA GAGGTTTACT TGCTGAGTAC CGTGTGGTTT 421 GGTACTCACC CCTTGCTTCT ACAAATTTTT GTAGGTTATG AGCCTGGATT TTTGTTGTAC 481 TTGTCAGTCT CTTCTTTTCC GAGGCTTCAT GGAGATTTGT GAGGTAGCTG TTTCCATCTC 541 AGCAGACTTT CTTCTCCATA ATTATGATCT TGTTCTATTC TAGAAACAAG TTCATTTGAG 601 ACTTGAGTTT TTTTCTTTTG AATCTTGTAA TACTTTAGAG GCTTGTACAC GTGACTACCC 661 AGGTCTTGGG GGTTTAATTA AGTTACTTAT ATTTTATTTC TGCAAAAAAA AAA Predicted gene structure (within gDNA segment 1 to 2483): Exon 1 363 393 ( 31 n); cDNA 154 183 ( 30 n); score: 0.677 Intron 1 394 1226 ( 833 n); Pd: 0.000 (s: 0), Pa: 0.459 (s: 0.78) Exon 2 1227 1748 ( 522 n); cDNA 184 704 ( 521 n); score: 0.915 MATCH C06HBa0153O03.1-4+ SGN-E541821- 0.915 553 0.776 C PGS_C06HBa0153O03.1-4+_SGN-E541821- (363 393,1227 1748) Alignment (genomic DNA sequence = upper lines): TGTTGATGGA ATGGTTTTAA GGATGAGCTT ACATGGAGGG ATGAACACCA ACTATAGTAG 422 |||||| | | || | || | |||| | | | TGTTGAAGTA TTGTTGTTGA GGAT-ATGTA A......... .......... .......... 183 GTCATAACAT AAGATTTTGG TAATATGGTT ATATTTCCTT CTTCTTCCTT ATGTTCGAGG 482 .......... .......... .......... .......... .......... .......... 183 ACGAACATAG TTTTTAGTGG TGGATAATGT AATGACCCAA AAAATCAGCT TTTTTGAAAA 542 .......... .......... .......... .......... .......... .......... 183 TGCTAAACTA TTCTTTTAAC TTTAGTTAAT TTGTTGATGG ATAATTATAG ATAATTAAAT 602 .......... .......... .......... .......... .......... .......... 183 TAATTGAAAG TTAATGGACT AAAGTGATAA TTTATTTAAG ACCATATACT ATTTAACTCT 662 .......... .......... .......... .......... .......... .......... 183 TATATATTAA AATAAGAAGT AGGGTTTGTC AGTTTAGTAC ATAATTTAGA AAAAAAGATG 722 .......... .......... .......... .......... .......... .......... 183 GAGGAAAAGA ACGCCACAAG ATCGATGATG GATAACATAG GTACTAACCC TTTTGTTACT 782 .......... .......... .......... .......... .......... .......... 183 ATCAATTATT AGCAATGATT AGAAATGGTT GCTCTTGTTA TTGATGATTC TAAACATTTT 842 .......... .......... .......... .......... .......... .......... 183 CAGTGAGTTG CTTTTGTGAT GTGAATGTAA TCATTGGCTA ATAATTTCAG CCAATCATTA 902 .......... .......... .......... .......... .......... .......... 183 GATAATGAAT GTGTATGGCA CCATTTCAAA AGGTTTTAAT TCTGCCTATA GGTTTTATAG 962 .......... .......... .......... .......... .......... .......... 183 TTACGGATTT TATATCTATT AGTTGGTTAT GTATGTTCAG TTCATGGTTC AAATGTTGGT 1022 .......... .......... .......... .......... .......... .......... 183 ATATTAAGAT AGGCTACTAT GCTATTTGGA GAAATTATAA CTTAATTCTA CTTCGAAAAA 1082 .......... .......... .......... .......... .......... .......... 183 AAAAGATATG ATAATTAAAA TGTTAATTGG CATCAAAATT GAGAGTTGAA TTATAAATTC 1142 .......... .......... .......... .......... .......... .......... 183 GGATAAATTT TAGGTCTAAG CTAGACCTAT GGTAATATGA GTGTTGTTAG TTCAAAATCT 1202 .......... .......... .......... .......... .......... .......... 183 TATTGTGATT ATTAATGTGA ATAGATTGCT TTGATCTTGT TATTGCAGAT ATGAAATTTG 1262 ||| | | |||| | ||| ||| ||| |||||| .......... .......... ....TTTGTT GAAGTATTGT TGTTGAGGAT ATGTAATTTG 219 TTGAAGTGTT GTTGTTGAGG ATATGTAATT TGTCTAAGTA TTGTTGTTGA GCTACGTGCT 1322 |||||||||| |||||||||| |||||||||| || |||||| |||||||||| |||| ||||| TTGAAGTGTT GTTGTTGAGG ATATGTAATT TGCCTAAGTG TTGTTGTTGA GCTATGTGCT 279 ATGTAAACTG TGAGTTGTTA GGTTGGGTTG ATTTTTAATG CAGGTTGTAG TTGTGGAGGT 1382 ||||||| || |||| ||||| |||||||||| | |||||||| ||||||||| |||||||||| ATGTAAATTG TGAGCTGTTA GGTTGGGTTG A-TTTTAATG CAGGTTGTAT TTGTGGAGGT 338 TCGGTTGGGG GTGGTAGGAG TACCCGTATT TCATCCCCTT AGCTTGTGTT TAGAGGTTTA 1442 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| TCGGTTGGGG GTGGTAGGAG TACCCGTATT TCATCCCCTT AGCTTATGTT TAGAGGTTTA 398 CTTGCTGAGT ACCGTGTGGT TTGGTACTCA CCCTTTGCTT CTACAAAATT TTTGTAGGTT 1502 |||||||||| |||||||||| |||||||||| ||| |||||| |||| ||||| |||||||||| CTTGCTGAGT ACCGTGTGGT TTGGTACTCA CCCCTTGCTT CTAC-AAATT TTTGTAGGTT 457 ATGAGCCTGG ATTTTTGTTG TACTTGTCAT TCTCTTCTTT TCCGAGGATT CTTGGAGATT 1562 |||||||||| |||||||||| ||||||||| |||||||||| ||||||| || | |||||||| ATGAGCCTGG ATTTTTGTTG TACTTGTCAG TCTCTTCTTT TCCGAGGCTT CATGGAGATT 517 CGTCAGGTAG CTGTTTCCAT CGCAGCAGAC TTTCTTCTCC TTAATTATGA TCTTGTTCTA 1622 || |||||| |||||||||| | |||||||| |||||||||| ||||||||| |||||||||| TGTGAGGTAG CTGTTTCCAT CTCAGCAGAC TTTCTTCTCC ATAATTATGA TCTTGTTCTA 577 TTCTAGAAAT AAGTTCATTT GAGACTTGAG --TTTTTCTT TTGAATCAAT TGTAATACTT 1680 ||||||||| |||||||||| |||||||||| |||||||| ||||||| | |||||||||| TTCTAGAAAC AAGTTCATTT GAGACTTGAG TTTTTTTCTT TTGAATC--T TGTAATACTT 635 TAGAGGCTTG TACACGTGAC TA-CCAGGTT TTGGAGTTTT TATTAAGTTA CTTATATTTT 1739 |||||||||| |||||||||| || |||||| |||| | ||| ||||||||| |||||||||| TAGAGGCTTG TACACGTGAC TACCCAGGTC TTGGGGGTTT AATTAAGTTA CTTATATTTT 695 ATTTTCGCA 1748 |||| ||| ATTTCTGCA 704 hqPGS_C06HBa0153O03.1-4+_SGN-E541821- (1227 1748) ******************************************************************************** EST sequence 4 -strand 319 n (File: SGN-E577986-) 1 TTTTGCGGAA CTTGTGATTG TTGAATACTG AGCTTGTTAT TGAGGATATA TAATTTGTTG 61 AAGTGTTGTT GTTGAGGATA TGTGATTTGT TGAAGTGTGT TATTGAGGAT ATGTAAATTT 121 TTGGAAGTGT TGCTATTGAG GATATCTAAT TTGTTTAAGT GTTGATGTTG AGGATATGTA 181 ATTTGTTAAA GTGTTGTTGT GGAGGATATG TAATTTGTCT AAATGTTTTT GTAGAGTTAC 241 GTTCTATGTA AACTGTGAGT TGTTAGGTTG TGCTGATGTT TATGTAGGTT GTAGTTGTGG 301 AGGTTCGGTT GGGGTGGTA Predicted gene structure (within gDNA segment 137 to 2042): Exon 1 1228 1398 ( 171 n); cDNA 151 319 ( 169 n); score: 0.860 MATCH C06HBa0153O03.1-4+ SGN-E577986- 0.860 171 0.536 C PGS_C06HBa0153O03.1-4+_SGN-E577986- (1228 1398) Alignment (genomic DNA sequence = upper lines): TTGCTTTGAT CTTGTTATTG CAGATATGAA ATTTGTTGAA GTGTTGTTGT TGAGGATATG 1287 ||| || | ||| | ||| |||||| | ||||||| || |||||||||| ||||||||| TTGTTTAAGT GTTGATGTTG AGGATATGTA ATTTGTTAAA GTGTTGTTGT GGAGGATATG 210 TAATTTGTCT AAGTATTGTT GTTGAGCTAC GTGCTATGTA AACTGTGAGT TGTTAGGTTG 1347 |||||||||| || | || || || ||| ||| || ||||||| |||||||||| |||||||||| TAATTTGTCT AAATGTTTTT GTAGAGTTAC GTTCTATGTA AACTGTGAGT TGTTAGGTTG 270 GGTTGATTTT TAATGCAGGT TGTAGTTGTG GAGGTTCGGT TGGGGGTGGT A 1398 | |||| || | ||| |||| |||||||||| |||||||||| | |||||||| | TGCTGATGTT T-ATGTAGGT TGTAGTTGTG GAGGTTCGGT T-GGGGTGGT A 319 hqPGS_C06HBa0153O03.1-4+_SGN-E577986- (1228 1398) ******************************************************************************** EST sequence 1 +strand 597 n (File: SGN-E350035+) 1 GAAAAAAAAC TAGAAAAGGG TTGGGCGAGA ATAACGTCTT CTTCCGGTGG TTGGAAAAAA 61 ATGAATTTTT CTTCAGATTG CTTGAGTTGG AAGTTCAGCA AAAGGGGAAA ACCCAAGTCT 121 CGGAGTGAAT TCTCGATTGG TTAAGGTTAT GAGCCTGGAT TTTTGTTGTA CTTGTCATTC 181 TCTTCTTTTC CAAGGCTTCT TGGAGATTTG TCAGGTAGCT ATTTCCATCG CAGCAGACTT 241 TCTTCTCCTT AATTATGATC TTGTTCTATT CTAGAAACAA GTTCATTTGA GACTTGAGTT 301 TTTCTTTTGA ATCAATTGTA ATACTTTAGA GGCTTGTACA CGTGACTACC AGGTTTTGGG 361 GGGTCTTATT AAGTTACTTA TATTTTATTT CCGCACTTTA TGGTAATGGT TGAGTTTTAG 421 GCTGACTTGT CTTGGTGGGA TAAGACGAGT GCCATCACGT CCATTTTTGG GTCGTGACAC 481 ATCTACTAAA AGTATTTAGC TACTGGTGAA ATATGTAAAG ATGTTTGAGA CTTTCATTTC 541 CCTCATTCAG TTTCTTTCAT TTAATTCTTA CATGAAGTTT AAGTTCAAAA AAAAAAA Predicted gene structure (within gDNA segment 1 to 2666): Exon 1 1500 1835 ( 336 n); cDNA 146 482 ( 337 n); score: 0.942 PPA cDNA 587 597 MATCH C06HBa0153O03.1-4+ SGN-E350035+ 0.942 336 0.563 C PGS_C06HBa0153O03.1-4+_SGN-E350035+ (1500 1835) Alignment (genomic DNA sequence = upper lines): GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCGAGG ATTCTTGGAG 1559 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ||||||||| GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCAAGG CTTCTTGGAG 205 ATTCGTCAGG TAGCTGTTTC CATCGCAGCA GACTTTCTTC TCCTTAATTA TGATCTTGTT 1619 ||| |||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTCAGG TAGCTATTTC CATCGCAGCA GACTTTCTTC TCCTTAATTA TGATCTTGTT 265 CTATTCTAGA AATAAGTTCA TTTGAGACTT GAGTTTTTCT TTTGAATCAA TTGTAATACT 1679 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATTCTAGA AACAAGTTCA TTTGAGACTT GAGTTTTTCT TTTGAATCAA TTGTAATACT 325 TTAGAGGCTT GTACACGTGA CTACCAGGTT TT-GGAGTTT TTATTAAGTT ACTTATATTT 1738 |||||||||| |||||||||| |||||||||| || || | | |||||||||| |||||||||| TTAGAGGCTT GTACACGTGA CTACCAGGTT TTGGGGGGTC TTATTAAGTT ACTTATATTT 385 TATTTTCGCA CTTTATTGTA ATGATTGAGT TTTAGGTTGA CTTGTCTTGG TGGGATAAGA 1798 ||||| |||| |||||| ||| ||| |||||| |||||| ||| |||||||||| |||||||||| TATTTCCGCA CTTTATGGTA ATGGTTGAGT TTTAGGCTGA CTTGTCTTGG TGGGATAAGA 445 CGAGTGTCAT CACACCCATT TTTGGGTCAT GACATAT 1835 |||||| ||| ||| ||||| |||||||| | |||| || CGAGTGCCAT CACGTCCATT TTTGGGTCGT GACACAT 482 hqPGS_C06HBa0153O03.1-4+_SGN-E350035+ (1500 1835) ******************************************************************************** EST sequence 2 +strand 576 n (File: SGN-E339561+) 1 AACTAGAAAA GGGTTGGGCG AGAATAACGT CTTCTTCCGG TGGTTGGAAA AAAATGAATT 61 TTTCTTCAGA TTGCTTGAGT TGGAAGTTCA GCAAAAGGGG AAAACCCAAG TCTCGGAGTG 121 AATTCTCGAT TGGTTAAGGT TATGAGCCTG GATTTTTGTT GTACTTGTCA TTCTCTTCTT 181 TTCCAAGGCT TCTTGGAGAT TTGTCAGGTA GCTATTTCCA TCGCAGCAGA CTTTCTTCTC 241 CTTAATTATG ATCTTGTTCT ATTCTAGAAA CAAGTTCATT TGAGACTTGA GTTTTTCTTT 301 TGAATCAATT GTAATACTTT AGAGGCTTGT ACACGTGACT ACCAGGTTTT GGGGGGTCTT 361 ATTAAGTTAC TTATATTTTA TTTCCGCACT TTATGGTAAT GGTTGAGTTT TAGGCTGACT 421 TGTCTTGGTG GGATAAGACG AGTGCCATCA CGTCCATTTT TGGGTCGTGA CACATCTACT 481 AAAAGTATTT AGCTACTGGT GAAATATGTA AAGATGTTTG AGACTTTCAT TTCCCTCATT 541 CAGTTTCTTT CATTTAATTC TTACATGAAG TTTAAG Predicted gene structure (within gDNA segment 1 to 2666): Exon 1 1500 1835 ( 336 n); cDNA 139 475 ( 337 n); score: 0.942 MATCH C06HBa0153O03.1-4+ SGN-E339561+ 0.942 336 0.583 C PGS_C06HBa0153O03.1-4+_SGN-E339561+ (1500 1835) Alignment (genomic DNA sequence = upper lines): GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCGAGG ATTCTTGGAG 1559 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| ||||||||| GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCAAGG CTTCTTGGAG 198 ATTCGTCAGG TAGCTGTTTC CATCGCAGCA GACTTTCTTC TCCTTAATTA TGATCTTGTT 1619 ||| |||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTCAGG TAGCTATTTC CATCGCAGCA GACTTTCTTC TCCTTAATTA TGATCTTGTT 258 CTATTCTAGA AATAAGTTCA TTTGAGACTT GAGTTTTTCT TTTGAATCAA TTGTAATACT 1679 |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATTCTAGA AACAAGTTCA TTTGAGACTT GAGTTTTTCT TTTGAATCAA TTGTAATACT 318 TTAGAGGCTT GTACACGTGA CTACCAGGTT TT-GGAGTTT TTATTAAGTT ACTTATATTT 1738 |||||||||| |||||||||| |||||||||| || || | | |||||||||| |||||||||| TTAGAGGCTT GTACACGTGA CTACCAGGTT TTGGGGGGTC TTATTAAGTT ACTTATATTT 378 TATTTTCGCA CTTTATTGTA ATGATTGAGT TTTAGGTTGA CTTGTCTTGG TGGGATAAGA 1798 ||||| |||| |||||| ||| ||| |||||| |||||| ||| |||||||||| |||||||||| TATTTCCGCA CTTTATGGTA ATGGTTGAGT TTTAGGCTGA CTTGTCTTGG TGGGATAAGA 438 CGAGTGTCAT CACACCCATT TTTGGGTCAT GACATAT 1835 |||||| ||| ||| ||||| |||||||| | |||| || CGAGTGCCAT CACGTCCATT TTTGGGTCGT GACACAT 475 hqPGS_C06HBa0153O03.1-4+_SGN-E339561+ (1500 1835) ******************************************************************************** EST sequence 3 +strand 391 n (File: SGN-E344035+) 1 AGGAAAAAAA AATAGAAAAG AGTTGGGCGA GAATTACGTC TTCTTCCGGT GGTTGGAAAA 61 AATGAATTTT TCTTCAGATT GCTTGAGTTG GAAGTTCAGC AAAAGGGGAA AACCCAAGTC 121 TCGGAGTGAA TTCTCGATTG ATTGAGGTTA TGAGCCTGGA TTTTTGTTGT ACTTGTCATT 181 CTCTTCTTTT CCGAGGCTTC ATGGAGATTT GTCAGGTAGT TGTTTCCATC GCAGCAGACT 241 TTCTTCTCCG TAATTATGTT CTTGTTCTAT TCTAGAAACA AATTCATTTG AGACTTGAGT 301 TTTCTTTTGA ATCAATAGTA ATACTTTAGA GGCTTGTACA CGTGACAACC AGGTTTTGGG 361 TTATAATATA AGTCGATAGT AAAAAAAAAA A Predicted gene structure (within gDNA segment 1 to 2666): Exon 1 1500 1713 ( 214 n); cDNA 147 359 ( 213 n); score: 0.949 PPA cDNA 381 391 MATCH C06HBa0153O03.1-4+ SGN-E344035+ 0.949 214 0.547 C PGS_C06HBa0153O03.1-4+_SGN-E344035+ (1500 1713) Alignment (genomic DNA sequence = upper lines): GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCGAGG ATTCTTGGAG 1559 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| ||||| GTTATGAGCC TGGATTTTTG TTGTACTTGT CATTCTCTTC TTTTCCGAGG CTTCATGGAG 206 ATTCGTCAGG TAGCTGTTTC CATCGCAGCA GACTTTCTTC TCCTTAATTA TGATCTTGTT 1619 ||| |||||| ||| |||||| |||||||||| |||||||||| ||| |||||| || ||||||| ATTTGTCAGG TAGTTGTTTC CATCGCAGCA GACTTTCTTC TCCGTAATTA TGTTCTTGTT 266 CTATTCTAGA AATAAGTTCA TTTGAGACTT GAGTTTTTCT TTTGAATCAA TTGTAATACT 1679 |||||||||| || || |||| |||||||||| ||| |||||| |||||||||| | |||||||| CTATTCTAGA AACAAATTCA TTTGAGACTT GAG-TTTTCT TTTGAATCAA TAGTAATACT 325 TTAGAGGCTT GTACACGTGA CTACCAGGTT TTGG 1713 |||||||||| |||||||||| | |||||||| |||| TTAGAGGCTT GTACACGTGA CAACCAGGTT TTGG 359 hqPGS_C06HBa0153O03.1-4+_SGN-E344035+ (1500 1713) Total number of EST alignments reported: 5 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2666: PGL 1 (+ strand): 1227 1835 AGS-1 (1227 1835) SCR (e 0.915) Exon 1 1227 1835 ( 609 n); score: 0.915 PGS (1227 1748) SGN-E541821- PGS (1228 1398) SGN-E577986- PGS (1500 1835) SGN-E350035+ PGS (1500 1835) SGN-E339561+ PGS (1500 1713) SGN-E344035+ 3-phase translation of AGS-1 (+strand): . . . . . . 1227 ATTGCTTTGATCTTGTTATTGCAGATATGAAATTTGTTGAAGTGTTGTTGTTGAGGATAT I A L I L L L Q I - N L L K C C C - G Y L L - S C Y C R Y E I C - S V V V E D M C F D L V I A D M K F V E V L L L R I . . . . . . 1287 GTAATTTGTCTAAGTATTGTTGTTGAGCTACGTGCTATGTAAACTGTGAGTTGTTAGGTT V I C L S I V V E L R A M - T V S C - V - F V - V L L L S Y V L C K L - V V R L C N L S K Y C C - A T C Y V N C E L L G . . . . . . 1347 GGGTTGATTTTTAATGCAGGTTGTAGTTGTGGAGGTTCGGTTGGGGGTGGTAGGAGTACC G L I F N A G C S C G G S V G G G R S T G - F L M Q V V V V E V R L G V V G V P W V D F - C R L - L W R F G W G W - E Y . . . . . . 1407 CGTATTTCATCCCCTTAGCTTGTGTTTAGAGGTTTACTTGCTGAGTACCGTGTGGTTTGG R I S S P - L V F R G L L A E Y R V V W V F H P L S L C L E V Y L L S T V W F G P Y F I P L A C V - R F T C - V P C G L . . . . . . 1467 TACTCACCCTTTGCTTCTACAAAATTTTTGTAGGTTATGAGCCTGGATTTTTGTTGTACT Y S P F A S T K F L - V M S L D F C C T T H P L L L Q N F C R L - A W I F V V L V L T L C F Y K I F V G Y E P G F L L Y . . . . . . 1527 TGTCATTCTCTTCTTTTCCGAGGATTCTTGGAGATTCGTCAGGTAGCTGTTTCCATCGCA C H S L L F R G F L E I R Q V A V S I A V I L F F S E D S W R F V R - L F P S Q L S F S S F P R I L G D S S G S C F H R . . . . . . 1587 GCAGACTTTCTTCTCCTTAATTATGATCTTGTTCTATTCTAGAAATAAGTTCATTTGAGA A D F L L L N Y D L V L F - K - V H L R Q T F F S L I M I L F Y S R N K F I - D S R L S S P - L - S C S I L E I S S F E . . . . . . 1647 CTTGAGTTTTTCTTTTGAATCAATTGTAATACTTTAGAGGCTTGTACACGTGACTACCAG L E F F F - I N C N T L E A C T R D Y Q L S F S F E S I V I L - R L V H V T T R T - V F L L N Q L - Y F R G L Y T - L P . . . . . . 1707 GTTTTGGAGTTTTTATTAAGTTACTTATATTTTATTTTCGCACTTTATTGTAATGATTGA V L E F L L S Y L Y F I F A L Y C N D - F W S F Y - V T Y I L F S H F I V M I E G F G V F I K L L I F Y F R T L L - - L . . . . . . 1767 GTTTTAGGTTGACTTGTCTTGGTGGGATAAGACGAGTGTCATCACACCCATTTTTGGGTC V L G - L V L V G - D E C H H T H F W V F - V D L S W W D K T S V I T P I F G S S F R L T C L G G I R R V S S H P F L G . 1827 ATGACATAT M T Y - H H D I Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 1835 ATATGTCATGACCCAAAAATGGGTGTGATGACACTCGTCTTATCCCACCAAGACAAGTCA I C H D P K M G V M T L V L S H Q D K S Y V M T Q K W V - - H S S Y P T K T S Q M S - P K N G C D D T R L I P P R Q V . . . . . . 1775 ACCTAAAACTCAATCATTACAATAAAGTGCGAAAATAAAATATAAGTAACTTAATAAAAA T - N S I I T I K C E N K I - V T - - K P K T Q S L Q - S A K I K Y K - L N K N N L K L N H Y N K V R K - N I S N L I K . . . . . . 1715 CTCCAAAACCTGGTAGTCACGTGTACAAGCCTCTAAAGTATTACAATTGATTCAAAAGAA L Q N L V V T C T S L - S I T I D S K E S K T W - S R V Q A S K V L Q L I Q K K T P K P G S H V Y K P L K Y Y N - F K R . . . . . . 1655 AAACTCAAGTCTCAAATGAACTTATTTCTAGAATAGAACAAGATCATAATTAAGGAGAAG K L K S Q M N L F L E - N K I I I K E K N S S L K - T Y F - N R T R S - L R R R K T Q V S N E L I S R I E Q D H N - G E . . . . . . 1595 AAAGTCTGCTGCGATGGAAACAGCTACCTGACGAATCTCCAAGAATCCTCGGAAAAGAAG K V C C D G N S Y L T N L Q E S S E K K K S A A M E T A T - R I S K N P R K R R E S L L R W K Q L P D E S P R I L G K E . . . . . . 1535 AGAATGACAAGTACAACAAAAATCCAGGCTCATAACCTACAAAAATTTTGTAGAAGCAAA R M T S T T K I Q A H N L Q K F C R S K E - Q V Q Q K S R L I T Y K N F V E A K E N D K Y N K N P G S - P T K I L - K Q . . . . . . 1475 GGGTGAGTACCAAACCACACGGTACTCAGCAAGTAAACCTCTAAACACAAGCTAAGGGGA G - V P N H T V L S K - T S K H K L R G G E Y Q T T R Y S A S K P L N T S - G D R V S T K P H G T Q Q V N L - T Q A K G . . . . . . 1415 TGAAATACGGGTACTCCTACCACCCCCAACCGAACCTCCACAACTACAACCTGCATTAAA - N T G T P T T P N R T S T T T T C I K E I R V L L P P P T E P P Q L Q P A L K M K Y G Y S Y H P Q P N L H N Y N L H - . . . . . . 1355 AATCAACCCAACCTAACAACTCACAGTTTACATAGCACGTAGCTCAACAACAATACTTAG N Q P N L T T H S L H S T - L N N N T - I N P T - Q L T V Y I A R S S T T I L R K S T Q P N N S Q F T - H V A Q Q Q Y L . . . . . . 1295 ACAAATTACATATCCTCAACAACAACACTTCAACAAATTTCATATCTGCAATAACAAGAT T N Y I S S T T T L Q Q I S Y L Q - Q D Q I T Y P Q Q Q H F N K F H I C N N K I D K L H I L N N N T S T N F I S A I T R . 1235 CAAAGCAAT Q S N K A S K Q Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:19:40 2006 ________________________________________________________________________________ Sequence 5: C06HBa0153O03.1-5, from 1 to 7846, both strands analyzed. ... started at: Mon Aug 28 22:19:40 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 33 ******************************************************************************** EST sequence 1 +strand 715 n (File: SGN-E389503+) 1 AATAGGAAGG AAACACTAGT GGAACATGAT CCACTACATC AACTCTAAAA CTAAGCTAGA 61 ATATAAAACA ACAGCATCCT CGAAATCATG AGGACCTACC AAAGCCGGTT GAATGCTAAA 121 CGTCCGAATA ACTGCAGCGA AGATCCTTTA GTTCATTCGT CCAGTTGTGG TTGCACCTAA 181 AATAGTAGAG TTTGTATAGG GTTAGTACAC ACTTGTACTA AGTATGGGTA TATGCAAGCA 241 CACACCAATG ACATGCATAA GAAAGAATAA TTCTTTCCTA ACAACATGGT TTTTGGAAGT 301 CAAGTTAGTG GACCTGCCAA TTTGTGATTA CGAAGGTTTC CAAATATGGA TTAGGAAAGT 361 CAGTGAGCTT TCTAGATTTA GAATAGGAAA GTTATGCCAT GAGAATAACG TACATCATCA 421 GTTTTTAACA TTTCCACATA GCACATAACA TTTACACATA GCACATATGA TATGGCATAA 481 AATACAATTT GCATGTAGCA CATCACTTCT TTAATATCAT TTATCATATG CCATGAGACC 541 TTTGAATCAT GGACTTAACA TTAAGAATTC CCAGAAATGA GGGCTCAACA AATGGGACCT 601 CAACTAGGGA GTCTTCATTA GCAAACACAT AGCCTGCTTC TTTCATTCAT ACGTACTACA 661 TTTCATTTCA TTCATAGGCC AGTGTAAACA CAAACTATAC CTAGGATGTA GTTTG Predicted gene structure (within gDNA segment 219 to 2530): Exon 1 1161 1552 ( 392 n); cDNA 1 390 ( 390 n); score: 0.847 Intron 1 1553 1584 ( 32 n); Pd: 0.057 (s: 0.88), Pa: 0.000 (s: 0.76) Exon 2 1585 1910 ( 326 n); cDNA 391 714 ( 324 n); score: 0.856 MATCH C06HBa0153O03.1-5+ SGN-E389503+ 0.851 718 1.004 C PGS_C06HBa0153O03.1-5+_SGN-E389503+ (1161 1552,1585 1910) Alignment (genomic DNA sequence = upper lines): AACAGGAAGG AAACGCTAGT GGAACATGCT CCACTAGCTC AACTCTAAAA CTAAGCTAGA 1220 || ||||||| |||| ||||| |||||||| | |||||| || |||||||||| |||||||||| AATAGGAAGG AAACACTAGT GGAACATGAT CCACTACATC AACTCTAAAA CTAAGCTAGA 60 ATATAAAACA GTGGCATCCT CGAAAGCATG ACGACCTACC AACTCCGAAC GAATGCTCGA 1280 |||||||||| ||||||| ||||| |||| | |||||||| || ||| ||||||| | ATATAAAACA ACAGCATCCT CGAAATCATG AGGACCTACC AAAGCCGGTT GAATGCTAAA 120 CGTTTGGATA ATTGCAACGA TGATCTTGTA GCTCTATCGT CCATCTGTGT CTGCACCTAA 1340 ||| | ||| | |||| ||| |||| | || | || |||| ||| |||| ||||||| | CGTCCGAATA ACTGCAGCGA AGATCCTTTA GTTCATTCGT CCAGTTGTGG TTGCACCT-A 179 AAATAGTAGA GTTTGTATAG GGTTAGTACA CACTTTTAAT AAGTATGGGT ATATGCAAGA 1400 |||||||||| |||||||||| |||||||||| ||||| || | |||||||||| ||||||||| AAATAGTAGA GTTTGTATAG GGTTAGTACA CACTTGTACT AAGTATGGGT ATATGCAAGC 239 ACACACCACG AATATGCATG AGAAAGAATA ACTCTTTCTT AACAACATGA CTTTTTGGAA 1460 |||||||| | |||||| |||||||||| | |||||| | ||||||||| ||||||||| ACACACCAAT GACATGCATA AGAAAGAATA ATTCTTTCCT AACAACATG- GTTTTTGGAA 298 GTCAAGTCAG TGGACTTGCC AAATTTAGAT TAGGAGAGTT ACCAAATTTG GAATAGGAAA 1520 ||||||| || ||||| |||| || || ||| || || ||| |||||| || || ||||||| GTCAAGTTAG TGGACCTGCC AATTTGTGAT TACGAAGGTT TCCAAATATG GATTAGGAAA 358 GTCAATGAGC TTTCCAAATT TGGAATAGGA AAGTCAGTAA GCTTTCCTGA TTTGGAATAG 1580 |||| ||||| |||| | ||| | |||||||| || GTCAGTGAGC TTTCTAGATT TAGAATAGGA AA........ .......... .......... 390 GCTAGTTATG CCATGAGTTT AACACACATC ATCATACTTT GCACCTTTGC ACACACCACA 1640 |||||| ||||||| | ||| ||||| |||| ||| || ||| | ||| | |||| ....GTTATG CCATGAGAAT AACGTACATC ATCA-GTTTT TAACATTTCC ACATAGCACA 445 TAACATTTAC ACATAGCACA TATCATATAG CACACTGCAC AATTTGCATG AAGCACATAT 1700 |||||||||| |||||||||| ||| |||| | || | || |||||||||| ||||||| TAACATTTAC ACATAGCACA TATGATATGG CATAAAATAC AATTTGCATG TAGCACATCA 505 TTTCTTTAAT ATCATTCATT CATATGCCAT AAGACCTTTG GATCATGGAC TTAATGTTAA 1760 ||||||||| |||||| | | |||||||||| ||||||||| ||||||||| |||| |||| CTTCTTTAAT ATCATTTA-T CATATGCCAT GAGACCTTTG AATCATGGAC TTAACATTAA 564 GACATCCCAT AAATGAGGTC TCAATAGATG GGACCTCAAC TAGAGAGTCT TCATTAGCAA 1820 || ||||| |||||||| | |||| | ||| |||||||||| ||| |||||| |||||||||| GAATTCCCAG AAATGAGGGC TCAACAAATG GGACCTCAAC TAGGGAGTCT TCATTAGCAA 624 ACACAGAATC TGTTTCGTTC ATTCATACGT ACTCCATTTC ATTTCATTCA TAGGCCAGTA 1880 ||||| | | || ||| ||| |||||||||| ||| |||||| |||||||||| ||||||||| ACACATAGCC TGCTTCTTTC ATTCATACGT ACTACATTTC ATTTCATTCA TAGGCCAGTG 684 TAAACACCAG CTCTACCTAG GATGTAGTTT 1910 ||||||| | || ||||||| |||||||||| TAAACACAAA CTATACCTAG GATGTAGTTT 714 hqPGS_C06HBa0153O03.1-5+_SGN-E389503+ (1161 1552,1585 1910) ******************************************************************************** EST sequence 9 -strand 687 n (File: SGN-E389994-) 1 TTAGTTCATT CGTCCAGTNG TGGTTGCACN TAAAATAGTA GAGTTTGTAT AGGGTTAGTA 61 CACACTTGTA CTAAGTATGG GTATATGCAA GCACACACCA AATGACATGC ATAAGAAAGA 121 ATAATTCTTT CCTAACAACA TGGTTTTGGG AAGTCAAGTT AGTGGACCTG CCAATTTGTG 181 ATTACGAAGG TTTCCAAATA TGGATTAGGA AAGTCAGTGA GCTTTCTAGA TTTAGAATAG 241 GAAAGTTATG CCATGAGAAT AACGTACATC ATCAGTTTTT AACATTTCCA CATAGCACAT 301 AACATTTACA CATAGCACAT ATGATATGGC ATAAAATACA ATTTGCATGT AGCACATCAC 361 TTCTTTAATA TCATTTATCA TATGCCATGA GACCTTTGAA TCATGGACTT AACATTAAGA 421 ATTCCCAGAA ATGAGGGCTC AACAAATGGG ACCTCAACTA GGGAGTCTTC ATTAGCAAAC 481 ACATAGCCTG CTTCTTTCAT TCATACGTAC TACATTTCAT TTCATTCATA GGCCAGTGTA 541 AACACAAACT ATACCTAGGA TGTAGTTTGA GACTTTCATT AGGTTCATTG TGCAATGACT 601 AAGAATGACC TAATGTCATT ACTTGAATCT AACTCACCTT GTGATTATCC TATCCAAACA 661 ACTTTGATAT CATTCATTTC ATTATGT Predicted gene structure (within gDNA segment 420 to 3052): Exon 1 1309 1552 ( 244 n); cDNA 2 244 ( 243 n); score: 0.834 Intron 1 1553 1584 ( 32 n); Pd: 0.057 (s: 0.88), Pa: 0.000 (s: 0.76) Exon 2 1585 2029 ( 445 n); cDNA 245 687 ( 443 n); score: 0.863 MATCH C06HBa0153O03.1-5+ SGN-E389994- 0.853 689 1.003 C PGS_C06HBa0153O03.1-5+_SGN-E389994- (1309 1552,1585 2029) Alignment (genomic DNA sequence = upper lines): TAGCTCTATC GTCCATCTGT GTCTGCACCT AAAAATAGTA GAGTTTGTAT AGGGTTAGTA 1368 ||| || || ||||| || | ||||| | ||||||||| |||||||||| |||||||||| TAGTTCATTC GTCCAGTNGT GGTTGCACNT -AAAATAGTA GAGTTTGTAT AGGGTTAGTA 60 CACACTTTTA ATAAGTATGG GTATATGCAA GAACACACCA CGA-ATATGC ATGAGAAAGA 1427 ||||||| || ||||||||| |||||||||| | |||||||| | |||| || ||||||| CACACTTGTA CTAAGTATGG GTATATGCAA GCACACACCA AATGACATGC ATAAGAAAGA 120 ATAACTCTTT CTTAACAACA TGACTTTTTG GAAGTCAAGT CAGTGGACTT GCCAAATTTA 1487 |||| ||||| | |||||||| || |||| | |||||||||| ||||||| | ||||| || ATAATTCTTT CCTAACAACA TG-GTTTTGG GAAGTCAAGT TAGTGGACCT GCCAATTTGT 179 GATTAGGAGA GTTACCAAAT TTGGAATAGG AAAGTCAATG AGCTTTCCAA ATTTGGAATA 1547 ||||| || ||| |||||| |||| |||| ||||||| || ||||||| | |||| ||||| GATTACGAAG GTTTCCAAAT ATGGATTAGG AAAGTCAGTG AGCTTTCTAG ATTTAGAATA 239 GGAAAGTCAG TAAGCTTTCC TGATTTGGAA TAGGCTAGTT ATGCCATGAG TTTAACACAC 1607 ||||| ||| |||||||||| |||| || GGAAA..... .......... .......... .......GTT ATGCCATGAG AATAACGTAC 267 ATCATCATAC TTTGCACCTT TGCACACACC ACATAACATT TACACATAGC ACATATCATA 1667 ||||||| ||| || || | |||| | | |||||||||| |||||||||| |||||| ||| ATCATCA-GT TTTTAACATT TCCACATAGC ACATAACATT TACACATAGC ACATATGATA 326 TAGCACACTG CACAATTTGC ATGAAGCACA TATTTTCTTT AATATCATTC ATTCATATGC 1727 | ||| | ||||||||| ||| |||||| | |||||| ||||||||| | |||||||| TGGCATAAAA TACAATTTGC ATGTAGCACA TCACTTCTTT AATATCATTT A-TCATATGC 385 CATAAGACCT TTGGATCATG GACTTAATGT TAAGACATCC CATAAATGAG GTCTCAATAG 1787 ||| |||||| ||| |||||| ||||||| | ||||| ||| || ||||||| | ||||| | CATGAGACCT TTGAATCATG GACTTAACAT TAAGAATTCC CAGAAATGAG GGCTCAACAA 445 ATGGGACCTC AACTAGAGAG TCTTCATTAG CAAACACAGA ATCTGTTTCG TTCATTCATA 1847 |||||||||| |||||| ||| |||||||||| |||||||| | ||| ||| |||||||||| ATGGGACCTC AACTAGGGAG TCTTCATTAG CAAACACATA GCCTGCTTCT TTCATTCATA 505 CGTACTCCAT TTCATTTCAT TCATAGGCCA GTATAAACAC CAGCTCTACC TAGGATGTAG 1907 |||||| ||| |||||||||| |||||||||| || ||||||| | || |||| |||||||||| CGTACTACAT TTCATTTCAT TCATAGGCCA GTGTAAACAC AAACTATACC TAGGATGTAG 565 TTTTAGACTT TCATTAAATT CGTCATGAAA TGACCAAGAA TGACCTAATG TCATTACTTG 1967 ||| |||||| |||||| || | | || || |||| ||||| |||||||||| |||||||||| TTTGAGACTT TCATTAGGTT CATTGTGCAA TGACTAAGAA TGACCTAATG TCATTACTTG 625 AATCTAACTC ACCTTTTGAT TACCCTATCC TAATACCTTT GCTATCATTC ATTTCATTAT 2027 |||||||||| ||||| |||| || ||||||| || | |||| | |||||||| |||||||||| AATCTAACTC ACCTTGTGAT TATCCTATCC AAACAACTTT GATATCATTC ATTTCATTAT 685 GT 2029 || GT 687 hqPGS_C06HBa0153O03.1-5+_SGN-E389994- (1309 1552,1585 2029) ******************************************************************************** EST sequence 31 -strand 806 n (File: SGN-E546254-) 1 GTGGGAAGAA TCTCGGGCTG ATAGCATTCG CGCTCTCCTT ATCCGGCCAC AATATACTAA 61 AGCTCTTTAT CGAAAAGCTG CCTCAAGTAT CAAGCTGGAA AGATGGGCTG ATGCTGTGAG 121 CGATTATGAG TTTCTGCGGC AGCAACTTCC AAGTAACAAA GTGGTTGCTG AAAATTTATC 181 CCATGCCCGA GCTGAATTAA GGAAGTCACG CAGAAAAGGT AATTTCATGG TGGAGTTGAT 241 TTCAGATCTT GACAAGTTCC GGGCTGCGAT TGCATCTGGT CAATCTGTGG TCTATTTTGA 301 TGAATTATCC AACCCAGAAA GCACGTGGAT GTCCTATATC ATGGATACCT TGAACGCCGA 361 ATATCCTTCA GTAACTTTTC TCGAGGTGGA CGTGAAACAG AGTCCAGCGA TCGCTACAGC 421 AGAGAAAATT AAAGTAGCAC CTAGGATTAA GCTTTACAAC AATGGCAGTC GTGTGGCGGA 481 AACGGTTGTG CTAACTCCAG ATTCGTTGGA GTTATTAATT AAGAACACCC TCGTATGTCC 541 CCCCGGATGG TTTGGTGTTA CCTCTGTCAC GACCGGAATC TAGACCCCAG ACGAGACCGG 601 CGTCGTTGAC CTCTCAGAGG TCGCAGACAA GCCTACTTAC GTCATTCTTA CTTTACATAG 661 ATAAATTTTA GCGAAAAATT TGAATTTTTT TTTTTGATTC TTTAATCATA CTTGCTGAAT 721 ATTCTTAACA TTTCATAATC GTTCATCATC AACCATCTAA CATTTAAGAG ATAAAAAAAA 781 AAAAAAAAAA ACTCGAGGGG GGCCCG Predicted gene structure (within gDNA segment 1 to 3046): Exon 1 888 1013 ( 126 n); cDNA 562 687 ( 126 n); score: 0.897 PPA cDNA 771 791 MATCH C06HBa0153O03.1-5+ SGN-E546254- 0.897 126 0.156 C PGS_C06HBa0153O03.1-5+_SGN-E546254- (888 1013) Alignment (genomic DNA sequence = upper lines): CTTTGTCACG ACCGGCATCT AGACCTCATA AGAGACCAGC GTCGATGACC TCTCAGAGGT 947 || ||||||| ||||| |||| ||||| || | |||||| || |||| ||||| |||||||||| CTCTGTCACG ACCGGAATCT AGACCCCAGA CGAGACCGGC GTCGTTGACC TCTCAGAGGT 621 CGCAGACAAG CCTACTTACG TCATTCTTAC TTTACATAGG TTAATTTTAG CGGAAAATTT 1007 |||||||||| |||||||||| |||||||||| ||||||||| | |||||||| || ||||||| CGCAGACAAG CCTACTTACG TCATTCTTAC TTTACATAGA TAAATTTTAG CGAAAAATTT 681 TTGTTT 1013 ||| GAATTT 687 hqPGS_C06HBa0153O03.1-5+_SGN-E546254- (888 1013) ******************************************************************************** EST sequence 22 -strand 617 n (File: SGN-E235464-) 1 TACTAAGTAT GGGTATATGC AAGCACACAC CAATGACATG CATAAGAAAG AATAATTCTT 61 TCCTAACAAC ATGGTTTTTG GAAGTCAAGT TAGTGGACCT GCCAATTTGT GATTACGAAG 121 GTTTCCAAAT ATGGATTAGG AAAGTCAGTG AGCTTTCTAG ATTTAGAATA GGAAAGTTAT 181 GCCATGAGAA TAACGTACAT CATCAGTTTT TAACATTTCC ACATAGCACA TAACATTTAC 241 ACATAGCACA TATGATATGG CATAAAATAC AATTTGCATG TAGCACATCA CTTCTTTAAT 301 ATCATTTATC ATATGCCATG AGACCTTTGA ATCATGGACT TAACATTAAG AATTCCCAGA 361 AATGAGGGCT CAACAAATGG GACCTCAACT AGGGAGTCTT CATTAGCAAA CACATAGCCT 421 GCTTCTTTCA TTCATACGTA CTACATTTCA TTTCATTCAT AGGCCAGTGT AAACACAAAC 481 TATACCTAGG ATGTAGTTTG AGACTTTCAT TAGGTTCATT GTGCAATGAC TAAGAATGAC 541 CTAATGTCAT TACTTGAATC TAACTCACCT TGTGATTATC CTATCCAAAC AACTTTGATA 601 TCATTCATTT CATTATG Predicted gene structure (within gDNA segment 1 to 3042): Exon 1 1377 1552 ( 176 n); cDNA 1 175 ( 175 n); score: 0.852 Intron 1 1553 1584 ( 32 n); Pd: 0.057 (s: 0.88), Pa: 0.000 (s: 0.76) Exon 2 1585 2028 ( 444 n); cDNA 176 617 ( 442 n); score: 0.863 MATCH C06HBa0153O03.1-5+ SGN-E235464- 0.860 620 1.005 C PGS_C06HBa0153O03.1-5+_SGN-E235464- (1377 1552,1585 2028) Alignment (genomic DNA sequence = upper lines): TAATAAGTAT GGGTATATGC AAGAACACAC CACGAATATG CATGAGAAAG AATAACTCTT 1436 || ||||||| |||||||||| ||| |||||| || | ||| ||| |||||| ||||| |||| TACTAAGTAT GGGTATATGC AAGCACACAC CAATGACATG CATAAGAAAG AATAATTCTT 60 TCTTAACAAC ATGACTTTTT GGAAGTCAAG TCAGTGGACT TGCCAAATTT AGATTAGGAG 1496 || ||||||| ||| ||||| |||||||||| | ||||||| |||||| || ||||| || TCCTAACAAC ATG-GTTTTT GGAAGTCAAG TTAGTGGACC TGCCAATTTG TGATTACGAA 119 AGTTACCAAA TTTGGAATAG GAAAGTCAAT GAGCTTTCCA AATTTGGAAT AGGAAAGTCA 1556 ||| ||||| | |||| ||| |||||||| | |||||||| | |||| |||| |||||| GGTTTCCAAA TATGGATTAG GAAAGTCAGT GAGCTTTCTA GATTTAGAAT AGGAAA.... 175 GTAAGCTTTC CTGATTTGGA ATAGGCTAGT TATGCCATGA GTTTAACACA CATCATCATA 1616 || |||||||||| | |||| | |||||||| .......... .......... ........GT TATGCCATGA GAATAACGTA CATCATCA-G 206 CTTTGCACCT TTGCACACAC CACATAACAT TTACACATAG CACATATCAT ATAGCACACT 1676 ||| || | || |||| | |||||||||| |||||||||| ||||||| || || ||| | TTTTTAACAT TTCCACATAG CACATAACAT TTACACATAG CACATATGAT ATGGCATAAA 266 GCACAATTTG CATGAAGCAC ATATTTTCTT TAATATCATT CATTCATATG CCATAAGACC 1736 |||||||| |||| ||||| || ||||| |||||||||| | ||||||| |||| ||||| ATACAATTTG CATGTAGCAC ATCACTTCTT TAATATCATT TA-TCATATG CCATGAGACC 325 TTTGGATCAT GGACTTAATG TTAAGACATC CCATAAATGA GGTCTCAATA GATGGGACCT 1796 |||| ||||| |||||||| |||||| || ||| |||||| || ||||| | ||||||||| TTTGAATCAT GGACTTAACA TTAAGAATTC CCAGAAATGA GGGCTCAACA AATGGGACCT 385 CAACTAGAGA GTCTTCATTA GCAAACACAG AATCTGTTTC GTTCATTCAT ACGTACTCCA 1856 ||||||| || |||||||||| ||||||||| | ||| ||| ||||||||| ||||||| || CAACTAGGGA GTCTTCATTA GCAAACACAT AGCCTGCTTC TTTCATTCAT ACGTACTACA 445 TTTCATTTCA TTCATAGGCC AGTATAAACA CCAGCTCTAC CTAGGATGTA GTTTTAGACT 1916 |||||||||| |||||||||| ||| |||||| | | || ||| |||||||||| |||| ||||| TTTCATTTCA TTCATAGGCC AGTGTAAACA CAAACTATAC CTAGGATGTA GTTTGAGACT 505 TTCATTAAAT TCGTCATGAA ATGACCAAGA ATGACCTAAT GTCATTACTT GAATCTAACT 1976 ||||||| | || | || | ||||| |||| |||||||||| |||||||||| |||||||||| TTCATTAGGT TCATTGTGCA ATGACTAAGA ATGACCTAAT GTCATTACTT GAATCTAACT 565 CACCTTTTGA TTACCCTATC CTAATACCTT TGCTATCATT CATTTCATTA TG 2028 |||||| ||| ||| |||||| | || | ||| || ||||||| |||||||||| || CACCTTGTGA TTATCCTATC CAAACAACTT TGATATCATT CATTTCATTA TG 617 hqPGS_C06HBa0153O03.1-5+_SGN-E235464- (1377 1552,1585 2028) ******************************************************************************** EST sequence 19 -strand 688 n (File: SGN-E395007-) 1 TTTATAGCAT TTTCAGCAAG GAACATGTCC GTTCAGTGAA ATTTGTTCAG ACCCATTTTA 61 TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAACTTTTT TGAAAAATTG TGTCTGTTAG 121 GAAAATAACT ACTTTTTGGA AAGTAACGAC TTTTCGGAAA AGTAACGACT TTTCGGAATG 181 TTACTGTTAC CCGCAAATTT ATAAGAAATA ATATTAACAG GATTTATTTG ATTTAACAAA 241 AATTGATTGA ATAAATTTTG TCCAAAAAAT ATATCAATCA ATCACATCAT TTGCCAAATC 301 CAAATCCAAA TCCAAAGAAA TGACTTTTCA TTTGCACTTT GCAAATGACT TTTCATTTTC 361 CCTCCAAAGT AGTTCCCTCA CTTTTGATAT TCTCTTTTTT ATTTTTCCAT TCACACTTGT 421 TAAAACCCAA CATAAACATC ATTTTTGTAT GTGACACATA ACAAAGAAGA TTCCAGAATA 481 TTTAGGTCAT GTTTATCATT AGTATGCTGA TTTTAGTAGC AAGTTTAGTT GGTATATTCA 541 CGATACAACT ACTCCTAAGA AATTTGAAAC AATATAGATA GAAATAATGG GGACGTATAA 601 TTTAGGAAAA ATTAGTGGGT TGTAGAGAAT ATATGCAATT CGTGAAAATT GATACCAGCA 661 TATGTGTGTA CTACTTTTTG TGCAGGCA Predicted gene structure (within gDNA segment 3724 to 7846): Exon 1 4828 5129 ( 302 n); cDNA 1 315 ( 315 n); score: 0.800 Intron 1 5130 5258 ( 129 n); Pd: 0.000 (s: 0.98), Pa: 0.000 (s: 0.98) Exon 2 5259 5370 ( 112 n); cDNA 316 427 ( 112 n); score: 0.902 Intron 2 5371 5631 ( 261 n); Pd: 0.000 (s: 0.82), Pa: 0.977 (s: 0) Exon 3 5632 5657 ( 26 n); cDNA 428 452 ( 25 n); score: 0.654 MATCH C06HBa0153O03.1-5+ SGN-E395007- 0.827 440 0.640 C PGS_C06HBa0153O03.1-5+_SGN-E395007- (4828 5129,5259 5370,5632 5657) Alignment (genomic DNA sequence = upper lines): TTTATAGCCA TTTTCAGCAA GAAACGTGTA TGTTCAAAGA AATCTGTTCA GACCCGTTTT 4887 ||||||| || |||||||||| | ||| ||| ||||| || ||| |||||| ||||| |||| TTTATAG-CA TTTTCAGCAA GGAACATGTC CGTTCAGTGA AATTTGTTCA GACCCATTTT 59 ATCCAGAAAG TTGTGTCTTT TGGAAAAAAT AACAAGTTTT TGGAAAAA-T GTGTCCGTTA 4946 |||||||||| |||||||||| |||||||||| ||||| |||| | |||||| | ||||| |||| ATCCAGAAAG TTGTGTCTTT TGGAAAAAAT AACAACTTTT TTGAAAAATT GTGTCTGTTA 119 GGAAAATAAC GGCTTTTTGG AAAGTAAGGA CTTTTCGGAA AGAGTAATAA CTTTTCGG-A 5005 |||||||||| |||||||| ||||||| || |||||||||| | ||||| | |||||||| | GGAAAATAAC TACTTTTTGG AAAGTAACGA CTTTTCGGAA A-AGTAACGA CTTTTCGGAA 178 ----A-TGTT A--C-CA--- -T-TAAGACA TAATATTAAC AAGATTTATT TGATTTAACA 5052 | |||| | | || | ||||| | |||||||||| | |||||||| |||||||||| TGTTACTGTT ACCCGCAAAT TTATAAGAAA TAATATTAAC AGGATTTATT TGATTTAACA 238 AAAACTGATT AAATAAATTT TGTCCAAAAA ATTTATCAAT CAATCACATC ATTTGCCAAA 5112 |||| ||||| ||||||||| |||||||||| || ||||||| |||||||||| |||||||||| AAAATTGATT GAATAAATTT TGTCCAAAAA ATATATCAAT CAATCACATC ATTTGCCAAA 298 TCCAAATCCA AATCCAAATC CTAAGCCGAA GCCGAGCGAA CGACGACGAC GGCGCGAGGG 5172 |||||||||| ||||||| TCCAAATCCA AATCCAA... .......... .......... .......... .......... 315 GGCATCTTCT TCTTAGCTCT TTAAGAATTA ATGGAAGTGT TTCCTTATAT AAGGACAACA 5232 .......... .......... .......... .......... .......... .......... 315 ATTTCCCTTT CTTTTGATGA CATAGGAGAA ATGACTTTTC ATTTGCACTT TGTAAATGAC 5292 |||| |||||||||| |||||||||| || ||||||| .......... .......... ......AGAA ATGACTTTTC ATTTGCACTT TGCAAATGAC 349 TTTTCATTTT CCCTCCAAAA TAGTTCCCTT ACTTTTCATA TTCTCTCTTT TCTTTTCTCA 5352 |||||||||| ||||||||| ||||||||| |||||| ||| |||||| ||| | |||| || TTTTCATTTT CCCTCCAAAG TAGTTCCCTC ACTTTTGATA TTCTCTTTTT TATTTTTCCA 409 TTCACACATG TTAAATCTAA CAATCCCCCA CGTGAATAGG GAAGGCTATT GTTAAAACAT 5412 ||||||| || ||||| | TTCACACTTG TTAAAACC.. .......... .......... .......... .......... 427 ATGCATGAAA AACTTGTGTG TCTTCTGGTA AAGGCTAATC GCATCTGGAT AAGTAGATTT 5472 .......... .......... .......... .......... .......... .......... 427 CCCTTTAAAC TTTCCGTAGT GAACATATAT CGGATATACT CGGTCAATTG GTAGATTTGA 5532 .......... .......... .......... .......... .......... .......... 427 TATCTTTGAA CCGTCGAGCT TTGTTATATA CCTAGACAAC ATATGTCACA CAATCAACCC 5592 .......... .......... .......... .......... .......... .......... 427 TTGAACTGTT CTTAGTTCTC ATTGTTTTGT TCGTTTCAGC CACGAAAACA TCTTGGATAG 5652 | || ||||| || | | | .......... .......... .......... .........C AACATAAACA TCAT-TTTTG 447 TAAGT 5657 || || TATGT 452 hqPGS_C06HBa0153O03.1-5+_SGN-E395007- (4828 5129,5259 5370,5632 5657) ******************************************************************************** EST sequence 14 -strand 586 n (File: SGN-E250408-) 1 AAAATTGTGT CTGTTAGGAA AATAACTACT TTTTGGAAAG TAACGACTTT TCGGAAAAGT 61 AACGACTTTT CGGAATGTTA CTGTTACCCG CAAATTTATA AGAAATAATA TTAACAGGAT 121 TTATTTGATT TAACAAAAAT TGATTGAATA AATTTTGTCC AAAAAATATA TCAATCAATC 181 ACATCATTTG CCAAATCCAA ATCCAAATCC AAAGAAATGA CTTTTCATTT GCACTTTGCA 241 AATGACTTTT CATTTTCCCT CCAAAGTAGT TCCCTCACTT TTGATATTCT CTTTTTTATT 301 TTTCCATTCA CACTTGTTAA AACCCAACAT AAACATCATT TTTGTATGTG ACACATAACA 361 AAGAAGATTC CAGAATATTT AGGTCATGTT TATCATTAGT ATGCTGATTT TAGTAGCAAG 421 TTTAGTTGGT ATATTCACGA TACAACTACT CTTAAGAAAT TTGAAACAAT ATAGATAGAA 481 ATAATGGGGA CGTATAATTT AGGAAAAATT AGTGGGTTGT AGAGAATATA TGCAATTCGT 541 GAAAATTGAT ACCAGCATAT GTGTGTACTA CTTTTTGTGC AGGCAC Predicted gene structure (within gDNA segment 3255 to 7846): Exon 1 5016 5129 ( 114 n); cDNA 99 212 ( 114 n); score: 0.956 Intron 1 5130 5258 ( 129 n); Pd: 0.000 (s: 0.98), Pa: 0.000 (s: 0.98) Exon 2 5259 5370 ( 112 n); cDNA 213 324 ( 112 n); score: 0.902 Intron 2 5371 5631 ( 261 n); Pd: 0.000 (s: 0.82), Pa: 0.977 (s: 0) Exon 3 5632 5657 ( 26 n); cDNA 325 349 ( 25 n); score: 0.654 MATCH C06HBa0153O03.1-5+ SGN-E250408- 0.929 252 0.430 C PGS_C06HBa0153O03.1-5+_SGN-E250408- (5016 5129,5259 5370,5632 5657) Alignment (genomic DNA sequence = upper lines): TAAGACATAA TATTAACAAG ATTTATTTGA TTTAACAAAA ACTGATTAAA TAAATTTTGT 5075 ||||| |||| |||||||| | |||||||||| |||||||||| | ||||| || |||||||||| TAAGAAATAA TATTAACAGG ATTTATTTGA TTTAACAAAA ATTGATTGAA TAAATTTTGT 158 CCAAAAAATT TATCAATCAA TCACATCATT TGCCAAATCC AAATCCAAAT CCAAATCCTA 5135 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| CCAAAAAATA TATCAATCAA TCACATCATT TGCCAAATCC AAATCCAAAT CCAA...... 212 AGCCGAAGCC GAGCGAACGA CGACGACGGC GCGAGGGGGC ATCTTCTTCT TAGCTCTTTA 5195 .......... .......... .......... .......... .......... .......... 212 AGAATTAATG GAAGTGTTTC CTTATATAAG GACAACAATT TCCCTTTCTT TTGATGACAT 5255 .......... .......... .......... .......... .......... .......... 212 AGGAGAAATG ACTTTTCATT TGCACTTTGT AAATGACTTT TCATTTTCCC TCCAAAATAG 5315 ||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||| ||| ...AGAAATG ACTTTTCATT TGCACTTTGC AAATGACTTT TCATTTTCCC TCCAAAGTAG 269 TTCCCTTACT TTTCATATTC TCTCTTTTCT TTTCTCATTC ACACATGTTA AATCTAACAA 5375 |||||| ||| ||| |||||| ||| |||| | ||| ||||| |||| ||||| || | TTCCCTCACT TTTGATATTC TCTTTTTTAT TTTTCCATTC ACACTTGTTA AAACC..... 324 TCCCCCACGT GAATAGGGAA GGCTATTGTT AAAACATATG CATGAAAAAC TTGTGTGTCT 5435 .......... .......... .......... .......... .......... .......... 324 TCTGGTAAAG GCTAATCGCA TCTGGATAAG TAGATTTCCC TTTAAACTTT CCGTAGTGAA 5495 .......... .......... .......... .......... .......... .......... 324 CATATATCGG ATATACTCGG TCAATTGGTA GATTTGATAT CTTTGAACCG TCGAGCTTTG 5555 .......... .......... .......... .......... .......... .......... 324 TTATATACCT AGACAACATA TGTCACACAA TCAACCCTTG AACTGTTCTT AGTTCTCATT 5615 .......... .......... .......... .......... .......... .......... 324 GTTTTGTTCG TTTCAGCCAC GAAAACATCT TGGATAGTAA GT 5657 | || ||||||| | | ||| || .......... ......CAAC ATAAACATCA T-TTTTGTAT GT 349 hqPGS_C06HBa0153O03.1-5+_SGN-E250408- (5016 5129,5259 5370,5632 5657) ******************************************************************************** EST sequence 21 -strand 574 n (File: SGN-E548743-) 1 AATTTATTAG AGATAAATAT GAACAGGATT TATGTGATTT AACACAAGGT GATTAAATAA 61 ATTTTGTCCA AAAAATTTAT CAATCCAATC ACATCATTTG CCAAATCCAA ATCCAAAGCC 121 GAGCGATGAT GACGGCGCGA CGGGGGCATA TTCTTCTTAG CTCTTTAAGA AGTAATGGAA 181 GTGTTTCCTT ATATAACGAC AACAATTTCC CTTTCTTTAG TCGATATGGG AGAAATGAAT 241 TTTCATTTGC ACTTGGCAAA TGACTTTTCA GTTTCCCTCG AGAGTAGTTC CCTCACCTTT 301 CAGATTCTCT TTTTTTTTTC CCATTCACAC TTGCTAAACC CCAACATGAG CAAGGTATTA 361 TTATAATTTG GTCCAAAATG TGGTTAAGTT TTGGGAGTTT ATCTTGATCA AGTGATTAAT 421 GATATTTGAT ATATTAAGAG CAATCCCGAA CATATGTTTG AACTAGCCGA CCCAGATTAT 481 ATACAACATC CGTAGATGCA TAGGTTGGAT TACTTGCTCC ACATACTTCT TGCTACCAAA 541 GTTCAAGATA CCAAGTGAGA GACAGTTCTC TCCA Predicted gene structure (within gDNA segment 3958 to 7846): Exon 1 5018 5123 ( 106 n); cDNA 9 116 ( 108 n); score: 0.887 Intron 1 5124 5153 ( 30 n); Pd: 0.000 (s: 0.95), Pa: 0.000 (s: 0.40) ?? Exon 2 5154 5372 ( 219 n); cDNA 117 343 ( 227 n); score: 0.774 Intron 2 5373 5526 ( 154 n); Pd: 0.000 (s: 0.80), Pa: 0.635 (s: 0) Exon 3 5527 5531 ( 5 n); cDNA 344 348 ( 5 n); score: 0.600 Intron 3 5532 5928 ( 397 n); Pd: 0.900 (s: 0), Pa: 0.552 (s: 0) Exon 4 5929 5954 ( 26 n); cDNA 349 372 ( 24 n); score: 0.615 MATCH C06HBa0153O03.1-5+ SGN-E548743- 0.811 356 0.620 C PGS_C06HBa0153O03.1-5+_SGN-E548743- (5018 5123,5154 5372,5527 5531,5929 5954) Alignment (genomic DNA sequence = upper lines): AGACAT-AAT ATTAACAAGA TTTATTTGAT TTAACAAAAA CTGATTAAAT AAATTTTGTC 5076 ||| || ||| || |||| || ||||| |||| |||||| || ||||||||| |||||||||| AGAGATAAAT ATGAACAGGA TTTATGTGAT TTAACACAAG GTGATTAAAT AAATTTTGTC 68 CAAAAAATTT ATCAAT-CAA TCACATCATT TGCCAAATCC AAATCCAAAT CCAAATCCTA 5135 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||| CAAAAAATTT ATCAATCCAA TCACATCATT TGCCAAATCC AAATCCAA.. .......... 116 AGCCGAAGCC GAGCGAAC-G ACGA-C---- --GACGGCGC GA-GGGGGCA TCTTCTTCTT 5186 | ||| | |||||||| || ||||||| | |||||||| .......... ........AG CCGAGCGATG ATGACGGCGC GACGGGGGCA TATTCTTCTT 158 AGCTCTTTAA GAATTAATGG AAGTGTTTCC TTATATAAGG ACAACAATTT CCCTTTCTTT 5246 |||||||||| ||| |||||| |||||||||| |||||||| | |||||||||| |||||||||| AGCTCTTTAA GAAGTAATGG AAGTGTTTCC TTATATAACG ACAACAATTT CCCTTTCTTT 218 TGATGACATA GGAGAAATGA CTTTTCATTT GCACTTTGTA AATGACTTTT CATTTTCCCT 5306 | || || |||||||||| ||||||||| |||||| | | |||||||||| || ||||||| AGTCGATATG GGAGAAATGA ATTTTCATTT GCACTTGGCA AATGACTTTT CAGTTTCCCT 278 CCAAAATAGT TCCCTTACTT TTCATATTCT CTCTTTTCTT TTCTCATTCA CACATGTTAA 5366 | | | |||| ||||| || | |||| ||||| || |||| || ||| |||||| ||| || ||| CGAGAGTAGT TCCCTCACCT TTCAGATTCT CT-TTTTTTT TTCCCATTCA CACTTGCTAA 337 ATCTAACAAT CCCCCACGTG AATAGGGAAG GCTATTGTTA AAACATATGC ATGAAAAACT 5426 | | | ACCCCA.... .......... .......... .......... .......... .......... 343 TGTGTGTCTT CTGGTAAAGG CTAATCGCAT CTGGATAAGT AGATTTCCCT TTAAACTTTC 5486 .......... .......... .......... .......... .......... .......... 343 CGTAGTGAAC ATATATCGGA TATACTCGGT CAATTGGTAG ATTTGATATC TTTGAACCGT 5546 | || .......... .......... .......... .......... ACATG..... .......... 348 CGAGCTTTGT TATATACCTA GACAACATAT GTCACACAAT CAACCCTTGA ACTGTTCTTA 5606 .......... .......... .......... .......... .......... .......... 348 GTTCTCATTG TTTTGTTCGT TTCAGCCACG AAAACATCTT GGATAGTAAG TGCTTAAAGA 5666 .......... .......... .......... .......... .......... .......... 348 GCTGGCCTTA CCGGATTCTC CTTGAAGCGG CTTACACTTC ACACTTACAT AGGTGATTTC 5726 .......... .......... .......... .......... .......... .......... 348 TAAATGTGTT ATCCCATAGA TATACCATTT GATATTCCAT GTATCAAACT TAGAAACCAT 5786 .......... .......... .......... .......... .......... .......... 348 TAAAAAGTCC TTACGTCTTT ATCCTTATTA CTAAATATTG TCTCATCATG AAAATGGACC 5846 .......... .......... .......... .......... .......... .......... 348 ATAAAATAAT AAGAATAATT TATTTTTTTC TTGACAATGT TGAACCGTCA TCAATGACTT 5906 .......... .......... .......... .......... .......... .......... 348 TGTTTTATCT CCTTGAACCT AGATCATGGG ATCTCCTGTA TTCTAGGT 5954 | || || || | | || | | ||| .......... .......... ..AGCAAGGT AT-TATTATA AT-TTGGT 372 hqPGS_C06HBa0153O03.1-5+_SGN-E548743- (5018 5123,5154 5372) ******************************************************************************** EST sequence 33 -strand 568 n (File: SGN-E301922-) 1 TTATTAGAGA TAAATATTAA CAGGATTTAT TTGATTTAAC AAAAAGTGAT TAAATAAATT 61 TTGTCCAAAA AATTTATCAA TCCAATCACA TCATTTGCCA AATCCAAATC CAAAGCCGAG 121 CGATGATGAC GGCGCGAGGG GGCATCTTCT TCTTAGCTCT TTAAAAAGTA ATGGAAGTGT 181 TTCCTTATAT AAAGACAACA ATTTCCCTTT CTTTTGCCGA TATGGGAGAA ATGAATTTTC 241 ATTTGCACTT TGCAAATGAC TTTTCAGTTT CCCTCCAGAG TAGTTCCCTC ACCTTTCATA 301 TTCTCTTTTT TTTTTCCCAT TCACACTTGC TAAACCCCAA CAAGAACAAG GTATTATTAT 361 AATTTGGTCC AAAATGTGGT TAAGATTTGG GAGTTTATCT TGAACAAGTG ATTAATGATA 421 TTTGATATAT AAAGAACAAT GCCGAAAATA TGTTTGAACT AGTCGATATA GATCAATCTT 481 TTAACTCGTT ACGTTTTTAG CTCAAAAATA TTTTTTTTAC TACCCCTTAC TCCAAAAAAA 541 AAAAAAAAAA AACTCGAGAC AGTTCTCT Predicted gene structure (within gDNA segment 4195 to 7846): Exon 1 5018 5123 ( 106 n); cDNA 6 113 ( 108 n); score: 0.925 Intron 1 5124 5153 ( 30 n); Pd: 0.000 (s: 0.95), Pa: 0.000 (s: 0.50) ?? Exon 2 5154 5366 ( 213 n); cDNA 114 333 ( 220 n); score: 0.812 PPA cDNA 534 553 MATCH C06HBa0153O03.1-5+ SGN-E301922- 0.850 319 0.562 C PGS_C06HBa0153O03.1-5+_SGN-E301922- (5018 5123,5154 5366) Alignment (genomic DNA sequence = upper lines): AGACAT-AAT ATTAACAAGA TTTATTTGAT TTAACAAAAA CTGATTAAAT AAATTTTGTC 5076 ||| || ||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||||| AGAGATAAAT ATTAACAGGA TTTATTTGAT TTAACAAAAA GTGATTAAAT AAATTTTGTC 65 CAAAAAATTT ATCAAT-CAA TCACATCATT TGCCAAATCC AAATCCAAAT CCAAATCCTA 5135 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||| CAAAAAATTT ATCAATCCAA TCACATCATT TGCCAAATCC AAATCCAA.. .......... 113 AGCCGAAGCC GAGCGAAC-G ACGA-CGA-- ----CGGCGC GAGGGGGCAT CTTCTTCTTA 5187 | ||| ||| |||||| |||||||||| |||||||||| .......... ........AG CCGAGCGATG ATGACGGCGC GAGGGGGCAT CTTCTTCTTA 155 GCTCTTTAAG AATTAATGGA AGTGTTTCCT TATATAAGGA CAACAATTTC CCTTTCTTTT 5247 ||||||||| || ||||||| |||||||||| ||||||| || |||||||||| |||||||||| GCTCTTTAAA AAGTAATGGA AGTGTTTCCT TATATAAAGA CAACAATTTC CCTTTCTTTT 215 GATGACATAG GAGAAATGAC TTTTCATTTG CACTTTGTAA ATGACTTTTC ATTTTCCCTC 5307 | || || | ||||||||| |||||||||| ||||||| || |||||||||| | |||||||| GCCGATATGG GAGAAATGAA TTTTCATTTG CACTTTGCAA ATGACTTTTC AGTTTCCCTC 275 CAAAATAGTT CCCTTACTTT TCATATTCTC TCTTTTCTTT TCTCATTCAC ACATGTTAA 5366 || | ||||| |||| || || |||||||||| | |||| ||| || ||||||| || || ||| CAGAGTAGTT CCCTCACCTT TCATATTCTC T-TTTTTTTT TCCCATTCAC ACTTGCTAA 333 hqPGS_C06HBa0153O03.1-5+_SGN-E301922- (5018 5123,5154 5366) ******************************************************************************** EST sequence 23 -strand 658 n (File: SGN-E542859-) 1 TGTCTGTTAG GAAAATAACG ACTTTTTGGA AAGTAACGAC TTTTCCGAAA GAGTAACAAC 61 TTTTCAAAAT GTTACCGTTA ACCGCAAATT TATTAGAGAT AAATATTAAC AGGATTTATT 121 TGATTTAACA AAAAGTGATT AAATAAATTT TGTCCAAAAA ATTTATCAAT CCAATCACAT 181 CATTTGCCAA ATCCAAATCC AAAGCCGAGC GATGATGACG GCGCGAGGGG GCATCTTCTT 241 CTTAGCTCTT TAAGAAGTAA TGGAAGTGTT TCCTTATATA ACGACAACAA TTTCCCTTTC 301 TTTTGCCGAT ATGGGAGAAA TGAATTTTCA TTTGCACTTT GCAAATGACT TTTCAGTTTC 361 CCTCCAGAGT AGTTCCCTCA CCTTTCATAT TCTCTTTTTT TTTTCCCATT CACACTTGCT 421 AAACCCCAAC AAGAACAAGG TATTATTATA ATTTGGTCCA AAATGTGGTT AAGATTTGGG 481 AGTTTATCTT GAACAAGTGA TTAATGATAT TTGATATATA AAGAACAATG CCGAAAATAT 541 GTTTGAACTA GTCGATATAG ATCAATCTTT TAACTCGTTA CGTTTTTAGC TCAAAAATAT 601 TTTTTTTACT AACAATTACT CCAAAAAAAA AAAAAAAAAA ACTCGAGACA GTTCTCTC Predicted gene structure (within gDNA segment 2705 to 7846): Exon 1 3637 3646 ( 10 n); cDNA 89 98 ( 10 n); score: 0.700 Intron 1 3647 5019 (1373 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.92) Exon 2 5020 5123 ( 104 n); cDNA 99 202 ( 104 n); score: 0.938 Intron 2 5124 5153 ( 30 n); Pd: 0.000 (s: 0.95), Pa: 0.000 (s: 0.50) ?? Exon 3 5154 5366 ( 213 n); cDNA 203 422 ( 220 n); score: 0.817 PPA cDNA 623 642 MATCH C06HBa0153O03.1-5+ SGN-E542859- 0.856 327 0.497 C PGS_C06HBa0153O03.1-5+_SGN-E542859- (3637 3646,5020 5123,5154 5366) Alignment (genomic DNA sequence = upper lines): TTGAGAAGAG GTGGAAAAAT TGAAAAGAGA AGCATAGTCA ATGCCAAGAC CCATAAAATG 3696 || | |||| TTTATTAGAG .......... .......... .......... .......... .......... 98 ATATGTCGAA GGCATTGTGC GGACAGCGGA CTAATTGTTG TCTACCTTTT GAGCTTGAAT 3756 .......... .......... .......... .......... .......... .......... 98 AGACATAAAA TGTGCTTGCC AAAATTTGCA ATCAGAACTC CAAAAAGGAA TTTATGGAAG 3816 .......... .......... .......... .......... .......... .......... 98 TGCAGTAAGG AGACATAGTC GGGTCGGCCA AAGCAACCTT CGGGTTTAGT AATTGTTAAG 3876 .......... .......... .......... .......... .......... .......... 98 GTATGTTCAG CGAGCTTGAG CTCAAAATAG TGATTTATCG AATACATTCA GCGACAGTAG 3936 .......... .......... .......... .......... .......... .......... 98 ATATCACCAA ATAAGTATGG TATAAAAATA CCAAGGGCCT TTTAAAATTC AAAAAGTCTA 3996 .......... .......... .......... .......... .......... .......... 98 AAACTTTGAT ACTTTAGAGC CTTTAAGATT CTTTTATCTT GCTTTAAATA ATTTTTAGTG 4056 .......... .......... .......... .......... .......... .......... 98 TTTGTGAGCC TCTTATGAGT GTTTAAAAAT GGATTTTGTC TCAACCCAAC ATACGTACCT 4116 .......... .......... .......... .......... .......... .......... 98 CTCAATTTTG GTGACCTTAA GGTTGGTTTT TGAATCCTTG TTCATATTTT TTAGTTTAAA 4176 .......... .......... .......... .......... .......... .......... 98 GTTTTTATAG GGTGTGAGCT TGAGCCCTCT AACTGATTTT GTGTGCCTGA TTTAAATGGG 4236 .......... .......... .......... .......... .......... .......... 98 TAGCATTGTA TTGTGTTATT GGAACTGTAC CCATGAGTTT TCAATTGTGT ATGCCGGTCA 4296 .......... .......... .......... .......... .......... .......... 98 AATAGCGAAA CGTGTGTGTT GGTTAAATTG TGTAATGATC AGAGGGTTTG ATATATTGTG 4356 .......... .......... .......... .......... .......... .......... 98 TTTTAATGAG TTTAAGGGTT CAGTTAATAA AGAATTAAAC AGAAGATTTC ATAATATAAA 4416 .......... .......... .......... .......... .......... .......... 98 CTCATACAAA TGTAGGAGTT CATTTAATTA TTTCGCCAAA TTTAAATGGT ATCTTCTAAT 4476 .......... .......... .......... .......... .......... .......... 98 TGTTATTCAA CCTGTATTTA TAAGGTTAAC CAATAAACAG AAATAGAAAT AAGATGAAAC 4536 .......... .......... .......... .......... .......... .......... 98 AGAAAATACA TAAAATACAA TAAGTAATCC GAGTCTACAA AAACTACTAT GTGTCCTTAA 4596 .......... .......... .......... .......... .......... .......... 98 GAAATTTAAT CCCCTCACTG TACACAAGGT TATGGATTAA TTTCTCCCAA GATAAAATGG 4656 .......... .......... .......... .......... .......... .......... 98 ATTAAACCTG TTAAAGAAAT AGCAGCACCT CAGATTTCTT TAACTAAAGC GAAATTCAGA 4716 .......... .......... .......... .......... .......... .......... 98 ACAACAACAA GTCACATAGA CTCAGTCGAT CGACACTTTG ATTTATTTGA GAGAAAAATA 4776 .......... .......... .......... .......... .......... .......... 98 TATGCAGAGA AGGAAAATTT TAGTGTTTGA AAAATCAAAA ATTGACTTCC TTTTATAGCC 4836 .......... .......... .......... .......... .......... .......... 98 ATTTTCAGCA AGAAACGTGT ATGTTCAAAG AAATCTGTTC AGACCCGTTT TATCCAGAAA 4896 .......... .......... .......... .......... .......... .......... 98 GTTGTGTCTT TTGGAAAAAA TAACAAGTTT TTGGAAAAAT GTGTCCGTTA GGAAAATAAC 4956 .......... .......... .......... .......... .......... .......... 98 GGCTTTTTGG AAAGTAAGGA CTTTTCGGAA AGAGTAATAA CTTTTCGGAA TGTTACCATT 5016 .......... .......... .......... .......... .......... .......... 98 AAGACATAAT ATTAACAAGA TTTATTTGAT TTAACAAAAA CTGATTAAAT AAATTTTGTC 5076 | | ||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||||| ...ATA-AAT ATTAACAGGA TTTATTTGAT TTAACAAAAA GTGATTAAAT AAATTTTGTC 154 CAAAAAATTT ATCAAT-CAA TCACATCATT TGCCAAATCC AAATCCAAAT CCAAATCCTA 5135 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||| CAAAAAATTT ATCAATCCAA TCACATCATT TGCCAAATCC AAATCCAA.. .......... 202 AGCCGAAGCC GAGCGAAC-- --GA-C---G ACGACGGCGC GAGGGGGCAT CTTCTTCTTA 5187 || | | | |||||||| |||||||||| |||||||||| .......... ........AG CCGAGCGATG ATGACGGCGC GAGGGGGCAT CTTCTTCTTA 244 GCTCTTTAAG AATTAATGGA AGTGTTTCCT TATATAAGGA CAACAATTTC CCTTTCTTTT 5247 |||||||||| || ||||||| |||||||||| ||||||| || |||||||||| |||||||||| GCTCTTTAAG AAGTAATGGA AGTGTTTCCT TATATAACGA CAACAATTTC CCTTTCTTTT 304 GATGACATAG GAGAAATGAC TTTTCATTTG CACTTTGTAA ATGACTTTTC ATTTTCCCTC 5307 | || || | ||||||||| |||||||||| ||||||| || |||||||||| | |||||||| GCCGATATGG GAGAAATGAA TTTTCATTTG CACTTTGCAA ATGACTTTTC AGTTTCCCTC 364 CAAAATAGTT CCCTTACTTT TCATATTCTC TCTTTTCTTT TCTCATTCAC ACATGTTAA 5366 || | ||||| |||| || || |||||||||| | |||| ||| || ||||||| || || ||| CAGAGTAGTT CCCTCACCTT TCATATTCTC T-TTTTTTTT TCCCATTCAC ACTTGCTAA 422 hqPGS_C06HBa0153O03.1-5+_SGN-E542859- (5020 5123,5154 5366) ******************************************************************************** EST sequence 34 -strand 598 n (File: SGN-E301820-) 1 TTTTCAAAAT GTTACCGTTA ACCGCAAATT TATTAGAGAT AAATATTAAC AGGATTTATT 61 TGATTTAACA AAAAGTGATT AAATAAATTT TGTCCAAAAA ATTTATCAAT CCAATCACAT 121 CATTTGCCAA ATCCAAATCC AAAGCCGAGC GATGATGACG GCGCGAGGGG GCATCTTCTT 181 CTTAGCTCTT TAAGAAGTAA TGGAAGTGTT TCCTTATATA ACGACAACAA TTTCCCTTTC 241 TTTTGCCGAT ATGGGAGAAA TGAATTTTCA TTTGCCCTTT GCAAATGACT TTTCAGTTTC 301 CCTCCAGAGT AGTTCCCTCA CCTTTCATAT TCTCTTTTTT TTTTCCCATT CACACTTGCT 361 AAACCCCAAC AAGAACAAGG TATTATTATA ATTTGGTCCA AAATGTGGTT AAGATTTGGG 421 AGTTTATCTT GTCCCAGTGA TTAATGATAT TTGATATATA AAGAACAATG CCGAAAATAT 481 GTTTGAACTA GTCGATATAG ATCAATCTTT TAACTCGTGA CGTTTTTAGC TCAAAAATAT 541 TTTTTCCCCC TAACCATTAC TCCAAAAAAA AAAAAAAAAA AACTCGAGAC AGTTCTCT Predicted gene structure (within gDNA segment 3905 to 7846): Exon 1 5024 5123 ( 100 n); cDNA 42 142 ( 101 n); score: 0.955 Intron 1 5124 5153 ( 30 n); Pd: 0.000 (s: 0.95), Pa: 0.000 (s: 0.50) ?? Exon 2 5154 5366 ( 213 n); cDNA 143 362 ( 220 n); score: 0.812 PPA cDNA 564 583 MATCH C06HBa0153O03.1-5+ SGN-E301820- 0.858 313 0.523 C PGS_C06HBa0153O03.1-5+_SGN-E301820- (5024 5123,5154 5366) Alignment (genomic DNA sequence = upper lines): AATATTAACA AGATTTATTT GATTTAACAA AAACTGATTA AATAAATTTT GTCCAAAAAA 5083 |||||||||| ||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| AATATTAACA GGATTTATTT GATTTAACAA AAAGTGATTA AATAAATTTT GTCCAAAAAA 101 TTTATCAAT- CAATCACATC ATTTGCCAAA TCCAAATCCA AATCCAAATC CTAAGCCGAA 5142 ||||||||| |||||||||| |||||||||| |||||||||| | TTTATCAATC CAATCACATC ATTTGCCAAA TCCAAATCCA A......... .......... 142 GCCGAGCGAA C----GA-CG ACG---ACGG CGCGAGGGGG CATCTTCTTC TTAGCTCTTT 5194 || || | | |||| |||||||||| |||||||||| |||||||||| .......... .AGCCGAGCG ATGATGACGG CGCGAGGGGG CATCTTCTTC TTAGCTCTTT 191 AAGAATTAAT GGAAGTGTTT CCTTATATAA GGACAACAAT TTCCCTTTCT TTTGATGACA 5254 ||||| |||| |||||||||| |||||||||| ||||||||| |||||||||| |||| || | AAGAAGTAAT GGAAGTGTTT CCTTATATAA CGACAACAAT TTCCCTTTCT TTTGCCGATA 251 TAGGAGAAAT GACTTTTCAT TTGCACTTTG TAAATGACTT TTCATTTTCC CTCCAAAATA 5314 | |||||||| || ||||||| |||| ||||| ||||||||| |||| ||||| ||||| | || TGGGAGAAAT GAATTTTCAT TTGCCCTTTG CAAATGACTT TTCAGTTTCC CTCCAGAGTA 311 GTTCCCTTAC TTTTCATATT CTCTCTTTTC TTTTCTCATT CACACATGTT AA 5366 ||||||| || ||||||||| |||| |||| ||||| |||| ||||| || | || GTTCCCTCAC CTTTCATATT CTCT-TTTTT TTTTCCCATT CACACTTGCT AA 362 hqPGS_C06HBa0153O03.1-5+_SGN-E301820- (5024 5123,5154 5366) ******************************************************************************** EST sequence 15 -strand 417 n (File: SGN-E250410-) 1 TAAATTTTGT CCACATAATA TATCAATCAA TCACATCATT TGCCAAATCC AAATCCAAAT 61 CCAAAGCAAT GACTTTTCAT TTGCACTTTG CAAATGACTT TTCATTTTCC TTCCAAAGTA 121 GTTCCCTCAC TTTTGATATT CTCTTTTTTA TTTTTCCATT CACACTTGTT AAAACCCAAT 181 ATAAACATCA TTTTTGTATG TGACACATAA CAAAGAAGAT TCCAGAATAT TTAGGTCATG 241 TTTATCATTA GTATGCTGAT TTTAGTAGCA AGTTTAGTTG GTATATTCAC GATACAACTA 301 CTCTTAAGAA ATTTGAAACA ATATAGATAG TAATAATGGG GACGTATAAT TTAGGAAAAA 361 TTAGCGGGTT GTAGAGAATA TATGCTATTC GTGAAAATTG ATTCCAGCAT ATGTGTG Predicted gene structure (within gDNA segment 4276 to 7846): Exon 1 5066 5129 ( 64 n); cDNA 1 64 ( 64 n); score: 0.953 Intron 1 5130 5258 ( 129 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0.94) Exon 2 5259 5376 ( 118 n); cDNA 65 182 ( 118 n); score: 0.864 MATCH C06HBa0153O03.1-5+ SGN-E250410- 0.896 182 0.436 C PGS_C06HBa0153O03.1-5+_SGN-E250410- (5066 5129,5259 5376) Alignment (genomic DNA sequence = upper lines): TAAATTTTGT CCAAAAAATT TATCAATCAA TCACATCATT TGCCAAATCC AAATCCAAAT 5125 |||||||||| ||| | ||| |||||||||| |||||||||| |||||||||| |||||||||| TAAATTTTGT CCACATAATA TATCAATCAA TCACATCATT TGCCAAATCC AAATCCAAAT 60 CCAAATCCTA AGCCGAAGCC GAGCGAACGA CGACGACGGC GCGAGGGGGC ATCTTCTTCT 5185 |||| CCAA...... .......... .......... .......... .......... .......... 64 TAGCTCTTTA AGAATTAATG GAAGTGTTTC CTTATATAAG GACAACAATT TCCCTTTCTT 5245 .......... .......... .......... .......... .......... .......... 64 TTGATGACAT AGGAGAAATG ACTTTTCATT TGCACTTTGT AAATGACTTT TCATTTTCCC 5305 || |||| |||||||||| ||||||||| |||||||||| ||||||||| .......... ...AGCAATG ACTTTTCATT TGCACTTTGC AAATGACTTT TCATTTTCCT 111 TCCAAAATAG TTCCCTTACT TTTCATATTC TCTCTTTTCT TTTCTCATTC ACACATGTTA 5365 |||||| ||| |||||| ||| ||| |||||| ||| |||| | ||| ||||| |||| ||||| TCCAAAGTAG TTCCCTCACT TTTGATATTC TCTTTTTTAT TTTTCCATTC ACACTTGTTA 171 AATCTAACAA T 5376 || | | | | AAACCCAATA T 182 hqPGS_C06HBa0153O03.1-5+_SGN-E250410- (5066 5129,5259 5376) ******************************************************************************** EST sequence 17 -strand 514 n (File: SGN-E255327-) 1 TAAAGTTTTT ATGTTTTTTT AATCGTTAAT CGAATGATTT TAAAAATTCA AGTAGAGTAG 61 AAAACCATAA AGGTTTTAAT CTCCAGAAAC CTCAAACACA GAAACTGTAT AAATTTTCTT 121 AAGATTGTTA CTCTTATAAT AACTTTTATT TATAAAGTTA ATAAACAGAA ACAGAAATAA 181 GATGAAGCAG AAAATAATAT AGAATACAAG AATTAATCCG AGTCCACAGA AACTACTATG 241 TGTCCTTAAG AAATTTAATT CCCTCACTGT ACCCAAGGTT ATGGATTAAT TTCTCTCAAG 301 ATAAAACGGA TTAAACCTGT TAAAGAAATA GCGGTACCTC AAACTTCTTT AACTTCAACG 361 AACTTAAGAA CAACAACAAG TCACACAGAC TCAGTCGATC GACACTTTGA TTTTATTTGA 421 GAGAAAAATA AATGCAGAGA AAGAAAAATT TTCAGTGTTT CAAAAATCAA AATTTGACTT 481 CCTTTTATAG CCATTTTCAG CAAGGAACAT GTCC Predicted gene structure (within gDNA segment 1077 to 5548): Exon 1 2088 2092 ( 5 n); cDNA 136 140 ( 5 n); score: 0.800 Intron 1 2093 4484 (2392 n); Pd: 0.311 (s: 0), Pa: 0.000 (s: 0.84) Exon 2 4485 4856 ( 372 n); cDNA 141 512 ( 372 n); score: 0.879 MATCH C06HBa0153O03.1-5+ SGN-E255327- 0.879 377 0.733 C PGS_C06HBa0153O03.1-5+_SGN-E255327- (2088 2092,4485 4856) Alignment (genomic DNA sequence = upper lines): ATCATGTGAG AATCAAAAAA ATCCAGTGTC TCCCCCACAC TAAAAATAGG TGGAGTCACC 2147 || || ATAAT..... .......... .......... .......... .......... .......... 140 GGCCAAAGTG ACCTAAAACA TGCTAGTGTA CATGGTAATT TAACCCCACC AGATCCCTAT 2207 .......... .......... .......... .......... .......... .......... 140 ATTGGTTGTA TAGGTTCGAA GACTAAGAGA TATAAGGAGA CCTATACCCA GCAAGCCGAA 2267 .......... .......... .......... .......... .......... .......... 140 AAGTCTCATC TCATGAGAGT TACGTGAACC TCTTCCCTTC CCGAAAGAAG GCATCACCGC 2327 .......... .......... .......... .......... .......... .......... 140 TCATAGCCAT CCTAGCGGTG CTCAGTATAA AGTTCCATTT CATTCATCTC ATACGTCATG 2387 .......... .......... .......... .......... .......... .......... 140 GAAGTTGGCT TCTAGTATGA GGACATCATA GCTCATTAGA TGATTTCTCC ATCTCATCAT 2447 .......... .......... .......... .......... .......... .......... 140 TAGTATTAAG TGTGTTAGGC TCAATACTTT CATTAGAGTG TTCATAGAGA CTGGTCTCTT 2507 .......... .......... .......... .......... .......... .......... 140 CATTATCTTA CACTCTCATA GATGAGTACG TTTTGGTAAC ATTTACTCGG GCTCATTTAA 2567 .......... .......... .......... .......... .......... .......... 140 TATTACTTTC ATCATATACA TTAGCCTCAT ATCATGTTGT CACCACATTC CTTAACATTA 2627 .......... .......... .......... .......... .......... .......... 140 GCACCTTTGC TTTTCATAGT TACTCACCTC TTACTACGTG AATGTTCCTT TCATCATATG 2687 .......... .......... .......... .......... .......... .......... 140 TCAATTTTTG GCCGTTTAAT GTGTTTTACA CCCTTACTTA CCCCTTCTAG GGTTTCATAA 2747 .......... .......... .......... .......... .......... .......... 140 TTTCATTATT CATTACATAA TTTCATTGTT AATTTCATCA TATGCCAATG CACTTGGACA 2807 .......... .......... .......... .......... .......... .......... 140 TTTAGTGTAT TTTACACTCA TACTTAACCT TCTAGGATTC ATCATTTCCT TACATACATC 2867 .......... .......... .......... .......... .......... .......... 140 TTAGGTTCAC TTACTTTTGA ACGTATGCTA GACTTATGAA TCTATACACA CAAGACATGG 2927 .......... .......... .......... .......... .......... .......... 140 GGCTTCATTC ATAAATTTTT AAGTGATTCA TACATAGGGA GACTAAGTCT CGACCCACAA 2987 .......... .......... .......... .......... .......... .......... 140 CCCCCACCTA CGAGGCGTGG TTCCACCCAA GGATCGTGCG CACATTCGCA GGTCATGTTG 3047 .......... .......... .......... .......... .......... .......... 140 TGTTTTGACA GCTTTTTGGG CCTTCTTTTG GGTCCTCCTC AAGGACCCTT GGGAGGTCCT 3107 .......... .......... .......... .......... .......... .......... 140 TAGGGTCACG CCTTGATGTT TAGGTCCTTA AACATTAATA CTAAGTAGGG GAGGTTATTT 3167 .......... .......... .......... .......... .......... .......... 140 TATGTCTCTT ATCTTTACGT TCAACCTCAT TTAGGACACT AGACTACATG TTAGGCTCTA 3227 .......... .......... .......... .......... .......... .......... 140 ACTTAGGTCA TTAGATTTTT GGGGTGTTAC AAATCATCTG CTCAAATAAC CAGTTAGGCC 3287 .......... .......... .......... .......... .......... .......... 140 AGAAGGAAGT TAGACTGAAG ATATGCTTGA AAAGATCTAT AACAAGGTTT AAGGGTCTGA 3347 .......... .......... .......... .......... .......... .......... 140 TAAATTGTTG AAGGAATTCA AAAATGACTT ATCCACTCTA TTCCCAGACG ATGACTTCTA 3407 .......... .......... .......... .......... .......... .......... 140 ACACAGTTTC CATTAAGTAA CTAGAGACCA TTCTAGGACA AATTGGCACT CTTCACAAGC 3467 .......... .......... .......... .......... .......... .......... 140 AAAGGCAAAT GGAAACATTT CCTAGAAATA CCATCCCAAA CCCCAATAAC TATTGTTCAG 3527 .......... .......... .......... .......... .......... .......... 140 TTAAAGAATT TTGTCGTCCC ATAACATTAA ATAAGCAATC GCTTAGGGGA CACCAAAGTT 3587 .......... .......... .......... .......... .......... .......... 140 TTTACCTTTG TTTTTATTGC TTTTAAATAA CATGTGTTCT GAGTGCAGGT TGAGAAGAGG 3647 .......... .......... .......... .......... .......... .......... 140 TGGAAAAATT GAAAAGAGAA GCATAGTCAA TGCCAAGACC CATAAAATGA TATGTCGAAG 3707 .......... .......... .......... .......... .......... .......... 140 GCATTGTGCG GACAGCGGAC TAATTGTTGT CTACCTTTTG AGCTTGAATA GACATAAAAT 3767 .......... .......... .......... .......... .......... .......... 140 GTGCTTGCCA AAATTTGCAA TCAGAACTCC AAAAAGGAAT TTATGGAAGT GCAGTAAGGA 3827 .......... .......... .......... .......... .......... .......... 140 GACATAGTCG GGTCGGCCAA AGCAACCTTC GGGTTTAGTA ATTGTTAAGG TATGTTCAGC 3887 .......... .......... .......... .......... .......... .......... 140 GAGCTTGAGC TCAAAATAGT GATTTATCGA ATACATTCAG CGACAGTAGA TATCACCAAA 3947 .......... .......... .......... .......... .......... .......... 140 TAAGTATGGT ATAAAAATAC CAAGGGCCTT TTAAAATTCA AAAAGTCTAA AACTTTGATA 4007 .......... .......... .......... .......... .......... .......... 140 CTTTAGAGCC TTTAAGATTC TTTTATCTTG CTTTAAATAA TTTTTAGTGT TTGTGAGCCT 4067 .......... .......... .......... .......... .......... .......... 140 CTTATGAGTG TTTAAAAATG GATTTTGTCT CAACCCAACA TACGTACCTC TCAATTTTGG 4127 .......... .......... .......... .......... .......... .......... 140 TGACCTTAAG GTTGGTTTTT GAATCCTTGT TCATATTTTT TAGTTTAAAG TTTTTATAGG 4187 .......... .......... .......... .......... .......... .......... 140 GTGTGAGCTT GAGCCCTCTA ACTGATTTTG TGTGCCTGAT TTAAATGGGT AGCATTGTAT 4247 .......... .......... .......... .......... .......... .......... 140 TGTGTTATTG GAACTGTACC CATGAGTTTT CAATTGTGTA TGCCGGTCAA ATAGCGAAAC 4307 .......... .......... .......... .......... .......... .......... 140 GTGTGTGTTG GTTAAATTGT GTAATGATCA GAGGGTTTGA TATATTGTGT TTTAATGAGT 4367 .......... .......... .......... .......... .......... .......... 140 TTAAGGGTTC AGTTAATAAA GAATTAAACA GAAGATTTCA TAATATAAAC TCATACAAAT 4427 .......... .......... .......... .......... .......... .......... 140 GTAGGAGTTC ATTTAATTAT TTCGCCAAAT TTAAATGGTA TCTTCTAATT GTTATTCAAC 4487 ||| .......... .......... .......... .......... .......... .......AAC 143 CTGTATTTAT AAGGTTAACC AATAAACAGA AATAGAAATA AGATGAAACA GAAAAT-ACA 4546 | ||||||| || ||||| |||||||| || ||||||| ||||||| || |||||| | | TTTTATTTAT AAAGTTAA-- --TAAACAGA AACAGAAATA AGATGAAGCA GAAAATAATA 199 TAAAATACAA TAAGTAATCC GAGTCTACAA AAACTACTAT GTGTCCTTAA GAAATTTAAT 4606 || ||||||| || |||||| ||||| ||| |||||||||| |||||||||| |||||||||| TAGAATACAA GAATTAATCC GAGTCCACAG AAACTACTAT GTGTCCTTAA GAAATTTAAT 259 CCCCTCACTG TACACAAGGT TATGGATTAA TTTCTCCCAA GATAAAATGG ATTAAACCTG 4666 ||||||||| ||| |||||| |||||||||| |||||| ||| ||||||| || |||||||||| TCCCTCACTG TACCCAAGGT TATGGATTAA TTTCTCTCAA GATAAAACGG ATTAAACCTG 319 TTAAAGAAAT AGCAGCACCT CAGATTTCTT TAACTAAAGC GAAATTCAGA ACAACAACAA 4726 |||||||||| ||| | |||| || | ||||| ||||| | | ||| || ||| |||||||||| TTAAAGAAAT AGCGGTACCT CAAACTTCTT TAACTTCAAC GAACTTAAGA ACAACAACAA 379 GTCACATAGA CTCAGTCGAT CGACACTTTG A-TTTATTTG AGAGAAAAAT ATATGCAGAG 4785 |||||| ||| |||||||||| |||||||||| | |||||||| |||||||||| | |||||||| GTCACACAGA CTCAGTCGAT CGACACTTTG ATTTTATTTG AGAGAAAAAT AAATGCAGAG 439 -AAGGAAAAT TTT-AGTGTT TGAAAAATCA AAAATTGACT TCCTTTTATA GCCATTTTCA 4843 ||| ||||| ||| |||||| | |||||||| ||| |||||| |||||||||| |||||||||| AAAGAAAAAT TTTCAGTGTT TCAAAAATCA AAATTTGACT TCCTTTTATA GCCATTTTCA 499 GCAAGAAACG TGT 4856 ||||| ||| ||| GCAAGGAACA TGT 512 hqPGS_C06HBa0153O03.1-5+_SGN-E255327- (4485 4856) ******************************************************************************** EST sequence 32 -strand 577 n (File: SGN-E369760-) 1 TAAGTATTTA AAATACTAAC AGTAAAGAGC AAATCTTTAC ACAACCTGTT TCTGATTTAA 61 CGATAAAGTT TTTATGTTTT TTTAATCGTT AATCGAATGA TTTTAAAAAT TCAAGTAGAG 121 TAGAAAACCA TAAAGGTTTT AATCTCCAGA AACCTCAAAC ACAGAAACTG TATAAATTTT 181 CTTAAGATTG TTACTCTTAT AATAACTTTT ATTTATAAAG TTAATAAACA GAAACAGAAA 241 TAAGATGAAG CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT 301 ATGTGTCCTT AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC 361 AAGATAAAAC GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC TTTAACTTCA 421 ACGAACTTAA GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT TGATTTTATT 481 TGAGAGAAAA ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT CAAAATTTGA 541 CTTCCTTTTA TAGCCATTTT CAGCAAGGAA CATGTCC Predicted gene structure (within gDNA segment 1047 to 5548): Exon 1 2088 2092 ( 5 n); cDNA 199 203 ( 5 n); score: 0.800 Intron 1 2093 4484 (2392 n); Pd: 0.311 (s: 0), Pa: 0.000 (s: 0.84) Exon 2 4485 4856 ( 372 n); cDNA 204 575 ( 372 n); score: 0.879 MATCH C06HBa0153O03.1-5+ SGN-E369760- 0.879 377 0.653 C PGS_C06HBa0153O03.1-5+_SGN-E369760- (2088 2092,4485 4856) Alignment (genomic DNA sequence = upper lines): ATCATGTGAG AATCAAAAAA ATCCAGTGTC TCCCCCACAC TAAAAATAGG TGGAGTCACC 2147 || || ATAAT..... .......... .......... .......... .......... .......... 203 GGCCAAAGTG ACCTAAAACA TGCTAGTGTA CATGGTAATT TAACCCCACC AGATCCCTAT 2207 .......... .......... .......... .......... .......... .......... 203 ATTGGTTGTA TAGGTTCGAA GACTAAGAGA TATAAGGAGA CCTATACCCA GCAAGCCGAA 2267 .......... .......... .......... .......... .......... .......... 203 AAGTCTCATC TCATGAGAGT TACGTGAACC TCTTCCCTTC CCGAAAGAAG GCATCACCGC 2327 .......... .......... .......... .......... .......... .......... 203 TCATAGCCAT CCTAGCGGTG CTCAGTATAA AGTTCCATTT CATTCATCTC ATACGTCATG 2387 .......... .......... .......... .......... .......... .......... 203 GAAGTTGGCT TCTAGTATGA GGACATCATA GCTCATTAGA TGATTTCTCC ATCTCATCAT 2447 .......... .......... .......... .......... .......... .......... 203 TAGTATTAAG TGTGTTAGGC TCAATACTTT CATTAGAGTG TTCATAGAGA CTGGTCTCTT 2507 .......... .......... .......... .......... .......... .......... 203 CATTATCTTA CACTCTCATA GATGAGTACG TTTTGGTAAC ATTTACTCGG GCTCATTTAA 2567 .......... .......... .......... .......... .......... .......... 203 TATTACTTTC ATCATATACA TTAGCCTCAT ATCATGTTGT CACCACATTC CTTAACATTA 2627 .......... .......... .......... .......... .......... .......... 203 GCACCTTTGC TTTTCATAGT TACTCACCTC TTACTACGTG AATGTTCCTT TCATCATATG 2687 .......... .......... .......... .......... .......... .......... 203 TCAATTTTTG GCCGTTTAAT GTGTTTTACA CCCTTACTTA CCCCTTCTAG GGTTTCATAA 2747 .......... .......... .......... .......... .......... .......... 203 TTTCATTATT CATTACATAA TTTCATTGTT AATTTCATCA TATGCCAATG CACTTGGACA 2807 .......... .......... .......... .......... .......... .......... 203 TTTAGTGTAT TTTACACTCA TACTTAACCT TCTAGGATTC ATCATTTCCT TACATACATC 2867 .......... .......... .......... .......... .......... .......... 203 TTAGGTTCAC TTACTTTTGA ACGTATGCTA GACTTATGAA TCTATACACA CAAGACATGG 2927 .......... .......... .......... .......... .......... .......... 203 GGCTTCATTC ATAAATTTTT AAGTGATTCA TACATAGGGA GACTAAGTCT CGACCCACAA 2987 .......... .......... .......... .......... .......... .......... 203 CCCCCACCTA CGAGGCGTGG TTCCACCCAA GGATCGTGCG CACATTCGCA GGTCATGTTG 3047 .......... .......... .......... .......... .......... .......... 203 TGTTTTGACA GCTTTTTGGG CCTTCTTTTG GGTCCTCCTC AAGGACCCTT GGGAGGTCCT 3107 .......... .......... .......... .......... .......... .......... 203 TAGGGTCACG CCTTGATGTT TAGGTCCTTA AACATTAATA CTAAGTAGGG GAGGTTATTT 3167 .......... .......... .......... .......... .......... .......... 203 TATGTCTCTT ATCTTTACGT TCAACCTCAT TTAGGACACT AGACTACATG TTAGGCTCTA 3227 .......... .......... .......... .......... .......... .......... 203 ACTTAGGTCA TTAGATTTTT GGGGTGTTAC AAATCATCTG CTCAAATAAC CAGTTAGGCC 3287 .......... .......... .......... .......... .......... .......... 203 AGAAGGAAGT TAGACTGAAG ATATGCTTGA AAAGATCTAT AACAAGGTTT AAGGGTCTGA 3347 .......... .......... .......... .......... .......... .......... 203 TAAATTGTTG AAGGAATTCA AAAATGACTT ATCCACTCTA TTCCCAGACG ATGACTTCTA 3407 .......... .......... .......... .......... .......... .......... 203 ACACAGTTTC CATTAAGTAA CTAGAGACCA TTCTAGGACA AATTGGCACT CTTCACAAGC 3467 .......... .......... .......... .......... .......... .......... 203 AAAGGCAAAT GGAAACATTT CCTAGAAATA CCATCCCAAA CCCCAATAAC TATTGTTCAG 3527 .......... .......... .......... .......... .......... .......... 203 TTAAAGAATT TTGTCGTCCC ATAACATTAA ATAAGCAATC GCTTAGGGGA CACCAAAGTT 3587 .......... .......... .......... .......... .......... .......... 203 TTTACCTTTG TTTTTATTGC TTTTAAATAA CATGTGTTCT GAGTGCAGGT TGAGAAGAGG 3647 .......... .......... .......... .......... .......... .......... 203 TGGAAAAATT GAAAAGAGAA GCATAGTCAA TGCCAAGACC CATAAAATGA TATGTCGAAG 3707 .......... .......... .......... .......... .......... .......... 203 GCATTGTGCG GACAGCGGAC TAATTGTTGT CTACCTTTTG AGCTTGAATA GACATAAAAT 3767 .......... .......... .......... .......... .......... .......... 203 GTGCTTGCCA AAATTTGCAA TCAGAACTCC AAAAAGGAAT TTATGGAAGT GCAGTAAGGA 3827 .......... .......... .......... .......... .......... .......... 203 GACATAGTCG GGTCGGCCAA AGCAACCTTC GGGTTTAGTA ATTGTTAAGG TATGTTCAGC 3887 .......... .......... .......... .......... .......... .......... 203 GAGCTTGAGC TCAAAATAGT GATTTATCGA ATACATTCAG CGACAGTAGA TATCACCAAA 3947 .......... .......... .......... .......... .......... .......... 203 TAAGTATGGT ATAAAAATAC CAAGGGCCTT TTAAAATTCA AAAAGTCTAA AACTTTGATA 4007 .......... .......... .......... .......... .......... .......... 203 CTTTAGAGCC TTTAAGATTC TTTTATCTTG CTTTAAATAA TTTTTAGTGT TTGTGAGCCT 4067 .......... .......... .......... .......... .......... .......... 203 CTTATGAGTG TTTAAAAATG GATTTTGTCT CAACCCAACA TACGTACCTC TCAATTTTGG 4127 .......... .......... .......... .......... .......... .......... 203 TGACCTTAAG GTTGGTTTTT GAATCCTTGT TCATATTTTT TAGTTTAAAG TTTTTATAGG 4187 .......... .......... .......... .......... .......... .......... 203 GTGTGAGCTT GAGCCCTCTA ACTGATTTTG TGTGCCTGAT TTAAATGGGT AGCATTGTAT 4247 .......... .......... .......... .......... .......... .......... 203 TGTGTTATTG GAACTGTACC CATGAGTTTT CAATTGTGTA TGCCGGTCAA ATAGCGAAAC 4307 .......... .......... .......... .......... .......... .......... 203 GTGTGTGTTG GTTAAATTGT GTAATGATCA GAGGGTTTGA TATATTGTGT TTTAATGAGT 4367 .......... .......... .......... .......... .......... .......... 203 TTAAGGGTTC AGTTAATAAA GAATTAAACA GAAGATTTCA TAATATAAAC TCATACAAAT 4427 .......... .......... .......... .......... .......... .......... 203 GTAGGAGTTC ATTTAATTAT TTCGCCAAAT TTAAATGGTA TCTTCTAATT GTTATTCAAC 4487 ||| .......... .......... .......... .......... .......... .......AAC 206 CTGTATTTAT AAGGTTAACC AATAAACAGA AATAGAAATA AGATGAAACA GAAAAT-ACA 4546 | ||||||| || ||||| |||||||| || ||||||| ||||||| || |||||| | | TTTTATTTAT AAAGTTAA-- --TAAACAGA AACAGAAATA AGATGAAGCA GAAAATAATA 262 TAAAATACAA TAAGTAATCC GAGTCTACAA AAACTACTAT GTGTCCTTAA GAAATTTAAT 4606 || ||||||| || |||||| ||||| ||| |||||||||| |||||||||| |||||||||| TAGAATACAA GAATTAATCC GAGTCCACAG AAACTACTAT GTGTCCTTAA GAAATTTAAT 322 CCCCTCACTG TACACAAGGT TATGGATTAA TTTCTCCCAA GATAAAATGG ATTAAACCTG 4666 ||||||||| ||| |||||| |||||||||| |||||| ||| ||||||| || |||||||||| TCCCTCACTG TACCCAAGGT TATGGATTAA TTTCTCTCAA GATAAAACGG ATTAAACCTG 382 TTAAAGAAAT AGCAGCACCT CAGATTTCTT TAACTAAAGC GAAATTCAGA ACAACAACAA 4726 |||||||||| ||| | |||| || | ||||| ||||| | | ||| || ||| |||||||||| TTAAAGAAAT AGCGGTACCT CAAACTTCTT TAACTTCAAC GAACTTAAGA ACAACAACAA 442 GTCACATAGA CTCAGTCGAT CGACACTTTG A-TTTATTTG AGAGAAAAAT ATATGCAGAG 4785 |||||| ||| |||||||||| |||||||||| | |||||||| |||||||||| | |||||||| GTCACACAGA CTCAGTCGAT CGACACTTTG ATTTTATTTG AGAGAAAAAT AAATGCAGAG 502 -AAGGAAAAT TTT-AGTGTT TGAAAAATCA AAAATTGACT TCCTTTTATA GCCATTTTCA 4843 ||| ||||| ||| |||||| | |||||||| ||| |||||| |||||||||| |||||||||| AAAGAAAAAT TTTCAGTGTT TAAAAAATCA AAATTTGACT TCCTTTTATA GCCATTTTCA 562 GCAAGAAACG TGT 4856 ||||| ||| ||| GCAAGGAACA TGT 575 hqPGS_C06HBa0153O03.1-5+_SGN-E369760- (4485 4856) ******************************************************************************** EST sequence 8 -strand 606 n (File: SGN-E262710-) 1 TTTTTTTCTG TCACAATTGG CAGATAAATT AAGTATTTAA AATACTAACA GTAAAGAGCA 61 AATCTTTACA CAACCTGTTT CTGATTTAAC GATAAAGTTT TTATGTTTTT TTAATCGTTA 121 ATCGAATGAT TTTAAAAATT CAAGTAGAGT AGAAAACCAT AAAGGTTTTA ATCTCCAGAA 181 ACCTCAAACA CAGAAACTGT ATAAATTTTC TTAAGATTGT TACTCTTATA ATAACTTTTA 241 TTTATAAAGT TAATAAACAG AAACAGAAAT AAGATGAAGC AGAAAATAAT ATAGAATACA 301 AGAATTAATC CGAGTCCACA GAAACTACTA TGTGTCCTTA AGAAATTTAA TTCCCTCACT 361 GTACCCAAGG TTATGGATTA ATTTCTCTCA AGATAAAACG GATTAAACCT GTTAAAGAAA 421 TAGCGGTACC TCAAACTTCT TTAACTTCAA CGAACTTAAG AACAACAACA AGTCACACAG 481 ACTCAGTCGA TCGACACTTT GATTTTATTT GAGAGAAAAA TAAATGCAGA GAAAGAAAAA 541 TTTTCAGTGT TTAAAAAATC AAAATTTGAC TTCCTTTTAT AGCCATTTTC AGCAAGGAAC 601 ATGTCC Predicted gene structure (within gDNA segment 157 to 5548): Exon 1 876 891 ( 16 n); cDNA 223 238 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4856 ( 366 n); cDNA 239 604 ( 366 n); score: 0.883 MATCH C06HBa0153O03.1-5+ SGN-E262710- 0.883 382 0.630 C PGS_C06HBa0153O03.1-5+_SGN-E262710- (876 891,4491 4856) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 238 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 238 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 238 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 238 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 238 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 238 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 238 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 238 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 238 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 238 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 238 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 238 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 238 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 238 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 238 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 238 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 238 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 238 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 238 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 238 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 238 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 238 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 238 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 238 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 238 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 238 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 238 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 238 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 238 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 238 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 238 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 238 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 238 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 238 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 238 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 238 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 238 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 238 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 238 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 238 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 238 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 238 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 238 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 238 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 238 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 238 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 238 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 238 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 238 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 238 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 238 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 238 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 238 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 238 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 238 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 238 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 238 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 238 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 238 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 238 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 279 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 339 AAGAAATTTA ATCCCCTCAC TGTACACAAG GTTATGGATT AATTTCTCCC AAGATAAAAT 4654 |||||||||| || ||||||| ||||| |||| |||||||||| |||||||| | ||||||||| AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 399 GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCAGATTTC TTTAACTAAA GCGAAATTCA 4714 |||||||||| |||||||||| ||||| | || |||| | ||| ||||||| | |||| || | GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC TTTAACTTCA ACGAACTTAA 459 GAACAACAAC AAGTCACATA GACTCAGTCG ATCGACACTT TGA-TTTATT TGAGAGAAAA 4773 |||||||||| |||||||| | |||||||||| |||||||||| ||| |||||| |||||||||| GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT TGATTTTATT TGAGAGAAAA 519 ATATATGCAG AG-AAGGAAA ATTTT-AGTG TTTGAAAAAT CAAAAATTGA CTTCCTTTTA 4831 ||| |||||| || ||| ||| ||||| |||| ||| |||||| ||||| |||| |||||||||| ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT CAAAATTTGA CTTCCTTTTA 579 TAGCCATTTT CAGCAAGAAA CGTGT 4856 |||||||||| ||||||| || | ||| TAGCCATTTT CAGCAAGGAA CATGT 604 hqPGS_C06HBa0153O03.1-5+_SGN-E262710- (4491 4856) ******************************************************************************** EST sequence 18 -strand 591 n (File: SGN-E254845-) 1 GACTTGAATT AGTCTTTTTT TTCTGTCACA ATTGGCAGAT AAATTAAGTA TTTAAAATAC 61 TAACAGTAAA GAGCAATTCT TTACACAACC TGTTTCTGAT TTAACGATAA AGTTTTTATG 121 TTTTTTTAAT CGTTAATCGA ATGATTTTAA AAATTCAAGT AGAGTAGAAA ACCATAAAGG 181 TTTTATTCTC CAGAAACCTC AAACACAGAA ACTGTATAAA TTTTCTTAAG ATTGTTACTC 241 TTATAATAAC TTTTATTTAT AAAGTTAATA AACAGAAACA GAAATAAGAT GAAGCAGAAA 301 ATAATATAGA ATACAAGAAT TAATCCGAGT CCACAGAAAC TACTATGTGT CCTTAAGAAA 361 TTTAATTCCC TCACTGGTAC CCAAGGTTAT GGATTAATTT CTCTCAAGAT AAAACGGATT 421 AAACCTGTTA AAGAAATAGC GGTACCTCAA ACTTCTTTAA CTTCAACGAA CTTAAGAACA 481 ACAACAAGTC ACACAGACTC AGTCGATCGA CACTTTGATT TTATTTGAGA GAAAAATAAA 541 TGCACCGAAA GCCAAATTTT CAGTGTTTAA AAAATCAAAA TTTGACTTCC T Predicted gene structure (within gDNA segment 7 to 5906): Exon 1 876 891 ( 16 n); cDNA 238 253 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4827 ( 337 n); cDNA 254 591 ( 338 n); score: 0.862 MATCH C06HBa0153O03.1-5+ SGN-E254845- 0.862 353 0.597 C PGS_C06HBa0153O03.1-5+_SGN-E254845- (876 891,4491 4827) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 253 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 253 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 253 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 253 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 253 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 253 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 253 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 253 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 253 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 253 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 253 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 253 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 253 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 253 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 253 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 253 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 253 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 253 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 253 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 253 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 253 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 253 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 253 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 253 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 253 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 253 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 253 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 253 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 253 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 253 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 253 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 253 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 253 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 253 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 253 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 253 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 253 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 253 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 253 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 253 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 253 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 253 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 253 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 253 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 253 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 253 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 253 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 253 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 253 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 253 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 253 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 253 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 253 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 253 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 253 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 253 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 253 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 253 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 253 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 253 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 294 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 354 AAGAAATTTA ATCCCCTCAC T-GTACACAA GGTTATGGAT TAATTTCTCC CAAGATAAAA 4653 |||||||||| || ||||||| | |||| ||| |||||||||| ||||||||| |||||||||| AAGAAATTTA ATTCCCTCAC TGGTACCCAA GGTTATGGAT TAATTTCTCT CAAGATAAAA 414 TGGATTAAAC CTGTTAAAGA AATAGCAGCA CCTCAGATTT CTTTAACTAA AGCGAAATTC 4713 ||||||||| |||||||||| |||||| | | ||||| | || |||||||| | |||| || CGGATTAAAC CTGTTAAAGA AATAGCGGTA CCTCAAACTT CTTTAACTTC AACGAACTTA 474 AGAACAACAA CAAGTCACAT AGACTCAGTC GATCGACACT TTGA-TTTAT TTGAGAGAAA 4772 |||||||||| ||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| AGAACAACAA CAAGTCACAC AGACTCAGTC GATCGACACT TTGATTTTAT TTGAGAGAAA 534 AATATATGCA GAG-AAGGAA AATTTT-AGT GTTTGAAAAA TCAAAAATTG ACTTCCT 4827 |||| ||||| | ||| | |||||| ||| |||| ||||| |||||| ||| ||||||| AATAAATGCA CCGAAAGCCA AATTTTCAGT GTTTAAAAAA TCAAAATTTG ACTTCCT 591 hqPGS_C06HBa0153O03.1-5+_SGN-E254845- (4491 4827) ******************************************************************************** EST sequence 26 -strand 653 n (File: SGN-E273518-) 1 GCAGCATGTG AACGATTATG TGCAACCATT GTTGTTGCAG CACTTACTGT TCCAGCATTT 61 GTTTGACTTG AATTAGTCTT TTTTTTCTGT CACAATTGGC AGATAAATTA AGTATTTAAA 121 ATACTAACAG TAAAGAGCAA ATCTTTACAC AACCTGTTTC TGATTTAACG ATAAAGTTTT 181 TATGTTTTTT TAATCGTTAA TCGAATGATT TTAAAAATTC AAGTAGAGTA GAAAACCATA 241 AAGGTTTTAA TCTCCAGAAA CCTCAAACAC AGAAACTGTA TAAATTTTCT TAAGATTGTT 301 ACTCTTATAA TAACTTTTAT TTATAAAGTT AATAAACAGA AACAGAAATA AGATGAAGCA 361 GAAAATAATA TAGAATACAA GAATTAATCC GAGTCCACAG AAACTACTAT GTGTCCTTAA 421 GAAATTTAAT TCCCTCACTG TACCCAAGGT TATGGATTAA TTTCTCTCAA GATAAAACGG 481 ATTAAACCTG TTAAAGAAAT AGCGGTACCT CAAACTTCTT TAACTTCAAC GAACTTAAGA 541 ACAACAACAA GTCACACAGA CTCAGTCGAT CGACACTTTG ATTTTATTTG AGAGAAAAAT 601 AAATGCAGAG AAAGAAAAAT TTTCAGTGTT TAAAAAATCA AAATTTGACT TCC Predicted gene structure (within gDNA segment 1 to 5896): Exon 1 876 891 ( 16 n); cDNA 302 317 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4826 ( 336 n); cDNA 318 653 ( 336 n); score: 0.878 MATCH C06HBa0153O03.1-5+ SGN-E273518- 0.878 352 0.539 C PGS_C06HBa0153O03.1-5+_SGN-E273518- (876 891,4491 4826) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 317 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 317 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 317 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 317 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 317 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 317 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 317 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 317 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 317 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 317 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 317 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 317 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 317 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 317 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 317 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 317 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 317 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 317 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 317 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 317 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 317 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 317 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 317 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 317 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 317 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 317 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 317 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 317 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 317 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 317 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 317 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 317 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 317 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 317 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 317 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 317 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 317 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 317 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 317 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 317 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 317 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 317 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 317 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 317 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 317 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 317 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 317 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 317 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 317 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 317 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 317 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 317 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 317 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 317 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 317 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 317 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 317 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 317 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 317 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 317 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 358 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 418 AAGAAATTTA ATCCCCTCAC TGTACACAAG GTTATGGATT AATTTCTCCC AAGATAAAAT 4654 |||||||||| || ||||||| ||||| |||| |||||||||| |||||||| | ||||||||| AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 478 GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCAGATTTC TTTAACTAAA GCGAAATTCA 4714 |||||||||| |||||||||| ||||| | || |||| | ||| ||||||| | |||| || | GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC TTTAACTTCA ACGAACTTAA 538 GAACAACAAC AAGTCACATA GACTCAGTCG ATCGACACTT TGA-TTTATT TGAGAGAAAA 4773 |||||||||| |||||||| | |||||||||| |||||||||| ||| |||||| |||||||||| GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT TGATTTTATT TGAGAGAAAA 598 ATATATGCAG AG-AAGGAAA ATTTT-AGTG TTTGAAAAAT CAAAAATTGA CTTCC 4826 ||| |||||| || ||| ||| ||||| |||| ||| |||||| ||||| |||| ||||| ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT CAAAATTTGA CTTCC 653 hqPGS_C06HBa0153O03.1-5+_SGN-E273518- (4491 4826) ******************************************************************************** EST sequence 5 -strand 707 n (File: SGN-E263584-) 1 CGTTCATCAG CCGAAGTTTC ATCTGACATA ACAGGAACAT TCTCATTGAT GAACTTCTGC 61 AGACTCAACG TAGTGAGATA GAAGAACATC TTTTGCTGCC ATCTCTTAAA GTCGACTCCA 121 GAAAACTTTG CAGGTTTCTC AGCCGGTGCT AAGGCAGCAT GTGAACGATT ATGTGCAACC 181 ATTGTTGTTG CAGCACTTAC TGTTCCAGCA TTTGTTTGAC TTGAATTAGT CTTTTTTTTC 241 TGTCACAATT GGCAGATAAA TTAAGTATTT AAAATACTAA CAGTAAAGAG CAAATCTTTA 301 CACAACCTGT TTCTGATTTA ACGATAAAGT TTTTATGTTT TTTTAATCGT TAATCGAATG 361 ATTTTAAAAA TTCAAGTAGA GTAGAAAACC ATAAAGGTTT TAATCTCCAG AAACCTCAAA 421 CACAGAAACT GTATAAATTT TCTTAAGATT GTTACTCTTA TAATAACTTT TATTTATAAA 481 GTTAATAAAC AGAAACAGAA ATAAGATGAA GCAGAAAATA ATATAGAATA CAAGAATTAA 541 TCCGAGTCCA CAGAAACTAC TATGTGTCCT TAAGAAATTT AATTCCCTCA CTGTACCCAA 601 GGTTATGGAT TAATTTCTCT CAAGATAAAA CGGATTAAAC CTGTTAAAGA AATAGCGGTA 661 CCTCAAACTT CTTTAACTTC AACGAACTTA AGAACAACAA CAAGTCA Predicted gene structure (within gDNA segment 1 to 5789): Exon 1 876 891 ( 16 n); cDNA 455 470 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4730 ( 240 n); cDNA 471 707 ( 237 n); score: 0.881 MATCH C06HBa0153O03.1-5+ SGN-E263584- 0.881 256 0.362 C PGS_C06HBa0153O03.1-5+_SGN-E263584- (876 891,4491 4730) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 470 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 470 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 470 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 470 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 470 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 470 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 470 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 470 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 470 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 470 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 470 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 470 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 470 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 470 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 470 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 470 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 470 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 470 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 470 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 470 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 470 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 470 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 470 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 470 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 470 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 470 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 470 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 470 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 470 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 470 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 470 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 470 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 470 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 470 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 470 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 470 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 470 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 470 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 470 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 470 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 470 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 470 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 470 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 470 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 470 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 470 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 470 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 470 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 470 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 470 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 470 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 470 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 470 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 470 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 470 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 470 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 470 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 470 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 470 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 470 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 511 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 571 AAGAAATTTA ATCCCCTCAC TGTACACAAG GTTATGGATT AATTTCTCCC AAGATAAAAT 4654 |||||||||| || ||||||| ||||| |||| |||||||||| |||||||| | ||||||||| AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 631 GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCAGATTTC TTTAACTAAA GCGAAATTCA 4714 |||||||||| |||||||||| ||||| | || |||| | ||| ||||||| | |||| || | GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC TTTAACTTCA ACGAACTTAA 691 GAACAACAAC AAGTCA 4730 |||||||||| |||||| GAACAACAAC AAGTCA 707 hqPGS_C06HBa0153O03.1-5+_SGN-E263584- (4491 4730) ******************************************************************************** EST sequence 11 -strand 706 n (File: SGN-E261066-) 1 AAAAAATCTG AGTGTGTCCA TGCTTCTGTT ACCAAGAATC GTTCATCAGC CGAAGTTTCA 61 TCTGACATAA CAGGAACATT CTCATTGATG AACTTCTGCA GACTCAACGT AGTGAGATAG 121 AAGAACATCT TTTGCTGCCA TCTCTTAAAG TCGACTCCAG AAATCTTTGC AGGTTTCTCA 181 GCCGGTGCTA AGGCAGCATG TGAACGATTA TGTGCAACCA TTGTTGTTGC AGCACTTACT 241 GTTCCAGCAT TTGTTTGACT TGAATTAGTC TTTTTTTTCT GTCACAATTG GCAGATAAAT 301 TAAGTATTTA AAATACTAAC AGTAAAGAGC AAATCTTTAC ACAACCTGTT TATGATTTAA 361 CGATAAAGTT TTTATGTTTT TTTAATCGTT AATCGAATGA TTTTAAAAAT TCAAGTAGAG 421 TAGAAAACCA TAAAGGTTTT AATCTCCAGA AACCTCAAAC ACAGAAACTG TATAAATTTT 481 CTTAAGATTG TTACTCTTAT AATAACTTTT ATTTATAAAG TTAATAAACA GAAACAGAAA 541 TAAGATGAAG CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT 601 ATGTGTCCTT AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC 661 AAGATAAAAC GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAA Predicted gene structure (within gDNA segment 1 to 5389): Exon 1 876 891 ( 16 n); cDNA 494 509 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4688 ( 198 n); cDNA 510 704 ( 195 n); score: 0.891 MATCH C06HBa0153O03.1-5+ SGN-E261066- 0.891 214 0.303 C PGS_C06HBa0153O03.1-5+_SGN-E261066- (876 891,4491 4688) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 509 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 509 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 509 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 509 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 509 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 509 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 509 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 509 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 509 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 509 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 509 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 509 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 509 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 509 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 509 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 509 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 509 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 509 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 509 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 509 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 509 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 509 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 509 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 509 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 509 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 509 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 509 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 509 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 509 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 509 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 509 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 509 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 509 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 509 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 509 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 509 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 509 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 509 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 509 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 509 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 509 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 509 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 509 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 509 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 509 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 509 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 509 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 509 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 509 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 509 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 509 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 509 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 509 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 509 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 509 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 509 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 509 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 509 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 509 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 509 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 550 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 610 AAGAAATTTA ATCCCCTCAC TGTACACAAG GTTATGGATT AATTTCTCCC AAGATAAAAT 4654 |||||||||| || ||||||| ||||| |||| |||||||||| |||||||| | ||||||||| AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 670 GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCA 4688 |||||||||| |||||||||| ||||| | || |||| GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCA 704 hqPGS_C06HBa0153O03.1-5+_SGN-E261066- (4491 4688) ******************************************************************************** EST sequence 24 -strand 635 n (File: SGN-E276669-) 1 CAGGAACATT CTCATTGATG AACTTCTGCA GACTCAACGT AGTGAGATAG AAGAACATCT 61 TTTGCTGCCA TCTCTTAAAG TCGACTCCAG AAAACTTTGC AGGTTTCTCA GCCGGTGCTA 121 AGGCAGCATG TGAACGATTA TGTGCAACCA TTGTTGTTGC AGCACTTACT GTTCCAGCAT 181 TTGTTTGACT TGAATTAGTC TTTTTTTTCT GTCACAATTG GCAGATAAAT TAAGTATTTA 241 AAATACTAAC AGTAAAGAGC AAATCTTTAC ACAACCTGTT TCTGATTTAA CGATAAAGTT 301 TTTATGTTTT TTTAATCGTT AATCGAATGA TTTTAAAAAT TCAAGTAGAG TAGAAAACCA 361 TAAAGGTTTT AATCTCCAGA AACCTCAAAC ACAGAAACTG TATAAATTTT CTTAAGATTG 421 TTACTCTTAT AATAACTTTT ATTTATAAAG TTAATAAACA GAAACAGAAA TAAGATGAAG 481 CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 541 AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 601 GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAA Predicted gene structure (within gDNA segment 1 to 5379): Exon 1 876 891 ( 16 n); cDNA 424 439 ( 16 n); score: 0.812 Intron 1 892 4490 (3599 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 4491 4688 ( 198 n); cDNA 440 634 ( 195 n); score: 0.891 MATCH C06HBa0153O03.1-5+ SGN-E276669- 0.891 214 0.337 C PGS_C06HBa0153O03.1-5+_SGN-E276669- (876 891,4491 4688) Alignment (genomic DNA sequence = upper lines): CTCTAATAGA AACTTTGTCA CGACCGGCAT CTAGACCTCA TAAGAGACCA GCGTCGATGA 935 |||| ||| |||||| CTCTTATAAT AACTTT.... .......... .......... .......... .......... 439 CCTCTCAGAG GTCGCAGACA AGCCTACTTA CGTCATTCTT ACTTTACATA GGTTAATTTT 995 .......... .......... .......... .......... .......... .......... 439 AGCGGAAAAT TTTTGTTTAT AAACATACTT ATAAAACGAT TCTTATTAAT ACTAATTGTT 1055 .......... .......... .......... .......... .......... .......... 439 CACCATCAAC CATATAACAT TAGGAGATAA GCAACAGAAA TAAATAGTTC AATTCTTATT 1115 .......... .......... .......... .......... .......... .......... 439 AAGTGCATAA TTGCCAAAAC GGCACCAATA CATAGTCTGA ATAAAAACAG GAAGGAAACG 1175 .......... .......... .......... .......... .......... .......... 439 CTAGTGGAAC ATGCTCCACT AGCTCAACTC TAAAACTAAG CTAGAATATA AAACAGTGGC 1235 .......... .......... .......... .......... .......... .......... 439 ATCCTCGAAA GCATGACGAC CTACCAACTC CGAACGAATG CTCGACGTTT GGATAATTGC 1295 .......... .......... .......... .......... .......... .......... 439 AACGATGATC TTGTAGCTCT ATCGTCCATC TGTGTCTGCA CCTAAAAATA GTAGAGTTTG 1355 .......... .......... .......... .......... .......... .......... 439 TATAGGGTTA GTACACACTT TTAATAAGTA TGGGTATATG CAAGAACACA CCACGAATAT 1415 .......... .......... .......... .......... .......... .......... 439 GCATGAGAAA GAATAACTCT TTCTTAACAA CATGACTTTT TGGAAGTCAA GTCAGTGGAC 1475 .......... .......... .......... .......... .......... .......... 439 TTGCCAAATT TAGATTAGGA GAGTTACCAA ATTTGGAATA GGAAAGTCAA TGAGCTTTCC 1535 .......... .......... .......... .......... .......... .......... 439 AAATTTGGAA TAGGAAAGTC AGTAAGCTTT CCTGATTTGG AATAGGCTAG TTATGCCATG 1595 .......... .......... .......... .......... .......... .......... 439 AGTTTAACAC ACATCATCAT ACTTTGCACC TTTGCACACA CCACATAACA TTTACACATA 1655 .......... .......... .......... .......... .......... .......... 439 GCACATATCA TATAGCACAC TGCACAATTT GCATGAAGCA CATATTTTCT TTAATATCAT 1715 .......... .......... .......... .......... .......... .......... 439 TCATTCATAT GCCATAAGAC CTTTGGATCA TGGACTTAAT GTTAAGACAT CCCATAAATG 1775 .......... .......... .......... .......... .......... .......... 439 AGGTCTCAAT AGATGGGACC TCAACTAGAG AGTCTTCATT AGCAAACACA GAATCTGTTT 1835 .......... .......... .......... .......... .......... .......... 439 CGTTCATTCA TACGTACTCC ATTTCATTTC ATTCATAGGC CAGTATAAAC ACCAGCTCTA 1895 .......... .......... .......... .......... .......... .......... 439 CCTAGGATGT AGTTTTAGAC TTTCATTAAA TTCGTCATGA AATGACCAAG AATGACCTAA 1955 .......... .......... .......... .......... .......... .......... 439 TGTCATTACT TGAATCTAAC TCACCTTTTG ATTACCCTAT CCTAATACCT TTGCTATCAT 2015 .......... .......... .......... .......... .......... .......... 439 TCATTTCATT ATGTGCATCA TTTGAGGCTG GCCTCATTCT TTCATTGGAA ACTTTTACTT 2075 .......... .......... .......... .......... .......... .......... 439 TAACCGAAAT AGATCATGTG AGAATCAAAA AAATCCAGTG TCTCCCCCAC ACTAAAAATA 2135 .......... .......... .......... .......... .......... .......... 439 GGTGGAGTCA CCGGCCAAAG TGACCTAAAA CATGCTAGTG TACATGGTAA TTTAACCCCA 2195 .......... .......... .......... .......... .......... .......... 439 CCAGATCCCT ATATTGGTTG TATAGGTTCG AAGACTAAGA GATATAAGGA GACCTATACC 2255 .......... .......... .......... .......... .......... .......... 439 CAGCAAGCCG AAAAGTCTCA TCTCATGAGA GTTACGTGAA CCTCTTCCCT TCCCGAAAGA 2315 .......... .......... .......... .......... .......... .......... 439 AGGCATCACC GCTCATAGCC ATCCTAGCGG TGCTCAGTAT AAAGTTCCAT TTCATTCATC 2375 .......... .......... .......... .......... .......... .......... 439 TCATACGTCA TGGAAGTTGG CTTCTAGTAT GAGGACATCA TAGCTCATTA GATGATTTCT 2435 .......... .......... .......... .......... .......... .......... 439 CCATCTCATC ATTAGTATTA AGTGTGTTAG GCTCAATACT TTCATTAGAG TGTTCATAGA 2495 .......... .......... .......... .......... .......... .......... 439 GACTGGTCTC TTCATTATCT TACACTCTCA TAGATGAGTA CGTTTTGGTA ACATTTACTC 2555 .......... .......... .......... .......... .......... .......... 439 GGGCTCATTT AATATTACTT TCATCATATA CATTAGCCTC ATATCATGTT GTCACCACAT 2615 .......... .......... .......... .......... .......... .......... 439 TCCTTAACAT TAGCACCTTT GCTTTTCATA GTTACTCACC TCTTACTACG TGAATGTTCC 2675 .......... .......... .......... .......... .......... .......... 439 TTTCATCATA TGTCAATTTT TGGCCGTTTA ATGTGTTTTA CACCCTTACT TACCCCTTCT 2735 .......... .......... .......... .......... .......... .......... 439 AGGGTTTCAT AATTTCATTA TTCATTACAT AATTTCATTG TTAATTTCAT CATATGCCAA 2795 .......... .......... .......... .......... .......... .......... 439 TGCACTTGGA CATTTAGTGT ATTTTACACT CATACTTAAC CTTCTAGGAT TCATCATTTC 2855 .......... .......... .......... .......... .......... .......... 439 CTTACATACA TCTTAGGTTC ACTTACTTTT GAACGTATGC TAGACTTATG AATCTATACA 2915 .......... .......... .......... .......... .......... .......... 439 CACAAGACAT GGGGCTTCAT TCATAAATTT TTAAGTGATT CATACATAGG GAGACTAAGT 2975 .......... .......... .......... .......... .......... .......... 439 CTCGACCCAC AACCCCCACC TACGAGGCGT GGTTCCACCC AAGGATCGTG CGCACATTCG 3035 .......... .......... .......... .......... .......... .......... 439 CAGGTCATGT TGTGTTTTGA CAGCTTTTTG GGCCTTCTTT TGGGTCCTCC TCAAGGACCC 3095 .......... .......... .......... .......... .......... .......... 439 TTGGGAGGTC CTTAGGGTCA CGCCTTGATG TTTAGGTCCT TAAACATTAA TACTAAGTAG 3155 .......... .......... .......... .......... .......... .......... 439 GGGAGGTTAT TTTATGTCTC TTATCTTTAC GTTCAACCTC ATTTAGGACA CTAGACTACA 3215 .......... .......... .......... .......... .......... .......... 439 TGTTAGGCTC TAACTTAGGT CATTAGATTT TTGGGGTGTT ACAAATCATC TGCTCAAATA 3275 .......... .......... .......... .......... .......... .......... 439 ACCAGTTAGG CCAGAAGGAA GTTAGACTGA AGATATGCTT GAAAAGATCT ATAACAAGGT 3335 .......... .......... .......... .......... .......... .......... 439 TTAAGGGTCT GATAAATTGT TGAAGGAATT CAAAAATGAC TTATCCACTC TATTCCCAGA 3395 .......... .......... .......... .......... .......... .......... 439 CGATGACTTC TAACACAGTT TCCATTAAGT AACTAGAGAC CATTCTAGGA CAAATTGGCA 3455 .......... .......... .......... .......... .......... .......... 439 CTCTTCACAA GCAAAGGCAA ATGGAAACAT TTCCTAGAAA TACCATCCCA AACCCCAATA 3515 .......... .......... .......... .......... .......... .......... 439 ACTATTGTTC AGTTAAAGAA TTTTGTCGTC CCATAACATT AAATAAGCAA TCGCTTAGGG 3575 .......... .......... .......... .......... .......... .......... 439 GACACCAAAG TTTTTACCTT TGTTTTTATT GCTTTTAAAT AACATGTGTT CTGAGTGCAG 3635 .......... .......... .......... .......... .......... .......... 439 GTTGAGAAGA GGTGGAAAAA TTGAAAAGAG AAGCATAGTC AATGCCAAGA CCCATAAAAT 3695 .......... .......... .......... .......... .......... .......... 439 GATATGTCGA AGGCATTGTG CGGACAGCGG ACTAATTGTT GTCTACCTTT TGAGCTTGAA 3755 .......... .......... .......... .......... .......... .......... 439 TAGACATAAA ATGTGCTTGC CAAAATTTGC AATCAGAACT CCAAAAAGGA ATTTATGGAA 3815 .......... .......... .......... .......... .......... .......... 439 GTGCAGTAAG GAGACATAGT CGGGTCGGCC AAAGCAACCT TCGGGTTTAG TAATTGTTAA 3875 .......... .......... .......... .......... .......... .......... 439 GGTATGTTCA GCGAGCTTGA GCTCAAAATA GTGATTTATC GAATACATTC AGCGACAGTA 3935 .......... .......... .......... .......... .......... .......... 439 GATATCACCA AATAAGTATG GTATAAAAAT ACCAAGGGCC TTTTAAAATT CAAAAAGTCT 3995 .......... .......... .......... .......... .......... .......... 439 AAAACTTTGA TACTTTAGAG CCTTTAAGAT TCTTTTATCT TGCTTTAAAT AATTTTTAGT 4055 .......... .......... .......... .......... .......... .......... 439 GTTTGTGAGC CTCTTATGAG TGTTTAAAAA TGGATTTTGT CTCAACCCAA CATACGTACC 4115 .......... .......... .......... .......... .......... .......... 439 TCTCAATTTT GGTGACCTTA AGGTTGGTTT TTGAATCCTT GTTCATATTT TTTAGTTTAA 4175 .......... .......... .......... .......... .......... .......... 439 AGTTTTTATA GGGTGTGAGC TTGAGCCCTC TAACTGATTT TGTGTGCCTG ATTTAAATGG 4235 .......... .......... .......... .......... .......... .......... 439 GTAGCATTGT ATTGTGTTAT TGGAACTGTA CCCATGAGTT TTCAATTGTG TATGCCGGTC 4295 .......... .......... .......... .......... .......... .......... 439 AAATAGCGAA ACGTGTGTGT TGGTTAAATT GTGTAATGAT CAGAGGGTTT GATATATTGT 4355 .......... .......... .......... .......... .......... .......... 439 GTTTTAATGA GTTTAAGGGT TCAGTTAATA AAGAATTAAA CAGAAGATTT CATAATATAA 4415 .......... .......... .......... .......... .......... .......... 439 ACTCATACAA ATGTAGGAGT TCATTTAATT ATTTCGCCAA ATTTAAATGG TATCTTCTAA 4475 .......... .......... .......... .......... .......... .......... 439 TTGTTATTCA ACCTGTATTT ATAAGGTTAA CCAATAAACA GAAATAGAAA TAAGATGAAA 4535 ||||| |||| ||| |||||||| |||| ||||| ||||||||| .......... .....TATTT ATAAAGTT-- --AATAAACA GAAACAGAAA TAAGATGAAG 480 CAGAAAAT-A CATAAAATAC AATAAGTAAT CCGAGTCTAC AAAAACTACT ATGTGTCCTT 4594 |||||||| | ||| ||||| || || |||| ||||||| || | |||||||| |||||||||| CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC AGAAACTACT ATGTGTCCTT 540 AAGAAATTTA ATCCCCTCAC TGTACACAAG GTTATGGATT AATTTCTCCC AAGATAAAAT 4654 |||||||||| || ||||||| ||||| |||| |||||||||| |||||||| | ||||||||| AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT AATTTCTCTC AAGATAAAAC 600 GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCA 4688 |||||||||| |||||||||| ||||| | || |||| GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCA 634 hqPGS_C06HBa0153O03.1-5+_SGN-E276669- (4491 4688) ******************************************************************************** EST sequence 10 -strand 397 n (File: SGN-E262800-) 1 TATAAATTTT CTTAAGATTG TTACTCTTAT AATAACTTTT ATTTATAAAG TTAATAAACA 61 GAAACAGAAA TAAGATGAAG CAGAAAATAA TATAGAATAC AAGAATTAAT CCGAGTCCAC 121 AGAAACTACT ATGTGTCCTT AAGAAATTTA ATTCCCTCAC TGTACCCAAG GTTATGGATT 181 AATTTCTCTC AAGATAAAAC GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC 241 TTTAACTTCA ACGAACTTAA GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT 301 TGATTTTATT TGAGAGAAAA ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT 361 CAAAATTTGA CTTCCTTTTA TAGCCATTTT CAGCAAG Predicted gene structure (within gDNA segment 2747 to 5448): Exon 1 4517 4848 ( 332 n); cDNA 62 397 ( 336 n); score: 0.892 MATCH C06HBa0153O03.1-5+ SGN-E262800- 0.892 332 0.836 C PGS_C06HBa0153O03.1-5+_SGN-E262800- (4517 4848) Alignment (genomic DNA sequence = upper lines): AAATAGAAAT AAGATGAAAC AGAAAAT-AC ATAAAATACA ATAAGTAATC CGAGTCTACA 4575 ||| |||||| |||||||| | ||||||| | ||| |||||| | || ||||| |||||| ||| AAACAGAAAT AAGATGAAGC AGAAAATAAT ATAGAATACA AGAATTAATC CGAGTCCACA 121 AAAACTACTA TGTGTCCTTA AGAAATTTAA TCCCCTCACT GTACACAAGG TTATGGATTA 4635 ||||||||| |||||||||| |||||||||| | |||||||| |||| ||||| |||||||||| GAAACTACTA TGTGTCCTTA AGAAATTTAA TTCCCTCACT GTACCCAAGG TTATGGATTA 181 ATTTCTCCCA AGATAAAATG GATTAAACCT GTTAAAGAAA TAGCAGCACC TCAGATTTCT 4695 ||||||| || |||||||| | |||||||||| |||||||||| |||| | ||| ||| | |||| ATTTCTCTCA AGATAAAACG GATTAAACCT GTTAAAGAAA TAGCGGTACC TCAAACTTCT 241 TTAACTAAAG CGAAATTCAG AACAACAACA AGTCACATAG ACTCAGTCGA TCGACACTTT 4755 |||||| | |||| || || |||||||||| ||||||| || |||||||||| |||||||||| TTAACTTCAA CGAACTTAAG AACAACAACA AGTCACACAG ACTCAGTCGA TCGACACTTT 301 GA-TTTATTT GAGAGAAAAA TATATGCAGA G-AAGGAAAA TTTT-AGTGT TTGAAAAATC 4812 || ||||||| |||||||||| || ||||||| | ||| |||| |||| ||||| || ||||||| GATTTTATTT GAGAGAAAAA TAAATGCAGA GAAAGAAAAA TTTTCAGTGT TTAAAAAATC 361 AAAAATTGAC TTCCTTTTAT AGCCATTTTC AGCAAG 4848 |||| ||||| |||||||||| |||||||||| |||||| AAAATTTGAC TTCCTTTTAT AGCCATTTTC AGCAAG 397 hqPGS_C06HBa0153O03.1-5+_SGN-E262800- (4517 4848) ******************************************************************************** EST sequence 27 -strand 329 n (File: SGN-E258205-) 1 AGATTGTTAC TCTTATAATA CCTTTTATTT ATAAAGTTAA TAAACAGAAC CAGAAATAAG 61 ATGAAGCAGA AAATAATATA GAATACAAGA ATTAATCCGA GTCCACAGAA ACTACTATGT 121 GTCCTTAAGA AATTTAATTC CCTCACTGTA CCCAAGGTTA TGGATTAATT TCTCTCAAGA 181 TAAAACGGAT TAAACCTGTT AAAGAAATAG CGGTACCTCA AACTTCTTTA ACTTCAACGA 241 ACTTAAGAAC AACAACAAGT CACACAGACT CAGTCGATCG ACACTTTGAT TTTATTTGAG 301 AGAAAAATAA ATGCAGAGAA AGAAAAATT Predicted gene structure (within gDNA segment 2887 to 5576): Exon 1 4517 4796 ( 280 n); cDNA 48 329 ( 282 n); score: 0.889 MATCH C06HBa0153O03.1-5+ SGN-E258205- 0.889 280 0.851 C PGS_C06HBa0153O03.1-5+_SGN-E258205- (4517 4796) Alignment (genomic DNA sequence = upper lines): AAATAGAAAT AAGATGAAAC AGAAAAT-AC ATAAAATACA ATAAGTAATC CGAGTCTACA 4575 || |||||| |||||||| | ||||||| | ||| |||||| | || ||||| |||||| ||| AACCAGAAAT AAGATGAAGC AGAAAATAAT ATAGAATACA AGAATTAATC CGAGTCCACA 107 AAAACTACTA TGTGTCCTTA AGAAATTTAA TCCCCTCACT GTACACAAGG TTATGGATTA 4635 ||||||||| |||||||||| |||||||||| | |||||||| |||| ||||| |||||||||| GAAACTACTA TGTGTCCTTA AGAAATTTAA TTCCCTCACT GTACCCAAGG TTATGGATTA 167 ATTTCTCCCA AGATAAAATG GATTAAACCT GTTAAAGAAA TAGCAGCACC TCAGATTTCT 4695 ||||||| || |||||||| | |||||||||| |||||||||| |||| | ||| ||| | |||| ATTTCTCTCA AGATAAAACG GATTAAACCT GTTAAAGAAA TAGCGGTACC TCAAACTTCT 227 TTAACTAAAG CGAAATTCAG AACAACAACA AGTCACATAG ACTCAGTCGA TCGACACTTT 4755 |||||| | |||| || || |||||||||| ||||||| || |||||||||| |||||||||| TTAACTTCAA CGAACTTAAG AACAACAACA AGTCACACAG ACTCAGTCGA TCGACACTTT 287 GA-TTTATTT GAGAGAAAAA TATATGCAGA GAAGGAAAAT TT 4796 || ||||||| |||||||||| || ||||||| ||| ||||| || GATTTTATTT GAGAGAAAAA TAAATGCAGA GAAAGAAAAA TT 329 hqPGS_C06HBa0153O03.1-5+_SGN-E258205- (4517 4796) ******************************************************************************** EST sequence 13 -strand 227 n (File: SGN-E261310-) 1 AATTTCTCTC AAGATAAAAC GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC 61 TTTAACTTCA ACGAACTTAA GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT 121 TGATTTTATT TGAGAGAAAA ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT 181 CAAAATTTGA CTTCCTTTTA TAGCCATTTT CAGCAAGGAA CATGTCC Predicted gene structure (within gDNA segment 3314 to 5548): Exon 1 4635 4856 ( 222 n); cDNA 1 225 ( 225 n); score: 0.885 MATCH C06HBa0153O03.1-5+ SGN-E261310- 0.885 222 0.978 C PGS_C06HBa0153O03.1-5+_SGN-E261310- (4635 4856) Alignment (genomic DNA sequence = upper lines): AATTTCTCCC AAGATAAAAT GGATTAAACC TGTTAAAGAA ATAGCAGCAC CTCAGATTTC 4694 |||||||| | ||||||||| |||||||||| |||||||||| ||||| | || |||| | ||| AATTTCTCTC AAGATAAAAC GGATTAAACC TGTTAAAGAA ATAGCGGTAC CTCAAACTTC 60 TTTAACTAAA GCGAAATTCA GAACAACAAC AAGTCACATA GACTCAGTCG ATCGACACTT 4754 ||||||| | |||| || | |||||||||| |||||||| | |||||||||| |||||||||| TTTAACTTCA ACGAACTTAA GAACAACAAC AAGTCACACA GACTCAGTCG ATCGACACTT 120 TGA-TTTATT TGAGAGAAAA ATATATGCAG AG-AAGGAAA ATTTT-AGTG TTTGAAAAAT 4811 ||| |||||| |||||||||| ||| |||||| || ||| ||| ||||| |||| ||| |||||| TGATTTTATT TGAGAGAAAA ATAAATGCAG AGAAAGAAAA ATTTTCAGTG TTTAAAAAAT 180 CAAAAATTGA CTTCCTTTTA TAGCCATTTT CAGCAAGAAA CGTGT 4856 ||||| |||| |||||||||| |||||||||| ||||||| || | ||| CAAAATTTGA CTTCCTTTTA TAGCCATTTT CAGCAAGGAA CATGT 225 hqPGS_C06HBa0153O03.1-5+_SGN-E261310- (4635 4856) ******************************************************************************** EST sequence 6 -strand 792 n (File: SGN-E351952-) 1 TGGAGTTTGA AGTCCTCCAA TAGGTTTCAA CTTAGAGAGA GAATTTAGAG AGAAAGGAGA 61 GCATGTAAAT TTTTTGAATT ATAAACATCA TTTGTGAAAT CCAAATCCAA AACCGAAGCC 121 GAGGCCGAGG CCAAGCGGCG GCGACGGCGT GAGGGGGCAC CTTCTTCTTA ACCCTTTAAG 181 AAGCACAGGA AGTGTTTCCT AATATAAGGA CAACAATTTC ATTTCTTCTA CCTATATGAG 241 AAAAATGACT TTTCATTTGT ACTTTGCAAA TGACTTTTCT TTTTTCCTCC AAAATAGGTT 301 CCCTCAATTT TCCATTTTTC TCTTCTCTTT TTCCATTCAC ATTTTGCTAA ACCCAAAAAT 361 CCCCCACATA AATGGGAAAT GTCTATTGTT AAAACATATG CATGAAAAAC TGAAGTGTCT 421 TGTGTGTAAG CGTTAGTCAC ATCTGGATAA GTAGGTTTCT CTTTAAACTT TCCACAGTGA 481 TCATATATCG GATATACTCG ATCAATCGGT AGATTTGATA TCTTTAAACC ATCGAGCTTT 541 GGTGTATACT TAGACAACAT AAGTCACACA ATCAACCCTT GAATTGTTCT TAGTTCTCAT 601 TATTTTGTTC ATTTCATCCA AGAACACATC TCAATTAGTA AGTGCTTAGA GAACTGGCCT 661 TACCGGATTC TCCTTCAAGT GACGTACACT TCACACTCAC ATAGGTGGTT TCTATATGTG 721 TTATCCCGTA GATACACTAT TTGATATACT ATGTATCAAA CTTAGAATCC ATTAAAAAGT 781 CCTTATGTCT TT Predicted gene structure (within gDNA segment 76 to 6469): Exon 1 5098 5806 ( 709 n); cDNA 85 792 ( 708 n); score: 0.836 MATCH C06HBa0153O03.1-5+ SGN-E351952- 0.836 709 0.895 C PGS_C06HBa0153O03.1-5+_SGN-E351952- (5098 5806) Alignment (genomic DNA sequence = upper lines): ACATCATTTG CCAAATCCAA ATCCAAATCC AAATCCTAAG CCGAAGCCGA GCGAACGACG 5157 |||||||||| |||||||| ||||||| || || || | | |||| ||| | ||| | || ACATCATTTG TGAAATCCAA ATCCAAAACC GAAGCCGAGG CCGAGGCCAA GCG---G-CG 140 ACGACGGCGC GAGGGGGCAT CTTCTTCTTA GCTCTTTAAG AATTAATGGA AGTGTTTCCT 5217 |||||||| ||||||||| |||||||||| | ||||||| || | ||| |||||||||| GCGACGGCGT GAGGGGGCAC CTTCTTCTTA ACCCTTTAAG AAGCACAGGA AGTGTTTCCT 200 TATATAAGGA CAACAATTTC CCTTTCTTTT GATGACATAG GAGAAATGAC TTTTCATTTG 5277 ||||||||| ||||||||| | |||||| | | || || ||||||| |||||||||| AATATAAGGA CAACAATTT- CATTTCTTCT ACCTATATGA GAAAAATGAC TTTTCATTTG 259 CACTTTGTAA ATGACTTTTC ATTTTCCCTC CAAAATA-GT TCCCTTACTT TTCATATTCT 5336 |||||| || |||||||||| |||| |||| ||||||| || ||||| | || ||| || | TACTTTGCAA ATGACTTTTC TTTTTTCCTC CAAAATAGGT TCCCTCAATT TTCCATTTTT 319 CTCTTTTCTT TTCTCATTCA CA-CATGTTA AATCTAACAA TCCCCCACGT GAATAGGGAA 5395 ||||| |||| || |||||| || || || || | || || |||||||| | ||| || || CTCTTCTCTT TTTCCATTCA CATTTTGCTA AACCCAAAAA TCCCCCACAT AAATGGGAAA 379 -GGCTATTGT TAAAACATAT GCATGAAAAA CTTGTGTGTC TTCTG-GTAA AGGCTAATCG 5453 | ||||||| |||||||||| |||||||||| || ||||| || || |||| | || || TGTCTATTGT TAAAACATAT GCATGAAAAA CTGAAGTGTC TTGTGTGTAA GCGTTAGTCA 439 CATCTGGATA AGTAGATTTC CCTTTAAACT TTCCGTAGTG AACATATATC GGATATACTC 5513 |||||||||| ||||| |||| ||||||||| |||| |||| | |||||||| |||||||||| CATCTGGATA AGTAGGTTTC TCTTTAAACT TTCCACAGTG ATCATATATC GGATATACTC 499 GGTCAATTGG TAGATTTGAT ATCTTTGAAC CGTCGAGCTT TGTTATATAC CTAGACAACA 5573 | ||||| || |||||||||| |||||| ||| | |||||||| || | ||||| ||||||||| GATCAATCGG TAGATTTGAT ATCTTTAAAC CATCGAGCTT TGGTGTATAC TTAGACAACA 559 TATGTCACAC AATCAACCCT TGAACTGTTC TTAGTTCTCA TTGTTTTGTT CGTTTCAGCC 5633 || ||||||| |||||||||| |||| ||||| |||||||||| || ||||||| | ||||| || TAAGTCACAC AATCAACCCT TGAATTGTTC TTAGTTCTCA TTATTTTGTT CATTTCATCC 619 ACGAAAACAT CTTGGATAGT AAGTGCTTAA AGAGCTGGCC TTACCGGATT CTCCTTGAAG 5693 | ||| |||| || |||| ||||||||| ||| |||||| |||||||||| |||||| ||| AAGAACACAT CTCAATTAGT AAGTGCTTAG AGAACTGGCC TTACCGGATT CTCCTTCAAG 679 CGGCTTACAC TTCACACTTA CATAGGTGAT TTCTAAATGT GTTATCCCAT AGATATACCA 5753 | | ||||| |||||||| | |||||||| | ||||| |||| |||||||| | ||||| || | TGACGTACAC TTCACACTCA CATAGGTGGT TTCTATATGT GTTATCCCGT AGATACACTA 739 TTTGATATTC CATGTATCAA ACTTAGAAAC CATTAAAAAG TCCTTACGTC TTT 5806 |||||||| | ||||||||| |||||||| | |||||||||| |||||| ||| ||| TTTGATATAC TATGTATCAA ACTTAGAATC CATTAAAAAG TCCTTATGTC TTT 792 hqPGS_C06HBa0153O03.1-5+_SGN-E351952- (5098 5806) ******************************************************************************** EST sequence 28 -strand 552 n (File: SGN-E357316-) 1 AAAAATGACT TTTCATTTGT ACTTTGCAAA TGACTTTTCT TTTTCCCTCC AAAATAGGTT 61 CCCTCAATTT TCCATTTTTC TCTTCTCTTT TTCCATTCAC ATTTTGCTAA ACCCAAAAAT 121 CCCCCACATA AATGGGAAAT GTCTATTGTT AAAACATATG CATGAAAAAC TGAAGTGTCT 181 TGTGTGTAAG CGTTAGTCAC ATCTGGATAA GTAGGTTTCT CTTTAAACTT TCCACAGTGA 241 TCATATATCG GATATACTCG ATCAATCGGT AGATTTGATA TCTTTAAACC ATCGAGCTTT 301 GGTGTATACT TAGACAACAT AAGTCACACA ATCAACCCTT GAATTGTTCT TAGTTCTCAT 361 TATTTTGTTC ATTTCATCCA AGAACACATC TCAATTAGTA AGTGCTTAGA GAACTGGCCT 421 TACCGGATTC TCCTTCAAGT GACGTACACT TCACACTCAC ATAGGTGGTT TCTATATGTG 481 TTATCCCGTA GATACACTAT TTGATATACT ATGTATCAAA CTTAGAATCC ATTAAAAAGT 541 CCTTATGTCT TT Predicted gene structure (within gDNA segment 2476 to 6469): Exon 1 5261 5806 ( 546 n); cDNA 3 552 ( 550 n); score: 0.852 MATCH C06HBa0153O03.1-5+ SGN-E357316- 0.852 546 0.989 C PGS_C06HBa0153O03.1-5+_SGN-E357316- (5261 5806) Alignment (genomic DNA sequence = upper lines): AAATGACTTT TCATTTGCAC TTTGTAAATG ACTTTTCATT TTCCCTCCAA AATA-GTTCC 5319 |||||||||| ||||||| || |||| ||||| ||||||| || |||||||||| |||| ||||| AAATGACTTT TCATTTGTAC TTTGCAAATG ACTTTTCTTT TTCCCTCCAA AATAGGTTCC 62 CTTACTTTTC ATATTCTCTC TTTTCTTTTC TCATTCACAC AT-GTTAAAT CTAACAATCC 5378 || | ||||| || |||| || |||||| |||||||| | | |||| | || ||||| CTCAATTTTC CATTTTTCTC TTCTCTTTTT CCATTCACAT TTTGCTAAAC CCAAAAATCC 122 CCCACGTGAA TAGGGAA-GG CTATTGTTAA AACATATGCA TGAAAAACTT GTGTGTCTTC 5437 ||||| | || | || || | |||||||||| |||||||||| ||||||||| ||||||| CCCACATAAA TGGGAAATGT CTATTGTTAA AACATATGCA TGAAAAACTG AAGTGTCTTG 182 TG-GTAAAGG CTAATCGCAT CTGGATAAGT AGATTTCCCT TTAAACTTTC CGTAGTGAAC 5496 || |||| | || || ||| |||||||||| || |||| || |||||||||| | ||||| | TGTGTAAGCG TTAGTCACAT CTGGATAAGT AGGTTTCTCT TTAAACTTTC CACAGTGATC 242 ATATATCGGA TATACTCGGT CAATTGGTAG ATTTGATATC TTTGAACCGT CGAGCTTTGT 5556 |||||||||| |||||||| | |||| ||||| |||||||||| ||| |||| | ||||||||| ATATATCGGA TATACTCGAT CAATCGGTAG ATTTGATATC TTTAAACCAT CGAGCTTTGG 302 TATATACCTA GACAACATAT GTCACACAAT CAACCCTTGA ACTGTTCTTA GTTCTCATTG 5616 | ||||| || ||||||||| |||||||||| |||||||||| | |||||||| ||||||||| TGTATACTTA GACAACATAA GTCACACAAT CAACCCTTGA ATTGTTCTTA GTTCTCATTA 362 TTTTGTTCGT TTCAGCCACG AAAACATCTT GGATAGTAAG TGCTTAAAGA GCTGGCCTTA 5676 |||||||| | |||| ||| | || |||||| ||||||| |||||| ||| ||||||||| TTTTGTTCAT TTCATCCAAG AACACATCTC AATTAGTAAG TGCTTAGAGA ACTGGCCTTA 422 CCGGATTCTC CTTGAAGCGG CTTACACTTC ACACTTACAT AGGTGATTTC TAAATGTGTT 5736 |||||||||| ||| ||| | | |||||||| ||||| |||| ||||| |||| || ||||||| CCGGATTCTC CTTCAAGTGA CGTACACTTC ACACTCACAT AGGTGGTTTC TATATGTGTT 482 ATCCCATAGA TATACCATTT GATATTCCAT GTATCAAACT TAGAAACCAT TAAAAAGTCC 5796 ||||| |||| || || |||| ||||| | || |||||||||| ||||| |||| |||||||||| ATCCCGTAGA TACACTATTT GATATACTAT GTATCAAACT TAGAATCCAT TAAAAAGTCC 542 TTACGTCTTT 5806 ||| |||||| TTATGTCTTT 552 hqPGS_C06HBa0153O03.1-5+_SGN-E357316- (5261 5806) ******************************************************************************** EST sequence 7 -strand 583 n (File: SGN-E251023-) 1 GGTACCATCA GCATGAGATA GACTCCATTT CAATAAACCA TCCCAATTAG GTCCCTCTTT 61 CGCCATTTTT GATTTCTTAC TTAGATTTTT CGGTGTTTTG TTTAGGGAAG AAGAAGAATT 121 TATTGAAGAG ATTTTGTAGA AACTAGAAGG TACTGGAACT TTGGAGAAGA AGGGATTTAT 181 TATCTTGGAG AAGGGTTTAT TATTTGTTTA TATTAAATTA AAATCGAGAT GGGTATATAA 241 TTACTTCAAA TTTAATATAG TGGTGGGCAT TCGGTATTTC GGTTCGGTTC GGGTTTTTTT 301 CGGTTTTTTC GGTTTTCGGT TTTTGTAAAT TGCGTATCGA ATACCGAATC GAAATATTTC 361 GGTTTGATTT TTATTAATTC GATTCGATTT TTTATTTCGG TTTTTTAATG GACCAAACTG 421 CTGCACGGTC GGCGGCTGGC TGCTGCTGAG TGCTGTCGCG GCTGGTTGCT GCTGTGCTGC 481 TGTCTGTGGC TGGCTGCTGC TGTGCTGCTG TCGCGGCTAC TGCTGCTTCC CTTGTTGTGG 541 ACGGCGGCTG GCGCCTGGAG CCTGGCGGCG CTGCTGAAGA CAA Predicted gene structure (within gDNA segment 3425 to 7846): Exon 1 4339 4373 ( 35 n); cDNA 192 224 ( 33 n); score: 0.686 Intron 1 4374 4844 ( 471 n); Pd: 0.000 (s: 0), Pa: 0.259 (s: 0) Exon 2 4845 4854 ( 10 n); cDNA 225 234 ( 10 n); score: 0.600 Intron 2 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 3 5527 5531 ( 5 n); cDNA 235 239 ( 5 n); score: 0.600 Intron 3 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 4 6567 6730 ( 164 n); cDNA 240 402 ( 163 n); score: 0.726 MATCH C06HBa0153O03.1-5+ SGN-E251023- 0.726 214 0.367 C PGS_C06HBa0153O03.1-5+_SGN-E251023- (4339 4373,4845 4854,5527 5531,6567 6730) Alignment (genomic DNA sequence = upper lines): AGGGTTTGAT ATATTGTGTT TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG 4398 ||||||| | || |||| || || | | || || AGGGTTTATT AT-TTGT-TT ATATTAAATT AAAAT..... .......... .......... 224 AAGATTTCAT AATATAAACT CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT 4458 .......... .......... .......... .......... .......... .......... 224 TAAATGGTAT CTTCTAATTG TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA 4518 .......... .......... .......... .......... .......... .......... 224 ATAGAAATAA GATGAAACAG AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA 4578 .......... .......... .......... .......... .......... .......... 224 ACTACTATGT GTCCTTAAGA AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT 4638 .......... .......... .......... .......... .......... .......... 224 TCTCCCAAGA TAAAATGGAT TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA 4698 .......... .......... .......... .......... .......... .......... 224 ACTAAAGCGA AATTCAGAAC AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT 4758 .......... .......... .......... .......... .......... .......... 224 TTATTTGAGA GAAAAATATA TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT 4818 .......... .......... .......... .......... .......... .......... 224 TGACTTCCTT TTATAGCCAT TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG 4878 | || | || .......... .......... ......CGAG ATGGGT.... .......... .......... 234 ACCCGTTTTA TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT 4938 .......... .......... .......... .......... .......... .......... 234 GTCCGTTAGG AAAATAACGG CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT 4998 .......... .......... .......... .......... .......... .......... 234 TTTCGGAATG TTACCATTAA GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT 5058 .......... .......... .......... .......... .......... .......... 234 GATTAAATAA ATTTTGTCCA AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA 5118 .......... .......... .......... .......... .......... .......... 234 TCCAAATCCA AATCCTAAGC CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC 5178 .......... .......... .......... .......... .......... .......... 234 TTCTTCTTAG CTCTTTAAGA ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC 5238 .......... .......... .......... .......... .......... .......... 234 CTTTCTTTTG ATGACATAGG AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA 5298 .......... .......... .......... .......... .......... .......... 234 TTTTCCCTCC AAAATAGTTC CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA 5358 .......... .......... .......... .......... .......... .......... 234 CATGTTAAAT CTAACAATCC CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT 5418 .......... .......... .......... .......... .......... .......... 234 GAAAAACTTG TGTGTCTTCT GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT 5478 .......... .......... .......... .......... .......... .......... 234 AAACTTTCCG TAGTGAACAT ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT 5538 || | .......... .......... .......... .......... ........AT ATA....... 239 TGAACCGTCG AGCTTTGTTA TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC 5598 .......... .......... .......... .......... .......... .......... 239 TGTTCTTAGT TCTCATTGTT TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG 5658 .......... .......... .......... .......... .......... .......... 239 CTTAAAGAGC TGGCCTTACC GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG 5718 .......... .......... .......... .......... .......... .......... 239 GTGATTTCTA AATGTGTTAT CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA 5778 .......... .......... .......... .......... .......... .......... 239 GAAACCATTA AAAAGTCCTT ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA 5838 .......... .......... .......... .......... .......... .......... 239 AATGGACCAT AAAATAATAA GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC 5898 .......... .......... .......... .......... .......... .......... 239 AATGACTTTG TTTTATCTCC TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG 5958 .......... .......... .......... .......... .......... .......... 239 GTACCGCCAC GATGACTTTT TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT 6018 .......... .......... .......... .......... .......... .......... 239 CCCTCACTAT TTAGGCCTTT TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG 6078 .......... .......... .......... .......... .......... .......... 239 TCAATTGTGA TAATTCAACT AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG 6138 .......... .......... .......... .......... .......... .......... 239 TGACGAGACT TGTCGTTATA CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA 6198 .......... .......... .......... .......... .......... .......... 239 CAGTGTATGC ATATAGGGGC CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG 6258 .......... .......... .......... .......... .......... .......... 239 TGAAACCATT CAGCTTCTTC ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA 6318 .......... .......... .......... .......... .......... .......... 239 GAGCAAGCTA TACATGTTTG TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA 6378 .......... .......... .......... .......... .......... .......... 239 AAAACGTATC CACTTGTGGA TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA 6438 .......... .......... .......... .......... .......... .......... 239 TATCCTTCAA GAACGGCTAG ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT 6498 .......... .......... .......... .......... .......... .......... 239 AAGTATCTCA AAACTCTCTT CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT 6558 .......... .......... .......... .......... .......... .......... 239 CGACTCAGTT TACTGATAGC GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 6618 | |||| | || |||| |||||||||| |||||||||| |||||||||| ........AT TACTTCAAAT TTAA--TATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 289 C-GG--TTTT T----TT-TT CGGTTTTCGG TTTTTGTAAA TTGCGTACCG AATACCGAAC 6670 | || |||| | || || |||||||||| |||||||||| ||||||| || ||||||||| CGGGTTTTTT TCGGTTTTTT CGGTTTTCGG TTTTTGTAAA TTGCGTATCG AATACCGAAT 349 CGAAATATTT TGGTTCGGTT CGGTTTTTGT TAATTCGGTT CGGTTTTTAT TAATTCGGTT 6730 ||||||| | | | ||||| | ||||| | ||||||| || | | |||| | || ||||||| CGAAATA--T T--T-CGGTT TGATTTTTAT TAATTCGATT C-GATTTT-T TATTTCGGTT 402 hqPGS_C06HBa0153O03.1-5+_SGN-E251023- (6567 6730) ******************************************************************************** EST sequence 29 -strand 624 n (File: SGN-E391663-) 1 ACCATCCCAA TTAGGTCCCT CTTTCGCCAT TTTTGATTTT TTACTTAGAT TTTTTGGGTG 61 TTTTGTTTAG GGAAGAAGAA GAATTTATTG AAGAGATTTT GTAGAAACTA GAAGGTACTG 121 GAACTTTGGA GAAGAAGGGA TTTATTATCT TGGAGAAGGG TTTATTATTT GTTTATATTA 181 AATTAAAATC GAGATGGGTA TATACTTACT TCAAATTTAA TATAGTGGTG GGCATTCGGT 241 ATTTCGGTTC GGTTCGGGTT TTTTTCGGTT TTTTCGGTTT TCGGTTTTTG TAAATTGCGT 301 ATCGAATACC GAATCGAAAT ATTTCGGTTT GATTTTTATT AATTCGATTC GATTTTTTAT 361 TTCGGTTTTT TAATGGACCA AACTGCTGCA CGGTCGGCGG CTGGCTGCTG CTGAGTGCTG 421 TCGCGGCTGG TTGCTGCTGT GCTGCTGTCT GTGGCTGGCT GCTGCTGTGC TGCTGTCGCG 481 GCTACTGCTG CTTCCCTTGT TGTGGACGGC GGCTGGCGCC TGGCGCCTGG CGGCGCTGCT 541 GAAGACAAGA GTGGTTGGAG CTGAAGAGTA GTTGGAATAG AAGAGGAAGA GGAGAAGATA 601 AGAGGAAGAA GAGCTGAAGA GGAG Predicted gene structure (within gDNA segment 3175 to 7846): Exon 1 4339 4373 ( 35 n); cDNA 157 189 ( 33 n); score: 0.686 Intron 1 4374 4844 ( 471 n); Pd: 0.000 (s: 0), Pa: 0.259 (s: 0) Exon 2 4845 4854 ( 10 n); cDNA 190 199 ( 10 n); score: 0.600 Intron 2 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 3 5527 5531 ( 5 n); cDNA 200 204 ( 5 n); score: 0.600 Intron 3 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 4 6567 6730 ( 164 n); cDNA 205 367 ( 163 n); score: 0.726 MATCH C06HBa0153O03.1-5+ SGN-E391663- 0.726 214 0.343 C PGS_C06HBa0153O03.1-5+_SGN-E391663- (4339 4373,4845 4854,5527 5531,6567 6730) Alignment (genomic DNA sequence = upper lines): AGGGTTTGAT ATATTGTGTT TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG 4398 ||||||| | || |||| || || | | || || AGGGTTTATT AT-TTGT-TT ATATTAAATT AAAAT..... .......... .......... 189 AAGATTTCAT AATATAAACT CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT 4458 .......... .......... .......... .......... .......... .......... 189 TAAATGGTAT CTTCTAATTG TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA 4518 .......... .......... .......... .......... .......... .......... 189 ATAGAAATAA GATGAAACAG AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA 4578 .......... .......... .......... .......... .......... .......... 189 ACTACTATGT GTCCTTAAGA AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT 4638 .......... .......... .......... .......... .......... .......... 189 TCTCCCAAGA TAAAATGGAT TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA 4698 .......... .......... .......... .......... .......... .......... 189 ACTAAAGCGA AATTCAGAAC AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT 4758 .......... .......... .......... .......... .......... .......... 189 TTATTTGAGA GAAAAATATA TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT 4818 .......... .......... .......... .......... .......... .......... 189 TGACTTCCTT TTATAGCCAT TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG 4878 | || | || .......... .......... ......CGAG ATGGGT.... .......... .......... 199 ACCCGTTTTA TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT 4938 .......... .......... .......... .......... .......... .......... 199 GTCCGTTAGG AAAATAACGG CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT 4998 .......... .......... .......... .......... .......... .......... 199 TTTCGGAATG TTACCATTAA GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT 5058 .......... .......... .......... .......... .......... .......... 199 GATTAAATAA ATTTTGTCCA AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA 5118 .......... .......... .......... .......... .......... .......... 199 TCCAAATCCA AATCCTAAGC CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC 5178 .......... .......... .......... .......... .......... .......... 199 TTCTTCTTAG CTCTTTAAGA ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC 5238 .......... .......... .......... .......... .......... .......... 199 CTTTCTTTTG ATGACATAGG AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA 5298 .......... .......... .......... .......... .......... .......... 199 TTTTCCCTCC AAAATAGTTC CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA 5358 .......... .......... .......... .......... .......... .......... 199 CATGTTAAAT CTAACAATCC CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT 5418 .......... .......... .......... .......... .......... .......... 199 GAAAAACTTG TGTGTCTTCT GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT 5478 .......... .......... .......... .......... .......... .......... 199 AAACTTTCCG TAGTGAACAT ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT 5538 || | .......... .......... .......... .......... ........AT ATA....... 204 TGAACCGTCG AGCTTTGTTA TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC 5598 .......... .......... .......... .......... .......... .......... 204 TGTTCTTAGT TCTCATTGTT TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG 5658 .......... .......... .......... .......... .......... .......... 204 CTTAAAGAGC TGGCCTTACC GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG 5718 .......... .......... .......... .......... .......... .......... 204 GTGATTTCTA AATGTGTTAT CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA 5778 .......... .......... .......... .......... .......... .......... 204 GAAACCATTA AAAAGTCCTT ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA 5838 .......... .......... .......... .......... .......... .......... 204 AATGGACCAT AAAATAATAA GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC 5898 .......... .......... .......... .......... .......... .......... 204 AATGACTTTG TTTTATCTCC TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG 5958 .......... .......... .......... .......... .......... .......... 204 GTACCGCCAC GATGACTTTT TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT 6018 .......... .......... .......... .......... .......... .......... 204 CCCTCACTAT TTAGGCCTTT TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG 6078 .......... .......... .......... .......... .......... .......... 204 TCAATTGTGA TAATTCAACT AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG 6138 .......... .......... .......... .......... .......... .......... 204 TGACGAGACT TGTCGTTATA CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA 6198 .......... .......... .......... .......... .......... .......... 204 CAGTGTATGC ATATAGGGGC CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG 6258 .......... .......... .......... .......... .......... .......... 204 TGAAACCATT CAGCTTCTTC ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA 6318 .......... .......... .......... .......... .......... .......... 204 GAGCAAGCTA TACATGTTTG TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA 6378 .......... .......... .......... .......... .......... .......... 204 AAAACGTATC CACTTGTGGA TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA 6438 .......... .......... .......... .......... .......... .......... 204 TATCCTTCAA GAACGGCTAG ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT 6498 .......... .......... .......... .......... .......... .......... 204 AAGTATCTCA AAACTCTCTT CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT 6558 .......... .......... .......... .......... .......... .......... 204 CGACTCAGTT TACTGATAGC GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 6618 | |||| | || |||| |||||||||| |||||||||| |||||||||| ........CT TACTTCAAAT TTAA--TATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 254 C-GG--TTTT T----TT-TT CGGTTTTCGG TTTTTGTAAA TTGCGTACCG AATACCGAAC 6670 | || |||| | || || |||||||||| |||||||||| ||||||| || ||||||||| CGGGTTTTTT TCGGTTTTTT CGGTTTTCGG TTTTTGTAAA TTGCGTATCG AATACCGAAT 314 CGAAATATTT TGGTTCGGTT CGGTTTTTGT TAATTCGGTT CGGTTTTTAT TAATTCGGTT 6730 ||||||| | | | ||||| | ||||| | ||||||| || | | |||| | || ||||||| CGAAATA--T T--T-CGGTT TGATTTTTAT TAATTCGATT C-GATTTT-T TATTTCGGTT 367 hqPGS_C06HBa0153O03.1-5+_SGN-E391663- (6567 6730) ******************************************************************************** EST sequence 3 -strand 525 n (File: SGN-E331585-) 1 TTCGCCATTT TTGATTTCTT ACTTAGATTT TTCGGTGTTT TGTTTAGGGA AGAAGAAGAA 61 TTTATTGAAG AGATTTTGTA GAAACTAGAA GGTACTGGAA CTTTGGAGAA GAAGGGATTT 121 ATTATCTTGG AGAAGGGTTT ATTATTTGTT TATATTAAAT TAAAATCGAG ATGGGTATAT 181 AATTACTTCA AATTTAATAT AGTGGTGGGC ATTCGGTATT TCGGTTCGGT TCGGGTTTTT 241 TTCGGTTTTT TCGGTTTTCG GTTTTTGTAA ATTGCGTATC GAATACCGAA TCGAAATATT 301 TCGGTTTGAT TTTTATTAAT TCGATTCGAT TTTTTATTTC GGTTTTTTAA TGGACCAAAC 361 TGCTGCACGG TCGGCGGCTG GCTGCTGCTG AGTGCTGTCG CGGCTGGTTG CTGCTGTGCT 421 GCTGTCTGTG GCTGGCTGCT GCTGTGCTGC TGTCGCGGCT ACTGCTGCTT CCCTTGTTGT 481 GGACGGCGGC TGGCGCCTGG CGCCTGGCGG CGCTGCTGAA GACAA Predicted gene structure (within gDNA segment 3405 to 7846): Exon 1 4339 4373 ( 35 n); cDNA 134 166 ( 33 n); score: 0.686 Intron 1 4374 4844 ( 471 n); Pd: 0.000 (s: 0), Pa: 0.259 (s: 0) Exon 2 4845 4854 ( 10 n); cDNA 167 176 ( 10 n); score: 0.600 Intron 2 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 3 5527 5531 ( 5 n); cDNA 177 181 ( 5 n); score: 0.600 Intron 3 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 4 6567 6730 ( 164 n); cDNA 182 344 ( 163 n); score: 0.726 MATCH C06HBa0153O03.1-5+ SGN-E331585- 0.726 214 0.408 C PGS_C06HBa0153O03.1-5+_SGN-E331585- (4339 4373,4845 4854,5527 5531,6567 6730) Alignment (genomic DNA sequence = upper lines): AGGGTTTGAT ATATTGTGTT TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG 4398 ||||||| | || |||| || || | | || || AGGGTTTATT AT-TTGT-TT ATATTAAATT AAAAT..... .......... .......... 166 AAGATTTCAT AATATAAACT CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT 4458 .......... .......... .......... .......... .......... .......... 166 TAAATGGTAT CTTCTAATTG TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA 4518 .......... .......... .......... .......... .......... .......... 166 ATAGAAATAA GATGAAACAG AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA 4578 .......... .......... .......... .......... .......... .......... 166 ACTACTATGT GTCCTTAAGA AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT 4638 .......... .......... .......... .......... .......... .......... 166 TCTCCCAAGA TAAAATGGAT TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA 4698 .......... .......... .......... .......... .......... .......... 166 ACTAAAGCGA AATTCAGAAC AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT 4758 .......... .......... .......... .......... .......... .......... 166 TTATTTGAGA GAAAAATATA TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT 4818 .......... .......... .......... .......... .......... .......... 166 TGACTTCCTT TTATAGCCAT TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG 4878 | || | || .......... .......... ......CGAG ATGGGT.... .......... .......... 176 ACCCGTTTTA TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT 4938 .......... .......... .......... .......... .......... .......... 176 GTCCGTTAGG AAAATAACGG CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT 4998 .......... .......... .......... .......... .......... .......... 176 TTTCGGAATG TTACCATTAA GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT 5058 .......... .......... .......... .......... .......... .......... 176 GATTAAATAA ATTTTGTCCA AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA 5118 .......... .......... .......... .......... .......... .......... 176 TCCAAATCCA AATCCTAAGC CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC 5178 .......... .......... .......... .......... .......... .......... 176 TTCTTCTTAG CTCTTTAAGA ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC 5238 .......... .......... .......... .......... .......... .......... 176 CTTTCTTTTG ATGACATAGG AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA 5298 .......... .......... .......... .......... .......... .......... 176 TTTTCCCTCC AAAATAGTTC CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA 5358 .......... .......... .......... .......... .......... .......... 176 CATGTTAAAT CTAACAATCC CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT 5418 .......... .......... .......... .......... .......... .......... 176 GAAAAACTTG TGTGTCTTCT GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT 5478 .......... .......... .......... .......... .......... .......... 176 AAACTTTCCG TAGTGAACAT ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT 5538 || | .......... .......... .......... .......... ........AT ATA....... 181 TGAACCGTCG AGCTTTGTTA TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC 5598 .......... .......... .......... .......... .......... .......... 181 TGTTCTTAGT TCTCATTGTT TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG 5658 .......... .......... .......... .......... .......... .......... 181 CTTAAAGAGC TGGCCTTACC GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG 5718 .......... .......... .......... .......... .......... .......... 181 GTGATTTCTA AATGTGTTAT CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA 5778 .......... .......... .......... .......... .......... .......... 181 GAAACCATTA AAAAGTCCTT ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA 5838 .......... .......... .......... .......... .......... .......... 181 AATGGACCAT AAAATAATAA GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC 5898 .......... .......... .......... .......... .......... .......... 181 AATGACTTTG TTTTATCTCC TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG 5958 .......... .......... .......... .......... .......... .......... 181 GTACCGCCAC GATGACTTTT TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT 6018 .......... .......... .......... .......... .......... .......... 181 CCCTCACTAT TTAGGCCTTT TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG 6078 .......... .......... .......... .......... .......... .......... 181 TCAATTGTGA TAATTCAACT AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG 6138 .......... .......... .......... .......... .......... .......... 181 TGACGAGACT TGTCGTTATA CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA 6198 .......... .......... .......... .......... .......... .......... 181 CAGTGTATGC ATATAGGGGC CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG 6258 .......... .......... .......... .......... .......... .......... 181 TGAAACCATT CAGCTTCTTC ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA 6318 .......... .......... .......... .......... .......... .......... 181 GAGCAAGCTA TACATGTTTG TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA 6378 .......... .......... .......... .......... .......... .......... 181 AAAACGTATC CACTTGTGGA TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA 6438 .......... .......... .......... .......... .......... .......... 181 TATCCTTCAA GAACGGCTAG ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT 6498 .......... .......... .......... .......... .......... .......... 181 AAGTATCTCA AAACTCTCTT CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT 6558 .......... .......... .......... .......... .......... .......... 181 CGACTCAGTT TACTGATAGC GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 6618 | |||| | || |||| |||||||||| |||||||||| |||||||||| ........AT TACTTCAAAT TTAA--TATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 231 C-GG--TTTT T----TT-TT CGGTTTTCGG TTTTTGTAAA TTGCGTACCG AATACCGAAC 6670 | || |||| | || || |||||||||| |||||||||| ||||||| || ||||||||| CGGGTTTTTT TCGGTTTTTT CGGTTTTCGG TTTTTGTAAA TTGCGTATCG AATACCGAAT 291 CGAAATATTT TGGTTCGGTT CGGTTTTTGT TAATTCGGTT CGGTTTTTAT TAATTCGGTT 6730 ||||||| | | | ||||| | ||||| | ||||||| || | | |||| | || ||||||| CGAAATA--T T--T-CGGTT TGATTTTTAT TAATTCGATT C-GATTTT-T TATTTCGGTT 344 hqPGS_C06HBa0153O03.1-5+_SGN-E331585- (6567 6730) ******************************************************************************** EST sequence 20 -strand 487 n (File: SGN-E255789-) 1 GAAGAGATTT TGTAGAAACT AGAAGGTACT GGAACTTTGG AGAAGAAGGG ATTTATTATC 61 TTGGAGAAGG GTTTATTATT TGTTTATATT AAATTAAAAT CGAGATGGGT ATATAATTAC 121 TTCAAATTTA ATATAGTGGT GGGCATTCGG TATTTCGGTT CGGTTCGGGT TTTTTTCGGT 181 TTTTTCGGTT TTCGGTTTTT GTAAATTGCG TATCGAATAC CGAATCGAAA TATTTCGGTT 241 TGATTTTTAT TAATTCGATT CGATTTTTTA TTTCGGTTTT TTAATGGACC AAACTGCTGC 301 ACGGTCGGCG GCTGGCTGCT GCTGAGTGCT GTCGCGGCTG GTTGCTGCTG TGCTGCTGTC 361 TGTGGCTGGC TGCTGCTGTG CTGCTGTCGC GGCTACTGCT GCTTCCCTTG TTGTGGACGG 421 CGGCTGGCGC CTGGCGCCTG GCGGCGCTGC TGAAGACAAG AGTGGTTGGA GCTGAAGAGT 481 AGTTGGA Predicted gene structure (within gDNA segment 4065 to 7846): Exon 1 4847 4854 ( 8 n); cDNA 103 110 ( 8 n); score: 0.625 Intron 1 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 2 5527 5531 ( 5 n); cDNA 111 115 ( 5 n); score: 0.600 Intron 2 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 3 6567 6730 ( 164 n); cDNA 116 278 ( 163 n); score: 0.726 MATCH C06HBa0153O03.1-5+ SGN-E255789- 0.726 177 0.363 C PGS_C06HBa0153O03.1-5+_SGN-E255789- (4847 4854,5527 5531,6567 6730) Alignment (genomic DNA sequence = upper lines): AGAAACGTGT ATGTTCAAAG AAATCTGTTC AGACCCGTTT TATCCAGAAA GTTGTGTCTT 4906 ||| || AGATGGGT.. .......... .......... .......... .......... .......... 110 TTGGAAAAAA TAACAAGTTT TTGGAAAAAT GTGTCCGTTA GGAAAATAAC GGCTTTTTGG 4966 .......... .......... .......... .......... .......... .......... 110 AAAGTAAGGA CTTTTCGGAA AGAGTAATAA CTTTTCGGAA TGTTACCATT AAGACATAAT 5026 .......... .......... .......... .......... .......... .......... 110 ATTAACAAGA TTTATTTGAT TTAACAAAAA CTGATTAAAT AAATTTTGTC CAAAAAATTT 5086 .......... .......... .......... .......... .......... .......... 110 ATCAATCAAT CACATCATTT GCCAAATCCA AATCCAAATC CAAATCCTAA GCCGAAGCCG 5146 .......... .......... .......... .......... .......... .......... 110 AGCGAACGAC GACGACGGCG CGAGGGGGCA TCTTCTTCTT AGCTCTTTAA GAATTAATGG 5206 .......... .......... .......... .......... .......... .......... 110 AAGTGTTTCC TTATATAAGG ACAACAATTT CCCTTTCTTT TGATGACATA GGAGAAATGA 5266 .......... .......... .......... .......... .......... .......... 110 CTTTTCATTT GCACTTTGTA AATGACTTTT CATTTTCCCT CCAAAATAGT TCCCTTACTT 5326 .......... .......... .......... .......... .......... .......... 110 TTCATATTCT CTCTTTTCTT TTCTCATTCA CACATGTTAA ATCTAACAAT CCCCCACGTG 5386 .......... .......... .......... .......... .......... .......... 110 AATAGGGAAG GCTATTGTTA AAACATATGC ATGAAAAACT TGTGTGTCTT CTGGTAAAGG 5446 .......... .......... .......... .......... .......... .......... 110 CTAATCGCAT CTGGATAAGT AGATTTCCCT TTAAACTTTC CGTAGTGAAC ATATATCGGA 5506 .......... .......... .......... .......... .......... .......... 110 TATACTCGGT CAATTGGTAG ATTTGATATC TTTGAACCGT CGAGCTTTGT TATATACCTA 5566 || | .......... .......... ATATA..... .......... .......... .......... 115 GACAACATAT GTCACACAAT CAACCCTTGA ACTGTTCTTA GTTCTCATTG TTTTGTTCGT 5626 .......... .......... .......... .......... .......... .......... 115 TTCAGCCACG AAAACATCTT GGATAGTAAG TGCTTAAAGA GCTGGCCTTA CCGGATTCTC 5686 .......... .......... .......... .......... .......... .......... 115 CTTGAAGCGG CTTACACTTC ACACTTACAT AGGTGATTTC TAAATGTGTT ATCCCATAGA 5746 .......... .......... .......... .......... .......... .......... 115 TATACCATTT GATATTCCAT GTATCAAACT TAGAAACCAT TAAAAAGTCC TTACGTCTTT 5806 .......... .......... .......... .......... .......... .......... 115 ATCCTTATTA CTAAATATTG TCTCATCATG AAAATGGACC ATAAAATAAT AAGAATAATT 5866 .......... .......... .......... .......... .......... .......... 115 TATTTTTTTC TTGACAATGT TGAACCGTCA TCAATGACTT TGTTTTATCT CCTTGAACCT 5926 .......... .......... .......... .......... .......... .......... 115 AGATCATGGG ATCTCCTGTA TTCTAGGTAG AGGTACCGCC ACGATGACTT TTTCTCATCC 5986 .......... .......... .......... .......... .......... .......... 115 ATAGTCCCAT TCTCATCGAT GATTTCTCAA CTCCCTCACT ATTTAGGCCT TTTGAAAGTG 6046 .......... .......... .......... .......... .......... .......... 115 GATCTGACAC ATTATCTTTT GACTTCACAT AGTCAATTGT GATAATTCAA CTAGCGAGTA 6106 .......... .......... .......... .......... .......... .......... 115 GTTATCTCAC ATAGTCATGT CTTCGTCTTA TGTGACGAGA CTTGTCGTTA TACATAATGC 6166 .......... .......... .......... .......... .......... .......... 115 TTCCAGCCCT TCCTATTGCA GCTTGACTAT CACAGTGTAT GCATATAGGG GCCATTGATA 6226 .......... .......... .......... .......... .......... .......... 115 TGGGCCAAAA TGGAATATCT TCTAAGAAAT TGTGAAACCA TTCAGCTTCT TCACCTTCCT 6286 .......... .......... .......... .......... .......... .......... 115 TGTCTAAAGC AATGAATTCA GACTTCATTA TAGAGCAAGC TATACATGTT TGTTTGGATG 6346 .......... .......... .......... .......... .......... .......... 115 ATTTCAAAGA TATTGCTCCT CCACCAATAG TAAAAACGTA TCCACTTGTG GATTTTTTTT 6406 .......... .......... .......... .......... .......... .......... 115 CAGTTGACCC AGTAATCAAA TTTGCATCAC TATATCCTTC AAGAACGGCT AGATATGTGT 6466 .......... .......... .......... .......... .......... .......... 115 TGTAATGTAA AGCATACTTT TGAGTATCAT CTAAGTATCT CAAAACTCTC TTCATTGCCA 6526 .......... .......... .......... .......... .......... .......... 115 ACCAATGACC TTGATTAGGA TTACTTGTGT ATCGACTCAG TTTACTGATA GCGCAAGCTA 6586 ||||| | || || .......... .......... .......... .......... ATTACTTCAA ATTTAA--TA 133 TAGTGGTGGG CATTCGGTAT TTCGGTTCGG TTC-GG--TT TTT----TT- TTCGGTTTTC 6638 |||||||||| |||||||||| |||||||||| ||| || || ||| || |||||||||| TAGTGGTGGG CATTCGGTAT TTCGGTTCGG TTCGGGTTTT TTTCGGTTTT TTCGGTTTTC 193 GGTTTTTGTA AATTGCGTAC CGAATACCGA ACCGAAATAT TTTGGTTCGG TTCGGTTTTT 6698 |||||||||| ||||||||| |||||||||| | ||||||| || | ||| || | ||||| GGTTTTTGTA AATTGCGTAT CGAATACCGA ATCGAAATA- -TT--T-CGG TTTGATTTTT 248 GTTAATTCGG TTCGGTTTTT ATTAATTCGG TT 6730 |||||||| ||| | |||| ||| ||||| || ATTAATTCGA TTC-GATTTT -TTATTTCGG TT 278 hqPGS_C06HBa0153O03.1-5+_SGN-E255789- (6567 6730) ******************************************************************************** EST sequence 16 -strand 533 n (File: SGN-E269527-) 1 AAGTGTAATC TCCTTCATTC TCTTAATCAC ATCAACAGTT TGAGCTTGCA TAGCCTCCAT 61 GAACCACCTC CTGTCCTCCT CACTCAAATT GCGAGGAGGA GGTTTGGTAC CATCAGCATG 121 AGATAGACTC CATTTCAATA AACCATCCCA ATTAGGTCCC TCTTTCGCCA TTTTTGATTT 181 CTTACTTAGA TTTTTCGGTG TTTTGTTTAG GGAAGAAGAA GAATTTATTG AAGAGATTTT 241 GTAGAAACTA GAAGGTACTG GAACTTTGGA GAAGAAGGGA TTTATTATCT TGGAGAAGGG 301 TTTATTATTT GTTTATATTA AATTAAAATC GAGATGGGTA TATAATTACT TCAAATTTAA 361 TATAGTGGTG GGCATTCGGT ATTTCGGTTC GGTTCGGGTT TTTTTCGGTT TTTTCGGTTT 421 TCGGTTTTTG TAAATTGCGT ATCGAATACC GAATCGAAAT ATTTCGGTTT GATTTTTATT 481 AATTCGATTC GATTTTTTAT TTCGGTTTTT TAATGGACCA AACTGCTGCA CGG Predicted gene structure (within gDNA segment 2375 to 7846): Exon 1 4339 4373 ( 35 n); cDNA 297 329 ( 33 n); score: 0.686 Intron 1 4374 4844 ( 471 n); Pd: 0.000 (s: 0), Pa: 0.259 (s: 0) Exon 2 4845 4854 ( 10 n); cDNA 330 339 ( 10 n); score: 0.600 Intron 2 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 3 5527 5531 ( 5 n); cDNA 340 344 ( 5 n); score: 0.600 Intron 3 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 4 6567 6685 ( 119 n); cDNA 345 469 ( 125 n); score: 0.723 MATCH C06HBa0153O03.1-5+ SGN-E269527- 0.723 169 0.317 C PGS_C06HBa0153O03.1-5+_SGN-E269527- (4339 4373,4845 4854,5527 5531,6567 6685) Alignment (genomic DNA sequence = upper lines): AGGGTTTGAT ATATTGTGTT TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG 4398 ||||||| | || |||| || || | | || || AGGGTTTATT AT-TTGT-TT ATATTAAATT AAAAT..... .......... .......... 329 AAGATTTCAT AATATAAACT CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT 4458 .......... .......... .......... .......... .......... .......... 329 TAAATGGTAT CTTCTAATTG TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA 4518 .......... .......... .......... .......... .......... .......... 329 ATAGAAATAA GATGAAACAG AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA 4578 .......... .......... .......... .......... .......... .......... 329 ACTACTATGT GTCCTTAAGA AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT 4638 .......... .......... .......... .......... .......... .......... 329 TCTCCCAAGA TAAAATGGAT TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA 4698 .......... .......... .......... .......... .......... .......... 329 ACTAAAGCGA AATTCAGAAC AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT 4758 .......... .......... .......... .......... .......... .......... 329 TTATTTGAGA GAAAAATATA TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT 4818 .......... .......... .......... .......... .......... .......... 329 TGACTTCCTT TTATAGCCAT TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG 4878 | || | || .......... .......... ......CGAG ATGGGT.... .......... .......... 339 ACCCGTTTTA TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT 4938 .......... .......... .......... .......... .......... .......... 339 GTCCGTTAGG AAAATAACGG CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT 4998 .......... .......... .......... .......... .......... .......... 339 TTTCGGAATG TTACCATTAA GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT 5058 .......... .......... .......... .......... .......... .......... 339 GATTAAATAA ATTTTGTCCA AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA 5118 .......... .......... .......... .......... .......... .......... 339 TCCAAATCCA AATCCTAAGC CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC 5178 .......... .......... .......... .......... .......... .......... 339 TTCTTCTTAG CTCTTTAAGA ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC 5238 .......... .......... .......... .......... .......... .......... 339 CTTTCTTTTG ATGACATAGG AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA 5298 .......... .......... .......... .......... .......... .......... 339 TTTTCCCTCC AAAATAGTTC CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA 5358 .......... .......... .......... .......... .......... .......... 339 CATGTTAAAT CTAACAATCC CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT 5418 .......... .......... .......... .......... .......... .......... 339 GAAAAACTTG TGTGTCTTCT GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT 5478 .......... .......... .......... .......... .......... .......... 339 AAACTTTCCG TAGTGAACAT ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT 5538 || | .......... .......... .......... .......... ........AT ATA....... 344 TGAACCGTCG AGCTTTGTTA TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC 5598 .......... .......... .......... .......... .......... .......... 344 TGTTCTTAGT TCTCATTGTT TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG 5658 .......... .......... .......... .......... .......... .......... 344 CTTAAAGAGC TGGCCTTACC GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG 5718 .......... .......... .......... .......... .......... .......... 344 GTGATTTCTA AATGTGTTAT CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA 5778 .......... .......... .......... .......... .......... .......... 344 GAAACCATTA AAAAGTCCTT ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA 5838 .......... .......... .......... .......... .......... .......... 344 AATGGACCAT AAAATAATAA GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC 5898 .......... .......... .......... .......... .......... .......... 344 AATGACTTTG TTTTATCTCC TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG 5958 .......... .......... .......... .......... .......... .......... 344 GTACCGCCAC GATGACTTTT TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT 6018 .......... .......... .......... .......... .......... .......... 344 CCCTCACTAT TTAGGCCTTT TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG 6078 .......... .......... .......... .......... .......... .......... 344 TCAATTGTGA TAATTCAACT AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG 6138 .......... .......... .......... .......... .......... .......... 344 TGACGAGACT TGTCGTTATA CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA 6198 .......... .......... .......... .......... .......... .......... 344 CAGTGTATGC ATATAGGGGC CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG 6258 .......... .......... .......... .......... .......... .......... 344 TGAAACCATT CAGCTTCTTC ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA 6318 .......... .......... .......... .......... .......... .......... 344 GAGCAAGCTA TACATGTTTG TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA 6378 .......... .......... .......... .......... .......... .......... 344 AAAACGTATC CACTTGTGGA TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA 6438 .......... .......... .......... .......... .......... .......... 344 TATCCTTCAA GAACGGCTAG ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT 6498 .......... .......... .......... .......... .......... .......... 344 AAGTATCTCA AAACTCTCTT CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT 6558 .......... .......... .......... .......... .......... .......... 344 CGACTCAGTT TACTGATAGC GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 6618 | |||| | || |||| |||||||||| |||||||||| |||||||||| ........AT TACTTCAAAT TTAA--TATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 394 C-GG--TTTT T----TTT-T CGGTTTTCGG TTTTTGTAAA TTGCGTACCG AATACCGAAC 6670 | || |||| | ||| | |||||||||| |||||||||| ||||||| || ||||||||| CGGGTTTTTT TCGGTTTTTT CGGTTTTCGG TTTTTGTAAA TTGCGTATCG AATACCGAAT 454 CGAAATATTT TGGTT 6685 |||||||||| |||| CGAAATATTT CGGTT 469 hqPGS_C06HBa0153O03.1-5+_SGN-E269527- (6567 6685) ******************************************************************************** EST sequence 4 -strand 237 n (File: SGN-E250948-) 1 TGTAGAAACT AGAAGGTACT GGAACTTTGG AGAAGAAGGG ATTTATTATC TTGGAGAAGG 61 GTTTATTATT TGTTTATATT AAATTAAAAT CGAGATGGGT ATATAATTAC TTCAAATTTA 121 ATATAGTGGT GGGCATTCGG TATTTCGGTT CGGTTCGGGT TTTTTTCGGT TTTTTCGGTT 181 TTCGGTTTTT GTAAATTGCG TATCGAATAC CGAATCGAAA TATTTCGGTT TGATTTT Predicted gene structure (within gDNA segment 4165 to 7607): Exon 1 4847 4854 ( 8 n); cDNA 93 100 ( 8 n); score: 0.625 Intron 1 4855 5526 ( 672 n); Pd: 0.593 (s: 0), Pa: 0.635 (s: 0) Exon 2 5527 5531 ( 5 n); cDNA 101 105 ( 5 n); score: 0.600 Intron 2 5532 6566 (1035 n); Pd: 0.900 (s: 0), Pa: 0.850 (s: 0.80) Exon 3 6567 6637 ( 71 n); cDNA 106 173 ( 68 n); score: 0.831 MATCH C06HBa0153O03.1-5+ SGN-E250948- 0.831 84 0.354 C PGS_C06HBa0153O03.1-5+_SGN-E250948- (4847 4854,5527 5531,6567 6637) Alignment (genomic DNA sequence = upper lines): AGAAACGTGT ATGTTCAAAG AAATCTGTTC AGACCCGTTT TATCCAGAAA GTTGTGTCTT 4906 ||| || AGATGGGT.. .......... .......... .......... .......... .......... 100 TTGGAAAAAA TAACAAGTTT TTGGAAAAAT GTGTCCGTTA GGAAAATAAC GGCTTTTTGG 4966 .......... .......... .......... .......... .......... .......... 100 AAAGTAAGGA CTTTTCGGAA AGAGTAATAA CTTTTCGGAA TGTTACCATT AAGACATAAT 5026 .......... .......... .......... .......... .......... .......... 100 ATTAACAAGA TTTATTTGAT TTAACAAAAA CTGATTAAAT AAATTTTGTC CAAAAAATTT 5086 .......... .......... .......... .......... .......... .......... 100 ATCAATCAAT CACATCATTT GCCAAATCCA AATCCAAATC CAAATCCTAA GCCGAAGCCG 5146 .......... .......... .......... .......... .......... .......... 100 AGCGAACGAC GACGACGGCG CGAGGGGGCA TCTTCTTCTT AGCTCTTTAA GAATTAATGG 5206 .......... .......... .......... .......... .......... .......... 100 AAGTGTTTCC TTATATAAGG ACAACAATTT CCCTTTCTTT TGATGACATA GGAGAAATGA 5266 .......... .......... .......... .......... .......... .......... 100 CTTTTCATTT GCACTTTGTA AATGACTTTT CATTTTCCCT CCAAAATAGT TCCCTTACTT 5326 .......... .......... .......... .......... .......... .......... 100 TTCATATTCT CTCTTTTCTT TTCTCATTCA CACATGTTAA ATCTAACAAT CCCCCACGTG 5386 .......... .......... .......... .......... .......... .......... 100 AATAGGGAAG GCTATTGTTA AAACATATGC ATGAAAAACT TGTGTGTCTT CTGGTAAAGG 5446 .......... .......... .......... .......... .......... .......... 100 CTAATCGCAT CTGGATAAGT AGATTTCCCT TTAAACTTTC CGTAGTGAAC ATATATCGGA 5506 .......... .......... .......... .......... .......... .......... 100 TATACTCGGT CAATTGGTAG ATTTGATATC TTTGAACCGT CGAGCTTTGT TATATACCTA 5566 || | .......... .......... ATATA..... .......... .......... .......... 105 GACAACATAT GTCACACAAT CAACCCTTGA ACTGTTCTTA GTTCTCATTG TTTTGTTCGT 5626 .......... .......... .......... .......... .......... .......... 105 TTCAGCCACG AAAACATCTT GGATAGTAAG TGCTTAAAGA GCTGGCCTTA CCGGATTCTC 5686 .......... .......... .......... .......... .......... .......... 105 CTTGAAGCGG CTTACACTTC ACACTTACAT AGGTGATTTC TAAATGTGTT ATCCCATAGA 5746 .......... .......... .......... .......... .......... .......... 105 TATACCATTT GATATTCCAT GTATCAAACT TAGAAACCAT TAAAAAGTCC TTACGTCTTT 5806 .......... .......... .......... .......... .......... .......... 105 ATCCTTATTA CTAAATATTG TCTCATCATG AAAATGGACC ATAAAATAAT AAGAATAATT 5866 .......... .......... .......... .......... .......... .......... 105 TATTTTTTTC TTGACAATGT TGAACCGTCA TCAATGACTT TGTTTTATCT CCTTGAACCT 5926 .......... .......... .......... .......... .......... .......... 105 AGATCATGGG ATCTCCTGTA TTCTAGGTAG AGGTACCGCC ACGATGACTT TTTCTCATCC 5986 .......... .......... .......... .......... .......... .......... 105 ATAGTCCCAT TCTCATCGAT GATTTCTCAA CTCCCTCACT ATTTAGGCCT TTTGAAAGTG 6046 .......... .......... .......... .......... .......... .......... 105 GATCTGACAC ATTATCTTTT GACTTCACAT AGTCAATTGT GATAATTCAA CTAGCGAGTA 6106 .......... .......... .......... .......... .......... .......... 105 GTTATCTCAC ATAGTCATGT CTTCGTCTTA TGTGACGAGA CTTGTCGTTA TACATAATGC 6166 .......... .......... .......... .......... .......... .......... 105 TTCCAGCCCT TCCTATTGCA GCTTGACTAT CACAGTGTAT GCATATAGGG GCCATTGATA 6226 .......... .......... .......... .......... .......... .......... 105 TGGGCCAAAA TGGAATATCT TCTAAGAAAT TGTGAAACCA TTCAGCTTCT TCACCTTCCT 6286 .......... .......... .......... .......... .......... .......... 105 TGTCTAAAGC AATGAATTCA GACTTCATTA TAGAGCAAGC TATACATGTT TGTTTGGATG 6346 .......... .......... .......... .......... .......... .......... 105 ATTTCAAAGA TATTGCTCCT CCACCAATAG TAAAAACGTA TCCACTTGTG GATTTTTTTT 6406 .......... .......... .......... .......... .......... .......... 105 CAGTTGACCC AGTAATCAAA TTTGCATCAC TATATCCTTC AAGAACGGCT AGATATGTGT 6466 .......... .......... .......... .......... .......... .......... 105 TGTAATGTAA AGCATACTTT TGAGTATCAT CTAAGTATCT CAAAACTCTC TTCATTGCCA 6526 .......... .......... .......... .......... .......... .......... 105 ACCAATGACC TTGATTAGGA TTACTTGTGT ATCGACTCAG TTTACTGATA GCGCAAGCTA 6586 ||||| | || || .......... .......... .......... .......... ATTACTTCAA ATTTAA--TA 123 TAGTGGTGGG CATTCGGTAT TTCGGTTCGG TTCGGTTTTT TTTTCGGTTT T 6637 |||||||||| |||||||||| |||||||||| ||||| ||| |||||||||| | TAGTGGTGGG CATTCGGTAT TTCGGTTCGG TTCGG-GTTT TTTTCGGTTT T 173 hqPGS_C06HBa0153O03.1-5+_SGN-E250948- (6567 6637) ******************************************************************************** EST sequence 30 -strand 672 n (File: SGN-E396524-) 1 GAGGAGGAGG TTTGGTACCA TCAGCATGAG ATAGACTCCA TTTCAATAAA CCATCCCAAT 61 TAGGTCCCTC TTTCGCCATT TTTGATTTCT TACTTAGATT TTTCGGTGTT TTGTTTAGGG 121 AAGAAGAAGA ATTTATTGAA GAGATTTTGT AGAAACTAGA AGGTACTGGA ACTTTGGAGA 181 AGAAGGGATT TATTATCTTG GAGAAGGGTT TATTATTTGT TTATATTAAA TTAAAATCGA 241 GATGGGTATA TAACTACTTC AAATTTAATA TAGTGGTGGG CATTCGGTAT TTCGGTTCGG 301 TTCGGGTTTT TTTCGGTTTT TTCGGTTTTC GGTTTTTGTA AATTGCGTAT CGAATACCGA 361 ATCGAAATAT TTCGGTTTGA TTTTTATTAA TTCGATTCGA TTTTTTATTT CGGTTTTTTA 421 ATGGACCAAA CTGCTGCACG GTCGGCGGCT GGCTGCTGCT GAGTGCTGTC GCGGCTGGTT 481 GCTGCTGTGC TGCTGTCTGT GGCTGGCTGC TGCTGTGCTG CTGTCGCGGC TACTGCTGCT 541 TCCCTTGTTG TGGACGGCGG CTGGCGCCTG GCGCCTGGCG GCGCTGCTGA AGACAAGAGT 601 GGTTGGAGCT GAAGAGTAGT TGGAATAGAA GAGGAAGAGG AGAAGATAAG AGGAAGAAGA 661 GCTGAAGAGG AG Predicted gene structure (within gDNA segment 2695 to 7846): Exon 1 4339 4373 ( 35 n); cDNA 205 237 ( 33 n); score: 0.686 Intron 1 4374 4844 ( 471 n); Pd: 0.000 (s: 0), Pa: 0.259 (s: 0) Exon 2 4845 4854 ( 10 n); cDNA 238 247 ( 10 n); score: 0.600 Intron 2 4855 6355 (1501 n); Pd: 0.593 (s: 0), Pa: 0.242 (s: 0) Exon 3 6356 6375 ( 20 n); cDNA 248 267 ( 20 n); score: 0.600 Intron 3 6376 6583 ( 208 n); Pd: 0.221 (s: 0), Pa: 0.000 (s: 0.64) Exon 4 6584 6730 ( 147 n); cDNA 268 415 ( 148 n); score: 0.755 MATCH C06HBa0153O03.1-5+ SGN-E396524- 0.755 212 0.315 C PGS_C06HBa0153O03.1-5+_SGN-E396524- (4339 4373,4845 4854,6356 6375,6584 6730) Alignment (genomic DNA sequence = upper lines): AGGGTTTGAT ATATTGTGTT TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG 4398 ||||||| | || |||| || || | | || || AGGGTTTATT AT-TTGT-TT ATATTAAATT AAAAT..... .......... .......... 237 AAGATTTCAT AATATAAACT CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT 4458 .......... .......... .......... .......... .......... .......... 237 TAAATGGTAT CTTCTAATTG TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA 4518 .......... .......... .......... .......... .......... .......... 237 ATAGAAATAA GATGAAACAG AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA 4578 .......... .......... .......... .......... .......... .......... 237 ACTACTATGT GTCCTTAAGA AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT 4638 .......... .......... .......... .......... .......... .......... 237 TCTCCCAAGA TAAAATGGAT TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA 4698 .......... .......... .......... .......... .......... .......... 237 ACTAAAGCGA AATTCAGAAC AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT 4758 .......... .......... .......... .......... .......... .......... 237 TTATTTGAGA GAAAAATATA TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT 4818 .......... .......... .......... .......... .......... .......... 237 TGACTTCCTT TTATAGCCAT TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG 4878 | || | || .......... .......... ......CGAG ATGGGT.... .......... .......... 247 ACCCGTTTTA TCCAGAAAGT TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT 4938 .......... .......... .......... .......... .......... .......... 247 GTCCGTTAGG AAAATAACGG CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT 4998 .......... .......... .......... .......... .......... .......... 247 TTTCGGAATG TTACCATTAA GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT 5058 .......... .......... .......... .......... .......... .......... 247 GATTAAATAA ATTTTGTCCA AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA 5118 .......... .......... .......... .......... .......... .......... 247 TCCAAATCCA AATCCTAAGC CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC 5178 .......... .......... .......... .......... .......... .......... 247 TTCTTCTTAG CTCTTTAAGA ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC 5238 .......... .......... .......... .......... .......... .......... 247 CTTTCTTTTG ATGACATAGG AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA 5298 .......... .......... .......... .......... .......... .......... 247 TTTTCCCTCC AAAATAGTTC CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA 5358 .......... .......... .......... .......... .......... .......... 247 CATGTTAAAT CTAACAATCC CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT 5418 .......... .......... .......... .......... .......... .......... 247 GAAAAACTTG TGTGTCTTCT GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT 5478 .......... .......... .......... .......... .......... .......... 247 AAACTTTCCG TAGTGAACAT ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT 5538 .......... .......... .......... .......... .......... .......... 247 TGAACCGTCG AGCTTTGTTA TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC 5598 .......... .......... .......... .......... .......... .......... 247 TGTTCTTAGT TCTCATTGTT TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG 5658 .......... .......... .......... .......... .......... .......... 247 CTTAAAGAGC TGGCCTTACC GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG 5718 .......... .......... .......... .......... .......... .......... 247 GTGATTTCTA AATGTGTTAT CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA 5778 .......... .......... .......... .......... .......... .......... 247 GAAACCATTA AAAAGTCCTT ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA 5838 .......... .......... .......... .......... .......... .......... 247 AATGGACCAT AAAATAATAA GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC 5898 .......... .......... .......... .......... .......... .......... 247 AATGACTTTG TTTTATCTCC TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG 5958 .......... .......... .......... .......... .......... .......... 247 GTACCGCCAC GATGACTTTT TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT 6018 .......... .......... .......... .......... .......... .......... 247 CCCTCACTAT TTAGGCCTTT TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG 6078 .......... .......... .......... .......... .......... .......... 247 TCAATTGTGA TAATTCAACT AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG 6138 .......... .......... .......... .......... .......... .......... 247 TGACGAGACT TGTCGTTATA CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA 6198 .......... .......... .......... .......... .......... .......... 247 CAGTGTATGC ATATAGGGGC CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG 6258 .......... .......... .......... .......... .......... .......... 247 TGAAACCATT CAGCTTCTTC ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA 6318 .......... .......... .......... .......... .......... .......... 247 GAGCAAGCTA TACATGTTTG TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA 6378 ||| | || || | | || .......... .......... .......... .......ATA TAACTACTTC AAATTTA... 267 AAAACGTATC CACTTGTGGA TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA 6438 .......... .......... .......... .......... .......... .......... 267 TATCCTTCAA GAACGGCTAG ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT 6498 .......... .......... .......... .......... .......... .......... 267 AAGTATCTCA AAACTCTCTT CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT 6558 .......... .......... .......... .......... .......... .......... 267 CGACTCAGTT TACTGATAGC GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 6618 |||| |||||||||| |||||||||| |||||||||| .......... .......... .....ATATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT 302 C-GG--TTTT T---T-TT-T CGGTTTTCGG TTTTTGTAAA TTGCGTACCG AATACCGAAC 6670 | || |||| | | || | |||||||||| |||||||||| ||||||| || ||||||||| CGGGTTTTTT TCGGTTTTTT CGGTTTTCGG TTTTTGTAAA TTGCGTATCG AATACCGAAT 362 CGAAATATTT TGGTTCGGTT CGGTTTTTGT TAATTCGGTT CGGTTTTTAT TAATTCGGTT 6730 |||||||||| || | || | ||||| | ||||||| || | | |||| | || ||||||| CGAAATATTT CGG-T---TT -GATTTTTAT TAATTCGATT C-GATTTT-T TATTTCGGTT 415 hqPGS_C06HBa0153O03.1-5+_SGN-E396524- (6584 6730) ******************************************************************************** EST sequence 12 -strand 460 n (File: SGN-E302012-) 1 GGGTTTATTA TTTGTTTATA TTAAATTAAA ATCGAGATGG GTATATAATT ACTTCAAATT 61 TAATATAGTG GTGGGCATTC GGTATTTCGG TTCGGTTCGG GTTTTTTTCG GTTTTTTCGG 121 TTTTCGGTTT TTGTAAATTG CGTATCGAAT ACCGAATCGA AATATTTCGG TTTGATTTTT 181 ATTAATTCGA TTCGATTTTT TATTTCGGTT TTTTAATGGA CCAAACTGCT GCACGGTCGG 241 CGGCTGGCTG CTGCTGAGTG CTGTCGCGGC TGGTTGCTGC TGTGCTGCTG TCTGTGGCTG 301 GCTGCTGCTG TGCTGCTGTC GCGGCTACTG CTGCTTCCCT TGTTGTGGAC GGCGGCTGGC 361 GCCTGGCGCC TGGCGGCGCT GCTGAAGACA AGAGTGGTTG GAGCTGAAGA GTAGTTGGAA 421 TAGAAGAGGA AGAGGAGAAG ATAAGAGGAA GAAGAGCTGA Predicted gene structure (within gDNA segment 5345 to 7846): Exon 1 6625 6730 ( 106 n); cDNA 112 210 ( 99 n); score: 0.858 MATCH C06HBa0153O03.1-5+ SGN-E302012- 0.858 106 0.230 C PGS_C06HBa0153O03.1-5+_SGN-E302012- (6625 6730) Alignment (genomic DNA sequence = upper lines): TTTTTTCGGT TTTCGGTTTT TGTAAATTGC GTACCGAATA CCGAACCGAA ATATTTTGGT 6684 |||||||||| |||||||||| |||||||||| ||| |||||| ||||| |||| |||||| ||| TTTTTTCGGT TTTCGGTTTT TGTAAATTGC GTATCGAATA CCGAATCGAA ATATTTCGGT 171 TCGGTTCGGT TTTTGTTAAT TCGGTTCGGT TTTTATTAAT TCGGTT 6730 | | | | |||| ||||| ||| ||| | |||| ||| | |||||| T---T--GAT TTTTATTAAT TCGATTC-GA TTTT-TTATT TCGGTT 210 hqPGS_C06HBa0153O03.1-5+_SGN-E302012- (6625 6730) ******************************************************************************** EST sequence 25 -strand 435 n (File: SGN-E247286-) 1 TCGAGATGGG TATATAATTA CTTCAAATTT AATATAGTGG TGGGCATTCG GTATTTCGGT 61 TCGGTTCGGG TTTTTTTCGG TTTTTTCGGT TTTCGGTTTT TGTAAATTGC GTATCGAATA 121 CCGAATCGAA ATATTTCGGT TTGATTTTTA TTAATTCGAT TCGATTTTTT ATTTCGGTTT 181 TTTAATGGAC CAAACTGCTG CACGGTCGGC GGCTGGCTGC TGCTGAGTGC TGTCGCGGCT 241 GGTTGCTGCT GTGCTGCTGT CTGTGGCTGG CTGCTGCTGT GCTGCTGTCG CGGCTACTGC 301 TGCTTCCCTT GTTGTGGACG GCGGCTGGCG CCTGGCGCCT GGCGGCGCTG CTGAAGACAA 361 GAGTGGTTGG AGCTGAAGAG TAGTTGGAAT AGAAGAGGAA GAGGAGAAGA TAAGAGGAAG 421 AAGAGCTGAA GAGGA Predicted gene structure (within gDNA segment 5655 to 7846): Exon 1 6625 6730 ( 106 n); cDNA 81 179 ( 99 n); score: 0.858 MATCH C06HBa0153O03.1-5+ SGN-E247286- 0.858 106 0.244 C PGS_C06HBa0153O03.1-5+_SGN-E247286- (6625 6730) Alignment (genomic DNA sequence = upper lines): TTTTTTCGGT TTTCGGTTTT TGTAAATTGC GTACCGAATA CCGAACCGAA ATATTTTGGT 6684 |||||||||| |||||||||| |||||||||| ||| |||||| ||||| |||| |||||| || TTTTTTCGGT TTTCGGTTTT TGTAAATTGC GTATCGAATA CCGAATCGAA ATATTTCGG- 139 TCGGTTCGGT TTTTGTTAAT TCGGTTCGGT TTTTATTAAT TCGGTT 6730 | || | | |||| ||||| ||| ||| | |||| ||| | |||||| T---TT-GAT TTTTATTAAT TCGATTC-GA TTTT-TTATT TCGGTT 179 hqPGS_C06HBa0153O03.1-5+_SGN-E247286- (6625 6730) ******************************************************************************** EST sequence 2 -strand 726 n (File: SGN-E577892-) 1 AATGACTGTA TCTGGAACGA TCTCACTATG ATTTGCTACA GCATCCAATT CAGTACTCAA 61 ACCTGTATCG AGAGATGAGG AGAAGCCAGC GGAAGCTCGG ATTAGAGGAA ACGACCGGCG 121 AGGCTTGAAA ATCAAATTCG GCTTCGCAAA ATTAACTGCA TTTCTCAGTT TCCTGTCCTG 181 GAGAGCTGAG GAAGATGAGA GAGAGGAGTA TAGCGGAGAA TGCAAAGAGG ATGACATTTT 241 CAACTGAGAG AGAGAGAGAT GCGTACCGAA TACCGAACCG AAATATTTCG GTTCGGTTTG 301 GTTTTTGTTA ATTCGGTTCG GTTTTTATTA ATTCGGTTCG GTTTTTTATT TCGGTTTTTT 361 AATGGGCCTG TTTAGTGGGC TTTTTAAACT TAACAATTTT TTCAATTTTT TGTTTGATTT 421 TTCAACTTAA TGGGCTTTGA TATTTTAACT TAATGGGCTT TGATATTTCA ACTTAGGGTT 481 TCTTATTTTT TAAAAAAAGA TTATTTAATA TTAATAATTA TTAAATATAA TATAATTTAT 541 AAATTATAAT TTAATATTTC GGTTTAAACC GAAATACCGA ATTTCAAAGT AACATGTACC 601 GAAAATCGAA CCGAAATGCC GAAAATACAA AAAATTAAAC CGAATACCGA ACCGAAAACC 661 GAAATACCGA AACCGAAATT CTGAAAAATT TCGGTTCGGT TCGGTGTTTC GGTTTTCCGG 721 TTTTTA Predicted gene structure (within gDNA segment 2852 to 7790): Exon 1 3639 3669 ( 31 n); cDNA 206 233 ( 28 n); score: 0.645 Intron 1 3670 4834 (1165 n); Pd: 0.000 (s: 0), Pa: 0.636 (s: 0) Exon 2 4835 4854 ( 20 n); cDNA 234 251 ( 18 n); score: 0.650 Intron 2 4855 5952 (1098 n); Pd: 0.593 (s: 0), Pa: 0.861 (s: 0) Exon 3 5953 5958 ( 6 n); cDNA 252 256 ( 5 n); score: 0.833 Intron 3 5959 6648 ( 690 n); Pd: 0.849 (s: 0), Pa: 0.000 (s: 0.92) Exon 4 6649 7118 ( 470 n); cDNA 257 726 ( 470 n); score: 0.943 MATCH C06HBa0153O03.1-5+ SGN-E577892- 0.943 527 0.726 C PGS_C06HBa0153O03.1-5+_SGN-E577892- (3639 3669,4835 4854,5953 5958,6649 7118) Alignment (genomic DNA sequence = upper lines): GAGAAGAGGT GGAAAAATTG AAAAGAGAAG CATAGTCAAT GCCAAGACCC ATAAAATGAT 3698 ||| | | | || | || || |||||| | GAGTATA-GC GG-AGAA-TG CAAAGAGGAT G......... .......... .......... 233 ATGTCGAAGG CATTGTGCGG ACAGCGGACT AATTGTTGTC TACCTTTTGA GCTTGAATAG 3758 .......... .......... .......... .......... .......... .......... 233 ACATAAAATG TGCTTGCCAA AATTTGCAAT CAGAACTCCA AAAAGGAATT TATGGAAGTG 3818 .......... .......... .......... .......... .......... .......... 233 CAGTAAGGAG ACATAGTCGG GTCGGCCAAA GCAACCTTCG GGTTTAGTAA TTGTTAAGGT 3878 .......... .......... .......... .......... .......... .......... 233 ATGTTCAGCG AGCTTGAGCT CAAAATAGTG ATTTATCGAA TACATTCAGC GACAGTAGAT 3938 .......... .......... .......... .......... .......... .......... 233 ATCACCAAAT AAGTATGGTA TAAAAATACC AAGGGCCTTT TAAAATTCAA AAAGTCTAAA 3998 .......... .......... .......... .......... .......... .......... 233 ACTTTGATAC TTTAGAGCCT TTAAGATTCT TTTATCTTGC TTTAAATAAT TTTTAGTGTT 4058 .......... .......... .......... .......... .......... .......... 233 TGTGAGCCTC TTATGAGTGT TTAAAAATGG ATTTTGTCTC AACCCAACAT ACGTACCTCT 4118 .......... .......... .......... .......... .......... .......... 233 CAATTTTGGT GACCTTAAGG TTGGTTTTTG AATCCTTGTT CATATTTTTT AGTTTAAAGT 4178 .......... .......... .......... .......... .......... .......... 233 TTTTATAGGG TGTGAGCTTG AGCCCTCTAA CTGATTTTGT GTGCCTGATT TAAATGGGTA 4238 .......... .......... .......... .......... .......... .......... 233 GCATTGTATT GTGTTATTGG AACTGTACCC ATGAGTTTTC AATTGTGTAT GCCGGTCAAA 4298 .......... .......... .......... .......... .......... .......... 233 TAGCGAAACG TGTGTGTTGG TTAAATTGTG TAATGATCAG AGGGTTTGAT ATATTGTGTT 4358 .......... .......... .......... .......... .......... .......... 233 TTAATGAGTT TAAGGGTTCA GTTAATAAAG AATTAAACAG AAGATTTCAT AATATAAACT 4418 .......... .......... .......... .......... .......... .......... 233 CATACAAATG TAGGAGTTCA TTTAATTATT TCGCCAAATT TAAATGGTAT CTTCTAATTG 4478 .......... .......... .......... .......... .......... .......... 233 TTATTCAACC TGTATTTATA AGGTTAACCA ATAAACAGAA ATAGAAATAA GATGAAACAG 4538 .......... .......... .......... .......... .......... .......... 233 AAAATACATA AAATACAATA AGTAATCCGA GTCTACAAAA ACTACTATGT GTCCTTAAGA 4598 .......... .......... .......... .......... .......... .......... 233 AATTTAATCC CCTCACTGTA CACAAGGTTA TGGATTAATT TCTCCCAAGA TAAAATGGAT 4658 .......... .......... .......... .......... .......... .......... 233 TAAACCTGTT AAAGAAATAG CAGCACCTCA GATTTCTTTA ACTAAAGCGA AATTCAGAAC 4718 .......... .......... .......... .......... .......... .......... 233 AACAACAAGT CACATAGACT CAGTCGATCG ACACTTTGAT TTATTTGAGA GAAAAATATA 4778 .......... .......... .......... .......... .......... .......... 233 TGCAGAGAAG GAAAATTTTA GTGTTTGAAA AATCAAAAAT TGACTTCCTT TTATAGCCAT 4838 ||| .......... .......... .......... .......... .......... ......ACAT 237 TTTCAGCAAG AAACGTGTAT GTTCAAAGAA ATCTGTTCAG ACCCGTTTTA TCCAGAAAGT 4898 ||||| | | | | | TTTCAACT-G AGA-GA.... .......... .......... .......... .......... 251 TGTGTCTTTT GGAAAAAATA ACAAGTTTTT GGAAAAATGT GTCCGTTAGG AAAATAACGG 4958 .......... .......... .......... .......... .......... .......... 251 CTTTTTGGAA AGTAAGGACT TTTCGGAAAG AGTAATAACT TTTCGGAATG TTACCATTAA 5018 .......... .......... .......... .......... .......... .......... 251 GACATAATAT TAACAAGATT TATTTGATTT AACAAAAACT GATTAAATAA ATTTTGTCCA 5078 .......... .......... .......... .......... .......... .......... 251 AAAAATTTAT CAATCAATCA CATCATTTGC CAAATCCAAA TCCAAATCCA AATCCTAAGC 5138 .......... .......... .......... .......... .......... .......... 251 CGAAGCCGAG CGAACGACGA CGACGGCGCG AGGGGGCATC TTCTTCTTAG CTCTTTAAGA 5198 .......... .......... .......... .......... .......... .......... 251 ATTAATGGAA GTGTTTCCTT ATATAAGGAC AACAATTTCC CTTTCTTTTG ATGACATAGG 5258 .......... .......... .......... .......... .......... .......... 251 AGAAATGACT TTTCATTTGC ACTTTGTAAA TGACTTTTCA TTTTCCCTCC AAAATAGTTC 5318 .......... .......... .......... .......... .......... .......... 251 CCTTACTTTT CATATTCTCT CTTTTCTTTT CTCATTCACA CATGTTAAAT CTAACAATCC 5378 .......... .......... .......... .......... .......... .......... 251 CCCACGTGAA TAGGGAAGGC TATTGTTAAA ACATATGCAT GAAAAACTTG TGTGTCTTCT 5438 .......... .......... .......... .......... .......... .......... 251 GGTAAAGGCT AATCGCATCT GGATAAGTAG ATTTCCCTTT AAACTTTCCG TAGTGAACAT 5498 .......... .......... .......... .......... .......... .......... 251 ATATCGGATA TACTCGGTCA ATTGGTAGAT TTGATATCTT TGAACCGTCG AGCTTTGTTA 5558 .......... .......... .......... .......... .......... .......... 251 TATACCTAGA CAACATATGT CACACAATCA ACCCTTGAAC TGTTCTTAGT TCTCATTGTT 5618 .......... .......... .......... .......... .......... .......... 251 TTGTTCGTTT CAGCCACGAA AACATCTTGG ATAGTAAGTG CTTAAAGAGC TGGCCTTACC 5678 .......... .......... .......... .......... .......... .......... 251 GGATTCTCCT TGAAGCGGCT TACACTTCAC ACTTACATAG GTGATTTCTA AATGTGTTAT 5738 .......... .......... .......... .......... .......... .......... 251 CCCATAGATA TACCATTTGA TATTCCATGT ATCAAACTTA GAAACCATTA AAAAGTCCTT 5798 .......... .......... .......... .......... .......... .......... 251 ACGTCTTTAT CCTTATTACT AAATATTGTC TCATCATGAA AATGGACCAT AAAATAATAA 5858 .......... .......... .......... .......... .......... .......... 251 GAATAATTTA TTTTTTTCTT GACAATGTTG AACCGTCATC AATGACTTTG TTTTATCTCC 5918 .......... .......... .......... .......... .......... .......... 251 TTGAACCTAG ATCATGGGAT CTCCTGTATT CTAGGTAGAG GTACCGCCAC GATGACTTTT 5978 | |||| .......... .......... .......... ....G-AGAG .......... .......... 256 TCTCATCCAT AGTCCCATTC TCATCGATGA TTTCTCAACT CCCTCACTAT TTAGGCCTTT 6038 .......... .......... .......... .......... .......... .......... 256 TGAAAGTGGA TCTGACACAT TATCTTTTGA CTTCACATAG TCAATTGTGA TAATTCAACT 6098 .......... .......... .......... .......... .......... .......... 256 AGCGAGTAGT TATCTCACAT AGTCATGTCT TCGTCTTATG TGACGAGACT TGTCGTTATA 6158 .......... .......... .......... .......... .......... .......... 256 CATAATGCTT CCAGCCCTTC CTATTGCAGC TTGACTATCA CAGTGTATGC ATATAGGGGC 6218 .......... .......... .......... .......... .......... .......... 256 CATTGATATG GGCCAAAATG GAATATCTTC TAAGAAATTG TGAAACCATT CAGCTTCTTC 6278 .......... .......... .......... .......... .......... .......... 256 ACCTTCCTTG TCTAAAGCAA TGAATTCAGA CTTCATTATA GAGCAAGCTA TACATGTTTG 6338 .......... .......... .......... .......... .......... .......... 256 TTTGGATGAT TTCAAAGATA TTGCTCCTCC ACCAATAGTA AAAACGTATC CACTTGTGGA 6398 .......... .......... .......... .......... .......... .......... 256 TTTTTTTTCA GTTGACCCAG TAATCAAATT TGCATCACTA TATCCTTCAA GAACGGCTAG 6458 .......... .......... .......... .......... .......... .......... 256 ATATGTGTTG TAATGTAAAG CATACTTTTG AGTATCATCT AAGTATCTCA AAACTCTCTT 6518 .......... .......... .......... .......... .......... .......... 256 CATTGCCAAC CAATGACCTT GATTAGGATT ACTTGTGTAT CGACTCAGTT TACTGATAGC 6578 .......... .......... .......... .......... .......... .......... 256 GCAAGCTATA GTGGTGGGCA TTCGGTATTT CGGTTCGGTT CGGTTTTTTT TTCGGTTTTC 6638 .......... .......... .......... .......... .......... .......... 256 GGTTTTTGTA AATTGCGTAC CGAATACCGA ACCGAAATAT TTTGGTTCGG TTCGGTTTTT 6698 | ||||||| |||||||||| |||||||||| || ||||||| || ||||||| .......... AGATGCGTAC CGAATACCGA ACCGAAATAT TTCGGTTCGG TTTGGTTTTT 306 GTTAATTCGG TTCGGTTTTT ATTAATTCGG TTCAATTTTT TATTTTGGTT TTTTAATGGG 6758 |||||||||| |||||||||| |||||||||| ||| ||||| ||||| |||| |||||||||| GTTAATTCGG TTCGGTTTTT ATTAATTCGG TTCGGTTTTT TATTTCGGTT TTTTAATGGG 366 CCTGTTTAGT GGGCTTTTTA AACTTAACAA TTTTTTCAAT TTTTCGTGAG ATTATACAAT 6818 |||||||||| |||||||||| |||||||||| |||||||||| |||| || | ||| | ||| CCTGTTTAGT GGGCTTTTTA AACTTAACAA TTTTTTCAAT TTTTTGTTTG ATTTTTCAAC 426 TTTTTGGGCT TTGATATTTC AACTTAATGG GCTTTGATAT TTCAACTTAG GGTTTCTTAT 6878 || |||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAATGGGCT TTGATATTTT AACTTAATGG GCTTTGATAT TTCAACTTAG GGTTTCTTAT 486 TTGTTTTAAA AAGATTATTT AATATTAATA ATTATTAAAT ATAATATAAT TTATAAATTA 6938 || || ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTAAAAA AAGATTATTT AATATTAATA ATTATTAAAT ATAATATAAT TTATAAATTA 546 TAATTTAATA TTTCGGTTTA AACCGAAATA CCGAATTCCA AAGTAACATG TACCGAAAAC 6998 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| ||||||||| TAATTTAATA TTTCGGTTTA AACCGAAATA CCGAATTTCA AAGTAACATG TACCGAAAAT 606 CGAACCGAAA TGCCGAAAAT ACAAAAAATT AAACCGAATA CTGAACCGAA AACCGAAATA 7058 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| CGAACCGAAA TGCCGAAAAT ACAAAAAATT AAACCGAATA CCGAACCGAA AACCGAAATA 666 CCAAAACCGA AATTCTGAAA AATTTTGGTT CGGTTCGGTG TTTCGGTTTT CCACTCTTTA 7118 || ||||||| |||||||||| ||||| |||| |||||||||| |||||||||| || | |||| CCGAAACCGA AATTCTGAAA AATTTCGGTT CGGTTCGGTG TTTCGGTTTT CCGGTTTTTA 726 hqPGS_C06HBa0153O03.1-5+_SGN-E577892- (6649 7118) Total number of EST alignments reported: 34 ________________________________________________________________________________ Predicted gene locations (3) in segment 1 to 7846: PGL 1 (+ strand): 888 2029 AGS-1 (888 1013) SCR (e 0.897) Exon 1 888 1013 ( 126 n); score: 0.897 PGS (888 1013) SGN-E546254- 3-phase translation of AGS-1 (+strand): . . . . . . 888 CTTTGTCACGACCGGCATCTAGACCTCATAAGAGACCAGCGTCGATGACCTCTCAGAGGT L C H D R H L D L I R D Q R R - P L R G F V T T G I - T S - E T S V D D L S E V L S R P A S R P H K R P A S M T S Q R . . . . . . 948 CGCAGACAAGCCTACTTACGTCATTCTTACTTTACATAGGTTAATTTTAGCGGAAAATTT R R Q A Y L R H S Y F T - V N F S G K F A D K P T Y V I L T L H R L I L A E N F S Q T S L L T S F L L Y I G - F - R K I . 1008 TTGTTT L F C F V Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 1013 AAACAAAAATTTTCCGCTAAAATTAACCTATGTAAAGTAAGAATGACGTAAGTAGGCTTG K Q K F S A K I N L C K V R M T - V G L N K N F P L K L T Y V K - E - R K - A C T K I F R - N - P M - S K N D V S R L . . . . . . 953 TCTGCGACCTCTGAGAGGTCATCGACGCTGGTCTCTTATGAGGTCTAGATGCCGGTCGTG S A T S E R S S T L V S Y E V - M P V V L R P L R G H R R W S L M R S R C R S - V C D L - E V I D A G L L - G L D A G R . 893 ACAAAG T K Q D K Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (1161 1552,1585 2029) SCR (e 0.847 d 0.057 a 0.000,e 0.863) Exon 1 1161 1552 ( 392 n); score: 0.847 Intron 1 1553 1584 ( 32 n); Pd: 0.057 Pa: 0.000 Exon 2 1585 2029 ( 445 n); score: 0.863 PGS (1161 1552,1585 1910) SGN-E389503+ PGS (1309 1552,1585 2029) SGN-E389994- PGS (1377 1552,1585 2028) SGN-E235464- 3-phase translation of AGS-2 (+strand): . . . . . . 1161 AACAGGAAGGAAACGCTAGTGGAACATGCTCCACTAGCTCAACTCTAAAACTAAGCTAGA N R K E T L V E H A P L A Q L - N - A R T G R K R - W N M L H - L N S K T K L E Q E G N A S G T C S T S S T L K L S - . . . . . . 1221 ATATAAAACAGTGGCATCCTCGAAAGCATGACGACCTACCAACTCCGAACGAATGCTCGA I - N S G I L E S M T T Y Q L R T N A R Y K T V A S S K A - R P T N S E R M L D N I K Q W H P R K H D D L P T P N E C S . . . . . . 1281 CGTTTGGATAATTGCAACGATGATCTTGTAGCTCTATCGTCCATCTGTGTCTGCACCTAA R L D N C N D D L V A L S S I C V C T - V W I I A T M I L - L Y R P S V S A P K T F G - L Q R - S C S S I V H L C L H L . . . . . . 1341 AAATAGTAGAGTTTGTATAGGGTTAGTACACACTTTTAATAAGTATGGGTATATGCAAGA K - - S L Y R V S T H F - - V W V Y A R N S R V C I G L V H T F N K Y G Y M Q E K I V E F V - G - Y T L L I S M G I C K . . . . . . 1401 ACACACCACGAATATGCATGAGAAAGAATAACTCTTTCTTAACAACATGACTTTTTGGAA T H H E Y A - E R I T L S - Q H D F L E H T T N M H E K E - L F L N N M T F W K N T P R I C M R K N N S F L T T - L F G . . . . . . 1461 GTCAAGTCAGTGGACTTGCCAAATTTAGATTAGGAGAGTTACCAAATTTGGAATAGGAAA V K S V D L P N L D - E S Y Q I W N R K S S Q W T C Q I - I R R V T K F G I G K S Q V S G L A K F R L G E L P N L E - E . . . . : . . 1521 GTCAATGAGCTTTCCAAATTTGGAATAGGAAA : GTTATGCCATGAGTTTAACACACATCAT V N E L S K F G I G K : L C H E F N T H H S M S F P N L E - E : S Y A M S L T H I I S Q - A F Q I W N R K : V M P - V - H T S . . . . . . 1613 CATACTTTGCACCTTTGCACACACCACATAACATTTACACATAGCACATATCATATAGCA H T L H L C T H H I T F T H S T Y H I A I L C T F A H T T - H L H I A H I I - H S Y F A P L H T P H N I Y T - H I S Y S . . . . . . 1673 CACTGCACAATTTGCATGAAGCACATATTTTCTTTAATATCATTCATTCATATGCCATAA H C T I C M K H I F S L I S F I H M P - T A Q F A - S T Y F L - Y H S F I C H K T L H N L H E A H I F F N I I H S Y A I . . . . . . 1733 GACCTTTGGATCATGGACTTAATGTTAAGACATCCCATAAATGAGGTCTCAATAGATGGG D L W I M D L M L R H P I N E V S I D G T F G S W T - C - D I P - M R S Q - M G R P L D H G L N V K T S H K - G L N R W . . . . . . 1793 ACCTCAACTAGAGAGTCTTCATTAGCAAACACAGAATCTGTTTCGTTCATTCATACGTAC T S T R E S S L A N T E S V S F I H T Y P Q L E S L H - Q T Q N L F R S F I R T D L N - R V F I S K H R I C F V H S Y V . . . . . . 1853 TCCATTTCATTTCATTCATAGGCCAGTATAAACACCAGCTCTACCTAGGATGTAGTTTTA S I S F H S - A S I N T S S T - D V V L P F H F I H R P V - T P A L P R M - F - L H F I S F I G Q Y K H Q L Y L G C S F . . . . . . 1913 GACTTTCATTAAATTCGTCATGAAATGACCAAGAATGACCTAATGTCATTACTTGAATCT D F H - I R H E M T K N D L M S L L E S T F I K F V M K - P R M T - C H Y L N L R L S L N S S - N D Q E - P N V I T - I . . . . . . 1973 AACTCACCTTTTGATTACCCTATCCTAATACCTTTGCTATCATTCATTTCATTATGT N S P F D Y P I L I P L L S F I S L C T H L L I T L S - Y L C Y H S F H Y - L T F - L P Y P N T F A I I H F I M Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5+_PGL-1_AGS-2_PPS_1 (1494 1552,1585 1732) (frame '1'; 204 bp, 68 residues) 1 ESYQIWNRKV NELSKFGIGK LCHEFNTHHH TLHLCTHHIT FTHSTYHIAH CTICMKHIFS 61 LISFIHMP- PGL 2 (+ strand): 4485 5806 AGS-1 (4485 5129,5259 5370,5632 5657) SCR (e 0.883 d 0.000 a 0.000,e 0.902 d 0.000 a 0.977,e 0.654) Exon 1 4485 5129 ( 645 n); score: 0.883 Intron 1 5130 5258 ( 129 n); Pd: 0.000 Pa: 0.000 Exon 2 5259 5370 ( 112 n); score: 0.902 Intron 2 5371 5631 ( 261 n); Pd: 0.000 Pa: 0.977 Exon 3 5632 5657 ( 26 n); score: 0.654 PGS (4485 4856) SGN-E255327- PGS (4485 4856) SGN-E369760- PGS (4491 4856) SGN-E262710- PGS (4491 4827) SGN-E254845- PGS (4491 4826) SGN-E273518- PGS (4491 4730) SGN-E263584- PGS (4491 4688) SGN-E261066- PGS (4491 4688) SGN-E276669- PGS (4517 4848) SGN-E262800- PGS (4517 4796) SGN-E258205- PGS (4635 4856) SGN-E261310- PGS (4828 5129,5259 5370,5632 5657) SGN-E395007- PGS (5016 5129,5259 5370,5632 5657) SGN-E250408- PGS (5066 5129,5259 5376) SGN-E250410- 3-phase translation of AGS-1 (+strand): . . . . . . 4485 AACCTGTATTTATAAGGTTAACCAATAAACAGAAATAGAAATAAGATGAAACAGAAAATA N L Y L - G - P I N R N R N K M K Q K I T C I Y K V N Q - T E I E I R - N R K Y P V F I R L T N K Q K - K - D E T E N . . . . . . 4545 CATAAAATACAATAAGTAATCCGAGTCTACAAAAACTACTATGTGTCCTTAAGAAATTTA H K I Q - V I R V Y K N Y Y V S L R N L I K Y N K - S E S T K T T M C P - E I - T - N T I S N P S L Q K L L C V L K K F . . . . . . 4605 ATCCCCTCACTGTACACAAGGTTATGGATTAATTTCTCCCAAGATAAAATGGATTAAACC I P S L Y T R L W I N F S Q D K M D - T S P H C T Q G Y G L I S P K I K W I K P N P L T V H K V M D - F L P R - N G L N . . . . . . 4665 TGTTAAAGAAATAGCAGCACCTCAGATTTCTTTAACTAAAGCGAAATTCAGAACAACAAC C - R N S S T S D F F N - S E I Q N N N V K E I A A P Q I S L T K A K F R T T T L L K K - Q H L R F L - L K R N S E Q Q . . . . . . 4725 AAGTCACATAGACTCAGTCGATCGACACTTTGATTTATTTGAGAGAAAAATATATGCAGA K S H R L S R S T L - F I - E K N I C R S H I D S V D R H F D L F E R K I Y A E Q V T - T Q S I D T L I Y L R E K Y M Q . . . . . . 4785 GAAGGAAAATTTTAGTGTTTGAAAAATCAAAAATTGACTTCCTTTTATAGCCATTTTCAG E G K F - C L K N Q K L T S F Y S H F Q K E N F S V - K I K N - L P F I A I F S R R K I L V F E K S K I D F L L - P F S . . . . . . 4845 CAAGAAACGTGTATGTTCAAAGAAATCTGTTCAGACCCGTTTTATCCAGAAAGTTGTGTC Q E T C M F K E I C S D P F Y P E S C V K K R V C S K K S V Q T R F I Q K V V S A R N V Y V Q R N L F R P V L S R K L C . . . . . . 4905 TTTTGGAAAAAATAACAAGTTTTTGGAAAAATGTGTCCGTTAGGAAAATAACGGCTTTTT F W K K - Q V F G K M C P L G K - R L F F G K N N K F L E K C V R - E N N G F L L L E K I T S F W K N V S V R K I T A F . . . . . . 4965 GGAAAGTAAGGACTTTTCGGAAAGAGTAATAACTTTTCGGAATGTTACCATTAAGACATA G K - G L F G K S N N F S E C Y H - D I E S K D F S E R V I T F R N V T I K T - W K V R T F R K E - - L F G M L P L R H . . . . . . 5025 ATATTAACAAGATTTATTTGATTTAACAAAAACTGATTAAATAAATTTTGTCCAAAAAAT I L T R F I - F N K N - L N K F C P K N Y - Q D L F D L T K T D - I N F V Q K I N I N K I Y L I - Q K L I K - I L S K K . . . . . : . 5085 TTATCAATCAATCACATCATTTGCCAAATCCAAATCCAAATCCAA : AGAAATGACTTTTCA L S I N H I I C Q I Q I Q I Q : R N D F S Y Q S I T S F A K S K S K S K : E M T F H F I N Q S H H L P N P N P N P : K K - L F . . . . . . 5274 TTTGCACTTTGTAAATGACTTTTCATTTTCCCTCCAAAATAGTTCCCTTACTTTTCATAT F A L C K - L F I F P P K - F P Y F S Y L H F V N D F S F S L Q N S S L T F H I I C T L - M T F H F P S K I V P L L F I . . . . : . . 5334 TCTCTCTTTTCTTTTCTCATTCACACATGTTAAATCT : CCACGAAAACATCTTGGATAGTA S L F S F L I H T C - I : S T K T S W I V L S F L F S F T H V K S : P R K H L G - - F S L F F S H S H M L N L : H E N I L D S . 5655 AGT S K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5+_PGL-2_AGS-1_PPS_1 (4606 4806) (frame '2'; 198 bp, 66 residues) 1 SPHCTQGYGL ISPKIKWIKP VKEIAAPQIS LTKAKFRTTT SHIDSVDRHF DLFERKIYAE 61 KENFSV- >C06HBa0153O03.1-5+_PGL-2_AGS-1_PPS_2 (5065 5129,5259 5370,5632 5652) (frame '2'; 195 bp, 65 residues) 1 INFVQKIYQS ITSFAKSKSK SKEMTFHLHF VNDFSFSLQN SSLTFHILSF LFSFTHVKSP 61 RKHLG- AGS-2 (5018 5123,5154 5806) SCR (e 0.955 d 0.000 a 0.000,e 0.852) Exon 1 5018 5123 ( 106 n); score: 0.955 Intron 1 5124 5153 ( 30 n); Pd: 0.000 Pa: 0.000 Exon 2 5154 5806 ( 653 n); score: 0.852 PGS (5018 5123,5154 5372) SGN-E548743- PGS (5018 5123,5154 5366) SGN-E301922- PGS (5020 5123,5154 5366) SGN-E542859- PGS (5024 5123,5154 5366) SGN-E301820- PGS (5261 5806) SGN-E357316- 3-phase translation of AGS-2 (+strand): . . . . . . 5018 AGACATAATATTAACAAGATTTATTTGATTTAACAAAAACTGATTAAATAAATTTTGTCC R H N I N K I Y L I - Q K L I K - I L S D I I L T R F I - F N K N - L N K F C P T - Y - Q D L F D L T K T D - I N F V . . . . . : . 5078 AAAAAATTTATCAATCAATCACATCATTTGCCAAATCCAAATCCAA : GACGACGACGGCGC K K F I N Q S H H L P N P N P : R R R R R K N L S I N H I I C Q I Q I Q : D D D G A Q K I Y Q S I T S F A K S K S K : T T T A . . . . . . 5168 GAGGGGGCATCTTCTTCTTAGCTCTTTAAGAATTAATGGAAGTGTTTCCTTATATAAGGA E G A S S S - L F K N - W K C F L I - G R G H L L L S S L R I N G S V S L Y K D R G G I F F L A L - E L M E V F P Y I R . . . . . . 5228 CAACAATTTCCCTTTCTTTTGATGACATAGGAGAAATGACTTTTCATTTGCACTTTGTAA Q Q F P F L L M T - E K - L F I C T L - N N F P F F - - H R R N D F S F A L C K T T I S L S F D D I G E M T F H L H F V . . . . . . 5288 ATGACTTTTCATTTTCCCTCCAAAATAGTTCCCTTACTTTTCATATTCTCTCTTTTCTTT M T F H F P S K I V P L L F I F S L F F - L F I F P P K - F P Y F S Y S L F S F N D F S F S L Q N S S L T F H I L S F L . . . . . . 5348 TCTCATTCACACATGTTAAATCTAACAATCCCCCACGTGAATAGGGAAGGCTATTGTTAA S H S H M L N L T I P H V N R E G Y C - L I H T C - I - Q S P T - I G K A I V K F S F T H V K S N N P P R E - G R L L L . . . . . . 5408 AACATATGCATGAAAAACTTGTGTGTCTTCTGGTAAAGGCTAATCGCATCTGGATAAGTA N I C M K N L C V F W - R L I A S G - V T Y A - K T C V S S G K G - S H L D K - K H M H E K L V C L L V K A N R I W I S . . . . . . 5468 GATTTCCCTTTAAACTTTCCGTAGTGAACATATATCGGATATACTCGGTCAATTGGTAGA D F P L N F P - - T Y I G Y T R S I G R I S L - T F R S E H I S D I L G Q L V D R F P F K L S V V N I Y R I Y S V N W - . . . . . . 5528 TTTGATATCTTTGAACCGTCGAGCTTTGTTATATACCTAGACAACATATGTCACACAATC F D I F E P S S F V I Y L D N I C H T I L I S L N R R A L L Y T - T T Y V T Q S I - Y L - T V E L C Y I P R Q H M S H N . . . . . . 5588 AACCCTTGAACTGTTCTTAGTTCTCATTGTTTTGTTCGTTTCAGCCACGAAAACATCTTG N P - T V L S S H C F V R F S H E N I L T L E L F L V L I V L F V S A T K T S W Q P L N C S - F S L F C S F Q P R K H L . . . . . . 5648 GATAGTAAGTGCTTAAAGAGCTGGCCTTACCGGATTCTCCTTGAAGCGGCTTACACTTCA D S K C L K S W P Y R I L L E A A Y T S I V S A - R A G L T G F S L K R L T L H G - - V L K E L A L P D S P - S G L H F . . . . . . 5708 CACTTACATAGGTGATTTCTAAATGTGTTATCCCATAGATATACCATTTGATATTCCATG H L H R - F L N V L S H R Y T I - Y S M T Y I G D F - M C Y P I D I P F D I P C T L T - V I S K C V I P - I Y H L I F H . . . . 5768 TATCAAACTTAGAAACCATTAAAAAGTCCTTACGTCTTT Y Q T - K P L K S P Y V F I K L R N H - K V L T S V S N L E T I K K S L R L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5+_PGL-2_AGS-2_PPS_1 (5197 5391) (frame '0'; 192 bp, 64 residues) 1 ELMEVFPYIR TTISLSFDDI GEMTFHLHFV NDFSFSLQNS SLTFHILSFL FSFTHVKSNN 61 PPRE- AGS-3 (5098 5806) SCR (e 0.836) Exon 1 5098 5806 ( 709 n); score: 0.836 PGS (5098 5806) SGN-E351952- 3-phase translation of AGS-3 (+strand): . . . . . . 5098 ACATCATTTGCCAAATCCAAATCCAAATCCAAATCCTAAGCCGAAGCCGAGCGAACGACG T S F A K S K S K S K S - A E A E R T T H H L P N P N P N P N P K P K P S E R R I I C Q I Q I Q I Q I L S R S R A N D . . . . . . 5158 ACGACGGCGCGAGGGGGCATCTTCTTCTTAGCTCTTTAAGAATTAATGGAAGTGTTTCCT T T A R G G I F F L A L - E L M E V F P R R R E G A S S S - L F K N - W K C F L D D G A R G H L L L S S L R I N G S V S . . . . . . 5218 TATATAAGGACAACAATTTCCCTTTCTTTTGATGACATAGGAGAAATGACTTTTCATTTG Y I R T T I S L S F D D I G E M T F H L I - G Q Q F P F L L M T - E K - L F I C L Y K D N N F P F F - - H R R N D F S F . . . . . . 5278 CACTTTGTAAATGACTTTTCATTTTCCCTCCAAAATAGTTCCCTTACTTTTCATATTCTC H F V N D F S F S L Q N S S L T F H I L T L - M T F H F P S K I V P L L F I F S A L C K - L F I F P P K - F P Y F S Y S . . . . . . 5338 TCTTTTCTTTTCTCATTCACACATGTTAAATCTAACAATCCCCCACGTGAATAGGGAAGG S F L F S F T H V K S N N P P R E - G R L F F S H S H M L N L T I P H V N R E G L F S F L I H T C - I - Q S P T - I G K . . . . . . 5398 CTATTGTTAAAACATATGCATGAAAAACTTGTGTGTCTTCTGGTAAAGGCTAATCGCATC L L L K H M H E K L V C L L V K A N R I Y C - N I C M K N L C V F W - R L I A S A I V K T Y A - K T C V S S G K G - S H . . . . . . 5458 TGGATAAGTAGATTTCCCTTTAAACTTTCCGTAGTGAACATATATCGGATATACTCGGTC W I S R F P F K L S V V N I Y R I Y S V G - V D F P L N F P - - T Y I G Y T R S L D K - I S L - T F R S E H I S D I L G . . . . . . 5518 AATTGGTAGATTTGATATCTTTGAACCGTCGAGCTTTGTTATATACCTAGACAACATATG N W - I - Y L - T V E L C Y I P R Q H M I G R F D I F E P S S F V I Y L D N I C Q L V D L I S L N R R A L L Y T - T T Y . . . . . . 5578 TCACACAATCAACCCTTGAACTGTTCTTAGTTCTCATTGTTTTGTTCGTTTCAGCCACGA S H N Q P L N C S - F S L F C S F Q P R H T I N P - T V L S S H C F V R F S H E V T Q S T L E L F L V L I V L F V S A T . . . . . . 5638 AAACATCTTGGATAGTAAGTGCTTAAAGAGCTGGCCTTACCGGATTCTCCTTGAAGCGGC K H L G - - V L K E L A L P D S P - S G N I L D S K C L K S W P Y R I L L E A A K T S W I V S A - R A G L T G F S L K R . . . . . . 5698 TTACACTTCACACTTACATAGGTGATTTCTAAATGTGTTATCCCATAGATATACCATTTG L H F T L T - V I S K C V I P - I Y H L Y T S H L H R - F L N V L S H R Y T I - L T L H T Y I G D F - M C Y P I D I P F . . . . . 5758 ATATTCCATGTATCAAACTTAGAAACCATTAAAAAGTCCTTACGTCTTT I F H V S N L E T I K K S L R L Y S M Y Q T - K P L K S P Y V F D I P C I K L R N H - K V L T S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5+_PGL-2_AGS-3_PPS_1 (5197 5391) (frame '1'; 192 bp, 64 residues) 1 ELMEVFPYIR TTISLSFDDI GEMTFHLHFV NDFSFSLQNS SLTFHILSFL FSFTHVKSNN 61 PPRE- 3-phase translation of AGS-3 (-strand): . . . . . . 5806 AAAGACGTAAGGACTTTTTAATGGTTTCTAAGTTTGATACATGGAATATCAAATGGTATA K D V R T F - W F L S L I H G I S N G I K T - G L F N G F - V - Y M E Y Q M V Y R R K D F L M V S K F D T W N I K W Y . . . . . . 5746 TCTATGGGATAACACATTTAGAAATCACCTATGTAAGTGTGAAGTGTAAGCCGCTTCAAG S M G - H I - K S P M - V - S V S R F K L W D N T F R N H L C K C E V - A A S R I Y G I T H L E I T Y V S V K C K P L Q . . . . . . 5686 GAGAATCCGGTAAGGCCAGCTCTTTAAGCACTTACTATCCAAGATGTTTTCGTGGCTGAA E N P V R P A L - A L T I Q D V F V A E R I R - G Q L F K H L L S K M F S W L K G E S G K A S S L S T Y Y P R C F R G - . . . . . . 5626 ACGAACAAAACAATGAGAACTAAGAACAGTTCAAGGGTTGATTGTGTGACATATGTTGTC T N K T M R T K N S S R V D C V T Y V V R T K Q - E L R T V Q G L I V - H M L S N E Q N N E N - E Q F K G - L C D I C C . . . . . . 5566 TAGGTATATAACAAAGCTCGACGGTTCAAAGATATCAAATCTACCAATTGACCGAGTATA - V Y N K A R R F K D I K S T N - P S I R Y I T K L D G S K I S N L P I D R V Y L G I - Q S S T V Q R Y Q I Y Q L T E Y . . . . . . 5506 TCCGATATATGTTCACTACGGAAAGTTTAAAGGGAAATCTACTTATCCAGATGCGATTAG S D I C S L R K V - R E I Y L S R C D - P I Y V H Y G K F K G K S T Y P D A I S I R Y M F T T E S L K G N L L I Q M R L . . . . . . 5446 CCTTTACCAGAAGACACACAAGTTTTTCATGCATATGTTTTAACAATAGCCTTCCCTATT P L P E D T Q V F H A Y V L T I A F P I L Y Q K T H K F F M H M F - Q - P S L F A F T R R H T S F S C I C F N N S L P Y . . . . . . 5386 CACGTGGGGGATTGTTAGATTTAACATGTGTGAATGAGAAAAGAAAAGAGAGAATATGAA H V G D C - I - H V - M R K E K R E Y E T W G I V R F N M C E - E K K R E N M K S R G G L L D L T C V N E K R K E R I - . . . . . . 5326 AAGTAAGGGAACTATTTTGGAGGGAAAATGAAAAGTCATTTACAAAGTGCAAATGAAAAG K - G N Y F G G K M K S H L Q S A N E K S K G T I L E G K - K V I Y K V Q M K S K V R E L F W R E N E K S F T K C K - K . . . . . . 5266 TCATTTCTCCTATGTCATCAAAAGAAAGGGAAATTGTTGTCCTTATATAAGGAAACACTT S F L L C H Q K K G K L L S L Y K E T L H F S Y V I K R K G N C C P Y I R K H F V I S P M S S K E R E I V V L I - G N T . . . . . . 5206 CCATTAATTCTTAAAGAGCTAAGAAGAAGATGCCCCCTCGCGCCGTCGTCGTCGTTCGCT P L I L K E L R R R C P L A P S S S F A H - F L K S - E E D A P S R R R R R S L S I N S - R A K K K M P P R A V V V V R . . . . . 5146 CGGCTTCGGCTTAGGATTTGGATTTGGATTTGGATTTGGCAAATGATGT R L R L R I W I W I W I W Q M M G F G L G F G F G F G F G K - C S A S A - D L D L D L D L A N D Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5-_PGL-2_AGS-3_PPS_1 (5555 5328) (frame '0'; 225 bp, 75 residues) 1 QSSTVQRYQI YQLTEYIRYM FTTESLKGNL LIQMRLAFTR RHTSFSCICF NNSLPYSRGG 61 LLDLTCVNEK RKERI- >C06HBa0153O03.1-5-_PGL-2_AGS-3_PPS_2 (5320 5099) (frame '1'; 222 bp, 74 residues) 1 GNYFGGKMKS HLQSANEKSF LLCHQKKGKL LSLYKETLPL ILKELRRRCP LAPSSSFARL 61 RLRIWIWIWI WQMM PGL 3 (+ strand): 6567 7118 AGS-1 (6567 7118) SCR (e 0.943) Exon 1 6567 7118 ( 552 n); score: 0.943 PGS (6567 6730) SGN-E251023- PGS (6567 6730) SGN-E391663- PGS (6567 6730) SGN-E331585- PGS (6567 6730) SGN-E255789- PGS (6567 6685) SGN-E269527- PGS (6567 6637) SGN-E250948- PGS (6584 6730) SGN-E396524- PGS (6625 6730) SGN-E302012- PGS (6625 6730) SGN-E247286- PGS (6649 7118) SGN-E577892- 3-phase translation of AGS-1 (+strand): . . . . . . 6567 TTTACTGATAGCGCAAGCTATAGTGGTGGGCATTCGGTATTTCGGTTCGGTTCGGTTTTT F T D S A S Y S G G H S V F R F G S V F L L I A Q A I V V G I R Y F G S V R F F Y - - R K L - W W A F G I S V R F G F . . . . . . 6627 TTTTCGGTTTTCGGTTTTTGTAAATTGCGTACCGAATACCGAACCGAAATATTTTGGTTC F S V F G F C K L R T E Y R T E I F W F F R F S V F V N C V P N T E P K Y F G S F F G F R F L - I A Y R I P N R N I L V . . . . . . 6687 GGTTCGGTTTTTGTTAATTCGGTTCGGTTTTTATTAATTCGGTTCAATTTTTTATTTTGG G S V F V N S V R F L L I R F N F L F W V R F L L I R F G F Y - F G S I F Y F G R F G F C - F G S V F I N S V Q F F I L . . . . . . 6747 TTTTTTAATGGGCCTGTTTAGTGGGCTTTTTAAACTTAACAATTTTTTCAATTTTTCGTG F F N G P V - W A F - T - Q F F Q F F V F L M G L F S G L F K L N N F F N F S - V F - W A C L V G F L N L T I F S I F R . . . . . . 6807 AGATTATACAATTTTTTGGGCTTTGATATTTCAACTTAATGGGCTTTGATATTTCAACTT R L Y N F L G F D I S T - W A L I F Q L D Y T I F W A L I F Q L N G L - Y F N L E I I Q F F G L - Y F N L M G F D I S T . . . . . . 6867 AGGGTTTCTTATTTGTTTTAAAAAGATTATTTAATATTAATAATTATTAAATATAATATA R V S Y L F - K D Y L I L I I I K Y N I G F L I C F K K I I - Y - - L L N I I - - G F L F V L K R L F N I N N Y - I - Y . . . . . . 6927 ATTTATAAATTATAATTTAATATTTCGGTTTAAACCGAAATACCGAATTCCAAAGTAACA I Y K L - F N I S V - T E I P N S K V T F I N Y N L I F R F K P K Y R I P K - H N L - I I I - Y F G L N R N T E F Q S N . . . . . . 6987 TGTACCGAAAACCGAACCGAAATGCCGAAAATACAAAAAATTAAACCGAATACTGAACCG C T E N R T E M P K I Q K I K P N T E P V P K T E P K C R K Y K K L N R I L N R M Y R K P N R N A E N T K N - T E Y - T . . . . . . 7047 AAAACCGAAATACCAAAACCGAAATTCTGAAAAATTTTGGTTCGGTTCGGTGTTTCGGTT K T E I P K P K F - K I L V R F G V S V K P K Y Q N R N S E K F W F G S V F R F E N R N T K T E I L K N F G S V R C F G . . 7107 TTCCACTCTTTA F H S L S T L F P L F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-5+_PGL-3_AGS-1_PPS_1 (6567 6767) (frame '1'; 198 bp, 66 residues) 1 FTDSASYSGG HSVFRFGSVF FSVFGFCKLR TEYRTEIFWF GSVFVNSVRF LLIRFNFLFW 61 FFNGPV- 3-phase translation of AGS-1 (-strand): . . . . . . 7118 TAAAGAGTGGAAAACCGAAACACCGAACCGAACCAAAATTTTTCAGAATTTCGGTTTTGG - R V E N R N T E P N Q N F S E F R F W K E W K T E T P N R T K I F Q N F G F G K S G K P K H R T E P K F F R I S V L . . . . . . 7058 TATTTCGGTTTTCGGTTCAGTATTCGGTTTAATTTTTTGTATTTTCGGCATTTCGGTTCG Y F G F R F S I R F N F L Y F R H F G S I S V F G S V F G L I F C I F G I S V R V F R F S V Q Y S V - F F V F S A F R F . . . . . . 6998 GTTTTCGGTACATGTTACTTTGGAATTCGGTATTTCGGTTTAAACCGAAATATTAAATTA V F G T C Y F G I R Y F G L N R N I K L F S V H V T L E F G I S V - T E I L N Y G F R Y M L L W N S V F R F K P K Y - I . . . . . . 6938 TAATTTATAAATTATATTATATTTAATAATTATTAATATTAAATAATCTTTTTAAAACAA - F I N Y I I F N N Y - Y - I I F L K Q N L - I I L Y L I I I N I K - S F - N K I I Y K L Y Y I - - L L I L N N L F K T . . . . . . 6878 ATAAGAAACCCTAAGTTGAAATATCAAAGCCCATTAAGTTGAAATATCAAAGCCCAAAAA I R N P K L K Y Q S P L S - N I K A Q K - E T L S - N I K A H - V E I S K P K K N K K P - V E I S K P I K L K Y Q S P K . . . . . . 6818 ATTGTATAATCTCACGAAAAATTGAAAAAATTGTTAAGTTTAAAAAGCCCACTAAACAGG I V - S H E K L K K L L S L K S P L N R L Y N L T K N - K N C - V - K A H - T G N C I I S R K I E K I V K F K K P T K Q . . . . . . 6758 CCCATTAAAAAACCAAAATAAAAAATTGAACCGAATTAATAAAAACCGAACCGAATTAAC P I K K P K - K I E P N - - K P N R I N P L K N Q N K K L N R I N K N R T E L T A H - K T K I K N - T E L I K T E P N - . . . . . . 6698 AAAAACCGAACCGAACCAAAATATTTCGGTTCGGTATTCGGTACGCAATTTACAAAAACC K N R T E P K Y F G S V F G T Q F T K T K T E P N Q N I S V R Y S V R N L Q K P Q K P N R T K I F R F G I R Y A I Y K N . . . . . . 6638 GAAAACCGAAAAAAAAACCGAACCGAACCGAAATACCGAATGCCCACCACTATAGCTTGC E N R K K N R T E P K Y R M P T T I A C K T E K K T E P N R N T E C P P L - L A R K P K K K P N R T E I P N A H H Y S L . . 6578 GCTATCAGTAAA A I S K L S V R Y Q - Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:20:55 2006 ________________________________________________________________________________ Sequence 6: C06HBa0153O03.1-6, from 1 to 3430, both strands analyzed. ... started at: Mon Aug 28 22:20:55 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 7 ******************************************************************************** EST sequence 4 +strand 597 n (File: SGN-E350035+) 1 GAAAAAAAAC TAGAAAAGGG TTGGGCGAGA ATAACGTCTT CTTCCGGTGG TTGGAAAAAA 61 ATGAATTTTT CTTCAGATTG CTTGAGTTGG AAGTTCAGCA AAAGGGGAAA ACCCAAGTCT 121 CGGAGTGAAT TCTCGATTGG TTAAGGTTAT GAGCCTGGAT TTTTGTTGTA CTTGTCATTC 181 TCTTCTTTTC CAAGGCTTCT TGGAGATTTG TCAGGTAGCT ATTTCCATCG CAGCAGACTT 241 TCTTCTCCTT AATTATGATC TTGTTCTATT CTAGAAACAA GTTCATTTGA GACTTGAGTT 301 TTTCTTTTGA ATCAATTGTA ATACTTTAGA GGCTTGTACA CGTGACTACC AGGTTTTGGG 361 GGGTCTTATT AAGTTACTTA TATTTTATTT CCGCACTTTA TGGTAATGGT TGAGTTTTAG 421 GCTGACTTGT CTTGGTGGGA TAAGACGAGT GCCATCACGT CCATTTTTGG GTCGTGACAC 481 ATCTACTAAA AGTATTTAGC TACTGGTGAA ATATGTAAAG ATGTTTGAGA CTTTCATTTC 541 CCTCATTCAG TTTCTTTCAT TTAATTCTTA CATGAAGTTT AAGTTCAAAA AAAAAAA Predicted gene structure (within gDNA segment 3430 to 1): Exon 1 2442 2373 ( 70 n); cDNA 77 145 ( 69 n); score: 0.757 Intron 1 2372 1404 ( 969 n); Pd: 0.978 (s: 0.72), Pa: 0.000 (s: 0.94) Exon 2 1403 1071 ( 333 n); cDNA 146 479 ( 334 n); score: 0.941 PPA cDNA 587 597 MATCH C06HBa0153O03.1-6- SGN-E350035+ 0.909 403 0.675 C PGS_C06HBa0153O03.1-6-_SGN-E350035+ (2442 2373,1403 1071) Alignment (genomic DNA sequence = upper lines): ATTGCTTTGA GTTTGGAGTT CAACAAAATG GGAAGGCTCA AGTTTTAAAG TGAATTCTTG 2383 ||||| |||| ||| | |||| || ||||| | |||| | || ||| | || |||||||| | ATTGC-TTGA GTTGGAAGTT CAGCAAAAGG GGAAAACCCA AGTCTCGGAG TGAATTCTCG 135 ACTAATTGAG GCAAGTGGAT TTCTAAACTC TTGTTAAGTG TATGAAATGT GTGTATTTCC 2323 | | || || ATTGGTTAAG .......... .......... .......... .......... .......... 145 TTGTGGTATA TGTTTGGGAG TAACGGGATT GGTGATGGAT TGACTTGCCC ACATTGATTA 2263 .......... .......... .......... .......... .......... .......... 145 ATTCTAATAA TGAAAAAAAG GGATAACAAA AGGCAATGTG ATTAATTGTT TATGTGTGAT 2203 .......... .......... .......... .......... .......... .......... 145 GTGTTGAGAA AGGGTTGAAC GCTTGTTGAA TCATTGTTGA TGTGATTTCA TGTTTGTGTG 2143 .......... .......... .......... .......... .......... .......... 145 GTTGTGAACT CTGCAATGTT ATGAAAATGG TCATCTCCTC ATTATTTGTG TGAACGTGTC 2083 .......... .......... .......... .......... .......... .......... 145 ATCTGCATTA TTATGAGGCA TTGGTTGTGA CAATTGTTAT GTGCATTGAG AAAGAATGGG 2023 .......... .......... .......... .......... .......... .......... 145 TACTTAAGAG GATGTACCAT TTCGAGGGAC GTATCGCGCG CCGCGATGGA TACTATATTT 1963 .......... .......... .......... .......... .......... .......... 145 CGAGGGACAT ATCGCGCGTC GCGATGGATA CTATATTTCG AGAGACGTAT CGCGCGCCAT 1903 .......... .......... .......... .......... .......... .......... 145 GATGGTTACT ATTATCGAGG GTCATGTCGT GCTCCGCGAC AGATGCATGA ACAGATATGT 1843 .......... .......... .......... .......... .......... .......... 145 CCCCTATGGG TCCCGGACTG AGAGACAGCG AGTGTATGTC ACTAGGTCAG ACATGCATCA 1783 .......... .......... .......... .......... .......... .......... 145 TTATACTTGA CATTGTATTC CATTGTATTG CACATATTTA TCATTAGTGA ACTTGATATC 1723 .......... .......... .......... .......... .......... .......... 145 GTGTTTTGCT GATCTTATGA TTACCATTCT GTGGAACTTG TGATTGATCA ATATTGAGCT 1663 .......... .......... .......... .......... .......... .......... 145 TGTTATTGCG AATATGTAAT TTGTTGAAGT GTTGTTGTTG AGGATATGTA ATTTGTCTAA 1603 .......... .......... .......... .......... .......... .......... 145 GTGTTGTTGT TGAGCTACGT GCTGTGTAAA CTGTGAGTTG TTAGGTTGGG TTGATTTTTA 1543 .......... .......... .......... .......... .......... .......... 145 ATGCAGGTTG TAGTTGTGGA GGTCCGGTTG GGGGTGGTAG GAGTACCCGT ATTTCATCCC 1483 .......... .......... .......... .......... .......... .......... 145 TTTAGCTTGT GTTTAGAGGT TTACTTGCTG AGTACCGTGT GGTTTGGTAC TCACCCCTTG 1423 .......... .......... .......... .......... .......... .......... 145 CTTCTACAAA TTTTTGTAGG TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 1363 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 186 TTTTTTAGGC TTCTTGGAGA TTTGTCAAGT AGTTGTTTCC ATCGCAGCAG ACTTTTTTCT 1303 ||| |||| |||||||||| ||||||| || || | ||||| |||||||||| ||||| |||| TTTCCAAGGC TTCTTGGAGA TTTGTCAGGT AGCTATTTCC ATCGCAGCAG ACTTTCTTCT 246 CCTTTATTAT GATATTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG ATTTTTTCTT 1243 |||| ||||| ||| |||||| |||||||||| |||||||||| |||||||||| | |||||||| CCTTAATTAT GATCTTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG AGTTTTTCTT 306 TTGAATCAAT TGTAATACTT TAGAGGCTTG TACACGTGAC TACCAGGTTT T-GGGGTTTT 1184 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||| | | TTGAATCAAT TGTAATACTT TAGAGGCTTG TACACGTGAC TACCAGGTTT TGGGGGGTCT 366 TATTAAGTTA CTTATATTTT ATTTCCGCAC TTTATTGTAA TGATTGAGTT TTAGGCTGAC 1124 |||||||||| |||||||||| |||||||||| ||||| |||| || ||||||| |||||||||| TATTAAGTTA CTTATATTTT ATTTCCGCAC TTTATGGTAA TGGTTGAGTT TTAGGCTGAC 426 TTGTCTTGGT GGGATAAGAC GAGTGCCATC ACGCCTATTT TTGGGTCGTA ACA 1071 |||||||||| |||||||||| |||||||||| ||| | |||| ||||||||| ||| TTGTCTTGGT GGGATAAGAC GAGTGCCATC ACGTCCATTT TTGGGTCGTG ACA 479 hqPGS_C06HBa0153O03.1-6-_SGN-E350035+ (2442 2373,1403 1071) ******************************************************************************** EST sequence 5 +strand 576 n (File: SGN-E339561+) 1 AACTAGAAAA GGGTTGGGCG AGAATAACGT CTTCTTCCGG TGGTTGGAAA AAAATGAATT 61 TTTCTTCAGA TTGCTTGAGT TGGAAGTTCA GCAAAAGGGG AAAACCCAAG TCTCGGAGTG 121 AATTCTCGAT TGGTTAAGGT TATGAGCCTG GATTTTTGTT GTACTTGTCA TTCTCTTCTT 181 TTCCAAGGCT TCTTGGAGAT TTGTCAGGTA GCTATTTCCA TCGCAGCAGA CTTTCTTCTC 241 CTTAATTATG ATCTTGTTCT ATTCTAGAAA CAAGTTCATT TGAGACTTGA GTTTTTCTTT 301 TGAATCAATT GTAATACTTT AGAGGCTTGT ACACGTGACT ACCAGGTTTT GGGGGGTCTT 361 ATTAAGTTAC TTATATTTTA TTTCCGCACT TTATGGTAAT GGTTGAGTTT TAGGCTGACT 421 TGTCTTGGTG GGATAAGACG AGTGCCATCA CGTCCATTTT TGGGTCGTGA CACATCTACT 481 AAAAGTATTT AGCTACTGGT GAAATATGTA AAGATGTTTG AGACTTTCAT TTCCCTCATT 541 CAGTTTCTTT CATTTAATTC TTACATGAAG TTTAAG Predicted gene structure (within gDNA segment 3365 to 1): Exon 1 2442 2373 ( 70 n); cDNA 70 138 ( 69 n); score: 0.757 Intron 1 2372 1404 ( 969 n); Pd: 0.978 (s: 0.72), Pa: 0.000 (s: 0.94) Exon 2 1403 1071 ( 333 n); cDNA 139 472 ( 334 n); score: 0.941 MATCH C06HBa0153O03.1-6- SGN-E339561+ 0.909 403 0.700 C PGS_C06HBa0153O03.1-6-_SGN-E339561+ (2442 2373,1403 1071) Alignment (genomic DNA sequence = upper lines): ATTGCTTTGA GTTTGGAGTT CAACAAAATG GGAAGGCTCA AGTTTTAAAG TGAATTCTTG 2383 ||||| |||| ||| | |||| || ||||| | |||| | || ||| | || |||||||| | ATTGC-TTGA GTTGGAAGTT CAGCAAAAGG GGAAAACCCA AGTCTCGGAG TGAATTCTCG 128 ACTAATTGAG GCAAGTGGAT TTCTAAACTC TTGTTAAGTG TATGAAATGT GTGTATTTCC 2323 | | || || ATTGGTTAAG .......... .......... .......... .......... .......... 138 TTGTGGTATA TGTTTGGGAG TAACGGGATT GGTGATGGAT TGACTTGCCC ACATTGATTA 2263 .......... .......... .......... .......... .......... .......... 138 ATTCTAATAA TGAAAAAAAG GGATAACAAA AGGCAATGTG ATTAATTGTT TATGTGTGAT 2203 .......... .......... .......... .......... .......... .......... 138 GTGTTGAGAA AGGGTTGAAC GCTTGTTGAA TCATTGTTGA TGTGATTTCA TGTTTGTGTG 2143 .......... .......... .......... .......... .......... .......... 138 GTTGTGAACT CTGCAATGTT ATGAAAATGG TCATCTCCTC ATTATTTGTG TGAACGTGTC 2083 .......... .......... .......... .......... .......... .......... 138 ATCTGCATTA TTATGAGGCA TTGGTTGTGA CAATTGTTAT GTGCATTGAG AAAGAATGGG 2023 .......... .......... .......... .......... .......... .......... 138 TACTTAAGAG GATGTACCAT TTCGAGGGAC GTATCGCGCG CCGCGATGGA TACTATATTT 1963 .......... .......... .......... .......... .......... .......... 138 CGAGGGACAT ATCGCGCGTC GCGATGGATA CTATATTTCG AGAGACGTAT CGCGCGCCAT 1903 .......... .......... .......... .......... .......... .......... 138 GATGGTTACT ATTATCGAGG GTCATGTCGT GCTCCGCGAC AGATGCATGA ACAGATATGT 1843 .......... .......... .......... .......... .......... .......... 138 CCCCTATGGG TCCCGGACTG AGAGACAGCG AGTGTATGTC ACTAGGTCAG ACATGCATCA 1783 .......... .......... .......... .......... .......... .......... 138 TTATACTTGA CATTGTATTC CATTGTATTG CACATATTTA TCATTAGTGA ACTTGATATC 1723 .......... .......... .......... .......... .......... .......... 138 GTGTTTTGCT GATCTTATGA TTACCATTCT GTGGAACTTG TGATTGATCA ATATTGAGCT 1663 .......... .......... .......... .......... .......... .......... 138 TGTTATTGCG AATATGTAAT TTGTTGAAGT GTTGTTGTTG AGGATATGTA ATTTGTCTAA 1603 .......... .......... .......... .......... .......... .......... 138 GTGTTGTTGT TGAGCTACGT GCTGTGTAAA CTGTGAGTTG TTAGGTTGGG TTGATTTTTA 1543 .......... .......... .......... .......... .......... .......... 138 ATGCAGGTTG TAGTTGTGGA GGTCCGGTTG GGGGTGGTAG GAGTACCCGT ATTTCATCCC 1483 .......... .......... .......... .......... .......... .......... 138 TTTAGCTTGT GTTTAGAGGT TTACTTGCTG AGTACCGTGT GGTTTGGTAC TCACCCCTTG 1423 .......... .......... .......... .......... .......... .......... 138 CTTCTACAAA TTTTTGTAGG TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 1363 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 179 TTTTTTAGGC TTCTTGGAGA TTTGTCAAGT AGTTGTTTCC ATCGCAGCAG ACTTTTTTCT 1303 ||| |||| |||||||||| ||||||| || || | ||||| |||||||||| ||||| |||| TTTCCAAGGC TTCTTGGAGA TTTGTCAGGT AGCTATTTCC ATCGCAGCAG ACTTTCTTCT 239 CCTTTATTAT GATATTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG ATTTTTTCTT 1243 |||| ||||| ||| |||||| |||||||||| |||||||||| |||||||||| | |||||||| CCTTAATTAT GATCTTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG AGTTTTTCTT 299 TTGAATCAAT TGTAATACTT TAGAGGCTTG TACACGTGAC TACCAGGTTT T-GGGGTTTT 1184 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | |||| | | TTGAATCAAT TGTAATACTT TAGAGGCTTG TACACGTGAC TACCAGGTTT TGGGGGGTCT 359 TATTAAGTTA CTTATATTTT ATTTCCGCAC TTTATTGTAA TGATTGAGTT TTAGGCTGAC 1124 |||||||||| |||||||||| |||||||||| ||||| |||| || ||||||| |||||||||| TATTAAGTTA CTTATATTTT ATTTCCGCAC TTTATGGTAA TGGTTGAGTT TTAGGCTGAC 419 TTGTCTTGGT GGGATAAGAC GAGTGCCATC ACGCCTATTT TTGGGTCGTA ACA 1071 |||||||||| |||||||||| |||||||||| ||| | |||| ||||||||| ||| TTGTCTTGGT GGGATAAGAC GAGTGCCATC ACGTCCATTT TTGGGTCGTG ACA 472 hqPGS_C06HBa0153O03.1-6-_SGN-E339561+ (2442 2373,1403 1071) ******************************************************************************** EST sequence 8 +strand 391 n (File: SGN-E344035+) 1 AGGAAAAAAA AATAGAAAAG AGTTGGGCGA GAATTACGTC TTCTTCCGGT GGTTGGAAAA 61 AATGAATTTT TCTTCAGATT GCTTGAGTTG GAAGTTCAGC AAAAGGGGAA AACCCAAGTC 121 TCGGAGTGAA TTCTCGATTG ATTGAGGTTA TGAGCCTGGA TTTTTGTTGT ACTTGTCATT 181 CTCTTCTTTT CCGAGGCTTC ATGGAGATTT GTCAGGTAGT TGTTTCCATC GCAGCAGACT 241 TTCTTCTCCG TAATTATGTT CTTGTTCTAT TCTAGAAACA AATTCATTTG AGACTTGAGT 301 TTTCTTTTGA ATCAATAGTA ATACTTTAGA GGCTTGTACA CGTGACAACC AGGTTTTGGG 361 TTATAATATA AGTCGATAGT AAAAAAAAAA A Predicted gene structure (within gDNA segment 3430 to 143): Exon 1 2442 2373 ( 70 n); cDNA 78 146 ( 69 n); score: 0.786 Intron 1 2372 1404 ( 969 n); Pd: 0.978 (s: 0.76), Pa: 0.000 (s: 0.94) Exon 2 1403 1189 ( 215 n); cDNA 147 360 ( 214 n); score: 0.930 PPA cDNA 381 391 MATCH C06HBa0153O03.1-6- SGN-E344035+ 0.895 285 0.729 C PGS_C06HBa0153O03.1-6-_SGN-E344035+ (2442 2373,1403 1189) Alignment (genomic DNA sequence = upper lines): ATTGCTTTGA GTTTGGAGTT CAACAAAATG GGAAGGCTCA AGTTTTAAAG TGAATTCTTG 2383 ||||| |||| ||| | |||| || ||||| | |||| | || ||| | || |||||||| | ATTGC-TTGA GTTGGAAGTT CAGCAAAAGG GGAAAACCCA AGTCTCGGAG TGAATTCTCG 136 ACTAATTGAG GCAAGTGGAT TTCTAAACTC TTGTTAAGTG TATGAAATGT GTGTATTTCC 2323 | | |||||| ATTGATTGAG .......... .......... .......... .......... .......... 146 TTGTGGTATA TGTTTGGGAG TAACGGGATT GGTGATGGAT TGACTTGCCC ACATTGATTA 2263 .......... .......... .......... .......... .......... .......... 146 ATTCTAATAA TGAAAAAAAG GGATAACAAA AGGCAATGTG ATTAATTGTT TATGTGTGAT 2203 .......... .......... .......... .......... .......... .......... 146 GTGTTGAGAA AGGGTTGAAC GCTTGTTGAA TCATTGTTGA TGTGATTTCA TGTTTGTGTG 2143 .......... .......... .......... .......... .......... .......... 146 GTTGTGAACT CTGCAATGTT ATGAAAATGG TCATCTCCTC ATTATTTGTG TGAACGTGTC 2083 .......... .......... .......... .......... .......... .......... 146 ATCTGCATTA TTATGAGGCA TTGGTTGTGA CAATTGTTAT GTGCATTGAG AAAGAATGGG 2023 .......... .......... .......... .......... .......... .......... 146 TACTTAAGAG GATGTACCAT TTCGAGGGAC GTATCGCGCG CCGCGATGGA TACTATATTT 1963 .......... .......... .......... .......... .......... .......... 146 CGAGGGACAT ATCGCGCGTC GCGATGGATA CTATATTTCG AGAGACGTAT CGCGCGCCAT 1903 .......... .......... .......... .......... .......... .......... 146 GATGGTTACT ATTATCGAGG GTCATGTCGT GCTCCGCGAC AGATGCATGA ACAGATATGT 1843 .......... .......... .......... .......... .......... .......... 146 CCCCTATGGG TCCCGGACTG AGAGACAGCG AGTGTATGTC ACTAGGTCAG ACATGCATCA 1783 .......... .......... .......... .......... .......... .......... 146 TTATACTTGA CATTGTATTC CATTGTATTG CACATATTTA TCATTAGTGA ACTTGATATC 1723 .......... .......... .......... .......... .......... .......... 146 GTGTTTTGCT GATCTTATGA TTACCATTCT GTGGAACTTG TGATTGATCA ATATTGAGCT 1663 .......... .......... .......... .......... .......... .......... 146 TGTTATTGCG AATATGTAAT TTGTTGAAGT GTTGTTGTTG AGGATATGTA ATTTGTCTAA 1603 .......... .......... .......... .......... .......... .......... 146 GTGTTGTTGT TGAGCTACGT GCTGTGTAAA CTGTGAGTTG TTAGGTTGGG TTGATTTTTA 1543 .......... .......... .......... .......... .......... .......... 146 ATGCAGGTTG TAGTTGTGGA GGTCCGGTTG GGGGTGGTAG GAGTACCCGT ATTTCATCCC 1483 .......... .......... .......... .......... .......... .......... 146 TTTAGCTTGT GTTTAGAGGT TTACTTGCTG AGTACCGTGT GGTTTGGTAC TCACCCCTTG 1423 .......... .......... .......... .......... .......... .......... 146 CTTCTACAAA TTTTTGTAGG TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 1363 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G TTATGAGCCT GGATTTTTGT TGTACTTGTC ATTCTCTTCT 187 TTTTTTAGGC TTCTTGGAGA TTTGTCAAGT AGTTGTTTCC ATCGCAGCAG ACTTTTTTCT 1303 ||| |||| ||| |||||| ||||||| || |||||||||| |||||||||| ||||| |||| TTTCCGAGGC TTCATGGAGA TTTGTCAGGT AGTTGTTTCC ATCGCAGCAG ACTTTCTTCT 247 CCTTTATTAT GATATTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG ATTTTTTCTT 1243 || | ||||| | | |||||| |||||||||| |||| ||||| |||||||||| | ||||||| CCGTAATTAT GTTCTTGTTC TATTCTAGAA ACAAATTCAT TTGAGACTTG A-GTTTTCTT 306 TTGAATCAAT TGTAATACTT TAGAGGCTTG TACACGTGAC TACCAGGTTT TGGG 1189 |||||||||| ||||||||| |||||||||| |||||||||| ||||||||| |||| TTGAATCAAT AGTAATACTT TAGAGGCTTG TACACGTGAC AACCAGGTTT TGGG 360 hqPGS_C06HBa0153O03.1-6-_SGN-E344035+ (2442 2373,1403 1189) ******************************************************************************** EST sequence 9 +strand 574 n (File: SGN-E354202+) 1 TTGCATCTTT ATCGTTATTC AAACAAAGGG AGTAAAACTT TGGAGTAAAT GAGGGAAAAA 61 AAGACTCAAC TGAGGTCATT TTTTTGAATA GATTACTTTG AGTTGAAAGT TCAACGAAAG 121 AGGAAGGCTC AAGTTTCAGA ATGAATTCTT GATTGATTGA GTGTTATGAA AATGGGTGTG 181 TCCTCTTTAT TTGTGTGAAC ATGTCATTTG CATTATTTTA AGACATGATT GTGACAAGTG 241 TTATGTGCAT TGAGAAAGAA TGAAAACTTA AAGAGGATGT ACCATTTCGA GGGACGTATC 301 GCGCGCCGCG ATGGTTATTA TTATCGAGGG TCGTGTCATG CGCCGCAACA GATGCATAGA 361 TAGATATGTC CTACATGGGT CCCGGCCTGA GAGACAGCGG GTGTGTATCA CTAGGTTACT 421 AGTCCGGACC TTTTATGATA TATTCTCTTC TTTTCCGAGG CTTCCTGGAG ATTTGTGAGG 481 TAGTTGTTGA TCATCTCAGC ATCCCTCCTT ACTCCTATTT ATGATTTTGT TCTAGTCTAG 541 AAACATATCA TATGAGACTT ATATTTTCTT TTGA Predicted gene structure (within gDNA segment 3430 to 1): Exon 1 2449 2373 ( 77 n); cDNA 85 161 ( 77 n); score: 0.844 Intron 1 2372 2127 ( 246 n); Pd: 0.978 (s: 0.84), Pa: 0.000 (s: 0.86) Exon 2 2126 1974 ( 153 n); cDNA 162 314 ( 153 n); score: 0.879 Intron 2 1973 1898 ( 76 n); Pd: 0.000 (s: 0.93), Pa: 0.000 (s: 0.84) Exon 3 1897 1798 ( 100 n); cDNA 315 414 ( 100 n); score: 0.850 Intron 3 1797 1404 ( 394 n); Pd: 0.961 (s: 0.86), Pa: 0.000 (s: 0.68) Exon 4 1403 1239 ( 165 n); cDNA 415 574 ( 160 n); score: 0.745 MATCH C06HBa0153O03.1-6- SGN-E354202+ 0.823 495 0.862 C PGS_C06HBa0153O03.1-6-_SGN-E354202+ (2449 2373,2126 1974,1897 1798,1403 1239) Alignment (genomic DNA sequence = upper lines): TGAATAGATT GCTTTGAGTT TGGAGTTCAA CAAAATGGGA AGGCTCAAGT TTTAAAGTGA 2390 |||||||||| ||||||||| ||||||| | ||| ||| |||||||||| || | | ||| TGAATAGATT ACTTTGAGTT GAAAGTTCAA CGAAAGAGGA AGGCTCAAGT TTCAGAATGA 144 ATTCTTGACT AATTGAGGCA AGTGGATTTC TAAACTCTTG TTAAGTGTAT GAAATGTGTG 2330 |||||||| | |||||| ATTCTTGATT GATTGAG... .......... .......... .......... .......... 161 TATTTCCTTG TGGTATATGT TTGGGAGTAA CGGGATTGGT GATGGATTGA CTTGCCCACA 2270 .......... .......... .......... .......... .......... .......... 161 TTGATTAATT CTAATAATGA AAAAAAGGGA TAACAAAAGG CAATGTGATT AATTGTTTAT 2210 .......... .......... .......... .......... .......... .......... 161 GTGTGATGTG TTGAGAAAGG GTTGAACGCT TGTTGAATCA TTGTTGATGT GATTTCATGT 2150 .......... .......... .......... .......... .......... .......... 161 TTGTGTGGTT GTGAACTCTG CAATGTTATG AAAATGGTCA TCTCCTCATT ATTTGTGTGA 2090 ||||||| ||||||| | ||||| || |||||||||| .......... .......... ...TGTTATG AAAATGGGTG TGTCCTCTTT ATTTGTGTGA 198 ACGTGTCATC TGCATTATTA TGAGGCATTG GTTGTGACAA TTGTTATGTG CATTGAGAAA 2030 || |||||| ||||||||| | || || || ||||||||| ||||||||| |||||||||| ACATGTCATT TGCATTATTT TAAGACA-TG ATTGTGACAA GTGTTATGTG CATTGAGAAA 257 GAATGGGTAC TT-AAGAGGA TGTACCATTT CGAGGGACGT ATCGCGCGCC GCGATGGATA 1971 ||||| || || ||||||| |||||||||| |||||||||| |||||||||| ||||||| GAATGAAAAC TTAAAGAGGA TGTACCATTT CGAGGGACGT ATCGCGCGCC GCGATGG... 314 CTATATTTCG AGGGACATAT CGCGCGTCGC GATGGATACT ATATTTCGAG AGACGTATCG 1911 .......... .......... .......... .......... .......... .......... 314 CGCGCCATGA TGGTTACTAT TATCGAGGGT CATGTCGTGC TCCGCGACAG ATGCATGAAC 1851 ||| ||| |||||||||| | |||| ||| |||| |||| |||||| | .......... ...TTATTAT TATCGAGGGT CGTGTCATGC GCCGCAACAG ATGCATAGAT 361 AGATATGTCC CCTATGGGTC CCGGACTGAG AGACAGCGAG TGTATGTCAC TAGGTCAGAC 1791 |||||||||| ||||||| |||| ||||| |||||||| | ||| | |||| ||| AGATATGTCC TACATGGGTC CCGGCCTGAG AGACAGCGGG TGTGTATCAC TAG....... 414 ATGCATCATT ATACTTGACA TTGTATTCCA TTGTATTGCA CATATTTATC ATTAGTGAAC 1731 .......... .......... .......... .......... .......... .......... 414 TTGATATCGT GTTTTGCTGA TCTTATGATT ACCATTCTGT GGAACTTGTG ATTGATCAAT 1671 .......... .......... .......... .......... .......... .......... 414 ATTGAGCTTG TTATTGCGAA TATGTAATTT GTTGAAGTGT TGTTGTTGAG GATATGTAAT 1611 .......... .......... .......... .......... .......... .......... 414 TTGTCTAAGT GTTGTTGTTG AGCTACGTGC TGTGTAAACT GTGAGTTGTT AGGTTGGGTT 1551 .......... .......... .......... .......... .......... .......... 414 GATTTTTAAT GCAGGTTGTA GTTGTGGAGG TCCGGTTGGG GGTGGTAGGA GTACCCGTAT 1491 .......... .......... .......... .......... .......... .......... 414 TTCATCCCTT TAGCTTGTGT TTAGAGGTTT ACTTGCTGAG TACCGTGTGG TTTGGTACTC 1431 .......... .......... .......... .......... .......... .......... 414 ACCCCTTGCT TCTACAAATT TTTGTAGGTT ATGAGCCTGG ATTTTTGTTG TACTTGTCAT 1371 ||| | || | || | ||| || | | | || .......... .......... .......GTT ACTAGTCCGG ACCTTTTATG -A--TAT-AT 443 TCTCTTCTTT TTTTAGGCTT CTTGGAGATT TGTCAAGTAG TTGTT-TCCA TCGCAGCA-G 1313 |||||||||| | |||||| | |||||||| ||| | |||| ||||| || || ||||| TCTCTTCTTT TCCGAGGCTT CCTGGAGATT TGTGAGGTAG TTGTTGATCA TCTCAGCATC 503 ACTTTTTTCT CCTTTATTAT GATATTGTTC TATTCTAGAA ACAAGTTCAT TTGAGACTTG 1253 || || || ||| | |||| ||| |||||| || ||||||| || | |||| |||||||| CCTCCTTACT CCTAT-TTAT GATTTTGTTC TAGTCTAGAA AC-ATATCAT ATGAGACTT- 560 ATTTTTTCTT TTGA 1239 || ||||||| |||| ATATTTTCTT TTGA 574 hqPGS_C06HBa0153O03.1-6-_SGN-E354202+ (2449 2373,2126 1974,1897 1798,1403 1239) ******************************************************************************** EST sequence 3 +strand 557 n (File: SGN-E243101+) 1 CAAATTACAT ATCCTCAACA ACAACACTTC AACAAATTAC ATATCCTCAA CAAGAATAGT 61 TCAACAAATT ATATATCCGC CATAACAAGC TCAATATTAA TCAATCACAA GTTCTGCAGA 121 AAGGCAATAA GAAGATCGAG AAAATACGAT ATCAAGTTCA CTAATGATAA GTGTGTAATG 181 CAATGGAATG CAATGTCAAG TATAATGATG CATGTCTGAC CTAGTGATAT ACACCCGCTG 241 TCTCTCAGTC CGGGACCCTT GGGGGACATA TCTGTCCATG CATCTGTCGC GGCGCACGAC 301 ACGATCCTCG ATAATAGTAA CCATCACGGC GCGCGATACG TTCCTCGAAA TATAGTATCC 361 ATCGCGGCGC GCGATACGTC CCTCGAAATG GTACATCCTC TTAAGTACCC ATTATTTCTC 421 AATGCACATA ACACTTGCCA CAACCAATGC CTCAGAATAA TGCAGATGAC AAGTTCACAC 481 ATATAATGAG GAGATGACCA TTTTCATAAC ATCCTACAGT TCACAACCAC ACAAACATCA 541 AGTTACATCA ACAATGA Predicted gene structure (within gDNA segment 989 to 2953): Exon 1 1589 1931 ( 343 n); cDNA 14 354 ( 341 n); score: 0.895 Intron 1 1932 1969 ( 38 n); Pd: 0.900 (s: 0.92), Pa: 0.000 (s: 1.00) Exon 2 1970 2172 ( 203 n); cDNA 355 557 ( 203 n); score: 0.936 MATCH C06HBa0153O03.1-6+ SGN-E243101+ 0.910 546 0.980 C PGS_C06HBa0153O03.1-6+_SGN-E243101+ (1589 1931,1970 2172) Alignment (genomic DNA sequence = upper lines): CTCAACAACA ACACTTAGAC AAATTACATA TCCTCAACAA CAACACTTCA ACAAATTACA 1648 |||||||||| |||||| || |||||||||| |||||||||| || | |||| |||||||| | CTCAACAACA ACACTTCAAC AAATTACATA TCCTCAACAA GAATAGTTCA ACAAATTATA 73 TATTCGCAAT AACAAGCTCA ATATTGATCA ATCACAAGTT CCACAGAATG GTAATCATAA 1708 ||| ||| || |||||||||| ||||| |||| |||||||||| | ||||| | | ||| | || TATCCGCCAT AACAAGCTCA ATATTAATCA ATCACAAGTT CTGCAGAAAG GCAATAAGAA 133 GATCAGCAAA ACACGATATC AAGTTCACTA ATGATAAATA TGTGCAATAC AATGGAATAC 1768 |||| ||| | |||||||| |||||||||| ||||| || |||| ||| | |||||||| | GATCGAGAAA ATACGATATC AAGTTCACTA ATGAT-AA-G TGTGTAATGC AATGGAATGC 191 AATGTCAAGT ATAATGATGC ATGTCTGACC TAGTGACATA CACTCGCTGT CTCTCAGTCC 1828 |||||||||| |||||||||| |||||||||| |||||| ||| ||| |||||| |||||||||| AATGTCAAGT ATAATGATGC ATGTCTGACC TAGTGATATA CACCCGCTGT CTCTCAGTCC 251 GGGACCCATA GGGGACATAT CTGTTCATGC ATCTGTCGCG GAGCACGACA TGACCCTCGA 1888 ||||||| | |||||||||| |||| ||||| |||||||||| | |||||||| || |||||| GGGACCCTTG GGGGACATAT CTGTCCATGC ATCTGTCGCG GCGCACGACA CGATCCTCGA 311 TAATAGTAAC CATCATGGCG CGCGATACGT CTCTCGAAAT ATAGTATCCA TCGCGACGCG 1948 |||||||||| ||||| |||| |||||||||| |||||||| ||| TAATAGTAAC CATCACGGCG CGCGATACGT TCCTCGAAAT ATA....... .......... 354 CGATATGTCC CTCGAAATAT AGTATCCATC GCGGCGCGCG ATACGTCCCT CGAAATGGTA 2008 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .GTATCCATC GCGGCGCGCG ATACGTCCCT CGAAATGGTA 393 CATCCTCTTA AGTACCCATT CTTTCTCAAT GCACATAACA ATTGTCACAA CCAATGCCTC 2068 |||||||||| |||||||||| ||||||||| |||||||||| ||| ||||| |||||||||| CATCCTCTTA AGTACCCATT ATTTCTCAAT GCACATAACA CTTGCCACAA CCAATGCCTC 453 ATAATAATGC AGATGACACG TTCACACAAA TAATGAGGAG ATGACCATTT TCATAACATT 2128 | |||||||| |||||||| | |||||||| | |||||||||| |||||||||| ||||||||| AGAATAATGC AGATGACAAG TTCACACATA TAATGAGGAG ATGACCATTT TCATAACATC 513 GCAGAGTTCA CAACCACACA AACATGAAAT CACATCAACA ATGA 2172 | |||||| |||||||||| ||||| || | ||||||||| |||| CTACAGTTCA CAACCACACA AACATCAAGT TACATCAACA ATGA 557 hqPGS_C06HBa0153O03.1-6+_SGN-E243101+ (1589 1931,1970 2172) ******************************************************************************** EST sequence 2 +strand 713 n (File: SGN-E541821+) 1 TTTTTTTTTT GCAGAAATAA AATATAAGTA ACTTAATTAA ACCCCCAAGA CCTGGGTAGT 61 CACGTGTACA AGCCTCTAAA GTATTACAAG ATTCAAAAGA AAAAAACTCA AGTCTCAAAT 121 GAACTTGTTT CTAGAATAGA ACAAGATCAT AATTATGGAG AAGAAAGTCT GCTGAGATGG 181 AAACAGCTAC CTCACAAATC TCCATGAAGC CTCGGAAAAG AAGAGACTGA CAAGTACAAC 241 AAAAATCCAG GCTCATAACC TACAAAAATT TGTAGAAGCA AGGGGTGAGT ACCAAACCAC 301 ACGGTACTCA GCAAGTAAAC CTCTAAACAT AAGCTAAGGG GATGAAATAC GGGTACTCCT 361 ACCACCCCCA ACCGAACCTC CACAAATACA ACCTGCATTA AAATCAACCC AACCTAACAG 421 CTCACAATTT ACATAGCACA TAGCTCAACA ACAACACTTA GGCAAATTAC ATATCCTCAA 481 CAACAACACT TCAACAAATT ACATATCCTC AACAACAATA CTTCAACAAA TTACATATCC 541 TCAACAACAA TACTTCAACA AATTACATAT CCGCAATAAC AAGCTCAATA TTGATCAATC 601 ACAAGTTCCG CAGAAAAGCA ATCAAAACAT CAGCGAAATA CGATATCAAG TTCACTAATG 661 GTAAGTATGT GCAATGCAAT GGAATGCAAT GTCAAGTATA ATGATGCATG TCT Predicted gene structure (within gDNA segment 429 to 3340): Exon 1 1155 1663 ( 509 n); cDNA 10 518 ( 509 n); score: 0.922 Intron 1 1664 2132 ( 469 n); Pd: 0.000 (s: 0.94), Pa: 0.737 (s: 0.62) Exon 2 2133 2180 ( 48 n); cDNA 519 561 ( 43 n); score: 0.625 MATCH C06HBa0153O03.1-6+ SGN-E541821+ 0.922 557 0.781 C PGS_C06HBa0153O03.1-6+_SGN-E541821+ (1155 1663,2133 2180) Alignment (genomic DNA sequence = upper lines): TGCGGAAATA AAATATAAGT AACTTAATAA AAACCCCAAA ACCT-GGTAG TCACGTGTAC 1213 ||| |||||| |||||||||| |||||||| | || |||||| |||| ||||| |||||||||| TGCAGAAATA AAATATAAGT AACTTAATTA AACCCCCAAG ACCTGGGTAG TCACGTGTAC 69 AAGCCTCTAA AGTATTACAA TTGATTCAAA AGAAAAA-A- TCAAGTCTCA AATGAACTTG 1271 |||||||||| |||||||||| |||||||| ||||||| | |||||||||| |||||||||| AAGCCTCTAA AGTATTACAA --GATTCAAA AGAAAAAAAC TCAAGTCTCA AATGAACTTG 127 TTTCTAGAAT AGAACAATAT CATAATAAAG GAGAAAAAAG TCTGCTGCGA TGGAAACAAC 1331 |||||||||| ||||||| || |||||| | | ||||| |||| ||||||| || |||||||| | TTTCTAGAAT AGAACAAGAT CATAATTATG GAGAAGAAAG TCTGCTGAGA TGGAAACAGC 187 TACTTGACAA ATCTCCAAGA AGCCTAAAAA AAGAAGAGAA TGACAAGTAC AACAAAAATC 1391 ||| | |||| ||||||| || ||||| || ||||||||| |||||||||| |||||||||| TACCTCACAA ATCTCCATGA AGCCTCGGAA AAGAAGAGAC TGACAAGTAC AACAAAAATC 247 CAGGCTCATA ACCTACAAAA ATTTGTAGAA GCAAGGGGTG AGTACCAAAC CACACGGTAC 1451 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGGCTCATA ACCTACAAAA ATTTGTAGAA GCAAGGGGTG AGTACCAAAC CACACGGTAC 307 TCAGCAAGTA AACCTCTAAA CACAAGCTAA AGGGATGAAA TACGGGTACT CCTACCACCC 1511 |||||||||| |||||||||| || ||||||| ||||||||| |||||||||| |||||||||| TCAGCAAGTA AACCTCTAAA CATAAGCTAA GGGGATGAAA TACGGGTACT CCTACCACCC 367 CCAACCGGAC CTCCACAACT ACAACCTGCA TTAAAAATCA ACCCAACCTA ACAACTCACA 1571 ||||||| || |||||||| | |||||||||| || ||||||| |||||||||| ||| |||||| CCAACCGAAC CTCCACAAAT ACAACCTGCA TT-AAAATCA ACCCAACCTA ACAGCTCACA 426 GTTTACACAG CACGTAGCTC AACAACAACA CTTAGACAAA TTACATATCC TCAACAACAA 1631 |||||| || ||| |||||| |||||||||| ||||| |||| |||||||||| |||||||||| ATTTACATAG CACATAGCTC AACAACAACA CTTAGGCAAA TTACATATCC TCAACAACAA 486 CACTTCAACA AATTACATAT TCGCAATAAC AAGCTCAATA TTGATCAATC ACAAGTTCCA 1691 |||||||||| |||||||||| | ||| ||| || CACTTCAACA AATTACATAT CCTCAACAAC AA........ .......... .......... 518 CAGAATGGTA ATCATAAGAT CAGCAAAACA CGATATCAAG TTCACTAATG ATAAATATGT 1751 .......... .......... .......... .......... .......... .......... 518 GCAATACAAT GGAATACAAT GTCAAGTATA ATGATGCATG TCTGACCTAG TGACATACAC 1811 .......... .......... .......... .......... .......... .......... 518 TCGCTGTCTC TCAGTCCGGG ACCCATAGGG GACATATCTG TTCATGCATC TGTCGCGGAG 1871 .......... .......... .......... .......... .......... .......... 518 CACGACATGA CCCTCGATAA TAGTAACCAT CATGGCGCGC GATACGTCTC TCGAAATATA 1931 .......... .......... .......... .......... .......... .......... 518 GTATCCATCG CGACGCGCGA TATGTCCCTC GAAATATAGT ATCCATCGCG GCGCGCGATA 1991 .......... .......... .......... .......... .......... .......... 518 CGTCCCTCGA AATGGTACAT CCTCTTAAGT ACCCATTCTT TCTCAATGCA CATAACAATT 2051 .......... .......... .......... .......... .......... .......... 518 GTCACAACCA ATGCCTCATA ATAATGCAGA TGACACGTTC ACACAAATAA TGAGGAGATG 2111 .......... .......... .......... .......... .......... .......... 518 ACCATTTTCA TAACATTGCA GAGTTCACAA CCACACAAAC ATGAAATCAC ATCAACAATG 2171 | ||| || | || || | | | | ||||||| .......... .......... .TACT-TCAA -CA-AATTAC AT--ATCCTC AACAACAATA 552 ATTCAACAA 2180 |||||||| CTTCAACAA 561 hqPGS_C06HBa0153O03.1-6+_SGN-E541821+ (1155 1663) ******************************************************************************** EST sequence 1 +strand 319 n (File: SGN-E577986+) 1 TACCACCCCA ACCGAACCTC CACAACTACA ACCTACATAA ACATCAGCAC AACCTAACAA 61 CTCACAGTTT ACATAGAACG TAACTCTACA AAAACATTTA GACAAATTAC ATATCCTCCA 121 CAACAACACT TTAACAAATT ACATATCCTC AACATCAACA CTTAAACAAA TTAGATATCC 181 TCAATAGCAA CACTTCCAAA AATTTACATA TCCTCAATAA CACACTTCAA CAAATCACAT 241 ATCCTCAACA ACAACACTTC AACAAATTAT ATATCCTCAA TAACAAGCTC AGTATTCAAC 301 AATCACAAGT TCCGCAAAA Predicted gene structure (within gDNA segment 770 to 3356): Exon 1 1504 1664 ( 161 n); cDNA 1 159 ( 159 n); score: 0.876 Intron 1 1665 2259 ( 595 n); Pd: 0.000 (s: 0.86), Pa: 0.958 (s: 0) Exon 2 2260 2269 ( 10 n); cDNA 160 169 ( 10 n); score: 0.800 MATCH C06HBa0153O03.1-6+ SGN-E577986+ 0.876 171 0.536 C PGS_C06HBa0153O03.1-6+_SGN-E577986+ (1504 1664,2260 2269) Alignment (genomic DNA sequence = upper lines): TACCACCCCC AACCGGACCT CCACAACTAC AACCTGCATT AAAAATCAAC CCAACCTAAC 1563 ||||| |||| ||||| |||| |||||||||| ||||| || | ||| |||| | ||||||||| TACCA-CCCC AACCGAACCT CCACAACTAC AACCTACA-T AAACATCAGC ACAACCTAAC 58 AACTCACAGT TTACACAGCA CGTAGCTCAA CAACAACACT TAGACAAATT ACATATCCTC 1623 |||||||||| ||||| || | |||| ||| | ||| |||| | |||||||||| |||||||||| AACTCACAGT TTACATAGAA CGTAACTCTA CAAAAACATT TAGACAAATT ACATATCCTC 118 AACAACAACA CTTCAACAAA TTACATATTC GCAATAACAA GCTCAATATT GATCAATCAC 1683 ||||||||| ||| |||||| |||||||| | ||| | ||| CACAACAACA CTTTAACAAA TTACATATCC TCAACATCAA C......... .......... 159 AAGTTCCACA GAATGGTAAT CATAAGATCA GCAAAACACG ATATCAAGTT CACTAATGAT 1743 .......... .......... .......... .......... .......... .......... 159 AAATATGTGC AATACAATGG AATACAATGT CAAGTATAAT GATGCATGTC TGACCTAGTG 1803 .......... .......... .......... .......... .......... .......... 159 ACATACACTC GCTGTCTCTC AGTCCGGGAC CCATAGGGGA CATATCTGTT CATGCATCTG 1863 .......... .......... .......... .......... .......... .......... 159 TCGCGGAGCA CGACATGACC CTCGATAATA GTAACCATCA TGGCGCGCGA TACGTCTCTC 1923 .......... .......... .......... .......... .......... .......... 159 GAAATATAGT ATCCATCGCG ACGCGCGATA TGTCCCTCGA AATATAGTAT CCATCGCGGC 1983 .......... .......... .......... .......... .......... .......... 159 GCGCGATACG TCCCTCGAAA TGGTACATCC TCTTAAGTAC CCATTCTTTC TCAATGCACA 2043 .......... .......... .......... .......... .......... .......... 159 TAACAATTGT CACAACCAAT GCCTCATAAT AATGCAGATG ACACGTTCAC ACAAATAATG 2103 .......... .......... .......... .......... .......... .......... 159 AGGAGATGAC CATTTTCATA ACATTGCAGA GTTCACAACC ACACAAACAT GAAATCACAT 2163 .......... .......... .......... .......... .......... .......... 159 CAACAATGAT TCAACAAGCG TTCAACCCTT TCTCAACACA TCACACATAA ACAATTAATC 2223 .......... .......... .......... .......... .......... .......... 159 ACATTGCCTT TTGTTATCCC TTTTTTTCAT TATTAGAATT AATCAA 2269 | || || ||| .......... .......... .......... ......ACTT AAACAA 169 hqPGS_C06HBa0153O03.1-6+_SGN-E577986+ (1504 1664) ******************************************************************************** EST sequence 6 -strand 681 n (File: SGN-E539191-) 1 TTCAACAAAT TATATATCCT CAATAACAAG CTCAGTATTC ATCAATCACA AATTTCGCAG 61 AAATTCAATC ACAAGATCAG CAAAACACAA TATCAAGTTC ACCAATGATA AGTATGTGTA 121 ATGCATTGTC AAGTATAATG ATGCATGTCT GACCTAGTGA TACACACCCG CTGTCTCTCA 181 GTCTGGGAAC CATGGGGGAC ATATCTGTCC ATGCATCTGT CGCGGCGCAC GACACGACCC 241 TCGATAATAG TAACCATCGC GGCGCGCGAT ACGTCCCTCA AAATATAGTA TCCATCGCGG 301 CATGCGATAC GTCCCTCGAA ATATAGTATC CATCGCGGCG CGCGATACGT CCCTCGAAAT 361 TGTACATCCT CTTAAGTACT CATTCTTTCT CAATGCACAT AACACTTGTC ACAACCAATG 421 CCTCATAATA ATGCAGATGG CAAGTTCACA CAAATAATGA GGAGATGACC ATTTTCATAA 481 CATTGCACAA TTCAAAACCA CACAAACATG AAATCACATC AACAATGATT CAACAATCCT 541 TCATCCCTTT CTGAACACAT CACACTTAAA TATTTAATCA CATTGCCTTT TTTTTTATCC 601 TTTTTTTTTC ATCATTAAAT TAATCAATGT GGACAAGTCA ATCCATCACC AATCCCGTTA 661 CTCCCAAACA TATACCACAA G Predicted gene structure (within gDNA segment 1 to 2923): Exon 1 1635 2323 ( 689 n); cDNA 1 681 ( 681 n); score: 0.896 MATCH C06HBa0153O03.1-6+ SGN-E539191- 0.896 689 1.012 C PGS_C06HBa0153O03.1-6+_SGN-E539191- (1635 2323) Alignment (genomic DNA sequence = upper lines): TTCAACAAAT TACATATTCG CAATAACAAG CTCAATATTG ATCAATCACA AGTTCCACAG 1694 |||||||||| || |||| | |||||||||| |||| |||| |||||||||| | || | ||| TTCAACAAAT TATATATCCT CAATAACAAG CTCAGTATTC ATCAATCACA AATTTCGCAG 60 AATGGTAATC ATAAGATCAG CAAAACACGA TATCAAGTTC ACTAATGATA AATATGTGCA 1754 || |||| | |||||||| |||||||| | |||||||||| || ||||||| | |||||| AAATTCAATC ACAAGATCAG CAAAACACAA TATCAAGTTC ACCAATGATA AGTATGTG-- 118 ATACAATGGA ATACAATGTC AAGTATAATG ATGCATGTCT GACCTAGTGA CATACACTCG 1814 | ||| | || |||| |||||||||| |||||||||| |||||||||| | |||| || -T--AAT-GC AT----TGTC AAGTATAATG ATGCATGTCT GACCTAGTGA TACACACCCG 170 CTGTCTCTCA GTCCGGGACC CATAGGGGAC ATATCTGTTC ATGCATCTGT CGCGGAGCAC 1874 |||||||||| ||| |||| | ||| |||||| |||||||| | |||||||||| ||||| |||| CTGTCTCTCA GTCTGGGAAC CATGGGGGAC ATATCTGTCC ATGCATCTGT CGCGGCGCAC 230 GACATGACCC TCGATAATAG TAACCATCAT GGCGCGCGAT ACGTCTCTCG AAATATAGTA 1934 |||| ||||| |||||||||| |||||||| |||||||||| ||||| ||| |||||||||| GACACGACCC TCGATAATAG TAACCATCGC GGCGCGCGAT ACGTCCCTCA AAATATAGTA 290 TCCATCGCGA CGCGCGATAT GTCCCTCGAA ATATAGTATC CATCGCGGCG CGCGATACGT 1994 ||||||||| | |||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCATCGCGG CATGCGATAC GTCCCTCGAA ATATAGTATC CATCGCGGCG CGCGATACGT 350 CCCTCGAAAT GGTACATCCT CTTAAGTACC CATTCTTTCT CAATGCACAT AACAATTGTC 2054 |||||||||| ||||||||| ||||||||| |||||||||| |||||||||| |||| ||||| CCCTCGAAAT TGTACATCCT CTTAAGTACT CATTCTTTCT CAATGCACAT AACACTTGTC 410 ACAACCAATG CCTCATAATA ATGCAGATGA CACGTTCACA CAAATAATGA GGAGATGACC 2114 |||||||||| |||||||||| ||||||||| || ||||||| |||||||||| |||||||||| ACAACCAATG CCTCATAATA ATGCAGATGG CAAGTTCACA CAAATAATGA GGAGATGACC 470 ATTTTCATAA CATTGCAGAG TTCACAACCA CACAAACATG AAATCACATC AACAATGATT 2174 |||||||||| ||||||| | |||| ||||| |||||||||| |||||||||| |||||||||| ATTTTCATAA CATTGCACAA TTCAAAACCA CACAAACATG AAATCACATC AACAATGATT 530 CAACAAGCGT TCAACCCTTT CTCAACACAT CACACATAAA CAATTAATCA CATTGCC--T 2232 |||||| | | ||| |||||| || ||||||| ||||| |||| | ||||||| ||||||| | CAACAATCCT TCATCCCTTT CTGAACACAT CACACTTAAA TATTTAATCA CATTGCCTTT 590 TTTGTTATCC -CTTTTTTTC ATTATTAGAA TTAATCAATG TGGGCAAGTC AATCCATCAC 2291 ||| |||||| |||||||| || |||| || |||||||||| ||| |||||| |||||||||| TTTTTTATCC TTTTTTTTTC ATCATTA-AA TTAATCAATG TGGACAAGTC AATCCATCAC 649 CAATCCCGTT ACTCCCAAAC ATATACCACA AG 2323 |||||||||| |||||||||| |||||||||| || CAATCCCGTT ACTCCCAAAC ATATACCACA AG 681 hqPGS_C06HBa0153O03.1-6+_SGN-E539191- (1635 2323) ******************************************************************************** EST sequence 10 -strand 397 n (File: SGN-E278592-) 1 TCAAAATATA GTATCCATCG CGGCATGCGA TACGTCCCTC GAAATATAGT ATCCATCGCG 61 GCGCGCGATA CGTCCCTCGA AATTGTACAT CCTCTTAAGT ACTCATTCTT TCTCAATGCA 121 CATAACACTT GTCACAACCA ATGCCTCATA ATAATGCAGA TGGCAAGTTC ACACAAATAA 181 TGAGGAGATC ACCATTTTCA TAACATTGCA CAATTCAAAA CCACACAAAC ATGAAATCAC 241 ATCAACAATG ATTCAACAAT CCTTCATCCC TTTCTGAACA CATCACACTT AAATATTTAA 301 TCACATTGCC TTTTTTTTTA TCCTTTTTTT TTCATCATTA AATTAATCAA TGTGGACAAG 361 TCAATCCATC ACCAATCCCG TGACTCCCCC ACATATA Predicted gene structure (within gDNA segment 889 to 3430): Exon 1 1922 2316 ( 395 n); cDNA 1 397 ( 397 n); score: 0.908 MATCH C06HBa0153O03.1-6+ SGN-E278592- 0.908 395 0.995 C PGS_C06HBa0153O03.1-6+_SGN-E278592- (1922 2316) Alignment (genomic DNA sequence = upper lines): TCGAAATATA GTATCCATCG CGACGCGCGA TATGTCCCTC GAAATATAGT ATCCATCGCG 1981 || ||||||| |||||||||| || | |||| || ||||||| |||||||||| |||||||||| TCAAAATATA GTATCCATCG CGGCATGCGA TACGTCCCTC GAAATATAGT ATCCATCGCG 60 GCGCGCGATA CGTCCCTCGA AATGGTACAT CCTCTTAAGT ACCCATTCTT TCTCAATGCA 2041 |||||||||| |||||||||| ||| |||||| |||||||||| || ||||||| |||||||||| GCGCGCGATA CGTCCCTCGA AATTGTACAT CCTCTTAAGT ACTCATTCTT TCTCAATGCA 120 CATAACAATT GTCACAACCA ATGCCTCATA ATAATGCAGA TGACACGTTC ACACAAATAA 2101 ||||||| || |||||||||| |||||||||| |||||||||| || || |||| |||||||||| CATAACACTT GTCACAACCA ATGCCTCATA ATAATGCAGA TGGCAAGTTC ACACAAATAA 180 TGAGGAGATG ACCATTTTCA TAACATTGCA GAGTTCACAA CCACACAAAC ATGAAATCAC 2161 ||||||||| |||||||||| |||||||||| | |||| || |||||||||| |||||||||| TGAGGAGATC ACCATTTTCA TAACATTGCA CAATTCAAAA CCACACAAAC ATGAAATCAC 240 ATCAACAATG ATTCAACAAG CGTTCAACCC TTTCTCAACA CATCACACAT AAACAATTAA 2221 |||||||||| ||||||||| | |||| ||| ||||| |||| |||||||| | ||| | |||| ATCAACAATG ATTCAACAAT CCTTCATCCC TTTCTGAACA CATCACACTT AAATATTTAA 300 TCACATTGCC TTTTGTT--A TCC-CTTTTT TTCATTATTA GAATTAATCA ATGTGGGCAA 2278 |||||||||| |||| || | ||| ||||| ||||| |||| ||||||||| |||||| ||| TCACATTGCC TTTTTTTTTA TCCTTTTTTT TTCATCATTA -AATTAATCA ATGTGGACAA 359 GTCAATCCAT CACCAATCCC GTTACTCCCA AACATATA 2316 |||||||||| |||||||||| || |||||| ||||||| GTCAATCCAT CACCAATCCC GTGACTCCCC CACATATA 397 hqPGS_C06HBa0153O03.1-6+_SGN-E278592- (1922 2316) Total number of EST alignments reported: 9 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 3430: PGL 1 (- strand): 2449 1071 AGS-1 (2442 2373,1403 1071) SCR (e 0.786 d 0.978 a 0.000,e 0.941) Exon 1 2442 2373 ( 70 n); score: 0.786 Intron 1 2372 1404 ( 969 n); Pd: 0.978 Pa: 0.000 Exon 2 1403 1071 ( 333 n); score: 0.941 PGS (2442 2373,1403 1071) SGN-E350035+ PGS (2442 2373,1403 1071) SGN-E339561+ PGS (2442 2373,1403 1189) SGN-E344035+ 3-phase translation of AGS-1 (-strand): . . . . . . 2442 ATTGCTTTGAGTTTGGAGTTCAACAAAATGGGAAGGCTCAAGTTTTAAAGTGAATTCTTG I A L S L E F N K M G R L K F - S E F L L L - V W S S T K W E G S S F K V N S - C F E F G V Q Q N G K A Q V L K - I L . : . . . . . 2382 ACTAATTGAG : GTTATGAGCCTGGATTTTTGTTGTACTTGTCATTCTCTTCTTTTTTTAGG T N - : G Y E P G F L L Y L S F S S F F R L I E : V M S L D F C C T C H S L L F L G D - L R : L - A W I F V V L V I L F F F - . . . . . . 1353 CTTCTTGGAGATTTGTCAAGTAGTTGTTTCCATCGCAGCAGACTTTTTTCTCCTTTATTA L L G D L S S S C F H R S R L F S P L L F L E I C Q V V V S I A A D F F L L Y Y A S W R F V K - L F P S Q Q T F F S F I . . . . . . 1293 TGATATTGTTCTATTCTAGAAACAAGTTCATTTGAGACTTGATTTTTTCTTTTGAATCAA - Y C S I L E T S S F E T - F F L L N Q D I V L F - K Q V H L R L D F F F - I N M I L F Y S R N K F I - D L I F S F E S . . . . . . 1233 TTGTAATACTTTAGAGGCTTGTACACGTGACTACCAGGTTTTGGGGTTTTTATTAAGTTA L - Y F R G L Y T - L P G F G V F I K L C N T L E A C T R D Y Q V L G F L L S Y I V I L - R L V H V T T R F W G F Y - V . . . . . . 1173 CTTATATTTTATTTCCGCACTTTATTGTAATGATTGAGTTTTAGGCTGACTTGTCTTGGT L I F Y F R T L L - - L S F R L T C L G L Y F I S A L Y C N D - V L G - L V L V T Y I L F P H F I V M I E F - A D L S W . . . . . 1113 GGGATAAGACGAGTGCCATCACGCCTATTTTTGGGTCGTAACA G I R R V P S R L F L G R N G - D E C H H A Y F W V V T W D K T S A I T P I F G S - Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (2449 2373,2126 1974,1897 1798,1403 1239) SCR (e 0.844 d 0.978 a 0.000,e 0.879 d 0.000 a 0.000,e 0.850 d 0.961 a 0.000,e 0.745) Exon 1 2449 2373 ( 77 n); score: 0.844 Intron 1 2372 2127 ( 246 n); Pd: 0.978 Pa: 0.000 Exon 2 2126 1974 ( 153 n); score: 0.879 Intron 2 1973 1898 ( 76 n); Pd: 0.000 Pa: 0.000 Exon 3 1897 1798 ( 100 n); score: 0.850 Intron 3 1797 1404 ( 394 n); Pd: 0.961 Pa: 0.000 Exon 4 1403 1239 ( 165 n); score: 0.745 PGS (2449 2373,2126 1974,1897 1798,1403 1239) SGN-E354202+ 3-phase translation of AGS-2 (-strand): . . . . . . 2449 TGAATAGATTGCTTTGAGTTTGGAGTTCAACAAAATGGGAAGGCTCAAGTTTTAAAGTGA - I D C F E F G V Q Q N G K A Q V L K - E - I A L S L E F N K M G R L K F - S E N R L L - V W S S T K W E G S S F K V . . : . . . . 2389 ATTCTTGACTAATTGAG : TGTTATGAAAATGGTCATCTCCTCATTATTTGTGTGAACGTGT I L D - L S : V M K M V I S S L F V - T C F L T N - : V L - K W S S P H Y L C E R V N S - L I E : C Y E N G H L L I I C V N V . . . . . . 2083 CATCTGCATTATTATGAGGCATTGGTTGTGACAATTGTTATGTGCATTGAGAAAGAATGG H L H Y Y E A L V V T I V M C I E K E W I C I I M R H W L - Q L L C A L R K N G S S A L L - G I G C D N C Y V H - E R M . . . . . : . 2023 GTACTTAAGAGGATGTACCATTTCGAGGGACGTATCGCGCGCCGCGATGG : TTACTATTAT V L K R M Y H F E G R I A R R D G : Y Y Y Y L R G C T I S R D V S R A A M : V T I I G T - E D V P F R G T Y R A P R W : L L L . . . . . . 1887 CGAGGGTCATGTCGTGCTCCGCGACAGATGCATGAACAGATATGTCCCCTATGGGTCCCG R G S C R A P R Q M H E Q I C P L W V P E G H V V L R D R C M N R Y V P Y G S R S R V M S C S A T D A - T D M S P M G P . . . : . . . 1827 GACTGAGAGACAGCGAGTGTATGTCACTAG : GTTATGAGCCTGGATTTTTGTTGTACTTGT D - E T A S V C H - : V M S L D F C C T C T E R Q R V Y V T R : L - A W I F V V L V G L R D S E C M S L : G Y E P G F L L Y L . . . . . . 1373 CATTCTCTTCTTTTTTTAGGCTTCTTGGAGATTTGTCAAGTAGTTGTTTCCATCGCAGCA H S L L F L G F L E I C Q V V V S I A A I L F F F - A S W R F V K - L F P S Q Q S F S S F F R L L G D L S S S C F H R S . . . . . . 1313 GACTTTTTTCTCCTTTATTATGATATTGTTCTATTCTAGAAACAAGTTCATTTGAGACTT D F F L L Y Y D I V L F - K Q V H L R L T F F S F I M I L F Y S R N K F I - D L R L F S P L L - Y C S I L E T S S F E T . . 1253 GATTTTTTCTTTTGA D F F F - I F S F - F F L L Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 1155 2323 AGS-1 (1155 1931,1970 2172) SCR (e 0.922 d 0.900 a 0.000,e 0.936) Exon 1 1155 1931 ( 777 n); score: 0.922 Intron 1 1932 1969 ( 38 n); Pd: 0.900 Pa: 0.000 Exon 2 1970 2172 ( 203 n); score: 0.936 PGS (1155 1663) SGN-E541821+ PGS (1504 1664) SGN-E577986+ PGS (1589 1931,1970 2172) SGN-E243101+ 3-phase translation of AGS-1 (+strand): . . . . . . 1155 TGCGGAAATAAAATATAAGTAACTTAATAAAAACCCCAAAACCTGGTAGTCACGTGTACA C G N K I - V T - - K P Q N L V V T C T A E I K Y K - L N K N P K T W - S R V Q R K - N I S N L I K T P K P G S H V Y . . . . . . 1215 AGCCTCTAAAGTATTACAATTGATTCAAAAGAAAAAATCAAGTCTCAAATGAACTTGTTT S L - S I T I D S K E K I K S Q M N L F A S K V L Q L I Q K K K S S L K - T C F K P L K Y Y N - F K R K N Q V S N E L V . . . . . . 1275 CTAGAATAGAACAATATCATAATAAAGGAGAAAAAAGTCTGCTGCGATGGAAACAACTAC L E - N N I I I K E K K V C C D G N N Y - N R T I S - - R R K K S A A M E T T T S R I E Q Y H N K G E K S L L R W K Q L . . . . . . 1335 TTGACAAATCTCCAAGAAGCCTAAAAAAAGAAGAGAATGACAAGTACAACAAAAATCCAG L T N L Q E A - K K K R M T S T T K I Q - Q I S K K P K K R R E - Q V Q Q K S R L D K S P R S L K K E E N D K Y N K N P . . . . . . 1395 GCTCATAACCTACAAAAATTTGTAGAAGCAAGGGGTGAGTACCAAACCACACGGTACTCA A H N L Q K F V E A R G E Y Q T T R Y S L I T Y K N L - K Q G V S T K P H G T Q G S - P T K I C R S K G - V P N H T V L . . . . . . 1455 GCAAGTAAACCTCTAAACACAAGCTAAAGGGATGAAATACGGGTACTCCTACCACCCCCA A S K P L N T S - R D E I R V L L P P P Q V N L - T Q A K G M K Y G Y S Y H P Q S K - T S K H K L K G - N T G T P T T P . . . . . . 1515 ACCGGACCTCCACAACTACAACCTGCATTAAAAATCAACCCAACCTAACAACTCACAGTT T G P P Q L Q P A L K I N P T - Q L T V P D L H N Y N L H - K S T Q P N N S Q F N R T S T T T T C I K N Q P N L T T H S . . . . . . 1575 TACACAGCACGTAGCTCAACAACAACACTTAGACAAATTACATATCCTCAACAACAACAC Y T A R S S T T T L R Q I T Y P Q Q Q H T Q H V A Q Q Q H L D K L H I L N N N T L H S T - L N N N T - T N Y I S S T T T . . . . . . 1635 TTCAACAAATTACATATTCGCAATAACAAGCTCAATATTGATCAATCACAAGTTCCACAG F N K L H I R N N K L N I D Q S Q V P Q S T N Y I F A I T S S I L I N H K F H R L Q Q I T Y S Q - Q A Q Y - S I T S S T . . . . . . 1695 AATGGTAATCATAAGATCAGCAAAACACGATATCAAGTTCACTAATGATAAATATGTGCA N G N H K I S K T R Y Q V H - - - I C A M V I I R S A K H D I K F T N D K Y V Q E W - S - D Q Q N T I S S S L M I N M C . . . . . . 1755 ATACAATGGAATACAATGTCAAGTATAATGATGCATGTCTGACCTAGTGACATACACTCG I Q W N T M S S I M M H V - P S D I H S Y N G I Q C Q V - - C M S D L V T Y T R N T M E Y N V K Y N D A C L T - - H T L . . . . . . 1815 CTGTCTCTCAGTCCGGGACCCATAGGGGACATATCTGTTCATGCATCTGTCGCGGAGCAC L S L S P G P I G D I S V H A S V A E H C L S V R D P - G T Y L F M H L S R S T A V S Q S G T H R G H I C S C I C R G A . . . . . . : 1875 GACATGACCCTCGATAATAGTAACCATCATGGCGCGCGATACGTCTCTCGAAATATA : GTA D M T L D N S N H H G A R Y V S R N I : V T - P S I I V T I M A R D T S L E I - : Y R H D P R - - - P S W R A I R L S K Y : S . . . . . . 1973 TCCATCGCGGCGCGCGATACGTCCCTCGAAATGGTACATCCTCTTAAGTACCCATTCTTT S I A A R D T S L E M V H P L K Y P F F P S R R A I R P S K W Y I L L S T H S F I H R G A R Y V P R N G T S S - V P I L . . . . . . 2033 CTCAATGCACATAACAATTGTCACAACCAATGCCTCATAATAATGCAGATGACACGTTCA L N A H N N C H N Q C L I I M Q M T R S S M H I T I V T T N A S - - C R - H V H S Q C T - Q L S Q P M P H N N A D D T F . . . . . . 2093 CACAAATAATGAGGAGATGACCATTTTCATAACATTGCAGAGTTCACAACCACACAAACA H K - - G D D H F H N I A E F T T T Q T T N N E E M T I F I T L Q S S Q P H K H T Q I M R R - P F S - H C R V H N H T N . . 2153 TGAAATCACATCAACAATGA - N H I N N E I T S T M M K S H Q Q - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-6+_PGL-2_AGS-1_PPS_1 (1797 1931,1970 2101) (frame '1'; 264 bp, 88 residues) 1 PSDIHSLSLS PGPIGDISVH ASVAEHDMTL DNSNHHGARY VSRNIVSIAA RDTSLEMVHP 61 LKYPFFLNAH NNCHNQCLII MQMTRSHK- >C06HBa0153O03.1-6+_PGL-2_AGS-1_PPS_2 (1546 1782) (frame '2'; 234 bp, 78 residues) 1 KSTQPNNSQF TQHVAQQQHL DKLHILNNNT STNYIFAITS SILINHKFHR MVIIRSAKHD 61 IKFTNDKYVQ YNGIQCQV- AGS-2 (1635 2323) SCR (e 0.896) Exon 1 1635 2323 ( 689 n); score: 0.896 PGS (1635 2323) SGN-E539191- PGS (1922 2316) SGN-E278592- 3-phase translation of AGS-2 (+strand): . . . . . . 1635 TTCAACAAATTACATATTCGCAATAACAAGCTCAATATTGATCAATCACAAGTTCCACAG F N K L H I R N N K L N I D Q S Q V P Q S T N Y I F A I T S S I L I N H K F H R Q Q I T Y S Q - Q A Q Y - S I T S S T . . . . . . 1695 AATGGTAATCATAAGATCAGCAAAACACGATATCAAGTTCACTAATGATAAATATGTGCA N G N H K I S K T R Y Q V H - - - I C A M V I I R S A K H D I K F T N D K Y V Q E W - S - D Q Q N T I S S S L M I N M C . . . . . . 1755 ATACAATGGAATACAATGTCAAGTATAATGATGCATGTCTGACCTAGTGACATACACTCG I Q W N T M S S I M M H V - P S D I H S Y N G I Q C Q V - - C M S D L V T Y T R N T M E Y N V K Y N D A C L T - - H T L . . . . . . 1815 CTGTCTCTCAGTCCGGGACCCATAGGGGACATATCTGTTCATGCATCTGTCGCGGAGCAC L S L S P G P I G D I S V H A S V A E H C L S V R D P - G T Y L F M H L S R S T A V S Q S G T H R G H I C S C I C R G A . . . . . . 1875 GACATGACCCTCGATAATAGTAACCATCATGGCGCGCGATACGTCTCTCGAAATATAGTA D M T L D N S N H H G A R Y V S R N I V T - P S I I V T I M A R D T S L E I - Y R H D P R - - - P S W R A I R L S K Y S . . . . . . 1935 TCCATCGCGACGCGCGATATGTCCCTCGAAATATAGTATCCATCGCGGCGCGCGATACGT S I A T R D M S L E I - Y P S R R A I R P S R R A I C P S K Y S I H R G A R Y V I H R D A R Y V P R N I V S I A A R D T . . . . . . 1995 CCCTCGAAATGGTACATCCTCTTAAGTACCCATTCTTTCTCAATGCACATAACAATTGTC P S K W Y I L L S T H S F S M H I T I V P R N G T S S - V P I L S Q C T - Q L S S L E M V H P L K Y P F F L N A H N N C . . . . . . 2055 ACAACCAATGCCTCATAATAATGCAGATGACACGTTCACACAAATAATGAGGAGATGACC T T N A S - - C R - H V H T N N E E M T Q P M P H N N A D D T F T Q I M R R - P H N Q C L I I M Q M T R S H K - - G D D . . . . . . 2115 ATTTTCATAACATTGCAGAGTTCACAACCACACAAACATGAAATCACATCAACAATGATT I F I T L Q S S Q P H K H E I T S T M I F S - H C R V H N H T N M K S H Q Q - F H F H N I A E F T T T Q T - N H I N N D . . . . . . 2175 CAACAAGCGTTCAACCCTTTCTCAACACATCACACATAAACAATTAATCACATTGCCTTT Q Q A F N P F S T H H T - T I N H I A F N K R S T L S Q H I T H K Q L I T L P F S T S V Q P F L N T S H I N N - S H C L . . . . . . 2235 TGTTATCCCTTTTTTTCATTATTAGAATTAATCAATGTGGGCAAGTCAATCCATCACCAA C Y P F F S L L E L I N V G K S I H H Q V I P F F H Y - N - S M W A S Q S I T N L L S L F F I I R I N Q C G Q V N P S P . . . 2295 TCCCGTTACTCCCAAACATATACCACAAG S R Y S Q T Y T T P V T P K H I P Q I P L L P N I Y H K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-6+_PGL-2_AGS-2_PPS_1 (1898 2101) (frame '0'; 201 bp, 67 residues) 1 PSWRAIRLSK YSIHRDARYV PRNIVSIAAR DTSLEMVHPL KYPFFLNAHN NCHNQCLIIM 61 QMTRSHK- 3-phase translation of AGS-2 (-strand): . . . . . . 2323 CTTGTGGTATATGTTTGGGAGTAACGGGATTGGTGATGGATTGACTTGCCCACATTGATT L V V Y V W E - R D W - W I D L P T L I L W Y M F G S N G I G D G L T C P H - L C G I C L G V T G L V M D - L A H I D . . . . . . 2263 AATTCTAATAATGAAAAAAAGGGATAACAAAAGGCAATGTGATTAATTGTTTATGTGTGA N S N N E K K G - Q K A M - L I V Y V - I L I M K K R D N K R Q C D - L F M C D - F - - - K K G I T K G N V I N C L C V . . . . . . 2203 TGTGTTGAGAAAGGGTTGAACGCTTGTTGAATCATTGTTGATGTGATTTCATGTTTGTGT C V E K G L N A C - I I V D V I S C L C V L R K G - T L V E S L L M - F H V C V M C - E R V E R L L N H C - C D F M F V . . . . . . 2143 GGTTGTGAACTCTGCAATGTTATGAAAATGGTCATCTCCTCATTATTTGTGTGAACGTGT G C E L C N V M K M V I S S L F V - T C V V N S A M L - K W S S P H Y L C E R V W L - T L Q C Y E N G H L L I I C V N V . . . . . . 2083 CATCTGCATTATTATGAGGCATTGGTTGTGACAATTGTTATGTGCATTGAGAAAGAATGG H L H Y Y E A L V V T I V M C I E K E W I C I I M R H W L - Q L L C A L R K N G S S A L L - G I G C D N C Y V H - E R M . . . . . . 2023 GTACTTAAGAGGATGTACCATTTCGAGGGACGTATCGCGCGCCGCGATGGATACTATATT V L K R M Y H F E G R I A R R D G Y Y I Y L R G C T I S R D V S R A A M D T I F G T - E D V P F R G T Y R A P R W I L Y . . . . . . 1963 TCGAGGGACATATCGCGCGTCGCGATGGATACTATATTTCGAGAGACGTATCGCGCGCCA S R D I S R V A M D T I F R E T Y R A P R G T Y R A S R W I L Y F E R R I A R H F E G H I A R R D G Y Y I S R D V S R A . . . . . . 1903 TGATGGTTACTATTATCGAGGGTCATGTCGTGCTCCGCGACAGATGCATGAACAGATATG - W L L L S R V M S C S A T D A - T D M D G Y Y Y R G S C R A P R Q M H E Q I C M M V T I I E G H V V L R D R C M N R Y . . . . . . 1843 TCCCCTATGGGTCCCGGACTGAGAGACAGCGAGTGTATGTCACTAGGTCAGACATGCATC S P M G P G L R D S E C M S L G Q T C I P L W V P D - E T A S V C H - V R H A S V P Y G S R T E R Q R V Y V T R S D M H . . . . . . 1783 ATTATACTTGACATTGTATTCCATTGTATTGCACATATTTATCATTAGTGAACTTGATAT I I L D I V F H C I A H I Y H - - T - Y L Y L T L Y S I V L H I F I I S E L D I H Y T - H C I P L Y C T Y L S L V N L I . . . . . . 1723 CGTGTTTTGCTGATCTTATGATTACCATTCTGTGGAACTTGTGATTGATCAATATTGAGC R V L L I L - L P F C G T C D - S I L S V F C - S Y D Y H S V E L V I D Q Y - A S C F A D L M I T I L W N L - L I N I E . . . 1663 TTGTTATTGCGAATATGTAATTTGTTGAA L L L R I C N L L C Y C E Y V I C - L V I A N M - F V E Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-6-_PGL-2_AGS-2_PPS_1 (2015 1773) (frame '0'; 240 bp, 80 residues) 1 EDVPFRGTYR APRWILYFEG HIARRDGYYI SRDVSRAMMV TIIEGHVVLR DRCMNRYVPY 61 GSRTERQRVY VTRSDMHHYT - >C06HBa0153O03.1-6-_PGL-2_AGS-2_PPS_2 (2052 1822) (frame '2'; 228 bp, 76 residues) 1 QLLCALRKNG YLRGCTISRD VSRAAMDTIF RGTYRASRWI LYFERRIARH DGYYYRGSCR 61 APRQMHEQIC PLWVPD- ... finished at: Mon Aug 28 22:21:16 2006 ________________________________________________________________________________ Sequence 7: C06HBa0153O03.1-7, from 1 to 8193, both strands analyzed. ... started at: Mon Aug 28 22:21:16 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 36 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 104 ******************************************************************************** EST sequence 108 +strand 711 n (File: SGN-E396039+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AAAAACAAAG ATTTTCTCCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AAATAAAAAA AATTTACTCA TTTTTTCTTG GAGCTAATTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 823 ( 63 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.72) Exon 3 822 773 ( 50 n); cDNA 567 614 ( 48 n); score: 0.720 PPA cDNA 661 672 MATCH C06HBa0153O03.1-7- SGN-E396039+ 0.786 644 0.906 C PGS_C06HBa0153O03.1-7-_SGN-E396039+ (1580 1265,1163 886,822 773) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 || || || | | ||||| .......... .......... .......... .......... ..TCCTACTT A--AATATTA 582 TTATTATTTT ATAACAAAAA AAATAATTAA AA 773 |||||||||| | | | | | | ||||| || TTATTATTTT ACGATTTATA ACACTATTAA AA 614 hqPGS_C06HBa0153O03.1-7-_SGN-E396039+ (1580 1265,1163 886,822 773) ******************************************************************************** EST sequence 52 +strand 726 n (File: SGN-E550322+) 1 TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC 61 CTCCTTCTTT TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT 121 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 181 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 241 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 301 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 361 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 421 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 481 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 541 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 601 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 661 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA 721 AAAAAC Predicted gene structure (within gDNA segment 2762 to 1): Exon 1 1589 1265 ( 325 n); cDNA 1 320 ( 320 n); score: 0.823 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 321 575 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 576 614 ( 39 n); score: 0.724 PPA cDNA 708 725 MATCH C06HBa0153O03.1-7- SGN-E550322+ 0.786 641 0.883 C PGS_C06HBa0153O03.1-7-_SGN-E550322+ (1589 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): TCTCTCTCTA TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA 1531 || | | | ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA 56 AACCCTCTTT CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC 1471 ||||||| || |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | AACCCTCCTT CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATGG -AATAATAAC 115 CCACTAATTA ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA 1412 ||||||||| ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| CCACTAATTT ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA 174 GACCCACGAA ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC 1352 |||||| || | ||||||| | || ||||| |||||||||| || ||| ||| |||| | AACCCACTAA CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA 233 AGAATTCCGA CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT 1292 |||| ||||| | ||||| |||||||||| ||||||| || ||||||| || |||||||||| AGAAATCCGA CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT 293 CCGTCATGCA GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT 1233 ||||| |||| || ||||| || ||| CCGTCCTGCA GG-TCGTCGC AAGGTTCA.. .......... .......... .......... 320 ACCTACGACG GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA 1173 .......... .......... .......... .......... .......... .......... 320 AATTCTCAAG AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA 1113 | || | | || | | | | | ||||| | || ||| |||| .........G AGACTCAAT- TTCCACCAAA GAGTCT-GTG ACGGTCCGTC ACGCCCGTGA 369 CGGTCCGTCC TGCAGGTCCG TCACAGAGTT CAGAGAGTC- AATTTCAGCA CCCAAATTTC 1054 ||||||||| ||| |||| | || |||| ||||||||| | ||| || | ||| |||||| CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCC-AATTTC 428 AGAATTTCTA AGTGTTTTGG GACGAAACAC CCTCGACGGT CCGTCGTGCC CATGACGTTC 994 |||||||||| ||||||||| |||| || |||||| || || | ||| || | AGAATTTCTA AGTGTTTTGA AACGAGAC-T CCTCGA---- -CG--GT--C CAT--CG-T- 474 CGTCATGCCC ATGACGTTCC GTCGTGGGTT CCGTCGTCTC AGCCTGTTTT TCCAGAAATA 934 | | | | |||||| ||| |||||||||| |||||||||| | |||||||| |||| ||||| -G-C-T---C ATGACGGTCC GTCGTGGGTT CCGTCGTCTC AACCTGTTTT TCCAAAAATA 528 AAATCTGCTG CTCAAAACAA CTAAACAGGT CGTTACAAAA TATTTTTTAT AAATATTTTG 874 ||||||||| |||||||| | |||||||||| |||||| | ||| || | AAATCTGCTA CTCAAAACGA CTAAACAGGT CGTTAC-ATT TATGTTCT.. .......... 575 ACTTTTTATC TTATTAATTT TTATATTTTT TTAATCTAGC TATTTAATTT TTCTTAATTA 814 | | ||||| || .......... .......... .......... .......... .......TCC TACTTAAATA 588 TTATTATTAT TAT-TATTTT ATAACA 789 |||||||||| | | ||| |||||| TTATTATTAT TTTACGATTT ATAACA 614 hqPGS_C06HBa0153O03.1-7-_SGN-E550322+ (1589 1265,1163 886,826 789) ******************************************************************************** EST sequence 50 +strand 729 n (File: SGN-E550212+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAACTCG 721 GGGGGGGGC Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 716 MATCH C06HBa0153O03.1-7- SGN-E550212+ 0.791 632 0.867 C PGS_C06HBa0153O03.1-7-_SGN-E550212+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550212+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 51 +strand 710 n (File: SGN-E550065+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATGA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTGATTCAT AAGAAAAAAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 MATCH C06HBa0153O03.1-7- SGN-E550065+ 0.791 632 0.890 C PGS_C06HBa0153O03.1-7-_SGN-E550065+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550065+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 54 +strand 732 n (File: SGN-E550201+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCNA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAAACT 721 CGAGGGGGGG CC Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 718 MATCH C06HBa0153O03.1-7- SGN-E550201+ 0.791 632 0.863 C PGS_C06HBa0153O03.1-7-_SGN-E550201+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550201+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 55 +strand 709 n (File: SGN-E550207+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTNCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 709 MATCH C06HBa0153O03.1-7- SGN-E550207+ 0.791 632 0.891 C PGS_C06HBa0153O03.1-7-_SGN-E550207+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550207+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 56 +strand 715 n (File: SGN-E550335+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAATCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCNAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 310 ( 310 n); score: 0.831 Intron 1 1264 1161 ( 104 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.50) Exon 2 1160 886 ( 275 n); cDNA 311 565 ( 255 n); score: 0.738 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 566 604 ( 39 n); score: 0.724 PPA cDNA 698 715 MATCH C06HBa0153O03.1-7- SGN-E550335+ 0.788 629 0.880 C PGS_C06HBa0153O03.1-7-_SGN-E550335+ (1580 1265,1160 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| ||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC -AACCCTCCT 55 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 114 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 173 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 232 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 292 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 310 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 310 GAGTTGGAGT GTTTTGAAAC GGTGGATC-A CGACGGTTCA TCGTGCCTGT GACGGTCCGT 1105 | ||| | | | ||| |||||| | || ||| || |||||||||| ...GAGACTC AATTTCCACC AAAGAATCTG TGACGGTCCG TCACGCCCGT GACGGTCCGT 367 CCTGCAGGTC CGTCACAGAG TTCAGAGAGT C-AATTTCAG CACCCAAATT TCAGAATTTC 1046 | ||| || ||| || || |||||||||| | | ||| || |||| |||| |||||||||| CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCC-AATT TCAGAATTTC 426 TAAGTGTTTT GGGACGAAAC ACCCTCGACG GTCCGTCGTG CCCATGACGT TCCGTCATGC 986 |||||||||| | |||| || |||||| || || |||| || | | | | TAAGTGTTTT GAAACGAGAC -TCCTCGA-- ---CG--GT- -CCAT--CG- T--G-C-T-- 467 CCATGACGTT CCGTCGTGGG TTCCGTCGTC TCAGCCTGTT TTTCCAGAAA TAAAATCTGC 926 ||||||| | |||||||||| |||||||||| ||| |||||| |||||| ||| |||||||||| -CATGACGGT CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC 526 TGCTCAAAAC AACTAAACAG GTCGTTACAA AATATTTTTT ATAAATATTT TGACTTTTTA 866 | |||||||| ||||||||| |||||||| | ||| || | TACTCAAAAC GACTAAACAG GTCGTTAC-A TTTATGTTCT .......... .......... 565 TCTTATTAAT TTTTATATTT TTTTAATCTA GCTATTTAAT TTTTCTTAAT TATTATTATT 806 | | ||||| |||||||||| .......... .......... .......... .........T CCTACTTAAA TATTATTATT 586 ATTAT-TATT TTATAACA 789 ||| | | |||||||| ATTTTACGAT TTATAACA 604 hqPGS_C06HBa0153O03.1-7-_SGN-E550335+ (1580 1265,1160 886,826 789) ******************************************************************************** EST sequence 74 +strand 714 n (File: SGN-E390013+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACNAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 714 MATCH C06HBa0153O03.1-7- SGN-E390013+ 0.791 632 0.885 C PGS_C06HBa0153O03.1-7-_SGN-E390013+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E390013+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 79 +strand 717 n (File: SGN-E550484+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 717 MATCH C06HBa0153O03.1-7- SGN-E550484+ 0.791 632 0.881 C PGS_C06HBa0153O03.1-7-_SGN-E550484+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550484+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 80 +strand 713 n (File: SGN-E550211+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 310 ( 310 n); score: 0.831 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 311 565 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 566 604 ( 39 n); score: 0.724 PPA cDNA 698 713 MATCH C06HBa0153O03.1-7- SGN-E550211+ 0.790 632 0.886 C PGS_C06HBa0153O03.1-7-_SGN-E550211+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| ||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC -AACCCTCCT 55 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 114 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 173 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 232 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 292 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 310 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 310 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 368 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 427 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 467 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 527 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 565 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 587 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 604 hqPGS_C06HBa0153O03.1-7-_SGN-E550211+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 84 +strand 713 n (File: SGN-E549941+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA TATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCGA CCNATGATTA ATGAAAAATT ATGCCATCAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.739 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 713 MATCH C06HBa0153O03.1-7- SGN-E549941+ 0.790 632 0.886 C PGS_C06HBa0153O03.1-7-_SGN-E549941+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| || ||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AATATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E549941+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 85 +strand 714 n (File: SGN-E550025+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 699 714 MATCH C06HBa0153O03.1-7- SGN-E550025+ 0.791 632 0.885 C PGS_C06HBa0153O03.1-7-_SGN-E550025+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E550025+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 110 +strand 711 n (File: SGN-E396056+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAA ATTTTCTCAC CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATAATAAAA ATTTACTCAT TTTTTCTTTG AGCTAATTCA TAAAAAAAAA A Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 567 605 ( 39 n); score: 0.724 PPA cDNA 700 711 MATCH C06HBa0153O03.1-7- SGN-E396056+ 0.791 632 0.889 C PGS_C06HBa0153O03.1-7-_SGN-E396056+ (1580 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TTAT-TATTT TATAACA 789 || | || ||||||| TTTTACGATT TATAACA 605 hqPGS_C06HBa0153O03.1-7-_SGN-E396056+ (1580 1265,1163 886,826 789) ******************************************************************************** EST sequence 119 +strand 690 n (File: SGN-E377133+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1579 1265 ( 315 n); cDNA 1 310 ( 310 n); score: 0.833 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 311 565 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 566 604 ( 39 n); score: 0.724 MATCH C06HBa0153O03.1-7- SGN-E377133+ 0.791 631 0.914 C PGS_C06HBa0153O03.1-7-_SGN-E377133+ (1579 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT 1521 ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| ||||||| || TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA AACCCTCCTT 56 CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC CCACTAATTA 1461 |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | ||||||||| CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATG- GAATAATAAC CCACTAATTT 115 ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA GACCCACGAA 1402 ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| |||||| || ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 174 ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC AGAATTCCGA 1342 | ||||||| | || ||||| |||||||||| || ||| ||| |||| | |||| ||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA AGAAATCCGA 233 CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT CCGTCATGCA 1282 | ||||| |||||||||| ||||||| || ||||||| || |||||||||| ||||| |||| CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT CCGTCCTGCA 293 GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 || ||||| || ||| GG-TCGTCGC AAGGTTCA.. .......... .......... .......... .......... 310 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 | .......... .......... .......... .......... .......... .........G 311 AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA CGGTCCGTCC 1103 || | | || | | | | | ||||| | || ||| |||| ||||||||| AGACTCAAT- TTCCACCAAA GAGTCT-GTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 369 TGCAGGTCCG TCACAGAGTT CAGAGAGTC- AATTTCAGCA CCCAAATTTC AGAATTTCTA 1044 ||| |||| | || |||| ||||||||| | ||| || | ||| |||||| |||||||||| TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCC-AATTTC AGAATTTCTA 428 AGTGTTTTGG GACGAAACAC CCTCGACGGT CCGTCGTGCC CATGACGTTC CGTCATGCCC 984 ||||||||| |||| || |||||| || || | ||| || | | | | | AGTGTTTTGA AACGAGAC-T CCTCGA---- -CG--GT--C CAT--CG-T- -G-C-T---C 468 ATGACGTTCC GTCGTGGGTT CCGTCGTCTC AGCCTGTTTT TCCAGAAATA AAATCTGCTG 924 |||||| ||| |||||||||| |||||||||| | |||||||| |||| ||||| ||||||||| ATGACGGTCC GTCGTGGGTT CCGTCGTCTC AACCTGTTTT TCCAAAAATA AAATCTGCTA 528 CTCAAAACAA CTAAACAGGT CGTTACAAAA TATTTTTTAT AAATATTTTG ACTTTTTATC 864 |||||||| | |||||||||| |||||| | ||| || | CTCAAAACGA CTAAACAGGT CGTTAC-ATT TATGTTCT.. .......... .......... 565 TTATTAATTT TTATATTTTT TTAATCTAGC TATTTAATTT TTCTTAATTA TTATTATTAT 804 | | ||||| || |||||||||| .......... .......... .......... .......TCC TACTTAAATA TTATTATTAT 588 TAT-TATTTT ATAACA 789 | | ||| |||||| TTTACGATTT ATAACA 604 hqPGS_C06HBa0153O03.1-7-_SGN-E377133+ (1579 1265,1163 886,826 789) ******************************************************************************** EST sequence 24 -strand 658 n (File: SGN-E377132-) 1 TTCCTTCTTT TACCCTAATT AGCATATATT TAAGAATAAA AGATGGAATA ATAACCCACT 61 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 121 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 181 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 241 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 301 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 361 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 421 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 481 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 541 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 601 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAA Predicted gene structure (within gDNA segment 2172 to 1): Exon 1 1525 1265 ( 261 n); cDNA 2 260 ( 259 n); score: 0.828 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 261 515 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 789 ( 38 n); cDNA 516 554 ( 39 n); score: 0.724 PPA cDNA 648 658 MATCH C06HBa0153O03.1-7- SGN-E377132- 0.784 577 0.877 C PGS_C06HBa0153O03.1-7-_SGN-E377132- (1525 1265,1163 886,826 789) Alignment (genomic DNA sequence = upper lines): TCTTTCTTTT ACCCTAATTA GTATATAATT AAGAATAAAA GATGACAATA ATAGCCCACT 1466 || ||||||| |||||||||| | ||||| || |||||||||| |||| |||| ||| |||||| TCCTTCTTTT ACCCTAATTA GCATATATTT AAGAATAAAA GATG-GAATA ATAACCCACT 60 AATTAACTTA AGGTTACCTC TTTTATTCCC CCAAGAAATT -GAGTTATTA ATATAGACCC 1407 |||| ||| | |||||||||| ||||| ||| ||| | |||| || |||||| | ||| |||| AATTTACTCA AGGTTACCTC TTTTA-ACCC CCAGGTAATT AGACTTATTA ACATAAACCC 119 ACGAAATATA TAATTATAGC AGGAATAGTC CAAAACGCCC CTTTAAAACT TAACCAGAAT 1347 || || | || |||||| || |||||||||| ||||||| || | ||||||| | |||| ACTAACTTTA TAATTAAAGT AGGAATAGTC CAAAACGTCC C-TTAAAACG TGTAAAGAAA 178 TCCGACTTCA ACTGGGATTA CGCAACCTGT GACGGCCCGT CGCGCCTGCG ACGGTCCGTC 1287 |||||| |||||||||| |||||||||| || ||||||| || ||||||| |||||||||| TCCGACCCAG ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC 238 ATGCAGGTTC GTC-AGAGAT TCGATTTCCT TAAGGAGTCT GTGACGGCCC GTCGTACCTA 1228 |||||| || ||| || | || CTGCAGG-TC GTCGCAAGGT TCA....... .......... .......... .......... 260 CGACGGTCCG TCCTGCATTT CCGTCACGAC GTTCAGAGAA TCGTTCCCTG TACCAAATTC 1168 .......... .......... .......... .......... .......... .......... 260 TCAAGAGTTG GAGTGTTTTG AAACGGTGGA TCACGACGGT TCATCGTGCC TGTGACGGTC 1108 ||| | | || | | | | |||||| | || ||| ||||||||| ....GAGACT CAAT-TTCCA CCAAAGAGTC T-GTGACGGT CCGTCACGCC CGTGACGGTC 314 CGTCCTGCAG GTCCGTCACA GAGTTCAGAG AGTC-AATTT CAGCACCCAA ATTTCAGAAT 1049 |||| ||| ||||| || ||||||||| |||| | ||| || |||| | |||||||||| CGTCGTGCCA TTCCGTTACG AAGTTCAGAG AGTCGATTTT TAGTACCC-A ATTTCAGAAT 373 TTCTAAGTGT TTTGGGACGA AACACCCTCG ACGGTCCGTC GTGCCCATGA CGTTCCGTCA 989 |||||||||| |||| |||| || ||||| | || || |||| || | | | TTCTAAGTGT TTTGAAACGA GAC-TCCTCG A-----CG-- GT--CCAT-- CG-T--G-C- 416 TGCCCATGAC GTTCCGTCGT GGGTTCCGTC GTCTCAGCCT GTTTTTCCAG AAATAAAATC 929 | |||||| | |||||||| |||||||||| |||||| ||| ||||||||| |||||||||| T---CATGAC GGTCCGTCGT GGGTTCCGTC GTCTCAACCT GTTTTTCCAA AAATAAAATC 473 TGCTGCTCAA AACAACTAAA CAGGTCGTTA CAAAATATTT TTTATAAATA TTTTGACTTT 869 |||| ||||| ||| |||||| |||||||||| | | ||| | | | TGCTACTCAA AACGACTAAA CAGGTCGTTA C-ATTTATGT TCT....... .......... 515 TTATCTTATT AATTTTTATA TTTTTTTAAT CTAGCTATTT AATTTTTCTT AATTATTATT 809 | | ||| || ||||||| .......... .......... .......... .......... ..TCCTACTT AAATATTATT 533 ATTATTAT-T ATTTTATAAC A 789 |||||| | |||||||| | ATTATTTTAC GATTTATAAC A 554 hqPGS_C06HBa0153O03.1-7-_SGN-E377132- (1525 1265,1163 886,826 789) ******************************************************************************** EST sequence 4 -strand 679 n (File: SGN-E550127-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTCCT TTTCTTTTTC TTATCAAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTGCACAA CCATGAATTA ATGAAAAAAT TATGACATAA 661 AATATAAAAA ATTACTCAT Predicted gene structure (within gDNA segment 2682 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.824 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 803 ( 24 n); cDNA 567 590 ( 24 n); score: 0.833 MATCH C06HBa0153O03.1-7- SGN-E550127- 0.786 618 0.910 C PGS_C06HBa0153O03.1-7-_SGN-E550127- (1580 1265,1163 886,826 803) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| | | ||||| || ||| |||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCT-GTCC T-TTTCT-TT TTC-TTATCA AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TT 803 || TT 590 hqPGS_C06HBa0153O03.1-7-_SGN-E550127- (1580 1265,1163 886,826 803) ******************************************************************************** EST sequence 5 -strand 673 n (File: SGN-E550140-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTACTCN 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATGTTATCAA CCATGAATTA ACAAAAAATT AGACCAAAAA 661 TATAAAAAAT TAC Predicted gene structure (within gDNA segment 2682 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.828 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 886 ( 278 n); cDNA 312 566 ( 255 n); score: 0.743 Intron 2 885 827 ( 59 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 3 826 803 ( 24 n); cDNA 567 590 ( 24 n); score: 0.833 MATCH C06HBa0153O03.1-7- SGN-E550140- 0.788 618 0.918 C PGS_C06HBa0153O03.1-7-_SGN-E550140- (1580 1265,1163 886,826 803) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 | ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | ACTCNAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| ||||||| | ||| || | ACTCAAAACG ACTAAACAGG TCGTTAC-AT TTATGTTCT. .......... .......... 566 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 | | ||||| | |||||||||| .......... .......... .......... ........TC CTACTTAAAT ATTATTATTA 588 TT 803 || TT 590 hqPGS_C06HBa0153O03.1-7-_SGN-E550140- (1580 1265,1163 886,826 803) ******************************************************************************** EST sequence 14 -strand 542 n (File: SGN-E252199-) 1 CGACCCAGCC TGGGATTACG CAGTCTGTGA CGGTCCGTCC TGCACGTCCG TCACAGAGTT 61 CAGAGACTAG ATTTTTACCA AGGGTCTGTG ACGGCCCATC ACGCCTGTGA CGGTCCGTCC 121 TGCCATTCCG TCACGAAGTT CAGAGAGTCG ATTTCAGTAC CCAAATTTCA GAATTCTAAG 181 TGTTTTGGAA CGAGACCCCC TCGACGGTCC GTCGTGGGAT CCGTCGTCTC AGTCAGTTTT 241 TCCAGAAATA AAATCTGTTA CTCAAAACGA CTAAACAGGT CGTTACAATA GATACCAATT 301 TACCCATCGT TCGTCCCCGA ACGATCACAA GAAGAAAAAC AAGGGCGAAA AGGAGTACCT 361 GAATCTGTAA ACAGGTATGG GTATCTTTCT CGCATATCAA CTTCCTTCTC CCAAGTGGAT 421 TCTTCAACTG GTCGATTCTT CCATTGAACT TTGATAGATG CAATCTCCCT TGACCTCAAT 481 TTGCGGACTT CTCTATCTAA AATGGCAACA GGCTCCTCCT CATAAGACAA ATTCTCATCA 541 AG Predicted gene structure (within gDNA segment 1968 to 1): Exon 1 1133 1012 ( 122 n); cDNA 90 210 ( 121 n); score: 0.877 Intron 1 1011 974 ( 38 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.90) Exon 2 973 891 ( 83 n); cDNA 211 293 ( 83 n); score: 0.904 MATCH C06HBa0153O03.1-7- SGN-E252199- 0.888 205 0.378 C PGS_C06HBa0153O03.1-7-_SGN-E252199- (1133 1012,973 891) Alignment (genomic DNA sequence = upper lines): GACGGTTCAT CGTGCCTGTG ACGGTCCGTC CTGCAGGTCC GTCACAGAGT TCAGAGAGTC 1074 ||||| ||| | ||||||| |||||||||| |||| ||| ||||| ||| |||||||||| GACGGCCCAT CACGCCTGTG ACGGTCCGTC CTGCCATTCC GTCACGAAGT TCAGAGAGTC 149 AATTTCAGCA CCCAAATTTC AGAATTTCTA AGTGTTTTGG GACGAAACAC CCTCGACGGT 1014 ||||||| | |||||||||| |||| ||||| |||||||||| |||| || | |||||||||| GATTTCAGTA CCCAAATTTC AGAA-TTCTA AGTGTTTTGG AACGAGACCC CCTCGACGGT 208 CCGTCGTGCC CATGACGTTC CGTCATGCCC ATGACGTTCC GTCGTGGGTT CCGTCGTCTC 954 || |||||||| | |||||||||| CC........ .......... .......... .......... GTCGTGGGAT CCGTCGTCTC 230 AGCCTGTTTT TCCAGAAATA AAATCTGCTG CTCAAAACAA CTAAACAGGT CGTTACAAAA 894 || | ||||| |||||||||| ||||||| | |||||||| | |||||||||| |||||||| | AGTCAGTTTT TCCAGAAATA AAATCTGTTA CTCAAAACGA CTAAACAGGT CGTTACAATA 290 TAT 891 || GAT 293 hqPGS_C06HBa0153O03.1-7-_SGN-E252199- (1133 1012,973 891) ******************************************************************************** EST sequence 9 -strand 681 n (File: SGN-E389553-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCGTTCC TACTTAAATA TTATTATTAT TTTACGATTT 601 ATAACACTAT TAGAAACAAA GATTTTCTCA ACCATGAATT AATGAAAAAA TTATGGAATA 661 AAATATAAAA AATTACTCAT T Predicted gene structure (within gDNA segment 2682 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 903 ( 261 n); cDNA 312 550 ( 239 n); score: 0.745 Intron 2 902 316 ( 587 n); Pd: 0.812 (s: 0.92), Pa: 0.000 (s: 0.69) Exon 3 315 274 ( 42 n); cDNA 551 590 ( 40 n); score: 0.690 MATCH C06HBa0153O03.1-7- SGN-E389553- 0.794 619 0.909 C PGS_C06HBa0153O03.1-7-_SGN-E389553- (1580 1265,1163 903,315 274) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| || ACTCAAAACG ACTAAACAGG TC........ .......... .......... .......... 550 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 .......... .......... .......... .......... .......... .......... 550 TTATTATTTT ATAACAAAAA AAATAATTAA AAGAAATTAA CCTACCCATT ATCCCACCTT 745 .......... .......... .......... .......... .......... .......... 550 CACTTCACCA TTTCTCTCCA TTCACCCCAT ACCCCACTCC ACTAATCCAA TCTCACATAC 685 .......... .......... .......... .......... .......... .......... 550 ACACATACAC ACATATAAAT ATATATAAAT TAATAATGAG AGGAGAAGAG AAAAGAAGAA 625 .......... .......... .......... .......... .......... .......... 550 AAAATTGTAC AAAGTAAAAG AGAAGAAAAA ATTTCAGAAA TCCAAAGAGA AAATCAGCAA 565 .......... .......... .......... .......... .......... .......... 550 AAAAGAGGAG AAAAAAACGT GTAAAGGAAA GGAGGAGAAA AAACGTGTAA AGGAAAAGGA 505 .......... .......... .......... .......... .......... .......... 550 AAAAAAATAC ACACACAAAA AGGAAGAAAA TCACCTTAGA AAAATAAAAA ATCAGCAAAT 445 .......... .......... .......... .......... .......... .......... 550 ATACTCACAT TTTATTTCAT ATTTTGCTTT CACTCACAAA CAAAAGTTGA AGTTGAGATT 385 .......... .......... .......... .......... .......... .......... 550 TACTTTCGTG GTTTCGAATT CGTGGTTCCT TATTCGGATT ATTTTCTTTT GGTGGAATTT 325 .......... .......... .......... .......... .......... .......... 550 TTGCAAGAGG TACCATTTAA TTTTTATTGA TTAATTTAAT ATCATGATTA T 274 | | ||||| | | || || || || ||| || || |||| | .........G TTACATTT-A TGTTCGTT-C CTACTTAAAT ATTATTATTA T 590 hqPGS_C06HBa0153O03.1-7-_SGN-E389553- (1580 1265,1163 903) ******************************************************************************** EST sequence 82 +strand 713 n (File: SGN-E550464+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GNTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCTA CCATGAATTA ATGAAAAATT ATGCCATAAG 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 903 ( 261 n); cDNA 312 550 ( 239 n); score: 0.745 Intron 2 902 853 ( 50 n); Pd: 0.812 (s: 0.92), Pa: 0.000 (s: 0.60) Exon 3 852 783 ( 70 n); cDNA 551 619 ( 69 n); score: 0.621 PPA cDNA 698 713 MATCH C06HBa0153O03.1-7- SGN-E550464+ 0.775 647 0.907 C PGS_C06HBa0153O03.1-7-_SGN-E550464+ (1580 1265,1163 903,852 783) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAGG TCGTTACAAA ATATTTTTTA TAAATATTTT GACTTTTTAT 865 |||||||| |||||||||| || ACTCAAAACG ACTAAACAGG TC........ .......... .......... .......... 550 CTTATTAATT TTTATATTTT TTTAATCTAG CTATTTAATT TTTCTTAATT ATTATTATTA 805 || || | | ||| ||| |||| | || || ||| ||| ||| | .......... ..GNTACATT TATGTTCTTC CTACTTAAAT ATTATT-ATT ATT-TTACGA 596 -TTATTATTT TATAACAAAA AAA 783 |||| | ||| | ||| ||| TTTATAACAC TATTAGAAAC AAA 619 hqPGS_C06HBa0153O03.1-7-_SGN-E550464+ (1580 1265,1163 903) ******************************************************************************** EST sequence 102 +strand 649 n (File: SGN-E374999+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CCAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTTCAC CCTGAATTAA TGAAAAAAT Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1579 1265 ( 315 n); cDNA 1 310 ( 310 n); score: 0.833 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 903 ( 261 n); cDNA 311 549 ( 239 n); score: 0.741 MATCH C06HBa0153O03.1-7- SGN-E374999+ 0.792 576 0.888 C PGS_C06HBa0153O03.1-7-_SGN-E374999+ (1579 1265,1163 903) Alignment (genomic DNA sequence = upper lines): TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT 1521 ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| ||||||| || TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA AACCCTCCTT 56 CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC CCACTAATTA 1461 |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | ||||||||| CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATG- GAATAATAAC CCACTAATTT 115 ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA GACCCACGAA 1402 ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| |||||| || ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 174 ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC AGAATTCCGA 1342 | ||||||| | || ||||| |||||||||| || ||| ||| |||| | |||| ||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA AGAAATCCGA 233 CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT CCGTCATGCA 1282 | ||||| |||||||||| ||||||| || ||||||| || |||||||||| ||||| |||| CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT CCGTCCTGCA 293 GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 || ||||| || ||| GG-TCGTCGC AAGGTTCA.. .......... .......... .......... .......... 310 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 | .......... .......... .......... .......... .......... .........G 311 AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA CGGTCCGTCC 1103 || | | || | | | | | ||||| | || ||| |||| ||||||||| AGACTCAAT- TTCCACCAAA GAGTCT-GTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 369 TGCAGGTCCG TCACAGAGTT CAGAGAGTC- AATTTCAGCA CCCAAATTTC AGAATTTCTA 1044 ||| |||| | || |||| ||||||||| | ||| || | ||| |||||| |||||||||| TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCC-AATTTC AGAATTTCTA 428 AGTGTTTTGG GACGAAACAC CCTCGACGGT CCGTCGTGCC CATGACGTTC CGTCATGCCC 984 ||||||||| |||| || |||||| || || | ||| || | | | | | AGTGTTTTGA AACGAGAC-T CCTCGA---- -CG--GT--C CAT--CG-T- -G-C-T---C 468 ATGACGTTCC GTCGTGGGTT CCGTCGTCTC AGCCTGTTTT TCCAGAAATA AAATCTGCTG 924 |||||| ||| |||||||||| |||||||||| | |||||||| |||| ||||| ||||||||| ATGACGGTCC GTCGTGGGTT CCGTCGTCTC AACCTGTTTT TCCAAAAATA AAATCTGCTA 528 CTCAAAACAA CTAAACAGGT C 903 ||| |||| | |||||||||| | CTCCAAACGA CTAAACAGGT C 549 hqPGS_C06HBa0153O03.1-7-_SGN-E374999+ (1579 1265,1163 903) ******************************************************************************** EST sequence 75 +strand 720 n (File: SGN-E389834+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTA CGTCGACTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAGAACGAC 541 TAAACAGGAC GTTACATTTA TGATCGTCCT ACTTAAATAT CATTATTATT TTACGATTTA 601 TAACACTATT AGAAACGAAG ATTTTCTCGA CCATGAATTA ATGAAAAAAT ATGCCATGAA 661 ATATAAAAAT TTACTCGTTC TTCATTGAGC TATTCGTGAA AAAAAAAAAA AAATCGAGGG Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 906 ( 258 n); cDNA 312 547 ( 236 n); score: 0.731 PPA cDNA 699 714 MATCH C06HBa0153O03.1-7- SGN-E389834+ 0.787 574 0.797 C PGS_C06HBa0153O03.1-7-_SGN-E389834+ (1580 1265,1163 906) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| | ||||| || || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TACGTCGACT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAG 906 |||| ||| ||||||||| ACTCAGAACG ACTAAACAG 547 hqPGS_C06HBa0153O03.1-7-_SGN-E389834+ (1580 1265,1163 906) ******************************************************************************** EST sequence 109 +strand 618 n (File: SGN-E396054+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 906 ( 258 n); cDNA 312 547 ( 236 n); score: 0.742 MATCH C06HBa0153O03.1-7- SGN-E396054+ 0.793 574 0.929 C PGS_C06HBa0153O03.1-7-_SGN-E396054+ (1580 1265,1163 906) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||||| ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAG 906 |||||||| ||||||||| ACTCAAAACG ACTAAACAG 547 hqPGS_C06HBa0153O03.1-7-_SGN-E396054+ (1580 1265,1163 906) ******************************************************************************** EST sequence 112 +strand 610 n (File: SGN-E396058+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTGTTT CCAAAAATAA AATCTGCTAC TCACAACGAC 541 TAAACAGGTC GTTACATTTA GGTTCTTCAT AGTTAACTAT TATTATTATT TTACGATTTA 601 TAACACTATT Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1265 ( 316 n); cDNA 1 311 ( 311 n); score: 0.834 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 906 ( 258 n); cDNA 312 547 ( 236 n); score: 0.734 MATCH C06HBa0153O03.1-7- SGN-E396058+ 0.789 574 0.941 C PGS_C06HBa0153O03.1-7-_SGN-E396058+ (1580 1265,1163 906) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | |||||||||| |||||| ||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC 293 AGGTTCGTC- AGAGATTCGA TTTCCTTAAG GAGTCTGTGA CGGCCCGTCG TACCTACGAC 1224 ||| ||||| || ||| AGG-TCGTCG CAAGGTTCA. .......... .......... .......... .......... 311 GGTCCGTCCT GCATTTCCGT CACGACGTTC AGAGAATCGT TCCCTGTACC AAATTCTCAA 1164 .......... .......... .......... .......... .......... .......... 311 GAGTTGGAGT GTTTTGAAAC GGTGGATCAC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 ||| | | || | | | | |||||| | | | ||| ||| |||||||||| GAGACTCAAT -TTCCACCAA AGAGTCT-GT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 369 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 428 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTCATGCC 985 |||||||||| |||| || |||||| || || |||| || | | | | AAGTGTTTTG AAACGAGAC- TCCTCGA--- --CG--GT-- CCAT--CG-T --G-C-T--- 468 CATGACGTTC CGTCGTGGGT TCCGTCGTCT CAGCCTGTTT TTCCAGAAAT AAAATCTGCT 925 ||||||| || |||||||||| |||||||||| || ||||| | ||||| |||| |||||||||| CATGACGGTC CGTCGTGGGT TCCGTCGTCT CAACCTGTGT TTCCAAAAAT AAAATCTGCT 528 GCTCAAAACA ACTAAACAG 906 |||| ||| ||||||||| ACTCACAACG ACTAAACAG 547 hqPGS_C06HBa0153O03.1-7-_SGN-E396058+ (1580 1265,1163 906) ******************************************************************************** EST sequence 113 +strand 454 n (File: SGN-E396070+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCGGGGGGGG 421 GAGTTTCTAA TTGTTTTGAA ACTAGACTCC TCGA Predicted gene structure (within gDNA segment 2672 to 1): Exon 1 1580 1294 ( 287 n); cDNA 1 282 ( 282 n); score: 0.847 Intron 1 1293 1111 ( 183 n); Pd: 0.000 (s: 0.88), Pa: 0.000 (s: 0.84) Exon 2 1110 1059 ( 52 n); cDNA 283 331 ( 49 n); score: 0.846 Intron 2 1058 1009 ( 50 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.80) Exon 3 1008 968 ( 41 n); cDNA 332 372 ( 41 n); score: 0.805 MATCH C06HBa0153O03.1-7- SGN-E396070+ 0.847 380 0.837 C PGS_C06HBa0153O03.1-7-_SGN-E396070+ (1580 1294,1110 1059,1008 968) Alignment (genomic DNA sequence = upper lines): ATCGATTC-C CTTCTCTCTC TTTCTCTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 1522 |||||||| | |||||||||| ||||| || | ||||| || ||| |||||| |||||||| | ATCGATTCTC CTTCTCTCTC TTTCTGTT-C T-TTTCT-TT TTC-TTATTC AAACCCTCCT 56 TCTTTTACCC TAATTAGTAT ATAATTAAGA ATAAAAGATG ACAATAATAG CCCACTAATT 1462 |||||||||| ||||||| || |||||||||| |||||||||| ||||||| |||||||||| TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG -GAATAATAA CCCACTAATT 115 AACTTAAGGT TACCTCTTTT ATTCCCCCAA GAAATT-GAG TTATTAATAT AGACCCACGA 1403 ||| ||||| |||||||||| | |||||| | |||| || ||||||| || | |||||| | TACTCAAGGT TACCTCTTTT A-ACCCCCAG GTAATTAGAC TTATTAACAT AAACCCACTA 174 AATATATAAT TATAGCAGGA ATAGTCCAAA ACGCCCCTTT AAAACTTAAC CAGAATTCCG 1343 | | |||||| || || |||| |||||||||| ||| ||| || ||||| | |||| |||| ACTTTATAAT TAAAGTAGGA ATAGTCCAAA ACGTCCC-TT AAAACGTGTA AAGAAATCCG 233 ACTTCAACTG GGATTACGCA ACCTGTGACG GCCCGTCGCG CCTGCGACGG TCCGTCATGC 1283 || |||| |||||||||| |||||||| | |||||||| | ||||||||| ACCCAGACTG GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACG. .......... 282 AGGTTCGTCA GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 .......... .......... .......... .......... .......... .......... 282 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 .......... .......... .......... .......... .......... .......... 282 AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA CGGTCCGTCC 1103 |||||||| .......... .......... .......... .......... .......... ..GTCCGTCC 290 TGCAGGTCCG TCACAGAGTT CAGAGAGTCA ATTTCAGCAC CCAAATTTCA GAATTTCTAA 1043 ||||||| || || || ||| |||||| ||| ||||| ||| | || TGCAGGT-CG TCGCAAGGTT CAGAGACTCA ATTTC--CAC CAAA...... .......... 331 GTGTTTTGGG ACGAAACACC CTCGACGGTC CGTCGTGCCC ATGACGTTCC GTCATGCCCA 983 | | | ||||| ||| |||| |||| .......... .......... .......... ....GAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGTTCCG TCGTG 968 ||||| |||| ||||| TGACGGTCCG TCGTG 372 hqPGS_C06HBa0153O03.1-7-_SGN-E396070+ (1580 1294,1110 1059,1008 968) ******************************************************************************** EST sequence 34 -strand 660 n (File: SGN-E349296-) 1 AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 61 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 121 TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 181 ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 241 ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 301 ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCCT 361 TAAAACAATT GAGGAATTCC GACTCAGACT GGGATTTACG CAGCCTGTGA CAGCCCGTTG 421 TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC GCAAGGTTCA GAGACTGGAT TTTCACTGAA 481 GACTCTGTGA TGGTCCATCA CGCCTGTGAC GGTCCGTCTT GCCATTCCGT TACGAAGTTC 541 AGAGAGTCGA TTTTCAGTAC CCAATTTCAG ATTTCCTAAG TGTTTTGAAA TGAGACCCTG 601 CGACGGTCCG TCGTGCCCAT GATGGTCCGT CGTGGGGTCC GTCATTTCTG CCAGTTTTTC Predicted gene structure (within gDNA segment 2844 to 1): Exon 1 1726 1265 ( 462 n); cDNA 1 460 ( 460 n); score: 0.820 Intron 1 1264 1160 ( 105 n); Pd: 0.900 (s: 0.77), Pa: 0.000 (s: 0.54) Exon 2 1159 990 ( 170 n); cDNA 461 631 ( 171 n); score: 0.732 MATCH C06HBa0153O03.1-7- SGN-E349296- 0.797 632 0.958 C PGS_C06HBa0153O03.1-7-_SGN-E349296- (1726 1265,1159 990) Alignment (genomic DNA sequence = upper lines): AATACTATCA ATACATATCA TTCTCTATTA AGAGTTTACT ATGAA-A--G CATGA-AAAC 1671 |||| ||||| |||||||| | ||| |||||| |||| ||||| | ||| | | | || |||| AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 60 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGCA AATTTTCTCA AAGCTTTGTG 1611 |||||||||| |||||||||| |||||||||| |||||||| | |||| | |||||||||| CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAG-T GA-TTTC-CC AAGCTTTGTG 117 TTTTTCCCCT TCTCGATCGT CTCTCTCTCT ATCGAT-TCC CTTCTCTCTC TTTCTCTTGT 1552 ||||| || ||||| ||| || |||| | ||| | | ||||||||| |||||||||| TTTTT-TCC- TCTCGTTCG- ATC-CTCTTT CTCGTTCGAC TTTCTCTCTC TTTCTCTTGT 173 TCTTTCTATT TTCTTTATTC AAACCCTCTT TCTTTTACCC TAATTAGTAT ATAATTAAGA 1492 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTTCTATT TTCTTTATTC AAACCCTCTT TCTTTTACCC TAATTAGTAT ATAATTAAGA 233 ATAAAAGATG ACAATAATAG CCCACTAATT AACTTAAGGT TACCTCTTTT ATTCCCCCAA 1432 |||||| ||| |||||||| |||||||||| |||||||||| |||||||||| | ||||||| ATAAAATATG GCAATAATAA CCCACTAATT AACTTAAGGT TACCTCTTTT A-ACCCCCAA 292 GAAATT-GAG TTATTAATAT AGACCCACGA AATATATAAT TATAGCAGGA ATAGTCCAAA 1373 | |||| || ||||||| || |||||| | | | |||||| || ||||||| |||||| ||| GTAATTAGAC TTATTAACAT TAACCCACTA ACTTTATAAT TAAAGCAGGA ATAGTCAAAA 352 ACGCCCCTTT AAAACTTAAC CAGAATTCCG ACTTCAACTG GGA-TTACGC AACCTGTGAC 1314 ||| ||| || ||||| |||||||| ||| |||| ||| |||||| | |||||||| ACGTCCC-TT AAAACAATTG AGGAATTCCG ACTCAGACTG GGATTTACGC AGCCTGTGAC 411 GGCCCGTCGC GCCTGCGACG GTCCGTCATG CAGGTTCGTC -AGAGATTCG ATTTCCTTAA 1255 |||||| | |||||||||| ||||||| || |||| ||||| || ||| AGCCCGTTGT GCCTGCGACG GTCCGTCCTG CAGG-TCGTC GCAAGGTTCA .......... 460 GGAGTCTGTG ACGGCCCGTC GTACCTACGA CGGTCCGTCC TGCATTTCCG TCACGACGTT 1195 .......... .......... .......... .......... .......... .......... 460 CAGAGAATCG TTCCCTGTAC CAAATTCTCA AGAGTTGGAG TGTTTTGAAA CGGTGGA-TC 1136 || || || | | | || || .......... .......... .......... .....GAGAC TGGATTTTCA CTGAAGACTC 485 -ACGACGGTT CATCGTGCCT GTGACGGTCC GTCCTGCAGG TCCGTCACAG AGTTCAGAGA 1077 || ||| |||| |||| |||||||||| ||| ||| ||||| || |||||||||| TGTGATGGTC CATCACGCCT GTGACGGTCC GTCTTGCCAT TCCGTTACGA AGTTCAGAGA 545 GTC-AATTTC AGCACCCAAA TTTCAGAATT TCTAAGTGTT TTGGGACGAA ACACCCTCGA 1018 ||| | |||| || |||| || ||||||| || ||||||||| ||| | || || || ||| GTCGATTTTC AGTACCC-AA TTTCAGATTT CCTAAGTGTT TTGAAATGAG AC-CCTGCGA 603 CGGTCCGTCG TGCCCATGAC GTTCCGTC 990 |||||||||| ||||||||| | |||||| CGGTCCGTCG TGCCCATGAT GGTCCGTC 631 hqPGS_C06HBa0153O03.1-7-_SGN-E349296- (1726 1265,1159 990) ******************************************************************************** EST sequence 87 +strand 558 n (File: SGN-E231589+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA TAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTT Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1579 1265 ( 315 n); cDNA 1 310 ( 310 n); score: 0.833 Intron 1 1264 1161 ( 104 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.50) Exon 2 1160 990 ( 171 n); cDNA 311 481 ( 171 n); score: 0.731 MATCH C06HBa0153O03.1-7- SGN-E231589+ 0.797 486 0.871 C PGS_C06HBa0153O03.1-7-_SGN-E231589+ (1579 1265,1160 990) Alignment (genomic DNA sequence = upper lines): TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT 1521 ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| ||||||| || TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA AACCCTCCTT 56 CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC CCACTAATTA 1461 |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | ||||||||| CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATG- GAATAATAAC CCACTAATTT 115 ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA GACCCACGAA 1402 ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| |||||| || ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 174 ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC AGAATTCCGA 1342 | ||||||| | || ||||| |||||||||| || ||| ||| |||| | |||| ||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA AGAAATCCGA 233 CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT CCGTCATGCA 1282 | ||||| |||||||||| ||||||| || ||||||| || |||||||||| ||||| |||| CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT CCGTCCTGCA 293 GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 || ||||| || ||| GG-TCGTCGC AAGGTTCA.. .......... .......... .......... .......... 310 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 .......... .......... .......... .......... .......... .......... 310 AGTTGGAGTG TTTTGAAACG GTGGATC-AC GACGGTTCAT CGTGCCTGTG ACGGTCCGTC 1104 | | ||| | | | || |||||| | | | ||| ||| |||||||||| ..TAGACTCA ATTTCCACCA AAGAGTCTGT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 368 CTGCAGGTCC GTCACAGAGT TCAGAGAGTC -AATTTCAGC ACCCAAATTT CAGAATTTCT 1045 ||| ||| || || ||| |||||||||| | ||| || |||| ||||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCC-AATTT CAGAATTTCT 427 AAGTGTTTTG GGACGAAACA CCCTCGACGG TCCGTCGTGC CCATGACGTT CCGTC 990 |||||||||| |||| || ||||||||| ||| |||||| ||||||| | ||||| AAGTGTTTTG AAACGAGAC- TCCTCGACGG TCCATCGTGC TCATGACGGT CCGTC 481 hqPGS_C06HBa0153O03.1-7-_SGN-E231589+ (1579 1265,1160 990) ******************************************************************************** EST sequence 140 +strand 545 n (File: SGN-E241959+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACA Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1579 1265 ( 315 n); cDNA 1 310 ( 310 n); score: 0.833 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 990 ( 174 n); cDNA 311 481 ( 171 n); score: 0.739 MATCH C06HBa0153O03.1-7- SGN-E241959+ 0.800 489 0.897 C PGS_C06HBa0153O03.1-7-_SGN-E241959+ (1579 1265,1163 990) Alignment (genomic DNA sequence = upper lines): TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT 1521 ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| ||||||| || TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA AACCCTCCTT 56 CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC CCACTAATTA 1461 |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | ||||||||| CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATG- GAATAATAAC CCACTAATTT 115 ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA GACCCACGAA 1402 ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| |||||| || ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 174 ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC AGAATTCCGA 1342 | ||||||| | || ||||| |||||||||| || ||| ||| |||| | |||| ||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA AGAAATCCGA 233 CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT CCGTCATGCA 1282 | ||||| |||||||||| ||||||| || ||||||| || |||||||||| ||||| |||| CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT CCGTCCTGCA 293 GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 || ||||| || ||| GG-TCGTCGC AAGGTTCA.. .......... .......... .......... .......... 310 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 | .......... .......... .......... .......... .......... .........G 311 AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA CGGTCCGTCC 1103 || | | || | | | | | ||||| | || ||| |||| ||||||||| AGACTCAAT- TTCCACCAAA GAGTCT-GTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 369 TGCAGGTCCG TCACAGAGTT CAGAGAGTC- AATTTCAGCA CCCAAATTTC AGAATTTCTA 1044 ||| |||| | || |||| ||||||||| | ||| || | ||| |||||| |||||||||| TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCC-AATTTC AGAATTTCTA 428 AGTGTTTTGG GACGAAACAC CCTCGACGGT CCGTCGTGCC CATGACGTTC CGTC 990 ||||||||| |||| || |||||||||| || |||||| ||||||| || |||| AGTGTTTTGA AACGAGAC-T CCTCGACGGT CCATCGTGCT CATGACGGTC CGTC 481 hqPGS_C06HBa0153O03.1-7-_SGN-E241959+ (1579 1265,1163 990) ******************************************************************************** EST sequence 116 +strand 472 n (File: SGN-E236652+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GA Predicted gene structure (within gDNA segment 2662 to 1): Exon 1 1579 1265 ( 315 n); cDNA 1 310 ( 310 n); score: 0.833 Intron 1 1264 1164 ( 101 n); Pd: 0.900 (s: 0.81), Pa: 0.000 (s: 0.52) Exon 2 1163 999 ( 165 n); cDNA 311 472 ( 162 n); score: 0.730 MATCH C06HBa0153O03.1-7- SGN-E236652+ 0.798 480 1.017 C PGS_C06HBa0153O03.1-7-_SGN-E236652+ (1579 1265,1163 999) Alignment (genomic DNA sequence = upper lines): TCGATTC-CC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT 1521 ||||||| || |||||||||| |||| || | ||||| ||| || ||||||| ||||||| || TCGATTCTCC TTCTCTCTCT TTCTGTT-CT -TTTCT-TTT TC-TTATTCA AACCCTCCTT 56 CTTTTACCCT AATTAGTATA TAATTAAGAA TAAAAGATGA CAATAATAGC CCACTAATTA 1461 |||||||||| |||||| ||| |||||||||| ||||||||| ||||||| | ||||||||| CTTTTACCCT AATTAGCATA TAATTAAGAA TAAAAGATG- GAATAATAAC CCACTAATTT 115 ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG AAATT-GAGT TATTAATATA GACCCACGAA 1402 ||| |||||| |||||||||| |||||| | |||| || | |||||| ||| |||||| || ACTCAAGGTT ACCTCTTTTA -ACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 174 ATATATAATT ATAGCAGGAA TAGTCCAAAA CGCCCCTTTA AAACTTAACC AGAATTCCGA 1342 | ||||||| | || ||||| |||||||||| || ||| ||| |||| | |||| ||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCC-TTA AAACGTGTAA AGAAATCCGA 233 CTTCAACTGG GATTACGCAA CCTGTGACGG CCCGTCGCGC CTGCGACGGT CCGTCATGCA 1282 | ||||| |||||||||| ||||||| || ||||||| || |||||||||| ||||| |||| CCCAGACTGG GATTACGCAA CCTGTGATGG CCCGTCGTGC CTGCGACGGT CCGTCCTGCA 293 GGTTCGTC-A GAGATTCGAT TTCCTTAAGG AGTCTGTGAC GGCCCGTCGT ACCTACGACG 1223 || ||||| || ||| GG-TCGTCGC AAGGTTCA.. .......... .......... .......... .......... 310 GTCCGTCCTG CATTTCCGTC ACGACGTTCA GAGAATCGTT CCCTGTACCA AATTCTCAAG 1163 | .......... .......... .......... .......... .......... .........G 311 AGTTGGAGTG TTTTGAAACG GTGGATCACG ACGGTTCATC GTGCCTGTGA CGGTCCGTCC 1103 || | | || | | | | | ||||| | || ||| |||| ||||||||| AGACTCAAT- TTCCACCAAA GAGTCT-GTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 369 TGCAGGTCCG TCACAGAGTT CAGAGAGTC- AATTTCAGCA CCCAAATTTC AGAATTTCTA 1044 ||| |||| | || |||| ||||||||| | ||| || | ||| |||||| |||||||||| TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCC-AATTTC AGAATTTCTA 428 AGTGTTTTGG GACGAAACAC CCTCGACGGT CCGTCGTGCC CATGA 999 ||||||||| |||| || |||||||||| || |||||| ||||| AGTGTTTTGA AACGAGAC-T CCTCGACGGT CCATCGTGCT CATGA 472 hqPGS_C06HBa0153O03.1-7-_SGN-E236652+ (1579 1265,1163 999) ******************************************************************************** EST sequence 29 -strand 548 n (File: SGN-E356257-) 1 GTTAACTAGA AAATTAAAGT GATAGAGTCA AATAATGTAA CGACCCGTTT AGTCGTTTTG 61 AGCAGCAGAC TTTATTTCTG GAAAAACTGG CAGAAGCGAC GGACCCCACG ACGGACCGTC 121 ATGGGCACGA CGGACCATCG CAGGGTCTCG TTTCAAAACC CTCTTTCTTT TACCCCAAAT 181 TAACATATAA TTAAGAATAA AAGATGGCAA TAATACCCCA CTAATTAACT TAGGGTTACC 241 TCTTTTAACC CCAAGAATTT GAGTTATTAA TATAAACCCA CGAAATCTAT AATTAAGGAA 301 AGAATAGTCC AAAAACGTCC CTTAAAACGT GTAAGGAAAT CCGATTCTGC CTGGGATTTG 361 CGCAACCTGT GACGGGCCGT CGTGACTGTG ACGGTCCGTC CTGCAGGTCG TCGCAAGGGT 421 CAGAGAGTCA ATTTCCACTG AACAATCTAT GACGGTCCGT CACGCCTGTG ATGGTCCGTC 481 CTGTCATTCC GTCACGAAGT TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTTCTA 541 AGTGTTTT Predicted gene structure (within gDNA segment 3691 to 1): Exon 1 1531 1265 ( 267 n); cDNA 156 422 ( 267 n); score: 0.805 Intron 1 1264 1160 ( 105 n); Pd: 0.900 (s: 0.72), Pa: 0.000 (s: 0.52) Exon 2 1159 1036 ( 124 n); cDNA 423 548 ( 126 n); score: 0.714 MATCH C06HBa0153O03.1-7- SGN-E356257- 0.776 391 0.714 C PGS_C06HBa0153O03.1-7-_SGN-E356257- (1531 1265,1159 1036) Alignment (genomic DNA sequence = upper lines): AAACCCTCTT TCTTTTA-CC CTAATTAGTA TATAATTAAG AATAAAAGAT GACAATAATA 1473 |||||||||| ||||||| || | ||||| | |||||||||| |||||||||| | |||||||| AAACCCTCTT TCTTTTACCC CAAATTAACA TATAATTAAG AATAAAAGAT GGCAATAATA 215 GCCCACTAAT TAACTTAAGG TTACCTCTTT TATTCCCCCA AGAAATTGAG TTATTAATAT 1413 ||||||||| ||||||| || |||||||||| || ||||| |||| ||||| |||||||||| CCCCACTAAT TAACTTAGGG TTACCTCTTT TA--ACCCCA AGAATTTGAG TTATTAATAT 273 AGACCCACGA AATATATAAT TATAGCAGGA ATAGTCC-AA AACGCCCCTT TAAAACTTAA 1354 | |||||||| ||| |||||| || | | || ||||||| || |||| ||| | |||||| | AAACCCACGA AATCTATAAT TAAGGAAAGA ATAGTCCAAA AACGTCCC-T TAAAACGTGT 332 CCAGAATTCC GACTTCAACT GGGA-TTACG CAACCTGTGA CGGCCCGTCG CGCCTGCGAC 1295 ||| ||| || | || |||| || || |||||||||| ||| |||||| | ||| ||| AAGGAAATCC GATTCTGCCT GGGATTTGCG CAACCTGTGA CGGGCCGTCG TGACTGTGAC 392 GGTCCGTCAT GCAGGTTCGT C-AGAGATTC GATTTCCTTA AGGAGTCTGT GACGGCCCGT 1236 |||||||| | ||||| |||| | || || GGTCCGTCCT GCAGG-TCGT CGCAAGGGTC A......... .......... .......... 422 CGTACCTACG ACGGTCCGTC CTGCATTTCC GTCACGACGT TCAGAGAATC GTTCCCTGTA 1176 .......... .......... .......... .......... .......... .......... 422 CCAAATTCTC AAGAGTTGGA GT-GTTTTGA AACGGTGGAT C-ACGACGGT TCATCGTGCC 1118 || || ||| | | || | | |||||| | || ||| .......... ......GAGA GTCAATTTCC ACTGAACAAT CTATGACGGT CCGTCACGCC 466 TGTGACGGTC CGTCCTGCAG GTCCGTCACA GAGTTCAGAG AGTC-AATTT CAGCACCCAA 1059 ||||| |||| ||||||| |||||||| ||||||||| |||| | ||| ||| |||| | TGTGATGGTC CGTCCTGTCA TTCCGTCACG AAGTTCAGAG AGTCGATTTT CAGTACCC-A 525 ATTTCAGAAT TTCTAAGTGT TTT 1036 |||||||| | |||||||||| ||| ATTTCAGATT TTCTAAGTGT TTT 548 hqPGS_C06HBa0153O03.1-7-_SGN-E356257- (1531 1265,1159 1036) ******************************************************************************** EST sequence 25 -strand 565 n (File: SGN-E275667-) 1 GAGACATCTG TGACGGACCG TCGTGCCTGT GACGGTCCGT CGTGGGTTCC GTTGTTTCAG 61 CCAATTTTCC AGAAATAAAA TCTGCTGCTC AAAACGACTA AACAGGTCGT TACAGTAATG 121 AAAAAAAAGA AGAAAGAGAA TAAAGAAAGA AGAAAGAAAG AGAAAAGGGA AGAAGAAGAA 181 AGAGAAAAAG AAAAAGAAAA AGAAAATGAA ATTGATAAAA TAAGAAAAAT AAAAATAAAA 241 ATTAATACGT GGCAGATTAT AATTGATGCG TAATTGAACT TCTTTTTTTG CAAGTGAGGA 301 TGGTTAAAAA ATGAGATATT TACAACACTT TAAAATTATT TAAGGGAGTA ATAAAATGTC 361 CGCTAAGTTA AGATATCTTT TTAATAATTT AAAATAACTT TAATGGTATT TTTATATCTT 421 TTCTCAAATA TTAAACTTTT TTAGAATACA CTCAATTCGC CTCAATGTCT TTTAAAATTT 481 TGACATTATG AATATCGACA TCACATGGTG CATATGCAAA AGTAATCACA TTATTATTGA 541 ATAGATCTAA TCTTTCATTG AGCGC Predicted gene structure (within gDNA segment 1936 to 1): Exon 1 1001 903 ( 99 n); cDNA 11 108 ( 98 n); score: 0.879 Intron 1 902 666 ( 237 n); Pd: 0.812 (s: 0.92), Pa: 0.000 (s: 0.69) Exon 2 665 621 ( 45 n); cDNA 109 151 ( 43 n); score: 0.689 Intron 2 620 539 ( 82 n); Pd: 0.000 (s: 0.69), Pa: 0.000 (s: 0.62) Exon 3 538 455 ( 84 n); cDNA 152 235 ( 84 n); score: 0.655 MATCH C06HBa0153O03.1-7- SGN-E275667- 0.776 228 0.404 C PGS_C06HBa0153O03.1-7-_SGN-E275667- (1001 903,665 621,538 455) Alignment (genomic DNA sequence = upper lines): TGACGTTCCG TCATGCCCAT GACGTTCCGT CGTGGGTTCC GTCGTCTCAG CCTGTTTTTC 942 ||||| ||| || |||| | |||| ||||| |||||||||| || || |||| || ||||| TGACGGACCG TCGTGCCTGT GACGGTCCGT CGTGGGTTCC GTTGTTTCAG CC-AATTTTC 69 CAGAAATAAA ATCTGCTGCT CAAAACAACT AAACAGGTCG TTACAAAATA TTTTTTATAA 882 |||||||||| |||||||||| |||||| ||| ||||||||| CAGAAATAAA ATCTGCTGCT CAAAACGACT AAACAGGTC. .......... .......... 108 ATATTTTGAC TTTTTATCTT ATTAATTTTT ATATTTTTTT AATCTAGCTA TTTAATTTTT 822 .......... .......... .......... .......... .......... .......... 108 CTTAATTATT ATTATTATTA TTATTTTATA ACAAAAAAAA TAATTAAAAG AAATTAACCT 762 .......... .......... .......... .......... .......... .......... 108 ACCCATTATC CCACCTTCAC TTCACCATTT CTCTCCATTC ACCCCATACC CCACTCCACT 702 .......... .......... .......... .......... .......... .......... 108 AATCCAATCT CACATACACA CATACACACA TATAAATATA TATAAATTAA TAATGAGAGG 642 || | ||| || || ||| | .......... .......... .......... ......GTTA CAGTAATGAA AAAAAAGAAG 132 AGAAGAGAAA AGAAGAAAAA ATTGTACAAA GTAAAAGAGA AGAAAAAATT TCAGAAATCC 582 | ||||||| | |||||| | | A-AAGAGAAT A-AAGAAAGA A......... .......... .......... .......... 151 AAAGAGAAAA TCAGCAAAAA AGAGGAGAAA AAAACGTGTA AAGGAAAGGA GGAGAAAAAA 522 ||||| | ||||||| .......... .......... .......... .......... ...GAAAGAA AGAGAAAAGG 168 CGTGTAAAGG AAAAGGAAAA AAAATACACA CACAAAAAGG AAGAAAATCA CCTTAGAAAA 462 | | | | ||| ||||| | || | | | |||| | || || | | |||||| GAAGAAGAAG AAAGAGAAAA AGAAAAAGAA AAAGAAAATG AAATTGATAA AATAAGAAAA 228 ATAAAAA 455 ||||||| ATAAAAA 235 hqPGS_C06HBa0153O03.1-7-_SGN-E275667- (1001 903) ******************************************************************************** EST sequence 33 -strand 725 n (File: SGN-E546548-) 1 GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 61 TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCCCTA ATCCATCAAG 121 CCTTCTTTTA CACTAAGGCA TCATCATTCT CATTATATAA TTTATCAAGC CTTCTTTCAT 181 ACTAAGGCAT CATCATTCTC ATTATATAAT ATATCAAGCG AATTAGGGTT CTTTCAAGAT 241 TTGGGATTCA ATTGCTTCAT CATGCTTTGT TAATTCATCG CAATTTCATA ATCATAATCA 301 TGCAAGCATA CAACTTAAGC ACATAGCAGG GTTTACAATA CTATCAACAC ATAATATTCA 361 CTATTAAGAG TTCACTACGA ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT 421 TGAATCAACA AGCTATCTTC TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT 481 CGTTCGTTTC TCCTCTCTTT CTGTTCTTTT CTTTTGTTTT GTTTTATTCA AACCCTCCTT 541 CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG GACAATAAAA CCCACTAATT 601 AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACC TATTAATATT AACCCTCAAT 661 CTTTATAATT AAGGAAAGAA TAGTCCAAAA CGACCCCTAA AACGTGTAGA GGAATCCTAT 721 TTTGC Predicted gene structure (within gDNA segment 5498 to 1): Exon 1 2309 2168 ( 142 n); cDNA 2 134 ( 133 n); score: 0.669 Intron 1 2167 1961 ( 207 n); Pd: 0.000 (s: 0.66), Pa: 0.000 (s: 0.74) Exon 2 1960 1884 ( 77 n); cDNA 135 209 ( 75 n); score: 0.753 Intron 2 1883 1850 ( 34 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.74) Exon 3 1849 1371 ( 479 n); cDNA 210 691 ( 482 n); score: 0.743 MATCH C06HBa0153O03.1-7- SGN-E546548- 0.729 698 0.963 C PGS_C06HBa0153O03.1-7-_SGN-E546548- (2309 2168,1960 1884,1849 1371) Alignment (genomic DNA sequence = upper lines): GTATATTAAC ATCTTTCAAG ATTCATGATC TTTATTTCTC TTGTGTCGGT ACGTGACACT 2250 ||| ||| | | | || ||| || | || || | | ||||||| |||||||||| GTACCGGAAC GTGGCACCCG ATCCAT-AT- TCTA--TC-C TGGTGTCGGA ACGTGACACT 56 CCGCTCCCTC ATATTCATTA ATCCTCTTGT GTCGGTACGT GACACTCCGA TCCCCTAAAT 2190 ||| | |||| ||||||||| | || || ||| |||| | ||| |||| |||||| ||| CCGAT-CCTC ATATTCATT- CTATCCTGGT ACCGGAACGT GGCAC-CCGA TCCCCT-AAT 112 CTACGTGTCG GTTCGTGACA CCCGATCCCC TAAATCTACG TATCGGTTCG TGACACCCGT 2130 | | | | | ||| | CCATCAAGCC TTCTTTTACA CT........ .......... .......... .......... 134 TCCCCTAAAT CTACATGTCG GTTCGTGACA CCCGGTCCCC TAATTCTACG TGTCGGTTCG 2070 .......... .......... .......... .......... .......... .......... 134 TGACACCCAA TCCCCTAATT CTACGTGTCG GTTCGTGACA CCCGATCCCC TAATACTACG 2010 .......... .......... .......... .......... .......... .......... 134 TGTCGGTTCA TGACACCCGA TCCCCTAATA CTACGTGTCG GTTCGTGACA CCCGATCCCC 1950 | ||| | .......... .......... .......... .......... .........A AGGCATCATC 145 TAATCTCCTT CTATCAATTC ATCAAGCCTT CTTTCTTACC AAGGCATCAT CCATCCCATT 1890 | |||| || ||| |||| |||||||||| ||||| ||| |||||||||| | || |||| -ATTCTCATT ATAT-AATTT ATCAAGCCTT CTTTCATACT AAGGCATCAT CATTCTCATT 203 ATTTTAGTTC ATCACGCCTT TTTTTATACC AAGGTCTCAT TATTAACAAA GAGATTAGGA 1830 || | | || || ||| |||||| ATATAA.... .......... .......... .......... TA-TATCAAG CGAATTAGGG 228 TT-TTACAAG ATTTGGGATT CAATAACTTC ATCATGC-TT AAT-ATA-AT CACAATTATA 1774 || || |||| |||||||||| |||| |||| ||||||| || | || || | ||||| | TTCTTTCAAG ATTTGGGATT CAATTGCTTC ATCATGCTTT GTTAATTCAT CGCAATTTCA 288 TAATCATGTT CATGCATGCA TACAA-TTAA GCACATAGCA GGGTTTACAA TACTATCAAT 1715 ||||||| | |||||| ||| ||||| |||| |||||||||| |||||||||| ||||||||| TAATCATAAT CATGCAAGCA TACAACTTAA GCACATAGCA GGGTTTACAA TACTATCAAC 348 ACATATCATT CTCTATTAAG AGTTTACTAT GAA-A--GCA TGA-AAACCA TAACCTACCT 1659 ||||| ||| | |||||||| |||| |||| ||| | | | | |||||| |||||||||| ACATAATATT CACTATTAAG AGTTCACTAC GAATATCGTA ACATAAACCA TAACCTACCT 408 CCACCGAAGA TTCGTGATCA AGCAAGCAAA TTTTCTCAAA GCTTTGTGTT TTTCCCCTTC 1599 |||||||||| | | |||| | ||||| | | |||||||| | || | ||| |||| CCACCGAAGA ATTG-AATCA A-CAAGC-TA TCTTCTCAAA -ATCCTTG-C TATCCTCTTC 463 -TCGATCGTC TCTCT-CTCT ATCGATTCCC TTCTCTCTCT TTCTCTTGTT CTTTCTATTT 1541 | || || ||||| ||| ||| ||| | ||||| ||| | |||| || |||| ||| GTTTCTC-TC TCTCTACTCG TTCGTTTCTC CTCTCTTTCT GT-TCTT-TT CTTTTGTTTT 520 TCTTTATTCA AACCCTCTTT C-TTTTACCC TAATTAGTA- TATAATTAAG AATAAAAGAT 1483 |||||||| ||||||| || | |||||||| |||||| | |||||||||| |||| || GTTTTATTCA AACCCTCCTT CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG 580 GACAATAATA GCCCACTAAT TAACTTAAGG TTACCTCTTT TATTCCCCCA AGAAATT-GA 1424 |||||||| | ||||||||| |||||||||| |||||||||| || |||||| || |||| || GACAATAA-A ACCCACTAAT TAACTTAAGG TTACCTCTTT TA-ACCCCCA AGTAATTAGA 638 GTTATTAATA TAGACCCACG AAATATATAA TTATAGCAGG AATAGTCCAA AAC 1371 |||||||| | |||| | | | ||||| ||| | | | |||||||||| ||| CCTATTAATA TTAACCCTCA ATCTTTATAA TTAAGGAAAG AATAGTCCAA AAC 691 hqPGS_C06HBa0153O03.1-7-_SGN-E546548- (1960 1884,1849 1371) ******************************************************************************** EST sequence 1 -strand 605 n (File: SGN-E347579-) 1 ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC CTAATTCTAC GTGTCGGTTC 61 GTGACACCTG ATCCCCTAAT CTACGTGCCG GTTCGTGACA CCCGATCCCC TAATTCTACG 121 TGCCAGTTCG TGACACCCGA TCCCCTAATT CTACGTGTCG GTTCGTGACA CCCGATCCCC 181 TGCATGTGTC GGTACGTGAC ACTCCGATCC ACTAATATCA TTCTGTAAAT CATCAGGCCT 241 TCTCTATACC AAGGCATCAT CAATCCCATT ACTTTTATTC ATCAAGCCTT CTTCTATACC 301 AAGGCATCAT CATTAATAAG AGATTAGATT TTTATCAAGA TTTGGGATTC AATAACTTCA 361 TCATGCTTAA TATAATCACA ATTATATAAT CACGTTCATG CATGCATACA ATTAAGCATA 421 TAGCAGGGTT TACAATACTA CCAATACATA TCATTCTCTA TTAAGAGTTT ACTATGAAAG 481 CATGAAAACC ATAACCTACC TCCACCGAAG ATTAGTGATC AAGCAAGCAA ATTTTTCTCC 541 AAGCTTTGTT TCTCCCTTCT CGTTCGATTC TTCCTCTCTC TCTTGTTCTT TCTATTTTCT 601 TTATT Predicted gene structure (within gDNA segment 2670 to 325): Exon 1 2200 2018 ( 183 n); cDNA 1 182 ( 182 n); score: 0.929 Intron 1 2017 1979 ( 39 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0.77) Exon 2 1978 1593 ( 386 n); cDNA 183 565 ( 383 n); score: 0.885 MATCH C06HBa0153O03.1-7- SGN-E347579- 0.899 569 0.940 C PGS_C06HBa0153O03.1-7-_SGN-E347579- (2200 2018,1978 1593) Alignment (genomic DNA sequence = upper lines): ATCCCCTAAA TCTACGTGTC GGTTCGTGAC ACCCGATCCC CTAAATCTAC GTATCGGTTC 2141 ||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| || ||||||| ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC CTAATTCTAC GTGTCGGTTC 60 GTGACACCCG TTCCCCTAAA TCTACATGTC GGTTCGTGAC ACCCGGTCCC CTAATTCTAC 2081 |||||||| | |||||| || ||||| || | |||||||||| ||||| |||| |||||||||| GTGACACCTG ATCCCCT-AA TCTACGTGCC GGTTCGTGAC ACCCGATCCC CTAATTCTAC 119 GTGTCGGTTC GTGACACCCA ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC 2021 ||| | |||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGCCAGTTC GTGACACCCG ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC 179 CTAATACTAC GTGTCGGTTC ATGACACCCG ATCCCCTAAT ACTACGTGTC GGTTCGTGAC 1961 || | ||||| ||| |||||| CTG....... .......... .......... .......... ..CATGTGTC GGTACGTGAC 200 AC-CCGATCC CCTAATCTCC TTCTATCAAT TCATCAAGCC TTCTTTCTTA CCAAGGCATC 1902 || ||||||| ||||| || |||| | || |||||| ||| |||| | || |||||||||| ACTCCGATCC ACTAATATCA TTCTGT-AAA TCATCAGGCC TTCTCT-ATA CCAAGGCATC 258 ATCCATCCCA TTATTTTAGT TCATCACGCC TTTTTTTATA CCAAGGTCTC ATTATTAACA 1842 ||| |||||| ||| ||| | |||||| ||| || || |||| |||||| || || ||||| ATCAATCCCA TTACTTTTAT TCATCAAGCC TTCTTCTATA CCAAGGCATC ATCATTAA-T 317 AAGAGATTAG GATTTTA-CA AGATTTGGGA TTCAATAACT TCATCATGCT TAATATAATC 1783 |||||||||| ||||| || |||||||||| |||||||||| |||||||||| |||||||||| AAGAGATTAG ATTTTTATCA AGATTTGGGA TTCAATAACT TCATCATGCT TAATATAATC 377 ACAATTATAT AATCATGTTC ATGCATGCAT ACAATTAAGC ACATAGCAGG GTTTACAATA 1723 |||||||||| ||||| |||| |||||||||| |||||||||| | |||||||| |||||||||| ACAATTATAT AATCACGTTC ATGCATGCAT ACAATTAAGC ATATAGCAGG GTTTACAATA 437 CTATCAATAC ATATCATTCT CTATTAAGAG TTTACTATGA AAGCATGAAA ACCATAACCT 1663 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACCAATAC ATATCATTCT CTATTAAGAG TTTACTATGA AAGCATGAAA ACCATAACCT 497 ACCTCCACCG AAGATTCGTG ATCAAGCAAG CAAA-TTTTC TCAAAGCTTT GTGTTTTTCC 1604 |||||||||| |||||| ||| |||||||||| |||| ||||| || |||| || ||||| | | ACCTCCACCG AAGATTAGTG ATCAAGCAAG CAAATTTTTC TCCAAGC-TT -TGTTTCT-C 554 CCTTCTCGAT C 1593 |||||||| | | CCTTCTCGTT C 565 hqPGS_C06HBa0153O03.1-7-_SGN-E347579- (2200 2018,1978 1593) ******************************************************************************** EST sequence 98 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 4456 to 1): Exon 1 2692 2262 ( 431 n); cDNA 1 428 ( 428 n); score: 0.781 Intron 1 2261 2219 ( 43 n); Pd: 0.933 (s: 0.78), Pa: 0.000 (s: 0.92) Exon 2 2218 2157 ( 62 n); cDNA 429 490 ( 62 n); score: 0.935 Intron 2 2156 2087 ( 70 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0.86) Exon 3 2086 1907 ( 180 n); cDNA 491 670 ( 180 n); score: 0.889 MATCH C06HBa0153O03.1-7- SGN-E241789+ 0.824 673 0.981 C PGS_C06HBa0153O03.1-7-_SGN-E241789+ (2692 2262,2218 2157,2086 1907) Alignment (genomic DNA sequence = upper lines): ATGCCGGAAG TTCAAGG-CA TCAAGACTTG AAGAAGA-AG -ATCCAGTCC AAGCTAGAGG 2636 ||| | || ||||||| || ||||||| || || ||| || ||||||||| |||||| ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 CATTAGCTTA CCCTGAATTT TCGATGTAGT -AAGACTGGC TTGAATTACT GTTGAGTTGA 2577 || ||||| | ||||||| | | || || | |||||||| | || || | |||||||||| CAATAGCTCA CCCTGAAATC T-GACGTGAT GAAGACTGGT TAGAGTTGCG GTTGAGTTGA 119 GGACGATGAC ACGTTTGCTG CACTCCACAA ATAAACAAGA AGAAAACATA AAAGTAGGGG 2517 ||||| | |||||||||| |||||||| | || ||||| | |||||||||| |||||||||| AGACGACGGT ACGTTTGCTG CACTCCAC-A ATTAACAAAA AGAAAACATA AAAGTAGGGG 178 TCAGTACAAA ACACGGGTAC TGAGTAGATA TCATCGGCCA ACTACAAATA GAAAACAATA 2457 ||||||| || ||||| |||| |||||||||| |||||||||| ||| |||| || ||||||| TCAGTAC-AA ACACGAGTAC TGAGTAGATA TCATCGGCCA ACTCAGAATA GAGAACAATA 237 TATACCAAGT AATATCATAA AATCAACTAT GATACTCAAC ATGTAGCAAC AACAAGCACT 2397 |||| ||| | |||| |||| ||||||| || | ||| ||| | || |||| |||||| || TATATCAAAT AATAAAATAA AATCAACCAT AACACTTAAC AGGTGACAAC AACAAGTACC 297 AT-CTCATTA ACAGTTACCG TCAAGTTCAC ACATGAGGAC TCAAGCCTCA ATACCATACT 2338 || |||| ||| |||| || |||||||| ||||||||| | |||||||| ATAACCATTG GGCACAACC- -CAAGAACAT CTATGAGGAC TCAAGCCTCC ACACCATACT 355 CATTTGGGAA TCATGTTCAT TAGATTGAGT ATATTAACAT CTTTCAAGAT TCATGATCTT 2278 |||||||||| || |||||| || ||||||| | |||||||| |||||||| |||| |||| CATTTGGGAA ACAGGTTCAT TAAATTGAGT ACATTAACAT AATTCAAGAT TCAT--TCTT 413 TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTCAT ATTCATTAAT CCTCTTGTGT 2218 | || || | ||| | | T-TTACTATC GTGGTG.... .......... .......... .......... .........T 429 CGGTACGTGA CACTCCGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC CGATCCCCTA 2158 ||| |||||| ||||||||| |||||| || |||||||||| |||||||||| |||||||||| CGGAACGTGA TACTCCGATC CCCTAATGCT ACGTGTCGGT TCGTGACACC CGATCCCCTA 489 AATCTACGTA TCGGTTCGTG ACACCCGTTC CCCTAAATCT ACATGTCGGT TCGTGACACC 2098 | A......... .......... .......... .......... .......... .......... 490 CGGTCCCCTA ATTCTACGTG TCGGTTCGTG ACACCCAATC CCCTAATTCT ACGTGTCGGT 2038 | ||||||| ||||||||| |||||| ||| |||||| || ||||| || | .......... .TACTACGTG TCGGTTCGTT ACACCCGATC TCCTAATACT ACGTGCCGAT 539 TCGTGACACC CGATCCCCTA ATACTACGTG TCGGTTCATG ACACCCGATC CCCTAATACT 1978 |||||||||| |||||| || |||||| ||| ||||||| || |||||||||| | ||||||| TCGTGACACC CGATCCATTA ATACTATGTG TCGGTTCGTG ACACCCGATC CATTAATACT 599 ACGTGTCGGT TCGTGACACC CGATCCCCTA ATCTCCTTCT ATCAATTCAT CAAGCCTTCT 1918 |||||||||| |||||||||| |||||||||| | ||| |||| | | ||||| |||||||||| ACGTGTCGGT TCGTGACACC CGATCCCCTA ACCTCATTCT TTTAGTTCAT CAAGCCTTCT 659 TTCTTACCAA G 1907 || |||||| | TTTATACCAA G 670 hqPGS_C06HBa0153O03.1-7-_SGN-E241789+ (2692 2262,2218 2157,2086 1907) ******************************************************************************** EST sequence 3 -strand 731 n (File: SGN-E578076-) 1 GCATCATCAA TCCCATTATT TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCTCATTAT 61 GAACAAAGAG ATTAAGATTT TGCAAGATTT GGGATTCAAT AACTTCATCA TGCTTATATA 121 ATCACAATTA TATAGTTACA TTCATGCAAG CATACAATTA AGCACATAGC AGGGTTTACA 181 ATATTATCAA TACATATCAT TCTCTATTAA GAGTTTACTA CGAATATCGT AAGAGAAACC 241 ATAACCTACC TCCACCGAAG AATTGCGATC AACAAGTTAT CTTCTCAAAA TCCTTGCTAT 301 CCTCTTCGTT TCTCTTTCTT TTTCTGTTTT CTCTTTGTTC TTTCTATTTT TCTTATTCAA 361 ACGTCCTAAC GAGCAAAACA GAGAACAAAA CAACCCTAAA AATTTCAACT TTTTTCGGTT 421 TCCCGACTTC CAATTTACCA GAGATATAGA TAATTCACTG AAATTGAACA AGGGTTAAGA 481 GCAGAAGAAA TTTACGTTGT GATTAATTGG GGCAAAGCGT CGAACAGTTG AACTGCAAAT 541 TTGTTCTTCA GTTATAGATA CAAAAGATAG AGTCTTATAT GAGTTAAAGA AGACGTAGAG 601 TATACCCTAG TAAGCGAGCT GACCACGGCG GAATGAGGTG GTGAGTTGGT GGGTTTCGTC 661 GGTCAACCAA GAAATGAAAA GGAAATTGAA GTATGAAAAA CTACAGAAAA ATGACGCGTT 721 TGGCCGAGAA A Predicted gene structure (within gDNA segment 3254 to 1): Exon 1 1906 1554 ( 353 n); cDNA 1 348 ( 348 n); score: 0.830 Intron 1 1553 594 ( 960 n); Pd: 0.968 (s: 0.64), Pa: 0.000 (s: 0.66) Exon 2 593 547 ( 47 n); cDNA 349 394 ( 46 n); score: 0.660 MATCH C06HBa0153O03.1-7- SGN-E578076- 0.830 400 0.547 C PGS_C06HBa0153O03.1-7-_SGN-E578076- (1906 1554,593 547) Alignment (genomic DNA sequence = upper lines): GCATCATCCA TCCCATTATT TTAGTTCATC ACGCCTTTTT TTATACCAAG GTCTCATTAT 1847 |||||||| | |||||||||| |||||||||| ||||||| || |||||||||| | |||||||| GCATCATCAA TCCCATTATT TTAGTTCATC ACGCCTTCTT TTATACCAAG GCCTCATTAT 60 TAACAAAGAG ATTAGGATTT TACAAGATTT GGGATTCAAT AACTTCATCA TGCTTAATAT 1787 ||||||||| |||| ||||| | |||||||| |||||||||| |||||||||| ||||| |||| GAACAAAGAG ATTAAGATTT TGCAAGATTT GGGATTCAAT AACTTCATCA TGCTT-ATAT 119 AATCACAATT ATATAATCAT GTTCATGCAT GCATACAATT AAGCACATAG CAGGGTTTAC 1727 |||||||||| ||||| | | |||||||| |||||||||| |||||||||| |||||||||| AATCACAATT ATATAGTTAC ATTCATGCAA GCATACAATT AAGCACATAG CAGGGTTTAC 179 AATACTATCA ATACATATCA TTCTCTATTA AGAGTTTACT ATGAA-A--G CATGA-AAAC 1671 |||| ||||| |||||||||| |||||||||| |||||||||| | ||| | | | || |||| AATATTATCA ATACATATCA TTCTCTATTA AGAGTTTACT ACGAATATCG TAAGAGAAAC 239 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGCA AATTTTCTCA AAGCTTTGTG 1611 |||||||||| |||||||||| || | | ||| ||| |||| || |||||| || | || CATAACCTAC CTCCACCGAA GAATTGCGAT CAA-CAAG-T TATCTTCTCA AA-ATCCTTG 296 TTTTTCCCCT TCTCGATCGT CTCTCTCTCT ATCGATTCCC TTCTCTCTCT TTCTCTTGTT 1551 | ||| | | ||| | | || ||| | | | ||| | || | | ||| |||| || -CTATCCTC- T-TCGTTTCT CTTTCTTTTT CTGTTTTCTC TT-TGT-TCT TTCTATT... 348 CTTTCTATTT TCTTTATTCA AACCCTCTTT CTTTTACCCT AATTAGTATA TAATTAAGAA 1491 .......... .......... .......... .......... .......... .......... 348 TAAAAGATGA CAATAATAGC CCACTAATTA ACTTAAGGTT ACCTCTTTTA TTCCCCCAAG 1431 .......... .......... .......... .......... .......... .......... 348 AAATTGAGTT ATTAATATAG ACCCACGAAA TATATAATTA TAGCAGGAAT AGTCCAAAAC 1371 .......... .......... .......... .......... .......... .......... 348 GCCCCTTTAA AACTTAACCA GAATTCCGAC TTCAACTGGG ATTACGCAAC CTGTGACGGC 1311 .......... .......... .......... .......... .......... .......... 348 CCGTCGCGCC TGCGACGGTC CGTCATGCAG GTTCGTCAGA GATTCGATTT CCTTAAGGAG 1251 .......... .......... .......... .......... .......... .......... 348 TCTGTGACGG CCCGTCGTAC CTACGACGGT CCGTCCTGCA TTTCCGTCAC GACGTTCAGA 1191 .......... .......... .......... .......... .......... .......... 348 GAATCGTTCC CTGTACCAAA TTCTCAAGAG TTGGAGTGTT TTGAAACGGT GGATCACGAC 1131 .......... .......... .......... .......... .......... .......... 348 GGTTCATCGT GCCTGTGACG GTCCGTCCTG CAGGTCCGTC ACAGAGTTCA GAGAGTCAAT 1071 .......... .......... .......... .......... .......... .......... 348 TTCAGCACCC AAATTTCAGA ATTTCTAAGT GTTTTGGGAC GAAACACCCT CGACGGTCCG 1011 .......... .......... .......... .......... .......... .......... 348 TCGTGCCCAT GACGTTCCGT CATGCCCATG ACGTTCCGTC GTGGGTTCCG TCGTCTCAGC 951 .......... .......... .......... .......... .......... .......... 348 CTGTTTTTCC AGAAATAAAA TCTGCTGCTC AAAACAACTA AACAGGTCGT TACAAAATAT 891 .......... .......... .......... .......... .......... .......... 348 TTTTTATAAA TATTTTGACT TTTTATCTTA TTAATTTTTA TATTTTTTTA ATCTAGCTAT 831 .......... .......... .......... .......... .......... .......... 348 TTAATTTTTC TTAATTATTA TTATTATTAT TATTTTATAA CAAAAAAAAT AATTAAAAGA 771 .......... .......... .......... .......... .......... .......... 348 AATTAACCTA CCCATTATCC CACCTTCACT TCACCATTTC TCTCCATTCA CCCCATACCC 711 .......... .......... .......... .......... .......... .......... 348 CACTCCACTA ATCCAATCTC ACATACACAC ATACACACAT ATAAATATAT ATAAATTAAT 651 .......... .......... .......... .......... .......... .......... 348 AATGAGAGGA GAAGAGAAAA GAAGAAAAAA TTGTACAAAG TAAAAGAGAA GAAAAAATTT 591 ||| .......... .......... .......... .......... .......... .......TTT 351 CAGAAATCCA AAGAGAAAAT CAGCAAAAAA GAGGAGAAAA AAAC 547 | | || | | | || ||||||| | ||| | |||| ||| C-TTATTCAA ACGTCCTAAC GAGCAAAACA GAGAACAAAA CAAC 394 hqPGS_C06HBa0153O03.1-7-_SGN-E578076- (1906 1554) ******************************************************************************** EST sequence 35 -strand 717 n (File: SGN-E349726-) 1 TTATGTTCAT TAGATTGAGT ATATAACATC TTTCAAGATT CATTGTCTTT ATTTCTCTTG 61 TGTCGGTACG TGACATTCCG CTCCNTCATA TTCATTAATC TTCTTGTGTC GGTACGTGAT 121 ACTCTGATCC CCTAAATCTA CGTGTCGGAA CGTGACACTC CGATCCCCTA AATCTACGTG 181 TCGGTTCGTG ACACCTGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC CGATCCCCTA 241 AATCTACGTG TCGGTTCGTG ACACCCGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC 301 CGATCCCCTA ATCTATGTGT TGGTTCGTGA CACCTGATCC CTTAATCTAC GTGTCGGTTT 361 GTGACACCCG ATCCCCTAAT TCTACGTGTC AGTTCGTGAC ACCCGATCCC CTAATCTCAT 421 TCTATCAATT CATCAAGCCT TCTCCCTTAC CAAGGCATCA TCAATCTCAT TACTTTAGTT 481 CATCAAGCCT TCTCCCTTAC CAAGGCATCA TCATTAAAAA GAGATTAGGT TTTTACAAGA 541 TTTGGGATTC AATAACTTCA TCATGCTTAT ATAATAACAA TTATATAGTT ACATTCATGC 601 AAGCATACAA TTAAGCACAT AGCAGGGTTT ACAATATTAT CAATACATAT CATTCTCTAT 661 TAAGAGTTTA CTACGAATAT CGTAAGAGAA ACAATAACCT ACCTCCACCG AAGACTA Predicted gene structure (within gDNA segment 2955 to 646): Exon 1 2287 1687 ( 601 n); cDNA 86 672 ( 587 n); score: 0.894 MATCH C06HBa0153O03.1-7- SGN-E349726- 0.894 601 0.838 C PGS_C06HBa0153O03.1-7-_SGN-E349726- (2287 1687) Alignment (genomic DNA sequence = upper lines): TCATGATCTT TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTCAT ATTCATTAAT 2228 |||| || | || | |||| |||||||||| |||| |||| | |||| | | | ||| TCATATTCAT TAATCTTCTT GTGTCGGTAC GTGATACTCT GATCCC-C-- --T-A--AAT 137 CCTCTTGTGT CGGTACGTGA CACTCCGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC 2168 || |||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| -CT-ACGTGT CGGAACGTGA CACTCCGATC CCCTAAATCT ACGTGTCGGT TCGTGACACC 195 CGATCCCCTA AATCTACGTA TCGGTTCGTG ACACCCGTTC CCCTAAATCT ACATGTCGGT 2108 ||||||||| ||||||||| |||||||||| ||||||| || |||||||||| || ||||||| TGATCCCCTA AATCTACGTG TCGGTTCGTG ACACCCGATC CCCTAAATCT ACGTGTCGGT 255 TCGTGACACC CGGTCCCCTA ATTCTACGTG TCGGTTCGTG ACACCCAATC CCCTAATTCT 2048 |||||||||| || ||||||| | |||||||| |||||||||| |||||| ||| |||||| ||| TCGTGACACC CGATCCCCTA AATCTACGTG TCGGTTCGTG ACACCCGATC CCCTAA-TCT 314 ACGTGTCGGT TCGTGACACC CGATCCCCTA ATACTACGTG TCGGTTCATG ACACCCGATC 1988 | |||| ||| |||||||||| |||||| || || ||||||| |||||| || |||||||||| ATGTGTTGGT TCGTGACACC TGATCCCTTA AT-CTACGTG TCGGTTTGTG ACACCCGATC 373 CCCTAATACT ACGTGTCGGT TCGTGACACC CGATCCCCTA ATCTCCTTCT ATCAATTCAT 1928 ||||||| || ||||||| || |||||||||| |||||||||| ||||| |||| |||||||||| CCCTAATTCT ACGTGTCAGT TCGTGACACC CGATCCCCTA ATCTCATTCT ATCAATTCAT 433 CAAGCCTTCT TTCTTACCAA GGCATCATCC ATCCCATTAT TTTAGTTCAT CACGCCTTTT 1868 |||||||||| |||||||| ||||||||| ||| ||||| |||||||||| || ||||| | CAAGCCTTCT CCCTTACCAA GGCATCATCA ATCTCATTAC TTTAGTTCAT CAAGCCTTCT 493 TTTATACCAA GGTCTCATTA TTAACAAAGA GATTAGGATT TTACAAGATT TGGGATTCAA 1808 |||||| || |||| | |||| ||||| ||||||| || |||||||||| |||||||||| CCCTTACCAA GGCATCATCA TTAA-AAAGA GATTAGGTTT TTACAAGATT TGGGATTCAA 552 TAACTTCATC ATGCTTAATA TAATCACAAT TATATAATCA TGTTCATGCA TGCATACAAT 1748 |||||||||| |||||| ||| |||| ||||| |||||| | | |||||||| ||||||||| TAACTTCATC ATGCTT-ATA TAATAACAAT TATATAGTTA CATTCATGCA AGCATACAAT 611 TAAGCACATA GCAGGGTTTA CAATACTATC AATACATATC ATTCTCTATT AAGAGTTTAC 1688 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| TAAGCACATA GCAGGGTTTA CAATATTATC AATACATATC ATTCTCTATT AAGAGTTTAC 671 T 1687 | T 672 hqPGS_C06HBa0153O03.1-7-_SGN-E349726- (2287 1687) ******************************************************************************** EST sequence 31 -strand 402 n (File: SGN-E357559-) 1 TGTGTTGGTT CGTGACACCT GATCCCTTAA TCTACGTGTC GGTTTGTGAC ACCCGATCCC 61 CTAATTCTAC GTGTCAGTTC GTGACACCCG ATCCCCTAAT CTCATTCTAT CAATTCATCA 121 AGCCTTCTCC CTTACCAAGG CATCATCAAT CTCATTACTT TAGTTCATCA AGCCTTCTCC 181 CTTACCAAGG CATCATCATT AAAAAGAGAT TAGGTTTTTA CAAGATTTGG GATTCAATAA 241 CTTCATCATG CTTATATAAT AACAATTATA TAGTTACATT CATGCAAGCA TACAATTAAG 301 CACATAGCAG GGTTTACAAT ATTATCAATA CATATCATTC TCTATTAAGA GTTTACTACG 361 AATATCGTAA GAGAAACAAT AACCTACCTC CACCGAAGAC TA Predicted gene structure (within gDNA segment 3082 to 646): Exon 1 2045 1687 ( 359 n); cDNA 2 357 ( 356 n); score: 0.908 MATCH C06HBa0153O03.1-7- SGN-E357559- 0.908 359 0.893 C PGS_C06HBa0153O03.1-7-_SGN-E357559- (2045 1687) Alignment (genomic DNA sequence = upper lines): GTGTCGGTTC GTGACACCCG ATCCCCTAAT ACTACGTGTC GGTTCATGAC ACCCGATCCC 1986 |||| ||||| |||||||| | ||||| |||| ||||||||| |||| |||| |||||||||| GTGTTGGTTC GTGACACCTG ATCCCTTAAT -CTACGTGTC GGTTTGTGAC ACCCGATCCC 60 CTAATACTAC GTGTCGGTTC GTGACACCCG ATCCCCTAAT CTCCTTCTAT CAATTCATCA 1926 ||||| |||| ||||| |||| |||||||||| |||||||||| ||| |||||| |||||||||| CTAATTCTAC GTGTCAGTTC GTGACACCCG ATCCCCTAAT CTCATTCTAT CAATTCATCA 120 AGCCTTCTTT CTTACCAAGG CATCATCCAT CCCATTATTT TAGTTCATCA CGCCTTTTTT 1866 |||||||| |||||||||| ||||||| || | ||||| || |||||||||| ||||| | AGCCTTCTCC CTTACCAAGG CATCATCAAT CTCATTACTT TAGTTCATCA AGCCTTCTCC 180 TATACCAAGG TCTCATTATT AACAAAGAGA TTAGGATTTT ACAAGATTTG GGATTCAATA 1806 |||||||| |||| ||| || ||||||| ||||| |||| |||||||||| |||||||||| CTTACCAAGG CATCATCATT AA-AAAGAGA TTAGGTTTTT ACAAGATTTG GGATTCAATA 239 ACTTCATCAT GCTTAATATA ATCACAATTA TATAATCATG TTCATGCATG CATACAATTA 1746 |||||||||| |||| ||||| || ||||||| |||| | | |||||||| | |||||||||| ACTTCATCAT GCTT-ATATA ATAACAATTA TATAGTTACA TTCATGCAAG CATACAATTA 298 AGCACATAGC AGGGTTTACA ATACTATCAA TACATATCAT TCTCTATTAA GAGTTTACT 1687 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| ||||||||| AGCACATAGC AGGGTTTACA ATATTATCAA TACATATCAT TCTCTATTAA GAGTTTACT 357 hqPGS_C06HBa0153O03.1-7-_SGN-E357559- (2045 1687) ******************************************************************************** EST sequence 2 -strand 774 n (File: SGN-E349977-) 1 AGTAGATATC ATCGCTAACT CAAAATAGGG AACAATATAT ATCAATAATA ATGTAAATCA 61 ACTACAATAC TCATCATGTA GCAATAGCAA TTTCTTNATC ATTAACAATT ACCGTCAAGT 121 TCACACATGA GGACTCAAGC CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT 181 GAGTATATTC ATTATCTTTC AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC 241 ACTCCGCTCC TCTATTTCTA TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATATTCAT 301 TCTATCCTGG TACCGGAACG TGGCACCCGA TCCTCATATT CTATCCTGGT GTCGGAACGT 361 AACACTCCGA TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCTCAT 421 ATTCTATCCT GGTGTCGGAA CGTGACACTC CGATCCTCAT ATTCTATCCT GGTGTCGGAA 481 CGTGACACTC CGATCCTCAT ATTCATTCTA TCCTGGTACC GAAACGTGGC ACCCGATCCC 541 CTAATTCATC AAGCCTTCTT CTACACTAAG GCATCATCAT TCTCATTATA TAATTTATCA 601 AGCCTTCTCT CATACTAAGG CCTCATCAAT CTTATTATAT AATATATCAA GTGAATTAGG 661 GTTCTTTCAA GATTTGGGAT TCAATAGCTT CATCATGCTT TGTTAATTCA TAACAATTTC 721 ATAATCATAA TCATGCAAGC ATACCAATAA GCATATAGAC AGGTTTACAA CATC Predicted gene structure (within gDNA segment 4073 to 1): Exon 1 2494 1850 ( 645 n); cDNA 1 626 ( 626 n); score: 0.692 MATCH C06HBa0153O03.1-7- SGN-E349977- 0.692 645 0.833 C PGS_C06HBa0153O03.1-7-_SGN-E349977- (2494 1850) Alignment (genomic DNA sequence = upper lines): AGTAGATATC ATCGGCCAAC TACAAATAGA AAACAATATA TACCAAGTAA TATCATAAAA 2435 |||||||||| ||| || ||| | |||||| ||||||||| || ||| ||| || | ||| AGTAGATATC ATC-GCTAAC TCAAAATAGG GAACAATATA TATCAA-TAA TAATGT-AAA 57 TCAACTATGA TACTCAACAT GTAGCAACAA CAAGCACTAT CTCATTAACA GTTACCGTCA 2375 ||||||| | |||||| ||| ||||||| | ||| || ||||||||| ||||||||| TCAACTACAA TACTCATCAT GTAGCAATAG CAATTTCTTN ATCATTAACA ATTACCGTCA 117 AGTTCACACA TGAGGACTCA AGCCTCAATA CCATACTCAT TTGGGAATCA TGTTCATTAG 2315 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | ||||||||| AGTTCACACA TGAGGACTCA AGCCTCAATA CCATACTCAT TTGGGAATTA AGTTCATTAG 177 ATTGAGTATA TT-AACATCT TTCAAGATTC ATGATCTTTA TTTCTCTTGT GTCGGTACGT 2256 |||||||||| || | |||| |||||||||| || |||||| || ||||||| |||||||||| ATTGAGTATA TTCATTATCT TTCAAGATTC ATTATCTTTC TTCCTCTTGT GTCGGTACGT 237 GACACTCCGC TCCCTCATAT TCATTAATCC TCTTGTGTCG GTACGTGACA CTCCGATCCC 2196 |||||||||| | |||| ||| | | || | || ||| || | ||||| || ||||||||| GACACTCCGC T-CCTC-TAT T--TCTAT-C -CTGGTGCCG GAACGTGGCA CTCCGATCCT 291 CTAAATCTAC GTGTCGGTTC GTGACACCCG ATCCCCTAAA TCTACGTATC GGTTCGTGAC 2136 | | || | | | || || || ||| | | | | | | || | | | C-ATAT-T-C AT-TC-TATC CTGGTACCGG AACGTGGCAC CCGA-TCCTC ATATTCT-A- 343 ACCCGTTCCC CTAAATCTAC ATGTCGGTTC GTGACACCCG GTCCCCTAAT TCTACGTGTC 2076 || | | | || || | || | || | | | | | || || || || | TCCTGGTGTC GGAACGTAAC A-CTCCGATC CTCATATTC- AT--TCT-AT CCT-GGTACC 397 GGTTCGTGAC ACCCAATCCC CTAATTCTA- -C--GTGTCG GTTCGTGACA C-CCGATCCC 2021 || |||| | |||| |||| | |||||| | |||||| | ||||||| | ||||||| GGAACGTGGC ACCCGATCCT CATATTCTAT CCTGGTGTCG GAACGTGACA CTCCGATCCT 457 CTAATACTA- -C--GTGTCG GTTCATGACA C-CCGATCCC CTAATACTAC GTGTCGGTTC 1966 | || ||| | |||||| | | ||||| | ||||| | || ||| | | | || || CATATTCTAT CCTGGTGTCG GAACGTGACA CTCCGAT--C CTCATA-TTC AT-TCTA-TC 512 GTGACACCCG ATCCCCTAAT CTCCTTCTAT CAATTCATCA AGCCTTCTTT CTTACCAAGG 1906 || | ||| | | | | || ||||||||| ||||||||| || |||| CTGGTA-CCG AAACGTGGCA CCCGATCCCC TAATTCATCA AGCCTTCTTC TACACTAAGG 571 CATCATCCAT CCCATTATTT TAGTTCATCA CGCCTTTTTT TATACCAAGG TCTCAT 1850 ||||||| | | ||||| | || || |||| ||||| | | |||| |||| ||||| CATCATCATT CTCATTA-TA TAATTTATCA AGCCTTCTCT CATACTAAGG CCTCAT 626 hqPGS_C06HBa0153O03.1-7-_SGN-E349977- (2494 1850) ******************************************************************************** EST sequence 128 +strand 337 n (File: SGN-E357033+) 1 ACTGAATAGA TATCATCGCC CAACTCAAAA TAGAAATCAA TATATATCAA GNATTATCAT 61 AAAATCAACT ATGATACTCA ACATGTAGCA ACAACAAGCA CTATATCATT AACAATTACC 121 GTCACGTTCA CACATGAGGA CTCAAGCCTC AATACCATAC TCATTTGGGA ATCATGTTCA 181 TTAGATTTAG TATATTAACA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGACGGAA 241 CATGACATTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TGAACACATG ACACTCCGAT 301 CCCCTAAATC TACATGACAG TTTCATGAAC GCTATCC Predicted gene structure (within gDNA segment 3584 to 1336): Exon 1 2498 2162 ( 337 n); cDNA 1 337 ( 337 n); score: 0.907 MATCH C06HBa0153O03.1-7- SGN-E357033+ 0.907 337 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E357033+ (2498 2162) Alignment (genomic DNA sequence = upper lines): ACTGAGTAGA TATCATCGGC CAACTACAAA TAGAAAACAA TATATACCAA GTAATATCAT 2439 ||||| |||| |||||||| | ||||| ||| |||||| ||| |||||| ||| | | |||||| ACTGAATAGA TATCATCGCC CAACTCAAAA TAGAAATCAA TATATATCAA GNATTATCAT 60 AAAATCAACT ATGATACTCA ACATGTAGCA ACAACAAGCA CTATCTCATT AACAGTTACC 2379 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||| ||||| AAAATCAACT ATGATACTCA ACATGTAGCA ACAACAAGCA CTATATCATT AACAATTACC 120 GTCAAGTTCA CACATGAGGA CTCAAGCCTC AATACCATAC TCATTTGGGA ATCATGTTCA 2319 |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCACGTTCA CACATGAGGA CTCAAGCCTC AATACCATAC TCATTTGGGA ATCATGTTCA 180 TTAGATTGAG TATATTAACA TCTTTCAAGA TTCATGATCT TTATTTCTCT TGTGTCGGTA 2259 ||||||| || |||||||||| |||||||||| ||||| |||| |||||||||| |||| ||| | TTAGATTTAG TATATTAACA TCTTTCAAGA TTCATTATCT TTATTTCTCT TGTGACGGAA 240 CGTGACACTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TCGGTACGTG ACACTCCGAT 2199 | ||||| || |||||||||| |||||||||| |||||||||| | || || |||||||||| CATGACATTC CGCTCCCTCA TATTCATTAA TCCTCTTGTG TGAACACATG ACACTCCGAT 300 CCCCTAAATC TACGTGTC-G GTTCGTGACA CCCGATCC 2162 |||||||||| ||| || | | ||| ||| | | | |||| CCCCTAAATC TACATGACAG TTTCATGA-A CGCTATCC 337 hqPGS_C06HBa0153O03.1-7-_SGN-E357033+ (2498 2162) ******************************************************************************** EST sequence 32 -strand 239 n (File: SGN-E391780-) 1 ATCAACAAAT ACTATATCAT TAACAATTAC CGTCAAGTTC ACACATGAGG ACTCAAGCCT 61 CAATACCATA CTCATTTGGG AATCATGTTC ATTAGATTGA GTATATTAAC ATCTTTCAAG 121 ATTCATTATC TTTATTTCTC TTGTGTCGGT ACGTGACACT CCGCTCCCTC AATATTCATT 181 AATCCTCTTG TGTCGGTACG TGACACTCCG ATCCCCTAAA TCTATATGTC GGTTTGTGA Predicted gene structure (within gDNA segment 3253 to 1437): Exon 1 2407 2172 ( 236 n); cDNA 3 239 ( 237 n); score: 0.956 MATCH C06HBa0153O03.1-7- SGN-E391780- 0.956 236 0.987 C PGS_C06HBa0153O03.1-7-_SGN-E391780- (2407 2172) Alignment (genomic DNA sequence = upper lines): CAACAAGCAC TATCTCATTA ACAGTTACCG TCAAGTTCAC ACATGAGGAC TCAAGCCTCA 2348 |||||| || ||| |||||| ||| |||||| |||||||||| |||||||||| |||||||||| CAACAAATAC TATATCATTA ACAATTACCG TCAAGTTCAC ACATGAGGAC TCAAGCCTCA 62 ATACCATACT CATTTGGGAA TCATGTTCAT TAGATTGAGT ATATTAACAT CTTTCAAGAT 2288 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACCATACT CATTTGGGAA TCATGTTCAT TAGATTGAGT ATATTAACAT CTTTCAAGAT 122 TCATGATCTT TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTC-A TATTCATTAA 2229 |||| ||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| TCATTATCTT TATTTCTCTT GTGTCGGTAC GTGACACTCC GCTCCCTCAA TATTCATTAA 182 TCCTCTTGTG TCGGTACGTG ACACTCCGAT CCCCTAAATC TACGTGTCGG TTCGTGA 2172 |||||||||| |||||||||| |||||||||| |||||||||| || |||||| || |||| TCCTCTTGTG TCGGTACGTG ACACTCCGAT CCCCTAAATC TATATGTCGG TTTGTGA 239 hqPGS_C06HBa0153O03.1-7-_SGN-E391780- (2407 2172) ******************************************************************************** EST sequence 27 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 3584 to 1447): Exon 1 2650 2173 ( 478 n); cDNA 1 481 ( 481 n); score: 0.947 MATCH C06HBa0153O03.1-7- SGN-E246710- 0.947 478 0.994 C PGS_C06HBa0153O03.1-7-_SGN-E246710- (2650 2173) Alignment (genomic DNA sequence = upper lines): AGTCCAAGCT AGAGGCATTA GCTTACCCTG AATTTTCGAT GTAGTAAGAC TGGCTTGAAT 2591 |||||||||| ||| |||||| ||| |||||| ||||| |||| |||||||||| |||||||||| AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 60 TACTGTTGAG TTGAGGACGA TGACACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAAAA 2531 |||||||||| |||| |||| || ||||||| |||||||||| |||||||||| ||||||| || TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 120 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTACA 2471 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| | CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 180 AATAGAAAAC -A-ATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 2413 |||||||| | | ||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 240 AGCAACAACA AGCACTATCT CATTAACAGT TACCGTCAAG TTCACACATG AGGACTCAAG 2353 |||||||||| | ||||| | |||||||| | |||||||||| |||||||||| |||||||||| AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 300 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 2293 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 360 AAGATTCATG ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTC-ATATTC 2234 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 420 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTACGT GTCGGTTCGT 2174 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| | ||||||| || ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 480 G 2173 | G 481 hqPGS_C06HBa0153O03.1-7-_SGN-E246710- (2650 2173) ******************************************************************************** EST sequence 137 +strand 730 n (File: SGN-E546506+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAACAAT TCAATACTAT TATTATTATC CCCAAAATCT 61 GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA ACTAGGAATG TCTAAGAACA 121 AAATAACTAA AAAGCTAGTC CATGCCGGAA ATTCAAGGCA TCAAGACTTG AAGAAGAAGA 181 CCCAGTCCAA GCTAGACGCA TTAGCTCACC CTGAATTTTC CGATGAAGTG AAGACTGGCT 241 AGATCTACTG TTGAGTTGAA GTTGACGGAA CGTTTGCTGC ATTACACAAA TAACAAAGAG 301 GAAAACATGA AAGTAGGGGT CAGTACAACC ACACGTACTG AGTAGATATC ATCGGCCAAC 361 TCAAAATAGG GAACAGTATA TATCAATAAT AATGTAAATC AACTACAATA CTCAACATGT 421 AGCAATAACA CCATGAATTC ATCAATAACT ACAACCGAGT TCACACATGA GGACTCAAGC 481 CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT GAGTATATTC ATTATCTTTC 541 AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC ACTCCGATCC TCTATTTCTA 601 TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATTCTATC CTGGTACCGG AACGTGGCAC 661 CCGATCCATT TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA TTCTATCCTG 721 GTACCGGAAC Predicted gene structure (within gDNA segment 3887 to 197): Exon 1 2787 2197 ( 591 n); cDNA 51 630 ( 580 n); score: 0.837 PPA cDNA 18 1 MATCH C06HBa0153O03.1-7- SGN-E546506+ 0.837 591 0.810 C PGS_C06HBa0153O03.1-7-_SGN-E546506+ (2787 2197) Alignment (genomic DNA sequence = upper lines): CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA AACTAAGAGT 2728 |||||||||| |||||||||| |||||||||| |||||| | ||||| ||| ||||| || | CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTA-TCT CAAATTACTT AACTAGGAAT 109 ATTCTAAAAG CTAAAAATAC ATAAGAAGTT AGTCCATGCC GGAAGTTCAA GGCATCAAGA 2668 ||||| | | |||||| ||| ||| | |||||||||| |||| ||||| |||||||||| -GTCTAAGAA C--AAAATAA CTAAAAAGCT AGTCCATGCC GGAAATTCAA GGCATCAAGA 166 CTTGAAGAAG AAGATCCAGT CCAAGCTAGA GGCATTAGCT TACCCTGAAT TTT-CGATGT 2609 |||||||||| |||| ||||| |||||||||| ||||||||| ||||||||| ||| ||||| CTTGAAGAAG AAGACCCAGT CCAAGCTAGA CGCATTAGCT CACCCTGAAT TTTCCGATGA 226 AGT-AAGACT GGCTTGAATT ACTGTTGAGT TGAGGACGAT GACACGTTTG CTGCACTCCA 2550 ||| |||||| |||| || | |||||||||| ||| | || | ||||||| ||||| | || AGTGAAGACT GGCTAGATCT ACTGTTGAGT TGAAGTTGAC GGAACGTTTG CTGCATTACA 286 CAAATAAACA AGAAGAAAAC ATAAAAGTAG GGGTCAGTAC AAAACACGGG TACTGAGTAG 2490 ||||||| | ||| |||||| || ||||||| |||||||||| || ||| | |||||||||| CAAATAACAA AGAGGAAAAC ATGAAAGTAG GGGTCAGTAC -AACCACACG TACTGAGTAG 345 ATATCATCGG CCAACTACAA ATAGAAAACA ATATATACCA AGTAATATCA TAAAATCAAC 2430 |||||||||| |||||| || |||| |||| |||||| || | ||||| | |||||||| ATATCATCGG CCAACTCAAA ATAGGGAACA GTATATATCA A-TAATAATG T-AAATCAAC 403 TATGATACTC AACATGTAGC AACAACAAGC ACTATCTCAT TAACAGTTAC CGTCAAGTTC 2370 || |||||| |||||||||| || ||| | | | | |||| || | ||| | ||||| TACAATACTC AACATGTAGC AATAAC-ACC ATGAATTCAT CAATAACTAC AACCGAGTTC 462 ACACATGAGG ACTCAAGCCT CAATACCATA CTCATTTGGG AATCATGTTC ATTAGATTGA 2310 |||||||||| |||||||||| |||||||||| |||||||||| ||| | |||| |||||||||| ACACATGAGG ACTCAAGCCT CAATACCATA CTCATTTGGG AATTAAGTTC ATTAGATTGA 522 GTATATT-AA CATCTTTCAA GATTCATGAT CTTTATTTCT CTTGTGTCGG TACGTGACAC 2251 ||||||| | ||||||||| ||||||| || |||| || || |||||||||| |||||||||| GTATATTCAT TATCTTTCAA GATTCATTAT CTTTCTTCCT CTTGTGTCGG TACGTGACAC 582 TCCGCTCCCT CATATTCATT AATCCTCTTG TGTCGGTACG TGACACTCCG ATCC 2197 |||| | ||| | |||| | || | || | || ||| ||| || ||||||| |||| TCCGAT-CCT C-TATT--TC TAT-C-CTGG TGCCGGAACG TGGCACTCCG ATCC 630 hqPGS_C06HBa0153O03.1-7-_SGN-E546506+ (2787 2197) ******************************************************************************** EST sequence 36 -strand 236 n (File: SGN-E209683-) 1 CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 61 AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 121 ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 181 ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA Predicted gene structure (within gDNA segment 3323 to 294): Exon 1 2551 2316 ( 236 n); cDNA 1 236 ( 236 n); score: 0.769 MATCH C06HBa0153O03.1-7- SGN-E209683- 0.769 236 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E209683- (2551 2316) Alignment (genomic DNA sequence = upper lines): CACAAATAAA CAAGAAGA-A AACATAAAAG TAGGGGTCAG TACAAAACAC GGGTACTGAG 2493 ||||||| || |||||||| | |||||||||| |||||||||| |||||| ||| |||||||||| CACAAAT-AA CAAGAAGATA AACATAAAAG TAGGGGTCAG TACAAACCAC GGGTACTGAG 59 TAGATATCAT CGGCCAACTA CAAATAGAAA ACAATATATA CCAAGTAATA TCATAAAATC 2433 |||||||||| ||||||||| |||||| | ||| ||| || ||| |||| |||||||||| TAGATATCAT CGGCCAACTC AAAATAGGGA ACAGTATGTA TTAAGCAATA TCATAAAATC 119 AACTATGATA CTCAACATGT AGCAACAACA AGCACTATCT CATTAACAGT TACCGTCAAG 2373 ||||| || || |||||| |||| | | || || | | | | |||| AACTAATATC CTTAACATGC AGCATTTATA GTTACCATAA CCCTTGGTTA CAACACCAAG 179 TTCACACATG AGGACTCAAG CCTCAATACC ATACTCATTT GGGAATCATG TTCATTA 2316 || ||| |||||||| |||| | | |||||||||| |||||| | ||||||| CACATCAATG AGGACTCACA CCTCCTCATC ATACTCATTT GGGAATTTAG TTCATTA 236 hqPGS_C06HBa0153O03.1-7-_SGN-E209683- (2551 2316) ******************************************************************************** EST sequence 18 -strand 729 n (File: SGN-E351546-) 1 AGTCGTTGCT CTAGTTCTAC CCATCTGGCA AGAGAGTGAG NATGGTCAGA TACCAATTCG 61 TATCGCTTAG ATACCAATTG ACTCGAAGTA GTAGCACGAA AGAAAGAATG AAAGAGTGAA 121 GTTTTCCTAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAA GCGTCCCCCT ACCGTTCCTT 181 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 241 AGTTTTGTCA CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA 301 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA 361 AGACTTAAAC TCATTAATAA AATCAATAAC TACTATTATT AAACATCTAT TATTCCCAAA 421 ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA GAGTTTCTAA 481 GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 541 AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 601 AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 661 ACCAAGAAGA AAAACATAAA AGTAGGGGTC AGTACAAAAC ACGGCTACTG AGTAGATATC 721 ATCGGCCAA Predicted gene structure (within gDNA segment 6848 to 1876): Exon 1 6017 6011 ( 7 n); cDNA 230 236 ( 7 n); score: 0.714 Intron 1 6010 2973 (3038 n); Pd: 0.000 (s: 0), Pa: 0.990 (s: 0.77) Exon 2 2972 2476 ( 497 n); cDNA 237 729 ( 493 n); score: 0.849 MATCH C06HBa0153O03.1-7- SGN-E351546- 0.849 504 0.691 C PGS_C06HBa0153O03.1-7-_SGN-E351546- (6017 6011,2972 2476) Alignment (genomic DNA sequence = upper lines): CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAGGGTG 5958 || || | CTCTGAT... .......... .......... .......... .......... .......... 236 CAGTATGGAT ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT TGTTCTTTAT 5898 .......... .......... .......... .......... .......... .......... 236 TTTTTAGTTG CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT TGGGAAACTT 5838 .......... .......... .......... .......... .......... .......... 236 GGAGTACATT GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA TTGAGCTTGC 5778 .......... .......... .......... .......... .......... .......... 236 TGTTATTCTC AGTTGAAATG TCTGGTTGTT GTGTTTCTGC TTCTTCTTTA TCAAGCTTTT 5718 .......... .......... .......... .......... .......... .......... 236 GTTCGCATAG GTTGAACAAT CTTGTTGGTG ACAATGCGGA CGGACATTAT TTTCTCCTTT 5658 .......... .......... .......... .......... .......... .......... 236 TTAGCTCAAT CTCAGCTGAG CATTACATTT TTATTTCTAA AAAGATAGAT TATGTTTTTG 5598 .......... .......... .......... .......... .......... .......... 236 CAGTTAACTG CTATCAAGTT TTTTCGTTGA TTTTTTAAAA AAGTTAATAA AGGTAGACAA 5538 .......... .......... .......... .......... .......... .......... 236 GTAAAGTCTG GAAGCAAGAT TTGGACGTTA GATGTTTGTG CAGAACTTGC AAAGTTGGAA 5478 .......... .......... .......... .......... .......... .......... 236 AATGTAGAAA TGTTTGGAGT TTAAATGTTC ATACAAGTTT TTGGAATAGC CTTCAAATAT 5418 .......... .......... .......... .......... .......... .......... 236 TACTTTTGTT TCATTTACCA TACAACTATT TTTTAGTAAC GTACTTATTC AATTCACTTA 5358 .......... .......... .......... .......... .......... .......... 236 ATTAAAATTA TAATCTTGTT CTTTGTTTTT GTTTTTTAAA TAGATTAATT TATTAAGAAG 5298 .......... .......... .......... .......... .......... .......... 236 GAATTTCTGT GTTAAGTAGT ATCCTTTTTT TGGTACAATT TTCATATTAT TTTTTGGGAA 5238 .......... .......... .......... .......... .......... .......... 236 AGGGATCAAA ACGTTTGTAC TTTTAATGAA GAAACTCAAA TCAACAAAAA TAACCCATTC 5178 .......... .......... .......... .......... .......... .......... 236 ACACAAAATA TCCAGATTCC TCTCTAATCT GACCCACGAA TAATTGTAAC CCAATTGCTT 5118 .......... .......... .......... .......... .......... .......... 236 ACCTTCTTCA TTGTGAAAAA ATTTGAACTA TATGAATTTT CCTCAAATTT CAACTCCATT 5058 .......... .......... .......... .......... .......... .......... 236 AATCATTTCT CGTATTTGAA ACTCCAGGGC TCTTTCTTTG TCTTTAAACG TAAAATGGAG 4998 .......... .......... .......... .......... .......... .......... 236 TCATTGACCA GCTATGTGAT TTAATCGCGG AGAAACTAGC ATGGATATTT GGTAGACGCC 4938 .......... .......... .......... .......... .......... .......... 236 CGCCCTCCCT CCCACAAGAA GCCCTACTTG TTGGATCTCC TAAGGTTTAC AGGTCATACT 4878 .......... .......... .......... .......... .......... .......... 236 CGTCGTTGCA CGATTTCTCT CTAAATGCTC TAATCATTCT GATGAAATGC CCAAATTGGC 4818 .......... .......... .......... .......... .......... .......... 236 ATCGCCATCG CTGATGACTT CACCGTAAAA CACAAACTCA ATCTGTTGTT TTCGGGTCAA 4758 .......... .......... .......... .......... .......... .......... 236 TATTTGACCA ATGTGTCACA ACGATTTAGA GGAACGGCGA ATAAGAACAA CTGGACAGAA 4698 .......... .......... .......... .......... .......... .......... 236 CCTGAAATGA GTTTTGCTGA ATCTATTTAT TGGTTCGAAA ACTCATTCCC GCAATTTGAA 4638 .......... .......... .......... .......... .......... .......... 236 GGAGATGGAG AATTTTTTTT TTTTGGAAGG AGATGAAGAA GATTTGAGAG GGGCATTTAA 4578 .......... .......... .......... .......... .......... .......... 236 TTTATTTGAT TTAGTGGGTC GGATTAAGAA GAAATTGAGT AATTTTATTG AAGTTGAATT 4518 .......... .......... .......... .......... .......... .......... 236 ATTTTTGTTC GATTTGAGAT TTTTGAGGTC ACGAATAACA AGTTTGCTGA CTACAATGAT 4458 .......... .......... .......... .......... .......... .......... 236 ATAAATAAAA GGAAACTTTC ACACGTAACC ACTTAAAAAT GACATAATTA CTCTTCATAG 4398 .......... .......... .......... .......... .......... .......... 236 CTATGGTTTA ATGATAAAAT TCGTAACTAC ATATTATATG GAGAAGAGAG GTGAACGAGA 4338 .......... .......... .......... .......... .......... .......... 236 CTCTGAGAGA GGAAAAAATG GGAGAAAGAT GAATTGTATA TGTATATAAT TGTATATTAT 4278 .......... .......... .......... .......... .......... .......... 236 ACATATGCAT TTGTATATAT GACAAGCAAG ATTGGGAAAG GAAGAAGGGA TGCAAGCGAG 4218 .......... .......... .......... .......... .......... .......... 236 ATTGAGAGAG GGCAGAGAGA GAGAGGTGAA TTGCATATGT AGATAGGTTA AATAATTATA 4158 .......... .......... .......... .......... .......... .......... 236 CATATGTATT TGTATACCTA ACGAATTATA CATATACAAA AGTGACTAAT TATGCAAATT 4098 .......... .......... .......... .......... .......... .......... 236 TGAAATCAGC CCAAATAATT AATGTATAAT GTTAGTCACG AATGGTAATT ATACCAAACT 4038 .......... .......... .......... .......... .......... .......... 236 ATAACTATGA TGAGTAATTA TATAATATAA ATTTACTTAA CCACGTAACT TTTCCTATAA 3978 .......... .......... .......... .......... .......... .......... 236 ATAACTTGAT ATTAATGATA AGGGTAAAGT CAAATAAATA ATAAAAGGGG TATATTTGAC 3918 .......... .......... .......... .......... .......... .......... 236 CCTTTTACTT ATTTTTCTAA ATAGTCACTA TGTTTTGAAA ATTGCCCACA CAAATTACTT 3858 .......... .......... .......... .......... .......... .......... 236 TTATTTTTTT TAGAGATTAT AGTATCACTC AACTATTACT TCACGTTTTA ATTTTTACGA 3798 .......... .......... .......... .......... .......... .......... 236 GTTAATTTGA CCATGGAGTT CATACAAAAA TAATGACTTT TTTTTTGTGT TTCAAGATAA 3738 .......... .......... .......... .......... .......... .......... 236 GCCACAGAAA TTTGGGTGGT CGTAAATAAT TTCTTTAGGG GTAAAATGAA TATTTTAAAA 3678 .......... .......... .......... .......... .......... .......... 236 TTAAATTGTT ACTAAATATA GAAACATATC ATTTTTTAGA CTGATTAAAA AAGTAGTATA 3618 .......... .......... .......... .......... .......... .......... 236 AATTGAGACA AGTATTGCTT TTCTTTAAAA ATATACTTAA ACCTTAAGGT CCTGTTTGGA 3558 .......... .......... .......... .......... .......... .......... 236 AGGACACTCT GATAACTGAA TTTGGTGGAA TTACATTGTC TGTCCTTTTT GGTTGAACAT 3498 .......... .......... .......... .......... .......... .......... 236 AGTAATTACT TGGACAGCAG GTAATTGGTG TAATTGGCAA GAGATAATTA CACTCTCAAA 3438 .......... .......... .......... .......... .......... .......... 236 TTTACAGGCA AAGACTGTGA ATTGCTGGTA ATTATATGGT GTAATTACCA ATTGATTACT 3378 .......... .......... .......... .......... .......... .......... 236 TTTTGATTTT TTTTAATTCT ATATTTATTT TTTTGTTTTA ATTTTTTTTC ACTTCTATTA 3318 .......... .......... .......... .......... .......... .......... 236 TATTAATTTT ATTTTTATTT TTATTATTCT AAAATGTTCC AGAAGTGTAT GTTTTATTTT 3258 .......... .......... .......... .......... .......... .......... 236 TTATTATTAT ATTTCATTTT CAACCTTACT TCAAATGATT CTATGCAATC TTTTTGTATT 3198 .......... .......... .......... .......... .......... .......... 236 ATCTATTTTT TTATTTTATG CTTATTTTCT CATTCGCATA ATTTTATTAG TATTCTACTT 3138 .......... .......... .......... .......... .......... .......... 236 TTCAAATTAA TTTATTGTTA AGTAAGGTTA TATATTCATG TCTTCTTTGT TTCTAAAAAA 3078 .......... .......... .......... .......... .......... .......... 236 AATCATTTTT ATGTAAGCTA TGTTAAAATT GGCTATAAGA GTATTGTGTT AAATTGTTGT 3018 .......... .......... .......... .......... .......... .......... 236 AGACATGTAT TTTTTGACCG AATTTAAATT TTATACCTTT TTTAGAGCTT AAATATGTCA 2958 | | | | ||||| .......... .......... .......... .......... .....A-CCA AGTTTTGTCA 250 CGACCCAAA- CCGGGTTGCG ACTGGCACCC ACACTTTCCC TCCTATGTGA GCGAACCAAC 2899 ||||||||| ||||| || |||||||||| |||||| ||| |||||||||| |||||||||| CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA GCGAACCAAC 310 CAATCT-AAC CTTAACATTT CAATATAATA TCAACAGAAA GTAATGCGGA AGACTTAAAC 2840 |||||| ||| |||||||||| |||| ||||| ||||||||| |||||||||| |||||||||| CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA AGACTTAAAC 370 TCATCAAATA AAGACCAATT CATTAACTTC TAAAATTCAA CATCTATTAT TTCCCAAAAT 2780 |||| ||| | | || | || ||||| | || ||| || ||||||||| ||||||||| TCAT-TAAT- -A-A--AA-T CAATAACTAC TATTATTAAA CATCTATTA- TTCCCAAAAC 422 CTGGAAGTCA TCATCACAAG AACATCTACG ATCAAATGAC TAA-ACTAAG AGTATTCTAA 2721 |||||||||| |||||||||| ||||||||| | ||| || ||| ||||| ||| |||||| CTGGAAGTCA TCATCACAAG AACATCTAC- TTTAAACTAC TAATTCTAAG AGT-TTCTAA 480 -AAGCT-AAA AA-TACATAA GAAGTTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACTTG 2664 ||||| ||| || ||||||| |||| ||||| |||||||||| |||||||||| ||||||| || GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 540 AAGAAGAAGA TCCAGTCCAA GCTAGAGGCA TTAGCTTACC CTGAATTTTC GATGT-AGTA 2605 ||| |||||| |||||||||| |||||| || |||||| ||| ||||| | | | ||| | | AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 600 AGACTGGCTT GAATTACTGT TGAGTTGAGG ACGATGACAC GTTTGCTGCA CTCCACAAAT 2545 |||||||||| || ||||||| ||||| || | | || | ||| |||||||||| |||||||||| AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 660 AAACAAGAAG -AAAACATAA AAGTAGGGGT CAGTACAAAA CACGGGTACT GAGTAGATAT 2486 | ||||||| ||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| -ACCAAGAAG AAAAACATAA AAGTAGGGGT CAGTACAAAA CACGGCTACT GAGTAGATAT 719 CATCGGCCAA 2476 |||||||||| CATCGGCCAA 729 hqPGS_C06HBa0153O03.1-7-_SGN-E351546- (2972 2476) ******************************************************************************** EST sequence 28 -strand 580 n (File: SGN-E356206-) 1 GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 61 TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 121 CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 181 TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 241 CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 301 ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 361 CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 421 GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 481 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA AAGTAGGGGT 541 CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA Predicted gene structure (within gDNA segment 4758 to 1876): Exon 1 2962 2476 ( 487 n); cDNA 97 580 ( 484 n); score: 0.860 MATCH C06HBa0153O03.1-7- SGN-E356206- 0.860 487 0.840 C PGS_C06HBa0153O03.1-7-_SGN-E356206- (2962 2476) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAA-CCGGG TTGCGACTGG CACCCACACT TTCCCTCCTA TGTGAGCGAA 2904 |||||||||| |||| ||||| || ||||| |||||||||| | |||||||| |||||||||| TGTCACGACC CAAATCCGGG CCGCCACTGG CACCCACACT TACCCTCCTA TGTGAGCGAA 156 CCAACCAATC T-AACCTTAA CATTTCAATA TAATATCAAC AGAAAGTAAT GCGGAAGACT 2845 |||||||||| | |||||||| ||||||||| ||||| |||| |||||||||| |||||||||| CCAACCAATC TAAACCTTAA CATTTCAATG TAATAGCAAC AGAAAGTAAT GCGGAAGACT 216 TAAACTCATC AAATAAAGAC CAATTCATTA ACTTCTAAAA TTCAACATCT ATTATTTCCC 2785 ||||||||| ||| | | || ||| || ||| ||| | || ||||||| |||| ||||| TAAACTCAT- TAAT--A-A- -AA-TCAATA ACTACTATTA TTAAACATCT ATTA-TTCCC 268 AAAATCTGGA AGTCATCATC ACAAGAACAT CTACGATCAA ATGACTAA-A CTAAGAGTAT 2726 |||| ||||| |||||||||| |||||||||| |||| | || | ||||| |||||||| | AAAACCTGGA AGTCATCATC ACAAGAACAT CTAC-TTTAA ACTACTAATT CTAAGAGT-T 326 TCTAA-AAGC T-AAAAA-TA CATAAGAAGT TAGTCCATGC CGGAAGTTCA AGGCATCAAG 2669 ||||| |||| | ||||| || ||||||||| |||||||||| |||||||||| |||||||||| TCTAAGAAGC TAAAAAATTA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 386 ACTTGAAGAA GAAGATCCAG TCCAAGCTAG AGGCATTAGC TTACCCTGAA TTTTCGATGT 2609 || ||||| | |||||||||| |||||||||| | || ||||| | |||||||| | || ||| ACATGAAGGA GAAGATCCAG TCCAAGCTAG AAGCGTTAGC TCACCCTGAA GATCCGGTGT 446 -AGTAAGACT GGCTTGAATT ACTGTTGAGT TGAGGACGAT GACACGTTTG CTGCACTCCA 2550 | |||||| ||||||| || |||||||||| || || || | |||||||| |||||||||| GACGAAGACT GGCTTGAGTT ACTGTTGAGT CGAAGATGAC GGCACGTTTG CTGCACTCCA 506 CAAATAAACA AGAAG-AAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 2491 ||||| |||| ||||| |||| |||||||||| |||||||||| |||||||||| ||||||||| CAAAT-AACA AGAAGAAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA 565 GATATCATCG GCCAA 2476 |||||||||| ||||| GATATCATCG GCCAA 580 hqPGS_C06HBa0153O03.1-7-_SGN-E356206- (2962 2476) ******************************************************************************** EST sequence 30 -strand 655 n (File: SGN-E356696-) 1 CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 61 TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 121 CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 181 CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 241 CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 301 TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 361 ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 421 ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 481 GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 541 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAACA AGAAGAAAAA 601 CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA GATATCATCG GCCAA Predicted gene structure (within gDNA segment 5508 to 1876): Exon 1 2962 2476 ( 487 n); cDNA 172 655 ( 484 n); score: 0.858 MATCH C06HBa0153O03.1-7- SGN-E356696- 0.858 487 0.744 C PGS_C06HBa0153O03.1-7-_SGN-E356696- (2962 2476) Alignment (genomic DNA sequence = upper lines): TGTCACGACC CAAA-CCGGG TTGCGACTGG CACCCACACT TTCCCTCCTA TGTGAGCGAA 2904 |||||||||| |||| ||||| || ||||| |||||||||| | ||||| || |||||||||| TGTCACGACC CAAATCCGGG CCGCCACTGG CACCCACACT TACCCTCNTA TGTGAGCGAA 231 CCAACCAATC T-AACCTTAA CATTTCAATA TAATATCAAC AGAAAGTAAT GCGGAAGACT 2845 |||||||||| | |||||||| ||||||||| ||||| |||| |||||||||| |||||||||| CCAACCAATC TAAACCTTAA CATTTCAATG TAATAGCAAC AGAAAGTAAT GCGGAAGACT 291 TAAACTCATC AAATAAAGAC CAATTCATTA ACTTCTAAAA TTCAACATCT ATTATTTCCC 2785 ||||||||| ||| | | || ||| || ||| ||| | || ||||||| |||| ||||| TAAACTCAT- TAAT--A-A- -AA-TCAATA ACTACTATTA TTAAACATCT ATTA-TTCCC 343 AAAATCTGGA AGTCATCATC ACAAGAACAT CTACGATCAA ATGACTAA-A CTAAGAGTAT 2726 |||| ||||| |||||||||| |||||||||| |||| | || | ||||| |||||||| | AAAACCTGGA AGTCATCATC ACAAGAACAT CTAC-TTTAA ACTACTAATT CTAAGAGT-T 401 TCTAA-AAGC T-AAAAA-TA CATAAGAAGT TAGTCCATGC CGGAAGTTCA AGGCATCAAG 2669 ||||| |||| | ||||| || ||||||||| |||||||||| |||||||||| |||||||||| TCTAAGAAGC TAAAAAATTA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 461 ACTTGAAGAA GAAGATCCAG TCCAAGCTAG AGGCATTAGC TTACCCTGAA TTTTCGATGT 2609 || ||||| | |||||||||| |||||||||| | || ||||| | |||||||| | || ||| ACATGAAGGA GAAGATCCAG TCCAAGCTAG AAGCGTTAGC TCACCCTGAA GATCCGGTGT 521 -AGTAAGACT GGCTTGAATT ACTGTTGAGT TGAGGACGAT GACACGTTTG CTGCACTCCA 2550 | |||||| ||||||| || |||||||||| || || || | |||||||| |||||||||| GACGAAGACT GGCTTGAGTT ACTGTTGAGT CGAAGATGAC GGCACGTTTG CTGCACTCCA 581 CAAATAAACA AGAAG-AAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA 2491 ||||| |||| ||||| |||| |||||||||| |||||||||| |||||||||| ||||||||| CAAAT-AACA AGAAGAAAAA CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA 640 GATATCATCG GCCAA 2476 |||||||||| ||||| GATATCATCG GCCAA 655 hqPGS_C06HBa0153O03.1-7-_SGN-E356696- (2962 2476) ******************************************************************************** EST sequence 47 +strand 434 n (File: SGN-E222578+) 1 TTTTTTTTTT TTTTTTTTTA ATAAAAACCA ATTCAATAAC TATCAATATT CAACATCTAT 61 TATTCCCAAA ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA 121 GAGTTTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAGGTT CAAGGCATCA 181 AGACATGAAG GAGAAGATCC AGTCCAAGCT AGACGCGTTA GCTCACCCTG AAGATCCGGT 241 GTGACGAAGA CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC 301 CACAACTTTC TAGATGGGGA CTTTCTTCAA GGCTTCGAGA TGGAAACTTG CTTGCAGAGC 361 TTCGAGTGTT ACCAGCTTCA AGATGGAGTT TCAGTGATGA GGCTTGCTAG TCTCGAGTTT 421 TTTTTTTTTT TTTT Predicted gene structure (within gDNA segment 3875 to 1): Exon 1 3311 3293 ( 19 n); cDNA 1 19 ( 19 n); score: 0.842 Intron 1 3292 2834 ( 459 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.86) Exon 2 2833 2547 ( 287 n); cDNA 20 305 ( 286 n); score: 0.864 MATCH C06HBa0153O03.1-7- SGN-E222578+ 0.864 306 0.705 C PGS_C06HBa0153O03.1-7-_SGN-E222578+ (3311 3293,2833 2547) Alignment (genomic DNA sequence = upper lines): TTTTATTTTT ATTTTTATTA TTCTAAAATG TTCCAGAAGT GTATGTTTTA TTTTTTATTA 3252 |||| ||||| ||||| || TTTTTTTTTT TTTTTTTTT. .......... .......... .......... .......... 19 TTATATTTCA TTTTCAACCT TACTTCAAAT GATTCTATGC AATCTTTTTG TATTATCTAT 3192 .......... .......... .......... .......... .......... .......... 19 TTTTTTATTT TATGCTTATT TTCTCATTCG CATAATTTTA TTAGTATTCT ACTTTTCAAA 3132 .......... .......... .......... .......... .......... .......... 19 TTAATTTATT GTTAAGTAAG GTTATATATT CATGTCTTCT TTGTTTCTAA AAAAAATCAT 3072 .......... .......... .......... .......... .......... .......... 19 TTTTATGTAA GCTATGTTAA AATTGGCTAT AAGAGTATTG TGTTAAATTG TTGTAGACAT 3012 .......... .......... .......... .......... .......... .......... 19 GTATTTTTTG ACCGAATTTA AATTTTATAC CTTTTTTAGA GCTTAAATAT GTCACGACCC 2952 .......... .......... .......... .......... .......... .......... 19 AAACCGGGTT GCGACTGGCA CCCACACTTT CCCTCCTATG TGAGCGAACC AACCAATCTA 2892 .......... .......... .......... .......... .......... .......... 19 ACCTTAACAT TTCAATATAA TATCAACAGA AAGTAATGCG GAAGACTTAA ACTCATCAAA 2832 || .......... .......... .......... .......... .......... ........AA 21 TAAAGACCAA TTCATTAACT TCTAAAATTC AACATCTATT ATTTCCCAAA ATCTGGAAGT 2772 |||| ||||| |||| ||||| || |||| |||||||||| ||| |||||| | |||||||| TAAAAACCAA TTCAATAACT ATCAATATTC AACATCTATT ATT-CCCAAA ACCTGGAAGT 80 CATCATCACA AGAACATCTA CGATCAAATG ACTAA-ACTA AGAGTATTCT AAAAGCTAAA 2713 |||||||||| |||||||||| | | ||| ||||| ||| ||||| |||| |||||||||| CATCATCACA AGAACATCTA C-TTTAAACT ACTAATTCTA AGAGT-TTCT AAAAGCTAAA 138 AATACATAAG AAGTTAGTCC ATGCCGGAAG TTCAAGGCAT CAAGACTTGA AGAAGAAGAT 2653 |||||||||| ||| |||||| |||||||| | |||||||||| |||||| ||| || ||||||| AATACATAAG AAGCTAGTCC ATGCCGGAGG TTCAAGGCAT CAAGACATGA AGGAGAAGAT 198 CCAGTCCAAG CTAGAGGCAT TAGCTTACCC TGAATTTTCG ATGT-AGTAA GACTGGCTTG 2594 |||||||||| ||||| || | ||||| |||| |||| | || ||| | || |||||||||| CCAGTCCAAG CTAGACGCGT TAGCTCACCC TGAAGATCCG GTGTGACGAA GACTGGCTTG 258 AATTACTGTT GAGTTGAGGA CGATGACACG TTTGCTGCAC TCCACAA 2547 | |||||||| |||| || || || | |||| |||||||||| ||||||| AGTTACTGTT GAGTCGAAGA TGACGGCACG TTTGCTGCAC TCCACAA 305 hqPGS_C06HBa0153O03.1-7-_SGN-E222578+ (2833 2547) ******************************************************************************** EST sequence 131 +strand 710 n (File: SGN-E392027+) 1 CCACAGCCCC AGTGGCTGGC TCAGTCGCAC CCTGTCCCGC CGGTGCTGGT GTTGATGCTG 61 GCGTAGTCGT TGCTCTAGTT CTAACCATCT GCGAAATAGA GTGAAGATGG TCAGATACCA 121 ATTTGTATCA CCTAGATACC AATTGGACCC AAGTAATAGC ACGAAAGAAG AAAGAATGGA 181 ATTTTCCAAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAG GCATCCCCCT ACCGTTCCTT 241 AAGACTCTAC TAGACTCGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 301 AGTTTGTCAC GACCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 361 GAACCAACCA ATCTAACCTT AACATTTCAA TATAATATCA ACAGAAAGTA ATGTGGAAGA 421 CTTAAACTCA TTAAATACAG ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC 481 CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA 541 GTCTAAAAGC TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC 601 TTGAAGAAGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATT TCCGATGTAG 661 TAAGACTGGC TTGAATTACT GTTGAGTTGA ACACGATGGC ACGTTTGCTG Predicted gene structure (within gDNA segment 6701 to 1767): Exon 1 6017 6011 ( 7 n); cDNA 290 296 ( 7 n); score: 0.714 Intron 1 6010 2973 (3038 n); Pd: 0.000 (s: 0), Pa: 0.990 (s: 0.88) Exon 2 2972 2557 ( 416 n); cDNA 297 710 ( 414 n); score: 0.952 MATCH C06HBa0153O03.1-7- SGN-E392027+ 0.952 423 0.596 C PGS_C06HBa0153O03.1-7-_SGN-E392027+ (6017 6011,2972 2557) Alignment (genomic DNA sequence = upper lines): CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAGGGTG 5958 || || | CTCTGAT... .......... .......... .......... .......... .......... 296 CAGTATGGAT ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT TGTTCTTTAT 5898 .......... .......... .......... .......... .......... .......... 296 TTTTTAGTTG CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT TGGGAAACTT 5838 .......... .......... .......... .......... .......... .......... 296 GGAGTACATT GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA TTGAGCTTGC 5778 .......... .......... .......... .......... .......... .......... 296 TGTTATTCTC AGTTGAAATG TCTGGTTGTT GTGTTTCTGC TTCTTCTTTA TCAAGCTTTT 5718 .......... .......... .......... .......... .......... .......... 296 GTTCGCATAG GTTGAACAAT CTTGTTGGTG ACAATGCGGA CGGACATTAT TTTCTCCTTT 5658 .......... .......... .......... .......... .......... .......... 296 TTAGCTCAAT CTCAGCTGAG CATTACATTT TTATTTCTAA AAAGATAGAT TATGTTTTTG 5598 .......... .......... .......... .......... .......... .......... 296 CAGTTAACTG CTATCAAGTT TTTTCGTTGA TTTTTTAAAA AAGTTAATAA AGGTAGACAA 5538 .......... .......... .......... .......... .......... .......... 296 GTAAAGTCTG GAAGCAAGAT TTGGACGTTA GATGTTTGTG CAGAACTTGC AAAGTTGGAA 5478 .......... .......... .......... .......... .......... .......... 296 AATGTAGAAA TGTTTGGAGT TTAAATGTTC ATACAAGTTT TTGGAATAGC CTTCAAATAT 5418 .......... .......... .......... .......... .......... .......... 296 TACTTTTGTT TCATTTACCA TACAACTATT TTTTAGTAAC GTACTTATTC AATTCACTTA 5358 .......... .......... .......... .......... .......... .......... 296 ATTAAAATTA TAATCTTGTT CTTTGTTTTT GTTTTTTAAA TAGATTAATT TATTAAGAAG 5298 .......... .......... .......... .......... .......... .......... 296 GAATTTCTGT GTTAAGTAGT ATCCTTTTTT TGGTACAATT TTCATATTAT TTTTTGGGAA 5238 .......... .......... .......... .......... .......... .......... 296 AGGGATCAAA ACGTTTGTAC TTTTAATGAA GAAACTCAAA TCAACAAAAA TAACCCATTC 5178 .......... .......... .......... .......... .......... .......... 296 ACACAAAATA TCCAGATTCC TCTCTAATCT GACCCACGAA TAATTGTAAC CCAATTGCTT 5118 .......... .......... .......... .......... .......... .......... 296 ACCTTCTTCA TTGTGAAAAA ATTTGAACTA TATGAATTTT CCTCAAATTT CAACTCCATT 5058 .......... .......... .......... .......... .......... .......... 296 AATCATTTCT CGTATTTGAA ACTCCAGGGC TCTTTCTTTG TCTTTAAACG TAAAATGGAG 4998 .......... .......... .......... .......... .......... .......... 296 TCATTGACCA GCTATGTGAT TTAATCGCGG AGAAACTAGC ATGGATATTT GGTAGACGCC 4938 .......... .......... .......... .......... .......... .......... 296 CGCCCTCCCT CCCACAAGAA GCCCTACTTG TTGGATCTCC TAAGGTTTAC AGGTCATACT 4878 .......... .......... .......... .......... .......... .......... 296 CGTCGTTGCA CGATTTCTCT CTAAATGCTC TAATCATTCT GATGAAATGC CCAAATTGGC 4818 .......... .......... .......... .......... .......... .......... 296 ATCGCCATCG CTGATGACTT CACCGTAAAA CACAAACTCA ATCTGTTGTT TTCGGGTCAA 4758 .......... .......... .......... .......... .......... .......... 296 TATTTGACCA ATGTGTCACA ACGATTTAGA GGAACGGCGA ATAAGAACAA CTGGACAGAA 4698 .......... .......... .......... .......... .......... .......... 296 CCTGAAATGA GTTTTGCTGA ATCTATTTAT TGGTTCGAAA ACTCATTCCC GCAATTTGAA 4638 .......... .......... .......... .......... .......... .......... 296 GGAGATGGAG AATTTTTTTT TTTTGGAAGG AGATGAAGAA GATTTGAGAG GGGCATTTAA 4578 .......... .......... .......... .......... .......... .......... 296 TTTATTTGAT TTAGTGGGTC GGATTAAGAA GAAATTGAGT AATTTTATTG AAGTTGAATT 4518 .......... .......... .......... .......... .......... .......... 296 ATTTTTGTTC GATTTGAGAT TTTTGAGGTC ACGAATAACA AGTTTGCTGA CTACAATGAT 4458 .......... .......... .......... .......... .......... .......... 296 ATAAATAAAA GGAAACTTTC ACACGTAACC ACTTAAAAAT GACATAATTA CTCTTCATAG 4398 .......... .......... .......... .......... .......... .......... 296 CTATGGTTTA ATGATAAAAT TCGTAACTAC ATATTATATG GAGAAGAGAG GTGAACGAGA 4338 .......... .......... .......... .......... .......... .......... 296 CTCTGAGAGA GGAAAAAATG GGAGAAAGAT GAATTGTATA TGTATATAAT TGTATATTAT 4278 .......... .......... .......... .......... .......... .......... 296 ACATATGCAT TTGTATATAT GACAAGCAAG ATTGGGAAAG GAAGAAGGGA TGCAAGCGAG 4218 .......... .......... .......... .......... .......... .......... 296 ATTGAGAGAG GGCAGAGAGA GAGAGGTGAA TTGCATATGT AGATAGGTTA AATAATTATA 4158 .......... .......... .......... .......... .......... .......... 296 CATATGTATT TGTATACCTA ACGAATTATA CATATACAAA AGTGACTAAT TATGCAAATT 4098 .......... .......... .......... .......... .......... .......... 296 TGAAATCAGC CCAAATAATT AATGTATAAT GTTAGTCACG AATGGTAATT ATACCAAACT 4038 .......... .......... .......... .......... .......... .......... 296 ATAACTATGA TGAGTAATTA TATAATATAA ATTTACTTAA CCACGTAACT TTTCCTATAA 3978 .......... .......... .......... .......... .......... .......... 296 ATAACTTGAT ATTAATGATA AGGGTAAAGT CAAATAAATA ATAAAAGGGG TATATTTGAC 3918 .......... .......... .......... .......... .......... .......... 296 CCTTTTACTT ATTTTTCTAA ATAGTCACTA TGTTTTGAAA ATTGCCCACA CAAATTACTT 3858 .......... .......... .......... .......... .......... .......... 296 TTATTTTTTT TAGAGATTAT AGTATCACTC AACTATTACT TCACGTTTTA ATTTTTACGA 3798 .......... .......... .......... .......... .......... .......... 296 GTTAATTTGA CCATGGAGTT CATACAAAAA TAATGACTTT TTTTTTGTGT TTCAAGATAA 3738 .......... .......... .......... .......... .......... .......... 296 GCCACAGAAA TTTGGGTGGT CGTAAATAAT TTCTTTAGGG GTAAAATGAA TATTTTAAAA 3678 .......... .......... .......... .......... .......... .......... 296 TTAAATTGTT ACTAAATATA GAAACATATC ATTTTTTAGA CTGATTAAAA AAGTAGTATA 3618 .......... .......... .......... .......... .......... .......... 296 AATTGAGACA AGTATTGCTT TTCTTTAAAA ATATACTTAA ACCTTAAGGT CCTGTTTGGA 3558 .......... .......... .......... .......... .......... .......... 296 AGGACACTCT GATAACTGAA TTTGGTGGAA TTACATTGTC TGTCCTTTTT GGTTGAACAT 3498 .......... .......... .......... .......... .......... .......... 296 AGTAATTACT TGGACAGCAG GTAATTGGTG TAATTGGCAA GAGATAATTA CACTCTCAAA 3438 .......... .......... .......... .......... .......... .......... 296 TTTACAGGCA AAGACTGTGA ATTGCTGGTA ATTATATGGT GTAATTACCA ATTGATTACT 3378 .......... .......... .......... .......... .......... .......... 296 TTTTGATTTT TTTTAATTCT ATATTTATTT TTTTGTTTTA ATTTTTTTTC ACTTCTATTA 3318 .......... .......... .......... .......... .......... .......... 296 TATTAATTTT ATTTTTATTT TTATTATTCT AAAATGTTCC AGAAGTGTAT GTTTTATTTT 3258 .......... .......... .......... .......... .......... .......... 296 TTATTATTAT ATTTCATTTT CAACCTTACT TCAAATGATT CTATGCAATC TTTTTGTATT 3198 .......... .......... .......... .......... .......... .......... 296 ATCTATTTTT TTATTTTATG CTTATTTTCT CATTCGCATA ATTTTATTAG TATTCTACTT 3138 .......... .......... .......... .......... .......... .......... 296 TTCAAATTAA TTTATTGTTA AGTAAGGTTA TATATTCATG TCTTCTTTGT TTCTAAAAAA 3078 .......... .......... .......... .......... .......... .......... 296 AATCATTTTT ATGTAAGCTA TGTTAAAATT GGCTATAAGA GTATTGTGTT AAATTGTTGT 3018 .......... .......... .......... .......... .......... .......... 296 AGACATGTAT TTTTTGACCG AATTTAAATT TTATACCTTT TTTAGAGCTT AAATATGTCA 2958 | | || | ||||| .......... .......... .......... .......... .....A-C-C AAGTTTGTCA 309 CGACCCAAAC CGGGTTGCGA CTGGCACCCA CACTTTCCCT CCTATGTGAG CGAACCAACC 2898 ||||| |||| |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| CGACCAAAAC CGGGTTGCGA CTGGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC 369 AATCTAACCT TAACATTTCA ATATAATATC AACAGAAAGT AATGCGGAAG ACTTAAACTC 2838 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| AATCTAACCT TAACATTTCA ATATAATATC AACAGAAAGT AATGTGGAAG ACTTAAACTC 429 ATCAAATAAA GACCAATTCA TTAACTTCTA AAATTCAACA TCTATTATTT CCCAAAATCT 2778 || ||||| | |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| ATTAAATACA GACCAATTCA TTAACTTCTA AAATTCAACA TCTATTATTC CCCAAAATCT 489 GGAAGTCATC ATCACAAGAA CATCTACGAT CAAATGACTA AACTAAGAGT ATTCTAAAAG 2718 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| | |||||||| GGAAGTCATC ACCACAAGAA CATCTACGAT CAAATGACTA AACTAAGAGT AGTCTAAAAG 549 CTAAAAATAC ATAAGAAGTT AGTCCATGCC GGAAGTTCAA GGCATCAAGA CTTGAAGAAG 2658 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| CTAAAAATAC ATAAGAAGCT AGTCCATGCC GGAAGTTCAA GGCATCAAGA CTTGAAGAAG 609 AAGATCCAGT CCAAGCTAGA GGCATTAGCT TACCCTGAAT TTTCGATGTA GTAAGACTGG 2598 |||||||||| |||||||||| ||||||||| ||||||||| || ||||||| |||||||||| AAGATCCAGT CCAAGCTAGA AGCATTAGCT CACCCTGAAT TTCCGATGTA GTAAGACTGG 669 CTTGAATTAC TGTTGAGTTG AGGACGATGA CACGTTTGCT G 2557 |||||||||| |||||||||| | |||||| |||||||||| | CTTGAATTAC TGTTGAGTTG AACACGATGG CACGTTTGCT G 710 hqPGS_C06HBa0153O03.1-7-_SGN-E392027+ (2972 2557) ******************************************************************************** EST sequence 46 +strand 679 n (File: SGN-E370357+) 1 TTTTTTTTTT CTTACAATTA TATTATGAAT TCGATAATCT TTAATGTCAC GACCCAAATC 61 GAGCCGCAAG TGGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATACAAAATC 121 CAACATTTCA ATATAATGAC GGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC 181 AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA 241 TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT ATCTAGAAAG CTAGAATAAC 301 TAAAAAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 361 CAAGCTAGAA GCGTTAGCTC ACACTGAAAT CCGGTATAAT GAAGACTGGC TAGAGTTGCG 421 GTTGAGTTGA AGACGACGGT ACGTTTGCTT TATTCGAGTG TCAATTAATC ATTCGGCTGT 481 CACCCAAATA TTATTGATTG ATTACACCTC TGCCATTTGT AAAATTTTTC AAATTTGCCT 541 ACGGATGCAG AATTTTCCTC GAATTTCTGA TGTGTTTTCT TGTAAATAGT GGCCATTTGT 601 GTAAGTAAAT GCCCATTTCT CCTCCTACAA AGTCCAATTC CATTTTTCCC CCAATCCACC 661 ATGGCAACAC CACCTCCAA Predicted gene structure (within gDNA segment 4371 to 1): Exon 1 3192 3158 ( 35 n); cDNA 2 36 ( 35 n); score: 0.657 Intron 1 3157 2973 ( 185 n); Pd: 0.000 (s: 0), Pa: 0.990 (s: 0.80) Exon 2 2972 2558 ( 415 n); cDNA 37 449 ( 413 n); score: 0.804 MATCH C06HBa0153O03.1-7- SGN-E370357+ 0.804 450 0.663 C PGS_C06HBa0153O03.1-7-_SGN-E370357+ (3192 3158,2972 2558) Alignment (genomic DNA sequence = upper lines): TTTTTTTATT TTATGCTTAT TTTCTCATTC GCATAATTTT ATTAGTATTC TACTTTTCAA 3133 ||||||| | ||| |||| || | | | ||| TTTTTTTTTC TTACAATTAT ATTATGAATT CGATA..... .......... .......... 36 ATTAATTTAT TGTTAAGTAA GGTTATATAT TCATGTCTTC TTTGTTTCTA AAAAAAATCA 3073 .......... .......... .......... .......... .......... .......... 36 TTTTTATGTA AGCTATGTTA AAATTGGCTA TAAGAGTATT GTGTTAAATT GTTGTAGACA 3013 .......... .......... .......... .......... .......... .......... 36 TGTATTTTTT GACCGAATTT AAATTTTATA CCTTTTTTAG AGCTTAAATA TGTCACGACC 2953 | ||| | | |||||||||| .......... .......... .......... .......... ATCTT-TA-A TGTCACGACC 54 CAAACCGGGT TGCGACTGGC ACCCACACTT TCCCTCCTAT GTGAGCGAAC CAACCAAT-C 2894 |||| || | || | |||| |||||||||| ||||||||| |||||||||| |||||||| | CAAATCGAGC CGCAAGTGGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC CAACCAATAC 114 TAACCTTAAC ATTTCAATAT AATATCAACA GAAAGTAATG CGGAAGACTT AAACTCATCA 2834 || ||| |||||||||| ||| || ||| ||||| |||||||||| |||||||| AAAATCCAAC ATTTCAATAT AAT---GACG GAATATAATG CGGAAGACTT AAACTCAT-T 170 AATAAAGACC AATTCATTAA CTTCT-AAAA TTCAACATCT ATTATT-T-C CCAAAATCTG 2777 ||| || | | |||| | ||| ||||| |||| |||||| || |||||| | | |||||||||| AATGAAAATC AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG 230 GAAGTCATCA TCACAAGAAC ATCTACGATC AAATGACTAA -ACTAAGAGT ATTCTA-AAA 2719 |||||||||| |||||||||| ||||| || |||| ||||| |||||||| | |||| ||| GAAGTCATCA TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT A-TCTAGAAA 289 GCTAAAAATA CATAAGAAGT TAGTCCATGC CGGAAGTTCA AGGCATCAAG ACTTGAAGAA 2659 ||| | |||| ||| ||| |||||||||| ||||| |||| |||||||||| || ||||||| GCT-AGAATA ACTAAAAAGC TAGTCCATGC CGGAACTTCA AGGCATCAAG ACATGAAGAA 348 GAAGATCCAG TCCAAGCTAG AGGCATTAGC TTACCCTGAA TTTTCGATGT AGT-AAGACT 2600 |||||||||| |||||||||| | || ||||| | || ||||| | || | | | | |||||| GAAGATCCAG TCCAAGCTAG AAGCGTTAGC TCACACTGAA -ATCCGGTAT AATGAAGACT 407 GGCTTGAATT ACTGTTGAGT TGAGGACGAT GACACGTTTG CT 2558 |||| || || | ||||||| ||| ||||| | ||||||| || GGCTAGAGTT GCGGTTGAGT TGAAGACGAC GGTACGTTTG CT 449 hqPGS_C06HBa0153O03.1-7-_SGN-E370357+ (2972 2558) ******************************************************************************** EST sequence 99 +strand 840 n (File: SGN-E542084+) 1 TTTTTTTTTT TAGGGGAAAA TTTCTTACTT CTATAAATGT CACGACCCAA ATCGGATCGC 61 GACTGGCACC CACACTTACC CTGCTATGTG AGCGAACCAA CCAATCCAAA CCTTAACATT 121 TCAATGTAAT ATCAACATAA AGTAATGCGG AAGACTTAAA CTTATTAATG AAAACCAATT 181 CAATAACTAT TATTTCCCAA AATCTGGAAG TCATCATCAT AAGAACATCT ACTTCAAATT 241 ACTAAATCTA AGAGTTTCTA AGAAGCTAAA AAATACATAA AAGCTAGTCC ATGCCGGAAC 301 TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAT CTAGAAAGCA TTAGCTCACC 361 CTGATATCCG AAGTAATGAA GACTGGCTAG AGTTACTGTT GAGTCGAAGA TGACGGCACG 421 TTTGCTAAAA TCAGTGGACG GAGGAGAAGG GAAAGCACAC CGGGAATGAG AAGAAGCTGA 481 AGGAGGAACC AAAGAGGAAT CCCATTGCAA AGTAAATGAG AGTGTAAGCT AGCAGACGCG 541 ATGGAAGAGC TTACGCAGAA ATAACACTCT CATTTGGTGA TTTAGTTTGG AGATCATCTG 601 AGACCTTCGT GTTGGACAAC ATCATCCATG AAGATGTCAT TAGAAAAGTT AGATGCTTTA 661 TATACATGTT GATAGTTCCT GACTACTCTA TTTCTTTTTC AGAAAGCCCC GAAATTTCTC 721 AGATGATAAA TGCTGTCTGT TTTGGAAAAC CATCTCTATG CAAAGATGAT GTTTGCTGCA 781 TTGAGGTGTC AATATTGGGA ATTTCAAGAA AATTATGCCT TGTAGAATAT GTACAGCAAC Predicted gene structure (within gDNA segment 4121 to 1): Exon 1 2968 2558 ( 411 n); cDNA 32 426 ( 395 n); score: 0.822 PPA cDNA 11 1 MATCH C06HBa0153O03.1-7- SGN-E542084+ 0.822 411 0.489 C PGS_C06HBa0153O03.1-7-_SGN-E542084+ (2968 2558) Alignment (genomic DNA sequence = upper lines): TAAATATGTC ACGACCCAAA CCGGGTTGCG ACTGGCACCC ACACTTTCCC TCCTATGTGA 2909 || | ||||| |||||||||| ||| | ||| |||||||||| |||||| ||| | |||||||| TATAAATGTC ACGACCCAAA TCGGATCGCG ACTGGCACCC ACACTTACCC TGCTATGTGA 91 GCGAACCAAC CAAT-CTAAC CTTAACATTT CAATATAATA TCAACAGAAA GTAATGCGGA 2850 |||||||||| |||| | ||| |||||||||| |||| ||||| |||||| ||| |||||||||| GCGAACCAAC CAATCCAAAC CTTAACATTT CAATGTAATA TCAACATAAA GTAATGCGGA 151 AGACTTAAAC TCATCAAATA AAGACCAATT CATTAACTTC TAAAATTCAA CATCTATTAT 2790 |||||||||| | || ||| || ||||||| | || | || ||||||| AGACTTAAAC TTAT-TAATG AAAACCAATT C---AA---- T--AA----- ---CTATTAT 193 TTCCCAAAAT CTGGAAGTCA TCATCACAAG AACATCTACG ATCAAATGAC TAAA-CTAAG 2731 |||||||||| |||||||||| |||||| ||| ||||||||| |||||| || |||| ||||| TTCCCAAAAT CTGGAAGTCA TCATCATAAG AACATCTAC- TTCAAATTAC TAAATCTAAG 252 AGTATTCTAA -AAGCT-AAA AATACATAAG AAGTTAGTCC ATGCCGGAAG TTCAAGGCAT 2673 ||| |||||| ||||| ||| ||||||||| ||| |||||| ||||||||| |||||| ||| AGT-TTCTAA GAAGCTAAAA AATACATAA- AAGCTAGTCC ATGCCGGAAC TTCAAGACAT 310 CAAGACTTGA AGAAGAAGAT CCAGTCCAAG CTAG-AGGCA TTAGCTTACC CTGAATTTTC 2614 |||||| ||| ||| |||||| ||||||||| |||| | ||| |||||| ||| ||| || | | CAAGACATGA AGAGGAAGAT CCAGTCCAAT CTAGAAAGCA TTAGCTCACC CTG-ATATCC 369 GATGTAGT-A AGACTGGCTT GAATTACTGT TGAGTTGAGG ACGATGACAC GTTTGCT 2558 || ||| | | ||||||||| || ||||||| ||||| || | | || | ||| ||||||| GAAGTAATGA AGACTGGCTA GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCT 426 hqPGS_C06HBa0153O03.1-7-_SGN-E542084+ (2968 2558) ******************************************************************************** EST sequence 12 -strand 299 n (File: SGN-E373117-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 61 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 241 GAGTTGAAGA CGACGGTACG TTTGCCAAAA TTACGACAGT ATTTGGACAA GCTAGAAGA Predicted gene structure (within gDNA segment 3767 to 908): Exon 1 2821 2561 ( 261 n); cDNA 1 263 ( 263 n); score: 0.820 MATCH C06HBa0153O03.1-7- SGN-E373117- 0.820 261 0.873 C PGS_C06HBa0153O03.1-7-_SGN-E373117- (2821 2561) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCT-AAAATT CAACATCTAT TATT-TCC-C AAAATCTGGA AGTCATCATC 2765 || || | ||| |||| | ||||| |||| |||| ||| | |||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 60 ACAAGAACAT CTACGATCAA ATGACTAAA- CTAAGAGTAT TCTAA-AAGC TAAAAATACA 2707 |||||||||| |||| |||| || |||||| ||||||||| ||||| |||| | |||||||| ACAAGAACAT CTAC-TTCAA ATTACTAAAT CTAAGAGTA- TCTAAGAAGC T-AAAATACA 117 TAAGAAGTTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC TTGAAGAAGA AGATCCAGTC 2647 ||| || || |||||||||| ||| |||||| |||||||||| ||||||||| |||||||||| TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 177 CAAGCTAGAG GCATTAGCTT ACCCTGAATT TTCGATGTAG T-AAGACTGG CTTGAATTAC 2588 ||||||||| || |||||| |||||||| | ||||||| | |||||||| || || || | CAAGCTAGAA GCGTTAGCTC ACCCTGAA-A TCCGATGTAA TGAAGACTGG CTAGAGTTGC 236 TGTTGAGTTG AGGACGATGA CACGTTT 2561 ||||||||| | ||||| | |||||| GGTTGAGTTG AAGACGACGG TACGTTT 263 hqPGS_C06HBa0153O03.1-7-_SGN-E373117- (2821 2561) ******************************************************************************** EST sequence 89 +strand 299 n (File: SGN-E373116+) 1 TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCGT TAGCTCACCC TGAAATCCGA TGTAATGAAG ACTGGCTAGA GTTGCGGTTG 241 AGTTGAAGAC GACGGTACGT TTGCCAAAAT TACGACAGTA TTTGGACAAG CTAGAAGAG Predicted gene structure (within gDNA segment 3747 to 888): Exon 1 2821 2561 ( 261 n); cDNA 1 262 ( 262 n); score: 0.818 MATCH C06HBa0153O03.1-7- SGN-E373116+ 0.818 261 0.873 C PGS_C06HBa0153O03.1-7-_SGN-E373116+ (2821 2561) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCTAAAATTC AACATCTATT ATT-TCCC-A AAATCTGGAA GTCATCATCA 2764 || || | |||| || |||| ||||| ||| |||| | |||||||||| |||||||||| TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 60 CAAGAACATC TACGATCAAA TGACTAAA-C TAAGAGTATT CTAA-AAGCT AAAAATACAT 2706 |||||||||| ||| ||||| | |||||| | |||||||| | |||| ||||| ||||||||| CAAGAACATC TAC-TTCAAA TTACTAAATC TAAGAGTA-T CTAAGAAGCT -AAAATACAT 117 AAGAAGTTAG TCCATGCCGG AAGTTCAAGG CATCAAGACT TGAAGAAGAA GATCCAGTCC 2646 || || ||| |||||||||| || ||||||| ||||||||| |||||||||| |||||||||| AAACAGCTAG TCCATGCCGG AACTTCAAGG CATCAAGACA TGAAGAAGAA GATCCAGTCC 177 AAGCTAGAGG CATTAGCTTA CCCTGAATTT TCGATGTAGT -AAGACTGGC TTGAATTACT 2587 |||||||| | | |||||| | ||||||| | ||||||| | ||||||||| | || || | AAGCTAGAAG CGTTAGCTCA CCCTGAA-AT CCGATGTAAT GAAGACTGGC TAGAGTTGCG 236 GTTGAGTTGA GGACGATGAC ACGTTT 2561 |||||||||| ||||| | |||||| GTTGAGTTGA AGACGACGGT ACGTTT 262 hqPGS_C06HBa0153O03.1-7-_SGN-E373116+ (2821 2561) ******************************************************************************** EST sequence 138 +strand 265 n (File: SGN-E216150+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAATCAA TAATCAACTT GTATAACTCA AAACTTATCA 61 TTCCCCAAAA TCTGGAAGTC ATCATCACCA GAGCCTCTAT CATAAAATTA CTAAACTAAG 121 AGTATTCTAA GAAGCTAAAA ATACATACGA AGCTAGTCCA TGCCGGAAGT TCAAGGCATC 181 AAGACTTGAA GAAGAAGATC CAGTCAAACC TAGAAGCATT AGCTCACCCT GAATTTCCGA 241 TGTAGTAGGA CTGGCTTGAG TTACT Predicted gene structure (within gDNA segment 4431 to 1437): Exon 1 3310 3293 ( 18 n); cDNA 1 18 ( 18 n); score: 0.833 Intron 1 3292 2834 ( 459 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.74) Exon 2 2833 2587 ( 247 n); cDNA 19 265 ( 247 n); score: 0.872 MATCH C06HBa0153O03.1-7- SGN-E216150+ 0.872 265 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E216150+ (3310 3293,2833 2587) Alignment (genomic DNA sequence = upper lines): TTTATTTTTA TTTTTATTAT TCTAAAATGT TCCAGAAGTG TATGTTTTAT TTTTTATTAT 3251 ||| ||||| ||||| || TTTTTTTTTT TTTTTTTT.. .......... .......... .......... .......... 18 TATATTTCAT TTTCAACCTT ACTTCAAATG ATTCTATGCA ATCTTTTTGT ATTATCTATT 3191 .......... .......... .......... .......... .......... .......... 18 TTTTTATTTT ATGCTTATTT TCTCATTCGC ATAATTTTAT TAGTATTCTA CTTTTCAAAT 3131 .......... .......... .......... .......... .......... .......... 18 TAATTTATTG TTAAGTAAGG TTATATATTC ATGTCTTCTT TGTTTCTAAA AAAAATCATT 3071 .......... .......... .......... .......... .......... .......... 18 TTTATGTAAG CTATGTTAAA ATTGGCTATA AGAGTATTGT GTTAAATTGT TGTAGACATG 3011 .......... .......... .......... .......... .......... .......... 18 TATTTTTTGA CCGAATTTAA ATTTTATACC TTTTTTAGAG CTTAAATATG TCACGACCCA 2951 .......... .......... .......... .......... .......... .......... 18 AACCGGGTTG CGACTGGCAC CCACACTTTC CCTCCTATGT GAGCGAACCA ACCAATCTAA 2891 .......... .......... .......... .......... .......... .......... 18 CCTTAACATT TCAATATAAT ATCAACAGAA AGTAATGCGG AAGACTTAAA CTCATCAAAT 2831 ||| .......... .......... .......... .......... .......... .......AAT 21 AAAGACCAAT TCATTAACTT CTAAAATTCA ACATCTATTA TTTCCCAAAA TCTGGAAGTC 2771 ||| | ||| | || ||||| || || ||| | | ||| | || ||||||| |||||||||| AAAAATCAA- TAATCAACTT GTATAACTCA AAACTTATCA TTCCCCAAAA TCTGGAAGTC 80 ATCATCACAA GAACATCTAC GATCAAATGA CTAAACTAAG AGTATTCTAA -AAGCTAAAA 2712 |||||||| | || | |||| || |||| | |||||||||| |||||||||| ||||||||| ATCATCACCA GAGCCTCTAT CATAAAATTA CTAAACTAAG AGTATTCTAA GAAGCTAAAA 140 ATACATAAGA AGTTAGTCCA TGCCGGAAGT TCAAGGCATC AAGACTTGAA GAAGAAGATC 2652 ||||||| || || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACATACGA AGCTAGTCCA TGCCGGAAGT TCAAGGCATC AAGACTTGAA GAAGAAGATC 200 CAGTCCAAGC TAGAGGCATT AGCTTACCCT GAATTTTCGA TGTAGTAAGA CTGGCTTGAA 2592 ||||| || | |||| ||||| |||| ||||| |||||| ||| ||||||| || ||||||||| CAGTCAAACC TAGAAGCATT AGCTCACCCT GAATTTCCGA TGTAGTAGGA CTGGCTTGAG 260 TTACT 2587 ||||| TTACT 265 hqPGS_C06HBa0153O03.1-7-_SGN-E216150+ (2833 2587) ******************************************************************************** EST sequence 15 -strand 402 n (File: SGN-E352844-) 1 TTTTTTTTAT AAAAACCAAT TCAATAACTA TTATTTCCCA AAATCTGGAA GTTATCATCA 61 CAAGAACATC TACTTCGAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCTT TGTTTTATCG AAAAAAGGTG ATTTTTCGAA AAGAGTTTGT TTTATTTTAA 241 AGTATTTTTC GACTTTAGGA GTCGCCACTT AATTTTTAAG AAAAATCAAG AAAACTCATT 301 CTCAAAACAA TTTAAACAGA AAAGTCGTTT TGAAAATATT TTTTAGGATT CGGGATTCTT 361 ATTAGCGTCT TAGGAAGGTG TTTAAGGCAC CTAAGACACT CC Predicted gene structure (within gDNA segment 3676 to 1): Exon 1 2821 2638 ( 184 n); cDNA 3 185 ( 183 n); score: 0.842 MATCH C06HBa0153O03.1-7- SGN-E352844- 0.842 184 0.458 C PGS_C06HBa0153O03.1-7-_SGN-E352844- (2821 2638) Alignment (genomic DNA sequence = upper lines): TTCATTAACT TCTAAAATTC AACATCTATT ATTTCCCAAA ATCTGGAAGT CATCATCACA 2762 || ||| ||||| || | ||||| |||||||||| |||||||||| ||||||||| TTTTTTATAA AAACCAATTC AATAACTATT ATTTCCCAAA ATCTGGAAGT TATCATCACA 62 AGAACATCTA CGATCAAATG ACTAAA-CTA AGAGTATTCT AA-AAGCTAA AAATACATAA 2704 |||||||||| | || ||| |||||| ||| |||||| ||| || ||||| | |||||||||| AGAACATCTA C-TTCGAATT ACTAAATCTA AGAGTA-TCT AAGAAGCT-A AAATACATAA 119 GAAGTTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACTTG AAGAAGAAGA TCCAGTCCAA 2644 || ||||| |||||||||| ||||||||| ||||||| || |||||||||| |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 179 GCTAGA 2638 |||||| GCTAGA 185 hqPGS_C06HBa0153O03.1-7-_SGN-E352844- (2821 2638) ******************************************************************************** EST sequence 26 -strand 666 n (File: SGN-E368629-) 1 TTTTTTTTTT TTTTTTTTTT TTTTTTATAA AAACCAATTC AATAACTATT ATTTCCCAAA 61 ATCTGGAAGT TATCATCACA AGAACATCTA CTTCGAATTA CTAAATCTAG AAGTATCTAA 121 GAGCCTAAAA TACATAACAC AGTTAGTCCA TGCCGAAACT TCAAGGCATC AAGACATAAA 181 GAAGAAGATC CAGTCCAAGC TAGAAGCTTT GTTTTATCGA AAAAAGGTGA TTTTTCGAAA 241 AGAGTTTGTT TTATTTTAAA GTATTTTTCG ACTTTAGGAG TCGCCACTTA ATTTTTAAGA 301 AAAATCAAGA AAACTCATTC TCAAAACAAT TTAAACAGAA AAGTCGTTTT GAAAATATTT 361 TTTAGGATTC GGGATTCTTA TTAGCGTCTT AGGAAGGTGT TTAAGGCACC TAAGACACTC 421 CGTTAAATAC GGTTTTCCAA CGACTAACTT ATTTGATTAT TTTTATTTTT ACCCTTTGCA 481 AATTTATTTG AACTTTTATC ACGATTTACT TAGCCAAACT TTGCAAATTT GAGATATTAA 541 TCTTTTAAGA TTCCGTCTTA GTTAAACTTT CTAAGCCTTA ACTCTCTAAG CAGACTTTCA 601 AATTTTAAAC CTCTATCGTT TCAAAACTTC AATTTTTATT TTTTAGTTTC ATAAAGCAAA 661 AGGCGT Predicted gene structure (within gDNA segment 3856 to 1): Exon 1 2806 2638 ( 169 n); cDNA 36 204 ( 169 n); score: 0.864 PPA cDNA 28 1 MATCH C06HBa0153O03.1-7- SGN-E368629- 0.864 169 0.254 C PGS_C06HBa0153O03.1-7-_SGN-E368629- (2806 2638) Alignment (genomic DNA sequence = upper lines): AATTCAACAT CTATTATTTC CCAAAATCTG GAAGTCATCA TCACAAGAAC ATCTACGATC 2747 ||||||| | |||||||||| |||||||||| ||||| |||| |||||||||| |||||| || AATTCAATAA CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTAC-TTC 94 AAATGACTAA A-CTAAGAGT ATTCTAAAAG CTAAAAATAC ATAAGA-AGT TAGTCCATGC 2689 ||| ||||| | ||| ||| | ||||| || | ||||||| |||| | ||| |||||||||| GAATTACTAA ATCTAGAAGT A-TCTAAGAG CCTAAAATAC ATAACACAGT TAGTCCATGC 153 CGGAAGTTCA AGGCATCAAG ACTTGAAGAA GAAGATCCAG TCCAAGCTAG A 2638 || || |||| |||||||||| || | ||||| |||||||||| |||||||||| | CGAAACTTCA AGGCATCAAG ACATAAAGAA GAAGATCCAG TCCAAGCTAG A 204 hqPGS_C06HBa0153O03.1-7-_SGN-E368629- (2806 2638) ******************************************************************************** EST sequence 23 -strand 620 n (File: SGN-E238551-) 1 CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTACTTCG AATTACTAAA 61 TCTAAGAGTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG AACTTCAAGG 121 CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTGTTTTA TCGAAAAAAG 181 GTGATTTTTC GAAAAGAGTT TGTTTTATTT TAAAGTATTT TTCGACTTTA GGAGTCGCCA 241 CTTAATTTTT AAGAAAAATC AAGAAAACTC ATTCTCAAAA CAATTTAAAC AGAAAAGTCG 301 TTTTGAAAAT ATTTTTTAGG ATTCGGGATT CTTATTAGCG TCTTAGGAAG GTGTTTAAGG 361 CACCTAAGAC ACTCCGTTAA ATACGGTTTT CCAACGACTA ACTTATTTGA TTATTTTTAT 421 TTTTACCCTT TGCAAATTTA TTTGAACTTT TATCACGATT TACTTAGCCA AACTTTGCAA 481 ATTTGAGATA TTAATCTTTT AAGATTCCGT CTTAGTTAAA CTTTCTAAGC CTTAACTCTC 541 TAAGCAGACT TTCAAATTTT AAACCTCTAT CGTTTCAAAA CTTCAATTTT TATTTTTTAG 601 TTTCATAAAG CAAAAGGCGT Predicted gene structure (within gDNA segment 3406 to 1): Exon 1 2796 2638 ( 159 n); cDNA 1 158 ( 158 n); score: 0.890 MATCH C06HBa0153O03.1-7- SGN-E238551- 0.890 159 0.256 C PGS_C06HBa0153O03.1-7-_SGN-E238551- (2796 2638) Alignment (genomic DNA sequence = upper lines): CTATTATTTC CCAAAATCTG GAAGTCATCA TCACAAGAAC ATCTACGATC AAATGACTAA 2737 |||||||||| |||||||||| ||||| |||| |||||||||| |||||| || ||| ||||| CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTAC-TTC GAATTACTAA 59 A-CTAAGAGT ATTCTAAAAG CTAAAAATAC ATAAGAAGTT AGTCCATGCC GGAAGTTCAA 2678 | |||||||| || | ||| || ||||||| |||| || | |||||||||| |||| ||||| ATCTAAGAGT ATCTAAGAAG CT-AAAATAC ATAAACAGCT AGTCCATGCC GGAACTTCAA 118 GGCATCAAGA CTTGAAGAAG AAGATCCAGT CCAAGCTAGA 2638 |||||||||| | |||||||| |||||||||| |||||||||| GGCATCAAGA CATGAAGAAG AAGATCCAGT CCAAGCTAGA 158 hqPGS_C06HBa0153O03.1-7-_SGN-E238551- (2796 2638) ******************************************************************************** EST sequence 19 -strand 572 n (File: SGN-E351583-) 1 TAAGCATGCT ATGGAAATTA TCCATTTGTT GGCTGCCCTA AACCCAATCC AAGTGATGGT 61 TGATGCTGTT ATCAACAGTG GCCCAAGAGA AGATGCAACT CGTATAGGTT CTGCTGGTGT 121 TGTGAGGCGA CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTCAACCAAG CAATATATCT 181 CCTCACAACT GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGCCCATAG CAGAATCCCT 241 TGCAGATGAA CTCATTAATG CTGCCAAGGG ATTTTCCACC AGTTATGCTA TCAAGAAGAA 301 GGATGAGATT GAGAGGGTTG CCAAGGCCAA TCGTTGAGGG TGCAGTATGG ATATTTACTA 361 TTGGTTGGCC AGTTTTGCTT CGAAACGTTG TTTGTTCTTT ATTTTTTAGT TGCTAGAAAG 421 GCATTTTGGA ATTAGTAACG AGTTTTTTTG TTTGGGAAAC TTGGAGTCCA TTGGTATTGA 481 ATATTATATG GGGGAATTAG CAAAAAGCAA AATTGAGCTT GCTGTTATTC TCAAAAAAAA 541 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA Predicted gene structure (within gDNA segment 7188 to 4777): Exon 1 6578 6501 ( 78 n); cDNA 1 78 ( 78 n); score: 0.949 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.98) Exon 2 6321 6118 ( 204 n); cDNA 79 282 ( 204 n); score: 0.975 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.94), Pa: 1.000 (s: 0.98) Exon 3 6017 5767 ( 251 n); cDNA 283 533 ( 251 n); score: 0.980 PPA cDNA 534 572 MATCH C06HBa0153O03.1-7- SGN-E351583- 0.974 533 0.932 C PGS_C06HBa0153O03.1-7-_SGN-E351583- (6578 6501,6321 6118,6017 5767) Alignment (genomic DNA sequence = upper lines): TAAGCATGCT ATGGAAATTA TCCATCTGTT GACTGACCTA AACCCAATCC AAGTGATTGT 6519 |||||||||| |||||||||| ||||| |||| | ||| |||| |||||||||| ||||||| || TAAGCATGCT ATGGAAATTA TCCATTTGTT GGCTGCCCTA AACCCAATCC AAGTGATGGT 60 TGATGCTGTT ATCAACAGGT TTAGAGATTA TTCTGATTTT TGCATATTTA TTAGCTCGAG 6459 |||||||||| |||||||| TGATGCTGTT ATCAACAG.. .......... .......... .......... .......... 78 TTTTTCTTGC TGAGGTCTTG TTAATTAGAA GATTTTCATA CCATGTCTTC TTTGTTCCAT 6399 .......... .......... .......... .......... .......... .......... 78 TTCCATGTCG CGGCATACTT GAGATATTGT AGTCATTCTC ATTTTTTCCT TCCCATATTC 6339 .......... .......... .......... .......... .......... .......... 78 TTACCTATGT GATGCAGTGG ACCAAGAGAA GATGCAACTC GTATAGGTTC TGCTGGTGTT 6279 ||| ||||||||| |||||||||| |||||||||| |||||||||| .......... .......TGG CCCAAGAGAA GATGCAACTC GTATAGGTTC TGCTGGTGTT 121 GTGAGGCGAC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TCAACCAAGC AATATATCTC 6219 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAGGCGAC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TCAACCAAGC AATATATCTC 181 CTCACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC AGAATGCCTT 6159 |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| ||||| |||| CTCACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGCCCATAGC AGAATCCCTT 241 GCAGATGAAC TCATTAATGC TGCCAAGGGA TCTTCCAACA GGTAATCTTT TCTATTGCCA 6099 |||||||||| |||||||||| |||||||||| | ||||| || | GCAGATGAAC TCATTAATGC TGCCAAGGGA TTTTCCACCA G......... .......... 282 TCTTTTTACT CCTATATGCG TTTAATCCTT AATAAATGTA ACTATATTCT CTGCCTACTT 6039 .......... .......... .......... .......... .......... .......... 282 ATTCATTCTA TGTACGTGTA GCTATGCTAT CAAGAAGAAG GATGAGATTG AGAGGGTTGC 5979 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... .TTATGCTAT CAAGAAGAAG GATGAGATTG AGAGGGTTGC 321 CAAGGCCAAT CGTTGAGGGT GCAGTATGGA TATTTACTAT TGGTTGGACA GTTTTGCTTC 5919 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| CAAGGCCAAT CGTTGAGGGT GCAGTATGGA TATTTACTAT TGGTTGGCCA GTTTTGCTTC 381 GAAACGTTGT TTGTTCTTTA TTTTTTAGTT GCTAGAAAGG CATTTTGGAA CTAGTAACGA 5859 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| GAAACGTTGT TTGTTCTTTA TTTTTTAGTT GCTAGAAAGG CATTTTGGAA TTAGTAACGA 441 GTTTTTCTGT TTGGGAAACT TGGAGTACAT TGGTATTGAA TATTATATGG GGGAATTAGC 5799 |||||| ||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| GTTTTTTTGT TTGGGAAACT TGGAGTCCAT TGGTATTGAA TATTATATGG GGGAATTAGC 501 AAAAAGCAAA ATTGAGCTTG CTGTTATTCT CA 5767 |||||||||| |||||||||| |||||||||| || AAAAAGCAAA ATTGAGCTTG CTGTTATTCT CA 533 hqPGS_C06HBa0153O03.1-7-_SGN-E351583- (6578 6501,6321 6118,6017 5767) ******************************************************************************** EST sequence 11 -strand 334 n (File: SGN-E217332-) 1 GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 61 CTTCCAACAG CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC 121 GTTGAGGGTG CAGTATGGAT ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT 181 TGTTCTTTAT TTTTTAGTTG CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT 241 TGGGAAACTT GGAGTACATT GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA 301 TTGAGCTTGC TGTTAAAAAA AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 6797 to 4983): Exon 1 6187 6118 ( 70 n); cDNA 1 70 ( 70 n); score: 1.000 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6017 5774 ( 244 n); cDNA 71 314 ( 244 n); score: 1.000 PPA cDNA 315 334 MATCH C06HBa0153O03.1-7- SGN-E217332- 1.000 314 0.940 C PGS_C06HBa0153O03.1-7-_SGN-E217332- (6187 6118,6017 5774) Alignment (genomic DNA sequence = upper lines): GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 6128 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 60 CTTCCAACAG GTAATCTTTT CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA 6068 |||||||||| CTTCCAACAG .......... .......... .......... .......... .......... 70 ATAAATGTAA CTATATTCTC TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC 6008 |||||||||| .......... .......... .......... .......... .......... CTATGCTATC 80 AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAGGGTG CAGTATGGAT 5948 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAGGGTG CAGTATGGAT 140 ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT TGTTCTTTAT TTTTTAGTTG 5888 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT TGTTCTTTAT TTTTTAGTTG 200 CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT TGGGAAACTT GGAGTACATT 5828 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT TGGGAAACTT GGAGTACATT 260 GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA TTGAGCTTGC TGTT 5774 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA TTGAGCTTGC TGTT 314 hqPGS_C06HBa0153O03.1-7-_SGN-E217332- (6187 6118,6017 5774) ******************************************************************************** EST sequence 72 +strand 683 n (File: SGN-E262562+) 1 AAAAATGGAA GAAGCTTCAG TAGTAGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 61 TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAGA TTGCTGATAT 121 TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA CACCACACAC 181 AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG TGGAGAGGTT 241 GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG CCGTTCGTAT 301 TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA TCCAAGTGAT 361 TGTTGATGCT GTTATCAACA GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 421 TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA 481 TCTCCTCACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCAGAATG 541 CCTTGCAGAT GAACTCATTA ATGCTGCCAA GGGATCTTCC AACAGCTATG CTATCAAGAA 601 GAAGGATGAG ATTGAGAGGG TTGCCAANGC CAATCGTTGA GGGTGCAGTA TGGATATTTA 661 CTATTGGTTG GACAGTTTTG CTT Predicted gene structure (within gDNA segment 8193 to 5310): Exon 1 8022 7914 ( 109 n); cDNA 1 109 ( 109 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 110 381 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6118 ( 204 n); cDNA 382 585 ( 204 n); score: 1.000 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 1.00), Pa: 1.000 (s: 0.98) Exon 4 6017 5920 ( 98 n); cDNA 586 683 ( 98 n); score: 0.990 MATCH C06HBa0153O03.1-7- SGN-E262562+ 0.999 683 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E262562+ (8022 7914,6772 6501,6321 6118,6017 5920) Alignment (genomic DNA sequence = upper lines): AAAAATGGAA GAAGCTTCAG TAGTAGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 7963 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAATGGAA GAAGCTTCAG TAGTAGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 60 TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAGG TTTGTTTGTT 7903 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAG. .......... 109 TCCCTTTCAA TTTTATTCCT CTCCAGTTCC TATATCTTTT CATTATTTGC CTAACATTAA 7843 .......... .......... .......... .......... .......... .......... 109 TGTCGAATTG GATGAAACTT GGCATTTTCG AAATCATAAG ATGAACATTT GAATTATTTT 7783 .......... .......... .......... .......... .......... .......... 109 GTTTCTTGCG TTAGCTAAAC TCTAATTGTA GTGTAGCAGA GGTGATATAT CAGTAAGGGT 7723 .......... .......... .......... .......... .......... .......... 109 GGGCATGGTA GGGTAGATAC CGAAACCAAA ATTTTTCACT CAATGGTTTC AATATCATGA 7663 .......... .......... .......... .......... .......... .......... 109 CATTTGATAT TATTTATAAT GTATATCGAA TCACCAAATA CTTTAACAGA GTGTATAGTT 7603 .......... .......... .......... .......... .......... .......... 109 AGGTATCCAA TTCATTTATC GTATTATAAT ACTAACAAAT ATATTAACTA GTATTAGTTC 7543 .......... .......... .......... .......... .......... .......... 109 AAAGTTGTTT AGACATTGAA AGCTTTGACT ACTCTTTTCT TGTTAGAATT GTCCTTTTTG 7483 .......... .......... .......... .......... .......... .......... 109 TGTAATTGAT TAAGTGATGG AATTGCTTCT TCTTTCTTTT GAATATTTTT ACATGAGTAA 7423 .......... .......... .......... .......... .......... .......... 109 GATCTTTATA TGATATAATT AAGAAGTTTC TAAAGAAACC AAAACATAAT TCTCTATTTA 7363 .......... .......... .......... .......... .......... .......... 109 TATGAGTATA TGTAAGTCGA AGTCGAACAA ACAATGGTTA CCAACCAAAA GTTAAAAAGT 7303 .......... .......... .......... .......... .......... .......... 109 ATCGGCACAT AATGGTTTAA TTTGATATGG TAATGGTATA GTACTTTTAA AAATCAAAAT 7243 .......... .......... .......... .......... .......... .......... 109 TATTGAACCA AAGTTTTCAA TATTGTATCA TACCTTTCCA TGCTCATCCC TACATATCAG 7183 .......... .......... .......... .......... .......... .......... 109 TTCTCAAGTC CAATGCATTG AATACTTAAC CATGGTTAGG AAACTTGAAA CACTATGCAC 7123 .......... .......... .......... .......... .......... .......... 109 GACACTGCTT AGGTATGTCT ATCAACTATA AAGCCTGCTG GCTTGATCTT CTTATTCAAA 7063 .......... .......... .......... .......... .......... .......... 109 GAAACATGCA TGCTAAACAT GATATGATTA AGTTGAACAG AATAGTGTTG GTTTCCCCAA 7003 .......... .......... .......... .......... .......... .......... 109 TCCATAACAA GCCAACTGGG ACAACCTTAC AGAAGGTGTG CCTATTCATC ATTGTTGCCT 6943 .......... .......... .......... .......... .......... .......... 109 TGTAAATGAT GGATTTATAC AACTGAAAAT TACTTGCTGA GAGTTCAGGG AAATCCTTGT 6883 .......... .......... .......... .......... .......... .......... 109 TGGTTAAGTT GGAAATGTAA TTGTAGGTGG ATTCTTCATT GGAATGCTCA AAGGAGAAAT 6823 .......... .......... .......... .......... .......... .......... 109 TCAGTATATG ATCTCTTGAA TTCTCTCTTA AATGTTATTA TCTCATGCAG ATTGCTGATA 6763 |||||||||| .......... .......... .......... .......... .......... ATTGCTGATA 119 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 6703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 179 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 6643 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 239 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 6583 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 299 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 6523 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 359 TTGTTGATGC TGTTATCAAC AGGTTTAGAG ATTATTCTGA TTTTTGCATA TTTATTAGCT 6463 |||||||||| |||||||||| || TTGTTGATGC TGTTATCAAC AG........ .......... .......... .......... 381 CGAGTTTTTC TTGCTGAGGT CTTGTTAATT AGAAGATTTT CATACCATGT CTTCTTTGTT 6403 .......... .......... .......... .......... .......... .......... 381 CCATTTCCAT GTCGCGGCAT ACTTGAGATA TTGTAGTCAT TCTCATTTTT TCCTTCCCAT 6343 .......... .......... .......... .......... .......... .......... 381 ATTCTTACCT ATGTGATGCA GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 6283 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .TGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 420 TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA 6223 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA 480 TCTCCTCACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCAGAATG 6163 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCCTCACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCAGAATG 540 CCTTGCAGAT GAACTCATTA ATGCTGCCAA GGGATCTTCC AACAGGTAAT CTTTTCTATT 6103 |||||||||| |||||||||| |||||||||| |||||||||| ||||| CCTTGCAGAT GAACTCATTA ATGCTGCCAA GGGATCTTCC AACAG..... .......... 585 GCCATCTTTT TACTCCTATA TGCGTTTAAT CCTTAATAAA TGTAACTATA TTCTCTGCCT 6043 .......... .......... .......... .......... .......... .......... 585 ACTTATTCAT TCTATGTACG TGTAGCTATG CTATCAAGAA GAAGGATGAG ATTGAGAGGG 5983 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....CTATG CTATCAAGAA GAAGGATGAG ATTGAGAGGG 620 TTGCCAAGGC CAATCGTTGA GGGTGCAGTA TGGATATTTA CTATTGGTTG GACAGTTTTG 5923 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCCAANGC CAATCGTTGA GGGTGCAGTA TGGATATTTA CTATTGGTTG GACAGTTTTG 680 CTT 5920 ||| CTT 683 hqPGS_C06HBa0153O03.1-7-_SGN-E262562+ (8022 7914,6772 6501,6321 6118,6017 5920) ******************************************************************************** EST sequence 124 +strand 673 n (File: SGN-E287025+) 1 AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 61 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGAT 121 TGCTGATATT TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC 181 ACCACACACA GCTGGGAGGT ACCAAGCCAA GCGGTTTAGA AAGGCTCAAT GCCCAATTGT 241 GGAGAGGTTG ACCAACTCAC TGATGATGCA CGGAAGGAAC AACGGGAAGA AGTTGATGGC 301 CGTTCGTATT ATTAAGCATG CTATGGAAAT TATCCATCTG TTGACTGACC TAAACCCAAT 361 CCAAGTGATT GTTGATGCTG TTATCAACAG TGGACCGAGA GAAGATGCAA CTCGTATAGG 421 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 481 AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 541 AGCAGAATGC CTTGCAGATG AACTCATTAA TGCTGCCAAG GGATCTTCCA ACAGCTATGC 601 TATCAAGAAG AAGGATGAGA TTGAGAGGGT TGCCAAGGCC AATCGTTGAG GGTGCAGTAT 661 GGATATTTAC TAT Predicted gene structure (within gDNA segment 8193 to 5329): Exon 1 8031 7914 ( 118 n); cDNA 1 118 ( 118 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 119 390 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 0.98) Exon 3 6321 6118 ( 204 n); cDNA 391 594 ( 204 n); score: 0.995 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 4 6017 5939 ( 79 n); cDNA 595 673 ( 79 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E287025+ 0.999 673 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E287025+ (8031 7914,6772 6501,6321 6118,6017 5939) Alignment (genomic DNA sequence = upper lines): AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 7972 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 60 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGGT 7912 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAG.. 118 TTGTTTGTTT CCCTTTCAAT TTTATTCCTC TCCAGTTCCT ATATCTTTTC ATTATTTGCC 7852 .......... .......... .......... .......... .......... .......... 118 TAACATTAAT GTCGAATTGG ATGAAACTTG GCATTTTCGA AATCATAAGA TGAACATTTG 7792 .......... .......... .......... .......... .......... .......... 118 AATTATTTTG TTTCTTGCGT TAGCTAAACT CTAATTGTAG TGTAGCAGAG GTGATATATC 7732 .......... .......... .......... .......... .......... .......... 118 AGTAAGGGTG GGCATGGTAG GGTAGATACC GAAACCAAAA TTTTTCACTC AATGGTTTCA 7672 .......... .......... .......... .......... .......... .......... 118 ATATCATGAC ATTTGATATT ATTTATAATG TATATCGAAT CACCAAATAC TTTAACAGAG 7612 .......... .......... .......... .......... .......... .......... 118 TGTATAGTTA GGTATCCAAT TCATTTATCG TATTATAATA CTAACAAATA TATTAACTAG 7552 .......... .......... .......... .......... .......... .......... 118 TATTAGTTCA AAGTTGTTTA GACATTGAAA GCTTTGACTA CTCTTTTCTT GTTAGAATTG 7492 .......... .......... .......... .......... .......... .......... 118 TCCTTTTTGT GTAATTGATT AAGTGATGGA ATTGCTTCTT CTTTCTTTTG AATATTTTTA 7432 .......... .......... .......... .......... .......... .......... 118 CATGAGTAAG ATCTTTATAT GATATAATTA AGAAGTTTCT AAAGAAACCA AAACATAATT 7372 .......... .......... .......... .......... .......... .......... 118 CTCTATTTAT ATGAGTATAT GTAAGTCGAA GTCGAACAAA CAATGGTTAC CAACCAAAAG 7312 .......... .......... .......... .......... .......... .......... 118 TTAAAAAGTA TCGGCACATA ATGGTTTAAT TTGATATGGT AATGGTATAG TACTTTTAAA 7252 .......... .......... .......... .......... .......... .......... 118 AATCAAAATT ATTGAACCAA AGTTTTCAAT ATTGTATCAT ACCTTTCCAT GCTCATCCCT 7192 .......... .......... .......... .......... .......... .......... 118 ACATATCAGT TCTCAAGTCC AATGCATTGA ATACTTAACC ATGGTTAGGA AACTTGAAAC 7132 .......... .......... .......... .......... .......... .......... 118 ACTATGCACG ACACTGCTTA GGTATGTCTA TCAACTATAA AGCCTGCTGG CTTGATCTTC 7072 .......... .......... .......... .......... .......... .......... 118 TTATTCAAAG AAACATGCAT GCTAAACATG ATATGATTAA GTTGAACAGA ATAGTGTTGG 7012 .......... .......... .......... .......... .......... .......... 118 TTTCCCCAAT CCATAACAAG CCAACTGGGA CAACCTTACA GAAGGTGTGC CTATTCATCA 6952 .......... .......... .......... .......... .......... .......... 118 TTGTTGCCTT GTAAATGATG GATTTATACA ACTGAAAATT ACTTGCTGAG AGTTCAGGGA 6892 .......... .......... .......... .......... .......... .......... 118 AATCCTTGTT GGTTAAGTTG GAAATGTAAT TGTAGGTGGA TTCTTCATTG GAATGCTCAA 6832 .......... .......... .......... .......... .......... .......... 118 AGGAGAAATT CAGTATATGA TCTCTTGAAT TCTCTCTTAA ATGTTATTAT CTCATGCAGA 6772 | .......... .......... .......... .......... .......... .........A 119 TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 6712 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 179 CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 6652 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 239 TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 6592 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 299 CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 6532 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 359 TCCAAGTGAT TGTTGATGCT GTTATCAACA GGTTTAGAGA TTATTCTGAT TTTTGCATAT 6472 |||||||||| |||||||||| |||||||||| | TCCAAGTGAT TGTTGATGCT GTTATCAACA G......... .......... .......... 390 TTATTAGCTC GAGTTTTTCT TGCTGAGGTC TTGTTAATTA GAAGATTTTC ATACCATGTC 6412 .......... .......... .......... .......... .......... .......... 390 TTCTTTGTTC CATTTCCATG TCGCGGCATA CTTGAGATAT TGTAGTCATT CTCATTTTTT 6352 .......... .......... .......... .......... .......... .......... 390 CCTTCCCATA TTCTTACCTA TGTGATGCAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 6292 |||||| ||| |||||||||| |||||||||| .......... .......... .......... TGGACCGAGA GAAGATGCAA CTCGTATAGG 420 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 6232 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 480 AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 6172 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 540 AGCAGAATGC CTTGCAGATG AACTCATTAA TGCTGCCAAG GGATCTTCCA ACAGGTAATC 6112 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| AGCAGAATGC CTTGCAGATG AACTCATTAA TGCTGCCAAG GGATCTTCCA ACAG...... 594 TTTTCTATTG CCATCTTTTT ACTCCTATAT GCGTTTAATC CTTAATAAAT GTAACTATAT 6052 .......... .......... .......... .......... .......... .......... 594 TCTCTGCCTA CTTATTCATT CTATGTACGT GTAGCTATGC TATCAAGAAG AAGGATGAGA 5992 |||||| |||||||||| |||||||||| .......... .......... .......... ....CTATGC TATCAAGAAG AAGGATGAGA 620 TTGAGAGGGT TGCCAAGGCC AATCGTTGAG GGTGCAGTAT GGATATTTAC TAT 5939 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| TTGAGAGGGT TGCCAAGGCC AATCGTTGAG GGTGCAGTAT GGATATTTAC TAT 673 hqPGS_C06HBa0153O03.1-7-_SGN-E287025+ (8031 7914,6772 6501,6321 6118,6017 5939) ******************************************************************************** EST sequence 38 +strand 639 n (File: SGN-E347850+) 1 TTCAGTAGTA GCAGTGGACA ACCAAAAGCC GCAGCAAGAG AAGCCTCACA CTGATGTTTT 61 GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGATTGCT GATATTTCTG TTGAGGATTA 121 CATAACTGCT ACTGCTAACA AGCATCCTAC ATATACACCA CACACAGCTG GGAGGTACCA 181 AGCCAAGCGG TTTAGAAAGG CTCAATGCCC AATTGTGGAG AGGTTGACCA ACTCACTGAT 241 GATGCACGGA AGGAACAACG GGAAGAAGTT GATGGCCGTT CGTATTATTA AGCATGCTAT 301 GGAAATTATC CATCTGTTGA CTGACCTAAA CCCAATCCAA GTGATTGTTG ATGCTGTTAT 361 CAACAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA 421 AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG 481 TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT 541 CATTAATGCT GCCAAGGGAT CTTNCAACAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 601 GAGGGTTGCC AAGGCCAATC GTTGAGGGTG CAGTATGGA Predicted gene structure (within gDNA segment 8193 to 5339): Exon 1 8007 7914 ( 94 n); cDNA 1 94 ( 94 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 95 366 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6118 ( 204 n); cDNA 367 570 ( 204 n); score: 0.995 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.98), Pa: 1.000 (s: 1.00) Exon 4 6017 5949 ( 69 n); cDNA 571 639 ( 69 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E347850+ 0.998 639 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E347850+ (8007 7914,6772 6501,6321 6118,6017 5949) Alignment (genomic DNA sequence = upper lines): TTCAGTAGTA GCAGTGGACA ACCAAAAGCC GCAGCAAGAG AAGCCTCACA CTGATGTTTT 7948 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGTAGTA GCAGTGGACA ACCAAAAGCC GCAGCAAGAG AAGCCTCACA CTGATGTTTT 60 GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT TTGTTTCCCT TTCAATTTTA 7888 |||||||||| |||||||||| |||||||||| |||| GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAG...... .......... .......... 94 TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC ATTAATGTCG AATTGGATGA 7828 .......... .......... .......... .......... .......... .......... 94 AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT ATTTTGTTTC TTGCGTTAGC 7768 .......... .......... .......... .......... .......... .......... 94 TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA AGGGTGGGCA TGGTAGGGTA 7708 .......... .......... .......... .......... .......... .......... 94 GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT CATGACATTT GATATTATTT 7648 .......... .......... .......... .......... .......... .......... 94 ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA TAGTTAGGTA TCCAATTCAT 7588 .......... .......... .......... .......... .......... .......... 94 TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT AGTTCAAAGT TGTTTAGACA 7528 .......... .......... .......... .......... .......... .......... 94 TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT TTTTGTGTAA TTGATTAAGT 7468 .......... .......... .......... .......... .......... .......... 94 GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG AGTAAGATCT TTATATGATA 7408 .......... .......... .......... .......... .......... .......... 94 TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT ATTTATATGA GTATATGTAA 7348 .......... .......... .......... .......... .......... .......... 94 GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA AAAGTATCGG CACATAATGG 7288 .......... .......... .......... .......... .......... .......... 94 TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC AAAATTATTG AACCAAAGTT 7228 .......... .......... .......... .......... .......... .......... 94 TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT ATCAGTTCTC AAGTCCAATG 7168 .......... .......... .......... .......... .......... .......... 94 CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA TGCACGACAC TGCTTAGGTA 7108 .......... .......... .......... .......... .......... .......... 94 TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT TCAAAGAAAC ATGCATGCTA 7048 .......... .......... .......... .......... .......... .......... 94 AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC CCCAATCCAT AACAAGCCAA 6988 .......... .......... .......... .......... .......... .......... 94 CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT TGCCTTGTAA ATGATGGATT 6928 .......... .......... .......... .......... .......... .......... 94 TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC CTTGTTGGTT AAGTTGGAAA 6868 .......... .......... .......... .......... .......... .......... 94 TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA GAAATTCAGT ATATGATCTC 6808 .......... .......... .......... .......... .......... .......... 94 TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC TGATATTTCT GTTGAGGATT 6748 ||||| |||||||||| |||||||||| .......... .......... .......... .....ATTGC TGATATTTCT GTTGAGGATT 119 ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC ACACACAGCT GGGAGGTACC 6688 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC ACACACAGCT GGGAGGTACC 179 AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA GAGGTTGACC AACTCACTGA 6628 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA GAGGTTGACC AACTCACTGA 239 TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT TCGTATTATT AAGCATGCTA 6568 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT TCGTATTATT AAGCATGCTA 299 TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA AGTGATTGTT GATGCTGTTA 6508 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA AGTGATTGTT GATGCTGTTA 359 TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT TAGCTCGAGT TTTTCTTGCT 6448 ||||||| TCAACAG... .......... .......... .......... .......... .......... 366 GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT TTGTTCCATT TCCATGTCGC 6388 .......... .......... .......... .......... .......... .......... 366 GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT CCCATATTCT TACCTATGTG 6328 .......... .......... .......... .......... .......... .......... 366 ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA 6268 |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ......TGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA 420 AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG 6208 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG 480 TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT 6148 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT 540 CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT CTATTGCCAT CTTTTTACTC 6088 |||||||||| |||||||||| ||| |||||| CATTAATGCT GCCAAGGGAT CTTNCAACAG .......... .......... .......... 570 CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC TGCCTACTTA TTCATTCTAT 6028 .......... .......... .......... .......... .......... .......... 570 GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC 5968 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC 620 GTTGAGGGTG CAGTATGGA 5949 |||||||||| ||||||||| GTTGAGGGTG CAGTATGGA 639 hqPGS_C06HBa0153O03.1-7-_SGN-E347850+ (8007 7914,6772 6501,6321 6118,6017 5949) ******************************************************************************** EST sequence 65 +strand 734 n (File: SGN-E348651+) 1 AAAGACACTT CTCCCCGGAA GGTGAATTAG AGCAGGCAAG AGAAGTAGAA GAAGAAATGG 61 ACGCAGGTGT AGTTGCTGCC CCCGCCCCGG CCGCCGCCGT CGATGCAAGC AAAGAGAATA 121 AGGTTCACAC TGATGTCATG CTTTTCAATC GCTGGAGCTA TGATGGAGTT GAGATCAATG 181 ACATGTCTGT TGAGGATTAC ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC 241 ACACAGCTGG TAGATACCAG GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA 301 GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC 361 GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG 421 TCATTGTTGA TGCTGTTATC AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 481 CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCNGTCGTGT TAACCAAGCA 541 ATTTATTTGC TGACAACTGG TGCACGTTGA GAGTGCTTTC AGGAACATCA AGACCATAGN 601 CTGAGTGCCT TGCTGATGAA CTCATCAATG CTGCCAAGGG GTTCTTCAAA TAGCTATGCT 661 ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GCCAAGCCCA ATCGTTAAGA AGATTGTTGT 721 TGGAGCAACT TTTT Predicted gene structure (within gDNA segment 8193 to 4210): Exon 1 7967 7914 ( 54 n); cDNA 120 173 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 174 445 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 446 653 ( 208 n); score: 0.858 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.83), Pa: 1.000 (s: 0.92) Exon 4 6017 5956 ( 62 n); cDNA 654 716 ( 63 n); score: 0.798 Intron 4 5955 5891 ( 65 n); Pd: 0.439 (s: 0.77), Pa: 0.474 (s: 0) Exon 5 5890 5873 ( 18 n); cDNA 717 734 ( 18 n); score: 0.611 MATCH C06HBa0153O03.1-7- SGN-E348651+ 0.839 610 0.831 C PGS_C06HBa0153O03.1-7-_SGN-E348651+ (7967 7914,6772 6501,6321 6118,6017 5956,5890 5873) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 173 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 173 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 173 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 173 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 173 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 173 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 173 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 173 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 173 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 173 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 173 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 173 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 173 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 173 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 173 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 173 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 173 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 173 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 173 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 178 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 238 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 298 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 358 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 418 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 445 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 445 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 445 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 479 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCC-GTCGTG TCAACCAAGC 6229 |||||||||| | || || || |||||||||| |||||||||| ||| |||||| | |||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCNGTCGTG TTAACCAAGC 539 AATATATCTC CTCACAACTG GTGCACG-TG AGAGTGCTTT CAGGAACATC AAGACCATAG 6170 ||| ||| | || ||||||| ||||||| || |||||||||| |||||||||| |||||||||| AATTTATTTG CTGACAACTG GTGCACGTTG AGAGTGCTTT CAGGAACATC AAGACCATAG 599 -CAGAATGCC TTGCAGATGA ACTCATTAAT GCTGCCAA-G GGATCTTCCA ACAGGTAATC 6112 | || |||| |||| ||||| |||||| ||| |||||||| | || ||||| | | || NCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GGTTCTTCAA ATAG...... 653 TTTTCTATTG CCATCTTTTT ACTCCTATAT GCGTTTAATC CTTAATAAAT GTAACTATAT 6052 .......... .......... .......... .......... .......... .......... 653 TCTCTGCCTA CTTATTCATT CTATGTACGT GTAGCTATGC TATCAAGAAG AAGGATGAGA 5992 |||||| ||| |||||| ||||| |||| .......... .......... .......... ....CTATGC TATTAAGAAG AAGGACGAGA 679 TTGAGAGGGT TGCCAAGGCC AATCGTTGAG -GGTGCAGTA TGGATATTTA CTATTGGTTG 5933 |||| ||||| ||||||| || ||||||| || | TTGAAAGGGT TGCCAAGCCC AATCGTTAAG AAGATTG... .......... .......... 716 GACAGTTTTG CTTCGAAACG TTGTTTGTTC TTTATTTTTT AGTTGCTAGA AAGGCATTTT 5873 ||| | || | |||| .......... .......... .......... .......... ..TTGTTGGA GCAACTTTTT 734 hqPGS_C06HBa0153O03.1-7-_SGN-E348651+ (7967 7914,6772 6501,6321 6118,6017 5956) ******************************************************************************** EST sequence 83 +strand 733 n (File: SGN-E339213+) 1 AGAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCAGGCA AGAGAAGTAG AAGAAGAAAT 61 GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA GCAAAGAGAA 121 TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG TTGAGATCAA 181 TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 481 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 541 AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC 601 TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA GCTATGCTAT 661 TAAGAAGAAA GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA TTGTTGTTGG 721 AGCAACTTTT TCG Predicted gene structure (within gDNA segment 8193 to 4229): Exon 1 7967 7914 ( 54 n); cDNA 122 175 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 176 447 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 448 651 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.92) Exon 4 6017 5956 ( 62 n); cDNA 652 713 ( 62 n); score: 0.839 Intron 4 5955 5891 ( 65 n); Pd: 0.439 (s: 0.82), Pa: 0.474 (s: 0) Exon 5 5890 5873 ( 18 n); cDNA 714 731 ( 18 n); score: 0.611 MATCH C06HBa0153O03.1-7- SGN-E339213+ 0.860 610 0.832 C PGS_C06HBa0153O03.1-7-_SGN-E339213+ (7967 7914,6772 6501,6321 6118,6017 5956,5890 5873) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 175 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 175 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 175 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 175 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 175 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 175 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 175 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 175 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 175 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 175 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 175 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 175 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 175 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 175 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 175 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 175 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 175 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 175 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 175 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 180 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 240 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 300 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 360 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 420 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 447 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 447 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 447 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 541 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 601 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 651 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 651 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||| | | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAAG ACGAGATTGA 681 GAGGGTTGCC AAGGCCAATC GTTGAGGGTG CAGTATGGAT ATTTACTATT GGTTGGACAG 5928 ||||||||| |||||||||| ||| || | AAGGGTTGCC AAGGCCAATC GTTAAGAGAT TG........ .......... .......... 713 TTTTGCTTCG AAACGTTGTT TGTTCTTTAT TTTTTAGTTG CTAGAAAGGC ATTTT 5873 ||| | || | |||| .......... .......... .......... .......TTG TTGGAGCAAC TTTTT 731 hqPGS_C06HBa0153O03.1-7-_SGN-E339213+ (7967 7914,6772 6501,6321 6118,6017 5956) ******************************************************************************** EST sequence 111 +strand 575 n (File: SGN-E353078+) 1 GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 61 ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA 121 GATGCAACAC GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA 181 CTCCGTCGTG TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC 241 AGGAACATCA AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT 301 TCTTCAAATA GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT 361 CGTTAAGAGA TTGTTGTTGG AGCAACTTTT TCGAGAGAAC TTTTGGGTTA TGTATTTTCT 421 CAGNTCTGTT TTCATGTAGG CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG 481 GAGGATTTAT GTTTGGTATT GTTATAAATG TTAAATTTTG AAGTTCCTTT ATTCGGGTTC 541 TCAGTAGAGT TTCGTTAAAC ACGGTAAAAA AAAAA Predicted gene structure (within gDNA segment 8193 to 2409): Exon 1 6607 6501 ( 107 n); cDNA 1 107 ( 107 n); score: 0.888 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 108 311 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5961 ( 57 n); cDNA 312 368 ( 57 n); score: 0.912 Intron 3 5960 5328 ( 633 n); Pd: 0.081 (s: 0.90), Pa: 0.000 (s: 0.62) Exon 4 5327 5280 ( 48 n); cDNA 369 414 ( 46 n); score: 0.625 PPA cDNA 566 575 MATCH C06HBa0153O03.1-7- SGN-E353078+ 0.902 416 0.723 C PGS_C06HBa0153O03.1-7-_SGN-E353078+ (6607 6501,6321 6118,6017 5961,5327 5280) Alignment (genomic DNA sequence = upper lines): GGGAAGAAGT TGATGGCCGT TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG 6548 || |||||| | ||||| || |||||||||| |||||||| | |||| || || ||| ||||| GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 60 ACTGACCTAA ACCCAATCCA AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT 6488 ||||||| || ||||||| || ||| |||||| |||||||||| ||||||| ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAG... .......... 107 TCTGATTTTT GCATATTTAT TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG 6428 .......... .......... .......... .......... .......... .......... 107 ATTTTCATAC CATGTCTTCT TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA 6368 .......... .......... .......... .......... .......... .......... 107 GTCATTCTCA TTTTTTCCTT CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG 6308 ||| ||||| |||| .......... .......... .......... .......... ......TGGG CCAAGGGAAG 121 ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC 6248 ||||||| || ||| |||||| |||||||||| | || || || |||||||||| |||||||||| ATGCAACACG TATTGGTTCT GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC 181 TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA 6188 |||||||||| ||||||||| || ||| | | | |||||||| |||||||||| |||||||||| TCCGTCGTGT TAACCAAGCA ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA 241 GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 6128 |||||||||| ||||||||| || ||||||| | |||||||| ||| |||||| |||||||| | GGAACATCAA GACCATAGCT GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT 301 CTTCCAACAG GTAATCTTTT CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA 6068 |||| || || CTTCAAATAG .......... .......... .......... .......... .......... 311 ATAAATGTAA CTATATTCTC TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC 6008 ||||||||| .......... .......... .......... .......... .......... CTATGCTATT 321 AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAGGGTG CAGTATGGAT 5948 |||||||||| | |||||||| ||||||||| |||||||||| ||| || AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAGA... .......... 368 ATTTACTATT GGTTGGACAG TTTTGCTTCG AAACGTTGTT TGTTCTTTAT TTTTTAGTTG 5888 .......... .......... .......... .......... .......... .......... 368 CTAGAAAGGC ATTTTGGAAC TAGTAACGAG TTTTTCTGTT TGGGAAACTT GGAGTACATT 5828 .......... .......... .......... .......... .......... .......... 368 GGTATTGAAT ATTATATGGG GGAATTAGCA AAAAGCAAAA TTGAGCTTGC TGTTATTCTC 5768 .......... .......... .......... .......... .......... .......... 368 AGTTGAAATG TCTGGTTGTT GTGTTTCTGC TTCTTCTTTA TCAAGCTTTT GTTCGCATAG 5708 .......... .......... .......... .......... .......... .......... 368 GTTGAACAAT CTTGTTGGTG ACAATGCGGA CGGACATTAT TTTCTCCTTT TTAGCTCAAT 5648 .......... .......... .......... .......... .......... .......... 368 CTCAGCTGAG CATTACATTT TTATTTCTAA AAAGATAGAT TATGTTTTTG CAGTTAACTG 5588 .......... .......... .......... .......... .......... .......... 368 CTATCAAGTT TTTTCGTTGA TTTTTTAAAA AAGTTAATAA AGGTAGACAA GTAAAGTCTG 5528 .......... .......... .......... .......... .......... .......... 368 GAAGCAAGAT TTGGACGTTA GATGTTTGTG CAGAACTTGC AAAGTTGGAA AATGTAGAAA 5468 .......... .......... .......... .......... .......... .......... 368 TGTTTGGAGT TTAAATGTTC ATACAAGTTT TTGGAATAGC CTTCAAATAT TACTTTTGTT 5408 .......... .......... .......... .......... .......... .......... 368 TCATTTACCA TACAACTATT TTTTAGTAAC GTACTTATTC AATTCACTTA ATTAAAATTA 5348 .......... .......... .......... .......... .......... .......... 368 TAATCTTGTT CTTTGTTTTT GTTTTTTAAA TAGATTAATT TATTAAGAAG GAATTTCTGT 5288 | || || | || || | | || || ||| || || .......... .......... GATTGTT-GT TGGAGCAACT T-TTTCGAGA GAACTTTTGG 406 GTTAAGTA 5280 |||| ||| GTTATGTA 414 hqPGS_C06HBa0153O03.1-7-_SGN-E353078+ (6607 6501,6321 6118,6017 5961) ******************************************************************************** EST sequence 66 +strand 767 n (File: SGN-E348708+) 1 TACAGCAGAA AGACACTTCT CCCCGGAAGG TGAATTAGAG CAGGCAAGAG AAGTAGAAGA 61 AGAAATGGAC GCAGGTGTAG TTGCTGCCCC CGCCCCGGCC GCCGCCGTCG ATGCAAGCAA 121 AGAGAATAAG GTTCACACTG ATGTCATGCT TTTCAATCGC TGGAGCTATG ATGGAGTTGA 181 GATCAATGAC ATGTCTGTTG AGGATTACAT CACCGCAACT GCTAACAAGC ACCCAGTTTA 241 CATGCCACAC ACAGCTGGTA GATACCAGGC CAAGCGTTTC AGGAAGGCTC AGTGCCCAAT 301 CGTTGAGAGG CTCACAAATT CTCTCATGAT GCACGGAAGG AACAACGGAA AGAAGCTCAT 361 GGCTGTTCGT ATTATTAAGC ATGCAATGGA GATCATTCAT TTGTTGACTG ACCAAAACCC 421 AATTCAAGTC ATTGTTGATG CTGTTATCAA CAGTGGGCCA AGGGAAGATG CAACACGTAT 481 TGGTTCTGCT GGTGTTGTCA GACGTCAAGC TGTTGATATT TCTCCACTCC GTCGTGTTAA 541 CCAAGCAATT TATTTGCTGA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA ACATCAAGAC 601 CATAGCTGAG TGCCTTGCTG ATGAACTCAT CAATGCTGCC AAGGGTTCTT CAAATAGCTA 661 TGCTATTAAG AAGAAGGACG AGATTGAAAG GGTTGCCAAG GCCAATCGTT AAGAGATTGT 721 TGTTGGAGCA ACTTTTTCGA GAGACTNTTT GGTTATGTTA TTTTCTC Predicted gene structure (within gDNA segment 8193 to 3949): Exon 1 7967 7914 ( 54 n); cDNA 128 181 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 182 453 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 454 657 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 4 6017 5962 ( 56 n); cDNA 658 713 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E348708+ 0.869 586 0.764 C PGS_C06HBa0153O03.1-7-_SGN-E348708+ (7967 7914,6772 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 181 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 181 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 181 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 181 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 181 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 181 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 181 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 181 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 181 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 181 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 181 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 181 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 181 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 181 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 181 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 181 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 181 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 181 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 181 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 186 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 246 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 306 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 366 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 426 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 453 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 453 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 453 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 487 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 547 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 607 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 657 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 657 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 687 GAGGGTTGCC AAGGCCAATC GTTGAG 5962 ||||||||| |||||||||| ||| || AAGGGTTGCC AAGGCCAATC GTTAAG 713 hqPGS_C06HBa0153O03.1-7-_SGN-E348708+ (7967 7914,6772 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 68 +strand 722 n (File: SGN-E348496+) 1 ACACTTCTCC CCGGAAGGTG AATTAGAGCA GGCAAGAGAA GTAGAAGAAG AAATGGACGC 61 AGGTGTAGTT GCTGCCCCCG CCCCGGCCGC CGCCGTCGAT GCAAGCAAAG AGAATAAGGT 121 TCACACTGAT GTCATGCTTT TCAATCGCTG GAGCTATGAT GGAGTTGAGA TCAATGACAT 181 GTCTGTTGAG GATTACATCA CCGCAACTGC TAACAAGCAC CCAGTTTACA TGCCACACAC 241 AGCTGGTAGA TACCAGGCCA AGCGTTTCAG GAAGGCTCAG TGCCCAATCG TTGAGAGGCT 301 CACAAATTCT CTCATGATGC ACGGAAGGAA CAACGGAAAG AAGCTCATGG CTGTTCGTAT 361 TATTAAGCAT GCAATGGAGA TCATTCATTT GTTGACTGAC CAAAACCCAA TTCAAGTCAT 421 TGTTGATGCT GTTATCAACA GTGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG 481 TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCACTCCGT CGTGTTAACC AAGCAATTTA 541 TTTGCTGACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCTGAGTG 601 CCTTGCTGAT GAACTCATCA ATGCTGCCAA GGGGTTCTTC AAATAGCTAT GCTATTAAGA 661 AGAAGGACGA GATTGAAAGG GTTGCCAAGG CCAATCGTTA AGAGATTTGT TGTTGGAGCA 721 AC Predicted gene structure (within gDNA segment 8193 to 4279): Exon 1 7967 7914 ( 54 n); cDNA 116 169 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 170 441 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 442 646 ( 205 n); score: 0.895 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.83), Pa: 1.000 (s: 0.94) Exon 4 6017 5962 ( 56 n); cDNA 647 702 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E348496+ 0.864 586 0.812 C PGS_C06HBa0153O03.1-7-_SGN-E348496+ (7967 7914,6772 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 169 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 169 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 169 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 169 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 169 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 169 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 169 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 169 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 169 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 169 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 169 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 169 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 169 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 169 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 169 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 169 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 169 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 169 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 169 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 174 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 234 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 294 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 354 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 414 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 441 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 441 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 441 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 475 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 535 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 595 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAA-GGGA TCTTCCAACA GGTAATCTTT 6109 || ||||||| | |||||||| ||| |||||| ||||| ||| ||||| || | | GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGGT TCTTCAAATA G......... 646 TCTATTGCCA TCTTTTTACT CCTATATGCG TTTAATCCTT AATAAATGTA ACTATATTCT 6049 .......... .......... .......... .......... .......... .......... 646 CTGCCTACTT ATTCATTCTA TGTACGTGTA GCTATGCTAT CAAGAAGAAG GATGAGATTG 5989 ||||||||| ||||||||| || ||||||| .......... .......... .......... .CTATGCTAT TAAGAAGAAG GACGAGATTG 675 AGAGGGTTGC CAAGGCCAAT CGTTGAG 5962 | |||||||| |||||||||| |||| || AAAGGGTTGC CAAGGCCAAT CGTTAAG 702 hqPGS_C06HBa0153O03.1-7-_SGN-E348496+ (7967 7914,6772 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 77 +strand 821 n (File: SGN-E539246+) 1 GCAGAAAGAC ACTTCTCCCC GGAAGGTGAA TTAGAGCAGG CAAGAGAAGT AGAAGAAGAA 61 ATGGACGCAG GTGTAGTTGC TGCCCCCGCC CCGGCCGCCG CCGTCGATGC AAGCAAAGAG 121 AATAAGGTTC ACACTGATGT CATGCTTTTC AATCGCTGGA GCTATGATGG AGTTGAGATC 181 AATGACATGT CTGTTGAGGA TTACATCACC GCAACTGCTA ACAAGCACCC AGTTTACATG 241 CCACACACAG CTGGTAGATA CCAGGCCAAG CGTTTCAGGA AGGCTCAGTG CCCAATCGTT 301 GAGAGGCTCA CAAATTCTCT CATGATGCAC GGAAGGAACA ACGGAAAGAA GCTCATGGCT 361 GTTCGTATTA TTAAGCATGC AATGGAGATC ATTCATTTGT TGACTGACCA AAACCCAATT 421 CAAGTCATTG TTGATGCTGT TATCAACAGT GGGCCAAGGG AAGATGCAAC ACGTATTGGT 481 TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 541 GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 601 GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAGCTATGCT 661 ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GCCAAGGCCA ATCGTTAAGA GATTGTTGTT 721 GGAGCAACTT TTTCGAGAGA CTTTTTGGTT ATGTTATTTT CTCAGTTCTG TTTTCATGTA 781 GGCATTATAG CATCTGCTAC TCCTTATGGA TTTAGTTTCT T Predicted gene structure (within gDNA segment 8193 to 3369): Exon 1 7967 7914 ( 54 n); cDNA 124 177 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 178 449 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 450 653 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 4 6017 5962 ( 56 n); cDNA 654 709 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E539246+ 0.869 586 0.714 C PGS_C06HBa0153O03.1-7-_SGN-E539246+ (7967 7914,6772 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 177 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 177 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 177 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 177 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 177 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 177 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 177 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 177 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 177 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 177 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 177 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 177 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 177 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 177 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 177 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 177 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 177 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 177 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 177 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 182 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 242 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 302 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 362 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 422 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 449 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 449 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 449 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 483 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 543 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 603 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 653 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 653 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 683 GAGGGTTGCC AAGGCCAATC GTTGAG 5962 ||||||||| |||||||||| ||| || AAGGGTTGCC AAGGCCAATC GTTAAG 709 hqPGS_C06HBa0153O03.1-7-_SGN-E539246+ (7967 7914,6772 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 88 +strand 750 n (File: SGN-E339028+) 1 AGAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCAGGCA AGAGAAGTAG AAGAAGAAAT 61 GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA GCAAAGAGAA 121 TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG TTGAGATCAA 181 TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAGAT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 481 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 541 AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC 601 TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGG TTCTTCAAAT AGCTATGCTA 661 TTAAGAAGAA GGACGAGATT GAAAGGGTTG CCAAGGCCAA TCGTTAAGAG ATTGTTGTTG 721 GAGCAACTTT TTCGAGAGAC TTTTTGGGTA Predicted gene structure (within gDNA segment 8193 to 4059): Exon 1 7967 7914 ( 54 n); cDNA 122 175 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 176 447 ( 272 n); score: 0.838 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 448 652 ( 205 n); score: 0.895 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.83), Pa: 1.000 (s: 0.94) Exon 4 6017 5962 ( 56 n); cDNA 653 708 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E339028+ 0.863 586 0.781 C PGS_C06HBa0153O03.1-7-_SGN-E339028+ (7967 7914,6772 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 175 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 175 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 175 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 175 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 175 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 175 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 175 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 175 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 175 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 175 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 175 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 175 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 175 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 175 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 175 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 175 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 175 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 175 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 175 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 180 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 240 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||| | || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGAT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 300 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 360 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 420 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 447 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 447 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 447 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 541 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 601 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAA-GGGA TCTTCCAACA GGTAATCTTT 6109 || ||||||| | |||||||| ||| |||||| ||||| ||| ||||| || | | GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGGT TCTTCAAATA G......... 652 TCTATTGCCA TCTTTTTACT CCTATATGCG TTTAATCCTT AATAAATGTA ACTATATTCT 6049 .......... .......... .......... .......... .......... .......... 652 CTGCCTACTT ATTCATTCTA TGTACGTGTA GCTATGCTAT CAAGAAGAAG GATGAGATTG 5989 ||||||||| ||||||||| || ||||||| .......... .......... .......... .CTATGCTAT TAAGAAGAAG GACGAGATTG 681 AGAGGGTTGC CAAGGCCAAT CGTTGAG 5962 | |||||||| |||||||||| |||| || AAAGGGTTGC CAAGGCCAAT CGTTAAG 708 hqPGS_C06HBa0153O03.1-7-_SGN-E339028+ (7967 7914,6772 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 139 +strand 644 n (File: SGN-E347192+) 1 TGGAGTTGAG ATCAATGACA TGTCTGTTGA GGATTACATC ACCGCAACTG CTAACAAGCA 61 CCCAGTTTAC ATGCCACACA CAGCTGGTAG ATACCAGGCC AAGCGTTTCA GGAAGGCTCA 121 GTGCCCAATC GTTGAGAGGC TCACAAATTC TCTCATGATG CACGGAAGGA ACAACGGAAA 181 GAAGCTCATG GCTGTTCGTA TTATTAAGCA TGCAATGGAG ATCATTCATT TGTTGACTGA 241 CCAAAACCCA ATTCAAGTCA TTGTTGATGC TGTTATCAAC AGTGGGCCAA GGGAAGATGC 301 AACACGTATT GGTTCTGCTG GTGTTGTCAG ACGTCAAGCT GTTGATATTT CTCCACTCCG 361 TCGTGTTAAC CAAGCAATTT ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 421 CATCAAGACC ATAGCTGAGT GCCTTGCTGA TGAACTCATC AATGCTGCCA AGGGTTCTTC 481 AAATAGCTAT GCTATTAAGA AGAAGGACGA GATTGAAAGG GTTGCCAAGG CCAATCGTTA 541 AGAGATTGTT GTTGGAGCAA CTTTTTCGAG AGACTTTTTG GTTATGTTAT TTTCTCAGNT 601 CTGTTTTCAT GTAGGCATTA TAGCATCTGC TACTCCTTAT GGAT Predicted gene structure (within gDNA segment 8193 to 3469): Exon 1 7923 7914 ( 10 n); cDNA 1 10 ( 10 n); score: 0.700 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 11 282 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 283 486 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 4 6017 5962 ( 56 n); cDNA 487 542 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E347192+ 0.876 542 0.842 C PGS_C06HBa0153O03.1-7-_SGN-E347192+ (7923 7914,6772 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TGATGTTCAG GTTTGTTTGT TTCCCTTTCA ATTTTATTCC TCTCCAGTTC CTATATCTTT 7864 || ||| || TGGAGTTGAG .......... .......... .......... .......... .......... 10 TCATTATTTG CCTAACATTA ATGTCGAATT GGATGAAACT TGGCATTTTC GAAATCATAA 7804 .......... .......... .......... .......... .......... .......... 10 GATGAACATT TGAATTATTT TGTTTCTTGC GTTAGCTAAA CTCTAATTGT AGTGTAGCAG 7744 .......... .......... .......... .......... .......... .......... 10 AGGTGATATA TCAGTAAGGG TGGGCATGGT AGGGTAGATA CCGAAACCAA AATTTTTCAC 7684 .......... .......... .......... .......... .......... .......... 10 TCAATGGTTT CAATATCATG ACATTTGATA TTATTTATAA TGTATATCGA ATCACCAAAT 7624 .......... .......... .......... .......... .......... .......... 10 ACTTTAACAG AGTGTATAGT TAGGTATCCA ATTCATTTAT CGTATTATAA TACTAACAAA 7564 .......... .......... .......... .......... .......... .......... 10 TATATTAACT AGTATTAGTT CAAAGTTGTT TAGACATTGA AAGCTTTGAC TACTCTTTTC 7504 .......... .......... .......... .......... .......... .......... 10 TTGTTAGAAT TGTCCTTTTT GTGTAATTGA TTAAGTGATG GAATTGCTTC TTCTTTCTTT 7444 .......... .......... .......... .......... .......... .......... 10 TGAATATTTT TACATGAGTA AGATCTTTAT ATGATATAAT TAAGAAGTTT CTAAAGAAAC 7384 .......... .......... .......... .......... .......... .......... 10 CAAAACATAA TTCTCTATTT ATATGAGTAT ATGTAAGTCG AAGTCGAACA AACAATGGTT 7324 .......... .......... .......... .......... .......... .......... 10 ACCAACCAAA AGTTAAAAAG TATCGGCACA TAATGGTTTA ATTTGATATG GTAATGGTAT 7264 .......... .......... .......... .......... .......... .......... 10 AGTACTTTTA AAAATCAAAA TTATTGAACC AAAGTTTTCA ATATTGTATC ATACCTTTCC 7204 .......... .......... .......... .......... .......... .......... 10 ATGCTCATCC CTACATATCA GTTCTCAAGT CCAATGCATT GAATACTTAA CCATGGTTAG 7144 .......... .......... .......... .......... .......... .......... 10 GAAACTTGAA ACACTATGCA CGACACTGCT TAGGTATGTC TATCAACTAT AAAGCCTGCT 7084 .......... .......... .......... .......... .......... .......... 10 GGCTTGATCT TCTTATTCAA AGAAACATGC ATGCTAAACA TGATATGATT AAGTTGAACA 7024 .......... .......... .......... .......... .......... .......... 10 GAATAGTGTT GGTTTCCCCA ATCCATAACA AGCCAACTGG GACAACCTTA CAGAAGGTGT 6964 .......... .......... .......... .......... .......... .......... 10 GCCTATTCAT CATTGTTGCC TTGTAAATGA TGGATTTATA CAACTGAAAA TTACTTGCTG 6904 .......... .......... .......... .......... .......... .......... 10 AGAGTTCAGG GAAATCCTTG TTGGTTAAGT TGGAAATGTA ATTGTAGGTG GATTCTTCAT 6844 .......... .......... .......... .......... .......... .......... 10 TGGAATGCTC AAAGGAGAAA TTCAGTATAT GATCTCTTGA ATTCTCTCTT AAATGTTATT 6784 .......... .......... .......... .......... .......... .......... 10 ATCTCATGCA GATTGCTGAT ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC 6724 || ||| || ||||||| |||||||||| || || ||| |||||||||| .......... .ATCAATGAC ATGTCTGTTG AGGATTACAT CACCGCAACT GCTAACAAGC 59 ATCCTACATA TACACCACAC ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC 6664 | || || | |||||| |||||||| | | ||||| || |||||| || || ||||||| ACCCAGTTTA CATGCCACAC ACAGCTGGTA GATACCAGGC CAAGCGTTTC AGGAAGGCTC 119 AATGCCCAAT TGTGGAGAGG TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA 6604 | |||||||| || |||||| | || || | | || ||||| |||||||||| |||||||| | AGTGCCCAAT CGTTGAGAGG CTCACAAATT CTCTCATGAT GCACGGAAGG AACAACGGAA 179 AGAAGTTGAT GGCCGTTCGT ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG 6544 ||||| | || ||| |||||| |||||||||| |||| ||||| || || ||| ||||||||| AGAAGCTCAT GGCTGTTCGT ATTATTAAGC ATGCAATGGA GATCATTCAT TTGTTGACTG 239 ACCTAAACCC AATCCAAGTG ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG 6484 ||| |||||| ||| ||||| |||||||||| |||||||||| ||| ACCAAAACCC AATTCAAGTC ATTGTTGATG CTGTTATCAA CAG....... .......... 282 ATTTTTGCAT ATTTATTAGC TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT 6424 .......... .......... .......... .......... .......... .......... 282 TCATACCATG TCTTCTTTGT TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA 6364 .......... .......... .......... .......... .......... .......... 282 TTCTCATTTT TTCCTTCCCA TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC 6304 ||| |||| | |||||||| .......... .......... .......... .......... ..TGGGCCAA GGGAAGATGC 300 AACTCGTATA GGTTCTGCTG GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG 6244 ||| ||||| |||||||||| ||||||| || || |||||| |||||||||| |||||||||| AACACGTATT GGTTCTGCTG GTGTTGTCAG ACGTCAAGCT GTTGATATTT CTCCACTCCG 360 TCGTGTCAAC CAAGCAATAT ATCTCCTCAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 6184 |||||| ||| |||||||| | || | || || |||||||||| |||||||||| |||||||||| TCGTGTTAAC CAAGCAATTT ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 420 CATCAAGACC ATAGCAGAAT GCCTTGCAGA TGAACTCATT AATGCTGCCA AGGGATCTTC 6124 |||||||||| ||||| || | ||||||| || ||||||||| |||||||||| |||| ||||| CATCAAGACC ATAGCTGAGT GCCTTGCTGA TGAACTCATC AATGCTGCCA AGGGTTCTTC 480 CAACAGGTAA TCTTTTCTAT TGCCATCTTT TTACTCCTAT ATGCGTTTAA TCCTTAATAA 6064 || || AAATAG.... .......... .......... .......... .......... .......... 486 ATGTAACTAT ATTCTCTGCC TACTTATTCA TTCTATGTAC GTGTAGCTAT GCTATCAAGA 6004 |||| ||||| |||| .......... .......... .......... .......... ......CTAT GCTATTAAGA 500 AGAAGGATGA GATTGAGAGG GTTGCCAAGG CCAATCGTTG AG 5962 ||||||| || |||||| ||| |||||||||| ||||||||| || AGAAGGACGA GATTGAAAGG GTTGCCAAGG CCAATCGTTA AG 542 hqPGS_C06HBa0153O03.1-7-_SGN-E347192+ (6772 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 103 +strand 799 n (File: SGN-E556577+) 1 GACATGTCTG TTGAGGATTA CATCACCGCA ACTGCTAACA AGCACCCAGT TTACATGCCA 61 CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG CTCAGTGCCC AATCGTTGAG 121 AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG GAAAGAAGCT CATGGCTGTT 181 CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 241 GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 301 GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 361 ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 421 GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG CTATGCTATT 481 AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAGAGAT TGTTGTTGGA 541 GCAACTTTTT CGAGAGACTT TTTGGTTATG TTATTTTCTC AGTTCTGTTT TCATGTAGGC 601 ATTATAGCAT CTGCTACTCC TTATGGATTT AGTTTCTTGG AGGATTTATG TTTGGTATTG 661 TTATAAATGT TAAATTTTGA AGTTCCTTTA TTCGGGTTCT CAGTAGAGTT TCGTTAAACA 721 CGGTAAAAAA NANNAAAAAA AAAAAAAAAA AAAAAAAAAA AANAAAAAAA AAAAAAAAAA 781 AAAAAAAAAA GTGGGGGGG Predicted gene structure (within gDNA segment 8193 to 1759): Exon 1 6766 6501 ( 266 n); cDNA 1 266 ( 266 n); score: 0.850 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 267 470 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 471 526 ( 56 n); score: 0.929 PPA cDNA 749 790 MATCH C06HBa0153O03.1-7- SGN-E556577+ 0.880 526 0.658 C PGS_C06HBa0153O03.1-7-_SGN-E556577+ (6766 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): GATATTTCTG TTGAGGATTA CATAACTGCT ACTGCTAACA AGCATCCTAC ATATACACCA 6707 || || |||| |||||||||| ||| || || |||||||||| |||| || || | ||| GACATGTCTG TTGAGGATTA CATCACCGCA ACTGCTAACA AGCACCCAGT TTACATGCCA 60 CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG CTCAATGCCC AATTGTGGAG 6647 |||||||||| | || ||||| |||||||| || || |||| |||| ||||| ||| || ||| CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG CTCAGTGCCC AATCGTTGAG 120 AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG GGAAGAAGTT GATGGCCGTT 6587 ||| | || | | || || || |||||||||| |||||||||| | |||||| | ||||| ||| AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG GAAAGAAGCT CATGGCTGTT 180 CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA CTGACCTAAA CCCAATCCAA 6527 |||||||||| ||||||| || ||| || || ||| |||||| |||||| ||| |||||| ||| CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 240 GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT CTGATTTTTG CATATTTATT 6467 || ||||||| |||||||||| |||||| GTCATTGTTG ATGCTGTTAT CAACAG.... .......... .......... .......... 266 AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA TTTTCATACC ATGTCTTCTT 6407 .......... .......... .......... .......... .......... .......... 266 TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG TCATTCTCAT TTTTTCCTTC 6347 .......... .......... .......... .......... .......... .......... 266 CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA TGCAACTCGT ATAGGTTCTG 6287 ||| | |||| ||||| |||||| ||| || ||||||| .......... .......... .....TGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 301 CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT CCGTCGTGTC AACCAAGCAA 6227 |||||||||| || || ||| |||||||||| |||||||||| ||||||||| |||||||||| CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 361 TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCAG 6167 | ||| | || ||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TTTATTTGCT GACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCTG 421 AATGCCTTGC AGATGAACTC ATTAATGCTG CCAAGGGATC TTCCAACAGG TAATCTTTTC 6107 | |||||||| ||||||||| || ||||||| ||||||| || ||| || || AGTGCCTTGC TGATGAACTC ATCAATGCTG CCAAGGGTTC TTCAAATAG. .......... 470 TATTGCCATC TTTTTACTCC TATATGCGTT TAATCCTTAA TAAATGTAAC TATATTCTCT 6047 .......... .......... .......... .......... .......... .......... 470 GCCTACTTAT TCATTCTATG TACGTGTAGC TATGCTATCA AGAAGAAGGA TGAGATTGAG 5987 | |||||||| | |||||||||| |||||||| .......... .......... .........C TATGCTATTA AGAAGAAGGA CGAGATTGAA 501 AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||||| |||||||||| || || AGGGTTGCCA AGGCCAATCG TTAAG 526 hqPGS_C06HBa0153O03.1-7-_SGN-E556577+ (6766 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 58 +strand 626 n (File: SGN-E325615+) 1 ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC ACACAGCTGG TAGATACCAG 61 GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA GGCTCACAAA TTCTCTCATG 121 ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC GTATTATTAA GCATGCAATG 181 GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG TCATTGTTGA TGCTGTTATC 241 AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG CTGGTGTTGT CAGACGTCAA 301 GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA TTTATTTGCT GACAACTGGT 361 GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCTG AGTGCCTTGC TGATGAACTC 421 ATCAATGCTG CCAAGGGTTC TTCAAATAGC TATGCTATTA AGAAGAAGGA CGAGATTGAA 481 AGGGTTGCCA AGGCCAATCG TTAAGAGATT GTTGTTGGAG CAACTTTTTC GAGAGACTTT 541 TTGGTTATGT TATTTTCTCA GNTCTGTTTT CATGTAGGCA TTATAGCATC TGCTACTCCT 601 TATGGATTTA GTTTCTTGGA GGATTA Predicted gene structure (within gDNA segment 8193 to 3279): Exon 1 6745 6501 ( 245 n); cDNA 1 245 ( 245 n); score: 0.845 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 246 449 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 450 505 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E325615+ 0.879 505 0.807 C PGS_C06HBa0153O03.1-7-_SGN-E325615+ (6745 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): ATAACTGCTA CTGCTAACAA GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA 6686 || || || | |||||||||| ||| || || | |||| |||||||||| || ||||| ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC ACACAGCTGG TAGATACCAG 60 GCCAAGCGGT TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG 6626 |||||||| | | || ||||| ||| |||||| || || |||| || | || || || || ||| GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA GGCTCACAAA TTCTCTCATG 120 ATGCACGGAA GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG 6566 |||||||||| |||||||||| |||||| | ||||| |||| |||||||||| |||||| ||| ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC GTATTATTAA GCATGCAATG 180 GAAATTATCC ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC 6506 || || || | || ||||||| ||||| |||| ||||| |||| | |||||||| |||||||||| GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG TCATTGTTGA TGCTGTTATC 240 AACAGGTTTA GAGATTATTC TGATTTTTGC ATATTTATTA GCTCGAGTTT TTCTTGCTGA 6446 ||||| AACAG..... .......... .......... .......... .......... .......... 245 GGTCTTGTTA ATTAGAAGAT TTTCATACCA TGTCTTCTTT GTTCCATTTC CATGTCGCGG 6386 .......... .......... .......... .......... .......... .......... 245 CATACTTGAG ATATTGTAGT CATTCTCATT TTTTCCTTCC CATATTCTTA CCTATGTGAT 6326 .......... .......... .......... .......... .......... .......... 245 GCAGTGGACC AAGAGAAGAT GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGACAAG 6266 ||| || ||| |||||| ||||| |||| | |||||||| ||||||||| || || |||| ....TGGGCC AAGGGAAGAT GCAACACGTA TTGGTTCTGC TGGTGTTGTC AGACGTCAAG 301 CTGTTGATAT TTCTCCACTC CGTCGTGTCA ACCAAGCAAT ATATCTCCTC ACAACTGGTG 6206 |||||||||| |||||||||| |||||||| | |||||||||| ||| | || |||||||||| CTGTTGATAT TTCTCCACTC CGTCGTGTTA ACCAAGCAAT TTATTTGCTG ACAACTGGTG 361 CACGTGAGAG TGCTTTCAGG AACATCAAGA CCATAGCAGA ATGCCTTGCA GATGAACTCA 6146 |||||||||| |||||||||| |||||||||| ||||||| || |||||||| |||||||||| CACGTGAGAG TGCTTTCAGG AACATCAAGA CCATAGCTGA GTGCCTTGCT GATGAACTCA 421 TTAATGCTGC CAAGGGATCT TCCAACAGGT AATCTTTTCT ATTGCCATCT TTTTACTCCT 6086 | |||||||| |||||| ||| || || || TCAATGCTGC CAAGGGTTCT TCAAATAG.. .......... .......... .......... 449 ATATGCGTTT AATCCTTAAT AAATGTAACT ATATTCTCTG CCTACTTATT CATTCTATGT 6026 .......... .......... .......... .......... .......... .......... 449 ACGTGTAGCT ATGCTATCAA GAAGAAGGAT GAGATTGAGA GGGTTGCCAA GGCCAATCGT 5966 || ||||||| || ||||||||| |||||||| | |||||||||| |||||||||| ........CT ATGCTATTAA GAAGAAGGAC GAGATTGAAA GGGTTGCCAA GGCCAATCGT 501 TGAG 5962 | || TAAG 505 hqPGS_C06HBa0153O03.1-7-_SGN-E325615+ (6745 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 134 +strand 579 n (File: SGN-E283479+) 1 AAGCACCCAG TTTACATGCC ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG 61 GCTCAGTGCC CAATCGTTGA GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC 121 GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 181 ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA 241 GATGCAACAC GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA 301 CTCCGTCGTG TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC 361 AGGAACATCA AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT 421 TCTTCAAATA GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT 481 CGTTAAGAGA TTGTTGTTGG AGCAACTTTT TCGAGAGACT TTTTGGNTAT GTTATTTTCT 541 CAGATCTGTT TTCATGTAGG CATTATAACA TCTGCTACT Predicted gene structure (within gDNA segment 8193 to 3569): Exon 1 6727 6501 ( 227 n); cDNA 1 227 ( 227 n); score: 0.846 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 228 431 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 432 487 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E283479+ 0.881 487 0.841 C PGS_C06HBa0153O03.1-7-_SGN-E283479+ (6727 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCATCCTA CATATACACC ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG 6668 ||||| || || | || |||||||||| || || |||| | |||||||| || || ||| AAGCACCCAG TTTACATGCC ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG 60 GCTCAATGCC CAATTGTGGA GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC 6608 ||||| |||| |||| || || |||| | || || || || | |||||||||| |||||||||| GCTCAGTGCC CAATCGTTGA GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC 120 GGGAAGAAGT TGATGGCCGT TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG 6548 || |||||| | ||||| || |||||||||| |||||||| | |||| || || ||| ||||| GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 180 ACTGACCTAA ACCCAATCCA AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT 6488 ||||||| || ||||||| || ||| |||||| |||||||||| ||||||| ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAG... .......... 227 TCTGATTTTT GCATATTTAT TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG 6428 .......... .......... .......... .......... .......... .......... 227 ATTTTCATAC CATGTCTTCT TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA 6368 .......... .......... .......... .......... .......... .......... 227 GTCATTCTCA TTTTTTCCTT CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG 6308 ||| ||||| |||| .......... .......... .......... .......... ......TGGG CCAAGGGAAG 241 ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC 6248 ||||||| || ||| |||||| |||||||||| | || || || |||||||||| |||||||||| ATGCAACACG TATTGGTTCT GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC 301 TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA 6188 |||||||||| ||||||||| || ||| | | | |||||||| |||||||||| |||||||||| TCCGTCGTGT TAACCAAGCA ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA 361 GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 6128 |||||||||| ||||||||| || ||||||| | |||||||| ||| |||||| |||||||| | GGAACATCAA GACCATAGCT GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT 421 CTTCCAACAG GTAATCTTTT CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA 6068 |||| || || CTTCAAATAG .......... .......... .......... .......... .......... 431 ATAAATGTAA CTATATTCTC TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC 6008 ||||||||| .......... .......... .......... .......... .......... CTATGCTATT 441 AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAG 5962 |||||||||| | |||||||| ||||||||| |||||||||| ||| || AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAG 487 hqPGS_C06HBa0153O03.1-7-_SGN-E283479+ (6727 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 6 -strand 582 n (File: SGN-E296307-) 1 AGCACCCAGT TTACATGCCA CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG 61 CTCAGTGCCC AATCGTTGAG AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG 121 GAAAGAAGCT CATGGCTGTT CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA 181 CTGACCAAAA CCCAATTCAA GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG 241 ATGCAACACG TATTGGTTCT GCTGGTGTTG TCACACGTCA AGCTGTTGAT ATTTCTCCAC 301 TCCGTCGTGT TAACCAAGCA ATTTATTTGC TGACATCTGG TGCACGTGAG AGTGCTTTCA 361 GGAACATCAA GACCATAGCT GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT 421 CTTCAAATAG CTATGCTATT AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC 481 GTTAAGAGAT TGTTGTTGGA GCAACTTTTT CGAGAGACTT TTTGGTTATG TTATTTTCTC 541 AGTTCTGTTT TCATGTAGGC ATTATAGCAT CTGCTACTCC TT Predicted gene structure (within gDNA segment 8193 to 3539): Exon 1 6726 6501 ( 226 n); cDNA 1 226 ( 226 n); score: 0.845 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.86) Exon 2 6321 6118 ( 204 n); cDNA 227 430 ( 204 n); score: 0.897 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 431 486 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E296307- 0.877 486 0.835 C PGS_C06HBa0153O03.1-7-_SGN-E296307- (6726 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AGCATCCTAC ATATACACCA CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG 6667 |||| || || | ||| |||||||||| | || ||||| |||||||| || || |||| AGCACCCAGT TTACATGCCA CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG 60 CTCAATGCCC AATTGTGGAG AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG 6607 |||| ||||| ||| || ||| ||| | || | | || || || |||||||||| |||||||||| CTCAGTGCCC AATCGTTGAG AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG 120 GGAAGAAGTT GATGGCCGTT CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA 6547 | |||||| | ||||| ||| |||||||||| ||||||| || ||| || || ||| |||||| GAAAGAAGCT CATGGCTGTT CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA 180 CTGACCTAAA CCCAATCCAA GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT 6487 |||||| ||| |||||| ||| || ||||||| |||||||||| |||||| CTGACCAAAA CCCAATTCAA GTCATTGTTG ATGCTGTTAT CAACAG.... .......... 226 CTGATTTTTG CATATTTATT AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA 6427 .......... .......... .......... .......... .......... .......... 226 TTTTCATACC ATGTCTTCTT TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG 6367 .......... .......... .......... .......... .......... .......... 226 TCATTCTCAT TTTTTCCTTC CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA 6307 ||| | |||| ||||| .......... .......... .......... .......... .....TGGGC CAAGGGAAGA 241 TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT 6247 |||||| ||| || ||||||| |||||||||| | || ||| |||||||||| |||||||||| TGCAACACGT ATTGGTTCTG CTGGTGTTGT CACACGTCAA GCTGTTGATA TTTCTCCACT 301 CCGTCGTGTC AACCAAGCAA TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG 6187 ||||||||| |||||||||| | ||| | || ||| ||||| |||||||||| |||||||||| CCGTCGTGTT AACCAAGCAA TTTATTTGCT GACATCTGGT GCACGTGAGA GTGCTTTCAG 361 GAACATCAAG ACCATAGCAG AATGCCTTGC AGATGAACTC ATTAATGCTG CCAAGGGATC 6127 |||||||||| |||||||| | | |||||||| ||||||||| || ||||||| ||||||| || GAACATCAAG ACCATAGCTG AGTGCCTTGC TGATGAACTC ATCAATGCTG CCAAGGGTTC 421 TTCCAACAGG TAATCTTTTC TATTGCCATC TTTTTACTCC TATATGCGTT TAATCCTTAA 6067 ||| || || TTCAAATAG. .......... .......... .......... .......... .......... 430 TAAATGTAAC TATATTCTCT GCCTACTTAT TCATTCTATG TACGTGTAGC TATGCTATCA 6007 | |||||||| | .......... .......... .......... .......... .........C TATGCTATTA 441 AGAAGAAGGA TGAGATTGAG AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||||| |||||||| |||||||||| |||||||||| || || AGAAGAAGGA CGAGATTGAA AGGGTTGCCA AGGCCAATCG TTAAG 486 hqPGS_C06HBa0153O03.1-7-_SGN-E296307- (6726 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 7 -strand 582 n (File: SGN-E296445-) 1 AGCACCCAGT TTACATGCCA CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG 61 CTCAGTGCCC AATCGTTGAG AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG 121 GAAAGAAGCT CATGGCTGTT CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA 181 CTGACCAAAA CCCAATTCAA GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG 241 ATGCAACACG TATTGGTTCT GCTGGTGTTG TCACACGTCA AGCTGTTGAT ATTTCTCCAC 301 TCCGTCGTGT TAACCAAGCA ATTTATTTGC TGACATCTGG TGCACGTGAG AGTGCTTTCA 361 GGAACATCAA GACCATAGCT GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT 421 CTTCAAATAG CTATGCTATT AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC 481 GTTAAGAGAT TGTTGTTGGA GCAACTTTTT CGAGAGACTT TTTGGTTATG TTATTTTCTC 541 AGTTCTGTTT TCATGTAGGC ATTATAGCAT CTGCTACTCC TT Predicted gene structure (within gDNA segment 8193 to 3539): Exon 1 6726 6501 ( 226 n); cDNA 1 226 ( 226 n); score: 0.845 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.86) Exon 2 6321 6118 ( 204 n); cDNA 227 430 ( 204 n); score: 0.897 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 431 486 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E296445- 0.877 486 0.835 C PGS_C06HBa0153O03.1-7-_SGN-E296445- (6726 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AGCATCCTAC ATATACACCA CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG 6667 |||| || || | ||| |||||||||| | || ||||| |||||||| || || |||| AGCACCCAGT TTACATGCCA CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG 60 CTCAATGCCC AATTGTGGAG AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG 6607 |||| ||||| ||| || ||| ||| | || | | || || || |||||||||| |||||||||| CTCAGTGCCC AATCGTTGAG AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG 120 GGAAGAAGTT GATGGCCGTT CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA 6547 | |||||| | ||||| ||| |||||||||| ||||||| || ||| || || ||| |||||| GAAAGAAGCT CATGGCTGTT CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA 180 CTGACCTAAA CCCAATCCAA GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT 6487 |||||| ||| |||||| ||| || ||||||| |||||||||| |||||| CTGACCAAAA CCCAATTCAA GTCATTGTTG ATGCTGTTAT CAACAG.... .......... 226 CTGATTTTTG CATATTTATT AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA 6427 .......... .......... .......... .......... .......... .......... 226 TTTTCATACC ATGTCTTCTT TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG 6367 .......... .......... .......... .......... .......... .......... 226 TCATTCTCAT TTTTTCCTTC CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA 6307 ||| | |||| ||||| .......... .......... .......... .......... .....TGGGC CAAGGGAAGA 241 TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT 6247 |||||| ||| || ||||||| |||||||||| | || ||| |||||||||| |||||||||| TGCAACACGT ATTGGTTCTG CTGGTGTTGT CACACGTCAA GCTGTTGATA TTTCTCCACT 301 CCGTCGTGTC AACCAAGCAA TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG 6187 ||||||||| |||||||||| | ||| | || ||| ||||| |||||||||| |||||||||| CCGTCGTGTT AACCAAGCAA TTTATTTGCT GACATCTGGT GCACGTGAGA GTGCTTTCAG 361 GAACATCAAG ACCATAGCAG AATGCCTTGC AGATGAACTC ATTAATGCTG CCAAGGGATC 6127 |||||||||| |||||||| | | |||||||| ||||||||| || ||||||| ||||||| || GAACATCAAG ACCATAGCTG AGTGCCTTGC TGATGAACTC ATCAATGCTG CCAAGGGTTC 421 TTCCAACAGG TAATCTTTTC TATTGCCATC TTTTTACTCC TATATGCGTT TAATCCTTAA 6067 ||| || || TTCAAATAG. .......... .......... .......... .......... .......... 430 TAAATGTAAC TATATTCTCT GCCTACTTAT TCATTCTATG TACGTGTAGC TATGCTATCA 6007 | |||||||| | .......... .......... .......... .......... .........C TATGCTATTA 441 AGAAGAAGGA TGAGATTGAG AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||||| |||||||| |||||||||| |||||||||| || || AGAAGAAGGA CGAGATTGAA AGGGTTGCCA AGGCCAATCG TTAAG 486 hqPGS_C06HBa0153O03.1-7-_SGN-E296445- (6726 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 37 +strand 646 n (File: SGN-E350031+) 1 TTTACATGCC ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC 61 CAATCGTTGA GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC 121 TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA 181 ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC 241 GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 301 TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 361 AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA 421 GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA 481 TTGTTGTTGG AGCAACTTTT TCGAGAGACT TTTTGGTTAT GTTATTTTCT CAGTTCTGTT 541 TTCATGTAGG CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG GAGGATTTAT 601 GGTTTGGTAT TGTTATAAAT GTTAAATTTT TGAAGTTCCT TTATTC Predicted gene structure (within gDNA segment 8118 to 2799): Exon 1 6715 6501 ( 215 n); cDNA 3 217 ( 215 n); score: 0.860 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 218 421 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 422 477 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E350031+ 0.888 475 0.735 C PGS_C06HBa0153O03.1-7-_SGN-E350031+ (6715 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT TTAGAAAGGC TCAATGCCCA 6656 || | |||| |||||||||| || ||||| |||||||| | | || ||||| ||| |||||| TACATGCCAC ACACAGCTGG TAGATACCAG GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA 62 ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA GGAACAACGG GAAGAAGTTG 6596 || || |||| || | || || || || ||| |||||||||| |||||||||| |||||| | ATCGTTGAGA GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC 122 ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC ATCTGTTGAC TGACCTAAAC 6536 ||||| |||| |||||||||| |||||| ||| || || || | || ||||||| ||||| |||| ATGGCTGTTC GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCAAAAC 182 CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAGGTTTA GAGATTATTC TGATTTTTGC 6476 ||||| |||| | |||||||| |||||||||| ||||| CCAATTCAAG TCATTGTTGA TGCTGTTATC AACAG..... .......... .......... 217 ATATTTATTA GCTCGAGTTT TTCTTGCTGA GGTCTTGTTA ATTAGAAGAT TTTCATACCA 6416 .......... .......... .......... .......... .......... .......... 217 TGTCTTCTTT GTTCCATTTC CATGTCGCGG CATACTTGAG ATATTGTAGT CATTCTCATT 6356 .......... .......... .......... .......... .......... .......... 217 TTTTCCTTCC CATATTCTTA CCTATGTGAT GCAGTGGACC AAGAGAAGAT GCAACTCGTA 6296 ||| || ||| |||||| ||||| |||| .......... .......... .......... ....TGGGCC AAGGGAAGAT GCAACACGTA 243 TAGGTTCTGC TGGTGTTGTG AGGCGACAAG CTGTTGATAT TTCTCCACTC CGTCGTGTCA 6236 | |||||||| ||||||||| || || |||| |||||||||| |||||||||| |||||||| | TTGGTTCTGC TGGTGTTGTC AGACGTCAAG CTGTTGATAT TTCTCCACTC CGTCGTGTTA 303 ACCAAGCAAT ATATCTCCTC ACAACTGGTG CACGTGAGAG TGCTTTCAGG AACATCAAGA 6176 |||||||||| ||| | || |||||||||| |||||||||| |||||||||| |||||||||| ACCAAGCAAT TTATTTGCTG ACAACTGGTG CACGTGAGAG TGCTTTCAGG AACATCAAGA 363 CCATAGCAGA ATGCCTTGCA GATGAACTCA TTAATGCTGC CAAGGGATCT TCCAACAGGT 6116 ||||||| || |||||||| |||||||||| | |||||||| |||||| ||| || || || CCATAGCTGA GTGCCTTGCT GATGAACTCA TCAATGCTGC CAAGGGTTCT TCAAATAG.. 421 AATCTTTTCT ATTGCCATCT TTTTACTCCT ATATGCGTTT AATCCTTAAT AAATGTAACT 6056 .......... .......... .......... .......... .......... .......... 421 ATATTCTCTG CCTACTTATT CATTCTATGT ACGTGTAGCT ATGCTATCAA GAAGAAGGAT 5996 || ||||||| || ||||||||| .......... .......... .......... ........CT ATGCTATTAA GAAGAAGGAC 443 GAGATTGAGA GGGTTGCCAA GGCCAATCGT TGAG 5962 |||||||| | |||||||||| |||||||||| | || GAGATTGAAA GGGTTGCCAA GGCCAATCGT TAAG 477 hqPGS_C06HBa0153O03.1-7-_SGN-E350031+ (6715 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 93 +strand 595 n (File: SGN-E342485+) 1 AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA GAGGCTCACA AATTCTCTCA 61 TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA 121 TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA 181 TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC TGCTGGTGTT GTCAGACGTC 241 AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC AATTTATTTG CTGACAACTG 301 GTGCACGTGA GAGTGCTNTC AGGAACATCA AGACCATAGC TGAGTGCCTT GCTGATGAAC 361 TCATCAATGC TGCCAAGGGT TCTTCAAATA GCTATGCTAT TAAGAAGAAG GACGAGATTG 421 AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA ATTGTTGTTG GAGCAACTTT TTCGAAAGAC 481 TTTTTGGTTA TGTTATTTTT CTCAGTTCTG TTTTCATGTA GGCATTATAG CATCTGCTAC 541 TCCTTATGGA TTTAGTTCTT GGAGGATTTA TGTTTGGATT TGTATAAATG TTAAA Predicted gene structure (within gDNA segment 7818 to 3009): Exon 1 6685 6501 ( 185 n); cDNA 3 187 ( 185 n); score: 0.870 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 188 391 ( 204 n); score: 0.902 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 392 447 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E342485+ 0.892 445 0.748 C PGS_C06HBa0153O03.1-7-_SGN-E342485+ (6685 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): GCCAAGCGGT TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG 6626 |||||||| | | || ||||| ||| |||||| || || |||| || | || || || || ||| GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA GGCTCACAAA TTCTCTCATG 62 ATGCACGGAA GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG 6566 |||||||||| |||||||||| |||||| | ||||| |||| |||||||||| |||||| ||| ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC GTATTATTAA GCATGCAATG 122 GAAATTATCC ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC 6506 || || || | || ||||||| ||||| |||| ||||| |||| | |||||||| |||||||||| GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG TCATTGTTGA TGCTGTTATC 182 AACAGGTTTA GAGATTATTC TGATTTTTGC ATATTTATTA GCTCGAGTTT TTCTTGCTGA 6446 ||||| AACAG..... .......... .......... .......... .......... .......... 187 GGTCTTGTTA ATTAGAAGAT TTTCATACCA TGTCTTCTTT GTTCCATTTC CATGTCGCGG 6386 .......... .......... .......... .......... .......... .......... 187 CATACTTGAG ATATTGTAGT CATTCTCATT TTTTCCTTCC CATATTCTTA CCTATGTGAT 6326 .......... .......... .......... .......... .......... .......... 187 GCAGTGGACC AAGAGAAGAT GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGACAAG 6266 ||| || ||| |||||| ||||| |||| | |||||||| ||||||||| || || |||| ....TGGGCC AAGGGAAGAT GCAACACGTA TTGGTTCTGC TGGTGTTGTC AGACGTCAAG 243 CTGTTGATAT TTCTCCACTC CGTCGTGTCA ACCAAGCAAT ATATCTCCTC ACAACTGGTG 6206 |||||||||| |||||||||| |||||||| | |||||||||| ||| | || |||||||||| CTGTTGATAT TTCTCCACTC CGTCGTGTTA ACCAAGCAAT TTATTTGCTG ACAACTGGTG 303 CACGTGAGAG TGCTTTCAGG AACATCAAGA CCATAGCAGA ATGCCTTGCA GATGAACTCA 6146 |||||||||| |||| ||||| |||||||||| ||||||| || |||||||| |||||||||| CACGTGAGAG TGCTNTCAGG AACATCAAGA CCATAGCTGA GTGCCTTGCT GATGAACTCA 363 TTAATGCTGC CAAGGGATCT TCCAACAGGT AATCTTTTCT ATTGCCATCT TTTTACTCCT 6086 | |||||||| |||||| ||| || || || TCAATGCTGC CAAGGGTTCT TCAAATAG.. .......... .......... .......... 391 ATATGCGTTT AATCCTTAAT AAATGTAACT ATATTCTCTG CCTACTTATT CATTCTATGT 6026 .......... .......... .......... .......... .......... .......... 391 ACGTGTAGCT ATGCTATCAA GAAGAAGGAT GAGATTGAGA GGGTTGCCAA GGCCAATCGT 5966 || ||||||| || ||||||||| |||||||| | |||||||||| |||||||||| ........CT ATGCTATTAA GAAGAAGGAC GAGATTGAAA GGGTTGCCAA GGCCAATCGT 443 TGAG 5962 | || TAAG 447 hqPGS_C06HBa0153O03.1-7-_SGN-E342485+ (6685 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 61 +strand 674 n (File: SGN-E350864+) 1 GAAGGCTCAG TGCCCAATCG TTGAGAGGCT CACAAATTCT CTCATGATGC ACGGAAGGAA 61 CAACGGAAAG AAGCTCATGG CTGTTCGTAT TATTAAGCAT GCAATGGAGA TCATTCATTT 121 GTTGACTGAC CAAAACCCAA TTCAAGTCAT TGTTGATGCT GTTATCAACA GTGGGCCAAG 181 GGAAGATGCA ACACGTATTG GTTCTGCTGG TGTTGTCAGA CGTCAAGCTG TTGATATTTC 241 TCCACTCCGT CGTGTTAACC AAGCAATTTA TTTGCTGACA ACTGGTGCAC GTGAGAGTGC 301 TTTCAGGAAC ATCAAGACCA TAGCTGAGTG CCTTGCTGAT GAACTCATCA ATGCTGCCAA 361 GGGTTCTTCA AATAGCTATG CTATTAAGAA GAAGGACGAG ATTGAAAGGG TTGCCAAGGC 421 CAATCGTTAA GAGATTGTTG TTGGAGCAAC TTTTTCGAGA GACTTTTTGG TTATGTTATT 481 TTCTCAGTTC TGTTTTCATG TAGGCATTAT AGCATCTGCT ACTCCTTATG GATTTAGTTT 541 CTTGGAGGAT TTATGTTTGG TATTGTTATA AATGTTAAAT TTTGAAGTTC CTTTATTCGG 601 GTTCTCAGTA GAGTTTCGTT AAACACGGTA TTTTGTGATT TTCCTTAGAT GTTTTGAGAT 661 ACTTCCCAAA AAAA Predicted gene structure (within gDNA segment 7658 to 2059): Exon 1 6670 6501 ( 170 n); cDNA 2 171 ( 170 n); score: 0.876 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 172 375 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 376 431 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E350864+ 0.898 430 0.638 C PGS_C06HBa0153O03.1-7-_SGN-E350864+ (6670 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGGCTCAAT GCCCAATTGT GGAGAGGTTG ACCAACTCAC TGATGATGCA CGGAAGGAAC 6611 |||||||| | ||||||| || |||||| | || || || | | |||||||| |||||||||| AAGGCTCAGT GCCCAATCGT TGAGAGGCTC ACAAATTCTC TCATGATGCA CGGAAGGAAC 61 AACGGGAAGA AGTTGATGGC CGTTCGTATT ATTAAGCATG CTATGGAAAT TATCCATCTG 6551 ||||| |||| || | ||||| ||||||||| |||||||||| | ||||| || || ||| || AACGGAAAGA AGCTCATGGC TGTTCGTATT ATTAAGCATG CAATGGAGAT CATTCATTTG 121 TTGACTGACC TAAACCCAAT CCAAGTGATT GTTGATGCTG TTATCAACAG GTTTAGAGAT 6491 |||||||||| ||||||||| ||||| ||| |||||||||| |||||||||| TTGACTGACC AAAACCCAAT TCAAGTCATT GTTGATGCTG TTATCAACAG .......... 171 TATTCTGATT TTTGCATATT TATTAGCTCG AGTTTTTCTT GCTGAGGTCT TGTTAATTAG 6431 .......... .......... .......... .......... .......... .......... 171 AAGATTTTCA TACCATGTCT TCTTTGTTCC ATTTCCATGT CGCGGCATAC TTGAGATATT 6371 .......... .......... .......... .......... .......... .......... 171 GTAGTCATTC TCATTTTTTC CTTCCCATAT TCTTACCTAT GTGATGCAGT GGACCAAGAG 6311 | || ||||| | .......... .......... .......... .......... .........T GGGCCAAGGG 182 AAGATGCAAC TCGTATAGGT TCTGCTGGTG TTGTGAGGCG ACAAGCTGTT GATATTTCTC 6251 |||||||||| ||||| ||| |||||||||| |||| || || ||||||||| |||||||||| AAGATGCAAC ACGTATTGGT TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC 242 CACTCCGTCG TGTCAACCAA GCAATATATC TCCTCACAAC TGGTGCACGT GAGAGTGCTT 6191 |||||||||| ||| |||||| ||||| ||| | || ||||| |||||||||| |||||||||| CACTCCGTCG TGTTAACCAA GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT 302 TCAGGAACAT CAAGACCATA GCAGAATGCC TTGCAGATGA ACTCATTAAT GCTGCCAAGG 6131 |||||||||| |||||||||| || || |||| |||| ||||| |||||| ||| |||||||||| TCAGGAACAT CAAGACCATA GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG 362 GATCTTCCAA CAGGTAATCT TTTCTATTGC CATCTTTTTA CTCCTATATG CGTTTAATCC 6071 | ||||| || || GTTCTTCAAA TAG....... .......... .......... .......... .......... 375 TTAATAAATG TAACTATATT CTCTGCCTAC TTATTCATTC TATGTACGTG TAGCTATGCT 6011 ||||||| .......... .......... .......... .......... .......... ...CTATGCT 382 ATCAAGAAGA AGGATGAGAT TGAGAGGGTT GCCAAGGCCA ATCGTTGAG 5962 || ||||||| |||| ||||| ||| |||||| |||||||||| |||||| || ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GCCAAGGCCA ATCGTTAAG 431 hqPGS_C06HBa0153O03.1-7-_SGN-E350864+ (6670 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 127 +strand 599 n (File: SGN-E357024+) 1 GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 61 ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA 121 GATGCAACAC GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA 181 CTCCGTCGTG TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC 241 AGGAACATCA AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT 301 TCTTCAAATA GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT 361 CGTTAAGAGA TTGTTGTTGG AGCAACTTTT TCGAGAGACT TTTTGGTTAT GTTATTTTCT 421 CAGTTCTGTT TTCATGTAGG CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG 481 GAGGATTTAT GTTTGGTATT GTTATAAATG TTAAATTTTG AAGTTCCTTT ATTCGGGTTC 541 TCAGTAGAGT TTCGTTAAAC ACGGTATTTT GTGATTTTCT TTAGATGTTT TGAAGATAC Predicted gene structure (within gDNA segment 8193 to 2169): Exon 1 6607 6501 ( 107 n); cDNA 1 107 ( 107 n); score: 0.888 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 108 311 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 312 367 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E357024+ 0.905 367 0.613 C PGS_C06HBa0153O03.1-7-_SGN-E357024+ (6607 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): GGGAAGAAGT TGATGGCCGT TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG 6548 || |||||| | ||||| || |||||||||| |||||||| | |||| || || ||| ||||| GGAAAGAAGC TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG 60 ACTGACCTAA ACCCAATCCA AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT 6488 ||||||| || ||||||| || ||| |||||| |||||||||| ||||||| ACTGACCAAA ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAG... .......... 107 TCTGATTTTT GCATATTTAT TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG 6428 .......... .......... .......... .......... .......... .......... 107 ATTTTCATAC CATGTCTTCT TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA 6368 .......... .......... .......... .......... .......... .......... 107 GTCATTCTCA TTTTTTCCTT CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG 6308 ||| ||||| |||| .......... .......... .......... .......... ......TGGG CCAAGGGAAG 121 ATGCAACTCG TATAGGTTCT GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC 6248 ||||||| || ||| |||||| |||||||||| | || || || |||||||||| |||||||||| ATGCAACACG TATTGGTTCT GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC 181 TCCGTCGTGT CAACCAAGCA ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA 6188 |||||||||| ||||||||| || ||| | | | |||||||| |||||||||| |||||||||| TCCGTCGTGT TAACCAAGCA ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA 241 GGAACATCAA GACCATAGCA GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT 6128 |||||||||| ||||||||| || ||||||| | |||||||| ||| |||||| |||||||| | GGAACATCAA GACCATAGCT GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT 301 CTTCCAACAG GTAATCTTTT CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA 6068 |||| || || CTTCAAATAG .......... .......... .......... .......... .......... 311 ATAAATGTAA CTATATTCTC TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC 6008 ||||||||| .......... .......... .......... .......... .......... CTATGCTATT 321 AAGAAGAAGG ATGAGATTGA GAGGGTTGCC AAGGCCAATC GTTGAG 5962 |||||||||| | |||||||| ||||||||| |||||||||| ||| || AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAG 367 hqPGS_C06HBa0153O03.1-7-_SGN-E357024+ (6607 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 104 +strand 567 n (File: SGN-E334977+) 1 AGAAGCTCAT GGCTGTTCGT ATTATTAAGC ATGCAATGGA GATCATTCAT TTGTTGACTG 61 ACCAAAACCC AATTCAAGTC ATTGTTGATG CTGTTATCAA CAGTGGGCCA AGGGAAGATG 121 CAACACGTAT TGGTTCTGCT GGTGTTGTCA GACGTCAAGC TGTTGATATT TCTCCACTCC 181 GTCGTGTTAA CCAAGCAATT TATTTGCTGA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA 241 ACATCAAGAC CATAGCTGAG TGCCTTGCTG ATGAACTCAT CAATGCTGCC AAGGGTTCTT 301 CAAATAGCTA TGCTATTAAG AAGAAGGACG AGATTGAAAG GGTTGCCAAG GCCAATCGTT 361 AAGAGATTGT TGTTGGAGCA ACTTTTTCGA GAGACTTTTT GGTTATGTTA TTTTCTCAGT 421 TCTGTTTTCA TGTAGGCATT ATAGCATCTG CTACTCCTTA TGGATTTAGT TTCTTGGAGG 481 ATTTATGTTT GGTATTGTTA TAAATGTTAA ATTTTGAAGT TCCTTTATTC GGGTTCTCAG 541 TAGAGTTTCG TTAAACAAAA AAAAAAA Predicted gene structure (within gDNA segment 8193 to 2449): Exon 1 6603 6501 ( 103 n); cDNA 1 103 ( 103 n); score: 0.893 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 104 307 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 308 363 ( 56 n); score: 0.929 PPA cDNA 553 567 MATCH C06HBa0153O03.1-7- SGN-E334977+ 0.906 363 0.640 C PGS_C06HBa0153O03.1-7-_SGN-E334977+ (6603 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AGAAGTTGAT GGCCGTTCGT ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG 6544 ||||| | || ||| |||||| |||||||||| |||| ||||| || || ||| ||||||||| AGAAGCTCAT GGCTGTTCGT ATTATTAAGC ATGCAATGGA GATCATTCAT TTGTTGACTG 60 ACCTAAACCC AATCCAAGTG ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG 6484 ||| |||||| ||| ||||| |||||||||| |||||||||| ||| ACCAAAACCC AATTCAAGTC ATTGTTGATG CTGTTATCAA CAG....... .......... 103 ATTTTTGCAT ATTTATTAGC TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT 6424 .......... .......... .......... .......... .......... .......... 103 TCATACCATG TCTTCTTTGT TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA 6364 .......... .......... .......... .......... .......... .......... 103 TTCTCATTTT TTCCTTCCCA TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC 6304 ||| |||| | |||||||| .......... .......... .......... .......... ..TGGGCCAA GGGAAGATGC 121 AACTCGTATA GGTTCTGCTG GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG 6244 ||| ||||| |||||||||| ||||||| || || |||||| |||||||||| |||||||||| AACACGTATT GGTTCTGCTG GTGTTGTCAG ACGTCAAGCT GTTGATATTT CTCCACTCCG 181 TCGTGTCAAC CAAGCAATAT ATCTCCTCAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 6184 |||||| ||| |||||||| | || | || || |||||||||| |||||||||| |||||||||| TCGTGTTAAC CAAGCAATTT ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 241 CATCAAGACC ATAGCAGAAT GCCTTGCAGA TGAACTCATT AATGCTGCCA AGGGATCTTC 6124 |||||||||| ||||| || | ||||||| || ||||||||| |||||||||| |||| ||||| CATCAAGACC ATAGCTGAGT GCCTTGCTGA TGAACTCATC AATGCTGCCA AGGGTTCTTC 301 CAACAGGTAA TCTTTTCTAT TGCCATCTTT TTACTCCTAT ATGCGTTTAA TCCTTAATAA 6064 || || AAATAG.... .......... .......... .......... .......... .......... 307 ATGTAACTAT ATTCTCTGCC TACTTATTCA TTCTATGTAC GTGTAGCTAT GCTATCAAGA 6004 |||| ||||| |||| .......... .......... .......... .......... ......CTAT GCTATTAAGA 321 AGAAGGATGA GATTGAGAGG GTTGCCAAGG CCAATCGTTG AG 5962 ||||||| || |||||| ||| |||||||||| ||||||||| || AGAAGGACGA GATTGAAAGG GTTGCCAAGG CCAATCGTTA AG 363 hqPGS_C06HBa0153O03.1-7-_SGN-E334977+ (6603 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 8 -strand 549 n (File: SGN-E292322-) 1 CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 61 GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 121 GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 181 ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GGCCATAGCT 241 GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG CTATGCTATT 301 AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAGAGAT TGTTGTTGGA 361 GCAACTTTTT CGAGAGACTT TTTGGTTATG TTATTTTCTC AGTTCTGTTT TCATGTAGGC 421 ATTATAGCAT CTGCTACTCC TTATGGATTT AGTTTCTTGG AGGATTTATG TTTGGTATTG 481 TTATAAATGT TAAATTTTGA AGTTCCTTTA TTCGGGTTCT CAGTAGAGTT TCGTTAAAAA 541 AAAAAAAAA Predicted gene structure (within gDNA segment 8043 to 2397): Exon 1 6586 6501 ( 86 n); cDNA 1 86 ( 86 n); score: 0.907 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 87 290 ( 204 n); score: 0.902 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 291 346 ( 56 n); score: 0.929 PPA cDNA 536 549 MATCH C06HBa0153O03.1-7- SGN-E292322- 0.908 346 0.630 C PGS_C06HBa0153O03.1-7-_SGN-E292322- (6586 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA CTGACCTAAA CCCAATCCAA 6527 |||||||||| ||||||| || ||| || || ||| |||||| |||||| ||| |||||| ||| CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 60 GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT CTGATTTTTG CATATTTATT 6467 || ||||||| |||||||||| |||||| GTCATTGTTG ATGCTGTTAT CAACAG.... .......... .......... .......... 86 AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA TTTTCATACC ATGTCTTCTT 6407 .......... .......... .......... .......... .......... .......... 86 TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG TCATTCTCAT TTTTTCCTTC 6347 .......... .......... .......... .......... .......... .......... 86 CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA TGCAACTCGT ATAGGTTCTG 6287 ||| | |||| ||||| |||||| ||| || ||||||| .......... .......... .....TGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 121 CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT CCGTCGTGTC AACCAAGCAA 6227 |||||||||| || || ||| |||||||||| |||||||||| ||||||||| |||||||||| CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 181 TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCAG 6167 | ||| | || ||||||||| |||||||||| |||||||||| |||||||||| ||||||| | TTTATTTGCT GACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG GCCATAGCTG 241 AATGCCTTGC AGATGAACTC ATTAATGCTG CCAAGGGATC TTCCAACAGG TAATCTTTTC 6107 | |||||||| ||||||||| || ||||||| ||||||| || ||| || || AGTGCCTTGC TGATGAACTC ATCAATGCTG CCAAGGGTTC TTCAAATAG. .......... 290 TATTGCCATC TTTTTACTCC TATATGCGTT TAATCCTTAA TAAATGTAAC TATATTCTCT 6047 .......... .......... .......... .......... .......... .......... 290 GCCTACTTAT TCATTCTATG TACGTGTAGC TATGCTATCA AGAAGAAGGA TGAGATTGAG 5987 | |||||||| | |||||||||| |||||||| .......... .......... .........C TATGCTATTA AGAAGAAGGA CGAGATTGAA 321 AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||||| |||||||||| || || AGGGTTGCCA AGGCCAATCG TTAAG 346 hqPGS_C06HBa0153O03.1-7-_SGN-E292322- (6586 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 62 +strand 525 n (File: SGN-E350734+) 1 TTAAGCATGC AATGGAGATC ATTCATTTGT TGACTGACCA AAACCCAATT CAAGTCATTG 61 TTGATGCTGT TATCAACAGT GGGCCAAGGG AAGATGCAAC ACGTATTGGT TCTGCTGGTG 121 TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA GCAATTTATT 181 TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA GCTGAGTGCC 241 TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAGCTATGCT ATTAAGAAGA 301 AGGACGAGAT TGAAAGGGTT GCCAAGGCCA ATCGTTAAGA GATTGTTGTT GGAGCAACTT 361 TTTCGAGAGA CTTTTTGGTT ATGTTATTTT CTCAGTTCTG TTTTCATGTA GGCATTATAG 421 CATCTGCTAC TCCTTATGGA TTTAGTTTCT TGGAGGATTT ATGTTTGGTA TTGTTATAAA 481 TGTTAAATTT TGAAGTTCCT TTATTCGGGT TCTCAAAAAA AAAAA Predicted gene structure (within gDNA segment 7963 to 2629): Exon 1 6579 6501 ( 79 n); cDNA 1 79 ( 79 n); score: 0.899 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 80 283 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 284 339 ( 56 n); score: 0.929 PPA cDNA 515 525 MATCH C06HBa0153O03.1-7- SGN-E350734+ 0.909 339 0.646 C PGS_C06HBa0153O03.1-7-_SGN-E350734+ (6579 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TTAAGCATGC TATGGAAATT ATCCATCTGT TGACTGACCT AAACCCAATC CAAGTGATTG 6520 |||||||||| ||||| || || ||| ||| ||||||||| ||||||||| ||||| |||| TTAAGCATGC AATGGAGATC ATTCATTTGT TGACTGACCA AAACCCAATT CAAGTCATTG 60 TTGATGCTGT TATCAACAGG TTTAGAGATT ATTCTGATTT TTGCATATTT ATTAGCTCGA 6460 |||||||||| ||||||||| TTGATGCTGT TATCAACAG. .......... .......... .......... .......... 79 GTTTTTCTTG CTGAGGTCTT GTTAATTAGA AGATTTTCAT ACCATGTCTT CTTTGTTCCA 6400 .......... .......... .......... .......... .......... .......... 79 TTTCCATGTC GCGGCATACT TGAGATATTG TAGTCATTCT CATTTTTTCC TTCCCATATT 6340 .......... .......... .......... .......... .......... .......... 79 CTTACCTATG TGATGCAGTG GACCAAGAGA AGATGCAACT CGTATAGGTT CTGCTGGTGT 6280 || | ||||| || ||||||||| ||||| |||| |||||||||| .......... ........TG GGCCAAGGGA AGATGCAACA CGTATTGGTT CTGCTGGTGT 121 TGTGAGGCGA CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTCAACCAAG CAATATATCT 6220 ||| || || |||||||||| |||||||||| |||||||||| || ||||||| |||| ||| | TGTCAGACGT CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTTAACCAAG CAATTTATTT 181 CCTCACAACT GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG CAGAATGCCT 6160 || |||||| |||||||||| |||||||||| |||||||||| |||||||||| | || ||||| GCTGACAACT GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG CTGAGTGCCT 241 TGCAGATGAA CTCATTAATG CTGCCAAGGG ATCTTCCAAC AGGTAATCTT TTCTATTGCC 6100 ||| |||||| ||||| |||| |||||||||| ||||| || || TGCTGATGAA CTCATCAATG CTGCCAAGGG TTCTTCAAAT AG........ .......... 283 ATCTTTTTAC TCCTATATGC GTTTAATCCT TAATAAATGT AACTATATTC TCTGCCTACT 6040 .......... .......... .......... .......... .......... .......... 283 TATTCATTCT ATGTACGTGT AGCTATGCTA TCAAGAAGAA GGATGAGATT GAGAGGGTTG 5980 |||||||| | |||||||| ||| |||||| || ||||||| .......... .......... ..CTATGCTA TTAAGAAGAA GGACGAGATT GAAAGGGTTG 321 CCAAGGCCAA TCGTTGAG 5962 |||||||||| ||||| || CCAAGGCCAA TCGTTAAG 339 hqPGS_C06HBa0153O03.1-7-_SGN-E350734+ (6579 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 67 +strand 513 n (File: SGN-E348674+) 1 AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA AGTCATTGTT 61 GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC TGCTGGTGTT 121 GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC AATTTATTTG 181 CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC TGAGTGCCTT 241 GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA GCTATGCTAT TAAGAAGAAG 301 GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA TTGTTGTTGG AGCAACTTTT 361 TCGAGAGACT TTTTGGTTAT GTTATTTTCT CAGTTCTGTT TTCATGTAGG CATTATAGCA 421 TCTGCTACTC CTTATGGATT TAGTTTCTTG GAGGATTTAT GTTTGGTATT GTTATAAATG 481 TTAAATTTTG AAGTTCCTTT TANAAAAAAA AAA Predicted gene structure (within gDNA segment 7943 to 2729): Exon 1 6577 6501 ( 77 n); cDNA 1 77 ( 77 n); score: 0.896 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 78 281 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 282 337 ( 56 n); score: 0.929 PPA cDNA 502 513 MATCH C06HBa0153O03.1-7- SGN-E348674+ 0.908 337 0.657 C PGS_C06HBa0153O03.1-7-_SGN-E348674+ (6577 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA AGTGATTGTT 6518 |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || ||| |||||| AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA AGTCATTGTT 60 GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT TAGCTCGAGT 6458 |||||||||| ||||||| GATGCTGTTA TCAACAG... .......... .......... .......... .......... 77 TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT TTGTTCCATT 6398 .......... .......... .......... .......... .......... .......... 77 TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT CCCATATTCT 6338 .......... .......... .......... .......... .......... .......... 77 TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT GCTGGTGTTG 6278 ||| ||||| |||| ||||||| || ||| |||||| |||||||||| .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT GCTGGTGTTG 121 TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA ATATATCTCC 6218 | || || || |||||||||| |||||||||| |||||||||| ||||||||| || ||| | | TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA ATTTATTTGC 181 TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA GAATGCCTTG 6158 | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| || ||||||| TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT GAGTGCCTTG 241 CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT CTATTGCCAT 6098 | |||||||| ||| |||||| |||||||| | |||| || || CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... .......... 281 CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC TGCCTACTTA 6038 .......... .......... .......... .......... .......... .......... 281 TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA GAGGGTTGCC 5978 ||||||||| |||||||||| | |||||||| ||||||||| .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA AAGGGTTGCC 321 AAGGCCAATC GTTGAG 5962 |||||||||| ||| || AAGGCCAATC GTTAAG 337 hqPGS_C06HBa0153O03.1-7-_SGN-E348674+ (6577 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 22 -strand 495 n (File: SGN-E283583-) 1 AGATCATTCA TTTGTTGACT GACCAAACCC CAATTCAAGT CATTGTTGAT GCTGTTATCA 61 ACAGTGGGCC AAGGGAAGAT GCAACACGTA TTGGTTCTGC TGGTGTTGTC AGACGTCAAG 121 CTGTTGATAT TTCTCCACTC CGTCGTGTTA ACCAAGCAAT TTATTTGCTG ACAACTGGTG 181 CACGTGAGAG TGCTTTCAGG AACATCAAGA CCATAGCTGA GTGCCTTGCT GATGAACTCA 241 TCAATGCTGC CAAGGGTTCT TCAAATAGCT ATGCTATTAA GAAGAAGGAC GAGATTGAAA 301 GGGTTGCCAA GGCCAATCGT TAAGAGATTG TTGTTGGAGC AACTTTTTCG AGAGACTTTT 361 TGGTTATGTT ATTTTCTCAG TTCTGTTTTC ATGTAGGCAT TATAGCATCT GCTACTCCTT 421 ATGGATTTAG TTTCTTGGAG GATTTATGTT TGGTATTGTT ATAAATGTTA AATTTTGAAG 481 TTCCTTTAAA AAAAA Predicted gene structure (within gDNA segment 7823 to 2789): Exon 1 6562 6501 ( 62 n); cDNA 3 64 ( 62 n); score: 0.887 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.92), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 65 268 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 269 324 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E283583- 0.907 322 0.651 C PGS_C06HBa0153O03.1-7-_SGN-E283583- (6562 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA TTGTTGATGC TGTTATCAAC 6503 || || ||| |||||||||| || || |||| || ||||| | |||||||||| |||||||||| ATCATTCATT TGTTGACTGA CCAAACCCCA ATTCAAGTCA TTGTTGATGC TGTTATCAAC 62 AGGTTTAGAG ATTATTCTGA TTTTTGCATA TTTATTAGCT CGAGTTTTTC TTGCTGAGGT 6443 || AG........ .......... .......... .......... .......... .......... 64 CTTGTTAATT AGAAGATTTT CATACCATGT CTTCTTTGTT CCATTTCCAT GTCGCGGCAT 6383 .......... .......... .......... .......... .......... .......... 64 ACTTGAGATA TTGTAGTCAT TCTCATTTTT TCCTTCCCAT ATTCTTACCT ATGTGATGCA 6323 .......... .......... .......... .......... .......... .......... 64 GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG TGTTGTGAGG CGACAAGCTG 6263 ||| ||||| ||||||||| || ||||| | |||||||||| |||||| || || ||||||| .TGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG TGTTGTCAGA CGTCAAGCTG 123 TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA TCTCCTCACA ACTGGTGCAC 6203 |||||||||| |||||||||| ||||| |||| ||||||| || | | || ||| |||||||||| TTGATATTTC TCCACTCCGT CGTGTTAACC AAGCAATTTA TTTGCTGACA ACTGGTGCAC 183 GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCAGAATG CCTTGCAGAT GAACTCATTA 6143 |||||||||| |||||||||| |||||||||| |||| || || |||||| ||| |||||||| | GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCTGAGTG CCTTGCTGAT GAACTCATCA 243 ATGCTGCCAA GGGATCTTCC AACAGGTAAT CTTTTCTATT GCCATCTTTT TACTCCTATA 6083 |||||||||| ||| ||||| || || ATGCTGCCAA GGGTTCTTCA AATAG..... .......... .......... .......... 268 TGCGTTTAAT CCTTAATAAA TGTAACTATA TTCTCTGCCT ACTTATTCAT TCTATGTACG 6023 .......... .......... .......... .......... .......... .......... 268 TGTAGCTATG CTATCAAGAA GAAGGATGAG ATTGAGAGGG TTGCCAAGGC CAATCGTTGA 5963 ||||| |||| ||||| |||||| ||| ||||| |||| |||||||||| |||||||| | .....CTATG CTATTAAGAA GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTTAA 323 G 5962 | G 324 hqPGS_C06HBa0153O03.1-7-_SGN-E283583- (6562 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 86 +strand 477 n (File: SGN-E272853+) 1 GATCATTCAT TTGTTGACTG ACCAAAACCC AATTCAAGTC ATTGTTGATG CTGTTATCAA 61 CAGTGGGCCA AGGGAAGATG CAACACGTAT TGGTTCTGCT GGTGTTGTCA GACGTCAAGC 121 TGTTGATATT TCTCCACTCC GTCGTGTTAA CCAAGCAATT TATTTGCTGA CAACTGGTGC 181 ACGTGAGAGT GCTTTCAGGA ACATCAAGAC CATAGCTGAG TGCCTTGCTG ATGAACTCAT 241 CAATGCTGCC AAGGGTTCTT CAAATAGCTA TGCTATTAAG AAGAAGGACG AGATTGAAAG 301 GGTTGCCAAG GCCAATCGTT AAGAGATTGT TGTTGGAGCA ACTTTTTCGA GAGACTNTTT 361 GGTTATGTTA TTTTCTCAGT TCTGTTTTCA TGTAGGCATT ATAGCATCTG CTACTCCTTA 421 TGGATTTAGT TTCTTGGAGG AATTTATGTT GGTATTGTTA TAAATGTTAA ATTTTGA Predicted gene structure (within gDNA segment 7803 to 2949): Exon 1 6562 6501 ( 62 n); cDNA 2 63 ( 62 n); score: 0.903 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 64 267 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 268 323 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E272853+ 0.910 322 0.675 C PGS_C06HBa0153O03.1-7-_SGN-E272853+ (6562 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA TTGTTGATGC TGTTATCAAC 6503 || || ||| |||||||||| || ||||||| || ||||| | |||||||||| |||||||||| ATCATTCATT TGTTGACTGA CCAAAACCCA ATTCAAGTCA TTGTTGATGC TGTTATCAAC 61 AGGTTTAGAG ATTATTCTGA TTTTTGCATA TTTATTAGCT CGAGTTTTTC TTGCTGAGGT 6443 || AG........ .......... .......... .......... .......... .......... 63 CTTGTTAATT AGAAGATTTT CATACCATGT CTTCTTTGTT CCATTTCCAT GTCGCGGCAT 6383 .......... .......... .......... .......... .......... .......... 63 ACTTGAGATA TTGTAGTCAT TCTCATTTTT TCCTTCCCAT ATTCTTACCT ATGTGATGCA 6323 .......... .......... .......... .......... .......... .......... 63 GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG TGTTGTGAGG CGACAAGCTG 6263 ||| ||||| ||||||||| || ||||| | |||||||||| |||||| || || ||||||| .TGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG TGTTGTCAGA CGTCAAGCTG 122 TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA TCTCCTCACA ACTGGTGCAC 6203 |||||||||| |||||||||| ||||| |||| ||||||| || | | || ||| |||||||||| TTGATATTTC TCCACTCCGT CGTGTTAACC AAGCAATTTA TTTGCTGACA ACTGGTGCAC 182 GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCAGAATG CCTTGCAGAT GAACTCATTA 6143 |||||||||| |||||||||| |||||||||| |||| || || |||||| ||| |||||||| | GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCTGAGTG CCTTGCTGAT GAACTCATCA 242 ATGCTGCCAA GGGATCTTCC AACAGGTAAT CTTTTCTATT GCCATCTTTT TACTCCTATA 6083 |||||||||| ||| ||||| || || ATGCTGCCAA GGGTTCTTCA AATAG..... .......... .......... .......... 267 TGCGTTTAAT CCTTAATAAA TGTAACTATA TTCTCTGCCT ACTTATTCAT TCTATGTACG 6023 .......... .......... .......... .......... .......... .......... 267 TGTAGCTATG CTATCAAGAA GAAGGATGAG ATTGAGAGGG TTGCCAAGGC CAATCGTTGA 5963 ||||| |||| ||||| |||||| ||| ||||| |||| |||||||||| |||||||| | .....CTATG CTATTAAGAA GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTTAA 322 G 5962 | G 323 hqPGS_C06HBa0153O03.1-7-_SGN-E272853+ (6562 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 10 -strand 500 n (File: SGN-E539243-) 1 TTCAAGTCAT TGTTGATGCT GTTATCAACA GTGGGCCAAG GGAAGATGCA ACACGTATTG 61 GTTCTGCTGG TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCACTCCGT CGTGTTAACC 121 AAGCAATTTA TTTGCTGACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA 181 TAGCTGAGTG CCTTGCTGAT GAACTCATCA ATGCTGCCAA GGGTTCTTCA AATAGCTATG 241 CTATTAAGAA GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTTAA GAGATTGTTG 301 TTGGAGCAAC TTTTTCGAGA GACTTTTTGG TTATGTTATT TTCTCAGTTC TGTTTTCATG 361 TAGGCATTAT AGCATCTGCT ACTCCTTATG GATTTAGTTT CTTGGAGGAT TTATGTTTGG 421 TATTGTTATA AATGTTAAAT TTTGAAGTTC CTTTATTCGG GTTCTCAGTA GAGTTTCGTT 481 AAACACGGTA AAAAAGAAAA Predicted gene structure (within gDNA segment 7493 to 2409): Exon 1 6529 6501 ( 29 n); cDNA 3 31 ( 29 n); score: 0.966 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 32 235 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 236 291 ( 56 n); score: 0.929 PPA cDNA 490 500 MATCH C06HBa0153O03.1-7- SGN-E539243- 0.912 289 0.578 C PGS_C06HBa0153O03.1-7-_SGN-E539243- (6529 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): CAAGTGATTG TTGATGCTGT TATCAACAGG TTTAGAGATT ATTCTGATTT TTGCATATTT 6470 ||||| |||| |||||||||| ||||||||| CAAGTCATTG TTGATGCTGT TATCAACAG. .......... .......... .......... 31 ATTAGCTCGA GTTTTTCTTG CTGAGGTCTT GTTAATTAGA AGATTTTCAT ACCATGTCTT 6410 .......... .......... .......... .......... .......... .......... 31 CTTTGTTCCA TTTCCATGTC GCGGCATACT TGAGATATTG TAGTCATTCT CATTTTTTCC 6350 .......... .......... .......... .......... .......... .......... 31 TTCCCATATT CTTACCTATG TGATGCAGTG GACCAAGAGA AGATGCAACT CGTATAGGTT 6290 || | ||||| || ||||||||| ||||| |||| .......... .......... ........TG GGCCAAGGGA AGATGCAACA CGTATTGGTT 63 CTGCTGGTGT TGTGAGGCGA CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTCAACCAAG 6230 |||||||||| ||| || || |||||||||| |||||||||| |||||||||| || ||||||| CTGCTGGTGT TGTCAGACGT CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTTAACCAAG 123 CAATATATCT CCTCACAACT GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG 6170 |||| ||| | || |||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATTTATTT GCTGACAACT GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG 183 CAGAATGCCT TGCAGATGAA CTCATTAATG CTGCCAAGGG ATCTTCCAAC AGGTAATCTT 6110 | || ||||| ||| |||||| ||||| |||| |||||||||| ||||| || || CTGAGTGCCT TGCTGATGAA CTCATCAATG CTGCCAAGGG TTCTTCAAAT AG........ 235 TTCTATTGCC ATCTTTTTAC TCCTATATGC GTTTAATCCT TAATAAATGT AACTATATTC 6050 .......... .......... .......... .......... .......... .......... 235 TCTGCCTACT TATTCATTCT ATGTACGTGT AGCTATGCTA TCAAGAAGAA GGATGAGATT 5990 |||||||| | |||||||| ||| |||||| .......... .......... .......... ..CTATGCTA TTAAGAAGAA GGACGAGATT 263 GAGAGGGTTG CCAAGGCCAA TCGTTGAG 5962 || ||||||| |||||||||| ||||| || GAAAGGGTTG CCAAGGCCAA TCGTTAAG 291 hqPGS_C06HBa0153O03.1-7-_SGN-E539243- (6529 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 21 -strand 510 n (File: SGN-E244303-) 1 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 61 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 121 AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC 181 TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA GCTATGCTAT 241 TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA TTGTTGTTGG 301 AGCAACTTTT TCGAGAGACT TTTTGGTTAT GTTATTTTCT CAGTTCTGTT TTCATGTAGG 361 CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG GAGGATTTAT GTTTGGTATT 421 GTTATAAATG TTAAATTTTG AAGTTCCTTT ATTCGGGTTC TCAGTAGAGT TTCGTTAAAC 481 ACGGTATTTT GTGATTTTCT TTAAAAAAAA Predicted gene structure (within gDNA segment 7453 to 2269): Exon 1 6527 6501 ( 27 n); cDNA 1 27 ( 27 n); score: 0.963 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 28 231 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 232 287 ( 56 n); score: 0.929 MATCH C06HBa0153O03.1-7- SGN-E244303- 0.912 287 0.563 C PGS_C06HBa0153O03.1-7-_SGN-E244303- (6527 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 27 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 27 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 27 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 61 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 121 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 181 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 231 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 231 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 261 GAGGGTTGCC AAGGCCAATC GTTGAG 5962 ||||||||| |||||||||| ||| || AAGGGTTGCC AAGGCCAATC GTTAAG 287 hqPGS_C06HBa0153O03.1-7-_SGN-E244303- (6527 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 16 -strand 469 n (File: SGN-E226355-) 1 GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 61 GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 121 ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 181 GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG CTATGCTATT 241 AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAATC GTTAAGAGAT TGTTGTTGGA 301 GCAACTTTTT CGAGAGACTT TTTGGTTATG TTATTTTCTC AGTTCTGTTT TCATGTAGGC 361 ATTATAGCAT CTGCTACTCC TTATGGATTT AGTTTCTTGG AGGATTTATG TTTGGTATTG 421 TTATAAATGT TAAATTTTGA AGTTCCTTTT AAAAAAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 7443 to 2669): Exon 1 6526 6501 ( 26 n); cDNA 1 26 ( 26 n); score: 0.962 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 27 230 ( 204 n); score: 0.907 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5962 ( 56 n); cDNA 231 286 ( 56 n); score: 0.929 PPA cDNA 451 469 MATCH C06HBa0153O03.1-7- SGN-E226355- 0.912 286 0.610 C PGS_C06HBa0153O03.1-7-_SGN-E226355- (6526 6501,6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT CTGATTTTTG CATATTTATT 6467 || ||||||| |||||||||| |||||| GTCATTGTTG ATGCTGTTAT CAACAG.... .......... .......... .......... 26 AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA TTTTCATACC ATGTCTTCTT 6407 .......... .......... .......... .......... .......... .......... 26 TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG TCATTCTCAT TTTTTCCTTC 6347 .......... .......... .......... .......... .......... .......... 26 CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA TGCAACTCGT ATAGGTTCTG 6287 ||| | |||| ||||| |||||| ||| || ||||||| .......... .......... .....TGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 61 CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT CCGTCGTGTC AACCAAGCAA 6227 |||||||||| || || ||| |||||||||| |||||||||| ||||||||| |||||||||| CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 121 TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCAG 6167 | ||| | || ||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TTTATTTGCT GACAACTGGT GCACGTGAGA GTGCTTTCAG GAACATCAAG ACCATAGCTG 181 AATGCCTTGC AGATGAACTC ATTAATGCTG CCAAGGGATC TTCCAACAGG TAATCTTTTC 6107 | |||||||| ||||||||| || ||||||| ||||||| || ||| || || AGTGCCTTGC TGATGAACTC ATCAATGCTG CCAAGGGTTC TTCAAATAG. .......... 230 TATTGCCATC TTTTTACTCC TATATGCGTT TAATCCTTAA TAAATGTAAC TATATTCTCT 6047 .......... .......... .......... .......... .......... .......... 230 GCCTACTTAT TCATTCTATG TACGTGTAGC TATGCTATCA AGAAGAAGGA TGAGATTGAG 5987 | |||||||| | |||||||||| |||||||| .......... .......... .........C TATGCTATTA AGAAGAAGGA CGAGATTGAA 261 AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||||| |||||||||| || || AGGGTTGCCA AGGCCAATCG TTAAG 286 hqPGS_C06HBa0153O03.1-7-_SGN-E226355- (6526 6501,6321 6118,6017 5962) ******************************************************************************** EST sequence 39 +strand 516 n (File: SGN-E348200+) 1 ATTGATGGCT TTTCTTCCTC ATTTTTTGAT CATTTAGTGG GCCAAGGGAA GATGCAACAC 61 GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 121 TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 181 AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA 241 GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA 301 TTGTTGTTGG AGCAACTTTT TCGAGAGACT TTTTGGTTAT GTTATTTTCT CAGTTCTGTT 361 TTCATGTAGG CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG GAGGATTTAT 421 GTTTGGTATT GTTATAAATG TTAAATTTTG AAGTTCCTTT NNNAAAAAAA AAAANAAAAA 481 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA Predicted gene structure (within gDNA segment 7543 to 2299): Exon 1 6321 6118 ( 204 n); cDNA 38 241 ( 204 n); score: 0.907 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 2 6017 5962 ( 56 n); cDNA 242 297 ( 56 n); score: 0.929 PPA cDNA 466 516 MATCH C06HBa0153O03.1-7- SGN-E348200+ 0.912 260 0.504 C PGS_C06HBa0153O03.1-7-_SGN-E348200+ (6321 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TGGACCAAGA GAAGATGCAA CTCGTATAGG TTCTGCTGGT GTTGTGAGGC GACAAGCTGT 6262 ||| ||||| |||||||||| | ||||| || |||||||||| ||||| || | | |||||||| TGGGCCAAGG GAAGATGCAA CACGTATTGG TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT 97 TGATATTTCT CCACTCCGTC GTGTCAACCA AGCAATATAT CTCCTCACAA CTGGTGCACG 6202 |||||||||| |||||||||| |||| ||||| |||||| ||| | || |||| |||||||||| TGATATTTCT CCACTCCGTC GTGTTAACCA AGCAATTTAT TTGCTGACAA CTGGTGCACG 157 TGAGAGTGCT TTCAGGAACA TCAAGACCAT AGCAGAATGC CTTGCAGATG AACTCATTAA 6142 |||||||||| |||||||||| |||||||||| ||| || ||| ||||| |||| ||||||| || TGAGAGTGCT TTCAGGAACA TCAAGACCAT AGCTGAGTGC CTTGCTGATG AACTCATCAA 217 TGCTGCCAAG GGATCTTCCA ACAGGTAATC TTTTCTATTG CCATCTTTTT ACTCCTATAT 6082 |||||||||| || ||||| | | || TGCTGCCAAG GGTTCTTCAA ATAG...... .......... .......... .......... 241 GCGTTTAATC CTTAATAAAT GTAACTATAT TCTCTGCCTA CTTATTCATT CTATGTACGT 6022 .......... .......... .......... .......... .......... .......... 241 GTAGCTATGC TATCAAGAAG AAGGATGAGA TTGAGAGGGT TGCCAAGGCC AATCGTTGAG 5962 |||||| ||| |||||| ||||| |||| |||| ||||| |||||||||| ||||||| || ....CTATGC TATTAAGAAG AAGGACGAGA TTGAAAGGGT TGCCAAGGCC AATCGTTAAG 297 hqPGS_C06HBa0153O03.1-7-_SGN-E348200+ (6321 6118,6017 5962) ******************************************************************************** EST sequence 20 -strand 438 n (File: SGN-E229056-) 1 CAAGGGAAGA TGCAACACGT ATTGTTCTGC TGGTGTTGTC AGACGTCAAG CTGTTGATAT 61 TTCTCCACTC CGTCGTGTTA ACCAAGCAAT TTATTTGCTG ACAACTGGTG CACGTGAGAG 121 TGCTTTCAGG AACATCAAGA CCATAGCTGA GTGCCTTGCT GATGAACTCA TCAATGCTGC 181 CAAGGGTTCT TCAAATAGCT ATGCTATTAA GAAGAAGGAC GAGATTGAAA GGGTTGCCAA 241 GGCCAATCGT TAAGAGATTG TTGTTGGAGC AACTTTTTCG AGAGACTTTT TGGTTATGTT 301 ATTTTCTCAG TTCTGTTTTC ATGTAGGCAT TATAGCATCT GCTACTCCTT ATGGATTTAG 361 TTTCTTGGAG GATTTATGTT TGGTATTGTT ATAAATGTTA AATTTTGAAG TTCCTTTAAA 421 AAAAAAAAAA AAAAAAAA Predicted gene structure (within gDNA segment 7132 to 2659): Exon 1 6316 6118 ( 199 n); cDNA 1 198 ( 198 n); score: 0.905 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 2 6017 5962 ( 56 n); cDNA 199 254 ( 56 n); score: 0.929 PPA cDNA 418 438 MATCH C06HBa0153O03.1-7- SGN-E229056- 0.910 255 0.582 C PGS_C06HBa0153O03.1-7-_SGN-E229056- (6316 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): CAAGAGAAGA TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA 6257 |||| ||||| |||||| ||| || |||||| |||||||||| || || ||| |||||||||| CAAGGGAAGA TGCAACACGT AT-TGTTCTG CTGGTGTTGT CAGACGTCAA GCTGTTGATA 59 TTTCTCCACT CCGTCGTGTC AACCAAGCAA TATATCTCCT CACAACTGGT GCACGTGAGA 6197 |||||||||| ||||||||| |||||||||| | ||| | || ||||||||| |||||||||| TTTCTCCACT CCGTCGTGTT AACCAAGCAA TTTATTTGCT GACAACTGGT GCACGTGAGA 119 GTGCTTTCAG GAACATCAAG ACCATAGCAG AATGCCTTGC AGATGAACTC ATTAATGCTG 6137 |||||||||| |||||||||| |||||||| | | |||||||| ||||||||| || ||||||| GTGCTTTCAG GAACATCAAG ACCATAGCTG AGTGCCTTGC TGATGAACTC ATCAATGCTG 179 CCAAGGGATC TTCCAACAGG TAATCTTTTC TATTGCCATC TTTTTACTCC TATATGCGTT 6077 ||||||| || ||| || || CCAAGGGTTC TTCAAATAG. .......... .......... .......... .......... 198 TAATCCTTAA TAAATGTAAC TATATTCTCT GCCTACTTAT TCATTCTATG TACGTGTAGC 6017 | .......... .......... .......... .......... .......... .........C 199 TATGCTATCA AGAAGAAGGA TGAGATTGAG AGGGTTGCCA AGGCCAATCG TTGAG 5962 |||||||| | |||||||||| |||||||| |||||||||| |||||||||| || || TATGCTATTA AGAAGAAGGA CGAGATTGAA AGGGTTGCCA AGGCCAATCG TTAAG 254 hqPGS_C06HBa0153O03.1-7-_SGN-E229056- (6316 6118,6017 5962) ******************************************************************************** EST sequence 17 -strand 420 n (File: SGN-E226752-) 1 GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 61 TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 121 AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA 181 GCTATGCTAT TAAGAAGAAG GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAGAGA 241 TTGTTGTTGG AGCAACTTTT TCGAGAGACT TTTTGGTTAT GTTATTTTCT CAGTTCTGTT 301 TTCATGTAGG CATTATAGCA TCTGCTACTC CTTATGGATT TAGTTTCTTG GAGGATTTAT 361 GTTTGGTATT GTTATAAATG TTAAATTTTG AAGTTCCTTT TAAAAAAAAA AAAAAAAAAA Predicted gene structure (within gDNA segment 6953 to 2669): Exon 1 6298 6118 ( 181 n); cDNA 1 181 ( 181 n); score: 0.912 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 2 6017 5962 ( 56 n); cDNA 182 237 ( 56 n); score: 0.929 PPA cDNA 402 420 MATCH C06HBa0153O03.1-7- SGN-E226752- 0.916 237 0.564 C PGS_C06HBa0153O03.1-7-_SGN-E226752- (6298 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): GTATAGGTTC TGCTGGTGTT GTGAGGCGAC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 6239 |||| ||||| |||||||||| || || || | |||||||||| |||||||||| |||||||||| GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 60 TCAACCAAGC AATATATCTC CTCACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 6179 | |||||||| ||| ||| | || ||||||| |||||||||| |||||||||| |||||||||| TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 120 AGACCATAGC AGAATGCCTT GCAGATGAAC TCATTAATGC TGCCAAGGGA TCTTCCAACA 6119 |||||||||| || |||||| || ||||||| |||| ||||| ||||||||| ||||| || | AGACCATAGC TGAGTGCCTT GCTGATGAAC TCATCAATGC TGCCAAGGGT TCTTCAAATA 180 GGTAATCTTT TCTATTGCCA TCTTTTTACT CCTATATGCG TTTAATCCTT AATAAATGTA 6059 | G......... .......... .......... .......... .......... .......... 181 ACTATATTCT CTGCCTACTT ATTCATTCTA TGTACGTGTA GCTATGCTAT CAAGAAGAAG 5999 ||||||||| ||||||||| .......... .......... .......... .......... .CTATGCTAT TAAGAAGAAG 200 GATGAGATTG AGAGGGTTGC CAAGGCCAAT CGTTGAG 5962 || ||||||| | |||||||| |||||||||| |||| || GACGAGATTG AAAGGGTTGC CAAGGCCAAT CGTTAAG 237 hqPGS_C06HBa0153O03.1-7-_SGN-E226752- (6298 6118,6017 5962) ******************************************************************************** EST sequence 48 +strand 409 n (File: SGN-E294720+) 1 TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 61 GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 121 GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAGCTATGCT 181 ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GCCAAGGCCA ATCGTTAAGA GATTGTTGTT 241 GGAGCAACTT TTTCGAGAGA CTTTTTGGTT ATGTTATTTT CTCAGTTCTG TTTTCATGTA 301 GGCATTATAG CATCTGCTAC TCCTTATGGA TTTAGTTTCT TGGAGGATTT ATGTTTGGTA 361 TTGTTATAAA TGTTAAATTT TGAAGTTCCT TTNANAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 7079 to 2689): Exon 1 6290 6118 ( 173 n); cDNA 1 173 ( 173 n); score: 0.913 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 2 6017 5962 ( 56 n); cDNA 174 229 ( 56 n); score: 0.929 PPA cDNA 394 409 MATCH C06HBa0153O03.1-7- SGN-E294720+ 0.917 229 0.560 C PGS_C06HBa0153O03.1-7-_SGN-E294720+ (6290 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TCTGCTGGTG TTGTGAGGCG ACAAGCTGTT GATATTTCTC CACTCCGTCG TGTCAACCAA 6231 |||||||||| |||| || || ||||||||| |||||||||| |||||||||| ||| |||||| TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 60 GCAATATATC TCCTCACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 6171 ||||| ||| | || ||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 120 GCAGAATGCC TTGCAGATGA ACTCATTAAT GCTGCCAAGG GATCTTCCAA CAGGTAATCT 6111 || || |||| |||| ||||| |||||| ||| |||||||||| | ||||| || || GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAG....... 173 TTTCTATTGC CATCTTTTTA CTCCTATATG CGTTTAATCC TTAATAAATG TAACTATATT 6051 .......... .......... .......... .......... .......... .......... 173 CTCTGCCTAC TTATTCATTC TATGTACGTG TAGCTATGCT ATCAAGAAGA AGGATGAGAT 5991 ||||||| || ||||||| |||| ||||| .......... .......... .......... ...CTATGCT ATTAAGAAGA AGGACGAGAT 200 TGAGAGGGTT GCCAAGGCCA ATCGTTGAG 5962 ||| |||||| |||||||||| |||||| || TGAAAGGGTT GCCAAGGCCA ATCGTTAAG 229 hqPGS_C06HBa0153O03.1-7-_SGN-E294720+ (6290 6118,6017 5962) ******************************************************************************** EST sequence 49 +strand 409 n (File: SGN-E294721+) 1 TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 61 GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 121 GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAGCTATGCT 181 ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GCCAAGGCCA ATCGTTAAGA GATTGTTGTT 241 GGAGCAACTT TTTCGAGAGA CTTTTTGGTT ATGTTATTTT CTCAGTTCTG TTTTCATGTA 301 GGCATTATAG CATCTGCTAC TCCTTATGGA TTTAGTTTCT TGGAGGATTT ATGTTTGGTA 361 TTGTTATAAA TGTTAAATTT TGAAGTTCCT TTNANAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 7079 to 2689): Exon 1 6290 6118 ( 173 n); cDNA 1 173 ( 173 n); score: 0.913 Intron 1 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 2 6017 5962 ( 56 n); cDNA 174 229 ( 56 n); score: 0.929 PPA cDNA 394 409 MATCH C06HBa0153O03.1-7- SGN-E294721+ 0.917 229 0.560 C PGS_C06HBa0153O03.1-7-_SGN-E294721+ (6290 6118,6017 5962) Alignment (genomic DNA sequence = upper lines): TCTGCTGGTG TTGTGAGGCG ACAAGCTGTT GATATTTCTC CACTCCGTCG TGTCAACCAA 6231 |||||||||| |||| || || ||||||||| |||||||||| |||||||||| ||| |||||| TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 60 GCAATATATC TCCTCACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 6171 ||||| ||| | || ||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 120 GCAGAATGCC TTGCAGATGA ACTCATTAAT GCTGCCAAGG GATCTTCCAA CAGGTAATCT 6111 || || |||| |||| ||||| |||||| ||| |||||||||| | ||||| || || GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAG....... 173 TTTCTATTGC CATCTTTTTA CTCCTATATG CGTTTAATCC TTAATAAATG TAACTATATT 6051 .......... .......... .......... .......... .......... .......... 173 CTCTGCCTAC TTATTCATTC TATGTACGTG TAGCTATGCT ATCAAGAAGA AGGATGAGAT 5991 ||||||| || ||||||| |||| ||||| .......... .......... .......... ...CTATGCT ATTAAGAAGA AGGACGAGAT 200 TGAGAGGGTT GCCAAGGCCA ATCGTTGAG 5962 ||| |||||| |||||||||| |||||| || TGAAAGGGTT GCCAAGGCCA ATCGTTAAG 229 hqPGS_C06HBa0153O03.1-7-_SGN-E294721+ (6290 6118,6017 5962) ******************************************************************************** EST sequence 43 +strand 698 n (File: SGN-E348271+) 1 ACACTTCTCC CCGGAAGGTG AATTAGAGCA GGCAAGAGAA GTAGAAGAAG AAATGGACGC 61 AGGTGTAGTT GCTGCCCCCG CCCCGGCCGC CGCCGTCGAT GCAAGCAAAG AGAATAAGGT 121 TCACACTGAT GTCATGCTTT TCAATCGCTG GAGCTATGAT GGAGTTGAGA TCAATGACAT 181 GTCTGTTGAG GATTACATCA CCGCAACTGC TAACAAGCAC CCAGTTTACA TGCCACACAC 241 AGCTGGTAGA TACCAGGCCA AGCGTTTCAG GAAGGCTCAG TGCCCAATCG TTGAGAGGCT 301 CACAAATTCT CTCATGATGC ACGGAAGGAA CAACGGAAAG AAGCTCATGG CTGTTCGTAT 361 TATTAAGCAT GCAATGGAGA TCATTCATTT GTTGACTGAC CAAAACCCAA TTCAAGTCAT 421 TGTTGATGCT GTTATCAACA GTGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG 481 TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCACTCCGT CGTGTTAACC AAGCAATTTA 541 TTTGCTGACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA TAGCTGAGTG 601 CCTTGCTGAT GAACTCATCA ATGCTGCCAA AGGTTCTTCA AATAGCTATG CTATTAAGAA 661 GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTT Predicted gene structure (within gDNA segment 8193 to 4519): Exon 1 7967 7914 ( 54 n); cDNA 116 169 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 170 441 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 442 645 ( 204 n); score: 0.902 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.86), Pa: 1.000 (s: 0.94) Exon 4 6017 5965 ( 53 n); cDNA 646 698 ( 53 n); score: 0.943 MATCH C06HBa0153O03.1-7- SGN-E348271+ 0.868 583 0.835 C PGS_C06HBa0153O03.1-7-_SGN-E348271+ (7967 7914,6772 6501,6321 6118,6017 5965) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 169 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 169 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 169 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 169 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 169 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 169 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 169 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 169 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 169 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 169 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 169 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 169 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 169 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 169 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 169 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 169 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 169 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 169 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 169 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 174 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 234 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 294 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 354 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 414 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 441 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 441 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 441 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 475 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 535 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 595 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| ||||| || | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAAGGTT CTTCAAATAG .......... 645 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 645 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 675 GAGGGTTGCC AAGGCCAATC GTT 5965 ||||||||| |||||||||| ||| AAGGGTTGCC AAGGCCAATC GTT 698 hqPGS_C06HBa0153O03.1-7-_SGN-E348271+ (7967 7914,6772 6501,6321 6118,6017 5965) ******************************************************************************** EST sequence 91 +strand 709 n (File: SGN-E342444+) 1 AGCAGAAAGA CACTTCTCCC CGGAAGGTGA ATTAGAGCAG GCAAGAGAAG TAGAAGAAGA 61 AATGGACGCA GGTGTAGTTG CTGCCCCCGC CCCGGCCGCC GCCGTCGATG CAAGCAAAGA 121 GAATAAGGTT CACACTGATG TCATGCTTTT CAATCGCTGG AGCTATGATG GAGTTGAGAT 181 CAATGACATG TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC CAGTTTACAT 241 GCCACACACA GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT GCCCAATCGT 301 TGAGAGGCTC ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA AGCTCATGGC 361 TGTTCGTATT ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC AAAACCCAAT 421 TCAAGTCATT GTTGATGCTG TTATCAACAG TGGGCCAAGG GAAGATGCAA CACGTATTGG 481 TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT TGATATTTCT CCACTCCGTC GTGTTAACCA 541 AGCAATTTAT TTGCTGACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 601 AGCTGAGTGC CTTGCTGATG AACTCATCAA TGCTGCCAAG GGGTTCTTCA AATAGCTATG 661 CTATTAAGAA GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTTA Predicted gene structure (within gDNA segment 8193 to 4499): Exon 1 7967 7914 ( 54 n); cDNA 125 178 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 179 450 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 451 655 ( 205 n); score: 0.895 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.83), Pa: 1.000 (s: 0.94) Exon 4 6017 5965 ( 53 n); cDNA 656 708 ( 53 n); score: 0.943 MATCH C06HBa0153O03.1-7- SGN-E342444+ 0.865 583 0.822 C PGS_C06HBa0153O03.1-7-_SGN-E342444+ (7967 7914,6772 6501,6321 6118,6017 5965) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 178 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 178 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 178 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 178 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 178 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 178 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 178 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 178 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 178 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 178 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 178 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 178 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 178 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 178 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 178 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 178 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 178 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 178 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 178 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 183 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 243 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 303 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 363 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 423 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 450 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 450 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 450 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 484 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 544 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 604 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAA-GGGA TCTTCCAACA GGTAATCTTT 6109 || ||||||| | |||||||| ||| |||||| ||||| ||| ||||| || | | GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGGT TCTTCAAATA G......... 655 TCTATTGCCA TCTTTTTACT CCTATATGCG TTTAATCCTT AATAAATGTA ACTATATTCT 6049 .......... .......... .......... .......... .......... .......... 655 CTGCCTACTT ATTCATTCTA TGTACGTGTA GCTATGCTAT CAAGAAGAAG GATGAGATTG 5989 ||||||||| ||||||||| || ||||||| .......... .......... .......... .CTATGCTAT TAAGAAGAAG GACGAGATTG 684 AGAGGGTTGC CAAGGCCAAT CGTT 5965 | |||||||| |||||||||| |||| AAAGGGTTGC CAAGGCCAAT CGTT 708 hqPGS_C06HBa0153O03.1-7-_SGN-E342444+ (7967 7914,6772 6501,6321 6118,6017 5965) ******************************************************************************** EST sequence 13 -strand 523 n (File: SGN-E393920-) 1 CCAAAACCCA ATTCAAGTCA TTGTTGATGC TGTTATCAAC AGTGGGCCAA GGGAAGATGC 61 AACACGTATT GGTTCTGCTG GTGTTGTCAG ACGTCAAGCT GTTGATATTT TTCCATTCCG 121 TCGTGTTAAC CAAGCAATTT ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA 181 CATCAAGACC ATAGCTGAGT GCCTTGCTGA TGAATTCATC AATGCTGCCA AGGGTTCTTC 241 AAATAGTTAT GTTATTAAGA AGAAGGACGA GATTGAAAGG GTTGCCAAGG CCAATCGTTA 301 ATAGATTGTT GTTGGAGCAA CTTTTTCGAG AGACTTTTTG GTTATGTTAT TTTTTCAGTT 361 CTGTTTTCAT GTAGGCATTA TAGCATTTGT TACTCCTTAT GGATTTAGTT TCTTGGAGGA 421 TTTATGTTTG GTATTGTTAT AAATGTTAAA TTTTGAAGTT CCTTTATTCG GGTTTTCAGT 481 AGAGTTTCGT TAACCAAAAA AAAAAAAAAA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 7603 to 2289): Exon 1 6542 6501 ( 42 n); cDNA 1 42 ( 42 n); score: 0.929 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.93), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 43 246 ( 204 n); score: 0.892 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.86), Pa: 1.000 (s: 0.90) Exon 3 6017 5965 ( 53 n); cDNA 247 299 ( 53 n); score: 0.906 PPA cDNA 496 523 MATCH C06HBa0153O03.1-7- SGN-E393920- 0.895 299 0.572 C PGS_C06HBa0153O03.1-7-_SGN-E393920- (6542 6501,6321 6118,6017 5965) Alignment (genomic DNA sequence = upper lines): CCTAAACCCA ATCCAAGTGA TTGTTGATGC TGTTATCAAC AGGTTTAGAG ATTATTCTGA 6483 || ||||||| || ||||| | |||||||||| |||||||||| || CCAAAACCCA ATTCAAGTCA TTGTTGATGC TGTTATCAAC AG........ .......... 42 TTTTTGCATA TTTATTAGCT CGAGTTTTTC TTGCTGAGGT CTTGTTAATT AGAAGATTTT 6423 .......... .......... .......... .......... .......... .......... 42 CATACCATGT CTTCTTTGTT CCATTTCCAT GTCGCGGCAT ACTTGAGATA TTGTAGTCAT 6363 .......... .......... .......... .......... .......... .......... 42 TCTCATTTTT TCCTTCCCAT ATTCTTACCT ATGTGATGCA GTGGACCAAG AGAAGATGCA 6303 ||| ||||| ||||||||| .......... .......... .......... .......... .TGGGCCAAG GGAAGATGCA 61 ACTCGTATAG GTTCTGCTGG TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT 6243 || ||||| | |||||||||| |||||| || || ||||||| ||||||||| |||| ||||| ACACGTATTG GTTCTGCTGG TGTTGTCAGA CGTCAAGCTG TTGATATTTT TCCATTCCGT 121 CGTGTCAACC AAGCAATATA TCTCCTCACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC 6183 ||||| |||| ||||||| || | | || ||| |||||||||| |||||||||| |||||||||| CGTGTTAACC AAGCAATTTA TTTGCTGACA ACTGGTGCAC GTGAGAGTGC TTTCAGGAAC 181 ATCAAGACCA TAGCAGAATG CCTTGCAGAT GAACTCATTA ATGCTGCCAA GGGATCTTCC 6123 |||||||||| |||| || || |||||| ||| ||| |||| | |||||||||| ||| ||||| ATCAAGACCA TAGCTGAGTG CCTTGCTGAT GAATTCATCA ATGCTGCCAA GGGTTCTTCA 241 AACAGGTAAT CTTTTCTATT GCCATCTTTT TACTCCTATA TGCGTTTAAT CCTTAATAAA 6063 || || AATAG..... .......... .......... .......... .......... .......... 246 TGTAACTATA TTCTCTGCCT ACTTATTCAT TCTATGTACG TGTAGCTATG CTATCAAGAA 6003 |||| ||| ||||| .......... .......... .......... .......... .....TTATG TTATTAAGAA 261 GAAGGATGAG ATTGAGAGGG TTGCCAAGGC CAATCGTT 5965 |||||| ||| ||||| |||| |||||||||| |||||||| GAAGGACGAG ATTGAAAGGG TTGCCAAGGC CAATCGTT 299 hqPGS_C06HBa0153O03.1-7-_SGN-E393920- (6542 6501,6321 6118,6017 5965) ******************************************************************************** EST sequence 105 +strand 710 n (File: SGN-E335041+) 1 ATACAGCAGA AAGACACTTC TCCCCGGAAG GTGAATTAGA GCAGGCAAGA GAAGTAGAAG 61 AAGAAATGGA CGCAGGTGTA GTTGCTGCCC CCGCCCCGGC CGCCGCCGTC GATGCAAGCA 121 AAGAGAATAA GGTTCACACT GATGTCATGC TTTTCAATCG CTGGAGCTAT GATGGAGTTG 181 AGATCAATGA CATGTCTGTT GAGGATTACA TCACCGCAAC TGCTAACAAG CACCCAGTTT 241 ACATGCCACA CACAGCTGGT AGATACCAGG CCAAGCGTTT CAGGAAGGCT CAGTGCCCAA 301 TCGTTGAGAG GCTCACAAAT TCTCTCATGA TGCACGGAAG GAACAACGGA AAGAAGCTCA 361 TGGCTGTTCG TATTATTAAG CATGCAATGG AGATCATTCA TTTGTTGACT GACCAAAACC 421 CAATTCAAGT CATTGTTGAT GCTGTTATCA ACAGTGGGCC AAGGGAAGAT GCAACACGTA 481 TTGGTTCTGC TGGTGTTGTC AGACGTCAAG CTGTTGATAT TTCTCCACTC CGTCGTGTTA 541 ACCAAGCAAT TTATTTGCTG ACAACTGGTG CACGTGAGAG TGCTTTCAGG AACATCAAGA 601 CCATAGCTGA GTGCCTTGCT GATGAACTCA TCAATGCTGC CAAGGGTTCT TCAAATAGCT 661 ATGCTATTAA GAAGAAAGAC GAGATTGAAA GGGTTGCCAA GGCCAATCGT Predicted gene structure (within gDNA segment 8193 to 4529): Exon 1 7967 7914 ( 54 n); cDNA 129 182 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 183 454 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 455 658 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.92) Exon 4 6017 5966 ( 52 n); cDNA 659 710 ( 52 n); score: 0.923 MATCH C06HBa0153O03.1-7- SGN-E335041+ 0.868 582 0.820 C PGS_C06HBa0153O03.1-7-_SGN-E335041+ (7967 7914,6772 6501,6321 6118,6017 5966) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 182 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 182 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 182 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 182 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 182 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 182 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 182 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 182 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 182 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 182 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 182 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 182 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 182 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 182 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 182 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 182 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 182 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 182 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 182 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 187 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 247 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 307 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 367 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 427 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 454 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 454 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 454 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 488 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 548 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 608 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 658 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 658 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||| | | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAAG ACGAGATTGA 688 GAGGGTTGCC AAGGCCAATC GT 5966 ||||||||| |||||||||| || AAGGGTTGCC AAGGCCAATC GT 710 hqPGS_C06HBa0153O03.1-7-_SGN-E335041+ (7967 7914,6772 6501,6321 6118,6017 5966) ******************************************************************************** EST sequence 63 +strand 518 n (File: SGN-E251833+) 1 GACATGTCTG TTGATGATTA CATCACCGCA ACTGCTAACA AGCACCCAGT TTACATGCCA 61 CACACAGCTG GTAGATACCA TGCCAAGCGT TTCAGGAAGG CTCAGTGCCC AATCGTTGAG 121 AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG GAAAGAAGCT CATGGCTGTT 181 CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 241 GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 301 GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 361 ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA AGAAACATAA AACCATAACT 421 GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG CTATGCTATT 481 AAGAAGAAGG ACGAGATTGA AAGGGTTGCC AAGGCCAA Predicted gene structure (within gDNA segment 8193 to 3948): Exon 1 6766 6501 ( 266 n); cDNA 1 266 ( 266 n); score: 0.846 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6118 ( 204 n); cDNA 267 470 ( 204 n); score: 0.875 Intron 2 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0.94) Exon 3 6017 5970 ( 48 n); cDNA 471 518 ( 48 n); score: 0.938 MATCH C06HBa0153O03.1-7- SGN-E251833+ 0.859 518 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E251833+ (6766 6501,6321 6118,6017 5970) Alignment (genomic DNA sequence = upper lines): GATATTTCTG TTGAGGATTA CATAACTGCT ACTGCTAACA AGCATCCTAC ATATACACCA 6707 || || |||| |||| ||||| ||| || || |||||||||| |||| || || | ||| GACATGTCTG TTGATGATTA CATCACCGCA ACTGCTAACA AGCACCCAGT TTACATGCCA 60 CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG CTCAATGCCC AATTGTGGAG 6647 |||||||||| | || ||||| |||||||| || || |||| |||| ||||| ||| || ||| CACACAGCTG GTAGATACCA TGCCAAGCGT TTCAGGAAGG CTCAGTGCCC AATCGTTGAG 120 AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG GGAAGAAGTT GATGGCCGTT 6587 ||| | || | | || || || |||||||||| |||||||||| | |||||| | ||||| ||| AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG GAAAGAAGCT CATGGCTGTT 180 CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA CTGACCTAAA CCCAATCCAA 6527 |||||||||| ||||||| || ||| || || ||| |||||| |||||| ||| |||||| ||| CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 240 GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT CTGATTTTTG CATATTTATT 6467 || ||||||| |||||||||| |||||| GTCATTGTTG ATGCTGTTAT CAACAG.... .......... .......... .......... 266 AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA TTTTCATACC ATGTCTTCTT 6407 .......... .......... .......... .......... .......... .......... 266 TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG TCATTCTCAT TTTTTCCTTC 6347 .......... .......... .......... .......... .......... .......... 266 CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA TGCAACTCGT ATAGGTTCTG 6287 ||| | |||| ||||| |||||| ||| || ||||||| .......... .......... .....TGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 301 CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT CCGTCGTGTC AACCAAGCAA 6227 |||||||||| || || ||| |||||||||| |||||||||| ||||||||| |||||||||| CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 361 TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTC-A GGAACATCAA GACCATAGCA 6168 | ||| | || ||||||||| |||||||||| |||||||| | | ||||| || |||||| | TTTATTTGCT GACAACTGGT GCACGTGAGA GTGCTTTCAA GAAACAT-AA AACCATAACT 420 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 470 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 470 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 500 GAGGGTTGCC AAGGCCAA 5970 ||||||||| |||||||| AAGGGTTGCC AAGGCCAA 518 hqPGS_C06HBa0153O03.1-7-_SGN-E251833+ (6766 6501,6321 6118,6017 5970) ******************************************************************************** EST sequence 122 +strand 702 n (File: SGN-E334390+) 1 CAGCATACAG CAGAAAGACA CTTCTCCCCG GAAGGTGAAT TAGAGCAGGC AAGAGAAGTA 61 GAAGAAGAAA TGGACGCAGG TGTAGTTGCT GCCCCCGCCC CGGCCGCCGC CGTCGATGCA 121 AGCAAAGAGA ATAAGGTTCA CACTGATGTC ATGCTTTTCA ATCGCTGGAG CTATGATGGA 181 GTTGAGATCA ATGACATGTC TGTTGAGGAT TACATCACCG CAACTGCTAA CAAGCACCCA 241 GTTTACATGC CACACACAGC TGGTAGATAC CAGGCCAAGC GTTTCAGGAA GGCTCAGTGC 301 CCAATCGTTG AGAGGCTCAC AAATTCTCTC ATGATGCACG GAAGGAACAA CGGAAAGAAG 361 CTCATGGCTG TTCGTATTAT TAAGCATGCA ATGGAGATCA TTCATTTGTT GACTGACCAA 421 AACCCAATTC AAGTCATTGT TGATGCTGTT ATCAACAGTG GGCCAAGGGA AGATGCAACA 481 CGTATTGGTT CTGCTGGTGT TGTCAGACGT CAAGCTGTTG ATATTTCTCC ACTCCGTCGT 541 GTTAACCAAG CAATTTATTT GCTGACAACT GGTGCACGTG AGAGTGCTTT CAGGGACATC 601 AAGACCATAG CTGAGTGCCT TGCTGATGAA CTCATCAATG CTGNCANGGG TTCTTCAAAT 661 AGCTATGCTA TTAAGAAGAA GGACGAGAAT TGAAGGGTTG CC Predicted gene structure (within gDNA segment 8193 to 4649): Exon 1 7967 7914 ( 54 n); cDNA 133 186 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 187 458 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 459 662 ( 204 n); score: 0.892 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.84), Pa: 1.000 (s: 0.85) Exon 4 6017 5978 ( 40 n); cDNA 663 702 ( 40 n); score: 0.850 MATCH C06HBa0153O03.1-7- SGN-E334390+ 0.857 570 0.812 C PGS_C06HBa0153O03.1-7-_SGN-E334390+ (7967 7914,6772 6501,6321 6118,6017 5978) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 186 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 186 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 186 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 186 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 186 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 186 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 186 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 186 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 186 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 186 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 186 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 186 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 186 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 186 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 186 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 186 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 186 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 186 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 186 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 191 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 251 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 311 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 371 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 431 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 458 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 458 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 458 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 492 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 552 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| || ||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGGACATCAA GACCATAGCT 612 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| | || ||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GNCANGGGTT CTTCAAATAG .......... 662 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 662 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||| | .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGAATTG 692 GAGGGTTGCC 5978 ||||||||| AAGGGTTGCC 702 hqPGS_C06HBa0153O03.1-7-_SGN-E334390+ (7967 7914,6772 6501,6321 6118,6017 5978) ******************************************************************************** EST sequence 133 +strand 692 n (File: SGN-E346329+) 1 GCAGAAAGAC ACTTCTCCCC GGAAGGTGAA TTAGAGCAGG CAAGAGAAGT AGAAGAAGAA 61 ATGGACGCAG GTGTAGTTGC TGCCCCCGCC CCGGCCGCCG CCGTCGATGC AAGCAAAGAG 121 AATAAGGTTC ACACTGATGT CATGCTTTTC AATCGCTGGA GCTATGATGG AGTTGAGATC 181 AATGACATGT CTGTTGAGGA TTACATCACC GCAACTGCTA ACAAGCACCC AGTTTACATG 241 CCACACACAG CTGGTAGATA CCAGGCCAAG CGTTTCAGGA AGGCTCAGTG CCCAATCGTT 301 GAGAGGCTCA CAAATTCTCT CATGATGCAC GGAAGGAACA ACGGAAAGAA GCTCATGGCT 361 GTTCGTATTA TTAAGCATGC AATGGAGATC ATTCATTTGT TGACTGACCA AAACCCAATT 421 CAAGTCATTG TTGATGCTGT TATCAACAGT GGGCCAAGGG AAGATGCAAC ACGTATTGGT 481 TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA 541 GCAATTTATT TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA 601 GCTGAGTGCC TTGCTGATGA ACTCATCAAT GCTGCCAAGG GTTCTTCAAA TAGCTATGCT 661 ATTAAGAAGA AGGACGAGAT TGAAAGGGTT GC Predicted gene structure (within gDNA segment 8193 to 4659): Exon 1 7967 7914 ( 54 n); cDNA 124 177 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 178 449 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 450 653 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0) Exon 4 6017 5979 ( 39 n); cDNA 654 692 ( 39 n); score: 0.923 MATCH C06HBa0153O03.1-7- SGN-E346329+ 0.862 569 0.822 C PGS_C06HBa0153O03.1-7-_SGN-E346329+ (7967 7914,6772 6501,6321 6118,6017 5979) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 177 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 177 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 177 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 177 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 177 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 177 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 177 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 177 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 177 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 177 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 177 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 177 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 177 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 177 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 177 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 177 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 177 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 177 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 177 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 182 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 242 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 302 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 362 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 422 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 449 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 449 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 449 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 483 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 543 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 603 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 653 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 653 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTATC AAGAAGAAGG ATGAGATTGA 5988 ||||||||| |||||||||| | |||||||| .......... .......... .......... CTATGCTATT AAGAAGAAGG ACGAGATTGA 683 GAGGGTTGC 5979 |||||||| AAGGGTTGC 692 hqPGS_C06HBa0153O03.1-7-_SGN-E346329+ (7967 7914,6772 6501,6321 6118,6017 5979) ******************************************************************************** EST sequence 136 +strand 669 n (File: SGN-E345879+) 1 GCAGAAAGAC ACTTCTCCCC GGAAGGTGAA TTAGAGCAGG CAAGAGAAGT AGAAGAAGAA 61 ATGGACGCAG GTGTAGTTGC TGCCCCCGCC CCGGCCGCCG CCGTCGATGC AAGCAAAGAG 121 AATAAGGTTC ACACTGATGT CATGCTTTTC AATCGCTGGA GCTATGATGG AGTTGAGATC 181 AATGACATGT CTGTTGAGGA TTACATCACC GCAACTGCTA ACAAGCACCC AGTTTACATG 241 CCACACACAG CTGGTAGATA CCAGGCCAAG CGTTTCAGGA AGGCTCAGTG CCCAATCGTT 301 GAGAGGCTCA CAAATTCTCT CATGATGCAC GGAAGGAACA ACGGAAAGAA GCTCATGGCT 361 GTTCGTATTA TTAAGCATGC AATGGAGATC ATTCATTTGT TGACTGACCA AAACCCAATT 421 CAAGTCATTG TTGATGCTGT TATCAACAGT GGGCCAAGGG AAGATGCAAC ACGTATTGGT 481 TCTGCTGGTG TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCNGTC GTGTTAACCA 541 AGCAATTTAT TTGCTGACAA CTGGGTGCAC GTGAGAGTGC TTTCAGGAAC ATCAAGACCA 601 TAGCTGAGTG CCTTGCTGAT GAACTCATCA ATGCTGCCAA GGGGTTCTTC AAATAGCTAT 661 GCTATTAAG Predicted gene structure (within gDNA segment 8193 to 4909): Exon 1 7967 7914 ( 54 n); cDNA 124 177 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 178 449 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 450 656 ( 207 n); score: 0.870 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.83), Pa: 1.000 (s: 0) Exon 4 6017 6005 ( 13 n); cDNA 657 669 ( 13 n); score: 0.923 MATCH C06HBa0153O03.1-7- SGN-E345879+ 0.848 543 0.812 C PGS_C06HBa0153O03.1-7-_SGN-E345879+ (7967 7914,6772 6501,6321 6118,6017 6005) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 177 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 177 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 177 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 177 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 177 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 177 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 177 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 177 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 177 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 177 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 177 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 177 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 177 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 177 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 177 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 177 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 177 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 177 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 177 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 182 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 242 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 302 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 362 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 422 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 449 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 449 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 449 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 483 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCC-GTCGTG TCAACCAAGC 6229 |||||||||| | || || || |||||||||| |||||||||| ||| |||||| | |||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCNGTCGTG TTAACCAAGC 543 AATATATCTC CTCACAACT- GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG 6170 ||| ||| | || |||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTTATTTG CTGACAACTG GGTGCACGTG AGAGTGCTTT CAGGAACATC AAGACCATAG 603 CAGAATGCCT TGCAGATGAA CTCATTAATG CTGCCAA-GG GATCTTCCAA CAGGTAATCT 6111 | || ||||| ||| |||||| ||||| |||| ||||||| || | ||||| || || CTGAGTGCCT TGCTGATGAA CTCATCAATG CTGCCAAGGG GTTCTTCAAA TAG....... 656 TTTCTATTGC CATCTTTTTA CTCCTATATG CGTTTAATCC TTAATAAATG TAACTATATT 6051 .......... .......... .......... .......... .......... .......... 656 CTCTGCCTAC TTATTCATTC TATGTACGTG TAGCTATGCT ATCAAG 6005 ||||||| || ||| .......... .......... .......... ...CTATGCT ATTAAG 669 hqPGS_C06HBa0153O03.1-7-_SGN-E345879+ (7967 7914,6772 6501,6321 6118,6017 6005) ******************************************************************************** EST sequence 90 +strand 674 n (File: SGN-E328566+) 1 AGCAGCATAC AGCAGAAAGA CACTTCTCCC CGGAAGGTGA ATTAGAGCAG GCAAGAGAAG 61 TAGAAGAAGA AATGGACGCA GGTGTAGTTG CTGCCCCCGC CCCGGCCGCC GCCGTCGATG 121 CAAGCAAAGA GAATAAGGTT CACACTGATG TCATGCTTTT CAATCGCTGG AGCTATGATG 181 GAGTTGAGAT CAATGACATG TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC 241 CAGTTTACAT GCCACACACA GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT 301 GCCCAATCGT TGAGAGGCTC ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA 361 AGCTCATGGC TGTTCGTATT ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC 421 AAAACCCAAT TCAAGTCATT GTTGATGCTG TTATCAACAG TGGGCCAAGG GAAGATGCAA 481 CACGTATTGG TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT TGATATTTCT CCACTCCGTC 541 GTGTTAACCA AGCAATTTAT TTGCTGACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA 601 TCAAGACCAT AGCTGAGTGC CTTGCTGATG AACTCATCAA TGCTGCCAAG GGTTCTTCAA 661 ATAGCTATGC TATT Predicted gene structure (within gDNA segment 8193 to 4949): Exon 1 7967 7914 ( 54 n); cDNA 135 188 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 189 460 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 461 664 ( 204 n); score: 0.907 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.88), Pa: 1.000 (s: 0) Exon 4 6017 6009 ( 9 n); cDNA 665 673 ( 9 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E328566+ 0.862 539 0.800 C PGS_C06HBa0153O03.1-7-_SGN-E328566+ (7967 7914,6772 6501,6321 6118,6017 6009) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 188 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 188 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 188 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 188 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 188 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 188 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 188 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 188 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 188 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 188 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 188 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 188 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 188 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 188 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 188 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 188 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 188 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 188 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 188 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 193 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 253 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 313 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 373 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 433 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 460 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 460 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 460 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 494 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 554 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 614 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG GTAATCTTTT 6108 || ||||||| | |||||||| ||| |||||| |||||||| | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAGGGTT CTTCAAATAG .......... 664 CTATTGCCAT CTTTTTACTC CTATATGCGT TTAATCCTTA ATAAATGTAA CTATATTCTC 6048 .......... .......... .......... .......... .......... .......... 664 TGCCTACTTA TTCATTCTAT GTACGTGTAG CTATGCTAT 6009 ||||||||| .......... .......... .......... CTATGCTAT 673 hqPGS_C06HBa0153O03.1-7-_SGN-E328566+ (7967 7914,6772 6501,6321 6118,6017 6009) ******************************************************************************** EST sequence 120 +strand 614 n (File: SGN-E320513+) 1 TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 61 AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 121 ATGATGTTCA GATTGCTGAT ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC 181 ATCCTACATA TACACCACAC ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC 241 AATGCCCAAT TGTGGAGAGG TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA 301 AGAAGTTGAT GGCCGTTCGT ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG 361 ACCTAAACCC AATCCAAGTG ATTGTTGATG CTGTTATCAA CAGTGGACCA AGAGAAGATG 421 CAACTCGTAT AGGTTCTGCT GGTGTTGTGA GGCGACAAGC TGTTGATATT TCTCCACTCC 481 GTCGTGTCAA CCAAGCAATA TATCTCCTCA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA 541 ACATCAAGAC CATAGCAGAA TGCCTTGCAG ATGAACTCAT TAATGCTGCC AAGGGATCTT 601 CCCACAGCTA TGCT Predicted gene structure (within gDNA segment 8193 to 5393): Exon 1 8044 7914 ( 131 n); cDNA 1 131 ( 131 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 132 403 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6118 ( 204 n); cDNA 404 607 ( 204 n); score: 0.995 Intron 3 6117 6018 ( 100 n); Pd: 1.000 (s: 0.98), Pa: 1.000 (s: 0) Exon 4 6017 6011 ( 7 n); cDNA 608 614 ( 7 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E320513+ 0.998 614 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E320513+ (8044 7914,6772 6501,6321 6118,6017 6011) Alignment (genomic DNA sequence = upper lines): TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 7985 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 60 AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 7925 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 120 ATGATGTTCA GGTTTGTTTG TTTCCCTTTC AATTTTATTC CTCTCCAGTT CCTATATCTT 7865 |||||||||| | ATGATGTTCA G......... .......... .......... .......... .......... 131 TTCATTATTT GCCTAACATT AATGTCGAAT TGGATGAAAC TTGGCATTTT CGAAATCATA 7805 .......... .......... .......... .......... .......... .......... 131 AGATGAACAT TTGAATTATT TTGTTTCTTG CGTTAGCTAA ACTCTAATTG TAGTGTAGCA 7745 .......... .......... .......... .......... .......... .......... 131 GAGGTGATAT ATCAGTAAGG GTGGGCATGG TAGGGTAGAT ACCGAAACCA AAATTTTTCA 7685 .......... .......... .......... .......... .......... .......... 131 CTCAATGGTT TCAATATCAT GACATTTGAT ATTATTTATA ATGTATATCG AATCACCAAA 7625 .......... .......... .......... .......... .......... .......... 131 TACTTTAACA GAGTGTATAG TTAGGTATCC AATTCATTTA TCGTATTATA ATACTAACAA 7565 .......... .......... .......... .......... .......... .......... 131 ATATATTAAC TAGTATTAGT TCAAAGTTGT TTAGACATTG AAAGCTTTGA CTACTCTTTT 7505 .......... .......... .......... .......... .......... .......... 131 CTTGTTAGAA TTGTCCTTTT TGTGTAATTG ATTAAGTGAT GGAATTGCTT CTTCTTTCTT 7445 .......... .......... .......... .......... .......... .......... 131 TTGAATATTT TTACATGAGT AAGATCTTTA TATGATATAA TTAAGAAGTT TCTAAAGAAA 7385 .......... .......... .......... .......... .......... .......... 131 CCAAAACATA ATTCTCTATT TATATGAGTA TATGTAAGTC GAAGTCGAAC AAACAATGGT 7325 .......... .......... .......... .......... .......... .......... 131 TACCAACCAA AAGTTAAAAA GTATCGGCAC ATAATGGTTT AATTTGATAT GGTAATGGTA 7265 .......... .......... .......... .......... .......... .......... 131 TAGTACTTTT AAAAATCAAA ATTATTGAAC CAAAGTTTTC AATATTGTAT CATACCTTTC 7205 .......... .......... .......... .......... .......... .......... 131 CATGCTCATC CCTACATATC AGTTCTCAAG TCCAATGCAT TGAATACTTA ACCATGGTTA 7145 .......... .......... .......... .......... .......... .......... 131 GGAAACTTGA AACACTATGC ACGACACTGC TTAGGTATGT CTATCAACTA TAAAGCCTGC 7085 .......... .......... .......... .......... .......... .......... 131 TGGCTTGATC TTCTTATTCA AAGAAACATG CATGCTAAAC ATGATATGAT TAAGTTGAAC 7025 .......... .......... .......... .......... .......... .......... 131 AGAATAGTGT TGGTTTCCCC AATCCATAAC AAGCCAACTG GGACAACCTT ACAGAAGGTG 6965 .......... .......... .......... .......... .......... .......... 131 TGCCTATTCA TCATTGTTGC CTTGTAAATG ATGGATTTAT ACAACTGAAA ATTACTTGCT 6905 .......... .......... .......... .......... .......... .......... 131 GAGAGTTCAG GGAAATCCTT GTTGGTTAAG TTGGAAATGT AATTGTAGGT GGATTCTTCA 6845 .......... .......... .......... .......... .......... .......... 131 TTGGAATGCT CAAAGGAGAA ATTCAGTATA TGATCTCTTG AATTCTCTCT TAAATGTTAT 6785 .......... .......... .......... .......... .......... .......... 131 TATCTCATGC AGATTGCTGA TATTTCTGTT GAGGATTACA TAACTGCTAC TGCTAACAAG 6725 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..ATTGCTGA TATTTCTGTT GAGGATTACA TAACTGCTAC TGCTAACAAG 179 CATCCTACAT ATACACCACA CACAGCTGGG AGGTACCAAG CCAAGCGGTT TAGAAAGGCT 6665 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCCTACAT ATACACCACA CACAGCTGGG AGGTACCAAG CCAAGCGGTT TAGAAAGGCT 239 CAATGCCCAA TTGTGGAGAG GTTGACCAAC TCACTGATGA TGCACGGAAG GAACAACGGG 6605 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATGCCCAA TTGTGGAGAG GTTGACCAAC TCACTGATGA TGCACGGAAG GAACAACGGG 299 AAGAAGTTGA TGGCCGTTCG TATTATTAAG CATGCTATGG AAATTATCCA TCTGTTGACT 6545 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAGTTGA TGGCCGTTCG TATTATTAAG CATGCTATGG AAATTATCCA TCTGTTGACT 359 GACCTAAACC CAATCCAAGT GATTGTTGAT GCTGTTATCA ACAGGTTTAG AGATTATTCT 6485 |||||||||| |||||||||| |||||||||| |||||||||| |||| GACCTAAACC CAATCCAAGT GATTGTTGAT GCTGTTATCA ACAG...... .......... 403 GATTTTTGCA TATTTATTAG CTCGAGTTTT TCTTGCTGAG GTCTTGTTAA TTAGAAGATT 6425 .......... .......... .......... .......... .......... .......... 403 TTCATACCAT GTCTTCTTTG TTCCATTTCC ATGTCGCGGC ATACTTGAGA TATTGTAGTC 6365 .......... .......... .......... .......... .......... .......... 403 ATTCTCATTT TTTCCTTCCC ATATTCTTAC CTATGTGATG CAGTGGACCA AGAGAAGATG 6305 ||||||| |||||||||| .......... .......... .......... .......... ...TGGACCA AGAGAAGATG 420 CAACTCGTAT AGGTTCTGCT GGTGTTGTGA GGCGACAAGC TGTTGATATT TCTCCACTCC 6245 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACTCGTAT AGGTTCTGCT GGTGTTGTGA GGCGACAAGC TGTTGATATT TCTCCACTCC 480 GTCGTGTCAA CCAAGCAATA TATCTCCTCA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA 6185 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCGTGTCAA CCAAGCAATA TATCTCCTCA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA 540 ACATCAAGAC CATAGCAGAA TGCCTTGCAG ATGAACTCAT TAATGCTGCC AAGGGATCTT 6125 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATCAAGAC CATAGCAGAA TGCCTTGCAG ATGAACTCAT TAATGCTGCC AAGGGATCTT 600 CCAACAGGTA ATCTTTTCTA TTGCCATCTT TTTACTCCTA TATGCGTTTA ATCCTTAATA 6065 || |||| CCCACAG... .......... .......... .......... .......... .......... 607 AATGTAACTA TATTCTCTGC CTACTTATTC ATTCTATGTA CGTGTAGCTA TGCT 6011 ||| |||| .......... .......... .......... .......... .......CTA TGCT 614 hqPGS_C06HBa0153O03.1-7-_SGN-E320513+ (8044 7914,6772 6501,6321 6118,6017 6011) ******************************************************************************** EST sequence 69 +strand 614 n (File: SGN-E292203+) 1 GCAAGAGAAG TAGAAGAAGA AATGGACGCA GGTGTAGTTG CTGCCCCCGC CCCGGCCGCC 61 GCCGTCGATG CAAGCAAAGA GAATAAGGTT CACACTGATG TCATGCTTTT CAATCGCTGG 121 AGCTATGATG GAGTTGAGAT CAATGACATG TCTGTTGAGG ATTACATCAC CGCAACTGCT 181 AACAAGCACC CAGTTTACAT GCCACACACA GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG 241 AAGGCTCAGT GCCCAATCGT TGAGAGGCTC ACAAATTCTC TCATGATGCA CGGAAGGAAC 301 AACGGAAAGA AGCTCATGGC TGTTCGTATT ATTAAGCATG CAATGGAGAT CATTCATTTG 361 TTGACTGACC AAAACCCAAT TCAAGTCATT GTTGATGCTG TTATCAACAG TGGGCCAAGG 421 GAAGATGCAA CACGTATTGG TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT TGATATTTCT 481 CCACTCCGTC GTGTTAACCA AGCAATTTAT TTGCTGACAA CTGGTGCACG TGAGAGTGCT 541 TTCAGGAACA TCAAGACCAT AGCTGAGTGC CTTGCTGATG AACTCATCAA TGCTGCCAAA 601 GGTTCTTCAA ATAG Predicted gene structure (within gDNA segment 8193 to 5049): Exon 1 7967 7914 ( 54 n); cDNA 85 138 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 139 410 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6118 ( 204 n); cDNA 411 614 ( 204 n); score: 0.902 MATCH C06HBa0153O03.1-7- SGN-E292203+ 0.860 530 0.863 C PGS_C06HBa0153O03.1-7-_SGN-E292203+ (7967 7914,6772 6501,6321 6118) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 138 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 138 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 138 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 138 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 138 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 138 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 138 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 138 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 138 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 138 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 138 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 138 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 138 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 138 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 138 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 138 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 138 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 138 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 138 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 143 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 203 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 263 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 323 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 383 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 410 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 410 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 410 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 444 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 504 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 564 GAATGCCTTG CAGATGAACT CATTAATGCT GCCAAGGGAT CTTCCAACAG 6118 || ||||||| | |||||||| ||| |||||| ||||| || | |||| || || GAGTGCCTTG CTGATGAACT CATCAATGCT GCCAAAGGTT CTTCAAATAG 614 hqPGS_C06HBa0153O03.1-7-_SGN-E292203+ (7967 7914,6772 6501,6321 6118) ******************************************************************************** EST sequence 53 +strand 440 n (File: SGN-E325676+) 1 ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC ACACAGCTGG TAGATACCAG 61 GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA GGCTCACAAA TTCTCTCATG 121 ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC GTATTATTAA GCATGCAATG 181 GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG TCATTGTTGA TGCTGTTATC 241 AACAGTGGGC CAAGGGAAGA TGCGACACGT ATTGGATCTG CTGGTGTTGT CAGACGTCAA 301 GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA TTTATTTGCT GACAACTGGT 361 GCACGTGAGA GTGCTTTCAG GAACATCAAG AACATAGCTG AGTGCCTTGC TGATGAACTC 421 ATCAATGCTG CCAAGGGTTC Predicted gene structure (within gDNA segment 8193 to 5076): Exon 1 6745 6501 ( 245 n); cDNA 1 245 ( 245 n); score: 0.845 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.84) Exon 2 6321 6127 ( 195 n); cDNA 246 440 ( 195 n); score: 0.897 MATCH C06HBa0153O03.1-7- SGN-E325676+ 0.868 440 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E325676+ (6745 6501,6321 6127) Alignment (genomic DNA sequence = upper lines): ATAACTGCTA CTGCTAACAA GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA 6686 || || || | |||||||||| ||| || || | |||| |||||||||| || ||||| ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC ACACAGCTGG TAGATACCAG 60 GCCAAGCGGT TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG 6626 |||||||| | | || ||||| ||| |||||| || || |||| || | || || || || ||| GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA GGCTCACAAA TTCTCTCATG 120 ATGCACGGAA GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG 6566 |||||||||| |||||||||| |||||| | ||||| |||| |||||||||| |||||| ||| ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC GTATTATTAA GCATGCAATG 180 GAAATTATCC ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC 6506 || || || | || ||||||| ||||| |||| ||||| |||| | |||||||| |||||||||| GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG TCATTGTTGA TGCTGTTATC 240 AACAGGTTTA GAGATTATTC TGATTTTTGC ATATTTATTA GCTCGAGTTT TTCTTGCTGA 6446 ||||| AACAG..... .......... .......... .......... .......... .......... 245 GGTCTTGTTA ATTAGAAGAT TTTCATACCA TGTCTTCTTT GTTCCATTTC CATGTCGCGG 6386 .......... .......... .......... .......... .......... .......... 245 CATACTTGAG ATATTGTAGT CATTCTCATT TTTTCCTTCC CATATTCTTA CCTATGTGAT 6326 .......... .......... .......... .......... .......... .......... 245 GCAGTGGACC AAGAGAAGAT GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGACAAG 6266 ||| || ||| |||||| || || |||| | || ||||| ||||||||| || || |||| ....TGGGCC AAGGGAAGAT GCGACACGTA TTGGATCTGC TGGTGTTGTC AGACGTCAAG 301 CTGTTGATAT TTCTCCACTC CGTCGTGTCA ACCAAGCAAT ATATCTCCTC ACAACTGGTG 6206 |||||||||| |||||||||| |||||||| | |||||||||| ||| | || |||||||||| CTGTTGATAT TTCTCCACTC CGTCGTGTTA ACCAAGCAAT TTATTTGCTG ACAACTGGTG 361 CACGTGAGAG TGCTTTCAGG AACATCAAGA CCATAGCAGA ATGCCTTGCA GATGAACTCA 6146 |||||||||| |||||||||| |||||||||| |||||| || |||||||| |||||||||| CACGTGAGAG TGCTTTCAGG AACATCAAGA ACATAGCTGA GTGCCTTGCT GATGAACTCA 421 TTAATGCTGC CAAGGGATC 6127 | |||||||| |||||| || TCAATGCTGC CAAGGGTTC 440 hqPGS_C06HBa0153O03.1-7-_SGN-E325676+ (6745 6501,6321 6127) ******************************************************************************** EST sequence 59 +strand 638 n (File: SGN-E304505+) 1 ACAGCAGAAA GACACTTCTC CCCGGAAGGT GAATTAGAGC AGGCAAGAGA AGTAGAAGAA 61 GAAATGGACG CAGGTGTAGT TGCTGCCCCC GCCCCGGCCG CCGCCGTCGA TGCAAGCAAA 121 GAGAATAAGG TTCACACTGA TGTCATGCTT TTCAATCGCT GGAGCTATGA TGGAGTTGAG 181 ATCAATGACA TGTCTGTTGA GGATTACATC ACCGCAACTG CTAACAAGCA CCCAGTTTAC 241 ATGCCACACA CAGCTGGTAG ATACCAGGCC AAGCGTTTCA GGAAGGCTCA GTGCCCAATC 301 GTTGAGAGGC TCACAAATTC TCTCATGATG CACGGAAGGA ACAACGGAAA GAAGCTCATG 361 GCTGTTCGTA TTATTAAGCA TGCAATGGAG ATCATTCATT TGTTGACTGA CCAAAACCCA 421 ATTCAAGTCA TTGTTGATGC TGTTATCAAC AGTGGGCCAA GGGAAGATGC AACACGTATT 481 GGTTCTGCTG GTGTTGTCAG ACGTCAAGCT GTTGATATTT CTCCACTCCG TCGTGTTAAC 541 CAAGCAATTT ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA CATCAAGACC 601 ATAGCTGAGT GCCTTGCTGA TGAACTCATC AATGCTGC Predicted gene structure (within gDNA segment 8193 to 5229): Exon 1 7967 7914 ( 54 n); cDNA 127 180 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 181 452 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6136 ( 186 n); cDNA 453 638 ( 186 n); score: 0.914 MATCH C06HBa0153O03.1-7- SGN-E304505+ 0.863 512 0.803 C PGS_C06HBa0153O03.1-7-_SGN-E304505+ (7967 7914,6772 6501,6321 6136) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 180 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 180 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 180 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 180 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 180 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 180 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 180 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 180 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 180 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 180 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 180 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 180 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 180 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 180 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 180 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 180 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 180 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 180 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 180 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 185 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 245 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 305 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 365 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 425 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 452 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 452 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 452 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 486 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 546 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 606 GAATGCCTTG CAGATGAACT CATTAATGCT GC 6136 || ||||||| | |||||||| ||| |||||| || GAGTGCCTTG CTGATGAACT CATCAATGCT GC 638 hqPGS_C06HBa0153O03.1-7-_SGN-E304505+ (7967 7914,6772 6501,6321 6136) ******************************************************************************** EST sequence 123 +strand 602 n (File: SGN-E311109+) 1 TAGAGCATGC AAGAGAAGTA GAAGAAGAAA TGGACGCAGG TGTAGTTGCT GCCCCCGCCC 61 CGGCCGCCGC CGTCGATGCA AGCAAAGAGA ATAAGGTTCA CACTGATGTC ATGCTTTTCA 121 ATCGCTGGAG CTATGATGGA GTTGAGATCA ATGACATGTC TGTTGAGGAT TACATCACCG 181 CAACTGCTAA CAAGCACCCA GTTTACATGC CACACACAGC TGGTAGATAC CAGGCCAAGC 241 GTTTCAGGAA GGCTCAGTGC CCAATCGTTG AGAGGCTCAC AAATTCTCTC ATGATGCACG 301 GAAGGAACAA CGGAAAGAAG CTCATGGCTG TTCGTATTAT TAAGCATGCA ATGGAGATCA 361 TTCATTTGTT GACTGACCAA AACCCAATTC AAGTCATTGT TGATGCTGTT ATCAACAGTG 421 GGCCAAGGGA AGATGCAACA CGTATTGGTT CTGCTGGTGT TGTCAGACGT CAAGCTGTTG 481 ATATTTCTCC ACTCCGTCGT GTTAACCAAG CAATTTATTT GCTGACAACT GGTGCACGTG 541 AGAGTGCTTT CAGGAACATC AAGACCATAG CTGAGTGCCT TGCTGATGAA CTTATCAATG 601 CT Predicted gene structure (within gDNA segment 8193 to 5249): Exon 1 7967 7914 ( 54 n); cDNA 93 146 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 147 418 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6138 ( 184 n); cDNA 419 602 ( 184 n); score: 0.908 MATCH C06HBa0153O03.1-7- SGN-E311109+ 0.861 510 0.847 C PGS_C06HBa0153O03.1-7-_SGN-E311109+ (7967 7914,6772 6501,6321 6138) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 146 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 146 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 146 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 146 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 146 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 146 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 146 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 146 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 146 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 146 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 146 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 146 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 146 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 146 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 146 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 146 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 146 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 146 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 146 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 151 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 211 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 271 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 331 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 391 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 418 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 418 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 418 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 452 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 512 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 572 GAATGCCTTG CAGATGAACT CATTAATGCT 6138 || ||||||| | |||||||| || |||||| GAGTGCCTTG CTGATGAACT TATCAATGCT 602 hqPGS_C06HBa0153O03.1-7-_SGN-E311109+ (7967 7914,6772 6501,6321 6138) ******************************************************************************** EST sequence 73 +strand 588 n (File: SGN-E272678+) 1 GCTTAGACCT ATCAGAAAAA CAGGAAAAAT GGAAGAAGCT TCAGTAGTAG CAGTGGACAA 61 CCAAAAGCCG CAGCAAGAGA AGCCTCACAC TGATGTTTTG CTTTTCAATC GTTGGTCATA 121 TGATGATGTT CAGATTGCTG ATATTTCTGT TGAGGATTAC ATAACTGCTA CTGCTAACAA 181 GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT TTAGAAAGGC 241 TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA GGAACAACGG 301 GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC ATCTGTTGAC 361 TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAGTGGAC CAAGAGAAGA 421 TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT 481 CCGTCGTGTC AACCAAGCAA TATATCTCCT CACAACTGGT GCACGTGAGA GTGCTTTCAG 541 GAACATCAAG ACCATAGCAG AATGCCTTGC AGATGAACTC ATTAATGC Predicted gene structure (within gDNA segment 8193 to 5529): Exon 1 8045 7914 ( 132 n); cDNA 2 133 ( 132 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 134 405 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6139 ( 183 n); cDNA 406 588 ( 183 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E272678+ 1.000 587 0.998 C PGS_C06HBa0153O03.1-7-_SGN-E272678+ (8045 7914,6772 6501,6321 6139) Alignment (genomic DNA sequence = upper lines): CTTAGACCTA TCAGAAAAAC AGGAAAAATG GAAGAAGCTT CAGTAGTAGC AGTGGACAAC 7986 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTAGACCTA TCAGAAAAAC AGGAAAAATG GAAGAAGCTT CAGTAGTAGC AGTGGACAAC 61 CAAAAGCCGC AGCAAGAGAA GCCTCACACT GATGTTTTGC TTTTCAATCG TTGGTCATAT 7926 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAAGCCGC AGCAAGAGAA GCCTCACACT GATGTTTTGC TTTTCAATCG TTGGTCATAT 121 GATGATGTTC AGGTTTGTTT GTTTCCCTTT CAATTTTATT CCTCTCCAGT TCCTATATCT 7866 |||||||||| || GATGATGTTC AG........ .......... .......... .......... .......... 133 TTTCATTATT TGCCTAACAT TAATGTCGAA TTGGATGAAA CTTGGCATTT TCGAAATCAT 7806 .......... .......... .......... .......... .......... .......... 133 AAGATGAACA TTTGAATTAT TTTGTTTCTT GCGTTAGCTA AACTCTAATT GTAGTGTAGC 7746 .......... .......... .......... .......... .......... .......... 133 AGAGGTGATA TATCAGTAAG GGTGGGCATG GTAGGGTAGA TACCGAAACC AAAATTTTTC 7686 .......... .......... .......... .......... .......... .......... 133 ACTCAATGGT TTCAATATCA TGACATTTGA TATTATTTAT AATGTATATC GAATCACCAA 7626 .......... .......... .......... .......... .......... .......... 133 ATACTTTAAC AGAGTGTATA GTTAGGTATC CAATTCATTT ATCGTATTAT AATACTAACA 7566 .......... .......... .......... .......... .......... .......... 133 AATATATTAA CTAGTATTAG TTCAAAGTTG TTTAGACATT GAAAGCTTTG ACTACTCTTT 7506 .......... .......... .......... .......... .......... .......... 133 TCTTGTTAGA ATTGTCCTTT TTGTGTAATT GATTAAGTGA TGGAATTGCT TCTTCTTTCT 7446 .......... .......... .......... .......... .......... .......... 133 TTTGAATATT TTTACATGAG TAAGATCTTT ATATGATATA ATTAAGAAGT TTCTAAAGAA 7386 .......... .......... .......... .......... .......... .......... 133 ACCAAAACAT AATTCTCTAT TTATATGAGT ATATGTAAGT CGAAGTCGAA CAAACAATGG 7326 .......... .......... .......... .......... .......... .......... 133 TTACCAACCA AAAGTTAAAA AGTATCGGCA CATAATGGTT TAATTTGATA TGGTAATGGT 7266 .......... .......... .......... .......... .......... .......... 133 ATAGTACTTT TAAAAATCAA AATTATTGAA CCAAAGTTTT CAATATTGTA TCATACCTTT 7206 .......... .......... .......... .......... .......... .......... 133 CCATGCTCAT CCCTACATAT CAGTTCTCAA GTCCAATGCA TTGAATACTT AACCATGGTT 7146 .......... .......... .......... .......... .......... .......... 133 AGGAAACTTG AAACACTATG CACGACACTG CTTAGGTATG TCTATCAACT ATAAAGCCTG 7086 .......... .......... .......... .......... .......... .......... 133 CTGGCTTGAT CTTCTTATTC AAAGAAACAT GCATGCTAAA CATGATATGA TTAAGTTGAA 7026 .......... .......... .......... .......... .......... .......... 133 CAGAATAGTG TTGGTTTCCC CAATCCATAA CAAGCCAACT GGGACAACCT TACAGAAGGT 6966 .......... .......... .......... .......... .......... .......... 133 GTGCCTATTC ATCATTGTTG CCTTGTAAAT GATGGATTTA TACAACTGAA AATTACTTGC 6906 .......... .......... .......... .......... .......... .......... 133 TGAGAGTTCA GGGAAATCCT TGTTGGTTAA GTTGGAAATG TAATTGTAGG TGGATTCTTC 6846 .......... .......... .......... .......... .......... .......... 133 ATTGGAATGC TCAAAGGAGA AATTCAGTAT ATGATCTCTT GAATTCTCTC TTAAATGTTA 6786 .......... .......... .......... .......... .......... .......... 133 TTATCTCATG CAGATTGCTG ATATTTCTGT TGAGGATTAC ATAACTGCTA CTGCTAACAA 6726 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ...ATTGCTG ATATTTCTGT TGAGGATTAC ATAACTGCTA CTGCTAACAA 180 GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT TTAGAAAGGC 6666 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT TTAGAAAGGC 240 TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA GGAACAACGG 6606 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA GGAACAACGG 300 GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC ATCTGTTGAC 6546 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC ATCTGTTGAC 360 TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAGGTTTA GAGATTATTC 6486 |||||||||| |||||||||| |||||||||| |||||||||| ||||| TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAG..... .......... 405 TGATTTTTGC ATATTTATTA GCTCGAGTTT TTCTTGCTGA GGTCTTGTTA ATTAGAAGAT 6426 .......... .......... .......... .......... .......... .......... 405 TTTCATACCA TGTCTTCTTT GTTCCATTTC CATGTCGCGG CATACTTGAG ATATTGTAGT 6366 .......... .......... .......... .......... .......... .......... 405 CATTCTCATT TTTTCCTTCC CATATTCTTA CCTATGTGAT GCAGTGGACC AAGAGAAGAT 6306 |||||| |||||||||| .......... .......... .......... .......... ....TGGACC AAGAGAAGAT 421 GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGACAAG CTGTTGATAT TTCTCCACTC 6246 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGACAAG CTGTTGATAT TTCTCCACTC 481 CGTCGTGTCA ACCAAGCAAT ATATCTCCTC ACAACTGGTG CACGTGAGAG TGCTTTCAGG 6186 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTCGTGTCA ACCAAGCAAT ATATCTCCTC ACAACTGGTG CACGTGAGAG TGCTTTCAGG 541 AACATCAAGA CCATAGCAGA ATGCCTTGCA GATGAACTCA TTAATGC 6139 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| AACATCAAGA CCATAGCAGA ATGCCTTGCA GATGAACTCA TTAATGC 588 hqPGS_C06HBa0153O03.1-7-_SGN-E272678+ (8045 7914,6772 6501,6321 6139) ******************************************************************************** EST sequence 97 +strand 573 n (File: SGN-E298350+) 1 AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 61 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGAT 121 TGCTGATATT TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC 181 ACCACACACA GCTGGGAGGT ACCAAGCCAA GCGGTTTAGA AAGGCTCAAT GCCCAATTGT 241 GGAGAGGTTG ACCAACTCAC TGATGATGCA CGGAAGGAAC AACGGGAAGA AGTTGATGGC 301 CGTTCGTATT ATTAAGCATG CTATGGAAAT TATCCATCTG TTGACTGACC TAAACCCAAT 361 CCAAGTGATT GTTGATGCTG TTATCAACAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 421 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 481 AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 541 AGCAGAATGC CTTGCAGATG AACTCATTAA TGC Predicted gene structure (within gDNA segment 8193 to 5529): Exon 1 8031 7914 ( 118 n); cDNA 1 118 ( 118 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 119 390 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6139 ( 183 n); cDNA 391 573 ( 183 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E298350+ 1.000 573 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E298350+ (8031 7914,6772 6501,6321 6139) Alignment (genomic DNA sequence = upper lines): AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 7972 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 60 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGGT 7912 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAG.. 118 TTGTTTGTTT CCCTTTCAAT TTTATTCCTC TCCAGTTCCT ATATCTTTTC ATTATTTGCC 7852 .......... .......... .......... .......... .......... .......... 118 TAACATTAAT GTCGAATTGG ATGAAACTTG GCATTTTCGA AATCATAAGA TGAACATTTG 7792 .......... .......... .......... .......... .......... .......... 118 AATTATTTTG TTTCTTGCGT TAGCTAAACT CTAATTGTAG TGTAGCAGAG GTGATATATC 7732 .......... .......... .......... .......... .......... .......... 118 AGTAAGGGTG GGCATGGTAG GGTAGATACC GAAACCAAAA TTTTTCACTC AATGGTTTCA 7672 .......... .......... .......... .......... .......... .......... 118 ATATCATGAC ATTTGATATT ATTTATAATG TATATCGAAT CACCAAATAC TTTAACAGAG 7612 .......... .......... .......... .......... .......... .......... 118 TGTATAGTTA GGTATCCAAT TCATTTATCG TATTATAATA CTAACAAATA TATTAACTAG 7552 .......... .......... .......... .......... .......... .......... 118 TATTAGTTCA AAGTTGTTTA GACATTGAAA GCTTTGACTA CTCTTTTCTT GTTAGAATTG 7492 .......... .......... .......... .......... .......... .......... 118 TCCTTTTTGT GTAATTGATT AAGTGATGGA ATTGCTTCTT CTTTCTTTTG AATATTTTTA 7432 .......... .......... .......... .......... .......... .......... 118 CATGAGTAAG ATCTTTATAT GATATAATTA AGAAGTTTCT AAAGAAACCA AAACATAATT 7372 .......... .......... .......... .......... .......... .......... 118 CTCTATTTAT ATGAGTATAT GTAAGTCGAA GTCGAACAAA CAATGGTTAC CAACCAAAAG 7312 .......... .......... .......... .......... .......... .......... 118 TTAAAAAGTA TCGGCACATA ATGGTTTAAT TTGATATGGT AATGGTATAG TACTTTTAAA 7252 .......... .......... .......... .......... .......... .......... 118 AATCAAAATT ATTGAACCAA AGTTTTCAAT ATTGTATCAT ACCTTTCCAT GCTCATCCCT 7192 .......... .......... .......... .......... .......... .......... 118 ACATATCAGT TCTCAAGTCC AATGCATTGA ATACTTAACC ATGGTTAGGA AACTTGAAAC 7132 .......... .......... .......... .......... .......... .......... 118 ACTATGCACG ACACTGCTTA GGTATGTCTA TCAACTATAA AGCCTGCTGG CTTGATCTTC 7072 .......... .......... .......... .......... .......... .......... 118 TTATTCAAAG AAACATGCAT GCTAAACATG ATATGATTAA GTTGAACAGA ATAGTGTTGG 7012 .......... .......... .......... .......... .......... .......... 118 TTTCCCCAAT CCATAACAAG CCAACTGGGA CAACCTTACA GAAGGTGTGC CTATTCATCA 6952 .......... .......... .......... .......... .......... .......... 118 TTGTTGCCTT GTAAATGATG GATTTATACA ACTGAAAATT ACTTGCTGAG AGTTCAGGGA 6892 .......... .......... .......... .......... .......... .......... 118 AATCCTTGTT GGTTAAGTTG GAAATGTAAT TGTAGGTGGA TTCTTCATTG GAATGCTCAA 6832 .......... .......... .......... .......... .......... .......... 118 AGGAGAAATT CAGTATATGA TCTCTTGAAT TCTCTCTTAA ATGTTATTAT CTCATGCAGA 6772 | .......... .......... .......... .......... .......... .........A 119 TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 6712 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 179 CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 6652 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 239 TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 6592 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 299 CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 6532 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 359 TCCAAGTGAT TGTTGATGCT GTTATCAACA GGTTTAGAGA TTATTCTGAT TTTTGCATAT 6472 |||||||||| |||||||||| |||||||||| | TCCAAGTGAT TGTTGATGCT GTTATCAACA G......... .......... .......... 390 TTATTAGCTC GAGTTTTTCT TGCTGAGGTC TTGTTAATTA GAAGATTTTC ATACCATGTC 6412 .......... .......... .......... .......... .......... .......... 390 TTCTTTGTTC CATTTCCATG TCGCGGCATA CTTGAGATAT TGTAGTCATT CTCATTTTTT 6352 .......... .......... .......... .......... .......... .......... 390 CCTTCCCATA TTCTTACCTA TGTGATGCAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 6292 |||||||||| |||||||||| |||||||||| .......... .......... .......... TGGACCAAGA GAAGATGCAA CTCGTATAGG 420 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 6232 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 480 AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 6172 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAATATAT CTCCTCACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT 540 AGCAGAATGC CTTGCAGATG AACTCATTAA TGC 6139 |||||||||| |||||||||| |||||||||| ||| AGCAGAATGC CTTGCAGATG AACTCATTAA TGC 573 hqPGS_C06HBa0153O03.1-7-_SGN-E298350+ (8031 7914,6772 6501,6321 6139) ******************************************************************************** EST sequence 81 +strand 625 n (File: SGN-E335097+) 1 GACACTTCTC CCCGGAAGGT GAATTAGAGC AGGCAAGAGA AGTAGAAGAA GAAATGGACG 61 CAGGTGTAGT TGCTGCCCCC GCCCCGGCCG CCGCCGTCGA TGCAAGCAAA GAGAATAAGG 121 TTCACACTGA TGTCATGCTT TTCAATCGCT GGAGCTATGA TGGAGTTGAG ATCAATGACA 181 TGTCTGTTGA GGATTACATC ACCGCAACTG CTAACAAGCA CCCAGTTTAC ATGCCACACA 241 CAGCTGGTAG ATACCAGGCC AAGCGTTTCA GGAAGGCTCA GTGCCCAATC GTTGAGAGGC 301 TCACAAATTC TCTCATGATG CACGGAAGGA ACAACGGAAA GAAGCTCATG GCTGTTCGTA 361 TTATTAAGCA TGCAATGGAG ATCATTCATT TGTTGACTGA CCAAAACCCA ATTCAAGTCA 421 TTGTTGATGC TGTTATCAAC AGTGGGCCAA GGGAAGATGC AACACGTATT GGTTCTGCTG 481 GTGTTGTCAG ACGTCAAGCT GTTGATATTT CTCCACTCCG TCGTGTTAAC CAAGCAATTT 541 ATTTGCTGAC AACTGGTGCA CGTGAGAGTG CTTTCAGGAA CATCAAGACC ATAGCTGAGT 601 GCCTTGCTGA TGAACTCATC AATGC Predicted gene structure (within gDNA segment 8193 to 5259): Exon 1 7967 7914 ( 54 n); cDNA 117 170 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 171 442 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6139 ( 183 n); cDNA 443 625 ( 183 n); score: 0.913 MATCH C06HBa0153O03.1-7- SGN-E335097+ 0.862 509 0.814 C PGS_C06HBa0153O03.1-7-_SGN-E335097+ (7967 7914,6772 6501,6321 6139) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 170 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 170 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 170 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 170 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 170 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 170 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 170 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 170 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 170 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 170 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 170 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 170 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 170 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 170 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 170 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 170 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 170 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 170 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 170 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 175 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 235 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 295 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 355 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 415 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 442 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 442 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 442 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 476 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 536 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 596 GAATGCCTTG CAGATGAACT CATTAATGC 6139 || ||||||| | |||||||| ||| ||||| GAGTGCCTTG CTGATGAACT CATCAATGC 625 hqPGS_C06HBa0153O03.1-7-_SGN-E335097+ (7967 7914,6772 6501,6321 6139) ******************************************************************************** EST sequence 60 +strand 629 n (File: SGN-E348912+) 1 AGAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCAGGCA AGAGAAGTAG AAGAAGAAAT 61 GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA GCAAAGAGAA 121 TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG TTGAGATCAA 181 TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 481 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 541 AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA AGACCATAGC 601 TGAGTGCCTT GCTGATGAAC TCATCAATG Predicted gene structure (within gDNA segment 8193 to 5269): Exon 1 7967 7914 ( 54 n); cDNA 122 175 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 176 447 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6140 ( 182 n); cDNA 448 629 ( 182 n); score: 0.912 MATCH C06HBa0153O03.1-7- SGN-E348912+ 0.862 508 0.808 C PGS_C06HBa0153O03.1-7-_SGN-E348912+ (7967 7914,6772 6501,6321 6140) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 175 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 175 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 175 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 175 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 175 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 175 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 175 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 175 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 175 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 175 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 175 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 175 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 175 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 175 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 175 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 175 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 175 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 175 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 175 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 180 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 240 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 300 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 360 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 420 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 447 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 447 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 447 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 541 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 601 GAATGCCTTG CAGATGAACT CATTAATG 6140 || ||||||| | |||||||| ||| |||| GAGTGCCTTG CTGATGAACT CATCAATG 629 hqPGS_C06HBa0153O03.1-7-_SGN-E348912+ (7967 7914,6772 6501,6321 6140) ******************************************************************************** EST sequence 92 +strand 621 n (File: SGN-E335466+) 1 AGACACTTCT CCCCGGAAGG TGAATTAGAG CAGGCAAGAG AAGTAGAAGA AGAAATGGAC 61 GCAGGTGTAG TTGCTGCCCC CGCCCCGGCC GCCGCCGTCG ATGCAAGCAA AGAGAATAAG 121 GTTCACACTG ATGTCATGCT TTTCAATCGC TGGAGCTATG ATGGAGTTGA GATCAATGAC 181 ATGTCTGTTG AGGATTACAT CACCGCAACT GCTAACAAGC ACCCAGTTTA CATGCCACAC 241 ACAGCTGGTA GATACCAGGC CAAGCGTTTC AGGAAGGCTC AGTGCCCAAT CGTTGAGAGG 301 CTCACAAATT CTCTCATGAT GCACGGAAGG AACAACGGAA AGAAGCTCAT GGCTGTTCGT 361 ATTATTAAGC ATGCAATGGA GATCATTCAT TTGTTGACTG ACCAAAACCC AATTCAAGTC 421 ATTGTTGATG CTGTTATCAA CAGTGGGCCA AGGGAAGATG CAACACGTAT TGGTTCTGCT 481 GGTGTTGTCA GACGTCAAGC TGTTGATATT TCTCCACTCC GTCGTGTTAA CCAAGCAATT 541 TATTTGCTGA CAACTGGTGC ACGTGAGAGT GCTTTCAGGA ACATCAAGAC CATAACTGAG 601 TGCCTTGCTG ATGAACTCAT C Predicted gene structure (within gDNA segment 8193 to 5291): Exon 1 7967 7914 ( 54 n); cDNA 118 171 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 172 443 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6145 ( 177 n); cDNA 444 620 ( 177 n); score: 0.910 MATCH C06HBa0153O03.1-7- SGN-E335466+ 0.861 503 0.810 C PGS_C06HBa0153O03.1-7-_SGN-E335466+ (7967 7914,6772 6501,6321 6145) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 171 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 171 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 171 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 171 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 171 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 171 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 171 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 171 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 171 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 171 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 171 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 171 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 171 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 171 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 171 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 171 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 171 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 171 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 171 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 176 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 236 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 296 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 356 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 416 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 443 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 443 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 443 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 477 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 537 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||| | ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAACT 597 GAATGCCTTG CAGATGAACT CAT 6145 || ||||||| | |||||||| ||| GAGTGCCTTG CTGATGAACT CAT 620 hqPGS_C06HBa0153O03.1-7-_SGN-E335466+ (7967 7914,6772 6501,6321 6145) ******************************************************************************** EST sequence 107 +strand 630 n (File: SGN-E276058+) 1 AGCATACAGT ATAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCAGGCA AGAGAAGTAG 61 AAGAAGAAAT GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA 121 GCAAAGAGAA TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG 181 TTGAGATCAA TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG 241 TTTACATGCC ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC 301 CAATCGTTGA GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC 361 TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA 421 ACCCAATTCA AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC 481 GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG 541 TTAACCAAGC AATTTATTTG CTGACAACTG GTGCACGTGA GAGTGCTTTC AGGAACATCA 601 AGACCATAGC TGAGTGCCTT GCTGATGAAC Predicted gene structure (within gDNA segment 8193 to 5359): Exon 1 7967 7914 ( 54 n); cDNA 132 185 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 186 457 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6149 ( 173 n); cDNA 458 630 ( 173 n); score: 0.913 MATCH C06HBa0153O03.1-7- SGN-E276058+ 0.862 499 0.792 C PGS_C06HBa0153O03.1-7-_SGN-E276058+ (7967 7914,6772 6501,6321 6149) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 185 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 185 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 185 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 185 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 185 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 185 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 185 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 185 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 185 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 185 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 185 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 185 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 185 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 185 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 185 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 185 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 185 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 185 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 185 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 190 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 250 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 310 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 370 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 430 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 457 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 457 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 457 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 491 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 551 ATATATCTCC TCACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCA 6168 || ||| | | | |||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG AGTGCTTTCA GGAACATCAA GACCATAGCT 611 GAATGCCTTG CAGATGAAC 6149 || ||||||| | ||||||| GAGTGCCTTG CTGATGAAC 630 hqPGS_C06HBa0153O03.1-7-_SGN-E276058+ (7967 7914,6772 6501,6321 6149) ******************************************************************************** EST sequence 118 +strand 471 n (File: SGN-E283582+) 1 ATCGCTTGAG CTATGATGGA GTTGAGATCA ATGACATGTC TGTTGAGGAT TACATCACCG 61 CAACTGCTAA CAAGCACCCA GTTTACATGC CACACACAGC TGGTAGATAC CAGGCCAAGC 121 GTTTCAGGAA GGCTCAGTGC CCAATCGTTG AGAGGCTCAC AAATTCTCTC ATGATGCACG 181 GAAGGAACAA CGGAAAGAAG CTCATGGCTG TTCGTATTAT TAAGCATGCA ATGGAGATCA 241 TTCATTTGTT GACTGACCAA AACCCAATTC AAGTCATTGT TGATGCTGTT ATCAACAGTG 301 GGCCAAGGGA AGATGCAACA CGTATTGGTT CTGCTGGTGT TGTCAGACGT CAAGCTGTTG 361 ATATTTCTCC ACTCCGTCGT GTTAACCAAG CAATTTATTT GCTGACAACT GGTGCACGTG 421 AGAGTGCTTT CAGGAACATC AAGACCATAG CTGAGTGCCT TGCTGATGAA C Predicted gene structure (within gDNA segment 8193 to 5359): Exon 1 7939 7914 ( 26 n); cDNA 1 26 ( 26 n); score: 0.692 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 27 298 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6149 ( 173 n); cDNA 299 471 ( 173 n); score: 0.913 MATCH C06HBa0153O03.1-7- SGN-E283582+ 0.870 471 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E283582+ (7939 7914,6772 6501,6321 6149) Alignment (genomic DNA sequence = upper lines): ATCGTTGGTC ATATGATGAT GTTCAGGTTT GTTTGTTTCC CTTTCAATTT TATTCCTCTC 7880 |||| | | ||||||| ||| || ATCGCTTGAG CTATGATGGA GTTGAG.... .......... .......... .......... 26 CAGTTCCTAT ATCTTTTCAT TATTTGCCTA ACATTAATGT CGAATTGGAT GAAACTTGGC 7820 .......... .......... .......... .......... .......... .......... 26 ATTTTCGAAA TCATAAGATG AACATTTGAA TTATTTTGTT TCTTGCGTTA GCTAAACTCT 7760 .......... .......... .......... .......... .......... .......... 26 AATTGTAGTG TAGCAGAGGT GATATATCAG TAAGGGTGGG CATGGTAGGG TAGATACCGA 7700 .......... .......... .......... .......... .......... .......... 26 AACCAAAATT TTTCACTCAA TGGTTTCAAT ATCATGACAT TTGATATTAT TTATAATGTA 7640 .......... .......... .......... .......... .......... .......... 26 TATCGAATCA CCAAATACTT TAACAGAGTG TATAGTTAGG TATCCAATTC ATTTATCGTA 7580 .......... .......... .......... .......... .......... .......... 26 TTATAATACT AACAAATATA TTAACTAGTA TTAGTTCAAA GTTGTTTAGA CATTGAAAGC 7520 .......... .......... .......... .......... .......... .......... 26 TTTGACTACT CTTTTCTTGT TAGAATTGTC CTTTTTGTGT AATTGATTAA GTGATGGAAT 7460 .......... .......... .......... .......... .......... .......... 26 TGCTTCTTCT TTCTTTTGAA TATTTTTACA TGAGTAAGAT CTTTATATGA TATAATTAAG 7400 .......... .......... .......... .......... .......... .......... 26 AAGTTTCTAA AGAAACCAAA ACATAATTCT CTATTTATAT GAGTATATGT AAGTCGAAGT 7340 .......... .......... .......... .......... .......... .......... 26 CGAACAAACA ATGGTTACCA ACCAAAAGTT AAAAAGTATC GGCACATAAT GGTTTAATTT 7280 .......... .......... .......... .......... .......... .......... 26 GATATGGTAA TGGTATAGTA CTTTTAAAAA TCAAAATTAT TGAACCAAAG TTTTCAATAT 7220 .......... .......... .......... .......... .......... .......... 26 TGTATCATAC CTTTCCATGC TCATCCCTAC ATATCAGTTC TCAAGTCCAA TGCATTGAAT 7160 .......... .......... .......... .......... .......... .......... 26 ACTTAACCAT GGTTAGGAAA CTTGAAACAC TATGCACGAC ACTGCTTAGG TATGTCTATC 7100 .......... .......... .......... .......... .......... .......... 26 AACTATAAAG CCTGCTGGCT TGATCTTCTT ATTCAAAGAA ACATGCATGC TAAACATGAT 7040 .......... .......... .......... .......... .......... .......... 26 ATGATTAAGT TGAACAGAAT AGTGTTGGTT TCCCCAATCC ATAACAAGCC AACTGGGACA 6980 .......... .......... .......... .......... .......... .......... 26 ACCTTACAGA AGGTGTGCCT ATTCATCATT GTTGCCTTGT AAATGATGGA TTTATACAAC 6920 .......... .......... .......... .......... .......... .......... 26 TGAAAATTAC TTGCTGAGAG TTCAGGGAAA TCCTTGTTGG TTAAGTTGGA AATGTAATTG 6860 .......... .......... .......... .......... .......... .......... 26 TAGGTGGATT CTTCATTGGA ATGCTCAAAG GAGAAATTCA GTATATGATC TCTTGAATTC 6800 .......... .......... .......... .......... .......... .......... 26 TCTCTTAAAT GTTATTATCT CATGCAGATT GCTGATATTT CTGTTGAGGA TTACATAACT 6740 || ||| || | |||||||||| |||||| || .......... .......... .......ATC AATGACATGT CTGTTGAGGA TTACATCACC 59 GCTACTGCTA ACAAGCATCC TACATATACA CCACACACAG CTGGGAGGTA CCAAGCCAAG 6680 || ||||||| ||||||| || || | |||||||||| |||| || || ||| |||||| GCAACTGCTA ACAAGCACCC AGTTTACATG CCACACACAG CTGGTAGATA CCAGGCCAAG 119 CGGTTTAGAA AGGCTCAATG CCCAATTGTG GAGAGGTTGA CCAACTCACT GATGATGCAC 6620 || || || | ||||||| || |||||| || |||||| | | | || || || ||||||||| CGTTTCAGGA AGGCTCAGTG CCCAATCGTT GAGAGGCTCA CAAATTCTCT CATGATGCAC 179 GGAAGGAACA ACGGGAAGAA GTTGATGGCC GTTCGTATTA TTAAGCATGC TATGGAAATT 6560 |||||||||| |||| ||||| | | ||||| |||||||||| |||||||||| ||||| || GGAAGGAACA ACGGAAAGAA GCTCATGGCT GTTCGTATTA TTAAGCATGC AATGGAGATC 239 ATCCATCTGT TGACTGACCT AAACCCAATC CAAGTGATTG TTGATGCTGT TATCAACAGG 6500 || ||| ||| ||||||||| ||||||||| ||||| |||| |||||||||| ||||||||| ATTCATTTGT TGACTGACCA AAACCCAATT CAAGTCATTG TTGATGCTGT TATCAACAG. 298 TTTAGAGATT ATTCTGATTT TTGCATATTT ATTAGCTCGA GTTTTTCTTG CTGAGGTCTT 6440 .......... .......... .......... .......... .......... .......... 298 GTTAATTAGA AGATTTTCAT ACCATGTCTT CTTTGTTCCA TTTCCATGTC GCGGCATACT 6380 .......... .......... .......... .......... .......... .......... 298 TGAGATATTG TAGTCATTCT CATTTTTTCC TTCCCATATT CTTACCTATG TGATGCAGTG 6320 || .......... .......... .......... .......... .......... ........TG 300 GACCAAGAGA AGATGCAACT CGTATAGGTT CTGCTGGTGT TGTGAGGCGA CAAGCTGTTG 6260 | ||||| || ||||||||| ||||| |||| |||||||||| ||| || || |||||||||| GGCCAAGGGA AGATGCAACA CGTATTGGTT CTGCTGGTGT TGTCAGACGT CAAGCTGTTG 360 ATATTTCTCC ACTCCGTCGT GTCAACCAAG CAATATATCT CCTCACAACT GGTGCACGTG 6200 |||||||||| |||||||||| || ||||||| |||| ||| | || |||||| |||||||||| ATATTTCTCC ACTCCGTCGT GTTAACCAAG CAATTTATTT GCTGACAACT GGTGCACGTG 420 AGAGTGCTTT CAGGAACATC AAGACCATAG CAGAATGCCT TGCAGATGAA C 6149 |||||||||| |||||||||| |||||||||| | || ||||| ||| |||||| | AGAGTGCTTT CAGGAACATC AAGACCATAG CTGAGTGCCT TGCTGATGAA C 471 hqPGS_C06HBa0153O03.1-7-_SGN-E283582+ (6772 6501,6321 6149) ******************************************************************************** EST sequence 45 +strand 430 n (File: SGN-E284368+) 1 TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC CAGTTTACAT GCCACACACA 61 GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT GCCCAATCGT TGAGAGGCTC 121 ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA AGCTCATGGC TGTTCGTATT 181 ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC AAAACCCAAT TCAAGTCATT 241 GTTGATGCTG TTATCAACAG TGGGCCAAGG GAAGATGCAA CACGTATTGG TTCTGCTGGT 301 GTTGTCAGAC GTCAAGCTGT TGATATTTCT CCACTCCGTC GTGTTAACCA AGCAATTTAT 361 TTGCTGACAA CTGGTGCACG TGAGAGTGCT TTCAGGAACA TCAAGACCAT AGCTGAGTGC 421 CTTGCTGATG Predicted gene structure (within gDNA segment 8193 to 5389): Exon 1 6760 6501 ( 260 n); cDNA 1 260 ( 260 n); score: 0.854 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 2 6321 6152 ( 170 n); cDNA 261 430 ( 170 n); score: 0.912 MATCH C06HBa0153O03.1-7- SGN-E284368+ 0.877 430 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E284368+ (6760 6501,6321 6152) Alignment (genomic DNA sequence = upper lines): TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC ACCACACACA 6701 |||||||||| ||||||| || || |||||| |||||||| | | || | ||||||||| TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC CAGTTTACAT GCCACACACA 60 GCTGGGAGGT ACCAAGCCAA GCGGTTTAGA AAGGCTCAAT GCCCAATTGT GGAGAGGTTG 6641 ||||| || | |||| ||||| ||| || || |||||||| | ||||||| || |||||| | GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT GCCCAATCGT TGAGAGGCTC 120 ACCAACTCAC TGATGATGCA CGGAAGGAAC AACGGGAAGA AGTTGATGGC CGTTCGTATT 6581 || || || | | |||||||| |||||||||| ||||| |||| || | ||||| ||||||||| ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA AGCTCATGGC TGTTCGTATT 180 ATTAAGCATG CTATGGAAAT TATCCATCTG TTGACTGACC TAAACCCAAT CCAAGTGATT 6521 |||||||||| | ||||| || || ||| || |||||||||| ||||||||| ||||| ||| ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC AAAACCCAAT TCAAGTCATT 240 GTTGATGCTG TTATCAACAG GTTTAGAGAT TATTCTGATT TTTGCATATT TATTAGCTCG 6461 |||||||||| |||||||||| GTTGATGCTG TTATCAACAG .......... .......... .......... .......... 260 AGTTTTTCTT GCTGAGGTCT TGTTAATTAG AAGATTTTCA TACCATGTCT TCTTTGTTCC 6401 .......... .......... .......... .......... .......... .......... 260 ATTTCCATGT CGCGGCATAC TTGAGATATT GTAGTCATTC TCATTTTTTC CTTCCCATAT 6341 .......... .......... .......... .......... .......... .......... 260 TCTTACCTAT GTGATGCAGT GGACCAAGAG AAGATGCAAC TCGTATAGGT TCTGCTGGTG 6281 | || ||||| | |||||||||| ||||| ||| |||||||||| .......... .........T GGGCCAAGGG AAGATGCAAC ACGTATTGGT TCTGCTGGTG 301 TTGTGAGGCG ACAAGCTGTT GATATTTCTC CACTCCGTCG TGTCAACCAA GCAATATATC 6221 |||| || || ||||||||| |||||||||| |||||||||| ||| |||||| ||||| ||| TTGTCAGACG TCAAGCTGTT GATATTTCTC CACTCCGTCG TGTTAACCAA GCAATTTATT 361 TCCTCACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA GCAGAATGCC 6161 | || ||||| |||||||||| |||||||||| |||||||||| |||||||||| || || |||| TGCTGACAAC TGGTGCACGT GAGAGTGCTT TCAGGAACAT CAAGACCATA GCTGAGTGCC 421 TTGCAGATG 6152 |||| |||| TTGCTGATG 430 hqPGS_C06HBa0153O03.1-7-_SGN-E284368+ (6760 6501,6321 6152) ******************************************************************************** EST sequence 71 +strand 516 n (File: SGN-E338649+) 1 AAATGGAAGA AGCTTCAGTA GTAGCAGTGG ACCAACCAAA AGCCGCAGCA AGAGAAGCCT 61 CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGAT TGCTGATATT 121 TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC CACCACACAC 181 AGCTGGGAGG TACCCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 241 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 301 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 361 TTGTTGATGC TGTTATCAAC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 421 GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTCAAC CAAGCAATAT 481 ATCTCCTCAC AACTGGTGCA CGTGAGAGTG CTTTCA Predicted gene structure (within gDNA segment 8193 to 5578): Exon 1 8020 7914 ( 107 n); cDNA 1 108 ( 108 n); score: 0.977 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 109 382 ( 274 n); score: 0.982 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6188 ( 134 n); cDNA 383 516 ( 134 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E338649+ 0.985 513 0.994 C PGS_C06HBa0153O03.1-7-_SGN-E338649+ (8020 7914,6772 6501,6321 6188) Alignment (genomic DNA sequence = upper lines): AAATGGAAGA AGCTTCAGTA GTAGCAGTGG A-CAACCAAA AGCCGCAGCA AGAGAAGCCT 7962 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| AAATGGAAGA AGCTTCAGTA GTAGCAGTGG ACCAACCAAA AGCCGCAGCA AGAGAAGCCT 60 CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGGT TTGTTTGTTT 7902 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAG.. .......... 108 CCCTTTCAAT TTTATTCCTC TCCAGTTCCT ATATCTTTTC ATTATTTGCC TAACATTAAT 7842 .......... .......... .......... .......... .......... .......... 108 GTCGAATTGG ATGAAACTTG GCATTTTCGA AATCATAAGA TGAACATTTG AATTATTTTG 7782 .......... .......... .......... .......... .......... .......... 108 TTTCTTGCGT TAGCTAAACT CTAATTGTAG TGTAGCAGAG GTGATATATC AGTAAGGGTG 7722 .......... .......... .......... .......... .......... .......... 108 GGCATGGTAG GGTAGATACC GAAACCAAAA TTTTTCACTC AATGGTTTCA ATATCATGAC 7662 .......... .......... .......... .......... .......... .......... 108 ATTTGATATT ATTTATAATG TATATCGAAT CACCAAATAC TTTAACAGAG TGTATAGTTA 7602 .......... .......... .......... .......... .......... .......... 108 GGTATCCAAT TCATTTATCG TATTATAATA CTAACAAATA TATTAACTAG TATTAGTTCA 7542 .......... .......... .......... .......... .......... .......... 108 AAGTTGTTTA GACATTGAAA GCTTTGACTA CTCTTTTCTT GTTAGAATTG TCCTTTTTGT 7482 .......... .......... .......... .......... .......... .......... 108 GTAATTGATT AAGTGATGGA ATTGCTTCTT CTTTCTTTTG AATATTTTTA CATGAGTAAG 7422 .......... .......... .......... .......... .......... .......... 108 ATCTTTATAT GATATAATTA AGAAGTTTCT AAAGAAACCA AAACATAATT CTCTATTTAT 7362 .......... .......... .......... .......... .......... .......... 108 ATGAGTATAT GTAAGTCGAA GTCGAACAAA CAATGGTTAC CAACCAAAAG TTAAAAAGTA 7302 .......... .......... .......... .......... .......... .......... 108 TCGGCACATA ATGGTTTAAT TTGATATGGT AATGGTATAG TACTTTTAAA AATCAAAATT 7242 .......... .......... .......... .......... .......... .......... 108 ATTGAACCAA AGTTTTCAAT ATTGTATCAT ACCTTTCCAT GCTCATCCCT ACATATCAGT 7182 .......... .......... .......... .......... .......... .......... 108 TCTCAAGTCC AATGCATTGA ATACTTAACC ATGGTTAGGA AACTTGAAAC ACTATGCACG 7122 .......... .......... .......... .......... .......... .......... 108 ACACTGCTTA GGTATGTCTA TCAACTATAA AGCCTGCTGG CTTGATCTTC TTATTCAAAG 7062 .......... .......... .......... .......... .......... .......... 108 AAACATGCAT GCTAAACATG ATATGATTAA GTTGAACAGA ATAGTGTTGG TTTCCCCAAT 7002 .......... .......... .......... .......... .......... .......... 108 CCATAACAAG CCAACTGGGA CAACCTTACA GAAGGTGTGC CTATTCATCA TTGTTGCCTT 6942 .......... .......... .......... .......... .......... .......... 108 GTAAATGATG GATTTATACA ACTGAAAATT ACTTGCTGAG AGTTCAGGGA AATCCTTGTT 6882 .......... .......... .......... .......... .......... .......... 108 GGTTAAGTTG GAAATGTAAT TGTAGGTGGA TTCTTCATTG GAATGCTCAA AGGAGAAATT 6822 .......... .......... .......... .......... .......... .......... 108 CAGTATATGA TCTCTTGAAT TCTCTCTTAA ATGTTATTAT CTCATGCAGA TTGCTGATAT 6762 | |||||||||| .......... .......... .......... .......... .........A TTGCTGATAT 119 TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA -CACCACACA 6703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA CCACCACACA 179 CAGCTGGGAG GTA-CCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 6644 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCTGGGAG GTACCCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 239 TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 6584 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 299 ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 6524 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 359 ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG ATTTTTGCAT ATTTATTAGC 6464 |||||||||| |||||||||| ||| ATTGTTGATG CTGTTATCAA CAG....... .......... .......... .......... 382 TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT TCATACCATG TCTTCTTTGT 6404 .......... .......... .......... .......... .......... .......... 382 TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA TTCTCATTTT TTCCTTCCCA 6344 .......... .......... .......... .......... .......... .......... 382 TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 6284 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 420 GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTCAAC CAAGCAATAT 6224 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTCAAC CAAGCAATAT 480 ATCTCCTCAC AACTGGTGCA CGTGAGAGTG CTTTCA 6188 |||||||||| |||||||||| |||||||||| |||||| ATCTCCTCAC AACTGGTGCA CGTGAGAGTG CTTTCA 516 hqPGS_C06HBa0153O03.1-7-_SGN-E338649+ (8020 7914,6772 6501,6321 6188) ******************************************************************************** EST sequence 117 +strand 570 n (File: SGN-E244404+) 1 GAAAGACACT TCTCCCCGGA AGGTGAATTA GAGCAGGCAA GAGAAGTATA AGAAGAAATG 61 GACGCAGGTG TAGTTGCTGC CCCCGCCCCG GCCGCCGCCG TCGATGCAAG CAAAGAGAAT 121 AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAGATCAAT 181 GACATGTCTG TTGAGGATTA CATCACCGCA ACTGCTAACA AGCACCCAGT TTACATGCCA 241 CACACAGCTG GTAGATACCA GGCCAAGCGT TTCAGGAAGG CTCAGTGCCC AATCGTTGAG 301 AGGCTCACAA ATTCTCTCAT GATGCACGGA AGGAACAACG GAAAGAAGCT CATGGCTGTT 361 CGTATTATTA AGCATGCAAT GGAGATCATT CATTTGTTGA CTGACCAAAA CCCAATTCAA 421 GTCATTGTTG ATGCTGTTAT CAACAGTGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGG TAACCAAGCA 541 ATTTATTTGC TGACAACTGG TGCACGTGAG Predicted gene structure (within gDNA segment 8193 to 5219): Exon 1 7967 7914 ( 54 n); cDNA 121 174 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 175 446 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6198 ( 124 n); cDNA 447 570 ( 124 n); score: 0.895 MATCH C06HBa0153O03.1-7- SGN-E244404+ 0.851 450 0.789 C PGS_C06HBa0153O03.1-7-_SGN-E244404+ (7967 7914,6772 6501,6321 6198) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 174 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 174 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 174 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 174 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 174 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 174 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 174 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 174 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 174 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 174 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 174 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 174 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 174 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 174 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 174 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 174 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 174 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 174 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 174 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 179 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 239 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 299 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 359 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 419 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 446 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 446 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 446 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 480 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| ||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGG TAACCAAGCA 540 ATATATCTCC TCACAACTGG TGCACGTGAG 6198 || ||| | | | |||||||| |||||||||| ATTTATTTGC TGACAACTGG TGCACGTGAG 570 hqPGS_C06HBa0153O03.1-7-_SGN-E244404+ (7967 7914,6772 6501,6321 6198) ******************************************************************************** EST sequence 115 +strand 567 n (File: SGN-E319578+) 1 AAAGACACTT CTCCCCGGAA GGTGAATTAT AGCAGGCAAG AGAAGTAGAA GAAGAAATGG 61 ACGCAGGTGT AGTTGCTGCC CCCGCCCCGG CCGCCGCCGT CGATGCAAGC AAAGAGAATA 121 AGGTTCACAC TGATGTCATG CTTTTCAATC GCTGGAGCTA TGATGGAGTT GAGATCAATG 181 ACATGTCTGT TGAGGATTAC ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC 241 ACACAGCTGG TAGATACCAG GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA 301 GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC 361 GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG 421 TCATTGTTGA TGCTGTTATC AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 481 CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 541 TTTATTTGCT GACAACTGGT GCACGTG Predicted gene structure (within gDNA segment 8193 to 5248): Exon 1 7967 7914 ( 54 n); cDNA 120 173 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 174 445 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6200 ( 122 n); cDNA 446 567 ( 122 n); score: 0.902 MATCH C06HBa0153O03.1-7- SGN-E319578+ 0.853 448 0.790 C PGS_C06HBa0153O03.1-7-_SGN-E319578+ (7967 7914,6772 6501,6321 6200) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 173 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 173 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 173 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 173 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 173 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 173 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 173 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 173 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 173 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 173 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 173 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 173 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 173 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 173 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 173 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 173 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 173 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 173 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 173 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 178 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 238 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 298 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 358 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 418 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 445 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 445 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 445 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 479 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 539 ATATATCTCC TCACAACTGG TGCACGTG 6200 || ||| | | | |||||||| |||||||| ATTTATTTGC TGACAACTGG TGCACGTG 567 hqPGS_C06HBa0153O03.1-7-_SGN-E319578+ (7967 7914,6772 6501,6321 6200) ******************************************************************************** EST sequence 121 +strand 563 n (File: SGN-E308687+) 1 ACACTTCTCC CCGGAAGGTG AATTAGAGCA GGCAAGAGAA GTAGAAGAAG AAATGGACGC 61 AGGTGTAGTT GCTGCCCCCG CCCCGGCCGC CGCCGTCGAT GCAAGCAAAG AGAATAAGGT 121 TCACACTGAT GTCATGCTTT TCAATCGCTG GAGCTATGAT GGAGTTGAGA TCAATGACAT 181 GTCTGTTGAG GATTACATCA CCGCAACTGC TAACAAGCAC CCAGTTTACA TGCCACACAC 241 AGCTGGTAGA TACCAGGCCA AGCGTTTCAG GAAGGCTCAG TGCCCAATCG TTGAGAGGCT 301 CACAAATTCT CTCATGATGC ACGGAAGGAA CAACGGAAAG AAGCTCATGG CTGTTCGTAT 361 TATTAAGCAT GCAATGGAGA TCATTCATTT GTTGACTGAC CAAAACCCAA TTCAAGTCAT 421 TGTTGATGCT GTTATCAACA GTGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG 481 TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCACTCCGT CGTGTTAACC AAGCAATTTA 541 TTTGCTGACA ACTGGTGCAC GTG Predicted gene structure (within gDNA segment 8193 to 5248): Exon 1 7967 7914 ( 54 n); cDNA 116 169 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 170 441 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6200 ( 122 n); cDNA 442 563 ( 122 n); score: 0.902 MATCH C06HBa0153O03.1-7- SGN-E308687+ 0.853 448 0.796 C PGS_C06HBa0153O03.1-7-_SGN-E308687+ (7967 7914,6772 6501,6321 6200) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 169 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 169 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 169 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 169 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 169 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 169 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 169 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 169 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 169 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 169 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 169 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 169 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 169 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 169 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 169 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 169 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 169 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 169 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 169 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 174 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 234 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 294 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 354 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 414 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 441 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 441 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 441 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 475 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 535 ATATATCTCC TCACAACTGG TGCACGTG 6200 || ||| | | | |||||||| |||||||| ATTTATTTGC TGACAACTGG TGCACGTG 563 hqPGS_C06HBa0153O03.1-7-_SGN-E308687+ (7967 7914,6772 6501,6321 6200) ******************************************************************************** EST sequence 125 +strand 567 n (File: SGN-E288197+) 1 AAAGACACTT CTTCCCGGAA GGTGAATTAT AGCAGGCAAG AGAAGTATAA GATGAAATGG 61 ACGCAAGTGT AGTTGCTGTC CCCGCCCCGG CCGCCGACGT CGATGCAAGC AAAGAGAATA 121 AGGTTCACAC TGATGTCATG CTTTTCAATC GCTGGAGCTA TGATGGAGTT GAGATCAATG 181 ACATGTCTGT TGAGGATTAC ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC 241 ACACAGCTGG TAGATACCAA GCCAAGCGTT TCATGAAGGC TCAGTGCCCA ATCGTTGAGA 301 GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC 361 GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCATAAC CCATTTCAAG 421 TCATTGTTGA TGCTGATATC AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 481 CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 541 TTTATTTGCT GACAACTGGT GCACGTG Predicted gene structure (within gDNA segment 8193 to 5248): Exon 1 7967 7914 ( 54 n); cDNA 120 173 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 174 445 ( 272 n); score: 0.831 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.88), Pa: 0.999 (s: 0.88) Exon 3 6321 6200 ( 122 n); cDNA 446 567 ( 122 n); score: 0.902 MATCH C06HBa0153O03.1-7- SGN-E288197+ 0.846 448 0.790 C PGS_C06HBa0153O03.1-7-_SGN-E288197+ (7967 7914,6772 6501,6321 6200) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 173 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 173 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 173 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 173 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 173 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 173 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 173 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 173 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 173 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 173 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 173 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 173 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 173 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 173 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 173 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 173 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 173 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 173 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 173 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 178 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 238 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| |||||||||| || | ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AAGCCAAGCG TTTCATGAAG GCTCAGTGCC CAATCGTTGA 298 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 358 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| | ||||| | || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCATA ACCCATTTCA 418 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| ||||||| || ||||||| AGTCATTGTT GATGCTGATA TCAACAG... .......... .......... .......... 445 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 445 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 445 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 479 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 539 ATATATCTCC TCACAACTGG TGCACGTG 6200 || ||| | | | |||||||| |||||||| ATTTATTTGC TGACAACTGG TGCACGTG 567 hqPGS_C06HBa0153O03.1-7-_SGN-E288197+ (7967 7914,6772 6501,6321 6200) ******************************************************************************** EST sequence 132 +strand 567 n (File: SGN-E290714+) 1 AGAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCAGGCA AGAGAAGTAG AAGAAGAAAT 61 GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA GCAAAGAGAA 121 TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG TTGAGATCAA 181 TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 481 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 541 AATTTATTTG CTGACAACTG GTGCACG Predicted gene structure (within gDNA segment 8193 to 5268): Exon 1 7967 7914 ( 54 n); cDNA 122 175 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 176 447 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6202 ( 120 n); cDNA 448 567 ( 120 n); score: 0.900 MATCH C06HBa0153O03.1-7- SGN-E290714+ 0.852 446 0.787 C PGS_C06HBa0153O03.1-7-_SGN-E290714+ (7967 7914,6772 6501,6321 6202) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 175 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 175 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 175 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 175 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 175 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 175 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 175 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 175 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 175 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 175 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 175 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 175 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 175 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 175 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 175 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 175 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 175 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 175 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 175 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 180 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 240 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 300 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 360 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 420 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 447 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 447 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 447 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 541 ATATATCTCC TCACAACTGG TGCACG 6202 || ||| | | | |||||||| |||||| ATTTATTTGC TGACAACTGG TGCACG 567 hqPGS_C06HBa0153O03.1-7-_SGN-E290714+ (7967 7914,6772 6501,6321 6202) ******************************************************************************** EST sequence 94 +strand 561 n (File: SGN-E394069+) 1 AAAGACACTT CTCCCCGGAA GGTGAATTAG AGCAGGCAAG AGAAGTAGAA GAAGAAATGG 61 ACGCAGGTGT AGTTGCTGCC CCCGCCCCGG CCGCCGCCGT CGATGCAAGC AAAGAGAATA 121 AGGTTCACAC TGATGTCATG CTTTTCAATC GCTGGAGCTA TGATGGAGTT GAGATCAATG 181 ACATGTCTGT TGAGGATTAC ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC 241 ACACAGCTGG TAGATACCAG GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA 301 GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC 361 GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG 421 TCATTGTTGA TGCTGTTATC AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 481 CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCTCCACT CCGTCGTGTT AACCAAGCAA 541 TTTTTTTGCT GACAACTGGT G Predicted gene structure (within gDNA segment 8193 to 5308): Exon 1 7967 7914 ( 54 n); cDNA 120 173 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 174 445 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6206 ( 116 n); cDNA 446 561 ( 116 n); score: 0.888 MATCH C06HBa0153O03.1-7- SGN-E394069+ 0.848 442 0.788 C PGS_C06HBa0153O03.1-7-_SGN-E394069+ (7967 7914,6772 6501,6321 6206) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 173 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 173 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 173 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 173 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 173 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 173 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 173 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 173 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 173 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 173 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 173 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 173 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 173 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 173 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 173 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 173 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 173 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 173 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 173 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 178 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 238 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 298 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 358 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 418 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 445 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 445 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 445 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 479 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 539 ATATATCTCC TCACAACTGG TG 6206 || | | | | | |||||||| || ATTTTTTTGC TGACAACTGG TG 561 hqPGS_C06HBa0153O03.1-7-_SGN-E394069+ (7967 7914,6772 6501,6321 6206) ******************************************************************************** EST sequence 135 +strand 567 n (File: SGN-E246370+) 1 AAGACACCTT CTCCCCCGGA AGGTGAATTA GAGCAGGCCA AGAGAAGTAG AAGAAGAAAT 61 GGACCGCCAG GTGTAGTTGC TGCCCCCCGC CCCGGCCGCC GCCGTCGATG CAAGCAAAGA 121 GAATAAGGTT CACACCTGAT GTCATGCCTT TTCAATCGCT GGAGCTATGA TGGAGTTGAG 181 ATCAATGACC ATGTCTGTTG AGGATTACAT CACCGCAACC TGCTAACAAG CACCCAGTTT 241 ACATGCCACA CACAGCTGGT AGATACCAGG CCAAGCGTTT CAGGAAGGCT CAGTGCCCAA 301 TCGTTGAGAG GCTCACAAAT TCTCTCATGA TGCACGGAAG GAACAACGGA AAGAAGCTCA 361 TGGCTGTTCG TATTATTAAG CATGCAATGG AGATCATTCA TTTGTTGACT GACCAAAACC 421 CAATTCAAGT CATTGTTGAT GCTGTTATCA ACAGTGGGCC AAGGGAAGAT GCAACACGTA 481 TTGGTTCTGC TGGTGTTGTC AGACGTCAAG CTGTTGATAT TTCTCCACTC CGTCGTGTTA 541 ACCAAGCAAT TTATTTGCTG ACAACTG Predicted gene structure (within gDNA segment 8193 to 5338): Exon 1 7967 7914 ( 54 n); cDNA 125 180 ( 56 n); score: 0.704 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.71), Pa: 1.000 (s: 0.73) Exon 2 6772 6501 ( 272 n); cDNA 181 454 ( 274 n); score: 0.824 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6209 ( 113 n); cDNA 455 567 ( 113 n); score: 0.894 MATCH C06HBa0153O03.1-7- SGN-E246370+ 0.827 439 0.774 C PGS_C06HBa0153O03.1-7-_SGN-E246370+ (7967 7914,6772 6501,6321 6209) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA -CTGATGTTT TG-CTTTTCA ATCGTTGGTC ATATGATGAT GTTCAGGTTT 7910 ||| ||||| ||||||| || ||||||| |||| ||| ||||||| ||| || AAGGTTCACA CCTGATGTCA TGCCTTTTCA ATCGCTGGAG CTATGATGGA GTTGAG.... 180 GTTTGTTTCC CTTTCAATTT TATTCCTCTC CAGTTCCTAT ATCTTTTCAT TATTTGCCTA 7850 .......... .......... .......... .......... .......... .......... 180 ACATTAATGT CGAATTGGAT GAAACTTGGC ATTTTCGAAA TCATAAGATG AACATTTGAA 7790 .......... .......... .......... .......... .......... .......... 180 TTATTTTGTT TCTTGCGTTA GCTAAACTCT AATTGTAGTG TAGCAGAGGT GATATATCAG 7730 .......... .......... .......... .......... .......... .......... 180 TAAGGGTGGG CATGGTAGGG TAGATACCGA AACCAAAATT TTTCACTCAA TGGTTTCAAT 7670 .......... .......... .......... .......... .......... .......... 180 ATCATGACAT TTGATATTAT TTATAATGTA TATCGAATCA CCAAATACTT TAACAGAGTG 7610 .......... .......... .......... .......... .......... .......... 180 TATAGTTAGG TATCCAATTC ATTTATCGTA TTATAATACT AACAAATATA TTAACTAGTA 7550 .......... .......... .......... .......... .......... .......... 180 TTAGTTCAAA GTTGTTTAGA CATTGAAAGC TTTGACTACT CTTTTCTTGT TAGAATTGTC 7490 .......... .......... .......... .......... .......... .......... 180 CTTTTTGTGT AATTGATTAA GTGATGGAAT TGCTTCTTCT TTCTTTTGAA TATTTTTACA 7430 .......... .......... .......... .......... .......... .......... 180 TGAGTAAGAT CTTTATATGA TATAATTAAG AAGTTTCTAA AGAAACCAAA ACATAATTCT 7370 .......... .......... .......... .......... .......... .......... 180 CTATTTATAT GAGTATATGT AAGTCGAAGT CGAACAAACA ATGGTTACCA ACCAAAAGTT 7310 .......... .......... .......... .......... .......... .......... 180 AAAAAGTATC GGCACATAAT GGTTTAATTT GATATGGTAA TGGTATAGTA CTTTTAAAAA 7250 .......... .......... .......... .......... .......... .......... 180 TCAAAATTAT TGAACCAAAG TTTTCAATAT TGTATCATAC CTTTCCATGC TCATCCCTAC 7190 .......... .......... .......... .......... .......... .......... 180 ATATCAGTTC TCAAGTCCAA TGCATTGAAT ACTTAACCAT GGTTAGGAAA CTTGAAACAC 7130 .......... .......... .......... .......... .......... .......... 180 TATGCACGAC ACTGCTTAGG TATGTCTATC AACTATAAAG CCTGCTGGCT TGATCTTCTT 7070 .......... .......... .......... .......... .......... .......... 180 ATTCAAAGAA ACATGCATGC TAAACATGAT ATGATTAAGT TGAACAGAAT AGTGTTGGTT 7010 .......... .......... .......... .......... .......... .......... 180 TCCCCAATCC ATAACAAGCC AACTGGGACA ACCTTACAGA AGGTGTGCCT ATTCATCATT 6950 .......... .......... .......... .......... .......... .......... 180 GTTGCCTTGT AAATGATGGA TTTATACAAC TGAAAATTAC TTGCTGAGAG TTCAGGGAAA 6890 .......... .......... .......... .......... .......... .......... 180 TCCTTGTTGG TTAAGTTGGA AATGTAATTG TAGGTGGATT CTTCATTGGA ATGCTCAAAG 6830 .......... .......... .......... .......... .......... .......... 180 GAGAAATTCA GTATATGATC TCTTGAATTC TCTCTTAAAT GTTATTATCT CATGCAGATT 6770 || .......... .......... .......... .......... .......... .......ATC 183 GCTGA-TATT TCTGTTGAGG ATTACATAAC TGCTA-CTGC TAACAAGCAT CCTACATATA 6712 ||| || |||||||||| ||||||| || || | |||| ||||||||| || || | AATGACCATG TCTGTTGAGG ATTACATCAC CGCAACCTGC TAACAAGCAC CCAGTTTACA 243 CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 6652 |||||||| |||||| || ||||| |||| |||| || || |||||||| |||||||| | TGCCACACAC AGCTGGTAGA TACCAGGCCA AGCGTTTCAG GAAGGCTCAG TGCCCAATCG 303 TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 6592 | |||||| | || || || || ||||||| |||||||||| |||||| ||| ||| | |||| TTGAGAGGCT CACAAATTCT CTCATGATGC ACGGAAGGAA CAACGGAAAG AAGCTCATGG 363 CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 6532 | |||||||| |||||||||| || ||||| | | || ||| | |||||||||| | |||||||| CTGTTCGTAT TATTAAGCAT GCAATGGAGA TCATTCATTT GTTGACTGAC CAAAACCCAA 423 TCCAAGTGAT TGTTGATGCT GTTATCAACA GGTTTAGAGA TTATTCTGAT TTTTGCATAT 6472 | ||||| || |||||||||| |||||||||| | TTCAAGTCAT TGTTGATGCT GTTATCAACA G......... .......... .......... 454 TTATTAGCTC GAGTTTTTCT TGCTGAGGTC TTGTTAATTA GAAGATTTTC ATACCATGTC 6412 .......... .......... .......... .......... .......... .......... 454 TTCTTTGTTC CATTTCCATG TCGCGGCATA CTTGAGATAT TGTAGTCATT CTCATTTTTT 6352 .......... .......... .......... .......... .......... .......... 454 CCTTCCCATA TTCTTACCTA TGTGATGCAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 6292 ||| ||||| |||||||||| | ||||| || .......... .......... .......... TGGGCCAAGG GAAGATGCAA CACGTATTGG 484 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGTCAACCA 6232 |||||||||| ||||| || | | |||||||| |||||||||| |||||||||| |||| ||||| TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT TGATATTTCT CCACTCCGTC GTGTTAACCA 544 AGCAATATAT CTCCTCACAA CTG 6209 |||||| ||| | || |||| ||| AGCAATTTAT TTGCTGACAA CTG 567 hqPGS_C06HBa0153O03.1-7-_SGN-E246370+ (7967 7914,6772 6501,6321 6209) ******************************************************************************** EST sequence 95 +strand 559 n (File: SGN-E205675+) 1 AGAAAGACAC TTCTCCCCGG AAGGTGAATT AGATCAGGCA ATAAAAGGGG AAGAAGAAAT 61 GGACGCAGGT GTAGTTGCTG CCCCCGCCCC GGCCGCCGCC GTCGATGCAA GCAAAGAGAA 121 TAAGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG TTGAGATCAA 181 TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAACT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTCATTGTT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC GTATTGGTTC 481 TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA CTCCGTCGTG TTAACCAAGC 541 AATTTATTTG CTGACAACT Predicted gene structure (within gDNA segment 8193 to 5348): Exon 1 7967 7914 ( 54 n); cDNA 122 175 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 176 447 ( 272 n); score: 0.838 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6210 ( 112 n); cDNA 448 559 ( 112 n); score: 0.893 MATCH C06HBa0153O03.1-7- SGN-E205675+ 0.847 438 0.784 C PGS_C06HBa0153O03.1-7-_SGN-E205675+ (7967 7914,6772 6501,6321 6210) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 175 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 175 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 175 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 175 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 175 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 175 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 175 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 175 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 175 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 175 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 175 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 175 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 175 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 175 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 175 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 175 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 175 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 175 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 175 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 180 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 240 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 ||||||| || || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAACT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 300 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 360 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 420 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 447 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 447 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 447 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 481 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 541 ATATATCTCC TCACAACT 6210 || ||| | | | |||||| ATTTATTTGC TGACAACT 559 hqPGS_C06HBa0153O03.1-7-_SGN-E205675+ (7967 7914,6772 6501,6321 6210) ******************************************************************************** EST sequence 100 +strand 555 n (File: SGN-E324787+) 1 CAGAAAGACA CTTCTCCCCG GAAGGTGAAT TAGAGCAGGC AAGAGAAGTG GAAGAAGAAA 61 TGGACGCAGG TGTAGTTGCT GCCCCCGCCC CGGCCGCCGC CGTCGATGCA AGCAAAGAGA 121 ATAAGGTTCA CACTGATGTC ATGCTTTTCA ATCGCTGGAG CTATGATGGA GTTGAGATCA 181 ATGACATGTC TGTTGAGGAT TACATCACCG CAACTGCTAA CAAGCACCCA GTTTACATGC 241 CACACACAGC TGGTAGATAC CAGGCCAAGC GTTTCAGGAA GGCTCAGTGC CCAATCGTTG 301 AGAGGCTCAC AAATTCTCTC ATGATGCACG GAAGGAACAA CGGAAAGAAG CTCATGGCTG 361 TTCGTATTAT TAAGCATGCA ATGGAGATCA TTCATTTGTT GACTGACCAA AACCCAATTC 421 AAGTCATTGT TGATGCTGTT ATCAACAGTG GGCCAAGGGA AGATGCAACA CGTATTGGTT 481 CTGCTGGTGT TGTCAGACGT CAAGCTGTTG ATATTTCTCC ACTCCGTCGT GTTAACCAAG 541 CAATTTATTT GCTGA Predicted gene structure (within gDNA segment 8193 to 5398): Exon 1 7967 7914 ( 54 n); cDNA 123 176 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 177 448 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6217 ( 105 n); cDNA 449 553 ( 105 n); score: 0.895 MATCH C06HBa0153O03.1-7- SGN-E324787+ 0.849 431 0.777 C PGS_C06HBa0153O03.1-7-_SGN-E324787+ (7967 7914,6772 6501,6321 6217) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 176 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 176 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 176 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 176 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 176 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 176 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 176 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 176 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 176 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 176 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 176 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 176 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 176 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 176 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 176 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 176 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 176 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 176 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 176 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 181 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 241 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 301 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 361 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 421 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 448 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 448 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 448 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 482 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT CAACCAAGCA 6228 |||||||||| | || || || |||||||||| |||||||||| |||||||||| ||||||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGTCGTGT TAACCAAGCA 542 ATATATCTCC T 6217 || ||| | | | ATTTATTTGC T 553 hqPGS_C06HBa0153O03.1-7-_SGN-E324787+ (7967 7914,6772 6501,6321 6217) ******************************************************************************** EST sequence 44 +strand 484 n (File: SGN-E550930+) 1 GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 61 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG ATTGCTGATA 121 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 181 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 241 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 301 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 361 TTGTTGATGC TGTTATCAAC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 421 GTGTTGTGAG GCGACNAGCT GTTGATATTT CTCCACTCCG TCGTGGGTCA CCAGCATATA 481 TCTC Predicted gene structure (within gDNA segment 8193 to 5439): Exon 1 8023 7914 ( 110 n); cDNA 1 110 ( 110 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 111 382 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6219 ( 103 n); cDNA 383 484 ( 102 n); score: 0.922 MATCH C06HBa0153O03.1-7- SGN-E550930+ 0.984 485 1.002 C PGS_C06HBa0153O03.1-7-_SGN-E550930+ (8023 7914,6772 6501,6321 6219) Alignment (genomic DNA sequence = upper lines): GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 7964 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 60 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG GTTTGTTTGT 7904 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG .......... 110 TTCCCTTTCA ATTTTATTCC TCTCCAGTTC CTATATCTTT TCATTATTTG CCTAACATTA 7844 .......... .......... .......... .......... .......... .......... 110 ATGTCGAATT GGATGAAACT TGGCATTTTC GAAATCATAA GATGAACATT TGAATTATTT 7784 .......... .......... .......... .......... .......... .......... 110 TGTTTCTTGC GTTAGCTAAA CTCTAATTGT AGTGTAGCAG AGGTGATATA TCAGTAAGGG 7724 .......... .......... .......... .......... .......... .......... 110 TGGGCATGGT AGGGTAGATA CCGAAACCAA AATTTTTCAC TCAATGGTTT CAATATCATG 7664 .......... .......... .......... .......... .......... .......... 110 ACATTTGATA TTATTTATAA TGTATATCGA ATCACCAAAT ACTTTAACAG AGTGTATAGT 7604 .......... .......... .......... .......... .......... .......... 110 TAGGTATCCA ATTCATTTAT CGTATTATAA TACTAACAAA TATATTAACT AGTATTAGTT 7544 .......... .......... .......... .......... .......... .......... 110 CAAAGTTGTT TAGACATTGA AAGCTTTGAC TACTCTTTTC TTGTTAGAAT TGTCCTTTTT 7484 .......... .......... .......... .......... .......... .......... 110 GTGTAATTGA TTAAGTGATG GAATTGCTTC TTCTTTCTTT TGAATATTTT TACATGAGTA 7424 .......... .......... .......... .......... .......... .......... 110 AGATCTTTAT ATGATATAAT TAAGAAGTTT CTAAAGAAAC CAAAACATAA TTCTCTATTT 7364 .......... .......... .......... .......... .......... .......... 110 ATATGAGTAT ATGTAAGTCG AAGTCGAACA AACAATGGTT ACCAACCAAA AGTTAAAAAG 7304 .......... .......... .......... .......... .......... .......... 110 TATCGGCACA TAATGGTTTA ATTTGATATG GTAATGGTAT AGTACTTTTA AAAATCAAAA 7244 .......... .......... .......... .......... .......... .......... 110 TTATTGAACC AAAGTTTTCA ATATTGTATC ATACCTTTCC ATGCTCATCC CTACATATCA 7184 .......... .......... .......... .......... .......... .......... 110 GTTCTCAAGT CCAATGCATT GAATACTTAA CCATGGTTAG GAAACTTGAA ACACTATGCA 7124 .......... .......... .......... .......... .......... .......... 110 CGACACTGCT TAGGTATGTC TATCAACTAT AAAGCCTGCT GGCTTGATCT TCTTATTCAA 7064 .......... .......... .......... .......... .......... .......... 110 AGAAACATGC ATGCTAAACA TGATATGATT AAGTTGAACA GAATAGTGTT GGTTTCCCCA 7004 .......... .......... .......... .......... .......... .......... 110 ATCCATAACA AGCCAACTGG GACAACCTTA CAGAAGGTGT GCCTATTCAT CATTGTTGCC 6944 .......... .......... .......... .......... .......... .......... 110 TTGTAAATGA TGGATTTATA CAACTGAAAA TTACTTGCTG AGAGTTCAGG GAAATCCTTG 6884 .......... .......... .......... .......... .......... .......... 110 TTGGTTAAGT TGGAAATGTA ATTGTAGGTG GATTCTTCAT TGGAATGCTC AAAGGAGAAA 6824 .......... .......... .......... .......... .......... .......... 110 TTCAGTATAT GATCTCTTGA ATTCTCTCTT AAATGTTATT ATCTCATGCA GATTGCTGAT 6764 ||||||||| .......... .......... .......... .......... .......... .ATTGCTGAT 119 ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 6704 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 179 ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 6644 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 239 TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 6584 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 299 ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 6524 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 359 ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG ATTTTTGCAT ATTTATTAGC 6464 |||||||||| |||||||||| ||| ATTGTTGATG CTGTTATCAA CAG....... .......... .......... .......... 382 TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT TCATACCATG TCTTCTTTGT 6404 .......... .......... .......... .......... .......... .......... 382 TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA TTCTCATTTT TTCCTTCCCA 6344 .......... .......... .......... .......... .......... .......... 382 TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 6284 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 420 GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTCAAC CAAGCAATAT 6224 |||||||||| ||||| |||| |||||||||| |||||||||| ||||| | ||| |||| GTGTTGTGAG GCGACNAGCT GTTGATATTT CTCCACTCCG TCGTGGGTCA CCAGC-ATAT 479 ATCTC 6219 ||||| ATCTC 484 hqPGS_C06HBa0153O03.1-7-_SGN-E550930+ (8023 7914,6772 6501,6321 6219) ******************************************************************************** EST sequence 64 +strand 480 n (File: SGN-E269754+) 1 AAAAATGGAA GATGCTTCAG TAGTTGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 61 TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAGA TTGCTGATAT 121 TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA CACCACACAC 181 AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG TGGAGAGGTT 241 GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG CCGTTCGTAT 301 TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA TCCAAGTGAT 361 TGTTGATGCT GTTATCAACA GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 421 TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA Predicted gene structure (within gDNA segment 8193 to 5613): Exon 1 8022 7914 ( 109 n); cDNA 1 109 ( 109 n); score: 0.982 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 110 381 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6223 ( 99 n); cDNA 382 480 ( 99 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E269754+ 0.996 480 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E269754+ (8022 7914,6772 6501,6321 6223) Alignment (genomic DNA sequence = upper lines): AAAAATGGAA GAAGCTTCAG TAGTAGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 7963 |||||||||| || ||||||| |||| ||||| |||||||||| |||||||||| |||||||||| AAAAATGGAA GATGCTTCAG TAGTTGCAGT GGACAACCAA AAGCCGCAGC AAGAGAAGCC 60 TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAGG TTTGTTTGTT 7903 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAG. .......... 109 TCCCTTTCAA TTTTATTCCT CTCCAGTTCC TATATCTTTT CATTATTTGC CTAACATTAA 7843 .......... .......... .......... .......... .......... .......... 109 TGTCGAATTG GATGAAACTT GGCATTTTCG AAATCATAAG ATGAACATTT GAATTATTTT 7783 .......... .......... .......... .......... .......... .......... 109 GTTTCTTGCG TTAGCTAAAC TCTAATTGTA GTGTAGCAGA GGTGATATAT CAGTAAGGGT 7723 .......... .......... .......... .......... .......... .......... 109 GGGCATGGTA GGGTAGATAC CGAAACCAAA ATTTTTCACT CAATGGTTTC AATATCATGA 7663 .......... .......... .......... .......... .......... .......... 109 CATTTGATAT TATTTATAAT GTATATCGAA TCACCAAATA CTTTAACAGA GTGTATAGTT 7603 .......... .......... .......... .......... .......... .......... 109 AGGTATCCAA TTCATTTATC GTATTATAAT ACTAACAAAT ATATTAACTA GTATTAGTTC 7543 .......... .......... .......... .......... .......... .......... 109 AAAGTTGTTT AGACATTGAA AGCTTTGACT ACTCTTTTCT TGTTAGAATT GTCCTTTTTG 7483 .......... .......... .......... .......... .......... .......... 109 TGTAATTGAT TAAGTGATGG AATTGCTTCT TCTTTCTTTT GAATATTTTT ACATGAGTAA 7423 .......... .......... .......... .......... .......... .......... 109 GATCTTTATA TGATATAATT AAGAAGTTTC TAAAGAAACC AAAACATAAT TCTCTATTTA 7363 .......... .......... .......... .......... .......... .......... 109 TATGAGTATA TGTAAGTCGA AGTCGAACAA ACAATGGTTA CCAACCAAAA GTTAAAAAGT 7303 .......... .......... .......... .......... .......... .......... 109 ATCGGCACAT AATGGTTTAA TTTGATATGG TAATGGTATA GTACTTTTAA AAATCAAAAT 7243 .......... .......... .......... .......... .......... .......... 109 TATTGAACCA AAGTTTTCAA TATTGTATCA TACCTTTCCA TGCTCATCCC TACATATCAG 7183 .......... .......... .......... .......... .......... .......... 109 TTCTCAAGTC CAATGCATTG AATACTTAAC CATGGTTAGG AAACTTGAAA CACTATGCAC 7123 .......... .......... .......... .......... .......... .......... 109 GACACTGCTT AGGTATGTCT ATCAACTATA AAGCCTGCTG GCTTGATCTT CTTATTCAAA 7063 .......... .......... .......... .......... .......... .......... 109 GAAACATGCA TGCTAAACAT GATATGATTA AGTTGAACAG AATAGTGTTG GTTTCCCCAA 7003 .......... .......... .......... .......... .......... .......... 109 TCCATAACAA GCCAACTGGG ACAACCTTAC AGAAGGTGTG CCTATTCATC ATTGTTGCCT 6943 .......... .......... .......... .......... .......... .......... 109 TGTAAATGAT GGATTTATAC AACTGAAAAT TACTTGCTGA GAGTTCAGGG AAATCCTTGT 6883 .......... .......... .......... .......... .......... .......... 109 TGGTTAAGTT GGAAATGTAA TTGTAGGTGG ATTCTTCATT GGAATGCTCA AAGGAGAAAT 6823 .......... .......... .......... .......... .......... .......... 109 TCAGTATATG ATCTCTTGAA TTCTCTCTTA AATGTTATTA TCTCATGCAG ATTGCTGATA 6763 |||||||||| .......... .......... .......... .......... .......... ATTGCTGATA 119 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 6703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 179 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 6643 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 239 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 6583 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 299 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 6523 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 359 TTGTTGATGC TGTTATCAAC AGGTTTAGAG ATTATTCTGA TTTTTGCATA TTTATTAGCT 6463 |||||||||| |||||||||| || TTGTTGATGC TGTTATCAAC AG........ .......... .......... .......... 381 CGAGTTTTTC TTGCTGAGGT CTTGTTAATT AGAAGATTTT CATACCATGT CTTCTTTGTT 6403 .......... .......... .......... .......... .......... .......... 381 CCATTTCCAT GTCGCGGCAT ACTTGAGATA TTGTAGTCAT TCTCATTTTT TCCTTCCCAT 6343 .......... .......... .......... .......... .......... .......... 381 ATTCTTACCT ATGTGATGCA GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 6283 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .TGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 420 TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA 6223 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCACTCCGT CGTGTCAACC AAGCAATATA 480 hqPGS_C06HBa0153O03.1-7-_SGN-E269754+ (8022 7914,6772 6501,6321 6223) ******************************************************************************** EST sequence 40 +strand 495 n (File: SGN-E248667+) 1 GAGCTTAGAC CTATCAGAAA AACAGGAAAA ATGGAAGAAG CTTCAGTAGT AGCAGTGGAC 61 AACCAAAAGC CGCAGCAAGA GAAGCCTCAC ACCTGATGTT TTGCTTTTCA ATCGTTGGTC 121 ATATGATGAT GTTCAGATTG CTGATATTTC TGTTGAGGAT TACATAACTG CTACTGCTAA 181 CAAGCATCCT ACATATACAC CACACACAGC TGGGAGGTAC CAAGCCAAGC GGTTTAGAAA 241 GGCTCAATGC CCAATTGTGG AGAGGTTGAC CAACTCACTG ATGATGCACG GAAGGAACAA 301 CGGGAAGAAG TTGATGGCCG TTCGTATTAT TAAGCATGCT ATGGAAATTA TCCATCTGTT 361 GACTGACCTA AACCCAATCC AAGTGATTGT TGATGCTGTT ATCAACAGTG GACCAAGAGA 421 AGATGCAACT CGTATAGGTT CTGCTGGTGT TGTGAGGCGA CAAGCTGTTG ATATTTCTCC 481 ACTCCGTCGT GTCAA Predicted gene structure (within gDNA segment 8193 to 5625): Exon 1 8045 7914 ( 132 n); cDNA 4 136 ( 133 n); score: 0.981 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.95), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 137 408 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6235 ( 87 n); cDNA 409 495 ( 87 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E248667+ 0.995 491 0.992 C PGS_C06HBa0153O03.1-7-_SGN-E248667+ (8045 7914,6772 6501,6321 6235) Alignment (genomic DNA sequence = upper lines): CTTAGACCTA TCAGAAAAAC AGGAAAAATG GAAGAAGCTT CAGTAGTAGC AGTGGACAAC 7986 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTAGACCTA TCAGAAAAAC AGGAAAAATG GAAGAAGCTT CAGTAGTAGC AGTGGACAAC 63 CAAAAGCCGC AGCAAGAGAA GCCTCACA-C TGATGTTTTG CTTTTCAATC GTTGGTCATA 7927 |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| CAAAAGCCGC AGCAAGAGAA GCCTCACACC TGATGTTTTG CTTTTCAATC GTTGGTCATA 123 TGATGATGTT CAGGTTTGTT TGTTTCCCTT TCAATTTTAT TCCTCTCCAG TTCCTATATC 7867 |||||||||| ||| TGATGATGTT CAG....... .......... .......... .......... .......... 136 TTTTCATTAT TTGCCTAACA TTAATGTCGA ATTGGATGAA ACTTGGCATT TTCGAAATCA 7807 .......... .......... .......... .......... .......... .......... 136 TAAGATGAAC ATTTGAATTA TTTTGTTTCT TGCGTTAGCT AAACTCTAAT TGTAGTGTAG 7747 .......... .......... .......... .......... .......... .......... 136 CAGAGGTGAT ATATCAGTAA GGGTGGGCAT GGTAGGGTAG ATACCGAAAC CAAAATTTTT 7687 .......... .......... .......... .......... .......... .......... 136 CACTCAATGG TTTCAATATC ATGACATTTG ATATTATTTA TAATGTATAT CGAATCACCA 7627 .......... .......... .......... .......... .......... .......... 136 AATACTTTAA CAGAGTGTAT AGTTAGGTAT CCAATTCATT TATCGTATTA TAATACTAAC 7567 .......... .......... .......... .......... .......... .......... 136 AAATATATTA ACTAGTATTA GTTCAAAGTT GTTTAGACAT TGAAAGCTTT GACTACTCTT 7507 .......... .......... .......... .......... .......... .......... 136 TTCTTGTTAG AATTGTCCTT TTTGTGTAAT TGATTAAGTG ATGGAATTGC TTCTTCTTTC 7447 .......... .......... .......... .......... .......... .......... 136 TTTTGAATAT TTTTACATGA GTAAGATCTT TATATGATAT AATTAAGAAG TTTCTAAAGA 7387 .......... .......... .......... .......... .......... .......... 136 AACCAAAACA TAATTCTCTA TTTATATGAG TATATGTAAG TCGAAGTCGA ACAAACAATG 7327 .......... .......... .......... .......... .......... .......... 136 GTTACCAACC AAAAGTTAAA AAGTATCGGC ACATAATGGT TTAATTTGAT ATGGTAATGG 7267 .......... .......... .......... .......... .......... .......... 136 TATAGTACTT TTAAAAATCA AAATTATTGA ACCAAAGTTT TCAATATTGT ATCATACCTT 7207 .......... .......... .......... .......... .......... .......... 136 TCCATGCTCA TCCCTACATA TCAGTTCTCA AGTCCAATGC ATTGAATACT TAACCATGGT 7147 .......... .......... .......... .......... .......... .......... 136 TAGGAAACTT GAAACACTAT GCACGACACT GCTTAGGTAT GTCTATCAAC TATAAAGCCT 7087 .......... .......... .......... .......... .......... .......... 136 GCTGGCTTGA TCTTCTTATT CAAAGAAACA TGCATGCTAA ACATGATATG ATTAAGTTGA 7027 .......... .......... .......... .......... .......... .......... 136 ACAGAATAGT GTTGGTTTCC CCAATCCATA ACAAGCCAAC TGGGACAACC TTACAGAAGG 6967 .......... .......... .......... .......... .......... .......... 136 TGTGCCTATT CATCATTGTT GCCTTGTAAA TGATGGATTT ATACAACTGA AAATTACTTG 6907 .......... .......... .......... .......... .......... .......... 136 CTGAGAGTTC AGGGAAATCC TTGTTGGTTA AGTTGGAAAT GTAATTGTAG GTGGATTCTT 6847 .......... .......... .......... .......... .......... .......... 136 CATTGGAATG CTCAAAGGAG AAATTCAGTA TATGATCTCT TGAATTCTCT CTTAAATGTT 6787 .......... .......... .......... .......... .......... .......... 136 ATTATCTCAT GCAGATTGCT GATATTTCTG TTGAGGATTA CATAACTGCT ACTGCTAACA 6727 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....ATTGCT GATATTTCTG TTGAGGATTA CATAACTGCT ACTGCTAACA 182 AGCATCCTAC ATATACACCA CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG 6667 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCATCCTAC ATATACACCA CACACAGCTG GGAGGTACCA AGCCAAGCGG TTTAGAAAGG 242 CTCAATGCCC AATTGTGGAG AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG 6607 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCAATGCCC AATTGTGGAG AGGTTGACCA ACTCACTGAT GATGCACGGA AGGAACAACG 302 GGAAGAAGTT GATGGCCGTT CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA 6547 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAAGAAGTT GATGGCCGTT CGTATTATTA AGCATGCTAT GGAAATTATC CATCTGTTGA 362 CTGACCTAAA CCCAATCCAA GTGATTGTTG ATGCTGTTAT CAACAGGTTT AGAGATTATT 6487 |||||||||| |||||||||| |||||||||| |||||||||| |||||| CTGACCTAAA CCCAATCCAA GTGATTGTTG ATGCTGTTAT CAACAG.... .......... 408 CTGATTTTTG CATATTTATT AGCTCGAGTT TTTCTTGCTG AGGTCTTGTT AATTAGAAGA 6427 .......... .......... .......... .......... .......... .......... 408 TTTTCATACC ATGTCTTCTT TGTTCCATTT CCATGTCGCG GCATACTTGA GATATTGTAG 6367 .......... .......... .......... .......... .......... .......... 408 TCATTCTCAT TTTTTCCTTC CCATATTCTT ACCTATGTGA TGCAGTGGAC CAAGAGAAGA 6307 ||||| |||||||||| .......... .......... .......... .......... .....TGGAC CAAGAGAAGA 423 TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT 6247 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGACAA GCTGTTGATA TTTCTCCACT 483 CCGTCGTGTC AA 6235 |||||||||| || CCGTCGTGTC AA 495 hqPGS_C06HBa0153O03.1-7-_SGN-E248667+ (8045 7914,6772 6501,6321 6235) ******************************************************************************** EST sequence 96 +strand 467 n (File: SGN-E286676+) 1 GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 61 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG ATTGCTGATA 121 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 181 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 241 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 301 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 361 TTGTTGATGC TGTTATCAAC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 421 GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTC Predicted gene structure (within gDNA segment 8193 to 5627): Exon 1 8023 7914 ( 110 n); cDNA 1 110 ( 110 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 111 382 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6237 ( 85 n); cDNA 383 467 ( 85 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E286676+ 1.000 467 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E286676+ (8023 7914,6772 6501,6321 6237) Alignment (genomic DNA sequence = upper lines): GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 7964 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 60 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG GTTTGTTTGT 7904 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG .......... 110 TTCCCTTTCA ATTTTATTCC TCTCCAGTTC CTATATCTTT TCATTATTTG CCTAACATTA 7844 .......... .......... .......... .......... .......... .......... 110 ATGTCGAATT GGATGAAACT TGGCATTTTC GAAATCATAA GATGAACATT TGAATTATTT 7784 .......... .......... .......... .......... .......... .......... 110 TGTTTCTTGC GTTAGCTAAA CTCTAATTGT AGTGTAGCAG AGGTGATATA TCAGTAAGGG 7724 .......... .......... .......... .......... .......... .......... 110 TGGGCATGGT AGGGTAGATA CCGAAACCAA AATTTTTCAC TCAATGGTTT CAATATCATG 7664 .......... .......... .......... .......... .......... .......... 110 ACATTTGATA TTATTTATAA TGTATATCGA ATCACCAAAT ACTTTAACAG AGTGTATAGT 7604 .......... .......... .......... .......... .......... .......... 110 TAGGTATCCA ATTCATTTAT CGTATTATAA TACTAACAAA TATATTAACT AGTATTAGTT 7544 .......... .......... .......... .......... .......... .......... 110 CAAAGTTGTT TAGACATTGA AAGCTTTGAC TACTCTTTTC TTGTTAGAAT TGTCCTTTTT 7484 .......... .......... .......... .......... .......... .......... 110 GTGTAATTGA TTAAGTGATG GAATTGCTTC TTCTTTCTTT TGAATATTTT TACATGAGTA 7424 .......... .......... .......... .......... .......... .......... 110 AGATCTTTAT ATGATATAAT TAAGAAGTTT CTAAAGAAAC CAAAACATAA TTCTCTATTT 7364 .......... .......... .......... .......... .......... .......... 110 ATATGAGTAT ATGTAAGTCG AAGTCGAACA AACAATGGTT ACCAACCAAA AGTTAAAAAG 7304 .......... .......... .......... .......... .......... .......... 110 TATCGGCACA TAATGGTTTA ATTTGATATG GTAATGGTAT AGTACTTTTA AAAATCAAAA 7244 .......... .......... .......... .......... .......... .......... 110 TTATTGAACC AAAGTTTTCA ATATTGTATC ATACCTTTCC ATGCTCATCC CTACATATCA 7184 .......... .......... .......... .......... .......... .......... 110 GTTCTCAAGT CCAATGCATT GAATACTTAA CCATGGTTAG GAAACTTGAA ACACTATGCA 7124 .......... .......... .......... .......... .......... .......... 110 CGACACTGCT TAGGTATGTC TATCAACTAT AAAGCCTGCT GGCTTGATCT TCTTATTCAA 7064 .......... .......... .......... .......... .......... .......... 110 AGAAACATGC ATGCTAAACA TGATATGATT AAGTTGAACA GAATAGTGTT GGTTTCCCCA 7004 .......... .......... .......... .......... .......... .......... 110 ATCCATAACA AGCCAACTGG GACAACCTTA CAGAAGGTGT GCCTATTCAT CATTGTTGCC 6944 .......... .......... .......... .......... .......... .......... 110 TTGTAAATGA TGGATTTATA CAACTGAAAA TTACTTGCTG AGAGTTCAGG GAAATCCTTG 6884 .......... .......... .......... .......... .......... .......... 110 TTGGTTAAGT TGGAAATGTA ATTGTAGGTG GATTCTTCAT TGGAATGCTC AAAGGAGAAA 6824 .......... .......... .......... .......... .......... .......... 110 TTCAGTATAT GATCTCTTGA ATTCTCTCTT AAATGTTATT ATCTCATGCA GATTGCTGAT 6764 ||||||||| .......... .......... .......... .......... .......... .ATTGCTGAT 119 ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 6704 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 179 ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 6644 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 239 TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 6584 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 299 ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 6524 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 359 ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG ATTTTTGCAT ATTTATTAGC 6464 |||||||||| |||||||||| ||| ATTGTTGATG CTGTTATCAA CAG....... .......... .......... .......... 382 TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT TCATACCATG TCTTCTTTGT 6404 .......... .......... .......... .......... .......... .......... 382 TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA TTCTCATTTT TTCCTTCCCA 6344 .......... .......... .......... .......... .......... .......... 382 TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 6284 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 420 GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTC 6237 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| GTGTTGTGAG GCGACAAGCT GTTGATATTT CTCCACTCCG TCGTGTC 467 hqPGS_C06HBa0153O03.1-7-_SGN-E286676+ (8023 7914,6772 6501,6321 6237) ******************************************************************************** EST sequence 78 +strand 474 n (File: SGN-E277497+) 1 AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 61 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGAT 121 TGCTGATATT TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC 181 ACCACACACA GCTGGGAGGT ACCAAGCCAA GCGGTTTAGA AAGGCTCAAT GCCCAATTGT 241 GGAGAGGTTG ACCAACTCAC TGATGATGCA CGGAAGGAAC AACGGGAAGA AGTTGATGGC 301 CGTTCGTATT ATTAAGCATG CTATGGAAAT TATCCATCTG TTGACTGACC TAAACCCAAT 361 CCAAGTGATT GTTGATGCTG TTATCAACAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 421 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGT Predicted gene structure (within gDNA segment 8193 to 5628): Exon 1 8031 7914 ( 118 n); cDNA 1 118 ( 118 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 119 390 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6238 ( 84 n); cDNA 391 474 ( 84 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E277497+ 1.000 474 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E277497+ (8031 7914,6772 6501,6321 6238) Alignment (genomic DNA sequence = upper lines): AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 7972 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 60 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGGT 7912 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAG.. 118 TTGTTTGTTT CCCTTTCAAT TTTATTCCTC TCCAGTTCCT ATATCTTTTC ATTATTTGCC 7852 .......... .......... .......... .......... .......... .......... 118 TAACATTAAT GTCGAATTGG ATGAAACTTG GCATTTTCGA AATCATAAGA TGAACATTTG 7792 .......... .......... .......... .......... .......... .......... 118 AATTATTTTG TTTCTTGCGT TAGCTAAACT CTAATTGTAG TGTAGCAGAG GTGATATATC 7732 .......... .......... .......... .......... .......... .......... 118 AGTAAGGGTG GGCATGGTAG GGTAGATACC GAAACCAAAA TTTTTCACTC AATGGTTTCA 7672 .......... .......... .......... .......... .......... .......... 118 ATATCATGAC ATTTGATATT ATTTATAATG TATATCGAAT CACCAAATAC TTTAACAGAG 7612 .......... .......... .......... .......... .......... .......... 118 TGTATAGTTA GGTATCCAAT TCATTTATCG TATTATAATA CTAACAAATA TATTAACTAG 7552 .......... .......... .......... .......... .......... .......... 118 TATTAGTTCA AAGTTGTTTA GACATTGAAA GCTTTGACTA CTCTTTTCTT GTTAGAATTG 7492 .......... .......... .......... .......... .......... .......... 118 TCCTTTTTGT GTAATTGATT AAGTGATGGA ATTGCTTCTT CTTTCTTTTG AATATTTTTA 7432 .......... .......... .......... .......... .......... .......... 118 CATGAGTAAG ATCTTTATAT GATATAATTA AGAAGTTTCT AAAGAAACCA AAACATAATT 7372 .......... .......... .......... .......... .......... .......... 118 CTCTATTTAT ATGAGTATAT GTAAGTCGAA GTCGAACAAA CAATGGTTAC CAACCAAAAG 7312 .......... .......... .......... .......... .......... .......... 118 TTAAAAAGTA TCGGCACATA ATGGTTTAAT TTGATATGGT AATGGTATAG TACTTTTAAA 7252 .......... .......... .......... .......... .......... .......... 118 AATCAAAATT ATTGAACCAA AGTTTTCAAT ATTGTATCAT ACCTTTCCAT GCTCATCCCT 7192 .......... .......... .......... .......... .......... .......... 118 ACATATCAGT TCTCAAGTCC AATGCATTGA ATACTTAACC ATGGTTAGGA AACTTGAAAC 7132 .......... .......... .......... .......... .......... .......... 118 ACTATGCACG ACACTGCTTA GGTATGTCTA TCAACTATAA AGCCTGCTGG CTTGATCTTC 7072 .......... .......... .......... .......... .......... .......... 118 TTATTCAAAG AAACATGCAT GCTAAACATG ATATGATTAA GTTGAACAGA ATAGTGTTGG 7012 .......... .......... .......... .......... .......... .......... 118 TTTCCCCAAT CCATAACAAG CCAACTGGGA CAACCTTACA GAAGGTGTGC CTATTCATCA 6952 .......... .......... .......... .......... .......... .......... 118 TTGTTGCCTT GTAAATGATG GATTTATACA ACTGAAAATT ACTTGCTGAG AGTTCAGGGA 6892 .......... .......... .......... .......... .......... .......... 118 AATCCTTGTT GGTTAAGTTG GAAATGTAAT TGTAGGTGGA TTCTTCATTG GAATGCTCAA 6832 .......... .......... .......... .......... .......... .......... 118 AGGAGAAATT CAGTATATGA TCTCTTGAAT TCTCTCTTAA ATGTTATTAT CTCATGCAGA 6772 | .......... .......... .......... .......... .......... .........A 119 TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 6712 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 179 CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 6652 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 239 TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 6592 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 299 CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 6532 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 359 TCCAAGTGAT TGTTGATGCT GTTATCAACA GGTTTAGAGA TTATTCTGAT TTTTGCATAT 6472 |||||||||| |||||||||| |||||||||| | TCCAAGTGAT TGTTGATGCT GTTATCAACA G......... .......... .......... 390 TTATTAGCTC GAGTTTTTCT TGCTGAGGTC TTGTTAATTA GAAGATTTTC ATACCATGTC 6412 .......... .......... .......... .......... .......... .......... 390 TTCTTTGTTC CATTTCCATG TCGCGGCATA CTTGAGATAT TGTAGTCATT CTCATTTTTT 6352 .......... .......... .......... .......... .......... .......... 390 CCTTCCCATA TTCTTACCTA TGTGATGCAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 6292 |||||||||| |||||||||| |||||||||| .......... .......... .......... TGGACCAAGA GAAGATGCAA CTCGTATAGG 420 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGT 6238 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATTTCT CCACTCCGTC GTGT 474 hqPGS_C06HBa0153O03.1-7-_SGN-E277497+ (8031 7914,6772 6501,6321 6238) ******************************************************************************** EST sequence 114 +strand 520 n (File: SGN-E319581+) 1 ACACTTCTCC CCGGAAGGTG AATTAGAGCA GGCAAGAGAA GTAGAAGAAG AAATGGACGC 61 AGGTGTAGTT GCTGCCCCCG CCCCGGCCGC CGCCGTCGAT GCAAGCAAAG AGAATAAGGT 121 TCACACTGAT GTCATGCTTT TCAATCGCTG GAGCTATGAT GGAGTTGAGA TCAATGACAT 181 GTCTGTTGAG GATTACATCA CCGCAACTGC TAACAAGCAC CCAGTTTACA TGCCACACAC 241 AGCTGGTAGA TACCAGGCCA AGCGTTTCAG GAAGGCTCAG TGCCCAATCG TTGAGAGGCT 301 CACAAATTCT CTCATGATGC ACGGAAGGAA CAACGGAAAG AAGCTCATGG CTGTTCGTAT 361 TATTAAGCAT GCAATGGAGA TCATTCATTT GTTGACTGAC CAAAACCCAA TTCAAGTCAT 421 TGTTGATGCT GTTATCAACA GTGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG 481 TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCACTCCGT Predicted gene structure (within gDNA segment 8193 to 5633): Exon 1 7967 7914 ( 54 n); cDNA 116 169 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 170 441 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6243 ( 79 n); cDNA 442 520 ( 79 n); score: 0.911 MATCH C06HBa0153O03.1-7- SGN-E319581+ 0.849 405 0.779 C PGS_C06HBa0153O03.1-7-_SGN-E319581+ (7967 7914,6772 6501,6321 6243) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 169 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 169 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 169 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 169 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 169 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 169 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 169 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 169 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 169 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 169 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 169 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 169 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 169 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 169 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 169 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 169 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 169 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 169 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 169 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 174 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 234 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 294 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 354 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 414 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 441 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 441 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 441 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 475 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCGT 6243 |||||||||| | || || || |||||||||| |||||||||| ||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCGT 520 hqPGS_C06HBa0153O03.1-7-_SGN-E319581+ (7967 7914,6772 6501,6321 6243) ******************************************************************************** EST sequence 70 +strand 518 n (File: SGN-E306428+) 1 CACTTCTCCC CGGAAGGTGA ATTAGAGCAG GCAAGAGAAG TAGAAGAAGA AATGGACGCA 61 GGTGTAGTTG CTGCCCCCGC CCCGGCCGCC GCCGTCGATG CAAGCAAAGA GAATAAGGTT 121 CACACTGATG TCATGCTTTT CAATCGCTGG AGCTATGATG GAGTTGAGAT CAATGACATG 181 TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC CAGTTTACAT GCCACACACA 241 GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT GCCCAATCGT TGAGAGGCTC 301 ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA AGCTCATGGC TGTTCGTATT 361 ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC AAAACCCAAT TCAAGTCATT 421 GTTGATGCTG TTATCAACAG TGGGCCAAGG GAAGATGCAA CACGTATTGG TTCTGCTGGT 481 GTTGTCAGAC GTCAAGCTGT TGATATTTCT CCACTCCG Predicted gene structure (within gDNA segment 8193 to 5634): Exon 1 7967 7914 ( 54 n); cDNA 115 168 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 169 440 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6244 ( 78 n); cDNA 441 518 ( 78 n); score: 0.910 MATCH C06HBa0153O03.1-7- SGN-E306428+ 0.849 404 0.780 C PGS_C06HBa0153O03.1-7-_SGN-E306428+ (7967 7914,6772 6501,6321 6244) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 168 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 168 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 168 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 168 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 168 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 168 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 168 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 168 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 168 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 168 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 168 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 168 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 168 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 168 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 168 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 168 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 168 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 168 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 168 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 173 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 233 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 293 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 353 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 413 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 440 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 440 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 440 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 474 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCTCCAC TCCG 6244 |||||||||| | || || || |||||||||| |||||||||| |||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCTCCAC TCCG 518 hqPGS_C06HBa0153O03.1-7-_SGN-E306428+ (7967 7914,6772 6501,6321 6244) ******************************************************************************** EST sequence 106 +strand 531 n (File: SGN-E276054+) 1 AGCATACAGC ATAAAGACAC TTCTCCCCGG AAGGTGAATT AGAGCATGCA AGAGAAGTAT 61 AAGAAGAAAT GGACGCAGGT GTAGTTGCTG TCCCCGTCCC GGTCGCCGTC GTTGATGCAA 121 GCAAAGAGAA TATGGTTCAC ACTGATGTCA TGCTTTTCAA TCGCTGGAGC TATGATGGAG 181 TTGAGATCAA TGACATGTCT GTTGAGGATT ACATCACCGT ATCTGCTAAC AAGCACCCAG 241 TTTACATGCC ACACACAGAT GGTAGATACC ACGTCAAGCG TTTCAGGAAG GCTCAGTGCC 301 CAATCGTTGA GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC 361 TCATGGCTGT TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA 421 ACCCAATTCA AGTCATTGGT GATGCTGTTA TCAACAGTGG GCCAAGGGAA GATGCAACAC 481 GTATTGGTTC TGCTGGTGTT GTCAGACGTC AAGCTGTTGA TATTTCTCCA C Predicted gene structure (within gDNA segment 8193 to 5638): Exon 1 7962 7914 ( 49 n); cDNA 137 185 ( 49 n); score: 0.816 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.82), Pa: 1.000 (s: 0.80) Exon 2 6772 6501 ( 272 n); cDNA 186 457 ( 272 n); score: 0.824 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.92), Pa: 0.999 (s: 0.88) Exon 3 6321 6248 ( 74 n); cDNA 458 531 ( 74 n); score: 0.905 MATCH C06HBa0153O03.1-7- SGN-E276054+ 0.841 395 0.744 C PGS_C06HBa0153O03.1-7-_SGN-E276054+ (7962 7914,6772 6501,6321 6248) Alignment (genomic DNA sequence = upper lines): TCACACTGAT GTTTTGCTTT TCAATCGTTG GTCATATGAT GATGTTCAGG TTTGTTTGTT 7903 |||||||||| || |||||| ||||||| || | |||||| | ||| || TCACACTGAT GTCATGCTTT TCAATCGCTG GAGCTATGAT GGAGTTGAG. .......... 185 TCCCTTTCAA TTTTATTCCT CTCCAGTTCC TATATCTTTT CATTATTTGC CTAACATTAA 7843 .......... .......... .......... .......... .......... .......... 185 TGTCGAATTG GATGAAACTT GGCATTTTCG AAATCATAAG ATGAACATTT GAATTATTTT 7783 .......... .......... .......... .......... .......... .......... 185 GTTTCTTGCG TTAGCTAAAC TCTAATTGTA GTGTAGCAGA GGTGATATAT CAGTAAGGGT 7723 .......... .......... .......... .......... .......... .......... 185 GGGCATGGTA GGGTAGATAC CGAAACCAAA ATTTTTCACT CAATGGTTTC AATATCATGA 7663 .......... .......... .......... .......... .......... .......... 185 CATTTGATAT TATTTATAAT GTATATCGAA TCACCAAATA CTTTAACAGA GTGTATAGTT 7603 .......... .......... .......... .......... .......... .......... 185 AGGTATCCAA TTCATTTATC GTATTATAAT ACTAACAAAT ATATTAACTA GTATTAGTTC 7543 .......... .......... .......... .......... .......... .......... 185 AAAGTTGTTT AGACATTGAA AGCTTTGACT ACTCTTTTCT TGTTAGAATT GTCCTTTTTG 7483 .......... .......... .......... .......... .......... .......... 185 TGTAATTGAT TAAGTGATGG AATTGCTTCT TCTTTCTTTT GAATATTTTT ACATGAGTAA 7423 .......... .......... .......... .......... .......... .......... 185 GATCTTTATA TGATATAATT AAGAAGTTTC TAAAGAAACC AAAACATAAT TCTCTATTTA 7363 .......... .......... .......... .......... .......... .......... 185 TATGAGTATA TGTAAGTCGA AGTCGAACAA ACAATGGTTA CCAACCAAAA GTTAAAAAGT 7303 .......... .......... .......... .......... .......... .......... 185 ATCGGCACAT AATGGTTTAA TTTGATATGG TAATGGTATA GTACTTTTAA AAATCAAAAT 7243 .......... .......... .......... .......... .......... .......... 185 TATTGAACCA AAGTTTTCAA TATTGTATCA TACCTTTCCA TGCTCATCCC TACATATCAG 7183 .......... .......... .......... .......... .......... .......... 185 TTCTCAAGTC CAATGCATTG AATACTTAAC CATGGTTAGG AAACTTGAAA CACTATGCAC 7123 .......... .......... .......... .......... .......... .......... 185 GACACTGCTT AGGTATGTCT ATCAACTATA AAGCCTGCTG GCTTGATCTT CTTATTCAAA 7063 .......... .......... .......... .......... .......... .......... 185 GAAACATGCA TGCTAAACAT GATATGATTA AGTTGAACAG AATAGTGTTG GTTTCCCCAA 7003 .......... .......... .......... .......... .......... .......... 185 TCCATAACAA GCCAACTGGG ACAACCTTAC AGAAGGTGTG CCTATTCATC ATTGTTGCCT 6943 .......... .......... .......... .......... .......... .......... 185 TGTAAATGAT GGATTTATAC AACTGAAAAT TACTTGCTGA GAGTTCAGGG AAATCCTTGT 6883 .......... .......... .......... .......... .......... .......... 185 TGGTTAAGTT GGAAATGTAA TTGTAGGTGG ATTCTTCATT GGAATGCTCA AAGGAGAAAT 6823 .......... .......... .......... .......... .......... .......... 185 TCAGTATATG ATCTCTTGAA TTCTCTCTTA AATGTTATTA TCTCATGCAG ATTGCTGATA 6763 || ||| | .......... .......... .......... .......... .......... ATCAATGACA 195 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 6703 | |||||||| ||||||||| || | ||| |||||||||| || || | ||||||| TGTCTGTTGA GGATTACATC ACCGTATCTG CTAACAAGCA CCCAGTTTAC ATGCCACACA 255 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 6643 ||| ||| || ||||| | | ||||| || | | |||||||| |||||||| || |||||| CAGATGGTAG ATACCACGTC AAGCGTTTCA GGAAGGCTCA GTGCCCAATC GTTGAGAGGC 315 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 6583 | || || || || |||||| |||||||||| ||||||| || |||| | ||| || ||||||| TCACAAATTC TCTCATGATG CACGGAAGGA ACAACGGAAA GAAGCTCATG GCTGTTCGTA 375 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 6523 |||||||||| ||| ||||| || || ||| |||||||||| || ||||||| || ||||| | TTATTAAGCA TGCAATGGAG ATCATTCATT TGTTGACTGA CCAAAACCCA ATTCAAGTCA 435 TTGTTGATGC TGTTATCAAC AGGTTTAGAG ATTATTCTGA TTTTTGCATA TTTATTAGCT 6463 ||| |||||| |||||||||| || TTGGTGATGC TGTTATCAAC AG........ .......... .......... .......... 457 CGAGTTTTTC TTGCTGAGGT CTTGTTAATT AGAAGATTTT CATACCATGT CTTCTTTGTT 6403 .......... .......... .......... .......... .......... .......... 457 CCATTTCCAT GTCGCGGCAT ACTTGAGATA TTGTAGTCAT TCTCATTTTT TCCTTCCCAT 6343 .......... .......... .......... .......... .......... .......... 457 ATTCTTACCT ATGTGATGCA GTGGACCAAG AGAAGATGCA ACTCGTATAG GTTCTGCTGG 6283 ||| ||||| ||||||||| || ||||| | |||||||||| .......... .......... .TGGGCCAAG GGAAGATGCA ACACGTATTG GTTCTGCTGG 496 TGTTGTGAGG CGACAAGCTG TTGATATTTC TCCAC 6248 |||||| || || ||||||| |||||||||| ||||| TGTTGTCAGA CGTCAAGCTG TTGATATTTC TCCAC 531 hqPGS_C06HBa0153O03.1-7-_SGN-E276054+ (7962 7914,6772 6501,6321 6248) ******************************************************************************** EST sequence 101 +strand 515 n (File: SGN-E319734+) 1 AAAGACACTT CTCCCCGGAA GGTGAATTAG AGCAGGCAAG AGAAGTAGAA GAAGAAATGG 61 ACGCAGGTGT AGTTGCTGCC CCCGCCCCGG CCGCCGCCGT CGATGCAAGC AAAGAGAATA 121 AGGTTCACAC TGATGTCATG CTTTTCAATC GCTGGAGCTA TGATGGAGTT GAGATCAATG 181 ACATGTCTGT TGAGGATTAC ATCACCGCAA CTGCTAACAA GCACCCAGTT TACATGCCAC 241 ACACAGCTGG TAGATACCAG GCCAAGCGTT TCAGGAAGGC TCAGTGCCCA ATCGTTGAGA 301 GGCTCACAAA TTCTCTCATG ATGCACGGAA GGAACAACGG AAAGAAGCTC ATGGCTGTTC 361 GTATTATTAA GCATGCAATG GAGATCATTC ATTTGTTGAC TGACCAAAAC CCAATTCAAG 421 TCATTGTTGA TGCTGTTATC AACAGTGGGC CAAGGGAAGA TGCAACACGT ATTGGTTCTG 481 CTGGTGTTGT CAGACGTCAA GCTGTTGATA TTTCT Predicted gene structure (within gDNA segment 8193 to 5642): Exon 1 7967 7914 ( 54 n); cDNA 120 173 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 174 445 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6252 ( 70 n); cDNA 446 515 ( 70 n); score: 0.900 MATCH C06HBa0153O03.1-7- SGN-E319734+ 0.846 396 0.769 C PGS_C06HBa0153O03.1-7-_SGN-E319734+ (7967 7914,6772 6501,6321 6252) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 173 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 173 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 173 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 173 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 173 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 173 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 173 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 173 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 173 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 173 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 173 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 173 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 173 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 173 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 173 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 173 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 173 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 173 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 173 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 178 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 238 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 298 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 358 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 418 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 445 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 445 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 445 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 479 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTCT 6252 |||||||||| | || || || |||||||||| |||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTCT 515 hqPGS_C06HBa0153O03.1-7-_SGN-E319734+ (7967 7914,6772 6501,6321 6252) ******************************************************************************** EST sequence 126 +strand 469 n (File: SGN-E322500+) 1 TAGAAGAAGA AATGGACGCA GGTGTAGTTG CTGCCCCCGC CCCGGCCGCC GCCGTCGATG 61 CAAGCAAAGA GAATAAGGTT CACACTGATG TCATGCTTTT CAATCGCTGG AGCTATGATG 121 GAGTTGAGAT CAATGACATG TCTGTTGAGG ATTACATCAC CGCAACTGCT AACAAGCACC 181 CAGTTTACAT GCCACACACA GCTGGTAGAT ACCAGGCCAA GCGTTTCAGG AAGGCTCAGT 241 GCCCAATCGT TGAGAGGCTC ACAAATTCTC TCATGATGCA CGGAAGGAAC AACGGAAAGA 301 AGCTCATGGC TGTTCGTATT ATTAAGCATG CAATGGAGAT CATTCATTTG TTGACTGACC 361 AAAACCCAAT TCAAGTCATT GTTGATGCTG TTATCAACAG TGGGCCAAGG GAAGATGCAA 421 CACGTATTGG TTCTGCTGGT GTTGTCAGAC GTCAAGCTGT TGATATTTC Predicted gene structure (within gDNA segment 8193 to 5643): Exon 1 7967 7914 ( 54 n); cDNA 75 128 ( 54 n); score: 0.796 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.80), Pa: 1.000 (s: 0.84) Exon 2 6772 6501 ( 272 n); cDNA 129 400 ( 272 n); score: 0.842 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 0.94), Pa: 0.999 (s: 0.88) Exon 3 6321 6253 ( 69 n); cDNA 401 469 ( 69 n); score: 0.899 MATCH C06HBa0153O03.1-7- SGN-E322500+ 0.846 395 0.842 C PGS_C06HBa0153O03.1-7-_SGN-E322500+ (7967 7914,6772 6501,6321 6253) Alignment (genomic DNA sequence = upper lines): AAGCCTCACA CTGATGTTTT GCTTTTCAAT CGTTGGTCAT ATGATGATGT TCAGGTTTGT 7908 ||| ||||| ||||||| | |||||||||| || ||| | |||||| || | || AAGGTTCACA CTGATGTCAT GCTTTTCAAT CGCTGGAGCT ATGATGGAGT TGAG...... 128 TTGTTTCCCT TTCAATTTTA TTCCTCTCCA GTTCCTATAT CTTTTCATTA TTTGCCTAAC 7848 .......... .......... .......... .......... .......... .......... 128 ATTAATGTCG AATTGGATGA AACTTGGCAT TTTCGAAATC ATAAGATGAA CATTTGAATT 7788 .......... .......... .......... .......... .......... .......... 128 ATTTTGTTTC TTGCGTTAGC TAAACTCTAA TTGTAGTGTA GCAGAGGTGA TATATCAGTA 7728 .......... .......... .......... .......... .......... .......... 128 AGGGTGGGCA TGGTAGGGTA GATACCGAAA CCAAAATTTT TCACTCAATG GTTTCAATAT 7668 .......... .......... .......... .......... .......... .......... 128 CATGACATTT GATATTATTT ATAATGTATA TCGAATCACC AAATACTTTA ACAGAGTGTA 7608 .......... .......... .......... .......... .......... .......... 128 TAGTTAGGTA TCCAATTCAT TTATCGTATT ATAATACTAA CAAATATATT AACTAGTATT 7548 .......... .......... .......... .......... .......... .......... 128 AGTTCAAAGT TGTTTAGACA TTGAAAGCTT TGACTACTCT TTTCTTGTTA GAATTGTCCT 7488 .......... .......... .......... .......... .......... .......... 128 TTTTGTGTAA TTGATTAAGT GATGGAATTG CTTCTTCTTT CTTTTGAATA TTTTTACATG 7428 .......... .......... .......... .......... .......... .......... 128 AGTAAGATCT TTATATGATA TAATTAAGAA GTTTCTAAAG AAACCAAAAC ATAATTCTCT 7368 .......... .......... .......... .......... .......... .......... 128 ATTTATATGA GTATATGTAA GTCGAAGTCG AACAAACAAT GGTTACCAAC CAAAAGTTAA 7308 .......... .......... .......... .......... .......... .......... 128 AAAGTATCGG CACATAATGG TTTAATTTGA TATGGTAATG GTATAGTACT TTTAAAAATC 7248 .......... .......... .......... .......... .......... .......... 128 AAAATTATTG AACCAAAGTT TTCAATATTG TATCATACCT TTCCATGCTC ATCCCTACAT 7188 .......... .......... .......... .......... .......... .......... 128 ATCAGTTCTC AAGTCCAATG CATTGAATAC TTAACCATGG TTAGGAAACT TGAAACACTA 7128 .......... .......... .......... .......... .......... .......... 128 TGCACGACAC TGCTTAGGTA TGTCTATCAA CTATAAAGCC TGCTGGCTTG ATCTTCTTAT 7068 .......... .......... .......... .......... .......... .......... 128 TCAAAGAAAC ATGCATGCTA AACATGATAT GATTAAGTTG AACAGAATAG TGTTGGTTTC 7008 .......... .......... .......... .......... .......... .......... 128 CCCAATCCAT AACAAGCCAA CTGGGACAAC CTTACAGAAG GTGTGCCTAT TCATCATTGT 6948 .......... .......... .......... .......... .......... .......... 128 TGCCTTGTAA ATGATGGATT TATACAACTG AAAATTACTT GCTGAGAGTT CAGGGAAATC 6888 .......... .......... .......... .......... .......... .......... 128 CTTGTTGGTT AAGTTGGAAA TGTAATTGTA GGTGGATTCT TCATTGGAAT GCTCAAAGGA 6828 .......... .......... .......... .......... .......... .......... 128 GAAATTCAGT ATATGATCTC TTGAATTCTC TCTTAAATGT TATTATCTCA TGCAGATTGC 6768 || .......... .......... .......... .......... .......... .....ATCAA 133 TGATATTTCT GTTGAGGATT ACATAACTGC TACTGCTAAC AAGCATCCTA CATATACACC 6708 ||| || ||| |||||||||| |||| || || ||||||||| ||||| || || | || TGACATGTCT GTTGAGGATT ACATCACCGC AACTGCTAAC AAGCACCCAG TTTACATGCC 193 ACACACAGCT GGGAGGTACC AAGCCAAGCG GTTTAGAAAG GCTCAATGCC CAATTGTGGA 6648 |||||||||| || || |||| | |||||||| || || ||| ||||| |||| |||| || || ACACACAGCT GGTAGATACC AGGCCAAGCG TTTCAGGAAG GCTCAGTGCC CAATCGTTGA 253 GAGGTTGACC AACTCACTGA TGATGCACGG AAGGAACAAC GGGAAGAAGT TGATGGCCGT 6588 |||| | || || || || | |||||||||| |||||||||| || |||||| | ||||| || GAGGCTCACA AATTCTCTCA TGATGCACGG AAGGAACAAC GGAAAGAAGC TCATGGCTGT 313 TCGTATTATT AAGCATGCTA TGGAAATTAT CCATCTGTTG ACTGACCTAA ACCCAATCCA 6528 |||||||||| |||||||| | |||| || || ||| ||||| ||||||| || ||||||| || TCGTATTATT AAGCATGCAA TGGAGATCAT TCATTTGTTG ACTGACCAAA ACCCAATTCA 373 AGTGATTGTT GATGCTGTTA TCAACAGGTT TAGAGATTAT TCTGATTTTT GCATATTTAT 6468 ||| |||||| |||||||||| ||||||| AGTCATTGTT GATGCTGTTA TCAACAG... .......... .......... .......... 400 TAGCTCGAGT TTTTCTTGCT GAGGTCTTGT TAATTAGAAG ATTTTCATAC CATGTCTTCT 6408 .......... .......... .......... .......... .......... .......... 400 TTGTTCCATT TCCATGTCGC GGCATACTTG AGATATTGTA GTCATTCTCA TTTTTTCCTT 6348 .......... .......... .......... .......... .......... .......... 400 CCCATATTCT TACCTATGTG ATGCAGTGGA CCAAGAGAAG ATGCAACTCG TATAGGTTCT 6288 ||| ||||| |||| ||||||| || ||| |||||| .......... .......... ......TGGG CCAAGGGAAG ATGCAACACG TATTGGTTCT 434 GCTGGTGTTG TGAGGCGACA AGCTGTTGAT ATTTC 6253 |||||||||| | || || || |||||||||| ||||| GCTGGTGTTG TCAGACGTCA AGCTGTTGAT ATTTC 469 hqPGS_C06HBa0153O03.1-7-_SGN-E322500+ (7967 7914,6772 6501,6321 6253) ******************************************************************************** EST sequence 76 +strand 457 n (File: SGN-E295487+) 1 AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 61 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGAT 121 TGCTGATATT TCTGTTGAGG ATTACATAAC TGCTACTGCT AACAAGCATC CTACATATAC 181 ACCACACACA GCTGGGAGGT ACCAAGCCAA GCGGTTTAGA AAGGCTCAAT GCCCAATTGT 241 GGAGAGGTTG ACCAACTCAC TGATGATGCA CGGAAGGAAC AACGGGAAGA AGTTGATGGC 301 CGTTCGTATT ATTAAGCATG CTATGGAAAT TATCCATCTG TTGACTGACC TAAACCCAAT 361 CCAAGTGATT GTTGATGCTG TTATCAACAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 421 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATT Predicted gene structure (within gDNA segment 8193 to 5645): Exon 1 8031 7914 ( 118 n); cDNA 1 118 ( 118 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 119 390 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6255 ( 67 n); cDNA 391 457 ( 67 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E295487+ 1.000 457 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E295487+ (8031 7914,6772 6501,6321 6255) Alignment (genomic DNA sequence = upper lines): AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 7972 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACAGGA AAAATGGAAG AAGCTTCAGT AGTAGCAGTG GACAACCAAA AGCCGCAGCA 60 AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAGGT 7912 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AGAGAAGCCT CACACTGATG TTTTGCTTTT CAATCGTTGG TCATATGATG ATGTTCAG.. 118 TTGTTTGTTT CCCTTTCAAT TTTATTCCTC TCCAGTTCCT ATATCTTTTC ATTATTTGCC 7852 .......... .......... .......... .......... .......... .......... 118 TAACATTAAT GTCGAATTGG ATGAAACTTG GCATTTTCGA AATCATAAGA TGAACATTTG 7792 .......... .......... .......... .......... .......... .......... 118 AATTATTTTG TTTCTTGCGT TAGCTAAACT CTAATTGTAG TGTAGCAGAG GTGATATATC 7732 .......... .......... .......... .......... .......... .......... 118 AGTAAGGGTG GGCATGGTAG GGTAGATACC GAAACCAAAA TTTTTCACTC AATGGTTTCA 7672 .......... .......... .......... .......... .......... .......... 118 ATATCATGAC ATTTGATATT ATTTATAATG TATATCGAAT CACCAAATAC TTTAACAGAG 7612 .......... .......... .......... .......... .......... .......... 118 TGTATAGTTA GGTATCCAAT TCATTTATCG TATTATAATA CTAACAAATA TATTAACTAG 7552 .......... .......... .......... .......... .......... .......... 118 TATTAGTTCA AAGTTGTTTA GACATTGAAA GCTTTGACTA CTCTTTTCTT GTTAGAATTG 7492 .......... .......... .......... .......... .......... .......... 118 TCCTTTTTGT GTAATTGATT AAGTGATGGA ATTGCTTCTT CTTTCTTTTG AATATTTTTA 7432 .......... .......... .......... .......... .......... .......... 118 CATGAGTAAG ATCTTTATAT GATATAATTA AGAAGTTTCT AAAGAAACCA AAACATAATT 7372 .......... .......... .......... .......... .......... .......... 118 CTCTATTTAT ATGAGTATAT GTAAGTCGAA GTCGAACAAA CAATGGTTAC CAACCAAAAG 7312 .......... .......... .......... .......... .......... .......... 118 TTAAAAAGTA TCGGCACATA ATGGTTTAAT TTGATATGGT AATGGTATAG TACTTTTAAA 7252 .......... .......... .......... .......... .......... .......... 118 AATCAAAATT ATTGAACCAA AGTTTTCAAT ATTGTATCAT ACCTTTCCAT GCTCATCCCT 7192 .......... .......... .......... .......... .......... .......... 118 ACATATCAGT TCTCAAGTCC AATGCATTGA ATACTTAACC ATGGTTAGGA AACTTGAAAC 7132 .......... .......... .......... .......... .......... .......... 118 ACTATGCACG ACACTGCTTA GGTATGTCTA TCAACTATAA AGCCTGCTGG CTTGATCTTC 7072 .......... .......... .......... .......... .......... .......... 118 TTATTCAAAG AAACATGCAT GCTAAACATG ATATGATTAA GTTGAACAGA ATAGTGTTGG 7012 .......... .......... .......... .......... .......... .......... 118 TTTCCCCAAT CCATAACAAG CCAACTGGGA CAACCTTACA GAAGGTGTGC CTATTCATCA 6952 .......... .......... .......... .......... .......... .......... 118 TTGTTGCCTT GTAAATGATG GATTTATACA ACTGAAAATT ACTTGCTGAG AGTTCAGGGA 6892 .......... .......... .......... .......... .......... .......... 118 AATCCTTGTT GGTTAAGTTG GAAATGTAAT TGTAGGTGGA TTCTTCATTG GAATGCTCAA 6832 .......... .......... .......... .......... .......... .......... 118 AGGAGAAATT CAGTATATGA TCTCTTGAAT TCTCTCTTAA ATGTTATTAT CTCATGCAGA 6772 | .......... .......... .......... .......... .......... .........A 119 TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 6712 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTGATAT TTCTGTTGAG GATTACATAA CTGCTACTGC TAACAAGCAT CCTACATATA 179 CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 6652 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCACACAC AGCTGGGAGG TACCAAGCCA AGCGGTTTAG AAAGGCTCAA TGCCCAATTG 239 TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 6592 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGAGAGGTT GACCAACTCA CTGATGATGC ACGGAAGGAA CAACGGGAAG AAGTTGATGG 299 CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 6532 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTCGTAT TATTAAGCAT GCTATGGAAA TTATCCATCT GTTGACTGAC CTAAACCCAA 359 TCCAAGTGAT TGTTGATGCT GTTATCAACA GGTTTAGAGA TTATTCTGAT TTTTGCATAT 6472 |||||||||| |||||||||| |||||||||| | TCCAAGTGAT TGTTGATGCT GTTATCAACA G......... .......... .......... 390 TTATTAGCTC GAGTTTTTCT TGCTGAGGTC TTGTTAATTA GAAGATTTTC ATACCATGTC 6412 .......... .......... .......... .......... .......... .......... 390 TTCTTTGTTC CATTTCCATG TCGCGGCATA CTTGAGATAT TGTAGTCATT CTCATTTTTT 6352 .......... .......... .......... .......... .......... .......... 390 CCTTCCCATA TTCTTACCTA TGTGATGCAG TGGACCAAGA GAAGATGCAA CTCGTATAGG 6292 |||||||||| |||||||||| |||||||||| .......... .......... .......... TGGACCAAGA GAAGATGCAA CTCGTATAGG 420 TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATT 6255 |||||||||| |||||||||| |||||||||| ||||||| TTCTGCTGGT GTTGTGAGGC GACAAGCTGT TGATATT 457 hqPGS_C06HBa0153O03.1-7-_SGN-E295487+ (8031 7914,6772 6501,6321 6255) ******************************************************************************** EST sequence 57 +strand 449 n (File: SGN-E304731+) 1 GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 61 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG ATTGCTGATA 121 TTTCTGTTGA GGATTACATA ACTGCTACTG CTAACAAGCA TCCTACATAT ACACCACACA 181 CAGCTGGGAG GTACCAAGCC AAGCGGTTTA GAAAGGCTCA ATGCCCAATT GTGGAGAGGT 241 TGACCAACTC ACTGATGATG CACGGAAGGA ACAACGGGAA GAAGTTGATG GCCGTTCGTA 301 TTATTAAGCA TGCTATGGAA ATTATCCATC TGTTGACTGA CCTAAACCCA ATCCAAGTGA 361 TTGTTGATGC TGTTATCAAC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 421 GTGTTGTGAG GCGACAAGCT GTTGATATT Predicted gene structure (within gDNA segment 8193 to 5645): Exon 1 8023 7914 ( 110 n); cDNA 1 110 ( 110 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6501 ( 272 n); cDNA 111 382 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 6321 6255 ( 67 n); cDNA 383 449 ( 67 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E304731+ 1.000 449 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E304731+ (8023 7914,6772 6501,6321 6255) Alignment (genomic DNA sequence = upper lines): GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 7964 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAAAATGGA AGAAGCTTCA GTAGTAGCAG TGGACAACCA AAAGCCGCAG CAAGAGAAGC 60 CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG GTTTGTTTGT 7904 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCACACTGA TGTTTTGCTT TTCAATCGTT GGTCATATGA TGATGTTCAG .......... 110 TTCCCTTTCA ATTTTATTCC TCTCCAGTTC CTATATCTTT TCATTATTTG CCTAACATTA 7844 .......... .......... .......... .......... .......... .......... 110 ATGTCGAATT GGATGAAACT TGGCATTTTC GAAATCATAA GATGAACATT TGAATTATTT 7784 .......... .......... .......... .......... .......... .......... 110 TGTTTCTTGC GTTAGCTAAA CTCTAATTGT AGTGTAGCAG AGGTGATATA TCAGTAAGGG 7724 .......... .......... .......... .......... .......... .......... 110 TGGGCATGGT AGGGTAGATA CCGAAACCAA AATTTTTCAC TCAATGGTTT CAATATCATG 7664 .......... .......... .......... .......... .......... .......... 110 ACATTTGATA TTATTTATAA TGTATATCGA ATCACCAAAT ACTTTAACAG AGTGTATAGT 7604 .......... .......... .......... .......... .......... .......... 110 TAGGTATCCA ATTCATTTAT CGTATTATAA TACTAACAAA TATATTAACT AGTATTAGTT 7544 .......... .......... .......... .......... .......... .......... 110 CAAAGTTGTT TAGACATTGA AAGCTTTGAC TACTCTTTTC TTGTTAGAAT TGTCCTTTTT 7484 .......... .......... .......... .......... .......... .......... 110 GTGTAATTGA TTAAGTGATG GAATTGCTTC TTCTTTCTTT TGAATATTTT TACATGAGTA 7424 .......... .......... .......... .......... .......... .......... 110 AGATCTTTAT ATGATATAAT TAAGAAGTTT CTAAAGAAAC CAAAACATAA TTCTCTATTT 7364 .......... .......... .......... .......... .......... .......... 110 ATATGAGTAT ATGTAAGTCG AAGTCGAACA AACAATGGTT ACCAACCAAA AGTTAAAAAG 7304 .......... .......... .......... .......... .......... .......... 110 TATCGGCACA TAATGGTTTA ATTTGATATG GTAATGGTAT AGTACTTTTA AAAATCAAAA 7244 .......... .......... .......... .......... .......... .......... 110 TTATTGAACC AAAGTTTTCA ATATTGTATC ATACCTTTCC ATGCTCATCC CTACATATCA 7184 .......... .......... .......... .......... .......... .......... 110 GTTCTCAAGT CCAATGCATT GAATACTTAA CCATGGTTAG GAAACTTGAA ACACTATGCA 7124 .......... .......... .......... .......... .......... .......... 110 CGACACTGCT TAGGTATGTC TATCAACTAT AAAGCCTGCT GGCTTGATCT TCTTATTCAA 7064 .......... .......... .......... .......... .......... .......... 110 AGAAACATGC ATGCTAAACA TGATATGATT AAGTTGAACA GAATAGTGTT GGTTTCCCCA 7004 .......... .......... .......... .......... .......... .......... 110 ATCCATAACA AGCCAACTGG GACAACCTTA CAGAAGGTGT GCCTATTCAT CATTGTTGCC 6944 .......... .......... .......... .......... .......... .......... 110 TTGTAAATGA TGGATTTATA CAACTGAAAA TTACTTGCTG AGAGTTCAGG GAAATCCTTG 6884 .......... .......... .......... .......... .......... .......... 110 TTGGTTAAGT TGGAAATGTA ATTGTAGGTG GATTCTTCAT TGGAATGCTC AAAGGAGAAA 6824 .......... .......... .......... .......... .......... .......... 110 TTCAGTATAT GATCTCTTGA ATTCTCTCTT AAATGTTATT ATCTCATGCA GATTGCTGAT 6764 ||||||||| .......... .......... .......... .......... .......... .ATTGCTGAT 119 ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 6704 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC ATCCTACATA TACACCACAC 179 ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 6644 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC AATGCCCAAT TGTGGAGAGG 239 TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 6584 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGACCAACT CACTGATGAT GCACGGAAGG AACAACGGGA AGAAGTTGAT GGCCGTTCGT 299 ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 6524 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATTAAGC ATGCTATGGA AATTATCCAT CTGTTGACTG ACCTAAACCC AATCCAAGTG 359 ATTGTTGATG CTGTTATCAA CAGGTTTAGA GATTATTCTG ATTTTTGCAT ATTTATTAGC 6464 |||||||||| |||||||||| ||| ATTGTTGATG CTGTTATCAA CAG....... .......... .......... .......... 382 TCGAGTTTTT CTTGCTGAGG TCTTGTTAAT TAGAAGATTT TCATACCATG TCTTCTTTGT 6404 .......... .......... .......... .......... .......... .......... 382 TCCATTTCCA TGTCGCGGCA TACTTGAGAT ATTGTAGTCA TTCTCATTTT TTCCTTCCCA 6344 .......... .......... .......... .......... .......... .......... 382 TATTCTTACC TATGTGATGC AGTGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 6284 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TGGACCAA GAGAAGATGC AACTCGTATA GGTTCTGCTG 420 GTGTTGTGAG GCGACAAGCT GTTGATATT 6255 |||||||||| |||||||||| ||||||||| GTGTTGTGAG GCGACAAGCT GTTGATATT 449 hqPGS_C06HBa0153O03.1-7-_SGN-E304731+ (8023 7914,6772 6501,6321 6255) ******************************************************************************** EST sequence 42 +strand 228 n (File: SGN-E239858+) 1 TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA 61 GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC 121 ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAGTGGAC 181 CAAGAGAAGA TGCAACTCGT ATAGGTTCTG CTGGTGTTGT GAGGCGAC Predicted gene structure (within gDNA segment 7275 to 5659): Exon 1 6675 6501 ( 175 n); cDNA 1 175 ( 175 n); score: 1.000 Intron 1 6500 6322 ( 179 n); Pd: 0.992 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 2 6321 6269 ( 53 n); cDNA 176 228 ( 53 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E239858+ 1.000 228 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E239858+ (6675 6501,6321 6269) Alignment (genomic DNA sequence = upper lines): TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA 6616 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA 60 GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC 6556 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATTATCC 120 ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAGGTTTA 6496 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| ATCTGTTGAC TGACCTAAAC CCAATCCAAG TGATTGTTGA TGCTGTTATC AACAG..... 175 GAGATTATTC TGATTTTTGC ATATTTATTA GCTCGAGTTT TTCTTGCTGA GGTCTTGTTA 6436 .......... .......... .......... .......... .......... .......... 175 ATTAGAAGAT TTTCATACCA TGTCTTCTTT GTTCCATTTC CATGTCGCGG CATACTTGAG 6376 .......... .......... .......... .......... .......... .......... 175 ATATTGTAGT CATTCTCATT TTTTCCTTCC CATATTCTTA CCTATGTGAT GCAGTGGACC 6316 |||||| .......... .......... .......... .......... .......... ....TGGACC 181 AAGAGAAGAT GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGAC 6269 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| AAGAGAAGAT GCAACTCGTA TAGGTTCTGC TGGTGTTGTG AGGCGAC 228 hqPGS_C06HBa0153O03.1-7-_SGN-E239858+ (6675 6501,6321 6269) ******************************************************************************** EST sequence 129 +strand 345 n (File: SGN-E320800+) 1 TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 61 AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 121 ATGATGTTCA GATTGCTGAT ATTTCTGTTG AGGATTACAT AACTGCTACT GCTAACAAGC 181 ATCCTACATA TACACCACAC ACAGCTGGGA GGTACCAAGC CAAGCGGTTT AGAAAGGCTC 241 AATGCCCAAT TGTGGAGAGG TTGACCAACT CACTGATGAT GCACGGAAGG AACACCGGGA 301 AGAAGTTGAT GGCCGTTCGT ATTATTAAGC ATGCTATGGA AATTA Predicted gene structure (within gDNA segment 8193 to 5949): Exon 1 8044 7914 ( 131 n); cDNA 1 131 ( 131 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6559 ( 214 n); cDNA 132 345 ( 214 n); score: 0.995 MATCH C06HBa0153O03.1-7- SGN-E320800+ 0.997 345 1.000 C PGS_C06HBa0153O03.1-7-_SGN-E320800+ (8044 7914,6772 6559) Alignment (genomic DNA sequence = upper lines): TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 7985 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA GTGGACAACC 60 AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 7925 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT TGGTCATATG 120 ATGATGTTCA GGTTTGTTTG TTTCCCTTTC AATTTTATTC CTCTCCAGTT CCTATATCTT 7865 |||||||||| | ATGATGTTCA G......... .......... .......... .......... .......... 131 TTCATTATTT GCCTAACATT AATGTCGAAT TGGATGAAAC TTGGCATTTT CGAAATCATA 7805 .......... .......... .......... .......... .......... .......... 131 AGATGAACAT TTGAATTATT TTGTTTCTTG CGTTAGCTAA ACTCTAATTG TAGTGTAGCA 7745 .......... .......... .......... .......... .......... .......... 131 GAGGTGATAT ATCAGTAAGG GTGGGCATGG TAGGGTAGAT ACCGAAACCA AAATTTTTCA 7685 .......... .......... .......... .......... .......... .......... 131 CTCAATGGTT TCAATATCAT GACATTTGAT ATTATTTATA ATGTATATCG AATCACCAAA 7625 .......... .......... .......... .......... .......... .......... 131 TACTTTAACA GAGTGTATAG TTAGGTATCC AATTCATTTA TCGTATTATA ATACTAACAA 7565 .......... .......... .......... .......... .......... .......... 131 ATATATTAAC TAGTATTAGT TCAAAGTTGT TTAGACATTG AAAGCTTTGA CTACTCTTTT 7505 .......... .......... .......... .......... .......... .......... 131 CTTGTTAGAA TTGTCCTTTT TGTGTAATTG ATTAAGTGAT GGAATTGCTT CTTCTTTCTT 7445 .......... .......... .......... .......... .......... .......... 131 TTGAATATTT TTACATGAGT AAGATCTTTA TATGATATAA TTAAGAAGTT TCTAAAGAAA 7385 .......... .......... .......... .......... .......... .......... 131 CCAAAACATA ATTCTCTATT TATATGAGTA TATGTAAGTC GAAGTCGAAC AAACAATGGT 7325 .......... .......... .......... .......... .......... .......... 131 TACCAACCAA AAGTTAAAAA GTATCGGCAC ATAATGGTTT AATTTGATAT GGTAATGGTA 7265 .......... .......... .......... .......... .......... .......... 131 TAGTACTTTT AAAAATCAAA ATTATTGAAC CAAAGTTTTC AATATTGTAT CATACCTTTC 7205 .......... .......... .......... .......... .......... .......... 131 CATGCTCATC CCTACATATC AGTTCTCAAG TCCAATGCAT TGAATACTTA ACCATGGTTA 7145 .......... .......... .......... .......... .......... .......... 131 GGAAACTTGA AACACTATGC ACGACACTGC TTAGGTATGT CTATCAACTA TAAAGCCTGC 7085 .......... .......... .......... .......... .......... .......... 131 TGGCTTGATC TTCTTATTCA AAGAAACATG CATGCTAAAC ATGATATGAT TAAGTTGAAC 7025 .......... .......... .......... .......... .......... .......... 131 AGAATAGTGT TGGTTTCCCC AATCCATAAC AAGCCAACTG GGACAACCTT ACAGAAGGTG 6965 .......... .......... .......... .......... .......... .......... 131 TGCCTATTCA TCATTGTTGC CTTGTAAATG ATGGATTTAT ACAACTGAAA ATTACTTGCT 6905 .......... .......... .......... .......... .......... .......... 131 GAGAGTTCAG GGAAATCCTT GTTGGTTAAG TTGGAAATGT AATTGTAGGT GGATTCTTCA 6845 .......... .......... .......... .......... .......... .......... 131 TTGGAATGCT CAAAGGAGAA ATTCAGTATA TGATCTCTTG AATTCTCTCT TAAATGTTAT 6785 .......... .......... .......... .......... .......... .......... 131 TATCTCATGC AGATTGCTGA TATTTCTGTT GAGGATTACA TAACTGCTAC TGCTAACAAG 6725 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..ATTGCTGA TATTTCTGTT GAGGATTACA TAACTGCTAC TGCTAACAAG 179 CATCCTACAT ATACACCACA CACAGCTGGG AGGTACCAAG CCAAGCGGTT TAGAAAGGCT 6665 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCCTACAT ATACACCACA CACAGCTGGG AGGTACCAAG CCAAGCGGTT TAGAAAGGCT 239 CAATGCCCAA TTGTGGAGAG GTTGACCAAC TCACTGATGA TGCACGGAAG GAACAACGGG 6605 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CAATGCCCAA TTGTGGAGAG GTTGACCAAC TCACTGATGA TGCACGGAAG GAACACCGGG 299 AAGAAGTTGA TGGCCGTTCG TATTATTAAG CATGCTATGG AAATTA 6559 |||||||||| |||||||||| |||||||||| |||||||||| |||||| AAGAAGTTGA TGGCCGTTCG TATTATTAAG CATGCTATGG AAATTA 345 hqPGS_C06HBa0153O03.1-7-_SGN-E320800+ (8044 7914,6772 6559) ******************************************************************************** EST sequence 41 +strand 362 n (File: SGN-E280221+) 1 GCACGAGCGG CACGAGCTTA GACCTATCAG AAAAACAGGA AAAATGGAAG AAGCTTCAGT 61 AGTAGCAGTG CGCCAACCAA AAGCCGCAGC AAGAGAAGCC TCACACTGAT GTTTTGCTTT 121 TCAATCGTTG GTCATATGAT GATGTTCAGA TTGCTGATAT TTCTGTTGAG GATTACATAA 181 CTGCTACTGC TAACAAGCAT CCTACATATA CACCACACAC AGCTGGGAGG TACCAAGCCA 241 AGCGGTTTAG AAAGGCTCAA TGCCCAATTG TGGAGAGGTT GACCAACTCA CTGATGATGC 301 ACGGAAGGAA CAACGGGAAG AAGTTGATGG CCGTTCGTAT TATTAAGCAT GCTATGGAAA 361 TT Predicted gene structure (within gDNA segment 8193 to 5950): Exon 1 8054 7914 ( 141 n); cDNA 8 149 ( 142 n); score: 0.947 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 6772 6560 ( 213 n); cDNA 150 362 ( 213 n); score: 1.000 MATCH C06HBa0153O03.1-7- SGN-E280221+ 0.979 354 0.978 C PGS_C06HBa0153O03.1-7-_SGN-E280221+ (8054 7914,6772 6560) Alignment (genomic DNA sequence = upper lines): CGACACCTCC TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA 7995 || ||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGGCACGAGC TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA 67 GTG-GACAAC CAAAAGCCGC AGCAAGAGAA GCCTCACACT GATGTTTTGC TTTTCAATCG 7936 ||| | |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGCGCCAAC CAAAAGCCGC AGCAAGAGAA GCCTCACACT GATGTTTTGC TTTTCAATCG 127 TTGGTCATAT GATGATGTTC AGGTTTGTTT GTTTCCCTTT CAATTTTATT CCTCTCCAGT 7876 |||||||||| |||||||||| || TTGGTCATAT GATGATGTTC AG........ .......... .......... .......... 149 TCCTATATCT TTTCATTATT TGCCTAACAT TAATGTCGAA TTGGATGAAA CTTGGCATTT 7816 .......... .......... .......... .......... .......... .......... 149 TCGAAATCAT AAGATGAACA TTTGAATTAT TTTGTTTCTT GCGTTAGCTA AACTCTAATT 7756 .......... .......... .......... .......... .......... .......... 149 GTAGTGTAGC AGAGGTGATA TATCAGTAAG GGTGGGCATG GTAGGGTAGA TACCGAAACC 7696 .......... .......... .......... .......... .......... .......... 149 AAAATTTTTC ACTCAATGGT TTCAATATCA TGACATTTGA TATTATTTAT AATGTATATC 7636 .......... .......... .......... .......... .......... .......... 149 GAATCACCAA ATACTTTAAC AGAGTGTATA GTTAGGTATC CAATTCATTT ATCGTATTAT 7576 .......... .......... .......... .......... .......... .......... 149 AATACTAACA AATATATTAA CTAGTATTAG TTCAAAGTTG TTTAGACATT GAAAGCTTTG 7516 .......... .......... .......... .......... .......... .......... 149 ACTACTCTTT TCTTGTTAGA ATTGTCCTTT TTGTGTAATT GATTAAGTGA TGGAATTGCT 7456 .......... .......... .......... .......... .......... .......... 149 TCTTCTTTCT TTTGAATATT TTTACATGAG TAAGATCTTT ATATGATATA ATTAAGAAGT 7396 .......... .......... .......... .......... .......... .......... 149 TTCTAAAGAA ACCAAAACAT AATTCTCTAT TTATATGAGT ATATGTAAGT CGAAGTCGAA 7336 .......... .......... .......... .......... .......... .......... 149 CAAACAATGG TTACCAACCA AAAGTTAAAA AGTATCGGCA CATAATGGTT TAATTTGATA 7276 .......... .......... .......... .......... .......... .......... 149 TGGTAATGGT ATAGTACTTT TAAAAATCAA AATTATTGAA CCAAAGTTTT CAATATTGTA 7216 .......... .......... .......... .......... .......... .......... 149 TCATACCTTT CCATGCTCAT CCCTACATAT CAGTTCTCAA GTCCAATGCA TTGAATACTT 7156 .......... .......... .......... .......... .......... .......... 149 AACCATGGTT AGGAAACTTG AAACACTATG CACGACACTG CTTAGGTATG TCTATCAACT 7096 .......... .......... .......... .......... .......... .......... 149 ATAAAGCCTG CTGGCTTGAT CTTCTTATTC AAAGAAACAT GCATGCTAAA CATGATATGA 7036 .......... .......... .......... .......... .......... .......... 149 TTAAGTTGAA CAGAATAGTG TTGGTTTCCC CAATCCATAA CAAGCCAACT GGGACAACCT 6976 .......... .......... .......... .......... .......... .......... 149 TACAGAAGGT GTGCCTATTC ATCATTGTTG CCTTGTAAAT GATGGATTTA TACAACTGAA 6916 .......... .......... .......... .......... .......... .......... 149 AATTACTTGC TGAGAGTTCA GGGAAATCCT TGTTGGTTAA GTTGGAAATG TAATTGTAGG 6856 .......... .......... .......... .......... .......... .......... 149 TGGATTCTTC ATTGGAATGC TCAAAGGAGA AATTCAGTAT ATGATCTCTT GAATTCTCTC 6796 .......... .......... .......... .......... .......... .......... 149 TTAAATGTTA TTATCTCATG CAGATTGCTG ATATTTCTGT TGAGGATTAC ATAACTGCTA 6736 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...ATTGCTG ATATTTCTGT TGAGGATTAC ATAACTGCTA 186 CTGCTAACAA GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT 6676 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGCTAACAA GCATCCTACA TATACACCAC ACACAGCTGG GAGGTACCAA GCCAAGCGGT 246 TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA 6616 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGAAAGGC TCAATGCCCA ATTGTGGAGA GGTTGACCAA CTCACTGATG ATGCACGGAA 306 GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATT 6560 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| GGAACAACGG GAAGAAGTTG ATGGCCGTTC GTATTATTAA GCATGCTATG GAAATT 362 hqPGS_C06HBa0153O03.1-7-_SGN-E280221+ (8054 7914,6772 6560) ******************************************************************************** EST sequence 130 +strand 228 n (File: SGN-E277843+) 1 GCACGAGCGG CACGAGCTTA GACCTATCAG AAAAACAGGA AAAATGGAAG AAGCTTGGGT 61 AGTAGCAGTG GACAACCATG AGCCGGAGCA GGAGAAGCCT CACACTGATG TTTTGCTATT 121 TAATGGTTGG TCATATGATG ATGTTCAGAT TGCTGATGTA GCTGTTGATG ATGACATCAT 181 TGCTACTGAT AACGGGTATG GTTCTTATAC TCCAGACACA TATGGGAT Predicted gene structure (within gDNA segment 8193 to 6504): Exon 1 8054 7914 ( 141 n); cDNA 8 148 ( 141 n); score: 0.908 Intron 1 7913 6773 (1141 n); Pd: 0.998 (s: 0.94), Pa: 1.000 (s: 0.78) Exon 2 6772 6694 ( 79 n); cDNA 149 227 ( 79 n); score: 0.759 MATCH C06HBa0153O03.1-7- SGN-E277843+ 0.855 220 0.965 C PGS_C06HBa0153O03.1-7-_SGN-E277843+ (8054 7914,6772 6694) Alignment (genomic DNA sequence = upper lines): CGACACCTCC TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTC AGTAGTAGCA 7995 || ||| | |||||||||| |||||||||| |||||||||| ||||||||| ||||||||| CGGCACGAGC TTAGACCTAT CAGAAAAACA GGAAAAATGG AAGAAGCTTG GGTAGTAGCA 67 GTGGACAACC AAAAGCCGCA GCAAGAGAAG CCTCACACTG ATGTTTTGCT TTTCAATCGT 7935 |||||||||| | ||||| | ||| |||||| |||||||||| |||||||||| || ||| || GTGGACAACC ATGAGCCGGA GCAGGAGAAG CCTCACACTG ATGTTTTGCT ATTTAATGGT 127 TGGTCATATG ATGATGTTCA GGTTTGTTTG TTTCCCTTTC AATTTTATTC CTCTCCAGTT 7875 |||||||||| |||||||||| | TGGTCATATG ATGATGTTCA G......... .......... .......... .......... 148 CCTATATCTT TTCATTATTT GCCTAACATT AATGTCGAAT TGGATGAAAC TTGGCATTTT 7815 .......... .......... .......... .......... .......... .......... 148 CGAAATCATA AGATGAACAT TTGAATTATT TTGTTTCTTG CGTTAGCTAA ACTCTAATTG 7755 .......... .......... .......... .......... .......... .......... 148 TAGTGTAGCA GAGGTGATAT ATCAGTAAGG GTGGGCATGG TAGGGTAGAT ACCGAAACCA 7695 .......... .......... .......... .......... .......... .......... 148 AAATTTTTCA CTCAATGGTT TCAATATCAT GACATTTGAT ATTATTTATA ATGTATATCG 7635 .......... .......... .......... .......... .......... .......... 148 AATCACCAAA TACTTTAACA GAGTGTATAG TTAGGTATCC AATTCATTTA TCGTATTATA 7575 .......... .......... .......... .......... .......... .......... 148 ATACTAACAA ATATATTAAC TAGTATTAGT TCAAAGTTGT TTAGACATTG AAAGCTTTGA 7515 .......... .......... .......... .......... .......... .......... 148 CTACTCTTTT CTTGTTAGAA TTGTCCTTTT TGTGTAATTG ATTAAGTGAT GGAATTGCTT 7455 .......... .......... .......... .......... .......... .......... 148 CTTCTTTCTT TTGAATATTT TTACATGAGT AAGATCTTTA TATGATATAA TTAAGAAGTT 7395 .......... .......... .......... .......... .......... .......... 148 TCTAAAGAAA CCAAAACATA ATTCTCTATT TATATGAGTA TATGTAAGTC GAAGTCGAAC 7335 .......... .......... .......... .......... .......... .......... 148 AAACAATGGT TACCAACCAA AAGTTAAAAA GTATCGGCAC ATAATGGTTT AATTTGATAT 7275 .......... .......... .......... .......... .......... .......... 148 GGTAATGGTA TAGTACTTTT AAAAATCAAA ATTATTGAAC CAAAGTTTTC AATATTGTAT 7215 .......... .......... .......... .......... .......... .......... 148 CATACCTTTC CATGCTCATC CCTACATATC AGTTCTCAAG TCCAATGCAT TGAATACTTA 7155 .......... .......... .......... .......... .......... .......... 148 ACCATGGTTA GGAAACTTGA AACACTATGC ACGACACTGC TTAGGTATGT CTATCAACTA 7095 .......... .......... .......... .......... .......... .......... 148 TAAAGCCTGC TGGCTTGATC TTCTTATTCA AAGAAACATG CATGCTAAAC ATGATATGAT 7035 .......... .......... .......... .......... .......... .......... 148 TAAGTTGAAC AGAATAGTGT TGGTTTCCCC AATCCATAAC AAGCCAACTG GGACAACCTT 6975 .......... .......... .......... .......... .......... .......... 148 ACAGAAGGTG TGCCTATTCA TCATTGTTGC CTTGTAAATG ATGGATTTAT ACAACTGAAA 6915 .......... .......... .......... .......... .......... .......... 148 ATTACTTGCT GAGAGTTCAG GGAAATCCTT GTTGGTTAAG TTGGAAATGT AATTGTAGGT 6855 .......... .......... .......... .......... .......... .......... 148 GGATTCTTCA TTGGAATGCT CAAAGGAGAA ATTCAGTATA TGATCTCTTG AATTCTCTCT 6795 .......... .......... .......... .......... .......... .......... 148 TAAATGTTAT TATCTCATGC AGATTGCTGA TATTTCTGTT GAGGATTACA TAACTGCTAC 6735 |||||||| | | ||||| || ||| ||| | | |||||| .......... .......... ..ATTGCTGA TGTAGCTGTT GATGATGACA TCATTGCTAC 186 TGCTAACAAG CATCCTACAT ATACACCACA CACAGCTGGG A 6694 || |||| | || | | | |||| ||| | |||| |||| | TGATAACGGG TATGGTTCTT ATACTCCAGA CACATATGGG A 227 hqPGS_C06HBa0153O03.1-7-_SGN-E277843+ (8054 7914,6772 6694) Total number of EST alignments reported: 140 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 8193: PGL 1 (- strand): 2972 773 AGS-1 (2972 2262,2218 2157,2086 1884,1849 1265,1163 886,822 773) SCR (e 0.849 d 0.933 a 0.000,e 0.935 d 0.000 a 0.000,e 0.889 d 0.000 a 0.000,e 0.743 d 0.900 a 0.000,e 0.745 d 0.000 a 0.000,e 0.720) Exon 1 2972 2262 ( 711 n); score: 0.849 Intron 1 2261 2219 ( 43 n); Pd: 0.933 Pa: 0.000 Exon 2 2218 2157 ( 62 n); score: 0.935 Intron 2 2156 2087 ( 70 n); Pd: 0.000 Pa: 0.000 Exon 3 2086 1884 ( 203 n); score: 0.889 Intron 3 1883 1850 ( 34 n); Pd: 0.000 Pa: 0.000 Exon 4 1849 1265 ( 585 n); score: 0.743 Intron 4 1264 1164 ( 101 n); Pd: 0.900 Pa: 0.000 Exon 5 1163 886 ( 278 n); score: 0.745 Intron 5 885 823 ( 63 n); Pd: 0.000 Pa: 0.000 Exon 6 822 773 ( 50 n); score: 0.720 PGS (1580 1265,1163 886,822 773) SGN-E396039+ PGS (1580 1265,1163 903) SGN-E389553- PGS (1580 1265,1163 903) SGN-E550464+ PGS (1579 1265,1163 903) SGN-E374999+ PGS (1001 903) SGN-E275667- PGS (1580 1265,1163 906) SGN-E389834+ PGS (1580 1265,1163 906) SGN-E396054+ PGS (1580 1265,1163 906) SGN-E396058+ PGS (1579 1265,1163 990) SGN-E241959+ PGS (1579 1265,1163 999) SGN-E236652+ PGS (1960 1884,1849 1371) SGN-E546548- PGS (2692 2262,2218 2157,2086 1907) SGN-E241789+ PGS (2551 2316) SGN-E209683- PGS (2972 2476) SGN-E351546- PGS (2962 2476) SGN-E356206- PGS (2962 2476) SGN-E356696- PGS (2833 2547) SGN-E222578+ PGS (2972 2557) SGN-E392027+ PGS (2972 2558) SGN-E370357+ PGS (2968 2558) SGN-E542084+ PGS (2821 2561) SGN-E373117- PGS (2821 2561) SGN-E373116+ PGS (2833 2587) SGN-E216150+ PGS (2821 2638) SGN-E352844- PGS (2806 2638) SGN-E368629- PGS (2796 2638) SGN-E238551- 3-phase translation of AGS-1 (-strand): . . . . . . 2972 AGCTTAAATATGTCACGACCCAAACCGGGTTGCGACTGGCACCCACACTTTCCCTCCTAT S L N M S R P K P G C D W H P H F P S Y A - I C H D P N R V A T G T H T F P P M L K Y V T T Q T G L R L A P T L S L L . . . . . . 2912 GTGAGCGAACCAACCAATCTAACCTTAACATTTCAATATAATATCAACAGAAAGTAATGC V S E P T N L T L T F Q Y N I N R K - C - A N Q P I - P - H F N I I S T E S N A C E R T N Q S N L N I S I - Y Q Q K V M . . . . . . 2852 GGAAGACTTAAACTCATCAAATAAAGACCAATTCATTAACTTCTAAAATTCAACATCTAT G R L K L I K - R P I H - L L K F N I Y E D L N S S N K D Q F I N F - N S T S I R K T - T H Q I K T N S L T S K I Q H L . . . . . . 2792 TATTTCCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTA Y F P K S G S H H H K N I Y D Q M T K L I S Q N L E V I I T R T S T I K - L N - L F P K I W K S S S Q E H L R S N D - T . . . . . . 2732 AGAGTATTCTAAAAGCTAAAAATACATAAGAAGTTAGTCCATGCCGGAAGTTCAAGGCAT R V F - K L K I H K K L V H A G S S R H E Y S K S - K Y I R S - S M P E V Q G I K S I L K A K N T - E V S P C R K F K A . . . . . . 2672 CAAGACTTGAAGAAGAAGATCCAGTCCAAGCTAGAGGCATTAGCTTACCCTGAATTTTCG Q D L K K K I Q S K L E A L A Y P E F S K T - R R R S S P S - R H - L T L N F R S R L E E E D P V Q A R G I S L P - I F . . . . . . 2612 ATGTAGTAAGACTGGCTTGAATTACTGTTGAGTTGAGGACGATGACACGTTTGCTGCACT M - - D W L E L L L S - G R - H V C C T C S K T G L N Y C - V E D D D T F A A L D V V R L A - I T V E L R T M T R L L H . . . . . . 2552 CCACAAATAAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAG P Q I N K K K T - K - G S V Q N T G T E H K - T R R K H K S R G Q Y K T R V L S S T N K Q E E N I K V G V S T K H G Y - . . . . . . 2492 TAGATATCATCGGCCAACTACAAATAGAAAACAATATATACCAAGTAATATCATAAAATC - I S S A N Y K - K T I Y T K - Y H K I R Y H R P T T N R K Q Y I P S N I I K S V D I I G Q L Q I E N N I Y Q V I S - N . . . . . . 2432 AACTATGATACTCAACATGTAGCAACAACAAGCACTATCTCATTAACAGTTACCGTCAAG N Y D T Q H V A T T S T I S L T V T V K T M I L N M - Q Q Q A L S H - Q L P S S Q L - Y S T C S N N K H Y L I N S Y R Q . . . . . . 2372 TTCACACATGAGGACTCAAGCCTCAATACCATACTCATTTGGGAATCATGTTCATTAGAT F T H E D S S L N T I L I W E S C S L D S H M R T Q A S I P Y S F G N H V H - I V H T - G L K P Q Y H T H L G I M F I R . . . . . . : 2312 TGAGTATATTAACATCTTTCAAGATTCATGATCTTTATTTCTCTTGTGTCG : TCGGTACGT - V Y - H L S R F M I F I S L V S : S V R E Y I N I F Q D S - S L F L L C R : R Y V L S I L T S F K I H D L Y F S C V : V G T . . . . . . : 2209 GACACTCCGATCCCCTAAATCTACGTGTCGGTTCGTGACACCCGATCCCCTAA : TTCTACG D T P I P - I Y V S V R D T R S P N : S T T L R S P K S T C R F V T P D P L : I L R - H S D P L N L R V G S - H P I P - : F Y . . . . . . 2079 TGTCGGTTCGTGACACCCAATCCCCTAATTCTACGTGTCGGTTCGTGACACCCGATCCCC C R F V T P N P L I L R V G S - H P I P V G S - H P I P - F Y V S V R D T R S P V S V R D T Q S P N S T C R F V T P D P . . . . . . 2019 TAATACTACGTGTCGGTTCATGACACCCGATCCCCTAATACTACGTGTCGGTTCGTGACA - Y Y V S V H D T R S P N T T C R F V T N T T C R F M T P D P L I L R V G S - H L I L R V G S - H P I P - Y Y V S V R D . . . . . . 1959 CCCGATCCCCTAATCTCCTTCTATCAATTCATCAAGCCTTCTTTCTTACCAAGGCATCAT P D P L I S F Y Q F I K P S F L P R H H P I P - S P S I N S S S L L S Y Q G I I T R S P N L L L S I H Q A F F L T K A S . . : . . . . 1899 CCATCCCATTATTTTA : TATTAACAAAGAGATTAGGATTTTACAAGATTTGGGATTCAATA P S H Y F : I L T K R L G F Y K I W D S I H P I I L : Y - Q R D - D F T R F G I Q - S I P L F Y : I N K E I R I L Q D L G F N . . . . . . 1805 ACTTCATCATGCTTAATATAATCACAATTATATAATCATGTTCATGCATGCATACAATTA T S S C L I - S Q L Y N H V H A C I Q L L H H A - Y N H N Y I I M F M H A Y N - N F I M L N I I T I I - S C S C M H T I . . . . . . 1745 AGCACATAGCAGGGTTTACAATACTATCAATACATATCATTCTCTATTAAGAGTTTACTA S T - Q G L Q Y Y Q Y I S F S I K S L L A H S R V Y N T I N T Y H S L L R V Y Y K H I A G F T I L S I H I I L Y - E F T . . . . . . 1685 TGAAAGCATGAAAACCATAACCTACCTCCACCGAAGATTCGTGATCAAGCAAGCAAATTT - K H E N H N L P P P K I R D Q A S K F E S M K T I T Y L H R R F V I K Q A N F M K A - K P - P T S T E D S - S S K Q I . . . . . . 1625 TCTCAAAGCTTTGTGTTTTTCCCCTTCTCGATCGTCTCTCTCTCTATCGATTCCCTTCTC S Q S F V F F P F S I V S L S I D S L L L K A L C F S P S R S S L S L S I P F S F S K L C V F P L L D R L S L Y R F P S . . . . . . 1565 TCTCTTTCTCTTGTTCTTTCTATTTTCTTTATTCAAACCCTCTTTCTTTTACCCTAATTA S L S L V L S I F F I Q T L F L L P - L L F L L F F L F S L F K P S F F Y P N - L S F S C S F Y F L Y S N P L S F T L I . . . . . . 1505 GTATATAATTAAGAATAAAAGATGACAATAATAGCCCACTAATTAACTTAAGGTTACCTC V Y N - E - K M T I I A H - L T - G Y L Y I I K N K R - Q - - P T N - L K V T S S I - L R I K D D N N S P L I N L R L P . . . . . . 1445 TTTTATTCCCCCAAGAAATTGAGTTATTAATATAGACCCACGAAATATATAATTATAGCA F Y S P K K L S Y - Y R P T K Y I I I A F I P P R N - V I N I D P R N I - L - Q L L F P Q E I E L L I - T H E I Y N Y S . . . . . . 1385 GGAATAGTCCAAAACGCCCCTTTAAAACTTAACCAGAATTCCGACTTCAACTGGGATTAC G I V Q N A P L K L N Q N S D F N W D Y E - S K T P L - N L T R I P T S T G I T R N S P K R P F K T - P E F R L Q L G L . . . . . . 1325 GCAACCTGTGACGGCCCGTCGCGCCTGCGACGGTCCGTCATGCAGGTTCGTCAGAGATTC A T C D G P S R L R R S V M Q V R Q R F Q P V T A R R A C D G P S C R F V R D S R N L - R P V A P A T V R H A G S S E I . : . . . . . 1265 G : GAGTTGGAGTGTTTTGAAACGGTGGATCACGACGGTTCATCGTGCCTGTGACGGTCCGT : G V G V F - N G G S R R F I V P V T V R : E L E C F E T V D H D G S S C L - R S V R : S W S V L K R W I T T V H R A C D G P . . . . . . 1104 CCTGCAGGTCCGTCACAGAGTTCAGAGAGTCAATTTCAGCACCCAAATTTCAGAATTTCT P A G P S Q S S E S Q F Q H P N F R I S L Q V R H R V Q R V N F S T Q I S E F L S C R S V T E F R E S I S A P K F Q N F . . . . . . 1044 AAGTGTTTTGGGACGAAACACCCTCGACGGTCCGTCGTGCCCATGACGTTCCGTCATGCC K C F G T K H P R R S V V P M T F R H A S V L G R N T L D G P S C P - R S V M P - V F W D E T P S T V R R A H D V P S C . . . . . . 984 CATGACGTTCCGTCGTGGGTTCCGTCGTCTCAGCCTGTTTTTCCAGAAATAAAATCTGCT H D V P S W V P S S Q P V F P E I K S A M T F R R G F R R L S L F F Q K - N L L P - R S V V G S V V S A C F S R N K I C . . . . : . . 924 GCTCAAAACAACTAAACAGGTCGTTACAAAATATTTTTT : TCTTAATTATTATTATTATTA A Q N N - T G R Y K I F F : S - L L L L L L K T T K Q V V T K Y F F : L N Y Y Y Y Y C S K Q L N R S L Q N I F : F L I I I I I . . . 801 TTATTTTATAACAAAAAAAATAATTAAAA L F Y N K K N N - Y F I T K K I I K I I L - Q K K - L K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-1_AGS-1_PPS_1 (1744 1505) (frame '2'; 237 bp, 79 residues) 1 AHSRVYNTIN TYHSLLRVYY ESMKTITYLH RRFVIKQANF LKALCFSPSR SSLSLSIPFS 61 LFLLFFLFSL FKPSFFYPN- >C06HBa0153O03.1-7-_PGL-1_AGS-1_PPS_2 (1146 910) (frame '1'; 234 bp, 78 residues) 1 NGGSRRFIVP VTVRPAGPSQ SSESQFQHPN FRISKCFGTK HPRRSVVPMT FRHAHDVPSW 61 VPSSQPVFPE IKSAAQNN- >C06HBa0153O03.1-7-_PGL-1_AGS-1_PPS_3 (2016 1884,1849 1785) (frame '1'; 195 bp, 65 residues) 1 YYVSVHDTRS PNTTCRFVTP DPLISFYQFI KPSFLPRHHP SHYFILTKRL GFYKIWDSIT 61 SSCLI- AGS-2 (2787 2018,1978 1265,1163 886,826 789) SCR (e 0.947 d 0.000 a 0.000,e 0.834 d 0.900 a 0.000,e 0.743 d 0.000 a 0.000,e 0.833) Exon 1 2787 2018 ( 770 n); score: 0.947 Intron 1 2017 1979 ( 39 n); Pd: 0.000 Pa: 0.000 Exon 2 1978 1265 ( 714 n); score: 0.834 Intron 2 1264 1164 ( 101 n); Pd: 0.900 Pa: 0.000 Exon 3 1163 886 ( 278 n); score: 0.743 Intron 3 885 827 ( 59 n); Pd: 0.000 Pa: 0.000 Exon 4 826 789 ( 38 n); score: 0.833 PGS (1589 1265,1163 886,826 789) SGN-E550322+ PGS (1580 1265,1163 886,826 789) SGN-E550212+ PGS (1580 1265,1163 886,826 789) SGN-E550065+ PGS (1580 1265,1163 886,826 789) SGN-E550201+ PGS (1580 1265,1163 886,826 789) SGN-E550207+ PGS (1580 1265,1163 886,826 789) SGN-E390013+ PGS (1580 1265,1163 886,826 789) SGN-E550484+ PGS (1580 1265,1163 886,826 789) SGN-E550211+ PGS (1580 1265,1163 886,826 789) SGN-E549941+ PGS (1580 1265,1163 886,826 789) SGN-E550025+ PGS (1580 1265,1163 886,826 789) SGN-E396056+ PGS (1579 1265,1163 886,826 789) SGN-E377133+ PGS (1525 1265,1163 886,826 789) SGN-E377132- PGS (1580 1265,1163 886,826 803) SGN-E550127- PGS (1580 1265,1163 886,826 803) SGN-E550140- PGS (1906 1554) SGN-E578076- PGS (2200 2018,1978 1593) SGN-E347579- PGS (2498 2162) SGN-E357033+ PGS (2407 2172) SGN-E391780- PGS (2650 2173) SGN-E246710- PGS (2787 2197) SGN-E546506+ 3-phase translation of AGS-2 (-strand): . . . . . . 2787 CCCAAAATCTGGAAGTCATCATCACAAGAACATCTACGATCAAATGACTAAACTAAGAGT P K I W K S S S Q E H L R S N D - T K S P K S G S H H H K N I Y D Q M T K L R V Q N L E V I I T R T S T I K - L N - E . . . . . . 2727 ATTCTAAAAGCTAAAAATACATAAGAAGTTAGTCCATGCCGGAAGTTCAAGGCATCAAGA I L K A K N T - E V S P C R K F K A S R F - K L K I H K K L V H A G S S R H Q D Y S K S - K Y I R S - S M P E V Q G I K . . . . . . 2667 CTTGAAGAAGAAGATCCAGTCCAAGCTAGAGGCATTAGCTTACCCTGAATTTTCGATGTA L E E E D P V Q A R G I S L P - I F D V L K K K I Q S K L E A L A Y P E F S M - T - R R R S S P S - R H - L T L N F R C . . . . . . 2607 GTAAGACTGGCTTGAATTACTGTTGAGTTGAGGACGATGACACGTTTGCTGCACTCCACA V R L A - I T V E L R T M T R L L H S T - D W L E L L L S - G R - H V C C T P Q S K T G L N Y C - V E D D D T F A A L H . . . . . . 2547 AATAAACAAGAAGAAAACATAAAAGTAGGGGTCAGTACAAAACACGGGTACTGAGTAGAT N K Q E E N I K V G V S T K H G Y - V D I N K K K T - K - G S V Q N T G T E - I K - T R R K H K S R G Q Y K T R V L S R . . . . . . 2487 ATCATCGGCCAACTACAAATAGAAAACAATATATACCAAGTAATATCATAAAATCAACTA I I G Q L Q I E N N I Y Q V I S - N Q L S S A N Y K - K T I Y T K - Y H K I N Y Y H R P T T N R K Q Y I P S N I I K S T . . . . . . 2427 TGATACTCAACATGTAGCAACAACAAGCACTATCTCATTAACAGTTACCGTCAAGTTCAC - Y S T C S N N K H Y L I N S Y R Q V H D T Q H V A T T S T I S L T V T V K F T M I L N M - Q Q Q A L S H - Q L P S S S . . . . . . 2367 ACATGAGGACTCAAGCCTCAATACCATACTCATTTGGGAATCATGTTCATTAGATTGAGT T - G L K P Q Y H T H L G I M F I R L S H E D S S L N T I L I W E S C S L D - V H M R T Q A S I P Y S F G N H V H - I E . . . . . . 2307 ATATTAACATCTTTCAAGATTCATGATCTTTATTTCTCTTGTGTCGGTACGTGACACTCC I L T S F K I H D L Y F S C V G T - H S Y - H L S R F M I F I S L V S V R D T P Y I N I F Q D S - S L F L L C R Y V T L . . . . . . 2247 GCTCCCTCATATTCATTAATCCTCTTGTGTCGGTACGTGACACTCCGATCCCCTAAATCT A P S Y S L I L L C R Y V T L R S P K S L P H I H - S S C V G T - H S D P L N L R S L I F I N P L V S V R D T P I P - I . . . . . . 2187 ACGTGTCGGTTCGTGACACCCGATCCCCTAAATCTACGTATCGGTTCGTGACACCCGTTC T C R F V T P D P L N L R I G S - H P F R V G S - H P I P - I Y V S V R D T R S Y V S V R D T R S P K S T Y R F V T P V . . . . . . 2127 CCCTAAATCTACATGTCGGTTCGTGACACCCGGTCCCCTAATTCTACGTGTCGGTTCGTG P - I Y M S V R D T R S P N S T C R F V P K S T C R F V T P G P L I L R V G S - P L N L H V G S - H P V P - F Y V S V R . . . . . : . 2067 ACACCCAATCCCCTAATTCTACGTGTCGGTTCGTGACACCCGATCCCCTA : TACGTGTCGG T P N P L I L R V G S - H P I P Y : T C R H P I P - F Y V S V R D T R S P : I R V G D T Q S P N S T C R F V T P D P L : Y V S . . . . . . 1968 TTCGTGACACCCGATCCCCTAATCTCCTTCTATCAATTCATCAAGCCTTCTTTCTTACCA F V T P D P L I S F Y Q F I K P S F L P S - H P I P - S P S I N S S S L L S Y Q V R D T R S P N L L L S I H Q A F F L T . . . . . . 1908 AGGCATCATCCATCCCATTATTTTAGTTCATCACGCCTTTTTTTATACCAAGGTCTCATT R H H P S H Y F S S S R L F L Y Q G L I G I I H P I I L V H H A F F Y T K V S L K A S S I P L F - F I T P F F I P R S H . . . . . . 1848 ATTAACAAAGAGATTAGGATTTTACAAGATTTGGGATTCAATAACTTCATCATGCTTAAT I N K E I R I L Q D L G F N N F I M L N L T K R L G F Y K I W D S I T S S C L I Y - Q R D - D F T R F G I Q - L H H A - . . . . . . 1788 ATAATCACAATTATATAATCATGTTCATGCATGCATACAATTAAGCACATAGCAGGGTTT I I T I I - S C S C M H T I K H I A G F - S Q L Y N H V H A C I Q L S T - Q G L Y N H N Y I I M F M H A Y N - A H S R V . . . . . . 1728 ACAATACTATCAATACATATCATTCTCTATTAAGAGTTTACTATGAAAGCATGAAAACCA T I L S I H I I L Y - E F T M K A - K P Q Y Y Q Y I S F S I K S L L - K H E N H Y N T I N T Y H S L L R V Y Y E S M K T . . . . . . 1668 TAACCTACCTCCACCGAAGATTCGTGATCAAGCAAGCAAATTTTCTCAAAGCTTTGTGTT - P T S T E D S - S S K Q I F S K L C V N L P P P K I R D Q A S K F S Q S F V F I T Y L H R R F V I K Q A N F L K A L C . . . . . . 1608 TTTCCCCTTCTCGATCGTCTCTCTCTCTATCGATTCCCTTCTCTCTCTTTCTCTTGTTCT F P L L D R L S L Y R F P S L S F S C S F P F S I V S L S I D S L L S L S L V L F S P S R S S L S L S I P F S L F L L F . . . . . . 1548 TTCTATTTTCTTTATTCAAACCCTCTTTCTTTTACCCTAATTAGTATATAATTAAGAATA F Y F L Y S N P L S F T L I S I - L R I S I F F I Q T L F L L P - L V Y N - E - F L F S L F K P S F F Y P N - Y I I K N . . . . . . 1488 AAAGATGACAATAATAGCCCACTAATTAACTTAAGGTTACCTCTTTTATTCCCCCAAGAA K D D N N S P L I N L R L P L L F P Q E K M T I I A H - L T - G Y L F Y S P K K K R - Q - - P T N - L K V T S F I P P R . . . . . . 1428 ATTGAGTTATTAATATAGACCCACGAAATATATAATTATAGCAGGAATAGTCCAAAACGC I E L L I - T H E I Y N Y S R N S P K R L S Y - Y R P T K Y I I I A G I V Q N A N - V I N I D P R N I - L - Q E - S K T . . . . . . 1368 CCCTTTAAAACTTAACCAGAATTCCGACTTCAACTGGGATTACGCAACCTGTGACGGCCC P F K T - P E F R L Q L G L R N L - R P P L K L N Q N S D F N W D Y A T C D G P P L - N L T R I P T S T G I T Q P V T A . . . . . : . 1308 GTCGCGCCTGCGACGGTCCGTCATGCAGGTTCGTCAGAGATTCG : GAGTTGGAGTGTTTTG V A P A T V R H A G S S E I R : S W S V L S R L R R S V M Q V R Q R F : G V G V F - R R A C D G P S C R F V R D S : E L E C F . . . . . . 1147 AAACGGTGGATCACGACGGTTCATCGTGCCTGTGACGGTCCGTCCTGCAGGTCCGTCACA K R W I T T V H R A C D G P S C R S V T N G G S R R F I V P V T V R P A G P S Q E T V D H D G S S C L - R S V L Q V R H . . . . . . 1087 GAGTTCAGAGAGTCAATTTCAGCACCCAAATTTCAGAATTTCTAAGTGTTTTGGGACGAA E F R E S I S A P K F Q N F - V F W D E S S E S Q F Q H P N F R I S K C F G T K R V Q R V N F S T Q I S E F L S V L G R . . . . . . 1027 ACACCCTCGACGGTCCGTCGTGCCCATGACGTTCCGTCATGCCCATGACGTTCCGTCGTG T P S T V R R A H D V P S C P - R S V V H P R R S V V P M T F R H A H D V P S W N T L D G P S C P - R S V M P M T F R R . . . . . . 967 GGTTCCGTCGTCTCAGCCTGTTTTTCCAGAAATAAAATCTGCTGCTCAAAACAACTAAAC G S V V S A C F S R N K I C C S K Q L N V P S S Q P V F P E I K S A A Q N N - T G F R R L S L F F Q K - N L L L K T T K . . . : . . . 907 AGGTCGTTACAAAATATTTTTT : TTTTTCTTAATTATTATTATTATTATTATTTTATAACA R S L Q N I F : F F L N Y Y Y Y Y Y F I T G R Y K I F F : F F L I I I I I I I L - Q V V T K Y F F : F S - L L L L L L F Y N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-1_AGS-2_PPS_1 (1744 1505) (frame '0'; 237 bp, 79 residues) 1 AHSRVYNTIN TYHSLLRVYY ESMKTITYLH RRFVIKQANF LKALCFSPSR SSLSLSIPFS 61 LFLLFFLFSL FKPSFFYPN- >C06HBa0153O03.1-7-_PGL-1_AGS-2_PPS_2 (1146 910) (frame '2'; 234 bp, 78 residues) 1 NGGSRRFIVP VTVRPAGPSQ SSESQFQHPN FRISKCFGTK HPRRSVVPMT FRHAHDVPSW 61 VPSSQPVFPE IKSAAQNN- >C06HBa0153O03.1-7-_PGL-1_AGS-2_PPS_3 (2031 2018,1978 1771) (frame '1'; 219 bp, 73 residues) 1 HPIPYTCRFV TPDPLISFYQ FIKPSFLPRH HPSHYFSSSR LFLYQGLIIN KEIRILQDLG 61 FNNFIMLNII TII- AGS-3 (1580 1265,1160 886,826 789) SCR (e 0.833 d 0.900 a 0.000,e 0.738 d 0.000 a 0.000,e 0.724) Exon 1 1580 1265 ( 316 n); score: 0.833 Intron 1 1264 1161 ( 104 n); Pd: 0.900 Pa: 0.000 Exon 2 1160 886 ( 275 n); score: 0.738 Intron 2 885 827 ( 59 n); Pd: 0.000 Pa: 0.000 Exon 3 826 789 ( 38 n); score: 0.724 PGS (1580 1265,1160 886,826 789) SGN-E550335+ PGS (1579 1265,1160 990) SGN-E231589+ 3-phase translation of AGS-3 (-strand): . . . . . . 1580 ATCGATTCCCTTCTCTCTCTTTCTCTTGTTCTTTCTATTTTCTTTATTCAAACCCTCTTT I D S L L S L S L V L S I F F I Q T L F S I P F S L F L L F F L F S L F K P S F R F P S L S F S C S F Y F L Y S N P L . . . . . . 1520 CTTTTACCCTAATTAGTATATAATTAAGAATAAAAGATGACAATAATAGCCCACTAATTA L L P - L V Y N - E - K M T I I A H - L F Y P N - Y I I K N K R - Q - - P T N - S F T L I S I - L R I K D D N N S P L I . . . . . . 1460 ACTTAAGGTTACCTCTTTTATTCCCCCAAGAAATTGAGTTATTAATATAGACCCACGAAA T - G Y L F Y S P K K L S Y - Y R P T K L K V T S F I P P R N - V I N I D P R N N L R L P L L F P Q E I E L L I - T H E . . . . . . 1400 TATATAATTATAGCAGGAATAGTCCAAAACGCCCCTTTAAAACTTAACCAGAATTCCGAC Y I I I A G I V Q N A P L K L N Q N S D I - L - Q E - S K T P L - N L T R I P T I Y N Y S R N S P K R P F K T - P E F R . . . . . . 1340 TTCAACTGGGATTACGCAACCTGTGACGGCCCGTCGCGCCTGCGACGGTCCGTCATGCAG F N W D Y A T C D G P S R L R R S V M Q S T G I T Q P V T A R R A C D G P S C R L Q L G L R N L - R P V A P A T V R H A . . : . . . . 1280 GTTCGTCAGAGATTCG : TTGGAGTGTTTTGAAACGGTGGATCACGACGGTTCATCGTGCCT V R Q R F : V G V F - N G G S R R F I V P F V R D S : L E C F E T V D H D G S S C L G S S E I R : W S V L K R W I T T V H R A . . . . . . 1116 GTGACGGTCCGTCCTGCAGGTCCGTCACAGAGTTCAGAGAGTCAATTTCAGCACCCAAAT V T V R P A G P S Q S S E S Q F Q H P N - R S V L Q V R H R V Q R V N F S T Q I C D G P S C R S V T E F R E S I S A P K . . . . . . 1056 TTCAGAATTTCTAAGTGTTTTGGGACGAAACACCCTCGACGGTCCGTCGTGCCCATGACG F R I S K C F G T K H P R R S V V P M T S E F L S V L G R N T L D G P S C P - R F Q N F - V F W D E T P S T V R R A H D . . . . . . 996 TTCCGTCATGCCCATGACGTTCCGTCGTGGGTTCCGTCGTCTCAGCCTGTTTTTCCAGAA F R H A H D V P S W V P S S Q P V F P E S V M P M T F R R G F R R L S L F F Q K V P S C P - R S V V G S V V S A C F S R . . . . . . : 936 ATAAAATCTGCTGCTCAAAACAACTAAACAGGTCGTTACAAAATATTTTTT : TTTTTCTTA I K S A A Q N N - T G R Y K I F F : F F L - N L L L K T T K Q V V T K Y F F : F S - N K I C C S K Q L N R S L Q N I F : F F L . . . 817 ATTATTATTATTATTATTATTTTATAACA I I I I I I I L - L L L L L L F Y N N Y Y Y Y Y Y F I T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-1_AGS-3_PPS_1 (1146 910) (frame '1'; 234 bp, 78 residues) 1 NGGSRRFIVP VTVRPAGPSQ SSESQFQHPN FRISKCFGTK HPRRSVVPMT FRHAHDVPSW 61 VPSSQPVFPE IKSAAQNN- AGS-4 (2494 1265,1159 1012,973 891) SCR (e 0.894 d 0.900 a 0.000,e 0.732 d 0.000 a 0.000,e 0.904) Exon 1 2494 1265 (1230 n); score: 0.894 Intron 1 1264 1160 ( 105 n); Pd: 0.900 Pa: 0.000 Exon 2 1159 1012 ( 148 n); score: 0.732 Intron 2 1011 974 ( 38 n); Pd: 0.000 Pa: 0.000 Exon 3 973 891 ( 83 n); score: 0.904 PGS (1133 1012,973 891) SGN-E252199- PGS (1726 1265,1159 990) SGN-E349296- PGS (1531 1265,1159 1036) SGN-E356257- PGS (2287 1687) SGN-E349726- PGS (2045 1687) SGN-E357559- PGS (2494 1850) SGN-E349977- 3-phase translation of AGS-4 (-strand): . . . . . . 2494 AGTAGATATCATCGGCCAACTACAAATAGAAAACAATATATACCAAGTAATATCATAAAA S R Y H R P T T N R K Q Y I P S N I I K V D I I G Q L Q I E N N I Y Q V I S - N - I S S A N Y K - K T I Y T K - Y H K . . . . . . 2434 TCAACTATGATACTCAACATGTAGCAACAACAAGCACTATCTCATTAACAGTTACCGTCA S T M I L N M - Q Q Q A L S H - Q L P S Q L - Y S T C S N N K H Y L I N S Y R Q I N Y D T Q H V A T T S T I S L T V T V . . . . . . 2374 AGTTCACACATGAGGACTCAAGCCTCAATACCATACTCATTTGGGAATCATGTTCATTAG S S H M R T Q A S I P Y S F G N H V H - V H T - G L K P Q Y H T H L G I M F I R K F T H E D S S L N T I L I W E S C S L . . . . . . 2314 ATTGAGTATATTAACATCTTTCAAGATTCATGATCTTTATTTCTCTTGTGTCGGTACGTG I E Y I N I F Q D S - S L F L L C R Y V L S I L T S F K I H D L Y F S C V G T - D - V Y - H L S R F M I F I S L V S V R . . . . . . 2254 ACACTCCGCTCCCTCATATTCATTAATCCTCTTGTGTCGGTACGTGACACTCCGATCCCC T L R S L I F I N P L V S V R D T P I P H S A P S Y S L I L L C R Y V T L R S P D T P L P H I H - S S C V G T - H S D P . . . . . . 2194 TAAATCTACGTGTCGGTTCGTGACACCCGATCCCCTAAATCTACGTATCGGTTCGTGACA - I Y V S V R D T R S P K S T Y R F V T K S T C R F V T P D P L N L R I G S - H L N L R V G S - H P I P - I Y V S V R D . . . . . . 2134 CCCGTTCCCCTAAATCTACATGTCGGTTCGTGACACCCGGTCCCCTAATTCTACGTGTCG P V P L N L H V G S - H P V P - F Y V S P F P - I Y M S V R D T R S P N S T C R T R S P K S T C R F V T P G P L I L R V . . . . . . 2074 GTTCGTGACACCCAATCCCCTAATTCTACGTGTCGGTTCGTGACACCCGATCCCCTAATA V R D T Q S P N S T C R F V T P D P L I F V T P N P L I L R V G S - H P I P - Y G S - H P I P - F Y V S V R D T R S P N . . . . . . 2014 CTACGTGTCGGTTCATGACACCCGATCCCCTAATACTACGTGTCGGTTCGTGACACCCGA L R V G S - H P I P - Y Y V S V R D T R Y V S V H D T R S P N T T C R F V T P D T T C R F M T P D P L I L R V G S - H P . . . . . . 1954 TCCCCTAATCTCCTTCTATCAATTCATCAAGCCTTCTTTCTTACCAAGGCATCATCCATC S P N L L L S I H Q A F F L T K A S S I P L I S F Y Q F I K P S F L P R H H P S I P - S P S I N S S S L L S Y Q G I I H . . . . . . 1894 CCATTATTTTAGTTCATCACGCCTTTTTTTATACCAAGGTCTCATTATTAACAAAGAGAT P L F - F I T P F F I P R S H Y - Q R D H Y F S S S R L F L Y Q G L I I N K E I P I I L V H H A F F Y T K V S L L T K R . . . . . . 1834 TAGGATTTTACAAGATTTGGGATTCAATAACTTCATCATGCTTAATATAATCACAATTAT - D F T R F G I Q - L H H A - Y N H N Y R I L Q D L G F N N F I M L N I I T I I L G F Y K I W D S I T S S C L I - S Q L . . . . . . 1774 ATAATCATGTTCATGCATGCATACAATTAAGCACATAGCAGGGTTTACAATACTATCAAT I I M F M H A Y N - A H S R V Y N T I N - S C S C M H T I K H I A G F T I L S I Y N H V H A C I Q L S T - Q G L Q Y Y Q . . . . . . 1714 ACATATCATTCTCTATTAAGAGTTTACTATGAAAGCATGAAAACCATAACCTACCTCCAC T Y H S L L R V Y Y E S M K T I T Y L H H I I L Y - E F T M K A - K P - P T S T Y I S F S I K S L L - K H E N H N L P P . . . . . . 1654 CGAAGATTCGTGATCAAGCAAGCAAATTTTCTCAAAGCTTTGTGTTTTTCCCCTTCTCGA R R F V I K Q A N F L K A L C F S P S R E D S - S S K Q I F S K L C V F P L L D P K I R D Q A S K F S Q S F V F F P F S . . . . . . 1594 TCGTCTCTCTCTCTATCGATTCCCTTCTCTCTCTTTCTCTTGTTCTTTCTATTTTCTTTA S S L S L S I P F S L F L L F F L F S L R L S L Y R F P S L S F S C S F Y F L Y I V S L S I D S L L S L S L V L S I F F . . . . . . 1534 TTCAAACCCTCTTTCTTTTACCCTAATTAGTATATAATTAAGAATAAAAGATGACAATAA F K P S F F Y P N - Y I I K N K R - Q - S N P L S F T L I S I - L R I K D D N N I Q T L F L L P - L V Y N - E - K M T I . . . . . . 1474 TAGCCCACTAATTAACTTAAGGTTACCTCTTTTATTCCCCCAAGAAATTGAGTTATTAAT - P T N - L K V T S F I P P R N - V I N S P L I N L R L P L L F P Q E I E L L I I A H - L T - G Y L F Y S P K K L S Y - . . . . . . 1414 ATAGACCCACGAAATATATAATTATAGCAGGAATAGTCCAAAACGCCCCTTTAAAACTTA I D P R N I - L - Q E - S K T P L - N L - T H E I Y N Y S R N S P K R P F K T - Y R P T K Y I I I A G I V Q N A P L K L . . . . . . 1354 ACCAGAATTCCGACTTCAACTGGGATTACGCAACCTGTGACGGCCCGTCGCGCCTGCGAC T R I P T S T G I T Q P V T A R R A C D P E F R L Q L G L R N L - R P V A P A T N Q N S D F N W D Y A T C D G P S R L R . . . : . . . 1294 GGTCCGTCATGCAGGTTCGTCAGAGATTCG : TGGAGTGTTTTGAAACGGTGGATCACGACG G P S C R F V R D S : W S V L K R W I T T V R H A G S S E I R : G V F - N G G S R R R S V M Q V R Q R F : V E C F E T V D H D . . . . . . 1129 GTTCATCGTGCCTGTGACGGTCCGTCCTGCAGGTCCGTCACAGAGTTCAGAGAGTCAATT V H R A C D G P S C R S V T E F R E S I F I V P V T V R P A G P S Q S S E S Q F G S S C L - R S V L Q V R H R V Q R V N . . . . . . : 1069 TCAGCACCCAAATTTCAGAATTTCTAAGTGTTTTGGGACGAAACACCCTCGACGGTCC : GT S A P K F Q N F - V F W D E T P S T V : R Q H P N F R I S K C F G T K H P R R S : V F S T Q I S E F L S V L G R N T L D G P : . . . . . . 971 CGTGGGTTCCGTCGTCTCAGCCTGTTTTTCCAGAAATAAAATCTGCTGCTCAAAACAACT R G F R R L S L F F Q K - N L L L K T T V G S V V S A C F S R N K I C C S K Q L S W V P S S Q P V F P E I K S A A Q N N . . . 911 AAACAGGTCGTTACAAAATAT K Q V V T K Y N R S L Q N - T G R Y K I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-1_AGS-4_PPS_1 (2016 1771) (frame '2'; 243 bp, 81 residues) 1 YYVSVHDTRS PNTTCRFVTP DPLISFYQFI KPSFLPRHHP SHYFSSSRLF LYQGLIINKE 61 IRILQDLGFN NFIMLNIITI I- >C06HBa0153O03.1-7-_PGL-1_AGS-4_PPS_2 (1744 1505) (frame '1'; 237 bp, 79 residues) 1 AHSRVYNTIN TYHSLLRVYY ESMKTITYLH RRFVIKQANF LKALCFSPSR SSLSLSIPFS 61 LFLLFFLFSL FKPSFFYPN- >C06HBa0153O03.1-7-_PGL-1_AGS-4_PPS_3 (1146 1012,973 893) (frame '2'; 216 bp, 72 residues) 1 NGGSRRFIVP VTVRPAGPSQ SSESQFQHPN FRISKCFGTK HPRRSVVGSV VSACFSRNKI 61 CCSKQLNRSL QN AGS-5 (1580 1294,1110 1059,1008 968) SCR (e 0.847 d 0.000 a 0.000,e 0.846 d 0.000 a 0.000,e 0.805) Exon 1 1580 1294 ( 287 n); score: 0.847 Intron 1 1293 1111 ( 183 n); Pd: 0.000 Pa: 0.000 Exon 2 1110 1059 ( 52 n); score: 0.846 Intron 2 1058 1009 ( 50 n); Pd: 0.000 Pa: 0.000 Exon 3 1008 968 ( 41 n); score: 0.805 PGS (1580 1294,1110 1059,1008 968) SGN-E396070+ 3-phase translation of AGS-5 (-strand): . . . . . . 1580 ATCGATTCCCTTCTCTCTCTTTCTCTTGTTCTTTCTATTTTCTTTATTCAAACCCTCTTT I D S L L S L S L V L S I F F I Q T L F S I P F S L F L L F F L F S L F K P S F R F P S L S F S C S F Y F L Y S N P L . . . . . . 1520 CTTTTACCCTAATTAGTATATAATTAAGAATAAAAGATGACAATAATAGCCCACTAATTA L L P - L V Y N - E - K M T I I A H - L F Y P N - Y I I K N K R - Q - - P T N - S F T L I S I - L R I K D D N N S P L I . . . . . . 1460 ACTTAAGGTTACCTCTTTTATTCCCCCAAGAAATTGAGTTATTAATATAGACCCACGAAA T - G Y L F Y S P K K L S Y - Y R P T K L K V T S F I P P R N - V I N I D P R N N L R L P L L F P Q E I E L L I - T H E . . . . . . 1400 TATATAATTATAGCAGGAATAGTCCAAAACGCCCCTTTAAAACTTAACCAGAATTCCGAC Y I I I A G I V Q N A P L K L N Q N S D I - L - Q E - S K T P L - N L T R I P T I Y N Y S R N S P K R P F K T - P E F R . . . . . : . 1340 TTCAACTGGGATTACGCAACCTGTGACGGCCCGTCGCGCCTGCGACG : GTCCGTCCTGCAG F N W D Y A T C D G P S R L R R : S V L Q S T G I T Q P V T A R R A C D : G P S C R L Q L G L R N L - R P V A P A T : V R P A . . . . : . . 1097 GTCCGTCACAGAGTTCAGAGAGTCAATTTCAGCACCCAA : GTGCCCATGACGTTCCGTCAT V R H R V Q R V N F S T Q : V P M T F R H S V T E F R E S I S A P K : C P - R S V M G P S Q S S E S Q F Q H P : S A H D V P S . . 987 GCCCATGACGTTCCGTCGTG A H D V P S P M T F R R C P - R S V V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-1_AGS-5_PPS_1 (1415 1294,1110 1059,1008 970) (frame '1'; 213 bp, 71 residues) 1 YRPTKYIIIA GIVQNAPLKL NQNSDFNWDY ATCDGPSRLR RSVLQVRHRV QRVNFSTQVP 61 MTFRHAHDVP S PGL 2 (- strand): 8054 5767 AGS-1 (8054 7914,6772 6501,6321 6118,6017 5767) SCR (e 1.000 d 0.998 a 1.000,e 1.000 d 0.992 a 0.999,e 1.000 d 1.000 a 1.000,e 1.000) Exon 1 8054 7914 ( 141 n); score: 1.000 Intron 1 7913 6773 (1141 n); Pd: 0.998 Pa: 1.000 Exon 2 6772 6501 ( 272 n); score: 1.000 Intron 2 6500 6322 ( 179 n); Pd: 0.992 Pa: 0.999 Exon 3 6321 6118 ( 204 n); score: 1.000 Intron 3 6117 6018 ( 100 n); Pd: 1.000 Pa: 1.000 Exon 4 6017 5767 ( 251 n); score: 1.000 PGS (6578 6501,6321 6118,6017 5767) SGN-E351583- PGS (6187 6118,6017 5774) SGN-E217332- PGS (8022 7914,6772 6501,6321 6118,6017 5920) SGN-E262562+ PGS (8031 7914,6772 6501,6321 6118,6017 5939) SGN-E287025+ PGS (8007 7914,6772 6501,6321 6118,6017 5949) SGN-E347850+ PGS (7967 7914,6772 6501,6321 6118,6017 5956) SGN-E348651+ PGS (7967 7914,6772 6501,6321 6118,6017 5956) SGN-E339213+ PGS (6607 6501,6321 6118,6017 5961) SGN-E353078+ PGS (7967 7914,6772 6501,6321 6118,6017 5962) SGN-E348708+ PGS (7967 7914,6772 6501,6321 6118,6017 5962) SGN-E348496+ PGS (7967 7914,6772 6501,6321 6118,6017 5962) SGN-E539246+ PGS (7967 7914,6772 6501,6321 6118,6017 5962) SGN-E339028+ PGS (6772 6501,6321 6118,6017 5962) SGN-E347192+ PGS (6766 6501,6321 6118,6017 5962) SGN-E556577+ PGS (6745 6501,6321 6118,6017 5962) SGN-E325615+ PGS (6727 6501,6321 6118,6017 5962) SGN-E283479+ PGS (6726 6501,6321 6118,6017 5962) SGN-E296307- PGS (6726 6501,6321 6118,6017 5962) SGN-E296445- PGS (6715 6501,6321 6118,6017 5962) SGN-E350031+ PGS (6685 6501,6321 6118,6017 5962) SGN-E342485+ PGS (6670 6501,6321 6118,6017 5962) SGN-E350864+ PGS (6607 6501,6321 6118,6017 5962) SGN-E357024+ PGS (6603 6501,6321 6118,6017 5962) SGN-E334977+ PGS (6586 6501,6321 6118,6017 5962) SGN-E292322- PGS (6579 6501,6321 6118,6017 5962) SGN-E350734+ PGS (6577 6501,6321 6118,6017 5962) SGN-E348674+ PGS (6562 6501,6321 6118,6017 5962) SGN-E283583- PGS (6562 6501,6321 6118,6017 5962) SGN-E272853+ PGS (6529 6501,6321 6118,6017 5962) SGN-E539243- PGS (6527 6501,6321 6118,6017 5962) SGN-E244303- PGS (6526 6501,6321 6118,6017 5962) SGN-E226355- PGS (6321 6118,6017 5962) SGN-E348200+ PGS (6316 6118,6017 5962) SGN-E229056- PGS (6298 6118,6017 5962) SGN-E226752- PGS (6290 6118,6017 5962) SGN-E294720+ PGS (6290 6118,6017 5962) SGN-E294721+ PGS (7967 7914,6772 6501,6321 6118,6017 5965) SGN-E348271+ PGS (7967 7914,6772 6501,6321 6118,6017 5965) SGN-E342444+ PGS (6542 6501,6321 6118,6017 5965) SGN-E393920- PGS (7967 7914,6772 6501,6321 6118,6017 5966) SGN-E335041+ PGS (6766 6501,6321 6118,6017 5970) SGN-E251833+ PGS (7967 7914,6772 6501,6321 6118,6017 5978) SGN-E334390+ PGS (7967 7914,6772 6501,6321 6118,6017 5979) SGN-E346329+ PGS (7967 7914,6772 6501,6321 6118,6017 6005) SGN-E345879+ PGS (7967 7914,6772 6501,6321 6118,6017 6009) SGN-E328566+ PGS (8044 7914,6772 6501,6321 6118,6017 6011) SGN-E320513+ PGS (7967 7914,6772 6501,6321 6118) SGN-E292203+ PGS (6745 6501,6321 6127) SGN-E325676+ PGS (7967 7914,6772 6501,6321 6136) SGN-E304505+ PGS (7967 7914,6772 6501,6321 6138) SGN-E311109+ PGS (8045 7914,6772 6501,6321 6139) SGN-E272678+ PGS (8031 7914,6772 6501,6321 6139) SGN-E298350+ PGS (7967 7914,6772 6501,6321 6139) SGN-E335097+ PGS (7967 7914,6772 6501,6321 6140) SGN-E348912+ PGS (7967 7914,6772 6501,6321 6145) SGN-E335466+ PGS (7967 7914,6772 6501,6321 6149) SGN-E276058+ PGS (6772 6501,6321 6149) SGN-E283582+ PGS (6760 6501,6321 6152) SGN-E284368+ PGS (8020 7914,6772 6501,6321 6188) SGN-E338649+ PGS (7967 7914,6772 6501,6321 6198) SGN-E244404+ PGS (7967 7914,6772 6501,6321 6200) SGN-E319578+ PGS (7967 7914,6772 6501,6321 6200) SGN-E308687+ PGS (7967 7914,6772 6501,6321 6200) SGN-E288197+ PGS (7967 7914,6772 6501,6321 6202) SGN-E290714+ PGS (7967 7914,6772 6501,6321 6206) SGN-E394069+ PGS (7967 7914,6772 6501,6321 6209) SGN-E246370+ PGS (7967 7914,6772 6501,6321 6210) SGN-E205675+ PGS (7967 7914,6772 6501,6321 6217) SGN-E324787+ PGS (8023 7914,6772 6501,6321 6219) SGN-E550930+ PGS (8022 7914,6772 6501,6321 6223) SGN-E269754+ PGS (8045 7914,6772 6501,6321 6235) SGN-E248667+ PGS (8023 7914,6772 6501,6321 6237) SGN-E286676+ PGS (8031 7914,6772 6501,6321 6238) SGN-E277497+ PGS (7967 7914,6772 6501,6321 6243) SGN-E319581+ PGS (7967 7914,6772 6501,6321 6244) SGN-E306428+ PGS (7962 7914,6772 6501,6321 6248) SGN-E276054+ PGS (7967 7914,6772 6501,6321 6252) SGN-E319734+ PGS (7967 7914,6772 6501,6321 6253) SGN-E322500+ PGS (8031 7914,6772 6501,6321 6255) SGN-E295487+ PGS (8023 7914,6772 6501,6321 6255) SGN-E304731+ PGS (6675 6501,6321 6269) SGN-E239858+ PGS (8044 7914,6772 6559) SGN-E320800+ PGS (8054 7914,6772 6560) SGN-E280221+ PGS (8054 7914,6772 6694) SGN-E277843+ 3-phase translation of AGS-1 (-strand): . . . . . . 8054 CGACACCTCCTTAGACCTATCAGAAAAACAGGAAAAATGGAAGAAGCTTCAGTAGTAGCA R H L L R P I R K T G K M E E A S V V A D T S L D L S E K Q E K W K K L Q - - Q T P P - T Y Q K N R K N G R S F S S S . . . . . . 7994 GTGGACAACCAAAAGCCGCAGCAAGAGAAGCCTCACACTGATGTTTTGCTTTTCAATCGT V D N Q K P Q Q E K P H T D V L L F N R W T T K S R S K R S L T L M F C F S I V S G Q P K A A A R E A S H - C F A F Q S . . . : . . . 7934 TGGTCATATGATGATGTTCAG : ATTGCTGATATTTCTGTTGAGGATTACATAACTGCTACT W S Y D D V Q : I A D I S V E D Y I T A T G H M M M F R : L L I F L L R I T - L L L L V I - - C S : D C - Y F C - G L H N C Y . . . . . . 6733 GCTAACAAGCATCCTACATATACACCACACACAGCTGGGAGGTACCAAGCCAAGCGGTTT A N K H P T Y T P H T A G R Y Q A K R F L T S I L H I H H T Q L G G T K P S G L C - Q A S Y I Y T T H S W E V P S Q A V . . . . . . 6673 AGAAAGGCTCAATGCCCAATTGTGGAGAGGTTGACCAACTCACTGATGATGCACGGAAGG R K A Q C P I V E R L T N S L M M H G R E R L N A Q L W R G - P T H - - C T E G - K G S M P N C G E V D Q L T D D A R K . . . . . . 6613 AACAACGGGAAGAAGTTGATGGCCGTTCGTATTATTAAGCATGCTATGGAAATTATCCAT N N G K K L M A V R I I K H A M E I I H T T G R S - W P F V L L S M L W K L S I E Q R E E V D G R S Y Y - A C Y G N Y P . . . . . . : 6553 CTGTTGACTGACCTAAACCCAATCCAAGTGATTGTTGATGCTGTTATCAACAG : TGGACCA L L T D L N P I Q V I V D A V I N S : G P C - L T - T Q S K - L L M L L S T : V D Q S V D - P K P N P S D C - C C Y Q Q : W T . . . . . . 6314 AGAGAAGATGCAACTCGTATAGGTTCTGCTGGTGTTGTGAGGCGACAAGCTGTTGATATT R E D A T R I G S A G V V R R Q A V D I E K M Q L V - V L L V L - G D K L L I F K R R C N S Y R F C W C C E A T S C - Y . . . . . . 6254 TCTCCACTCCGTCGTGTCAACCAAGCAATATATCTCCTCACAACTGGTGCACGTGAGAGT S P L R R V N Q A I Y L L T T G A R E S L H S V V S T K Q Y I S S Q L V H V R V F S T P S C Q P S N I S P H N W C T - E . . . . . . 6194 GCTTTCAGGAACATCAAGACCATAGCAGAATGCCTTGCAGATGAACTCATTAATGCTGCC A F R N I K T I A E C L A D E L I N A A L S G T S R P - Q N A L Q M N S L M L P C F Q E H Q D H S R M P C R - T H - C C . . : . . . . 6134 AAGGGATCTTCCAACAG : CTATGCTATCAAGAAGAAGGATGAGATTGAGAGGGTTGCCAAG K G S S N S : Y A I K K K D E I E R V A K R D L P T : A M L S R R R M R L R G L P R Q G I F Q Q : L C Y Q E E G - D - E G C Q . . . . . . 5974 GCCAATCGTTGAGGGTGCAGTATGGATATTTACTATTGGTTGGACAGTTTTGCTTCGAAA A N R - G C S M D I Y Y W L D S F A S K P I V E G A V W I F T I G W T V L L R N G Q S L R V Q Y G Y L L L V G Q F C F E . . . . . . 5914 CGTTGTTTGTTCTTTATTTTTTAGTTGCTAGAAAGGCATTTTGGAACTAGTAACGAGTTT R C L F F I F - L L E R H F G T S N E F V V C S L F F S C - K G I L E L V T S F T L F V L Y F L V A R K A F W N - - R V . . . . . . 5854 TTCTGTTTGGGAAACTTGGAGTACATTGGTATTGAATATTATATGGGGGAATTAGCAAAA F C L G N L E Y I G I E Y Y M G E L A K S V W E T W S T L V L N I I W G N - Q K F L F G K L G V H W Y - I L Y G G I S K . . . 5794 AGCAAAATTGAGCTTGCTGTTATTCTCA S K I E L A V I L A K L S L L L F S K Q N - A C C Y S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-7-_PGL-2_AGS-1_PPS_1 (8054 7914,6772 6501,6321 6118,6017 5963) (frame '1'; 669 bp, 223 residues) 1 RHLLRPIRKT GKMEEASVVA VDNQKPQQEK PHTDVLLFNR WSYDDVQIAD ISVEDYITAT 61 ANKHPTYTPH TAGRYQAKRF RKAQCPIVER LTNSLMMHGR NNGKKLMAVR IIKHAMEIIH 121 LLTDLNPIQV IVDAVINSGP REDATRIGSA GVVRRQAVDI SPLRRVNQAI YLLTTGARES 181 AFRNIKTIAE CLADELINAA KGSSNSYAIK KKDEIERVAK ANR- ... finished at: Mon Aug 28 22:23:22 2006 ________________________________________________________________________________ Sequence 8: C06HBa0153O03.1-8, from 1 to 3825, both strands analyzed. ... started at: Mon Aug 28 22:23:22 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 24 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 7 HitsTableSize = 7 ******************************************************************************** EST sequence 25 -strand 489 n (File: SGN-E553348-) 1 TTTTTTAAAA AAGGGCCCCC GAAATTTTTT TTACCCCCCC CTTGGGGGGA AATGGGGCCC 61 CCCCCCCCCC CCCAGGTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 121 TTTTTAACGA TTAAAAGAAC ATAAAAACAT TATCATTAAA TCAGAAACAA TTTGTGTAAA 181 GATTTGTTCT TTACTGTTAG TATTTCAAAT ACTTAACTTA TATTCCATTT GTAACAGAAA 241 AAAGAAAAAT GACATATTCA TGTCAAACAA ATGCTGGAAC AGTAAGTGCT GCAACAGTAA 301 GTGCTGCAAC ATCATCGGTT GCACATTATC GTTCAAATAT TGCCTTAGCG TCGGCTGAGA 361 AACCTGCAAA ATTTTTTGGA GTTGACTTTA AGAGATGGCA ACAAAAGGTG TCCTTCTATC 421 TCACTACGTT GAGTCTGCAG AAGTTCATTA ATGAGAATGT TCCTGTTATG TCAGATGAAA 481 CTCCGCCTG Predicted gene structure (within gDNA segment 1 to 2870): Exon 1 735 773 ( 39 n); cDNA 85 123 ( 39 n); score: 0.769 Intron 1 774 1902 (1129 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.92) Exon 2 1903 2270 ( 368 n); cDNA 124 489 ( 366 n); score: 0.905 MATCH C06HBa0153O03.1-8+ SGN-E553348- 0.905 407 0.832 C PGS_C06HBa0153O03.1-8+_SGN-E553348- (735 773,1903 2270) Alignment (genomic DNA sequence = upper lines): TTTTTCTTAT TTTTTTCCTT CTTTATTTCT TTCTTCTTTA AAAATGACAT GCCATCTTTG 794 ||||| || | |||||| || ||| ||| | || || ||| TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTT. .......... .......... 123 ACGGAACTTC TCCACCTTCG GTGCCTCCAT TTTTGTCAGA ACATTCTATT TCTATTATTC 854 .......... .......... .......... .......... .......... .......... 123 CTTTAATTCT CAACTTTAAA ACCTTAACTA AATATTTTTT TTTACATTTT AAATAATTTT 914 .......... .......... .......... .......... .......... .......... 123 ATAATCCCAT TAGATTTAAA GGTGATTAGT GTTGAGTTTA GCAAGTGTGA ATGAGAAAAG 974 .......... .......... .......... .......... .......... .......... 123 AAAAGAGAGA ATATGAAAAG TGAGGGAACT ATTTTGGAGG GAAAATGAAA AGTCATTTGC 1034 .......... .......... .......... .......... .......... .......... 123 AAAGTGCAAC GAAAAATCAT TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT 1094 .......... .......... .......... .......... .......... .......... 123 ATATAAGGAA ATACTTCCAT TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG 1154 .......... .......... .......... .......... .......... .......... 123 TCATTGCTCG CCCGGCTTCG GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATGGTT 1214 .......... .......... .......... .......... .......... .......... 123 TGATTGATAA ATTTTTTGGA CAAAATTTAT TTAATCAGTT TTTGTTAAAT CAAATAAATC 1274 .......... .......... .......... .......... .......... .......... 123 CTGTTAATAT TATCTCTTAT AAATTTGCGG ATAACGGTAA CATTTCGAAA AGTTGTTACT 1334 .......... .......... .......... .......... .......... .......... 123 CTTTCCGATA AGTCGTTAAT TTTTGAAAAG CCGTTATTTT TCTAACAGAC ACATTTTTCT 1394 .......... .......... .......... .......... .......... .......... 123 GAAAAGTTGT TATTTTTTCC AAAAGACACA ACTTTCTTGA TAAAACGGGT CTGAACAGAT 1454 .......... .......... .......... .......... .......... .......... 123 TTCTCTGAAC AGACACGTTT CTTGCTGAAA GTGGCTATAA AAGGAAGTCA ATTTTTGATT 1514 .......... .......... .......... .......... .......... .......... 123 TTTCAAACAC TGAAATTTTC CTTCTCTGCA TATATTTTTC TCTCAAATGA ATCAAAGTGT 1574 .......... .......... .......... .......... .......... .......... 123 CGATCGACTG AATTTGTGTG ACTTGTTGCT GTTCTGAAGT TCGTTGAAGT TAAAGAAATT 1634 .......... .......... .......... .......... .......... .......... 123 TGAGGTACCG CTATTTCTTT AACAGGCTTA ATCCATCTTA TCTTGGGAGA AATTAATCCA 1694 .......... .......... .......... .......... .......... .......... 123 TAACCGTGGG TACAATGAGG GGATTAAATT TCTTAAGGAC ACACAGTAGT TTCTGTGGAC 1754 .......... .......... .......... .......... .......... .......... 123 TCGAATTACT TCTTGTATTT ATGTATTTTG TGTTTCATCT TATTTCTGTT TCTGTTAAGA 1814 .......... .......... .......... .......... .......... .......... 123 AATTTAGTAA GTTTATGTAT TTAAGGTTTC TTGAGATGAA AACCTTTATG GTTTTCTACT 1874 .......... .......... .......... .......... .......... .......... 123 CTGCTTGAGT TTTTTAAAAT TCATTCGATT AACGATTAAA AGAACATAAA AACTTTATCG 1934 || |||||||||| |||||||||| ||| ||||| .......... .......... ........TT AACGATTAAA AGAACATAAA AACATTATCA 155 TTAAATCAGA AACAGTCTGT GTAACGATTT GTTCTTTACT GTTAGTATTT CAAATACTTA 1994 |||||||||| |||| | ||| |||| ||||| |||||||||| |||||||||| |||||||||| TTAAATCAGA AACAATTTGT GTAAAGATTT GTTCTTTACT GTTAGTATTT CAAATACTTA 215 AGTTATGTGC CATTTGTGAC AGAAAAAAAG AAAAAATTAC TAATTCAAAT CAAACAAATG 2054 | |||| | | ||||||| || || ||||||| |||||| || ||||| | |||||||||| ACTTATATTC CATTTGTAAC AG-AAAAAAG -AAAAATGAC ATATTCATGT CAAACAAATG 273 TTGGAACAGT AAGTGCTACA AGAGTTTGTG CTGCAATAAC ATCGGTTGCA CATAATAATT 2114 ||||||||| ||||||| || | ||| ||| |||||| | | |||||||||| ||| || || CTGGAACAGT AAGTGCTGCA ACAGTAAGTG CTGCAACATC ATCGGTTGCA CATTATCGTT 333 CAAATGCTGC CTTAGCGCCG GCTGAGAAAC CTGCAAAATT TTCTGGAGTC GACTTTAAGA 2174 ||||| ||| ||||||| || |||||||||| |||||||||| || |||||| |||||||||| CAAATATTGC CTTAGCGTCG GCTGAGAAAC CTGCAAAATT TTTTGGAGTT GACTTTAAGA 393 GATGGCAGCA GAAGATGTTC TTCTATCTCA CTACGTTGAG TCTGCAGAAG TTCATTAATG 2234 ||||||| || ||| ||| | |||||||||| |||||||||| |||||||||| |||||||||| GATGGCAACA AAAGGTGTCC TTCTATCTCA CTACGTTGAG TCTGCAGAAG TTCATTAATG 453 AGAATGTTCC TGTTATGTCA GATGAAACTC CGCCTG 2270 |||||||||| |||||||||| |||||||||| |||||| AGAATGTTCC TGTTATGTCA GATGAAACTC CGCCTG 489 hqPGS_C06HBa0153O03.1-8+_SGN-E553348- (735 773,1903 2270) ******************************************************************************** EST sequence 14 +strand 688 n (File: SGN-E395007+) 1 TGCCTGCACA AAAAGTAGTA CACACATATG CTGGTATCAA TTTTCACGAA TTGCATATAT 61 TCTCTACAAC CCACTAATTT TTCCTAAATT ATACGTCCCC ATTATTTCTA TCTATATTGT 121 TTCAAATTTC TTAGGAGTAG TTGTATCGTG AATATACCAA CTAAACTTGC TACTAAAATC 181 AGCATACTAA TGATAAACAT GACCTAAATA TTCTGGAATC TTCTTTGTTA TGTGTCACAT 241 ACAAAAATGA TGTTTATGTT GGGTTTTAAC AAGTGTGAAT GGAAAAATAA AAAAGAGAAT 301 ATCAAAAGTG AGGGAACTAC TTTGGAGGGA AAATGAAAAG TCATTTGCAA AGTGCAAATG 361 AAAAGTCATT TCTTTGGATT TGGATTTGGA TTTGGCAAAT GATGTGATTG ATTGATATAT 421 TTTTTGGACA AAATTTATTC AATCAATTTT TGTTAAATCA AATAAATCCT GTTAATATTA 481 TTTCTTATAA ATTTGCGGGT AACAGTAACA TTCCGAAAAG TCGTTACTTT TCCGAAAAGT 541 CGTTACTTTC CAAAAAGTAG TTATTTTCCT AACAGACACA ATTTTTCAAA AAAGTTGTTA 601 TTTTTTCCAA AAGACACAAC TTTCTGGATA AAATGGGTCT GAACAAATTT CACTGAACGG 661 ACATGTTCCT TGCTGAAAAT GCTATAAA Predicted gene structure (within gDNA segment 1 to 2671): Exon 1 937 1056 ( 120 n); cDNA 251 371 ( 121 n); score: 0.858 Intron 1 1057 1181 ( 125 n); Pd: 0.000 (s: 0.91), Pa: 0.000 (s: 0.67) Exon 2 1182 1495 ( 314 n); cDNA 372 688 ( 317 n); score: 0.858 MATCH C06HBa0153O03.1-8+ SGN-E395007+ 0.858 434 0.631 C PGS_C06HBa0153O03.1-8+_SGN-E395007+ (937 1056,1182 1495) Alignment (genomic DNA sequence = upper lines): TGATTAGTGT TGAG-TTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || ||| ||| || | |||| |||||||||| || |||| | ||| |||||| ||| |||||| TGTTTA-TGT TGGGTTTTAA CAAGTGTGAA TGGAAAAATA AAAAAGAGAA TATCAAAAGT 309 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| ||||||||| |||||||||| |||||||||| |||||| || ||||| |||| GAGGGAACTA CTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGCAAAT GAAAAGTCAT 369 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 || TT........ .......... .......... .......... .......... .......... 371 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 .......... .......... .......... .......... .......... .......... 371 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATG--G T--TTGATTG ATAAATTTTT 1230 ||| || || ||| ||| |||||| ||||||| | | ||||||| ||| |||||| .......CTT TGGATTTGGA TTTGGATTTG GCAAATGATG TGATTGATTG ATATATTTTT 424 TGGACAAAAT TTATTTAATC AGTTTTTGTT AAATCAAATA AATCCTGTTA ATATTATCTC 1290 |||||||||| ||||| |||| | |||||||| |||||||||| |||||||||| ||||||| || TGGACAAAAT TTATTCAATC AATTTTTGTT AAATCAAATA AATCCTGTTA ATATTATTTC 484 TTATAAATTT GCGGATAACG GTAACATTTC GAAAAGTTGT TACTCTTTCC GATAAGTCGT 1350 |||||||||| |||| |||| |||||||| | ||||||| || |||| ||||| || ||||||| TTATAAATTT GCGGGTAACA GTAACATTCC GAAAAGTCGT TACT-TTTCC GAAAAGTCGT 543 TAATTTTTGA AAAGCCGTTA TTTTTCTAAC AGACAC-ATT TTTCTGAAAA GTTGTTATTT 1409 || ||| | |||| |||| |||| ||||| |||||| ||| |||| |||| |||||||||| TACTTTCCAA AAAGTAGTTA TTTTCCTAAC AGACACAATT TTTCAAAAAA GTTGTTATTT 603 TTTCCAAAAG ACACAACTTT CTTGATAAAA CGGGTCTGAA CAGATTTCTC TGAACAGACA 1469 |||||||||| |||||||||| || ||||||| ||||||||| || ||||| | ||||| |||| TTTCCAAAAG ACACAACTTT CTGGATAAAA TGGGTCTGAA CAAATTTCAC TGAACGGACA 663 CGTTTCTTGC TGAAAGTGGC TATAAA 1495 ||| ||||| ||||| | || |||||| TGTTCCTTGC TGAAAAT-GC TATAAA 688 hqPGS_C06HBa0153O03.1-8+_SGN-E395007+ (937 1056,1182 1495) ******************************************************************************** EST sequence 11 +strand 586 n (File: SGN-E250408+) 1 GTGCCTGCAC AAAAAGTAGT ACACACATAT GCTGGTATCA ATTTTCACGA ATTGCATATA 61 TTCTCTACAA CCCACTAATT TTTCCTAAAT TATACGTCCC CATTATTTCT ATCTATATTG 121 TTTCAAATTT CTTAAGAGTA GTTGTATCGT GAATATACCA ACTAAACTTG CTACTAAAAT 181 CAGCATACTA ATGATAAACA TGACCTAAAT ATTCTGGAAT CTTCTTTGTT ATGTGTCACA 241 TACAAAAATG ATGTTTATGT TGGGTTTTAA CAAGTGTGAA TGGAAAAATA AAAAAGAGAA 301 TATCAAAAGT GAGGGAACTA CTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGCAAAT 361 GAAAAGTCAT TTCTTTGGAT TTGGATTTGG ATTTGGCAAA TGATGTGATT GATTGATATA 421 TTTTTTGGAC AAAATTTATT CAATCAATTT TTGTTAAATC AAATAAATCC TGTTAATATT 481 ATTTCTTATA AATTTGCGGG TAACAGTAAC ATTCCGAAAA GTCGTTACTT TTCCGAAAAG 541 TCGTTACTTT CCAAAAAGTA GTTATTTTCC TAACAGACAC AATTTT Predicted gene structure (within gDNA segment 1 to 2784): Exon 1 937 1056 ( 120 n); cDNA 252 372 ( 121 n); score: 0.858 Intron 1 1057 1181 ( 125 n); Pd: 0.000 (s: 0.91), Pa: 0.000 (s: 0.67) Exon 2 1182 1392 ( 211 n); cDNA 373 586 ( 214 n); score: 0.848 MATCH C06HBa0153O03.1-8+ SGN-E250408+ 0.852 331 0.565 C PGS_C06HBa0153O03.1-8+_SGN-E250408+ (937 1056,1182 1392) Alignment (genomic DNA sequence = upper lines): TGATTAGTGT TGAG-TTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || ||| ||| || | |||| |||||||||| || |||| | ||| |||||| ||| |||||| TGTTTA-TGT TGGGTTTTAA CAAGTGTGAA TGGAAAAATA AAAAAGAGAA TATCAAAAGT 310 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| ||||||||| |||||||||| |||||||||| |||||| || ||||| |||| GAGGGAACTA CTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGCAAAT GAAAAGTCAT 370 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 || TT........ .......... .......... .......... .......... .......... 372 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 .......... .......... .......... .......... .......... .......... 372 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATG--G T--TTGATTG ATAAATTTTT 1230 ||| || || ||| ||| |||||| ||||||| | | ||||||| ||| |||||| .......CTT TGGATTTGGA TTTGGATTTG GCAAATGATG TGATTGATTG ATATATTTTT 425 TGGACAAAAT TTATTTAATC AGTTTTTGTT AAATCAAATA AATCCTGTTA ATATTATCTC 1290 |||||||||| ||||| |||| | |||||||| |||||||||| |||||||||| ||||||| || TGGACAAAAT TTATTCAATC AATTTTTGTT AAATCAAATA AATCCTGTTA ATATTATTTC 485 TTATAAATTT GCGGATAACG GTAACATTTC GAAAAGTTGT TACTCTTTCC GATAAGTCGT 1350 |||||||||| |||| |||| |||||||| | ||||||| || |||| ||||| || ||||||| TTATAAATTT GCGGGTAACA GTAACATTCC GAAAAGTCGT TACT-TTTCC GAAAAGTCGT 544 TAATTTTTGA AAAGCCGTTA TTTTTCTAAC AGACACATTT TT 1392 || ||| | |||| |||| |||| ||||| ||||||| || || TACTTTCCAA AAAGTAGTTA TTTTCCTAAC AGACACAATT TT 586 hqPGS_C06HBa0153O03.1-8+_SGN-E250408+ (937 1056,1182 1392) ******************************************************************************** EST sequence 30 -strand 757 n (File: SGN-E542858-) 1 GAACAGATTT CACAGAACAG ACACGTTCCT TGCTGAAAAT TGCTATAAAG GAAGTCAATT 61 TTGATTTTCA AACACTGAAA ATTTTCCTTC TCNGTATTAT TTTTCTCTAA AAAAAATCAA 121 AGTGTCGATC GACTGAGTCT GTGTGACTTG TTGCTGTTCT GAAGTTTGCT GAAGTTAAAG 181 AAGTTTGAGA AAAAAAAAGA CTAACTCAAG TCAAACAAAT GCTGGAATAG TAAGTGCTGC 241 AACAACATCG GCTGCACATA ATCATTCAGA TGCTATCTTA GCGGCGGCTG AGAAACCTGC 301 AGAGTTTTCT GGAGTCGACT TTGAGAGATG GCAGCAAAAG ATGTTCTTCT ATCTCACTAC 361 GTTGAGTCTG CAGAAGTTCA TTAATGAGAA TGTTCCTGTT TATCAGATGA AACTCCGGCT 421 GATGAACGAT TCTTGGTAAC AGAAGCATGG ACACACTCAG ATTTTTTGTG TAAAAATTAT 481 ATTTTGAGTG GTCTGCAAGA TGAACAACAA TGCCAAAACC TCAAAGAACT CTTGGATGCT 541 TTAGAAAAGA AGTACAAAAC AGAAGATGCC GGAATGAAGA AATTCATTGT GGTAAAATTT 601 TTGGACTATA AGATGATAGA CAATAAGACT GTCGTCACCC AAGTTCAAGA ATTGCAGGTC 661 ATAATCCATG ATCTGTGCTG AATGTATAAA TTTATTTAAT GCCTATGTTA GAAATATTAA 721 GTTTTCCCTT AATAAATTTA TTTAATTAAA AAAAAAA Predicted gene structure (within gDNA segment 1 to 3745): Exon 1 1447 1638 ( 192 n); cDNA 1 189 ( 189 n); score: 0.888 Intron 1 1639 2034 ( 396 n); Pd: 0.746 (s: 0.94), Pa: 0.000 (s: 0.54) Exon 2 2035 2616 ( 582 n); cDNA 190 750 ( 561 n); score: 0.858 MATCH C06HBa0153O03.1-8+ SGN-E542858- 0.866 774 1.022 C PGS_C06HBa0153O03.1-8+_SGN-E542858- (1447 1638,2035 2616) Alignment (genomic DNA sequence = upper lines): GAACAGATTT CTCTGAACAG ACACGTTTCT TGCTGAAAGT GGCTATAAAA GGAAGTCAAT 1506 |||||||||| | | |||||| ||||||| || |||||||| | |||||||| ||||||||| GAACAGATTT CACAGAACAG ACACGTTCCT TGCTGAAAAT TGCTATAAA- GGAAGTCAA- 58 TTTTGATTTT TCAAACACTG -AAATTTTCC TTCTCTGCAT ATATTTTTCT CTCAAATGAA 1565 |||||| ||| |||||||||| ||||||||| ||||| | || ||||||||| || ||| || TTTTGA-TTT TCAAACACTG AAAATTTTCC TTCTCNGTAT -TATTTTTCT CTAAAAAAAA 116 TCAAAGTGTC GATCGACTGA ATTTGTGTGA CTTGTTGCTG TTCTGAAGTT CGTTGAAGTT 1625 |||||||||| |||||||||| | ||||||| |||||||||| |||||||||| | ||||||| TCAAAGTGTC GATCGACTGA GTCTGTGTGA CTTGTTGCTG TTCTGAAGTT TGCTGAAGTT 176 AAAGAAATTT GAGGTACCGC TATTTCTTTA ACAGGCTTAA TCCATCTTAT CTTGGGAGAA 1685 |||||| ||| ||| AAAGAAGTTT GAG....... .......... .......... .......... .......... 189 ATTAATCCAT AACCGTGGGT ACAATGAGGG GATTAAATTT CTTAAGGACA CACAGTAGTT 1745 .......... .......... .......... .......... .......... .......... 189 TCTGTGGACT CGAATTACTT CTTGTATTTA TGTATTTTGT GTTTCATCTT ATTTCTGTTT 1805 .......... .......... .......... .......... .......... .......... 189 CTGTTAAGAA ATTTAGTAAG TTTATGTATT TAAGGTTTCT TGAGATGAAA ACCTTTATGG 1865 .......... .......... .......... .......... .......... .......... 189 TTTTCTACTC TGCTTGAGTT TTTTAAAATT CATTCGATTA ACGATTAAAA GAACATAAAA 1925 .......... .......... .......... .......... .......... .......... 189 ACTTTATCGT TAAATCAGAA ACAGTCTGTG TAACGATTTG TTCTTTACTG TTAGTATTTC 1985 .......... .......... .......... .......... .......... .......... 189 AAATACTTAA GTTATGTGCC ATTTGTGACA GAAAAAAAGA AAAAATTACT AATTCAAA-T 2044 || ||| .......... .......... .......... .......... .........A AAAAAAAAGA 200 CAAACAAATG TTGGAACAGT AAGTGCTACA AGAGTTTGTG CTGCAATAAC ATCGGTTGCA 2104 | ||| | | | |||| || |||| | | ||| ||| |||||| ||| ||||| |||| CTAACTCAAG -TCAAACA-- AA-TGCTGGA ATAGTAAGTG CTGCAACAAC ATCGGCTGCA 256 CATAATAATT CAAATGCTGC CTTAGCGCCG GCTGAGAAAC CTGCAAAATT TTCTGGAGTC 2164 |||||| ||| || ||||| ||||||| || |||||||||| ||||| | || |||||||||| CATAATCATT CAGATGCTAT CTTAGCGGCG GCTGAGAAAC CTGCAGAGTT TTCTGGAGTC 316 GACTTTAAGA GATGGCAGCA GAAGATGTTC TTCTATCTCA CTACGTTGAG TCTGCAGAAG 2224 |||||| ||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| GACTTTGAGA GATGGCAGCA AAAGATGTTC TTCTATCTCA CTACGTTGAG TCTGCAGAAG 376 TTCATTAATG AGAATGTTCC TGTTATGTCA GATGAAACTC CGCCTGATGA ACGATTCTTG 2284 |||||||||| |||||||||| |||| | ||| |||||||||| || ||||||| |||||||||| TTCATTAATG AGAATGTTCC TGTT-TATCA GATGAAACTC CGGCTGATGA ACGATTCTTG 435 GTAACACAAG CATGGACACA CTCAGATTTT TTGTGTAAAA ATTATATTTT GAGTGGCCTA 2344 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| || GTAACAGAAG CATGGACACA CTCAGATTTT TTGTGTAAAA ATTATATTTT GAGTGGTCTG 495 CAAGATGATC TGTACAATGT GTACAGCAAT GTCAAAACCT TAAAAGAACT CTGGGATGCT 2404 |||||||| || | | |||| | ||||||| | |||||||| || ||||||| CAAGATGA-- ---AC----- --A-A-CAAT GCCAAAACC- TCAAAGAACT CTTGGATGCT 540 TTAGAAAAGA AGTACAAAAC AGAAGATGCC AGAATGAAGA AATTCATCAT GGCAAAATTT 2464 |||||||||| |||||||||| |||||||||| ||||||||| ||||||| | || ||||||| TTAGAAAAGA AGTACAAAAC AGAAGATGCC GGAATGAAGA AATTCATTGT GGTAAAATTT 600 CTGGACTATA AGATGATAGA CAGTAAGACT GTAGTCACCC AAGTTCAAGA ACTGCAGGTC 2524 ||||||||| |||||||||| || ||||||| || ||||||| |||||||||| | |||||||| TTGGACTATA AGATGATAGA CAATAAGACT GTCGTCACCC AAGTTCAAGA ATTGCAGGTC 660 ATAATCCATG ATCTCCTTGC TGAAGGTATA AATTTATTTA ATACCTATGT TAAAAATATT 2584 |||||||||| |||| ||| |||| ||||| |||||||||| || ||||||| || ||||||| ATAATCCATG ATCT--GTGC TGAATGTATA AATTTATTTA ATGCCTATGT TAGAAATATT 718 AAGTTTTTCC TTAATACTCA CATAATCTTC AA 2616 ||||||| || |||||| || || || AAGTTTTCCC TTAATAAATT TATTTAATTA AA 750 hqPGS_C06HBa0153O03.1-8+_SGN-E542858- (1447 1638,2035 2616) ******************************************************************************** EST sequence 16 +strand 658 n (File: SGN-E542859+) 1 GAGAGAACTG TCTCGAGTTT TTTTTTTTTT TTTTTTGGAG TAATTGTTAG TAAAAAAAAT 61 ATTTTTGAGC TAAAAACGTA ACGAGTTAAA AGATTGATCT ATATCGACTA GTTCAAACAT 121 ATTTTCGGCA TTGTTCTTTA TATATCAAAT ATCATTAATC ACTTGTTCAA GATAAACTCC 181 CAAATCTTAA CCACATTTTG GACCAAATTA TAATAATACC TTGTTCTTGT TGGGGTTTAG 241 CAAGTGTGAA TGGGAAAAAA AAAAGAGAAT ATGAAAGGTG AGGGAACTAC TCTGGAGGGA 301 AACTGAAAAG TCATTTGCAA AGTGCAAATG AAAATTCATT TCTCCCATAT CGGCAAAAGA 361 AAGGGAAATT GTTGTCGTTA TATAAGGAAA CACTTCCATT ACTTCTTAAA GAGCTAAGAA 421 GAAGATGCCC CCTCGCGCCG TCATCATCGC TCGGCTTTGG ATTTGGATTT GGCAAATGAT 481 GTGATTGGAT TGATAAATTT TTTGGACAAA ATTTATTTAA TCACTTTTTG TTAAATCAAA 541 TAAATCCTGT TAATATTTAT CTCTAATAAA TTTGCGGTTA ACGGTAACAT TTTGAAAAGT 601 TGTTACTCTT TCGGAAAAGT CGTTACTTTC CAAAAAGTCG TTATTTTCCT AACAGACA Predicted gene structure (within gDNA segment 1 to 2905): Exon 1 936 1385 ( 450 n); cDNA 223 658 ( 436 n); score: 0.856 PPA cDNA 36 17 MATCH C06HBa0153O03.1-8+ SGN-E542859+ 0.856 450 0.684 C PGS_C06HBa0153O03.1-8+_SGN-E542859+ (936 1385) Alignment (genomic DNA sequence = upper lines): GTGATTAGTG TTGAGTTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || || || | | |||||| |||||||||| || ||||| | ||| |||||| ||||||| || GTTCTT-GT- TGGGGTTTAG CAAGTGTGAA TGGGAAAA-A AAAAAGAGAA TATGAAAGGT 279 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| | ||||||| ||| |||||| |||||||||| |||||| || ||||| |||| GAGGGAACTA CTCTGGAGGG AAACTGAAAA GTCATTTGCA AAGTGCAAAT GAAAATTCAT 339 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 |||||||||| | |||||||| |||||||||| ||||||| || |||||||||| | |||||||| TTCTCCCATA TCGGCAAAAG AAAGGGAAAT TGTTGTCGTT ATATAAGGAA ACACTTCCAT 399 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 |||||||||| |||||||||| |||||||| | || | | ||| | || || | | || TACTTCTTAA AGAGCTAAGA AGAAGATG-C CC-C-C-TCG -C---GC-CG -TCATCATC- 448 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATGGTT TGATTGATAA ATTTTTTGGA 1234 || ||||||| || || ||| ||| | | | || || ||||||||| |||||||||| GC-TCGGCTT TGGATTTGGA TTTGGCAAAT G-ATGTGATT GGATTGATAA ATTTTTTGGA 506 CAAAATTTAT TTAATCAGTT TTTGTTAAAT CAAATAAATC CTGTTAATA- TTATCTCTTA 1293 |||||||||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||| | CAAAATTTAT TTAATCACTT TTTGTTAAAT CAAATAAATC CTGTTAATAT TTATCTCTAA 566 TAAATTTGCG GATAACGGTA ACATTTCGAA AAGTTGTTAC TCTTTCCGAT AAGTCGTTAA 1353 |||||||||| | |||||||| |||||| ||| |||||||||| |||||| || ||||||||| TAAATTTGCG GTTAACGGTA ACATTTTGAA AAGTTGTTAC TCTTTCGGAA AAGTCGTTAC 626 TTTTTGAAAA GCCGTTATTT TTCTAACAGA CA 1385 ||| |||| | |||||||| | |||||||| || TTTCCAAAAA GTCGTTATTT TCCTAACAGA CA 658 hqPGS_C06HBa0153O03.1-8+_SGN-E542859+ (936 1385) ******************************************************************************** EST sequence 24 +strand 598 n (File: SGN-E301820+) 1 AGAGAACTGT CTCGAGTTTT TTTTTTTTTT TTTTTGGAGT AATGGTTAGG GGGAAAAAAT 61 ATTTTTGAGC TAAAAACGTC ACGAGTTAAA AGATTGATCT ATATCGACTA GTTCAAACAT 121 ATTTTCGGCA TTGTTCTTTA TATATCAAAT ATCATTAATC ACTGGGACAA GATAAACTCC 181 CAAATCTTAA CCACATTTTG GACCAAATTA TAATAATACC TTGTTCTTGT TGGGGTTTAG 241 CAAGTGTGAA TGGGAAAAAA AAAAGAGAAT ATGAAAGGTG AGGGAACTAC TCTGGAGGGA 301 AACTGAAAAG TCATTTGCAA AGGGCAAATG AAAATTCATT TCTCCCATAT CGGCAAAAGA 361 AAGGGAAATT GTTGTCGTTA TATAAGGAAA CACTTCCATT ACTTCTTAAA GAGCTAAGAA 421 GAAGATGCCC CCTCGCGCCG TCATCATCGC TCGGCTTTGG ATTTGGATTT GGCAAATGAT 481 GTGATTGGAT TGATAAATTT TTTGGACAAA ATTTATTTAA TCACTTTTTG TTAAATCAAA 541 TAAATCCTGT TAATATTTAT CTCTAATAAA TTTGCGGTTA ACGGTAACAT TTTGAAAA Predicted gene structure (within gDNA segment 1 to 2305): Exon 1 936 1325 ( 390 n); cDNA 223 598 ( 376 n); score: 0.851 PPA cDNA 35 16 MATCH C06HBa0153O03.1-8+ SGN-E301820+ 0.851 390 0.652 C PGS_C06HBa0153O03.1-8+_SGN-E301820+ (936 1325) Alignment (genomic DNA sequence = upper lines): GTGATTAGTG TTGAGTTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || || || | | |||||| |||||||||| || ||||| | ||| |||||| ||||||| || GTTCTT-GT- TGGGGTTTAG CAAGTGTGAA TGGGAAAA-A AAAAAGAGAA TATGAAAGGT 279 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| | ||||||| ||| |||||| |||||||||| ||| || || ||||| |||| GAGGGAACTA CTCTGGAGGG AAACTGAAAA GTCATTTGCA AAGGGCAAAT GAAAATTCAT 339 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 |||||||||| | |||||||| |||||||||| ||||||| || |||||||||| | |||||||| TTCTCCCATA TCGGCAAAAG AAAGGGAAAT TGTTGTCGTT ATATAAGGAA ACACTTCCAT 399 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 |||||||||| |||||||||| |||||||| | || | | ||| | || || | | || TACTTCTTAA AGAGCTAAGA AGAAGATG-C CC-C-C-TCG -C---GC-CG -TCATCATC- 448 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATGGTT TGATTGATAA ATTTTTTGGA 1234 || ||||||| || || ||| ||| | | | || || ||||||||| |||||||||| GC-TCGGCTT TGGATTTGGA TTTGGCAAAT G-ATGTGATT GGATTGATAA ATTTTTTGGA 506 CAAAATTTAT TTAATCAGTT TTTGTTAAAT CAAATAAATC CTGTTAATA- TTATCTCTTA 1293 |||||||||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||| | CAAAATTTAT TTAATCACTT TTTGTTAAAT CAAATAAATC CTGTTAATAT TTATCTCTAA 566 TAAATTTGCG GATAACGGTA ACATTTCGAA AA 1325 |||||||||| | |||||||| |||||| ||| || TAAATTTGCG GTTAACGGTA ACATTTTGAA AA 598 hqPGS_C06HBa0153O03.1-8+_SGN-E301820+ (936 1325) ******************************************************************************** EST sequence 23 +strand 568 n (File: SGN-E301922+) 1 AGAGAACTGT CTCGAGTTTT TTTTTTTTTT TTTTTGGAGT AAGGGGTAGT AAAAAAAATA 61 TTTTTGAGCT AAAAACGTAA CGAGTTAAAA GATTGATCTA TATCGACTAG TTCAAACATA 121 TTTTCGGCAT TGTTCTTTAT ATATCAAATA TCATTAATCA CTTGTTCAAG ATAAACTCCC 181 AAATCTTAAC CACATTTTGG ACCAAATTAT AATAATACCT TGTTCTTGTT GGGGTTTAGC 241 AAGTGTGAAT GGGAAAAAAA AAAGAGAATA TGAAAGGTGA GGGAACTACT CTGGAGGGAA 301 ACTGAAAAGT CATTTGCAAA GTGCAAATGA AAATTCATTT CTCCCATATC GGCAAAAGAA 361 AGGGAAATTG TTGTCTTTAT ATAAGGAAAC ACTTCCATTA CTTTTTAAAG AGCTAAGAAG 421 AAGATGCCCC CTCGCGCCGT CATCATCGCT CGGCTTTGGA TTTGGATTTG GCAAATGATG 481 TGATTGGATT GATAAATTTT TTGGACAAAA TTTATTTAAT CACTTTTTGT TAAATCAAAT 541 AAATCCTGTT AATATTTATC TCTAATAA Predicted gene structure (within gDNA segment 1 to 2015): Exon 1 936 1296 ( 361 n); cDNA 222 568 ( 347 n); score: 0.845 PPA cDNA 35 16 MATCH C06HBa0153O03.1-8+ SGN-E301922+ 0.845 361 0.636 C PGS_C06HBa0153O03.1-8+_SGN-E301922+ (936 1296) Alignment (genomic DNA sequence = upper lines): GTGATTAGTG TTGAGTTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || || || | | |||||| |||||||||| || ||||| | ||| |||||| ||||||| || GTTCTT-GT- TGGGGTTTAG CAAGTGTGAA TGGGAAAA-A AAAAAGAGAA TATGAAAGGT 278 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| | ||||||| ||| |||||| |||||||||| |||||| || ||||| |||| GAGGGAACTA CTCTGGAGGG AAACTGAAAA GTCATTTGCA AAGTGCAAAT GAAAATTCAT 338 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 |||||||||| | |||||||| |||||||||| ||||||| || |||||||||| | |||||||| TTCTCCCATA TCGGCAAAAG AAAGGGAAAT TGTTGTCTTT ATATAAGGAA ACACTTCCAT 398 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 ||||| |||| |||||||||| |||||||| | || | | ||| | || || | | || TACTTTTTAA AGAGCTAAGA AGAAGATG-C CC-C-C-TCG -C---GC-CG -TCATCATC- 447 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATGGTT TGATTGATAA ATTTTTTGGA 1234 || ||||||| || || ||| ||| | | | || || ||||||||| |||||||||| GC-TCGGCTT TGGATTTGGA TTTGGCAAAT G-ATGTGATT GGATTGATAA ATTTTTTGGA 505 CAAAATTTAT TTAATCAGTT TTTGTTAAAT CAAATAAATC CTGTTAATA- TTATCTCTTA 1293 |||||||||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||| | CAAAATTTAT TTAATCACTT TTTGTTAAAT CAAATAAATC CTGTTAATAT TTATCTCTAA 565 TAA 1296 ||| TAA 568 hqPGS_C06HBa0153O03.1-8+_SGN-E301922+ (936 1296) ******************************************************************************** EST sequence 15 +strand 574 n (File: SGN-E548743+) 1 TGGAGAGAAC TGTCTCTCAC TTGGTATCTT GAACTTTGGT AGCAAGAAGT ATGTGGAGCA 61 AGTAATCCAA CCTATGCATC TACGGATGTT GTATATAATC TGGGTCGGCT AGTTCAAACA 121 TATGTTCGGG ATTGCTCTTA ATATATCAAA TATCATTAAT CACTTGATCA AGATAAACTC 181 CCAAAACTTA ACCACATTTT GGACCAAATT ATAATAATAC CTTGCTCATG TTGGGGTTTA 241 GCAAGTGTGA ATGGGAAAAA AAAAAGAGAA TCTGAAAGGT GAGGGAACTA CTCTCGAGGG 301 AAACTGAAAA GTCATTTGCC AAGTGCAAAT GAAAATTCAT TTCTCCCATA TCGACTAAAG 361 AAAGGGAAAT TGTTGTCGTT ATATAAGGAA ACACTTCCAT TACTTCTTAA AGAGCTAAGA 421 AGAATATGCC CCCGTCGCGC CGTCATCATC GCTCGGCTTT GGATTTGGAT TTGGCAAATG 481 ATGTGATTGG ATTGATAAAT TTTTTGGACA AAATTTATTT AATCACCTTG TGTTAAATCA 541 CATAAATCCT GTTCATATTT ATCTCTAATA AATT Predicted gene structure (within gDNA segment 1 to 2351): Exon 1 937 1299 ( 363 n); cDNA 223 574 ( 352 n); score: 0.820 MATCH C06HBa0153O03.1-8+ SGN-E548743+ 0.820 363 0.632 C PGS_C06HBa0153O03.1-8+_SGN-E548743+ (937 1299) Alignment (genomic DNA sequence = upper lines): TGATTAGTGT TG-AGTTTAG CAAGTGTGAA TGAGAAAAGA AAAGAGAGAA TATGAAAAGT 995 || | | ||| || |||||| |||||||||| || ||||| | ||| |||||| | ||||| || TGCTCA-TGT TGGGGTTTAG CAAGTGTGAA TGGGAAAA-A AAAAAGAGAA TCTGAAAGGT 280 GAGGGAACTA TTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGC-AAC GAAAAATCAT 1054 |||||||||| | | ||||| ||| |||||| ||||||||| |||||| || ||||| |||| GAGGGAACTA CTCTCGAGGG AAACTGAAAA GTCATTTGCC AAGTGCAAAT GAAAATTCAT 340 TTCTCCCATA TTGGCAAAAG AAAGGGAAAT TGTTGTCCTT ATATAAGGAA ATACTTCCAT 1114 |||||||||| | | | |||| |||||||||| ||||||| || |||||||||| | |||||||| TTCTCCCATA TCGACTAAAG AAAGGGAAAT TGTTGTCGTT ATATAAGGAA ACACTTCCAT 400 TACTTCTTAA AGAGCTAAGA AGAAGATGCC CCTCACATCG TCATTGCTCG CCCGGCTTCG 1174 |||||||||| |||||||||| |||| ||| | || | | ||| | || || | | || TACTTCTTAA AGAGCTAAGA AGAATATG-C CC-C-CGTCG -C---GC-CG -TCATCATC- 450 GCTTCGGCTT CGGCTTCGGA TTTAGATTTG GCAAATGGTT TGATTGATAA ATTTTTTGGA 1234 || ||||||| || || ||| ||| | | | || || ||||||||| |||||||||| GC-TCGGCTT TGGATTTGGA TTTGGCAAAT G-ATGTGATT GGATTGATAA ATTTTTTGGA 508 CAAAATTTAT TTAATCAGTT TTTGTTAAAT CAAATAAATC CTGTTAATA- TTATCTCTTA 1293 |||||||||| ||||||| | | |||||||| || ||||||| ||||| ||| |||||||| | CAAAATTTAT TTAATCACCT TGTGTTAAAT CACATAAATC CTGTTCATAT TTATCTCTAA 568 TAAATT 1299 |||||| TAAATT 574 hqPGS_C06HBa0153O03.1-8+_SGN-E548743+ (937 1299) ******************************************************************************** EST sequence 18 +strand 462 n (File: SGN-E236009+) 1 GTTTCTGTGG ACTCGGATTA ATTCTTCTGT TTTAAATTTT TTCTGCTTCA TCTTGTTTCT 61 ATTTCTATTC ATTAACTTCA TAAATACAAG TTATTGTAAG AATAACAATT TTATAACAAT 121 CTTAAGAAAT TTAACAGTTT TTGTATTTGA GGTTTCTGGA GATTAAAACC TTTATGGTTG 181 GTTTTCTATT CTGCTTGAAT TTTTAAAATT TATTCGATTA ACGATTAAAA GAACATAAAA 241 ACTTTATCGT TAAATCAGAA ACAGTTTGTG TAAAGATTTG TTTTTTATTG TTAATATTTT 301 AAATACTTAA TTTATCTGCC ATTTGTGACA GAAAAAAATG ACTAATTCAA GTCAAACAAA 361 TGCTGGAACA GTAAGTGCTG CAACAACAAC GGTTGCACAT AATTCTTCTG GAGTCGACTT 421 TAAGAGATGA CAACAAAAGA TGTTCTTCTA TCTCACTACG TT Predicted gene structure (within gDNA segment 1 to 2821): Exon 1 318 357 ( 40 n); cDNA 83 122 ( 40 n); score: 0.675 Intron 1 358 1809 (1452 n); Pd: 0.000 (s: 0.68), Pa: 0.000 (s: 0.86) Exon 2 1810 2089 ( 280 n); cDNA 123 397 ( 275 n); score: 0.829 Intron 2 2090 2147 ( 58 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.87) Exon 3 2148 2211 ( 64 n); cDNA 398 462 ( 65 n); score: 0.898 MATCH C06HBa0153O03.1-8+ SGN-E236009+ 0.842 384 0.831 C PGS_C06HBa0153O03.1-8+_SGN-E236009+ (318 357,1810 2089,2148 2211) Alignment (genomic DNA sequence = upper lines): AATACAAGTA TTTCTAGTAG TAATATTATC AACACACTCT ATTTTATACT ATTATTATAT 377 ||||||||| || || | ||| | | | | ||| ||| AATACAAGTT ATTGTAAGAA TAACAATTTT ATAACAATCT .......... .......... 122 ACACACCCTA TCAAACGAAA CGTGACAAAA TTATATTAAT TGCATTTTGA TTACCCGAGA 437 .......... .......... .......... .......... .......... .......... 122 TTTTACAAGC ATATTTGAGC AAGAGGTAAT ATATCATATG AATCCTCTTA CCTTGAATGC 497 .......... .......... .......... .......... .......... .......... 122 GACTTGACTA AAATTCTTAG TGCTAACACT CGTTTATTAT AGAGAAAAAA ACATTATTAT 557 .......... .......... .......... .......... .......... .......... 122 TCCATTAAAG TAAGGTCTAT CATTCAAAAA AGATTCTTGG ATACTAAATT GGTGTCATAA 617 .......... .......... .......... .......... .......... .......... 122 TATCCCCTTT TGGCAATCTA ACACAGAGAG TTAGAAACTT AAAATTGTGA CTTAATATTA 677 .......... .......... .......... .......... .......... .......... 122 GTTTTATGAG TATTTTATTA TGAAGTGTAT TTTCTTATTT TTCTGATAAT TTAACTGTTT 737 .......... .......... .......... .......... .......... .......... 122 TTCTTATTTT TTTCCTTCTT TATTTCTTTC TTCTTTAAAA ATGACATGCC ATCTTTGACG 797 .......... .......... .......... .......... .......... .......... 122 GAACTTCTCC ACCTTCGGTG CCTCCATTTT TGTCAGAACA TTCTATTTCT ATTATTCCTT 857 .......... .......... .......... .......... .......... .......... 122 TAATTCTCAA CTTTAAAACC TTAACTAAAT ATTTTTTTTT ACATTTTAAA TAATTTTATA 917 .......... .......... .......... .......... .......... .......... 122 ATCCCATTAG ATTTAAAGGT GATTAGTGTT GAGTTTAGCA AGTGTGAATG AGAAAAGAAA 977 .......... .......... .......... .......... .......... .......... 122 AGAGAGAATA TGAAAAGTGA GGGAACTATT TTGGAGGGAA AATGAAAAGT CATTTGCAAA 1037 .......... .......... .......... .......... .......... .......... 122 GTGCAACGAA AAATCATTTC TCCCATATTG GCAAAAGAAA GGGAAATTGT TGTCCTTATA 1097 .......... .......... .......... .......... .......... .......... 122 TAAGGAAATA CTTCCATTAC TTCTTAAAGA GCTAAGAAGA AGATGCCCCT CACATCGTCA 1157 .......... .......... .......... .......... .......... .......... 122 TTGCTCGCCC GGCTTCGGCT TCGGCTTCGG CTTCGGATTT AGATTTGGCA AATGGTTTGA 1217 .......... .......... .......... .......... .......... .......... 122 TTGATAAATT TTTTGGACAA AATTTATTTA ATCAGTTTTT GTTAAATCAA ATAAATCCTG 1277 .......... .......... .......... .......... .......... .......... 122 TTAATATTAT CTCTTATAAA TTTGCGGATA ACGGTAACAT TTCGAAAAGT TGTTACTCTT 1337 .......... .......... .......... .......... .......... .......... 122 TCCGATAAGT CGTTAATTTT TGAAAAGCCG TTATTTTTCT AACAGACACA TTTTTCTGAA 1397 .......... .......... .......... .......... .......... .......... 122 AAGTTGTTAT TTTTTCCAAA AGACACAACT TTCTTGATAA AACGGGTCTG AACAGATTTC 1457 .......... .......... .......... .......... .......... .......... 122 TCTGAACAGA CACGTTTCTT GCTGAAAGTG GCTATAAAAG GAAGTCAATT TTTGATTTTT 1517 .......... .......... .......... .......... .......... .......... 122 CAAACACTGA AATTTTCCTT CTCTGCATAT ATTTTTCTCT CAAATGAATC AAAGTGTCGA 1577 .......... .......... .......... .......... .......... .......... 122 TCGACTGAAT TTGTGTGACT TGTTGCTGTT CTGAAGTTCG TTGAAGTTAA AGAAATTTGA 1637 .......... .......... .......... .......... .......... .......... 122 GGTACCGCTA TTTCTTTAAC AGGCTTAATC CATCTTATCT TGGGAGAAAT TAATCCATAA 1697 .......... .......... .......... .......... .......... .......... 122 CCGTGGGTAC AATGAGGGGA TTAAATTTCT TAAGGACACA CAGTAGTTTC TGTGGACTCG 1757 .......... .......... .......... .......... .......... .......... 122 AATTACTTCT TGTATTTATG TATTTTGTGT TTCATCTTAT TTCTGTTTCT GTTAAGAAAT 1817 |||||||| .......... .......... .......... .......... .......... ..TAAGAAAT 130 TTAGTAAGTT TATGTATTTA AGGTTTCTTG AGATGAAAAC CTTTA----T GGTTTTCTAC 1873 ||| | ||| | ||||||| |||||||| | |||| ||||| ||||| | ||||||||| TTAACA-GTT TTTGTATTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT GGTTTTCTAT 189 TCTGCTTGAG TTTTTTAAAA TTCATTCGAT TAACGATTAA AAGAACATAA AAACTTTATC 1933 ||||||||| ||||||||| || ||||||| |||||||||| |||||||||| |||||||||| TCTGCTTGA- ATTTTTAAAA TTTATTCGAT TAACGATTAA AAGAACATAA AAACTTTATC 248 GTTAAATCAG AAACAGTCTG TGTAACGATT TGTTCTTTAC TGTTAGTATT TCAAATACTT 1993 |||||||||| ||||||| || ||||| |||| |||| |||| ||||| |||| | |||||||| GTTAAATCAG AAACAGTTTG TGTAAAGATT TGTTTTTTAT TGTTAATATT TTAAATACTT 308 AAGTTATGTG CCATTTGTGA CAGAAAAAAA GAAAAAATTA CTAATTCAAA TCAAACAAAT 2053 || |||| || |||||||||| ||| | ||||||| | ||||||||| |||||||||| AATTTATCTG CCATTTGTGA CAG------A -AAAAAATGA CTAATTCAAG TCAAACAAAT 361 GTTGGAACAG TAAGTGCTAC AAGAGTTTGT GCTGCAATAA CATCGGTTGC ACATAATAAT 2113 | |||||||| |||||||| | || | | |||| GCTGGAACAG TAAGTGCTGC AACAACAACG GTTGCA.... .......... .......... 397 TCAAATGCTG CCTTAGCGCC GGCTGAGAAA CCTGCAAAAT T-TTCTGGAG TCGACTTTAA 2172 || ||| | |||||||| |||||||||| .......... .......... .......... ....CATAAT TCTTCTGGAG TCGACTTTAA 423 GAGATGGCAG CAGAAGATGT TCTTCTATCT CACTACGTT 2211 |||||| || || ||||||| |||||||||| ||||||||| GAGATGACAA CAAAAGATGT TCTTCTATCT CACTACGTT 462 hqPGS_C06HBa0153O03.1-8+_SGN-E236009+ (1810 2089,2148 2211) ******************************************************************************** EST sequence 5 +strand 727 n (File: SGN-E262550+) 1 TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT TTCTACTCTA 61 CTTGAATTTT TAAAATCATT CGATTAACGA TTAAAAAAAC ATAAAAACTT TATCGTTAAA 121 TCAGAAACAG GTTGTGTAAA GATTTGCTCT TTACTGTTAG TATTTTAAAT ACTTAATTTA 181 TCTGCCAATT GTGACAGAAA AAAAAGACTA ATTCAAGTCA AACAAATGCT GGGACAGTAA 241 GTGCTGCAAC AACAATGGTT GCACATAATC GTTCACATGC TGCCTTAGCA CCGGCTGAGA 301 AACTGCAAAG TTTTCTGGAG TCGACTTTAA GAGATGGCAG CAAAAGATGT TCTTCTATCT 361 CACTACGTTG AGTCTGCAGA AGTTCATCAA TGAGAATGTT CCTGTTATGT CAGATGAAAC 421 TTCGGCTGAT GAACGATTCT TGGTAACAGA AGCATGGACA CACTCAGATT TTTTGTGTAA 481 AATTATATTT TGAGTGGTCT GCAAGATGAT CTGTANCATG TGTACAGCAA TGCAAAAACC 541 TCAAAAGAAC TCTGGGATGC TTTAGAAAAG AAGTACAAAC AGAAGATGCC GGAAATGAGA 601 AAATCATTGT GGGCAAATTT CTAGACTTTT AGATGATAGA CAGTAAGACT GTCGTCACCC 661 AAGTTTCAGA ATTGCAGGTT ATAATCCATG ATCTCCTTGC TGAAAGGATG ATTGTGAATG 721 ATGCTTT Predicted gene structure (within gDNA segment 903 to 3825): Exon 1 1818 2549 ( 732 n); cDNA 1 705 ( 705 n); score: 0.881 Intron 1 2550 2621 ( 72 n); Pd: 1.000 (s: 0.90), Pa: 0.986 (s: 0) Exon 2 2622 2643 ( 22 n); cDNA 706 727 ( 22 n); score: 0.864 MATCH C06HBa0153O03.1-8+ SGN-E262550+ 0.881 754 1.037 C PGS_C06HBa0153O03.1-8+_SGN-E262550+ (1818 2549,2622 2643) Alignment (genomic DNA sequence = upper lines): TTAGTAAGTT TATGTATTTA AGGTTTCTTG AGATGAAAAC CTTTATGGTT TTCTACTCTG 1877 ||| |||| | ||| ||| |||||||| | |||| ||||| |||||||||| ||||||||| TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT TTCTACTCTA 60 CTTGAGTTTT TTAAAATTCA TTCGATTAAC GATTAAAAGA ACATAAAAAC TTTATCGTTA 1937 ||||| ||| |||||| ||| |||||||||| |||||||| | |||||||||| |||||||||| CTTGA-ATTT TTAAAA-TCA TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA 118 AATCAGAAAC AGTCTGTGTA ACGATTTGTT CTTTACTGTT AGTATTTCAA ATACTTAAGT 1997 |||||||||| || |||||| | |||||| | |||||||||| ||||||| || |||||||| | AATCAGAAAC AGGTTGTGTA AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTAATT 178 TATGTGCCAT TTGTGACAGA AAAAAAGAAA AAATTACTAA TTCAAATCAA ACAAATGTTG 2057 ||| ||||| ||||||||| || ||| ||| ||||| ||||| |||| ||||||| || TATCTGCCAA TTGTGACAG- ----AA-AAA AAA-GACTAA TTCAAGTCAA ACAAATGCTG 231 GAACAGTAAG TGCTACAAGA GTTTGTGCTG CAATAACATC GGTTGCACAT AATAATTCAA 2117 | |||||||| |||| | | | ||| | || |||||||||| ||| |||| GGACAGTAAG TGCTGC-A-A ---------- CAA-CA-AT- GGTTGCACAT AATCGTTCAC 276 ATGCTGCCTT AGCGCCGGCT GAGAAACCTG CAAAATTTTC TGGAGTCGAC TTTAAGAGAT 2177 |||||||||| ||| |||||| |||||| ||| |||| ||||| |||||||||| |||||||||| ATGCTGCCTT AGCACCGGCT GAGAAA-CTG CAAAGTTTTC TGGAGTCGAC TTTAAGAGAT 335 GGCAGCAGAA GATGTTCTTC TATCTCACTA CGTTGAGTCT GCAGAAGTTC ATTAATGAGA 2237 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| GGCAGCAAAA GATGTTCTTC TATCTCACTA CGTTGAGTCT GCAGAAGTTC ATCAATGAGA 395 ATGTTCCTGT TATGTCAGAT GAAACTCCGC CTGATGAACG ATTCTTGGTA ACACAAGCAT 2297 |||||||||| |||||||||| |||||| || |||||||||| |||||||||| ||| |||||| ATGTTCCTGT TATGTCAGAT GAAACTTCGG CTGATGAACG ATTCTTGGTA ACAGAAGCAT 455 GGACACACTC AGATTTTTTG TGTAAAAATT ATATTTTGAG TGGCCTACAA GATGATCTGT 2357 |||||||||| |||||||||| ||| |||||| |||||||||| ||| || ||| |||||||||| GGACACACTC AGATTTTTTG TGT-AAAATT ATATTTTGAG TGGTCTGCAA GATGATCTGT 514 ACAATGTGTA CAGCAATGTC AAAACCTTAA AAGAACTCTG GGATGCTTTA GAAAAGAAGT 2417 | ||||||| |||||||| ||||||| || |||||||||| |||||||||| |||||||||| ANCATGTGTA CAGCAATGCA AAAACCTCAA AAGAACTCTG GGATGCTTTA GAAAAGAAGT 574 ACAAAACAGA AGATGCCAGA ATGAAGAAAT TCATCATGGC AAAATTTCTG GACTATAAGA 2477 || ||||||| ||||||| || | ||||| |||| ||| |||||||| |||| | ||| AC-AAACAGA AGATGCCGGA AATGAGAAAA TCATTGTGGG CAAATTTCTA GACTTTTAGA 633 TGATAGACAG TAAGACTGTA GTCACCCAAG TTCAAGAACT GCAGGTCATA ATCCATGATC 2537 |||||||||| ||||||||| |||||||||| || |||| | |||||| ||| |||||||||| TGATAGACAG TAAGACTGTC GTCACCCAAG TTTCAGAATT GCAGGTTATA ATCCATGATC 693 TCCTTGCTGA AGGTATAAAT TTATTTAATA CCTATGTTAA AAATATTAAG TTTTTCCTTA 2597 |||||||||| | TCCTTGCTGA AA........ .......... .......... .......... .......... 705 ATACTCACAT AATCTTCAAT GTAGGATTGA TTGTGAATGA TGCCTT 2643 | ||| |||||||||| ||| || .......... .......... ....GGATGA TTGTGAATGA TGCTTT 727 hqPGS_C06HBa0153O03.1-8+_SGN-E262550+ (1818 2549,2622 2643) ******************************************************************************** EST sequence 2 +strand 772 n (File: SGN-E261540+) 1 TAAAACCTTT ATGGTTTTCT ACTCTACTTG AATTTTTAAA ATCATTCGAT TAACGATTAA 61 AAAAACATAA AAACTTTATC GTTAAATCAG AAACAGGTTG TGTAAAGATT TGCTCTTTAC 121 TGTTAGTATT TTAAATACTT AATTTATCTG CCAATTGTGA CAGAAAAAAA AGACTAATTC 181 AAGTCAAACA AATGCTGGAA CAGTAAGTGC TGCAACAACA ATGGTTGCAC ATAATCGTTC 241 ACATGCTGCC TTAGCACCGG CTGAGAAACC TGCAAAGTTT TCTGGAGTCG ACTTTAAGAG 301 ATGGCAGCAA AAGATGTTCT TCTATCTCAC TACGTTGAGT CTGCAGAAGT TCATCAATGA 361 GAATGTTCCT GTTATGTCAG ATGAAACTTC GGCTGATGAA CGATTCTTGG TAACAGAAGC 421 ATGGACACAC TCAGATTTTT TGTGTAAAAA TTATATTTTG AGTGGTCTGC AAGATGATCT 481 GTACAATGTG TACAGCAATG CAAAAACCTC AAAAGAACTC TGGGATGCTT TAGAAAAGAA 541 GTACAAAACA GAAGATGCCG GAATGAAGAA ATTCATTGTG GCAAAATTTC TAGACTTTAA 601 GATGATAGAC AGTAAGACTG TCGTCACCCA AGTTCAAGAA TTGCAGGTTA TAATCCATGA 661 TCTCCTTGCT GAAGGATTGA TTGTGAATGA TGCTTTTCAA GTGGCTGCAA TTATTGAAAA 721 GTTACCTCCT ATTGTGGAAG GACTTTAAAA ACTACTTGAA ACACAAACGC AA Predicted gene structure (within gDNA segment 1243 to 3797): Exon 1 1853 2549 ( 697 n); cDNA 2 674 ( 673 n); score: 0.910 Intron 1 2550 2621 ( 72 n); Pd: 1.000 (s: 0.96), Pa: 0.986 (s: 0.96) Exon 2 2622 2718 ( 97 n); cDNA 675 772 ( 98 n); score: 0.933 MATCH C06HBa0153O03.1-8+ SGN-E261540+ 0.912 794 1.028 C PGS_C06HBa0153O03.1-8+_SGN-E261540+ (1853 2549,2622 2718) Alignment (genomic DNA sequence = upper lines): AAAACCTTTA TGGTTTTCTA CTCTGCTTGA GTTTTTTAAA ATTCATTCGA TTAACGATTA 1912 |||||||||| |||||||||| |||| ||||| |||||||| | |||||||| |||||||||| AAAACCTTTA TGGTTTTCTA CTCTACTTGA -ATTTTTAAA A-TCATTCGA TTAACGATTA 59 AAAGAACATA AAAACTTTAT CGTTAAATCA GAAACAGTCT GTGTAACGAT TTGTTCTTTA 1972 ||| |||||| |||||||||| |||||||||| ||||||| | |||||| ||| ||| |||||| AAAAAACATA AAAACTTTAT CGTTAAATCA GAAACAGGTT GTGTAAAGAT TTGCTCTTTA 119 CTGTTAGTAT TTCAAATACT TAAGTTATGT GCCATTTGTG ACAGAAAAAA AGAAAAAATT 2032 |||||||||| || ||||||| ||| |||| | |||| ||||| |||| | | |||||| CTGTTAGTAT TTTAAATACT TAATTTATCT GCCAATTGTG ACAG-----A A-AAAAAA-G 172 ACTAATTCAA ATCAAACAAA TGTTGGAACA GTAAGTGCTA CAAGAGTTTG TGCTGCAATA 2092 |||||||||| ||||||||| || ||||||| ||||||||| | | | ||| ACTAATTCAA GTCAAACAAA TGCTGGAACA GTAAGTGCTG C-A-A----- -----CAA-C 219 ACATCGGTTG CACATAATAA TTCAAATGCT GCCTTAGCGC CGGCTGAGAA ACCTGCAAAA 2152 | || ||||| |||||||| |||| ||||| |||||||| | |||||||||| ||||||||| A-AT-GGTTG CACATAATCG TTCACATGCT GCCTTAGCAC CGGCTGAGAA ACCTGCAAAG 277 TTTTCTGGAG TCGACTTTAA GAGATGGCAG CAGAAGATGT TCTTCTATCT CACTACGTTG 2212 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| TTTTCTGGAG TCGACTTTAA GAGATGGCAG CAAAAGATGT TCTTCTATCT CACTACGTTG 337 AGTCTGCAGA AGTTCATTAA TGAGAATGTT CCTGTTATGT CAGATGAAAC TCCGCCTGAT 2272 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| | || ||||| AGTCTGCAGA AGTTCATCAA TGAGAATGTT CCTGTTATGT CAGATGAAAC TTCGGCTGAT 397 GAACGATTCT TGGTAACACA AGCATGGACA CACTCAGATT TTTTGTGTAA AAATTATATT 2332 |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| GAACGATTCT TGGTAACAGA AGCATGGACA CACTCAGATT TTTTGTGTAA AAATTATATT 457 TTGAGTGGCC TACAAGATGA TCTGTACAAT GTGTACAGCA ATGTCAAAAC CTTAAAAGAA 2392 |||||||| | | |||||||| |||||||||| |||||||||| ||| ||||| || ||||||| TTGAGTGGTC TGCAAGATGA TCTGTACAAT GTGTACAGCA ATGCAAAAAC CTCAAAAGAA 517 CTCTGGGATG CTTTAGAAAA GAAGTACAAA ACAGAAGATG CCAGAATGAA GAAATTCATC 2452 |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| ||||||||| CTCTGGGATG CTTTAGAAAA GAAGTACAAA ACAGAAGATG CCGGAATGAA GAAATTCATT 577 ATGGCAAAAT TTCTGGACTA TAAGATGATA GACAGTAAGA CTGTAGTCAC CCAAGTTCAA 2512 ||||||||| |||| |||| |||||||||| |||||||||| |||| ||||| |||||||||| GTGGCAAAAT TTCTAGACTT TAAGATGATA GACAGTAAGA CTGTCGTCAC CCAAGTTCAA 637 GAACTGCAGG TCATAATCCA TGATCTCCTT GCTGAAGGTA TAAATTTATT TAATACCTAT 2572 ||| |||||| | |||||||| |||||||||| ||||||| GAATTGCAGG TTATAATCCA TGATCTCCTT GCTGAAG... .......... .......... 674 GTTAAAAATA TTAAGTTTTT CCTTAATACT CACATAATCT TCAATGTAGG ATTGATTGTG 2632 | |||||||||| .......... .......... .......... .......... .........G ATTGATTGTG 685 AATGATGCCT TTCAAGTGGC TGCAATTATT GAAAACTTAC CTCC-ATTGT TGAAGGACTT 2691 |||||||| | |||||||||| |||||||||| ||||| |||| |||| ||||| ||||||||| AATGATGCTT TTCAAGTGGC TGCAATTATT GAAAAGTTAC CTCCTATTGT GGAAGGACTT 745 CAAAAACTAC TTGAAACACA AACGCAA 2718 ||||||||| |||||||||| ||||||| TAAAAACTAC TTGAAACACA AACGCAA 772 hqPGS_C06HBa0153O03.1-8+_SGN-E261540+ (1853 2549,2622 2718) ******************************************************************************** EST sequence 3 +strand 608 n (File: SGN-E251204+) 1 TCAAGTCAAA CAAATGCTGG AACAGTAAGT GCTGCAACAA CAATGGTTGC ACATAATCGT 61 TCACATGCTG CCTTAGCACC GGCTGAGAAA CCTGCAAAGT TTTCTGGAGT CGACTTTAAG 121 AGATGGCAGC AAAAGATGTT CTTCTATCTC ACTACGTTGA GTCTGCAGAA GTTCATCAAT 181 GAGAATGTTC CTGTTATGTC AGATGAAACT TCGGCTGATG AACGATTCTT GGTAACAGAA 241 GCATGGACAC ACTCAGATTT TTTGTGTAAA AATTATATTT TGAGTGGTCT GCAAGATGAT 301 CTGTACAATG TGTACAGCAA TGCAAAAACC TCAAAAGAAC TCTGGGATGC TTTAGAAAAG 361 AAGTACAAAA CAGAAGATGC CGGAATGAAG AAATTCATTG TGGCAAAATT TCTAGACTTT 421 AAGATGATAG ACAGTAAGAC TGTCGTCACC CAAGTTCAAG AATTGCAGGT TATAATCCAT 481 GATCTCCTTG CTGAAGGATT GATTGTGAAT GATGCTTTTC AAGTGGCTGC AATTATTGAA 541 AAGTTACCTC TATTGTGGAA GGACTTTAAA AACTACTTGA AACACAAACG CAAGGAGATG 601 ATTGTTGA Predicted gene structure (within gDNA segment 752 to 3406): Exon 1 1734 1739 ( 6 n); cDNA 7 12 ( 6 n); score: 0.833 Intron 1 1740 2063 ( 324 n); Pd: 0.683 (s: 0), Pa: 0.331 (s: 0.76) Exon 2 2064 2549 ( 486 n); cDNA 13 496 ( 484 n); score: 0.932 Intron 2 2550 2621 ( 72 n); Pd: 1.000 (s: 0.96), Pa: 0.986 (s: 0.96) Exon 3 2622 2733 ( 112 n); cDNA 497 608 ( 112 n); score: 0.946 MATCH C06HBa0153O03.1-8+ SGN-E251204+ 0.935 604 0.993 C PGS_C06HBa0153O03.1-8+_SGN-E251204+ (1734 1739,2064 2549,2622 2733) Alignment (genomic DNA sequence = upper lines): CACACAGTAG TTTCTGTGGA CTCGAATTAC TTCTTGTATT TATGTATTTT GTGTTTCATC 1793 || ||| CAAACA.... .......... .......... .......... .......... .......... 12 TTATTTCTGT TTCTGTTAAG AAATTTAGTA AGTTTATGTA TTTAAGGTTT CTTGAGATGA 1853 .......... .......... .......... .......... .......... .......... 12 AAACCTTTAT GGTTTTCTAC TCTGCTTGAG TTTTTTAAAA TTCATTCGAT TAACGATTAA 1913 .......... .......... .......... .......... .......... .......... 12 AAGAACATAA AAACTTTATC GTTAAATCAG AAACAGTCTG TGTAACGATT TGTTCTTTAC 1973 .......... .......... .......... .......... .......... .......... 12 TGTTAGTATT TCAAATACTT AAGTTATGTG CCATTTGTGA CAGAAAAAAA GAAAAAATTA 2033 .......... .......... .......... .......... .......... .......... 12 CTAATTCAAA TCAAACAAAT GTTGGAACAG TAAGTGCTAC AAGAGTTTGT GCTGCAATAA 2093 || |||| || ||| || ||||||| || .......... .......... .......... -AA-TGCTGG AACAGTAAGT GCTGCAACAA 40 CATCGGTTGC ACATAATAAT TCAAATGCTG CCTTAGCGCC GGCTGAGAAA CCTGCAAAAT 2153 || |||||| ||||||| | ||| |||||| ||||||| || |||||||||| |||||||| | CAATGGTTGC ACATAATCGT TCACATGCTG CCTTAGCACC GGCTGAGAAA CCTGCAAAGT 100 TTTCTGGAGT CGACTTTAAG AGATGGCAGC AGAAGATGTT CTTCTATCTC ACTACGTTGA 2213 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| TTTCTGGAGT CGACTTTAAG AGATGGCAGC AAAAGATGTT CTTCTATCTC ACTACGTTGA 160 GTCTGCAGAA GTTCATTAAT GAGAATGTTC CTGTTATGTC AGATGAAACT CCGCCTGATG 2273 |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| || |||||| GTCTGCAGAA GTTCATCAAT GAGAATGTTC CTGTTATGTC AGATGAAACT TCGGCTGATG 220 AACGATTCTT GGTAACACAA GCATGGACAC ACTCAGATTT TTTGTGTAAA AATTATATTT 2333 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| AACGATTCTT GGTAACAGAA GCATGGACAC ACTCAGATTT TTTGTGTAAA AATTATATTT 280 TGAGTGGCCT ACAAGATGAT CTGTACAATG TGTACAGCAA TGTCAAAACC TTAAAAGAAC 2393 ||||||| || ||||||||| |||||||||| |||||||||| || |||||| | |||||||| TGAGTGGTCT GCAAGATGAT CTGTACAATG TGTACAGCAA TGCAAAAACC TCAAAAGAAC 340 TCTGGGATGC TTTAGAAAAG AAGTACAAAA CAGAAGATGC CAGAATGAAG AAATTCATCA 2453 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||| TCTGGGATGC TTTAGAAAAG AAGTACAAAA CAGAAGATGC CGGAATGAAG AAATTCATTG 400 TGGCAAAATT TCTGGACTAT AAGATGATAG ACAGTAAGAC TGTAGTCACC CAAGTTCAAG 2513 |||||||||| ||| |||| | |||||||||| |||||||||| ||| |||||| |||||||||| TGGCAAAATT TCTAGACTTT AAGATGATAG ACAGTAAGAC TGTCGTCACC CAAGTTCAAG 460 AACTGCAGGT CATAATCCAT GATCTCCTTG CTGAAGGTAT AAATTTATTT AATACCTATG 2573 || ||||||| ||||||||| |||||||||| |||||| AATTGCAGGT TATAATCCAT GATCTCCTTG CTGAAG.... .......... .......... 496 TTAAAAATAT TAAGTTTTTC CTTAATACTC ACATAATCTT CAATGTAGGA TTGATTGTGA 2633 || |||||||||| .......... .......... .......... .......... ........GA TTGATTGTGA 508 ATGATGCCTT TCAAGTGGCT GCAATTATTG AAAACTTACC TCCATTGTTG AAGGACTTCA 2693 ||||||| || |||||||||| |||||||||| |||| ||||| || ||||| | |||||||| | ATGATGCTTT TCAAGTGGCT GCAATTATTG AAAAGTTACC TCTATTGTGG AAGGACTTTA 568 AAAACTACTT GAAACACAAA CGCAAGGAGA TGACTGTTGA 2733 |||||||||| |||||||||| |||||||||| ||| |||||| AAAACTACTT GAAACACAAA CGCAAGGAGA TGATTGTTGA 608 hqPGS_C06HBa0153O03.1-8+_SGN-E251204+ (2064 2549,2622 2733) ******************************************************************************** EST sequence 6 +strand 606 n (File: SGN-E262710+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA AATCAGAAAC AGGTTGTGTA 541 AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTAATT TATCTGCCAA TTGTGACAGA 601 AAAAAA Predicted gene structure (within gDNA segment 107 to 2984): Exon 1 1466 1824 ( 359 n); cDNA 2 365 ( 364 n); score: 0.876 MATCH C06HBa0153O03.1-8+ SGN-E262710+ 0.876 359 0.592 C PGS_C06HBa0153O03.1-8+_SGN-E262710+ (1466 1824) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAG TGGCTATAAA AGGAAGTCAA TTTTTGATTT TTCAAACACT 1525 |||| ||| | ||||||||| |||||||||| |||||||||| ||||||||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTCAAATG -AATCAAAGT GTCGATCGAC 1582 | ||| |||| | |||||||| || ||||||| ||||||||| ||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TGAATTTGTG TGACTTGTTG CTGTTCTGAA GTTCGTTGAA GTTAAAGAAA TTTGAGGTAC 1642 ||| | |||| |||||||||| |||||| || |||||||||| ||||||||| |||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CGCTATTTCT TTAACAGGCT TAATCCATCT TATCTTGGGA GAAATTAATC CATAACCGTG 1702 |||||||||| |||||||| | |||||| | | ||||||| || |||||||||| ||||||| || CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATTAATC CATAACCTTG 241 GGTACAATGA GGGGATTAAA TTTCTTAAGG ACACACAGTA GTTTCTGTGG ACTCGAATTA 1762 |||||| ||| ||| |||||| |||||||||| ||||| |||| |||||||||| ||||| |||| GGTACAGTGA GGGAATTAAA TTTCTTAAGG ACACATAGTA GTTTCTGTGG ACTCGGATTA 301 CTTCTTGTAT T-TAT-GTAT TTTGTGTTTC ATCTTATTTC TGTTTCTGTT AAGAAATTTA 1820 ||||||||| | ||| ||| ||| || ||| |||||||||| |||||||||| | || || ATTCTTGTAT TCTATATTAT TTTCTGCTTC ATCTTATTTC TGTTTCTGTT TATTAACTTT 361 GTAA 1824 ||| ATAA 365 hqPGS_C06HBa0153O03.1-8+_SGN-E262710+ (1466 1824) ******************************************************************************** EST sequence 12 +strand 514 n (File: SGN-E255327+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTGAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTA Predicted gene structure (within gDNA segment 107 to 2685): Exon 1 1466 1824 ( 359 n); cDNA 2 365 ( 364 n); score: 0.876 MATCH C06HBa0153O03.1-8+ SGN-E255327+ 0.876 359 0.698 C PGS_C06HBa0153O03.1-8+_SGN-E255327+ (1466 1824) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAG TGGCTATAAA AGGAAGTCAA TTTTTGATTT TTCAAACACT 1525 |||| ||| | ||||||||| |||||||||| |||||||||| ||||||||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTGAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTCAAATG -AATCAAAGT GTCGATCGAC 1582 | ||| |||| | |||||||| || ||||||| ||||||||| ||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TGAATTTGTG TGACTTGTTG CTGTTCTGAA GTTCGTTGAA GTTAAAGAAA TTTGAGGTAC 1642 ||| | |||| |||||||||| |||||| || |||||||||| ||||||||| |||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CGCTATTTCT TTAACAGGCT TAATCCATCT TATCTTGGGA GAAATTAATC CATAACCGTG 1702 |||||||||| |||||||| | |||||| | | ||||||| || |||||||||| ||||||| || CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATTAATC CATAACCTTG 241 GGTACAATGA GGGGATTAAA TTTCTTAAGG ACACACAGTA GTTTCTGTGG ACTCGAATTA 1762 |||||| ||| ||| |||||| |||||||||| ||||| |||| |||||||||| ||||| |||| GGTACAGTGA GGGAATTAAA TTTCTTAAGG ACACATAGTA GTTTCTGTGG ACTCGGATTA 301 CTTCTTGTAT T-TAT-GTAT TTTGTGTTTC ATCTTATTTC TGTTTCTGTT AAGAAATTTA 1820 ||||||||| | ||| ||| ||| || ||| |||||||||| |||||||||| | || || ATTCTTGTAT TCTATATTAT TTTCTGCTTC ATCTTATTTC TGTTTCTGTT TATTAACTTT 361 GTAA 1824 ||| ATAA 365 hqPGS_C06HBa0153O03.1-8+_SGN-E255327+ (1466 1824) ******************************************************************************** EST sequence 22 +strand 577 n (File: SGN-E369760+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA AATCAGAAAC AGGTTGTGTA 541 AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTA Predicted gene structure (within gDNA segment 107 to 2694): Exon 1 1466 1824 ( 359 n); cDNA 2 365 ( 364 n); score: 0.876 MATCH C06HBa0153O03.1-8+ SGN-E369760+ 0.876 359 0.622 C PGS_C06HBa0153O03.1-8+_SGN-E369760+ (1466 1824) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAG TGGCTATAAA AGGAAGTCAA TTTTTGATTT TTCAAACACT 1525 |||| ||| | ||||||||| |||||||||| |||||||||| ||||||||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTCAAATG -AATCAAAGT GTCGATCGAC 1582 | ||| |||| | |||||||| || ||||||| ||||||||| ||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TGAATTTGTG TGACTTGTTG CTGTTCTGAA GTTCGTTGAA GTTAAAGAAA TTTGAGGTAC 1642 ||| | |||| |||||||||| |||||| || |||||||||| ||||||||| |||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CGCTATTTCT TTAACAGGCT TAATCCATCT TATCTTGGGA GAAATTAATC CATAACCGTG 1702 |||||||||| |||||||| | |||||| | | ||||||| || |||||||||| ||||||| || CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATTAATC CATAACCTTG 241 GGTACAATGA GGGGATTAAA TTTCTTAAGG ACACACAGTA GTTTCTGTGG ACTCGAATTA 1762 |||||| ||| ||| |||||| |||||||||| ||||| |||| |||||||||| ||||| |||| GGTACAGTGA GGGAATTAAA TTTCTTAAGG ACACATAGTA GTTTCTGTGG ACTCGGATTA 301 CTTCTTGTAT T-TAT-GTAT TTTGTGTTTC ATCTTATTTC TGTTTCTGTT AAGAAATTTA 1820 ||||||||| | ||| ||| ||| || ||| |||||||||| |||||||||| | || || ATTCTTGTAT TCTATATTAT TTTCTGCTTC ATCTTATTTC TGTTTCTGTT TATTAACTTT 361 GTAA 1824 ||| ATAA 365 hqPGS_C06HBa0153O03.1-8+_SGN-E369760+ (1466 1824) ******************************************************************************** EST sequence 10 +strand 227 n (File: SGN-E261310+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATT Predicted gene structure (within gDNA segment 107 to 2550): Exon 1 1466 1688 ( 223 n); cDNA 2 227 ( 226 n); score: 0.890 MATCH C06HBa0153O03.1-8+ SGN-E261310+ 0.890 223 0.982 C PGS_C06HBa0153O03.1-8+_SGN-E261310+ (1466 1688) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAG TGGCTATAAA AGGAAGTCAA TTTTTGATTT TTCAAACACT 1525 |||| ||| | ||||||||| |||||||||| |||||||||| ||||||||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTCAAATG -AATCAAAGT GTCGATCGAC 1582 | ||| |||| | |||||||| || ||||||| ||||||||| ||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TGAATTTGTG TGACTTGTTG CTGTTCTGAA GTTCGTTGAA GTTAAAGAAA TTTGAGGTAC 1642 ||| | |||| |||||||||| |||||| || |||||||||| ||||||||| |||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CGCTATTTCT TTAACAGGCT TAATCCATCT TATCTTGGGA GAAATT 1688 |||||||||| |||||||| | |||||| | | ||||||| || |||||| CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATT 227 hqPGS_C06HBa0153O03.1-8+_SGN-E261310+ (1466 1688) ******************************************************************************** EST sequence 7 +strand 397 n (File: SGN-E262800+) 1 CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC TGAAAATTTT 61 TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA CTGAGTCTGT 121 GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA CCGCTATTTC 181 TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT GGGTACAGTG 241 AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA 301 TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT TATAAATAAA 361 AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATA Predicted gene structure (within gDNA segment 207 to 3477): Exon 1 1475 1807 ( 333 n); cDNA 1 338 ( 338 n); score: 0.890 MATCH C06HBa0153O03.1-8+ SGN-E262800+ 0.890 333 0.839 C PGS_C06HBa0153O03.1-8+_SGN-E262800+ (1475 1807) Alignment (genomic DNA sequence = upper lines): CTTGCTGAAA GTGGCTATAA AAGGAAGTCA ATTTTTGATT TTTCAAACAC TG-AAA-TTT 1532 |||||||||| ||||||||| |||||||||| | |||||||| ||| |||||| || ||| ||| CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC TGAAAATTTT 60 TCCTTCTCTG CATATATTTT TCTCTCAAAT -GAATCAAAG TGTCGATCGA CTGAATTTGT 1591 || ||||||| ||| |||||| |||||||||| |||||||| |||||||||| |||| | ||| TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA CTGAGTCTGT 120 GTGACTTGTT GCTGTTCTGA AGTTCGTTGA AGTTAAAGAA ATTTGAGGTA CCGCTATTTC 1651 |||||||||| | |||||| | |||||||||| |||||||||| ||||||||| |||||||||| GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA CCGCTATTTC 180 TTTAACAGGC TTAATCCATC TTATCTTGGG AGAAATTAAT CCATAACCGT GGGTACAATG 1711 ||||||||| ||||||| | |||||||| | |||||||||| |||||||| | ||||||| || TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT GGGTACAGTG 240 AGGGGATTAA ATTTCTTAAG GACACACAGT AGTTTCTGTG GACTCGAATT ACTTCTTGTA 1771 |||| ||||| |||||||||| |||||| ||| |||||||||| |||||| ||| | |||||||| AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA 300 TT-TAT-GTA TTTTGTGTTT CATCTTATTT CTGTTTCT 1807 || ||| || |||| || || |||||||||| |||||||| TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCT 338 hqPGS_C06HBa0153O03.1-8+_SGN-E262800+ (1475 1807) ******************************************************************************** EST sequence 13 +strand 591 n (File: SGN-E254845+) 1 AGGAAGTCAA ATTTTGATTT TTTAAACACT GAAAATTTGG CTTTCGGTGC ATTTATTTTT 61 CTCTCAAATA AAATCAAAGT GTCGATCGAC TGAGTCTGTG TGACTTGTTG TTGTTCTTAA 121 GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC CGCTATTTCT TTAACAGGTT TAATCCGTTT 181 TATCTTGAGA GAAATTAATC CATAACCTTG GGTACCAGTG AGGGAATTAA ATTTCTTAAG 241 GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA TTCTATATTA TTTTCTGCTT 301 CATCTTATTT CTGTTTCTGT TTATTAACTT TATAAATAAA AGTTATTATA AGAGTAACAA 361 TCTTAAGAAA ATTTATACAG TTTCTGTGTT TGAGGTTTCT GGAGAATAAA ACCTTTATGG 421 TTTTCTACTC TACTTGAATT TTTAAAATCA TTCGATTAAC GATTAAAAAA ACATAAAAAC 481 TTTATCGTTA AATCAGAAAC AGGTTGTGTA AAGAATTGCT CTTTACTGTT AGTATTTTAA 541 ATACTTAATT TATCTGCCAA TTGTGACAGA AAAAAAAGAC TAATTCAAGT C Predicted gene structure (within gDNA segment 417 to 3134): Exon 1 1496 1824 ( 329 n); cDNA 1 335 ( 335 n); score: 0.857 MATCH C06HBa0153O03.1-8+ SGN-E254845+ 0.857 329 0.557 C PGS_C06HBa0153O03.1-8+_SGN-E254845+ (1496 1824) Alignment (genomic DNA sequence = upper lines): AGGAAGTCAA TTTTTGATTT TTCAAACACT G-AAATTT-T CCTTCTCTGC ATATATTTTT 1553 |||||||||| ||||||||| || ||||||| | |||||| | ||| ||| || ||||||| AGGAAGTCAA ATTTTGATTT TTTAAACACT GAAAATTTGG CTTTCGGTGC ATTTATTTTT 60 CTCTCAAAT- GAATCAAAGT GTCGATCGAC TGAATTTGTG TGACTTGTTG CTGTTCTGAA 1612 ||||||||| ||||||||| |||||||||| ||| | |||| |||||||||| |||||| || CTCTCAAATA AAATCAAAGT GTCGATCGAC TGAGTCTGTG TGACTTGTTG TTGTTCTTAA 120 GTTCGTTGAA GTTAAAGAAA TTTGAGGTAC CGCTATTTCT TTAACAGGCT TAATCCATCT 1672 |||||||||| ||||||||| |||||||||| |||||||||| |||||||| | |||||| | | GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC CGCTATTTCT TTAACAGGTT TAATCCGTTT 180 TATCTTGGGA GAAATTAATC CATAACCGTG GGTA-CAATG AGGGGATTAA ATTTCTTAAG 1731 ||||||| || |||||||||| ||||||| || |||| || || |||| ||||| |||||||||| TATCTTGAGA GAAATTAATC CATAACCTTG GGTACCAGTG AGGGAATTAA ATTTCTTAAG 240 GACACACAGT AGTTTCTGTG GACTCGAATT ACTTCTTGTA TT-TAT-GTA TTTTGTGTTT 1789 |||||| ||| |||||||||| |||||| ||| | |||||||| || ||| || |||| || || GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA TTCTATATTA TTTTCTGCTT 300 CATCTTATTT CTGTTTCTGT TAAGAAATTT AGTAA 1824 |||||||||| |||||||||| | | || || ||| CATCTTATTT CTGTTTCTGT TTATTAACTT TATAA 335 hqPGS_C06HBa0153O03.1-8+_SGN-E254845+ (1496 1824) ******************************************************************************** EST sequence 19 +strand 653 n (File: SGN-E273518+) 1 GGAAGTCAAA TTTTGATTTT TTAAACACTG AAAATTTTTC TTTCTCTGCA TTTATTTTTC 61 TCTCAAATAA AATCAAAGTG TCGATCGACT GAGTCTGTGT GACTTGTTGT TGTTCTTAAG 121 TTCGTTGAAG TTAAAGAAGT TTGAGGTACC GCTATTTCTT TAACAGGTTT AATCCGTTTT 181 ATCTTGAGAG AAATTAATCC ATAACCTTGG GTACAGTGAG GGAATTAAAT TTCTTAAGGA 241 CACATAGTAG TTTCTGTGGA CTCGGATTAA TTCTTGTATT CTATATTATT TTCTGCTTCA 301 TCTTATTTCT GTTTCTGTTT ATTAACTTTA TAAATAAAAG TTATTATAAG AGTAACAATC 361 TTAAGAAAAT TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT 421 TTCTACTCTA CTTGAATTTT TAAAATCATT CGATTAACGA TTAAAAAAAC ATAAAAACTT 481 TATCGTTAAA TCAGAAACAG GTTGTGTAAA GATTTGCTCT TTACTGTTAG TATTTTAAAT 541 ACTTAATTTA TCTGCCAATT GTGACAGAAA AAAAAGACTA ATTCAAGTCA AACAAATGCT 601 GGAACAGTAA GTGCTGCAAC AACAATGGTT GCACATAATC GTTCACATGC TGC Predicted gene structure (within gDNA segment 427 to 3825): Exon 1 1497 1824 ( 328 n); cDNA 1 333 ( 333 n); score: 0.873 Intron 1 1825 3152 (1328 n); Pd: 0.000 (s: 0.77), Pa: 0.987 (s: 0) Exon 2 3153 3162 ( 10 n); cDNA 334 342 ( 9 n); score: 0.800 MATCH C06HBa0153O03.1-8+ SGN-E273518+ 0.873 338 0.518 C PGS_C06HBa0153O03.1-8+_SGN-E273518+ (1497 1824,3153 3162) Alignment (genomic DNA sequence = upper lines): GGAAGTCAAT TTTTGATTTT TCAAACACTG -AAA-TTTTC CTTCTCTGCA TATATTTTTC 1554 ||||||||| |||||||||| | |||||||| ||| ||||| ||||||||| | |||||||| GGAAGTCAAA TTTTGATTTT TTAAACACTG AAAATTTTTC TTTCTCTGCA TTTATTTTTC 60 TCTCAAAT-G AATCAAAGTG TCGATCGACT GAATTTGTGT GACTTGTTGC TGTTCTGAAG 1613 |||||||| |||||||||| |||||||||| || | ||||| ||||||||| |||||| ||| TCTCAAATAA AATCAAAGTG TCGATCGACT GAGTCTGTGT GACTTGTTGT TGTTCTTAAG 120 TTCGTTGAAG TTAAAGAAAT TTGAGGTACC GCTATTTCTT TAACAGGCTT AATCCATCTT 1673 |||||||||| |||||||| | |||||||||| |||||||||| ||||||| || ||||| | || TTCGTTGAAG TTAAAGAAGT TTGAGGTACC GCTATTTCTT TAACAGGTTT AATCCGTTTT 180 ATCTTGGGAG AAATTAATCC ATAACCGTGG GTACAATGAG GGGATTAAAT TTCTTAAGGA 1733 |||||| ||| |||||||||| |||||| ||| ||||| |||| || ||||||| |||||||||| ATCTTGAGAG AAATTAATCC ATAACCTTGG GTACAGTGAG GGAATTAAAT TTCTTAAGGA 240 CACACAGTAG TTTCTGTGGA CTCGAATTAC TTCTTGTATT -TAT-GTATT TTGTGTTTCA 1791 |||| ||||| |||||||||| |||| |||| |||||||||| ||| |||| || || |||| CACATAGTAG TTTCTGTGGA CTCGGATTAA TTCTTGTATT CTATATTATT TTCTGCTTCA 300 TCTTATTTCT GTTTCTGTTA AGAAATTTAG TAAGTTTATG TATTTAAGGT TTCTTGAGAT 1851 |||||||||| ||||||||| | || || ||| TCTTATTTCT GTTTCTGTTT ATTAACTTTA TAA....... .......... .......... 333 GAAAACCTTT ATGGTTTTCT ACTCTGCTTG AGTTTTTTAA AATTCATTCG ATTAACGATT 1911 .......... .......... .......... .......... .......... .......... 333 AAAAGAACAT AAAAACTTTA TCGTTAAATC AGAAACAGTC TGTGTAACGA TTTGTTCTTT 1971 .......... .......... .......... .......... .......... .......... 333 ACTGTTAGTA TTTCAAATAC TTAAGTTATG TGCCATTTGT GACAGAAAAA AAGAAAAAAT 2031 .......... .......... .......... .......... .......... .......... 333 TACTAATTCA AATCAAACAA ATGTTGGAAC AGTAAGTGCT ACAAGAGTTT GTGCTGCAAT 2091 .......... .......... .......... .......... .......... .......... 333 AACATCGGTT GCACATAATA ATTCAAATGC TGCCTTAGCG CCGGCTGAGA AACCTGCAAA 2151 .......... .......... .......... .......... .......... .......... 333 ATTTTCTGGA GTCGACTTTA AGAGATGGCA GCAGAAGATG TTCTTCTATC TCACTACGTT 2211 .......... .......... .......... .......... .......... .......... 333 GAGTCTGCAG AAGTTCATTA ATGAGAATGT TCCTGTTATG TCAGATGAAA CTCCGCCTGA 2271 .......... .......... .......... .......... .......... .......... 333 TGAACGATTC TTGGTAACAC AAGCATGGAC ACACTCAGAT TTTTTGTGTA AAAATTATAT 2331 .......... .......... .......... .......... .......... .......... 333 TTTGAGTGGC CTACAAGATG ATCTGTACAA TGTGTACAGC AATGTCAAAA CCTTAAAAGA 2391 .......... .......... .......... .......... .......... .......... 333 ACTCTGGGAT GCTTTAGAAA AGAAGTACAA AACAGAAGAT GCCAGAATGA AGAAATTCAT 2451 .......... .......... .......... .......... .......... .......... 333 CATGGCAAAA TTTCTGGACT ATAAGATGAT AGACAGTAAG ACTGTAGTCA CCCAAGTTCA 2511 .......... .......... .......... .......... .......... .......... 333 AGAACTGCAG GTCATAATCC ATGATCTCCT TGCTGAAGGT ATAAATTTAT TTAATACCTA 2571 .......... .......... .......... .......... .......... .......... 333 TGTTAAAAAT ATTAAGTTTT TCCTTAATAC TCACATAATC TTCAATGTAG GATTGATTGT 2631 .......... .......... .......... .......... .......... .......... 333 GAATGATGCC TTTCAAGTGG CTGCAATTAT TGAAAACTTA CCTCCATTGT TGAAGGACTT 2691 .......... .......... .......... .......... .......... .......... 333 CAAAAACTAC TTGAAACACA AACGCAAGGA GATGACTGTT GAAGATCTCA TAGTAAGGTT 2751 .......... .......... .......... .......... .......... .......... 333 GAGAATCGAA GATGATAATA AGGCTGCAGA AAAGAGGTCA CATCGTAATT CAACAATATT 2811 .......... .......... .......... .......... .......... .......... 333 TGGAGTAAAT TTTGTTGAAG AAGATCCCAC AAAATTAAAA AAAAGAAAGA AAACATCTGG 2871 .......... .......... .......... .......... .......... .......... 333 TCCAAAAAGC AATCCTCCTA AGAAGAAATT CAATGGAAAC TGCTTCAACT GTGGTAAACA 2931 .......... .......... .......... .......... .......... .......... 333 TGGTCATAGA GCTACTGAAT GCCGGGGTCC AAAGTAGGAC AAGAAAAAGA AGGATCAAGC 2991 .......... .......... .......... .......... .......... .......... 333 AAACTTGGCT GAATCCAAAG GAGAAATGGA CGATCTCTGT GCAATGCTTT TAAAATGTAA 3051 .......... .......... .......... .......... .......... .......... 333 CTTGGTTGGA AATCCAAGAG AATGGTGGAT AGATTCTGGT GCCTCATGCC ATGTTTGTGC 3111 .......... .......... .......... .......... .......... .......... 333 CAACAAAGAA TTATTTTAAT CATATACTTC AACACTTACA GATGAAAAAT T 3162 || |||| | | .......... .......... .......... .......... .AT-AAAAGT T 342 hqPGS_C06HBa0153O03.1-8+_SGN-E273518+ (1497 1824) ******************************************************************************** EST sequence 21 +strand 329 n (File: SGN-E258205+) 1 AATTTTTCTT TCTCTGCATT TATTTTTCTC TCAAATAAAA TCAAAGTGTC GATCGACTGA 61 GTCTGTGTGA CTTGTTGTTG TTCTTAAGTT CGTTGAAGTT AAAGAAGTTT GAGGTACCGC 121 TATTTCTTTA ACAGGTTTAA TCCGTTTTAT CTTGAGAGAA ATTAATCCAT AACCTTGGGT 181 ACAGTGAGGG AATTAAATTT CTTAAGGACA CATAGTAGTT TCTGTGGACT CGGATTAATT 241 CTTGTATTCT ATATTATTTT CTGCTTCATC TTATTTCTGG TTCTGTTTAT TAACTTTATA 301 AATAAAAGGT ATTATAAGAG TAACAATCT Predicted gene structure (within gDNA segment 747 to 3337): Exon 1 1527 1807 ( 281 n); cDNA 1 284 ( 284 n); score: 0.891 MATCH C06HBa0153O03.1-8+ SGN-E258205+ 0.891 281 0.854 C PGS_C06HBa0153O03.1-8+_SGN-E258205+ (1527 1807) Alignment (genomic DNA sequence = upper lines): AAATTTTCCT TCTCTGCATA TATTTTTCTC TCAAAT-GAA TCAAAGTGTC GATCGACTGA 1585 || ||||| | ||||||||| |||||||||| |||||| || |||||||||| |||||||||| AATTTTTCTT TCTCTGCATT TATTTTTCTC TCAAATAAAA TCAAAGTGTC GATCGACTGA 60 ATTTGTGTGA CTTGTTGCTG TTCTGAAGTT CGTTGAAGTT AAAGAAATTT GAGGTACCGC 1645 | ||||||| ||||||| || |||| ||||| |||||||||| |||||| ||| |||||||||| GTCTGTGTGA CTTGTTGTTG TTCTTAAGTT CGTTGAAGTT AAAGAAGTTT GAGGTACCGC 120 TATTTCTTTA ACAGGCTTAA TCCATCTTAT CTTGGGAGAA ATTAATCCAT AACCGTGGGT 1705 |||||||||| ||||| |||| ||| | |||| |||| ||||| |||||||||| |||| ||||| TATTTCTTTA ACAGGTTTAA TCCGTTTTAT CTTGAGAGAA ATTAATCCAT AACCTTGGGT 180 ACAATGAGGG GATTAAATTT CTTAAGGACA CACAGTAGTT TCTGTGGACT CGAATTACTT 1765 ||| |||||| ||||||||| |||||||||| || ||||||| |||||||||| || |||| || ACAGTGAGGG AATTAAATTT CTTAAGGACA CATAGTAGTT TCTGTGGACT CGGATTAATT 240 CTTGTATT-T AT-GTATTTT GTGTTTCATC TTATTTCTGT TTCT 1807 |||||||| | || |||||| || |||||| ||||||||| |||| CTTGTATTCT ATATTATTTT CTGCTTCATC TTATTTCTGG TTCT 284 hqPGS_C06HBa0153O03.1-8+_SGN-E258205+ (1527 1807) ******************************************************************************** EST sequence 9 +strand 706 n (File: SGN-E261066+) 1 TTTGAGGTAC CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATTAATC 61 CATAACCTTG GGTACAGTGA GGGAATTAAA TTTCTTAAGG ACACATAGTA GTTTCTGTGG 121 ACTCGGATTA ATTCTTGTAT TCTATATTAT TTTCTGCTTC ATCTTATTTC TGTTTCTGTT 181 TATTAACTTT ATAAATAAAA GTTATTATAA GAGTAACAAT CTTAAGAAAA TTTATACAGT 241 TTCTGTGTTT GAGGTTTCTG GAGATTAAAA CCTTTATGGT TTTCTACTCT ACTTGAATTT 301 TTAAAATCAT TCGATTAACG ATTAAAAAAA CATAAAAACT TTATCGTTAA ATCATAAACA 361 GGTTGTGTAA AGATTTGCTC TTTACTGTTA GTATTTTAAA TACTTAATTT ATCTGCCAAT 421 TGTGACAGAA AAAAAAGACT AATTCAAGTC AAACAAATGC TGGAACAGTA AGTGCTGCAA 481 CAACAATGGT TGCACATAAT CGTTCACATG CTGCCTTAGC ACCGGCTGAG AAACCTGCAA 541 AGATTTCTGG AGTCGACTTT AAGAGATGGC AGCAAAAGAT GTTCTTCTAT CTCACTACGT 601 TGAGTCTGCA GAAGTTCATC AATGAGAATG TTCCTGTTAT GTCAGATGAA ACTTCGGCTG 661 ATGAACGATT CTTGGTAACA GAAGCATGGA CACACTCAGA TTTTTT Predicted gene structure (within gDNA segment 601 to 2926): Exon 1 1458 1500 ( 43 n); cDNA 175 216 ( 42 n); score: 0.605 Intron 1 1501 1803 ( 303 n); Pd: 0.665 (s: 0.60), Pa: 0.000 (s: 0.70) Exon 2 1804 2316 ( 513 n); cDNA 217 706 ( 490 n); score: 0.870 MATCH C06HBa0153O03.1-8+ SGN-E261066+ 0.870 556 0.788 C PGS_C06HBa0153O03.1-8+_SGN-E261066+ (1458 1500,1804 2316) Alignment (genomic DNA sequence = upper lines): TCTGAACAGA CACGTTTCTT GCTGAAAGTG GCTATAAAAG GAAGTCAATT TTTGATTTTT 1517 |||| | || ||| | | ||||| ||||| || || TCTGTTTATT AAC-TTTATA AATAAAAGTT ATTATAAGAG TAA....... .......... 216 CAAACACTGA AATTTTCCTT CTCTGCATAT ATTTTTCTCT CAAATGAATC AAAGTGTCGA 1577 .......... .......... .......... .......... .......... .......... 216 TCGACTGAAT TTGTGTGACT TGTTGCTGTT CTGAAGTTCG TTGAAGTTAA AGAAATTTGA 1637 .......... .......... .......... .......... .......... .......... 216 GGTACCGCTA TTTCTTTAAC AGGCTTAATC CATCTTATCT TGGGAGAAAT TAATCCATAA 1697 .......... .......... .......... .......... .......... .......... 216 CCGTGGGTAC AATGAGGGGA TTAAATTTCT TAAGGACACA CAGTAGTTTC TGTGGACTCG 1757 .......... .......... .......... .......... .......... .......... 216 AATTACTTCT TGTATTTATG TATTTTGTGT TTCATCTTAT TTCTGTTTCT GTTAAGAAAT 1817 | |||||||| .......... .......... .......... .......... ......CAAT CTTAAGAAAA 230 TTAGTA-AGT TTATGTATTT AAGGTTTCTT GAGATGAAAA CCTTTATGGT TTTCTACTCT 1876 || || ||| || ||| ||| |||||||| ||||| |||| |||||||||| |||||||||| TTTATACAGT TTCTGTGTTT GAGGTTTCTG GAGATTAAAA CCTTTATGGT TTTCTACTCT 290 GCTTGAGTTT TTTAAAATTC ATTCGATTAA CGATTAAAAG AACATAAAAA CTTTATCGTT 1936 ||||| || ||||||| || |||||||||| ||||||||| |||||||||| |||||||||| ACTTGA-ATT TTTAAAA-TC ATTCGATTAA CGATTAAAAA AACATAAAAA CTTTATCGTT 348 AAATCAGAAA CAGTCTGTGT AACGATTTGT TCTTTACTGT TAGTATTTCA AATACTTAAG 1996 |||||| ||| ||| ||||| || |||||| |||||||||| |||||||| | ||||||||| AAATCATAAA CAGGTTGTGT AAAGATTTGC TCTTTACTGT TAGTATTTTA AATACTTAAT 408 TTATGTGCCA TTTGTGACAG AAAAAAAGAA AAAATTACTA ATTCAAATCA AACAAATGTT 2056 |||| ||||| ||||||||| || || |||| |||| |||||| ||| |||||||| | TTATCTGCCA ATTGTGACAG -----AA-AA AAAA-GACTA ATTCAAGTCA AACAAATGCT 461 GGAACAGTAA GTGCTACAAG AGTTTGTGCT GCAATAACAT CGGTTGCACA TAATAATTCA 2116 |||||||||| ||||| | | | ||| | || ||||||||| |||| |||| GGAACAGTAA GTGCTGC-A- A--------- -CAA-CA-AT -GGTTGCACA TAATCGTTCA 506 AATGCTGCCT TAGCGCCGGC TGAGAAACCT GCAAAATTTT CTGGAGTCGA CTTTAAGAGA 2176 ||||||||| |||| ||||| |||||||||| ||||| ||| |||||||||| |||||||||| CATGCTGCCT TAGCACCGGC TGAGAAACCT GCAAAGATTT CTGGAGTCGA CTTTAAGAGA 566 TGGCAGCAGA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATTAATGAG 2236 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| TGGCAGCAAA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATCAATGAG 626 AATGTTCCTG TTATGTCAGA TGAAACTCCG CCTGATGAAC GATTCTTGGT AACACAAGCA 2296 |||||||||| |||||||||| ||||||| || ||||||||| |||||||||| |||| ||||| AATGTTCCTG TTATGTCAGA TGAAACTTCG GCTGATGAAC GATTCTTGGT AACAGAAGCA 686 TGGACACACT CAGATTTTTT 2316 |||||||||| |||||||||| TGGACACACT CAGATTTTTT 706 hqPGS_C06HBa0153O03.1-8+_SGN-E261066+ (1804 2316) ******************************************************************************** EST sequence 4 +strand 707 n (File: SGN-E263584+) 1 TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC CGCTATTTCT 61 TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATTAATC CATAACCTTG GGTACAGTGA 121 GGGAATTAAA TTTCTTAAGG ACACATAGTA GTTTCTGTGG ACTCGGATTA ATTCTTGTAT 181 TCTATATTAT TTTCTGCTTC ATCTTATTTC TGTTTCTGTT TATTAACTTT ATAAATAAAA 241 GTTATTATAA GAGTAACAAT CTTAAGAAAA TTTATACAGT TTCTGTGTTT GAGGTTTCTG 301 GAGATTAAAA CCTTTATGGT TTTCTACTCT ACTTGAATTT TTAAAATCAT TCGATTAACG 361 ATTAAAAAAA CATAAAAACT TTATCGTTAA ATCAGAAACA GGTTGTGTAA AGATTTGCTC 421 TTTACTGTTA GTATTTTAAA TACTTAATTT ATCTGCCAAT TGTGACAGAA AAAAAAGACT 481 AATTCAAGTC AAACAAATGC TGGAACAGTA AGTGCTGCAA CAACAATGGT TGCACATAAT 541 CGTTCACATG CTGCCTTAGC ACCGGCTGAG AAACCTGCAA AGTTTTCTGG AGTCGACTTT 601 AAGAGATGGC AGCAAAAGAT GTTCTTCTAT CTCACTACGT TGAGTCTGCA GAAGTTCATC 661 AATGAGAATG TTCCTGTTAT GTCAGATGAA ACTTCGGCTG ATGAACG Predicted gene structure (within gDNA segment 831 to 3013): Exon 1 1458 1500 ( 43 n); cDNA 215 256 ( 42 n); score: 0.605 Intron 1 1501 1803 ( 303 n); Pd: 0.665 (s: 0.60), Pa: 0.000 (s: 0.70) Exon 2 1804 2277 ( 474 n); cDNA 257 707 ( 451 n); score: 0.866 MATCH C06HBa0153O03.1-8+ SGN-E263584+ 0.866 517 0.731 C PGS_C06HBa0153O03.1-8+_SGN-E263584+ (1458 1500,1804 2277) Alignment (genomic DNA sequence = upper lines): TCTGAACAGA CACGTTTCTT GCTGAAAGTG GCTATAAAAG GAAGTCAATT TTTGATTTTT 1517 |||| | || ||| | | ||||| ||||| || || TCTGTTTATT AAC-TTTATA AATAAAAGTT ATTATAAGAG TAA....... .......... 256 CAAACACTGA AATTTTCCTT CTCTGCATAT ATTTTTCTCT CAAATGAATC AAAGTGTCGA 1577 .......... .......... .......... .......... .......... .......... 256 TCGACTGAAT TTGTGTGACT TGTTGCTGTT CTGAAGTTCG TTGAAGTTAA AGAAATTTGA 1637 .......... .......... .......... .......... .......... .......... 256 GGTACCGCTA TTTCTTTAAC AGGCTTAATC CATCTTATCT TGGGAGAAAT TAATCCATAA 1697 .......... .......... .......... .......... .......... .......... 256 CCGTGGGTAC AATGAGGGGA TTAAATTTCT TAAGGACACA CAGTAGTTTC TGTGGACTCG 1757 .......... .......... .......... .......... .......... .......... 256 AATTACTTCT TGTATTTATG TATTTTGTGT TTCATCTTAT TTCTGTTTCT GTTAAGAAAT 1817 | |||||||| .......... .......... .......... .......... ......CAAT CTTAAGAAAA 270 TTAGTA-AGT TTATGTATTT AAGGTTTCTT GAGATGAAAA CCTTTATGGT TTTCTACTCT 1876 || || ||| || ||| ||| |||||||| ||||| |||| |||||||||| |||||||||| TTTATACAGT TTCTGTGTTT GAGGTTTCTG GAGATTAAAA CCTTTATGGT TTTCTACTCT 330 GCTTGAGTTT TTTAAAATTC ATTCGATTAA CGATTAAAAG AACATAAAAA CTTTATCGTT 1936 ||||| || ||||||| || |||||||||| ||||||||| |||||||||| |||||||||| ACTTGA-ATT TTTAAAA-TC ATTCGATTAA CGATTAAAAA AACATAAAAA CTTTATCGTT 388 AAATCAGAAA CAGTCTGTGT AACGATTTGT TCTTTACTGT TAGTATTTCA AATACTTAAG 1996 |||||||||| ||| ||||| || |||||| |||||||||| |||||||| | ||||||||| AAATCAGAAA CAGGTTGTGT AAAGATTTGC TCTTTACTGT TAGTATTTTA AATACTTAAT 448 TTATGTGCCA TTTGTGACAG AAAAAAAGAA AAAATTACTA ATTCAAATCA AACAAATGTT 2056 |||| ||||| ||||||||| || || |||| |||| |||||| ||| |||||||| | TTATCTGCCA ATTGTGACAG -----AA-AA AAAA-GACTA ATTCAAGTCA AACAAATGCT 501 GGAACAGTAA GTGCTACAAG AGTTTGTGCT GCAATAACAT CGGTTGCACA TAATAATTCA 2116 |||||||||| ||||| | | | ||| | || ||||||||| |||| |||| GGAACAGTAA GTGCTGC-A- A--------- -CAA-CA-AT -GGTTGCACA TAATCGTTCA 546 AATGCTGCCT TAGCGCCGGC TGAGAAACCT GCAAAATTTT CTGGAGTCGA CTTTAAGAGA 2176 ||||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| |||||||||| CATGCTGCCT TAGCACCGGC TGAGAAACCT GCAAAGTTTT CTGGAGTCGA CTTTAAGAGA 606 TGGCAGCAGA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATTAATGAG 2236 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| TGGCAGCAAA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATCAATGAG 666 AATGTTCCTG TTATGTCAGA TGAAACTCCG CCTGATGAAC G 2277 |||||||||| |||||||||| ||||||| || ||||||||| | AATGTTCCTG TTATGTCAGA TGAAACTTCG GCTGATGAAC G 707 hqPGS_C06HBa0153O03.1-8+_SGN-E263584+ (1804 2277) ******************************************************************************** EST sequence 17 +strand 635 n (File: SGN-E276669+) 1 TTGAGGTACC GCTATTTCTT TAACAGGTTT AATCCGTTTT ATCTTGAGAG AAATTAATCC 61 ATAACCTTGG GTACAGTGAG GGAATTAAAT TTCTTAAGGA CACATAGTAG TTTCTGTGGA 121 CTCGGATTAA TTCTTGTATT CTATATTATT TTCTGCTTCA TCTTATTTCT GTTTCTGTTT 181 ATTAACTTTA TAAATAAAAG TTATTATAAG AGTAACAATC TTAAGAAAAT TTATACAGTT 241 TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT TTCTACTCTA CTTGAATTTT 301 TAAAATCATT CGATTAACGA TTAAAAAAAC ATAAAAACTT TATCGTTAAA TCAGAAACAG 361 GTTGTGTAAA GATTTGCTCT TTACTGTTAG TATTTTAAAT ACTTAATTTA TCTGCCAATT 421 GTGACAGAAA AAAAAGACTA ATTCAAGTCA AACAAATGCT GGAACAGTAA GTGCTGCAAC 481 AACAATGGTT GCACATAATC GTTCACATGC TGCCTTAGCA CCGGCTGAGA AACCTGCAAA 541 GTTTTCTGGA GTCGACTTTA AGAGATGGCA GCAAAAGATG TTCTTCTATC TCACTACGTT 601 GAGTCTGCAG AAGTTCATCA ATGAGAATGT TCCTG Predicted gene structure (within gDNA segment 611 to 2856): Exon 1 1458 1500 ( 43 n); cDNA 174 215 ( 42 n); score: 0.605 Intron 1 1501 1803 ( 303 n); Pd: 0.665 (s: 0.60), Pa: 0.000 (s: 0.70) Exon 2 1804 2246 ( 443 n); cDNA 216 635 ( 420 n); score: 0.861 MATCH C06HBa0153O03.1-8+ SGN-E276669+ 0.861 486 0.765 C PGS_C06HBa0153O03.1-8+_SGN-E276669+ (1458 1500,1804 2246) Alignment (genomic DNA sequence = upper lines): TCTGAACAGA CACGTTTCTT GCTGAAAGTG GCTATAAAAG GAAGTCAATT TTTGATTTTT 1517 |||| | || ||| | | ||||| ||||| || || TCTGTTTATT AAC-TTTATA AATAAAAGTT ATTATAAGAG TAA....... .......... 215 CAAACACTGA AATTTTCCTT CTCTGCATAT ATTTTTCTCT CAAATGAATC AAAGTGTCGA 1577 .......... .......... .......... .......... .......... .......... 215 TCGACTGAAT TTGTGTGACT TGTTGCTGTT CTGAAGTTCG TTGAAGTTAA AGAAATTTGA 1637 .......... .......... .......... .......... .......... .......... 215 GGTACCGCTA TTTCTTTAAC AGGCTTAATC CATCTTATCT TGGGAGAAAT TAATCCATAA 1697 .......... .......... .......... .......... .......... .......... 215 CCGTGGGTAC AATGAGGGGA TTAAATTTCT TAAGGACACA CAGTAGTTTC TGTGGACTCG 1757 .......... .......... .......... .......... .......... .......... 215 AATTACTTCT TGTATTTATG TATTTTGTGT TTCATCTTAT TTCTGTTTCT GTTAAGAAAT 1817 | |||||||| .......... .......... .......... .......... ......CAAT CTTAAGAAAA 229 TTAGTA-AGT TTATGTATTT AAGGTTTCTT GAGATGAAAA CCTTTATGGT TTTCTACTCT 1876 || || ||| || ||| ||| |||||||| ||||| |||| |||||||||| |||||||||| TTTATACAGT TTCTGTGTTT GAGGTTTCTG GAGATTAAAA CCTTTATGGT TTTCTACTCT 289 GCTTGAGTTT TTTAAAATTC ATTCGATTAA CGATTAAAAG AACATAAAAA CTTTATCGTT 1936 ||||| || ||||||| || |||||||||| ||||||||| |||||||||| |||||||||| ACTTGA-ATT TTTAAAA-TC ATTCGATTAA CGATTAAAAA AACATAAAAA CTTTATCGTT 347 AAATCAGAAA CAGTCTGTGT AACGATTTGT TCTTTACTGT TAGTATTTCA AATACTTAAG 1996 |||||||||| ||| ||||| || |||||| |||||||||| |||||||| | ||||||||| AAATCAGAAA CAGGTTGTGT AAAGATTTGC TCTTTACTGT TAGTATTTTA AATACTTAAT 407 TTATGTGCCA TTTGTGACAG AAAAAAAGAA AAAATTACTA ATTCAAATCA AACAAATGTT 2056 |||| ||||| ||||||||| || || |||| |||| |||||| ||| |||||||| | TTATCTGCCA ATTGTGACAG -----AA-AA AAAA-GACTA ATTCAAGTCA AACAAATGCT 460 GGAACAGTAA GTGCTACAAG AGTTTGTGCT GCAATAACAT CGGTTGCACA TAATAATTCA 2116 |||||||||| ||||| | | | ||| | || ||||||||| |||| |||| GGAACAGTAA GTGCTGC-A- A--------- -CAA-CA-AT -GGTTGCACA TAATCGTTCA 505 AATGCTGCCT TAGCGCCGGC TGAGAAACCT GCAAAATTTT CTGGAGTCGA CTTTAAGAGA 2176 ||||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| |||||||||| CATGCTGCCT TAGCACCGGC TGAGAAACCT GCAAAGTTTT CTGGAGTCGA CTTTAAGAGA 565 TGGCAGCAGA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATTAATGAG 2236 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| TGGCAGCAAA AGATGTTCTT CTATCTCACT ACGTTGAGTC TGCAGAAGTT CATCAATGAG 625 AATGTTCCTG 2246 |||||||||| AATGTTCCTG 635 hqPGS_C06HBa0153O03.1-8+_SGN-E276669+ (1804 2246) ******************************************************************************** EST sequence 1 +strand 387 n (File: SGN-E241550+) 1 AATTTCTTAA GGACACACAG TAGTTTCTGT GGACTCGGAT TAATTCTTCT GTTTTAAATT 61 TTTTCTGCTT CATCTTGTTT CTATTTCTAT TCATTAACTT CATAAATACA AGTTATTGTA 121 AGAATAACAA TTTTATAACA ATCTTAAGAA ATTTAACAGT TTTTGTATTT GAGGTTTCTG 181 GAGATTAAAA CCTTTATGGT TGGGTTTCTA TTCTGCTTGA ATTTTTAAAA TTTATTCGAT 241 TAACGATTAA AAGAACATAA AAACTTTATC GTTTAATCAG AAACAGTTTG TGTAAAGAAT 301 TGTTTTTTAT TGTTAATATT TTAAATACTT AATTTATCTG CCATTTGTGA CAGAAAAAAA 361 TGACTAATTC AAGTCAAACA AATGCTG Predicted gene structure (within gDNA segment 1121 to 3686): Exon 1 1809 2054 ( 246 n); cDNA 144 387 ( 244 n); score: 0.825 MATCH C06HBa0153O03.1-8+ SGN-E241550+ 0.825 246 0.636 C PGS_C06HBa0153O03.1-8+_SGN-E241550+ (1809 2054) Alignment (genomic DNA sequence = upper lines): TTAAGAAATT TAGTAAGTTT ATGTATTTAA GGTTTCTTGA GATGAAAACC TTTATGGTT- 1867 |||||||||| || ||||| ||||||| | ||||||| || ||| |||||| ||||||||| TTAAGAAATT TA-ACAGTTT TTGTATTTGA GGTTTCTGGA GATTAAAACC TTTATGGTTG 202 ---TTCTACT CTGCTTGAGT TTTTTAAAAT TCATTCGATT AACGATTAAA AGAACATAAA 1924 ||||| | |||||||| | |||| ||||| | |||||||| |||||||||| |||||||||| GGTTTCTATT CTGCTTGAAT TTTT-AAAAT TTATTCGATT AACGATTAAA AGAACATAAA 261 AACTTTATCG TTAAATCAGA AACAGTCTGT GTAACGATTT GTTCTTTACT GTTAGTATTT 1984 |||||||||| || ||||||| |||||| ||| |||| || || ||| |||| | |||| ||||| AACTTTATCG TTTAATCAGA AACAGTTTGT GTAAAGAATT GTTTTTTATT GTTAATATTT 321 CAAATACTTA AGTTATGTGC CATTTGTGAC AGAAAAAAAG AAAAAATTAC TAATTCAAAT 2044 ||||||||| | |||| ||| |||||||||| ||||||||| | |||| | || ||||| TAAATACTTA ATTTATCTGC CATTTGTGAC AGAAAAAAAT GACTAATT-C -AAGTCAAA- 378 CAAACAAATG 2054 |||| || CAAA-TGCTG 387 hqPGS_C06HBa0153O03.1-8+_SGN-E241550+ (1809 2054) ******************************************************************************** EST sequence 8 +strand 611 n (File: SGN-E253427+) 1 TGGACTCGGA TTAATTCTTG TATTCTATAT TATTTTCTGC TTCATCTTAT TTCTGTTTCT 61 GGTTATTAAC TTTATAAATA AAAGTTATTA TAAGAGTAAC AATCTTAAGA AAATTTATAC 121 AGTTTCTGTG TTTGAGGTTT CTGGAGATTA AAACCTTTAT GGTTTTCTAC TCTACTTGAA 181 TTTTTAAAAT CATTCGATTA ACGATTAAAA AAACATAAAA ACTTTATCGT TAAATCAGAA 241 ACAGGTTGTG TAAAGATTTG CTCTTTACTG TTAGTATTTT AAATACTTAA TTTATCTGCC 301 AATTGTGACA GAAAAAAAAG ACTAATTCAA GTCAAACAAA TGCTGGAACA GTAAGTGCTG 361 CAACAACAAT GGTTGCACAT AATCGTTCAC ATGCTGCCTT AGCACCGGCT GAGAAACCTG 421 CAAAGTTTTC TGGAGTCGAC TTTAAGAGAT GGCAGCAAAA GATGTTCTTC TATCTCACTA 481 CGTTGAGTCT GCAGAAGTTC ATCAATGAGA ATGTTCCTGT TATGTCAGAT GAAACTTCGG 541 CTGATGAACG ATTCTTGGTA ACAGAAGCAT GGACACACTC AGATTTTTTG TGTAAAAATT 601 ATATTTTGAG T Predicted gene structure (within gDNA segment 1 to 2948): Exon 1 1814 2338 ( 525 n); cDNA 111 611 ( 501 n); score: 0.890 MATCH C06HBa0153O03.1-8+ SGN-E253427+ 0.890 525 0.859 C PGS_C06HBa0153O03.1-8+_SGN-E253427+ (1814 2338) Alignment (genomic DNA sequence = upper lines): AAATTTAGTA AGTTTATGTA TTTAAGGTTT CTTGAGATGA AAACCTTTAT GGTTTTCTAC 1873 ||||||| ||||| ||| ||| |||||| || ||||| | |||||||||| |||||||||| AAATTTATAC AGTTTCTGTG TTTGAGGTTT CTGGAGATTA AAACCTTTAT GGTTTTCTAC 170 TCTGCTTGAG TTTTTTAAAA TTCATTCGAT TAACGATTAA AAGAACATAA AAACTTTATC 1933 ||| ||||| ||||||||| ||||||||| |||||||||| || ||||||| |||||||||| TCTACTTGA- ATTTTTAAAA -TCATTCGAT TAACGATTAA AAAAACATAA AAACTTTATC 228 GTTAAATCAG AAACAGTCTG TGTAACGATT TGTTCTTTAC TGTTAGTATT TCAAATACTT 1993 |||||||||| |||||| || ||||| |||| || ||||||| |||||||||| | |||||||| GTTAAATCAG AAACAGGTTG TGTAAAGATT TGCTCTTTAC TGTTAGTATT TTAAATACTT 288 AAGTTATGTG CCATTTGTGA CAGAAAAAAA GAAAAAATTA CTAATTCAAA TCAAACAAAT 2053 || |||| || ||| |||||| ||| || |||||| | ||||||||| |||||||||| AATTTATCTG CCAATTGTGA CAG-----AA -AAAAAA-GA CTAATTCAAG TCAAACAAAT 341 GTTGGAACAG TAAGTGCTAC AAGAGTTTGT GCTGCAATAA CATCGGTTGC ACATAATAAT 2113 | |||||||| |||||||| | | | ||| | || |||||| ||||||| | GCTGGAACAG TAAGTGCTGC -A-A------ ----CAA-CA -AT-GGTTGC ACATAATCGT 386 TCAAATGCTG CCTTAGCGCC GGCTGAGAAA CCTGCAAAAT TTTCTGGAGT CGACTTTAAG 2173 ||| |||||| ||||||| || |||||||||| |||||||| | |||||||||| |||||||||| TCACATGCTG CCTTAGCACC GGCTGAGAAA CCTGCAAAGT TTTCTGGAGT CGACTTTAAG 446 AGATGGCAGC AGAAGATGTT CTTCTATCTC ACTACGTTGA GTCTGCAGAA GTTCATTAAT 2233 |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| |||||| ||| AGATGGCAGC AAAAGATGTT CTTCTATCTC ACTACGTTGA GTCTGCAGAA GTTCATCAAT 506 GAGAATGTTC CTGTTATGTC AGATGAAACT CCGCCTGATG AACGATTCTT GGTAACACAA 2293 |||||||||| |||||||||| |||||||||| || |||||| |||||||||| ||||||| || GAGAATGTTC CTGTTATGTC AGATGAAACT TCGGCTGATG AACGATTCTT GGTAACAGAA 566 GCATGGACAC ACTCAGATTT TTTGTGTAAA AATTATATTT TGAGT 2338 |||||||||| |||||||||| |||||||||| |||||||||| ||||| GCATGGACAC ACTCAGATTT TTTGTGTAAA AATTATATTT TGAGT 611 hqPGS_C06HBa0153O03.1-8+_SGN-E253427+ (1814 2338) ******************************************************************************** EST sequence 20 +strand 443 n (File: SGN-E258047+) 1 TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT TTCTACTCTA 61 CTTGAATTTT TAAAATCATT CGATTAACGA TTAAAAAAAC ATAAAAACTT TATCGTTAAA 121 TCAGAAACAG GTTGTGTAAA GATTTGCTCT TTACTGTTAG TATTTTAAAT ACTTAATTTA 181 TCTGCCAATT GTGACAGAAA AAAAAGACTA ATTCAAGTCA AACAAATGCT GGAACAGTAA 241 GTGCTGCAAC AACAATGGTT GCACATAATC GTTCACATGC TGCCTTAGCA CCGGCTGAGA 301 AACCTGCAAA GTTTTCTGGA GTCGACTTTA AGAGATGGCA GCAAAAGATG TTCTTCTATC 361 TCACTACGTT GAGTCTGCAG AAGTTCATCA ATGAGAATGT TCCTGTTATG TCAGATGAAA 421 CTTCGGCTGA TGAACGATTC TTG Predicted gene structure (within gDNA segment 903 to 3083): Exon 1 1818 2284 ( 467 n); cDNA 1 443 ( 443 n); score: 0.878 MATCH C06HBa0153O03.1-8+ SGN-E258047+ 0.878 467 1.054 C PGS_C06HBa0153O03.1-8+_SGN-E258047+ (1818 2284) Alignment (genomic DNA sequence = upper lines): TTAGTAAGTT TATGTATTTA AGGTTTCTTG AGATGAAAAC CTTTATGGTT TTCTACTCTG 1877 ||| |||| | ||| ||| |||||||| | |||| ||||| |||||||||| ||||||||| TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT TTCTACTCTA 60 CTTGAGTTTT TTAAAATTCA TTCGATTAAC GATTAAAAGA ACATAAAAAC TTTATCGTTA 1937 ||||| ||| |||||| ||| |||||||||| |||||||| | |||||||||| |||||||||| CTTGA-ATTT TTAAAA-TCA TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA 118 AATCAGAAAC AGTCTGTGTA ACGATTTGTT CTTTACTGTT AGTATTTCAA ATACTTAAGT 1997 |||||||||| || |||||| | |||||| | |||||||||| ||||||| || |||||||| | AATCAGAAAC AGGTTGTGTA AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTAATT 178 TATGTGCCAT TTGTGACAGA AAAAAAGAAA AAATTACTAA TTCAAATCAA ACAAATGTTG 2057 ||| ||||| ||||||||| || ||| ||| ||||| ||||| |||| ||||||| || TATCTGCCAA TTGTGACAG- ----AA-AAA AAA-GACTAA TTCAAGTCAA ACAAATGCTG 231 GAACAGTAAG TGCTACAAGA GTTTGTGCTG CAATAACATC GGTTGCACAT AATAATTCAA 2117 |||||||||| |||| | | | ||| | || |||||||||| ||| |||| GAACAGTAAG TGCTGC-A-A ---------- CAA-CA-AT- GGTTGCACAT AATCGTTCAC 276 ATGCTGCCTT AGCGCCGGCT GAGAAACCTG CAAAATTTTC TGGAGTCGAC TTTAAGAGAT 2177 |||||||||| ||| |||||| |||||||||| |||| ||||| |||||||||| |||||||||| ATGCTGCCTT AGCACCGGCT GAGAAACCTG CAAAGTTTTC TGGAGTCGAC TTTAAGAGAT 336 GGCAGCAGAA GATGTTCTTC TATCTCACTA CGTTGAGTCT GCAGAAGTTC ATTAATGAGA 2237 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| GGCAGCAAAA GATGTTCTTC TATCTCACTA CGTTGAGTCT GCAGAAGTTC ATCAATGAGA 396 ATGTTCCTGT TATGTCAGAT GAAACTCCGC CTGATGAACG ATTCTTG 2284 |||||||||| |||||||||| |||||| || |||||||||| ||||||| ATGTTCCTGT TATGTCAGAT GAAACTTCGG CTGATGAACG ATTCTTG 443 hqPGS_C06HBa0153O03.1-8+_SGN-E258047+ (1818 2284) ******************************************************************************** EST sequence 31 -strand 718 n (File: SGN-E369759-) 1 TAAGGTTGAG AATCGAAGAG GATAATAAGG CTGCAGAAAA GAGGTCACGT GGTAATTCAG 61 CAATATCTGG AGTAAATTTT GTTGAAGAAG ATTCCACAAA ATTCAAGAAA AGAAAGAAAG 121 CATCTGGTCC AAAACGCAAT CCTCCTAAGA AGAAATTCAA TGGAAACTGC TTTAATTGTG 181 GTAAACATGG TCATAGGGCT AATGAATGCC GGGGTCCTAA GAAGGACAAG AAAAAGAAGG 241 ATCAAGCAAA CTTGGCTGAA TCCAAAGGAG AAATGGACGA TCTCTGTGCA ATGCTTTCAG 301 AATGTAACTT GGTTGGAAAT CCAAGAGAAT GGTGGATAGA TTCTGGTGCC TCATGCCATG 361 TTTGTGCCAA CAAAGAATTA TTTTCATCAT ATACTCTAGC ACTTACAGAT GAAAAATTAT 421 TTATGGCAAA CTCCGCTGTT GCAAAGGTGG AAGGAACTGG CAAAGTCCTA TTAAAGATGA 481 CATCAGGCAA GGTAGTGACT TTGAATATGG TCTCATATGT TCCAGAATTG AGAAATAATT 541 TAGTTTCAAT TCCAATTCTG ACCAAGAATG GATTTAAATG TGTATTTGTT TCTGATAAAG 601 TAGTAGTAAG CAAAAATGAT ATGTATGTAG GAAAAGACTA CCTTAGTGAT GGCCTTTTCA 661 AACTCAATGT AATTGCAGTT GATATGAATA AAGATTTTGC TTCTTCTTAA AAAAAAAA Predicted gene structure (within gDNA segment 2135 to 3825): Exon 1 2745 3450 ( 706 n); cDNA 1 709 ( 709 n); score: 0.940 MATCH C06HBa0153O03.1-8+ SGN-E369759- 0.940 706 0.983 C PGS_C06HBa0153O03.1-8+_SGN-E369759- (2745 3450) Alignment (genomic DNA sequence = upper lines): TAAGGTTGAG AATCGAAGAT GATAATAAGG CTGCAGAAAA GAGGTCACAT CGTAATTCAA 2804 |||||||||| ||||||||| |||||||||| |||||||||| |||||||| | |||||||| TAAGGTTGAG AATCGAAGAG GATAATAAGG CTGCAGAAAA GAGGTCACGT GGTAATTCAG 60 CAATATTTGG AGTAAATTTT GTTGAAGAAG ATCCCACAAA ATTAAAAAAA AGAAAGAAAA 2864 |||||| ||| |||||||||| |||||||||| || ||||||| ||| || ||| ||||||||| CAATATCTGG AGTAAATTTT GTTGAAGAAG ATTCCACAAA ATTCAAGAAA AGAAAGAAAG 120 CATCTGGTCC AAAAAGCAAT CCTCCTAAGA AGAAATTCAA TGGAAACTGC TTCAACTGTG 2924 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| || || |||| CATCTGGTCC AAAACGCAAT CCTCCTAAGA AGAAATTCAA TGGAAACTGC TTTAATTGTG 180 GTAAACATGG TCATAGAGCT ACTGAATGCC GGGGTCCAAA GTAGGACAAG AAAAAGAAGG 2984 |||||||||| |||||| ||| | |||||||| ||||||| || | |||||||| |||||||||| GTAAACATGG TCATAGGGCT AATGAATGCC GGGGTCCTAA GAAGGACAAG AAAAAGAAGG 240 ATCAAGCAAA CTTGGCTGAA TCCAAAGGAG AAATGGACGA TCTCTGTGCA ATGCTTTTAA 3044 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| | ATCAAGCAAA CTTGGCTGAA TCCAAAGGAG AAATGGACGA TCTCTGTGCA ATGCTTTCAG 300 AATGTAACTT GGTTGGAAAT CCAAGAGAAT GGTGGATAGA TTCTGGTGCC TCATGCCATG 3104 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTAACTT GGTTGGAAAT CCAAGAGAAT GGTGGATAGA TTCTGGTGCC TCATGCCATG 360 TTTGTGCCAA CAAAGAATTA TTTTAATCAT ATACTTCAAC ACTTACAGAT GAAAAATTGT 3164 |||||||||| |||||||||| |||| ||||| ||||| | | |||||||||| |||||||| | TTTGTGCCAA CAAAGAATTA TTTTCATCAT ATACTCTAGC ACTTACAGAT GAAAAATTAT 420 TTATGGCAAA CTCCGCTGTT GCAAAGGTGG AAGGAACTGG CAAAGTCCTA TTAAAGATGA 3224 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATGGCAAA CTCCGCTGTT GCAAAGGTGG AAGGAACTGG CAAAGTCCTA TTAAAGATGA 480 CATCAGGCAA GGTGGTGACT TTGAATAGAG TCTAATATGT TCCTGAATTG ATTAAGAATT 3284 |||||||||| ||| |||||| ||||||| | ||| |||||| ||| |||||| | || |||| CATCAGGCAA GGTAGTGACT TTGAATATGG TCTCATATGT TCCAGAATTG AGAAATAATT 540 TAGTTTCAAT TCCAGTTCTG ACCAAGAATG GATTTAAATG TGTATTTGTT TCTGATAAAG 3344 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTTTCAAT TCCAATTCTG ACCAAGAATG GATTTAAATG TGTATTTGTT TCTGATAAAG 600 TAGTAGTAAG CAAAAATGAT ATGTATGTAG GAAAAGGCTA CCTTAGTGAT GG-C--TTCA 3401 |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| || | |||| TAGTAGTAAG CAAAAATGAT ATGTATGTAG GAAAAGACTA CCTTAGTGAT GGCCTTTTCA 660 AACTCGATGT AATTGCAGTT GATATGAATA AAGATTTTGA TTCTTCTTA 3450 ||||| |||| |||||||||| |||||||||| ||||||||| ||||||||| AACTCAATGT AATTGCAGTT GATATGAATA AAGATTTTGC TTCTTCTTA 709 hqPGS_C06HBa0153O03.1-8+_SGN-E369759- (2745 3450) ******************************************************************************** EST sequence 28 -strand 585 n (File: SGN-E395006-) 1 AAGAAATTCA ATGGAAACTG CTTTAATTGT GGTAAACATG GTCATAGGGG TAATGAATGC 61 CGGGGTCCTA AGAAGGACAA GAAAAAGAAG GATCAAGCAA ACTTGGCTGA ATCCAAAGGA 121 GAAATGGACG ATCTCTGTGC AATGCTTTCA GAATGTAACT TGGTTGGAAA TCCAAGAGAA 181 TGGTGGATAG ATTCTGGTGC CTCATGCCAT GTTTGTGCCA ACAAAGAATT ATTTTCATCA 241 TATACTCTAG CACTTACAGA TGAAAAATTA TTTATGGCAA ACTCCGCTGT TGCAAAGGTG 301 GAAGGAACTG GCAAAGTCCT ATTAAAGATG ACATCAGGCA AGGTAGTGAC TTTGAATATG 361 GTCTCATATG TTCCAGAATT GAGAAATAAT TTAGTTTCAA TTCCAATTCT GACCAAGAAT 421 GGATTTAAAT GTGTATTTGT TTTTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 481 GGAAAAGACT ACCTTAGTGA TGGCCTTTTC AAACTCAATG TAATTGCAGT TGATATGAAT 541 AAAGATTTTG CTTCTTTTTA AAAAAAAAAA AAAAAAAAAA CTCGA Predicted gene structure (within gDNA segment 1627 to 3825): Exon 1 2894 3450 ( 557 n); cDNA 1 560 ( 560 n); score: 0.936 PPA cDNA 561 581 MATCH C06HBa0153O03.1-8+ SGN-E395006- 0.936 557 0.952 C PGS_C06HBa0153O03.1-8+_SGN-E395006- (2894 3450) Alignment (genomic DNA sequence = upper lines): AAGAAATTCA ATGGAAACTG CTTCAACTGT GGTAAACATG GTCATAGAGC TACTGAATGC 2953 |||||||||| |||||||||| ||| || ||| |||||||||| ||||||| | || ||||||| AAGAAATTCA ATGGAAACTG CTTTAATTGT GGTAAACATG GTCATAGGGG TAATGAATGC 60 CGGGGTCCAA AGTAGGACAA GAAAAAGAAG GATCAAGCAA ACTTGGCTGA ATCCAAAGGA 3013 |||||||| | || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGGGGTCCTA AGAAGGACAA GAAAAAGAAG GATCAAGCAA ACTTGGCTGA ATCCAAAGGA 120 GAAATGGACG ATCTCTGTGC AATGCTTTTA AAATGTAACT TGGTTGGAAA TCCAAGAGAA 3073 |||||||||| |||||||||| |||||||| | ||||||||| |||||||||| |||||||||| GAAATGGACG ATCTCTGTGC AATGCTTTCA GAATGTAACT TGGTTGGAAA TCCAAGAGAA 180 TGGTGGATAG ATTCTGGTGC CTCATGCCAT GTTTGTGCCA ACAAAGAATT ATTTTAATCA 3133 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| TGGTGGATAG ATTCTGGTGC CTCATGCCAT GTTTGTGCCA ACAAAGAATT ATTTTCATCA 240 TATACTTCAA CACTTACAGA TGAAAAATTG TTTATGGCAA ACTCCGCTGT TGCAAAGGTG 3193 |||||| | |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TATACTCTAG CACTTACAGA TGAAAAATTA TTTATGGCAA ACTCCGCTGT TGCAAAGGTG 300 GAAGGAACTG GCAAAGTCCT ATTAAAGATG ACATCAGGCA AGGTGGTGAC TTTGAATAGA 3253 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||| GAAGGAACTG GCAAAGTCCT ATTAAAGATG ACATCAGGCA AGGTAGTGAC TTTGAATATG 360 GTCTAATATG TTCCTGAATT GATTAAGAAT TTAGTTTCAA TTCCAGTTCT GACCAAGAAT 3313 |||| ||||| |||| ||||| || || ||| |||||||||| ||||| |||| |||||||||| GTCTCATATG TTCCAGAATT GAGAAATAAT TTAGTTTCAA TTCCAATTCT GACCAAGAAT 420 GGATTTAAAT GTGTATTTGT TTCTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 3373 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| GGATTTAAAT GTGTATTTGT TTTTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 480 GGAAAAGGCT ACCTTAGTGA TGG-C--TTC AAACTCGATG TAATTGCAGT TGATATGAAT 3430 ||||||| || |||||||||| ||| | ||| |||||| ||| |||||||||| |||||||||| GGAAAAGACT ACCTTAGTGA TGGCCTTTTC AAACTCAATG TAATTGCAGT TGATATGAAT 540 AAAGATTTTG ATTCTTCTTA 3450 |||||||||| ||||| ||| AAAGATTTTG CTTCTTTTTA 560 hqPGS_C06HBa0153O03.1-8+_SGN-E395006- (2894 3450) ******************************************************************************** EST sequence 26 -strand 562 n (File: SGN-E250407-) 1 AATTGTGGTA AACATGGTCA TAGGGCTAAT GAAAGCCGGG GTCTTAAGAA GGACAAGATA 61 AAGAAGGATC AAGCAAACTT GGGTGAATCC AAAGGAGAAA TGGACGATCT CTGTGCAATG 121 CTTTCAGAAT GTAACTTGGT TGGAAATCCA AGAGAATGGT GGATAGATTC TGGTGCCTCA 181 TGCCATGTTT GTGCCAACAA AGAATTATTT TCATCATATA CTCTAGCAGT TACAGATGAG 241 AAATTATTTA TGGCCAACTC CGCTGTTGCA AAGGTGTAAG GAACTGGCAA AGTCCTATTA 301 AAGATGACAT CAGGCAAGGT AGTGACTTTG AATATGGTCT CATATGTTCC AGAATTGAGA 361 AATAATTTAG TTTCAATTCC AATTCTGACC AAGAATGGAT TTAAATGTGT ATTTGTTTCT 421 GATAAAGTAG TAGTAAGCAA AAATGATATG TATGTAGGAA AAGACTACCT TAGTGATGGC 481 CTTTTCAAAC TCAATGTAAT TGCAGTTCAT ATGAATAAAG ATTTTGCTTC TTATTAAAAA 541 AAAAAAAAAA AAAAGACTCG AC Predicted gene structure (within gDNA segment 1777 to 3825): Exon 1 2918 3450 ( 533 n); cDNA 1 536 ( 536 n); score: 0.922 PPA cDNA 537 556 MATCH C06HBa0153O03.1-8+ SGN-E250407- 0.922 533 0.948 C PGS_C06HBa0153O03.1-8+_SGN-E250407- (2918 3450) Alignment (genomic DNA sequence = upper lines): AACTGTGGTA AACATGGTCA TAGAGCTACT GAATGCCGGG GTCCAAAGTA GGACAAGAAA 2977 || ||||||| |||||||||| ||| |||| | ||| |||||| ||| ||| | |||||||| | AATTGTGGTA AACATGGTCA TAGGGCTAAT GAAAGCCGGG GTCTTAAGAA GGACAAGATA 60 AAGAAGGATC AAGCAAACTT GGCTGAATCC AAAGGAGAAA TGGACGATCT CTGTGCAATG 3037 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| AAGAAGGATC AAGCAAACTT GGGTGAATCC AAAGGAGAAA TGGACGATCT CTGTGCAATG 120 CTTTTAAAAT GTAACTTGGT TGGAAATCCA AGAGAATGGT GGATAGATTC TGGTGCCTCA 3097 |||| | ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTCAGAAT GTAACTTGGT TGGAAATCCA AGAGAATGGT GGATAGATTC TGGTGCCTCA 180 TGCCATGTTT GTGCCAACAA AGAATTATTT TAATCATATA CTTCAACACT TACAGATGAA 3157 |||||||||| |||||||||| |||||||||| | |||||||| || | || | ||||||||| TGCCATGTTT GTGCCAACAA AGAATTATTT TCATCATATA CTCTAGCAGT TACAGATGAG 240 AAATTGTTTA TGGCAAACTC CGCTGTTGCA AAGGTGGAAG GAACTGGCAA AGTCCTATTA 3217 ||||| |||| |||| ||||| |||||||||| |||||| ||| |||||||||| |||||||||| AAATTATTTA TGGCCAACTC CGCTGTTGCA AAGGTGTAAG GAACTGGCAA AGTCCTATTA 300 AAGATGACAT CAGGCAAGGT GGTGACTTTG AATAGAGTCT AATATGTTCC TGAATTGATT 3277 |||||||||| |||||||||| ||||||||| |||| |||| ||||||||| ||||||| AAGATGACAT CAGGCAAGGT AGTGACTTTG AATATGGTCT CATATGTTCC AGAATTGAGA 360 AAGAATTTAG TTTCAATTCC AGTTCTGACC AAGAATGGAT TTAAATGTGT ATTTGTTTCT 3337 || ||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| AATAATTTAG TTTCAATTCC AATTCTGACC AAGAATGGAT TTAAATGTGT ATTTGTTTCT 420 GATAAAGTAG TAGTAAGCAA AAATGATATG TATGTAGGAA AAGGCTACCT TAGTGATGG- 3396 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| ||||||||| GATAAAGTAG TAGTAAGCAA AAATGATATG TATGTAGGAA AAGACTACCT TAGTGATGGC 480 CT--TCAAAC TCGATGTAAT TGCAGTTGAT ATGAATAAAG ATTTTGATTC TTCTTA 3450 || |||||| || ||||||| ||||||| || |||||||||| |||||| ||| || ||| CTTTTCAAAC TCAATGTAAT TGCAGTTCAT ATGAATAAAG ATTTTGCTTC TTATTA 536 hqPGS_C06HBa0153O03.1-8+_SGN-E250407- (2918 3450) ******************************************************************************** EST sequence 27 -strand 524 n (File: SGN-E375520-) 1 ATGCCGGGGT CCTAAGAAGG ACAAGAAAAA GAAGGATCAA GCAAACTTGG CTGAATCCAA 61 AGGAGAAATG GACGATCTCT GTGCAATGCT TTCAGAATGT AACTTGGTTG GAAATCCAAG 121 AGAATGGTGG ATAGATTTTG GTGCCTCATG CCATGTTTGT GCCAACAAAG AATTATTTTC 181 ATCATATACT TTAGCACTTA CAGATGAAAA ATTATTTATG GCAAACTCCG CTGTTGCAAA 241 GGTGGAAGGA ACTGGCAAAG TCCTATTAAA GATGACATCA GGCAAGGTAG TGACTTTGAA 301 TATGGTTTCA TATGTTCCAG AATTGAGAAA TAATTTAGTT TCAATTCCAA TTTTGACCAA 361 GAATGGATTT AAATGTGTAT TTGTTTTTGA TAAAGTAGTA GTAAGCAAAA ATGATATGTA 421 TGTAGGAAAA GACTACCTTA GTGATGGCCT TTTCAAACTC AATGTAATTG CAGTTGATAT 481 GAATAAAGAT TTTGCTTTTT TTTAAAAAAA AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 2187 to 3825): Exon 1 2950 3440 ( 491 n); cDNA 1 494 ( 494 n); score: 0.938 PPA cDNA 504 524 MATCH C06HBa0153O03.1-8+ SGN-E375520- 0.938 491 0.937 C PGS_C06HBa0153O03.1-8+_SGN-E375520- (2950 3440) Alignment (genomic DNA sequence = upper lines): ATGCCGGGGT CCAAAGTAGG ACAAGAAAAA GAAGGATCAA GCAAACTTGG CTGAATCCAA 3009 |||||||||| || ||| ||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCCGGGGT CCTAAGAAGG ACAAGAAAAA GAAGGATCAA GCAAACTTGG CTGAATCCAA 60 AGGAGAAATG GACGATCTCT GTGCAATGCT TTTAAAATGT AACTTGGTTG GAAATCCAAG 3069 |||||||||| |||||||||| |||||||||| || | ||||| |||||||||| |||||||||| AGGAGAAATG GACGATCTCT GTGCAATGCT TTCAGAATGT AACTTGGTTG GAAATCCAAG 120 AGAATGGTGG ATAGATTCTG GTGCCTCATG CCATGTTTGT GCCAACAAAG AATTATTTTA 3129 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| ||||||||| AGAATGGTGG ATAGATTTTG GTGCCTCATG CCATGTTTGT GCCAACAAAG AATTATTTTC 180 ATCATATACT TCAACACTTA CAGATGAAAA ATTGTTTATG GCAAACTCCG CTGTTGCAAA 3189 |||||||||| | | |||||| |||||||||| ||| |||||| |||||||||| |||||||||| ATCATATACT TTAGCACTTA CAGATGAAAA ATTATTTATG GCAAACTCCG CTGTTGCAAA 240 GGTGGAAGGA ACTGGCAAAG TCCTATTAAA GATGACATCA GGCAAGGTGG TGACTTTGAA 3249 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| GGTGGAAGGA ACTGGCAAAG TCCTATTAAA GATGACATCA GGCAAGGTAG TGACTTTGAA 300 TAGAGTCTAA TATGTTCCTG AATTGATTAA GAATTTAGTT TCAATTCCAG TTCTGACCAA 3309 || || | | |||||||| | |||||| || ||||||||| ||||||||| || ||||||| TATGGTTTCA TATGTTCCAG AATTGAGAAA TAATTTAGTT TCAATTCCAA TTTTGACCAA 360 GAATGGATTT AAATGTGTAT TTGTTTCTGA TAAAGTAGTA GTAAGCAAAA ATGATATGTA 3369 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| GAATGGATTT AAATGTGTAT TTGTTTTTGA TAAAGTAGTA GTAAGCAAAA ATGATATGTA 420 TGTAGGAAAA GGCTACCTTA GTGATGG-C- -TTCAAACTC GATGTAATTG CAGTTGATAT 3426 |||||||||| | |||||||| ||||||| | ||||||||| ||||||||| |||||||||| TGTAGGAAAA GACTACCTTA GTGATGGCCT TTTCAAACTC AATGTAATTG CAGTTGATAT 480 GAATAAAGAT TTTG 3440 |||||||||| |||| GAATAAAGAT TTTG 494 hqPGS_C06HBa0153O03.1-8+_SGN-E375520- (2950 3440) ******************************************************************************** EST sequence 29 -strand 236 n (File: SGN-E398572-) 1 GTTTTATTTG TTCCCGATTT GGGAAGTAAT TTAGTTTCAA TTCCAATTTT GACCAAGAAT 61 GGATTTAAAG GGGTTTTTGT TTTTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 121 GGAAAAGACT ACCTTGGGGA TGGCCTTTTC AAACTCAATG TAATTGCAGT TGATATGAAT 181 AAAGATTTTG CTTTTTTTTT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA Predicted gene structure (within gDNA segment 2203 to 3825): Exon 1 3254 3440 ( 187 n); cDNA 1 190 ( 190 n); score: 0.853 PPA cDNA 201 236 MATCH C06HBa0153O03.1-8+ SGN-E398572- 0.853 187 0.792 C PGS_C06HBa0153O03.1-8+_SGN-E398572- (3254 3440) Alignment (genomic DNA sequence = upper lines): GTCTAATATG TTCCTGAATT GATTAAGAAT TTAGTTTCAA TTCCAGTTCT GACCAAGAAT 3313 || | || || |||| || || | | ||| |||||||||| ||||| || | |||||||||| GTTTTATTTG TTCCCGATTT GGGAAGTAAT TTAGTTTCAA TTCCAATTTT GACCAAGAAT 60 GGATTTAAAT GTGTATTTGT TTCTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 3373 ||||||||| | || ||||| || ||||||| |||||||||| |||||||||| |||||||||| GGATTTAAAG GGGTTTTTGT TTTTGATAAA GTAGTAGTAA GCAAAAATGA TATGTATGTA 120 GGAAAAGGCT ACCTTAGTGA TGG-C--TTC AAACTCGATG TAATTGCAGT TGATATGAAT 3430 ||||||| || ||||| | || ||| | ||| |||||| ||| |||||||||| |||||||||| GGAAAAGACT ACCTTGGGGA TGGCCTTTTC AAACTCAATG TAATTGCAGT TGATATGAAT 180 AAAGATTTTG 3440 |||||||||| AAAGATTTTG 190 hqPGS_C06HBa0153O03.1-8+_SGN-E398572- (3254 3440) Total number of EST alignments reported: 31 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3825: PGL 1 (+ strand): 735 3450 AGS-1 (735 773,1903 2549,2622 2733) SCR (e 0.769 d 0.000 a 0.000,e 0.932 d 1.000 a 0.986,e 0.946) Exon 1 735 773 ( 39 n); score: 0.769 Intron 1 774 1902 (1129 n); Pd: 0.000 Pa: 0.000 Exon 2 1903 2549 ( 647 n); score: 0.932 Intron 2 2550 2621 ( 72 n); Pd: 1.000 Pa: 0.986 Exon 3 2622 2733 ( 112 n); score: 0.946 PGS (735 773,1903 2270) SGN-E553348- PGS (2064 2549,2622 2733) SGN-E251204+ 3-phase translation of AGS-1 (+strand): . . . . : . . 735 TTTTTCTTATTTTTTTCCTTCTTTATTTCTTTCTTCTTT : TTAACGATTAAAAGAACATAA F F L F F S F F I S F F F : L T I K R T - F S Y F F P S L F L S S F : - R L K E H K F L I F F L L Y F F L L : F N D - K N I . . . . . . 1924 AAACTTTATCGTTAAATCAGAAACAGTCTGTGTAACGATTTGTTCTTTACTGTTAGTATT K L Y R - I R N S L C N D L F F T V S I N F I V K S E T V C V T I C S L L L V F K T L S L N Q K Q S V - R F V L Y C - Y . . . . . . 1984 TCAAATACTTAAGTTATGTGCCATTTGTGACAGAAAAAAAGAAAAAATTACTAATTCAAA S N T - V M C H L - Q K K R K N Y - F K Q I L K L C A I C D R K K E K I T N S N F K Y L S Y V P F V T E K K K K L L I Q . . . . . . 2044 TCAAACAAATGTTGGAACAGTAAGTGCTACAAGAGTTTGTGCTGCAATAACATCGGTTGC S N K C W N S K C Y K S L C C N N I G C Q T N V G T V S A T R V C A A I T S V A I K Q M L E Q - V L Q E F V L Q - H R L . . . . . . 2104 ACATAATAATTCAAATGCTGCCTTAGCGCCGGCTGAGAAACCTGCAAAATTTTCTGGAGT T - - F K C C L S A G - E T C K I F W S H N N S N A A L A P A E K P A K F S G V H I I I Q M L P - R R L R N L Q N F L E . . . . . . 2164 CGACTTTAAGAGATGGCAGCAGAAGATGTTCTTCTATCTCACTACGTTGAGTCTGCAGAA R L - E M A A E D V L L S H Y V E S A E D F K R W Q Q K M F F Y L T T L S L Q K S T L R D G S R R C S S I S L R - V C R . . . . . . 2224 GTTCATTAATGAGAATGTTCCTGTTATGTCAGATGAAACTCCGCCTGATGAACGATTCTT V H - - E C S C Y V R - N S A - - T I L F I N E N V P V M S D E T P P D E R F L S S L M R M F L L C Q M K L R L M N D S . . . . . . 2284 GGTAACACAAGCATGGACACACTCAGATTTTTTGTGTAAAAATTATATTTTGAGTGGCCT G N T S M D T L R F F V - K L Y F E W P V T Q A W T H S D F L C K N Y I L S G L W - H K H G H T Q I F C V K I I F - V A . . . . . . 2344 ACAAGATGATCTGTACAATGTGTACAGCAATGTCAAAACCTTAAAAGAACTCTGGGATGC T R - S V Q C V Q Q C Q N L K R T L G C Q D D L Y N V Y S N V K T L K E L W D A Y K M I C T M C T A M S K P - K N S G M . . . . . . 2404 TTTAGAAAAGAAGTACAAAACAGAAGATGCCAGAATGAAGAAATTCATCATGGCAAAATT F R K E V Q N R R C Q N E E I H H G K I L E K K Y K T E D A R M K K F I M A K F L - K R S T K Q K M P E - R N S S W Q N . . . . . . 2464 TCTGGACTATAAGATGATAGACAGTAAGACTGTAGTCACCCAAGTTCAAGAACTGCAGGT S G L - D D R Q - D C S H P S S R T A G L D Y K M I D S K T V V T Q V Q E L Q V F W T I R - - T V R L - S P K F K N C R . . . : . . . 2524 CATAATCCATGATCTCCTTGCTGAAG : GATTGATTGTGAATGATGCCTTTCAAGTGGCTGC H N P - S P C - R : I D C E - C L S S G C I I H D L L A E : G L I V N D A F Q V A A S - S M I S L L K : D - L - M M P F K W L . . . . . . 2656 AATTATTGAAAACTTACCTCCATTGTTGAAGGACTTCAAAAACTACTTGAAACACAAACG N Y - K L T S I V E G L Q K L L E T Q T I I E N L P P L L K D F K N Y L K H K R Q L L K T Y L H C - R T S K T T - N T N . . 2716 CAAGGAGATGACTGTTGA Q G D D C - K E M T V A R R - L L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-1_PPS_1 (1907 2549,2622 2731) (frame '2'; 753 bp, 251 residues) 1 RLKEHKNFIV KSETVCVTIC SLLLVFQILK LCAICDRKKE KITNSNQTNV GTVSATRVCA 61 AITSVAHNNS NAALAPAEKP AKFSGVDFKR WQQKMFFYLT TLSLQKFINE NVPVMSDETP 121 PDERFLVTQA WTHSDFLCKN YILSGLQDDL YNVYSNVKTL KELWDALEKK YKTEDARMKK 181 FIMAKFLDYK MIDSKTVVTQ VQELQVIIHD LLAEGLIVND AFQVAAIIEN LPPLLKDFKN 241 YLKHKRKEMT V AGS-2 (936 1385) SCR (e 0.856) Exon 1 936 1385 ( 450 n); score: 0.856 PGS (936 1385) SGN-E542859+ PGS (936 1325) SGN-E301820+ PGS (936 1296) SGN-E301922+ PGS (937 1299) SGN-E548743+ 3-phase translation of AGS-2 (+strand): . . . . . . 936 GTGATTAGTGTTGAGTTTAGCAAGTGTGAATGAGAAAAGAAAAGAGAGAATATGAAAAGT V I S V E F S K C E - E K K R E N M K S - L V L S L A S V N E K R K E R I - K V D - C - V - Q V - M R K E K R E Y E K . . . . . . 996 GAGGGAACTATTTTGGAGGGAAAATGAAAAGTCATTTGCAAAGTGCAACGAAAAATCATT E G T I L E G K - K V I C K V Q R K I I R E L F W R E N E K S F A K C N E K S F - G N Y F G G K M K S H L Q S A T K N H . . . . . . 1056 TCTCCCATATTGGCAAAAGAAAGGGAAATTGTTGTCCTTATATAAGGAAATACTTCCATT S P I L A K E R E I V V L I - G N T S I L P Y W Q K K G K L L S L Y K E I L P L F S H I G K R K G N C C P Y I R K Y F H . . . . . . 1116 ACTTCTTAAAGAGCTAAGAAGAAGATGCCCCTCACATCGTCATTGCTCGCCCGGCTTCGG T S - R A K K K M P L T S S L L A R L R L L K E L R R R C P S H R H C S P G F G Y F L K S - E E D A P H I V I A R P A S . . . . . . 1176 CTTCGGCTTCGGCTTCGGATTTAGATTTGGCAAATGGTTTGATTGATAAATTTTTTGGAC L R L R L R I - I W Q M V - L I N F L D F G F G F G F R F G K W F D - - I F W T A S A S A S D L D L A N G L I D K F F G . . . . . . 1236 AAAATTTATTTAATCAGTTTTTGTTAAATCAAATAAATCCTGTTAATATTATCTCTTATA K I Y L I S F C - I K - I L L I L S L I K F I - S V F V K S N K S C - Y Y L L - Q N L F N Q F L L N Q I N P V N I I S Y . . . . . . 1296 AATTTGCGGATAACGGTAACATTTCGAAAAGTTGTTACTCTTTCCGATAAGTCGTTAATT N L R I T V T F R K V V T L S D K S L I I C G - R - H F E K L L L F P I S R - F K F A D N G N I S K S C Y S F R - V V N . . . 1356 TTTGAAAAGCCGTTATTTTTCTAACAGACA F E K P L F F - Q T L K S R Y F S N R F - K A V I F L T D Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-2_PPS_1 (991 1221) (frame '2'; 228 bp, 76 residues) 1 KVRELFWREN EKSFAKCNEK SFLPYWQKKG KLLSLYKEIL PLLLKELRRR CPSHRHCSPG 61 FGFGFGFGFR FGKWFD- >C06HBa0153O03.1-8+_PGL-1_AGS-2_PPS_2 (1133 1345) (frame '0'; 210 bp, 70 residues) 1 EEDAPHIVIA RPASASASAS DLDLANGLID KFFGQNLFNQ FLLNQINPVN IISYKFADNG 61 NISKSCYSFR - 3-phase translation of AGS-2 (-strand): . . . . . . 1385 TGTCTGTTAGAAAAATAACGGCTTTTCAAAAATTAACGACTTATCGGAAAGAGTAACAAC C L L E K - R L F K N - R L I G K S N N V C - K N N G F S K I N D L S E R V T T S V R K I T A F Q K L T T Y R K E - Q . . . . . . 1325 TTTTCGAAATGTTACCGTTATCCGCAAATTTATAAGAGATAATATTAACAGGATTTATTT F S K C Y R Y P Q I Y K R - Y - Q D L F F R N V T V I R K F I R D N I N R I Y L L F E M L P L S A N L - E I I L T G F I . . . . . . 1265 GATTTAACAAAAACTGATTAAATAAATTTTGTCCAAAAAATTTATCAATCAAACCATTTG D L T K T D - I N F V Q K I Y Q S N H L I - Q K L I K - I L S K K F I N Q T I C - F N K N - L N K F C P K N L S I K P F . . . . . . 1205 CCAAATCTAAATCCGAAGCCGAAGCCGAAGCCGAAGCCGGGCGAGCAATGACGATGTGAG P N L N P K P K P K P K P G E Q - R C E Q I - I R S R S R S R S R A S N D D V R A K S K S E A E A E A E A G R A M T M - . . . . . . 1145 GGGCATCTTCTTCTTAGCTCTTTAAGAAGTAATGGAAGTATTTCCTTATATAAGGACAAC G H L L L S S L R S N G S I S L Y K D N G I F F L A L - E V M E V F P Y I R T T G A S S S - L F K K - W K Y F L I - G Q . . . . . . 1085 AATTTCCCTTTCTTTTGCCAATATGGGAGAAATGATTTTTCGTTGCACTTTGCAAATGAC N F P F F C Q Y G R N D F S L H F A N D I S L S F A N M G E M I F R C T L Q M T Q F P F L L P I W E K - F F V A L C K - . . . . . . 1025 TTTTCATTTTCCCTCCAAAATAGTTCCCTCACTTTTCATATTCTCTCTTTTCTTTTCTCA F S F S L Q N S S L T F H I L S F L F S F H F P S K I V P S L F I F S L F F S H L F I F P P K - F P H F S Y S L F S F L . . . 965 TTCACACTTGCTAAACTCAACACTAATCAC F T L A K L N T N H S H L L N S T L I I H T C - T Q H - S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8-_PGL-1_AGS-2_PPS_1 (1154 936) (frame '1'; 219 bp, 73 residues) 1 RCEGHLLLSS LRSNGSISLY KDNNFPFFCQ YGRNDFSLHF ANDFSFSLQN SSLTFHILSF 61 LFSFTLAKLN TNH AGS-3 (937 1056,1182 1638,2035 2616) SCR (e 0.858 d 0.000 a 0.000,e 0.858 d 0.746 a 0.000,e 0.858) Exon 1 937 1056 ( 120 n); score: 0.858 Intron 1 1057 1181 ( 125 n); Pd: 0.000 Pa: 0.000 Exon 2 1182 1638 ( 457 n); score: 0.858 Intron 2 1639 2034 ( 396 n); Pd: 0.746 Pa: 0.000 Exon 3 2035 2616 ( 582 n); score: 0.858 PGS (937 1056,1182 1495) SGN-E395007+ PGS (937 1056,1182 1392) SGN-E250408+ PGS (1447 1638,2035 2616) SGN-E542858- 3-phase translation of AGS-3 (+strand): . . . . . . 937 TGATTAGTGTTGAGTTTAGCAAGTGTGAATGAGAAAAGAAAAGAGAGAATATGAAAAGTG - L V L S L A S V N E K R K E R I - K V D - C - V - Q V - M R K E K R E Y E K - I S V E F S K C E - E K K R E N M K S . . . . . . : 997 AGGGAACTATTTTGGAGGGAAAATGAAAAGTCATTTGCAAAGTGCAACGAAAAATCATTT : R E L F W R E N E K S F A K C N E K S F : G N Y F G G K M K S H L Q S A T K N H F : E G T I L E G K - K V I C K V Q R K I I : . . . . . . 1182 CTTCGGCTTCGGATTTAGATTTGGCAAATGGTTTGATTGATAAATTTTTTGGACAAAATT L R L R I - I W Q M V - L I N F L D K I F G F G F R F G K W F D - - I F W T K F S S A S D L D L A N G L I D K F F G Q N . . . . . . 1242 TATTTAATCAGTTTTTGTTAAATCAAATAAATCCTGTTAATATTATCTCTTATAAATTTG Y L I S F C - I K - I L L I L S L I N L I - S V F V K S N K S C - Y Y L L - I C L F N Q F L L N Q I N P V N I I S Y K F . . . . . . 1302 CGGATAACGGTAACATTTCGAAAAGTTGTTACTCTTTCCGATAAGTCGTTAATTTTTGAA R I T V T F R K V V T L S D K S L I F E G - R - H F E K L L L F P I S R - F L K A D N G N I S K S C Y S F R - V V N F - . . . . . . 1362 AAGCCGTTATTTTTCTAACAGACACATTTTTCTGAAAAGTTGTTATTTTTTCCAAAAGAC K P L F F - Q T H F S E K L L F F P K D S R Y F S N R H I F L K S C Y F F Q K T K A V I F L T D T F F - K V V I F S K R . . . . . . 1422 ACAACTTTCTTGATAAAACGGGTCTGAACAGATTTCTCTGAACAGACACGTTTCTTGCTG T T F L I K R V - T D F S E Q T R F L L Q L S - - N G S E Q I S L N R H V S C - H N F L D K T G L N R F L - T D T F L A . . . . . . 1482 AAAGTGGCTATAAAAGGAAGTCAATTTTTGATTTTTCAAACACTGAAATTTTCCTTCTCT K V A I K G S Q F L I F Q T L K F S F S K W L - K E V N F - F F K H - N F P S L E S G Y K R K S I F D F S N T E I F L L . . . . . . 1542 GCATATATTTTTCTCTCAAATGAATCAAAGTGTCGATCGACTGAATTTGTGTGACTTGTT A Y I F L S N E S K C R S T E F V - L V H I F F S Q M N Q S V D R L N L C D L L C I Y F S L K - I K V S I D - I C V T C . . . . : . . 1602 GCTGTTCTGAAGTTCGTTGAAGTTAAAGAAATTTGAG : TAATTCAAATCAAACAAATGTTG A V L K F V E V K E I - : V I Q I K Q M L L F - S S L K L K K F E : - F K S N K C W C C S E V R - S - R N L S : N S N Q T N V . . . . . . 2058 GAACAGTAAGTGCTACAAGAGTTTGTGCTGCAATAACATCGGTTGCACATAATAATTCAA E Q - V L Q E F V L Q - H R L H I I I Q N S K C Y K S L C C N N I G C T - - F K G T V S A T R V C A A I T S V A H N N S . . . . . . 2118 ATGCTGCCTTAGCGCCGGCTGAGAAACCTGCAAAATTTTCTGGAGTCGACTTTAAGAGAT M L P - R R L R N L Q N F L E S T L R D C C L S A G - E T C K I F W S R L - E M N A A L A P A E K P A K F S G V D F K R . . . . . . 2178 GGCAGCAGAAGATGTTCTTCTATCTCACTACGTTGAGTCTGCAGAAGTTCATTAATGAGA G S R R C S S I S L R - V C R S S L M R A A E D V L L S H Y V E S A E V H - - E W Q Q K M F F Y L T T L S L Q K F I N E . . . . . . 2238 ATGTTCCTGTTATGTCAGATGAAACTCCGCCTGATGAACGATTCTTGGTAACACAAGCAT M F L L C Q M K L R L M N D S W - H K H C S C Y V R - N S A - - T I L G N T S M N V P V M S D E T P P D E R F L V T Q A . . . . . . 2298 GGACACACTCAGATTTTTTGTGTAAAAATTATATTTTGAGTGGCCTACAAGATGATCTGT G H T Q I F C V K I I F - V A Y K M I C D T L R F F V - K L Y F E W P T R - S V W T H S D F L C K N Y I L S G L Q D D L . . . . . . 2358 ACAATGTGTACAGCAATGTCAAAACCTTAAAAGAACTCTGGGATGCTTTAGAAAAGAAGT T M C T A M S K P - K N S G M L - K R S Q C V Q Q C Q N L K R T L G C F R K E V Y N V Y S N V K T L K E L W D A L E K K . . . . . . 2418 ACAAAACAGAAGATGCCAGAATGAAGAAATTCATCATGGCAAAATTTCTGGACTATAAGA T K Q K M P E - R N S S W Q N F W T I R Q N R R C Q N E E I H H G K I S G L - D Y K T E D A R M K K F I M A K F L D Y K . . . . . . 2478 TGATAGACAGTAAGACTGTAGTCACCCAAGTTCAAGAACTGCAGGTCATAATCCATGATC - - T V R L - S P K F K N C R S - S M I D R Q - D C S H P S S R T A G H N P - S M I D S K T V V T Q V Q E L Q V I I H D . . . . . . 2538 TCCTTGCTGAAGGTATAAATTTATTTAATACCTATGTTAAAAATATTAAGTTTTTCCTTA S L L K V - I Y L I P M L K I L S F S L P C - R Y K F I - Y L C - K Y - V F P - L L A E G I N L F N T Y V K N I K F F L . . 2598 ATACTCACATAATCTTCAA I L T - S S Y S H N L Q N T H I I F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-3_PPS_1 (1628 1638,2035 2614) (frame '0'; 591 bp, 197 residues) 1 RNLSNSNQTN VGTVSATRVC AAITSVAHNN SNAALAPAEK PAKFSGVDFK RWQQKMFFYL 61 TTLSLQKFIN ENVPVMSDET PPDERFLVTQ AWTHSDFLCK NYILSGLQDD LYNVYSNVKT 121 LKELWDALEK KYKTEDARMK KFIMAKFLDY KMIDSKTVVT QVQELQVIIH DLLAEGINLF 181 NTYVKNIKFF LNTHIIF >C06HBa0153O03.1-8+_PGL-1_AGS-3_PPS_2 (1023 1056,1182 1345) (frame '0'; 195 bp, 65 residues) 1 KVICKVQRKI ISSASDLDLA NGLIDKFFGQ NLFNQFLLNQ INPVNIISYK FADNGNISKS 61 CYSFR- AGS-4 (1466 2549,2622 2718) SCR (e 0.881 d 1.000 a 0.986,e 0.933) Exon 1 1466 2549 (1084 n); score: 0.881 Intron 1 2550 2621 ( 72 n); Pd: 1.000 Pa: 0.986 Exon 2 2622 2718 ( 97 n); score: 0.933 PGS (1466 1824) SGN-E262710+ PGS (1466 1824) SGN-E255327+ PGS (1466 1824) SGN-E369760+ PGS (1466 1688) SGN-E261310+ PGS (1475 1807) SGN-E262800+ PGS (1496 1824) SGN-E254845+ PGS (1497 1824) SGN-E273518+ PGS (1527 1807) SGN-E258205+ PGS (1804 2316) SGN-E261066+ PGS (1804 2277) SGN-E263584+ PGS (1804 2246) SGN-E276669+ PGS (1809 2054) SGN-E241550+ PGS (1814 2338) SGN-E253427+ PGS (1818 2549,2622 2643) SGN-E262550+ PGS (1818 2284) SGN-E258047+ PGS (1853 2549,2622 2718) SGN-E261540+ 3-phase translation of AGS-4 (+strand): . . . . . . 1466 GACACGTTTCTTGCTGAAAGTGGCTATAAAAGGAAGTCAATTTTTGATTTTTCAAACACT D T F L A E S G Y K R K S I F D F S N T T R F L L K V A I K G S Q F L I F Q T L H V S C - K W L - K E V N F - F F K H . . . . . . 1526 GAAATTTTCCTTCTCTGCATATATTTTTCTCTCAAATGAATCAAAGTGTCGATCGACTGA E I F L L C I Y F S L K - I K V S I D - K F S F S A Y I F L S N E S K C R S T E - N F P S L H I F F S Q M N Q S V D R L . . . . . . 1586 ATTTGTGTGACTTGTTGCTGTTCTGAAGTTCGTTGAAGTTAAAGAAATTTGAGGTACCGC I C V T C C C S E V R - S - R N L R Y R F V - L V A V L K F V E V K E I - G T A N L C D L L L F - S S L K L K K F E V P . . . . . . 1646 TATTTCTTTAACAGGCTTAATCCATCTTATCTTGGGAGAAATTAATCCATAACCGTGGGT Y F F N R L N P S Y L G R N - S I T V G I S L T G L I H L I L G E I N P - P W V L F L - Q A - S I L S W E K L I H N R G . . . . . . 1706 ACAATGAGGGGATTAAATTTCTTAAGGACACACAGTAGTTTCTGTGGACTCGAATTACTT T M R G L N F L R T H S S F C G L E L L Q - G D - I S - G H T V V S V D S N Y F Y N E G I K F L K D T Q - F L W T R I T . . . . . . 1766 CTTGTATTTATGTATTTTGTGTTTCATCTTATTTCTGTTTCTGTTAAGAAATTTAGTAAG L V F M Y F V F H L I S V S V K K F S K L Y L C I L C F I L F L F L L R N L V S S C I Y V F C V S S Y F C F C - E I - - . . . . . . 1826 TTTATGTATTTAAGGTTTCTTGAGATGAAAACCTTTATGGTTTTCTACTCTGCTTGAGTT F M Y L R F L E M K T F M V F Y S A - V L C I - G F L R - K P L W F S T L L E F V Y V F K V S - D E N L Y G F L L C L S . . . . . . 1886 TTTTAAAATTCATTCGATTAACGATTAAAAGAACATAAAAACTTTATCGTTAAATCAGAA F - N S F D - R L K E H K N F I V K S E F K I H S I N D - K N I K T L S L N Q K F L K F I R L T I K R T - K L Y R - I R . . . . . . 1946 ACAGTCTGTGTAACGATTTGTTCTTTACTGTTAGTATTTCAAATACTTAAGTTATGTGCC T V C V T I C S L L L V F Q I L K L C A Q S V - R F V L Y C - Y F K Y L S Y V P N S L C N D L F F T V S I S N T - V M C . . . . . . 2006 ATTTGTGACAGAAAAAAAGAAAAAATTACTAATTCAAATCAAACAAATGTTGGAACAGTA I C D R K K E K I T N S N Q T N V G T V F V T E K K K K L L I Q I K Q M L E Q - H L - Q K K R K N Y - F K S N K C W N S . . . . . . 2066 AGTGCTACAAGAGTTTGTGCTGCAATAACATCGGTTGCACATAATAATTCAAATGCTGCC S A T R V C A A I T S V A H N N S N A A V L Q E F V L Q - H R L H I I I Q M L P K C Y K S L C C N N I G C T - - F K C C . . . . . . 2126 TTAGCGCCGGCTGAGAAACCTGCAAAATTTTCTGGAGTCGACTTTAAGAGATGGCAGCAG L A P A E K P A K F S G V D F K R W Q Q - R R L R N L Q N F L E S T L R D G S R L S A G - E T C K I F W S R L - E M A A . . . . . . 2186 AAGATGTTCTTCTATCTCACTACGTTGAGTCTGCAGAAGTTCATTAATGAGAATGTTCCT K M F F Y L T T L S L Q K F I N E N V P R C S S I S L R - V C R S S L M R M F L E D V L L S H Y V E S A E V H - - E C S . . . . . . 2246 GTTATGTCAGATGAAACTCCGCCTGATGAACGATTCTTGGTAACACAAGCATGGACACAC V M S D E T P P D E R F L V T Q A W T H L C Q M K L R L M N D S W - H K H G H T C Y V R - N S A - - T I L G N T S M D T . . . . . . 2306 TCAGATTTTTTGTGTAAAAATTATATTTTGAGTGGCCTACAAGATGATCTGTACAATGTG S D F L C K N Y I L S G L Q D D L Y N V Q I F C V K I I F - V A Y K M I C T M C L R F F V - K L Y F E W P T R - S V Q C . . . . . . 2366 TACAGCAATGTCAAAACCTTAAAAGAACTCTGGGATGCTTTAGAAAAGAAGTACAAAACA Y S N V K T L K E L W D A L E K K Y K T T A M S K P - K N S G M L - K R S T K Q V Q Q C Q N L K R T L G C F R K E V Q N . . . . . . 2426 GAAGATGCCAGAATGAAGAAATTCATCATGGCAAAATTTCTGGACTATAAGATGATAGAC E D A R M K K F I M A K F L D Y K M I D K M P E - R N S S W Q N F W T I R - - T R R C Q N E E I H H G K I S G L - D D R . . . . . . 2486 AGTAAGACTGTAGTCACCCAAGTTCAAGAACTGCAGGTCATAATCCATGATCTCCTTGCT S K T V V T Q V Q E L Q V I I H D L L A V R L - S P K F K N C R S - S M I S L L Q - D C S H P S S R T A G H N P - S P C . : . . . . . 2546 GAAG : GATTGATTGTGAATGATGCCTTTCAAGTGGCTGCAATTATTGAAAACTTACCTCCA E : G L I V N D A F Q V A A I I E N L P P K : D - L - M M P F K W L Q L L K T Y L H - R : I D C E - C L S S G C N Y - K L T S . . . . . 2678 TTGTTGAAGGACTTCAAAAACTACTTGAAACACAAACGCAA L L K D F K N Y L K H K R C - R T S K T T - N T N A I V E G L Q K L L E T Q T Q Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-4_PPS_1 (1907 2549,2622 2716) (frame '1'; 738 bp, 246 residues) 1 RLKEHKNFIV KSETVCVTIC SLLLVFQILK LCAICDRKKE KITNSNQTNV GTVSATRVCA 61 AITSVAHNNS NAALAPAEKP AKFSGVDFKR WQQKMFFYLT TLSLQKFINE NVPVMSDETP 121 PDERFLVTQA WTHSDFLCKN YILSGLQDDL YNVYSNVKTL KELWDALEKK YKTEDARMKK 181 FIMAKFLDYK MIDSKTVVTQ VQELQVIIHD LLAEGLIVND AFQVAAIIEN LPPLLKDFKN 241 YLKHKR AGS-5 (1810 2089,2148 2211) SCR (e 0.829 d 0.000 a 0.000,e 0.898) Exon 1 1810 2089 ( 280 n); score: 0.829 Intron 1 2090 2147 ( 58 n); Pd: 0.000 Pa: 0.000 Exon 2 2148 2211 ( 64 n); score: 0.898 PGS (1810 2089,2148 2211) SGN-E236009+ 3-phase translation of AGS-5 (+strand): . . . . . . 1810 TAAGAAATTTAGTAAGTTTATGTATTTAAGGTTTCTTGAGATGAAAACCTTTATGGTTTT - E I - - V Y V F K V S - D E N L Y G F K K F S K F M Y L R F L E M K T F M V F R N L V S L C I - G F L R - K P L W F . . . . . . 1870 CTACTCTGCTTGAGTTTTTTAAAATTCATTCGATTAACGATTAAAAGAACATAAAAACTT L L C L S F L K F I R L T I K R T - K L Y S A - V F - N S F D - R L K E H K N F S T L L E F F K I H S I N D - K N I K T . . . . . . 1930 TATCGTTAAATCAGAAACAGTCTGTGTAACGATTTGTTCTTTACTGTTAGTATTTCAAAT Y R - I R N S L C N D L F F T V S I S N I V K S E T V C V T I C S L L L V F Q I L S L N Q K Q S V - R F V L Y C - Y F K . . . . . . 1990 ACTTAAGTTATGTGCCATTTGTGACAGAAAAAAAGAAAAAATTACTAATTCAAATCAAAC T - V M C H L - Q K K R K N Y - F K S N L K L C A I C D R K K E K I T N S N Q T Y L S Y V P F V T E K K K K L L I Q I K . . . . : . . 2050 AAATGTTGGAACAGTAAGTGCTACAAGAGTTTGTGCTGCA : CAAAATTTTCTGGAGTCGAC K C W N S K C Y K S L C C : T K F S G V D N V G T V S A T R V C A A : Q N F L E S T Q M L E Q - V L Q E F V L H : K I F W S R . . . . . 2168 TTTAAGAGATGGCAGCAGAAGATGTTCTTCTATCTCACTACGTT F K R W Q Q K M F F Y L T T L R D G S R R C S S I S L R L - E M A A E D V L L S H Y V Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-5_PPS_1 (1907 2089,2148 2210) (frame '2'; 246 bp, 82 residues) 1 RLKEHKNFIV KSETVCVTIC SLLLVFQILK LCAICDRKKE KITNSNQTNV GTVSATRVCA 61 AQNFLESTLR DGSRRCSSIS LR AGS-6 (2745 3450) SCR (e 0.940) Exon 1 2745 3450 ( 706 n); score: 0.940 PGS (2745 3450) SGN-E369759- PGS (2894 3450) SGN-E395006- PGS (2918 3450) SGN-E250407- PGS (2950 3440) SGN-E375520- PGS (3254 3440) SGN-E398572- 3-phase translation of AGS-6 (+strand): . . . . . . 2745 TAAGGTTGAGAATCGAAGATGATAATAAGGCTGCAGAAAAGAGGTCACATCGTAATTCAA - G - E S K M I I R L Q K R G H I V I Q K V E N R R - - - G C R K E V T S - F N R L R I E D D N K A A E K R S H R N S . . . . . . 2805 CAATATTTGGAGTAAATTTTGTTGAAGAAGATCCCACAAAATTAAAAAAAAGAAAGAAAA Q Y L E - I L L K K I P Q N - K K E R K N I W S K F C - R R S H K I K K K K E N T I F G V N F V E E D P T K L K K R K K . . . . . . 2865 CATCTGGTCCAAAAAGCAATCCTCCTAAGAAGAAATTCAATGGAAACTGCTTCAACTGTG H L V Q K A I L L R R N S M E T A S T V I W S K K Q S S - E E I Q W K L L Q L W T S G P K S N P P K K K F N G N C F N C . . . . . . 2925 GTAAACATGGTCATAGAGCTACTGAATGCCGGGGTCCAAAGTAGGACAAGAAAAAGAAGG V N M V I E L L N A G V Q S R T R K R R - T W S - S Y - M P G S K V G Q E K E G G K H G H R A T E C R G P K - D K K K K . . . . . . 2985 ATCAAGCAAACTTGGCTGAATCCAAAGGAGAAATGGACGATCTCTGTGCAATGCTTTTAA I K Q T W L N P K E K W T I S V Q C F - S S K L G - I Q R R N G R S L C N A F K D Q A N L A E S K G E M D D L C A M L L . . . . . . 3045 AATGTAACTTGGTTGGAAATCCAAGAGAATGGTGGATAGATTCTGGTGCCTCATGCCATG N V T W L E I Q E N G G - I L V P H A M M - L G W K S K R M V D R F W C L M P C K C N L V G N P R E W W I D S G A S C H . . . . . . 3105 TTTGTGCCAACAAAGAATTATTTTAATCATATACTTCAACACTTACAGATGAAAAATTGT F V P T K N Y F N H I L Q H L Q M K N C L C Q Q R I I L I I Y F N T Y R - K I V V C A N K E L F - S Y T S T L T D E K L . . . . . . 3165 TTATGGCAAACTCCGCTGTTGCAAAGGTGGAAGGAACTGGCAAAGTCCTATTAAAGATGA L W Q T P L L Q R W K E L A K S Y - R - Y G K L R C C K G G R N W Q S P I K D D F M A N S A V A K V E G T G K V L L K M . . . . . . 3225 CATCAGGCAAGGTGGTGACTTTGAATAGAGTCTAATATGTTCCTGAATTGATTAAGAATT H Q A R W - L - I E S N M F L N - L R I I R Q G G D F E - S L I C S - I D - E F T S G K V V T L N R V - Y V P E L I K N . . . . . . 3285 TAGTTTCAATTCCAGTTCTGACCAAGAATGGATTTAAATGTGTATTTGTTTCTGATAAAG - F Q F Q F - P R M D L N V Y L F L I K S F N S S S D Q E W I - M C I C F - - S L V S I P V L T K N G F K C V F V S D K . . . . . . 3345 TAGTAGTAAGCAAAAATGATATGTATGTAGGAAAAGGCTACCTTAGTGATGGCTTCAAAC - - - A K M I C M - E K A T L V M A S N S S K Q K - Y V C R K R L P - - W L Q T V V V S K N D M Y V G K G Y L S D G F K . . . . . 3405 TCGATGTAATTGCAGTTGATATGAATAAAGATTTTGATTCTTCTTA S M - L Q L I - I K I L I L L R C N C S - Y E - R F - F F L L D V I A V D M N K D F D S S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8+_PGL-1_AGS-6_PPS_1 (2747 2968) (frame '0'; 219 bp, 73 residues) 1 RLRIEDDNKA AEKRSHRNST IFGVNFVEED PTKLKKRKKT SGPKSNPPKK KFNGNCFNCG 61 KHGHRATECR GPK- >C06HBa0153O03.1-8+_PGL-1_AGS-6_PPS_2 (2850 3044) (frame '1'; 192 bp, 64 residues) 1 KKERKHLVQK AILLRRNSME TASTVVNMVI ELLNAGVQSR TRKRRIKQTW LNPKEKWTIS 61 VQCF- 3-phase translation of AGS-6 (-strand): . . . . . . 3450 TAAGAAGAATCAAAATCTTTATTCATATCAACTGCAATTACATCGAGTTTGAAGCCATCA - E E S K S L F I S T A I T S S L K P S K K N Q N L Y S Y Q L Q L H R V - S H H R R I K I F I H I N C N Y I E F E A I . . . . . . 3390 CTAAGGTAGCCTTTTCCTACATACATATCATTTTTGCTTACTACTACTTTATCAGAAACA L R - P F P T Y I S F L L T T T L S E T - G S L F L H T Y H F C L L L L Y Q K Q T K V A F S Y I H I I F A Y Y Y F I R N . . . . . . 3330 AATACACATTTAAATCCATTCTTGGTCAGAACTGGAATTGAAACTAAATTCTTAATCAAT N T H L N P F L V R T G I E T K F L I N I H I - I H S W S E L E L K L N S - S I K Y T F K S I L G Q N W N - N - I L N Q . . . . . . 3270 TCAGGAACATATTAGACTCTATTCAAAGTCACCACCTTGCCTGATGTCATCTTTAATAGG S G T Y - T L F K V T T L P D V I F N R Q E H I R L Y S K S P P C L M S S L I G F R N I L D S I Q S H H L A - C H L - - . . . . . . 3210 ACTTTGCCAGTTCCTTCCACCTTTGCAACAGCGGAGTTTGCCATAAACAATTTTTCATCT T L P V P S T F A T A E F A I N N F S S L C Q F L P P L Q Q R S L P - T I F H L D F A S S F H L C N S G V C H K Q F F I . . . . . . 3150 GTAAGTGTTGAAGTATATGATTAAAATAATTCTTTGTTGGCACAAACATGGCATGAGGCA V S V E V Y D - N N S L L A Q T W H E A - V L K Y M I K I I L C W H K H G M R H C K C - S I - L K - F F V G T N M A - G . . . . . . 3090 CCAGAATCTATCCACCATTCTCTTGGATTTCCAACCAAGTTACATTTTAAAAGCATTGCA P E S I H H S L G F P T K L H F K S I A Q N L S T I L L D F Q P S Y I L K A L H T R I Y P P F S W I S N Q V T F - K H C . . . . . . 3030 CAGAGATCGTCCATTTCTCCTTTGGATTCAGCCAAGTTTGCTTGATCCTTCTTTTTCTTG Q R S S I S P L D S A K F A - S F F F L R D R P F L L W I Q P S L L D P S F S C T E I V H F S F G F S Q V C L I L L F L . . . . . . 2970 TCCTACTTTGGACCCCGGCATTCAGTAGCTCTATGACCATGTTTACCACAGTTGAAGCAG S Y F G P R H S V A L - P C L P Q L K Q P T L D P G I Q - L Y D H V Y H S - S S V L L W T P A F S S S M T M F T T V E A . . . . . . 2910 TTTCCATTGAATTTCTTCTTAGGAGGATTGCTTTTTGGACCAGATGTTTTCTTTCTTTTT F P L N F F L G G L L F G P D V F F L F F H - I S S - E D C F L D Q M F S F F F V S I E F L L R R I A F W T R C F L S F . . . . . . 2850 TTTAATTTTGTGGGATCTTCTTCAACAAAATTTACTCCAAATATTGTTGAATTACGATGT F N F V G S S S T K F T P N I V E L R C L I L W D L L Q Q N L L Q I L L N Y D V F - F C G I F F N K I Y S K Y C - I T M . . . . . 2790 GACCTCTTTTCTGCAGCCTTATTATCATCTTCGATTCTCAACCTTA D L F S A A L L S S S I L N L T S F L Q P Y Y H L R F S T L - P L F C S L I I I F D S Q P Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-8-_PGL-1_AGS-6_PPS_1 (3146 2943) (frame '2'; 201 bp, 67 residues) 1 VLKYMIKIIL CWHKHGMRHQ NLSTILLDFQ PSYILKALHR DRPFLLWIQP SLLDPSFSCP 61 TLDPGIQ- >C06HBa0153O03.1-8-_PGL-1_AGS-6_PPS_2 (3040 2846) (frame '0'; 192 bp, 64 residues) 1 KHCTEIVHFS FGFSQVCLIL LFLVLLWTPA FSSSMTMFTT VEAVSIEFLL RRIAFWTRCF 61 LSFF- ... finished at: Mon Aug 28 22:23:57 2006 ________________________________________________________________________________ Sequence 9: C06HBa0153O03.1-9, from 1 to 3226, both strands analyzed. ... started at: Mon Aug 28 22:23:57 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 7 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 726 n (File: SGN-E577892+) 1 TAAAAACCGG AAAACCGAAA CACCGAACCG AACCGAAATT TTTCAGAATT TCGGTTTCGG 61 TATTTCGGTT TTCGGTTCGG TATTCGGTTT AATTTTTTGT ATTTTCGGCA TTTCGGTTCG 121 ATTTTCGGTA CATGTTACTT TGAAATTCGG TATTTCGGTT TAAACCGAAA TATTAAATTA 181 TAATTTATAA ATTATATTAT ATTTAATAAT TATTAATATT AAATAATCTT TTTTTAAAAA 241 ATAAGAAACC CTAAGTTGAA ATATCAAAGC CCATTAAGTT AAAATATCAA AGCCCATTAA 301 GTTGAAAAAT CAAACAAAAA ATTGAAAAAA TTGTTAAGTT TAAAAAGCCC ACTAAACAGG 361 CCCATTAAAA AACCGAAATA AAAAACCGAA CCGAATTAAT AAAAACCGAA CCGAATTAAC 421 AAAAACCAAA CCGAACCGAA ATATTTCGGT TCGGTATTCG GTACGCATCT CTCTCTCTCT 481 CAGTTGAAAA TGTCATCCTC TTTGCATTCT CCGCTATACT CCTCTCTCTC ATCTTCCTCA 541 GCTCTCCAGG ACAGGAAACT GAGAAATGCA GTTAATTTTG CGAAGCCGAA TTTGATTTTC 601 AAGCCTCGCC GGTCGTTTCC TCTAATCCGA GCTTCCGCTG GCTTCTCCTC ATCTCTCGAT 661 ACAGGTTTGA GTACTGAATT GGATGCTGTA GCAAATCATA GTGAGATCGT TCCAGATACA 721 GTCATT Predicted gene structure (within gDNA segment 1 to 3226): Exon 1 241 337 ( 97 n); cDNA 371 467 ( 97 n); score: 0.959 Intron 1 338 925 ( 588 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0) Exon 2 926 937 ( 12 n); cDNA 468 479 ( 12 n); score: 0.917 MATCH C06HBa0153O03.1-9+ SGN-E577892+ 0.959 109 0.150 C PGS_C06HBa0153O03.1-9+_SGN-E577892+ (241 337,926 937) Alignment (genomic DNA sequence = upper lines): AACCGAAATA AAAAACCGAA CCGAATTAAT AAAAACCGAA CCGAATTAAC AAAAATCGAA 300 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| | || AACCGAAATA AAAAACCGAA CCGAATTAAT AAAAACCGAA CCGAATTAAC AAAAACCAAA 430 CCGAACCGAA ATATTTTGGT TCGGTATTCG GTACGTAATT CTAAAAAACC GAAAAAACCG 360 |||||||||| |||||| ||| |||||||||| ||||| | CCGAACCGAA ATATTTCGGT TCGGTATTCG GTACGCA... .......... .......... 467 AAGAAAAAAA AACCGAACCG AACCGAAATA CCGAATGCCC ACCTCTAGCA GAAATGGTGA 420 .......... .......... .......... .......... .......... .......... 467 ATGAAAGGTC ATTTTTAAAA CAAAAGATAG ATGAAGGGAT TTTTAAATCT TTTCCTATAG 480 .......... .......... .......... .......... .......... .......... 467 TTTAGGAGTA TTTTTGACTC TTTCCATTAT TTAAAAGATG TTTGAGCTAG ATGTCCATTG 540 .......... .......... .......... .......... .......... .......... 467 AAATTTAACA TTTGACTTAT CGCAAATGGA AATTATCATT GCAAGTGAAT AATATATCAA 600 .......... .......... .......... .......... .......... .......... 467 AATTTCAAGA GAAATATATA TTTATGTTTA GAGGAAAACT GAACTCAAGG GTGGACCTAC 660 .......... .......... .......... .......... .......... .......... 467 AGAGGGTTAA TGAACCCATA TTGTTGAAAA ATAGCATTGT ATATAGATTT GTATTTAATC 720 .......... .......... .......... .......... .......... .......... 467 AATTTTTAAA GTGTATAAAT TAACTTTTGA CCCCACTCAT GCAAGTGTGA CCCTTAGCCG 780 .......... .......... .......... .......... .......... .......... 467 GGTGGTGAAG AGGGTTCAAT TTTTCCAGCC AGCCCAAGTT TGAATTTCTT AATAGACTAC 840 .......... .......... .......... .......... .......... .......... 467 TACAATATAA AAATAGAGAA AATGGGTAAA AACCTCTCCA ATCTATATTT GAATTTTTAA 900 .......... .......... .......... .......... .......... .......... 467 CTACACACTT TAACTTTACG GGAGTTCTAT CTCTCTC 937 ||| | ||||||| .......... .......... .....TCTCT CTCTCTC 479 hqPGS_C06HBa0153O03.1-9+_SGN-E577892+ (241 337) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3226: PGL 1 (+ strand): 241 337 AGS-1 (241 337) SCR (e 0.959) Exon 1 241 337 ( 97 n); score: 0.959 PGS (241 337) SGN-E577892+ 3-phase translation of AGS-1 (+strand): . . . . . . 241 AACCGAAATAAAAAACCGAACCGAATTAATAAAAACCGAACCGAATTAACAAAAATCGAA N R N K K P N R I N K N R T E L T K I E T E I K N R T E L I K T E P N - Q K S N P K - K T E P N - - K P N R I N K N R . . . . 301 CCGAACCGAAATATTTTGGTTCGGTATTCGGTACGTA P N R N I L V R Y S V R R T E I F W F G I R Y V T E P K Y F G S V F G T Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 337 TACGTACCGAATACCGAACCAAAATATTTCGGTTCGGTTCGATTTTTGTTAATTCGGTTC Y V P N T E P K Y F G S V R F L L I R F T Y R I P N Q N I S V R F D F C - F G S R T E Y R T K I F R F G S I F V N S V . . . . 277 GGTTTTTATTAATTCGGTTCGGTTTTTTATTTCGGTT G F Y - F G S V F Y F G V F I N S V R F F I S V R F L L I R F G F L F R Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:24:09 2006 ________________________________________________________________________________ Sequence 10: C06HBa0153O03.1-10, from 1 to 11761, both strands analyzed. ... started at: Mon Aug 28 22:24:09 2006 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-T4dD4R/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 9 ******************************************************************************** EST sequence 9 +strand 680 n (File: SGN-E238572+) 1 ACTAAACCTT TTTTCGGATA TCCAGCATCC TCATACTGCG ACTTTCGATT TCTAAACACG 61 AGAGTTCTGG ACCATCCAAC ACTCCACCTC ATACAACATA GTTGGTCTAA CTACTACTTT 121 GTACAATTTG TCTTTAAGTT TGGTGGCACC TTCTTATCAC ACAAGACTCC CCAATTCAGT 181 ATTGTGGACG GCGAGAGGCG ACAAGGGCCT GCCTCGCCAC ACGGCGAGGC GAGCGGCGAG 241 GCGCTTGCCT TTTTGAATAA AGGCGCCAAT TAATACCAAA AATTAAAAAT ATTTAACTGC 301 ATATTTTGTC CAAAATCTTA ATAGCAATAA CACATATTAA CAAATATTCA ATTCAAAAAC 361 CAATAGTAGA TACTAAAATG TCTAAAACTT TAAAGACCAA ACAATTCAAA ATACAATAAA 421 CCAAAACTTT AAAAGTCTAT TCTTCTTCAA ATTATCCAAC CCTTGAGTGT CATAATTCCA 481 TCCAATGTCC CTCTTTTGCT AGCAACTGCC ATTGATTATT AAATTCAATT AATTCGCTAT 541 TTAGGAATAT TGGAATAAAT AATTAGGAAA ATGGAAAGAT TTTTTTTTTA ATTAACTGCC 601 TATTATTAAA AAAAATTACA CAAAGCAGTT AGCAACTAAG CACTGCTTAC AGCCTTGCAC 661 CAAAAAAACA CAACTGAAAA Predicted gene structure (within gDNA segment 3719 to 1): Exon 1 722 482 ( 241 n); cDNA 228 466 ( 239 n); score: 0.882 MATCH C06HBa0153O03.1-10- SGN-E238572+ 0.882 241 0.354 C PGS_C06HBa0153O03.1-10-_SGN-E238572+ (722 482) Alignment (genomic DNA sequence = upper lines): GGCGAGAAGC GA-CCGCTTG CCTTTTTGAA TCAAGGCTCC AATTAATACC AAAAATTAAA 664 |||||| || || |||||| |||||||||| | ||||| || |||||||||| |||||||||| GGCGAGCGGC GAGGCGCTTG CCTTTTTGAA TAAAGGCGCC AATTAATACC AAAAATTAAA 287 AATATTTAAT TTCATATATA AATATCCAAA ATCTTAATAG TAATAACACA TATTAGCAAA 604 ||||||||| | ||||| | | |||||| |||||||||| ||||||||| ||||| |||| AATATTTAAC TGCATAT-T- -TTGTCCAAA ATCTTAATAG CAATAACACA TATTAACAAA 344 TATTCAATTC AAAAACCAAT AGTAGATACT AAAAAGTCTA AAACTTTAAA GACCAAATAA 544 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||||| || TATTCAATTC AAAAACCAAT AGTAGATACT AAAATGTCTA AAACTTTAAA GACCAAACAA 404 TTCAAAATAC AAT--ACTAA AACTTTAAAA GTCTATTCTT CTTCAAAAAC TATCCAACTC 486 |||||||||| ||| || || |||||||||| |||||||||| |||| ||| |||||||| | TTCAAAATAC AATAAACCAA AACTTTAAAA GTCTATTCTT CTTC--AAAT TATCCAACCC 462 TTGA 482 |||| TTGA 466 hqPGS_C06HBa0153O03.1-10-_SGN-E238572+ (722 482) ******************************************************************************** EST sequence 10 +strand 640 n (File: SGN-E368646+) 1 ACTAAACCTT TTTTCGGATA TCCAGCATCC TCATACTGCG ACTTTCGATT TCTAAACACG 61 AGAGTTCTGG ACCATCCAAC ACTCCACCTC ATACAACATA GTTGGTCTAA CTACTACTTT 121 GTACAATTTG TCTTTAAGTT TGGTGGCACC TTCTTATCAC ACAAGACTCC CCAATTCAGT 181 ATTGTGGACG GCGAGAGGCG ACAAGGGCCT GCCTCGCCAC ACGGCGAGGC GAGCGGCGAG 241 GCGCTTGCCT TTTTGAATAA AGGCGCCAAT TAATACCAAA AATTAAAAAT ATTTAACTGC 301 ATATTTTGTC CAAAATCTTA ATAGCAATAA CACATATTAA CAAATATTCA ATTCAAAAAC 361 CAATAGTAGA TACTAAAATG TCTAAAACTT TAAAGACCAA ACAATTCAAA ATACAATAAA 421 CCAAAACTTT AAAAGTCTAT TCTTCTTCAA ATTATCCAAC CCTTGAGTGT CATAATTCCA 481 TCCAATGTCC CTCTTTTGCT AGCAACTGCC ATTGATTATT AAATTCAATT AATTCGCTAT 541 TTAGGAATAT TGGAATAAAT AATTAGGAAA ATGGAAAGAT TTTTTTTTTA ATTAACTGCC 601 TATAATTAAA AAAAATTACA CAAAGCAGTA AGCAACTTAG Predicted gene structure (within gDNA segment 3719 to 1): Exon 1 722 482 ( 241 n); cDNA 228 466 ( 239 n); score: 0.882 MATCH C06HBa0153O03.1-10- SGN-E368646+ 0.882 241 0.377 C PGS_C06HBa0153O03.1-10-_SGN-E368646+ (722 482) Alignment (genomic DNA sequence = upper lines): GGCGAGAAGC GA-CCGCTTG CCTTTTTGAA TCAAGGCTCC AATTAATACC AAAAATTAAA 664 |||||| || || |||||| |||||||||| | ||||| || |||||||||| |||||||||| GGCGAGCGGC GAGGCGCTTG CCTTTTTGAA TAAAGGCGCC AATTAATACC AAAAATTAAA 287 AATATTTAAT TTCATATATA AATATCCAAA ATCTTAATAG TAATAACACA TATTAGCAAA 604 ||||||||| | ||||| | | |||||| |||||||||| ||||||||| ||||| |||| AATATTTAAC TGCATAT-T- -TTGTCCAAA ATCTTAATAG CAATAACACA TATTAACAAA 344 TATTCAATTC AAAAACCAAT AGTAGATACT AAAAAGTCTA AAACTTTAAA GACCAAATAA 544 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||||| || TATTCAATTC AAAAACCAAT AGTAGATACT AAAATGTCTA AAACTTTAAA GACCAAACAA 404 TTCAAAATAC AAT--ACTAA AACTTTAAAA GTCTATTCTT CTTCAAAAAC TATCCAACTC 486 |||||||||| ||| || || |||||||||| |||||||||| |||| ||| |||||||| | TTCAAAATAC AATAAACCAA AACTTTAAAA GTCTATTCTT CTTC--AAAT TATCCAACCC 462 TTGA 482 |||| TTGA 466 hqPGS_C06HBa0153O03.1-10-_SGN-E368646+ (722 482) ******************************************************************************** EST sequence 1 -strand 429 n (File: SGN-E368645-) 1 CAAAAAATTA AAAAATATTT AACTGCATAA TTTTGTCCAA AAATCTTAAT AGCAATAACA 61 CATATTAACA AATATTCAAT TCAAAAACCA ATAGTAGATA CTAAAATGTC TAAAACTTTA 121 AAGACCAAAC AATTCAAAAT ACAATAAACC AAAACTTTAA AAGTCTATTC TTCTTCAAAT 181 TATCCAACCC TTGAGTGTCA TAATTCCATC CAATGTCCCT CTTTTGCTAG CAACTGCCAT 241 TGATTATTAA ATTCAATTAA TTCGCTATTT AGGAATATTG GAATAAATAA TTAGGAAAAT 301 GGAAAGATTT TTTTTTTAAT TAACTGCCTA TAATTAAAAA AAATTACACA AAGCAGTAAG 361 CAACTAAGCA CTGCTTACAG CCTTGCACCA AAAAAGCACA ACTGAAAAAA AAAACTGAAG 421 AAGAAAAAA Predicted gene structure (within gDNA segment 1897 to 1): Exon 1 673 482 ( 192 n); cDNA 3 194 ( 192 n); score: 0.862 PPA cDNA 405 415 MATCH C06HBa0153O03.1-10- SGN-E368645- 0.862 192 0.448 C PGS_C06HBa0153O03.1-10-_SGN-E368645- (673 482) Alignment (genomic DNA sequence = upper lines): AAAAATTAAA -AATATTTAA TTTCATATAT AAATATCCAA AATCTTAATA GTAATAACAC 615 |||||||||| ||||||||| | |||| || | || |||||||||| | |||||||| AAAAATTAAA AAATATTTAA CTGCATA-AT TTTGTCCAAA AATCTTAATA GCAATAACAC 61 ATATTAGCAA ATATTCAATT CAAAAACCAA TAGTAGATAC TAAAAAGTCT AAAACTTTAA 555 |||||| ||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ATATTAACAA ATATTCAATT CAAAAACCAA TAGTAGATAC TAAAATGTCT AAAACTTTAA 121 AGACCAAATA ATTCAAAATA CAAT--ACTA AAACTTTAAA AGTCTATTCT TCTTCAAAAA 497 |||||||| | |||||||||| |||| || | |||||||||| |||||||||| ||||| ||| AGACCAAACA ATTCAAAATA CAATAAACCA AAACTTTAAA AGTCTATTCT TCTTC--AAA 179 CTATCCAACT CTTGA 482 |||||||| ||||| TTATCCAACC CTTGA 194 hqPGS_C06HBa0153O03.1-10-_SGN-E368645- (673 482) ******************************************************************************** EST sequence 5 +strand 730 n (File: SGN-E374433+) 1 TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 61 CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 121 TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 181 TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 241 TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 301 TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 361 GGCAATTCTA GATTTTTTCA AGAAGTTCAC TGATTCAGTT GGTTCTGATC TGCAATTTGT 421 TATTGACGAT ATATCCAAGG AGGATTCATC AGCTGTTGGA GTCACATGGC ACTTGGAATG 481 GAGGGGAAGA CCTTTTCCTT TTAGCAAAGG ATGCAGCTTT TATCGATTGG AAGTGGTGAA 541 TGGCCAGATG AAAATACTTT ATGGCAGAGA CAGTGTGGAA CCTGCAGTCA AGCCGGGGGA 601 GACGGCATTG GTTGCGATAA GAGGCGTGGC ATGGCTGTTG CAAAAATTTC CCCAATTGGC 661 AGATCGGTTG TGATTGAAGA AGTATGATTG TTCTTATAAG ATTGAAGTTA ATTTATGAGG 721 GAAAAAAAAA Predicted gene structure (within gDNA segment 10216 to 2592): Exon 1 9616 9256 ( 361 n); cDNA 1 361 ( 361 n); score: 1.000 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 1.00), Pa: 0.980 (s: 1.00) Exon 2 5968 5854 ( 115 n); cDNA 362 476 ( 115 n); score: 1.000 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 1.00), Pa: 0.967 (s: 1.00) Exon 3 5753 5672 ( 82 n); cDNA 477 558 ( 82 n); score: 1.000 Intron 3 5671 4436 (1236 n); Pd: 0.927 (s: 1.00), Pa: 0.985 (s: 1.00) Exon 4 4435 4384 ( 52 n); cDNA 559 610 ( 52 n); score: 1.000 MATCH C06HBa0153O03.1-10- SGN-E374433+ 1.000 610 0.836 C PGS_C06HBa0153O03.1-10-_SGN-E374433+ (9616 9256,5968 5854,5753 5672,4435 4384) Alignment (genomic DNA sequence = upper lines): TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 9557 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 60 CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 9497 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 120 TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 9437 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 180 TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 9377 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 240 TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 9317 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 300 TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 9257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 360 GGTTAGTTAT CAACCTGGAC TAGTTGTATT TTGAAGACTT TTTTCTGTTG CATTTTGTTT 9197 | G......... .......... .......... .......... .......... .......... 361 GACTATTTTC TCTCTGGTAT ATTTAGTTTA AGATGTGAGA AAAGTTCAAA CTAAGCTAGG 9137 .......... .......... .......... .......... .......... .......... 361 CTTTTCATTC AATAAAACTA AAGATGAGCT GAAACCCTTT GAAAAGAGGC CAAGATTAAA 9077 .......... .......... .......... .......... .......... .......... 361 CTGAAAATGG AAATTACATT TGGGGATAAT AACCAAGTTC GTACTTTCTT CATCCCATTT 9017 .......... .......... .......... .......... .......... .......... 361 TATGTGACAC CCTTAGATAT TTGTGTGTTT GAGATTCATT TCATTAAAGA TAAAAGGAAA 8957 .......... .......... .......... .......... .......... .......... 361 ACTTAAAAGT TAAATTGTTG CTAATCATAG TAAGGTAATA CTTCATTTTG AGACGAACTA 8897 .......... .......... .......... .......... .......... .......... 361 AAAAGGAAAT GCTCTTACAT AAAATGGAAG AGAGGGAGTA ATTAAATTTA TAGGGATTTT 8837 .......... .......... .......... .......... .......... .......... 361 GCCAAGAGAA ACATCTCCAT TTCAAAAGAT ATGATCCTAC TTTCATCAAA TTAAAAAGTC 8777 .......... .......... .......... .......... .......... .......... 361 CGCTGGTTGA TGTAAAGATT AACACCGCCC ATGTTGCATA GCATGGAGCT GAATGAATTC 8717 .......... .......... .......... .......... .......... .......... 361 ACATTCAAAT TTACGTGAGA TCAAAATAAC ATCAAAGTGT TCGATGAATG TTTTGGGAAC 8657 .......... .......... .......... .......... .......... .......... 361 TTCAACTGCA AACAAATGTC CAAGAGCAAT GGTTACTGCA AATTTATCAG ACTCTCTACT 8597 .......... .......... .......... .......... .......... .......... 361 ACTCTTAGAA GTGTAAAAAG TATGGAGGAA TGCAAAATCA TCATCATTAT ATAGTGCAAT 8537 .......... .......... .......... .......... .......... .......... 361 ATTTGGGCCC TCTCCAGTTA CAGCTACAAC ATCAACTTGA ATGATTCGCA CACATACAAC 8477 .......... .......... .......... .......... .......... .......... 361 AACAGTGAAA CAACCAACAT TTTGCAAATG TCAGAATAAC CAATAATAAC CTCCTGATGC 8417 .......... .......... .......... .......... .......... .......... 361 CCATCATAGT TCTTTATTAT GTGCTCCCTA CCTAGTGGCA GAATCAGGAT TTTCATTAAG 8357 .......... .......... .......... .......... .......... .......... 361 GGGTTCGATG GTTGTAGTAA AACGCAATAT TTATTCCCTC TCCAGTTACA GCTACAACAT 8297 .......... .......... .......... .......... .......... .......... 361 CAACTTGAAT GATTTGCACT CCTGCAACAA CAATGAAACA ACCAATATTT GCGAATGTTA 8237 .......... .......... .......... .......... .......... .......... 361 GAATAACCAA TTATAACCTC CTGATGCCCA TCATAGTTTT TTATTATGCG TTCCCTATAC 8177 .......... .......... .......... .......... .......... .......... 361 CAATTAATTG GTAGTAATCT TCTGACTACC CAACCAGCCA GTGGTGGAAC CAGGATGTTT 8117 .......... .......... .......... .......... .......... .......... 361 AGTAGGGCTT GAACACGTAA CCTCATGGAA TTTTCTTATG CCCTTAACCA ATAAACTAAA 8057 .......... .......... .......... .......... .......... .......... 361 TCTTCAACTT GTTTCAAGAG GTGTCAATAC TTGTATATAT ATTTACTAAA CCAAAATATG 7997 .......... .......... .......... .......... .......... .......... 361 ACTTCTATAT ACAATGTAAC TTTCTGACGA AGGGGTTTCG CTTCACACCT CTTGGCCAAG 7937 .......... .......... .......... .......... .......... .......... 361 GGTGGGTGCG CCCCTGGATC CAGCTTGTCT CATGTCCTAC ATGGGCTCAA ATAGAGGAAC 7877 .......... .......... .......... .......... .......... .......... 361 TAGATGCTGT GTTCAACCAG GAAAGGGCCT GCTTATCTCC CCATTAATAA TAAGATGTGC 7817 .......... .......... .......... .......... .......... .......... 361 ATCCTTTTGT AAAATCTCTA CATCTAGTTC ATCCAAGGCA TTTGGAACGG TGGAAATAAC 7757 .......... .......... .......... .......... .......... .......... 361 ATAAGCTCCA ACTGAATCAT TTCCTAATTC AGTATCTGTT TTTTTTTTTT TAATGAAAAT 7697 .......... .......... .......... .......... .......... .......... 361 CTGCTTTGAA GGTATTCTAG ACTTCTTTGT TGTCAGGCAG AACCACTGTA GTAGGAAGAA 7637 .......... .......... .......... .......... .......... .......... 361 CCCAAGGTCT TTGCCTTTTT ATATCTTTAG TTAGAAACTC CAGTGTTTGC TCTTCATCCC 7577 .......... .......... .......... .......... .......... .......... 361 AATTACCATA TTTAGGTATT TTGTGATTAT ATCAATTGCC TATGACCAGG ACACCTAGAC 7517 .......... .......... .......... .......... .......... .......... 361 ATGTGAATAT AAACAAGTGA TTCATCACTA AAATATAGCT GAAGTTTGAG AAACATGAAA 7457 .......... .......... .......... .......... .......... .......... 361 ATTGACTCTC AGAAACTCAA ACCCGATCAA GTATCTCCCA ACTGAATCTG AAAGCTTCCT 7397 .......... .......... .......... .......... .......... .......... 361 AATTGCCTCA AGTATTGAAC AAAACCCACA CCAGCCTTCT CTGAAGTTTA CCATATGTGC 7337 .......... .......... .......... .......... .......... .......... 361 TCATAACAGA AACTATTAGG AAGAGATGCT TCAAGTTTCT TGCGGTTCAT ACTTTTCTAT 7277 .......... .......... .......... .......... .......... .......... 361 AAAGAATTTC AAAGTATTGT TCAATAACAA CAATCATCCT TGAGTCAACA TGATAGAAAT 7217 .......... .......... .......... .......... .......... .......... 361 CACAACTTCT TTGTCTTAGC TGCTCATGTG ACATAGCATA GGGCCAAGGA CCAAGATCTC 7157 .......... .......... .......... .......... .......... .......... 361 ATCTTGCCAC CTCAATAACT CAGGGCTTCA GTGTTTCTCC ACCTTTAGTA GAGCTATTTC 7097 .......... .......... .......... .......... .......... .......... 361 ACAATTTTTT CGCCACAGTG CACTTGCCCA TTAAAATCTT GACGAGGATA CTCAATATGA 7037 .......... .......... .......... .......... .......... .......... 361 AAAAGGCAAA CTGTCAGCTA GCTTCCACAA CTCCATCATT CCTACATCAA TGAGTATTAT 6977 .......... .......... .......... .......... .......... .......... 361 AGCAGGAAGA AAATTATGGC CCAAACTATG AACACAAACA ACTTTTACAA ACTGCATTTG 6917 .......... .......... .......... .......... .......... .......... 361 GAGATTTGGA ATCACATCTT TGTAAAAAGT CATTTTGTTT CAATGTCCAT CTCAGTATTG 6857 .......... .......... .......... .......... .......... .......... 361 GAATTATCAC TATTAATAGT AATTCTATTC TCCTCTTTGT TGTTGTTCAA GCCTTTTCCA 6797 .......... .......... .......... .......... .......... .......... 361 TAATAGAGAA ATCACCTGAA AGCCCAGTTT TCTTCTTAGC CTTGTTACTG CTTCCATCCT 6737 .......... .......... .......... .......... .......... .......... 361 CTTATTTTTT CACCTTCTTG CACCTCTTTA AAGTTCCATT TACTATAAAT TGAAGAACCA 6677 .......... .......... .......... .......... .......... .......... 361 GTTCTCCATC ATCAACAACC TTATTGGCGT CAGACAAACT ATTCTCGATG CTCGATAATT 6617 .......... .......... .......... .......... .......... .......... 361 CTAATTCCTT GGGTTTGATG CACTTGAGCT TATGGTCTTC CAAAAACAAT CCCTCTACTT 6557 .......... .......... .......... .......... .......... .......... 361 CCATGAGGTA GTGGTAAGGT CTGCTACACT CTACCCTCCT GAGACCCTAC TTAGTGCGAT 6497 .......... .......... .......... .......... .......... .......... 361 TTCTCTGGAT ATGTTGTTGT ATCTGTTTGC TTTTCGTGAG TGAAGAATTT GGCCGTAGAG 6437 .......... .......... .......... .......... .......... .......... 361 TTCATTATCT TTGTATAGAT TTCTCTGTTT TGAGTTGTGA TCCACCTCAT CTGAAAGTTC 6377 .......... .......... .......... .......... .......... .......... 361 TAAAATTTTG TATGAGGTGA CACCCTATGT TGCTCGGACT TTTCAACAGT ATCATCGGTG 6317 .......... .......... .......... .......... .......... .......... 361 CGTGTCCGAT TCTTCAAAAG TGGTGCATTT TTGCAGAATT TGACACCGGT GAGGCATCTA 6257 .......... .......... .......... .......... .......... .......... 361 AAGTGAGGGG TCCGCACAAC TTACAACTCT TTGGGCATGG GATTAGAACG AAATCCATCT 6197 .......... .......... .......... .......... .......... .......... 361 CTAATGAACG GACACCCAAT GTCGGTTTAC TCTGTTTTGT TACTCCATGC TTCACATGTC 6137 .......... .......... .......... .......... .......... .......... 361 TAGTACTCAG CATGGTGGCG AAGTGTTAGT CCCATATAAG ATAAGTGTAT TTGTGATTAT 6077 .......... .......... .......... .......... .......... .......... 361 ATTCGCTTAA AGAAAATCAC TACTTGAGCT AATTTTTGGA ATTATATCAG GCCTATGTCC 6017 .......... .......... .......... .......... .......... .......... 361 TTTTTGCTCA TCACTAAGCT TATGTTTGAT CTCAAACTAT TAATGCAGGC AATTCTAGAT 5957 || |||||||||| .......... .......... .......... .......... ........GC AATTCTAGAT 373 TTTTTCAAGA AGTTCACTGA TTCAGTTGGT TCTGATCTGC AATTTGTTAT TGACGATATA 5897 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTCAAGA AGTTCACTGA TTCAGTTGGT TCTGATCTGC AATTTGTTAT TGACGATATA 433 TCCAAGGAGG ATTCATCAGC TGTTGGAGTC ACATGGCACT TGGGTATGGA AAAACTTGTA 5837 |||||||||| |||||||||| |||||||||| |||||||||| ||| TCCAAGGAGG ATTCATCAGC TGTTGGAGTC ACATGGCACT TGG....... .......... 476 ATACTTTATC TTGTTTTCCA TACTGTTTAC TAGGAAGATT GATCTTCATT GTCCAGGGTA 5777 .......... .......... .......... .......... .......... .......... 476 ATTGATTTTG AATATGATAA CAGAATGGAG GGGAAGACCT TTTCCTTTTA GCAAAGGATG 5717 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...AATGGAG GGGAAGACCT TTTCCTTTTA GCAAAGGATG 513 CAGCTTTTAT CGATTGGAAG TGGTGAATGG CCAGATGAAA ATACTGTTCG TAAAACAATA 5657 |||||||||| |||||||||| |||||||||| |||||||||| ||||| CAGCTTTTAT CGATTGGAAG TGGTGAATGG CCAGATGAAA ATACT..... .......... 558 TCCCAACCCA CTAAATACTG GATGTGTTTT TATATTTTAT TATCTGCTGA ACACTTCAAT 5597 .......... .......... .......... .......... .......... .......... 558 TTTATCTAAT TGGTCGAACA AATTACTTCA TTGAAGAAGA TCGAGAAGCA CAATTATATG 5537 .......... .......... .......... .......... .......... .......... 558 TTCATTGTAG TTAGCATAGA GGATTGTTCA ATATTCCATA TTTTTAAACT ATAATCAATA 5477 .......... .......... .......... .......... .......... .......... 558 AAGGGTGTGG CCTAGTAGTC AATGACGTGG ATTGAGAACC ATGAGATCAT AGGTATAAAT 5417 .......... .......... .......... .......... .......... .......... 558 TCCAGCAGAG GCAAAAAACA CTGGCTGATC TATTGTGTCC ATGGCTGGTG GGATGTGACT 5357 .......... .......... .......... .......... .......... .......... 558 GATATCCCGT GGAGTTAGTT GAGGTGTCCG AATGTTGGCG CGGACATTAA ATTTGTCAAA 5297 .......... .......... .......... .......... .......... .......... 558 GGAAAATAAA AAATCGACCA GCAACTCCTA AACCCAAATT AATTGGATTC GTCTATATGA 5237 .......... .......... .......... .......... .......... .......... 558 ATCTTTCGTA TCTCTTTTTC TCTATTCAGT GTGTGCTGAA GTGAACATAT TTGAATAAAC 5177 .......... .......... .......... .......... .......... .......... 558 TCAATATTTT GAGAATTCAT TAGTCAGTTT ATTACATTGA CATCAATCTA CCATAACCTT 5117 .......... .......... .......... .......... .......... .......... 558 TTACTTTTAT TGCGCTTGAA ACTGACCATA CTTTGCTAAG TTAACATAGA TGTAGGTGTG 5057 .......... .......... .......... .......... .......... .......... 558 GAGCATTAAC TTAATTTATA AATACAGAAT TTAACTTGCT TGCTAGTTAT GAGCATCATC 4997 .......... .......... .......... .......... .......... .......... 558 TTTTAGCTTG ATTTTACATT GTTCCATAAG GATCTTCATG TAGTTATAAC TTATAATCCT 4937 .......... .......... .......... .......... .......... .......... 558 TGTGTTAGCC AACTATTAGA AGCACAAGAT TGTTGGTTCC CATTCTCTAT TTGATCAGGT 4877 .......... .......... .......... .......... .......... .......... 558 TTACACTTAT ATTAATATAT ATGAGATAGA TTTGATGACC TAGATCCTGC TCCTGATCTG 4817 .......... .......... .......... .......... .......... .......... 558 TACACTATTA AGGCTGTGAC TTTTGTTAAT ACATGATGAC AAGGCTCTTC AGAACTCCGG 4757 .......... .......... .......... .......... .......... .......... 558 ACTAAAAAAA GTCAGTAACT AGTACAGAGA GAATTTGTAA AGATAATTAA CTCTTTATAG 4697 .......... .......... .......... .......... .......... .......... 558 ATGAAGGGTA AATTATTAAT TTCTCTATCC TTTCAGTGTT TATTGTATTT GGCTGTAGAA 4637 .......... .......... .......... .......... .......... .......... 558 AATGACTTGT AGATTTTGTA TGTTCTCTTG TAAGGTCCTG GGTTTTTCTT CTTGGTTTTG 4577 .......... .......... .......... .......... .......... .......... 558 AAATTTCTCA GCAGCTCTCT TTGGGAGAAG GGTCTTGATA ATCACTTGCA ACCAAAAAAG 4517 .......... .......... .......... .......... .......... .......... 558 ACTACCATTC TGTCTTAGGC TTACATTTTA AGAGCAAAAA AAAGTTTTGA GTGTAGACTA 4457 .......... .......... .......... .......... .......... .......... 558 AATGTTAATT TTCTTAAGCA GTTATGGCAG AGACAGTGTG GAACCTGCAG TCAAGCCGGG 4397 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .TTATGGCAG AGACAGTGTG GAACCTGCAG TCAAGCCGGG 597 GGAGACGGCA TTG 4384 |||||||||| ||| GGAGACGGCA TTG 610 hqPGS_C06HBa0153O03.1-10-_SGN-E374433+ (9616 9256,5968 5854,5753 5672,4435 4384) ******************************************************************************** EST sequence 8 +strand 604 n (File: SGN-E320389+) 1 TTACTTCATC CTCTGATTCT GTAACTGATT TAGAGTCGGC GGCGAGTGTG GTGAGGAAAT 61 TCTATGCCGG AATAAATAGG CGGGATTTGG ACTCTGTCGA AGAACTTATT GCTGAGGATT 121 GTGTGTATGA AGACCTTGTA TTTCCTCAAC CTTTCGTTGG CCGTAAGGCA ATTCTAGATT 181 TTTTCAAGAA GTTCACTGAT TCAGTTGGTT CTGATCTGCA ATTTGTTATT GACGATATAT 241 CCAAGGAGGA TTCATCAGCT GTTGGAGTCA CATGGCACTT GGAATGGAGG GGAAGACCTT 301 TTCCTTTTAG CAAAGGATGC AGCTTTTATC GATTGGAAGT GGTGAATGGC CAGATGAAAA 361 TACTTTATGG CAGAGACAGT GTGGAACCTG CAGTCAAGCC GGGGGAGACG GCATTGGTTG 421 CGATAAGAGG CGTGGCATGG CTGTTGCAAA AATTTCCCCA ATTGGCAGAT CGGTTGTGAT 481 TGAAGAAGTA TGATTGGTCT TATAAGATTG AAAGTTATTT ATGAGGGGAA AAAAAATCAG 541 TATATACATG TCATTACTTG TAATGTAAGC CTGAAGGCTT TACTAGTATG TACATAATGA 601 ATTT Predicted gene structure (within gDNA segment 10022 to 1912): Exon 1 9422 9256 ( 167 n); cDNA 1 167 ( 167 n); score: 1.000 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 1.00), Pa: 0.980 (s: 1.00) Exon 2 5968 5854 ( 115 n); cDNA 168 282 ( 115 n); score: 1.000 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 1.00), Pa: 0.967 (s: 1.00) Exon 3 5753 5672 ( 82 n); cDNA 283 364 ( 82 n); score: 1.000 Intron 3 5671 4436 (1236 n); Pd: 0.927 (s: 1.00), Pa: 0.985 (s: 1.00) Exon 4 4435 4384 ( 52 n); cDNA 365 416 ( 52 n); score: 1.000 MATCH C06HBa0153O03.1-10- SGN-E320389+ 1.000 416 0.689 C PGS_C06HBa0153O03.1-10-_SGN-E320389+ (9422 9256,5968 5854,5753 5672,4435 4384) Alignment (genomic DNA sequence = upper lines): TTACTTCATC CTCTGATTCT GTAACTGATT TAGAGTCGGC GGCGAGTGTG GTGAGGAAAT 9363 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTACTTCATC CTCTGATTCT GTAACTGATT TAGAGTCGGC GGCGAGTGTG GTGAGGAAAT 60 TCTATGCCGG AATAAATAGG CGGGATTTGG ACTCTGTCGA AGAACTTATT GCTGAGGATT 9303 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTATGCCGG AATAAATAGG CGGGATTTGG ACTCTGTCGA AGAACTTATT GCTGAGGATT 120 GTGTGTATGA AGACCTTGTA TTTCCTCAAC CTTTCGTTGG CCGTAAGGTT AGTTATCAAC 9243 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| GTGTGTATGA AGACCTTGTA TTTCCTCAAC CTTTCGTTGG CCGTAAG... .......... 167 CTGGACTAGT TGTATTTTGA AGACTTTTTT CTGTTGCATT TTGTTTGACT ATTTTCTCTC 9183 .......... .......... .......... .......... .......... .......... 167 TGGTATATTT AGTTTAAGAT GTGAGAAAAG TTCAAACTAA GCTAGGCTTT TCATTCAATA 9123 .......... .......... .......... .......... .......... .......... 167 AAACTAAAGA TGAGCTGAAA CCCTTTGAAA AGAGGCCAAG ATTAAACTGA AAATGGAAAT 9063 .......... .......... .......... .......... .......... .......... 167 TACATTTGGG GATAATAACC AAGTTCGTAC TTTCTTCATC CCATTTTATG TGACACCCTT 9003 .......... .......... .......... .......... .......... .......... 167 AGATATTTGT GTGTTTGAGA TTCATTTCAT TAAAGATAAA AGGAAAACTT AAAAGTTAAA 8943 .......... .......... .......... .......... .......... .......... 167 TTGTTGCTAA TCATAGTAAG GTAATACTTC ATTTTGAGAC GAACTAAAAA GGAAATGCTC 8883 .......... .......... .......... .......... .......... .......... 167 TTACATAAAA TGGAAGAGAG GGAGTAATTA AATTTATAGG GATTTTGCCA AGAGAAACAT 8823 .......... .......... .......... .......... .......... .......... 167 CTCCATTTCA AAAGATATGA TCCTACTTTC ATCAAATTAA AAAGTCCGCT GGTTGATGTA 8763 .......... .......... .......... .......... .......... .......... 167 AAGATTAACA CCGCCCATGT TGCATAGCAT GGAGCTGAAT GAATTCACAT TCAAATTTAC 8703 .......... .......... .......... .......... .......... .......... 167 GTGAGATCAA AATAACATCA AAGTGTTCGA TGAATGTTTT GGGAACTTCA ACTGCAAACA 8643 .......... .......... .......... .......... .......... .......... 167 AATGTCCAAG AGCAATGGTT ACTGCAAATT TATCAGACTC TCTACTACTC TTAGAAGTGT 8583 .......... .......... .......... .......... .......... .......... 167 AAAAAGTATG GAGGAATGCA AAATCATCAT CATTATATAG TGCAATATTT GGGCCCTCTC 8523 .......... .......... .......... .......... .......... .......... 167 CAGTTACAGC TACAACATCA ACTTGAATGA TTCGCACACA TACAACAACA GTGAAACAAC 8463 .......... .......... .......... .......... .......... .......... 167 CAACATTTTG CAAATGTCAG AATAACCAAT AATAACCTCC TGATGCCCAT CATAGTTCTT 8403 .......... .......... .......... .......... .......... .......... 167 TATTATGTGC TCCCTACCTA GTGGCAGAAT CAGGATTTTC ATTAAGGGGT TCGATGGTTG 8343 .......... .......... .......... .......... .......... .......... 167 TAGTAAAACG CAATATTTAT TCCCTCTCCA GTTACAGCTA CAACATCAAC TTGAATGATT 8283 .......... .......... .......... .......... .......... .......... 167 TGCACTCCTG CAACAACAAT GAAACAACCA ATATTTGCGA ATGTTAGAAT AACCAATTAT 8223 .......... .......... .......... .......... .......... .......... 167 AACCTCCTGA TGCCCATCAT AGTTTTTTAT TATGCGTTCC CTATACCAAT TAATTGGTAG 8163 .......... .......... .......... .......... .......... .......... 167 TAATCTTCTG ACTACCCAAC CAGCCAGTGG TGGAACCAGG ATGTTTAGTA GGGCTTGAAC 8103 .......... .......... .......... .......... .......... .......... 167 ACGTAACCTC ATGGAATTTT CTTATGCCCT TAACCAATAA ACTAAATCTT CAACTTGTTT 8043 .......... .......... .......... .......... .......... .......... 167 CAAGAGGTGT CAATACTTGT ATATATATTT ACTAAACCAA AATATGACTT CTATATACAA 7983 .......... .......... .......... .......... .......... .......... 167 TGTAACTTTC TGACGAAGGG GTTTCGCTTC ACACCTCTTG GCCAAGGGTG GGTGCGCCCC 7923 .......... .......... .......... .......... .......... .......... 167 TGGATCCAGC TTGTCTCATG TCCTACATGG GCTCAAATAG AGGAACTAGA TGCTGTGTTC 7863 .......... .......... .......... .......... .......... .......... 167 AACCAGGAAA GGGCCTGCTT ATCTCCCCAT TAATAATAAG ATGTGCATCC TTTTGTAAAA 7803 .......... .......... .......... .......... .......... .......... 167 TCTCTACATC TAGTTCATCC AAGGCATTTG GAACGGTGGA AATAACATAA GCTCCAACTG 7743 .......... .......... .......... .......... .......... .......... 167 AATCATTTCC TAATTCAGTA TCTGTTTTTT TTTTTTTAAT GAAAATCTGC TTTGAAGGTA 7683 .......... .......... .......... .......... .......... .......... 167 TTCTAGACTT CTTTGTTGTC AGGCAGAACC ACTGTAGTAG GAAGAACCCA AGGTCTTTGC 7623 .......... .......... .......... .......... .......... .......... 167 CTTTTTATAT CTTTAGTTAG AAACTCCAGT GTTTGCTCTT CATCCCAATT ACCATATTTA 7563 .......... .......... .......... .......... .......... .......... 167 GGTATTTTGT GATTATATCA ATTGCCTATG ACCAGGACAC CTAGACATGT GAATATAAAC 7503 .......... .......... .......... .......... .......... .......... 167 AAGTGATTCA TCACTAAAAT ATAGCTGAAG TTTGAGAAAC ATGAAAATTG ACTCTCAGAA 7443 .......... .......... .......... .......... .......... .......... 167 ACTCAAACCC GATCAAGTAT CTCCCAACTG AATCTGAAAG CTTCCTAATT GCCTCAAGTA 7383 .......... .......... .......... .......... .......... .......... 167 TTGAACAAAA CCCACACCAG CCTTCTCTGA AGTTTACCAT ATGTGCTCAT AACAGAAACT 7323 .......... .......... .......... .......... .......... .......... 167 ATTAGGAAGA GATGCTTCAA GTTTCTTGCG GTTCATACTT TTCTATAAAG AATTTCAAAG 7263 .......... .......... .......... .......... .......... .......... 167 TATTGTTCAA TAACAACAAT CATCCTTGAG TCAACATGAT AGAAATCACA ACTTCTTTGT 7203 .......... .......... .......... .......... .......... .......... 167 CTTAGCTGCT CATGTGACAT AGCATAGGGC CAAGGACCAA GATCTCATCT TGCCACCTCA 7143 .......... .......... .......... .......... .......... .......... 167 ATAACTCAGG GCTTCAGTGT TTCTCCACCT TTAGTAGAGC TATTTCACAA TTTTTTCGCC 7083 .......... .......... .......... .......... .......... .......... 167 ACAGTGCACT TGCCCATTAA AATCTTGACG AGGATACTCA ATATGAAAAA GGCAAACTGT 7023 .......... .......... .......... .......... .......... .......... 167 CAGCTAGCTT CCACAACTCC ATCATTCCTA CATCAATGAG TATTATAGCA GGAAGAAAAT 6963 .......... .......... .......... .......... .......... .......... 167 TATGGCCCAA ACTATGAACA CAAACAACTT TTACAAACTG CATTTGGAGA TTTGGAATCA 6903 .......... .......... .......... .......... .......... .......... 167 CATCTTTGTA AAAAGTCATT TTGTTTCAAT GTCCATCTCA GTATTGGAAT TATCACTATT 6843 .......... .......... .......... .......... .......... .......... 167 AATAGTAATT CTATTCTCCT CTTTGTTGTT GTTCAAGCCT TTTCCATAAT AGAGAAATCA 6783 .......... .......... .......... .......... .......... .......... 167 CCTGAAAGCC CAGTTTTCTT CTTAGCCTTG TTACTGCTTC CATCCTCTTA TTTTTTCACC 6723 .......... .......... .......... .......... .......... .......... 167 TTCTTGCACC TCTTTAAAGT TCCATTTACT ATAAATTGAA GAACCAGTTC TCCATCATCA 6663 .......... .......... .......... .......... .......... .......... 167 ACAACCTTAT TGGCGTCAGA CAAACTATTC TCGATGCTCG ATAATTCTAA TTCCTTGGGT 6603 .......... .......... .......... .......... .......... .......... 167 TTGATGCACT TGAGCTTATG GTCTTCCAAA AACAATCCCT CTACTTCCAT GAGGTAGTGG 6543 .......... .......... .......... .......... .......... .......... 167 TAAGGTCTGC TACACTCTAC CCTCCTGAGA CCCTACTTAG TGCGATTTCT CTGGATATGT 6483 .......... .......... .......... .......... .......... .......... 167 TGTTGTATCT GTTTGCTTTT CGTGAGTGAA GAATTTGGCC GTAGAGTTCA TTATCTTTGT 6423 .......... .......... .......... .......... .......... .......... 167 ATAGATTTCT CTGTTTTGAG TTGTGATCCA CCTCATCTGA AAGTTCTAAA ATTTTGTATG 6363 .......... .......... .......... .......... .......... .......... 167 AGGTGACACC CTATGTTGCT CGGACTTTTC AACAGTATCA TCGGTGCGTG TCCGATTCTT 6303 .......... .......... .......... .......... .......... .......... 167 CAAAAGTGGT GCATTTTTGC AGAATTTGAC ACCGGTGAGG CATCTAAAGT GAGGGGTCCG 6243 .......... .......... .......... .......... .......... .......... 167 CACAACTTAC AACTCTTTGG GCATGGGATT AGAACGAAAT CCATCTCTAA TGAACGGACA 6183 .......... .......... .......... .......... .......... .......... 167 CCCAATGTCG GTTTACTCTG TTTTGTTACT CCATGCTTCA CATGTCTAGT ACTCAGCATG 6123 .......... .......... .......... .......... .......... .......... 167 GTGGCGAAGT GTTAGTCCCA TATAAGATAA GTGTATTTGT GATTATATTC GCTTAAAGAA 6063 .......... .......... .......... .......... .......... .......... 167 AATCACTACT TGAGCTAATT TTTGGAATTA TATCAGGCCT ATGTCCTTTT TGCTCATCAC 6003 .......... .......... .......... .......... .......... .......... 167 TAAGCTTATG TTTGATCTCA AACTATTAAT GCAGGCAATT CTAGATTTTT TCAAGAAGTT 5943 |||||| |||||||||| |||||||||| .......... .......... .......... ....GCAATT CTAGATTTTT TCAAGAAGTT 193 CACTGATTCA GTTGGTTCTG ATCTGCAATT TGTTATTGAC GATATATCCA AGGAGGATTC 5883 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGATTCA GTTGGTTCTG ATCTGCAATT TGTTATTGAC GATATATCCA AGGAGGATTC 253 ATCAGCTGTT GGAGTCACAT GGCACTTGGG TATGGAAAAA CTTGTAATAC TTTATCTTGT 5823 |||||||||| |||||||||| ||||||||| ATCAGCTGTT GGAGTCACAT GGCACTTGG. .......... .......... .......... 282 TTTCCATACT GTTTACTAGG AAGATTGATC TTCATTGTCC AGGGTAATTG ATTTTGAATA 5763 .......... .......... .......... .......... .......... .......... 282 TGATAACAGA ATGGAGGGGA AGACCTTTTC CTTTTAGCAA AGGATGCAGC TTTTATCGAT 5703 | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .........A ATGGAGGGGA AGACCTTTTC CTTTTAGCAA AGGATGCAGC TTTTATCGAT 333 TGGAAGTGGT GAATGGCCAG ATGAAAATAC TGTTCGTAAA ACAATATCCC AACCCACTAA 5643 |||||||||| |||||||||| |||||||||| | TGGAAGTGGT GAATGGCCAG ATGAAAATAC T......... .......... .......... 364 ATACTGGATG TGTTTTTATA TTTTATTATC TGCTGAACAC TTCAATTTTA TCTAATTGGT 5583 .......... .......... .......... .......... .......... .......... 364 CGAACAAATT ACTTCATTGA AGAAGATCGA GAAGCACAAT TATATGTTCA TTGTAGTTAG 5523 .......... .......... .......... .......... .......... .......... 364 CATAGAGGAT TGTTCAATAT TCCATATTTT TAAACTATAA TCAATAAAGG GTGTGGCCTA 5463 .......... .......... .......... .......... .......... .......... 364 GTAGTCAATG ACGTGGATTG AGAACCATGA GATCATAGGT ATAAATTCCA GCAGAGGCAA 5403 .......... .......... .......... .......... .......... .......... 364 AAAACACTGG CTGATCTATT GTGTCCATGG CTGGTGGGAT GTGACTGATA TCCCGTGGAG 5343 .......... .......... .......... .......... .......... .......... 364 TTAGTTGAGG TGTCCGAATG TTGGCGCGGA CATTAAATTT GTCAAAGGAA AATAAAAAAT 5283 .......... .......... .......... .......... .......... .......... 364 CGACCAGCAA CTCCTAAACC CAAATTAATT GGATTCGTCT ATATGAATCT TTCGTATCTC 5223 .......... .......... .......... .......... .......... .......... 364 TTTTTCTCTA TTCAGTGTGT GCTGAAGTGA ACATATTTGA ATAAACTCAA TATTTTGAGA 5163 .......... .......... .......... .......... .......... .......... 364 ATTCATTAGT CAGTTTATTA CATTGACATC AATCTACCAT AACCTTTTAC TTTTATTGCG 5103 .......... .......... .......... .......... .......... .......... 364 CTTGAAACTG ACCATACTTT GCTAAGTTAA CATAGATGTA GGTGTGGAGC ATTAACTTAA 5043 .......... .......... .......... .......... .......... .......... 364 TTTATAAATA CAGAATTTAA CTTGCTTGCT AGTTATGAGC ATCATCTTTT AGCTTGATTT 4983 .......... .......... .......... .......... .......... .......... 364 TACATTGTTC CATAAGGATC TTCATGTAGT TATAACTTAT AATCCTTGTG TTAGCCAACT 4923 .......... .......... .......... .......... .......... .......... 364 ATTAGAAGCA CAAGATTGTT GGTTCCCATT CTCTATTTGA TCAGGTTTAC ACTTATATTA 4863 .......... .......... .......... .......... .......... .......... 364 ATATATATGA GATAGATTTG ATGACCTAGA TCCTGCTCCT GATCTGTACA CTATTAAGGC 4803 .......... .......... .......... .......... .......... .......... 364 TGTGACTTTT GTTAATACAT GATGACAAGG CTCTTCAGAA CTCCGGACTA AAAAAAGTCA 4743 .......... .......... .......... .......... .......... .......... 364 GTAACTAGTA CAGAGAGAAT TTGTAAAGAT AATTAACTCT TTATAGATGA AGGGTAAATT 4683 .......... .......... .......... .......... .......... .......... 364 ATTAATTTCT CTATCCTTTC AGTGTTTATT GTATTTGGCT GTAGAAAATG ACTTGTAGAT 4623 .......... .......... .......... .......... .......... .......... 364 TTTGTATGTT CTCTTGTAAG GTCCTGGGTT TTTCTTCTTG GTTTTGAAAT TTCTCAGCAG 4563 .......... .......... .......... .......... .......... .......... 364 CTCTCTTTGG GAGAAGGGTC TTGATAATCA CTTGCAACCA AAAAAGACTA CCATTCTGTC 4503 .......... .......... .......... .......... .......... .......... 364 TTAGGCTTAC ATTTTAAGAG CAAAAAAAAG TTTTGAGTGT AGACTAAATG TTAATTTTCT 4443 .......... .......... .......... .......... .......... .......... 364 TAAGCAGTTA TGGCAGAGAC AGTGTGGAAC CTGCAGTCAA GCCGGGGGAG ACGGCATTG 4384 ||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| .......TTA TGGCAGAGAC AGTGTGGAAC CTGCAGTCAA GCCGGGGGAG ACGGCATTG 416 hqPGS_C06HBa0153O03.1-10-_SGN-E320389+ (9422 9256,5968 5854,5753 5672,4435 4384) ******************************************************************************** EST sequence 4 +strand 459 n (File: SGN-E340888+) 1 GCAGCTTTTA TCGATTGGAA GTGGTGAATG GCCAGATGAA AATACTTTAT GGCAGAGACA 61 GTGTGGAACC TGCAGTCAAG CCGGGGGAGA CGGCATTGGT TGCGATAAGA GGCGTGGCAT 121 GGCTGTTGCA AAAATTTCCC CAATTGGCAG ATCGGTTGTG ATTGAAGAAG TATGATTGGT 181 CTTATAAGAT TGAAGTTAAT TTATGAGGGG AAAAAAAATC AGTATATACA TGTCATTACT 241 TGTAATGTAA GCCTGAAAGG CTTTACTAGT ATGTACATAA TGAATTTGTG CATAGTTACT 301 TGATAAAAAT ATTCATTTCC ACCTTGTATG TCATGGTGAA TAAATTAAAT TTTACTATTC 361 AGATTTACAG TAATATTTNT AGGGAGAAGT GCACANNNNA NAAAAAAAAA AAAAAAAAAA 421 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 6317 to 1): Exon 1 5717 5672 ( 46 n); cDNA 1 46 ( 46 n); score: 1.000 Intron 1 5671 4436 (1236 n); Pd: 0.927 (s: 1.00), Pa: 0.985 (s: 1.00) Exon 2 4435 4384 ( 52 n); cDNA 47 98 ( 52 n); score: 1.000 PPA cDNA 409 459 MATCH C06HBa0153O03.1-10- SGN-E340888+ 1.000 98 0.214 C PGS_C06HBa0153O03.1-10-_SGN-E340888+ (5717 5672,4435 4384) Alignment (genomic DNA sequence = upper lines): GCAGCTTTTA TCGATTGGAA GTGGTGAATG GCCAGATGAA AATACTGTTC GTAAAACAAT 5658 |||||||||| |||||||||| |||||||||| |||||||||| |||||| GCAGCTTTTA TCGATTGGAA GTGGTGAATG GCCAGATGAA AATACT.... .......... 46 ATCCCAACCC ACTAAATACT GGATGTGTTT TTATATTTTA TTATCTGCTG AACACTTCAA 5598 .......... .......... .......... .......... .......... .......... 46 TTTTATCTAA TTGGTCGAAC AAATTACTTC ATTGAAGAAG ATCGAGAAGC ACAATTATAT 5538 .......... .......... .......... .......... .......... .......... 46 GTTCATTGTA GTTAGCATAG AGGATTGTTC AATATTCCAT ATTTTTAAAC TATAATCAAT 5478 .......... .......... .......... .......... .......... .......... 46 AAAGGGTGTG GCCTAGTAGT CAATGACGTG GATTGAGAAC CATGAGATCA TAGGTATAAA 5418 .......... .......... .......... .......... .......... .......... 46 TTCCAGCAGA GGCAAAAAAC ACTGGCTGAT CTATTGTGTC CATGGCTGGT GGGATGTGAC 5358 .......... .......... .......... .......... .......... .......... 46 TGATATCCCG TGGAGTTAGT TGAGGTGTCC GAATGTTGGC GCGGACATTA AATTTGTCAA 5298 .......... .......... .......... .......... .......... .......... 46 AGGAAAATAA AAAATCGACC AGCAACTCCT AAACCCAAAT TAATTGGATT CGTCTATATG 5238 .......... .......... .......... .......... .......... .......... 46 AATCTTTCGT ATCTCTTTTT CTCTATTCAG TGTGTGCTGA AGTGAACATA TTTGAATAAA 5178 .......... .......... .......... .......... .......... .......... 46 CTCAATATTT TGAGAATTCA TTAGTCAGTT TATTACATTG ACATCAATCT ACCATAACCT 5118 .......... .......... .......... .......... .......... .......... 46 TTTACTTTTA TTGCGCTTGA AACTGACCAT ACTTTGCTAA GTTAACATAG ATGTAGGTGT 5058 .......... .......... .......... .......... .......... .......... 46 GGAGCATTAA CTTAATTTAT AAATACAGAA TTTAACTTGC TTGCTAGTTA TGAGCATCAT 4998 .......... .......... .......... .......... .......... .......... 46 CTTTTAGCTT GATTTTACAT TGTTCCATAA GGATCTTCAT GTAGTTATAA CTTATAATCC 4938 .......... .......... .......... .......... .......... .......... 46 TTGTGTTAGC CAACTATTAG AAGCACAAGA TTGTTGGTTC CCATTCTCTA TTTGATCAGG 4878 .......... .......... .......... .......... .......... .......... 46 TTTACACTTA TATTAATATA TATGAGATAG ATTTGATGAC CTAGATCCTG CTCCTGATCT 4818 .......... .......... .......... .......... .......... .......... 46 GTACACTATT AAGGCTGTGA CTTTTGTTAA TACATGATGA CAAGGCTCTT CAGAACTCCG 4758 .......... .......... .......... .......... .......... .......... 46 GACTAAAAAA AGTCAGTAAC TAGTACAGAG AGAATTTGTA AAGATAATTA ACTCTTTATA 4698 .......... .......... .......... .......... .......... .......... 46 GATGAAGGGT AAATTATTAA TTTCTCTATC CTTTCAGTGT TTATTGTATT TGGCTGTAGA 4638 .......... .......... .......... .......... .......... .......... 46 AAATGACTTG TAGATTTTGT ATGTTCTCTT GTAAGGTCCT GGGTTTTTCT TCTTGGTTTT 4578 .......... .......... .......... .......... .......... .......... 46 GAAATTTCTC AGCAGCTCTC TTTGGGAGAA GGGTCTTGAT AATCACTTGC AACCAAAAAA 4518 .......... .......... .......... .......... .......... .......... 46 GACTACCATT CTGTCTTAGG CTTACATTTT AAGAGCAAAA AAAAGTTTTG AGTGTAGACT 4458 .......... .......... .......... .......... .......... .......... 46 AAATGTTAAT TTTCTTAAGC AGTTATGGCA GAGACAGTGT GGAACCTGCA GTCAAGCCGG 4398 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..TTATGGCA GAGACAGTGT GGAACCTGCA GTCAAGCCGG 84 GGGAGACGGC ATTG 4384 |||||||||| |||| GGGAGACGGC ATTG 98 hqPGS_C06HBa0153O03.1-10-_SGN-E340888+ (5717 5672,4435 4384) ******************************************************************************** EST sequence 6 +strand 626 n (File: SGN-E320286+) 1 TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 61 TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 121 CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 181 ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 241 ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAGAG TCGGCGGCGA 301 GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 361 TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGCCGGA 421 AGGCAATTCT AGATTTTTTC AAGAAGTTCA CTGATTCAGT TGGGTCTGAT CTGCAATTTG 481 TTATTGACGA TATATCCAAG GAGGATTCAT CAGCTGTTGG AGTCACATGG CACTTGGAAT 541 GGANGGGAAG ACCTTTTCCT TTTAGCAAAG GATGCAGCTT TTATCGATTG GAAGTGGTGA 601 ATGGCCAGAT GAAAATACTT TATGGC Predicted gene structure (within gDNA segment 10277 to 4992): Exon 1 9677 9256 ( 422 n); cDNA 1 422 ( 422 n); score: 0.998 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 0.98), Pa: 0.980 (s: 0.98) Exon 2 5968 5854 ( 115 n); cDNA 423 537 ( 115 n); score: 0.991 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 1.00), Pa: 0.967 (s: 0.98) Exon 3 5753 5672 ( 82 n); cDNA 538 619 ( 82 n); score: 0.988 MATCH C06HBa0153O03.1-10- SGN-E320286+ 0.995 619 0.989 C PGS_C06HBa0153O03.1-10-_SGN-E320286+ (9677 9256,5968 5854,5753 5672) Alignment (genomic DNA sequence = upper lines): TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 9618 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 60 TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 9558 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 120 CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 9498 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 180 ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 9438 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 240 ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAGAG TCGGCGGCGA 9378 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAGAG TCGGCGGCGA 300 GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 9318 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 360 TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGCCGTA 9258 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGCCGGA 420 AGGTTAGTTA TCAACCTGGA CTAGTTGTAT TTTGAAGACT TTTTTCTGTT GCATTTTGTT 9198 || AG........ .......... .......... .......... .......... .......... 422 TGACTATTTT CTCTCTGGTA TATTTAGTTT AAGATGTGAG AAAAGTTCAA ACTAAGCTAG 9138 .......... .......... .......... .......... .......... .......... 422 GCTTTTCATT CAATAAAACT AAAGATGAGC TGAAACCCTT TGAAAAGAGG CCAAGATTAA 9078 .......... .......... .......... .......... .......... .......... 422 ACTGAAAATG GAAATTACAT TTGGGGATAA TAACCAAGTT CGTACTTTCT TCATCCCATT 9018 .......... .......... .......... .......... .......... .......... 422 TTATGTGACA CCCTTAGATA TTTGTGTGTT TGAGATTCAT TTCATTAAAG ATAAAAGGAA 8958 .......... .......... .......... .......... .......... .......... 422 AACTTAAAAG TTAAATTGTT GCTAATCATA GTAAGGTAAT ACTTCATTTT GAGACGAACT 8898 .......... .......... .......... .......... .......... .......... 422 AAAAAGGAAA TGCTCTTACA TAAAATGGAA GAGAGGGAGT AATTAAATTT ATAGGGATTT 8838 .......... .......... .......... .......... .......... .......... 422 TGCCAAGAGA AACATCTCCA TTTCAAAAGA TATGATCCTA CTTTCATCAA ATTAAAAAGT 8778 .......... .......... .......... .......... .......... .......... 422 CCGCTGGTTG ATGTAAAGAT TAACACCGCC CATGTTGCAT AGCATGGAGC TGAATGAATT 8718 .......... .......... .......... .......... .......... .......... 422 CACATTCAAA TTTACGTGAG ATCAAAATAA CATCAAAGTG TTCGATGAAT GTTTTGGGAA 8658 .......... .......... .......... .......... .......... .......... 422 CTTCAACTGC AAACAAATGT CCAAGAGCAA TGGTTACTGC AAATTTATCA GACTCTCTAC 8598 .......... .......... .......... .......... .......... .......... 422 TACTCTTAGA AGTGTAAAAA GTATGGAGGA ATGCAAAATC ATCATCATTA TATAGTGCAA 8538 .......... .......... .......... .......... .......... .......... 422 TATTTGGGCC CTCTCCAGTT ACAGCTACAA CATCAACTTG AATGATTCGC ACACATACAA 8478 .......... .......... .......... .......... .......... .......... 422 CAACAGTGAA ACAACCAACA TTTTGCAAAT GTCAGAATAA CCAATAATAA CCTCCTGATG 8418 .......... .......... .......... .......... .......... .......... 422 CCCATCATAG TTCTTTATTA TGTGCTCCCT ACCTAGTGGC AGAATCAGGA TTTTCATTAA 8358 .......... .......... .......... .......... .......... .......... 422 GGGGTTCGAT GGTTGTAGTA AAACGCAATA TTTATTCCCT CTCCAGTTAC AGCTACAACA 8298 .......... .......... .......... .......... .......... .......... 422 TCAACTTGAA TGATTTGCAC TCCTGCAACA ACAATGAAAC AACCAATATT TGCGAATGTT 8238 .......... .......... .......... .......... .......... .......... 422 AGAATAACCA ATTATAACCT CCTGATGCCC ATCATAGTTT TTTATTATGC GTTCCCTATA 8178 .......... .......... .......... .......... .......... .......... 422 CCAATTAATT GGTAGTAATC TTCTGACTAC CCAACCAGCC AGTGGTGGAA CCAGGATGTT 8118 .......... .......... .......... .......... .......... .......... 422 TAGTAGGGCT TGAACACGTA ACCTCATGGA ATTTTCTTAT GCCCTTAACC AATAAACTAA 8058 .......... .......... .......... .......... .......... .......... 422 ATCTTCAACT TGTTTCAAGA GGTGTCAATA CTTGTATATA TATTTACTAA ACCAAAATAT 7998 .......... .......... .......... .......... .......... .......... 422 GACTTCTATA TACAATGTAA CTTTCTGACG AAGGGGTTTC GCTTCACACC TCTTGGCCAA 7938 .......... .......... .......... .......... .......... .......... 422 GGGTGGGTGC GCCCCTGGAT CCAGCTTGTC TCATGTCCTA CATGGGCTCA AATAGAGGAA 7878 .......... .......... .......... .......... .......... .......... 422 CTAGATGCTG TGTTCAACCA GGAAAGGGCC TGCTTATCTC CCCATTAATA ATAAGATGTG 7818 .......... .......... .......... .......... .......... .......... 422 CATCCTTTTG TAAAATCTCT ACATCTAGTT CATCCAAGGC ATTTGGAACG GTGGAAATAA 7758 .......... .......... .......... .......... .......... .......... 422 CATAAGCTCC AACTGAATCA TTTCCTAATT CAGTATCTGT TTTTTTTTTT TTAATGAAAA 7698 .......... .......... .......... .......... .......... .......... 422 TCTGCTTTGA AGGTATTCTA GACTTCTTTG TTGTCAGGCA GAACCACTGT AGTAGGAAGA 7638 .......... .......... .......... .......... .......... .......... 422 ACCCAAGGTC TTTGCCTTTT TATATCTTTA GTTAGAAACT CCAGTGTTTG CTCTTCATCC 7578 .......... .......... .......... .......... .......... .......... 422 CAATTACCAT ATTTAGGTAT TTTGTGATTA TATCAATTGC CTATGACCAG GACACCTAGA 7518 .......... .......... .......... .......... .......... .......... 422 CATGTGAATA TAAACAAGTG ATTCATCACT AAAATATAGC TGAAGTTTGA GAAACATGAA 7458 .......... .......... .......... .......... .......... .......... 422 AATTGACTCT CAGAAACTCA AACCCGATCA AGTATCTCCC AACTGAATCT GAAAGCTTCC 7398 .......... .......... .......... .......... .......... .......... 422 TAATTGCCTC AAGTATTGAA CAAAACCCAC ACCAGCCTTC TCTGAAGTTT ACCATATGTG 7338 .......... .......... .......... .......... .......... .......... 422 CTCATAACAG AAACTATTAG GAAGAGATGC TTCAAGTTTC TTGCGGTTCA TACTTTTCTA 7278 .......... .......... .......... .......... .......... .......... 422 TAAAGAATTT CAAAGTATTG TTCAATAACA ACAATCATCC TTGAGTCAAC ATGATAGAAA 7218 .......... .......... .......... .......... .......... .......... 422 TCACAACTTC TTTGTCTTAG CTGCTCATGT GACATAGCAT AGGGCCAAGG ACCAAGATCT 7158 .......... .......... .......... .......... .......... .......... 422 CATCTTGCCA CCTCAATAAC TCAGGGCTTC AGTGTTTCTC CACCTTTAGT AGAGCTATTT 7098 .......... .......... .......... .......... .......... .......... 422 CACAATTTTT TCGCCACAGT GCACTTGCCC ATTAAAATCT TGACGAGGAT ACTCAATATG 7038 .......... .......... .......... .......... .......... .......... 422 AAAAAGGCAA ACTGTCAGCT AGCTTCCACA ACTCCATCAT TCCTACATCA ATGAGTATTA 6978 .......... .......... .......... .......... .......... .......... 422 TAGCAGGAAG AAAATTATGG CCCAAACTAT GAACACAAAC AACTTTTACA AACTGCATTT 6918 .......... .......... .......... .......... .......... .......... 422 GGAGATTTGG AATCACATCT TTGTAAAAAG TCATTTTGTT TCAATGTCCA TCTCAGTATT 6858 .......... .......... .......... .......... .......... .......... 422 GGAATTATCA CTATTAATAG TAATTCTATT CTCCTCTTTG TTGTTGTTCA AGCCTTTTCC 6798 .......... .......... .......... .......... .......... .......... 422 ATAATAGAGA AATCACCTGA AAGCCCAGTT TTCTTCTTAG CCTTGTTACT GCTTCCATCC 6738 .......... .......... .......... .......... .......... .......... 422 TCTTATTTTT TCACCTTCTT GCACCTCTTT AAAGTTCCAT TTACTATAAA TTGAAGAACC 6678 .......... .......... .......... .......... .......... .......... 422 AGTTCTCCAT CATCAACAAC CTTATTGGCG TCAGACAAAC TATTCTCGAT GCTCGATAAT 6618 .......... .......... .......... .......... .......... .......... 422 TCTAATTCCT TGGGTTTGAT GCACTTGAGC TTATGGTCTT CCAAAAACAA TCCCTCTACT 6558 .......... .......... .......... .......... .......... .......... 422 TCCATGAGGT AGTGGTAAGG TCTGCTACAC TCTACCCTCC TGAGACCCTA CTTAGTGCGA 6498 .......... .......... .......... .......... .......... .......... 422 TTTCTCTGGA TATGTTGTTG TATCTGTTTG CTTTTCGTGA GTGAAGAATT TGGCCGTAGA 6438 .......... .......... .......... .......... .......... .......... 422 GTTCATTATC TTTGTATAGA TTTCTCTGTT TTGAGTTGTG ATCCACCTCA TCTGAAAGTT 6378 .......... .......... .......... .......... .......... .......... 422 CTAAAATTTT GTATGAGGTG ACACCCTATG TTGCTCGGAC TTTTCAACAG TATCATCGGT 6318 .......... .......... .......... .......... .......... .......... 422 GCGTGTCCGA TTCTTCAAAA GTGGTGCATT TTTGCAGAAT TTGACACCGG TGAGGCATCT 6258 .......... .......... .......... .......... .......... .......... 422 AAAGTGAGGG GTCCGCACAA CTTACAACTC TTTGGGCATG GGATTAGAAC GAAATCCATC 6198 .......... .......... .......... .......... .......... .......... 422 TCTAATGAAC GGACACCCAA TGTCGGTTTA CTCTGTTTTG TTACTCCATG CTTCACATGT 6138 .......... .......... .......... .......... .......... .......... 422 CTAGTACTCA GCATGGTGGC GAAGTGTTAG TCCCATATAA GATAAGTGTA TTTGTGATTA 6078 .......... .......... .......... .......... .......... .......... 422 TATTCGCTTA AAGAAAATCA CTACTTGAGC TAATTTTTGG AATTATATCA GGCCTATGTC 6018 .......... .......... .......... .......... .......... .......... 422 CTTTTTGCTC ATCACTAAGC TTATGTTTGA TCTCAAACTA TTAATGCAGG CAATTCTAGA 5958 | |||||||||| .......... .......... .......... .......... .........G CAATTCTAGA 433 TTTTTTCAAG AAGTTCACTG ATTCAGTTGG TTCTGATCTG CAATTTGTTA TTGACGATAT 5898 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TTTTTTCAAG AAGTTCACTG ATTCAGTTGG GTCTGATCTG CAATTTGTTA TTGACGATAT 493 ATCCAAGGAG GATTCATCAG CTGTTGGAGT CACATGGCAC TTGGGTATGG AAAAACTTGT 5838 |||||||||| |||||||||| |||||||||| |||||||||| |||| ATCCAAGGAG GATTCATCAG CTGTTGGAGT CACATGGCAC TTGG...... .......... 537 AATACTTTAT CTTGTTTTCC ATACTGTTTA CTAGGAAGAT TGATCTTCAT TGTCCAGGGT 5778 .......... .......... .......... .......... .......... .......... 537 AATTGATTTT GAATATGATA ACAGAATGGA GGGGAAGACC TTTTCCTTTT AGCAAAGGAT 5718 |||||| ||||||||| |||||||||| |||||||||| .......... .......... ....AATGGA NGGGAAGACC TTTTCCTTTT AGCAAAGGAT 573 GCAGCTTTTA TCGATTGGAA GTGGTGAATG GCCAGATGAA AATACT 5672 |||||||||| |||||||||| |||||||||| |||||||||| |||||| GCAGCTTTTA TCGATTGGAA GTGGTGAATG GCCAGATGAA AATACT 619 hqPGS_C06HBa0153O03.1-10-_SGN-E320286+ (9677 9256,5968 5854,5753 5672) ******************************************************************************** EST sequence 3 +strand 562 n (File: SGN-E294549+) 1 TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 61 CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 121 TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 181 TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 241 TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 301 TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 361 GGCAATTCTA GATTTTTTCA AGAAGTTCAC TGATTCAGTT GGTTCTGATC TGCAATTTGT 421 TATTGACGAT ATATCCAAGG AGGATTCATC AGCTGTTGGA GTCACATGGC ACTTGGAATG 481 GAGGGGAAGA CCTTTTCCTT TTAGCAAAGG ATGCAGCTTT TATCGATTGG AAGTGGTGAA 541 TGGCCAGATG AAAATACTTT AT Predicted gene structure (within gDNA segment 10216 to 5022): Exon 1 9616 9256 ( 361 n); cDNA 1 361 ( 361 n); score: 1.000 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 1.00), Pa: 0.980 (s: 1.00) Exon 2 5968 5854 ( 115 n); cDNA 362 476 ( 115 n); score: 1.000 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 1.00), Pa: 0.967 (s: 1.00) Exon 3 5753 5672 ( 82 n); cDNA 477 558 ( 82 n); score: 1.000 MATCH C06HBa0153O03.1-10- SGN-E294549+ 1.000 558 0.993 C PGS_C06HBa0153O03.1-10-_SGN-E294549+ (9616 9256,5968 5854,5753 5672) Alignment (genomic DNA sequence = upper lines): TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 9557 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATGAGCAGC ACCTCATCCT TTTCTCTCCT CCCACTCAAC TCAAACCCCT CTACATCTAC 60 CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 9497 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGCCGCT GTCGTCGGCA ACTCACCCGC CGTTCTTCAT AGTGTCAACT GTTGTACAGA 120 TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 9437 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATCTATTT AGGATCGTTT CTGGCCATCG GATAAGACAA GTGGCTGTAC GGAATAGCAA 180 TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 9377 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGAACGGCT GAGGTTACTT CATCCTCTGA TTCTGTAACT GATTTAGAGT CGGCGGCGAG 240 TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 9317 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGGTGAGG AAATTCTATG CCGGAATAAA TAGGCGGGAT TTGGACTCTG TCGAAGAACT 300 TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 9257 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTGCTGAG GATTGTGTGT ATGAAGACCT TGTATTTCCT CAACCTTTCG TTGGCCGTAA 360 GGTTAGTTAT CAACCTGGAC TAGTTGTATT TTGAAGACTT TTTTCTGTTG CATTTTGTTT 9197 | G......... .......... .......... .......... .......... .......... 361 GACTATTTTC TCTCTGGTAT ATTTAGTTTA AGATGTGAGA AAAGTTCAAA CTAAGCTAGG 9137 .......... .......... .......... .......... .......... .......... 361 CTTTTCATTC AATAAAACTA AAGATGAGCT GAAACCCTTT GAAAAGAGGC CAAGATTAAA 9077 .......... .......... .......... .......... .......... .......... 361 CTGAAAATGG AAATTACATT TGGGGATAAT AACCAAGTTC GTACTTTCTT CATCCCATTT 9017 .......... .......... .......... .......... .......... .......... 361 TATGTGACAC CCTTAGATAT TTGTGTGTTT GAGATTCATT TCATTAAAGA TAAAAGGAAA 8957 .......... .......... .......... .......... .......... .......... 361 ACTTAAAAGT TAAATTGTTG CTAATCATAG TAAGGTAATA CTTCATTTTG AGACGAACTA 8897 .......... .......... .......... .......... .......... .......... 361 AAAAGGAAAT GCTCTTACAT AAAATGGAAG AGAGGGAGTA ATTAAATTTA TAGGGATTTT 8837 .......... .......... .......... .......... .......... .......... 361 GCCAAGAGAA ACATCTCCAT TTCAAAAGAT ATGATCCTAC TTTCATCAAA TTAAAAAGTC 8777 .......... .......... .......... .......... .......... .......... 361 CGCTGGTTGA TGTAAAGATT AACACCGCCC ATGTTGCATA GCATGGAGCT GAATGAATTC 8717 .......... .......... .......... .......... .......... .......... 361 ACATTCAAAT TTACGTGAGA TCAAAATAAC ATCAAAGTGT TCGATGAATG TTTTGGGAAC 8657 .......... .......... .......... .......... .......... .......... 361 TTCAACTGCA AACAAATGTC CAAGAGCAAT GGTTACTGCA AATTTATCAG ACTCTCTACT 8597 .......... .......... .......... .......... .......... .......... 361 ACTCTTAGAA GTGTAAAAAG TATGGAGGAA TGCAAAATCA TCATCATTAT ATAGTGCAAT 8537 .......... .......... .......... .......... .......... .......... 361 ATTTGGGCCC TCTCCAGTTA CAGCTACAAC ATCAACTTGA ATGATTCGCA CACATACAAC 8477 .......... .......... .......... .......... .......... .......... 361 AACAGTGAAA CAACCAACAT TTTGCAAATG TCAGAATAAC CAATAATAAC CTCCTGATGC 8417 .......... .......... .......... .......... .......... .......... 361 CCATCATAGT TCTTTATTAT GTGCTCCCTA CCTAGTGGCA GAATCAGGAT TTTCATTAAG 8357 .......... .......... .......... .......... .......... .......... 361 GGGTTCGATG GTTGTAGTAA AACGCAATAT TTATTCCCTC TCCAGTTACA GCTACAACAT 8297 .......... .......... .......... .......... .......... .......... 361 CAACTTGAAT GATTTGCACT CCTGCAACAA CAATGAAACA ACCAATATTT GCGAATGTTA 8237 .......... .......... .......... .......... .......... .......... 361 GAATAACCAA TTATAACCTC CTGATGCCCA TCATAGTTTT TTATTATGCG TTCCCTATAC 8177 .......... .......... .......... .......... .......... .......... 361 CAATTAATTG GTAGTAATCT TCTGACTACC CAACCAGCCA GTGGTGGAAC CAGGATGTTT 8117 .......... .......... .......... .......... .......... .......... 361 AGTAGGGCTT GAACACGTAA CCTCATGGAA TTTTCTTATG CCCTTAACCA ATAAACTAAA 8057 .......... .......... .......... .......... .......... .......... 361 TCTTCAACTT GTTTCAAGAG GTGTCAATAC TTGTATATAT ATTTACTAAA CCAAAATATG 7997 .......... .......... .......... .......... .......... .......... 361 ACTTCTATAT ACAATGTAAC TTTCTGACGA AGGGGTTTCG CTTCACACCT CTTGGCCAAG 7937 .......... .......... .......... .......... .......... .......... 361 GGTGGGTGCG CCCCTGGATC CAGCTTGTCT CATGTCCTAC ATGGGCTCAA ATAGAGGAAC 7877 .......... .......... .......... .......... .......... .......... 361 TAGATGCTGT GTTCAACCAG GAAAGGGCCT GCTTATCTCC CCATTAATAA TAAGATGTGC 7817 .......... .......... .......... .......... .......... .......... 361 ATCCTTTTGT AAAATCTCTA CATCTAGTTC ATCCAAGGCA TTTGGAACGG TGGAAATAAC 7757 .......... .......... .......... .......... .......... .......... 361 ATAAGCTCCA ACTGAATCAT TTCCTAATTC AGTATCTGTT TTTTTTTTTT TAATGAAAAT 7697 .......... .......... .......... .......... .......... .......... 361 CTGCTTTGAA GGTATTCTAG ACTTCTTTGT TGTCAGGCAG AACCACTGTA GTAGGAAGAA 7637 .......... .......... .......... .......... .......... .......... 361 CCCAAGGTCT TTGCCTTTTT ATATCTTTAG TTAGAAACTC CAGTGTTTGC TCTTCATCCC 7577 .......... .......... .......... .......... .......... .......... 361 AATTACCATA TTTAGGTATT TTGTGATTAT ATCAATTGCC TATGACCAGG ACACCTAGAC 7517 .......... .......... .......... .......... .......... .......... 361 ATGTGAATAT AAACAAGTGA TTCATCACTA AAATATAGCT GAAGTTTGAG AAACATGAAA 7457 .......... .......... .......... .......... .......... .......... 361 ATTGACTCTC AGAAACTCAA ACCCGATCAA GTATCTCCCA ACTGAATCTG AAAGCTTCCT 7397 .......... .......... .......... .......... .......... .......... 361 AATTGCCTCA AGTATTGAAC AAAACCCACA CCAGCCTTCT CTGAAGTTTA CCATATGTGC 7337 .......... .......... .......... .......... .......... .......... 361 TCATAACAGA AACTATTAGG AAGAGATGCT TCAAGTTTCT TGCGGTTCAT ACTTTTCTAT 7277 .......... .......... .......... .......... .......... .......... 361 AAAGAATTTC AAAGTATTGT TCAATAACAA CAATCATCCT TGAGTCAACA TGATAGAAAT 7217 .......... .......... .......... .......... .......... .......... 361 CACAACTTCT TTGTCTTAGC TGCTCATGTG ACATAGCATA GGGCCAAGGA CCAAGATCTC 7157 .......... .......... .......... .......... .......... .......... 361 ATCTTGCCAC CTCAATAACT CAGGGCTTCA GTGTTTCTCC ACCTTTAGTA GAGCTATTTC 7097 .......... .......... .......... .......... .......... .......... 361 ACAATTTTTT CGCCACAGTG CACTTGCCCA TTAAAATCTT GACGAGGATA CTCAATATGA 7037 .......... .......... .......... .......... .......... .......... 361 AAAAGGCAAA CTGTCAGCTA GCTTCCACAA CTCCATCATT CCTACATCAA TGAGTATTAT 6977 .......... .......... .......... .......... .......... .......... 361 AGCAGGAAGA AAATTATGGC CCAAACTATG AACACAAACA ACTTTTACAA ACTGCATTTG 6917 .......... .......... .......... .......... .......... .......... 361 GAGATTTGGA ATCACATCTT TGTAAAAAGT CATTTTGTTT CAATGTCCAT CTCAGTATTG 6857 .......... .......... .......... .......... .......... .......... 361 GAATTATCAC TATTAATAGT AATTCTATTC TCCTCTTTGT TGTTGTTCAA GCCTTTTCCA 6797 .......... .......... .......... .......... .......... .......... 361 TAATAGAGAA ATCACCTGAA AGCCCAGTTT TCTTCTTAGC CTTGTTACTG CTTCCATCCT 6737 .......... .......... .......... .......... .......... .......... 361 CTTATTTTTT CACCTTCTTG CACCTCTTTA AAGTTCCATT TACTATAAAT TGAAGAACCA 6677 .......... .......... .......... .......... .......... .......... 361 GTTCTCCATC ATCAACAACC TTATTGGCGT CAGACAAACT ATTCTCGATG CTCGATAATT 6617 .......... .......... .......... .......... .......... .......... 361 CTAATTCCTT GGGTTTGATG CACTTGAGCT TATGGTCTTC CAAAAACAAT CCCTCTACTT 6557 .......... .......... .......... .......... .......... .......... 361 CCATGAGGTA GTGGTAAGGT CTGCTACACT CTACCCTCCT GAGACCCTAC TTAGTGCGAT 6497 .......... .......... .......... .......... .......... .......... 361 TTCTCTGGAT ATGTTGTTGT ATCTGTTTGC TTTTCGTGAG TGAAGAATTT GGCCGTAGAG 6437 .......... .......... .......... .......... .......... .......... 361 TTCATTATCT TTGTATAGAT TTCTCTGTTT TGAGTTGTGA TCCACCTCAT CTGAAAGTTC 6377 .......... .......... .......... .......... .......... .......... 361 TAAAATTTTG TATGAGGTGA CACCCTATGT TGCTCGGACT TTTCAACAGT ATCATCGGTG 6317 .......... .......... .......... .......... .......... .......... 361 CGTGTCCGAT TCTTCAAAAG TGGTGCATTT TTGCAGAATT TGACACCGGT GAGGCATCTA 6257 .......... .......... .......... .......... .......... .......... 361 AAGTGAGGGG TCCGCACAAC TTACAACTCT TTGGGCATGG GATTAGAACG AAATCCATCT 6197 .......... .......... .......... .......... .......... .......... 361 CTAATGAACG GACACCCAAT GTCGGTTTAC TCTGTTTTGT TACTCCATGC TTCACATGTC 6137 .......... .......... .......... .......... .......... .......... 361 TAGTACTCAG CATGGTGGCG AAGTGTTAGT CCCATATAAG ATAAGTGTAT TTGTGATTAT 6077 .......... .......... .......... .......... .......... .......... 361 ATTCGCTTAA AGAAAATCAC TACTTGAGCT AATTTTTGGA ATTATATCAG GCCTATGTCC 6017 .......... .......... .......... .......... .......... .......... 361 TTTTTGCTCA TCACTAAGCT TATGTTTGAT CTCAAACTAT TAATGCAGGC AATTCTAGAT 5957 || |||||||||| .......... .......... .......... .......... ........GC AATTCTAGAT 373 TTTTTCAAGA AGTTCACTGA TTCAGTTGGT TCTGATCTGC AATTTGTTAT TGACGATATA 5897 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTCAAGA AGTTCACTGA TTCAGTTGGT TCTGATCTGC AATTTGTTAT TGACGATATA 433 TCCAAGGAGG ATTCATCAGC TGTTGGAGTC ACATGGCACT TGGGTATGGA AAAACTTGTA 5837 |||||||||| |||||||||| |||||||||| |||||||||| ||| TCCAAGGAGG ATTCATCAGC TGTTGGAGTC ACATGGCACT TGG....... .......... 476 ATACTTTATC TTGTTTTCCA TACTGTTTAC TAGGAAGATT GATCTTCATT GTCCAGGGTA 5777 .......... .......... .......... .......... .......... .......... 476 ATTGATTTTG AATATGATAA CAGAATGGAG GGGAAGACCT TTTCCTTTTA GCAAAGGATG 5717 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...AATGGAG GGGAAGACCT TTTCCTTTTA GCAAAGGATG 513 CAGCTTTTAT CGATTGGAAG TGGTGAATGG CCAGATGAAA ATACT 5672 |||||||||| |||||||||| |||||||||| |||||||||| ||||| CAGCTTTTAT CGATTGGAAG TGGTGAATGG CCAGATGAAA ATACT 558 hqPGS_C06HBa0153O03.1-10-_SGN-E294549+ (9616 9256,5968 5854,5753 5672) ******************************************************************************** EST sequence 2 +strand 504 n (File: SGN-E250902+) 1 CTCACCCACC GTTCTTCATA GCGTCAACTG TAGTACAGAT TATCTATCTC GGATCGCTTC 61 TGGCCATCGG ATAATACAAG TGGCTGTACG GAATAGCAAT CGAACGGCTG AAGATACTTC 121 ATCCTCTGAT TCTGTCACTG ATTTAAAGTC TGCGGCTAGT GTGGTGAGGA AATTCTATGC 181 CGGAATAAAT AGGCGGGATT TGGACTCTGT CGAAGAACTT ATTGCTGAGG ATTGTGTGTA 241 TGAAGACCTT GTATTTCCTC AACCTTTCGT TGGCCGTAAG GCAATTCTAG ATTTTTTCAA 301 TAAGTTCACT GATTCAGTTG GTTCTGATCT GCAATTAGTT ATTGACGATA TATCCGACGA 361 GGATTCATCA GCTGTTGGAG TCACATGGCA CTTGGAATGG AGGGGAAGAC CTTTTCCTTT 421 TAGCAAAAGA TGCAGCTTTT ATCGATTGGA AGTGGTGAAT GGCCAGATGA AAATACTTTA 481 TGGCAGAGAC AGTGTGGAAC CTGC Predicted gene structure (within gDNA segment 10648 to 4792): Exon 1 9535 9256 ( 280 n); cDNA 1 280 ( 280 n); score: 0.954 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 1.00), Pa: 0.980 (s: 0.98) Exon 2 5968 5854 ( 115 n); cDNA 281 395 ( 115 n); score: 0.965 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 0.96), Pa: 0.967 (s: 0.98) Exon 3 5753 5672 ( 82 n); cDNA 396 477 ( 82 n); score: 0.988 MATCH C06HBa0153O03.1-10- SGN-E250902+ 0.962 477 0.946 C PGS_C06HBa0153O03.1-10-_SGN-E250902+ (9535 9256,5968 5854,5753 5672) Alignment (genomic DNA sequence = upper lines): CTCACCCGCC GTTCTTCATA GTGTCAACTG TTGTACAGAT TATCTATTTA GGATCGTTTC 9476 ||||||| || |||||||||| | |||||||| | |||||||| ||||||| | |||||| ||| CTCACCCACC GTTCTTCATA GCGTCAACTG TAGTACAGAT TATCTATCTC GGATCGCTTC 60 TGGCCATCGG ATAAGACAAG TGGCTGTACG GAATAGCAAT CGAACGGCTG AGGTTACTTC 9416 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| | | |||||| TGGCCATCGG ATAATACAAG TGGCTGTACG GAATAGCAAT CGAACGGCTG AAGATACTTC 120 ATCCTCTGAT TCTGTAACTG ATTTAGAGTC GGCGGCGAGT GTGGTGAGGA AATTCTATGC 9356 |||||||||| ||||| |||| ||||| |||| ||||| ||| |||||||||| |||||||||| ATCCTCTGAT TCTGTCACTG ATTTAAAGTC TGCGGCTAGT GTGGTGAGGA AATTCTATGC 180 CGGAATAAAT AGGCGGGATT TGGACTCTGT CGAAGAACTT ATTGCTGAGG ATTGTGTGTA 9296 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGGAATAAAT AGGCGGGATT TGGACTCTGT CGAAGAACTT ATTGCTGAGG ATTGTGTGTA 240 TGAAGACCTT GTATTTCCTC AACCTTTCGT TGGCCGTAAG GTTAGTTATC AACCTGGACT 9236 |||||||||| |||||||||| |||||||||| |||||||||| TGAAGACCTT GTATTTCCTC AACCTTTCGT TGGCCGTAAG .......... .......... 280 AGTTGTATTT TGAAGACTTT TTTCTGTTGC ATTTTGTTTG ACTATTTTCT CTCTGGTATA 9176 .......... .......... .......... .......... .......... .......... 280 TTTAGTTTAA GATGTGAGAA AAGTTCAAAC TAAGCTAGGC TTTTCATTCA ATAAAACTAA 9116 .......... .......... .......... .......... .......... .......... 280 AGATGAGCTG AAACCCTTTG AAAAGAGGCC AAGATTAAAC TGAAAATGGA AATTACATTT 9056 .......... .......... .......... .......... .......... .......... 280 GGGGATAATA ACCAAGTTCG TACTTTCTTC ATCCCATTTT ATGTGACACC CTTAGATATT 8996 .......... .......... .......... .......... .......... .......... 280 TGTGTGTTTG AGATTCATTT CATTAAAGAT AAAAGGAAAA CTTAAAAGTT AAATTGTTGC 8936 .......... .......... .......... .......... .......... .......... 280 TAATCATAGT AAGGTAATAC TTCATTTTGA GACGAACTAA AAAGGAAATG CTCTTACATA 8876 .......... .......... .......... .......... .......... .......... 280 AAATGGAAGA GAGGGAGTAA TTAAATTTAT AGGGATTTTG CCAAGAGAAA CATCTCCATT 8816 .......... .......... .......... .......... .......... .......... 280 TCAAAAGATA TGATCCTACT TTCATCAAAT TAAAAAGTCC GCTGGTTGAT GTAAAGATTA 8756 .......... .......... .......... .......... .......... .......... 280 ACACCGCCCA TGTTGCATAG CATGGAGCTG AATGAATTCA CATTCAAATT TACGTGAGAT 8696 .......... .......... .......... .......... .......... .......... 280 CAAAATAACA TCAAAGTGTT CGATGAATGT TTTGGGAACT TCAACTGCAA ACAAATGTCC 8636 .......... .......... .......... .......... .......... .......... 280 AAGAGCAATG GTTACTGCAA ATTTATCAGA CTCTCTACTA CTCTTAGAAG TGTAAAAAGT 8576 .......... .......... .......... .......... .......... .......... 280 ATGGAGGAAT GCAAAATCAT CATCATTATA TAGTGCAATA TTTGGGCCCT CTCCAGTTAC 8516 .......... .......... .......... .......... .......... .......... 280 AGCTACAACA TCAACTTGAA TGATTCGCAC ACATACAACA ACAGTGAAAC AACCAACATT 8456 .......... .......... .......... .......... .......... .......... 280 TTGCAAATGT CAGAATAACC AATAATAACC TCCTGATGCC CATCATAGTT CTTTATTATG 8396 .......... .......... .......... .......... .......... .......... 280 TGCTCCCTAC CTAGTGGCAG AATCAGGATT TTCATTAAGG GGTTCGATGG TTGTAGTAAA 8336 .......... .......... .......... .......... .......... .......... 280 ACGCAATATT TATTCCCTCT CCAGTTACAG CTACAACATC AACTTGAATG ATTTGCACTC 8276 .......... .......... .......... .......... .......... .......... 280 CTGCAACAAC AATGAAACAA CCAATATTTG CGAATGTTAG AATAACCAAT TATAACCTCC 8216 .......... .......... .......... .......... .......... .......... 280 TGATGCCCAT CATAGTTTTT TATTATGCGT TCCCTATACC AATTAATTGG TAGTAATCTT 8156 .......... .......... .......... .......... .......... .......... 280 CTGACTACCC AACCAGCCAG TGGTGGAACC AGGATGTTTA GTAGGGCTTG AACACGTAAC 8096 .......... .......... .......... .......... .......... .......... 280 CTCATGGAAT TTTCTTATGC CCTTAACCAA TAAACTAAAT CTTCAACTTG TTTCAAGAGG 8036 .......... .......... .......... .......... .......... .......... 280 TGTCAATACT TGTATATATA TTTACTAAAC CAAAATATGA CTTCTATATA CAATGTAACT 7976 .......... .......... .......... .......... .......... .......... 280 TTCTGACGAA GGGGTTTCGC TTCACACCTC TTGGCCAAGG GTGGGTGCGC CCCTGGATCC 7916 .......... .......... .......... .......... .......... .......... 280 AGCTTGTCTC ATGTCCTACA TGGGCTCAAA TAGAGGAACT AGATGCTGTG TTCAACCAGG 7856 .......... .......... .......... .......... .......... .......... 280 AAAGGGCCTG CTTATCTCCC CATTAATAAT AAGATGTGCA TCCTTTTGTA AAATCTCTAC 7796 .......... .......... .......... .......... .......... .......... 280 ATCTAGTTCA TCCAAGGCAT TTGGAACGGT GGAAATAACA TAAGCTCCAA CTGAATCATT 7736 .......... .......... .......... .......... .......... .......... 280 TCCTAATTCA GTATCTGTTT TTTTTTTTTT AATGAAAATC TGCTTTGAAG GTATTCTAGA 7676 .......... .......... .......... .......... .......... .......... 280 CTTCTTTGTT GTCAGGCAGA ACCACTGTAG TAGGAAGAAC CCAAGGTCTT TGCCTTTTTA 7616 .......... .......... .......... .......... .......... .......... 280 TATCTTTAGT TAGAAACTCC AGTGTTTGCT CTTCATCCCA ATTACCATAT TTAGGTATTT 7556 .......... .......... .......... .......... .......... .......... 280 TGTGATTATA TCAATTGCCT ATGACCAGGA CACCTAGACA TGTGAATATA AACAAGTGAT 7496 .......... .......... .......... .......... .......... .......... 280 TCATCACTAA AATATAGCTG AAGTTTGAGA AACATGAAAA TTGACTCTCA GAAACTCAAA 7436 .......... .......... .......... .......... .......... .......... 280 CCCGATCAAG TATCTCCCAA CTGAATCTGA AAGCTTCCTA ATTGCCTCAA GTATTGAACA 7376 .......... .......... .......... .......... .......... .......... 280 AAACCCACAC CAGCCTTCTC TGAAGTTTAC CATATGTGCT CATAACAGAA ACTATTAGGA 7316 .......... .......... .......... .......... .......... .......... 280 AGAGATGCTT CAAGTTTCTT GCGGTTCATA CTTTTCTATA AAGAATTTCA AAGTATTGTT 7256 .......... .......... .......... .......... .......... .......... 280 CAATAACAAC AATCATCCTT GAGTCAACAT GATAGAAATC ACAACTTCTT TGTCTTAGCT 7196 .......... .......... .......... .......... .......... .......... 280 GCTCATGTGA CATAGCATAG GGCCAAGGAC CAAGATCTCA TCTTGCCACC TCAATAACTC 7136 .......... .......... .......... .......... .......... .......... 280 AGGGCTTCAG TGTTTCTCCA CCTTTAGTAG AGCTATTTCA CAATTTTTTC GCCACAGTGC 7076 .......... .......... .......... .......... .......... .......... 280 ACTTGCCCAT TAAAATCTTG ACGAGGATAC TCAATATGAA AAAGGCAAAC TGTCAGCTAG 7016 .......... .......... .......... .......... .......... .......... 280 CTTCCACAAC TCCATCATTC CTACATCAAT GAGTATTATA GCAGGAAGAA AATTATGGCC 6956 .......... .......... .......... .......... .......... .......... 280 CAAACTATGA ACACAAACAA CTTTTACAAA CTGCATTTGG AGATTTGGAA TCACATCTTT 6896 .......... .......... .......... .......... .......... .......... 280 GTAAAAAGTC ATTTTGTTTC AATGTCCATC TCAGTATTGG AATTATCACT ATTAATAGTA 6836 .......... .......... .......... .......... .......... .......... 280 ATTCTATTCT CCTCTTTGTT GTTGTTCAAG CCTTTTCCAT AATAGAGAAA TCACCTGAAA 6776 .......... .......... .......... .......... .......... .......... 280 GCCCAGTTTT CTTCTTAGCC TTGTTACTGC TTCCATCCTC TTATTTTTTC ACCTTCTTGC 6716 .......... .......... .......... .......... .......... .......... 280 ACCTCTTTAA AGTTCCATTT ACTATAAATT GAAGAACCAG TTCTCCATCA TCAACAACCT 6656 .......... .......... .......... .......... .......... .......... 280 TATTGGCGTC AGACAAACTA TTCTCGATGC TCGATAATTC TAATTCCTTG GGTTTGATGC 6596 .......... .......... .......... .......... .......... .......... 280 ACTTGAGCTT ATGGTCTTCC AAAAACAATC CCTCTACTTC CATGAGGTAG TGGTAAGGTC 6536 .......... .......... .......... .......... .......... .......... 280 TGCTACACTC TACCCTCCTG AGACCCTACT TAGTGCGATT TCTCTGGATA TGTTGTTGTA 6476 .......... .......... .......... .......... .......... .......... 280 TCTGTTTGCT TTTCGTGAGT GAAGAATTTG GCCGTAGAGT TCATTATCTT TGTATAGATT 6416 .......... .......... .......... .......... .......... .......... 280 TCTCTGTTTT GAGTTGTGAT CCACCTCATC TGAAAGTTCT AAAATTTTGT ATGAGGTGAC 6356 .......... .......... .......... .......... .......... .......... 280 ACCCTATGTT GCTCGGACTT TTCAACAGTA TCATCGGTGC GTGTCCGATT CTTCAAAAGT 6296 .......... .......... .......... .......... .......... .......... 280 GGTGCATTTT TGCAGAATTT GACACCGGTG AGGCATCTAA AGTGAGGGGT CCGCACAACT 6236 .......... .......... .......... .......... .......... .......... 280 TACAACTCTT TGGGCATGGG ATTAGAACGA AATCCATCTC TAATGAACGG ACACCCAATG 6176 .......... .......... .......... .......... .......... .......... 280 TCGGTTTACT CTGTTTTGTT ACTCCATGCT TCACATGTCT AGTACTCAGC ATGGTGGCGA 6116 .......... .......... .......... .......... .......... .......... 280 AGTGTTAGTC CCATATAAGA TAAGTGTATT TGTGATTATA TTCGCTTAAA GAAAATCACT 6056 .......... .......... .......... .......... .......... .......... 280 ACTTGAGCTA ATTTTTGGAA TTATATCAGG CCTATGTCCT TTTTGCTCAT CACTAAGCTT 5996 .......... .......... .......... .......... .......... .......... 280 ATGTTTGATC TCAAACTATT AATGCAGGCA ATTCTAGATT TTTTCAAGAA GTTCACTGAT 5936 ||| |||||||||| ||||||| || |||||||||| .......... .......... .......GCA ATTCTAGATT TTTTCAATAA GTTCACTGAT 313 TCAGTTGGTT CTGATCTGCA ATTTGTTATT GACGATATAT CCAAGGAGGA TTCATCAGCT 5876 |||||||||| |||||||||| ||| |||||| |||||||||| || | ||||| |||||||||| TCAGTTGGTT CTGATCTGCA ATTAGTTATT GACGATATAT CCGACGAGGA TTCATCAGCT 373 GTTGGAGTCA CATGGCACTT GGGTATGGAA AAACTTGTAA TACTTTATCT TGTTTTCCAT 5816 |||||||||| |||||||||| || GTTGGAGTCA CATGGCACTT GG........ .......... .......... .......... 395 ACTGTTTACT AGGAAGATTG ATCTTCATTG TCCAGGGTAA TTGATTTTGA ATATGATAAC 5756 .......... .......... .......... .......... .......... .......... 395 AGAATGGAGG GGAAGACCTT TTCCTTTTAG CAAAGGATGC AGCTTTTATC GATTGGAAGT 5696 |||||||| |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| ..AATGGAGG GGAAGACCTT TTCCTTTTAG CAAAAGATGC AGCTTTTATC GATTGGAAGT 453 GGTGAATGGC CAGATGAAAA TACT 5672 |||||||||| |||||||||| |||| GGTGAATGGC CAGATGAAAA TACT 477 hqPGS_C06HBa0153O03.1-10-_SGN-E250902+ (9535 9256,5968 5854,5753 5672) ******************************************************************************** EST sequence 7 +strand 563 n (File: SGN-E320063+) 1 TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 61 TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 121 CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 181 ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 241 ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAAAG TCNGCGGCGA 301 GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 361 TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGGCGTA 421 AGGCAATTCT AGATTTTTTC AAGAAGTTCA CTGATTCAGT TGGTTCTGAT CTGCAATTTG 481 TTATTGACGA TATATCCAAG GAGGATTCAT CAGCTGTTGG AGTCACATGG CACTTGGAAT 541 GGGAGGGAAG ACCTTTTCCT TTT Predicted gene structure (within gDNA segment 10277 to 4984): Exon 1 9677 9256 ( 422 n); cDNA 1 422 ( 422 n); score: 0.993 Intron 1 9255 5969 (3287 n); Pd: 0.992 (s: 0.98), Pa: 0.980 (s: 1.00) Exon 2 5968 5854 ( 115 n); cDNA 423 537 ( 115 n); score: 1.000 Intron 2 5853 5754 ( 100 n); Pd: 0.971 (s: 1.00), Pa: 0.967 (s: 0) Exon 3 5753 5728 ( 26 n); cDNA 538 563 ( 26 n); score: 0.923 MATCH C06HBa0153O03.1-10- SGN-E320063+ 0.994 563 1.000 C PGS_C06HBa0153O03.1-10-_SGN-E320063+ (9677 9256,5968 5854,5753 5728) Alignment (genomic DNA sequence = upper lines): TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 9618 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAGCTGCGC CTCATAATCA CAACAGTAGC ATCAAACTGC AATTGCAGCT ATGGTGATGC 60 TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 9558 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTATGAGCAG CACCTCATCC TTTTCTCTCC TCCCACTCAA CTCAAACCCC TCTACATCTA 120 CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 9498 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCACTGCCGC TGTCGTCGGC AACTCACCCG CCGTTCTTCA TAGTGTCAAC TGTTGTACAG 180 ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 9438 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATCTATT TAGGATCGTT TCTGGCCATC GGATAAGACA AGTGGCTGTA CGGAATAGCA 240 ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAGAG TCGGCGGCGA 9378 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || || ||||||| ATCGAACGGC TGAGGTTACT TCATCCTCTG ATTCTGTAAC TGATTTAAAG TCNGCGGCGA 300 GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 9318 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTGGTGAG GAAATTCTAT GCCGGAATAA ATAGGCGGGA TTTGGACTCT GTCGAAGAAC 360 TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGCCGTA 9258 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| TTATTGCTGA GGATTGTGTG TATGAAGACC TTGTATTTCC TCAACCTTTC GTTGGGCGTA 420 AGGTTAGTTA TCAACCTGGA CTAGTTGTAT TTTGAAGACT TTTTTCTGTT GCATTTTGTT 9198 || AG........ .......... .......... .......... .......... .......... 422 TGACTATTTT CTCTCTGGTA TATTTAGTTT AAGATGTGAG AAAAGTTCAA ACTAAGCTAG 9138 .......... .......... .......... .......... .......... .......... 422 GCTTTTCATT CAATAAAACT AAAGATGAGC TGAAACCCTT TGAAAAGAGG CCAAGATTAA 9078 .......... .......... .......... .......... .......... .......... 422 ACTGAAAATG GAAATTACAT TTGGGGATAA TAACCAAGTT CGTACTTTCT TCATCCCATT 9018 .......... .......... .......... .......... .......... .......... 422 TTATGTGACA CCCTTAGATA TTTGTGTGTT TGAGATTCAT TTCATTAAAG ATAAAAGGAA 8958 .......... .......... .......... .......... .......... .......... 422 AACTTAAAAG TTAAATTGTT GCTAATCATA GTAAGGTAAT ACTTCATTTT GAGACGAACT 8898 .......... .......... .......... .......... .......... .......... 422 AAAAAGGAAA TGCTCTTACA TAAAATGGAA GAGAGGGAGT AATTAAATTT ATAGGGATTT 8838 .......... .......... .......... .......... .......... .......... 422 TGCCAAGAGA AACATCTCCA TTTCAAAAGA TATGATCCTA CTTTCATCAA ATTAAAAAGT 8778 .......... .......... .......... .......... .......... .......... 422 CCGCTGGTTG ATGTAAAGAT TAACACCGCC CATGTTGCAT AGCATGGAGC TGAATGAATT 8718 .......... .......... .......... .......... .......... .......... 422 CACATTCAAA TTTACGTGAG ATCAAAATAA CATCAAAGTG TTCGATGAAT GTTTTGGGAA 8658 .......... .......... .......... .......... .......... .......... 422 CTTCAACTGC AAACAAATGT CCAAGAGCAA TGGTTACTGC AAATTTATCA GACTCTCTAC 8598 .......... .......... .......... .......... .......... .......... 422 TACTCTTAGA AGTGTAAAAA GTATGGAGGA ATGCAAAATC ATCATCATTA TATAGTGCAA 8538 .......... .......... .......... .......... .......... .......... 422 TATTTGGGCC CTCTCCAGTT ACAGCTACAA CATCAACTTG AATGATTCGC ACACATACAA 8478 .......... .......... .......... .......... .......... .......... 422 CAACAGTGAA ACAACCAACA TTTTGCAAAT GTCAGAATAA CCAATAATAA CCTCCTGATG 8418 .......... .......... .......... .......... .......... .......... 422 CCCATCATAG TTCTTTATTA TGTGCTCCCT ACCTAGTGGC AGAATCAGGA TTTTCATTAA 8358 .......... .......... .......... .......... .......... .......... 422 GGGGTTCGAT GGTTGTAGTA AAACGCAATA TTTATTCCCT CTCCAGTTAC AGCTACAACA 8298 .......... .......... .......... .......... .......... .......... 422 TCAACTTGAA TGATTTGCAC TCCTGCAACA ACAATGAAAC AACCAATATT TGCGAATGTT 8238 .......... .......... .......... .......... .......... .......... 422 AGAATAACCA ATTATAACCT CCTGATGCCC ATCATAGTTT TTTATTATGC GTTCCCTATA 8178 .......... .......... .......... .......... .......... .......... 422 CCAATTAATT GGTAGTAATC TTCTGACTAC CCAACCAGCC AGTGGTGGAA CCAGGATGTT 8118 .......... .......... .......... .......... .......... .......... 422 TAGTAGGGCT TGAACACGTA ACCTCATGGA ATTTTCTTAT GCCCTTAACC AATAAACTAA 8058 .......... .......... .......... .......... .......... .......... 422 ATCTTCAACT TGTTTCAAGA GGTGTCAATA CTTGTATATA TATTTACTAA ACCAAAATAT 7998 .......... .......... .......... .......... .......... .......... 422 GACTTCTATA TACAATGTAA CTTTCTGACG AAGGGGTTTC GCTTCACACC TCTTGGCCAA 7938 .......... .......... .......... .......... .......... .......... 422 GGGTGGGTGC GCCCCTGGAT CCAGCTTGTC TCATGTCCTA CATGGGCTCA AATAGAGGAA 7878 .......... .......... .......... .......... .......... .......... 422 CTAGATGCTG TGTTCAACCA GGAAAGGGCC TGCTTATCTC CCCATTAATA ATAAGATGTG 7818 .......... .......... .......... .......... .......... .......... 422 CATCCTTTTG TAAAATCTCT ACATCTAGTT CATCCAAGGC ATTTGGAACG GTGGAAATAA 7758 .......... .......... .......... .......... .......... .......... 422 CATAAGCTCC AACTGAATCA TTTCCTAATT CAGTATCTGT TTTTTTTTTT TTAATGAAAA 7698 .......... .......... .......... .......... .......... .......... 422 TCTGCTTTGA AGGTATTCTA GACTTCTTTG TTGTCAGGCA GAACCACTGT AGTAGGAAGA 7638 .......... .......... .......... .......... .......... .......... 422 ACCCAAGGTC TTTGCCTTTT TATATCTTTA GTTAGAAACT CCAGTGTTTG CTCTTCATCC 7578 .......... .......... .......... .......... .......... .......... 422 CAATTACCAT ATTTAGGTAT TTTGTGATTA TATCAATTGC CTATGACCAG GACACCTAGA 7518 .......... .......... .......... .......... .......... .......... 422 CATGTGAATA TAAACAAGTG ATTCATCACT AAAATATAGC TGAAGTTTGA GAAACATGAA 7458 .......... .......... .......... .......... .......... .......... 422 AATTGACTCT CAGAAACTCA AACCCGATCA AGTATCTCCC AACTGAATCT GAAAGCTTCC 7398 .......... .......... .......... .......... .......... .......... 422 TAATTGCCTC AAGTATTGAA CAAAACCCAC ACCAGCCTTC TCTGAAGTTT ACCATATGTG 7338 .......... .......... .......... .......... .......... .......... 422 CTCATAACAG AAACTATTAG GAAGAGATGC TTCAAGTTTC TTGCGGTTCA TACTTTTCTA 7278 .......... .......... .......... .......... .......... .......... 422 TAAAGAATTT CAAAGTATTG TTCAATAACA ACAATCATCC TTGAGTCAAC ATGATAGAAA 7218 .......... .......... .......... .......... .......... .......... 422 TCACAACTTC TTTGTCTTAG CTGCTCATGT GACATAGCAT AGGGCCAAGG ACCAAGATCT 7158 .......... .......... .......... .......... .......... .......... 422 CATCTTGCCA CCTCAATAAC TCAGGGCTTC AGTGTTTCTC CACCTTTAGT AGAGCTATTT 7098 .......... .......... .......... .......... .......... .......... 422 CACAATTTTT TCGCCACAGT GCACTTGCCC ATTAAAATCT TGACGAGGAT ACTCAATATG 7038 .......... .......... .......... .......... .......... .......... 422 AAAAAGGCAA ACTGTCAGCT AGCTTCCACA ACTCCATCAT TCCTACATCA ATGAGTATTA 6978 .......... .......... .......... .......... .......... .......... 422 TAGCAGGAAG AAAATTATGG CCCAAACTAT GAACACAAAC AACTTTTACA AACTGCATTT 6918 .......... .......... .......... .......... .......... .......... 422 GGAGATTTGG AATCACATCT TTGTAAAAAG TCATTTTGTT TCAATGTCCA TCTCAGTATT 6858 .......... .......... .......... .......... .......... .......... 422 GGAATTATCA CTATTAATAG TAATTCTATT CTCCTCTTTG TTGTTGTTCA AGCCTTTTCC 6798 .......... .......... .......... .......... .......... .......... 422 ATAATAGAGA AATCACCTGA AAGCCCAGTT TTCTTCTTAG CCTTGTTACT GCTTCCATCC 6738 .......... .......... .......... .......... .......... .......... 422 TCTTATTTTT TCACCTTCTT GCACCTCTTT AAAGTTCCAT TTACTATAAA TTGAAGAACC 6678 .......... .......... .......... .......... .......... .......... 422 AGTTCTCCAT CATCAACAAC CTTATTGGCG TCAGACAAAC TATTCTCGAT GCTCGATAAT 6618 .......... .......... .......... .......... .......... .......... 422 TCTAATTCCT TGGGTTTGAT GCACTTGAGC TTATGGTCTT CCAAAAACAA TCCCTCTACT 6558 .......... .......... .......... .......... .......... .......... 422 TCCATGAGGT AGTGGTAAGG TCTGCTACAC TCTACCCTCC TGAGACCCTA CTTAGTGCGA 6498 .......... .......... .......... .......... .......... .......... 422 TTTCTCTGGA TATGTTGTTG TATCTGTTTG CTTTTCGTGA GTGAAGAATT TGGCCGTAGA 6438 .......... .......... .......... .......... .......... .......... 422 GTTCATTATC TTTGTATAGA TTTCTCTGTT TTGAGTTGTG ATCCACCTCA TCTGAAAGTT 6378 .......... .......... .......... .......... .......... .......... 422 CTAAAATTTT GTATGAGGTG ACACCCTATG TTGCTCGGAC TTTTCAACAG TATCATCGGT 6318 .......... .......... .......... .......... .......... .......... 422 GCGTGTCCGA TTCTTCAAAA GTGGTGCATT TTTGCAGAAT TTGACACCGG TGAGGCATCT 6258 .......... .......... .......... .......... .......... .......... 422 AAAGTGAGGG GTCCGCACAA CTTACAACTC TTTGGGCATG GGATTAGAAC GAAATCCATC 6198 .......... .......... .......... .......... .......... .......... 422 TCTAATGAAC GGACACCCAA TGTCGGTTTA CTCTGTTTTG TTACTCCATG CTTCACATGT 6138 .......... .......... .......... .......... .......... .......... 422 CTAGTACTCA GCATGGTGGC GAAGTGTTAG TCCCATATAA GATAAGTGTA TTTGTGATTA 6078 .......... .......... .......... .......... .......... .......... 422 TATTCGCTTA AAGAAAATCA CTACTTGAGC TAATTTTTGG AATTATATCA GGCCTATGTC 6018 .......... .......... .......... .......... .......... .......... 422 CTTTTTGCTC ATCACTAAGC TTATGTTTGA TCTCAAACTA TTAATGCAGG CAATTCTAGA 5958 | |||||||||| .......... .......... .......... .......... .........G CAATTCTAGA 433 TTTTTTCAAG AAGTTCACTG ATTCAGTTGG TTCTGATCTG CAATTTGTTA TTGACGATAT 5898 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTTCAAG AAGTTCACTG ATTCAGTTGG TTCTGATCTG CAATTTGTTA TTGACGATAT 493 ATCCAAGGAG GATTCATCAG CTGTTGGAGT CACATGGCAC TTGGGTATGG AAAAACTTGT 5838 |||||||||| |||||||||| |||||||||| |||||||||| |||| ATCCAAGGAG GATTCATCAG CTGTTGGAGT CACATGGCAC TTGG...... .......... 537 AATACTTTAT CTTGTTTTCC ATACTGTTTA CTAGGAAGAT TGATCTTCAT TGTCCAGGGT 5778 .......... .......... .......... .......... .......... .......... 537 AATTGATTTT GAATATGATA ACAGAATGGA GGGGAAGACC TTTTCCTTTT 5728 ||||| ||||||||| |||||||||| .......... .......... ....AATGGG AGGGAAGACC TTTTCCTTTT 563 hqPGS_C06HBa0153O03.1-10-_SGN-E320063+ (9677 9256,5968 5854,5753 5728) Total number of EST alignments reported: 10 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 11761: PGL 1 (- strand): 722 482 AGS-1 (722 482) SCR (e 0.882) Exon 1 722 482 ( 241 n); score: 0.882 PGS (722 482) SGN-E238572+ PGS (722 482) SGN-E368646+ PGS (673 482) SGN-E368645- 3-phase translation of AGS-1 (-strand): . . . . . . 722 GGCGAGAAGCGACCGCTTGCCTTTTTGAATCAAGGCTCCAATTAATACCAAAAATTAAAA G E K R P L A F L N Q G S N - Y Q K L K A R S D R L P F - I K A P I N T K N - K R E A T A C L F E S R L Q L I P K I K . . . . . . 662 ATATTTAATTTCATATATAAATATCCAAAATCTTAATAGTAATAACACATATTAGCAAAT I F N F I Y K Y P K S - - - - H I L A N Y L I S Y I N I Q N L N S N N T Y - Q I N I - F H I - I S K I L I V I T H I S K . . . . . . 602 ATTCAATTCAAAAACCAATAGTAGATACTAAAAAGTCTAAAACTTTAAAGACCAAATAAT I Q F K N Q - - I L K S L K L - R P N N F N S K T N S R Y - K V - N F K D Q I I Y S I Q K P I V D T K K S K T L K T K - . . . . . . 542 TCAAAATACAATACTAAAACTTTAAAAGTCTATTCTTCTTCAAAAACTATCCAACTCTTG S K Y N T K T L K V Y S S S K T I Q L L Q N T I L K L - K S I L L Q K L S N S - F K I Q Y - N F K S L F F F K N Y P T L . 482 A Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 482 TCAAGAGTTGGATAGTTTTTGAAGAAGAATAGACTTTTAAAGTTTTAGTATTGTATTTTG S R V G - F L K K N R L L K F - Y C I L Q E L D S F - R R I D F - S F S I V F - K S W I V F E E E - T F K V L V L Y F . . . . . . 542 AATTATTTGGTCTTTAAAGTTTTAGACTTTTTAGTATCTACTATTGGTTTTTGAATTGAA N Y L V F K V L D F L V S T I G F - I E I I W S L K F - T F - Y L L L V F E L N E L F G L - S F R L F S I Y Y W F L N - . . . . . . 602 TATTTGCTAATATGTGTTATTACTATTAAGATTTTGGATATTTATATATGAAATTAAATA Y L L I C V I T I K I L D I Y I - N - I I C - Y V L L L L R F W I F I Y E I K Y I F A N M C Y Y Y - D F G Y L Y M K L N . . . . . . 662 TTTTTAATTTTTGGTATTAATTGGAGCCTTGATTCAAAAAGGCAAGCGGTCGCTTCTCGC F L I F G I N W S L D S K R Q A V A S R F - F L V L I G A L I Q K G K R S L L A I F N F W Y - L E P - F K K A S G R F S . 722 C Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 9677 4384 AGS-1 (9677 9256,5968 5854,5753 5672,4435 4384) SCR (e 0.998 d 0.992 a 0.980,e 1.000 d 0.971 a 0.967,e 1.000 d 0.927 a 0.985,e 1.000) Exon 1 9677 9256 ( 422 n); score: 0.998 Intron 1 9255 5969 (3287 n); Pd: 0.992 Pa: 0.980 Exon 2 5968 5854 ( 115 n); score: 1.000 Intron 2 5853 5754 ( 100 n); Pd: 0.971 Pa: 0.967 Exon 3 5753 5672 ( 82 n); score: 1.000 Intron 3 5671 4436 (1236 n); Pd: 0.927 Pa: 0.985 Exon 4 4435 4384 ( 52 n); score: 1.000 PGS (9616 9256,5968 5854,5753 5672,4435 4384) SGN-E374433+ PGS (9422 9256,5968 5854,5753 5672,4435 4384) SGN-E320389+ PGS (5717 5672,4435 4384) SGN-E340888+ PGS (9677 9256,5968 5854,5753 5672) SGN-E320286+ PGS (9616 9256,5968 5854,5753 5672) SGN-E294549+ PGS (9535 9256,5968 5854,5753 5672) SGN-E250902+ PGS (9677 9256,5968 5854,5753 5728) SGN-E320063+ 3-phase translation of AGS-1 (-strand): . . . . . . 9677 TTAGCTGCGCCTCATAATCACAACAGTAGCATCAAACTGCAATTGCAGCTATGGTGATGC L A A P H N H N S S I K L Q L Q L W - C - L R L I I T T V A S N C N C S Y G D A S C A S - S Q Q - H Q T A I A A M V M . . . . . . 9617 TTATGAGCAGCACCTCATCCTTTTCTCTCCTCCCACTCAACTCAAACCCCTCTACATCTA L - A A P H P F L S S H S T Q T P L H L Y E Q H L I L F S P P T Q L K P L Y I Y L M S S T S S F S L L P L N S N P S T S . . . . . . 9557 CCACTGCCGCTGTCGTCGGCAACTCACCCGCCGTTCTTCATAGTGTCAACTGTTGTACAG P L P L S S A T H P P F F I V S T V V Q H C R C R R Q L T R R S S - C Q L L Y R T T A A V V G N S P A V L H S V N C C T . . . . . . 9497 ATTATCTATTTAGGATCGTTTCTGGCCATCGGATAAGACAAGTGGCTGTACGGAATAGCA I I Y L G S F L A I G - D K W L Y G I A L S I - D R F W P S D K T S G C T E - Q D Y L F R I V S G H R I R Q V A V R N S . . . . . . 9437 ATCGAACGGCTGAGGTTACTTCATCCTCTGATTCTGTAACTGATTTAGAGTCGGCGGCGA I E R L R L L H P L I L - L I - S R R R S N G - G Y F I L - F C N - F R V G G E N R T A E V T S S S D S V T D L E S A A . . . . . . 9377 GTGTGGTGAGGAAATTCTATGCCGGAATAAATAGGCGGGATTTGGACTCTGTCGAAGAAC V W - G N S M P E - I G G I W T L S K N C G E E I L C R N K - A G F G L C R R T S V V R K F Y A G I N R R D L D S V E E . . . . . . 9317 TTATTGCTGAGGATTGTGTGTATGAAGACCTTGTATTTCCTCAACCTTTCGTTGGCCGTA L L L R I V C M K T L Y F L N L S L A V Y C - G L C V - R P C I S S T F R W P - L I A E D C V Y E D L V F P Q P F V G R . : . . . . . 9257 AG : GCAATTCTAGATTTTTTCAAGAAGTTCACTGATTCAGTTGGTTCTGATCTGCAATTTG R : Q F - I F S R S S L I Q L V L I C N L : G N S R F F Q E V H - F S W F - S A I C K : A I L D F F K K F T D S V G S D L Q F . . . . . . : 5910 TTATTGACGATATATCCAAGGAGGATTCATCAGCTGTTGGAGTCACATGGCACTTGG : AAT L L T I Y P R R I H Q L L E S H G T W : N Y - R Y I Q G G F I S C W S H M A L G : M V I D D I S K E D S S A V G V T W H L : E . . . . . . 5750 GGAGGGGAAGACCTTTTCCTTTTAGCAAAGGATGCAGCTTTTATCGATTGGAAGTGGTGA G G E D L F L L A K D A A F I D W K W - E G K T F S F - Q R M Q L L S I G S G E W R G R P F P F S K G C S F Y R L E V V . . : . . . . 5690 ATGGCCAGATGAAAATACT : TTATGGCAGAGACAGTGTGGAACCTGCAGTCAAGCCGGGGG M A R - K Y : F M A E T V W N L Q S S R G W P D E N T : L W Q R Q C G T C S Q A G G N G Q M K I L : Y G R D S V E P A V K P G . . 4394 AGACGGCATTG R R H D G I E T A L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0153O03.1-10-_PGL-2_AGS-1_PPS_1 (9648 9256,5968 5854,5753 5672,4435 4384) (frame '0'; 642 bp, 214 residues) 1 HQTAIAAMVM LMSSTSSFSL LPLNSNPSTS TTAAVVGNSP AVLHSVNCCT DYLFRIVSGH 61 RIRQVAVRNS NRTAEVTSSS DSVTDLESAA SVVRKFYAGI NRRDLDSVEE LIAEDCVYED 121 LVFPQPFVGR KAILDFFKKF TDSVGSDLQF VIDDISKEDS SAVGVTWHLE WRGRPFPFSK 181 GCSFYRLEVV NGQMKILYGR DSVEPAVKPG ETAL ... finished at: Mon Aug 28 22:24:29 2006