GeneSeqer. Version of March 12, 2006. Date run: Mon Aug 28 21:47:04 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 34829 Total sequence length: 35392868 Minimum sequence length: 63 Maximum sequence length: 5381 Length distribution (number of sequences of specified length): < 100: 4 < 200: 53 < 300: 143 < 400: 353 < 500: 791 < 600: 1674 < 700: 2583 < 800: 4023 < 900: 6481 < 1000: 5706 >=1000: 13018 Input file : /tmp/bac-submission-temp-vzQej/C06HBa0120H21/C06HBa0120H21.seq.screen ________________________________________________________________________________ Sequence 1: C06HBa0120H21.1-1, from 1 to 8769, both strands analyzed. ... started at: Mon Aug 28 21:57:16 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 955 n (File: SGN-U335600+) 1 NAGTAGCTGG AGCTCCCCGC GGTGGCGGCC GCTCTTCACT AGTGGATCCC CCGGGCTGCA 61 GGAATTCGGC ACGAGGCCGA TCATCTTTGT CGATCACTTT ATGAAACCCT AGATCTCGGT 121 CACCACTCTA CTGCAAACCA TTTGCAAGGA GTGCTTTTTT GTTCATCTCT AATACAAATA 181 TAAAATGTTG CGAATATGGC ATACCTGCTG CCTTCAGAAT GTTGTGAGCA TGGCAGACCC 241 GCTGCCTTCA GTTCCTATTA TATGCATAGA ATTGTTACAT ACACATATAG AAGAGAATAA 301 TATTCCAAAG TTGGTAAAAT GGCTGATGAT AGTAAGGCAG CTGAGAACCA TAACCTCACT 361 TCGGATAGTA ACTTATCTTC AGAAAGCGGT AATGAGGTAT CAATTGATTC CTTAGCACGA 421 AAGGTACAAG AATCTCTCTC ACTGTCGAAG AAGCATAAGT TTTGGGAGAC CCAACCAGTG 481 GGCCAGTTCA AGGATCTTGG AAATTCAAGC TTGCCTGAAG GGCCTATTGA ACCTCCAACA 541 CCCTTTACTG AAGTTAAACA GGAGCCTTTA CATCTTCCCA AGCCCTATGA GGGGACCCCC 601 TTGTGATTTG GACTCCAAAA ATATGTGGAA TGAGGGCTAT CTTCTGCTAA CAAAAAATAT 661 GTAAAAAAAT GATGAGAAAC TGTTTACGGT CCATTACTTC AAAGAAATTC CTTCCAGGGG 721 CACAAAATCC CCCCAGTTAC TAATAAAAAC TGGCCATTTG GATCCGAAGA TGAAAACCCC 781 CAAAAAATTT GGTTCTTTTT TTTCCAGGGG GACCTGCCAA AAAACTGGCT AAAAAATAAC 841 CCCGGGATCA TGGCAAAAAA AAAATTTCCT GGTGGTTCGT AAGAAACTTT AATTAAAAGA 901 AAACTGCCCC TTGTTATTGA AAAAGGGAGG CCCAAGGAAA GGTCCTTTGT AAAAA Predicted gene structure (within gDNA segment 1 to 5406): Exon 1 291 437 ( 147 n); cDNA 147 293 ( 147 n); score: 1.000 Intron 1 438 651 ( 214 n); Pd: 0.000 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 652 1302 ( 651 n); cDNA 294 954 ( 661 n); score: 0.816 MATCH C06HBa0120H21.1-1+ SGN-U335600+ 0.850 798 0.836 C PGS_C06HBa0120H21.1-1+_SGN-U335600+ (291 437,652 1302) Alignment (genomic DNA sequence = upper lines): AGGAGTGCTT TTTTGTTCAT CTCTAATACA AATATAAAAT GTTGCGAATA TGGCATACCT 350 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGAGTGCTT TTTTGTTCAT CTCTAATACA AATATAAAAT GTTGCGAATA TGGCATACCT 206 GCTGCCTTCA GAATGTTGTG AGCATGGCAG ACCCGCTGCC TTCAGTTCCT ATTATATGCA 410 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTGCCTTCA GAATGTTGTG AGCATGGCAG ACCCGCTGCC TTCAGTTCCT ATTATATGCA 266 TAGAATTGTT ACATACACAT ATAGAAGGTC CGAACCCATT TCACAAAGAA GTAGTGGATT 470 |||||||||| |||||||||| ||||||| TAGAATTGTT ACATACACAT ATAGAAG... .......... .......... .......... 293 TCAGTGGTAC TCTGTATTTT ACTGACTTAA AACTCCTGGA TGGGGCTTTG GTTGTGGAAT 530 .......... .......... .......... .......... .......... .......... 293 GTTTTTAACA CTAATGGTCT CTGACAGCAT GTGTGCTGTA AATGGCTTAT AGATTTGCTA 590 .......... .......... .......... .......... .......... .......... 293 ATATGTTCGC TTTATGTGTT ATATACTTAG TGCTGGATTT AATAATTTAT TGTTATGGCA 650 .......... .......... .......... .......... .......... .......... 293 GAGAATAATA TTCCAAAGTT GGTAAAATGG CTGATGATAG TAAGGCAGCT GAGAACCATA 710 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .AGAATAATA TTCCAAAGTT GGTAAAATGG CTGATGATAG TAAGGCAGCT GAGAACCATA 352 ACCTCACTTC GGATAGTAAC TTATCTTCAG AAAGCGGTAA TGAGGTATCA ATTGATTCCT 770 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCTCACTTC GGATAGTAAC TTATCTTCAG AAAGCGGTAA TGAGGTATCA ATTGATTCCT 412 TAGCACGAAA GGTACAAGAA TCTCTCTCAC TGTCGAAGAG GCATAAGTTT TGGGAGACCC 830 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TAGCACGAAA GGTACAAGAA TCTCTCTCAC TGTCGAAGAA GCATAAGTTT TGGGAGACCC 472 AACCAGTGGG CCAGTTCAAG GATCTTGGAG ATTCAAGCTT GCCTGAAGGT CCTATTGAAC 890 |||||||||| |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| AACCAGTGGG CCAGTTCAAG GATCTTGGAA ATTCAAGCTT GCCTGAAGGG CCTATTGAAC 532 CTCCAACACC CTTAACTGAA GTTAAACAGG AGCCTTACAA TCTTCCTAGT CCATATGAGT 950 |||||||||| ||| |||||| |||||||||| |||||| | |||||| | || |||||| CTCCAACACC CTTTACTGAA GTTAAACAGG AGCCTTTACA TCTTCCCAAG CCCTATGAGG 592 GGACCACC-T GTGATTTGGA CTCAGAAGAT ATGTGTAATG AGGTCTATCT TCTGCTAACA 1009 ||||| || | |||||||||| ||| || || ||||| |||| ||| |||||| |||||||||| GGACCCCCTT GTGATTTGGA CTCCAAAAAT ATGTGGAATG AGGGCTATCT TCTGCTAACA 652 AACAATTATG T-AGAAGATG ATGAGAACAT GTTTAGGTTC AATTAC-TCA AAGGAATTTC 1067 || || |||| | | || ||| ||||||| | ||||| | || ||||| ||| ||| |||| | AA-AAATATG TAAAAAAATG ATGAGAAACT GTTTACGGTC CATTACTTCA AAGAAATTCC 711 TTCGATGGGC ACTACATCCT CCAGGTTACT -ATAAGAGCT GGCACATAGG AGTCAGAG-T 1125 ||| | |||| || | |||| || |||||| |||| | || ||| | || | | || | TTCCAGGGGC ACAAAATCCC CCCAGTTACT AATAAAAACT GGCCATTTGG ATCCGAAGAT 771 GAAGA-CCTC AAAAAAATTG GTTGCTTTTA TCACAGGGGT ACCTGCAAAG ATACGTGCT- 1183 ||| | || | |||||| ||| ||| |||| | |||||| |||||| || | || ||| GAAAACCCCC AAAAAATTTG GTTCTTTTTT TTCCAGGGGG ACCTGCCAAA AAACTGGCTA 831 AGAGATAACG TCGTGATCAT GGCAGAGATA AACTTT-CTG TGTGTTCATA AGAAGC-TTA 1241 | | ||||| || |||||| |||| | | | || ||| ||| |||| || |||| | ||| AAAAATAACC CCGGGATCAT GGCAAAAAAA AAATTTCCTG GTGGTTCGTA AGAAACTTTA 891 GATCAAAGAG ACTTGCTCC- TGTCA-TGAT AAAGGAGGTC ACAAGGAGAG TTCACTTGGA 1299 | ||||| | ||| || ||| | ||| ||||| | | |||||| || || ||| | ATTAAAAGAA AACTGCCCCT TGTTATTGAA AAAGGGAGGC CCAAGGAAAG GTCCTTTGTA 951 AAA 1302 ||| AAA 954 hqPGS_C06HBa0120H21.1-1+_SGN-U335600+ (291 437,652 1302) ******************************************************************************** EST sequence 2 +strand 1645 n (File: SGN-U316325+) 1 CGGCCGAATT GGTCGATCAC TTTATGAAAC CCTAGATCTC GGTCACCACT CTACTGCAAA 61 CCATTTGCAA GAGAATAATA TTCCAAAGTT GGTAAAATGG CTGATGATAG TAAGGCAGCT 121 GAGAACCATA ACCTCACTTC GGATAGTAAC TTATCTTCAG AAAGCGGTAA TGAGGTATCA 181 ATTGATTCCT TAGCACGAAA GGTACAAGAA TCTCTCTCAC TGTCGAAGAG GCATAAGTTT 241 TGGGAGACCC AACCAGTGGG CCAGTTCAAG GATCTTGGAG ATTCAAGCTT GCCTGAAGGT 301 CCTATTGAAC CTCCAACACC CTTAACTGAA GTTAAACAGG AGCCTTACAA TCTTCCTAGT 361 CCATATGAGT GGACCACCTG TGATTTGGAC TCAGAAGATA TGTGTAATGA GGTCTATCTT 421 CTGCTAACAA ACAATTATGT AGAAGATGAT GAGAACATGT TTAGGTTCAA TTACTCAAAG 481 GAATTTCTTC GATGGGCACT ACATCCTCCA GGTTACTATA AGAGCTGGCA CATAGGAGTC 541 AGAGTGAAGA CCTCAAAAAA ATTGGTTGCT TTTATCACAG GGGTACCTGC AAAGATACGT 601 GCTAGAGATA ACGTCGTGAT CATGGCAGAG ATAAACTTTC TGTGTGTTCA TAAGAAGCTT 661 AGATCAAAGA GACTTGCTCC TGTCATGATA AAGGAGGTCA CAAGGAGAGT TCACTTGGAA 721 AACATTTGGC AAGCTGCTTA TACAGCTGGG GTGGTTCTTC CAACACCTGT ATCCACATGT 781 CAATATTGGC ATAGATCTTT GAATCCAAAG AAGCTTATCG ATGTCGGATT TTCTAGGCTT 841 GGTGCGAGGA TGACAATGAG CCGCACAATA AAGCTTTACA AGTTACCTGA TCAGACCGTC 901 ACACCTGGGT TCAGAAAGAT GGAGCTCCAT GATGTTCCTG CTGTTACTCG ATTGCTTAGG 961 GATTACTTGA AGCACTTTGT GGTTGCACCT GATTTTGATG AGAATGACGT AGAGCACTGG 1021 CTTCTGCCAA AGGAGGGTGT TGTTGACAGC TATCTGGTTG AAAGCCCCGA GTCTCATGAA 1081 ATCACTGACT TCTGCAGTTT TTACACCCTC CCTTCATCAA TTCTTGGTAA TCAGAATTAT 1141 TCCACTCTGA AGGCTGCTTA TTCTTATTAC AACGTTTCAA CTAAAACTCC ATGGATTCAG 1201 TTGATGAATG ATGCTCTCAT AGTGGCTAAG AAAAAGGATT TTGACGTTTT CAATGCTCTA 1261 GATGTTATGC ATAACGAAAC TTTCTTGAAG GAACTAAAGT TTGGCCCCGG TGATGGGCAA 1321 CTCCACTACT ATCTCTACAA CTATCGAATA AAGCATGTTT TAAGACCATC AGAACTTGGG 1381 CTTGTACTGT TGTAATTTGT GCTTGGGAGA TTATTGCCTG CCTGGGAAGC ATTTTTTTTG 1441 GTGATGTGCT CGGTCTCATT TTTTAAGCCC CCTTTGCCCT GTTAAACTTT GTTTGCATCC 1501 ATGAATTGAT CAAGGCACAA GTGTGAACAT AGTCTTTGCA CTTAAATCGA TTGTACTTTA 1561 TGATGTCTGA TCTTATATAA TTCATCCATG TTGATCCCAA GATCTTAAAT ATTCAGAACC 1621 GTGGATAAAG AATTTTTTCT CTTTA Predicted gene structure (within gDNA segment 1 to 2835): Exon 1 652 2225 (1574 n); cDNA 72 1645 (1574 n); score: 1.000 MATCH C06HBa0120H21.1-1+ SGN-U316325+ 1.000 1574 0.957 C PGS_C06HBa0120H21.1-1+_SGN-U316325+ (652 2225) Alignment (genomic DNA sequence = upper lines): AGAATAATAT TCCAAAGTTG GTAAAATGGC TGATGATAGT AAGGCAGCTG AGAACCATAA 711 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATAATAT TCCAAAGTTG GTAAAATGGC TGATGATAGT AAGGCAGCTG AGAACCATAA 131 CCTCACTTCG GATAGTAACT TATCTTCAGA AAGCGGTAAT GAGGTATCAA TTGATTCCTT 771 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCACTTCG GATAGTAACT TATCTTCAGA AAGCGGTAAT GAGGTATCAA TTGATTCCTT 191 AGCACGAAAG GTACAAGAAT CTCTCTCACT GTCGAAGAGG CATAAGTTTT GGGAGACCCA 831 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCACGAAAG GTACAAGAAT CTCTCTCACT GTCGAAGAGG CATAAGTTTT GGGAGACCCA 251 ACCAGTGGGC CAGTTCAAGG ATCTTGGAGA TTCAAGCTTG CCTGAAGGTC CTATTGAACC 891 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCAGTGGGC CAGTTCAAGG ATCTTGGAGA TTCAAGCTTG CCTGAAGGTC CTATTGAACC 311 TCCAACACCC TTAACTGAAG TTAAACAGGA GCCTTACAAT CTTCCTAGTC CATATGAGTG 951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAACACCC TTAACTGAAG TTAAACAGGA GCCTTACAAT CTTCCTAGTC CATATGAGTG 371 GACCACCTGT GATTTGGACT CAGAAGATAT GTGTAATGAG GTCTATCTTC TGCTAACAAA 1011 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACCACCTGT GATTTGGACT CAGAAGATAT GTGTAATGAG GTCTATCTTC TGCTAACAAA 431 CAATTATGTA GAAGATGATG AGAACATGTT TAGGTTCAAT TACTCAAAGG AATTTCTTCG 1071 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATTATGTA GAAGATGATG AGAACATGTT TAGGTTCAAT TACTCAAAGG AATTTCTTCG 491 ATGGGCACTA CATCCTCCAG GTTACTATAA GAGCTGGCAC ATAGGAGTCA GAGTGAAGAC 1131 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGGCACTA CATCCTCCAG GTTACTATAA GAGCTGGCAC ATAGGAGTCA GAGTGAAGAC 551 CTCAAAAAAA TTGGTTGCTT TTATCACAGG GGTACCTGCA AAGATACGTG CTAGAGATAA 1191 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCAAAAAAA TTGGTTGCTT TTATCACAGG GGTACCTGCA AAGATACGTG CTAGAGATAA 611 CGTCGTGATC ATGGCAGAGA TAAACTTTCT GTGTGTTCAT AAGAAGCTTA GATCAAAGAG 1251 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTCGTGATC ATGGCAGAGA TAAACTTTCT GTGTGTTCAT AAGAAGCTTA GATCAAAGAG 671 ACTTGCTCCT GTCATGATAA AGGAGGTCAC AAGGAGAGTT CACTTGGAAA ACATTTGGCA 1311 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTGCTCCT GTCATGATAA AGGAGGTCAC AAGGAGAGTT CACTTGGAAA ACATTTGGCA 731 AGCTGCTTAT ACAGCTGGGG TGGTTCTTCC AACACCTGTA TCCACATGTC AATATTGGCA 1371 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTGCTTAT ACAGCTGGGG TGGTTCTTCC AACACCTGTA TCCACATGTC AATATTGGCA 791 TAGATCTTTG AATCCAAAGA AGCTTATCGA TGTCGGATTT TCTAGGCTTG GTGCGAGGAT 1431 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGATCTTTG AATCCAAAGA AGCTTATCGA TGTCGGATTT TCTAGGCTTG GTGCGAGGAT 851 GACAATGAGC CGCACAATAA AGCTTTACAA GTTACCTGAT CAGACCGTCA CACCTGGGTT 1491 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACAATGAGC CGCACAATAA AGCTTTACAA GTTACCTGAT CAGACCGTCA CACCTGGGTT 911 CAGAAAGATG GAGCTCCATG ATGTTCCTGC TGTTACTCGA TTGCTTAGGG ATTACTTGAA 1551 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGAAAGATG GAGCTCCATG ATGTTCCTGC TGTTACTCGA TTGCTTAGGG ATTACTTGAA 971 GCACTTTGTG GTTGCACCTG ATTTTGATGA GAATGACGTA GAGCACTGGC TTCTGCCAAA 1611 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACTTTGTG GTTGCACCTG ATTTTGATGA GAATGACGTA GAGCACTGGC TTCTGCCAAA 1031 GGAGGGTGTT GTTGACAGCT ATCTGGTTGA AAGCCCCGAG TCTCATGAAA TCACTGACTT 1671 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAGGGTGTT GTTGACAGCT ATCTGGTTGA AAGCCCCGAG TCTCATGAAA TCACTGACTT 1091 CTGCAGTTTT TACACCCTCC CTTCATCAAT TCTTGGTAAT CAGAATTATT CCACTCTGAA 1731 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGCAGTTTT TACACCCTCC CTTCATCAAT TCTTGGTAAT CAGAATTATT CCACTCTGAA 1151 GGCTGCTTAT TCTTATTACA ACGTTTCAAC TAAAACTCCA TGGATTCAGT TGATGAATGA 1791 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCTGCTTAT TCTTATTACA ACGTTTCAAC TAAAACTCCA TGGATTCAGT TGATGAATGA 1211 TGCTCTCATA GTGGCTAAGA AAAAGGATTT TGACGTTTTC AATGCTCTAG ATGTTATGCA 1851 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTCTCATA GTGGCTAAGA AAAAGGATTT TGACGTTTTC AATGCTCTAG ATGTTATGCA 1271 TAACGAAACT TTCTTGAAGG AACTAAAGTT TGGCCCCGGT GATGGGCAAC TCCACTACTA 1911 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAACGAAACT TTCTTGAAGG AACTAAAGTT TGGCCCCGGT GATGGGCAAC TCCACTACTA 1331 TCTCTACAAC TATCGAATAA AGCATGTTTT AAGACCATCA GAACTTGGGC TTGTACTGTT 1971 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCTACAAC TATCGAATAA AGCATGTTTT AAGACCATCA GAACTTGGGC TTGTACTGTT 1391 GTAATTTGTG CTTGGGAGAT TATTGCCTGC CTGGGAAGCA TTTTTTTTGG TGATGTGCTC 2031 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAATTTGTG CTTGGGAGAT TATTGCCTGC CTGGGAAGCA TTTTTTTTGG TGATGTGCTC 1451 GGTCTCATTT TTTAAGCCCC CTTTGCCCTG TTAAACTTTG TTTGCATCCA TGAATTGATC 2091 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTCTCATTT TTTAAGCCCC CTTTGCCCTG TTAAACTTTG TTTGCATCCA TGAATTGATC 1511 AAGGCACAAG TGTGAACATA GTCTTTGCAC TTAAATCGAT TGTACTTTAT GATGTCTGAT 2151 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGGCACAAG TGTGAACATA GTCTTTGCAC TTAAATCGAT TGTACTTTAT GATGTCTGAT 1571 CTTATATAAT TCATCCATGT TGATCCCAAG ATCTTAAATA TTCAGAACCG TGGATAAAGA 2211 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTATATAAT TCATCCATGT TGATCCCAAG ATCTTAAATA TTCAGAACCG TGGATAAAGA 1631 ATTTTTTCTC TTTA 2225 |||||||||| |||| ATTTTTTCTC TTTA 1645 hqPGS_C06HBa0120H21.1-1+_SGN-U316325+ (652 2225) ******************************************************************************** EST sequence 3 +strand 1651 n (File: SGN-U316326+) 1 TGAAACCCTA GATCTGCGTC GCCGGCCGGG ATTCGAAGAC GCTGCTAGAG AATAATACAC 61 TGAAGCTGGT GAAATGGCTG ACAACAGTAA GTCAACCGAG AACCATAACC AAAGTTCTGA 121 TGATAATTTA GCTCCAGAGA ATGGTAATGA GGCCATTGAT TCCTTGGCAC GAAAGGTCCA 181 AGAATCTCTG TCGCTTGCAA AGAGACATAA GTTCTGGGAG ACCCAACCAG TGAGACAGTT 241 CAAGGATCTT GGGGATTCAA GCTTGCCTGA AGGCCCCATT GAACCTCCAA CTCCTTTGTC 301 AGAAGTTAAG CAGGAGCCTT ACAACCTTCC AAGTCAGTAT GAGTGGACCA CCTGTGATAT 361 GGATTCGGAG GAGATGTGCA ATGAAGTCTA TGTTCTCCTA ACAAACAACT ATGTTGAGGA 421 TGATGAGAAC ATGTTTAGGT TCAATTACTC TAAAGAATTT CTTCGGTGGG CACTTCGCCC 481 TCCAGGTTTC TATAGGAGCT GGCACATTGG AGTCAGAGTG AAGACCTCAA AAAAGTTGGT 541 TGCTTTTATT ACAGGGGTAC CTGCTAGGAT ACGCGTTCGA GATAATGTTG TGATGATGGC 601 AGAGATCAAT TTCCTTTGTG TTCATAAGAA ACTTAGATCA AAAAGACTTG CTCCTGTTAT 661 GATAAAAGAG GTCACAAGGA GGGTTCATAT GGAGAATATC TGGCAAGCTG CTTATACAGC 721 TGGGGTGGTC CTACCGACAC CTGTATCAAC CTGTCAATAT TGGCATAGAT CTCTGAATCC 781 AAAGAAGCTA ATTGATGTTG GGTTTTCCAG GCTTGGTGCA AGGATGACAA TGAGCCGAAC 841 AATAAAGCTG TATAAGTTAC CTGATCAGAC TGTCACACCT GGGTTCAGGA AGATGGAGCC 901 CCATGATGTT CCTGCAGTTA CTCGATTGCT CAGGAATTAC TTGAAGCAGT TTGTGGTTGC 961 ACCTGACTTT GATGAAAATG ATGTGGAACA CTGGCTTCTG CCAAAGGAAG GTGTTATTGA 1021 CAGTTATCTG GTTGAAAGTC CTCAAACTCA TGAAATCACC GACTTCTGCA GTTTTTACAC 1081 ACTCCCTTCA TCAATTCTTG GTAACCAGAA TCATACCACT CTAAAAGCTG CTTATTCGTA 1141 TTACAATGTC TCTACAAAGA CTCCATTGAT TCAGTTGATG AATGATGCCC TTATTGTGGC 1201 AAAACAGAAG GACTTTGATG TTTTCAATGC CCTAGATGTT ATGCAGAACG ATAGTTTCTT 1261 AAAGGAACTG AAGTTTGGCC CCGGTGATGG GAAACTCCAC TACTATCTCT ACAATTATCG 1321 AACAAAGCAT GTTTTAAGAT CATCAGAGCT TGGGCTTGTA CTCTTGTAGT TGGAGTTCAC 1381 AAAACTGTTT GTTACTGGGG ACAGAGCAGC ATTTTTGGTA ATTTTGTTTC GGTGTAGAAC 1441 TCTCTATTTT GCTCTATCTG TCTCTTGTCC CTCTCAACTG GACTTTGTAG TTCCTGATCT 1501 TGCCTGTTTC ATTGTAACTA TGGATAAGTT CCAGTTCAAC TCTTTTTGTG GGTAAACTAT 1561 CACATCTTTC TATGTCCTTC CTAAATCCAT TACTTGCTTG ATTAAAGTTG ACATTAATAA 1621 AGCAGTGTTA TACCTCCTTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 1 to 5675): Exon 1 652 1977 (1326 n); cDNA 49 1371 (1323 n); score: 0.873 PPA cDNA 1641 1651 MATCH C06HBa0120H21.1-1+ SGN-U316326+ 0.873 1326 0.803 C PGS_C06HBa0120H21.1-1+_SGN-U316326+ (652 1977) Alignment (genomic DNA sequence = upper lines): AGAATAATAT TCCAAAGTTG GTAAAATGGC TGATGATAGT AAGGCAGCTG AGAACCATAA 711 ||||||||| | ||| || || ||||||| ||| | ||| ||| || | | |||||||||| AGAATAATAC ACTGAAGCTG GTGAAATGGC TGACAACAGT AAGTCAACCG AGAACCATAA 108 CCTCACTTCG GATAGTAACT TATCTTCAGA AAGCGGTAAT GAGGTATCAA TTGATTCCTT 771 || | ||| ||| ||| | || || |||| | |||||| |||| | | |||||||||| CCAAAGTTCT GATGATAATT TAGCTCCAGA GAATGGTAAT GAGG--CC-A TTGATTCCTT 165 AGCACGAAAG GTACAAGAAT CTCTCTCACT GTCGAAGAGG CATAAGTTTT GGGAGACCCA 831 ||||||||| || ||||||| |||| || || | ||||| |||||||| | |||||||||| GGCACGAAAG GTCCAAGAAT CTCTGTCGCT TGCAAAGAGA CATAAGTTCT GGGAGACCCA 225 ACCAGTGGGC CAGTTCAAGG ATCTTGGAGA TTCAAGCTTG CCTGAAGGTC CTATTGAACC 891 ||||||| | |||||||||| ||||||| || |||||||||| |||||||| | | |||||||| ACCAGTGAGA CAGTTCAAGG ATCTTGGGGA TTCAAGCTTG CCTGAAGGCC CCATTGAACC 285 TCCAACACCC TTAACTGAAG TTAAACAGGA GCCTTACAAT CTTCCTAGTC CATATGAGTG 951 |||||| || || | |||| |||| ||||| ||||||||| ||||| |||| |||||||| TCCAACTCCT TTGTCAGAAG TTAAGCAGGA GCCTTACAAC CTTCCAAGTC AGTATGAGTG 345 GACCACCTGT GATTTGGACT CAGAAGATAT GTGTAATGAG GTCTATCTTC TGCTAACAAA 1011 |||||||||| ||| |||| | | || || || ||| ||||| |||||| ||| | |||||||| GACCACCTGT GATATGGATT CGGAGGAGAT GTGCAATGAA GTCTATGTTC TCCTAACAAA 405 CAATTATGTA GAAGATGATG AGAACATGTT TAGGTTCAAT TACTCAAAGG AATTTCTTCG 1071 ||| ||||| || ||||||| |||||||||| |||||||||| ||||| || | |||||||||| CAACTATGTT GAGGATGATG AGAACATGTT TAGGTTCAAT TACTCTAAAG AATTTCTTCG 465 ATGGGCACTA CATCCTCCAG GTTACTATAA GAGCTGGCAC ATAGGAGTCA GAGTGAAGAC 1131 |||||||| | ||||||| ||| ||||| |||||||||| || ||||||| |||||||||| GTGGGCACTT CGCCCTCCAG GTTTCTATAG GAGCTGGCAC ATTGGAGTCA GAGTGAAGAC 525 CTCAAAAAAA TTGGTTGCTT TTATCACAGG GGTACCTGCA AAGATACGTG CTAGAGATAA 1191 ||||||||| |||||||||| |||| ||||| ||||||||| | |||||| | | ||||||| CTCAAAAAAG TTGGTTGCTT TTATTACAGG GGTACCTGCT AGGATACGCG TTCGAGATAA 585 CGTCGTGATC ATGGCAGAGA TAAACTTTCT GTGTGTTCAT AAGAAGCTTA GATCAAAGAG 1251 || ||||| |||||||||| | || || || ||||||||| ||||| |||| ||||||| || TGTTGTGATG ATGGCAGAGA TCAATTTCCT TTGTGTTCAT AAGAAACTTA GATCAAAAAG 645 ACTTGCTCCT GTCATGATAA AGGAGGTCAC AAGGAGAGTT CACTTGGAAA ACATTTGGCA 1311 |||||||||| || ||||||| | |||||||| |||||| ||| || |||| | | || ||||| ACTTGCTCCT GTTATGATAA AAGAGGTCAC AAGGAGGGTT CATATGGAGA ATATCTGGCA 705 AGCTGCTTAT ACAGCTGGGG TGGTTCTTCC AACACCTGTA TCCACATGTC AATATTGGCA 1371 |||||||||| |||||||||| |||| || || ||||||||| || || |||| |||||||||| AGCTGCTTAT ACAGCTGGGG TGGTCCTACC GACACCTGTA TCAACCTGTC AATATTGGCA 765 TAGATCTTTG AATCCAAAGA AGCTTATCGA TGTCGGATTT TCTAGGCTTG GTGCGAGGAT 1431 ||||||| || |||||||||| |||| || || ||| || ||| || ||||||| |||| ||||| TAGATCTCTG AATCCAAAGA AGCTAATTGA TGTTGGGTTT TCCAGGCTTG GTGCAAGGAT 825 GACAATGAGC CGCACAATAA AGCTTTACAA GTTACCTGAT CAGACCGTCA CACCTGGGTT 1491 |||||||||| || ||||||| |||| || || |||||||||| ||||| |||| |||||||||| GACAATGAGC CGAACAATAA AGCTGTATAA GTTACCTGAT CAGACTGTCA CACCTGGGTT 885 CAGAAAGATG GAGCTCCATG ATGTTCCTGC TGTTACTCGA TTGCTTAGGG ATTACTTGAA 1551 ||| |||||| |||| ||||| |||||||||| ||||||||| ||||| ||| |||||||||| CAGGAAGATG GAGCCCCATG ATGTTCCTGC AGTTACTCGA TTGCTCAGGA ATTACTTGAA 945 GCACTTTGTG GTTGCACCTG ATTTTGATGA GAATGACGTA GAGCACTGGC TTCTGCCAAA 1611 ||| |||||| |||||||||| | |||||||| ||||| || || ||||||| |||||||||| GCAGTTTGTG GTTGCACCTG ACTTTGATGA AAATGATGTG GAACACTGGC TTCTGCCAAA 1005 GGAGGGTGTT GTTGACAGCT ATCTGGTTGA AAGCCCCGAG TCTCATGAAA TCACTGACTT 1671 ||| |||||| ||||||| | |||||||||| ||| || | ||||||||| |||| ||||| GGAAGGTGTT ATTGACAGTT ATCTGGTTGA AAGTCCTCAA ACTCATGAAA TCACCGACTT 1065 CTGCAGTTTT TACACCCTCC CTTCATCAAT TCTTGGTAAT CAGAATTATT CCACTCTGAA 1731 |||||||||| ||||| |||| |||||||||| ||||||||| |||||| || ||||||| || CTGCAGTTTT TACACACTCC CTTCATCAAT TCTTGGTAAC CAGAATCATA CCACTCTAAA 1125 GGCTGCTTAT TCTTATTACA ACGTTTCAAC TAAAACTCCA TGGATTCAGT TGATGAATGA 1791 ||||||||| || ||||||| | || || || || |||||| | |||||||| |||||||||| AGCTGCTTAT TCGTATTACA ATGTCTCTAC AAAGACTCCA TTGATTCAGT TGATGAATGA 1185 TGCTCTCATA GTGGCTAAGA AAAAGGATTT TGACGTTTTC AATGCTCTAG ATGTTATGCA 1851 ||| || || ||||| || | ||||| || ||| |||||| ||||| |||| |||||||||| TGCCCTTATT GTGGCAAAAC AGAAGGACTT TGATGTTTTC AATGCCCTAG ATGTTATGCA 1245 TAACGAAACT TTCTTGAAGG AACTAAAGTT TGGCCCCGGT GATGGGCAAC TCCACTACTA 1911 ||||| | | ||||| |||| |||| ||||| |||||||||| |||||| ||| |||||||||| GAACGATAGT TTCTTAAAGG AACTGAAGTT TGGCCCCGGT GATGGGAAAC TCCACTACTA 1305 TCTCTACAAC TATCGAATAA AGCATGTTTT AAGACCATCA GAACTTGGGC TTGTACTGTT 1971 ||||||||| ||||||| || |||||||||| |||| ||||| || ||||||| ||||||| || TCTCTACAAT TATCGAACAA AGCATGTTTT AAGATCATCA GAGCTTGGGC TTGTACTCTT 1365 GTAATT 1977 ||| || GTAGTT 1371 hqPGS_C06HBa0120H21.1-1+_SGN-U316326+ (652 1977) Total number of EST alignments reported: 3 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 8769: PGL 1 (+ strand): 291 2225 AGS-1 (291 437,652 2225) SCR (e 1.000 d 0.000 a 0.997,e 1.000) Exon 1 291 437 ( 147 n); score: 1.000 Intron 1 438 651 ( 214 n); Pd: 0.000 Pa: 0.997 Exon 2 652 2225 (1574 n); score: 1.000 PGS (291 437,652 1302) SGN-U335600+ PGS (652 2225) SGN-U316325+ PGS (652 1977) SGN-U316326+ 3-phase translation of AGS-1 (+strand): . . . . . . 291 AGGAGTGCTTTTTTGTTCATCTCTAATACAAATATAAAATGTTGCGAATATGGCATACCT R S A F L F I S N T N I K C C E Y G I P G V L F C S S L I Q I - N V A N M A Y L E C F F V H L - Y K Y K M L R I W H T . . . . . . 351 GCTGCCTTCAGAATGTTGTGAGCATGGCAGACCCGCTGCCTTCAGTTCCTATTATATGCA A A F R M L - A W Q T R C L Q F L L Y A L P S E C C E H G R P A A F S S Y Y M H C C L Q N V V S M A D P L P S V P I I C . . . : . . . 411 TAGAATTGTTACATACACATATAGAAG : AGAATAATATTCCAAAGTTGGTAAAATGGCTGA - N C Y I H I - K : R I I F Q S W - N G - R I V T Y T Y R R : E - Y S K V G K M A D I E L L H T H I E : E N N I P K L V K W L . . . . . . 685 TGATAGTAAGGCAGCTGAGAACCATAACCTCACTTCGGATAGTAACTTATCTTCAGAAAG - - - G S - E P - P H F G - - L I F R K D S K A A E N H N L T S D S N L S S E S M I V R Q L R T I T S L R I V T Y L Q K . . . . . . 745 CGGTAATGAGGTATCAATTGATTCCTTAGCACGAAAGGTACAAGAATCTCTCTCACTGTC R - - G I N - F L S T K G T R I S L T V G N E V S I D S L A R K V Q E S L S L S A V M R Y Q L I P - H E R Y K N L S H C . . . . . . 805 GAAGAGGCATAAGTTTTGGGAGACCCAACCAGTGGGCCAGTTCAAGGATCTTGGAGATTC E E A - V L G D P T S G P V Q G S W R F K R H K F W E T Q P V G Q F K D L G D S R R G I S F G R P N Q W A S S R I L E I . . . . . . 865 AAGCTTGCCTGAAGGTCCTATTGAACCTCCAACACCCTTAACTGAAGTTAAACAGGAGCC K L A - R S Y - T S N T L N - S - T G A S L P E G P I E P P T P L T E V K Q E P Q A C L K V L L N L Q H P - L K L N R S . . . . . . 925 TTACAATCTTCCTAGTCCATATGAGTGGACCACCTGTGATTTGGACTCAGAAGATATGTG L Q S S - S I - V D H L - F G L R R Y V Y N L P S P Y E W T T C D L D S E D M C L T I F L V H M S G P P V I W T Q K I C . . . . . . 985 TAATGAGGTCTATCTTCTGCTAACAAACAATTATGTAGAAGATGATGAGAACATGTTTAG - - G L S S A N K Q L C R R - - E H V - N E V Y L L L T N N Y V E D D E N M F R V M R S I F C - Q T I M - K M M R T C L . . . . . . 1045 GTTCAATTACTCAAAGGAATTTCTTCGATGGGCACTACATCCTCCAGGTTACTATAAGAG V Q L L K G I S S M G T T S S R L L - E F N Y S K E F L R W A L H P P G Y Y K S G S I T Q R N F F D G H Y I L Q V T I R . . . . . . 1105 CTGGCACATAGGAGTCAGAGTGAAGACCTCAAAAAAATTGGTTGCTTTTATCACAGGGGT L A H R S Q S E D L K K I G C F Y H R G W H I G V R V K T S K K L V A F I T G V A G T - E S E - R P Q K N W L L L S Q G . . . . . . 1165 ACCTGCAAAGATACGTGCTAGAGATAACGTCGTGATCATGGCAGAGATAAACTTTCTGTG T C K D T C - R - R R D H G R D K L S V P A K I R A R D N V V I M A E I N F L C Y L Q R Y V L E I T S - S W Q R - T F C . . . . . . 1225 TGTTCATAAGAAGCTTAGATCAAAGAGACTTGCTCCTGTCATGATAAAGGAGGTCACAAG C S - E A - I K E T C S C H D K G G H K V H K K L R S K R L A P V M I K E V T R V F I R S L D Q R D L L L S - - R R S Q . . . . . . 1285 GAGAGTTCACTTGGAAAACATTTGGCAAGCTGCTTATACAGCTGGGGTGGTTCTTCCAAC E S S L G K H L A S C L Y S W G G S S N R V H L E N I W Q A A Y T A G V V L P T G E F T W K T F G K L L I Q L G W F F Q . . . . . . 1345 ACCTGTATCCACATGTCAATATTGGCATAGATCTTTGAATCCAAAGAAGCTTATCGATGT T C I H M S I L A - I F E S K E A Y R C P V S T C Q Y W H R S L N P K K L I D V H L Y P H V N I G I D L - I Q R S L S M . . . . . . 1405 CGGATTTTCTAGGCTTGGTGCGAGGATGACAATGAGCCGCACAATAAAGCTTTACAAGTT R I F - A W C E D D N E P H N K A L Q V G F S R L G A R M T M S R T I K L Y K L S D F L G L V R G - Q - A A Q - S F T S . . . . . . 1465 ACCTGATCAGACCGTCACACCTGGGTTCAGAAAGATGGAGCTCCATGATGTTCCTGCTGT T - S D R H T W V Q K D G A P - C S C C P D Q T V T P G F R K M E L H D V P A V Y L I R P S H L G S E R W S S M M F L L . . . . . . 1525 TACTCGATTGCTTAGGGATTACTTGAAGCACTTTGTGGTTGCACCTGATTTTGATGAGAA Y S I A - G L L E A L C G C T - F - - E T R L L R D Y L K H F V V A P D F D E N L L D C L G I T - S T L W L H L I L M R . . . . . . 1585 TGACGTAGAGCACTGGCTTCTGCCAAAGGAGGGTGTTGTTGACAGCTATCTGGTTGAAAG - R R A L A S A K G G C C - Q L S G - K D V E H W L L P K E G V V D S Y L V E S M T - S T G F C Q R R V L L T A I W L K . . . . . . 1645 CCCCGAGTCTCATGAAATCACTGACTTCTGCAGTTTTTACACCCTCCCTTCATCAATTCT P R V S - N H - L L Q F L H P P F I N S P E S H E I T D F C S F Y T L P S S I L A P S L M K S L T S A V F T P S L H Q F . . . . . . 1705 TGGTAATCAGAATTATTCCACTCTGAAGGCTGCTTATTCTTATTACAACGTTTCAACTAA W - S E L F H S E G C L F L L Q R F N - G N Q N Y S T L K A A Y S Y Y N V S T K L V I R I I P L - R L L I L I T T F Q L . . . . . . 1765 AACTCCATGGATTCAGTTGATGAATGATGCTCTCATAGTGGCTAAGAAAAAGGATTTTGA N S M D S V D E - C S H S G - E K G F - T P W I Q L M N D A L I V A K K K D F D K L H G F S - - M M L S - W L R K R I L . . . . . . 1825 CGTTTTCAATGCTCTAGATGTTATGCATAACGAAACTTTCTTGAAGGAACTAAAGTTTGG R F Q C S R C Y A - R N F L E G T K V W V F N A L D V M H N E T F L K E L K F G T F S M L - M L C I T K L S - R N - S L . . . . . . 1885 CCCCGGTGATGGGCAACTCCACTACTATCTCTACAACTATCGAATAAAGCATGTTTTAAG P R - W A T P L L S L Q L S N K A C F K P G D G Q L H Y Y L Y N Y R I K H V L R A P V M G N S T T I S T T I E - S M F - . . . . . . 1945 ACCATCAGAACTTGGGCTTGTACTGTTGTAATTTGTGCTTGGGAGATTATTGCCTGCCTG T I R T W A C T V V I C A W E I I A C L P S E L G L V L L - F V L G R L L P A W D H Q N L G L Y C C N L C L G D Y C L P . . . . . . 2005 GGAAGCATTTTTTTTGGTGATGTGCTCGGTCTCATTTTTTAAGCCCCCTTTGCCCTGTTA G S I F F G D V L G L I F - A P F A L L E A F F L V M C S V S F F K P P L P C - G K H F F W - C A R S H F L S P L C P V . . . . . . 2065 AACTTTGTTTGCATCCATGAATTGATCAAGGCACAAGTGTGAACATAGTCTTTGCACTTA N F V C I H E L I K A Q V - T - S L H L T L F A S M N - S R H K C E H S L C T - K L C L H P - I D Q G T S V N I V F A L . . . . . . 2125 AATCGATTGTACTTTATGATGTCTGATCTTATATAATTCATCCATGTTGATCCCAAGATC N R L Y F M M S D L I - F I H V D P K I I D C T L - C L I L Y N S S M L I P R S K S I V L Y D V - S Y I I H P C - S Q D . . . . . 2185 TTAAATATTCAGAACCGTGGATAAAGAATTTTTTCTCTTTA L N I Q N R G - R I F S L - I F R T V D K E F F L F L K Y S E P W I K N F F S L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-1+_PGL-1_AGS-1_PPS_1 (659 1975) (frame '2'; 1314 bp, 438 residues) 1 YSKVGKMADD SKAAENHNLT SDSNLSSESG NEVSIDSLAR KVQESLSLSK RHKFWETQPV 61 GQFKDLGDSS LPEGPIEPPT PLTEVKQEPY NLPSPYEWTT CDLDSEDMCN EVYLLLTNNY 121 VEDDENMFRF NYSKEFLRWA LHPPGYYKSW HIGVRVKTSK KLVAFITGVP AKIRARDNVV 181 IMAEINFLCV HKKLRSKRLA PVMIKEVTRR VHLENIWQAA YTAGVVLPTP VSTCQYWHRS 241 LNPKKLIDVG FSRLGARMTM SRTIKLYKLP DQTVTPGFRK MELHDVPAVT RLLRDYLKHF 301 VVAPDFDEND VEHWLLPKEG VVDSYLVESP ESHEITDFCS FYTLPSSILG NQNYSTLKAA 361 YSYYNVSTKT PWIQLMNDAL IVAKKKDFDV FNALDVMHNE TFLKELKFGP GDGQLHYYLY 421 NYRIKHVLRP SELGLVLL- ... finished at: Mon Aug 28 21:57:26 2006 ________________________________________________________________________________ Sequence 2: C06HBa0120H21.1-2, from 1 to 738, both strands analyzed. ... started at: Mon Aug 28 21:57:26 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 21:57:30 2006 ________________________________________________________________________________ Sequence 3: C06HBa0120H21.1-3, from 1 to 10330, both strands analyzed. ... started at: Mon Aug 28 21:57:30 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 ******************************************************************************** EST sequence 2 +strand 2290 n (File: SGN-U322161+) 1 GAAGTTTCTG ATTGAAATAC GTATGGGCCG ACTATGAAAT ACACTGTATA TTCATTCAGT 61 TCCCTCTGAT GAATTTATCT ACAACACTAC GTACGACGCT GTATCATGGG GATCGACTAG 121 GGTGGGTGGG GACAAGAAGA TAGTATTACG AAAGCAAAAA GACAGAACTA AACGTATATA 181 TATATATATA AAATAATAGA AGGAAGAAAT TGGGGTTAAG AGATAGAATC AGGGCCAGAT 241 TATTCAAAAT GGGATGATAA TAAAGCATAC ACTTGAAGTT GATGCTGCTT CATTTTGCAC 301 CTTCTTGAGG CAAAACTCAT GAAGGAACAA AACCCCAAAA AAGAATTATT CAAAAAGCTG 361 AAATCTTGAT TTTTCTTTTT TTTGATTCAT TTTCATTCTA TGTTGGATTG AATTTTTCAG 421 GGTTTGTGTG GTGAAGTTTT TTTTTTTTTT CAAATTTAAA TTTTTAGTCT CCGATGGAGC 481 CTCGTGTTGG TAATAAGTTC AGGCTTGGCC GGAAAATTGG TAGCGGTTCT TTTGGGGAGA 541 TCTACCTCGG TGGTAATGTT CAAACTAATG AAGAGGTTGC TATCAAGCTG GAAAATGTGA 601 AAACAAAGCA TCCTCAACTG TTGTACGAAG CAAAGCTGTA TAAAATACTA CAAGGAGGAA 661 CTGGAGTCCC CAATTTAAAA TGGTTTGGAG TTGAAGGAGA TTACAATGTC CTTGTGATGG 721 ATTTACTGGG ACCTAGTCTT GAAGATCTTT TCAACTTCTG CAATAGGAAA ATGTCCTTGA 781 AGACCGTTCT CATGCTTGCA GATCAGATGA TCAATCGGGT TGAGTTTGTT CATGCCAAAT 841 CTTTTCTTCA TCGAGATATA AAACCTGACA ACTTTCTTAT GGGATTAGGA AGACGTGCAA 901 ATCAGGTCTA TGTTATTGAT TTTGGGCTGG CTAAGAAGTA CAGAGACTCA TCAACTCATC 961 AGCATATTCC GTATAGAGAA AATAAAAATT TGACAGGAAC TGCTAGATAT GCAAGCATGA 1021 ACACTCACCT CGGCATTGAA CAAAGTCGAA GGGATGATTT GGAATCATTG GGTTTTGTTC 1081 TGATGTACTT TTTAAGAGGA AGTCTCCCTT GGCAGGGGCT GAAAGCAGGC AATAAGAAAC 1141 AGAAGTATGA GAGGATCAGT GAGAAGAAAG TTTCAACGTC AATAGAGACC TTGTGTCGAG 1201 GCTATCCTGC AGAGTTTGCA TCATATTTTC ATTACTGTCG ATCACTGAGA TTTGATGATA 1261 AACCAGATTA TGCTTATCTC AAGAGAATTT TCCGTGACCT TTTCATTCGT GAAGGGTTTC 1321 AGTTTGATTA TGTATTTGAC TGGACCATAT TGAAATATCA GCAATCACAG CTTGCCAATA 1381 TTCCATCTCG TGCTCTTGGC GGTACTGCTG GGCCAAGTTC AGGGACGCCT CATGGTCTTG 1441 CTAATGTCGA AAAGAAATCA GGTGGTGAAG AAGGACGGCC AACTGGTTGG TCTTCATCAA 1501 ATCTGACACG TAATAGGAGC ACAGGGCTCA ATTTCAATGC TGGAAGCTTA TTGAAGCAAA 1561 AAGGCACAGT TGCTAATGAT TTATCTGTGG GTAAAGAGTT ACCTAGTTCT AATTTTTTCC 1621 GGTCAAGTGG ATCAGCAAGG CAACCTAATG TCTCTAGCAG TCGAGACCCA GTGATTACTG 1681 GGGGTGAACC TGACCTCTCC CGCCCTCAGA CACTAGATGC AGCAGGCGCA GCATCACTGC 1741 GTAAAATATT TAATACTACC CAGAAGACTT CACCAGTTGT GTCTTCAGAG CACAAGCGCA 1801 GCTCCTCCAC AAGAAACACA AACCTAAAGA ATTTAGAGTC TGCCATCAAA GGAATAGAGG 1861 GTTTAAGCTT TCGATGATGA GGTACTGGAT TAGTAGCTCT GCTTTGTCAC AATTCCCCCT 1921 CTACTGTATA TCTTGGCACA GCAAACACAC CAACATGGCG GAGTATGAGT TCTGATATTA 1981 GTTGTTTCCA GGAGGAACCA TAAACAATGC AACCCCCGCA AACTCACAAA TCCCAGTTTA 2041 TGTTTTGTCC ATACAGACAC AGTTATAGGC ACTTATCGTA TTCTTTTTCG TCTCTATCTC 2101 TCCTGTTCTT GTTCTATCGT GTTATTCATA TTTATCTTAT GTTGTGAATT ATGAAGAGGC 2161 CCATATATAA TTGCCGATTT ATATGGTCCA CGAGATGTTT GACGCTAGCA GATTTTTTTG 2221 CTTTGGACAT GAGCAAACTC TCTTTTGTTT CAAGCTATTT AAATATCAAT CAAAAAAAAA 2281 AAAAAAAAAA Predicted gene structure (within gDNA segment 8874 to 263): Exon 1 8274 7726 ( 549 n); cDNA 1 549 ( 549 n); score: 1.000 Intron 1 7725 7627 ( 99 n); Pd: 0.995 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 2 7626 7586 ( 41 n); cDNA 550 590 ( 41 n); score: 1.000 Intron 2 7585 6591 ( 995 n); Pd: 1.000 (s: 1.00), Pa: 0.996 (s: 1.00) Exon 3 6590 6521 ( 70 n); cDNA 591 660 ( 70 n); score: 1.000 Intron 3 6520 6439 ( 82 n); Pd: 0.919 (s: 1.00), Pa: 0.951 (s: 1.00) Exon 4 6438 6290 ( 149 n); cDNA 661 809 ( 149 n); score: 1.000 Intron 4 6289 6216 ( 74 n); Pd: 0.949 (s: 1.00), Pa: 0.931 (s: 1.00) Exon 5 6215 6120 ( 96 n); cDNA 810 905 ( 96 n); score: 1.000 Intron 5 6119 6036 ( 84 n); Pd: 0.880 (s: 1.00), Pa: 0.920 (s: 1.00) Exon 6 6035 5965 ( 71 n); cDNA 906 976 ( 71 n); score: 1.000 Intron 6 5964 4642 (1323 n); Pd: 0.231 (s: 1.00), Pa: 0.988 (s: 1.00) Exon 7 4641 4580 ( 62 n); cDNA 977 1038 ( 62 n); score: 1.000 Intron 7 4579 4498 ( 82 n); Pd: 1.000 (s: 1.00), Pa: 0.940 (s: 1.00) Exon 8 4497 4434 ( 64 n); cDNA 1039 1102 ( 64 n); score: 1.000 Intron 8 4433 4342 ( 92 n); Pd: 0.999 (s: 1.00), Pa: 0.987 (s: 1.00) Exon 9 4341 4257 ( 85 n); cDNA 1103 1187 ( 85 n); score: 1.000 Intron 9 4256 4173 ( 84 n); Pd: 0.995 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 10 4172 4046 ( 127 n); cDNA 1188 1314 ( 127 n); score: 1.000 Intron 10 4045 3907 ( 139 n); Pd: 0.998 (s: 1.00), Pa: 0.614 (s: 1.00) Exon 11 3906 3824 ( 83 n); cDNA 1315 1397 ( 83 n); score: 1.000 Intron 11 3823 3751 ( 73 n); Pd: 0.974 (s: 1.00), Pa: 0.995 (s: 1.00) Exon 12 3750 3687 ( 64 n); cDNA 1398 1461 ( 64 n); score: 1.000 Intron 12 3686 3385 ( 302 n); Pd: 0.862 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 13 3384 3248 ( 137 n); cDNA 1462 1598 ( 137 n); score: 1.000 Intron 13 3247 1727 (1521 n); Pd: 0.998 (s: 1.00), Pa: 0.880 (s: 1.00) Exon 14 1726 1053 ( 674 n); cDNA 1599 2272 ( 674 n); score: 0.997 PPA cDNA 2273 2290 MATCH C06HBa0120H21.1-3- SGN-U322161+ 0.999 2272 0.992 C PGS_C06HBa0120H21.1-3-_SGN-U322161+ (8274 7726,7626 7586,6590 6521,6438 6290,6215 6120,6035 5965,4641 4580,4497 4434,4341 4257,4172 4046,3906 3824,3750 3687,3384 3248,1726 1053) Alignment (genomic DNA sequence = upper lines): GAAGTTTCTG ATTGAAATAC GTATGGGCCG ACTATGAAAT ACACTGTATA TTCATTCAGT 8215 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGTTTCTG ATTGAAATAC GTATGGGCCG ACTATGAAAT ACACTGTATA TTCATTCAGT 60 TCCCTCTGAT GAATTTATCT ACAACACTAC GTACGACGCT GTATCATGGG GATCGACTAG 8155 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCCTCTGAT GAATTTATCT ACAACACTAC GTACGACGCT GTATCATGGG GATCGACTAG 120 GGTGGGTGGG GACAAGAAGA TAGTATTACG AAAGCAAAAA GACAGAACTA AACGTATATA 8095 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGGGTGGG GACAAGAAGA TAGTATTACG AAAGCAAAAA GACAGAACTA AACGTATATA 180 TATATATATA AAATAATAGA AGGAAGAAAT TGGGGTTAAG AGATAGAATC AGGGCCAGAT 8035 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATATATATA AAATAATAGA AGGAAGAAAT TGGGGTTAAG AGATAGAATC AGGGCCAGAT 240 TATTCAAAAT GGGATGATAA TAAAGCATAC ACTTGAAGTT GATGCTGCTT CATTTTGCAC 7975 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTCAAAAT GGGATGATAA TAAAGCATAC ACTTGAAGTT GATGCTGCTT CATTTTGCAC 300 CTTCTTGAGG CAAAACTCAT GAAGGAACAA AACCCCAAAA AAGAATTATT CAAAAAGCTG 7915 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCTTGAGG CAAAACTCAT GAAGGAACAA AACCCCAAAA AAGAATTATT CAAAAAGCTG 360 AAATCTTGAT TTTTCTTTTT TTTGATTCAT TTTCATTCTA TGTTGGATTG AATTTTTCAG 7855 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATCTTGAT TTTTCTTTTT TTTGATTCAT TTTCATTCTA TGTTGGATTG AATTTTTCAG 420 GGTTTGTGTG GTGAAGTTTT TTTTTTTTTT CAAATTTAAA TTTTTAGTCT CCGATGGAGC 7795 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTTTGTGTG GTGAAGTTTT TTTTTTTTTT CAAATTTAAA TTTTTAGTCT CCGATGGAGC 480 CTCGTGTTGG TAATAAGTTC AGGCTTGGCC GGAAAATTGG TAGCGGTTCT TTTGGGGAGA 7735 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCGTGTTGG TAATAAGTTC AGGCTTGGCC GGAAAATTGG TAGCGGTTCT TTTGGGGAGA 540 TCTACCTCGG TATTTGAACC CTTCATGTTT TTTATTTTTT TGCCTTTCAG GTTTTGTTTT 7675 ||||||||| TCTACCTCG. .......... .......... .......... .......... .......... 549 TATTTTTAGG GTTAATTTAA TTGGTGATTT GTTTATATGT TCTTCAAGGT GGTAATGTTC 7615 || |||||||||| .......... .......... .......... .......... ........GT GGTAATGTTC 561 AAACTAATGA AGAGGTTGCT ATCAAGCTGG TAAGTTGAAA TTATCTCTCT GTGTGTGTTT 7555 |||||||||| |||||||||| ||||||||| AAACTAATGA AGAGGTTGCT ATCAAGCTG. .......... .......... .......... 590 TTATTTGTCT AAACTTGTTA TAGGAGTATT GATATGTTTT TTAATTGCTA AAGCATATGG 7495 .......... .......... .......... .......... .......... .......... 590 TTTTGCTATG AAAAAACTTG AGAATTTGTC ATTGTTTTGA TGATGGATGG TTTTTGGTTA 7435 .......... .......... .......... .......... .......... .......... 590 GCAATTAAAA CACAATTATA TGCATGTGTT ATTATCCGTG TGTTTAGTTG CTAATTATTA 7375 .......... .......... .......... .......... .......... .......... 590 TTAGGGTGGA AGGGGGTAGA TTAAGAACAA CTAGATAGAT ACCAGTGTAA TTCTATAAGC 7315 .......... .......... .......... .......... .......... .......... 590 GAGGTTTGGG CAGGGTAGGA TGTATGTAGT CCTTATCCCC TACCTTTTGC CGGGAGAGGA 7255 .......... .......... .......... .......... .......... .......... 590 TTAAACTCAA TCTTTTATAG AGAACACGTT TAGTTATATT TTATCATGCC TTCTCCGTTG 7195 .......... .......... .......... .......... .......... .......... 590 CTAGTGTGGT CCTTTTCCAT CGCGAAGTAC TTGTAGCTGT CTTTTGATGA ATGTCAGAGT 7135 .......... .......... .......... .......... .......... .......... 590 AACATTGGAA TCCAGGATCT TTGGTTGTTC TAATACCAAG TTAATATGAG GATGCAACAC 7075 .......... .......... .......... .......... .......... .......... 590 TAAACTTGAG ATAATAAGTG GAGTCACCCA AAACCGTTAT AAACACCCCT GCATGTTGAA 7015 .......... .......... .......... .......... .......... .......... 590 GACATAATTA AGCAAATAAA CTTGTGCCCG CCCTCTGATT GCTTAGATTA TTAGATTAAA 6955 .......... .......... .......... .......... .......... .......... 590 AAGGCTCTCA CACTCAATAG GGTTTTGACC ACTTGACCAT TTGGTCTTAT GGATCACAGT 6895 .......... .......... .......... .......... .......... .......... 590 AGAGGAAATG TAACCGTTAT TACCTAGCCT ACAATTTGAG ACTCCAAGTA GTGAATTTGC 6835 .......... .......... .......... .......... .......... .......... 590 ATTAACTAAA GCAAATTTGT GGTTTTGAAG AAATTGCAAC AGTTTCACTG CTTACATTTG 6775 .......... .......... .......... .......... .......... .......... 590 TCTTTCTAGA ATTTTTTATA TAAAACTTGA TATCTATTGT AAACTGAAGT TTTTGCTGAT 6715 .......... .......... .......... .......... .......... .......... 590 ACTCATGATA TCATTCTACT TCTGCAAATG CTTAGTCATT CGTGAACATC CCTTCAATTA 6655 .......... .......... .......... .......... .......... .......... 590 TGTCATAGAT GATTTTTTTA GTTTGCAAGA TTAAGTTGCT GATTTTCTTA TTTCTTGTAC 6595 .......... .......... .......... .......... .......... .......... 590 TCAGGAAAAT GTGAAAACAA AGCATCCTCA ACTGTTGTAC GAAGCAAAGC TGTATAAAAT 6535 |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ....GAAAAT GTGAAAACAA AGCATCCTCA ACTGTTGTAC GAAGCAAAGC TGTATAAAAT 646 ACTACAAGGA GGAAGTAATT TCTCATCTTC ATGTTGCCTG TTCTCTTCAT TAACTACTCA 6475 |||||||||| |||| ACTACAAGGA GGAA...... .......... .......... .......... .......... 660 ATGTTATAAG TGGTATTTAT CTTTTTATAT TGGTAGCTGG AGTCCCCAAT TTAAAATGGT 6415 |||| |||||||||| |||||||||| .......... .......... .......... ......CTGG AGTCCCCAAT TTAAAATGGT 684 TTGGAGTTGA AGGAGATTAC AATGTCCTTG TGATGGATTT ACTGGGACCT AGTCTTGAAG 6355 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGAGTTGA AGGAGATTAC AATGTCCTTG TGATGGATTT ACTGGGACCT AGTCTTGAAG 744 ATCTTTTCAA CTTCTGCAAT AGGAAAATGT CCTTGAAGAC CGTTCTCATG CTTGCAGATC 6295 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCTTTTCAA CTTCTGCAAT AGGAAAATGT CCTTGAAGAC CGTTCTCATG CTTGCAGATC 804 AGATGGTGTG TGATGTTTTG TACCACTTAT TGATTCCAAT TACAATTTAA AATCTCTTCT 6235 ||||| AGATG..... .......... .......... .......... .......... .......... 809 CACCCTTATC AATTTTCAGA TCAATCGGGT TGAGTTTGTT CATGCCAAAT CTTTTCTTCA 6175 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........A TCAATCGGGT TGAGTTTGTT CATGCCAAAT CTTTTCTTCA 850 TCGAGATATA AAACCTGACA ACTTTCTTAT GGGATTAGGA AGACGTGCAA ATCAGGTATT 6115 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TCGAGATATA AAACCTGACA ACTTTCTTAT GGGATTAGGA AGACGTGCAA ATCAG..... 905 GGGATTAAAG CTCTTACTTT GCTGCTCAAT TCACCTATTG CAGGCATGGT ATCTTCATAA 6055 .......... .......... .......... .......... .......... .......... 905 CCACGATTTA TTAATATAGG TCTATGTTAT TGATTTTGGG CTGGCTAAGA AGTACAGAGA 5995 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G TCTATGTTAT TGATTTTGGG CTGGCTAAGA AGTACAGAGA 946 CTCATCAACT CATCAGCATA TTCCGTATAG GTCAGCTCCC TTGAAGCACT ATGAGGTTGA 5935 |||||||||| |||||||||| |||||||||| CTCATCAACT CATCAGCATA TTCCGTATAG .......... .......... .......... 976 TATCTGTTTT GTTTAAGAAG ATGAAGTACT CGAAATATGG TGTTCATGCA TATGATATGC 5875 .......... .......... .......... .......... .......... .......... 976 CTATAGGAGC TTTGTGTTGT GGTCTGTTTG CTCTTTCGCT ATAGAGCAAT AGAAAAGAGT 5815 .......... .......... .......... .......... .......... .......... 976 GAATTCTTTC AGTTTGCATC ATTTTTAAAC TCCTATGAAA TTATCCCCTC ACTTTTTTTA 5755 .......... .......... .......... .......... .......... .......... 976 ATAATTGTGG ACTTGTGGTT TCTGAGAAAA GCTTGCACAC ACCGAGTAAT CCCATAGAGT 5695 .......... .......... .......... .......... .......... .......... 976 ACTTTCTACC TCCCAACAGC ACAGAGCACC AGGTAACTTT GTCCACCAAG GCATAGACAG 5635 .......... .......... .......... .......... .......... .......... 976 ATGAGAAGAA ATTACCTAGA CTTAGTCTTT TGATCATGTA GTCCAAAAGC ATGTGTCAGT 5575 .......... .......... .......... .......... .......... .......... 976 ATAGGTGCTT CCTTATTTTG TGGGGGGTTT TCTTTATGAA TTAACAGGAT ATAAGAAGAG 5515 .......... .......... .......... .......... .......... .......... 976 TAGAAACACA CATTTTTATC TGTATTGCAT GTAAGATAAG TCATCAAAGC TATTTAATGA 5455 .......... .......... .......... .......... .......... .......... 976 CTTCACAAAG GTATCAGAAT GCATGAACTA GCAAATTTAA TAAACTAACT TAATCTCTGT 5395 .......... .......... .......... .......... .......... .......... 976 TCGGGAATAA TCATATCCAC TCCTTTTCAC TGTTACTACT TTCATATTGT ACACCTTCTC 5335 .......... .......... .......... .......... .......... .......... 976 ACTACTAACT TCTAGTTCGT TAATAACTAT CCCTTCAATA TGATCAAACT GTTTTTTGGT 5275 .......... .......... .......... .......... .......... .......... 976 TTGCCTGGCA CTGCTGGTCA AGACTTAAGA GGAACGAAGC GACAACTGGG TAGTTTAAAC 5215 .......... .......... .......... .......... .......... .......... 976 AAGAAATATT TTATCACTTC GTAAAAGACT GAAAAATTTA CGGATAGGGA GAAGAAACTG 5155 .......... .......... .......... .......... .......... .......... 976 ATATCCTTCT TAAATTTTTT CTTTAGTTAA AGCCTTAATA TCTTAACCCT TTTCATTAAT 5095 .......... .......... .......... .......... .......... .......... 976 AACTGTGATT AGGTTATGGC ATTGGAAAAT GATTATCGTT CTAGTTTCCT TCGACTGGGA 5035 .......... .......... .......... .......... .......... .......... 976 CAGGGAAATA TCAGTAAATG ATATGAAACC AGATCCTGTT GATCTTTCCC AACTGTTTTA 4975 .......... .......... .......... .......... .......... .......... 976 TATTTTGGTC ATTCATCATA CATTTAACTA ATAGAAAGAC TTTTTGGCAT CAAAGATGCA 4915 .......... .......... .......... .......... .......... .......... 976 GTGCCATCAT GCTTACCATA GAGGAAGTGA TCTCTCTAAT TTATAGAGCT TGTGCAATTC 4855 .......... .......... .......... .......... .......... .......... 976 ACCGAAATTT TAACAGTGAA TCTGGAGATT TTCTGTATTT TGTAAATTAC AGTTGGTCAT 4795 .......... .......... .......... .......... .......... .......... 976 TTGTATTTCG AGCTTGTGAA GAAGTATATT TGTATGTTGA ATCATTGATT AATCACATTG 4735 .......... .......... .......... .......... .......... .......... 976 GATAATTGTA CTCTTTATTA GTCAATTATC TTACAATGAG ATCCTCCTCC AGGGCATGGA 4675 .......... .......... .......... .......... .......... .......... 976 TGTATGATTT ATTTTCTTTT TAATGTGATA CAGAGAAAAT AAAAATTTGA CAGGAACTGC 4615 ||||||| |||||||||| |||||||||| .......... .......... .......... ...AGAAAAT AAAAATTTGA CAGGAACTGC 1003 TAGATATGCA AGCATGAACA CTCACCTCGG CATTGGTGAG CTTTTCTTTT ATTTTTCCAA 4555 |||||||||| |||||||||| |||||||||| ||||| TAGATATGCA AGCATGAACA CTCACCTCGG CATTG..... .......... .......... 1038 TTTGTTCCCA ATTGTTTATA TGGACTGTTT ACTCTGTGTT TTCTATTGAC CATGTAGAAC 4495 ||| .......... .......... .......... .......... .......... .......AAC 1041 AAAGTCGAAG GGATGATTTG GAATCATTGG GTTTTGTTCT GATGTACTTT TTAAGAGGAA 4435 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGTCGAAG GGATGATTTG GAATCATTGG GTTTTGTTCT GATGTACTTT TTAAGAGGAA 1101 GGTAAATATT TCTCCTTCCC TACATTTTCT TTTATTTAGT GTCTTTACTT CTCTTTCATG 4375 | G......... .......... .......... .......... .......... .......... 1102 GTAATCTTAC ATTATCCTTT CAAACACATG TAGTCTCCCT TGGCAGGGGC TGAAAGCAGG 4315 ||||||| |||||||||| |||||||||| .......... .......... .......... ...TCTCCCT TGGCAGGGGC TGAAAGCAGG 1129 CAATAAGAAA CAGAAGTATG AGAGGATCAG TGAGAAGAAA GTTTCAACGT CAATAGAGGT 4255 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| CAATAAGAAA CAGAAGTATG AGAGGATCAG TGAGAAGAAA GTTTCAACGT CAATAGAG.. 1187 AATTGATGCT CCTCCATTCT TCATATGCTA GCTATTTAGT TTTTTATTCA CTAAATTGAT 4195 .......... .......... .......... .......... .......... .......... 1187 ACTAGTGCAA ATTTTCCTAT AGACCTTGTG TCGAGGCTAT CCTGCAGAGT TTGCATCATA 4135 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..ACCTTGTG TCGAGGCTAT CCTGCAGAGT TTGCATCATA 1225 TTTTCATTAC TGTCGATCAC TGAGATTTGA TGATAAACCA GATTATGCTT ATCTCAAGAG 4075 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCATTAC TGTCGATCAC TGAGATTTGA TGATAAACCA GATTATGCTT ATCTCAAGAG 1285 AATTTTCCGT GACCTTTTCA TTCGTGAAGG TATGATCATT TTGCTTTCCA TGCCTTCTCT 4015 |||||||||| |||||||||| ||||||||| AATTTTCCGT GACCTTTTCA TTCGTGAAG. .......... .......... .......... 1314 ATTTCATCTG TACATAGATG TTACAACAAG TACCTCAATG ACAATGTTCA CAGGATGACT 3955 .......... .......... .......... .......... .......... .......... 1314 TCCATTTTCC TTCCCTTTAT TCTTTTTTCT TTTTCCCATC TATTTCAGGG TTTCAGTTTG 3895 || |||||||||| .......... .......... .......... .......... ........GG TTTCAGTTTG 1326 ATTATGTATT TGACTGGACC ATATTGAAAT ATCAGCAATC ACAGCTTGCC AATATTCCAT 3835 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTATGTATT TGACTGGACC ATATTGAAAT ATCAGCAATC ACAGCTTGCC AATATTCCAT 1386 CTCGTGCTCT TGTAAGTGAC TTAAATATAT TAGCTTATGG CATTGGTCTC AAAATATAAA 3775 |||||||||| | CTCGTGCTCT T......... .......... .......... .......... .......... 1397 AATGAAATGT TGATTTTGAA ACAGGGCGGT ACTGCTGGGC CAAGTTCAGG GACGCCTCAT 3715 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GGCGGT ACTGCTGGGC CAAGTTCAGG GACGCCTCAT 1433 GGTCTTGCTA ATGTCGAAAA GAAATCAGGT ATGTTTGGTC TCGATATCCT CCTCCCTCAA 3655 |||||||||| |||||||||| |||||||| GGTCTTGCTA ATGTCGAAAA GAAATCAG.. .......... .......... .......... 1461 CCACAAAAAA GAAGGAAAGA TTAACAAGAA AAAGTTAATT AGAAGACTTA GCAATCAGTC 3595 .......... .......... .......... .......... .......... .......... 1461 TGTGCTAACT AGTGTGGCAT AAAGTGGTTC TATGTATCTT GTCTCTTATG TATACGTGAA 3535 .......... .......... .......... .......... .......... .......... 1461 AAGTTACAAT TGGAATCCAT TTCAATTGGT CCATTATGTC ATTTATTTTG GATGGCATGC 3475 .......... .......... .......... .......... .......... .......... 1461 CAGATGTTCT GTTCAGGAAA GTACATTACC TCTAAATATG CTAGCTTCTT ATTCTCTGGG 3415 .......... .......... .......... .......... .......... .......... 1461 AGCTATATAT GCTTGCTACT CAATATGCAG GTGGTGAAGA AGGACGGCCA ACTGGTTGGT 3355 |||||||||| |||||||||| |||||||||| .......... .......... .......... GTGGTGAAGA AGGACGGCCA ACTGGTTGGT 1491 CTTCATCAAA TCTGACACGT AATAGGAGCA CAGGGCTCAA TTTCAATGCT GGAAGCTTAT 3295 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCATCAAA TCTGACACGT AATAGGAGCA CAGGGCTCAA TTTCAATGCT GGAAGCTTAT 1551 TGAAGCAAAA AGGCACAGTT GCTAATGATT TATCTGTGGG TAAAGAGGTT AGTGCTCTTC 3235 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| TGAAGCAAAA AGGCACAGTT GCTAATGATT TATCTGTGGG TAAAGAG... .......... 1598 TTCCCCCCTC TCCCTATTGG TTATACCCTT TCTCCTGGTT GGGATGCGTT TGGGGAAGTG 3175 .......... .......... .......... .......... .......... .......... 1598 TAGTACTATT GCATATAAAT TCTGTGGCAA CTTTACTTGT TACAAAAACG ATGTGAGTGC 3115 .......... .......... .......... .......... .......... .......... 1598 ATTTGACTTT ATTCAGATAT AGGACAAAAG TTCTTTTGCA ATCTTGATTA GTCAAGTACC 3055 .......... .......... .......... .......... .......... .......... 1598 TCGGCATGAG GAATTAGATG CAGTTTCCAA AGAAGAAGTA AAAAGTTCCT GAACTTGTGT 2995 .......... .......... .......... .......... .......... .......... 1598 GGCAATTTCA ATTTACGGAG CAATTCACCA AGTAGAGAGG ATAAAGTGTG CAATCTCTTT 2935 .......... .......... .......... .......... .......... .......... 1598 GAATAAAATT TTCTTGAATC AATTTTACAT TTTATGTTAT CCTGAAAATA ATTTGATGGA 2875 .......... .......... .......... .......... .......... .......... 1598 TAGCAGGTAG GCTGCATTTC TCCTTTCTCA AAAAAAAAAA AAACGAAGAA GAAAAAGAAC 2815 .......... .......... .......... .......... .......... .......... 1598 AAGAAAAAAA GGGTAGGCTG CAACTCTTTC TGGTGCTCTA GGCTTTTATT TTGGGTGAGG 2755 .......... .......... .......... .......... .......... .......... 1598 GGATACAGAA TAGATGTGTA AGGATGGAAG TGGAATAATG AGAGGATGAA ATGTTGTGCA 2695 .......... .......... .......... .......... .......... .......... 1598 GCGAGCTCTC ACGTGGGTAC TTTGGAGGGA CAGAAACAAG AGATTTTGGA AGCAAAGGAA 2635 .......... .......... .......... .......... .......... .......... 1598 GATGACTTCA TAAAGCTTAG GGTTTTTCTT TCATTTTGCT CAACTTTTGG ATTGCCCACG 2575 .......... .......... .......... .......... .......... .......... 1598 GAGTTCCTAC ATAGATGATT GAGAGTCTTT AATAAAGAAC CATGTTTTAT GAACCTATTC 2515 .......... .......... .......... .......... .......... .......... 1598 TGCTTATGGA TTATCTGTCA CACACTGTTC CCAAGAACTG CCAAAGTCAA AACAGCAAAA 2455 .......... .......... .......... .......... .......... .......... 1598 TTCTTGGATC ATATCAATGG GTGCACAACT GCATGTTTCT CTGGACCATC ATAGTGTACT 2395 .......... .......... .......... .......... .......... .......... 1598 TGCACATATC TATTCAGAGG AAGGTCTTTT CATAATGGAT GATCCTCAAA ACTATAATGT 2335 .......... .......... .......... .......... .......... .......... 1598 AGATGATTGA TGGAAGAAAT ATGAAACATA TCCAAAATTC TCGCTGATGG AAATAGACCC 2275 .......... .......... .......... .......... .......... .......... 1598 GACACAGATA TCTCTTGTAT ACTGGGGCGT CCTCCACAAC ATTAGCAAAA TACTATCCAC 2215 .......... .......... .......... .......... .......... .......... 1598 AAAAGGAGTG GAAGTGGGAG TAAAATAGAA GTAGCTAAAA GGCATTGAAC CCCCACCCAA 2155 .......... .......... .......... .......... .......... .......... 1598 AATAAACACC CTTCTCTCCA TCTGCTGCCT TTTGAATTAC CTTCACCGAG TGGCACTGAT 2095 .......... .......... .......... .......... .......... .......... 1598 GATTACATAT AGGTTAAGAG GGCCAATAAG GTGGTGTTGG TTCGTCTATA TGACAGAAAT 2035 .......... .......... .......... .......... .......... .......... 1598 GGATGGTGAC TGCATAATTA GGGGTGGCAT TTATCTTTTT CTCTTCTAAC ATTTCCTTTC 1975 .......... .......... .......... .......... .......... .......... 1598 CTCTATTACT TCCACATATT TGTGCTTGCC CTTCATGGAA CATGAGAATA ATTTGTATTC 1915 .......... .......... .......... .......... .......... .......... 1598 ATTTGGTAGT TCTTGAATAT AATGGCCAAA CTGAGAGCAA ATTGGTGGTA TACACTAAGA 1855 .......... .......... .......... .......... .......... .......... 1598 AACAGAATTT AGTACAACTA AGGATCTATT ATGCTTATAC TTTGGGGAAT GTGGTTGAAA 1795 .......... .......... .......... .......... .......... .......... 1598 TTTACTCTTG TTTTATGACC TATATCTACC AGTAAACACT ACAGACTTTG ACTGAGTTTG 1735 .......... .......... .......... .......... .......... .......... 1598 TTTTGCAGTT ACCTAGTTCT AATTTTTTCC GGTCAAGTGG ATCAGCAAGG CAACCTAATG 1675 || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ........TT ACCTAGTTCT AATTTTTTCC GGTCAAGTGG ATCAGCAAGG CAACCTAATG 1650 TCTCTAGCAG TCGAGACCCA GTGATTACTG GGGGTGAACC TGACCTCTCC CGCCCTCAGA 1615 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCTAGCAG TCGAGACCCA GTGATTACTG GGGGTGAACC TGACCTCTCC CGCCCTCAGA 1710 CACTAGATGC AGCAGGCGCA GCATCACTGC GTAAAATATT TAATACTACC CAGAAGACTT 1555 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTAGATGC AGCAGGCGCA GCATCACTGC GTAAAATATT TAATACTACC CAGAAGACTT 1770 CACCAGTTGT GTCTTCAGAG CACAAGCGCA GCTCCTCCAC AAGAAACACA AACCTAAAGA 1495 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACCAGTTGT GTCTTCAGAG CACAAGCGCA GCTCCTCCAC AAGAAACACA AACCTAAAGA 1830 ATTTAGAGTC TGCCATCAAA GGAATAGAGG GTTTAAGCTT TCGATGATGA GGTACTGGAT 1435 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTAGAGTC TGCCATCAAA GGAATAGAGG GTTTAAGCTT TCGATGATGA GGTACTGGAT 1890 TAGTAGCTCT GCTTTGTCAC AATTCCCCCT CTACTGTATA TCTTGGCACA GCAAACACAC 1375 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTAGCTCT GCTTTGTCAC AATTCCCCCT CTACTGTATA TCTTGGCACA GCAAACACAC 1950 CAACATGGCG GAGTATGAGT TCTGATATTA GTTGTTTCCA GGAGGAACCA TAAACAATGC 1315 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACATGGCG GAGTATGAGT TCTGATATTA GTTGTTTCCA GGAGGAACCA TAAACAATGC 2010 AACCCCCGCA AACTCACAAA TCCCAGTTTA TGTTTTGTCC ATACAGACAC AGTTATAGGC 1255 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACCCCCGCA AACTCACAAA TCCCAGTTTA TGTTTTGTCC ATACAGACAC AGTTATAGGC 2070 ACTTATCGTA TTCTTTTTCG TCTCTATCTC TCCTGTTCTT GTTCTATCGT GTTATTCATA 1195 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTATCGTA TTCTTTTTCG TCTCTATCTC TCCTGTTCTT GTTCTATCGT GTTATTCATA 2130 TTTATCTTAT GTTGTGAATT ATGAAGAGGC CCATATATAA TTGCCGATTT ATATGGTCCA 1135 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTATCTTAT GTTGTGAATT ATGAAGAGGC CCATATATAA TTGCCGATTT ATATGGTCCA 2190 CGAGATGTTT GACGCTAGCA GATTCTTCTG CTTTGGACAT GAGCAAACTC TCTTTTGTTT 1075 |||||||||| |||||||||| |||| || || |||||||||| |||||||||| |||||||||| CGAGATGTTT GACGCTAGCA GATTTTTTTG CTTTGGACAT GAGCAAACTC TCTTTTGTTT 2250 CAAGCTATTT AAATATCAAT CA 1053 |||||||||| |||||||||| || CAAGCTATTT AAATATCAAT CA 2272 hqPGS_C06HBa0120H21.1-3-_SGN-U322161+ (8274 7726,7626 7586,6590 6521,6438 6290,6215 6120,6035 5965,4641 4580,4497 4434,4341 4257,4172 4046,3906 3824,3750 3687,3384 3248,1726 1053) ******************************************************************************** EST sequence 1 +strand 815 n (File: SGN-U337544+) 1 GCTCCACTCG CGGTGGCGGC CGCTCTAGAA CTAGTGGATC CCCCGGGCTG CAGGGAGAAT 61 TTTCCGTGAC CTTTTCATTC GTGAAGGGTT TCAGTTTGAT TATGTATTTG ACTGGACCAT 121 ATTGAAATAT CAGCAATCAC AGCTTGCCAA TATTCCATCT CGTGCTCTTG GCGGTACTGC 181 TGGGCCAAGT TCAGGGACGC CTCATGGTCT TGCTAATGTC GAAAAGAAAT CAGGTATGTT 241 TGGTCTCGAT ATCCTCCTCC CTCAACCACA AAAAAGAAGG AAAGATTAAC AAGAAAAAGT 301 TAATTAGAAG ACTTAGCAAT CAGTCTGTGC TAACTAGTGT GGCATAAAGT GGTTCTATGT 361 ATCTTGTCTC TTATGTATAC GTGAAAAGTT ACAATTGGAA TCCATTTCAA TTGGTCCATT 421 ATGTCATTTA TTTTGGATGG CATGCCAGAT GTTCTGTTCA GGAAAGTACA TTACCTCTAA 481 ATATGCTAGC TTCTTATTCT CTGGGAGCTA TATATGCTTG CTACTCAATA TGCAGGTGGT 541 GAAGAAGGAC GGCCAACTGG TTGGTCTTCA TCAAATCTGA CACGTAATAG GAGCACAGGG 601 CTCAATTTCA ATGCTGGAAG CTTATTGAAG CAAAAAGGCA CAGTTGCTAA TGATTTATCT 661 GTGGGTAAAG AGTTACCTAA TTCTAATTTT TTTCCGTCAA AGGGATCAGC AAAGCAACCT 721 AAATGTTTTA GCAGTCCAGA CCCAGTGATT ACTTGTGGGT GAACCTGACC TTTTCCGGCC 781 TCAGAAACTA GATGCAACAA GCGCCGCATC ACTGG Predicted gene structure (within gDNA segment 5217 to 1208): Exon 1 4076 4046 ( 31 n); cDNA 56 86 ( 31 n); score: 1.000 Intron 1 4045 3907 ( 139 n); Pd: 0.998 (s: 0), Pa: 0.614 (s: 1.00) Exon 2 3906 3824 ( 83 n); cDNA 87 169 ( 83 n); score: 1.000 Intron 2 3823 3751 ( 73 n); Pd: 0.974 (s: 1.00), Pa: 0.995 (s: 1.00) Exon 3 3750 3248 ( 503 n); cDNA 170 672 ( 503 n); score: 1.000 Intron 3 3247 1727 (1521 n); Pd: 0.998 (s: 1.00), Pa: 0.880 (s: 0.83) Exon 4 1726 1586 ( 141 n); cDNA 673 814 ( 142 n); score: 0.844 MATCH C06HBa0120H21.1-3- SGN-U337544+ 0.970 758 0.930 C PGS_C06HBa0120H21.1-3-_SGN-U337544+ (4076 4046,3906 3824,3750 3248,1726 1586) Alignment (genomic DNA sequence = upper lines): AGAATTTTCC GTGACCTTTT CATTCGTGAA GGTATGATCA TTTTGCTTTC CATGCCTTCT 4017 |||||||||| |||||||||| |||||||||| | AGAATTTTCC GTGACCTTTT CATTCGTGAA G......... .......... .......... 86 CTATTTCATC TGTACATAGA TGTTACAACA AGTACCTCAA TGACAATGTT CACAGGATGA 3957 .......... .......... .......... .......... .......... .......... 86 CTTCCATTTT CCTTCCCTTT ATTCTTTTTT CTTTTTCCCA TCTATTTCAG GGTTTCAGTT 3897 |||||||||| .......... .......... .......... .......... .......... GGTTTCAGTT 96 TGATTATGTA TTTGACTGGA CCATATTGAA ATATCAGCAA TCACAGCTTG CCAATATTCC 3837 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTATGTA TTTGACTGGA CCATATTGAA ATATCAGCAA TCACAGCTTG CCAATATTCC 156 ATCTCGTGCT CTTGTAAGTG ACTTAAATAT ATTAGCTTAT GGCATTGGTC TCAAAATATA 3777 |||||||||| ||| ATCTCGTGCT CTT....... .......... .......... .......... .......... 169 AAAATGAAAT GTTGATTTTG AAACAGGGCG GTACTGCTGG GCCAAGTTCA GGGACGCCTC 3717 |||| |||||||||| |||||||||| |||||||||| .......... .......... ......GGCG GTACTGCTGG GCCAAGTTCA GGGACGCCTC 203 ATGGTCTTGC TAATGTCGAA AAGAAATCAG GTATGTTTGG TCTCGATATC CTCCTCCCTC 3657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGTCTTGC TAATGTCGAA AAGAAATCAG GTATGTTTGG TCTCGATATC CTCCTCCCTC 263 AACCACAAAA AAGAAGGAAA GATTAACAAG AAAAAGTTAA TTAGAAGACT TAGCAATCAG 3597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACCACAAAA AAGAAGGAAA GATTAACAAG AAAAAGTTAA TTAGAAGACT TAGCAATCAG 323 TCTGTGCTAA CTAGTGTGGC ATAAAGTGGT TCTATGTATC TTGTCTCTTA TGTATACGTG 3537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTGTGCTAA CTAGTGTGGC ATAAAGTGGT TCTATGTATC TTGTCTCTTA TGTATACGTG 383 AAAAGTTACA ATTGGAATCC ATTTCAATTG GTCCATTATG TCATTTATTT TGGATGGCAT 3477 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAGTTACA ATTGGAATCC ATTTCAATTG GTCCATTATG TCATTTATTT TGGATGGCAT 443 GCCAGATGTT CTGTTCAGGA AAGTACATTA CCTCTAAATA TGCTAGCTTC TTATTCTCTG 3417 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCCAGATGTT CTGTTCAGGA AAGTACATTA CCTCTAAATA TGCTAGCTTC TTATTCTCTG 503 GGAGCTATAT ATGCTTGCTA CTCAATATGC AGGTGGTGAA GAAGGACGGC CAACTGGTTG 3357 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAGCTATAT ATGCTTGCTA CTCAATATGC AGGTGGTGAA GAAGGACGGC CAACTGGTTG 563 GTCTTCATCA AATCTGACAC GTAATAGGAG CACAGGGCTC AATTTCAATG CTGGAAGCTT 3297 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTTCATCA AATCTGACAC GTAATAGGAG CACAGGGCTC AATTTCAATG CTGGAAGCTT 623 ATTGAAGCAA AAAGGCACAG TTGCTAATGA TTTATCTGTG GGTAAAGAGG TTAGTGCTCT 3237 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ATTGAAGCAA AAAGGCACAG TTGCTAATGA TTTATCTGTG GGTAAAGAG. .......... 672 TCTTCCCCCC TCTCCCTATT GGTTATACCC TTTCTCCTGG TTGGGATGCG TTTGGGGAAG 3177 .......... .......... .......... .......... .......... .......... 672 TGTAGTACTA TTGCATATAA ATTCTGTGGC AACTTTACTT GTTACAAAAA CGATGTGAGT 3117 .......... .......... .......... .......... .......... .......... 672 GCATTTGACT TTATTCAGAT ATAGGACAAA AGTTCTTTTG CAATCTTGAT TAGTCAAGTA 3057 .......... .......... .......... .......... .......... .......... 672 CCTCGGCATG AGGAATTAGA TGCAGTTTCC AAAGAAGAAG TAAAAAGTTC CTGAACTTGT 2997 .......... .......... .......... .......... .......... .......... 672 GTGGCAATTT CAATTTACGG AGCAATTCAC CAAGTAGAGA GGATAAAGTG TGCAATCTCT 2937 .......... .......... .......... .......... .......... .......... 672 TTGAATAAAA TTTTCTTGAA TCAATTTTAC ATTTTATGTT ATCCTGAAAA TAATTTGATG 2877 .......... .......... .......... .......... .......... .......... 672 GATAGCAGGT AGGCTGCATT TCTCCTTTCT CAAAAAAAAA AAAAACGAAG AAGAAAAAGA 2817 .......... .......... .......... .......... .......... .......... 672 ACAAGAAAAA AAGGGTAGGC TGCAACTCTT TCTGGTGCTC TAGGCTTTTA TTTTGGGTGA 2757 .......... .......... .......... .......... .......... .......... 672 GGGGATACAG AATAGATGTG TAAGGATGGA AGTGGAATAA TGAGAGGATG AAATGTTGTG 2697 .......... .......... .......... .......... .......... .......... 672 CAGCGAGCTC TCACGTGGGT ACTTTGGAGG GACAGAAACA AGAGATTTTG GAAGCAAAGG 2637 .......... .......... .......... .......... .......... .......... 672 AAGATGACTT CATAAAGCTT AGGGTTTTTC TTTCATTTTG CTCAACTTTT GGATTGCCCA 2577 .......... .......... .......... .......... .......... .......... 672 CGGAGTTCCT ACATAGATGA TTGAGAGTCT TTAATAAAGA ACCATGTTTT ATGAACCTAT 2517 .......... .......... .......... .......... .......... .......... 672 TCTGCTTATG GATTATCTGT CACACACTGT TCCCAAGAAC TGCCAAAGTC AAAACAGCAA 2457 .......... .......... .......... .......... .......... .......... 672 AATTCTTGGA TCATATCAAT GGGTGCACAA CTGCATGTTT CTCTGGACCA TCATAGTGTA 2397 .......... .......... .......... .......... .......... .......... 672 CTTGCACATA TCTATTCAGA GGAAGGTCTT TTCATAATGG ATGATCCTCA AAACTATAAT 2337 .......... .......... .......... .......... .......... .......... 672 GTAGATGATT GATGGAAGAA ATATGAAACA TATCCAAAAT TCTCGCTGAT GGAAATAGAC 2277 .......... .......... .......... .......... .......... .......... 672 CCGACACAGA TATCTCTTGT ATACTGGGGC GTCCTCCACA ACATTAGCAA AATACTATCC 2217 .......... .......... .......... .......... .......... .......... 672 ACAAAAGGAG TGGAAGTGGG AGTAAAATAG AAGTAGCTAA AAGGCATTGA ACCCCCACCC 2157 .......... .......... .......... .......... .......... .......... 672 AAAATAAACA CCCTTCTCTC CATCTGCTGC CTTTTGAATT ACCTTCACCG AGTGGCACTG 2097 .......... .......... .......... .......... .......... .......... 672 ATGATTACAT ATAGGTTAAG AGGGCCAATA AGGTGGTGTT GGTTCGTCTA TATGACAGAA 2037 .......... .......... .......... .......... .......... .......... 672 ATGGATGGTG ACTGCATAAT TAGGGGTGGC ATTTATCTTT TTCTCTTCTA ACATTTCCTT 1977 .......... .......... .......... .......... .......... .......... 672 TCCTCTATTA CTTCCACATA TTTGTGCTTG CCCTTCATGG AACATGAGAA TAATTTGTAT 1917 .......... .......... .......... .......... .......... .......... 672 TCATTTGGTA GTTCTTGAAT ATAATGGCCA AACTGAGAGC AAATTGGTGG TATACACTAA 1857 .......... .......... .......... .......... .......... .......... 672 GAAACAGAAT TTAGTACAAC TAAGGATCTA TTATGCTTAT ACTTTGGGGA ATGTGGTTGA 1797 .......... .......... .......... .......... .......... .......... 672 AATTTACTCT TGTTTTATGA CCTATATCTA CCAGTAAACA CTACAGACTT TGACTGAGTT 1737 .......... .......... .......... .......... .......... .......... 672 TGTTTTGCAG TTACCTAGTT CTAATTTTTT CCGGTCAAGT GGATCAGCAA GGCAACCT-A 1678 ||||||| || |||||||||| | ||||| |||||||||| ||||||| | .......... TTACCTAATT CTAATTTTTT TCCGTCAAAG GGATCAGCAA AGCAACCTAA 722 ATGTCTCTAG CAGTCGAGAC CCAGTGATTA C-TGGGGGTG AACCTGACCT CTCCCGCCCT 1619 |||| | ||| ||||| |||| |||||||||| | || ||||| |||||||||| | ||| ||| ATGT-TTTAG CAGTCCAGAC CCAGTGATTA CTTGTGGGTG AACCTGACCT TTTCCGGCCT 781 CAGACACTAG ATGCAGCAGG CGCAGCATCA CTG 1586 |||| ||||| ||||| || | ||| |||||| ||| CAGAAACTAG ATGCAACAAG CGCCGCATCA CTG 814 hqPGS_C06HBa0120H21.1-3-_SGN-U337544+ (4076 4046,3906 3824,3750 3248,1726 1586) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 10330: PGL 1 (- strand): 8274 1053 AGS-1 (8274 7726,7626 7586,6590 6521,6438 6290,6215 6120,6035 5965,4641 4580,4497 4434,4341 4257,4172 4046,3906 3824,3750 3687,3384 3248,1726 1053) SCR (e 1.000 d 0.995 a 0.990,e 1.000 d 1.000 a 0.996,e 1.000 d 0.919 a 0.951,e 1.000 d 0.949 a 0.931,e 1.000 d 0.880 a 0.920,e 1.000 d 0.231 a 0.988,e 1.000 d 1.000 a 0.940,e 1.000 d 0.999 a 0.987,e 1.000 d 0.995 a 0.000,e 1.000 d 0.998 a 0.614,e 1.000 d 0.974 a 0.995,e 1.000 d 0.862 a 0.990,e 1.000 d 0.998 a 0.880,e 0.997) Exon 1 8274 7726 ( 549 n); score: 1.000 Intron 1 7725 7627 ( 99 n); Pd: 0.995 Pa: 0.990 Exon 2 7626 7586 ( 41 n); score: 1.000 Intron 2 7585 6591 ( 995 n); Pd: 1.000 Pa: 0.996 Exon 3 6590 6521 ( 70 n); score: 1.000 Intron 3 6520 6439 ( 82 n); Pd: 0.919 Pa: 0.951 Exon 4 6438 6290 ( 149 n); score: 1.000 Intron 4 6289 6216 ( 74 n); Pd: 0.949 Pa: 0.931 Exon 5 6215 6120 ( 96 n); score: 1.000 Intron 5 6119 6036 ( 84 n); Pd: 0.880 Pa: 0.920 Exon 6 6035 5965 ( 71 n); score: 1.000 Intron 6 5964 4642 (1323 n); Pd: 0.231 Pa: 0.988 Exon 7 4641 4580 ( 62 n); score: 1.000 Intron 7 4579 4498 ( 82 n); Pd: 1.000 Pa: 0.940 Exon 8 4497 4434 ( 64 n); score: 1.000 Intron 8 4433 4342 ( 92 n); Pd: 0.999 Pa: 0.987 Exon 9 4341 4257 ( 85 n); score: 1.000 Intron 9 4256 4173 ( 84 n); Pd: 0.995 Pa: 0.000 Exon 10 4172 4046 ( 127 n); score: 1.000 Intron 10 4045 3907 ( 139 n); Pd: 0.998 Pa: 0.614 Exon 11 3906 3824 ( 83 n); score: 1.000 Intron 11 3823 3751 ( 73 n); Pd: 0.974 Pa: 0.995 Exon 12 3750 3687 ( 64 n); score: 1.000 Intron 12 3686 3385 ( 302 n); Pd: 0.862 Pa: 0.990 Exon 13 3384 3248 ( 137 n); score: 1.000 Intron 13 3247 1727 (1521 n); Pd: 0.998 Pa: 0.880 Exon 14 1726 1053 ( 674 n); score: 0.997 PGS (8274 7726,7626 7586,6590 6521,6438 6290,6215 6120,6035 5965,4641 4580,4497 4434,4341 4257,4172 4046,3906 3824,3750 3687,3384 3248,1726 1053) SGN-U322161+ 3-phase translation of AGS-1 (-strand): . . . . . . 8274 GAAGTTTCTGATTGAAATACGTATGGGCCGACTATGAAATACACTGTATATTCATTCAGT E V S D - N T Y G P T M K Y T V Y S F S K F L I E I R M G R L - N T L Y I H S V S F - L K Y V W A D Y E I H C I F I Q . . . . . . 8214 TCCCTCTGATGAATTTATCTACAACACTACGTACGACGCTGTATCATGGGGATCGACTAG S L - - I Y L Q H Y V R R C I M G I D - P S D E F I Y N T T Y D A V S W G S T R F P L M N L S T T L R T T L Y H G D R L . . . . . . 8154 GGTGGGTGGGGACAAGAAGATAGTATTACGAAAGCAAAAAGACAGAACTAAACGTATATA G G W G Q E D S I T K A K R Q N - T Y I V G G D K K I V L R K Q K D R T K R I Y G W V G T R R - Y Y E S K K T E L N V Y . . . . . . 8094 TATATATATAAAATAATAGAAGGAAGAAATTGGGGTTAAGAGATAGAATCAGGGCCAGAT Y I Y K I I E G R N W G - E I E S G P D I Y I K - - K E E I G V K R - N Q G Q I I Y I - N N R R K K L G L R D R I R A R . . . . . . 8034 TATTCAAAATGGGATGATAATAAAGCATACACTTGAAGTTGATGCTGCTTCATTTTGCAC Y S K W D D N K A Y T - S - C C F I L H I Q N G M I I K H T L E V D A A S F C T L F K M G - - - S I H L K L M L L H F A . . . . . . 7974 CTTCTTGAGGCAAAACTCATGAAGGAACAAAACCCCAAAAAAGAATTATTCAAAAAGCTG L L E A K L M K E Q N P K K E L F K K L F L R Q N S - R N K T P K K N Y S K S - P S - G K T H E G T K P Q K R I I Q K A . . . . . . 7914 AAATCTTGATTTTTCTTTTTTTTGATTCATTTTCATTCTATGTTGGATTGAATTTTTCAG K S - F F F F L I H F H S M L D - I F Q N L D F S F F - F I F I L C W I E F F R E I L I F L F F D S F S F Y V G L N F S . . . . . . 7854 GGTTTGTGTGGTGAAGTTTTTTTTTTTTTTCAAATTTAAATTTTTAGTCTCCGATGGAGC G L C G E V F F F F Q I - I F S L R W S V C V V K F F F F F K F K F L V S D G A G F V W - S F F F F S N L N F - S P M E . . . . . . 7794 CTCGTGTTGGTAATAAGTTCAGGCTTGGCCGGAAAATTGGTAGCGGTTCTTTTGGGGAGA L V L V I S S G L A G K L V A V L L G R S C W - - V Q A W P E N W - R F F W G D P R V G N K F R L G R K I G S G S F G E . : . . . . : . 7734 TCTACCTCG : GTGGTAATGTTCAAACTAATGAAGAGGTTGCTATCAAGCTG : GAAAATGTGA S T S : V V M F K L M K R L L S S W : K M - L P R : W - C S N - - R G C Y Q A : G K C E I Y L : G G N V Q T N E E V A I K L : E N V . . . . . . : 6580 AAACAAAGCATCCTCAACTGTTGTACGAAGCAAAGCTGTATAAAATACTACAAGGAGGAA : K Q S I L N C C T K Q S C I K Y Y K E E : N K A S S T V V R S K A V - N T T R R N : K T K H P Q L L Y E A K L Y K I L Q G G : . . . . . . 6438 CTGGAGTCCCCAATTTAAAATGGTTTGGAGTTGAAGGAGATTACAATGTCCTTGTGATGG L E S P I - N G L E L K E I T M S L - W W S P Q F K M V W S - R R L Q C P C D G T G V P N L K W F G V E G D Y N V L V M . . . . . . 6378 ATTTACTGGGACCTAGTCTTGAAGATCTTTTCAACTTCTGCAATAGGAAAATGTCCTTGA I Y W D L V L K I F S T S A I G K C P - F T G T - S - R S F Q L L Q - E N V L E D L L G P S L E D L F N F C N R K M S L . . . : . . . 6318 AGACCGTTCTCATGCTTGCAGATCAGATG : ATCAATCGGGTTGAGTTTGTTCATGCCAAAT R P F S C L Q I R - : S I G L S L F M P N D R S H A C R S D : D Q S G - V C S C Q I K T V L M L A D Q M : I N R V E F V H A K . . . . . . 6184 CTTTTCTTCATCGAGATATAAAACCTGACAACTTTCTTATGGGATTAGGAAGACGTGCAA L F F I E I - N L T T F L W D - E D V Q F S S S R Y K T - Q L S Y G I R K T C K S F L H R D I K P D N F L M G L G R R A . : . . . . . 6124 ATCAG : GTCTATGTTATTGATTTTGGGCTGGCTAAGAAGTACAGAGACTCATCAACTCATC I R : S M L L I L G W L R S T E T H Q L I S : G L C Y - F W A G - E V Q R L I N S S N Q : V Y V I D F G L A K K Y R D S S T H . . : . . . . 5980 AGCATATTCCGTATAG : AGAAAATAAAAATTTGACAGGAACTGCTAGATATGCAAGCATGA S I F R I : E K I K I - Q E L L D M Q A - A Y S V - : R K - K F D R N C - I C K H E Q H I P Y R : E N K N L T G T A R Y A S M . . : . . . . 4597 ACACTCACCTCGGCATTG : AACAAAGTCGAAGGGATGATTTGGAATCATTGGGTTTTGTTC T L T S A L : N K V E G M I W N H W V L F H S P R H - : T K S K G - F G I I G F C S N T H L G I : E Q S R R D D L E S L G F V . . . : . . . 4455 TGATGTACTTTTTAAGAGGAAG : TCTCCCTTGGCAGGGGCTGAAAGCAGGCAATAAGAAAC - C T F - E E : V S L G R G - K Q A I R N D V L F K R K : S P L A G A E S R Q - E T L M Y F L R G S : L P W Q G L K A G N K K . . . . . : . 4303 AGAAGTATGAGAGGATCAGTGAGAAGAAAGTTTCAACGTCAATAGAG : ACCTTGTGTCGAG R S M R G S V R R K F Q R Q - R : P C V E E V - E D Q - E E S F N V N R : D L V S R Q K Y E R I S E K K V S T S I E : T L C R . . . . . . 4159 GCTATCCTGCAGAGTTTGCATCATATTTTCATTACTGTCGATCACTGAGATTTGATGATA A I L Q S L H H I F I T V D H - D L M I L S C R V C I I F S L L S I T E I - - - G Y P A E F A S Y F H Y C R S L R F D D . . . . . . : 4099 AACCAGATTATGCTTATCTCAAGAGAATTTTCCGTGACCTTTTCATTCGTGAAG : GGTTTC N Q I M L I S R E F S V T F S F V K : G F T R L C L S Q E N F P - P F H S - R : V S K P D Y A Y L K R I F R D L F I R E : G F . . . . . . 3900 AGTTTGATTATGTATTTGACTGGACCATATTGAAATATCAGCAATCACAGCTTGCCAATA S L I M Y L T G P Y - N I S N H S L P I V - L C I - L D H I E I S A I T A C Q Y Q F D Y V F D W T I L K Y Q Q S Q L A N . . : . . . . 3840 TTCCATCTCGTGCTCTT : GGCGGTACTGCTGGGCCAAGTTCAGGGACGCCTCATGGTCTTG F H L V L L : A V L L G Q V Q G R L M V L S I S C S : W R Y C W A K F R D A S W S C I P S R A L : G G T A G P S S G T P H G L . . . : . . . 3707 CTAATGTCGAAAAGAAATCAG : GTGGTGAAGAAGGACGGCCAACTGGTTGGTCTTCATCAA L M S K R N Q : V V K K D G Q L V G L H Q - C R K E I R : W - R R T A N W L V F I K A N V E K K S : G G E E G R P T G W S S S . . . . . . 3345 ATCTGACACGTAATAGGAGCACAGGGCTCAATTTCAATGCTGGAAGCTTATTGAAGCAAA I - H V I G A Q G S I S M L E A Y - S K S D T - - E H R A Q F Q C W K L I E A K N L T R N R S T G L N F N A G S L L K Q . . . . : . . 3285 AAGGCACAGTTGCTAATGATTTATCTGTGGGTAAAGAG : TTACCTAGTTCTAATTTTTTCC K A Q L L M I Y L W V K S : Y L V L I F S R H S C - - F I C G - R : V T - F - F F P K G T V A N D L S V G K E : L P S S N F F . . . . . . 1704 GGTCAAGTGGATCAGCAAGGCAACCTAATGTCTCTAGCAGTCGAGACCCAGTGATTACTG G Q V D Q Q G N L M S L A V E T Q - L L V K W I S K A T - C L - Q S R P S D Y W R S S G S A R Q P N V S S S R D P V I T . . . . . . 1644 GGGGTGAACCTGACCTCTCCCGCCCTCAGACACTAGATGCAGCAGGCGCAGCATCACTGC G V N L T S P A L R H - M Q Q A Q H H C G - T - P L P P S D T R C S R R S I T A G G E P D L S R P Q T L D A A G A A S L . . . . . . 1584 GTAAAATATTTAATACTACCCAGAAGACTTCACCAGTTGTGTCTTCAGAGCACAAGCGCA V K Y L I L P R R L H Q L C L Q S T S A - N I - Y Y P E D F T S C V F R A Q A Q R K I F N T T Q K T S P V V S S E H K R . . . . . . 1524 GCTCCTCCACAAGAAACACAAACCTAAAGAATTTAGAGTCTGCCATCAAAGGAATAGAGG A P P Q E T Q T - R I - S L P S K E - R L L H K K H K P K E F R V C H Q R N R G S S S T R N T N L K N L E S A I K G I E . . . . . . 1464 GTTTAAGCTTTCGATGATGAGGTACTGGATTAGTAGCTCTGCTTTGTCACAATTCCCCCT V - A F D D E V L D - - L C F V T I P P F K L S M M R Y W I S S S A L S Q F P L G L S F R - - G T G L V A L L C H N S P . . . . . . 1404 CTACTGTATATCTTGGCACAGCAAACACACCAACATGGCGGAGTATGAGTTCTGATATTA L L Y I L A Q Q T H Q H G G V - V L I L Y C I S W H S K H T N M A E Y E F - Y - S T V Y L G T A N T P T W R S M S S D I . . . . . . 1344 GTTGTTTCCAGGAGGAACCATAAACAATGCAACCCCCGCAAACTCACAAATCCCAGTTTA V V S R R N H K Q C N P R K L T N P S L L F P G G T I N N A T P A N S Q I P V Y S C F Q E E P - T M Q P P Q T H K S Q F . . . . . . 1284 TGTTTTGTCCATACAGACACAGTTATAGGCACTTATCGTATTCTTTTTCGTCTCTATCTC C F V H T D T V I G T Y R I L F R L Y L V L S I Q T Q L - A L I V F F F V S I S M F C P Y R H S Y R H L S Y S F S S L S . . . . . . 1224 TCCTGTTCTTGTTCTATCGTGTTATTCATATTTATCTTATGTTGTGAATTATGAAGAGGC S C S C S I V L F I F I L C C E L - R G P V L V L S C Y S Y L S Y V V N Y E E A L L F L F Y R V I H I Y L M L - I M K R . . . . . . 1164 CCATATATAATTGCCGATTTATATGGTCCACGAGATGTTTGACGCTAGCAGATTCTTCTG P Y I I A D L Y G P R D V - R - Q I L L H I - L P I Y M V H E M F D A S R F F C P I Y N C R F I W S T R C L T L A D S S . . . . . . 1104 CTTTGGACATGAGCAAACTCTCTTTTGTTTCAAGCTATTTAAATATCAATCA L W T - A N S L L F Q A I - I S I F G H E Q T L F C F K L F K Y Q S A L D M S K L S F V S S Y L N I N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-3-_PGL-1_AGS-1_PPS_1 (7807 7726,7626 7586,6590 6521,6438 6290,6215 6120,6035 5965,4641 4580,4497 4434,4341 4257,4172 4046,3906 3824,3750 3687,3384 3248,1726 1448) (frame '0'; 1407 bp, 469 residues) 1 SPMEPRVGNK FRLGRKIGSG SFGEIYLGGN VQTNEEVAIK LENVKTKHPQ LLYEAKLYKI 61 LQGGTGVPNL KWFGVEGDYN VLVMDLLGPS LEDLFNFCNR KMSLKTVLML ADQMINRVEF 121 VHAKSFLHRD IKPDNFLMGL GRRANQVYVI DFGLAKKYRD SSTHQHIPYR ENKNLTGTAR 181 YASMNTHLGI EQSRRDDLES LGFVLMYFLR GSLPWQGLKA GNKKQKYERI SEKKVSTSIE 241 TLCRGYPAEF ASYFHYCRSL RFDDKPDYAY LKRIFRDLFI REGFQFDYVF DWTILKYQQS 301 QLANIPSRAL GGTAGPSSGT PHGLANVEKK SGGEEGRPTG WSSSNLTRNR STGLNFNAGS 361 LLKQKGTVAN DLSVGKELPS SNFFRSSGSA RQPNVSSSRD PVITGGEPDL SRPQTLDAAG 421 AASLRKIFNT TQKTSPVVSS EHKRSSSTRN TNLKNLESAI KGIEGLSFR- AGS-2 (4076 4046,3906 3824,3750 3248,1726 1586) SCR (e 1.000 d 0.998 a 0.614,e 1.000 d 0.974 a 0.995,e 1.000 d 0.998 a 0.880,e 0.844) Exon 1 4076 4046 ( 31 n); score: 1.000 Intron 1 4045 3907 ( 139 n); Pd: 0.998 Pa: 0.614 Exon 2 3906 3824 ( 83 n); score: 1.000 Intron 2 3823 3751 ( 73 n); Pd: 0.974 Pa: 0.995 Exon 3 3750 3248 ( 503 n); score: 1.000 Intron 3 3247 1727 (1521 n); Pd: 0.998 Pa: 0.880 Exon 4 1726 1586 ( 141 n); score: 0.844 PGS (4076 4046,3906 3824,3750 3248,1726 1586) SGN-U337544+ 3-phase translation of AGS-2 (-strand): . . . . : . . 4076 AGAATTTTCCGTGACCTTTTCATTCGTGAAG : GGTTTCAGTTTGATTATGTATTTGACTGG R I F R D L F I R E : G F Q F D Y V F D W E F S V T F S F V K : G F S L I M Y L T G N F P - P F H S - R : V S V - L C I - L . . . . . . : 3877 ACCATATTGAAATATCAGCAATCACAGCTTGCCAATATTCCATCTCGTGCTCTT : GGCGGT T I L K Y Q Q S Q L A N I P S R A L : G G P Y - N I S N H S L P I F H L V L L : A V D H I E I S A I T A C Q Y S I S C S : W R . . . . . . 3744 ACTGCTGGGCCAAGTTCAGGGACGCCTCATGGTCTTGCTAATGTCGAAAAGAAATCAGGT T A G P S S G T P H G L A N V E K K S G L L G Q V Q G R L M V L L M S K R N Q V Y C W A K F R D A S W S C - C R K E I R . . . . . . 3684 ATGTTTGGTCTCGATATCCTCCTCCCTCAACCACAAAAAAGAAGGAAAGATTAACAAGAA M F G L D I L L P Q P Q K R R K D - Q E C L V S I S S S L N H K K E G K I N K K Y V W S R Y P P P S T T K K K E R L T R . . . . . . 3624 AAAGTTAATTAGAAGACTTAGCAATCAGTCTGTGCTAACTAGTGTGGCATAAAGTGGTTC K V N - K T - Q S V C A N - C G I K W F K L I R R L S N Q S V L T S V A - S G S K S - L E D L A I S L C - L V W H K V V . . . . . . 3564 TATGTATCTTGTCTCTTATGTATACGTGAAAAGTTACAATTGGAATCCATTTCAATTGGT Y V S C L L C I R E K L Q L E S I S I G M Y L V S Y V Y V K S Y N W N P F Q L V L C I L S L M Y T - K V T I G I H F N W . . . . . . 3504 CCATTATGTCATTTATTTTGGATGGCATGCCAGATGTTCTGTTCAGGAAAGTACATTACC P L C H L F W M A C Q M F C S G K Y I T H Y V I Y F G W H A R C S V Q E S T L P S I M S F I L D G M P D V L F R K V H Y . . . . . . 3444 TCTAAATATGCTAGCTTCTTATTCTCTGGGAGCTATATATGCTTGCTACTCAATATGCAG S K Y A S F L F S G S Y I C L L L N M Q L N M L A S Y S L G A I Y A C Y S I C R L - I C - L L I L W E L Y M L A T Q Y A . . . . . . 3384 GTGGTGAAGAAGGACGGCCAACTGGTTGGTCTTCATCAAATCTGACACGTAATAGGAGCA V V K K D G Q L V G L H Q I - H V I G A W - R R T A N W L V F I K S D T - - E H G G E E G R P T G W S S S N L T R N R S . . . . . . 3324 CAGGGCTCAATTTCAATGCTGGAAGCTTATTGAAGCAAAAAGGCACAGTTGCTAATGATT Q G S I S M L E A Y - S K K A Q L L M I R A Q F Q C W K L I E A K R H S C - - F T G L N F N A G S L L K Q K G T V A N D . . : . . . . 3264 TATCTGTGGGTAAAGAG : TTACCTAGTTCTAATTTTTTCCGGTCAAGTGGATCAGCAAGGC Y L W V K S : Y L V L I F S G Q V D Q Q G I C G - R : V T - F - F F P V K W I S K A L S V G K E : L P S S N F F R S S G S A R . . . . . . 1683 AACCTAATGTCTCTAGCAGTCGAGACCCAGTGATTACTGGGGGTGAACCTGACCTCTCCC N L M S L A V E T Q - L L G V N L T S P T - C L - Q S R P S D Y W G - T - P L P Q P N V S S S R D P V I T G G E P D L S . . . . 1623 GCCCTCAGACACTAGATGCAGCAGGCGCAGCATCACTG A L R H - M Q Q A Q H H P S D T R C S R R S I T R P Q T L D A A G A A S L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-3-_PGL-1_AGS-2_PPS_1 (3430 3248,1726 1586) (frame '0'; 324 bp, 108 residues) 1 LLILWELYML ATQYAGGEEG RPTGWSSSNL TRNRSTGLNF NAGSLLKQKG TVANDLSVGK 61 ELPSSNFFRS SGSARQPNVS SSRDPVITGG EPDLSRPQTL DAAGAASL >C06HBa0120H21.1-3-_PGL-1_AGS-2_PPS_2 (4076 4046,3906 3824,3750 3631) (frame '1'; 231 bp, 77 residues) 1 RIFRDLFIRE GFQFDYVFDW TILKYQQSQL ANIPSRALGG TAGPSSGTPH GLANVEKKSG 61 MFGLDILLPQ PQKRRKD- ... finished at: Mon Aug 28 21:57:39 2006 ________________________________________________________________________________ Sequence 4: C06HBa0120H21.1-4, from 1 to 12097, both strands analyzed. ... started at: Mon Aug 28 21:57:39 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 5 ******************************************************************************** EST sequence 7 +strand 2470 n (File: SGN-U321764+) 1 GATTTTCGTA ATTATTGATA CAGACAAAGG TGTTTAAACG GACATCGTAA TGGAGTGTGT 61 GTGAGAGAGA ACCCATTTTG AGGAATTCGG GGGCAATTTC AGTTTCTCGG TGGCGGAAGA 121 CGCGCAGAGA CTTCACCCTT TCAATTCCGT TGACACATAT ACAGAGGTGT CGTTTCTGAT 181 AATCAATTTC ACTTTCTCTT TGCTACGATC GCTACATTCT CTCTCTCTCT TTTGTGTAGA 241 CTGAACCCGC GCTGAACTGC ATTTCGCCTA TAAATTAATA TATTTCTTAG GAAGATGCAG 301 CAACCCGTGC AGCCAAGGTC TTCTGCCAAT GGATATGGCC GTCGTAAAGT TGATAGAGAA 361 ATGGGTACTA AGTTGGAGAA TAAAGCGCAA TCTGGAAAAA CTACTTCTCG TCAATTTACA 421 GGTAAAGGGG GAGCATATCA AAGCCTGTCA CATGATCGAC TAGTTTATTT CACTACCTGT 481 CTTGTTGGAC ATCAAGTGGA AGTACAAGTG ATGGACGGAT CAGTGTTTTC AGGGATACTT 541 CATGCGACAA ACGCTGAAAA AGATTTTGGT ATCATTCTGA AAATGGCGCA GTTGATAAAA 601 GATAGCTCTG AGGGGATGAA GAGTAGTTCT GAAACTTTTA GCAAGCCTCC ATTAAAGACT 661 TTGATAATAC CGGGTAAAGA GTTTGCTCAA GTTACAGCAA AGGGTGTGCC TACAACTCTA 721 GACGGTTTCA GAACAGAATT CATGCTGGAA CAGCAGCAGG AACTTTTGAC TGATTCATGC 781 ATTTCACAAT CTCGGCATAT TGAGGTAGAG CGGCAATTGG AACGCTGGGT ACCTGATGAT 841 GATGCTCCTG AATGTCCTGA ACTGGACAAT ATATTTGATG GCCATTGGAA TAGGGGCTGG 901 GATCAGTTTC AAGCCAATGA AACACTGTTT GGAGTAAAAA GCACATTTGA TGAGGACCTT 961 TATACGACAA AGCTTGAGAG AGGTCCTCAG ATGAGTGAGT TGGAAAAAGA AGCTCTAAGA 1021 ATAGCTAGAG AAATTGAGGG TGAGGATACA CGTGATCTTC ATCTAGCAGA GGAGAGAGGG 1081 ATCCAACTTC ATGAGAACCT AGAAGTGGAC GAGGAAACCA GATTTTCCGC AGTTGTTAGA 1141 GAGATTGATG ATAGCGGCTA TGACAACTGT GAGGACATCC TGTTGGATTC ACGTAATGAT 1201 GAGACATTTC AAGGTATATC TAGTGCTATG GGGAAGTCAT TTACTGACAT GGGCAGAAGG 1261 AAAATGAATG ATGGTGCACA AGTTTCATTA AGATCTTCCT TCATGGATGA AGTGCAATCT 1321 TCCAAGCTAA GTACCAGTAG GGATGTCTAC CAGACTTGTT ACGATGATCA TGCGAAACAG 1381 TCATCAGCTG AAGTTGTCCT TAAAGGTGGC TCTATCTTAA ACAGGGGTCG CAAAACTCTG 1441 TTTAGTGAGC ATGCTGGAGC AAGTTGGAAT AAGGAGGATA CAAGAAATCA AATGACGGAT 1501 GAAGTTGCTC AAACGTCAGT ATTGGAAGAT TCAATGTCTT CTTCAAGAAT GAAAATGGAG 1561 ACCTCTGATG GGGGTAGATT GTCTCCAGAC ATCTCTGCAT TGCATGTTCA TCCAGCGGAC 1621 CAGGATATGA TCACAAGTTC TTCTAGAGAG AAGTTTGAGG GTGCGGTGTC TTCCAAGATT 1681 CAAGGGGCTC CACAATCTGC TAATTCTCGT GTACGACCTA GTAGTTCTGT TCTTTCCGGT 1741 TCTGATGGAA CAGGTGCTGC CTCAACGTCA GCTGACAATG GATTATCACG AACCTCTTCT 1801 GTAAATTCAT TTTCGTCAGA AAAATCCACA TTGAATCCAC ATGCTAAGGA ATTTAAATTA 1861 AATCCTAATG CAAAGAGTTT CATGCCATTT CAATCACCTT TGAGACCTGC TTCTCCGGTG 1921 TCTGATAGTT CCTTCTATTA TCCAGCTGGT GTGGCTACTG TTCCCAATGT GCATGGCATG 1981 CCTGTTGGGG TAGGTCCTTC ATTTTCTCCA CATCAGCCTG TTATGTTTAA TCCACAAGCT 2041 ACACCTGTAC CACAACAATT TTTTCATCCA AATGGACCAC AGTATGGGCA GCAGATGATG 2101 ATTGGTCCCC CTCGGCAAGT AGTCTATATG CCGAATTACC CCGCTGAAAT GCGACGAGAC 2161 TACTAATCAG TTGGCAAACC ATATTGCGTG GTGGGTTGAA CCGATGGATG CTGACATGAG 2221 ATTTCATGGA TTGGTGGAGG AGGTTTAGCT GGTTGATGAA GGGGGATTCC AATGATTTGA 2281 TTAGAGCTTT TCCTTATACT GGGGTATCAG TAATTGTTAC TTTGTCATAA TCATTAGATT 2341 TGTTAACTTT CAGATTTACA GTCTTTCTTG AAGTTAACTG TGGTGTTTCC TTGGTATGCT 2401 GCTGTTGATA TTTCTTCTCT TTGATCTGTA TTCCTAATAT TGTATGTTTC TCGACAAAAA 2461 AAAAAAAAAA Predicted gene structure (within gDNA segment 5875 to 1): Exon 1 5269 5039 ( 231 n); cDNA 2 232 ( 231 n); score: 1.000 Intron 1 5038 2123 (2916 n); Pd: 0.980 (s: 1.00), Pa: 0.855 (s: 1.00) Exon 2 2122 1934 ( 189 n); cDNA 233 421 ( 189 n); score: 1.000 Intron 2 1933 1864 ( 70 n); Pd: 0.614 (s: 1.00), Pa: 0.357 (s: 1.00) Exon 3 1863 1717 ( 147 n); cDNA 422 568 ( 147 n); score: 1.000 Intron 3 1716 1459 ( 258 n); Pd: 0.969 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 4 1458 1325 ( 134 n); cDNA 569 702 ( 134 n); score: 1.000 Intron 4 1324 1089 ( 236 n); Pd: 0.980 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 5 1088 898 ( 191 n); cDNA 703 893 ( 191 n); score: 0.990 PPA cDNA 2454 2470 MATCH C06HBa0120H21.1-4- SGN-U321764+ 0.998 892 0.361 C PGS_C06HBa0120H21.1-4-_SGN-U321764+ (5269 5039,2122 1934,1863 1717,1458 1325,1088 898) Alignment (genomic DNA sequence = upper lines): ATTTTCGTAA TTATTGATAC AGACAAAGGT GTTTAAACGG ACATCGTAAT GGAGTGTGTG 5210 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTTCGTAA TTATTGATAC AGACAAAGGT GTTTAAACGG ACATCGTAAT GGAGTGTGTG 61 TGAGAGAGAA CCCATTTTGA GGAATTCGGG GGCAATTTCA GTTTCTCGGT GGCGGAAGAC 5150 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAGAGAGAA CCCATTTTGA GGAATTCGGG GGCAATTTCA GTTTCTCGGT GGCGGAAGAC 121 GCGCAGAGAC TTCACCCTTT CAATTCCGTT GACACATATA CAGAGGTGTC GTTTCTGATA 5090 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGCAGAGAC TTCACCCTTT CAATTCCGTT GACACATATA CAGAGGTGTC GTTTCTGATA 181 ATCAATTTCA CTTTCTCTTT GCTACGATCG CTACATTCTC TCTCTCTCTT TGTAAGTATC 5030 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | ATCAATTTCA CTTTCTCTTT GCTACGATCG CTACATTCTC TCTCTCTCTT T......... 232 GCCTTTTTAA TCTACTTGTT TATTGATGGA TCATGTAGTT TGAATCGCAC AAAAATATGG 4970 .......... .......... .......... .......... .......... .......... 232 GTCTAGGGTT TTCTAATCGA GCTCAGGCAA AAGAAGTTGT TCTTTTTTTG TTTTAAAGTT 4910 .......... .......... .......... .......... .......... .......... 232 GGGATTTCAA TTTATTACTT GTGAATTTGT GAAGCAGCTG TAAAAGCTGT GATTTTGGGT 4850 .......... .......... .......... .......... .......... .......... 232 TAGCTCCTAC TGCGTTGATT GTCTTGTTTG GGAAGTTGGG TGCTTTGGAG AAGGTTTCTT 4790 .......... .......... .......... .......... .......... .......... 232 TTATCGGAAA TGCAGTTATT CTTGATTATA CGTGCTTTTT GTGCCAAATT GTTTGTTTTG 4730 .......... .......... .......... .......... .......... .......... 232 TTTCTGGCTC TTGTTGTTTC AAGTTTATAA AAATCAATAA ATGAACCTAA ATATAGCGGA 4670 .......... .......... .......... .......... .......... .......... 232 ATGGATATAG ATGTTGCTGA ACCAACTAGT CTAGGATGAG AGTTGTTGCT GTATATTAAC 4610 .......... .......... .......... .......... .......... .......... 232 TCACAGTGGT GAATAGGTAC CTGTAGACTC CATTTGTCAC TTGGTATGCT TATGTTTAGT 4550 .......... .......... .......... .......... .......... .......... 232 ACAGAAAAAG AAGGGAAGAG ACAAGAAAAA GATCAGAGTG GACAGAAGAC GTTGAGCTTA 4490 .......... .......... .......... .......... .......... .......... 232 AGTTTCTAAT GAAGGGAAAA TAGATTCACA GCATGCCGTT TCTTGGTTAC TGTTAGTTGG 4430 .......... .......... .......... .......... .......... .......... 232 TTTGTCAAGA TTCTGGTAGA TGGGCTCCCC TTATTCGAGC GAGGTTGTTC CCGAAAGGTC 4370 .......... .......... .......... .......... .......... .......... 232 CTCAGCTAAA AAATAGAGTA ATCAAAGCAA AATAACGACA ACTATATAGC AAAATAAACT 4310 .......... .......... .......... .......... .......... .......... 232 TAGCAAATGA AGCAACATGT AGTTGTAACT CGTAACAATG AAGAATAAGA TACTATGTGT 4250 .......... .......... .......... .......... .......... .......... 232 ATACTAAAAT AATACTACAG AATAAATGGA AGATGAGAAG AGGAGACTAA CCCCTGCCTC 4190 .......... .......... .......... .......... .......... .......... 232 TCCCACATAA AATTACCAAA CACTCTGCTA TCTTCTAAAT TTCAATAATT GATCATTTAA 4130 .......... .......... .......... .......... .......... .......... 232 AAGCTTAATA GCAGCTGAGG TTTTAGCTAG TTCACAATTG TTCCTTCACC GAGTCTTTGA 4070 .......... .......... .......... .......... .......... .......... 232 TCTTGGGCTT ATTAATGAAA ATCTTACTTG ATTAAAGAAA AATACCACTC TGATGTTGAG 4010 .......... .......... .......... .......... .......... .......... 232 CTTAACGTCA TTTTCCCTTT CTTTTCCCAG AACCCTGCCA GTAAAAAGAT TTGTAAGAGC 3950 .......... .......... .......... .......... .......... .......... 232 TCACTTCATC GGACTCAAAT TTAATAAAGC ATTTCCTCAA ATGTTAACTC TTGGTTTTTC 3890 .......... .......... .......... .......... .......... .......... 232 CTGGTAGATT GGGTTTTTCT GTCGTCTGGA ACTAGGCTCT TGTTGGTCCT AGTTTATGTT 3830 .......... .......... .......... .......... .......... .......... 232 TTATTTCCAA TCATTTTATT GGGGTGATCG TTTTAGAACT CAATTTATCA AATATCAAGA 3770 .......... .......... .......... .......... .......... .......... 232 AAATTATTTT TTGTTGCCGG TGCTAAGTCC TTTGATCTCA TCGAATATGC TCCATCCCAC 3710 .......... .......... .......... .......... .......... .......... 232 AATATATGGT GCAATTGGGT AGAAAGAAAA ATAAACTTCG TAAGGCAGTA GATAATCAGC 3650 .......... .......... .......... .......... .......... .......... 232 TGATGAGTCC TAATATGTTA TATGGTTATG CCTAAAGATG ATTGCTGCAT CAGAAGGATC 3590 .......... .......... .......... .......... .......... .......... 232 TGGGAATATA TTCTGGAGAG GGAGTCAGTT GGACCAACTC TATGAATTTT CTGTAACTCT 3530 .......... .......... .......... .......... .......... .......... 232 ATATTTTAAG TTTCATTTAA GATTTGTGGC TATCTGCAAA ACAGCAGTAA TGTTATTATC 3470 .......... .......... .......... .......... .......... .......... 232 CTGGGTCGAT ATAACTTGAA AAATACAAAA AATTATAAAG GAAGCGAGAT TATAAAAGGG 3410 .......... .......... .......... .......... .......... .......... 232 AAGCAACCAC AGTGATATGA AGGGGCGGTG AGAACTTGAG ATGTTCTGAA AGTGAAACAG 3350 .......... .......... .......... .......... .......... .......... 232 GGCTGAAATA TAGGAACTAG TTGGAGCCAA AATCACTGTT GGATGACAAA ATCATTTAAC 3290 .......... .......... .......... .......... .......... .......... 232 AAAAGAGGGA GTCCCTTCAA AATTGCTTGT TGGGGAGATT TCAGGGTGTT CACCAAGATA 3230 .......... .......... .......... .......... .......... .......... 232 TCCCAATAGG TTCAGAACTA AGAAGATGGT TAACCACATT TGGAAGGTTT TTTTTTTTTT 3170 .......... .......... .......... .......... .......... .......... 232 TTGTGTGTGT GTGTGTGTGG GGGGGGGGGG GTTCTCAAAA GGAAAGGGTT TAGTATTATC 3110 .......... .......... .......... .......... .......... .......... 232 TTTGAACTAA CATTGGATGA GGAATCTCAG TGAATCAAAA GAGTATGAAG ATGGAACCTA 3050 .......... .......... .......... .......... .......... .......... 232 GGCTGGATTG GTGGTCACAA ATGTCGGGTT TCATATCTAA CAACGATATA CCAAAGTAGT 2990 .......... .......... .......... .......... .......... .......... 232 GTTGGATTAG CTATTTAGGT TATTGTGTAT GCTTCTACCC TATGGAACCA GGATACTTTT 2930 .......... .......... .......... .......... .......... .......... 232 AGAAAATTGG TGAAATTGCG TGGTTGATTG AAAATAGAGG AAGAAACTAT TTTGAACAGG 2870 .......... .......... .......... .......... .......... .......... 232 CTACCTAAAA TGGGCTTGAA TTCTTGTCAA AAGCCTCTTA ATTATATTCC GGGGACCATC 2810 .......... .......... .......... .......... .......... .......... 232 TCAGTGGATA CTGGAGAGTT TGAGTACAAT ATATCCATTT GGGTTTGAAA GCACTACAAT 2750 .......... .......... .......... .......... .......... .......... 232 TTGTCAGTCA CAAAGCCTCA CTGTCCTGGA AAATATTTTT CACTGGAATT TCTGGTGAAC 2690 .......... .......... .......... .......... .......... .......... 232 AAATGGACCT CTGAGGGGTT TTTGAGGAGA CGCATTTGAA ATTCAGATCT GATGGTGAGG 2630 .......... .......... .......... .......... .......... .......... 232 TAACTAGGAT CACGCATAAC GAGTTTGGCT TGGGCAAGCA ACAATGTTTA AAGTTATTTA 2570 .......... .......... .......... .......... .......... .......... 232 ACGAGTCCAA TCCTTTGGGC GTCTCTTACA TATAAAAGAA CCTACTATTC TAATACCTGA 2510 .......... .......... .......... .......... .......... .......... 232 ACCAACGTTC AAATAGCAAA ACAATTGCTG AAACCCTCTA TTTGTCATAA GATAATTAAA 2450 .......... .......... .......... .......... .......... .......... 232 TGGCCTGCAT ATATTAACAT TCAATTCTCA CCTTGCATTG CAGAGGACAC GGCCTTCTAC 2390 .......... .......... .......... .......... .......... .......... 232 TTTGTATCTT GTTAATCACA GTCCTATAAA GTGCCTTAAT TCCTAGTCAT TGTTCCAAAG 2330 .......... .......... .......... .......... .......... .......... 232 AATCATTATG TTTAGTGTTA TATAATTCTT TTTCCTGGTT TCTGTATACT AGCTACTGGA 2270 .......... .......... .......... .......... .......... .......... 232 AGATCCAGGT CTAGTGTCTT AGTTGTTGCC TTGGTGTGTA TTGTGCTGTT TGGTACTTTG 2210 .......... .......... .......... .......... .......... .......... 232 ACATGCTTGC AAAAAAGATG CTTCATTCTA AATATTCAAT CTTTAGCAGT TACGGTTATT 2150 .......... .......... .......... .......... .......... .......... 232 ATCTTATTGG AGTATTGTCT TCTGCAGTGT GTAGACTGAA CCCGCGCTGA ACTGCATTTC 2090 ||| |||||||||| |||||||||| |||||||||| .......... .......... .......TGT GTAGACTGAA CCCGCGCTGA ACTGCATTTC 265 GCCTATAAAT TAATATATTT CTTAGGAAGA TGCAGCAACC CGTGCAGCCA AGGTCTTCTG 2030 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCCTATAAAT TAATATATTT CTTAGGAAGA TGCAGCAACC CGTGCAGCCA AGGTCTTCTG 325 CCAATGGATA TGGCCGTCGT AAAGTTGATA GAGAAATGGG TACTAAGTTG GAGAATAAAG 1970 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAATGGATA TGGCCGTCGT AAAGTTGATA GAGAAATGGG TACTAAGTTG GAGAATAAAG 385 CGCAATCTGG AAAAACTACT TCTCGTCAAT TTACAGGTAT AGGTGAGAGC CGCTGACTTA 1910 |||||||||| |||||||||| |||||||||| |||||| CGCAATCTGG AAAAACTACT TCTCGTCAAT TTACAG.... .......... .......... 421 CGATTGTCTA TGATTTCTTC TGTAGTTCTA ATTCCAAGTG TTCTAGGTAA AGGGGGAGCA 1850 |||| |||||||||| .......... .......... .......... .......... ......GTAA AGGGGGAGCA 435 TATCAAAGCC TGTCACATGA TCGACTAGTT TATTTCACTA CCTGTCTTGT TGGACATCAA 1790 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATCAAAGCC TGTCACATGA TCGACTAGTT TATTTCACTA CCTGTCTTGT TGGACATCAA 495 GTGGAAGTAC AAGTGATGGA CGGATCAGTG TTTTCAGGGA TACTTCATGC GACAAACGCT 1730 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGGAAGTAC AAGTGATGGA CGGATCAGTG TTTTCAGGGA TACTTCATGC GACAAACGCT 555 GAAAAAGATT TTGGTATGTT ATAAAAGTTT AACTTAGACT GCAGGTCTTT GGGTATTAAT 1670 |||||||||| ||| GAAAAAGATT TTG....... .......... .......... .......... .......... 568 TGAAGACTTT AACCATTGTT TTGGCATATG TTCAACTATT TTTTTTGAGT CAAAGGCAAC 1610 .......... .......... .......... .......... .......... .......... 568 CAATTGTATA AAATCATAAT TTAGCACATA AAAGAAATGC TAAGTTAAGA AATCTCCAAA 1550 .......... .......... .......... .......... .......... .......... 568 GTATACATAC AACCAAAACA GGAGCCCCTA TTCAAGATAT ACTTCAACTG TGTTATATTC 1490 .......... .......... .......... .......... .......... .......... 568 AACATTGGCT TATTCCCTTC ACTTGACACA GGTATCATTC TGAAAATGGC GCAGTTGATA 1430 ||||||||| |||||||||| |||||||||| .......... .......... .......... .GTATCATTC TGAAAATGGC GCAGTTGATA 597 AAAGATAGCT CTGAGGGGAT GAAGAGTAGT TCTGAAACTT TTAGCAAGCC TCCATTAAAG 1370 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGATAGCT CTGAGGGGAT GAAGAGTAGT TCTGAAACTT TTAGCAAGCC TCCATTAAAG 657 ACTTTGATAA TACCGGGTAA AGAGTTTGCT CAAGTTACAG CAAAGGTTTG TGTGCAACTA 1310 |||||||||| |||||||||| |||||||||| |||||||||| ||||| ACTTTGATAA TACCGGGTAA AGAGTTTGCT CAAGTTACAG CAAAG..... .......... 702 CTATAATTCT TTCAGGCAAT TATTTATCCA TTGCCTAATT TCAACAATGG CAATACTTTA 1250 .......... .......... .......... .......... .......... .......... 702 AAATTTAAAT AATCTACCAA GCAAATCAGA TCACAGTATA TGATGGTGAT GGTTTTTCAT 1190 .......... .......... .......... .......... .......... .......... 702 GGAAGGTCTT TATATATAAT TTCAAATTAA TTTTTGACAC TGATCGTTGT GATTGCAATT 1130 .......... .......... .......... .......... .......... .......... 702 AGTTTTAGTT ATAATAATGA TGCTCTATTG CTTTGGAGTA GGGTGTGCCT ACAACTCTAG 1070 ||||||||| |||||||||| .......... .......... .......... .......... .GGTGTGCCT ACAACTCTAG 721 ACGGTTTCAG AACAGAATTC ATGCTGGAAC AGCAGCAGGA ACTTTTGACT GATTCATGCA 1010 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACGGTTTCAG AACAGAATTC ATGCTGGAAC AGCAGCAGGA ACTTTTGACT GATTCATGCA 781 TTTCACAATC TCGGCATATT GAGGTAGAGC GGCAATTGGA ACGCTGGGTA CCTGATGATG 950 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCACAATC TCGGCATATT GAGGTAGAGC GGCAATTGGA ACGCTGGGTA CCTGATGATG 841 ATGCTCCTGA ATGTCCTGAT CTGGACAATA TATTTGATGA CCATTGGAAT AG 898 |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| || ATGCTCCTGA ATGTCCTGAA CTGGACAATA TATTTGATGG CCATTGGAAT AG 893 hqPGS_C06HBa0120H21.1-4-_SGN-U321764+ (5269 5039,2122 1934,1863 1717,1458 1325,1088 898) ******************************************************************************** EST sequence 5 +strand 895 n (File: SGN-U337451+) 1 TCCCCGCGGT GGCGGCCGCT CTAGAACTAG TGGATCCCCC GGGCTGCAGG AATTCGGCAC 61 GAGGGGTACC TGATGATGAT GCTCCTGAAT GTCCTGATCT GGACAATATA TTTGATGACC 121 ATTGGAATAG GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA 181 CATTTGATGA GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG 241 AAAAAGAAGC TCTAAGAATA GCTAGAGAAA TTGAAGGTGA GGATACACGT GATCTTCATC 301 TAGCAGAGGA GAGAGGGATC CAACTTCATG AGAACCTAGA AGTGGACGAG GAAACCAGAT 361 TTTCCGCAGT TGTTAGAGAG ATTGATGATA GCGGCTATGA CAACTGTGAG GACATCCTGT 421 TGGATTCACG TAATGATGAG ACATTTCAAG GTATATCTAG TGCTATGGGG AAGTCATTTA 481 CTGACATGGG CAGAAGGAAA ATGAATGATG GTGCACAAGT TTCATTAAGA TCTTCCTTCA 541 TGGTATAATT TTGTTTACCC ACATTCATTA GTCTTTAAGA ATTGTTTGTT GCGGTACTGA 601 GGCTTTTTCC TTCTGTTAAG ATAGGGATGA CAGAGGAAGA TATTGTCATA TTAGTCAAGT 661 ATCTCAAGCT AGTCGGAAAA TATATCAATT GGTTGTTGTC TGCTATTGCT GGTGAATTTA 721 AGAGTTATCC ACGTCTTAGA TACCAAGCAC TTTCAGTGTG AATCATGTGG TTTAAGAGCC 781 TATAAATAAT ACTTTAGCTT CTGTGTACTC GACTATAGTA AATATAAGCT AATTATATGA 841 CGCATTTTAG TATTTCTATT CCTATCTTGC CCCTAGGATG AAGTGCAATC TTCCA Predicted gene structure (within gDNA segment 2194 to 1): Exon 1 964 898 ( 67 n); cDNA 64 130 ( 67 n); score: 1.000 MATCH C06HBa0120H21.1-4- SGN-U337451+ 1.000 67 0.075 C PGS_C06HBa0120H21.1-4-_SGN-U337451+ (964 898) Alignment (genomic DNA sequence = upper lines): GGGTACCTGA TGATGATGCT CCTGAATGTC CTGATCTGGA CAATATATTT GATGACCATT 905 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGTACCTGA TGATGATGCT CCTGAATGTC CTGATCTGGA CAATATATTT GATGACCATT 123 GGAATAG 898 ||||||| GGAATAG 130 hqPGS_C06HBa0120H21.1-4-_SGN-U337451+ (964 898) ******************************************************************************** EST sequence 6 +strand 2527 n (File: SGN-U320342+) 1 TTCAGCCGTT GCCGGCGGCG GCTTCCGGTG GTCTCTGTTC TTATAGAATA TAGGAAGCTC 61 TCTCCGTATG TTATGTTAAT TGAAGTAAAA TAAAAGATAA AGAGATGTCG TTATTACGAA 121 GAAGAAAAGC ACCTGAAAAT GAAATTCAAC CGGATGTGGA GCCAAAGCTT GATGCGGAAG 181 AAGACGATAA GAAGAGTAGG AAGAAGAATG TGAAGTCTGG AAAGAAGAAG AAATGGTCGT 241 GTATCGATAA TTGCTGTTGG TTCGTAGGAT GCATATGTTG TGTGTGGTGG ATTTTGTTGT 301 TTTTATACAA TGCTATGCCG GCGTCGTTCC CGCAGTATGT TACGGAGGCG ATTACAGGCC 361 CGTTACCTGA TCCACCGGGC ATCAAGTTGC AGAAGGAAGG GTTGAAGGCT AAGCATCCGG 421 TGGTATTTAT TCCTGGAATT GTCACGTGTG GACTTGAACT GTGGGAAGGA CATCAGTGTG 481 CTGAAGGATT GTTTCGGAAG CGGCTTTGGG GCGGGACTTT TGGAGAAGTG TACAAAAGAC 541 CACTATGCTG GGTGAACCAC ATGACATTAG ATAATGAAAC TGGGATGGAT CCTCCTGGTA 601 TTAGAGTTAG GCCAGTTAGT GGACTTGTTG CTGCAGATTA CTTTGCTCCA GGATACTTTG 661 TCTGGGCAGT TTTGATTGCT AACTTGGCTC GAATTGGATA CGAGGAGAAA ACCATGTATA 721 TGGCTGCATA TGATTGGAGG CTTGCCTTTC AGAACACCGA GGTGCGAGAC CAGACCCTAA 781 GTCGGATAAA AAGCAATATA GAACTGATGG TTGCAACCAG TGGCAAGAAG GCAGTAATTG 841 TTCCACATTC CATGGGGGTT GTATACTTTT TGCATTTTAT GAAGTGGGTG GAGGCACCAG 901 CTCCTGTGGG TGGTGGTGGT GGACCCGATT GGTGCGCCAA GCACATTAAA GCTGTGATGA 961 ATATTGGTGG GCCATTGCTA GGTGTTCCAA AATCTATAGC TGGCCTTTTC TCAGCTGAAG 1021 CACGGGATAT TGCTGTTGCC AGGGCTCTGG CTCCAGGTGT TCTGGACACG GATATGTTCC 1081 ATCTTCAAAC ATTAGAGCAC ATAATGAAGA TGTCACGAAC CTGGGATGCA ACCATGTCAA 1141 TGATACCAAG AGGAGGGGAC ACGATCTGGG GCGGTCTTGA ATGGTCACCC GAGGAAGGCT 1201 ATTCTCCTTG CAGAAGTAAG TCCAGAGACG ATGCTGCTCA GAATTCAGGA CATCACGAGA 1261 ATCAAACTAC AGATTCTAAA GCAAAATATT ACAGTTATGG AAGGATGATG TCCTTTGGAA 1321 AGGATGCAGC AGAAGCTCAT CCATCAGATC TTAAGAGGAT TGACTTCAGG GATGCTGTTA 1381 AGGGCGGTAA TGTTGCAAAT AACACCTGTG ATGTGTGGAA CGAGTACCAA GACATGGGTG 1441 TTAGTGGCAC TAAAGCAGTG GAAGAGTACA AGGTTTATAC AGCTGGAGAG ATTGTGGATT 1501 TGCTCAACTT TGTTGCCCCG AAAATGATGG CCCGTGGTAA TGCTCACTTT TCATATGGGA 1561 TAGCTGATGA TTTGGATGAT CCTAAGTATT CACACTACAA ATATTGGTCA AATCCATTGG 1621 AGACAAAGTT ACCAAATGCT CCTGACATGG AGATCTATTC ACTGTATGGA GTTGGCATTG 1681 AAACTGAAAG AGCATATGTT TACAAGCGGA TACCTACAGC AGGATGCAAT ATTCCATTCC 1741 AGATTGATAC TTCAGCTGAT GATAATGATG AAGGTAGCTG CTTGAAATCT GGTGTTTACA 1801 CGGTAGATGG TGATGAGACT GTGCCTGCAT TAAGCGCAGG ATTCATGTGT GCAAAAGGAT 1861 GGCGAGGAAA AACTAGATTT AATCCCTCTG GTATCAAAAC TTATATAAGG GAGTATTTTC 1921 ATGCTCCCCC TGCAAACCTT CTTGAGGGTC GTGGTACACA GAGTGGTGCA CATGTTGATA 1981 TAATGGGAAA TTTCGCTTTG ATAGAGGATG TAATGAGAGT TGCTGCTGGT GGAACAAGTA 2041 AAAACTTGGG AGGTGATCAA GTTTACTCAG ATATCTTCAA GTGGTCTGAG AAGATCAATT 2101 TACGTTTGTG ATTATCTGTC GAAACTCACC AGTTTCATTG TGTCAACATA CCAGCTTGAC 2161 ATGTTATTTA TGAACTTACA TACCCTTTCC ACTGGATGCC ATTCAAATAT TCAGAAATGA 2221 ACAGGTATTT TTCCAGATAT AGTGCAGGAT TTATTACTTC CGGTGGTGTA AATTGTAACA 2281 TTAACGCAGC TCATCTCGCT ATGCAACAAC AAAGAATAAT AGGTGTGTAG TAGGGATTGA 2341 GTTTGATGTT GTACCATTAA AGCTTATAAT TAGCAGTGTA AGTGAAAGCT GTCGTACTAC 2401 AGGTGTTCAT CCTCTACATT TTATTTGTCC ATTTCGACTT TTTATTTGTC CCTTTTTGCT 2461 TATTTTTGAC AAATAAAAAA AAGATTGATA TTTTTTTTAC CTTTTAAAAA AAAAAAAAAA 2521 AAACTCG Predicted gene structure (within gDNA segment 12097 to 2452): Exon 1 10484 10135 ( 350 n); cDNA 191 538 ( 348 n); score: 0.791 Intron 1 10134 9751 ( 384 n); Pd: 0.210 (s: 0.86), Pa: 0.966 (s: 0.76) Exon 2 9750 9528 ( 223 n); cDNA 539 761 ( 223 n); score: 0.843 Intron 2 9527 9432 ( 96 n); Pd: 0.992 (s: 0.84), Pa: 0.952 (s: 0.92) Exon 3 9431 9148 ( 284 n); cDNA 762 1042 ( 281 n); score: 0.856 Intron 3 9147 8643 ( 505 n); Pd: 0.996 (s: 0.78), Pa: 0.915 (s: 0.68) Exon 4 8642 8315 ( 328 n); cDNA 1043 1370 ( 328 n); score: 0.750 Intron 4 8314 8002 ( 313 n); Pd: 0.959 (s: 0.76), Pa: 1.000 (s: 0.76) Exon 5 8001 7745 ( 257 n); cDNA 1371 1627 ( 257 n); score: 0.833 Intron 5 7744 7628 ( 117 n); Pd: 0.966 (s: 0.90), Pa: 0.934 (s: 0.88) Exon 6 7627 7154 ( 474 n); cDNA 1628 2104 ( 477 n); score: 0.834 Intron 6 7153 3883 (3271 n); Pd: 0.000 (s: 0.88), Pa: 0.696 (s: 0) Exon 7 3882 3867 ( 16 n); cDNA 2105 2120 ( 16 n); score: 0.750 PPA cDNA 2506 2524 MATCH C06HBa0120H21.1-4- SGN-U320342+ 0.816 1932 0.765 C PGS_C06HBa0120H21.1-4-_SGN-U320342+ (10484 10135,9750 9528,9431 9148,8642 8315,8001 7745,7627 7154,3882 3867) Alignment (genomic DNA sequence = upper lines): GAAGATGATA AGAAAAGTAA GAGGAAGATT GCAACGAAGC AGAAATGGTC ATGTATCGAT 10425 ||||| | || ||| | || |||| | | || |||| |||||||||| ||||||||| GAAGA-G-TA GGAAGAAGAA TGTGAAGTCT GGAAAGAAGA AGAAATGGTC GTGTATCGAT 248 AGTTGTTGTT GGTTTGTAGG ATACATTTGT ACTGTATGGT GGATTTTATT ATTTTTGTAC 10365 | ||| |||| |||| ||||| || ||| ||| ||| |||| ||||||| || ||||| ||| AATTGCTGTT GGTTCGTAGG ATGCATATGT TGTGTGTGGT GGATTTTGTT GTTTTTATAC 308 AATGCTATGC CAGCTTCGTT TCCACAGTAC GTAACAGAGA AGATTAATGG GCCAGTAGCT 10305 |||||||||| | || ||||| || ||||| || || ||| ||||| || || || || AATGCTATGC CGGCGTCGTT CCCGCAGTAT GTTACGGAGG CGATTACAGG CCCGTTACCT 368 GATCCTCCTG GCGTAAAGCT ACGAAATGAA GGGCTAAAGG TTAAACATCC AGTAGTTTTT 10245 ||||| || | || | ||| | | || ||| ||| | |||| ||| ||||| || || ||| GATCCACCGG GCATCAAGTT GCAGAAGGAA GGGTTGAAGG CTAAGCATCC GGTGGTATTT 428 GTACCTGGGA TTGTTACTTG TGGCCTTGAG CTATGGGAGG GACATCAGTG TGCTGAAGGA 10185 | ||||| | |||| || || ||| ||||| || ||||| | |||||||||| |||||||||| ATTCCTGGAA TTGTCACGTG TGGACTTGAA CTGTGGGAAG GACATCAGTG TGCTGAAGGA 488 TTGTTTCGAA AGCGGTTATG GGGTGGTACT TTTGGCGAAG TGTATAAAAG GTCAGAGACG 10125 |||||||| | ||||| | || ||| || ||| ||||| |||| |||| ||||| TTGTTTCGGA AGCGGCTTTG GGGCGGGACT TTTGGAGAAG TGTACAAAAG .......... 538 AATCCAGAAT TTGAAATTGT TAGGTTCAAT CTATAAAGTT GTTATGATCA AATTTCATAA 10065 .......... .......... .......... .......... .......... .......... 538 AAGATTATGA GTTCAAACTT CATGTTTTTC GGAATTTTAA TGAAAATAAA CTTATGCTCT 10005 .......... .......... .......... .......... .......... .......... 538 GTGTTAAAAG TATTGAGTTC AGATGAACCT GGTATGTTTT TATAGATCGA TGCAGGGGAA 9945 .......... .......... .......... .......... .......... .......... 538 TAAACTTCAT TGGTATTAAT AGTATGAGAT GAGATCTAGG ATATGGGTAC TATCTCAAAA 9885 .......... .......... .......... .......... .......... .......... 538 TAAATAAACT TATATTTGAT AAAATTTTTA AATATATATA GATGATATGA GTTATGGTGC 9825 .......... .......... .......... .......... .......... .......... 538 TACTGAATAG CTACAATTGC AAAAAGGAAA AAAGGTTTAT TTTAATGATT ATCTGTTTGT 9765 .......... .......... .......... .......... .......... .......... 538 GTGATGGTAA CCAGACCGTT TTGTTGGGCG GAACACATGT CATTGGACAA TGAATCTGGG 9705 ||| | || |||| | | |||||| |||| || || |||| ||||| .......... ....ACCACT ATGCTGGGTG AACCACATGA CATTAGATAA TGAAACTGGG 584 TTGGATCCTC CGGGAATACG GGTTAGGCCA GTTGCTGGAC TTGTTGCAGC AGATTACTTT 9645 ||||||||| | || || | ||||||||| ||| ||||| ||||||| || |||||||||| ATGGATCCTC CTGGTATTAG AGTTAGGCCA GTTAGTGGAC TTGTTGCTGC AGATTACTTT 644 GCACCAGGAT ATTTTGTGTG GGCAGTTTTG ATTGCTAATT TGGCGCGAAT AGGATATGAG 9585 || ||||||| | ||||| || |||||||||| |||||||| | |||| ||||| ||||| ||| GCTCCAGGAT ACTTTGTCTG GGCAGTTTTG ATTGCTAACT TGGCTCGAAT TGGATACGAG 704 GAGAAAACGA TGTATATGGC TGCATATGAC TGGAGACTAT CCATTCAGAA TACTGAGGTA 9525 |||||||| | |||||||||| ||||||||| ||||| || || ||||||| || ||| GAGAAAACCA TGTATATGGC TGCATATGAT TGGAGGCTTG CCTTTCAGAA CACCGAG... 761 TAGATTTAAC TTTCTGTATG CTTGAGCATT GTTGTTTCCT TAATCCACTT AAAAAGTCGT 9465 .......... .......... .......... .......... .......... .......... 761 TGTAATGTGT CAATGGTGAA ATCATTTCAT CAGGTGCGCG ACCAGACACT AAGCCAGATA 9405 ||||| | ||||||| || ||| | |||| .......... .......... .......... ...GTGCGAG ACCAGACCCT AAGTCGGATA 788 AAAAGCAATA TAGAACTGAT GGTTGCAACT AATGGAGGCA ATAAGGCAGT AATTGTTCCA 9345 |||||||||| |||||||||| ||||||||| | | |||| | |||||||| |||||||||| AAAAGCAATA TAGAACTGAT GGTTGCAAC- CA--GTGGCA AGAAGGCAGT AATTGTTCCA 845 CATTCTATGG GAGCTATTTA CTTTTTGTAT TTCATGAAGT GGGTCGAGGC ACCAGCTCCG 9285 ||||| |||| | | | | || ||||||| || || ||||||| |||| ||||| ||||||||| CATTCCATGG GGGTTGTATA CTTTTTGCAT TTTATGAAGT GGGTGGAGGC ACCAGCTCCT 905 ATGGGTGGTG GTGGTGGTCC TGATTGGTGT GCCAAACATA TTAAAGCAGT GATGAATATT 9225 ||||||||| ||||||| || |||||||| ||||| || | ||||||| || |||||||||| GTGGGTGGTG GTGGTGGACC CGATTGGTGC GCCAAGCACA TTAAAGCTGT GATGAATATT 965 GGTGCGCCGT TTCTAGGTGT TCCTAAAGCA TTAGCTGCAC TTTTCTCAGC TGAAGCTCGA 9165 |||| ||| | | |||||||| ||| ||| | |||||| | |||||||||| |||||| || GGTGGGCCAT TGCTAGGTGT TCCAAAATCT ATAGCTGGCC TTTTCTCAGC TGAAGCACGG 1025 GATGTCGCTA TTGCAAGGTA AAATTGGTTC ATGTGGATGT TTTCTCCTTG ACAACCAACT 9105 ||| | ||| |||| || GATATTGCTG TTGCCAG... .......... .......... .......... .......... 1042 GAAGAAATAG TACATCTGGT ATAGAGCTAA ATCATTTCTT GGCTTCTGCA CTTGACTGAG 9045 .......... .......... .......... .......... .......... .......... 1042 AAGCAATACC TCAAAATTAA TCCTAAAATG CTCCTCTAGT TCCGAGTGTT GGGTTTTTAA 8985 .......... .......... .......... .......... .......... .......... 1042 AGGTTTGCTA ATCTGAATAC AGGAGATAAC ACCGAGCTTG GTAATTAAGG CCGTGAGAAA 8925 .......... .......... .......... .......... .......... .......... 1042 TTCTTTCCAT ATCTGTTTCT TGAGCTAGAT TTGAGTATGT GTTGGTGCTT TGACGTGTGA 8865 .......... .......... .......... .......... .......... .......... 1042 GTAGTAATTG CCGTTACACA TGTGGTTTAC ATCCTTATTT CCTTTTCCCT CAACTTTGCA 8805 .......... .......... .......... .......... .......... .......... 1042 CTACGAATCA TTGATCTTCT AAGAGATGCT ATTTCATAGC CTTTTTATCT TTCAACACTA 8745 .......... .......... .......... .......... .......... .......... 1042 GATGTTATGT ACAAAATACA AGTCCTTGAA TGCTTCCGCG TGTAGTCTGT TCCTTCTTCA 8685 .......... .......... .......... .......... .......... .......... 1042 TTGGTAGTCT TCTTGATCAT CACTAGTTTA CTTAAATTTC AGGAGTAAAG CATCAGTTGT 8625 | | | | ||| ||| .......... .......... .......... .......... ..GGCTCTGG CTCCAGGTGT 1060 TATGGACAAG GATTTATTTC GTATTCAAAC ACTACCACAT TTAATGAGGA TGCTTCGGAC 8565 | |||||| | ||| | || | | ||||||| | || || |||||| || || || || TCTGGACACG GATATGTTCC ATCTTCAAAC ATTAGAGCAC ATAATGAAGA TGTCACGAAC 1120 TTGGGATTCA ACCATGTCTA TGTTACCAAA AGGAGGAGAG ACGATTTGGG GTGGTCTTGA 8505 |||||| || |||||||| | || |||||| |||||| || ||||| |||| | |||||||| CTGGGATGCA ACCATGTCAA TGATACCAAG AGGAGGGGAC ACGATCTGGG GCGGTCTTGA 1180 CTGGTCTCCA GAAGAAGGCT ATTCTCCTCG CAAAAGAAAA CTAAGGGACA AAACTAGTCA 8445 ||||| || || ||||||| |||||||| | || ||| || || ||| | || ||| ATGGTCACCC GAGGAAGGCT ATTCTCCTTG CAGAAGTAAG TCCAGAGACG ATGCTGCTCA 1240 TACGTCAAGC CATCAGGACA ATCAAACTGT AGAATCTAAA GGAAAACATG TTAATTATGG 8385 | ||| | ||||| || | |||||||| ||| |||||| | |||| || | |||||| GAATTCAGGA CATCACGAGA ATCAAACTAC AGATTCTAAA GCAAAATATT ACAGTTATGG 1300 AAGGATGATA TCATTTGGAA AGGTTGCAGC ACAGAAACCT TCATCAGATA TTACTAGGAT 8325 ||||||||| || ||||||| ||| |||||| | | | | |||||||| ||| ||||| AAGGATGATG TCCTTTGGAA AGGATGCAGC AGAAGCTCAT CCATCAGATC TTAAGAGGAT 1360 TGACTTCAGA GTAATGTTAA CCAGAAACCA ATGCTCCTTA TTTATCGTTC TCTACATATT 8265 ||||||||| TGACTTCAGG .......... .......... .......... .......... .......... 1370 TCTATGCTTT CTTCTATTCG TTTCAACATG TTAAATAAGG CTCACAAACA TATATTTTTG 8205 .......... .......... .......... .......... .......... .......... 1370 TGCTTGAGAA AGACCTTTGT GAGGCATAGC TTTGGTAGAC CTGTCATCTA ACATTTTGAT 8145 .......... .......... .......... .......... .......... .......... 1370 ACTTATAAAG CAACAAAGAA TAACTACAAC AATTCAATCG AAACAAAAAT ATGATGAACA 8085 .......... .......... .......... .......... .......... .......... 1370 TTAACTCAGT AGTACAGTAG GAAATTATCT CTTCCCCACT TTCTTCTTGT TGTCTCTTAG 8025 .......... .......... .......... .......... .......... .......... 1370 TAGCTGTGTT TGTTTACATG CAGGGTGCAG TGAAGGGCAC GAACAAAGCA AATAACACAT 7965 | ||| | | |||||| || ||| |||||||| | .......... .......... ...GATGCTG TTAAGGGCGG TAATGTTGCA AATAACACCT 1407 GTGATGTGTG GACGGAGTAC TATGACATGG GCGTTGCTGG TATAAAAGCT GTGGAAGAAT 7905 |||||||||| || |||||| | ||||||| | ||| ||| | ||||| |||||||| | GTGATGTGTG GAACGAGTAC CAAGACATGG GTGTTAGTGG CACTAAAGCA GTGGAAGAGT 1467 ACAAGGTTTA TACAGCTGGA GATATATTGG ATCTACTCCA CTTTGTTGCC CCAAAGATGA 7845 |||||||||| |||||||||| || || ||| || | ||| | |||||||||| || || |||| ACAAGGTTTA TACAGCTGGA GAGATTGTGG ATTTGCTCAA CTTTGTTGCC CCGAAAATGA 1527 TGGCTCGTGG AGGCGCTCAT TTTTCATATG GGATAGCTGA AGATTTGGAT GATCCAATGT 7785 |||| ||||| ||||| |||||||||| |||||||||| ||||||||| ||||| | || TGGCCCGTGG TAATGCTCAC TTTTCATATG GGATAGCTGA TGATTTGGAT GATCCTAAGT 1587 ATTCACACTA CAAATACTGG TCAAATCCGT TGGAAACAAA GTGAGTACTT TTCATTTGAA 7725 |||||||||| |||||| ||| |||||||| | |||| ||||| ATTCACACTA CAAATATTGG TCAAATCCAT TGGAGACAAA .......... .......... 1627 CTCTGTCTCC GTACTTGTTG TCTTTGTACA AAGGATATAG TCTACGAGTA GAAAATGACC 7665 .......... .......... .......... .......... .......... .......... 1627 TCGAGTTCAT TTTATCTTTA CCTTGTTAAC ACTACAGGCT ACCAAATGCT CCTGAAATGG 7605 | | |||||||||| ||||| |||| .......... .......... .......... .......GTT ACCAAATGCT CCTGACATGG 1650 AGCTTTATTC GATGTATGGA GTTGGCATTC CAACTGAAAG AGCATATGTT TACGGGCAAA 7545 || | ||||| |||||||| ||||||||| ||||||||| |||||||||| ||| || | AGATCTATTC ACTGTATGGA GTTGGCATTG AAACTGAAAG AGCATATGTT TACAAGCGGA 1710 CACCAATAGC ACAATGTCAT ATTCCATTCC AGATTGAAAC TTCAGCTGAT GA-AGGGAAT 7486 ||| | ||| | ||| || |||||||||| ||||||| || |||||||||| || | || TACCTACAGC AGGATGCAAT ATTCCATTCC AGATTGATAC TTCAGCTGAT GATAATGATG 1770 GA-GT-GTTG TATGAAGAAT GGTGTTTTGA CTGTTGATGG CGATGAGACG GTGCCTATTT 7428 | || | || |||| | ||||||| | | || ||||| |||||||| |||||| | AAGGTAGCTG CTTGAAATCT GGTGTTTACA CGGTAGATGG TGATGAGACT GTGCCTGCAT 1830 TAAGTGCAGG CTTCATGTGT GCAAAAGGAT GGCGCGGAAA AACTAGATTT AATCCATCAG 7368 |||| ||||| ||||||||| |||||||||| |||| ||||| |||||||||| ||||| || | TAAGCGCAGG ATTCATGTGT GCAAAAGGAT GGCGAGGAAA AACTAGATTT AATCCCTCTG 1890 GAATCAAAAC TTATACAAGG GAGTATGATC ATGCTCCTCC CGCAAACCTT CTTGAGGGTC 7308 | |||||||| ||||| |||| |||||| || ||||||| || ||||||||| |||||||||| GTATCAAAAC TTATATAAGG GAGTATTTTC ATGCTCCCCC TGCAAACCTT CTTGAGGGTC 1950 GTGGTACACA GAGTGGAAAT CATGTTGATA TAATGGGAAA TTTCGCTTTG ATTGAAGATA 7248 |||||||||| |||||| |||||||||| |||||||||| |||||||||| || || ||| GTGGTACACA GAGTGGTGCA CATGTTGATA TAATGGGAAA TTTCGCTTTG ATAGAGGATG 2010 TCATGAGAGT TGCAGCTGGT GCAACGGGCA AAGACTTGGG AGGTGATCAA GTTCACTCGG 7188 | |||||||| ||| |||||| | ||| | | || ||||||| |||||||||| ||| |||| | TAATGAGAGT TGCTGCTGGT GGAACAAGTA AAAACTTGGG AGGTGATCAA GTTTACTCAG 2070 ACATCTTTAA GTGGTCTGAG AAGATTGATT TACGTCTTTA GGGGAACACT GGGTGCACTG 7128 | ||||| || |||||||||| ||||| ||| |||| ATATCTTCAA GTGGTCTGAG AAGATCAATT TACG...... .......... .......... 2104 CTTTTCTCTT ATATGATGAA GTACTTGACT CAACGGTTAG AATATCACTT TCATATCTAC 7068 .......... .......... .......... .......... .......... .......... 2104 ATAATTGTGA TGGTGATCTA TCTTTACGTC TTTAAGTTTC ATCATGTCAA GATGTCTGCT 7008 .......... .......... .......... .......... .......... .......... 2104 TCGTGTATAA TCACAAAGTT ATATGCTGAT ATGAGTTCAT CTGGACGCCA CACAAACCCT 6948 .......... .......... .......... .......... .......... .......... 2104 TCCTGTCAAA TGTATGGACA ACAACATACC CTGTATAATC CCACTCATGG GAGATACTAA 6888 .......... .......... .......... .......... .......... .......... 2104 ATTGTGAAAT TATGTCATGA ATCTACCACT GACGTTTGGA GAGCCCTAGC AACACTTAAC 6828 .......... .......... .......... .......... .......... .......... 2104 TTTTATCCGA CTATGTAGAC GATTAAATGG ATTCCAGATA AAATGCAGGA TGTTTTGTGT 6768 .......... .......... .......... .......... .......... .......... 2104 ATCTCGTAGT GTAATATGTC AAAGTTGGAG CTAGTTGCCC AAATCAGGAA CAATTAGGGA 6708 .......... .......... .......... .......... .......... .......... 2104 TAGAACTGAT ATTGTACCAA CATAGTTTTC TCAAAGATAT TGTATTTCAA ATGTGTTGTT 6648 .......... .......... .......... .......... .......... .......... 2104 ATACAGACTT GTGGAATAGC TACTCAATGA ATGAACAACT TTTCTTCTTT ATTTGTGATA 6588 .......... .......... .......... .......... .......... .......... 2104 ACATGAATTT CTTGGATTCT TCAGTTCAGT TGAATGTACT TATTATAATG GAGTATTCTT 6528 .......... .......... .......... .......... .......... .......... 2104 AAAAAAAAGT TGATTTTCCA TTATTATACG TCGAAATAAT ACTGTAGAAA TGCATGATAA 6468 .......... .......... .......... .......... .......... .......... 2104 GTCTATGTAT AATAGACTTT TATAGTCAAA TCCTTTTCCG AACTCCACAA GTAAAGAAAA 6408 .......... .......... .......... .......... .......... .......... 2104 CTTAATGTAT CGAGTTCAGA AGAAACTGAA AAATTGAAAA AAGAAATGGC CTATGAAAAA 6348 .......... .......... .......... .......... .......... .......... 2104 ATGATTGGAT AGAGAGGAGG TCTGACATAT GGGTTTGGGA ATACATCACT AAAAGGCATC 6288 .......... .......... .......... .......... .......... .......... 2104 AGTGAAGCTC ATCGGATTCT TTGTCAAATA GGTATATATA TACTATCACA TAATGTTTGA 6228 .......... .......... .......... .......... .......... .......... 2104 CTTTGCAAGT GTAACAGATT ATTACGGCAG TCGAGTCAGA ATTTTTAATA AGGGATTTAA 6168 .......... .......... .......... .......... .......... .......... 2104 ATTTTGAAAA AATAGACATA TGGATACTAT AGAAATCTAA AAAATTGTTT TAACCATGTA 6108 .......... .......... .......... .......... .......... .......... 2104 AATGATTAAT TTTTCGTCAA AAGGGATTTG AAATGAACAA CCTGACTATA ACGTAGCTTC 6048 .......... .......... .......... .......... .......... .......... 2104 GCCACTGATT ACTATATAAT TGCTAAATTT AATAACTTGA AAAATAAAGC AAACAACCTT 5988 .......... .......... .......... .......... .......... .......... 2104 TTATAACGAG TCAAATTTTA TTTATATTAT AAAAACTCAT TCATTATAGT GTATATGTCT 5928 .......... .......... .......... .......... .......... .......... 2104 AATACACGCA AGCTACAGTT TATATTATAA ATAATATAGA AATAATCAAT TTAGTTATGC 5868 .......... .......... .......... .......... .......... .......... 2104 GCTTAGAAAT GATCAAAATA CGTTTTGAAT ATAAAAAGAA GAAATTGAGG AACATGTCAA 5808 .......... .......... .......... .......... .......... .......... 2104 TTTTTATGTT TGATTTATTT AATGAAAGAT TATAAAAGGA ATATTGTTTT ATTTAAAATA 5748 .......... .......... .......... .......... .......... .......... 2104 CGAACTTTAA AATGTCAATT TATATGTTCC ATGTAAGCAA ATTTTTGTCA TTTTTAATTG 5688 .......... .......... .......... .......... .......... .......... 2104 GATCATATAA ATCGTTGTTT AAGAATAGTT ATATTATTAA TCCTAATAAT GCATTTTTAC 5628 .......... .......... .......... .......... .......... .......... 2104 TTTATTTAAA ATGACATATA AATTTCTGCA TATGTCTATC ATGCAACAAG AAAAATAAAA 5568 .......... .......... .......... .......... .......... .......... 2104 CCCAAACAAA TAACGTAACC AAAGAAAAAA GATTTCTCAT TTAGGATTAA GAAAATTTAT 5508 .......... .......... .......... .......... .......... .......... 2104 GTACGCTGAT AAAATTAAAA ATGAAGAAAA CTTTCTAATT CGTTAAGTAA TATTTTTTAA 5448 .......... .......... .......... .......... .......... .......... 2104 CTCTAATTAG CTACTAAATA ATATCCAATT CTATAGATGT AAATATCCTT GCATATATGC 5388 .......... .......... .......... .......... .......... .......... 2104 TCTGATCATT TTTGCTATTC GCTATCTTAT TATTATTATT TTGGTTTGTT TTTTGAAAAT 5328 .......... .......... .......... .......... .......... .......... 2104 GAACTTTAAT TTAGAAGAGA TAGTTATATA TATTCTTCGT AATTTCACTG AATTTATTAT 5268 .......... .......... .......... .......... .......... .......... 2104 TTTCGTAATT ATTGATACAG ACAAAGGTGT TTAAACGGAC ATCGTAATGG AGTGTGTGTG 5208 .......... .......... .......... .......... .......... .......... 2104 AGAGAGAACC CATTTTGAGG AATTCGGGGG CAATTTCAGT TTCTCGGTGG CGGAAGACGC 5148 .......... .......... .......... .......... .......... .......... 2104 GCAGAGACTT CACCCTTTCA ATTCCGTTGA CACATATACA GAGGTGTCGT TTCTGATAAT 5088 .......... .......... .......... .......... .......... .......... 2104 CAATTTCACT TTCTCTTTGC TACGATCGCT ACATTCTCTC TCTCTCTTTG TAAGTATCGC 5028 .......... .......... .......... .......... .......... .......... 2104 CTTTTTAATC TACTTGTTTA TTGATGGATC ATGTAGTTTG AATCGCACAA AAATATGGGT 4968 .......... .......... .......... .......... .......... .......... 2104 CTAGGGTTTT CTAATCGAGC TCAGGCAAAA GAAGTTGTTC TTTTTTTGTT TTAAAGTTGG 4908 .......... .......... .......... .......... .......... .......... 2104 GATTTCAATT TATTACTTGT GAATTTGTGA AGCAGCTGTA AAAGCTGTGA TTTTGGGTTA 4848 .......... .......... .......... .......... .......... .......... 2104 GCTCCTACTG CGTTGATTGT CTTGTTTGGG AAGTTGGGTG CTTTGGAGAA GGTTTCTTTT 4788 .......... .......... .......... .......... .......... .......... 2104 ATCGGAAATG CAGTTATTCT TGATTATACG TGCTTTTTGT GCCAAATTGT TTGTTTTGTT 4728 .......... .......... .......... .......... .......... .......... 2104 TCTGGCTCTT GTTGTTTCAA GTTTATAAAA ATCAATAAAT GAACCTAAAT ATAGCGGAAT 4668 .......... .......... .......... .......... .......... .......... 2104 GGATATAGAT GTTGCTGAAC CAACTAGTCT AGGATGAGAG TTGTTGCTGT ATATTAACTC 4608 .......... .......... .......... .......... .......... .......... 2104 ACAGTGGTGA ATAGGTACCT GTAGACTCCA TTTGTCACTT GGTATGCTTA TGTTTAGTAC 4548 .......... .......... .......... .......... .......... .......... 2104 AGAAAAAGAA GGGAAGAGAC AAGAAAAAGA TCAGAGTGGA CAGAAGACGT TGAGCTTAAG 4488 .......... .......... .......... .......... .......... .......... 2104 TTTCTAATGA AGGGAAAATA GATTCACAGC ATGCCGTTTC TTGGTTACTG TTAGTTGGTT 4428 .......... .......... .......... .......... .......... .......... 2104 TGTCAAGATT CTGGTAGATG GGCTCCCCTT ATTCGAGCGA GGTTGTTCCC GAAAGGTCCT 4368 .......... .......... .......... .......... .......... .......... 2104 CAGCTAAAAA ATAGAGTAAT CAAAGCAAAA TAACGACAAC TATATAGCAA AATAAACTTA 4308 .......... .......... .......... .......... .......... .......... 2104 GCAAATGAAG CAACATGTAG TTGTAACTCG TAACAATGAA GAATAAGATA CTATGTGTAT 4248 .......... .......... .......... .......... .......... .......... 2104 ACTAAAATAA TACTACAGAA TAAATGGAAG ATGAGAAGAG GAGACTAACC CCTGCCTCTC 4188 .......... .......... .......... .......... .......... .......... 2104 CCACATAAAA TTACCAAACA CTCTGCTATC TTCTAAATTT CAATAATTGA TCATTTAAAA 4128 .......... .......... .......... .......... .......... .......... 2104 GCTTAATAGC AGCTGAGGTT TTAGCTAGTT CACAATTGTT CCTTCACCGA GTCTTTGATC 4068 .......... .......... .......... .......... .......... .......... 2104 TTGGGCTTAT TAATGAAAAT CTTACTTGAT TAAAGAAAAA TACCACTCTG ATGTTGAGCT 4008 .......... .......... .......... .......... .......... .......... 2104 TAACGTCATT TTCCCTTTCT TTTCCCAGAA CCCTGCCAGT AAAAAGATTT GTAAGAGCTC 3948 .......... .......... .......... .......... .......... .......... 2104 ACTTCATCGG ACTCAAATTT AATAAAGCAT TTCCTCAAAT GTTAACTCTT GGTTTTTCCT 3888 .......... .......... .......... .......... .......... .......... 2104 GGTAGATTGG GTTTTTCTGT C 3867 ||| | || ||||| | .....TTTGT GATTATCTGT C 2120 hqPGS_C06HBa0120H21.1-4-_SGN-U320342+ (10484 10135,9750 9528,9431 9148,8642 8315,8001 7745,7627 7154) ******************************************************************************** EST sequence 4 +strand 890 n (File: SGN-U335137+) 1 AGAGCTGGAG CTCCCCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG 61 GGAAAAACAT ATATCTTATT GCTATACACT TATAATTATG CAATATACAT ACATTTTAAT 121 TCGGTTCAAT TATATGCAAA GCAAATTTTA TAATAATATT GCAGCGAAAT AGGCAGCGAA 181 TTATACAATT GTAGTTAAAT AGGATAGCGA ATTATACAAT TGCAGCGAAA TAGGCCAGCG 241 AATTATACAA TTTAGGTCAG CGAATTATAC AATTTTATGT TTGCTATGGA GCGCAATTAT 301 GCAAACTTTG TTATAGCATA CAAATATGAA TTTTTTGTTT GTTATATGTG AAAGTTGCCC 361 ATTAATTAAA GAGTCAATAA AATAGCGGTC TAATCAAGTC CTAAGCAGCC CAAGGTTTGA 421 CCCGTCAACC CAAAAATGAA TAATATTATG GGAAAAGGGT ATGATATACC CCTCAACTTT 481 GTCATTTGAA ACTGATATAC CCCTCGTTAT GAAAGTGGCT CATATATACC CTTACCGTTA 541 TACAAATGGT TCACATATAC CTCTGCCGTT ACAAAATGAG CTCATATATA CCCTTCATTT 601 AACGAAAGTG AAAAAATTAC TTTTAAATTT ATAATTTTAC TTTAATTTTA AAAAAAAATA 661 TATTTAGGGG CATATATGAT TCTTTTTTCA AAGTTCAAGG TATATTTCAA ATTTTTTTAA 721 TACATAAATT ATTTTTTTGA CTTATTTATT ATAATTATTT GAGTTNATTA ATCTTGTTTT 781 TTTTCTTNCA TTCCCNTAGT GTAAGAAAAA AATTTAAAAC ATTTTTTATG TCTATATGGA 841 ATTTAATTTT TGGTATCGAA GAAAAGAATG GTCATNCTAC ATAGTTTTAC Predicted gene structure (within gDNA segment 12097 to 3724): Exon 1 11540 11481 ( 60 n); cDNA 145 201 ( 57 n); score: 0.783 Intron 1 11480 11329 ( 152 n); Pd: 0.000 (s: 0.80), Pa: 0.000 (s: 0.88) Exon 2 11328 11256 ( 73 n); cDNA 202 274 ( 73 n); score: 0.890 Intron 2 11255 11226 ( 30 n); Pd: 0.000 (s: 0.90), Pa: 0.000 (s: 0.78) ?? Exon 3 11225 11140 ( 86 n); cDNA 275 363 ( 89 n); score: 0.808 Intron 3 11139 8643 (2497 n); Pd: 0.000 (s: 0.86), Pa: 0.915 (s: 0) Exon 4 8642 8627 ( 16 n); cDNA 364 379 ( 16 n); score: 0.625 Intron 4 8626 7774 ( 853 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 5 7773 7745 ( 29 n); cDNA 380 408 ( 29 n); score: 0.586 Intron 5 7744 5248 (2497 n); Pd: 0.966 (s: 0), Pa: 0.992 (s: 0) Exon 6 5247 5225 ( 23 n); cDNA 409 429 ( 21 n); score: 0.609 Intron 6 5224 4992 ( 233 n); Pd: 0.000 (s: 0), Pa: 0.993 (s: 0.54) Exon 7 4991 4944 ( 48 n); cDNA 430 476 ( 47 n); score: 0.542 Intron 7 4943 4579 ( 365 n); Pd: 0.789 (s: 0.54), Pa: 0.000 (s: 0) Exon 8 4578 4567 ( 12 n); cDNA 477 488 ( 12 n); score: 0.833 MATCH C06HBa0120H21.1-4- SGN-U335137+ 0.829 347 0.390 C PGS_C06HBa0120H21.1-4-_SGN-U335137+ (11540 11481,11328 11256,11225 11140,8642 8627,7773 7745,5247 5225,4991 4944,4578 4567) Alignment (genomic DNA sequence = upper lines): ATTTTACTTC AAATATTGCA AAGAAAAAGG CCAACGAATT ATACAATTGT GAATTATACA 11481 |||||| | ||||||||| |||| ||| || |||||| |||||||||| | ||| | | ATTTTA-TAA TAATATTGCA GCGAAATAGG -CAGCGAATT ATACAATTGT -AGTTAAATA 201 ATTGCAGTGA AATACAATTT TCTCTAGCTT TATACAACAG AAGTGTATAT ATTGTGTTTC 11421 .......... .......... .......... .......... .......... .......... 201 TGTTTTTATA TAAAGCGAGA AAAACATATA TCTTCTTGCT ATACACTTAT AATTATGCAA 11361 .......... .......... .......... .......... .......... .......... 201 TATACGTACA TTTTAATTCG ATTCAACTGT ATGCAAAGCA AATTATACAA TTGCAGCGAA 11301 | | ||| |||||||||| |||||||||| .......... .......... .......... ..GGATAGCG AATTATACAA TTGCAGCGAA 229 ATAAGTCAGT GAATTATACA ATTTAGGCCA TCGAATTATA CAATTGTATA TGTATAGCGA 11241 ||| | ||| |||||||||| ||||||| || ||||||||| ||||| ATAGGCCAGC GAATTATACA ATTTAGGTCA GCGAATTATA CAATT..... .......... 274 ATTATACATT TTCTA---TG TTTGCTATGG AGCGCAATTA TACAAACTTT GCTATAGCGT 11184 || |||||||||| |||||||||| | |||||||| | |||||| | .......... .....TTATG TTTGCTATGG AGCGCAATTA TGCAAACTTT GTTATAGCAT 319 ACAAATATAA ATTTTTTATT TGCTATATGT GAAAATTGTC CTTTTTAATT TTTGCCTTTC 11124 |||||||| | ||||||| || || ||||||| |||| ||| | | || ACAAATATGA ATTTTTTGTT TGTTATATGT GAAAGTTGCC CATT...... .......... 363 GCGCTTTTAA GTAACAAAAA GCTTAGCCCA AAATATTCCT AACCTTGAAA AAAGGCACAA 11064 .......... .......... .......... .......... .......... .......... 363 GTGCAGCGTA TATACATAAG TTATGCCTTA TAAGACATGA ATTAGTTCCT ACAGAACTTT 11004 .......... .......... .......... .......... .......... .......... 363 TATCGATAGA GGCATAAAAT TAGTTCTAGG CTTATGTCAT AAGCAGTTCA TAACCAACAA 10944 .......... .......... .......... .......... .......... .......... 363 ATTATATCTC ATCACTAAGA CATAACTTGG TTCGTATGAA ACTTATATCA ATAGAGGCAT 10884 .......... .......... .......... .......... .......... .......... 363 AAGTCCGTGC CTCAAGAGGC AGTAACAAAA TTAGATACTA CAAAACTTAT ATCGTACAAA 10824 .......... .......... .......... .......... .......... .......... 363 ATAAAAGTTG TTTAAACTCA TGCATTACGA AATAAATTAG GTCATAGTCA AGGACTGATT 10764 .......... .......... .......... .......... .......... .......... 363 AACCTGTAAA TACAAAAAAA CTTGATGAAA CACCATAATG TTTTGGTTAG GTTCAATTCA 10704 .......... .......... .......... .......... .......... .......... 363 AAATTAACAA AAATTTAATT AAGCACAAGT GTTATTTTTC CCTCCACTAA TCTAAAATCT 10644 .......... .......... .......... .......... .......... .......... 363 TGATCCACTA TAATAGGAAG AGCTATTACA CATAACATAC TTGTATCAAT CCCAAACAAA 10584 .......... .......... .......... .......... .......... .......... 363 AAATAATACT ATTAAAAATT TGTTTTGATT ATGGAGTTAA TTAGAAGAAG AAATGCACCT 10524 .......... .......... .......... .......... .......... .......... 363 CATCAAAATG AAATTCCTCC CCTGGAACTA CTTGACGAGG AAGATGATAA GAAAAGTAAG 10464 .......... .......... .......... .......... .......... .......... 363 AGGAAGATTG CAACGAAGCA GAAATGGTCA TGTATCGATA GTTGTTGTTG GTTTGTAGGA 10404 .......... .......... .......... .......... .......... .......... 363 TACATTTGTA CTGTATGGTG GATTTTATTA TTTTTGTACA ATGCTATGCC AGCTTCGTTT 10344 .......... .......... .......... .......... .......... .......... 363 CCACAGTACG TAACAGAGAA GATTAATGGG CCAGTAGCTG ATCCTCCTGG CGTAAAGCTA 10284 .......... .......... .......... .......... .......... .......... 363 CGAAATGAAG GGCTAAAGGT TAAACATCCA GTAGTTTTTG TACCTGGGAT TGTTACTTGT 10224 .......... .......... .......... .......... .......... .......... 363 GGCCTTGAGC TATGGGAGGG ACATCAGTGT GCTGAAGGAT TGTTTCGAAA GCGGTTATGG 10164 .......... .......... .......... .......... .......... .......... 363 GGTGGTACTT TTGGCGAAGT GTATAAAAGG TCAGAGACGA ATCCAGAATT TGAAATTGTT 10104 .......... .......... .......... .......... .......... .......... 363 AGGTTCAATC TATAAAGTTG TTATGATCAA ATTTCATAAA AGATTATGAG TTCAAACTTC 10044 .......... .......... .......... .......... .......... .......... 363 ATGTTTTTCG GAATTTTAAT GAAAATAAAC TTATGCTCTG TGTTAAAAGT ATTGAGTTCA 9984 .......... .......... .......... .......... .......... .......... 363 GATGAACCTG GTATGTTTTT ATAGATCGAT GCAGGGGAAT AAACTTCATT GGTATTAATA 9924 .......... .......... .......... .......... .......... .......... 363 GTATGAGATG AGATCTAGGA TATGGGTACT ATCTCAAAAT AAATAAACTT ATATTTGATA 9864 .......... .......... .......... .......... .......... .......... 363 AAATTTTTAA ATATATATAG ATGATATGAG TTATGGTGCT ACTGAATAGC TACAATTGCA 9804 .......... .......... .......... .......... .......... .......... 363 AAAAGGAAAA AAGGTTTATT TTAATGATTA TCTGTTTGTG TGATGGTAAC CAGACCGTTT 9744 .......... .......... .......... .......... .......... .......... 363 TGTTGGGCGG AACACATGTC ATTGGACAAT GAATCTGGGT TGGATCCTCC GGGAATACGG 9684 .......... .......... .......... .......... .......... .......... 363 GTTAGGCCAG TTGCTGGACT TGTTGCAGCA GATTACTTTG CACCAGGATA TTTTGTGTGG 9624 .......... .......... .......... .......... .......... .......... 363 GCAGTTTTGA TTGCTAATTT GGCGCGAATA GGATATGAGG AGAAAACGAT GTATATGGCT 9564 .......... .......... .......... .......... .......... .......... 363 GCATATGACT GGAGACTATC CATTCAGAAT ACTGAGGTAT AGATTTAACT TTCTGTATGC 9504 .......... .......... .......... .......... .......... .......... 363 TTGAGCATTG TTGTTTCCTT AATCCACTTA AAAAGTCGTT GTAATGTGTC AATGGTGAAA 9444 .......... .......... .......... .......... .......... .......... 363 TCATTTCATC AGGTGCGCGA CCAGACACTA AGCCAGATAA AAAGCAATAT AGAACTGATG 9384 .......... .......... .......... .......... .......... .......... 363 GTTGCAACTA ATGGAGGCAA TAAGGCAGTA ATTGTTCCAC ATTCTATGGG AGCTATTTAC 9324 .......... .......... .......... .......... .......... .......... 363 TTTTTGTATT TCATGAAGTG GGTCGAGGCA CCAGCTCCGA TGGGTGGTGG TGGTGGTCCT 9264 .......... .......... .......... .......... .......... .......... 363 GATTGGTGTG CCAAACATAT TAAAGCAGTG ATGAATATTG GTGCGCCGTT TCTAGGTGTT 9204 .......... .......... .......... .......... .......... .......... 363 CCTAAAGCAT TAGCTGCACT TTTCTCAGCT GAAGCTCGAG ATGTCGCTAT TGCAAGGTAA 9144 .......... .......... .......... .......... .......... .......... 363 AATTGGTTCA TGTGGATGTT TTCTCCTTGA CAACCAACTG AAGAAATAGT ACATCTGGTA 9084 .......... .......... .......... .......... .......... .......... 363 TAGAGCTAAA TCATTTCTTG GCTTCTGCAC TTGACTGAGA AGCAATACCT CAAAATTAAT 9024 .......... .......... .......... .......... .......... .......... 363 CCTAAAATGC TCCTCTAGTT CCGAGTGTTG GGTTTTTAAA GGTTTGCTAA TCTGAATACA 8964 .......... .......... .......... .......... .......... .......... 363 GGAGATAACA CCGAGCTTGG TAATTAAGGC CGTGAGAAAT TCTTTCCATA TCTGTTTCTT 8904 .......... .......... .......... .......... .......... .......... 363 GAGCTAGATT TGAGTATGTG TTGGTGCTTT GACGTGTGAG TAGTAATTGC CGTTACACAT 8844 .......... .......... .......... .......... .......... .......... 363 GTGGTTTACA TCCTTATTTC CTTTTCCCTC AACTTTGCAC TACGAATCAT TGATCTTCTA 8784 .......... .......... .......... .......... .......... .......... 363 AGAGATGCTA TTTCATAGCC TTTTTATCTT TCAACACTAG ATGTTATGTA CAAAATACAA 8724 .......... .......... .......... .......... .......... .......... 363 GTCCTTGAAT GCTTCCGCGT GTAGTCTGTT CCTTCTTCAT TGGTAGTCTT CTTGATCATC 8664 .......... .......... .......... .......... .......... .......... 363 ACTAGTTTAC TTAAATTTCA GGAGTAAAGC ATCAGTTGTT ATGGACAAGG ATTTATTTCG 8604 | ||||| ||| | .......... .......... .AATTAAAGA GTCAATA... .......... .......... 379 TATTCAAACA CTACCACATT TAATGAGGAT GCTTCGGACT TGGGATTCAA CCATGTCTAT 8544 .......... .......... .......... .......... .......... .......... 379 GTTACCAAAA GGAGGAGAGA CGATTTGGGG TGGTCTTGAC TGGTCTCCAG AAGAAGGCTA 8484 .......... .......... .......... .......... .......... .......... 379 TTCTCCTCGC AAAAGAAAAC TAAGGGACAA AACTAGTCAT ACGTCAAGCC ATCAGGACAA 8424 .......... .......... .......... .......... .......... .......... 379 TCAAACTGTA GAATCTAAAG GAAAACATGT TAATTATGGA AGGATGATAT CATTTGGAAA 8364 .......... .......... .......... .......... .......... .......... 379 GGTTGCAGCA CAGAAACCTT CATCAGATAT TACTAGGATT GACTTCAGAG TAATGTTAAC 8304 .......... .......... .......... .......... .......... .......... 379 CAGAAACCAA TGCTCCTTAT TTATCGTTCT CTACATATTT CTATGCTTTC TTCTATTCGT 8244 .......... .......... .......... .......... .......... .......... 379 TTCAACATGT TAAATAAGGC TCACAAACAT ATATTTTTGT GCTTGAGAAA GACCTTTGTG 8184 .......... .......... .......... .......... .......... .......... 379 AGGCATAGCT TTGGTAGACC TGTCATCTAA CATTTTGATA CTTATAAAGC AACAAAGAAT 8124 .......... .......... .......... .......... .......... .......... 379 AACTACAACA ATTCAATCGA AACAAAAATA TGATGAACAT TAACTCAGTA GTACAGTAGG 8064 .......... .......... .......... .......... .......... .......... 379 AAATTATCTC TTCCCCACTT TCTTCTTGTT GTCTCTTAGT AGCTGTGTTT GTTTACATGC 8004 .......... .......... .......... .......... .......... .......... 379 AGGGTGCAGT GAAGGGCACG AACAAAGCAA ATAACACATG TGATGTGTGG ACGGAGTACT 7944 .......... .......... .......... .......... .......... .......... 379 ATGACATGGG CGTTGCTGGT ATAAAAGCTG TGGAAGAATA CAAGGTTTAT ACAGCTGGAG 7884 .......... .......... .......... .......... .......... .......... 379 ATATATTGGA TCTACTCCAC TTTGTTGCCC CAAAGATGAT GGCTCGTGGA GGCGCTCATT 7824 .......... .......... .......... .......... .......... .......... 379 TTTCATATGG GATAGCTGAA GATTTGGATG ATCCAATGTA TTCACACTAC AAATACTGGT 7764 ||||| ||| .......... .......... .......... .......... .......... AAATAGCGGT 389 CAAATCCGTT GGAAACAAAG TGAGTACTTT TCATTTGAAC TCTGTCTCCG TACTTGTTGT 7704 | |||| | || | CTAATCAAGT CCTAAGCAG. .......... .......... .......... .......... 408 CTTTGTACAA AGGATATAGT CTACGAGTAG AAAATGACCT CGAGTTCATT TTATCTTTAC 7644 .......... .......... .......... .......... .......... .......... 408 CTTGTTAACA CTACAGGCTA CCAAATGCTC CTGAAATGGA GCTTTATTCG ATGTATGGAG 7584 .......... .......... .......... .......... .......... .......... 408 TTGGCATTCC AACTGAAAGA GCATATGTTT ACGGGCAAAC ACCAATAGCA CAATGTCATA 7524 .......... .......... .......... .......... .......... .......... 408 TTCCATTCCA GATTGAAACT TCAGCTGATG AAGGGAATGA GTGTTGTATG AAGAATGGTG 7464 .......... .......... .......... .......... .......... .......... 408 TTTTGACTGT TGATGGCGAT GAGACGGTGC CTATTTTAAG TGCAGGCTTC ATGTGTGCAA 7404 .......... .......... .......... .......... .......... .......... 408 AAGGATGGCG CGGAAAAACT AGATTTAATC CATCAGGAAT CAAAACTTAT ACAAGGGAGT 7344 .......... .......... .......... .......... .......... .......... 408 ATGATCATGC TCCTCCCGCA AACCTTCTTG AGGGTCGTGG TACACAGAGT GGAAATCATG 7284 .......... .......... .......... .......... .......... .......... 408 TTGATATAAT GGGAAATTTC GCTTTGATTG AAGATATCAT GAGAGTTGCA GCTGGTGCAA 7224 .......... .......... .......... .......... .......... .......... 408 CGGGCAAAGA CTTGGGAGGT GATCAAGTTC ACTCGGACAT CTTTAAGTGG TCTGAGAAGA 7164 .......... .......... .......... .......... .......... .......... 408 TTGATTTACG TCTTTAGGGG AACACTGGGT GCACTGCTTT TCTCTTATAT GATGAAGTAC 7104 .......... .......... .......... .......... .......... .......... 408 TTGACTCAAC GGTTAGAATA TCACTTTCAT ATCTACATAA TTGTGATGGT GATCTATCTT 7044 .......... .......... .......... .......... .......... .......... 408 TACGTCTTTA AGTTTCATCA TGTCAAGATG TCTGCTTCGT GTATAATCAC AAAGTTATAT 6984 .......... .......... .......... .......... .......... .......... 408 GCTGATATGA GTTCATCTGG ACGCCACACA AACCCTTCCT GTCAAATGTA TGGACAACAA 6924 .......... .......... .......... .......... .......... .......... 408 CATACCCTGT ATAATCCCAC TCATGGGAGA TACTAAATTG TGAAATTATG TCATGAATCT 6864 .......... .......... .......... .......... .......... .......... 408 ACCACTGACG TTTGGAGAGC CCTAGCAACA CTTAACTTTT ATCCGACTAT GTAGACGATT 6804 .......... .......... .......... .......... .......... .......... 408 AAATGGATTC CAGATAAAAT GCAGGATGTT TTGTGTATCT CGTAGTGTAA TATGTCAAAG 6744 .......... .......... .......... .......... .......... .......... 408 TTGGAGCTAG TTGCCCAAAT CAGGAACAAT TAGGGATAGA ACTGATATTG TACCAACATA 6684 .......... .......... .......... .......... .......... .......... 408 GTTTTCTCAA AGATATTGTA TTTCAAATGT GTTGTTATAC AGACTTGTGG AATAGCTACT 6624 .......... .......... .......... .......... .......... .......... 408 CAATGAATGA ACAACTTTTC TTCTTTATTT GTGATAACAT GAATTTCTTG GATTCTTCAG 6564 .......... .......... .......... .......... .......... .......... 408 TTCAGTTGAA TGTACTTATT ATAATGGAGT ATTCTTAAAA AAAAGTTGAT TTTCCATTAT 6504 .......... .......... .......... .......... .......... .......... 408 TATACGTCGA AATAATACTG TAGAAATGCA TGATAAGTCT ATGTATAATA GACTTTTATA 6444 .......... .......... .......... .......... .......... .......... 408 GTCAAATCCT TTTCCGAACT CCACAAGTAA AGAAAACTTA ATGTATCGAG TTCAGAAGAA 6384 .......... .......... .......... .......... .......... .......... 408 ACTGAAAAAT TGAAAAAAGA AATGGCCTAT GAAAAAATGA TTGGATAGAG AGGAGGTCTG 6324 .......... .......... .......... .......... .......... .......... 408 ACATATGGGT TTGGGAATAC ATCACTAAAA GGCATCAGTG AAGCTCATCG GATTCTTTGT 6264 .......... .......... .......... .......... .......... .......... 408 CAAATAGGTA TATATATACT ATCACATAAT GTTTGACTTT GCAAGTGTAA CAGATTATTA 6204 .......... .......... .......... .......... .......... .......... 408 CGGCAGTCGA GTCAGAATTT TTAATAAGGG ATTTAAATTT TGAAAAAATA GACATATGGA 6144 .......... .......... .......... .......... .......... .......... 408 TACTATAGAA ATCTAAAAAA TTGTTTTAAC CATGTAAATG ATTAATTTTT CGTCAAAAGG 6084 .......... .......... .......... .......... .......... .......... 408 GATTTGAAAT GAACAACCTG ACTATAACGT AGCTTCGCCA CTGATTACTA TATAATTGCT 6024 .......... .......... .......... .......... .......... .......... 408 AAATTTAATA ACTTGAAAAA TAAAGCAAAC AACCTTTTAT AACGAGTCAA ATTTTATTTA 5964 .......... .......... .......... .......... .......... .......... 408 TATTATAAAA ACTCATTCAT TATAGTGTAT ATGTCTAATA CACGCAAGCT ACAGTTTATA 5904 .......... .......... .......... .......... .......... .......... 408 TTATAAATAA TATAGAAATA ATCAATTTAG TTATGCGCTT AGAAATGATC AAAATACGTT 5844 .......... .......... .......... .......... .......... .......... 408 TTGAATATAA AAAGAAGAAA TTGAGGAACA TGTCAATTTT TATGTTTGAT TTATTTAATG 5784 .......... .......... .......... .......... .......... .......... 408 AAAGATTATA AAAGGAATAT TGTTTTATTT AAAATACGAA CTTTAAAATG TCAATTTATA 5724 .......... .......... .......... .......... .......... .......... 408 TGTTCCATGT AAGCAAATTT TTGTCATTTT TAATTGGATC ATATAAATCG TTGTTTAAGA 5664 .......... .......... .......... .......... .......... .......... 408 ATAGTTATAT TATTAATCCT AATAATGCAT TTTTACTTTA TTTAAAATGA CATATAAATT 5604 .......... .......... .......... .......... .......... .......... 408 TCTGCATATG TCTATCATGC AACAAGAAAA ATAAAACCCA AACAAATAAC GTAACCAAAG 5544 .......... .......... .......... .......... .......... .......... 408 AAAAAAGATT TCTCATTTAG GATTAAGAAA ATTTATGTAC GCTGATAAAA TTAAAAATGA 5484 .......... .......... .......... .......... .......... .......... 408 AGAAAACTTT CTAATTCGTT AAGTAATATT TTTTAACTCT AATTAGCTAC TAAATAATAT 5424 .......... .......... .......... .......... .......... .......... 408 CCAATTCTAT AGATGTAAAT ATCCTTGCAT ATATGCTCTG ATCATTTTTG CTATTCGCTA 5364 .......... .......... .......... .......... .......... .......... 408 TCTTATTATT ATTATTTTGG TTTGTTTTTT GAAAATGAAC TTTAATTTAG AAGAGATAGT 5304 .......... .......... .......... .......... .......... .......... 408 TATATATATT CTTCGTAATT TCACTGAATT TATTATTTTC GTAATTATTG ATACAGACAA 5244 | | .......... .......... .......... .......... .......... ......CCCA 412 AGGTGTTTAA ACGGACATCG TAATGGAGTG TGTGTGAGAG AGAACCCATT TTGAGGAATT 5184 | | |||| | | | || | A-G-GTTTGA CCCGTCAAC. .......... .......... .......... .......... 429 CGGGGGCAAT TTCAGTTTCT CGGTGGCGGA AGACGCGCAG AGACTTCACC CTTTCAATTC 5124 .......... .......... .......... .......... .......... .......... 429 CGTTGACACA TATACAGAGG TGTCGTTTCT GATAATCAAT TTCACTTTCT CTTTGCTACG 5064 .......... .......... .......... .......... .......... .......... 429 ATCGCTACAT TCTCTCTCTC TCTTTGTAAG TATCGCCTTT TTAATCTACT TGTTTATTGA 5004 .......... .......... .......... .......... .......... .......... 429 TGGATCATGT AGTTTGAATC GCACAAAAAT ATGGGTCTAG GGTTTTCTAA TCGAGCTCAG 4944 || | | || | | ||||| || ||| | | | | |||| .......... ..CCAAAAAT GAATAATATT ATGGGAAAAG GGTATGAT-A TACCCCTCAA 476 GCAAAAGAAG TTGTTCTTTT TTTGTTTTAA AGTTGGGATT TCAATTTATT ACTTGTGAAT 4884 .......... .......... .......... .......... .......... .......... 476 TTGTGAAGCA GCTGTAAAAG CTGTGATTTT GGGTTAGCTC CTACTGCGTT GATTGTCTTG 4824 .......... .......... .......... .......... .......... .......... 476 TTTGGGAAGT TGGGTGCTTT GGAGAAGGTT TCTTTTATCG GAAATGCAGT TATTCTTGAT 4764 .......... .......... .......... .......... .......... .......... 476 TATACGTGCT TTTTGTGCCA AATTGTTTGT TTTGTTTCTG GCTCTTGTTG TTTCAAGTTT 4704 .......... .......... .......... .......... .......... .......... 476 ATAAAAATCA ATAAATGAAC CTAAATATAG CGGAATGGAT ATAGATGTTG CTGAACCAAC 4644 .......... .......... .......... .......... .......... .......... 476 TAGTCTAGGA TGAGAGTTGT TGCTGTATAT TAACTCACAG TGGTGAATAG GTACCTGTAG 4584 .......... .......... .......... .......... .......... .......... 476 ACTCCATTTG TCACTTG 4567 |||| ||| ||| .....CTTTG TCATTTG 488 hqPGS_C06HBa0120H21.1-4-_SGN-U335137+ (11540 11481,11328 11256,11225 11140) ******************************************************************************** EST sequence 1 -strand 1136 n (File: SGN-U325194-) 1 TTTTTTTTTA GGCAGCGAAT TATACAATTG TGCATTATAC AATTGCAGTG AAATAGGATA 61 GCGAATTATA CAATTGTATA TGTATAGCGA ATTATACAAT TTTATGTTTG CTATGGAACG 121 CAATTATGCG AAGTTTGCTA TAGCATACAA ATATGAATTT TTTGTTTGCT GTATGTGAAA 181 AAAGTTGCCC AAAAATAAAT GAGAGTTAGA TTTAAATGAA ATGGACACAT TGATTTCTCG 241 TCTAAAAGCA TTAAAGGTGG TTGATTCTTA CTCGACTAAG TGCAGTAGCT TCCCCAAGTC 301 CATATAACGA ATCAATCAAC AGTGTCCGCA GTGAAGCTGG ACCTCTAGCC ATGTCCATCC 361 CTGTTTCACT TGCGACACCA AATATAGCCA ATGCAGAAGC TGTTGCTTCC ACAGCATGGG 421 ATGGATCAAC AGCGACAAAG GCAGCAATGA GGGCAGTGAC TGAACATCCA GATGCAGTAA 481 TCTTTTGCAA CATAGGCACC CCATTGTGAA CACAAACAAC CCTATCACCG TCTGTGATAT 541 AGTCAACAGC TCCAGAAACA GCGACTACAC TACCACTTTG TTGAGCGATG GACTTAGCAG 601 CTTCAACAGC ATCACTTGAA CCATGTACAC TATCCACACC CTTGGAATTT GAGTCAACAC 661 AACCCTTGAA AAGAGCAAGA ATCTCAGACC CATTGCCTCT AACAACACTA GGTTTTAAAG 721 CAAGTAGCTC CAAACACGCC TTTAACCGGA AACTTGACCC ACCGGCAGCA ACAGGATCAA 781 GCACCCAAGG TTTTTTCGAT TGATTCGCCA CTTGTGCCGC TAATTTCATA CCGGGTAGCC 841 AGTCCGGAGT GAGGGTTCCG GCATTTATGC AAACTCCAAG GGCTTTTGGA GTAAATTCTG 901 GGATTTCTTC AACGGAATGA ATCATCGCCG GTGATGCGCC GGCGGATAAC AGAGTATTGG 961 CCATGAAATC CATTGAAACG AAGTTGGTTA TACATTGAAT TAATGGGGAT TTTTGACGAA 1021 CTAATTCGAA ATGTGTCCAT GCTTGTTTGG GCCAATCAAA AGGGAGTTCT TGAGCTCCCT 1081 TTTGGGGTTC AATGGTGTTA CTACTTTTGT CTTCTTCCAT AGGAGTCTAT AAGCGA Predicted gene structure (within gDNA segment 12097 to 1): Exon 1 11509 11466 ( 44 n); cDNA 13 56 ( 44 n); score: 0.932 Intron 1 11465 11275 ( 191 n); Pd: 0.000 (s: 0.93), Pa: 0.000 (s: 0.90) Exon 2 11274 11150 ( 125 n); cDNA 57 180 ( 124 n); score: 0.896 Intron 2 11149 10338 ( 812 n); Pd: 0.000 (s: 0.90), Pa: 0.937 (s: 0) Exon 3 10337 10311 ( 27 n); cDNA 181 206 ( 26 n); score: 0.519 Intron 3 10310 8025 (2286 n); Pd: 0.000 (s: 0), Pa: 0.940 (s: 0) Exon 4 8024 8020 ( 5 n); cDNA 207 211 ( 5 n); score: 0.800 Intron 4 8019 6085 (1935 n); Pd: 0.000 (s: 0), Pa: 0.779 (s: 0) Exon 5 6084 6056 ( 29 n); cDNA 212 239 ( 28 n); score: 0.586 Intron 5 6055 4873 (1183 n); Pd: 0.197 (s: 0), Pa: 0.861 (s: 0) Exon 6 4872 4862 ( 11 n); cDNA 240 250 ( 11 n); score: 0.727 Intron 6 4861 1934 (2928 n); Pd: 0.000 (s: 0), Pa: 0.620 (s: 0) Exon 7 1933 1928 ( 6 n); cDNA 251 256 ( 6 n); score: 0.667 Intron 7 1927 807 (1121 n); Pd: 0.776 (s: 0), Pa: 0.265 (s: 0) Exon 8 806 798 ( 9 n); cDNA 257 265 ( 9 n); score: 0.889 MATCH C06HBa0120H21.1-4- SGN-U325194- 0.896 256 0.225 C PGS_C06HBa0120H21.1-4-_SGN-U325194- (11509 11466,11274 11150,10337 10311,8024 8020,6084 6056,4872 4862,1933 1928,806 798) Alignment (genomic DNA sequence = upper lines): CAACGAATTA TACAATTGTG AATTATACAA TTGCAGTGAA ATACAATTTT CTCTAGCTTT 11450 || ||||||| |||||||||| ||||||||| |||||||||| ||| CAGCGAATTA TACAATTGTG CATTATACAA TTGCAGTGAA ATAG...... .......... 56 ATACAACAGA AGTGTATATA TTGTGTTTCT GTTTTTATAT AAAGCGAGAA AAACATATAT 11390 .......... .......... .......... .......... .......... .......... 56 CTTCTTGCTA TACACTTATA ATTATGCAAT ATACGTACAT TTTAATTCGA TTCAACTGTA 11330 .......... .......... .......... .......... .......... .......... 56 TGCAAAGCAA ATTATACAAT TGCAGCGAAA TAAGTCAGTG AATTATACAA TTTAGGCCAT 11270 | | .......... .......... .......... .......... .......... .....GATAG 61 CGAATTATAC AATTGTATAT GTATAGCGAA TTATACATTT TCTATGTTTG CTATGGAGCG 11210 |||||||||| |||||||||| |||||||||| ||||||| || | |||||||| ||||||| || CGAATTATAC AATTGTATAT GTATAGCGAA TTATACAATT T-TATGTTTG CTATGGAACG 120 CAATTATACA AACTTTGCTA TAGCGTACAA ATATAAATTT TTTATTTGCT ATATGTGAAA 11150 ||||||| | || ||||||| |||| ||||| |||| ||||| ||| |||||| ||||||||| CAATTATGCG AAGTTTGCTA TAGCATACAA ATATGAATTT TTTGTTTGCT GTATGTGAAA 180 ATTGTCCTTT TTAATTTTTG CCTTTCGCGC TTTTAAGTAA CAAAAAGCTT AGCCCAAAAT 11090 .......... .......... .......... .......... .......... .......... 180 ATTCCTAACC TTGAAAAAAG GCACAAGTGC AGCGTATATA CATAAGTTAT GCCTTATAAG 11030 .......... .......... .......... .......... .......... .......... 180 ACATGAATTA GTTCCTACAG AACTTTTATC GATAGAGGCA TAAAATTAGT TCTAGGCTTA 10970 .......... .......... .......... .......... .......... .......... 180 TGTCATAAGC AGTTCATAAC CAACAAATTA TATCTCATCA CTAAGACATA ACTTGGTTCG 10910 .......... .......... .......... .......... .......... .......... 180 TATGAAACTT ATATCAATAG AGGCATAAGT CCGTGCCTCA AGAGGCAGTA ACAAAATTAG 10850 .......... .......... .......... .......... .......... .......... 180 ATACTACAAA ACTTATATCG TACAAAATAA AAGTTGTTTA AACTCATGCA TTACGAAATA 10790 .......... .......... .......... .......... .......... .......... 180 AATTAGGTCA TAGTCAAGGA CTGATTAACC TGTAAATACA AAAAAACTTG ATGAAACACC 10730 .......... .......... .......... .......... .......... .......... 180 ATAATGTTTT GGTTAGGTTC AATTCAAAAT TAACAAAAAT TTAATTAAGC ACAAGTGTTA 10670 .......... .......... .......... .......... .......... .......... 180 TTTTTCCCTC CACTAATCTA AAATCTTGAT CCACTATAAT AGGAAGAGCT ATTACACATA 10610 .......... .......... .......... .......... .......... .......... 180 ACATACTTGT ATCAATCCCA AACAAAAAAT AATACTATTA AAAATTTGTT TTGATTATGG 10550 .......... .......... .......... .......... .......... .......... 180 AGTTAATTAG AAGAAGAAAT GCACCTCATC AAAATGAAAT TCCTCCCCTG GAACTACTTG 10490 .......... .......... .......... .......... .......... .......... 180 ACGAGGAAGA TGATAAGAAA AGTAAGAGGA AGATTGCAAC GAAGCAGAAA TGGTCATGTA 10430 .......... .......... .......... .......... .......... .......... 180 TCGATAGTTG TTGTTGGTTT GTAGGATACA TTTGTACTGT ATGGTGGATT TTATTATTTT 10370 .......... .......... .......... .......... .......... .......... 180 TGTACAATGC TATGCCAGCT TCGTTTCCAC AGTACGTAAC AGAGAAGATT AATGGGCCAG 10310 | || | | || || |||| | .......... .......... .......... ..AAAGTTGC CCA-AAAATA AATGAGAGT. 206 TAGCTGATCC TCCTGGCGTA AAGCTACGAA ATGAAGGGCT AAAGGTTAAA CATCCAGTAG 10250 .......... .......... .......... .......... .......... .......... 206 TTTTTGTACC TGGGATTGTT ACTTGTGGCC TTGAGCTATG GGAGGGACAT CAGTGTGCTG 10190 .......... .......... .......... .......... .......... .......... 206 AAGGATTGTT TCGAAAGCGG TTATGGGGTG GTACTTTTGG CGAAGTGTAT AAAAGGTCAG 10130 .......... .......... .......... .......... .......... .......... 206 AGACGAATCC AGAATTTGAA ATTGTTAGGT TCAATCTATA AAGTTGTTAT GATCAAATTT 10070 .......... .......... .......... .......... .......... .......... 206 CATAAAAGAT TATGAGTTCA AACTTCATGT TTTTCGGAAT TTTAATGAAA ATAAACTTAT 10010 .......... .......... .......... .......... .......... .......... 206 GCTCTGTGTT AAAAGTATTG AGTTCAGATG AACCTGGTAT GTTTTTATAG ATCGATGCAG 9950 .......... .......... .......... .......... .......... .......... 206 GGGAATAAAC TTCATTGGTA TTAATAGTAT GAGATGAGAT CTAGGATATG GGTACTATCT 9890 .......... .......... .......... .......... .......... .......... 206 CAAAATAAAT AAACTTATAT TTGATAAAAT TTTTAAATAT ATATAGATGA TATGAGTTAT 9830 .......... .......... .......... .......... .......... .......... 206 GGTGCTACTG AATAGCTACA ATTGCAAAAA GGAAAAAAGG TTTATTTTAA TGATTATCTG 9770 .......... .......... .......... .......... .......... .......... 206 TTTGTGTGAT GGTAACCAGA CCGTTTTGTT GGGCGGAACA CATGTCATTG GACAATGAAT 9710 .......... .......... .......... .......... .......... .......... 206 CTGGGTTGGA TCCTCCGGGA ATACGGGTTA GGCCAGTTGC TGGACTTGTT GCAGCAGATT 9650 .......... .......... .......... .......... .......... .......... 206 ACTTTGCACC AGGATATTTT GTGTGGGCAG TTTTGATTGC TAATTTGGCG CGAATAGGAT 9590 .......... .......... .......... .......... .......... .......... 206 ATGAGGAGAA AACGATGTAT ATGGCTGCAT ATGACTGGAG ACTATCCATT CAGAATACTG 9530 .......... .......... .......... .......... .......... .......... 206 AGGTATAGAT TTAACTTTCT GTATGCTTGA GCATTGTTGT TTCCTTAATC CACTTAAAAA 9470 .......... .......... .......... .......... .......... .......... 206 GTCGTTGTAA TGTGTCAATG GTGAAATCAT TTCATCAGGT GCGCGACCAG ACACTAAGCC 9410 .......... .......... .......... .......... .......... .......... 206 AGATAAAAAG CAATATAGAA CTGATGGTTG CAACTAATGG AGGCAATAAG GCAGTAATTG 9350 .......... .......... .......... .......... .......... .......... 206 TTCCACATTC TATGGGAGCT ATTTACTTTT TGTATTTCAT GAAGTGGGTC GAGGCACCAG 9290 .......... .......... .......... .......... .......... .......... 206 CTCCGATGGG TGGTGGTGGT GGTCCTGATT GGTGTGCCAA ACATATTAAA GCAGTGATGA 9230 .......... .......... .......... .......... .......... .......... 206 ATATTGGTGC GCCGTTTCTA GGTGTTCCTA AAGCATTAGC TGCACTTTTC TCAGCTGAAG 9170 .......... .......... .......... .......... .......... .......... 206 CTCGAGATGT CGCTATTGCA AGGTAAAATT GGTTCATGTG GATGTTTTCT CCTTGACAAC 9110 .......... .......... .......... .......... .......... .......... 206 CAACTGAAGA AATAGTACAT CTGGTATAGA GCTAAATCAT TTCTTGGCTT CTGCACTTGA 9050 .......... .......... .......... .......... .......... .......... 206 CTGAGAAGCA ATACCTCAAA ATTAATCCTA AAATGCTCCT CTAGTTCCGA GTGTTGGGTT 8990 .......... .......... .......... .......... .......... .......... 206 TTTAAAGGTT TGCTAATCTG AATACAGGAG ATAACACCGA GCTTGGTAAT TAAGGCCGTG 8930 .......... .......... .......... .......... .......... .......... 206 AGAAATTCTT TCCATATCTG TTTCTTGAGC TAGATTTGAG TATGTGTTGG TGCTTTGACG 8870 .......... .......... .......... .......... .......... .......... 206 TGTGAGTAGT AATTGCCGTT ACACATGTGG TTTACATCCT TATTTCCTTT TCCCTCAACT 8810 .......... .......... .......... .......... .......... .......... 206 TTGCACTACG AATCATTGAT CTTCTAAGAG ATGCTATTTC ATAGCCTTTT TATCTTTCAA 8750 .......... .......... .......... .......... .......... .......... 206 CACTAGATGT TATGTACAAA ATACAAGTCC TTGAATGCTT CCGCGTGTAG TCTGTTCCTT 8690 .......... .......... .......... .......... .......... .......... 206 CTTCATTGGT AGTCTTCTTG ATCATCACTA GTTTACTTAA ATTTCAGGAG TAAAGCATCA 8630 .......... .......... .......... .......... .......... .......... 206 GTTGTTATGG ACAAGGATTT ATTTCGTATT CAAACACTAC CACATTTAAT GAGGATGCTT 8570 .......... .......... .......... .......... .......... .......... 206 CGGACTTGGG ATTCAACCAT GTCTATGTTA CCAAAAGGAG GAGAGACGAT TTGGGGTGGT 8510 .......... .......... .......... .......... .......... .......... 206 CTTGACTGGT CTCCAGAAGA AGGCTATTCT CCTCGCAAAA GAAAACTAAG GGACAAAACT 8450 .......... .......... .......... .......... .......... .......... 206 AGTCATACGT CAAGCCATCA GGACAATCAA ACTGTAGAAT CTAAAGGAAA ACATGTTAAT 8390 .......... .......... .......... .......... .......... .......... 206 TATGGAAGGA TGATATCATT TGGAAAGGTT GCAGCACAGA AACCTTCATC AGATATTACT 8330 .......... .......... .......... .......... .......... .......... 206 AGGATTGACT TCAGAGTAAT GTTAACCAGA AACCAATGCT CCTTATTTAT CGTTCTCTAC 8270 .......... .......... .......... .......... .......... .......... 206 ATATTTCTAT GCTTTCTTCT ATTCGTTTCA ACATGTTAAA TAAGGCTCAC AAACATATAT 8210 .......... .......... .......... .......... .......... .......... 206 TTTTGTGCTT GAGAAAGACC TTTGTGAGGC ATAGCTTTGG TAGACCTGTC ATCTAACATT 8150 .......... .......... .......... .......... .......... .......... 206 TTGATACTTA TAAAGCAACA AAGAATAACT ACAACAATTC AATCGAAACA AAAATATGAT 8090 .......... .......... .......... .......... .......... .......... 206 GAACATTAAC TCAGTAGTAC AGTAGGAAAT TATCTCTTCC CCACTTTCTT CTTGTTGTCT 8030 .......... .......... .......... .......... .......... .......... 206 CTTAGTAGCT GTGTTTGTTT ACATGCAGGG TGCAGTGAAG GGCACGAACA AAGCAAATAA 7970 ||| | .....TAGAT .......... .......... .......... .......... .......... 211 CACATGTGAT GTGTGGACGG AGTACTATGA CATGGGCGTT GCTGGTATAA AAGCTGTGGA 7910 .......... .......... .......... .......... .......... .......... 211 AGAATACAAG GTTTATACAG CTGGAGATAT ATTGGATCTA CTCCACTTTG TTGCCCCAAA 7850 .......... .......... .......... .......... .......... .......... 211 GATGATGGCT CGTGGAGGCG CTCATTTTTC ATATGGGATA GCTGAAGATT TGGATGATCC 7790 .......... .......... .......... .......... .......... .......... 211 AATGTATTCA CACTACAAAT ACTGGTCAAA TCCGTTGGAA ACAAAGTGAG TACTTTTCAT 7730 .......... .......... .......... .......... .......... .......... 211 TTGAACTCTG TCTCCGTACT TGTTGTCTTT GTACAAAGGA TATAGTCTAC GAGTAGAAAA 7670 .......... .......... .......... .......... .......... .......... 211 TGACCTCGAG TTCATTTTAT CTTTACCTTG TTAACACTAC AGGCTACCAA ATGCTCCTGA 7610 .......... .......... .......... .......... .......... .......... 211 AATGGAGCTT TATTCGATGT ATGGAGTTGG CATTCCAACT GAAAGAGCAT ATGTTTACGG 7550 .......... .......... .......... .......... .......... .......... 211 GCAAACACCA ATAGCACAAT GTCATATTCC ATTCCAGATT GAAACTTCAG CTGATGAAGG 7490 .......... .......... .......... .......... .......... .......... 211 GAATGAGTGT TGTATGAAGA ATGGTGTTTT GACTGTTGAT GGCGATGAGA CGGTGCCTAT 7430 .......... .......... .......... .......... .......... .......... 211 TTTAAGTGCA GGCTTCATGT GTGCAAAAGG ATGGCGCGGA AAAACTAGAT TTAATCCATC 7370 .......... .......... .......... .......... .......... .......... 211 AGGAATCAAA ACTTATACAA GGGAGTATGA TCATGCTCCT CCCGCAAACC TTCTTGAGGG 7310 .......... .......... .......... .......... .......... .......... 211 TCGTGGTACA CAGAGTGGAA ATCATGTTGA TATAATGGGA AATTTCGCTT TGATTGAAGA 7250 .......... .......... .......... .......... .......... .......... 211 TATCATGAGA GTTGCAGCTG GTGCAACGGG CAAAGACTTG GGAGGTGATC AAGTTCACTC 7190 .......... .......... .......... .......... .......... .......... 211 GGACATCTTT AAGTGGTCTG AGAAGATTGA TTTACGTCTT TAGGGGAACA CTGGGTGCAC 7130 .......... .......... .......... .......... .......... .......... 211 TGCTTTTCTC TTATATGATG AAGTACTTGA CTCAACGGTT AGAATATCAC TTTCATATCT 7070 .......... .......... .......... .......... .......... .......... 211 ACATAATTGT GATGGTGATC TATCTTTACG TCTTTAAGTT TCATCATGTC AAGATGTCTG 7010 .......... .......... .......... .......... .......... .......... 211 CTTCGTGTAT AATCACAAAG TTATATGCTG ATATGAGTTC ATCTGGACGC CACACAAACC 6950 .......... .......... .......... .......... .......... .......... 211 CTTCCTGTCA AATGTATGGA CAACAACATA CCCTGTATAA TCCCACTCAT GGGAGATACT 6890 .......... .......... .......... .......... .......... .......... 211 AAATTGTGAA ATTATGTCAT GAATCTACCA CTGACGTTTG GAGAGCCCTA GCAACACTTA 6830 .......... .......... .......... .......... .......... .......... 211 ACTTTTATCC GACTATGTAG ACGATTAAAT GGATTCCAGA TAAAATGCAG GATGTTTTGT 6770 .......... .......... .......... .......... .......... .......... 211 GTATCTCGTA GTGTAATATG TCAAAGTTGG AGCTAGTTGC CCAAATCAGG AACAATTAGG 6710 .......... .......... .......... .......... .......... .......... 211 GATAGAACTG ATATTGTACC AACATAGTTT TCTCAAAGAT ATTGTATTTC AAATGTGTTG 6650 .......... .......... .......... .......... .......... .......... 211 TTATACAGAC TTGTGGAATA GCTACTCAAT GAATGAACAA CTTTTCTTCT TTATTTGTGA 6590 .......... .......... .......... .......... .......... .......... 211 TAACATGAAT TTCTTGGATT CTTCAGTTCA GTTGAATGTA CTTATTATAA TGGAGTATTC 6530 .......... .......... .......... .......... .......... .......... 211 TTAAAAAAAA GTTGATTTTC CATTATTATA CGTCGAAATA ATACTGTAGA AATGCATGAT 6470 .......... .......... .......... .......... .......... .......... 211 AAGTCTATGT ATAATAGACT TTTATAGTCA AATCCTTTTC CGAACTCCAC AAGTAAAGAA 6410 .......... .......... .......... .......... .......... .......... 211 AACTTAATGT ATCGAGTTCA GAAGAAACTG AAAAATTGAA AAAAGAAATG GCCTATGAAA 6350 .......... .......... .......... .......... .......... .......... 211 AAATGATTGG ATAGAGAGGA GGTCTGACAT ATGGGTTTGG GAATACATCA CTAAAAGGCA 6290 .......... .......... .......... .......... .......... .......... 211 TCAGTGAAGC TCATCGGATT CTTTGTCAAA TAGGTATATA TATACTATCA CATAATGTTT 6230 .......... .......... .......... .......... .......... .......... 211 GACTTTGCAA GTGTAACAGA TTATTACGGC AGTCGAGTCA GAATTTTTAA TAAGGGATTT 6170 .......... .......... .......... .......... .......... .......... 211 AAATTTTGAA AAAATAGACA TATGGATACT ATAGAAATCT AAAAAATTGT TTTAACCATG 6110 .......... .......... .......... .......... .......... .......... 211 TAAATGATTA ATTTTTCGTC AAAAGGGATT TGAAATGAAC AACCTGACTA TAACGTAGCT 6050 | ||||||| || | ||| | | | .......... .......... .....TTAAA TGAAATGGAC ACATTGA-TT TCTC...... 239 TCGCCACTGA TTACTATATA ATTGCTAAAT TTAATAACTT GAAAAATAAA GCAAACAACC 5990 .......... .......... .......... .......... .......... .......... 239 TTTTATAACG AGTCAAATTT TATTTATATT ATAAAAACTC ATTCATTATA GTGTATATGT 5930 .......... .......... .......... .......... .......... .......... 239 CTAATACACG CAAGCTACAG TTTATATTAT AAATAATATA GAAATAATCA ATTTAGTTAT 5870 .......... .......... .......... .......... .......... .......... 239 GCGCTTAGAA ATGATCAAAA TACGTTTTGA ATATAAAAAG AAGAAATTGA GGAACATGTC 5810 .......... .......... .......... .......... .......... .......... 239 AATTTTTATG TTTGATTTAT TTAATGAAAG ATTATAAAAG GAATATTGTT TTATTTAAAA 5750 .......... .......... .......... .......... .......... .......... 239 TACGAACTTT AAAATGTCAA TTTATATGTT CCATGTAAGC AAATTTTTGT CATTTTTAAT 5690 .......... .......... .......... .......... .......... .......... 239 TGGATCATAT AAATCGTTGT TTAAGAATAG TTATATTATT AATCCTAATA ATGCATTTTT 5630 .......... .......... .......... .......... .......... .......... 239 ACTTTATTTA AAATGACATA TAAATTTCTG CATATGTCTA TCATGCAACA AGAAAAATAA 5570 .......... .......... .......... .......... .......... .......... 239 AACCCAAACA AATAACGTAA CCAAAGAAAA AAGATTTCTC ATTTAGGATT AAGAAAATTT 5510 .......... .......... .......... .......... .......... .......... 239 ATGTACGCTG ATAAAATTAA AAATGAAGAA AACTTTCTAA TTCGTTAAGT AATATTTTTT 5450 .......... .......... .......... .......... .......... .......... 239 AACTCTAATT AGCTACTAAA TAATATCCAA TTCTATAGAT GTAAATATCC TTGCATATAT 5390 .......... .......... .......... .......... .......... .......... 239 GCTCTGATCA TTTTTGCTAT TCGCTATCTT ATTATTATTA TTTTGGTTTG TTTTTTGAAA 5330 .......... .......... .......... .......... .......... .......... 239 ATGAACTTTA ATTTAGAAGA GATAGTTATA TATATTCTTC GTAATTTCAC TGAATTTATT 5270 .......... .......... .......... .......... .......... .......... 239 ATTTTCGTAA TTATTGATAC AGACAAAGGT GTTTAAACGG ACATCGTAAT GGAGTGTGTG 5210 .......... .......... .......... .......... .......... .......... 239 TGAGAGAGAA CCCATTTTGA GGAATTCGGG GGCAATTTCA GTTTCTCGGT GGCGGAAGAC 5150 .......... .......... .......... .......... .......... .......... 239 GCGCAGAGAC TTCACCCTTT CAATTCCGTT GACACATATA CAGAGGTGTC GTTTCTGATA 5090 .......... .......... .......... .......... .......... .......... 239 ATCAATTTCA CTTTCTCTTT GCTACGATCG CTACATTCTC TCTCTCTCTT TGTAAGTATC 5030 .......... .......... .......... .......... .......... .......... 239 GCCTTTTTAA TCTACTTGTT TATTGATGGA TCATGTAGTT TGAATCGCAC AAAAATATGG 4970 .......... .......... .......... .......... .......... .......... 239 GTCTAGGGTT TTCTAATCGA GCTCAGGCAA AAGAAGTTGT TCTTTTTTTG TTTTAAAGTT 4910 .......... .......... .......... .......... .......... .......... 239 GGGATTTCAA TTTATTACTT GTGAATTTGT GAAGCAGCTG TAAAAGCTGT GATTTTGGGT 4850 | ||||||| .......... .......... .......... .......GTC TAAAAGCA.. .......... 250 TAGCTCCTAC TGCGTTGATT GTCTTGTTTG GGAAGTTGGG TGCTTTGGAG AAGGTTTCTT 4790 .......... .......... .......... .......... .......... .......... 250 TTATCGGAAA TGCAGTTATT CTTGATTATA CGTGCTTTTT GTGCCAAATT GTTTGTTTTG 4730 .......... .......... .......... .......... .......... .......... 250 TTTCTGGCTC TTGTTGTTTC AAGTTTATAA AAATCAATAA ATGAACCTAA ATATAGCGGA 4670 .......... .......... .......... .......... .......... .......... 250 ATGGATATAG ATGTTGCTGA ACCAACTAGT CTAGGATGAG AGTTGTTGCT GTATATTAAC 4610 .......... .......... .......... .......... .......... .......... 250 TCACAGTGGT GAATAGGTAC CTGTAGACTC CATTTGTCAC TTGGTATGCT TATGTTTAGT 4550 .......... .......... .......... .......... .......... .......... 250 ACAGAAAAAG AAGGGAAGAG ACAAGAAAAA GATCAGAGTG GACAGAAGAC GTTGAGCTTA 4490 .......... .......... .......... .......... .......... .......... 250 AGTTTCTAAT GAAGGGAAAA TAGATTCACA GCATGCCGTT TCTTGGTTAC TGTTAGTTGG 4430 .......... .......... .......... .......... .......... .......... 250 TTTGTCAAGA TTCTGGTAGA TGGGCTCCCC TTATTCGAGC GAGGTTGTTC CCGAAAGGTC 4370 .......... .......... .......... .......... .......... .......... 250 CTCAGCTAAA AAATAGAGTA ATCAAAGCAA AATAACGACA ACTATATAGC AAAATAAACT 4310 .......... .......... .......... .......... .......... .......... 250 TAGCAAATGA AGCAACATGT AGTTGTAACT CGTAACAATG AAGAATAAGA TACTATGTGT 4250 .......... .......... .......... .......... .......... .......... 250 ATACTAAAAT AATACTACAG AATAAATGGA AGATGAGAAG AGGAGACTAA CCCCTGCCTC 4190 .......... .......... .......... .......... .......... .......... 250 TCCCACATAA AATTACCAAA CACTCTGCTA TCTTCTAAAT TTCAATAATT GATCATTTAA 4130 .......... .......... .......... .......... .......... .......... 250 AAGCTTAATA GCAGCTGAGG TTTTAGCTAG TTCACAATTG TTCCTTCACC GAGTCTTTGA 4070 .......... .......... .......... .......... .......... .......... 250 TCTTGGGCTT ATTAATGAAA ATCTTACTTG ATTAAAGAAA AATACCACTC TGATGTTGAG 4010 .......... .......... .......... .......... .......... .......... 250 CTTAACGTCA TTTTCCCTTT CTTTTCCCAG AACCCTGCCA GTAAAAAGAT TTGTAAGAGC 3950 .......... .......... .......... .......... .......... .......... 250 TCACTTCATC GGACTCAAAT TTAATAAAGC ATTTCCTCAA ATGTTAACTC TTGGTTTTTC 3890 .......... .......... .......... .......... .......... .......... 250 CTGGTAGATT GGGTTTTTCT GTCGTCTGGA ACTAGGCTCT TGTTGGTCCT AGTTTATGTT 3830 .......... .......... .......... .......... .......... .......... 250 TTATTTCCAA TCATTTTATT GGGGTGATCG TTTTAGAACT CAATTTATCA AATATCAAGA 3770 .......... .......... .......... .......... .......... .......... 250 AAATTATTTT TTGTTGCCGG TGCTAAGTCC TTTGATCTCA TCGAATATGC TCCATCCCAC 3710 .......... .......... .......... .......... .......... .......... 250 AATATATGGT GCAATTGGGT AGAAAGAAAA ATAAACTTCG TAAGGCAGTA GATAATCAGC 3650 .......... .......... .......... .......... .......... .......... 250 TGATGAGTCC TAATATGTTA TATGGTTATG CCTAAAGATG ATTGCTGCAT CAGAAGGATC 3590 .......... .......... .......... .......... .......... .......... 250 TGGGAATATA TTCTGGAGAG GGAGTCAGTT GGACCAACTC TATGAATTTT CTGTAACTCT 3530 .......... .......... .......... .......... .......... .......... 250 ATATTTTAAG TTTCATTTAA GATTTGTGGC TATCTGCAAA ACAGCAGTAA TGTTATTATC 3470 .......... .......... .......... .......... .......... .......... 250 CTGGGTCGAT ATAACTTGAA AAATACAAAA AATTATAAAG GAAGCGAGAT TATAAAAGGG 3410 .......... .......... .......... .......... .......... .......... 250 AAGCAACCAC AGTGATATGA AGGGGCGGTG AGAACTTGAG ATGTTCTGAA AGTGAAACAG 3350 .......... .......... .......... .......... .......... .......... 250 GGCTGAAATA TAGGAACTAG TTGGAGCCAA AATCACTGTT GGATGACAAA ATCATTTAAC 3290 .......... .......... .......... .......... .......... .......... 250 AAAAGAGGGA GTCCCTTCAA AATTGCTTGT TGGGGAGATT TCAGGGTGTT CACCAAGATA 3230 .......... .......... .......... .......... .......... .......... 250 TCCCAATAGG TTCAGAACTA AGAAGATGGT TAACCACATT TGGAAGGTTT TTTTTTTTTT 3170 .......... .......... .......... .......... .......... .......... 250 TTGTGTGTGT GTGTGTGTGG GGGGGGGGGG GTTCTCAAAA GGAAAGGGTT TAGTATTATC 3110 .......... .......... .......... .......... .......... .......... 250 TTTGAACTAA CATTGGATGA GGAATCTCAG TGAATCAAAA GAGTATGAAG ATGGAACCTA 3050 .......... .......... .......... .......... .......... .......... 250 GGCTGGATTG GTGGTCACAA ATGTCGGGTT TCATATCTAA CAACGATATA CCAAAGTAGT 2990 .......... .......... .......... .......... .......... .......... 250 GTTGGATTAG CTATTTAGGT TATTGTGTAT GCTTCTACCC TATGGAACCA GGATACTTTT 2930 .......... .......... .......... .......... .......... .......... 250 AGAAAATTGG TGAAATTGCG TGGTTGATTG AAAATAGAGG AAGAAACTAT TTTGAACAGG 2870 .......... .......... .......... .......... .......... .......... 250 CTACCTAAAA TGGGCTTGAA TTCTTGTCAA AAGCCTCTTA ATTATATTCC GGGGACCATC 2810 .......... .......... .......... .......... .......... .......... 250 TCAGTGGATA CTGGAGAGTT TGAGTACAAT ATATCCATTT GGGTTTGAAA GCACTACAAT 2750 .......... .......... .......... .......... .......... .......... 250 TTGTCAGTCA CAAAGCCTCA CTGTCCTGGA AAATATTTTT CACTGGAATT TCTGGTGAAC 2690 .......... .......... .......... .......... .......... .......... 250 AAATGGACCT CTGAGGGGTT TTTGAGGAGA CGCATTTGAA ATTCAGATCT GATGGTGAGG 2630 .......... .......... .......... .......... .......... .......... 250 TAACTAGGAT CACGCATAAC GAGTTTGGCT TGGGCAAGCA ACAATGTTTA AAGTTATTTA 2570 .......... .......... .......... .......... .......... .......... 250 ACGAGTCCAA TCCTTTGGGC GTCTCTTACA TATAAAAGAA CCTACTATTC TAATACCTGA 2510 .......... .......... .......... .......... .......... .......... 250 ACCAACGTTC AAATAGCAAA ACAATTGCTG AAACCCTCTA TTTGTCATAA GATAATTAAA 2450 .......... .......... .......... .......... .......... .......... 250 TGGCCTGCAT ATATTAACAT TCAATTCTCA CCTTGCATTG CAGAGGACAC GGCCTTCTAC 2390 .......... .......... .......... .......... .......... .......... 250 TTTGTATCTT GTTAATCACA GTCCTATAAA GTGCCTTAAT TCCTAGTCAT TGTTCCAAAG 2330 .......... .......... .......... .......... .......... .......... 250 AATCATTATG TTTAGTGTTA TATAATTCTT TTTCCTGGTT TCTGTATACT AGCTACTGGA 2270 .......... .......... .......... .......... .......... .......... 250 AGATCCAGGT CTAGTGTCTT AGTTGTTGCC TTGGTGTGTA TTGTGCTGTT TGGTACTTTG 2210 .......... .......... .......... .......... .......... .......... 250 ACATGCTTGC AAAAAAGATG CTTCATTCTA AATATTCAAT CTTTAGCAGT TACGGTTATT 2150 .......... .......... .......... .......... .......... .......... 250 ATCTTATTGG AGTATTGTCT TCTGCAGTGT GTAGACTGAA CCCGCGCTGA ACTGCATTTC 2090 .......... .......... .......... .......... .......... .......... 250 GCCTATAAAT TAATATATTT CTTAGGAAGA TGCAGCAACC CGTGCAGCCA AGGTCTTCTG 2030 .......... .......... .......... .......... .......... .......... 250 CCAATGGATA TGGCCGTCGT AAAGTTGATA GAGAAATGGG TACTAAGTTG GAGAATAAAG 1970 .......... .......... .......... .......... .......... .......... 250 CGCAATCTGG AAAAACTACT TCTCGTCAAT TTACAGGTAT AGGTGAGAGC CGCTGACTTA 1910 || || .......... .......... .......... ......TTAA AG........ .......... 256 CGATTGTCTA TGATTTCTTC TGTAGTTCTA ATTCCAAGTG TTCTAGGTAA AGGGGGAGCA 1850 .......... .......... .......... .......... .......... .......... 256 TATCAAAGCC TGTCACATGA TCGACTAGTT TATTTCACTA CCTGTCTTGT TGGACATCAA 1790 .......... .......... .......... .......... .......... .......... 256 GTGGAAGTAC AAGTGATGGA CGGATCAGTG TTTTCAGGGA TACTTCATGC GACAAACGCT 1730 .......... .......... .......... .......... .......... .......... 256 GAAAAAGATT TTGGTATGTT ATAAAAGTTT AACTTAGACT GCAGGTCTTT GGGTATTAAT 1670 .......... .......... .......... .......... .......... .......... 256 TGAAGACTTT AACCATTGTT TTGGCATATG TTCAACTATT TTTTTTGAGT CAAAGGCAAC 1610 .......... .......... .......... .......... .......... .......... 256 CAATTGTATA AAATCATAAT TTAGCACATA AAAGAAATGC TAAGTTAAGA AATCTCCAAA 1550 .......... .......... .......... .......... .......... .......... 256 GTATACATAC AACCAAAACA GGAGCCCCTA TTCAAGATAT ACTTCAACTG TGTTATATTC 1490 .......... .......... .......... .......... .......... .......... 256 AACATTGGCT TATTCCCTTC ACTTGACACA GGTATCATTC TGAAAATGGC GCAGTTGATA 1430 .......... .......... .......... .......... .......... .......... 256 AAAGATAGCT CTGAGGGGAT GAAGAGTAGT TCTGAAACTT TTAGCAAGCC TCCATTAAAG 1370 .......... .......... .......... .......... .......... .......... 256 ACTTTGATAA TACCGGGTAA AGAGTTTGCT CAAGTTACAG CAAAGGTTTG TGTGCAACTA 1310 .......... .......... .......... .......... .......... .......... 256 CTATAATTCT TTCAGGCAAT TATTTATCCA TTGCCTAATT TCAACAATGG CAATACTTTA 1250 .......... .......... .......... .......... .......... .......... 256 AAATTTAAAT AATCTACCAA GCAAATCAGA TCACAGTATA TGATGGTGAT GGTTTTTCAT 1190 .......... .......... .......... .......... .......... .......... 256 GGAAGGTCTT TATATATAAT TTCAAATTAA TTTTTGACAC TGATCGTTGT GATTGCAATT 1130 .......... .......... .......... .......... .......... .......... 256 AGTTTTAGTT ATAATAATGA TGCTCTATTG CTTTGGAGTA GGGTGTGCCT ACAACTCTAG 1070 .......... .......... .......... .......... .......... .......... 256 ACGGTTTCAG AACAGAATTC ATGCTGGAAC AGCAGCAGGA ACTTTTGACT GATTCATGCA 1010 .......... .......... .......... .......... .......... .......... 256 TTTCACAATC TCGGCATATT GAGGTAGAGC GGCAATTGGA ACGCTGGGTA CCTGATGATG 950 .......... .......... .......... .......... .......... .......... 256 ATGCTCCTGA ATGTCCTGAT CTGGACAATA TATTTGATGA CCATTGGAAT AGGTTGCTAG 890 .......... .......... .......... .......... .......... .......... 256 CTCACATCAT TTCCGTTGAT TGTTTATGTT TTAGATATCT GTAGTATGTA TTGCACGGAT 830 .......... .......... .......... .......... .......... .......... 256 CCTTCAAATC CTTGACGTAC CAGGTGATTG AT 798 ||| ||| || .......... .......... ...GTGGTTG AT 265 hqPGS_C06HBa0120H21.1-4-_SGN-U325194- (11509 11466,11274 11150) ******************************************************************************** EST sequence 3 +strand 1511 n (File: SGN-U323393+) 1 CAAACCAAAA CATTTCCATT TATCTTTTGC CTTTGCCATT TCTTATTTCT AACAAATATT 61 CGAATTTCTT TTGTTTCATC GATTTTATTT AGATTTCACC TTTTCTTTTT CGAAAAAATG 121 AAATTAGAAA ATGGTCAAAA AATTGGGAGG GTTCATGAGA GAGCTGAGGG TCCAGCGAAA 181 ATTTTAGCCA TTGGGACAGC AACTCCTTTT CATTGGGTTG ATCAAACCTC GTATCCTGAT 241 TATTATTTCA AAGTTACGAA TAATGAGCAT TTGGTGGACC TCAAAGAAAA ATTTAGACGT 301 ATTTGCAGCA GAACAATGAT TAGGAAAAGG CATATGCTTT TAACAGAAGA AATATTAAAG 361 AAAAATCCTA ATTTGTGTTC TTATAATGGG CCTTCCCTTG ATATTAGGCA AGACATTTTG 421 GTCTCAGAAA TACCCAAACT TGGTAAAGAG GCTGCCCTTA GGGCCATTGA TGAATGGGCT 481 CAGCCCAAAT CAAATATTAC CCATTTAGTC TTTTGTACTA GAAGTGGTGT GGACATGCCT 541 GGTGCGGATT ACCAATTAAT TAACTTATTG GGCCTAAGCC CATCGGTTCA ACGATTCATG 601 ATGTATCAAC AAGGTTGTTT TGCCGGTGGC ACGATGCTCC GGTTAGCCAA GGACTTAGCT 661 GAGAACAACA AGGGTGCTAG GGTGCTTGTT GTGTGTGCCG AGAGCTCAGC GATAGGGTTT 721 CGCGGGCCGA GTGAAGCTTA TCCCGATAAC CTTATCGCGC AAGCATTGTT CGGAGACGGT 781 GCGGTCGCGG TTATAATCGG GTCGGACCCT AAAATGGGCC TGGAGAGGCC CGTTTTCGAG 841 ATTGTCTCGG CGGGCCAGAC GTTTGTACCT AACGGGGATT GCCACCTCGC GTTACACTTA 901 CGCGAGATGG GCCTTACGTT TCATTGTACC AGAGACGTAC CACCGGCCAT CGCGAAAAAT 961 GTGGAGAGTT GCTTAATAAA GGCGTTTGAA CCGTTAGGCA TTTCAGATTG GAACTCGGTG 1021 TTTTGGATTC TTCATCCAGG AGGTAATGCG ATTGTGGACC AAGTCGAAAA CATATTGGGC 1081 CTCGAGCCCG ATAGGTTACG GGCCACGAGA AATATCCTTC GAGAATACGG TAACTTGTCG 1141 AGTGCATGTG TTTTATTCAT ATTGGATGAG ATAAGAAAAA AATCTGCTAG AGATGGGCTG 1201 AAGACTACTG GAGATGGGCT GGACTTGGGA GTCCTTTTAT CATTTGGGCC TGGCCTTACA 1261 ATTGAGACCG TTGTGCTTCG TAGTATGCCC ATTTAAATAA TGGGCCCATT ATGTCTTTTT 1321 ACTTGGGCCT GGCCTTTTAT GATGTTTCAT TTCATTTGGT TGAAGTTTTT TTCATCTATA 1381 TGCATGTATT AATAAATAAT TGAGTAAAAT ATCCACCATC TCAATAGATA TTATAAAGGG 1441 AAAATTGTAT ATAATTGCAA ACTAATAACC TAAATTAAAT GGAATAGCTA GGGTTTAATT 1501 TAAAAAAAAA A Predicted gene structure (within gDNA segment 12097 to 10987): Exon 1 12092 11990 ( 103 n); cDNA 1329 1427 ( 99 n); score: 0.592 Intron 1 11989 11807 ( 183 n); Pd: 0.900 (s: 0.58), Pa: 0.000 (s: 0.88) Exon 2 11806 11732 ( 75 n); cDNA 1428 1502 ( 75 n); score: 0.907 MATCH C06HBa0120H21.1-4- SGN-U323393+ 0.725 178 0.160 G PGS_C06HBa0120H21.1-4-_SGN-U323393+ (12092 11990,11806 11732) Alignment (genomic DNA sequence = upper lines): CTCTCAACTA ATTATGTGTA TAGTTCATTT CGTTTTCACT TTGTTTCTTA GTAATATTCA 12033 || | | || |||| | | ||||||| | || | | || |||| | | |||| || CTGGCCTTTT ATGATGT-TT CATTTCATTT -GGTTGAAGT TTTTTTCAT- CT-ATATGCA 1384 ACGATTTATT CAATTTTGAT TAAATTGATG ATCGAATCAA TAAATATCCT AATTATTCGG 11973 ||| || | |||| |||| | | | |||| || TGTATTAATA AATAATTGAG TAAAATATCC ACCATCTCAA TAG....... .......... 1427 TAAAAACTTG GCATTTCTCC TGATCTGTAC AACCGAGATC TCCTTTAGCC CAACTACTAA 11913 .......... .......... .......... .......... .......... .......... 1427 GGCCCATCAT GGTATGGTGG GCCGAAGCAA TGGATCAAAA TATTTGGCCC TAAAACATTT 11853 .......... .......... .......... .......... .......... .......... 1427 TCAATGAAAC TCTGGTCATT CGTATGAATA CGTCTATTTT AGAGTGATTT TTAATTGGGA 11793 || | | | |||| .......... .......... .......... .......... ......ATAT TATAAAGGGA 1441 AAATTGTATA TAATAGCAAA CTAATAACCT AAATTAAATG GAATAGCTAG GGTTTGATTT 11733 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| ||||| |||| AAATTGTATA TAATTGCAAA CTAATAACCT AAATTAAATG GAATAGCTAG GGTTTAATTT 1501 A 11732 | A 1502 hqPGS_C06HBa0120H21.1-4-_SGN-U323393+ (12092 11990,11806 11732) ******************************************************************************** EST sequence 2 -strand 2089 n (File: SGN-U315404-) 1 TTTTTTTTTT TTTTTTTTTC CTATCAAATG TGATACTGGG GGAATATATA TATATATATA 61 TAATGTTAAT TAATCGAAAT TGAGAAGCTG AAATTTCACC TATAACATCA AGTATACACG 121 TTACCTATGC CCTGGTTGTT TATAGCTGAA TTTCAAGCCC CTTTTCTTAT TCTATTATTA 181 CACCCCTAAT GAAATGATTA AGGAACTTAT TAAAGCAAAA CAACTAAATT CTTATTTTGC 241 AATCTTTTCT TGTCATTCAC AAGCGATAAT TCTAAATCTT TTTCCTTCTC TCCATTTTAT 301 ACAAGCTCTT GGGAAAATTG TATATAATAG CAAACTAGTA ATCTAACTAA ATGGAGTAGC 361 TAGAGTTTGA TTTAATTGTG CTCCATAGCA AACGTTAGCT AAAGTTTGTC AGGCGTCTCC 421 CTCCCAAAAA TCTCGCTCGC CACTCTCCAT TCTCGCTCGC CTATCTCGCT ATATACACAG 481 AAGTGTATAA ATTCTGTTTC TATTTTGTAT AAAGCGAGAG AAAATTGTAT ATACACATGT 541 AAAAATATAT ATCTTCGTGT TATACACTTA ATTATACAAA TTACAAACAT TTTACTTCAA 601 ATATTGCAGA GAAAAAGGCC AACGAATTAT ACAATTGTGA ATTATATAAT TGCAGTGAAA 661 TACAATTTTC TCTAACTTTA TACAACAGAA GTGTATATAT TGTGTTTTTG TTTTTGTATA 721 AAGCGAGACA AAAACATATA TATTCTTGCC ATACACTTAT AATTATGCAA TATACATACA 781 TTTTAATTCG ATTCAACGGT ATGCAAAGCA AATTTATAAA AATATTGCAG CGAAATAAGC 841 AGCGATTTAT ACGATTGTGC ATTACACAAT CGCAGTGAAT GAAGATAGCG AATTATACAA 901 TTGCAGCGAA ATAGGCCATC AAATTATACA ATTTAGGCCA GCGAATTATA CAATTGTATA 961 TGTATAGCGA ATTATACAGT TTTATGTTTG CTATGGAGCG CAATTATGCA AACTTTGTTA 1021 TAACATACAA ATATGAATTT TTTATTTGCT ATACGTGAAA GTTGCCCTTA ATTAAATGCT 1081 CCACAGTCTA TACGAATATT TCCATTCTTA CCCGTCTTCA CACCAACTCG GCCCAATTTA 1141 GTCATCGCAT TAACAAATGC GGTTTCAAAA ACTTTAAAAT TACTGGCCCA CAAATTAACA 1201 GTACCTTTAG ACCTTTGGTC TGTGAACAAA ACTTGATCTG ATGTGAATAG GCCCATTCCG 1261 TTTTGCAAGT TTTGGAAATA CACATTGTCA AAAGCCCTAG GTGTTATTGG GTCCATGTTG 1321 ATGGCTATTC TTGGGTCCAC ATTTTTCGGA CACATTTGTT CTAATTGGGC TGCATACGTC 1381 TTGTTGAGAC TTGGATCCAC TGGGTTTTTA GGGTTAAAGT TGAAAATTCG GTTCGAGAAT 1441 TGGTCGCAGT GAGAAAATCC AACAGTATGG GCCGCAGATA AGGCAATCAT ATCAGCCTGA 1501 TTTAAACCAT GAGAGGCAAA CATTGTATTG AGTTGATCCA AATTGAAAGT AGGTTTAGGC 1561 AACTTTCCCC CTACATTTGT AGATTTTGAT GTCAAACCAT CTAATCTCCC CAATTCCACT 1621 GCATACCCCG GTCCACCGGA TAGTTGAATA ACATCTCGAG TGGCTAAGGC AAGAATATCA 1681 GCACAAGAAA CTTTATTTTT ACAACTTGGG ATCGCATCAA CTGCGGCTTT TGCTTTGATA 1741 ACTGTGTCAA ATCCATCTCC AGCCAATGAA AGATTATCTG GGTGATCTTT TTCTGCTGTG 1801 TTCCCTGCCG TTGATGCTAT TATCACCGAT GCATCACAAC CCTCAACAAA GCAATCATGG 1861 AAGAAAAGAC GAAGAACAGC CGGAATTGTG ACAAACGTTT GTTTGAATTT CTGGTTAACA 1921 ACATTACGCA CAATGGATTC AACATTAGGA CAAGTTTGGG CATAAAAATT GGTTTTGAGT 1981 TGAGCATCTA CCAAGTTTGG CATGAAAATA CTAACACAAG AAATTGACAA AAATGATGTA 2041 ATAAAAACTT GTAAATAACC CATATTTTTT TCTATATATA ATCAAAAAT Predicted gene structure (within gDNA segment 12097 to 938): Exon 1 11798 11233 ( 566 n); cDNA 309 899 ( 591 n); score: 0.785 Intron 1 11232 8002 (3231 n); Pd: 0.000 (s: 0.62), Pa: 1.000 (s: 0) Exon 2 8001 7970 ( 32 n); cDNA 900 931 ( 32 n); score: 0.562 Intron 2 7969 6564 (1406 n); Pd: 0.000 (s: 0), Pa: 0.120 (s: 0) Exon 3 6563 6536 ( 28 n); cDNA 932 958 ( 27 n); score: 0.536 Intron 3 6535 4992 (1544 n); Pd: 0.900 (s: 0), Pa: 0.993 (s: 0.64) Exon 4 4991 4945 ( 47 n); cDNA 959 1002 ( 44 n); score: 0.638 PPA cDNA 19 1 MATCH C06HBa0120H21.1-4- SGN-U315404- 0.785 673 0.322 C PGS_C06HBa0120H21.1-4-_SGN-U315404- (11798 11233,8001 7970,6563 6536,4991 4945) Alignment (genomic DNA sequence = upper lines): TTGGGAAAAT TGTATATAAT AGCAAACTAA TAACCTAAAT TAAATGGAAT AGCTAGGGTT 11739 |||||||||| |||||||||| ||||||||| ||| || || |||||||| | |||||| ||| TTGGGAAAAT TGTATATAAT AGCAAACTAG TAATCT-AAC TAAATGGAGT AGCTAGAGTT 367 TGATTTAATT GTGCTCCATA GC-AACATTA GCTAAAATTT GCCA-GTGTC TCTCTCCCAA 11681 |||||||||| |||||||||| || ||| ||| |||||| ||| | || | ||| || ||||||| TGATTTAATT GTGCTCCATA GCAAACGTTA GCTAAAGTTT GTCAGGCGTC TCCCTCCCAA 427 AAATCTCGCT CG---C-CT- C--TCTCG-- ---CT-T-T- -----ATACA CAGAAGTGTA 11641 |||||||||| || | || | ||||| || | | ||||| |||||||||| AAATCTCGCT CGCCACTCTC CATTCTCGCT CGCCTATCTC GCTATATACA CAGAAGTGTA 487 T-AATTCTGT TACTGTTTTG TATAAAGCGA GAGAAAATTG TATATACACA TGCAAAAATG 11582 | |||||||| | || ||||| |||||||||| |||||||||| |||||||||| || |||||| TAAATTCTGT TTCTATTTTG TATAAAGCGA GAGAAAATTG TATATACACA TGTAAAAATA 547 TATCTCTTCG TGTTATACAC TTAATTATAC AATTTACAAA CATTTTACTT CAAATATTGC 11522 ||| |||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| TATATCTTCG TGTTATACAC TTAATTATAC AAATTACAAA CATTTTACTT CAAATATTGC 607 AAAGAAAAAG GCCAACGAAT TATACAATTG TGAATTATAC AATTGCAGTG AAATACAATT 11462 | |||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| AGAGAAAAAG GCCAACGAAT TATACAATTG TGAATTATAT AATTGCAGTG AAATACAATT 667 TTCTCTAGCT TTATACAACA GAAGTGTATA TATTGTGTTT CTGTTTTTAT ATAAAGCGAG 11402 ||||||| || |||||||||| |||||||||| |||||||||| ||||||| | |||||||||| TTCTCTAACT TTATACAACA GAAGTGTATA TATTGTGTTT TTGTTTTTGT ATAAAGCGAG 727 --AAAAACAT ATATCTTCTT GCTATACACT TATAATTATG CAATATACGT ACATTTTAAT 11344 |||||||| |||| ||||| || ||||||| |||||||||| |||||||| | |||||||||| ACAAAAACAT ATATATTCTT GCCATACACT TATAATTATG CAATATACAT ACATTTTAAT 787 TCGATTCAAC TGTATGCAAA GCAAA-TTAT --ACA-ATTG CAGCGAAATA AGTCAGTGAA 11288 |||||||||| ||||||||| ||||| |||| | | |||| |||||||||| || ||| || TCGATTCAAC GGTATGCAAA GCAAATTTAT AAAAATATTG CAGCGAAATA AG-CAGCGAT 846 TTATACAATT TAGGCCATCG AATTATACAA TTGTATATGT ATAGCGAATT ATACATTTTC 11228 |||||| | | | | ||| | || | || || |||||||||| ||||| TTATACGA-T T-GTGCATTA CACAATCGCA GTGAATGAAG ATAGCGAATT ATACA..... 899 TATGTTTGCT ATGGAGCGCA ATTATACAAA CTTTGCTATA GCGTACAAAT ATAAATTTTT 11168 .......... .......... .......... .......... .......... .......... 899 TATTTGCTAT ATGTGAAAAT TGTCCTTTTT AATTTTTGCC TTTCGCGCTT TTAAGTAACA 11108 .......... .......... .......... .......... .......... .......... 899 AAAAGCTTAG CCCAAAATAT TCCTAACCTT GAAAAAAGGC ACAAGTGCAG CGTATATACA 11048 .......... .......... .......... .......... .......... .......... 899 TAAGTTATGC CTTATAAGAC ATGAATTAGT TCCTACAGAA CTTTTATCGA TAGAGGCATA 10988 .......... .......... .......... .......... .......... .......... 899 AAATTAGTTC TAGGCTTATG TCATAAGCAG TTCATAACCA ACAAATTATA TCTCATCACT 10928 .......... .......... .......... .......... .......... .......... 899 AAGACATAAC TTGGTTCGTA TGAAACTTAT ATCAATAGAG GCATAAGTCC GTGCCTCAAG 10868 .......... .......... .......... .......... .......... .......... 899 AGGCAGTAAC AAAATTAGAT ACTACAAAAC TTATATCGTA CAAAATAAAA GTTGTTTAAA 10808 .......... .......... .......... .......... .......... .......... 899 CTCATGCATT ACGAAATAAA TTAGGTCATA GTCAAGGACT GATTAACCTG TAAATACAAA 10748 .......... .......... .......... .......... .......... .......... 899 AAAACTTGAT GAAACACCAT AATGTTTTGG TTAGGTTCAA TTCAAAATTA ACAAAAATTT 10688 .......... .......... .......... .......... .......... .......... 899 AATTAAGCAC AAGTGTTATT TTTCCCTCCA CTAATCTAAA ATCTTGATCC ACTATAATAG 10628 .......... .......... .......... .......... .......... .......... 899 GAAGAGCTAT TACACATAAC ATACTTGTAT CAATCCCAAA CAAAAAATAA TACTATTAAA 10568 .......... .......... .......... .......... .......... .......... 899 AATTTGTTTT GATTATGGAG TTAATTAGAA GAAGAAATGC ACCTCATCAA AATGAAATTC 10508 .......... .......... .......... .......... .......... .......... 899 CTCCCCTGGA ACTACTTGAC GAGGAAGATG ATAAGAAAAG TAAGAGGAAG ATTGCAACGA 10448 .......... .......... .......... .......... .......... .......... 899 AGCAGAAATG GTCATGTATC GATAGTTGTT GTTGGTTTGT AGGATACATT TGTACTGTAT 10388 .......... .......... .......... .......... .......... .......... 899 GGTGGATTTT ATTATTTTTG TACAATGCTA TGCCAGCTTC GTTTCCACAG TACGTAACAG 10328 .......... .......... .......... .......... .......... .......... 899 AGAAGATTAA TGGGCCAGTA GCTGATCCTC CTGGCGTAAA GCTACGAAAT GAAGGGCTAA 10268 .......... .......... .......... .......... .......... .......... 899 AGGTTAAACA TCCAGTAGTT TTTGTACCTG GGATTGTTAC TTGTGGCCTT GAGCTATGGG 10208 .......... .......... .......... .......... .......... .......... 899 AGGGACATCA GTGTGCTGAA GGATTGTTTC GAAAGCGGTT ATGGGGTGGT ACTTTTGGCG 10148 .......... .......... .......... .......... .......... .......... 899 AAGTGTATAA AAGGTCAGAG ACGAATCCAG AATTTGAAAT TGTTAGGTTC AATCTATAAA 10088 .......... .......... .......... .......... .......... .......... 899 GTTGTTATGA TCAAATTTCA TAAAAGATTA TGAGTTCAAA CTTCATGTTT TTCGGAATTT 10028 .......... .......... .......... .......... .......... .......... 899 TAATGAAAAT AAACTTATGC TCTGTGTTAA AAGTATTGAG TTCAGATGAA CCTGGTATGT 9968 .......... .......... .......... .......... .......... .......... 899 TTTTATAGAT CGATGCAGGG GAATAAACTT CATTGGTATT AATAGTATGA GATGAGATCT 9908 .......... .......... .......... .......... .......... .......... 899 AGGATATGGG TACTATCTCA AAATAAATAA ACTTATATTT GATAAAATTT TTAAATATAT 9848 .......... .......... .......... .......... .......... .......... 899 ATAGATGATA TGAGTTATGG TGCTACTGAA TAGCTACAAT TGCAAAAAGG AAAAAAGGTT 9788 .......... .......... .......... .......... .......... .......... 899 TATTTTAATG ATTATCTGTT TGTGTGATGG TAACCAGACC GTTTTGTTGG GCGGAACACA 9728 .......... .......... .......... .......... .......... .......... 899 TGTCATTGGA CAATGAATCT GGGTTGGATC CTCCGGGAAT ACGGGTTAGG CCAGTTGCTG 9668 .......... .......... .......... .......... .......... .......... 899 GACTTGTTGC AGCAGATTAC TTTGCACCAG GATATTTTGT GTGGGCAGTT TTGATTGCTA 9608 .......... .......... .......... .......... .......... .......... 899 ATTTGGCGCG AATAGGATAT GAGGAGAAAA CGATGTATAT GGCTGCATAT GACTGGAGAC 9548 .......... .......... .......... .......... .......... .......... 899 TATCCATTCA GAATACTGAG GTATAGATTT AACTTTCTGT ATGCTTGAGC ATTGTTGTTT 9488 .......... .......... .......... .......... .......... .......... 899 CCTTAATCCA CTTAAAAAGT CGTTGTAATG TGTCAATGGT GAAATCATTT CATCAGGTGC 9428 .......... .......... .......... .......... .......... .......... 899 GCGACCAGAC ACTAAGCCAG ATAAAAAGCA ATATAGAACT GATGGTTGCA ACTAATGGAG 9368 .......... .......... .......... .......... .......... .......... 899 GCAATAAGGC AGTAATTGTT CCACATTCTA TGGGAGCTAT TTACTTTTTG TATTTCATGA 9308 .......... .......... .......... .......... .......... .......... 899 AGTGGGTCGA GGCACCAGCT CCGATGGGTG GTGGTGGTGG TCCTGATTGG TGTGCCAAAC 9248 .......... .......... .......... .......... .......... .......... 899 ATATTAAAGC AGTGATGAAT ATTGGTGCGC CGTTTCTAGG TGTTCCTAAA GCATTAGCTG 9188 .......... .......... .......... .......... .......... .......... 899 CACTTTTCTC AGCTGAAGCT CGAGATGTCG CTATTGCAAG GTAAAATTGG TTCATGTGGA 9128 .......... .......... .......... .......... .......... .......... 899 TGTTTTCTCC TTGACAACCA ACTGAAGAAA TAGTACATCT GGTATAGAGC TAAATCATTT 9068 .......... .......... .......... .......... .......... .......... 899 CTTGGCTTCT GCACTTGACT GAGAAGCAAT ACCTCAAAAT TAATCCTAAA ATGCTCCTCT 9008 .......... .......... .......... .......... .......... .......... 899 AGTTCCGAGT GTTGGGTTTT TAAAGGTTTG CTAATCTGAA TACAGGAGAT AACACCGAGC 8948 .......... .......... .......... .......... .......... .......... 899 TTGGTAATTA AGGCCGTGAG AAATTCTTTC CATATCTGTT TCTTGAGCTA GATTTGAGTA 8888 .......... .......... .......... .......... .......... .......... 899 TGTGTTGGTG CTTTGACGTG TGAGTAGTAA TTGCCGTTAC ACATGTGGTT TACATCCTTA 8828 .......... .......... .......... .......... .......... .......... 899 TTTCCTTTTC CCTCAACTTT GCACTACGAA TCATTGATCT TCTAAGAGAT GCTATTTCAT 8768 .......... .......... .......... .......... .......... .......... 899 AGCCTTTTTA TCTTTCAACA CTAGATGTTA TGTACAAAAT ACAAGTCCTT GAATGCTTCC 8708 .......... .......... .......... .......... .......... .......... 899 GCGTGTAGTC TGTTCCTTCT TCATTGGTAG TCTTCTTGAT CATCACTAGT TTACTTAAAT 8648 .......... .......... .......... .......... .......... .......... 899 TTCAGGAGTA AAGCATCAGT TGTTATGGAC AAGGATTTAT TTCGTATTCA AACACTACCA 8588 .......... .......... .......... .......... .......... .......... 899 CATTTAATGA GGATGCTTCG GACTTGGGAT TCAACCATGT CTATGTTACC AAAAGGAGGA 8528 .......... .......... .......... .......... .......... .......... 899 GAGACGATTT GGGGTGGTCT TGACTGGTCT CCAGAAGAAG GCTATTCTCC TCGCAAAAGA 8468 .......... .......... .......... .......... .......... .......... 899 AAACTAAGGG ACAAAACTAG TCATACGTCA AGCCATCAGG ACAATCAAAC TGTAGAATCT 8408 .......... .......... .......... .......... .......... .......... 899 AAAGGAAAAC ATGTTAATTA TGGAAGGATG ATATCATTTG GAAAGGTTGC AGCACAGAAA 8348 .......... .......... .......... .......... .......... .......... 899 CCTTCATCAG ATATTACTAG GATTGACTTC AGAGTAATGT TAACCAGAAA CCAATGCTCC 8288 .......... .......... .......... .......... .......... .......... 899 TTATTTATCG TTCTCTACAT ATTTCTATGC TTTCTTCTAT TCGTTTCAAC ATGTTAAATA 8228 .......... .......... .......... .......... .......... .......... 899 AGGCTCACAA ACATATATTT TTGTGCTTGA GAAAGACCTT TGTGAGGCAT AGCTTTGGTA 8168 .......... .......... .......... .......... .......... .......... 899 GACCTGTCAT CTAACATTTT GATACTTATA AAGCAACAAA GAATAACTAC AACAATTCAA 8108 .......... .......... .......... .......... .......... .......... 899 TCGAAACAAA AATATGATGA ACATTAACTC AGTAGTACAG TAGGAAATTA TCTCTTCCCC 8048 .......... .......... .......... .......... .......... .......... 899 ACTTTCTTCT TGTTGTCTCT TAGTAGCTGT GTTTGTTTAC ATGCAGGGTG CAGTGAAGGG 7988 || ||| ||| .......... .......... .......... .......... ......ATTG CAGCGAAATA 913 CACGAACAAA GCAAATAACA CATGTGATGT GTGGACGGAG TACTATGACA TGGGCGTTGC 7928 | | |||| | | || GGCCATCAAA TTATACAA.. .......... .......... .......... .......... 931 TGGTATAAAA GCTGTGGAAG AATACAAGGT TTATACAGCT GGAGATATAT TGGATCTACT 7868 .......... .......... .......... .......... .......... .......... 931 CCACTTTGTT GCCCCAAAGA TGATGGCTCG TGGAGGCGCT CATTTTTCAT ATGGGATAGC 7808 .......... .......... .......... .......... .......... .......... 931 TGAAGATTTG GATGATCCAA TGTATTCACA CTACAAATAC TGGTCAAATC CGTTGGAAAC 7748 .......... .......... .......... .......... .......... .......... 931 AAAGTGAGTA CTTTTCATTT GAACTCTGTC TCCGTACTTG TTGTCTTTGT ACAAAGGATA 7688 .......... .......... .......... .......... .......... .......... 931 TAGTCTACGA GTAGAAAATG ACCTCGAGTT CATTTTATCT TTACCTTGTT AACACTACAG 7628 .......... .......... .......... .......... .......... .......... 931 GCTACCAAAT GCTCCTGAAA TGGAGCTTTA TTCGATGTAT GGAGTTGGCA TTCCAACTGA 7568 .......... .......... .......... .......... .......... .......... 931 AAGAGCATAT GTTTACGGGC AAACACCAAT AGCACAATGT CATATTCCAT TCCAGATTGA 7508 .......... .......... .......... .......... .......... .......... 931 AACTTCAGCT GATGAAGGGA ATGAGTGTTG TATGAAGAAT GGTGTTTTGA CTGTTGATGG 7448 .......... .......... .......... .......... .......... .......... 931 CGATGAGACG GTGCCTATTT TAAGTGCAGG CTTCATGTGT GCAAAAGGAT GGCGCGGAAA 7388 .......... .......... .......... .......... .......... .......... 931 AACTAGATTT AATCCATCAG GAATCAAAAC TTATACAAGG GAGTATGATC ATGCTCCTCC 7328 .......... .......... .......... .......... .......... .......... 931 CGCAAACCTT CTTGAGGGTC GTGGTACACA GAGTGGAAAT CATGTTGATA TAATGGGAAA 7268 .......... .......... .......... .......... .......... .......... 931 TTTCGCTTTG ATTGAAGATA TCATGAGAGT TGCAGCTGGT GCAACGGGCA AAGACTTGGG 7208 .......... .......... .......... .......... .......... .......... 931 AGGTGATCAA GTTCACTCGG ACATCTTTAA GTGGTCTGAG AAGATTGATT TACGTCTTTA 7148 .......... .......... .......... .......... .......... .......... 931 GGGGAACACT GGGTGCACTG CTTTTCTCTT ATATGATGAA GTACTTGACT CAACGGTTAG 7088 .......... .......... .......... .......... .......... .......... 931 AATATCACTT TCATATCTAC ATAATTGTGA TGGTGATCTA TCTTTACGTC TTTAAGTTTC 7028 .......... .......... .......... .......... .......... .......... 931 ATCATGTCAA GATGTCTGCT TCGTGTATAA TCACAAAGTT ATATGCTGAT ATGAGTTCAT 6968 .......... .......... .......... .......... .......... .......... 931 CTGGACGCCA CACAAACCCT TCCTGTCAAA TGTATGGACA ACAACATACC CTGTATAATC 6908 .......... .......... .......... .......... .......... .......... 931 CCACTCATGG GAGATACTAA ATTGTGAAAT TATGTCATGA ATCTACCACT GACGTTTGGA 6848 .......... .......... .......... .......... .......... .......... 931 GAGCCCTAGC AACACTTAAC TTTTATCCGA CTATGTAGAC GATTAAATGG ATTCCAGATA 6788 .......... .......... .......... .......... .......... .......... 931 AAATGCAGGA TGTTTTGTGT ATCTCGTAGT GTAATATGTC AAAGTTGGAG CTAGTTGCCC 6728 .......... .......... .......... .......... .......... .......... 931 AAATCAGGAA CAATTAGGGA TAGAACTGAT ATTGTACCAA CATAGTTTTC TCAAAGATAT 6668 .......... .......... .......... .......... .......... .......... 931 TGTATTTCAA ATGTGTTGTT ATACAGACTT GTGGAATAGC TACTCAATGA ATGAACAACT 6608 .......... .......... .......... .......... .......... .......... 931 TTTCTTCTTT ATTTGTGATA ACATGAATTT CTTGGATTCT TCAGTTCAGT TGAATGTACT 6548 || || | | | | .......... .......... .......... .......... ....TTTAGG CCAGCG-AAT 946 TATTATAATG GAGTATTCTT AAAAAAAAGT TGATTTTCCA TTATTATACG TCGAAATAAT 6488 ||| | || | TATACAATTG TA........ .......... .......... .......... .......... 958 ACTGTAGAAA TGCATGATAA GTCTATGTAT AATAGACTTT TATAGTCAAA TCCTTTTCCG 6428 .......... .......... .......... .......... .......... .......... 958 AACTCCACAA GTAAAGAAAA CTTAATGTAT CGAGTTCAGA AGAAACTGAA AAATTGAAAA 6368 .......... .......... .......... .......... .......... .......... 958 AAGAAATGGC CTATGAAAAA ATGATTGGAT AGAGAGGAGG TCTGACATAT GGGTTTGGGA 6308 .......... .......... .......... .......... .......... .......... 958 ATACATCACT AAAAGGCATC AGTGAAGCTC ATCGGATTCT TTGTCAAATA GGTATATATA 6248 .......... .......... .......... .......... .......... .......... 958 TACTATCACA TAATGTTTGA CTTTGCAAGT GTAACAGATT ATTACGGCAG TCGAGTCAGA 6188 .......... .......... .......... .......... .......... .......... 958 ATTTTTAATA AGGGATTTAA ATTTTGAAAA AATAGACATA TGGATACTAT AGAAATCTAA 6128 .......... .......... .......... .......... .......... .......... 958 AAAATTGTTT TAACCATGTA AATGATTAAT TTTTCGTCAA AAGGGATTTG AAATGAACAA 6068 .......... .......... .......... .......... .......... .......... 958 CCTGACTATA ACGTAGCTTC GCCACTGATT ACTATATAAT TGCTAAATTT AATAACTTGA 6008 .......... .......... .......... .......... .......... .......... 958 AAAATAAAGC AAACAACCTT TTATAACGAG TCAAATTTTA TTTATATTAT AAAAACTCAT 5948 .......... .......... .......... .......... .......... .......... 958 TCATTATAGT GTATATGTCT AATACACGCA AGCTACAGTT TATATTATAA ATAATATAGA 5888 .......... .......... .......... .......... .......... .......... 958 AATAATCAAT TTAGTTATGC GCTTAGAAAT GATCAAAATA CGTTTTGAAT ATAAAAAGAA 5828 .......... .......... .......... .......... .......... .......... 958 GAAATTGAGG AACATGTCAA TTTTTATGTT TGATTTATTT AATGAAAGAT TATAAAAGGA 5768 .......... .......... .......... .......... .......... .......... 958 ATATTGTTTT ATTTAAAATA CGAACTTTAA AATGTCAATT TATATGTTCC ATGTAAGCAA 5708 .......... .......... .......... .......... .......... .......... 958 ATTTTTGTCA TTTTTAATTG GATCATATAA ATCGTTGTTT AAGAATAGTT ATATTATTAA 5648 .......... .......... .......... .......... .......... .......... 958 TCCTAATAAT GCATTTTTAC TTTATTTAAA ATGACATATA AATTTCTGCA TATGTCTATC 5588 .......... .......... .......... .......... .......... .......... 958 ATGCAACAAG AAAAATAAAA CCCAAACAAA TAACGTAACC AAAGAAAAAA GATTTCTCAT 5528 .......... .......... .......... .......... .......... .......... 958 TTAGGATTAA GAAAATTTAT GTACGCTGAT AAAATTAAAA ATGAAGAAAA CTTTCTAATT 5468 .......... .......... .......... .......... .......... .......... 958 CGTTAAGTAA TATTTTTTAA CTCTAATTAG CTACTAAATA ATATCCAATT CTATAGATGT 5408 .......... .......... .......... .......... .......... .......... 958 AAATATCCTT GCATATATGC TCTGATCATT TTTGCTATTC GCTATCTTAT TATTATTATT 5348 .......... .......... .......... .......... .......... .......... 958 TTGGTTTGTT TTTTGAAAAT GAACTTTAAT TTAGAAGAGA TAGTTATATA TATTCTTCGT 5288 .......... .......... .......... .......... .......... .......... 958 AATTTCACTG AATTTATTAT TTTCGTAATT ATTGATACAG ACAAAGGTGT TTAAACGGAC 5228 .......... .......... .......... .......... .......... .......... 958 ATCGTAATGG AGTGTGTGTG AGAGAGAACC CATTTTGAGG AATTCGGGGG CAATTTCAGT 5168 .......... .......... .......... .......... .......... .......... 958 TTCTCGGTGG CGGAAGACGC GCAGAGACTT CACCCTTTCA ATTCCGTTGA CACATATACA 5108 .......... .......... .......... .......... .......... .......... 958 GAGGTGTCGT TTCTGATAAT CAATTTCACT TTCTCTTTGC TACGATCGCT ACATTCTCTC 5048 .......... .......... .......... .......... .......... .......... 958 TCTCTCTTTG TAAGTATCGC CTTTTTAATC TACTTGTTTA TTGATGGATC ATGTAGTTTG 4988 | || .......... .......... .......... .......... .......... ......TATG 962 AATCGCACAA AAATATGGGT CTAGGGTTTT CTAATCGAGC TCA 4945 || || || ||| | | || |||| || || |||| || TATAGC-GAA TTATACAGTT TTA-TGTTTG CT-ATGGAGC GCA 1002 hqPGS_C06HBa0120H21.1-4-_SGN-U315404- (11798 11233) Total number of EST alignments reported: 7 ________________________________________________________________________________ Predicted gene locations (3) in segment 1 to 12097: PGL 1 (- strand): 5269 898 AGS-1 (5269 5039,2122 1934,1863 1717,1458 1325,1088 898) SCR (e 1.000 d 0.980 a 0.855,e 1.000 d 0.614 a 0.357,e 1.000 d 0.969 a 0.994,e 1.000 d 0.980 a 0.990,e 0.990) Exon 1 5269 5039 ( 231 n); score: 1.000 Intron 1 5038 2123 (2916 n); Pd: 0.980 Pa: 0.855 Exon 2 2122 1934 ( 189 n); score: 1.000 Intron 2 1933 1864 ( 70 n); Pd: 0.614 Pa: 0.357 Exon 3 1863 1717 ( 147 n); score: 1.000 Intron 3 1716 1459 ( 258 n); Pd: 0.969 Pa: 0.994 Exon 4 1458 1325 ( 134 n); score: 1.000 Intron 4 1324 1089 ( 236 n); Pd: 0.980 Pa: 0.990 Exon 5 1088 898 ( 191 n); score: 0.990 PGS (5269 5039,2122 1934,1863 1717,1458 1325,1088 898) SGN-U321764+ PGS (964 898) SGN-U337451+ 3-phase translation of AGS-1 (-strand): . . . . . . 5269 ATTTTCGTAATTATTGATACAGACAAAGGTGTTTAAACGGACATCGTAATGGAGTGTGTG I F V I I D T D K G V - T D I V M E C V F S - L L I Q T K V F K R T S - W S V C F R N Y - Y R Q R C L N G H R N G V C . . . . . . 5209 TGAGAGAGAACCCATTTTGAGGAATTCGGGGGCAATTTCAGTTTCTCGGTGGCGGAAGAC - E R T H F E E F G G N F S F S V A E D E R E P I L R N S G A I S V S R W R K T V R E N P F - G I R G Q F Q F L G G G R . . . . . . 5149 GCGCAGAGACTTCACCCTTTCAATTCCGTTGACACATATACAGAGGTGTCGTTTCTGATA A Q R L H P F N S V D T Y T E V S F L I R R D F T L S I P L T H I Q R C R F - - R A E T S P F Q F R - H I Y R G V V S D . . . . . . : 5089 ATCAATTTCACTTTCTCTTTGCTACGATCGCTACATTCTCTCTCTCTCTTT : TGTGTAGAC I N F T F S L L R S L H S L S L F : C V D S I S L S L C Y D R Y I L S L S F : V - T N Q F H F L F A T I A T F S L S L : L C R . . . . . . 2113 TGAACCCGCGCTGAACTGCATTTCGCCTATAAATTAATATATTTCTTAGGAAGATGCAGC - T R A E L H F A Y K L I Y F L G R C S E P A L N C I S P I N - Y I S - E D A A L N P R - T A F R L - I N I F L R K M Q . . . . . . 2053 AACCCGTGCAGCCAAGGTCTTCTGCCAATGGATATGGCCGTCGTAAAGTTGATAGAGAAA N P C S Q G L L P M D M A V V K L I E K T R A A K V F C Q W I W P S - S - - R N Q P V Q P R S S A N G Y G R R K V D R E . . . . . . : 1993 TGGGTACTAAGTTGGAGAATAAAGCGCAATCTGGAAAAACTACTTCTCGTCAATTTACAG : W V L S W R I K R N L E K L L L V N L Q : G Y - V G E - S A I W K N Y F S S I Y R : M G T K L E N K A Q S G K T T S R Q F T : . . . . . . 1863 GTAAAGGGGGAGCATATCAAAGCCTGTCACATGATCGACTAGTTTATTTCACTACCTGTC V K G E H I K A C H M I D - F I S L P V - R G S I S K P V T - S T S L F H Y L S G K G G A Y Q S L S H D R L V Y F T T C . . . . . . 1803 TTGTTGGACATCAAGTGGAAGTACAAGTGATGGACGGATCAGTGTTTTCAGGGATACTTC L L D I K W K Y K - W T D Q C F Q G Y F C W T S S G S T S D G R I S V F R D T S L V G H Q V E V Q V M D G S V F S G I L . . . : . . . 1743 ATGCGACAAACGCTGAAAAAGATTTTG : GTATCATTCTGAAAATGGCGCAGTTGATAAAAG M R Q T L K K I L : V S F - K W R S - - K C D K R - K R F W : Y H S E N G A V D K R H A T N A E K D F : G I I L K M A Q L I K . . . . . . 1425 ATAGCTCTGAGGGGATGAAGAGTAGTTCTGAAACTTTTAGCAAGCCTCCATTAAAGACTT I A L R G - R V V L K L L A S L H - R L - L - G D E E - F - N F - Q A S I K D F D S S E G M K S S S E T F S K P P L K T . . . . . : . 1365 TGATAATACCGGGTAAAGAGTTTGCTCAAGTTACAGCAAAG : GGTGTGCCTACAACTCTAG - - Y R V K S L L K L Q Q R : V C L Q L - D N T G - R V C S S Y S K : G C A Y N S R L I I P G K E F A Q V T A K : G V P T T L . . . . . . 1069 ACGGTTTCAGAACAGAATTCATGCTGGAACAGCAGCAGGAACTTTTGACTGATTCATGCA T V S E Q N S C W N S S R N F - L I H A R F Q N R I H A G T A A G T F D - F M H D G F R T E F M L E Q Q Q E L L T D S C . . . . . . 1009 TTTCACAATCTCGGCATATTGAGGTAGAGCGGCAATTGGAACGCTGGGTACCTGATGATG F H N L G I L R - S G N W N A G Y L M M F T I S A Y - G R A A I G T L G T - - - I S Q S R H I E V E R Q L E R W V P D D . . . . . . 949 ATGCTCCTGAATGTCCTGATCTGGACAATATATTTGATGACCATTGGAATAG M L L N V L I W T I Y L M T I G I C S - M S - S G Q Y I - - P L E - D A P E C P D L D N I F D D H W N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-4-_PGL-1_AGS-1_PPS_1 (2081 1934,1863 1717,1458 1325,1088 900) (frame '0'; 618 bp, 206 residues) 1 INIFLRKMQQ PVQPRSSANG YGRRKVDREM GTKLENKAQS GKTTSRQFTG KGGAYQSLSH 61 DRLVYFTTCL VGHQVEVQVM DGSVFSGILH ATNAEKDFGI ILKMAQLIKD SSEGMKSSSE 121 TFSKPPLKTL IIPGKEFAQV TAKGVPTTLD GFRTEFMLEQ QQELLTDSCI SQSRHIEVER 181 QLERWVPDDD APECPDLDNI FDDHWN PGL 2 (- strand): 10484 7154 AGS-1 (10484 10135,9750 9528,9431 9148,8642 8315,8001 7745,7627 7154) SCR (e 0.791 d 0.210 a 0.966,e 0.843 d 0.992 a 0.952,e 0.856 d 0.996 a 0.915,e 0.750 d 0.959 a 1.000,e 0.833 d 0.966 a 0.934,e 0.834) Exon 1 10484 10135 ( 350 n); score: 0.791 Intron 1 10134 9751 ( 384 n); Pd: 0.210 Pa: 0.966 Exon 2 9750 9528 ( 223 n); score: 0.843 Intron 2 9527 9432 ( 96 n); Pd: 0.992 Pa: 0.952 Exon 3 9431 9148 ( 284 n); score: 0.856 Intron 3 9147 8643 ( 505 n); Pd: 0.996 Pa: 0.915 Exon 4 8642 8315 ( 328 n); score: 0.750 Intron 4 8314 8002 ( 313 n); Pd: 0.959 Pa: 1.000 Exon 5 8001 7745 ( 257 n); score: 0.833 Intron 5 7744 7628 ( 117 n); Pd: 0.966 Pa: 0.934 Exon 6 7627 7154 ( 474 n); score: 0.834 PGS (10484 10135,9750 9528,9431 9148,8642 8315,8001 7745,7627 7154) SGN-U320342+ 3-phase translation of AGS-1 (-strand): . . . . . . 10484 GAAGATGATAAGAAAAGTAAGAGGAAGATTGCAACGAAGCAGAAATGGTCATGTATCGAT E D D K K S K R K I A T K Q K W S C I D K M I R K V R G R L Q R S R N G H V S I R - - E K - E E D C N E A E M V M Y R . . . . . . 10424 AGTTGTTGTTGGTTTGTAGGATACATTTGTACTGTATGGTGGATTTTATTATTTTTGTAC S C C W F V G Y I C T V W W I L L F L Y V V V G L - D T F V L Y G G F Y Y F C T - L L L V C R I H L Y C M V D F I I F V . . . . . . 10364 AATGCTATGCCAGCTTCGTTTCCACAGTACGTAACAGAGAAGATTAATGGGCCAGTAGCT N A M P A S F P Q Y V T E K I N G P V A M L C Q L R F H S T - Q R R L M G Q - L Q C Y A S F V S T V R N R E D - W A S S . . . . . . 10304 GATCCTCCTGGCGTAAAGCTACGAAATGAAGGGCTAAAGGTTAAACATCCAGTAGTTTTT D P P G V K L R N E G L K V K H P V V F I L L A - S Y E M K G - R L N I Q - F L - S S W R K A T K - R A K G - T S S S F . . . . . . 10244 GTACCTGGGATTGTTACTTGTGGCCTTGAGCTATGGGAGGGACATCAGTGTGCTGAAGGA V P G I V T C G L E L W E G H Q C A E G Y L G L L L V A L S Y G R D I S V L K D C T W D C Y L W P - A M G G T S V C - R . . . . . : . 10184 TTGTTTCGAAAGCGGTTATGGGGTGGTACTTTTGGCGAAGTGTATAAAAG : ACCGTTTTGT L F R K R L W G G T F G E V Y K R : P F C C F E S G Y G V V L L A K C I K : D R F V I V S K A V M G W Y F W R S V - K : T V L . . . . . . 9740 TGGGCGGAACACATGTCATTGGACAATGAATCTGGGTTGGATCCTCCGGGAATACGGGTT W A E H M S L D N E S G L D P P G I R V G R N T C H W T M N L G W I L R E Y G L L G G T H V I G Q - I W V G S S G N T G . . . . . . 9680 AGGCCAGTTGCTGGACTTGTTGCAGCAGATTACTTTGCACCAGGATATTTTGTGTGGGCA R P V A G L V A A D Y F A P G Y F V W A G Q L L D L L Q Q I T L H Q D I L C G Q - A S C W T C C S R L L C T R I F C V G . . . . . . 9620 GTTTTGATTGCTAATTTGGCGCGAATAGGATATGAGGAGAAAACGATGTATATGGCTGCA V L I A N L A R I G Y E E K T M Y M A A F - L L I W R E - D M R R K R C I W L H S F D C - F G A N R I - G E N D V Y G C . . . . : . . 9560 TATGACTGGAGACTATCCATTCAGAATACTGAG : GTGCGCGACCAGACACTAAGCCAGATA Y D W R L S I Q N T E : V R D Q T L S Q I M T G D Y P F R I L R : C A T R H - A R - I - L E T I H S E Y - : G A R P D T K P D . . . . . . 9404 AAAAGCAATATAGAACTGATGGTTGCAACTAATGGAGGCAATAAGGCAGTAATTGTTCCA K S N I E L M V A T N G G N K A V I V P K A I - N - W L Q L M E A I R Q - L F H K K Q Y R T D G C N - W R Q - G S N C S . . . . . . 9344 CATTCTATGGGAGCTATTTACTTTTTGTATTTCATGAAGTGGGTCGAGGCACCAGCTCCG H S M G A I Y F L Y F M K W V E A P A P I L W E L F T F C I S - S G S R H Q L R T F Y G S Y L L F V F H E V G R G T S S . . . . . . 9284 ATGGGTGGTGGTGGTGGTCCTGATTGGTGTGCCAAACATATTAAAGCAGTGATGAATATT M G G G G G P D W C A K H I K A V M N I W V V V V V L I G V P N I L K Q - - I L D G W W W W S - L V C Q T Y - S S D E Y . . . . . . 9224 GGTGCGCCGTTTCTAGGTGTTCCTAAAGCATTAGCTGCACTTTTCTCAGCTGAAGCTCGA G A P F L G V P K A L A A L F S A E A R V R R F - V F L K H - L H F S Q L K L E W C A V S R C S - S I S C T F L S - S S . . : . . . . 9164 GATGTCGCTATTGCAAG : GAGTAAAGCATCAGTTGTTATGGACAAGGATTTATTTCGTATT D V A I A R : S K A S V V M D K D L F R I M S L L Q : G V K H Q L L W T R I Y F V F R C R Y C K : E - S I S C Y G Q G F I S Y . . . . . . 8599 CAAACACTACCACATTTAATGAGGATGCTTCGGACTTGGGATTCAACCATGTCTATGTTA Q T L P H L M R M L R T W D S T M S M L K H Y H I - - G C F G L G I Q P C L C Y S N T T T F N E D A S D L G F N H V Y V . . . . . . 8539 CCAAAAGGAGGAGAGACGATTTGGGGTGGTCTTGACTGGTCTCCAGAAGAAGGCTATTCT P K G G E T I W G G L D W S P E E G Y S Q K E E R R F G V V L T G L Q K K A I L T K R R R D D L G W S - L V S R R R L F . . . . . . 8479 CCTCGCAAAAGAAAACTAAGGGACAAAACTAGTCATACGTCAAGCCATCAGGACAATCAA P R K R K L R D K T S H T S S H Q D N Q L A K E N - G T K L V I R Q A I R T I K S S Q K K T K G Q N - S Y V K P S G Q S . . . . . . 8419 ACTGTAGAATCTAAAGGAAAACATGTTAATTATGGAAGGATGATATCATTTGGAAAGGTT T V E S K G K H V N Y G R M I S F G K V L - N L K E N M L I M E G - Y H L E R L N C R I - R K T C - L W K D D I I W K G . . . . . : . 8359 GCAGCACAGAAACCTTCATCAGATATTACTAGGATTGACTTCAGA : GGTGCAGTGAAGGGC A A Q K P S S D I T R I D F R : G A V K G Q H R N L H Q I L L G L T S E : V Q - R A C S T E T F I R Y Y - D - L Q : R C S E G . . . . . . 7986 ACGAACAAAGCAAATAACACATGTGATGTGTGGACGGAGTACTATGACATGGGCGTTGCT T N K A N N T C D V W T E Y Y D M G V A R T K Q I T H V M C G R S T M T W A L L H E Q S K - H M - C V D G V L - H G R C . . . . . . 7926 GGTATAAAAGCTGTGGAAGAATACAAGGTTTATACAGCTGGAGATATATTGGATCTACTC G I K A V E E Y K V Y T A G D I L D L L V - K L W K N T R F I Q L E I Y W I Y S W Y K S C G R I Q G L Y S W R Y I G S T . . . . . . 7866 CACTTTGTTGCCCCAAAGATGATGGCTCGTGGAGGCGCTCATTTTTCATATGGGATAGCT H F V A P K M M A R G G A H F S Y G I A T L L P Q R - W L V E A L I F H M G - L P L C C P K D D G S W R R S F F I W D S . . . . . . 7806 GAAGATTTGGATGATCCAATGTATTCACACTACAAATACTGGTCAAATCCGTTGGAAACA E D L D D P M Y S H Y K Y W S N P L E T K I W M I Q C I H T T N T G Q I R W K Q - R F G - S N V F T L Q I L V K S V G N . : . . . . . 7746 AA : GCTACCAAATGCTCCTGAAATGGAGCTTTATTCGATGTATGGAGTTGGCATTCCAACT K : L P N A P E M E L Y S M Y G V G I P T : S Y Q M L L K W S F I R C M E L A F Q L K : A T K C S - N G A L F D V W S W H S N . . . . . . 7569 GAAAGAGCATATGTTTACGGGCAAACACCAATAGCACAATGTCATATTCCATTCCAGATT E R A Y V Y G Q T P I A Q C H I P F Q I K E H M F T G K H Q - H N V I F H S R L - K S I C L R A N T N S T M S Y S I P D . . . . . . 7509 GAAACTTCAGCTGATGAAGGGAATGAGTGTTGTATGAAGAATGGTGTTTTGACTGTTGAT E T S A D E G N E C C M K N G V L T V D K L Q L M K G M S V V - R M V F - L L M - N F S - - R E - V L Y E E W C F D C - . . . . . . 7449 GGCGATGAGACGGTGCCTATTTTAAGTGCAGGCTTCATGTGTGCAAAAGGATGGCGCGGA G D E T V P I L S A G F M C A K G W R G A M R R C L F - V Q A S C V Q K D G A E W R - D G A Y F K C R L H V C K R M A R . . . . . . 7389 AAAACTAGATTTAATCCATCAGGAATCAAAACTTATACAAGGGAGTATGATCATGCTCCT K T R F N P S G I K T Y T R E Y D H A P K L D L I H Q E S K L I Q G S M I M L L K N - I - S I R N Q N L Y K G V - S C S . . . . . . 7329 CCCGCAAACCTTCTTGAGGGTCGTGGTACACAGAGTGGAAATCATGTTGATATAATGGGA P A N L L E G R G T Q S G N H V D I M G P Q T F L R V V V H R V E I M L I - W E S R K P S - G S W Y T E W K S C - Y N G . . . . . . 7269 AATTTCGCTTTGATTGAAGATATCATGAGAGTTGCAGCTGGTGCAACGGGCAAAGACTTG N F A L I E D I M R V A A G A T G K D L I S L - L K I S - E L Q L V Q R A K T W K F R F D - R Y H E S C S W C N G Q R L . . . . . . 7209 GGAGGTGATCAAGTTCACTCGGACATCTTTAAGTGGTCTGAGAAGATTGATTTACG G G D Q V H S D I F K W S E K I D L E V I K F T R T S L S G L R R L I Y G R - S S S L G H L - V V - E D - F T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-4-_PGL-2_AGS-1_PPS_1 (10484 10135,9750 9528,9431 9148,8642 8315,8001 7745,7627 7156) (frame '1'; 1914 bp, 638 residues) 1 EDDKKSKRKI ATKQKWSCID SCCWFVGYIC TVWWILLFLY NAMPASFPQY VTEKINGPVA 61 DPPGVKLRNE GLKVKHPVVF VPGIVTCGLE LWEGHQCAEG LFRKRLWGGT FGEVYKRPFC 121 WAEHMSLDNE SGLDPPGIRV RPVAGLVAAD YFAPGYFVWA VLIANLARIG YEEKTMYMAA 181 YDWRLSIQNT EVRDQTLSQI KSNIELMVAT NGGNKAVIVP HSMGAIYFLY FMKWVEAPAP 241 MGGGGGPDWC AKHIKAVMNI GAPFLGVPKA LAALFSAEAR DVAIARSKAS VVMDKDLFRI 301 QTLPHLMRML RTWDSTMSML PKGGETIWGG LDWSPEEGYS PRKRKLRDKT SHTSSHQDNQ 361 TVESKGKHVN YGRMISFGKV AAQKPSSDIT RIDFRGAVKG TNKANNTCDV WTEYYDMGVA 421 GIKAVEEYKV YTAGDILDLL HFVAPKMMAR GGAHFSYGIA EDLDDPMYSH YKYWSNPLET 481 KLPNAPEMEL YSMYGVGIPT ERAYVYGQTP IAQCHIPFQI ETSADEGNEC CMKNGVLTVD 541 GDETVPILSA GFMCAKGWRG KTRFNPSGIK TYTREYDHAP PANLLEGRGT QSGNHVDIMG 601 NFALIEDIMR VAAGATGKDL GGDQVHSDIF KWSEKIDL PGL 3 (- strand): 12092 11140 AGS-1 (11540 11481,11328 11256,11225 11140) SCR (e 0.783 d 0.000 a 0.000,e 0.890 d 0.000 a 0.000,e 0.808) Exon 1 11540 11481 ( 60 n); score: 0.783 Intron 1 11480 11329 ( 152 n); Pd: 0.000 Pa: 0.000 Exon 2 11328 11256 ( 73 n); score: 0.890 Intron 2 11255 11226 ( 30 n); Pd: 0.000 Pa: 0.000 Exon 3 11225 11140 ( 86 n); score: 0.808 PGS (11540 11481,11328 11256,11225 11140) SGN-U335137+ 3-phase translation of AGS-1 (-strand): . . . . . . : 11540 ATTTTACTTCAAATATTGCAAAGAAAAAGGCCAACGAATTATACAATTGTGAATTATACA : I L L Q I L Q R K R P T N Y T I V N Y T : F Y F K Y C K E K G Q R I I Q L - I I Q : F T S N I A K K K A N E L Y N C E L Y : . . . . . . 11328 GCAAAGCAAATTATACAATTGCAGCGAAATAAGTCAGTGAATTATACAATTTAGGCCATC A K Q I I Q L Q R N K S V N Y T I - A I Q S K L Y N C S E I S Q - I I Q F R P S S K A N Y T I A A K - V S E L Y N L G H . . : . . . . 11268 GAATTATACAATT : TGTTTGCTATGGAGCGCAATTATACAAACTTTGCTATAGCGTACAAA E L Y N : L F A M E R N Y T N F A I A Y K N Y T I : C L L W S A I I Q T L L - R T N R I I Q F : V C Y G A Q L Y K L C Y S V Q . . . . 11178 TATAAATTTTTTATTTGCTATATGTGAAAATTGTCCTTT Y K F F I C Y M - K L S F I N F L F A I C E N C P I - I F Y L L Y V K I V L Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (11509 11466,11274 11150) SCR (e 0.932 d 0.000 a 0.000,e 0.896) Exon 1 11509 11466 ( 44 n); score: 0.932 Intron 1 11465 11275 ( 191 n); Pd: 0.000 Pa: 0.000 Exon 2 11274 11150 ( 125 n); score: 0.896 PGS (11509 11466,11274 11150) SGN-U325194- 3-phase translation of AGS-2 (-strand): . . . . . : . 11509 CAACGAATTATACAATTGTGAATTATACAATTGCAGTGAAATAC : GCCATCGAATTATACA Q R I I Q L - I I Q L Q - N T : P S N Y T N E L Y N C E L Y N C S E I : R H R I I Q T N Y T I V N Y T I A V K Y : A I E L Y . . . . . . 11258 ATTGTATATGTATAGCGAATTATACATTTTCTATGTTTGCTATGGAGCGCAATTATACAA I V Y V - R I I H F L C L L W S A I I Q L Y M Y S E L Y I F Y V C Y G A Q L Y K N C I C I A N Y T F S M F A M E R N Y T . . . . . 11198 ACTTTGCTATAGCGTACAAATATAAATTTTTTATTTGCTATATGTGAAA T L L - R T N I N F L F A I C E L C Y S V Q I - I F Y L L Y V K N F A I A Y K Y K F F I C Y M - Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (12092 11990,11806 11233) SCR (e 0.592 d 0.900 a 0.000,e 0.907) Exon 1 12092 11990 ( 103 n); score: 0.592 Intron 1 11989 11807 ( 183 n); Pd: 0.900 Pa: 0.000 Exon 2 11806 11233 ( 574 n); score: 0.907 PGS (11798 11233) SGN-U315404- PGS (12092 11990,11806 11732) SGN-U323393+ 3-phase translation of AGS-3 (-strand): . . . . . . 12092 CTCTCAACTAATTATGTGTATAGTTCATTTCGTTTTCACTTTGTTTCTTAGTAATATTCA L S T N Y V Y S S F R F H F V S - - Y S S Q L I M C I V H F V F T L F L S N I Q L N - L C V - F I S F S L C F L V I F . . . . . : . 12032 ACGATTTATTCAATTTTGATTAAATTGATGATCGAATCAATAA : ATTTTTAATTGGGAAAA T I Y S I L I K L M I E S I : N F - L G K R F I Q F - L N - - S N Q - : I F N W E N N D L F N F D - I D D R I N K : F L I G K . . . . . . 11789 TTGTATATAATAGCAAACTAATAACCTAAATTAAATGGAATAGCTAGGGTTTGATTTAAT L Y I I A N - - P K L N G I A R V - F N C I - - Q T N N L N - M E - L G F D L I I V Y N S K L I T - I K W N S - G L I - . . . . . . 11729 TGTGCTCCATAGCAACATTAGCTAAAATTTGCCAGTGTCTCTCTCCCAAAAATCTCGCTC C A P - Q H - L K F A S V S L P K I S L V L H S N I S - N L P V S L S Q K S R S L C S I A T L A K I C Q C L S P K N L A . . . . . . 11669 GCCTCTCTCGCTTTATACACAGAAGTGTATAATTCTGTTACTGTTTTGTATAAAGCGAGA A S L A L Y T E V Y N S V T V L Y K A R P L S L Y T Q K C I I L L L F C I K R E R L S R F I H R S V - F C Y C F V - S E . . . . . . 11609 GAAAATTGTATATACACATGCAAAAATGTATCTCTTCGTGTTATACACTTAATTATACAA E N C I Y T C K N V S L R V I H L I I Q K I V Y T H A K M Y L F V L Y T - L Y N R K L Y I H M Q K C I S S C Y T L N Y T . . . . . . 11549 TTTACAAACATTTTACTTCAAATATTGCAAAGAAAAAGGCCAACGAATTATACAATTGTG F T N I L L Q I L Q R K R P T N Y T I V L Q T F Y F K Y C K E K G Q R I I Q L - I Y K H F T S N I A K K K A N E L Y N C . . . . . . 11489 AATTATACAATTGCAGTGAAATACAATTTTCTCTAGCTTTATACAACAGAAGTGTATATA N Y T I A V K Y N F L - L Y T T E V Y I I I Q L Q - N T I F S S F I Q Q K C I Y E L Y N C S E I Q F S L A L Y N R S V Y . . . . . . 11429 TTGTGTTTCTGTTTTTATATAAAGCGAGAAAAACATATATCTTCTTGCTATACACTTATA L C F C F Y I K R E K H I S S C Y T L I C V S V F I - S E K N I Y L L A I H L - I V F L F L Y K A R K T Y I F L L Y T Y . . . . . . 11369 ATTATGCAATATACGTACATTTTAATTCGATTCAACTGTATGCAAAGCAAATTATACAAT I M Q Y T Y I L I R F N C M Q S K L Y N L C N I R T F - F D S T V C K A N Y T I N Y A I Y V H F N S I Q L Y A K Q I I Q . . . . . . 11309 TGCAGCGAAATAAGTCAGTGAATTATACAATTTAGGCCATCGAATTATACAATTGTATAT C S E I S Q - I I Q F R P S N Y T I V Y A A K - V S E L Y N L G H R I I Q L Y M L Q R N K S V N Y T I - A I E L Y N C I . . 11249 GTATAGCGAATTATACA V - R I I Y S E L Y C I A N Y T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-4-_PGL-3_AGS-3_PPS_1 (11616 11275) (frame '0'; 339 bp, 113 residues) 1 SERKLYIHMQ KCISSCYTLN YTIYKHFTSN IAKKKANELY NCELYNCSEI QFSLALYNRS 61 VYIVFLFLYK ARKTYIFLLY TYNYAIYVHF NSIQLYAKQI IQLQRNKSVN YTI- ... finished at: Mon Aug 28 21:58:34 2006 ________________________________________________________________________________ Sequence 5: C06HBa0120H21.1-5, from 1 to 2316, both strands analyzed. ... started at: Mon Aug 28 21:58:34 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 1791 n (File: SGN-U320278+) 1 AAACTCATTT CCCTGTTTGG TTTCATTGAC ACATTTTGTT CAACACCTTG TAAGCAATAA 61 AAATTGACAC CAAACTGATC AAATTTCATC GGATTTAGTA TAACAGCTAT GGCTACTTGC 121 GGAATTGACT GGAAATCTGT TCTGCCAAAC TGTTTTAAGG GCAATAATGT TCGTTCGGAG 181 GCGAAGGTGA TGGAGAACAG TAAACAGATG AATTCTGATC ATCATAGATT AGCTTTTTCT 241 GATATAAGTA CTGATTCTAG ATCAGTACTT ATATCATTGG ATGACCTTTC ATCGAACGCT 301 GTCATTGGTT CAAATCTTCA TGTATTCACA TATGAGGAAC TTAAACTCAT CACTAGTGAT 361 TTCTCCTCAG CTAATTTTCT CGGTAAAGGT GGATTTGGAC CCGTTCACAA GGGGTTTATT 421 GATGACAAGA TTAAGCCTGG TTTGGATGCT CAACCTGTTG CTGTTAAATT GCTTGATTTG 481 GATGGAAATC AGGGCCATCA AGAATGGCTG ACTGAAGTGG TTTTTTTGGG GCAATTGAGG 541 CATCATCATC TAGTGAAGTT GATTGGATAT TGTTGGGAGG AGGAGCAGAG ACTTCTCGTT 601 TACGAGTATA TGGCAAGGGG AAACCTCGAG GATCAACTAT TTTCGAGATA TTCGAGTTGT 661 TTGCCATGGT TGACCAGAAT AAAAATTATG GTTGGTGCTG CAAAGGGACT GGCTTTCCTT 721 CACGGAGAAG AAAAACCAGT AATCTACCGC GATTTTAAGG CTTCTAACAT CCTCTTAGAC 781 TCGGATTACA GAGCCAAACT ATCTGATTTT GGGCTAGCAA AGGATGGACC GGAAGGCGAT 841 GACACACATG TCTCAACTCG TGTGATGGGC ACTCATGGTT ATGCTGCTCC GGAGTACATC 901 ATGACTGGTC ATTTGACAAG CAAGAGCGAT GTGTACAGCT TCGGAGTAGT TTTGTTAGAA 961 CTTATAACAG GACGACGAGC CATGGACAAG AAACGCCCCC TCAAAGAGCG AATCTTGGTG 1021 GATTGGGCAA GACCAATGCT AAGGGATCCA CACAAGCTTG ACAGAATAAT GGACCCACGG 1081 CTTGAAGGTC AGTACTCGAC ACAAGGAGCC AAGAAGGTAG CTGCATTGGC TTATCAATGC 1141 TTAAGCCACC ACCCCAGGTC TAGGCCTACT ATGAGCAACA TAGTGAAGAT CTTGGAACCA 1201 GTCTTGGACA TGAAGGATAT ACCAATGGGC CCATTTGTTT ACGTCGTTCC CTCCTCAAAA 1261 CCTGACAAGG GAACAGAAAT TGGTGAATTG AAGACTAAAG TGAACGATGA AAACAAGGCG 1321 GGTGTAAGGG AAAACGAAGT AGATAATGCA GGGGAAAACA GAGAGGATGG TAATGCTAAG 1381 CAACGGAGAG TCGGACACAG GTATAAACAC AGGCTAAAGA CTGATGCTTC TGTTTACTCA 1441 GATACTCATT TGTATCACAA AACTGTAAAG CATGAAAGAA CAAACAAACT AAATTCTTAT 1501 TGATCAATCG CAACGAATAG AAGAAAGGGA AAAGAGATTT AAAATGTAAA ACATATGTGA 1561 ATTAGATAGT ATAATTAAGT ACTCCAATTG TATTTTTTTT TTTTTTTGTT AAATCAGTTT 1621 ACATTTTAGT CTCAGTTTTG GGAGAAAAAA AAATCCTTTT AGTGTTTTCG TTTTTGTTGT 1681 TGAAAGCAAT GTACATAGAA GATTCTTATG ACTTTTATCA GTTGTAAAAA GTTGTTTGAA 1741 CAAGAAAAAG TTATATATGA AAACGAAAAA GAATCATTCT TCTCTTCTCT T Predicted gene structure (within gDNA segment 268 to 2316): Exon 1 868 1377 ( 510 n); cDNA 1 510 ( 510 n); score: 1.000 Intron 1 1378 1472 ( 95 n); Pd: 0.975 (s: 1.00), Pa: 0.964 (s: 1.00) Exon 2 1473 1608 ( 136 n); cDNA 511 646 ( 136 n); score: 1.000 Intron 2 1609 1705 ( 97 n); Pd: 0.989 (s: 1.00), Pa: 0.923 (s: 1.00) Exon 3 1706 1842 ( 137 n); cDNA 647 783 ( 137 n); score: 1.000 Intron 3 1843 1927 ( 85 n); Pd: 0.996 (s: 1.00), Pa: 0.901 (s: 1.00) Exon 4 1928 2051 ( 124 n); cDNA 784 907 ( 124 n); score: 1.000 Intron 4 2052 2164 ( 113 n); Pd: 0.999 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 5 2165 2316 ( 152 n); cDNA 908 1059 ( 152 n); score: 1.000 MATCH C06HBa0120H21.1-5+ SGN-U320278+ 1.000 1059 0.591 C PGS_C06HBa0120H21.1-5+_SGN-U320278+ (868 1377,1473 1608,1706 1842,1928 2051,2165 2316) Alignment (genomic DNA sequence = upper lines): AAACTCATTT CCCTGTTTGG TTTCATTGAC ACATTTTGTT CAACACCTTG TAAGCAATAA 927 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAACTCATTT CCCTGTTTGG TTTCATTGAC ACATTTTGTT CAACACCTTG TAAGCAATAA 60 AAATTGACAC CAAACTGATC AAATTTCATC GGATTTAGTA TAACAGCTAT GGCTACTTGC 987 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATTGACAC CAAACTGATC AAATTTCATC GGATTTAGTA TAACAGCTAT GGCTACTTGC 120 GGAATTGACT GGAAATCTGT TCTGCCAAAC TGTTTTAAGG GCAATAATGT TCGTTCGGAG 1047 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAATTGACT GGAAATCTGT TCTGCCAAAC TGTTTTAAGG GCAATAATGT TCGTTCGGAG 180 GCGAAGGTGA TGGAGAACAG TAAACAGATG AATTCTGATC ATCATAGATT AGCTTTTTCT 1107 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGAAGGTGA TGGAGAACAG TAAACAGATG AATTCTGATC ATCATAGATT AGCTTTTTCT 240 GATATAAGTA CTGATTCTAG ATCAGTACTT ATATCATTGG ATGACCTTTC ATCGAACGCT 1167 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATATAAGTA CTGATTCTAG ATCAGTACTT ATATCATTGG ATGACCTTTC ATCGAACGCT 300 GTCATTGGTT CAAATCTTCA TGTATTCACA TATGAGGAAC TTAAACTCAT CACTAGTGAT 1227 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCATTGGTT CAAATCTTCA TGTATTCACA TATGAGGAAC TTAAACTCAT CACTAGTGAT 360 TTCTCCTCAG CTAATTTTCT CGGTAAAGGT GGATTTGGAC CCGTTCACAA GGGGTTTATT 1287 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTCCTCAG CTAATTTTCT CGGTAAAGGT GGATTTGGAC CCGTTCACAA GGGGTTTATT 420 GATGACAAGA TTAAGCCTGG TTTGGATGCT CAACCTGTTG CTGTTAAATT GCTTGATTTG 1347 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGACAAGA TTAAGCCTGG TTTGGATGCT CAACCTGTTG CTGTTAAATT GCTTGATTTG 480 GATGGAAATC AGGGCCATCA AGAATGGCTG GTGAGCAATT TGTTTTGGCC TGAAAATTGA 1407 |||||||||| |||||||||| |||||||||| GATGGAAATC AGGGCCATCA AGAATGGCTG .......... .......... .......... 510 TTTTTTTTGG TTGTAACTAC TGTTTCTGAT TAGTAGTAAT AGTTTTAGAA TTGTTGAATT 1467 .......... .......... .......... .......... .......... .......... 510 TTCAGACTGA AGTGGTTTTT TTGGGGCAAT TGAGGCATCA TCATCTAGTG AAGTTGATTG 1527 ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .....ACTGA AGTGGTTTTT TTGGGGCAAT TGAGGCATCA TCATCTAGTG AAGTTGATTG 565 GATATTGTTG GGAGGAGGAG CAGAGACTTC TCGTTTACGA GTATATGGCA AGGGGAAACC 1587 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATATTGTTG GGAGGAGGAG CAGAGACTTC TCGTTTACGA GTATATGGCA AGGGGAAACC 625 TCGAGGATCA ACTATTTTCG AGTAAGCATT GATTTTCTTG TATGATCGAT TGTAAAATGA 1647 |||||||||| |||||||||| | TCGAGGATCA ACTATTTTCG A......... .......... .......... .......... 646 ACAAAAACTT TTGTCCCCGT TCTCGGTTTC TGATTTTTCA TGTGATGTAT CAAAACAGGA 1707 || .......... .......... .......... .......... .......... ........GA 648 TATTCGAGTT GTTTGCCATG GTTGACCAGA ATAAAAATTA TGGTTGGTGC TGCAAAGGGA 1767 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTCGAGTT GTTTGCCATG GTTGACCAGA ATAAAAATTA TGGTTGGTGC TGCAAAGGGA 708 CTGGCTTTCC TTCACGGAGA AGAAAAACCA GTAATCTACC GCGATTTTAA GGCTTCTAAC 1827 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGGCTTTCC TTCACGGAGA AGAAAAACCA GTAATCTACC GCGATTTTAA GGCTTCTAAC 768 ATCCTCTTAG ACTCGGTAAG TTATCTACGA AGTATTGTTT TGACATTGAA AAGTTCAACA 1887 |||||||||| ||||| ATCCTCTTAG ACTCG..... .......... .......... .......... .......... 783 CCTTTTTCAG CTAGAAACTC ATAATGCTAT GAATTATCAG GATTACAGAG CCAAACTATC 1947 |||||||||| |||||||||| .......... .......... .......... .......... GATTACAGAG CCAAACTATC 803 TGATTTTGGG CTAGCAAAGG ATGGACCGGA AGGCGATGAC ACACATGTCT CAACTCGTGT 2007 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTTTGGG CTAGCAAAGG ATGGACCGGA AGGCGATGAC ACACATGTCT CAACTCGTGT 863 GATGGGCACT CATGGTTATG CTGCTCCGGA GTACATCATG ACTGGTAATT TTTTAGATTA 2067 |||||||||| |||||||||| |||||||||| |||||||||| |||| GATGGGCACT CATGGTTATG CTGCTCCGGA GTACATCATG ACTG...... .......... 907 TCAGTGTACT TTAAGATGTT ATGTGTCTCA TTTATCGCGT TACTTCTTAG TAGTCCCACT 2127 .......... .......... .......... .......... .......... .......... 907 TATTACGAAT ACTTATACTT TCATTTTCTT TAAACAGGTC ATTTGACAAG CAAGAGCGAT 2187 ||| |||||||||| |||||||||| .......... .......... .......... .......GTC ATTTGACAAG CAAGAGCGAT 930 GTGTACAGCT TCGGAGTAGT TTTGTTAGAA CTTATAACAG GACGACGAGC CATGGACAAG 2247 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTACAGCT TCGGAGTAGT TTTGTTAGAA CTTATAACAG GACGACGAGC CATGGACAAG 990 AAACGCCCCC TCAAAGAGCG AATCTTGGTG GATTGGGCAA GACCAATGCT AAGGGATCCA 2307 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAACGCCCCC TCAAAGAGCG AATCTTGGTG GATTGGGCAA GACCAATGCT AAGGGATCCA 1050 CACAAGCTT 2316 ||||||||| CACAAGCTT 1059 hqPGS_C06HBa0120H21.1-5+_SGN-U320278+ (868 1377,1473 1608,1706 1842,1928 2051,2165 2316) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2316: PGL 1 (+ strand): 868 2316 AGS-1 (868 1377,1473 1608,1706 1842,1928 2051,2165 2316) SCR (e 1.000 d 0.975 a 0.964,e 1.000 d 0.989 a 0.923,e 1.000 d 0.996 a 0.901,e 1.000 d 0.999 a 0.998,e 1.000) Exon 1 868 1377 ( 510 n); score: 1.000 Intron 1 1378 1472 ( 95 n); Pd: 0.975 Pa: 0.964 Exon 2 1473 1608 ( 136 n); score: 1.000 Intron 2 1609 1705 ( 97 n); Pd: 0.989 Pa: 0.923 Exon 3 1706 1842 ( 137 n); score: 1.000 Intron 3 1843 1927 ( 85 n); Pd: 0.996 Pa: 0.901 Exon 4 1928 2051 ( 124 n); score: 1.000 Intron 4 2052 2164 ( 113 n); Pd: 0.999 Pa: 0.998 Exon 5 2165 2316 ( 152 n); score: 1.000 PGS (868 1377,1473 1608,1706 1842,1928 2051,2165 2316) SGN-U320278+ 3-phase translation of AGS-1 (+strand): . . . . . . 868 AAACTCATTTCCCTGTTTGGTTTCATTGACACATTTTGTTCAACACCTTGTAAGCAATAA K L I S L F G F I D T F C S T P C K Q - N S F P C L V S L T H F V Q H L V S N K T H F P V W F H - H I L F N T L - A I . . . . . . 928 AAATTGACACCAAACTGATCAAATTTCATCGGATTTAGTATAACAGCTATGGCTACTTGC K L T P N - S N F I G F S I T A M A T C N - H Q T D Q I S S D L V - Q L W L L A K I D T K L I K F H R I - Y N S Y G Y L . . . . . . 988 GGAATTGACTGGAAATCTGTTCTGCCAAACTGTTTTAAGGGCAATAATGTTCGTTCGGAG G I D W K S V L P N C F K G N N V R S E E L T G N L F C Q T V L R A I M F V R R R N - L E I C S A K L F - G Q - C S F G . . . . . . 1048 GCGAAGGTGATGGAGAACAGTAAACAGATGAATTCTGATCATCATAGATTAGCTTTTTCT A K V M E N S K Q M N S D H H R L A F S R R - W R T V N R - I L I I I D - L F L G E G D G E Q - T D E F - S S - I S F F . . . . . . 1108 GATATAAGTACTGATTCTAGATCAGTACTTATATCATTGGATGACCTTTCATCGAACGCT D I S T D S R S V L I S L D D L S S N A I - V L I L D Q Y L Y H W M T F H R T L - Y K Y - F - I S T Y I I G - P F I E R . . . . . . 1168 GTCATTGGTTCAAATCTTCATGTATTCACATATGAGGAACTTAAACTCATCACTAGTGAT V I G S N L H V F T Y E E L K L I T S D S L V Q I F M Y S H M R N L N S S L V I C H W F K S S C I H I - G T - T H H - - . . . . . . 1228 TTCTCCTCAGCTAATTTTCTCGGTAAAGGTGGATTTGGACCCGTTCACAAGGGGTTTATT F S S A N F L G K G G F G P V H K G F I S P Q L I F S V K V D L D P F T R G L L F L L S - F S R - R W I W T R S Q G V Y . . . . . . 1288 GATGACAAGATTAAGCCTGGTTTGGATGCTCAACCTGTTGCTGTTAAATTGCTTGATTTG D D K I K P G L D A Q P V A V K L L D L M T R L S L V W M L N L L L L N C L I W - - Q D - A W F G C S T C C C - I A - F . . . : . . . 1348 GATGGAAATCAGGGCCATCAAGAATGGCTG : ACTGAAGTGGTTTTTTTGGGGCAATTGAGG D G N Q G H Q E W L : T E V V F L G Q L R M E I R A I K N G - : L K W F F W G N - G G W K S G P S R M A : D - S G F F G A I E . . . . . . 1503 CATCATCATCTAGTGAAGTTGATTGGATATTGTTGGGAGGAGGAGCAGAGACTTCTCGTT H H H L V K L I G Y C W E E E Q R L L V I I I - - S - L D I V G R R S R D F S F A S S S S E V D W I L L G G G A E T S R . . . . . : . 1563 TACGAGTATATGGCAAGGGGAAACCTCGAGGATCAACTATTTTCGA : GATATTCGAGTTGT Y E Y M A R G N L E D Q L F S : R Y S S C T S I W Q G E T S R I N Y F R : D I R V V L R V Y G K G K P R G S T I F E : I F E L . . . . . . 1720 TTGCCATGGTTGACCAGAATAAAAATTATGGTTGGTGCTGCAAAGGGACTGGCTTTCCTT L P W L T R I K I M V G A A K G L A F L C H G - P E - K L W L V L Q R D W L S F F A M V D Q N K N Y G W C C K G T G F P . . . . . . 1780 CACGGAGAAGAAAAACCAGTAATCTACCGCGATTTTAAGGCTTCTAACATCCTCTTAGAC H G E E K P V I Y R D F K A S N I L L D T E K K N Q - S T A I L R L L T S S - T S R R R K T S N L P R F - G F - H P L R . : . . . . . 1840 TCG : GATTACAGAGCCAAACTATCTGATTTTGGGCTAGCAAAGGATGGACCGGAAGGCGAT S : D Y R A K L S D F G L A K D G P E G D R : I T E P N Y L I L G - Q R M D R K A M L : G L Q S Q T I - F W A S K G W T G R R . . . . . . 1985 GACACACATGTCTCAACTCGTGTGATGGGCACTCATGGTTATGCTGCTCCGGAGTACATC D T H V S T R V M G T H G Y A A P E Y I T H M S Q L V - W A L M V M L L R S T S - H T C L N S C D G H S W L C C S G V H . : . . . . . 2045 ATGACTG : GTCATTTGACAAGCAAGAGCGATGTGTACAGCTTCGGAGTAGTTTTGTTAGAA M T : G H L T S K S D V Y S F G V V L L E - L : V I - Q A R A M C T A S E - F C - N H D W : S F D K Q E R C V Q L R S S F V R . . . . . . 2218 CTTATAACAGGACGACGAGCCATGGACAAGAAACGCCCCCTCAAAGAGCGAATCTTGGTG L I T G R R A M D K K R P L K E R I L V L - Q D D E P W T R N A P S K S E S W W T Y N R T T S H G Q E T P P Q R A N L G . . . . 2278 GATTGGGCAAGACCAATGCTAAGGGATCCACACAAGCTT D W A R P M L R D P H K L I G Q D Q C - G I H T S G L G K T N A K G S T Q A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-5+_PGL-1_AGS-1_PPS_1 (946 1377,1473 1608,1706 1842,1928 2051,2165 2316) (frame '1'; 981 bp, 327 residues) 1 SNFIGFSITA MATCGIDWKS VLPNCFKGNN VRSEAKVMEN SKQMNSDHHR LAFSDISTDS 61 RSVLISLDDL SSNAVIGSNL HVFTYEELKL ITSDFSSANF LGKGGFGPVH KGFIDDKIKP 121 GLDAQPVAVK LLDLDGNQGH QEWLTEVVFL GQLRHHHLVK LIGYCWEEEQ RLLVYEYMAR 181 GNLEDQLFSR YSSCLPWLTR IKIMVGAAKG LAFLHGEEKP VIYRDFKASN ILLDSDYRAK 241 LSDFGLAKDG PEGDDTHVST RVMGTHGYAA PEYIMTGHLT SKSDVYSFGV VLLELITGRR 301 AMDKKRPLKE RILVDWARPM LRDPHKL ... finished at: Mon Aug 28 21:58:39 2006 ________________________________________________________________________________ Sequence 6: C06HBa0120H21.1-6, from 1 to 2937, both strands analyzed. ... started at: Mon Aug 28 21:58:39 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 1 ... matches indexed, elapsed seconds = 2 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 711 n (File: SGN-U312486+) 1 GTAGATCCGA GGAGTTTCAG ACAAAAAACC CTAACCCCCA ATCGCTTCTC TTTCCCTCAG 61 ATCTTTTACT CTTCTTTATC AGTCATGGCT GGCTTGGCAC CGGAAGGTTC TCAATTTGAT 121 GCTCGTCAAT TTGATGCTAA AATGACAGAG CTGCTTGGGA CTGAACAGCA GGAGTTCTTT 181 ACATCATATG ATGAAGTTCA TGACAGTTTC GATGCCATGG GTTTGCAAGA AAATCTTCTT 241 AGGGGCATCT ATGCCTATGG TTTTGAGAAG CCATCTGCTA TCCAGCAAAG GGGTATTGTT 301 CCTTTTTGCA AGGGTCTTGA TGTGATTCAA CAGGCACAAT CTGGTACAGG AAAGACAGCA 361 ACTTTCTGCT CTGGGATTCT CCAGCAGCTT GATTACAGCT TAGTCGAATG TCAGGCTCTG 421 GTTCTGGCTC CAACCCGTGA GCTTGCCCAA CAGATTGAGA AGGTTATGCG AGCACTTGGT 481 GACTATCTTG GTGTGAAGGT TCATGCCTGT GTAGGAGGTA CCAGTGTCCG TGAGGATCAG 541 CGTATCCTTC AAAGTGGTGT TCATGTGGTT GTTGGTACTC CTGGTCGTGT ATTTGATATG 601 TTGCGTAGGC AGTCTCTTCG CCCTGACCAC ATCAAGATGT TTGTTCTGGA TGAAGCTAAT 661 GAAATGCTCT CAAGAGGTTT CAAGGATCAA ATTTATGATA TTTTCCAATT G Predicted gene structure (within gDNA segment 1 to 2937): Exon 1 134 206 ( 73 n); cDNA 80 152 ( 73 n); score: 0.808 Intron 1 207 337 ( 131 n); Pd: 0.996 (s: 0.78), Pa: 0.990 (s: 0.68) Exon 2 338 444 ( 107 n); cDNA 153 259 ( 107 n); score: 0.804 Intron 2 445 785 ( 341 n); Pd: 0.969 (s: 0.90), Pa: 0.520 (s: 0.86) Exon 3 786 1216 ( 431 n); cDNA 260 690 ( 431 n); score: 0.803 Intron 3 1217 1641 ( 425 n); Pd: 0.999 (s: 0.80), Pa: 0.978 (s: 0) Exon 4 1642 1662 ( 21 n); cDNA 691 711 ( 21 n); score: 0.857 MATCH C06HBa0120H21.1-6+ SGN-U312486+ 0.804 632 0.889 C PGS_C06HBa0120H21.1-6+_SGN-U312486+ (134 206,338 444,786 1216,1642 1662) Alignment (genomic DNA sequence = upper lines): CAGTCATGGC ACGCTTGGCA CCAGAAGGAG CTCAATTTGA TGCTCGACAG TTCGATTCTA 193 |||||||||| |||||||| || ||||| |||||||||| |||||| || || ||| ||| CAGTCATGGC TGGCTTGGCA CCGGAAGGTT CTCAATTTGA TGCTCGTCAA TTTGATGCTA 139 AGATGAATGA TTTGTAAGTT ATCATGATAT TCGTTGCACA GTATGATTTT AGTCATTTCT 253 | |||| || | AAATGACAGA GCT....... .......... .......... .......... .......... 152 TTCTTGAATC AATACAAATT TTATATGTTA AATGCTGAAA ATTGTACTCG TCTCTATTAC 313 .......... .......... .......... .......... .......... .......... 152 TTATATTATA TATATATGGT GCAGACTTGC CGCTGAGGGA AAAGATTTCT TTACTTCATA 373 |||| |||| | || |||| |||| ||||| .......... .......... ....GCTTGG GACTGAACAG CAGGAGTTCT TTACATCATA 188 TGACGAGGTG TATGACAGTT TCGATGCTAT GGGTCTGCAA GAAAATCTTC TCAGGGGCAT 433 ||| || || ||||||||| ||||||| || |||| ||||| |||||||||| | |||||||| TGATGAAGTT CATGACAGTT TCGATGCCAT GGGTTTGCAA GAAAATCTTC TTAGGGGCAT 248 TTATGCTTAT GGTACTATGT AAACCTTTTA AGTTATTGTT CTCAAGTCTA GGAAAATTCT 493 ||||| ||| | CTATGCCTAT G......... .......... .......... .......... .......... 259 TCGGTTTTTT GTTTCTTTGC ATGATACATC CAGTCCGTTT CAATTTGCTT GTCTTACTTT 553 .......... .......... .......... .......... .......... .......... 259 CCTTTTTTGC AACTCTTTAA TTTCAACTTT TCACGTGACA TGTTTAAGAT CACAAGATTC 613 .......... .......... .......... .......... .......... .......... 259 AAAAGTATTT TTTACTTTCC TAAACTTTGT GTCATGTCAA AGCCAGACAA ACAAATTGAA 673 .......... .......... .......... .......... .......... .......... 259 ACAGAGGGAG TACCATACAG TTGAGAGTTC TCTGAAAAAA AAAGTTTATT TTTTTTAAAT 733 .......... .......... .......... .......... .......... .......... 259 TGTTGATTTT GTGCCTGATT CCGCCTTTTT TGCTGCATGC AATCTTTTAA AGGTTTTGAG 793 |||||||| .......... .......... .......... .......... .......... ..GTTTTGAG 267 AAACCATCTG CCATTCAGCA GAGGGGTATC GTACCATTTT GCAAGGGACT TGATGTAATT 853 || ||||||| | || ||||| |||||||| || || |||| ||||||| || |||||| ||| AAGCCATCTG CTATCCAGCA AAGGGGTATT GTTCCTTTTT GCAAGGGTCT TGATGTGATT 327 CAGCAAGCTC AATCTGGCAC AGGGAAGACA GCTACTTTTT GTTCTGGAAT TTTGCAGCAA 913 || || || | ||||||| || ||| |||||| || ||||| | | ||||| || | | ||||| CAACAGGCAC AATCTGGTAC AGGAAAGACA GCAACTTTCT GCTCTGGGAT TCTCCAGCAG 387 CTTGATTATG GTTTAATTCA ATGCCAATCA TTGGTGTTGG CACCTACTCG AGAACTTGCA 973 |||||||| | ||| | | ||| || | |||| ||| | || || || || ||||| CTTGATTACA GCTTAGTCGA ATGTCAGGCT CTGGTTCTGG CTCCAACCCG TGAGCTTGCC 447 CAGCAGATTG AGAAGGTTAT GCGAGCACTT GGTGACTATC TTGGGGTTAA GGTCCATGCT 1033 || ||||||| |||||||||| |||||||||| |||||||||| |||| || || ||| ||||| CAACAGATTG AGAAGGTTAT GCGAGCACTT GGTGACTATC TTGGTGTGAA GGTTCATGCC 507 TGTGTAGGTG GCACTAGTGT CAGGGAGGAT CAACGCATTC TCGCAGCTGG TGTTCATGTT 1093 |||||||| | | || ||||| | | |||||| || || || | | | ||| ||||||||| TGTGTAGGAG GTACCAGTGT CCGTGAGGAT CAGCGTATCC TTCAAAGTGG TGTTCATGTG 567 GTTGTTGGCA CCCCTGGACG TGTGTTTGAC ATGTTGCGAA GACAGTCCCT CCGTCCTGAT 1153 |||||||| | | ||||| || ||| ||||| |||||||| | | ||||| || || ||||| GTTGTTGGTA CTCCTGGTCG TGTATTTGAT ATGTTGCGTA GGCAGTCTCT TCGCCCTGAC 627 TGCCTCAGAA TGTTTGTGCT AGACGAGGCT GATGAAATGC TGTCACGGGG TTTTAAGGAT 1213 | ||| | ||||||| || || || ||| ||||||||| | ||| | || ||| |||||| CACATCAAGA TGTTTGTTCT GGATGAAGCT AATGAAATGC TCTCAAGAGG TTTCAAGGAT 687 CAGGTACTTC AACCTCTTCA ATTAGATTTA TAATTACCCC TTAGTAGACG ATTTGAACAA 1273 || CAA....... .......... .......... .......... .......... .......... 690 GTTCAGGTTT GAGGTCCCTC GGTTTGCTAA TCTGGGATTA TCGCTGGAAT TAATTCTTTT 1333 .......... .......... .......... .......... .......... .......... 690 GAAGAAATCA ATATAGAAGA TACAGATTTA GTTTACGTTT AGATGTTCTT GTACTCCGTT 1393 .......... .......... .......... .......... .......... .......... 690 AACCAGTCAT CTTAATCATT GTCGGGAATT AGCTATTGAG ATAAGTTACA CACTAGTACC 1453 .......... .......... .......... .......... .......... .......... 690 CCTCCCATTT ATTTGGTTCC TAAGGACATA ATGTAAATTG ATCATCCCGT TGAATTTGTT 1513 .......... .......... .......... .......... .......... .......... 690 CCTTCATTTG CTGATTGCGA TTAGTAAAAC TAAATACAGA GTTTATGACC TTTCCCTCCG 1573 .......... .......... .......... .......... .......... .......... 690 ATTTGATTTG CTGTCGACTT TGAGAAATTA GTAATTTGGA ACCTTTATCC ATGTGTCTAT 1633 .......... .......... .......... .......... .......... .......... 690 ATTGGCAGAT ATATGATATT TTCCAGATG 1662 || ||||||||| ||||| || ........AT TTATGATATT TTCCAATTG 711 hqPGS_C06HBa0120H21.1-6+_SGN-U312486+ (134 206,338 444,786 1216,1642 1662) ******************************************************************************** EST sequence 3 +strand 852 n (File: SGN-U346056+) 1 TTTTAACTGG AGCTCCCCGC GGTGGCGGCC GCTCTAGAAC TAGTGGATCC CCCGGGCTGC 61 AGGAATTCGG CACGAGGGAA AAAACAAACA AAAAAATTAA GTGTTTAATT TTTGCTAAGC 121 TCATTCTCAC TTTCTTATAC AACGCTATTA CCGTCGACGT TCCTAATTTT TTACTAATCA 181 TCATGGCACG CTTGGCACCA GAAGGAGCTC AATTTGATGC TCGACAGTTC GATTCTAAGA 241 TGAATGATTT ACTTGCCGCT GAGGGAAAAG ATTTCTTTAC TTCATATGAC GAGGTGTATG 301 ACAGTTTCGA TGCTATGGGT CTGCAAGAAA ATCTTCTCAG GGGCATTTAT GCTTATGGTT 361 TTGAGAAACC ATCTGCCATT CAGCAGAGGG GTATCGTACC ATTTTGCAAG GGACTTGATG 421 TAATTCAGCA AGCTCAATCT GGCACAGGGA AGACAGCTAC TTTTTGTTCT GGAATTTTGC 481 AGCAACTTGA TTATGGTTTA ATTCAATGCC AATCATTGGT GTTGGCACCT ACTCGAGAAC 541 TTGCACAGCA GATTGAGAAG GTTATGCGAG CACTTGGTGA CTATCTTGGG GTTAAGGTCC 601 ATGCTTGTGT AGGTGGCACT AGTGTCAGGG NAGGATCACG CATTCTCGCA GCTGGTGGTC 661 ATGTTGTTGT TGGCACCCCT GGACGTGTGT TTGACATGTT GCGAAGACAG TCCCCTCGTC 721 CTTGATGCCT CAGAAATGTT GTGCTAGACG AGGCTGATGA AATGCTGTCA CGGGGTTTTT 781 AGGATCAGAT ATATGATATT TTTCCAGATG CTGCCCTACC AAAGTCAAGT CGGGAGTGTT 841 TTCTGCGACA TG Predicted gene structure (within gDNA segment 1 to 2340): Exon 1 137 206 ( 70 n); cDNA 181 250 ( 70 n); score: 1.000 Intron 1 207 337 ( 131 n); Pd: 0.996 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 2 338 444 ( 107 n); cDNA 251 357 ( 107 n); score: 1.000 Intron 2 445 785 ( 341 n); Pd: 0.969 (s: 1.00), Pa: 0.520 (s: 1.00) Exon 3 786 1216 ( 431 n); cDNA 358 788 ( 431 n); score: 0.969 Intron 3 1217 1641 ( 425 n); Pd: 0.999 (s: 0.98), Pa: 0.978 (s: 0.82) Exon 4 1642 1699 ( 58 n); cDNA 789 848 ( 60 n); score: 0.853 MATCH C06HBa0120H21.1-6+ SGN-U346056+ 0.967 666 0.782 C PGS_C06HBa0120H21.1-6+_SGN-U346056+ (137 206,338 444,786 1216,1642 1699) Alignment (genomic DNA sequence = upper lines): TCATGGCACG CTTGGCACCA GAAGGAGCTC AATTTGATGC TCGACAGTTC GATTCTAAGA 196 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATGGCACG CTTGGCACCA GAAGGAGCTC AATTTGATGC TCGACAGTTC GATTCTAAGA 240 TGAATGATTT GTAAGTTATC ATGATATTCG TTGCACAGTA TGATTTTAGT CATTTCTTTC 256 |||||||||| TGAATGATTT .......... .......... .......... .......... .......... 250 TTGAATCAAT ACAAATTTTA TATGTTAAAT GCTGAAAATT GTACTCGTCT CTATTACTTA 316 .......... .......... .......... .......... .......... .......... 250 TATTATATAT ATATGGTGCA GACTTGCCGC TGAGGGAAAA GATTTCTTTA CTTCATATGA 376 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .ACTTGCCGC TGAGGGAAAA GATTTCTTTA CTTCATATGA 289 CGAGGTGTAT GACAGTTTCG ATGCTATGGG TCTGCAAGAA AATCTTCTCA GGGGCATTTA 436 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAGGTGTAT GACAGTTTCG ATGCTATGGG TCTGCAAGAA AATCTTCTCA GGGGCATTTA 349 TGCTTATGGT ACTATGTAAA CCTTTTAAGT TATTGTTCTC AAGTCTAGGA AAATTCTTCG 496 |||||||| TGCTTATG.. .......... .......... .......... .......... .......... 357 GTTTTTTGTT TCTTTGCATG ATACATCCAG TCCGTTTCAA TTTGCTTGTC TTACTTTCCT 556 .......... .......... .......... .......... .......... .......... 357 TTTTTGCAAC TCTTTAATTT CAACTTTTCA CGTGACATGT TTAAGATCAC AAGATTCAAA 616 .......... .......... .......... .......... .......... .......... 357 AGTATTTTTT ACTTTCCTAA ACTTTGTGTC ATGTCAAAGC CAGACAAACA AATTGAAACA 676 .......... .......... .......... .......... .......... .......... 357 GAGGGAGTAC CATACAGTTG AGAGTTCTCT GAAAAAAAAA GTTTATTTTT TTTAAATTGT 736 .......... .......... .......... .......... .......... .......... 357 TGATTTTGTG CCTGATTCCG CCTTTTTTGC TGCATGCAAT CTTTTAAAGG TTTTGAGAAA 796 | |||||||||| .......... .......... .......... .......... .........G TTTTGAGAAA 368 CCATCTGCCA TTCAGCAGAG GGGTATCGTA CCATTTTGCA AGGGACTTGA TGTAATTCAG 856 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCATCTGCCA TTCAGCAGAG GGGTATCGTA CCATTTTGCA AGGGACTTGA TGTAATTCAG 428 CAAGCTCAAT CTGGCACAGG GAAGACAGCT ACTTTTTGTT CTGGAATTTT GCAGCAACTT 916 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAGCTCAAT CTGGCACAGG GAAGACAGCT ACTTTTTGTT CTGGAATTTT GCAGCAACTT 488 GATTATGGTT TAATTCAATG CCAATCATTG GTGTTGGCAC CTACTCGAGA ACTTGCACAG 976 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATTATGGTT TAATTCAATG CCAATCATTG GTGTTGGCAC CTACTCGAGA ACTTGCACAG 548 CAGATTGAGA AGGTTATGCG AGCACTTGGT GACTATCTTG GGGTTAAGGT CCATGCTTGT 1036 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGATTGAGA AGGTTATGCG AGCACTTGGT GACTATCTTG GGGTTAAGGT CCATGCTTGT 608 GTAGGTGGCA CTAGTGTCAG GG-AGGATCA ACGCATTCTC GCAGCTGGTG TTCATGTTGT 1095 |||||||||| |||||||||| || |||||| |||||||||| |||||||||| ||||||||| GTAGGTGGCA CTAGTGTCAG GGNAGGATC- ACGCATTCTC GCAGCTGGTG GTCATGTTGT 667 TGTTGGCACC CCTGGACGTG TGTTTGACAT GTTGCGAAGA CAGTCCCTCC GTCCTGATTG 1155 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| | ||||| || TGTTGGCACC CCTGGACGTG TGTTTGACAT GTTGCGAAGA CAGTCCCCTC GTCCTTGATG 727 CCTCAGAATG TTTGTGCTAG ACGAGGCTGA TGAAATGCTG TCACGGGGTT TTAAGGATCA 1215 |||||||| ||||||||| |||||||||| |||||||||| |||||||||| || ||||||| CCTCAGAAAT GTTGTGCTAG ACGAGGCTGA TGAAATGCTG TCACGGGGTT TTTAGGATCA 787 GGTACTTCAA CCTCTTCAAT TAGATTTATA ATTACCCCTT AGTAGACGAT TTGAACAAGT 1275 | G......... .......... .......... .......... .......... .......... 788 TCAGGTTTGA GGTCCCTCGG TTTGCTAATC TGGGATTATC GCTGGAATTA ATTCTTTTGA 1335 .......... .......... .......... .......... .......... .......... 788 AGAAATCAAT ATAGAAGATA CAGATTTAGT TTACGTTTAG ATGTTCTTGT ACTCCGTTAA 1395 .......... .......... .......... .......... .......... .......... 788 CCAGTCATCT TAATCATTGT CGGGAATTAG CTATTGAGAT AAGTTACACA CTAGTACCCC 1455 .......... .......... .......... .......... .......... .......... 788 TCCCATTTAT TTGGTTCCTA AGGACATAAT GTAAATTGAT CATCCCGTTG AATTTGTTCC 1515 .......... .......... .......... .......... .......... .......... 788 TTCATTTGCT GATTGCGATT AGTAAAACTA AATACAGAGT TTATGACCTT TCCCTCCGAT 1575 .......... .......... .......... .......... .......... .......... 788 TTGATTTGCT GTCGACTTTG AGAAATTAGT AATTTGGAAC CTTTATCCAT GTGTCTATAT 1635 .......... .......... .......... .......... .......... .......... 788 TGGCAGATAT ATGATA-TTT TCCAGATGCT G-CCTACCAA AGTTCAAGTC -GGAGTGTTT 1692 |||| |||||| ||| |||||||||| | |||||||| || ||||||| ||||||||| ......ATAT ATGATATTTT TCCAGATGCT GCCCTACCAA AG-TCAAGTC GGGAGTGTTT 841 TCTGCGA 1699 ||||||| TCTGCGA 848 hqPGS_C06HBa0120H21.1-6+_SGN-U346056+ (137 206,338 444,786 1216,1642 1699) ******************************************************************************** EST sequence 2 +strand 1216 n (File: SGN-U343435+) 1 TGATCACGCG GGCGGCCGCT CAGAACTAGT GGATCCCCCG GGCTGCAGGA ATTCGGCACG 61 AGGCGTAGCC GTGACCACAC GGTTTCAGCT ACACATGGAG ATATGGACCA GAACACTAGA 121 GACATAATCA TGCGTGAGTT TCGCTCTGGT TCTTCCCGTG TCCTTATCAC AACTGATCTG 181 TTGGCTCGTG GTATCGATGT ACAACAAGTA TCCCTTGTTA TCAACTATGA TCTCCCAACT 241 CAGCCGGAGA ATTATCTCCA TCGTATTGGA AGAAGTGGAA GGTTTGGAAG GAAAGGAGTT 301 GCGATCAACT TTGTGACAAC AGACGACGAA AGAATGTTGT TTGATATTCA AAAATTCTAC 361 AATGTGGTCA TCGAAGAACT CCCTTCAAAC GTCGCTGACC TCCTCTGAAA ACTTCATCTT 421 TATAAGGTGA GTACCAAGCT TATAGTATAG AAGAAATCAT TTTAAACTAC CATTATTATC 481 TATCGTCAAA AACGCACCCC TGGCATTAAT GTTGCTAAGA ATTTTGCAGT CAGTATGTAG 541 TAAGGTCTTG GTTTTATTTC ATTTCCAAAT TCTTATACTT CTTTGGCATT AATTTTTAGT 601 GTAGTTAAGT TCTTGATTAA TGTTGTGTTC TTGTTTTAGC ACAGAAATAT TAATGATAAA 661 CTGAGAAATC TTGATAAAAA AAAAAAAAAA AAAAAACTCC CCCCGGGCCA ATTTCGGCAC 721 AAGGGGAAAA AAAGGGTTCC CCTTTTTTGG TTCCCCCCCT AAGGGGCTTT TTTCTTCGGT 781 CTTCCCCAAG CAAAAAACAT TTTTTTCCCT TTCAAAAAAA ACCCCTATTT TTACTTCAAA 841 ACCAATTAAA ACCCCCCTTG GTTGTTGCCA AAAAACTTCT CTACACACAT TGAAAAATCA 901 AAAAAGGCGT TTTTTTTCCC CTAAAAACAC CCAAAAAAAA AAACCCTATT TTGGTTTTGG 961 GAGAATTCTT TACGGTTTTA GAGGTGCCCC CCCCCCAAAG CCAAAAAAAA CCCCCCCCCC 1021 AAAAAAAATT TCAAAAAACT TGTTTCCTCA GAAAAATATT TCAAAAAACC CGGGGCCCCC 1081 GTTATTTTAT AAAATTCTCC CACCCCCAAA TTTTTAAAAA AATTCTGAAC TCGAAACGCT 1141 GAGAAAAACC GAAAAACCCC TTAGAATTAT TGGGCACCAA ATAGCCACAC GCCATATTAA 1201 TCTTACAGAG AAATAG Predicted gene structure (within gDNA segment 676 to 2937): Exon 1 1938 2545 ( 608 n); cDNA 69 676 ( 608 n); score: 1.000 MATCH C06HBa0120H21.1-6+ SGN-U343435+ 1.000 608 0.500 C PGS_C06HBa0120H21.1-6+_SGN-U343435+ (1938 2545) Alignment (genomic DNA sequence = upper lines): CCGTGACCAC ACGGTTTCAG CTACACATGG AGATATGGAC CAGAACACTA GAGACATAAT 1997 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTGACCAC ACGGTTTCAG CTACACATGG AGATATGGAC CAGAACACTA GAGACATAAT 128 CATGCGTGAG TTTCGCTCTG GTTCTTCCCG TGTCCTTATC ACAACTGATC TGTTGGCTCG 2057 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATGCGTGAG TTTCGCTCTG GTTCTTCCCG TGTCCTTATC ACAACTGATC TGTTGGCTCG 188 TGGTATCGAT GTACAACAAG TATCCCTTGT TATCAACTAT GATCTCCCAA CTCAGCCGGA 2117 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTATCGAT GTACAACAAG TATCCCTTGT TATCAACTAT GATCTCCCAA CTCAGCCGGA 248 GAATTATCTC CATCGTATTG GAAGAAGTGG AAGGTTTGGA AGGAAAGGAG TTGCGATCAA 2177 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATTATCTC CATCGTATTG GAAGAAGTGG AAGGTTTGGA AGGAAAGGAG TTGCGATCAA 308 CTTTGTGACA ACAGACGACG AAAGAATGTT GTTTGATATT CAAAAATTCT ACAATGTGGT 2237 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTGTGACA ACAGACGACG AAAGAATGTT GTTTGATATT CAAAAATTCT ACAATGTGGT 368 CATCGAAGAA CTCCCTTCAA ACGTCGCTGA CCTCCTCTGA AAACTTCATC TTTATAAGGT 2297 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCGAAGAA CTCCCTTCAA ACGTCGCTGA CCTCCTCTGA AAACTTCATC TTTATAAGGT 428 GAGTACCAAG CTTATAGTAT AGAAGAAATC ATTTTAAACT ACCATTATTA TCTATCGTCA 2357 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGTACCAAG CTTATAGTAT AGAAGAAATC ATTTTAAACT ACCATTATTA TCTATCGTCA 488 AAAACGCACC CCTGGCATTA ATGTTGCTAA GAATTTTGCA GTCAGTATGT AGTAAGGTCT 2417 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAACGCACC CCTGGCATTA ATGTTGCTAA GAATTTTGCA GTCAGTATGT AGTAAGGTCT 548 TGGTTTTATT TCATTTCCAA ATTCTTATAC TTCTTTGGCA TTAATTTTTA GTGTAGTTAA 2477 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTTTTATT TCATTTCCAA ATTCTTATAC TTCTTTGGCA TTAATTTTTA GTGTAGTTAA 608 GTTCTTGATT AATGTTGTGT TCTTGTTTTA GCACAGAAAT ATTAATGATA AACTGAGAAA 2537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTCTTGATT AATGTTGTGT TCTTGTTTTA GCACAGAAAT ATTAATGATA AACTGAGAAA 668 TCTTGATA 2545 |||||||| TCTTGATA 676 hqPGS_C06HBa0120H21.1-6+_SGN-U343435+ (1938 2545) Total number of EST alignments reported: 3 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2937: PGL 1 (+ strand): 134 2545 AGS-1 (134 206,338 444,786 1216,1642 1699) SCR (e 1.000 d 0.996 a 0.990,e 1.000 d 0.969 a 0.520,e 0.969 d 0.999 a 0.978,e 0.853) Exon 1 134 206 ( 73 n); score: 1.000 Intron 1 207 337 ( 131 n); Pd: 0.996 Pa: 0.990 Exon 2 338 444 ( 107 n); score: 1.000 Intron 2 445 785 ( 341 n); Pd: 0.969 Pa: 0.520 Exon 3 786 1216 ( 431 n); score: 0.969 Intron 3 1217 1641 ( 425 n); Pd: 0.999 Pa: 0.978 Exon 4 1642 1699 ( 58 n); score: 0.853 PGS (134 206,338 444,786 1216,1642 1662) SGN-U312486+ PGS (137 206,338 444,786 1216,1642 1699) SGN-U346056+ 3-phase translation of AGS-1 (+strand): . . . . . . 134 CAGTCATGGCACGCTTGGCACCAGAAGGAGCTCAATTTGATGCTCGACAGTTCGATTCTA Q S W H A W H Q K E L N L M L D S S I L S H G T L G T R R S S I - C S T V R F - V M A R L A P E G A Q F D A R Q F D S . . : . . . . 194 AGATGAATGATTT : ACTTGCCGCTGAGGGAAAAGATTTCTTTACTTCATATGACGAGGTGT R - M I : Y L P L R E K I S L L H M T R C D E - F : T C R - G K R F L Y F I - R G V K M N D L : L A A E G K D F F T S Y D E V . . . . . . : 385 ATGACAGTTTCGATGCTATGGGTCTGCAAGAAAATCTTCTCAGGGGCATTTATGCTTATG : M T V S M L W V C K K I F S G A F M L M : - Q F R C Y G S A R K S S Q G H L C L W : Y D S F D A M G L Q E N L L R G I Y A Y : . . . . . . 786 GTTTTGAGAAACCATCTGCCATTCAGCAGAGGGGTATCGTACCATTTTGCAAGGGACTTG V L R N H L P F S R G V S Y H F A R D L F - E T I C H S A E G Y R T I L Q G T - G F E K P S A I Q Q R G I V P F C K G L . . . . . . 846 ATGTAATTCAGCAAGCTCAATCTGGCACAGGGAAGACAGCTACTTTTTGTTCTGGAATTT M - F S K L N L A Q G R Q L L F V L E F C N S A S S I W H R E D S Y F L F W N F D V I Q Q A Q S G T G K T A T F C S G I . . . . . . 906 TGCAGCAACTTGATTATGGTTTAATTCAATGCCAATCATTGGTGTTGGCACCTACTCGAG C S N L I M V - F N A N H W C W H L L E A A T - L W F N S M P I I G V G T Y S R L Q Q L D Y G L I Q C Q S L V L A P T R . . . . . . 966 AACTTGCACAGCAGATTGAGAAGGTTATGCGAGCACTTGGTGACTATCTTGGGGTTAAGG N L H S R L R R L C E H L V T I L G L R T C T A D - E G Y A S T W - L S W G - G E L A Q Q I E K V M R A L G D Y L G V K . . . . . . 1026 TCCATGCTTGTGTAGGTGGCACTAGTGTCAGGGAGGATCAACGCATTCTCGCAGCTGGTG S M L V - V A L V S G R I N A F S Q L V P C L C R W H - C Q G G S T H S R S W C V H A C V G G T S V R E D Q R I L A A G . . . . . . 1086 TTCATGTTGTTGTTGGCACCCCTGGACGTGTGTTTGACATGTTGCGAAGACAGTCCCTCC F M L L L A P L D V C L T C C E D S P S S C C C W H P W T C V - H V A K T V P P V H V V V G T P G R V F D M L R R Q S L . . . . . . 1146 GTCCTGATTGCCTCAGAATGTTTGTGCTAGACGAGGCTGATGAAATGCTGTCACGGGGTT V L I A S E C L C - T R L M K C C H G V S - L P Q N V C A R R G - - N A V T G F R P D C L R M F V L D E A D E M L S R G . . : . . . . 1206 TTAAGGATCAG : ATATATGATATTTTCCAGATGCTGCCTACCAAAGTTCAAGTCGGAGTGT L R I R : Y M I F S R C C L P K F K S E C - G S : D I - Y F P D A A Y Q S S S R S V F K D Q : I Y D I F Q M L P T K V Q V G V . 1691 TTTCTGCGA F L R F C F S A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-6+_PGL-1_AGS-1_PPS_1 (136 206,338 444,786 1216,1642 1698) (frame '0'; 666 bp, 222 residues) 1 VMARLAPEGA QFDARQFDSK MNDLLAAEGK DFFTSYDEVY DSFDAMGLQE NLLRGIYAYG 61 FEKPSAIQQR GIVPFCKGLD VIQQAQSGTG KTATFCSGIL QQLDYGLIQC QSLVLAPTRE 121 LAQQIEKVMR ALGDYLGVKV HACVGGTSVR EDQRILAAGV HVVVGTPGRV FDMLRRQSLR 181 PDCLRMFVLD EADEMLSRGF KDQIYDIFQM LPTKVQVGVF SA AGS-2 (1938 2545) SCR (e 1.000) Exon 1 1938 2545 ( 608 n); score: 1.000 PGS (1938 2545) SGN-U343435+ 3-phase translation of AGS-2 (+strand): . . . . . . 1938 CCGTGACCACACGGTTTCAGCTACACATGGAGATATGGACCAGAACACTAGAGACATAAT P - P H G F S Y T W R Y G P E H - R H N R D H T V S A T H G D M D Q N T R D I I V T T R F Q L H M E I W T R T L E T - . . . . . . 1998 CATGCGTGAGTTTCGCTCTGGTTCTTCCCGTGTCCTTATCACAACTGATCTGTTGGCTCG H A - V S L W F F P C P Y H N - S V G S M R E F R S G S S R V L I T T D L L A R S C V S F A L V L P V S L S Q L I C W L . . . . . . 2058 TGGTATCGATGTACAACAAGTATCCCTTGTTATCAACTATGATCTCCCAACTCAGCCGGA W Y R C T T S I P C Y Q L - S P N S A G G I D V Q Q V S L V I N Y D L P T Q P E V V S M Y N K Y P L L S T M I S Q L S R . . . . . . 2118 GAATTATCTCCATCGTATTGGAAGAAGTGGAAGGTTTGGAAGGAAAGGAGTTGCGATCAA E L S P S Y W K K W K V W K E R S C D Q N Y L H R I G R S G R F G R K G V A I N R I I S I V L E E V E G L E G K E L R S . . . . . . 2178 CTTTGTGACAACAGACGACGAAAGAATGTTGTTTGATATTCAAAAATTCTACAATGTGGT L C D N R R R K N V V - Y S K I L Q C G F V T T D D E R M L F D I Q K F Y N V V T L - Q Q T T K E C C L I F K N S T M W . . . . . . 2238 CATCGAAGAACTCCCTTCAAACGTCGCTGACCTCCTCTGAAAACTTCATCTTTATAAGGT H R R T P F K R R - P P L K T S S L - G I E E L P S N V A D L L - K L H L Y K V S S K N S L Q T S L T S S E N F I F I R . . . . . . 2298 GAGTACCAAGCTTATAGTATAGAAGAAATCATTTTAAACTACCATTATTATCTATCGTCA E Y Q A Y S I E E I I L N Y H Y Y L S S S T K L I V - K K S F - T T I I I Y R Q - V P S L - Y R R N H F K L P L L S I V . . . . . . 2358 AAAACGCACCCCTGGCATTAATGTTGCTAAGAATTTTGCAGTCAGTATGTAGTAAGGTCT K T H P W H - C C - E F C S Q Y V V R S K R T P G I N V A K N F A V S M - - G L K N A P L A L M L L R I L Q S V C S K V . . . . . . 2418 TGGTTTTATTTCATTTCCAAATTCTTATACTTCTTTGGCATTAATTTTTAGTGTAGTTAA W F Y F I S K F L Y F F G I N F - C S - G F I S F P N S Y T S L A L I F S V V K L V L F H F Q I L I L L W H - F L V - L . . . . . . 2478 GTTCTTGATTAATGTTGTGTTCTTGTTTTAGCACAGAAATATTAATGATAAACTGAGAAA V L D - C C V L V L A Q K Y - - - T E K F L I N V V F L F - H R N I N D K L R N S S - L M L C S C F S T E I L M I N - E . 2538 TCTTGATA S - L D I L I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-6+_PGL-1_AGS-2_PPS_1 (1939 2277) (frame '2'; 336 bp, 112 residues) 1 RDHTVSATHG DMDQNTRDII MREFRSGSSR VLITTDLLAR GIDVQQVSLV INYDLPTQPE 61 NYLHRIGRSG RFGRKGVAIN FVTTDDERML FDIQKFYNVV IEELPSNVAD LL- 3-phase translation of AGS-2 (-strand): . . . . . . 2545 TATCAAGATTTCTCAGTTTATCATTAATATTTCTGTGCTAAAACAAGAACACAACATTAA Y Q D F S V Y H - Y F C A K T R T Q H - I K I S Q F I I N I S V L K Q E H N I N S R F L S L S L I F L C - N K N T T L . . . . . . 2485 TCAAGAACTTAACTACACTAAAAATTAATGCCAAAGAAGTATAAGAATTTGGAAATGAAA S R T - L H - K L M P K K Y K N L E M K Q E L N Y T K N - C Q R S I R I W K - N I K N L T T L K I N A K E V - E F G N E . . . . . . 2425 TAAAACCAAGACCTTACTACATACTGACTGCAAAATTCTTAGCAACATTAATGCCAGGGG - N Q D L T T Y - L Q N S - Q H - C Q G K T K T L L H T D C K I L S N I N A R G I K P R P Y Y I L T A K F L A T L M P G . . . . . . 2365 TGCGTTTTTGACGATAGATAATAATGGTAGTTTAAAATGATTTCTTCTATACTATAAGCT C V F D D R - - W - F K M I S S I L - A A F L T I D N N G S L K - F L L Y Y K L V R F - R - I I M V V - N D F F Y T I S . . . . . . 2305 TGGTACTCACCTTATAAAGATGAAGTTTTCAGAGGAGGTCAGCGACGTTTGAAGGGAGTT W Y S P Y K D E V F R G G Q R R L K G V G T H L I K M K F S E E V S D V - R E F L V L T L - R - S F Q R R S A T F E G S . . . . . . 2245 CTTCGATGACCACATTGTAGAATTTTTGAATATCAAACAACATTCTTTCGTCGTCTGTTG L R - P H C R I F E Y Q T T F F R R L L F D D H I V E F L N I K Q H S F V V C C S S M T T L - N F - I S N N I L S S S V . . . . . . 2185 TCACAAAGTTGATCGCAACTCCTTTCCTTCCAAACCTTCCACTTCTTCCAATACGATGGA S Q S - S Q L L S F Q T F H F F Q Y D G H K V D R N S F P S K P S T S S N T M E V T K L I A T P F L P N L P L L P I R W . . . . . . 2125 GATAATTCTCCGGCTGAGTTGGGAGATCATAGTTGATAACAAGGGATACTTGTTGTACAT D N S P A E L G D H S - - Q G I L V V H I I L R L S W E I I V D N K G Y L L Y I R - F S G - V G R S - L I T R D T C C T . . . . . . 2065 CGATACCACGAGCCAACAGATCAGTTGTGATAAGGACACGGGAAGAACCAGAGCGAAACT R Y H E P T D Q L - - G H G K N Q S E T D T T S Q Q I S C D K D T G R T R A K L S I P R A N R S V V I R T R E E P E R N . . . . . . 2005 CACGCATGATTATGTCTCTAGTGTTCTGGTCCATATCTCCATGTGTAGCTGAAACCGTGT H A - L C L - C S G P Y L H V - L K P C T H D Y V S S V L V H I S M C S - N R V S R M I M S L V F W S I S P C V A E T V . 1945 GGTCACGG G H V T W S R Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-6-_PGL-1_AGS-2_PPS_1 (2253 1954) (frame '2'; 297 bp, 99 residues) 1 REFFDDHIVE FLNIKQHSFV VCCHKVDRNS FPSKPSTSSN TMEIILRLSW EIIVDNKGYL 61 LYIDTTSQQI SCDKDTGRTR AKLTHDYVSS VLVHISMCS- ... finished at: Mon Aug 28 21:58:44 2006 ________________________________________________________________________________ Sequence 7: C06HBa0120H21.1-7, from 1 to 8182, both strands analyzed. ... started at: Mon Aug 28 21:58:44 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 2 ******************************************************************************** EST sequence 1 +strand 871 n (File: SGN-U344890+) 1 TCCACCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG GAATTCGGCA 61 CGAGGCTCAG GTGGAGCCTG GTGTTCTAAT CACGTTTGTT TCTCTACCTG AAGGTGGAAA 121 CGATCTGAAA CGAATTCGAT TCAGCCGTGA GCTATTTAAC AAGTGGCAAG CTCAACGTTG 181 GTGGGCTGAG AATTATGACA AGGTCATGGA ATTATACAAT GTCCACAGGT TCAATCGACG 241 AACAGTACCG TTGCCAATCC CTCCGAGGTC TGAAGATGAG AACTTGAAGC TTGATTATGC 301 TGAGAGTAGT CCAGTAACAC CTCCACTAAC GAAAGAGCGT CTTCCTAGCC ACTTTCATCG 361 TTCAACAGGA GTGGAACACT TGTCATCAGG CTCTGTTGAA AGAGATCCAT CACAGGGTCA 421 TCATTATTAT GATGCAGGTG GTCTCACTTC GACCCCTAAG CTCTCCAACA TCAGTGCGAC 481 AAAAAGTGAG GCACCGTCAA TGGATGCTTC TGCACGGTCT AGCTCTTCAA GGGAGGCTGA 541 TCGCTCANGA GAGCTGTCTG TTAGCAATGC CAGTGACGTT GAGACTGAAT GGGTTGAAGA 601 GATGAGCCTG GAGTCTATTC ACGATCGAGC GTTGCCTGGT GTACTCGAGA GCTAAAAGAG 661 TTAGTTCATT CGTGAAGAAT TGGAGAATGG CTGCCGGTTG GGTGGGAAGA GAACAAACCG 721 GATACAGAAC GGTTTTGGGA TTAGAATAAT TGGAGTCTAT TCTCCCAATA GTTCTAATGC 781 TCCGGCCGGG GTCCATTGGG TACTATGGTT TTTTTTTTTA ACCAGGGTAA GTTTTTTTTT 841 TTGGGGGAAG GTTAAAACTT TTTGGTTGCC N Predicted gene structure (within gDNA segment 7277 to 1): Exon 1 6037 5958 ( 80 n); cDNA 65 144 ( 80 n); score: 1.000 Intron 1 5957 1857 (4101 n); Pd: 1.000 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 2 1856 1721 ( 136 n); cDNA 145 280 ( 136 n); score: 1.000 Intron 2 1720 1576 ( 145 n); Pd: 0.709 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 3 1575 1181 ( 395 n); cDNA 281 669 ( 389 n); score: 0.977 Intron 3 1180 507 ( 674 n); Pd: 0.999 (s: 0.88), Pa: 0.976 (s: 0.86) Exon 4 506 376 ( 131 n); cDNA 670 781 ( 112 n); score: 0.794 MATCH C06HBa0120H21.1-7- SGN-U344890+ 0.951 742 0.852 C PGS_C06HBa0120H21.1-7-_SGN-U344890+ (6037 5958,1856 1721,1575 1181,506 376) Alignment (genomic DNA sequence = upper lines): GCTCAGGTGG AGCCTGGTGT TCTAATCACG TTTGTTTCTC TACCTGAAGG TGGAAACGAT 5978 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTCAGGTGG AGCCTGGTGT TCTAATCACG TTTGTTTCTC TACCTGAAGG TGGAAACGAT 124 CTGAAACGAA TTCGATTCAG GTATTCTTTT TCTCTGCTTT GATCTGTTCT TTGAAAGTTG 5918 |||||||||| |||||||||| CTGAAACGAA TTCGATTCAG .......... .......... .......... .......... 144 TTGTTTGTTT GCGAGTAATT AGTATTACTG TTTCAATTTT TCCTGCTTTT TTTTTCTGTG 5858 .......... .......... .......... .......... .......... .......... 144 TGGCGCCGAA TGACCTTTTG ATGGGTCGTT TTCTGGAATA TGATTTATAA TTGTACAGTA 5798 .......... .......... .......... .......... .......... .......... 144 GACTTGTAGT TGGATATATC TGCACTGTGC CAGAAATTAT GGAGTGGACC AAAAGTGGGT 5738 .......... .......... .......... .......... .......... .......... 144 CTTTTATCCA GGCTTAAAAT AGTGGCTTGC TGCCTTAACA AATTGAATCT ATCCATAGTA 5678 .......... .......... .......... .......... .......... .......... 144 CAAAAGAAGT ATAATACAAT ATTCAAGCAT GAGTATTTTT GCACACGTCA AATGGAAGAT 5618 .......... .......... .......... .......... .......... .......... 144 TTCATTAAGA TTATATGTTA GTATTAGATT TGAGAGCAAG TAATTGGTGA ACAACCAGGT 5558 .......... .......... .......... .......... .......... .......... 144 GGAGCGGATA GTCACTCCGG TTTAACCGGT TATATGTACA AGTCTTTAAT AATTTGTTAT 5498 .......... .......... .......... .......... .......... .......... 144 GTAAGATGGG ACACCTAATT ACCTGGCAGT AGAACAATAA ATCTTTCTAA ACAAGATATA 5438 .......... .......... .......... .......... .......... .......... 144 TATCATATTG ACAAGACTGA AACTATCCAA TTATGTTACG TTTAGATACG TCAATTAATA 5378 .......... .......... .......... .......... .......... .......... 144 ACATCATATA TCACTCCCTC TGACCCTATT AACATGAAAG CTGAATCTCA AATGATTTGG 5318 .......... .......... .......... .......... .......... .......... 144 CTTGATATAT ACTTAACTTG GTAAGAATAT GTAGAAATGA TCATTGGAAA TGGTGAAAAT 5258 .......... .......... .......... .......... .......... .......... 144 TGTATTAAAA ATATGATTGG TATGGAGATG TTGTTTAAGC TTTAAAGATT GGAATCTTTG 5198 .......... .......... .......... .......... .......... .......... 144 AATAAATAAA ATAAAATTAT TCGATGAATA ATGTTGTAAT TCAGTAGTGG ATGTAAAGAA 5138 .......... .......... .......... .......... .......... .......... 144 ATTAGAACTT AATGAGCAAA AATATGTCTT GTGTTTACAA TATTTTTGTT TGGAAAATTA 5078 .......... .......... .......... .......... .......... .......... 144 TTGGAAGATT AATTTGTCAA ATAAATTATT TAGATGATCT TTTGGAATGT AAATTGAATG 5018 .......... .......... .......... .......... .......... .......... 144 TTCTTTATTT TGTTACGTTG GTTTAGGGTT TACTATATTA TAACATGAAA AAAATTAACA 4958 .......... .......... .......... .......... .......... .......... 144 AGGAAAAATG AAAGATAAAC AATATAGACT CTGGGTGATG AGTGTAATTA GTAATTTTTT 4898 .......... .......... .......... .......... .......... .......... 144 GACATATAAA GAAAATAAAT CTTTTGCATT TCTTACTTCT ACCCATCAAC AAAAAGAATA 4838 .......... .......... .......... .......... .......... .......... 144 CTCAATGTGG ACTTTTTGTG TCTTCTTTGT AGAACCAATG TCAACGTGAC GAATCAATAC 4778 .......... .......... .......... .......... .......... .......... 144 ATTGAATCTT CCAATAAACC TTTGGACTTC TTAGGCTTTA AATACACTCA ACCCTCTTTC 4718 .......... .......... .......... .......... .......... .......... 144 CTTCTAATAT CACTCGTCCG TTATCTTACC AAGCAAAAAG ATCTTTAGTG TAGTAACATG 4658 .......... .......... .......... .......... .......... .......... 144 ACAATCTCTT CTTCCACAAT TGGAACTCCA TGTGATGTGC GAAGAAGATG GTACCATATG 4598 .......... .......... .......... .......... .......... .......... 144 GATTGGGAGA TCGAACTGTT GAAAGACGAA GTAAGGACCT TAATTATTTT GTGGAAGAAC 4538 .......... .......... .......... .......... .......... .......... 144 TCTGGGAGTT GAGGCATTGT TTCCATACGT ACATTGAAGA GGTTAAGAAT GCGAACAAGG 4478 .......... .......... .......... .......... .......... .......... 144 AAGAGGGTGG TAGCCATGGA TGTGCTACTC TATCAATGTA TGTGTGTATG TTTTTGTTTT 4418 .......... .......... .......... .......... .......... .......... 144 TGGTGTTGTT TCAACTTTAT GAAAACTCTA GATCTCACAG ATATCGAGCA ATTAGCAAAA 4358 .......... .......... .......... .......... .......... .......... 144 CATACATTAA GAAATTTTTA GATCAAATTG TTTTGTAGTC ATATTTTTTT AATATATATC 4298 .......... .......... .......... .......... .......... .......... 144 AAAGTAAATT TTTTTATAGG ATAAATTTAG GGCAAAATTA AGGCCCATAT TATAATTCAA 4238 .......... .......... .......... .......... .......... .......... 144 CTACTTTTTT TGTCAAGAAG TGAGAATGGG CCTACTCCAA AAAAAAATCA GGCTCAATAC 4178 .......... .......... .......... .......... .......... .......... 144 ATGATGCAAT CTGATACAAC AAAATTTTAA ACTAAACATA ACAAAATAAA GAACATTCAA 4118 .......... .......... .......... .......... .......... .......... 144 TTTACATTTT AAAAATCCAT CGAATAAATT TTCAAAAACT TCAGCCAAAA AACTGCTAAT 4058 .......... .......... .......... .......... .......... .......... 144 TACTAAATAT TTTCATGGAT AATTTCACTC TATCTATTCG AAGGTTCCAA TCTGCATTAT 3998 .......... .......... .......... .......... .......... .......... 144 CATACTTGTC ATATTTTCGG TACATGCTTC TACTAACTCC AATGGTCATT CTGGCCAAGT 3938 .......... .......... .......... .......... .......... .......... 144 TGAGTATATA CTTGTCTTTT CTTCTTAAGA ACAGCATAAA CATTGTCCAT TAAATTTATG 3878 .......... .......... .......... .......... .......... .......... 144 AACGAAAGAG AGCGAAAATA GCCGAATTTT GATGATGTAA AAGAAACAAC AATGGAAGAC 3818 .......... .......... .......... .......... .......... .......... 144 AAAGATTGGA AGAAATAGAG TGGATAAATA ATGGAAGAAA ATCATTTGGT ATTTGACAGG 3758 .......... .......... .......... .......... .......... .......... 144 GTTGTTATTG GCTAATGTTT TGGATAGTCA GGTAGCGGTT TGAAATTTAG AGGGCATTTT 3698 .......... .......... .......... .......... .......... .......... 144 GAGTGCAAAA AAATGTGAAT TTACCGATTC ACCCTTGTCA GCCCCTCTAA TTACGAAAAT 3638 .......... .......... .......... .......... .......... .......... 144 AACTCCAAAA AGCACTAGTT GTGACAAATA TAATATTCAC TAGAAATTAA CATCTCTTTC 3578 .......... .......... .......... .......... .......... .......... 144 CGGTCTTTAT AGATATAACT TGCTTCAGTT ATAGGTAAAT CAAGCCTAAT CATTCATGTT 3518 .......... .......... .......... .......... .......... .......... 144 TACTTGTCCT CGAGCTTTGA GGTTCGGCTT TCATGTTAAC TGGGAAGTGT TAGAGAGAGT 3458 .......... .......... .......... .......... .......... .......... 144 GACGATTTTG GTATATGATG TTAATGATTG GCCTATTTAA GCGTAACATA GATGGATAGC 3398 .......... .......... .......... .......... .......... .......... 144 TCCACTCTTG CCAATATCAT CTACATCTAG TTTAGGAGGG TCTTTTGTTT TATCGACAGG 3338 .......... .......... .......... .......... .......... .......... 144 CAGTTGTGTA TCCCCTCTTA CATATCAAAC TATTGAAGAC TTGTGCAAAT AACCGGGTAA 3278 .......... .......... .......... .......... .......... .......... 144 TAATGGAGGG TAGAACGGAC TGGTTGTCTG TAGAACATGT CTACATGGCT GCTGAGCAAT 3218 .......... .......... .......... .......... .......... .......... 144 TACTTGCTTT CAAATCTAAT ACTAACTGAC GGTCTTTAAT GTAATCTTCT ATCCGACGTA 3158 .......... .......... .......... .......... .......... .......... 144 GGTAGAAAAC ACCCATGGAC ATTTTTTCTG CTGACTCATT TTTATTATGC ATATGTGTTC 3098 .......... .......... .......... .......... .......... .......... 144 ACTGTTTAAT GCTACTGTGA AGTAAAATTG AACTAGTGGT TCTTGTGATA ACATGTGACC 3038 .......... .......... .......... .......... .......... .......... 144 TTTTTTTTTA ATCCTCTTAA GTATGCTAAG TTATCACAGC CCTTAGAGAA TTTCAGCTAA 2978 .......... .......... .......... .......... .......... .......... 144 GACCTGTGCC TAATTAATTA TGAGCATGAG TTTTCTTATT GTTGGTCATT TAGTCCATTA 2918 .......... .......... .......... .......... .......... .......... 144 GTAAGTGAAC TTTTGAATTT CTTGAGGAAA ATAGATACTT CTTCCGTGTC AATTTGTTTT 2858 .......... .......... .......... .......... .......... .......... 144 ACCTACTTCT CCTTTTTAGT CAATTTCAAA AGAACTCTTT CCCTTTTTAG TAACTCTTTA 2798 .......... .......... .......... .......... .......... .......... 144 GTTTCATCAT CTTCCCATAA GAACATTTTA GTACTTTTGA GTTTTGACAT ATCTTTAATT 2738 .......... .......... .......... .......... .......... .......... 144 TAAGACCACA AGTTTCAAAA ATCTTTTTTA TTTTCTGAAA CTCTGTGCCA AGTCAAAGTA 2678 .......... .......... .......... .......... .......... .......... 144 GGCCAAACAA ATTGAAATAA ACAGAGTAAC ATATAAGTAA TTGATTCATT TCTGTTTGGA 2618 .......... .......... .......... .......... .......... .......... 144 AATATATGGA TTCATATCTT TATCACGTCA TAACATTAGC AGAAACATTT TTGTTTTTAA 2558 .......... .......... .......... .......... .......... .......... 144 TTTTCATTTT GGCGACCAAT TTTGTTACTA CCTTAAACTA TAACGCGAAA AGTAAAGCGT 2498 .......... .......... .......... .......... .......... .......... 144 CTTTTTTTCC TTTCTAGTTT GCAGTTCAAT TCTTGAGTAT GTCAGTGTGG GCTTTCACCT 2438 .......... .......... .......... .......... .......... .......... 144 TCCCTTCATA GCATGCTTCT ATTTCTGCCT GAAATTATAC AAAGTTGACT AATTAGATTC 2378 .......... .......... .......... .......... .......... .......... 144 TGTTTATTGC ATGAGCTGCA GCTGAAAATT GTGTCTTAAC AAAATTGATT GTGATGTATT 2318 .......... .......... .......... .......... .......... .......... 144 AAAAGCATTG TGATCTTGCT CCTCCTGTCC CAGCACTATT TTCCTAACTC TGCCATTGCT 2258 .......... .......... .......... .......... .......... .......... 144 TCAATGTTAG AGTCAATACT ACTCTGAAAT GCTGTAATCA CTAATTGCCC GTCAAGTCCT 2198 .......... .......... .......... .......... .......... .......... 144 GAGTTTTGGA ATCTTGAATT GGAAGATAAG CTCAGCATGA TATTCTTTGG TTCAAAATGA 2138 .......... .......... .......... .......... .......... .......... 144 AACTTGATTG AAGTTGCAGA CCTCATTGGC TGTCTTTGTT GAGCTGTCAA CTCCCTTATA 2078 .......... .......... .......... .......... .......... .......... 144 TCACTACTAC TGACGACTGT TATTAACTGA AGGAGGATGG CAGTGATGTA ATGCTTTGCC 2018 .......... .......... .......... .......... .......... .......... 144 TTTTTTTATC TGAACTGCAA TAGTTGCAGC TTTCTCATGA GCATATTTTT GTATGCATGG 1958 .......... .......... .......... .......... .......... .......... 144 GTGACTGATC TTTATGATTG AATGCATCTT CCACCTTCCC CACAGGCCAT AATTGGTAAA 1898 .......... .......... .......... .......... .......... .......... 144 CTGGTGAAGA GTTCTAAAAT GTTACTGATA TTTTTTTTCA GCCGTGAGCT ATTTAACAAG 1838 ||||||||| |||||||||| .......... .......... .......... .......... .CCGTGAGCT ATTTAACAAG 163 TGGCAAGCTC AACGTTGGTG GGCTGAGAAT TATGACAAGG TCATGGAATT ATACAATGTC 1778 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGCAAGCTC AACGTTGGTG GGCTGAGAAT TATGACAAGG TCATGGAATT ATACAATGTC 223 CACAGGTTCA ATCGACGAAC AGTACCGTTG CCAATCCCTC CGAGGTCTGA AGATGAGGTG 1718 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| CACAGGTTCA ATCGACGAAC AGTACCGTTG CCAATCCCTC CGAGGTCTGA AGATGAG... 280 AGCCTTGTGG TTTAGTGGCA AATGTTGCAA CCAATCCTAT TAAATTGTTT GGAGCAGCAC 1658 .......... .......... .......... .......... .......... .......... 280 TCTTGCCTTC TTGCTCTTTT GTATTGCAAG TATATGTATT CATGTATGTT TCTTGTTCAT 1598 .......... .......... .......... .......... .......... .......... 280 TATATGTACC ATTTTGTTTC AGAACTTGAA GCTTGATTAT GCTGAGAGTA GTCCAGTAAC 1538 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..AACTTGAA GCTTGATTAT GCTGAGAGTA GTCCAGTAAC 318 ACCTCCACTA ACGAAAGAGC GTCTTCCTAG CCACTTTCAT CGTTCAACAG GAGTGGAACA 1478 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCTCCACTA ACGAAAGAGC GTCTTCCTAG CCACTTTCAT CGTTCAACAG GAGTGGAACA 378 CTTGTCATCA GGCTCTGTTG AAAGAGATCC ATCACAGGGT CATCATTATT ATGATGCAGG 1418 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTGTCATCA GGCTCTGTTG AAAGAGATCC ATCACAGGGT CATCATTATT ATGATGCAGG 438 TGGTCTCACT TCGACCCCTA AGCTCTCCAA CATCAGTGCG ACAAAAAGTG AGGCACCGTC 1358 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTCTCACT TCGACCCCTA AGCTCTCCAA CATCAGTGCG ACAAAAAGTG AGGCACCGTC 498 AATGGATGCT TCTGCACGGT CTAGCTCTTC AAGGGAGGCT GATCGCTCAG GAGAGCTGTC 1298 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| AATGGATGCT TCTGCACGGT CTAGCTCTTC AAGGGAGGCT GATCGCTCAN GAGAGCTGTC 558 TGTTAGCAAT GCCAGTGACG TTGAGACTGA ATGGGTTGAA GAAGATGAGC CTGGAGTCTA 1238 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TGTTAGCAAT GCCAGTGACG TTGAGACTGA ATGGGTTGAA G-AGATGAGC CTGGAGTCTA 617 TATCACGATT CGAGCGTTGC CTGGTGGTAC TCGAGAGCTT AGAAGAGTTA GGTTCAGGTT 1178 | |||||| | |||||||||| ||||| |||| |||||||| | | |||||||| ||||| T-TCACGA-T CGAGCGTTGC CTGGT-GTAC TCGAGAGC-T AAAAGAGTTA -GTTCAT... 669 AGTAATTCTT TTATGCGTTT ATCAGCTCCT ATTTTATAAC TGGACTATAT CAGAAGAACT 1118 .......... .......... .......... .......... .......... .......... 669 AAGTCTGTCT TTATAGTACC AACTTGCCGT TTGCTGGCTT TGAAAACTTT TACCTTGATT 1058 .......... .......... .......... .......... .......... .......... 669 TCTTCAACAT ACCTCGTCTT CATATTCTCA AACTGATTAA AATGGACACG AGCACAATGA 998 .......... .......... .......... .......... .......... .......... 669 ATAAGTAAAA GATTCCTTTT AAATAGATCT GAATATAATG TGGTCACTGA ATTGATCCAA 938 .......... .......... .......... .......... .......... .......... 669 CAGTCCATTT TCTTCAGCAC CGCGCTCTAC TCTATGTGCA TCAGTTCTTC TCTAGTGAAT 878 .......... .......... .......... .......... .......... .......... 669 GTCAGTGAGG ATAGAGGGTT GTGAATTTTT AGTTACATCC CATCAACTGA GTCATTTTAT 818 .......... .......... .......... .......... .......... .......... 669 CTAATACAAG AGTGAAATTT GTGCATCTTT GTTCCCTGAT TCATAAGGGC AATTTATCCG 758 .......... .......... .......... .......... .......... .......... 669 TCTACTAAAA AATGGTCGAT GGTTGAAATA CTGAGGGATT TGCTTTCATA CTAGTACTAG 698 .......... .......... .......... .......... .......... .......... 669 TATATCTAAG CTTGACATCC ATCTATATAT TTAAATGGGG ATGTTTTCTT GTTCTTTATA 638 .......... .......... .......... .......... .......... .......... 669 TGCGTTAAAC TGTTGAGTCA TTCCTAAGGC TGATCACTGC TCAGAAACAT AATTTTGTAT 578 .......... .......... .......... .......... .......... .......... 669 ATAGTTGTGT CTCGCGATTG AAAGATCAAG CAAGCTCTTT TCTGATGCGG AATGATTTTT 518 .......... .......... .......... .......... .......... .......... 669 TCTTTTTGCA GTCGTGAAAG ATTTGGAGAA ATGCATGCCA GGTTGTGGTG GGAAGAGAAC 458 ||||| ||| | |||||| | ||| |||| ||||| |||| |||||||||| .......... .TCGTG-AAG AATTGGAG-A ATGGCTGCC- GGTTG-GGTG GGAAGAGAAC 714 AGAGCCAGGA TCCAAGAACA GTATTTGTGA TTTAAGACAT AATTTGGAGT TCATATTCAT 398 | | || ||| | | ||||| || |||| || || ||| || || |||||| || ||||| | A-AACC-GGA TAC-AGAACG GT-TTTGGGA -TT-AGA-AT AA-TTGGAG- TC-TATTC-T 763 CCATAATTAG TATCTTAATG CT 376 || || ||| | || ||||| || CC-CAA-TAG T-TC-TAATG CT 781 hqPGS_C06HBa0120H21.1-7-_SGN-U344890+ (6037 5958,1856 1721,1575 1181,506 376) ******************************************************************************** EST sequence 2 +strand 850 n (File: SGN-U346045+) 1 CGTTGGAACT TTCGATCCNN NTANAAACGT CAACTTTGAT GCTACACCGC GGTGGCGGTT 61 GCTCTAGAAC TAGCGGATCC CCCGGGCTGC AGGAATTCGG CACGAGGGTG AGATCCACCT 121 CTCTCTCGTG ATCTCATCTT ATATCTATTT TCCGCCTTTA ATGCCTCGGT TGCGGAGTAG 181 TATTCTACTC TTTTTTTCAC AAAACCCAAT TTCATATTTC TCTCTCTCTT TCTTCTTCTT 241 CGCCGCATTC GAATTCAGCT TTTGTTTTTG TTTCAGGTTG ATCTGAAGAC TAGTATTTGT 301 GTTACGTTGT TTGGTTTTCA GATGTAAATC TGTCTTTATT GCTCAAAAAG ACTGACCCCC 361 TTCTGTTTAT TTTTTCCAGT TTTGGGGTTA AGTAGTGTGT AACAGTAGTC CTCTGTTTCT 421 ACTCTGTTTT TGTTTTGATG GTCTGTAAGT GATGAGTGCT TAGTTTGTCA GTAATAGGGT 481 ATTTCTTCTT AGTGGGTATT TTGCTAGGTT GAAAATTTGA GAATGTTGAC TTGTATAGCA 541 TGTTCGAAGC AGCTTAATAC TGGATCTCTG CGTGAACCAG AGGAAGATGA AACGGCTGCA 601 ACTCCCAGCA AAAAGCAAGC CATTAAAGCC CTTACCGCTC AGATCAAGGA TATGGCAGTT 661 AAAGCTTCAA GAGCGTATAA GAACTGTCAG CCATGTTCAG GGGGTTCAAA CAATAACCAG 721 AACCCTAACT ATGCTGATTC TGAAACTGGG TCTGTGTCTG GAAGATTTCA TTACTTCGAT 781 AAAAGGACTG GGAGGTTCAA ATCAACCACC AAAGGTGTGG GGGTAAGGAA ATGAAAGGAA 841 AGAATGGAAG Predicted gene structure (within gDNA segment 8182 to 5019): Exon 1 6962 6435 ( 528 n); cDNA 116 642 ( 527 n); score: 0.897 Intron 1 6434 6341 ( 94 n); Pd: 0.996 (s: 0.96), Pa: 0.999 (s: 0.96) Exon 2 6340 6136 ( 205 n); cDNA 643 850 ( 208 n); score: 0.900 MATCH C06HBa0120H21.1-7- SGN-U346045+ 0.898 733 0.862 C PGS_C06HBa0120H21.1-7-_SGN-U346045+ (6962 6435,6340 6136) Alignment (genomic DNA sequence = upper lines): CACCTCTCTC TCGTGATCTC ATCTTAATAT CTATTTTCCG CCTTTAACGC CTCGGTTACG 6903 |||||||||| |||||||||| ||||| |||| |||||||||| ||||||| || ||||||| || CACCTCTCTC TCGTGATCTC ATCTT-ATAT CTATTTTCCG CCTTTAATGC CTCGGTTGCG 174 GAGTAGTATT CTACTCTTTT TTTTTTTCAC AAAACCCAAT TTCATATTTC TCTCTCTCTC 6843 |||||||||| |||||| |||||||||| |||||||||| |||||| | | |||||||| GAGTAGTATT CTACTC---- TTTTTTTCAC AAAACCCAAT TTCATA--T- T-TCTCTCTC 226 TATCTTTCTT CTTCTTCGCC GTACTCGAAT TCAGCTTTTG TTTTTTGTTC CAGGTTGATC 6783 |||||||| |||||||||| | | |||||| |||||||||| |||||||| |||||||||| --TCTTTCTT CTTCTTCGCC GCATTCGAAT TCAGCTTTTG -TTTTTGTTT CAGGTTGATC 283 TGAAGACTAG TATTTGTGTT TCGTTGTTTG GTTTTCAGAT GTAAATCTGT CTTTATTGCT 6723 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| TGAAGACTAG TATTTGTGTT ACGTTGTTTG GTTTTCAGAT GTAAATCTGT CTTTATTGCT 343 CAAAAAGACT GACCCCCTTC TGTTTAGTTT TTCCAGTTTT GGGGTTAAGT AGTGTGTAAC 6663 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CAAAAAGACT GACCCCCTTC TGTTTATTTT TTCCAGTTTT GGGGTTAAGT AGTGTGTAAC 403 ATTAGTCCTC TGTTTCTACT C--------- --TGATGGTC AGTAAGTGAT GAGTGCTTAG 6614 | |||||||| |||||||||| | |||||||| ||||||||| |||||||||| AGTAGTCCTC TGTTTCTACT CTGTTTTTGT TTTGATGGTC TGTAAGTGAT GAGTGCTTAG 463 TTTGTCATTA ATAGGGTATT TCTTCTTAGT GGGTATTTTG CTAGGTTGAA AATTTGAGGA 6554 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TTTGTCAGTA ATAGGGTATT TCTTCTTAGT GGGTATTTTG CTAGGTTGAA AATTTGAGAA 523 TGTTAACTTG TATAGCATGT TCGAAGCAGC TTAATACTGG ATCTCTACGT GAACCAGAGG 6494 |||| ||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| TGTTGACTTG TATAGCATGT TCGAAGCAGC TTAATACTGG ATCTCTGCGT GAACCAGAGG 583 AAGATGAAAC GGCTGCAACA CCCAGCAAAA AGCAAGCCAT TAAAGCCCTT ACTGCTCAGG 6434 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| || |||||| AAGATGAAAC GGCTGCAACT CCCAGCAAAA AGCAAGCCAT TAAAGCCCTT ACCGCTCAG. 642 TGGTTTTTTT TTTCTCAATG AACTTGAAAA AGCTGTTGTT AAGTGCAAAA GCATGTATAT 6374 .......... .......... .......... .......... .......... .......... 642 TTAGCTGTGG TTTTGATGTT TTGTTTTTTT CAGATCAAGG ATATGGCAGT TAAAGCTTCA 6314 ||||||| |||||||||| |||||||||| .......... .......... .......... ...ATCAAGG ATATGGCAGT TAAAGCTTCA 669 GGAGCGTATA AGAACTGTAA GCCATGTTCA GGGGGTTCAA ACAATAACCA GAACCCTAAC 6254 ||||||||| |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| AGAGCGTATA AGAACTGTCA GCCATGTTCA GGGGGTTCAA ACAATAACCA GAACCCTAAC 729 TATGCTGATT CTGAAACTGG GTCTGTGTCT GAAAGATTTC ATTACTCGTA TAAAAGGACT 6194 |||||||||| |||||||||| |||||||||| | |||||||| |||||| | |||||||||| TATGCTGATT CTGAAACTGG GTCTGTGTCT GGAAGATTTC ATTACTTCGA TAAAAGGACT 789 GGGA-GTTCA AATTCAACAC CAAGGGTGT- GGGGTAAGGA AATGAAA-GA GAGATTGAAA 6137 |||| ||||| ||| | ||| ||| ||||| |||||||||| ||||||| || ||| || || GGGAGGTTCA AATCAACCAC CAAAGGTGTG GGGGTAAGGA AATGAAAGGA AAGAATGGAA 849 G 6136 | G 850 hqPGS_C06HBa0120H21.1-7-_SGN-U346045+ (6962 6435,6340 6136) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 8182: PGL 1 (- strand): 6962 376 AGS-1 (6037 5958,1856 1721,1575 1181,506 376) SCR (e 1.000 d 1.000 a 0.989,e 1.000 d 0.709 a 0.999,e 0.977 d 0.999 a 0.976,e 0.794) Exon 1 6037 5958 ( 80 n); score: 1.000 Intron 1 5957 1857 (4101 n); Pd: 1.000 Pa: 0.989 Exon 2 1856 1721 ( 136 n); score: 1.000 Intron 2 1720 1576 ( 145 n); Pd: 0.709 Pa: 0.999 Exon 3 1575 1181 ( 395 n); score: 0.977 Intron 3 1180 507 ( 674 n); Pd: 0.999 Pa: 0.976 Exon 4 506 376 ( 131 n); score: 0.794 PGS (6037 5958,1856 1721,1575 1181,506 376) SGN-U344890+ 3-phase translation of AGS-1 (-strand): . . . . . . 6037 GCTCAGGTGGAGCCTGGTGTTCTAATCACGTTTGTTTCTCTACCTGAAGGTGGAAACGAT A Q V E P G V L I T F V S L P E G G N D L R W S L V F - S R L F L Y L K V E T I S G G A W C S N H V C F S T - R W K R . . : . . . . 5977 CTGAAACGAATTCGATTCAG : CCGTGAGCTATTTAACAAGTGGCAAGCTCAACGTTGGTGG L K R I R F S : R E L F N K W Q A Q R W W - N E F D S : A V S Y L T S G K L N V G G S E T N S I Q : P - A I - Q V A S S T L V . . . . . . 1816 GCTGAGAATTATGACAAGGTCATGGAATTATACAATGTCCACAGGTTCAATCGACGAACA A E N Y D K V M E L Y N V H R F N R R T L R I M T R S W N Y T M S T G S I D E Q G - E L - Q G H G I I Q C P Q V Q S T N . . . . : . . 1756 GTACCGTTGCCAATCCCTCCGAGGTCTGAAGATGAG : AACTTGAAGCTTGATTATGCTGAG V P L P I P P R S E D E : N L K L D Y A E Y R C Q S L R G L K M R : T - S L I M L R S T V A N P S E V - R - : E L E A - L C - . . . . . . 1551 AGTAGTCCAGTAACACCTCCACTAACGAAAGAGCGTCTTCCTAGCCACTTTCATCGTTCA S S P V T P P L T K E R L P S H F H R S V V Q - H L H - R K S V F L A T F I V Q E - S S N T S T N E R A S S - P L S S F . . . . . . 1491 ACAGGAGTGGAACACTTGTCATCAGGCTCTGTTGAAAGAGATCCATCACAGGGTCATCAT T G V E H L S S G S V E R D P S Q G H H Q E W N T C H Q A L L K E I H H R V I I N R S G T L V I R L C - K R S I T G S S . . . . . . 1431 TATTATGATGCAGGTGGTCTCACTTCGACCCCTAAGCTCTCCAACATCAGTGCGACAAAA Y Y D A G G L T S T P K L S N I S A T K I M M Q V V S L R P L S S P T S V R Q K L L - C R W S H F D P - A L Q H Q C D K . . . . . . 1371 AGTGAGGCACCGTCAATGGATGCTTCTGCACGGTCTAGCTCTTCAAGGGAGGCTGATCGC S E A P S M D A S A R S S S S R E A D R V R H R Q W M L L H G L A L Q G R L I A K - G T V N G C F C T V - L F K G G - S . . . . . . 1311 TCAGGAGAGCTGTCTGTTAGCAATGCCAGTGACGTTGAGACTGAATGGGTTGAAGAAGAT S G E L S V S N A S D V E T E W V E E D Q E S C L L A M P V T L R L N G L K K M L R R A V C - Q C Q - R - D - M G - R R . . . . . . 1251 GAGCCTGGAGTCTATATCACGATTCGAGCGTTGCCTGGTGGTACTCGAGAGCTTAGAAGA E P G V Y I T I R A L P G G T R E L R R S L E S I S R F E R C L V V L E S L E E - A W S L Y H D S S V A W W Y S R A - K . . : . . . . 1191 GTTAGGTTCAG : TCGTGAAAGATTTGGAGAAATGCATGCCAGGTTGTGGTGGGAAGAGAAC V R F S : R E R F G E M H A R L W W E E N L G S : V V K D L E K C M P G C G G K R T S - V Q : S - K I W R N A C Q V V V G R E . . . . . . 457 AGAGCCAGGATCCAAGAACAGTATTTGTGATTTAAGACATAATTTGGAGTTCATATTCAT R A R I Q E Q Y L - F K T - F G V H I H E P G S K N S I C D L R H N L E F I F I Q S Q D P R T V F V I - D I I W S S Y S . . . 397 CCATAATTAGTATCTTAATGCT P - L V S - C H N - Y L N A S I I S I L M Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-7-_PGL-1_AGS-1_PPS_1 (6037 5958,1856 1721,1575 1181,506 428) (frame '1'; 687 bp, 229 residues) 1 AQVEPGVLIT FVSLPEGGND LKRIRFSREL FNKWQAQRWW AENYDKVMEL YNVHRFNRRT 61 VPLPIPPRSE DENLKLDYAE SSPVTPPLTK ERLPSHFHRS TGVEHLSSGS VERDPSQGHH 121 YYDAGGLTST PKLSNISATK SEAPSMDASA RSSSSREADR SGELSVSNAS DVETEWVEED 181 EPGVYITIRA LPGGTRELRR VRFSRERFGE MHARLWWEEN RARIQEQYL- AGS-2 (6962 6435,6340 6136) SCR (e 0.897 d 0.996 a 0.999,e 0.900) Exon 1 6962 6435 ( 528 n); score: 0.897 Intron 1 6434 6341 ( 94 n); Pd: 0.996 Pa: 0.999 Exon 2 6340 6136 ( 205 n); score: 0.900 PGS (6962 6435,6340 6136) SGN-U346045+ 3-phase translation of AGS-2 (-strand): . . . . . . 6962 CACCTCTCTCTCGTGATCTCATCTTAATATCTATTTTCCGCCTTTAACGCCTCGGTTACG H L S L V I S S - Y L F S A F N A S V T T S L S - S H L N I Y F P P L T P R L R P L S R D L I L I S I F R L - R L G Y . . . . . . 6902 GAGTAGTATTCTACTCTTTTTTTTTTTCACAAAACCCAATTTCATATTTCTCTCTCTCTC E - Y S T L F F F H K T Q F H I S L S L S S I L L F F F F T K P N F I F L S L S G V V F Y S F F F S Q N P I S Y F S L S . . . . . . 6842 TATCTTTCTTCTTCTTCGCCGTACTCGAATTCAGCTTTTGTTTTTTGTTCCAGGTTGATC Y L S S S S P Y S N S A F V F C S R L I I F L L L R R T R I Q L L F F V P G - S L S F F F F A V L E F S F C F L F Q V D . . . . . . 6782 TGAAGACTAGTATTTGTGTTTCGTTGTTTGGTTTTCAGATGTAAATCTGTCTTTATTGCT - R L V F V F R C L V F R C K S V F I A E D - Y L C F V V W F S D V N L S L L L L K T S I C V S L F G F Q M - I C L Y C . . . . . . 6722 CAAAAAGACTGACCCCCTTCTGTTTAGTTTTTCCAGTTTTGGGGTTAAGTAGTGTGTAAC Q K D - P P S V - F F Q F W G - V V C N K K T D P L L F S F S S F G V K - C V T S K R L T P F C L V F P V L G L S S V - . . . . . . 6662 ATTAGTCCTCTGTTTCTACTCTGATGGTCAGTAAGTGATGAGTGCTTAGTTTGTCATTAA I S P L F L L - W S V S D E C L V C H - L V L C F Y S D G Q - V M S A - F V I N H - S S V S T L M V S K - - V L S L S L . . . . . . 6602 TAGGGTATTTCTTCTTAGTGGGTATTTTGCTAGGTTGAAAATTTGAGGATGTTAACTTGT - G I S S - W V F C - V E N L R M L T C R V F L L S G Y F A R L K I - G C - L V I G Y F F L V G I L L G - K F E D V N L . . . . . . 6542 ATAGCATGTTCGAAGCAGCTTAATACTGGATCTCTACGTGAACCAGAGGAAGATGAAACG I A C S K Q L N T G S L R E P E E D E T - H V R S S L I L D L Y V N Q R K M K R Y S M F E A A - Y W I S T - T R G R - N . . . . . : . 6482 GCTGCAACACCCAGCAAAAAGCAAGCCATTAAAGCCCTTACTGCTCAG : ATCAAGGATATG A A T P S K K Q A I K A L T A Q : I K D M L Q H P A K S K P L K P L L L R : S R I W G C N T Q Q K A S H - S P Y C S : D Q G Y . . . . . . 6328 GCAGTTAAAGCTTCAGGAGCGTATAAGAACTGTAAGCCATGTTCAGGGGGTTCAAACAAT A V K A S G A Y K N C K P C S G G S N N Q L K L Q E R I R T V S H V Q G V Q T I G S - S F R S V - E L - A M F R G F K Q . . . . . . 6268 AACCAGAACCCTAACTATGCTGATTCTGAAACTGGGTCTGTGTCTGAAAGATTTCATTAC N Q N P N Y A D S E T G S V S E R F H Y T R T L T M L I L K L G L C L K D F I T - P E P - L C - F - N W V C V - K I S L . . . . . . 6208 TCGTATAAAAGGACTGGGAGTTCAAATTCAACACCAAGGGTGTGGGGTAAGGAAATGAAA S Y K R T G S S N S T P R V W G K E M K R I K G L G V Q I Q H Q G C G V R K - K L V - K D W E F K F N T K G V G - G N E . . 6148 GAGAGATTGAAAG E R L K R D - K R E I E Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-7-_PGL-1_AGS-2_PPS_1 (6569 6435,6340 6137) (frame '1'; 339 bp, 113 residues) 1 VENLRMLTCI ACSKQLNTGS LREPEEDETA ATPSKKQAIK ALTAQIKDMA VKASGAYKNC 61 KPCSGGSNNN QNPNYADSET GSVSERFHYS YKRTGSSNST PRVWGKEMKE RLK ... finished at: Mon Aug 28 21:58:50 2006 ________________________________________________________________________________ Sequence 8: C06HBa0120H21.1-8, from 1 to 16877, both strands analyzed. ... started at: Mon Aug 28 21:58:50 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 3 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 5 ******************************************************************************** EST sequence 5 +strand 881 n (File: SGN-U325074+) 1 TCTCAGCAAA ATGTGAGTCC TCCTTCTCCG CCGCCGCCGC CGTCTAACCA ATCTCCTCCT 61 CCACCGCCTC CACCACCATC ACCAGGGCCT CCTCCTCCTC CATCCCAGCA AAAATACCAT 121 TCTCCACCAC CAACTAAATC TGTGAATTCT GCGACCACCT CGGAGAGTAA ACACTCTAAT 181 CATGATAAAA AACACCATAA CTCTTACGGG AAATCGCATC AACCAGCAAA GAAAAAGAAG 241 CCAAATTTGG GGAAGAAACT GGGGTTAGTG TTTGTGGGTG TTGCTGGGAT GTTGCAGGTG 301 TGTGTGGTGG CGTTCTTGCT AATAAAGAGA AGACAATTGT TAAAGGCTGG TAGTAGATTT 361 TGAATGAACA TTTGAATATG GATGTATATC AGTTAGTCTA ATTAATTCAG AATTTTACGA 421 GACCGCGAAA GGCAATGAAC ACGGCATTGA TGAATTAGAA GCATTGGGTT CATCTGACAT 481 GTAGATTTCT GCATTTGCAT TGGTGGCTAC TGTAAATTTG AGCATGTTGA GAAATACAGA 541 TGAATGTGCA GACTTCGGAT GCTTTGTTTC TAGTTGCTCG TCTCAAAGTT GTAAGAATGA 601 TATCTTGCTT CGTTTGTCAA TTTTAAGGAA CTGTACTGGA TTTGTTCTGG AGAACATAGT 661 GTTACGTCTT CTCTGTCCGG TGAGTAAAAA ATGTAGGGAA ACGAGCATAC TGTTGATGAA 721 GGCATGACCT GTCATCGTCG CAACTGAAAT ATGATTTCTA ATGTAGAGGT TTAACACTGT 781 AAAACTCTTT TAACTGTTGG TATAGTCTAA CTGTTGCATC TGATATGAAA ACTTTCTAAT 841 ACGCTGGCAA AATAATATCT ACCTTGATTC TTAAAAAAAA A Predicted gene structure (within gDNA segment 3956 to 887): Exon 1 3356 2837 ( 520 n); cDNA 1 520 ( 520 n); score: 1.000 Intron 1 2836 2444 ( 393 n); Pd: 0.937 (s: 1.00), Pa: 0.984 (s: 1.00) Exon 2 2443 2385 ( 59 n); cDNA 521 579 ( 59 n); score: 1.000 Intron 2 2384 1880 ( 505 n); Pd: 0.000 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 3 1879 1581 ( 299 n); cDNA 580 878 ( 299 n); score: 0.993 MATCH C06HBa0120H21.1-8- SGN-U325074+ 0.998 878 0.997 C PGS_C06HBa0120H21.1-8-_SGN-U325074+ (3356 2837,2443 2385,1879 1581) Alignment (genomic DNA sequence = upper lines): TCTCAGCAAA ATGTGAGTCC TCCTTCTCCG CCGCCGCCGC CGTCTAACCA ATCTCCTCCT 3297 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCAGCAAA ATGTGAGTCC TCCTTCTCCG CCGCCGCCGC CGTCTAACCA ATCTCCTCCT 60 CCACCGCCTC CACCACCATC ACCAGGGCCT CCTCCTCCTC CATCCCAGCA AAAATACCAT 3237 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCACCGCCTC CACCACCATC ACCAGGGCCT CCTCCTCCTC CATCCCAGCA AAAATACCAT 120 TCTCCACCAC CAACTAAATC TGTGAATTCT GCGACCACCT CGGAGAGTAA ACACTCTAAT 3177 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCCACCAC CAACTAAATC TGTGAATTCT GCGACCACCT CGGAGAGTAA ACACTCTAAT 180 CATGATAAAA AACACCATAA CTCTTACGGG AAATCGCATC AACCAGCAAA GAAAAAGAAG 3117 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATGATAAAA AACACCATAA CTCTTACGGG AAATCGCATC AACCAGCAAA GAAAAAGAAG 240 CCAAATTTGG GGAAGAAACT GGGGTTAGTG TTTGTGGGTG TTGCTGGGAT GTTGCAGGTG 3057 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAAATTTGG GGAAGAAACT GGGGTTAGTG TTTGTGGGTG TTGCTGGGAT GTTGCAGGTG 300 TGTGTGGTGG CGTTCTTGCT AATAAAGAGA AGACAATTGT TAAAGGCTGG TAGTAGATTT 2997 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGTGGTGG CGTTCTTGCT AATAAAGAGA AGACAATTGT TAAAGGCTGG TAGTAGATTT 360 TGAATGAACA TTTGAATATG GATGTATATC AGTTAGTCTA ATTAATTCAG AATTTTACGA 2937 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAATGAACA TTTGAATATG GATGTATATC AGTTAGTCTA ATTAATTCAG AATTTTACGA 420 GACCGCGAAA GGCAATGAAC ACGGCATTGA TGAATTAGAA GCATTGGGTT CATCTGACAT 2877 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACCGCGAAA GGCAATGAAC ACGGCATTGA TGAATTAGAA GCATTGGGTT CATCTGACAT 480 GTAGATTTCT GCATTTGCAT TGGTGGCTAC TGTAAATTTG GTAATTCTTG ATCAATATGC 2817 |||||||||| |||||||||| |||||||||| |||||||||| GTAGATTTCT GCATTTGCAT TGGTGGCTAC TGTAAATTTG .......... .......... 520 ATATTTATCT TACGTTTTTC TCGTTTATTA TGAATGAGTA TCAATGCATC ATGTATTACT 2757 .......... .......... .......... .......... .......... .......... 520 AGTGTGAATG ATTTAGAACT CATTGTGATT TTTATGTGTT CCATTATTTC GAAATCTACT 2697 .......... .......... .......... .......... .......... .......... 520 TTCTCTGTAA TTGTGCTGCT GCTTTAAGTG AGCATTTATT CAACGGATGG ATCGATAAAA 2637 .......... .......... .......... .......... .......... .......... 520 TTAAACATTC CGGTAGTCTG AAGCTGCTAA CAGCTTTTGT CAAAAGTTAT GTATGATGAG 2577 .......... .......... .......... .......... .......... .......... 520 TCCCTTTTTG CTGGAATATT CCGAATATGT TTGTTATTAG GTTTGGTCTG CTCTTTAGCT 2517 .......... .......... .......... .......... .......... .......... 520 GAAATATACT GTACTGCTGG AAAGCTTTAT CCTTTTTCGA TGATAATGAA CTATAAAGAA 2457 .......... .......... .......... .......... .......... .......... 520 TGTACTTTTG CAGAGCATGT TGAGAAATAC AGATGAATGT GCAGACTTCG GATGCTTTGT 2397 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ...AGCATGT TGAGAAATAC AGATGAATGT GCAGACTTCG GATGCTTTGT 567 TTCTAGTTGC TCGTATGTGT AAGAATCAAT TTGAAGTTCT CGATAAGATC ATTTGTAGGT 2337 |||||||||| || TTCTAGTTGC TC........ .......... .......... .......... .......... 579 CTCAATCAAG CATAACAATT CAATTAAGAA AGAGAAAAGG ATTATAGTGA AAACTCATGG 2277 .......... .......... .......... .......... .......... .......... 579 TGCTATGTGA ATTTTTACTC TCTTTTGCAT TTAACTGATC CTACCATTTT TACTAGTTTA 2217 .......... .......... .......... .......... .......... .......... 579 GAAGAAACTA ACTAATTCAA GTAACGGTTA ATGTAGATGA GGTGAACCTT TGGTGTTTGC 2157 .......... .......... .......... .......... .......... .......... 579 ACCTGGTTGC ACGTACCTTG ACTAATCCAC CGAACATTCC AAAGAATGGG GAGTAAATAG 2097 .......... .......... .......... .......... .......... .......... 579 ATACTCCGAC TTCAATTGAA TGCAGAACTA ATTCTTGATG ATGGGATTTG CTTGGTTTGT 2037 .......... .......... .......... .......... .......... .......... 579 CCTAGAATTT ACTTGCTATT TGTTCGCGGC TGCTAAAAAG TGTTTTGAAG TTCTGTCTAA 1977 .......... .......... .......... .......... .......... .......... 579 CAGCAAAAAT ACCTTAGTTA CTGCTTGTGC TTCCTTGTTG TGTTTCTGTA GGAATAATGC 1917 .......... .......... .......... .......... .......... .......... 579 CAACTCTTCA TGACTCCTGT GTTTTTTGGT ATTGCAGGTC TCAAAGTTGT AAGAATGATA 1857 ||| |||||||||| |||||||||| .......... .......... .......... .......GTC TCAAAGTTGT AAGAATGATA 602 TCTTGCTTCG TTTGTCAATT TTAAGGAACT GTACTGGATT TGTTCTGGAG AACATAGTGT 1797 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTGCTTCG TTTGTCAATT TTAAGGAACT GTACTGGATT TGTTCTGGAG AACATAGTGT 662 TACGTCTTCT CTGTCCGGTG AGTAAAAAAT GTAGGGAAAC GAGCATACTG TTGATGAAGG 1737 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACGTCTTCT CTGTCCGGTG AGTAAAAAAT GTAGGGAAAC GAGCATACTG TTGATGAAGG 722 CATGACCTGT CATCGTCGCA ACTGAAATAT GATTTCTAAT GTAGAGGTTT AACACTGTAA 1677 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATGACCTGT CATCGTCGCA ACTGAAATAT GATTTCTAAT GTAGAGGTTT AACACTGTAA 782 AACTCTTTTA ACTGTTGGTA TAGTCTAACT GTTGCATCTG ATATGAAAAC TTTCTAATAC 1617 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTCTTTTA ACTGTTGGTA TAGTCTAACT GTTGCATCTG ATATGAAAAC TTTCTAATAC 842 GCTGGCAAAA TAATATCTAC CTTGATTCTT GAATAA 1581 |||||||||| |||||||||| |||||||||| || || GCTGGCAAAA TAATATCTAC CTTGATTCTT AAAAAA 878 hqPGS_C06HBa0120H21.1-8-_SGN-U325074+ (3356 2837,2443 2385,1879 1581) ******************************************************************************** EST sequence 7 +strand 845 n (File: SGN-U339975+) 1 AGAGGGAGAN ACGCGCNGAT NNTTAGGTGA CANATCAGAA GACCCACCGG GAGCTCCACC 61 GCGTGTGAGC CGCTAGAGAC CCAGTGGATC CCCCGGGCTG CAGGAATTCG GCACGAGGGT 121 TAACCCAATT TACGACCCGA TTGTATCCCA TATCCGACCC GTATCCCTGC GTAATCACCT 181 CGACACCAAT CTATGATACC CTAGTTCTTG GGGGCATCAT AATCTTCAAC AGTTCCTCTT 241 TGATTCTTAC AAATTTCTGG AATAAATCAA CATTTTGGGA TCGTTTTTTT GGGCCCCATT 301 TGGTTGATTC CTCGAGATTA GAGAGACAAT TGGTGGGTTG CCAAATATTT TGCAGATTGA 361 TGAACCCTAT CTATTTATGC CTGATTGTTA AATTAGGGGA AAATTTTGCT TAAAATTGAA 421 AGAAAAACCC ATATGTTATG AAGAATTTTG GAGAAAGATT TTGAATTAAA AGGGGCTTTT 481 ATTAAAGGTA AAAAGGTCTT GTCATTAATA TAATTGACTC TAAATTAACC TGGAACTGAA 541 GTCCATCCAC AAGCAAAAAG GCCCATTTCT GTGGCAAAAC CAAGGCCCAA AAGTTCCAGT 601 GCAACCAGCC CCAAAATACC ATTCATTCCA GGCCCAAAAC ATTTAATCCG TACAGGCCCC 661 AAACTTCCAT CCAACCAGCC CCCAAAATGT CAGGTTAAAC CAGCCCAGTA TTGCCAGGCC 721 CAAACCCTGT GGCGCCATGG GGGAAAAAAT TTGGTTGCCC AACCCCCTGC AACAAATTCC 781 CAATTGGAGG CCAGTTTTGA ATGGGCCTGA CCTAATAAAG GGTGGATGTG GGCCCACTCA 841 AGCTG Predicted gene structure (within gDNA segment 5635 to 1): Exon 1 3878 3442 ( 437 n); cDNA 105 548 ( 444 n); score: 0.817 Intron 1 3441 3025 ( 417 n); Pd: 0.000 (s: 0.66), Pa: 0.000 (s: 0) Exon 2 3024 3008 ( 17 n); cDNA 549 564 ( 16 n); score: 0.588 Intron 2 3007 2873 ( 135 n); Pd: 0.167 (s: 0), Pa: 0.465 (s: 0) Exon 3 2872 2867 ( 6 n); cDNA 565 570 ( 6 n); score: 1.000 Intron 3 2866 1880 ( 987 n); Pd: 0.000 (s: 0), Pa: 0.994 (s: 0) Exon 4 1879 1869 ( 11 n); cDNA 571 581 ( 11 n); score: 0.545 Intron 4 1868 1366 ( 503 n); Pd: 0.848 (s: 0), Pa: 0.965 (s: 0) Exon 5 1365 1354 ( 12 n); cDNA 582 592 ( 11 n); score: 0.833 MATCH C06HBa0120H21.1-8- SGN-U339975+ 0.817 483 0.572 C PGS_C06HBa0120H21.1-8-_SGN-U339975+ (3878 3442,3024 3008,2872 2867,1879 1869,1365 1354) Alignment (genomic DNA sequence = upper lines): AAACAGGGGT TAAGGTTAAC CCAATTTACG ACCCGATTCT ATCCCATATC CGACCCGTTT 3819 || || | ||||||| |||||||||| |||||||| | |||||||||| |||||||| | AATTCGGCAC GAGGGTTAAC CCAATTTACG ACCCGATTGT ATCCCATATC CGACCCGTAT 164 CCCTGCGTAA TCACCTCGAC ACCAATCTAT GATGCCCTAG TTCTTGGGGG CATCATCATC 3759 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||||| ||| CCCTGCGTAA TCACCTCGAC ACCAATCTAT GATACCCTAG TTCTTGGGGG CATCATAATC 224 TTCAACAGTT CCTCTTTGAT TCTTACAAAT TTCTGGAAGA AATCAACATT TTGGGATCGT 3699 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| TTCAACAGTT CCTCTTTGAT TCTTACAAAT TTCTGGAATA AATCAACATT TTGGGATCGT 284 TTTTTGGGGA CGCATTTGGT TGATTTCTCG AGATTAGAGA GACAATTGGT GGG-TGCCAT 3640 ||||| ||| | |||||||| ||||| |||| |||||||||| |||||||||| ||| ||||| TTTTTTGGGC CCCATTTGGT TGATTCCTCG AGATTAGAGA GACAATTGGT GGGTTGCCAA 344 TTATTTTGCA GATTGATGAA CCCTAGCTAT TTATGCCTGA TTG-TAAATT TGGTGAAGAT 3581 ||||||||| |||||||||| ||||| |||| |||||||||| ||| |||||| || ||| || ATATTTTGCA GATTGATGAA CCCTATCTAT TTATGCCTGA TTGTTAAATT AGGGGAAAAT 404 TTTGCTT--A ATTGAAGGAA AAACCCAGAT GTCA-GGAGG A-TTTGGTGA GTTGATTTGG 3525 ||||||| | |||||| ||| ||||||| || || | | || | ||||| || |||| TTTGCTTAAA ATTGAAAGAA AAACCCATAT GTTATGAAGA ATTTTGGAGA AAGATTTTGA 464 A-TGAAAGGG G-TTCAAATG AAGGTAAATG GGTTTTGAAG ATTCATATAG TTGACACCAA 3467 | | |||||| | || | | |||||||| ||| ||| ||| ||||| ||||| | || ATTAAAAGGG GCTTTTATTA AAGGTAAAAA GGTCTTG-TC ATTAATATAA TTGACTCTAA 523 ATTAGCTTTG TATTGAGGTT TAACGATTGA TTTGCTCATG TATGGATCCT TCATTGTTAA 3407 |||| | | | | ||| || | | ATTAACCTGG AACTGAAGTC CATCC..... .......... .......... .......... 548 TCTTAACCTT TTTGGGTGTT TGCTTTTGCT TCCTTGCTCT TGCCGTGCAA TCTCAGCAAA 3347 .......... .......... .......... .......... .......... .......... 548 ATGTGAGTCC TCCTTCTCCG CCGCCGCCGC CGTCTAACCA ATCTCCTCCT CCACCGCCTC 3287 .......... .......... .......... .......... .......... .......... 548 CACCACCATC ACCAGGGCCT CCTCCTCCTC CATCCCAGCA AAAATACCAT TCTCCACCAC 3227 .......... .......... .......... .......... .......... .......... 548 CAACTAAATC TGTGAATTCT GCGACCACCT CGGAGAGTAA ACACTCTAAT CATGATAAAA 3167 .......... .......... .......... .......... .......... .......... 548 AACACCATAA CTCTTACGGG AAATCGCATC AACCAGCAAA GAAAAAGAAG CCAAATTTGG 3107 .......... .......... .......... .......... .......... .......... 548 GGAAGAAACT GGGGTTAGTG TTTGTGGGTG TTGCTGGGAT GTTGCAGGTG TGTGTGGTGG 3047 .......... .......... .......... .......... .......... .......... 548 CGTTCTTGCT AATAAAGAGA AGACAATTGT TAAAGGCTGG TAGTAGATTT TGAATGAACA 2987 |||| |||||| .......... .......... ..ACAA-GCA AAAAGGCCC. .......... .......... 564 TTTGAATATG GATGTATATC AGTTAGTCTA ATTAATTCAG AATTTTACGA GACCGCGAAA 2927 .......... .......... .......... .......... .......... .......... 564 GGCAATGAAC ACGGCATTGA TGAATTAGAA GCATTGGGTT CATCTGACAT GTAGATTTCT 2867 |||||| .......... .......... .......... .......... .......... ....ATTTCT 570 GCATTTGCAT TGGTGGCTAC TGTAAATTTG GTAATTCTTG ATCAATATGC ATATTTATCT 2807 .......... .......... .......... .......... .......... .......... 570 TACGTTTTTC TCGTTTATTA TGAATGAGTA TCAATGCATC ATGTATTACT AGTGTGAATG 2747 .......... .......... .......... .......... .......... .......... 570 ATTTAGAACT CATTGTGATT TTTATGTGTT CCATTATTTC GAAATCTACT TTCTCTGTAA 2687 .......... .......... .......... .......... .......... .......... 570 TTGTGCTGCT GCTTTAAGTG AGCATTTATT CAACGGATGG ATCGATAAAA TTAAACATTC 2627 .......... .......... .......... .......... .......... .......... 570 CGGTAGTCTG AAGCTGCTAA CAGCTTTTGT CAAAAGTTAT GTATGATGAG TCCCTTTTTG 2567 .......... .......... .......... .......... .......... .......... 570 CTGGAATATT CCGAATATGT TTGTTATTAG GTTTGGTCTG CTCTTTAGCT GAAATATACT 2507 .......... .......... .......... .......... .......... .......... 570 GTACTGCTGG AAAGCTTTAT CCTTTTTCGA TGATAATGAA CTATAAAGAA TGTACTTTTG 2447 .......... .......... .......... .......... .......... .......... 570 CAGAGCATGT TGAGAAATAC AGATGAATGT GCAGACTTCG GATGCTTTGT TTCTAGTTGC 2387 .......... .......... .......... .......... .......... .......... 570 TCGTATGTGT AAGAATCAAT TTGAAGTTCT CGATAAGATC ATTTGTAGGT CTCAATCAAG 2327 .......... .......... .......... .......... .......... .......... 570 CATAACAATT CAATTAAGAA AGAGAAAAGG ATTATAGTGA AAACTCATGG TGCTATGTGA 2267 .......... .......... .......... .......... .......... .......... 570 ATTTTTACTC TCTTTTGCAT TTAACTGATC CTACCATTTT TACTAGTTTA GAAGAAACTA 2207 .......... .......... .......... .......... .......... .......... 570 ACTAATTCAA GTAACGGTTA ATGTAGATGA GGTGAACCTT TGGTGTTTGC ACCTGGTTGC 2147 .......... .......... .......... .......... .......... .......... 570 ACGTACCTTG ACTAATCCAC CGAACATTCC AAAGAATGGG GAGTAAATAG ATACTCCGAC 2087 .......... .......... .......... .......... .......... .......... 570 TTCAATTGAA TGCAGAACTA ATTCTTGATG ATGGGATTTG CTTGGTTTGT CCTAGAATTT 2027 .......... .......... .......... .......... .......... .......... 570 ACTTGCTATT TGTTCGCGGC TGCTAAAAAG TGTTTTGAAG TTCTGTCTAA CAGCAAAAAT 1967 .......... .......... .......... .......... .......... .......... 570 ACCTTAGTTA CTGCTTGTGC TTCCTTGTTG TGTTTCTGTA GGAATAATGC CAACTCTTCA 1907 .......... .......... .......... .......... .......... .......... 570 TGACTCCTGT GTTTTTTGGT ATTGCAGGTC TCAAAGTTGT AAGAATGATA TCTTGCTTCG 1847 || |||| .......... .......... .......GTG GCAAAACC.. .......... .......... 581 TTTGTCAATT TTAAGGAACT GTACTGGATT TGTTCTGGAG AACATAGTGT TACGTCTTCT 1787 .......... .......... .......... .......... .......... .......... 581 CTGTCCGGTG AGTAAAAAAT GTAGGGAAAC GAGCATACTG TTGATGAAGG CATGACCTGT 1727 .......... .......... .......... .......... .......... .......... 581 CATCGTCGCA ACTGAAATAT GATTTCTAAT GTAGAGGTTT AACACTGTAA AACTCTTTTA 1667 .......... .......... .......... .......... .......... .......... 581 ACTGTTGGTA TAGTCTAACT GTTGCATCTG ATATGAAAAC TTTCTAATAC GCTGGCAAAA 1607 .......... .......... .......... .......... .......... .......... 581 TAATATCTAC CTTGATTCTT GAATAATGTT GCTTTATTGC GTTATAATAT CTATCTACAT 1547 .......... .......... .......... .......... .......... .......... 581 AGTAAAAACA TATTTCTTAT GTTGAACGTG CAAAAACATG AGTTGCGAAG TTGAGTAACC 1487 .......... .......... .......... .......... .......... .......... 581 TGAGATTTCA GGTTCAAAAC TCAGCGGAGA CAAAAAAATA CTAGGTGATT CTTCCCATTT 1427 .......... .......... .......... .......... .......... .......... 581 GTTCTTGCCT TTGTGGACAC AGTTACCTGG TACTTGATAT CTGTTGTTGT GGAATTAGTA 1367 .......... .......... .......... .......... .......... .......... 581 GAAGTGCGCA AAA 1354 ||| || || ||| .AAG-GCCCA AAA 592 hqPGS_C06HBa0120H21.1-8-_SGN-U339975+ (3878 3442) ******************************************************************************** EST sequence 1 -strand 890 n (File: SGN-U335137-) 1 GTAAAACTAT GTAGNATGAC CATTCTTTTC TTCGATACCA AAAATTAAAT TCCATATAGA 61 CATAAAAAAT GTTTTAAATT TTTTTCTTAC ACTANGGGAA TGNAAGAAAA AAAACAAGAT 121 TAATNAACTC AAATAATTAT AATAAATAAG TCAAAAAAAT AATTTATGTA TTAAAAAAAT 181 TTGAAATATA CCTTGAACTT TGAAAAAAGA ATCATATATG CCCCTAAATA TATTTTTTTT 241 TAAAATTAAA GTAAAATTAT AAATTTAAAA GTAATTTTTT CACTTTCGTT AAATGAAGGG 301 TATATATGAG CTCATTTTGT AACGGCAGAG GTATATGTGA ACCATTTGTA TAACGGTAAG 361 GGTATATATG AGCCACTTTC ATAACGAGGG GTATATCAGT TTCAAATGAC AAAGTTGAGG 421 GGTATATCAT ACCCTTTTCC CATAATATTA TTCATTTTTG GGTTGACGGG TCAAACCTTG 481 GGCTGCTTAG GACTTGATTA GACCGCTATT TTATTGACTC TTTAATTAAT GGGCAACTTT 541 CACATATAAC AAACAAAAAA TTCATATTTG TATGCTATAA CAAAGTTTGC ATAATTGCGC 601 TCCATAGCAA ACATAAAATT GTATAATTCG CTGACCTAAA TTGTATAATT CGCTGGCCTA 661 TTTCGCTGCA ATTGTATAAT TCGCTATCCT ATTTAACTAC AATTGTATAA TTCGCTGCCT 721 ATTTCGCTGC AATATTATTA TAAAATTTGC TTTGCATATA ATTGAACCGA ATTAAAATGT 781 ATGTATATTG CATAATTATA AGTGTATAGC AATAAGATAT ATGTTTTTCC CTGCAGCCCG 841 GGGGATCCAC TAGTTCTAGA GCGGCCGCCA CCGCGGGGAG CTCCAGCTCT Predicted gene structure (within gDNA segment 10773 to 1988): Exon 1 10144 10110 ( 35 n); cDNA 436 470 ( 35 n); score: 0.543 Intron 1 10109 7694 (2416 n); Pd: 0.453 (s: 0), Pa: 0.952 (s: 0) Exon 2 7693 7680 ( 14 n); cDNA 471 484 ( 14 n); score: 0.571 Intron 2 7679 7185 ( 495 n); Pd: 0.000 (s: 0), Pa: 0.861 (s: 0) Exon 3 7184 7146 ( 39 n); cDNA 485 521 ( 37 n); score: 0.615 Intron 3 7145 4666 (2480 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.88) Exon 4 4665 4575 ( 91 n); cDNA 522 612 ( 91 n); score: 0.890 Intron 4 4574 4545 ( 30 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.72) ?? Exon 5 4544 4472 ( 73 n); cDNA 613 689 ( 77 n); score: 0.767 Intron 5 4471 4334 ( 138 n); Pd: 0.000 (s: 0.90), Pa: 0.000 (s: 0.84) Exon 6 4333 4275 ( 59 n); cDNA 690 746 ( 57 n); score: 0.831 Intron 6 4274 3534 ( 741 n); Pd: 0.000 (s: 0.82), Pa: 0.000 (s: 0) Exon 7 3533 3504 ( 30 n); cDNA 747 776 ( 30 n); score: 0.600 Intron 7 3503 3060 ( 444 n); Pd: 0.890 (s: 0), Pa: 0.993 (s: 0) Exon 8 3059 3052 ( 8 n); cDNA 777 784 ( 8 n); score: 0.750 MATCH C06HBa0120H21.1-8- SGN-U335137- 0.834 349 0.392 C PGS_C06HBa0120H21.1-8-_SGN-U335137- (10144 10110,7693 7680,7184 7146,4665 4575,4544 4472,4333 4275,3533 3504,3059 3052) Alignment (genomic DNA sequence = upper lines): TTGCGTATAA ACTTGTTAAA TTAAATGTCT AAGCGGTAGT CAATGTCGCG TTCTCTACTC 10085 || | |||| || || | || || | | | TTTCCCATAA TATTATTCAT TTTTGGGTTG ACGGG..... .......... .......... 470 CGTTTTTCGT AATTTGTTTT TTTCTGCTCA CTTTAGTGTG TGTGAAACCT AGGTTAGCCT 10025 .......... .......... .......... .......... .......... .......... 470 TTGAACCAAA ATGGGAAATA ACATTAAAAT TCTGAATACT TCTTCACAAC TTAAGTGGTT 9965 .......... .......... .......... .......... .......... .......... 470 GAAGCAAATG CCTACCATTT CTTCTCCGAA TTTCCCCTCT TACCCTCTTC AACAGCAGAG 9905 .......... .......... .......... .......... .......... .......... 470 GCAGCATCGA ATGCTTATAC CGAAATAATT AGCTTCACTT CAACACTTGT GTCAAAAATC 9845 .......... .......... .......... .......... .......... .......... 470 AATGCTTTTG AACCCAGAAG AGTTATCGCA AACTGCACAG CCGGCACAAC AGCAATTGGG 9785 .......... .......... .......... .......... .......... .......... 470 GTATCCGCAG ATGCATCAGA ATCAGCAGCT CCAACAGCAA CAACAACAAC CTCAGCAGGT 9725 .......... .......... .......... .......... .......... .......... 470 TTTGCATCAG CAGCAATCTT CTCCAGCAAT GAATTCCCCT GGTGGTCATA ATTTACTGAG 9665 .......... .......... .......... .......... .......... .......... 470 TTTGACTGGA TCAGAACCAG ATGCCACTGG ATCTGGGACA ACGACTCCTG GGAGTAGTTC 9605 .......... .......... .......... .......... .......... .......... 470 AAGCCAGGGG GCTGAAGCAA GCAATCAGTT TCTTGGGAAG AGAAAGATTC AGGATTTAGT 9545 .......... .......... .......... .......... .......... .......... 470 TTCACAGGTT TTCCCTTAAC CTTCTTAATC TCCAAAGCTT TCTGTTTTCC TTGGTGGTTA 9485 .......... .......... .......... .......... .......... .......... 470 CTCAAGCCAA CGTTGGAATC AATTAATTTT TATTTATTTT CACAATCAAT TAAAAAAAAT 9425 .......... .......... .......... .......... .......... .......... 470 TGGTCCACAA AGTATGTGAA AAGGCTGTTT TCGGCAGACC CTCGTTCAAG GGAAAAAAAT 9365 .......... .......... .......... .......... .......... .......... 470 AGCATACTTG TAAAAACAGT TAATTTCTAT ACATAAGTTT TGTGGTTATT AACACTGCAA 9305 .......... .......... .......... .......... .......... .......... 470 CTTTTTAACA TCTTTCCACA TATTATGATA ATGTTCCAGG TGGATCCTCA GGGAAGAGTT 9245 .......... .......... .......... .......... .......... .......... 470 GATCCTGAAG TTGAACAGTT TCTTTTAGAG ATCGCTGATG ACTTTATTGA TTCGGTGAGG 9185 .......... .......... .......... .......... .......... .......... 470 CATAGAAATG ATTTTTTCTT TGTTATCGTT ATCTTTTTTG TTAGGATTGC GTCAGGAGGT 9125 .......... .......... .......... .......... .......... .......... 470 TGGGAGACTG CGGGTTTCAT GTTTTGCATG CTGCCTTTCT CATTCATTCA CGGTTCAAGT 9065 .......... .......... .......... .......... .......... .......... 470 TCTAAGATTG TTAGCTTTGA TAACTGTGGT TGAATACATT GACATAGCCA GTAAAGAGGA 9005 .......... .......... .......... .......... .......... .......... 470 TTTTTTCTTG AAGGGTCATC GTATGCTGAT TCTGGACTCT TCCTTTCACA CTTACCCTTC 8945 .......... .......... .......... .......... .......... .......... 470 ACTCAAATGT AAACTAAAAA TTTTAAAAAT AATAATAAAA GACCCCCCTA TTTTCCCATT 8885 .......... .......... .......... .......... .......... .......... 470 TTTGAGTTGT AATGCCAAAA AAAATATCAA AGGCATGACA AGGAACCTTT TTTAAGAAGA 8825 .......... .......... .......... .......... .......... .......... 470 GTTTCCTTTT GAAATACTTT CTTGAGAAAC ATTGATTGTT TGTCATTTTC TTTGCTAAGT 8765 .......... .......... .......... .......... .......... .......... 470 TTCATGTTCC ATTCCTCTTT GTAGGTTACT ACATTTTCTT GCAATTTGGC GAAGCATCGG 8705 .......... .......... .......... .......... .......... .......... 470 AAATCTTCGA CTCTGGAGTC CAAAGATATA CTGTTACATT TAGGTTTGTG CCAGCAACTG 8645 .......... .......... .......... .......... .......... .......... 470 CAAAATATGC AGAGTTGTTT TACTTTGTTT TGTTTTTACA GAAAAAACAA GCTATTTATC 8585 .......... .......... .......... .......... .......... .......... 470 ATTATTTTAG AATATAAAAG GAGTCATTTC ATACTTCATC AAAGATGATA GAATTATAGA 8525 .......... .......... .......... .......... .......... .......... 470 AATAAAAGGT CCTAGTTGTT GCATATATTT TCATAATAAG TTCATTTTTA TTGCTTTCCG 8465 .......... .......... .......... .......... .......... .......... 470 TTAGTGAGTG TGTTTTTTAC ATGACTATAT GGTGAGATCC TCTGCATTTA GCCTATACAC 8405 .......... .......... .......... .......... .......... .......... 470 AGGTTTTCTA TCTGGAATGT AATTAGAGTT ACAAAATTTG ACACATACTC CTATTGACTC 8345 .......... .......... .......... .......... .......... .......... 470 AGACTGGGAC CAAACTTCGT TTTCATGTCA GTACAGTGTA CCAACAACCA TCAGTCTCCA 8285 .......... .......... .......... .......... .......... .......... 470 CAGTTTAATG TCTTTAGCTC TTTGAGCTTT TTAGCTACTT ATGCGTTCAG CCAACTAGTA 8225 .......... .......... .......... .......... .......... .......... 470 AAATTTGTAT TCCACCACCT TATAATGTAT GCTTCAAGAG TTGCTGGTGA GTATATGATG 8165 .......... .......... .......... .......... .......... .......... 470 AGAACAGTTT GAATTGCTAA GCTTTCTCTT TAGTTGTCAG GAGTGGCTTT GAAGTGAGGT 8105 .......... .......... .......... .......... .......... .......... 470 GTGACCTGAG TCCTGACTTA GTTAAATTGT TGAGTATCTG AATAGTCCTA GTTGCATCTG 8045 .......... .......... .......... .......... .......... .......... 470 AGAGAGTGAT TGAGTCACAC TGTCATTGTA TACAGTATAT ATAATGGTTT ACCTTTTCGT 7985 .......... .......... .......... .......... .......... .......... 470 GAATAGATTT TGACTTCCTC ATTTGTTTAG AACATGCACA ATGCAACTCT ATCTTATTCC 7925 .......... .......... .......... .......... .......... .......... 470 AACACATCAA TACCTGGTAA AATTTTGTTA GGCAACAGCT TTCTGCTTTG CCTAATGCAG 7865 .......... .......... .......... .......... .......... .......... 470 TTGATTTGCT ATTTTTGGCG GAAAGGCTGC TTCATGATCT CATCAATGCA GTTGTTCCTG 7805 .......... .......... .......... .......... .......... .......... 470 GGAATGTTTA CATTCGGTCT ATCACTGCTT AAGATGTTTT ATAACAGAAT TTGATTTAGA 7745 .......... .......... .......... .......... .......... .......... 470 AAGAACAAAA AATATGGTAT TAGTCTAAGT TAACTATGAT GTTCTTTACA GAGAAAGATT 7685 ||| || .......... .......... .......... .......... .......... .TCAAACCTT 479 GGAATTTGAC TGTCCCAGGT TTTTCAAGTG AGGATAAGAA ACACTGCCCT GAACATGTAA 7625 || | GGGCT..... .......... .......... .......... .......... .......... 484 GCTCTTCTTC CTCTCTAACT TTCATGGGTG GTATACTGGT CTATCTTATA TCTACAAATA 7565 .......... .......... .......... .......... .......... .......... 484 ATTGAGGAGT TCTGTAAGTT GTAACCAATA GTTAATTATG ATCCATGGCC TCTGAATACT 7505 .......... .......... .......... .......... .......... .......... 484 GGAAGTACTC GATAACATAT TTCTTGAAAA AGGAAAAGAA AGGCTCAATA ACTTAGCTTT 7445 .......... .......... .......... .......... .......... .......... 484 TTGCCCAAGT CTTGTTTATT ATAAATTTAA GTTTCTGATT ACAGTGCACT CATCGCGTTA 7385 .......... .......... .......... .......... .......... .......... 484 TAAGTACCTC TTGATCTCTT TCTGGTCAAT GATTGTTGTA ACTTTAAGCT GAGTGGAAAT 7325 .......... .......... .......... .......... .......... .......... 484 TCAGGCTGTT TTATTGACAA TATGTTTCTA AACTTTGTGC AGTCATCAGG TGATCTCTGC 7265 .......... .......... .......... .......... .......... .......... 484 AAAGAGCGTT TGGAAATGGT GAGTTGAGAG TTGTTTTGTG TTAATACTAC CTGCATTTAT 7205 .......... .......... .......... .......... .......... .......... 484 TTTAGGTGTA TGATTCATAG TCTTTGTGCT TGACTCAGAT ACCTGATATG ATGGAGGCTT 7145 ||| | || ||| | ||| || || | || || || .......... .......... GCTTAGGACT TGA-TTAGAC CGCT-ATTTT ATTGACTCT. 521 CACCACAAGC TGAAGCAAGT ACAAGCAGCA GCGCGAAGGA GATCGTAAGT CCAGGGCTGG 7085 .......... .......... .......... .......... .......... .......... 521 GTGACCAGGT TGGTTCGACT GACATAATCG GACCACCAAG TTCAGAGGAA TTGGCTTCAC 7025 .......... .......... .......... .......... .......... .......... 521 CATCTAATGG TGAAATATAG TTCAACTATA ACAAGATGTG ATGTTCCGTG GAGATTGGTA 6965 .......... .......... .......... .......... .......... .......... 521 TACCCCTTGC TTTACTGTAA TAAAACCTTT TTCAACTTTA GTTTGCTTGT CGACTGAAGA 6905 .......... .......... .......... .......... .......... .......... 521 ACGTGAACAT AAGTCTGTTA TGAATATTCA TGACCTTTTG TTGGTATGTT ATGTTAAGTT 6845 .......... .......... .......... .......... .......... .......... 521 TTCTTTATTA TTTACTATGT CCTCTTTACT TCCTGTATTG TTCTGTTAGC TGCTCCGGAA 6785 .......... .......... .......... .......... .......... .......... 521 TCTAAGACTG GGCACGGGAC TGAACCGAGA TGGTTTGATC GGGATGTTGA CCGGTATCGG 6725 .......... .......... .......... .......... .......... .......... 521 GATGAATCGA ACCGAAATTA TCGGGACGAA AACTCGGTTC CGTCTTGTCC CACTATATAC 6665 .......... .......... .......... .......... .......... .......... 521 CGGGATAGAA TTGGATTGGA CCGGACGATA CATTTAGCTA TTTTAAACAA TAATTTTTTT 6605 .......... .......... .......... .......... .......... .......... 521 TTGAATTACG AAAATATGAA TGTTTTTTTT TTTTTTTTTT TAAAGTTTAA TAGTTTATAA 6545 .......... .......... .......... .......... .......... .......... 521 AGTATTTAAG TTTTGAAATT TATAATGTTT ATTTGTAATT TTATTTTAAA GATATTTTTC 6485 .......... .......... .......... .......... .......... .......... 521 TAAAAATTTT ACTTAAAAGT ATGTGTATAT TCTACCCTTC CTAGATCCTG TTTTGGGATC 6425 .......... .......... .......... .......... .......... .......... 521 ATAATGGGTT TTTTGTTGTG TTATTTGCGT GATTGTTTTA CATTATACCA TTTTTGTCAT 6365 .......... .......... .......... .......... .......... .......... 521 GCCTTTTGCC ACCTCCTCAA ATCATGCGTC TAATCTCCCG TTTGGTCATA AATTTTTCAC 6305 .......... .......... .......... .......... .......... .......... 521 TTCCCTTTTT TTCAAAAATA TTTTAAATAC ACTGTTTCTG AAAATGAATA TTTTTTCAAG 6245 .......... .......... .......... .......... .......... .......... 521 TTTCAAAAAT TAGTTTACGA GTAGTTTTTC AAGCTTTTGA AGTCCTAGAC TCGCAAAACT 6185 .......... .......... .......... .......... .......... .......... 521 TCAACATAAA ATGCATATCC AACCATAACT TCATTCTCAA AAAATCATTT TTCATTTCAT 6125 .......... .......... .......... .......... .......... .......... 521 CTTGAAAACC CCATATTTTT GCATTACAAG AACATTTTAA GCAGCCTTAT TTAGGGCTTA 6065 .......... .......... .......... .......... .......... .......... 521 CCTTCCACCT TATGCTTATG ACTGGGCCGA TCTATCAAGC CCAGTCTTCT CCGGCCCAAC 6005 .......... .......... .......... .......... .......... .......... 521 TCTTCGCGTC TGTGTTGAGC CAATCATTTA TCAGGTTAAC TAAAAATCTA TTAGAATTAA 5945 .......... .......... .......... .......... .......... .......... 521 CTTAAAAGTT AATTAAATCT GTTTTTTAAT CGTTTTTAAC TTTTAGGAGT ATTCGTCAAA 5885 .......... .......... .......... .......... .......... .......... 521 TTTAATAATA AGATGTGTGT GTATATATAT ATATTGGATG GGAATTTAAT ATTGGAGTTT 5825 .......... .......... .......... .......... .......... .......... 521 GGAATCCCAT TTGCAAGCCA TGTTCACTTC ATGGCTTTAC ATTTCTTTGG ATTTATAGAC 5765 .......... .......... .......... .......... .......... .......... 521 TATATAATAT AATACCATTA GTATATGATT TTTTCCCCAA AAGTCTTAAT ATTAGCCTGT 5705 .......... .......... .......... .......... .......... .......... 521 CAGTGAGATG GTCGGGCTTC CTCGTAAAAG ACATTTTTTC TTCATATAAA AATATAATCC 5645 .......... .......... .......... .......... .......... .......... 521 CATAAAAAAA AACAAATGAG AAAAATATAT TTTCTTCTTC TTTCAACAGA CAAACTAAAT 5585 .......... .......... .......... .......... .......... .......... 521 TATATTACAT GAAGATTCAA TGTTACAAGA ATTTAACATG TTATCATTCA CAATAAACTT 5525 .......... .......... .......... .......... .......... .......... 521 TCCAAATTTT GAGTTGCCTT GAAATATCAT TAAAAATTAT GATGGAATGA ATATAATCTC 5465 .......... .......... .......... .......... .......... .......... 521 TTTGCTTTAA ATTAATTTTG AGCGTCGGCT AAAAAATATA TAGTCATCTA ATATATCTCA 5405 .......... .......... .......... .......... .......... .......... 521 CACAATACAT ATTTAAATTA ATTATTATAC AAATATCAAA TATAGAATAA GAGAGCGAAA 5345 .......... .......... .......... .......... .......... .......... 521 AATCAAATAA AAAGTAAAAA TACAACTTGA ATATCCTAAG TTTATAGGAT AAAATGCGAC 5285 .......... .......... .......... .......... .......... .......... 521 TTGTTAGCCT AATGCATAAG ACAACAATTC TATAGATTTT ACTCCCTTAT TTGGTTAAGT 5225 .......... .......... .......... .......... .......... .......... 521 TATCTCTAAC TAATAAGTAT TCTCTTTTAA CTCAAATAAC TTAGATTTTT TTTACTATGG 5165 .......... .......... .......... .......... .......... .......... 521 ATCTAAAAAT GAAGTATTTT TATGTTTAGT AATAGTTCAA TTCTAAAATG TTTATTTTAT 5105 .......... .......... .......... .......... .......... .......... 521 CTTTAATAGA ATGATTAATA GTTACACAAA TATTACTAAC TATTTTTAGA TCACAAAATT 5045 .......... .......... .......... .......... .......... .......... 521 TTAAATATTT TTTTTAAAAA AATTCATATC AAATCAAACT AGATCGAAAA AAAGTTAATT 4985 .......... .......... .......... .......... .......... .......... 521 TTCTTACATT ATGGATTATG AAACTTAAGT GAGTACCAAA AATCAAATCC AAAAACACAA 4925 .......... .......... .......... .......... .......... .......... 521 ACATGTAAAT GACTCTTACT TCTTCTTTTC TCTACGATGG TTTTTTACTT CATGTTTGTT 4865 .......... .......... .......... .......... .......... .......... 521 AATCATATTA AAATCTAATA TAAATACAAG TGATAATCTC ATTCAATACA AAAAATTTGT 4805 .......... .......... .......... .......... .......... .......... 521 ACTTTTTCTA GCTTTAGCAC AATTTCTTAC TTTATTATTT CGTTGTCAAT TATAAAAGCG 4745 .......... .......... .......... .......... .......... .......... 521 ATGCTTTTAA CTAAATTACT CTTATTAGCT TAAACTCGAG ACTTTCGATA ATGTGTACAA 4685 .......... .......... .......... .......... .......... .......... 521 AATGTGAAAT TATGTTGCAT TTTTTATTGG GTAACTTTCA CATATAGCAA GCAAAAAATT 4625 | | ||| ||| | |||||||| |||||| ||| ||||||||| .......... .........T TAATTAATGG GCAACTTTCA CATATAACAA ACAAAAAATT 562 CATATTTGTA TGCTATAGCA AACTTTGCAT AATTGCGCTT CATAACAAAC ATAAAACTGT 4565 |||||||||| ||||||| || || ||||||| ||||||||| |||| ||||| CATATTTGTA TGCTATAACA AAGTTTGCAT AATTGCGCTC CATAGCAAAC .......... 612 ATAATTTGCT ATATATATAC ----AATTAT ATAATTCGCT GGCCTAAATT GTATACTTCG 4509 |||| | |||||||||| | |||||||| ||||| |||| .......... .......... ATAAAATTGT ATAATTCGCT GACCTAAATT GTATAATTCG 652 CTGGCCTATT TCGCTGCAAT TGTATAATTT GTTTTGCATA CAGTTGAATC GAATTAAAAT 4449 |||||||||| |||||||||| ||||||||| | | | | CTGGCCTATT TCGCTGCAAT TGTATAATTC GCTATCC... .......... .......... 689 GTACGTATAT TGCATAATTA TAAGTGTATA GCAAGAAGAT ATATGTTTTT CTCGCTTTAT 4389 .......... .......... .......... .......... .......... .......... 689 ATAAAAACAG AAACACAATA TATACACTTC TGTTGTATAA AGCTAGAGAA AAGTGTATTT 4329 ||||| .......... .......... .......... .......... .......... .....TATTT 694 CACTGCAATT GTATAATTCG TTGGCCTTTT TCTCTGCAAT ATTTGAAGTA AAATGTTTGT 4269 ||| ||||| |||||||||| | |||| || || ||||||| ||| | || |||| AACTACAATT GTATAATTCG CT-GCCTATT TCGCTGCAAT ATTATTA-TA AAAT...... 746 AAATTGTATA CTTAAGTGTA TAACACGAAG ATATACATTT TTGCATGTGT ATATACAATT 4209 .......... .......... .......... .......... .......... .......... 746 TTCTCTCACT TTATACAAAA CAGAAATAGA ATTATGCACT TCTGTGCATA AAGCGAGAGA 4149 .......... .......... .......... .......... .......... .......... 746 GGCGAGCGAG AATGGAGAGT GGCGAGCGAG ATTTTTGAGA GAGAGACACT GACAAATGGA 4089 .......... .......... .......... .......... .......... .......... 746 GCACAATTAA ATCAAACCCT AGCTAGTCCA TTTAATTTAG GTTATTAATT TGCTATTATA 4029 .......... .......... .......... .......... .......... .......... 746 TACGATTTTC CCTTTTTTTA AGGTTTATTA GGTCATGGAA TTAATTTACC AAATTTGACC 3969 .......... .......... .......... .......... .......... .......... 746 CACCATTGAA CGGTTTCATT TCTCTCTCTA ACAGCCACTT CTCTCTCTTA ACTTCATCGT 3909 .......... .......... .......... .......... .......... .......... 746 CCCCCCATTT CTGTTTCCTC CATTTCTCAG AAACAGGGGT TAAGGTTAAC CCAATTTACG 3849 .......... .......... .......... .......... .......... .......... 746 ACCCGATTCT ATCCCATATC CGACCCGTTT CCCTGCGTAA TCACCTCGAC ACCAATCTAT 3789 .......... .......... .......... .......... .......... .......... 746 GATGCCCTAG TTCTTGGGGG CATCATCATC TTCAACAGTT CCTCTTTGAT TCTTACAAAT 3729 .......... .......... .......... .......... .......... .......... 746 TTCTGGAAGA AATCAACATT TTGGGATCGT TTTTTGGGGA CGCATTTGGT TGATTTCTCG 3669 .......... .......... .......... .......... .......... .......... 746 AGATTAGAGA GACAATTGGT GGGTGCCATT TATTTTGCAG ATTGATGAAC CCTAGCTATT 3609 .......... .......... .......... .......... .......... .......... 746 TATGCCTGAT TGTAAATTTG GTGAAGATTT TGCTTAATTG AAGGAAAAAC CCAGATGTCA 3549 .......... .......... .......... .......... .......... .......... 746 GGAGGATTTG GTGAGTTGAT TTGGATGAAA GGGGTTCAAA TGAAGGTAAA TGGGTTTTGA 3489 ||| | ||| || || | | || | || .......... .....TTGCT TTGCATATAA TTGAACCGAA TTAAA..... .......... 776 AGATTCATAT AGTTGACACC AAATTAGCTT TGTATTGAGG TTTAACGATT GATTTGCTCA 3429 .......... .......... .......... .......... .......... .......... 776 TGTATGGATC CTTCATTGTT AATCTTAACC TTTTTGGGTG TTTGCTTTTG CTTCCTTGCT 3369 .......... .......... .......... .......... .......... .......... 776 CTTGCCGTGC AATCTCAGCA AAATGTGAGT CCTCCTTCTC CGCCGCCGCC GCCGTCTAAC 3309 .......... .......... .......... .......... .......... .......... 776 CAATCTCCTC CTCCACCGCC TCCACCACCA TCACCAGGGC CTCCTCCTCC TCCATCCCAG 3249 .......... .......... .......... .......... .......... .......... 776 CAAAAATACC ATTCTCCACC ACCAACTAAA TCTGTGAATT CTGCGACCAC CTCGGAGAGT 3189 .......... .......... .......... .......... .......... .......... 776 AAACACTCTA ATCATGATAA AAAACACCAT AACTCTTACG GGAAATCGCA TCAACCAGCA 3129 .......... .......... .......... .......... .......... .......... 776 AAGAAAAAGA AGCCAAATTT GGGGAAGAAA CTGGGGTTAG TGTTTGTGGG TGTTGCTGGG 3069 .......... .......... .......... .......... .......... .......... 776 ATGTTGCAGG TGTGTGT 3052 ||| ||| .........A TGTATGT 784 hqPGS_C06HBa0120H21.1-8-_SGN-U335137- (4665 4575,4544 4472,4333 4275) ******************************************************************************** EST sequence 6 +strand 2089 n (File: SGN-U315404+) 1 ATTTTTGATT ATATATAGAA AAAAATATGG GTTATTTACA AGTTTTTATT ACATCATTTT 61 TGTCAATTTC TTGTGTTAGT ATTTTCATGC CAAACTTGGT AGATGCTCAA CTCAAAACCA 121 ATTTTTATGC CCAAACTTGT CCTAATGTTG AATCCATTGT GCGTAATGTT GTTAACCAGA 181 AATTCAAACA AACGTTTGTC ACAATTCCGG CTGTTCTTCG TCTTTTCTTC CATGATTGCT 241 TTGTTGAGGG TTGTGATGCA TCGGTGATAA TAGCATCAAC GGCAGGGAAC ACAGCAGAAA 301 AAGATCACCC AGATAATCTT TCATTGGCTG GAGATGGATT TGACACAGTT ATCAAAGCAA 361 AAGCCGCAGT TGATGCGATC CCAAGTTGTA AAAATAAAGT TTCTTGTGCT GATATTCTTG 421 CCTTAGCCAC TCGAGATGTT ATTCAACTAT CCGGTGGACC GGGGTATGCA GTGGAATTGG 481 GGAGATTAGA TGGTTTGACA TCAAAATCTA CAAATGTAGG GGGAAAGTTG CCTAAACCTA 541 CTTTCAATTT GGATCAACTC AATACAATGT TTGCCTCTCA TGGTTTAAAT CAGGCTGATA 601 TGATTGCCTT ATCTGCGGCC CATACTGTTG GATTTTCTCA CTGCGACCAA TTCTCGAACC 661 GAATTTTCAA CTTTAACCCT AAAAACCCAG TGGATCCAAG TCTCAACAAG ACGTATGCAG 721 CCCAATTAGA ACAAATGTGT CCGAAAAATG TGGACCCAAG AATAGCCATC AACATGGACC 781 CAATAACACC TAGGGCTTTT GACAATGTGT ATTTCCAAAA CTTGCAAAAC GGAATGGGCC 841 TATTCACATC AGATCAAGTT TTGTTCACAG ACCAAAGGTC TAAAGGTACT GTTAATTTGT 901 GGGCCAGTAA TTTTAAAGTT TTTGAAACCG CATTTGTTAA TGCGATGACT AAATTGGGCC 961 GAGTTGGTGT GAAGACGGGT AAGAATGGAA ATATTCGTAT AGACTGTGGA GCATTTAATT 1021 AAGGGCAACT TTCACGTATA GCAAATAAAA AATTCATATT TGTATGTTAT AACAAAGTTT 1081 GCATAATTGC GCTCCATAGC AAACATAAAA CTGTATAATT CGCTATACAT ATACAATTGT 1141 ATAATTCGCT GGCCTAAATT GTATAATTTG ATGGCCTATT TCGCTGCAAT TGTATAATTC 1201 GCTATCTTCA TTCACTGCGA TTGTGTAATG CACAATCGTA TAAATCGCTG CTTATTTCGC 1261 TGCAATATTT TTATAAATTT GCTTTGCATA CCGTTGAATC GAATTAAAAT GTATGTATAT 1321 TGCATAATTA TAAGTGTATG GCAAGAATAT ATATGTTTTT GTCTCGCTTT ATACAAAAAC 1381 AAAAACACAA TATATACACT TCTGTTGTAT AAAGTTAGAG AAAATTGTAT TTCACTGCAA 1441 TTATATAATT CACAATTGTA TAATTCGTTG GCCTTTTTCT CTGCAATATT TGAAGTAAAA 1501 TGTTTGTAAT TTGTATAATT AAGTGTATAA CACGAAGATA TATATTTTTA CATGTGTATA 1561 TACAATTTTC TCTCGCTTTA TACAAAATAG AAACAGAATT TATACACTTC TGTGTATATA 1621 GCGAGATAGG CGAGCGAGAA TGGAGAGTGG CGAGCGAGAT TTTTGGGAGG GAGACGCCTG 1681 ACAAACTTTA GCTAACGTTT GCTATGGAGC ACAATTAAAT CAAACTCTAG CTACTCCATT 1741 TAGTTAGATT ACTAGTTTGC TATTATATAC AATTTTCCCA AGAGCTTGTA TAAAATGGAG 1801 AGAAGGAAAA AGATTTAGAA TTATCGCTTG TGAATGACAA GAAAAGATTG CAAAATAAGA 1861 ATTTAGTTGT TTTGCTTTAA TAAGTTCCTT AATCATTTCA TTAGGGGTGT AATAATAGAA 1921 TAAGAAAAGG GGCTTGAAAT TCAGCTATAA ACAACCAGGG CATAGGTAAC GTGTATACTT 1981 GATGTTATAG GTGAAATTTC AGCTTCTCAA TTTCGATTAA TTAACATTAT ATATATATAT 2041 ATATATTCCC CCAGTATCAC ATTTGATAGG AAAAAAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 14677 to 1): Exon 1 11572 11550 ( 23 n); cDNA 1029 1051 ( 23 n); score: 0.696 Intron 1 11549 8755 (2795 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.64) Exon 2 8754 8662 ( 93 n); cDNA 1052 1139 ( 88 n); score: 0.602 Intron 2 8661 8227 ( 435 n); Pd: 0.802 (s: 0.56), Pa: 0.000 (s: 0) Exon 3 8226 8199 ( 28 n); cDNA 1140 1167 ( 28 n); score: 0.571 Intron 3 8198 7865 ( 334 n); Pd: 0.900 (s: 0), Pa: 0.817 (s: 0) Exon 4 7864 7848 ( 17 n); cDNA 1168 1183 ( 16 n); score: 0.706 Intron 4 7847 4575 (3273 n); Pd: 0.072 (s: 0), Pa: 0.000 (s: 0.58) Exon 5 4574 4089 ( 486 n); cDNA 1184 1690 ( 507 n); score: 0.765 Intron 5 4088 1880 (2209 n); Pd: 0.202 (s: 0.79), Pa: 0.994 (s: 0) Exon 6 1879 1869 ( 11 n); cDNA 1691 1699 ( 9 n); score: 0.727 PPA cDNA 2071 2089 MATCH C06HBa0120H21.1-8- SGN-U315404+ 0.739 658 0.315 C PGS_C06HBa0120H21.1-8-_SGN-U315404+ (11572 11550,8754 8662,8226 8199,7864 7848,4574 4089,1879 1869) Alignment (genomic DNA sequence = upper lines): CTTTCCCAGA TATCAAACAA TCATCTCAGA ACTGGTACAG ACAAGGGGAA TCTGAAGAAT 11513 ||||| | | || |||| || | CTTTCACGTA TAGCAAATAA AAA....... .......... .......... .......... 1051 AAAGTGCAAT TACGTGTCCT AATATCTTAG GCCCTATTTT TTGCATTAAG ATGAAGACAT 11453 .......... .......... .......... .......... .......... .......... 1051 TTGAATTTGA ATGCACATTA GAGTGATTAA GGTTGTTTGT TTTTTAACAT CTGTATATGC 11393 .......... .......... .......... .......... .......... .......... 1051 ATAATTTATT TATTTTTAAA CATAATTAAT ATACAATTCA AATAAAAAAG TTATTTAAAT 11333 .......... .......... .......... .......... .......... .......... 1051 ATTAGAAAAA TATATACAAT AAAATCATTA TTTGATAAAA AATGAATCAT TTATATTCGC 11273 .......... .......... .......... .......... .......... .......... 1051 TAGTAATGGT GTAACTGGAT TGCAGTGGCG GTGGTGATGG TGATGGTTGG TACTAGTGGC 11213 .......... .......... .......... .......... .......... .......... 1051 TAGTAATGGT GGTGGTGATA GTTATGTTGG TGAAGGTGGC TGGTGTTGTA GTGGTTGTTG 11153 .......... .......... .......... .......... .......... .......... 1051 CGATGGTGGT GGTGATGATG ATTGTTAGTA GGTTGATGAT AGTAGTTAGT GGTGGCGGAA 11093 .......... .......... .......... .......... .......... .......... 1051 GAATGTGAGG TGGTTGATGA TGGTGTTAGT GTTGGTGGTA GTTGGCGGAG GTTTTAGTTG 11033 .......... .......... .......... .......... .......... .......... 1051 TAATTGTGGA GGTGGTTGTT AGTAGGAATT GTCCAGAGTG GTGGTAGTGA CCGGTAGTGG 10973 .......... .......... .......... .......... .......... .......... 1051 TAGCGTCATT AGATATAATT AATGATGGAG GTGTTAGGTG TGGTATCGAC TAACAATAGT 10913 .......... .......... .......... .......... .......... .......... 1051 GGTTGACAGT GGTAGTTGAT GACAAAGGTG GTTGGTGCTT GGTGACGGTT CGTGTGGTTG 10853 .......... .......... .......... .......... .......... .......... 1051 ACCGTGGTTA TGGTGATGGT AGAGGTGGGC GGTTGTCGAC AAAGTGGTAC TTAATAATAT 10793 .......... .......... .......... .......... .......... .......... 1051 TACACTTTAT TCAATGTTTT AATGATTTAG ACCAATTCAG AACAAATAAA TGTTTAAATC 10733 .......... .......... .......... .......... .......... .......... 1051 TTAACGTAAA CAAATGCATT TAATGCCTAA GGTTTGAAAT ATTCAGATTT AGACCTCCAT 10673 .......... .......... .......... .......... .......... .......... 1051 GAAGTGCAAA ATATTGAACC CTAAGACTTA CTTTGCATGT GCTGATTATC CTTGGAAAAT 10613 .......... .......... .......... .......... .......... .......... 1051 AAATATGCCT GAGAACTATA AAGGGAAAAG TACTGCCCAA TACCTATGGG CCATCTAGTT 10553 .......... .......... .......... .......... .......... .......... 1051 TACAAAATAT GTATACAATG TATATGTAGT GTATGCTTAA TACTAAATAT TGTACATGCC 10493 .......... .......... .......... .......... .......... .......... 1051 TCTAGTGTAT ATTCTGGTCA TGTATGGTAA ATTAGATAGT CAAATAGTAT ATATTGGTAA 10433 .......... .......... .......... .......... .......... .......... 1051 CCATCCCAAC TGTAAATCCC CTTTTAATAA AAAATAGATC AACTGAAGGA CATCGCAATT 10373 .......... .......... .......... .......... .......... .......... 1051 GTTGCATTTA GGTCCATATG TATACCTTTG AGTTTTCTTG TTAAATCTCA ACCCTTAGTG 10313 .......... .......... .......... .......... .......... .......... 1051 TCATGTAAGT TATACATTAA AATATAGCTC CGAACAAATA TGTTTGTGTG ATTGTGATTA 10253 .......... .......... .......... .......... .......... .......... 1051 AAGTTATGTC TGCATGCTTT TGTATATGTG TTGAAACTGA GAGCATTTAA AGCTATCTTT 10193 .......... .......... .......... .......... .......... .......... 1051 ATGTGATTAT CTTCACTTGT TCAATTTTGC TTTCTCCTCT TGGGCAGGTT GCGTATAAAC 10133 .......... .......... .......... .......... .......... .......... 1051 TTGTTAAATT AAATGTCTAA GCGGTAGTCA ATGTCGCGTT CTCTACTCCG TTTTTCGTAA 10073 .......... .......... .......... .......... .......... .......... 1051 TTTGTTTTTT TCTGCTCACT TTAGTGTGTG TGAAACCTAG GTTAGCCTTT GAACCAAAAT 10013 .......... .......... .......... .......... .......... .......... 1051 GGGAAATAAC ATTAAAATTC TGAATACTTC TTCACAACTT AAGTGGTTGA AGCAAATGCC 9953 .......... .......... .......... .......... .......... .......... 1051 TACCATTTCT TCTCCGAATT TCCCCTCTTA CCCTCTTCAA CAGCAGAGGC AGCATCGAAT 9893 .......... .......... .......... .......... .......... .......... 1051 GCTTATACCG AAATAATTAG CTTCACTTCA ACACTTGTGT CAAAAATCAA TGCTTTTGAA 9833 .......... .......... .......... .......... .......... .......... 1051 CCCAGAAGAG TTATCGCAAA CTGCACAGCC GGCACAACAG CAATTGGGGT ATCCGCAGAT 9773 .......... .......... .......... .......... .......... .......... 1051 GCATCAGAAT CAGCAGCTCC AACAGCAACA ACAACAACCT CAGCAGGTTT TGCATCAGCA 9713 .......... .......... .......... .......... .......... .......... 1051 GCAATCTTCT CCAGCAATGA ATTCCCCTGG TGGTCATAAT TTACTGAGTT TGACTGGATC 9653 .......... .......... .......... .......... .......... .......... 1051 AGAACCAGAT GCCACTGGAT CTGGGACAAC GACTCCTGGG AGTAGTTCAA GCCAGGGGGC 9593 .......... .......... .......... .......... .......... .......... 1051 TGAAGCAAGC AATCAGTTTC TTGGGAAGAG AAAGATTCAG GATTTAGTTT CACAGGTTTT 9533 .......... .......... .......... .......... .......... .......... 1051 CCCTTAACCT TCTTAATCTC CAAAGCTTTC TGTTTTCCTT GGTGGTTACT CAAGCCAACG 9473 .......... .......... .......... .......... .......... .......... 1051 TTGGAATCAA TTAATTTTTA TTTATTTTCA CAATCAATTA AAAAAAATTG GTCCACAAAG 9413 .......... .......... .......... .......... .......... .......... 1051 TATGTGAAAA GGCTGTTTTC GGCAGACCCT CGTTCAAGGG AAAAAAATAG CATACTTGTA 9353 .......... .......... .......... .......... .......... .......... 1051 AAAACAGTTA ATTTCTATAC ATAAGTTTTG TGGTTATTAA CACTGCAACT TTTTAACATC 9293 .......... .......... .......... .......... .......... .......... 1051 TTTCCACATA TTATGATAAT GTTCCAGGTG GATCCTCAGG GAAGAGTTGA TCCTGAAGTT 9233 .......... .......... .......... .......... .......... .......... 1051 GAACAGTTTC TTTTAGAGAT CGCTGATGAC TTTATTGATT CGGTGAGGCA TAGAAATGAT 9173 .......... .......... .......... .......... .......... .......... 1051 TTTTTCTTTG TTATCGTTAT CTTTTTTGTT AGGATTGCGT CAGGAGGTTG GGAGACTGCG 9113 .......... .......... .......... .......... .......... .......... 1051 GGTTTCATGT TTTGCATGCT GCCTTTCTCA TTCATTCACG GTTCAAGTTC TAAGATTGTT 9053 .......... .......... .......... .......... .......... .......... 1051 AGCTTTGATA ACTGTGGTTG AATACATTGA CATAGCCAGT AAAGAGGATT TTTTCTTGAA 8993 .......... .......... .......... .......... .......... .......... 1051 GGGTCATCGT ATGCTGATTC TGGACTCTTC CTTTCACACT TACCCTTCAC TCAAATGTAA 8933 .......... .......... .......... .......... .......... .......... 1051 ACTAAAAATT TTAAAAATAA TAATAAAAGA CCCCCCTATT TTCCCATTTT TGAGTTGTAA 8873 .......... .......... .......... .......... .......... .......... 1051 TGCCAAAAAA AATATCAAAG GCATGACAAG GAACCTTTTT TAAGAAGAGT TTCCTTTTGA 8813 .......... .......... .......... .......... .......... .......... 1051 AATACTTTCT TGAGAAACAT TGATTGTTTG TCATTTTCTT TGCTAAGTTT CATGTTCCAT 8753 || .......... .......... .......... .......... .......... ........AT 1053 TCCTCTTTGT AGGTTACTAC ATTTTCTTGC AATTTGGCGA AGCATCGGAA ATCTTCGACT 8693 || | ||||| | |||| || | | |||| | | ||| ||| | || | | | | TCATATTTGT ATGTTATAAC AAAGT-TTGC ATAATTGCGC TCCATAGCAA A-CAT-AAAA 1110 CTGGAGTCCA AAGATATACT GTTACATTTA GGTTTGTGCC AGCAACTGCA AAATATGCAG 8633 ||| | | | ||||| |||| || | CTGTA-TAAT TCGCTATACA TATACAATT- G......... .......... .......... 1139 AGTTGTTTTA CTTTGTTTTG TTTTTACAGA AAAAACAAGC TATTTATCAT TATTTTAGAA 8573 .......... .......... .......... .......... .......... .......... 1139 TATAAAAGGA GTCATTTCAT ACTTCATCAA AGATGATAGA ATTATAGAAA TAAAAGGTCC 8513 .......... .......... .......... .......... .......... .......... 1139 TAGTTGTTGC ATATATTTTC ATAATAAGTT CATTTTTATT GCTTTCCGTT AGTGAGTGTG 8453 .......... .......... .......... .......... .......... .......... 1139 TTTTTTACAT GACTATATGG TGAGATCCTC TGCATTTAGC CTATACACAG GTTTTCTATC 8393 .......... .......... .......... .......... .......... .......... 1139 TGGAATGTAA TTAGAGTTAC AAAATTTGAC ACATACTCCT ATTGACTCAG ACTGGGACCA 8333 .......... .......... .......... .......... .......... .......... 1139 AACTTCGTTT TCATGTCAGT ACAGTGTACC AACAACCATC AGTCTCCACA GTTTAATGTC 8273 .......... .......... .......... .......... .......... .......... 1139 TTTAGCTCTT TGAGCTTTTT AGCTACTTAT GCGTTCAGCC AACTAGTAAA ATTTGTATTC 8213 || | ||| | | .......... .......... .......... .......... ......TATA ATTCGCTGGC 1153 CACCACCTTA TAATGTATGC TTCAAGAGTT GCTGGTGAGT ATATGATGAG AACAGTTTGA 8153 | | || |||| CTAAATTGTA TAAT...... .......... .......... .......... .......... 1167 ATTGCTAAGC TTTCTCTTTA GTTGTCAGGA GTGGCTTTGA AGTGAGGTGT GACCTGAGTC 8093 .......... .......... .......... .......... .......... .......... 1167 CTGACTTAGT TAAATTGTTG AGTATCTGAA TAGTCCTAGT TGCATCTGAG AGAGTGATTG 8033 .......... .......... .......... .......... .......... .......... 1167 AGTCACACTG TCATTGTATA CAGTATATAT AATGGTTTAC CTTTTCGTGA ATAGATTTTG 7973 .......... .......... .......... .......... .......... .......... 1167 ACTTCCTCAT TTGTTTAGAA CATGCACAAT GCAACTCTAT CTTATTCCAA CACATCAATA 7913 .......... .......... .......... .......... .......... .......... 1167 CCTGGTAAAA TTTTGTTAGG CAACAGCTTT CTGCTTTGCC TAATGCAGTT GATTTGCTAT 7853 || ||| ||| .......... .......... .......... .......... ........TT GATGGCCTA- 1178 TTTTGGCGGA AAGGCTGCTT CATGATCTCA TCAATGCAGT TGTTCCTGGG AATGTTTACA 7793 ||| | TTTCG..... .......... .......... .......... .......... .......... 1183 TTCGGTCTAT CACTGCTTAA GATGTTTTAT AACAGAATTT GATTTAGAAA GAACAAAAAA 7733 .......... .......... .......... .......... .......... .......... 1183 TATGGTATTA GTCTAAGTTA ACTATGATGT TCTTTACAGA GAAAGATTGG AATTTGACTG 7673 .......... .......... .......... .......... .......... .......... 1183 TCCCAGGTTT TTCAAGTGAG GATAAGAAAC ACTGCCCTGA ACATGTAAGC TCTTCTTCCT 7613 .......... .......... .......... .......... .......... .......... 1183 CTCTAACTTT CATGGGTGGT ATACTGGTCT ATCTTATATC TACAAATAAT TGAGGAGTTC 7553 .......... .......... .......... .......... .......... .......... 1183 TGTAAGTTGT AACCAATAGT TAATTATGAT CCATGGCCTC TGAATACTGG AAGTACTCGA 7493 .......... .......... .......... .......... .......... .......... 1183 TAACATATTT CTTGAAAAAG GAAAAGAAAG GCTCAATAAC TTAGCTTTTT GCCCAAGTCT 7433 .......... .......... .......... .......... .......... .......... 1183 TGTTTATTAT AAATTTAAGT TTCTGATTAC AGTGCACTCA TCGCGTTATA AGTACCTCTT 7373 .......... .......... .......... .......... .......... .......... 1183 GATCTCTTTC TGGTCAATGA TTGTTGTAAC TTTAAGCTGA GTGGAAATTC AGGCTGTTTT 7313 .......... .......... .......... .......... .......... .......... 1183 ATTGACAATA TGTTTCTAAA CTTTGTGCAG TCATCAGGTG ATCTCTGCAA AGAGCGTTTG 7253 .......... .......... .......... .......... .......... .......... 1183 GAAATGGTGA GTTGAGAGTT GTTTTGTGTT AATACTACCT GCATTTATTT TAGGTGTATG 7193 .......... .......... .......... .......... .......... .......... 1183 ATTCATAGTC TTTGTGCTTG ACTCAGATAC CTGATATGAT GGAGGCTTCA CCACAAGCTG 7133 .......... .......... .......... .......... .......... .......... 1183 AAGCAAGTAC AAGCAGCAGC GCGAAGGAGA TCGTAAGTCC AGGGCTGGGT GACCAGGTTG 7073 .......... .......... .......... .......... .......... .......... 1183 GTTCGACTGA CATAATCGGA CCACCAAGTT CAGAGGAATT GGCTTCACCA TCTAATGGTG 7013 .......... .......... .......... .......... .......... .......... 1183 AAATATAGTT CAACTATAAC AAGATGTGAT GTTCCGTGGA GATTGGTATA CCCCTTGCTT 6953 .......... .......... .......... .......... .......... .......... 1183 TACTGTAATA AAACCTTTTT CAACTTTAGT TTGCTTGTCG ACTGAAGAAC GTGAACATAA 6893 .......... .......... .......... .......... .......... .......... 1183 GTCTGTTATG AATATTCATG ACCTTTTGTT GGTATGTTAT GTTAAGTTTT CTTTATTATT 6833 .......... .......... .......... .......... .......... .......... 1183 TACTATGTCC TCTTTACTTC CTGTATTGTT CTGTTAGCTG CTCCGGAATC TAAGACTGGG 6773 .......... .......... .......... .......... .......... .......... 1183 CACGGGACTG AACCGAGATG GTTTGATCGG GATGTTGACC GGTATCGGGA TGAATCGAAC 6713 .......... .......... .......... .......... .......... .......... 1183 CGAAATTATC GGGACGAAAA CTCGGTTCCG TCTTGTCCCA CTATATACCG GGATAGAATT 6653 .......... .......... .......... .......... .......... .......... 1183 GGATTGGACC GGACGATACA TTTAGCTATT TTAAACAATA ATTTTTTTTT GAATTACGAA 6593 .......... .......... .......... .......... .......... .......... 1183 AATATGAATG TTTTTTTTTT TTTTTTTTTA AAGTTTAATA GTTTATAAAG TATTTAAGTT 6533 .......... .......... .......... .......... .......... .......... 1183 TTGAAATTTA TAATGTTTAT TTGTAATTTT ATTTTAAAGA TATTTTTCTA AAAATTTTAC 6473 .......... .......... .......... .......... .......... .......... 1183 TTAAAAGTAT GTGTATATTC TACCCTTCCT AGATCCTGTT TTGGGATCAT AATGGGTTTT 6413 .......... .......... .......... .......... .......... .......... 1183 TTGTTGTGTT ATTTGCGTGA TTGTTTTACA TTATACCATT TTTGTCATGC CTTTTGCCAC 6353 .......... .......... .......... .......... .......... .......... 1183 CTCCTCAAAT CATGCGTCTA ATCTCCCGTT TGGTCATAAA TTTTTCACTT CCCTTTTTTT 6293 .......... .......... .......... .......... .......... .......... 1183 CAAAAATATT TTAAATACAC TGTTTCTGAA AATGAATATT TTTTCAAGTT TCAAAAATTA 6233 .......... .......... .......... .......... .......... .......... 1183 GTTTACGAGT AGTTTTTCAA GCTTTTGAAG TCCTAGACTC GCAAAACTTC AACATAAAAT 6173 .......... .......... .......... .......... .......... .......... 1183 GCATATCCAA CCATAACTTC ATTCTCAAAA AATCATTTTT CATTTCATCT TGAAAACCCC 6113 .......... .......... .......... .......... .......... .......... 1183 ATATTTTTGC ATTACAAGAA CATTTTAAGC AGCCTTATTT AGGGCTTACC TTCCACCTTA 6053 .......... .......... .......... .......... .......... .......... 1183 TGCTTATGAC TGGGCCGATC TATCAAGCCC AGTCTTCTCC GGCCCAACTC TTCGCGTCTG 5993 .......... .......... .......... .......... .......... .......... 1183 TGTTGAGCCA ATCATTTATC AGGTTAACTA AAAATCTATT AGAATTAACT TAAAAGTTAA 5933 .......... .......... .......... .......... .......... .......... 1183 TTAAATCTGT TTTTTAATCG TTTTTAACTT TTAGGAGTAT TCGTCAAATT TAATAATAAG 5873 .......... .......... .......... .......... .......... .......... 1183 ATGTGTGTGT ATATATATAT ATTGGATGGG AATTTAATAT TGGAGTTTGG AATCCCATTT 5813 .......... .......... .......... .......... .......... .......... 1183 GCAAGCCATG TTCACTTCAT GGCTTTACAT TTCTTTGGAT TTATAGACTA TATAATATAA 5753 .......... .......... .......... .......... .......... .......... 1183 TACCATTAGT ATATGATTTT TTCCCCAAAA GTCTTAATAT TAGCCTGTCA GTGAGATGGT 5693 .......... .......... .......... .......... .......... .......... 1183 CGGGCTTCCT CGTAAAAGAC ATTTTTTCTT CATATAAAAA TATAATCCCA TAAAAAAAAA 5633 .......... .......... .......... .......... .......... .......... 1183 CAAATGAGAA AAATATATTT TCTTCTTCTT TCAACAGACA AACTAAATTA TATTACATGA 5573 .......... .......... .......... .......... .......... .......... 1183 AGATTCAATG TTACAAGAAT TTAACATGTT ATCATTCACA ATAAACTTTC CAAATTTTGA 5513 .......... .......... .......... .......... .......... .......... 1183 GTTGCCTTGA AATATCATTA AAAATTATGA TGGAATGAAT ATAATCTCTT TGCTTTAAAT 5453 .......... .......... .......... .......... .......... .......... 1183 TAATTTTGAG CGTCGGCTAA AAAATATATA GTCATCTAAT ATATCTCACA CAATACATAT 5393 .......... .......... .......... .......... .......... .......... 1183 TTAAATTAAT TATTATACAA ATATCAAATA TAGAATAAGA GAGCGAAAAA TCAAATAAAA 5333 .......... .......... .......... .......... .......... .......... 1183 AGTAAAAATA CAACTTGAAT ATCCTAAGTT TATAGGATAA AATGCGACTT GTTAGCCTAA 5273 .......... .......... .......... .......... .......... .......... 1183 TGCATAAGAC AACAATTCTA TAGATTTTAC TCCCTTATTT GGTTAAGTTA TCTCTAACTA 5213 .......... .......... .......... .......... .......... .......... 1183 ATAAGTATTC TCTTTTAACT CAAATAACTT AGATTTTTTT TACTATGGAT CTAAAAATGA 5153 .......... .......... .......... .......... .......... .......... 1183 AGTATTTTTA TGTTTAGTAA TAGTTCAATT CTAAAATGTT TATTTTATCT TTAATAGAAT 5093 .......... .......... .......... .......... .......... .......... 1183 GATTAATAGT TACACAAATA TTACTAACTA TTTTTAGATC ACAAAATTTT AAATATTTTT 5033 .......... .......... .......... .......... .......... .......... 1183 TTTAAAAAAA TTCATATCAA ATCAAACTAG ATCGAAAAAA AGTTAATTTT CTTACATTAT 4973 .......... .......... .......... .......... .......... .......... 1183 GGATTATGAA ACTTAAGTGA GTACCAAAAA TCAAATCCAA AAACACAAAC ATGTAAATGA 4913 .......... .......... .......... .......... .......... .......... 1183 CTCTTACTTC TTCTTTTCTC TACGATGGTT TTTTACTTCA TGTTTGTTAA TCATATTAAA 4853 .......... .......... .......... .......... .......... .......... 1183 ATCTAATATA AATACAAGTG ATAATCTCAT TCAATACAAA AAATTTGTAC TTTTTCTAGC 4793 .......... .......... .......... .......... .......... .......... 1183 TTTAGCACAA TTTCTTACTT TATTATTTCG TTGTCAATTA TAAAAGCGAT GCTTTTAACT 4733 .......... .......... .......... .......... .......... .......... 1183 AAATTACTCT TATTAGCTTA AACTCGAGAC TTTCGATAAT GTGTACAAAA TGTGAAATTA 4673 .......... .......... .......... .......... .......... .......... 1183 TGTTGCATTT TTTATTGGGT AACTTTCACA TATAGCAAGC AAAAAATTCA TATTTGTATG 4613 .......... .......... .......... .......... .......... .......... 1183 CTATAGCAAA CTTTGCATAA TTGCGCTTCA TAACAAACAT AAAACTGTAT AATTTGCTAT 4553 | || ||||| |||| ||||| .......... .......... .......... ........CT GCAATTGTAT AATTCGCTAT 1205 ATATATACAA TTATATAATT CGCTGGCCTA AATTGTATAC TTCGCTGGCC TATTTCGCTG 4493 | || || | || | | | | ||| ||||| ||||| || |||||||||| CTTCATTCAC TGCGATTGTG TAAT-G-CAC AATCGTATAA ATCGCT-GCT TATTTCGCTG 1262 C-A-A-TTGT AT-AATTTGT TTTGCATACA GTTGAATCGA ATTAAAATGT ACGTATATTG 4437 | | | || | || |||||| ||||||||| |||||||||| |||||||||| | |||||||| CAATATTTTT ATAAATTTGC TTTGCATACC GTTGAATCGA ATTAAAATGT ATGTATATTG 1322 CATAATTATA AGTGTATAGC AAGAAGATAT ATGTTTTT-- CTCGCTTTAT ATAAAAACAG 4379 |||||||||| ||||||| || ||||| |||| |||||||| |||||||||| | ||||||| CATAATTATA AGTGTATGGC AAGAATATAT ATGTTTTTGT CTCGCTTTAT ACAAAAACAA 1382 AAACACAATA TATACACTTC TGTTGTATAA AGCTAGAGAA AAGTGTATTT CACTG----- 4324 |||||||||| |||||||||| |||||||||| || ||||||| || ||||||| ||||| AAACACAATA TATACACTTC TGTTGTATAA AGTTAGAGAA AATTGTATTT CACTGCAATT 1442 --------CA --ATTGTATA ATTCGTTGGC CTTTTTCTCT GCAATATTTG AAGTAAAATG 4274 || |||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATAATTCA CAATTGTATA ATTCGTTGGC CTTTTTCTCT GCAATATTTG AAGTAAAATG 1502 TTTGTAAATT GTATACTTAA GTGTATAACA CGAAGATATA CATTTTTGCA TGTGTATATA 4214 ||||||| || ||||| |||| |||||||||| |||||||||| |||||| || |||||||||| TTTGTAATTT GTATAATTAA GTGTATAACA CGAAGATATA TATTTTTACA TGTGTATATA 1562 CAATTTTCTC TCACTTTATA CAAAACAGAA ATAGAA-TTA TGCACTTCTG TGCATAAAGC 4155 |||||||||| || ||||||| ||||| |||| | |||| ||| | |||||||| || ||| ||| CAATTTTCTC TCGCTTTATA CAAAATAGAA ACAGAATTTA TACACTTCTG TGTATATAGC 1622 GAGAGAGGCG AGCGAGAATG GAGAGTGGCG AGCGAGATTT TTGAGAGAGA GAC-ACTGAC 4096 |||| ||||| |||||||||| |||||||||| |||||||||| ||| ||| || ||| ||||| GAGATAGGCG AGCGAGAATG GAGAGTGGCG AGCGAGATTT TTGGGAGGGA GACGCCTGAC 1682 AAA-TGGAGC ACAATTAAAT CAAACCCTAG CTAGTCCATT TAATTTAGGT TATTAATTTG 4037 ||| | | AAACTTTA.. .......... .......... .......... .......... .......... 1690 CTATTATATA CGATTTTCCC TTTTTTTAAG GTTTATTAGG TCATGGAATT AATTTACCAA 3977 .......... .......... .......... .......... .......... .......... 1690 ATTTGACCCA CCATTGAACG GTTTCATTTC TCTCTCTAAC AGCCACTTCT CTCTCTTAAC 3917 .......... .......... .......... .......... .......... .......... 1690 TTCATCGTCC CCCCATTTCT GTTTCCTCCA TTTCTCAGAA ACAGGGGTTA AGGTTAACCC 3857 .......... .......... .......... .......... .......... .......... 1690 AATTTACGAC CCGATTCTAT CCCATATCCG ACCCGTTTCC CTGCGTAATC ACCTCGACAC 3797 .......... .......... .......... .......... .......... .......... 1690 CAATCTATGA TGCCCTAGTT CTTGGGGGCA TCATCATCTT CAACAGTTCC TCTTTGATTC 3737 .......... .......... .......... .......... .......... .......... 1690 TTACAAATTT CTGGAAGAAA TCAACATTTT GGGATCGTTT TTTGGGGACG CATTTGGTTG 3677 .......... .......... .......... .......... .......... .......... 1690 ATTTCTCGAG ATTAGAGAGA CAATTGGTGG GTGCCATTTA TTTTGCAGAT TGATGAACCC 3617 .......... .......... .......... .......... .......... .......... 1690 TAGCTATTTA TGCCTGATTG TAAATTTGGT GAAGATTTTG CTTAATTGAA GGAAAAACCC 3557 .......... .......... .......... .......... .......... .......... 1690 AGATGTCAGG AGGATTTGGT GAGTTGATTT GGATGAAAGG GGTTCAAATG AAGGTAAATG 3497 .......... .......... .......... .......... .......... .......... 1690 GGTTTTGAAG ATTCATATAG TTGACACCAA ATTAGCTTTG TATTGAGGTT TAACGATTGA 3437 .......... .......... .......... .......... .......... .......... 1690 TTTGCTCATG TATGGATCCT TCATTGTTAA TCTTAACCTT TTTGGGTGTT TGCTTTTGCT 3377 .......... .......... .......... .......... .......... .......... 1690 TCCTTGCTCT TGCCGTGCAA TCTCAGCAAA ATGTGAGTCC TCCTTCTCCG CCGCCGCCGC 3317 .......... .......... .......... .......... .......... .......... 1690 CGTCTAACCA ATCTCCTCCT CCACCGCCTC CACCACCATC ACCAGGGCCT CCTCCTCCTC 3257 .......... .......... .......... .......... .......... .......... 1690 CATCCCAGCA AAAATACCAT TCTCCACCAC CAACTAAATC TGTGAATTCT GCGACCACCT 3197 .......... .......... .......... .......... .......... .......... 1690 CGGAGAGTAA ACACTCTAAT CATGATAAAA AACACCATAA CTCTTACGGG AAATCGCATC 3137 .......... .......... .......... .......... .......... .......... 1690 AACCAGCAAA GAAAAAGAAG CCAAATTTGG GGAAGAAACT GGGGTTAGTG TTTGTGGGTG 3077 .......... .......... .......... .......... .......... .......... 1690 TTGCTGGGAT GTTGCAGGTG TGTGTGGTGG CGTTCTTGCT AATAAAGAGA AGACAATTGT 3017 .......... .......... .......... .......... .......... .......... 1690 TAAAGGCTGG TAGTAGATTT TGAATGAACA TTTGAATATG GATGTATATC AGTTAGTCTA 2957 .......... .......... .......... .......... .......... .......... 1690 ATTAATTCAG AATTTTACGA GACCGCGAAA GGCAATGAAC ACGGCATTGA TGAATTAGAA 2897 .......... .......... .......... .......... .......... .......... 1690 GCATTGGGTT CATCTGACAT GTAGATTTCT GCATTTGCAT TGGTGGCTAC TGTAAATTTG 2837 .......... .......... .......... .......... .......... .......... 1690 GTAATTCTTG ATCAATATGC ATATTTATCT TACGTTTTTC TCGTTTATTA TGAATGAGTA 2777 .......... .......... .......... .......... .......... .......... 1690 TCAATGCATC ATGTATTACT AGTGTGAATG ATTTAGAACT CATTGTGATT TTTATGTGTT 2717 .......... .......... .......... .......... .......... .......... 1690 CCATTATTTC GAAATCTACT TTCTCTGTAA TTGTGCTGCT GCTTTAAGTG AGCATTTATT 2657 .......... .......... .......... .......... .......... .......... 1690 CAACGGATGG ATCGATAAAA TTAAACATTC CGGTAGTCTG AAGCTGCTAA CAGCTTTTGT 2597 .......... .......... .......... .......... .......... .......... 1690 CAAAAGTTAT GTATGATGAG TCCCTTTTTG CTGGAATATT CCGAATATGT TTGTTATTAG 2537 .......... .......... .......... .......... .......... .......... 1690 GTTTGGTCTG CTCTTTAGCT GAAATATACT GTACTGCTGG AAAGCTTTAT CCTTTTTCGA 2477 .......... .......... .......... .......... .......... .......... 1690 TGATAATGAA CTATAAAGAA TGTACTTTTG CAGAGCATGT TGAGAAATAC AGATGAATGT 2417 .......... .......... .......... .......... .......... .......... 1690 GCAGACTTCG GATGCTTTGT TTCTAGTTGC TCGTATGTGT AAGAATCAAT TTGAAGTTCT 2357 .......... .......... .......... .......... .......... .......... 1690 CGATAAGATC ATTTGTAGGT CTCAATCAAG CATAACAATT CAATTAAGAA AGAGAAAAGG 2297 .......... .......... .......... .......... .......... .......... 1690 ATTATAGTGA AAACTCATGG TGCTATGTGA ATTTTTACTC TCTTTTGCAT TTAACTGATC 2237 .......... .......... .......... .......... .......... .......... 1690 CTACCATTTT TACTAGTTTA GAAGAAACTA ACTAATTCAA GTAACGGTTA ATGTAGATGA 2177 .......... .......... .......... .......... .......... .......... 1690 GGTGAACCTT TGGTGTTTGC ACCTGGTTGC ACGTACCTTG ACTAATCCAC CGAACATTCC 2117 .......... .......... .......... .......... .......... .......... 1690 AAAGAATGGG GAGTAAATAG ATACTCCGAC TTCAATTGAA TGCAGAACTA ATTCTTGATG 2057 .......... .......... .......... .......... .......... .......... 1690 ATGGGATTTG CTTGGTTTGT CCTAGAATTT ACTTGCTATT TGTTCGCGGC TGCTAAAAAG 1997 .......... .......... .......... .......... .......... .......... 1690 TGTTTTGAAG TTCTGTCTAA CAGCAAAAAT ACCTTAGTTA CTGCTTGTGC TTCCTTGTTG 1937 .......... .......... .......... .......... .......... .......... 1690 TGTTTCTGTA GGAATAATGC CAACTCTTCA TGACTCCTGT GTTTTTTGGT ATTGCAGGTC 1877 | | .......... .......... .......... .......... .......... .......G-C 1692 TCAAAGTT 1869 | || ||| T-AACGTT 1699 hqPGS_C06HBa0120H21.1-8-_SGN-U315404+ (4574 4089) ******************************************************************************** EST sequence 2 -strand 680 n (File: SGN-U315405-) 1 ATCGAATTAA ATAAGCTATG TGTATATATT AAATGTAGAG GAGAAAGAGA GGGAATAATT 61 TAGAATTAAG GGTAAAATAA AATGGTTCAT ATTATATTTT TTAATGGGTC GGGTTAGGTG 121 GGTGGATCAC TAAATAGTAT TTTTTATTTT TTTAAAAAAT TTTGATTTGT TATATAAACA 181 ATTAACGTAA ATAAATTTTA ATTTAAAAAG TAATTTAATA GATTGAGTGA CTTGAGGAAA 241 AAATAAATAG TTGAGTCACT TTTGAGTTAA AATTCAAAGT TGAGTCACTT TTTGAGAAAT 301 GTATGGATAA TTGAATACTT TTAAGGAAAA ATTGAAAGTT GAGTGACTTT TTGAGAGAAA 361 GTATGGGTGG AATCCCCCGA GAAGAGTTAG AATGACCGTT CCCGCTCCTT CGGCGATACC 421 AAACAAATGA CTATTTGAAT CGGGGTTTTG AGCACAAAGA TTAGTTGAAT GACCATCTAA 481 AGTATTAACT CGGAGATAAA CTGATTTTGG AAGAACTATT TTCATATGTC CATGATACAT 541 TATTGGGCAA CTTTTACATA TAGCAAATAA AAAATTCATA TTTGTATGCT AGCAAACTTT 601 GCATAATTGC GCTGCATAGC AAACATAAAA CTGTATAATT TGCTATACAT ATACAATTGT 661 ATAATTCGCT GGCCTAAATT Predicted gene structure (within gDNA segment 11109 to 3919): Exon 1 7280 7247 ( 34 n); cDNA 496 528 ( 33 n); score: 0.588 Intron 1 7246 7167 ( 80 n); Pd: 0.909 (s: 0), Pa: 0.998 (s: 0) Exon 2 7166 7155 ( 12 n); cDNA 529 539 ( 11 n); score: 0.583 Intron 2 7154 4662 (2493 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.92) Exon 3 4661 4519 ( 143 n); cDNA 540 680 ( 141 n); score: 0.930 MATCH C06HBa0120H21.1-8- SGN-U315405- 0.930 189 0.278 C PGS_C06HBa0120H21.1-8-_SGN-U315405- (7280 7247,7166 7155,4661 4519) Alignment (genomic DNA sequence = upper lines): ATCAGGTGAT CTCTGCAAAG AGCGTTTGGA AATGGTGAGT TGAGAGTTGT TTTGTGTTAA 7221 || | |||| | || || | ||| | ||| ATAAACTGAT -TTTGGAAGA ACTATTTTCA TATG...... .......... .......... 528 TACTACCTGC ATTTATTTTA GGTGTATGAT TCATAGTCTT TGTGCTTGAC TCAGATACCT 7161 | | | .......... .......... .......... .......... .......... ....-TCCAT 533 GATATGATGG AGGCTTCACC ACAAGCTGAA GCAAGTACAA GCAGCAGCGC GAAGGAGATC 7101 |||| GATACA.... .......... .......... .......... .......... .......... 539 GTAAGTCCAG GGCTGGGTGA CCAGGTTGGT TCGACTGACA TAATCGGACC ACCAAGTTCA 7041 .......... .......... .......... .......... .......... .......... 539 GAGGAATTGG CTTCACCATC TAATGGTGAA ATATAGTTCA ACTATAACAA GATGTGATGT 6981 .......... .......... .......... .......... .......... .......... 539 TCCGTGGAGA TTGGTATACC CCTTGCTTTA CTGTAATAAA ACCTTTTTCA ACTTTAGTTT 6921 .......... .......... .......... .......... .......... .......... 539 GCTTGTCGAC TGAAGAACGT GAACATAAGT CTGTTATGAA TATTCATGAC CTTTTGTTGG 6861 .......... .......... .......... .......... .......... .......... 539 TATGTTATGT TAAGTTTTCT TTATTATTTA CTATGTCCTC TTTACTTCCT GTATTGTTCT 6801 .......... .......... .......... .......... .......... .......... 539 GTTAGCTGCT CCGGAATCTA AGACTGGGCA CGGGACTGAA CCGAGATGGT TTGATCGGGA 6741 .......... .......... .......... .......... .......... .......... 539 TGTTGACCGG TATCGGGATG AATCGAACCG AAATTATCGG GACGAAAACT CGGTTCCGTC 6681 .......... .......... .......... .......... .......... .......... 539 TTGTCCCACT ATATACCGGG ATAGAATTGG ATTGGACCGG ACGATACATT TAGCTATTTT 6621 .......... .......... .......... .......... .......... .......... 539 AAACAATAAT TTTTTTTTGA ATTACGAAAA TATGAATGTT TTTTTTTTTT TTTTTTTAAA 6561 .......... .......... .......... .......... .......... .......... 539 GTTTAATAGT TTATAAAGTA TTTAAGTTTT GAAATTTATA ATGTTTATTT GTAATTTTAT 6501 .......... .......... .......... .......... .......... .......... 539 TTTAAAGATA TTTTTCTAAA AATTTTACTT AAAAGTATGT GTATATTCTA CCCTTCCTAG 6441 .......... .......... .......... .......... .......... .......... 539 ATCCTGTTTT GGGATCATAA TGGGTTTTTT GTTGTGTTAT TTGCGTGATT GTTTTACATT 6381 .......... .......... .......... .......... .......... .......... 539 ATACCATTTT TGTCATGCCT TTTGCCACCT CCTCAAATCA TGCGTCTAAT CTCCCGTTTG 6321 .......... .......... .......... .......... .......... .......... 539 GTCATAAATT TTTCACTTCC CTTTTTTTCA AAAATATTTT AAATACACTG TTTCTGAAAA 6261 .......... .......... .......... .......... .......... .......... 539 TGAATATTTT TTCAAGTTTC AAAAATTAGT TTACGAGTAG TTTTTCAAGC TTTTGAAGTC 6201 .......... .......... .......... .......... .......... .......... 539 CTAGACTCGC AAAACTTCAA CATAAAATGC ATATCCAACC ATAACTTCAT TCTCAAAAAA 6141 .......... .......... .......... .......... .......... .......... 539 TCATTTTTCA TTTCATCTTG AAAACCCCAT ATTTTTGCAT TACAAGAACA TTTTAAGCAG 6081 .......... .......... .......... .......... .......... .......... 539 CCTTATTTAG GGCTTACCTT CCACCTTATG CTTATGACTG GGCCGATCTA TCAAGCCCAG 6021 .......... .......... .......... .......... .......... .......... 539 TCTTCTCCGG CCCAACTCTT CGCGTCTGTG TTGAGCCAAT CATTTATCAG GTTAACTAAA 5961 .......... .......... .......... .......... .......... .......... 539 AATCTATTAG AATTAACTTA AAAGTTAATT AAATCTGTTT TTTAATCGTT TTTAACTTTT 5901 .......... .......... .......... .......... .......... .......... 539 AGGAGTATTC GTCAAATTTA ATAATAAGAT GTGTGTGTAT ATATATATAT TGGATGGGAA 5841 .......... .......... .......... .......... .......... .......... 539 TTTAATATTG GAGTTTGGAA TCCCATTTGC AAGCCATGTT CACTTCATGG CTTTACATTT 5781 .......... .......... .......... .......... .......... .......... 539 CTTTGGATTT ATAGACTATA TAATATAATA CCATTAGTAT ATGATTTTTT CCCCAAAAGT 5721 .......... .......... .......... .......... .......... .......... 539 CTTAATATTA GCCTGTCAGT GAGATGGTCG GGCTTCCTCG TAAAAGACAT TTTTTCTTCA 5661 .......... .......... .......... .......... .......... .......... 539 TATAAAAATA TAATCCCATA AAAAAAAACA AATGAGAAAA ATATATTTTC TTCTTCTTTC 5601 .......... .......... .......... .......... .......... .......... 539 AACAGACAAA CTAAATTATA TTACATGAAG ATTCAATGTT ACAAGAATTT AACATGTTAT 5541 .......... .......... .......... .......... .......... .......... 539 CATTCACAAT AAACTTTCCA AATTTTGAGT TGCCTTGAAA TATCATTAAA AATTATGATG 5481 .......... .......... .......... .......... .......... .......... 539 GAATGAATAT AATCTCTTTG CTTTAAATTA ATTTTGAGCG TCGGCTAAAA AATATATAGT 5421 .......... .......... .......... .......... .......... .......... 539 CATCTAATAT ATCTCACACA ATACATATTT AAATTAATTA TTATACAAAT ATCAAATATA 5361 .......... .......... .......... .......... .......... .......... 539 GAATAAGAGA GCGAAAAATC AAATAAAAAG TAAAAATACA ACTTGAATAT CCTAAGTTTA 5301 .......... .......... .......... .......... .......... .......... 539 TAGGATAAAA TGCGACTTGT TAGCCTAATG CATAAGACAA CAATTCTATA GATTTTACTC 5241 .......... .......... .......... .......... .......... .......... 539 CCTTATTTGG TTAAGTTATC TCTAACTAAT AAGTATTCTC TTTTAACTCA AATAACTTAG 5181 .......... .......... .......... .......... .......... .......... 539 ATTTTTTTTA CTATGGATCT AAAAATGAAG TATTTTTATG TTTAGTAATA GTTCAATTCT 5121 .......... .......... .......... .......... .......... .......... 539 AAAATGTTTA TTTTATCTTT AATAGAATGA TTAATAGTTA CACAAATATT ACTAACTATT 5061 .......... .......... .......... .......... .......... .......... 539 TTTAGATCAC AAAATTTTAA ATATTTTTTT TAAAAAAATT CATATCAAAT CAAACTAGAT 5001 .......... .......... .......... .......... .......... .......... 539 CGAAAAAAAG TTAATTTTCT TACATTATGG ATTATGAAAC TTAAGTGAGT ACCAAAAATC 4941 .......... .......... .......... .......... .......... .......... 539 AAATCCAAAA ACACAAACAT GTAAATGACT CTTACTTCTT CTTTTCTCTA CGATGGTTTT 4881 .......... .......... .......... .......... .......... .......... 539 TTACTTCATG TTTGTTAATC ATATTAAAAT CTAATATAAA TACAAGTGAT AATCTCATTC 4821 .......... .......... .......... .......... .......... .......... 539 AATACAAAAA ATTTGTACTT TTTCTAGCTT TAGCACAATT TCTTACTTTA TTATTTCGTT 4761 .......... .......... .......... .......... .......... .......... 539 GTCAATTATA AAAGCGATGC TTTTAACTAA ATTACTCTTA TTAGCTTAAA CTCGAGACTT 4701 .......... .......... .......... .......... .......... .......... 539 TCGATAATGT GTACAAAATG TGAAATTATG TTGCATTTTT TATTGGGTAA CTTTCACATA 4641 | ||||||| || |||| ||||| .......... .......... .......... .........T TATTGGGCAA CTTTTACATA 560 TAGCAAGCAA AAAATTCATA TTTGTATGCT ATAGCAAACT TTGCATAATT GCGCTTCATA 4581 |||||| || |||||||||| ||||||||| ||||||||| |||||||||| ||||| |||| TAGCAAATAA AAAATTCATA TTTGTATGC- -TAGCAAACT TTGCATAATT GCGCTGCATA 618 ACAAACATAA AACTGTATAA TTTGCTATAT ATATACAATT ATATAATTCG CTGGCCTAAA 4521 ||||||||| |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| GCAAACATAA AACTGTATAA TTTGCTATAC ATATACAATT GTATAATTCG CTGGCCTAAA 678 TT 4519 || TT 680 hqPGS_C06HBa0120H21.1-8-_SGN-U315405- (4661 4519) ******************************************************************************** EST sequence 3 +strand 1659 n (File: SGN-U322786+) 1 AAAATCCATT TTCCATCTCC GACAACACAC CCAGAACACA ATTATATTGA GCATATGGAT 61 ATTTAGCTAT TTTTGAAAAA ATGCTGAATT TTCTTATCAA AGCTTTTTCA ACGAACACAA 121 CAACAAATAC TCTTCCAATT TTTTTTTTAA CCATATCCTG TCGGCAATTG ATTGAGAAAT 181 CAGCAACCTG TTGATTTGAA TCAGAATTTT CTCCGGTCAA GCTTGTTGCA GGCACTGCCC 241 AGTTCAACTT GCAGTTTCAG CTTCTGGATT TCGGAATTCA ACTTGATCAG CATAATTGGA 301 CAAACTGTCA AGCTGCCTCT GTTGTCTGGA TAGTCTTGTT GCAGGCACTG CCTAGTTTAA 361 TTTGCAATTC CAACTTTTGG ATTCGCGGAG ATGTGGAGGG GTTGGATGTA GTAGTCACGA 421 GGGAGAGGTG GAGCTAGAGT AGGCTGAAAA AGTATTAAGG AAAAGTGATT AGACACAACA 481 TGATGCAGCT TCTGACCTTA GATAAGAGGG AGAGGGAATG GAGACTGCGG ACCGCCTAAA 541 AGGGCACGCC ATGATGGCAG GATTGATGCA GTCTTCTCTG TGTCATTTCC GTGGACAGTC 601 TTTTCAGGGA TGCAGGCAAT GGGAAACTGG GAATATTGGG ATCTCTCAAT TTGAGCTCTC 661 AACTCAGGGC ACATCGGGTC CTTGCATATG CCCAACAAAG GATGAACAAA GGTCAACTAC 721 GGAAGACGTT GTCTCAGCAA ACTTCCCTCA TCATCAGTCA GAAGAGTTAT CGCAAACTGC 781 ACAGCCGGCA CAACAGCAAT TGGGGTATCC GCAGATGCAT CAGAATCAGC AGCTCCAACA 841 GCAACAACAA CAACCTCAGC AGGTTTTGCA TCAGCAGCAA TCTTCTCCAG CAATGAATTC 901 CCCTGGTGGT CATAATTTAC TGAGTTTGAC TGGATCAGAA CCAGATGCCA CTGGATCTGG 961 GACAACGACT CCTGGGAGTA GTTCAAGCCA GGGGGCTGAA GCAAGCAATC AGTTTCTTGG 1021 GAAGAGAAAG ATTCAGGATT TAGTTTCACA GGTGGATCCT CAGGGAAGAG TTGATCCTGA 1081 AGTTGAACAG TTTCTTTTAG AGATCGCTGA TGACTTTATT GATTCGGTTA CTACATTTTC 1141 TTGCAATTTG GCGAAGCATC GGAAATCTTC GACTCTGGAG TCCAAAGATA TACTGTTACA 1201 TTTAGAGAAA GATTGGAATT TGACTGTCCC AGGTTTTTCA AGTGAGGATA AGAAACACTG 1261 CCCTGAACAT TCATCAGGTG ATCTCTGCAA AGAGCGTTTG GAAATGTCTT TGTGCTTGAC 1321 TCAGATACCT GATATGATGG AGGCTTCACC ACAAGCTGAA GCAAGTACAA GCAGCAGCGC 1381 GAAGGAGATC GTAAGTCCAG GGCTGGGTGA CCAGGTTGGT TCGACTGACA TAATCGGACC 1441 ACCAAGTTCA GAGGAATTGG CTTCACCATC TAATGGTGAA ATATAGTTCA ACTATAACAA 1501 GATGTGATGT TCCGTGGAGA TTGGTATACC CCTTGCTTTA CTGTAATAAA ACCTTTTTCA 1561 ACTTTAGTTT GCTTGTCGAC TGAAGAACGT GAACATAAGT CTGTTATGAA TATTCATGAC 1621 CTTTTGTTGG TATGTTATGT TAAGTTTTCT TTATTAAAA Predicted gene structure (within gDNA segment 16877 to 6195): Exon 1 16570 16292 ( 279 n); cDNA 1 273 ( 273 n); score: 0.950 Intron 1 16291 15001 (1291 n); Pd: 0.962 (s: 1.00), Pa: 0.936 (s: 1.00) Exon 2 15000 14889 ( 112 n); cDNA 274 385 ( 112 n); score: 1.000 Intron 2 14888 14791 ( 98 n); Pd: 0.907 (s: 1.00), Pa: 0.988 (s: 1.00) Exon 3 14790 14633 ( 158 n); cDNA 386 543 ( 158 n); score: 1.000 Intron 3 14632 13674 ( 959 n); Pd: 0.306 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 4 13673 13456 ( 218 n); cDNA 544 761 ( 218 n); score: 1.000 Intron 4 13455 9828 (3628 n); Pd: 0.996 (s: 1.00), Pa: 0.181 (s: 1.00) Exon 5 9827 9538 ( 290 n); cDNA 762 1051 ( 290 n); score: 1.000 Intron 5 9537 9266 ( 272 n); Pd: 0.978 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 6 9265 9191 ( 75 n); cDNA 1052 1126 ( 75 n); score: 1.000 Intron 6 9190 8741 ( 450 n); Pd: 0.964 (s: 1.00), Pa: 0.993 (s: 1.00) Exon 7 8740 8662 ( 79 n); cDNA 1127 1205 ( 79 n); score: 1.000 Intron 7 8661 7694 ( 968 n); Pd: 0.802 (s: 1.00), Pa: 0.952 (s: 1.00) Exon 8 7693 7629 ( 65 n); cDNA 1206 1270 ( 65 n); score: 1.000 Intron 8 7628 7283 ( 346 n); Pd: 0.986 (s: 1.00), Pa: 0.977 (s: 0) Exon 9 7282 7247 ( 36 n); cDNA 1271 1306 ( 36 n); score: 1.000 Intron 9 7246 7185 ( 62 n); Pd: 0.909 (s: 0), Pa: 0.861 (s: 1.00) Exon 10 7184 6835 ( 350 n); cDNA 1307 1656 ( 350 n); score: 1.000 MATCH C06HBa0120H21.1-8- SGN-U322786+ 0.991 1662 1.002 C PGS_C06HBa0120H21.1-8-_SGN-U322786+ (16570 16292,15000 14889,14790 14633,13673 13456,9827 9538,9265 9191,8740 8662,7693 7629,7282 7247,7184 6835) Alignment (genomic DNA sequence = upper lines): AAAATCCATT TTCCATCTCC CACAACACAC CCAGAACACA GTTATATTGA GCATATGGAT 16511 |||||||||| |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| AAAATCCATT TTCCATCTCC GACAACACAC CCAGAACACA ATTATATTGA GCATATGGAT 60 ATTTAGCTAT TTTTTTTGAA AAATGCTGAA TTTTCTTATC CAAGCGTTTT CAATGAAACA 16451 |||||||||| |||| || |||||||||| |||||||||| |||| |||| ||| | |||| ATTTAGCTAT TTTT--GAAA AAATGCTGAA TTTTCTTATC AAAGCTTTTT CAACG-AACA 117 CAGCAACAAA TACTCTTCCA ATTTTATTTT TTTTAACCAT ATCCTGTCGG CAATTGATTG 16391 || ||||||| |||||||||| | || |||| |||||||||| |||||||||| |||||||||| CAACAACAAA TACTCTTCCA A--TT-TTTT TTTTAACCAT ATCCTGTCGG CAATTGATTG 174 AGAAATCAGC AACCTGTTGA TTTGAATCAG AATTTTCTCC GGTCAAGCTT GTTGCAGGCA 16331 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAATCAGC AACCTGTTGA TTTGAATCAG AATTTTCTCC GGTCAAGCTT GTTGCAGGCA 234 CTGCCCAGTT CAACTTGCAG TTTCAGCTTC TGGATTTCGG TAATGTCTTC TTACTTTCAC 16271 |||||||||| |||||||||| |||||||||| ||||||||| CTGCCCAGTT CAACTTGCAG TTTCAGCTTC TGGATTTCG. .......... .......... 273 TTCAGCACTT GTATGATTCC TCAGAAGGTG TAGAGTAACA AATTAAAATT TGAATTTCTA 16211 .......... .......... .......... .......... .......... .......... 273 AAAACTCTTC GCATATTTTA ATTAGCGGAG ATGTTGGATG TAGTAGTCAC AAGATAGAGG 16151 .......... .......... .......... .......... .......... .......... 273 TAGGCCGAAA AAGTATTAGA GTAAGTGATT AGACAAGAGA TATTGCAGGT TTTGACCTTA 16091 .......... .......... .......... .......... .......... .......... 273 GATAAGAGGG AGAGGGAATG GAGACCGCTT ATTAGGTAGA AGACAATTAG CTAGTGAAGT 16031 .......... .......... .......... .......... .......... .......... 273 GTTGTGCTTT CGAGGGTTCA AGCTTATGGG TATTCTTGTA ATTGTAGTTT CTGATTTGTG 15971 .......... .......... .......... .......... .......... .......... 273 TATTCATTTG TTTGTTGTGT TTTGGCTACT GCACTATTTT GTTGTTGTTC ATGTTCTTTT 15911 .......... .......... .......... .......... .......... .......... 273 CTTGTAGGCT TTGCATTGTT TTTCTTATTA GTTGTTATGT TTTTTGTCAC TGTTTTCTTC 15851 .......... .......... .......... .......... .......... .......... 273 TTCTTTATAC TTGAAATTGC TGCAAGTGAA CCGAGAGTCA ATCGGAAATA ACCTTTCTAC 15791 .......... .......... .......... .......... .......... .......... 273 CTCCACGAGG TAGTGGTAAG GCCTGCACAC TCTATCCTCC CCAGACATTG TGAGATTTCA 15731 .......... .......... .......... .......... .......... .......... 273 CTTGGTATGT TTTTGTTTTA AAGAAAAAAT ACATTCCATT TTTGTTTTCA TATGACAATA 15671 .......... .......... .......... .......... .......... .......... 273 TTTGGTTAGG AATATGGAGT AGTTCAACAT GTTTATTTGA CTTTAGATTA TATTAGTGGG 15611 .......... .......... .......... .......... .......... .......... 273 TACAGAAGAA CGCTCTAACG TGATTGTTGA GAAGTTAGTT TAATAGAAAA CATCACATCA 15551 .......... .......... .......... .......... .......... .......... 273 GAATTTACAA CAACATACCC AGTGTAATAT CCCACAAGTA GGGTCTGCGG AGGGTAGGAT 15491 .......... .......... .......... .......... .......... .......... 273 GTATGCATAT ATTACCCCTA CCTTTCTCGG GTAGGGAGCC TGTTTCTGAT AGAGACCCTC 15431 .......... .......... .......... .......... .......... .......... 273 GGCTCAAAAG GAATGTTATC AAAGCAGGAA TCACATCAAA ATTTGAAATC TCAAAAGTTC 15371 .......... .......... .......... .......... .......... .......... 273 ACTGGATTTA GCTGTATTGA AATAGAACAG AAACATTTCG CAATGTCCTC AAATAGGACT 15311 .......... .......... .......... .......... .......... .......... 273 TTCGTCATAT GAATCGCAAC TGACAGTAAC CGATTATGTG ACAAATTTTT GTCCTAGGCA 15251 .......... .......... .......... .......... .......... .......... 273 CACCATGATG GAAGGATTGA TGCAGTCTTC TTTGTGTCAT CGAGGGAATT TCCATGGACA 15191 .......... .......... .......... .......... .......... .......... 273 GTGTCAGGAA TGTAGCCAAT GGGAAACTGG GAATAATGGG ATCTCTTAAT TTCTTTCTTT 15131 .......... .......... .......... .......... .......... .......... 273 TGGAGAGCTT TTAAAGTTCT TATCAAAGCT TTTTCAACAA AATACAACTA TATAGACTCT 15071 .......... .......... .......... .......... .......... .......... 273 TTGAAAAGAT TTTCTCTGTC GGCAATTGAT TGAGAAAAGT TTGGAATTAT AACTCTTTGA 15011 .......... .......... .......... .......... .......... .......... 273 TTTGAATCAG GAATTCAACT TGATCAGCAT AATTGGACAA ACTGTCAAGC TGCCTCTGTT 14951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... GAATTCAACT TGATCAGCAT AATTGGACAA ACTGTCAAGC TGCCTCTGTT 323 GTCTGGATAG TCTTGTTGCA GGCACTGCCT AGTTTAATTT GCAATTCCAA CTTTTGGATT 14891 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTGGATAG TCTTGTTGCA GGCACTGCCT AGTTTAATTT GCAATTCCAA CTTTTGGATT 383 CGGTAATGAA TACTTACTTT CACTTCAGCA CTCGTATGAT TCCTATGTAC AGTAACAAAT 14831 || CG........ .......... .......... .......... .......... .......... 385 TAAAATTTGA ATTTCTAAAG TCTCTTCTTA TATTTGTCAG CGGAGATGTG GAGGGGTTGG 14771 |||||||||| |||||||||| .......... .......... .......... .......... CGGAGATGTG GAGGGGTTGG 405 ATGTAGTAGT CACGAGGGAG AGGTGGAGCT AGAGTAGGCT GAAAAAGTAT TAAGGAAAAG 14711 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGTAGTAGT CACGAGGGAG AGGTGGAGCT AGAGTAGGCT GAAAAAGTAT TAAGGAAAAG 465 TGATTAGACA CAACATGATG CAGCTTCTGA CCTTAGATAA GAGGGAGAGG GAATGGAGAC 14651 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATTAGACA CAACATGATG CAGCTTCTGA CCTTAGATAA GAGGGAGAGG GAATGGAGAC 525 TGCGGACCGC CTAAAAGGGT AGAAGATTAT AGCTAGTGAA CTGTTGTGCT TTCGAATTTC 14591 |||||||||| |||||||| TGCGGACCGC CTAAAAGG.. .......... .......... .......... .......... 543 GATGTTTCAA GCTTATTGGT ATTCTTATAG TTGTAGTTTT TGATATGTAT ATTCATTTGC 14531 .......... .......... .......... .......... .......... .......... 543 TTATTGTGGT TCGCCTACTG CACTATTTTG TTGTTGTTCA TGTTCTTTTC TTTGTTGGCT 14471 .......... .......... .......... .......... .......... .......... 543 TTGCACTGCT TTTCTAATTA GTTGTTATTT TTTTGGTCAC TGTTTTCTTC TTTATACCTG 14411 .......... .......... .......... .......... .......... .......... 543 AAATTGCTGC ACTTGAGCCA TGAGTCAACC GGAAACAACC TCTCTACCTC CACGAGGTAA 14351 .......... .......... .......... .......... .......... .......... 543 TGGTAAAGCC TACATACACT CTACCCTCCC TAGACGTCAC TTGTGGGATT TCACTTGGTA 14291 .......... .......... .......... .......... .......... .......... 543 TTTTGTAGTG TTTGTTGTTG TTGTTCTTCT TCTCATATTC ACTTCTAAGG CACTGTTTGG 14231 .......... .......... .......... .......... .......... .......... 543 ATAGAAAATG CTTTTTTTTT ATTTATGTTT TAAAGAAAAA ATACATTCCA TATTCGTGTT 14171 .......... .......... .......... .......... .......... .......... 543 CATATGACAA TATTAGGAAT GGAGTAGTTC AACATGTTAA TTTGACTTTA GATTATATTA 14111 .......... .......... .......... .......... .......... .......... 543 GTGGCAATAG ATTAAAGCTC TAACGTGATT GTTGAAAAGT TAGTTTAACA GAGAAAAAAT 14051 .......... .......... .......... .......... .......... .......... 543 CACATCAGAA TTTACAACAA CATACCCATT GTAATATCAC ACAAGTAGGG TCTGGGGAGG 13991 .......... .......... .......... .......... .......... .......... 543 TAGAATGTAC AACTGTGGAT GAAATGTTGT GCTAAGAGTA ACCTGAGTAT CAAGACCTTT 13931 .......... .......... .......... .......... .......... .......... 543 CTGAAATGAC TTCCAGAAAT GAGAGGTAAA CTAAGATACA TTTCCCCTAC CTTGGACGCT 13871 .......... .......... .......... .......... .......... .......... 543 GTTTTCGATA GAGACCCTTG GCTAAAAAGG AATGTTATCA AAGCAGGAAT CACATCAAAA 13811 .......... .......... .......... .......... .......... .......... 543 TTTGAAATCT CAAAAGTCCA GTGGATTAAG CTATATTGAA ATAGAACGGA TCATTTTGCA 13751 .......... .......... .......... .......... .......... .......... 543 ATGTCTCCAA ATAGGAATTT TGTCATATAA ATTGCAATTG AGACAGTAAT TGATTATGTA 13691 .......... .......... .......... .......... .......... .......... 543 ACAATTTTTT GTTTTAGGCA CGCCATGATG GCAGGATTGA TGCAGTCTTC TCTGTGTCAT 13631 ||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .......GCA CGCCATGATG GCAGGATTGA TGCAGTCTTC TCTGTGTCAT 586 TTCCGTGGAC AGTCTTTTCA GGGATGCAGG CAATGGGAAA CTGGGAATAT TGGGATCTCT 13571 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCCGTGGAC AGTCTTTTCA GGGATGCAGG CAATGGGAAA CTGGGAATAT TGGGATCTCT 646 CAATTTGAGC TCTCAACTCA GGGCACATCG GGTCCTTGCA TATGCCCAAC AAAGGATGAA 13511 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATTTGAGC TCTCAACTCA GGGCACATCG GGTCCTTGCA TATGCCCAAC AAAGGATGAA 706 CAAAGGTCAA CTACGGAAGA CGTTGTCTCA GCAAACTTCC CTCATCATCA GTCAGGTGAC 13451 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| CAAAGGTCAA CTACGGAAGA CGTTGTCTCA GCAAACTTCC CTCATCATCA GTCAG..... 761 AAATTTTGAT ATCTCCAGCT TTATATGTAG TTGACTATAA AATCTGAGTT GTCATTTTTT 13391 .......... .......... .......... .......... .......... .......... 761 CTGCTGATCT TTTCACCACA CTGTGGCCGG CATAGATATG ACTTCCTTAG GCTATTTAGT 13331 .......... .......... .......... .......... .......... .......... 761 GTATTTGTTT TCATATTTGA GTTGAGGTAG CACCATCCAA AGACATTGTC ATAGCTTTGT 13271 .......... .......... .......... .......... .......... .......... 761 TTTACAACTT AATCAGGTCA TGAATTTAAT CTGTGGTTAA AGCCATCAAA CAGAAGAAAA 13211 .......... .......... .......... .......... .......... .......... 761 TTCATTGATG TTTTGTTTCT TTGGAAATTT CCTATCACAT GAGGTATTTT TCGTTGAGAA 13151 .......... .......... .......... .......... .......... .......... 761 GAGTCCTGCA CGAACTGAAC TAGATATAGC AGGTATTGTG TCTTAAAATC AGCTTGGAAT 13091 .......... .......... .......... .......... .......... .......... 761 TACTTCAATC ATTGGCCTCC TCCTTTCAAC TCGGGATGCT TAATGCTACC TTTTTTTTTA 13031 .......... .......... .......... .......... .......... .......... 761 TAAAGGTAAA GTTGTATTCT TCTGCATTAA AGGTATGCTG GCTACCTCCA AAAAGTTACA 12971 .......... .......... .......... .......... .......... .......... 761 AGCTATTTCC ATAATGACAT AGATCCTAGG AAATCTACCA ATTTGTTCTA CTTCTTCTAT 12911 .......... .......... .......... .......... .......... .......... 761 ACTATCTTCT TTACACATAA AAACATGTTA ATGCAATTTT CTTTAACCTT CGGAATAGTG 12851 .......... .......... .......... .......... .......... .......... 761 TTAGTAATAT TCTAATGCAC CTGAGATTTC TCTCTCCAAA TAGCCCACCA GACACAAATA 12791 .......... .......... .......... .......... .......... .......... 761 GGTACCCTAA TGCTGCTACC TGTTATTTAG TGCTACAGGC TAATTGCCAT AAAATCTCTT 12731 .......... .......... .......... .......... .......... .......... 761 ATTAGATCTT CTGTTGCAGG AGGTCAAGCA TTTTTCTATT GCACCCATTT ACTTTGCTAC 12671 .......... .......... .......... .......... .......... .......... 761 TTAAGTAGAT CTACTAAAGG AAAAAACCAT TCTTACTGTA ATTGTATTTT CTTTCTTCTT 12611 .......... .......... .......... .......... .......... .......... 761 GATTCAATGA TTAACAATTT GAAACTAGGT GTGGCTCTTT CTTAAAGATA ATAGGTTAAG 12551 .......... .......... .......... .......... .......... .......... 761 ATTTAATTGA AGTTAGCTAT TCCAAGATAT ATGGCACAAG TGCCTCCTAA TGTGGATGTA 12491 .......... .......... .......... .......... .......... .......... 761 AAGCTTTCTG GTAGGGGCAC GAAGCTTCTT ATAGTGGTTC TTAAAAAAGT ATTCTCACTG 12431 .......... .......... .......... .......... .......... .......... 761 TAATTTTATT GTCTTTCCTG TAGATTCAAT GATTAACTAC TTGAAACTAG GTTCTACTCT 12371 .......... .......... .......... .......... .......... .......... 761 CCTTAAAGAT CGTTGACTAA GATTTGATAG AAGTTAGCTA TTCCAAGAGA TGATGCGAGT 12311 .......... .......... .......... .......... .......... .......... 761 GCCTTTTAAC GGATGTGAAG TTTTTCGGTA GGGGCAAGAA GCTTCATAAA GCGGTTCTTA 12251 .......... .......... .......... .......... .......... .......... 761 AAAAACTATT CTCACTGTAA TTTTATTATC TTTCTTGTTG ATTCAATCAT AACAATTTGA 12191 .......... .......... .......... .......... .......... .......... 761 AACTAGGTCC TACTACTTAA AAATCATTGA TTAAGATTTG ATAGACGTTA GCTATTCCAA 12131 .......... .......... .......... .......... .......... .......... 761 GAGATATGGC GCGAGAGCCT CCTAATGGAT GTGAAGCTTT CTGGTAGGGA CAAGAAGCTT 12071 .......... .......... .......... .......... .......... .......... 761 TATGAAGTGG TTTCTGCTGG ATGAGATGAG TAGAAGTCTT GATTTCTTTT CTACAAAATA 12011 .......... .......... .......... .......... .......... .......... 761 AAAACATGAG TTTGCGCCCT CTTGCCCTAT CCTAGGTGGT CAGGTTTACT TGGAACCTAG 11951 .......... .......... .......... .......... .......... .......... 761 ACTGTCAGGA CTAGCTAGTT TCCCAGTGAA TTATTGAGGT GTGCCAGGCA GACCTGAACC 11891 .......... .......... .......... .......... .......... .......... 761 ATAAAAAGGA AGTTAGAGAA CTCTGTTTGG TGCTTGAAGT TAAAGAACTT GAGGCAGTTA 11831 .......... .......... .......... .......... .......... .......... 761 TTCATTTGGA TGTTCGATTA TCTTGTAGAT GAGTAGAATC ATCACCATCA ATAAGAAGAA 11771 .......... .......... .......... .......... .......... .......... 761 GAAGCTAGAG AATTCTTTTT GGTGTTTAAA GTTAAAAAAC TTGCTGCACA AAAATGCTTC 11711 .......... .......... .......... .......... .......... .......... 761 ATAAATTTGG CTTATCTTTT ATCATTTCAT GTTTGATTAT TTTGGGATCA ATAAAGATGA 11651 .......... .......... .......... .......... .......... .......... 761 TGGAAAAGTA TACCTCTTAC ACAATTAAAT ATTAAATACG TTTAAAATTC AAGTGTAATT 11591 .......... .......... .......... .......... .......... .......... 761 ACGTGTTCCT TCGTAGGCCT TTCCCAGATA TCAAACAATC ATCTCAGAAC TGGTACAGAC 11531 .......... .......... .......... .......... .......... .......... 761 AAGGGGAATC TGAAGAATAA AGTGCAATTA CGTGTCCTAA TATCTTAGGC CCTATTTTTT 11471 .......... .......... .......... .......... .......... .......... 761 GCATTAAGAT GAAGACATTT GAATTTGAAT GCACATTAGA GTGATTAAGG TTGTTTGTTT 11411 .......... .......... .......... .......... .......... .......... 761 TTTAACATCT GTATATGCAT AATTTATTTA TTTTTAAACA TAATTAATAT ACAATTCAAA 11351 .......... .......... .......... .......... .......... .......... 761 TAAAAAAGTT ATTTAAATAT TAGAAAAATA TATACAATAA AATCATTATT TGATAAAAAA 11291 .......... .......... .......... .......... .......... .......... 761 TGAATCATTT ATATTCGCTA GTAATGGTGT AACTGGATTG CAGTGGCGGT GGTGATGGTG 11231 .......... .......... .......... .......... .......... .......... 761 ATGGTTGGTA CTAGTGGCTA GTAATGGTGG TGGTGATAGT TATGTTGGTG AAGGTGGCTG 11171 .......... .......... .......... .......... .......... .......... 761 GTGTTGTAGT GGTTGTTGCG ATGGTGGTGG TGATGATGAT TGTTAGTAGG TTGATGATAG 11111 .......... .......... .......... .......... .......... .......... 761 TAGTTAGTGG TGGCGGAAGA ATGTGAGGTG GTTGATGATG GTGTTAGTGT TGGTGGTAGT 11051 .......... .......... .......... .......... .......... .......... 761 TGGCGGAGGT TTTAGTTGTA ATTGTGGAGG TGGTTGTTAG TAGGAATTGT CCAGAGTGGT 10991 .......... .......... .......... .......... .......... .......... 761 GGTAGTGACC GGTAGTGGTA GCGTCATTAG ATATAATTAA TGATGGAGGT GTTAGGTGTG 10931 .......... .......... .......... .......... .......... .......... 761 GTATCGACTA ACAATAGTGG TTGACAGTGG TAGTTGATGA CAAAGGTGGT TGGTGCTTGG 10871 .......... .......... .......... .......... .......... .......... 761 TGACGGTTCG TGTGGTTGAC CGTGGTTATG GTGATGGTAG AGGTGGGCGG TTGTCGACAA 10811 .......... .......... .......... .......... .......... .......... 761 AGTGGTACTT AATAATATTA CACTTTATTC AATGTTTTAA TGATTTAGAC CAATTCAGAA 10751 .......... .......... .......... .......... .......... .......... 761 CAAATAAATG TTTAAATCTT AACGTAAACA AATGCATTTA ATGCCTAAGG TTTGAAATAT 10691 .......... .......... .......... .......... .......... .......... 761 TCAGATTTAG ACCTCCATGA AGTGCAAAAT ATTGAACCCT AAGACTTACT TTGCATGTGC 10631 .......... .......... .......... .......... .......... .......... 761 TGATTATCCT TGGAAAATAA ATATGCCTGA GAACTATAAA GGGAAAAGTA CTGCCCAATA 10571 .......... .......... .......... .......... .......... .......... 761 CCTATGGGCC ATCTAGTTTA CAAAATATGT ATACAATGTA TATGTAGTGT ATGCTTAATA 10511 .......... .......... .......... .......... .......... .......... 761 CTAAATATTG TACATGCCTC TAGTGTATAT TCTGGTCATG TATGGTAAAT TAGATAGTCA 10451 .......... .......... .......... .......... .......... .......... 761 AATAGTATAT ATTGGTAACC ATCCCAACTG TAAATCCCCT TTTAATAAAA AATAGATCAA 10391 .......... .......... .......... .......... .......... .......... 761 CTGAAGGACA TCGCAATTGT TGCATTTAGG TCCATATGTA TACCTTTGAG TTTTCTTGTT 10331 .......... .......... .......... .......... .......... .......... 761 AAATCTCAAC CCTTAGTGTC ATGTAAGTTA TACATTAAAA TATAGCTCCG AACAAATATG 10271 .......... .......... .......... .......... .......... .......... 761 TTTGTGTGAT TGTGATTAAA GTTATGTCTG CATGCTTTTG TATATGTGTT GAAACTGAGA 10211 .......... .......... .......... .......... .......... .......... 761 GCATTTAAAG CTATCTTTAT GTGATTATCT TCACTTGTTC AATTTTGCTT TCTCCTCTTG 10151 .......... .......... .......... .......... .......... .......... 761 GGCAGGTTGC GTATAAACTT GTTAAATTAA ATGTCTAAGC GGTAGTCAAT GTCGCGTTCT 10091 .......... .......... .......... .......... .......... .......... 761 CTACTCCGTT TTTCGTAATT TGTTTTTTTC TGCTCACTTT AGTGTGTGTG AAACCTAGGT 10031 .......... .......... .......... .......... .......... .......... 761 TAGCCTTTGA ACCAAAATGG GAAATAACAT TAAAATTCTG AATACTTCTT CACAACTTAA 9971 .......... .......... .......... .......... .......... .......... 761 GTGGTTGAAG CAAATGCCTA CCATTTCTTC TCCGAATTTC CCCTCTTACC CTCTTCAACA 9911 .......... .......... .......... .......... .......... .......... 761 GCAGAGGCAG CATCGAATGC TTATACCGAA ATAATTAGCT TCACTTCAAC ACTTGTGTCA 9851 .......... .......... .......... .......... .......... .......... 761 AAAATCAATG CTTTTGAACC CAGAAGAGTT ATCGCAAACT GCACAGCCGG CACAACAGCA 9791 ||||||| |||||||||| |||||||||| |||||||||| .......... .......... ...AAGAGTT ATCGCAAACT GCACAGCCGG CACAACAGCA 798 ATTGGGGTAT CCGCAGATGC ATCAGAATCA GCAGCTCCAA CAGCAACAAC AACAACCTCA 9731 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGGGGTAT CCGCAGATGC ATCAGAATCA GCAGCTCCAA CAGCAACAAC AACAACCTCA 858 GCAGGTTTTG CATCAGCAGC AATCTTCTCC AGCAATGAAT TCCCCTGGTG GTCATAATTT 9671 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAGGTTTTG CATCAGCAGC AATCTTCTCC AGCAATGAAT TCCCCTGGTG GTCATAATTT 918 ACTGAGTTTG ACTGGATCAG AACCAGATGC CACTGGATCT GGGACAACGA CTCCTGGGAG 9611 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGAGTTTG ACTGGATCAG AACCAGATGC CACTGGATCT GGGACAACGA CTCCTGGGAG 978 TAGTTCAAGC CAGGGGGCTG AAGCAAGCAA TCAGTTTCTT GGGAAGAGAA AGATTCAGGA 9551 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTTCAAGC CAGGGGGCTG AAGCAAGCAA TCAGTTTCTT GGGAAGAGAA AGATTCAGGA 1038 TTTAGTTTCA CAGGTTTTCC CTTAACCTTC TTAATCTCCA AAGCTTTCTG TTTTCCTTGG 9491 |||||||||| ||| TTTAGTTTCA CAG....... .......... .......... .......... .......... 1051 TGGTTACTCA AGCCAACGTT GGAATCAATT AATTTTTATT TATTTTCACA ATCAATTAAA 9431 .......... .......... .......... .......... .......... .......... 1051 AAAAATTGGT CCACAAAGTA TGTGAAAAGG CTGTTTTCGG CAGACCCTCG TTCAAGGGAA 9371 .......... .......... .......... .......... .......... .......... 1051 AAAAATAGCA TACTTGTAAA AACAGTTAAT TTCTATACAT AAGTTTTGTG GTTATTAACA 9311 .......... .......... .......... .......... .......... .......... 1051 CTGCAACTTT TTAACATCTT TCCACATATT ATGATAATGT TCCAGGTGGA TCCTCAGGGA 9251 ||||| |||||||||| .......... .......... .......... .......... .....GTGGA TCCTCAGGGA 1066 AGAGTTGATC CTGAAGTTGA ACAGTTTCTT TTAGAGATCG CTGATGACTT TATTGATTCG 9191 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGTTGATC CTGAAGTTGA ACAGTTTCTT TTAGAGATCG CTGATGACTT TATTGATTCG 1126 GTGAGGCATA GAAATGATTT TTTCTTTGTT ATCGTTATCT TTTTTGTTAG GATTGCGTCA 9131 .......... .......... .......... .......... .......... .......... 1126 GGAGGTTGGG AGACTGCGGG TTTCATGTTT TGCATGCTGC CTTTCTCATT CATTCACGGT 9071 .......... .......... .......... .......... .......... .......... 1126 TCAAGTTCTA AGATTGTTAG CTTTGATAAC TGTGGTTGAA TACATTGACA TAGCCAGTAA 9011 .......... .......... .......... .......... .......... .......... 1126 AGAGGATTTT TTCTTGAAGG GTCATCGTAT GCTGATTCTG GACTCTTCCT TTCACACTTA 8951 .......... .......... .......... .......... .......... .......... 1126 CCCTTCACTC AAATGTAAAC TAAAAATTTT AAAAATAATA ATAAAAGACC CCCCTATTTT 8891 .......... .......... .......... .......... .......... .......... 1126 CCCATTTTTG AGTTGTAATG CCAAAAAAAA TATCAAAGGC ATGACAAGGA ACCTTTTTTA 8831 .......... .......... .......... .......... .......... .......... 1126 AGAAGAGTTT CCTTTTGAAA TACTTTCTTG AGAAACATTG ATTGTTTGTC ATTTTCTTTG 8771 .......... .......... .......... .......... .......... .......... 1126 CTAAGTTTCA TGTTCCATTC CTCTTTGTAG GTTACTACAT TTTCTTGCAA TTTGGCGAAG 8711 |||||||||| |||||||||| |||||||||| .......... .......... .......... GTTACTACAT TTTCTTGCAA TTTGGCGAAG 1156 CATCGGAAAT CTTCGACTCT GGAGTCCAAA GATATACTGT TACATTTAGG TTTGTGCCAG 8651 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CATCGGAAAT CTTCGACTCT GGAGTCCAAA GATATACTGT TACATTTAG. .......... 1205 CAACTGCAAA ATATGCAGAG TTGTTTTACT TTGTTTTGTT TTTACAGAAA AAACAAGCTA 8591 .......... .......... .......... .......... .......... .......... 1205 TTTATCATTA TTTTAGAATA TAAAAGGAGT CATTTCATAC TTCATCAAAG ATGATAGAAT 8531 .......... .......... .......... .......... .......... .......... 1205 TATAGAAATA AAAGGTCCTA GTTGTTGCAT ATATTTTCAT AATAAGTTCA TTTTTATTGC 8471 .......... .......... .......... .......... .......... .......... 1205 TTTCCGTTAG TGAGTGTGTT TTTTACATGA CTATATGGTG AGATCCTCTG CATTTAGCCT 8411 .......... .......... .......... .......... .......... .......... 1205 ATACACAGGT TTTCTATCTG GAATGTAATT AGAGTTACAA AATTTGACAC ATACTCCTAT 8351 .......... .......... .......... .......... .......... .......... 1205 TGACTCAGAC TGGGACCAAA CTTCGTTTTC ATGTCAGTAC AGTGTACCAA CAACCATCAG 8291 .......... .......... .......... .......... .......... .......... 1205 TCTCCACAGT TTAATGTCTT TAGCTCTTTG AGCTTTTTAG CTACTTATGC GTTCAGCCAA 8231 .......... .......... .......... .......... .......... .......... 1205 CTAGTAAAAT TTGTATTCCA CCACCTTATA ATGTATGCTT CAAGAGTTGC TGGTGAGTAT 8171 .......... .......... .......... .......... .......... .......... 1205 ATGATGAGAA CAGTTTGAAT TGCTAAGCTT TCTCTTTAGT TGTCAGGAGT GGCTTTGAAG 8111 .......... .......... .......... .......... .......... .......... 1205 TGAGGTGTGA CCTGAGTCCT GACTTAGTTA AATTGTTGAG TATCTGAATA GTCCTAGTTG 8051 .......... .......... .......... .......... .......... .......... 1205 CATCTGAGAG AGTGATTGAG TCACACTGTC ATTGTATACA GTATATATAA TGGTTTACCT 7991 .......... .......... .......... .......... .......... .......... 1205 TTTCGTGAAT AGATTTTGAC TTCCTCATTT GTTTAGAACA TGCACAATGC AACTCTATCT 7931 .......... .......... .......... .......... .......... .......... 1205 TATTCCAACA CATCAATACC TGGTAAAATT TTGTTAGGCA ACAGCTTTCT GCTTTGCCTA 7871 .......... .......... .......... .......... .......... .......... 1205 ATGCAGTTGA TTTGCTATTT TTGGCGGAAA GGCTGCTTCA TGATCTCATC AATGCAGTTG 7811 .......... .......... .......... .......... .......... .......... 1205 TTCCTGGGAA TGTTTACATT CGGTCTATCA CTGCTTAAGA TGTTTTATAA CAGAATTTGA 7751 .......... .......... .......... .......... .......... .......... 1205 TTTAGAAAGA ACAAAAAATA TGGTATTAGT CTAAGTTAAC TATGATGTTC TTTACAGAGA 7691 ||| .......... .......... .......... .......... .......... .......AGA 1208 AAGATTGGAA TTTGACTGTC CCAGGTTTTT CAAGTGAGGA TAAGAAACAC TGCCCTGAAC 7631 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGATTGGAA TTTGACTGTC CCAGGTTTTT CAAGTGAGGA TAAGAAACAC TGCCCTGAAC 1268 ATGTAAGCTC TTCTTCCTCT CTAACTTTCA TGGGTGGTAT ACTGGTCTAT CTTATATCTA 7571 || AT........ .......... .......... .......... .......... .......... 1270 CAAATAATTG AGGAGTTCTG TAAGTTGTAA CCAATAGTTA ATTATGATCC ATGGCCTCTG 7511 .......... .......... .......... .......... .......... .......... 1270 AATACTGGAA GTACTCGATA ACATATTTCT TGAAAAAGGA AAAGAAAGGC TCAATAACTT 7451 .......... .......... .......... .......... .......... .......... 1270 AGCTTTTTGC CCAAGTCTTG TTTATTATAA ATTTAAGTTT CTGATTACAG TGCACTCATC 7391 .......... .......... .......... .......... .......... .......... 1270 GCGTTATAAG TACCTCTTGA TCTCTTTCTG GTCAATGATT GTTGTAACTT TAAGCTGAGT 7331 .......... .......... .......... .......... .......... .......... 1270 GGAAATTCAG GCTGTTTTAT TGACAATATG TTTCTAAACT TTGTGCAGTC ATCAGGTGAT 7271 || |||||||||| .......... .......... .......... .......... ........TC ATCAGGTGAT 1282 CTCTGCAAAG AGCGTTTGGA AATGGTGAGT TGAGAGTTGT TTTGTGTTAA TACTACCTGC 7211 |||||||||| |||||||||| |||| CTCTGCAAAG AGCGTTTGGA AATG...... .......... .......... .......... 1306 ATTTATTTTA GGTGTATGAT TCATAGTCTT TGTGCTTGAC TCAGATACCT GATATGATGG 7151 |||| |||||||||| |||||||||| |||||||||| .......... .......... ......TCTT TGTGCTTGAC TCAGATACCT GATATGATGG 1340 AGGCTTCACC ACAAGCTGAA GCAAGTACAA GCAGCAGCGC GAAGGAGATC GTAAGTCCAG 7091 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGCTTCACC ACAAGCTGAA GCAAGTACAA GCAGCAGCGC GAAGGAGATC GTAAGTCCAG 1400 GGCTGGGTGA CCAGGTTGGT TCGACTGACA TAATCGGACC ACCAAGTTCA GAGGAATTGG 7031 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCTGGGTGA CCAGGTTGGT TCGACTGACA TAATCGGACC ACCAAGTTCA GAGGAATTGG 1460 CTTCACCATC TAATGGTGAA ATATAGTTCA ACTATAACAA GATGTGATGT TCCGTGGAGA 6971 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCACCATC TAATGGTGAA ATATAGTTCA ACTATAACAA GATGTGATGT TCCGTGGAGA 1520 TTGGTATACC CCTTGCTTTA CTGTAATAAA ACCTTTTTCA ACTTTAGTTT GCTTGTCGAC 6911 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGGTATACC CCTTGCTTTA CTGTAATAAA ACCTTTTTCA ACTTTAGTTT GCTTGTCGAC 1580 TGAAGAACGT GAACATAAGT CTGTTATGAA TATTCATGAC CTTTTGTTGG TATGTTATGT 6851 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGAACGT GAACATAAGT CTGTTATGAA TATTCATGAC CTTTTGTTGG TATGTTATGT 1640 TAAGTTTTCT TTATTA 6835 |||||||||| |||||| TAAGTTTTCT TTATTA 1656 hqPGS_C06HBa0120H21.1-8-_SGN-U322786+ (16570 16292,15000 14889,14790 14633,13673 13456,9827 9538,9265 9191,8740 8662,7693 7629,7282 7247,7184 6835) ******************************************************************************** EST sequence 4 +strand 896 n (File: SGN-U343351+) 1 AAAGCTGGAG CTCCCCGCGG TGGCGGCCGC TCTAGAACTA GTGGATCCCC CGGGCTGCAG 61 GAATTCGGCA CGAGGGTGGT TATGGTGATG GTAGAGGTGG GCGGTTGTCG ACAAAGTGGT 121 ACTTAATAAT ATTACACTTT ATTCAATGTT TTAATGATTT AGACCAATTC AGAACAAATA 181 AATGTTTAAA TCTTAACGTA AACAAATGCA TTTAATGCCT AAGGTTTGAA ATATTCAGAT 241 TTAGACCTCC ATGAAGTGCA AAATATTGAA CCCTAAGACT TACTTTGCAT GTGCTGATTA 301 TCCTTGGAAA ATAAATATGC CTGAGAACTA TAAAGGGAAA AGTACTGCCC AATACCTATG 361 GGCCATCTAG TTTACAAAAT ATGTATACAA TGTATATGTA GTGTATGCTT AATACTAAAT 421 ATTGTACATG CCTCTAGTGT ATATTCTGGT CATGTATGGT AAATTAGATA GTCAAATAGT 481 ATATATTGGT AACCATCCCA ACTGTAAATC CCCTTTTAAT AAAAAATAGA TCAACTGAAG 541 GACATCGCAA TTGTTGCATT TAGGTCCATA TGTATACCTT TGAGTTTTCT TGTTAAATCT 601 CAACCCTTAG TGTCATGTAA GTTATACATT AAAATATAGC TCCGAACAAA TATGTTTGNT 661 GTGATTGTGA TTAAAGTTAT GTCTGCATGC TTTTGTATAT GTGTTGAAAC TGAGAGCATT 721 TAAAGCTATC TTTATGTGAA TATCTTCACT TGGTCAATTT TGCTTTCTCC TTTGGGGCAG 781 GTTGCGTATA AACTTGGTAA ATTAATTGTC TAAGCGGTAN TCAATGTCGC GTTCTCTACT 841 CCGTTTTTCG AATTTGGTTT TTTTCTGCTC CCTTTAAGGG GGGGGAAACT AGGGTA Predicted gene structure (within gDNA segment 12199 to 9006): Exon 1 10849 10029 ( 821 n); cDNA 76 896 ( 821 n); score: 0.975 MATCH C06HBa0120H21.1-8- SGN-U343351+ 0.975 821 0.916 C PGS_C06HBa0120H21.1-8-_SGN-U343351+ (10849 10029) Alignment (genomic DNA sequence = upper lines): GTGGTTATGG TGATGGTAGA GGTGGGCGGT TGTCGACAAA GTGGTACTTA ATAATATTAC 10790 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGGTTATGG TGATGGTAGA GGTGGGCGGT TGTCGACAAA GTGGTACTTA ATAATATTAC 135 ACTTTATTCA ATGTTTTAAT GATTTAGACC AATTCAGAAC AAATAAATGT TTAAATCTTA 10730 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTTATTCA ATGTTTTAAT GATTTAGACC AATTCAGAAC AAATAAATGT TTAAATCTTA 195 ACGTAAACAA ATGCATTTAA TGCCTAAGGT TTGAAATATT CAGATTTAGA CCTCCATGAA 10670 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACGTAAACAA ATGCATTTAA TGCCTAAGGT TTGAAATATT CAGATTTAGA CCTCCATGAA 255 GTGCAAAATA TTGAACCCTA AGACTTACTT TGCATGTGCT GATTATCCTT GGAAAATAAA 10610 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGCAAAATA TTGAACCCTA AGACTTACTT TGCATGTGCT GATTATCCTT GGAAAATAAA 315 TATGCCTGAG AACTATAAAG GGAAAAGTAC TGCCCAATAC CTATGGGCCA TCTAGTTTAC 10550 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATGCCTGAG AACTATAAAG GGAAAAGTAC TGCCCAATAC CTATGGGCCA TCTAGTTTAC 375 AAAATATGTA TACAATGTAT ATGTAGTGTA TGCTTAATAC TAAATATTGT ACATGCCTCT 10490 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAATATGTA TACAATGTAT ATGTAGTGTA TGCTTAATAC TAAATATTGT ACATGCCTCT 435 AGTGTATATT CTGGTCATGT ATGGTAAATT AGATAGTCAA ATAGTATATA TTGGTAACCA 10430 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTGTATATT CTGGTCATGT ATGGTAAATT AGATAGTCAA ATAGTATATA TTGGTAACCA 495 TCCCAACTGT AAATCCCCTT TTAATAAAAA ATAGATCAAC TGAAGGACAT CGCAATTGTT 10370 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCCAACTGT AAATCCCCTT TTAATAAAAA ATAGATCAAC TGAAGGACAT CGCAATTGTT 555 GCATTTAGGT CCATATGTAT ACCTTTGAGT TTTCTTGTTA AATCTCAACC CTTAGTGTCA 10310 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCATTTAGGT CCATATGTAT ACCTTTGAGT TTTCTTGTTA AATCTCAACC CTTAGTGTCA 615 TGTAAGTTAT ACATTAAAAT ATAGCTCCGA ACAAATATGT TTG-TGTGAT TGTGATTAAA 10251 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| TGTAAGTTAT ACATTAAAAT ATAGCTCCGA ACAAATATGT TTGNTGTGAT TGTGATTAAA 675 GTTATGTCTG CATGCTTTTG TATATGTGTT GAAACTGAGA GCATTTAAAG CTATCTTTAT 10191 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTATGTCTG CATGCTTTTG TATATGTGTT GAAACTGAGA GCATTTAAAG CTATCTTTAT 735 GTGATTATCT TCACTTGTTC AATTTTGCTT TCTCCTCTTG GGCAGGTTGC GTATAAACTT 10131 |||| ||||| ||||||| || |||||||||| |||||| | | |||||||||| |||||||||| GTGAATATCT TCACTTGGTC AATTTTGCTT TCTCCTTTGG GGCAGGTTGC GTATAAACTT 795 GTTAAATTAA ATGTCTAAGC GGTAGTCAAT GTCGCGTTCT CTACTCCGTT TTTCGTAATT 10071 | |||||||| ||||||||| |||| ||||| |||||||||| |||||||||| ||||| | || GGTAAATTAA TTGTCTAAGC GGTANTCAAT GTCGCGTTCT CTACTCCGTT TTTCGAATTT 855 TGTTTTTTTC TGCTCACTTT AGTGTGTGTG AAACCTAGGT TA 10029 ||||||||| ||||| |||| | | | | | ||| ||||| || GGTTTTTTTC TGCTCCCTTT AAGGGGGGGG AAA-CTAGGG TA 896 hqPGS_C06HBa0120H21.1-8-_SGN-U343351+ (10849 10029) Total number of EST alignments reported: 7 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 16877: PGL 1 (- strand): 4665 1581 AGS-1 (3356 2837,2443 2385,1879 1581) SCR (e 1.000 d 0.937 a 0.984,e 1.000 d 0.000 a 0.994,e 0.993) Exon 1 3356 2837 ( 520 n); score: 1.000 Intron 1 2836 2444 ( 393 n); Pd: 0.937 Pa: 0.984 Exon 2 2443 2385 ( 59 n); score: 1.000 Intron 2 2384 1880 ( 505 n); Pd: 0.000 Pa: 0.994 Exon 3 1879 1581 ( 299 n); score: 0.993 PGS (3356 2837,2443 2385,1879 1581) SGN-U325074+ 3-phase translation of AGS-1 (-strand): . . . . . . 3356 TCTCAGCAAAATGTGAGTCCTCCTTCTCCGCCGCCGCCGCCGTCTAACCAATCTCCTCCT S Q Q N V S P P S P P P P P S N Q S P P L S K M - V L L L R R R R R L T N L L L S A K C E S S F S A A A A V - P I S S . . . . . . 3296 CCACCGCCTCCACCACCATCACCAGGGCCTCCTCCTCCTCCATCCCAGCAAAAATACCAT P P P P P P S P G P P P P P S Q Q K Y H H R L H H H H Q G L L L L H P S K N T I S T A S T T I T R A S S S S I P A K I P . . . . . . 3236 TCTCCACCACCAACTAAATCTGTGAATTCTGCGACCACCTCGGAGAGTAAACACTCTAAT S P P P T K S V N S A T T S E S K H S N L H H Q L N L - I L R P P R R V N T L I F S T T N - I C E F C D H L G E - T L - . . . . . . 3176 CATGATAAAAAACACCATAACTCTTACGGGAAATCGCATCAACCAGCAAAGAAAAAGAAG H D K K H H N S Y G K S H Q P A K K K K M I K N T I T L T G N R I N Q Q R K R S S - - K T P - L L R E I A S T S K E K E . . . . . . 3116 CCAAATTTGGGGAAGAAACTGGGGTTAGTGTTTGTGGGTGTTGCTGGGATGTTGCAGGTG P N L G K K L G L V F V G V A G M L Q V Q I W G R N W G - C L W V L L G C C R C A K F G E E T G V S V C G C C W D V A G . . . . . . 3056 TGTGTGGTGGCGTTCTTGCTAATAAAGAGAAGACAATTGTTAAAGGCTGGTAGTAGATTT C V V A F L L I K R R Q L L K A G S R F V W W R S C - - R E D N C - R L V V D F V C G G V L A N K E K T I V K G W - - I . . . . . . 2996 TGAATGAACATTTGAATATGGATGTATATCAGTTAGTCTAATTAATTCAGAATTTTACGA - M N I - I W M Y I S - S N - F R I L R E - T F E Y G C I S V S L I N S E F Y E L N E H L N M D V Y Q L V - L I Q N F T . . . . . . 2936 GACCGCGAAAGGCAATGAACACGGCATTGATGAATTAGAAGCATTGGGTTCATCTGACAT D R E R Q - T R H - - I R S I G F I - H T A K G N E H G I D E L E A L G S S D M R P R K A M N T A L M N - K H W V H L T . . . . : . . 2876 GTAGATTTCTGCATTTGCATTGGTGGCTACTGTAAATTTG : AGCATGTTGAGAAATACAGA V D F C I C I G G Y C K F : E H V E K Y R - I S A F A L V A T V N L : S M L R N T D C R F L H L H W W L L - I - : A C - E I Q . . . . : . . 2423 TGAATGTGCAGACTTCGGATGCTTTGTTTCTAGTTGCTC : GTCTCAAAGTTGTAAGAATGA - M C R L R M L C F - L L : V S K L - E - E C A D F G C F V S S C S : S Q S C K N D M N V Q T S D A L F L V A : R L K V V R M . . . . . . 1858 TATCTTGCTTCGTTTGTCAATTTTAAGGAACTGTACTGGATTTGTTCTGGAGAACATAGT Y L A S F V N F K E L Y W I C S G E H S I L L R L S I L R N C T G F V L E N I V I S C F V C Q F - G T V L D L F W R T - . . . . . . 1798 GTTACGTCTTCTCTGTCCGGTGAGTAAAAAATGTAGGGAAACGAGCATACTGTTGATGAA V T S S L S G E - K M - G N E H T V D E L R L L C P V S K K C R E T S I L L M K C Y V F S V R - V K N V G K R A Y C - - . . . . . . 1738 GGCATGACCTGTCATCGTCGCAACTGAAATATGATTTCTAATGTAGAGGTTTAACACTGT G M T C H R R N - N M I S N V E V - H C A - P V I V A T E I - F L M - R F N T V R H D L S S S Q L K Y D F - C R G L T L . . . . . . 1678 AAAACTCTTTTAACTGTTGGTATAGTCTAACTGTTGCATCTGATATGAAAACTTTCTAAT K T L L T V G I V - L L H L I - K L S N K L F - L L V - S N C C I - Y E N F L I - N S F N C W Y S L T V A S D M K T F - . . . . 1618 ACGCTGGCAAAATAATATCTACCTTGATTCTTGAATAA T L A K - Y L P - F L N R W Q N N I Y L D S - I Y A G K I I S T L I L E - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-8-_PGL-1_AGS-1_PPS_1 (3356 2994) (frame '1'; 360 bp, 120 residues) 1 SQQNVSPPSP PPPPSNQSPP PPPPPPSPGP PPPPSQQKYH SPPPTKSVNS ATTSESKHSN 61 HDKKHHNSYG KSHQPAKKKK PNLGKKLGLV FVGVAGMLQV CVVAFLLIKR RQLLKAGSRF 121 - >C06HBa0120H21.1-8-_PGL-1_AGS-1_PPS_2 (2872 2837,2443 2385,1879 1732) (frame '2'; 240 bp, 80 residues) 1 ISAFALVATV NLSMLRNTDE CADFGCFVSS CSSQSCKNDI LLRLSILRNC TGFVLENIVL 61 RLLCPVSKKC RETSILLMKA - AGS-2 (3878 3442) SCR (e 0.817) Exon 1 3878 3442 ( 437 n); score: 0.817 PGS (3878 3442) SGN-U339975+ 3-phase translation of AGS-2 (-strand): . . . . . . 3878 AAACAGGGGTTAAGGTTAACCCAATTTACGACCCGATTCTATCCCATATCCGACCCGTTT K Q G L R L T Q F T T R F Y P I S D P F N R G - G - P N L R P D S I P Y P T R F T G V K V N P I Y D P I L S H I R P V . . . . . . 3818 CCCTGCGTAATCACCTCGACACCAATCTATGATGCCCTAGTTCTTGGGGGCATCATCATC P C V I T S T P I Y D A L V L G G I I I P A - S P R H Q S M M P - F L G A S S S S L R N H L D T N L - C P S S W G H H H . . . . . . 3758 TTCAACAGTTCCTCTTTGATTCTTACAAATTTCTGGAAGAAATCAACATTTTGGGATCGT F N S S S L I L T N F W K K S T F W D R S T V P L - F L Q I S G R N Q H F G I V L Q Q F L F D S Y K F L E E I N I L G S . . . . . . 3698 TTTTTGGGGACGCATTTGGTTGATTTCTCGAGATTAGAGAGACAATTGGTGGGTGCCATT F L G T H L V D F S R L E R Q L V G A I F W G R I W L I S R D - R D N W W V P F F F G D A F G - F L E I R E T I G G C H . . . . . . 3638 TATTTTGCAGATTGATGAACCCTAGCTATTTATGCCTGATTGTAAATTTGGTGAAGATTT Y F A D - - T L A I Y A - L - I W - R F I L Q I D E P - L F M P D C K F G E D F L F C R L M N P S Y L C L I V N L V K I . . . . . . 3578 TGCTTAATTGAAGGAAAAACCCAGATGTCAGGAGGATTTGGTGAGTTGATTTGGATGAAA C L I E G K T Q M S G G F G E L I W M K A - L K E K P R C Q E D L V S - F G - K L L N - R K N P D V R R I W - V D L D E . . . . . . 3518 GGGGTTCAAATGAAGGTAAATGGGTTTTGAAGATTCATATAGTTGACACCAAATTAGCTT G V Q M K V N G F - R F I - L T P N - L G F K - R - M G F E D S Y S - H Q I S F R G S N E G K W V L K I H I V D T K L A . . 3458 TGTATTGAGGTTTAACG C I E V - V L R F N L Y - G L T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-8-_PGL-1_AGS-2_PPS_1 (3878 3624) (frame '1'; 252 bp, 84 residues) 1 KQGLRLTQFT TRFYPISDPF PCVITSTPIY DALVLGGIII FNSSSLILTN FWKKSTFWDR 61 FLGTHLVDFS RLERQLVGAI YFAD- 3-phase translation of AGS-2 (+strand): . . . . . . 3442 CGTTAAACCTCAATACAAAGCTAATTTGGTGTCAACTATATGAATCTTCAAAACCCATTT R - T S I Q S - F G V N Y M N L Q N P F V K P Q Y K A N L V S T I - I F K T H L L N L N T K L I W C Q L Y E S S K P I . . . . . . 3502 ACCTTCATTTGAACCCCTTTCATCCAAATCAACTCACCAAATCCTCCTGACATCTGGGTT T F I - T P F I Q I N S P N P P D I W V P S F E P L S S K S T H Q I L L T S G F Y L H L N P F H P N Q L T K S S - H L G . . . . . . 3562 TTTCCTTCAATTAAGCAAAATCTTCACCAAATTTACAATCAGGCATAAATAGCTAGGGTT F P S I K Q N L H Q I Y N Q A - I A R V F L Q L S K I F T K F T I R H K - L G F F S F N - A K S S P N L Q S G I N S - G . . . . . . 3622 CATCAATCTGCAAAATAAATGGCACCCACCAATTGTCTCTCTAATCTCGAGAAATCAACC H Q S A K - M A P T N C L S N L E K S T I N L Q N K W H P P I V S L I S R N Q P S S I C K I N G T H Q L S L - S R E I N . . . . . . 3682 AAATGCGTCCCCAAAAAACGATCCCAAAATGTTGATTTCTTCCAGAAATTTGTAAGAATC K C V P K K R S Q N V D F F Q K F V R I N A S P K N D P K M L I S S R N L - E S Q M R P Q K T I P K C - F L P E I C K N . . . . . . 3742 AAAGAGGAACTGTTGAAGATGATGATGCCCCCAAGAACTAGGGCATCATAGATTGGTGTC K E E L L K M M M P P R T R A S - I G V K R N C - R - - C P Q E L G H H R L V S Q R G T V E D D D A P K N - G I I D W C . . . . . . 3802 GAGGTGATTACGCAGGGAAACGGGTCGGATATGGGATAGAATCGGGTCGTAAATTGGGTT E V I T Q G N G S D M G - N R V V N W V R - L R R E T G R I W D R I G S - I G L R G D Y A G K R V G Y G I E S G R K L G . . 3862 AACCTTAACCCCTGTTT N L N P C T L T P V - P - P L F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (4661 4089) SCR (e 0.765) Exon 1 4661 4089 ( 573 n); score: 0.765 PGS (4574 4089) SGN-U315404+ PGS (4661 4519) SGN-U315405- 3-phase translation of AGS-3 (-strand): . . . . . . 4661 TTATTGGGTAACTTTCACATATAGCAAGCAAAAAATTCATATTTGTATGCTATAGCAAAC L L G N F H I - Q A K N S Y L Y A I A N Y W V T F T Y S K Q K I H I C M L - Q T I G - L S H I A S K K F I F V C Y S K . . . . . . 4601 TTTGCATAATTGCGCTTCATAACAAACATAAAACTGTATAATTTGCTATATATATACAAT F A - L R F I T N I K L Y N L L Y I Y N L H N C A S - Q T - N C I I C Y I Y T I L C I I A L H N K H K T V - F A I Y I Q . . . . . . 4541 TATATAATTCGCTGGCCTAAATTGTATACTTCGCTGGCCTATTTCGCTGCAATTGTATAA Y I I R W P K L Y T S L A Y F A A I V - I - F A G L N C I L R W P I S L Q L Y N L Y N S L A - I V Y F A G L F R C N C I . . . . . . 4481 TTTGTTTTGCATACAGTTGAATCGAATTAAAATGTACGTATATTGCATAATTATAAGTGT F V L H T V E S N - N V R I L H N Y K C L F C I Q L N R I K M Y V Y C I I I S V I C F A Y S - I E L K C T Y I A - L - V . . . . . . 4421 ATAGCAAGAAGATATATGTTTTTCTCGCTTTATATAAAAACAGAAACACAATATATACAC I A R R Y M F F S L Y I K T E T Q Y I H - Q E D I C F S R F I - K Q K H N I Y T Y S K K I Y V F L A L Y K N R N T I Y T . . . . . . 4361 TTCTGTTGTATAAAGCTAGAGAAAAGTGTATTTCACTGCAATTGTATAATTCGTTGGCCT F C C I K L E K S V F H C N C I I R W P S V V - S - R K V Y F T A I V - F V G L L L L Y K A R E K C I S L Q L Y N S L A . . . . . . 4301 TTTTCTCTGCAATATTTGAAGTAAAATGTTTGTAAATTGTATACTTAAGTGTATAACACG F S L Q Y L K - N V C K L Y T - V Y N T F L C N I - S K M F V N C I L K C I T R F F S A I F E V K C L - I V Y L S V - H . . . . . . 4241 AAGATATACATTTTTGCATGTGTATATACAATTTTCTCTCACTTTATACAAAACAGAAAT K I Y I F A C V Y T I F S H F I Q N R N R Y T F L H V Y I Q F S L T L Y K T E I E D I H F C M C I Y N F L S L Y T K Q K . . . . . . 4181 AGAATTATGCACTTCTGTGCATAAAGCGAGAGAGGCGAGCGAGAATGGAGAGTGGCGAGC R I M H F C A - S E R G E R E W R V A S E L C T S V H K A R E A S E N G E W R A - N Y A L L C I K R E R R A R M E S G E . . . . 4121 GAGATTTTTGAGAGAGAGACACTGACAAATGGA E I F E R E T L T N G R F L R E R H - Q M R D F - E R D T D K W Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-3 (+strand): . . . . . . 4089 TCCATTTGTCAGTGTCTCTCTCTCAAAAATCTCGCTCGCCACTCTCCATTCTCGCTCGCC S I C Q C L S L K N L A R H S P F S L A P F V S V S L S K I S L A T L H S R S P H L S V S L S Q K S R S P L S I L A R . . . . . . 4149 TCTCTCGCTTTATGCACAGAAGTGCATAATTCTATTTCTGTTTTGTATAAAGTGAGAGAA S L A L C T E V H N S I S V L Y K V R E L S L Y A Q K C I I L F L F C I K - E K L S R F M H R S A - F Y F C F V - S E R . . . . . . 4209 AATTGTATATACACATGCAAAAATGTATATCTTCGTGTTATACACTTAAGTATACAATTT N C I Y T C K N V Y L R V I H L S I Q F I V Y T H A K M Y I F V L Y T - V Y N L K L Y I H M Q K C I S S C Y T L K Y T I . . . . . . 4269 ACAAACATTTTACTTCAAATATTGCAGAGAAAAAGGCCAACGAATTATACAATTGCAGTG T N I L L Q I L Q R K R P T N Y T I A V Q T F Y F K Y C R E K G Q R I I Q L Q - Y K H F T S N I A E K K A N E L Y N C S . . . . . . 4329 AAATACACTTTTCTCTAGCTTTATACAACAGAAGTGTATATATTGTGTTTCTGTTTTTAT K Y T F L - L Y T T E V Y I L C F C F Y N T L F S S F I Q Q K C I Y C V S V F I E I H F S L A L Y N R S V Y I V F L F L . . . . . . 4389 ATAAAGCGAGAAAAACATATATCTTCTTGCTATACACTTATAATTATGCAATATACGTAC I K R E K H I S S C Y T L I I M Q Y T Y - S E K N I Y L L A I H L - L C N I R T Y K A R K T Y I F L L Y T Y N Y A I Y V . . . . . . 4449 ATTTTAATTCGATTCAACTGTATGCAAAACAAATTATACAATTGCAGCGAAATAGGCCAG I L I R F N C M Q N K L Y N C S E I G Q F - F D S T V C K T N Y T I A A K - A S H F N S I Q L Y A K Q I I Q L Q R N R P . . . . . . 4509 CGAAGTATACAATTTAGGCCAGCGAATTATATAATTGTATATATATAGCAAATTATACAG R S I Q F R P A N Y I I V Y I - Q I I Q E V Y N L G Q R I I - L Y I Y S K L Y S A K Y T I - A S E L Y N C I Y I A N Y T . . . . . . 4569 TTTTATGTTTGTTATGAAGCGCAATTATGCAAAGTTTGCTATAGCATACAAATATGAATT F Y V C Y E A Q L C K V C Y S I Q I - I F M F V M K R N Y A K F A I A Y K Y E F V L C L L - S A I M Q S L L - H T N M N . . . . 4629 TTTTGCTTGCTATATGTGAAAGTTACCCAATAA F C L L Y V K V T Q - F A C Y M - K L P N F L L A I C E S Y P I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-8+_PGL-1_AGS-3_PPS_1 (4199 4525) (frame '0'; 324 bp, 108 residues) 1 SERKLYIHMQ KCISSCYTLK YTIYKHFTSN IAEKKANELY NCSEIHFSLA LYNRSVYIVF 61 LFLYKARKTY IFLLYTYNYA IYVHFNSIQL YAKQIIQLQR NRPAKYTI- AGS-4 (4665 4575,4544 4472,4333 4275) SCR (e 0.890 d 0.000 a 0.000,e 0.767 d 0.000 a 0.000,e 0.831) Exon 1 4665 4575 ( 91 n); score: 0.890 Intron 1 4574 4545 ( 30 n); Pd: 0.000 Pa: 0.000 Exon 2 4544 4472 ( 73 n); score: 0.767 Intron 2 4471 4334 ( 138 n); Pd: 0.000 Pa: 0.000 Exon 3 4333 4275 ( 59 n); score: 0.831 PGS (4665 4575,4544 4472,4333 4275) SGN-U335137- 3-phase translation of AGS-4 (-strand): . . . . . . 4665 TTTTTTATTGGGTAACTTTCACATATAGCAAGCAAAAAATTCATATTTGTATGCTATAGC F F I G - L S H I A S K K F I F V C Y S F L L G N F H I - Q A K N S Y L Y A I A F Y W V T F T Y S K Q K I H I C M L - . . . . : . . 4605 AAACTTTGCATAATTGCGCTTCATAACAAAC : AATTATATAATTCGCTGGCCTAAATTGTA K L C I I A L H N K : Q L Y N S L A - I V N F A - L R F I T N : N Y I I R W P K L Y Q T L H N C A S - Q T : I I - F A G L N C . . . . . : . 4515 TACTTCGCTGGCCTATTTCGCTGCAATTGTATAATTTGTTTTGC : TATTTCACTGCAATTG Y F A G L F R C N C I I C F A : I S L Q L T S L A Y F A A I V - F V L : L F H C N C I L R W P I S L Q L Y N L F C : Y F T A I . . . . . 4317 TATAATTCGTTGGCCTTTTTCTCTGCAATATTTGAAGTAAAAT Y N S L A F F S A I F E V K I I R W P F S L Q Y L K - N V - F V G L F L C N I - S K Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 16570 6835 AGS-1 (16570 16292,15000 14889,14790 14633,13673 13456,9827 9538,9265 9191,8740 8662,7693 7629,7282 7247,7184 6835) SCR (e 0.950 d 0.962 a 0.936,e 1.000 d 0.907 a 0.988,e 1.000 d 0.306 a 0.999,e 1.000 d 0.996 a 0.181,e 1.000 d 0.978 a 0.994,e 1.000 d 0.964 a 0.993,e 1.000 d 0.802 a 0.952,e 1.000 d 0.986 a 0.977,e 1.000 d 0.909 a 0.861,e 1.000) Exon 1 16570 16292 ( 279 n); score: 0.950 Intron 1 16291 15001 (1291 n); Pd: 0.962 Pa: 0.936 Exon 2 15000 14889 ( 112 n); score: 1.000 Intron 2 14888 14791 ( 98 n); Pd: 0.907 Pa: 0.988 Exon 3 14790 14633 ( 158 n); score: 1.000 Intron 3 14632 13674 ( 959 n); Pd: 0.306 Pa: 0.999 Exon 4 13673 13456 ( 218 n); score: 1.000 Intron 4 13455 9828 (3628 n); Pd: 0.996 Pa: 0.181 Exon 5 9827 9538 ( 290 n); score: 1.000 Intron 5 9537 9266 ( 272 n); Pd: 0.978 Pa: 0.994 Exon 6 9265 9191 ( 75 n); score: 1.000 Intron 6 9190 8741 ( 450 n); Pd: 0.964 Pa: 0.993 Exon 7 8740 8662 ( 79 n); score: 1.000 Intron 7 8661 7694 ( 968 n); Pd: 0.802 Pa: 0.952 Exon 8 7693 7629 ( 65 n); score: 1.000 Intron 8 7628 7283 ( 346 n); Pd: 0.986 Pa: 0.977 Exon 9 7282 7247 ( 36 n); score: 1.000 Intron 9 7246 7185 ( 62 n); Pd: 0.909 Pa: 0.861 Exon 10 7184 6835 ( 350 n); score: 1.000 PGS (16570 16292,15000 14889,14790 14633,13673 13456,9827 9538,9265 9191,8740 8662,7693 7629,7282 7247,7184 6835) SGN-U322786+ 3-phase translation of AGS-1 (-strand): . . . . . . 16570 AAAATCCATTTTCCATCTCCCACAACACACCCAGAACACAGTTATATTGAGCATATGGAT K I H F P S P T T H P E H S Y I E H M D K S I F H L P Q H T Q N T V I L S I W I N P F S I S H N T P R T Q L Y - A Y G . . . . . . 16510 ATTTAGCTATTTTTTTTGAAAAATGCTGAATTTTCTTATCCAAGCGTTTTCAATGAAACA I - L F F L K N A E F S Y P S V F N E T F S Y F F - K M L N F L I Q A F S M K H Y L A I F F E K C - I F L S K R F Q - N . . . . . . 16450 CAGCAACAAATACTCTTCCAATTTTATTTTTTTTAACCATATCCTGTCGGCAATTGATTG Q Q Q I L F Q F Y F F - P Y P V G N - L S N K Y S S N F I F F N H I L S A I D - T A T N T L P I L F F L T I S C R Q L I . . . . . . 16390 AGAAATCAGCAACCTGTTGATTTGAATCAGAATTTTCTCCGGTCAAGCTTGTTGCAGGCA R N Q Q P V D L N Q N F L R S S L L Q A E I S N L L I - I R I F S G Q A C C R H E K S A T C - F E S E F S P V K L V A G . . . . : . . 16330 CTGCCCAGTTCAACTTGCAGTTTCAGCTTCTGGATTTCG : GAATTCAACTTGATCAGCATA L P S S T C S F S F W I S : E F N L I S I C P V Q L A V S A S G F R : N S T - S A - T A Q F N L Q F Q L L D F : G I Q L D Q H . . . . . . 14979 ATTGGACAAACTGTCAAGCTGCCTCTGTTGTCTGGATAGTCTTGTTGCAGGCACTGCCTA I G Q T V K L P L L S G - S C C R H C L L D K L S S C L C C L D S L V A G T A - N W T N C Q A A S V V W I V L L Q A L P . . . . : . . 14919 GTTTAATTTGCAATTCCAACTTTTGGATTCG : CGGAGATGTGGAGGGGTTGGATGTAGTAG V - F A I P T F G F : A E M W R G W M - - F N L Q F Q L L D S : R R C G G V G C S S S L I C N S N F W I R : G D V E G L D V V . . . . . . 14761 TCACGAGGGAGAGGTGGAGCTAGAGTAGGCTGAAAAAGTATTAAGGAAAAGTGATTAGAC S R G R G G A R V G - K S I K E K - L D H E G E V E L E - A E K V L R K S D - T V T R E R W S - S R L K K Y - G K V I R . . . . . . 14701 ACAACATGATGCAGCTTCTGACCTTAGATAAGAGGGAGAGGGAATGGAGACTGCGGACCG T T - C S F - P - I R G R G N G D C G P Q H D A A S D L R - E G E G M E T A D R H N M M Q L L T L D K R E R E W R L R T . : . . . . . 14641 CCTAAAAGG : GCACGCCATGATGGCAGGATTGATGCAGTCTTCTCTGTGTCATTTCCGTGG P K R : A R H D G R I D A V F S V S F P W L K G : H A M M A G L M Q S S L C H F R G A - K : G T P - W Q D - C S L L C V I S V . . . . . . 13622 ACAGTCTTTTCAGGGATGCAGGCAATGGGAAACTGGGAATATTGGGATCTCTCAATTTGA T V F S G M Q A M G N W E Y W D L S I - Q S F Q G C R Q W E T G N I G I S Q F E D S L F R D A G N G K L G I L G S L N L . . . . . . 13562 GCTCTCAACTCAGGGCACATCGGGTCCTTGCATATGCCCAACAAAGGATGAACAAAGGTC A L N S G H I G S L H M P N K G - T K V L S T Q G T S G P C I C P T K D E Q R S S S Q L R A H R V L A Y A Q Q R M N K G . . . . . : . 13502 AACTACGGAAGACGTTGTCTCAGCAAACTTCCCTCATCATCAGTCAG : AAGAGTTATCGCA N Y G R R C L S K L P S S S V R : R V I A T T E D V V S A N F P H H Q S : E E L S Q Q L R K T L S Q Q T S L I I S Q : K S Y R . . . . . . 9814 AACTGCACAGCCGGCACAACAGCAATTGGGGTATCCGCAGATGCATCAGAATCAGCAGCT N C T A G T T A I G V S A D A S E S A A T A Q P A Q Q Q L G Y P Q M H Q N Q Q L K L H S R H N S N W G I R R C I R I S S . . . . . . 9754 CCAACAGCAACAACAACAACCTCAGCAGGTTTTGCATCAGCAGCAATCTTCTCCAGCAAT P T A T T T T S A G F A S A A I F S S N Q Q Q Q Q Q P Q Q V L H Q Q Q S S P A M S N S N N N N L S R F C I S S N L L Q Q . . . . . . 9694 GAATTCCCCTGGTGGTCATAATTTACTGAGTTTGACTGGATCAGAACCAGATGCCACTGG E F P W W S - F T E F D W I R T R C H W N S P G G H N L L S L T G S E P D A T G - I P L V V I I Y - V - L D Q N Q M P L . . . . . . 9634 ATCTGGGACAACGACTCCTGGGAGTAGTTCAAGCCAGGGGGCTGAAGCAAGCAATCAGTT I W D N D S W E - F K P G G - S K Q S V S G T T T P G S S S S Q G A E A S N Q F D L G Q R L L G V V Q A R G L K Q A I S . . . . : . . 9574 TCTTGGGAAGAGAAAGATTCAGGATTTAGTTTCACAG : GTGGATCCTCAGGGAAGAGTTGA S W E E K D S G F S F T : G G S S G K S - L G K R K I Q D L V S Q : V D P Q G R V D F L G R E R F R I - F H R : W I L R E E L . . . . . . : 9242 TCCTGAAGTTGAACAGTTTCTTTTAGAGATCGCTGATGACTTTATTGATTCG : GTTACTAC S - S - T V S F R D R - - L Y - F : G Y Y P E V E Q F L L E I A D D F I D S : V T T I L K L N S F F - R S L M T L L I R : L L . . . . . . 8732 ATTTTCTTGCAATTTGGCGAAGCATCGGAAATCTTCGACTCTGGAGTCCAAAGATATACT I F L Q F G E A S E I F D S G V Q R Y T F S C N L A K H R K S S T L E S K D I L H F L A I W R S I G N L R L W S P K I Y . . : . . . . 8672 GTTACATTTAG : AGAAAGATTGGAATTTGACTGTCCCAGGTTTTTCAAGTGAGGATAAGAA V T F R : E R L E F D C P R F F K - G - E L H L : E K D W N L T V P G F S S E D K K C Y I - : R K I G I - L S Q V F Q V R I R . . : . . . . : 7644 ACACTGCCCTGAACAT : TCATCAGGTGATCTCTGCAAAGAGCGTTTGGAAATG : TCTTTGTG T L P - T : F I R - S L Q R A F G N : V F V H C P E H : S S G D L C K E R L E M : S L C N T A L N I : H Q V I S A K S V W K C : L C . . . . . . 7176 CTTGACTCAGATACCTGATATGATGGAGGCTTCACCACAAGCTGAAGCAAGTACAAGCAG L D S D T - Y D G G F T T S - S K Y K Q L T Q I P D M M E A S P Q A E A S T S S A - L R Y L I - W R L H H K L K Q V Q A . . . . . . 7116 CAGCGCGAAGGAGATCGTAAGTCCAGGGCTGGGTGACCAGGTTGGTTCGACTGACATAAT Q R E G D R K S R A G - P G W F D - H N S A K E I V S P G L G D Q V G S T D I I A A R R R S - V Q G W V T R L V R L T - . . . . . . 7056 CGGACCACCAAGTTCAGAGGAATTGGCTTCACCATCTAATGGTGAAATATAGTTCAACTA R T T K F R G I G F T I - W - N I V Q L G P P S S E E L A S P S N G E I - F N Y S D H Q V Q R N W L H H L M V K Y S S T . . . . . . 6996 TAACAAGATGTGATGTTCCGTGGAGATTGGTATACCCCTTGCTTTACTGTAATAAAACCT - Q D V M F R G D W Y T P C F T V I K P N K M - C S V E I G I P L A L L - - N L I T R C D V P W R L V Y P L L Y C N K T . . . . . . 6936 TTTTCAACTTTAGTTTGCTTGTCGACTGAAGAACGTGAACATAAGTCTGTTATGAATATT F S T L V C L S T E E R E H K S V M N I F Q L - F A C R L K N V N I S L L - I F F F N F S L L V D - R T - T - V C Y E Y . . . . . 6876 CATGACCTTTTGTTGGTATGTTATGTTAAGTTTTCTTTATTA H D L L L V C Y V K F S L L M T F C W Y V M L S F L Y S - P F V G M L C - V F F I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-8-_PGL-2_AGS-1_PPS_1 (14670 14633,13673 13456,9827 9538,9265 9191,8740 8662,7693 7629,7282 7247,7184 7005) (frame '2'; 978 bp, 326 residues) 1 EGEGMETADR LKGHAMMAGL MQSSLCHFRG QSFQGCRQWE TGNIGISQFE LSTQGTSGPC 61 ICPTKDEQRS TTEDVVSANF PHHQSEELSQ TAQPAQQQLG YPQMHQNQQL QQQQQQPQQV 121 LHQQQSSPAM NSPGGHNLLS LTGSEPDATG SGTTTPGSSS SQGAEASNQF LGKRKIQDLV 181 SQVDPQGRVD PEVEQFLLEI ADDFIDSVTT FSCNLAKHRK SSTLESKDIL LHLEKDWNLT 241 VPGFSSEDKK HCPEHSSGDL CKERLEMSLC LTQIPDMMEA SPQAEASTSS SAKEIVSPGL 301 GDQVGSTDII GPPSSEELAS PSNGEI- >C06HBa0120H21.1-8-_PGL-2_AGS-1_PPS_2 (16370 16292,15000 14889,14790 14739) (frame '0'; 240 bp, 80 residues) 1 FESEFSPVKL VAGTAQFNLQ FQLLDFGIQL DQHNWTNCQA ASVVWIVLLQ ALPSLICNSN 61 FWIRGDVEGL DVVVTRERWS - AGS-2 (10849 10029) SCR (e 0.975) Exon 1 10849 10029 ( 821 n); score: 0.975 PGS (10849 10029) SGN-U343351+ 3-phase translation of AGS-2 (-strand): . . . . . . 10849 GTGGTTATGGTGATGGTAGAGGTGGGCGGTTGTCGACAAAGTGGTACTTAATAATATTAC V V M V M V E V G G C R Q S G T - - Y Y W L W - W - R W A V V D K V V L N N I T G Y G D G R G G R L S T K W Y L I I L . . . . . . 10789 ACTTTATTCAATGTTTTAATGATTTAGACCAATTCAGAACAAATAAATGTTTAAATCTTA T L F N V L M I - T N S E Q I N V - I L L Y S M F - - F R P I Q N K - M F K S - H F I Q C F N D L D Q F R T N K C L N L . . . . . . 10729 ACGTAAACAAATGCATTTAATGCCTAAGGTTTGAAATATTCAGATTTAGACCTCCATGAA T - T N A F N A - G L K Y S D L D L H E R K Q M H L M P K V - N I Q I - T S M K N V N K C I - C L R F E I F R F R P P - . . . . . . 10669 GTGCAAAATATTGAACCCTAAGACTTACTTTGCATGTGCTGATTATCCTTGGAAAATAAA V Q N I E P - D L L C M C - L S L E N K C K I L N P K T Y F A C A D Y P W K I N S A K Y - T L R L T L H V L I I L G K - . . . . . . 10609 TATGCCTGAGAACTATAAAGGGAAAAGTACTGCCCAATACCTATGGGCCATCTAGTTTAC Y A - E L - R E K Y C P I P M G H L V Y M P E N Y K G K S T A Q Y L W A I - F T I C L R T I K G K V L P N T Y G P S S L . . . . . . 10549 AAAATATGTATACAATGTATATGTAGTGTATGCTTAATACTAAATATTGTACATGCCTCT K I C I Q C I C S V C L I L N I V H A S K Y V Y N V Y V V Y A - Y - I L Y M P L Q N M Y T M Y M - C M L N T K Y C T C L . . . . . . 10489 AGTGTATATTCTGGTCATGTATGGTAAATTAGATAGTCAAATAGTATATATTGGTAACCA S V Y S G H V W - I R - S N S I Y W - P V Y I L V M Y G K L D S Q I V Y I G N H - C I F W S C M V N - I V K - Y I L V T . . . . . . 10429 TCCCAACTGTAAATCCCCTTTTAATAAAAAATAGATCAACTGAAGGACATCGCAATTGTT S Q L - I P F - - K I D Q L K D I A I V P N C K S P F N K K - I N - R T S Q L L I P T V N P L L I K N R S T E G H R N C . . . . . . 10369 GCATTTAGGTCCATATGTATACCTTTGAGTTTTCTTGTTAAATCTCAACCCTTAGTGTCA A F R S I C I P L S F L V K S Q P L V S H L G P Y V Y L - V F L L N L N P - C H C I - V H M Y T F E F S C - I S T L S V . . . . . . 10309 TGTAAGTTATACATTAAAATATAGCTCCGAACAAATATGTTTGTGTGATTGTGATTAAAG C K L Y I K I - L R T N M F V - L - L K V S Y T L K Y S S E Q I C L C D C D - S M - V I H - N I A P N K Y V C V I V I K . . . . . . 10249 TTATGTCTGCATGCTTTTGTATATGTGTTGAAACTGAGAGCATTTAAAGCTATCTTTATG L C L H A F V Y V L K L R A F K A I F M Y V C M L L Y M C - N - E H L K L S L C V M S A C F C I C V E T E S I - S Y L Y . . . . . . 10189 TGATTATCTTCACTTGTTCAATTTTGCTTTCTCCTCTTGGGCAGGTTGCGTATAAACTTG - L S S L V Q F C F L L L G R L R I N L D Y L H L F N F A F S S W A G C V - T C V I I F T C S I L L S P L G Q V A Y K L . . . . . . 10129 TTAAATTAAATGTCTAAGCGGTAGTCAATGTCGCGTTCTCTACTCCGTTTTTCGTAATTT L N - M S K R - S M S R S L L R F S - F - I K C L S G S Q C R V L Y S V F R N L V K L N V - A V V N V A F S T P F F V I . . . . . 10069 GTTTTTTTCTGCTCACTTTAGTGTGTGTGAAACCTAGGTTA V F F C S L - C V - N L G F F S A H F S V C E T - V C F F L L T L V C V K P R L Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (+strand): . . . . . . 10029 TAACCTAGGTTTCACACACACTAAAGTGAGCAGAAAAAAACAAATTACGAAAAACGGAGT - P R F H T H - S E Q K K T N Y E K R S N L G F T H T K V S R K K Q I T K N G V T - V S H T L K - A E K N K L R K T E . . . . . . 10089 AGAGAACGCGACATTGACTACCGCTTAGACATTTAATTTAACAAGTTTATACGCAACCTG R E R D I D Y R L D I - F N K F I R N L E N A T L T T A - T F N L T S L Y A T C - R T R H - L P L R H L I - Q V Y T Q P . . . . . . 10149 CCCAAGAGGAGAAAGCAAAATTGAACAAGTGAAGATAATCACATAAAGATAGCTTTAAAT P K R R K Q N - T S E D N H I K I A L N P R G E S K I E Q V K I I T - R - L - M A Q E E K A K L N K - R - S H K D S F K . . . . . . 10209 GCTCTCAGTTTCAACACATATACAAAAGCATGCAGACATAACTTTAATCACAATCACACA A L S F N T Y T K A C R H N F N H N H T L S V S T H I Q K H A D I T L I T I T Q C S Q F Q H I Y K S M Q T - L - S Q S H . . . . . . 10269 AACATATTTGTTCGGAGCTATATTTTAATGTATAACTTACATGACACTAAGGGTTGAGAT N I F V R S Y I L M Y N L H D T K G - D T Y L F G A I F - C I T Y M T L R V E I K H I C S E L Y F N V - L T - H - G L R . . . . . . 10329 TTAACAAGAAAACTCAAAGGTATACATATGGACCTAAATGCAACAATTGCGATGTCCTTC L T R K L K G I H M D L N A T I A M S F - Q E N S K V Y I W T - M Q Q L R C P S F N K K T Q R Y T Y G P K C N N C D V L . . . . . . 10389 AGTTGATCTATTTTTTATTAAAAGGGGATTTACAGTTGGGATGGTTACCAATATATACTA S - S I F Y - K G I Y S W D G Y Q Y I L V D L F F I K R G F T V G M V T N I Y Y Q L I Y F L L K G D L Q L G W L P I Y T . . . . . . 10449 TTTGACTATCTAATTTACCATACATGACCAGAATATACACTAGAGGCATGTACAATATTT F D Y L I Y H T - P E Y T L E A C T I F L T I - F T I H D Q N I H - R H V Q Y L I - L S N L P Y M T R I Y T R G M Y N I . . . . . . 10509 AGTATTAAGCATACACTACATATACATTGTATACATATTTTGTAAACTAGATGGCCCATA S I K H T L H I H C I H I L - T R W P I V L S I H Y I Y I V Y I F C K L D G P - - Y - A Y T T Y T L Y T Y F V N - M A H . . . . . . 10569 GGTATTGGGCAGTACTTTTCCCTTTATAGTTCTCAGGCATATTTATTTTCCAAGGATAAT G I G Q Y F S L Y S S Q A Y L F S K D N V L G S T F P F I V L R H I Y F P R I I R Y W A V L F P L - F S G I F I F Q G - . . . . . . 10629 CAGCACATGCAAAGTAAGTCTTAGGGTTCAATATTTTGCACTTCATGGAGGTCTAAATCT Q H M Q S K S - G S I F C T S W R S K S S T C K V S L R V Q Y F A L H G G L N L S A H A K - V L G F N I L H F M E V - I . . . . . . 10689 GAATATTTCAAACCTTAGGCATTAAATGCATTTGTTTACGTTAAGATTTAAACATTTATT E Y F K P - A L N A F V Y V K I - T F I N I S N L R H - M H L F T L R F K H L F - I F Q T L G I K C I C L R - D L N I Y . . . . . . 10749 TGTTCTGAATTGGTCTAAATCATTAAAACATTGAATAAAGTGTAATATTATTAAGTACCA C S E L V - I I K T L N K V - Y Y - V P V L N W S K S L K H - I K C N I I K Y H L F - I G L N H - N I E - S V I L L S T . . . . . 10809 CTTTGTCGACAACCGCCCACCTCTACCATCACCATAACCAC L C R Q P P T S T I T I T F V D N R P P L P S P - P T L S T T A H L Y H H H N H Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 21:59:35 2006 ________________________________________________________________________________ Sequence 9: C06HBa0120H21.1-9, from 1 to 21437, both strands analyzed. ... started at: Mon Aug 28 21:59:35 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 7 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 ******************************************************************************** EST sequence 6 +strand 2470 n (File: SGN-U321764+) 1 GATTTTCGTA ATTATTGATA CAGACAAAGG TGTTTAAACG GACATCGTAA TGGAGTGTGT 61 GTGAGAGAGA ACCCATTTTG AGGAATTCGG GGGCAATTTC AGTTTCTCGG TGGCGGAAGA 121 CGCGCAGAGA CTTCACCCTT TCAATTCCGT TGACACATAT ACAGAGGTGT CGTTTCTGAT 181 AATCAATTTC ACTTTCTCTT TGCTACGATC GCTACATTCT CTCTCTCTCT TTTGTGTAGA 241 CTGAACCCGC GCTGAACTGC ATTTCGCCTA TAAATTAATA TATTTCTTAG GAAGATGCAG 301 CAACCCGTGC AGCCAAGGTC TTCTGCCAAT GGATATGGCC GTCGTAAAGT TGATAGAGAA 361 ATGGGTACTA AGTTGGAGAA TAAAGCGCAA TCTGGAAAAA CTACTTCTCG TCAATTTACA 421 GGTAAAGGGG GAGCATATCA AAGCCTGTCA CATGATCGAC TAGTTTATTT CACTACCTGT 481 CTTGTTGGAC ATCAAGTGGA AGTACAAGTG ATGGACGGAT CAGTGTTTTC AGGGATACTT 541 CATGCGACAA ACGCTGAAAA AGATTTTGGT ATCATTCTGA AAATGGCGCA GTTGATAAAA 601 GATAGCTCTG AGGGGATGAA GAGTAGTTCT GAAACTTTTA GCAAGCCTCC ATTAAAGACT 661 TTGATAATAC CGGGTAAAGA GTTTGCTCAA GTTACAGCAA AGGGTGTGCC TACAACTCTA 721 GACGGTTTCA GAACAGAATT CATGCTGGAA CAGCAGCAGG AACTTTTGAC TGATTCATGC 781 ATTTCACAAT CTCGGCATAT TGAGGTAGAG CGGCAATTGG AACGCTGGGT ACCTGATGAT 841 GATGCTCCTG AATGTCCTGA ACTGGACAAT ATATTTGATG GCCATTGGAA TAGGGGCTGG 901 GATCAGTTTC AAGCCAATGA AACACTGTTT GGAGTAAAAA GCACATTTGA TGAGGACCTT 961 TATACGACAA AGCTTGAGAG AGGTCCTCAG ATGAGTGAGT TGGAAAAAGA AGCTCTAAGA 1021 ATAGCTAGAG AAATTGAGGG TGAGGATACA CGTGATCTTC ATCTAGCAGA GGAGAGAGGG 1081 ATCCAACTTC ATGAGAACCT AGAAGTGGAC GAGGAAACCA GATTTTCCGC AGTTGTTAGA 1141 GAGATTGATG ATAGCGGCTA TGACAACTGT GAGGACATCC TGTTGGATTC ACGTAATGAT 1201 GAGACATTTC AAGGTATATC TAGTGCTATG GGGAAGTCAT TTACTGACAT GGGCAGAAGG 1261 AAAATGAATG ATGGTGCACA AGTTTCATTA AGATCTTCCT TCATGGATGA AGTGCAATCT 1321 TCCAAGCTAA GTACCAGTAG GGATGTCTAC CAGACTTGTT ACGATGATCA TGCGAAACAG 1381 TCATCAGCTG AAGTTGTCCT TAAAGGTGGC TCTATCTTAA ACAGGGGTCG CAAAACTCTG 1441 TTTAGTGAGC ATGCTGGAGC AAGTTGGAAT AAGGAGGATA CAAGAAATCA AATGACGGAT 1501 GAAGTTGCTC AAACGTCAGT ATTGGAAGAT TCAATGTCTT CTTCAAGAAT GAAAATGGAG 1561 ACCTCTGATG GGGGTAGATT GTCTCCAGAC ATCTCTGCAT TGCATGTTCA TCCAGCGGAC 1621 CAGGATATGA TCACAAGTTC TTCTAGAGAG AAGTTTGAGG GTGCGGTGTC TTCCAAGATT 1681 CAAGGGGCTC CACAATCTGC TAATTCTCGT GTACGACCTA GTAGTTCTGT TCTTTCCGGT 1741 TCTGATGGAA CAGGTGCTGC CTCAACGTCA GCTGACAATG GATTATCACG AACCTCTTCT 1801 GTAAATTCAT TTTCGTCAGA AAAATCCACA TTGAATCCAC ATGCTAAGGA ATTTAAATTA 1861 AATCCTAATG CAAAGAGTTT CATGCCATTT CAATCACCTT TGAGACCTGC TTCTCCGGTG 1921 TCTGATAGTT CCTTCTATTA TCCAGCTGGT GTGGCTACTG TTCCCAATGT GCATGGCATG 1981 CCTGTTGGGG TAGGTCCTTC ATTTTCTCCA CATCAGCCTG TTATGTTTAA TCCACAAGCT 2041 ACACCTGTAC CACAACAATT TTTTCATCCA AATGGACCAC AGTATGGGCA GCAGATGATG 2101 ATTGGTCCCC CTCGGCAAGT AGTCTATATG CCGAATTACC CCGCTGAAAT GCGACGAGAC 2161 TACTAATCAG TTGGCAAACC ATATTGCGTG GTGGGTTGAA CCGATGGATG CTGACATGAG 2221 ATTTCATGGA TTGGTGGAGG AGGTTTAGCT GGTTGATGAA GGGGGATTCC AATGATTTGA 2281 TTAGAGCTTT TCCTTATACT GGGGTATCAG TAATTGTTAC TTTGTCATAA TCATTAGATT 2341 TGTTAACTTT CAGATTTACA GTCTTTCTTG AAGTTAACTG TGGTGTTTCC TTGGTATGCT 2401 GCTGTTGATA TTTCTTCTCT TTGATCTGTA TTCCTAATAT TGTATGTTTC TCGACAAAAA 2461 AAAAAAAAAA Predicted gene structure (within gDNA segment 1 to 4766): Exon 1 1721 2040 ( 320 n); cDNA 1529 1848 ( 320 n); score: 1.000 Intron 1 2041 2202 ( 162 n); Pd: 0.974 (s: 1.00), Pa: 0.984 (s: 1.00) Exon 2 2203 2343 ( 141 n); cDNA 1849 1989 ( 141 n); score: 1.000 Intron 2 2344 3076 ( 733 n); Pd: 0.997 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 3 3077 3169 ( 93 n); cDNA 1990 2082 ( 93 n); score: 1.000 Intron 3 3170 3274 ( 105 n); Pd: 0.993 (s: 1.00), Pa: 0.924 (s: 1.00) Exon 4 3275 3337 ( 63 n); cDNA 2083 2145 ( 63 n); score: 1.000 Intron 4 3338 3725 ( 388 n); Pd: 0.995 (s: 1.00), Pa: 0.895 (s: 1.00) Exon 5 3726 4032 ( 307 n); cDNA 2146 2452 ( 307 n); score: 1.000 PPA cDNA 2454 2470 MATCH C06HBa0120H21.1-9+ SGN-U321764+ 1.000 924 0.374 C PGS_C06HBa0120H21.1-9+_SGN-U321764+ (1721 2040,2203 2343,3077 3169,3275 3337,3726 4032) Alignment (genomic DNA sequence = upper lines): ATTCAATGTC TTCTTCAAGA ATGAAAATGG AGACCTCTGA TGGGGGTAGA TTGTCTCCAG 1780 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTCAATGTC TTCTTCAAGA ATGAAAATGG AGACCTCTGA TGGGGGTAGA TTGTCTCCAG 1588 ACATCTCTGC ATTGCATGTT CATCCAGCGG ACCAGGATAT GATCACAAGT TCTTCTAGAG 1840 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATCTCTGC ATTGCATGTT CATCCAGCGG ACCAGGATAT GATCACAAGT TCTTCTAGAG 1648 AGAAGTTTGA GGGTGCGGTG TCTTCCAAGA TTCAAGGGGC TCCACAATCT GCTAATTCTC 1900 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAGTTTGA GGGTGCGGTG TCTTCCAAGA TTCAAGGGGC TCCACAATCT GCTAATTCTC 1708 GTGTACGACC TAGTAGTTCT GTTCTTTCCG GTTCTGATGG AACAGGTGCT GCCTCAACGT 1960 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTACGACC TAGTAGTTCT GTTCTTTCCG GTTCTGATGG AACAGGTGCT GCCTCAACGT 1768 CAGCTGACAA TGGATTATCA CGAACCTCTT CTGTAAATTC ATTTTCGTCA GAAAAATCCA 2020 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCTGACAA TGGATTATCA CGAACCTCTT CTGTAAATTC ATTTTCGTCA GAAAAATCCA 1828 CATTGAATCC ACATGCTAAG GCAAGAACTA TTTCTGAATC TTCCGTTTTT CAACAAAACG 2080 |||||||||| |||||||||| CATTGAATCC ACATGCTAAG .......... .......... .......... .......... 1848 AAAAGAACAA TCATCCTGCA ATAATTTCCA TAGTTTCTAT TGCAGTTATA CCATTCCCCC 2140 .......... .......... .......... .......... .......... .......... 1848 GCCTTCCTTT GGCCAATCAT CTACATGTTT ATTTATTCTG CTTCTGTTGA TCACTTTTTC 2200 .......... .......... .......... .......... .......... .......... 1848 AGGAATTTAA ATTAAATCCT AATGCAAAGA GTTTCATGCC ATTTCAATCA CCTTTGAGAC 2260 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ..GAATTTAA ATTAAATCCT AATGCAAAGA GTTTCATGCC ATTTCAATCA CCTTTGAGAC 1906 CTGCTTCTCC GGTGTCTGAT AGTTCCTTCT ATTATCCAGC TGGTGTGGCT ACTGTTCCCA 2320 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGCTTCTCC GGTGTCTGAT AGTTCCTTCT ATTATCCAGC TGGTGTGGCT ACTGTTCCCA 1966 ATGTGCATGG CATGCCTGTT GGGGTAAGGG ATGTGTGTAT ATTTTTAATT TTTTATAGTC 2380 |||||||||| |||||||||| ||| ATGTGCATGG CATGCCTGTT GGG....... .......... .......... .......... 1989 AATATATGCG GAGTTTGTTC TGACAGAAAA AAAAAACTGT ATGAATCTTA CCTGGTCTGT 2440 .......... .......... .......... .......... .......... .......... 1989 CTTTGGAAGT CAAAGCATGG TCGACTCCAT AGCTGTATTA CCTAAGTTAT ATTTTGGACA 2500 .......... .......... .......... .......... .......... .......... 1989 TGTACTTGTG TAGTGTCTAT CATCTTTATT TTGAGTCAAA TATTTGGTTA TGCTGGTAAA 2560 .......... .......... .......... .......... .......... .......... 1989 AAAGTTTATA GATTTGGTTT GTCGGTAACC CATTTTTTGG TGGTATTGTG TAAAATCTGC 2620 .......... .......... .......... .......... .......... .......... 1989 AAATGTGTTT TCCTAAGAAT GGTAATTTTC ATGAGGTGGA ATACCTATGG ATTCTGTCTT 2680 .......... .......... .......... .......... .......... .......... 1989 TTGTAATCAT GCTGTGGACT ATTTACCCTT CTCATTTTTC ATTGTGGAAA TGAAACTCTA 2740 .......... .......... .......... .......... .......... .......... 1989 TGTGCTGTTA CGGTAATTGG GGTAAAATGT TGTACTTCTC TTATCGGATA TAATTAATGT 2800 .......... .......... .......... .......... .......... .......... 1989 TGTATTGAAT ATACTACTCT CCCCAAAAAC CTCACTTGTG AGATTACAAT GTATATGTTG 2860 .......... .......... .......... .......... .......... .......... 1989 TTATTGAGAA AATTTGTTCT TTTGATCTGC CTATTGTTTT CATATAGTTG GAAGATAGCC 2920 .......... .......... .......... .......... .......... .......... 1989 GGAGAAGTTC ACAATTCTTT TGAAGCTACT GTCGCAAATA TTTTTCTTGT GTTTGTTAGT 2980 .......... .......... .......... .......... .......... .......... 1989 GTTTCACCTA AAGAGGTGCT ATGCCTCCCC CCCTCCCCCA CCCTCTTCAC ATTTTAATGC 3040 .......... .......... .......... .......... .......... .......... 1989 ACTATTGAAG GTTTGTTAAT CATATATGCA ATGCAGGTAG GTCCTTCATT TTCTCCACAT 3100 |||| |||||||||| |||||||||| .......... .......... .......... ......GTAG GTCCTTCATT TTCTCCACAT 2013 CAGCCTGTTA TGTTTAATCC ACAAGCTACA CCTGTACCAC AACAATTTTT TCATCCAAAT 3160 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGCCTGTTA TGTTTAATCC ACAAGCTACA CCTGTACCAC AACAATTTTT TCATCCAAAT 2073 GGACCACAGG TTTGTCAGTT TTTTGGTTAC CAGCATATAT GTGTACTGGT TCTTCTATTA 3220 ||||||||| GGACCACAG. .......... .......... .......... .......... .......... 2082 TTGCATTTGA AATGGGTTTC AGAATATTCA AAGATCTCAT CTTTTTTGAT GCAGTATGGG 3280 |||||| .......... .......... .......... .......... .......... ....TATGGG 2088 CAGCAGATGA TGATTGGTCC CCCTCGGCAA GTAGTCTATA TGCCGAATTA CCCCGCTGTA 3340 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| CAGCAGATGA TGATTGGTCC CCCTCGGCAA GTAGTCTATA TGCCGAATTA CCCCGCT... 2145 AGAAATATTT GACATTTTAC ATTTATTTTG TTGGCCTTTA TATCAGTAAC ACCAAAATAA 3400 .......... .......... .......... .......... .......... .......... 2145 TGATAAATTT ACGTAACAGC AGCCAAACAG TTAAGCTGTG CTCCAGTCTG TTGACTGTTT 3460 .......... .......... .......... .......... .......... .......... 2145 CATACATTAA TAACAGCATT TCTGATTGTT ATGCATATAA AATTGATTGT TTATATGTTT 3520 .......... .......... .......... .......... .......... .......... 2145 GGCCAAATAA CCCTTTTGTG CGTAAGTACA CTGGGTACTT GAGGAGTTTG AGTTTTCTTA 3580 .......... .......... .......... .......... .......... .......... 2145 TATTATGAGA AAGTCTCTTT TGTTTTTAGA GCGAGCCACA GTTCAGCTAG AGTCTTGGGT 3640 .......... .......... .......... .......... .......... .......... 2145 CATATAACAA ACTAGTGGAA GTGTCAACTC TCTCTCTCTT TTATCGTGGC TCAAAACTGA 3700 .......... .......... .......... .......... .......... .......... 2145 CTGTGGCTTT GTTGAGATAT TATAGGAAAT GCGACGAGAC TACTAATCAG TTGGCAAACC 3760 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....GAAAT GCGACGAGAC TACTAATCAG TTGGCAAACC 2180 ATATTGCGTG GTGGGTTGAA CCGATGGATG CTGACATGAG ATTTCATGGA TTGGTGGAGG 3820 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATTGCGTG GTGGGTTGAA CCGATGGATG CTGACATGAG ATTTCATGGA TTGGTGGAGG 2240 AGGTTTAGCT GGTTGATGAA GGGGGATTCC AATGATTTGA TTAGAGCTTT TCCTTATACT 3880 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTTTAGCT GGTTGATGAA GGGGGATTCC AATGATTTGA TTAGAGCTTT TCCTTATACT 2300 GGGGTATCAG TAATTGTTAC TTTGTCATAA TCATTAGATT TGTTAACTTT CAGATTTACA 3940 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGGTATCAG TAATTGTTAC TTTGTCATAA TCATTAGATT TGTTAACTTT CAGATTTACA 2360 GTCTTTCTTG AAGTTAACTG TGGTGTTTCC TTGGTATGCT GCTGTTGATA TTTCTTCTCT 4000 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTTTCTTG AAGTTAACTG TGGTGTTTCC TTGGTATGCT GCTGTTGATA TTTCTTCTCT 2420 TTGATCTGTA TTCCTAATAT TGTATGTTTC TC 4032 |||||||||| |||||||||| |||||||||| || TTGATCTGTA TTCCTAATAT TGTATGTTTC TC 2452 hqPGS_C06HBa0120H21.1-9+_SGN-U321764+ (1721 2040,2203 2343,3077 3169,3275 3337,3726 4032) ******************************************************************************** EST sequence 2 +strand 461 n (File: SGN-U335999+) 1 TCCCCNNTTG GACACCGCTG GACCTCCCTG CAGCGGCCGC GCGGTCTAGA ACTAGTGGAT 61 CCCCCGGGCT GCAGGAATTG GGCGCCTGAT CTTCTATTCA CTTATTAAAA CCCTTTTCCC 121 CATTCTTTGT TTCTACACAC AACTCAAAAT GCCCTTCTTG CTGTATTTAC CCCCCTTTCA 181 TCCATCCATC CGTACGCCAC TCTCATTTTC CTGCGAATTT CCTTCGAGGA TGCTCTTCTA 241 ATTAATGGAC GCCGGTGGAG GAGGAGAACA GTTTGATTCC CGAACTGTGG AAGATGTGTT 301 TGGGGATTTC AAGAGACGAC GAACTGCTTT GATTAAGGCT CTTACTGTTG ATGTGGAAGA 361 ATTTTATCAG CAGTGTGATC CTGAGAAGGA AAACTTGTGC TTGTATGGTC TCCCAAATGA 421 ACAATGGGAG GTCAATCTGC CTGCTGAAGA AGTACCACCT G Predicted gene structure (within gDNA segment 5110 to 10417): Exon 1 6662 6923 ( 262 n); cDNA 89 350 ( 262 n); score: 0.935 Intron 1 6924 7425 ( 502 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 0) Exon 2 7426 7458 ( 33 n); cDNA 351 383 ( 33 n); score: 1.000 Intron 2 7459 9729 (2271 n); Pd: 0.748 (s: 0), Pa: 0.842 (s: 1.00) Exon 3 9730 9807 ( 78 n); cDNA 384 461 ( 78 n); score: 1.000 MATCH C06HBa0120H21.1-9+ SGN-U335999+ 0.950 373 0.809 C PGS_C06HBa0120H21.1-9+_SGN-U335999+ (6662 6923,7426 7458,9730 9807) Alignment (genomic DNA sequence = upper lines): ATCTTCTCTT CACTTATTAA AACCCTTTTC CCCATTCTTT GTTTCTACAC ACAATTCAAA 6721 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| ATCTTCTATT CACTTATTAA AACCCTTTTC CCCATTCTTT GTTTCTACAC ACAACTCAAA 148 ATCCCCTCCT CCCTCTCTTT CCCCCCCTTT GAACTCTGCA GCCGTACGCC ACTCTCATTT 6781 || |||| || || | ||| ||||||||| | | | || ||||||||| |||||||||| ATGCCCTTCT TGCTGTATTT ACCCCCCTTT CATCCATCCA TCCGTACGCC ACTCTCATTT 208 TCCTGCGAAT TTCCTTCGAG GTTGCTCTTC TGATTAATGG ACGCCGGTGG AGGAGGAGAA 6841 |||||||||| |||||||||| | |||||||| | |||||||| |||||||||| |||||||||| TCCTGCGAAT TTCCTTCGAG GATGCTCTTC TAATTAATGG ACGCCGGTGG AGGAGGAGAA 268 CAGTTTGATT CCCGAACTGT GGAAGATGTG TTTGGGGATT TCAAGAGACG ACGAACTGCT 6901 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTTTGATT CCCGAACTGT GGAAGATGTG TTTGGGGATT TCAAGAGACG ACGAACTGCT 328 TTGATTAAGG CTCTTACTGT TGGTGGGTTT TTTTTTTTTG GTGTGTGTTG TTTGGTTTTT 6961 |||||||||| |||||||||| || TTGATTAAGG CTCTTACTGT TG........ .......... .......... .......... 350 TGAATGTTTA TTAACTGTTC CATTAAAACC CTTGAGTTTT CATTGGATTT TTGTTGTGGT 7021 .......... .......... .......... .......... .......... .......... 350 GTGTGAGAAA TTGAGATGAG ATTTTCGATG TTTAGTAGTT TTTTCTTGAG AAAATCACAA 7081 .......... .......... .......... .......... .......... .......... 350 TCCCCTCCTC CTTTTTTCCC TTTGAATTCC GAAATAGTAG TAATGATGTT TCGCGTTATT 7141 .......... .......... .......... .......... .......... .......... 350 TCATGTGTTG TTTTTCATCT TGCTCGTTTT TTTTTGTGCA AATTTTATTT ATCGCTCTTT 7201 .......... .......... .......... .......... .......... .......... 350 GAAAATTGGA AACCCTAGAG TTTTTTCTTT TGGATGTGTT TGTGAAATTA AGATGAGATT 7261 .......... .......... .......... .......... .......... .......... 350 TATCAATCTT TTGAAGTTTT TGAGAAAATC ACAGTCCCAT TTTTTCAGTT TTTTTGAATT 7321 .......... .......... .......... .......... .......... .......... 350 CCCAAATATT AGTAATGATG TTTACCTTTG CTTCACTTTT TTTTTTTAAC TATTTGTCTT 7381 .......... .......... .......... .......... .......... .......... 350 TATTCTGTAT CTAATTGGTG TATTTTGTTT TTGTTATATT CCAGATGTGG AAGAATTTTA 7441 |||||| |||||||||| .......... .......... .......... .......... ....ATGTGG AAGAATTTTA 366 TCAGCAGTGT GATCCTGGTA AGGGTTTTTT CCAGTTGGAC CTTCTTCAGT ATAAAGTTTG 7501 |||||||||| ||||||| TCAGCAGTGT GATCCTG... .......... .......... .......... .......... 383 GAATTTTTTA CTTGCATAAG CTGTATGTAA TGACTGTCAT TGCATTTTTG TTTCATTTTG 7561 .......... .......... .......... .......... .......... .......... 383 GTTTTGGGTA TTAGGGAATT TGATATTGCT CTAGACGAAT GATAGCATCG TGTTCTGATA 7621 .......... .......... .......... .......... .......... .......... 383 GTGAATGAAT GAACTTGCTA CCTTTGCAGA GTTTTGTTTG TTCATTGGCA TCTTTTGTAT 7681 .......... .......... .......... .......... .......... .......... 383 GCAATGCAGT GTGTGTTCCT TTTTTCTGAA AAAGACTTTA AGGAAACAGG ATCAGTTTGT 7741 .......... .......... .......... .......... .......... .......... 383 TCTGAAGAAG TTAAGCAGTT TTGTGTTAGT TGTCATTTGC TGATAACTAT GTTAAAAAAA 7801 .......... .......... .......... .......... .......... .......... 383 TCTAGAACAG ACTCACACTT GCGGTTTTAT CCGATACCTA GTTGTTGACT TGACTTGGAT 7861 .......... .......... .......... .......... .......... .......... 383 AGTAAATGTG ATGTCTGATA CACTGGTTCA TTTGTTGCAT AGTTAAAGCT TCAACAAGCT 7921 .......... .......... .......... .......... .......... .......... 383 GCTGTAATTA TTTATTTAAG ATGCAGCCAC ATGCGATAAA TAAGCCATGT TGAAGGAACT 7981 .......... .......... .......... .......... .......... .......... 383 CCAGATATAC CTTGCCAATA CAAATGTATT GATTGATTAA TTAGATACTA GACAACATCT 8041 .......... .......... .......... .......... .......... .......... 383 GCTTCAGTGT TCTTTCTCCC CCTCTTTCTT CTTTCCCCTT GAACTGGCTC CGTTATCAGT 8101 .......... .......... .......... .......... .......... .......... 383 CATATGTATG TTAGGATGGT GATGACATAT GCAAAAGTGA ATCCAAAAAA ATTGAGTTTA 8161 .......... .......... .......... .......... .......... .......... 383 TGGGTTCTAG ATTCTAGATT CTATATTGTA GAAAGAGAAC TTATTTGGTT TTGGATAAGT 8221 .......... .......... .......... .......... .......... .......... 383 TTATACATAT TAAGTGGATT TGAAGGGTGT GTGGATTGGC TTTTTCCAAA GTGGCTTATT 8281 .......... .......... .......... .......... .......... .......... 383 GTCTAAAAGC TAAAAAACAT AAGTTGGGAA CGCCCAACTT TGGCTTTTGG CTTTCTTTGT 8341 .......... .......... .......... .......... .......... .......... 383 ACTTTTTCAG CCTAAAAGAA AGTGCTGATG TTTTCCAAAA GCTTCCAAAA CTAAAGAAGA 8401 .......... .......... .......... .......... .......... .......... 383 GCTTAAAAGG CAGACCTTCT TAATCCAAAC ACCCACTTAG CCTCTCTACC TCCAAGGTAG 8461 .......... .......... .......... .......... .......... .......... 383 GGGTAAGGTC TGCTAAGCTT TGTCCTCTCT ATTCCCGACT TTGTAGGATT ACATAGGGTG 8521 .......... .......... .......... .......... .......... .......... 383 TTTTGTTGTA TATCAATTGG GTTTCTTGAC ACAAATATAG AGTTTTCGCC AAGGTTATTG 8581 .......... .......... .......... .......... .......... .......... 383 AGTTCTATTG AATCTATAGC GGCATAGCCT CAACTTTAGC TGCACCCTTG AATATGTGAT 8641 .......... .......... .......... .......... .......... .......... 383 TGTGCCAATA CAAGTGGTCG GAGTAGGTAG AAAGGATTAG GGCGTGTTTT GGTCCAAGGA 8701 .......... .......... .......... .......... .......... .......... 383 ATTTATGTAG GAGCAGTCAA TTAGTTGTAC TACACATATA ATTAGGTCTC TTTGGAGAAA 8761 .......... .......... .......... .......... .......... .......... 383 ATTAGGCGGC ATTGGGTTTT CTGAATCAAC TATTATGTCA GAACTTTTAC GTATAATTAA 8821 .......... .......... .......... .......... .......... .......... 383 GTTGTCTCGA TAGACTTGTA ATTTTTTAGA ATCTATTCAG ATTTATCTAA AGATAGGATA 8881 .......... .......... .......... .......... .......... .......... 383 GTCTTAACAT TACTGTGCAT TGTACCTTTA TATGCCTATA AATACAAAAT TCTTATATTT 8941 .......... .......... .......... .......... .......... .......... 383 TATATACCAC TTTGTATGGT CTGTTATGTT TTTTCTTTTA TTTGCTATGC CTTGTTCTAA 9001 .......... .......... .......... .......... .......... .......... 383 TCTGTATTAT CTTTTCCAAG CTGACTTGTT ATGTGTTACT TGAGCCGAGG GTCTGTGTGC 9061 .......... .......... .......... .......... .......... .......... 383 ACTGGGTATG TTATTGTATA TGCCAGTAAA CATATTCATA AATTCTCAGT TTGTCATGTC 9121 .......... .......... .......... .......... .......... .......... 383 ATGTTCTCTG CAGTTTCTCC ATTTCATTAC ATTGGCCAAG TGCAAAAGCA AAAAATGATT 9181 .......... .......... .......... .......... .......... .......... 383 GAGCAAGTGT AAATGGTCTT GCGTATTTAT AAGTTCTGTA TTGCCATGTC TGTAATAAAT 9241 .......... .......... .......... .......... .......... .......... 383 ATTTACCTTC TCTAGAAGCA GAACATTACT TATAAATTAC AGGGCATTGA GCTTCAACAT 9301 .......... .......... .......... .......... .......... .......... 383 TTTTTTCATC TGAATTTTTG CTCCTTACTT CCCTAGCTGT GCTAAGTTTA TGGACCTAAA 9361 .......... .......... .......... .......... .......... .......... 383 TATAGCCATA CATGGATGCA TTTAGTGGTA ATTTATGGAT GACAAGATTT GCTCAAAGTT 9421 .......... .......... .......... .......... .......... .......... 383 TTCAAAAGTG AACTTGCTTT GCTACTGGAA GTGTCATATG TAATGGCACT GACTCATACT 9481 .......... .......... .......... .......... .......... .......... 383 ACTGCTCTGT GTGTCACATA TTAGTTGATT GATGACTTAG AATACTTAGT TCCTTATTGA 9541 .......... .......... .......... .......... .......... .......... 383 TACTCTTATC TTTCTCCTTT AATTGAGACT CAGTTTTTGG TAACATCTAC CGTGGTCCCC 9601 .......... .......... .......... .......... .......... .......... 383 TTCTTTGAAC AATTTTTTTA TTTACATTAA TGTGGTGTAC TTTACTTTTG TAATTTATTC 9661 .......... .......... .......... .......... .......... .......... 383 CATCTTATAT GAATGATTAT GTAGTTTCTA ATTCAAGCGG TTAAACATCT TCATTGTGTA 9721 .......... .......... .......... .......... .......... .......... 383 TTATCCAGAG AAGGAAAACT TGTGCTTGTA TGGTCTCCCA AATGAACAAT GGGAGGTCAA 9781 || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ........AG AAGGAAAACT TGTGCTTGTA TGGTCTCCCA AATGAACAAT GGGAGGTCAA 435 TCTGCCTGCT GAAGAAGTAC CACCTG 9807 |||||||||| |||||||||| |||||| TCTGCCTGCT GAAGAAGTAC CACCTG 461 hqPGS_C06HBa0120H21.1-9+_SGN-U335999+ (6662 6923,7426 7458,9730 9807) ******************************************************************************** EST sequence 4 +strand 1311 n (File: SGN-U317215+) 1 CTTCTCTTCA CTTATTAAAA CCCTTTTCCC CATTCTTTGT TTCTACACAC AATTCAAAAT 61 CCCCTCCTCC CTCTCTTTCC CCCCCTTTGA ACTCTGCAGC CGTACGCCAC TCTCATTTTC 121 CTGCGAATTT CCTTCGAGGT TGCTCTTCTG ATTAATGGAC GCCGGTGGAG GAGGAGAACA 181 GTTTGATTCC CGAACTGTGG AAGATGTGTT TGGGGATTTC AAGAGACGAC GAACTGCTTT 241 GATTAAGGCT CTTACTGTTG ATGTGGAAGA ATTTTATCAG CAGTGTGATC CTGAGAAGGA 301 AAACTTGTGC TTGTATGGTC TCCCAAATGA ACAATGGGAG GTCAATCTGC CTGCTGAAGA 361 AGTACCACCT GAACTCCCTG AGCCTGCTCT TGGTATTAAC TTTGCTAGAG ATGGGATGGA 421 AGACAAGGAT TGGCTATCCT TAGTTGCTGT CCATAGTGAT TCCTGGCTGC TTTCTGTCGC 481 CTTCTATTTT GGTGCTAGAT TTGGGTTTGA TAAAGCCAGC AGGAAGAAGC TTTTCAACAT 541 GATAAATGAA CTGCCTACAA TATATGAAGT AGTGACTGGT GCCTCAAAGA AACAACAGAA 601 AGAAAAATCT TCTGGCCATA GTGGCAAGAA ATCCAAGTCA AATTCCAAGG CGAGGGCACA 661 AGACTATCAG GAGAAGTTAG CAAAGTTGCA GGCTAAAGAT GAAGAGGAGG AGGGTTTGGA 721 TGAGCAGGAG GACGAGGATG AGCATGGCGA GACACTGTGT GGTGCCTGTG GAGAAAATTA 781 TGCAGCAGAT GAATTTTGGA TATGCTGTGA CATTTGTGAA AAGTGGTTCC ACGGCAAGTG 841 TGTGAAGATC ACCCCTGCCA AGGCTGAGCA TATCAAGCAA TACAAGTGTC CGTCTTGTAG 901 CCACAAGAGA CCTCGAGCTG ACATATAAAA TTTGATAAAG TAGCATCTTC TGTTAGTCAG 961 GTTATTCAGG CTGTTTGGCT CTCTACCTCG TGGAGGTTTA ATAGCACTGT GGTTCATCTT 1021 TGTACACTGT TGGTTAGAAC ATGCAGCTGA TGAATAGGGA GGGTTCTATT GATGGTGGTT 1081 TAGTTATAAA AAAAGTAGTC TTAAAGTAGC AAAAACAGGT ACATACTATG AGTTTAAACA 1141 GTTGATGTCT CTTCTTTAGT TGTTTTAGTA GGAGGATTTG ATGTGGTCTA TTGAGCTTCC 1201 ACCAACTTTG GTGTACATGT CTCTTTGGTT TTTAAACACT TAATGTTGCC TATAATTGTG 1261 AATTATGGTC ATGTTTATTA CATACCCCAT AAAAAAAAAA AAAAAAAAAA A Predicted gene structure (within gDNA segment 6064 to 12982): Exon 1 6664 6923 ( 260 n); cDNA 1 260 ( 260 n); score: 1.000 Intron 1 6924 7425 ( 502 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 0) Exon 2 7426 7458 ( 33 n); cDNA 261 293 ( 33 n); score: 1.000 Intron 2 7459 9729 (2271 n); Pd: 0.748 (s: 0), Pa: 0.842 (s: 1.00) Exon 3 9730 9958 ( 229 n); cDNA 294 522 ( 229 n); score: 1.000 Intron 3 9959 11131 (1173 n); Pd: 0.999 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 4 11132 11261 ( 130 n); cDNA 523 652 ( 130 n); score: 1.000 Intron 4 11262 11479 ( 218 n); Pd: 0.979 (s: 1.00), Pa: 0.970 (s: 1.00) Exon 5 11480 12115 ( 636 n); cDNA 653 1288 ( 636 n); score: 0.998 PPA cDNA 1289 1311 MATCH C06HBa0120H21.1-9+ SGN-U317215+ 0.999 1288 0.982 C PGS_C06HBa0120H21.1-9+_SGN-U317215+ (6664 6923,7426 7458,9730 9958,11132 11261,11480 12115) Alignment (genomic DNA sequence = upper lines): CTTCTCTTCA CTTATTAAAA CCCTTTTCCC CATTCTTTGT TTCTACACAC AATTCAAAAT 6723 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCTCTTCA CTTATTAAAA CCCTTTTCCC CATTCTTTGT TTCTACACAC AATTCAAAAT 60 CCCCTCCTCC CTCTCTTTCC CCCCCTTTGA ACTCTGCAGC CGTACGCCAC TCTCATTTTC 6783 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCCTCCTCC CTCTCTTTCC CCCCCTTTGA ACTCTGCAGC CGTACGCCAC TCTCATTTTC 120 CTGCGAATTT CCTTCGAGGT TGCTCTTCTG ATTAATGGAC GCCGGTGGAG GAGGAGAACA 6843 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGCGAATTT CCTTCGAGGT TGCTCTTCTG ATTAATGGAC GCCGGTGGAG GAGGAGAACA 180 GTTTGATTCC CGAACTGTGG AAGATGTGTT TGGGGATTTC AAGAGACGAC GAACTGCTTT 6903 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTGATTCC CGAACTGTGG AAGATGTGTT TGGGGATTTC AAGAGACGAC GAACTGCTTT 240 GATTAAGGCT CTTACTGTTG GTGGGTTTTT TTTTTTTGGT GTGTGTTGTT TGGTTTTTTG 6963 |||||||||| |||||||||| GATTAAGGCT CTTACTGTTG .......... .......... .......... .......... 260 AATGTTTATT AACTGTTCCA TTAAAACCCT TGAGTTTTCA TTGGATTTTT GTTGTGGTGT 7023 .......... .......... .......... .......... .......... .......... 260 GTGAGAAATT GAGATGAGAT TTTCGATGTT TAGTAGTTTT TTCTTGAGAA AATCACAATC 7083 .......... .......... .......... .......... .......... .......... 260 CCCTCCTCCT TTTTTCCCTT TGAATTCCGA AATAGTAGTA ATGATGTTTC GCGTTATTTC 7143 .......... .......... .......... .......... .......... .......... 260 ATGTGTTGTT TTTCATCTTG CTCGTTTTTT TTTGTGCAAA TTTTATTTAT CGCTCTTTGA 7203 .......... .......... .......... .......... .......... .......... 260 AAATTGGAAA CCCTAGAGTT TTTTCTTTTG GATGTGTTTG TGAAATTAAG ATGAGATTTA 7263 .......... .......... .......... .......... .......... .......... 260 TCAATCTTTT GAAGTTTTTG AGAAAATCAC AGTCCCATTT TTTCAGTTTT TTTGAATTCC 7323 .......... .......... .......... .......... .......... .......... 260 CAAATATTAG TAATGATGTT TACCTTTGCT TCACTTTTTT TTTTTAACTA TTTGTCTTTA 7383 .......... .......... .......... .......... .......... .......... 260 TTCTGTATCT AATTGGTGTA TTTTGTTTTT GTTATATTCC AGATGTGGAA GAATTTTATC 7443 |||||||| |||||||||| .......... .......... .......... .......... ..ATGTGGAA GAATTTTATC 278 AGCAGTGTGA TCCTGGTAAG GGTTTTTTCC AGTTGGACCT TCTTCAGTAT AAAGTTTGGA 7503 |||||||||| ||||| AGCAGTGTGA TCCTG..... .......... .......... .......... .......... 293 ATTTTTTACT TGCATAAGCT GTATGTAATG ACTGTCATTG CATTTTTGTT TCATTTTGGT 7563 .......... .......... .......... .......... .......... .......... 293 TTTGGGTATT AGGGAATTTG ATATTGCTCT AGACGAATGA TAGCATCGTG TTCTGATAGT 7623 .......... .......... .......... .......... .......... .......... 293 GAATGAATGA ACTTGCTACC TTTGCAGAGT TTTGTTTGTT CATTGGCATC TTTTGTATGC 7683 .......... .......... .......... .......... .......... .......... 293 AATGCAGTGT GTGTTCCTTT TTTCTGAAAA AGACTTTAAG GAAACAGGAT CAGTTTGTTC 7743 .......... .......... .......... .......... .......... .......... 293 TGAAGAAGTT AAGCAGTTTT GTGTTAGTTG TCATTTGCTG ATAACTATGT TAAAAAAATC 7803 .......... .......... .......... .......... .......... .......... 293 TAGAACAGAC TCACACTTGC GGTTTTATCC GATACCTAGT TGTTGACTTG ACTTGGATAG 7863 .......... .......... .......... .......... .......... .......... 293 TAAATGTGAT GTCTGATACA CTGGTTCATT TGTTGCATAG TTAAAGCTTC AACAAGCTGC 7923 .......... .......... .......... .......... .......... .......... 293 TGTAATTATT TATTTAAGAT GCAGCCACAT GCGATAAATA AGCCATGTTG AAGGAACTCC 7983 .......... .......... .......... .......... .......... .......... 293 AGATATACCT TGCCAATACA AATGTATTGA TTGATTAATT AGATACTAGA CAACATCTGC 8043 .......... .......... .......... .......... .......... .......... 293 TTCAGTGTTC TTTCTCCCCC TCTTTCTTCT TTCCCCTTGA ACTGGCTCCG TTATCAGTCA 8103 .......... .......... .......... .......... .......... .......... 293 TATGTATGTT AGGATGGTGA TGACATATGC AAAAGTGAAT CCAAAAAAAT TGAGTTTATG 8163 .......... .......... .......... .......... .......... .......... 293 GGTTCTAGAT TCTAGATTCT ATATTGTAGA AAGAGAACTT ATTTGGTTTT GGATAAGTTT 8223 .......... .......... .......... .......... .......... .......... 293 ATACATATTA AGTGGATTTG AAGGGTGTGT GGATTGGCTT TTTCCAAAGT GGCTTATTGT 8283 .......... .......... .......... .......... .......... .......... 293 CTAAAAGCTA AAAAACATAA GTTGGGAACG CCCAACTTTG GCTTTTGGCT TTCTTTGTAC 8343 .......... .......... .......... .......... .......... .......... 293 TTTTTCAGCC TAAAAGAAAG TGCTGATGTT TTCCAAAAGC TTCCAAAACT AAAGAAGAGC 8403 .......... .......... .......... .......... .......... .......... 293 TTAAAAGGCA GACCTTCTTA ATCCAAACAC CCACTTAGCC TCTCTACCTC CAAGGTAGGG 8463 .......... .......... .......... .......... .......... .......... 293 GTAAGGTCTG CTAAGCTTTG TCCTCTCTAT TCCCGACTTT GTAGGATTAC ATAGGGTGTT 8523 .......... .......... .......... .......... .......... .......... 293 TTGTTGTATA TCAATTGGGT TTCTTGACAC AAATATAGAG TTTTCGCCAA GGTTATTGAG 8583 .......... .......... .......... .......... .......... .......... 293 TTCTATTGAA TCTATAGCGG CATAGCCTCA ACTTTAGCTG CACCCTTGAA TATGTGATTG 8643 .......... .......... .......... .......... .......... .......... 293 TGCCAATACA AGTGGTCGGA GTAGGTAGAA AGGATTAGGG CGTGTTTTGG TCCAAGGAAT 8703 .......... .......... .......... .......... .......... .......... 293 TTATGTAGGA GCAGTCAATT AGTTGTACTA CACATATAAT TAGGTCTCTT TGGAGAAAAT 8763 .......... .......... .......... .......... .......... .......... 293 TAGGCGGCAT TGGGTTTTCT GAATCAACTA TTATGTCAGA ACTTTTACGT ATAATTAAGT 8823 .......... .......... .......... .......... .......... .......... 293 TGTCTCGATA GACTTGTAAT TTTTTAGAAT CTATTCAGAT TTATCTAAAG ATAGGATAGT 8883 .......... .......... .......... .......... .......... .......... 293 CTTAACATTA CTGTGCATTG TACCTTTATA TGCCTATAAA TACAAAATTC TTATATTTTA 8943 .......... .......... .......... .......... .......... .......... 293 TATACCACTT TGTATGGTCT GTTATGTTTT TTCTTTTATT TGCTATGCCT TGTTCTAATC 9003 .......... .......... .......... .......... .......... .......... 293 TGTATTATCT TTTCCAAGCT GACTTGTTAT GTGTTACTTG AGCCGAGGGT CTGTGTGCAC 9063 .......... .......... .......... .......... .......... .......... 293 TGGGTATGTT ATTGTATATG CCAGTAAACA TATTCATAAA TTCTCAGTTT GTCATGTCAT 9123 .......... .......... .......... .......... .......... .......... 293 GTTCTCTGCA GTTTCTCCAT TTCATTACAT TGGCCAAGTG CAAAAGCAAA AAATGATTGA 9183 .......... .......... .......... .......... .......... .......... 293 GCAAGTGTAA ATGGTCTTGC GTATTTATAA GTTCTGTATT GCCATGTCTG TAATAAATAT 9243 .......... .......... .......... .......... .......... .......... 293 TTACCTTCTC TAGAAGCAGA ACATTACTTA TAAATTACAG GGCATTGAGC TTCAACATTT 9303 .......... .......... .......... .......... .......... .......... 293 TTTTCATCTG AATTTTTGCT CCTTACTTCC CTAGCTGTGC TAAGTTTATG GACCTAAATA 9363 .......... .......... .......... .......... .......... .......... 293 TAGCCATACA TGGATGCATT TAGTGGTAAT TTATGGATGA CAAGATTTGC TCAAAGTTTT 9423 .......... .......... .......... .......... .......... .......... 293 CAAAAGTGAA CTTGCTTTGC TACTGGAAGT GTCATATGTA ATGGCACTGA CTCATACTAC 9483 .......... .......... .......... .......... .......... .......... 293 TGCTCTGTGT GTCACATATT AGTTGATTGA TGACTTAGAA TACTTAGTTC CTTATTGATA 9543 .......... .......... .......... .......... .......... .......... 293 CTCTTATCTT TCTCCTTTAA TTGAGACTCA GTTTTTGGTA ACATCTACCG TGGTCCCCTT 9603 .......... .......... .......... .......... .......... .......... 293 CTTTGAACAA TTTTTTTATT TACATTAATG TGGTGTACTT TACTTTTGTA ATTTATTCCA 9663 .......... .......... .......... .......... .......... .......... 293 TCTTATATGA ATGATTATGT AGTTTCTAAT TCAAGCGGTT AAACATCTTC ATTGTGTATT 9723 .......... .......... .......... .......... .......... .......... 293 ATCCAGAGAA GGAAAACTTG TGCTTGTATG GTCTCCCAAA TGAACAATGG GAGGTCAATC 9783 |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ......AGAA GGAAAACTTG TGCTTGTATG GTCTCCCAAA TGAACAATGG GAGGTCAATC 347 TGCCTGCTGA AGAAGTACCA CCTGAACTCC CTGAGCCTGC TCTTGGTATT AACTTTGCTA 9843 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCCTGCTGA AGAAGTACCA CCTGAACTCC CTGAGCCTGC TCTTGGTATT AACTTTGCTA 407 GAGATGGGAT GGAAGACAAG GATTGGCTAT CCTTAGTTGC TGTCCATAGT GATTCCTGGC 9903 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGATGGGAT GGAAGACAAG GATTGGCTAT CCTTAGTTGC TGTCCATAGT GATTCCTGGC 467 TGCTTTCTGT CGCCTTCTAT TTTGGTGCTA GATTTGGGTT TGATAAAGCC AGCAGGTACT 9963 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TGCTTTCTGT CGCCTTCTAT TTTGGTGCTA GATTTGGGTT TGATAAAGCC AGCAG..... 522 TTTTCTCATC TGAACTTTCA TTTGTTGTAA GCACACATAT TTCCATGCTT AATGGACGTT 10023 .......... .......... .......... .......... .......... .......... 522 AAGAGCATTA CTTGATTGAA ATATGTGATA AGTTTTGGAG ATATTTGCTT CTAGTATTTC 10083 .......... .......... .......... .......... .......... .......... 522 CTTTGATGAT TTTATATTTG TATTTGATCC CCAGTAAATC AAATTGTTTA TCTGTTTTCC 10143 .......... .......... .......... .......... .......... .......... 522 TGGTTGTTCT CTCAAGCAAT TCCATGCATC TATTTGATTT GTGATGTGTT GCATTGAAGT 10203 .......... .......... .......... .......... .......... .......... 522 AATCGTTTTT TGTTATTAGT AATATAGTTC AGACAGTGAT TGGTATCAAG CTGCCACTAC 10263 .......... .......... .......... .......... .......... .......... 522 TAGCTGTGCT ACTGTCCCAG AGGAGGTATA AATGTTAAGT CTGCTTTATT AAGTTGTACG 10323 .......... .......... .......... .......... .......... .......... 522 CAGAGATAAT TATTGGGCAT GGGCAGAACT GTGCTGGTTC AGTTAGAAAT ATAGATATTC 10383 .......... .......... .......... .......... .......... .......... 522 TAATTACAGG CTAAAAAGAT TCTTCCTCCA GCTTGCATTA TTTGTTATGT ATGGACAGAT 10443 .......... .......... .......... .......... .......... .......... 522 TTCAAATGGG ATGTAAATCA AATCAATAGA CAGTTGACCT CTGTTGCTTG GATATCTGTA 10503 .......... .......... .......... .......... .......... .......... 522 TGATTAACAT ACCTACGCGT TGTCAAACTT GTTTTTGGCT TAACACAGTC AATCAACGTA 10563 .......... .......... .......... .......... .......... .......... 522 CATCATTGTA TTCATGCTCA AGTATTTTCT TTGAAATTGT GAAAGAGCTC TTATTGAGCA 10623 .......... .......... .......... .......... .......... .......... 522 AGGTTCAATT TTAGTTCATT TCTGCATTTT GTGTTTGCTA CATGTAGGAT CTTTGTCAGA 10683 .......... .......... .......... .......... .......... .......... 522 GGGTTTGATG AATGAAAAAT GCTCCAACTT TTCACTTAGA AAAGTAAATT TGATGTTAAT 10743 .......... .......... .......... .......... .......... .......... 522 GGAAAGCCAT GTTCTGGTTA TTTTCCATTA GCCCTACTGC ATATAGGATT CAATAGTTTA 10803 .......... .......... .......... .......... .......... .......... 522 TTCACTACTT GAAAAATTAA GGCATTAGTG ACGGACTAAT TCCATAGCTT AACGACAAAA 10863 .......... .......... .......... .......... .......... .......... 522 GTCATTGCTA AACCTATTTA GTGATGGCAG TTAGCTACGA CCAAATTTAG TGACGAAATT 10923 .......... .......... .......... .......... .......... .......... 522 CAAAGCTGAT TCCTGTTGTT GTTTTTTGTA GTTATTAGGT TCACATTTCT ATTATCCACC 10983 .......... .......... .......... .......... .......... .......... 522 CATCATTGTG CCTCCTGTTT GAAACCCGCA TCAGGATATA CATTGTCAAT ATGGCCTCAG 11043 .......... .......... .......... .......... .......... .......... 522 TGGTGACAGA AGTCATCATA GTTCCCCAGC TCATGAACTT CACTGCGTTT TTTTTCTGTT 11103 .......... .......... .......... .......... .......... .......... 522 GCTCATAGTT TCTTTTTATC TTGTTCAGGA AGAAGCTTTT CAACATGATA AATGAACTGC 11163 || |||||||||| |||||||||| |||||||||| .......... .......... ........GA AGAAGCTTTT CAACATGATA AATGAACTGC 554 CTACAATATA TGAAGTAGTG ACTGGTGCCT CAAAGAAACA ACAGAAAGAA AAATCTTCTG 11223 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACAATATA TGAAGTAGTG ACTGGTGCCT CAAAGAAACA ACAGAAAGAA AAATCTTCTG 614 GCCATAGTGG CAAGAAATCC AAGTCAAATT CCAAGGCGGT AAGTCTATCT TTGGACAATA 11283 |||||||||| |||||||||| |||||||||| |||||||| GCCATAGTGG CAAGAAATCC AAGTCAAATT CCAAGGCG.. .......... .......... 652 TGTTGGTTTG CTGGTCAGAA GTAGCATATT CGTTTTTCTG TTTTCCGGGA TATTAGAGAT 11343 .......... .......... .......... .......... .......... .......... 652 GTGAAGGAGA GCTTGTCCTT GGACTATCTA ATCTGATCTA ACCCTGTAAC AACGATACCT 11403 .......... .......... .......... .......... .......... .......... 652 ATTGTGAATA TTGTCTGACT ACATGGTGTC TTCACTTTTG GTTGATCATT TTCTTACTTT 11463 .......... .......... .......... .......... .......... .......... 652 TTGGTGGCAA AAGCAGAGGG CACAAGACTA TCAGGAGAAG TTAGCAAAGT TGCAGGCTAA 11523 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......AGGG CACAAGACTA TCAGGAGAAG TTAGCAAAGT TGCAGGCTAA 696 AGATGAAGAG GAGGAGGGTT TGGATGAGCA GGAGGACGAG GATGAGCATG GCGAGACACT 11583 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATGAAGAG GAGGAGGGTT TGGATGAGCA GGAGGACGAG GATGAGCATG GCGAGACACT 756 GTGTGGTGCC TGTGGAGAAA ATTATGCAGC AGATGAATTT TGGATATGCT GTGACATTTG 11643 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTGGTGCC TGTGGAGAAA ATTATGCAGC AGATGAATTT TGGATATGCT GTGACATTTG 816 TGAAAAGTGG TTCCACGGCA AGTGTGTGAA GATCACCCCT GCCAAGGCTG AGCATATCAA 11703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAAAGTGG TTCCACGGCA AGTGTGTGAA GATCACCCCT GCCAAGGCTG AGCATATCAA 876 GCAATACAAG TGTCCGTCTT GTAGCCACAA GAGACCTCGA GCTGACATAT AAAATTTGAT 11763 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAATACAAG TGTCCGTCTT GTAGCCACAA GAGACCTCGA GCTGACATAT AAAATTTGAT 936 AAAGTAGCAT CTTCTGTTAG TCAGGTTATT CAGGCTGTTT GGCTCTCTAC CTCGTGGAGG 11823 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGTAGCAT CTTCTGTTAG TCAGGTTATT CAGGCTGTTT GGCTCTCTAC CTCGTGGAGG 996 TTTAATAGCA CTGTGGTTCA TCTTTGTACA CTGTTGGTTA GAACATGCAG CTGATGAATA 11883 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTAATAGCA CTGTGGTTCA TCTTTGTACA CTGTTGGTTA GAACATGCAG CTGATGAATA 1056 GGGAGGGTTC TATTGATGGT GGTTTAGTTA TAAAAAAAGT AGTCTTAAAG TAGCAAAAAC 11943 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAGGGTTC TATTGATGGT GGTTTAGTTA TAAAAAAAGT AGTCTTAAAG TAGCAAAAAC 1116 AGGTACATAC TATGAGTTTA AACAGTTGAT GTCTCTTCTT TAGTTGTTTT AGTAGGAGGA 12003 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTACATAC TATGAGTTTA AACAGTTGAT GTCTCTTCTT TAGTTGTTTT AGTAGGAGGA 1176 TTTGATGTGG TCTATTGAGC TTCCACCAAC TTTGGTGTAC ATGTCTCTTT GGTTTTTAAA 12063 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGATGTGG TCTATTGAGC TTCCACCAAC TTTGGTGTAC ATGTCTCTTT GGTTTTTAAA 1236 CACTTAATGT TGCCTATAAT TGTGAATTAT GGTCATGTTT ATTACATACA CC 12115 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| || CACTTAATGT TGCCTATAAT TGTGAATTAT GGTCATGTTT ATTACATACC CC 1288 hqPGS_C06HBa0120H21.1-9+_SGN-U317215+ (6664 6923,7426 7458,9730 9958,11132 11261,11480 12115) ******************************************************************************** EST sequence 1 +strand 902 n (File: SGN-U335998+) 1 ACTAGCTGGA GCTCCCCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA 61 GGAATTCGGC ACGAGGCCTC CTCCCTCTCT TTCCCCCCCT TTGAACTCTG CAGCCGTACG 121 CCACTCTCAT TTTCCTGCGA ATTTCCTTCG AGGTTGCTCT TCTGATTAAT GGACGCCGGT 181 GGAGGAGGAG AACAGTTTGA TTCCCGAACT GTGGAAGATG TGTTTGGGGA TTTCAAGAGA 241 CGACGAACTG CTTTGATTAA GGCTCTTACT GTTGATGTGG AAGAATTTTA TCAGCAGTGT 301 GATCCTGAGA AGGAAAACTT GTGCTTGTAT GGTCTCCCAA ATGAACAATG GGAGGTCAAT 361 CTGCCTGCTG AAGAAGTACC ACCTGAACTC CCTGAGCCTG CTCTTGGTAT TAACTTTGCT 421 AGAGATGGGA TGGAAGACAA GGATTGGCTA TCCTTAGTTG CTGTCCATAG TGATTCCTGG 481 CTGCTTTCTG TCGCCTTCTA TTTTGGTGCT AGATTTGGGT TTGATAAAGC CAGCAGGTAC 541 TTTTTCTCAT CTGAACTTTC ATTTGTTGTA AGCACACATA TTTCCATGCT TAATGGACGT 601 TAAGAGCATT ACTTGATTGA AATATGTGAT AAGTTTTGGA GATATTTGCT TCTAGTATTT 661 CCTTTGATGA TTTTATATTT GTATTTGATC CCCAGTAAAT CAAATTGTTT ATCTGTTTTC 721 CTGGGTGTTC TCTCAAGCAA TTCCATGCAT CTATTTGATT TGTGATGTGT TGCATTGAAG 781 TAATCGTTTT TTGGTATTAG TAATATAGNT CAGACAGTGA TTGGGTATCA GCTGCCACTA 841 CTAGCTGTGC TACTGGTCCC AGAGAGGTAT AAATGGTAAG TCTGCTTTAT AAAGTGTACG 901 CA Predicted gene structure (within gDNA segment 5267 to 11915): Exon 1 6726 6923 ( 198 n); cDNA 77 274 ( 198 n); score: 1.000 Intron 1 6924 7425 ( 502 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 0) Exon 2 7426 7458 ( 33 n); cDNA 275 307 ( 33 n); score: 1.000 Intron 2 7459 9729 (2271 n); Pd: 0.748 (s: 0), Pa: 0.842 (s: 1.00) Exon 3 9730 10325 ( 596 n); cDNA 308 902 ( 595 n); score: 0.978 MATCH C06HBa0120H21.1-9+ SGN-U335998+ 0.984 827 0.917 C PGS_C06HBa0120H21.1-9+_SGN-U335998+ (6726 6923,7426 7458,9730 10325) Alignment (genomic DNA sequence = upper lines): CCTCCTCCCT CTCTTTCCCC CCCTTTGAAC TCTGCAGCCG TACGCCACTC TCATTTTCCT 6785 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCCTCCCT CTCTTTCCCC CCCTTTGAAC TCTGCAGCCG TACGCCACTC TCATTTTCCT 136 GCGAATTTCC TTCGAGGTTG CTCTTCTGAT TAATGGACGC CGGTGGAGGA GGAGAACAGT 6845 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGAATTTCC TTCGAGGTTG CTCTTCTGAT TAATGGACGC CGGTGGAGGA GGAGAACAGT 196 TTGATTCCCG AACTGTGGAA GATGTGTTTG GGGATTTCAA GAGACGACGA ACTGCTTTGA 6905 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATTCCCG AACTGTGGAA GATGTGTTTG GGGATTTCAA GAGACGACGA ACTGCTTTGA 256 TTAAGGCTCT TACTGTTGGT GGGTTTTTTT TTTTTGGTGT GTGTTGTTTG GTTTTTTGAA 6965 |||||||||| |||||||| TTAAGGCTCT TACTGTTG.. .......... .......... .......... .......... 274 TGTTTATTAA CTGTTCCATT AAAACCCTTG AGTTTTCATT GGATTTTTGT TGTGGTGTGT 7025 .......... .......... .......... .......... .......... .......... 274 GAGAAATTGA GATGAGATTT TCGATGTTTA GTAGTTTTTT CTTGAGAAAA TCACAATCCC 7085 .......... .......... .......... .......... .......... .......... 274 CTCCTCCTTT TTTCCCTTTG AATTCCGAAA TAGTAGTAAT GATGTTTCGC GTTATTTCAT 7145 .......... .......... .......... .......... .......... .......... 274 GTGTTGTTTT TCATCTTGCT CGTTTTTTTT TGTGCAAATT TTATTTATCG CTCTTTGAAA 7205 .......... .......... .......... .......... .......... .......... 274 ATTGGAAACC CTAGAGTTTT TTCTTTTGGA TGTGTTTGTG AAATTAAGAT GAGATTTATC 7265 .......... .......... .......... .......... .......... .......... 274 AATCTTTTGA AGTTTTTGAG AAAATCACAG TCCCATTTTT TCAGTTTTTT TGAATTCCCA 7325 .......... .......... .......... .......... .......... .......... 274 AATATTAGTA ATGATGTTTA CCTTTGCTTC ACTTTTTTTT TTTAACTATT TGTCTTTATT 7385 .......... .......... .......... .......... .......... .......... 274 CTGTATCTAA TTGGTGTATT TTGTTTTTGT TATATTCCAG ATGTGGAAGA ATTTTATCAG 7445 |||||||||| |||||||||| .......... .......... .......... .......... ATGTGGAAGA ATTTTATCAG 294 CAGTGTGATC CTGGTAAGGG TTTTTTCCAG TTGGACCTTC TTCAGTATAA AGTTTGGAAT 7505 |||||||||| ||| CAGTGTGATC CTG....... .......... .......... .......... .......... 307 TTTTTACTTG CATAAGCTGT ATGTAATGAC TGTCATTGCA TTTTTGTTTC ATTTTGGTTT 7565 .......... .......... .......... .......... .......... .......... 307 TGGGTATTAG GGAATTTGAT ATTGCTCTAG ACGAATGATA GCATCGTGTT CTGATAGTGA 7625 .......... .......... .......... .......... .......... .......... 307 ATGAATGAAC TTGCTACCTT TGCAGAGTTT TGTTTGTTCA TTGGCATCTT TTGTATGCAA 7685 .......... .......... .......... .......... .......... .......... 307 TGCAGTGTGT GTTCCTTTTT TCTGAAAAAG ACTTTAAGGA AACAGGATCA GTTTGTTCTG 7745 .......... .......... .......... .......... .......... .......... 307 AAGAAGTTAA GCAGTTTTGT GTTAGTTGTC ATTTGCTGAT AACTATGTTA AAAAAATCTA 7805 .......... .......... .......... .......... .......... .......... 307 GAACAGACTC ACACTTGCGG TTTTATCCGA TACCTAGTTG TTGACTTGAC TTGGATAGTA 7865 .......... .......... .......... .......... .......... .......... 307 AATGTGATGT CTGATACACT GGTTCATTTG TTGCATAGTT AAAGCTTCAA CAAGCTGCTG 7925 .......... .......... .......... .......... .......... .......... 307 TAATTATTTA TTTAAGATGC AGCCACATGC GATAAATAAG CCATGTTGAA GGAACTCCAG 7985 .......... .......... .......... .......... .......... .......... 307 ATATACCTTG CCAATACAAA TGTATTGATT GATTAATTAG ATACTAGACA ACATCTGCTT 8045 .......... .......... .......... .......... .......... .......... 307 CAGTGTTCTT TCTCCCCCTC TTTCTTCTTT CCCCTTGAAC TGGCTCCGTT ATCAGTCATA 8105 .......... .......... .......... .......... .......... .......... 307 TGTATGTTAG GATGGTGATG ACATATGCAA AAGTGAATCC AAAAAAATTG AGTTTATGGG 8165 .......... .......... .......... .......... .......... .......... 307 TTCTAGATTC TAGATTCTAT ATTGTAGAAA GAGAACTTAT TTGGTTTTGG ATAAGTTTAT 8225 .......... .......... .......... .......... .......... .......... 307 ACATATTAAG TGGATTTGAA GGGTGTGTGG ATTGGCTTTT TCCAAAGTGG CTTATTGTCT 8285 .......... .......... .......... .......... .......... .......... 307 AAAAGCTAAA AAACATAAGT TGGGAACGCC CAACTTTGGC TTTTGGCTTT CTTTGTACTT 8345 .......... .......... .......... .......... .......... .......... 307 TTTCAGCCTA AAAGAAAGTG CTGATGTTTT CCAAAAGCTT CCAAAACTAA AGAAGAGCTT 8405 .......... .......... .......... .......... .......... .......... 307 AAAAGGCAGA CCTTCTTAAT CCAAACACCC ACTTAGCCTC TCTACCTCCA AGGTAGGGGT 8465 .......... .......... .......... .......... .......... .......... 307 AAGGTCTGCT AAGCTTTGTC CTCTCTATTC CCGACTTTGT AGGATTACAT AGGGTGTTTT 8525 .......... .......... .......... .......... .......... .......... 307 GTTGTATATC AATTGGGTTT CTTGACACAA ATATAGAGTT TTCGCCAAGG TTATTGAGTT 8585 .......... .......... .......... .......... .......... .......... 307 CTATTGAATC TATAGCGGCA TAGCCTCAAC TTTAGCTGCA CCCTTGAATA TGTGATTGTG 8645 .......... .......... .......... .......... .......... .......... 307 CCAATACAAG TGGTCGGAGT AGGTAGAAAG GATTAGGGCG TGTTTTGGTC CAAGGAATTT 8705 .......... .......... .......... .......... .......... .......... 307 ATGTAGGAGC AGTCAATTAG TTGTACTACA CATATAATTA GGTCTCTTTG GAGAAAATTA 8765 .......... .......... .......... .......... .......... .......... 307 GGCGGCATTG GGTTTTCTGA ATCAACTATT ATGTCAGAAC TTTTACGTAT AATTAAGTTG 8825 .......... .......... .......... .......... .......... .......... 307 TCTCGATAGA CTTGTAATTT TTTAGAATCT ATTCAGATTT ATCTAAAGAT AGGATAGTCT 8885 .......... .......... .......... .......... .......... .......... 307 TAACATTACT GTGCATTGTA CCTTTATATG CCTATAAATA CAAAATTCTT ATATTTTATA 8945 .......... .......... .......... .......... .......... .......... 307 TACCACTTTG TATGGTCTGT TATGTTTTTT CTTTTATTTG CTATGCCTTG TTCTAATCTG 9005 .......... .......... .......... .......... .......... .......... 307 TATTATCTTT TCCAAGCTGA CTTGTTATGT GTTACTTGAG CCGAGGGTCT GTGTGCACTG 9065 .......... .......... .......... .......... .......... .......... 307 GGTATGTTAT TGTATATGCC AGTAAACATA TTCATAAATT CTCAGTTTGT CATGTCATGT 9125 .......... .......... .......... .......... .......... .......... 307 TCTCTGCAGT TTCTCCATTT CATTACATTG GCCAAGTGCA AAAGCAAAAA ATGATTGAGC 9185 .......... .......... .......... .......... .......... .......... 307 AAGTGTAAAT GGTCTTGCGT ATTTATAAGT TCTGTATTGC CATGTCTGTA ATAAATATTT 9245 .......... .......... .......... .......... .......... .......... 307 ACCTTCTCTA GAAGCAGAAC ATTACTTATA AATTACAGGG CATTGAGCTT CAACATTTTT 9305 .......... .......... .......... .......... .......... .......... 307 TTCATCTGAA TTTTTGCTCC TTACTTCCCT AGCTGTGCTA AGTTTATGGA CCTAAATATA 9365 .......... .......... .......... .......... .......... .......... 307 GCCATACATG GATGCATTTA GTGGTAATTT ATGGATGACA AGATTTGCTC AAAGTTTTCA 9425 .......... .......... .......... .......... .......... .......... 307 AAAGTGAACT TGCTTTGCTA CTGGAAGTGT CATATGTAAT GGCACTGACT CATACTACTG 9485 .......... .......... .......... .......... .......... .......... 307 CTCTGTGTGT CACATATTAG TTGATTGATG ACTTAGAATA CTTAGTTCCT TATTGATACT 9545 .......... .......... .......... .......... .......... .......... 307 CTTATCTTTC TCCTTTAATT GAGACTCAGT TTTTGGTAAC ATCTACCGTG GTCCCCTTCT 9605 .......... .......... .......... .......... .......... .......... 307 TTGAACAATT TTTTTATTTA CATTAATGTG GTGTACTTTA CTTTTGTAAT TTATTCCATC 9665 .......... .......... .......... .......... .......... .......... 307 TTATATGAAT GATTATGTAG TTTCTAATTC AAGCGGTTAA ACATCTTCAT TGTGTATTAT 9725 .......... .......... .......... .......... .......... .......... 307 CCAGAGAAGG AAAACTTGTG CTTGTATGGT CTCCCAAATG AACAATGGGA GGTCAATCTG 9785 |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ....AGAAGG AAAACTTGTG CTTGTATGGT CTCCCAAATG AACAATGGGA GGTCAATCTG 363 CCTGCTGAAG AAGTACCACC TGAACTCCCT GAGCCTGCTC TTGGTATTAA CTTTGCTAGA 9845 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTGCTGAAG AAGTACCACC TGAACTCCCT GAGCCTGCTC TTGGTATTAA CTTTGCTAGA 423 GATGGGATGG AAGACAAGGA TTGGCTATCC TTAGTTGCTG TCCATAGTGA TTCCTGGCTG 9905 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGGGATGG AAGACAAGGA TTGGCTATCC TTAGTTGCTG TCCATAGTGA TTCCTGGCTG 483 CTTTCTGTCG CCTTCTATTT TGGTGCTAGA TTTGGGTTTG ATAAAGCCAG CAGGTACTTT 9965 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTCTGTCG CCTTCTATTT TGGTGCTAGA TTTGGGTTTG ATAAAGCCAG CAGGTACTTT 543 TTCTCATCTG AACTTTCATT TGTTGTAAGC ACACATATTT CCATGCTTAA TGGACGTTAA 10025 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTCATCTG AACTTTCATT TGTTGTAAGC ACACATATTT CCATGCTTAA TGGACGTTAA 603 GAGCATTACT TGATTGAAAT ATGTGATAAG TTTTGGAGAT ATTTGCTTCT AGTATTTCCT 10085 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGCATTACT TGATTGAAAT ATGTGATAAG TTTTGGAGAT ATTTGCTTCT AGTATTTCCT 663 TTGATGATTT TATATTTGTA TTTGATCCCC AGTAAATCAA ATTGTTTATC TGTTTTCCTG 10145 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATGATTT TATATTTGTA TTTGATCCCC AGTAAATCAA ATTGTTTATC TGTTTTCCTG 723 GTTGTTCTCT CAAGCAATTC CATGCATCTA TTTGATTTGT GATGTGTTGC ATTGAAGTAA 10205 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGTTCTCT CAAGCAATTC CATGCATCTA TTTGATTTGT GATGTGTTGC ATTGAAGTAA 783 TCGTTTTTTG TTATTAGTAA TATAGTTCAG ACAGTGATT- GGTATCAAGC TGCCACTACT 10264 |||||||||| ||||||||| ||||| |||| ||||||||| |||||| ||| |||||||||| TCGTTTTTTG GTATTAGTAA TATAGNTCAG ACAGTGATTG GGTATC-AGC TGCCACTACT 842 AGCTGTGCTA CT-GTCCCAG AGGAGGTATA AATGTTAAGT CTGCTTTATT AAGTTGTACG 10323 |||||||||| || ||||||| | |||||||| |||| ||||| ||||||||| ||| |||||| AGCTGTGCTA CTGGTCCCAG A-GAGGTATA AATGGTAAGT CTGCTTTATA AAG-TGTACG 900 CA 10325 || CA 902 hqPGS_C06HBa0120H21.1-9+_SGN-U335998+ (6726 6923,7426 7458,9730 10325) ******************************************************************************** EST sequence 5 +strand 570 n (File: SGN-U330577+) 1 CTACTGCCTC ATAGCTCACT CTCTTTGCTT CAACAATGGC GGCTTCTTGC ATGCTCAGAT 61 CCTCTTTCCT TTCTCCTAAC CATAATCTTC ATCAACAATC CTCTCCTAAA TCTAACCGTG 121 CTTCCTTCTT CACTCCTATC AAAGCCACAT CTTCAACAGA TGATGCAATC TCAAAATCTC 181 CACAACTTCA GAAGCACCGC CGCCCTGCTG ACGAGAATAT CCGTGAGGAA GCCCGACGCG 241 ACGTATCTTC CCACAATTTC TCTGCTAGGT ATGTACCTTT TAATGCCGAT CCTAACTCCA 301 GCGAGTGGTA TCCTCTCGAT GAGATTATTT ATCGCAGCCG ATCAGGTGGC CTACTTGATG 361 TCCAACACGA TATGGACGCC CTCAAGAAAT TTGACGGCCA GTACTGGCGG TCCCTGTTTG 421 ATTCCAGGGT GGGCAAGACA ACATGGCCTT ATGGTTCTGG TGTTTGGTCC AAGAAGGAAT 481 GGGTCCTGCC TGAAATTGAC AGTGATGATA TTGTCAGCGC TTTTGAAGGA AATTCCAATC 541 TTTTTTGGGC TGAGCGTTAT GGGAAACAAT Predicted gene structure (within gDNA segment 12289 to 15132): Exon 1 12889 13156 ( 268 n); cDNA 1 268 ( 268 n); score: 1.000 Intron 1 13157 14220 (1064 n); Pd: 0.991 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 2 14221 14522 ( 302 n); cDNA 269 570 ( 302 n); score: 0.997 MATCH C06HBa0120H21.1-9+ SGN-U330577+ 0.998 570 1.000 C PGS_C06HBa0120H21.1-9+_SGN-U330577+ (12889 13156,14221 14522) Alignment (genomic DNA sequence = upper lines): CTACTGCCTC ATAGCTCACT CTCTTTGCTT CAACAATGGC GGCTTCTTGC ATGCTCAGAT 12948 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACTGCCTC ATAGCTCACT CTCTTTGCTT CAACAATGGC GGCTTCTTGC ATGCTCAGAT 60 CCTCTTTCCT TTCTCCTAAC CATAATCTTC ATCAACAATC CTCTCCTAAA TCTAACCGTG 13008 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCTTTCCT TTCTCCTAAC CATAATCTTC ATCAACAATC CTCTCCTAAA TCTAACCGTG 120 CTTCCTTCTT CACTCCTATC AAAGCCACAT CTTCAACAGA TGATGCAATC TCAAAATCTC 13068 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCCTTCTT CACTCCTATC AAAGCCACAT CTTCAACAGA TGATGCAATC TCAAAATCTC 180 CACAACTTCA GAAGCACCGC CGCCCTGCTG ACGAGAATAT CCGTGAGGAA GCCCGACGCG 13128 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACAACTTCA GAAGCACCGC CGCCCTGCTG ACGAGAATAT CCGTGAGGAA GCCCGACGCG 240 ACGTATCTTC CCACAATTTC TCTGCTAGGT ATGCGTACTA TGCTACCCAT TTAACATTCC 13188 |||||||||| |||||||||| |||||||| ACGTATCTTC CCACAATTTC TCTGCTAG.. .......... .......... .......... 268 TCTGTTATTG CGTTGTTCCG GTTTTGGTTT TCCCTTATTA TTTGAATGAG CGCATTTAAC 13248 .......... .......... .......... .......... .......... .......... 268 CTTCTATTCA TTCAATTTTT TTAGTTTTAG CTTTAACTGA ATCATTATTC TAAATGAATT 13308 .......... .......... .......... .......... .......... .......... 268 GAGAAACTAG AAAAATAGTG GAAATAGATA TTCCTTATTT AATCATCCTT CAAGATTCAA 13368 .......... .......... .......... .......... .......... .......... 268 TTTTTTTTCC AGCTTAATTT GTGGGTAGGT ATGTCATTGG GAAACTACCT GGGTTTGCAT 13428 .......... .......... .......... .......... .......... .......... 268 ACAAATACCT TCCCCAGACC CCATTCGCGT GACTACACTG AAGATGTGGT TGTTGTTGCT 13488 .......... .......... .......... .......... .......... .......... 268 TCGAGAAAAG TTGTTATTTT GGGGGTTCTA AATGGTGGTT TGTTGACTGG GAGTTTTCCA 13548 .......... .......... .......... .......... .......... .......... 268 ACGATGGGAC TATGGGGTTG CCTCATTACT GTGAATGTTG GAGAGATTGG ACTTCACTGG 13608 .......... .......... .......... .......... .......... .......... 268 CTCAAGAATG GAGAGGCTTT CTGCCTCCAC AGTATCTGAA TGAAGTACAG ATCGGTTTTT 13668 .......... .......... .......... .......... .......... .......... 268 TGAGTGTCTG GTATATGCCT TAACGACAAC ACCATAACCA ATATAATCCC ACAAGTGGGG 13728 .......... .......... .......... .......... .......... .......... 268 TCTATGGAGG GTAGGAGATG TACACAAACC TTACTTCTAC CTTTGCAGGG TAGAAGGAAA 13788 .......... .......... .......... .......... .......... .......... 268 GTTAAGGGTA TTTGCCTTCT GTGGGCGTAA ATGTGGAATC AGAAGGGCTG AAATGCCTAG 13848 .......... .......... .......... .......... .......... .......... 268 GAAATCTTAA GGATGAAGGG TGTGCGCGCT TGGTTTGATG CCCTCCACCC ATCAGTACTG 13908 .......... .......... .......... .......... .......... .......... 268 GACATCAGCA ATCAACACTC TTTTCACCTT TTATCTTCTG TTTTGATTGT AAAAATAATG 13968 .......... .......... .......... .......... .......... .......... 268 ATTATGTTAA GTTTTGTAAC AGCATTATCT CAGGGAACAA AATCACACAT TTCAAAGGTT 14028 .......... .......... .......... .......... .......... .......... 268 GAATAGGCTC AGTTAAATAA CCTTGGCATT TAAAGTGGAA TAATATAAAT TCACTAAATT 14088 .......... .......... .......... .......... .......... .......... 268 TGCTACTTCC CAACTGTAGC AAGCAAATCG TAGACATATA TATTTACATC AAGCTCCCAT 14148 .......... .......... .......... .......... .......... .......... 268 CTACTTTACC AAAATGTTGG TGCAATTGTA CTCTGCACCT ATTAACTTAA AATTTTGCAT 14208 .......... .......... .......... .......... .......... .......... 268 ATGCTTCTGC AGGTATGTAC CTTTTAATGC CGATCCTAAC TCCAGCGAGT GGTATCCTCT 14268 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..GTATGTAC CTTTTAATGC CGATCCTAAC TCCAGCGAGT GGTATCCTCT 316 CGATGAGATT ATTTATCGCA GCCGATCAGG TGGCCTACTT GATGTCCAAC ACGATATGGA 14328 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGATGAGATT ATTTATCGCA GCCGATCAGG TGGCCTACTT GATGTCCAAC ACGATATGGA 376 CGCCCTCAAG AAATTTGACG GCCAGTACTG GCGGTCCCTG TTTGATTCCA GGGTGGGCAA 14388 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCCCTCAAG AAATTTGACG GCCAGTACTG GCGGTCCCTG TTTGATTCCA GGGTGGGCAA 436 GACAACATGG CCTTATGGTT CTGGTGTTTG GTCCAAGAAG GAATGGGTCC TGCCTGAAAT 14448 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACAACATGG CCTTATGGTT CTGGTGTTTG GTCCAAGAAG GAATGGGTCC TGCCTGAAAT 496 TGACAGTGAT GATATTGTCA GTGCTTTTGA AGGAAATTCC AATCTTTTTT GGGCTGAGCG 14508 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| TGACAGTGAT GATATTGTCA GCGCTTTTGA AGGAAATTCC AATCTTTTTT GGGCTGAGCG 556 TTATGGGAAA CAAT 14522 |||||||||| |||| TTATGGGAAA CAAT 570 hqPGS_C06HBa0120H21.1-9+_SGN-U330577+ (12889 13156,14221 14522) ******************************************************************************** EST sequence 3 +strand 1895 n (File: SGN-U316421+) 1 TGGTCTTCTC AGCTTCACAA TCACACAACA ATGGCGGCTT CTTGCATGCT TAGATCCTCT 61 TTCCTCTCTC CTGGTCTTCC TCAACTTCAT CATCAATCAA CTTCAAAACC CAATAATGGT 121 ATTCATTTCT TCGCTCCGAT TAAAGCTACA GCCACAAACG ATGCGATTTC TCAACAGAAG 181 CACCGACGTC CTGCTGACGA GAACATCCGT GAAGAAGCGC GCCGTCACTG CTCATCTCAT 241 AATTTTTCAG CCAGGTATGT TCCTTTTAAT GCTGGTCCTA CTTCTGATGA ATGGTATTCT 301 CTAGATGAGA TTGTTTATCG GAGTCGGTCT GGTGGATTAC TTGATGTTCA GCATGATATG 361 GATGCCTTAA AGAAGTTTGA TGGTCAGTAT TGGCGATCCT TGTTTGATTC TCGTGTCGGT 421 AAGACCACTT GGCCTTATGG GTCCGGTGTT TGGTCTAAGA AGGAATGGGT TCTGCCTGAA 481 ATTGATAGTG ATGATATTGT TAGTGCTTTT GAAGGGAACT CGAATCTGTT TTGGGCTGAG 541 CGTTTTGGCA AACAGTTTCT AGGCATGACT GATTTGTGGG TCAAACACTG TGGGATTAGC 601 CATACTGGTA GTTTTAAGGA TCTTGGGATG ACTGTTTTGG TGAGTCAAGT TAATCGGTTG 661 CGCAAAATGC ATAAACCGGT CGTCGGTGTG GGGTGTGCTT CTACTGGAGA CACATCTGCG 721 GCCCTGTCGG CTTACTGTGC ATCTGCAGGC ATTCCATCAA TTGTATTTCT ACCTGCGAAT 781 AAGATTTCTA TGGCGCAACT GGTTCAACCT ATTGCAAATG GGGCTTTTGT GTTGAGTATT 841 GACACTGATT TTGATGGTTG TATGCAGTTG ATTCGTGAAG TTACTGCTGA GTTGCCTATT 901 TACTTGGCAA ATTCGTTGAA TAGTTTGAGG TTGGAAGGGC AAAAGACGGC AGCAATCGAG 961 ATTTTACAGC AGTTTGATTG GGAAGTTCCG GAGTGGGTTA TAGTTCCTGG TGGGAACTTA 1021 GGCAACATAT ATGCATTTTA TAAAGGTTTT CAGATGTGCA AAGAGTTGGG ACTTGTGGAT 1081 CGCATCCCTA GGCTTGTTTG TGCTCAAGCT GCCAACGCTA ATCCGCTCTA CTTGCATTAC 1141 AAATCTGGTT GGAAAGACTT CAAACCTGTG AAGGCGAACA CAACATTTGC ATCTGCTATA 1201 CAGATTGGTG ACCCAGTGTC TATAGACAGA GCTGTTTTTG CCCTGCAGAA CTGCAACGGG 1261 ATAGTTGAGG AGGCCACCGA GGAGGAGTTG ATGGATGCTA TGGCTCAGGC AGACTCCACT 1321 GGGATGTTCA TTTGCCCGCA TACTGGTGTG GCATTGACTG CGCTGTTCAA GCTGAGAAAC 1381 AGTGGAGTCA TTGCACCAAC TGATAGGACT GTGGTTGTGA GTACAGCTCA TGGATTGAAG 1441 TTTACTCAAT CCAAGATTGA TTACCACTCA AAGGAAATAA AAGACATGGA ATGTCGGTTT 1501 TCTAACCCAC CTGTGGAAGT GAAAGCAGAT TTTGGATCAG TTATGGATGT TCTGAAGAGC 1561 TATTTGTTGA GCCAAAATTC CAAGCTATGA TGTGTTGTCA GTTTTAAACA TAGTAAGTTT 1621 GTAATATGTA CTCCTTTTGT CAGCATATGG TCATTTATCG CCATTAATTT GCCGTATAGA 1681 AATGACCAGA TGGTGATTTT GTTGTTGGCA GAGTTGGGGA GAGAAAGGCT CTACATTACT 1741 TTTGAGAGTA CTACGAACTC GAGTCCTTTT ATGGAACTCT GTTTTTTGTA ACAGTCTGCT 1801 TTTTGTAGCA TTGCTTTGGT GCATCACAAT GGAAACTGCC TTAATTTGAC CTGTTTTGTT 1861 AAATTATATG ACCTTAAAGA GAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 6806 to 20431): Exon 1 12897 13156 ( 260 n); cDNA 4 254 ( 251 n); score: 0.662 Intron 1 13157 14220 (1064 n); Pd: 0.991 (s: 0.72), Pa: 1.000 (s: 0.76) Exon 2 14221 15567 (1347 n); cDNA 255 1601 (1347 n); score: 0.857 PPA cDNA 1882 1895 MATCH C06HBa0120H21.1-9+ SGN-U316421+ 0.825 1607 0.848 C PGS_C06HBa0120H21.1-9+_SGN-U316421+ (12897 13156,14221 15567) Alignment (genomic DNA sequence = upper lines): TCATAGCTCA CTCTCTTTGC TTCAACAATG GCGGCTTCTT GCATGCTCAG ATCCTCTTTC 12956 || | | || | | |||||||| |||||||||| ||||||| || |||||||||| TCTTCTCAGC TTCACAATCA CACAACAATG GCGGCTTCTT GCATGCTTAG ATCCTCTTTC 63 CTTTCTCCT- AAC--CATAA TCTTCATCAA CAATCCTCTC CTAAATCTAA CCGT-G-CTT 13011 || |||||| | | | | |||||||| ||||| || | ||| | || | | || CTCTCTCCTG GTCTTCCTCA ACTTCATCAT CAATCAACTT CAAAACCCAA TAATGGTATT 123 C-CTTCTTCA CTCCTATCAA AGCCACATCT TCAACAGATG ATGCAATCTC AAAATCTCCA 13070 | |||||| |||| || || ||| ||| | | ||| | | |||| || | ||| CATTTCTTCG CTCCGATTAA AGCTACAGC- -C-ACAAACG ATGCGAT-T- ----TCT--- 171 CAACTTCAGA AGCACCGCCG CCCTGCTGAC GAGAATATCC GTGAGGAAGC CCGACGCGAC 13130 ||| |||| ||||||| || ||||||||| ||||| |||| |||| ||||| || || || CAA---CAGA AGCACCGACG TCCTGCTGAC GAGAACATCC GTGAAGAAGC GCGCCGTCAC 228 GTATCTTCCC ACAATTTCTC TGCTAGGTAT GCGTACTATG CTACCCATTT AACATTCCTC 13190 || || | | ||||| || || || TGCTCATCTC ATAATTTTTC AGCCAG.... .......... .......... .......... 254 TGTTATTGCG TTGTTCCGGT TTTGGTTTTC CCTTATTATT TGAATGAGCG CATTTAACCT 13250 .......... .......... .......... .......... .......... .......... 254 TCTATTCATT CAATTTTTTT AGTTTTAGCT TTAACTGAAT CATTATTCTA AATGAATTGA 13310 .......... .......... .......... .......... .......... .......... 254 GAAACTAGAA AAATAGTGGA AATAGATATT CCTTATTTAA TCATCCTTCA AGATTCAATT 13370 .......... .......... .......... .......... .......... .......... 254 TTTTTTCCAG CTTAATTTGT GGGTAGGTAT GTCATTGGGA AACTACCTGG GTTTGCATAC 13430 .......... .......... .......... .......... .......... .......... 254 AAATACCTTC CCCAGACCCC ATTCGCGTGA CTACACTGAA GATGTGGTTG TTGTTGCTTC 13490 .......... .......... .......... .......... .......... .......... 254 GAGAAAAGTT GTTATTTTGG GGGTTCTAAA TGGTGGTTTG TTGACTGGGA GTTTTCCAAC 13550 .......... .......... .......... .......... .......... .......... 254 GATGGGACTA TGGGGTTGCC TCATTACTGT GAATGTTGGA GAGATTGGAC TTCACTGGCT 13610 .......... .......... .......... .......... .......... .......... 254 CAAGAATGGA GAGGCTTTCT GCCTCCACAG TATCTGAATG AAGTACAGAT CGGTTTTTTG 13670 .......... .......... .......... .......... .......... .......... 254 AGTGTCTGGT ATATGCCTTA ACGACAACAC CATAACCAAT ATAATCCCAC AAGTGGGGTC 13730 .......... .......... .......... .......... .......... .......... 254 TATGGAGGGT AGGAGATGTA CACAAACCTT ACTTCTACCT TTGCAGGGTA GAAGGAAAGT 13790 .......... .......... .......... .......... .......... .......... 254 TAAGGGTATT TGCCTTCTGT GGGCGTAAAT GTGGAATCAG AAGGGCTGAA ATGCCTAGGA 13850 .......... .......... .......... .......... .......... .......... 254 AATCTTAAGG ATGAAGGGTG TGCGCGCTTG GTTTGATGCC CTCCACCCAT CAGTACTGGA 13910 .......... .......... .......... .......... .......... .......... 254 CATCAGCAAT CAACACTCTT TTCACCTTTT ATCTTCTGTT TTGATTGTAA AAATAATGAT 13970 .......... .......... .......... .......... .......... .......... 254 TATGTTAAGT TTTGTAACAG CATTATCTCA GGGAACAAAA TCACACATTT CAAAGGTTGA 14030 .......... .......... .......... .......... .......... .......... 254 ATAGGCTCAG TTAAATAACC TTGGCATTTA AAGTGGAATA ATATAAATTC ACTAAATTTG 14090 .......... .......... .......... .......... .......... .......... 254 CTACTTCCCA ACTGTAGCAA GCAAATCGTA GACATATATA TTTACATCAA GCTCCCATCT 14150 .......... .......... .......... .......... .......... .......... 254 ACTTTACCAA AATGTTGGTG CAATTGTACT CTGCACCTAT TAACTTAAAA TTTTGCATAT 14210 .......... .......... .......... .......... .......... .......... 254 GCTTCTGCAG GTATGTACCT TTTAATGCCG ATCCTAACTC CAGCGAGTGG TATCCTCTCG 14270 |||||| ||| |||||||| | ||||| || || ||| ||| |||| | .......... GTATGTTCCT TTTAATGCTG GTCCTACTTC TGATGAATGG TATTCTCTAG 304 ATGAGATTAT TTATCGCAGC CGATCAGGTG GCCTACTTGA TGTCCAACAC GATATGGACG 14330 |||||||| | |||||| || || || |||| | ||||||| ||| || || |||||||| | ATGAGATTGT TTATCGGAGT CGGTCTGGTG GATTACTTGA TGTTCAGCAT GATATGGATG 364 CCCTCAAGAA ATTTGACGGC CAGTACTGGC GGTCCCTGTT TGATTCCAGG GTGGGCAAGA 14390 || | ||||| ||||| || ||||| |||| | ||| |||| |||||| | || || |||| CCTTAAAGAA GTTTGATGGT CAGTATTGGC GATCCTTGTT TGATTCTCGT GTCGGTAAGA 424 CAACATGGCC TTATGGTTCT GGTGTTTGGT CCAAGAAGGA ATGGGTCCTG CCTGAAATTG 14450 | || ||||| |||||| || |||||||||| | |||||||| |||||| ||| |||||||||| CCACTTGGCC TTATGGGTCC GGTGTTTGGT CTAAGAAGGA ATGGGTTCTG CCTGAAATTG 484 ACAGTGATGA TATTGTCAGT GCTTTTGAAG GAAATTCCAA TCTTTTTTGG GCTGAGCGTT 14510 | |||||||| |||||| ||| |||||||||| | || || || ||| |||||| |||||||||| ATAGTGATGA TATTGTTAGT GCTTTTGAAG GGAACTCGAA TCTGTTTTGG GCTGAGCGTT 544 ATGGGAAACA ATTCCTAGGC ATGAATGATT TGTGGGTCAA ACATTGTGGA ATCAGCCACA 14570 ||| ||||| || |||||| |||| ||||| |||||||||| ||| ||||| || ||||| | TTGGCAAACA GTTTCTAGGC ATGACTGATT TGTGGGTCAA ACACTGTGGG ATTAGCCATA 604 CCGGTAGCTT CAAGGATCTT GGCATGACCG TATTGGTGAG TCAAGTAAAT CGGTTGCGGA 14630 | ||||| || ||||||||| || ||||| | | |||||||| |||||| ||| |||||||| | CTGGTAGTTT TAAGGATCTT GGGATGACTG TTTTGGTGAG TCAAGTTAAT CGGTTGCGCA 664 AAATGCATAA ACCAGTTGTG GGTGTTGGCT GTGCTTCCAC TGGAGACACG TCTGCTGCGC 14690 |||||||||| ||| || || ||||| || | ||||||| || ||||||||| ||||| || | AAATGCATAA ACCGGTCGTC GGTGTGGGGT GTGCTTCTAC TGGAGACACA TCTGCGGCCC 724 TGTCAGCTTA CTGTGCATCT GCAGGCATTC CATCAATTGT GTTTTTACCT GCAAATAAGA 14750 |||| ||||| |||||||||| |||||||||| |||||||||| ||| ||||| || ||||||| TGTCGGCTTA CTGTGCATCT GCAGGCATTC CATCAATTGT ATTTCTACCT GCGAATAAGA 784 TATCTATGGC GCAACTGGTT CAACCAATAG CCAATGGGGC CTTTGTGTTG AGTATCGACA 14810 | |||||||| |||||||||| ||||| || | | |||||||| ||||||||| ||||| |||| TTTCTATGGC GCAACTGGTT CAACCTATTG CAAATGGGGC TTTTGTGTTG AGTATTGACA 844 CTGATTTTGA TGGTTGTATG CAGTTGATTC GCGAAGTCAC AGCTGAGTTG CCAATTTACT 14870 |||||||||| |||||||||| |||||||||| | ||||| || ||||||||| || ||||||| CTGATTTTGA TGGTTGTATG CAGTTGATTC GTGAAGTTAC TGCTGAGTTG CCTATTTACT 904 TGGCTAATTC CTTAAACAGT TTGAGGCTAG AAGGACAAAA GACTGCAGCA ATAGAGATAC 14930 |||| ||||| || || ||| |||||| | | |||| ||||| ||| |||||| || ||||| TGGCAAATTC GTTGAATAGT TTGAGGTTGG AAGGGCAAAA GACGGCAGCA ATCGAGATTT 964 TGCAGCAGTT TGAATGGGAA GTTCCAGACT GGGTGATAGT TCCCGGTGGT AACCTGGGCA 14990 | |||||||| ||| |||||| ||||| || | |||| ||||| ||| ||||| ||| | |||| TACAGCAGTT TGATTGGGAA GTTCCGGAGT GGGTTATAGT TCCTGGTGGG AACTTAGGCA 1024 ATATATATGC ATTTTACAAA GGTTTCCACA TGTGCAAGGA GCTGGGGCTT GTTGATCGTA 15050 | |||||||| |||||| ||| ||||| || | ||||||| || | |||| ||| || ||||| | ACATATATGC ATTTTATAAA GGTTTTCAGA TGTGCAAAGA GTTGGGACTT GTGGATCGCA 1084 TCCCAAGACT TGTTTGTGCT CAAGCAGCCA ATGCAAATCC ACTTTACGTG CATTATAAGT 15110 |||| || || |||||||||| ||||| |||| | || ||||| || ||| || ||||| || | TCCCTAGGCT TGTTTGTGCT CAAGCTGCCA ACGCTAATCC GCTCTACTTG CATTACAAAT 1144 CTGGTTGGAA AGATTTCAAA CCTGTTAAGG CAAATACAAC ATTTGCATCT GCTATACAGA 15170 |||||||||| ||| |||||| ||||| |||| | || ||||| |||||||||| |||||||||| CTGGTTGGAA AGACTTCAAA CCTGTGAAGG CGAACACAAC ATTTGCATCT GCTATACAGA 1204 TTGGTGACCC AGTATCTATT GATAGGGCTG TCTTTGCTCT AAAGAAGTCC AATGGGATAG 15230 |||||||||| ||| ||||| || || |||| | ||||| || |||| | | || ||||||| TTGGTGACCC AGTGTCTATA GACAGAGCTG TTTTTGCCCT GCAGAACTGC AACGGGATAG 1264 TGGAGGAGGC TACCGAGGAA GAGTTGATGG ATGCGATGGC TCAAGCTGAC TCAACTGGGA 15290 | |||||||| |||||||| |||||||||| |||| ||||| ||| || ||| || ||||||| TTGAGGAGGC CACCGAGGAG GAGTTGATGG ATGCTATGGC TCAGGCAGAC TCCACTGGGA 1324 TGTTCATATG CCCGCACACT GGTGTGGCAT TGACTGCACT CTCCAAGCTG AGAAAGGCTG 15350 ||||||| || |||||| ||| |||||||||| ||||||| || | ||||||| ||||| || TGTTCATTTG CCCGCATACT GGTGTGGCAT TGACTGCGCT GTTCAAGCTG AGAAACAGTG 1384 GAGTTATTGC GCCTACTGAT AGGACAGTGG TTGTGAGTAC AGCTCATGGG TTGAAGTTTA 15410 |||| ||||| || |||||| ||||| |||| |||||||||| ||||||||| |||||||||| GAGTCATTGC ACCAACTGAT AGGACTGTGG TTGTGAGTAC AGCTCATGGA TTGAAGTTTA 1444 CTCAATCCAA GGTTGATTAT CATTCTAAAG AAATAAAGAA CATGGAGTGT CGGTTTGCTA 15470 |||||||||| | ||||||| || || || | ||||||| | |||||| ||| |||||| ||| CTCAATCCAA GATTGATTAC CACTCAAAGG AAATAAAAGA CATGGAATGT CGGTTTTCTA 1504 ATCCCCCGGT ACAGGTGAAA GCAGACTTTG GATCTGTCAT GGATGTTCTG AAGAAGTATC 15530 | || || || | |||||| ||||| |||| |||| || || |||||||||| |||| ||| ACCCACCTGT GGAAGTGAAA GCAGATTTTG GATCAGTTAT GGATGTTCTG AAGAGCTATT 1564 TATTGAGCAA AAATTCCAAG TTCTAACTTT TTGGAAG 15567 | |||||| | |||||||||| | | | | ||| || TGTTGAGCCA AAATTCCAAG CTATGATGTG TTGTCAG 1601 hqPGS_C06HBa0120H21.1-9+_SGN-U316421+ (14221 15567) ******************************************************************************** EST sequence 7 +strand 1429 n (File: SGN-U322169+) 1 AATGATTTGT GGGTCAAACA TTGTGGAATC AGCCACACCG GTAGCTTCAA GGATCTTGGC 61 ATGACCGTAT TGGTGAGTCA AGTAAATCGG TTGCGGAAAA TGCATAAACC AGTTGTGGGT 121 GTTGGCTGTG CTTCCACTGG AGACACGTCT GCTGCGCTGT CAGCTTACTG TGCATCTGCA 181 GGCATTCCAT CAATTGTGTT TTTACCTGCA AATAAGATAT CTATGGCGCA ACTGGTTCAA 241 CCAATAGCCA ATGGGGCCTT TGTGTTGAGT ATCGACACTG ATTTTGATGG TTGTATGCAG 301 TTGATTCGCG AAGTCACAGC TGAGTTGCCA ATTTACTTGG CTAATTCCTT AAACAGTTTG 361 AGGCTAGAAG GACAAAAGAC TGCAGCAATA GATATACTGC AGCAGTTTGA ATGGGAAGTT 421 CCAGACTGGG TGATAGTTCC CGGTGGTAAC CTGGGCAATA TATATGCATT TTACAAAGGT 481 TTCCACATGT GCAAGGAGCT GGGGCTTGTT GATCGTATCC CAAGACTTGT TTGTGCTCAA 541 GCAGCCAATG CAAATCCACT TTACGTGCAT TATAAGTCTG GTTGGAAAGA TTTCAAACCT 601 GTTAAGGCAA ATACAACATT TGCATCTGCT ATACAGATTG GTGACCCAGT ATCTATTGAT 661 AGGGCTGTCT TTGCTCTAAA GAAGTCCAAT GGGATAGTGG AGGAGGCTAC CGAGGAAGAG 721 TTGATGGATG CGATGGCTCA AGCTGACTCA ACTGGGATGT TCATATGCCC GCACACTGGT 781 GTGGCATTGA CTGCACTCTC CAAGCTGAGA AAGGCTGGAG TTATTGCGCC TACTGATAGG 841 ACAGTGGTTG TGAGTACAGC TCATGGGTTG AAGTTTACTC AATCCAAGGT TGATTATCAT 901 TCTAAAGAAA TAAAGAACAT GGAGTGTCGG TTTGCTAATC CCCCGGTACA GGTGAAAGCA 961 GACTTTGGAT CTGTCATGGA TGTTCTGAAG AAGTATCTAT TGAGCAAAAA TTCCAAGTTC 1021 TAACTTTTTG GAAGAGTAAA TTCTGCTTAC CAAAATCATC ATCTTTTGGT TCATCTTCGT 1081 CCATGATGAA GAATTTGGTA TAATTAGGTG AAGTCATGAA AGGCTCTGAG TTTCCAATGA 1141 GAGAACTATG TTTTTAAGTA GGAACTTTTA GTGGTTGCAA CTTTCTACTA TTTTTATGCT 1201 CCTTTGTCTA ATGTTGGCGT TCTGAATATT TTGTAGCATT AGCATCTGCC TCAAATGAGT 1261 GGTGTTTTCT AGTTTGAGGG TGATAATGTC ATGAAAACTG AATAAATAGC GCCTCATATG 1321 GGATTTTTAT CTACCATCTG AATATATATC GTGAAAATCA TACCTTTAAA AAAAAAAAAA 1381 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA Predicted gene structure (within gDNA segment 13934 to 17130): Exon 1 14534 15900 (1367 n); cDNA 1 1367 (1367 n); score: 0.999 PPA cDNA 1379 1429 MATCH C06HBa0120H21.1-9+ SGN-U322169+ 0.999 1367 0.957 C PGS_C06HBa0120H21.1-9+_SGN-U322169+ (14534 15900) Alignment (genomic DNA sequence = upper lines): AATGATTTGT GGGTCAAACA TTGTGGAATC AGCCACACCG GTAGCTTCAA GGATCTTGGC 14593 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGATTTGT GGGTCAAACA TTGTGGAATC AGCCACACCG GTAGCTTCAA GGATCTTGGC 60 ATGACCGTAT TGGTGAGTCA AGTAAATCGG TTGCGGAAAA TGCATAAACC AGTTGTGGGT 14653 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGACCGTAT TGGTGAGTCA AGTAAATCGG TTGCGGAAAA TGCATAAACC AGTTGTGGGT 120 GTTGGCTGTG CTTCCACTGG AGACACGTCT GCTGCGCTGT CAGCTTACTG TGCATCTGCA 14713 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTGGCTGTG CTTCCACTGG AGACACGTCT GCTGCGCTGT CAGCTTACTG TGCATCTGCA 180 GGCATTCCAT CAATTGTGTT TTTACCTGCA AATAAGATAT CTATGGCGCA ACTGGTTCAA 14773 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCATTCCAT CAATTGTGTT TTTACCTGCA AATAAGATAT CTATGGCGCA ACTGGTTCAA 240 CCAATAGCCA ATGGGGCCTT TGTGTTGAGT ATCGACACTG ATTTTGATGG TTGTATGCAG 14833 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAATAGCCA ATGGGGCCTT TGTGTTGAGT ATCGACACTG ATTTTGATGG TTGTATGCAG 300 TTGATTCGCG AAGTCACAGC TGAGTTGCCA ATTTACTTGG CTAATTCCTT AAACAGTTTG 14893 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATTCGCG AAGTCACAGC TGAGTTGCCA ATTTACTTGG CTAATTCCTT AAACAGTTTG 360 AGGCTAGAAG GACAAAAGAC TGCAGCAATA GAGATACTGC AGCAGTTTGA ATGGGAAGTT 14953 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| AGGCTAGAAG GACAAAAGAC TGCAGCAATA GATATACTGC AGCAGTTTGA ATGGGAAGTT 420 CCAGACTGGG TGATAGTTCC CGGTGGTAAC CTGGGCAATA TATATGCATT TTACAAAGGT 15013 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAGACTGGG TGATAGTTCC CGGTGGTAAC CTGGGCAATA TATATGCATT TTACAAAGGT 480 TTCCACATGT GCAAGGAGCT GGGGCTTGTT GATCGTATCC CAAGACTTGT TTGTGCTCAA 15073 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCCACATGT GCAAGGAGCT GGGGCTTGTT GATCGTATCC CAAGACTTGT TTGTGCTCAA 540 GCAGCCAATG CAAATCCACT TTACGTGCAT TATAAGTCTG GTTGGAAAGA TTTCAAACCT 15133 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAGCCAATG CAAATCCACT TTACGTGCAT TATAAGTCTG GTTGGAAAGA TTTCAAACCT 600 GTTAAGGCAA ATACAACATT TGCATCTGCT ATACAGATTG GTGACCCAGT ATCTATTGAT 15193 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTAAGGCAA ATACAACATT TGCATCTGCT ATACAGATTG GTGACCCAGT ATCTATTGAT 660 AGGGCTGTCT TTGCTCTAAA GAAGTCCAAT GGGATAGTGG AGGAGGCTAC CGAGGAAGAG 15253 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGGCTGTCT TTGCTCTAAA GAAGTCCAAT GGGATAGTGG AGGAGGCTAC CGAGGAAGAG 720 TTGATGGATG CGATGGCTCA AGCTGACTCA ACTGGGATGT TCATATGCCC GCACACTGGT 15313 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATGGATG CGATGGCTCA AGCTGACTCA ACTGGGATGT TCATATGCCC GCACACTGGT 780 GTGGCATTGA CTGCACTCTC CAAGCTGAGA AAGGCTGGAG TTATTGCGCC TACTGATAGG 15373 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGGCATTGA CTGCACTCTC CAAGCTGAGA AAGGCTGGAG TTATTGCGCC TACTGATAGG 840 ACAGTGGTTG TGAGTACAGC TCATGGGTTG AAGTTTACTC AATCCAAGGT TGATTATCAT 15433 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAGTGGTTG TGAGTACAGC TCATGGGTTG AAGTTTACTC AATCCAAGGT TGATTATCAT 900 TCTAAAGAAA TAAAGAACAT GGAGTGTCGG TTTGCTAATC CCCCGGTACA GGTGAAAGCA 15493 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTAAAGAAA TAAAGAACAT GGAGTGTCGG TTTGCTAATC CCCCGGTACA GGTGAAAGCA 960 GACTTTGGAT CTGTCATGGA TGTTCTGAAG AAGTATCTAT TGAGCAAAAA TTCCAAGTTC 15553 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACTTTGGAT CTGTCATGGA TGTTCTGAAG AAGTATCTAT TGAGCAAAAA TTCCAAGTTC 1020 TAACTTTTTG GAAGAGTAAA TTCTGCTTAC CAAAATCATC ATCTTTTGGT TCATCTTCGT 15613 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAACTTTTTG GAAGAGTAAA TTCTGCTTAC CAAAATCATC ATCTTTTGGT TCATCTTCGT 1080 CCATGATGAA GAATTTGGTA TAATTAGGTG AAGTCATGAA AGGCTCTGAG TTTCCAATGA 15673 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCATGATGAA GAATTTGGTA TAATTAGGTG AAGTCATGAA AGGCTCTGAG TTTCCAATGA 1140 GAGAACTATG TTTTTAAGTA GGAACTTTTA GTGGTTGCAA CTTTCTACTA TTTTTATGCT 15733 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGAACTATG TTTTTAAGTA GGAACTTTTA GTGGTTGCAA CTTTCTACTA TTTTTATGCT 1200 CCTTTGTCTA ATGTTGGCGT TCTGAATATT TTGTAGCATT AGCATCTGCC TCAAATGAGT 15793 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTTTGTCTA ATGTTGGCGT TCTGAATATT TTGTAGCATT AGCATCTGCC TCAAATGAGT 1260 GGTGTTTTCT AGTTTGAGGG TGATAATGTC ATGAAAACTG AATAAATAGC GCCTCATATG 15853 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGTTTTCT AGTTTGAGGG TGATAATGTC ATGAAAACTG AATAAATAGC GCCTCATATG 1320 GGATTTTTAT CTACCATCTG AATATATATC GTGAAAATCA TACCTTT 15900 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| GGATTTTTAT CTACCATCTG AATATATATC GTGAAAATCA TACCTTT 1367 hqPGS_C06HBa0120H21.1-9+_SGN-U322169+ (14534 15900) Total number of EST alignments reported: 7 ________________________________________________________________________________ Predicted gene locations (3) in segment 1 to 21437: PGL 1 (+ strand): 1721 4032 AGS-1 (1721 2040,2203 2343,3077 3169,3275 3337,3726 4032) SCR (e 1.000 d 0.974 a 0.984,e 1.000 d 0.997 a 0.989,e 1.000 d 0.993 a 0.924,e 1.000 d 0.995 a 0.895,e 1.000) Exon 1 1721 2040 ( 320 n); score: 1.000 Intron 1 2041 2202 ( 162 n); Pd: 0.974 Pa: 0.984 Exon 2 2203 2343 ( 141 n); score: 1.000 Intron 2 2344 3076 ( 733 n); Pd: 0.997 Pa: 0.989 Exon 3 3077 3169 ( 93 n); score: 1.000 Intron 3 3170 3274 ( 105 n); Pd: 0.993 Pa: 0.924 Exon 4 3275 3337 ( 63 n); score: 1.000 Intron 4 3338 3725 ( 388 n); Pd: 0.995 Pa: 0.895 Exon 5 3726 4032 ( 307 n); score: 1.000 PGS (1721 2040,2203 2343,3077 3169,3275 3337,3726 4032) SGN-U321764+ 3-phase translation of AGS-1 (+strand): . . . . . . 1721 ATTCAATGTCTTCTTCAAGAATGAAAATGGAGACCTCTGATGGGGGTAGATTGTCTCCAG I Q C L L Q E - K W R P L M G V D C L Q F N V F F K N E N G D L - W G - I V S R S M S S S R M K M E T S D G G R L S P . . . . . . 1781 ACATCTCTGCATTGCATGTTCATCCAGCGGACCAGGATATGATCACAAGTTCTTCTAGAG T S L H C M F I Q R T R I - S Q V L L E H L C I A C S S S G P G Y D H K F F - R D I S A L H V H P A D Q D M I T S S S R . . . . . . 1841 AGAAGTTTGAGGGTGCGGTGTCTTCCAAGATTCAAGGGGCTCCACAATCTGCTAATTCTC R S L R V R C L P R F K G L H N L L I L E V - G C G V F Q D S R G S T I C - F S E K F E G A V S S K I Q G A P Q S A N S . . . . . . 1901 GTGTACGACCTAGTAGTTCTGTTCTTTCCGGTTCTGATGGAACAGGTGCTGCCTCAACGT V Y D L V V L F F P V L M E Q V L P Q R C T T - - F C S F R F - W N R C C L N V R V R P S S S V L S G S D G T G A A S T . . . . . . 1961 CAGCTGACAATGGATTATCACGAACCTCTTCTGTAAATTCATTTTCGTCAGAAAAATCCA Q L T M D Y H E P L L - I H F R Q K N P S - Q W I I T N L F C K F I F V R K I H S A D N G L S R T S S V N S F S S E K S . . : . . . . 2021 CATTGAATCCACATGCTAAG : GAATTTAAATTAAATCCTAATGCAAAGAGTTTCATGCCAT H - I H M L R : N L N - I L M Q R V S C H I E S T C - : G I - I K S - C K E F H A I T L N P H A K : E F K L N P N A K S F M P . . . . . . 2243 TTCAATCACCTTTGAGACCTGCTTCTCCGGTGTCTGATAGTTCCTTCTATTATCCAGCTG F N H L - D L L L R C L I V P S I I Q L S I T F E T C F S G V - - F L L L S S W F Q S P L R P A S P V S D S S F Y Y P A . . . . . : . 2303 GTGTGGCTACTGTTCCCAATGTGCATGGCATGCCTGTTGGG : GTAGGTCCTTCATTTTCTC V W L L F P M C M A C L L G : - V L H F L C G Y C S Q C A W H A C W : G R S F I F S G V A T V P N V H G M P V G : V G P S F S . . . . . . 3096 CACATCAGCCTGTTATGTTTAATCCACAAGCTACACCTGTACCACAACAATTTTTTCATC H I S L L C L I H K L H L Y H N N F F I T S A C Y V - S T S Y T C T T T I F S S P H Q P V M F N P Q A T P V P Q Q F F H . . : . . . . 3156 CAAATGGACCACAG : TATGGGCAGCAGATGATGATTGGTCCCCCTCGGCAAGTAGTCTATA Q M D H S : M G S R - - L V P L G K - S I K W T T : V W A A D D D W S P S A S S L Y P N G P Q : Y G Q Q M M I G P P R Q V V Y . . : . . . . 3321 TGCCGAATTACCCCGCT : GAAATGCGACGAGACTACTAATCAGTTGGCAAACCATATTGCG C R I T P L : K C D E T T N Q L A N H I A A E L P R : - N A T R L L I S W Q T I L R M P N Y P A : E M R R D Y - S V G K P Y C . . . . . . 3769 TGGTGGGTTGAACCGATGGATGCTGACATGAGATTTCATGGATTGGTGGAGGAGGTTTAG W W V E P M D A D M R F H G L V E E V - G G L N R W M L T - D F M D W W R R F S V V G - T D G C - H E I S W I G G G G L . . . . . . 3829 CTGGTTGATGAAGGGGGATTCCAATGATTTGATTAGAGCTTTTCCTTATACTGGGGTATC L V D E G G F Q - F D - S F S L Y W G I W L M K G D S N D L I R A F P Y T G V S A G - - R G I P M I - L E L F L I L G Y . . . . . . 3889 AGTAATTGTTACTTTGTCATAATCATTAGATTTGTTAACTTTCAGATTTACAGTCTTTCT S N C Y F V I I I R F V N F Q I Y S L S V I V T L S - S L D L L T F R F T V F L Q - L L L C H N H - I C - L S D L Q S F . . . . . . 3949 TGAAGTTAACTGTGGTGTTTCCTTGGTATGCTGCTGTTGATATTTCTTCTCTTTGATCTG - S - L W C F L G M L L L I F L L F D L E V N C G V S L V C C C - Y F F S L I C L K L T V V F P W Y A A V D I S S L - S . . . 4009 TATTCCTAATATTGTATGTTTCTC Y S - Y C M F L I P N I V C F V F L I L Y V S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-9+_PGL-1_AGS-1_PPS_1 (1723 2040,2203 2343,3077 3169,3275 3337,3726 3746) (frame '0'; 633 bp, 211 residues) 1 SMSSSRMKME TSDGGRLSPD ISALHVHPAD QDMITSSSRE KFEGAVSSKI QGAPQSANSR 61 VRPSSSVLSG SDGTGAASTS ADNGLSRTSS VNSFSSEKST LNPHAKEFKL NPNAKSFMPF 121 QSPLRPASPV SDSSFYYPAG VATVPNVHGM PVGVGPSFSP HQPVMFNPQA TPVPQQFFHP 181 NGPQYGQQMM IGPPRQVVYM PNYPAEMRRD Y- PGL 2 (+ strand): 6662 12115 AGS-1 (6662 6923,7426 7458,9730 9958,11132 11261,11480 12115) SCR (e 1.000 d 0.922 a 0.997,e 1.000 d 0.748 a 0.842,e 1.000 d 0.999 a 0.994,e 1.000 d 0.979 a 0.970,e 0.998) Exon 1 6662 6923 ( 262 n); score: 1.000 Intron 1 6924 7425 ( 502 n); Pd: 0.922 Pa: 0.997 Exon 2 7426 7458 ( 33 n); score: 1.000 Intron 2 7459 9729 (2271 n); Pd: 0.748 Pa: 0.842 Exon 3 9730 9958 ( 229 n); score: 1.000 Intron 3 9959 11131 (1173 n); Pd: 0.999 Pa: 0.994 Exon 4 11132 11261 ( 130 n); score: 1.000 Intron 4 11262 11479 ( 218 n); Pd: 0.979 Pa: 0.970 Exon 5 11480 12115 ( 636 n); score: 0.998 PGS (6662 6923,7426 7458,9730 9807) SGN-U335999+ PGS (6664 6923,7426 7458,9730 9958,11132 11261,11480 12115) SGN-U317215+ 3-phase translation of AGS-1 (+strand): . . . . . . 6662 ATCTTCTCTTCACTTATTAAAACCCTTTTCCCCATTCTTTGTTTCTACACACAATTCAAA I F S S L I K T L F P I L C F Y T Q F K S S L H L L K P F S P F F V S T H N S K L L F T Y - N P F P H S L F L H T I Q . . . . . . 6722 ATCCCCTCCTCCCTCTCTTTCCCCCCCTTTGAACTCTGCAGCCGTACGCCACTCTCATTT I P S S L S F P P F E L C S R T P L S F S P P P S L S P P L N S A A V R H S H F N P L L P L F P P L - T L Q P Y A T L I . . . . . . 6782 TCCTGCGAATTTCCTTCGAGGTTGCTCTTCTGATTAATGGACGCCGGTGGAGGAGGAGAA S C E F P S R L L F - L M D A G G G G E P A N F L R G C S S D - W T P V E E E N F L R I S F E V A L L I N G R R W R R R . . . . . . 6842 CAGTTTGATTCCCGAACTGTGGAAGATGTGTTTGGGGATTTCAAGAGACGACGAACTGCT Q F D S R T V E D V F G D F K R R R T A S L I P E L W K M C L G I S R D D E L L T V - F P N C G R C V W G F Q E T T N C . . . : . . . : 6902 TTGATTAAGGCTCTTACTGTTG : ATGTGGAAGAATTTTATCAGCAGTGTGATCCTG : AGAAG L I K A L T V : D V E E F Y Q Q C D P : E K - L R L L L L : M W K N F I S S V I L : R R F D - G S Y C - : C G R I L S A V - S - : E . . . . . . 9735 GAAAACTTGTGCTTGTATGGTCTCCCAAATGAACAATGGGAGGTCAATCTGCCTGCTGAA E N L C L Y G L P N E Q W E V N L P A E K T C A C M V S Q M N N G R S I C L L K G K L V L V W S P K - T M G G Q S A C - . . . . . . 9795 GAAGTACCACCTGAACTCCCTGAGCCTGCTCTTGGTATTAACTTTGCTAGAGATGGGATG E V P P E L P E P A L G I N F A R D G M K Y H L N S L S L L L V L T L L E M G W R S T T - T P - A C S W Y - L C - R W D . . . . . . 9855 GAAGACAAGGATTGGCTATCCTTAGTTGCTGTCCATAGTGATTCCTGGCTGCTTTCTGTC E D K D W L S L V A V H S D S W L L S V K T R I G Y P - L L S I V I P G C F L S G R Q G L A I L S C C P - - F L A A F C . . . . . : . 9915 GCCTTCTATTTTGGTGCTAGATTTGGGTTTGATAAAGCCAGCAG : GAAGAAGCTTTTCAAC A F Y F G A R F G F D K A S R : K K L F N P S I L V L D L G L I K P A : G R S F S T R L L F W C - I W V - - S Q Q : E E A F Q . . . . . . 11148 ATGATAAATGAACTGCCTACAATATATGAAGTAGTGACTGGTGCCTCAAAGAAACAACAG M I N E L P T I Y E V V T G A S K K Q Q - - M N C L Q Y M K - - L V P Q R N N R H D K - T A Y N I - S S D W C L K E T T . . . . . . : 11208 AAAGAAAAATCTTCTGGCCATAGTGGCAAGAAATCCAAGTCAAATTCCAAGGCG : AGGGCA K E K S S G H S G K K S K S N S K A : R A K K N L L A I V A R N P S Q I P R R : G H E R K I F W P - W Q E I Q V K F Q G : E G . . . . . . 11486 CAAGACTATCAGGAGAAGTTAGCAAAGTTGCAGGCTAAAGATGAAGAGGAGGAGGGTTTG Q D Y Q E K L A K L Q A K D E E E E G L K T I R R S - Q S C R L K M K R R R V W T R L S G E V S K V A G - R - R G G G F . . . . . . 11546 GATGAGCAGGAGGACGAGGATGAGCATGGCGAGACACTGTGTGGTGCCTGTGGAGAAAAT D E Q E D E D E H G E T L C G A C G E N M S R R T R M S M A R H C V V P V E K I G - A G G R G - A W R D T V W C L W R K . . . . . . 11606 TATGCAGCAGATGAATTTTGGATATGCTGTGACATTTGTGAAAAGTGGTTCCACGGCAAG Y A A D E F W I C C D I C E K W F H G K M Q Q M N F G Y A V T F V K S G S T A S L C S R - I L D M L - H L - K V V P R Q . . . . . . 11666 TGTGTGAAGATCACCCCTGCCAAGGCTGAGCATATCAAGCAATACAAGTGTCCGTCTTGT C V K I T P A K A E H I K Q Y K C P S C V - R S P L P R L S I S S N T S V R L V V C E D H P C Q G - A Y Q A I Q V S V L . . . . . . 11726 AGCCACAAGAGACCTCGAGCTGACATATAAAATTTGATAAAGTAGCATCTTCTGTTAGTC S H K R P R A D I - N L I K - H L L L V A T R D L E L T Y K I - - S S I F C - S - P Q E T S S - H I K F D K V A S S V S . . . . . . 11786 AGGTTATTCAGGCTGTTTGGCTCTCTACCTCGTGGAGGTTTAATAGCACTGTGGTTCATC R L F R L F G S L P R G G L I A L W F I G Y S G C L A L Y L V E V - - H C G S S Q V I Q A V W L S T S W R F N S T V V H . . . . . . 11846 TTTGTACACTGTTGGTTAGAACATGCAGCTGATGAATAGGGAGGGTTCTATTGATGGTGG F V H C W L E H A A D E - G G F Y - W W L Y T V G - N M Q L M N R E G S I D G G L C T L L V R T C S - - I G R V L L M V . . . . . . 11906 TTTAGTTATAAAAAAAGTAGTCTTAAAGTAGCAAAAACAGGTACATACTATGAGTTTAAA F S Y K K S S L K V A K T G T Y Y E F K L V I K K V V L K - Q K Q V H T M S L N V - L - K K - S - S S K N R Y I L - V - . . . . . . 11966 CAGTTGATGTCTCTTCTTTAGTTGTTTTAGTAGGAGGATTTGATGTGGTCTATTGAGCTT Q L M S L L - L F - - E D L M W S I E L S - C L F F S C F S R R I - C G L L S F T V D V S S L V V L V G G F D V V Y - A . . . . . . 12026 CCACCAACTTTGGTGTACATGTCTCTTTGGTTTTTAAACACTTAATGTTGCCTATAATTG P P T L V Y M S L W F L N T - C C L - L H Q L W C T C L F G F - T L N V A Y N C S T N F G V H V S L V F K H L M L P I I . . . 12086 TGAATTATGGTCATGTTTATTACATACACC - I M V M F I T Y T E L W S C L L H T V N Y G H V Y Y I H Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-9+_PGL-2_AGS-1_PPS_1 (6815 6923,7426 7458,9730 9958,11132 11261,11480 11755) (frame '1'; 774 bp, 258 residues) 1 LMDAGGGGEQ FDSRTVEDVF GDFKRRRTAL IKALTVDVEE FYQQCDPEKE NLCLYGLPNE 61 QWEVNLPAEE VPPELPEPAL GINFARDGME DKDWLSLVAV HSDSWLLSVA FYFGARFGFD 121 KASRKKLFNM INELPTIYEV VTGASKKQQK EKSSGHSGKK SKSNSKARAQ DYQEKLAKLQ 181 AKDEEEEGLD EQEDEDEHGE TLCGACGENY AADEFWICCD ICEKWFHGKC VKITPAKAEH 241 IKQYKCPSCS HKRPRADI- AGS-2 (6726 6923,7426 7458,9730 10325) SCR (e 1.000 d 0.922 a 0.997,e 1.000 d 0.748 a 0.842,e 0.978) Exon 1 6726 6923 ( 198 n); score: 1.000 Intron 1 6924 7425 ( 502 n); Pd: 0.922 Pa: 0.997 Exon 2 7426 7458 ( 33 n); score: 1.000 Intron 2 7459 9729 (2271 n); Pd: 0.748 Pa: 0.842 Exon 3 9730 10325 ( 596 n); score: 0.978 PGS (6726 6923,7426 7458,9730 10325) SGN-U335998+ 3-phase translation of AGS-2 (+strand): . . . . . . 6726 CCTCCTCCCTCTCTTTCCCCCCCTTTGAACTCTGCAGCCGTACGCCACTCTCATTTTCCT P P P S L S P P L N S A A V R H S H F P L L P L F P P L - T L Q P Y A T L I F L S S L S F P P F E L C S R T P L S F S . . . . . . 6786 GCGAATTTCCTTCGAGGTTGCTCTTCTGATTAATGGACGCCGGTGGAGGAGGAGAACAGT A N F L R G C S S D - W T P V E E E N S R I S F E V A L L I N G R R W R R R T V C E F P S R L L F - L M D A G G G G E Q . . . . . . 6846 TTGATTCCCGAACTGTGGAAGATGTGTTTGGGGATTTCAAGAGACGACGAACTGCTTTGA L I P E L W K M C L G I S R D D E L L - - F P N C G R C V W G F Q E T T N C F D F D S R T V E D V F G D F K R R R T A L . . : . . . . : 6906 TTAAGGCTCTTACTGTTG : ATGTGGAAGAATTTTATCAGCAGTGTGATCCTG : AGAAGGAAA L R L L L L : M W K N F I S S V I L : R R K - G S Y C - : C G R I L S A V - S - : E G K I K A L T V : D V E E F Y Q Q C D P : E K E . . . . . . 9739 ACTTGTGCTTGTATGGTCTCCCAAATGAACAATGGGAGGTCAATCTGCCTGCTGAAGAAG T C A C M V S Q M N N G R S I C L L K K L V L V W S P K - T M G G Q S A C - R S N L C L Y G L P N E Q W E V N L P A E E . . . . . . 9799 TACCACCTGAACTCCCTGAGCCTGCTCTTGGTATTAACTTTGCTAGAGATGGGATGGAAG Y H L N S L S L L L V L T L L E M G W K T T - T P - A C S W Y - L C - R W D G R V P P E L P E P A L G I N F A R D G M E . . . . . . 9859 ACAAGGATTGGCTATCCTTAGTTGCTGTCCATAGTGATTCCTGGCTGCTTTCTGTCGCCT T R I G Y P - L L S I V I P G C F L S P Q G L A I L S C C P - - F L A A F C R L D K D W L S L V A V H S D S W L L S V A . . . . . . 9919 TCTATTTTGGTGCTAGATTTGGGTTTGATAAAGCCAGCAGGTACTTTTTCTCATCTGAAC S I L V L D L G L I K P A G T F S H L N L F W C - I W V - - S Q Q V L F L I - T F Y F G A R F G F D K A S R Y F F S S E . . . . . . 9979 TTTCATTTGTTGTAAGCACACATATTTCCATGCTTAATGGACGTTAAGAGCATTACTTGA F H L L - A H I F P C L M D V K S I T - F I C C K H T Y F H A - W T L R A L L D L S F V V S T H I S M L N G R - E H Y L . . . . . . 10039 TTGAAATATGTGATAAGTTTTGGAGATATTTGCTTCTAGTATTTCCTTTGATGATTTTAT L K Y V I S F G D I C F - Y F L - - F Y - N M - - V L E I F A S S I S F D D F I I E I C D K F W R Y L L L V F P L M I L . . . . . . 10099 ATTTGTATTTGATCCCCAGTAAATCAAATTGTTTATCTGTTTTCCTGGTTGTTCTCTCAA I C I - S P V N Q I V Y L F S W L F S Q F V F D P Q - I K L F I C F P G C S L K Y L Y L I P S K S N C L S V F L V V L S . . . . . . 10159 GCAATTCCATGCATCTATTTGATTTGTGATGTGTTGCATTGAAGTAATCGTTTTTTGTTA A I P C I Y L I C D V L H - S N R F L L Q F H A S I - F V M C C I E V I V F C Y S N S M H L F D L - C V A L K - S F F V . . . . . . 10219 TTAGTAATATAGTTCAGACAGTGATTGGTATCAAGCTGCCACTACTAGCTGTGCTACTGT L V I - F R Q - L V S S C H Y - L C Y C - - Y S S D S D W Y Q A A T T S C A T V I S N I V Q T V I G I K L P L L A V L L . . . . . 10279 CCCAGAGGAGGTATAAATGTTAAGTCTGCTTTATTAAGTTGTACGCA P R G G I N V K S A L L S C T P E E V - M L S L L Y - V V R S Q R R Y K C - V C F I K L Y A Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-9+_PGL-2_AGS-2_PPS_1 (6815 6923,7426 7458,9730 10025) (frame '0'; 435 bp, 145 residues) 1 LMDAGGGGEQ FDSRTVEDVF GDFKRRRTAL IKALTVDVEE FYQQCDPEKE NLCLYGLPNE 61 QWEVNLPAEE VPPELPEPAL GINFARDGME DKDWLSLVAV HSDSWLLSVA FYFGARFGFD 121 KASRYFFSSE LSFVVSTHIS MLNGR- PGL 3 (+ strand): 12889 15900 AGS-1 (12889 13156,14221 15900) SCR (e 1.000 d 0.991 a 1.000,e 0.999) Exon 1 12889 13156 ( 268 n); score: 1.000 Intron 1 13157 14220 (1064 n); Pd: 0.991 Pa: 1.000 Exon 2 14221 15900 (1680 n); score: 0.999 PGS (12889 13156,14221 14522) SGN-U330577+ PGS (14221 15567) SGN-U316421+ PGS (14534 15900) SGN-U322169+ 3-phase translation of AGS-1 (+strand): . . . . . . 12889 CTACTGCCTCATAGCTCACTCTCTTTGCTTCAACAATGGCGGCTTCTTGCATGCTCAGAT L L P H S S L S L L Q Q W R L L A C S D Y C L I A H S L C F N N G G F L H A Q I T A S - L T L F A S T M A A S C M L R . . . . . . 12949 CCTCTTTCCTTTCTCCTAACCATAATCTTCATCAACAATCCTCTCCTAAATCTAACCGTG P L S F L L T I I F I N N P L L N L T V L F P F S - P - S S S T I L S - I - P C S S F L S P N H N L H Q Q S S P K S N R . . . . . . 13009 CTTCCTTCTTCACTCCTATCAAAGCCACATCTTCAACAGATGATGCAATCTCAAAATCTC L P S S L L S K P H L Q Q M M Q S Q N L F L L H S Y Q S H I F N R - C N L K I S A S F F T P I K A T S S T D D A I S K S . . . . . . 13069 CACAACTTCAGAAGCACCGCCGCCCTGCTGACGAGAATATCCGTGAGGAAGCCCGACGCG H N F R S T A A L L T R I S V R K P D A T T S E A P P P C - R E Y P - G S P T R P Q L Q K H R R P A D E N I R E E A R R . . . : . . . 13129 ACGTATCTTCCCACAATTTCTCTGCTAG : GTATGTACCTTTTAATGCCGATCCTAACTCCA T Y L P T I S L L : G M Y L L M P I L T P R I F P Q F L C - : V C T F - C R S - L Q D V S S H N F S A R : Y V P F N A D P N S . . . . . . 14253 GCGAGTGGTATCCTCTCGATGAGATTATTTATCGCAGCCGATCAGGTGGCCTACTTGATG A S G I L S M R L F I A A D Q V A Y L M R V V S S R - D Y L S Q P I R W P T - C S E W Y P L D E I I Y R S R S G G L L D . . . . . . 14313 TCCAACACGATATGGACGCCCTCAAGAAATTTGACGGCCAGTACTGGCGGTCCCTGTTTG S N T I W T P S R N L T A S T G G P C L P T R Y G R P Q E I - R P V L A V P V - V Q H D M D A L K K F D G Q Y W R S L F . . . . . . 14373 ATTCCAGGGTGGGCAAGACAACATGGCCTTATGGTTCTGGTGTTTGGTCCAAGAAGGAAT I P G W A R Q H G L M V L V F G P R R N F Q G G Q D N M A L W F W C L V Q E G M D S R V G K T T W P Y G S G V W S K K E . . . . . . 14433 GGGTCCTGCCTGAAATTGACAGTGATGATATTGTCAGTGCTTTTGAAGGAAATTCCAATC G S C L K L T V M I L S V L L K E I P I G P A - N - Q - - Y C Q C F - R K F Q S W V L P E I D S D D I V S A F E G N S N . . . . . . 14493 TTTTTTGGGCTGAGCGTTATGGGAAACAATTCCTAGGCATGAATGATTTGTGGGTCAAAC F F G L S V M G N N S - A - M I C G S N F L G - A L W E T I P R H E - F V G Q T L F W A E R Y G K Q F L G M N D L W V K . . . . . . 14553 ATTGTGGAATCAGCCACACCGGTAGCTTCAAGGATCTTGGCATGACCGTATTGGTGAGTC I V E S A T P V A S R I L A - P Y W - V L W N Q P H R - L Q G S W H D R I G E S H C G I S H T G S F K D L G M T V L V S . . . . . . 14613 AAGTAAATCGGTTGCGGAAAATGCATAAACCAGTTGTGGGTGTTGGCTGTGCTTCCACTG K - I G C G K C I N Q L W V L A V L P L S K S V A E N A - T S C G C W L C F H W Q V N R L R K M H K P V V G V G C A S T . . . . . . 14673 GAGACACGTCTGCTGCGCTGTCAGCTTACTGTGCATCTGCAGGCATTCCATCAATTGTGT E T R L L R C Q L T V H L Q A F H Q L C R H V C C A V S L L C I C R H S I N C V G D T S A A L S A Y C A S A G I P S I V . . . . . . 14733 TTTTACCTGCAAATAAGATATCTATGGCGCAACTGGTTCAACCAATAGCCAATGGGGCCT F Y L Q I R Y L W R N W F N Q - P M G P F T C K - D I Y G A T G S T N S Q W G L F L P A N K I S M A Q L V Q P I A N G A . . . . . . 14793 TTGTGTTGAGTATCGACACTGATTTTGATGGTTGTATGCAGTTGATTCGCGAAGTCACAG L C - V S T L I L M V V C S - F A K S Q C V E Y R H - F - W L Y A V D S R S H S F V L S I D T D F D G C M Q L I R E V T . . . . . . 14853 CTGAGTTGCCAATTTACTTGGCTAATTCCTTAAACAGTTTGAGGCTAGAAGGACAAAAGA L S C Q F T W L I P - T V - G - K D K R - V A N L L G - F L K Q F E A R R T K D A E L P I Y L A N S L N S L R L E G Q K . . . . . . 14913 CTGCAGCAATAGAGATACTGCAGCAGTTTGAATGGGAAGTTCCAGACTGGGTGATAGTTC L Q Q - R Y C S S L N G K F Q T G - - F C S N R D T A A V - M G S S R L G D S S T A A I E I L Q Q F E W E V P D W V I V . . . . . . 14973 CCGGTGGTAACCTGGGCAATATATATGCATTTTACAAAGGTTTCCACATGTGCAAGGAGC P V V T W A I Y M H F T K V S T C A R S R W - P G Q Y I C I L Q R F P H V Q G A P G G N L G N I Y A F Y K G F H M C K E . . . . . . 15033 TGGGGCTTGTTGATCGTATCCCAAGACTTGTTTGTGCTCAAGCAGCCAATGCAAATCCAC W G L L I V S Q D L F V L K Q P M Q I H G A C - S Y P K T C L C S S S Q C K S T L G L V D R I P R L V C A Q A A N A N P . . . . . . 15093 TTTACGTGCATTATAAGTCTGGTTGGAAAGATTTCAAACCTGTTAAGGCAAATACAACAT F T C I I S L V G K I S N L L R Q I Q H L R A L - V W L E R F Q T C - G K Y N I L Y V H Y K S G W K D F K P V K A N T T . . . . . . 15153 TTGCATCTGCTATACAGATTGGTGACCCAGTATCTATTGATAGGGCTGTCTTTGCTCTAA L H L L Y R L V T Q Y L L I G L S L L - C I C Y T D W - P S I Y - - G C L C S K F A S A I Q I G D P V S I D R A V F A L . . . . . . 15213 AGAAGTCCAATGGGATAGTGGAGGAGGCTACCGAGGAAGAGTTGATGGATGCGATGGCTC R S P M G - W R R L P R K S - W M R W L E V Q W D S G G G Y R G R V D G C D G S K K S N G I V E E A T E E E L M D A M A . . . . . . 15273 AAGCTGACTCAACTGGGATGTTCATATGCCCGCACACTGGTGTGGCATTGACTGCACTCT K L T Q L G C S Y A R T L V W H - L H S S - L N W D V H M P A H W C G I D C T L Q A D S T G M F I C P H T G V A L T A L . . . . . . 15333 CCAAGCTGAGAAAGGCTGGAGTTATTGCGCCTACTGATAGGACAGTGGTTGTGAGTACAG P S - E R L E L L R L L I G Q W L - V Q Q A E K G W S Y C A Y - - D S G C E Y S S K L R K A G V I A P T D R T V V V S T . . . . . . 15393 CTCATGGGTTGAAGTTTACTCAATCCAAGGTTGATTATCATTCTAAAGAAATAAAGAACA L M G - S L L N P R L I I I L K K - R T S W V E V Y S I Q G - L S F - R N K E H A H G L K F T Q S K V D Y H S K E I K N . . . . . . 15453 TGGAGTGTCGGTTTGCTAATCCCCCGGTACAGGTGAAAGCAGACTTTGGATCTGTCATGG W S V G L L I P R Y R - K Q T L D L S W G V S V C - S P G T G E S R L W I C H G M E C R F A N P P V Q V K A D F G S V M . . . . . . 15513 ATGTTCTGAAGAAGTATCTATTGAGCAAAAATTCCAAGTTCTAACTTTTTGGAAGAGTAA M F - R S I Y - A K I P S S N F L E E - C S E E V S I E Q K F Q V L T F W K S K D V L K K Y L L S K N S K F - L F G R V . . . . . . 15573 ATTCTGCTTACCAAAATCATCATCTTTTGGTTCATCTTCGTCCATGATGAAGAATTTGGT I L L T K I I I F W F I F V H D E E F G F C L P K S S S F G S S S S M M K N L V N S A Y Q N H H L L V H L R P - - R I W . . . . . . 15633 ATAATTAGGTGAAGTCATGAAAGGCTCTGAGTTTCCAATGAGAGAACTATGTTTTTAAGT I I R - S H E R L - V S N E R T M F L S - L G E V M K G S E F P M R E L C F - V Y N - V K S - K A L S F Q - E N Y V F K . . . . . . 15693 AGGAACTTTTAGTGGTTGCAACTTTCTACTATTTTTATGCTCCTTTGTCTAATGTTGGCG R N F - W L Q L S T I F M L L C L M L A G T F S G C N F L L F L C S F V - C W R - E L L V V A T F Y Y F Y A P L S N V G . . . . . . 15753 TTCTGAATATTTTGTAGCATTAGCATCTGCCTCAAATGAGTGGTGTTTTCTAGTTTGAGG F - I F C S I S I C L K - V V F S S L R S E Y F V A L A S A S N E W C F L V - G V L N I L - H - H L P Q M S G V F - F E . . . . . . 15813 GTGATAATGTCATGAAAACTGAATAAATAGCGCCTCATATGGGATTTTTATCTACCATCT V I M S - K L N K - R L I W D F Y L P S - - C H E N - I N S A S Y G I F I Y H L G D N V M K T E - I A P H M G F L S T I . . . 15873 GAATATATATCGTGAAAATCATACCTTT E Y I S - K S Y L N I Y R E N H T F - I Y I V K I I P Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-9+_PGL-3_AGS-1_PPS_1 (12903 13156,14221 15556) (frame '0'; 1587 bp, 529 residues) 1 LTLFASTMAA SCMLRSSFLS PNHNLHQQSS PKSNRASFFT PIKATSSTDD AISKSPQLQK 61 HRRPADENIR EEARRDVSSH NFSARYVPFN ADPNSSEWYP LDEIIYRSRS GGLLDVQHDM 121 DALKKFDGQY WRSLFDSRVG KTTWPYGSGV WSKKEWVLPE IDSDDIVSAF EGNSNLFWAE 181 RYGKQFLGMN DLWVKHCGIS HTGSFKDLGM TVLVSQVNRL RKMHKPVVGV GCASTGDTSA 241 ALSAYCASAG IPSIVFLPAN KISMAQLVQP IANGAFVLSI DTDFDGCMQL IREVTAELPI 301 YLANSLNSLR LEGQKTAAIE ILQQFEWEVP DWVIVPGGNL GNIYAFYKGF HMCKELGLVD 361 RIPRLVCAQA ANANPLYVHY KSGWKDFKPV KANTTFASAI QIGDPVSIDR AVFALKKSNG 421 IVEEATEEEL MDAMAQADST GMFICPHTGV ALTALSKLRK AGVIAPTDRT VVVSTAHGLK 481 FTQSKVDYHS KEIKNMECRF ANPPVQVKAD FGSVMDVLKK YLLSKNSKF- ... finished at: Mon Aug 28 21:59:55 2006 ________________________________________________________________________________ Sequence 10: C06HBa0120H21.1-10, from 1 to 2162, both strands analyzed. ... started at: Mon Aug 28 21:59:55 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 2 ... matches indexed, elapsed seconds = 2 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 2 ******************************************************************************** EST sequence 2 +strand 2470 n (File: SGN-U321764+) 1 GATTTTCGTA ATTATTGATA CAGACAAAGG TGTTTAAACG GACATCGTAA TGGAGTGTGT 61 GTGAGAGAGA ACCCATTTTG AGGAATTCGG GGGCAATTTC AGTTTCTCGG TGGCGGAAGA 121 CGCGCAGAGA CTTCACCCTT TCAATTCCGT TGACACATAT ACAGAGGTGT CGTTTCTGAT 181 AATCAATTTC ACTTTCTCTT TGCTACGATC GCTACATTCT CTCTCTCTCT TTTGTGTAGA 241 CTGAACCCGC GCTGAACTGC ATTTCGCCTA TAAATTAATA TATTTCTTAG GAAGATGCAG 301 CAACCCGTGC AGCCAAGGTC TTCTGCCAAT GGATATGGCC GTCGTAAAGT TGATAGAGAA 361 ATGGGTACTA AGTTGGAGAA TAAAGCGCAA TCTGGAAAAA CTACTTCTCG TCAATTTACA 421 GGTAAAGGGG GAGCATATCA AAGCCTGTCA CATGATCGAC TAGTTTATTT CACTACCTGT 481 CTTGTTGGAC ATCAAGTGGA AGTACAAGTG ATGGACGGAT CAGTGTTTTC AGGGATACTT 541 CATGCGACAA ACGCTGAAAA AGATTTTGGT ATCATTCTGA AAATGGCGCA GTTGATAAAA 601 GATAGCTCTG AGGGGATGAA GAGTAGTTCT GAAACTTTTA GCAAGCCTCC ATTAAAGACT 661 TTGATAATAC CGGGTAAAGA GTTTGCTCAA GTTACAGCAA AGGGTGTGCC TACAACTCTA 721 GACGGTTTCA GAACAGAATT CATGCTGGAA CAGCAGCAGG AACTTTTGAC TGATTCATGC 781 ATTTCACAAT CTCGGCATAT TGAGGTAGAG CGGCAATTGG AACGCTGGGT ACCTGATGAT 841 GATGCTCCTG AATGTCCTGA ACTGGACAAT ATATTTGATG GCCATTGGAA TAGGGGCTGG 901 GATCAGTTTC AAGCCAATGA AACACTGTTT GGAGTAAAAA GCACATTTGA TGAGGACCTT 961 TATACGACAA AGCTTGAGAG AGGTCCTCAG ATGAGTGAGT TGGAAAAAGA AGCTCTAAGA 1021 ATAGCTAGAG AAATTGAGGG TGAGGATACA CGTGATCTTC ATCTAGCAGA GGAGAGAGGG 1081 ATCCAACTTC ATGAGAACCT AGAAGTGGAC GAGGAAACCA GATTTTCCGC AGTTGTTAGA 1141 GAGATTGATG ATAGCGGCTA TGACAACTGT GAGGACATCC TGTTGGATTC ACGTAATGAT 1201 GAGACATTTC AAGGTATATC TAGTGCTATG GGGAAGTCAT TTACTGACAT GGGCAGAAGG 1261 AAAATGAATG ATGGTGCACA AGTTTCATTA AGATCTTCCT TCATGGATGA AGTGCAATCT 1321 TCCAAGCTAA GTACCAGTAG GGATGTCTAC CAGACTTGTT ACGATGATCA TGCGAAACAG 1381 TCATCAGCTG AAGTTGTCCT TAAAGGTGGC TCTATCTTAA ACAGGGGTCG CAAAACTCTG 1441 TTTAGTGAGC ATGCTGGAGC AAGTTGGAAT AAGGAGGATA CAAGAAATCA AATGACGGAT 1501 GAAGTTGCTC AAACGTCAGT ATTGGAAGAT TCAATGTCTT CTTCAAGAAT GAAAATGGAG 1561 ACCTCTGATG GGGGTAGATT GTCTCCAGAC ATCTCTGCAT TGCATGTTCA TCCAGCGGAC 1621 CAGGATATGA TCACAAGTTC TTCTAGAGAG AAGTTTGAGG GTGCGGTGTC TTCCAAGATT 1681 CAAGGGGCTC CACAATCTGC TAATTCTCGT GTACGACCTA GTAGTTCTGT TCTTTCCGGT 1741 TCTGATGGAA CAGGTGCTGC CTCAACGTCA GCTGACAATG GATTATCACG AACCTCTTCT 1801 GTAAATTCAT TTTCGTCAGA AAAATCCACA TTGAATCCAC ATGCTAAGGA ATTTAAATTA 1861 AATCCTAATG CAAAGAGTTT CATGCCATTT CAATCACCTT TGAGACCTGC TTCTCCGGTG 1921 TCTGATAGTT CCTTCTATTA TCCAGCTGGT GTGGCTACTG TTCCCAATGT GCATGGCATG 1981 CCTGTTGGGG TAGGTCCTTC ATTTTCTCCA CATCAGCCTG TTATGTTTAA TCCACAAGCT 2041 ACACCTGTAC CACAACAATT TTTTCATCCA AATGGACCAC AGTATGGGCA GCAGATGATG 2101 ATTGGTCCCC CTCGGCAAGT AGTCTATATG CCGAATTACC CCGCTGAAAT GCGACGAGAC 2161 TACTAATCAG TTGGCAAACC ATATTGCGTG GTGGGTTGAA CCGATGGATG CTGACATGAG 2221 ATTTCATGGA TTGGTGGAGG AGGTTTAGCT GGTTGATGAA GGGGGATTCC AATGATTTGA 2281 TTAGAGCTTT TCCTTATACT GGGGTATCAG TAATTGTTAC TTTGTCATAA TCATTAGATT 2341 TGTTAACTTT CAGATTTACA GTCTTTCTTG AAGTTAACTG TGGTGTTTCC TTGGTATGCT 2401 GCTGTTGATA TTTCTTCTCT TTGATCTGTA TTCCTAATAT TGTATGTTTC TCGACAAAAA 2461 AAAAAAAAAA Predicted gene structure (within gDNA segment 2162 to 1): Exon 1 1378 1201 ( 178 n); cDNA 894 1071 ( 178 n); score: 0.994 Intron 1 1200 1095 ( 106 n); Pd: 0.999 (s: 0.98), Pa: 0.997 (s: 1.00) Exon 2 1094 861 ( 234 n); cDNA 1072 1305 ( 234 n); score: 1.000 Intron 2 860 527 ( 334 n); Pd: 0.892 (s: 1.00), Pa: 0.829 (s: 1.00) Exon 3 526 408 ( 119 n); cDNA 1306 1424 ( 119 n); score: 1.000 Intron 3 407 252 ( 156 n); Pd: 0.949 (s: 1.00), Pa: 0.991 (s: 1.00) Exon 4 251 182 ( 70 n); cDNA 1425 1494 ( 70 n); score: 1.000 Intron 4 181 91 ( 91 n); Pd: 0.991 (s: 1.00), Pa: 0.998 (s: 0) Exon 5 90 57 ( 34 n); cDNA 1495 1528 ( 34 n); score: 1.000 PPA cDNA 2454 2470 MATCH C06HBa0120H21.1-10- SGN-U321764+ 0.998 635 0.294 G PGS_C06HBa0120H21.1-10-_SGN-U321764+ (1378 1201,1094 861,526 408,251 182,90 57) Alignment (genomic DNA sequence = upper lines): GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA CATTTGATGA 1319 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA CATTTGATGA 953 GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG AAAAAGAAGC 1259 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG AAAAAGAAGC 1013 TCTAAGAATA GCTAGAGAAA TTGAAGGTGA GGATACACGT GATCTTCATC TAGCAGAGGT 1199 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||| TCTAAGAATA GCTAGAGAAA TTGAGGGTGA GGATACACGT GATCTTCATC TAGCAGAG.. 1071 GGGGTAATTG CATTCTTTAT ACAGAAATCT TTGTTTCTAT GTGATTTTTT TCTGCTTCTT 1139 .......... .......... .......... .......... .......... .......... 1071 TCTCTCTAAT ATTAGTTGTG CTGCTTGGAA TTCATTTTTC ATAGGAGAGA GGGATCCAAC 1079 |||||| |||||||||| .......... .......... .......... .......... ....GAGAGA GGGATCCAAC 1087 TTCATGAGAA CCTAGAAGTG GACGAGGAAA CCAGATTTTC CGCAGTTGTT AGAGAGATTG 1019 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCATGAGAA CCTAGAAGTG GACGAGGAAA CCAGATTTTC CGCAGTTGTT AGAGAGATTG 1147 ATGATAGCGG CTATGACAAC TGTGAGGACA TCCTGTTGGA TTCACGTAAT GATGAGACAT 959 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGATAGCGG CTATGACAAC TGTGAGGACA TCCTGTTGGA TTCACGTAAT GATGAGACAT 1207 TTCAAGGTAT ATCTAGTGCT ATGGGGAAGT CATTTACTGA CATGGGCAGA AGGAAAATGA 899 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAAGGTAT ATCTAGTGCT ATGGGGAAGT CATTTACTGA CATGGGCAGA AGGAAAATGA 1267 ATGATGGTGC ACAAGTTTCA TTAAGATCTT CCTTCATGGT ATAATTTTGT TTACCCACAT 839 |||||||||| |||||||||| |||||||||| |||||||| ATGATGGTGC ACAAGTTTCA TTAAGATCTT CCTTCATG.. .......... .......... 1305 TCATTAGTCT TTAAGAATTG TTTGTTGCGG TACTGAGGCT TTTTCCTTCT GTTAAGATAG 779 .......... .......... .......... .......... .......... .......... 1305 GGATGACAGA GGAAGATATT GTCATATTAG TCAAGTATCT CAAGCTAGTC GGAAAATATA 719 .......... .......... .......... .......... .......... .......... 1305 TCAATTGGTT GTTGTCTGCT ATTGCTGGTG AATTTAAGAG TTATCCACGT CTTAGATACC 659 .......... .......... .......... .......... .......... .......... 1305 AAGCACTTTC AGTGTGAATC ATGTGGTTTA AGAGCCTATA AATAATACTT TAGCTTCTGT 599 .......... .......... .......... .......... .......... .......... 1305 GTACTCGACT ATAGTAAATA TAAGCTAATT ATATGACGCA TTTTAGTATT TCTATTCCTA 539 .......... .......... .......... .......... .......... .......... 1305 TCTTGCCCCT AGGATGAAGT GCAATCTTCC AAGCTAAGTA CCAGTAGGGA TGTCTACCAG 479 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..GATGAAGT GCAATCTTCC AAGCTAAGTA CCAGTAGGGA TGTCTACCAG 1353 ACTTGTTACG ATGATCATGC GAAACAGTCA TCAGCTGAAG TTGTCCTTAA AGGTGGCTCT 419 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTGTTACG ATGATCATGC GAAACAGTCA TCAGCTGAAG TTGTCCTTAA AGGTGGCTCT 1413 ATCTTAAACA GGTTTGCTAT GATCATTCCT TTCTTCCACA GTGACACTTC TTATCTCCTT 359 |||||||||| | ATCTTAAACA G......... .......... .......... .......... .......... 1424 AGGCTGTAAA GTCTATTTTC TGGTTTCACG ACTAGTGGTG TCAAAAATAA CGTTAGCCTT 299 .......... .......... .......... .......... .......... .......... 1424 GTTAGTAATC TCATTTATTG TCTCTCTTAC TGAAATGATC CTTTTAGGGG TCGCAAAACT 239 ||| |||||||||| .......... .......... .......... .......... .......GGG TCGCAAAACT 1437 CTGTTTAGTG AGCATGCTGG AGCAAGTTGG AATAAGGAGG ATACAAGAAA TCAAATGGTT 179 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| CTGTTTAGTG AGCATGCTGG AGCAAGTTGG AATAAGGAGG ATACAAGAAA TCAAATG... 1494 AGCTTATCTC TGACTTCCAT AATGCTTATG TGTTGGGTAC TTTTTCTTGG AGAACTTTCT 119 .......... .......... .......... .......... .......... .......... 1494 TATGGTTGGT GTTTCTTGTT TCATGTAGAC GGATGAAGTT GCTCAAACGT CAGTATTGGA 59 || |||||||||| |||||||||| |||||||||| .......... .......... ........AC GGATGAAGTT GCTCAAACGT CAGTATTGGA 1526 AG 57 || AG 1528 hqPGS_C06HBa0120H21.1-10-_SGN-U321764+ (1378 1201,1094 861,526 408,251 182,90 57) ******************************************************************************** EST sequence 1 +strand 895 n (File: SGN-U337451+) 1 TCCCCGCGGT GGCGGCCGCT CTAGAACTAG TGGATCCCCC GGGCTGCAGG AATTCGGCAC 61 GAGGGGTACC TGATGATGAT GCTCCTGAAT GTCCTGATCT GGACAATATA TTTGATGACC 121 ATTGGAATAG GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA 181 CATTTGATGA GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG 241 AAAAAGAAGC TCTAAGAATA GCTAGAGAAA TTGAAGGTGA GGATACACGT GATCTTCATC 301 TAGCAGAGGA GAGAGGGATC CAACTTCATG AGAACCTAGA AGTGGACGAG GAAACCAGAT 361 TTTCCGCAGT TGTTAGAGAG ATTGATGATA GCGGCTATGA CAACTGTGAG GACATCCTGT 421 TGGATTCACG TAATGATGAG ACATTTCAAG GTATATCTAG TGCTATGGGG AAGTCATTTA 481 CTGACATGGG CAGAAGGAAA ATGAATGATG GTGCACAAGT TTCATTAAGA TCTTCCTTCA 541 TGGTATAATT TTGTTTACCC ACATTCATTA GTCTTTAAGA ATTGTTTGTT GCGGTACTGA 601 GGCTTTTTCC TTCTGTTAAG ATAGGGATGA CAGAGGAAGA TATTGTCATA TTAGTCAAGT 661 ATCTCAAGCT AGTCGGAAAA TATATCAATT GGTTGTTGTC TGCTATTGCT GGTGAATTTA 721 AGAGTTATCC ACGTCTTAGA TACCAAGCAC TTTCAGTGTG AATCATGTGG TTTAAGAGCC 781 TATAAATAAT ACTTTAGCTT CTGTGTACTC GACTATAGTA AATATAAGCT AATTATATGA 841 CGCATTTTAG TATTTCTATT CCTATCTTGC CCCTAGGATG AAGTGCAATC TTCCA Predicted gene structure (within gDNA segment 2162 to 1): Exon 1 1378 1201 ( 178 n); cDNA 131 308 ( 178 n); score: 1.000 Intron 1 1200 1095 ( 106 n); Pd: 0.999 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 1094 508 ( 587 n); cDNA 309 895 ( 587 n); score: 1.000 MATCH C06HBa0120H21.1-10- SGN-U337451+ 1.000 765 0.855 C PGS_C06HBa0120H21.1-10-_SGN-U337451+ (1378 1201,1094 508) Alignment (genomic DNA sequence = upper lines): GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA CATTTGATGA 1319 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGCTGGGAT CAGTTTCAAG CCAATGAAAC ACTGTTTGGA GTAAAAAGCA CATTTGATGA 190 GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG AAAAAGAAGC 1259 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGACCTTTAT ACGACAAAGC TTGAGAGAGG TCCTCAGATG AGTGAGTTGG AAAAAGAAGC 250 TCTAAGAATA GCTAGAGAAA TTGAAGGTGA GGATACACGT GATCTTCATC TAGCAGAGGT 1199 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| TCTAAGAATA GCTAGAGAAA TTGAAGGTGA GGATACACGT GATCTTCATC TAGCAGAG.. 308 GGGGTAATTG CATTCTTTAT ACAGAAATCT TTGTTTCTAT GTGATTTTTT TCTGCTTCTT 1139 .......... .......... .......... .......... .......... .......... 308 TCTCTCTAAT ATTAGTTGTG CTGCTTGGAA TTCATTTTTC ATAGGAGAGA GGGATCCAAC 1079 |||||| |||||||||| .......... .......... .......... .......... ....GAGAGA GGGATCCAAC 324 TTCATGAGAA CCTAGAAGTG GACGAGGAAA CCAGATTTTC CGCAGTTGTT AGAGAGATTG 1019 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCATGAGAA CCTAGAAGTG GACGAGGAAA CCAGATTTTC CGCAGTTGTT AGAGAGATTG 384 ATGATAGCGG CTATGACAAC TGTGAGGACA TCCTGTTGGA TTCACGTAAT GATGAGACAT 959 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGATAGCGG CTATGACAAC TGTGAGGACA TCCTGTTGGA TTCACGTAAT GATGAGACAT 444 TTCAAGGTAT ATCTAGTGCT ATGGGGAAGT CATTTACTGA CATGGGCAGA AGGAAAATGA 899 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAAGGTAT ATCTAGTGCT ATGGGGAAGT CATTTACTGA CATGGGCAGA AGGAAAATGA 504 ATGATGGTGC ACAAGTTTCA TTAAGATCTT CCTTCATGGT ATAATTTTGT TTACCCACAT 839 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGATGGTGC ACAAGTTTCA TTAAGATCTT CCTTCATGGT ATAATTTTGT TTACCCACAT 564 TCATTAGTCT TTAAGAATTG TTTGTTGCGG TACTGAGGCT TTTTCCTTCT GTTAAGATAG 779 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATTAGTCT TTAAGAATTG TTTGTTGCGG TACTGAGGCT TTTTCCTTCT GTTAAGATAG 624 GGATGACAGA GGAAGATATT GTCATATTAG TCAAGTATCT CAAGCTAGTC GGAAAATATA 719 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATGACAGA GGAAGATATT GTCATATTAG TCAAGTATCT CAAGCTAGTC GGAAAATATA 684 TCAATTGGTT GTTGTCTGCT ATTGCTGGTG AATTTAAGAG TTATCCACGT CTTAGATACC 659 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATTGGTT GTTGTCTGCT ATTGCTGGTG AATTTAAGAG TTATCCACGT CTTAGATACC 744 AAGCACTTTC AGTGTGAATC ATGTGGTTTA AGAGCCTATA AATAATACTT TAGCTTCTGT 599 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGCACTTTC AGTGTGAATC ATGTGGTTTA AGAGCCTATA AATAATACTT TAGCTTCTGT 804 GTACTCGACT ATAGTAAATA TAAGCTAATT ATATGACGCA TTTTAGTATT TCTATTCCTA 539 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTACTCGACT ATAGTAAATA TAAGCTAATT ATATGACGCA TTTTAGTATT TCTATTCCTA 864 TCTTGCCCCT AGGATGAAGT GCAATCTTCC A 508 |||||||||| |||||||||| |||||||||| | TCTTGCCCCT AGGATGAAGT GCAATCTTCC A 895 hqPGS_C06HBa0120H21.1-10-_SGN-U337451+ (1378 1201,1094 508) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2162: PGL 1 (- strand): 1378 57 AGS-1 (1378 1201,1094 861,526 408,251 182,90 57) SCR (e 0.994 d 0.999 a 0.997,e 1.000 d 0.892 a 0.829,e 1.000 d 0.949 a 0.991,e 1.000 d 0.991 a 0.998,e 1.000) Exon 1 1378 1201 ( 178 n); score: 0.994 Intron 1 1200 1095 ( 106 n); Pd: 0.999 Pa: 0.997 Exon 2 1094 861 ( 234 n); score: 1.000 Intron 2 860 527 ( 334 n); Pd: 0.892 Pa: 0.829 Exon 3 526 408 ( 119 n); score: 1.000 Intron 3 407 252 ( 156 n); Pd: 0.949 Pa: 0.991 Exon 4 251 182 ( 70 n); score: 1.000 Intron 4 181 91 ( 91 n); Pd: 0.991 Pa: 0.998 Exon 5 90 57 ( 34 n); score: 1.000 PGS (1378 1201,1094 861,526 408,251 182,90 57) SGN-U321764+ 3-phase translation of AGS-1 (-strand): . . . . . . 1378 GGGCTGGGATCAGTTTCAAGCCAATGAAACACTGTTTGGAGTAAAAAGCACATTTGATGA G L G S V S S Q - N T V W S K K H I - - G W D Q F Q A N E T L F G V K S T F D E A G I S F K P M K H C L E - K A H L M . . . . . . 1318 GGACCTTTATACGACAAAGCTTGAGAGAGGTCCTCAGATGAGTGAGTTGGAAAAAGAAGC G P L Y D K A - E R S S D E - V G K R S D L Y T T K L E R G P Q M S E L E K E A R T F I R Q S L R E V L R - V S W K K K . . . . . . : 1258 TCTAAGAATAGCTAGAGAAATTGAAGGTGAGGATACACGTGATCTTCATCTAGCAGAG : GA S K N S - R N - R - G Y T - S S S S R : G L R I A R E I E G E D T R D L H L A E : E L - E - L E K L K V R I H V I F I - Q R : . . . . . . 1092 GAGAGGGATCCAACTTCATGAGAACCTAGAAGTGGACGAGGAAACCAGATTTTCCGCAGT E R D P T S - E P R S G R G N Q I F R S R G I Q L H E N L E V D E E T R F S A V R E G S N F M R T - K W T R K P D F P Q . . . . . . 1032 TGTTAGAGAGATTGATGATAGCGGCTATGACAACTGTGAGGACATCCTGTTGGATTCACG C - R D - - - R L - Q L - G H P V G F T V R E I D D S G Y D N C E D I L L D S R L L E R L M I A A M T T V R T S C W I H . . . . . . 972 TAATGATGAGACATTTCAAGGTATATCTAGTGCTATGGGGAAGTCATTTACTGACATGGG - - - D I S R Y I - C Y G E V I Y - H G N D E T F Q G I S S A M G K S F T D M G V M M R H F K V Y L V L W G S H L L T W . . . . . . : 912 CAGAAGGAAAATGAATGATGGTGCACAAGTTTCATTAAGATCTTCCTTCATG : GATGAAGT Q K E N E - W C T S F I K I F L H : G - S R R K M N D G A Q V S L R S S F M : D E V A E G K - M M V H K F H - D L P S W : M K . . . . . . 518 GCAATCTTCCAAGCTAAGTACCAGTAGGGATGTCTACCAGACTTGTTACGATGATCATGC A I F Q A K Y Q - G C L P D L L R - S C Q S S K L S T S R D V Y Q T C Y D D H A C N L P S - V P V G M S T R L V T M I M . . . . . . : 458 GAAACAGTCATCAGCTGAAGTTGTCCTTAAAGGTGGCTCTATCTTAAACAG : GGGTCGCAA E T V I S - S C P - R W L Y L K Q : G S Q K Q S S A E V V L K G G S I L N R : G R K R N S H Q L K L S L K V A L S - T : G V A . . . . . . 242 AACTCTGTTTAGTGAGCATGCTGGAGCAAGTTGGAATAAGGAGGATACAAGAAATCAAAT N S V - - A C W S K L E - G G Y K K S N T L F S E H A G A S W N K E D T R N Q M K L C L V S M L E Q V G I R R I Q E I K . : . . . 182 G : ACGGATGAAGTTGCTCAAACGTCAGTATTGGAAG : D G - S C S N V S I G : T D E V A Q T S V L E - : R M K L L K R Q Y W K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-10-_PGL-1_AGS-1_PPS_1 (1377 1201,1094 861,526 408,251 182,90 58) (frame '2'; 633 bp, 211 residues) 1 GWDQFQANET LFGVKSTFDE DLYTTKLERG PQMSELEKEA LRIAREIEGE DTRDLHLAEE 61 RGIQLHENLE VDEETRFSAV VREIDDSGYD NCEDILLDSR NDETFQGISS AMGKSFTDMG 121 RRKMNDGAQV SLRSSFMDEV QSSKLSTSRD VYQTCYDDHA KQSSAEVVLK GGSILNRGRK 181 TLFSEHAGAS WNKEDTRNQM TDEVAQTSVL E AGS-2 (1378 1201,1094 508) SCR (e 1.000 d 0.999 a 0.997,e 1.000) Exon 1 1378 1201 ( 178 n); score: 1.000 Intron 1 1200 1095 ( 106 n); Pd: 0.999 Pa: 0.997 Exon 2 1094 508 ( 587 n); score: 1.000 PGS (1378 1201,1094 508) SGN-U337451+ 3-phase translation of AGS-2 (-strand): . . . . . . 1378 GGGCTGGGATCAGTTTCAAGCCAATGAAACACTGTTTGGAGTAAAAAGCACATTTGATGA G L G S V S S Q - N T V W S K K H I - - G W D Q F Q A N E T L F G V K S T F D E A G I S F K P M K H C L E - K A H L M . . . . . . 1318 GGACCTTTATACGACAAAGCTTGAGAGAGGTCCTCAGATGAGTGAGTTGGAAAAAGAAGC G P L Y D K A - E R S S D E - V G K R S D L Y T T K L E R G P Q M S E L E K E A R T F I R Q S L R E V L R - V S W K K K . . . . . . : 1258 TCTAAGAATAGCTAGAGAAATTGAAGGTGAGGATACACGTGATCTTCATCTAGCAGAG : GA S K N S - R N - R - G Y T - S S S S R : G L R I A R E I E G E D T R D L H L A E : E L - E - L E K L K V R I H V I F I - Q R : . . . . . . 1092 GAGAGGGATCCAACTTCATGAGAACCTAGAAGTGGACGAGGAAACCAGATTTTCCGCAGT E R D P T S - E P R S G R G N Q I F R S R G I Q L H E N L E V D E E T R F S A V R E G S N F M R T - K W T R K P D F P Q . . . . . . 1032 TGTTAGAGAGATTGATGATAGCGGCTATGACAACTGTGAGGACATCCTGTTGGATTCACG C - R D - - - R L - Q L - G H P V G F T V R E I D D S G Y D N C E D I L L D S R L L E R L M I A A M T T V R T S C W I H . . . . . . 972 TAATGATGAGACATTTCAAGGTATATCTAGTGCTATGGGGAAGTCATTTACTGACATGGG - - - D I S R Y I - C Y G E V I Y - H G N D E T F Q G I S S A M G K S F T D M G V M M R H F K V Y L V L W G S H L L T W . . . . . . 912 CAGAAGGAAAATGAATGATGGTGCACAAGTTTCATTAAGATCTTCCTTCATGGTATAATT Q K E N E - W C T S F I K I F L H G I I R R K M N D G A Q V S L R S S F M V - F A E G K - M M V H K F H - D L P S W Y N . . . . . . 852 TTGTTTACCCACATTCATTAGTCTTTAAGAATTGTTTGTTGCGGTACTGAGGCTTTTTCC L F T H I H - S L R I V C C G T E A F S C L P T F I S L - E L F V A V L R L F P F V Y P H S L V F K N C L L R Y - G F F . . . . . . 792 TTCTGTTAAGATAGGGATGACAGAGGAAGATATTGTCATATTAGTCAAGTATCTCAAGCT F C - D R D D R G R Y C H I S Q V S Q A S V K I G M T E E D I V I L V K Y L K L L L L R - G - Q R K I L S Y - S S I S S . . . . . . 732 AGTCGGAAAATATATCAATTGGTTGTTGTCTGCTATTGCTGGTGAATTTAAGAGTTATCC S R K I Y Q L V V V C Y C W - I - E L S V G K Y I N W L L S A I A G E F K S Y P - S E N I S I G C C L L L L V N L R V I . . . . . . 672 ACGTCTTAGATACCAAGCACTTTCAGTGTGAATCATGTGGTTTAAGAGCCTATAAATAAT T S - I P S T F S V N H V V - E P I N N R L R Y Q A L S V - I M W F K S L - I I H V L D T K H F Q C E S C G L R A Y K - . . . . . . 612 ACTTTAGCTTCTGTGTACTCGACTATAGTAAATATAAGCTAATTATATGACGCATTTTAG T L A S V Y S T I V N I S - L Y D A F - L - L L C T R L - - I - A N Y M T H F S Y F S F C V L D Y S K Y K L I I - R I L . . . . . 552 TATTTCTATTCCTATCTTGCCCCTAGGATGAAGTGCAATCTTCCA Y F Y S Y L A P R M K C N L P I S I P I L P L G - S A I F V F L F L S C P - D E V Q S S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-10-_PGL-1_AGS-2_PPS_1 (1377 1201,1094 855) (frame '2'; 414 bp, 138 residues) 1 GWDQFQANET LFGVKSTFDE DLYTTKLERG PQMSELEKEA LRIAREIEGE DTRDLHLAEE 61 RGIQLHENLE VDEETRFSAV VREIDDSGYD NCEDILLDSR NDETFQGISS AMGKSFTDMG 121 RRKMNDGAQV SLRSSFMV- ... finished at: Mon Aug 28 22:00:01 2006 ________________________________________________________________________________ Sequence 11: C06HBa0120H21.1-11, from 1 to 6686, both strands analyzed. ... started at: Mon Aug 28 22:00:01 2006 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA +strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-ZuEZzO/lycopersicum_combined_unigene_seqs; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand 596 n (File: SGN-U340338+) 1 ATTGAACCCC CGGGCGGGGG CCCCGCGTGG GTNCCAATGA TNACAATGCT GGAGCTCCAC 61 CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGC TGCAGGAATT CGGCACGAGG 121 CTTTGGCGTA TGATATGAAG CATCATTTTT TGAATTTAGC AGTCAGCTGT GCATCTGTCA 181 TTTGCTGCCG TGTCTCTCCT AAGCAGAAGG CCTTGGTTAC TAGGTTAGTA AAGGAAGGAA 241 CTGGAAAAAC CACATTAGCA ATTGGTGATG GTGCAAATGA TGTTGGGATG ATCCAAGAAG 301 CGGACATTGG AGTTGGCATT AGCGGTGCAG AAGGAATGCA GCTTCTTGCA GGCTGTGATG 361 GCTAGTGACT TTGCTATTGC ACAGTTTCGG TTCTTGGAGA GACTTCTTGT TGTACACGGG 421 CATTGGTGCT ACAAAAGAAT TGCTCAAATG ATATGCTATT TCTTCTACAA AAATATAGCT 481 TTTGGTCTCA CACTCTTTTA CTTTGAGGCT TTCGCGGGCT TTTCGGGCCA GTCTGTCTAT 541 GATGATTCAT ACATGATTCT GTTCAATGTG ATTCTTACCT CACTGCCCGT GATTGC Predicted gene structure (within gDNA segment 3539 to 6686): Exon 1 5329 5424 ( 96 n); cDNA 120 215 ( 96 n); score: 1.000 Intron 1 5425 5524 ( 100 n); Pd: 0.998 (s: 1.00), Pa: 1.000 (s: 0.98) Exon 2 5525 5650 ( 126 n); cDNA 216 341 ( 126 n); score: 0.992 Intron 2 5651 6502 ( 852 n); Pd: 0.932 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 3 6503 6611 ( 109 n); cDNA 342 450 ( 109 n); score: 1.000 MATCH C06HBa0120H21.1-11+ SGN-U340338+ 0.997 331 0.555 C PGS_C06HBa0120H21.1-11+_SGN-U340338+ (5329 5424,5525 5650,6503 6611) Alignment (genomic DNA sequence = upper lines): GCTTTGGCGT ATGATATGAA GCATCATTTT TTGAATTTAG CAGTCAGCTG TGCATCTGTC 5388 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTTGGCGT ATGATATGAA GCATCATTTT TTGAATTTAG CAGTCAGCTG TGCATCTGTC 179 ATTTGCTGCC GTGTCTCTCC TAAGCAGAAG GCCTTGGTAA GCTATGTGAA ATCTATCAAC 5448 |||||||||| |||||||||| |||||||||| |||||| ATTTGCTGCC GTGTCTCTCC TAAGCAGAAG GCCTTG.... .......... .......... 215 CTTTTCTTGT CTAACTCAAT TGTTCATTCA TGAAATTGGT CATATTTACA TTAGTTTGTA 5508 .......... .......... .......... .......... .......... .......... 215 TTTAATTTGG TCGCAGGTTA CTAGGTTAGT AAAGGAAGGA ACTGGGAAAA CCACATTAGC 5568 |||| |||||||||| |||||||||| ||||| |||| |||||||||| .......... ......GTTA CTAGGTTAGT AAAGGAAGGA ACTGGAAAAA CCACATTAGC 259 AATTGGTGAT GGTGCAAATG ATGTTGGGAT GATCCAAGAA GCGGACATTG GAGTTGGCAT 5628 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTGGTGAT GGTGCAAATG ATGTTGGGAT GATCCAAGAA GCGGACATTG GAGTTGGCAT 319 TAGCGGTGCA GAAGGAATGC AGGTTGGTGA TACATCTAGC TGGCGTTTTG GTGTTCTTAA 5688 |||||||||| |||||||||| || TAGCGGTGCA GAAGGAATGC AG........ .......... .......... .......... 341 TATACTGAGA GAATTTTAAC ATTTTCCTTC TGACAATAAC TTTTTATATA CTGCATTTTA 5748 .......... .......... .......... .......... .......... .......... 341 GAAATTTGTC AGGGAAATAC GGTCAAAGGG GTGCTGCACA TTTACATTAG AGTTCACAGA 5808 .......... .......... .......... .......... .......... .......... 341 CTGACTTGAA ACAATATGAA CTTTTAAATA CTGCAAACCC AAGATAACCA GATTCAACTT 5868 .......... .......... .......... .......... .......... .......... 341 CCTTGTGCTA TATTGTTCAA TTTTACAATT TCTTTTGGGA CTAAGACTAA TGCATTAGTA 5928 .......... .......... .......... .......... .......... .......... 341 TTTTTTTTTT TGAATCAGTT TATGTGGTTA GGGCTTATAG CTGACTGTTT GATAATGTCA 5988 .......... .......... .......... .......... .......... .......... 341 CAAGACCACT TAAAACCTAA GCAGCAACTA AACCTCAGGC CTAATTAGCA AAACTGATTT 6048 .......... .......... .......... .......... .......... .......... 341 TTCTAGAGAC TTCAGGAAAC CAATACTCTT CCCATTGATT TGTAGAAAAA TAGTTGTGCA 6108 .......... .......... .......... .......... .......... .......... 341 GAACCTTTTA CCCATTTAAA TCTTGCCTGA ATGATCTGAC ATATACCAAT TTGATATCAC 6168 .......... .......... .......... .......... .......... .......... 341 ATTTTAAATT CTTTCAAAAG TTTGAGGACA TTGAAATGTG ATTTGAAGGC TGTTTCTTGG 6228 .......... .......... .......... .......... .......... .......... 341 CCACAATCTC TCCACTGTCC GATCCATCTT ATCAAAGTTT TGATAAACTA TTTAGCGCGG 6288 .......... .......... .......... .......... .......... .......... 341 GCACGGGCGC CACCGATATT GACAAGAAAA TAAAGTTGAC CTGCAACTCA GCAAATTGCA 6348 .......... .......... .......... .......... .......... .......... 341 TCTGTGTTTT TGAGTATATA TGACCGTTTA AAGCACTACC AACTTAAAAT AATCTGGAAA 6408 .......... .......... .......... .......... .......... .......... 341 ATTTTCTCTA ATTTGGTAAG ATGGCCCTAA ATCCATTGGC TCAAGTAAAT GCTTTTGTTC 6468 .......... .......... .......... .......... .......... .......... 341 TTGGAGGCTG TAAGTCGGCT TACTCGATTA TTAGCTTCTT GCAGGCTGTG ATGGCTAGTG 6528 |||||| |||||||||| |||||||||| .......... .......... .......... ....CTTCTT GCAGGCTGTG ATGGCTAGTG 367 ACTTTGCTAT TGCACAGTTT CGGTTCTTGG AGAGACTTCT TGTTGTACAC GGGCATTGGT 6588 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTTGCTAT TGCACAGTTT CGGTTCTTGG AGAGACTTCT TGTTGTACAC GGGCATTGGT 427 GCTACAAAAG AATTGCTCAA ATG 6611 |||||||||| |||||||||| ||| GCTACAAAAG AATTGCTCAA ATG 450 hqPGS_C06HBa0120H21.1-11+_SGN-U340338+ (5329 5424,5525 5650,6503 6611) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 6686: PGL 1 (+ strand): 5329 6611 AGS-1 (5329 5424,5525 5650,6503 6611) SCR (e 1.000 d 0.998 a 1.000,e 0.992 d 0.932 a 0.000,e 1.000) Exon 1 5329 5424 ( 96 n); score: 1.000 Intron 1 5425 5524 ( 100 n); Pd: 0.998 Pa: 1.000 Exon 2 5525 5650 ( 126 n); score: 0.992 Intron 2 5651 6502 ( 852 n); Pd: 0.932 Pa: 0.000 Exon 3 6503 6611 ( 109 n); score: 1.000 PGS (5329 5424,5525 5650,6503 6611) SGN-U340338+ 3-phase translation of AGS-1 (+strand): . . . . . . 5329 GCTTTGGCGTATGATATGAAGCATCATTTTTTGAATTTAGCAGTCAGCTGTGCATCTGTC A L A Y D M K H H F L N L A V S C A S V L W R M I - S I I F - I - Q S A V H L S F G V - Y E A S F F E F S S Q L C I C . . . . : . . 5389 ATTTGCTGCCGTGTCTCTCCTAAGCAGAAGGCCTTG : GTTACTAGGTTAGTAAAGGAAGGA I C C R V S P K Q K A L : V T R L V K E G F A A V S L L S R R P W : L L G - - R K E H L L P C L S - A E G L : G Y - V S K G R . . . . . . 5549 ACTGGGAAAACCACATTAGCAATTGGTGATGGTGCAAATGATGTTGGGATGATCCAAGAA T G K T T L A I G D G A N D V G M I Q E L G K P H - Q L V M V Q M M L G - S K K N W E N H I S N W - W C K - C W D D P R . . . . . : . 5609 GCGGACATTGGAGTTGGCATTAGCGGTGCAGAAGGAATGCAG : CTTCTTGCAGGCTGTGAT A D I G V G I S G A E G M Q : L L A G C D R T L E L A L A V Q K E C S : F L Q A V M S G H W S W H - R C R R N A : A S C R L - . . . . . . 6521 GGCTAGTGACTTTGCTATTGCACAGTTTCGGTTCTTGGAGAGACTTCTTGTTGTACACGG G - - L C Y C T V S V L G E T S C C T R A S D F A I A Q F R F L E R L L V V H G W L V T L L L H S F G S W R D F L L Y T . . . . 6581 GCATTGGTGCTACAAAAGAATTGCTCAAATG A L V L Q K N C S N H W C Y K R I A Q M G I G A T K E L L K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0120H21.1-11+_PGL-1_AGS-1_PPS_1 (5329 5424,5525 5650,6503 6526) (frame '1'; 243 bp, 81 residues) 1 ALAYDMKHHF LNLAVSCASV ICCRVSPKQK ALVTRLVKEG TGKTTLAIGD GANDVGMIQE 61 ADIGVGISGA EGMQLLAGCD G- ... finished at: Mon Aug 28 22:00:07 2006