GeneSeqer. Version of March 12, 2006. Date run: Mon Aug 28 21:49:48 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 175665 Total sequence length: 93213537 Minimum sequence length: 89 Maximum sequence length: 1082 Length distribution (number of sequences of specified length): < 100: 1 < 200: 2188 < 300: 8544 < 400: 20465 < 500: 39499 < 600: 49432 < 700: 32872 < 800: 19308 < 900: 3155 < 1000: 193 >=1000: 8 Input file : /tmp/bac-submission-temp-VVWab/C06HBa0112G05/C06HBa0112G05.seq.screen ________________________________________________________________________________ Sequence 1: C06HBa0112G05.1-1, from 1 to 1152, both strands analyzed. ... started at: Mon Aug 28 22:16:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:16:43 2006 ________________________________________________________________________________ Sequence 2: C06HBa0112G05.1-2, from 1 to 1924, both strands analyzed. ... started at: Mon Aug 28 22:16:43 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand 789 n (File: SGN-E548970+) 1 TTTTTTTTTT TTTTAACATC ATAATTCTCC TAATAGTTTA TTATATACGT TTTCATTTAT 61 TTTTTTTATT TGGGATACAC GTGCAACGCA CGTGCCCGGA GACTAGTAAC TATATAGGTG 121 CAAATATTTT TGAAAAAACT CAAATAACAA CCAAAATAAA GAAAGAATAC TAAATCCTAC 181 ACTAACTAGT TACAACAACT AAAAAATATT TATAGCTATT GAACTCCTAT GCACAGTCTA 241 CTGTAAAACA TAAGTAGAAG AACTTCAAAT CTTGGCAATG AAATGACACC ATTTTCTCAC 301 ACTCAAACTG ATGTTAAATT TCATGAATGA AACAATATAT AATACTCCTA TTTAGGTTGA 361 AGACCCATTT TCTTCCTCGT TTTCCCAACG TATAAATCCT TAGACTTGAT CATCCCGGGG 421 ACTAATCCAA TGATCGTTGT GAACGTAAAT TGAGCCACTG CATCTTGCCT TCCTAACGGA 481 ACGATGATCT TAGGAGAAGA TCGAGGCTCG TATATTGCAA GTTTGCTTTC TTTTCCTCCA 541 CTCATCAAAA GTTTCAAGTT CTTTGCAGCC ACTAGAGCAT GTTTTTGAGC TGAATATCCT 601 TGTTTAAGTT CCTTAATATC AGTTATATCA CCAACAGCAA ATATGTTTCT GTGACCCTTC 661 ATTCTAAGAT TTTCATCTAC CTTCAATCTT CCAAAATTAT CGATTCGATC CTTCAAATAT 721 GTCTCCCTTA ACCACTCTGA ACCCGGTGGC TTTCCTGTGC AAAGAAAATG GCAATCTGCT 781 CTGATAGTT Predicted gene structure (within gDNA segment 1924 to 1): Exon 1 1463 1333 ( 131 n); cDNA 18 138 ( 121 n); score: 0.817 Intron 1 1332 547 ( 786 n); Pd: 0.000 (s: 0.70), Pa: 0.759 (s: 0) Exon 2 546 513 ( 34 n); cDNA 139 169 ( 31 n); score: 0.706 PPA cDNA 14 1 MATCH C06HBa0112G05.1-2- SGN-E548970+ 0.817 165 0.209 C PGS_C06HBa0112G05.1-2-_SGN-E548970+ (1463 1333,546 513) Alignment (genomic DNA sequence = upper lines): ATCATAATTC TCCTAATAGT TTATTATATA CGTTTTCATT TATTTTTTTA TTGATTTTTA 1404 |||||||||| |||||||||| |||||||||| |||||||||| || | || |||||| ATCATAATTC TCCTAATAGT TTATTATATA CGTTTTCATT TA------T- TT--TTTTTA 68 TTTGGGATAC ACGTGCAACG CACGTGCTCG GAGACTAGTT ATATTATAAG TGGGAAGTCC 1344 |||||||||| |||||||||| ||||||| || ||||||||| | |||| | | | || | TTTGGGATAC ACGTGCAACG CACGTGCCCG GAGACTAGTA ACTATATAGG T-GCAAATAT 127 TTTAGCTCAA AGTTGAATTA CGAATTTACC CTTCACTTGA ACATTATTTG AATATATATA 1284 ||| | || | TTTTGAAAAA A......... .......... .......... .......... .......... 138 ACATTATATG AAAATAAAAG CATAATATTT TCATTTTTAA TGACCTGAAA AAAATAAATT 1224 .......... .......... .......... .......... .......... .......... 138 CACACTTCAA TTACAATAAT TAGGATAAAT GAAAAGACAA CAGTTAAACT TACATTTAAA 1164 .......... .......... .......... .......... .......... .......... 138 TAACCATGAA TAGGATAAAC AAAAAGGCAA TTGAAAGTAA TAAATCTTTT TTCCAAGTGA 1104 .......... .......... .......... .......... .......... .......... 138 AATAATGACT TTCTTCAGGT AAATAAAGAA ACTATGTTTA TCTTTAAGTA AATTATTTCA 1044 .......... .......... .......... .......... .......... .......... 138 TTTTCATATA CATTTCAAAA GGTATAGAAG ATATAGGAAC GTTATTTTAG TTAAGAAATC 984 .......... .......... .......... .......... .......... .......... 138 GAAGACTTAA AGACAATCAT GAAGAACATA TGAAGTGTGG TGTCTGCAGC GATTTCTCGA 924 .......... .......... .......... .......... .......... .......... 138 CAACCACACG GTGAGAAACT GATTTTATGT TCCTAGAAAT CTCATAGTCC TCCAGCATCC 864 .......... .......... .......... .......... .......... .......... 138 CCTACCCTTC ATCATTTTTC TTTTTTGAAA TTGGAAGGTG TCAAATTTTT AACACTATTT 804 .......... .......... .......... .......... .......... .......... 138 CTTTGTTGTT GTTTTCTTCC AACCCCCCAT CCCCCCTCCT CTTATTGTAG AATATGTCTC 744 .......... .......... .......... .......... .......... .......... 138 TTCTTAGATT TATATGTGAT GCAATTTGAA GTGTTCTTGA AATTCAAGGT ATATTGATTT 684 .......... .......... .......... .......... .......... .......... 138 ATCCGAGTAT TTCTATAACG CTGTTTCAAT CAGGATATTC ACTGGATTTT TTCCTTTCTC 624 .......... .......... .......... .......... .......... .......... 138 TATTTTATTA GTCTTTATTT GTTCCTTTTT ACCCTTTTTA CTTTTTTTTT CTTAGCTTGT 564 .......... .......... .......... .......... .......... .......... 138 TTTTCATTTT TTAAAAGGTG AAATTTCAAC AATAAATATG ATGAAAGTAT A 513 | |||| |||| | ||||| | ||||| || | .......... .......CTC AAATAACAAC CA-AAATA-- AAGAAAGAAT A 169 hqPGS_C06HBa0112G05.1-2-_SGN-E548970+ (1463 1333,546 513) Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 1924: PGL 1 (- strand): 1463 513 AGS-1 (1463 1333,546 513) SCR (e 0.817 d 0.000 a 0.759,e 0.706) Exon 1 1463 1333 ( 131 n); score: 0.817 Intron 1 1332 547 ( 786 n); Pd: 0.000 Pa: 0.759 Exon 2 546 513 ( 34 n); score: 0.706 PGS (1463 1333,546 513) SGN-E548970+ 3-phase translation of AGS-1 (-strand): . . . . . . 1463 ATCATAATTCTCCTAATAGTTTATTATATACGTTTTCATTTATTTTTTTATTGATTTTTA I I I L L I V Y Y I R F H L F F Y - F L S - F S - - F I I Y V F I Y F F I D F Y H N S P N S L L Y T F S F I F L L I F . . . . . . 1403 TTTGGGATACACGTGCAACGCACGTGCTCGGAGACTAGTTATATTATAAGTGGGAAGTCC F G I H V Q R T C S E T S Y I I S G K S L G Y T C N A R A R R L V I L - V G S P I W D T R A T H V L G D - L Y Y K W E V . . : . . . 1343 TTTAGCTCAAA : GTGAAATTTCAACAATAAATATGATGAAAGTATA F S S K : - N F N N K Y D E S I L A Q : S E I S T I N M M K V L - L K : V K F Q Q - I - - K Y Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:16:54 2006 ________________________________________________________________________________ Sequence 3: C06HBa0112G05.1-3, from 1 to 13409, both strands analyzed. ... started at: Mon Aug 28 22:16:54 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 2 ******************************************************************************** EST sequence 1 +strand 443 n (File: SGN-E353715+) 1 GGAAAAATTT CATGAATATG CTATCAGATG GAGGGAACAA GCTGCTAGGT TTAAACCACC 61 GATGAAGGAG TCAGAGATGA TTGACGTTTT TCTCCAGGCG CAAGAACCTG ATTACTTTCT 121 CTATCTGCTT TCTGCCGTAG GGAAGACATT CGCTGAAGTT ATTAAGGTAG GGGAAATGGT 181 GGAAAATGGT ATCAAGTCTG GAAAGATTGT AAGTCAGGCT GCCTTAAAAT CCACAACACA 241 AGTGCTTCAA AATGGTTCTG GAAATATTGA GGGAAGAAGA GAAGGGAGGA TGTGGCCACT 301 ATTGTATCAG CGCCTAGGAC TCATGTTCTA GGTAATTCCC CACAACACTA TTTTCCTTCC 361 CAAGCTCCAC AATATTCTAT GCCATACACT CCATATCATG GTTTTAATGC ACAACCAATT 421 GCACCCCCTT CTTATCCACC ATG Predicted gene structure (within gDNA segment 8816 to 7019): Exon 1 8108 7665 ( 444 n); cDNA 1 443 ( 443 n); score: 0.968 MATCH C06HBa0112G05.1-3- SGN-E353715+ 0.968 444 1.002 C PGS_C06HBa0112G05.1-3-_SGN-E353715+ (8108 7665) Alignment (genomic DNA sequence = upper lines): GGAAAATTTT CTTGAATATG CTATCAGATG GAGGGAACAA GCTGCTAGGG TTAAACCATC 8049 |||||| ||| | |||||||| |||||||||| |||||||||| ||||||||| |||||||| | GGAAAAATTT CATGAATATG CTATCAGATG GAGGGAACAA GCTGCTAGGT TTAAACCACC 60 GATGAAGGAG TCAGAGATGA TTGACGTTTT TCTCCAGGCG CAAGAACCTG ATTACTTTCA 7989 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| GATGAAGGAG TCAGAGATGA TTGACGTTTT TCTCCAGGCG CAAGAACCTG ATTACTTTCT 120 CTATCTGCTT TCTGCCGTAG GAAAGACATT CGCTGAAGTT ATTAAGGTAG GGGAAATGGT 7929 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| CTATCTGCTT TCTGCCGTAG GGAAGACATT CGCTGAAGTT ATTAAGGTAG GGGAAATGGT 180 GGAAAATGGC ATCAAGTCTG GAAAGATTGT AAGTCAGGCT GCCTTAAAAG CCACAACACA 7869 ||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| GGAAAATGGT ATCAAGTCTG GAAAGATTGT AAGTCAGGCT GCCTTAAAAT CCACAACACA 240 AGTACTTCAA AATGGTTCTG GAAATATTGG AGGGAAGAAG AGAAGGGAGA ATGTGTCCAC 7809 ||| |||||| |||||||||| |||||||| | |||||||||| ||||||||| ||||| |||| AGTGCTTCAA AATGGTTCTG GAAATATT-G AGGGAAGAAG AGAAGGGAGG ATGTGGCCAC 299 TATTGTATCA GCGCCTAGGA CTCATGTTCT AGGTAATTCC CCACAACACT ATTTTCCTTC 7749 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTGTATCA GCGCCTAGGA CTCATGTTCT AGGTAATTCC CCACAACACT ATTTTCCTTC 359 CCAAGCTCCA CAATATTCTA TGCCATACAC TCCATATCAT GTTTTTAATG CACAACCAAT 7689 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| CCAAGCTCCA CAATATTCTA TGCCATACAC TCCATATCAT GGTTTTAATG CACAACCAAT 419 TGCACCCCCT TCTTATCCAC AATG 7665 |||||||||| |||||||||| ||| TGCACCCCCT TCTTATCCAC CATG 443 hqPGS_C06HBa0112G05.1-3-_SGN-E353715+ (8108 7665) ******************************************************************************** EST sequence 2 +strand 416 n (File: SGN-E274122+) 1 TATGATAAGT AATCCACTCT TTGTGTCAAC TGCCCCGACT AACAACATCC TGTAGACAAT 61 GATGGTGCCC AAATCCAACA GCGATCCTCT GCCCAAAGTT CGGCTTGATC ACAGTTACAC 121 TCTTGAAGAG GCCATTAAAA TTCCAAGCTC TCATCCCAAC ATTCATCAAT ATGGTTACCC 181 TGTCAAAATT GAGAAGATGG TCAAGAATGA GGAACATGAA GAAATGACTA AGAAAATGAA 241 GAGTTCTGAA CAGAGTATAA GAGATATGAA AGGACCAAGA GGCCACAAAG GCATCTCGTT 301 CAGTGACTTG TGTATGTTTC CCTCACGTCC ATTTGCCTGC TGGTTTTAAA ATTCCAAAGT 361 TTGAAAAATA CGATGGTCAC GGAGACCTCA TAGCTCATCT AAAGATATAT TGCAAC Predicted gene structure (within gDNA segment 9353 to 7450): Exon 1 8733 8321 ( 413 n); cDNA 3 416 ( 414 n); score: 0.929 MATCH C06HBa0112G05.1-3- SGN-E274122+ 0.929 413 0.993 C PGS_C06HBa0112G05.1-3-_SGN-E274122+ (8733 8321) Alignment (genomic DNA sequence = upper lines): TGATAAGTAA TCCACTCTTT GTGTCAACTG CCCCGACTAA CAGCATCCCG CAGCCAACGA 8674 |||||||||| |||||||||| |||||||||| |||||||||| || ||||| | || ||| || TGATAAGTAA TCCACTCTTT GTGTCAACTG CCCCGACTAA CAACATCCTG TAGACAATGA 62 TGGTGCCTAA ATCCAACAGA GATCCTCCGT CCAAAGTTCG GCGTGATCAG AGTTACACTC 8614 ||||||| || ||||||||| ||||||| | |||||||||| || |||||| |||||||||| TGGTGCCCAA ATCCAACAGC GATCCTCTGC CCAAAGTTCG GCTTGATCAC AGTTACACTC 122 TTGAAGAGGC CATTAAAATT CCAAGCTCTC ATCCCCACAT TCATCAATAT AGTTCCCCTG 8554 |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| ||| ||||| TTGAAGAGGC CATTAAAATT CCAAGCTCTC ATCCCAACAT TCATCAATAT GGTTACCCTG 182 CCGAAATTGA GAGAATGGTC AAGAATGAGG AACATGAAGA AATGACTAAG AAAATGAAGA 8494 | ||||||| || |||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAAAATTGA GAAGATGGTC AAGAATGAGG AACATGAAGA AATGACTAAG AAAATGAAGA 242 GTTTGGAACA GAGTATAAGA GATATGCAAG GACTAGGAGG CCACAAAGGC ATCTCGTTCA 8434 ||| ||||| |||||||||| |||||| ||| ||| | |||| |||||||||| |||||||||| GTTCTGAACA GAGTATAAGA GATATGAAAG GACCAAGAGG CCACAAAGGC ATCTCGTTCA 302 GTGACTTGTG TATGTTT-CC TCACGTCCAT TTGCCTGCTG GTTTTAAAAC TCCAAAGTTT 8375 |||||||||| ||||||| || |||||||||| |||||||||| ||||||||| |||||||||| GTGACTTGTG TATGTTTCCC TCACGTCCAT TTGCCTGCTG GTTTTAAAAT TCCAAAGTTT 362 GAAAAATACG ATGGTCACGG AGACCCCATT GCTCATCTAA AGAGATATTG CAAC 8321 |||||||||| |||||||||| ||||| ||| |||||||||| ||| |||||| |||| GAAAAATACG ATGGTCACGG AGACCTCATA GCTCATCTAA AGATATATTG CAAC 416 hqPGS_C06HBa0112G05.1-3-_SGN-E274122+ (8733 8321) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 13409: PGL 1 (- strand): 8733 7665 AGS-1 (8108 7665) SCR (e 0.968) Exon 1 8108 7665 ( 444 n); score: 0.968 PGS (8108 7665) SGN-E353715+ 3-phase translation of AGS-1 (-strand): . . . . . . 8108 GGAAAATTTTCTTGAATATGCTATCAGATGGAGGGAACAAGCTGCTAGGGTTAAACCATC G K F S - I C Y Q M E G T S C - G - T I E N F L E Y A I R W R E Q A A R V K P S K I F L N M L S D G G N K L L G L N H . . . . . . 8048 GATGAAGGAGTCAGAGATGATTGACGTTTTTCTCCAGGCGCAAGAACCTGATTACTTTCA D E G V R D D - R F S P G A R T - L L S M K E S E M I D V F L Q A Q E P D Y F H R - R S Q R - L T F F S R R K N L I T F . . . . . . 7988 CTATCTGCTTTCTGCCGTAGGAAAGACATTCGCTGAAGTTATTAAGGTAGGGGAAATGGT L S A F C R R K D I R - S Y - G R G N G Y L L S A V G K T F A E V I K V G E M V T I C F L P - E R H S L K L L R - G K W . . . . . . 7928 GGAAAATGGCATCAAGTCTGGAAAGATTGTAAGTCAGGCTGCCTTAAAAGCCACAACACA G K W H Q V W K D C K S G C L K S H N T E N G I K S G K I V S Q A A L K A T T Q W K M A S S L E R L - V R L P - K P Q H . . . . . . 7868 AGTACTTCAAAATGGTTCTGGAAATATTGGAGGGAAGAAGAGAAGGGAGAATGTGTCCAC S T S K W F W K Y W R E E E K G E C V H V L Q N G S G N I G G K K R R E N V S T K Y F K M V L E I L E G R R E G R M C P . . . . . . 7808 TATTGTATCAGCGCCTAGGACTCATGTTCTAGGTAATTCCCCACAACACTATTTTCCTTC Y C I S A - D S C S R - F P T T L F S F I V S A P R T H V L G N S P Q H Y F P S L L Y Q R L G L M F - V I P H N T I F L . . . . . . 7748 CCAAGCTCCACAATATTCTATGCCATACACTCCATATCATGTTTTTAATGCACAACCAAT P S S T I F Y A I H S I S C F - C T T N Q A P Q Y S M P Y T P Y H V F N A Q P I P K L H N I L C H T L H I M F L M H N Q . . . 7688 TGCACCCCCTTCTTATCCACAATG C T P F L S T M A P P S Y P Q L H P L L I H N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-3-_PGL-1_AGS-1_PPS_1 (8107 7667) (frame '2'; 441 bp, 147 residues) 1 ENFLEYAIRW REQAARVKPS MKESEMIDVF LQAQEPDYFH YLLSAVGKTF AEVIKVGEMV 61 ENGIKSGKIV SQAALKATTQ VLQNGSGNIG GKKRRENVST IVSAPRTHVL GNSPQHYFPS 121 QAPQYSMPYT PYHVFNAQPI APPSYPQ 3-phase translation of AGS-1 (+strand): . . . . . . 7665 CATTGTGGATAAGAAGGGGGTGCAATTGGTTGTGCATTAAAAACATGATATGGAGTGTAT H C G - E G G A I G C A L K T - Y G V Y I V D K K G V Q L V V H - K H D M E C M L W I R R G C N W L C I K N M I W S V . . . . . . 7725 GGCATAGAATATTGTGGAGCTTGGGAAGGAAAATAGTGTTGTGGGGAATTACCTAGAACA G I E Y C G A W E G K - C C G E L P R T A - N I V E L G K E N S V V G N Y L E H W H R I L W S L G R K I V L W G I T - N . . . . . . 7785 TGAGTCCTAGGCGCTGATACAATAGTGGACACATTCTCCCTTCTCTTCTTCCCTCCAATA - V L G A D T I V D T F S L L F F P P I E S - A L I Q - W T H S P F S S S L Q Y M S P R R - Y N S G H I L P S L L P S N . . . . . . 7845 TTTCCAGAACCATTTTGAAGTACTTGTGTTGTGGCTTTTAAGGCAGCCTGACTTACAATC F P E P F - S T C V V A F K A A - L T I F Q N H F E V L V L W L L R Q P D L Q S I S R T I L K Y L C C G F - G S L T Y N . . . . . . 7905 TTTCCAGACTTGATGCCATTTTCCACCATTTCCCCTACCTTAATAACTTCAGCGAATGTC F P D L M P F S T I S P T L I T S A N V F Q T - C H F P P F P L P - - L Q R M S L S R L D A I F H H F P Y L N N F S E C . . . . . . 7965 TTTCCTACGGCAGAAAGCAGATAGTGAAAGTAATCAGGTTCTTGCGCCTGGAGAAAAACG F P T A E S R - - K - S G S C A W R K T F L R Q K A D S E S N Q V L A P G E K R L S Y G R K Q I V K V I R F L R L E K N . . . . . . 8025 TCAATCATCTCTGACTCCTTCATCGATGGTTTAACCCTAGCAGCTTGTTCCCTCCATCTG S I I S D S F I D G L T L A A C S L H L Q S S L T P S S M V - P - Q L V P S I - V N H L - L L H R W F N P S S L F P P S . . . 8085 ATAGCATATTCAAGAAAATTTTCC I A Y S R K F S - H I Q E N F D S I F K K I F Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (8733 8321) SCR (e 0.929) Exon 1 8733 8321 ( 413 n); score: 0.929 PGS (8733 8321) SGN-E274122+ 3-phase translation of AGS-2 (-strand): . . . . . . 8733 TGATAAGTAATCCACTCTTTGTGTCAACTGCCCCGACTAACAGCATCCCGCAGCCAACGA - - V I H S L C Q L P R L T A S R S Q R D K - S T L C V N C P D - Q H P A A N D I S N P L F V S T A P T N S I P Q P T . . . . . . 8673 TGGTGCCTAAATCCAACAGAGATCCTCCGTCCAAAGTTCGGCGTGATCAGAGTTACACTC W C L N P T E I L R P K F G V I R V T L G A - I Q Q R S S V Q S S A - S E L H S M V P K S N R D P P S K V R R D Q S Y T . . . . . . 8613 TTGAAGAGGCCATTAAAATTCCAAGCTCTCATCCCCACATTCATCAATATAGTTCCCCTG L K R P L K F Q A L I P T F I N I V P L - R G H - N S K L S S P H S S I - F P C L E E A I K I P S S H P H I H Q Y S S P . . . . . . 8553 CCGAAATTGAGAGAATGGTCAAGAATGAGGAACATGAAGAAATGACTAAGAAAATGAAGA P K L R E W S R M R N M K K - L R K - R R N - E N G Q E - G T - R N D - E N E E A E I E R M V K N E E H E E M T K K M K . . . . . . 8493 GTTTGGAACAGAGTATAAGAGATATGCAAGGACTAGGAGGCCACAAAGGCATCTCGTTCA V W N R V - E I C K D - E A T K A S R S F G T E Y K R Y A R T R R P Q R H L V Q S L E Q S I R D M Q G L G G H K G I S F . . . . . . 8433 GTGACTTGTGTATGTTTCCTCACGTCCATTTGCCTGCTGGTTTTAAAACTCCAAAGTTTG V T C V C F L T S I C L L V L K L Q S L - L V Y V S S R P F A C W F - N S K V - S D L C M F P H V H L P A G F K T P K F . . . . . . 8373 AAAAATACGATGGTCACGGAGACCCCATTGCTCATCTAAAGAGATATTGCAAC K N T M V T E T P L L I - R D I A K I R W S R R P H C S S K E I L Q E K Y D G H G D P I A H L K R Y C N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-3-_PGL-1_AGS-2_PPS_1 (8731 8321) (frame '0'; 411 bp, 137 residues) 1 ISNPLFVSTA PTNSIPQPTM VPKSNRDPPS KVRRDQSYTL EEAIKIPSSH PHIHQYSSPA 61 EIERMVKNEE HEEMTKKMKS LEQSIRDMQG LGGHKGISFS DLCMFPHVHL PAGFKTPKFE 121 KYDGHGDPIA HLKRYCN 3-phase translation of AGS-2 (+strand): . . . . . . 8321 GTTGCAATATCTCTTTAGATGAGCAATGGGGTCTCCGTGACCATCGTATTTTTCAAACTT V A I S L - M S N G V S V T I V F F K L L Q Y L F R - A M G S P - P S Y F S N F C N I S L D E Q W G L R D H R I F Q T . . . . . . 8381 TGGAGTTTTAAAACCAGCAGGCAAATGGACGTGAGGAAACATACACAAGTCACTGAACGA W S F K T S R Q M D V R K H T Q V T E R G V L K P A G K W T - G N I H K S L N E L E F - N Q Q A N G R E E T Y T S H - T . . . . . . 8441 GATGCCTTTGTGGCCTCCTAGTCCTTGCATATCTCTTATACTCTGTTCCAAACTCTTCAT D A F V A S - S L H I S Y T L F Q T L H M P L W P P S P C I S L I L C S K L F I R C L C G L L V L A Y L L Y S V P N S S . . . . . . 8501 TTTCTTAGTCATTTCTTCATGTTCCTCATTCTTGACCATTCTCTCAATTTCGGCAGGGGA F L S H F F M F L I L D H S L N F G R G F L V I S S C S S F L T I L S I S A G E F S - S F L H V P H S - P F S Q F R Q G . . . . . . 8561 ACTATATTGATGAATGTGGGGATGAGAGCTTGGAATTTTAATGGCCTCTTCAAGAGTGTA T I L M N V G M R A W N F N G L F K S V L Y - - M W G - E L G I L M A S S R V - N Y I D E C G D E S L E F - W P L Q E C . . . . . . 8621 ACTCTGATCACGCCGAACTTTGGACGGAGGATCTCTGTTGGATTTAGGCACCATCGTTGG T L I T P N F G R R I S V G F R H H R W L - S R R T L D G G S L L D L G T I V G N S D H A E L W T E D L C W I - A P S L . . . . . . 8681 CTGCGGGATGCTGTTAGTCGGGGCAGTTGACACAAAGAGTGGATTACTTATCA L R D A V S R G S - H K E W I T Y C G M L L V G A V D T K S G L L I A A G C C - S G Q L T Q R V D Y L S Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-3+_PGL-1_AGS-2_PPS_1 (8462 8710) (frame '1'; 246 bp, 82 residues) 1 SLHISYTLFQ TLHFLSHFFM FLILDHSLNF GRGTILMNVG MRAWNFNGLF KSVTLITPNF 61 GRRISVGFRH HRWLRDAVSR GS- ... finished at: Mon Aug 28 22:17:05 2006 ________________________________________________________________________________ Sequence 4: C06HBa0112G05.1-4, from 1 to 1643, both strands analyzed. ... started at: Mon Aug 28 22:17:05 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:17:16 2006 ________________________________________________________________________________ Sequence 5: C06HBa0112G05.1-5, from 1 to 937, both strands analyzed. ... started at: Mon Aug 28 22:17:16 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 7 HitsTableSize = 1 ******************************************************************************** EST sequence 2 +strand 624 n (File: SGN-E542827+) 1 TTTTCTTTTT TTTTGGTTAA TGACTAATTT ATAATTATTA TTTTGATAAT CAAATTTATT 61 TATATTTCAC TAATATTCTT GTAAAACTTA TTGTAGATGA CCAAAATTTT TCTTTGAATA 121 CCAAATTAAA TTACAATACA CACAAAAAAA ATAGTTTAGT TTTTTTTCTC TTTAAACTAA 181 GGAATGAAAG AAAAAAAATT AGAATAACAA ACTCAAATAA TTATAATAAA AGAAGTCAAA 241 CAATAATTTA TGTATAAAAA AAATTAAATA TAACCTCGAA CTTTGATAGA AGAATAATAT 301 ATACCTTTAA ATAATTTTTT TAAAAACAAT CAAAAGTAAT AAATATAAAT TTAAAATTAA 361 TTTTTTAATA TATATTAGCC ATTTTGTAAC GACAGTGCTG CAAATGACAA AATTGAGAAA 421 TATATCAAAC TTTTTCCGTA AAATAGATAA ACTTAAAAGA GGATATTTGT AAACAACACA 481 AAATCTTCAA TCAAATACAA AGTTCAAACA CTAGCAGTGA ACCAAATCAT CGAAAAGTTA 541 TAAAATTGTT AGAAATTTCT ACCATATATT GGATGAAAAC ATTGAAATTT CCTGGAATTT 601 TGAACTTTAG GTTGCTGGTT TTGG Predicted gene structure (within gDNA segment 1 to 937): Exon 1 478 839 ( 362 n); cDNA 1 365 ( 365 n); score: 0.840 Intron 1 840 887 ( 48 n); Pd: 0.000 (s: 0.92), Pa: 0.986 (s: 0.54) Exon 2 888 937 ( 50 n); cDNA 366 414 ( 49 n); score: 0.540 MATCH C06HBa0112G05.1-5+ SGN-E542827+ 0.803 412 0.660 C PGS_C06HBa0112G05.1-5+_SGN-E542827+ (478 839,888 937) Alignment (genomic DNA sequence = upper lines): TTTTTATTTT TTTTGTTTCA ATGACTAATT TATAATTATA ATTTTGATAA TCAAATTTGT 537 |||| |||| ||||| || | |||||||||| ||||||||| |||||||||| |||||||| | TTTTCTTTTT TTTTGGTT-A ATGACTAATT TATAATTATT ATTTTGATAA TCAAATTTAT 59 TTATGTTTCA CTAATATTCT TGTAAAACTT ATTGTAGA-- -CCAAATTTT TTCTTCGAAT 594 |||| ||||| |||||||||| |||||||||| |||||||| ||||| ||| ||||| |||| TTATATTTCA CTAATATTCT TGTAAAACTT ATTGTAGATG ACCAAAATTT TTCTTTGAAT 119 ACGAAATTAA ATTACAATAC ACA-AAAAAA --TTGTTTAA ATTTTTTTCT TTAAACTAAG 651 || ||||||| |||||||||| ||| |||||| | ||||| ||||||||| | | || ACCAAATTAA ATTACAATAC ACACAAAAAA AATAGTTTAG TTTTTTTTCT CTTTA--AAC 177 TAATGAAAGA AAAAAACAAA ATAAGAATAA GAAACTCAAA TAATTATAAT AAGCGAAGTC 711 ||| ||| || || ||| ||| || ||||||| ||||||||| |||||||||| || |||||| TAAGGAATGA AAGAAAAAAA ATTAGAATAA CAAACTCAAA TAATTATAAT AAAAGAAGTC 237 AAAAAATAAT TTATGTATGA AAAAAAATTA AA-ATATACC TTGAACTTTG ATAGAAGAAT 770 ||| |||||| |||||||| | |||||||||| || ||| ||| | |||||||| |||||||||| AAACAATAAT TTATGTAT-A AAAAAAATTA AATATA-ACC TCGAACTTTG ATAGAAGAAT 295 CATATATACC CCTAAATAA- TTTTTTAAAA AAAATTACAA GTAATAAATA TAAATTTAAA 829 ||||||||| ||||||| |||||||||| | ||| | || |||||||||| |||||||||| AATATATACC TTTAAATAAT TTTTTTAAAA ACAATCAAAA GTAATAAATA TAAATTTAAA 355 ACTAATTTTT TAACTTTCGT TAAATGAAGG GTATATGTGA GCCATTTTGT AACGGCAGGG 889 | |||||||| ATTAATTTTT .......... .......... .......... .......... ........TA 367 GTATATGTGA GCCGTTTGAA TAACGGTAAG GGCATATATA AACCACTT 937 ||||| | | ||| ||| ||||| | | | || | | || ATATATATTA GCCATTT-TG TAACGACAGT GCTGCAAATG ACAAAATT 414 hqPGS_C06HBa0112G05.1-5+_SGN-E542827+ (478 839,888 937) ******************************************************************************** EST sequence 1 +strand 460 n (File: SGN-E243215+) 1 TTATTGTAGA TGACCAATTT TTTCTTCGAA TACGAAATTA AATTACAATA CACACAAAAA 61 AAATATTTGA ATTTTTTTTA TTTAAACTAA GGAATGAAAG AAAAAAACAA AATAAGAATA 121 AGAAACTCAA ATTATTATAA TAAAAGAAGT CAAAAAATAA TTTTTGTATG AAAAAATTAA 181 AATATACCTT GAACTTTGAT AGAAGAATCA TATATATCCC TAAATATTTT TTTTAAAAAA 241 AAATTAGAAG TAACAAATAT AAATTTAAAA CTAATTTTTT AACTTTCGTT AAATGAAGGG 301 TATATGTGAG CCATTTTCTA ACGGCAGGGG TATATGTGAG CCGTTTGTAT AACGATAAGG 361 GCATATATGA ACCACTTTTA TTACGAGGGA TATATCAGCT CTAAATGACA AAGTTGAGAG 421 GTATATCAGA CCCTTTTCCC TATTTTTTAA AATTTCATAC Predicted gene structure (within gDNA segment 1 to 937): Exon 1 579 937 ( 359 n); cDNA 16 377 ( 362 n); score: 0.912 MATCH C06HBa0112G05.1-5+ SGN-E243215+ 0.912 359 0.780 C PGS_C06HBa0112G05.1-5+_SGN-E243215+ (579 937) Alignment (genomic DNA sequence = upper lines): AATTTTTTCT TCGAATACGA AATTAAATTA CAAT--ACAC AAAAAAATTG TTT-AAATTT 635 |||||||||| |||||||||| |||||||||| |||| |||| ||||||| | ||| || ||| AATTTTTTCT TCGAATACGA AATTAAATTA CAATACACAC AAAAAAAATA TTTGAATTTT 75 TTTTCTTTAA ACTAAGTAAT GAAAGAAAAA AACAAAATAA GAATAAGAAA CTCAAATAAT 695 |||| ||||| |||||| ||| |||||||||| |||||||||| |||||||||| ||||||| || TTTTATTTAA ACTAAGGAAT GAAAGAAAAA AACAAAATAA GAATAAGAAA CTCAAATTAT 135 TATAATAAGC GAAGTCAAAA AATAATTTAT GTATGAAAAA AAATTAAAAT ATACCTTGAA 755 |||||||| |||||||||| |||||||| | ||||| ||| |||||||||| |||||||||| TATAATAAAA GAAGTCAAAA AATAATTTTT GTATG--AAA AAATTAAAAT ATACCTTGAA 193 CTTTGATAGA AGAATCATAT ATACCCCTAA ATA-ATTTTT T-AAAAAAAA TTACAAGTAA 813 |||||||||| |||||||||| ||| |||||| ||| ||||| | |||||||| ||| |||||| CTTTGATAGA AGAATCATAT ATATCCCTAA ATATTTTTTT TAAAAAAAAA TTAGAAGTAA 253 TAAATATAAA TTTAAAACTA ATTTTTTAAC TTTCGTTAAA TGAAGGGTAT ATGTGAGCCA 873 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAATATAAA TTTAAAACTA ATTTTTTAAC TTTCGTTAAA TGAAGGGTAT ATGTGAGCCA 313 TTTTGTAACG GCAGGGGTAT ATGTGAGCCG TTTGAATAAC GGTAAGGGCA TATATAAACC 933 |||| ||||| |||||||||| |||||||||| |||| ||||| | |||||||| ||||| |||| TTTTCTAACG GCAGGGGTAT ATGTGAGCCG TTTGTATAAC GATAAGGGCA TATATGAACC 373 ACTT 937 |||| ACTT 377 hqPGS_C06HBa0112G05.1-5+_SGN-E243215+ (579 937) ******************************************************************************** EST sequence 3 -strand 701 n (File: SGN-E578389-) 1 TCAAATAATT ATAATAAATA AGTCAAAAAA ATAATTTATG TATTAAAAAA ATTTGAAATA 61 TACCTTGAAC TTTGAAAAAA GAATCATATA TGCCCCTAAA TATATTTTTT TTTAAAATTA 121 AAGTAAAATT ATAAATTTAA AAGTAATTTT TTCACTTTCG TTAAATGAAG GGTATATATG 181 AGCTCATTTT GTAACGGCAG AGGTATATGT GAACCATTTG TATAACGGTA AGGGTATATA 241 TGAGCCACTT TCATAACGAG GGGTATATCA GTTTCAAATG ACAAAGTTGA GGGGTATATC 301 ATACCCTTTT CCCATAATAT TATTCATTTT TGGGTTGACG GGTCAAACCT TGGGCTGCTT 361 AGGACTTGAT TAGACCGCTA TTTTATTGAC TCTTTAATTA ATGGGCAACT TTCACATATA 421 ACAAACAAAA AATTCATATT TGTATGCTAT AACAAAGTTT GCATAATTGC GCTCCATAGC 481 AAACATAAAA TTGTATAATT CGCTGACCTA AATTGTATAA TTCGCTGGCC TATTTCGCTG 541 CAATTGTATA ATTCGCTATC CTATTTAACT ACAATTGTAT AATTCGCTGC CTATTTCGCT 601 GCAATATTAT TATAAAATTT GCTTTGCATA TAATTGAACC GAATTAAAAT GTATGTATAT 661 TGCATAATTA TAAGTGTATA GCAATAAGAT ATATGTTTTT C Predicted gene structure (within gDNA segment 77 to 937): Exon 1 687 937 ( 251 n); cDNA 1 250 ( 250 n); score: 0.869 MATCH C06HBa0112G05.1-5+ SGN-E578389- 0.869 251 0.358 C PGS_C06HBa0112G05.1-5+_SGN-E578389- (687 937) Alignment (genomic DNA sequence = upper lines): TCAAATAATT ATAATAAGCG AAGTCAAAA- AATAATTTAT GTATGAAAAA AAATTAAAAT 745 |||||||||| ||||||| ||||||||| |||||||||| |||| ||||| || || |||| TCAAATAATT ATAATAA-AT AAGTCAAAAA AATAATTTAT GTATTAAAAA AATTTGAAAT 59 ATACCTTGAA CTTTGATAGA AGAATCATAT ATACCCCTAA ATAATTTTTT AAAAAAAATT 805 |||||||||| |||||| | | |||||||||| || ||||||| ||| ||||| |||||| ATACCTTGAA CTTTGAAAAA AGAATCATAT ATGCCCCTAA ATATATTTTT TTTTAAAATT 119 ACAAGTAATA AATATAAATT TAAAACTAAT TTTTTAACTT TCGTTAAATG AAGGGTATAT 865 | |||||| | | |||||||| ||||| |||| ||||| |||| |||||||||| |||||||||| A-AAGTAA-A ATTATAAATT TAAAAGTAAT TTTTTCACTT TCGTTAAATG AAGGGTATAT 177 GTGAGC-CAT TTTGTAACGG CAGGGGTATA TGTGAGCCGT TTGAATAACG GTAAGGGCAT 924 ||||| ||| |||||||||| ||| |||||| ||||| || | ||| |||||| ||||||| || ATGAGCTCAT TTTGTAACGG CAGAGGTATA TGTGAACCAT TTGTATAACG GTAAGGGTAT 237 ATATAAACCA CTT 937 |||| | ||| ||| ATATGAGCCA CTT 250 hqPGS_C06HBa0112G05.1-5+_SGN-E578389- (687 937) Total number of EST alignments reported: 3 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 937: PGL 1 (+ strand): 478 937 AGS-1 (478 839,888 937) SCR (e 0.840 d 0.000 a 0.986,e 0.540) Exon 1 478 839 ( 362 n); score: 0.840 Intron 1 840 887 ( 48 n); Pd: 0.000 Pa: 0.986 Exon 2 888 937 ( 50 n); score: 0.540 PGS (478 839,888 937) SGN-E542827+ 3-phase translation of AGS-1 (+strand): . . . . . . 478 TTTTTATTTTTTTTGTTTCAATGACTAATTTATAATTATAATTTTGATAATCAAATTTGT F L F F L F Q - L I Y N Y N F D N Q I C F Y F F C F N D - F I I I I L I I K F V F I F F V S M T N L - L - F - - S N L . . . . . . 538 TTATGTTTCACTAATATTCTTGTAAAACTTATTGTAGACCAAATTTTTTCTTCGAATACG L C F T N I L V K L I V D Q I F S S N T Y V S L I F L - N L L - T K F F L R I R F M F H - Y S C K T Y C R P N F F F E Y . . . . . . 598 AAATTAAATTACAATACACAAAAAAATTGTTTAAATTTTTTTCTTTAAACTAAGTAATGA K L N Y N T Q K N C L N F F L - T K - - N - I T I H K K I V - I F F F K L S N E E I K L Q Y T K K L F K F F S L N - V M . . . . . . 658 AAGAAAAAAACAAAATAAGAATAAGAAACTCAAATAATTATAATAAGCGAAGTCAAAAAA K K K T K - E - E T Q I I I I S E V K K R K K Q N K N K K L K - L - - A K S K N K E K N K I R I R N S N N Y N K R S Q K . . . . . . 718 TAATTTATGTATGAAAAAAAATTAAAATATACCTTGAACTTTGATAGAAGAATCATATAT - F M Y E K K L K Y T L N F D R R I I Y N L C M K K N - N I P - T L I E E S Y I I I Y V - K K I K I Y L E L - - K N H I . . . . . . 778 ACCCCTAAATAATTTTTTAAAAAAAATTACAAGTAATAAATATAAATTTAAAACTAATTT T P K - F F K K N Y K - - I - I - N - F P L N N F L K K I T S N K Y K F K T N F Y P - I I F - K K L Q V I N I N L K L I . : . . . . . 838 TT : GGGTATATGTGAGCCGTTTGAATAACGGTAAGGGCATATATAAACCACTT L : G I C E P F E - R - G H I - T T : W V Y V S R L N N G K G I Y K P L F : G Y M - A V - I T V R A Y I N H Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (579 937) SCR (e 0.912) Exon 1 579 937 ( 359 n); score: 0.912 PGS (579 937) SGN-E243215+ PGS (687 937) SGN-E578389- 3-phase translation of AGS-2 (+strand): . . . . . . 579 AATTTTTTCTTCGAATACGAAATTAAATTACAATACACAAAAAAATTGTTTAAATTTTTT N F F F E Y E I K L Q Y T K K L F K F F I F S S N T K L N Y N T Q K N C L N F F F F L R I R N - I T I H K K I V - I F . . . . . . 639 TCTTTAAACTAAGTAATGAAAGAAAAAAACAAAATAAGAATAAGAAACTCAAATAATTAT S L N - V M K E K N K I R I R N S N N Y L - T K - - K K K T K - E - E T Q I I I F F K L S N E R K K Q N K N K K L K - L . . . . . . 699 AATAAGCGAAGTCAAAAAATAATTTATGTATGAAAAAAAATTAAAATATACCTTGAACTT N K R S Q K I I Y V - K K I K I Y L E L I S E V K K - F M Y E K K L K Y T L N F - - A K S K N N L C M K K N - N I P - T . . . . . . 759 TGATAGAAGAATCATATATACCCCTAAATAATTTTTTAAAAAAAATTACAAGTAATAAAT - - K N H I Y P - I I F - K K L Q V I N D R R I I Y T P K - F F K K N Y K - - I L I E E S Y I P L N N F L K K I T S N K . . . . . . 819 ATAAATTTAAAACTAATTTTTTAACTTTCGTTAAATGAAGGGTATATGTGAGCCATTTTG I N L K L I F - L S L N E G Y M - A I L - I - N - F F N F R - M K G I C E P F C Y K F K T N F L T F V K - R V Y V S H F . . . . . . 879 TAACGGCAGGGGTATATGTGAGCCGTTTGAATAACGGTAAGGGCATATATAAACCACTT - R Q G Y M - A V - I T V R A Y I N H N G R G I C E P F E - R - G H I - T T V T A G V Y V S R L N N G K G I Y K P L Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (-strand): . . . . . . 937 AAGTGGTTTATATATGCCCTTACCGTTATTCAAACGGCTCACATATACCCCTGCCGTTAC K W F I Y A L T V I Q T A H I Y P C R Y S G L Y M P L P L F K R L T Y T P A V T V V Y I C P Y R Y S N G S H I P L P L . . . . . . 877 AAAATGGCTCACATATACCCTTCATTTAACGAAAGTTAAAAAATTAGTTTTAAATTTATA K M A H I Y P S F N E S - K I S F K F I K W L T Y T L H L T K V K K L V L N L Y Q N G S H I P F I - R K L K N - F - I Y . . . . . . 817 TTTATTACTTGTAATTTTTTTTAAAAAATTATTTAGGGGTATATATGATTCTTCTATCAA F I T C N F F - K I I - G Y I - F F Y Q L L L V I F F K K L F R G I Y D S S I K I Y Y L - F F L K N Y L G V Y M I L L S . . . . . . 757 AGTTCAAGGTATATTTTAATTTTTTTTCATACATAAATTATTTTTTGACTTCGCTTATTA S S R Y I L I F F H T - I I F - L R L L V Q G I F - F F F I H K L F F D F A Y Y K F K V Y F N F F S Y I N Y F L T S L I . . . . . . 697 TAATTATTTGAGTTTCTTATTCTTATTTTGTTTTTTTCTTTCATTACTTAGTTTAAAGAA - L F E F L I L I L F F S F I T - F K E N Y L S F L F L F C F F L S L L S L K K I I I - V S Y S Y F V F F F H Y L V - R . . . . . . 637 AAAAATTTAAACAATTTTTTTGTGTATTGTAATTTAATTTCGTATTCGAAGAAAAAATT K N L N N F F V Y C N L I S Y S K K K K I - T I F L C I V I - F R I R R K N K K F K Q F F C V L - F N F V F E E K I Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-5-_PGL-1_AGS-2_PPS_1 (936 739) (frame '2'; 195 bp, 65 residues) 1 SGLYMPLPLF KRLTYTPAVT KWLTYTLHLT KVKKLVLNLY LLLVIFFKKL FRGIYDSSIK 61 VQGIF- ... finished at: Mon Aug 28 22:17:29 2006 ________________________________________________________________________________ Sequence 6: C06HBa0112G05.1-6, from 1 to 9840, both strands analyzed. ... started at: Mon Aug 28 22:17:29 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 41 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 29 ******************************************************************************** EST sequence 16 +strand 649 n (File: SGN-E539761+) 1 TACGTCAGAA ACTAGGAGTC TTGACAAACA GGTGGAGTTT CAAGTCATTC AGAACGAGAG 61 CGATTTAAAG GAACCTGAAG AGGAGGATCA AGAGCCACAG ACTGAAACTG ATATTCCAGA 121 ATCTATGCCA TCAGATATCC ATCAGAGTAT AGCTCAAGAT CGGCCAAGGA GGGTTGGAGT 181 TCGGCCACCT ACGAGGTATG GTTTTGAGGA CATGGTGGGT TATGCACTGC AGGTTGCTGA 241 AGAGGTAGAT ACATCTGAGC CGTCTACTTA CAAAGAAGCC ATTTTAAGTT CTGATTCTGA 301 AAAATGGTTT GCCGCTATGG GAGATGAGAT GGAGTCCCTA CACAAGAATC AGACATGGGA 361 TCTGGTCATA CAGCCTTCGG GGAGAAAGAT TATTACTTGC AAATGGGTTT TCAAGAAGAA 421 GGAAGGGATA TCACCAGCAG AAGGAGTCAA GTATAAAGTC AGGGTTGTTG CTAGAGGTTT 481 CAACCAAAGA GAGGGAGTGG ACTACAATGA GATCTTCTCA CCAGTGGTCA GACATACTTC 541 CATCCGAGTG TTACTAGCGA TAGTTGCACA TCAGAATCTG GAGCTTGAAA CACTTGATGT 601 GAAGACAGCG TTTCTACATG GAGAGTTGGA GGAAGAGATA TACATGACT Predicted gene structure (within gDNA segment 2793 to 4777): Exon 1 3519 4167 ( 649 n); cDNA 1 649 ( 649 n); score: 0.988 MATCH C06HBa0112G05.1-6+ SGN-E539761+ 0.988 649 1.000 C PGS_C06HBa0112G05.1-6+_SGN-E539761+ (3519 4167) Alignment (genomic DNA sequence = upper lines): TACGTCAGAA ACTGGGAGTC TTGACAAACA GGTGGAGTTT CAAGTCATTC AGAACGAGAG 3578 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| TACGTCAGAA ACTAGGAGTC TTGACAAACA GGTGGAGTTT CAAGTCATTC AGAACGAGAG 60 CGATTTAAAG GAACCTGAAG AGGAGGATCA AGAGCCACAG ACAGAAACTG ATATTCCAGA 3638 |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| CGATTTAAAG GAACCTGAAG AGGAGGATCA AGAGCCACAG ACTGAAACTG ATATTCCAGA 120 ATCTATGCCA TCAGATATCC ATCAGAGTAT AGATCAAGAT CGGCCAAGGA GGGTTGGAGT 3698 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| ATCTATGCCA TCAGATATCC ATCAGAGTAT AGCTCAAGAT CGGCCAAGGA GGGTTGGAGT 180 TCGGCCACCT ACGAGGTATG GTTTTGAGGA CATGGTGGGT TATGCACTGC AGGTTGCTGA 3758 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGGCCACCT ACGAGGTATG GTTTTGAGGA CATGGTGGGT TATGCACTGC AGGTTGCTGA 240 AGAGGTAGAT ACATCTGAGC CGTCTACTTA CAAAGAAGCC ATTTTAAGTT CTGATTCTGA 3818 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGGTAGAT ACATCTGAGC CGTCTACTTA CAAAGAAGCC ATTTTAAGTT CTGATTCTGA 300 AAAATGGTTT GCCGCTATGG GAGATGAGAT GGAGTCCCTA CACAAGAATC AGACATGGGA 3878 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAATGGTTT GCCGCTATGG GAGATGAGAT GGAGTCCCTA CACAAGAATC AGACATGGGA 360 TCTGGTCATA CAGCCTTCGG GGAGAAAGAT TATTACTTGC AAATGGGTTT TCAAGAAGAA 3938 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTGGTCATA CAGCCTTCGG GGAGAAAGAT TATTACTTGC AAATGGGTTT TCAAGAAGAA 420 GGAAGGGATA TCACCAGCAG AAGGAGTCAA GTATAAAGCC AGGGTTGTTG CCAGAGGTTT 3998 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| | |||||||| GGAAGGGATA TCACCAGCAG AAGGAGTCAA GTATAAAGTC AGGGTTGTTG CTAGAGGTTT 480 CAACCAAAGA GAGGGAGTGG ACTACAATGA GATCTTCTCA CCAGTGGTCA GACATACTTC 4058 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACCAAAGA GAGGGAGTGG ACTACAATGA GATCTTCTCA CCAGTGGTCA GACATACTTC 540 CATCCGAGTG TTACTAGCGA TAGTTGCACA TCAGAATCTG GAGCTTGAAC AACTTGATGT 4118 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| ||||||||| CATCCGAGTG TTACTAGCGA TAGTTGCACA TCAGAATCTG GAGCTTGAAA CACTTGATGT 600 GAAGACAGCG TTTTTACATG GAGAGTTGGA GGAAGAGATA TACATGACT 4167 |||||||||| ||| |||||| |||||||||| |||||||||| ||||||||| GAAGACAGCG TTTCTACATG GAGAGTTGGA GGAAGAGATA TACATGACT 649 hqPGS_C06HBa0112G05.1-6+_SGN-E539761+ (3519 4167) ******************************************************************************** EST sequence 30 +strand 385 n (File: SGN-E284788+) 1 ACGTTAGAAA CTAGGAGTCT TGACAAACAG GTGGAGTTTC AAGTCATTCA GAACGAGAGC 61 GATTTAAAGG AACCTGAAGA GGAGGATCAA GAGCCACAGA CTGAAACTGA TATTCCAGAA 121 TCTATGCCAT CAGATATCCA TCAGAGTATA GCTCAAGATC GGTCAAGGAG GGTTGGAGTT 181 CGGCCACCTA CGAGGTATGG TTTTGAGGAC ATGGTGGGTT ATGCACTGCA GGCTGCTGAA 241 GAGGTAGATA CATCTGAGCC GTCTACTTAC AAAGAAGCCA TTTTAAGTTC TGATTCTGAA 301 AAATGGTTTG CCGATATGGG AGATGAGATG GAGTCCCTAC ACAAGAATCA GACATGGGAT 361 CTGGTCATAC AGTCTTCGGG GAGAA Predicted gene structure (within gDNA segment 2803 to 4631): Exon 1 3520 3904 ( 385 n); cDNA 1 385 ( 385 n); score: 0.979 MATCH C06HBa0112G05.1-6+ SGN-E284788+ 0.979 385 1.000 C PGS_C06HBa0112G05.1-6+_SGN-E284788+ (3520 3904) Alignment (genomic DNA sequence = upper lines): ACGTCAGAAA CTGGGAGTCT TGACAAACAG GTGGAGTTTC AAGTCATTCA GAACGAGAGC 3579 |||| ||||| || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACGTTAGAAA CTAGGAGTCT TGACAAACAG GTGGAGTTTC AAGTCATTCA GAACGAGAGC 60 GATTTAAAGG AACCTGAAGA GGAGGATCAA GAGCCACAGA CAGAAACTGA TATTCCAGAA 3639 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| GATTTAAAGG AACCTGAAGA GGAGGATCAA GAGCCACAGA CTGAAACTGA TATTCCAGAA 120 TCTATGCCAT CAGATATCCA TCAGAGTATA GATCAAGATC GGCCAAGGAG GGTTGGAGTT 3699 |||||||||| |||||||||| |||||||||| | |||||||| || ||||||| |||||||||| TCTATGCCAT CAGATATCCA TCAGAGTATA GCTCAAGATC GGTCAAGGAG GGTTGGAGTT 180 CGGCCACCTA CGAGGTATGG TTTTGAGGAC ATGGTGGGTT ATGCACTGCA GGTTGCTGAA 3759 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || ||||||| CGGCCACCTA CGAGGTATGG TTTTGAGGAC ATGGTGGGTT ATGCACTGCA GGCTGCTGAA 240 GAGGTAGATA CATCTGAGCC GTCTACTTAC AAAGAAGCCA TTTTAAGTTC TGATTCTGAA 3819 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGGTAGATA CATCTGAGCC GTCTACTTAC AAAGAAGCCA TTTTAAGTTC TGATTCTGAA 300 AAATGGTTTG CCGCTATGGG AGATGAGATG GAGTCCCTAC ACAAGAATCA GACATGGGAT 3879 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATGGTTTG CCGATATGGG AGATGAGATG GAGTCCCTAC ACAAGAATCA GACATGGGAT 360 CTGGTCATAC AGCCTTCGGG GAGAA 3904 |||||||||| || ||||||| ||||| CTGGTCATAC AGTCTTCGGG GAGAA 385 hqPGS_C06HBa0112G05.1-6+_SGN-E284788+ (3520 3904) ******************************************************************************** EST sequence 32 +strand 808 n (File: SGN-E343338+) 1 GTTTCCAAGT TCCAGGGAAG GAAAATCACG TCTGCAAGTT GAAGAAGTCC TTATATGGAC 61 TTAAGCAGTC TCCAAGGCAG TGGTATAAAA GGTTTGACAG CTATATGGTG AAGTTGGGCT 121 ATACTCGGAG CTCATATGAT TGTTGTGTCT ACTACAATAG GCTCAATGAT GATTCATTCA 181 TCTATCTGGT GCTTTATGTA GATGATATGT TGATAGCTGC AAAGAAGAAG TATGACATTC 241 AGAAGCTGAA GGGTTTACTT AGTGCTGAGT TTGAGATGAA GGATCTGGGA GCCGCTCGGA 301 AGATTTTAGG GATGGAGATC ATTAGAGACA GAGAGAGAAG GAAACTTTTC TTGTCACAGA 361 GAAGCTACAT TCAGAAGGTC TTGGCGAGGT TTGGCATGTC TTCATCTAAG CCCATTGATA 421 CCCCCAGTGC TGCCAATATC CATCTCACTG CCATGTTCGC TCCACAGTCA GAAGAAGAGA 481 AGGAGTATAT GTCACGAGTC CCTTATGCCA GTGCCGTAAG AAGTTTGATG TATGCTATGG 541 TCTGTACAAG GCCAGATTTA GCACATGCAG TCAGTGTAGT GAGCAGATTC ATGGGACAAC 601 CAGGGAGAGA ACATTGGCAG GCTGTGAAGA GAATTTTCCG GTACCTTAGA GGTACATCTG 661 ACGTTGGTCT CATTTATGGA GGTGATACTC AGTGCTTGGT TACTGGCTAT TCTGATTCAG 721 ACTATGCTGG AGATGTTGAC ACAAGAAGAT CGATGACTGG CTATGTGTTT ACCCTGGGAG 781 GATCTGTCGT CAGTTGGAAG GCAACTTT Predicted gene structure (within gDNA segment 3578 to 5595): Exon 1 4178 4985 ( 808 n); cDNA 1 808 ( 808 n); score: 0.994 MATCH C06HBa0112G05.1-6+ SGN-E343338+ 0.994 808 1.000 C PGS_C06HBa0112G05.1-6+_SGN-E343338+ (4178 4985) Alignment (genomic DNA sequence = upper lines): GTTTCCAAGT TCCAGGGAAG GAAAATCACG TCTGCAAGTT GAAGAAGTCC TTATATGGAC 4237 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTCCAAGT TCCAGGGAAG GAAAATCACG TCTGCAAGTT GAAGAAGTCC TTATATGGAC 60 TTAAGCAGTC TCCAAGGCAG TGGTATAAAA GGTTTGACAG CTATATGGTG AAGTTGGGCT 4297 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAAGCAGTC TCCAAGGCAG TGGTATAAAA GGTTTGACAG CTATATGGTG AAGTTGGGCT 120 ATACTCGGAG CTCATATGAT TGTTGTGTCT ACTACAATAG GCTCAATGAT GATTCATTCA 4357 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACTCGGAG CTCATATGAT TGTTGTGTCT ACTACAATAG GCTCAATGAT GATTCATTCA 180 TCTATCTGGT GCTTTATGTA GATGATATGT TGATAGCTGC AAAGAAGAAG TATGACATTC 4417 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTATCTGGT GCTTTATGTA GATGATATGT TGATAGCTGC AAAGAAGAAG TATGACATTC 240 AGAAGCTGAA GGGTTTACTT AGTGCTGAGT TTGAGATGAA GGATTTGGGA GCCGCTCGGA 4477 |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| AGAAGCTGAA GGGTTTACTT AGTGCTGAGT TTGAGATGAA GGATCTGGGA GCCGCTCGGA 300 AGATTTTAGG GATGGAGATC ATTAGAGACA GAGAGAGAAG GAAACTTTTC TTGTCACAGA 4537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATTTTAGG GATGGAGATC ATTAGAGACA GAGAGAGAAG GAAACTTTTC TTGTCACAGA 360 GAAGCTACAT TCAGAAGGTC TTGGCGAGGT TTGGCATGTC TTCATCTAAG CCCATTGATA 4597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGCTACAT TCAGAAGGTC TTGGCGAGGT TTGGCATGTC TTCATCTAAG CCCATTGATA 420 CCCCCAGTGC TGCCAATATC CATCTCACTG CCATGTTCGC TCCACAGTCA GAAGAAGAGA 4657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCCCCAGTGC TGCCAATATC CATCTCACTG CCATGTTCGC TCCACAGTCA GAAGAAGAGA 480 AGGAGTATAT GTCACGAGTC CCTTATGCCA GTGCCGTAGG AAGTTTAATG TATGCTATGG 4717 |||||||||| |||||||||| |||||||||| |||||||| | |||||| ||| |||||||||| AGGAGTATAT GTCACGAGTC CCTTATGCCA GTGCCGTAAG AAGTTTGATG TATGCTATGG 540 TCTGTACAAG GCCAGATTTA GCACATGCAG TCAGTGTAGT GAGCAGATTC ATGGGACAAC 4777 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTGTACAAG GCCAGATTTA GCACATGCAG TCAGTGTAGT GAGCAGATTC ATGGGACAAC 600 CAGGGAGAGA ACATTGGCAG GCTGTGAAGA GAATTTTCCG GTACCTTAGA GGTACATCTG 4837 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGGGAGAGA ACATTGGCAG GCTGTGAAGA GAATTTTCCG GTACCTTAGA GGTACATCTG 660 ACGTTGGTCT CATTTATGGA GGTGATACTC AATGCTTGGT TACTGGCTAT TCTGATTCAG 4897 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| ACGTTGGTCT CATTTATGGA GGTGATACTC AGTGCTTGGT TACTGGCTAT TCTGATTCAG 720 ACTATGCTGG AGATGTTGAC ACAAGAAGAT CGATGACTGG CTATGTGTTT ACCCTTGGAG 4957 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| ACTATGCTGG AGATGTTGAC ACAAGAAGAT CGATGACTGG CTATGTGTTT ACCCTGGGAG 780 GATCTGTCGT CAGTTGGAAG GCAACTTT 4985 |||||||||| |||||||||| |||||||| GATCTGTCGT CAGTTGGAAG GCAACTTT 808 hqPGS_C06HBa0112G05.1-6+_SGN-E343338+ (4178 4985) ******************************************************************************** EST sequence 1 +strand 567 n (File: SGN-E551070+) 1 TCTTTATATA ATACCAATTA CTTAAACTTA ATTGTTCTAA TTTTATTACT GCATTTCATT 61 GTAAAGCTAA TAATCCACTA TTTGGATCTT CTCCTGTTAT CAATGTATGT ATTTTATTTT 121 TAAAATTATA AGGATTTGGT CCTAATGATA CTAAATACTG AAATTCTGAT GGCTTCAAAA 181 CATAGATCAG CAGCCTTCAT GGTAAGAAAT CATAATGAAA TAGTTAGAGG AAAAACATTT 241 ATATCTGACC ATCTATTATC ATATATGGTA ATTAATGTCT TTGTTCCTAA ATTTTTTCTA 301 GTTAATCCTT TAATTCCTAT TACTATTAAT CCTAAATGCA TTAAATTTTT TCTAACATAT 361 TTTATTTCTT TTACAGATTC TTCATTTATG AGGGGTAAAG ATACAGAGTT TTTTCCTAAT 421 ACCGATAATT GTTCCTCTCT ATAATGTCTA TATAATTGAG ATTTTGAATT TGTATTATAT 481 AAAACTTTTG GATTTAATAA ATTTAATTGA TTCTGTTGGA AAATTTGTAT ATCTTTATCT 541 GTATCTATAA TTTCGTCTAA TTGTTGA Predicted gene structure (within gDNA segment 5114 to 9840): Exon 1 5714 5885 ( 172 n); cDNA 1 172 ( 172 n); score: 0.942 Intron 1 5886 6550 ( 665 n); Pd: 0.000 (s: 0.94), Pa: 0.000 (s: 0) Exon 2 6551 6586 ( 36 n); cDNA 173 207 ( 35 n); score: 0.528 Intron 2 6587 6971 ( 385 n); Pd: 0.630 (s: 0), Pa: 0.353 (s: 0) Exon 3 6972 6984 ( 13 n); cDNA 208 220 ( 13 n); score: 0.769 Intron 3 6985 7034 ( 50 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.74) Exon 4 7035 7223 ( 189 n); cDNA 221 407 ( 187 n); score: 0.852 Intron 4 7224 9003 (1780 n); Pd: 0.000 (s: 0.90), Pa: 0.447 (s: 0.82) Exon 5 9004 9163 ( 160 n); cDNA 408 567 ( 160 n); score: 0.841 MATCH C06HBa0112G05.1-6+ SGN-E551070+ 0.878 570 1.005 C PGS_C06HBa0112G05.1-6+_SGN-E551070+ (5714 5885,6551 6586,6972 6984,7035 7223,9004 9163) Alignment (genomic DNA sequence = upper lines): TCTTTATATA ATACCAATTA CTTAAACTTA ATTGCTCTAA TTTTATTACT GCATTTCTTT 5773 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||||| || TCTTTATATA ATACCAATTA CTTAAACTTA ATTGTTCTAA TTTTATTACT GCATTTCATT 60 GTAAGGCTAC TAATCCACTA TTTGGGTCTT CTCCTGTTAT TAATGTATGT ATTTTATTTG 5833 |||| |||| |||||||||| ||||| |||| |||||||||| ||||||||| ||||||||| GTAAAGCTAA TAATCCACTA TTTGGATCTT CTCCTGTTAT CAATGTATGT ATTTTATTTT 120 TAAAATTATA TGGGTTAGGT CCTAATGATA CTAAATACTG AAATTCTGAT GGGAATTCTA 5893 |||||||||| || || ||| |||||||||| |||||||||| |||||||||| || TAAAATTATA AGGATTTGGT CCTAATGATA CTAAATACTG AAATTCTGAT GG........ 172 CTTTATAAGC TTCGCATAAA GATTTAGTCG ATTCTCTTAG GATTGTTTCT AAGTATTTAT 5953 .......... .......... .......... .......... .......... .......... 172 ACATGGTTTC AGCATCAGTT TCTGTATAAT TTTTTATAAC ATATGCTACT ACTATTCCTT 6013 .......... .......... .......... .......... .......... .......... 172 TCCATAAATC TATTACTGAA TTCCATCTTT GTGGATCATG TGCTGCTATA TTTAAAAATT 6073 .......... .......... .......... .......... .......... .......... 172 TTCCTTTACT TCCTCCTTCT TGGAGTATTA TTGGTTCTTC TATAGGCATT CTTTTTCCCA 6133 .......... .......... .......... .......... .......... .......... 172 TGGGTCTAAC TTCTGTTAGT GGAGTATCTC CTTGTCTATA TACCCTTCTT CGTGGTATAT 6193 .......... .......... .......... .......... .......... .......... 172 TTCCTGTGCT ATTATCTGCA GTATATCCTT GTTCATTTGT TCTTTGAAAT GGACTACTGG 6253 .......... .......... .......... .......... .......... .......... 172 AACTTGCTGT TTCCATTAAT TCTTTCTGAC TTAGTATAGA TTCTAACTCA GATATTTCAC 6313 .......... .......... .......... .......... .......... .......... 172 TATCATCACC TTGTATTTGT GTCATTATAT TATTACTTTT TTCTAATTTA TCTTGTAAAT 6373 .......... .......... .......... .......... .......... .......... 172 CTTTTTCTAA TCTATATCCT TTATTATCAG TTCTTTGTTC ACATATTTTA AAATGAGATA 6433 .......... .......... .......... .......... .......... .......... 172 TTGTTTGTTC TAAATTTAAG TCTTTCAATT TACATCTTTT ACATGTAAAT AATCTATTAC 6493 .......... .......... .......... .......... .......... .......... 172 TATTGTTTCT ACTTATTTCT TTAACTTTTT CTATAGCCAA ATCTACTGCA AATTCTTCTT 6553 ||| .......... .......... .......... .......... .......... .......CTT 175 CTAATATTAT TTCCATGTTC ATTCCTTCTA AAGGCATTTC TTCTACAACG CTTTCTTCTT 6613 | || || | || | ||| | || | CAAAACATAG AT-CAGCAGC CTTCATGGTA AGA....... .......... .......... 207 CAAAGTCTTT TTGATCTTCA TAATTGTAAT CTGTAAATCT AATTGAAGGT TTTCCTTTAG 6673 .......... .......... .......... .......... .......... .......... 207 AGTTTGTATA TATTAAATGA GCATCTGGTG TTAATGCTTC TTTTTCTTTA AAATCTCCTA 6733 .......... .......... .......... .......... .......... .......... 207 ATTTTCACTC TAATCCTACA TATTCTTCAG AGTCTATTTT TATCGGTTTT ATTAACTTTA 6793 .......... .......... .......... .......... .......... .......... 207 TCCCTTTATT TCCCATTACT TCTACTACGT CTTTTATTTG TAGTTTAAAT CTAGTATTAC 6853 .......... .......... .......... .......... .......... .......... 207 TATTATCTGT CATTTTTTCT AGAAATCCTA CACACATTAA TAAATTCTTA CCACTATGCA 6913 .......... .......... .......... .......... .......... .......... 207 TTTCTTCATA TCCTTTAGTT TGTATTCTTA TTCTTATTTG GGTTCCAAAT TCATTTAGAT 6973 | .......... .......... .......... .......... .......... ........AA 209 TCATCATAAA ATCTGGGCTT ATGTAGAAAA TTCCTCCATT ATTAGTCATA TCTACTTCTG 7033 |||| || || | TCATAATGAA A......... .......... .......... .......... .......... 220 TTAATTATAT AATTGTCTTT TTTATGTCTG ACCATCTATT ATCATATATT GTAATTAATG 7093 || ||| | | | ||||| |||| |||||||||| ||||||||| |||||||||| .TAGTTAGAG GAAAAAC--A TTTATATCTG ACCATCTATT ATCATATATG GTAATTAATG 277 TCTTTGTTCC TAAGTTTTTC CTCGTTAATC CTTTAATTCC TATAACTATT AGTCCTATAT 7153 |||||||||| ||| ||||| || ||||||| |||||||||| ||| |||||| | ||||| || TCTTTGTTCC TAAATTTTTT CTAGTTAATC CTTTAATTCC TATTACTATT AATCCTAAAT 337 GCATTAGATC TTTTTTAACG TATTTTATTT CTTTTACAGA TTCAGCGTTT ATGAGTGGTA 7213 |||||| || |||| |||| |||||||||| |||||||||| ||| | ||| ||||| |||| GCATTAAATT TTTTCTAACA TATTTTATTT CTTTTACAGA TTCTTCATTT ATGAGGGGTA 397 ACGATACAGA GCCTTTTCCT AACTGTCACG ATCCAAATCG GGCCGCGACT AGCACCCACA 7273 | |||||||| AAGATACAGA .......... .......... .......... .......... .......... 407 CTTACCCTCC TATGTGAGCG AACCAACCAA TCCAAACCCC AACATTTTCA AACATAGTAA 7333 .......... .......... .......... .......... .......... .......... 407 CAGAATATAA TGCGGAAGAC TTAAACTCAT TAATGAAAAT CAATTAAATA ACTTCTAAAA 7393 .......... .......... .......... .......... .......... .......... 407 ACTCAACAAC TATTATTATC CCCAAAATCT GGAAGTCATC ATCACAAGAA CATCTACTTC 7453 .......... .......... .......... .......... .......... .......... 407 AAATTACTAA ATCTAAGATT ATCTAAGAAG CTAAAATACA TAAACAGCTA GTCCATGCCG 7513 .......... .......... .......... .......... .......... .......... 407 GAACTTCAAG GCATCAAGAC ATGAAGAGGA GGATCCAGTC CAAGCTAGAA GCATTAGCTC 7573 .......... .......... .......... .......... .......... .......... 407 ACCCTGAAAT CCGGAGTAAT GAAGACTGGC TAGATTTGCG GTTGAGTTGA AGACGACAGA 7633 .......... .......... .......... .......... .......... .......... 407 ACGTTTGCTG CACTCCACAA ATAATCAAAA AGAAAACATA CAAGTAGGGG TCAGTACAAA 7693 .......... .......... .......... .......... .......... .......... 407 ACACAGGTAC TGAGTAGATA TCATCGGCCA ACTCAAAATA GAAAACAGTA TATATCAGAT 7753 .......... .......... .......... .......... .......... .......... 407 AATATCATAA AATCAACTAC AGTACTCAAC ATGCGGCATT TACAATTACC ATAACCCTTG 7813 .......... .......... .......... .......... .......... .......... 407 GTCGCAACAC CAAGCTCATC AATGAGGACT CATGCCTCCC CATCATACTC ATTTGGGAAT 7873 .......... .......... .......... .......... .......... .......... 407 TAAGTTCCTT AAATTGAGTA TATTAACATA TTTCAAGATT CATTCTCTTT ACTAATCCTG 7933 .......... .......... .......... .......... .......... .......... 407 GTGTCAGAAC GTGACACCCG ATCCATATAT ACTATCCTGG TACCGGAACG TGGCACCCGA 7993 .......... .......... .......... .......... .......... .......... 407 TCCATATTCT ATCCTGGTGT CGGAACGTGA CACTCCGATC CTCATATACT ATCCTGGTAC 8053 .......... .......... .......... .......... .......... .......... 407 CGGAACGTGG CACCCGATCC ATATTCTATC CTGGTGTCGG AACGTGACAC TCCGATCCTC 8113 .......... .......... .......... .......... .......... .......... 407 ATATACTATC CTGGTACCGG AACGTGACAC CCGATCCCCT AATCTCACTA CTTTCGTTCA 8173 .......... .......... .......... .......... .......... .......... 407 TCAAGCCTTC TTGTATACTA AGGCATCATC ATTAACAAAG TAGATTAGGG TTTCTTTTTC 8233 .......... .......... .......... .......... .......... .......... 407 AAGATTTAGA ATTCAATAGC TTCATCATGC TTATCTCATC ACAATTATAT AATCACAATA 8293 .......... .......... .......... .......... .......... .......... 407 TGCAAACACA CAATTAAGCA TATAGAAGGG TTTACAACAC TACCCAATAC ATATCATTCG 8353 .......... .......... .......... .......... .......... .......... 407 ATATTAAGAG TTTACTACGA ATAGTGTAAA AACCATAACC TACCTCCATC GAAGATTAGT 8413 .......... .......... .......... .......... .......... .......... 407 GATCAAGCAA GCAAATTCCC CAAAGCTTTG TGTTTTCCTC TTCTCGTTCG ATCCTCTCTC 8473 .......... .......... .......... .......... .......... .......... 407 TCTTTTTGTT CTTTCTATTT TCTTTATTCA AACCCTCTTT CTTTTACCCT AATTAGCATA 8533 .......... .......... .......... .......... .......... .......... 407 TAATTAAGAA TAAAAGATGG CAATAATAAC CCACTAATTT ACTCAAGGTT ACCTTTTTTA 8593 .......... .......... .......... .......... .......... .......... 407 ACCCCCAAGT AATTAGACTT ATTAACATTA ACCCACTAAC TTTATAATTA AAGCAGGAAT 8653 .......... .......... .......... .......... .......... .......... 407 AGTAAAAAAC GTCCCTTAAA ACATTAAAGA AATCCGACTC AGCCTGGGAT TATGCAGCCT 8713 .......... .......... .......... .......... .......... .......... 407 GTGACGACTC GTCGTGCCTG CGACGGTCCG TCTTGCTGCT CCGTCACAGA GTTCAGAGAC 8773 .......... .......... .......... .......... .......... .......... 407 TCAATTTCCC TTAAAGAGTC TGTGACGGTC CGTCACGCCT GTGACGGTCC GTCCTGCCAT 8833 .......... .......... .......... .......... .......... .......... 407 TCCGTTACAA AGTTCAGAGA GTCGATTTCA GTACCCATTT TTCAGAATTT CTAAGTGTTT 8893 .......... .......... .......... .......... .......... .......... 407 TGAAACGAGA CCCCTCGACG GTCCGTCGTG CCCATGACGG TCCGTCGTGG GATCCGTCGT 8953 .......... .......... .......... .......... .......... .......... 407 CTCAACCATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT GACTAAACAG GTCGTTACAC 9013 || || | | .......... .......... .......... .......... .......... GTTTTTTC-C 416 TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT TGAGATTTTG AATTTGTATT 9073 ||| || ||| || ||||| | |||||||||| |||||||| | |||||||||| |||||||||| TAATACCGAT AATTGTTCCT CTCTATAATG TCTATATAAT TGAGATTTTG AATTTGTATT 476 GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT TGAAAGATTT G-ATATCCTT 9132 ||||||||| || ||| | |||||||| | |||||| ||| || || |||| | ||||| || ATATAAAACT TTTGGATTTA ATAAATTTAA TTGATTCTGT TGGAAAATTT GTATATCTTT 536 TTCTGTATCT ATTATTTCTC CTAATTGTTG A 9163 ||||||||| || ||||| |||||||||| | ATCTGTATCT ATAATTTCGT CTAATTGTTG A 567 hqPGS_C06HBa0112G05.1-6+_SGN-E551070+ (5714 5885,6551 6586,6972 6984,7035 7223,9004 9163) ******************************************************************************** EST sequence 34 +strand 552 n (File: SGN-E260038+) 1 CTTTATATAA TACCAATTAC TTAAACTTAA TTGTTCTAAT TTTATTACTG CATTTCATTG 61 TAAAGCTAAT AATCCACTAT TTGGATCTTC TCCTGTTATC AATGTATGTA TTTTATTTTT 121 AAAATTATAA GGATTTGGTC CTAATGATAC TAAATACTGA AATTCTGATG GCTTCAAAAC 181 ATAGATCAGC AGCCTTCATG GTAAGAAATC ATAATGAAAT AGTTAGAGGA AAAACATTTA 241 TATCTGACCA TCTATTATCA TATATGGTAA TTAATGTCTT TGTTCCTAAA TTTTTTCTAG 301 TTAATCCTTT AATTCCTATT ACTATTAATC CTAAATGCAT TAAATTTTTT CTAACATATT 361 TTATTTCTTT TACAGATTCT TCATTTATGA GGGGTAAAGA TACAGAGTTT TTTCCTAATA 421 CCGATAATTG TTCCTCTCTA TAATGTCTAT ATAATTGAGA TTTTGAATTT GTATTATATA 481 AAACTTTTGG ATTTAATAAA TTTAATTGAT TCTGTTGGAA AATTTGTATA TCTTTATCTG 541 TATCTATAAT TT Predicted gene structure (within gDNA segment 5115 to 9840): Exon 1 5715 5885 ( 171 n); cDNA 1 171 ( 171 n); score: 0.942 Intron 1 5886 6550 ( 665 n); Pd: 0.000 (s: 0.94), Pa: 0.000 (s: 0) Exon 2 6551 6586 ( 36 n); cDNA 172 206 ( 35 n); score: 0.528 Intron 2 6587 6971 ( 385 n); Pd: 0.630 (s: 0), Pa: 0.353 (s: 0) Exon 3 6972 6984 ( 13 n); cDNA 207 219 ( 13 n); score: 0.769 Intron 3 6985 7034 ( 50 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.74) Exon 4 7035 7223 ( 189 n); cDNA 220 406 ( 187 n); score: 0.852 Intron 4 7224 9003 (1780 n); Pd: 0.000 (s: 0.90), Pa: 0.447 (s: 0.82) Exon 5 9004 9149 ( 146 n); cDNA 407 552 ( 146 n); score: 0.839 MATCH C06HBa0112G05.1-6+ SGN-E260038+ 0.878 555 1.005 C PGS_C06HBa0112G05.1-6+_SGN-E260038+ (5715 5885,6551 6586,6972 6984,7035 7223,9004 9149) Alignment (genomic DNA sequence = upper lines): CTTTATATAA TACCAATTAC TTAAACTTAA TTGCTCTAAT TTTATTACTG CATTTCTTTG 5774 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||||| ||| CTTTATATAA TACCAATTAC TTAAACTTAA TTGTTCTAAT TTTATTACTG CATTTCATTG 60 TAAGGCTACT AATCCACTAT TTGGGTCTTC TCCTGTTATT AATGTATGTA TTTTATTTGT 5834 ||| |||| | |||||||||| |||| ||||| ||||||||| |||||||||| |||||||| | TAAAGCTAAT AATCCACTAT TTGGATCTTC TCCTGTTATC AATGTATGTA TTTTATTTTT 120 AAAATTATAT GGGTTAGGTC CTAATGATAC TAAATACTGA AATTCTGATG GGAATTCTAC 5894 ||||||||| || || |||| |||||||||| |||||||||| |||||||||| | AAAATTATAA GGATTTGGTC CTAATGATAC TAAATACTGA AATTCTGATG G......... 171 TTTATAAGCT TCGCATAAAG ATTTAGTCGA TTCTCTTAGG ATTGTTTCTA AGTATTTATA 5954 .......... .......... .......... .......... .......... .......... 171 CATGGTTTCA GCATCAGTTT CTGTATAATT TTTTATAACA TATGCTACTA CTATTCCTTT 6014 .......... .......... .......... .......... .......... .......... 171 CCATAAATCT ATTACTGAAT TCCATCTTTG TGGATCATGT GCTGCTATAT TTAAAAATTT 6074 .......... .......... .......... .......... .......... .......... 171 TCCTTTACTT CCTCCTTCTT GGAGTATTAT TGGTTCTTCT ATAGGCATTC TTTTTCCCAT 6134 .......... .......... .......... .......... .......... .......... 171 GGGTCTAACT TCTGTTAGTG GAGTATCTCC TTGTCTATAT ACCCTTCTTC GTGGTATATT 6194 .......... .......... .......... .......... .......... .......... 171 TCCTGTGCTA TTATCTGCAG TATATCCTTG TTCATTTGTT CTTTGAAATG GACTACTGGA 6254 .......... .......... .......... .......... .......... .......... 171 ACTTGCTGTT TCCATTAATT CTTTCTGACT TAGTATAGAT TCTAACTCAG ATATTTCACT 6314 .......... .......... .......... .......... .......... .......... 171 ATCATCACCT TGTATTTGTG TCATTATATT ATTACTTTTT TCTAATTTAT CTTGTAAATC 6374 .......... .......... .......... .......... .......... .......... 171 TTTTTCTAAT CTATATCCTT TATTATCAGT TCTTTGTTCA CATATTTTAA AATGAGATAT 6434 .......... .......... .......... .......... .......... .......... 171 TGTTTGTTCT AAATTTAAGT CTTTCAATTT ACATCTTTTA CATGTAAATA ATCTATTACT 6494 .......... .......... .......... .......... .......... .......... 171 ATTGTTTCTA CTTATTTCTT TAACTTTTTC TATAGCCAAA TCTACTGCAA ATTCTTCTTC 6554 |||| .......... .......... .......... .......... .......... ......CTTC 175 TAATATTATT TCCATGTTCA TTCCTTCTAA AGGCATTTCT TCTACAACGC TTTCTTCTTC 6614 || || | || | ||| | ||| AAAACATAGA T-CAGCAGCC TTCATGGTAA GA........ .......... .......... 206 AAAGTCTTTT TGATCTTCAT AATTGTAATC TGTAAATCTA ATTGAAGGTT TTCCTTTAGA 6674 .......... .......... .......... .......... .......... .......... 206 GTTTGTATAT ATTAAATGAG CATCTGGTGT TAATGCTTCT TTTTCTTTAA AATCTCCTAA 6734 .......... .......... .......... .......... .......... .......... 206 TTTTCACTCT AATCCTACAT ATTCTTCAGA GTCTATTTTT ATCGGTTTTA TTAACTTTAT 6794 .......... .......... .......... .......... .......... .......... 206 CCCTTTATTT CCCATTACTT CTACTACGTC TTTTATTTGT AGTTTAAATC TAGTATTACT 6854 .......... .......... .......... .......... .......... .......... 206 ATTATCTGTC ATTTTTTCTA GAAATCCTAC ACACATTAAT AAATTCTTAC CACTATGCAT 6914 .......... .......... .......... .......... .......... .......... 206 TTCTTCATAT CCTTTAGTTT GTATTCTTAT TCTTATTTGG GTTCCAAATT CATTTAGATT 6974 | | .......... .......... .......... .......... .......... .......AAT 209 CATCATAAAA TCTGGGCTTA TGTAGAAAAT TCCTCCATTA TTAGTCATAT CTACTTCTGT 7034 ||| || ||| CATAATGAAA .......... .......... .......... .......... .......... 219 TAATTATATA ATTGTCTTTT TTATGTCTGA CCATCTATTA TCATATATTG TAATTAATGT 7094 || ||| | | | | |||| ||||| |||||||||| |||||||| | |||||||||| TAGTTAGAGG AAAAAC--AT TTATATCTGA CCATCTATTA TCATATATGG TAATTAATGT 277 CTTTGTTCCT AAGTTTTTCC TCGTTAATCC TTTAATTCCT ATAACTATTA GTCCTATATG 7154 |||||||||| || ||||| | | |||||||| |||||||||| || ||||||| ||||| ||| CTTTGTTCCT AAATTTTTTC TAGTTAATCC TTTAATTCCT ATTACTATTA ATCCTAAATG 337 CATTAGATCT TTTTTAACGT ATTTTATTTC TTTTACAGAT TCAGCGTTTA TGAGTGGTAA 7214 ||||| || | ||| |||| | |||||||||| |||||||||| || | |||| |||| ||||| CATTAAATTT TTTCTAACAT ATTTTATTTC TTTTACAGAT TCTTCATTTA TGAGGGGTAA 397 CGATACAGAG CCTTTTCCTA ACTGTCACGA TCCAAATCGG GCCGCGACTA GCACCCACAC 7274 |||||||| AGATACAGA. .......... .......... .......... .......... .......... 406 TTACCCTCCT ATGTGAGCGA ACCAACCAAT CCAAACCCCA ACATTTTCAA ACATAGTAAC 7334 .......... .......... .......... .......... .......... .......... 406 AGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC AATTAAATAA CTTCTAAAAA 7394 .......... .......... .......... .......... .......... .......... 406 CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA TCACAAGAAC ATCTACTTCA 7454 .......... .......... .......... .......... .......... .......... 406 AATTACTAAA TCTAAGATTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG 7514 .......... .......... .......... .......... .......... .......... 406 AACTTCAAGG CATCAAGACA TGAAGAGGAG GATCCAGTCC AAGCTAGAAG CATTAGCTCA 7574 .......... .......... .......... .......... .......... .......... 406 CCCTGAAATC CGGAGTAATG AAGACTGGCT AGATTTGCGG TTGAGTTGAA GACGACAGAA 7634 .......... .......... .......... .......... .......... .......... 406 CGTTTGCTGC ACTCCACAAA TAATCAAAAA GAAAACATAC AAGTAGGGGT CAGTACAAAA 7694 .......... .......... .......... .......... .......... .......... 406 CACAGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT ATATCAGATA 7754 .......... .......... .......... .......... .......... .......... 406 ATATCATAAA ATCAACTACA GTACTCAACA TGCGGCATTT ACAATTACCA TAACCCTTGG 7814 .......... .......... .......... .......... .......... .......... 406 TCGCAACACC AAGCTCATCA ATGAGGACTC ATGCCTCCCC ATCATACTCA TTTGGGAATT 7874 .......... .......... .......... .......... .......... .......... 406 AAGTTCCTTA AATTGAGTAT ATTAACATAT TTCAAGATTC ATTCTCTTTA CTAATCCTGG 7934 .......... .......... .......... .......... .......... .......... 406 TGTCAGAACG TGACACCCGA TCCATATATA CTATCCTGGT ACCGGAACGT GGCACCCGAT 7994 .......... .......... .......... .......... .......... .......... 406 CCATATTCTA TCCTGGTGTC GGAACGTGAC ACTCCGATCC TCATATACTA TCCTGGTACC 8054 .......... .......... .......... .......... .......... .......... 406 GGAACGTGGC ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACT CCGATCCTCA 8114 .......... .......... .......... .......... .......... .......... 406 TATACTATCC TGGTACCGGA ACGTGACACC CGATCCCCTA ATCTCACTAC TTTCGTTCAT 8174 .......... .......... .......... .......... .......... .......... 406 CAAGCCTTCT TGTATACTAA GGCATCATCA TTAACAAAGT AGATTAGGGT TTCTTTTTCA 8234 .......... .......... .......... .......... .......... .......... 406 AGATTTAGAA TTCAATAGCT TCATCATGCT TATCTCATCA CAATTATATA ATCACAATAT 8294 .......... .......... .......... .......... .......... .......... 406 GCAAACACAC AATTAAGCAT ATAGAAGGGT TTACAACACT ACCCAATACA TATCATTCGA 8354 .......... .......... .......... .......... .......... .......... 406 TATTAAGAGT TTACTACGAA TAGTGTAAAA ACCATAACCT ACCTCCATCG AAGATTAGTG 8414 .......... .......... .......... .......... .......... .......... 406 ATCAAGCAAG CAAATTCCCC AAAGCTTTGT GTTTTCCTCT TCTCGTTCGA TCCTCTCTCT 8474 .......... .......... .......... .......... .......... .......... 406 CTTTTTGTTC TTTCTATTTT CTTTATTCAA ACCCTCTTTC TTTTACCCTA ATTAGCATAT 8534 .......... .......... .......... .......... .......... .......... 406 AATTAAGAAT AAAAGATGGC AATAATAACC CACTAATTTA CTCAAGGTTA CCTTTTTTAA 8594 .......... .......... .......... .......... .......... .......... 406 CCCCCAAGTA ATTAGACTTA TTAACATTAA CCCACTAACT TTATAATTAA AGCAGGAATA 8654 .......... .......... .......... .......... .......... .......... 406 GTAAAAAACG TCCCTTAAAA CATTAAAGAA ATCCGACTCA GCCTGGGATT ATGCAGCCTG 8714 .......... .......... .......... .......... .......... .......... 406 TGACGACTCG TCGTGCCTGC GACGGTCCGT CTTGCTGCTC CGTCACAGAG TTCAGAGACT 8774 .......... .......... .......... .......... .......... .......... 406 CAATTTCCCT TAAAGAGTCT GTGACGGTCC GTCACGCCTG TGACGGTCCG TCCTGCCATT 8834 .......... .......... .......... .......... .......... .......... 406 CCGTTACAAA GTTCAGAGAG TCGATTTCAG TACCCATTTT TCAGAATTTC TAAGTGTTTT 8894 .......... .......... .......... .......... .......... .......... 406 GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT CCGTCGTGGG ATCCGTCGTC 8954 .......... .......... .......... .......... .......... .......... 406 TCAACCATTT TTCCAGAAAT AACATTTGTT GCTCAAAATG ACTAAACAGG TCGTTACACT 9014 | | || | || .......... .......... .......... .......... .........G TTTTTTC-CT 416 AACACTGATA AATGTTCTTC TCTATAATGT CTATATAGTT GAGATTTTGA ATTTGTATTG 9074 || || |||| | ||||| || |||||||||| ||||||| || |||||||||| ||||||||| AATACCGATA ATTGTTCCTC TCTATAATGT CTATATAATT GAGATTTTGA ATTTGTATTA 476 TATAAAACTT TGATATTCAA TAAATTTTAT TGATTTTGTT GAAAGATTTG -ATATCCTTT 9133 |||||||||| | ||| || ||||||| || ||||| |||| | || ||||| ||||| || TATAAAACTT TTGGATTTAA TAAATTTAAT TGATTCTGTT GGAAAATTTG TATATCTTTA 536 TCTGTATCTA TTATTT 9149 |||||||||| | |||| TCTGTATCTA TAATTT 552 hqPGS_C06HBa0112G05.1-6+_SGN-E260038+ (5715 5885,6551 6586,6972 6984,7035 7223,9004 9149) ******************************************************************************** EST sequence 38 +strand 575 n (File: SGN-E301194+) 1 CCTTTTAGGG CGTAGCTTAG CACTATATAT AGACGCTATG GCAAACCCTA TTCTGTAATT 61 CTGTTTTTGC CTCTCCATAA TAAAATTGCT CCCTCTCTTC CCGTGGACGT AGCCAATTTA 121 TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTCGCGTT TATATATTTT CTCGTATTAT 181 CTCAAATTTC GCACAACACT CTTAATATTC ATAACTATCA TCTTTTCATA TTCATAACCT 241 CCAAATATTT AAATTAAACT TTAAGAAATC TTTTGGTATT CCTTCTATTC TATTTGTATA 301 AATTCAACTT CTTTATCTCA TGAAACCCCT ATCAAGATTA TTATTTTTAT TCTATAGTAA 361 AAATAGATGC TGAAAACTCT TGAATTTTGA TAGGATATGA AAGGAGTCGA TAAAAACTCA 421 GAGAGTTATG TACTAATTTT TACTTATTTT TTCATCTATA TATACATCAA TCTTATAAGA 481 ATAATGTCTA TATTGTATTT TTTTCTTAAA TATTCTGTTT CTTTTAGTCT TTTTTTTCAC 541 TCTGTTAGAC TTCTTAATTT AGTTTTCTAT GAATG Predicted gene structure (within gDNA segment 384 to 6429): Exon 1 5480 5666 ( 187 n); cDNA 11 198 ( 188 n); score: 0.936 MATCH C06HBa0112G05.1-6+ SGN-E301194+ 0.936 187 0.325 C PGS_C06HBa0112G05.1-6+_SGN-E301194+ (5480 5666) Alignment (genomic DNA sequence = upper lines): CGCTGCCTGG TCACTATATA TAGACGCTAT GGCAAACCCT ATTCTGTAAT TCTGTTTTTG 5539 || || | | ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTAGCTTAG -CACTATATA TAGACGCTAT GGCAAACCCT ATTCTGTAAT TCTGTTTTTG 69 CCTCTCCATA ATAAAATTGC TCCCTCTCTT CCCGTGGACG TAGCCAATTT ATTGGTGAAC 5599 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCTCCATA ATAAAATTGC TCCCTCTCTT CCCGTGGACG TAGCCAATTT ATTGGTGAAC 129 CACGTAAATC TGTTGTCTTG TTTTTCGC-A TT-TATATTT TCTCGTATTA TCTCAAATTC 5657 |||||||||| |||||||||| |||||||| || ||||||| |||||||||| ||||||||| CACGTAAATC TGTTGTCTTG TTTTTCGCGT TTATATATTT TCTCGTATTA TCTCAAATTT 189 CGCACAACA 5666 ||||||||| CGCACAACA 198 hqPGS_C06HBa0112G05.1-6+_SGN-E301194+ (5480 5666) ******************************************************************************** EST sequence 36 +strand 641 n (File: SGN-E301078+) 1 CCTTTTAGGG TGTAGCTTAG CACTATATAT AGACGCTATG GCAAACCCTA TTCTGTAATT 61 CTGTTTTTGC CTCTCCATAA TAAAATTGCT CCCTCTCTTC CCGTGGACGT AGCCAATTTA 121 TTGGTGAACC ACGTAAAACT GGTGTCTTGG TTTTCGCGTT TATATATTTT CTCGTATTAT 181 CTCAAATTTC GCACAACACT CTTAATATTC ATAACTATCA TCTTTTCATA TTCATAACCT 241 CCAAATATTT AAATTAAACT TTAAGATATC TTTTGGTATT CCTTCTATTC TATTTGTATA 301 AATTCAACTT CTTTATCTCA TGAAACCCCT ATCAAGATTA TTATTTTTAT TCTATAGTAA 361 AAATAGATGC TGAAAACTCT TGAATTTTGA TAGGATATGA AAGGAGTCGA TAAAAACTCA 421 GAGAGTTATG TACTAATTTT TACTTATTTT TTCATCTATA TATACATCAA TCTTATAAGA 481 ATAATGTCTA TATTGTATTT TTTTCTTAAA TATTCTGTTT CTTTTAGTCT TTTTTTTCAC 541 TCTGTTAGAC TTCTTAATTT AGTTTTCTAT GAATGATTTA TTGTCGTATG TCTTTGAATT 601 TTGTAATTGT TACATTTTAT TATTCATTAC AATTTACATA T Predicted gene structure (within gDNA segment 384 to 8150): Exon 1 5484 5667 ( 184 n); cDNA 15 199 ( 185 n); score: 0.924 Intron 1 5668 6476 ( 809 n); Pd: 0.990 (s: 0.81), Pa: 0.000 (s: 0.60) Exon 2 6477 6569 ( 93 n); cDNA 200 288 ( 89 n); score: 0.581 Intron 2 6570 6765 ( 196 n); Pd: 0.000 (s: 0.52), Pa: 0.000 (s: 0) Exon 3 6766 6778 ( 13 n); cDNA 289 301 ( 13 n); score: 0.769 Intron 3 6779 7192 ( 414 n); Pd: 0.916 (s: 0), Pa: 0.999 (s: 0) Exon 4 7193 7197 ( 5 n); cDNA 302 306 ( 5 n); score: 1.000 MATCH C06HBa0112G05.1-6+ SGN-E301078+ 0.809 295 0.460 C PGS_C06HBa0112G05.1-6+_SGN-E301078+ (5484 5667,6477 6569,6766 6778,7193 7197) Alignment (genomic DNA sequence = upper lines): GCCTGGTCAC TATATATAGA CGCTATGGCA AACCCTATTC TGTAATTCTG TTTTTGCCTC 5543 || | | ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTAG-CAC TATATATAGA CGCTATGGCA AACCCTATTC TGTAATTCTG TTTTTGCCTC 73 TCCATAATAA AATTGCTCCC TCTCTTCCCG TGGACGTAGC CAATTTATTG GTGAACCACG 5603 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCATAATAA AATTGCTCCC TCTCTTCCCG TGGACGTAGC CAATTTATTG GTGAACCACG 133 TAAATCTGTT GTCTTGTTTT TCGC-ATT-T ATATTTTCTC GTATTATCTC AAATTCCGCA 5661 |||| ||| | |||||| ||| |||| || | |||||||||| |||||||||| ||||| |||| TAAAACTGGT GTCTTGGTTT TCGCGTTTAT ATATTTTCTC GTATTATCTC AAATTTCGCA 193 CAACAGGCAT TTCCACTAAT TGTACAATAG TAGAAATAAT CATTTAAAAA TTTCTTTATA 5721 ||||| CAACAC.... .......... .......... .......... .......... .......... 199 TAATACCAAT TACTTAAACT TAATTGCTCT AATTTTATTA CTGCATTTCT TTGTAAGGCT 5781 .......... .......... .......... .......... .......... .......... 199 ACTAATCCAC TATTTGGGTC TTCTCCTGTT ATTAATGTAT GTATTTTATT TGTAAAATTA 5841 .......... .......... .......... .......... .......... .......... 199 TATGGGTTAG GTCCTAATGA TACTAAATAC TGAAATTCTG ATGGGAATTC TACTTTATAA 5901 .......... .......... .......... .......... .......... .......... 199 GCTTCGCATA AAGATTTAGT CGATTCTCTT AGGATTGTTT CTAAGTATTT ATACATGGTT 5961 .......... .......... .......... .......... .......... .......... 199 TCAGCATCAG TTTCTGTATA ATTTTTTATA ACATATGCTA CTACTATTCC TTTCCATAAA 6021 .......... .......... .......... .......... .......... .......... 199 TCTATTACTG AATTCCATCT TTGTGGATCA TGTGCTGCTA TATTTAAAAA TTTTCCTTTA 6081 .......... .......... .......... .......... .......... .......... 199 CTTCCTCCTT CTTGGAGTAT TATTGGTTCT TCTATAGGCA TTCTTTTTCC CATGGGTCTA 6141 .......... .......... .......... .......... .......... .......... 199 ACTTCTGTTA GTGGAGTATC TCCTTGTCTA TATACCCTTC TTCGTGGTAT ATTTCCTGTG 6201 .......... .......... .......... .......... .......... .......... 199 CTATTATCTG CAGTATATCC TTGTTCATTT GTTCTTTGAA ATGGACTACT GGAACTTGCT 6261 .......... .......... .......... .......... .......... .......... 199 GTTTCCATTA ATTCTTTCTG ACTTAGTATA GATTCTAACT CAGATATTTC ACTATCATCA 6321 .......... .......... .......... .......... .......... .......... 199 CCTTGTATTT GTGTCATTAT ATTATTACTT TTTTCTAATT TATCTTGTAA ATCTTTTTCT 6381 .......... .......... .......... .......... .......... .......... 199 AATCTATATC CTTTATTATC AGTTCTTTGT TCACATATTT TAAAATGAGA TATTGTTTGT 6441 .......... .......... .......... .......... .......... .......... 199 TCTAAATTTA AGTCTTTCAA TTTACATCTT TTACATGTAA ATAATCTATT ACTATTGTTT 6501 | | | ||| || || ||||| | | .......... .......... .......... .....TCTTA ATATTC-ATA ACTATCATCT 223 CTACTTATTT CTTTAACTTT TTCTATAGCC AAATCTACTG CAAATTCTTC TTCTAATATT 6561 | | || || | | | || ||| || | ||| || | || || | |||| TTTCATA-TT CATAACCTCC AAATAT-TTA AATTAAACTT TAAGAT-ATC TTTTGGTATT 280 ATTTCCATGT TCATTCCTTC TAAAGGCATT TCTTCTACAA CGCTTTCTTC TTCAAAGTCT 6621 ||| || CCTTCTAT.. .......... .......... .......... .......... .......... 288 TTTTGATCTT CATAATTGTA ATCTGTAAAT CTAATTGAAG GTTTTCCTTT AGAGTTTGTA 6681 .......... .......... .......... .......... .......... .......... 288 TATATTAAAT GAGCATCTGG TGTTAATGCT TCTTTTTCTT TAAAATCTCC TAATTTTCAC 6741 .......... .......... .......... .......... .......... .......... 288 TCTAATCCTA CATATTCTTC AGAGTCTATT TTTATCGGTT TTATTAACTT TATCCCTTTA 6801 |||||| | ||| .......... .......... ....TCTATT TGTATAA... .......... .......... 301 TTTCCCATTA CTTCTACTAC GTCTTTTATT TGTAGTTTAA ATCTAGTATT ACTATTATCT 6861 .......... .......... .......... .......... .......... .......... 301 GTCATTTTTT CTAGAAATCC TACACACATT AATAAATTCT TACCACTATG CATTTCTTCA 6921 .......... .......... .......... .......... .......... .......... 301 TATCCTTTAG TTTGTATTCT TATTCTTATT TGGGTTCCAA ATTCATTTAG ATTCATCATA 6981 .......... .......... .......... .......... .......... .......... 301 AAATCTGGGC TTATGTAGAA AATTCCTCCA TTATTAGTCA TATCTACTTC TGTTAATTAT 7041 .......... .......... .......... .......... .......... .......... 301 ATAATTGTCT TTTTTATGTC TGACCATCTA TTATCATATA TTGTAATTAA TGTCTTTGTT 7101 .......... .......... .......... .......... .......... .......... 301 CCTAAGTTTT TCCTCGTTAA TCCTTTAATT CCTATAACTA TTAGTCCTAT ATGCATTAGA 7161 .......... .......... .......... .......... .......... .......... 301 TCTTTTTTAA CGTATTTTAT TTCTTTTACA GATTCA 7197 ||||| .......... .......... .......... .ATTCA 306 hqPGS_C06HBa0112G05.1-6+_SGN-E301078+ (5484 5667) ******************************************************************************** EST sequence 37 +strand 575 n (File: SGN-E301194+) 1 CCTTTTAGGG CGTAGCTTAG CACTATATAT AGACGCTATG GCAAACCCTA TTCTGTAATT 61 CTGTTTTTGC CTCTCCATAA TAAAATTGCT CCCTCTCTTC CCGTGGACGT AGCCAATTTA 121 TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTCGCGTT TATATATTTT CTCGTATTAT 181 CTCAAATTTC GCACAACACT CTTAATATTC ATAACTATCA TCTTTTCATA TTCATAACCT 241 CCAAATATTT AAATTAAACT TTAAGAAATC TTTTGGTATT CCTTCTATTC TATTTGTATA 301 AATTCAACTT CTTTATCTCA TGAAACCCCT ATCAAGATTA TTATTTTTAT TCTATAGTAA 361 AAATAGATGC TGAAAACTCT TGAATTTTGA TAGGATATGA AAGGAGTCGA TAAAAACTCA 421 GAGAGTTATG TACTAATTTT TACTTATTTT TTCATCTATA TATACATCAA TCTTATAAGA 481 ATAATGTCTA TATTGTATTT TTTTCTTAAA TATTCTGTTT CTTTTAGTCT TTTTTTTCAC 541 TCTGTTAGAC TTCTTAATTT AGTTTTCTAT GAATG Predicted gene structure (within gDNA segment 4691 to 9840): Exon 1 5494 5667 ( 174 n); cDNA 24 199 ( 176 n); score: 0.954 Intron 1 5668 5804 ( 137 n); Pd: 0.990 (s: 0.83), Pa: 0.000 (s: 0) Exon 2 5805 5817 ( 13 n); cDNA 200 212 ( 13 n); score: 0.692 Intron 2 5818 6459 ( 642 n); Pd: 0.413 (s: 0), Pa: 0.000 (s: 0) Exon 3 6460 6477 ( 18 n); cDNA 213 229 ( 17 n); score: 0.778 Intron 3 6478 7192 ( 715 n); Pd: 0.364 (s: 0), Pa: 0.999 (s: 0) Exon 4 7193 7197 ( 5 n); cDNA 230 234 ( 5 n); score: 1.000 Intron 4 7198 8591 (1394 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 5 8592 8601 ( 10 n); cDNA 235 244 ( 10 n); score: 0.900 Intron 5 8602 9003 ( 402 n); Pd: 0.600 (s: 0), Pa: 0.447 (s: 0) Exon 6 9004 9027 ( 24 n); cDNA 245 266 ( 22 n); score: 0.583 Intron 6 9028 9235 ( 208 n); Pd: 0.258 (s: 0), Pa: 0.366 (s: 0) Exon 7 9236 9274 ( 39 n); cDNA 267 301 ( 35 n); score: 0.667 MATCH C06HBa0112G05.1-6+ SGN-E301194+ 0.954 283 0.492 C PGS_C06HBa0112G05.1-6+_SGN-E301194+ (5494 5667,5805 5817,6460 6477,7193 7197,8592 8601,9004 9027,9236 9274) Alignment (genomic DNA sequence = upper lines): TATATATAGA CGCTATGGCA AACCCTATTC TGTAATTCTG TTTTTGCCTC TCCATAATAA 5553 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATATATAGA CGCTATGGCA AACCCTATTC TGTAATTCTG TTTTTGCCTC TCCATAATAA 83 AATTGCTCCC TCTCTTCCCG TGGACGTAGC CAATTTATTG GTGAACCACG TAAATCTGTT 5613 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTGCTCCC TCTCTTCCCG TGGACGTAGC CAATTTATTG GTGAACCACG TAAATCTGTT 143 GTCTTGTTTT TCGC-ATT-T ATATTTTCTC GTATTATCTC AAATTCCGCA CAACAGGCAT 5671 |||||||||| |||| || | |||||||||| |||||||||| ||||| |||| ||||| GTCTTGTTTT TCGCGTTTAT ATATTTTCTC GTATTATCTC AAATTTCGCA CAACAC.... 199 TTCCACTAAT TGTACAATAG TAGAAATAAT CATTTAAAAA TTTCTTTATA TAATACCAAT 5731 .......... .......... .......... .......... .......... .......... 199 TACTTAAACT TAATTGCTCT AATTTTATTA CTGCATTTCT TTGTAAGGCT ACTAATCCAC 5791 .......... .......... .......... .......... .......... .......... 199 TATTTGGGTC TTCTCCTGTT ATTAATGTAT GTATTTTATT TGTAAAATTA TATGGGTTAG 5851 || | | ||| || .......... ...TCTTAAT ATTCAT.... .......... .......... .......... 212 GTCCTAATGA TACTAAATAC TGAAATTCTG ATGGGAATTC TACTTTATAA GCTTCGCATA 5911 .......... .......... .......... .......... .......... .......... 212 AAGATTTAGT CGATTCTCTT AGGATTGTTT CTAAGTATTT ATACATGGTT TCAGCATCAG 5971 .......... .......... .......... .......... .......... .......... 212 TTTCTGTATA ATTTTTTATA ACATATGCTA CTACTATTCC TTTCCATAAA TCTATTACTG 6031 .......... .......... .......... .......... .......... .......... 212 AATTCCATCT TTGTGGATCA TGTGCTGCTA TATTTAAAAA TTTTCCTTTA CTTCCTCCTT 6091 .......... .......... .......... .......... .......... .......... 212 CTTGGAGTAT TATTGGTTCT TCTATAGGCA TTCTTTTTCC CATGGGTCTA ACTTCTGTTA 6151 .......... .......... .......... .......... .......... .......... 212 GTGGAGTATC TCCTTGTCTA TATACCCTTC TTCGTGGTAT ATTTCCTGTG CTATTATCTG 6211 .......... .......... .......... .......... .......... .......... 212 CAGTATATCC TTGTTCATTT GTTCTTTGAA ATGGACTACT GGAACTTGCT GTTTCCATTA 6271 .......... .......... .......... .......... .......... .......... 212 ATTCTTTCTG ACTTAGTATA GATTCTAACT CAGATATTTC ACTATCATCA CCTTGTATTT 6331 .......... .......... .......... .......... .......... .......... 212 GTGTCATTAT ATTATTACTT TTTTCTAATT TATCTTGTAA ATCTTTTTCT AATCTATATC 6391 .......... .......... .......... .......... .......... .......... 212 CTTTATTATC AGTTCTTTGT TCACATATTT TAAAATGAGA TATTGTTTGT TCTAAATTTA 6451 .......... .......... .......... .......... .......... .......... 212 AGTCTTTCAA TTTACATCTT TTACATGTAA ATAATCTATT ACTATTGTTT CTACTTATTT 6511 || | |||||| || ||| ........AA CTATCATCTT TT-CAT.... .......... .......... .......... 229 CTTTAACTTT TTCTATAGCC AAATCTACTG CAAATTCTTC TTCTAATATT ATTTCCATGT 6571 .......... .......... .......... .......... .......... .......... 229 TCATTCCTTC TAAAGGCATT TCTTCTACAA CGCTTTCTTC TTCAAAGTCT TTTTGATCTT 6631 .......... .......... .......... .......... .......... .......... 229 CATAATTGTA ATCTGTAAAT CTAATTGAAG GTTTTCCTTT AGAGTTTGTA TATATTAAAT 6691 .......... .......... .......... .......... .......... .......... 229 GAGCATCTGG TGTTAATGCT TCTTTTTCTT TAAAATCTCC TAATTTTCAC TCTAATCCTA 6751 .......... .......... .......... .......... .......... .......... 229 CATATTCTTC AGAGTCTATT TTTATCGGTT TTATTAACTT TATCCCTTTA TTTCCCATTA 6811 .......... .......... .......... .......... .......... .......... 229 CTTCTACTAC GTCTTTTATT TGTAGTTTAA ATCTAGTATT ACTATTATCT GTCATTTTTT 6871 .......... .......... .......... .......... .......... .......... 229 CTAGAAATCC TACACACATT AATAAATTCT TACCACTATG CATTTCTTCA TATCCTTTAG 6931 .......... .......... .......... .......... .......... .......... 229 TTTGTATTCT TATTCTTATT TGGGTTCCAA ATTCATTTAG ATTCATCATA AAATCTGGGC 6991 .......... .......... .......... .......... .......... .......... 229 TTATGTAGAA AATTCCTCCA TTATTAGTCA TATCTACTTC TGTTAATTAT ATAATTGTCT 7051 .......... .......... .......... .......... .......... .......... 229 TTTTTATGTC TGACCATCTA TTATCATATA TTGTAATTAA TGTCTTTGTT CCTAAGTTTT 7111 .......... .......... .......... .......... .......... .......... 229 TCCTCGTTAA TCCTTTAATT CCTATAACTA TTAGTCCTAT ATGCATTAGA TCTTTTTTAA 7171 .......... .......... .......... .......... .......... .......... 229 CGTATTTTAT TTCTTTTACA GATTCAGCGT TTATGAGTGG TAACGATACA GAGCCTTTTC 7231 ||||| .......... .......... .ATTCA.... .......... .......... .......... 234 CTAACTGTCA CGATCCAAAT CGGGCCGCGA CTAGCACCCA CACTTACCCT CCTATGTGAG 7291 .......... .......... .......... .......... .......... .......... 234 CGAACCAACC AATCCAAACC CCAACATTTT CAAACATAGT AACAGAATAT AATGCGGAAG 7351 .......... .......... .......... .......... .......... .......... 234 ACTTAAACTC ATTAATGAAA ATCAATTAAA TAACTTCTAA AAACTCAACA ACTATTATTA 7411 .......... .......... .......... .......... .......... .......... 234 TCCCCAAAAT CTGGAAGTCA TCATCACAAG AACATCTACT TCAAATTACT AAATCTAAGA 7471 .......... .......... .......... .......... .......... .......... 234 TTATCTAAGA AGCTAAAATA CATAAACAGC TAGTCCATGC CGGAACTTCA AGGCATCAAG 7531 .......... .......... .......... .......... .......... .......... 234 ACATGAAGAG GAGGATCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA ATCCGGAGTA 7591 .......... .......... .......... .......... .......... .......... 234 ATGAAGACTG GCTAGATTTG CGGTTGAGTT GAAGACGACA GAACGTTTGC TGCACTCCAC 7651 .......... .......... .......... .......... .......... .......... 234 AAATAATCAA AAAGAAAACA TACAAGTAGG GGTCAGTACA AAACACAGGT ACTGAGTAGA 7711 .......... .......... .......... .......... .......... .......... 234 TATCATCGGC CAACTCAAAA TAGAAAACAG TATATATCAG ATAATATCAT AAAATCAACT 7771 .......... .......... .......... .......... .......... .......... 234 ACAGTACTCA ACATGCGGCA TTTACAATTA CCATAACCCT TGGTCGCAAC ACCAAGCTCA 7831 .......... .......... .......... .......... .......... .......... 234 TCAATGAGGA CTCATGCCTC CCCATCATAC TCATTTGGGA ATTAAGTTCC TTAAATTGAG 7891 .......... .......... .......... .......... .......... .......... 234 TATATTAACA TATTTCAAGA TTCATTCTCT TTACTAATCC TGGTGTCAGA ACGTGACACC 7951 .......... .......... .......... .......... .......... .......... 234 CGATCCATAT ATACTATCCT GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT 8011 .......... .......... .......... .......... .......... .......... 234 GTCGGAACGT GACACTCCGA TCCTCATATA CTATCCTGGT ACCGGAACGT GGCACCCGAT 8071 .......... .......... .......... .......... .......... .......... 234 CCATATTCTA TCCTGGTGTC GGAACGTGAC ACTCCGATCC TCATATACTA TCCTGGTACC 8131 .......... .......... .......... .......... .......... .......... 234 GGAACGTGAC ACCCGATCCC CTAATCTCAC TACTTTCGTT CATCAAGCCT TCTTGTATAC 8191 .......... .......... .......... .......... .......... .......... 234 TAAGGCATCA TCATTAACAA AGTAGATTAG GGTTTCTTTT TCAAGATTTA GAATTCAATA 8251 .......... .......... .......... .......... .......... .......... 234 GCTTCATCAT GCTTATCTCA TCACAATTAT ATAATCACAA TATGCAAACA CACAATTAAG 8311 .......... .......... .......... .......... .......... .......... 234 CATATAGAAG GGTTTACAAC ACTACCCAAT ACATATCATT CGATATTAAG AGTTTACTAC 8371 .......... .......... .......... .......... .......... .......... 234 GAATAGTGTA AAAACCATAA CCTACCTCCA TCGAAGATTA GTGATCAAGC AAGCAAATTC 8431 .......... .......... .......... .......... .......... .......... 234 CCCAAAGCTT TGTGTTTTCC TCTTCTCGTT CGATCCTCTC TCTCTTTTTG TTCTTTCTAT 8491 .......... .......... .......... .......... .......... .......... 234 TTTCTTTATT CAAACCCTCT TTCTTTTACC CTAATTAGCA TATAATTAAG AATAAAAGAT 8551 .......... .......... .......... .......... .......... .......... 234 GGCAATAATA ACCCACTAAT TTACTCAAGG TTACCTTTTT TAACCCCCAA GTAATTAGAC 8611 ||||| |||| .......... .......... .......... .......... TAACCTCCAA .......... 244 TTATTAACAT TAACCCACTA ACTTTATAAT TAAAGCAGGA ATAGTAAAAA ACGTCCCTTA 8671 .......... .......... .......... .......... .......... .......... 244 AAACATTAAA GAAATCCGAC TCAGCCTGGG ATTATGCAGC CTGTGACGAC TCGTCGTGCC 8731 .......... .......... .......... .......... .......... .......... 244 TGCGACGGTC CGTCTTGCTG CTCCGTCACA GAGTTCAGAG ACTCAATTTC CCTTAAAGAG 8791 .......... .......... .......... .......... .......... .......... 244 TCTGTGACGG TCCGTCACGC CTGTGACGGT CCGTCCTGCC ATTCCGTTAC AAAGTTCAGA 8851 .......... .......... .......... .......... .......... .......... 244 GAGTCGATTT CAGTACCCAT TTTTCAGAAT TTCTAAGTGT TTTGAAACGA GACCCCTCGA 8911 .......... .......... .......... .......... .......... .......... 244 CGGTCCGTCG TGCCCATGAC GGTCCGTCGT GGGATCCGTC GTCTCAACCA TTTTTCCAGA 8971 .......... .......... .......... .......... .......... .......... 244 AATAACATTT GTTGCTCAAA ATGACTAAAC AGGTCGTTAC ACTAACACTG ATAAATGTTC 9031 | ||| | ||| ||| ||| .......... .......... .......... ..ATATTTAA ATTAA-ACT- TTAAGA.... 266 TTCTCTATAA TGTCTATATA GTTGAGATTT TGAATTTGTA TTGTATAAAA CTTTGATATT 9091 .......... .......... .......... .......... .......... .......... 266 CAATAAATTT TATTGATTTT GTTGAAAGAT TTGATATCCT TTTCTGTATC TATTATTTCT 9151 .......... .......... .......... .......... .......... .......... 266 CCTAATTGTT GATTATTCTC TTCCTTTGTC CTTTTATTTA ATATTTTTGA TAGAAAGTTA 9211 .......... .......... .......... .......... .......... .......... 266 TTACTTATCA TATTTTTGGA ATAGCTTGGT CTTGGTATTT CTTCTCTTGA TCTAGTTGAA 9271 | | |||||||| ||||| | |||| ||| | .......... .......... ....AAT-CT TTTGGTATTC CTTCT-AT-- TCTATTTGTA 298 AAA 9274 || TAA 301 hqPGS_C06HBa0112G05.1-6+_SGN-E301194+ (5494 5667) ******************************************************************************** EST sequence 39 +strand 730 n (File: SGN-E546506+) 1 TTTTTTTTTT TTTTTTTTAA TAAAAACAAT TCAATACTAT TATTATTATC CCCAAAATCT 61 GGAAGTCATC ATCACAAGAA CATCTATCTC AAATTACTTA ACTAGGAATG TCTAAGAACA 121 AAATAACTAA AAAGCTAGTC CATGCCGGAA ATTCAAGGCA TCAAGACTTG AAGAAGAAGA 181 CCCAGTCCAA GCTAGACGCA TTAGCTCACC CTGAATTTTC CGATGAAGTG AAGACTGGCT 241 AGATCTACTG TTGAGTTGAA GTTGACGGAA CGTTTGCTGC ATTACACAAA TAACAAAGAG 301 GAAAACATGA AAGTAGGGGT CAGTACAACC ACACGTACTG AGTAGATATC ATCGGCCAAC 361 TCAAAATAGG GAACAGTATA TATCAATAAT AATGTAAATC AACTACAATA CTCAACATGT 421 AGCAATAACA CCATGAATTC ATCAATAACT ACAACCGAGT TCACACATGA GGACTCAAGC 481 CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT GAGTATATTC ATTATCTTTC 541 AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC ACTCCGATCC TCTATTTCTA 601 TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATTCTATC CTGGTACCGG AACGTGGCAC 661 CCGATCCATT TTCTATCCTG GTGTCGGAAC GTGACACTCC GATCCTCATA TTCTATCCTG 721 GTACCGGAAC Predicted gene structure (within gDNA segment 1092 to 8917): Exon 1 6350 6386 ( 37 n); cDNA 2 38 ( 37 n); score: 0.595 Intron 1 6387 7401 (1015 n); Pd: 0.990 (s: 0), Pa: 0.000 (s: 0.94) Exon 2 7402 7957 ( 556 n); cDNA 39 590 ( 552 n); score: 0.812 Intron 2 7958 7993 ( 36 n); Pd: 0.000 (s: 0.77), Pa: 0.000 (s: 0.88) Exon 3 7994 8136 ( 143 n); cDNA 591 730 ( 140 n); score: 0.944 MATCH C06HBa0112G05.1-6+ SGN-E546506+ 0.839 736 1.008 C PGS_C06HBa0112G05.1-6+_SGN-E546506+ (6350 6386,7402 7957,7994 8136) Alignment (genomic DNA sequence = upper lines): TTTTTTCTAA TTTATCTTGT AAATCTTTTT CTAATCTATA TCCTTTATTA TCAGTTCTTT 6409 |||||| | ||| | | | ||| || | | || TTTTTTTTTT TTTTTTTAAT AAAAACAATT CAATACT... .......... .......... 38 GTTCACATAT TTTAAAATGA GATATTGTTT GTTCTAAATT TAAGTCTTTC AATTTACATC 6469 .......... .......... .......... .......... .......... .......... 38 TTTTACATGT AAATAATCTA TTACTATTGT TTCTACTTAT TTCTTTAACT TTTTCTATAG 6529 .......... .......... .......... .......... .......... .......... 38 CCAAATCTAC TGCAAATTCT TCTTCTAATA TTATTTCCAT GTTCATTCCT TCTAAAGGCA 6589 .......... .......... .......... .......... .......... .......... 38 TTTCTTCTAC AACGCTTTCT TCTTCAAAGT CTTTTTGATC TTCATAATTG TAATCTGTAA 6649 .......... .......... .......... .......... .......... .......... 38 ATCTAATTGA AGGTTTTCCT TTAGAGTTTG TATATATTAA ATGAGCATCT GGTGTTAATG 6709 .......... .......... .......... .......... .......... .......... 38 CTTCTTTTTC TTTAAAATCT CCTAATTTTC ACTCTAATCC TACATATTCT TCAGAGTCTA 6769 .......... .......... .......... .......... .......... .......... 38 TTTTTATCGG TTTTATTAAC TTTATCCCTT TATTTCCCAT TACTTCTACT ACGTCTTTTA 6829 .......... .......... .......... .......... .......... .......... 38 TTTGTAGTTT AAATCTAGTA TTACTATTAT CTGTCATTTT TTCTAGAAAT CCTACACACA 6889 .......... .......... .......... .......... .......... .......... 38 TTAATAAATT CTTACCACTA TGCATTTCTT CATATCCTTT AGTTTGTATT CTTATTCTTA 6949 .......... .......... .......... .......... .......... .......... 38 TTTGGGTTCC AAATTCATTT AGATTCATCA TAAAATCTGG GCTTATGTAG AAAATTCCTC 7009 .......... .......... .......... .......... .......... .......... 38 CATTATTAGT CATATCTACT TCTGTTAATT ATATAATTGT CTTTTTTATG TCTGACCATC 7069 .......... .......... .......... .......... .......... .......... 38 TATTATCATA TATTGTAATT AATGTCTTTG TTCCTAAGTT TTTCCTCGTT AATCCTTTAA 7129 .......... .......... .......... .......... .......... .......... 38 TTCCTATAAC TATTAGTCCT ATATGCATTA GATCTTTTTT AACGTATTTT ATTTCTTTTA 7189 .......... .......... .......... .......... .......... .......... 38 CAGATTCAGC GTTTATGAGT GGTAACGATA CAGAGCCTTT TCCTAACTGT CACGATCCAA 7249 .......... .......... .......... .......... .......... .......... 38 ATCGGGCCGC GACTAGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCCAAA 7309 .......... .......... .......... .......... .......... .......... 38 CCCCAACATT TTCAAACATA GTAACAGAAT ATAATGCGGA AGACTTAAAC TCATTAATGA 7369 .......... .......... .......... .......... .......... .......... 38 AAATCAATTA AATAACTTCT AAAAACTCAA CAACTATTAT TATCCCCAAA ATCTGGAAGT 7429 | |||||| |||||||||| |||||||||| .......... .......... .......... ..ATTATTAT TATCCCCAAA ATCTGGAAGT 66 CATCATCACA AGAACATCTA CTTCAAATTA CTAAATCTAA GATTATCTAA GAAGCTAAAA 7489 |||||||||| |||||||||| |||||||| || || ||| || | ||||| ||| | |||| CATCATCACA AGAACATCTA TCTCAAATTA CTTAA-CTAG GAATGTCTAA GAA-C-AAAA 123 TACATAAACA GCTAGTCCAT GCCGGAACTT CAAGGCATCA AGACATGAAG AGGAGGATCC 7549 || |||| | |||||||||| ||||||| || |||||||||| |||| ||||| | || || || TAACTAAAAA GCTAGTCCAT GCCGGAAATT CAAGGCATCA AGACTTGAAG AAGAAGACCC 183 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AA--ATCCGG AGTAATGAAG ACTGGCTAGA 7607 |||||||||| ||| |||||| |||||||||| || |||| | | ||||| |||||||||| AGTCCAAGCT AGACGCATTA GCTCACCCTG AATTTTCCGA TGAAGTGAAG ACTGGCTAGA 243 TTTGCGGTTG AGTTGAAGAC GACAGAACGT TTGCTGCACT CCACAAATAA TCAAAAAGAA 7667 | | | |||| |||||||| ||| |||||| |||||||| | ||||||||| || | ||| TCTACTGTTG AGTTGAAGTT GACGGAACGT TTGCTGCATT ACACAAATAA CAAAGAGGAA 303 AACATACAAG TAGGGGTCAG TACAAAACAC AGGTACTGAG TAGATATCAT CGGCCAACTC 7727 ||||| ||| |||||||||| ||| || ||| | |||||||| |||||||||| |||||||||| AACATGAAAG TAGGGGTCAG TAC-AACCAC ACGTACTGAG TAGATATCAT CGGCCAACTC 362 AAAATAGAAA ACAGTATATA TCAGATAATA TCATAAAATC AACTACAGTA CTCAACATGC 7787 ||||||| | |||||||||| ||| |||||| | ||||| ||||||| || ||||||||| AAAATAGGGA ACAGTATATA TCA-ATAATA ATGT-AAATC AACTACAATA CTCAACATGT 420 GGCATTTACA ATTACCATAA CCCTTGGTCG CAAC-ACCAA GCTCATCAAT GAGGACTCAT 7846 ||| | || | | | | | | | | || ||| | | ||| || ||||||||| AGCAATAAC- ACCATGA-AT TCATCAATAA CTACAACCGA GTTCACACAT GAGGACTCAA 478 GCCTCCCCAT CATACTCATT TGGGAATTAA GTTCCTTAAA TTGAGTATAT T-AACATATT 7905 ||||| | |||||||||| |||||||||| |||| ||| | |||||||||| | | || || GCCTCAATAC CATACTCATT TGGGAATTAA GTTCATTAGA TTGAGTATAT TCATTATCTT 538 TCAAGATTCA TTCTCTTTAC TAATCCTGGT GTCAGAACGT GACAC-CCGA TCCATATATA 7964 |||||||||| || ||||| | | || || ||| | |||| ||||| |||| ||| TCAAGATTCA TTATCTTT-C TTCCTCTTGT GTCGGTACGT GACACTCCGA TCC....... 590 CTATCCTGGT ACCGGAACGT GGCACCCGAT CCATATTCTA TCCTGGTGTC GGAACGTGAC 8024 | | || ||||| |||||||| | |||||||| | .......... .......... .........T CTAT-TTCTA TCCTGGTGCC GGAACGTGGC 620 ACTCCGATCC TCATATACTA TCCTGGTACC GGAACGTGGC ACCCGATCCA TATTCTATCC 8084 |||||||||| |||| | ||| |||||||||| |||||||||| |||||||||| | |||||||| ACTCCGATCC TCAT-T-CTA TCCTGGTACC GGAACGTGGC ACCCGATCCA TTTTCTATCC 678 TGGTGTCGGA ACGTGACACT CCGATCCTCA TATACTATCC TGGTACCGGA AC 8136 |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| || TGGTGTCGGA ACGTGACACT CCGATCCTCA TATTCTATCC TGGTACCGGA AC 730 hqPGS_C06HBa0112G05.1-6+_SGN-E546506+ (7402 7957,7994 8136) ******************************************************************************** EST sequence 43 -strand 774 n (File: SGN-E349977-) 1 AGTAGATATC ATCGCTAACT CAAAATAGGG AACAATATAT ATCAATAATA ATGTAAATCA 61 ACTACAATAC TCATCATGTA GCAATAGCAA TTTCTTNATC ATTAACAATT ACCGTCAAGT 121 TCACACATGA GGACTCAAGC CTCAATACCA TACTCATTTG GGAATTAAGT TCATTAGATT 181 GAGTATATTC ATTATCTTTC AAGATTCATT ATCTTTCTTC CTCTTGTGTC GGTACGTGAC 241 ACTCCGCTCC TCTATTTCTA TCCTGGTGCC GGAACGTGGC ACTCCGATCC TCATATTCAT 301 TCTATCCTGG TACCGGAACG TGGCACCCGA TCCTCATATT CTATCCTGGT GTCGGAACGT 361 AACACTCCGA TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCTCAT 421 ATTCTATCCT GGTGTCGGAA CGTGACACTC CGATCCTCAT ATTCTATCCT GGTGTCGGAA 481 CGTGACACTC CGATCCTCAT ATTCATTCTA TCCTGGTACC GAAACGTGGC ACCCGATCCC 541 CTAATTCATC AAGCCTTCTT CTACACTAAG GCATCATCAT TCTCATTATA TAATTTATCA 601 AGCCTTCTCT CATACTAAGG CCTCATCAAT CTTATTATAT AATATATCAA GTGAATTAGG 661 GTTCTTTCAA GATTTGGGAT TCAATAGCTT CATCATGCTT TGTTAATTCA TAACAATTTC 721 ATAATCATAA TCATGCAAGC ATACCAATAA GCATATAGAC AGGTTTACAA CATC Predicted gene structure (within gDNA segment 3515 to 9840): Exon 1 7706 7957 ( 252 n); cDNA 1 250 ( 250 n); score: 0.750 Intron 1 7958 7993 ( 36 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0.81) Exon 2 7994 8150 ( 157 n); cDNA 251 416 ( 166 n); score: 0.803 MATCH C06HBa0112G05.1-6+ SGN-E349977- 0.770 409 0.528 C PGS_C06HBa0112G05.1-6+_SGN-E349977- (7706 7957,7994 8150) Alignment (genomic DNA sequence = upper lines): AGTAGATATC ATCGGCCAAC TCAAAATAGA AAACAGTATA TATCAGATAA TATCATAAAA 7765 |||||||||| ||| || ||| ||||||||| |||| |||| ||||| |||| || | ||| AGTAGATATC ATC-GCTAAC TCAAAATAGG GAACAATATA TATCA-ATAA TAATGT-AAA 57 TCAACTACAG TACTCAACAT GCGGCATTTA CAATTACCAT AACCCTTGGT CGCAACACCA 7825 ||||||||| |||||| ||| | ||| | ||||| | | | | | | || TCAACTACAA TACTCATCAT GTAGCAATAG CAATTTCTTN ATCATTAACA ATTACCGTCA 117 AGCTCATCAA TGAGGACTCA TGCCTCCCCA TCATACTCAT TTGGGAATTA AGTTCCTTAA 7885 || ||| | |||||||||| ||||| | ||||||||| |||||||||| ||||| ||| AGTTCACACA TGAGGACTCA AGCCTCAATA CCATACTCAT TTGGGAATTA AGTTCATTAG 177 ATTGAGTATA TT-AACATAT TTCAAGATTC ATTCTCTTTA CTAATCCTGG TGTCAGAACG 7944 |||||||||| || | || | |||||||||| ||| ||||| || || | |||| | ||| ATTGAGTATA TTCATTATCT TTCAAGATTC ATTATCTTT- CTTCCTCTTG TGTCGGTACG 236 TGACAC-CCG ATCCATATAT ACTATCCTGG TACCGGAACG TGGCACCCGA TCCATATTCT 8003 |||||| ||| ||| || || |||| TGACACTCCG CTCC...... .......... .......... .......... TCTAT-TTCT 259 ATCCTGGTGT CGGAACGTGA CACTCCGATC CTCATA-T-A --CTATCCTG GTACCGGAAC 8059 ||||||||| ||||||||| |||||||||| |||||| | | |||||||| |||||||||| ATCCTGGTGC CGGAACGTGG CACTCCGATC CTCATATTCA TTCTATCCTG GTACCGGAAC 319 GTGGCACCCG AT-C-CATAT TCTATCCTGG TGTCGGAACG TGACACTCCG ATCCTCATA- 8116 |||||||||| || | ||||| |||||||||| |||||||||| | |||||||| ||||||||| GTGGCACCCG ATCCTCATAT TCTATCCTGG TGTCGGAACG TAACACTCCG ATCCTCATAT 379 T-A--CTATC CTGGTACCGG AACGTGACAC CCGATCC 8150 | | ||||| |||||||||| |||||| ||| ||||||| TCATTCTATC CTGGTACCGG AACGTGGCAC CCGATCC 416 hqPGS_C06HBa0112G05.1-6+_SGN-E349977- (7706 7957,7994 8150) ******************************************************************************** EST sequence 63 -strand 580 n (File: SGN-E356206-) 1 GAAAAGTAAA AGCGTCCCCN TACCGTCCCT TAAGACTCTA CTAGACTTGT TCTTGTGTGA 61 TGAGACCAAC GACCCTAATG CTCTGATACC AAGTTTTGTC ACGACCCAAA TCCGGGCCGC 121 CACTGGCACC CACACTTACC CTCCTATGTG AGCGAACCAA CCAATCTAAA CCTTAACATT 181 TCAATGTAAT AGCAACAGAA AGTAATGCGG AAGACTTAAA CTCATTAATA AAATCAATAA 241 CTACTATTAT TAAACATCTA TTATTCCCAA AACCTGGAAG TCATCATCAC AAGAACATCT 301 ACTTTAAACT ACTAATTCTA AGAGTTTCTA AGAAGCTAAA AAATTACATA AGAAGCTAGT 361 CCATGCCGGA AGTTCAAGGC ATCAAGACAT GAAGGAGAAG ATCCAGTCCA AGCTAGAAGC 421 GTTAGCTCAC CCTGAAGATC CGGTGTGACG AAGACTGGCT TGAGTTACTG TTGAGTCGAA 481 GATGACGGCA CGTTTGCTGC ACTCCACAAA TAACAAGAAG AAAAACATAA AAGTAGGGGT 541 CAGTACAAAA CACGGCTACT GAGTAGATAT CATCGGCCAA Predicted gene structure (within gDNA segment 5405 to 8324): Exon 1 7193 7203 ( 11 n); cDNA 86 96 ( 11 n); score: 0.636 Intron 1 7204 7236 ( 33 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.89) Exon 2 7237 7724 ( 488 n); cDNA 97 580 ( 484 n); score: 0.842 MATCH C06HBa0112G05.1-6+ SGN-E356206- 0.842 499 0.860 C PGS_C06HBa0112G05.1-6+_SGN-E356206- (7193 7203,7237 7724) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAATC 7252 || | ||| | |||||| || ||||||| ATACCAAGTT T......... .......... .......... ....TGTCAC GACCCAAATC 112 -GGGCCGCGA CTAGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC AATCCAAACC 7311 ||||||| | || ||||||| |||||||||| |||||||||| |||||||||| |||| ||||| CGGGCCGCCA CTGGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC AATCTAAACC 172 CCAACATTTT CA-AACATAG TAACAGAATA TAATGCGGAA GACTTAAACT CATTAATGAA 7370 ||||||| | |||| ||||||| |||||||||| |||||||||| ||||||| || TTAACATTTC AATGTAATAG CAACAGAAAG TAATGCGGAA GACTTAAACT CATTAAT-AA 231 AATCAATTAA ATAACTTCTA AAAACTCAAC AACTATTATT ATCCCCAAAA TCTGGAAGTC 7430 |||| | |||||| || | | | ||| | |||||| | | ||||||| ||||||||| AATC-----A ATAACTACT- ATTATTAAAC ATCTATTA-T -T-CCCAAAA CCTGGAAGTC 282 ATCATCACAA GAACATCTAC TTCAAATTAC TAAATCTAAG ATTATCTAAG AAGCT--AAA 7488 |||||||||| |||||||||| || ||| ||| ||| |||||| | | |||||| ||||| ||| ATCATCACAA GAACATCTAC TTTAAACTAC TAATTCTAAG AGTTTCTAAG AAGCTAAAAA 342 A-TACATAAA CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAGGAGGAT 7547 | ||||||| ||||||||| ||||||||| |||||||||| |||||||||| || || ||| ATTACATAAG AAGCTAGTCC ATGCCGGAAG TTCAAGGCAT CAAGACATGA AGGAGAAGAT 402 CCAGTCCAAG CTAGAAGCAT TAGCTCACCC TGAA-ATCCG GAGTAATGAA GACTGGCTAG 7606 |||||||||| |||||||| | |||||||||| |||| ||||| | || | ||| |||||||| | CCAGTCCAAG CTAGAAGCGT TAGCTCACCC TGAAGATCCG GTGTGACGAA GACTGGCTTG 462 ATTTGCGGTT GAGTTGAAGA CGACAGAACG TTTGCTGCAC TCCACAAATA ATCAAAAAGA 7666 | || | ||| |||| ||||| ||| | ||| |||||||||| |||||||||| | | || | AGTTACTGTT GAGTCGAAGA TGACGGCACG TTTGCTGCAC TCCACAAATA ACAAGAAGAA 522 AAACATACAA GTAGGGGTCA GTACAAAACA CAGGTACTGA GTAGATATCA TCGGCCAA 7724 ||||||| || |||||||||| |||||||||| | | |||||| |||||||||| |||||||| AAACATAAAA GTAGGGGTCA GTACAAAACA CGGCTACTGA GTAGATATCA TCGGCCAA 580 hqPGS_C06HBa0112G05.1-6+_SGN-E356206- (7237 7724) ******************************************************************************** EST sequence 65 -strand 655 n (File: SGN-E356696-) 1 CAATTGGACT CAAGTAGTAG CACGAAAGAA AGAATGAAAG AGTGAAGTTT TCCTAAAGTC 61 TTATAGCCTC TCAAGGAAAA GTAAAAGCGT CCCCCTACCG TTCCTTAAGA CTCTACTAGA 121 CTTGTTCTTG TGTGATGAGA CCAACGAACC TAATGCTCTG ATACCAAGTT TTGTCACGAC 181 CCAAATCCGG GCCGCCACTG GCACCCACAC TTACCCTCNT ATGTGAGCGA ACCAACCAAT 241 CTAAACCTTA ACATTTCAAT GTAATAGCAA CAGAAAGTAA TGCGGAAGAC TTAAACTCAT 301 TAATAAAATC AATAACTACT ATTATTAAAC ATCTATTATT CCCAAAACCT GGAAGTCATC 361 ATCACAAGAA CATCTACTTT AAACTACTAA TTCTAAGAGT TTCTAAGAAG CTAAAAAATT 421 ACATAAGAAG CTAGTCCATG CCGGAAGTTC AAGGCATCAA GACATGAAGG AGAAGATCCA 481 GTCCAAGCTA GAAGCGTTAG CTCACCCTGA AGATCCGGTG TGACGAAGAC TGGCTTGAGT 541 TACTGTTGAG TCGAAGATGA CGGCACGTTT GCTGCACTCC ACAAATAACA AGAAGAAAAA 601 CATAAAAGTA GGGGTCAGTA CAAAACACGG CTACTGAGTA GATATCATCG GCCAA Predicted gene structure (within gDNA segment 4655 to 8324): Exon 1 7193 7203 ( 11 n); cDNA 161 171 ( 11 n); score: 0.636 Intron 1 7204 7236 ( 33 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.87) Exon 2 7237 7724 ( 488 n); cDNA 172 655 ( 484 n); score: 0.840 MATCH C06HBa0112G05.1-6+ SGN-E356696- 0.840 499 0.762 C PGS_C06HBa0112G05.1-6+_SGN-E356696- (7193 7203,7237 7724) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAATC 7252 || | ||| | |||||| || ||||||| ATACCAAGTT T......... .......... .......... ....TGTCAC GACCCAAATC 187 -GGGCCGCGA CTAGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC AATCCAAACC 7311 ||||||| | || ||||||| |||||||||| | |||||||| |||||||||| |||| ||||| CGGGCCGCCA CTGGCACCCA CACTTACCCT CNTATGTGAG CGAACCAACC AATCTAAACC 247 CCAACATTTT CA-AACATAG TAACAGAATA TAATGCGGAA GACTTAAACT CATTAATGAA 7370 ||||||| | |||| ||||||| |||||||||| |||||||||| ||||||| || TTAACATTTC AATGTAATAG CAACAGAAAG TAATGCGGAA GACTTAAACT CATTAAT-AA 306 AATCAATTAA ATAACTTCTA AAAACTCAAC AACTATTATT ATCCCCAAAA TCTGGAAGTC 7430 |||| | |||||| || | | | ||| | |||||| | | ||||||| ||||||||| AATC-----A ATAACTACT- ATTATTAAAC ATCTATTA-T -T-CCCAAAA CCTGGAAGTC 357 ATCATCACAA GAACATCTAC TTCAAATTAC TAAATCTAAG ATTATCTAAG AAGCT--AAA 7488 |||||||||| |||||||||| || ||| ||| ||| |||||| | | |||||| ||||| ||| ATCATCACAA GAACATCTAC TTTAAACTAC TAATTCTAAG AGTTTCTAAG AAGCTAAAAA 417 A-TACATAAA CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAGGAGGAT 7547 | ||||||| ||||||||| ||||||||| |||||||||| |||||||||| || || ||| ATTACATAAG AAGCTAGTCC ATGCCGGAAG TTCAAGGCAT CAAGACATGA AGGAGAAGAT 477 CCAGTCCAAG CTAGAAGCAT TAGCTCACCC TGAA-ATCCG GAGTAATGAA GACTGGCTAG 7606 |||||||||| |||||||| | |||||||||| |||| ||||| | || | ||| |||||||| | CCAGTCCAAG CTAGAAGCGT TAGCTCACCC TGAAGATCCG GTGTGACGAA GACTGGCTTG 537 ATTTGCGGTT GAGTTGAAGA CGACAGAACG TTTGCTGCAC TCCACAAATA ATCAAAAAGA 7666 | || | ||| |||| ||||| ||| | ||| |||||||||| |||||||||| | | || | AGTTACTGTT GAGTCGAAGA TGACGGCACG TTTGCTGCAC TCCACAAATA ACAAGAAGAA 597 AAACATACAA GTAGGGGTCA GTACAAAACA CAGGTACTGA GTAGATATCA TCGGCCAA 7724 ||||||| || |||||||||| |||||||||| | | |||||| |||||||||| |||||||| AAACATAAAA GTAGGGGTCA GTACAAAACA CGGCTACTGA GTAGATATCA TCGGCCAA 655 hqPGS_C06HBa0112G05.1-6+_SGN-E356696- (7237 7724) ******************************************************************************** EST sequence 50 -strand 729 n (File: SGN-E351546-) 1 AGTCGTTGCT CTAGTTCTAC CCATCTGGCA AGAGAGTGAG NATGGTCAGA TACCAATTCG 61 TATCGCTTAG ATACCAATTG ACTCGAAGTA GTAGCACGAA AGAAAGAATG AAAGAGTGAA 121 GTTTTCCTAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAA GCGTCCCCCT ACCGTTCCTT 181 AAGACTCTAC TAGACTTGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 241 AGTTTTGTCA CGACCCAAAT CCGGGCCGCC ACTGGCACCC ACACTTACCC TCCTATGTGA 301 GCGAACCAAC CAATCTAAAC CTTAACATTT CAATGTAATA GCAACAGAAA GTAATGCGGA 361 AGACTTAAAC TCATTAATAA AATCAATAAC TACTATTATT AAACATCTAT TATTCCCAAA 421 ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA GAGTTTCTAA 481 GAAGCTAAAA AATTACATAA GAAGCTAGTC CATGCCGGAA GTTCAAGGCA TCAAGACATG 541 AAGGAGAAGA TCCAGTCCAA GCTAGAAGCG TTAGCTCACC CTGAAGATCC GGTGTGACGA 601 AGACTGGCTT GAGTTACTGT TGAGTCGAAG ATGACGGCAC GTTTGCTGCA CTCCACAAAT 661 ACCAAGAAGA AAAACATAAA AGTAGGGGTC AGTACAAAAC ACGGCTACTG AGTAGATATC 721 ATCGGCCAA Predicted gene structure (within gDNA segment 3915 to 8324): Exon 1 7193 7203 ( 11 n); cDNA 235 245 ( 11 n); score: 0.636 Intron 1 7204 7236 ( 33 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.89) Exon 2 7237 7724 ( 488 n); cDNA 246 729 ( 484 n); score: 0.840 MATCH C06HBa0112G05.1-6+ SGN-E351546- 0.840 499 0.684 C PGS_C06HBa0112G05.1-6+_SGN-E351546- (7193 7203,7237 7724) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAAT- 7251 || | ||| | |||||| || |||||| ATACCAAGTT T......... .......... .......... ....TGTCAC GACCCAAATC 261 CGGGCCGCGA CTAGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC AATCCAAACC 7311 |||||||| | || ||||||| |||||||||| |||||||||| |||||||||| |||| ||||| CGGGCCGCCA CTGGCACCCA CACTTACCCT CCTATGTGAG CGAACCAACC AATCTAAACC 321 CCAACATTTT CA-AACATAG TAACAGAATA TAATGCGGAA GACTTAAACT CATTAATGAA 7370 ||||||| | |||| ||||||| |||||||||| |||||||||| ||||||| || TTAACATTTC AATGTAATAG CAACAGAAAG TAATGCGGAA GACTTAAACT CATTAAT-AA 380 AATCAATTAA ATAACTTCTA AAAACTCAAC AACTATTATT ATCCCCAAAA TCTGGAAGTC 7430 |||| | |||||| || | | | ||| | |||||| | | ||||||| ||||||||| AATC-----A ATAACTACT- ATTATTAAAC ATCTATTA-T -T-CCCAAAA CCTGGAAGTC 431 ATCATCACAA GAACATCTAC TTCAAATTAC TAAATCTAAG ATTATCTAAG AAGCT--AAA 7488 |||||||||| |||||||||| || ||| ||| ||| |||||| | | |||||| ||||| ||| ATCATCACAA GAACATCTAC TTTAAACTAC TAATTCTAAG AGTTTCTAAG AAGCTAAAAA 491 A-TACATAAA CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAGGAGGAT 7547 | ||||||| ||||||||| ||||||||| |||||||||| |||||||||| || || ||| ATTACATAAG AAGCTAGTCC ATGCCGGAAG TTCAAGGCAT CAAGACATGA AGGAGAAGAT 551 CCAGTCCAAG CTAGAAGCAT TAGCTCACCC TGAA-ATCCG GAGTAATGAA GACTGGCTAG 7606 |||||||||| |||||||| | |||||||||| |||| ||||| | || | ||| |||||||| | CCAGTCCAAG CTAGAAGCGT TAGCTCACCC TGAAGATCCG GTGTGACGAA GACTGGCTTG 611 ATTTGCGGTT GAGTTGAAGA CGACAGAACG TTTGCTGCAC TCCACAAATA ATCAAAAAGA 7666 | || | ||| |||| ||||| ||| | ||| |||||||||| |||||||||| | || | AGTTACTGTT GAGTCGAAGA TGACGGCACG TTTGCTGCAC TCCACAAATA CCAAGAAGAA 671 AAACATACAA GTAGGGGTCA GTACAAAACA CAGGTACTGA GTAGATATCA TCGGCCAA 7724 ||||||| || |||||||||| |||||||||| | | |||||| |||||||||| |||||||| AAACATAAAA GTAGGGGTCA GTACAAAACA CGGCTACTGA GTAGATATCA TCGGCCAA 729 hqPGS_C06HBa0112G05.1-6+_SGN-E351546- (7237 7724) ******************************************************************************** EST sequence 33 +strand 710 n (File: SGN-E392027+) 1 CCACAGCCCC AGTGGCTGGC TCAGTCGCAC CCTGTCCCGC CGGTGCTGGT GTTGATGCTG 61 GCGTAGTCGT TGCTCTAGTT CTAACCATCT GCGAAATAGA GTGAAGATGG TCAGATACCA 121 ATTTGTATCA CCTAGATACC AATTGGACCC AAGTAATAGC ACGAAAGAAG AAAGAATGGA 181 ATTTTCCAAA AGTCTTATAG CCTCTCAAGG AAAAGTAAAG GCATCCCCCT ACCGTTCCTT 241 AAGACTCTAC TAGACTCGTT CTTGTGTGAT GAGACCAACG AACCTAATGC TCTGATACCA 301 AGTTTGTCAC GACCAAAACC GGGTTGCGAC TGGCACCCAC ACTTACCCTC CTATGTGAGC 361 GAACCAACCA ATCTAACCTT AACATTTCAA TATAATATCA ACAGAAAGTA ATGTGGAAGA 421 CTTAAACTCA TTAAATACAG ACCAATTCAT TAACTTCTAA AATTCAACAT CTATTATTCC 481 CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA ACTAAGAGTA 541 GTCTAAAAGC TAAAAATACA TAAGAAGCTA GTCCATGCCG GAAGTTCAAG GCATCAAGAC 601 TTGAAGAAGA AGATCCAGTC CAAGCTAGAA GCATTAGCTC ACCCTGAATT TCCGATGTAG 661 TAAGACTGGC TTGAATTACT GTTGAGTTGA ACACGATGGC ACGTTTGCTG Predicted gene structure (within gDNA segment 3345 to 8811): Exon 1 7237 7643 ( 407 n); cDNA 305 710 ( 406 n); score: 0.827 MATCH C06HBa0112G05.1-6+ SGN-E392027+ 0.827 407 0.573 C PGS_C06HBa0112G05.1-6+_SGN-E392027+ (7237 7643) Alignment (genomic DNA sequence = upper lines): TGTCACGATC CAAATCGGGC CGCGACTAGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC 7296 |||||||| | ||| |||| |||||| || |||||||||| |||||||||| |||||||||| TGTCACGACC AAAACCGGGT TGCGACTGGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC 364 CAACCAATCC AAACCCCAAC ATTTTCA-AA CATAGTAACA GAATATAATG CGGAAGACTT 7355 |||||||| | |||| ||| |||| | | ||| |||| ||| ||||| ||||||||| CAACCAAT-C TAACCTTAAC ATTTCAATAT AATATCAACA GAAAGTAATG TGGAAGACTT 423 AAACTCATTA A-TGAAAATC AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC 7414 |||||||||| | | | | | |||| | ||| ||||| |||| |||||| || |||| | ||| AAACTCATTA AATACAGACC AATTCATTAA CTTCT-AAAA TTCAACATCT ATTA-T-TCC 480 CCAAAATCTG GAAGTCATCA TCACAAGAAC ATCTAC-TTC AAATTACTAA ATCTAAGATT 7473 |||||||||| |||||||||| ||||||||| |||||| || |||| ||||| | |||||| | CCAAAATCTG GAAGTCATCA CCACAAGAAC ATCTACGATC AAATGACTAA A-CTAAGAGT 539 A-TCTAAGAA GCT-AAAATA CATAAACAGC TAGTCCATGC CGGAACTTCA AGGCATCAAG 7531 | ||||| || ||| |||||| ||||| ||| |||||||||| ||||| |||| |||||||||| AGTCTAA-AA GCTAAAAATA CATAAGAAGC TAGTCCATGC CGGAAGTTCA AGGCATCAAG 598 ACATGAAGAG GAGGATCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA ATCCGGAGTA 7591 || |||||| || ||||||| |||||||||| |||||||||| |||||||||| | | || ACTTGAAGAA GAAGATCCAG TCCAAGCTAG AAGCATTAGC TCACCCTGAA TTTCCGATGT 658 ATGAAGACTG GCTAGATTTG CGGTTGAGTT GAAGACGACA GAACGTTTGC TG 7643 | ||||||| ||| || || | |||||||| ||| |||| | |||||||| || AGTAAGACTG GCTTGAATTA CTGTTGAGTT GAACACGATG GCACGTTTGC TG 710 hqPGS_C06HBa0112G05.1-6+_SGN-E392027+ (7237 7643) ******************************************************************************** EST sequence 21 +strand 840 n (File: SGN-E542084+) 1 TTTTTTTTTT TAGGGGAAAA TTTCTTACTT CTATAAATGT CACGACCCAA ATCGGATCGC 61 GACTGGCACC CACACTTACC CTGCTATGTG AGCGAACCAA CCAATCCAAA CCTTAACATT 121 TCAATGTAAT ATCAACATAA AGTAATGCGG AAGACTTAAA CTTATTAATG AAAACCAATT 181 CAATAACTAT TATTTCCCAA AATCTGGAAG TCATCATCAT AAGAACATCT ACTTCAAATT 241 ACTAAATCTA AGAGTTTCTA AGAAGCTAAA AAATACATAA AAGCTAGTCC ATGCCGGAAC 301 TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAT CTAGAAAGCA TTAGCTCACC 361 CTGATATCCG AAGTAATGAA GACTGGCTAG AGTTACTGTT GAGTCGAAGA TGACGGCACG 421 TTTGCTAAAA TCAGTGGACG GAGGAGAAGG GAAAGCACAC CGGGAATGAG AAGAAGCTGA 481 AGGAGGAACC AAAGAGGAAT CCCATTGCAA AGTAAATGAG AGTGTAAGCT AGCAGACGCG 541 ATGGAAGAGC TTACGCAGAA ATAACACTCT CATTTGGTGA TTTAGTTTGG AGATCATCTG 601 AGACCTTCGT GTTGGACAAC ATCATCCATG AAGATGTCAT TAGAAAAGTT AGATGCTTTA 661 TATACATGTT GATAGTTCCT GACTACTCTA TTTCTTTTTC AGAAAGCCCC GAAATTTCTC 721 AGATGATAAA TGCTGTCTGT TTTGGAAAAC CATCTCTATG CAAAGATGAT GTTTGCTGCA 781 TTGAGGTGTC AATATTGGGA ATTTCAAGAA AATTATGCCT TGTAGAATAT GTACAGCAAC Predicted gene structure (within gDNA segment 6015 to 9840): Exon 1 7237 7642 ( 406 n); cDNA 38 426 ( 389 n); score: 0.840 PPA cDNA 11 1 MATCH C06HBa0112G05.1-6+ SGN-E542084+ 0.840 406 0.483 C PGS_C06HBa0112G05.1-6+_SGN-E542084+ (7237 7642) Alignment (genomic DNA sequence = upper lines): TGTCACGATC CAAATCGGGC CGCGACTAGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC 7296 |||||||| | |||||||| ||||||| || |||||||||| ||||| |||| |||||||||| TGTCACGACC CAAATCGGAT CGCGACTGGC ACCCACACTT ACCCTGCTAT GTGAGCGAAC 97 CAACCAATCC AAACCCCAAC ATTTTCAAAC -ATAGTAACA GAATATAATG CGGAAGACTT 7355 |||||||||| ||||| ||| |||| | ||| |||| || ||||| |||||||||| CAACCAATCC AAACCTTAAC ATTTCAATGT AATATCAACA TAAAGTAATG CGGAAGACTT 157 AAACTCATTA ATGAAAATCA ATTAAATAAC TTCTAAAAAC TCAACAACTA TTATTATCCC 7415 ||||| |||| ||||||| || | ||| || | ||| || ||||| ||| AAACTTATTA ATGAAAACCA A--------- TTC----AA- T-AAC---TA TTATT-TCC- 197 CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACTTCAA ATTACTAAAT CTAAGATTAT 7475 |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||| | | CAAAATCTGG AAGTCATCAT CATAAGAACA TCTACTTCAA ATTACTAAAT CTAAGAGTTT 257 CTAAGAAGCT --AAAATACA TAAACAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC 7533 |||||||||| |||||||| |||| ||||| |||||||||| |||||||||| ||||||||| CTAAGAAGCT AAAAAATACA TAAA-AGCTA GTCCATGCCG GAACTTCAAG ACATCAAGAC 316 ATGAAGAGGA GGATCCAGTC CAAGCTAG-A AGCATTAGCT CACCCTGAAA TCCGGAGTAA 7592 |||||||||| ||||||||| ||| |||| | |||||||||| |||||||| | |||| ||||| ATGAAGAGGA AGATCCAGTC CAATCTAGAA AGCATTAGCT CACCCTGATA TCCGAAGTAA 376 TGAAGACTGG CTAGATTTGC GGTTGAGTTG AAGACGACAG AACGTTTGCT 7642 |||||||||| ||||| || | ||||||| | |||| ||| | ||||||||| TGAAGACTGG CTAGAGTTAC TGTTGAGTCG AAGATGACGG CACGTTTGCT 426 hqPGS_C06HBa0112G05.1-6+_SGN-E542084+ (7237 7642) ******************************************************************************** EST sequence 2 +strand 679 n (File: SGN-E370357+) 1 TTTTTTTTTT CTTACAATTA TATTATGAAT TCGATAATCT TTAATGTCAC GACCCAAATC 61 GAGCCGCAAG TGGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATACAAAATC 121 CAACATTTCA ATATAATGAC GGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC 181 AATTAAATAA CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA 241 TCACAAGAAC ATCTATCCTC AAATTACTAA TTCTAAGAGT ATCTAGAAAG CTAGAATAAC 301 TAAAAAGCTA GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAAGA AGATCCAGTC 361 CAAGCTAGAA GCGTTAGCTC ACACTGAAAT CCGGTATAAT GAAGACTGGC TAGAGTTGCG 421 GTTGAGTTGA AGACGACGGT ACGTTTGCTT TATTCGAGTG TCAATTAATC ATTCGGCTGT 481 CACCCAAATA TTATTGATTG ATTACACCTC TGCCATTTGT AAAATTTTTC AAATTTGCCT 541 ACGGATGCAG AATTTTCCTC GAATTTCTGA TGTGTTTTCT TGTAAATAGT GGCCATTTGT 601 GTAAGTAAAT GCCCATTTCT CCTCCTACAA AGTCCAATTC CATTTTTCCC CCAATCCACC 661 ATGGCAACAC CACCTCCAA Predicted gene structure (within gDNA segment 5945 to 9840): Exon 1 7237 7642 ( 406 n); cDNA 45 449 ( 405 n); score: 0.915 PPA cDNA 13 1 MATCH C06HBa0112G05.1-6+ SGN-E370357+ 0.915 406 0.598 C PGS_C06HBa0112G05.1-6+_SGN-E370357+ (7237 7642) Alignment (genomic DNA sequence = upper lines): TGTCACGATC CAAATCGGGC CGCGACTAGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC 7296 |||||||| | ||||||| || ||| | | || |||||||||| |||||||||| |||||||||| TGTCACGACC CAAATCGAGC CGCAAGTGGC ACCCACACTT ACCCTCCTAT GTGAGCGAAC 104 CAACCAATCC AAACCCCAAC ATTTTCAAAC ATAGTAACAG AATATAATGC GGAAGACTTA 7356 |||||||| | ||| ||||| | |||| || ||| | || | |||||||||| |||||||||| CAACCAATAC AAAATCCAAC A-TTTC-AAT ATAATGACGG AATATAATGC GGAAGACTTA 162 AACTCATTAA TGAAAATCAA TTAAATAACT TCTAAAAACT CAACAACTAT TATTATCCCC 7416 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTCATTAA TGAAAATCAA TTAAATAACT TCTAAAAACT CAACAACTAT TATTATCCCC 222 AAAATCTGGA AGTCATCATC ACAAGAACAT CTA-CTTCAA ATTACTAAAT CTAAGATTAT 7475 |||||||||| |||||||||| |||||||||| ||| | |||| |||||||| | |||||| ||| AAAATCTGGA AGTCATCATC ACAAGAACAT CTATCCTCAA ATTACTAATT CTAAGAGTAT 282 CTAAGAAGCT AAAATACATA AACAGCTAGT CCATGCCGGA ACTTCAAGGC ATCAAGACAT 7535 ||| ||||| | |||| || || ||||||| |||||||||| |||||||||| |||||||||| CTAGAAAGCT AGAATAACTA AAAAGCTAGT CCATGCCGGA ACTTCAAGGC ATCAAGACAT 342 GAAGAGGAGG ATCCAGTCCA AGCTAGAAGC ATTAGCTCAC CCTGAAATCC GGAGTAATGA 7595 ||||| || | |||||||||| |||||||||| ||||||||| ||||||||| || |||||| GAAGAAGAAG ATCCAGTCCA AGCTAGAAGC GTTAGCTCAC ACTGAAATCC GGTATAATGA 402 AGACTGGCTA GATTTGCGGT TGAGTTGAAG ACGACAGAAC GTTTGCT 7642 |||||||||| || ||||||| |||||||||| ||||| | || ||||||| AGACTGGCTA GAGTTGCGGT TGAGTTGAAG ACGACGGTAC GTTTGCT 449 hqPGS_C06HBa0112G05.1-6+_SGN-E370357+ (7237 7642) ******************************************************************************** EST sequence 47 -strand 299 n (File: SGN-E373117-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 61 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 241 GAGTTGAAGA CGACGGTACG TTTGCCAAAA TTACGACAGT ATTTGGACAA GCTAGAAGA Predicted gene structure (within gDNA segment 6686 to 8680): Exon 1 7377 7639 ( 263 n); cDNA 1 263 ( 263 n); score: 0.943 MATCH C06HBa0112G05.1-6+ SGN-E373117- 0.943 263 0.880 C PGS_C06HBa0112G05.1-6+_SGN-E373117- (7377 7639) Alignment (genomic DNA sequence = upper lines): TTAAATAACT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 7436 || | | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 60 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGATTATC TAAGAAGCTA AAATACATAA 7496 |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 120 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAGGAGGA TCCAGTCCAA 7556 |||||||||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 180 GCTAGAAGCA TTAGCTCACC CTGAAATCCG GAGTAATGAA GACTGGCTAG ATTTGCGGTT 7616 ||||||||| |||||||||| |||||||||| |||||||| |||||||||| | |||||||| GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGAA GACTGGCTAG AGTTGCGGTT 240 GAGTTGAAGA CGACAGAACG TTT 7639 |||||||||| |||| | ||| ||| GAGTTGAAGA CGACGGTACG TTT 263 hqPGS_C06HBa0112G05.1-6+_SGN-E373117- (7377 7639) ******************************************************************************** EST sequence 53 -strand 219 n (File: SGN-E298638-) 1 TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 61 ACAAGAACAT GTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 121 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 181 GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA Predicted gene structure (within gDNA segment 6686 to 8276): Exon 1 7377 7595 ( 219 n); cDNA 1 219 ( 219 n); score: 0.936 MATCH C06HBa0112G05.1-6+ SGN-E298638- 0.936 219 1.000 C PGS_C06HBa0112G05.1-6+_SGN-E298638- (7377 7595) Alignment (genomic DNA sequence = upper lines): TTAAATAACT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 7436 || | | |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| TTTTTTTTTT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGC AGTCATCATC 60 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGATTATC TAAGAAGCTA AAATACATAA 7496 |||||||||| ||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| ACAAGAACAT GTACTTCAAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 120 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAGGAGGA TCCAGTCCAA 7556 |||||||||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 180 GCTAGAAGCA TTAGCTCACC CTGAAATCCG GAGTAATGA 7595 ||||||||| |||||||||| |||||||||| ||||||| GCTAGAAGCG TTAGCTCACC CTGAAATCCG ATGTAATGA 219 hqPGS_C06HBa0112G05.1-6+_SGN-E298638- (7377 7595) ******************************************************************************** EST sequence 49 -strand 402 n (File: SGN-E352844-) 1 TTTTTTTTAT AAAAACCAAT TCAATAACTA TTATTTCCCA AAATCTGGAA GTTATCATCA 61 CAAGAACATC TACTTCGAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCTT TGTTTTATCG AAAAAAGGTG ATTTTTCGAA AAGAGTTTGT TTTATTTTAA 241 AGTATTTTTC GACTTTAGGA GTCGCCACTT AATTTTTAAG AAAAATCAAG AAAACTCATT 301 CTCAAAACAA TTTAAACAGA AAAGTCGTTT TGAAAATATT TTTTAGGATT CGGGATTCTT 361 ATTAGCGTCT TAGGAAGGTG TTTAAGGCAC CTAAGACACT CC Predicted gene structure (within gDNA segment 6444 to 9840): Exon 1 7377 7569 ( 193 n); cDNA 2 192 ( 191 n); score: 0.886 Intron 1 7570 9066 (1497 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.60) Exon 2 9067 9124 ( 58 n); cDNA 193 246 ( 54 n); score: 0.603 Intron 2 9125 9235 ( 111 n); Pd: 0.990 (s: 0.58), Pa: 0.366 (s: 0) Exon 3 9236 9274 ( 39 n); cDNA 247 285 ( 39 n); score: 0.538 MATCH C06HBa0112G05.1-6+ SGN-E352844- 0.821 290 0.721 C PGS_C06HBa0112G05.1-6+_SGN-E352844- (7377 7569,9067 9124,9236 9274) Alignment (genomic DNA sequence = upper lines): TTAAATAACT TCTAAAAACT CAACAACTAT TATTATCCCC AAAATCTGGA AGTCATCATC 7436 || | | | || | ||| |||||| |||| | ||| |||||||||| ||| |||||| TTTTTTTATA AAAACCAATT CAATAACTAT TATT-T-CCC AAAATCTGGA AGTTATCATC 59 ACAAGAACAT CTACTTCAAA TTACTAAATC TAAGATTATC TAAGAAGCTA AAATACATAA 7496 |||||||||| ||||||| || |||||||||| ||||| |||| |||||||||| |||||||||| ACAAGAACAT CTACTTCGAA TTACTAAATC TAAGAGTATC TAAGAAGCTA AAATACATAA 119 ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAGGAGGA TCCAGTCCAA 7556 |||||||||| |||||||||| |||||||||| |||||||||| |||| || || |||||||||| ACAGCTAGTC CATGCCGGAA CTTCAAGGCA TCAAGACATG AAGAAGAAGA TCCAGTCCAA 179 GCTAGAAGCA TTAGCTCACC CTGAAATCCG GAGTAATGAA GACTGGCTAG ATTTGCGGTT 7616 ||||||||| || GCTAGAAGCT TTG....... .......... .......... .......... .......... 192 GAGTTGAAGA CGACAGAACG TTTGCTGCAC TCCACAAATA ATCAAAAAGA AAACATACAA 7676 .......... .......... .......... .......... .......... .......... 192 GTAGGGGTCA GTACAAAACA CAGGTACTGA GTAGATATCA TCGGCCAACT CAAAATAGAA 7736 .......... .......... .......... .......... .......... .......... 192 AACAGTATAT ATCAGATAAT ATCATAAAAT CAACTACAGT ACTCAACATG CGGCATTTAC 7796 .......... .......... .......... .......... .......... .......... 192 AATTACCATA ACCCTTGGTC GCAACACCAA GCTCATCAAT GAGGACTCAT GCCTCCCCAT 7856 .......... .......... .......... .......... .......... .......... 192 CATACTCATT TGGGAATTAA GTTCCTTAAA TTGAGTATAT TAACATATTT CAAGATTCAT 7916 .......... .......... .......... .......... .......... .......... 192 TCTCTTTACT AATCCTGGTG TCAGAACGTG ACACCCGATC CATATATACT ATCCTGGTAC 7976 .......... .......... .......... .......... .......... .......... 192 CGGAACGTGG CACCCGATCC ATATTCTATC CTGGTGTCGG AACGTGACAC TCCGATCCTC 8036 .......... .......... .......... .......... .......... .......... 192 ATATACTATC CTGGTACCGG AACGTGGCAC CCGATCCATA TTCTATCCTG GTGTCGGAAC 8096 .......... .......... .......... .......... .......... .......... 192 GTGACACTCC GATCCTCATA TACTATCCTG GTACCGGAAC GTGACACCCG ATCCCCTAAT 8156 .......... .......... .......... .......... .......... .......... 192 CTCACTACTT TCGTTCATCA AGCCTTCTTG TATACTAAGG CATCATCATT AACAAAGTAG 8216 .......... .......... .......... .......... .......... .......... 192 ATTAGGGTTT CTTTTTCAAG ATTTAGAATT CAATAGCTTC ATCATGCTTA TCTCATCACA 8276 .......... .......... .......... .......... .......... .......... 192 ATTATATAAT CACAATATGC AAACACACAA TTAAGCATAT AGAAGGGTTT ACAACACTAC 8336 .......... .......... .......... .......... .......... .......... 192 CCAATACATA TCATTCGATA TTAAGAGTTT ACTACGAATA GTGTAAAAAC CATAACCTAC 8396 .......... .......... .......... .......... .......... .......... 192 CTCCATCGAA GATTAGTGAT CAAGCAAGCA AATTCCCCAA AGCTTTGTGT TTTCCTCTTC 8456 .......... .......... .......... .......... .......... .......... 192 TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 .......... .......... .......... .......... .......... .......... 192 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 .......... .......... .......... .......... .......... .......... 192 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 .......... .......... .......... .......... .......... .......... 192 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA TTAAAGAAAT CCGACTCAGC 8696 .......... .......... .......... .......... .......... .......... 192 CTGGGATTAT GCAGCCTGTG ACGACTCGTC GTGCCTGCGA CGGTCCGTCT TGCTGCTCCG 8756 .......... .......... .......... .......... .......... .......... 192 TCACAGAGTT CAGAGACTCA ATTTCCCTTA AAGAGTCTGT GACGGTCCGT CACGCCTGTG 8816 .......... .......... .......... .......... .......... .......... 192 ACGGTCCGTC CTGCCATTCC GTTACAAAGT TCAGAGAGTC GATTTCAGTA CCCATTTTTC 8876 .......... .......... .......... .......... .......... .......... 192 AGAATTTCTA AGTGTTTTGA AACGAGACCC CTCGACGGTC CGTCGTGCCC ATGACGGTCC 8936 .......... .......... .......... .......... .......... .......... 192 GTCGTGGGAT CCGTCGTCTC AACCATTTTT CCAGAAATAA CATTTGTTGC TCAAAATGAC 8996 .......... .......... .......... .......... .......... .......... 192 TAAACAGGTC GTTACACTAA CACTGATAAA TGTTCTTCTC TATAATGTCT ATATAGTTGA 9056 .......... .......... .......... .......... .......... .......... 192 GATTTTGAAT TTGTATTGTA TAAAACTTTG ATATTCAATA AATTTTATTG ATTTTGTTGA 9116 || ||| | | |||| || || || | || | ||| ||| || | .......... TTTTATCG-A -AAAAAGGTG ATTTTTCGAA AAGAGT-TTG TTTTATTTTA 239 AAGATTTGAT ATCCTTTTCT GTATCTATTA TTTCTCCTAA TTGTTGATTA TTCTCTTCCT 9176 ||| | | AAG-TATT.. .......... .......... .......... .......... .......... 246 TTGTCCTTTT ATTTAATATT TTTGATAGAA AGTTATTACT TATCATATTT TTGGAATAGC 9236 .......... .......... .......... .......... .......... .........T 247 TTGGTCTTGG TATTTCTTCT CTTGATCTAG TTGAAAAA 9274 || | ||| || | ||| || | |||||| TTCGACTTTA GGAGTCGCCA CTTAATTTTT AAGAAAAA 285 hqPGS_C06HBa0112G05.1-6+_SGN-E352844- (7377 7569) ******************************************************************************** EST sequence 48 -strand 405 n (File: SGN-E336814-) 1 TTTTTTTTTT TAATAAAATC AATAACTATT ATTATCCCCA AAATCTGGAA GTCATCACCA 61 CAAGAACATC TATGATCAAA GTACTAAACT AAGAGTGTTC TAAAAAGCTA AAATACAAGA 121 AAGCTAGTCC ATGCCGGAAC TTCAAGACAT CAAGACATGA AGAGGAAGAT CCAGTCCAAG 181 CTAGAAACAT TAGCTCACCT TAATATCCGG AATAATGAAG ACTGGCTAGA GTTACTGTTG 241 AGTCGAAGAT GACGGCACGT TTGCTCTCTT GTGTGATTTG CAAGCTATTG CAAAGGAGTA 301 GTTGCGGGTT TCCTTGTCAT AAAAAATAAT GCAAGGGAAA GCAGAAGATA ATGAAAATTA 361 GCAATATGTC AATCCATGAT CATGAAGTGC TTTGTTTGCT TTGGG Predicted gene structure (within gDNA segment 6561 to 9840): Exon 1 7386 7642 ( 257 n); cDNA 9 265 ( 257 n); score: 0.864 MATCH C06HBa0112G05.1-6+ SGN-E336814- 0.864 257 0.635 C PGS_C06HBa0112G05.1-6+_SGN-E336814- (7386 7642) Alignment (genomic DNA sequence = upper lines): TTCTAAAAAC TCAACAACTA TTATTATCCC CAAAATCTGG AAGTCATCAT CACAAGAACA 7445 || | ||| |||| ||||| |||||||||| |||||||||| ||||||||| |||||||||| TTTAATAAAA TCAATAACTA TTATTATCCC CAAAATCTGG AAGTCATCAC CACAAGAACA 68 TCTA-CTTCA AATTACTAAA TCTAAGA-TT ATCTAAGAAG CTAAAATACA TAAACAGCTA 7503 |||| ||| || ||||||| |||||| | ||||| ||| |||||||||| || ||||| TCTATGATCA AAGTACTAAA -CTAAGAGTG TTCTAAAAAG CTAAAATACA AGAA-AGCTA 126 GTCCATGCCG GAACTTCAAG GCATCAAGAC ATGAAGAGGA GGATCCAGTC CAAGCTAGAA 7563 |||||||||| |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| GTCCATGCCG GAACTTCAAG ACATCAAGAC ATGAAGAGGA AGATCCAGTC CAAGCTAGAA 186 GCATTAGCTC ACCCTGAAAT CCGGAGTAAT GAAGACTGGC TAGATTTGCG GTTGAGTTGA 7623 ||||||||| ||| | | || ||||| |||| |||||||||| |||| || | ||||||| || ACATTAGCTC ACCTTAATAT CCGGAATAAT GAAGACTGGC TAGAGTTACT GTTGAGTCGA 246 AGACGACAGA ACGTTTGCT 7642 ||| ||| | ||||||||| AGATGACGGC ACGTTTGCT 265 hqPGS_C06HBa0112G05.1-6+_SGN-E336814- (7386 7642) ******************************************************************************** EST sequence 19 +strand 299 n (File: SGN-E373116+) 1 TTTTTTTTTT CTAAAAACTC AACAACTATT ATTATCCCCA AAATCTGGAA GTCATCATCA 61 CAAGAACATC TACTTCAAAT TACTAAATCT AAGAGTATCT AAGAAGCTAA AATACATAAA 121 CAGCTAGTCC ATGCCGGAAC TTCAAGGCAT CAAGACATGA AGAAGAAGAT CCAGTCCAAG 181 CTAGAAGCGT TAGCTCACCC TGAAATCCGA TGTAATGAAG ACTGGCTAGA GTTGCGGTTG 241 AGTTGAAGAC GACGGTACGT TTGCCAAAAT TACGACAGTA TTTGGACAAG CTAGAAGAG Predicted gene structure (within gDNA segment 6706 to 8700): Exon 1 7386 7639 ( 254 n); cDNA 9 262 ( 254 n); score: 0.965 MATCH C06HBa0112G05.1-6+ SGN-E373116+ 0.965 254 0.849 C PGS_C06HBa0112G05.1-6+_SGN-E373116+ (7386 7639) Alignment (genomic DNA sequence = upper lines): TTCTAAAAAC TCAACAACTA TTATTATCCC CAAAATCTGG AAGTCATCAT CACAAGAACA 7445 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTAAAAAC TCAACAACTA TTATTATCCC CAAAATCTGG AAGTCATCAT CACAAGAACA 68 TCTACTTCAA ATTACTAAAT CTAAGATTAT CTAAGAAGCT AAAATACATA AACAGCTAGT 7505 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| TCTACTTCAA ATTACTAAAT CTAAGAGTAT CTAAGAAGCT AAAATACATA AACAGCTAGT 128 CCATGCCGGA ACTTCAAGGC ATCAAGACAT GAAGAGGAGG ATCCAGTCCA AGCTAGAAGC 7565 |||||||||| |||||||||| |||||||||| ||||| || | |||||||||| |||||||||| CCATGCCGGA ACTTCAAGGC ATCAAGACAT GAAGAAGAAG ATCCAGTCCA AGCTAGAAGC 188 ATTAGCTCAC CCTGAAATCC GGAGTAATGA AGACTGGCTA GATTTGCGGT TGAGTTGAAG 7625 ||||||||| |||||||||| | ||||||| |||||||||| || ||||||| |||||||||| GTTAGCTCAC CCTGAAATCC GATGTAATGA AGACTGGCTA GAGTTGCGGT TGAGTTGAAG 248 ACGACAGAAC GTTT 7639 ||||| | || |||| ACGACGGTAC GTTT 262 hqPGS_C06HBa0112G05.1-6+_SGN-E373116+ (7386 7639) ******************************************************************************** EST sequence 61 -strand 666 n (File: SGN-E368629-) 1 TTTTTTTTTT TTTTTTTTTT TTTTTTATAA AAACCAATTC AATAACTATT ATTTCCCAAA 61 ATCTGGAAGT TATCATCACA AGAACATCTA CTTCGAATTA CTAAATCTAG AAGTATCTAA 121 GAGCCTAAAA TACATAACAC AGTTAGTCCA TGCCGAAACT TCAAGGCATC AAGACATAAA 181 GAAGAAGATC CAGTCCAAGC TAGAAGCTTT GTTTTATCGA AAAAAGGTGA TTTTTCGAAA 241 AGAGTTTGTT TTATTTTAAA GTATTTTTCG ACTTTAGGAG TCGCCACTTA ATTTTTAAGA 301 AAAATCAAGA AAACTCATTC TCAAAACAAT TTAAACAGAA AAGTCGTTTT GAAAATATTT 361 TTTAGGATTC GGGATTCTTA TTAGCGTCTT AGGAAGGTGT TTAAGGCACC TAAGACACTC 421 CGTTAAATAC GGTTTTCCAA CGACTAACTT ATTTGATTAT TTTTATTTTT ACCCTTTGCA 481 AATTTATTTG AACTTTTATC ACGATTTACT TAGCCAAACT TTGCAAATTT GAGATATTAA 541 TCTTTTAAGA TTCCGTCTTA GTTAAACTTT CTAAGCCTTA ACTCTCTAAG CAGACTTTCA 601 AATTTTAAAC CTCTATCGTT TCAAAACTTC AATTTTTATT TTTTAGTTTC ATAAAGCAAA 661 AGGCGT Predicted gene structure (within gDNA segment 6264 to 9840): Exon 1 7396 7569 ( 174 n); cDNA 39 211 ( 173 n); score: 0.888 Intron 1 7570 9066 (1497 n); Pd: 0.000 (s: 0.90), Pa: 0.000 (s: 0.60) Exon 2 9067 9124 ( 58 n); cDNA 212 265 ( 54 n); score: 0.603 Intron 2 9125 9235 ( 111 n); Pd: 0.990 (s: 0.58), Pa: 0.366 (s: 0) Exon 3 9236 9274 ( 39 n); cDNA 266 304 ( 39 n); score: 0.538 PPA cDNA 28 1 MATCH C06HBa0112G05.1-6+ SGN-E368629- 0.817 271 0.407 C PGS_C06HBa0112G05.1-6+_SGN-E368629- (7396 7569,9067 9124,9236 9274) Alignment (genomic DNA sequence = upper lines): TCAACAACTA TTATTATCCC CAAAATCTGG AAGTCATCAT CACAAGAACA TCTACTTCAA 7455 |||| ||||| ||||| | || |||||||||| |||| ||||| |||||||||| |||||||| | TCAATAACTA TTATT-T-CC CAAAATCTGG AAGTTATCAT CACAAGAACA TCTACTTCGA 96 ATTACTAAAT CTAAGATTAT CTAAGAAGCT AAAATACATA A-ACAGCTAG TCCATGCCGG 7514 |||||||||| ||| | ||| |||||| || |||||||||| | |||| ||| ||||||||| ATTACTAAAT CTAGAAGTAT CTAAGAGCCT AAAATACATA ACACAGTTAG TCCATGCCGA 156 AACTTCAAGG CATCAAGACA TGAAGAGGAG GATCCAGTCC AAGCTAGAAG CATTAGCTCA 7574 |||||||||| |||||||||| | |||| || |||||||||| |||||||||| | || AACTTCAAGG CATCAAGACA TAAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTG..... 211 CCCTGAAATC CGGAGTAATG AAGACTGGCT AGATTTGCGG TTGAGTTGAA GACGACAGAA 7634 .......... .......... .......... .......... .......... .......... 211 CGTTTGCTGC ACTCCACAAA TAATCAAAAA GAAAACATAC AAGTAGGGGT CAGTACAAAA 7694 .......... .......... .......... .......... .......... .......... 211 CACAGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT ATATCAGATA 7754 .......... .......... .......... .......... .......... .......... 211 ATATCATAAA ATCAACTACA GTACTCAACA TGCGGCATTT ACAATTACCA TAACCCTTGG 7814 .......... .......... .......... .......... .......... .......... 211 TCGCAACACC AAGCTCATCA ATGAGGACTC ATGCCTCCCC ATCATACTCA TTTGGGAATT 7874 .......... .......... .......... .......... .......... .......... 211 AAGTTCCTTA AATTGAGTAT ATTAACATAT TTCAAGATTC ATTCTCTTTA CTAATCCTGG 7934 .......... .......... .......... .......... .......... .......... 211 TGTCAGAACG TGACACCCGA TCCATATATA CTATCCTGGT ACCGGAACGT GGCACCCGAT 7994 .......... .......... .......... .......... .......... .......... 211 CCATATTCTA TCCTGGTGTC GGAACGTGAC ACTCCGATCC TCATATACTA TCCTGGTACC 8054 .......... .......... .......... .......... .......... .......... 211 GGAACGTGGC ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACT CCGATCCTCA 8114 .......... .......... .......... .......... .......... .......... 211 TATACTATCC TGGTACCGGA ACGTGACACC CGATCCCCTA ATCTCACTAC TTTCGTTCAT 8174 .......... .......... .......... .......... .......... .......... 211 CAAGCCTTCT TGTATACTAA GGCATCATCA TTAACAAAGT AGATTAGGGT TTCTTTTTCA 8234 .......... .......... .......... .......... .......... .......... 211 AGATTTAGAA TTCAATAGCT TCATCATGCT TATCTCATCA CAATTATATA ATCACAATAT 8294 .......... .......... .......... .......... .......... .......... 211 GCAAACACAC AATTAAGCAT ATAGAAGGGT TTACAACACT ACCCAATACA TATCATTCGA 8354 .......... .......... .......... .......... .......... .......... 211 TATTAAGAGT TTACTACGAA TAGTGTAAAA ACCATAACCT ACCTCCATCG AAGATTAGTG 8414 .......... .......... .......... .......... .......... .......... 211 ATCAAGCAAG CAAATTCCCC AAAGCTTTGT GTTTTCCTCT TCTCGTTCGA TCCTCTCTCT 8474 .......... .......... .......... .......... .......... .......... 211 CTTTTTGTTC TTTCTATTTT CTTTATTCAA ACCCTCTTTC TTTTACCCTA ATTAGCATAT 8534 .......... .......... .......... .......... .......... .......... 211 AATTAAGAAT AAAAGATGGC AATAATAACC CACTAATTTA CTCAAGGTTA CCTTTTTTAA 8594 .......... .......... .......... .......... .......... .......... 211 CCCCCAAGTA ATTAGACTTA TTAACATTAA CCCACTAACT TTATAATTAA AGCAGGAATA 8654 .......... .......... .......... .......... .......... .......... 211 GTAAAAAACG TCCCTTAAAA CATTAAAGAA ATCCGACTCA GCCTGGGATT ATGCAGCCTG 8714 .......... .......... .......... .......... .......... .......... 211 TGACGACTCG TCGTGCCTGC GACGGTCCGT CTTGCTGCTC CGTCACAGAG TTCAGAGACT 8774 .......... .......... .......... .......... .......... .......... 211 CAATTTCCCT TAAAGAGTCT GTGACGGTCC GTCACGCCTG TGACGGTCCG TCCTGCCATT 8834 .......... .......... .......... .......... .......... .......... 211 CCGTTACAAA GTTCAGAGAG TCGATTTCAG TACCCATTTT TCAGAATTTC TAAGTGTTTT 8894 .......... .......... .......... .......... .......... .......... 211 GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT CCGTCGTGGG ATCCGTCGTC 8954 .......... .......... .......... .......... .......... .......... 211 TCAACCATTT TTCCAGAAAT AACATTTGTT GCTCAAAATG ACTAAACAGG TCGTTACACT 9014 .......... .......... .......... .......... .......... .......... 211 AACACTGATA AATGTTCTTC TCTATAATGT CTATATAGTT GAGATTTTGA ATTTGTATTG 9074 || ||| | .......... .......... .......... .......... .......... ..TTTTATCG 219 TATAAAACTT TGATATTCAA TAAATTTTAT TGATTTTGTT GAAAGATTTG ATATCCTTTT 9134 | |||| |||| || ||| | | || ||| || |||| | | -A-AAAAAGG TGATTTTTCG AAAAGAGT-T TGTTTTATTT TAAAG-TATT .......... 265 CTGTATCTAT TATTTCTCCT AATTGTTGAT TATTCTCTTC CTTTGTCCTT TTATTTAATA 9194 .......... .......... .......... .......... .......... .......... 265 TTTTTGATAG AAAGTTATTA CTTATCATAT TTTTGGAATA GCTTGGTCTT GGTATTTCTT 9254 || | ||| || .......... .......... .......... .......... .TTTCGACTT TAGGAGTCGC 284 CTCTTGATCT AGTTGAAAAA 9274 | ||| || | |||||| CACTTAATTT TTAAGAAAAA 304 hqPGS_C06HBa0112G05.1-6+_SGN-E368629- (7396 7569) ******************************************************************************** EST sequence 59 -strand 620 n (File: SGN-E238551-) 1 CTATTATTTC CCAAAATCTG GAAGTTATCA TCACAAGAAC ATCTACTTCG AATTACTAAA 61 TCTAAGAGTA TCTAAGAAGC TAAAATACAT AAACAGCTAG TCCATGCCGG AACTTCAAGG 121 CATCAAGACA TGAAGAAGAA GATCCAGTCC AAGCTAGAAG CTTTGTTTTA TCGAAAAAAG 181 GTGATTTTTC GAAAAGAGTT TGTTTTATTT TAAAGTATTT TTCGACTTTA GGAGTCGCCA 241 CTTAATTTTT AAGAAAAATC AAGAAAACTC ATTCTCAAAA CAATTTAAAC AGAAAAGTCG 301 TTTTGAAAAT ATTTTTTAGG ATTCGGGATT CTTATTAGCG TCTTAGGAAG GTGTTTAAGG 361 CACCTAAGAC ACTCCGTTAA ATACGGTTTT CCAACGACTA ACTTATTTGA TTATTTTTAT 421 TTTTACCCTT TGCAAATTTA TTTGAACTTT TATCACGATT TACTTAGCCA AACTTTGCAA 481 ATTTGAGATA TTAATCTTTT AAGATTCCGT CTTAGTTAAA CTTTCTAAGC CTTAACTCTC 541 TAAGCAGACT TTCAAATTTT AAACCTCTAT CGTTTCAAAA CTTCAATTTT TATTTTTTAG 601 TTTCATAAAG CAAAAGGCGT Predicted gene structure (within gDNA segment 6714 to 9840): Exon 1 7403 7569 ( 167 n); cDNA 1 165 ( 165 n); score: 0.946 Intron 1 7570 9066 (1497 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.60) Exon 2 9067 9124 ( 58 n); cDNA 166 219 ( 54 n); score: 0.603 Intron 2 9125 9235 ( 111 n); Pd: 0.990 (s: 0.58), Pa: 0.366 (s: 0) Exon 3 9236 9274 ( 39 n); cDNA 220 258 ( 39 n); score: 0.538 MATCH C06HBa0112G05.1-6+ SGN-E238551- 0.858 264 0.426 C PGS_C06HBa0112G05.1-6+_SGN-E238551- (7403 7569,9067 9124,9236 9274) Alignment (genomic DNA sequence = upper lines): CTATTATTAT CCCCAAAATC TGGAAGTCAT CATCACAAGA ACATCTACTT CAAATTACTA 7462 |||||||| | ||||||||| ||||||| || |||||||||| |||||||||| | |||||||| CTATTATT-T -CCCAAAATC TGGAAGTTAT CATCACAAGA ACATCTACTT CGAATTACTA 58 AATCTAAGAT TATCTAAGAA GCTAAAATAC ATAAACAGCT AGTCCATGCC GGAACTTCAA 7522 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCTAAGAG TATCTAAGAA GCTAAAATAC ATAAACAGCT AGTCCATGCC GGAACTTCAA 118 GGCATCAAGA CATGAAGAGG AGGATCCAGT CCAAGCTAGA AGCATTAGCT CACCCTGAAA 7582 |||||||||| |||||||| | | |||||||| |||||||||| ||| || GGCATCAAGA CATGAAGAAG AAGATCCAGT CCAAGCTAGA AGCTTTG... .......... 165 TCCGGAGTAA TGAAGACTGG CTAGATTTGC GGTTGAGTTG AAGACGACAG AACGTTTGCT 7642 .......... .......... .......... .......... .......... .......... 165 GCACTCCACA AATAATCAAA AAGAAAACAT ACAAGTAGGG GTCAGTACAA AACACAGGTA 7702 .......... .......... .......... .......... .......... .......... 165 CTGAGTAGAT ATCATCGGCC AACTCAAAAT AGAAAACAGT ATATATCAGA TAATATCATA 7762 .......... .......... .......... .......... .......... .......... 165 AAATCAACTA CAGTACTCAA CATGCGGCAT TTACAATTAC CATAACCCTT GGTCGCAACA 7822 .......... .......... .......... .......... .......... .......... 165 CCAAGCTCAT CAATGAGGAC TCATGCCTCC CCATCATACT CATTTGGGAA TTAAGTTCCT 7882 .......... .......... .......... .......... .......... .......... 165 TAAATTGAGT ATATTAACAT ATTTCAAGAT TCATTCTCTT TACTAATCCT GGTGTCAGAA 7942 .......... .......... .......... .......... .......... .......... 165 CGTGACACCC GATCCATATA TACTATCCTG GTACCGGAAC GTGGCACCCG ATCCATATTC 8002 .......... .......... .......... .......... .......... .......... 165 TATCCTGGTG TCGGAACGTG ACACTCCGAT CCTCATATAC TATCCTGGTA CCGGAACGTG 8062 .......... .......... .......... .......... .......... .......... 165 GCACCCGATC CATATTCTAT CCTGGTGTCG GAACGTGACA CTCCGATCCT CATATACTAT 8122 .......... .......... .......... .......... .......... .......... 165 CCTGGTACCG GAACGTGACA CCCGATCCCC TAATCTCACT ACTTTCGTTC ATCAAGCCTT 8182 .......... .......... .......... .......... .......... .......... 165 CTTGTATACT AAGGCATCAT CATTAACAAA GTAGATTAGG GTTTCTTTTT CAAGATTTAG 8242 .......... .......... .......... .......... .......... .......... 165 AATTCAATAG CTTCATCATG CTTATCTCAT CACAATTATA TAATCACAAT ATGCAAACAC 8302 .......... .......... .......... .......... .......... .......... 165 ACAATTAAGC ATATAGAAGG GTTTACAACA CTACCCAATA CATATCATTC GATATTAAGA 8362 .......... .......... .......... .......... .......... .......... 165 GTTTACTACG AATAGTGTAA AAACCATAAC CTACCTCCAT CGAAGATTAG TGATCAAGCA 8422 .......... .......... .......... .......... .......... .......... 165 AGCAAATTCC CCAAAGCTTT GTGTTTTCCT CTTCTCGTTC GATCCTCTCT CTCTTTTTGT 8482 .......... .......... .......... .......... .......... .......... 165 TCTTTCTATT TTCTTTATTC AAACCCTCTT TCTTTTACCC TAATTAGCAT ATAATTAAGA 8542 .......... .......... .......... .......... .......... .......... 165 ATAAAAGATG GCAATAATAA CCCACTAATT TACTCAAGGT TACCTTTTTT AACCCCCAAG 8602 .......... .......... .......... .......... .......... .......... 165 TAATTAGACT TATTAACATT AACCCACTAA CTTTATAATT AAAGCAGGAA TAGTAAAAAA 8662 .......... .......... .......... .......... .......... .......... 165 CGTCCCTTAA AACATTAAAG AAATCCGACT CAGCCTGGGA TTATGCAGCC TGTGACGACT 8722 .......... .......... .......... .......... .......... .......... 165 CGTCGTGCCT GCGACGGTCC GTCTTGCTGC TCCGTCACAG AGTTCAGAGA CTCAATTTCC 8782 .......... .......... .......... .......... .......... .......... 165 CTTAAAGAGT CTGTGACGGT CCGTCACGCC TGTGACGGTC CGTCCTGCCA TTCCGTTACA 8842 .......... .......... .......... .......... .......... .......... 165 AAGTTCAGAG AGTCGATTTC AGTACCCATT TTTCAGAATT TCTAAGTGTT TTGAAACGAG 8902 .......... .......... .......... .......... .......... .......... 165 ACCCCTCGAC GGTCCGTCGT GCCCATGACG GTCCGTCGTG GGATCCGTCG TCTCAACCAT 8962 .......... .......... .......... .......... .......... .......... 165 TTTTCCAGAA ATAACATTTG TTGCTCAAAA TGACTAAACA GGTCGTTACA CTAACACTGA 9022 .......... .......... .......... .......... .......... .......... 165 TAAATGTTCT TCTCTATAAT GTCTATATAG TTGAGATTTT GAATTTGTAT TGTATAAAAC 9082 || ||| | | |||| .......... .......... .......... .......... ....TTTTAT CG-A-AAAAA 179 TTTGATATTC AATAAATTTT ATTGATTTTG TTGAAAGATT TGATATCCTT TTCTGTATCT 9142 |||| || ||| | ||| ||| || |||| | | GGTGATTTTT CGAAAAGAGT -TTGTTTTAT TTTAAAG-TA TT........ .......... 219 ATTATTTCTC CTAATTGTTG ATTATTCTCT TCCTTTGTCC TTTTATTTAA TATTTTTGAT 9202 .......... .......... .......... .......... .......... .......... 219 AGAAAGTTAT TACTTATCAT ATTTTTGGAA TAGCTTGGTC TTGGTATTTC TTCTCTTGAT 9262 || | | || || | ||| || .......... .......... .......... ...TTTCGAC TTTAGGAGTC GCCACTTAAT 246 CTAGTTGAAA AA 9274 | |||| || TTTTAAGAAA AA 258 hqPGS_C06HBa0112G05.1-6+_SGN-E238551- (7403 7569) ******************************************************************************** EST sequence 3 +strand 434 n (File: SGN-E222578+) 1 TTTTTTTTTT TTTTTTTTTA ATAAAAACCA ATTCAATAAC TATCAATATT CAACATCTAT 61 TATTCCCAAA ACCTGGAAGT CATCATCACA AGAACATCTA CTTTAAACTA CTAATTCTAA 121 GAGTTTCTAA AAGCTAAAAA TACATAAGAA GCTAGTCCAT GCCGGAGGTT CAAGGCATCA 181 AGACATGAAG GAGAAGATCC AGTCCAAGCT AGACGCGTTA GCTCACCCTG AAGATCCGGT 241 GTGACGAAGA CTGGCTTGAG TTACTGTTGA GTCGAAGATG ACGGCACGTT TGCTGCACTC 301 CACAACTTTC TAGATGGGGA CTTTCTTCAA GGCTTCGAGA TGGAAACTTG CTTGCAGAGC 361 TTCGAGTGTT ACCAGCTTCA AGATGGAGTT TCAGTGATGA GGCTTGCTAG TCTCGAGTTT 421 TTTTTTTTTT TTTT Predicted gene structure (within gDNA segment 6102 to 9840): Exon 1 7193 7199 ( 7 n); cDNA 48 54 ( 7 n); score: 0.857 Intron 1 7200 7403 ( 204 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.88) Exon 2 7404 7653 ( 250 n); cDNA 55 305 ( 251 n); score: 0.856 PPA cDNA 19 1 MATCH C06HBa0112G05.1-6+ SGN-E222578+ 0.856 257 0.592 C PGS_C06HBa0112G05.1-6+_SGN-E222578+ (7193 7199,7404 7653) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAATC 7252 ||||| | ATTCAAC... .......... .......... .......... .......... .......... 54 GGGCCGCGAC TAGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATCCAAACCC 7312 .......... .......... .......... .......... .......... .......... 54 CAACATTTTC AAACATAGTA ACAGAATATA ATGCGGAAGA CTTAAACTCA TTAATGAAAA 7372 .......... .......... .......... .......... .......... .......... 54 TCAATTAAAT AACTTCTAAA AACTCAACAA CTATTATTAT CCCCAAAATC TGGAAGTCAT 7432 |||||| ||||||| | |||||||||| .......... .......... .......... .ATCTATTAT TCCCAAAACC TGGAAGTCAT 83 CATCACAAGA ACATCTACTT CAAATTACTA AATCTAAGAT TATCTAAGAA GCT-AAAATA 7491 |||||||||| |||||||||| ||| ||||| | ||||||| | ||||| || ||| |||||| CATCACAAGA ACATCTACTT TAAACTACTA ATTCTAAGAG TTTCTAA-AA GCTAAAAATA 142 CATAAACAGC TAGTCCATGC CGGAACTTCA AGGCATCAAG ACATGAAGAG GAGGATCCAG 7551 ||||| ||| |||||||||| |||| |||| |||||||||| |||||||| || ||||||| CATAAGAAGC TAGTCCATGC CGGAGGTTCA AGGCATCAAG ACATGAAGGA GAAGATCCAG 202 TCCAAGCTAG AAGCATTAGC TCACCCTGAA -ATCCGGAGT AATGAAGACT GGCTAGATTT 7610 |||||||||| | || ||||| |||||||||| |||||| || | ||||||| |||| || || TCCAAGCTAG ACGCGTTAGC TCACCCTGAA GATCCGGTGT GACGAAGACT GGCTTGAGTT 262 GCGGTTGAGT TGAAGACGAC AGAACGTTTG CTGCACTCCA CAA 7653 | ||||||| ||||| ||| | ||||||| |||||||||| ||| ACTGTTGAGT CGAAGATGAC GGCACGTTTG CTGCACTCCA CAA 305 hqPGS_C06HBa0112G05.1-6+_SGN-E222578+ (7404 7653) ******************************************************************************** EST sequence 62 -strand 481 n (File: SGN-E246710-) 1 AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 61 TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 121 CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 181 AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 241 AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 301 CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 361 AAGATTCATT ATCTTTATTT CTCTTGTGTC GGTACGTGAC ACTCCGCTCC CTCAATATTC 421 ATTAATCCTC TTGTGTCGGT ACGTGACACT CCGATCCCCT AAATCTATAT GTCGGTTTGT 481 G Predicted gene structure (within gDNA segment 6940 to 9762): Exon 1 7550 7950 ( 401 n); cDNA 1 402 ( 402 n); score: 0.813 MATCH C06HBa0112G05.1-6+ SGN-E246710- 0.813 401 0.834 C PGS_C06HBa0112G05.1-6+_SGN-E246710- (7550 7950) Alignment (genomic DNA sequence = upper lines): AGTCCAAGCT AGAAGCATTA GCTCACCCTG AAATCCGGAG TAATGAAGAC TGGCTAGATT 7609 |||||||||| |||||||||| |||||||||| || | | || | ||||| ||||| || | AGTCCAAGCT AGAAGCATTA GCTCACCCTG AATTTCCGAT GTAGTAAGAC TGGCTTGAAT 60 TGCGGTTGAG TTGAAGACGA CAGAACGTTT GCTGCACTCC ACAAATAATC AAAAAGAAAA 7669 | | |||||| ||||| |||| | |||||| |||||||||| |||||||| | || |||| || TACTGTTGAG TTGAACACGA TGGCACGTTT GCTGCACTCC ACAAATAAAC AAGAAGAGAA 120 CATACAAGTA GGGGTCAGTA CAAAACACAG GTACTGAGTA GATATCATCG GCCAACTCAA 7729 |||| ||||| |||||||||| |||||||| | |||||||||| |||||||||| |||||||||| CATAAAAGTA GGGGTCAGTA CAAAACACGG GTACTGAGTA GATATCATCG GCCAACTCAA 180 AATAGAAAAC AGTATATAT- -CAGATAATA TCATAAAATC AACTACAGTA CTCAACATGC 7787 |||||||| | | ||||||| || ||||| |||||||||| ||||| || ||||||||| AATAGAAATC AATATATATA CCAAGTAATA TCATAAAATC AACTATGATA CTCAACATGT 240 GGCATTTACA ATTACCATAA CCCTTGGTCG CAACACCAAG CTCATCAATG AGGACTCATG 7847 ||| ||| | ||| ||| | | | | |||| ||| ||| |||||||| | AGCAACAACA AATACTATAT CATTAACAAT TACCGTCAAG TTCACACATG AGGACTCAAG 300 CCTCCCCATC ATACTCATTT GGGAATTAAG TTCCTTAAAT TGAGTATATT AACATATTTC 7907 |||| | | |||||||||| |||||| | | ||| ||| || |||||||||| ||||| |||| CCTCAATACC ATACTCATTT GGGAATCATG TTCATTAGAT TGAGTATATT AACATCTTTC 360 AAGATTCATT CTCTTTACTA ATCCTGGTGT CAGAACGTGA CAC 7950 |||||||||| |||||| | | || |||| | | |||||| ||| AAGATTCATT ATCTTTATTT CT-CTTGTGT CGGTACGTGA CAC 402 hqPGS_C06HBa0112G05.1-6+_SGN-E246710- (7550 7950) ******************************************************************************** EST sequence 68 -strand 236 n (File: SGN-E209683-) 1 CACAAATAAC AAGAAGATAA ACATAAAAGT AGGGGTCAGT ACAAACCACG GGTACTGAGT 61 AGATATCATC GGCCAACTCA AAATAGGGAA CAGTATGTAT TAAGCAATAT CATAAAATCA 121 ACTAATATCC TTAACATGCA GCATTTATAG TTACCATAAC CCTTGGTTAC AACACCAAGC 181 ACATCAATGA GGACTCACAC CTCCTCATCA TACTCATTTG GGAATTTAGT TCATTA Predicted gene structure (within gDNA segment 6805 to 8574): Exon 1 7649 7884 ( 236 n); cDNA 1 236 ( 236 n); score: 0.871 MATCH C06HBa0112G05.1-6+ SGN-E209683- 0.871 236 1.000 C PGS_C06HBa0112G05.1-6+_SGN-E209683- (7649 7884) Alignment (genomic DNA sequence = upper lines): CACAAATAAT CAAAAAGA-A AACATACAAG TAGGGGTCAG TACAAAACAC AGGTACTGAG 7707 ||||||||| ||| |||| | |||||| ||| |||||||||| |||||| ||| ||||||||| CACAAATAA- CAAGAAGATA AACATAAAAG TAGGGGTCAG TACAAACCAC GGGTACTGAG 59 TAGATATCAT CGGCCAACTC AAAATAGAAA ACAGTATATA TCAGATAATA TCATAAAATC 7767 |||||||||| |||||||||| ||||||| | ||||||| || | | |||| |||||||||| TAGATATCAT CGGCCAACTC AAAATAGGGA ACAGTATGTA TTAAGCAATA TCATAAAATC 119 AACTACAGTA CTCAACATGC GGCATTTACA ATTACCATAA CCCTTGGTCG CAACACCAAG 7827 ||||| | || ||||||| ||||||| | ||||||||| |||||||| |||||||||| AACTAATATC CTTAACATGC AGCATTTATA GTTACCATAA CCCTTGGTTA CAACACCAAG 179 CTCATCAATG AGGACTCATG CCTCCCCATC ATACTCATTT GGGAATTAAG TTCCTTA 7884 | |||||||| |||||||| ||||| |||| |||||||||| ||||||| || ||| ||| CACATCAATG AGGACTCACA CCTCCTCATC ATACTCATTT GGGAATTTAG TTCATTA 236 hqPGS_C06HBa0112G05.1-6+_SGN-E209683- (7649 7884) ******************************************************************************** EST sequence 66 -strand 725 n (File: SGN-E546548-) 1 GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 61 TCCTCATATT CATTCTATCC TGGTACCGGA ACGTGGCACC CGATCCCCTA ATCCATCAAG 121 CCTTCTTTTA CACTAAGGCA TCATCATTCT CATTATATAA TTTATCAAGC CTTCTTTCAT 181 ACTAAGGCAT CATCATTCTC ATTATATAAT ATATCAAGCG AATTAGGGTT CTTTCAAGAT 241 TTGGGATTCA ATTGCTTCAT CATGCTTTGT TAATTCATCG CAATTTCATA ATCATAATCA 301 TGCAAGCATA CAACTTAAGC ACATAGCAGG GTTTACAATA CTATCAACAC ATAATATTCA 361 CTATTAAGAG TTCACTACGA ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT 421 TGAATCAACA AGCTATCTTC TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT 481 CGTTCGTTTC TCCTCTCTTT CTGTTCTTTT CTTTTGTTTT GTTTTATTCA AACCCTCCTT 541 CTTTTTACCC TAATTAAAAG TATAATTAAG TGTAAAGGAG GACAATAAAA CCCACTAATT 601 AACTTAAGGT TACCTCTTTT AACCCCCAAG TAATTAGACC TATTAATATT AACCCTCAAT 661 CTTTATAATT AAGGAAAGAA TAGTCCAAAA CGACCCCTAA AACGTGTAGA GGAATCCTAT 721 TTTGC Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 7972 8042 ( 71 n); cDNA 1 71 ( 71 n); score: 0.986 Intron 1 8043 8075 ( 33 n); Pd: 0.000 (s: 0.98), Pa: 0.000 (s: 0.78) Exon 2 8076 8696 ( 621 n); cDNA 72 725 ( 654 n); score: 0.639 MATCH C06HBa0112G05.1-6+ SGN-E546548- 0.675 692 0.954 C PGS_C06HBa0112G05.1-6+_SGN-E546548- (7972 8042,8076 8696) Alignment (genomic DNA sequence = upper lines): GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 8031 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTACCGGAA CGTGGCACCC GATCCATATT CTATCCTGGT GTCGGAACGT GACACTCCGA 60 TCCTCATATA CTATCCTGGT ACCGGAACGT GGCACCCGAT CCATATTCTA TCCTGGTGTC 8091 ||||||||| | |||||| ||||||| | TCCTCATATT C......... .......... .......... ....ATTCTA TCCTGGTACC 87 GGAACGTGAC ACTCCGATCC TCATATACTA TCCTG-GTAC CGGAACGTGA CACCCGATCC 8150 |||||||| | || ||||||| | || | | || | | | || | | | ||| GGAACGTGGC AC-CCGATCC -CCTAATCCA TCAAGCCTTC TTTTACACTA -AGGC-ATCA 143 CCTAATCTCA CTACTTTCGT TCATCAAGCC TTCTTGTATA CTAAGGCATC ATCA-T-T-A 8207 | | ||||| || | | | | |||||||| ||||| ||| |||||||||| |||| | | | TC-ATTCTCA TTA-TATAAT TTATCAAGCC TTCTTTCATA CTAAGGCATC ATCATTCTCA 201 --ACAAAGTA GAT-TAGGGT -TTC---TT- TTTCAAGATT TAGAATTCAA TAGCTTCATC 8259 | | | || || || | || || |||||||||| | | |||||| | |||||||| TTATATAATA TATCAAGCGA ATTAGGGTTC TTTCAAGATT TGGGATTCAA TTGCTTCATC 261 ATGC-TT-AT -CT-CATCAC AATTATATAA TCACAAT-AT GCAAACACAC AA-TTAAGCA 8313 |||| || | | |||| | |||| |||| ||| ||| || |||| || || || ||||||| ATGCTTTGTT AATTCATCGC AATTTCATAA TCATAATCAT GCAAGCATAC AACTTAAGCA 321 TATAGAAGGG TTTACAACAC TACCCAATAC ATATCATTCG ATATTAAGAG TTTACTACGA 8373 |||| |||| ||||||| || || ||| || ||| |||| ||||||||| || ||||||| CATAGCAGGG TTTACAATAC TA-TCAACAC ATAATATTCA CTATTAAGAG TTCACTACGA 380 ATAGTGT-A- A-AAACCATA ACCTACCTCC ATCGAAG-AT T-AGTGATCA AGC-A----A 8423 ||| || | | |||||||| |||||||||| | ||||| || | | | | || ||| | ATATCGTAAC ATAAACCATA ACCTACCTCC ACCGAAGAAT TGAATCAACA AGCTATCTTC 440 GCAAATTCC- -CCAAAGCT- TT-GTGT-TT TC-CTCTTCT CGTTCGATCC TCTCTCTCTT 8477 |||| ||| | | || || || | | || |||| || |||||| | | || ||||||| TCAAAATCCT TGCTATCCTC TTCGTTTCTC TCTCTCTACT CGTTCGTTTC TC-CTCTCTT 499 TTTGTTC-TT TC---TATTT T-CTTTATTC AAACCCTCTT TC-TTTTACC CTAATTAGCA 8531 | ||||| || || | ||| | ||||||| |||||||| | || ||||||| ||||||| | TCTGTTCTTT TCTTTTGTTT TGTTTTATTC AAACCCTCCT TCTTTTTACC CTAATTAAAA 559 -TATAATTAA GAATAAAAGA TGGCAATAAT AACCCACTAA TTTACTCAAG GTTACCTTTT 8590 ||||||||| | |||| || | |||||| |||||||||| || ||| ||| ||||||| || GTATAATTAA GTGTAAAGGA GGACAATAA- AACCCACTAA TTAACTTAAG GTTACCTCTT 618 TTAACCCCCA AGTAATTAGA CTTATTAACA TTAACCCACT AACTTTATAA TTAAAGCAGG 8650 |||||||||| |||||||||| | |||||| | ||||||| | | |||||||| |||| | | | TTAACCCCCA AGTAATTAGA CCTATTAATA TTAACCCTCA ATCTTTATAA TTAAGGAAAG 678 AATAGTAAAA AACGTCCCTT AAAACAT-TA AAGAAATCCG ACTCAGC 8696 |||||| || |||| ||| | ||||| | || || ||||| | | || AATAGTCCAA AACGACCCCT AAAACGTGTA GAGGAATCCT ATTTTGC 725 hqPGS_C06HBa0112G05.1-6+_SGN-E546548- (7972 8042) ******************************************************************************** EST sequence 46 -strand 681 n (File: SGN-E389553-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCGTTCC TACTTAAATA TTATTATTAT TTTACGATTT 601 ATAACACTAT TAGAAACAAA GATTTTCTCA ACCATGAATT AATGAAAAAA TTATGGAATA 661 AAATATAAAA AATTACTCAT T Predicted gene structure (within gDNA segment 7352 to 9840): Exon 1 8457 9027 ( 571 n); cDNA 2 571 ( 570 n); score: 0.875 Intron 1 9028 9806 ( 779 n); Pd: 0.258 (s: 0.72), Pa: 0.000 (s: 0) Exon 2 9807 9837 ( 31 n); cDNA 572 602 ( 31 n); score: 0.726 MATCH C06HBa0112G05.1-6+ SGN-E389553- 0.875 602 0.884 C PGS_C06HBa0112G05.1-6+_SGN-E389553- (8457 9027,9807 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | | | | | GACTAAACAG GTCGTTACAT TTATGTTCGT TCCT...... .......... .......... 571 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 571 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 571 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 571 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 571 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 571 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 571 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 571 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 571 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 571 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 571 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 571 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 571 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 |||| || |||| |||| |||||| | .......... .......... .......... ...ACTTAAA TATTATTATT ATTTTA-CGA 597 TTGAT 9837 || || TTTAT 602 hqPGS_C06HBa0112G05.1-6+_SGN-E389553- (8457 9027,9807 9837) ******************************************************************************** EST sequence 45 -strand 673 n (File: SGN-E550140-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTACTCN 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATGTTATCAA CCATGAATTA ACAAAAAATT AGACCAAAAA 661 TATAAAAAAT TAC Predicted gene structure (within gDNA segment 7352 to 9840): Exon 1 8457 9013 ( 557 n); cDNA 2 557 ( 556 n); score: 0.882 Intron 1 9014 9750 ( 737 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0) Exon 2 9751 9781 ( 31 n); cDNA 558 587 ( 30 n); score: 0.742 MATCH C06HBa0112G05.1-6+ SGN-E550140- 0.882 588 0.874 C PGS_C06HBa0112G05.1-6+_SGN-E550140- (8457 9013,9751 9781) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||| ||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAA-TTACT 118 C-AAGGTTAC CTTTTTTAAC CCCCAAGTAA TTAGACTTAT TAACATTAAC CCACTAACTT 8635 | |||||||| || ||||||| ||||| |||| |||||||||| |||||| ||| |||||||||| CNAAGGTTAC CTCTTTTAAC CCCCAGGTAA TTAGACTTAT TAACATAAAC CCACTAACTT 178 TATAATTAAA GCAGGAATAG TAAAAAACGT CCCTTAAAAC AT-TAAAGAA ATCCGACTCA 8694 |||||||||| | |||||||| | ||||||| |||||||||| | ||||||| ||||||| || TATAATTAAA GTAGGAATAG TCCAAAACGT CCCTTAAAAC GTGTAAAGAA ATCCGACCCA 238 GCCTGGGATT ATGCAGCCTG TGACGACTCG TCGTGCCTGC GACGGTCCGT CTTGCTGCTC 8754 | |||||||| | ||| |||| ||| | | || |||||||||| |||||||||| | ||| | | GACTGGGATT ACGCAACCTG TGATGGCCCG TCGTGCCTGC GACGGTCCGT CCTGCAGGT- 297 CGTCACAGAG TTCAGAGACT CAATTTCCCT TAAAGAGTCT GTGACGGTCC GTCACGCCTG 8814 |||| || | |||||||||| |||||||| ||||||||| |||||||||| |||||||| | CGTCGCAAGG TTCAGAGACT CAATTTCCAC CAAAGAGTCT GTGACGGTCC GTCACGCCCG 357 TGACGGTCCG TCCTGCCATT CCGTTACAAA GTTCAGAGAG TCGA-TTTCA GTACCCATTT 8873 |||||||||| || ||||||| ||||||| || |||||||||| |||| ||| | ||||||| | TGACGGTCCG TCGTGCCATT CCGTTACGAA GTTCAGAGAG TCGATTTTTA GTACCCA-AT 416 TTCAGAATTT CTAAGTGTTT TGAAACGAGA CCCCTCGACG GTCCGTCGTG CCCATGACGG 8933 |||||||||| |||||||||| |||||||||| | |||||||| |||| ||||| | |||||||| TTCAGAATTT CTAAGTGTTT TGAAACGAGA CTCCTCGACG GTCCATCGTG CTCATGACGG 476 TCCGTCGTGG GATCCGTCGT CTCAACC-AT TTTTCCAGAA ATAACATTTG TTGCTCAAAA 8992 |||||||||| | |||||||| ||||||| | ||||||| || |||| || || | ||||||| TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA ATAAAATCTG CTACTCAAAA 536 TGACTAAACA GGTCGTTACA CTAACACTGA TAAATGTTCT TCTCTATAAT GTCTATATAG 9052 ||||||||| |||||||||| CGACTAAACA GGTCGTTACA T......... .......... .......... .......... 557 TTGAGATTTT GAATTTGTAT TGTATAAAAC TTTGATATTC AATAAATTTT ATTGATTTTG 9112 .......... .......... .......... .......... .......... .......... 557 TTGAAAGATT TGATATCCTT TTCTGTATCT ATTATTTCTC CTAATTGTTG ATTATTCTCT 9172 .......... .......... .......... .......... .......... .......... 557 TCCTTTGTCC TTTTATTTAA TATTTTTGAT AGAAAGTTAT TACTTATCAT ATTTTTGGAA 9232 .......... .......... .......... .......... .......... .......... 557 TAGCTTGGTC TTGGTATTTC TTCTCTTGAT CTAGTTGAAA AAATATCCAT TTTTTCTGTT 9292 .......... .......... .......... .......... .......... .......... 557 TTTTAATTTT TTTTCTTTTT GGGGAGTAAT TTCTATATTA TTCAGTTTTT CTATTAATTC 9352 .......... .......... .......... .......... .......... .......... 557 TTCAGTTTTT CTATTAATTC TTCTATTCCG GTTGTATCTA TATTTTTACT TTCTAATTTT 9412 .......... .......... .......... .......... .......... .......... 557 TCTTTAAAGT TTTTATCTAT CTTATTATTT AGTTGTAATA AAAGTGATAT TATATTATTT 9472 .......... .......... .......... .......... .......... .......... 557 TGTTGTATAA TAATTTGATC TGATTTTGAA GATGGACTAA ATTCTTTGTA ATCTGATGGG 9532 .......... .......... .......... .......... .......... .......... 557 AGGGCTATTC TTTTTTCTAT TAGTTTTGTT GCTTCTTTAT GTGTTTCTGA TTCTACTAAG 9592 .......... .......... .......... .......... .......... .......... 557 TCTATCATGA AAGAACTATT TCTTTAATTG ATTCTAATTC TGTCTTTATT TCTTGTATTT 9652 .......... .......... .......... .......... .......... .......... 557 TTATTACTAA TGTATTATTA ATTTCTTCTT TAGATTTATT TTTTAACTCG TTTAATTTTT 9712 .......... .......... .......... .......... .......... .......... 557 TATTTAAATC TATTCCTTGT TCTTTTAGAT AATCTAAGTT ATTTTCTATC TTACTAAAAA 9772 || || |||| || |||| ||| .......... .......... .......... ........TT ATGTTCT-TC CTACTTAAAT 578 ACTGTTTTT 9781 | | || || ATTATTATT 587 hqPGS_C06HBa0112G05.1-6+_SGN-E550140- (8457 9013,9751 9781) ******************************************************************************** EST sequence 20 +strand 686 n (File: SGN-E241789+) 1 ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 61 CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 121 GACGACGGTA CGTTTGCTGC ACTCCACAAT TAACAAAAAG AAAACATAAA AGTAGGGGTC 181 AGTACAAACA CGAGTACTGA GTAGATATCA TCGGCCAACT CAGAATAGAG AACAATATAT 241 ATCAAATAAT AAAATAAAAT CAACCATAAC ACTTAACAGG TGACAACAAC AAGTACCATA 301 ACCATTGGGC ACAACCCAAG AACATCTATG AGGACTCAAG CCTCCACACC ATACTCATTT 361 GGGAAACAGG TTCATTAAAT TGAGTACATT AACATAATTC AAGATTCATT CTTTTTACTA 421 TCGTGGTGTC GGAACGTGAT ACTCCGATCC CCTAATGCTA CGTGTCGGTT CGTGACACCC 481 GATCCCCTAA TACTACGTGT CGGTTCGTTA CACCCGATCT CCTAATACTA CGTGCCGATT 541 CGTGACACCC GATCCATTAA TACTATGTGT CGGTTCGTGA CACCCGATCC ATTAATACTA 601 CGTGTCGGTT CGTGACACCC GATCCCCTAA CCTCATTCTT TTAGTTCATC AAGCCTTCTT 661 TTATACCAAG ACATCATCAT TAACAA Predicted gene structure (within gDNA segment 5969 to 9840): Exon 1 7508 8211 ( 704 n); cDNA 1 686 ( 686 n); score: 0.803 MATCH C06HBa0112G05.1-6+ SGN-E241789+ 0.803 704 1.026 C PGS_C06HBa0112G05.1-6+_SGN-E241789+ (7508 8211) Alignment (genomic DNA sequence = upper lines): ATGCCGGAAC TTCAAGG-CA TCAAGACATG AA-GAG-GAG GATCCAGTCC AAGCTAGAAG 7564 ||| | || | ||||||| || |||||||||| || ||| ||| ||||||||| |||||| | ATGTCTGATC TTCAAGGACA TCAAGACATG AATGAGAGAG AATCCAGTCC GAGCTAGGAA 60 CATTAGCTCA CCCTGAAATC CGGAGTAATG AAGACTGGCT AGATTTGCGG TTGAGTTGAA 7624 || ||||||| |||||||||| | || ||| |||||||| | ||| |||||| |||||||||| CAATAGCTCA CCCTGAAATC TGACGTGATG AAGACTGGTT AGAGTTGCGG TTGAGTTGAA 120 GACGACAGAA CGTTTGCTGC ACTCCACAAA TAATCAAAAA GAAAACATAC AAGTAGGGGT 7684 |||||| | | |||||||||| ||||||||| ||| |||||| ||||||||| |||||||||| GACGACGGTA CGTTTGCTGC ACTCCACAAT TAA-CAAAAA GAAAACATAA AAGTAGGGGT 179 CAGTACAAAA CACAGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT 7744 |||||| ||| ||| ||||| |||||||||| |||||||||| |||| ||||| | |||| ||| CAGTAC-AAA CACGAGTACT GAGTAGATAT CATCGGCCAA CTCAGAATAG AGAACAATAT 238 ATATCAGATA ATATCATAAA ATCAACTACA GTACTCAACA TGCGGCATTT ACAATTACCA 7804 |||||| ||| ||| ||||| |||||| | | ||| |||| | | || |||| ||||| ATATCAAATA ATAAAATAAA ATCAACCATA ACACTTAACA GGTGACAACA ACAAGTACCA 298 TAACCCTTGG TCGCAACACC AAGCTCATCA ATGAGGACTC ATGCCTCCCC ATCATACTCA 7864 ||||| |||| | |||| || ||| |||| |||||||||| | |||||| | | |||||||| TAACCATTGG GCACAAC-CC AAGAACATCT ATGAGGACTC AAGCCTCCAC ACCATACTCA 357 TTTGGGAATT AAGTTCCTTA AATTGAGTAT ATTAACATAT TTCAAGATTC ATTCTCTTTA 7924 |||||||| | |||| ||| ||||||||| ||||||||| |||||||||| ||||| |||| TTTGGGAAAC AGGTTCATTA AATTGAGTAC ATTAACATAA TTCAAGATTC ATTCTTTTTA 417 CTAATCCTGG TGTCAGAACG TGACAC-CCG ATCCATATAT ACTATCCTGG TACCGGAACG 7983 || ||| ||| |||| ||||| ||| || ||| |||| || ||| | | | | ||| || CT-ATCGTGG TGTCGGAACG TGATACTCCG ATCCCCTAAT GCTA-CGT-G T--CGGTTCG 472 TGGCACCCGA TCCATATTCT ATCCTGGTGT CGGAACGTGA CACTCCGATC CTCATATACT 8043 || ||||||| ||| | | | | |||| ||| ||| | ||| ||||| ||| || | | TGACACCCGA TCC-CCTAAT A-CTACGTGT CGGTTCGTTA CAC-CCGAT- CTCCTA-A-T 526 ATCCTGGTAC CGGAACGTGG CACCCGATCC ATATTCTATC CTGGTGTCGG AACGTGACAC 8103 | | || | || |||| |||||||||| || | || | ||||||| |||||||| A-CTACGTGC CGATTCGTGA CACCCGATCC AT-TAATA-C TATGTGTCGG TTCGTGACAC 583 TCCGATCCTC ATATACTATC CTGGTACCGG AACGTGACAC CCGATCCCCT AATCTCACTA 8163 ||||||| |||||| | | || ||| |||||||| |||||||||| || |||| | -CCGATCCAT TAATACTA-C GT-GT--CGG TTCGTGACAC CCGATCCCCT AACCTCATTC 638 CTTTCGTTCA TCAAGCCTTC TTGTATACTA AGGCATCATC ATTAACAA 8211 ||| ||||| |||||||||| || ||||| | || ||||||| |||||||| TTTTAGTTCA TCAAGCCTTC TTTTATACCA AGACATCATC ATTAACAA 686 hqPGS_C06HBa0112G05.1-6+_SGN-E241789+ (7508 8211) ******************************************************************************** EST sequence 52 -strand 515 n (File: SGN-E242359-) 1 AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 61 TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 121 ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 181 ATTTATTCCC CTCGTGTCCT TACGTGACAC TCCACTCCTC AATATACTAT CCTGGCACCG 241 GAACGTGGCA CCCGATCCAT ATTCTATCCT GGTGTCAGAA CGTGACACCC GATCCATATT 301 CTATCCTGGT GTCGGAACGT GACACCCGAT CCATATTCTA TCCTGGTACC GGAACGTGGC 361 ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACC CGATCCATAT TCTATCCTGG 421 TACCGGAACG TGGCACCCGA TCCCCTAATC TCACCACTTT CGTTCATCAA GCCTTCTTTT 481 ATACCAAGGC ATCATTATTA ACAAAGTAGA TTAGG Predicted gene structure (within gDNA segment 6626 to 8822): Exon 1 7740 8158 ( 419 n); cDNA 1 413 ( 413 n); score: 0.838 MATCH C06HBa0112G05.1-6+ SGN-E242359- 0.838 419 0.814 C PGS_C06HBa0112G05.1-6+_SGN-E242359- (7740 8158) Alignment (genomic DNA sequence = upper lines): AGTATATATC AGATAATATC ATAAAATCAA CTACAGTACT CAACATGCGG CATTTACAAT 7799 ||||| ||| | |||||| ||||||| || ||| | || | ||||| | ||||| |||| AGTATGTATT AAGCAATATC ATAAAATTAA CTAATATCCT TAGCATGCAG CATTTGCAAT 60 TACCATAACC CTTGGTCGCA ACACCAAGCT CATCAATGAG GACTCATGCC TCCCCATCAT 7859 |||||||||| |||||| ||| |||||||| |||||||||| |||||| || ||| |||||| TACCATAACC CTTGGTTGCA TCACCAAGCA CATCAATGAG GACTCACACC TCCTCATCAT 120 ACTCATTTGG GAATTAAGTT CCTTAAATTG AGTATATTAA CATATTTCAA GATTCATTCT 7919 ||| |||||| ||||| |||| | || |||| |||||||| |||||||||| ||||||| | ACTTATTTGG GAATTTAGTT CATTGGATTG CATATATTAA CATATTTCAA GATTCATCAT 180 CTTTACTAAT CCTGGTGTCA GAACGTGACA C-CCGATCC- ATATATACTA TCCTGGTACC 7977 |||| | ||| ||||| |||||||| | || ||| |||||||| |||||| ||| ATTTA-TTCC CCTCGTGTCC TTACGTGACA CTCCACTCCT CAATATACTA TCCTGGCACC 239 GGAACGTGGC ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACT CCGATCCTCA 8037 |||||||||| |||||||||| |||||||||| ||||||| || ||||||||| ||||| | || GGAACGTGGC ACCCGATCCA TATTCTATCC TGGTGTCAGA ACGTGACAC- CCGAT-C-CA 296 TATACTATCC TGGTACCGGA ACGTGGCACC CGATCCATAT TCTATCCTGG TGTCGGAACG 8097 ||| |||||| |||| |||| ||||| |||| |||||||||| |||||||||| | ||||||| TATTCTATCC TGGTGTCGGA ACGTGACACC CGATCCATAT TCTATCCTGG TACCGGAACG 356 TGACACTCCG ATCCTCATAT ACTATCCTGG TACCGGAACG TGACACCCGA TCCCCTAATC 8157 || ||| ||| || | ||||| ||||||||| | ||||||| |||||||||| | || || || TGGCAC-CCG AT-C-CATAT TCTATCCTGG TGTCGGAACG TGACACCCGA T-CCATATTC 412 T 8158 | T 413 hqPGS_C06HBa0112G05.1-6+_SGN-E242359- (7740 8158) ******************************************************************************** EST sequence 42 -strand 605 n (File: SGN-E347579-) 1 ATCCCCTAAT TCTACGTGTC GGTTCGTGAC ACCCGATCCC CTAATTCTAC GTGTCGGTTC 61 GTGACACCTG ATCCCCTAAT CTACGTGCCG GTTCGTGACA CCCGATCCCC TAATTCTACG 121 TGCCAGTTCG TGACACCCGA TCCCCTAATT CTACGTGTCG GTTCGTGACA CCCGATCCCC 181 TGCATGTGTC GGTACGTGAC ACTCCGATCC ACTAATATCA TTCTGTAAAT CATCAGGCCT 241 TCTCTATACC AAGGCATCAT CAATCCCATT ACTTTTATTC ATCAAGCCTT CTTCTATACC 301 AAGGCATCAT CATTAATAAG AGATTAGATT TTTATCAAGA TTTGGGATTC AATAACTTCA 361 TCATGCTTAA TATAATCACA ATTATATAAT CACGTTCATG CATGCATACA ATTAAGCATA 421 TAGCAGGGTT TACAATACTA CCAATACATA TCATTCTCTA TTAAGAGTTT ACTATGAAAG 481 CATGAAAACC ATAACCTACC TCCACCGAAG ATTAGTGATC AAGCAAGCAA ATTTTTCTCC 541 AAGCTTTGTT TCTCCCTTCT CGTTCGATTC TTCCTCTCTC TCTTGTTCTT TCTATTTTCT 601 TTATT Predicted gene structure (within gDNA segment 4790 to 9750): Exon 1 7193 7199 ( 7 n); cDNA 113 119 ( 7 n); score: 0.714 Intron 1 7200 7933 ( 734 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.70) Exon 2 7934 8086 ( 153 n); cDNA 120 261 ( 142 n); score: 0.644 Intron 2 8087 8153 ( 67 n); Pd: 0.000 (s: 0.46), Pa: 0.000 (s: 0.88) Exon 3 8154 8501 ( 348 n); cDNA 262 605 ( 344 n); score: 0.835 MATCH C06HBa0112G05.1-6+ SGN-E347579- 0.776 508 0.840 C PGS_C06HBa0112G05.1-6+_SGN-E347579- (7193 7199,7934 8086,8154 8501) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAATC 7252 |||| | ATTCTAC... .......... .......... .......... .......... .......... 119 GGGCCGCGAC TAGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATCCAAACCC 7312 .......... .......... .......... .......... .......... .......... 119 CAACATTTTC AAACATAGTA ACAGAATATA ATGCGGAAGA CTTAAACTCA TTAATGAAAA 7372 .......... .......... .......... .......... .......... .......... 119 TCAATTAAAT AACTTCTAAA AACTCAACAA CTATTATTAT CCCCAAAATC TGGAAGTCAT 7432 .......... .......... .......... .......... .......... .......... 119 CATCACAAGA ACATCTACTT CAAATTACTA AATCTAAGAT TATCTAAGAA GCTAAAATAC 7492 .......... .......... .......... .......... .......... .......... 119 ATAAACAGCT AGTCCATGCC GGAACTTCAA GGCATCAAGA CATGAAGAGG AGGATCCAGT 7552 .......... .......... .......... .......... .......... .......... 119 CCAAGCTAGA AGCATTAGCT CACCCTGAAA TCCGGAGTAA TGAAGACTGG CTAGATTTGC 7612 .......... .......... .......... .......... .......... .......... 119 GGTTGAGTTG AAGACGACAG AACGTTTGCT GCACTCCACA AATAATCAAA AAGAAAACAT 7672 .......... .......... .......... .......... .......... .......... 119 ACAAGTAGGG GTCAGTACAA AACACAGGTA CTGAGTAGAT ATCATCGGCC AACTCAAAAT 7732 .......... .......... .......... .......... .......... .......... 119 AGAAAACAGT ATATATCAGA TAATATCATA AAATCAACTA CAGTACTCAA CATGCGGCAT 7792 .......... .......... .......... .......... .......... .......... 119 TTACAATTAC CATAACCCTT GGTCGCAACA CCAAGCTCAT CAATGAGGAC TCATGCCTCC 7852 .......... .......... .......... .......... .......... .......... 119 CCATCATACT CATTTGGGAA TTAAGTTCCT TAAATTGAGT ATATTAACAT ATTTCAAGAT 7912 .......... .......... .......... .......... .......... .......... 119 TCATTCTCTT TACTAATCCT GGTGTCAGAA CGTGACACCC GATCCATATA TACTATCCTG 7972 ||| ||| |||||||||| ||||| | | ||| | | .......... .......... .GTGCCAGTT CGTGACACCC GATCCCCTAA TTCTA-CGT- 156 GTACCGGAAC GTGGCACCCG ATCCATATTC TATCCTGGTG TCGGAACGTG ACACTCCGAT 8032 || ||| | ||| |||||| |||| | | | || || |||| ||||| |||||||||| GT--CGGTTC GTGACACCCG ATCC----CC T-GCATG-TG TCGGTACGTG ACACTCCGAT 208 CCTCATATAC TATCCTGGTA CCGGAACGTG GC-ACCCGAT CCATATTCTA TCCTGGTGTC 8091 || | ||| || || ||| | | | | | | || | | | | || | CCACTAATAT CATTCT-GTA AATCATCAGG CCTTCTCTAT ACCAAGGC-A TCATC..... 261 GGAACGTGAC ACTCCGATCC TCATATACTA TCCTGGTACC GGAACGTGAC ACCCGATCCC 8151 .......... .......... .......... .......... .......... .......... 261 CTAATCTCAC TACTTTCGTT CATCAAGCCT TCTTGTATAC TAAGGCATCA TCATTAACAA 8211 |||| || |||||| || |||||||||| |||| ||||| ||||||||| ||||||| | ..AATCCCAT TACTTTTATT CATCAAGCCT TCTTCTATAC CAAGGCATCA TCATTAA-TA 318 AGTAGATTAG GGTTTCTTTT TCAAGATTTA GAATTCAATA GCTTCATCAT GCTT-ATCTC 8270 || |||||| | || ||| ||||||||| | |||||||| ||||||||| |||| || | AG-AGATTA- -GATT-TTTA TCAAGATTTG GGATTCAATA ACTTCATCAT GCTTAATATA 374 ATCACAATTA TATAATCACA AT-ATGCAAA CACACAATTA AGCATATAGA AGGGTTTACA 8329 |||||||||| ||||||||| | ||||| || ||||||| ||||||||| |||||||||| ATCACAATTA TATAATCACG TTCATGCATG CATACAATTA AGCATATAGC AGGGTTTACA 434 ACACTACCCA ATACATATCA TTCGATATTA AGAGTTTACT ACGAATAGTG TAAAAACCAT 8389 | |||| ||| |||||||||| ||| ||||| |||||||||| | ||| || | |||||||| ATACTA-CCA ATACATATCA TTCTCTATTA AGAGTTTACT ATGAA-AGCA TGAAAACCAT 492 AACCTACCTC CATCGAAGAT TAGTGATCAA GCAAGCAAA- -TTCCCCAAA GCTTTGTGTT 8447 |||||||||| || ||||||| |||||||||| ||||||||| || | | || ||||||| || AACCTACCTC CACCGAAGAT TAGTGATCAA GCAAGCAAAT TTTTCTCCAA GCTTTGT-TT 551 TTCCTCTTCT CGTTCGATCC TCTCTCTCT- TTTTGTTCTT TCTATTTTCT TTATT 8501 ||| ||||| |||||||| | | |||||| | |||||||| |||||||||| ||||| CTCC-CTTCT CGTTCGATTC TTCCTCTCTC TCTTGTTCTT TCTATTTTCT TTATT 605 hqPGS_C06HBa0112G05.1-6+_SGN-E347579- (8154 8501) ******************************************************************************** EST sequence 67 -strand 660 n (File: SGN-E349296-) 1 AATATTATCA ATACATATTA TTCGCTATTA AGAGCTTACT ACGAATATCG TAAGAGAAAC 61 CATAACCTAC CTCCACCGAA GATTCGTGAT CAAGCAAGTG ATTTCCCAAG CTTTGTGTTT 121 TTTCCTCTCG TTCGATCCTC TTTCTCGTTC GACTTTCTCT CTCTTTCTCT TGTTCTTTCT 181 ATTTTCTTTA TTCAAACCCT CTTTCTTTTA CCCTAATTAG TATATAATTA AGAATAAAAT 241 ATGGCAATAA TAACCCACTA ATTAACTTAA GGTTACCTCT TTTAACCCCC AAGTAATTAG 301 ACTTATTAAC ATTAACCCAC TAACTTTATA ATTAAAGCAG GAATAGTCAA AAACGTCCCT 361 TAAAACAATT GAGGAATTCC GACTCAGACT GGGATTTACG CAGCCTGTGA CAGCCCGTTG 421 TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC GCAAGGTTCA GAGACTGGAT TTTCACTGAA 481 GACTCTGTGA TGGTCCATCA CGCCTGTGAC GGTCCGTCTT GCCATTCCGT TACGAAGTTC 541 AGAGAGTCGA TTTTCAGTAC CCAATTTCAG ATTTCCTAAG TGTTTTGAAA TGAGACCCTG 601 CGACGGTCCG TCGTGCCCAT GATGGTCCGT CGTGGGGTCC GTCATTTCTG CCAGTTTTTC Predicted gene structure (within gDNA segment 4979 to 9840): Exon 1 6125 6147 ( 23 n); cDNA 119 141 ( 23 n); score: 0.696 Intron 1 6148 8453 (2306 n); Pd: 0.068 (s: 0), Pa: 0.000 (s: 0.78) Exon 2 8454 8966 ( 513 n); cDNA 142 658 ( 517 n); score: 0.873 MATCH C06HBa0112G05.1-6+ SGN-E349296- 0.873 536 0.812 C PGS_C06HBa0112G05.1-6+_SGN-E349296- (6125 6147,8454 8966) Alignment (genomic DNA sequence = upper lines): TTTTTCCCAT GGGTCTAACT TCTGTTAGTG GAGTATCTCC TTGTCTATAT ACCCTTCTTC 6184 ||||||| | | || | | ||| TTTTTCCTCT CGTTCGATCC TCT....... .......... .......... .......... 141 GTGGTATATT TCCTGTGCTA TTATCTGCAG TATATCCTTG TTCATTTGTT CTTTGAAATG 6244 .......... .......... .......... .......... .......... .......... 141 GACTACTGGA ACTTGCTGTT TCCATTAATT CTTTCTGACT TAGTATAGAT TCTAACTCAG 6304 .......... .......... .......... .......... .......... .......... 141 ATATTTCACT ATCATCACCT TGTATTTGTG TCATTATATT ATTACTTTTT TCTAATTTAT 6364 .......... .......... .......... .......... .......... .......... 141 CTTGTAAATC TTTTTCTAAT CTATATCCTT TATTATCAGT TCTTTGTTCA CATATTTTAA 6424 .......... .......... .......... .......... .......... .......... 141 AATGAGATAT TGTTTGTTCT AAATTTAAGT CTTTCAATTT ACATCTTTTA CATGTAAATA 6484 .......... .......... .......... .......... .......... .......... 141 ATCTATTACT ATTGTTTCTA CTTATTTCTT TAACTTTTTC TATAGCCAAA TCTACTGCAA 6544 .......... .......... .......... .......... .......... .......... 141 ATTCTTCTTC TAATATTATT TCCATGTTCA TTCCTTCTAA AGGCATTTCT TCTACAACGC 6604 .......... .......... .......... .......... .......... .......... 141 TTTCTTCTTC AAAGTCTTTT TGATCTTCAT AATTGTAATC TGTAAATCTA ATTGAAGGTT 6664 .......... .......... .......... .......... .......... .......... 141 TTCCTTTAGA GTTTGTATAT ATTAAATGAG CATCTGGTGT TAATGCTTCT TTTTCTTTAA 6724 .......... .......... .......... .......... .......... .......... 141 AATCTCCTAA TTTTCACTCT AATCCTACAT ATTCTTCAGA GTCTATTTTT ATCGGTTTTA 6784 .......... .......... .......... .......... .......... .......... 141 TTAACTTTAT CCCTTTATTT CCCATTACTT CTACTACGTC TTTTATTTGT AGTTTAAATC 6844 .......... .......... .......... .......... .......... .......... 141 TAGTATTACT ATTATCTGTC ATTTTTTCTA GAAATCCTAC ACACATTAAT AAATTCTTAC 6904 .......... .......... .......... .......... .......... .......... 141 CACTATGCAT TTCTTCATAT CCTTTAGTTT GTATTCTTAT TCTTATTTGG GTTCCAAATT 6964 .......... .......... .......... .......... .......... .......... 141 CATTTAGATT CATCATAAAA TCTGGGCTTA TGTAGAAAAT TCCTCCATTA TTAGTCATAT 7024 .......... .......... .......... .......... .......... .......... 141 CTACTTCTGT TAATTATATA ATTGTCTTTT TTATGTCTGA CCATCTATTA TCATATATTG 7084 .......... .......... .......... .......... .......... .......... 141 TAATTAATGT CTTTGTTCCT AAGTTTTTCC TCGTTAATCC TTTAATTCCT ATAACTATTA 7144 .......... .......... .......... .......... .......... .......... 141 GTCCTATATG CATTAGATCT TTTTTAACGT ATTTTATTTC TTTTACAGAT TCAGCGTTTA 7204 .......... .......... .......... .......... .......... .......... 141 TGAGTGGTAA CGATACAGAG CCTTTTCCTA ACTGTCACGA TCCAAATCGG GCCGCGACTA 7264 .......... .......... .......... .......... .......... .......... 141 GCACCCACAC TTACCCTCCT ATGTGAGCGA ACCAACCAAT CCAAACCCCA ACATTTTCAA 7324 .......... .......... .......... .......... .......... .......... 141 ACATAGTAAC AGAATATAAT GCGGAAGACT TAAACTCATT AATGAAAATC AATTAAATAA 7384 .......... .......... .......... .......... .......... .......... 141 CTTCTAAAAA CTCAACAACT ATTATTATCC CCAAAATCTG GAAGTCATCA TCACAAGAAC 7444 .......... .......... .......... .......... .......... .......... 141 ATCTACTTCA AATTACTAAA TCTAAGATTA TCTAAGAAGC TAAAATACAT AAACAGCTAG 7504 .......... .......... .......... .......... .......... .......... 141 TCCATGCCGG AACTTCAAGG CATCAAGACA TGAAGAGGAG GATCCAGTCC AAGCTAGAAG 7564 .......... .......... .......... .......... .......... .......... 141 CATTAGCTCA CCCTGAAATC CGGAGTAATG AAGACTGGCT AGATTTGCGG TTGAGTTGAA 7624 .......... .......... .......... .......... .......... .......... 141 GACGACAGAA CGTTTGCTGC ACTCCACAAA TAATCAAAAA GAAAACATAC AAGTAGGGGT 7684 .......... .......... .......... .......... .......... .......... 141 CAGTACAAAA CACAGGTACT GAGTAGATAT CATCGGCCAA CTCAAAATAG AAAACAGTAT 7744 .......... .......... .......... .......... .......... .......... 141 ATATCAGATA ATATCATAAA ATCAACTACA GTACTCAACA TGCGGCATTT ACAATTACCA 7804 .......... .......... .......... .......... .......... .......... 141 TAACCCTTGG TCGCAACACC AAGCTCATCA ATGAGGACTC ATGCCTCCCC ATCATACTCA 7864 .......... .......... .......... .......... .......... .......... 141 TTTGGGAATT AAGTTCCTTA AATTGAGTAT ATTAACATAT TTCAAGATTC ATTCTCTTTA 7924 .......... .......... .......... .......... .......... .......... 141 CTAATCCTGG TGTCAGAACG TGACACCCGA TCCATATATA CTATCCTGGT ACCGGAACGT 7984 .......... .......... .......... .......... .......... .......... 141 GGCACCCGAT CCATATTCTA TCCTGGTGTC GGAACGTGAC ACTCCGATCC TCATATACTA 8044 .......... .......... .......... .......... .......... .......... 141 TCCTGGTACC GGAACGTGGC ACCCGATCCA TATTCTATCC TGGTGTCGGA ACGTGACACT 8104 .......... .......... .......... .......... .......... .......... 141 CCGATCCTCA TATACTATCC TGGTACCGGA ACGTGACACC CGATCCCCTA ATCTCACTAC 8164 .......... .......... .......... .......... .......... .......... 141 TTTCGTTCAT CAAGCCTTCT TGTATACTAA GGCATCATCA TTAACAAAGT AGATTAGGGT 8224 .......... .......... .......... .......... .......... .......... 141 TTCTTTTTCA AGATTTAGAA TTCAATAGCT TCATCATGCT TATCTCATCA CAATTATATA 8284 .......... .......... .......... .......... .......... .......... 141 ATCACAATAT GCAAACACAC AATTAAGCAT ATAGAAGGGT TTACAACACT ACCCAATACA 8344 .......... .......... .......... .......... .......... .......... 141 TATCATTCGA TATTAAGAGT TTACTACGAA TAGTGTAAAA ACCATAACCT ACCTCCATCG 8404 .......... .......... .......... .......... .......... .......... 141 AAGATTAGTG ATCAAGCAAG CAAATTCCCC AAAGCTTTGT GTTTTCCTCT TCTCGTTCGA 8464 | |||||||||| .......... .......... .......... .......... .........T TCTCGTTCGA 152 --TCCTCTCT CTCT-TTTTG TTCTTTCTAT TTTCTTTATT CAAACCCTCT TTCTTTTACC 8521 | |||||| || | | ||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTCTCTCT CTTTCTCTTG TTCTTTCTAT TTTCTTTATT CAAACCCTCT TTCTTTTACC 212 CTAATTAGCA TATAATTAAG AATAAAAGAT GGCAATAATA ACCCACTAAT TTACTCAAGG 8581 |||||||| | |||||||||| ||||||| || |||||||||| |||||||||| | ||| |||| CTAATTAGTA TATAATTAAG AATAAAATAT GGCAATAATA ACCCACTAAT TAACTTAAGG 272 TTACCTTTTT TAACCCCCAA GTAATTAGAC TTATTAACAT TAACCCACTA ACTTTATAAT 8641 |||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTACCTCTTT TAACCCCCAA GTAATTAGAC TTATTAACAT TAACCCACTA ACTTTATAAT 332 TAAAGCAGGA ATAGTAAAAA ACGTCCCTTA AAAC-ATTAA AGAAATCCGA CTCAGCCTGG 8700 |||||||||| ||||| |||| |||||||||| |||| ||| | ||| ||||| ||||| |||| TAAAGCAGGA ATAGTCAAAA ACGTCCCTTA AAACAATTGA GGAATTCCGA CTCAGACTGG 392 GA-TTATGCA GCCTGTGACG ACTCGTCGTG CCTGCGACGG TCCGTCTTGC TGCTCCGTCA 8759 || ||| ||| ||||||||| | ||| ||| |||||||||| |||||| ||| | | |||| GATTTACGCA GCCTGTGACA GCCCGTTGTG CCTGCGACGG TCCGTCCTGC AGGT-CGTCG 451 CAGAGTTCAG AGACTCAATT TCCCTTAAAG AGTCTGTGAC GGTCCGTCAC GCCTGTGACG 8819 || |||||| ||||| ||| | | | ||| | ||||||| ||||| |||| |||||||||| CAAGGTTCAG AGACTGGATT TTCACTGAAG ACTCTGTGAT GGTCCATCAC GCCTGTGACG 511 GTCCGTCCTG CCATTCCGTT ACAAAGTTCA GAGAGTCGA- TTTCAGTACC CATTTTTCAG 8878 ||||||| || |||||||||| || ||||||| ||||||||| |||||||||| || |||||| GTCCGTCTTG CCATTCCGTT ACGAAGTTCA GAGAGTCGAT TTTCAGTACC CA-ATTTCAG 570 AATTTCTAAG TGTTTTGAAA CGAGACCCCT CGACGGTCCG TCGTGCCCAT GACGGTCCGT 8938 | || ||||| |||||||||| ||||||| |||||||||| |||||||||| || ||||||| ATTTCCTAAG TGTTTTGAAA TGAGACCCTG CGACGGTCCG TCGTGCCCAT GATGGTCCGT 630 CGTGGGATCC GTCGTCTCAA CCATTTTT 8966 |||||| ||| ||| | || ||| |||| CGTGGGGTCC GTCATTTCTG CCAGTTTT 658 hqPGS_C06HBa0112G05.1-6+_SGN-E349296- (8454 8966) ******************************************************************************** EST sequence 4 +strand 729 n (File: SGN-E550212+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAACTCG 721 GGGGGGGGC Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 716 MATCH C06HBa0112G05.1-6+ SGN-E550212+ 0.888 602 0.826 C PGS_C06HBa0112G05.1-6+_SGN-E550212+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550212+ (8457 9014) ******************************************************************************** EST sequence 5 +strand 710 n (File: SGN-E550065+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATGA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTGATTCAT AAGAAAAAAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 MATCH C06HBa0112G05.1-6+ SGN-E550065+ 0.888 602 0.848 C PGS_C06HBa0112G05.1-6+_SGN-E550065+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550065+ (8457 9014) ******************************************************************************** EST sequence 7 +strand 732 n (File: SGN-E550201+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCNA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAAACT 721 CGAGGGGGGG CC Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 718 MATCH C06HBa0112G05.1-6+ SGN-E550201+ 0.888 602 0.822 C PGS_C06HBa0112G05.1-6+_SGN-E550201+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550201+ (8457 9014) ******************************************************************************** EST sequence 8 +strand 709 n (File: SGN-E550207+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTNCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 709 MATCH C06HBa0112G05.1-6+ SGN-E550207+ 0.888 602 0.849 C PGS_C06HBa0112G05.1-6+_SGN-E550207+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550207+ (8457 9014) ******************************************************************************** EST sequence 9 +strand 715 n (File: SGN-E550335+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAATCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCNAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAAAA Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 557 ( 556 n); score: 0.884 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 558 600 ( 43 n); score: 0.693 PPA cDNA 698 715 MATCH C06HBa0112G05.1-6+ SGN-E550335+ 0.884 602 0.842 C PGS_C06HBa0112G05.1-6+_SGN-E550335+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||| ||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTC-AAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| ||||| |||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAATCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 557 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 557 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 557 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 557 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 557 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 557 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 557 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 557 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 557 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 557 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 557 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 557 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 557 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 595 TTGAT 9837 || || TTTAT 600 hqPGS_C06HBa0112G05.1-6+_SGN-E550335+ (8457 9014) ******************************************************************************** EST sequence 10 +strand 714 n (File: SGN-E390013+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACNAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 714 MATCH C06HBa0112G05.1-6+ SGN-E390013+ 0.888 602 0.843 C PGS_C06HBa0112G05.1-6+_SGN-E390013+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E390013+ (8457 9014) ******************************************************************************** EST sequence 12 +strand 717 n (File: SGN-E550484+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 717 MATCH C06HBa0112G05.1-6+ SGN-E550484+ 0.888 602 0.840 C PGS_C06HBa0112G05.1-6+_SGN-E550484+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550484+ (8457 9014) ******************************************************************************** EST sequence 13 +strand 713 n (File: SGN-E550211+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 557 ( 556 n); score: 0.886 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 558 600 ( 43 n); score: 0.693 PPA cDNA 698 713 MATCH C06HBa0112G05.1-6+ SGN-E550211+ 0.886 602 0.844 C PGS_C06HBa0112G05.1-6+_SGN-E550211+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||| ||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTC-AAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 557 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 557 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 557 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 557 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 557 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 557 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 557 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 557 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 557 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 557 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 557 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 557 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 557 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 595 TTGAT 9837 || || TTTAT 600 hqPGS_C06HBa0112G05.1-6+_SGN-E550211+ (8457 9014) ******************************************************************************** EST sequence 14 +strand 713 n (File: SGN-E550464+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GNTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCTA CCATGAATTA ATGAAAAATT ATGCCATAAG 661 ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.886 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.84), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 698 713 MATCH C06HBa0112G05.1-6+ SGN-E550464+ 0.886 602 0.844 C PGS_C06HBa0112G05.1-6+_SGN-E550464+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| |||| |||| | GACTAAACAG GTCGNTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550464+ (8457 9014) ******************************************************************************** EST sequence 15 +strand 713 n (File: SGN-E549941+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA TATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCGA CCNATGATTA ATGAAAAATT ATGCCATCAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 713 MATCH C06HBa0112G05.1-6+ SGN-E549941+ 0.888 602 0.844 C PGS_C06HBa0112G05.1-6+_SGN-E549941+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAATATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E549941+ (8457 9014) ******************************************************************************** EST sequence 17 +strand 714 n (File: SGN-E550025+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATATAAAAA ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 699 714 MATCH C06HBa0112G05.1-6+ SGN-E550025+ 0.888 602 0.843 C PGS_C06HBa0112G05.1-6+_SGN-E550025+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E550025+ (8457 9014) ******************************************************************************** EST sequence 24 +strand 711 n (File: SGN-E396039+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AAAAACAAAG ATTTTCTCCA CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AAATAAAAAA AATTTACTCA TTTTTTCTTG GAGCTAATTC AAAAAAAAAA A Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 661 672 MATCH C06HBa0112G05.1-6+ SGN-E396039+ 0.888 602 0.847 C PGS_C06HBa0112G05.1-6+_SGN-E396039+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E396039+ (8457 9014) ******************************************************************************** EST sequence 26 +strand 711 n (File: SGN-E396056+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAA ATTTTCTCAC CCATGAATTA ATGAAAAAAT TATGCCATAA 661 AATAATAAAA ATTTACTCAT TTTTTCTTTG AGCTAATTCA TAAAAAAAAA A Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 559 601 ( 43 n); score: 0.693 PPA cDNA 700 711 MATCH C06HBa0112G05.1-6+ SGN-E396056+ 0.888 602 0.847 C PGS_C06HBa0112G05.1-6+_SGN-E396056+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 596 TTGAT 9837 || || TTTAT 601 hqPGS_C06HBa0112G05.1-6+_SGN-E396056+ (8457 9014) ******************************************************************************** EST sequence 31 +strand 690 n (File: SGN-E377133+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT ATGCCATAAA 661 ATATAAAAAA TTTACTCATT TTTCATTGAG Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 1 557 ( 557 n); score: 0.888 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 558 600 ( 43 n); score: 0.693 MATCH C06HBa0112G05.1-6+ SGN-E377133+ 0.888 602 0.872 C PGS_C06HBa0112G05.1-6+_SGN-E377133+ (8457 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 557 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 557 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 557 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 557 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 557 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 557 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 557 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 557 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 557 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 557 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 557 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 557 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 557 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA 595 TTGAT 9837 || || TTTAT 600 hqPGS_C06HBa0112G05.1-6+_SGN-E377133+ (8457 9014) ******************************************************************************** EST sequence 44 -strand 679 n (File: SGN-E550127-) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTCCT TTTCTTTTTC TTATCAAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAAAG ATTTGCACAA CCATGAATTA ATGAAAAAAT TATGACATAA 661 AATATAAAAA ATTACTCAT Predicted gene structure (within gDNA segment 7352 to 9840): Exon 1 8457 9014 ( 558 n); cDNA 2 558 ( 557 n); score: 0.883 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.70) Exon 2 9794 9834 ( 41 n); cDNA 559 599 ( 41 n); score: 0.695 MATCH C06HBa0112G05.1-6+ SGN-E550127- 0.883 599 0.882 C PGS_C06HBa0112G05.1-6+_SGN-E550127- (8457 9014,9794 9834) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||| ||| | | |||| |||| |||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTCCTT T-TCTTTTTC TTATCAAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT 9053 |||||||||| ||||||||| | GACTAAACAG GTCGTTACAT T......... .......... .......... .......... 558 TGAGATTTTG AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT 9113 .......... .......... .......... .......... .......... .......... 558 TGAAAGATTT GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT 9173 .......... .......... .......... .......... .......... .......... 558 CCTTTGTCCT TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT 9233 .......... .......... .......... .......... .......... .......... 558 AGCTTGGTCT TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT 9293 .......... .......... .......... .......... .......... .......... 558 TTTAATTTTT TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT 9353 .......... .......... .......... .......... .......... .......... 558 TCAGTTTTTC TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT 9413 .......... .......... .......... .......... .......... .......... 558 CTTTAAAGTT TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT 9473 .......... .......... .......... .......... .......... .......... 558 GTTGTATAAT AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA 9533 .......... .......... .......... .......... .......... .......... 558 GGGCTATTCT TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT 9593 .......... .......... .......... .......... .......... .......... 558 CTATCATGAA AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT 9653 .......... .......... .......... .......... .......... .......... 558 TATTACTAAT GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT 9713 .......... .......... .......... .......... .......... .......... 558 ATTTAAATCT ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA 9773 .......... .......... .......... .......... .......... .......... 558 CTGTTTTTGC TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT 9832 | | ||||| | |||| || |||| |||| |||||| | .......... .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTACGAT 597 TT 9834 || TT 599 hqPGS_C06HBa0112G05.1-6+_SGN-E550127- (8457 9014) ******************************************************************************** EST sequence 18 +strand 558 n (File: SGN-E231589+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA TAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACAGGTCG TTACATTT Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9012 ( 556 n); cDNA 1 555 ( 555 n); score: 0.888 MATCH C06HBa0112G05.1-6+ SGN-E231589+ 0.888 556 0.996 C PGS_C06HBa0112G05.1-6+_SGN-E231589+ (8457 9012) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || ||| |||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCATAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACAG GTCGTTACA 9012 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0112G05.1-6+_SGN-E231589+ (8457 9012) ******************************************************************************** EST sequence 22 +strand 649 n (File: SGN-E374999+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CCAAACGACT 541 AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT TACGATTTAT 601 AACACTATTA GAAACAAAGA TTTTCTTCAC CCTGAATTAA TGAAAAAAT Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9012 ( 556 n); cDNA 1 555 ( 555 n); score: 0.888 MATCH C06HBa0112G05.1-6+ SGN-E374999+ 0.888 556 0.857 C PGS_C06HBa0112G05.1-6+_SGN-E374999+ (8457 9012) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||| ||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCCAAAC 536 GACTAAACAG GTCGTTACA 9012 |||||||||| ||||||||| GACTAAACAG GTCGTTACA 555 hqPGS_C06HBa0112G05.1-6+_SGN-E374999+ (8457 9012) ******************************************************************************** EST sequence 11 +strand 720 n (File: SGN-E389834+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTA CGTCGACTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAGAACGAC 541 TAAACAGGAC GTTACATTTA TGATCGTCCT ACTTAAATAT CATTATTATT TTACGATTTA 601 TAACACTATT AGAAACGAAG ATTTTCTCGA CCATGAATTA ATGAAAAAAT ATGCCATGAA 661 ATATAAAAAT TTACTCGTTC TTCATTGAGC TATTCGTGAA AAAAAAAAAA AAATCGAGGG Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9003 ( 547 n); cDNA 2 547 ( 546 n); score: 0.882 PPA cDNA 699 714 MATCH C06HBa0112G05.1-6+ SGN-E389834+ 0.882 547 0.760 C PGS_C06HBa0112G05.1-6+_SGN-E389834+ (8457 9003) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| | ||||| | |||||| || |||||| ||| ||| || || | |||| || CCGTCGTGGG TTACGTCGAC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAGAAC 537 GACTAAACAG 9003 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0112G05.1-6+_SGN-E389834+ (8457 9003) ******************************************************************************** EST sequence 25 +strand 618 n (File: SGN-E396054+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC 541 TAAACAGGTC GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA 601 TAACACTATT AGAAACAA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9003 ( 547 n); cDNA 2 547 ( 546 n); score: 0.888 MATCH C06HBa0112G05.1-6+ SGN-E396054+ 0.888 547 0.885 C PGS_C06HBa0112G05.1-6+_SGN-E396054+ (8457 9003) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 537 GACTAAACAG 9003 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0112G05.1-6+_SGN-E396054+ (8457 9003) ******************************************************************************** EST sequence 27 +strand 610 n (File: SGN-E396058+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA 421 GAATTTCTAA GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG 481 TCGTGGGTTC CGTCGTCTCA ACCTGTGTTT CCAAAAATAA AATCTGCTAC TCACAACGAC 541 TAAACAGGTC GTTACATTTA GGTTCTTCAT AGTTAACTAT TATTATTATT TTACGATTTA 601 TAACACTATT Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 9003 ( 547 n); cDNA 2 547 ( 546 n); score: 0.884 MATCH C06HBa0112G05.1-6+ SGN-E396058+ 0.884 547 0.897 C PGS_C06HBa0112G05.1-6+_SGN-E396058+ (8457 9003) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 477 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| | |||||| ||| ||| || || | |||| || CCGTCGTGGG TTCCGTCGTC TCAACCTGTG TTTCCAAAAA TAAAATCTGC TACTCACAAC 537 GACTAAACAG 9003 |||||||||| GACTAAACAG 547 hqPGS_C06HBa0112G05.1-6+_SGN-E396058+ (8457 9003) ******************************************************************************** EST sequence 41 +strand 545 n (File: SGN-E241959+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 481 CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 541 AAACA Predicted gene structure (within gDNA segment 7372 to 9840): Exon 1 8457 9002 ( 546 n); cDNA 1 545 ( 545 n); score: 0.887 MATCH C06HBa0112G05.1-6+ SGN-E241959+ 0.887 546 1.002 C PGS_C06HBa0112G05.1-6+_SGN-E241959+ (8457 9002) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT 8934 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT 476 CCGTCGTGGG ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT 8993 |||||||||| ||||||||| |||||| || |||||| ||| ||| || || | ||||||| CCGTCGTGGG TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC 536 GACTAAACA 9002 ||||||||| GACTAAACA 545 hqPGS_C06HBa0112G05.1-6+_SGN-E241959+ (8457 9002) ******************************************************************************** EST sequence 29 +strand 472 n (File: SGN-E236652+) 1 TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC CTCCTTCTTT 61 TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT AATTTACTCA 121 AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA CTAACTTTAT 181 AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC CGACCCAGAC 241 TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT GCAGGTCGTC 301 GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA CGCCCGTGAC 361 GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC CCAATTTCAG 421 AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GA Predicted gene structure (within gDNA segment 7372 to 9774): Exon 1 8457 8930 ( 474 n); cDNA 1 472 ( 472 n); score: 0.892 MATCH C06HBa0112G05.1-6+ SGN-E236652+ 0.892 474 1.004 C PGS_C06HBa0112G05.1-6+_SGN-E236652+ (8457 8930) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 59 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 118 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 178 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 238 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 297 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 357 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT 416 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGA 8930 |||||||||| |||||||||| |||||||||| ||||||||| ||| |||||| ||||| TCAGAATTTC TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGA 472 hqPGS_C06HBa0112G05.1-6+_SGN-E236652+ (8457 8930) ******************************************************************************** EST sequence 28 +strand 454 n (File: SGN-E396070+) 1 ATCGATTCTC CTTCTCTCTC TTTCTGTTCT TTTCTTTTTC TTATTCAAAC CCTCCTTCTT 61 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC 121 AAGGTTACCT CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA 181 TAATTAAAGT AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA 241 CTGGGATTAC GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT 301 CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA 361 CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCGGGGGGGG 421 GAGTTTCTAA TTGTTTTGAA ACTAGACTCC TCGA Predicted gene structure (within gDNA segment 7362 to 9840): Exon 1 8457 8911 ( 455 n); cDNA 2 454 ( 453 n); score: 0.870 MATCH C06HBa0112G05.1-6+ SGN-E396070+ 0.870 455 1.002 C PGS_C06HBa0112G05.1-6+_SGN-E396070+ (8457 8911) Alignment (genomic DNA sequence = upper lines): TCGTTCGATC CTCTCTCTCT TTTTGTTCTT TCTATTTTCT TTATTCAAAC CCTCTTTCTT 8516 ||| | | ||||||||| || ||||||| | | |||| |||||||||| |||| ||||| TCGATTCTCC TTCTCTCTCT TTCTGTTCTT T-TCTTTTTC TTATTCAAAC CCTCCTTCTT 60 TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT 8576 |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTACCCTAAT TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT 119 CAAGGTTACC TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT 8636 |||||||||| | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| CAAGGTTACC TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT 179 ATAATTAAAG CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG 8695 |||||||||| ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ATAATTAAAG TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG 239 CCTGGGATTA TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC 8755 ||||||||| ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ACTGGGATTA CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C 298 GTCACAGAGT TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT 8815 ||| || || |||||||||| ||||||| |||||||||| |||||||||| ||||||| || GTCGCAAGGT TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT 358 GACGGTCCGT CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT 8874 |||||||||| | |||||||| |||||| ||| |||||||||| ||| ||| || || || GACGGTCCGT CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TA-CCGGGGG 417 TCAGAATTTC TAAGTGTTTT GAAACGAGAC CCCTCGA 8911 || |||| ||| |||||| ||||| |||| |||||| GGGGAGTTTC TAATTGTTTT GAAACTAGAC TCCTCGA 454 hqPGS_C06HBa0112G05.1-6+_SGN-E396070+ (8457 8911) ******************************************************************************** EST sequence 6 +strand 726 n (File: SGN-E550322+) 1 TCGCACCAGA TCGATTCTCC TTCTCTCTCT TTCTGTTCTT TTCTTTTTCT TATTCAAACC 61 CTCCTTCTTT TACCCTAATT AGCATATAAT TAAGAATAAA AGATGGAATA ATAACCCACT 121 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 181 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 241 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 301 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 361 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 421 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 481 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 541 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 601 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 661 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAAAA 721 AAAAAC Predicted gene structure (within gDNA segment 7272 to 9840): Exon 1 8468 9014 ( 547 n); cDNA 22 567 ( 546 n); score: 0.897 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 568 610 ( 43 n); score: 0.693 PPA cDNA 708 725 MATCH C06HBa0112G05.1-6+ SGN-E550322+ 0.897 591 0.814 C PGS_C06HBa0112G05.1-6+_SGN-E550322+ (8468 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCTCTCTCTT TTTGTTCTTT CTATTTTCTT TATTCAAACC CTCTTTCTTT TACCCTAATT 8527 |||||||||| | |||||||| ||||||| ||||||||| ||| |||||| |||||||||| TCTCTCTCTT TCTGTTCTTT TCTTTTTCTT -ATTCAAACC CTCCTTCTTT TACCCTAATT 80 AGCATATAAT TAAGAATAAA AGATGGCAAT AATAACCCAC TAATTTACTC AAGGTTACCT 8587 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| AGCATATAAT TAAGAATAAA AGATGG-AAT AATAACCCAC TAATTTACTC AAGGTTACCT 139 TTTTTAACCC CCAAGTAATT AGACTTATTA ACATTAACCC ACTAACTTTA TAATTAAAGC 8647 ||||||||| ||| |||||| |||||||||| |||| ||||| |||||||||| ||||||||| CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT 199 AGGAATAGTA AAAAACGTCC CTTAAAACAT -TAAAGAAAT CCGACTCAGC CTGGGATTAT 8706 ||||||||| ||||||||| |||||||| | ||||||||| ||||| ||| ||||||||| AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC 259 GCAGCCTGTG ACGACTCGTC GTGCCTGCGA CGGTCCGTCT TGCTGCTCCG TCACAGAGTT 8766 ||| |||||| | | | |||| |||||||||| ||||||||| ||| | | || || || ||| GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGT-CG TCGCAAGGTT 318 CAGAGACTCA ATTTCCCTTA AAGAGTCTGT GACGGTCCGT CACGCCTGTG ACGGTCCGTC 8826 |||||||||| |||||| | |||||||||| |||||||||| |||||| ||| |||||||||| CAGAGACTCA ATTTCCACCA AAGAGTCTGT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 378 CTGCCATTCC GTTACAAAGT TCAGAGAGTC GA-TTTCAGT ACCCATTTTT CAGAATTTCT 8885 ||||||||| ||||| |||| |||||||||| || ||| ||| ||||| ||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCCA-ATTT CAGAATTTCT 437 AAGTGTTTTG AAACGAGACC CCTCGACGGT CCGTCGTGCC CATGACGGTC CGTCGTGGGA 8945 |||||||||| ||||||||| |||||||||| || |||||| |||||||||| ||||||||| AAGTGTTTTG AAACGAGACT CCTCGACGGT CCATCGTGCT CATGACGGTC CGTCGTGGGT 497 TCCGTCGTCT CAACC-ATTT TTCCAGAAAT AACATTTGTT GCTCAAAATG ACTAAACAGG 9004 |||||||||| ||||| ||| ||||| |||| || || || | ||||||| | |||||||||| TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT ACTCAAAACG ACTAAACAGG 557 TCGTTACACT AACACTGATA AATGTTCTTC TCTATAATGT CTATATAGTT GAGATTTTGA 9064 |||||||| | TCGTTACATT .......... .......... .......... .......... .......... 567 ATTTGTATTG TATAAAACTT TGATATTCAA TAAATTTTAT TGATTTTGTT GAAAGATTTG 9124 .......... .......... .......... .......... .......... .......... 567 ATATCCTTTT CTGTATCTAT TATTTCTCCT AATTGTTGAT TATTCTCTTC CTTTGTCCTT 9184 .......... .......... .......... .......... .......... .......... 567 TTATTTAATA TTTTTGATAG AAAGTTATTA CTTATCATAT TTTTGGAATA GCTTGGTCTT 9244 .......... .......... .......... .......... .......... .......... 567 GGTATTTCTT CTCTTGATCT AGTTGAAAAA ATATCCATTT TTTCTGTTTT TTAATTTTTT 9304 .......... .......... .......... .......... .......... .......... 567 TTCTTTTTGG GGAGTAATTT CTATATTATT CAGTTTTTCT ATTAATTCTT CAGTTTTTCT 9364 .......... .......... .......... .......... .......... .......... 567 ATTAATTCTT CTATTCCGGT TGTATCTATA TTTTTACTTT CTAATTTTTC TTTAAAGTTT 9424 .......... .......... .......... .......... .......... .......... 567 TTATCTATCT TATTATTTAG TTGTAATAAA AGTGATATTA TATTATTTTG TTGTATAATA 9484 .......... .......... .......... .......... .......... .......... 567 ATTTGATCTG ATTTTGAAGA TGGACTAAAT TCTTTGTAAT CTGATGGGAG GGCTATTCTT 9544 .......... .......... .......... .......... .......... .......... 567 TTTTCTATTA GTTTTGTTGC TTCTTTATGT GTTTCTGATT CTACTAAGTC TATCATGAAA 9604 .......... .......... .......... .......... .......... .......... 567 GAACTATTTC TTTAATTGAT TCTAATTCTG TCTTTATTTC TTGTATTTTT ATTACTAATG 9664 .......... .......... .......... .......... .......... .......... 567 TATTATTAAT TTCTTCTTTA GATTTATTTT TTAACTCGTT TAATTTTTTA TTTAAATCTA 9724 .......... .......... .......... .......... .......... .......... 567 TTCCTTGTTC TTTTAGATAA TCTAAGTTAT TTTCTATCTT ACTAAAAAAC TGTTTTTGCT 9784 .......... .......... .......... .......... .......... .......... 567 TATTTTGTGT TTCTTCTTGT TCACTT-AAT ATTTTTATAA TTTTATTGTT TGAT 9837 | | ||||| | |||| ||| ||| |||| | ||||| | | | || .........T ATGTTCTTCC T-ACTTAAAT ATTATTATTA TTTTA-CGAT TTAT 610 hqPGS_C06HBa0112G05.1-6+_SGN-E550322+ (8468 9014) ******************************************************************************** EST sequence 57 -strand 674 n (File: SGN-E396057-) 1 TTTTTTATTC AAACCTTCTT TTTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG 61 GAATAATAAC CCACTAATTT ACTCAAGGTT ACCTCTTTTA ACCCCCAGGT AATTAGACTT 121 ATTAACATAA ACCCACTAAC TTTATAATTA AAGTAGGAAT AGTCCAAAAC GTCCCTTAAA 181 ACGTGTAAAG AAATCCGACC CAGACTGGGA TTACGCAACC TGTGATGGCC CGTCGTGCCT 241 GCGACGGTCC GTCCTGCAGG TCGTCGCAAG GTTCAGAGAC TCAATTTCCA CCAAAGAGTC 301 TGTGACGGTC CGTCACGCCC GTGACGGTCC GTCGTGCCAT TCCGTTACGA AGTTCAGAGA 361 GTCGATTTTT AGTACCCAAT TTCAGAATTT TTAAGTGTTT TGAAACGAGA CTCCTCGACG 421 GTCCATCGTG CTCATGACGG TCCGTCGTGG GTTCCGTCGT CTCAACCTGT TTTTCCAAAA 481 ATAAAATCTG CTACTCAAAA CGACTAAACA GGTCGTTACA TTTATGTTCT TCCTACTTAA 541 ATATTATTAT TATTTTACGA TTTATAACAC TATTAGAAAC AAAGATTTTC TCAACCATGA 601 ATTAATGAAA AAATTATGCC ATAAAATATA AAAAATTTAC TCATTTTTCA TTGAGCTAAT 661 TCATAAAAAA AAAA Predicted gene structure (within gDNA segment 7685 to 9840): Exon 1 8493 9014 ( 522 n); cDNA 1 522 ( 522 n); score: 0.896 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 523 565 ( 43 n); score: 0.693 PPA cDNA 663 674 MATCH C06HBa0112G05.1-6+ SGN-E396057- 0.896 566 0.840 C PGS_C06HBa0112G05.1-6+_SGN-E396057- (8493 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TTCTTTATTC AAACCCTCTT TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG 8552 || ||||||| ||||| |||| | |||||||| |||||||||| |||||||||| |||||||||| TTTTTTATTC AAACCTTCTT TTTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG 60 GCAATAATAA CCCACTAATT TACTCAAGGT TACCTTTTTT AACCCCCAAG TAATTAGACT 8612 | |||||||| |||||||||| |||||||||| ||||| |||| |||||||| | |||||||||| G-AATAATAA CCCACTAATT TACTCAAGGT TACCTCTTTT AACCCCCAGG TAATTAGACT 119 TATTAACATT AACCCACTAA CTTTATAATT AAAGCAGGAA TAGTAAAAAA CGTCCCTTAA 8672 ||||||||| |||||||||| |||||||||| |||| ||||| |||| |||| |||||||||| TATTAACATA AACCCACTAA CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCCTTAA 179 AACAT-TAAA GAAATCCGAC TCAGCCTGGG ATTATGCAGC CTGTGACGAC TCGTCGTGCC 8731 ||| | |||| |||||||||| ||| ||||| |||| ||| | |||||| | | ||||||||| AACGTGTAAA GAAATCCGAC CCAGACTGGG ATTACGCAAC CTGTGATGGC CCGTCGTGCC 239 TGCGACGGTC CGTCTTGCTG CTCCGTCACA GAGTTCAGAG ACTCAATTTC CCTTAAAGAG 8791 |||||||||| |||| ||| | | |||| || |||||||| |||||||||| | |||||| TGCGACGGTC CGTCCTGCAG GT-CGTCGCA AGGTTCAGAG ACTCAATTTC CACCAAAGAG 298 TCTGTGACGG TCCGTCACGC CTGTGACGGT CCGTCCTGCC ATTCCGTTAC AAAGTTCAGA 8851 |||||||||| |||||||||| | |||||||| ||||| |||| |||||||||| ||||||||| TCTGTGACGG TCCGTCACGC CCGTGACGGT CCGTCGTGCC ATTCCGTTAC GAAGTTCAGA 358 GAGTCGA-TT TCAGTACCCA TTTTTCAGAA TTTCTAAGTG TTTTGAAACG AGACCCCTCG 8910 ||||||| || | |||||||| |||||||| ||| |||||| |||||||||| |||| ||||| GAGTCGATTT TTAGTACCCA -ATTTCAGAA TTTTTAAGTG TTTTGAAACG AGACTCCTCG 417 ACGGTCCGTC GTGCCCATGA CGGTCCGTCG TGGGATCCGT CGTCTCAACC -ATTTTTCCA 8969 ||||||| || |||| ||||| |||||||||| |||| ||||| |||||||||| |||||||| ACGGTCCATC GTGCTCATGA CGGTCCGTCG TGGGTTCCGT CGTCTCAACC TGTTTTTCCA 477 GAAATAACAT TTGTTGCTCA AAATGACTAA ACAGGTCGTT ACACTAACAC TGATAAATGT 9029 |||||| || || | |||| ||| |||||| |||||||||| ||| | AAAATAAAAT CTGCTACTCA AAACGACTAA ACAGGTCGTT ACATT..... .......... 522 TCTTCTCTAT AATGTCTATA TAGTTGAGAT TTTGAATTTG TATTGTATAA AACTTTGATA 9089 .......... .......... .......... .......... .......... .......... 522 TTCAATAAAT TTTATTGATT TTGTTGAAAG ATTTGATATC CTTTTCTGTA TCTATTATTT 9149 .......... .......... .......... .......... .......... .......... 522 CTCCTAATTG TTGATTATTC TCTTCCTTTG TCCTTTTATT TAATATTTTT GATAGAAAGT 9209 .......... .......... .......... .......... .......... .......... 522 TATTACTTAT CATATTTTTG GAATAGCTTG GTCTTGGTAT TTCTTCTCTT GATCTAGTTG 9269 .......... .......... .......... .......... .......... .......... 522 AAAAAATATC CATTTTTTCT GTTTTTTAAT TTTTTTTCTT TTTGGGGAGT AATTTCTATA 9329 .......... .......... .......... .......... .......... .......... 522 TTATTCAGTT TTTCTATTAA TTCTTCAGTT TTTCTATTAA TTCTTCTATT CCGGTTGTAT 9389 .......... .......... .......... .......... .......... .......... 522 CTATATTTTT ACTTTCTAAT TTTTCTTTAA AGTTTTTATC TATCTTATTA TTTAGTTGTA 9449 .......... .......... .......... .......... .......... .......... 522 ATAAAAGTGA TATTATATTA TTTTGTTGTA TAATAATTTG ATCTGATTTT GAAGATGGAC 9509 .......... .......... .......... .......... .......... .......... 522 TAAATTCTTT GTAATCTGAT GGGAGGGCTA TTCTTTTTTC TATTAGTTTT GTTGCTTCTT 9569 .......... .......... .......... .......... .......... .......... 522 TATGTGTTTC TGATTCTACT AAGTCTATCA TGAAAGAACT ATTTCTTTAA TTGATTCTAA 9629 .......... .......... .......... .......... .......... .......... 522 TTCTGTCTTT ATTTCTTGTA TTTTTATTAC TAATGTATTA TTAATTTCTT CTTTAGATTT 9689 .......... .......... .......... .......... .......... .......... 522 ATTTTTTAAC TCGTTTAATT TTTTATTTAA ATCTATTCCT TGTTCTTTTA GATAATCTAA 9749 .......... .......... .......... .......... .......... .......... 522 GTTATTTTCT ATCTTACTAA AAAACTGTTT TTGCTTATTT TGTGTTTCTT CTTGTTCACT 9809 | | || ||| | ||| .......... .......... .......... .......... ....TATGTT CTTCCT-ACT 537 T-AATATTTT TATAATTTTA TTGTTTGAT 9837 | |||||| | ||| |||||| | || || TAAATATTAT TATTATTTTA -CGATTTAT 565 hqPGS_C06HBa0112G05.1-6+_SGN-E396057- (8493 9014) ******************************************************************************** EST sequence 64 -strand 548 n (File: SGN-E356257-) 1 GTTAACTAGA AAATTAAAGT GATAGAGTCA AATAATGTAA CGACCCGTTT AGTCGTTTTG 61 AGCAGCAGAC TTTATTTCTG GAAAAACTGG CAGAAGCGAC GGACCCCACG ACGGACCGTC 121 ATGGGCACGA CGGACCATCG CAGGGTCTCG TTTCAAAACC CTCTTTCTTT TACCCCAAAT 181 TAACATATAA TTAAGAATAA AAGATGGCAA TAATACCCCA CTAATTAACT TAGGGTTACC 241 TCTTTTAACC CCAAGAATTT GAGTTATTAA TATAAACCCA CGAAATCTAT AATTAAGGAA 301 AGAATAGTCC AAAAACGTCC CTTAAAACGT GTAAGGAAAT CCGATTCTGC CTGGGATTTG 361 CGCAACCTGT GACGGGCCGT CGTGACTGTG ACGGTCCGTC CTGCAGGTCG TCGCAAGGGT 421 CAGAGAGTCA ATTTCCACTG AACAATCTAT GACGGTCCGT CACGCCTGTG ATGGTCCGTC 481 CTGTCATTCC GTCACGAAGT TCAGAGAGTC GATTTTCAGT ACCCAATTTC AGATTTTCTA 541 AGTGTTTT Predicted gene structure (within gDNA segment 6343 to 9840): Exon 1 8503 8894 ( 392 n); cDNA 156 548 ( 393 n); score: 0.830 MATCH C06HBa0112G05.1-6+ SGN-E356257- 0.830 392 0.715 C PGS_C06HBa0112G05.1-6+_SGN-E356257- (8503 8894) Alignment (genomic DNA sequence = upper lines): AAACCCTCTT TCTTTTA-CC CTAATTAGCA TATAATTAAG AATAAAAGAT GGCAATAATA 8561 |||||||||| ||||||| || | ||||| || |||||||||| |||||||||| |||||||||| AAACCCTCTT TCTTTTACCC CAAATTAACA TATAATTAAG AATAAAAGAT GGCAATAATA 215 ACCCACTAAT TTACTCAAGG TTACCTTTTT TAACCCCCAA GTAATTAGAC TTATTAACAT 8621 ||||||||| | ||| | || |||||| ||| ||| |||||| | |||| || ||||||| || CCCCACTAAT TAACTTAGGG TTACCTCTTT TAA-CCCCAA G-AATTTGAG TTATTAATAT 273 TAACCCACTA ACTTTATAAT TAAAGCAGGA ATAGT-AAAA AACGTCCCTT AAAACAT-TA 8679 ||||||| | | | |||||| ||| | | || ||||| ||| |||||||||| ||||| | || AAACCCACGA AATCTATAAT TAAGGAAAGA ATAGTCCAAA AACGTCCCTT AAAACGTGTA 333 AAGAAATCCG ACTCAGCCTG GGA-TTATGC AGCCTGTGAC GACTCGTCGT GCCTGCGACG 8738 | |||||||| | || ||||| ||| || || | |||||||| | |||||| | ||| |||| AGGAAATCCG ATTCTGCCTG GGATTTGCGC AACCTGTGAC GGGCCGTCGT GACTGTGACG 393 GTCCGTCTTG CTGCTCCGTC ACAGAGTTCA GAGACTCAAT TTCCCTTAAA GAGTCTGTGA 8798 ||||||| || | | | |||| || | ||| |||| ||||| |||| | || | ||| ||| GTCCGTCCTG CAGGT-CGTC GCAAGGGTCA GAGAGTCAAT TTCCACTGAA CAATCTATGA 452 CGGTCCGTCA CGCCTGTGAC GGTCCGTCCT GCCATTCCGT TACAAAGTTC AGAGAGTCGA 8858 |||||||||| ||||||||| |||||||||| | |||||||| || |||||| |||||||||| CGGTCCGTCA CGCCTGTGAT GGTCCGTCCT GTCATTCCGT CACGAAGTTC AGAGAGTCGA 512 -TTTCAGTAC CCATTTTTCA GAATTTCTAA GTGTTTT 8894 ||||||||| ||| ||||| || ||||||| ||||||| TTTTCAGTAC CCA-ATTTCA GATTTTCTAA GTGTTTT 548 hqPGS_C06HBa0112G05.1-6+_SGN-E356257- (8503 8894) ******************************************************************************** EST sequence 60 -strand 658 n (File: SGN-E377132-) 1 TTCCTTCTTT TACCCTAATT AGCATATATT TAAGAATAAA AGATGGAATA ATAACCCACT 61 AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 121 CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 181 CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 241 GCAGGTCGTC GCAAGGTTCA GAGACTCAAT TTCCACCAAA GAGTCTGTGA CGGTCCGTCA 301 CGCCCGTGAC GGTCCGTCGT GCCATTCCGT TACGAAGTTC AGAGAGTCGA TTTTTAGTAC 361 CCAATTTCAG AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT 421 GACGGTCCGT CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT 481 CAAAACGACT AAACAGGTCG TTACATTTAT GTTCTTCCTA CTTAAATATT ATTATTATTT 541 TACGATTTAT AACACTATTA GAAACAAAGA TTTTCTCAAC CATGAATTAA TGAAAAAATT 601 ATGCCATAAA ATATAAAAAA TTTACTCATT TTTCATTGAG CTAATTCATA AAAAAAAA Predicted gene structure (within gDNA segment 7862 to 9840): Exon 1 8509 9014 ( 506 n); cDNA 2 507 ( 506 n); score: 0.896 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 508 550 ( 43 n); score: 0.693 PPA cDNA 648 658 MATCH C06HBa0112G05.1-6+ SGN-E377132- 0.896 550 0.836 C PGS_C06HBa0112G05.1-6+_SGN-E377132- (8509 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TCTTTCTTTT ACCCTAATTA GCATATAATT AAGAATAAAA GATGGCAATA ATAACCCACT 8568 || ||||||| |||||||||| ||||||| || |||||||||| ||||| |||| |||||||||| TCCTTCTTTT ACCCTAATTA GCATATATTT AAGAATAAAA GATGG-AATA ATAACCCACT 60 AATTTACTCA AGGTTACCTT TTTTAACCCC CAAGTAATTA GACTTATTAA CATTAACCCA 8628 |||||||||| ||||||||| |||||||||| || ||||||| |||||||||| ||| |||||| AATTTACTCA AGGTTACCTC TTTTAACCCC CAGGTAATTA GACTTATTAA CATAAACCCA 120 CTAACTTTAT AATTAAAGCA GGAATAGTAA AAAACGTCCC TTAAAACAT- TAAAGAAATC 8687 |||||||||| |||||||| | |||||||| |||||||||| ||||||| | |||||||||| CTAACTTTAT AATTAAAGTA GGAATAGTCC AAAACGTCCC TTAAAACGTG TAAAGAAATC 180 CGACTCAGCC TGGGATTATG CAGCCTGTGA CGACTCGTCG TGCCTGCGAC GGTCCGTCTT 8747 |||| ||| | |||||||| | || ||||||| | | ||||| |||||||||| |||||||| | CGACCCAGAC TGGGATTACG CAACCTGTGA TGGCCCGTCG TGCCTGCGAC GGTCCGTCCT 240 GCTGCTCCGT CACAGAGTTC AGAGACTCAA TTTCCCTTAA AGAGTCTGTG ACGGTCCGTC 8807 || | | ||| | || |||| |||||||||| ||||| || |||||||||| |||||||||| GCAGGT-CGT CGCAAGGTTC AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC 299 ACGCCTGTGA CGGTCCGTCC TGCCATTCCG TTACAAAGTT CAGAGAGTCG A-TTTCAGTA 8866 ||||| |||| ||||||||| |||||||||| |||| ||||| |||||||||| | ||| |||| ACGCCCGTGA CGGTCCGTCG TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA 359 CCCATTTTTC AGAATTTCTA AGTGTTTTGA AACGAGACCC CTCGACGGTC CGTCGTGCCC 8926 |||| |||| |||||||||| |||||||||| |||||||| | |||||||||| | |||||| | CCCA-ATTTC AGAATTTCTA AGTGTTTTGA AACGAGACTC CTCGACGGTC CATCGTGCTC 418 ATGACGGTCC GTCGTGGGAT CCGTCGTCTC AACC-ATTTT TCCAGAAATA ACATTTGTTG 8985 |||||||||| |||||||| | |||||||||| |||| |||| |||| ||||| | || || | ATGACGGTCC GTCGTGGGTT CCGTCGTCTC AACCTGTTTT TCCAAAAATA AAATCTGCTA 478 CTCAAAATGA CTAAACAGGT CGTTACACTA ACACTGATAA ATGTTCTTCT CTATAATGTC 9045 ||||||| || |||||||||| ||||||| | CTCAAAACGA CTAAACAGGT CGTTACATT. .......... .......... .......... 507 TATATAGTTG AGATTTTGAA TTTGTATTGT ATAAAACTTT GATATTCAAT AAATTTTATT 9105 .......... .......... .......... .......... .......... .......... 507 GATTTTGTTG AAAGATTTGA TATCCTTTTC TGTATCTATT ATTTCTCCTA ATTGTTGATT 9165 .......... .......... .......... .......... .......... .......... 507 ATTCTCTTCC TTTGTCCTTT TATTTAATAT TTTTGATAGA AAGTTATTAC TTATCATATT 9225 .......... .......... .......... .......... .......... .......... 507 TTTGGAATAG CTTGGTCTTG GTATTTCTTC TCTTGATCTA GTTGAAAAAA TATCCATTTT 9285 .......... .......... .......... .......... .......... .......... 507 TTCTGTTTTT TAATTTTTTT TCTTTTTGGG GAGTAATTTC TATATTATTC AGTTTTTCTA 9345 .......... .......... .......... .......... .......... .......... 507 TTAATTCTTC AGTTTTTCTA TTAATTCTTC TATTCCGGTT GTATCTATAT TTTTACTTTC 9405 .......... .......... .......... .......... .......... .......... 507 TAATTTTTCT TTAAAGTTTT TATCTATCTT ATTATTTAGT TGTAATAAAA GTGATATTAT 9465 .......... .......... .......... .......... .......... .......... 507 ATTATTTTGT TGTATAATAA TTTGATCTGA TTTTGAAGAT GGACTAAATT CTTTGTAATC 9525 .......... .......... .......... .......... .......... .......... 507 TGATGGGAGG GCTATTCTTT TTTCTATTAG TTTTGTTGCT TCTTTATGTG TTTCTGATTC 9585 .......... .......... .......... .......... .......... .......... 507 TACTAAGTCT ATCATGAAAG AACTATTTCT TTAATTGATT CTAATTCTGT CTTTATTTCT 9645 .......... .......... .......... .......... .......... .......... 507 TGTATTTTTA TTACTAATGT ATTATTAATT TCTTCTTTAG ATTTATTTTT TAACTCGTTT 9705 .......... .......... .......... .......... .......... .......... 507 AATTTTTTAT TTAAATCTAT TCCTTGTTCT TTTAGATAAT CTAAGTTATT TTCTATCTTA 9765 .......... .......... .......... .......... .......... .......... 507 CTAAAAAACT GTTTTTGCTT ATTTTGTGTT TCTTCTTGTT CACTT-AATA TTTTTATAAT 9824 | | ||||| | |||| |||| || |||| || .......... .......... ........TA TGTTCTTCCT -ACTTAAATA TTATTATTAT 538 TTTATTGTTT GAT 9837 |||| | || || TTTA-CGATT TAT 550 hqPGS_C06HBa0112G05.1-6+_SGN-E377132- (8509 9014) ******************************************************************************** EST sequence 56 -strand 647 n (File: SGN-E396055-) 1 ATTAGCATAT AATTAAGAAT AAAAAGATGG AAATAATAAC CCACTAATTT ACTCAAGGGT 61 TCCTTTTTTT AACCCCCAGG GTAATTAGAC TTATTAACAT AAACCCCACT AACTTTATAA 121 TTAAAGTAGG AATAGTCCAA AACGTCCCTT AAAACGTGTA AAGAAATCCG ACCCAGACTG 181 GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGTCGTCGC 241 AAGGTTCAGA GACTCAATTT CCACCAAAGA GTCTGTGACG GTCCGTCACG CCCGTGACGG 301 TCCGTCGTGC CATTCCGTTA CGAAGTTCAG AGAGTCGATT TTTAGTACCC AATTTCAGAA 361 TTTCTAAGTG TTTTGAAACG AGACTCCTCG ACGGTCCATC GTGCTCATGA CGGTCCGTCG 421 TGGGTTCCGT CGTCTCAACC TGTTTTTCCA AAAATAAAAT CTGCTACTCA AAACGACTAA 481 ACAGGTCGTT ACATTTATGT TCTTCCTACT TAAATATTAT TATTATTTTA CGATTTATAA 541 CACTATTAGA AACAAAGATT TTCTCAACCA TGAATTAATG AAAAAATTAT GCCATAAAAT 601 ATAAAAAATT TACTCATTTT TCATTGAGCT AATTCATAAA AAAAAAA Predicted gene structure (within gDNA segment 7915 to 9840): Exon 1 8525 9014 ( 490 n); cDNA 1 495 ( 495 n); score: 0.874 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 496 538 ( 43 n); score: 0.693 PPA cDNA 636 647 MATCH C06HBa0112G05.1-6+ SGN-E396055- 0.874 534 0.825 C PGS_C06HBa0112G05.1-6+_SGN-E396055- (8525 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): ATTAGCATAT AATTAAGAAT -AAAAGATGG CAATAATAAC CCACTAATTT ACTCAA-GGT 8582 |||||||||| |||||||||| ||||||||| ||||||||| |||||||||| |||||| ||| ATTAGCATAT AATTAAGAAT AAAAAGATGG AAATAATAAC CCACTAATTT ACTCAAGGGT 60 TACCTTTTTT AACCCCCA-A GTAATTAGAC TTATTAACAT TAA-CCCACT AACTTTATAA 8640 | | |||||| |||||||| |||||||||| |||||||||| || |||||| |||||||||| TCCTTTTTTT AACCCCCAGG GTAATTAGAC TTATTAACAT AAACCCCACT AACTTTATAA 120 TTAAAGCAGG AATAGTAAAA AACGTCCCTT AAAACAT-TA AAGAAATCCG ACTCAGCCTG 8699 |||||| ||| |||||| || |||||||||| ||||| | || |||||||||| || ||| ||| TTAAAGTAGG AATAGTCCAA AACGTCCCTT AAAACGTGTA AAGAAATCCG ACCCAGACTG 180 GGATTATGCA GCCTGTGACG ACTCGTCGTG CCTGCGACGG TCCGTCTTGC TGCTCCGTCA 8759 |||||| ||| ||||||| | | ||||||| |||||||||| |||||| ||| | | |||| GGATTACGCA ACCTGTGATG GCCCGTCGTG CCTGCGACGG TCCGTCCTGC AGGT-CGTCG 239 CAGAGTTCAG AGACTCAATT TCCCTTAAAG AGTCTGTGAC GGTCCGTCAC GCCTGTGACG 8819 || |||||| |||||||||| ||| |||| |||||||||| |||||||||| ||| |||||| CAAGGTTCAG AGACTCAATT TCCACCAAAG AGTCTGTGAC GGTCCGTCAC GCCCGTGACG 299 GTCCGTCCTG CCATTCCGTT ACAAAGTTCA GAGAGTCGA- TTTCAGTACC CATTTTTCAG 8878 ||||||| || |||||||||| || ||||||| ||||||||| ||| |||||| || |||||| GTCCGTCGTG CCATTCCGTT ACGAAGTTCA GAGAGTCGAT TTTTAGTACC CA-ATTTCAG 358 AATTTCTAAG TGTTTTGAAA CGAGACCCCT CGACGGTCCG TCGTGCCCAT GACGGTCCGT 8938 |||||||||| |||||||||| |||||| ||| ||||||||| |||||| ||| |||||||||| AATTTCTAAG TGTTTTGAAA CGAGACTCCT CGACGGTCCA TCGTGCTCAT GACGGTCCGT 418 CGTGGGATCC GTCGTCTCAA CC-ATTTTTC CAGAAATAAC ATTTGTTGCT CAAAATGACT 8997 |||||| ||| |||||||||| || |||||| || |||||| || || | || ||||| |||| CGTGGGTTCC GTCGTCTCAA CCTGTTTTTC CAAAAATAAA ATCTGCTACT CAAAACGACT 478 AAACAGGTCG TTACACTAAC ACTGATAAAT GTTCTTCTCT ATAATGTCTA TATAGTTGAG 9057 |||||||||| ||||| | AAACAGGTCG TTACATT... .......... .......... .......... .......... 495 ATTTTGAATT TGTATTGTAT AAAACTTTGA TATTCAATAA ATTTTATTGA TTTTGTTGAA 9117 .......... .......... .......... .......... .......... .......... 495 AGATTTGATA TCCTTTTCTG TATCTATTAT TTCTCCTAAT TGTTGATTAT TCTCTTCCTT 9177 .......... .......... .......... .......... .......... .......... 495 TGTCCTTTTA TTTAATATTT TTGATAGAAA GTTATTACTT ATCATATTTT TGGAATAGCT 9237 .......... .......... .......... .......... .......... .......... 495 TGGTCTTGGT ATTTCTTCTC TTGATCTAGT TGAAAAAATA TCCATTTTTT CTGTTTTTTA 9297 .......... .......... .......... .......... .......... .......... 495 ATTTTTTTTC TTTTTGGGGA GTAATTTCTA TATTATTCAG TTTTTCTATT AATTCTTCAG 9357 .......... .......... .......... .......... .......... .......... 495 TTTTTCTATT AATTCTTCTA TTCCGGTTGT ATCTATATTT TTACTTTCTA ATTTTTCTTT 9417 .......... .......... .......... .......... .......... .......... 495 AAAGTTTTTA TCTATCTTAT TATTTAGTTG TAATAAAAGT GATATTATAT TATTTTGTTG 9477 .......... .......... .......... .......... .......... .......... 495 TATAATAATT TGATCTGATT TTGAAGATGG ACTAAATTCT TTGTAATCTG ATGGGAGGGC 9537 .......... .......... .......... .......... .......... .......... 495 TATTCTTTTT TCTATTAGTT TTGTTGCTTC TTTATGTGTT TCTGATTCTA CTAAGTCTAT 9597 .......... .......... .......... .......... .......... .......... 495 CATGAAAGAA CTATTTCTTT AATTGATTCT AATTCTGTCT TTATTTCTTG TATTTTTATT 9657 .......... .......... .......... .......... .......... .......... 495 ACTAATGTAT TATTAATTTC TTCTTTAGAT TTATTTTTTA ACTCGTTTAA TTTTTTATTT 9717 .......... .......... .......... .......... .......... .......... 495 AAATCTATTC CTTGTTCTTT TAGATAATCT AAGTTATTTT CTATCTTACT AAAAAACTGT 9777 .......... .......... .......... .......... .......... .......... 495 TTTTGCTTAT TTTGTGTTTC TTCTTGTTCA CTT-AATATT TTTATAATTT TATTGTTTGA 9836 | | ||||| | | ||| |||||| |||| |||| || | || | .......... ......TATG TTCTTCCT-A CTTAAATATT ATTATTATTT TA-CGATTTA 537 T 9837 | T 538 hqPGS_C06HBa0112G05.1-6+_SGN-E396055- (8525 9014) ******************************************************************************** EST sequence 55 -strand 640 n (File: SGN-E398551-) 1 TAGCATATAA TTAAGAATAA AAGATGGAAT AATAACCCAA TAATTTACTC AAGGTTACCT 61 CTTTTAACCC CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT 121 AGGAATAGTC CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC 181 GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC 241 AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 301 TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA GAATTTCTAA 361 GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG TCGTGGGTTC 421 CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC TAAACAGGTC 481 GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA TAACACTATT 541 AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA AATATAAAAA 601 ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAAA Predicted gene structure (within gDNA segment 7917 to 9840): Exon 1 8527 9014 ( 488 n); cDNA 1 488 ( 488 n); score: 0.894 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 489 531 ( 43 n); score: 0.693 PPA cDNA 629 640 MATCH C06HBa0112G05.1-6+ SGN-E398551- 0.894 532 0.831 C PGS_C06HBa0112G05.1-6+_SGN-E398551- (8527 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TAGCATATAA TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT CAAGGTTACC 8586 |||||||||| |||||||||| ||||||| || |||||||||| ||||||||| |||||||||| TAGCATATAA TTAAGAATAA AAGATGG-AA TAATAACCCA ATAATTTACT CAAGGTTACC 59 TTTTTTAACC CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT ATAATTAAAG 8646 | |||||||| |||| ||||| |||||||||| ||||| |||| |||||||||| |||||||||| TCTTTTAACC CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT ATAATTAAAG 119 CAGGAATAGT AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG CCTGGGATTA 8705 ||||||||| |||||||| ||||||||| | |||||||| |||||| ||| ||||||||| TAGGAATAGT CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG ACTGGGATTA 179 TGCAGCCTGT GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC GTCACAGAGT 8765 ||| ||||| || | | ||| |||||||||| |||||||||| ||| | | | ||| || || CGCAACCTGT GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C GTCGCAAGGT 238 TCAGAGACTC AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT GACGGTCCGT 8825 |||||||||| ||||||| |||||||||| |||||||||| ||||||| || |||||||||| TCAGAGACTC AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT GACGGTCCGT 298 CCTGCCATTC CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT TCAGAATTTC 8884 | |||||||| |||||| ||| |||||||||| ||| ||| || |||||| || |||||||||| CGTGCCATTC CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT TCAGAATTTC 357 TAAGTGTTTT GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT CCGTCGTGGG 8944 |||||||||| |||||||||| ||||||||| ||| |||||| ||||||||| |||||||||| TAAGTGTTTT GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT CCGTCGTGGG 417 ATCCGTCGTC TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT GACTAAACAG 9003 ||||||||| |||||| || |||||| ||| ||| || || | ||||||| |||||||||| TTCCGTCGTC TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC GACTAAACAG 477 GTCGTTACAC TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT TGAGATTTTG 9063 ||||||||| | GTCGTTACAT T......... .......... .......... .......... .......... 488 AATTTGTATT GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT TGAAAGATTT 9123 .......... .......... .......... .......... .......... .......... 488 GATATCCTTT TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT CCTTTGTCCT 9183 .......... .......... .......... .......... .......... .......... 488 TTTATTTAAT ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT AGCTTGGTCT 9243 .......... .......... .......... .......... .......... .......... 488 TGGTATTTCT TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT TTTAATTTTT 9303 .......... .......... .......... .......... .......... .......... 488 TTTCTTTTTG GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT TCAGTTTTTC 9363 .......... .......... .......... .......... .......... .......... 488 TATTAATTCT TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT CTTTAAAGTT 9423 .......... .......... .......... .......... .......... .......... 488 TTTATCTATC TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT GTTGTATAAT 9483 .......... .......... .......... .......... .......... .......... 488 AATTTGATCT GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA GGGCTATTCT 9543 .......... .......... .......... .......... .......... .......... 488 TTTTTCTATT AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT CTATCATGAA 9603 .......... .......... .......... .......... .......... .......... 488 AGAACTATTT CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT TATTACTAAT 9663 .......... .......... .......... .......... .......... .......... 488 GTATTATTAA TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT ATTTAAATCT 9723 .......... .......... .......... .......... .......... .......... 488 ATTCCTTGTT CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA CTGTTTTTGC 9783 .......... .......... .......... .......... .......... .......... 488 TTATTTTGTG TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT TTGAT 9837 | | ||||| | |||| || |||| |||| |||||| | || || .......... TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA TTTAT 531 hqPGS_C06HBa0112G05.1-6+_SGN-E398551- (8527 9014) ******************************************************************************** EST sequence 54 -strand 630 n (File: SGN-E396038-) 1 TTAAGAATAA AAGATGGAAT AATAACCCAC TAATTTACTC AAGGTTACCT CTTTTAACCC 61 CCAGGTAATT AGACTTATTA ACATAAACCC ACTAACTTTA TAATTAAAGT AGGAATAGTC 121 CAAAACGTCC CTTAAAACGT GTAAAGAAAT CCGACCCAGA CTGGGATTAC GCAACCTGTG 181 ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC AGAGACTCAA 241 TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG TGCCATTCCG 301 TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA GAATTTCTAA GTGTTTTGAA 361 ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG TCGTGGGTTC CGTCGTCTCA 421 ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC TAAACAGGTC GTTACATTTA 481 TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA TAACACTATT AGAAACAAAG 541 ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA AATATAAAAA ATTTACTCAT 601 TTTTCATAGA GTTAAATAAT AAAAAAAAAA Predicted gene structure (within gDNA segment 7927 to 9840): Exon 1 8537 9014 ( 478 n); cDNA 1 478 ( 478 n); score: 0.894 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 479 521 ( 43 n); score: 0.693 PPA cDNA 621 630 MATCH C06HBa0112G05.1-6+ SGN-E396038- 0.894 522 0.829 C PGS_C06HBa0112G05.1-6+_SGN-E396038- (8537 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): TTAAGAATAA AAGATGGCAA TAATAACCCA CTAATTTACT CAAGGTTACC TTTTTTAACC 8596 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| | |||||||| TTAAGAATAA AAGATGG-AA TAATAACCCA CTAATTTACT CAAGGTTACC TCTTTTAACC 59 CCCAAGTAAT TAGACTTATT AACATTAACC CACTAACTTT ATAATTAAAG CAGGAATAGT 8656 |||| ||||| |||||||||| ||||| |||| |||||||||| |||||||||| ||||||||| CCCAGGTAAT TAGACTTATT AACATAAACC CACTAACTTT ATAATTAAAG TAGGAATAGT 119 AAAAAACGTC CCTTAAAACA T-TAAAGAAA TCCGACTCAG CCTGGGATTA TGCAGCCTGT 8715 |||||||| ||||||||| | |||||||| |||||| ||| ||||||||| ||| ||||| CCAAAACGTC CCTTAAAACG TGTAAAGAAA TCCGACCCAG ACTGGGATTA CGCAACCTGT 179 GACGACTCGT CGTGCCTGCG ACGGTCCGTC TTGCTGCTCC GTCACAGAGT TCAGAGACTC 8775 || | | ||| |||||||||| |||||||||| ||| | | | ||| || || |||||||||| GATGGCCCGT CGTGCCTGCG ACGGTCCGTC CTGCAGGT-C GTCGCAAGGT TCAGAGACTC 238 AATTTCCCTT AAAGAGTCTG TGACGGTCCG TCACGCCTGT GACGGTCCGT CCTGCCATTC 8835 ||||||| |||||||||| |||||||||| ||||||| || |||||||||| | |||||||| AATTTCCACC AAAGAGTCTG TGACGGTCCG TCACGCCCGT GACGGTCCGT CGTGCCATTC 298 CGTTACAAAG TTCAGAGAGT CGA-TTTCAG TACCCATTTT TCAGAATTTC TAAGTGTTTT 8894 |||||| ||| |||||||||| ||| ||| || |||||| || |||||||||| |||||||||| CGTTACGAAG TTCAGAGAGT CGATTTTTAG TACCCA-ATT TCAGAATTTC TAAGTGTTTT 357 GAAACGAGAC CCCTCGACGG TCCGTCGTGC CCATGACGGT CCGTCGTGGG ATCCGTCGTC 8954 |||||||||| ||||||||| ||| |||||| ||||||||| |||||||||| ||||||||| GAAACGAGAC TCCTCGACGG TCCATCGTGC TCATGACGGT CCGTCGTGGG TTCCGTCGTC 417 TCAACC-ATT TTTCCAGAAA TAACATTTGT TGCTCAAAAT GACTAAACAG GTCGTTACAC 9013 |||||| || |||||| ||| ||| || || | ||||||| |||||||||| ||||||||| TCAACCTGTT TTTCCAAAAA TAAAATCTGC TACTCAAAAC GACTAAACAG GTCGTTACAT 477 TAACACTGAT AAATGTTCTT CTCTATAATG TCTATATAGT TGAGATTTTG AATTTGTATT 9073 | T......... .......... .......... .......... .......... .......... 478 GTATAAAACT TTGATATTCA ATAAATTTTA TTGATTTTGT TGAAAGATTT GATATCCTTT 9133 .......... .......... .......... .......... .......... .......... 478 TCTGTATCTA TTATTTCTCC TAATTGTTGA TTATTCTCTT CCTTTGTCCT TTTATTTAAT 9193 .......... .......... .......... .......... .......... .......... 478 ATTTTTGATA GAAAGTTATT ACTTATCATA TTTTTGGAAT AGCTTGGTCT TGGTATTTCT 9253 .......... .......... .......... .......... .......... .......... 478 TCTCTTGATC TAGTTGAAAA AATATCCATT TTTTCTGTTT TTTAATTTTT TTTCTTTTTG 9313 .......... .......... .......... .......... .......... .......... 478 GGGAGTAATT TCTATATTAT TCAGTTTTTC TATTAATTCT TCAGTTTTTC TATTAATTCT 9373 .......... .......... .......... .......... .......... .......... 478 TCTATTCCGG TTGTATCTAT ATTTTTACTT TCTAATTTTT CTTTAAAGTT TTTATCTATC 9433 .......... .......... .......... .......... .......... .......... 478 TTATTATTTA GTTGTAATAA AAGTGATATT ATATTATTTT GTTGTATAAT AATTTGATCT 9493 .......... .......... .......... .......... .......... .......... 478 GATTTTGAAG ATGGACTAAA TTCTTTGTAA TCTGATGGGA GGGCTATTCT TTTTTCTATT 9553 .......... .......... .......... .......... .......... .......... 478 AGTTTTGTTG CTTCTTTATG TGTTTCTGAT TCTACTAAGT CTATCATGAA AGAACTATTT 9613 .......... .......... .......... .......... .......... .......... 478 CTTTAATTGA TTCTAATTCT GTCTTTATTT CTTGTATTTT TATTACTAAT GTATTATTAA 9673 .......... .......... .......... .......... .......... .......... 478 TTTCTTCTTT AGATTTATTT TTTAACTCGT TTAATTTTTT ATTTAAATCT ATTCCTTGTT 9733 .......... .......... .......... .......... .......... .......... 478 CTTTTAGATA ATCTAAGTTA TTTTCTATCT TACTAAAAAA CTGTTTTTGC TTATTTTGTG 9793 .......... .......... .......... .......... .......... .......... 478 TTTCTTCTTG TTCACTT-AA TATTTTTATA ATTTTATTGT TTGAT 9837 | | ||||| | |||| || |||| |||| |||||| | || || TATGTTCTTC CT-ACTTAAA TATTATTATT ATTTTA-CGA TTTAT 521 hqPGS_C06HBa0112G05.1-6+_SGN-E396038- (8537 9014) ******************************************************************************** EST sequence 23 +strand 356 n (File: SGN-E396037+) 1 GGGCAGCGGA GCCTCATGTT TTGTTTACCA CTATGCCGCA TCTATATGAT TAACATGATG 61 ATGATGATGA TGACTACCAC GATTCACGAG AAGAAGATGA GGATGAATGG GGTATTGAGA 121 TGGATGTTTT ACTCGAGGTT ACCTCTTTTA ACCCCCAGGT AATTAGACTT ATTAACATAA 181 ACCCACTAAC TTTATAATTA AAGTAGGAAT AGTCCAAAAC GTCCCTTAAA ACGTGTAAAG 241 AAATCCGACC CAGACTGGGA TTACGCAACC TGTGATGGCC CGTCGTGCCT GCGACGGTCC 301 GTCCTGCAGG TCTTCTCTAG GTTCAGAGAC TCTCTTTCCA CCAAAGAGTC TGTGAC Predicted gene structure (within gDNA segment 6422 to 9840): Exon 1 7193 7207 ( 15 n); cDNA 114 127 ( 14 n); score: 0.600 Intron 1 7208 8570 (1363 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.94) Exon 2 8571 8799 ( 229 n); cDNA 128 356 ( 229 n); score: 0.862 MATCH C06HBa0112G05.1-6+ SGN-E396037+ 0.862 244 0.685 C PGS_C06HBa0112G05.1-6+_SGN-E396037+ (7193 7207,8571 8799) Alignment (genomic DNA sequence = upper lines): ATTCAGCGTT TATGAGTGGT AACGATACAG AGCCTTTTCC TAACTGTCAC GATCCAAATC 7252 ||| || | ||| ATTGAG-ATG GATGT..... .......... .......... .......... .......... 127 GGGCCGCGAC TAGCACCCAC ACTTACCCTC CTATGTGAGC GAACCAACCA ATCCAAACCC 7312 .......... .......... .......... .......... .......... .......... 127 CAACATTTTC AAACATAGTA ACAGAATATA ATGCGGAAGA CTTAAACTCA TTAATGAAAA 7372 .......... .......... .......... .......... .......... .......... 127 TCAATTAAAT AACTTCTAAA AACTCAACAA CTATTATTAT CCCCAAAATC TGGAAGTCAT 7432 .......... .......... .......... .......... .......... .......... 127 CATCACAAGA ACATCTACTT CAAATTACTA AATCTAAGAT TATCTAAGAA GCTAAAATAC 7492 .......... .......... .......... .......... .......... .......... 127 ATAAACAGCT AGTCCATGCC GGAACTTCAA GGCATCAAGA CATGAAGAGG AGGATCCAGT 7552 .......... .......... .......... .......... .......... .......... 127 CCAAGCTAGA AGCATTAGCT CACCCTGAAA TCCGGAGTAA TGAAGACTGG CTAGATTTGC 7612 .......... .......... .......... .......... .......... .......... 127 GGTTGAGTTG AAGACGACAG AACGTTTGCT GCACTCCACA AATAATCAAA AAGAAAACAT 7672 .......... .......... .......... .......... .......... .......... 127 ACAAGTAGGG GTCAGTACAA AACACAGGTA CTGAGTAGAT ATCATCGGCC AACTCAAAAT 7732 .......... .......... .......... .......... .......... .......... 127 AGAAAACAGT ATATATCAGA TAATATCATA AAATCAACTA CAGTACTCAA CATGCGGCAT 7792 .......... .......... .......... .......... .......... .......... 127 TTACAATTAC CATAACCCTT GGTCGCAACA CCAAGCTCAT CAATGAGGAC TCATGCCTCC 7852 .......... .......... .......... .......... .......... .......... 127 CCATCATACT CATTTGGGAA TTAAGTTCCT TAAATTGAGT ATATTAACAT ATTTCAAGAT 7912 .......... .......... .......... .......... .......... .......... 127 TCATTCTCTT TACTAATCCT GGTGTCAGAA CGTGACACCC GATCCATATA TACTATCCTG 7972 .......... .......... .......... .......... .......... .......... 127 GTACCGGAAC GTGGCACCCG ATCCATATTC TATCCTGGTG TCGGAACGTG ACACTCCGAT 8032 .......... .......... .......... .......... .......... .......... 127 CCTCATATAC TATCCTGGTA CCGGAACGTG GCACCCGATC CATATTCTAT CCTGGTGTCG 8092 .......... .......... .......... .......... .......... .......... 127 GAACGTGACA CTCCGATCCT CATATACTAT CCTGGTACCG GAACGTGACA CCCGATCCCC 8152 .......... .......... .......... .......... .......... .......... 127 TAATCTCACT ACTTTCGTTC ATCAAGCCTT CTTGTATACT AAGGCATCAT CATTAACAAA 8212 .......... .......... .......... .......... .......... .......... 127 GTAGATTAGG GTTTCTTTTT CAAGATTTAG AATTCAATAG CTTCATCATG CTTATCTCAT 8272 .......... .......... .......... .......... .......... .......... 127 CACAATTATA TAATCACAAT ATGCAAACAC ACAATTAAGC ATATAGAAGG GTTTACAACA 8332 .......... .......... .......... .......... .......... .......... 127 CTACCCAATA CATATCATTC GATATTAAGA GTTTACTACG AATAGTGTAA AAACCATAAC 8392 .......... .......... .......... .......... .......... .......... 127 CTACCTCCAT CGAAGATTAG TGATCAAGCA AGCAAATTCC CCAAAGCTTT GTGTTTTCCT 8452 .......... .......... .......... .......... .......... .......... 127 CTTCTCGTTC GATCCTCTCT CTCTTTTTGT TCTTTCTATT TTCTTTATTC AAACCCTCTT 8512 .......... .......... .......... .......... .......... .......... 127 TCTTTTACCC TAATTAGCAT ATAATTAAGA ATAAAAGATG GCAATAATAA CCCACTAATT 8572 || .......... .......... .......... .......... .......... ........TT 129 TACTCAAGGT TACCTTTTTT AACCCCCAAG TAATTAGACT TATTAACATT AACCCACTAA 8632 ||||| |||| ||||| |||| |||||||| | |||||||||| ||||||||| |||||||||| TACTCGAGGT TACCTCTTTT AACCCCCAGG TAATTAGACT TATTAACATA AACCCACTAA 189 CTTTATAATT AAAGCAGGAA TAGTAAAAAA CGTCCCTTAA AACAT-TAAA GAAATCCGAC 8691 |||||||||| |||| ||||| |||| |||| |||||||||| ||| | |||| |||||||||| CTTTATAATT AAAGTAGGAA TAGTCCAAAA CGTCCCTTAA AACGTGTAAA GAAATCCGAC 249 TCAGCCTGGG ATTATGCAGC CTGTGACGAC TCGTCGTGCC TGCGACGGTC CGTCTTGCTG 8751 ||| ||||| |||| ||| | |||||| | | ||||||||| |||||||||| |||| ||| | CCAGACTGGG ATTACGCAAC CTGTGATGGC CCGTCGTGCC TGCGACGGTC CGTCCTGCAG 309 CTCCGTCACA GAGTTCAGAG ACTCAATTTC CCTTAAAGAG TCTGTGAC 8799 | | || | |||||||| |||| |||| | |||||| |||||||| GT-CTTCTCT AGGTTCAGAG ACTCTCTTTC CACCAAAGAG TCTGTGAC 356 hqPGS_C06HBa0112G05.1-6+_SGN-E396037+ (8571 8799) ******************************************************************************** EST sequence 58 -strand 525 n (File: SGN-E396069-) 1 AAAGTAGGAA TAGTCCAAAA CGTCCCTTAA AACGTGTAAA GAAATCCGAC CCCCACCGGG 61 ATTACGCAAC CTGTGATGGC CCGTCGTGCC TGCGACGGTC CGTCCTGCAG GTCGTCGCAA 121 GGTTCAGAGA CTCAATTTCC ACCAAAGAGT CTGTGACGGT CCGTCACGCC CGTGACGGTC 181 CGTCGTGCCA TTCCGTTACG AAGTTCAGAG AGTCGATTTT TAGTACCCAA TTTCAGAATT 241 TCTAAGTGTT TTGAAACGAG ACTCCTCGAC GGTCCATCGT GCTCATGACG GTCCGTCGTG 301 GGTTCCGTCG TCTCAACCTG TTTTTCCAAA AATAAAATCT GCTACTCAAA ACGACTAAAC 361 AGGTCGTTAC ATTTATGTTC TTCCTACTTA AATATTATTA TTATTTTACG ATTTATAACA 421 CTATTAGAAA CAAAGATTTT CTCAACCATG AATTAATGAA AAAATTATGC CATAAAATAT 481 AAAAAATTTA CTCATTTTTC ATTGAGCTAA TTCATAAAAA AAAAA Predicted gene structure (within gDNA segment 7303 to 9840): Exon 1 8643 9014 ( 372 n); cDNA 1 373 ( 373 n); score: 0.867 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 374 416 ( 43 n); score: 0.693 PPA cDNA 514 525 MATCH C06HBa0112G05.1-6+ SGN-E396069- 0.867 416 0.792 C PGS_C06HBa0112G05.1-6+_SGN-E396069- (8643 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): AAAGCAGGAA TAGTAAAAAA CGTCCCTTAA AACAT-TAAA GAAATCCGAC TCAGCCTGGG 8701 |||| ||||| |||| |||| |||||||||| ||| | |||| |||||||||| | | ||| AAAGTAGGAA TAGTCCAAAA CGTCCCTTAA AACGTGTAAA GAAATCCGAC CCCCACCGGG 60 ATTATGCAGC CTGTGACGAC TCGTCGTGCC TGCGACGGTC CGTCTTGCTG CTCCGTCACA 8761 |||| ||| | |||||| | | ||||||||| |||||||||| |||| ||| | | |||| || ATTACGCAAC CTGTGATGGC CCGTCGTGCC TGCGACGGTC CGTCCTGCAG GT-CGTCGCA 119 GAGTTCAGAG ACTCAATTTC CCTTAAAGAG TCTGTGACGG TCCGTCACGC CTGTGACGGT 8821 |||||||| |||||||||| | |||||| |||||||||| |||||||||| | |||||||| AGGTTCAGAG ACTCAATTTC CACCAAAGAG TCTGTGACGG TCCGTCACGC CCGTGACGGT 179 CCGTCCTGCC ATTCCGTTAC AAAGTTCAGA GAGTCGA-TT TCAGTACCCA TTTTTCAGAA 8880 ||||| |||| |||||||||| ||||||||| ||||||| || | |||||||| |||||||| CCGTCGTGCC ATTCCGTTAC GAAGTTCAGA GAGTCGATTT TTAGTACCCA -ATTTCAGAA 238 TTTCTAAGTG TTTTGAAACG AGACCCCTCG ACGGTCCGTC GTGCCCATGA CGGTCCGTCG 8940 |||||||||| |||||||||| |||| ||||| ||||||| || |||| ||||| |||||||||| TTTCTAAGTG TTTTGAAACG AGACTCCTCG ACGGTCCATC GTGCTCATGA CGGTCCGTCG 298 TGGGATCCGT CGTCTCAACC -ATTTTTCCA GAAATAACAT TTGTTGCTCA AAATGACTAA 8999 |||| ||||| |||||||||| |||||||| |||||| || || | |||| ||| |||||| TGGGTTCCGT CGTCTCAACC TGTTTTTCCA AAAATAAAAT CTGCTACTCA AAACGACTAA 358 ACAGGTCGTT ACACTAACAC TGATAAATGT TCTTCTCTAT AATGTCTATA TAGTTGAGAT 9059 |||||||||| ||| | ACAGGTCGTT ACATT..... .......... .......... .......... .......... 373 TTTGAATTTG TATTGTATAA AACTTTGATA TTCAATAAAT TTTATTGATT TTGTTGAAAG 9119 .......... .......... .......... .......... .......... .......... 373 ATTTGATATC CTTTTCTGTA TCTATTATTT CTCCTAATTG TTGATTATTC TCTTCCTTTG 9179 .......... .......... .......... .......... .......... .......... 373 TCCTTTTATT TAATATTTTT GATAGAAAGT TATTACTTAT CATATTTTTG GAATAGCTTG 9239 .......... .......... .......... .......... .......... .......... 373 GTCTTGGTAT TTCTTCTCTT GATCTAGTTG AAAAAATATC CATTTTTTCT GTTTTTTAAT 9299 .......... .......... .......... .......... .......... .......... 373 TTTTTTTCTT TTTGGGGAGT AATTTCTATA TTATTCAGTT TTTCTATTAA TTCTTCAGTT 9359 .......... .......... .......... .......... .......... .......... 373 TTTCTATTAA TTCTTCTATT CCGGTTGTAT CTATATTTTT ACTTTCTAAT TTTTCTTTAA 9419 .......... .......... .......... .......... .......... .......... 373 AGTTTTTATC TATCTTATTA TTTAGTTGTA ATAAAAGTGA TATTATATTA TTTTGTTGTA 9479 .......... .......... .......... .......... .......... .......... 373 TAATAATTTG ATCTGATTTT GAAGATGGAC TAAATTCTTT GTAATCTGAT GGGAGGGCTA 9539 .......... .......... .......... .......... .......... .......... 373 TTCTTTTTTC TATTAGTTTT GTTGCTTCTT TATGTGTTTC TGATTCTACT AAGTCTATCA 9599 .......... .......... .......... .......... .......... .......... 373 TGAAAGAACT ATTTCTTTAA TTGATTCTAA TTCTGTCTTT ATTTCTTGTA TTTTTATTAC 9659 .......... .......... .......... .......... .......... .......... 373 TAATGTATTA TTAATTTCTT CTTTAGATTT ATTTTTTAAC TCGTTTAATT TTTTATTTAA 9719 .......... .......... .......... .......... .......... .......... 373 ATCTATTCCT TGTTCTTTTA GATAATCTAA GTTATTTTCT ATCTTACTAA AAAACTGTTT 9779 .......... .......... .......... .......... .......... .......... 373 TTGCTTATTT TGTGTTTCTT CTTGTTCACT T-AATATTTT TATAATTTTA TTGTTTGAT 9837 | | || ||| | ||| | |||||| | ||| |||||| | || || .......... ....TATGTT CTTCCT-ACT TAAATATTAT TATTATTTTA -CGATTTAT 416 hqPGS_C06HBa0112G05.1-6+_SGN-E396069- (8643 9014) ******************************************************************************** EST sequence 51 -strand 519 n (File: SGN-E374998-) 1 GGAATAGTAC ATAACGTCCC TTAAAACGTG TAAAAGAAAT CCGACCCAGA CTGGGATTAC 61 GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGTCGT CGCAAGGTTC 121 AGAGACTCAA TTTCCACCAA AGAGTCTGTG ACGGTCCGTC ACGCCCGTGA CGGTCCGTCG 181 TGCCATTCCG TTACGAAGTT CAGAGAGTCG ATTTTTAGTA CCCAATTTCA GAATTTCTAA 241 GTGTTTTGAA ACGAGACTCC TCGACGGTCC ATCGTGCTCA TGACGGTCCG TCGTGGGTTC 301 CGTCGTCTCA ACCTGTTTTT CCAAAAATAA AATCTGCTAC TCAAAACGAC TAAACAGGTC 361 GTTACATTTA TGTTCTTCCT ACTTAAATAT TATTATTATT TTACGATTTA TAACACTATT 421 AGAAACAAAG ATTTTCTCAA CCATGAATTA ATGAAAAAAT TATGCCATAA AATATAAAAA 481 ATTTACTCAT TTTTCATTGA GCTAATTCAT AAAAAAAAA Predicted gene structure (within gDNA segment 7353 to 9840): Exon 1 8649 9014 ( 366 n); cDNA 1 368 ( 368 n); score: 0.869 Intron 1 9015 9793 ( 779 n); Pd: 0.000 (s: 0.86), Pa: 0.000 (s: 0.69) Exon 2 9794 9837 ( 44 n); cDNA 369 411 ( 43 n); score: 0.693 PPA cDNA 509 519 MATCH C06HBa0112G05.1-6+ SGN-E374998- 0.869 410 0.790 C PGS_C06HBa0112G05.1-6+_SGN-E374998- (8649 9014,9794 9837) Alignment (genomic DNA sequence = upper lines): GGAATAGTAA AAAACGTCCC TTAAAACAT- T-AAAGAAAT CCGACTCAGC CTGGGATTAT 8706 ||||||||| | |||||||| ||||||| | | |||||||| ||||| ||| ||||||||| GGAATAGTAC ATAACGTCCC TTAAAACGTG TAAAAGAAAT CCGACCCAGA CTGGGATTAC 60 GCAGCCTGTG ACGACTCGTC GTGCCTGCGA CGGTCCGTCT TGCTGCTCCG TCACAGAGTT 8766 ||| |||||| | | | |||| |||||||||| ||||||||| ||| | | || || || ||| GCAACCTGTG ATGGCCCGTC GTGCCTGCGA CGGTCCGTCC TGCAGGT-CG TCGCAAGGTT 119 CAGAGACTCA ATTTCCCTTA AAGAGTCTGT GACGGTCCGT CACGCCTGTG ACGGTCCGTC 8826 |||||||||| |||||| | |||||||||| |||||||||| |||||| ||| |||||||||| CAGAGACTCA ATTTCCACCA AAGAGTCTGT GACGGTCCGT CACGCCCGTG ACGGTCCGTC 179 CTGCCATTCC GTTACAAAGT TCAGAGAGTC GA-TTTCAGT ACCCATTTTT CAGAATTTCT 8885 ||||||||| ||||| |||| |||||||||| || ||| ||| ||||| ||| |||||||||| GTGCCATTCC GTTACGAAGT TCAGAGAGTC GATTTTTAGT ACCCA-ATTT CAGAATTTCT 238 AAGTGTTTTG AAACGAGACC CCTCGACGGT CCGTCGTGCC CATGACGGTC CGTCGTGGGA 8945 |||||||||| ||||||||| |||||||||| || |||||| |||||||||| ||||||||| AAGTGTTTTG AAACGAGACT CCTCGACGGT CCATCGTGCT CATGACGGTC CGTCGTGGGT 298 TCCGTCGTCT CAACC-ATTT TTCCAGAAAT AACATTTGTT GCTCAAAATG ACTAAACAGG 9004 |||||||||| ||||| ||| ||||| |||| || || || | ||||||| | |||||||||| TCCGTCGTCT CAACCTGTTT TTCCAAAAAT AAAATCTGCT ACTCAAAACG ACTAAACAGG 358 TCGTTACACT AACACTGATA AATGTTCTTC TCTATAATGT CTATATAGTT GAGATTTTGA 9064 |||||||| | TCGTTACATT .......... .......... .......... .......... .......... 368 ATTTGTATTG TATAAAACTT TGATATTCAA TAAATTTTAT TGATTTTGTT GAAAGATTTG 9124 .......... .......... .......... .......... .......... .......... 368 ATATCCTTTT CTGTATCTAT TATTTCTCCT AATTGTTGAT TATTCTCTTC CTTTGTCCTT 9184 .......... .......... .......... .......... .......... .......... 368 TTATTTAATA TTTTTGATAG AAAGTTATTA CTTATCATAT TTTTGGAATA GCTTGGTCTT 9244 .......... .......... .......... .......... .......... .......... 368 GGTATTTCTT CTCTTGATCT AGTTGAAAAA ATATCCATTT TTTCTGTTTT TTAATTTTTT 9304 .......... .......... .......... .......... .......... .......... 368 TTCTTTTTGG GGAGTAATTT CTATATTATT CAGTTTTTCT ATTAATTCTT CAGTTTTTCT 9364 .......... .......... .......... .......... .......... .......... 368 ATTAATTCTT CTATTCCGGT TGTATCTATA TTTTTACTTT CTAATTTTTC TTTAAAGTTT 9424 .......... .......... .......... .......... .......... .......... 368 TTATCTATCT TATTATTTAG TTGTAATAAA AGTGATATTA TATTATTTTG TTGTATAATA 9484 .......... .......... .......... .......... .......... .......... 368 ATTTGATCTG ATTTTGAAGA TGGACTAAAT TCTTTGTAAT CTGATGGGAG GGCTATTCTT 9544 .......... .......... .......... .......... .......... .......... 368 TTTTCTATTA GTTTTGTTGC TTCTTTATGT GTTTCTGATT CTACTAAGTC TATCATGAAA 9604 .......... .......... .......... .......... .......... .......... 368 GAACTATTTC TTTAATTGAT TCTAATTCTG TCTTTATTTC TTGTATTTTT ATTACTAATG 9664 .......... .......... .......... .......... .......... .......... 368 TATTATTAAT TTCTTCTTTA GATTTATTTT TTAACTCGTT TAATTTTTTA TTTAAATCTA 9724 .......... .......... .......... .......... .......... .......... 368 TTCCTTGTTC TTTTAGATAA TCTAAGTTAT TTTCTATCTT ACTAAAAAAC TGTTTTTGCT 9784 .......... .......... .......... .......... .......... .......... 368 TATTTTGTGT TTCTTCTTGT TCACTT-AAT ATTTTTATAA TTTTATTGTT TGAT 9837 | | ||||| | |||| ||| ||| |||| | ||||| | | | || .........T ATGTTCTTCC T-ACTTAAAT ATTATTATTA TTTTA-CGAT TTAT 411 hqPGS_C06HBa0112G05.1-6+_SGN-E374998- (8649 9014) Total number of EST alignments reported: 66 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 9840: PGL 1 (+ strand): 3519 4985 AGS-1 (3519 4167) SCR (e 0.988) Exon 1 3519 4167 ( 649 n); score: 0.988 PGS (3519 4167) SGN-E539761+ PGS (3520 3904) SGN-E284788+ 3-phase translation of AGS-1 (+strand): . . . . . . 3519 TACGTCAGAAACTGGGAGTCTTGACAAACAGGTGGAGTTTCAAGTCATTCAGAACGAGAG Y V R N W E S - Q T G G V S S H S E R E T S E T G S L D K Q V E F Q V I Q N E S R Q K L G V L T N R W S F K S F R T R . . . . . . 3579 CGATTTAAAGGAACCTGAAGAGGAGGATCAAGAGCCACAGACAGAAACTGATATTCCAGA R F K G T - R G G S R A T D R N - Y S R D L K E P E E E D Q E P Q T E T D I P E A I - R N L K R R I K S H R Q K L I F Q . . . . . . 3639 ATCTATGCCATCAGATATCCATCAGAGTATAGATCAAGATCGGCCAAGGAGGGTTGGAGT I Y A I R Y P S E Y R S R S A K E G W S S M P S D I H Q S I D Q D R P R R V G V N L C H Q I S I R V - I K I G Q G G L E . . . . . . 3699 TCGGCCACCTACGAGGTATGGTTTTGAGGACATGGTGGGTTATGCACTGCAGGTTGCTGA S A T Y E V W F - G H G G L C T A G C - R P P T R Y G F E D M V G Y A L Q V A E F G H L R G M V L R T W W V M H C R L L . . . . . . 3759 AGAGGTAGATACATCTGAGCCGTCTACTTACAAAGAAGCCATTTTAAGTTCTGATTCTGA R G R Y I - A V Y L Q R S H F K F - F - E V D T S E P S T Y K E A I L S S D S E K R - I H L S R L L T K K P F - V L I L . . . . . . 3819 AAAATGGTTTGCCGCTATGGGAGATGAGATGGAGTCCCTACACAAGAATCAGACATGGGA K M V C R Y G R - D G V P T Q E S D M G K W F A A M G D E M E S L H K N Q T W D K N G L P L W E M R W S P Y T R I R H G . . . . . . 3879 TCTGGTCATACAGCCTTCGGGGAGAAAGATTATTACTTGCAAATGGGTTTTCAAGAAGAA S G H T A F G E K D Y Y L Q M G F Q E E L V I Q P S G R K I I T C K W V F K K K I W S Y S L R G E R L L L A N G F S R R . . . . . . 3939 GGAAGGGATATCACCAGCAGAAGGAGTCAAGTATAAAGCCAGGGTTGTTGCCAGAGGTTT G R D I T S R R S Q V - S Q G C C Q R F E G I S P A E G V K Y K A R V V A R G F R K G Y H Q Q K E S S I K P G L L P E V . . . . . . 3999 CAACCAAAGAGAGGGAGTGGACTACAATGAGATCTTCTCACCAGTGGTCAGACATACTTC Q P K R G S G L Q - D L L T S G Q T Y F N Q R E G V D Y N E I F S P V V R H T S S T K E R E W T T M R S S H Q W S D I L . . . . . . 4059 CATCCGAGTGTTACTAGCGATAGTTGCACATCAGAATCTGGAGCTTGAACAACTTGATGT H P S V T S D S C T S E S G A - T T - C I R V L L A I V A H Q N L E L E Q L D V P S E C Y - R - L H I R I W S L N N L M . . . . . 4119 GAAGACAGCGTTTTTACATGGAGAGTTGGAGGAAGAGATATACATGACT E D S V F T W R V G G R D I H D K T A F L H G E L E E E I Y M T - R Q R F Y M E S W R K R Y T - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-6+_PGL-1_AGS-1_PPS_1 (3520 4167) (frame '2'; 648 bp, 216 residues) 1 TSETGSLDKQ VEFQVIQNES DLKEPEEEDQ EPQTETDIPE SMPSDIHQSI DQDRPRRVGV 61 RPPTRYGFED MVGYALQVAE EVDTSEPSTY KEAILSSDSE KWFAAMGDEM ESLHKNQTWD 121 LVIQPSGRKI ITCKWVFKKK EGISPAEGVK YKARVVARGF NQREGVDYNE IFSPVVRHTS 181 IRVLLAIVAH QNLELEQLDV KTAFLHGELE EEIYMT 3-phase translation of AGS-1 (-strand): . . . . . . 4167 AGTCATGTATATCTCTTCCTCCAACTCTCCATGTAAAAACGCTGTCTTCACATCAAGTTG S H V Y L F L Q L S M - K R C L H I K L V M Y I S S S N S P C K N A V F T S S C S C I S L P P T L H V K T L S S H Q V . . . . . . 4107 TTCAAGCTCCAGATTCTGATGTGCAACTATCGCTAGTAACACTCGGATGGAAGTATGTCT F K L Q I L M C N Y R - - H S D G S M S S S S R F - C A T I A S N T R M E V C L V Q A P D S D V Q L S L V T L G W K Y V . . . . . . 4047 GACCACTGGTGAGAAGATCTCATTGTAGTCCACTCCCTCTCTTTGGTTGAAACCTCTGGC D H W - E D L I V V H S L S L V E T S G T T G E K I S L - S T P S L W L K P L A - P L V R R S H C S P L P L F G - N L W . . . . . . 3987 AACAACCCTGGCTTTATACTTGACTCCTTCTGCTGGTGATATCCCTTCCTTCTTCTTGAA N N P G F I L D S F C W - Y P F L L L E T T L A L Y L T P S A G D I P S F F L K Q Q P W L Y T - L L L L V I S L P S S - . . . . . . 3927 AACCCATTTGCAAGTAATAATCTTTCTCCCCGAAGGCTGTATGACCAGATCCCATGTCTG N P F A S N N L S P R R L Y D Q I P C L T H L Q V I I F L P E G C M T R S H V - K P I C K - - S F S P K A V - P D P M S . . . . . . 3867 ATTCTTGTGTAGGGACTCCATCTCATCTCCCATAGCGGCAAACCATTTTTCAGAATCAGA I L V - G L H L I S H S G K P F F R I R F L C R D S I S S P I A A N H F S E S E D S C V G T P S H L P - R Q T I F Q N Q . . . . . . 3807 ACTTAAAATGGCTTCTTTGTAAGTAGACGGCTCAGATGTATCTACCTCTTCAGCAACCTG T - N G F F V S R R L R C I Y L F S N L L K M A S L - V D G S D V S T S S A T C N L K W L L C K - T A Q M Y L P L Q Q P . . . . . . 3747 CAGTGCATAACCCACCATGTCCTCAAAACCATACCTCGTAGGTGGCCGAACTCCAACCCT Q C I T H H V L K T I P R R W P N S N P S A - P T M S S K P Y L V G G R T P T L A V H N P P C P Q N H T S - V A E L Q P . . . . . . 3687 CCTTGGCCGATCTTGATCTATACTCTGATGGATATCTGATGGCATAGATTCTGGAATATC P W P I L I Y T L M D I - W H R F W N I L G R S - S I L - W I S D G I D S G I S S L A D L D L Y S D G Y L M A - I L E Y . . . . . . 3627 AGTTTCTGTCTGTGGCTCTTGATCCTCCTCTTCAGGTTCCTTTAAATCGCTCTCGTTCTG S F C L W L L I L L F R F L - I A L V L V S V C G S - S S S S G S F K S L S F - Q F L S V A L D P P L Q V P L N R S R S . . . . . 3567 AATGACTTGAAACTCCACCTGTTTGTCAAGACTCCCAGTTTCTGACGTA N D L K L H L F V K T P S F - R M T - N S T C L S R L P V S D V E - L E T P P V C Q D S Q F L T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (4178 4985) SCR (e 0.994) Exon 1 4178 4985 ( 808 n); score: 0.994 PGS (4178 4985) SGN-E343338+ 3-phase translation of AGS-2 (+strand): . . . . . . 4178 GTTTCCAAGTTCCAGGGAAGGAAAATCACGTCTGCAAGTTGAAGAAGTCCTTATATGGAC V S K F Q G R K I T S A S - R S P Y M D F P S S R E G K S R L Q V E E V L I W T F Q V P G K E N H V C K L K K S L Y G . . . . . . 4238 TTAAGCAGTCTCCAAGGCAGTGGTATAAAAGGTTTGACAGCTATATGGTGAAGTTGGGCT L S S L Q G S G I K G L T A I W - S W A - A V S K A V V - K V - Q L Y G E V G L L K Q S P R Q W Y K R F D S Y M V K L G . . . . . . 4298 ATACTCGGAGCTCATATGATTGTTGTGTCTACTACAATAGGCTCAATGATGATTCATTCA I L G A H M I V V S T T I G S M M I H S Y S E L I - L L C L L Q - A Q - - F I H Y T R S S Y D C C V Y Y N R L N D D S F . . . . . . 4358 TCTATCTGGTGCTTTATGTAGATGATATGTTGATAGCTGCAAAGAAGAAGTATGACATTC S I W C F M - M I C - - L Q R R S M T F L S G A L C R - Y V D S C K E E V - H S I Y L V L Y V D D M L I A A K K K Y D I . . . . . . 4418 AGAAGCTGAAGGGTTTACTTAGTGCTGAGTTTGAGATGAAGGATTTGGGAGCCGCTCGGA R S - R V Y L V L S L R - R I W E P L G E A E G F T - C - V - D E G F G S R S E Q K L K G L L S A E F E M K D L G A A R . . . . . . 4478 AGATTTTAGGGATGGAGATCATTAGAGACAGAGAGAGAAGGAAACTTTTCTTGTCACAGA R F - G W R S L E T E R E G N F S C H R D F R D G D H - R Q R E K E T F L V T E K I L G M E I I R D R E R R K L F L S Q . . . . . . 4538 GAAGCTACATTCAGAAGGTCTTGGCGAGGTTTGGCATGTCTTCATCTAAGCCCATTGATA E A T F R R S W R G L A C L H L S P L I K L H S E G L G E V W H V F I - A H - Y R S Y I Q K V L A R F G M S S S K P I D . . . . . . 4598 CCCCCAGTGCTGCCAATATCCATCTCACTGCCATGTTCGCTCCACAGTCAGAAGAAGAGA P P V L P I S I S L P C S L H S Q K K R P Q C C Q Y P S H C H V R S T V R R R E T P S A A N I H L T A M F A P Q S E E E . . . . . . 4658 AGGAGTATATGTCACGAGTCCCTTATGCCAGTGCCGTAGGAAGTTTAATGTATGCTATGG R S I C H E S L M P V P - E V - C M L W G V Y V T S P L C Q C R R K F N V C Y G K E Y M S R V P Y A S A V G S L M Y A M . . . . . . 4718 TCTGTACAAGGCCAGATTTAGCACATGCAGTCAGTGTAGTGAGCAGATTCATGGGACAAC S V Q G Q I - H M Q S V - - A D S W D N L Y K A R F S T C S Q C S E Q I H G T T V C T R P D L A H A V S V V S R F M G Q . . . . . . 4778 CAGGGAGAGAACATTGGCAGGCTGTGAAGAGAATTTTCCGGTACCTTAGAGGTACATCTG Q G E N I G R L - R E F S G T L E V H L R E R T L A G C E E N F P V P - R Y I - P G R E H W Q A V K R I F R Y L R G T S . . . . . . 4838 ACGTTGGTCTCATTTATGGAGGTGATACTCAATGCTTGGTTACTGGCTATTCTGATTCAG T L V S F M E V I L N A W L L A I L I Q R W S H L W R - Y S M L G Y W L F - F R D V G L I Y G G D T Q C L V T G Y S D S . . . . . . 4898 ACTATGCTGGAGATGTTGACACAAGAAGATCGATGACTGGCTATGTGTTTACCCTTGGAG T M L E M L T Q E D R - L A M C L P L E L C W R C - H K K I D D W L C V Y P W R D Y A G D V D T R R S M T G Y V F T L G . . . 4958 GATCTGTCGTCAGTTGGAAGGCAACTTT D L S S V G R Q L I C R Q L E G N F G S V V S W K A T Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-6+_PGL-1_AGS-2_PPS_1 (4180 4983) (frame '0'; 804 bp, 268 residues) 1 FQVPGKENHV CKLKKSLYGL KQSPRQWYKR FDSYMVKLGY TRSSYDCCVY YNRLNDDSFI 61 YLVLYVDDML IAAKKKYDIQ KLKGLLSAEF EMKDLGAARK ILGMEIIRDR ERRKLFLSQR 121 SYIQKVLARF GMSSSKPIDT PSAANIHLTA MFAPQSEEEK EYMSRVPYAS AVGSLMYAMV 181 CTRPDLAHAV SVVSRFMGQP GREHWQAVKR IFRYLRGTSD VGLIYGGDTQ CLVTGYSDSD 241 YAGDVDTRRS MTGYVFTLGG SVVSWKAT 3-phase translation of AGS-2 (-strand): . . . . . . 4985 AAAGTTGCCTTCCAACTGACGACAGATCCTCCAAGGGTAAACACATAGCCAGTCATCGAT K V A F Q L T T D P P R V N T - P V I D K L P S N - R Q I L Q G - T H S Q S S I S C L P T D D R S S K G K H I A S H R . . . . . . 4925 CTTCTTGTGTCAACATCTCCAGCATAGTCTGAATCAGAATAGCCAGTAACCAAGCATTGA L L V S T S P A - S E S E - P V T K H - F L C Q H L Q H S L N Q N S Q - P S I E S S C V N I S S I V - I R I A S N Q A L . . . . . . 4865 GTATCACCTCCATAAATGAGACCAACGTCAGATGTACCTCTAAGGTACCGGAAAATTCTC V S P P - M R P T S D V P L R Y R K I L Y H L H K - D Q R Q M Y L - G T G K F S S I T S I N E T N V R C T S K V P E N S . . . . . . 4805 TTCACAGCCTGCCAATGTTCTCTCCCTGGTTGTCCCATGAATCTGCTCACTACACTGACT F T A C Q C S L P G C P M N L L T T L T S Q P A N V L S L V V P - I C S L H - L L H S L P M F S P W L S H E S A H Y T D . . . . . . 4745 GCATGTGCTAAATCTGGCCTTGTACAGACCATAGCATACATTAAACTTCCTACGGCACTG A C A K S G L V Q T I A Y I K L P T A L H V L N L A L Y R P - H T L N F L R H W C M C - I W P C T D H S I H - T S Y G T . . . . . . 4685 GCATAAGGGACTCGTGACATATACTCCTTCTCTTCTTCTGACTGTGGAGCGAACATGGCA A - G T R D I Y S F S S S D C G A N M A H K G L V T Y T P S L L L T V E R T W Q G I R D S - H I L L L F F - L W S E H G . . . . . . 4625 GTGAGATGGATATTGGCAGCACTGGGGGTATCAATGGGCTTAGATGAAGACATGCCAAAC V R W I L A A L G V S M G L D E D M P N - D G Y W Q H W G Y Q W A - M K T C Q T S E M D I G S T G G I N G L R - R H A K . . . . . . 4565 CTCGCCAAGACCTTCTGAATGTAGCTTCTCTGTGACAAGAAAAGTTTCCTTCTCTCTCTG L A K T F - M - L L C D K K S F L L S L S P R P S E C S F S V T R K V S F S L C P R Q D L L N V A S L - Q E K F P S L S . . . . . . 4505 TCTCTAATGATCTCCATCCCTAAAATCTTCCGAGCGGCTCCCAAATCCTTCATCTCAAAC S L M I S I P K I F R A A P K S F I S N L - - S P S L K S S E R L P N P S S Q T V S N D L H P - N L P S G S Q I L H L K . . . . . . 4445 TCAGCACTAAGTAAACCCTTCAGCTTCTGAATGTCATACTTCTTCTTTGCAGCTATCAAC S A L S K P F S F - M S Y F F F A A I N Q H - V N P S A S E C H T S S L Q L S T L S T K - T L Q L L N V I L L L C S Y Q . . . . . . 4385 ATATCATCTACATAAAGCACCAGATAGATGAATGAATCATCATTGAGCCTATTGTAGTAG I S S T - S T R - M N E S S L S L L - - Y H L H K A P D R - M N H H - A Y C S R H I I Y I K H Q I D E - I I I E P I V V . . . . . . 4325 ACACAACAATCATATGAGCTCCGAGTATAGCCCAACTTCACCATATAGCTGTCAAACCTT T Q Q S Y E L R V - P N F T I - L S N L H N N H M S S E Y S P T S P Y S C Q T F D T T I I - A P S I A Q L H H I A V K P . . . . . . 4265 TTATACCACTGCCTTGGAGACTGCTTAAGTCCATATAAGGACTTCTTCAACTTGCAGACG L Y H C L G D C L S P Y K D F F N L Q T Y T T A L E T A - V H I R T S S T C R R F I P L P W R L L K S I - G L L Q L A D . . . 4205 TGATTTTCCTTCCCTGGAACTTGGAAAC - F S F P G T W K D F P S L E L G N V I F L P W N L E Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 5480 9837 AGS-1 (5480 5667) SCR (e 0.954) Exon 1 5480 5667 ( 188 n); score: 0.954 PGS (5480 5666) SGN-E301194+ PGS (5484 5667) SGN-E301078+ PGS (5494 5667) SGN-E301194+ 3-phase translation of AGS-1 (+strand): . . . . . . 5480 CGCTGCCTGGTCACTATATATAGACGCTATGGCAAACCCTATTCTGTAATTCTGTTTTTG R C L V T I Y R R Y G K P Y S V I L F L A A W S L Y I D A M A N P I L - F C F C L P G H Y I - T L W Q T L F C N S V F . . . . . . 5540 CCTCTCCATAATAAAATTGCTCCCTCTCTTCCCGTGGACGTAGCCAATTTATTGGTGAAC P L H N K I A P S L P V D V A N L L V N L S I I K L L P L F P W T - P I Y W - T A S P - - N C S L S S R G R S Q F I G E . . . . . . 5600 CACGTAAATCTGTTGTCTTGTTTTTCGCATTTATATTTTCTCGTATTATCTCAAATTCCG H V N L L S C F S H L Y F L V L S Q I P T - I C C L V F R I Y I F S Y Y L K F R P R K S V V L F F A F I F S R I I S N S . 5660 CACAACAG H N T T A Q Q Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 5667 CTGTTGTGCGGAATTTGAGATAATACGAGAAAATATAAATGCGAAAAACAAGACAACAGA L L C G I - D N T R K Y K C E K Q D N R C C A E F E I I R E N I N A K N K T T D V V R N L R - Y E K I - M R K T R Q Q . . . . . . 5607 TTTACGTGGTTCACCAATAAATTGGCTACGTCCACGGGAAGAGAGGGAGCAATTTTATTA F T W F T N K L A T S T G R E G A I L L L R G S P I N W L R P R E E R E Q F Y Y I Y V V H Q - I G Y V H G K R G S N F I . . . . . . 5547 TGGAGAGGCAAAAACAGAATTACAGAATAGGGTTTGCCATAGCGTCTATATATAGTGACC W R G K N R I T E - G L P - R L Y I V T G E A K T E L Q N R V C H S V Y I - - P M E R Q K Q N Y R I G F A I A S I Y S D . 5487 AGGCAGCG R Q G S Q A A Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (5714 5885,6551 6586,6972 6984,7035 7223,9004 9163) SCR (e 0.942 d 0.000 a 0.000,e 0.528 d 0.630 a 0.353,e 0.769 d 0.000 a 0.000,e 0.852 d 0.000 a 0.447,e 0.841) Exon 1 5714 5885 ( 172 n); score: 0.942 Intron 1 5886 6550 ( 665 n); Pd: 0.000 Pa: 0.000 Exon 2 6551 6586 ( 36 n); score: 0.528 Intron 2 6587 6971 ( 385 n); Pd: 0.630 Pa: 0.353 Exon 3 6972 6984 ( 13 n); score: 0.769 Intron 3 6985 7034 ( 50 n); Pd: 0.000 Pa: 0.000 Exon 4 7035 7223 ( 189 n); score: 0.852 Intron 4 7224 9003 (1780 n); Pd: 0.000 Pa: 0.447 Exon 5 9004 9163 ( 160 n); score: 0.841 PGS (5714 5885,6551 6586,6972 6984,7035 7223,9004 9163) SGN-E551070+ PGS (5715 5885,6551 6586,6972 6984,7035 7223,9004 9149) SGN-E260038+ 3-phase translation of AGS-2 (+strand): . . . . . . 5714 TCTTTATATAATACCAATTACTTAAACTTAATTGCTCTAATTTTATTACTGCATTTCTTT S L Y N T N Y L N L I A L I L L L H F F L Y I I P I T - T - L L - F Y Y C I S L F I - Y Q L L K L N C S N F I T A F L . . . . . . 5774 GTAAGGCTACTAATCCACTATTTGGGTCTTCTCCTGTTATTAATGTATGTATTTTATTTG V R L L I H Y L G L L L L L M Y V F Y L - G Y - S T I W V F S C Y - C M Y F I C C K A T N P L F G S S P V I N V C I L F . . . . . . : 5834 TAAAATTATATGGGTTAGGTCCTAATGATACTAAATACTGAAATTCTGATGG : CTTCTAAT - N Y M G - V L M I L N T E I L M : A S N K I I W V R S - - Y - I L K F - W : L L I V K L Y G L G P N D T K Y - N S D G : F - . . . : . . : . 6559 ATTATTTCCATGTTCATTCCTTCTAAAG : ATTCATCATAAAA : TAATTATATAATTGTCTTT I I S M F I P S K : D S S - N : N Y I I V F L F P C S F L L K : I H H K : I I I - L S F Y Y F H V H S F - R : F I I K : - L Y N C L . . . . . . 7054 TTTATGTCTGACCATCTATTATCATATATTGTAATTAATGTCTTTGTTCCTAAGTTTTTC F M S D H L L S Y I V I N V F V P K F F L C L T I Y Y H I L - L M S L F L S F S F Y V - P S I I I Y C N - C L C S - V F . . . . . . 7114 CTCGTTAATCCTTTAATTCCTATAACTATTAGTCCTATATGCATTAGATCTTTTTTAACG L V N P L I P I T I S P I C I R S F L T S L I L - F L - L L V L Y A L D L F - R P R - S F N S Y N Y - S Y M H - I F F N . . . . . : . 7174 TATTTTATTTCTTTTACAGATTCAGCGTTTATGAGTGGTAACGATACAGA : GTCGTTACAC Y F I S F T D S A F M S G N D T E : S L H I L F L L Q I Q R L - V V T I Q : S R Y T V F Y F F Y R F S V Y E W - R Y R : V V T . . . . . . 9014 TAACACTGATAAATGTTCTTCTCTATAATGTCTATATAGTTGAGATTTTGAATTTGTATT - H - - M F F S I M S I - L R F - I C I N T D K C S S L - C L Y S - D F E F V L L T L I N V L L Y N V Y I V E I L N L Y . . . . . . 9074 GTATAAAACTTTGATATTCAATAAATTTTATTGATTTTGTTGAAAGATTTGATATCCTTT V - N F D I Q - I L L I L L K D L I S F Y K T L I F N K F Y - F C - K I - Y P F C I K L - Y S I N F I D F V E R F D I L . . . 9134 TCTGTATCTATTATTTCTCCTAATTGTTGA S V S I I S P N C - L Y L L F L L I V F C I Y Y F S - L L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-6+_PGL-2_AGS-2_PPS_1 (6983 6984,7035 7223,9004 9016) (frame '1'; 201 bp, 67 residues) 1 NNYIIVFFMS DHLLSYIVIN VFVPKFFLVN PLIPITISPI CIRSFLTYFI SFTDSAFMSG 61 NDTESLH- AGS-3 (7237 7957,7994 8150) SCR (e 0.842 d 0.000 a 0.000,e 0.944) Exon 1 7237 7957 ( 721 n); score: 0.842 Intron 1 7958 7993 ( 36 n); Pd: 0.000 Pa: 0.000 Exon 2 7994 8150 ( 157 n); score: 0.944 PGS (7237 7724) SGN-E356206- PGS (7237 7724) SGN-E356696- PGS (7237 7724) SGN-E351546- PGS (7237 7643) SGN-E392027+ PGS (7237 7642) SGN-E542084+ PGS (7237 7642) SGN-E370357+ PGS (7377 7639) SGN-E373117- PGS (7377 7595) SGN-E298638- PGS (7377 7569) SGN-E352844- PGS (7386 7642) SGN-E336814- PGS (7386 7639) SGN-E373116+ PGS (7396 7569) SGN-E368629- PGS (7402 7957,7994 8136) SGN-E546506+ PGS (7403 7569) SGN-E238551- PGS (7404 7653) SGN-E222578+ PGS (7550 7950) SGN-E246710- PGS (7649 7884) SGN-E209683- PGS (7706 7957,7994 8150) SGN-E349977- PGS (7972 8042) SGN-E546548- 3-phase translation of AGS-3 (+strand): . . . . . . 7237 TGTCACGATCCAAATCGGGCCGCGACTAGCACCCACACTTACCCTCCTATGTGAGCGAAC C H D P N R A A T S T H T Y P P M - A N V T I Q I G P R L A P T L T L L C E R T S R S K S G R D - H P H L P S Y V S E . . . . . . 7297 CAACCAATCCAAACCCCAACATTTTCAAACATAGTAACAGAATATAATGCGGAAGACTTA Q P I Q T P T F S N I V T E Y N A E D L N Q S K P Q H F Q T - - Q N I M R K T - P T N P N P N I F K H S N R I - C G R L . . . . . . 7357 AACTCATTAATGAAAATCAATTAAATAACTTCTAAAAACTCAACAACTATTATTATCCCC N S L M K I N - I T S K N S T T I I I P T H - - K S I K - L L K T Q Q L L L S P K L I N E N Q L N N F - K L N N Y Y Y P . . . . . . 7417 AAAATCTGGAAGTCATCATCACAAGAACATCTACTTCAAATTACTAAATCTAAGATTATC K I W K S S S Q E H L L Q I T K S K I I K S G S H H H K N I Y F K L L N L R L S Q N L E V I I T R T S T S N Y - I - D Y . . . . . . 7477 TAAGAAGCTAAAATACATAAACAGCTAGTCCATGCCGGAACTTCAAGGCATCAAGACATG - E A K I H K Q L V H A G T S R H Q D M K K L K Y I N S - S M P E L Q G I K T - L R S - N T - T A S P C R N F K A S R H . . . . . . 7537 AAGAGGAGGATCCAGTCCAAGCTAGAAGCATTAGCTCACCCTGAAATCCGGAGTAATGAA K R R I Q S K L E A L A H P E I R S N E R G G S S P S - K H - L T L K S G V M K E E E D P V Q A R S I S S P - N P E - - . . . . . . 7597 GACTGGCTAGATTTGCGGTTGAGTTGAAGACGACAGAACGTTTGCTGCACTCCACAAATA D W L D L R L S - R R Q N V C C T P Q I T G - I C G - V E D D R T F A A L H K - R L A R F A V E L K T T E R L L H S T N . . . . . . 7657 ATCAAAAAGAAAACATACAAGTAGGGGTCAGTACAAAACACAGGTACTGAGTAGATATCA I K K K T Y K - G S V Q N T G T E - I S S K R K H T S R G Q Y K T Q V L S R Y H N Q K E N I Q V G V S T K H R Y - V D I . . . . . . 7717 TCGGCCAACTCAAAATAGAAAACAGTATATATCAGATAATATCATAAAATCAACTACAGT S A N S K - K T V Y I R - Y H K I N Y S R P T Q N R K Q Y I S D N I I K S T T V I G Q L K I E N S I Y Q I I S - N Q L Q . . . . . . 7777 ACTCAACATGCGGCATTTACAATTACCATAACCCTTGGTCGCAACACCAAGCTCATCAAT T Q H A A F T I T I T L G R N T K L I N L N M R H L Q L P - P L V A T P S S S M Y S T C G I Y N Y H N P W S Q H Q A H Q . . . . . . 7837 GAGGACTCATGCCTCCCCATCATACTCATTTGGGAATTAAGTTCCTTAAATTGAGTATAT E D S C L P I I L I W E L S S L N - V Y R T H A S P S Y S F G N - V P - I E Y I - G L M P P H H T H L G I K F L K L S I . . . . . . 7897 TAACATATTTCAAGATTCATTCTCTTTACTAATCCTGGTGTCAGAACGTGACACCCGATC - H I S R F I L F T N P G V R T - H P I N I F Q D S F S L L I L V S E R D T R S L T Y F K I H S L Y - S W C Q N V T P D . : . . . . . 7957 C : TCCATATTCTATCCTGGTGTCGGAACGTGACACTCCGATCCTCATATACTATCCTGGTA : L H I L S W C R N V T L R S S Y T I L V : S I F Y P G V G T - H S D P H I L S W Y P : P Y S I L V S E R D T P I L I Y Y P G . . . . . . 8053 CCGGAACGTGGCACCCGATCCATATTCTATCCTGGTGTCGGAACGTGACACTCCGATCCT P E R G T R S I F Y P G V G T - H S D P R N V A P D P Y S I L V S E R D T P I L T G T W H P I H I L S W C R N V T L R S . . . . 8113 CATATACTATCCTGGTACCGGAACGTGACACCCGATCC H I L S W Y R N V T P D I Y Y P G T G T - H P I S Y T I L V P E R D T R S Maximal non-overlapping open reading frames (>= 64 codons): none AGS-4 (7508 9027,9807 9837) SCR (e 0.875 d 0.258 a 0.000,e 0.726) Exon 1 7508 9027 (1520 n); score: 0.875 Intron 1 9028 9806 ( 779 n); Pd: 0.258 Pa: 0.000 Exon 2 9807 9837 ( 31 n); score: 0.726 PGS (7508 8211) SGN-E241789+ PGS (7740 8158) SGN-E242359- PGS (8154 8501) SGN-E347579- PGS (8454 8966) SGN-E349296- PGS (8457 9027,9807 9837) SGN-E389553- PGS (8457 9014) SGN-E550212+ PGS (8457 9014) SGN-E550065+ PGS (8457 9014) SGN-E550201+ PGS (8457 9014) SGN-E550207+ PGS (8457 9014) SGN-E550335+ PGS (8457 9014) SGN-E390013+ PGS (8457 9014) SGN-E550484+ PGS (8457 9014) SGN-E550211+ PGS (8457 9014) SGN-E550464+ PGS (8457 9014) SGN-E549941+ PGS (8457 9014) SGN-E550025+ PGS (8457 9014) SGN-E396039+ PGS (8457 9014) SGN-E396056+ PGS (8457 9014) SGN-E377133+ PGS (8457 9014) SGN-E550127- PGS (8457 9012) SGN-E231589+ PGS (8457 9012) SGN-E374999+ PGS (8457 9003) SGN-E389834+ PGS (8457 9003) SGN-E396054+ PGS (8457 9003) SGN-E396058+ PGS (8457 9002) SGN-E241959+ PGS (8457 8930) SGN-E236652+ PGS (8457 8911) SGN-E396070+ PGS (8468 9014) SGN-E550322+ PGS (8493 9014) SGN-E396057- PGS (8503 8894) SGN-E356257- PGS (8509 9014) SGN-E377132- PGS (8525 9014) SGN-E396055- PGS (8527 9014) SGN-E398551- PGS (8537 9014) SGN-E396038- PGS (8571 8799) SGN-E396037+ PGS (8643 9014) SGN-E396069- PGS (8649 9014) SGN-E374998- 3-phase translation of AGS-4 (+strand): . . . . . . 7508 ATGCCGGAACTTCAAGGCATCAAGACATGAAGAGGAGGATCCAGTCCAAGCTAGAAGCAT M P E L Q G I K T - R G G S S P S - K H C R N F K A S R H E E E D P V Q A R S I A G T S R H Q D M K R R I Q S K L E A . . . . . . 7568 TAGCTCACCCTGAAATCCGGAGTAATGAAGACTGGCTAGATTTGCGGTTGAGTTGAAGAC - L T L K S G V M K T G - I C G - V E D S S P - N P E - - R L A R F A V E L K T L A H P E I R S N E D W L D L R L S - R . . . . . . 7628 GACAGAACGTTTGCTGCACTCCACAAATAATCAAAAAGAAAACATACAAGTAGGGGTCAG D R T F A A L H K - S K R K H T S R G Q T E R L L H S T N N Q K E N I Q V G V S R Q N V C C T P Q I I K K K T Y K - G S . . . . . . 7688 TACAAAACACAGGTACTGAGTAGATATCATCGGCCAACTCAAAATAGAAAACAGTATATA Y K T Q V L S R Y H R P T Q N R K Q Y I T K H R Y - V D I I G Q L K I E N S I Y V Q N T G T E - I S S A N S K - K T V Y . . . . . . 7748 TCAGATAATATCATAAAATCAACTACAGTACTCAACATGCGGCATTTACAATTACCATAA S D N I I K S T T V L N M R H L Q L P - Q I I S - N Q L Q Y S T C G I Y N Y H N I R - Y H K I N Y S T Q H A A F T I T I . . . . . . 7808 CCCTTGGTCGCAACACCAAGCTCATCAATGAGGACTCATGCCTCCCCATCATACTCATTT P L V A T P S S S M R T H A S P S Y S F P W S Q H Q A H Q - G L M P P H H T H L T L G R N T K L I N E D S C L P I I L I . . . . . . 7868 GGGAATTAAGTTCCTTAAATTGAGTATATTAACATATTTCAAGATTCATTCTCTTTACTA G N - V P - I E Y I N I F Q D S F S L L G I K F L K L S I L T Y F K I H S L Y - W E L S S L N - V Y - H I S R F I L F T . . . . . . 7928 ATCCTGGTGTCAGAACGTGACACCCGATCCATATATACTATCCTGGTACCGGAACGTGGC I L V S E R D T R S I Y T I L V P E R G S W C Q N V T P D P Y I L S W Y R N V A N P G V R T - H P I H I Y Y P G T G T W . . . . . . 7988 ACCCGATCCATATTCTATCCTGGTGTCGGAACGTGACACTCCGATCCTCATATACTATCC T R S I F Y P G V G T - H S D P H I L S P D P Y S I L V S E R D T P I L I Y Y P H P I H I L S W C R N V T L R S S Y T I . . . . . . 8048 TGGTACCGGAACGTGGCACCCGATCCATATTCTATCCTGGTGTCGGAACGTGACACTCCG W Y R N V A P D P Y S I L V S E R D T P G T G T W H P I H I L S W C R N V T L R L V P E R G T R S I F Y P G V G T - H S . . . . . . 8108 ATCCTCATATACTATCCTGGTACCGGAACGTGACACCCGATCCCCTAATCTCACTACTTT I L I Y Y P G T G T - H P I P - S H Y F S S Y T I L V P E R D T R S P N L T T F D P H I L S W Y R N V T P D P L I S L L . . . . . . 8168 CGTTCATCAAGCCTTCTTGTATACTAAGGCATCATCATTAACAAAGTAGATTAGGGTTTC R S S S L L V Y - G I I I N K V D - G F V H Q A F L Y T K A S S L T K - I R V S S F I K P S C I L R H H H - Q S R L G F . . . . . . 8228 TTTTTCAAGATTTAGAATTCAATAGCTTCATCATGCTTATCTCATCACAATTATATAATC F F K I - N S I A S S C L S H H N Y I I F S R F R I Q - L H H A Y L I T I I - S L F Q D L E F N S F I M L I S S Q L Y N . . . . . . 8288 ACAATATGCAAACACACAATTAAGCATATAGAAGGGTTTACAACACTACCCAATACATAT T I C K H T I K H I E G F T T L P N T Y Q Y A N T Q L S I - K G L Q H Y P I H I H N M Q T H N - A Y R R V Y N T T Q Y I . . . . . . 8348 CATTCGATATTAAGAGTTTACTACGAATAGTGTAAAAACCATAACCTACCTCCATCGAAG H S I L R V Y Y E - C K N H N L P P S K I R Y - E F T T N S V K T I T Y L H R R S F D I K S L L R I V - K P - P T S I E . . . . . . 8408 ATTAGTGATCAAGCAAGCAAATTCCCCAAAGCTTTGTGTTTTCCTCTTCTCGTTCGATCC I S D Q A S K F P K A L C F P L L V R S L V I K Q A N S P K L C V F L F S F D P D - - S S K Q I P Q S F V F S S S R S I . . . . . . 8468 TCTCTCTCTTTTTGTTCTTTCTATTTTCTTTATTCAAACCCTCTTTCTTTTACCCTAATT S L S F C S F Y F L Y S N P L S F T L I L S L F V L S I F F I Q T L F L L P - L L S L F L F F L F S L F K P S F F Y P N . . . . . . 8528 AGCATATAATTAAGAATAAAAGATGGCAATAATAACCCACTAATTTACTCAAGGTTACCT S I - L R I K D G N N N P L I Y S R L P A Y N - E - K M A I I T H - F T Q G Y L - H I I K N K R W Q - - P T N L L K V T . . . . . . 8588 TTTTTAACCCCCAAGTAATTAGACTTATTAACATTAACCCACTAACTTTATAATTAAAGC F L T P K - L D L L T L T H - L Y N - S F - P P S N - T Y - H - P T N F I I K A F F N P Q V I R L I N I N P L T L - L K . . . . . . 8648 AGGAATAGTAAAAAACGTCCCTTAAAACATTAAAGAAATCCGACTCAGCCTGGGATTATG R N S K K R P L K H - R N P T Q P G I M G I V K N V P - N I K E I R L S L G L C Q E - - K T S L K T L K K S D S A W D Y . . . . . . 8708 CAGCCTGTGACGACTCGTCGTGCCTGCGACGGTCCGTCTTGCTGCTCCGTCACAGAGTTC Q P V T T R R A C D G P S C C S V T E F S L - R L V V P A T V R L A A P S Q S S A A C D D S S C L R R S V L L L R H R V . . . . . . 8768 AGAGACTCAATTTCCCTTAAAGAGTCTGTGACGGTCCGTCACGCCTGTGACGGTCCGTCC R D S I S L K E S V T V R H A C D G P S E T Q F P L K S L - R S V T P V T V R P Q R L N F P - R V C D G P S R L - R S V . . . . . . 8828 TGCCATTCCGTTACAAAGTTCAGAGAGTCGATTTCAGTACCCATTTTTCAGAATTTCTAA C H S V T K F R E S I S V P I F Q N F - A I P L Q S S E S R F Q Y P F F R I S K L P F R Y K V Q R V D F S T H F S E F L . . . . . . 8888 GTGTTTTGAAACGAGACCCCTCGACGGTCCGTCGTGCCCATGACGGTCCGTCGTGGGATC V F - N E T P R R S V V P M T V R R G I C F E T R P L D G P S C P - R S V V G S S V L K R D P S T V R R A H D G P S W D . . . . . . 8948 CGTCGTCTCAACCATTTTTCCAGAAATAACATTTGTTGCTCAAAATGACTAAACAGGTCG R R L N H F S R N N I C C S K - L N R S V V S T I F P E I T F V A Q N D - T G R P S S Q P F F Q K - H L L L K M T K Q V . . : . . . . 9008 TTACACTAACACTGATAAAT : ACTTAATATTTTTATAATTTTATTGTTTGAT L H - H - - I : L N I F I I L L F D Y T N T D K : Y L I F L - F Y C L V T L T L I N : T - Y F Y N F I V - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-6+_PGL-2_AGS-4_PPS_1 (7929 8216) (frame '2'; 285 bp, 95 residues) 1 SWCQNVTPDP YILSWYRNVA PDPYSILVSE RDTPILIYYP GTGTWHPIHI LSWCRNVTLR 61 SSYTILVPER DTRSPNLTTF VHQAFLYTKA SSLTK- >C06HBa0112G05.1-6+_PGL-2_AGS-4_PPS_2 (8681 8887) (frame '1'; 204 bp, 68 residues) 1 RNPTQPGIMQ PVTTRRACDG PSCCSVTEFR DSISLKESVT VRHACDGPSC HSVTKFRESI 61 SVPIFQNF- AGS-5 (8457 9013,9751 9781) SCR (e 0.882 d 0.000 a 0.000,e 0.742) Exon 1 8457 9013 ( 557 n); score: 0.882 Intron 1 9014 9750 ( 737 n); Pd: 0.000 Pa: 0.000 Exon 2 9751 9781 ( 31 n); score: 0.742 PGS (8457 9013,9751 9781) SGN-E550140- 3-phase translation of AGS-5 (+strand): . . . . . . 8457 TCGTTCGATCCTCTCTCTCTTTTTGTTCTTTCTATTTTCTTTATTCAAACCCTCTTTCTT S F D P L S L F V L S I F F I Q T L F L R S I L S L F L F F L F S L F K P S F F V R S S L S F C S F Y F L Y S N P L S . . . . . . 8517 TTACCCTAATTAGCATATAATTAAGAATAAAAGATGGCAATAATAACCCACTAATTTACT L P - L A Y N - E - K M A I I T H - F T Y P N - H I I K N K R W Q - - P T N L L F T L I S I - L R I K D G N N N P L I Y . . . . . . 8577 CAAGGTTACCTTTTTTAACCCCCAAGTAATTAGACTTATTAACATTAACCCACTAACTTT Q G Y L F - P P S N - T Y - H - P T N F K V T F F N P Q V I R L I N I N P L T L S R L P F L T P K - L D L L T L T H - L . . . . . . 8637 ATAATTAAAGCAGGAATAGTAAAAAACGTCCCTTAAAACATTAAAGAAATCCGACTCAGC I I K A G I V K N V P - N I K E I R L S - L K Q E - - K T S L K T L K K S D S A Y N - S R N S K K R P L K H - R N P T Q . . . . . . 8697 CTGGGATTATGCAGCCTGTGACGACTCGTCGTGCCTGCGACGGTCCGTCTTGCTGCTCCG L G L C S L - R L V V P A T V R L A A P W D Y A A C D D S S C L R R S V L L L R P G I M Q P V T T R R A C D G P S C C S . . . . . . 8757 TCACAGAGTTCAGAGACTCAATTTCCCTTAAAGAGTCTGTGACGGTCCGTCACGCCTGTG S Q S S E T Q F P L K S L - R S V T P V H R V Q R L N F P - R V C D G P S R L - V T E F R D S I S L K E S V T V R H A C . . . . . . 8817 ACGGTCCGTCCTGCCATTCCGTTACAAAGTTCAGAGAGTCGATTTCAGTACCCATTTTTC T V R P A I P L Q S S E S R F Q Y P F F R S V L P F R Y K V Q R V D F S T H F S D G P S C H S V T K F R E S I S V P I F . . . . . . 8877 AGAATTTCTAAGTGTTTTGAAACGAGACCCCTCGACGGTCCGTCGTGCCCATGACGGTCC R I S K C F E T R P L D G P S C P - R S E F L S V L K R D P S T V R R A H D G P Q N F - V F - N E T P R R S V V P M T V . . . . . . 8937 GTCGTGGGATCCGTCGTCTCAACCATTTTTCCAGAAATAACATTTGTTGCTCAAAATGAC V V G S V V S T I F P E I T F V A Q N D S W D P S S Q P F F Q K - H L L L K M T R R G I R R L N H F S R N N I C C S K - . . : . . . 8997 TAAACAGGTCGTTACAC : TTATTTTCTATCTTACTAAAAAACTGTTTTT - T G R Y T : Y F L S Y - K T V F K Q V V T : L I F Y L T K K L F L N R S L H : L F S I L L K N C F Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-6+_PGL-2_AGS-5_PPS_1 (8681 8887) (frame '0'; 204 bp, 68 residues) 1 RNPTQPGIMQ PVTTRRACDG PSCCSVTEFR DSISLKESVT VRHACDGPSC HSVTKFRESI 61 SVPIFQNF- ... finished at: Mon Aug 28 22:19:02 2006 ________________________________________________________________________________ Sequence 7: C06HBa0112G05.1-7, from 1 to 2199, both strands analyzed. ... started at: Mon Aug 28 22:19:02 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 4 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand 628 n (File: SGN-E327106-) 1 GCTTTAATGT GTACAAATCC TACTCAGTAC TCCTTCACTC TCTGTTTCTA TANNGTAATT 61 ATCTTTCTAG TTTCTAGTTT AACTACTTTT TTTTTGGACT CTGTCCATTC ATGAATCTAT 121 TATTATTGTT CATCTCTTTC TCTTAACCTT TGTTCTTAAG AATAATCTTC TTTAACCACA 181 CCCCGGATTT GGTTATTCAT AGTTTTTTTT CTCAGAACAA TTAACCTGGC AATGTCCTAG 241 GAATTTACAG GTTAATCCAT CGATATTTTT AAGAAGCTAA GAGACTAACC ATATACTAAG 301 TGATATAGAT TCATGATGAT ATTTATAAAT TTACATGAAT ATTTAAATAA TCATAAAAGT 361 AAACACACAT GATTTACATA AAACACACAT AATTTACATA TATGAGAAGT TTACTAGAAA 421 CTTAAATATG AAAACTTACA TTATCTAGAA CAGAGTGAAG TTTTCCTTTC GGGAAACCCG 481 AACGACTTCC AAAGTAACTT TACAAAAACT TGAAACAAAA CCTAAGAACC TAAAGAAAAT 541 ATTTTTATAT TTTATTTTTA AAGAGGAAGG ATCTTTGAAG GGCTTAAGAT TTGCAGGTTG 601 TAGATTTTTT TGCTCTTTTT GCTTTCTT Predicted gene structure (within gDNA segment 2199 to 1): Exon 1 2199 1853 ( 347 n); cDNA 39 372 ( 334 n); score: 0.823 Intron 1 1852 1761 ( 92 n); Pd: 0.000 (s: 0.64), Pa: 0.000 (s: 0.70) Exon 2 1760 1699 ( 62 n); cDNA 373 433 ( 61 n); score: 0.694 Intron 2 1698 1358 ( 341 n); Pd: 0.000 (s: 0.64), Pa: 0.000 (s: 0) Exon 3 1357 1335 ( 23 n); cDNA 434 455 ( 22 n); score: 0.739 MATCH C06HBa0112G05.1-7- SGN-E327106- 0.803 432 0.688 C PGS_C06HBa0112G05.1-7-_SGN-E327106- (2199 1853,1760 1699,1357 1335) Alignment (genomic DNA sequence = upper lines): TCTCTTGTTC TATTATGTAA TTATCATATT AGTTTCTAGT TTAAACTTAC CTTTAAATTT 2140 ||||| ||| ||| |||| ||||| | | |||||||||| || ||| || | | ||| TCTCTGTTTC TATANNGTAA TTATCTTTCT AGTTTCTAGT TT-AAC-TA- C--T---TTT 90 TTTTTTATGG ACTCTGTCCA TTCATGAATC TATTATTACT GTTCATCTCT TTCCCTTAGC 2080 |||| | || |||||||||| |||||||||| |||||||| | |||||||||| ||| |||| | TTTT-T--GG ACTCTGTCCA TTCATGAATC TATTATTATT GTTCATCTCT TTCTCTTAAC 147 CTTTGTTCTT AGGAGTAATC TTCTAAAACC CCACTCCGGA TTCGGTTATT CGTAG-TTCT 2021 |||||||||| | || ||||| |||| |||| ||| ||||| || ||||||| | ||| || | CTTTGTTCTT AAGAATAATC TTCTTTAACC ACACCCCGGA TTTGGTTATT CATAGTTTTT 207 TTTCTTAGAA GATTTAACCT GACATTGTCC TAGGAATTTA CAGGTTAATC CATCGATACT 1961 ||||| |||| | ||||||| | || ||||| |||||||||| |||||||||| |||||||| | TTTCTCAGAA CAATTAACCT GGCAATGTCC TAGGAATTTA CAGGTTAATC CATCGATATT 267 TTTAAGAGGC TAAGAGACTA ACCATATACT AAGTAAAATA GATTCATGAT GATATTCATA 1901 ||||||| || |||||||||| |||||||||| |||| | ||| |||||||||| |||||| ||| TTTAAGAAGC TAAGAGACTA ACCATATACT AAGTGATATA GATTCATGAT GATATTTATA 327 TATTTACATG AACTTATGAA AAAAATTAAA CACACATAAT TTACATATAT GGGAAGTTTA 1841 ||||||||| || ||| | || ||| | | | | || |||| AATTTACATG AA--TATTTA AATAATCATA -AAAGTAAAC ACACATGA.. .......... 372 ACAGAAACTT AAATATGAAA ACTTACATTA TTAGAACAGA GTGAAGTTTC CCTTTCGGGA 1781 .......... .......... .......... .......... .......... .......... 372 AACCCGAAAA ACTTCCAAAC TTTACTTAAA ACAAACTTTA AACTAAAGAA ATGAAAACTT 1721 ||||| |||| ||| || || | || | | |||| || || .......... .......... TTTACATAAA ACACAC-ATA ATTTACATAT ATGAGAAGTT 411 TTTTACAATC TCTAAAAAGA AAATTTTAGA GAGGAAAGAG GTTGTTTTTT TGAGGATCTT 1661 | || || | | || | || || TACTAGAAAC TTAAATATGA AA........ .......... .......... .......... 433 TTAGAGTTTT TTGTAGTGAT TGTCTTTGCT TTTCTTGCCT TCTATCTCTT CCTGTCTTGA 1601 .......... .......... .......... .......... .......... .......... 433 TAATGGCTGC TTTTATAGAG GTCTTCGGAC TTCAAACTTC GGATTTCAAA TGTAAAAAGG 1541 .......... .......... .......... .......... .......... .......... 433 GAATCTTATC ATCTCAGCCG GGTACACATC TGGGCCCCAT CGTCACGTGA AATAATGTCC 1481 .......... .......... .......... .......... .......... .......... 433 TTTTTCTTGC ATTATCGTCC TTGGAGTCTG ATATTGGGGT CTGATATCTG TCTATCTTAT 1421 .......... .......... .......... .......... .......... .......... 433 CTCTTTCCTG AATAAGACGT GTCCCAACAG TTTTTCCACA TGTGTGGTAG TGAGTAAAGT 1361 .......... .......... .......... .......... .......... .......... 433 GGGACTCACA TTTTTTGACA AAAGAG 1335 ||| ||| || | | | | | |||| ...ACTTACA TTATCT-AGA ACAGAG 455 hqPGS_C06HBa0112G05.1-7-_SGN-E327106- (2199 1853,1760 1699,1357 1335) ******************************************************************************** EST sequence 3 -strand 439 n (File: SGN-E320805-) 1 TTCATAGTTT TTTTCTCAGA ACAATTAACC TGGCAATGTC NTAGGAATTT ACAGGTTAAT 61 CCATCGATAT TNTTAAGAAG CTAAGAGACT AACCATATAC TAAGTGATAT AGATTCATGA 121 TGATATTTAT AAATTTACAT GAATATTTAA ATAATCATAA AAGTAAACAC ACATGATTTA 181 CATAAAACAC ACATAATTTA CATATATGAG AAGTTTACTA GAAACTTAAA TATGAAAACT 241 TACATTATCT AGAACAGAGT GAAGTTTTCC TTTCGGGAAA CCCGAACGAC TTCCAAAGTA 301 ACTTTACAAA AACTTGAAAC AAAACCTAAG AACCTAAAGA AAATATTTTT ATATTTTATT 361 TTTAAAGAGG AAGGATCTTT GAAGGGCTTA AGATTTGCAG GTTGTAGATT TTTTTGCTCT 421 TTTTGCTTTC TTCGCTTTC Predicted gene structure (within gDNA segment 2199 to 1): Exon 1 1986 1978 ( 9 n); cDNA 132 140 ( 9 n); score: 0.889 Intron 1 1977 1921 ( 57 n); Pd: 0.720 (s: 0), Pa: 0.000 (s: 0.58) Exon 2 1920 1734 ( 187 n); cDNA 141 324 ( 184 n); score: 0.799 MATCH C06HBa0112G05.1-7- SGN-E320805- 0.799 196 0.446 C PGS_C06HBa0112G05.1-7-_SGN-E320805- (1986 1978,1920 1734) Alignment (genomic DNA sequence = upper lines): AATTTACAGG TTAATCCATC GATACTTTTA AGAGGCTAAG AGACTAACCA TATACTAAGT 1927 |||||||| AATTTACAT. .......... .......... .......... .......... .......... 140 AAAATAGATT CATGATGATA TTCATATATT TACATGAACT TATGAAAAAA ATTAAACACA 1867 || | | | ||| ||||| | || | || |||| | || ||||||| ......GAAT ATTTA-AATA ATCATAAAAG TA-AA-CACA CATGATTTAC ATAAAACACA 191 CATAATTTAC ATATATGGGA AGTTTAACAG AAACTTAAAT ATGAAAACTT ACATTAT-TA 1808 |||||||||| ||||||| || |||||| || |||||||||| |||||||||| ||||||| || CATAATTTAC ATATATGAGA AGTTTACTAG AAACTTAAAT ATGAAAACTT ACATTATCTA 251 GAACAGAGTG AAGTTTCCCT TTCGGGAAAC CCGAAAAACT TCCAAACTTT ACTTAAAACA 1748 |||||||||| |||||| ||| |||||||||| ||||| ||| |||||| | |||| | | | GAACAGAGTG AAGTTTTCCT TTCGGGAAAC CCGAACGACT TCCAAA-GTA ACTTTACAAA 310 AACTTTAAAC TAAA 1734 ||||| |||| ||| AACTTGAAAC AAAA 324 hqPGS_C06HBa0112G05.1-7-_SGN-E320805- (1986 1978,1920 1734) ******************************************************************************** EST sequence 4 -strand 618 n (File: SGN-E368232-) 1 AATTCGGTTA AACCTTTAAT ATTGTACAAT CCTACTCAAG TACTCCTTCA CTATAAATTA 61 TCATATTAGT TTCTAGTTTC AACTTCCCTT TTTTTTTTAT GGACTCTGTC CATTCATGAA 121 TCTATCTATT ATTATTGTAC ATCTCTTTCT CTTAGCCTTT GTTCTTAAAA GTAATCTTCT 181 TTAACCACAC CCCGGATTTG GTTATTCATA GTTTTTTTCT CAGAACATTT AACCTGACAA 241 TGTCCTAGGA ATTTACAGGT TAATCCATCG ATACTTTTAA GAAGCTAAGA GACTAACCAT 301 ATACTAAGTA ATATAGATTC ATGATGATAT TCATAAATTT ACATGAATTT ATCAAATAAT 361 TAACATTTTA TAAACACACA TAATTTACAT ATATGGGAAG TTTACTAGAA ACATAAATAT 421 GAAAACTTAC ATTATTTAGA ACAGAGTGAA GTTTCCCTTT CGGAAAACCC GAAAAGCTTC 481 CAAACTTTTA CTTAAAACAA ACTTTTAGAA TAAAGAACTG AAATCTTTAT TACAATCTTT 541 TAAAAGATAA TTTTAAAGAG AAAAGGGGTT GTTTTGAGGA TTTTGAGAGA GTTTTTGTTG 601 TGGTTGTCTT AGAACTAG Predicted gene structure (within gDNA segment 2199 to 1): Exon 1 2196 1635 ( 562 n); cDNA 43 610 ( 568 n); score: 0.823 MATCH C06HBa0112G05.1-7- SGN-E368232- 0.823 562 0.909 C PGS_C06HBa0112G05.1-7-_SGN-E368232- (2196 1635) Alignment (genomic DNA sequence = upper lines): CTTGTTCTAT TATGTAATTA TCATATTAGT TTCTAGTTTA AACTTACCTT TAAATTTTTT 2137 || ||| | ||| ||||| |||||||||| ||||||||| ||||| || | |||||| CTCCTTC-AC TAT-AAATTA TCATATTAGT TTCTAGTTTC AACTTCCC-- T---TTTTTT 95 TTTATGGACT CTGTCCATTC ATGAATCTAT -TATTACTGT T---CATCTC TTTCCCTTAG 2081 |||||||||| |||||||||| |||||||||| ||||| | | | |||||| |||| ||||| TTTATGGACT CTGTCCATTC ATGAATCTAT CTATTATTAT TGTACATCTC TTTCTCTTAG 155 CCTTTGTTCT TAGGAGTAAT CTTCTAAAAC CCCACTCCGG ATTCGGTTAT TCGTAGTTCT 2021 |||||||||| || |||||| ||||| ||| | ||| |||| ||| |||||| || ||||| | CCTTTGTTCT TAAAAGTAAT CTTCTTTAAC CACACCCCGG ATTTGGTTAT TCATAGTTTT 215 TTTCTTAGAA GATTTAACCT GACATTGTCC TAGGAATTTA CAGGTTAATC CATCGATACT 1961 ||||| |||| ||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| TTTCTCAGAA CATTTAACCT GACAATGTCC TAGGAATTTA CAGGTTAATC CATCGATACT 275 TTTAAGAGGC TAAGAGACTA ACCATATACT AAGTAAAATA GATTCATGAT GATATTCATA 1901 ||||||| || |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| TTTAAGAAGC TAAGAGACTA ACCATATACT AAGTAATATA GATTCATGAT GATATTCATA 335 TATTTACATG AACTTAT-GA A-AA--AA-A ---T-TAAAC ACACATAATT TACATATATG 1850 ||||||||| || |||| | | || || | | ||||| |||||||||| |||||||||| AATTTACATG AATTTATCAA ATAATTAACA TTTTATAAAC ACACATAATT TACATATATG 395 GGAAGTTTAA CAGAAACTTA AATATGAAAA CTTACATTA- TTAGAACAGA GTGAAGTTTC 1791 ||||||||| |||||| || |||||||||| ||||||||| |||||||||| |||||||||| GGAAGTTTAC TAGAAACATA AATATGAAAA CTTACATTAT TTAGAACAGA GTGAAGTTTC 455 CCTTTCGGGA AACCCGAAAA ACTTCCAAAC -TTTACTTAA AACAAAC-TT TAAACTAAAG 1733 |||||||| | |||||||||| ||||||||| ||||||||| ||||||| || || | ||||| CCTTTCGGAA AACCCGAAAA GCTTCCAAAC TTTTACTTAA AACAAACTTT TAGAATAAAG 515 AAATGAAAAC TTTTTTACAA TCTCTAAAAA GAAAATTTTA GAGAGGAAAG AGGTTGTTTT 1673 || ||||| | ||| |||||| ||| | |||| || ||||||| ||| |||| ||| ||| | AACTGAAATC TTTATTACAA TCTTTTAAAA GATAATTTTA AAGA-GAAA- AGG-GGTTGT 572 TTTGAGGATC TTTTAGAG-T TTTTTGTAGT GATTGTCTT 1635 ||||||||| ||| |||| ||||||| || | ||||||| TTTGAGGAT- TTTGAGAGAG TTTTTGTTGT GGTTGTCTT 610 hqPGS_C06HBa0112G05.1-7-_SGN-E368232- (2196 1635) ******************************************************************************** EST sequence 2 -strand 540 n (File: SGN-E257656-) 1 TAATTATCAT ATTAGTTTCT AGTTTTTCTT CCCTTTTTTT TATGGACTTT GTCCACTCAT 61 GAATAGTATT ATTGTTCATC TCTTTCTCTT AGCTTTTGTT CTTAAAAGTA ATCTTCTTTA 121 ACCACACCCC TGATTTGGTT ATTCATAGTT CTTCTCTAAG AACATTTAGT TTAACCTAAC 181 AATGTCCTAG GAATTTACAG GTTAATCCAT CGATACTTTT AAGAGGCTAA GAGACTAACC 241 ATATACTAAG TAATGTAGAT TCATGAATTT ATCAAATAAT TAACATATCA CATGTAATTT 301 ACCAAACACA AATAATTTAC ATATATGGGA AGTTTACTAG AAACGTAAAT ATGAAAACTT 361 ACATTCTTTA GAACAGAGTG AAGTTTCCCT TTCGGGAAAC CCGAAAAACT TCCAAACTGA 421 CTTAACACAA ACTTTTAAAC AAAGAACTGA AAATATTTAT ATATATATAT ATTTTTATTA 481 TTTTTTATTT TTTAAAAGGT AATTCTGAAG AGGAAAGAGA TTTTGAAGTC TTTGGAATTT Predicted gene structure (within gDNA segment 2199 to 1): Exon 1 2182 1716 ( 467 n); cDNA 1 461 ( 461 n); score: 0.821 MATCH C06HBa0112G05.1-7- SGN-E257656- 0.821 467 0.865 C PGS_C06HBa0112G05.1-7-_SGN-E257656- (2182 1716) Alignment (genomic DNA sequence = upper lines): TAATTATCAT ATTAGTTTCT AGTTTAAACT TACCTTTAAA TTTTTTTTTA TGGACTCTGT 2123 |||||||||| |||||||||| ||||| | | ||| ||||||||| |||||| ||| TAATTATCAT ATTAGTTTCT AGTTT----T T--CTT--CC CTTTTTTTTA TGGACTTTGT 52 CCATTCATGA ATCTATTATT ACTGTTCATC TCTTTCCCTT AGCCTTTGTT CTTAGGAGTA 2063 ||| |||||| | || |||| | |||||||| |||||| ||| ||| |||||| |||| |||| CCACTCATGA A--TAGTATT ATTGTTCATC TCTTTCTCTT AGCTTTTGTT CTTAAAAGTA 110 ATCTTCTAAA ACCCCACTCC GGATTCGGTT ATTCGTAGTT CTTTTCTTAG -A-A--GA-T 2008 ||||||| | ||| ||| || |||| |||| |||| ||||| ||| ||| || | | | | ATCTTCTTTA ACCACACCCC TGATTTGGTT ATTCATAGTT CTTCTCTAAG AACATTTAGT 170 TTAACCTGAC ATTGTCCTAG GAATTTACAG GTTAATCCAT CGATACTTTT AAGAGGCTAA 1948 ||||||| || | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAACCTAAC AATGTCCTAG GAATTTACAG GTTAATCCAT CGATACTTTT AAGAGGCTAA 230 GAGACTAACC ATATACTAAG TAAAATAGAT TCATGATGAT ATTCATAT-A TTTACATGAA 1889 |||||||||| |||||||||| ||| ||||| |||||| | | ||| || | || |||| | GAGACTAACC ATATACTAAG TAATGTAGAT TCATGAATTT A-TCAAATAA TTAACAT-AT 288 CTTATGAAAA AAATTAAACA CACATAATTT ACATATATGG GAAGTTTAAC AGAAACTTAA 1829 | ||| || | ||||| || ||||||| |||||||||| |||||||| |||||| ||| CACATGTAAT TTACCAAACA CAAATAATTT ACATATATGG GAAGTTTACT AGAAACGTAA 348 ATATGAAAAC TTACATT-AT TAGAACAGAG TGAAGTTTCC CTTTCGGGAA ACCCGAAAAA 1770 |||||||||| ||||||| | |||||||||| |||||||||| |||||||||| |||||||||| ATATGAAAAC TTACATTCTT TAGAACAGAG TGAAGTTTCC CTTTCGGGAA ACCCGAAAAA 408 CTTCCAAACT TTACTTAAAA CAAACTTTAA ACTAAAGAAA TGAAAACTTT TTTA 1716 ||||||||| | |||||| | |||||||| | | |||||| |||||| || | || CTTCCAAAC- TGACTTAACA CAAACTTTTA AACAAAGAAC TGAAAATATT TATA 461 hqPGS_C06HBa0112G05.1-7-_SGN-E257656- (2182 1716) Total number of EST alignments reported: 4 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2199: PGL 1 (- strand): 2199 1335 AGS-1 (2199 1853,1760 1699,1357 1335) SCR (e 0.823 d 0.000 a 0.000,e 0.694 d 0.000 a 0.000,e 0.739) Exon 1 2199 1853 ( 347 n); score: 0.823 Intron 1 1852 1761 ( 92 n); Pd: 0.000 Pa: 0.000 Exon 2 1760 1699 ( 62 n); score: 0.694 Intron 2 1698 1358 ( 341 n); Pd: 0.000 Pa: 0.000 Exon 3 1357 1335 ( 23 n); score: 0.739 PGS (2199 1853,1760 1699,1357 1335) SGN-E327106- 3-phase translation of AGS-1 (-strand): . . . . . . 2199 TCTCTTGTTCTATTATGTAATTATCATATTAGTTTCTAGTTTAAACTTACCTTTAAATTT S L V L L C N Y H I S F - F K L T F K F L L F Y Y V I I I L V S S L N L P L N F S C S I M - L S Y - F L V - T Y L - I . . . . . . 2139 TTTTTTATGGACTCTGTCCATTCATGAATCTATTATTACTGTTCATCTCTTTCCCTTAGC F F M D S V H S - I Y Y Y C S S L S L S F L W T L S I H E S I I T V H L F P L A F F Y G L C P F M N L L L L F I S F P - . . . . . . 2079 CTTTGTTCTTAGGAGTAATCTTCTAAAACCCCACTCCGGATTCGGTTATTCGTAGTTCTT L C S - E - S S K T P L R I R L F V V L F V L R S N L L K P H S G F G Y S - F F P L F L G V I F - N P T P D S V I R S S . . . . . . 2019 TTCTTAGAAGATTTAACCTGACATTGTCCTAGGAATTTACAGGTTAATCCATCGATACTT F L E D L T - H C P R N L Q V N P S I L S - K I - P D I V L G I Y R L I H R Y F F L R R F N L T L S - E F T G - S I D T . . . . . . 1959 TTAAGAGGCTAAGAGACTAACCATATACTAAGTAAAATAGATTCATGATGATATTCATAT L R G - E T N H I L S K I D S - - Y S Y - E A K R L T I Y - V K - I H D D I H I F K R L R D - P Y T K - N R F M M I F I . . . . . : . 1899 ATTTACATGAACTTATGAAAAAAATTAAACACACATAATTTACATAT : TTTACTTAAAACA I Y M N L - K K L N T H N L H I : L L K T F T - T Y E K N - T H I I Y I : F Y L K Q Y L H E L M K K I K H T - F T Y : F T - N . . . . . : . 1747 AACTTTAAACTAAAGAAATGAAAACTTTTTTACAATCTCTAAAAAGAAA : ACTCACATTTT N F K L K K - K L F Y N L - K E : N S H F T L N - R N E N F F T I S K K K : T H I F K L - T K E M K T F L Q S L K R K : L T F . . 1346 TTGACAAAAGAG L T K E - Q K F D K R Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (2196 1635) SCR (e 0.823) Exon 1 2196 1635 ( 562 n); score: 0.823 PGS (2196 1635) SGN-E368232- PGS (2182 1716) SGN-E257656- 3-phase translation of AGS-2 (-strand): . . . . . . 2196 CTTGTTCTATTATGTAATTATCATATTAGTTTCTAGTTTAAACTTACCTTTAAATTTTTT L V L L C N Y H I S F - F K L T F K F F L F Y Y V I I I L V S S L N L P L N F F C S I M - L S Y - F L V - T Y L - I F . . . . . . 2136 TTTATGGACTCTGTCCATTCATGAATCTATTATTACTGTTCATCTCTTTCCCTTAGCCTT F M D S V H S - I Y Y Y C S S L S L S L L W T L S I H E S I I T V H L F P L A F F Y G L C P F M N L L L L F I S F P - P . . . . . . 2076 TGTTCTTAGGAGTAATCTTCTAAAACCCCACTCCGGATTCGGTTATTCGTAGTTCTTTTC C S - E - S S K T P L R I R L F V V L F V L R S N L L K P H S G F G Y S - F F S L F L G V I F - N P T P D S V I R S S F . . . . . . 2016 TTAGAAGATTTAACCTGACATTGTCCTAGGAATTTACAGGTTAATCCATCGATACTTTTA L E D L T - H C P R N L Q V N P S I L L - K I - P D I V L G I Y R L I H R Y F - L R R F N L T L S - E F T G - S I D T F . . . . . . 1956 AGAGGCTAAGAGACTAACCATATACTAAGTAAAATAGATTCATGATGATATTCATATATT R G - E T N H I L S K I D S - - Y S Y I E A K R L T I Y - V K - I H D D I H I F K R L R D - P Y T K - N R F M M I F I Y . . . . . . 1896 TACATGAACTTATGAAAAAAATTAAACACACATAATTTACATATATGGGAAGTTTAACAG Y M N L - K K L N T H N L H I W E V - Q T - T Y E K N - T H I I Y I Y G K F N R L H E L M K K I K H T - F T Y M G S L T . . . . . . 1836 AAACTTAAATATGAAAACTTACATTATTAGAACAGAGTGAAGTTTCCCTTTCGGGAAACC K L K Y E N L H Y - N R V K F P F R E T N L N M K T Y I I R T E - S F P F G K P E T - I - K L T L L E Q S E V S L S G N . . . . . . 1776 CGAAAAACTTCCAAACTTTACTTAAAACAAACTTTAAACTAAAGAAATGAAAACTTTTTT R K T S K L Y L K Q T L N - R N E N F F E K L P N F T - N K L - T K E M K T F L P K N F Q T L L K T N F K L K K - K L F . . . . . . 1716 ACAATCTCTAAAAAGAAAATTTTAGAGAGGAAAGAGGTTGTTTTTTTGAGGATCTTTTAG T I S K K K I L E R K E V V F L R I F - Q S L K R K F - R G K R L F F - G S F R Y N L - K E N F R E E R G C F F E D L L . . . 1656 AGTTTTTTGTAGTGATTGTCTT S F L - - L S V F C S D C L E F F V V I V Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (+strand): . . . . . . 1635 AAGACAATCACTACAAAAAACTCTAAAAGATCCTCAAAAAAACAACCTCTTTCCTCTCTA K T I T T K N S K R S S K K Q P L S S L R Q S L Q K T L K D P Q K N N L F P L - D N H Y K K L - K I L K K T T S F L S . . . . . . 1695 AAATTTTCTTTTTAGAGATTGTAAAAAAGTTTTCATTTCTTTAGTTTAAAGTTTGTTTTA K F S F - R L - K S F H F F S L K F V L N F L F R D C K K V F I S L V - S L F - K I F F L E I V K K F S F L - F K V C F . . . . . . 1755 AGTAAAGTTTGGAAGTTTTTCGGGTTTCCCGAAAGGGAAACTTCACTCTGTTCTAATAAT S K V W K F F G F P E R E T S L C S N N V K F G S F S G F P K G K L H S V L I M K - S L E V F R V S R K G N F T L F - - . . . . . . 1815 GTAAGTTTTCATATTTAAGTTTCTGTTAAACTTCCCATATATGTAAATTATGTGTGTTTA V S F H I - V S V K L P I Y V N Y V C L - V F I F K F L L N F P Y M - I M C V - C K F S Y L S F C - T S H I C K L C V F . . . . . . 1875 ATTTTTTTCATAAGTTCATGTAAATATATGAATATCATCATGAATCTATTTTACTTAGTA I F F I S S C K Y M N I I M N L F Y L V F F S - V H V N I - I S S - I Y F T - Y N F F H K F M - I Y E Y H H E S I L L S . . . . . . 1935 TATGGTTAGTCTCTTAGCCTCTTAAAAGTATCGATGGATTAACCTGTAAATTCCTAGGAC Y G - S L S L L K V S M D - P V N S - D M V S L L A S - K Y R W I N L - I P R T I W L V S - P L K S I D G L T C K F L G . . . . . . 1995 AATGTCAGGTTAAATCTTCTAAGAAAAGAACTACGAATAACCGAATCCGGAGTGGGGTTT N V R L N L L R K E L R I T E S G V G F M S G - I F - E K N Y E - P N P E W G F Q C Q V K S S K K R T T N N R I R S G V . . . . . . 2055 TAGAAGATTACTCCTAAGAACAAAGGCTAAGGGAAAGAGATGAACAGTAATAATAGATTC - K I T P K N K G - G K E M N S N N R F R R L L L R T K A K G K R - T V I I D S L E D Y S - E Q R L R E R D E Q - - - I . . . . . . 2115 ATGAATGGACAGAGTCCATAAAAAAAAATTTAAAGGTAAGTTTAAACTAGAAACTAATAT M N G Q S P - K K I - R - V - T R N - Y - M D R V H K K K F K G K F K L E T N M H E W T E S I K K N L K V S L N - K L I . . . 2175 GATAATTACATAATAGAACAAG D N Y I I E Q I I T - - N K - - L H N R T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (1986 1978,1920 1734) SCR (e 0.889 d 0.720 a 0.000,e 0.799) Exon 1 1986 1978 ( 9 n); score: 0.889 Intron 1 1977 1921 ( 57 n); Pd: 0.720 Pa: 0.000 Exon 2 1920 1734 ( 187 n); score: 0.799 PGS (1986 1978,1920 1734) SGN-E320805- 3-phase translation of AGS-3 (-strand): . : . . . . . 1986 AATTTACAG : GATTCATGATGATATTCATATATTTACATGAACTTATGAAAAAAATTAAAC N L Q : D S - - Y S Y I Y M N L - K K L N I Y R : I H D D I H I F T - T Y E K N - T F T : G F M M I F I Y L H E L M K K I K . . . . . . 1869 ACACATAATTTACATATATGGGAAGTTTAACAGAAACTTAAATATGAAAACTTACATTAT T H N L H I W E V - Q K L K Y E N L H Y H I I Y I Y G K F N R N L N M K T Y I I H T - F T Y M G S L T E T - I - K L T L . . . . . . 1809 TAGAACAGAGTGAAGTTTCCCTTTCGGGAAACCCGAAAAACTTCCAAACTTTACTTAAAA - N R V K F P F R E T R K T S K L Y L K R T E - S F P F G K P E K L P N F T - N L E Q S E V S L S G N P K N F Q T L L K . . 1749 CAAACTTTAAACTAAA Q T L N - K L - T K T N F K L Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:19:15 2006 ________________________________________________________________________________ Sequence 8: C06HBa0112G05.1-8, from 1 to 1361, both strands analyzed. ... started at: Mon Aug 28 22:19:15 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:19:25 2006 ________________________________________________________________________________ Sequence 9: C06HBa0112G05.1-9, from 1 to 15823, both strands analyzed. ... started at: Mon Aug 28 22:19:25 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 8 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 8 ******************************************************************************** EST sequence 14 -strand 785 n (File: SGN-E544264-) 1 CTTGATTCAG AAGTTAGCCC CACCAGCGGC AAAATTGACG ATAATCGAGT CACTGGGAAA 61 ACAAAAACGC ATAACAAACA TTCCCAATGT ACCCAAGCCT CAATGCAACA CTTCTAAACT 121 TCGCCATCCA AACTTTTTCG GCGGGATTAT GCATATGAAG ATTACCATAG CTGATGTAGA 181 TGTAGTTGGC TAAAGACCAA ATAAGAAGAA CGGTAAACAT CGCAGCGAAA ATAAGTTCAA 241 CCGCGTTTAC AATTCCAAGA GATTCAATTA CAAGCCGTGG CCGTCTAAAA TATTCCCGCA 301 CTTTATTACT ACTCCTAACA ATATAATTGT ATGTTAGTTA AAAGAGATAT ATCAAAGATA 361 GATCATTAAT AATTAGTATG TAATGTTAGG ATCGAAAATA AGCAGGTGTA AACGCGGAAG 421 CTAGCAAAGC AAACCTCAAA AGACTACGAG TAAGAAGACA ACGAGAAATA TACCCAAAGA 481 CACAAAGATT TAACGTGGTT GCAAATAAAA CTCAACATGT AATTGATTAA CCTGTGATTA 541 GAAGTTGACT TGTTGTAAAG ATGAAGATAA ACACAGCTCA CAGCAGCTAT CAACATTATG 601 GGAAATGTAA ATAAAAGAAG ATTTATCCCT TGTTCCCTGA AATATGTAGA GTTGAGTTTG 661 ATTAGTAGAT GTGGAGTCCA TGAATTTTTG TAAGTAGGTG TAGGTAACAT TACCCATATG 721 AAAAGCCATC CAACAAACAC CAAAACCACA AAACCATTCA AAATTGTCTT GCTCCCCATA 781 TTTTT Predicted gene structure (within gDNA segment 1 to 8618): Exon 1 1 7 ( 7 n); cDNA 340 346 ( 7 n); score: 0.714 Intron 1 8 656 ( 649 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 2 657 677 ( 21 n); cDNA 347 367 ( 21 n); score: 0.714 Intron 2 678 3118 (2441 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0) Exon 3 3119 3132 ( 14 n); cDNA 368 380 ( 13 n); score: 0.786 Intron 3 3133 5048 (1916 n); Pd: 0.065 (s: 0), Pa: 0.000 (s: 0.96) Exon 4 5049 5168 ( 120 n); cDNA 381 500 ( 120 n); score: 0.950 Intron 4 5169 5460 ( 292 n); Pd: 0.000 (s: 0.96), Pa: 0.000 (s: 0) Exon 5 5461 5490 ( 30 n); cDNA 501 531 ( 31 n); score: 0.717 MATCH C06HBa0112G05.1-9+ SGN-E544264- 0.950 192 0.245 C PGS_C06HBa0112G05.1-9+_SGN-E544264- (1 7,657 677,3119 3132,5049 5168,5461 5490) Alignment (genomic DNA sequence = upper lines): AAAAAAAGTA TCATTGTATT GGGAGAAGGT ACCATTAGTG TTGCATTTTG AACACAAACG 60 |||| | AAAAGAG... .......... .......... .......... .......... .......... 346 CAGCAGAGAT GAGAATAATG AATAGACAGA AAAATGAATG TTTGGAAGGG TTCCATAAAT 120 .......... .......... .......... .......... .......... .......... 346 TTTCATCACT AATGTATGGA TTAGAATAAA ATAAACATAT ACCCTCGTAG ACACCGATAT 180 .......... .......... .......... .......... .......... .......... 346 TTTTTTACAT TTTTTTAGTT TTTATTTTAA CGGAAGTAGA AAAAGGATAC ATAGCGCTGG 240 .......... .......... .......... .......... .......... .......... 346 AAGAAGTTTG AAAATGTTCC AAGAAGTCAA ATTTTAGCAT CAATTCAAGA AAATTGGCCG 300 .......... .......... .......... .......... .......... .......... 346 CTTTGTAAGT AATTTTTATT TTAATCCTTA AAATTTTCTT TTTTTTGTTT TCAAATTGGA 360 .......... .......... .......... .......... .......... .......... 346 TATGTGATCT ACGTGTAAAG TGTGTAATGA GAGATTATTG AGGATATGTT ATATCGACAT 420 .......... .......... .......... .......... .......... .......... 346 ATTCATATTA TTACATGTAA TATTATCTGA ATTATTGCAA ATTTAACATT CAAATGTTAT 480 .......... .......... .......... .......... .......... .......... 346 GGATTTTATC TAATGAATAT GAATAAAATA AATATTATTA TTTGAACATT TTCTATATAA 540 .......... .......... .......... .......... .......... .......... 346 TAATCCAATA ATAATAAACG TGCAACGCAC GTTCCCAATG ACTAGTATAT ATATATATAT 600 .......... .......... .......... .......... .......... .......... 346 ATATATATAG TAAAGTAATA ATGTGAGATG GTGATACTAT GATGATATGA TAGAAGATAT 660 |||| .......... .......... .......... .......... .......... ......ATAT 350 ATAGATGATA GTCAATTATA TGCTTACTCA TAAAGAAGGT AAACATATGA GAATCCACCT 720 || | |||| | ||| ATCAAAGATA GATCATT... .......... .......... .......... .......... 367 GGGGTTAAGA CTGAATATAT AGGATTCTAA ACACTCCAAA TCTTGACCCA ACGACTACCC 780 .......... .......... .......... .......... .......... .......... 367 AAATTCTTAA GCTAGGACTA ACACCTAGCA AACCACAACC TGCTAGGAAG ACCATTGCAT 840 .......... .......... .......... .......... .......... .......... 367 TCAATCACCA AACTCAAATT AAAGTCCGAA TTCTCCGAGA CTTTGAACAA GATGTTACCT 900 .......... .......... .......... .......... .......... .......... 367 TTTCACCGTT GTTTGAGAAA TAATGTTGAT CATGTAATTT GGTAAAAGAA AGAACGAAAT 960 .......... .......... .......... .......... .......... .......... 367 AGGTAGATCA GTTAGGACTT TGATCAAACA CATAATGTAG AATCTGTTGG TGCTGTCATT 1020 .......... .......... .......... .......... .......... .......... 367 TAGTCTACAG AACATGTTTT TAATAACCTT TTGCATAATG TAATTAGTCT TCAAAAAGAG 1080 .......... .......... .......... .......... .......... .......... 367 TTTCAACAAA TTAAATAAAA AGTAACCATT GCGCTTTTAA TCAGAAAGTT TGAGGCTTGA 1140 .......... .......... .......... .......... .......... .......... 367 GTGAAGACAA ACAAAATCGG ATCAAGTTAG GACGGTTCCT TGCAATAGCA ACCAAGGCGG 1200 .......... .......... .......... .......... .......... .......... 367 TATTTGTCAT TTGGCAGCAC AAGTATAGAA ACGAGTGGAG CAACAAGGCC TCTCTCTGTC 1260 .......... .......... .......... .......... .......... .......... 367 AAGGACACAT CCGGTTCACT GAAATAACGG GATATTCTTC CGGCCAACTA CTTGAAGCTC 1320 .......... .......... .......... .......... .......... .......... 367 GTCTCCTCCC CTCATATCTT CAGCATAAAT CTTAATCTTC TTAGCGGAAT CTTCAAGTTG 1380 .......... .......... .......... .......... .......... .......... 367 AGGGCTCTCT ACAAGTTTGA GTGAAATAAT ATCCCCAAAA CTAGGCGGAA TCTCCTCAAG 1440 .......... .......... .......... .......... .......... .......... 367 ATCACAACAT TCATCCAATT TTAATTTCTC AAGGATTCCT CTCCAACCTC CCACTTGGAC 1500 .......... .......... .......... .......... .......... .......... 367 ACAGCCACTT GATACAAATT CAATAATTTG AGATTCTCAA AATGAAATCC ATGGTAGAAG 1560 .......... .......... .......... .......... .......... .......... 367 AAAAGAAGAA GAGAGTTATG AAGCATGAAC TTAATTGCCT CTCAAGTTCA GCAACATTAG 1620 .......... .......... .......... .......... .......... .......... 367 AAGCACTAGT TTGCTAAATG AAGTTTGAAA GAAGAATAAA ATAGCGATTA CCTTAAAGGT 1680 .......... .......... .......... .......... .......... .......... 367 TGATTTTACT CATTTCTACT TAATAAACTT CTATCAAACA CATTCTTTTA ACATAAGCAA 1740 .......... .......... .......... .......... .......... .......... 367 TTATTTGCAG TTTGAGAATA GACACATATA CCTGCCATTA TTTTTTTATT CCAGTTTTGC 1800 .......... .......... .......... .......... .......... .......... 367 GAAATTATTT TTTACACATG ACCAATTATT ATTCGACTAC ATCATTAACT TTGAAACGTA 1860 .......... .......... .......... .......... .......... .......... 367 AATAATCAAC ATTTTATCTT TTTGCTAATT TATCCTCCCT AATTCCAAAT CATATTCAAC 1920 .......... .......... .......... .......... .......... .......... 367 AAGATTCAAT AATTATTATT GAATATGTGA TTGTCAATCT AATTATTATT CAACAAGATT 1980 .......... .......... .......... .......... .......... .......... 367 CAATAATTCC AATAAGAAGT AATGAAGTAT CAAGTGTTGT TGGATAAATT TGAAGATGAA 2040 .......... .......... .......... .......... .......... .......... 367 GACAGAAGCA AAATACTACA GAAGAAGATT CACAAGAGCG TTAACAATAA TTTGGTACAA 2100 .......... .......... .......... .......... .......... .......... 367 AGAATAGAAG AATGTTACAA CATCAAATAC TAAATTGTTC TCATAATTCA CAGAAACTCG 2160 .......... .......... .......... .......... .......... .......... 367 AAATGTATAG TTTCTCCGCC TCAATTTGTC TGATATTTTT CATTTCTTGA AATTTGAAAA 2220 .......... .......... .......... .......... .......... .......... 367 TCGTACCAAC TTTAATTAAT ATTTTCAAAT TACAACTTAT AATACTTCTT GTATAATTTT 2280 .......... .......... .......... .......... .......... .......... 367 TCGAATATTT GTTTTTTAAA TTTATAATAC GGAGTTAATT TGATTCAAAA AAAAAAAAAA 2340 .......... .......... .......... .......... .......... .......... 367 CAGTTGGAAA ATAGCCATGT TGAGTCTGTA CAAGACATTC TATTAGTGGC ACTTGCACAT 2400 .......... .......... .......... .......... .......... .......... 367 TTGGATTAAT TTCTTCAGTA GAGGTAGAAG TAAATACTCA AATTAAAATT CTTTTTCTTA 2460 .......... .......... .......... .......... .......... .......... 367 GAAACAGGAG GGAAATAGGA TGACATACAG CAAACTCGAA AGGCATAGCA GAGCGCGGCA 2520 .......... .......... .......... .......... .......... .......... 367 CGCCAAACGC CAACTAGCGA GGAGGTAAAT GGAGCAGGGA AGGGTTCTGC TTTTTGATGT 2580 .......... .......... .......... .......... .......... .......... 367 TTGTGGGAAA GAAACAAACT TTTGTTGATT ACGTTATGCT TCAGTTTTTG AATGAAATTT 2640 .......... .......... .......... .......... .......... .......... 367 CATGAAACAC ATTCTGTATA TGAAGTTTTA ATATGAATGA CGTACCATTT TCTTCCTGCT 2700 .......... .......... .......... .......... .......... .......... 367 CGTCTCTCTA TCCTTCATTT TCTTTTCCTC TTACTGCAAA TACAGTCAGA CATATCAAAT 2760 .......... .......... .......... .......... .......... .......... 367 GGTGCACACA AAGAACAGAA AAAACGTTCC TTTTCTTCAC TTAAATAACG GGATATTATT 2820 .......... .......... .......... .......... .......... .......... 367 CCGGCCAACG ACTTGAAGCT CGTCTCCTCC CCTCATATCT TCAGCATATT CCTTAATTTT 2880 .......... .......... .......... .......... .......... .......... 367 CAAAGCAGAT CTTCTAGTTC AGTTAGGAAA TCCAATTTCG GGAACCAATA TTGCGCTGTT 2940 .......... .......... .......... .......... .......... .......... 367 GAACAATTCC ATGATTCCTT GAGATCAAAA CCAAGCACTT GAAGATTAAG AAGCCTTTTG 3000 .......... .......... .......... .......... .......... .......... 367 AAAATATCCT CTGTATCTTT CGAATAGGAA AGCACGAGTT TGTGTAAATG TCTCAAGTTC 3060 .......... .......... .......... .......... .......... .......... 367 TGTAACTTTG TGTCCTCTGC TATCAGTATT GATTCATCTG CATCCATATC AAAGAAAGAA 3120 || .......... .......... .......... .......... .......... ........AA 369 CAATTACTCA TGGCCAGGAG TCGCAACTTT ACAAGATCCC AAATTCTCGG TAATAGTATC 3180 ||||| | | || TAATTAGT-A TG........ .......... .......... .......... .......... 380 AAGGTTGATT CTTTGTTTTC AACCCACAAG ATTTCTAGAT TCCAGAGGTT TGAGAAAGAA 3240 .......... .......... .......... .......... .......... .......... 380 ATAGGCAGAG ATTTAACTTG TGTCCCAATT CTTAAGTACC TCAACATGCA TATTTCATTC 3300 .......... .......... .......... .......... .......... .......... 380 AACAAAAAAT CTTTCACCAT GATAAAAGAC GGTTCTAGGA ACAAATAATC ATGCACAAGA 3360 .......... .......... .......... .......... .......... .......... 380 TCATGAAGTT GGTAAGTCGG GTACTCACCT ATCTCATTGA AACAAATTAC CAAGCTACTG 3420 .......... .......... .......... .......... .......... .......... 380 GAAATCAAAT CATCCACATA AATCTTCACC ACTTCTTCCG TCTTTTCCAC AAACCCTCCA 3480 .......... .......... .......... .......... .......... .......... 380 GCAACCCATA CAGCTTTCAA CTCATAGATT GTCAATGCTT TGTCCTTCGG CCTACTTGCA 3540 .......... .......... .......... .......... .......... .......... 380 AAGTACAGCA AGCATGGCTT CAGATGATGA GGTAAGTGGT CAAAACTTAT TTCAATAACT 3600 .......... .......... .......... .......... .......... .......... 380 TTCATCACTT CCACTTTATT CTTCAAAATA AAAGAAAGCA AATTATTTAC AACTTCAAGC 3660 .......... .......... .......... .......... .......... .......... 380 CACACACTCT TTTTCTTTTC CCTCCCAACA ATGACTCCAG CAATCAGATC CACAATCAAA 3720 .......... .......... .......... .......... .......... .......... 380 GGAAGCCCTT TACAATTTTC GGCTATTTCT TTACCAACAT CCAATAGTTC ATCAGGGCAA 3780 .......... .......... .......... .......... .......... .......... 380 CTCTCGTTTC CAAATGCCCT TTTCTCTATT AACTCCCAAC TTTCTTCTGA TCTTAGCAAT 3840 .......... .......... .......... .......... .......... .......... 380 CGAAGGTCAA GAGGAGCAGT GTAGAGCTTT CCATGCAAAG CCACTTCCTT TTCTCGAGTT 3900 .......... .......... .......... .......... .......... .......... 380 GTCAAAATAA TTCTACTTCC TTTCTGAGCT TCAGGAAAAG ATCTTGTTAA CTCATCCCAT 3960 .......... .......... .......... .......... .......... .......... 380 GTAGTCGTCT CCCACCCGTC ATCTAATACA ATAAGATACC TCTTTCCGTG TAGTTGTTTC 4020 .......... .......... .......... .......... .......... .......... 380 CGTAGCTTAT CAGCAACATC AATATTCTCA CTCAATTTTG AATTTGAGTC ACTAACTTGA 4080 .......... .......... .......... .......... .......... .......... 380 TTGAAAATTT TATCCAACAA CTTCTTCTCG TCATATCCTT GGTCGACCGT GCACCATGCA 4140 .......... .......... .......... .......... .......... .......... 380 CGAAGGTCGA AATGGCTAGA AACTGATTTA TCATTGTATA CTTTGTATGC CAAAGTAGTT 4200 .......... .......... .......... .......... .......... .......... 380 TTACCTGAAC CAGGCATACC AGTGATCCAA ATGACATCTA GATCTGCCGG TCCATTGGTG 4260 .......... .......... .......... .......... .......... .......... 380 AGCTTTCTAA GTATCAAGTT TGTCTCCTCC TCAAAACCTA CAATTATTTT ATCAGTTGTC 4320 .......... .......... .......... .......... .......... .......... 380 AATGACTTTC TCTCAACCGG TTTCTTGAGA GAGTTCACAA CGATGAGACC TCTGTTCTTG 4380 .......... .......... .......... .......... .......... .......... 380 GGGATCTTCT CATGTAAATC AGAGACCTCG TCTTTGATAA GCTTCATCTT CTTTATGGTA 4440 .......... .......... .......... .......... .......... .......... 380 GTGGGAAGTG AGAAAATAAG ATGTAAGAGA CCATTATCTC GAAGTCGTGT ACAAGAGTTG 4500 .......... .......... .......... .......... .......... .......... 380 ATACATCTCT GGTAAGTGTT CCAACACGAT CCAAGAGATC AAAAAGTTTG TCATGATGAA 4560 .......... .......... .......... .......... .......... .......... 380 TAAAGTCCTT AAGCATATCA GAAAGGATAA TCAACAGGAA TTCCATCATG ACATGAATGT 4620 .......... .......... .......... .......... .......... .......... 380 TTCGAGCCCC TGAAGTGCTA GGGGTAATGA CAGTTATCAT ATGCTCTTGT AGATGAATAA 4680 .......... .......... .......... .......... .......... .......... 380 GATATTCCCG AAGAATGTCC GGAGAGGTTT CCAGGAGCTG CTTAATGAAG AGTCCATCTT 4740 .......... .......... .......... .......... .......... .......... 380 CTTCTGAAGT TGAAGCTTTC AAATTTGTAA AATATATGCG CATAAGCTCC AGTTCAGTTG 4800 .......... .......... .......... .......... .......... .......... 380 GAACAATCTT CAAGAGTAAA TGTGCTAACT TGAAGAGTCG AGAGTCTTTA ACATTCTGAT 4860 .......... .......... .......... .......... .......... .......... 380 CATTCTCATC TAGCTGGGCG AGTCGAGAGT CTTCATCAGT CTGATCTTCC CAAAGGAAGC 4920 .......... .......... .......... .......... .......... .......... 380 GTCCTACTCT CTCAGCCATC AGTTGAAACA GAGGTAAGAC ATTCTCAACC ATCTCATGCT 4980 .......... .......... .......... .......... .......... .......... 380 TAATGCGACC ATTCACTATC AATCCATGGA AATCTCTTAC GTTTCCACAT ACATTCTGAA 5040 .......... .......... .......... .......... .......... .......... 380 GAACCTCATA TTGTTAGGAA CGAAAATAAG CAGGTGTAAA CGCGGAAGCT AGCAAAGCAA 5100 || |||||||| |||||||||| |||||||||| |||||||||| |||||||||| ........TA ATGTTAGGAT CGAAAATAAG CAGGTGTAAA CGCGGAAGCT AGCAAAGCAA 432 ACCTCGAAAG ACCACGAGTA AGAAGACAAC GAGAAATATA TCAAAAGACA CAAAGATTTA 5160 ||||| |||| || ||||||| |||||||||| |||||||||| | ||||||| |||||||||| ACCTCAAAAG ACTACGAGTA AGAAGACAAC GAGAAATATA CCCAAAGACA CAAAGATTTA 492 ACGTGGTTCG GTCAATCGAC CTACGTCCAC AAAGGAGATG AGCAATCCAC TATAAATGTG 5220 |||||||| ACGTGGTT.. .......... .......... .......... .......... .......... 500 AGAGTACAAA ATACAGAGGG AAACAACCTC AACCAATTCA CTCGGAATAC ATGAGAGGTT 5280 .......... .......... .......... .......... .......... .......... 500 CACAAATTCT CGCTCTAACC AAAACTCTCA AAGCCCTTAA AACTACATTG TGAATGCTAA 5340 .......... .......... .......... .......... .......... .......... 500 TTAAGTTAGA AGGAACATGC CTTTATTTAT AGAGTCCTAA ACCTTTTCCT ACCAAAAAAA 5400 .......... .......... .......... .......... .......... .......... 500 AGAATAGTCA ATTCAAAACC TTTTCCTAAA AGGAAAACCT ATTTATGGTA AGAAATCAGG 5460 .......... .......... .......... .......... .......... .......... 500 GCAAATAAAA CCCAACACAT -ATTGTGTCA C 5490 |||||||||| | ||||| | |||| | | | GCAAATAAAA CTCAACATGT AATTGATTAA C 531 hqPGS_C06HBa0112G05.1-9+_SGN-E544264- (657 677,3119 3132,5049 5168,5461 5490) ******************************************************************************** EST sequence 16 -strand 550 n (File: SGN-E329886-) 1 AAGCTTGACG GTGTTGGAAA TATCTCTCCA TGGGCTTTCC AACCATAATA ATAACTAGCA 61 CTAATCTTGA ATCAGAAGTT ATGACCGTTT GAAAATGACC GAATCTCACT TTTTTAACTT 121 AAGAAATTTT CTTGATTTTT CCTTTTCTTT CAAAAAATAA TTGGTTTTAG TTTCTTTGCT 181 ATTTCAGGTT ACGAGATGTC ACAGTTTTAC TAATATTCAT GACCTCTATC ATGACCCAAT 241 GTGTTTGTTT TTTTACAGAG TAGGTTCTCT TACATTTGAA TCCATGAATC TTGTTTACTT 301 TGTTGCAGTT AGTCCAAACG ATAACTACTT TGAAAGTGCA TCTGATAAAT GGATGAATAT 361 ATGATCAGTG AAGAAGGCCA AGATTGTAAA AACTTATAAT GGCGATGACA GTACTTTTAA 421 AGTGTGTTGG AACAAAGAAG ATAACAAGGT TTCAACAGTT AGGATCGGAA ATAAGCAGGT 481 GTAACACGGA AGCTAGCAAA GCAAACCTTG AAAGACCACG AGTAAGAACA CAACGAGAAA 541 TATACCAAAA Predicted gene structure (within gDNA segment 1 to 5944): Exon 1 220 257 ( 38 n); cDNA 389 425 ( 37 n); score: 0.684 Intron 1 258 2090 (1833 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 2 2091 2122 ( 32 n); cDNA 426 457 ( 32 n); score: 0.781 Intron 2 2123 5052 (2930 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.92) Exon 3 5053 5146 ( 94 n); cDNA 458 550 ( 93 n); score: 0.926 MATCH C06HBa0112G05.1-9+ SGN-E329886- 0.926 164 0.298 C PGS_C06HBa0112G05.1-9+_SGN-E329886- (220 257,2091 2122,5053 5146) Alignment (genomic DNA sequence = upper lines): AAAAAGGATA CATAGCGCTG GAAGAAGTTT GAAAATGTTC CAAGAAGTCA AATTTTAGCA 279 |||| ||| || ||| || || | ||| ||| ||| AAAACTTATA -ATGGCGATG ACAGTACTTT TAAAGTGT.. .......... .......... 425 TCAATTCAAG AAAATTGGCC GCTTTGTAAG TAATTTTTAT TTTAATCCTT AAAATTTTCT 339 .......... .......... .......... .......... .......... .......... 425 TTTTTTTGTT TTCAAATTGG ATATGTGATC TACGTGTAAA GTGTGTAATG AGAGATTATT 399 .......... .......... .......... .......... .......... .......... 425 GAGGATATGT TATATCGACA TATTCATATT ATTACATGTA ATATTATCTG AATTATTGCA 459 .......... .......... .......... .......... .......... .......... 425 AATTTAACAT TCAAATGTTA TGGATTTTAT CTAATGAATA TGAATAAAAT AAATATTATT 519 .......... .......... .......... .......... .......... .......... 425 ATTTGAACAT TTTCTATATA ATAATCCAAT AATAATAAAC GTGCAACGCA CGTTCCCAAT 579 .......... .......... .......... .......... .......... .......... 425 GACTAGTATA TATATATATA TATATATATA GTAAAGTAAT AATGTGAGAT GGTGATACTA 639 .......... .......... .......... .......... .......... .......... 425 TGATGATATG ATAGAAGATA TATAGATGAT AGTCAATTAT ATGCTTACTC ATAAAGAAGG 699 .......... .......... .......... .......... .......... .......... 425 TAAACATATG AGAATCCACC TGGGGTTAAG ACTGAATATA TAGGATTCTA AACACTCCAA 759 .......... .......... .......... .......... .......... .......... 425 ATCTTGACCC AACGACTACC CAAATTCTTA AGCTAGGACT AACACCTAGC AAACCACAAC 819 .......... .......... .......... .......... .......... .......... 425 CTGCTAGGAA GACCATTGCA TTCAATCACC AAACTCAAAT TAAAGTCCGA ATTCTCCGAG 879 .......... .......... .......... .......... .......... .......... 425 ACTTTGAACA AGATGTTACC TTTTCACCGT TGTTTGAGAA ATAATGTTGA TCATGTAATT 939 .......... .......... .......... .......... .......... .......... 425 TGGTAAAAGA AAGAACGAAA TAGGTAGATC AGTTAGGACT TTGATCAAAC ACATAATGTA 999 .......... .......... .......... .......... .......... .......... 425 GAATCTGTTG GTGCTGTCAT TTAGTCTACA GAACATGTTT TTAATAACCT TTTGCATAAT 1059 .......... .......... .......... .......... .......... .......... 425 GTAATTAGTC TTCAAAAAGA GTTTCAACAA ATTAAATAAA AAGTAACCAT TGCGCTTTTA 1119 .......... .......... .......... .......... .......... .......... 425 ATCAGAAAGT TTGAGGCTTG AGTGAAGACA AACAAAATCG GATCAAGTTA GGACGGTTCC 1179 .......... .......... .......... .......... .......... .......... 425 TTGCAATAGC AACCAAGGCG GTATTTGTCA TTTGGCAGCA CAAGTATAGA AACGAGTGGA 1239 .......... .......... .......... .......... .......... .......... 425 GCAACAAGGC CTCTCTCTGT CAAGGACACA TCCGGTTCAC TGAAATAACG GGATATTCTT 1299 .......... .......... .......... .......... .......... .......... 425 CCGGCCAACT ACTTGAAGCT CGTCTCCTCC CCTCATATCT TCAGCATAAA TCTTAATCTT 1359 .......... .......... .......... .......... .......... .......... 425 CTTAGCGGAA TCTTCAAGTT GAGGGCTCTC TACAAGTTTG AGTGAAATAA TATCCCCAAA 1419 .......... .......... .......... .......... .......... .......... 425 ACTAGGCGGA ATCTCCTCAA GATCACAACA TTCATCCAAT TTTAATTTCT CAAGGATTCC 1479 .......... .......... .......... .......... .......... .......... 425 TCTCCAACCT CCCACTTGGA CACAGCCACT TGATACAAAT TCAATAATTT GAGATTCTCA 1539 .......... .......... .......... .......... .......... .......... 425 AAATGAAATC CATGGTAGAA GAAAAGAAGA AGAGAGTTAT GAAGCATGAA CTTAATTGCC 1599 .......... .......... .......... .......... .......... .......... 425 TCTCAAGTTC AGCAACATTA GAAGCACTAG TTTGCTAAAT GAAGTTTGAA AGAAGAATAA 1659 .......... .......... .......... .......... .......... .......... 425 AATAGCGATT ACCTTAAAGG TTGATTTTAC TCATTTCTAC TTAATAAACT TCTATCAAAC 1719 .......... .......... .......... .......... .......... .......... 425 ACATTCTTTT AACATAAGCA ATTATTTGCA GTTTGAGAAT AGACACATAT ACCTGCCATT 1779 .......... .......... .......... .......... .......... .......... 425 ATTTTTTTAT TCCAGTTTTG CGAAATTATT TTTTACACAT GACCAATTAT TATTCGACTA 1839 .......... .......... .......... .......... .......... .......... 425 CATCATTAAC TTTGAAACGT AAATAATCAA CATTTTATCT TTTTGCTAAT TTATCCTCCC 1899 .......... .......... .......... .......... .......... .......... 425 TAATTCCAAA TCATATTCAA CAAGATTCAA TAATTATTAT TGAATATGTG ATTGTCAATC 1959 .......... .......... .......... .......... .......... .......... 425 TAATTATTAT TCAACAAGAT TCAATAATTC CAATAAGAAG TAATGAAGTA TCAAGTGTTG 2019 .......... .......... .......... .......... .......... .......... 425 TTGGATAAAT TTGAAGATGA AGACAGAAGC AAAATACTAC AGAAGAAGAT TCACAAGAGC 2079 .......... .......... .......... .......... .......... .......... 425 GTTAACAATA ATTTGGTACA AAGAATAGAA GAATGTTACA ACATCAAATA CTAAATTGTT 2139 |||| ||| ||||| | || || ||| || ||| .......... .GTTGGAACA AAGAAGATAA CAAGGTTTCA ACA....... .......... 457 CTCATAATTC ACAGAAACTC GAAATGTATA GTTTCTCCGC CTCAATTTGT CTGATATTTT 2199 .......... .......... .......... .......... .......... .......... 457 TCATTTCTTG AAATTTGAAA ATCGTACCAA CTTTAATTAA TATTTTCAAA TTACAACTTA 2259 .......... .......... .......... .......... .......... .......... 457 TAATACTTCT TGTATAATTT TTCGAATATT TGTTTTTTAA ATTTATAATA CGGAGTTAAT 2319 .......... .......... .......... .......... .......... .......... 457 TTGATTCAAA AAAAAAAAAA ACAGTTGGAA AATAGCCATG TTGAGTCTGT ACAAGACATT 2379 .......... .......... .......... .......... .......... .......... 457 CTATTAGTGG CACTTGCACA TTTGGATTAA TTTCTTCAGT AGAGGTAGAA GTAAATACTC 2439 .......... .......... .......... .......... .......... .......... 457 AAATTAAAAT TCTTTTTCTT AGAAACAGGA GGGAAATAGG ATGACATACA GCAAACTCGA 2499 .......... .......... .......... .......... .......... .......... 457 AAGGCATAGC AGAGCGCGGC ACGCCAAACG CCAACTAGCG AGGAGGTAAA TGGAGCAGGG 2559 .......... .......... .......... .......... .......... .......... 457 AAGGGTTCTG CTTTTTGATG TTTGTGGGAA AGAAACAAAC TTTTGTTGAT TACGTTATGC 2619 .......... .......... .......... .......... .......... .......... 457 TTCAGTTTTT GAATGAAATT TCATGAAACA CATTCTGTAT ATGAAGTTTT AATATGAATG 2679 .......... .......... .......... .......... .......... .......... 457 ACGTACCATT TTCTTCCTGC TCGTCTCTCT ATCCTTCATT TTCTTTTCCT CTTACTGCAA 2739 .......... .......... .......... .......... .......... .......... 457 ATACAGTCAG ACATATCAAA TGGTGCACAC AAAGAACAGA AAAAACGTTC CTTTTCTTCA 2799 .......... .......... .......... .......... .......... .......... 457 CTTAAATAAC GGGATATTAT TCCGGCCAAC GACTTGAAGC TCGTCTCCTC CCCTCATATC 2859 .......... .......... .......... .......... .......... .......... 457 TTCAGCATAT TCCTTAATTT TCAAAGCAGA TCTTCTAGTT CAGTTAGGAA ATCCAATTTC 2919 .......... .......... .......... .......... .......... .......... 457 GGGAACCAAT ATTGCGCTGT TGAACAATTC CATGATTCCT TGAGATCAAA ACCAAGCACT 2979 .......... .......... .......... .......... .......... .......... 457 TGAAGATTAA GAAGCCTTTT GAAAATATCC TCTGTATCTT TCGAATAGGA AAGCACGAGT 3039 .......... .......... .......... .......... .......... .......... 457 TTGTGTAAAT GTCTCAAGTT CTGTAACTTT GTGTCCTCTG CTATCAGTAT TGATTCATCT 3099 .......... .......... .......... .......... .......... .......... 457 GCATCCATAT CAAAGAAAGA ACAATTACTC ATGGCCAGGA GTCGCAACTT TACAAGATCC 3159 .......... .......... .......... .......... .......... .......... 457 CAAATTCTCG GTAATAGTAT CAAGGTTGAT TCTTTGTTTT CAACCCACAA GATTTCTAGA 3219 .......... .......... .......... .......... .......... .......... 457 TTCCAGAGGT TTGAGAAAGA AATAGGCAGA GATTTAACTT GTGTCCCAAT TCTTAAGTAC 3279 .......... .......... .......... .......... .......... .......... 457 CTCAACATGC ATATTTCATT CAACAAAAAA TCTTTCACCA TGATAAAAGA CGGTTCTAGG 3339 .......... .......... .......... .......... .......... .......... 457 AACAAATAAT CATGCACAAG ATCATGAAGT TGGTAAGTCG GGTACTCACC TATCTCATTG 3399 .......... .......... .......... .......... .......... .......... 457 AAACAAATTA CCAAGCTACT GGAAATCAAA TCATCCACAT AAATCTTCAC CACTTCTTCC 3459 .......... .......... .......... .......... .......... .......... 457 GTCTTTTCCA CAAACCCTCC AGCAACCCAT ACAGCTTTCA ACTCATAGAT TGTCAATGCT 3519 .......... .......... .......... .......... .......... .......... 457 TTGTCCTTCG GCCTACTTGC AAAGTACAGC AAGCATGGCT TCAGATGATG AGGTAAGTGG 3579 .......... .......... .......... .......... .......... .......... 457 TCAAAACTTA TTTCAATAAC TTTCATCACT TCCACTTTAT TCTTCAAAAT AAAAGAAAGC 3639 .......... .......... .......... .......... .......... .......... 457 AAATTATTTA CAACTTCAAG CCACACACTC TTTTTCTTTT CCCTCCCAAC AATGACTCCA 3699 .......... .......... .......... .......... .......... .......... 457 GCAATCAGAT CCACAATCAA AGGAAGCCCT TTACAATTTT CGGCTATTTC TTTACCAACA 3759 .......... .......... .......... .......... .......... .......... 457 TCCAATAGTT CATCAGGGCA ACTCTCGTTT CCAAATGCCC TTTTCTCTAT TAACTCCCAA 3819 .......... .......... .......... .......... .......... .......... 457 CTTTCTTCTG ATCTTAGCAA TCGAAGGTCA AGAGGAGCAG TGTAGAGCTT TCCATGCAAA 3879 .......... .......... .......... .......... .......... .......... 457 GCCACTTCCT TTTCTCGAGT TGTCAAAATA ATTCTACTTC CTTTCTGAGC TTCAGGAAAA 3939 .......... .......... .......... .......... .......... .......... 457 GATCTTGTTA ACTCATCCCA TGTAGTCGTC TCCCACCCGT CATCTAATAC AATAAGATAC 3999 .......... .......... .......... .......... .......... .......... 457 CTCTTTCCGT GTAGTTGTTT CCGTAGCTTA TCAGCAACAT CAATATTCTC ACTCAATTTT 4059 .......... .......... .......... .......... .......... .......... 457 GAATTTGAGT CACTAACTTG ATTGAAAATT TTATCCAACA ACTTCTTCTC GTCATATCCT 4119 .......... .......... .......... .......... .......... .......... 457 TGGTCGACCG TGCACCATGC ACGAAGGTCG AAATGGCTAG AAACTGATTT ATCATTGTAT 4179 .......... .......... .......... .......... .......... .......... 457 ACTTTGTATG CCAAAGTAGT TTTACCTGAA CCAGGCATAC CAGTGATCCA AATGACATCT 4239 .......... .......... .......... .......... .......... .......... 457 AGATCTGCCG GTCCATTGGT GAGCTTTCTA AGTATCAAGT TTGTCTCCTC CTCAAAACCT 4299 .......... .......... .......... .......... .......... .......... 457 ACAATTATTT TATCAGTTGT CAATGACTTT CTCTCAACCG GTTTCTTGAG AGAGTTCACA 4359 .......... .......... .......... .......... .......... .......... 457 ACGATGAGAC CTCTGTTCTT GGGGATCTTC TCATGTAAAT CAGAGACCTC GTCTTTGATA 4419 .......... .......... .......... .......... .......... .......... 457 AGCTTCATCT TCTTTATGGT AGTGGGAAGT GAGAAAATAA GATGTAAGAG ACCATTATCT 4479 .......... .......... .......... .......... .......... .......... 457 CGAAGTCGTG TACAAGAGTT GATACATCTC TGGTAAGTGT TCCAACACGA TCCAAGAGAT 4539 .......... .......... .......... .......... .......... .......... 457 CAAAAAGTTT GTCATGATGA ATAAAGTCCT TAAGCATATC AGAAAGGATA ATCAACAGGA 4599 .......... .......... .......... .......... .......... .......... 457 ATTCCATCAT GACATGAATG TTTCGAGCCC CTGAAGTGCT AGGGGTAATG ACAGTTATCA 4659 .......... .......... .......... .......... .......... .......... 457 TATGCTCTTG TAGATGAATA AGATATTCCC GAAGAATGTC CGGAGAGGTT TCCAGGAGCT 4719 .......... .......... .......... .......... .......... .......... 457 GCTTAATGAA GAGTCCATCT TCTTCTGAAG TTGAAGCTTT CAAATTTGTA AAATATATGC 4779 .......... .......... .......... .......... .......... .......... 457 GCATAAGCTC CAGTTCAGTT GGAACAATCT TCAAGAGTAA ATGTGCTAAC TTGAAGAGTC 4839 .......... .......... .......... .......... .......... .......... 457 GAGAGTCTTT AACATTCTGA TCATTCTCAT CTAGCTGGGC GAGTCGAGAG TCTTCATCAG 4899 .......... .......... .......... .......... .......... .......... 457 TCTGATCTTC CCAAAGGAAG CGTCCTACTC TCTCAGCCAT CAGTTGAAAC AGAGGTAAGA 4959 .......... .......... .......... .......... .......... .......... 457 CATTCTCAAC CATCTCATGC TTAATGCGAC CATTCACTAT CAATCCATGG AAATCTCTTA 5019 .......... .......... .......... .......... .......... .......... 457 CGTTTCCACA TACATTCTGA AGAACCTCAT ATTGTTAGGA ACGAAAATAA GCAGGTGTAA 5079 ||||||| || |||||| ||||||||| .......... .......... .......... ...GTTAGGA TCGGAAATAA GCAGGTGTA- 483 ACGCGGAAGC TAGCAAAGCA AACCTCGAAA GACCACGAGT AAGAAGACAA CGAGAAATAT 5139 || ||||||| |||||||||| ||||| |||| |||||||||| ||||| |||| |||||||||| ACACGGAAGC TAGCAAAGCA AACCTTGAAA GACCACGAGT AAGAACACAA CGAGAAATAT 543 ATCAAAA 5146 | ||||| ACCAAAA 550 hqPGS_C06HBa0112G05.1-9+_SGN-E329886- (2091 2122,5053 5146) ******************************************************************************** EST sequence 13 -strand 780 n (File: SGN-E543331-) 1 CAACTACAAT AACAATGTTT TCGCGTTTAA ATTGTTAGAA CCGGAAATAA GCAGGGTAAA 61 CGCGGAAAGC TAGCAAAGCA AACCTCAAAA GACCACGAGT ATGAAAGACA ACGAGAAATA 121 TACCAAAAGA CACAAAGATT TAACGTGGTT CGGTCAATCG ACCTACGTCC ACAATGGAGA 181 TGAGCAATCC ACTATTAATA TGAGAGTACA AAATACAGAG AGAAACAACC TCAACCAATT 241 CACTAAGAAT ACACGGGAAC ATTTTCCTAC CAGAAAAAGA ATTAGTCAAT TCAAAACTTT 301 TTTCCTAAAA GAAAAACCTA TTTATGATAA GAAATTTAGG ACAAAATAAA ATCCAACATA 361 AATGATGTAA TTATACAATT CATATCATCA GACAATGGTT CATGGATCCA AAATCTTGAA 421 TGATTCCATT TTGGTTTGAG CTCAAACATC ACCATCTATA TAACTAAAAC TCGTCGAAAT 481 TGACTCAACC TGCTCATCTG ACACTTGAAA ATTCGAGACT CAAATTTAAT CATTAAGATG 541 AATTTTGTGA ATTTAAATAC CTTTTGCACG AAATGTACGC TGATAAAGCA TTTTAAGTGT 601 TTTTGAGGAG TAGCAGACGA AGCAATAGCG GATGGTAATG GTGATGGACA ATCATTGATC 661 CTTAACGAAG TTGCTCGAAT TCCTTCAGAA ATTTCCTCCA TAGTAAATTG TGATTTGATT 721 AATTATTGGC TCAAATAAAC AAGAAAATGG ATGAACAAGA ACTAATTAAG CATGTTTATG Predicted gene structure (within gDNA segment 3816 to 11222): Exon 1 5061 5277 ( 217 n); cDNA 42 259 ( 218 n); score: 0.912 Intron 1 5278 5381 ( 104 n); Pd: 0.848 (s: 0.88), Pa: 0.000 (s: 0.83) Exon 2 5382 5495 ( 114 n); cDNA 260 376 ( 117 n); score: 0.768 Intron 2 5496 8037 (2542 n); Pd: 0.488 (s: 0.65), Pa: 0.000 (s: 0) Exon 3 8038 8062 ( 25 n); cDNA 377 400 ( 24 n); score: 0.720 MATCH C06HBa0112G05.1-9+ SGN-E543331- 0.863 356 0.456 C PGS_C06HBa0112G05.1-9+_SGN-E543331- (5061 5277,5382 5495,8038 8062) Alignment (genomic DNA sequence = upper lines): CGAAAATAAG CAGGTGTAAA CGCGG-AAGC TAGCAAAGCA AACCTCGAAA GACCACGAGT 5119 || ||||||| |||| ||||| ||||| |||| |||||||||| |||||| ||| |||||||||| CGGAAATAAG CAGG-GTAAA CGCGGAAAGC TAGCAAAGCA AACCTCAAAA GACCACGAGT 100 AAG-AAGACA ACGAGAAATA TATCAAAAGA CACAAAGATT TAACGTGGTT CGGTCAATCG 5178 | | |||||| |||||||||| || ||||||| |||||||||| |||||||||| |||||||||| ATGAAAGACA ACGAGAAATA TACCAAAAGA CACAAAGATT TAACGTGGTT CGGTCAATCG 160 ACCTACGTCC ACAAAGGAGA TGAGCAATCC ACTATAAATG TGAGAGTACA AAATACAGAG 5238 |||||||||| |||| ||||| |||||||||| ||||| ||| |||||||||| |||||||||| ACCTACGTCC ACAATGGAGA TGAGCAATCC ACTATTAATA TGAGAGTACA AAATACAGAG 220 GGAAACAACC TCAACCAATT CACTCGGAAT ACATGAGAGG TTCACAAATT CTCGCTCTAA 5298 ||||||||| |||||||||| |||| |||| ||| | || AGAAACAACC TCAACCAATT CACTAAGAAT ACACGGGAA. .......... .......... 259 CCAAAACTCT CAAAGCCCTT AAAACTACAT TGTGAATGCT AATTAAGTTA GAAGGAACAT 5358 .......... .......... .......... .......... .......... .......... 259 GCCTTTATTT ATAGAGTCCT AAACCTTTTC CTACCAAAAA AAAGAATAGT CAATTCAAAA 5418 | ||||| |||||| ||| || | |||| |||||||||| .......... .......... ...CATTTTC CTACCAGAAA AAGAATTAGT CAATTCAAAA 296 C-CTTTTCCT AAAAGGAAAA CCTATTTATG GTAAGAAA-T CAGGGC-AAA TAAAACCCAA 5475 | ||||||| ||||| |||| |||||||||| ||||||| | ||| | ||| ||||| |||| CTTTTTTCCT AAAAGAAAAA CCTATTTATG ATAAGAAATT TAGGACAAAA TAAAATCCAA 356 CACATATTGT GTCACTCTAG GTAACATCTT TTCAGCATGA TGCCTCGCTA GATGATAGAG 5535 || | || | || | | || CATAAATGAT GTAATTATAC .......... .......... .......... .......... 376 ATTCAAGAGG AGGAAGTCCA ATTGCTCATC CATCATGATA GCATCTGATT TATAAGAACG 5595 .......... .......... .......... .......... .......... .......... 376 ATGATACAAG CTGATACAGT CATCCATATT ACTGGTGAGT CTAGTAAGGA CATCATCATC 5655 .......... .......... .......... .......... .......... .......... 376 CAAAATTGGT TGAAGCAGAT TCTCATCCTC TTGTCATTAT ATCTTCAAAC TGATCCAAAT 5715 .......... .......... .......... .......... .......... .......... 376 CGGAATAAGA AAGCTGGACA TATGTACAAA TAAATGCCAG CTCCAATTTT AGCTTTTCAA 5775 .......... .......... .......... .......... .......... .......... 376 TTAGATCCAC ATCAACATCC TTTTGATCTT GTTCATTCTT TAATCTCTCC AGAACATTGG 5835 .......... .......... .......... .......... .......... .......... 376 CAACGTCCGT GCGAAGAGCA GAAAATGACA TCTGCCAAAA GACCAATCAA ATTTACAAAG 5895 .......... .......... .......... .......... .......... .......... 376 CCACATTTCT ACCCTGCAAT AACTGCTAAC ACATAGTTCA GTGACTTCAA TGATAGAACA 5955 .......... .......... .......... .......... .......... .......... 376 CATAAGGTTT AGTGAATTTA GTGGACAATA AATAGTTTTT AGAATATTAC ATATACACAA 6015 .......... .......... .......... .......... .......... .......... 376 GTAAAGTTCA TTGATCAAGG GTGAAATTAG CTCGAGAATC TACCTGCATT ATTTTATTTT 6075 .......... .......... .......... .......... .......... .......... 376 GTTTACCTTC AAATTTTGAT TATATACTAT AATAAAATAA TGATCCAAGT AAATATAAGA 6135 .......... .......... .......... .......... .......... .......... 376 TACCTGTAGG TTGCTTCTTT CCATGACAGT AATTCTCCGG ATCTGATGTA GATAACAAGT 6195 .......... .......... .......... .......... .......... .......... 376 TTAAACACAG AGTTGTAATT AGAGACGATA GGAACCTTTT CAGTTGATGA AATATCTTAT 6255 .......... .......... .......... .......... .......... .......... 376 ATGTATGTTC TTTGTTTGTG TGGAGGAGAA AATACCAAGA AAATGACATA ATAAATAGGT 6315 .......... .......... .......... .......... .......... .......... 376 TTGAACAAGT CAGTCAGTCA ACTGACATTG GGACTGTACA GGCATTCATT CTCAACTAAA 6375 .......... .......... .......... .......... .......... .......... 376 ATAATTAATA GCATACTATA TGGCTTGTTT ACCCAGAGTA ATCGAAATTT AATTTGTACA 6435 .......... .......... .......... .......... .......... .......... 376 CAAAGTATAA TGATACTAAC TCGCTATATA TTTAGTAGAT AATGTGCTTG ATTAAAATCT 6495 .......... .......... .......... .......... .......... .......... 376 TACTAGAAAA ATCCCCTGGG AAAAAAGTGA TCTCTCGTAA CTGCTTTGGT TGCTTGTTAT 6555 .......... .......... .......... .......... .......... .......... 376 TTTACTAACG TCGTCTACTT AATAGTTCCG TCGGGGCTTT GATGAAACAA ATTGTAGACA 6615 .......... .......... .......... .......... .......... .......... 376 TCTTTTTAGA ACAGTAAAAG GTGTAAGAAC CTGCCAATTT GTTTGACACT TCTATGTATG 6675 .......... .......... .......... .......... .......... .......... 376 AAGACTCTGG AGATTTGGAC GTGAACTGTC ACATTTCTCC GCTCAAATTT TGCCAGAATA 6735 .......... .......... .......... .......... .......... .......... 376 TTCCAAAATT TTAGAAACTC CAAATACTTG TTTTCATAAT TTTAACTTTG TACACTTACA 6795 .......... .......... .......... .......... .......... .......... 376 AAAGAAATTC AACATTTTTC CAAGTTATAT TTATTCTTTC TAGGCATAGT ACATTAAAAT 6855 .......... .......... .......... .......... .......... .......... 376 GACATTTAGC TTGACTTCAG TGGATAACTG TGACCTTTAA CTTTGCTTAT CTGCCATAAA 6915 .......... .......... .......... .......... .......... .......... 376 GGAAAGAGAT AAGAAAGTTA ATCAAACTTT CAATACTCTC AGTTTGAAGT CAATGAAGTG 6975 .......... .......... .......... .......... .......... .......... 376 ATTGAGAGAA CTCAAGTTCA AATTTCAGCG GAGTTATCTG ATACCTGTTG CTGGTGCGTG 7035 .......... .......... .......... .......... .......... .......... 376 TTTTCAATAG TCATGGCTCG ATACATTAAC TGTTTTTTTT TCCTGAGAAG GGATACATGA 7095 .......... .......... .......... .......... .......... .......... 376 ACTGTGTTTG TGATAGAATA GATAAATTGA TTGTTAGTTG TAAAACCAGA AGTTTCCAAG 7155 .......... .......... .......... .......... .......... .......... 376 AGAAAGAACA ATAATCCTAA TATATTAATT GTGTTGCTGA ATCTGATAAA TGTCAGGGAT 7215 .......... .......... .......... .......... .......... .......... 376 AAATAAACAG TGTAATGAAC TTCAGTGTTT ACTGAGATAA CATAGTTAAA CAATTCATGT 7275 .......... .......... .......... .......... .......... .......... 376 CGAGTGAACT CTTTGAATAA TTTATTGGTC TGCCATATGA AGAATAACTT AATGTTGCAC 7335 .......... .......... .......... .......... .......... .......... 376 AAATCCTACA TTTTGACAAT TCATGTTTAT CTTTCCTAGT TAGACGCTTC ATTAGCTTTT 7395 .......... .......... .......... .......... .......... .......... 376 GAGCTCTTCT TGTCCAGTGG CTCTTTTGCA TAATTGGTTA TCCAGCTTAC GCTTTAATTA 7455 .......... .......... .......... .......... .......... .......... 376 AGGCACTTGT GATGGAGGAG ATTGCCAACA CTAGTAAGGA ACTTCAAGAG CTTAGGGTGA 7515 .......... .......... .......... .......... .......... .......... 376 TTCTACAGAT TCTTCGAGAC ATTCTGTGTC TACAGATCTT TTGGGGCCAC TTTGTCTGAA 7575 .......... .......... .......... .......... .......... .......... 376 ATGGTGTCGC TAACAAGTGC CTTGTTTCGT CTTAAAATTT GGAGTTATCG TCTCCATATA 7635 .......... .......... .......... .......... .......... .......... 376 TCCTTTTGTT TATGCTGGAG TAAGGTCTAT ATAGTTTCCC CTTTGTTTGT TAATTAATTC 7695 .......... .......... .......... .......... .......... .......... 376 TTTTTTAGCT GTTAAATGTG CCCTCAATGT TCACTTATCT CTATATACTT GTATTAATAC 7755 .......... .......... .......... .......... .......... .......... 376 ATTATATTCT TATTTGGAAC TGAAATTAAC AGGAGAGCTT CTAGAGCTTC AGGGATTAAA 7815 .......... .......... .......... .......... .......... .......... 376 AAAAGTTAAG GTACTCTTTC ATTTGTTTAG ATCACTATAA TTTCTAAGTG GTTAGGCCCC 7875 .......... .......... .......... .......... .......... .......... 376 TAAGGAAATA GTTGAGGCCT CAAGATCATC CGATGCTTAT GCTTAAAATC AACTATAATA 7935 .......... .......... .......... .......... .......... .......... 376 GTACTAAATA GAGATAAATA GACATTTACT CTCAGTGGAT GTTCTAAGTG TTGGTGATGA 7995 .......... .......... .......... .......... .......... .......... 376 TATTTTCTTC AGAATAATAC TAGGATTGCA CAAGCATAGT TGCATACATT TCAAACAGAG 8055 || ||| || | |||| .......... .......... .......... .......... ..AATTCATA TC-ATCAGAC 393 AATGTTT 8062 |||| || AATGGTT 400 hqPGS_C06HBa0112G05.1-9+_SGN-E543331- (5061 5277,5382 5495,8038 8062) ******************************************************************************** EST sequence 3 +strand 595 n (File: SGN-E271645+) 1 GATTAGTCAA TCCAAAATTT TTCCCTAAAA GGAAAACCTA TTTATGTTAA GAAATTATGG 61 TAAATAAAAT CCAACAAATC TCCTCCCCCT TGGCCTGTAT TTCTGACCAA AATAAATTTC 121 TCCACATTCT TCATTTAATC TTCAACAACT TGCTTCTCTT CTTCTTAATC TCCTTTGTAA 181 AATTTATGTC TCAACCATAG AAAACCTCTC TGAAACAATT TCTCCAACAA AATCTTCATT 241 ACTGTCAAAA AGATTGCGGC TAGAACCTAC CACCTGTCAA GATGAACCAC CACCCTCTTT 301 CTAACCTGGT CCAATAATCG ATTATCGAAC CACTGAACCT GACCTCTGTC ATTAAATGGC 361 TCTAATACCC ACTTGTTAGG ATCGAAATAA GCAGGTGTAA ATGCGGAAGC TAGCAAAGCA 421 AACTTCGAAA GACCACGAGT AAGAAGACAA CGAGAAATAT ACCAAAAGAC ACCAAAGATT 481 TAACGTGGTT CGGTCAATCG ACCTACGTTC ACAAAGGAGA TGAGCAATCC ACTATAAATA 541 TGAGAGTACA ATATACAGAG AGAAACAACA TCAACCAATT CATTCGGAAT ATATG Predicted gene structure (within gDNA segment 624 to 6387): Exon 1 3339 3371 ( 33 n); cDNA 327 359 ( 33 n); score: 0.576 Intron 1 3372 5035 (1664 n); Pd: 0.742 (s: 0), Pa: 0.000 (s: 0.80) Exon 2 5036 5273 ( 238 n); cDNA 360 595 ( 236 n); score: 0.910 MATCH C06HBa0112G05.1-9+ SGN-E271645+ 0.910 271 0.455 C PGS_C06HBa0112G05.1-9+_SGN-E271645+ (3339 3371,5036 5273) Alignment (genomic DNA sequence = upper lines): GAACAAATAA TCATGCACAA GATCATGAAG TTGGTAAGTC GGGTACTCAC CTATCTCATT 3398 |||| | | | | || | |||| || | | GAACCACTGA ACCTGACCTC TGTCATTAAA TGG....... .......... .......... 359 GAAACAAATT ACCAAGCTAC TGGAAATCAA ATCATCCACA TAAATCTTCA CCACTTCTTC 3458 .......... .......... .......... .......... .......... .......... 359 CGTCTTTTCC ACAAACCCTC CAGCAACCCA TACAGCTTTC AACTCATAGA TTGTCAATGC 3518 .......... .......... .......... .......... .......... .......... 359 TTTGTCCTTC GGCCTACTTG CAAAGTACAG CAAGCATGGC TTCAGATGAT GAGGTAAGTG 3578 .......... .......... .......... .......... .......... .......... 359 GTCAAAACTT ATTTCAATAA CTTTCATCAC TTCCACTTTA TTCTTCAAAA TAAAAGAAAG 3638 .......... .......... .......... .......... .......... .......... 359 CAAATTATTT ACAACTTCAA GCCACACACT CTTTTTCTTT TCCCTCCCAA CAATGACTCC 3698 .......... .......... .......... .......... .......... .......... 359 AGCAATCAGA TCCACAATCA AAGGAAGCCC TTTACAATTT TCGGCTATTT CTTTACCAAC 3758 .......... .......... .......... .......... .......... .......... 359 ATCCAATAGT TCATCAGGGC AACTCTCGTT TCCAAATGCC CTTTTCTCTA TTAACTCCCA 3818 .......... .......... .......... .......... .......... .......... 359 ACTTTCTTCT GATCTTAGCA ATCGAAGGTC AAGAGGAGCA GTGTAGAGCT TTCCATGCAA 3878 .......... .......... .......... .......... .......... .......... 359 AGCCACTTCC TTTTCTCGAG TTGTCAAAAT AATTCTACTT CCTTTCTGAG CTTCAGGAAA 3938 .......... .......... .......... .......... .......... .......... 359 AGATCTTGTT AACTCATCCC ATGTAGTCGT CTCCCACCCG TCATCTAATA CAATAAGATA 3998 .......... .......... .......... .......... .......... .......... 359 CCTCTTTCCG TGTAGTTGTT TCCGTAGCTT ATCAGCAACA TCAATATTCT CACTCAATTT 4058 .......... .......... .......... .......... .......... .......... 359 TGAATTTGAG TCACTAACTT GATTGAAAAT TTTATCCAAC AACTTCTTCT CGTCATATCC 4118 .......... .......... .......... .......... .......... .......... 359 TTGGTCGACC GTGCACCATG CACGAAGGTC GAAATGGCTA GAAACTGATT TATCATTGTA 4178 .......... .......... .......... .......... .......... .......... 359 TACTTTGTAT GCCAAAGTAG TTTTACCTGA ACCAGGCATA CCAGTGATCC AAATGACATC 4238 .......... .......... .......... .......... .......... .......... 359 TAGATCTGCC GGTCCATTGG TGAGCTTTCT AAGTATCAAG TTTGTCTCCT CCTCAAAACC 4298 .......... .......... .......... .......... .......... .......... 359 TACAATTATT TTATCAGTTG TCAATGACTT TCTCTCAACC GGTTTCTTGA GAGAGTTCAC 4358 .......... .......... .......... .......... .......... .......... 359 AACGATGAGA CCTCTGTTCT TGGGGATCTT CTCATGTAAA TCAGAGACCT CGTCTTTGAT 4418 .......... .......... .......... .......... .......... .......... 359 AAGCTTCATC TTCTTTATGG TAGTGGGAAG TGAGAAAATA AGATGTAAGA GACCATTATC 4478 .......... .......... .......... .......... .......... .......... 359 TCGAAGTCGT GTACAAGAGT TGATACATCT CTGGTAAGTG TTCCAACACG ATCCAAGAGA 4538 .......... .......... .......... .......... .......... .......... 359 TCAAAAAGTT TGTCATGATG AATAAAGTCC TTAAGCATAT CAGAAAGGAT AATCAACAGG 4598 .......... .......... .......... .......... .......... .......... 359 AATTCCATCA TGACATGAAT GTTTCGAGCC CCTGAAGTGC TAGGGGTAAT GACAGTTATC 4658 .......... .......... .......... .......... .......... .......... 359 ATATGCTCTT GTAGATGAAT AAGATATTCC CGAAGAATGT CCGGAGAGGT TTCCAGGAGC 4718 .......... .......... .......... .......... .......... .......... 359 TGCTTAATGA AGAGTCCATC TTCTTCTGAA GTTGAAGCTT TCAAATTTGT AAAATATATG 4778 .......... .......... .......... .......... .......... .......... 359 CGCATAAGCT CCAGTTCAGT TGGAACAATC TTCAAGAGTA AATGTGCTAA CTTGAAGAGT 4838 .......... .......... .......... .......... .......... .......... 359 CGAGAGTCTT TAACATTCTG ATCATTCTCA TCTAGCTGGG CGAGTCGAGA GTCTTCATCA 4898 .......... .......... .......... .......... .......... .......... 359 GTCTGATCTT CCCAAAGGAA GCGTCCTACT CTCTCAGCCA TCAGTTGAAA CAGAGGTAAG 4958 .......... .......... .......... .......... .......... .......... 359 ACATTCTCAA CCATCTCATG CTTAATGCGA CCATTCACTA TCAATCCATG GAAATCTCTT 5018 .......... .......... .......... .......... .......... .......... 359 ACGTTTCCAC ATACATTCTG AAGAACCTCA TATTGTTAGG AACGAAAATA AGCAGGTGTA 5078 || | ||| || |||||||| | || ||||| |||||||||| .......... .......CTC TAATACC-CA -CTTGTTAGG ATCG-AAATA AGCAGGTGTA 399 AACGCGGAAG CTAGCAAAGC AAACCTCGAA AGACCACGAG TAAGAAGACA ACGAGAAATA 5138 || ||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| AATGCGGAAG CTAGCAAAGC AAACTTCGAA AGACCACGAG TAAGAAGACA ACGAGAAATA 459 TATCAAAAGA CA-CAAAGAT TTAACGTGGT TCGGTCAATC GACCTACGTC CACAAAGGAG 5197 || ||||||| || ||||||| |||||||||| |||||||||| ||||||||| |||||||||| TACCAAAAGA CACCAAAGAT TTAACGTGGT TCGGTCAATC GACCTACGTT CACAAAGGAG 519 ATGAGCAATC CACTATAAAT GTGAGAGTAC AAAATACAGA GGGAAACAAC CTCAACCAAT 5257 |||||||||| |||||||||| ||||||||| || ||||||| | |||||||| ||||||||| ATGAGCAATC CACTATAAAT ATGAGAGTAC AATATACAGA GAGAAACAAC ATCAACCAAT 579 TCACTCGGAA TACATG 5273 ||| |||||| || ||| TCATTCGGAA TATATG 595 hqPGS_C06HBa0112G05.1-9+_SGN-E271645+ (5036 5273) ******************************************************************************** EST sequence 9 -strand 747 n (File: SGN-E211090-) 1 CCCATATCAA CGGCTGGATT CAAGCTCTGC CCACAGTCGT AAATAATATC AGATTGCAAA 61 ATGGTAGTCT CTCCGGTCAG GATATCAATC TCCACCTCAC TGACAGCAGC ACCAAAGTTC 121 AAATAACTCG TAAAATCAGA TTCTGGTACA TAATAAGAAT TTGCTGCTAA GTTTACTGAT 181 TCCATTTGTG CCTGCAAAAA GAAGGGGGTG GAGGGACAGA ACTGCAAACT CAGATATGTT 241 AGGACCGAAA ATAAGCAGGT GTAAACGCGG AAGCTAGCAA TACAAACCTC GAAAGACCAC 301 GAGTAAGAAG ACAACGAGAA ATATATCAAA AGACACAAAG ATTTAACGTG GTTCGGTCAA 361 TCAACCTACG TCCACAAAGG AGATGAGCAA TCCACTATAA ATATAAGAGT ACAAAATACA 421 GAGAGAAACA ACCTCAACCA ATTCACTCAG AATACATGGG AGGTTCACAC AAGTGATAAC 481 ATATCAAGCT TGTGACCCAC AGATTCTCCC TCTAACCAAA ACTCTCAAAG CCTGTAAGAC 541 TACATTGTGA ATGCTGATTA AGTTAAAAGG AATATTCATC TATTTATAGA GTCCTAAACC 601 TTTTCCTACA AGAAAAGGAT TAGTCAATTC AAAACCTTTT CCTAAAAGGA AAACCTATTT 661 ATGGTAAGAA ATCAGGGCAA ATAAAATCCA ACAAGATACT TAAAAGATGA GCAAGAGTGG 721 CTAATGGATT TAAATTAACA AATTGTT Predicted gene structure (within gDNA segment 2001 to 6680): Exon 1 4442 4447 ( 6 n); cDNA 223 228 ( 6 n); score: 0.667 Intron 1 4448 5044 ( 597 n); Pd: 0.230 (s: 0), Pa: 0.000 (s: 0.91) Exon 2 5045 5477 ( 433 n); cDNA 229 693 ( 465 n); score: 0.754 MATCH C06HBa0112G05.1-9+ SGN-E211090- 0.754 439 0.588 C PGS_C06HBa0112G05.1-9+_SGN-E211090- (4442 4447,5045 5477) Alignment (genomic DNA sequence = upper lines): TGGGAAGTGA GAAAATAAGA TGTAAGAGAC CATTATCTCG AAGTCGTGTA CAAGAGTTGA 4501 || || TGCAAA.... .......... .......... .......... .......... .......... 228 TACATCTCTG GTAAGTGTTC CAACACGATC CAAGAGATCA AAAAGTTTGT CATGATGAAT 4561 .......... .......... .......... .......... .......... .......... 228 AAAGTCCTTA AGCATATCAG AAAGGATAAT CAACAGGAAT TCCATCATGA CATGAATGTT 4621 .......... .......... .......... .......... .......... .......... 228 TCGAGCCCCT GAAGTGCTAG GGGTAATGAC AGTTATCATA TGCTCTTGTA GATGAATAAG 4681 .......... .......... .......... .......... .......... .......... 228 ATATTCCCGA AGAATGTCCG GAGAGGTTTC CAGGAGCTGC TTAATGAAGA GTCCATCTTC 4741 .......... .......... .......... .......... .......... .......... 228 TTCTGAAGTT GAAGCTTTCA AATTTGTAAA ATATATGCGC ATAAGCTCCA GTTCAGTTGG 4801 .......... .......... .......... .......... .......... .......... 228 AACAATCTTC AAGAGTAAAT GTGCTAACTT GAAGAGTCGA GAGTCTTTAA CATTCTGATC 4861 .......... .......... .......... .......... .......... .......... 228 ATTCTCATCT AGCTGGGCGA GTCGAGAGTC TTCATCAGTC TGATCTTCCC AAAGGAAGCG 4921 .......... .......... .......... .......... .......... .......... 228 TCCTACTCTC TCAGCCATCA GTTGAAACAG AGGTAAGACA TTCTCAACCA TCTCATGCTT 4981 .......... .......... .......... .......... .......... .......... 228 AATGCGACCA TTCACTATCA ATCCATGGAA ATCTCTTACG TTTCCACATA CATTCTGAAG 5041 .......... .......... .......... .......... .......... .......... 228 AACCTCATAT -TGTTAGGAA CGAAAATAAG CAGGTGTAAA CGCGGAAGCT AGCAAAGCAA 5100 |||| || |||||||| |||||||||| |||||||||| |||||||||| ||||| ||| ...CTCAGAT ATGTTAGGAC CGAAAATAAG CAGGTGTAAA CGCGGAAGCT AGCAATACAA 285 ACCTCGAAAG ACCACGAGTA AGAAGACAAC GAGAAATATA TCAAAAGACA CAAAGATTTA 5160 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCTCGAAAG ACCACGAGTA AGAAGACAAC GAGAAATATA TCAAAAGACA CAAAGATTTA 345 ACGTGGTTCG GTCAATCGAC CTACGTCCAC AAAGGAGATG AGCAATCCAC TATAAATGTG 5220 |||||||||| ||||||| || |||||||||| |||||||||| |||||||||| ||||||| | ACGTGGTTCG GTCAATCAAC CTACGTCCAC AAAGGAGATG AGCAATCCAC TATAAATATA 405 AGAGTACAAA ATACAGAGGG AAACAACCTC AACCAATTCA CTCGGAATAC AT--GA-G-- 5275 |||||||||| |||||||| | |||||||||| |||||||||| ||| |||||| || || | AGAGTACAAA ATACAGAGAG AAACAACCTC AACCAATTCA CTCAGAATAC ATGGGAGGTT 465 ------AG-G ------T-T- ---C-----A --CA-A-ATT CTCGCTCTAA CCAAAACTCT 5308 || | | | | | || | ||| ||| |||||| |||||||||| CACACAAGTG ATAACATATC AAGCTTGTGA CCCACAGATT CTCCCTCTAA CCAAAACTCT 525 CAAAGCCCTT AAAACTACAT TGTGAATGCT AATTAAGTTA GAAGGAACAT GCCTTTATTT 5368 ||||||| | || ||||||| |||||||||| ||||||||| |||||| || | | ||||| CAAAGCCTGT AAGACTACAT TGTGAATGCT GATTAAGTTA AAAGGAATAT TCATCTATTT 585 ATAGAGTCCT AAACCTTTTC CTACCAAAAA AAAGAATAGT CAATTCAAAA CCTTTTCCTA 5428 |||||||||| |||||||||| ||| ||| || || || |||| |||||||||| |||||||||| ATAGAGTCCT AAACCTTTTC CTA-CAAGAA AAGGATTAGT CAATTCAAAA CCTTTTCCTA 644 AAAGGAAAAC CTATTTATGG TAAGAAATCA GGGCAAATAA AACCCAACA 5477 |||||||||| |||||||||| |||||||||| |||||||||| || |||||| AAAGGAAAAC CTATTTATGG TAAGAAATCA GGGCAAATAA AATCCAACA 693 hqPGS_C06HBa0112G05.1-9+_SGN-E211090- (5045 5477) ******************************************************************************** EST sequence 2 +strand 667 n (File: SGN-E389613+) 1 TACATTACTG CAATCTCCAA ATCCAGCCAT AGATCCTTAG CATTGGACCA TTGGTAAAAG 61 TATTTAGTTA CAGATCCTTT TTTGCCTTTG GCAGCACTAA CTTATATACC TAATTAATTT 121 TGCCCCACAA TATGAAATGA GATAGCTTGC ATGCAACGCG TACCTGCTAC CTTCCACCAT 181 CACAAATCCA TAACCTGATA GTAATGAGTC AATGGAACCT TGACGGTGTT AGGACCAAAA 241 ATAAGCAGGT GTAAACGCGG AAGCTAGCAA AGCAAACCTC AAAAGACCAC GAGTAAGAAG 301 ACAACGAGAA ATATACCAAA AGATACAGAG ATTTAACGTG GTTCGGTCAA TTGACCTACG 361 TCCACAAAGG AGATGAGCAA TCCATTATAA ATATGAGAAT ACAAAATACA GAGAGAAACA 421 ACCTCAACCA AATTCACTCA GAATACATGA GAGATTCACA CAAGTGATAA CGTATCAAGA 481 AGCTTGTGAC TCATACCTCA ACGCATTATG TATGTGCAAA TCACAGGAGG AATGTACATT 541 CATGATCAGT TAAAAGTTAT AATAGAACAG GCAGAATAAA AGCAAAATAG AACATGATAA 601 TAATTTGTTA GAAACATTCT TGTACTCATC CAAATTAGTT GCAATGATTA CCTATTAGTA 661 ATATGAG Predicted gene structure (within gDNA segment 2093 to 9249): Exon 1 5058 5286 ( 229 n); cDNA 233 462 ( 230 n); score: 0.928 Intron 1 5287 8007 (2721 n); Pd: 0.000 (s: 0.87), Pa: 0.930 (s: 0.52) Exon 2 8008 8067 ( 60 n); cDNA 463 519 ( 57 n); score: 0.542 Intron 2 8068 8238 ( 171 n); Pd: 0.437 (s: 0.54), Pa: 0.739 (s: 0) Exon 3 8239 8274 ( 36 n); cDNA 520 554 ( 35 n); score: 0.583 MATCH C06HBa0112G05.1-9+ SGN-E389613+ 0.848 325 0.487 C PGS_C06HBa0112G05.1-9+_SGN-E389613+ (5058 5286,8008 8067,8239 8274) Alignment (genomic DNA sequence = upper lines): GAACGAAAAT AAGCAGGTGT AAACGCGGAA GCTAGCAAAG CAAACCTCGA AAGACCACGA 5117 || | ||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| GACCAAAAAT AAGCAGGTGT AAACGCGGAA GCTAGCAAAG CAAACCTCAA AAGACCACGA 292 GTAAGAAGAC AACGAGAAAT ATATCAAAAG ACACAAAGAT TTAACGTGGT TCGGTCAATC 5177 |||||||||| |||||||||| ||| |||||| | ||| |||| |||||||||| ||||||||| GTAAGAAGAC AACGAGAAAT ATACCAAAAG ATACAGAGAT TTAACGTGGT TCGGTCAATT 352 GACCTACGTC CACAAAGGAG ATGAGCAATC CACTATAAAT GTGAGAGTAC AAAATACAGA 5237 |||||||||| |||||||||| |||||||||| || ||||||| ||||| ||| |||||||||| GACCTACGTC CACAAAGGAG ATGAGCAATC CATTATAAAT ATGAGAATAC AAAATACAGA 412 GGGAAACAAC CTCAACC-AA TTCACTCGGA ATACATGAGA GGTTCACAAA TTCTCGCTCT 5296 | |||||||| ||||||| || ||||||| || |||||||||| | |||||| | GAGAAACAAC CTCAACCAAA TTCACTCAGA ATACATGAGA GATTCACACA .......... 462 AACCAAAACT CTCAAAGCCC TTAAAACTAC ATTGTGAATG CTAATTAAGT TAGAAGGAAC 5356 .......... .......... .......... .......... .......... .......... 462 ATGCCTTTAT TTATAGAGTC CTAAACCTTT TCCTACCAAA AAAAAGAATA GTCAATTCAA 5416 .......... .......... .......... .......... .......... .......... 462 AACCTTTTCC TAAAAGGAAA ACCTATTTAT GGTAAGAAAT CAGGGCAAAT AAAACCCAAC 5476 .......... .......... .......... .......... .......... .......... 462 ACATATTGTG TCACTCTAGG TAACATCTTT TCAGCATGAT GCCTCGCTAG ATGATAGAGA 5536 .......... .......... .......... .......... .......... .......... 462 TTCAAGAGGA GGAAGTCCAA TTGCTCATCC ATCATGATAG CATCTGATTT ATAAGAACGA 5596 .......... .......... .......... .......... .......... .......... 462 TGATACAAGC TGATACAGTC ATCCATATTA CTGGTGAGTC TAGTAAGGAC ATCATCATCC 5656 .......... .......... .......... .......... .......... .......... 462 AAAATTGGTT GAAGCAGATT CTCATCCTCT TGTCATTATA TCTTCAAACT GATCCAAATC 5716 .......... .......... .......... .......... .......... .......... 462 GGAATAAGAA AGCTGGACAT ATGTACAAAT AAATGCCAGC TCCAATTTTA GCTTTTCAAT 5776 .......... .......... .......... .......... .......... .......... 462 TAGATCCACA TCAACATCCT TTTGATCTTG TTCATTCTTT AATCTCTCCA GAACATTGGC 5836 .......... .......... .......... .......... .......... .......... 462 AACGTCCGTG CGAAGAGCAG AAAATGACAT CTGCCAAAAG ACCAATCAAA TTTACAAAGC 5896 .......... .......... .......... .......... .......... .......... 462 CACATTTCTA CCCTGCAATA ACTGCTAACA CATAGTTCAG TGACTTCAAT GATAGAACAC 5956 .......... .......... .......... .......... .......... .......... 462 ATAAGGTTTA GTGAATTTAG TGGACAATAA ATAGTTTTTA GAATATTACA TATACACAAG 6016 .......... .......... .......... .......... .......... .......... 462 TAAAGTTCAT TGATCAAGGG TGAAATTAGC TCGAGAATCT ACCTGCATTA TTTTATTTTG 6076 .......... .......... .......... .......... .......... .......... 462 TTTACCTTCA AATTTTGATT ATATACTATA ATAAAATAAT GATCCAAGTA AATATAAGAT 6136 .......... .......... .......... .......... .......... .......... 462 ACCTGTAGGT TGCTTCTTTC CATGACAGTA ATTCTCCGGA TCTGATGTAG ATAACAAGTT 6196 .......... .......... .......... .......... .......... .......... 462 TAAACACAGA GTTGTAATTA GAGACGATAG GAACCTTTTC AGTTGATGAA ATATCTTATA 6256 .......... .......... .......... .......... .......... .......... 462 TGTATGTTCT TTGTTTGTGT GGAGGAGAAA ATACCAAGAA AATGACATAA TAAATAGGTT 6316 .......... .......... .......... .......... .......... .......... 462 TGAACAAGTC AGTCAGTCAA CTGACATTGG GACTGTACAG GCATTCATTC TCAACTAAAA 6376 .......... .......... .......... .......... .......... .......... 462 TAATTAATAG CATACTATAT GGCTTGTTTA CCCAGAGTAA TCGAAATTTA ATTTGTACAC 6436 .......... .......... .......... .......... .......... .......... 462 AAAGTATAAT GATACTAACT CGCTATATAT TTAGTAGATA ATGTGCTTGA TTAAAATCTT 6496 .......... .......... .......... .......... .......... .......... 462 ACTAGAAAAA TCCCCTGGGA AAAAAGTGAT CTCTCGTAAC TGCTTTGGTT GCTTGTTATT 6556 .......... .......... .......... .......... .......... .......... 462 TTACTAACGT CGTCTACTTA ATAGTTCCGT CGGGGCTTTG ATGAAACAAA TTGTAGACAT 6616 .......... .......... .......... .......... .......... .......... 462 CTTTTTAGAA CAGTAAAAGG TGTAAGAACC TGCCAATTTG TTTGACACTT CTATGTATGA 6676 .......... .......... .......... .......... .......... .......... 462 AGACTCTGGA GATTTGGACG TGAACTGTCA CATTTCTCCG CTCAAATTTT GCCAGAATAT 6736 .......... .......... .......... .......... .......... .......... 462 TCCAAAATTT TAGAAACTCC AAATACTTGT TTTCATAATT TTAACTTTGT ACACTTACAA 6796 .......... .......... .......... .......... .......... .......... 462 AAGAAATTCA ACATTTTTCC AAGTTATATT TATTCTTTCT AGGCATAGTA CATTAAAATG 6856 .......... .......... .......... .......... .......... .......... 462 ACATTTAGCT TGACTTCAGT GGATAACTGT GACCTTTAAC TTTGCTTATC TGCCATAAAG 6916 .......... .......... .......... .......... .......... .......... 462 GAAAGAGATA AGAAAGTTAA TCAAACTTTC AATACTCTCA GTTTGAAGTC AATGAAGTGA 6976 .......... .......... .......... .......... .......... .......... 462 TTGAGAGAAC TCAAGTTCAA ATTTCAGCGG AGTTATCTGA TACCTGTTGC TGGTGCGTGT 7036 .......... .......... .......... .......... .......... .......... 462 TTTCAATAGT CATGGCTCGA TACATTAACT GTTTTTTTTT CCTGAGAAGG GATACATGAA 7096 .......... .......... .......... .......... .......... .......... 462 CTGTGTTTGT GATAGAATAG ATAAATTGAT TGTTAGTTGT AAAACCAGAA GTTTCCAAGA 7156 .......... .......... .......... .......... .......... .......... 462 GAAAGAACAA TAATCCTAAT ATATTAATTG TGTTGCTGAA TCTGATAAAT GTCAGGGATA 7216 .......... .......... .......... .......... .......... .......... 462 AATAAACAGT GTAATGAACT TCAGTGTTTA CTGAGATAAC ATAGTTAAAC AATTCATGTC 7276 .......... .......... .......... .......... .......... .......... 462 GAGTGAACTC TTTGAATAAT TTATTGGTCT GCCATATGAA GAATAACTTA ATGTTGCACA 7336 .......... .......... .......... .......... .......... .......... 462 AATCCTACAT TTTGACAATT CATGTTTATC TTTCCTAGTT AGACGCTTCA TTAGCTTTTG 7396 .......... .......... .......... .......... .......... .......... 462 AGCTCTTCTT GTCCAGTGGC TCTTTTGCAT AATTGGTTAT CCAGCTTACG CTTTAATTAA 7456 .......... .......... .......... .......... .......... .......... 462 GGCACTTGTG ATGGAGGAGA TTGCCAACAC TAGTAAGGAA CTTCAAGAGC TTAGGGTGAT 7516 .......... .......... .......... .......... .......... .......... 462 TCTACAGATT CTTCGAGACA TTCTGTGTCT ACAGATCTTT TGGGGCCACT TTGTCTGAAA 7576 .......... .......... .......... .......... .......... .......... 462 TGGTGTCGCT AACAAGTGCC TTGTTTCGTC TTAAAATTTG GAGTTATCGT CTCCATATAT 7636 .......... .......... .......... .......... .......... .......... 462 CCTTTTGTTT ATGCTGGAGT AAGGTCTATA TAGTTTCCCC TTTGTTTGTT AATTAATTCT 7696 .......... .......... .......... .......... .......... .......... 462 TTTTTAGCTG TTAAATGTGC CCTCAATGTT CACTTATCTC TATATACTTG TATTAATACA 7756 .......... .......... .......... .......... .......... .......... 462 TTATATTCTT ATTTGGAACT GAAATTAACA GGAGAGCTTC TAGAGCTTCA GGGATTAAAA 7816 .......... .......... .......... .......... .......... .......... 462 AAAGTTAAGG TACTCTTTCA TTTGTTTAGA TCACTATAAT TTCTAAGTGG TTAGGCCCCT 7876 .......... .......... .......... .......... .......... .......... 462 AAGGAAATAG TTGAGGCCTC AAGATCATCC GATGCTTATG CTTAAAATCA ACTATAATAG 7936 .......... .......... .......... .......... .......... .......... 462 TACTAAATAG AGATAAATAG ACATTTACTC TCAGTGGATG TTCTAAGTGT TGGTGATGAT 7996 .......... .......... .......... .......... .......... .......... 462 ATTTTCTTCA GAATAATACT AGGATTGCAC AAGCATAGTT GCATACATTT CAAACAGAGA 8056 | | ||| | | | | |||| | || | | | | | ||| | .......... .AGTGATA-- ACGTATCAAG AAGC-TTGTG ACTCATACCT C-AACGCATT 507 ATGTTTG-GC AAGTGCTAGA ATGAACTTCT AATTTTAATT GAATTTATGC TTTTATAAAA 8115 |||| || || || ATGTATGTGC AA........ .......... .......... .......... .......... 519 AATCAAATTG ATGAATAAAA TCAATTCTTA ACTGAAAATT TCGTTGATGA TTCAATCCGG 8175 .......... .......... .......... .......... .......... .......... 519 GGGGAAAAGG CTTGGACATC TTTTGAACCT AGAAACACAA TTATGGTTGC CAAACTATGA 8235 .......... .......... .......... .......... .......... .......... 519 CAGTTCCGAG GAGAGAATCA GGAGGCTTGG ATAGTTCAA 8274 || || ||| |||| | | || |||| || ...ATCACAG GAG-GAATGT ACATTCATGA TCAGTTAAA 554 hqPGS_C06HBa0112G05.1-9+_SGN-E389613+ (5058 5286) ******************************************************************************** EST sequence 4 +strand 574 n (File: SGN-E206285+) 1 AACTAAGTAG AGCTATGTAT ATTTTGTCTG CAGAAGCAAG GGCTTAATTT ACAACTATAG 61 GTGGCGTACG TATAATGCAT GGACCGAAAA ATAAGCAGGT GTAAACGCGG AAGCTAGCAA 121 TGCTAACATC GAAAGACCAC GAGTAAGAAG ACAACGAGAA ATATACCAAA AGACACAAAG 181 ATTTAACGTG GTTCGGTCAA TCGACCTATG TCCACAAAGG AGATGAGCAA TCCACTATAA 241 ATATGAGAGT ACAAAATACA GAGGGAAACA ATCTCAACCA ATTCACTCAG AATACATGGG 301 AGGTTCACAC AAGTAATAAC GTATCAAGCT TGTGACCCAC AAATTCTCCC TCTAACCAAA 361 ACTCTCAAAG CCCTTAAGAC TACATTGTGA ATGTTAACTA AGTTAAAAGG AACATGCCTC 421 TATTTATAGA ATCCTAAACC TTTCCCTACA AGAAAAGGAT TAGTCAATCC AAAACCTTTT 481 CCTACAAGGA AAATCTATTT ATGGTAAGAA ATTTAGAGCA AATAAAACCC AACAAACATC 541 TATGATACGT GAACTTGCTT GGCCCTATAT GTTT Predicted gene structure (within gDNA segment 3593 to 7087): Exon 1 5058 5277 ( 220 n); cDNA 82 302 ( 221 n); score: 0.943 Intron 1 5278 6313 (1036 n); Pd: 0.848 (s: 0.94), Pa: 0.349 (s: 0) Exon 2 6314 6349 ( 36 n); cDNA 303 336 ( 34 n); score: 0.583 MATCH C06HBa0112G05.1-9+ SGN-E206285+ 0.943 256 0.446 C PGS_C06HBa0112G05.1-9+_SGN-E206285+ (5058 5277,6314 6349) Alignment (genomic DNA sequence = upper lines): GAACGAAAA- TAAGCAGGTG TAAACGCGGA AGCTAGCAAA GCAAACCTCG AAAGACCACG 5116 || |||||| |||||||||| |||||||||| ||||||||| || ||| ||| |||||||||| GACCGAAAAA TAAGCAGGTG TAAACGCGGA AGCTAGCAAT GCTAACATCG AAAGACCACG 141 AGTAAGAAGA CAACGAGAAA TATATCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 5176 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| AGTAAGAAGA CAACGAGAAA TATACCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 201 CGACCTACGT CCACAAAGGA GATGAGCAAT CCACTATAAA TGTGAGAGTA CAAAATACAG 5236 ||||||| || |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| CGACCTATGT CCACAAAGGA GATGAGCAAT CCACTATAAA TATGAGAGTA CAAAATACAG 261 AGGGAAACAA CCTCAACCAA TTCACTCGGA ATACATGAGA GGTTCACAAA TTCTCGCTCT 5296 |||||||||| ||||||||| ||||||| || ||||||| || | AGGGAAACAA TCTCAACCAA TTCACTCAGA ATACATGGGA G......... .......... 302 AACCAAAACT CTCAAAGCCC TTAAAACTAC ATTGTGAATG CTAATTAAGT TAGAAGGAAC 5356 .......... .......... .......... .......... .......... .......... 302 ATGCCTTTAT TTATAGAGTC CTAAACCTTT TCCTACCAAA AAAAAGAATA GTCAATTCAA 5416 .......... .......... .......... .......... .......... .......... 302 AACCTTTTCC TAAAAGGAAA ACCTATTTAT GGTAAGAAAT CAGGGCAAAT AAAACCCAAC 5476 .......... .......... .......... .......... .......... .......... 302 ACATATTGTG TCACTCTAGG TAACATCTTT TCAGCATGAT GCCTCGCTAG ATGATAGAGA 5536 .......... .......... .......... .......... .......... .......... 302 TTCAAGAGGA GGAAGTCCAA TTGCTCATCC ATCATGATAG CATCTGATTT ATAAGAACGA 5596 .......... .......... .......... .......... .......... .......... 302 TGATACAAGC TGATACAGTC ATCCATATTA CTGGTGAGTC TAGTAAGGAC ATCATCATCC 5656 .......... .......... .......... .......... .......... .......... 302 AAAATTGGTT GAAGCAGATT CTCATCCTCT TGTCATTATA TCTTCAAACT GATCCAAATC 5716 .......... .......... .......... .......... .......... .......... 302 GGAATAAGAA AGCTGGACAT ATGTACAAAT AAATGCCAGC TCCAATTTTA GCTTTTCAAT 5776 .......... .......... .......... .......... .......... .......... 302 TAGATCCACA TCAACATCCT TTTGATCTTG TTCATTCTTT AATCTCTCCA GAACATTGGC 5836 .......... .......... .......... .......... .......... .......... 302 AACGTCCGTG CGAAGAGCAG AAAATGACAT CTGCCAAAAG ACCAATCAAA TTTACAAAGC 5896 .......... .......... .......... .......... .......... .......... 302 CACATTTCTA CCCTGCAATA ACTGCTAACA CATAGTTCAG TGACTTCAAT GATAGAACAC 5956 .......... .......... .......... .......... .......... .......... 302 ATAAGGTTTA GTGAATTTAG TGGACAATAA ATAGTTTTTA GAATATTACA TATACACAAG 6016 .......... .......... .......... .......... .......... .......... 302 TAAAGTTCAT TGATCAAGGG TGAAATTAGC TCGAGAATCT ACCTGCATTA TTTTATTTTG 6076 .......... .......... .......... .......... .......... .......... 302 TTTACCTTCA AATTTTGATT ATATACTATA ATAAAATAAT GATCCAAGTA AATATAAGAT 6136 .......... .......... .......... .......... .......... .......... 302 ACCTGTAGGT TGCTTCTTTC CATGACAGTA ATTCTCCGGA TCTGATGTAG ATAACAAGTT 6196 .......... .......... .......... .......... .......... .......... 302 TAAACACAGA GTTGTAATTA GAGACGATAG GAACCTTTTC AGTTGATGAA ATATCTTATA 6256 .......... .......... .......... .......... .......... .......... 302 TGTATGTTCT TTGTTTGTGT GGAGGAGAAA ATACCAAGAA AATGACATAA TAAATAGGTT 6316 ||| .......... .......... .......... .......... .......... .......GTT 305 TGAACAAGTC AGTCAGTCAA CTGACATTGG GAC 6349 |||||| | | | | | | ||| ||| CACACAAGT- AATAACGTAT C-AAGCTTGT GAC 336 hqPGS_C06HBa0112G05.1-9+_SGN-E206285+ (5058 5277) ******************************************************************************** EST sequence 5 +strand 344 n (File: SGN-E206496+) 1 AACTAAGTAG AGCTATGTAT ATTTTGTCTG CAGAAGCAAT GGCTTAATTT ACAACTATAG 61 GTGGCGTACG TATAATGCAT GGACCGAAAA ATAAGCAGGT GTAAACGCGG AAGCTAGCAA 121 TGCTAACATC GAAAGACCAC GAGTAAGAAG ACAACGAGAA ATATACCAAA AGACACAAAG 181 ATTTAACGTG GTTCGGTCAA TCGACCTATG TCCACAAAGG AGATGAGCAA TCCACTATAA 241 ATATGAGAGT ACAAAATACA GAGGGAAACA ATCTCAACCA ATTCACTCAG AATACATGGG 301 AGGTTCACAC AAGTAATAAC GTATCAAGCT TGTGACCCAC AAAT Predicted gene structure (within gDNA segment 3593 to 6433): Exon 1 5058 5277 ( 220 n); cDNA 82 302 ( 221 n); score: 0.943 MATCH C06HBa0112G05.1-9+ SGN-E206496+ 0.943 220 0.640 C PGS_C06HBa0112G05.1-9+_SGN-E206496+ (5058 5277) Alignment (genomic DNA sequence = upper lines): GAACGAAAA- TAAGCAGGTG TAAACGCGGA AGCTAGCAAA GCAAACCTCG AAAGACCACG 5116 || |||||| |||||||||| |||||||||| ||||||||| || ||| ||| |||||||||| GACCGAAAAA TAAGCAGGTG TAAACGCGGA AGCTAGCAAT GCTAACATCG AAAGACCACG 141 AGTAAGAAGA CAACGAGAAA TATATCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 5176 |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| AGTAAGAAGA CAACGAGAAA TATACCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 201 CGACCTACGT CCACAAAGGA GATGAGCAAT CCACTATAAA TGTGAGAGTA CAAAATACAG 5236 ||||||| || |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| CGACCTATGT CCACAAAGGA GATGAGCAAT CCACTATAAA TATGAGAGTA CAAAATACAG 261 AGGGAAACAA CCTCAACCAA TTCACTCGGA ATACATGAGA G 5277 |||||||||| ||||||||| ||||||| || ||||||| || | AGGGAAACAA TCTCAACCAA TTCACTCAGA ATACATGGGA G 302 hqPGS_C06HBa0112G05.1-9+_SGN-E206496+ (5058 5277) ******************************************************************************** EST sequence 7 +strand 349 n (File: SGN-E207598+) 1 GAAAAGAAAC TAAGTAGAGC TATGTATATT TTGTCTGCAG AAGCAATGGC TTAATTTACA 61 ACTATAGGTG GCGTACGTAT AATGCATGGA CCGAAAAATA AGCAGGTGTA AACGCGGAAG 121 CTAGCAATGC TAACATCGAA AGACCACGAG TTAGAAGACA ACGAGAAATA TACCAAAAGA 181 CACAAAGATT TAACGTGGGT CGGTCAATCG ACCTATGTCC ACAAAGGAGA TGAGCAATCC 241 ACTATTAATA TGAGAGTACA AAATACAGAG GGAAACAATC TCAACCAATT CACTCAGAAT 301 ACATGGGAGG GTCACACAAG TAATAACGTA TCAAGCTTGT GACCCACAA Predicted gene structure (within gDNA segment 3523 to 6413): Exon 1 5058 5277 ( 220 n); cDNA 89 309 ( 221 n); score: 0.930 MATCH C06HBa0112G05.1-9+ SGN-E207598+ 0.930 220 0.630 C PGS_C06HBa0112G05.1-9+_SGN-E207598+ (5058 5277) Alignment (genomic DNA sequence = upper lines): GAACGAAAA- TAAGCAGGTG TAAACGCGGA AGCTAGCAAA GCAAACCTCG AAAGACCACG 5116 || |||||| |||||||||| |||||||||| ||||||||| || ||| ||| |||||||||| GACCGAAAAA TAAGCAGGTG TAAACGCGGA AGCTAGCAAT GCTAACATCG AAAGACCACG 148 AGTAAGAAGA CAACGAGAAA TATATCAAAA GACACAAAGA TTTAACGTGG TTCGGTCAAT 5176 ||| |||||| |||||||||| |||| ||||| |||||||||| |||||||||| ||||||||| AGTTAGAAGA CAACGAGAAA TATACCAAAA GACACAAAGA TTTAACGTGG GTCGGTCAAT 208 CGACCTACGT CCACAAAGGA GATGAGCAAT CCACTATAAA TGTGAGAGTA CAAAATACAG 5236 ||||||| || |||||||||| |||||||||| ||||||| || | |||||||| |||||||||| CGACCTATGT CCACAAAGGA GATGAGCAAT CCACTATTAA TATGAGAGTA CAAAATACAG 268 AGGGAAACAA CCTCAACCAA TTCACTCGGA ATACATGAGA G 5277 |||||||||| ||||||||| ||||||| || ||||||| || | AGGGAAACAA TCTCAACCAA TTCACTCAGA ATACATGGGA G 309 hqPGS_C06HBa0112G05.1-9+_SGN-E207598+ (5058 5277) ******************************************************************************** EST sequence 1 +strand 302 n (File: SGN-E370622+) 1 TTTTTTTTTT TGATAACCTA GAAATATGCC TATGACCCGC CCCTTTGGAC TAATCACAAC 61 CTTCTAAACT CGGTGGATAA TGTTAGGACC GAAAATAAGC AGGTGTAAAC GCGGAAGCTA 121 GCAACGCAAA CCTCAAAATA CCACGAGTAA GACGACAACG AGAAATATAC CAAAAGACAC 181 AAAGATTTAA CGTGGTTCGG TCAATCGACC TACGTCCACA AAGGAGATGA GCAATCTACT 241 ATAAATATGA GAGTACAAAA TACAGAGAGA AACAACCTCA ACCAATTCAC TCGGAATACA 301 TG Predicted gene structure (within gDNA segment 3571 to 5883): Exon 1 5058 5273 ( 216 n); cDNA 87 302 ( 216 n); score: 0.958 PPA cDNA 11 1 MATCH C06HBa0112G05.1-9+ SGN-E370622+ 0.958 216 0.715 C PGS_C06HBa0112G05.1-9+_SGN-E370622+ (5058 5273) Alignment (genomic DNA sequence = upper lines): GAACGAAAAT AAGCAGGTGT AAACGCGGAA GCTAGCAAAG CAAACCTCGA AAGACCACGA 5117 || ||||||| |||||||||| |||||||||| |||||||| | |||||||| | || ||||||| GACCGAAAAT AAGCAGGTGT AAACGCGGAA GCTAGCAACG CAAACCTCAA AATACCACGA 146 GTAAGAAGAC AACGAGAAAT ATATCAAAAG ACACAAAGAT TTAACGTGGT TCGGTCAATC 5177 |||||| ||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| GTAAGACGAC AACGAGAAAT ATACCAAAAG ACACAAAGAT TTAACGTGGT TCGGTCAATC 206 GACCTACGTC CACAAAGGAG ATGAGCAATC CACTATAAAT GTGAGAGTAC AAAATACAGA 5237 |||||||||| |||||||||| |||||||||| ||||||||| ||||||||| |||||||||| GACCTACGTC CACAAAGGAG ATGAGCAATC TACTATAAAT ATGAGAGTAC AAAATACAGA 266 GGGAAACAAC CTCAACCAAT TCACTCGGAA TACATG 5273 | |||||||| |||||||||| |||||||||| |||||| GAGAAACAAC CTCAACCAAT TCACTCGGAA TACATG 302 hqPGS_C06HBa0112G05.1-9+_SGN-E370622+ (5058 5273) ******************************************************************************** EST sequence 10 -strand 490 n (File: SGN-E205772-) 1 GGTGTAAACG CGGAAGCTAG CAATACAAAC CTCGAAAGAC CACGAGTAAG AAGACAACGA 61 GAAATATATC AAAAGACACA AAGATTTAAC GTGGTTCGGT CAATCAACCT ACGTCCACAA 121 AGGAGATGAG CAATCCACTA TAAATATAAG AGTACAAAAT ACAGAGAGAA TCAACCTCAA 181 CCAATTCACT CAGAATACAT GGGAGGTTCA CACAAGTGAT AACATATCAA GCTTGTGACC 241 CACAGATTCT CCCTCTAACC AAAACTCTCA AAGCCTGTAA GAGTACATTG TGAATGCTGA 301 TTAAGTTAAA AGGAATATTC ATCTATTTAT AGAGTCCTAA ACCTTTTCCT ACAAGAAAAG 361 GATTAGTCAA TTCAAAACCT TTTCCTAAAA GGAAAACCTA TTTATGGTAA GAAATCAGGG 421 CAAATAAAAT CCAACAAGAT ACTTAAAAGA TGAGCAAGAG TGGCTAATGG ATTTAAATTA 481 ACAAATTGTT Predicted gene structure (within gDNA segment 4463 to 6680): Exon 1 5073 5477 ( 405 n); cDNA 1 436 ( 436 n); score: 0.743 MATCH C06HBa0112G05.1-9+ SGN-E205772- 0.743 405 0.827 C PGS_C06HBa0112G05.1-9+_SGN-E205772- (5073 5477) Alignment (genomic DNA sequence = upper lines): GGTGTAAACG CGGAAGCTAG CAAAGCAAAC CTCGAAAGAC CACGAGTAAG AAGACAACGA 5132 |||||||||| |||||||||| ||| ||||| |||||||||| |||||||||| |||||||||| GGTGTAAACG CGGAAGCTAG CAATACAAAC CTCGAAAGAC CACGAGTAAG AAGACAACGA 60 GAAATATATC AAAAGACACA AAGATTTAAC GTGGTTCGGT CAATCGACCT ACGTCCACAA 5192 |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| |||||||||| GAAATATATC AAAAGACACA AAGATTTAAC GTGGTTCGGT CAATCAACCT ACGTCCACAA 120 AGGAGATGAG CAATCCACTA TAAATGTGAG AGTACAAAAT ACAGAGGGAA ACAACCTCAA 5252 |||||||||| |||||||||| ||||| | || |||||||||| |||||| ||| ||||||||| AGGAGATGAG CAATCCACTA TAAATATAAG AGTACAAAAT ACAGAGAGAA TCAACCTCAA 180 CCAATTCACT CGGAATACAT GAGAGGTT-- CAC------- AA-AT-T--- -C---T---- 5290 |||||||||| | |||||||| | |||||| ||| || || | | | CCAATTCACT CAGAATACAT GGGAGGTTCA CACAAGTGAT AACATATCAA GCTTGTGACC 240 --C-G---CT --CT--AACC AAAACTCTCA AAGCCCTTAA AACTACATTG TGAATGCTAA 5340 | | || || |||| |||||||||| ||||| ||| | ||||||| |||||||| | CACAGATTCT CCCTCTAACC AAAACTCTCA AAGCCTGTAA GAGTACATTG TGAATGCTGA 300 TTAAGTTAGA AGGAACATGC CTTTATTTAT AGAGTCCTAA ACCTTTTCCT ACCAAAAAAA 5400 |||||||| | ||||| || | | ||||||| |||||||||| |||||||||| | ||| |||| TTAAGTTAAA AGGAATATTC ATCTATTTAT AGAGTCCTAA ACCTTTTCCT A-CAAGAAAA 359 AGAATAGTCA ATTCAAAACC TTTTCCTAAA AGGAAAACCT ATTTATGGTA AGAAATCAGG 5460 || |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATTAGTCA ATTCAAAACC TTTTCCTAAA AGGAAAACCT ATTTATGGTA AGAAATCAGG 419 GCAAATAAAA CCCAACA 5477 |||||||||| |||||| GCAAATAAAA TCCAACA 436 hqPGS_C06HBa0112G05.1-9+_SGN-E205772- (5073 5477) ******************************************************************************** EST sequence 11 -strand 456 n (File: SGN-E205773-) 1 AAAGACCACG AGTAAGAAGC CATCGAGAAA TATATCAAAA GACACAAAGA TTTAACGTGG 61 TTCGGTCAAT CAACCTACGT CCACAAAGTA GATGAGCAAT CCACTATAAA TATAAGAGTA 121 CAAAATACAG AGAGAAACAG CTTCAACCAA TTCACTCAGA ATACATGGTA GGTTCACACA 181 AGTGATAACA TATCAAGCTT GTGACCCACA GATTCTCCCT CTAACCAAAA CTCTCAAAGC 241 CTGTAAGACT ACATTGTGAA TGCTGATTAA GTTAAAAGGA ATATTCATCT ATTTATAGAG 301 TCCTAAACCT TTTCCTACAA GAAAAGGATT AGTCAATTCA AAACCTTTTC CTAAAAGGAA 361 AACCTATTTA TGGTAAGAAA TCAGGGCAAA TAAAATCCAA CAAGATACTT AAAAGATGAG 421 CAAGAGTGGC TAATGGATTT AAATTAACTA ATTGTT Predicted gene structure (within gDNA segment 3690 to 6680): Exon 1 4442 4447 ( 6 n); cDNA 200 205 ( 6 n); score: 0.667 Intron 1 4448 5279 ( 832 n); Pd: 0.230 (s: 0), Pa: 0.000 (s: 0.88) Exon 2 5280 5477 ( 198 n); cDNA 206 402 ( 197 n); score: 0.914 MATCH C06HBa0112G05.1-9+ SGN-E205773- 0.914 204 0.447 C PGS_C06HBa0112G05.1-9+_SGN-E205773- (4442 4447,5280 5477) Alignment (genomic DNA sequence = upper lines): TGGGAAGTGA GAAAATAAGA TGTAAGAGAC CATTATCTCG AAGTCGTGTA CAAGAGTTGA 4501 || || TGTGAC.... .......... .......... .......... .......... .......... 205 TACATCTCTG GTAAGTGTTC CAACACGATC CAAGAGATCA AAAAGTTTGT CATGATGAAT 4561 .......... .......... .......... .......... .......... .......... 205 AAAGTCCTTA AGCATATCAG AAAGGATAAT CAACAGGAAT TCCATCATGA CATGAATGTT 4621 .......... .......... .......... .......... .......... .......... 205 TCGAGCCCCT GAAGTGCTAG GGGTAATGAC AGTTATCATA TGCTCTTGTA GATGAATAAG 4681 .......... .......... .......... .......... .......... .......... 205 ATATTCCCGA AGAATGTCCG GAGAGGTTTC CAGGAGCTGC TTAATGAAGA GTCCATCTTC 4741 .......... .......... .......... .......... .......... .......... 205 TTCTGAAGTT GAAGCTTTCA AATTTGTAAA ATATATGCGC ATAAGCTCCA GTTCAGTTGG 4801 .......... .......... .......... .......... .......... .......... 205 AACAATCTTC AAGAGTAAAT GTGCTAACTT GAAGAGTCGA GAGTCTTTAA CATTCTGATC 4861 .......... .......... .......... .......... .......... .......... 205 ATTCTCATCT AGCTGGGCGA GTCGAGAGTC TTCATCAGTC TGATCTTCCC AAAGGAAGCG 4921 .......... .......... .......... .......... .......... .......... 205 TCCTACTCTC TCAGCCATCA GTTGAAACAG AGGTAAGACA TTCTCAACCA TCTCATGCTT 4981 .......... .......... .......... .......... .......... .......... 205 AATGCGACCA TTCACTATCA ATCCATGGAA ATCTCTTACG TTTCCACATA CATTCTGAAG 5041 .......... .......... .......... .......... .......... .......... 205 AACCTCATAT TGTTAGGAAC GAAAATAAGC AGGTGTAAAC GCGGAAGCTA GCAAAGCAAA 5101 .......... .......... .......... .......... .......... .......... 205 CCTCGAAAGA CCACGAGTAA GAAGACAACG AGAAATATAT CAAAAGACAC AAAGATTTAA 5161 .......... .......... .......... .......... .......... .......... 205 CGTGGTTCGG TCAATCGACC TACGTCCACA AAGGAGATGA GCAATCCACT ATAAATGTGA 5221 .......... .......... .......... .......... .......... .......... 205 GAGTACAAAA TACAGAGGGA AACAACCTCA ACCAATTCAC TCGGAATACA TGAGAGGTTC 5281 | .......... .......... .......... .......... .......... ........CC 207 ACAAATTCTC GCTCTAACCA AAACTCTCAA AGCCCTTAAA ACTACATTGT GAATGCTAAT 5341 ||| |||||| ||||||||| |||||||||| |||| ||| |||||||||| ||||||| || ACAGATTCTC CCTCTAACCA AAACTCTCAA AGCCTGTAAG ACTACATTGT GAATGCTGAT 267 TAAGTTAGAA GGAACATGCC TTTATTTATA GAGTCCTAAA CCTTTTCCTA CCAAAAAAAA 5401 ||||||| || |||| || | | |||||||| |||||||||| |||||||||| ||| |||| TAAGTTAAAA GGAATATTCA TCTATTTATA GAGTCCTAAA CCTTTTCCTA -CAAGAAAAG 326 GAATAGTCAA TTCAAAACCT TTTCCTAAAA GGAAAACCTA TTTATGGTAA GAAATCAGGG 5461 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATTAGTCAA TTCAAAACCT TTTCCTAAAA GGAAAACCTA TTTATGGTAA GAAATCAGGG 386 CAAATAAAAC CCAACA 5477 ||||||||| |||||| CAAATAAAAT CCAACA 402 hqPGS_C06HBa0112G05.1-9+_SGN-E205773- (5280 5477) ******************************************************************************** EST sequence 6 +strand 535 n (File: SGN-E243194+) 1 AAAAGACATA AAGATTTAAC GTGGTTCGGT CAATCGACCT ACGTCCACAA AGGAGATGAG 61 CAATCCACTA TAAATATGAG AGTACAAAAT ACAGAGGGAA ACAACCTCAA CCAATTCACT 121 CAGAATACAT GGGAGGTTCA CACAAGTGAT AACGTATCAA GCTTGTGACC CACAAATTCT 181 CCCTCTAACC AAAACTCTCA AAGCACTTAA GACTACATTT TGAATGCTGA TTAAGTTAGA 241 AGAAAAATGC CTCTATTTAT GGAGTCCTAA ACCTTTTCCT ACAAGAAAAG GATTAGTCAA 301 TCCAAAACCT TTTCCTACAA GAAAAGGATT AGTCAATCCA AAACCTTTTC CTACAAGGAA 361 AACCTATTTA TGGTAAGAAA TTTAGGGCAA ATAAAACCCA ACAAGTCTCC CCCTTGGCCT 421 GAATTTCTGA CAAATAAACT TGTCCACCTT CTTCACTTAA TCTTCAACAA CTTGCTTCTC 481 CTCTCCATAA TCTCCTTTGC AAAATTTATG TCTCAACACA AAGAATCTCT CTGAA Predicted gene structure (within gDNA segment 4462 to 7407): Exon 1 5281 5443 ( 163 n); cDNA 171 330 ( 160 n); score: 0.883 MATCH C06HBa0112G05.1-9+ SGN-E243194+ 0.883 163 0.305 C PGS_C06HBa0112G05.1-9+_SGN-E243194+ (5281 5443) Alignment (genomic DNA sequence = upper lines): CACAAATTCT CGCTCTAACC AAAACTCTCA AAGCCCTTAA AACTACATTG TGAATGCTAA 5340 |||||||||| | |||||||| |||||||||| |||| ||||| |||||||| |||||||| | CACAAATTCT CCCTCTAACC AAAACTCTCA AAGCACTTAA GACTACATTT TGAATGCTGA 230 TTAAGTTAGA AGGAACATGC CTTTATTTAT AGAGTCCTAA ACCTTTTCCT ACCAAAAAAA 5400 |||||||||| || || |||| || ||||||| ||||||||| |||||||||| | ||| |||| TTAAGTTAGA AGAAAAATGC CTCTATTTAT GGAGTCCTAA ACCTTTTCCT A-CAAGAAAA 289 AGAATAGTCA ATTCAAAACC TTTTCCTAAA AGGAAAACCT ATT 5443 || |||||| || ||||||| |||||||| | | ||||| ||| GGATTAGTCA ATCCAAAACC TTTTCCTACA A-GAAAA-GG ATT 330 hqPGS_C06HBa0112G05.1-9+_SGN-E243194+ (5281 5443) ******************************************************************************** EST sequence 8 -strand 457 n (File: SGN-E544869-) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAGGAA Predicted gene structure (within gDNA segment 12688 to 9897): Exon 1 12078 11744 ( 335 n); cDNA 1 335 ( 335 n); score: 0.943 PPA cDNA 423 454 MATCH C06HBa0112G05.1-9- SGN-E544869- 0.943 335 0.733 C PGS_C06HBa0112G05.1-9-_SGN-E544869- (12078 11744) Alignment (genomic DNA sequence = upper lines): ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 12019 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 11959 ||||| | | || ||| ||| | ||| |||| |||||||||| ||||| |||| |||||||||| TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 120 CTTGAGAAAT TAAAACTGCG GGGATGTGGT GAGCTTGAGG AGATTCCACC TAGTTTTGGA 11899 ||||||||| ||||||||| || ||||||| ||||||||| ||||||| || |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 180 GATATTTATT CATTGAAATT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 11839 |||||||||| |||||||||| ||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGG GCGAGCTTCA GATCCTTGGC 11779 |||||||||| ||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGG 11744 |||||||||| |||||||||| |||||||||| || || CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGG 335 hqPGS_C06HBa0112G05.1-9-_SGN-E544869- (12078 11744) ******************************************************************************** EST sequence 15 +strand 457 n (File: SGN-E544870+) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCANNNNANA AAAAAAAAAA AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 12678 to 9914): Exon 1 12078 11744 ( 335 n); cDNA 1 335 ( 335 n); score: 0.979 PPA cDNA 428 457 MATCH C06HBa0112G05.1-9- SGN-E544870+ 0.979 335 0.733 C PGS_C06HBa0112G05.1-9-_SGN-E544870+ (12078 11744) Alignment (genomic DNA sequence = upper lines): ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 12019 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 11959 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 120 CTTGAGAAAT TAAAACTGCG GGGATGTGGT GAGCTTGAGG AGATTCCACC TAGTTTTGGA 11899 ||||||||| ||||||||| || ||||||| ||||||||| |||||||||| |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 180 GATATTTATT CATTGAAATT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 11839 |||||||||| |||||||||| |||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGG GCGAGCTTCA GATCCTTGGC 11779 |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGG 11744 |||||||||| |||||||||| |||||||||| ||||| CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGG 335 hqPGS_C06HBa0112G05.1-9-_SGN-E544870+ (12078 11744) ******************************************************************************** EST sequence 12 +strand 453 n (File: SGN-E308524+) 1 TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 61 TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 121 TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 181 ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 241 TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 301 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAA AGTAGATTGT ACTTTGCAGG 361 GTACATTGTA TATGATTAAG AAAACTTTGT TGCAGTTATG AAATATTTTT GTGGATTTCT 421 CNNNNNAAAA AAAAAAAAAA AAAAAAAATA AAA Predicted gene structure (within gDNA segment 12677 to 9944): Exon 1 12077 11744 ( 334 n); cDNA 1 334 ( 334 n); score: 0.979 PPA cDNA 427 453 MATCH C06HBa0112G05.1-9- SGN-E308524+ 0.979 334 0.737 C PGS_C06HBa0112G05.1-9-_SGN-E308524+ (12077 11744) Alignment (genomic DNA sequence = upper lines): TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 12018 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 60 TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 11958 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 120 TTGAGAAATT AAAACTGCGG GGATGTGGTG AGCTTGAGGA GATTCCACCT AGTTTTGGAG 11898 |||||||| | |||||||| | | ||||||| |||||||||| |||||||||| |||||||||| TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 180 ATATTTATTC ATTGAAATTT ATCAAAATTG TAAAGAGTCC TCAACTTGAA GATTCTGCTC 11838 |||||||||| |||||||||| ||| |||||| |||| ||||| |||||||||| |||||||||| ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 240 TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGGG CGAGCTTCAG ATCCTTGGCC 11778 |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| |||||||||| TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 300 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGG 11744 |||||||||| |||||||||| |||||||||| |||| AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGG 334 hqPGS_C06HBa0112G05.1-9-_SGN-E308524+ (12077 11744) Total number of EST alignments reported: 16 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 15823: PGL 1 (+ strand): 657 8062 AGS-1 (657 677,3119 3132,5049 5168,5461 5490) SCR (e 0.714 d 0.900 a 0.000,e 0.786 d 0.065 a 0.000,e 0.950 d 0.000 a 0.000,e 0.717) Exon 1 657 677 ( 21 n); score: 0.714 Intron 1 678 3118 (2441 n); Pd: 0.900 Pa: 0.000 Exon 2 3119 3132 ( 14 n); score: 0.786 Intron 2 3133 5048 (1916 n); Pd: 0.065 Pa: 0.000 Exon 3 5049 5168 ( 120 n); score: 0.950 Intron 3 5169 5460 ( 292 n); Pd: 0.000 Pa: 0.000 Exon 4 5461 5490 ( 30 n); score: 0.717 PGS (657 677,3119 3132,5049 5168,5461 5490) SGN-E544264- 3-phase translation of AGS-1 (+strand): . . . : . : . . 657 ATATATAGATGATAGTCAATT : AACAATTACTCATG : TATTGTTAGGAACGAAAATAAGCAG I Y R - - S I : N N Y S C : I V R N E N K Q Y I D D S Q L : T I T H : V L L G T K I S R I - M I V N : - Q L L M : Y C - E R K - A . . . . . . 5074 GTGTAAACGCGGAAGCTAGCAAAGCAAACCTCGAAAGACCACGAGTAAGAAGACAACGAG V - T R K L A K Q T S K D H E - E D N E C K R G S - Q S K P R K T T S K K T T R G V N A E A S K A N L E R P R V R R Q R . . . . : . . 5134 AAATATATCAAAAGACACAAAGATTTAACGTGGTT : GCAAATAAAACCCAACACATATTGT K Y I K R H K D L T W L : Q I K P N T Y C N I S K D T K I - R G : C K - N P T H I V E I Y Q K T Q R F N V V : A N K T Q H I L . 5486 GTCAC V S C H Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (2091 2122,5053 5477) SCR (e 0.781 d 0.000 a 0.000,e 0.754) Exon 1 2091 2122 ( 32 n); score: 0.781 Intron 1 2123 5052 (2930 n); Pd: 0.000 Pa: 0.000 Exon 2 5053 5477 ( 425 n); score: 0.754 PGS (2091 2122,5053 5146) SGN-E329886- PGS (5036 5273) SGN-E271645+ PGS (5045 5477) SGN-E211090- PGS (5058 5286) SGN-E389613+ PGS (5058 5277) SGN-E206285+ PGS (5058 5277) SGN-E206496+ PGS (5058 5277) SGN-E207598+ PGS (5058 5273) SGN-E370622+ PGS (5073 5477) SGN-E205772- PGS (5280 5477) SGN-E205773- PGS (5281 5443) SGN-E243194+ 3-phase translation of AGS-2 (+strand): . . . . : . . 2091 TTTGGTACAAAGAATAGAAGAATGTTACAACA : GTTAGGAACGAAAATAAGCAGGTGTAAA F G T K N R R M L Q Q : L G T K I S R C K L V Q R I E E C Y N : S - E R K - A G V N W Y K E - K N V T T : V R N E N K Q V - . . . . . . 5081 CGCGGAAGCTAGCAAAGCAAACCTCGAAAGACCACGAGTAAGAAGACAACGAGAAATATA R G S - Q S K P R K T T S K K T T R N I A E A S K A N L E R P R V R R Q R E I Y T R K L A K Q T S K D H E - E D N E K Y . . . . . . 5141 TCAAAAGACACAAAGATTTAACGTGGTTCGGTCAATCGACCTACGTCCACAAAGGAGATG S K D T K I - R G S V N R P T S T K E M Q K T Q R F N V V R S I D L R P Q R R - I K R H K D L T W F G Q S T Y V H K G D . . . . . . 5201 AGCAATCCACTATAAATGTGAGAGTACAAAATACAGAGGGAAACAACCTCAACCAATTCA S N P L - M - E Y K I Q R E T T S T N S A I H Y K C E S T K Y R G K Q P Q P I H E Q S T I N V R V Q N T E G N N L N Q F . . . . . . 5261 CTCGGAATACATGAGAGGTTCACAAATTCTCGCTCTAACCAAAACTCTCAAAGCCCTTAA L G I H E R F T N S R S N Q N S Q S P - S E Y M R G S Q I L A L T K T L K A L K T R N T - E V H K F S L - P K L S K P L . . . . . . 5321 AACTACATTGTGAATGCTAATTAAGTTAGAAGGAACATGCCTTTATTTATAGAGTCCTAA N Y I V N A N - V R R N M P L F I E S - T T L - M L I K L E G T C L Y L - S P K K L H C E C - L S - K E H A F I Y R V L . . . . . . 5381 ACCTTTTCCTACCAAAAAAAAGAATAGTCAATTCAAAACCTTTTCCTAAAAGGAAAACCT T F S Y Q K K E - S I Q N L F L K G K P P F P T K K K N S Q F K T F S - K E N L N L F L P K K R I V N S K P F P K R K T . . . . 5441 ATTTATGGTAAGAAATCAGGGCAAATAAAACCCAACA I Y G K K S G Q I K P N F M V R N Q G K - N P T Y L W - E I R A N K T Q Maximal non-overlapping open reading frames (>= 64 codons): none AGS-3 (5061 5277,5382 5495,8038 8062) SCR (e 0.912 d 0.848 a 0.000,e 0.768 d 0.488 a 0.000,e 0.720) Exon 1 5061 5277 ( 217 n); score: 0.912 Intron 1 5278 5381 ( 104 n); Pd: 0.848 Pa: 0.000 Exon 2 5382 5495 ( 114 n); score: 0.768 Intron 2 5496 8037 (2542 n); Pd: 0.488 Pa: 0.000 Exon 3 8038 8062 ( 25 n); score: 0.720 PGS (5061 5277,5382 5495,8038 8062) SGN-E543331- 3-phase translation of AGS-3 (+strand): . . . . . . 5061 CGAAAATAAGCAGGTGTAAACGCGGAAGCTAGCAAAGCAAACCTCGAAAGACCACGAGTA R K - A G V N A E A S K A N L E R P R V E N K Q V - T R K L A K Q T S K D H E - K I S R C K R G S - Q S K P R K T T S . . . . . . 5121 AGAAGACAACGAGAAATATATCAAAAGACACAAAGATTTAACGTGGTTCGGTCAATCGAC R R Q R E I Y Q K T Q R F N V V R S I D E D N E K Y I K R H K D L T W F G Q S T K K T T R N I S K D T K I - R G S V N R . . . . . . 5181 CTACGTCCACAAAGGAGATGAGCAATCCACTATAAATGTGAGAGTACAAAATACAGAGGG L R P Q R R - A I H Y K C E S T K Y R G Y V H K G D E Q S T I N V R V Q N T E G P T S T K E M S N P L - M - E Y K I Q R . . . . : . . 5241 AAACAACCTCAACCAATTCACTCGGAATACATGAGAG : CCTTTTCCTACCAAAAAAAAGAA K Q P Q P I H S E Y M R : A F S Y Q K K E N N L N Q F T R N T - E : P F P T K K K N E T T S T N S L G I H E S : L F L P K K R . . . . . . 5405 TAGTCAATTCAAAACCTTTTCCTAAAAGGAAAACCTATTTATGGTAAGAAATCAGGGCAA - S I Q N L F L K G K P I Y G K K S G Q S Q F K T F S - K E N L F M V R N Q G K I V N S K P F P K R K T Y L W - E I R A . . . . : . . 5465 ATAAAACCCAACACATATTGTGTCACTCTAG : CATACATTTCAAACAGAGAATGTTT I K P N T Y C V T L : A Y I S N R E C - N P T H I V S L - : H T F Q T E N V N K T Q H I L C H S S : I H F K Q R M F Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 12078 11744 AGS-1 (12078 11744) SCR (e 0.979) Exon 1 12078 11744 ( 335 n); score: 0.979 PGS (12078 11744) SGN-E544869- PGS (12078 11744) SGN-E544870+ PGS (12077 11744) SGN-E308524+ 3-phase translation of AGS-1 (-strand): . . . . . . 12078 ATCATCCAGGGAGAAGAATGGAACATGGGGGAGGAAGACACCTTTGAGAATCTCAAATTT I I Q G E E W N M G E E D T F E N L K F S S R E K N G T W G R K T P L R I S N F H P G R R M E H G G G R H L - E S Q I . . . . . . 12018 TTGAACTTGCGTCTACCGACTCTTTCCAAGTGGGAGGTTGGAGAGGAATCCTTCCCCAAT L N L R L P T L S K W E V G E E S F P N - T C V Y R L F P S G R L E R N P S P I F E L A S T D S F Q V G G W R G I L P Q . . . . . . 11958 CTTGAGAAATTAAAACTGCGGGGATGTGGTGAGCTTGAGGAGATTCCACCTAGTTTTGGA L E K L K L R G C G E L E E I P P S F G L R N - N C G D V V S L R R F H L V L E S - E I K T A G M W - A - G D S T - F W . . . . . . 11898 GATATTTATTCATTGAAATTTATCAAAATTGTAAAGAGTCCTCAACTTGAAGATTCTGCT D I Y S L K F I K I V K S P Q L E D S A I F I H - N L S K L - R V L N L K I L L R Y L F I E I Y Q N C K E S S T - R F C . . . . . . 11838 CTCAAGATTAAGGAATACGCTGAAGAGATGAGAGGAGGGGGCGAGCTTCAGATCCTTGGC L K I K E Y A E E M R G G G E L Q I L G S R L R N T L K R - E E G A S F R S L A S Q D - G I R - R D E R R G R A S D P W . . . . 11778 CAGAAGAATATCCCCTTATTTAAGTAGCATTATGG Q K N I P L F K - H Y R R I S P Y L S S I M P E E Y P L I - V A L W Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-9-_PGL-2_AGS-1_PPS_1 (12078 11752) (frame '1'; 324 bp, 108 residues) 1 IIQGEEWNMG EEDTFENLKF LNLRLPTLSK WEVGEESFPN LEKLKLRGCG ELEEIPPSFG 61 DIYSLKFIKI VKSPQLEDSA LKIKEYAEEM RGGGELQILG QKNIPLFK- 3-phase translation of AGS-1 (+strand): . . . . . . 11744 CCATAATGCTACTTAAATAAGGGGATATTCTTCTGGCCAAGGATCTGAAGCTCGCCCCCT P - C Y L N K G I F F W P R I - S S P P H N A T - I R G Y S S G Q G S E A R P L I M L L K - G D I L L A K D L K L A P . . . . . . 11804 CCTCTCATCTCTTCAGCGTATTCCTTAATCTTGAGAGCAGAATCTTCAAGTTGAGGACTC P L I S S A Y S L I L R A E S S S - G L L S S L Q R I P - S - E Q N L Q V E D S S S H L F S V F L N L E S R I F K L R T . . . . . . 11864 TTTACAATTTTGATAAATTTCAATGAATAAATATCTCCAAAACTAGGTGGAATCTCCTCA F T I L I N F N E - I S P K L G G I S S L Q F - - I S M N K Y L Q N - V E S P Q L Y N F D K F Q - I N I S K T R W N L L . . . . . . 11924 AGCTCACCACATCCCCGCAGTTTTAATTTCTCAAGATTGGGGAAGGATTCCTCTCCAACC S S P H P R S F N F S R L G K D S S P T A H H I P A V L I S Q D W G R I P L Q P K L T T S P Q F - F L K I G E G F L S N . . . . . . 11984 TCCCACTTGGAAAGAGTCGGTAGACGCAAGTTCAAAAATTTGAGATTCTCAAAGGTGTCT S H L E R V G R R K F K N L R F S K V S P T W K E S V D A S S K I - D S Q R C L L P L G K S R - T Q V Q K F E I L K G V . . . . 12044 TCCTCCCCCATGTTCCATTCTTCTCCCTGGATGAT S S P M F H S S P W M P P P C S I L L P G - F L P H V P F F S L D D Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:20:02 2006 ________________________________________________________________________________ Sequence 10: C06HBa0112G05.1-10, from 1 to 2955, both strands analyzed. ... started at: Mon Aug 28 22:20:02 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 6 ... matches indexed, elapsed seconds = 6 HitsTableSize = 15 ******************************************************************************** EST sequence 9 +strand 688 n (File: SGN-E395007+) 1 TGCCTGCACA AAAAGTAGTA CACACATATG CTGGTATCAA TTTTCACGAA TTGCATATAT 61 TCTCTACAAC CCACTAATTT TTCCTAAATT ATACGTCCCC ATTATTTCTA TCTATATTGT 121 TTCAAATTTC TTAGGAGTAG TTGTATCGTG AATATACCAA CTAAACTTGC TACTAAAATC 181 AGCATACTAA TGATAAACAT GACCTAAATA TTCTGGAATC TTCTTTGTTA TGTGTCACAT 241 ACAAAAATGA TGTTTATGTT GGGTTTTAAC AAGTGTGAAT GGAAAAATAA AAAAGAGAAT 301 ATCAAAAGTG AGGGAACTAC TTTGGAGGGA AAATGAAAAG TCATTTGCAA AGTGCAAATG 361 AAAAGTCATT TCTTTGGATT TGGATTTGGA TTTGGCAAAT GATGTGATTG ATTGATATAT 421 TTTTTGGACA AAATTTATTC AATCAATTTT TGTTAAATCA AATAAATCCT GTTAATATTA 481 TTTCTTATAA ATTTGCGGGT AACAGTAACA TTCCGAAAAG TCGTTACTTT TCCGAAAAGT 541 CGTTACTTTC CAAAAAGTAG TTATTTTCCT AACAGACACA ATTTTTCAAA AAAGTTGTTA 601 TTTTTTCCAA AAGACACAAC TTTCTGGATA AAATGGGTCT GAACAAATTT CACTGAACGG 661 ACATGTTCCT TGCTGAAAAT GCTATAAA Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 1425 1421 ( 5 n); cDNA 263 267 ( 5 n); score: 1.000 Intron 1 1420 1071 ( 350 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.72) Exon 2 1070 965 ( 106 n); cDNA 268 373 ( 106 n); score: 0.821 Intron 2 964 846 ( 119 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.92) Exon 3 845 731 ( 115 n); cDNA 374 490 ( 117 n); score: 0.887 Intron 3 730 697 ( 34 n); Pd: 0.000 (s: 0.85), Pa: 0.000 (s: 0.82) Exon 4 696 499 ( 198 n); cDNA 491 688 ( 198 n); score: 0.843 MATCH C06HBa0112G05.1-10- SGN-E395007+ 0.850 424 0.616 C PGS_C06HBa0112G05.1-10-_SGN-E395007+ (1425 1421,1070 965,845 731,696 499) Alignment (genomic DNA sequence = upper lines): GTTTTGTATT CTTCTCAAAA AACAATGTTG ATAAAGCTCA GATTCCTTCT TTTAGAGTCA 1366 ||||| GTTTT..... .......... .......... .......... .......... .......... 267 CAACAACTTA TATCACAATT TAGCACACAA AATAATAGGG CATCAAAATT ATGCATTTTG 1306 .......... .......... .......... .......... .......... .......... 267 AAACAATCTT CTTTATTAAA TTTATTGACT TAAAAATAAA TAGTAATAGA TGAGATAATC 1246 .......... .......... .......... .......... .......... .......... 267 AAAAAAACTT ATATCAACCC CCTTAAGTAC AAAAAGACTT GTACAATTAC CAAATGAAGC 1186 .......... .......... .......... .......... .......... .......... 267 TTGAGTAGTA CAGTTGATTT AGCTCATAAT CCATAAGCTT TGCACCATCA TCAAGAGAAG 1126 .......... .......... .......... .......... .......... .......... 267 AGTAGATTTG AAGAAGATGA ATGAGAACTT TGTTGGGTTT AGCAAGCGTG AACGGGAAAA 1066 | || .......... .......... .......... .......... .......... .....AACAA 272 GAAAAGAAAG AATATGAAAA GAGAGAATAT GAAAAGTGAG GGAACTATTT TGGAGGAAAA 1006 | | | || | |||| ||||||||| ||||||||| ||||||| || |||||| ||| GTGTGAATGG AAAAATAAAA AAGAGAATAT CAAAAGTGAG GGAACTACTT TGGAGGGAAA 332 ATGAAAAGTT AGTTGCAAAG TGCAAAGGAA AAGTCATTTC TCCCATATCA GCAAAAGATA 946 ||||||||| | |||||||| |||||| ||| |||||||||| | ATGAAAAGTC ATTTGCAAAG TGCAAATGAA AAGTCATTTC T......... .......... 373 TGGAAATTGG TGTCCTTAGA TAAGGAAACA CTTCCTTTAC TTCTTAAAGA GCTAAGAAGA 886 .......... .......... .......... .......... .......... .......... 373 AGGTACCCCC TCGCGCCGTC ACCGTCGCTC GACCTTGACC TCGTTTTTGG ATTTGGATTT 826 | | ||||| |||||||||| .......... .......... .......... .......... TTGGATTTGG ATTTGGATTT 393 GGCAAATGAT GTGATTGATT GATAAATTTT TAGGATAAAA TTTATTCAAT CAATTTTTG- 767 |||||||||| |||||||||| |||| ||||| | ||| |||| |||||||||| ||||||||| GGCAAATGAT GTGATTGATT GATATATTTT TTGGACAAAA TTTATTCAAT CAATTTTTGT 453 T-AATCAAAT AAATCCTATT AATATTATCT CTTATAAATT AATGTAGAAC GTTAGTTAAT 708 | |||||||| ||||||| || |||||||| | ||||||| TAAATCAAAT AAATCCTGTT AATATTATTT CTTATAA... .......... .......... 490 TAACAGAATT AATTTGTGGG TAACGGTAAC ATTTCAAAAA GTTTTTAATC TTTTCGAAAA 648 ||||| ||| |||| ||||| ||| | |||| || ||| | ||| |||||| .......... .ATTTGCGGG TAACAGTAAC ATTCCGAAAA GTCGTTACT- TTTCCGAAAA 538 GTCGTTACTT TCGGAAAAAC CGTTATTTTT CTTACAGATA C-A-TTTTCC AAAAAGTTGT 590 |||||||||| || |||| |||||||| || ||||| | | | ||||| |||||||||| GTCGTTACTT TCCAAAAAGT AGTTATTTTC CTAACAGACA CAATTTTTCA AAAAAGTTGT 598 TATTTTTTCC AAAAGACACA ACTTTCTGGA TAAAACGGGT TTGAACAAAT TTTTCTGAAC 530 |||||||||| |||||||||| |||||||||| ||||| |||| ||||||||| || |||||| TATTTTTTCC AAAAGACACA ACTTTCTGGA TAAAATGGGT CTGAACAAAT TTCACTGAAC 658 AGACACGTTT CTTGCTGAAA ATGGCTATAA A 499 |||| ||| |||||||||| || ||||||| | GGACATGTTC CTTGCTGAAA AT-GCTATAA A 688 hqPGS_C06HBa0112G05.1-10-_SGN-E395007+ (1070 965,845 731,696 499) ******************************************************************************** EST sequence 6 +strand 586 n (File: SGN-E250408+) 1 GTGCCTGCAC AAAAAGTAGT ACACACATAT GCTGGTATCA ATTTTCACGA ATTGCATATA 61 TTCTCTACAA CCCACTAATT TTTCCTAAAT TATACGTCCC CATTATTTCT ATCTATATTG 121 TTTCAAATTT CTTAAGAGTA GTTGTATCGT GAATATACCA ACTAAACTTG CTACTAAAAT 181 CAGCATACTA ATGATAAACA TGACCTAAAT ATTCTGGAAT CTTCTTTGTT ATGTGTCACA 241 TACAAAAATG ATGTTTATGT TGGGTTTTAA CAAGTGTGAA TGGAAAAATA AAAAAGAGAA 301 TATCAAAAGT GAGGGAACTA CTTTGGAGGG AAAATGAAAA GTCATTTGCA AAGTGCAAAT 361 GAAAAGTCAT TTCTTTGGAT TTGGATTTGG ATTTGGCAAA TGATGTGATT GATTGATATA 421 TTTTTTGGAC AAAATTTATT CAATCAATTT TTGTTAAATC AAATAAATCC TGTTAATATT 481 ATTTCTTATA AATTTGCGGG TAACAGTAAC ATTCCGAAAA GTCGTTACTT TTCCGAAAAG 541 TCGTTACTTT CCAAAAAGTA GTTATTTTCC TAACAGACAC AATTTT Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 1425 1421 ( 5 n); cDNA 264 268 ( 5 n); score: 1.000 Intron 1 1420 1071 ( 350 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.72) Exon 2 1070 965 ( 106 n); cDNA 269 374 ( 106 n); score: 0.821 Intron 2 964 846 ( 119 n); Pd: 0.000 (s: 0.92), Pa: 0.000 (s: 0.92) Exon 3 845 731 ( 115 n); cDNA 375 491 ( 117 n); score: 0.887 Intron 3 730 697 ( 34 n); Pd: 0.000 (s: 0.85), Pa: 0.000 (s: 0.82) Exon 4 696 602 ( 95 n); cDNA 492 585 ( 94 n); score: 0.811 MATCH C06HBa0112G05.1-10- SGN-E250408+ 0.842 321 0.548 C PGS_C06HBa0112G05.1-10-_SGN-E250408+ (1425 1421,1070 965,845 731,696 602) Alignment (genomic DNA sequence = upper lines): GTTTTGTATT CTTCTCAAAA AACAATGTTG ATAAAGCTCA GATTCCTTCT TTTAGAGTCA 1366 ||||| GTTTT..... .......... .......... .......... .......... .......... 268 CAACAACTTA TATCACAATT TAGCACACAA AATAATAGGG CATCAAAATT ATGCATTTTG 1306 .......... .......... .......... .......... .......... .......... 268 AAACAATCTT CTTTATTAAA TTTATTGACT TAAAAATAAA TAGTAATAGA TGAGATAATC 1246 .......... .......... .......... .......... .......... .......... 268 AAAAAAACTT ATATCAACCC CCTTAAGTAC AAAAAGACTT GTACAATTAC CAAATGAAGC 1186 .......... .......... .......... .......... .......... .......... 268 TTGAGTAGTA CAGTTGATTT AGCTCATAAT CCATAAGCTT TGCACCATCA TCAAGAGAAG 1126 .......... .......... .......... .......... .......... .......... 268 AGTAGATTTG AAGAAGATGA ATGAGAACTT TGTTGGGTTT AGCAAGCGTG AACGGGAAAA 1066 | || .......... .......... .......... .......... .......... .....AACAA 273 GAAAAGAAAG AATATGAAAA GAGAGAATAT GAAAAGTGAG GGAACTATTT TGGAGGAAAA 1006 | | | || | |||| ||||||||| ||||||||| ||||||| || |||||| ||| GTGTGAATGG AAAAATAAAA AAGAGAATAT CAAAAGTGAG GGAACTACTT TGGAGGGAAA 333 ATGAAAAGTT AGTTGCAAAG TGCAAAGGAA AAGTCATTTC TCCCATATCA GCAAAAGATA 946 ||||||||| | |||||||| |||||| ||| |||||||||| | ATGAAAAGTC ATTTGCAAAG TGCAAATGAA AAGTCATTTC T......... .......... 374 TGGAAATTGG TGTCCTTAGA TAAGGAAACA CTTCCTTTAC TTCTTAAAGA GCTAAGAAGA 886 .......... .......... .......... .......... .......... .......... 374 AGGTACCCCC TCGCGCCGTC ACCGTCGCTC GACCTTGACC TCGTTTTTGG ATTTGGATTT 826 | | ||||| |||||||||| .......... .......... .......... .......... TTGGATTTGG ATTTGGATTT 394 GGCAAATGAT GTGATTGATT GATAAATTTT TAGGATAAAA TTTATTCAAT CAATTTTTG- 767 |||||||||| |||||||||| |||| ||||| | ||| |||| |||||||||| ||||||||| GGCAAATGAT GTGATTGATT GATATATTTT TTGGACAAAA TTTATTCAAT CAATTTTTGT 454 T-AATCAAAT AAATCCTATT AATATTATCT CTTATAAATT AATGTAGAAC GTTAGTTAAT 708 | |||||||| ||||||| || |||||||| | ||||||| TAAATCAAAT AAATCCTGTT AATATTATTT CTTATAA... .......... .......... 491 TAACAGAATT AATTTGTGGG TAACGGTAAC ATTTCAAAAA GTTTTTAATC TTTTCGAAAA 648 ||||| ||| |||| ||||| ||| | |||| || ||| | ||| |||||| .......... .ATTTGCGGG TAACAGTAAC ATTCCGAAAA GTCGTTACT- TTTCCGAAAA 539 GTCGTTACTT TCGGAAAAAC CGTTATTTTT CTTACAGATA CATTTT 602 |||||||||| || |||| |||||||| || ||||| | || ||| GTCGTTACTT TCCAAAAAGT AGTTATTTTC CTAACAGACA CAATTT 585 hqPGS_C06HBa0112G05.1-10-_SGN-E250408+ (1070 965,845 731,696 602) ******************************************************************************** EST sequence 12 +strand 658 n (File: SGN-E542859+) 1 GAGAGAACTG TCTCGAGTTT TTTTTTTTTT TTTTTTGGAG TAATTGTTAG TAAAAAAAAT 61 ATTTTTGAGC TAAAAACGTA ACGAGTTAAA AGATTGATCT ATATCGACTA GTTCAAACAT 121 ATTTTCGGCA TTGTTCTTTA TATATCAAAT ATCATTAATC ACTTGTTCAA GATAAACTCC 181 CAAATCTTAA CCACATTTTG GACCAAATTA TAATAATACC TTGTTCTTGT TGGGGTTTAG 241 CAAGTGTGAA TGGGAAAAAA AAAAGAGAAT ATGAAAGGTG AGGGAACTAC TCTGGAGGGA 301 AACTGAAAAG TCATTTGCAA AGTGCAAATG AAAATTCATT TCTCCCATAT CGGCAAAAGA 361 AAGGGAAATT GTTGTCGTTA TATAAGGAAA CACTTCCATT ACTTCTTAAA GAGCTAAGAA 421 GAAGATGCCC CCTCGCGCCG TCATCATCGC TCGGCTTTGG ATTTGGATTT GGCAAATGAT 481 GTGATTGGAT TGATAAATTT TTTGGACAAA ATTTATTTAA TCACTTTTTG TTAAATCAAA 541 TAAATCCTGT TAATATTTAT CTCTAATAAA TTTGCGGTTA ACGGTAACAT TTTGAAAAGT 601 TGTTACTCTT TCGGAAAAGT CGTTACTTTC CAAAAAGTCG TTATTTTCCT AACAGACA Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 1060 736 ( 325 n); cDNA 248 564 ( 317 n); score: 0.834 Intron 1 735 702 ( 34 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0.82) Exon 2 701 610 ( 92 n); cDNA 565 656 ( 92 n); score: 0.837 PPA cDNA 36 17 MATCH C06HBa0112G05.1-10- SGN-E542859+ 0.835 417 0.634 C PGS_C06HBa0112G05.1-10-_SGN-E542859+ (1060 736,701 610) Alignment (genomic DNA sequence = upper lines): GAAAGAATAT GAAAAGAGAG AATATGAAAA GTGAGGGAAC TATTTTGGAG GAAAAATGAA 1001 ||| | | |||| |||| ||||||||| |||||||||| || | ||||| | ||| |||| GAATGGGAAA AAAAAAAGAG AATATGAAAG GTGAGGGAAC TACTCTGGAG GGAAACTGAA 307 AAGTTAGTTG CAAAGTGCAA AGGAAAAGTC ATTTCTCCCA TATCAGCAAA AGATATGGAA 941 |||| | ||| |||||||||| | ||||| || |||||||||| |||| ||||| ||| | |||| AAGTCATTTG CAAAGTGCAA ATGAAAATTC ATTTCTCCCA TATCGGCAAA AGAAAGGGAA 367 ATTGGTGTCC TTAGATAAGG AAACACTTCC TTTACTTCTT AAAGAGCTAA GAAGAAGGTA 881 |||| |||| ||| |||||| |||||||||| ||||||||| |||||||||| ||||||| | ATTGTTGTCG TTATATAAGG AAACACTTCC ATTACTTCTT AAAGAGCTAA GAAGAAGATG 427 CCCCCTCGCG CCGTCACCGT CGCTCGACCT TGACCTCGTT TTTGGATTTG GATTTGGCAA 821 |||||||||| |||||| | | |||||| | | |||||||||| |||||||||| CCCCCTCGCG CCGTCATCAT CGCTCG---- -G----C--- TTTGGATTTG GATTTGGCAA 475 ATGATGTGAT T-GATTGATA AATTTTTAGG ATAAAATTTA TTCAATCAAT TTTTG-T-AA 764 |||||||||| | |||||||| ||||||| || | |||||||| || ||||| | ||||| | || ATGATGTGAT TGGATTGATA AATTTTTTGG ACAAAATTTA TTTAATCACT TTTTGTTAAA 535 TCAAATAAAT CCTATTAATA -TTATCTCTT ATAAATTAAT GTAGAACGTT AGTTAATTAA 705 |||||||||| ||| |||||| |||||||| TCAAATAAAT CCTGTTAATA TTTATCTCT. .......... .......... .......... 564 CAGAATTAAT TTGTGGGTAA CGGTAACATT TCAAAAAGTT TTTAATCTTT TCGAAAAGTC 645 ||| ||| ||| || ||| |||||||||| | ||||||| ||| ||||| |||||||| ...AATAAAT TTGCGGTTAA CGGTAACATT TTGAAAAGTT GTTACTCTTT CGGAAAAGTC 621 GTTACTTTCG GAAAAACCGT TATTTTTCTT ACAGA 610 ||||||||| |||| ||| |||||| || ||||| GTTACTTTCC AAAAAGTCGT TATTTTCCTA ACAGA 656 hqPGS_C06HBa0112G05.1-10-_SGN-E542859+ (1060 736,701 610) ******************************************************************************** EST sequence 17 +strand 598 n (File: SGN-E301820+) 1 AGAGAACTGT CTCGAGTTTT TTTTTTTTTT TTTTTGGAGT AATGGTTAGG GGGAAAAAAT 61 ATTTTTGAGC TAAAAACGTC ACGAGTTAAA AGATTGATCT ATATCGACTA GTTCAAACAT 121 ATTTTCGGCA TTGTTCTTTA TATATCAAAT ATCATTAATC ACTGGGACAA GATAAACTCC 181 CAAATCTTAA CCACATTTTG GACCAAATTA TAATAATACC TTGTTCTTGT TGGGGTTTAG 241 CAAGTGTGAA TGGGAAAAAA AAAAGAGAAT ATGAAAGGTG AGGGAACTAC TCTGGAGGGA 301 AACTGAAAAG TCATTTGCAA AGGGCAAATG AAAATTCATT TCTCCCATAT CGGCAAAAGA 361 AAGGGAAATT GTTGTCGTTA TATAAGGAAA CACTTCCATT ACTTCTTAAA GAGCTAAGAA 421 GAAGATGCCC CCTCGCGCCG TCATCATCGC TCGGCTTTGG ATTTGGATTT GGCAAATGAT 481 GTGATTGGAT TGATAAATTT TTTGGACAAA ATTTATTTAA TCACTTTTTG TTAAATCAAA 541 TAAATCCTGT TAATATTTAT CTCTAATAAA TTTGCGGTTA ACGGTAACAT TTTGAAAA Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 1060 736 ( 325 n); cDNA 248 564 ( 317 n); score: 0.831 Intron 1 735 702 ( 34 n); Pd: 0.000 (s: 0.78), Pa: 0.000 (s: 0) Exon 2 701 668 ( 34 n); cDNA 565 598 ( 34 n); score: 0.853 PPA cDNA 35 16 MATCH C06HBa0112G05.1-10- SGN-E301820+ 0.831 359 0.600 C PGS_C06HBa0112G05.1-10-_SGN-E301820+ (1060 736,701 668) Alignment (genomic DNA sequence = upper lines): GAAAGAATAT GAAAAGAGAG AATATGAAAA GTGAGGGAAC TATTTTGGAG GAAAAATGAA 1001 ||| | | |||| |||| ||||||||| |||||||||| || | ||||| | ||| |||| GAATGGGAAA AAAAAAAGAG AATATGAAAG GTGAGGGAAC TACTCTGGAG GGAAACTGAA 307 AAGTTAGTTG CAAAGTGCAA AGGAAAAGTC ATTTCTCCCA TATCAGCAAA AGATATGGAA 941 |||| | ||| ||||| |||| | ||||| || |||||||||| |||| ||||| ||| | |||| AAGTCATTTG CAAAGGGCAA ATGAAAATTC ATTTCTCCCA TATCGGCAAA AGAAAGGGAA 367 ATTGGTGTCC TTAGATAAGG AAACACTTCC TTTACTTCTT AAAGAGCTAA GAAGAAGGTA 881 |||| |||| ||| |||||| |||||||||| ||||||||| |||||||||| ||||||| | ATTGTTGTCG TTATATAAGG AAACACTTCC ATTACTTCTT AAAGAGCTAA GAAGAAGATG 427 CCCCCTCGCG CCGTCACCGT CGCTCGACCT TGACCTCGTT TTTGGATTTG GATTTGGCAA 821 |||||||||| |||||| | | |||||| | | |||||||||| |||||||||| CCCCCTCGCG CCGTCATCAT CGCTCG---- -G----C--- TTTGGATTTG GATTTGGCAA 475 ATGATGTGAT T-GATTGATA AATTTTTAGG ATAAAATTTA TTCAATCAAT TTTTG-T-AA 764 |||||||||| | |||||||| ||||||| || | |||||||| || ||||| | ||||| | || ATGATGTGAT TGGATTGATA AATTTTTTGG ACAAAATTTA TTTAATCACT TTTTGTTAAA 535 TCAAATAAAT CCTATTAATA -TTATCTCTT ATAAATTAAT GTAGAACGTT AGTTAATTAA 705 |||||||||| ||| |||||| |||||||| TCAAATAAAT CCTGTTAATA TTTATCTCT. .......... .......... .......... 564 CAGAATTAAT TTGTGGGTAA CGGTAACATT TCAAAAA 668 ||| ||| ||| || ||| |||||||||| | |||| ...AATAAAT TTGCGGTTAA CGGTAACATT TTGAAAA 598 hqPGS_C06HBa0112G05.1-10-_SGN-E301820+ (1060 736,701 668) ******************************************************************************** EST sequence 3 +strand 606 n (File: SGN-E262710+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA AATCAGAAAC AGGTTGTGTA 541 AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTAATT TATCTGCCAA TTGTGACAGA 601 AAAAAA Predicted gene structure (within gDNA segment 1219 to 1): Exon 1 528 309 ( 220 n); cDNA 2 230 ( 229 n); score: 0.805 MATCH C06HBa0112G05.1-10- SGN-E262710+ 0.805 220 0.363 C PGS_C06HBa0112G05.1-10-_SGN-E262710+ (528 309) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAA TGGCTATAAA AGGAAGTCAT TTTTTATTTT TTCAAACACT 469 |||| ||| | |||||||||| |||||||||| ||||||||| |||| ||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTC----A AAATCAAAGT GTCGATCGAC 415 | ||| |||| | |||||||| || ||||||| ||||| | |||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TAAGTCTGTG TGAC---TTG TTATTCATAA GTTTGCTGAA GTTAAAGAAG CTTGAGGTAC 358 | |||||||| |||| ||| || ||| ||| ||| | |||| |||||||||| ||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CACTATTTCT TTAACAGGAT TAATCCGTTT TATCTTG-GA GAAAATTAAT 309 | |||||||| |||||||| | |||||||||| ||||||| || | |||||||| CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA G-AAATTAAT 230 hqPGS_C06HBa0112G05.1-10-_SGN-E262710+ (528 309) ******************************************************************************** EST sequence 7 +strand 514 n (File: SGN-E255327+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTGAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTA Predicted gene structure (within gDNA segment 1219 to 1): Exon 1 528 309 ( 220 n); cDNA 2 230 ( 229 n); score: 0.805 MATCH C06HBa0112G05.1-10- SGN-E255327+ 0.805 220 0.428 C PGS_C06HBa0112G05.1-10-_SGN-E255327+ (528 309) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAA TGGCTATAAA AGGAAGTCAT TTTTTATTTT TTCAAACACT 469 |||| ||| | |||||||||| |||||||||| ||||||||| |||| ||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTGAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTC----A AAATCAAAGT GTCGATCGAC 415 | ||| |||| | |||||||| || ||||||| ||||| | |||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TAAGTCTGTG TGAC---TTG TTATTCATAA GTTTGCTGAA GTTAAAGAAG CTTGAGGTAC 358 | |||||||| |||| ||| || ||| ||| ||| | |||| |||||||||| ||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CACTATTTCT TTAACAGGAT TAATCCGTTT TATCTTG-GA GAAAATTAAT 309 | |||||||| |||||||| | |||||||||| ||||||| || | |||||||| CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA G-AAATTAAT 230 hqPGS_C06HBa0112G05.1-10-_SGN-E255327+ (528 309) ******************************************************************************** EST sequence 15 +strand 577 n (File: SGN-E369760+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT 241 GGGTACAGTG AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT 301 AATTCTTGTA TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT 361 TATAAATAAA AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATACAG TTTCTGTGTT 421 TGAGGTTTCT GGAGATTAAA ACCTTTATGG TTTTCTACTC TACTTGAATT TTTAAAATCA 481 TTCGATTAAC GATTAAAAAA ACATAAAAAC TTTATCGTTA AATCAGAAAC AGGTTGTGTA 541 AAGATTTGCT CTTTACTGTT AGTATTTTAA ATACTTA Predicted gene structure (within gDNA segment 1219 to 1): Exon 1 528 309 ( 220 n); cDNA 2 230 ( 229 n); score: 0.805 MATCH C06HBa0112G05.1-10- SGN-E369760+ 0.805 220 0.381 C PGS_C06HBa0112G05.1-10-_SGN-E369760+ (528 309) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAA TGGCTATAAA AGGAAGTCAT TTTTTATTTT TTCAAACACT 469 |||| ||| | |||||||||| |||||||||| ||||||||| |||| ||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTC----A AAATCAAAGT GTCGATCGAC 415 | ||| |||| | |||||||| || ||||||| ||||| | |||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TAAGTCTGTG TGAC---TTG TTATTCATAA GTTTGCTGAA GTTAAAGAAG CTTGAGGTAC 358 | |||||||| |||| ||| || ||| ||| ||| | |||| |||||||||| ||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CACTATTTCT TTAACAGGAT TAATCCGTTT TATCTTG-GA GAAAATTAAT 309 | |||||||| |||||||| | |||||||||| ||||||| || | |||||||| CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA G-AAATTAAT 230 hqPGS_C06HBa0112G05.1-10-_SGN-E369760+ (528 309) ******************************************************************************** EST sequence 4 +strand 397 n (File: SGN-E262800+) 1 CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC TGAAAATTTT 61 TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA CTGAGTCTGT 121 GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA CCGCTATTTC 181 TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATTAAT CCATAACCTT GGGTACAGTG 241 AGGGAATTAA ATTTCTTAAG GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA 301 TTCTATATTA TTTTCTGCTT CATCTTATTT CTGTTTCTGT TTATTAACTT TATAAATAAA 361 AGTTATTATA AGAGTAACAA TCTTAAGAAA ATTTATA Predicted gene structure (within gDNA segment 1119 to 1): Exon 1 519 309 ( 211 n); cDNA 1 220 ( 220 n); score: 0.806 MATCH C06HBa0112G05.1-10- SGN-E262800+ 0.806 211 0.531 C PGS_C06HBa0112G05.1-10-_SGN-E262800+ (519 309) Alignment (genomic DNA sequence = upper lines): CTTGCTGAAA ATGGCTATAA AAGGAAGTCA TTTTTTATTT TTTCAAACAC TG-AAA-TTT 462 |||||||||| |||||||||| |||||||||| |||| || ||| |||||| || ||| ||| CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC TGAAAATTTT 60 TCCTTCTCTG CATATATTTT TCTCTC---- AAAATCAAAG TGTCGATCGA CTAAGTCTGT 406 || ||||||| ||| |||||| |||||| |||||||||| |||||||||| || ||||||| TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA CTGAGTCTGT 120 GTGACTTG-T -TA-TTCATA AGTTTGCTGA AGTTAAAGAA GCTTGAGGTA CCACTATTTC 349 |||||||| | | ||| || |||| | ||| |||||||||| | |||||||| || ||||||| GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA CCGCTATTTC 180 TTTAACAGGA TTAATCCGTT TTATCTTG-G AGAAAATTAA T 309 ||||||||| |||||||||| |||||||| | || ||||||| | TTTAACAGGT TTAATCCGTT TTATCTTGAG AG-AAATTAA T 220 hqPGS_C06HBa0112G05.1-10-_SGN-E262800+ (519 309) ******************************************************************************** EST sequence 13 +strand 653 n (File: SGN-E273518+) 1 GGAAGTCAAA TTTTGATTTT TTAAACACTG AAAATTTTTC TTTCTCTGCA TTTATTTTTC 61 TCTCAAATAA AATCAAAGTG TCGATCGACT GAGTCTGTGT GACTTGTTGT TGTTCTTAAG 121 TTCGTTGAAG TTAAAGAAGT TTGAGGTACC GCTATTTCTT TAACAGGTTT AATCCGTTTT 181 ATCTTGAGAG AAATTAATCC ATAACCTTGG GTACAGTGAG GGAATTAAAT TTCTTAAGGA 241 CACATAGTAG TTTCTGTGGA CTCGGATTAA TTCTTGTATT CTATATTATT TTCTGCTTCA 301 TCTTATTTCT GTTTCTGTTT ATTAACTTTA TAAATAAAAG TTATTATAAG AGTAACAATC 361 TTAAGAAAAT TTATACAGTT TCTGTGTTTG AGGTTTCTGG AGATTAAAAC CTTTATGGTT 421 TTCTACTCTA CTTGAATTTT TAAAATCATT CGATTAACGA TTAAAAAAAC ATAAAAACTT 481 TATCGTTAAA TCAGAAACAG GTTGTGTAAA GATTTGCTCT TTACTGTTAG TATTTTAAAT 541 ACTTAATTTA TCTGCCAATT GTGACAGAAA AAAAAGACTA ATTCAAGTCA AACAAATGCT 601 GGAACAGTAA GTGCTGCAAC AACAATGGTT GCACATAATC GTTCACATGC TGC Predicted gene structure (within gDNA segment 1715 to 1): Exon 1 497 309 ( 189 n); cDNA 1 198 ( 198 n); score: 0.783 MATCH C06HBa0112G05.1-10- SGN-E273518+ 0.783 189 0.289 C PGS_C06HBa0112G05.1-10-_SGN-E273518+ (497 309) Alignment (genomic DNA sequence = upper lines): GGAAGTCATT TTTTATTTTT TCAAACACTG AAA-T-TTTC CTTCTCTGCA TATATTTTTC 440 |||||||| |||| |||| | |||||||| ||| | |||| ||||||||| | |||||||| GGAAGTCAAA TTTTGATTTT TTAAACACTG AAAATTTTTC TTTCTCTGCA TTTATTTTTC 60 TCTC--A-A- AATCAAAGTG TCGATCGACT AAGTCTGTGT GAC---TTGT TATTCATAAG 387 |||| | | |||||||||| |||||||||| ||||||||| ||| |||| | ||| |||| TCTCAAATAA AATCAAAGTG TCGATCGACT GAGTCTGTGT GACTTGTTGT TGTTCTTAAG 120 TTTGCTGAAG TTAAAGAAGC TTGAGGTACC ACTATTTCTT TAACAGGATT AATCCGTTTT 327 || | ||||| ||||||||| |||||||||| ||||||||| ||||||| || |||||||||| TTCGTTGAAG TTAAAGAAGT TTGAGGTACC GCTATTTCTT TAACAGGTTT AATCCGTTTT 180 ATCTTG-GAG AAAATTAAT 309 |||||| ||| |||||||| ATCTTGAGAG -AAATTAAT 198 hqPGS_C06HBa0112G05.1-10-_SGN-E273518+ (497 309) ******************************************************************************** EST sequence 14 +strand 329 n (File: SGN-E258205+) 1 AATTTTTCTT TCTCTGCATT TATTTTTCTC TCAAATAAAA TCAAAGTGTC GATCGACTGA 61 GTCTGTGTGA CTTGTTGTTG TTCTTAAGTT CGTTGAAGTT AAAGAAGTTT GAGGTACCGC 121 TATTTCTTTA ACAGGTTTAA TCCGTTTTAT CTTGAGAGAA ATTAATCCAT AACCTTGGGT 181 ACAGTGAGGG AATTAAATTT CTTAAGGACA CATAGTAGTT TCTGTGGACT CGGATTAATT 241 CTTGTATTCT ATATTATTTT CTGCTTCATC TTATTTCTGG TTCTGTTTAT TAACTTTATA 301 AATAAAAGGT ATTATAAGAG TAACAATCT Predicted gene structure (within gDNA segment 1395 to 1): Exon 1 467 309 ( 159 n); cDNA 1 166 ( 166 n); score: 0.799 MATCH C06HBa0112G05.1-10- SGN-E258205+ 0.799 159 0.483 C PGS_C06HBa0112G05.1-10-_SGN-E258205+ (467 309) Alignment (genomic DNA sequence = upper lines): AAATTTTCCT TCTCTGCATA TATTTTTCTC TC--A-A-AA TCAAAGTGTC GATCGACTAA 412 || ||||| | ||||||||| |||||||||| || | | || |||||||||| |||||||| | AATTTTTCTT TCTCTGCATT TATTTTTCTC TCAAATAAAA TCAAAGTGTC GATCGACTGA 60 GTCTGTGTGA C---TTGTTA TTCATAAGTT TGCTGAAGTT AAAGAAGCTT GAGGTACCAC 355 |||||||||| | ||||| ||| |||||| | ||||||| ||||||| || |||||||| | GTCTGTGTGA CTTGTTGTTG TTCTTAAGTT CGTTGAAGTT AAAGAAGTTT GAGGTACCGC 120 TATTTCTTTA ACAGGATTAA TCCGTTTTAT CTTG-GAGAA AATTAAT 309 |||||||||| ||||| |||| |||||||||| |||| ||| | ||||||| TATTTCTTTA ACAGGTTTAA TCCGTTTTAT CTTGAGAG-A AATTAAT 166 hqPGS_C06HBa0112G05.1-10-_SGN-E258205+ (467 309) ******************************************************************************** EST sequence 8 +strand 591 n (File: SGN-E254845+) 1 AGGAAGTCAA ATTTTGATTT TTTAAACACT GAAAATTTGG CTTTCGGTGC ATTTATTTTT 61 CTCTCAAATA AAATCAAAGT GTCGATCGAC TGAGTCTGTG TGACTTGTTG TTGTTCTTAA 121 GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC CGCTATTTCT TTAACAGGTT TAATCCGTTT 181 TATCTTGAGA GAAATTAATC CATAACCTTG GGTACCAGTG AGGGAATTAA ATTTCTTAAG 241 GACACATAGT AGTTTCTGTG GACTCGGATT AATTCTTGTA TTCTATATTA TTTTCTGCTT 301 CATCTTATTT CTGTTTCTGT TTATTAACTT TATAAATAAA AGTTATTATA AGAGTAACAA 361 TCTTAAGAAA ATTTATACAG TTTCTGTGTT TGAGGTTTCT GGAGAATAAA ACCTTTATGG 421 TTTTCTACTC TACTTGAATT TTTAAAATCA TTCGATTAAC GATTAAAAAA ACATAAAAAC 481 TTTATCGTTA AATCAGAAAC AGGTTGTGTA AAGAATTGCT CTTTACTGTT AGTATTTTAA 541 ATACTTAATT TATCTGCCAA TTGTGACAGA AAAAAAAGAC TAATTCAAGT C Predicted gene structure (within gDNA segment 1725 to 1): Exon 1 435 309 ( 127 n); cDNA 70 199 ( 130 n); score: 0.850 MATCH C06HBa0112G05.1-10- SGN-E254845+ 0.850 127 0.215 C PGS_C06HBa0112G05.1-10-_SGN-E254845+ (435 309) Alignment (genomic DNA sequence = upper lines): AAAATCAAAG TGTCGATCGA CTAAGTCTGT GTGAC---TT GTTATTCATA AGTTTGCTGA 379 |||||||||| |||||||||| || ||||||| ||||| || ||| ||| || |||| | ||| AAAATCAAAG TGTCGATCGA CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA 129 AGTTAAAGAA GCTTGAGGTA CCACTATTTC TTTAACAGGA TTAATCCGTT TTATCTTG-G 320 |||||||||| | |||||||| || ||||||| ||||||||| |||||||||| |||||||| | AGTTAAAGAA GTTTGAGGTA CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG 189 AGAAAATTAA T 309 || ||||||| | AG-AAATTAA T 199 hqPGS_C06HBa0112G05.1-10-_SGN-E254845+ (435 309) ******************************************************************************** EST sequence 5 +strand 227 n (File: SGN-E261310+) 1 GGACATGTTC CTTGCTGAAA ATGGCTATAA AAGGAAGTCA AATTTTGATT TTTTAAACAC 61 TGAAAATTTT TCTTTCTCTG CATTTATTTT TCTCTCAAAT AAAATCAAAG TGTCGATCGA 121 CTGAGTCTGT GTGACTTGTT GTTGTTCTTA AGTTCGTTGA AGTTAAAGAA GTTTGAGGTA 181 CCGCTATTTC TTTAACAGGT TTAATCCGTT TTATCTTGAG AGAAATT Predicted gene structure (within gDNA segment 1219 to 1): Exon 1 528 312 ( 217 n); cDNA 2 227 ( 226 n); score: 0.800 MATCH C06HBa0112G05.1-10- SGN-E261310+ 0.800 217 0.956 C PGS_C06HBa0112G05.1-10-_SGN-E261310+ (528 312) Alignment (genomic DNA sequence = upper lines): GACACGTTTC TTGCTGAAAA TGGCTATAAA AGGAAGTCAT TTTTTATTTT TTCAAACACT 469 |||| ||| | |||||||||| |||||||||| ||||||||| |||| ||| || ||||||| GACATGTTCC TTGCTGAAAA TGGCTATAAA AGGAAGTCAA ATTTTGATTT TTTAAACACT 61 G-AAA-TTTT CCTTCTCTGC ATATATTTTT CTCTC----A AAATCAAAGT GTCGATCGAC 415 | ||| |||| | |||||||| || ||||||| ||||| | |||||||||| |||||||||| GAAAATTTTT CTTTCTCTGC ATTTATTTTT CTCTCAAATA AAATCAAAGT GTCGATCGAC 121 TAAGTCTGTG TGAC---TTG TTATTCATAA GTTTGCTGAA GTTAAAGAAG CTTGAGGTAC 358 | |||||||| |||| ||| || ||| ||| ||| | |||| |||||||||| ||||||||| TGAGTCTGTG TGACTTGTTG TTGTTCTTAA GTTCGTTGAA GTTAAAGAAG TTTGAGGTAC 181 CACTATTTCT TTAACAGGAT TAATCCGTTT TATCTTGGAG AAAATT 312 | |||||||| |||||||| | |||||||||| ||||||| ||||| CGCTATTTCT TTAACAGGTT TAATCCGTTT TATCTTGAGA GAAATT 227 hqPGS_C06HBa0112G05.1-10-_SGN-E261310+ (528 312) ******************************************************************************** EST sequence 2 -strand 757 n (File: SGN-E542858-) 1 GAACAGATTT CACAGAACAG ACACGTTCCT TGCTGAAAAT TGCTATAAAG GAAGTCAATT 61 TTGATTTTCA AACACTGAAA ATTTTCCTTC TCNGTATTAT TTTTCTCTAA AAAAAATCAA 121 AGTGTCGATC GACTGAGTCT GTGTGACTTG TTGCTGTTCT GAAGTTTGCT GAAGTTAAAG 181 AAGTTTGAGA AAAAAAAAGA CTAACTCAAG TCAAACAAAT GCTGGAATAG TAAGTGCTGC 241 AACAACATCG GCTGCACATA ATCATTCAGA TGCTATCTTA GCGGCGGCTG AGAAACCTGC 301 AGAGTTTTCT GGAGTCGACT TTGAGAGATG GCAGCAAAAG ATGTTCTTCT ATCTCACTAC 361 GTTGAGTCTG CAGAAGTTCA TTAATGAGAA TGTTCCTGTT TATCAGATGA AACTCCGGCT 421 GATGAACGAT TCTTGGTAAC AGAAGCATGG ACACACTCAG ATTTTTTGTG TAAAAATTAT 481 ATTTTGAGTG GTCTGCAAGA TGAACAACAA TGCCAAAACC TCAAAGAACT CTTGGATGCT 541 TTAGAAAAGA AGTACAAAAC AGAAGATGCC GGAATGAAGA AATTCATTGT GGTAAAATTT 601 TTGGACTATA AGATGATAGA CAATAAGACT GTCGTCACCC AAGTTCAAGA ATTGCAGGTC 661 ATAATCCATG ATCTGTGCTG AATGTATAAA TTTATTTAAT GCCTATGTTA GAAATATTAA 721 GTTTTCCCTT AATAAATTTA TTTAATTAAA AAAAAAA Predicted gene structure (within gDNA segment 2165 to 1): Exon 1 547 362 ( 186 n); cDNA 1 189 ( 189 n); score: 0.798 PPA cDNA 748 757 MATCH C06HBa0112G05.1-10- SGN-E542858- 0.798 186 0.246 C PGS_C06HBa0112G05.1-10-_SGN-E542858- (547 362) Alignment (genomic DNA sequence = upper lines): GAACAAATTT TTCTGAACAG ACACGTTTCT TGCTGAAAAT GGCTATAAAA GGAAGTCATT 488 ||||| |||| | |||||| ||||||| || |||||||||| ||||| ||| |||||||| | GAACAGATTT CACAGAACAG ACACGTTCCT TGCTGAAAAT TGCTAT-AAA GGAAGTCAAT 59 TTTTATTTTT TCAAACACTG -AAATTTTCC TTCTCTGCAT ATATTTTTCT CTC-AAA--A 432 ||| | ||| |||||||||| ||||||||| ||||| | || ||||||||| || ||| | TTTGA--TTT TCAAACACTG AAAATTTTCC TTCTCNGTAT -TATTTTTCT CTAAAAAAAA 116 TCAAAGTGTC GATCGACTAA GTCTGTGTGA CTTG-T--TA TTCATAAGTT TGCTGAAGTT 375 |||||||||| |||||||| | |||||||||| |||| | | ||| ||||| |||||||||| TCAAAGTGTC GATCGACTGA GTCTGTGTGA CTTGTTGCTG TTCTGAAGTT TGCTGAAGTT 176 AAAGAAGCTT GAG 362 ||||||| || ||| AAAGAAGTTT GAG 189 hqPGS_C06HBa0112G05.1-10-_SGN-E542858- (547 362) ******************************************************************************** EST sequence 11 +strand 574 n (File: SGN-E548743+) 1 TGGAGAGAAC TGTCTCTCAC TTGGTATCTT GAACTTTGGT AGCAAGAAGT ATGTGGAGCA 61 AGTAATCCAA CCTATGCATC TACGGATGTT GTATATAATC TGGGTCGGCT AGTTCAAACA 121 TATGTTCGGG ATTGCTCTTA ATATATCAAA TATCATTAAT CACTTGATCA AGATAAACTC 181 CCAAAACTTA ACCACATTTT GGACCAAATT ATAATAATAC CTTGCTCATG TTGGGGTTTA 241 GCAAGTGTGA ATGGGAAAAA AAAAAGAGAA TCTGAAAGGT GAGGGAACTA CTCTCGAGGG 301 AAACTGAAAA GTCATTTGCC AAGTGCAAAT GAAAATTCAT TTCTCCCATA TCGACTAAAG 361 AAAGGGAAAT TGTTGTCGTT ATATAAGGAA ACACTTCCAT TACTTCTTAA AGAGCTAAGA 421 AGAATATGCC CCCGTCGCGC CGTCATCATC GCTCGGCTTT GGATTTGGAT TTGGCAAATG 481 ATGTGATTGG ATTGATAAAT TTTTTGGACA AAATTTATTT AATCACCTTG TGTTAAATCA 541 CATAAATCCT GTTCATATTT ATCTCTAATA AATT Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 2950 2942 ( 9 n); cDNA 221 229 ( 9 n); score: 0.889 Intron 1 2941 1774 (1168 n); Pd: 0.000 (s: 0), Pa: 0.198 (s: 0) Exon 2 1773 1769 ( 5 n); cDNA 230 234 ( 5 n); score: 0.800 Intron 2 1768 1426 ( 343 n); Pd: 0.522 (s: 0), Pa: 0.116 (s: 0) Exon 3 1425 1421 ( 5 n); cDNA 235 239 ( 5 n); score: 0.800 Intron 3 1420 1070 ( 351 n); Pd: 0.900 (s: 0), Pa: 0.000 (s: 0.68) Exon 4 1069 728 ( 342 n); cDNA 240 574 ( 335 n); score: 0.784 MATCH C06HBa0112G05.1-10- SGN-E548743+ 0.784 361 0.629 C PGS_C06HBa0112G05.1-10-_SGN-E548743+ (2950 2942,1773 1769,1425 1421,1069 728) Alignment (genomic DNA sequence = upper lines): CTTGCTCAAA TCTAGATTTA AATCGACTTT ATAGTATCTA GATAGCCACC AAACAAGATT 2891 |||||||| CTTGCTCAT. .......... .......... .......... .......... .......... 229 ACAAAAATTA ATTTTGAAGC ATCAAATAAT TTAAATATTA TATATTCGCA TCGGTTTTTG 2831 .......... .......... .......... .......... .......... .......... 229 ATGTTATCGG TTCGGTTTCG GTTAACCCAA TAAGAAAATG TCGATCAAAT AATATATAAT 2771 .......... .......... .......... .......... .......... .......... 229 AGTACGTATG TTATAATTAC ATGTTACCAA CTTACCGCCA AAACATAATA GAAAACTTTG 2711 .......... .......... .......... .......... .......... .......... 229 AATGTTGATT AAATTACTAA CATTCACAAA CTTGCAAAAG CTATCGCCTA CAATTACGAA 2651 .......... .......... .......... .......... .......... .......... 229 CTAGAATTAA AACTAAAATA AAATATTTCA ATGAGACAAT CTTGAACAGT TGCTTGATCC 2591 .......... .......... .......... .......... .......... .......... 229 ACCGGATAAT CAACAAAGTT CTCACTAATT CCCTAACAAA AAAATCATAT AGCCTAAAAA 2531 .......... .......... .......... .......... .......... .......... 229 GCTAATATTG AATCTATTTA TGTATTAAAT TATGTATATT TAATATTGAG GAAGGGTAAA 2471 .......... .......... .......... .......... .......... .......... 229 GTAGTAAATA ACTATCGTCT TATTGGGTTA TCGGTGTACC CAATAATCCA ATAGGAAAAA 2411 .......... .......... .......... .......... .......... .......... 229 CCGAAAACGA CCCAATAACC CAATAATTAT TTTTTCTAAA CCCATTAAAA ACCCAATAAC 2351 .......... .......... .......... .......... .......... .......... 229 CCAATAACAA TAACCCGATA ACAATTTATC GGTTCGATTT ACTGGTCGAT TTGGTTTTTG 2291 .......... .......... .......... .......... .......... .......... 229 CACACCCCTA CGCAGACGCG TTTTTCGACT TAACAATATC GCACTTACTA TCATGGTTGT 2231 .......... .......... .......... .......... .......... .......... 229 TGTTGAACTT AAAAGATATA TTTATTTGAC ATATCCAACA CCAAACCTTC AATGAAGAAG 2171 .......... .......... .......... .......... .......... .......... 229 ATATAGCTAC AAGGGAAGCA GTAAGTTCAA CGTCAAATTA CAATAATTTT TCATTATGTT 2111 .......... .......... .......... .......... .......... .......... 229 ACATCCTTTT GAACAATTTT TGCTTCCCAG GAGTCAATTT GTGGAATAGA TTGGAATGGT 2051 .......... .......... .......... .......... .......... .......... 229 AAAGAGTTAC TATCTAAAGG TTAAAAGAAA TTTGATGAAT GAGAAAAGAT TTTTTGCTCT 1991 .......... .......... .......... .......... .......... .......... 229 TGGGTTCATT ATGCAGCTCA CATTAACATA CAAATTGGTA GTAAAATTGA GAATATACCC 1931 .......... .......... .......... .......... .......... .......... 229 GGGGGCGGAC TCACATGGTG TCCGCCGGGT GCTCGAGCAC CCATTAACTT CGTTACGAAA 1871 .......... .......... .......... .......... .......... .......... 229 TATATATATA TATCTATGTA AAAATTGATA GGTATTTATA TAAAATTAAC ATAGAACACC 1811 .......... .......... .......... .......... .......... .......... 229 CAATGAATAA ATTATATAGT TGGCCCAATG GTTCTAGGAT GGGTACTTAA GACTCTTTTA 1751 | | || .......... .......... .......... .......GTT GG........ .......... 234 ATGTTGTTGT ACCAAGGTTT GAATCTCATT GTTAACACAT ATTTTTTATA GTTTCACTTC 1691 .......... .......... .......... .......... .......... .......... 234 AGAGCACCCA CAACCTTAAA ATCTTAGATC CGCCACTGAA TATACCCAGT TTGAGCGTAT 1631 .......... .......... .......... .......... .......... .......... 234 TTTGAATTGG AAGTTTTATT TAGAAAAATA ATGGTAAAGA GTTACTATCT ACATTGTTTG 1571 .......... .......... .......... .......... .......... .......... 234 GCAGACCTCT CCAATTCGTT ATATTTATTC GGCTACAATC AAGATAGATG AAGCTCAAAT 1511 .......... .......... .......... .......... .......... .......... 234 GACACTTCTA AGAACAGAAC AATCTCTGCT AGATCGAGAA TTTCTTTTTG GGAATCATTA 1451 .......... .......... .......... .......... .......... .......... 234 TATTATTAAT CAAGAATTAA TATAGGTTTT GTATTCTTCT CAAAAAACAA TGTTGATAAA 1391 | ||| .......... .......... .....GGTTT .......... .......... .......... 239 GCTCAGATTC CTTCTTTTAG AGTCACAACA ACTTATATCA CAATTTAGCA CACAAAATAA 1331 .......... .......... .......... .......... .......... .......... 239 TAGGGCATCA AAATTATGCA TTTTGAAACA ATCTTCTTTA TTAAATTTAT TGACTTAAAA 1271 .......... .......... .......... .......... .......... .......... 239 ATAAATAGTA ATAGATGAGA TAATCAAAAA AACTTATATC AACCCCCTTA AGTACAAAAA 1211 .......... .......... .......... .......... .......... .......... 239 GACTTGTACA ATTACCAAAT GAAGCTTGAG TAGTACAGTT GATTTAGCTC ATAATCCATA 1151 .......... .......... .......... .......... .......... .......... 239 AGCTTTGCAC CATCATCAAG AGAAGAGTAG ATTTGAAGAA GATGAATGAG AACTTTGTTG 1091 .......... .......... .......... .......... .......... .......... 239 GGTTTAGCAA GCGTGAACGG GAAAAGAAAA GAAAGAATAT GAAAAGAGAG AATATGAAAA 1031 | | ||| | | |||| |||| ||| ||||| .......... .......... .AGCAAGTGT GAATGGGAAA AAAAAAAGAG AATCTGAAAG 278 GTGAGGGAAC TATTTTGGAG GAAAAATGAA AAGTTAGTTG CAAAGTGCAA AGGAAAAGTC 971 |||||||||| || | | ||| | ||| |||| |||| | ||| | |||||||| | ||||| || GTGAGGGAAC TACTCTCGAG GGAAACTGAA AAGTCATTTG CCAAGTGCAA ATGAAAATTC 338 ATTTCTCCCA TATCAGCAAA AGATATGGAA ATTGGTGTCC TTAGATAAGG AAACACTTCC 911 |||||||||| |||| | || ||| | |||| |||| |||| ||| |||||| |||||||||| ATTTCTCCCA TATCGACTAA AGAAAGGGAA ATTGTTGTCG TTATATAAGG AAACACTTCC 398 TTTACTTCTT AAAGAGCTAA GAAGAAGGTA CCCCCTCGCG CCGTCACCGT CGCTCGACCT 851 ||||||||| |||||||||| |||||| || ||| | || || | |||| | || | | ATTACTTCTT AAAGAGCTAA GAAGAA--TA TGCCC-C-CG TCG-CGCCGT C-ATC-A--T 449 TGACCTCGTT TTTGGATTTG GATTTGGCAA ATGATGTGAT T-GATTGATA AATTTTTAGG 792 | |||| |||||||||| |||||||||| |||||||||| | |||||||| ||||||| || CG--CTCGGC TTTGGATTTG GATTTGGCAA ATGATGTGAT TGGATTGATA AATTTTTTGG 507 ATAAAATTTA TTCAATCAAT TTTTG-T-AA TCAAATAAAT CCTATTAATA -TTATCTCTT 735 | |||||||| || ||||| || || | || ||| |||||| ||| || ||| |||||||| ACAAAATTTA TTTAATCACC TTGTGTTAAA TCACATAAAT CCTGTTCATA TTTATCTCTA 567 ATAAATT 728 ||||||| ATAAATT 574 hqPGS_C06HBa0112G05.1-10-_SGN-E548743+ (1069 728) ******************************************************************************** EST sequence 16 +strand 568 n (File: SGN-E301922+) 1 AGAGAACTGT CTCGAGTTTT TTTTTTTTTT TTTTTGGAGT AAGGGGTAGT AAAAAAAATA 61 TTTTTGAGCT AAAAACGTAA CGAGTTAAAA GATTGATCTA TATCGACTAG TTCAAACATA 121 TTTTCGGCAT TGTTCTTTAT ATATCAAATA TCATTAATCA CTTGTTCAAG ATAAACTCCC 181 AAATCTTAAC CACATTTTGG ACCAAATTAT AATAATACCT TGTTCTTGTT GGGGTTTAGC 241 AAGTGTGAAT GGGAAAAAAA AAAGAGAATA TGAAAGGTGA GGGAACTACT CTGGAGGGAA 301 ACTGAAAAGT CATTTGCAAA GTGCAAATGA AAATTCATTT CTCCCATATC GGCAAAAGAA 361 AGGGAAATTG TTGTCTTTAT ATAAGGAAAC ACTTCCATTA CTTTTTAAAG AGCTAAGAAG 421 AAGATGCCCC CTCGCGCCGT CATCATCGCT CGGCTTTGGA TTTGGATTTG GCAAATGATG 481 TGATTGGATT GATAAATTTT TTGGACAAAA TTTATTTAAT CACTTTTTGT TAAATCAAAT 541 AAATCCTGTT AATATTTATC TCTAATAA Predicted gene structure (within gDNA segment 2955 to 1): Exon 1 1060 731 ( 330 n); cDNA 247 568 ( 322 n); score: 0.830 PPA cDNA 35 16 MATCH C06HBa0112G05.1-10- SGN-E301922+ 0.830 330 0.581 C PGS_C06HBa0112G05.1-10-_SGN-E301922+ (1060 731) Alignment (genomic DNA sequence = upper lines): GAAAGAATAT GAAAAGAGAG AATATGAAAA GTGAGGGAAC TATTTTGGAG GAAAAATGAA 1001 ||| | | |||| |||| ||||||||| |||||||||| || | ||||| | ||| |||| GAATGGGAAA AAAAAAAGAG AATATGAAAG GTGAGGGAAC TACTCTGGAG GGAAACTGAA 306 AAGTTAGTTG CAAAGTGCAA AGGAAAAGTC ATTTCTCCCA TATCAGCAAA AGATATGGAA 941 |||| | ||| |||||||||| | ||||| || |||||||||| |||| ||||| ||| | |||| AAGTCATTTG CAAAGTGCAA ATGAAAATTC ATTTCTCCCA TATCGGCAAA AGAAAGGGAA 366 ATTGGTGTCC TTAGATAAGG AAACACTTCC TTTACTTCTT AAAGAGCTAA GAAGAAGGTA 881 |||| |||| ||| |||||| |||||||||| |||||| || |||||||||| ||||||| | ATTGTTGTCT TTATATAAGG AAACACTTCC ATTACTTTTT AAAGAGCTAA GAAGAAGATG 426 CCCCCTCGCG CCGTCACCGT CGCTCGACCT TGACCTCGTT TTTGGATTTG GATTTGGCAA 821 |||||||||| |||||| | | |||||| | | |||||||||| |||||||||| CCCCCTCGCG CCGTCATCAT CGCTCG---- -G----C--- TTTGGATTTG GATTTGGCAA 474 ATGATGTGAT T-GATTGATA AATTTTTAGG ATAAAATTTA TTCAATCAAT TTTTG-T-AA 764 |||||||||| | |||||||| ||||||| || | |||||||| || ||||| | ||||| | || ATGATGTGAT TGGATTGATA AATTTTTTGG ACAAAATTTA TTTAATCACT TTTTGTTAAA 534 TCAAATAAAT CCTATTAATA -TTATCTCTT ATAA 731 |||||||||| ||| |||||| |||||||| |||| TCAAATAAAT CCTGTTAATA TTTATCTCTA ATAA 568 hqPGS_C06HBa0112G05.1-10-_SGN-E301922+ (1060 731) ******************************************************************************** EST sequence 1 +strand 676 n (File: SGN-E348470+) 1 TACAGTATAT ATGTGCGGTT GGTGTTTATG CCCTTCAAAA ACTAGAGCAT AGAGTATATA 61 TGTTCTTCAA TGTAAAGAAA GACTAAATCG GAACACATCT GTCGATTTTA TATTTAATAA 121 ATGTTGAATC ATGTGTTAAG AGTGAGATTC GAACCTTGGT ACAACAACAC TTAAAAAGTC 181 TTAAAGTACC CATCTAGGAG CTATTGGGCC AACTAAATCA TTTATTCATT GGGTGTTTTA 241 TGTTAATTTT ATACAAATAC CTACCGATTT CTACATAGAT ATATATATTC TGTAACGAAG 301 TTAATGGGTG CTCGATCACC CGGCGGACAC CATGTGAGTC CGCCTCTGGA CACGTGTATG 361 TAGTGGATAA TATTATGACA CATGTATATA GTGGATAATA CTATGACACA TGTATGTAGT 421 GGACAAGATT ATGACACGTG TATGTACCTT GAACAAAGGG GTTCATCCGA ATTCCCTTCG 481 TTTATTCATG TGTCTAATTT TATAGATTTT GAACCCCCTT ATTGAAAATT CAGACTCCGT 541 CTCTTCGTGT ATGTCCGTTA GTATTAAGGG TATATATGAT CTATTTTTGA ATTATAGGGG 601 CGCCAATGTC CCAAAAGTTA ACGAATGGTA TCTGCATACC ATGTATGAGA GTTTGAGGTA 661 TATTTGGCCT TTTTTT Predicted gene structure (within gDNA segment 1 to 2955): Exon 1 1541 1564 ( 24 n); cDNA 91 113 ( 23 n); score: 0.583 Intron 1 1565 1694 ( 130 n); Pd: 0.968 (s: 0), Pa: 0.000 (s: 0.72) Exon 2 1695 1932 ( 238 n); cDNA 114 349 ( 236 n); score: 0.836 MATCH C06HBa0112G05.1-10+ SGN-E348470+ 0.836 262 0.388 C PGS_C06HBa0112G05.1-10+_SGN-E348470+ (1541 1564,1695 1932) Alignment (genomic DNA sequence = upper lines): GAATAAATAT AACGAATTGG AGAGGTCTGC CAAACAATGT AGATAGTAAC TCTTTACCAT 1600 ||| | || | || ||| | | GAACACATCT GTCG-ATTTT ATAT...... .......... .......... .......... 113 TATTTTTCTA AATAAAACTT CCAATTCAAA ATACGCTCAA ACTGGGTATA TTCAGTGGCG 1660 .......... .......... .......... .......... .......... .......... 113 GATCTAAGAT TTTAAGGTTG TGGGTGCTCT GAAGTGAAAC TATAAAAAAT -ATGTGTTAA 1719 | || || ||| ||||||||| .......... .......... .......... ....TTAATA AATGTTGAAT CATGTGTTAA 139 CAATGAGATT CAAACCTTGG TACAACAACA TTAAAAGAGT CTT-AAGTAC CCATCCTAGA 1778 | ||||||| | |||||||| |||||||||| | ||| ||| ||| |||||| ||||| || GAGTGAGATT CGAACCTTGG TACAACAACA CTTAAAAAGT CTTAAAGTAC CCATCTAGGA 199 ACCATTGGGC CAACTATATA ATTTATTCAT TGGGTGTTCT ATGTTAATTT TATATAAATA 1838 | ||||||| |||||| || |||||||||| |||||||| | |||||||||| |||| ||||| GCTATTGGGC CAACTAAATC ATTTATTCAT TGGGTGTTTT ATGTTAATTT TATACAAATA 259 CCTATCAATT TTTACATAGA TATATATATA TATTTCGTAA CGAAGTTAAT GGGTGCTCGA 1898 |||| | ||| | ||||||| ||||||||| | | | |||| |||||||||| |||||||||| CCTACCGATT TCTACATAG- -ATATATATA T-TCT-GTAA CGAAGTTAAT GGGTGCTCGA 315 GCACCCGGCG GACACCATGT GAGTCCGCCC CCGG 1932 ||||||||| |||||||||| ||||||||| | || TCACCCGGCG GACACCATGT GAGTCCGCCT CTGG 349 hqPGS_C06HBa0112G05.1-10+_SGN-E348470+ (1541 1564,1695 1932) ******************************************************************************** EST sequence 10 -strand 526 n (File: SGN-E244115-) 1 TAATTTTTTA ATACAATTGA TAAAACAAAA AAAAAAATCA ATTTTAACTC CCAACGAAAT 61 GCTACTATGG ATCGAAAAAG GATGGTTTGA ATAGGATCTA CTAATACTAT CAAAAGTATT 121 GTGTGAACTT TATTGCATAC ATCTAATGGG AAGTCAGATT TTTCCTTCCC TCCGTTAAAA 181 ACATTACGTC ATTCTTTAAA TTTTTATTTA ATTTAAGATA TGCATCATTA GTGTAAAAAA 241 ATTACATCTT AATGCAGTCT AAATGTTGAA CACATGTAAA TGAGAAACAA ATTGTGCTTA 301 ACTATGTATG ACGTATGATG TATCTATAAC TTAAAAGGCA GAACACATAA GCGGTCCCTT 361 GAATTTGTTA AGATTTTTTT TATAGACACT TTAACAAGAC TTGCTTCTTA TTGAACACTC 421 AAACCCATTG GCGGATCTAG GAATTTAAGG TTGTGGGTGT TCGTGAAAAT ATAAAAAATA 481 TGTGTTAAAA GTGAGATTCG AACCTTGGTA CAACAACACT AAAAAA Predicted gene structure (within gDNA segment 1 to 2429): Exon 1 1656 1755 ( 100 n); cDNA 429 524 ( 96 n); score: 0.880 MATCH C06HBa0112G05.1-10+ SGN-E244115- 0.880 100 0.190 C PGS_C06HBa0112G05.1-10+_SGN-E244115- (1656 1755) Alignment (genomic DNA sequence = upper lines): TGGCGGATCT AAGATTTTAA GGTTGTGGGT GCTCTGAAGT GAAACTATAA AAAATATGTG 1715 |||||||||| | || ||||| |||||||||| | | | || |||| ||||| |||||||||| TGGCGGATCT AGGAATTTAA GGTTGTGGGT G-T-T--CGT GAAAATATAA AAAATATGTG 484 TTAACAATGA GATTCAAACC TTGGTACAAC AACATTAAAA 1755 |||| | ||| ||||| |||| |||||||||| |||| ||||| TTAAAAGTGA GATTCGAACC TTGGTACAAC AACACTAAAA 524 hqPGS_C06HBa0112G05.1-10+_SGN-E244115- (1656 1755) Total number of EST alignments reported: 17 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 2955: PGL 1 (- strand): 1070 309 AGS-1 (1070 965,845 731,696 309) SCR (e 0.821 d 0.000 a 0.000,e 0.887 d 0.000 a 0.000,e 0.843) Exon 1 1070 965 ( 106 n); score: 0.821 Intron 1 964 846 ( 119 n); Pd: 0.000 Pa: 0.000 Exon 2 845 731 ( 115 n); score: 0.887 Intron 2 730 697 ( 34 n); Pd: 0.000 Pa: 0.000 Exon 3 696 309 ( 388 n); score: 0.843 PGS (528 309) SGN-E262710+ PGS (528 309) SGN-E255327+ PGS (528 309) SGN-E369760+ PGS (519 309) SGN-E262800+ PGS (497 309) SGN-E273518+ PGS (467 309) SGN-E258205+ PGS (435 309) SGN-E254845+ PGS (528 312) SGN-E261310+ PGS (547 362) SGN-E542858- PGS (1070 965,845 731,696 499) SGN-E395007+ PGS (1070 965,845 731,696 602) SGN-E250408+ 3-phase translation of AGS-1 (-strand): . . . . . . 1070 GAAAAGAAAAGAAAGAATATGAAAAGAGAGAATATGAAAAGTGAGGGAACTATTTTGGAG E K K R K N M K R E N M K S E G T I L E K R K E R I - K E R I - K V R E L F W R K E K K E Y E K R E Y E K - G N Y F G . . . . . : . 1010 GAAAAATGAAAAGTTAGTTGCAAAGTGCAAAGGAAAAGTCATTTCT : TCGTTTTTGGATTT E K - K V S C K V Q R K S H F : F V F G F K N E K L V A K C K G K V I S : S F L D L G K M K S - L Q S A K E K S F L : R F W I . . . . . . 831 GGATTTGGCAAATGATGTGATTGATTGATAAATTTTTAGGATAAAATTTATTCAATCAAT G F G K - C D - L I N F - D K I Y S I N D L A N D V I D - - I F R I K F I Q S I W I W Q M M - L I D K F L G - N L F N Q . . . . . : . 771 TTTTGTAATCAAATAAATCCTATTAATATTATCTCTTATAA : ATTTGTGGGTAACGGTAAC F C N Q I N P I N I I S Y K : F V G N G N F V I K - I L L I L S L I : N L W V T V T F L - S N K S Y - Y Y L L - : I C G - R - . . . . . . 677 ATTTCAAAAAGTTTTTAATCTTTTCGAAAAGTCGTTACTTTCGGAAAAACCGTTATTTTT I S K S F - S F R K V V T F G K T V I F F Q K V F N L F E K S L L S E K P L F F H F K K F L I F S K S R Y F R K N R Y F . . . . . . 617 CTTACAGATACATTTTCCAAAAAGTTGTTATTTTTTCCAAAAGACACAACTTTCTGGATA L T D T F S K K L L F F P K D T T F W I L Q I H F P K S C Y F F Q K T Q L S G - S Y R Y I F Q K V V I F S K R H N F L D . . . . . . 557 AAACGGGTTTGAACAAATTTTTCTGAACAGACACGTTTCTTGCTGAAAATGGCTATAAAA K R V - T N F S E Q T R F L L K M A I K N G F E Q I F L N R H V S C - K W L - K K T G L N K F F - T D T F L A E N G Y K . . . . . . 497 GGAAGTCATTTTTTATTTTTTCAAACACTGAAATTTTCCTTCTCTGCATATATTTTTCTC G S H F L F F Q T L K F S F S A Y I F L E V I F Y F F K H - N F P S L H I F F S R K S F F I F S N T E I F L L C I Y F S . . . . . . 437 TCAAAATCAAAGTGTCGATCGACTAAGTCTGTGTGACTTGTTATTCATAAGTTTGCTGAA S K S K C R S T K S V - L V I H K F A E Q N Q S V D R L S L C D L L F I S L L K L K I K V S I D - V C V T C Y S - V C - . . . . . . 377 GTTAAAGAAGCTTGAGGTACCACTATTTCTTTAACAGGATTAATCCGTTTTATCTTGGAG V K E A - G T T I S L T G L I R F I L E L K K L E V P L F L - Q D - S V L S W R S - R S L R Y H Y F F N R I N P F Y L G . 317 AAAATTAAT K I N K L E N - Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (1069 736,701 610) SCR (e 0.834 d 0.000 a 0.000,e 0.837) Exon 1 1069 736 ( 334 n); score: 0.834 Intron 1 735 702 ( 34 n); Pd: 0.000 Pa: 0.000 Exon 2 701 610 ( 92 n); score: 0.837 PGS (1060 736,701 610) SGN-E542859+ PGS (1060 736,701 668) SGN-E301820+ PGS (1069 728) SGN-E548743+ PGS (1060 731) SGN-E301922+ 3-phase translation of AGS-2 (-strand): . . . . . . 1069 AAAAGAAAAGAAAGAATATGAAAAGAGAGAATATGAAAAGTGAGGGAACTATTTTGGAGG K R K E R I - K E R I - K V R E L F W R K E K K E Y E K R E Y E K - G N Y F G G K K R K N M K R E N M K S E G T I L E . . . . . . 1009 AAAAATGAAAAGTTAGTTGCAAAGTGCAAAGGAAAAGTCATTTCTCCCATATCAGCAAAA K N E K L V A K C K G K V I S P I S A K K M K S - L Q S A K E K S F L P Y Q Q K E K - K V S C K V Q R K S H F S H I S K . . . . . . 949 GATATGGAAATTGGTGTCCTTAGATAAGGAAACACTTCCTTTACTTCTTAAAGAGCTAAG D M E I G V L R - G N T S F T S - R A K I W K L V S L D K E T L P L L L K E L R R Y G N W C P - I R K H F L Y F L K S - . . . . . . 889 AAGAAGGTACCCCCTCGCGCCGTCACCGTCGCTCGACCTTGACCTCGTTTTTGGATTTGG K K V P P R A V T V A R P - P R F W I W R R Y P L A P S P S L D L D L V F G F G E E G T P S R R H R R S T L T S F L D L . . . . . . 829 ATTTGGCAAATGATGTGATTGATTGATAAATTTTTAGGATAAAATTTATTCAATCAATTT I W Q M M - L I D K F L G - N L F N Q F F G K - C D - L I N F - D K I Y S I N F D L A N D V I D - - I F R I K F I Q S I . . . . : . . 769 TTGTAATCAAATAAATCCTATTAATATTATCTCT : AATTAATTTGTGGGTAACGGTAACAT L - S N K S Y - Y Y L : - L I C G - R - H C N Q I N P I N I I S : N - F V G N G N I F V I K - I L L I L S L : I N L W V T V T . . . . . . 675 TTCAAAAAGTTTTTAATCTTTTCGAAAAGTCGTTACTTTCGGAAAAACCGTTATTTTTCT F K K F L I F S K S R Y F R K N R Y F S S K S F - S F R K V V T F G K T V I F L F Q K V F N L F E K S L L S E K P L F F . 615 TACAGA Y R T L Q Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 1541 1932 AGS-1 (1541 1564,1695 1932) SCR (e 0.583 d 0.968 a 0.000,e 0.836) Exon 1 1541 1564 ( 24 n); score: 0.583 Intron 1 1565 1694 ( 130 n); Pd: 0.968 Pa: 0.000 Exon 2 1695 1932 ( 238 n); score: 0.836 PGS (1541 1564,1695 1932) SGN-E348470+ 3-phase translation of AGS-1 (+strand): . . . : . . . 1541 GAATAAATATAACGAATTGGAGAG : TGAAACTATAAAAAATATGTGTTAACAATGAGATTC E - I - R I G E : - N Y K K Y V L T M R F N K Y N E L E S : E T I K N M C - Q - D S I N I T N W R : V K L - K I C V N N E I . . . . . . 1731 AAACCTTGGTACAACAACATTAAAAGAGTCTTAAGTACCCATCCTAGAACCATTGGGCCA K P W Y N N I K R V L S T H P R T I G P N L G T T T L K E S - V P I L E P L G Q Q T L V Q Q H - K S L K Y P S - N H W A . . . . . . 1791 ACTATATAATTTATTCATTGGGTGTTCTATGTTAATTTTATATAAATACCTATCAATTTT T I - F I H W V F Y V N F I - I P I N F L Y N L F I G C S M L I L Y K Y L S I F N Y I I Y S L G V L C - F Y I N T Y Q F . . . . . . 1851 TACATAGATATATATATATATTTCGTAACGAAGTTAATGGGTGCTCGAGCACCCGGCGGA Y I D I Y I Y F V T K L M G A R A P G G T - I Y I Y I S - R S - W V L E H P A D L H R Y I Y I F R N E V N G C S S T R R . . . 1911 CACCATGTGAGTCCGCCCCCGG H H V S P P P T M - V R P R T P C E S A P Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (1656 1755) SCR (e 0.880) Exon 1 1656 1755 ( 100 n); score: 0.880 PGS (1656 1755) SGN-E244115- 3-phase translation of AGS-2 (+strand): . . . . . . 1656 TGGCGGATCTAAGATTTTAAGGTTGTGGGTGCTCTGAAGTGAAACTATAAAAAATATGTG W R I - D F K V V G A L K - N Y K K Y V G G S K I L R L W V L - S E T I K N M C A D L R F - G C G C S E V K L - K I C . . . . 1716 TTAACAATGAGATTCAAACCTTGGTACAACAACATTAAAA L T M R F K P W Y N N I K - Q - D S N L G T T T L K V N N E I Q T L V Q Q H - Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (-strand): . . . . . . 1755 TTTTAATGTTGTTGTACCAAGGTTTGAATCTCATTGTTAACACATATTTTTTATAGTTTC F - C C C T K V - I S L L T H I F Y S F F N V V V P R F E S H C - H I F F I V S L M L L Y Q G L N L I V N T Y F L - F . . . . 1695 ACTTCAGAGCACCCACAACCTTAAAATCTTAGATCCGCCA T S E H P Q P - N L R S A L Q S T H N L K I L D P P H F R A P T T L K S - I R Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:20:24 2006 ________________________________________________________________________________ Sequence 11: C06HBa0112G05.1-11, from 1 to 2373, both strands analyzed. ... started at: Mon Aug 28 22:20:24 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:20:34 2006 ________________________________________________________________________________ Sequence 12: C06HBa0112G05.1-12, from 1 to 3049, both strands analyzed. ... started at: Mon Aug 28 22:20:34 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:20:44 2006 ________________________________________________________________________________ Sequence 13: C06HBa0112G05.1-13, from 1 to 605, both strands analyzed. ... started at: Mon Aug 28 22:20:44 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:20:54 2006 ________________________________________________________________________________ Sequence 14: C06HBa0112G05.1-14, from 1 to 1395, both strands analyzed. ... started at: Mon Aug 28 22:20:54 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:21:05 2006 ________________________________________________________________________________ Sequence 15: C06HBa0112G05.1-15, from 1 to 2296, both strands analyzed. ... started at: Mon Aug 28 22:21:05 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:21:15 2006 ________________________________________________________________________________ Sequence 16: C06HBa0112G05.1-16, from 1 to 1407, both strands analyzed. ... started at: Mon Aug 28 22:21:15 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:21:25 2006 ________________________________________________________________________________ Sequence 17: C06HBa0112G05.1-17, from 1 to 2015, both strands analyzed. ... started at: Mon Aug 28 22:21:25 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 4 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 ******************************************************************************** EST sequence 4 -strand 727 n (File: SGN-E357460-) 1 AAAAGAAGGA TAAAACTATT TGTACATGTA GATCTAGATC TATATATCCC GTGAAACTAA 61 TAGAAGATCG AGATAGTTTA CAAAACTGAA ACTTGTTCAT CCCTAGCAGA TTTGTATGTA 121 TGTTTATATA TATTTCCATG TTTTTATAGA CATGCTAGAC CTTTAGCTTC AGATCCTTTG 181 ATAATTCTAA GCCTTTTGCA AGAACTTATA AACATCTCCC ATGGTACATC ACCAACAAGC 241 ATCCAATCAC CATCCTTGTC TTCATATGTT GGTGCATAAT CAGATCCATT ATATCCTTCT 301 CTTTCTGAAT ACACTCCAAT AGTGCACTTG AACATGCTTT GTAAAGCCTT GAGTAGCTCT 361 TGATAACTTT TGTAAACCTG AAGATCAATT TTCCTCAAAT AAGGTGCTCC ATCCATGCTA 421 ACTTTTAAAT ACATTCCTGA GGAATTATTA TCAGATTCTG ATAATTTAGA CACATGATTT 481 TTCCTGTATG ATCGAACTGG TGGCCAACCA ACAACTTGTG CTTTTGGTGC AGGAGCTGAG 541 TCTTGTTGTT CATTTTCATT TACACTACTA GATGAAGGTC TTTTTCTGCT ATTTTTAGAA 601 GTACTAGTTG ATGTTGAAGA TTCATCGTTT ATTATCCCAG GCAAACCTAA TCTTAGCTCA 661 GTTGCCTCAA GATCATTGAT ATCCTTCTCG TAAATTCTCA TGATTTAGAA AGAATGATTG 721 ATTTGAA Predicted gene structure (within gDNA segment 2015 to 1): Exon 1 1523 1290 ( 234 n); cDNA 1 220 ( 220 n); score: 0.850 MATCH C06HBa0112G05.1-17- SGN-E357460- 0.850 234 0.322 C PGS_C06HBa0112G05.1-17-_SGN-E357460- (1523 1290) Alignment (genomic DNA sequence = upper lines): AAAAGAAAGA T-AAACTATT TGTACATG-A GATCTAGATC TATATATCCC GTGTAATTAA 1466 ||||||| || | |||||||| |||||||| | |||||||||| |||||||||| ||| || ||| AAAAGAAGGA TAAAACTATT TGTACATGTA GATCTAGATC TATATATCCC GTGAAACTAA 60 TCGAAAATCA AGATAGTTTA CAAAACTGAA ACTTGTTCAA ACTTGTTCAT TCCTATCAGA 1406 | ||| ||| |||||||||| |||||||||| |||||||| | || |||| |||| TAGAAGATCG AGATAGTTTA CAAAACTGAA ACTTGTTC-- A-----TC-- -CCTAGCAGA 110 TTGGTATATA TGTGTGTATA TATATATATG TCCATGTTTT TATAGACTTG CTAAACCTTT 1346 || |||| || ||| | |||| ||||| | | ||||||||| ||||||| || ||| |||||| TTTGTATGTA TGT-T-TATA TATAT-T-T- -CCATGTTTT TATAGACATG CTAGACCTTT 164 AGCTTCAGAT CCTTTGATAA TTCTAAGCCT TTTGCAAGAA CTGATAAACC TCACCC 1290 |||||||||| |||||||||| |||||||||| |||||||||| || |||||| || ||| AGCTTCAGAT CCTTTGATAA TTCTAAGCCT TTTGCAAGAA CTTATAAACA TCTCCC 220 hqPGS_C06HBa0112G05.1-17-_SGN-E357460- (1523 1290) ******************************************************************************** EST sequence 1 -strand 633 n (File: SGN-E355238-) 1 AGATAGTTTA CAAAACTGAA ACTGGTTCAT CCTTAGCAGA TTTGTATGTA TGTTTATATA 61 TATTTCCATG TTTTTATAGA CATGCTAGAC CTTTAGCTTC AGATCCTTTG ATAATTCTAA 121 GCCTTTTGCA AGAACTTATA AACATCTCCC ATGGTACATC ACCAACAAGC ATCCAATCAC 181 CATCCTTGTC TTCATATGTT GGTGCATAAT CAGATCCATT ATATCCTTCT CTTTCTGAAT 241 ACACTCCAAT AGTGCACTTG AACATGCTTT GTAAAGCCTT GAGTAGCTCT TGATAACTTT 301 TGTAAACCTG AAGATCAATT TTCCTCAAAT AAGGTGCTCC ATCCATGCTA ACTTTTAAAT 361 ACATTCCTGA GGAATTATTA TCAGATTCTG ATAATTTAGA CACATGATTT TTCCTGTATG 421 ATCGAACTGG TGGCCAACCA ACAACTTGTG CTTTTGGTGC AGGAGCTGAG TCTTGTTGTT 481 CATTTTCATT TACACTACTA GATGAAGGTC TTTTTCTGCT ATTTTTAGAA GTACTAGTTG 541 ATGTTGAAGA TTCATCGTTT ATTATCCCAG GCAAACCTAA TCTTAGCTCA GTTGCCTCAA 601 GATCATTGAT ATCCTTCTCG TAAATTCTCA TGA Predicted gene structure (within gDNA segment 2015 to 1): Exon 1 1385 1290 ( 96 n); cDNA 55 150 ( 96 n); score: 0.938 MATCH C06HBa0112G05.1-17- SGN-E355238- 0.938 96 0.152 C PGS_C06HBa0112G05.1-17-_SGN-E355238- (1385 1290) Alignment (genomic DNA sequence = upper lines): TATATATATG TCCATGTTTT TATAGACTTG CTAAACCTTT AGCTTCAGAT CCTTTGATAA 1326 ||||||||| |||||||||| ||||||| || ||| |||||| |||||||||| |||||||||| TATATATATT TCCATGTTTT TATAGACATG CTAGACCTTT AGCTTCAGAT CCTTTGATAA 114 TTCTAAGCCT TTTGCAAGAA CTGATAAACC TCACCC 1290 |||||||||| |||||||||| || |||||| || ||| TTCTAAGCCT TTTGCAAGAA CTTATAAACA TCTCCC 150 hqPGS_C06HBa0112G05.1-17-_SGN-E355238- (1385 1290) ******************************************************************************** EST sequence 2 -strand 598 n (File: SGN-E354765-) 1 GCAGATTGGT AGGTATGTTT ATATATATTT CCATGTTTTT ATAGACATGC TAGACCTTTA 61 GCTTCAGATC CTTTGATAAT TCTAAGCCTT TCGCAAGAAC TTATAAACAT CTCCCATGGT 121 ACATCACCAA CAAGCATCCA ATCACCATCT TTGTCTTCAT ATGTTGGTGC ATAATCAGAT 181 CCATTATATC CTTCTCTTTC TGAATACACT CCAATAGTGC ACTTGAACAT GCTTTGTAAA 241 GCCTTGAGTA GCTCTTGATA ACTTTTGTAA ACCTGAAGAT CAATTTTCCT CAAATAAGGT 301 GCTCCATCCA TGCTAACTTT TAAATACATT CCTGAGGAAT TATTATCAGA TTCTGATAAT 361 TTAGACACAT GATTTTTCCT GTATGATCGA ACTGGTGGCC AACCAACAAC TTGTGCTTTT 421 GGTGCAGGAG CTGAGTCTTG TTGTTCATTT TCATTTACAC TACTAGATGA AGGTCTTTTT 481 CTGCTATTTT TAGAAGTACT AGTTGATGTT GAAGATTCAT CGTTTATTAT CCCAGGCAAA 541 CCTAATCTTA GCTCAGTTGC CTCAAGATCA TTGATATCCT TCTCGTAAAT TCTCATGA Predicted gene structure (within gDNA segment 2015 to 1): Exon 1 1385 1290 ( 96 n); cDNA 20 115 ( 96 n); score: 0.927 MATCH C06HBa0112G05.1-17- SGN-E354765- 0.927 96 0.161 C PGS_C06HBa0112G05.1-17-_SGN-E354765- (1385 1290) Alignment (genomic DNA sequence = upper lines): TATATATATG TCCATGTTTT TATAGACTTG CTAAACCTTT AGCTTCAGAT CCTTTGATAA 1326 ||||||||| |||||||||| ||||||| || ||| |||||| |||||||||| |||||||||| TATATATATT TCCATGTTTT TATAGACATG CTAGACCTTT AGCTTCAGAT CCTTTGATAA 79 TTCTAAGCCT TTTGCAAGAA CTGATAAACC TCACCC 1290 |||||||||| || ||||||| || |||||| || ||| TTCTAAGCCT TTCGCAAGAA CTTATAAACA TCTCCC 115 hqPGS_C06HBa0112G05.1-17-_SGN-E354765- (1385 1290) ******************************************************************************** EST sequence 3 -strand 634 n (File: SGN-E395178-) 1 AGATAGTTTA CAAAATGAAA ACTTGTTCAT CCCTAGCAGA TTTGTATGTA TGTTTATATA 61 TATTTCCATG TTTTTATAGA CATGCTAGAC CTTTAGTTTC AGATCCTTTG ATAATTCTAA 121 GCCTTTTGCA AGAACTTATA AACATCTCCC ATGGTACATC ACCAACAAGC ATCCAATCAC 181 CATCCTTGTC TTCATATGTT GGTGCATAAT CAGATCCATT ATATCCTTCT CTTTCTGAAT 241 ACACTCCAAT AGTGCACTTG AACATGCTTT GTAAAGCCTT GAGTAGCTCT TGATAACTTT 301 TGTAAACCTG AAGATCAATT TTCCTCAAAT AAGGTGCTCC ATCCATGCTA ACTTTTAAAT 361 ACATTCCTGA GGAATTATTA TCAGATTCTG ATAATTTAGA CACATGATTT TTCCTGTATG 421 ATCGAACTGG TGGCCAACCA ACAACTTGTG CTTTTGGTGC AGGAGCTGAG TCTTGTTGTT 481 CATTTTCATT TACACTACTA GATGAAGGTC TTTTTCTGCT ATTTTTAGAA GTACTAGTTG 541 ATGTTGAAGA TTCATCGTTT ATTATCCCAG GCAAACCTAA TCTTAGCTCA GTTGCCTCAA 601 GATCATTGAT ATCCTTCTCG TAAATTCTCA TGAT Predicted gene structure (within gDNA segment 2015 to 1): Exon 1 1385 1290 ( 96 n); cDNA 55 150 ( 96 n); score: 0.927 MATCH C06HBa0112G05.1-17- SGN-E395178- 0.927 96 0.151 C PGS_C06HBa0112G05.1-17-_SGN-E395178- (1385 1290) Alignment (genomic DNA sequence = upper lines): TATATATATG TCCATGTTTT TATAGACTTG CTAAACCTTT AGCTTCAGAT CCTTTGATAA 1326 ||||||||| |||||||||| ||||||| || ||| |||||| || ||||||| |||||||||| TATATATATT TCCATGTTTT TATAGACATG CTAGACCTTT AGTTTCAGAT CCTTTGATAA 114 TTCTAAGCCT TTTGCAAGAA CTGATAAACC TCACCC 1290 |||||||||| |||||||||| || |||||| || ||| TTCTAAGCCT TTTGCAAGAA CTTATAAACA TCTCCC 150 hqPGS_C06HBa0112G05.1-17-_SGN-E395178- (1385 1290) ******************************************************************************** EST sequence 5 +strand 205 n (File: SGN-E398137+) 1 TTTTTTTTTT GTGCTAAAGT TTTGTATTTC TTCCCGTATA TTTATACTCG TACATAATAC 61 ACAAGGAAAT TCTTCCAAAA AAAAAAAAAA GAAGGATAAA ACTATTTGTA CATGTAGATC 121 TAGATCTATA TATCCCGTGA AACTAATAAA AGATCGAGAT AGTTTACAAA ACTGAAACTT 181 GTTCATCCCT AGCAGATTTG TATGT Predicted gene structure (within gDNA segment 2015 to 617): Exon 1 1600 1410 ( 191 n); cDNA 3 203 ( 201 n); score: 0.759 MATCH C06HBa0112G05.1-17- SGN-E398137+ 0.759 191 0.932 C PGS_C06HBa0112G05.1-17-_SGN-E398137+ (1600 1410) Alignment (genomic DNA sequence = upper lines): TTCATATTGT GCTAAAATTT TGTATTTCTT CCCGTATATT TATACTCATA CATAATACAC 1541 || | |||| |||||| ||| |||||||||| |||||||||| ||||||| || |||||||||| TTTTTTTTGT GCTAAAGTTT TGTATTTCTT CCCGTATATT TATACTCGTA CATAATACAC 62 AAGAAAATTC TT-C----A- AAA-AAAAGA AAGAT-AAAC TATTTGTACA TG-AGATCTA 1490 ||| |||||| || | | ||| |||||| | ||| |||| |||||||||| || ||||||| AAGGAAATTC TTCCAAAAAA AAAAAAAAGA AGGATAAAAC TATTTGTACA TGTAGATCTA 122 GATCTATATA TCCCGTGTAA TTAATCGAAA ATCAAGATAG TTTACAAAAC TGAAACTTGT 1430 |||||||||| ||||||| || |||| || ||| |||||| |||||||||| |||||||||| GATCTATATA TCCCGTGAAA CTAATAAAAG ATCGAGATAG TTTACAAAAC TGAAACTTGT 182 TCA-AACTTG TTCATTCCTA T 1410 ||| || | ||| || | TCATCCCTAG CAGATTTGTA T 203 hqPGS_C06HBa0112G05.1-17-_SGN-E398137+ (1600 1410) Total number of EST alignments reported: 5 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 2015: PGL 1 (- strand): 1600 1290 AGS-1 (1600 1290) SCR (e 0.850) Exon 1 1600 1290 ( 311 n); score: 0.850 PGS (1523 1290) SGN-E357460- PGS (1385 1290) SGN-E355238- PGS (1385 1290) SGN-E354765- PGS (1385 1290) SGN-E395178- PGS (1600 1410) SGN-E398137+ 3-phase translation of AGS-1 (-strand): . . . . . . 1600 TTCATATTGTGCTAAAATTTTGTATTTCTTCCCGTATATTTATACTCATACATAATACAC F I L C - N F V F L P V Y L Y S Y I I H S Y C A K I L Y F F P Y I Y T H T - Y T H I V L K F C I S S R I F I L I H N T . . . . . . 1540 AAGAAAATTCTTCAAAAAAAAGAAAGATAAACTATTTGTACATGAGATCTAGATCTATAT K K I L Q K K E R - T I C T - D L D L Y R K F F K K K K D K L F V H E I - I Y I Q E N S S K K R K I N Y L Y M R S R S I . . . . . . 1480 ATCCCGTGTAATTAATCGAAAATCAAGATAGTTTACAAAACTGAAACTTGTTCAAACTTG I P C N - S K I K I V Y K T E T C S N L S R V I N R K S R - F T K L K L V Q T C Y P V - L I E N Q D S L Q N - N L F K L . . . . . . 1420 TTCATTCCTATCAGATTGGTATATATGTGTGTATATATATATATGTCCATGTTTTTATAG F I P I R L V Y M C V Y I Y M S M F L - S F L S D W Y I C V Y I Y I C P C F Y R V H S Y Q I G I Y V C I Y I Y V H V F I . . . . . . 1360 ACTTGCTAAACCTTTAGCTTCAGATCCTTTGATAATTCTAAGCCTTTTGCAAGAACTGAT T C - T F S F R S F D N S K P F A R T D L A K P L A S D P L I I L S L L Q E L I D L L N L - L Q I L - - F - A F C K N - . . 1300 AAACCTCACCC K P H N L T - T S P Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 1290 GGGTGAGGTTTATCAGTTCTTGCAAAAGGCTTAGAATTATCAAAGGATCTGAAGCTAAAG G - G L S V L A K G L E L S K D L K L K G E V Y Q F L Q K A - N Y Q R I - S - R V R F I S S C K R L R I I K G S E A K . . . . . . 1350 GTTTAGCAAGTCTATAAAAACATGGACATATATATATATACACACATATATACCAATCTG V - Q V Y K N M D I Y I Y T H I Y T N L F S K S I K T W T Y I Y I H T Y I P I - G L A S L - K H G H I Y I Y T H I Y Q S . . . . . . 1410 ATAGGAATGAACAAGTTTGAACAAGTTTCAGTTTTGTAAACTATCTTGATTTTCGATTAA I G M N K F E Q V S V L - T I L I F D - - E - T S L N K F Q F C K L S - F S I N D R N E Q V - T S F S F V N Y L D F R L . . . . . . 1470 TTACACGGGATATATAGATCTAGATCTCATGTACAAATAGTTTATCTTTCTTTTTTTTGA L H G I Y R S R S H V Q I V Y L S F F - Y T G Y I D L D L M Y K - F I F L F F E I T R D I - I - I S C T N S L S F F F L . . . . . . 1530 AGAATTTTCTTGTGTATTATGTATGAGTATAAATATACGGGAAGAAATACAAAATTTTAG R I F L C I M Y E Y K Y T G R N T K F - E F S C V L C M S I N I R E E I Q N F S K N F L V Y Y V - V - I Y G K K Y K I L . . 1590 CACAATATGAA H N M T I - A Q Y E Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:21:38 2006 ________________________________________________________________________________ Sequence 18: C06HBa0112G05.1-18, from 1 to 3327, both strands analyzed. ... started at: Mon Aug 28 22:21:38 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand 575 n (File: SGN-E301194+) 1 CCTTTTAGGG CGTAGCTTAG CACTATATAT AGACGCTATG GCAAACCCTA TTCTGTAATT 61 CTGTTTTTGC CTCTCCATAA TAAAATTGCT CCCTCTCTTC CCGTGGACGT AGCCAATTTA 121 TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTCGCGTT TATATATTTT CTCGTATTAT 181 CTCAAATTTC GCACAACACT CTTAATATTC ATAACTATCA TCTTTTCATA TTCATAACCT 241 CCAAATATTT AAATTAAACT TTAAGAAATC TTTTGGTATT CCTTCTATTC TATTTGTATA 301 AATTCAACTT CTTTATCTCA TGAAACCCCT ATCAAGATTA TTATTTTTAT TCTATAGTAA 361 AAATAGATGC TGAAAACTCT TGAATTTTGA TAGGATATGA AAGGAGTCGA TAAAAACTCA 421 GAGAGTTATG TACTAATTTT TACTTATTTT TTCATCTATA TATACATCAA TCTTATAAGA 481 ATAATGTCTA TATTGTATTT TTTTCTTAAA TATTCTGTTT CTTTTAGTCT TTTTTTTCAC 541 TCTGTTAGAC TTCTTAATTT AGTTTTCTAT GAATG Predicted gene structure (within gDNA segment 1423 to 3327): Exon 1 2060 2255 ( 196 n); cDNA 2 198 ( 197 n); score: 0.939 Intron 1 2256 2567 ( 312 n); Pd: 0.000 (s: 0.83), Pa: 0.239 (s: 0) Exon 2 2568 2594 ( 27 n); cDNA 199 222 ( 24 n); score: 0.593 Intron 2 2595 3155 ( 561 n); Pd: 0.995 (s: 0), Pa: 0.000 (s: 0.68) Exon 3 3156 3205 ( 50 n); cDNA 223 269 ( 47 n); score: 0.680 MATCH C06HBa0112G05.1-18+ SGN-E301194+ 0.886 273 0.475 C PGS_C06HBa0112G05.1-18+_SGN-E301194+ (2060 2255,2568 2594,3156 3205) Alignment (genomic DNA sequence = upper lines): CTGTTAGGGC GTAGCTTAGC ACTATATATA GACGCTATGG CAAACCCTAT TCTGTAATTC 2119 || ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTAGGGC GTAGCTTAGC ACTATATATA GACGCTATGG CAAACCCTAT TCTGTAATTC 61 TGTTTTTGCC TCTCCATAAT AAAACTGCTC CCTCTCTTCC CCGTGGATGT AGCCAATTTA 2179 |||||||||| |||||||||| |||| ||||| |||||||| | ||||||| || |||||||||| TGTTTTTGCC TCTCCATAAT AAAATTGCTC CCTCTCTT-C CCGTGGACGT AGCCAATTTA 120 TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTTGCG-T T-TATATTTT CACGTATTAT 2237 |||||||||| |||||||||| |||||||||| |||| ||| | | |||||||| | |||||||| TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTCGCGTT TATATATTTT CTCGTATTAT 180 CTCGAATTTC GCACAACAAT AACATATGAC TGCTCAACTC AGCTGTGAAA TGAAAGAATA 2297 ||| |||||| |||||||| CTCAAATTTC GCACAACA.. .......... .......... .......... .......... 198 CATTTGCACT CCTAAATCCT CAATTGGCAA ACCCTGAAAT AAAAATCTCA AACTCCTTAC 2357 .......... .......... .......... .......... .......... .......... 198 AAATATAGAA ACTAAAAGGC CATAAGAAGA GCATCAATAA ACATAGTCAC AGCAGCACAA 2417 .......... .......... .......... .......... .......... .......... 198 TATCAATACA AAAACATTTT ACACTTTCAA AGAGTGAGGA ACCATCCATA TAATAAATTC 2477 .......... .......... .......... .......... .......... .......... 198 AAAGTGATAA AACGGAGAGA TGACAAACCT CACATAGCAA GTAGAGTCAG GGGCACGAAC 2537 .......... .......... .......... .......... .......... .......... 198 AATATAAACA CTCCATTGTT AATCCTTCAG AACATGACTC TTACAACACA TCTCATCGCA 2597 | | | | || || || | ||||| .......... .......... .......... CTC-TTAATA TT-CATAAC- TATCATC... 222 TTGTCCAAAC CTGCACATAC CTTCTCATTT TTTGTTCTAC CTTAATTAAA TAGAGAAACA 2657 .......... .......... .......... .......... .......... .......... 222 TTAGGATTTC AAGTTCAATA AAAGATGTTT ACAAACCATG AATTACTTTA ACAAATAAAT 2717 .......... .......... .......... .......... .......... .......... 222 GTCTCAGCGA TATATTTTTC CTTCAATATT CTAAAGGGAT GAAACTTTAT GGATGTCTTG 2777 .......... .......... .......... .......... .......... .......... 222 AAAATATGGA ACAAAAATTA ATTGTTTGTT CGCCAACTAA TAATATTCCT AAAAACAACT 2837 .......... .......... .......... .......... .......... .......... 222 CACAATAATA TGAATTCAGT ATTTTTGTGC CAAAAAGAGG AGCAACTATT ATGCTTTTAA 2897 .......... .......... .......... .......... .......... .......... 222 TTATTCAAGA GCATGGATAA TAACACTGAT GCAAGGAACA AAAAAATTTG GGCTAGATTA 2957 .......... .......... .......... .......... .......... .......... 222 TTTATGCAAG GAACAAAAAA ATTTGGGCTA GATTATTTAT GCAAGGAACA AATAATGATG 3017 .......... .......... .......... .......... .......... .......... 222 TTGCATTCTA CAAAGGGGTC TTCATTTTTC TATTTTAGCA TAGTAATCTC ATTCCTATTG 3077 .......... .......... .......... .......... .......... .......... 222 CAGAAGCCAA CAAAAGCCAT GACTTGCTTG GTTCATTTTT GAGTATTTCT AGCCTTTTCA 3137 .......... .......... .......... .......... .......... .......... 222 TTTCCCACAT GTTGTAACTT TCTCCTACTA AGCTAAGCTG ATAATATATA AATTTAACAG 3197 || | || || | | ||| || ||||| || |||| ||| .......... ........TT T-TCATATTC A--TAACCTC CAAATATTTA AATTAAACTT 261 TATAAAAT 3205 || |||| TAAGAAAT 269 hqPGS_C06HBa0112G05.1-18+_SGN-E301194+ (2060 2255) ******************************************************************************** EST sequence 1 +strand 641 n (File: SGN-E301078+) 1 CCTTTTAGGG TGTAGCTTAG CACTATATAT AGACGCTATG GCAAACCCTA TTCTGTAATT 61 CTGTTTTTGC CTCTCCATAA TAAAATTGCT CCCTCTCTTC CCGTGGACGT AGCCAATTTA 121 TTGGTGAACC ACGTAAAACT GGTGTCTTGG TTTTCGCGTT TATATATTTT CTCGTATTAT 181 CTCAAATTTC GCACAACACT CTTAATATTC ATAACTATCA TCTTTTCATA TTCATAACCT 241 CCAAATATTT AAATTAAACT TTAAGATATC TTTTGGTATT CCTTCTATTC TATTTGTATA 301 AATTCAACTT CTTTATCTCA TGAAACCCCT ATCAAGATTA TTATTTTTAT TCTATAGTAA 361 AAATAGATGC TGAAAACTCT TGAATTTTGA TAGGATATGA AAGGAGTCGA TAAAAACTCA 421 GAGAGTTATG TACTAATTTT TACTTATTTT TTCATCTATA TATACATCAA TCTTATAAGA 481 ATAATGTCTA TATTGTATTT TTTTCTTAAA TATTCTGTTT CTTTTAGTCT TTTTTTTCAC 541 TCTGTTAGAC TTCTTAATTT AGTTTTCTAT GAATGATTTA TTGTCGTATG TCTTTGAATT 601 TTGTAATTGT TACATTTTAT TATTCATTAC AATTTACATA T Predicted gene structure (within gDNA segment 1360 to 3327): Exon 1 2060 2255 ( 196 n); cDNA 2 198 ( 197 n); score: 0.918 MATCH C06HBa0112G05.1-18+ SGN-E301078+ 0.918 196 0.306 C PGS_C06HBa0112G05.1-18+_SGN-E301078+ (2060 2255) Alignment (genomic DNA sequence = upper lines): CTGTTAGGGC GTAGCTTAGC ACTATATATA GACGCTATGG CAAACCCTAT TCTGTAATTC 2119 || |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTAGGGT GTAGCTTAGC ACTATATATA GACGCTATGG CAAACCCTAT TCTGTAATTC 61 TGTTTTTGCC TCTCCATAAT AAAACTGCTC CCTCTCTTCC CCGTGGATGT AGCCAATTTA 2179 |||||||||| |||||||||| |||| ||||| |||||||| | ||||||| || |||||||||| TGTTTTTGCC TCTCCATAAT AAAATTGCTC CCTCTCTT-C CCGTGGACGT AGCCAATTTA 120 TTGGTGAACC ACGTAAATCT GTTGTCTTGT TTTTTGCG-T T-TATATTTT CACGTATTAT 2237 |||||||||| ||||||| || | ||||||| |||| ||| | | |||||||| | |||||||| TTGGTGAACC ACGTAAAACT GGTGTCTTGG TTTTCGCGTT TATATATTTT CTCGTATTAT 180 CTCGAATTTC GCACAACA 2255 ||| |||||| |||||||| CTCAAATTTC GCACAACA 198 hqPGS_C06HBa0112G05.1-18+_SGN-E301078+ (2060 2255) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3327: PGL 1 (+ strand): 2060 2255 AGS-1 (2060 2255) SCR (e 0.939) Exon 1 2060 2255 ( 196 n); score: 0.939 PGS (2060 2255) SGN-E301194+ PGS (2060 2255) SGN-E301078+ 3-phase translation of AGS-1 (+strand): . . . . . . 2060 CTGTTAGGGCGTAGCTTAGCACTATATATAGACGCTATGGCAAACCCTATTCTGTAATTC L L G R S L A L Y I D A M A N P I L - F C - G V A - H Y I - T L W Q T L F C N S V R A - L S T I Y R R Y G K P Y S V I . . . . . . 2120 TGTTTTTGCCTCTCCATAATAAAACTGCTCCCTCTCTTCCCCGTGGATGTAGCCAATTTA C F C L S I I K L L P L F P V D V A N L V F A S P - - N C S L S S P W M - P I Y L F L P L H N K T A P S L P R G C S Q F . . . . . . 2180 TTGGTGAACCACGTAAATCTGTTGTCTTGTTTTTTGCGTTTATATTTTCACGTATTATCT L V N H V N L L S C F L R L Y F H V L S W - T T - I C C L V F C V Y I F T Y Y L I G E P R K S V V L F F A F I F S R I I . . 2240 CGAATTTCGCACAACA R I S H N E F R T T S N F A Q Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 2255 TGTTGTGCGAAATTCGAGATAATACGTGAAAATATAAACGCAAAAAACAAGACAACAGAT C C A K F E I I R E N I N A K N K T T D V V R N S R - Y V K I - T Q K T R Q Q I L C E I R D N T - K Y K R K K Q D N R . . . . . . 2195 TTACGTGGTTCACCAATAAATTGGCTACATCCACGGGGAAGAGAGGGAGCAGTTTTATTA L R G S P I N W L H P R G R E G A V L L Y V V H Q - I G Y I H G E E R E Q F Y Y F T W F T N K L A T S T G K R G S S F I . . . . . . 2135 TGGAGAGGCAAAAACAGAATTACAGAATAGGGTTTGCCATAGCGTCTATATATAGTGCTA W R G K N R I T E - G L P - R L Y I V L G E A K T E L Q N R V C H S V Y I - C - M E R Q K Q N Y R I G F A I A S I Y S A . . 2075 AGCTACGCCCTAACAG S Y A L T A T P - Q K L R P N Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:21:49 2006 ________________________________________________________________________________ Sequence 19: C06HBa0112G05.1-19, from 1 to 4860, both strands analyzed. ... started at: Mon Aug 28 22:21:49 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 5 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 2 ******************************************************************************** EST sequence 1 -strand 684 n (File: SGN-E371571-) 1 GATCCATTCC ACTGCATGCC GTGGGGGTTG CAGGGGGTGG GTGTATAACA GCAGGTGGTG 61 GGTGTTGATC AGTTGCCGAA TGATCTGGTT TAGAGGAATG AGGTCGCATT GGAATTCCAA 121 CATACTCTGG TGGAATTGCT GGTGAGCTAG CATCATTATA ATCATCATCA TTATTATTAT 181 TATTATCCTG ATTATAATCA CCATTATCAT CATCAGCCAT AATAGCCCTA CGTGCTTGTA 241 ATAATCTTGT ACAGGATGTT GTTGAGGATA AACAAAAAAT TAAGAAAAAA ATGGCTATGA 301 ATAAAAAAGA GAATGTTTTG TGAATCATGG TTTTTAAAAT CTACCAGAAG TATGGAGTAA 361 TTTCAATGTG TATTATCCAA AGAAGACAAT GATTTAAATA CAAGTAAAGA AAATACTAAC 421 TTAGGTAAGT ACAAATCTAA ATCCCTAAGA ATCAGCTTAG AATCATATAT TACTAAAAGC 481 ATGAACAAAA AAGTTGAAAG TTGAATTACG ATTTTACTCC TTCTATTAAT TAAATTGTTA 541 TAAAATATTT AAATATTTAA TTATATAAAT TTAATTAAAT GTTAATGACT TTTAGAATTC 601 CTAAGATTAA TTCCTAATTA AATATCTAAT CTATTAATAT TTATCATCTA TAATCTATAT 661 GTATATAATA ATTACAAAAA AAAA Predicted gene structure (within gDNA segment 4860 to 1): Exon 1 4239 4228 ( 12 n); cDNA 401 412 ( 12 n); score: 0.833 Intron 1 4227 1351 (2877 n); Pd: 0.000 (s: 0), Pa: 0.998 (s: 0) Exon 2 1350 1319 ( 32 n); cDNA 413 442 ( 30 n); score: 0.562 Intron 2 1318 304 (1015 n); Pd: 0.923 (s: 0), Pa: 0.000 (s: 0.77) Exon 3 303 72 ( 232 n); cDNA 443 673 ( 231 n); score: 0.881 PPA cDNA 674 684 MATCH C06HBa0112G05.1-19- SGN-E371571- 0.881 276 0.404 C PGS_C06HBa0112G05.1-19-_SGN-E371571- (4239 4228,1350 1319,303 72) Alignment (genomic DNA sequence = upper lines): CAAGAAAGGA AAATTTGTTT GATCAGATAA GATCAAGTGC TCCAACAGAT TTGTTGCCTC 4180 |||| || || || CAAGTAAAGA AA........ .......... .......... .......... .......... 412 GTCAAATTAC CATTGATTAT GATGATGATG AGGAGCACTT TGGGCTTAAT TTTGTCATTT 4120 .......... .......... .......... .......... .......... .......... 412 TTGATTCAAA TAAGAAAAGG CATTCTGGTA AACACATCTA TTCTTTGAGG ATTTTTGGAG 4060 .......... .......... .......... .......... .......... .......... 412 ACGAGCTGGA TGACAGTCTT TTTGATACAT TTCACCTAAG ACACTTGAGG CTTCTTAGAG 4000 .......... .......... .......... .......... .......... .......... 412 TGTTGGTCCT GGATACCTCT TTTATCATGG TGAACGATTC TTTGCTGAAT GAAATATGCA 3940 .......... .......... .......... .......... .......... .......... 412 TGTTGAATCA TTTGAGGTAC TTAAGAATTG GGACACAAGT TAAATATCTG CCTTTGTCTT 3880 .......... .......... .......... .......... .......... .......... 412 TCTCAAACCT CTGGAATCTA GAATTATTGT GGGTTGAAAA CAAAGAATCA ACCTTGATAC 3820 .......... .......... .......... .......... .......... .......... 412 TATTACCAAG AATTTGGGAT CTTGTAAAGC TGCGAGTGCT GTTCGCGGAT GCTTGTTCTT 3760 .......... .......... .......... .......... .......... .......... 412 TCTTTGATAT GGATGCAGAT GAATCAATAT TGATAGCAGA GGACACAAAG TTAGAGAAGT 3700 .......... .......... .......... .......... .......... .......... 412 TGAGAATATT AGGGGAACTG TTGATTTCCT ATTCGAAAGA TACAAAGAAT ATTTTCAAAA 3640 .......... .......... .......... .......... .......... .......... 412 GGTTTCCCAA TCTTCAGATG CTTCAGTTTG AACTCAAGGA GTCATGGGAT TATTCAACAG 3580 .......... .......... .......... .......... .......... .......... 412 AGCAACATTG GTTCCCGAAA TTGGATTGCC TAACTGAACT AGAAATACTC AATGTAGGTT 3520 .......... .......... .......... .......... .......... .......... 412 TTAAAAGTTC AAACACAAAC CACAGTGGGT CCTCTGTAAA GACAAATCGG CCGTGGGATT 3460 .......... .......... .......... .......... .......... .......... 412 TTCACTTTCC TTCAAATTTG AAACAACTGT CATTGCATGA CTTTCCTCTG ACATCCGATT 3400 .......... .......... .......... .......... .......... .......... 412 CACTATCAAC AATAGCGAGA CTGCCCAACC TTGAAGAGTT GTCCCTTTAT GATGCAATCA 3340 .......... .......... .......... .......... .......... .......... 412 TCCAGGGAGA AGAATGGAAC ATGGGGGAGG AAGACACCTT TGAGAATCTC AAATTTTTGA 3280 .......... .......... .......... .......... .......... .......... 412 ACTTGCGTCT AGCGACTCTT TCCAAGTGGG AGGTTGGAGA GGAATCCTTC CCCAATCTTG 3220 .......... .......... .......... .......... .......... .......... 412 AGAAGTTAAA ACTGCAGGGA TGTCGTAAGC TTGAGGAGAT TCCACCTAGT TTTGGAGATA 3160 .......... .......... .......... .......... .......... .......... 412 TTTATTCATT GAAAGTTATC AAAATTGTAA AGAGTCCTCA ACTTGAAGAT TCTGCTCTCA 3100 .......... .......... .......... .......... .......... .......... 412 AGATTAAGGA ATACGCTGAA GATATGAGAG GAGGGAGCGA GCTTCAGATC CTTGGCCAGA 3040 .......... .......... .......... .......... .......... .......... 412 AGAATATCCC CTTATTTAAG TAGCATTATG GTTGAACTTT GCTGGGTGAT ATTGTATATG 2980 .......... .......... .......... .......... .......... .......... 412 ATTAAAATAT CCTGTTATGA GATTCCTCTT AGTTTCTTTT AACAAAAAAT ATAATTTTTA 2920 .......... .......... .......... .......... .......... .......... 412 TAAGTACACG TATCATTTGT TAATTTGTCC AGATTGAAGT AACTATCTGA AGTTCATATT 2860 .......... .......... .......... .......... .......... .......... 412 ATAAACATTA ATCTTGTATA CCAAACTACT ATTCCTATGC TATGTTGTTT GCCATTGTCG 2800 .......... .......... .......... .......... .......... .......... 412 TTCTCTCTTT ATTTTTTTTC TTTCCATTCA CATACACATT AATTTTCTAG TAGACTGCAT 2740 .......... .......... .......... .......... .......... .......... 412 ATTACTACAT CTGTATTATC CGTATGCAAG AGGAATCCAG GATTTGATGT TTACAAGTAT 2680 .......... .......... .......... .......... .......... .......... 412 TTGTGGCGAC CTCATGTTAA TATCAATAAC AATTAGATTC ACATATGTAT AGGATTTTGA 2620 .......... .......... .......... .......... .......... .......... 412 CAGAAATTGA GGGATTCACA TGAATTCATA GATTACTCCG TGGATTTGCC TTTGGCTGTC 2560 .......... .......... .......... .......... .......... .......... 412 CAAACCTCCT TTATGTCTAA CTTCGTCTGA AGTCCCATTT ATATGCTCAA AGCTTAGTCA 2500 .......... .......... .......... .......... .......... .......... 412 AGGTACTGAT TTAAAACGAT ATTGATACTA CTCTATAACA AACCCAGCGA ACTTTCATCA 2440 .......... .......... .......... .......... .......... .......... 412 CAAAAGCTAG GCCGTGTAGT GAACTTTAAA ATGATATTGC TGCAAAGTCG CTCAACAAAG 2380 .......... .......... .......... .......... .......... .......... 412 GGTCATAACC AGCACTACAA CTACACAAGG CTCAAGCAAG TATACGCGGG TGAAAGATTA 2320 .......... .......... .......... .......... .......... .......... 412 ACATAGATCG CTATCCCCCG CAAAAGCTAA GGAAAGCATC TCTAACTTCT TAGCATGGAC 2260 .......... .......... .......... .......... .......... .......... 412 CCAGATGTAC TCAAACACAC GATCTGTAAG GATGCCAGAA AGAGAAAGTT GCTGCAATTC 2200 .......... .......... .......... .......... .......... .......... 412 CTTGCAGTGT TGCACAATGT CCCAAAACCA GCATCAAGTG GTTCAAGTCA GGAGTTCGAG 2140 .......... .......... .......... .......... .......... .......... 412 GCTCAATAAT AAATGAAATT GGATCATGTT AGGACGGTTC CTAGCAATAG TAACTAAGGC 2080 .......... .......... .......... .......... .......... .......... 412 GTCATTTATC ATTTGGAGGC AGAAGTATAA AACTGACTGA AGCTTAGGGC AGCCCATGGA 2020 .......... .......... .......... .......... .......... .......... 412 GACACTTACA AGGCCTTGCT CTGTCAAGGA TACATTAGGT CATGGATCAG AAGGAAACAC 1960 .......... .......... .......... .......... .......... .......... 412 CCTAAGATCT TGAAGTTCCT TACTAGTGTT GGTAATCTCC TCTATCACAT CCATTCCAAG 1900 .......... .......... .......... .......... .......... .......... 412 TGCCTTAAAG CGTAAGCTGG ATAACCAATT ATGCAAAAGA GCCACTGAAC AAGAAGAGCT 1840 .......... .......... .......... .......... .......... .......... 412 CAAAAGCTAA TGAAGCGTCT AACTAGGAAA GATAAACATG AATTGTCAAA ATGTAGGATT 1780 .......... .......... .......... .......... .......... .......... 412 TGTGCAACAT CAAGTTATTC TTCATATGGC AGACCACTAA ATTATTCAAA GAGTTCACTC 1720 .......... .......... .......... .......... .......... .......... 412 GACATCAATT GTTTAACTAT GTTATCTCAC TAAACACTGA AGTTCATTAC ACTGTTTATT 1660 .......... .......... .......... .......... .......... .......... 412 TATCCCTGGC ATTTATCAAA AGCAGCAACA CAATTAATAA ATTAGGATTA TTGTTCTTTC 1600 .......... .......... .......... .......... .......... .......... 412 TCTTGGAAAC TTCTGGTTTT ACAACTAACA ATCAATTTAT CTCTTTTATC ACAAACACAG 1540 .......... .......... .......... .......... .......... .......... 412 TTCATGTATC CCTTCTCAGA AAAAAAACAG TTAATGTATC GAGCCATGAC TATTGAAAAC 1480 .......... .......... .......... .......... .......... .......... 412 ACGCACCAGC AACAGGTATC AGATAACTCC ACTGAAATTT GAACCTGAGT TCTCTCAATC 1420 .......... .......... .......... .......... .......... .......... 412 ACTTCATTGA CTTCAAACTG AGACTATTGA AAGTTTGATT AACTTTCTTA TCTCTTTCCT 1360 .......... .......... .......... .......... .......... .......... 412 TTGTGGCAGA TAAGCATACT TGTGCACCGG CAAAGTTAAA GGTCACAATT ATCCACCGAA 1300 | | | | ||| | | | |||| |||| .........A T-A-CTAACT TAGGTAAGTA CAAATCTAAA T......... .......... 442 GTCAAGCTAA ATGTCATTTT CTAGTAAGAT TTTAATCAAG CACATTATCT ACTAAATATA 1240 .......... .......... .......... .......... .......... .......... 442 TAGCGAGTTA GTATCATTAT ATTTTGTCTA CAAATTAAAT TTCGATTACT CTGGGTAAAC 1180 .......... .......... .......... .......... .......... .......... 442 AAGCCATATA GTAGGCTATT AATATTTATA GTTGAGAATG AATGGTTGTA CAGTCCCAAT 1120 .......... .......... .......... .......... .......... .......... 442 GTTCAAACGT TATTTATATC ATTGAAGTGA CTGGACTATG TGTTAGCTAT CGAGGAAGTA 1060 .......... .......... .......... .......... .......... .......... 442 TAAGTCAAAT ATTTGGGCAC TGAAATATAA CATGTCAGCA GTTATTGCAG GGTAGAAATG 1000 .......... .......... .......... .......... .......... .......... 442 TGGCTTTGTA AATTTGATTG GTCTTTTGGC AGATGTCATT TTCTGCTCTT CGCAAGGACG 940 .......... .......... .......... .......... .......... .......... 442 TTGCCAATGT TCTGGAGAGA TTAAAGAATG AACAAGATCA AAAGGATGTT GATGTGGATC 880 .......... .......... .......... .......... .......... .......... 442 TAATTGAAAA GCGAAAATTG GAGCTGGCAT TTATTTGTAC ATATGTTCAG CTTTCTTATT 820 .......... .......... .......... .......... .......... .......... 442 CCGATTTGGA TCAGTTTGAA GATATAATGA CGAGAGGTTG AGAATCTGCT TCAAGCAATT 760 .......... .......... .......... .......... .......... .......... 442 TTGGATGATG ATGTCCTTAC TAGCCTCGCC GGTAATATGG ATGATCATCG TCATTCTAAA 700 .......... .......... .......... .......... .......... .......... 442 TCAGATGCCA TCATGATGGA TGAGCAATTG TACTTCCTCC TCTTGAATCT CTATCATCTA 640 .......... .......... .......... .......... .......... .......... 442 GCGAGGCATC GTGCTGAAAA GATGTTTCCT GGAGTATGAG GTTCTTCAGA ATGTATGTAG 580 .......... .......... .......... .......... .......... .......... 442 AAACGTAAGA GATTTCCATG GATTGATAGT GAATGGTTGC ATTGATCACG AGATTGTTGA 520 .......... .......... .......... .......... .......... .......... 442 ATGTGTCTTA CCTCTGTTTC AACTGATGGC TGAGAGAGTA GGACACTTCC TTTGGGAGGA 460 .......... .......... .......... .......... .......... .......... 442 AGAAATTATG CTGATAATCT TGAAAATCAG CAATTTTGAA TTTTTGATTA TATAAATCTT 400 .......... .......... .......... .......... .......... .......... 442 TGTGCTTCAC ACGTGTGTTT CATGTGAATT TTTATATATA CGTTAGAATG ACTAAATTTC 340 .......... .......... .......... .......... .......... .......... 442 ATGGAAATAT ATATGCAAAA TGTAATGAAT AATAAAATGT AACAATCATA TCA-TATCAT 281 | || ||||| | | ||||| .......... .......... .......... ......CCCT AAGAATCAGC TTAGAATCAT 466 ATATTACTAA AAGCATGAAC AAAAAAAGTT GAAAGTTGAA TTACAGTTTT ACCCCTTCTA 221 |||||||||| |||||||||| | |||||||| |||||||||| |||| |||| || ||||||| ATATTACTAA AAGCATGAAC A-AAAAAGTT GAAAGTTGAA TTACGATTTT ACTCCTTCTA 525 TAAATTAAAT TGTTATAAAA TATTTAAATA TTTAATTGTA TAAATTTAAT TAAAAGGTTA 161 | |||||||| |||||||||| |||||||||| ||||||| || |||||||||| | ||| |||| TTAATTAAAT TGTTATAAAA TATTTAAATA TTTAATTATA TAAATTTAAT T-AAATGTTA 584 ATGACTTTTA GACTTCTAAA GATTGATTTC TAATTAAATA TCCAATTAAT TAATATTTAT 101 |||||||||| || ||| || |||| ||| | |||||||||| || ||| || |||||||||| ATGACTTTTA GAATTCCTAA GATTAATTCC TAATTAAATA TCTAATCTAT TAATATTTAT 644 CATCTATAAT CTATATGTAT ATTATAATT 72 |||||||||| |||||||||| || |||||| CATCTATAAT CTATATGTAT ATAATAATT 673 hqPGS_C06HBa0112G05.1-19-_SGN-E371571- (303 72) ******************************************************************************** EST sequence 2 -strand 599 n (File: SGN-E370504-) 1 TGGTTTAGAG GAATGAGGTC GCATTGGAAT TCCAACATAC TCTGGTGGAA TTGCTGGTGA 61 GCTAGCATCA TTATAATCAT CATCATTATT ATTATTATTA TCCTGATTAT AATCACCATT 121 ATCATCATCA GCCATAATAG CCCTACGTGC TTGTAATAAT CTTGTACAGG ATGTTGTTGA 181 GGATAAACAA AAAATTAAGA AAAAAATGGC TATGAATAAA AAAGAGAATG TTTTGTGAAT 241 CATGGTTTTT AAAATCTACC AGAAGTATGG AGTAATTTCA ATGTGTATTA TCCAAAGAAG 301 ACAATGATTT AAATACAAGT AAAGAAAATA CTAACTTAGG TAAGTACAAA TCTAAATCCC 361 TAAGAATCAG CTTAGAATCA TATATTACTA AAAGCATGAA CAAAAAAGTT GAAAGTTGAA 421 TTACGATTTT ACTCCTTCTA TTAATTAAAT TGTTATAAAA TATTTAAATA TTTAATTATA 481 TAAATTTAAT TAAATGTTAA TGACTTTTAG AATTCCTAAG ATTAATTCCT AATTAAATAT 541 CTAATCTATT AATATTTATC ATCTATAATC TATATGTATA TAATAATTAC AAAAAAAAA Predicted gene structure (within gDNA segment 4860 to 1): Exon 1 4239 4228 ( 12 n); cDNA 316 327 ( 12 n); score: 0.833 Intron 1 4227 1351 (2877 n); Pd: 0.000 (s: 0), Pa: 0.998 (s: 0) Exon 2 1350 1319 ( 32 n); cDNA 328 357 ( 30 n); score: 0.562 Intron 2 1318 304 (1015 n); Pd: 0.923 (s: 0), Pa: 0.000 (s: 0.77) Exon 3 303 72 ( 232 n); cDNA 358 588 ( 231 n); score: 0.881 PPA cDNA 589 599 MATCH C06HBa0112G05.1-19- SGN-E370504- 0.881 276 0.461 C PGS_C06HBa0112G05.1-19-_SGN-E370504- (4239 4228,1350 1319,303 72) Alignment (genomic DNA sequence = upper lines): CAAGAAAGGA AAATTTGTTT GATCAGATAA GATCAAGTGC TCCAACAGAT TTGTTGCCTC 4180 |||| || || || CAAGTAAAGA AA........ .......... .......... .......... .......... 327 GTCAAATTAC CATTGATTAT GATGATGATG AGGAGCACTT TGGGCTTAAT TTTGTCATTT 4120 .......... .......... .......... .......... .......... .......... 327 TTGATTCAAA TAAGAAAAGG CATTCTGGTA AACACATCTA TTCTTTGAGG ATTTTTGGAG 4060 .......... .......... .......... .......... .......... .......... 327 ACGAGCTGGA TGACAGTCTT TTTGATACAT TTCACCTAAG ACACTTGAGG CTTCTTAGAG 4000 .......... .......... .......... .......... .......... .......... 327 TGTTGGTCCT GGATACCTCT TTTATCATGG TGAACGATTC TTTGCTGAAT GAAATATGCA 3940 .......... .......... .......... .......... .......... .......... 327 TGTTGAATCA TTTGAGGTAC TTAAGAATTG GGACACAAGT TAAATATCTG CCTTTGTCTT 3880 .......... .......... .......... .......... .......... .......... 327 TCTCAAACCT CTGGAATCTA GAATTATTGT GGGTTGAAAA CAAAGAATCA ACCTTGATAC 3820 .......... .......... .......... .......... .......... .......... 327 TATTACCAAG AATTTGGGAT CTTGTAAAGC TGCGAGTGCT GTTCGCGGAT GCTTGTTCTT 3760 .......... .......... .......... .......... .......... .......... 327 TCTTTGATAT GGATGCAGAT GAATCAATAT TGATAGCAGA GGACACAAAG TTAGAGAAGT 3700 .......... .......... .......... .......... .......... .......... 327 TGAGAATATT AGGGGAACTG TTGATTTCCT ATTCGAAAGA TACAAAGAAT ATTTTCAAAA 3640 .......... .......... .......... .......... .......... .......... 327 GGTTTCCCAA TCTTCAGATG CTTCAGTTTG AACTCAAGGA GTCATGGGAT TATTCAACAG 3580 .......... .......... .......... .......... .......... .......... 327 AGCAACATTG GTTCCCGAAA TTGGATTGCC TAACTGAACT AGAAATACTC AATGTAGGTT 3520 .......... .......... .......... .......... .......... .......... 327 TTAAAAGTTC AAACACAAAC CACAGTGGGT CCTCTGTAAA GACAAATCGG CCGTGGGATT 3460 .......... .......... .......... .......... .......... .......... 327 TTCACTTTCC TTCAAATTTG AAACAACTGT CATTGCATGA CTTTCCTCTG ACATCCGATT 3400 .......... .......... .......... .......... .......... .......... 327 CACTATCAAC AATAGCGAGA CTGCCCAACC TTGAAGAGTT GTCCCTTTAT GATGCAATCA 3340 .......... .......... .......... .......... .......... .......... 327 TCCAGGGAGA AGAATGGAAC ATGGGGGAGG AAGACACCTT TGAGAATCTC AAATTTTTGA 3280 .......... .......... .......... .......... .......... .......... 327 ACTTGCGTCT AGCGACTCTT TCCAAGTGGG AGGTTGGAGA GGAATCCTTC CCCAATCTTG 3220 .......... .......... .......... .......... .......... .......... 327 AGAAGTTAAA ACTGCAGGGA TGTCGTAAGC TTGAGGAGAT TCCACCTAGT TTTGGAGATA 3160 .......... .......... .......... .......... .......... .......... 327 TTTATTCATT GAAAGTTATC AAAATTGTAA AGAGTCCTCA ACTTGAAGAT TCTGCTCTCA 3100 .......... .......... .......... .......... .......... .......... 327 AGATTAAGGA ATACGCTGAA GATATGAGAG GAGGGAGCGA GCTTCAGATC CTTGGCCAGA 3040 .......... .......... .......... .......... .......... .......... 327 AGAATATCCC CTTATTTAAG TAGCATTATG GTTGAACTTT GCTGGGTGAT ATTGTATATG 2980 .......... .......... .......... .......... .......... .......... 327 ATTAAAATAT CCTGTTATGA GATTCCTCTT AGTTTCTTTT AACAAAAAAT ATAATTTTTA 2920 .......... .......... .......... .......... .......... .......... 327 TAAGTACACG TATCATTTGT TAATTTGTCC AGATTGAAGT AACTATCTGA AGTTCATATT 2860 .......... .......... .......... .......... .......... .......... 327 ATAAACATTA ATCTTGTATA CCAAACTACT ATTCCTATGC TATGTTGTTT GCCATTGTCG 2800 .......... .......... .......... .......... .......... .......... 327 TTCTCTCTTT ATTTTTTTTC TTTCCATTCA CATACACATT AATTTTCTAG TAGACTGCAT 2740 .......... .......... .......... .......... .......... .......... 327 ATTACTACAT CTGTATTATC CGTATGCAAG AGGAATCCAG GATTTGATGT TTACAAGTAT 2680 .......... .......... .......... .......... .......... .......... 327 TTGTGGCGAC CTCATGTTAA TATCAATAAC AATTAGATTC ACATATGTAT AGGATTTTGA 2620 .......... .......... .......... .......... .......... .......... 327 CAGAAATTGA GGGATTCACA TGAATTCATA GATTACTCCG TGGATTTGCC TTTGGCTGTC 2560 .......... .......... .......... .......... .......... .......... 327 CAAACCTCCT TTATGTCTAA CTTCGTCTGA AGTCCCATTT ATATGCTCAA AGCTTAGTCA 2500 .......... .......... .......... .......... .......... .......... 327 AGGTACTGAT TTAAAACGAT ATTGATACTA CTCTATAACA AACCCAGCGA ACTTTCATCA 2440 .......... .......... .......... .......... .......... .......... 327 CAAAAGCTAG GCCGTGTAGT GAACTTTAAA ATGATATTGC TGCAAAGTCG CTCAACAAAG 2380 .......... .......... .......... .......... .......... .......... 327 GGTCATAACC AGCACTACAA CTACACAAGG CTCAAGCAAG TATACGCGGG TGAAAGATTA 2320 .......... .......... .......... .......... .......... .......... 327 ACATAGATCG CTATCCCCCG CAAAAGCTAA GGAAAGCATC TCTAACTTCT TAGCATGGAC 2260 .......... .......... .......... .......... .......... .......... 327 CCAGATGTAC TCAAACACAC GATCTGTAAG GATGCCAGAA AGAGAAAGTT GCTGCAATTC 2200 .......... .......... .......... .......... .......... .......... 327 CTTGCAGTGT TGCACAATGT CCCAAAACCA GCATCAAGTG GTTCAAGTCA GGAGTTCGAG 2140 .......... .......... .......... .......... .......... .......... 327 GCTCAATAAT AAATGAAATT GGATCATGTT AGGACGGTTC CTAGCAATAG TAACTAAGGC 2080 .......... .......... .......... .......... .......... .......... 327 GTCATTTATC ATTTGGAGGC AGAAGTATAA AACTGACTGA AGCTTAGGGC AGCCCATGGA 2020 .......... .......... .......... .......... .......... .......... 327 GACACTTACA AGGCCTTGCT CTGTCAAGGA TACATTAGGT CATGGATCAG AAGGAAACAC 1960 .......... .......... .......... .......... .......... .......... 327 CCTAAGATCT TGAAGTTCCT TACTAGTGTT GGTAATCTCC TCTATCACAT CCATTCCAAG 1900 .......... .......... .......... .......... .......... .......... 327 TGCCTTAAAG CGTAAGCTGG ATAACCAATT ATGCAAAAGA GCCACTGAAC AAGAAGAGCT 1840 .......... .......... .......... .......... .......... .......... 327 CAAAAGCTAA TGAAGCGTCT AACTAGGAAA GATAAACATG AATTGTCAAA ATGTAGGATT 1780 .......... .......... .......... .......... .......... .......... 327 TGTGCAACAT CAAGTTATTC TTCATATGGC AGACCACTAA ATTATTCAAA GAGTTCACTC 1720 .......... .......... .......... .......... .......... .......... 327 GACATCAATT GTTTAACTAT GTTATCTCAC TAAACACTGA AGTTCATTAC ACTGTTTATT 1660 .......... .......... .......... .......... .......... .......... 327 TATCCCTGGC ATTTATCAAA AGCAGCAACA CAATTAATAA ATTAGGATTA TTGTTCTTTC 1600 .......... .......... .......... .......... .......... .......... 327 TCTTGGAAAC TTCTGGTTTT ACAACTAACA ATCAATTTAT CTCTTTTATC ACAAACACAG 1540 .......... .......... .......... .......... .......... .......... 327 TTCATGTATC CCTTCTCAGA AAAAAAACAG TTAATGTATC GAGCCATGAC TATTGAAAAC 1480 .......... .......... .......... .......... .......... .......... 327 ACGCACCAGC AACAGGTATC AGATAACTCC ACTGAAATTT GAACCTGAGT TCTCTCAATC 1420 .......... .......... .......... .......... .......... .......... 327 ACTTCATTGA CTTCAAACTG AGACTATTGA AAGTTTGATT AACTTTCTTA TCTCTTTCCT 1360 .......... .......... .......... .......... .......... .......... 327 TTGTGGCAGA TAAGCATACT TGTGCACCGG CAAAGTTAAA GGTCACAATT ATCCACCGAA 1300 | | | | ||| | | | |||| |||| .........A T-A-CTAACT TAGGTAAGTA CAAATCTAAA T......... .......... 357 GTCAAGCTAA ATGTCATTTT CTAGTAAGAT TTTAATCAAG CACATTATCT ACTAAATATA 1240 .......... .......... .......... .......... .......... .......... 357 TAGCGAGTTA GTATCATTAT ATTTTGTCTA CAAATTAAAT TTCGATTACT CTGGGTAAAC 1180 .......... .......... .......... .......... .......... .......... 357 AAGCCATATA GTAGGCTATT AATATTTATA GTTGAGAATG AATGGTTGTA CAGTCCCAAT 1120 .......... .......... .......... .......... .......... .......... 357 GTTCAAACGT TATTTATATC ATTGAAGTGA CTGGACTATG TGTTAGCTAT CGAGGAAGTA 1060 .......... .......... .......... .......... .......... .......... 357 TAAGTCAAAT ATTTGGGCAC TGAAATATAA CATGTCAGCA GTTATTGCAG GGTAGAAATG 1000 .......... .......... .......... .......... .......... .......... 357 TGGCTTTGTA AATTTGATTG GTCTTTTGGC AGATGTCATT TTCTGCTCTT CGCAAGGACG 940 .......... .......... .......... .......... .......... .......... 357 TTGCCAATGT TCTGGAGAGA TTAAAGAATG AACAAGATCA AAAGGATGTT GATGTGGATC 880 .......... .......... .......... .......... .......... .......... 357 TAATTGAAAA GCGAAAATTG GAGCTGGCAT TTATTTGTAC ATATGTTCAG CTTTCTTATT 820 .......... .......... .......... .......... .......... .......... 357 CCGATTTGGA TCAGTTTGAA GATATAATGA CGAGAGGTTG AGAATCTGCT TCAAGCAATT 760 .......... .......... .......... .......... .......... .......... 357 TTGGATGATG ATGTCCTTAC TAGCCTCGCC GGTAATATGG ATGATCATCG TCATTCTAAA 700 .......... .......... .......... .......... .......... .......... 357 TCAGATGCCA TCATGATGGA TGAGCAATTG TACTTCCTCC TCTTGAATCT CTATCATCTA 640 .......... .......... .......... .......... .......... .......... 357 GCGAGGCATC GTGCTGAAAA GATGTTTCCT GGAGTATGAG GTTCTTCAGA ATGTATGTAG 580 .......... .......... .......... .......... .......... .......... 357 AAACGTAAGA GATTTCCATG GATTGATAGT GAATGGTTGC ATTGATCACG AGATTGTTGA 520 .......... .......... .......... .......... .......... .......... 357 ATGTGTCTTA CCTCTGTTTC AACTGATGGC TGAGAGAGTA GGACACTTCC TTTGGGAGGA 460 .......... .......... .......... .......... .......... .......... 357 AGAAATTATG CTGATAATCT TGAAAATCAG CAATTTTGAA TTTTTGATTA TATAAATCTT 400 .......... .......... .......... .......... .......... .......... 357 TGTGCTTCAC ACGTGTGTTT CATGTGAATT TTTATATATA CGTTAGAATG ACTAAATTTC 340 .......... .......... .......... .......... .......... .......... 357 ATGGAAATAT ATATGCAAAA TGTAATGAAT AATAAAATGT AACAATCATA TCA-TATCAT 281 | || ||||| | | ||||| .......... .......... .......... ......CCCT AAGAATCAGC TTAGAATCAT 381 ATATTACTAA AAGCATGAAC AAAAAAAGTT GAAAGTTGAA TTACAGTTTT ACCCCTTCTA 221 |||||||||| |||||||||| | |||||||| |||||||||| |||| |||| || ||||||| ATATTACTAA AAGCATGAAC A-AAAAAGTT GAAAGTTGAA TTACGATTTT ACTCCTTCTA 440 TAAATTAAAT TGTTATAAAA TATTTAAATA TTTAATTGTA TAAATTTAAT TAAAAGGTTA 161 | |||||||| |||||||||| |||||||||| ||||||| || |||||||||| | ||| |||| TTAATTAAAT TGTTATAAAA TATTTAAATA TTTAATTATA TAAATTTAAT T-AAATGTTA 499 ATGACTTTTA GACTTCTAAA GATTGATTTC TAATTAAATA TCCAATTAAT TAATATTTAT 101 |||||||||| || ||| || |||| ||| | |||||||||| || ||| || |||||||||| ATGACTTTTA GAATTCCTAA GATTAATTCC TAATTAAATA TCTAATCTAT TAATATTTAT 559 CATCTATAAT CTATATGTAT ATTATAATT 72 |||||||||| |||||||||| || |||||| CATCTATAAT CTATATGTAT ATAATAATT 588 hqPGS_C06HBa0112G05.1-19-_SGN-E370504- (303 72) ******************************************************************************** EST sequence 3 -strand 530 n (File: SGN-E280629-) 1 ATTATAATCA TCATCATTAT TATTATTATT ATCCTGATTA TAATCACCAT TATCATCATC 61 AGCCATAATA GCCCTACGTG CTTGTAATAA TCTTGTACAG GATGTTGTTG AGGATAAACA 121 AAAAATTAAG AAAAAAATGG CTATGAATAA AAAAGAGAAT GTTTTGTGAA TCATGGTTTT 181 TAAAATCTAC CAGAAGTATG GAGTAATTTC AATGTGTATT ATCCAAAGAA GACAATGATT 241 TAAATACAAG TAAAGAAAAT ACTAACTTAG GTAAGTACAA ATCTAAATCC CTAAGAATCA 301 GCTTAGAATC ATATATTACT AAAAGCATGA ACAAAAAAGT TGAAAGTTGA ATTACGATTT 361 TACTCCTTCT ATTAATTAAA TTGTTATAAA ATATTTAAAT ATTTAATTAT ATAAATTTAA 421 TTAAATGTTA ATGACTTTTA GAATTCCTAA GATTAATTCC TAATTAAATA TCTAATCTAT 481 TAATATTTAT CATCTATAAT CTATATGTAT ATAATAATTA CAAAAAAAAA Predicted gene structure (within gDNA segment 3965 to 1): Exon 1 2934 2911 ( 24 n); cDNA 256 280 ( 25 n); score: 0.688 Intron 1 2910 968 (1943 n); Pd: 0.900 (s: 0), Pa: 0.996 (s: 0) Exon 2 967 956 ( 12 n); cDNA 281 292 ( 12 n); score: 0.583 Intron 2 955 300 ( 656 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.83) Exon 3 299 72 ( 228 n); cDNA 293 519 ( 227 n); score: 0.893 PPA cDNA 520 530 MATCH C06HBa0112G05.1-19- SGN-E280629- 0.893 264 0.498 C PGS_C06HBa0112G05.1-19-_SGN-E280629- (2934 2911,967 956,299 72) Alignment (genomic DNA sequence = upper lines): AAAATA-TAA TTTTTATAAG TACACGTATC ATTTGTTAAT TTGTCCAGAT TGAAGTAACT 2876 |||||| ||| || |||| |||| AAAATACTAA CTTAGGTAAG TACAA..... .......... .......... .......... 280 ATCTGAAGTT CATATTATAA ACATTAATCT TGTATACCAA ACTACTATTC CTATGCTATG 2816 .......... .......... .......... .......... .......... .......... 280 TTGTTTGCCA TTGTCGTTCT CTCTTTATTT TTTTTCTTTC CATTCACATA CACATTAATT 2756 .......... .......... .......... .......... .......... .......... 280 TTCTAGTAGA CTGCATATTA CTACATCTGT ATTATCCGTA TGCAAGAGGA ATCCAGGATT 2696 .......... .......... .......... .......... .......... .......... 280 TGATGTTTAC AAGTATTTGT GGCGACCTCA TGTTAATATC AATAACAATT AGATTCACAT 2636 .......... .......... .......... .......... .......... .......... 280 ATGTATAGGA TTTTGACAGA AATTGAGGGA TTCACATGAA TTCATAGATT ACTCCGTGGA 2576 .......... .......... .......... .......... .......... .......... 280 TTTGCCTTTG GCTGTCCAAA CCTCCTTTAT GTCTAACTTC GTCTGAAGTC CCATTTATAT 2516 .......... .......... .......... .......... .......... .......... 280 GCTCAAAGCT TAGTCAAGGT ACTGATTTAA AACGATATTG ATACTACTCT ATAACAAACC 2456 .......... .......... .......... .......... .......... .......... 280 CAGCGAACTT TCATCACAAA AGCTAGGCCG TGTAGTGAAC TTTAAAATGA TATTGCTGCA 2396 .......... .......... .......... .......... .......... .......... 280 AAGTCGCTCA ACAAAGGGTC ATAACCAGCA CTACAACTAC ACAAGGCTCA AGCAAGTATA 2336 .......... .......... .......... .......... .......... .......... 280 CGCGGGTGAA AGATTAACAT AGATCGCTAT CCCCCGCAAA AGCTAAGGAA AGCATCTCTA 2276 .......... .......... .......... .......... .......... .......... 280 ACTTCTTAGC ATGGACCCAG ATGTACTCAA ACACACGATC TGTAAGGATG CCAGAAAGAG 2216 .......... .......... .......... .......... .......... .......... 280 AAAGTTGCTG CAATTCCTTG CAGTGTTGCA CAATGTCCCA AAACCAGCAT CAAGTGGTTC 2156 .......... .......... .......... .......... .......... .......... 280 AAGTCAGGAG TTCGAGGCTC AATAATAAAT GAAATTGGAT CATGTTAGGA CGGTTCCTAG 2096 .......... .......... .......... .......... .......... .......... 280 CAATAGTAAC TAAGGCGTCA TTTATCATTT GGAGGCAGAA GTATAAAACT GACTGAAGCT 2036 .......... .......... .......... .......... .......... .......... 280 TAGGGCAGCC CATGGAGACA CTTACAAGGC CTTGCTCTGT CAAGGATACA TTAGGTCATG 1976 .......... .......... .......... .......... .......... .......... 280 GATCAGAAGG AAACACCCTA AGATCTTGAA GTTCCTTACT AGTGTTGGTA ATCTCCTCTA 1916 .......... .......... .......... .......... .......... .......... 280 TCACATCCAT TCCAAGTGCC TTAAAGCGTA AGCTGGATAA CCAATTATGC AAAAGAGCCA 1856 .......... .......... .......... .......... .......... .......... 280 CTGAACAAGA AGAGCTCAAA AGCTAATGAA GCGTCTAACT AGGAAAGATA AACATGAATT 1796 .......... .......... .......... .......... .......... .......... 280 GTCAAAATGT AGGATTTGTG CAACATCAAG TTATTCTTCA TATGGCAGAC CACTAAATTA 1736 .......... .......... .......... .......... .......... .......... 280 TTCAAAGAGT TCACTCGACA TCAATTGTTT AACTATGTTA TCTCACTAAA CACTGAAGTT 1676 .......... .......... .......... .......... .......... .......... 280 CATTACACTG TTTATTTATC CCTGGCATTT ATCAAAAGCA GCAACACAAT TAATAAATTA 1616 .......... .......... .......... .......... .......... .......... 280 GGATTATTGT TCTTTCTCTT GGAAACTTCT GGTTTTACAA CTAACAATCA ATTTATCTCT 1556 .......... .......... .......... .......... .......... .......... 280 TTTATCACAA ACACAGTTCA TGTATCCCTT CTCAGAAAAA AAACAGTTAA TGTATCGAGC 1496 .......... .......... .......... .......... .......... .......... 280 CATGACTATT GAAAACACGC ACCAGCAACA GGTATCAGAT AACTCCACTG AAATTTGAAC 1436 .......... .......... .......... .......... .......... .......... 280 CTGAGTTCTC TCAATCACTT CATTGACTTC AAACTGAGAC TATTGAAAGT TTGATTAACT 1376 .......... .......... .......... .......... .......... .......... 280 TTCTTATCTC TTTCCTTTGT GGCAGATAAG CATACTTGTG CACCGGCAAA GTTAAAGGTC 1316 .......... .......... .......... .......... .......... .......... 280 ACAATTATCC ACCGAAGTCA AGCTAAATGT CATTTTCTAG TAAGATTTTA ATCAAGCACA 1256 .......... .......... .......... .......... .......... .......... 280 TTATCTACTA AATATATAGC GAGTTAGTAT CATTATATTT TGTCTACAAA TTAAATTTCG 1196 .......... .......... .......... .......... .......... .......... 280 ATTACTCTGG GTAAACAAGC CATATAGTAG GCTATTAATA TTTATAGTTG AGAATGAATG 1136 .......... .......... .......... .......... .......... .......... 280 GTTGTACAGT CCCAATGTTC AAACGTTATT TATATCATTG AAGTGACTGG ACTATGTGTT 1076 .......... .......... .......... .......... .......... .......... 280 AGCTATCGAG GAAGTATAAG TCAAATATTT GGGCACTGAA ATATAACATG TCAGCAGTTA 1016 .......... .......... .......... .......... .......... .......... 280 TTGCAGGGTA GAAATGTGGC TTTGTAAATT TGATTGGTCT TTTGGCAGAT GTCATTTTCT 956 || | | | || .......... .......... .......... .......... ........AT CTAAATCCCT 292 GCTCTTCGCA AGGACGTTGC CAATGTTCTG GAGAGATTAA AGAATGAACA AGATCAAAAG 896 .......... .......... .......... .......... .......... .......... 292 GATGTTGATG TGGATCTAAT TGAAAAGCGA AAATTGGAGC TGGCATTTAT TTGTACATAT 836 .......... .......... .......... .......... .......... .......... 292 GTTCAGCTTT CTTATTCCGA TTTGGATCAG TTTGAAGATA TAATGACGAG AGGTTGAGAA 776 .......... .......... .......... .......... .......... .......... 292 TCTGCTTCAA GCAATTTTGG ATGATGATGT CCTTACTAGC CTCGCCGGTA ATATGGATGA 716 .......... .......... .......... .......... .......... .......... 292 TCATCGTCAT TCTAAATCAG ATGCCATCAT GATGGATGAG CAATTGTACT TCCTCCTCTT 656 .......... .......... .......... .......... .......... .......... 292 GAATCTCTAT CATCTAGCGA GGCATCGTGC TGAAAAGATG TTTCCTGGAG TATGAGGTTC 596 .......... .......... .......... .......... .......... .......... 292 TTCAGAATGT ATGTAGAAAC GTAAGAGATT TCCATGGATT GATAGTGAAT GGTTGCATTG 536 .......... .......... .......... .......... .......... .......... 292 ATCACGAGAT TGTTGAATGT GTCTTACCTC TGTTTCAACT GATGGCTGAG AGAGTAGGAC 476 .......... .......... .......... .......... .......... .......... 292 ACTTCCTTTG GGAGGAAGAA ATTATGCTGA TAATCTTGAA AATCAGCAAT TTTGAATTTT 416 .......... .......... .......... .......... .......... .......... 292 TGATTATATA AATCTTTGTG CTTCACACGT GTGTTTCATG TGAATTTTTA TATATACGTT 356 .......... .......... .......... .......... .......... .......... 292 AGAATGACTA AATTTCATGG AAATATATAT GCAAAATGTA ATGAATAATA AAATGTAACA 296 || | .......... .......... .......... .......... .......... ......AAGA 296 ATCATATCA- TATCATATAT TACTAAAAGC ATGAACAAAA AAAGTTGAAA GTTGAATTAC 237 |||| | | ||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| ATCAGCTTAG AATCATATAT TACTAAAAGC ATGAAC-AAA AAAGTTGAAA GTTGAATTAC 355 AGTTTTACCC CTTCTATAAA TTAAATTGTT ATAAAATATT TAAATATTTA ATTGTATAAA 177 |||||| | ||||||| || |||||||||| |||||||||| |||||||||| ||| |||||| GATTTTACTC CTTCTATTAA TTAAATTGTT ATAAAATATT TAAATATTTA ATTATATAAA 415 TTTAATTAAA AGGTTAATGA CTTTTAGACT TCTAAAGATT GATTTCTAAT TAAATATCCA 117 ||||||| || | |||||||| |||||||| | || |||||| ||| ||||| |||||||| | TTTAATT-AA ATGTTAATGA CTTTTAGAAT TCCTAAGATT AATTCCTAAT TAAATATCTA 474 ATTAATTAAT ATTTATCATC TATAATCTAT ATGTATATTA TAATT 72 || |||||| |||||||||| |||||||||| |||||||| | ||||| ATCTATTAAT ATTTATCATC TATAATCTAT ATGTATATAA TAATT 519 hqPGS_C06HBa0112G05.1-19-_SGN-E280629- (299 72) ******************************************************************************** EST sequence 5 -strand 523 n (File: SGN-E391217-) 1 CATCATCATT ATTATTATTA TTATCCTGAT TATAATCACC ATTATCATCA TCAGCCATAA 61 TAGCCCTACG TGCTTGTAAT AATCTTGTAC AGGACGCCGT TGAGGATAAA CAAAAAATTA 121 AGAAAAAAAT GGCTATGAAT AAAAAAGAGA ATGTTTTGTG AATCATGGTT TTTAAAATCT 181 ACCAGAAGTA TGGAGTAATT TCAATGTGTA TTATCCAAAG AAGACAATGA TTTAAATACA 241 AGTAAAGAAA ATACTAACTT AGGTAAGTAC AAATCTAAAT CCCTAAGAAT CAGCTTAGAA 301 TCATATATTA CTAAAAGCAT GAACAAAAAA GTTGAAAGTT GAATTACGAT TTTACTCCTT 361 CTATTAATTA AATTGTTATA AAATATTTAA ATATTTAATT ATATAAATTT AATTAAATGT 421 TAATGACTTT TAGAATTCCT AAGATTAATT CCTAATTAAA TATCTAATCT ATTAATATTT 481 ATCATCTATA ATCTATATGT ATATAATAAT TACAAAAAAA AAA Predicted gene structure (within gDNA segment 3885 to 1): Exon 1 2934 2911 ( 24 n); cDNA 248 272 ( 25 n); score: 0.688 Intron 1 2910 968 (1943 n); Pd: 0.900 (s: 0), Pa: 0.996 (s: 0) Exon 2 967 956 ( 12 n); cDNA 273 284 ( 12 n); score: 0.583 Intron 2 955 300 ( 656 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.83) Exon 3 299 72 ( 228 n); cDNA 285 511 ( 227 n); score: 0.893 PPA cDNA 512 523 MATCH C06HBa0112G05.1-19- SGN-E391217- 0.893 264 0.505 C PGS_C06HBa0112G05.1-19-_SGN-E391217- (2934 2911,967 956,299 72) Alignment (genomic DNA sequence = upper lines): AAAATA-TAA TTTTTATAAG TACACGTATC ATTTGTTAAT TTGTCCAGAT TGAAGTAACT 2876 |||||| ||| || |||| |||| AAAATACTAA CTTAGGTAAG TACAA..... .......... .......... .......... 272 ATCTGAAGTT CATATTATAA ACATTAATCT TGTATACCAA ACTACTATTC CTATGCTATG 2816 .......... .......... .......... .......... .......... .......... 272 TTGTTTGCCA TTGTCGTTCT CTCTTTATTT TTTTTCTTTC CATTCACATA CACATTAATT 2756 .......... .......... .......... .......... .......... .......... 272 TTCTAGTAGA CTGCATATTA CTACATCTGT ATTATCCGTA TGCAAGAGGA ATCCAGGATT 2696 .......... .......... .......... .......... .......... .......... 272 TGATGTTTAC AAGTATTTGT GGCGACCTCA TGTTAATATC AATAACAATT AGATTCACAT 2636 .......... .......... .......... .......... .......... .......... 272 ATGTATAGGA TTTTGACAGA AATTGAGGGA TTCACATGAA TTCATAGATT ACTCCGTGGA 2576 .......... .......... .......... .......... .......... .......... 272 TTTGCCTTTG GCTGTCCAAA CCTCCTTTAT GTCTAACTTC GTCTGAAGTC CCATTTATAT 2516 .......... .......... .......... .......... .......... .......... 272 GCTCAAAGCT TAGTCAAGGT ACTGATTTAA AACGATATTG ATACTACTCT ATAACAAACC 2456 .......... .......... .......... .......... .......... .......... 272 CAGCGAACTT TCATCACAAA AGCTAGGCCG TGTAGTGAAC TTTAAAATGA TATTGCTGCA 2396 .......... .......... .......... .......... .......... .......... 272 AAGTCGCTCA ACAAAGGGTC ATAACCAGCA CTACAACTAC ACAAGGCTCA AGCAAGTATA 2336 .......... .......... .......... .......... .......... .......... 272 CGCGGGTGAA AGATTAACAT AGATCGCTAT CCCCCGCAAA AGCTAAGGAA AGCATCTCTA 2276 .......... .......... .......... .......... .......... .......... 272 ACTTCTTAGC ATGGACCCAG ATGTACTCAA ACACACGATC TGTAAGGATG CCAGAAAGAG 2216 .......... .......... .......... .......... .......... .......... 272 AAAGTTGCTG CAATTCCTTG CAGTGTTGCA CAATGTCCCA AAACCAGCAT CAAGTGGTTC 2156 .......... .......... .......... .......... .......... .......... 272 AAGTCAGGAG TTCGAGGCTC AATAATAAAT GAAATTGGAT CATGTTAGGA CGGTTCCTAG 2096 .......... .......... .......... .......... .......... .......... 272 CAATAGTAAC TAAGGCGTCA TTTATCATTT GGAGGCAGAA GTATAAAACT GACTGAAGCT 2036 .......... .......... .......... .......... .......... .......... 272 TAGGGCAGCC CATGGAGACA CTTACAAGGC CTTGCTCTGT CAAGGATACA TTAGGTCATG 1976 .......... .......... .......... .......... .......... .......... 272 GATCAGAAGG AAACACCCTA AGATCTTGAA GTTCCTTACT AGTGTTGGTA ATCTCCTCTA 1916 .......... .......... .......... .......... .......... .......... 272 TCACATCCAT TCCAAGTGCC TTAAAGCGTA AGCTGGATAA CCAATTATGC AAAAGAGCCA 1856 .......... .......... .......... .......... .......... .......... 272 CTGAACAAGA AGAGCTCAAA AGCTAATGAA GCGTCTAACT AGGAAAGATA AACATGAATT 1796 .......... .......... .......... .......... .......... .......... 272 GTCAAAATGT AGGATTTGTG CAACATCAAG TTATTCTTCA TATGGCAGAC CACTAAATTA 1736 .......... .......... .......... .......... .......... .......... 272 TTCAAAGAGT TCACTCGACA TCAATTGTTT AACTATGTTA TCTCACTAAA CACTGAAGTT 1676 .......... .......... .......... .......... .......... .......... 272 CATTACACTG TTTATTTATC CCTGGCATTT ATCAAAAGCA GCAACACAAT TAATAAATTA 1616 .......... .......... .......... .......... .......... .......... 272 GGATTATTGT TCTTTCTCTT GGAAACTTCT GGTTTTACAA CTAACAATCA ATTTATCTCT 1556 .......... .......... .......... .......... .......... .......... 272 TTTATCACAA ACACAGTTCA TGTATCCCTT CTCAGAAAAA AAACAGTTAA TGTATCGAGC 1496 .......... .......... .......... .......... .......... .......... 272 CATGACTATT GAAAACACGC ACCAGCAACA GGTATCAGAT AACTCCACTG AAATTTGAAC 1436 .......... .......... .......... .......... .......... .......... 272 CTGAGTTCTC TCAATCACTT CATTGACTTC AAACTGAGAC TATTGAAAGT TTGATTAACT 1376 .......... .......... .......... .......... .......... .......... 272 TTCTTATCTC TTTCCTTTGT GGCAGATAAG CATACTTGTG CACCGGCAAA GTTAAAGGTC 1316 .......... .......... .......... .......... .......... .......... 272 ACAATTATCC ACCGAAGTCA AGCTAAATGT CATTTTCTAG TAAGATTTTA ATCAAGCACA 1256 .......... .......... .......... .......... .......... .......... 272 TTATCTACTA AATATATAGC GAGTTAGTAT CATTATATTT TGTCTACAAA TTAAATTTCG 1196 .......... .......... .......... .......... .......... .......... 272 ATTACTCTGG GTAAACAAGC CATATAGTAG GCTATTAATA TTTATAGTTG AGAATGAATG 1136 .......... .......... .......... .......... .......... .......... 272 GTTGTACAGT CCCAATGTTC AAACGTTATT TATATCATTG AAGTGACTGG ACTATGTGTT 1076 .......... .......... .......... .......... .......... .......... 272 AGCTATCGAG GAAGTATAAG TCAAATATTT GGGCACTGAA ATATAACATG TCAGCAGTTA 1016 .......... .......... .......... .......... .......... .......... 272 TTGCAGGGTA GAAATGTGGC TTTGTAAATT TGATTGGTCT TTTGGCAGAT GTCATTTTCT 956 || | | | || .......... .......... .......... .......... ........AT CTAAATCCCT 284 GCTCTTCGCA AGGACGTTGC CAATGTTCTG GAGAGATTAA AGAATGAACA AGATCAAAAG 896 .......... .......... .......... .......... .......... .......... 284 GATGTTGATG TGGATCTAAT TGAAAAGCGA AAATTGGAGC TGGCATTTAT TTGTACATAT 836 .......... .......... .......... .......... .......... .......... 284 GTTCAGCTTT CTTATTCCGA TTTGGATCAG TTTGAAGATA TAATGACGAG AGGTTGAGAA 776 .......... .......... .......... .......... .......... .......... 284 TCTGCTTCAA GCAATTTTGG ATGATGATGT CCTTACTAGC CTCGCCGGTA ATATGGATGA 716 .......... .......... .......... .......... .......... .......... 284 TCATCGTCAT TCTAAATCAG ATGCCATCAT GATGGATGAG CAATTGTACT TCCTCCTCTT 656 .......... .......... .......... .......... .......... .......... 284 GAATCTCTAT CATCTAGCGA GGCATCGTGC TGAAAAGATG TTTCCTGGAG TATGAGGTTC 596 .......... .......... .......... .......... .......... .......... 284 TTCAGAATGT ATGTAGAAAC GTAAGAGATT TCCATGGATT GATAGTGAAT GGTTGCATTG 536 .......... .......... .......... .......... .......... .......... 284 ATCACGAGAT TGTTGAATGT GTCTTACCTC TGTTTCAACT GATGGCTGAG AGAGTAGGAC 476 .......... .......... .......... .......... .......... .......... 284 ACTTCCTTTG GGAGGAAGAA ATTATGCTGA TAATCTTGAA AATCAGCAAT TTTGAATTTT 416 .......... .......... .......... .......... .......... .......... 284 TGATTATATA AATCTTTGTG CTTCACACGT GTGTTTCATG TGAATTTTTA TATATACGTT 356 .......... .......... .......... .......... .......... .......... 284 AGAATGACTA AATTTCATGG AAATATATAT GCAAAATGTA ATGAATAATA AAATGTAACA 296 || | .......... .......... .......... .......... .......... ......AAGA 288 ATCATATCA- TATCATATAT TACTAAAAGC ATGAACAAAA AAAGTTGAAA GTTGAATTAC 237 |||| | | ||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| ATCAGCTTAG AATCATATAT TACTAAAAGC ATGAAC-AAA AAAGTTGAAA GTTGAATTAC 347 AGTTTTACCC CTTCTATAAA TTAAATTGTT ATAAAATATT TAAATATTTA ATTGTATAAA 177 |||||| | ||||||| || |||||||||| |||||||||| |||||||||| ||| |||||| GATTTTACTC CTTCTATTAA TTAAATTGTT ATAAAATATT TAAATATTTA ATTATATAAA 407 TTTAATTAAA AGGTTAATGA CTTTTAGACT TCTAAAGATT GATTTCTAAT TAAATATCCA 117 ||||||| || | |||||||| |||||||| | || |||||| ||| ||||| |||||||| | TTTAATT-AA ATGTTAATGA CTTTTAGAAT TCCTAAGATT AATTCCTAAT TAAATATCTA 466 ATTAATTAAT ATTTATCATC TATAATCTAT ATGTATATTA TAATT 72 || |||||| |||||||||| |||||||||| |||||||| | ||||| ATCTATTAAT ATTTATCATC TATAATCTAT ATGTATATAA TAATT 511 hqPGS_C06HBa0112G05.1-19-_SGN-E391217- (299 72) ******************************************************************************** EST sequence 4 -strand 457 n (File: SGN-E544869-) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAGGAA Predicted gene structure (within gDNA segment 3953 to 562): Exon 1 3343 3006 ( 338 n); cDNA 1 338 ( 338 n); score: 0.938 Intron 1 3005 1351 (1655 n); Pd: 0.000 (s: 0.98), Pa: 0.998 (s: 0) Exon 2 1350 1322 ( 29 n); cDNA 339 364 ( 26 n); score: 0.586 PPA cDNA 423 454 MATCH C06HBa0112G05.1-19- SGN-E544869- 0.938 367 0.803 C PGS_C06HBa0112G05.1-19-_SGN-E544869- (3343 3006,1350 1322) Alignment (genomic DNA sequence = upper lines): ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 3284 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTAGCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 3224 ||||| | | || || ||| | ||| |||| |||||||||| ||||| |||| |||||||||| TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 120 CTTGAGAAGT TAAAACTGCA GGGATGTCGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 3164 |||||||| |||||||||| || |||| || |||||||||| ||||||| || |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 180 GATATTTATT CATTGAAAGT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 3104 |||||||||| |||||||| | ||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGATATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 3044 |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA CTTTGCTGGG TGATATTGTA 2984 |||||||||| |||||||||| |||||||||| || ||||| CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGGTTG.. .......... .......... 338 TATGATTAAA ATATCCTGTT ATGAGATTCC TCTTAGTTTC TTTTAACAAA AAATATAATT 2924 .......... .......... .......... .......... .......... .......... 338 TTTATAAGTA CACGTATCAT TTGTTAATTT GTCCAGATTG AAGTAACTAT CTGAAGTTCA 2864 .......... .......... .......... .......... .......... .......... 338 TATTATAAAC ATTAATCTTG TATACCAAAC TACTATTCCT ATGCTATGTT GTTTGCCATT 2804 .......... .......... .......... .......... .......... .......... 338 GTCGTTCTCT CTTTATTTTT TTTCTTTCCA TTCACATACA CATTAATTTT CTAGTAGACT 2744 .......... .......... .......... .......... .......... .......... 338 GCATATTACT ACATCTGTAT TATCCGTATG CAAGAGGAAT CCAGGATTTG ATGTTTACAA 2684 .......... .......... .......... .......... .......... .......... 338 GTATTTGTGG CGACCTCATG TTAATATCAA TAACAATTAG ATTCACATAT GTATAGGATT 2624 .......... .......... .......... .......... .......... .......... 338 TTGACAGAAA TTGAGGGATT CACATGAATT CATAGATTAC TCCGTGGATT TGCCTTTGGC 2564 .......... .......... .......... .......... .......... .......... 338 TGTCCAAACC TCCTTTATGT CTAACTTCGT CTGAAGTCCC ATTTATATGC TCAAAGCTTA 2504 .......... .......... .......... .......... .......... .......... 338 GTCAAGGTAC TGATTTAAAA CGATATTGAT ACTACTCTAT AACAAACCCA GCGAACTTTC 2444 .......... .......... .......... .......... .......... .......... 338 ATCACAAAAG CTAGGCCGTG TAGTGAACTT TAAAATGATA TTGCTGCAAA GTCGCTCAAC 2384 .......... .......... .......... .......... .......... .......... 338 AAAGGGTCAT AACCAGCACT ACAACTACAC AAGGCTCAAG CAAGTATACG CGGGTGAAAG 2324 .......... .......... .......... .......... .......... .......... 338 ATTAACATAG ATCGCTATCC CCCGCAAAAG CTAAGGAAAG CATCTCTAAC TTCTTAGCAT 2264 .......... .......... .......... .......... .......... .......... 338 GGACCCAGAT GTACTCAAAC ACACGATCTG TAAGGATGCC AGAAAGAGAA AGTTGCTGCA 2204 .......... .......... .......... .......... .......... .......... 338 ATTCCTTGCA GTGTTGCACA ATGTCCCAAA ACCAGCATCA AGTGGTTCAA GTCAGGAGTT 2144 .......... .......... .......... .......... .......... .......... 338 CGAGGCTCAA TAATAAATGA AATTGGATCA TGTTAGGACG GTTCCTAGCA ATAGTAACTA 2084 .......... .......... .......... .......... .......... .......... 338 AGGCGTCATT TATCATTTGG AGGCAGAAGT ATAAAACTGA CTGAAGCTTA GGGCAGCCCA 2024 .......... .......... .......... .......... .......... .......... 338 TGGAGACACT TACAAGGCCT TGCTCTGTCA AGGATACATT AGGTCATGGA TCAGAAGGAA 1964 .......... .......... .......... .......... .......... .......... 338 ACACCCTAAG ATCTTGAAGT TCCTTACTAG TGTTGGTAAT CTCCTCTATC ACATCCATTC 1904 .......... .......... .......... .......... .......... .......... 338 CAAGTGCCTT AAAGCGTAAG CTGGATAACC AATTATGCAA AAGAGCCACT GAACAAGAAG 1844 .......... .......... .......... .......... .......... .......... 338 AGCTCAAAAG CTAATGAAGC GTCTAACTAG GAAAGATAAA CATGAATTGT CAAAATGTAG 1784 .......... .......... .......... .......... .......... .......... 338 GATTTGTGCA ACATCAAGTT ATTCTTCATA TGGCAGACCA CTAAATTATT CAAAGAGTTC 1724 .......... .......... .......... .......... .......... .......... 338 ACTCGACATC AATTGTTTAA CTATGTTATC TCACTAAACA CTGAAGTTCA TTACACTGTT 1664 .......... .......... .......... .......... .......... .......... 338 TATTTATCCC TGGCATTTAT CAAAAGCAGC AACACAATTA ATAAATTAGG ATTATTGTTC 1604 .......... .......... .......... .......... .......... .......... 338 TTTCTCTTGG AAACTTCTGG TTTTACAACT AACAATCAAT TTATCTCTTT TATCACAAAC 1544 .......... .......... .......... .......... .......... .......... 338 ACAGTTCATG TATCCCTTCT CAGAAAAAAA ACAGTTAATG TATCGAGCCA TGACTATTGA 1484 .......... .......... .......... .......... .......... .......... 338 AAACACGCAC CAGCAACAGG TATCAGATAA CTCCACTGAA ATTTGAACCT GAGTTCTCTC 1424 .......... .......... .......... .......... .......... .......... 338 AATCACTTCA TTGACTTCAA ACTGAGACTA TTGAAAGTTT GATTAACTTT CTTATCTCTT 1364 .......... .......... .......... .......... .......... .......... 338 TCCTTTGTGG CAGATAAGCA TACTTGTGCA CCGGCAAAGT TA 1322 | ||| | | |||| | || | | || .......... ...AAAAGTA GA-TTGTAC- TTTGC-AGGG TA 364 hqPGS_C06HBa0112G05.1-19-_SGN-E544869- (3343 3006) ******************************************************************************** EST sequence 7 +strand 457 n (File: SGN-E544870+) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCANNNNANA AAAAAAAAAA AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 3943 to 624): Exon 1 3343 3006 ( 338 n); cDNA 1 338 ( 338 n); score: 0.973 Intron 1 3005 1351 (1655 n); Pd: 0.000 (s: 1.00), Pa: 0.998 (s: 0) Exon 2 1350 1322 ( 29 n); cDNA 339 364 ( 26 n); score: 0.586 PPA cDNA 428 457 MATCH C06HBa0112G05.1-19- SGN-E544870+ 0.973 367 0.803 C PGS_C06HBa0112G05.1-19-_SGN-E544870+ (3343 3006,1350 1322) Alignment (genomic DNA sequence = upper lines): ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 3284 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTAGCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 3224 |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 120 CTTGAGAAGT TAAAACTGCA GGGATGTCGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 3164 |||||||| |||||||||| || |||| || |||||||||| |||||||||| |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 180 GATATTTATT CATTGAAAGT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 3104 |||||||||| |||||||| | |||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGATATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 3044 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA CTTTGCTGGG TGATATTGTA 2984 |||||||||| |||||||||| |||||||||| |||||||| CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTG.. .......... .......... 338 TATGATTAAA ATATCCTGTT ATGAGATTCC TCTTAGTTTC TTTTAACAAA AAATATAATT 2924 .......... .......... .......... .......... .......... .......... 338 TTTATAAGTA CACGTATCAT TTGTTAATTT GTCCAGATTG AAGTAACTAT CTGAAGTTCA 2864 .......... .......... .......... .......... .......... .......... 338 TATTATAAAC ATTAATCTTG TATACCAAAC TACTATTCCT ATGCTATGTT GTTTGCCATT 2804 .......... .......... .......... .......... .......... .......... 338 GTCGTTCTCT CTTTATTTTT TTTCTTTCCA TTCACATACA CATTAATTTT CTAGTAGACT 2744 .......... .......... .......... .......... .......... .......... 338 GCATATTACT ACATCTGTAT TATCCGTATG CAAGAGGAAT CCAGGATTTG ATGTTTACAA 2684 .......... .......... .......... .......... .......... .......... 338 GTATTTGTGG CGACCTCATG TTAATATCAA TAACAATTAG ATTCACATAT GTATAGGATT 2624 .......... .......... .......... .......... .......... .......... 338 TTGACAGAAA TTGAGGGATT CACATGAATT CATAGATTAC TCCGTGGATT TGCCTTTGGC 2564 .......... .......... .......... .......... .......... .......... 338 TGTCCAAACC TCCTTTATGT CTAACTTCGT CTGAAGTCCC ATTTATATGC TCAAAGCTTA 2504 .......... .......... .......... .......... .......... .......... 338 GTCAAGGTAC TGATTTAAAA CGATATTGAT ACTACTCTAT AACAAACCCA GCGAACTTTC 2444 .......... .......... .......... .......... .......... .......... 338 ATCACAAAAG CTAGGCCGTG TAGTGAACTT TAAAATGATA TTGCTGCAAA GTCGCTCAAC 2384 .......... .......... .......... .......... .......... .......... 338 AAAGGGTCAT AACCAGCACT ACAACTACAC AAGGCTCAAG CAAGTATACG CGGGTGAAAG 2324 .......... .......... .......... .......... .......... .......... 338 ATTAACATAG ATCGCTATCC CCCGCAAAAG CTAAGGAAAG CATCTCTAAC TTCTTAGCAT 2264 .......... .......... .......... .......... .......... .......... 338 GGACCCAGAT GTACTCAAAC ACACGATCTG TAAGGATGCC AGAAAGAGAA AGTTGCTGCA 2204 .......... .......... .......... .......... .......... .......... 338 ATTCCTTGCA GTGTTGCACA ATGTCCCAAA ACCAGCATCA AGTGGTTCAA GTCAGGAGTT 2144 .......... .......... .......... .......... .......... .......... 338 CGAGGCTCAA TAATAAATGA AATTGGATCA TGTTAGGACG GTTCCTAGCA ATAGTAACTA 2084 .......... .......... .......... .......... .......... .......... 338 AGGCGTCATT TATCATTTGG AGGCAGAAGT ATAAAACTGA CTGAAGCTTA GGGCAGCCCA 2024 .......... .......... .......... .......... .......... .......... 338 TGGAGACACT TACAAGGCCT TGCTCTGTCA AGGATACATT AGGTCATGGA TCAGAAGGAA 1964 .......... .......... .......... .......... .......... .......... 338 ACACCCTAAG ATCTTGAAGT TCCTTACTAG TGTTGGTAAT CTCCTCTATC ACATCCATTC 1904 .......... .......... .......... .......... .......... .......... 338 CAAGTGCCTT AAAGCGTAAG CTGGATAACC AATTATGCAA AAGAGCCACT GAACAAGAAG 1844 .......... .......... .......... .......... .......... .......... 338 AGCTCAAAAG CTAATGAAGC GTCTAACTAG GAAAGATAAA CATGAATTGT CAAAATGTAG 1784 .......... .......... .......... .......... .......... .......... 338 GATTTGTGCA ACATCAAGTT ATTCTTCATA TGGCAGACCA CTAAATTATT CAAAGAGTTC 1724 .......... .......... .......... .......... .......... .......... 338 ACTCGACATC AATTGTTTAA CTATGTTATC TCACTAAACA CTGAAGTTCA TTACACTGTT 1664 .......... .......... .......... .......... .......... .......... 338 TATTTATCCC TGGCATTTAT CAAAAGCAGC AACACAATTA ATAAATTAGG ATTATTGTTC 1604 .......... .......... .......... .......... .......... .......... 338 TTTCTCTTGG AAACTTCTGG TTTTACAACT AACAATCAAT TTATCTCTTT TATCACAAAC 1544 .......... .......... .......... .......... .......... .......... 338 ACAGTTCATG TATCCCTTCT CAGAAAAAAA ACAGTTAATG TATCGAGCCA TGACTATTGA 1484 .......... .......... .......... .......... .......... .......... 338 AAACACGCAC CAGCAACAGG TATCAGATAA CTCCACTGAA ATTTGAACCT GAGTTCTCTC 1424 .......... .......... .......... .......... .......... .......... 338 AATCACTTCA TTGACTTCAA ACTGAGACTA TTGAAAGTTT GATTAACTTT CTTATCTCTT 1364 .......... .......... .......... .......... .......... .......... 338 TCCTTTGTGG CAGATAAGCA TACTTGTGCA CCGGCAAAGT TA 1322 | ||| | | |||| | || | | || .......... ...AAAAGTA GA-TTGTAC- TTTGC-AGGG TA 364 hqPGS_C06HBa0112G05.1-19-_SGN-E544870+ (3343 3006) ******************************************************************************** EST sequence 6 +strand 453 n (File: SGN-E308524+) 1 TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 61 TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 121 TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 181 ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 241 TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 301 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAA AGTAGATTGT ACTTTGCAGG 361 GTACATTGTA TATGATTAAG AAAACTTTGT TGCAGTTATG AAATATTTTT GTGGATTTCT 421 CNNNNNAAAA AAAAAAAAAA AAAAAAAATA AAA Predicted gene structure (within gDNA segment 3942 to 654): Exon 1 3342 3006 ( 337 n); cDNA 1 337 ( 337 n); score: 0.973 Intron 1 3005 1351 (1655 n); Pd: 0.000 (s: 1.00), Pa: 0.998 (s: 0) Exon 2 1350 1322 ( 29 n); cDNA 338 363 ( 26 n); score: 0.586 PPA cDNA 427 453 MATCH C06HBa0112G05.1-19- SGN-E308524+ 0.973 366 0.808 C PGS_C06HBa0112G05.1-19-_SGN-E308524+ (3342 3006,1350 1322) Alignment (genomic DNA sequence = upper lines): TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 3283 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 60 TGAACTTGCG TCTAGCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 3223 |||||||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 120 TTGAGAAGTT AAAACTGCAG GGATGTCGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 3163 ||||||| | |||||||||| | |||| ||| |||||||||| |||||||||| |||||||||| TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 180 ATATTTATTC ATTGAAAGTT ATCAAAATTG TAAAGAGTCC TCAACTTGAA GATTCTGCTC 3103 |||||||||| ||||||| || ||| |||||| |||| ||||| |||||||||| |||||||||| ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 240 TCAAGATTAA GGAATACGCT GAAGATATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 3043 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 300 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAC TTTGCTGGGT GATATTGTAT 2983 |||||||||| |||||||||| |||||||||| ||||||| AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTG... .......... .......... 337 ATGATTAAAA TATCCTGTTA TGAGATTCCT CTTAGTTTCT TTTAACAAAA AATATAATTT 2923 .......... .......... .......... .......... .......... .......... 337 TTATAAGTAC ACGTATCATT TGTTAATTTG TCCAGATTGA AGTAACTATC TGAAGTTCAT 2863 .......... .......... .......... .......... .......... .......... 337 ATTATAAACA TTAATCTTGT ATACCAAACT ACTATTCCTA TGCTATGTTG TTTGCCATTG 2803 .......... .......... .......... .......... .......... .......... 337 TCGTTCTCTC TTTATTTTTT TTCTTTCCAT TCACATACAC ATTAATTTTC TAGTAGACTG 2743 .......... .......... .......... .......... .......... .......... 337 CATATTACTA CATCTGTATT ATCCGTATGC AAGAGGAATC CAGGATTTGA TGTTTACAAG 2683 .......... .......... .......... .......... .......... .......... 337 TATTTGTGGC GACCTCATGT TAATATCAAT AACAATTAGA TTCACATATG TATAGGATTT 2623 .......... .......... .......... .......... .......... .......... 337 TGACAGAAAT TGAGGGATTC ACATGAATTC ATAGATTACT CCGTGGATTT GCCTTTGGCT 2563 .......... .......... .......... .......... .......... .......... 337 GTCCAAACCT CCTTTATGTC TAACTTCGTC TGAAGTCCCA TTTATATGCT CAAAGCTTAG 2503 .......... .......... .......... .......... .......... .......... 337 TCAAGGTACT GATTTAAAAC GATATTGATA CTACTCTATA ACAAACCCAG CGAACTTTCA 2443 .......... .......... .......... .......... .......... .......... 337 TCACAAAAGC TAGGCCGTGT AGTGAACTTT AAAATGATAT TGCTGCAAAG TCGCTCAACA 2383 .......... .......... .......... .......... .......... .......... 337 AAGGGTCATA ACCAGCACTA CAACTACACA AGGCTCAAGC AAGTATACGC GGGTGAAAGA 2323 .......... .......... .......... .......... .......... .......... 337 TTAACATAGA TCGCTATCCC CCGCAAAAGC TAAGGAAAGC ATCTCTAACT TCTTAGCATG 2263 .......... .......... .......... .......... .......... .......... 337 GACCCAGATG TACTCAAACA CACGATCTGT AAGGATGCCA GAAAGAGAAA GTTGCTGCAA 2203 .......... .......... .......... .......... .......... .......... 337 TTCCTTGCAG TGTTGCACAA TGTCCCAAAA CCAGCATCAA GTGGTTCAAG TCAGGAGTTC 2143 .......... .......... .......... .......... .......... .......... 337 GAGGCTCAAT AATAAATGAA ATTGGATCAT GTTAGGACGG TTCCTAGCAA TAGTAACTAA 2083 .......... .......... .......... .......... .......... .......... 337 GGCGTCATTT ATCATTTGGA GGCAGAAGTA TAAAACTGAC TGAAGCTTAG GGCAGCCCAT 2023 .......... .......... .......... .......... .......... .......... 337 GGAGACACTT ACAAGGCCTT GCTCTGTCAA GGATACATTA GGTCATGGAT CAGAAGGAAA 1963 .......... .......... .......... .......... .......... .......... 337 CACCCTAAGA TCTTGAAGTT CCTTACTAGT GTTGGTAATC TCCTCTATCA CATCCATTCC 1903 .......... .......... .......... .......... .......... .......... 337 AAGTGCCTTA AAGCGTAAGC TGGATAACCA ATTATGCAAA AGAGCCACTG AACAAGAAGA 1843 .......... .......... .......... .......... .......... .......... 337 GCTCAAAAGC TAATGAAGCG TCTAACTAGG AAAGATAAAC ATGAATTGTC AAAATGTAGG 1783 .......... .......... .......... .......... .......... .......... 337 ATTTGTGCAA CATCAAGTTA TTCTTCATAT GGCAGACCAC TAAATTATTC AAAGAGTTCA 1723 .......... .......... .......... .......... .......... .......... 337 CTCGACATCA ATTGTTTAAC TATGTTATCT CACTAAACAC TGAAGTTCAT TACACTGTTT 1663 .......... .......... .......... .......... .......... .......... 337 ATTTATCCCT GGCATTTATC AAAAGCAGCA ACACAATTAA TAAATTAGGA TTATTGTTCT 1603 .......... .......... .......... .......... .......... .......... 337 TTCTCTTGGA AACTTCTGGT TTTACAACTA ACAATCAATT TATCTCTTTT ATCACAAACA 1543 .......... .......... .......... .......... .......... .......... 337 CAGTTCATGT ATCCCTTCTC AGAAAAAAAA CAGTTAATGT ATCGAGCCAT GACTATTGAA 1483 .......... .......... .......... .......... .......... .......... 337 AACACGCACC AGCAACAGGT ATCAGATAAC TCCACTGAAA TTTGAACCTG AGTTCTCTCA 1423 .......... .......... .......... .......... .......... .......... 337 ATCACTTCAT TGACTTCAAA CTGAGACTAT TGAAAGTTTG ATTAACTTTC TTATCTCTTT 1363 .......... .......... .......... .......... .......... .......... 337 CCTTTGTGGC AGATAAGCAT ACTTGTGCAC CGGCAAAGTT A 1322 | ||| | | |||| | || | | | | .......... ..AAAAGTAG A-TTGTAC-T TTGC-AGGGT A 363 hqPGS_C06HBa0112G05.1-19-_SGN-E308524+ (3342 3006) Total number of EST alignments reported: 7 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 4860: PGL 1 (- strand): 303 72 AGS-1 (303 72) SCR (e 0.893) Exon 1 303 72 ( 232 n); score: 0.893 PGS (303 72) SGN-E371571- PGS (303 72) SGN-E370504- PGS (299 72) SGN-E280629- PGS (299 72) SGN-E391217- 3-phase translation of AGS-1 (-strand): . . . . . . 303 ATGTAACAATCATATCATATCATATATTACTAAAAGCATGAACAAAAAAAGTTGAAAGTT M - Q S Y H I I Y Y - K H E Q K K L K V C N N H I I S Y I T K S M N K K S - K L V T I I S Y H I L L K A - T K K V E S . . . . . . 243 GAATTACAGTTTTACCCCTTCTATAAATTAAATTGTTATAAAATATTTAAATATTTAATT E L Q F Y P F Y K L N C Y K I F K Y L I N Y S F T P S I N - I V I K Y L N I - L - I T V L P L L - I K L L - N I - I F N . . . . . . 183 GTATAAATTTAATTAAAAGGTTAATGACTTTTAGACTTCTAAAGATTGATTTCTAATTAA V - I - L K G - - L L D F - R L I S N - Y K F N - K V N D F - T S K D - F L I K C I N L I K R L M T F R L L K I D F - L . . . . . . 123 ATATCCAATTAATTAATATTTATCATCTATAATCTATATGTATATTATAATT I S N - L I F I I Y N L Y V Y Y N Y P I N - Y L S S I I Y M Y I I I N I Q L I N I Y H L - S I C I L - Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 72 AATTATAATATACATATAGATTATAGATGATAAATATTAATTAATTGGATATTTAATTAG N Y N I H I D Y R - - I L I N W I F N - I I I Y I - I I D D K Y - L I G Y L I R L - Y T Y R L - M I N I N - L D I - L . . . . . . 132 AAATCAATCTTTAGAAGTCTAAAAGTCATTAACCTTTTAATTAAATTTATACAATTAAAT K S I F R S L K V I N L L I K F I Q L N N Q S L E V - K S L T F - L N L Y N - I E I N L - K S K S H - P F N - I Y T I K . . . . . . 192 ATTTAAATATTTTATAACAATTTAATTTATAGAAGGGGTAAAACTGTAATTCAACTTTCA I - I F Y N N L I Y R R G K T V I Q L S F K Y F I T I - F I E G V K L - F N F Q Y L N I L - Q F N L - K G - N C N S T F . . . . . . 252 ACTTTTTTTGTTCATGCTTTTAGTAATATATGATATGATATGATTGTTACAT T F F V H A F S N I - Y D M I V T L F L F M L L V I Y D M I - L L H N F F C S C F - - Y M I - Y D C Y Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (- strand): 3343 3006 AGS-1 (3343 3006) SCR (e 0.973) Exon 1 3343 3006 ( 338 n); score: 0.973 PGS (3343 3006) SGN-E544869- PGS (3343 3006) SGN-E544870+ PGS (3342 3006) SGN-E308524+ 3-phase translation of AGS-1 (-strand): . . . . . . 3343 ATCATCCAGGGAGAAGAATGGAACATGGGGGAGGAAGACACCTTTGAGAATCTCAAATTT I I Q G E E W N M G E E D T F E N L K F S S R E K N G T W G R K T P L R I S N F H P G R R M E H G G G R H L - E S Q I . . . . . . 3283 TTGAACTTGCGTCTAGCGACTCTTTCCAAGTGGGAGGTTGGAGAGGAATCCTTCCCCAAT L N L R L A T L S K W E V G E E S F P N - T C V - R L F P S G R L E R N P S P I F E L A S S D S F Q V G G W R G I L P Q . . . . . . 3223 CTTGAGAAGTTAAAACTGCAGGGATGTCGTAAGCTTGAGGAGATTCCACCTAGTTTTGGA L E K L K L Q G C R K L E E I P P S F G L R S - N C R D V V S L R R F H L V L E S - E V K T A G M S - A - G D S T - F W . . . . . . 3163 GATATTTATTCATTGAAAGTTATCAAAATTGTAAAGAGTCCTCAACTTGAAGATTCTGCT D I Y S L K V I K I V K S P Q L E D S A I F I H - K L S K L - R V L N L K I L L R Y L F I E S Y Q N C K E S S T - R F C . . . . . . 3103 CTCAAGATTAAGGAATACGCTGAAGATATGAGAGGAGGGAGCGAGCTTCAGATCCTTGGC L K I K E Y A E D M R G G S E L Q I L G S R L R N T L K I - E E G A S F R S L A S Q D - G I R - R Y E R R E R A S D P W . . . . 3043 CAGAAGAATATCCCCTTATTTAAGTAGCATTATGGTTG Q K N I P L F K - H Y G R R I S P Y L S S I M V P E E Y P L I - V A L W L Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-19-_PGL-2_AGS-1_PPS_1 (3343 3017) (frame '1'; 324 bp, 108 residues) 1 IIQGEEWNMG EEDTFENLKF LNLRLATLSK WEVGEESFPN LEKLKLQGCR KLEEIPPSFG 61 DIYSLKVIKI VKSPQLEDSA LKIKEYAEDM RGGSELQILG QKNIPLFK- 3-phase translation of AGS-1 (+strand): . . . . . . 3006 CAACCATAATGCTACTTAAATAAGGGGATATTCTTCTGGCCAAGGATCTGAAGCTCGCTC Q P - C Y L N K G I F F W P R I - S S L N H N A T - I R G Y S S G Q G S E A R S T I M L L K - G D I L L A K D L K L A . . . . . . 3066 CCTCCTCTCATATCTTCAGCGTATTCCTTAATCTTGAGAGCAGAATCTTCAAGTTGAGGA P P L I S S A Y S L I L R A E S S S - G L L S Y L Q R I P - S - E Q N L Q V E D P S S H I F S V F L N L E S R I F K L R . . . . . . 3126 CTCTTTACAATTTTGATAACTTTCAATGAATAAATATCTCCAAAACTAGGTGGAATCTCC L F T I L I T F N E - I S P K L G G I S S L Q F - - L S M N K Y L Q N - V E S P T L Y N F D N F Q - I N I S K T R W N L . . . . . . 3186 TCAAGCTTACGACATCCCTGCAGTTTTAACTTCTCAAGATTGGGGAAGGATTCCTCTCCA S S L R H P C S F N F S R L G K D S S P Q A Y D I P A V L T S Q D W G R I P L Q L K L T T S L Q F - L L K I G E G F L S . . . . . . 3246 ACCTCCCACTTGGAAAGAGTCGCTAGACGCAAGTTCAAAAATTTGAGATTCTCAAAGGTG T S H L E R V A R R K F K N L R F S K V P P T W K E S L D A S S K I - D S Q R C N L P L G K S R - T Q V Q K F E I L K G . . . . 3306 TCTTCCTCCCCCATGTTCCATTCTTCTCCCTGGATGAT S S S P M F H S S P W M L P P P C S I L L P G - V F L P H V P F F S L D D Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:22:11 2006 ________________________________________________________________________________ Sequence 20: C06HBa0112G05.1-20, from 1 to 3383, both strands analyzed. ... started at: Mon Aug 28 22:22:11 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand 727 n (File: SGN-E357460+) 1 TTCAAATCAA TCATTCTTTC TAAATCATGA GAATTTACGA GAAGGATATC AATGATCTTG 61 AGGCAACTGA GCTAAGATTA GGTTTGCCTG GGATAATAAA CGATGAATCT TCAACATCAA 121 CTAGTACTTC TAAAAATAGC AGAAAAAGAC CTTCATCTAG TAGTGTAAAT GAAAATGAAC 181 AACAAGACTC AGCTCCTGCA CCAAAAGCAC AAGTTGTTGG TTGGCCACCA GTTCGATCAT 241 ACAGGAAAAA TCATGTGTCT AAATTATCAG AATCTGATAA TAATTCCTCA GGAATGTATT 301 TAAAAGTTAG CATGGATGGA GCACCTTATT TGAGGAAAAT TGATCTTCAG GTTTACAAAA 361 GTTATCAAGA GCTACTCAAG GCTTTACAAA GCATGTTCAA GTGCACTATT GGAGTGTATT 421 CAGAAAGAGA AGGATATAAT GGATCTGATT ATGCACCAAC ATATGAAGAC AAGGATGGTG 481 ATTGGATGCT TGTTGGTGAT GTACCATGGG AGATGTTTAT AAGTTCTTGC AAAAGGCTTA 541 GAATTATCAA AGGATCTGAA GCTAAAGGTC TAGCATGTCT ATAAAAACAT GGAAATATAT 601 ATAAACATAC ATACAAATCT GCTAGGGATG AACAAGTTTC AGTTTTGTAA ACTATCTCGA 661 TCTTCTATTA GTTTCACGGG ATATATAGAT CTAGATCTAC ATGTACAAAT AGTTTTATCC 721 TTCTTTT Predicted gene structure (within gDNA segment 1 to 3383): Exon 1 1025 1035 ( 11 n); cDNA 581 591 ( 11 n); score: 0.909 Intron 1 1036 2548 (1513 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0.64) Exon 2 2549 2687 ( 139 n); cDNA 592 727 ( 136 n); score: 0.820 MATCH C06HBa0112G05.1-20+ SGN-E357460+ 0.820 150 0.206 C PGS_C06HBa0112G05.1-20+_SGN-E357460+ (1025 1035,2549 2687) Alignment (genomic DNA sequence = upper lines): ATAAAAACAA GTGGTTGTCA TTTCATCAAA CACATTGAAG ACATTGGCAA TTTCCACATT 1084 ||||||||| | ATAAAAACAT G......... .......... .......... .......... .......... 591 ATAGAATTTA ATACAAAATA TCAAAACATG AATTCTTGAA ATGGAGTATA ATAATTATTT 1144 .......... .......... .......... .......... .......... .......... 591 TCTTCACAAT GAATATACGT ACTATATGTG TATATGTTCA ATATAACCGG AGAAAACATT 1204 .......... .......... .......... .......... .......... .......... 591 GCCAGAAAAA ATAAATAACA GAGTTTAAAA TATGTACTCA AAAACAATAA TTAATATCAT 1264 .......... .......... .......... .......... .......... .......... 591 AAGTATAAAT TAATGAGGAA AAAAAATTAT ACCGCGATAC GAAAAAATAG AACGAAAAAA 1324 .......... .......... .......... .......... .......... .......... 591 ATTGAGTCCA CTGAATGCAC AATGTCCTCT TAAGGAAACT ACTATTTCCC TCCAATTCCG 1384 .......... .......... .......... .......... .......... .......... 591 AAAATTTAAA GAATTACATC CTCTTAGGAT GAACGATGTT ATTCACCCAT ATGGGTTCGG 1444 .......... .......... .......... .......... .......... .......... 591 CTCTGGTGGG CGCCCACCCA GTGACTCTAA TTCCTAAATT CGTCTATACT GTCAGTAACT 1504 .......... .......... .......... .......... .......... .......... 591 CATTCAACAG CATCAAAGTA AGTACATGAA TATTTAATTT TGCAGAAATA AAAGAAAATG 1564 .......... .......... .......... .......... .......... .......... 591 ATTATAATTG TTTTTTGTTT TAAAATAAGG GTGTGTAGTA TGAAGGAAAA CATTTCTCGG 1624 .......... .......... .......... .......... .......... .......... 591 AAAATATTTT TTCAATTTTC TCATATTTGG TTGGATGAAA TCATTTTTCT CAAATTTAAA 1684 .......... .......... .......... .......... .......... .......... 591 AAAAAAAAAG ACTTCCTTTC CAAACTTAAG GAAAACATTT TCCAAAATTC TTTTCTAACC 1744 .......... .......... .......... .......... .......... .......... 591 TTCCCCTACC CACCACCTGC TAGCCCCCCA CCCATACCCC TAAAAAGTTT AAGTTTAGTT 1804 .......... .......... .......... .......... .......... .......... 591 TTTTAAAATA TTTTAACTTC ACAATTTCTT TTTTTCACCC CTACCCTCGA CCCCCAACCC 1864 .......... .......... .......... .......... .......... .......... 591 ACCCCCTACC ACCCCCATCC CCAAAAAAAA TAATTTAAGT TTGTTTTTTA AAAAATATTT 1924 .......... .......... .......... .......... .......... .......... 591 TAAACTTAAA AAATTTATTT TTTCACCCCC TACCCTCGAC TTACCCACTC CCTACCAGCC 1984 .......... .......... .......... .......... .......... .......... 591 CCCCTCCCCC CACCCCCCCA CCCCCTCTTC AAAAAAAAAG TTAAGTTTGT TTTTAAAAAA 2044 .......... .......... .......... .......... .......... .......... 591 TATTTTAAAC TTCAAAATTT CATTTTTTCA CTCCTACCCT CGACTCCCCA CCCCACTCCA 2104 .......... .......... .......... .......... .......... .......... 591 CTAGCCCCCT ACCAGCGCAC CCCCACCACG AAAAAAAATT TTAAGTTTGT TTTTTTAAAA 2164 .......... .......... .......... .......... .......... .......... 591 AAATAAATTC AACTTCAAAA ATTATTTTCT ACTCTAGTAA AAAATAAAGA TATTTCTCAA 2224 .......... .......... .......... .......... .......... .......... 591 AAGTATTTTT CATTGAAAAA ACAAACACTA AAATATTTTT CTAGAAAATA TTATCTACTG 2284 .......... .......... .......... .......... .......... .......... 591 ACCAACCAAA CATCAGAATA TAAGTAAAAT ATCTACCTGT TTTTCAGAAA AACATTTTCC 2344 .......... .......... .......... .......... .......... .......... 591 AAGGAATTTC CATCATACCA AACACACCCA AGAGTATATT CCTCTATTTA TAGACAACAA 2404 .......... .......... .......... .......... .......... .......... 591 AAGGTAATGT GAACAAATAT TTATTGTGCC TTATTAGAAA GGATAGAACT ATTTGTAAAA 2464 .......... .......... .......... .......... .......... .......... 591 ATTGCAACCC TTCGGAAAGT TTATAACGTT TCATAAAAGT CGCAACTCTT CATAAAAGTC 2524 .......... .......... .......... .......... .......... .......... 591 ACAACTCTTC ATTAAAGTCG CAACTATATA CATATATACA AATCTTATAG GGATGAACAA 2584 | ||| ||||| ||| | | || | || | .......... .......... ....GAAATA TATATAAACA TA-CATACAA ATCTG-CTAG 625 GTTTGAACAA GTTTCAGTTT TGTAAACTAT CTTGATCTTC GATTAATTTC ACGGGATATA 2644 | ||||||| |||||||||| |||||||||| || ||||||| |||| |||| |||||||||| GGATGAACAA GTTTCAGTTT TGTAAACTAT CTCGATCTTC TATTAGTTTC ACGGGATATA 685 TAGATCTAGA TCTACATGTA CAAATAGTTT ATCTTCTTTT TTT 2687 |||||||||| |||||||||| |||||||||| | | ||| | ||| TAGATCTAGA TCTACATGTA CAAATAGTTT -TATCCTTCT TTT 727 hqPGS_C06HBa0112G05.1-20+_SGN-E357460+ (2549 2687) ******************************************************************************** EST sequence 2 -strand 205 n (File: SGN-E398137-) 1 ACATACAAAT CTGCTAGGGA TGAACAAGTT TCAGTTTTGT AAACTATCTC GATCTTTTAT 61 TAGTTTCACG GGATATATAG ATCTAGATCT ACATGTACAA ATAGTTTTAT CCTTCTTTTT 121 TTTTTTTTTG GAAGAATTTC CTTGTGTATT ATGTACGAGT ATAAATATAC GGGAAGAAAT 181 ACAAAACTTT AGCACAAAAA AAAAA Predicted gene structure (within gDNA segment 1822 to 3383): Exon 1 2588 2762 ( 175 n); cDNA 21 203 ( 183 n); score: 0.817 MATCH C06HBa0112G05.1-20+ SGN-E398137- 0.817 175 0.854 C PGS_C06HBa0112G05.1-20+_SGN-E398137- (2588 2762) Alignment (genomic DNA sequence = upper lines): TGAACAAGTT TCAGTTTTGT AAACTATCTT GATCTTCGAT TAATTTCACG GGATATATAG 2647 |||||||||| |||||||||| ||||||||| |||||| || || ||||||| |||||||||| TGAACAAGTT TCAGTTTTGT AAACTATCTC GATCTTTTAT TAGTTTCACG GGATATATAG 80 ATCTAGATCT ACATGTACAA ATAG-TTTAT -CTTC-T-TT TTTTT----G AAAGAATTTT 2699 |||||||||| |||||||||| |||| ||||| |||| | || ||||| | |||||||| ATCTAGATCT ACATGTACAA ATAGTTTTAT CCTTCTTTTT TTTTTTTTTG GAAGAATTTC 140 CTTGTGTATT ATGTACGAGT ATAAATATAC AGGAAGAAAT GCAAAATTTT AGCACAATAT 2759 |||||||||| |||||||||| |||||||||| ||||||||| ||||| ||| ||||||| | CTTGTGTATT ATGTACGAGT ATAAATATAC GGGAAGAAAT ACAAAACTTT AGCACAAAAA 200 GAA 2762 || AAA 203 hqPGS_C06HBa0112G05.1-20+_SGN-E398137- (2588 2762) Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (1) in segment 1 to 3383: PGL 1 (+ strand): 2549 2762 AGS-1 (2549 2762) SCR (e 0.817) Exon 1 2549 2762 ( 214 n); score: 0.817 PGS (2549 2687) SGN-E357460+ PGS (2588 2762) SGN-E398137- 3-phase translation of AGS-1 (+strand): . . . . . . 2549 TATATACATATATACAAATCTTATAGGGATGAACAAGTTTGAACAAGTTTCAGTTTTGTA Y I H I Y K S Y R D E Q V - T S F S F V I Y I Y T N L I G M N K F E Q V S V L - Y T Y I Q I L - G - T S L N K F Q F C . . . . . . 2609 AACTATCTTGATCTTCGATTAATTTCACGGGATATATAGATCTAGATCTACATGTACAAA N Y L D L R L I S R D I - I - I Y M Y K T I L I F D - F H G I Y R S R S T C T N K L S - S S I N F T G Y I D L D L H V Q . . . . . . 2669 TAGTTTATCTTCTTTTTTTTGAAAGAATTTTCTTGTGTATTATGTACGAGTATAAATATA - F I F F F L K E F S C V L C T S I N I S L S S F F - K N F L V Y Y V R V - I Y I V Y L L F F E R I F L C I M Y E Y K Y . . . . 2729 CAGGAAGAAATGCAAAATTTTAGCACAATATGAA Q E E M Q N F S T I - R K K C K I L A Q Y E T G R N A K F - H N M Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 2762 TTCATATTGTGCTAAAATTTTGCATTTCTTCCTGTATATTTATACTCGTACATAATACAC F I L C - N F A F L P V Y L Y S Y I I H S Y C A K I L H F F L Y I Y T R T - Y T H I V L K F C I S S C I F I L V H N T . . . . . . 2702 AAGAAAATTCTTTCAAAAAAAAGAAGATAAACTATTTGTACATGTAGATCTAGATCTATA K K I L S K K R R - T I C T C R S R S I R K F F Q K K E D K L F V H V D L D L Y Q E N S F K K K K I N Y L Y M - I - I Y . . . . . . 2642 TATCCCGTGAAATTAATCGAAGATCAAGATAGTTTACAAAACTGAAACTTGTTCAAACTT Y P V K L I E D Q D S L Q N - N L F K L I P - N - S K I K I V Y K T E T C S N L I S R E I N R R S R - F T K L K L V Q T . . . . 2582 GTTCATCCCTATAAGATTTGTATATATGTATATA V H P Y K I C I Y V Y F I P I R F V Y M Y I C S S L - D L Y I C I Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:22:23 2006 ________________________________________________________________________________ Sequence 21: C06HBa0112G05.1-21, from 1 to 5441, both strands analyzed. ... started at: Mon Aug 28 22:22:23 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 0 No significant EST matches were found. ... finished at: Mon Aug 28 22:22:33 2006 ________________________________________________________________________________ Sequence 22: C06HBa0112G05.1-22, from 1 to 9707, both strands analyzed. ... started at: Mon Aug 28 22:22:33 2006 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 6 HitsTableSize = 2 EST library file: /tmp/cxgn-bacpublish-resources-SZZk1T/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 5 ... matches indexed, elapsed seconds = 5 HitsTableSize = 4 ******************************************************************************** EST sequence 1 +strand 585 n (File: SGN-E320435+) 1 TTTTGTTGAG CACTGTTGTC TAAGTACTTT AGCCCTTGCC GGTGACTTCA TTAAACTGAT 61 TTGATAGAAC GGAAGGTTTT GTGGAAAAAT GGTGTCTGAG TACTTCAGCC CTTATTGGTT 121 AGGATAAGTT TTAATTTATG CATCTGAGAA ACTTTTCTTG CTTGTAATTG AAGCATGGGG 181 AAGCAGTGGG TTTTGTGGAG CTTTATTGTC TGAGTACTTT AGTCTTTCTG GTGAGAAAAG 241 TTCTAATCTT TGCATCTGGG GTTTGAAATG TTTCTTGTTA GTAATTGAGG GTTTAAGGAA 301 GTAGTAGGGT CTGTTTAGCA TTCTCGTCTG AGTACTTTAG CCCTTGAGAT ATCCCTCCCA 361 CATTGTGTAA ACAATCTCTT CTCGTCGAGC ATGCATTTCC CCATCATCTG ACTTAGAAGC 421 TGAAGCAACA TTGTTAATCT CTATCTCTGG TTGGTCGCTG ATATTAAAGT TGGTCGCTGA 481 TATTCTGGAA CCTGAGAGAA CTGGACCTGG GAGAAAGTGA AGCAGAACAC CCGAGTGGCC 541 ATTGGGCTGG TCACTTTCCT GATAGCTATA CATCGCGTGT GTCAT Predicted gene structure (within gDNA segment 154 to 3169): Exon 1 844 1272 ( 429 n); cDNA 10 473 ( 464 n); score: 0.752 Intron 1 1273 1870 ( 598 n); Pd: 0.000 (s: 0.76), Pa: 0.000 (s: 0) Exon 2 1871 1884 ( 14 n); cDNA 474 487 ( 14 n); score: 0.786 Intron 2 1885 1939 ( 55 n); Pd: 0.000 (s: 0), Pa: 0.999 (s: 0.94) Exon 3 1940 2036 ( 97 n); cDNA 488 584 ( 97 n); score: 0.887 MATCH C06HBa0112G05.1-22+ SGN-E320435+ 0.777 540 0.923 C PGS_C06HBa0112G05.1-22+_SGN-E320435+ (844 1272,1871 1884,1940 2036) Alignment (genomic DNA sequence = upper lines): GCACTGTTGT CTAAGTACTT TAGCCGTTGC CGGTGACTTC ATTAAACTGA TTTGATAGAA 903 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| GCACTGTTGT CTAAGTACTT TAGCCCTTGC CGGTGACTTC ATTAAACTGA TTTGATAGAA 69 CAGAAGGTTT TGTGGAACAT TGGTGTCTGA GTACTTCAGC CCTTACTGGT TGGGATGAGT 963 | |||||||| ||||||| | |||||||||| |||||||||| ||||| |||| | |||| ||| CGGAAGGTTT TGTGGAAAAA TGGTGTCTGA GTACTTCAGC CCTTATTGGT TAGGATAAGT 129 TTTAATCTAT GCATCTGAGG AACTTTTCTT GCTTGTAATT GAAGC----- --AG---TGG 1013 |||||| ||| ||||||||| |||||||||| |||||||||| ||||| || ||| TTTAATTTAT GCATCTGAGA AACTTTTCTT GCTTGTAATT GAAGCATGGG GAAGCAGTGG 189 GTTTTGTGGA GCATTATTGT GTGAGTACTT TAGTCTTTCT GGTGAGAAAA GTTCTAATCT 1073 |||||||||| || ||||||| ||||||||| |||||||||| |||||||||| |||||||||| GTTTTGTGGA GCTTTATTGT CTGAGTACTT TAGTCTTTCT GGTGAGAAAA GTTCTAATCT 249 TTGCATCTGG GGTTTG-AAT GTTTCTTGTT AGTAATTGAG GGTTTGAGGA AGTAGTAGGT 1132 |||||||||| |||||| ||| |||||||||| |||||||||| ||||| |||| ||||||||| TTGCATCTGG GGTTTGAAAT GTTTCTTGTT AGTAATTGAG GGTTTAAGGA AGTAGTAGGG 309 TCTGTGAAGC ATTCTCGTCT GAGTAC--T- ----TT-AG- ---CCCTCCC ACATTGTGTG 1180 ||||| ||| |||||||||| |||||| | || || ||||||| ||||||||| TCTGTTTAGC ATTCTCGTCT GAGTACTTTA GCCCTTGAGA TATCCCTCCC ACATTGTGTA 369 AACAATCTCT TCTCATCGAG CATGCGTTTC CCCATCA--- G-C-T-GAAG C--AA-C-AC 1230 |||||||||| |||| ||||| ||||| |||| ||||||| | | | |||| | || | || AACAATCTCT TCTCGTCGAG CATGCATTTC CCCATCATCT GACTTAGAAG CTGAAGCAAC 429 --TGTTAATC TTTATCTCTG GTTGGTCGCT GATATTAAAG TTGGGGTTGT TATAACCAAA 1288 |||||||| | |||||||| |||||||||| |||||||||| |||| ATTGTTAATC TCTATCTCTG GTTGGTCGCT GATATTAAAG TTGG...... .......... 473 AAATCAGGGT ACATTTGAAA ATAAAATATA ATTCAACATT CGAAAGCGGG TTATAAGTTG 1348 .......... .......... .......... .......... .......... .......... 473 TTTTCCAAAT TTGATTTACA ACTTCGAACT TTTCGTACAC AAACACTGGT TTTCGAATAA 1408 .......... .......... .......... .......... .......... .......... 473 AGTGGACATT ATATCAGAAA AAAAGTAAAT AATATTTATG GGTCCTTAAT TAAGTCTTTC 1468 .......... .......... .......... .......... .......... .......... 473 AGTTCATTCC TCTGTTTTGC TTGATTGATG GTATTTCACA GTAGTTGGTG GCCATAGAGA 1528 .......... .......... .......... .......... .......... .......... 473 AGAAATGCCT TAGTACTTTG AATCGCTTAC TGGGTAAAGA AGCAAAAGCA TATAGTGTAT 1588 .......... .......... .......... .......... .......... .......... 473 GACTTGGCTA CAACAAACGA AAGGGATGTA AGAGGGCACA ATTTCGTCCA CCTTTAGAAA 1648 .......... .......... .......... .......... .......... .......... 473 CTAAGCATAA ACTGAATCTA ATTAAATTTT TGTTTTGATA TGTGCACTTT ACTAGCAATG 1708 .......... .......... .......... .......... .......... .......... 473 TTGCACAGAA AGTGTGAACT ACTGGTTCGG TGTTATAAGT AGTTCAATGA TTTGAATTAT 1768 .......... .......... .......... .......... .......... .......... 473 TGAAAGGGTT GAATATCTTG TAACATGGTT ATCAAATTAA TGTTGCATTT TTATCTCTTT 1828 .......... .......... .......... .......... .......... .......... 473 CCATTTGATT TATCACAACT CCTCTTATAT TTGGACCTTC TTTTGCTGCT ATTCTTGTTT 1888 | |||| | ||||| .......... .......... .......... .......... ..TCGCTGAT ATTCTG.... 487 AGCGCTGGGA ATAAAAACTA TAGAAGCAAT GTAACTCGTT TTTTGCTGCA GGAACCTGAG 1948 ||||||||| .......... .......... .......... .......... .......... .GAACCTGAG 496 AGAACTGGAA CTGGGAGAAA GTGAAGCAGA AGACCTGAGT GGCCATTGGC TTAGTCATTT 2008 ||||||||| |||||||||| |||||||||| | ||| |||| ||||||||| | |||| || AGAACTGGAC CTGGGAGAAA GTGAAGCAGA ACACCCGAGT GGCCATTGGG CTGGTCACTT 556 TTCTGATAGT TGTACATCGC TTGTGTCA 2036 | ||||||| | |||||||| ||||||| TCCTGATAGC TATACATCGC GTGTGTCA 584 hqPGS_C06HBa0112G05.1-22+_SGN-E320435+ (844 1272,1871 1884,1940 2036) ******************************************************************************** EST sequence 3 -strand 728 n (File: SGN-E541981-) 1 GATCAGAGAT GTCCCGTTTG GAGACAAGGC TCTCTTGGCT AATGCTGCAA AGCTCGAGAC 61 AATGCGATCC CTTTGGATGT CTTCTTGTTC AGTAAGTTTT GAAGCATGTA AGATGCTAGC 121 TCAGAAGATG CCTAGGCTTA ATGTGGAAGT CATAGATGAG AGGGGTCCTC CAGATACAAG 181 GCCAGAAAGC TGCCCTGTGG AGAAACTTTA CATATATCGA ACAGTAGCTG GACGAAGATT 241 TGATACACCT GGTTATGTTT GGACAATGGA TGAAGATGCA GCTGTAAGCT TGACCTGAGC 301 CATACATCAA ATGGGAAATT CCAGCAAGAC TTCACATGGA AAGATGGGCA GGTATTAGCG 361 CTATGTTTAT CCTCCTGCAT TTGTTTGTCA CGCCTTATGT AGTTGCAGTG CCAGTTGAGG 421 GTTGGAACTT CACTGCTATG AAGTACGAGG CTCCCGAGCT CATACGCCAT CAATTTTATT 481 CATGGCTGAG AGCTACTTCA CAGCAGGTAG TTTTTAATAA AGAGTTTGAA CTCTGAGTTT 541 CTGTGTATCA TCTAGAAGTA ACAATTATTC GTCTTCTTTG GGTGGGATTT TGCTTGGGGG 601 GTGGGTGGGT TTCGGGGCAT ATTTTTGCCA TGTATGTGCT ATTTCCAGAC CTTGCTTCAT 661 TATGTAGTTC CCTTTATTTC TTCTATTGAC TTATTTTCTA TGAAATGGAG TGCTTTCAAA 721 AAAAAAAA Predicted gene structure (within gDNA segment 2492 to 9707): Exon 1 3408 3687 ( 280 n); cDNA 1 275 ( 275 n); score: 0.779 Intron 1 3688 3935 ( 248 n); Pd: 1.000 (s: 0.64), Pa: 0.000 (s: 0) Exon 2 3936 3944 ( 9 n); cDNA 276 285 ( 10 n); score: 0.389 Intron 2 3945 4778 ( 834 n); Pd: 0.990 (s: 0), Pa: 0.861 (s: 0.46) Exon 3 4779 4911 ( 133 n); cDNA 286 416 ( 131 n); score: 0.613 Intron 3 4912 8180 (3269 n); Pd: 0.000 (s: 0.74), Pa: 0.000 (s: 0) Exon 4 8181 8199 ( 19 n); cDNA 417 435 ( 19 n); score: 0.684 PPA cDNA 718 728 MATCH C06HBa0112G05.1-22+ SGN-E541981- 0.725 441 0.606 C PGS_C06HBa0112G05.1-22+_SGN-E541981- (3408 3687,3936 3944,4779 4911,8181 8199) Alignment (genomic DNA sequence = upper lines): GATTAGAGAC TGCCCTTTTG GCGATGAGGC TCTGTTGGCT AATGCTGCAA AGCTGGAGAG 3467 ||| ||||| ||| |||| | || |||| ||| |||||| |||||||||| |||| | GATCAGAGAT GTCCCGTTTG GAGACAAGGC TCTCTTGGCT AATGCTGCAA AGCT----CG 56 TGGAGACCAT GCGATCCCTT TGGATGTCTA ATTGTTCAGT AAGTTTTAAA GCATGTAAGC 3527 |||| || |||||||||| ||||||||| ||||||||| ||||||| || ||||||||| ---AGACAAT GCGATCCCTT TGGATGTCTT CTTGTTCAGT AAGTTTTGAA GCATGTAAGA 113 TGCTAGCCCA GAAGTTGCC- AGGCTTAATG TTGAAGTTAT AAACGAGAGG GGTCATCCGG 3586 ||||||| || |||| |||| |||||||||| | ||||| || | | |||||| |||| ||| | TGCTAGCTCA GAAGATGCCT AGGCTTAATG TGGAAGTCAT AGATGAGAGG GGTCCTCCAG 173 ATACGAGACC AGAAAGTTGC TCTATTGAGA AACTTTATAT ACACAAGACA GTGTCAGGAA 3646 |||| || || |||||| ||| || | |||| ||||||| || | | ||| || | ||| ATACAAGGCC AGAAAGCTGC CCTGTGGAGA AACTTTACAT ATATCGAACA GTAGCTGGAC 233 GGAGGTTCGA CACTCCTGGT TTTGTTTGGA CTA-GGCTCC GGGCACGTGT ATTCCAAATA 3705 | || || || || |||||| | |||||||| | | || | | GAAGATTTGA TACACCTGGT TATGTTTGGA CAATGGATGA AG........ .......... 275 AAAATCAATA AAAAAAATAA ATGAAAACGT ATATAATAAA CTATTAGGAG AATTATGATG 3765 .......... .......... .......... .......... .......... .......... 275 TTTTTTTCGA GTGTTCTCTC GATATTTTCG ATCTACTGCA CATTAGATAA GTCTCCAAAG 3825 .......... .......... .......... .......... .......... .......... 275 CTTACCCATT TATAAGCAAT TAAATGTACA GACAGTCATC AACAAGTAAA AATTCTAAGA 3885 .......... .......... .......... .......... .......... .......... 275 AAGACACATC AAGTTAAAGA ATTAAAAAGG GAAAATGCGA CGTTATTTAA AGTCATCT-T 3944 | || || | .......... .......... .......... .......... .......... ATGCAGCTGT 285 GTATCCTTAA TGTGTGTCGT TACAAATGTA ACTACTTCCT TAAAAACTTT CAGATTTACT 4004 .......... .......... .......... .......... .......... .......... 285 CAGACTTACA TTTACAGGCT TACAACTACT ATCCATCACT GTTAGAGAAG ACTTGGAGTC 4064 .......... .......... .......... .......... .......... .......... 285 GCTACATTTA TTGAAGAAAA TGTGATATAT TTTGCTAGCA AACAGCTGCA ATTTCAATTT 4124 .......... .......... .......... .......... .......... .......... 285 ATTATTAATA GTTTAAAAGT TTAAACATGG TTTCTTTTCT GTGGGAAGCT TGCATAGAGA 4184 .......... .......... .......... .......... .......... .......... 285 GTTTTAAAGA GAGTTGATCA TCATTCAGTA TCAAACCATA TTGTACAACA ACATTCAATT 4244 .......... .......... .......... .......... .......... .......... 285 TTTGATATGT GTGGTTATAC CTTTCAAGCT ATTTATTTTA CCTATTAAAA AAATTAGAAA 4304 .......... .......... .......... .......... .......... .......... 285 GTTAGCAAAT CAAACGGAAA AATCTAGACT TAATAAGATT GTACCTGAAA ATTAGAAGAC 4364 .......... .......... .......... .......... .......... .......... 285 ATTTTTGTAC CCACTTAACA ATGGAAGCTT GTGTTGAATG ACTAATATTG AGTATGTTTA 4424 .......... .......... .......... .......... .......... .......... 285 TGTAGTGTAG TCTTGGAGAT TTGTTGATAA AAACTCTTTG ATATTCCTCA ATTACTTAAT 4484 .......... .......... .......... .......... .......... .......... 285 TTGATCCTTT TCAACTTCTC ATAATTAGCA AAATACGTTT CAACTTCTAT ATTAATTGAA 4544 .......... .......... .......... .......... .......... .......... 285 ACTTTATTCG TGTTCATCCT ATTTGGTTGT TTCACACATA GTAATTGAAT AGACTGTTGA 4604 .......... .......... .......... .......... .......... .......... 285 ACTTCTTATT TTGTTCCTTT ATTATGTAAT TCAATATTAC TATTTAATTA GATATTTAAT 4664 .......... .......... .......... .......... .......... .......... 285 TTATATTTGG GAAAAGTATT TTATTACTTT TTAATTTAGT ATAGGGGCAA AGTAATAATT 4724 .......... .......... .......... .......... .......... .......... 285 CAACTTTACA CTTTAGAGCT TCATGCTTAT AATAATATAT GATTATTGAT GAAGATGCAA 4784 | || .......... .......... .......... .......... .......... ....AAGCTT 291 CATCGACTCC ATATAGC-AA TGGG-GATTG CTCTTTGGCT TCTTCTTAGG AAGA-CTTCA 4841 | | || ||| | | || |||| ||| | | || || | | | |||| || GACCTGAGCC ATACATCAAA TGGGAAATTC CAGCAAGACT TC-ACATGGA AAGATGGGCA 350 GGTATTAGTT CTATGCTGAT CTTTCACCTG CATTTACTTG CTTGAGCCTT GTGTAGTTGC 4901 |||||||| ||||| | || | | |||| ||||| ||| | ||||| ||||||||| GGTATTAGCG CTATGTTTAT C---CTCCTG CATTTGTTTG -TCACGCCTT ATGTAGTTGC 406 AGTGCTGGTT ATGGTGGCAG TGAAGCGCAT GTGCCATTGA TTTGACCCTT TGTTGAGTGA 4961 ||||| ||| AGTGCCAGTT .......... .......... .......... .......... .......... 416 CTTTGCAACA ATTAATTACA AATAATGAGT TTAAACTCTT GTTGTTGTGC TATATCACTA 5021 .......... .......... .......... .......... .......... .......... 416 AGGATAACTA TTGTATTTTC AACTCTTATT GTTTTTTCTG TTGAAATGAT GCTGTCAATG 5081 .......... .......... .......... .......... .......... .......... 416 TACTTGTTGT CCTGTATTGT TGTAATTCTC CTTGAAGATA AATGGGAACC TTTCATTCAT 5141 .......... .......... .......... .......... .......... .......... 416 TTGAGTGCTG AAATCCGATT TAATTTGTTG AACGTTTAGA AGAAGGTATT TTATTGTTCT 5201 .......... .......... .......... .......... .......... .......... 416 AGTACGAGTC AGCCTGATAA ACAAACACTT ACATAGGTAT GATGACTGTT CATTTTCTCA 5261 .......... .......... .......... .......... .......... .......... 416 ATTTTATACC TACTCGTGCA AATCCAAAGT TAAAAGGTCA TGTTTATGTA TTATGCCAAA 5321 .......... .......... .......... .......... .......... .......... 416 TCTATACTCA CATGTGTGCT TGCGATAAAA GAAATAGACA AATCAACAAA TGATACGTGC 5381 .......... .......... .......... .......... .......... .......... 416 ATTTATAAAA ATTATAATTT TTGTTGAAGA AACTTTGAGA AATCCACAAA AATATTTCAT 5441 .......... .......... .......... .......... .......... .......... 416 AACTGCAACA AAGTTTTCTT AATCATATAC AATGTACCCT GCAAAGTACA ATCTACTTTT 5501 .......... .......... .......... .......... .......... .......... 416 CAACCATAAT GCTACTTAAA TAAGGGGATA TTCTTCTGGC CAAGGATCTG AAGCTCGCTC 5561 .......... .......... .......... .......... .......... .......... 416 CCTCCTCTCA TTTCTTCAGC GTATTCCTTA ATCTTGAGAG CAGAATCTTC AAGTTGAGGA 5621 .......... .......... .......... .......... .......... .......... 416 CTCTTTACAA TTTTGATAAA TTTCAATGAA TAAATATCTC CAAAACTAGG TGGAATCTCC 5681 .......... .......... .......... .......... .......... .......... 416 TCAAACTTAC CACATTCCTG CAGTTTTAAT TTCTCAAGAT TGGGGAAGGA TTCCTCTCCA 5741 .......... .......... .......... .......... .......... .......... 416 ACCTCCCACT TGGAAAGAGT CGGTAGACGC AAGTTCAAAA ATTTGAGATT CTCAAAGGTG 5801 .......... .......... .......... .......... .......... .......... 416 TCTTCCTCCC CCATGTTCCA TTCTTCTCCA TGGATGATTG CATCACAAAG GGACAACCCT 5861 .......... .......... .......... .......... .......... .......... 416 TCAAGGTTGG GCAGTCTAGC TATTGTTGAC AGTGAATCGG ATGTCAGAGG AAAGTCACGC 5921 .......... .......... .......... .......... .......... .......... 416 CATGACAGTT CTTTCAAATT TGAAGGGAAG TGGAAATCCC ACGGCCGATT TGTCGCTACA 5981 .......... .......... .......... .......... .......... .......... 416 GAGGACCCAA TGTGGTTTGT GTTTGAACTT TTAAAACCTA CACTGAGTAT TTCTAGTTCA 6041 .......... .......... .......... .......... .......... .......... 416 GTTAGGCAAT CCAATTTCGG GAACCAATAT TGCTCTGTTG AATAATCCCA TGACTCCTTG 6101 .......... .......... .......... .......... .......... .......... 416 AGAACAAATT CAAGCACTTT AAGATTGGGA AACCTTTTGA AAATATTCTT TGTATCTTTC 6161 .......... .......... .......... .......... .......... .......... 416 GAATAGGAAA TCAACAGTTC CCTTAGTATT CTCAAGTTCT CTAACTTTGT GTCCTCTGCT 6221 .......... .......... .......... .......... .......... .......... 416 ATCAATATTG ATTCATCTGC ATCCATATCA AAGAAAGAAC AAGCATCCGC GGACAGCACT 6281 .......... .......... .......... .......... .......... .......... 416 CGTAGCTTTA CAAGATCCAA AATTCTTGGT AACAGTATCA AGGTTGATCC TTTGTTAGAC 6341 .......... .......... .......... .......... .......... .......... 416 ACAAACAGAC TTTCTAGATT CCAGAGGTTT GAGAAAGACA AAGGCAGATA TTTAACTTGT 6401 .......... .......... .......... .......... .......... .......... 416 GTCCGAATTC TTAAGTACCT CAAATGATTC AACATGCATA TTTCATTCAG CAAAGAATCA 6461 .......... .......... .......... .......... .......... .......... 416 TTCACCATGA TTAAAGAGGA TTCCAGGTCC AACACTCTAA GAAGCCTCAA GTGTCTTAGG 6521 .......... .......... .......... .......... .......... .......... 416 TGAAATGTAT CAAAAAGACT GTCATCATAA TCAATGGTAA TTTTACGTGG CAATAAATCT 6581 .......... .......... .......... .......... .......... .......... 416 GATGGAGCAC TTGATCTTAT CCGATCAAAC AACTTTTCCT TTCTTGCTTT TATCAAACAA 6641 .......... .......... .......... .......... .......... .......... 416 AAGTCATGCA CAAGATCATG AACTTGGCAA CTCGGTTCAT CACCTATCTC ATTCAAAAGA 6701 .......... .......... .......... .......... .......... .......... 416 ATTACCAAGC TACTGGAAAT TAAATCATCC ATACAAATCT TCAGCACTTC TTCCATACTC 6761 .......... .......... .......... .......... .......... .......... 416 TTCATCTCCG TCTTCCCCAC AAATCCTTCA GCACCCAAAA AAAACATTCA ACAAATAGAT 6821 .......... .......... .......... .......... .......... .......... 416 TGTCAATGGA GTGTCCTTCG GCAAACTTAC AAAGTGAAGC AAGCATGGCT TGAGGTGATG 6881 .......... .......... .......... .......... .......... .......... 416 TGGTAAATGG TCATAACTTA ATTCTATAAC TTTCATCACT TCCACTTCAC TGTTCAAAAT 6941 .......... .......... .......... .......... .......... .......... 416 AAAAGAACTC AAACTATTTT GAACTTCAAG CCACACACTC TTTTTCTTTT CCCTCCCAGC 7001 .......... .......... .......... .......... .......... .......... 416 AATGACTCCA GCAATCAGAT CAGCCACCAA AGGAAGCCCT TTACAATTTT TGGCTATTTC 7061 .......... .......... .......... .......... .......... .......... 416 TTTACCGACA TCTAATAGTT CATCAGGGCA ACTCTCGTTC CCAAATGCCC TTTTCTCTAA 7121 .......... .......... .......... .......... .......... .......... 416 TAGTTCCCAA CTTTCATCTG GTCTTAGCAA TCGAAGGTCA AGAGGATCAG TGTTCAGCTT 7181 .......... .......... .......... .......... .......... .......... 416 TCCATGCAAA GGCACTTCCT TTTCTCGAGT TGTCAAAATA ATCCTACTTC CTTTCTTAGC 7241 .......... .......... .......... .......... .......... .......... 416 TTCAAGAAAA GGTCTTGTTA ACTCATCCCA TGTAGTAGTA TCCCACACGT CATCTAAGAC 7301 .......... .......... .......... .......... .......... .......... 416 AATAAGATAC CTCTTTCCAT ACAGTTTTTT CCGTAGCTTA TCAGGAACAT CAATATTCTC 7361 .......... .......... .......... .......... .......... .......... 416 ACTCAATTTT GAATCTGAGT CACTAACTTG ATTGACAATT TTATTCAACA ACTTCTTCTC 7421 .......... .......... .......... .......... .......... .......... 416 ATCATATCCT TGGTCGACCG TGCACCATGC ACGAAGGTTG AAATGGCTAG AAACTGACTT 7481 .......... .......... .......... .......... .......... .......... 416 ATCACTGTAT ACTTTGTATG CCAAAGTAGT TTTACCTGAA CCCGGCATAC CAGTGATCGA 7541 .......... .......... .......... .......... .......... .......... 416 AATGACATCT AGATCTGCCG GTCCACTGGT GAGCTTTCTA AGTATCAAGT TTGTCTCCTC 7601 .......... .......... .......... .......... .......... .......... 416 CTCAAAACCT ACAATTATTT TATCAGTTGT CAATGACTTT CTCTCAACTG GTTTCTTGGG 7661 .......... .......... .......... .......... .......... .......... 416 AGAGTTCACA ACGATTAGAC CTCTGTCCTT GGGAATGTTC TCATCTAAAG CAGAGATCTC 7721 .......... .......... .......... .......... .......... .......... 416 TTCTTTGATA AGTTTGATCT TCTTTATGGT AATGGGAAGT GAGAAAATAA GATGTAAGAG 7781 .......... .......... .......... .......... .......... .......... 416 ACCATTATCT CGAACAATAA TTGAATCTAT GACATCTTTT GCCTCATAAG CCACATCTAG 7841 .......... .......... .......... .......... .......... .......... 416 AACACGTGCC CAGATATCTT TATACAATCC TTGCTCAGCA TCCACAAAGA ATGATCTTAT 7901 .......... .......... .......... .......... .......... .......... 416 GAATTCCAGG TCTTGTCTCA CCAACTCGAT TTCTTCCTTT ATCAAAGAAA TTGAATAAGC 7961 .......... .......... .......... .......... .......... .......... 416 ATTAGATTCT AGCAAATCAT TTAAGTGCAT GTGTAGAAGA TGCATGAAGA GTGGTCCATC 8021 .......... .......... .......... .......... .......... .......... 416 ACTCATGGGG GAGCAACATT GAGATGAATC CGGGGCTTTC AGATAAACAT GTTTGAGATC 8081 .......... .......... .......... .......... .......... .......... 416 TTTCTTGAGG AGTTCAATAT TTTCCAGCAA GTCTAGGGTT ACACAATTTG TTTGGTTATT 8141 .......... .......... .......... .......... .......... .......... 416 ACCCTCTTTA TTCCTTAATT TCTCTTCCAA GTCACATACA AGAGTTGATA CCTCCCTG 8199 || |||| | | || ||| .......... .......... .......... .........G AGGGTTGGAA CTTCACTG 435 hqPGS_C06HBa0112G05.1-22+_SGN-E541981- (3408 3687) ******************************************************************************** EST sequence 5 -strand 789 n (File: SGN-E548970-) 1 AACTATCAGA GCAGATTGCC ATTTTCTTTG CACAGGAAAG CCACCGGGTT CAGAGTGGTT 61 AAGGGAGACA TATTTGAAGG ATCGAATCGA TAATTTTGGA AGATTGAAGG TAGATGAAAA 121 TCTTAGAATG AAGGGTCACA GAAACATATT TGCTGTTGGT GATATAACTG ATATTAAGGA 181 ACTTAAACAA GGATATTCAG CTCAAAAACA TGCTCTAGTG GCTGCAAAGA ACTTGAAACT 241 TTTGATGAGT GGAGGAAAAG AAAGCAAACT TGCAATATAC GAGCCTCGAT CTTCTCCTAA 301 GATCATCGTT CCGTTAGGAA GGCAAGATGC AGTGGCTCAA TTTACGTTCA CAACGATCAT 361 TGGATTAGTC CCCGGGATGA TCAAGTCTAA GGATTTATAC GTTGGGAAAA CGAGGAAGAA 421 AATGGGTCTT CAACCTAAAT AGGAGTATTA TATATTGTTT CATTCATGAA ATTTAACATC 481 AGTTTGAGTG TGAGAAAATG GTGTCATTTC ATTGCCAAGA TTTGAAGTTC TTCTACTTAT 541 GTTTTACAGT AGACTGTGCA TAGGAGTTCA ATAGCTATAA ATATTTTTTA GTTGTTGTAA 601 CTAGTTAGTG TAGGATTTAG TATTCTTTCT TTATTTTGGT TGTTATTTGA GTTTTTTCAA 661 AAATATTTGC ACCTATATAG TTACTAGTCT CCGGGCACGT GCGTTGCACG TGTATCCCAA 721 ATAAAAAAAA TAAATGAAAA CGTATATAAT AAACTATTAG GAGAATTATG ATGTTAAAAA 781 AAAAAAAAA Predicted gene structure (within gDNA segment 1 to 4507): Exon 1 1017 1054 ( 38 n); cDNA 595 631 ( 37 n); score: 0.579 Intron 1 1055 1509 ( 455 n); Pd: 0.101 (s: 0), Pa: 0.974 (s: 0) Exon 2 1510 1514 ( 5 n); cDNA 632 636 ( 5 n); score: 0.800 Intron 2 1515 2432 ( 918 n); Pd: 0.000 (s: 0), Pa: 0.743 (s: 0) Exon 3 2433 2441 ( 9 n); cDNA 637 644 ( 8 n); score: 0.667 Intron 3 2442 3280 ( 839 n); Pd: 0.663 (s: 0), Pa: 0.887 (s: 0.52) Exon 4 3281 3322 ( 42 n); cDNA 645 684 ( 40 n); score: 0.524 Intron 4 3323 3677 ( 355 n); Pd: 0.900 (s: 0.52), Pa: 0.000 (s: 0.72) Exon 5 3678 3764 ( 87 n); cDNA 685 772 ( 88 n); score: 0.845 PPA cDNA 776 789 MATCH C06HBa0112G05.1-22+ SGN-E548970- 0.845 181 0.229 C PGS_C06HBa0112G05.1-22+_SGN-E548970- (1017 1054,1510 1514,2433 2441,3281 3322,3678 3764) Alignment (genomic DNA sequence = upper lines): TTGTGGAGCA TTATTGTGTG AGTACTTTAG TCTTTCTGGT GAGAAAAGTT CTAATCTTTG 1076 |||| ||| ||| | | | || ||||||| TTGTAACTAG TTAGTGTAGG ATTTAG-TAT TCTTTCTT.. .......... .......... 631 CATCTGGGGT TTGAATGTTT CTTGTTAGTA ATTGAGGGTT TGAGGAAGTA GTAGGTTCTG 1136 .......... .......... .......... .......... .......... .......... 631 TGAAGCATTC TCGTCTGAGT ACTTTAGCCC TCCCACATTG TGTGAACAAT CTCTTCTCAT 1196 .......... .......... .......... .......... .......... .......... 631 CGAGCATGCG TTTCCCCATC AGCTGAAGCA ACACTGTTAA TCTTTATCTC TGGTTGGTCG 1256 .......... .......... .......... .......... .......... .......... 631 CTGATATTAA AGTTGGGGTT GTTATAACCA AAAAATCAGG GTACATTTGA AAATAAAATA 1316 .......... .......... .......... .......... .......... .......... 631 TAATTCAACA TTCGAAAGCG GGTTATAAGT TGTTTTCCAA ATTTGATTTA CAACTTCGAA 1376 .......... .......... .......... .......... .......... .......... 631 CTTTTCGTAC ACAAACACTG GTTTTCGAAT AAAGTGGACA TTATATCAGA AAAAAAGTAA 1436 .......... .......... .......... .......... .......... .......... 631 ATAATATTTA TGGGTCCTTA ATTAAGTCTT TCAGTTCATT CCTCTGTTTT GCTTGATTGA 1496 .......... .......... .......... .......... .......... .......... 631 TGGTATTTCA CAGTAGTTGG TGGCCATAGA GAAGAAATGC CTTAGTACTT TGAATCGCTT 1556 || || .......... ...TATTT.. .......... .......... .......... .......... 636 ACTGGGTAAA GAAGCAAAAG CATATAGTGT ATGACTTGGC TACAACAAAC GAAAGGGATG 1616 .......... .......... .......... .......... .......... .......... 636 TAAGAGGGCA CAATTTCGTC CACCTTTAGA AACTAAGCAT AAACTGAATC TAATTAAATT 1676 .......... .......... .......... .......... .......... .......... 636 TTTGTTTTGA TATGTGCACT TTACTAGCAA TGTTGCACAG AAAGTGTGAA CTACTGGTTC 1736 .......... .......... .......... .......... .......... .......... 636 GGTGTTATAA GTAGTTCAAT GATTTGAATT ATTGAAAGGG TTGAATATCT TGTAACATGG 1796 .......... .......... .......... .......... .......... .......... 636 TTATCAAATT AATGTTGCAT TTTTATCTCT TTCCATTTGA TTTATCACAA CTCCTCTTAT 1856 .......... .......... .......... .......... .......... .......... 636 ATTTGGACCT TCTTTTGCTG CTATTCTTGT TTAGCGCTGG GAATAAAAAC TATAGAAGCA 1916 .......... .......... .......... .......... .......... .......... 636 ATGTAACTCG TTTTTTGCTG CAGGAACCTG AGAGAACTGG AACTGGGAGA AAGTGAAGCA 1976 .......... .......... .......... .......... .......... .......... 636 GAAGACCTGA GTGGCCATTG GCTTAGTCAT TTTTCTGATA GTTGTACATC GCTTGTGTCA 2036 .......... .......... .......... .......... .......... .......... 636 CTTAACATTG CTTGTTTGGC TTCTGAGGTC AGCTTCTCAG CTTTGGAGCG TCTAGTTGCT 2096 .......... .......... .......... .......... .......... .......... 636 CGCTCTTCTC ATTTTAGGAC TCTTCGGCTC AATCGTGCTG TTCCCATTGA GAAACTTCCA 2156 .......... .......... .......... .......... .......... .......... 636 AAGCTACTTC GTCATGCTTC GAAGTTGGTT GAATTTGGTA CATGATCCTA CTCTGCTGAC 2216 .......... .......... .......... .......... .......... .......... 636 ATGCAGGCTG ATGTTTCTGA AGTTTTCGTA AATGTATCTC AAGCATTTTC AGGCTGTAAT 2276 .......... .......... .......... .......... .......... .......... 636 CAACTTAAAG GCTTGAGTGG GTTTTGGGAT GCTGTGCCAG CCTACTTTCC AACTATTTAT 2336 .......... .......... .......... .......... .......... .......... 636 CCAGTCTACT CCAAACTCAC CTCTTTGAAT TTAAGCTATG CTACCATTCA AATAGCTGAT 2396 .......... .......... .......... .......... .......... .......... 636 CTTTGCAAGC TCATTGGCAA TTGTTTCAAT TTGCAGCGGT TGTGGGTAGG TTCTAGCTTG 2456 ||| ||| .......... .......... .......... ......TGGT TGT-T..... .......... 644 TGTTTTACTT TTGTATACTT ATCGAGTGTT TTCAATAGTC ATGGCTCAAT ACATTAACTG 2516 .......... .......... .......... .......... .......... .......... 644 TGTTTGTGAT AAAATAGATA AATTGATTGA TAGTTGTAAA CACACAAATT TCCAAGAGAA 2576 .......... .......... .......... .......... .......... .......... 644 GAACAATAAT CCTAACTTAG TAATTGTCTT GCTAATTTTG ATAATGCCAG GGATAAATAA 2636 .......... .......... .......... .......... .......... .......... 644 ACAACGTAAT GAACTTCAGT GTTTACTTAG ATAACATAGC TAAACAATTA ATTTCGAGTG 2696 .......... .......... .......... .......... .......... .......... 644 AACTCTTTGA ATAATTTGCT GGTCTGTCAT ACAAAGAATA AGTTAATGTT GAATCCTAGT 2756 .......... .......... .......... .......... .......... .......... 644 GTTGCACGAA TCCTACACTT TGACAATTCA TGTATATCTT TCCGGGTTAG ACGCTTCCAT 2816 .......... .......... .......... .......... .......... .......... 644 TAGTTTTTAG GCTTTTCTTG TCCAGTGGAT CTGCTGCATA ATTAGTTATC CAGCTTACGC 2876 .......... .......... .......... .......... .......... .......... 644 TTTAAGGCAC TTGGAATGGA TGTGATTGCA GGTTCTAGAC TACATTGAAG ATAGCGGTCT 2936 .......... .......... .......... .......... .......... .......... 644 TGAGGAGATT GCCAACACTT GTAAGGAACT TCAAGAGCTT AGGGTGTTTC CTTTTGATCC 2996 .......... .......... .......... .......... .......... .......... 644 ATTTGCTCCA GGACCTAATG TATCCTTGAC AGAGCAAGGC CTTGTAGCTG TCTCAATGGG 3056 .......... .......... .......... .......... .......... .......... 644 CTGCCCTAAG CTTTAGTCAG TTTTATACTT CTGCCGCCAA ATGACAAATG ACGCCTTAGT 3116 .......... .......... .......... .......... .......... .......... 644 TACTATTGCA AGGAACCGTC CTAACATGAT CCGATTTCGT TTGTGTATTA TCGAGCCTCA 3176 .......... .......... .......... .......... .......... .......... 644 AACTCCTGAC TACTTAATCC TTGAACCACT TGATGCTGGT TTTGGGGTCA TTGTGTAACA 3236 .......... .......... .......... .......... .......... .......... 644 CTGCAAAAAA TTGCAGCGAC TTTCTCTTTC TGGCCTCCTT ACAGATCGTG TGTTTGAGTA 3296 || || |||| | .......... .......... .......... .......... ....AT-TTG AGTTTTTTCA 659 AATCGGGGTC CATGCTAAGA AGTTAGATAT GCTTTCCTTA GCTTTTGCAG GAGATAGTGA 3356 || | || ||| ||||| AAAATATTTG CA-CCTATAT AGTTAC.... .......... .......... .......... 684 TCTAGGCCTC CTATATGTTC TCTCTGGTTG TGAGAGCCTC CGTAAGTTGG AGATTAGAGA 3416 .......... .......... .......... .......... .......... .......... 684 CTGCCCTTTT GGCGATGAGG CTCTGTTGGC TAATGCTGCA AAGCTGGAGA GTGGAGACCA 3476 .......... .......... .......... .......... .......... .......... 684 TGCGATCCCT TTGGATGTCT AATTGTTCAG TAAGTTTTAA AGCATGTAAG CTGCTAGCCC 3536 .......... .......... .......... .......... .......... .......... 684 AGAAGTTGCC AGGCTTAATG TTGAAGTTAT AAACGAGAGG GGTCATCCGG ATACGAGACC 3596 .......... .......... .......... .......... .......... .......... 684 AGAAAGTTGC TCTATTGAGA AACTTTATAT ACACAAGACA GTGTCAGGAA GGAGGTTCGA 3656 .......... .......... .......... .......... .......... .......... 684 CACTCCTGGT TTTGTTTGGA CTAGGCTCCG GGCACGTGTA TTCCAAATAA AAATC-AATA 3715 ||| ||||| |||||||| || || | | | |||| .......... .......... .TAGTCTCCG GGCACGTGCG TTGCACGTGT ATCCCAAATA 723 AAAAAAATAA ATGAAAACGT ATATAATAAA CTATTAGGAG AATTATGAT 3764 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| AAAAAAATAA ATGAAAACGT ATATAATAAA CTATTAGGAG AATTATGAT 772 hqPGS_C06HBa0112G05.1-22+_SGN-E548970- (3678 3764) ******************************************************************************** EST sequence 2 -strand 457 n (File: SGN-E544869-) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAGGAA Predicted gene structure (within gDNA segment 6530 to 4495): Exon 1 5839 5415 ( 425 n); cDNA 1 425 ( 425 n); score: 0.958 PPA cDNA 426 454 MATCH C06HBa0112G05.1-22- SGN-E544869- 0.958 425 0.930 C PGS_C06HBa0112G05.1-22-_SGN-E544869- (5839 5415) Alignment (genomic DNA sequence = upper lines): ATCATCCATG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 5780 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 5720 ||||| | | || ||| ||| | ||| |||| |||||||||| ||||| |||| |||||||||| TTGAAGTGGG GTTTACGGAC TTTTTTCAAG TGGGAGGTTG GAGAGAAATC CTTCCCCAAT 120 CTTGAGAAAT TAAAACTGCA GGAATGTGGT AAGTTTGAGG AGATTCCACC TAGTTTTGGA 5660 ||||||||| |||||||||| |||||||||| ||| |||||| ||||||| || |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCCCC TAGTTTTGGA 180 GATATTTATT CATTGAAATT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 5600 |||||||||| |||||||||| ||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATGGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGAAATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 5540 |||||||||| ||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGT TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 5480 |||||||||| |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| CAGAAGAATA TCCCCTTATT TAAGTAGCAT TAGGGTTGAA AAGTAGATTG TACTTTGCAG 360 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 5420 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 420 TCAAA 5415 ||||| TCAAA 425 hqPGS_C06HBa0112G05.1-22-_SGN-E544869- (5839 5415) ******************************************************************************** EST sequence 6 +strand 457 n (File: SGN-E544870+) 1 ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 61 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 121 CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 181 GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 241 CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 301 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 361 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 421 TCANNNNANA AAAAAAAAAA AAAAAAAAAA AAAAAAA Predicted gene structure (within gDNA segment 6520 to 4467): Exon 1 5839 5417 ( 423 n); cDNA 1 423 ( 423 n); score: 0.986 PPA cDNA 428 457 MATCH C06HBa0112G05.1-22- SGN-E544870+ 0.986 423 0.926 C PGS_C06HBa0112G05.1-22-_SGN-E544870+ (5839 5417) Alignment (genomic DNA sequence = upper lines): ATCATCCATG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 5780 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCATCCAGG GAGAAGAATG GAACATGGGG GAGGAAGACA CCTTTGAGAA TCTCAAATTT 60 TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 5720 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAACTTGC GTCTACCGAC TCTTTCCAAG TGGGAGGTTG GAGAGGAATC CTTCCCCAAT 120 CTTGAGAAAT TAAAACTGCA GGAATGTGGT AAGTTTGAGG AGATTCCACC TAGTTTTGGA 5660 ||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| CTTGAGAAAC TAAAACTGCA GGAATGTGGT AAGCTTGAGG AGATTCCACC TAGTTTTGGA 180 GATATTTATT CATTGAAATT TATCAAAATT GTAAAGAGTC CTCAACTTGA AGATTCTGCT 5600 |||||||||| |||||||||| |||| ||||| ||||| |||| |||||||||| |||||||||| GATATTTATT CATTGAAATT TATCGAAATT GTAAATAGTC CTCAACTTGA AGATTCTGCT 240 CTCAAGATTA AGGAATACGC TGAAGAAATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 5540 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| |||||||||| CTCAAGATTA AGGAATACGC TGAAGAGATG AGAGGAGGGA GCGAGCTTCA GATCCTTGGC 300 CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 5480 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGAAGAATA TCCCCTTATT TAAGTAGCAT TATGGTTGAA AAGTAGATTG TACTTTGCAG 360 GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 5420 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTACATTGT ATATGATTAA GAAAACTTTG TTGCAGTTAT GAAATATTTT TGTGGATTTC 420 TCA 5417 ||| TCA 423 hqPGS_C06HBa0112G05.1-22-_SGN-E544870+ (5839 5417) ******************************************************************************** EST sequence 4 +strand 453 n (File: SGN-E308524+) 1 TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 61 TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 121 TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 181 ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 241 TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 301 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAA AGTAGATTGT ACTTTGCAGG 361 GTACATTGTA TATGATTAAG AAAACTTTGT TGCAGTTATG AAATATTTTT GTGGATTTCT 421 CNNNNNAAAA AAAAAAAAAA AAAAAAAATA AAA Predicted gene structure (within gDNA segment 6510 to 4488): Exon 1 5838 5418 ( 421 n); cDNA 1 421 ( 421 n); score: 0.986 PPA cDNA 427 453 MATCH C06HBa0112G05.1-22- SGN-E308524+ 0.986 421 0.929 C PGS_C06HBa0112G05.1-22-_SGN-E308524+ (5838 5418) Alignment (genomic DNA sequence = upper lines): TCATCCATGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 5779 ||||||| || |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATCCAGGG AGAAGAATGG AACATGGGGG AGGAAGACAC CTTTGAGAAT CTCAAATTTT 60 TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 5719 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAACTTGCG TCTACCGACT CTTTCCAAGT GGGAGGTTGG AGAGGAATCC TTCCCCAATC 120 TTGAGAAATT AAAACTGCAG GAATGTGGTA AGTTTGAGGA GATTCCACCT AGTTTTGGAG 5659 |||||||| | |||||||||| |||||||||| || ||||||| |||||||||| |||||||||| TTGAGAAACT AAAACTGCAG GAATGTGGTA AGCTTGAGGA GATTCCACCT AGTTTTGGAG 180 ATATTTATTC ATTGAAATTT ATCAAAATTG TAAAGAGTCC TCAACTTGAA GATTCTGCTC 5599 |||||||||| |||||||||| ||| |||||| |||| ||||| |||||||||| |||||||||| ATATTTATTC ATTGAAATTT ATCGAAATTG TAAATAGTCC TCAACTTGAA GATTCTGCTC 240 TCAAGATTAA GGAATACGCT GAAGAAATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 5539 |||||||||| |||||||||| ||||| |||| |||||||||| |||||||||| |||||||||| TCAAGATTAA GGAATACGCT GAAGAGATGA GAGGAGGGAG CGAGCTTCAG ATCCTTGGCC 300 AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAA AGTAGATTGT ACTTTGCAGG 5479 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAGAATAT CCCCTTATTT AAGTAGCATT ATGGTTGAAA AGTAGATTGT ACTTTGCAGG 360 GTACATTGTA TATGATTAAG AAAACTTTGT TGCAGTTATG AAATATTTTT GTGGATTTCT 5419 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTACATTGTA TATGATTAAG AAAACTTTGT TGCAGTTATG AAATATTTTT GTGGATTTCT 420 C 5418 | C 421 hqPGS_C06HBa0112G05.1-22-_SGN-E308524+ (5838 5418) Total number of EST alignments reported: 6 ________________________________________________________________________________ Predicted gene locations (3) in segment 1 to 9707: PGL 1 (+ strand): 844 2036 AGS-1 (844 1272,1871 1884,1940 2036) SCR (e 0.752 d 0.000 a 0.000,e 0.786 d 0.000 a 0.999,e 0.887) Exon 1 844 1272 ( 429 n); score: 0.752 Intron 1 1273 1870 ( 598 n); Pd: 0.000 Pa: 0.000 Exon 2 1871 1884 ( 14 n); score: 0.786 Intron 2 1885 1939 ( 55 n); Pd: 0.000 Pa: 0.999 Exon 3 1940 2036 ( 97 n); score: 0.887 PGS (844 1272,1871 1884,1940 2036) SGN-E320435+ 3-phase translation of AGS-1 (+strand): . . . . . . 844 GCACTGTTGTCTAAGTACTTTAGCCGTTGCCGGTGACTTCATTAAACTGATTTGATAGAA A L L S K Y F S R C R - L H - T D L I E H C C L S T L A V A G D F I K L I - - N T V V - V L - P L P V T S L N - F D R . . . . . . 904 CAGAAGGTTTTGTGGAACATTGGTGTCTGAGTACTTCAGCCCTTACTGGTTGGGATGAGT Q K V L W N I G V - V L Q P L L V G M S R R F C G T L V S E Y F S P Y W L G - V T E G F V E H W C L S T S A L T G W D E . . . . . . 964 TTTAATCTATGCATCTGAGGAACTTTTCTTGCTTGTAATTGAAGCAGTGGGTTTTGTGGA F N L C I - G T F L A C N - S S G F C G L I Y A S E E L F L L V I E A V G F V E F - S M H L R N F S C L - L K Q W V L W . . . . . . 1024 GCATTATTGTGTGAGTACTTTAGTCTTTCTGGTGAGAAAAGTTCTAATCTTTGCATCTGG A L L C E Y F S L S G E K S S N L C I W H Y C V S T L V F L V R K V L I F A S G S I I V - V L - S F W - E K F - S L H L . . . . . . 1084 GGTTTGAATGTTTCTTGTTAGTAATTGAGGGTTTGAGGAAGTAGTAGGTTCTGTGAAGCA G L N V S C - - L R V - G S S R F C E A V - M F L V S N - G F E E V V G S V K H G F E C F L L V I E G L R K - - V L - S . . . . . . 1144 TTCTCGTCTGAGTACTTTAGCCCTCCCACATTGTGTGAACAATCTCTTCTCATCGAGCAT F S S E Y F S P P T L C E Q S L L I E H S R L S T L A L P H C V N N L F S S S M I L V - V L - P S H I V - T I S S H R A . . . . . . 1204 GCGTTTCCCCATCAGCTGAAGCAACACTGTTAATCTTTATCTCTGGTTGGTCGCTGATAT A F P H Q L K Q H C - S L S L V G R - Y R F P I S - S N T V N L Y L W L V A D I C V S P S A E A T L L I F I S G W S L I . : . . : . . . 1264 TAAAGTTGG : TTGCTGCTATTCTT : GAACCTGAGAGAACTGGAACTGGGAGAAAGTGAAGCA - S W : L L L F L : N L R E L E L G E S E A K V G : C C Y S : - T - E N W N W E K V K Q L K L : V A A I L : E P E R T G T G R K - S . . . . . . 1977 GAAGACCTGAGTGGCCATTGGCTTAGTCATTTTTCTGATAGTTGTACATCGCTTGTGTCA E D L S G H W L S H F S D S C T S L V S K T - V A I G L V I F L I V V H R L C R R P E W P L A - S F F - - L Y I A C V Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 3408 3764 AGS-1 (3408 3764) SCR (e 0.779) Exon 1 3408 3764 ( 357 n); score: 0.779 PGS (3408 3687) SGN-E541981- PGS (3678 3764) SGN-E548970- 3-phase translation of AGS-1 (+strand): . . . . . . 3408 GATTAGAGACTGCCCTTTTGGCGATGAGGCTCTGTTGGCTAATGCTGCAAAGCTGGAGAG D - R L P F W R - G S V G - C C K A G E I R D C P F G D E A L L A N A A K L E S L E T A L L A M R L C W L M L Q S W R . . . . . . 3468 TGGAGACCATGCGATCCCTTTGGATGTCTAATTGTTCAGTAAGTTTTAAAGCATGTAAGC W R P C D P F G C L I V Q - V L K H V S G D H A I P L D V - L F S K F - S M - A V E T M R S L W M S N C S V S F K A C K . . . . . . 3528 TGCTAGCCCAGAAGTTGCCAGGCTTAATGTTGAAGTTATAAACGAGAGGGGTCATCCGGA C - P R S C Q A - C - S Y K R E G S S G A S P E V A R L N V E V I N E R G H P D L L A Q K L P G L M L K L - T R G V I R . . . . . . 3588 TACGAGACCAGAAAGTTGCTCTATTGAGAAACTTTATATACACAAGACAGTGTCAGGAAG Y E T R K L L Y - E T L Y T Q D S V R K T R P E S C S I E K L Y I H K T V S G R I R D Q K V A L L R N F I Y T R Q C Q E . . . . . . 3648 GAGGTTCGACACTCCTGGTTTTGTTTGGACTAGGCTCCGGGCACGTGTATTCCAAATAAA E V R H S W F C L D - A P G T C I P N K R F D T P G F V W T R L R A R V F Q I K G G S T L L V L F G L G S G H V Y S K - . . . . . . 3708 AATCAATAAAAAAAATAAATGAAAACGTATATAATAAACTATTAGGAGAATTATGAT N Q - K K - M K T Y I I N Y - E N Y D I N K K N K - K R I - - T I R R I M K S I K K I N E N V Y N K L L G E L - Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-22+_PGL-2_AGS-1_PPS_1 (3526 3729) (frame '2'; 201 bp, 67 residues) 1 AASPEVARLN VEVINERGHP DTRPESCSIE KLYIHKTVSG RRFDTPGFVW TRLRARVFQI 61 KINKKNK- 3-phase translation of AGS-1 (-strand): . . . . . . 3764 ATCATAATTCTCCTAATAGTTTATTATATACGTTTTCATTTATTTTTTTTATTGATTTTT I I I L L I V Y Y I R F H L F F L L I F S - F S - - F I I Y V F I Y F F Y - F L H N S P N S L L Y T F S F I F F I D F . . . . . . 3704 ATTTGGAATACACGTGCCCGGAGCCTAGTCCAAACAAAACCAGGAGTGTCGAACCTCCTT I W N T R A R S L V Q T K P G V S N L L F G I H V P G A - S K Q N Q E C R T S F Y L E Y T C P E P S P N K T R S V E P P . . . . . . 3644 CCTGACACTGTCTTGTGTATATAAAGTTTCTCAATAGAGCAACTTTCTGGTCTCGTATCC P D T V L C I - S F S I E Q L S G L V S L T L S C V Y K V S Q - S N F L V S Y P S - H C L V Y I K F L N R A T F W S R I . . . . . . 3584 GGATGACCCCTCTCGTTTATAACTTCAACATTAAGCCTGGCAACTTCTGGGCTAGCAGCT G - P L S F I T S T L S L A T S G L A A D D P S R L - L Q H - A W Q L L G - Q L R M T P L V Y N F N I K P G N F W A S S . . . . . . 3524 TACATGCTTTAAAACTTACTGAACAATTAGACATCCAAAGGGATCGCATGGTCTCCACTC Y M L - N L L N N - T S K G I A W S P L T C F K T Y - T I R H P K G S H G L H S L H A L K L T E Q L D I Q R D R M V S T . . . . . . 3464 TCCAGCTTTGCAGCATTAGCCAACAGAGCCTCATCGCCAAAAGGGCAGTCTCTAATC S S F A A L A N R A S S P K G Q S L I P A L Q H - P T E P H R Q K G S L - L Q L C S I S Q Q S L I A K R A V S N Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-22-_PGL-2_AGS-1_PPS_1 (3639 3409) (frame '0'; 231 bp, 77 residues) 1 HCLVYIKFLN RATFWSRIRM TPLVYNFNIK PGNFWASSLH ALKLTEQLDI QRDRMVSTLQ 61 LCSISQQSLI AKRAVSN PGL 3 (- strand): 5839 5415 AGS-1 (5839 5415) SCR (e 0.986) Exon 1 5839 5415 ( 425 n); score: 0.986 PGS (5839 5415) SGN-E544869- PGS (5839 5417) SGN-E544870+ PGS (5838 5418) SGN-E308524+ 3-phase translation of AGS-1 (-strand): . . . . . . 5839 ATCATCCATGGAGAAGAATGGAACATGGGGGAGGAAGACACCTTTGAGAATCTCAAATTT I I H G E E W N M G E E D T F E N L K F S S M E K N G T W G R K T P L R I S N F H P W R R M E H G G G R H L - E S Q I . . . . . . 5779 TTGAACTTGCGTCTACCGACTCTTTCCAAGTGGGAGGTTGGAGAGGAATCCTTCCCCAAT L N L R L P T L S K W E V G E E S F P N - T C V Y R L F P S G R L E R N P S P I F E L A S T D S F Q V G G W R G I L P Q . . . . . . 5719 CTTGAGAAATTAAAACTGCAGGAATGTGGTAAGTTTGAGGAGATTCCACCTAGTTTTGGA L E K L K L Q E C G K F E E I P P S F G L R N - N C R N V V S L R R F H L V L E S - E I K T A G M W - V - G D S T - F W . . . . . . 5659 GATATTTATTCATTGAAATTTATCAAAATTGTAAAGAGTCCTCAACTTGAAGATTCTGCT D I Y S L K F I K I V K S P Q L E D S A I F I H - N L S K L - R V L N L K I L L R Y L F I E I Y Q N C K E S S T - R F C . . . . . . 5599 CTCAAGATTAAGGAATACGCTGAAGAAATGAGAGGAGGGAGCGAGCTTCAGATCCTTGGC L K I K E Y A E E M R G G S E L Q I L G S R L R N T L K K - E E G A S F R S L A S Q D - G I R - R N E R R E R A S D P W . . . . . . 5539 CAGAAGAATATCCCCTTATTTAAGTAGCATTATGGTTGAAAAGTAGATTGTACTTTGCAG Q K N I P L F K - H Y G - K V D C T L Q R R I S P Y L S S I M V E K - I V L C R P E E Y P L I - V A L W L K S R L Y F A . . . . . . 5479 GGTACATTGTATATGATTAAGAAAACTTTGTTGCAGTTATGAAATATTTTTGTGGATTTC G T L Y M I K K T L L Q L - N I F V D F V H C I - L R K L C C S Y E I F L W I S G Y I V Y D - E N F V A V M K Y F C G F . 5419 TCAAA S Q L K Maximal non-overlapping open reading frames (>= 64 codons): >C06HBa0112G05.1-22-_PGL-3_AGS-1_PPS_1 (5839 5513) (frame '1'; 324 bp, 108 residues) 1 IIHGEEWNMG EEDTFENLKF LNLRLPTLSK WEVGEESFPN LEKLKLQECG KFEEIPPSFG 61 DIYSLKFIKI VKSPQLEDSA LKIKEYAEEM RGGSELQILG QKNIPLFK- 3-phase translation of AGS-1 (+strand): . . . . . . 5415 TTTGAGAAATCCACAAAAATATTTCATAACTGCAACAAAGTTTTCTTAATCATATACAAT F E K S T K I F H N C N K V F L I I Y N L R N P Q K Y F I T A T K F S - S Y T M - E I H K N I S - L Q Q S F L N H I Q . . . . . . 5475 GTACCCTGCAAAGTACAATCTACTTTTCAACCATAATGCTACTTAAATAAGGGGATATTC V P C K V Q S T F Q P - C Y L N K G I F Y P A K Y N L L F N H N A T - I R G Y S C T L Q S T I Y F S T I M L L K - G D I . . . . . . 5535 TTCTGGCCAAGGATCTGAAGCTCGCTCCCTCCTCTCATTTCTTCAGCGTATTCCTTAATC F W P R I - S S L P P L I S S A Y S L I S G Q G S E A R S L L S F L Q R I P - S L L A K D L K L A P S S H F F S V F L N . . . . . . 5595 TTGAGAGCAGAATCTTCAAGTTGAGGACTCTTTACAATTTTGATAAATTTCAATGAATAA L R A E S S S - G L F T I L I N F N E - - E Q N L Q V E D S L Q F - - I S M N K L E S R I F K L R T L Y N F D K F Q - I . . . . . . 5655 ATATCTCCAAAACTAGGTGGAATCTCCTCAAACTTACCACATTCCTGCAGTTTTAATTTC I S P K L G G I S S N L P H S C S F N F Y L Q N - V E S P Q T Y H I P A V L I S N I S K T R W N L L K L T T F L Q F - F . . . . . . 5715 TCAAGATTGGGGAAGGATTCCTCTCCAACCTCCCACTTGGAAAGAGTCGGTAGACGCAAG S R L G K D S S P T S H L E R V G R R K Q D W G R I P L Q P P T W K E S V D A S L K I G E G F L S N L P L G K S R - T Q . . . . . . 5775 TTCAAAAATTTGAGATTCTCAAAGGTGTCTTCCTCCCCCATGTTCCATTCTTCTCCATGG F K N L R F S K V S S S P M F H S S P W S K I - D S Q R C L P P P C S I L L H G V Q K F E I L K G V F L P H V P F F S M . 5835 ATGAT M - D D Maximal non-overlapping open reading frames (>= 64 codons): none ... finished at: Mon Aug 28 22:22:50 2006