FGENESH 2.4 Prediction of potential genes in Nicotiana_dicot genomic DNA Time : Tue Oct 16 16:05:44 2007 Seq name: C10HBa0041K23.1 AC171731.1 htgs_phase:3 submitted_to_sgn_as:082905.asm.C52 082905.asm.assem.ace.1 from 1 to 73387 Length of sequence: 73387 Number of predicted genes 30 in +chain 15 in -chain 15 Number of predicted exons 58 in +chain 29 in -chain 29 Positions of predicted genes and exons: Variant 1 from 1, Score:666.025732 G Str Feature Start End Score ORF Len 1 - PolA 7 -0.75 1 - 1 CDSl 22 - 191 6.18 22 - 189 168 1 - 2 CDSi 564 - 661 1.50 565 - 660 96 1 - 3 CDSf 858 - 916 9.94 860 - 916 57 2 + 1 CDSf 2200 - 2951 71.93 2200 - 2949 750 2 + 2 CDSl 3046 - 3403 36.03 3047 - 3403 357 2 + PolA 3621 2.25 3 + 1 CDSf 4157 - 4290 6.97 4157 - 4288 132 3 + 2 CDSl 5519 - 5777 19.89 5520 - 5777 258 3 + PolA 5861 0.75 4 + 1 CDSf 5902 - 5944 4.35 5902 - 5943 42 4 + 2 CDSl 6072 - 6193 11.27 6074 - 6193 120 4 + PolA 6342 0.75 5 + 1 CDSo 6763 - 6942 20.44 6763 - 6942 180 5 + PolA 7014 2.25 6 + 1 CDSf 7176 - 7222 -3.38 7176 - 7220 45 6 + 2 CDSi 7271 - 7314 -2.44 7272 - 7313 42 6 + 3 CDSl 7347 - 7501 11.03 7349 - 7501 153 6 + PolA 7704 2.25 7 - PolA 8599 2.25 7 - 1 CDSl 8825 - 8882 -2.59 8825 - 8881 57 7 - 2 CDSf 9639 - 9763 7.40 9641 - 9763 123 8 + 1 CDSf 9995 - 10070 15.60 9995 - 10069 75 8 + 2 CDSl 10223 - 10305 -4.82 10225 - 10305 81 8 + PolA 10477 0.75 9 - PolA 10503 -0.55 9 - 1 CDSl 10597 - 10665 1.57 10597 - 10665 69 9 - 2 CDSf 10994 - 11104 12.18 10994 - 11104 111 10 - PolA 11193 0.75 10 - 1 CDSl 11387 - 11483 4.55 11387 - 11482 96 10 - 2 CDSf 12598 - 12716 5.63 12600 - 12716 117 11 + 1 CDSf 12763 - 12849 2.56 12763 - 12849 87 11 + 2 CDSl 13485 - 13796 20.82 13485 - 13796 312 11 + PolA 14027 2.25 12 + 1 CDSf 14431 - 14565 3.45 14431 - 14565 135 12 + 2 CDSi 14636 - 14961 19.94 14636 - 14959 324 12 + 3 CDSl 15178 - 15394 24.36 15179 - 15394 216 12 + PolA 15603 0.75 13 - PolA 15623 -0.55 13 - 1 CDSo 15650 - 15853 13.05 15650 - 15853 204 14 - PolA 15943 -2.85 14 - 1 CDSl 15957 - 16142 10.18 15957 - 16142 186 14 - 2 CDSi 16624 - 16685 11.52 16624 - 16683 60 14 - 3 CDSf 16793 - 16907 0.53 16794 - 16907 114 15 - PolA 18075 0.75 15 - 1 CDSo 18411 - 19469 103.65 18411 - 19469 1059 16 - PolA 21534 0.75 16 - 1 CDSo 21614 - 21685 -0.04 21614 - 21685 72 17 + 1 CDSf 23426 - 23696 5.65 23426 - 23695 270 17 + 2 CDSl 23759 - 23889 -2.07 23761 - 23889 129 17 + PolA 23907 0.75 18 - PolA 24840 2.25 18 - 1 CDSo 24976 - 25218 12.82 24976 - 25218 243 19 + 1 CDSf 25599 - 25625 -2.64 25599 - 25625 27 19 + 2 CDSl 25694 - 25771 6.07 25694 - 25771 78 19 + PolA 25775 2.25 20 - PolA 25787 0.75 20 - 1 CDSl 25840 - 25968 11.10 25840 - 25968 129 20 - 2 CDSi 26046 - 26141 13.07 26046 - 26141 96 20 - 3 CDSi 26256 - 26337 11.87 26256 - 26336 81 20 - 4 CDSi 26426 - 26482 2.34 26428 - 26481 54 20 - 5 CDSi 26615 - 26657 4.69 26617 - 26655 39 20 - 6 CDSf 27314 - 27416 14.73 27315 - 27416 102 21 + 1 CDSo 27607 - 27681 -1.10 27607 - 27681 75 21 + PolA 27805 2.25 22 + 1 CDSf 44987 - 45121 15.54 44987 - 45121 135 22 + 2 CDSl 45518 - 45709 18.18 45518 - 45709 192 22 + PolA 45835 -2.65 23 + 1 CDSf 51354 - 51450 9.10 51354 - 51449 96 23 + 2 CDSl 51499 - 51866 30.34 51501 - 51866 366 23 + PolA 51921 -6.15 24 + 1 CDSf 51934 - 52031 1.96 51934 - 52029 96 24 + 2 CDSl 52062 - 52296 15.83 52063 - 52296 234 24 + PolA 52430 2.25 25 - PolA 54104 2.25 25 - 1 CDSo 54167 - 54331 15.18 54167 - 54331 165 26 + 1 CDSo 55189 - 55305 2.07 55189 - 55305 117 26 + PolA 55332 2.25 27 - PolA 55401 0.75 27 - 1 CDSo 55538 - 55876 34.08 55538 - 55876 339 28 - PolA 58373 2.25 28 - 1 CDSo 58403 - 58477 -0.57 58403 - 58477 75 29 - PolA 59518 2.25 29 - 1 CDSo 59559 - 59654 0.26 59559 - 59654 96 30 - PolA 61177 2.25 30 - 1 CDSl 61200 - 61250 8.28 61200 - 61250 51 30 - 2 CDSi 61804 - 61866 11.91 61804 - 61866 63 30 - 3 CDSf 62107 - 62355 10.06 62107 - 62355 249 Predicted protein(s): >FGENESH: 1 3 exon (s) 22 - 916 108 aa, chain - MYFAPTHANKNEKATSRYIILLVQDYIICNLHCVLFNHVKICHFTWHNRLYKESAINASS NCEAAKLTQNQNKPTAGKKQYKPRSVTESKQFGSFIERKVNSSDEDAI >FGENESH: 2 2 exon (s) 2200 - 3403 369 aa, chain + MAAIRLSTLKRTLNFSYKLANQIDVRPSYACINGNLHSREPTYSKNYADPKCFNTREIHS ASGTIHVTSYLSGRSDNKSLWGSKSIVVTSPIWNYRWYSSSFSSKGDSPKGSEVSTGASG SDMDTGGVSGSEWVGNIKEAWRTATDAVTSTGEKVKEASSEMTPYVEQVLNAHPYLRDVI VPVAGTLTGTLMAWVVLPRLLRRFHKYSMQGPAALLPGSSIWGQVSYERSIWGAMEDPVR YLITFMAFSQIAVMVAPSTIASQYLLQTWRGAAILSFVWFLQRWKTNVISRALAVKSLEV GDRDRLLTLDRFSSVGLFILGLMTLAEACGVAVQSILTVGGIGGEISQIGSHLSSYLLGG KFPTNKRSP >FGENESH: 3 2 exon (s) 4157 - 5777 130 aa, chain + MGINPCFTHAKNHSPNPLKYGRIGRVTKKWAHFAGTNSNHGILLICELYFDVSSSIVWRK VSPSFYIRTHVKHVVLHSTDDVSTWTILTSQSRNGNKQCDIQQWTDLLIGGHVKMGINFE IICSAKWWVL >FGENESH: 4 2 exon (s) 5902 - 6193 54 aa, chain + MSAMILIGSILVCAGVATAFAARDILGNVLSGLSVQLSQPFSVGDTIKVCLKSS >FGENESH: 5 1 exon (s) 6763 - 6942 59 aa, chain + MDGLRRKDTDELWCKDLHLETFIWMDCDTRHYIGVGEAGKEEVHLCPKLRTKSCLLSEI >FGENESH: 6 3 exon (s) 7176 - 7501 81 aa, chain + MSCLTETTFAGWISGRKIPCYCPKFTIFQSTTEISLRHPVDAKLPKEGNCQLSKYNKLIK AKELMSSSYSWSFFWGCYHFW >FGENESH: 7 2 exon (s) 8825 - 9763 60 aa, chain - MSFFLFFYARTKLTLGGHPSKVLWRSPTYAMGLNGHPLTMSTLCEPPRSKRGVYLDLSGI >FGENESH: 8 2 exon (s) 9995 - 10305 52 aa, chain + MSYLSSVCLLPVEPPDVVHRLLVLPDIPLNGHAYHQQITKKVYYYQLLRTAK >FGENESH: 9 2 exon (s) 10597 - 11104 59 aa, chain - MALDQYENKEQIDVLEENGPNMYLVYSVRREPVDILREEHDADVINYLAKYLDSPLNMA >FGENESH: 10 2 exon (s) 11387 - 12716 71 aa, chain - MLRMIIRPTMLYGMECWPNKNTHVLKINVVKMRMLRWMHGGSNKRFSVRFASVCLQKVLA LQTGHENLSGN >FGENESH: 11 2 exon (s) 12763 - 13796 132 aa, chain + MKERLHPCEFHAPSTNDSGAVVKDQYVPPVIVNKSRAQWRAMVTTVPFQIEDFDIIVQIS DDVKSMLKSNPNVFLEKEAPYCYLSKIEKSFAELTLGCNLRYAVCLLFFSLSYYHCEIHL IYVLELYVYCNN >FGENESH: 12 3 exon (s) 14431 - 15394 225 aa, chain + MVTWLTLLFELVSGNPPGNCDPCLAISVTKLETGRKYSSGRMTGLVVWKPQGWNVNLKRF LNDWEVNRTAELLKVLEQCQGITTNEDRLFWKQHTRRTYTIKLTYVALNGTGQHTNIWPW EHNCKVKIPYKVASFTGIVGKKACLTHDNLFWNWDSCAQDVIYMEGFRKYWNLCFCVKEI EWVMPKATQEHLNAGTTWVGRQKKWWKKILVDNKEGRKLEMFSKK >FGENESH: 13 1 exon (s) 15650 - 15853 67 aa, chain - MSDTQRAQLLTDSTLKITSAFVPNILFCINHKADLKIKEHAFFCVMGSTFEVPIIRPKFG KINAEAV >FGENESH: 14 3 exon (s) 15957 - 16907 120 aa, chain - MQENKWTSMTNSVQTALKAVLRQERECYNPTFTTLTVRAVTVTRNTPKSPTRSIGILAKM KKTNQGNLTTRLRHLCSNSLTPRISQCAPMLLDYSNCLQKNILLGRKQFVLALKYKKDGN >FGENESH: 15 1 exon (s) 18411 - 19469 352 aa, chain - MSRPQEPHRPFLPFGNPFKFILPKGSYLSPKLLALLNAFEESLTERVKSLKPGGKEDTLT LAWMTQAISTLCAIHTDVKTFITDLELPVCDWDEKWIDVYLDNSVKLLDICIAFSSDISR LNQGHLYLQCGLHNLDGTSNQFMKARSSFDGWKQHINSKNPRLENCFAILDSLTESLNLP KIKNSAKGKVLMRAMYGVRVVTVFILSMFAVTFSGSTKELKDLQIHETCLWTEAFVDVRD FISQEIRSIYSSGRITSLKELEVVDTSVKKLYPLIQDGVDPNEAEQLQLLTSNLAEKAEK LSGGLDLLAKEADRFFHILLTGRDSLLCNLRIDNTVSNPAEVNNNVERKEVR >FGENESH: 16 1 exon (s) 21614 - 21685 23 aa, chain - MTKIPLATCRHVDTSCQFVASST >FGENESH: 17 2 exon (s) 23426 - 23889 133 aa, chain + MNVKIDAQVASSILDSYFKETCRLMRILYIVLNIDQMEWTKKRLIFGIRTKNVSSKFRGK LNNVVVRPTMLYVTKCMQVEILSLEDEHRIVSYLNYYSLCIKIFLDVGVPICTRCRLRTN TRPTQHQYVVNTK >FGENESH: 18 1 exon (s) 24976 - 25218 80 aa, chain - MNKRDGKFGARPTTAPPIQHMSRLDQPLPPTIAYAPQSYPPPPPPQQPYGYPPPPQQPPI YPPPNYAPSGVGYPPPGYPR >FGENESH: 19 2 exon (s) 25599 - 25771 34 aa, chain + MFFETHDVKMFKLSHTCIQLQTLQQEKGKVNRDG >FGENESH: 20 6 exon (s) 25840 - 27416 169 aa, chain - MASLDAEMEKMKFRQNYQNHWHTDLLRAPQSDPLCSRRRLTKMICLCRCGPCASYILRKR ALYNDMSRYTCCGGYMPCSGRCGESHCPELCLCTEVFLCFANSVASTRFMLQDEFNLQTT KCDNCIIGFMFCLQQVACIFSCIACITGNDELQEASRVLNCCSDMVYCT >FGENESH: 21 1 exon (s) 27607 - 27681 24 aa, chain + MNRNMKGKDIMFLEVEVGIPRLNE >FGENESH: 22 2 exon (s) 44987 - 45709 108 aa, chain + MSSIIQGAGFSLSPFTATRPRRMAPVVRAEAINPDSNKDEPSLTYGMKHLTSCERIRPSN NGTCNLTSSQKTTFFVSIYLIAFGTGGIKPCVSFFGADQIDDNDQNSS >FGENESH: 23 2 exon (s) 51354 - 51866 154 aa, chain + MELRFFDKDAMESESDKVDGSVNFVKALHNDSKWNYLSPVKKANLHALQGVHHLRLEDVD VKDSKSMKSVPSDAKTIGEVMIRGNTVMNGYFKDVKATKSSYKGGWFRSGDLLVRHQGGC IEVKDRSIDTTISGLESISSIEVESVIFSHPSVF >FGENESH: 24 2 exon (s) 51934 - 52296 110 aa, chain + MDAMPILMRSSSIVAIVCLNTLLLGQLFLSCQLQRNGPHFRIPSACLSLFDTLSTNSDGE TRLSFLQLINLFLGLASEGENEENLGEWVDGNRRFERMGRVILNGVELLD >FGENESH: 25 1 exon (s) 54167 - 54331 54 aa, chain - MWVDVTELEETNGHMSCKKLMIMRVTRTNGHISYKTVMIMRVTRNNGYISNNNQ >FGENESH: 26 1 exon (s) 55189 - 55305 38 aa, chain + MDRHSFRRLHLFYIVYAFPSTSLNNPEITHPKIKGKSR >FGENESH: 27 1 exon (s) 55538 - 55876 112 aa, chain - MTKILLTLFSLALVLGQTYGNIQCGTDVIPKVMSCGGFILGDDAKPSQACCVGLQDLAKT AAASQTDRKDICLCFKAAMQGAKVKYDKAKQLPDLCHFTPFMPLEPNPDCSK >FGENESH: 28 1 exon (s) 58403 - 58477 24 aa, chain - MNNSNINLKIKTRFINMGGLENRR >FGENESH: 29 1 exon (s) 59559 - 59654 31 aa, chain - MTQSLRVPYPLSLQTSNNLESFKITRKLKET >FGENESH: 30 3 exon (s) 61200 - 62355 120 aa, chain - MTKILLILFSLALILSQTNGVIQCGTDVLPKVKPCGGFVLGQDPTPSNDCCVGLQDLAKI AAASQSDRKDICICFKALMKAGQGEELGQWLRGSKSSQSLLVGKVYSKKLKQNHPKVEEM