FGENESH 1.1 Prediction of potential genes in Homo_sapiens genomic DNA Time : Fri Mar 11 10:22:57 2005 Seq name: hox Length of sequence: 500000 Number of predicted genes 18 in +chain 5 in -chain 13 Number of predicted exons 64 in +chain 21 in -chain 43 Positions of predicted genes and exons: G Str Feature Start End Score ORF Len 1 + TSS 3097 -8.79 1 + 1 CDSf 4260 - 4676 11.71 4260 - 4676 417 1 + 2 CDSl 4829 - 5011 10.98 4829 - 5011 183 1 + PolA 5882 1.12 2 - PolA 63392 1.12 2 - 1 CDSo 65008 - 65430 27.55 65008 - 65430 423 2 - TSS 66831 -9.49 3 + TSS 105720 -7.99 3 + 1 CDSf 106574 - 107004 29.33 106574 - 107002 429 3 + 2 CDSi 124795 - 124826 0.69 124796 - 124825 30 3 + 3 CDSi 130250 - 130533 23.87 130252 - 130533 282 3 + 4 CDSi 169967 - 170267 7.57 169967 - 170266 300 3 + 5 CDSl 170463 - 170728 6.30 170465 - 170728 264 3 + PolA 171045 -1.08 4 - PolA 175425 1.12 4 - 1 CDSl 176539 - 176894 37.05 176539 - 176892 354 4 - 2 CDSi 177360 - 177705 5.12 177361 - 177705 345 4 - 3 CDSf 177850 - 177996 6.29 177850 - 177996 147 4 - TSS 179280 -6.09 5 + TSS 179689 -9.89 5 + 1 CDSf 180044 - 180249 3.94 180044 - 180247 204 5 + 2 CDSl 181173 - 181284 6.68 181174 - 181284 111 5 + PolA 181506 1.12 6 - PolA 182472 1.12 6 - 1 CDSl 182825 - 183564 51.70 182825 - 183562 738 6 - 2 CDSi 186047 - 186193 5.98 186048 - 186191 144 6 - 3 CDSi 188042 - 188275 0.40 188043 - 188273 231 6 - 4 CDSf 188916 - 189000 5.30 188917 - 189000 84 6 - TSS 189252 -9.89 7 - PolA 189406 -1.08 7 - 1 CDSl 190014 - 190819 86.26 190014 - 190817 804 7 - 2 CDSf 192538 - 192739 27.65 192539 - 192739 201 7 - TSS 192980 -13.39 8 + TSS 203006 -7.49 8 + 1 CDSf 204074 - 204125 4.76 204074 - 204124 51 8 + 2 CDSi 205528 - 205737 9.61 205530 - 205736 207 8 + 3 CDSi 206430 - 206597 6.22 206432 - 206596 165 8 + 4 CDSl 217609 - 217751 -2.49 217611 - 217751 141 8 + PolA 217790 1.12 9 - PolA 223168 1.12 9 - 1 CDSl 223934 - 224184 18.12 223934 - 224182 249 9 - 2 CDSf 225145 - 225589 49.39 225146 - 225589 444 9 - TSS 226187 -10.69 10 - PolA 226235 1.12 10 - 1 CDSl 227757 - 228016 41.82 227757 - 228014 258 10 - 2 CDSf 229407 - 229848 39.94 229408 - 229848 441 10 - TSS 229862 -13.29 11 - PolA 235837 1.12 11 - 1 CDSl 236698 - 236831 0.97 236698 - 236829 132 11 - 2 CDSi 237124 - 237321 44.65 237125 - 237319 195 11 - 3 CDSi 238266 - 238536 36.52 238267 - 238536 270 11 - 4 CDSi 241442 - 241621 1.60 241442 - 241621 180 11 - 5 CDSf 242800 - 243225 18.23 242800 - 243225 426 11 - TSS 243520 -10.19 12 - PolA 245022 1.12 12 - 1 CDSl 245702 - 245940 31.50 245702 - 245938 237 12 - 2 CDSf 246977 - 247556 70.75 246978 - 247556 579 12 - TSS 247755 -8.09 13 - PolA 253002 1.12 13 - 1 CDSl 253998 - 254272 24.28 253998 - 254270 273 13 - 2 CDSi 255448 - 255922 70.25 255449 - 255922 474 13 - 3 CDSi 264985 - 265127 21.71 264985 - 265125 141 13 - 4 CDSi 266223 - 266280 -0.33 266224 - 266280 57 13 - 5 CDSi 266857 - 267135 14.80 266857 - 267135 279 13 - 6 CDSf 267550 - 267579 11.55 267550 - 267579 30 13 - TSS 268637 -11.79 14 - PolA 269351 1.12 14 - 1 CDSl 269414 - 269475 1.56 269414 - 269473 60 14 - 2 CDSi 269949 - 270365 17.62 269950 - 270363 414 14 - 3 CDSi 273988 - 274165 7.02 273989 - 274165 177 14 - 4 CDSi 274454 - 274927 8.87 274454 - 274927 474 14 - 5 CDSi 280369 - 280541 8.82 280369 - 280539 171 14 - 6 CDSi 281255 - 281527 7.17 281256 - 281525 270 14 - 7 CDSi 284544 - 284635 14.73 284545 - 284634 90 14 - 8 CDSf 286945 - 286985 2.31 286947 - 286985 39 14 - TSS 287813 -7.99 15 - PolA 295029 1.12 15 - 1 CDSl 295308 - 295751 14.33 295308 - 295751 444 15 - 2 CDSf 323648 - 323701 5.54 323648 - 323701 54 15 - TSS 323758 -9.39 16 + TSS 324793 -10.79 16 + 1 CDSf 325130 - 325336 17.17 325130 - 325336 207 16 + 2 CDSi 325451 - 325556 8.86 325451 - 325555 105 16 + 3 CDSi 326187 - 326248 0.04 326189 - 326248 60 16 + 4 CDSi 327074 - 327403 39.23 327074 - 327403 330 16 + 5 CDSi 327985 - 328506 64.07 327985 - 328506 522 16 + 6 CDSi 330789 - 330862 0.88 330789 - 330860 72 16 + 7 CDSi 331942 - 332172 9.79 331943 - 332170 228 16 + 8 CDSl 332247 - 332385 0.89 332248 - 332385 138 16 + PolA 332573 1.12 17 - PolA 350365 1.12 17 - 1 CDSl 351431 - 351631 5.75 351431 - 351631 201 17 - 2 CDSf 351742 - 351810 13.90 351742 - 351810 69 17 - TSS 354419 -5.39 18 - PolA 462041 1.12 18 - 1 CDSl 462161 - 462319 6.24 462161 - 462319 159 18 - 2 CDSi 462775 - 462792 1.90 462775 - 462792 18 18 - 3 CDSi 479454 - 479548 1.77 479454 - 479546 93 18 - 4 CDSf 479861 - 479933 3.91 479862 - 479933 72 18 - TSS 480516 -7.09 Predicted protein(s): >FGENESH: 1 2 exon (s) 4260 - 5011 199 aa, chain + MKQEVKKVVNPLLEKRPKNFGIGQDIQPKRDLTCFKKWSHYIRLQWQRVILCKWLKVHPE INQFTQALHHQTATQLLQLAHNYRPNKRKTKQEKKQRLLVRAEKKACSKGDISTRRPLVL QAGVNTVTTLVENKKAQLVVNLEDKEALAKLVEAIRTNYNNRYDEICRHWGGNVLGPKSV ARIAKLEKAKAKELATKLG >FGENESH: 2 1 exon (s) 65008 - 65430 140 aa, chain - MSAYAFYVQTCREEHRKKNPEVPVNFAEFSKKCSERWKTMSGKDKSKFDEITKADKMRYD QEMKDYGPAKGAKKKKDPNASKRPLSGFFLFCSELGPKIKSTNPTISIRDMAKKLGEMWN NLNDSEKQPYITKAAKLKEK >FGENESH: 3 5 exon (s) 106574 - 170728 437 aa, chain + MELQEIQLKEAKHIAEEADRKYEEVPRKLVIIEGDLECTEERAELAESRCQETDEQIRLM DQNLKCLSDAEEKYSQKEDIYEEEIKILTDKLKEAETRAEFTERLVAKLEKTTDDLEYKL KCNKEENLCTQRMLYQTLLDLNEISSVSIQISNYSFRGCCDDQNKGRFDGPEAQEEACSG ERTYQELLVNQNPIVQPLASRRLTRNLYKCIKKAMKQKQLRRGVKEVQKFVNKGEKGIMV LAEDTLPIEEVLGPEALFGRVRWRPGAAQTAPQEPSKLGDAGRKKREASLSPETKRSSSQ PRGPVRTHFPFPPKPPTDAGRETLRPKMGSEREPGKSPDAAPFPPRTGAGSRRSCGISSR SCRSLAGRGQALVPPARSPQPQQSRSSRGRRCPSDSIARADLFPSLGQLTPAHPKGKTPS VPQNGKIPVHTLPEPCC >FGENESH: 4 3 exon (s) 176539 - 177996 282 aa, chain - MNSFLEYPILSSGDSGTCSARAYPSDHRITTFQSCAVSANSCGGDDRFLNFSAPYSPYAL NQEADVSGGYPQCAPAVYSGNLSSPMVQHHHHHQGYAGGAVGSPQYIHHSYGQEHQSLAL ATYNNSLSPLHASHQEACRSPASETSSPAQTFDWMKVKRNPPKTGKVGEYGYLGQPNAVR TNFTTKQLTELEKEFHFNKYLTRARRVEIAASLQLNETQVKIWFQNRRMKQKKREKEGLL PISPATPPGNDEKAEESSEKSSSSPCVPSPGSSTSDTLTTSH >FGENESH: 5 2 exon (s) 180044 - 181284 105 aa, chain + MVKREHGQERPTFWGWAATPAPVSAPGNPPTGEGERQGSPPGGGFLGSTSFQRRGEKELL WERGQDVSRKRLIYERRTNRIQELRSPGRCRDALLRAELGARNQP >FGENESH: 6 4 exon (s) 182825 - 189000 401 aa, chain - MGPGQAIRLEARRILRMAAGAPGGCGSSGAKYYISTPGFKIGLLTDWLEEAKEGTVKGNI FIFRFKDSSRKESNSENADLRPASPSSPPSELEARNCFRLGIRSIGLLFPTTEPGSKHPR CPRIRKVWEAVQAAREHSLYLNQPAPNQSDKDKKKESLEIADGSGGGSRRLRTAYTNTQL LELEKEFHFNKYLCRPRRVEIAALLDLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCK SLEDSEKVEEDEEEKTLFEQALSVSGALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVS PLTSNEKNLKHFQHQSPTVPNCLSTMGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQ LSDAVSPSLPGSLDSPVDISADSLDFFTDTLTTIDLQHLNY >FGENESH: 7 2 exon (s) 190014 - 192739 335 aa, chain - MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG HPKAHELSESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLL NLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVP YEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHG LQGNGSYGTPHIQGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGH HHGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL >FGENESH: 8 4 exon (s) 204074 - 217751 190 aa, chain + MSPDERRKKTRCLRRPKVFATLSCPLTSKWSPSRQSAFWEWEMMGKEERIVRSPDPGLEK FCAPLGLCGPFASTDLSLPRLPLHSDPAFPSVNQNITNCLRIRPTSSPKPGCDRRKPKPE LTFPRTRRAPWTRRNPAEKAPRSDLRTLCIQRTVVSEKAYSVFENCKDYILQEVSVFAFI LTHFRGHHVY >FGENESH: 9 2 exon (s) 223934 - 225589 231 aa, chain - MHSGRYGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPD PLPCSAVAPSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQ ASAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRY LTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP >FGENESH: 10 2 exon (s) 227757 - 229848 233 aa, chain - MSSYFVNPTFPGSLPSGQDSFLGQLPLYQAGYDALRPFPASYGASSLPDKTYTSPCFYQQ SNSVLACNRASYEYGASCFYSDKDLSGASPSGSGKQRGPGDYLHFSPEQQYKPDSSSGQG KALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELEKEFHFNRYL TRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKAGE >FGENESH: 11 5 exon (s) 236698 - 243225 402 aa, chain - MQEASAPPSPASATLVKFLRCRLDTSLSGMGAPDFSEQPPYIFWAAQAWLEASLGNKESA GACPTLTARASLTPSRGPSAAGLQLSGRPRGCSSRRGRGSRVTVLSWDRPDTARRQRAPA DWRTQEERLQPQNLMTAQRVCTAGGWESAFPLGSLGVASPLRTFTQARPRTNPISFVHDR YEFVFAPRLGITHSLDGEQMQLRSGYGAGAGAFASTVPGLYNVNSPLYQSPFASGYGLGA DAYGNLPCASYDQNIPGLCSDLAKGACDKTDEGALHGAAEANFRIYPWMRSSGPDRKRGR QTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKEHKDEAP NFCGLSGRRDWVALDSLCLRGPQKTPPTPGQPALLWRVQNTT >FGENESH: 12 2 exon (s) 245702 - 247556 272 aa, chain - MATTGALGNYYVDSFLLGADAADELSVGRYAPGTLGQPPRQAATLAEHPDFSPCSFQSKA TVFGASWNPVHAAGANAVPAAVYHHHHHHPYVHPQAPVAAAAPDGRYMRSWLEPTPGALS FAGLPSSRPYGIKPEPLSARRGDCPTLDTHTLSLTDYACGSPPVDREKQPSEGAFSENNA ENESGGDKPPIDPNNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVA RLLNLTERQVKIWFQNRRMKMKKINKDRAKDE >FGENESH: 13 6 exon (s) 253998 - 267579 419 aa, chain - MSYTGDYVLSTPSSRPMTYSYSSNLPQVQPVREVTFREYAIEPATKWHPRGNLAHCYSAE ELVHRDCLQAPSAAGVPGDVLAKSSANVYHHPTPAVSSNFYSTLPSPAARPRSGPAPGWG AGGGQRTRKKRCPYTKYQIRELEREFFFSVYINKEKRLQLSRMLNLTDRQNIKEESSYCL YDSADKCPKVSATAAELAPFPRGPPPDGCALGTSSGVPVPGYFRLSQAYGTAKGYGSGGG GAQQLGAGPFPAQPPGRGFDLPPALASGSADAARKERALDSPPPPTLACGSGGGSQGDEE AHASSSAAEELSPAPSESSKASPEKDSLGNSKGENAANWLTAKSGRKKRCPYTKHQTLEL EKEFLFNMYLTRERRLEISRSVHLTDRQVKIWFQNRRMKLKKMNRENRIRELTANFNFS >FGENESH: 14 8 exon (s) 269414 - 286985 569 aa, chain - MLLMGVWNLVVLKWVRGYGEVGGGGKNVELALAAVCTGSCAMARAAEEFSSRAKEFAFYH QGYAAGPYHHHQPMPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCP KEQAQPPHLWKSTLPDVVSHPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKR RRISATTNLSERQPGAERWELSEGTESEPGTRGYAVHPSGRAVRPGGRQPRAALRPAPLP RSAPLGCRPRSRPQPEAGTRRTGPWHRQVPVGTQSGTQELAEGGWGQELAEGGWREKRAL ERALRGLALALSLWSGGLATAACKKGGESRAGGFVSGRGSADTAQNGGGKGPCWKGVPAA TGGPGRVPVPEERGLPQSSRLRSPLRPSGGERERLQAQGKGRGWIGVLGPAPPHTPFRAA SPSGHSASVQPAELPCQPLPERLSQPQPFLRNSPLSPADTRGIQPCQHPLGSALEEDMGG SGYAEQRTPPYRPFCRDRSPGHQGKTDLIKRGAACRWGRWPGIPRWRATTTPSAQAQAEV DCQNCYKTAALLPSGQATVDKYVKLKMHK >FGENESH: 15 2 exon (s) 295308 - 323701 165 aa, chain - MAFEVEESEVAENALKQQHPGPEIASSPWVTLARTNPHPTPERAQSAPFENCSRAQGVGL AVTFQPGAASEELSLGSRGERPDWKEGAEEPWTAGVPDPAQEGACPRPRGLRATSLSCHP VPGQDAGPGLRGPRAPGTCVGLSAALLCGAPSPWRASEPGIEAAR >FGENESH: 16 8 exon (s) 325130 - 332385 556 aa, chain + MESRKDMVVFLDGGQLGTLVGKRVSNLSEAVGSPLPEPPEKMVPRGCLSPRAVPPATRER GGGGPEEEPGQPSSSDTESDFYEEIEVSCTPDCATGNAEYQHSKVPLLHPKEKELQGAGA PPFSEAYSPTSMAFVCLCLYTPVLWTRCAGSGSEALVGSPNGGSETPKSNGGSGGGGSQG TLACSASDQMRRYRTAFTREQIARLEKEFYRENYVSRPRRCELAAALNLPETTIKVWFQN RRMKDKRQRLAMTWPHPADPAFYTYMMSHAAAAGGLPYPFPSHLPLPYYSPVGLGAASAA SAAASPFSGSLRPLDTFRVLSQPYPRPELLCAFRHPPLYPGPAHGLGASAGGPCSCLACH SGPANGLAPRAAAASDFTCASTSRSDSFLTFAPSVLSKASSVALDQREEPKAPECQILVN FAVAPYWAEIACDSLILSPNVACPTNGFFSDYVDSTFLVSNPAPALSRQEVPLPRPGIEG EEIQNASPCPRPFGGDMALDSHPEIGLRSSQLLPLGCFHYSPTAETPATICPPENKRECR GGQLPPKVSRRMAFSK >FGENESH: 17 2 exon (s) 351431 - 351810 89 aa, chain - MAKIKARDLHEKEELLKQLDDLKTQKEKLRKFYKGKKYKPLDLWPKKTRAMCCRFNKHKE NLKTKKRQRKERLCLLRKYAVKARVARCQ >FGENESH: 18 4 exon (s) 462161 - 479933 114 aa, chain - MERGNRVPGNMMRRVYREIGSADGGLSSSFQQQNETWYGYGSYVKMSRDSDKAVALRALA YKLWLFIGYLHPLKGASSDDIGDLTEGTAIPAPVPGVDTFQNVKGSKVDPMVLC Click Back button to return to programs menu (Loaded file with sequence was automatically delinked)