GENSCAN 1.0 Date run: 10-Mar-105 Time: 12:53:32 Sequence hox : 500000 bp : 44.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4260 4676 417 2 0 79 7 257 0.493 13.13 1.02 Term + 4829 5011 183 1 0 50 36 239 0.892 12.44 1.03 PlyA + 5882 5887 6 1.05 2.02 PlyA - 6223 6218 6 1.05 2.01 Sngl - 65430 65008 423 0 0 79 49 395 0.724 31.00 2.00 Prom - 70950 70911 40 -3.46 3.00 Prom + 91950 91989 40 -5.26 3.01 Init + 105837 105931 95 2 2 116 44 149 0.657 13.15 3.02 Intr + 106008 106110 103 0 1 -7 25 184 0.948 2.78 3.03 Term + 106586 107008 423 1 0 -15 42 448 0.997 25.20 3.04 PlyA + 109265 109270 6 1.05 4.00 Prom + 111241 111280 40 -9.26 4.01 Init + 112818 112920 103 2 1 45 119 0 0.316 -0.80 4.02 Intr + 130250 130533 284 0 2 49 59 438 0.773 34.14 4.03 Intr + 130599 130700 102 2 0 49 46 163 0.275 8.57 4.04 Term + 170555 170728 174 1 0 92 40 108 0.045 4.16 4.05 PlyA + 171974 171979 6 1.05 5.05 PlyA - 172097 172092 6 1.05 5.04 Term - 176894 176539 356 0 2 70 54 525 0.980 41.96 5.03 Intr - 177513 177360 154 0 1 32 82 70 0.406 0.45 5.02 Intr - 177768 177658 111 0 0 11 105 83 0.435 2.88 5.01 Init - 177996 177850 147 0 0 72 -1 140 0.411 3.63 5.00 Prom - 181637 181598 40 -7.06 6.05 PlyA - 181855 181850 6 1.05 6.04 Term - 183564 182825 740 1 2 101 42 646 0.998 54.93 6.03 Intr - 186193 186047 147 2 0 119 37 104 0.955 8.61 6.02 Intr - 188275 188042 234 2 0 50 39 127 0.683 1.66 6.01 Init - 189000 188916 85 0 1 91 57 120 0.977 8.33 6.00 Prom - 189291 189252 40 -7.76 7.10 PlyA - 189580 189575 6 -0.45 7.09 Term - 190819 190014 806 2 2 148 44 950 0.999 89.99 7.08 Intr - 192352 192214 139 1 1 78 103 142 0.061 14.74 7.07 Intr - 192859 192672 188 2 2 103 9 127 0.010 5.71 7.06 Intr - 201815 201766 50 1 2 109 78 -8 0.042 -1.38 7.05 Intr - 204741 204658 84 2 0 108 87 67 0.201 7.54 7.04 Intr - 206573 206533 41 2 2 92 92 48 0.282 2.62 7.03 Intr - 211670 211513 158 0 2 19 53 261 0.980 15.43 7.02 Intr - 212286 212217 70 0 1 -11 104 155 0.724 5.95 7.01 Init - 215505 215497 9 0 0 84 57 34 0.432 -1.28 7.00 Prom - 217875 217836 40 -5.26 8.14 PlyA - 223173 223168 6 1.05 8.13 Term - 224184 223934 251 1 2 66 48 265 0.985 16.17 8.12 Intr - 225921 225145 777 1 0 58 58 679 0.509 53.35 8.11 Intr - 226708 226530 179 0 2 38 -21 145 0.145 -1.84 8.10 Intr - 228016 227761 256 2 1 92 -18 562 0.284 42.50 8.09 Intr - 229761 229407 355 0 1 49 94 456 0.350 37.16 8.08 Intr - 231722 231625 98 0 2 41 9 116 0.297 -1.27 8.07 Intr - 233151 232979 173 2 2 43 42 114 0.236 1.89 8.06 Intr - 234051 233906 146 0 2 101 81 30 0.279 2.68 8.05 Intr - 236294 236233 62 0 2 65 28 53 0.264 -4.55 8.04 Intr - 237321 237124 198 1 0 100 63 510 0.993 49.02 8.03 Intr - 238536 238266 271 0 1 69 96 506 0.984 46.61 8.02 Intr - 241621 241442 180 1 0 79 85 48 0.760 3.66 8.01 Init - 243225 242800 426 0 0 57 86 156 0.738 8.71 8.00 Prom - 244918 244879 40 -6.86 9.03 PlyA - 245027 245022 6 1.05 9.02 Term - 245940 245702 239 1 2 139 44 360 0.981 33.03 9.01 Init - 247556 246977 580 2 1 80 100 733 0.872 69.02 9.00 Prom - 249702 249663 40 -6.96 10.13 PlyA - 249923 249918 6 1.05 10.12 Term - 254272 253998 275 2 2 67 45 327 0.990 21.93 10.11 Intr - 255922 255448 475 1 1 67 94 690 0.164 60.14 10.10 Intr - 265127 264985 143 0 2 156 -13 295 0.069 26.27 10.09 Intr - 266280 266223 58 0 1 103 64 -18 0.685 -4.24 10.08 Intr - 267135 266857 279 0 0 103 8 301 0.884 21.27 10.07 Intr - 267372 267255 118 2 1 15 63 95 0.027 0.17 10.06 Intr - 268445 268252 194 2 2 92 105 54 0.014 5.79 10.05 Intr - 271534 271337 198 1 0 44 66 122 0.479 5.15 10.04 Intr - 274340 274237 104 0 2 70 28 86 0.512 0.69 10.03 Intr - 275411 275243 169 2 1 29 100 79 0.218 2.72 10.02 Intr - 280541 280369 173 0 2 126 42 137 0.379 12.56 10.01 Init - 281627 281255 373 2 1 74 56 370 0.875 27.53 10.00 Prom - 287852 287813 40 -4.66 11.04 PlyA - 288629 288624 6 1.05 11.03 Term - 296742 296642 101 1 2 115 36 79 0.368 3.69 11.02 Intr - 299529 299470 60 1 0 42 100 93 0.307 4.61 11.01 Init - 318103 318058 46 1 1 43 94 101 0.190 5.14 11.00 Prom - 323797 323758 40 -6.36 12.00 Prom + 324787 324826 40 -7.66 12.01 Init + 325130 325336 207 1 0 86 33 146 0.732 6.67 12.02 Intr + 325451 325556 106 1 1 63 63 201 0.760 14.89 12.03 Intr + 326187 326248 62 1 2 102 80 2 0.476 -0.75 12.04 Intr + 327074 327403 330 1 0 69 91 424 0.497 36.63 12.05 Intr + 327985 328506 522 0 0 116 34 788 0.839 69.35 12.06 Intr + 331400 331510 111 1 0 74 76 58 0.623 3.78 12.07 Intr + 331561 331690 130 0 1 113 -13 43 0.374 -3.03 12.08 Intr + 332247 332323 77 1 2 111 75 41 0.306 4.33 12.09 Term + 332621 332644 24 1 0 108 49 28 0.375 -0.88 12.10 PlyA + 338112 338117 6 1.05 13.05 PlyA - 350370 350365 6 1.05 13.04 Term - 351631 351431 201 1 0 22 42 321 0.971 18.29 13.03 Intr - 351839 351712 128 0 2 34 -5 288 0.120 14.50 13.02 Intr - 385279 385031 249 2 0 86 44 138 0.048 6.61 13.01 Init - 424114 423961 154 1 1 64 12 133 0.215 3.44 13.00 Prom - 449811 449772 40 -2.86 14.04 PlyA - 462046 462041 6 1.05 14.03 Term - 462319 462161 159 1 0 63 42 150 0.635 5.84 14.02 Intr - 466160 466118 43 1 1 46 100 38 0.105 -0.96 14.01 Intr - 496616 496538 79 0 1 100 76 76 0.173 6.21 Predicted peptide sequence(s): >hox|GENSCAN_predicted_peptide_1|199_aa MKQEVKKVVNPLLEKRPKNFGIGQDIQPKRDLTCFKKWSHYIRLQWQRVILCKWLKVHPE INQFTQALHHQTATQLLQLAHNYRPNKRKTKQEKKQRLLVRAEKKACSKGDISTRRPLVL QAGVNTVTTLVENKKAQLVVNLEDKEALAKLVEAIRTNYNNRYDEICRHWGGNVLGPKSV ARIAKLEKAKAKELATKLG >hox|GENSCAN_predicted_peptide_2|140_aa MSAYAFYVQTCREEHRKKNPEVPVNFAEFSKKCSERWKTMSGKDKSKFDEITKADKMRYD QEMKDYGPAKGAKKKKDPNASKRPLSGFFLFCSELGPKIKSTNPTISIRDMAKKLGEMWN NLNDSEKQPYITKAAKLKEK >hox|GENSCAN_predicted_peptide_3|206_aa MAGIATMEEVKCKIQVLQQQADDAEERAERLQFEEELDHAQERLTTALQKLEEEEKAADE SERDMKEIQLKEAKHIAEEADRKYEEVPRKLVIIEGDLECTEERAELAESRCQETDEQIR LMDQNLKCLSDAEEKYSQKEDIYEEEIKILTDKLKEAETRAEFTERLVAKLEKTTDDLEY KLKCNKEENLCTQRMLYQTLLDLNEM >hox|GENSCAN_predicted_peptide_4|220_aa MKQKDLCFAVRKKKNCGKQCKNKNKQASVWGISAGFRGCCDDQNKGRFDGPEAQEEACSG ERTYQELLVNQNPIVQPLASRRLTRNLYKCIKKAMKQKQLRRGVKEVQKFVNKGEKGIMV LAEDTLPIEMNLGAAAGSKCPTCVIMVKPHKEYQEAYDKCLEEPQQSRSSRGRRCPSDSI ARADLFPSLGQLTPAHPKGKTPSVPQNGKIPVHTLPEPCC >hox|GENSCAN_predicted_peptide_5|255_aa MNSFLEYPILSSGDSGTCSARAYPSDHRITTFQSCAVSANSCGGDDRFLTSGNLGVSYSH SSCGPSYGSQNFSAPYSPYALNQEADEHQSLALATYNNSLSPLHASHQEACRSPASETSS PAQTFDWMKVKRNPPKTGKVGEYGYLGQPNAVRTNFTTKQLTELEKEFHFNKYLTRARRV EIAASLQLNETQVKIWFQNRRMKQKKREKEGLLPISPATPPGNDEKAEESSEKSSSSPCV PSPGSSTSDTLTTSH >hox|GENSCAN_predicted_peptide_6|401_aa MGPGQAIRLEARRILRMAAGAPGGCGSSGAKYYISTPGFKIGLLTDWLEEAKEGTVKGNI FIFRFKDSSRKESNSENADLRPASPSSPPSELEARNCFRLGIRSIGLLFPTTEPGSKHPR CPRIRKVWEAVQAAREHSLYLNQPAPNQSDKDKKKESLEIADGSGGGSRRLRTAYTNTQL LELEKEFHFNKYLCRPRRVEIAALLDLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCK SLEDSEKVEEDEEEKTLFEQALSVSGALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVS PLTSNEKNLKHFQHQSPTVPNCLSTMGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQ LSDAVSPSLPGSLDSPVDISADSLDFFTDTLTTIDLQHLNY >hox|GENSCAN_predicted_peptide_7|514_aa MLLSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRY LTRRRRIEIAHTLCLSERQGFCESRERAASGETAPGHRKRRFPERLELETIPGSEFKHLS RGGVKGRNHAQPPAAAISLRGAAIGGGVSRDRGGVPMCALTGVKPLSECAIKIVKQRDAK SDLLRQLGDLRWLPLPGSQRNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQK TSSSSSGESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLN LTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPY EPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGL QGNGSYGTPHIQGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHH HGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL >hox|GENSCAN_predicted_peptide_8|1123_aa MQEASAPPSPASATLVKFLRCRLDTSLSGMGAPDFSEQPPYIFWAAQAWLEASLGNKESA GACPTLTARASLTPSRGPSAAGLQLSGRPRGCSSRRGRGSRVTVLSWDRPDTARRQRAPA DWRTQEERLQPQNLMTAQRVCTAGGWESAFPLGSLGVASPLRTFTQARPRTNPISFVHDR YEFVFAPRLGITHSLDGEQMQLRSGYGAGAGAFASTVPGLYNVNSPLYQSPFASGYGLGA DAYGNLPCASYDQNIPGLCSDLAKGACDKTDEGALHGAAEANFRIYPWMRSSGPDRKRGR QTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKEHKDEAQ AHQPPLLEGNVPELAADLWRPREAQKEVARHTVFPASPLGLDFCVFIHGPIWFCWIPQQE ETILDLGRSAAFLTAAAAGRLFIATVLALAAATPWPSPGPAPAAARLLQPGATAVLAPQR FPSKQGRIDWGTGIRLPGVAPVAGLAAAFQECLLGSSEAGYDALRPFPASYGASSLPDKT YTSPCFYQQSNSVLACNRASYEYGASCFYSDKDLSGASPSGSGKQRGPGDYLHFSPEQQY KPDSSSGQGKALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELE KEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKA GEEPGPTRWVQGEPSDLGPGWLEEPDPVMKIFQAGYVSKSNRKLMTRKPYSHERKIDYGY KMHEFTSRGHQAGFTTGQQKHVIRSRTPYLGAYVGGNQVHVPVISIIHHKLCKGAIDAQT TASHKSSTHIKKQMSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGY GYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVA PSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASAQSEPS PAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIE IAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP >hox|GENSCAN_predicted_peptide_9|272_aa MATTGALGNYYVDSFLLGADAADELSVGRYAPGTLGQPPRQAATLAEHPDFSPCSFQSKA TVFGASWNPVHAAGANAVPAAVYHHHHHHPYVHPQAPVAAAAPDGRYMRSWLEPTPGALS FAGLPSSRPYGIKPEPLSARRGDCPTLDTHTLSLTDYACGSPPVDREKQPSEGAFSENNA ENESGGDKPPIDPNNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVA RLLNLTERQVKIWFQNRRMKMKKINKDRAKDE >hox|GENSCAN_predicted_peptide_10|852_aa MGPHPNAIKSCAHXXXXXXXXXFADKYMDTAGPAAEEFSSRAKEFAFYHQGYAAGPYHHH QPMPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPKEQAQPPHLWK STLPDVVSHPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKRRRISATTNLSE RQRRGPKPSTLLQEPPTQNGGSGRCKAGLPLTARGPVGAAAGRLAEVPARNSKGRVCAGQ VLRATAASMVRADRCRRLKKAQDAQLPCCKRVLHNNLGNKASHCPVLTDGLEATPPGHCP PAPSRCLPDHSGCKKKRRRQRRLDPLYSDIPPAKAVERSPVLSTSILPTPLELCKQPDGP PTKMQHTQEKSPRMSAPLKQPKACLNCMVSRVLSSRRPGRWVDRMGIDFHVSLRLQISTS RIRFKEAAAVENHVKLGYCGEPKTPSSRPMTYSYSSNLPQVQPVREVTFREYAIEPATKW HPRGNLAHCYSAEELVHRDCLQAPSAAGVPGDVLAKSSANVYHHPTPAVSSNFYSTLPSP AARPRSGPAPGWGAGGGQRTRKKRCPYTKYQIRELEREFFFSVYINKEKRLQLSRMLNLT DRQNIKEESSYCLYDSADKCPKVSATAAELAPFPRGPPPDGCALGTSSGVPVPGYFRLSQ AYGTAKGYGSGGGGAQQLGAGPFPAQPPGRGFDLPPALASGSADAARKERALDSPPPPTL ACGSGGGSQGDEEAHASSSAAEELSPAPSESSKASPEKDSLGNSKGENAANWLTAKSGRK KRCPYTKHQTLELEKEFLFNMYLTRERRLEISRSVHLTDRQVKIWFQNRRMKLKKMNREN RIRELTANFNFS >hox|GENSCAN_predicted_peptide_11|68_aa MRPLSPRAVSDAREPAVSHFKAKYAKERAFKAAGKATACTFTGEIGSANFRELPPPLVDL RAPDPKPQ >hox|GENSCAN_predicted_peptide_12|522_aa MESRKDMVVFLDGGQLGTLVGKRVSNLSEAVGSPLPEPPEKMVPRGCLSPRAVPPATRER GGGGPEEEPGQPSSSDTESDFYEEIEVSCTPDCATGNAEYQHSKVPLLHPKEKELQGAGA PPFSEAYSPTSMAFVCLCLYTPVLWTRCAGSGSEALVGSPNGGSETPKSNGGSGGGGSQG TLACSASDQMRRYRTAFTREQIARLEKEFYRENYVSRPRRCELAAALNLPETTIKVWFQN RRMKDKRQRLAMTWPHPADPAFYTYMMSHAAAAGGLPYPFPSHLPLPYYSPVGLGAASAA SAAASPFSGSLRPLDTFRVLSQPYPRPELLCAFRHPPLYPGPAHGLGASAGGPCSCLACH SGPANGLAPRAAAASDFTCASTSRSDSFLTFAPSVLSKASSVALDQREEEGKRRLQVYTA ISGRCSYELRAPTSSSLTLFWGTDSEGSSIPASRQGLGARSSAQPRFAQLGKAGHPAARV CTGPRQLLACFSRLVASIIPQLRKHQPPYVRRRISEAKLPAD >hox|GENSCAN_predicted_peptide_13|243_aa MELEVSKFHWRVTPVKEEKWEAGLERKNCQTIGQSGQSLCQLIDSSTVKTARVSLPMVPS LAQKNSSGTEDAGSKPKRALHRGARDPGIHPRAGSTPHLTEGYSLMGTEDPRYKEEFPSL SDPYQSQLSPGFKLKQAAAVAYAAMAKIKARDLHEKEELLKQLDDLKVELSQLHVAKTQK EKLRKFYKGKKYKPLDLWPKKTRAMCCRFNKHKENLKTKKRQRKERLCLLRKYAVKARVA RCQ >hox|GENSCAN_predicted_peptide_14|93_aa XAEEQRKSQQQETLGRKMNWMVQANGGVPDVVAITTEQVKELWLFIGYLHPLKGASSDDI GDLTEGTAIPAPVPGVDTFQNVKGSKVDPMVLC