TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= slap_pknowlesi_definitiva (424 letters) Database: cds.fa 5106 sequences; 11,140,562 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Pla... 857 0.0 Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmod... 33 0.091 Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmod... 31 0.72 Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Pla... 30 0.89 Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Pla... 29 1.8 Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmod... 29 2.7 Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Pla... 28 5.3 Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmod... 27 6.3 >Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) PLP-dependent aminotransferase, putative Length = 1728 Score = 857 bits (2213), Expect = 0.0, Method: Composition-based stats. Identities = 424/424 (100%), Positives = 424/424 (100%) Frame = +1 Query: 1 MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI 60 MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI Sbjct: 454 MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI 633 Query: 61 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 120 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI Sbjct: 634 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 813 Query: 121 QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT 180 QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT Sbjct: 814 QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT 993 Query: 181 LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH 240 LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH Sbjct: 994 LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH 1173 Query: 241 ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD 300 ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD Sbjct: 1174ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD 1353 Query: 301 PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA 360 PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA Sbjct: 1354PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA 1533 Query: 361 RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF 420 RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF Sbjct: 1534RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF 1713 Query: 421 GRGA 424 GRGA Sbjct: 1714GRGA 1725 >Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmodium_k nowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 2286 Score = 33.5 bits (75), Expect = 0.091, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 4/107 (3%) Frame = +1 Query: 61 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 120 E + Q Y EK C ++ R + IV +A KR + I NNA+ NY+ +I Sbjct: 1744 EMVEQCYPEKTCAQKQPQKRWSEREKNKIVDLA---KRGD---IFNNAYVNLLNYIESDI 1905 Query: 121 QKCFESKG----RVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKK 163 + C + +++S D +F++P+ + S+DK + + KK Sbjct: 1906 KNCSPDEALNTLLQLHLLRSVDNHFVLPLISKLCNSTDKLRNEHKKK 2046 >Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmodium _knowlesi_Sanger|(protein coding) cysteine protease, putative Length = 2118 Score = 30.8 bits (68), Expect = 0.72, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 71/174 (40%), Gaps = 22/174 (12%) Frame = +1 Query: 92 IAHICKRYNVPHIINNAFG-LQCNYLCKEIQKCFESKGRVDFVVQSCDKNF--LVP---- 144 +AH+ KR +II N FG L C I+ CF S C++ + + P Sbjct: 43 VAHLVKRVK-SNIIINCFGTLHCKICHIAIRNCFLSGTSNLTKCIECEEKYYNIQPCTHH 219 Query: 145 VNGGIVFSSDKKKMKELKKH-YPGRTPVHAYLDLFITLLELGKKKILNLRKEREENFSWL 203 + S DK E+K H Y V DL +++L ++ N E E+ Sbjct: 220 TENFLQISRDKGAFVEMKNHDYLTEAKVD---DLISEIVKLSLERQKNAVPETEQTKQDF 390 Query: 204 QEKVSTLCSKYNLT--------LIKASKNKISMAIN------LNELYNICHILN 243 Q+K+ LC N + +A+ ++ IN +NE N+ HI+N Sbjct: 391 QKKIMQLCLYSNFSDHYENAKKHTQANAEEVEKHINKIVIMYMNESNNMEHIIN 552 >Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) acetyl CoA synthetase, putative Length = 2988 Score = 30.4 bits (67), Expect = 0.89, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 2/92 (2%) Frame = +1 Query: 215 NLTLIKASKNKISMAI--NLNELYNICHILNPKSITLLGSLLFYRNVTGHRVICSPLLIR 272 N + AS K+ + + N EL+N+C ILN + N G V+ + R Sbjct: 883 NYEIFYASMKKLGVLVVDNFEELFNMCKILNLSKYPETNEVCVVTNAGGPGVLLVDNITR 1062 Query: 273 NGGVAQRRDQQIQTNVEPAFPIHHGAVDPNEV 304 N G + + ++ ++ P +P ++ Sbjct: 1063NDGNLSKLNDNLKKKLDAFLPPSWSKANPVDI 1158 >Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) recticulocyte binding protein, putative Length = 966 Score = 29.3 bits (64), Expect = 1.8, Method: Composition-based stats. Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Frame = +2 Query: 212 SKYNLTLIKASKNKISMAINLNELYNICHILNPKSITLLGSLLFYRNVTGHRVIC-SPLL 270 S + +T + +++NK AIN+ Y I P+SI L L + + G R +C PLL Sbjct: 686 SGFTITFLWSARNK---AINIVT*YRIL----PRSIAPLRIKL*MKRLMGSRCLCMPPLL 844 Query: 271 IRNGGVAQR 279 GV QR Sbjct: 845 PVRWGVFQR 871 >Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmodium _knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Apicomplexan species Length = 1431 Score = 28.9 bits (63), Expect = 2.7, Method: Composition-based stats. Identities = 19/46 (41%), Positives = 27/46 (58%) Frame = +1 Query: 164 HYPGRTPVHAYLDLFITLLELGKKKILNLRKEREENFSWLQEKVST 209 H GR V ++FIT+ L KK+I L + REE S Q+K+S+ Sbjct: 466 HRGGRELVK---EIFITIKTLIKKRIDVLNEHREEYLSGSQDKLSS 594 >Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) RNA helicase, putative Length = 3489 Score = 27.7 bits (60), Expect = 5.3, Method: Composition-based stats. Identities = 12/24 (50%), Positives = 18/24 (75%) Frame = +1 Query: 308 EQGAHTDGEFFPNRGKKSEDAHQI 331 E+ AHT GEF P++G+K DA+ + Sbjct: 2929 EKDAHT-GEFSPHQGRKDHDAYNL 2997 >Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmodium_k nowlesi_Sanger|(protein coding) dna gyrase a-subunit, putative Length = 3522 Score = 27.3 bits (59), Expect = 6.3, Method: Composition-based stats. Identities = 12/46 (26%), Positives = 28/46 (60%) Frame = +1 Query: 19 YVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEIERLI 64 +++ +I K +DFC K ++ + ++ +L +N+++I+RLI Sbjct: 1852 HILSMKIQKLVNIKNVDFCTHKEQILARMEQNRDLISNVDKIKRLI 1989 Database: cds.fa Posted date: Mar 10, 2008 1:57 PM Number of letters in database: 11,140,562 Number of sequences in database: 5106 Lambda K H 0.322 0.138 0.419 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5106 Number of Hits to DB: 6,559,429 Number of extensions: 118833 Number of successful extensions: 809 Number of sequences better than 10.0: 28 Number of HSP's gapped: 805 Number of HSP's successfully gapped: 31 Length of query: 424 Length of database: 3,713,520 Length adjustment: 95 Effective length of query: 329 Effective length of database: 3,228,450 Effective search space: 1062160050 Effective search space used: 1062160050 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 40 (20.0 bits) # TBLASTN 2.2.17 [Aug-26-2007] # Query: slap_pknowlesi_definitiva # Database: cds.fa # Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Plasmodium_knowlesi_Sanger|(protein 100.00 424 0 0 1 424 454 1725 0.0 857 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmodium_knowlesi_Sanger|(protein 27.10 107 74 3 61 163 1744 2046 0.091 33.5 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmodium_knowlesi_Sanger|(protein 25.86 174 107 8 92 243 43 552 0.72 30.8 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Plasmodium_knowlesi_Sanger|(protein 21.74 92 70 1 215 304 883 1158 0.89 30.4 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Plasmodium_knowlesi_Sanger|(protein 34.78 69 44 3 212 279 686 871 1.8 29.3 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmodium_knowlesi_Sanger|(protein 41.30 46 27 1 164 209 466 594 2.7 28.9 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Plasmodium_knowlesi_Sanger|(protein 50.00 24 12 1 308 331 2929 2997 5.3 27.7 slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmodium_knowlesi_Sanger|(protein 26.09 46 34 0 19 64 1852 1989 6.3 27.3