TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= sel3_Pknowlesi_prot (344 letters) Database: cds.fa 5106 sequences; 11,140,562 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmod... 566 e-162 Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Pla... 30 0.70 Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Pla... 29 2.0 Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Pla... 28 4.7 Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotat... 27 7.9 Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotat... 27 8.0 Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Pla... 27 8.3 Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Pla... 27 8.3 Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotat... 27 8.4 >Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmodium _knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 972 Score = 566 bits (1458), Expect = e-162, Method: Composition-based stats. Identities = 323/344 (93%), Positives = 323/344 (93%) Frame = +1 Query: 1 MVLNKVYLLTILVLFYVNTLCVEAG*SKKLHIKLPNEDDDYLGKLINISSKITKYAQNNK 60 MVLNKVYLLTIL LPNEDDDYLGKLINISSKITKYAQNNK Sbjct: 1 MVLNKVYLLTIL---------------------LPNEDDDYLGKLINISSKITKYAQNNK 117 Query: 61 FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT 120 FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT Sbjct: 118 FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT 297 Query: 121 FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL 180 FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL Sbjct: 298 FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL 477 Query: 181 FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI 240 FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI Sbjct: 478 FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI 657 Query: 241 EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH 300 EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH Sbjct: 658 EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH 837 Query: 301 RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS 344 RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS Sbjct: 838 RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS 969 >Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 7470 Score = 30.4 bits (67), Expect = 0.70, Method: Composition-based stats. Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 1/34 (2%) Frame = +1 Query: 159 KWRKKCTYNFCKYKEFENIGN-LFNCIRISKHEG 191 +WRK+ + +FC + E +NI L+N SK EG Sbjct: 3646 RWRKESSSDFCNFHEMKNIERVLYNSRVASKGEG 3747 >Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 3138 Score = 28.9 bits (63), Expect = 2.0, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 25/42 (59%) Frame = +1 Query: 212 YEIVKNGEEDSLCTLTFTPLHKYYSDYTIEIKLVKEKENNNV 253 +E+ KN +++C L +T +YS Y E+ VKE+ NN+ Sbjct: 415 HEMSKN---ENICMLKYTSYESFYSKYDNEMDKVKEQGENNI 531 >Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1665 Score = 27.7 bits (60), Expect = 4.7, Method: Composition-based stats. Identities = 15/35 (42%), Positives = 18/35 (51%) Frame = +3 Query: 159 KWRKKCTYNFCKYKEFENIGNLFNCIRISKHEGPI 193 KWRKKC N + +E+ G N R SK E I Sbjct: 1023 KWRKKCKRNEGSRR*YESRGQQCNGGRKSKGEASI 1127 >Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotation|P lasmodium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1926 Score = 26.9 bits (58), Expect = 7.9, Method: Composition-based stats. Identities = 17/41 (41%), Positives = 24/41 (58%) Frame = +1 Query: 133 HEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKE 173 H+ TE +IYV+ N +NF N L K+R Y +C+ KE Sbjct: 1495 HDGKETE-NEIYVLKNKINFNNDLKKKYR---FYFYCRMKE 1605 >Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotation |Plasmodium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1842 Score = 26.9 bits (58), Expect = 8.0, Method: Composition-based stats. Identities = 11/35 (31%), Positives = 24/35 (68%) Frame = +1 Query: 221 DSLCTLTFTPLHKYYSDYTIEIKLVKEKENNNVTF 255 D + T+TF L+K Y+D+ ++K ++ +EN+ + + Sbjct: 124 DIILTITFFALYKLYNDFK-KMKFLQPRENHQIIY 225 >Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 918 Score = 26.6 bits (57), Expect = 8.3, Method: Composition-based stats. Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 1/32 (3%) Frame = +1 Query: 151 NFLNFLPYKWRKK-CTYNFCKYKEFENIGNLF 181 NF FL Y ++KK C NFCK + N F Sbjct: 517 NFYTFLSYDYKKKSCIQNFCKKYSQVEVSNQF 612 >Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) SICAvar antigen (fragment) Length = 970 Score = 26.6 bits (57), Expect = 8.3, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 1/74 (1%) Frame = +1 Query: 251 NNVTFITSVKCANKNNGNGNSFYI-NVIQNIAMFLAYDIFEGINNNIHVVHRRNANYRKT 309 N + + +K N N+G N+F + +V+ N + L + +GI + V N + Sbjct: 742 NKTSVVEFMKLDNSNSGQSNTFSLADVLMNSEIQLPENTIQGILKEM-VADSANGTVEPS 918 Query: 310 TFNTSNVTLKRKKK 323 T+ TL+++ K Sbjct: 919 KMKTAVQTLEKESK 960 >Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotation |Plasmodium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 2133 Score = 26.6 bits (57), Expect = 8.4, Method: Composition-based stats. Identities = 13/31 (41%), Positives = 16/31 (51%) Frame = -1 Query: 141 GQIYVVNNFVNFLNFLPYKWRKKCTYNFCKY 171 G IY N+F FL+F +K YN C Y Sbjct: 327 GSIYQANSFYFFLHFCAFKQITDILYNVCAY 235 Database: cds.fa Posted date: Mar 10, 2008 1:57 PM Number of letters in database: 11,140,562 Number of sequences in database: 5106 Lambda K H 0.323 0.138 0.418 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5106 Number of Hits to DB: 5,393,821 Number of extensions: 104004 Number of successful extensions: 805 Number of sequences better than 10.0: 55 Number of HSP's gapped: 803 Number of HSP's successfully gapped: 57 Length of query: 344 Length of database: 3,713,520 Length adjustment: 93 Effective length of query: 251 Effective length of database: 3,238,662 Effective search space: 812904162 Effective search space used: 812904162 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 40 (20.0 bits) # TBLASTN 2.2.17 [Aug-26-2007] # Query: sel3_Pknowlesi_prot # Database: cds.fa # Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmodium_knowlesi_Sanger|(protein 93.90 344 21 1 1 344 1 969 4e-162 566 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Plasmodium_knowlesi_Sanger|(protein 41.18 34 19 1 159 191 3646 3747 0.70 30.4 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Plasmodium_knowlesi_Sanger|(protein 35.71 42 27 1 212 253 415 531 2.0 28.9 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Plasmodium_knowlesi_Sanger|(protein 42.86 35 20 0 159 193 1023 1127 4.7 27.7 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotation|Plasmodium_knowlesi_Sanger|(protein 41.46 41 24 2 133 173 1495 1605 7.9 26.9 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotation|Plasmodium_knowlesi_Sanger|(protein 31.43 35 24 1 221 255 124 225 8.0 26.9 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Plasmodium_knowlesi_Sanger|(protein 43.75 32 17 1 151 181 517 612 8.3 26.6 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Plasmodium_knowlesi_Sanger|(protein 24.32 74 55 2 251 323 742 960 8.3 26.6 sel3_Pknowlesi_prot Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotation|Plasmodium_knowlesi_Sanger|(protein 41.94 31 18 0 141 171 327 235 8.4 26.6