TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= SPS1_nostra (419 letters) Database: cds.fa 5106 sequences; 11,140,562 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070780|Annotation|Pla... 432 e-122 Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030600|Annotation|Pla... 32 0.40 Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_133400|Annotation|Pla... 29 2.4 >Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070780|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) selenophosphate synthase, putative Length = 3450 Score = 432 bits (1110), Expect = e-122, Method: Composition-based stats. Identities = 212/212 (100%), Positives = 212/212 (100%) Frame = +1 Query: 13 LLVSKETAYRVYQEEGENGTTITIIVQDGSVHKGTPSHSDTKREKVNSYIKESIEKIINR 72 LLVSKETAYRVYQEEGENGTTITIIVQDGSVHKGTPSHSDTKREKVNSYIKESIEKIINR Sbjct: 1810 LLVSKETAYRVYQEEGENGTTITIIVQDGSVHKGTPSHSDTKREKVNSYIKESIEKIINR 1989 Query: 73 NTCGGCGSKVPSNVLSNSLKGLSVYNSPNVFLGIEACDDCCIFVHSKSKRGEESPALVQT 132 NTCGGCGSKVPSNVLSNSLKGLSVYNSPNVFLGIEACDDCCIFVHSKSKRGEESPALVQT Sbjct: 1990 NTCGGCGSKVPSNVLSNSLKGLSVYNSPNVFLGIEACDDCCIFVHSKSKRGEESPALVQT 2169 Query: 133 IDFFKSFIDDEYILGEIIAIHCLSDVYSMGGTGICALCVLIVKDNIEKKLQQRLQNILTG 192 IDFFKSFIDDEYILGEIIAIHCLSDVYSMGGTGICALCVLIVKDNIEKKLQQRLQNILTG Sbjct: 2170 IDFFKSFIDDEYILGEIIAIHCLSDVYSMGGTGICALCVLIVKDNIEKKLQQRLQNILTG 2349 Query: 193 CCQKLKEEKCVLSGGHTCAGNENYVGLAVTGK 224 CCQKLKEEKCVLSGGHTCAGNENYVGLAVTGK Sbjct: 2350 CCQKLKEEKCVLSGGHTCAGNENYVGLAVTGK 2445 Score = 267 bits (683), Expect = 3e-72, Method: Composition-based stats. Identities = 130/138 (94%), Positives = 132/138 (95%) Frame = +1 Query: 224 KENYLFLPKGSGSVKAGDIIITTKMFGFGFIMAAHICKKAKARWIYICLDEMLLSNRKSG 283 KENYLFLPKGSGSVKAGDIIITTKMFGFGFIMAAHICKKAKARWIYICLDEMLLSNRKSG Sbjct: 2662 KENYLFLPKGSGSVKAGDIIITTKMFGFGFIMAAHICKKAKARWIYICLDEMLLSNRKSG 2841 Query: 284 LYLLQNNNAKACTDVTGFGILGHLNEMIKCSRREIYFASHMKKKSLHTTSQRKILEGRNK 343 LYLLQNNNAKACTDVTGFGILGHLNEMIKCSRREIYFASHMKKKSLHTTSQRKILEGRNK Sbjct: 2842 LYLLQNNNAKACTDVTGFGILGHLNEMIKCSRREIYFASHMKKKSLHTTSQRKILEGRNK 3021 Query: 344 RMDDGENDQTSGGLMAIV 361 RMDDGENDQ M ++ Sbjct: 3022 RMDDGENDQVKEPPMNLI 3075 Score = 140 bits (353), Expect = 5e-34, Method: Composition-based stats. Identities = 68/68 (100%), Positives = 68/68 (100%) Frame = +1 Query: 352 QTSGGLMAIVERERAHQILADLKNMGYSNCSAVGEIINVQDYKFKGVPINQVSLDDYLDT 411 QTSGGLMAIVERERAHQILADLKNMGYSNCSAVGEIINVQDYKFKGVPINQVSLDDYLDT Sbjct: 3244 QTSGGLMAIVERERAHQILADLKNMGYSNCSAVGEIINVQDYKFKGVPINQVSLDDYLDT 3423 Query: 412 TNSVYIEC 419 TNSVYIEC Sbjct: 3424 TNSVYIEC 3447 >Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030600|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) Sporozoite protein with MAC/Perforin domain Length = 2535 Score = 31.6 bits (70), Expect = 0.40, Method: Composition-based stats. Identities = 17/56 (30%), Positives = 29/56 (51%) Frame = +1 Query: 84 SNVLSNSLKGLSVYNSPNVFLGIEACDDCCIFVHSKSKRGEESPALVQTIDFFKSF 139 + N++ GL P F+G+EA +C V+ + K EE ++ I FFK++ Sbjct: 1159 TTAFKNAVDGL-----PPHFIGLEAESECASDVYEQKKTSEECESVHAWITFFKTY 1311 >Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_133400|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1908 Score = 28.9 bits (63), Expect = 2.4, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 27/58 (46%) Frame = +1 Query: 191 TGCCQKLKEEKCVLSGGHTCAGNENYVGLAVTGKENYLFLPKGSGSVKAGDIIITTKM 248 T C + +EEK ++ G NEN + K+ + S +A DI++TT M Sbjct: 607 TSCVKFAEEEKRNMNNGRWFGNNENQTNIPRRCKDQVMLPYSSSKDNEAADILLTTPM 780 Database: cds.fa Posted date: Mar 7, 2008 5:38 PM Number of letters in database: 11,140,562 Number of sequences in database: 5106 Lambda K H 0.319 0.136 0.400 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5106 Number of Hits to DB: 5,449,989 Number of extensions: 88475 Number of successful extensions: 516 Number of sequences better than 10.0: 6 Number of HSP's gapped: 516 Number of HSP's successfully gapped: 8 Length of query: 419 Length of database: 3,713,520 Length adjustment: 95 Effective length of query: 324 Effective length of database: 3,228,450 Effective search space: 1046017800 Effective search space used: 1046017800 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 40 (20.0 bits)