TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= Sel2_prot (231 letters) Database: cds.fasta 5106 sequences; 11,140,562 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072950|Annotation|Pla... 287 2e-78 Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111780|An... 28 1.5 Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_114380|An... 28 1.5 Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070150|Annotation|Pla... 27 3.5 Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111110|An... 26 7.3 >Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072950|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 564 Score = 287 bits (734), Expect = 2e-78, Method: Composition-based stats. Identities = 187/231 (80%), Positives = 187/231 (80%) Frame = +1 Query: 1 MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLFVQKDTFPSYINRDN 60 MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLF Sbjct: 1 MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLF-------------- 138 Query: 61 QISIYFCKS*QSHYVLSRIKKYFDLLNHGEDTEIYFEEDSYQADNSSNGEEKYVIFLRNI 120 DTEIYFEEDSYQADNSSNGEEKYVIFLRNI Sbjct: 139 ------------------------------DTEIYFEEDSYQADNSSNGEEKYVIFLRNI 228 Query: 121 KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG 180 KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG Sbjct: 229 KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG 408 Query: 181 IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG 231 IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG Sbjct: 409 IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG 561 >Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111780|Annotat ion|Plasmodium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 5463 Score = 28.5 bits (62), Expect = 1.5, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 38/72 (52%), Gaps = 3/72 (4%) Frame = +1 Query: 64 IYFCKS*QSHYVLSRIKKYFDLLNHGEDTEIYFEEDSYQADNSSNGEEKYVI---FLRNI 120 IY +S + + +K YF+ + +IYFEE+++Q N +EK++ F+ N+ Sbjct: 4798 IYGGRSPRDESFENAMKAYFNAKAIYDKWKIYFEENNFQG----NAQEKFLFNLSFIYNV 4965 Query: 121 KAALIYRLICRF 132 L++ IC + Sbjct: 4966 YNYLVWIYICTY 5001 >Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_114380|Annot ation|Plasmodium_knowlesi_Sanger|(protein coding) SICAvar antigen (fragment) Length = 912 Score = 28.5 bits (62), Expect = 1.5, Method: Composition-based stats. Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 2/39 (5%) Frame = -3 Query: 122 AALIYRLICRF--VLFIYVLIAASLIFPHYLKPYIPPIL 158 A IYR ICR VL+IY ++ S PY+PPIL Sbjct: 589 AEWIYRTICRCL*VLYIYPILHPS--------PYVPPIL 497 >Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070150|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 6954 Score = 26.9 bits (58), Expect = 3.5, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 2/41 (4%) Frame = -2 Query: 125 IYRLICRFVLFIYVLIAASLIFPHYLKP--YIPPILLNNRR 163 ++RL+C F++ + A +FP P +PP+ +R Sbjct: 5342 LFRLVCNSSPFLWYMWALGGLFPRLFPPPGKVPPVYFETKR 5220 >Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111110|Annotat ion|Plasmodium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 9525 Score = 26.2 bits (56), Expect = 7.3, Method: Composition-based stats. Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 4/44 (9%) Frame = +1 Query: 192 MLCNTSVIHVYQNRNLIYDDYYSDHL--FVQRL--RENMTHVPG 231 +LC S +QN + +YDD Y DHL ++ R RE + VPG Sbjct: 2863 LLCRNSKNTCFQNVSKVYDDIY-DHLEKYILRYKSREGLV-VPG 2988 Database: cds.fasta Posted date: Mar 6, 2008 6:01 PM Number of letters in database: 11,140,562 Number of sequences in database: 5106 Lambda K H 0.332 0.147 0.443 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5106 Number of Hits to DB: 3,734,200 Number of extensions: 71623 Number of successful extensions: 811 Number of sequences better than 10.0: 37 Number of HSP's gapped: 809 Number of HSP's successfully gapped: 39 Length of query: 231 Length of database: 3,713,520 Length adjustment: 89 Effective length of query: 142 Effective length of database: 3,259,086 Effective search space: 462790212 Effective search space used: 462790212 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 15 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.9 bits) S2: 38 (19.2 bits)