TBLASTN 2.2.17 [Aug-26-2007]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= sel3_Pknowlesi_prot
         (344 letters)

Database: cds.fa 
           5106 sequences; 11,140,562 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmod...   566   e-162
Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Pla...    30   0.70 
Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Pla...    29   2.0  
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Pla...    28   4.7  
Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotat...    27   7.9  
Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotat...    27   8.0  
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Pla...    27   8.3  
Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Pla...    27   8.3  
Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotat...    27   8.4  

>Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmodium
           _knowlesi_Sanger|(protein coding) hypothetical protein,
           conserved in Plasmodium species
          Length = 972

 Score =  566 bits (1458), Expect = e-162,   Method: Composition-based stats.
 Identities = 323/344 (93%), Positives = 323/344 (93%)
 Frame = +1

Query: 1   MVLNKVYLLTILVLFYVNTLCVEAG*SKKLHIKLPNEDDDYLGKLINISSKITKYAQNNK 60
           MVLNKVYLLTIL                     LPNEDDDYLGKLINISSKITKYAQNNK
Sbjct: 1   MVLNKVYLLTIL---------------------LPNEDDDYLGKLINISSKITKYAQNNK 117

Query: 61  FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT 120
           FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT
Sbjct: 118 FKIAKILSTSALSAYSLNWVYQTGVTLLKDPHYSLFVPSNNYMNNAIRRIRTNYPVKSYT 297

Query: 121 FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL 180
           FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL
Sbjct: 298 FKVNKILERNFHHEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKEFENIGNL 477

Query: 181 FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI 240
           FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI
Sbjct: 478 FNCIRISKHEGPILLFQGKLKKQYWIHLPLKYEIVKNGEEDSLCTLTFTPLHKYYSDYTI 657

Query: 241 EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH 300
           EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH
Sbjct: 658 EIKLVKEKENNNVTFITSVKCANKNNGNGNSFYINVIQNIAMFLAYDIFEGINNNIHVVH 837

Query: 301 RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS 344
           RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS
Sbjct: 838 RRNANYRKTTFNTSNVTLKRKKKTNFQFVLSPVINPWSFKIRRS 969


>Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) hypothetical protein,
            conserved in Plasmodium species
          Length = 7470

 Score = 30.4 bits (67), Expect = 0.70,   Method: Composition-based stats.
 Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
 Frame = +1

Query: 159  KWRKKCTYNFCKYKEFENIGN-LFNCIRISKHEG 191
            +WRK+ + +FC + E +NI   L+N    SK EG
Sbjct: 3646 RWRKESSSDFCNFHEMKNIERVLYNSRVASKGEG 3747


>Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 3138

 Score = 28.9 bits (63), Expect = 2.0,   Method: Composition-based stats.
 Identities = 15/42 (35%), Positives = 25/42 (59%)
 Frame = +1

Query: 212 YEIVKNGEEDSLCTLTFTPLHKYYSDYTIEIKLVKEKENNNV 253
           +E+ KN   +++C L +T    +YS Y  E+  VKE+  NN+
Sbjct: 415 HEMSKN---ENICMLKYTSYESFYSKYDNEMDKVKEQGENNI 531


>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) hypothetical protein,
            conserved in Plasmodium species
          Length = 1665

 Score = 27.7 bits (60), Expect = 4.7,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 18/35 (51%)
 Frame = +3

Query: 159  KWRKKCTYNFCKYKEFENIGNLFNCIRISKHEGPI 193
            KWRKKC  N    + +E+ G   N  R SK E  I
Sbjct: 1023 KWRKKCKRNEGSRR*YESRGQQCNGGRKSKGEASI 1127


>Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotation|P
            lasmodium_knowlesi_Sanger|(protein coding) hypothetical
            protein, conserved in Plasmodium species
          Length = 1926

 Score = 26.9 bits (58), Expect = 7.9,   Method: Composition-based stats.
 Identities = 17/41 (41%), Positives = 24/41 (58%)
 Frame = +1

Query: 133  HEYANTEMGQIYVVNNFVNFLNFLPYKWRKKCTYNFCKYKE 173
            H+   TE  +IYV+ N +NF N L  K+R    Y +C+ KE
Sbjct: 1495 HDGKETE-NEIYVLKNKINFNNDLKKKYR---FYFYCRMKE 1605


>Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotation
           |Plasmodium_knowlesi_Sanger|(protein coding)
           hypothetical protein, conserved in Plasmodium species
          Length = 1842

 Score = 26.9 bits (58), Expect = 8.0,   Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 24/35 (68%)
 Frame = +1

Query: 221 DSLCTLTFTPLHKYYSDYTIEIKLVKEKENNNVTF 255
           D + T+TF  L+K Y+D+  ++K ++ +EN+ + +
Sbjct: 124 DIILTITFFALYKLYNDFK-KMKFLQPRENHQIIY 225


>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 918

 Score = 26.6 bits (57), Expect = 8.3,   Method: Composition-based stats.
 Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
 Frame = +1

Query: 151 NFLNFLPYKWRKK-CTYNFCKYKEFENIGNLF 181
           NF  FL Y ++KK C  NFCK      + N F
Sbjct: 517 NFYTFLSYDYKKKSCIQNFCKKYSQVEVSNQF 612


>Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) SICAvar antigen
           (fragment)
          Length = 970

 Score = 26.6 bits (57), Expect = 8.3,   Method: Composition-based stats.
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
 Frame = +1

Query: 251 NNVTFITSVKCANKNNGNGNSFYI-NVIQNIAMFLAYDIFEGINNNIHVVHRRNANYRKT 309
           N  + +  +K  N N+G  N+F + +V+ N  + L  +  +GI   + V    N     +
Sbjct: 742 NKTSVVEFMKLDNSNSGQSNTFSLADVLMNSEIQLPENTIQGILKEM-VADSANGTVEPS 918

Query: 310 TFNTSNVTLKRKKK 323
              T+  TL+++ K
Sbjct: 919 KMKTAVQTLEKESK 960


>Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotation
           |Plasmodium_knowlesi_Sanger|(protein coding)
           hypothetical protein, conserved in Plasmodium species
          Length = 2133

 Score = 26.6 bits (57), Expect = 8.4,   Method: Composition-based stats.
 Identities = 13/31 (41%), Positives = 16/31 (51%)
 Frame = -1

Query: 141 GQIYVVNNFVNFLNFLPYKWRKKCTYNFCKY 171
           G IY  N+F  FL+F  +K      YN C Y
Sbjct: 327 GSIYQANSFYFFLHFCAFKQITDILYNVCAY 235


  Database: cds.fa
    Posted date:  Mar 10, 2008  1:57 PM
  Number of letters in database: 11,140,562
  Number of sequences in database:  5106
  
Lambda     K      H
   0.323    0.138    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 5,393,821
Number of extensions: 104004
Number of successful extensions: 805
Number of sequences better than 10.0: 55
Number of HSP's gapped: 803
Number of HSP's successfully gapped: 57
Length of query: 344
Length of database: 3,713,520
Length adjustment: 93
Effective length of query: 251
Effective length of database: 3,238,662
Effective search space: 812904162
Effective search space used: 812904162
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 40 (20.0 bits)
# TBLASTN 2.2.17 [Aug-26-2007]
# Query: sel3_Pknowlesi_prot
# Database: cds.fa
# Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|chr14|PKH_142340|Annotation|Plasmodium_knowlesi_Sanger|(protein	93.90	344	21	1	1	344	1	969	4e-162	 566
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093850|Annotation|Plasmodium_knowlesi_Sanger|(protein	41.18	34	19	1	159	191	3646	3747	0.70	30.4
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr05|PKH_050680|Annotation|Plasmodium_knowlesi_Sanger|(protein	35.71	42	27	1	212	253	415	531	2.0	28.9
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_134460|Annotation|Plasmodium_knowlesi_Sanger|(protein	42.86	35	20	0	159	193	1023	1127	4.7	27.7
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_121780|Annotation|Plasmodium_knowlesi_Sanger|(protein	41.46	41	24	2	133	173	1495	1605	7.9	26.9
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_120090|Annotation|Plasmodium_knowlesi_Sanger|(protein	31.43	35	24	1	221	255	124	225	8.0	26.9
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_132690|Annotation|Plasmodium_knowlesi_Sanger|(protein	43.75	32	17	1	151	181	517	612	8.3	26.6
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072690|Annotation|Plasmodium_knowlesi_Sanger|(protein	24.32	74	55	2	251	323	742	960	8.3	26.6
sel3_Pknowlesi_prot	Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_124940|Annotation|Plasmodium_knowlesi_Sanger|(protein	41.94	31	18	0	141	171	327	235	8.4	26.6