TBLASTN 2.2.17 [Aug-26-2007]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= Sel2_prot
         (231 letters)

Database: cds.fasta 
           5106 sequences; 11,140,562 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072950|Annotation|Pla...   287   2e-78
Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111780|An...    28   1.5  
Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_114380|An...    28   1.5  
Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070150|Annotation|Pla...    27   3.5  
Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111110|An...    26   7.3  

>Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_072950|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 564

 Score =  287 bits (734), Expect = 2e-78,   Method: Composition-based stats.
 Identities = 187/231 (80%), Positives = 187/231 (80%)
 Frame = +1

Query: 1   MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLFVQKDTFPSYINRDN 60
           MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLF              
Sbjct: 1   MYYLKLFLKLLLLLFLSYDFIVLKNGTREVVHGDRNFLKESYEFLF-------------- 138

Query: 61  QISIYFCKS*QSHYVLSRIKKYFDLLNHGEDTEIYFEEDSYQADNSSNGEEKYVIFLRNI 120
                                         DTEIYFEEDSYQADNSSNGEEKYVIFLRNI
Sbjct: 139 ------------------------------DTEIYFEEDSYQADNSSNGEEKYVIFLRNI 228

Query: 121 KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG 180
           KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG
Sbjct: 229 KAALIYRLICRFVLFIYVLIAASLIFPHYLKPYIPPILLNNRRFNEVFENMQQKKLMVLG 408

Query: 181 IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG 231
           IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG
Sbjct: 409 IMFFSYNVIYGMLCNTSVIHVYQNRNLIYDDYYSDHLFVQRLRENMTHVPG 561


>Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111780|Annotat
            ion|Plasmodium_knowlesi_Sanger|(protein coding)
            hypothetical protein, conserved in Plasmodium species
          Length = 5463

 Score = 28.5 bits (62), Expect = 1.5,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
 Frame = +1

Query: 64   IYFCKS*QSHYVLSRIKKYFDLLNHGEDTEIYFEEDSYQADNSSNGEEKYVI---FLRNI 120
            IY  +S +     + +K YF+     +  +IYFEE+++Q     N +EK++    F+ N+
Sbjct: 4798 IYGGRSPRDESFENAMKAYFNAKAIYDKWKIYFEENNFQG----NAQEKFLFNLSFIYNV 4965

Query: 121  KAALIYRLICRF 132
               L++  IC +
Sbjct: 4966 YNYLVWIYICTY 5001


>Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_114380|Annot
           ation|Plasmodium_knowlesi_Sanger|(protein coding)
           SICAvar antigen (fragment)
          Length = 912

 Score = 28.5 bits (62), Expect = 1.5,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
 Frame = -3

Query: 122 AALIYRLICRF--VLFIYVLIAASLIFPHYLKPYIPPIL 158
           A  IYR ICR   VL+IY ++  S        PY+PPIL
Sbjct: 589 AEWIYRTICRCL*VLYIYPILHPS--------PYVPPIL 497


>Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070150|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) hypothetical protein,
            conserved in Plasmodium species
          Length = 6954

 Score = 26.9 bits (58), Expect = 3.5,   Method: Composition-based stats.
 Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
 Frame = -2

Query: 125  IYRLICRFVLFIYVLIAASLIFPHYLKP--YIPPILLNNRR 163
            ++RL+C    F++ + A   +FP    P   +PP+    +R
Sbjct: 5342 LFRLVCNSSPFLWYMWALGGLFPRLFPPPGKVPPVYFETKR 5220


>Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_111110|Annotat
            ion|Plasmodium_knowlesi_Sanger|(protein coding)
            hypothetical protein, conserved in Plasmodium species
          Length = 9525

 Score = 26.2 bits (56), Expect = 7.3,   Method: Composition-based stats.
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
 Frame = +1

Query: 192  MLCNTSVIHVYQNRNLIYDDYYSDHL--FVQRL--RENMTHVPG 231
            +LC  S    +QN + +YDD Y DHL  ++ R   RE +  VPG
Sbjct: 2863 LLCRNSKNTCFQNVSKVYDDIY-DHLEKYILRYKSREGLV-VPG 2988


  Database: cds.fasta
    Posted date:  Mar 6, 2008  6:01 PM
  Number of letters in database: 11,140,562
  Number of sequences in database:  5106
  
Lambda     K      H
   0.332    0.147    0.443 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 3,734,200
Number of extensions: 71623
Number of successful extensions: 811
Number of sequences better than 10.0: 37
Number of HSP's gapped: 809
Number of HSP's successfully gapped: 39
Length of query: 231
Length of database: 3,713,520
Length adjustment: 89
Effective length of query: 142
Effective length of database: 3,259,086
Effective search space: 462790212
Effective search space used: 462790212
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 38 (19.2 bits)