TBLASTN 2.2.17 [Aug-26-2007]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= EMBOSS_001_1
         (289 letters)

Database: cds.fa 
           5106 sequences; 11,140,562 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmod...   553   e-158
Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Pla...    29   1.3  
Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Pla...    28   2.9  
Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmod...    28   3.2  
Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmod...    27   5.8  
Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Pla...    26   8.6  
Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Pla...    26   8.9  
Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Pla...    26   10.0 

>Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmodium
           _knowlesi_Sanger|(protein coding) hypothetical protein,
           conserved in Plasmodium species
          Length = 1551

 Score =  553 bits (1425), Expect = e-158,   Method: Composition-based stats.
 Identities = 288/288 (100%), Positives = 288/288 (100%)
 Frame = +1

Query: 1   MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK 60
           MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK
Sbjct: 1   MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK 180

Query: 61  YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK 120
           YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK
Sbjct: 181 YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK 360

Query: 121 TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY 180
           TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY
Sbjct: 361 TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY 540

Query: 181 HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH 240
           HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH
Sbjct: 541 HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH 720

Query: 241 YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA 288
           YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA
Sbjct: 721 YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA 864


>Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 1968

 Score = 28.9 bits (63), Expect = 1.3,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 33/68 (48%)
 Frame = +1

Query: 9   GKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIKYYYRIEKR 68
           G P SGK + +N++  K   L + L+  NNL      N  LR KI+   +I Y     K 
Sbjct: 691 GPPGSGKSSLVNVIKNKTNNLFISLFHLNNL------NNELR-KIYDQSVINY-----KL 834

Query: 69  RRKTSIFC 76
            +K +I C
Sbjct: 835 SKKRTILC 858


>Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 2967

 Score = 27.7 bits (60), Expect = 2.9,   Method: Composition-based stats.
 Identities = 25/109 (22%), Positives = 44/109 (40%), Gaps = 4/109 (3%)
 Frame = +1

Query: 167 CYLYRHKDEWMYHYHSYLRRNKIK--TFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNL 224
           C L+     ++YH+   + R +++    N  ++  ++  Y   L  +   N  K  ++N+
Sbjct: 775 CRLWWEDKPFIYHWERGMNRKEVQRCVHNFCINIAKEDFYLARLERLAEDNLFKEKEKNI 954

Query: 225 H--LLLNRTYTVYGREKHYRGVYYPIVKKTTIKGGHVMLIRALNHKKEK 271
               L   TY  Y   + Y            +KG    LI+  NH KEK
Sbjct: 955 QKIQLKGNTYAYYKICERYEKD-----SLGDVKGNEEYLIKRKNHVKEK 1086


>Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmodium_k
            nowlesi_Sanger|(protein coding) queuine
            trna-ribosyltransferase, putative
          Length = 2118

 Score = 27.7 bits (60), Expect = 3.2,   Method: Composition-based stats.
 Identities = 17/57 (29%), Positives = 31/57 (54%)
 Frame = +1

Query: 146  VQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHYHSYLRRNKIKTFNVSLDFIEKQ 202
            ++R+   R+   ++S  +++ C L   K   M +YHSYL     K +NV   +IE++
Sbjct: 1285 IEREYTERSM--HRSHRWYIRCLLEFKKAMSMPNYHSYLNELHNKKYNVKHKWIERE 1449


>Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmodium_k
            nowlesi_Sanger|(protein coding) hypothetical protein,
            conserved in Apicomplexan species
          Length = 2106

 Score = 26.9 bits (58), Expect = 5.8,   Method: Composition-based stats.
 Identities = 34/138 (24%), Positives = 56/138 (40%), Gaps = 14/138 (10%)
 Frame = +1

Query: 131  PRGITHQLYPTLRKVV------QRKPLRRTTLTNQ--SDDFHL------ICYLYRHKDEW 176
            PR  T+ +Y T+R  +        K  R T    Q  S  F L      + Y+  + D +
Sbjct: 1705 PRIFTYNVYRTIRPKLLYLIRHMNKTFRDTLSFPQYFSYSFRLRIIPRHVAYMNIYYDNY 1884

Query: 177  MYHYHSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYG 236
            + +Y   LR +    FN   +   + +Y+  +P +           NL +LL  +   + 
Sbjct: 1885 ISYYKELLRTHNYDDFNRKFN---ELVYKPDIPPI-----------NLKMLLQTSNKDF- 2019

Query: 237  REKHYRGVYYPIVKKTTI 254
              KHY+  YY  VK T +
Sbjct: 2020 -MKHYKISYYDFVKSTQV 2070


>Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Plasmodium species
          Length = 1899

 Score = 26.2 bits (56), Expect = 8.6,   Method: Composition-based stats.
 Identities = 25/86 (29%), Positives = 36/86 (41%), Gaps = 5/86 (5%)
 Frame = +1

Query: 193 NVSLDFIEKQLYRGHLPEVTSLNPSKIVQ-----RNLHLLLNRTYTVYGREKHYRGVYYP 247
           N+ L   E Q  R   P+   ++P K  +     RN+    + T   YGR KH R  Y  
Sbjct: 85  NLVLRSAEGQAIRA--PQRYYIHPRKAKRTERRGRNVPPCYHNTLRNYGRNKHTRLFYRS 258

Query: 248 IVKKTTIKGGHVMLIRALNHKKEKGP 273
             +K   + G++     L  KK K P
Sbjct: 259 KKEKDIQESGYMYPFDHLEKKKTKFP 336


>Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) hypothetical protein,
            conserved in Plasmodium species
          Length = 6249

 Score = 26.2 bits (56), Expect = 8.9,   Method: Composition-based stats.
 Identities = 17/62 (27%), Positives = 30/62 (48%)
 Frame = +1

Query: 28   QLELFLYLFNNLIHKNESNVRLREKIFFIQLIKYYYRIEKRRRKTSIFCGHGGETDQKVS 87
            +L  FL+LF     KN+  +R   K +F+    YY   +K+    +  C H  E D++  
Sbjct: 5416 KLHDFLFLF-----KNKVKMRKDTKCYFLM---YYVLYKKKLFHNNKLCKHKNENDEEYH 5571

Query: 88   FR 89
            ++
Sbjct: 5572 YK 5577


>Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) dna-directed rna
            polymerase, beta subunit, putative
          Length = 4620

 Score = 26.2 bits (56), Expect = 10.0,   Method: Composition-based stats.
 Identities = 11/20 (55%), Positives = 13/20 (65%)
 Frame = +1

Query: 226  LLLNRTYTVYGREKHYRGVY 245
            LLLN+ Y  YG E  Y G+Y
Sbjct: 4066 LLLNKGYDYYGTELLYSGIY 4125


  Database: cds.fa
    Posted date:  Mar 10, 2008  1:57 PM
  Number of letters in database: 11,140,562
  Number of sequences in database:  5106
  
Lambda     K      H
   0.324    0.140    0.426 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 4,618,567
Number of extensions: 87015
Number of successful extensions: 789
Number of sequences better than 10.0: 26
Number of HSP's gapped: 786
Number of HSP's successfully gapped: 26
Length of query: 289
Length of database: 3,713,520
Length adjustment: 91
Effective length of query: 198
Effective length of database: 3,248,874
Effective search space: 643277052
Effective search space used: 643277052
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 39 (19.6 bits)
# TBLASTN 2.2.17 [Aug-26-2007]
# Query: EMBOSS_001_1
# Database: cds.fa
# Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmodium_knowlesi_Sanger|(protein	100.00	288	0	0	1	288	1	864	2e-158	 553
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Plasmodium_knowlesi_Sanger|(protein	33.82	68	45	3	9	76	691	858	1.3	28.9
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Plasmodium_knowlesi_Sanger|(protein	22.94	109	80	3	167	271	775	1086	2.9	27.7
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmodium_knowlesi_Sanger|(protein	29.82	57	40	1	146	202	1285	1449	3.2	27.7
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmodium_knowlesi_Sanger|(protein	24.64	138	90	6	131	254	1705	2070	5.8	26.9
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Plasmodium_knowlesi_Sanger|(protein	29.07	86	56	2	193	273	85	336	8.6	26.2
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Plasmodium_knowlesi_Sanger|(protein	27.42	62	45	2	28	89	5416	5577	8.9	26.2
EMBOSS_001_1	Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Plasmodium_knowlesi_Sanger|(protein	55.00	20	9	0	226	245	4066	4125	10.0	26.2