TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= EMBOSS_001_1 (289 letters) Database: cds.fa 5106 sequences; 11,140,562 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmod... 553 e-158 Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Pla... 29 1.3 Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Pla... 28 2.9 Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmod... 28 3.2 Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmod... 27 5.8 Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Pla... 26 8.6 Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Pla... 26 8.9 Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Pla... 26 10.0 >Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmodium _knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1551 Score = 553 bits (1425), Expect = e-158, Method: Composition-based stats. Identities = 288/288 (100%), Positives = 288/288 (100%) Frame = +1 Query: 1 MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK 60 MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK Sbjct: 1 MKCAFLFYGKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIK 180 Query: 61 YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK 120 YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK Sbjct: 181 YYYRIEKRRRKTSIFCGHGGETDQKVSFRFLVQLTSHLLHIGRSSGWDRGAGSKGVKNGK 360 Query: 121 TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY 180 TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY Sbjct: 361 TKEQLRRGLPPRGITHQLYPTLRKVVQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHY 540 Query: 181 HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH 240 HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH Sbjct: 541 HSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYGREKH 720 Query: 241 YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA 288 YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA Sbjct: 721 YRGVYYPIVKKTTIKGGHVMLIRALNHKKEKGPNMTEAVNIAMKPTSA 864 >Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1968 Score = 28.9 bits (63), Expect = 1.3, Method: Composition-based stats. Identities = 23/68 (33%), Positives = 33/68 (48%) Frame = +1 Query: 9 GKPCSGKDTFINLLLTKRRQLELFLYLFNNLIHKNESNVRLREKIFFIQLIKYYYRIEKR 68 G P SGK + +N++ K L + L+ NNL N LR KI+ +I Y K Sbjct: 691 GPPGSGKSSLVNVIKNKTNNLFISLFHLNNL------NNELR-KIYDQSVINY-----KL 834 Query: 69 RRKTSIFC 76 +K +I C Sbjct: 835 SKKRTILC 858 >Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 2967 Score = 27.7 bits (60), Expect = 2.9, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 44/109 (40%), Gaps = 4/109 (3%) Frame = +1 Query: 167 CYLYRHKDEWMYHYHSYLRRNKIK--TFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNL 224 C L+ ++YH+ + R +++ N ++ ++ Y L + N K ++N+ Sbjct: 775 CRLWWEDKPFIYHWERGMNRKEVQRCVHNFCINIAKEDFYLARLERLAEDNLFKEKEKNI 954 Query: 225 H--LLLNRTYTVYGREKHYRGVYYPIVKKTTIKGGHVMLIRALNHKKEK 271 L TY Y + Y +KG LI+ NH KEK Sbjct: 955 QKIQLKGNTYAYYKICERYEKD-----SLGDVKGNEEYLIKRKNHVKEK 1086 >Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmodium_k nowlesi_Sanger|(protein coding) queuine trna-ribosyltransferase, putative Length = 2118 Score = 27.7 bits (60), Expect = 3.2, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 31/57 (54%) Frame = +1 Query: 146 VQRKPLRRTTLTNQSDDFHLICYLYRHKDEWMYHYHSYLRRNKIKTFNVSLDFIEKQ 202 ++R+ R+ ++S +++ C L K M +YHSYL K +NV +IE++ Sbjct: 1285 IEREYTERSM--HRSHRWYIRCLLEFKKAMSMPNYHSYLNELHNKKYNVKHKWIERE 1449 >Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmodium_k nowlesi_Sanger|(protein coding) hypothetical protein, conserved in Apicomplexan species Length = 2106 Score = 26.9 bits (58), Expect = 5.8, Method: Composition-based stats. Identities = 34/138 (24%), Positives = 56/138 (40%), Gaps = 14/138 (10%) Frame = +1 Query: 131 PRGITHQLYPTLRKVV------QRKPLRRTTLTNQ--SDDFHL------ICYLYRHKDEW 176 PR T+ +Y T+R + K R T Q S F L + Y+ + D + Sbjct: 1705 PRIFTYNVYRTIRPKLLYLIRHMNKTFRDTLSFPQYFSYSFRLRIIPRHVAYMNIYYDNY 1884 Query: 177 MYHYHSYLRRNKIKTFNVSLDFIEKQLYRGHLPEVTSLNPSKIVQRNLHLLLNRTYTVYG 236 + +Y LR + FN + + +Y+ +P + NL +LL + + Sbjct: 1885 ISYYKELLRTHNYDDFNRKFN---ELVYKPDIPPI-----------NLKMLLQTSNKDF- 2019 Query: 237 REKHYRGVYYPIVKKTTI 254 KHY+ YY VK T + Sbjct: 2020 -MKHYKISYYDFVKSTQV 2070 >Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Plasmo dium_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 1899 Score = 26.2 bits (56), Expect = 8.6, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 36/86 (41%), Gaps = 5/86 (5%) Frame = +1 Query: 193 NVSLDFIEKQLYRGHLPEVTSLNPSKIVQ-----RNLHLLLNRTYTVYGREKHYRGVYYP 247 N+ L E Q R P+ ++P K + RN+ + T YGR KH R Y Sbjct: 85 NLVLRSAEGQAIRA--PQRYYIHPRKAKRTERRGRNVPPCYHNTLRNYGRNKHTRLFYRS 258 Query: 248 IVKKTTIKGGHVMLIRALNHKKEKGP 273 +K + G++ L KK K P Sbjct: 259 KKEKDIQESGYMYPFDHLEKKKTKFP 336 >Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) hypothetical protein, conserved in Plasmodium species Length = 6249 Score = 26.2 bits (56), Expect = 8.9, Method: Composition-based stats. Identities = 17/62 (27%), Positives = 30/62 (48%) Frame = +1 Query: 28 QLELFLYLFNNLIHKNESNVRLREKIFFIQLIKYYYRIEKRRRKTSIFCGHGGETDQKVS 87 +L FL+LF KN+ +R K +F+ YY +K+ + C H E D++ Sbjct: 5416 KLHDFLFLF-----KNKVKMRKDTKCYFLM---YYVLYKKKLFHNNKLCKHKNENDEEYH 5571 Query: 88 FR 89 ++ Sbjct: 5572 YK 5577 >Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Plasmodi um_knowlesi_Sanger|(protein coding) dna-directed rna polymerase, beta subunit, putative Length = 4620 Score = 26.2 bits (56), Expect = 10.0, Method: Composition-based stats. Identities = 11/20 (55%), Positives = 13/20 (65%) Frame = +1 Query: 226 LLLNRTYTVYGREKHYRGVY 245 LLLN+ Y YG E Y G+Y Sbjct: 4066 LLLNKGYDYYGTELLYSGIY 4125 Database: cds.fa Posted date: Mar 10, 2008 1:57 PM Number of letters in database: 11,140,562 Number of sequences in database: 5106 Lambda K H 0.324 0.140 0.426 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5106 Number of Hits to DB: 4,618,567 Number of extensions: 87015 Number of successful extensions: 789 Number of sequences better than 10.0: 26 Number of HSP's gapped: 786 Number of HSP's successfully gapped: 26 Length of query: 289 Length of database: 3,713,520 Length adjustment: 91 Effective length of query: 198 Effective length of database: 3,248,874 Effective search space: 643277052 Effective search space used: 643277052 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 39 (19.6 bits) # TBLASTN 2.2.17 [Aug-26-2007] # Query: EMBOSS_001_1 # Database: cds.fa # Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score EMBOSS_001_1 Plasmodium_knowlesi_strain_H|chr02|PKH_020960|Annotation|Plasmodium_knowlesi_Sanger|(protein 100.00 288 0 0 1 288 1 864 2e-158 553 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090990|Annotation|Plasmodium_knowlesi_Sanger|(protein 33.82 68 45 3 9 76 691 858 1.3 28.9 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|PK4.chr06|PKH_060360|Annotation|Plasmodium_knowlesi_Sanger|(protein 22.94 109 80 3 167 271 775 1086 2.9 27.7 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|chr14|PKH_145930|Annotation|Plasmodium_knowlesi_Sanger|(protein 29.82 57 40 1 146 202 1285 1449 3.2 27.7 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|chr02|PKH_021190|Annotation|Plasmodium_knowlesi_Sanger|(protein 24.64 138 90 6 131 254 1705 2070 5.8 26.9 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|PK4.chr08|PKH_083220|Annotation|Plasmodium_knowlesi_Sanger|(protein 29.07 86 56 2 193 273 85 336 8.6 26.2 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|PK4.chr07|PKH_070230|Annotation|Plasmodium_knowlesi_Sanger|(protein 27.42 62 45 2 28 89 5416 5577 8.9 26.2 EMBOSS_001_1 Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093260|Annotation|Plasmodium_knowlesi_Sanger|(protein 55.00 20 9 0 226 245 4066 4125 10.0 26.2