TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= slap_pknowlesi_definitiva
(424 letters)
Database: cds.fa
5106 sequences; 11,140,562 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Pla... 857 0.0
Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmod... 33 0.091
Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmod... 31 0.72
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Pla... 30 0.89
Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Pla... 29 1.8
Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmod... 29 2.7
Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Pla... 28 5.3
Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmod... 27 6.3
>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Plasmo
dium_knowlesi_Sanger|(protein coding) PLP-dependent
aminotransferase, putative
Length = 1728
Score = 857 bits (2213), Expect = 0.0, Method: Composition-based stats.
Identities = 424/424 (100%), Positives = 424/424 (100%)
Frame = +1
Query: 1 MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI 60
MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI
Sbjct: 454 MCLSTCLLYTKKEREKSEYVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEI 633
Query: 61 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 120
ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI
Sbjct: 634 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 813
Query: 121 QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT 180
QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT
Sbjct: 814 QKCFESKGRVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKKHYPGRTPVHAYLDLFIT 993
Query: 181 LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH 240
LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH
Sbjct: 994 LLELGKKKILNLRKEREENFSWLQEKVSTLCSKYNLTLIKASKNKISMAINLNELYNICH 1173
Query: 241 ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD 300
ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD
Sbjct: 1174ILNPKSITLLGSLLFYRNVTGHRVICSPLLIRNGGVAQRRDQQIQTNVEPAFPIHHGAVD 1353
Query: 301 PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA 360
PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA
Sbjct: 1354PNEVPTIEQGAHTDGEFFPNRGKKSEDAHQIRSSDNCNEECVGKEGSQINLVKNENMNDA 1533
Query: 361 RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF 420
RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF
Sbjct: 1534RIHGKGLTIGNHTFEHFGCSYDSYPFSYIAFSCVIGIEREELQSFVAKLDDAIGCFIRRF 1713
Query: 421 GRGA 424
GRGA
Sbjct: 1714GRGA 1725
>Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmodium_k
nowlesi_Sanger|(protein coding) hypothetical protein,
conserved in Plasmodium species
Length = 2286
Score = 33.5 bits (75), Expect = 0.091, Method: Composition-based stats.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Frame = +1
Query: 61 ERLIQTYGEKICCVMSVTSSYAPRNSDDIVKIAHICKRYNVPHIINNAFGLQCNYLCKEI 120
E + Q Y EK C ++ R + IV +A KR + I NNA+ NY+ +I
Sbjct: 1744 EMVEQCYPEKTCAQKQPQKRWSEREKNKIVDLA---KRGD---IFNNAYVNLLNYIESDI 1905
Query: 121 QKCFESKG----RVDFVVQSCDKNFLVPVNGGIVFSSDKKKMKELKK 163
+ C + +++S D +F++P+ + S+DK + + KK
Sbjct: 1906 KNCSPDEALNTLLQLHLLRSVDNHFVLPLISKLCNSTDKLRNEHKKK 2046
>Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmodium
_knowlesi_Sanger|(protein coding) cysteine protease,
putative
Length = 2118
Score = 30.8 bits (68), Expect = 0.72, Method: Composition-based stats.
Identities = 45/174 (25%), Positives = 71/174 (40%), Gaps = 22/174 (12%)
Frame = +1
Query: 92 IAHICKRYNVPHIINNAFG-LQCNYLCKEIQKCFESKGRVDFVVQSCDKNF--LVP---- 144
+AH+ KR +II N FG L C I+ CF S C++ + + P
Sbjct: 43 VAHLVKRVK-SNIIINCFGTLHCKICHIAIRNCFLSGTSNLTKCIECEEKYYNIQPCTHH 219
Query: 145 VNGGIVFSSDKKKMKELKKH-YPGRTPVHAYLDLFITLLELGKKKILNLRKEREENFSWL 203
+ S DK E+K H Y V DL +++L ++ N E E+
Sbjct: 220 TENFLQISRDKGAFVEMKNHDYLTEAKVD---DLISEIVKLSLERQKNAVPETEQTKQDF 390
Query: 204 QEKVSTLCSKYNLT--------LIKASKNKISMAIN------LNELYNICHILN 243
Q+K+ LC N + +A+ ++ IN +NE N+ HI+N
Sbjct: 391 QKKIMQLCLYSNFSDHYENAKKHTQANAEEVEKHINKIVIMYMNESNNMEHIIN 552
>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Plasmo
dium_knowlesi_Sanger|(protein coding) acetyl CoA
synthetase, putative
Length = 2988
Score = 30.4 bits (67), Expect = 0.89, Method: Composition-based stats.
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 2/92 (2%)
Frame = +1
Query: 215 NLTLIKASKNKISMAI--NLNELYNICHILNPKSITLLGSLLFYRNVTGHRVICSPLLIR 272
N + AS K+ + + N EL+N+C ILN + N G V+ + R
Sbjct: 883 NYEIFYASMKKLGVLVVDNFEELFNMCKILNLSKYPETNEVCVVTNAGGPGVLLVDNITR 1062
Query: 273 NGGVAQRRDQQIQTNVEPAFPIHHGAVDPNEV 304
N G + + ++ ++ P +P ++
Sbjct: 1063NDGNLSKLNDNLKKKLDAFLPPSWSKANPVDI 1158
>Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Plasmo
dium_knowlesi_Sanger|(protein coding) recticulocyte
binding protein, putative
Length = 966
Score = 29.3 bits (64), Expect = 1.8, Method: Composition-based stats.
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Frame = +2
Query: 212 SKYNLTLIKASKNKISMAINLNELYNICHILNPKSITLLGSLLFYRNVTGHRVIC-SPLL 270
S + +T + +++NK AIN+ Y I P+SI L L + + G R +C PLL
Sbjct: 686 SGFTITFLWSARNK---AINIVT*YRIL----PRSIAPLRIKL*MKRLMGSRCLCMPPLL 844
Query: 271 IRNGGVAQR 279
GV QR
Sbjct: 845 PVRWGVFQR 871
>Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmodium
_knowlesi_Sanger|(protein coding) hypothetical protein,
conserved in Apicomplexan species
Length = 1431
Score = 28.9 bits (63), Expect = 2.7, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 27/46 (58%)
Frame = +1
Query: 164 HYPGRTPVHAYLDLFITLLELGKKKILNLRKEREENFSWLQEKVST 209
H GR V ++FIT+ L KK+I L + REE S Q+K+S+
Sbjct: 466 HRGGRELVK---EIFITIKTLIKKRIDVLNEHREEYLSGSQDKLSS 594
>Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Plasmodi
um_knowlesi_Sanger|(protein coding) RNA helicase,
putative
Length = 3489
Score = 27.7 bits (60), Expect = 5.3, Method: Composition-based stats.
Identities = 12/24 (50%), Positives = 18/24 (75%)
Frame = +1
Query: 308 EQGAHTDGEFFPNRGKKSEDAHQI 331
E+ AHT GEF P++G+K DA+ +
Sbjct: 2929 EKDAHT-GEFSPHQGRKDHDAYNL 2997
>Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmodium_k
nowlesi_Sanger|(protein coding) dna gyrase a-subunit,
putative
Length = 3522
Score = 27.3 bits (59), Expect = 6.3, Method: Composition-based stats.
Identities = 12/46 (26%), Positives = 28/46 (60%)
Frame = +1
Query: 19 YVIVSRIDHKTCYKCIDFCALKYLVVDMVYKDEELFTNLNEIERLI 64
+++ +I K +DFC K ++ + ++ +L +N+++I+RLI
Sbjct: 1852 HILSMKIQKLVNIKNVDFCTHKEQILARMEQNRDLISNVDKIKRLI 1989
Database: cds.fa
Posted date: Mar 10, 2008 1:57 PM
Number of letters in database: 11,140,562
Number of sequences in database: 5106
Lambda K H
0.322 0.138 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 6,559,429
Number of extensions: 118833
Number of successful extensions: 809
Number of sequences better than 10.0: 28
Number of HSP's gapped: 805
Number of HSP's successfully gapped: 31
Length of query: 424
Length of database: 3,713,520
Length adjustment: 95
Effective length of query: 329
Effective length of database: 3,228,450
Effective search space: 1062160050
Effective search space used: 1062160050
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 40 (20.0 bits)
# TBLASTN 2.2.17 [Aug-26-2007]
# Query: slap_pknowlesi_definitiva
# Database: cds.fa
# Fields: Query id, Subject id, % identity, alignment length, mismatches, gap openings, q. start, q. end, s. start, s. end, e-value, bit score
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130360|Annotation|Plasmodium_knowlesi_Sanger|(protein 100.00 424 0 0 1 424 454 1725 0.0 857
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_144500|Annotation|Plasmodium_knowlesi_Sanger|(protein 27.10 107 74 3 61 163 1744 2046 0.091 33.5
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr04|PKH_041270|Annotation|Plasmodium_knowlesi_Sanger|(protein 25.86 174 107 8 92 243 43 552 0.72 30.8
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131160|Annotation|Plasmodium_knowlesi_Sanger|(protein 21.74 92 70 1 215 304 883 1158 0.89 30.4
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr01|PKH_010570|Annotation|Plasmodium_knowlesi_Sanger|(protein 34.78 69 44 3 212 279 686 871 1.8 29.3
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_141340|Annotation|Plasmodium_knowlesi_Sanger|(protein 41.30 46 27 1 164 209 466 594 2.7 28.9
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|PK4.chr03|PKH_030340|Annotation|Plasmodium_knowlesi_Sanger|(protein 50.00 24 12 1 308 331 2929 2997 5.3 27.7
slap_pknowlesi_definitiva Plasmodium_knowlesi_strain_H|chr14|PKH_143980|Annotation|Plasmodium_knowlesi_Sanger|(protein 26.09 46 34 0 19 64 1852 1989 6.3 27.3
