GPx7

TBLASTN 2.2.17 [Aug-26-2007]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= Gpx7_prot
         (193 letters)

Database: cds.fa 
           5106 sequences; 11,140,562 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131090|Annotation|Pla...   337   7e-94
Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_125860|Annotat...   332   4e-92
Plasmodium_knowlesi_strain_H|chr04|PKH_041200|Annotation|Plasmod...    28   0.99 
Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_112660|An...    28   1.5  
Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093690|Annotation|Pla...    27   3.1  
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130060|Annotation|Pla...    25   8.9  
Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090350|Annotation|Pla...    25   9.6  

>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131090|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) glutathione
           peroxidase, putative
          Length = 603

 Score =  337 bits (865), Expect = 7e-94,   Method: Composition-based stats.
 Identities = 176/184 (95%), Positives = 177/184 (96%), Gaps = 7/184 (3%)
 Frame = +1

Query: 17  QRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 76
           QRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT
Sbjct: 49  QRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 228

Query: 77  RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 136
           RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN
Sbjct: 229 RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 408

Query: 137 GENTHELFKFLKANCDS-------VENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 189
           GENTHELFKFLKANCDS       +ENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI
Sbjct: 409 GENTHELFKFLKANCDSMHDKNGQLENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 588

Query: 190 TDLL 193
           TDLL
Sbjct: 589 TDLL 600


>Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_125860|Annotation
           |Plasmodium_knowlesi_Sanger|(protein coding) glutathione
           peroxidase, putative
          Length = 681

 Score =  332 bits (850), Expect = 4e-92,   Method: Composition-based stats.
 Identities = 174/184 (94%), Positives = 175/184 (95%), Gaps = 7/184 (3%)
 Frame = +1

Query: 17  QRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 76
           QRN ANMFSFFKKITVSK ELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT
Sbjct: 49  QRNHANMFSFFKKITVSKEELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 228

Query: 77  RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 136
           RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN
Sbjct: 229 RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 408

Query: 137 GENTHELFKFLKANCDS-------VENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 189
           GENTHELFKFLKANCDS       +ENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI
Sbjct: 409 GENTHELFKFLKANCDSMHDKNGQLENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 588

Query: 190 TDLL 193
           TDLL
Sbjct: 589 TDLL 600


>Plasmodium_knowlesi_strain_H|chr04|PKH_041200|Annotation|Plasmodium
           _knowlesi_Sanger|(protein coding) cysteine protease,
           putative
          Length = 2853

 Score = 28.5 bits (62), Expect = 0.99,   Method: Composition-based stats.
 Identities = 15/55 (27%), Positives = 23/55 (41%)
 Frame = +1

Query: 99  AFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVNGENTHELFKFLKANCDS 153
           +FPTS + N    N   I   N +      +++ IEV+G   H        N D+
Sbjct: 97  SFPTSAYSNHTDENKLSIDQENGRGDASPEYYSGIEVDGTIAHTPLNVPPPNADN 261


>Plasmodium_knowlesi_strain_H|PK4.chr11.pseudo.embl|PKH_112660|Annotat
            ion|Plasmodium_knowlesi_Sanger|(protein coding)
            hypothetical protein, conserved in Plasmodium species
          Length = 8178

 Score = 27.7 bits (60), Expect = 1.5,   Method: Composition-based stats.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
 Frame = +1

Query: 13   LLILQRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYK-----NKVLLIV 67
            LL  +RN  N +    +    + E     YD HV K++     MS YK     NK+ +I+
Sbjct: 4213 LLSKKRNKNNWYMTDDECKALEEERSKEFYD-HVHKINNFL--MSNYKDDSFVNKLKVII 4383

Query: 68   NSASKCGLTRKHVDQLNQ-LHDRLNEQ 93
            NSA K     + ++ LNQ L D+L  +
Sbjct: 4384 NSADKQFKKEEVLNILNQCLRDKLTSE 4464


>Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_093690|Annotation|Plasmodi
            um_knowlesi_Sanger|(protein coding) Malaria antigen,
            putative
          Length = 8139

 Score = 26.9 bits (58), Expect = 3.1,   Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 21/34 (61%)
 Frame = +1

Query: 96   EILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNF 129
            E+  FP+ + ++ +  N C + ++ D+ K +Y+F
Sbjct: 3409 ELKKFPSIEEISHDLENLCFVVNYKDEEKKRYSF 3510


>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_130060|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) hypothetical
           protein, conserved in Apicomplexan species
          Length = 618

 Score = 25.4 bits (54), Expect = 8.9,   Method: Composition-based stats.
 Identities = 12/26 (46%), Positives = 16/26 (61%)
 Frame = -2

Query: 5   LFLALLFVLLILQRNPANMFSFFKKI 30
           LF A  F+LLIL  +P +  SFF  +
Sbjct: 110 LFFACFFILLILPCSPRSSLSFFSPL 33


>Plasmodium_knowlesi_strain_H|PK4.chr09|PKH_090350|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) 5'-3'
           exoribonuclease, putative
          Length = 1947

 Score = 25.4 bits (54), Expect = 9.6,   Method: Composition-based stats.
 Identities = 9/22 (40%), Positives = 15/22 (68%)
 Frame = +1

Query: 130 FAPIEVNGENTHELFKFLKANC 151
           F+  +V GE  H++ KF++ NC
Sbjct: 646 FSGSDVPGEGEHKILKFIRENC 711


  Database: cds.fa
    Posted date:  Mar 6, 2008  4:53 PM
  Number of letters in database: 11,140,562
  Number of sequences in database:  5106
  
Lambda     K      H
   0.323    0.139    0.411 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 2,862,728
Number of extensions: 47246
Number of successful extensions: 315
Number of sequences better than 10.0: 21
Number of HSP's gapped: 313
Number of HSP's successfully gapped: 22
Length of query: 193
Length of database: 3,713,520
Length adjustment: 87
Effective length of query: 106
Effective length of database: 3,269,298
Effective search space: 346545588
Effective search space used: 346545588
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 37 (18.9 bits)



GPx8

TBLASTN 2.2.17 [Aug-26-2007]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= GPx8_prot
         (218 letters)

Database: cds.fasta 
           5106 sequences; 11,140,562 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_125860|Annotat...   349   2e-97
Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131090|Annotation|Pla...   318   7e-88
Plasmodium_knowlesi_strain_H|chr04|PKH_041200|Annotation|Plasmod...    28   2.1  

>Plasmodium_knowlesi_strain_H|PK4.chr12.pseudo|PKH_125860|Annotation
           |Plasmodium_knowlesi_Sanger|(protein coding) glutathione
           peroxidase, putative
          Length = 681

 Score =  349 bits (896), Expect = 2e-97,   Method: Composition-based stats.
 Identities = 192/201 (95%), Positives = 193/201 (96%), Gaps = 8/201 (3%)
 Frame = +1

Query: 17  QRNHANMFSFFKKITVSKEELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASK-GLT 75
           QRNHANMFSFFKKITVSKEELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASK GLT
Sbjct: 49  QRNHANMFSFFKKITVSKEELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 228

Query: 76  RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 135
           RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN
Sbjct: 229 RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 408

Query: 136 GENTHELFKFLKANCDS-------VENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 188
           GENTHELFKFLKANCDS       +ENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI
Sbjct: 409 GENTHELFKFLKANCDSMHDKNGQLENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 588

Query: 189 TDLLEKKTKTELSVNQDGQIE 209
           TDLLEKKTKTELSVNQDGQIE
Sbjct: 589 TDLLEKKTKTELSVNQDGQIE 651


>Plasmodium_knowlesi_strain_H|PK4.chr13|PKH_131090|Annotation|Plasmo
           dium_knowlesi_Sanger|(protein coding) glutathione
           peroxidase, putative
          Length = 603

 Score =  318 bits (814), Expect = 7e-88,   Method: Composition-based stats.
 Identities = 173/184 (94%), Positives = 174/184 (94%), Gaps = 8/184 (4%)
 Frame = +1

Query: 17  QRNHANMFSFFKKITVSKEELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASK-GLT 75
           QRN ANMFSFFKKITVSK ELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASK GLT
Sbjct: 49  QRNPANMFSFFKKITVSKGELRPSIYDYHVKKLDGTTVPMSTYKNKVLLIVNSASKCGLT 228

Query: 76  RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 135
           RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN
Sbjct: 229 RKHVDQLNQLHDRLNEQGLEILAFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVN 408

Query: 136 GENTHELFKFLKANCDS-------VENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 188
           GENTHELFKFLKANCDS       +ENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI
Sbjct: 409 GENTHELFKFLKANCDSMHDKNGQLENIGWNFGKFLVNRSGNVVSYFSPRTYPLEIEKNI 588

Query: 189 TDLL 192
           TDLL
Sbjct: 589 TDLL 600


>Plasmodium_knowlesi_strain_H|chr04|PKH_041200|Annotation|Plasmodium
           _knowlesi_Sanger|(protein coding) cysteine protease,
           putative
          Length = 2853

 Score = 27.7 bits (60), Expect = 2.1,   Method: Composition-based stats.
 Identities = 15/55 (27%), Positives = 23/55 (41%)
 Frame = +1

Query: 98  AFPTSQFLNQEFPNTCDISSFNDKNKIKYNFFAPIEVNGENTHELFKFLKANCDS 152
           +FPTS + N    N   I   N +      +++ IEV+G   H        N D+
Sbjct: 97  SFPTSAYSNHTDENKLSIDQENGRGDASPEYYSGIEVDGTIAHTPLNVPPPNADN 261


  Database: cds.fasta
    Posted date:  Mar 6, 2008  6:01 PM
  Number of letters in database: 11,140,562
  Number of sequences in database:  5106
  
Lambda     K      H
   0.316    0.133    0.383 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 5106
Number of Hits to DB: 2,588,000
Number of extensions: 40384
Number of successful extensions: 246
Number of sequences better than 10.0: 13
Number of HSP's gapped: 242
Number of HSP's successfully gapped: 13
Length of query: 218
Length of database: 3,713,520
Length adjustment: 88
Effective length of query: 130
Effective length of database: 3,264,192
Effective search space: 424344960
Effective search space used: 424344960
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 38 (19.2 bits)