Alineamiento de B.Bigemina contra las selenoproteínas de humano (e-value 10)
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000038_1.0 # Protein # Eukaryotic elongation factor
(eEFSec) # Homo sapiens # Complete
(596 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3160 115 6e-26
Contig4149.1 111 7e-25
Contig4155 92 6e-19
Contig4117.1 69 8e-12
Contig4133 57 2e-08
Contig4055 52 6e-07
Contig3936.1 49 8e-06
Contig4180 45 6e-05
Contig4136 44 3e-04
Contig4166.0 41 0.001
Contig3606 41 0.002
Contig3954.0 39 0.007
Contig3973 30 0.22
Contig1952 33 0.32
Contig4098 32 1.1
Contig3632 32 1.2
Contig4154 31 1.8
Contig2705 30 4.5
Contig4163.0 30 4.5
Contig4184.3 29 5.2
Contig2251 28 8.3
>Contig3160
Length = 31744
Score = 115 bits (288), Expect = 6e-26, Method: Composition-based stats.
Identities = 90/326 (27%), Positives = 152/326 (46%), Gaps = 23/326 (7%)
Frame = -1
Query: 7 NVNVGVLGHIDSGKTALARALSTTA----------STAAFDKQPQSRERGITLDLGFSCF 56
++N+G +GHID GKT L AL++ S D P+ + RGIT++
Sbjct: 15466 HINIGTIGHIDHGKTTLTSALTSVLKAKGLAKKAHSYEEIDSAPEEKSRGITIN------ 15305
Query: 57 SVPLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLV 116
+ E++++ L +DCPGHA I+ +I GA +D +LV
Sbjct: 15304 ---------TKHVEYESS----------LRHYAHIDCPGHADYIKNMITGAVQMDGAILV 15182
Query: 117 IDVTKGMQTQSAE-CLVIGQIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQKTLENTK 175
I +T G Q+ E L+I QI + +++ LNK D + + + I+ + +++ L
Sbjct: 15181 ISLTDGPMPQTIEHLLLIKQIGIENVIIFLNKEDKIVD---KELINFVEEEVLDILNKYN 15011
Query: 176 FRGAPIIPVAA---------KPGGPEAPETEAPQGIPELIELLTSQISIPTRDPSGPFLM 226
F+ + ++ K E + I L++ + I P RD PFLM
Sbjct: 15010 FKEDCVHFISGSALEALNIVKNSNKVDFENIWIKKILNLVDCIDKYIPTPDRDLVKPFLM 14831
Query: 227 SVDHCFSIKGQGTVMTGTILSGSISLGDSVEIPALKVVK--KVKSMQMFHMPITSAMQGD 284
V+ FSI G+G V TG I G++ +GD +E+ K V +++MF+ + GD
Sbjct: 14830 PVEDSFSITGRGIVATGKIERGNVQIGDKLELLGHDSSKMTSVLNIEMFNKSLNKGESGD 14651
Query: 285 RLGICVTQFDPKLLERGLVCA-PESL 309
+GI + + ++RG + A PES+
Sbjct: 14650 NVGILLRGINKTDIKRGYILAVPESV 14573
>Contig4149.1
Length = 48505
Score = 111 bits (278), Expect = 7e-25, Method: Composition-based stats.
Identities = 90/321 (28%), Positives = 148/321 (46%), Gaps = 13/321 (4%)
Frame = -3
Query: 7 NVNVGVLGHIDSGKTALARALSTTAST---------AAFDKQPQSRERGITLDLGFSCFS 57
++N+G +GH+D GKT L AL+ + A D+ P+ R+RGIT++
Sbjct: 9458 HLNIGTIGHVDHGKTTLTAALTKVCAMDGHGEYTPYEAIDRAPEERKRGITIN------- 9300
Query: 58 VPLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVI 117
S+ E++ VDCPGHA ++ +I GA +D +LV+
Sbjct: 9299 --------STHVEYETQNR----------HYGHVDCPGHADYVKNMISGAAQMDGAILVV 9174
Query: 118 DVTKGMQTQSAE-CLVIGQIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQKTLENTKF 176
G Q+ E L+ QI +LVV LNK+D+L + + ++ +++
Sbjct: 9173 SCVDGPMPQTKEHVLLAKQIGVPRLVVFLNKLDMLEDVELLELVELEIRELLSEY-GYDG 8997
Query: 177 RGAPIIPVAAKPGGPEAPETEAPQGIPELIELLTSQISIPTRDPSGPFLMSVDHCFSIKG 236
PII +A A E E + I EL+ + + P R P L++VD +I G
Sbjct: 8996 DATPIIKGSAMKALNSAAE-EDVKPIRELLHACDTYLLTPERKSDLPLLVAVDDVLAIPG 8820
Query: 237 QGTVMTGTILSGSISLGDSVEI-PALKVVKKVKSM--QMFHMPITSAMQGDRLGICVTQF 293
+GTV+TG + G I GD++E+ K KK M +MF ++ + GD++G+ +
Sbjct: 8819 KGTVVTGRVEQGKIRCGDAIEVCGGPKSGKKTVCMGLEMFRKSLSEGIAGDQIGVLLKGV 8640
Query: 294 DPKLLERGLVCAPESLHTVHA 314
+ERG V + H+
Sbjct: 8639 KRDEVERGFVLIQPGTYKCHS 8577
>Contig4155
Length = 58573
Score = 92.0 bits (227), Expect = 6e-19, Method: Composition-based stats.
Identities = 80/277 (28%), Positives = 134/277 (48%), Gaps = 23/277 (8%)
Frame = -3
Query: 4 RRVNVNVGVLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSV----- 58
R+ +N+G +GH+ GK+ + ALS T F + +ER IT+ LG++ +
Sbjct: 26675 RQATINIGTIGHVAHGKSTVVHALSGV-HTVRFKHE---KERNITIKLGYANAKIYKCTN 26508
Query: 59 ---PLPARLRSSLPEFQAAPEAEPEPG-----EPLLQVTLVDCPGHASLIRTIIGGAQII 110
P P +S + P P PG E V+ VDCPGH L+ T++ GA ++
Sbjct: 26507 PECPAPDCYKSYGSSKEDEPPC-PRPGCGHKMELKRHVSFVDCPGHDILMATMLNGAAVM 26331
Query: 111 DLMMLVIDVTKGM-QTQSAECLVIGQIACQKLVVVL-NKIDLLPEGKRQAAIDKMTKKMQ 168
D +L++ + Q Q++E L +I K +++L NK++L+ E + AI + ++++
Sbjct: 26330 DAALLLVAGNESCPQPQTSEHLAAVEIMRLKNIIILQNKVELIKESQ---AIQRQ-EEIK 26163
Query: 169 KTLENTKFRGAPIIPVAAKPGGPEAPETEAPQGIPELIELLTSQISIPTRDPSGPFLMSV 228
+ + T APIIP++A I + E L +Q+++P RD P M V
Sbjct: 26162 RFVSGTAADNAPIIPISAVLN----------YNIDVICEYLVTQVAVPKRDFVLPPQMIV 26013
Query: 229 DHCFSIKG--------QGTVMTGTILSGSISLGDSVE 257
F + QG V G+IL G + +GD +E
Sbjct: 26012 IRSFDVNKPGELVENLQGGVAGGSILHGVLKVGDQIE 25902
>Contig4117.1
Length = 14601
Score = 68.6 bits (166), Expect = 8e-12, Method: Composition-based stats.
Identities = 85/394 (21%), Positives = 148/394 (37%), Gaps = 46/394 (11%)
Frame = +2
Query: 1 MAGRRVNVNVGVLGHIDSGKTALARAL------------------STTASTAAF------ 36
M + ++N+ V+GH+DSGK+ L ST +F
Sbjct: 2198 MPKEKTHINLVVIGHVDSGKSTTTGHLIYKLGGIDKRTIEKFEKESTDMGKGSFKYAWVL 2377
Query: 37 DKQPQSRERGITLDLGFSCFSVPLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGH 96
DK RERGIT+D+ +L +F+ T++D PGH
Sbjct: 2378 DKLKSERERGITIDI---------------TLWKFETGK----------YYYTVIDAPGH 2482
Query: 97 ASLIRTIIGGAQIIDLMMLVIDVTKG-------MQTQSAE-CLVIGQIACQKLVVVLNKI 148
I+ +I G D+ MLV+ G + Q+ E L+ + ++++ +NK+
Sbjct: 2483 RDFIKNMITGTSQADVAMLVVPAEAGGFEAAFSKEGQTREHALLAFTLGVKQIICAINKM 2662
Query: 149 DLLPEGKRQAAIDKMTKKMQKTLENTKFRGAPIIPVAAKPGGPEAPETEAPQGIP----- 203
D ++ ++ K++Q L+ + +P A G E +P
Sbjct: 2663 DKC--DYKEDRYSEIQKEVQGYLKKVGY-NIEKVPFVAISGFMGDNMVERSTNMPWYKGK 2833
Query: 204 ELIELLTSQISIPTRDPSGPFLMSVDHCFSIKGQGTVMTGTILSGSISLGDSVEIPALKV 263
L+E L + P R P + + + I G GTV G + +G + G + +
Sbjct: 2834 TLVEAL-DMMEPPKRPVDKPLRLPLQGVYKIGGIGTVPVGRVETGQLKAGMVLTFAPNPI 3010
Query: 264 VKKVKSMQMFHMPITSAMQGDRLGICVTQFDPKLLERGLVCAPESLHTVHAALISVEKIP 323
+ KS++M H I A GD +G V + G V + AA+ ++
Sbjct: 3011 TTECKSVEMHHEVIDVASPGDNVGFNVKNVSTSDIRTGHVASDSKNDPAKAAVSFTAQVI 3190
Query: 324 YFRGPLQTKAKF---------HITVGHETVMGRL 348
P KA + HI+ + + R+
Sbjct: 3191 ILNHPGTIKAGYSPVVDCHTAHISCKFDEITSRM 3292
Score = 47.8 bits (112), Expect = 1e-05, Method: Composition-based stats.
Identities = 43/181 (23%), Positives = 72/181 (39%), Gaps = 32/181 (17%)
Frame = -3
Query: 1 MAGRRVNVNVGVLGHIDSGKTALARAL------------------STTASTAAF------ 36
M + ++N+ V+GH+DSGK+ L ST +F
Sbjct: 772 MPKEKTHINLVVIGHVDSGKSTTTGHLIYKLGGIDKRTIEKFEKESTDMGKGSFKYAWVL 593
Query: 37 DKQPQSRERGITLDLGFSCFSVPLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGH 96
DK RERGIT+D+ +L +F+ T++D PGH
Sbjct: 592 DKLKSERERGITIDI---------------TLWKFETGK----------YYYTVIDAPGH 488
Query: 97 ASLIRTIIGGAQIIDLMMLVIDVTKG-------MQTQSAE-CLVIGQIACQKLVVVLNKI 148
I+ +I G D+ MLV+ G + Q+ E L+ + ++++ +NK+
Sbjct: 487 RDFIKNMITGTSQADVAMLVVPAEAGGFEAAFSKEGQTREHALLAFTLGVKQIICAINKM 308
Query: 149 D 149
D
Sbjct: 307 D 305
>Contig4133
Length = 40823
Score = 57.4 bits (137), Expect = 2e-08, Method: Composition-based stats.
Identities = 79/296 (26%), Positives = 122/296 (41%), Gaps = 37/296 (12%)
Frame = -2
Query: 12 VLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSVPLPARLRSSL-PE 70
VLGH+D+GKT L + + A GIT +G + F + + + PE
Sbjct: 16177 VLGHVDTGKTKLLDKIRHSNVQNA-------EAGGITQQIGATFFPKEMLNKHCDMVNPE 16019
Query: 71 FQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVTKGMQTQSAEC 130
F+ PG + ++D PGH S G+ + D+ +LV+D+ G++ Q+ E
Sbjct: 16018 FKVK-----SPG-----LLIIDTPGHESFNNLRARGSSLCDIAVLVVDIMHGLEPQTIES 15869
Query: 131 LVIGQIACQK--LVVVLNKIDLL------PEGKRQAAIDKM-----------TKKMQKTL 171
IG + +K V+ LNKID L P DK KK+ L
Sbjct: 15868 --IGLLRARKCYFVIALNKIDRLYKWEPTPWATFHKTFDKQLDHTKAEFFERAKKIMLEL 15695
Query: 172 E----NTKF--------RGAPIIPVAAKPGGPEAPETEAPQGIPELIEL---LTSQISIP 216
N++F R I P +A G +GI +LI L LT +I +
Sbjct: 15694 SEQGLNSEFYWENDDVRRNISICPTSAITG----------EGISDLICLMLQLTQKIMVK 15545
Query: 217 TRDPSGPFLMSVDHCFSIKGQGTVMTGTILSGSISLGDSVEIPALK--VVKKVKSM 270
F SV I+G GT + +L G+I GD V + L +V ++++
Sbjct: 15544 NITHKEEFHCSVLEVKVIEGLGTTVDVILLGGTIREGDKVVLCGLSGPIVTTIRTL 15377
>Contig4055
Length = 39951
Score = 52.4 bits (124), Expect = 6e-07, Method: Composition-based stats.
Identities = 80/376 (21%), Positives = 151/376 (40%), Gaps = 35/376 (9%)
Frame = +3
Query: 5 RVNVNVGVLGHIDSGK-TALARALSTTA-----STAAFDKQPQSRERGITLDLGFSCFSV 58
R ++N+ +GHID+GK T L T + ++++ + R R + F F +
Sbjct: 27363 RQHLNIVFIGHIDAGKSTTCGNILYLTGYVDERTIEKYEREAKDRNR----ESWFLAFIM 27530
Query: 59 PLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVID 118
R + +A E + T++D PGH + + +I GA D +L++
Sbjct: 27531 DTNEEERQKGKTVEVG-QARIETANK--RFTILDAPGHKNYVPNMISGATQADCGVLILS 27701
Query: 119 VTKGM---------QTQSAECLVIGQIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQK 169
KG QT+ L+ + L+V +NK+D +A +++ +K++
Sbjct: 27702 ARKGEFETGFDRGGQTRE-HTLLAKTLGVSHLIVAVNKMDDHTCDWSEARYNEVVQKLRP 27878
Query: 170 TLENTKF---RGAPIIPVAAKPG-------------GPEAPETEAPQGIPELIELLTSQI 213
L+ F R +P++ G G + PQ +P L +LL I
Sbjct: 27879 FLKTCGFVEGRNLNFLPISGLVGQNLMHHVSDPSYKGYCPKASWYPQTMPTLFQLLDG-I 28055
Query: 214 SIPTRDPSGPFLMSVDHCFSIKGQGTVMTGTILSGSISLGDSVEIPALKVVKKVKSMQMF 273
D + V + + G + G + SG+++ G + + KV KV+++
Sbjct: 28056 DTAVFDDKAALRIPVTGAY--RDNGVICMGKVESGTVTSGQTCIVMPHKVKVKVQNVLFD 28229
Query: 274 HMPITSAMQGDRLGICVTQFDPKLLERG-LVC---APESLHTVHAALISVEKIPYFRGPL 329
A G+ + + + D + +G +VC P + T A++ + + R PL
Sbjct: 28230 EEEFAYAKPGENIRLRIVGLDEDQISKGAIVCDVDRPCPVVTEFIAMVQIIDLLEHR-PL 28406
Query: 330 QTKAKFHITVGHETVM 345
T F + H V+
Sbjct: 28407 ITSGYFCVMHAHSVVV 28454
>Contig3936.1
Length = 10689
Score = 48.5 bits (114), Expect = 8e-06, Method: Composition-based stats.
Identities = 40/153 (26%), Positives = 65/153 (42%), Gaps = 10/153 (6%)
Frame = +2
Query: 9 NVGVLGHIDSGKTALARALSTTASTAA---------FDKQPQSRERGITL-DLGFSCFSV 58
NV L H+D GKT L+ +L ++ + D + + R IT+ S
Sbjct: 4418 NVCFLAHVDHGKTTLSDSLISSVGIISERLSGKLRYLDNRDDEQRRMITIKSSSISLIYS 4597
Query: 59 PLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVID 118
P +R+ A+ P + + LVDCPGH + A++ D +L++D
Sbjct: 4598 PNNPTVRTG--------SAKALNDSPCI-INLVDCPGHVDFSVEVSTAARLCDGALLIVD 4750
Query: 119 VTKGMQTQSAECLVIGQIACQKLVVVLNKIDLL 151
+G+ Q+ L + +VLNKID L
Sbjct: 4751 AVEGICPQTKAVLRQAWRESVRTALVLNKIDKL 4849
>Contig4180
Length = 166360
Score = 45.4 bits (106), Expect = 6e-05, Method: Composition-based stats.
Identities = 59/227 (25%), Positives = 93/227 (40%), Gaps = 29/227 (12%)
Frame = +2
Query: 9 NVGVLGHIDSGKTALA-RALSTTASTAA-------------FDKQPQSRERGITLDLGFS 54
N+G+ HIDSGKT L R L + AA D RERGIT
Sbjct: 95402 NIGISAHIDSGKTTLTERILFYSGRIAAIHEVRGNDGVGAKMDSMDLERERGIT------ 95563
Query: 55 CFSVPLPARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMM 114
++S++ F+ + E V ++D PGH + +++D +
Sbjct: 95564 ---------IQSAVTNFRWSVEGAAGESSVDYMVNIIDTPGHVDFTIEVERALRVLDGAI 95716
Query: 115 LVIDVTKGMQTQSAECLVIGQIACQKL--VVVLNKID---LLPEG-----KRQAAIDKMT 164
L+ G+Q+Q+ V Q+ K+ +V LNK+D PE +++ ID +
Sbjct: 95717 LLCCAVAGVQSQTLT--VNMQMDRYKIPRIVFLNKMDRDGADPERVIGLIRQKLNIDLLE 95890
Query: 165 KKMQKTLENTKFRG-APIIPVAA----KPGGPEAPETEAPQGIPELI 206
++ L N +F G +I A P G + E P G EL+
Sbjct: 95891 LQVPIGLAN-RFEGVVDVIDECAYHFEGPNGQHVVKKEVPDGYAELL 96028
>Contig4136
Length = 67380
Score = 43.5 bits (101), Expect = 3e-04, Method: Composition-based stats.
Identities = 67/271 (24%), Positives = 102/271 (37%), Gaps = 21/271 (7%)
Frame = +2
Query: 9 NVGVLGHIDSGKTALARAL-----STTASTAAFDKQPQSRERGITLDLGFSCFSVPLPAR 63
NV V+ H+D GKT L L + A D +ERGIT+ C V R
Sbjct: 20948 NVAVVAHVDHGKTTLVDGLLRCSGEQLTHSRAMDSHELEKERGITI-----CSKV---TR 21103
Query: 64 LRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVTKGM 123
+ S F +VD PGHA + I+D + L++D+ +G
Sbjct: 21104 VEWSGHTFN-----------------IVDTPGHADFGGEVERILNIVDCVCLLVDIVEGP 21232
Query: 124 QTQSAECLVIG-QIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQKTLENTKFR----- 177
+ Q+ L + K VVV+NK D+ K + +EN F
Sbjct: 21233 KPQTTFVLRKALENPDLKAVVVVNK------------CDRDCNKTRSDIENELFELFVDC 21376
Query: 178 GA-------PIIPVAAKPGGPEAP---ETEAPQGIPELIELLTSQISIPTRDPSGPFLMS 227
GA PI+ +AK + + G +++E L+ P F +
Sbjct: 21377 GASDEQLEFPILFASAKENWVSSSYPVDRHQVSGTSKILECLSHVAPSPKVSKHTNFTLQ 21556
Query: 228 VDHCFSIKGQGTVMTGTILSGSISLGDSVEI 258
V + ++TG I SGSI G ++ I
Sbjct: 21557 VS-LLDFEEGCALITGKINSGSIQKGATLHI 21646
>Contig4166.0
Length = 90045
Score = 41.2 bits (95), Expect = 0.001, Method: Composition-based stats.
Identities = 37/141 (26%), Positives = 57/141 (40%)
Frame = +2
Query: 10 VGVLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSVPLPARLRSSLP 69
V ++GH+D GKT L L + A E G R+ L
Sbjct: 61361 VTIMGHVDHGKTTLLDTLQRSNIAAG--------ESG----------------RITQKLG 61468
Query: 70 EFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVTKGMQTQSAE 129
F+ E P + VD PGHA+ R G + D+++LV+ G+ Q+ E
Sbjct: 61469 AFKIQLERGP--------LVFVDTPGHAAFGRMRDRGVRCADVVILVVAADDGVMPQTRE 61624
Query: 130 CLVIGQIACQKLVVVLNKIDL 150
+ + + VV +NKID+
Sbjct: 61625 AIELIRRNDLPYVVAVNKIDI 61687
>Contig3606
Length = 7804
Score = 40.8 bits (94), Expect = 0.002, Method: Composition-based stats.
Identities = 32/143 (22%), Positives = 61/143 (42%), Gaps = 2/143 (1%)
Frame = +3
Query: 9 NVGVLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSVPLPARLRSSL 68
N+ V+ H+D GK+ L +L + A A +R D C ++ +S+
Sbjct: 3705 NMSVIAHVDHGKSTLTDSLVSKAGIIAAKNAGDARFTDTRADEQERCITI------KSTG 3866
Query: 69 PEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVTKGMQTQSA 128
+ + G + L+D PGH + ++ D ++V+D +G+ Q+
Sbjct: 3867 ISMYFEHDLDDGKGMQPFLINLIDSPGHVDFSSEVTAALRVTDGALVVVDTIEGVCVQTE 4046
Query: 129 ECLVIGQIACQKLVVVL--NKID 149
V+ Q +++ VL NK+D
Sbjct: 4047 T--VLRQALGERIRPVLHVNKVD 4109
>Contig3954.0
Length = 18229
Score = 38.9 bits (89), Expect = 0.007, Method: Composition-based stats.
Identities = 41/212 (19%), Positives = 88/212 (41%), Gaps = 30/212 (14%)
Frame = +1
Query: 8 VNVGVLGHIDSGKTALAR-------ALSTTASTAAF----DKQPQSRERGITLD--LGFS 54
+NV V G +D GK+ L A+ + + F D+ R RGIT+D +
Sbjct: 7465 LNVVVCGRVDVGKSTLLGHLLTLLGAVESRSLRGEFAWILDQGEDERSRGITIDPTKASA 7644
Query: 55 CFSVPLPARLRSSLPEFQAAPEAEPEPGE----PL--------LQVTLVDCPGHASLIRT 102
V P S ++ P PG P+ +++ +D PGH L+ +
Sbjct: 7645 VVDVVPPPEADGSGSRDGTVRQSAPAPGTDAVAPVPAPSTNRRVKLNFIDTPGHHELVSS 7824
Query: 103 IIGGAQIIDLMMLVIDVTKGMQTQS-----AECLVIGQIACQKLVVVLNKIDLLPEGKRQ 157
++ GA + ++++DV ++ + ++ + + V+ +NK+D E +
Sbjct: 7825 LVRGAIFAEAAVVIVDVLDFLKEDTNGYFEQHFFLLWSLGVRHFVICVNKVDRCAE---R 7995
Query: 158 AAIDKMTKKMQKTLENTKFRGAPIIPVAAKPG 189
+++K + ++ + + ++P + G
Sbjct: 7996 ESVEKAMRMARERVSSYALSTLLVVPTSGMSG 8091
>Contig3973
Length = 13702
Score = 29.6 bits (65), Expect(2) = 0.22, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 1/75 (1%)
Frame = -2
Query: 23 LARALSTTASTAAFDKQPQSRERGITLD-LGFSCFSVPLPARLRSSLPEFQAAPEAEPEP 81
L A++ A A D PQ RERGIT+ +CF + +F
Sbjct: 1902 LLMAMTLHAGEAIMDYMPQERERGITITAAATTCF-------WKGGYRKF---------- 1774
Query: 82 GEPLLQVTLVDCPGH 96
P ++ ++D PGH
Sbjct: 1773 --PQHRINIIDTPGH 1735
Score = 22.7 bits (47), Expect(2) = 0.22, Method: Composition-based stats.
Identities = 7/11 (63%), Positives = 10/11 (90%)
Frame = -1
Query: 11 GVLGHIDSGKT 21
G++ HID+GKT
Sbjct: 2008 GIIAHIDAGKT 1976
>Contig1952
Length = 1492
Score = 33.1 bits (74), Expect = 0.32, Method: Composition-based stats.
Identities = 24/134 (17%), Positives = 60/134 (44%), Gaps = 5/134 (3%)
Frame = -1
Query: 61 PARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGGAQIIDLMMLVIDVT 120
PAR R L ++Q P +++ +D PGH L+ + + GA + ++++DV
Sbjct: 1231 PARKRMRLLQYQL-PFXNRR-----VKLNFIDTPGHHELVSSWVRGAIFAEAAVVIVDVL 1070
Query: 121 KGMQTQS-----AECLVIGQIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQKTLENTK 175
++ + ++ + + V+ +NK+D E + +++K + ++ + +
Sbjct: 1069 DFLKEDTNGYFEQHFFLLWSLGVRHFVICVNKVDRCAE---RESVEKAMRMARERVSSYA 899
Query: 176 FRGAPIIPVAAKPG 189
++P + G
Sbjct: 898 LSTLLVVPTSGMSG 857
>Contig4098
Length = 25291
Score = 31.6 bits (70), Expect = 1.1, Method: Composition-based stats.
Identities = 12/19 (63%), Positives = 16/19 (84%)
Frame = +2
Query: 9 NVGVLGHIDSGKTALARAL 27
N+G++ HID+GKT LA AL
Sbjct: 9575 NIGIVAHIDAGKTTLAEAL 9631
>Contig3632
Length = 5969
Score = 31.6 bits (70), Expect = 1.2, Method: Composition-based stats.
Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Frame = -1
Query: 196 TEAPQGIPELIELLTSQISIPTRDPSGPFLMSVDHCFS----IKGQGTVMTGTILSGSIS 251
T AP P+ I+ + D + PF+ V H + + G +GT++S S +
Sbjct: 5522 TSAPFSAPDAIQ----SWMLSWFDNAAPFMFHVPHQITRLPRLAGCDGCGSGTVISVSTA 5355
Query: 252 LGDSVEIPAL 261
LG+S+E PAL
Sbjct: 5354 LGESLEPPAL 5325
>Contig4154
Length = 86118
Score = 30.8 bits (68), Expect = 1.8, Method: Composition-based stats.
Identities = 30/111 (27%), Positives = 50/111 (45%)
Frame = -3
Query: 114 MLVIDVTKGMQTQSAECLVIGQIACQKLVVVLNKIDLLPEGKRQAAIDKMTKKMQKTLEN 173
+LV+D TKG+Q Q+ +I KL+ ++NKID+ + T K+L N
Sbjct: 78898 LLVVDGTKGIQAQTVTTSMIAIEKGLKLIPIVNKIDV------EFCDYDATASDLKSLFN 78737
Query: 174 TKFRGAPIIPVAAKPGGPEAPETEAPQGIPELIELLTSQISIPTRDPSGPF 224
F I+ +AK G GI ++++ + ++ P D PF
Sbjct: 78736 --FTEDEILMASAKEGF----------GITDILDAVVERVPPPKIDTEKPF 78620
>Contig2705
Length = 2255
Score = 29.6 bits (65), Expect = 4.5, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 15/29 (51%)
Frame = +3
Query: 283 GDRLGICVTQFDPKLLERGLVCAPESLHT 311
GDR G C D + RG +CA SL T
Sbjct: 621 GDRCGSCAQNTDLSMYGRGTLCAKTSLRT 707
>Contig4163.0
Length = 28715
Score = 29.6 bits (65), Expect = 4.5, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 15/29 (51%)
Frame = +2
Query: 283 GDRLGICVTQFDPKLLERGLVCAPESLHT 311
GDR G C D + RG +CA SL T
Sbjct: 16718 GDRCGSCAQNTDLSMYGRGTLCAKTSLRT 16804
>Contig4184.3
Length = 137674
Score = 29.3 bits (64), Expect = 5.2, Method: Composition-based stats.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Frame = -3
Query: 440 NTCRLAFHGILLHGLEDRNYADSFLPRLKVYKLKHKHGLVERAMDDYSVIGRSLFKKETN 499
NTC +H L++R+Y L KVY L + + + +++D SV +L + +
Sbjct: 136427 NTCSY------VHALQERHYVFINLHDCKVYCLPENYQVEDASLNDISV-SATLMRNSSI 136269
Query: 500 IQLFVGLKVHLSTGELGIIDS--AFGQSGKFKIHIPG 534
QLF LK + + IDS +G++ IPG
Sbjct: 136268 RQLF--LKPSFTKEYVDYIDSKIVYGKALDGTDFIPG 136164
>Contig2251
Length = 1343
Score = 28.5 bits (62), Expect = 8.3, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Frame = +1
Query: 58 VPLPARLRSSLPEFQAA--PEAEPEPGEPLLQVTLVDCPG 95
+PLP LRS P AA P A+P P P L T + C G
Sbjct: 1177 LPLPHSLRSLPPPSHAACPPNADPPPISPRLIATRLYCDG 1296
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000004_1.0 # Protein # Glutathione peroxidase 1 (GPx1) #
Homo sapiens # Complete
(203 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
***** No hits found ******
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000005_1.0 # Protein # Glutathione peroxidase 2 (GPx2) #
Homo sapiens # Complete
(190 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig2766 33 0.076
Contig4065 30 0.88
Contig4094 28 3.1
>Contig2766
Length = 9629
Score = 33.1 bits (74), Expect = 0.076, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 29/71 (40%)
Frame = +3
Query: 54 QCRFPRRLVVLGFPCNQFGHQENCQNEEILNSLKYVRPGGGYQPTFTLVQKCEVNGQNEH 113
QCR+ R + CN+ ++E P G QP KC+VN Q
Sbjct: 5316 QCRYGRDIPTTKSQCNEHSNEE---------------PNG--QP------KCQVNCQPTS 5426
Query: 114 PVFAYLKDKLP 124
P+ YL D LP
Sbjct: 5427 PLMLYLSDSLP 5459
>Contig4065
Length = 21455
Score = 29.6 bits (65), Expect = 0.88, Method: Composition-based stats.
Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 4/118 (3%)
Frame = -2
Query: 17 DGEKVDFNTFRGRAVLIENVASLUGTTTRDFTQLNELQCRFPRR---LVVLGFPCNQFGH 73
DGE + + F G + ++ A++ G + LN + R + +VVL CN G
Sbjct: 5164 DGELRNCDEFTGSNIQVKINAAISGANGNE--HLNNCKLREQKDINIMVVLACLCNHLGQ 4991
Query: 74 QENCQNEEILNSLKYVRPGGGYQPTFTLVQKCE-VNGQNEHPVFAYLKDKLPYPYDDP 130
NC + I + +P P + Q E + G +PV++ LK + +D P
Sbjct: 4990 YYNCCHYVI-----HPKP----TPPCNIYQMLEWLCGLPHNPVYSKLKGECKTLFDKP 4844
>Contig4094
Length = 27189
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 11/26 (42%), Positives = 14/26 (53%)
Frame = -1
Query: 108 NGQNEHPVFAYLKDKLPYPYDDPFSL 133
N NEH +Y + P PYDD +L
Sbjct: 10305 NFSNEHVTMSYNETMFPVPYDDVLAL 10228
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000006_1.0 # Protein # Glutathione peroxidase 3 (GPx3) #
Homo sapiens # Complete
(226 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3112 28 2.8
Contig3785.0 28 4.4
>Contig3112
Length = 5813
Score = 28.1 bits (61), Expect = 2.8, Method: Composition-based stats.
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 5/34 (14%)
Frame = -1
Query: 24 GQEKSKMDCHG-----GISGTIYEYGALTIDGEE 52
G+EKS+ C G+S T+Y+YG + + EE
Sbjct: 419 GEEKSRCQCESIVTCNGVSPTLYQYGFIFNNAEE 318
>Contig3785.0
Length = 3968
Score = 27.7 bits (60), Expect = 4.4, Method: Composition-based stats.
Identities = 21/66 (31%), Positives = 28/66 (42%), Gaps = 1/66 (1%)
Frame = -1
Query: 106 KQEPGENSE-ILPTLKYVRPGGGFVPNFQLFEKGDVNGEKEQKFYTFLKNSCPPTSELLG 164
K++PGE E +P +PG G PNF L G K F +F + P S+
Sbjct: 461 KRDPGEPGEWTVPGGNCPKPGQGRRPNFHLPPAKKRPGAKLTSFKSFREAISPNPSKAWP 282
Query: 165 TSDRLF 170
R F
Sbjct: 281 VDPRNF 264
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000007_1.0 # Protein # Glutathione peroxidase 4 (GPx4) #
Homo sapiens # Complete
(197 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3921 31 0.44
Contig4144.0 28 3.1
>Contig3921
Length = 37191
Score = 30.8 bits (68), Expect = 0.44, Method: Composition-based stats.
Identities = 11/29 (37%), Positives = 17/29 (58%)
Frame = +1
Query: 142 PLWKWMKIQPKGKGILGNAIKWNFTKFLI 170
P +WM+ P+G L N+ K+ F+ F I
Sbjct: 1582 PATRWMRRTPRGAASLSNSWKYKFSSFTI 1668
>Contig4144.0
Length = 55587
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 12/22 (54%), Positives = 15/22 (68%)
Frame = +3
Query: 129 MFSKICVNGDDAHPLWKWMKIQ 150
M SKI VN D HPL K++ +Q
Sbjct: 8952 MASKIVVNIHDVHPLVKYINLQ 9017
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000008_1.0 # Protein # Glutathione peroxidase 5 (GPx5) #
Homo sapiens # Complete
(221 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3949 28 4.5
Contig3785.0 27 4.6
Contig4180 27 8.2
>Contig3949
Length = 36457
Score = 27.7 bits (60), Expect = 4.5, Method: Composition-based stats.
Identities = 17/54 (31%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Frame = +3
Query: 107 QEPGDNKE--ILPGLKYVRPGGGFVPSFQLFEKGDVNGEKEQKVFSFLKHSCPH 158
Q+P D + ++P L + GGG+ PS F D++G ++ + LK S H
Sbjct: 24663 QDPPDVEVEIVMPSL--IDNGGGYQPSGNGFRGNDIDGFEDPNAINLLKDSNNH 24818
>Contig3785.0
Length = 3968
Score = 27.3 bits (59), Expect = 4.6, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Frame = -1
Query: 106 KQEPGDNKE-ILPGLKYVRPGGGFVPSFQLFEKGDVNGEKEQKVFSFLKHSCPHPSE 161
K++PG+ E +PG +PG G P+F L G K SF + P+PS+
Sbjct: 461 KRDPGEPGEWTVPGGNCPKPGQGRRPNFHLPPAKKRPGAKLTSFKSFREAISPNPSK 291
>Contig4180
Length = 166360
Score = 26.6 bits (57), Expect = 8.2, Method: Composition-based stats.
Identities = 11/34 (32%), Positives = 17/34 (50%)
Frame = -2
Query: 19 VQTSPKQEKMKMDCHKDEKGTIYDYEAIALNKNE 52
+ T DC+KD KGTI E +A+ + +
Sbjct: 122667 INTEADTRAHSADCNKDTKGTIPQAERVAVTQKD 122566
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000009_1.0 # Protein # Glutathione peroxidase 6 (GPx6) #
Homo sapiens # Complete
(221 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig738 28 2.3
>Contig738
Length = 1891
Score = 28.5 bits (62), Expect = 2.3, Method: Composition-based stats.
Identities = 28/72 (38%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Frame = -1
Query: 30 VDCNKGVTGTIYEYG-----ALTLNGEEYIQFKQFAGKHVLFVNVAAYUGLAAQYPELNA 84
VDC KGV T+Y+YG A TLNGE + F + NV L ++Y E
Sbjct: 1486 VDC-KGVAPTLYQYGFVFGEASTLNGESPKKCSDFCSQ---LKNV-----LKSKYFE--K 1340
Query: 85 LQEELKNFGVIV 96
L EE NF I+
Sbjct: 1339 LFEECDNFLWII 1304
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000010_1.0 # Protein # Glutathione peroxidase 7 (GPx7) #
Homo sapiens # Complete
(187 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4140 30 0.69
Contig3822 27 4.0
Contig3699 27 4.0
Contig3782.1 27 4.9
Contig89 27 6.9
>Contig4140
Length = 46276
Score = 29.6 bits (65), Expect = 0.69, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 22/51 (43%), Gaps = 8/51 (15%)
Frame = -2
Query: 43 YRGSVSLVVNVASECGFTDQHYRA--------LQQLQRDLGPHHFNVLAFP 85
YR V + + S T HY A Q L DL PH F+V A+P
Sbjct: 31707 YRSKVKSIFDKFSRLLDTLTHYNADFSTNLAEFQNLLSDLNPHEFDVPAYP 31555
>Contig3822
Length = 12065
Score = 27.3 bits (59), Expect = 4.0, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Frame = -2
Query: 30 AVNIR--GKLVSLEKYRGSVSLVVNVASECGFTDQHYRALQQLQRDLGPH 77
AVN+R G ++ E +GS + A ECG HYRA +L GPH
Sbjct: 11443 AVNVRLLGNSMADEIIQGSTGMRRR-AGECGTYFSHYRA--ELLGKYGPH 11303
>Contig3699
Length = 13005
Score = 27.3 bits (59), Expect = 4.0, Method: Composition-based stats.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Frame = +3
Query: 30 AVNIR--GKLVSLEKYRGSVSLVVNVASECGFTDQHYRALQQLQRDLGPH 77
AVN+R G ++ E +GS + A ECG HYRA +L GPH
Sbjct: 10755 AVNVRLLGNSMADEIIQGSTGMRRR-AGECGTYFSHYRA--ELLGKYGPH 10895
>Contig3782.1
Length = 1841
Score = 26.9 bits (58), Expect = 4.9, Method: Composition-based stats.
Identities = 14/41 (34%), Positives = 20/41 (48%)
Frame = +1
Query: 92 QEPDSNKEIESFARRTYSVSFPMFSKIAVTGTGAHPAFKYL 132
Q PD+++ + + RR YSV + IAV G A Y
Sbjct: 1087 QTPDNSQYVAIYRRRQYSVICKILCAIAVLNFGHFNAVLYF 1209
>Contig89
Length = 1098
Score = 26.6 bits (57), Expect = 6.9, Method: Composition-based stats.
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Frame = -1
Query: 90 GQQEPDSNKEIESFARRTYSV-SFPMFSKIAVTGTGAHPA 128
G Q S + + A T+++ SFP+F I + G G HPA
Sbjct: 180 GLQSRGSQRVRQG*ATNTFTLPSFPLFYFILLLGIGKHPA 61
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000011_1.0 # Protein # Glutathione peroxidase 8 (GPx8) #
Homo sapiens # Complete
(209 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
***** No hits found ******
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000001_1.0 # Protein # Iodothyronine deiodinase 1 (DI1) #
Homo sapiens # Complete
(249 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig1329 27 6.4
>Contig1329
Length = 1478
Score = 27.3 bits (59), Expect = 6.4, Method: Composition-based stats.
Identities = 12/40 (30%), Positives = 19/40 (47%)
Frame = -1
Query: 209 YAALPERLYIIQEGRILYKGKSGPWNYNPEEVRAVLEKLH 248
+ ALP L + Q L+ S P++Y P +R + H
Sbjct: 920 FLALPSVLEVFQSALSLFHSTSSPYHYTPAPLRPPAQSHH 801
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000002_1.0 # Protein # Iodothyronine deiodinase 2 (DI2) #
Homo sapiens # Complete
(265 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig2984 30 0.98
Contig4102 30 1.1
Contig4181.1 28 3.8
Contig4148 28 4.4
Contig1906 28 5.0
Contig4066 27 8.2
>Contig2984
Length = 4517
Score = 30.0 bits (66), Expect = 0.98, Method: Composition-based stats.
Identities = 15/61 (24%), Positives = 29/61 (47%), Gaps = 7/61 (11%)
Frame = -2
Query: 94 GGDNSGNGTQEKIAEGAT-------CHLLDFASPERPLVVNFGSATUPPFTSQLPAFRKL 146
G D+SG G+ + + T CH D ++ + ++++ S PPF + LP +
Sbjct: 520 GSDDSGYGSDDMVTAAMTWSRQR*HCHGSDDSNDQVVIIIHSTSLHNPPFIAPLPTLHRS 341
Query: 147 V 147
+
Sbjct: 340 I 338
>Contig4102
Length = 42529
Score = 30.0 bits (66), Expect = 1.1, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 38/85 (44%)
Frame = -2
Query: 84 SSVVHVSSTEGGDNSGNGTQEKIAEGATCHLLDFASPERPLVVNFGSATUPPFTSQLPAF 143
SS+V S++ GG+N N E T + RPL+ GSA P +S+
Sbjct: 22446 SSIVLDSNSVGGENEENNPSEADERRPTEE--SGIADGRPLLSEGGSAGRPSTSSEGTPS 22273
Query: 144 RKLVEEFSSVADFLLVYIDEAHPSD 168
R +F + +D + D HPS+
Sbjct: 22272 RTDQTDFYTDSDTPHRHCDSPHPSN 22198
>Contig4181.1
Length = 107582
Score = 28.1 bits (61), Expect = 3.8, Method: Composition-based stats.
Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Frame = +3
Query: 5 SVDLLITLQILPVFFSNCLFLALYDSV---ILLKHVVLLLSRSKSTRGEWRRML 55
++DL+ TL +L + SNCL ++ + +L +V+L + K ++RR L
Sbjct: 95217 AIDLIFTLTMLSSYLSNCLQISSTHGIPVMLLPLSIVMLGNAIKDAYEDYRRYL 95378
>Contig4148
Length = 42781
Score = 28.1 bits (61), Expect = 4.4, Method: Composition-based stats.
Identities = 17/45 (37%), Positives = 18/45 (40%)
Frame = -1
Query: 80 DAPNSSVVHVSSTEGGDNSGNGTQEKIAEGATCHLLDFASPERPL 124
DA + S GG N G CH LDFA PER L
Sbjct: 15919 DARHQLDAEASPHLGGGGRANCRALAQPGGQGCHDLDFAHPERKL 15785
>Contig1906
Length = 1680
Score = 27.7 bits (60), Expect = 5.0, Method: Composition-based stats.
Identities = 16/56 (28%), Positives = 28/56 (50%)
Frame = -3
Query: 148 EEFSSVADFLLVYIDEAHPSDGWAIPGDSSLSFEVKKHQNQEDRCAAAQQLLERFS 203
E ++ + LL I EAHPS+ W + + + VK+ +Q + AA + F+
Sbjct: 1441 ESVTADMEVLLHDIKEAHPSNTWKVDDSNPVVKAVKEQLSQ--KVTAADSTIRSFA 1280
>Contig4066
Length = 25167
Score = 26.9 bits (58), Expect = 8.2, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 20/29 (68%)
Frame = -1
Query: 19 FSNCLFLALYDSVILLKHVVLLLSRSKST 47
+SNCLF A+ D+ +L V LL +S++T
Sbjct: 17964 YSNCLFGAMCDTGVLHSEPVWLLLQSEAT 17878
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000003_1.0 # Protein # Iodothyronine deiodinase 3 (DI3) #
Homo sapiens # Complete
(278 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig50 30 1.2
Contig1334 28 3.3
Contig3102 28 5.4
>Contig50
Length = 6757
Score = 30.0 bits (66), Expect = 1.2, Method: Composition-based stats.
Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 2/31 (6%)
Frame = +3
Query: 242 TIMYQGGRGPDGYQVS--ELRTWLERYDEQL 270
T+ Q G G DG VS E++ WLE YD Q+
Sbjct: 609 TLNKQVGSGRDGLAVSVAEVKRWLEGYDNQI 701
>Contig1334
Length = 2713
Score = 28.5 bits (62), Expect = 3.3, Method: Composition-based stats.
Identities = 12/23 (52%), Positives = 15/23 (65%)
Frame = +1
Query: 248 GRGPDGYQVSELRTWLERYDEQL 270
GR VSE+ TWL R+DE+L
Sbjct: 1141 GRTEFNSAVSEVSTWLRRHDEEL 1209
>Contig3102
Length = 1266
Score = 27.7 bits (60), Expect = 5.4, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 18/28 (64%)
Frame = -1
Query: 54 PEPEVELNSEGEEVPPDDPPICVSDDNR 81
PE E++S GE PP PPI V DD+R
Sbjct: 900 PEVRDEVDSTGEYSPPS-PPILVPDDSR 820
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000012_1.0 # Protein # Methionine sulfoxide reductase A
(MsrA) # Homo sapiens # Complete
(235 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4070 30 1.1
Contig4165 29 2.0
Contig971 27 8.2
>Contig4070
Length = 46082
Score = 29.6 bits (65), Expect = 1.1, Method: Composition-based stats.
Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Frame = -3
Query: 105 YKEVCSEKTGHAEVVRVVYQPE-HMSFEELLKVFWENHDPTQG 146
+ ++ S+ GH V R+ P + S L+++ WE+HDP +G
Sbjct: 20280 WPKIMSKVLGHT-VRRMSLPPMLYYSVAHLMRIIWEHHDPQEG 20155
>Contig4165
Length = 92729
Score = 28.9 bits (63), Expect = 2.0, Method: Composition-based stats.
Identities = 15/38 (39%), Positives = 23/38 (60%)
Frame = +3
Query: 7 RACQLLLLHSLFPVPRMGNSASNIVSPQEALPGRKEQT 44
RACQ L + PVPR + S ++P+ LPGR++ +
Sbjct: 45402 RACQAL---APAPVPRTFETGSRRLAPRPPLPGRRQAS 45506
>Contig971
Length = 6473
Score = 26.9 bits (58), Expect = 8.2, Method: Composition-based stats.
Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 5/55 (9%)
Frame = +2
Query: 130 FEELL---KVFWENHDPTQGMRQGNDHGTQYRSAIY--PTSAKQMEAALSSKENY 179
FEELL V ++++ + + N G ++A Y PTSA+ +A L +KEN+
Sbjct: 3674 FEELLTNTNVTGDHYNLSINLHNNNGLGDFLKTAGYDVPTSAEISDAELKNKENF 3838
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000037_1.0 # Protein # SECIS binding protein 2 (SBP2) #
Homo sapiens # Complete
(854 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3736 32 1.2
Contig656 30 5.3
Contig4160 30 6.3
>Contig3736
Length = 11714
Score = 32.0 bits (71), Expect = 1.2, Method: Composition-based stats.
Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Frame = -2
Query: 719 CE-QNIPFVFALNRKALGRSLNKAVPVSVVGIFSYDGA 755
CE +NIP++F ++ ALGR+ + PV I S +G+
Sbjct: 8641 CEDKNIPYIFVKSKIALGRACGVSRPVVSCAIISREGS 8528
>Contig656
Length = 1403
Score = 29.6 bits (65), Expect = 5.3, Method: Composition-based stats.
Identities = 14/53 (26%), Positives = 25/53 (47%)
Frame = -3
Query: 745 SVVGIFSYDGAQDQFHKMVELTVAARQAYKTMLENVQQELVGEPRPQAPPSLP 797
+VVG+ Y A +H+ L + Q ++ M+ E + P+P P+ P
Sbjct: 1380 TVVGLHVYHEAYSAYHRGEHLCASREQMHRVMVRRFXVEXMVSPQPPCVPATP 1222
>Contig4160
Length = 71491
Score = 29.6 bits (65), Expect = 6.3, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 29/58 (50%)
Frame = -3
Query: 565 ESGGDDQFPEQAELSGPEGMDELISTPSVEDKSEEPPGTELQRDTEASHLAPNHTTFP 622
E G DQ EL EG+D + + VE EEPP T+ TEA+ + P+ + P
Sbjct: 68723 EMYGTDQL---QELIKSEGLDAVDNENKVESLEEEPPTTKRAAPTEATEVLPSIISEP 68559
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000032_1.0 # Protein # Selenophosphate synthetase 1 (SPS1)
# Homo sapiens # Complete
(392 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3973 33 0.32
Contig4180 30 2.6
Contig4139.0 29 4.2
Contig4077 29 4.4
>Contig3973
Length = 13702
Score = 32.7 bits (73), Expect = 0.32, Method: Composition-based stats.
Identities = 15/40 (37%), Positives = 25/40 (62%)
Frame = -3
Query: 202 PLGTQVAVAVHQWLDIPEKWNKIKLVVTQEDVELAYQEAM 241
P+G Q+ +HQ D+P K+K V+ Q+D+E+ Y E +
Sbjct: 10361 PMGQQIVNYIHQRNDVPFPHVKLKNVI-QQDIEIVYYENL 10245
>Contig4180
Length = 166360
Score = 29.6 bits (65), Expect = 2.6, Method: Composition-based stats.
Identities = 16/54 (29%), Positives = 26/54 (48%)
Frame = -3
Query: 162 GQTVLNPWIVLGGVATTVCQPNEFIMPDNAVPGDVLVLTKPLGTQVAVAVHQWL 215
G N +++ GG PN+F++P N GD+ + P VA ++WL
Sbjct: 51509 GPMTKNSFVIAGG------HPNQFLIPVNVPSGDLCKICSP----NEVAFYRWL 51378
>Contig4139.0
Length = 74301
Score = 28.9 bits (63), Expect = 4.2, Method: Composition-based stats.
Identities = 20/71 (28%), Positives = 30/71 (42%), Gaps = 1/71 (1%)
Frame = -3
Query: 110 DLYAMGVTECDNMLMLLGVSNKMTDRERDKVMPLIIQGFKDAAEEAGTSVTGGQTVL-NP 168
D+Y D+ + GVS + DR R P F G + T++ NP
Sbjct: 3274 DVYTFASCSLDHTIKFWGVSPDVMDR-RHSNQPSPKPSFTLKGHTRGVNCIEFSTIMSNP 3098
Query: 169 WIVLGGVATTV 179
+I+ GG TT+
Sbjct: 3097 YIISGGEDTTI 3065
>Contig4077
Length = 34604
Score = 28.9 bits (63), Expect = 4.4, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 31/75 (41%)
Frame = +1
Query: 7 FNPESYELDKSFRLTRFTELKGTGCKVPQDVLQKLLESLQENHFQEDEQFLGAVMPRLGI 66
+N ESY L S LTR G ++ + + SLQE +DE+ R I
Sbjct: 20221 YNTESYSLLYSLTLTRNRSFTGINRELNSRYMTEYGSSLQEWDLSDDEEVAEGAAERRRI 20400
Query: 67 GMDTCVIPLRHGGLS 81
+ ++ G LS
Sbjct: 20401 QSHYALPGVQQGELS 20445
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000033_1.0 # Protein # Selenophosphate synthetase 2 (SPS2)
# Homo sapiens # Complete
(448 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
***** No hits found ******
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000013_1.0 # Protein # Selenoprotein 15 (Sel15) # Homo
sapiens # Complete
(124 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4131.0 28 0.94
Contig2791 28 1.00
Contig4133 25 7.8
>Contig4131.0
Length = 48526
Score = 28.1 bits (61), Expect = 0.94, Method: Composition-based stats.
Identities = 14/34 (41%), Positives = 18/34 (52%)
Frame = -2
Query: 46 FSSNLLCSSCDLLGQFNLLQLDPDCRGCCQEEAQ 79
F+SN C L + +L D DC G CQE A+
Sbjct: 38145 FNSNTSCEDAIELQRMAMLNEDRDCYGHCQEFAR 38044
>Contig2791
Length = 1817
Score = 28.1 bits (61), Expect = 1.00, Method: Composition-based stats.
Identities = 14/34 (41%), Positives = 18/34 (52%)
Frame = +3
Query: 46 FSSNLLCSSCDLLGQFNLLQLDPDCRGCCQEEAQ 79
F+SN C L + +L D DC G CQE A+
Sbjct: 996 FNSNTSCEDAIELQRMAMLNEDRDCYGHCQEFAR 1097
>Contig4133
Length = 40823
Score = 25.0 bits (53), Expect = 7.8, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 23/55 (41%), Gaps = 3/55 (5%)
Frame = -1
Query: 22 LATVLQAVSAFGAEFSSEACRELGFSSNLLCS---SCDLLGQFNLLQLDPDCRGC 73
LA VLQ V A AE E E + N LC + LGQ L D GC
Sbjct: 27815 LAEVLQKVGAENAELRVERLPEQAHARNHLCGVLVAH*TLGQHRHLVHDLGPHGC 27651
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000014_1.0 # Protein # Selenoprotein H (SelH) # Homo
sapiens # Complete
(122 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4122 25 7.1
>Contig4122
Length = 47457
Score = 25.0 bits (53), Expect = 7.1, Method: Composition-based stats.
Identities = 11/26 (42%), Positives = 14/26 (53%)
Frame = +2
Query: 65 LPVKVNPTKPRRGSFEVTLLRPDGSS 90
+PV + P + RRG LL GSS
Sbjct: 18002 IPVAIQPHQLRRGDLYTVLLNAQGSS 18079
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000015_1.0 # Protein # Selenoprotein I (SelI) # Homo
sapiens # Complete
(397 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4096 49 3e-06
>Contig4096
Length = 56293
Score = 49.3 bits (116), Expect = 3e-06, Method: Composition-based stats.
Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 7/88 (7%)
Frame = +1
Query: 100 AYTLDGVDGKQARRTNSSTPLGELFDHGLDSWSCVYFVVTVYSIFGRGSTGVSVFVLYLL 159
A T DG+DGKQAR+ S+PLG+L DHGLD+ V++ +++ G T F+ LL
Sbjct: 46102 AQTFDGIDGKQARKLGMSSPLGQLLDHGLDAVVTVFYPYICCTLYPGGFT----FITLLL 46269
Query: 160 LWVVLFSFILSHWE-------KYNTGIL 180
+ + + + W +Y G+L
Sbjct: 46270 VAIAPIHVLCTVWRESEFETFEYTNGVL 46353
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000016_1.0 # Protein # Selenoprotein K (SelK) # Homo
sapiens # Complete
(94 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4113 26 3.6
Contig4085 25 6.9
Contig3968 25 7.9
Contig4179.1 24 8.2
Contig4186.0 24 8.4
>Contig4113
Length = 37815
Score = 25.8 bits (55), Expect = 3.6, Method: Composition-based stats.
Identities = 11/18 (61%), Positives = 12/18 (66%)
Frame = +2
Query: 68 GNPPRRMGRINHLRGPSP 85
GNPPRR R+N R SP
Sbjct: 26024 GNPPRRSPRVNCSRLQSP 26077
>Contig4085
Length = 59578
Score = 24.6 bits (52), Expect = 6.9, Method: Composition-based stats.
Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 4/45 (8%)
Frame = +1
Query: 47 KKRRSYGNSSDSRYDDGRGPPGNPPRRMGRINHLR----GPSPPP 87
++RR+ G S + +G PP + + G +HLR SPPP
Sbjct: 16942 RERRASGCSPERTSSEGSAPPPDTRTQ*GGGSHLR*RG*SGSPPP 17076
>Contig3968
Length = 26453
Score = 24.6 bits (52), Expect = 7.9, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 16/37 (43%)
Frame = -1
Query: 51 SYGNSSDSRYDDGRGPPGNPPRRMGRINHLRGPSPPP 87
S G + R R P G PP + G H + PPP
Sbjct: 203 SGGKTQRRRKSRKRNPQGRPPPKKGPKAHRQKREPPP 93
>Contig4179.1
Length = 126567
Score = 24.3 bits (51), Expect = 8.2, Method: Composition-based stats.
Identities = 13/36 (36%), Positives = 18/36 (50%)
Frame = -2
Query: 56 SDSRYDDGRGPPGNPPRRMGRINHLRGPSPPPMAGG 91
+D R +G+ PG+PP + L PP M GG
Sbjct: 118442 ADVRSAEGKTAPGSPPGAATTHSTLFQKPPPKMVGG 118335
>Contig4186.0
Length = 78499
Score = 24.3 bits (51), Expect = 8.4, Method: Composition-based stats.
Identities = 12/47 (25%), Positives = 21/47 (44%)
Frame = +2
Query: 37 FFKTLLQQDVKKRRSYGNSSDSRYDDGRGPPGNPPRRMGRINHLRGP 83
++ QQD + R+ +G+ + + G + R R HLR P
Sbjct: 54512 LLHSIFQQDTRHRKDFGSHREIQEHGGEHDDRSRRRHWPRAVHLRPP 54652
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000017_1.0 # Protein # Selenoprotein M (SelM) # Homo
sapiens # Complete
(145 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4156.1 28 1.3
>Contig4156.1
Length = 51696
Score = 28.1 bits (61), Expect = 1.3, Method: Composition-based stats.
Identities = 14/28 (50%), Positives = 15/28 (53%), Gaps = 3/28 (10%)
Frame = +3
Query: 28 RPDWNRLSGLTRARVETC---GGUQLNR 52
RP W RL TR V+TC G QL R
Sbjct: 3537 RPCWTRLRSHTRISVKTC*SVSGLQLRR 3620
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000018_1.0 # Protein # Selenoprotein N (SelN) # Homo
sapiens # Complete
(590 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3915 33 0.47
Contig1540 32 0.87
Contig2076 31 2.0
Contig3778 30 2.3
Contig4159 30 4.0
Contig3954.0 28 8.1
>Contig3915
Length = 10523
Score = 32.7 bits (73), Expect = 0.47, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Frame = -1
Query: 7 GQRGPPSPGPAAQ----PPAPPRRRARSLALLGALLAAAAAAAVRVCARHAEAQAAAR 60
GQRG P+P PAA AP R R LL A R ARH + AA R
Sbjct: 9161 GQRGVPAPPPAAHGYCAGGAPASRHGR-------LLQGVRRADQRHVARHGQQHAAVR 9009
>Contig1540
Length = 1417
Score = 32.0 bits (71), Expect = 0.87, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 16/26 (61%), Gaps = 1/26 (3%)
Frame = -1
Query: 2 GRARPGQRGPPSPGPA-AQPPAPPRR 26
G P +R PSPGPA PPAPP R
Sbjct: 1180 GGGHPRERREPSPGPALGFPPAPPHR 1103
>Contig2076
Length = 1201
Score = 30.8 bits (68), Expect = 2.0, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 14/23 (60%)
Frame = +2
Query: 7 GQRGPPSPGPAAQPPAPPRRRAR 29
G R P P P+AQPP P R AR
Sbjct: 203 GGRAPRPPHPSAQPPGPARTHAR 271
>Contig3778
Length = 7428
Score = 30.4 bits (67), Expect = 2.3, Method: Composition-based stats.
Identities = 14/25 (56%), Positives = 17/25 (68%)
Frame = -3
Query: 2 GRARPGQRGPPSPGPAAQPPAPPRR 26
GRA GQR PP+ G ++PP P RR
Sbjct: 3022 GRASWGQRRPPAAGGCSRPPWPHRR 2948
>Contig4159
Length = 55924
Score = 29.6 bits (65), Expect = 4.0, Method: Composition-based stats.
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Frame = +3
Query: 177 FLGVSRLALSGLRNWTAAASPSAVFATRHFQPFLPPP-GQELGE 219
FLG+ + + + AA ++ A RH F+PPP G E+ E
Sbjct: 48558 FLGLQKASRKMSLDLCAACDDASTIALRHSNSFVPPPAGHEIDE 48689
>Contig3954.0
Length = 18229
Score = 28.5 bits (62), Expect = 8.1, Method: Composition-based stats.
Identities = 13/25 (52%), Positives = 16/25 (64%)
Frame = +2
Query: 2 GRARPGQRGPPSPGPAAQPPAPPRR 26
GRA GQR PP+ G ++ P P RR
Sbjct: 1253 GRASSGQRRPPAAGGCSRRPWPRRR 1327
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000019_1.0 # Protein # Selenoprotein O (SelO) # Homo
sapiens # Complete
(669 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4181.2 31 2.1
Contig4142.1 29 5.3
Contig4173.0 29 6.3
Contig91 29 6.6
>Contig4181.2
Length = 4971
Score = 30.8 bits (68), Expect = 2.1, Method: Composition-based stats.
Identities = 25/93 (26%), Positives = 41/93 (44%)
Frame = +1
Query: 312 REVTRRTARMVAEWQCVGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG 371
R + T + +++ C C G + +G + FG R H+CN+
Sbjct: 3070 RSASESTFQCISQSSCSIECRG*RGSTIRDYVGGCCEARAFG*-TRLGQSHLCNS----- 3231
Query: 372 RYAYSKQPEVCRWNLRKLAEALQPELPLELGEA 404
RYA + VC+ LR+LA ++PE P + A
Sbjct: 3232 RYAAQIRVRVCKRLLRRLA*GIRPERPYQTSVA 3330
>Contig4142.1
Length = 32721
Score = 29.3 bits (64), Expect = 5.3, Method: Composition-based stats.
Identities = 14/43 (32%), Positives = 19/43 (44%)
Frame = -2
Query: 29 PRSTLSGAAMEPAPRWLAGLRFDNRALRALPVEAPPPGPEGAP 71
P + +SG + P PR L D+ + APPP E P
Sbjct: 11429 PSAVVSGESSHPTPRSADSLASDSHSQSESEASAPPPPSESTP 11301
>Contig4173.0
Length = 70135
Score = 29.3 bits (64), Expect = 6.3, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 29/69 (42%)
Frame = -2
Query: 422 MRRKLGLVQVELEEDGALVSKLLETMHLTGADFTNTFYLLSSFPVELESPGLAEFLARLM 481
MR GL E DG + +HL +F+ L SSF +P L+ F M
Sbjct: 44448 MRSAEGLFPDEANVDGKMYEWSASQIHLLMTNFSRFPNLHSSFKRLCANPHLSSFRLNEM 44269
Query: 482 EQCASLEEL 490
+ C +EL
Sbjct: 44268 DVCDFYDEL 44242
>Contig91
Length = 1362
Score = 29.3 bits (64), Expect = 6.6, Method: Composition-based stats.
Identities = 15/34 (44%), Positives = 21/34 (61%)
Frame = +1
Query: 486 SLEELRLAFRPQMDPRQLSMMLMLAQSNPQLFAL 519
SL LRL PQ+ RQ S+ + AQS P+L ++
Sbjct: 595 SLSHLRLFATPQITARQASLSITNAQSPPKLISI 696
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000020_1.0 # Protein # Selenoprotein P (SelP) # Homo
sapiens # Complete
(381 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4161 29 4.4
Contig2845.0 28 6.2
Contig4109 28 9.8
>Contig4161
Length = 76909
Score = 28.9 bits (63), Expect = 4.4, Method: Composition-based stats.
Identities = 13/42 (30%), Positives = 21/42 (50%)
Frame = +1
Query: 95 SSRLKYTHLKNKVSEHIPVYQQEENQTDVWTLLNGSKDDFLI 136
+SR + + K ++P Y++ + Q VWT N S D I
Sbjct: 30928 NSRSSFFGEQQKSGAYVPAYRRAQTQKAVWTPKNSSVQDIFI 31053
>Contig2845.0
Length = 21521
Score = 28.1 bits (61), Expect = 6.2, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 6/55 (10%)
Frame = -2
Query: 205 YHHEHHHNHGHQHLGSSELSENQQPGAPNAP-THPAPPGL-----HHHHKHKGQH 253
YH H+++ G+ HL S LS++ Q N+P TH + P + HH+ H ++
Sbjct: 7813 YHRFHNNHSGNVHLFSYMLSDSTQAYMANSPWTHVSSPNIVSKYCHHYIYHNTRY 7649
>Contig4109
Length = 27245
Score = 27.7 bits (60), Expect = 9.8, Method: Composition-based stats.
Identities = 11/26 (42%), Positives = 18/26 (69%)
Frame = -2
Query: 146 HLGLPFSFLTFPYVEEAIKIAYCEKK 171
HL PFSF FP++ +I++ C+K+
Sbjct: 25366 HLRDPFSFTHFPHIFRSIELISCKKR 25289
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000021_1.0 # Protein # Methionine-R-sufoxide reductase 1
(SelR1) # Homo sapiens # Complete
(116 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4165 28 0.88
Contig3930 27 1.6
Contig2233 26 2.6
Contig4157 26 4.2
Contig2265 25 5.8
Contig4150 25 6.5
Contig4180 25 6.6
Contig2845.0 25 9.3
>Contig4165
Length = 92729
Score = 28.1 bits (61), Expect = 0.88, Method: Composition-based stats.
Identities = 14/42 (33%), Positives = 23/42 (54%)
Frame = -1
Query: 30 LFSSRSKYAHSSPWPAFTETIHADSVAKRPEHNRSEALKVSC 71
L+S R A+++PW T+ +P+H R+ A KV+C
Sbjct: 11045 LYSERCDGAYTAPWTTLTQCTPHGR*DSKPKHERA-ATKVTC 10923
>Contig3930
Length = 7326
Score = 26.9 bits (58), Expect = 1.6, Method: Composition-based stats.
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 3/40 (7%)
Frame = -2
Query: 29 ELFSSRSKYAHSSPW---PAFTETIHADSVAKRPEHNRSE 65
+L +R K H PW PA T+ + +RP HN S+
Sbjct: 2309 KLQGNRMKCIHVYPWAAWPARRNTLVSADAQRRPHHNTSQ 2190
>Contig2233
Length = 2376
Score = 26.2 bits (56), Expect = 2.6, Method: Composition-based stats.
Identities = 12/33 (36%), Positives = 16/33 (48%)
Frame = -3
Query: 78 LGHEFLNDGPKPGQSRFUIFSSSLKFVPKGKET 110
LG FL+ GP+ R F +F+P ET
Sbjct: 232 LGTNFLDQGPQNSWGRICTFLRDTRFLPPPPET 134
>Contig4157
Length = 97051
Score = 25.8 bits (55), Expect = 4.2, Method: Composition-based stats.
Identities = 11/28 (39%), Positives = 13/28 (46%)
Frame = +3
Query: 8 GGEVFQNHFEPGVYVCAKCGYELFSSRS 35
G E Q EPG + CGY F S +
Sbjct: 2571 GAEPRQRDLEPGFRTASGCGYRSFRSNA 2654
>Contig2265
Length = 2066
Score = 25.4 bits (54), Expect = 5.8, Method: Composition-based stats.
Identities = 8/17 (47%), Positives = 13/17 (76%)
Frame = -1
Query: 27 GYELFSSRSKYAHSSPW 43
G+EL++SR +AH + W
Sbjct: 1079 GWELWTSREAHAHKADW 1029
>Contig4150
Length = 53170
Score = 25.0 bits (53), Expect = 6.5, Method: Composition-based stats.
Identities = 11/30 (36%), Positives = 13/30 (43%)
Frame = -2
Query: 19 GVYVCAKCGYELFSSRSKYAHSSPWPAFTE 48
G++ CAKCG F P FTE
Sbjct: 11082 GMFTCAKCGCHFFKKFKDGLFEYPSRCFTE 10993
>Contig4180
Length = 166360
Score = 25.0 bits (53), Expect = 6.6, Method: Composition-based stats.
Identities = 9/14 (64%), Positives = 11/14 (78%)
Frame = +3
Query: 79 GHEFLNDGPKPGQS 92
GHE ++DGPKP S
Sbjct: 57516 GHEAISDGPKPRPS 57557
>Contig2845.0
Length = 21521
Score = 24.6 bits (52), Expect = 9.3, Method: Composition-based stats.
Identities = 12/28 (42%), Positives = 14/28 (50%)
Frame = +1
Query: 28 YELFSSRSKYAHSSPWPAFTETIHADSV 55
YELF+ R SS WP T+ I V
Sbjct: 7441 YELFNRRCAGWRSSGWPCITDGIRHSGV 7524
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000022_1.0 # Protein # Methionine-R-sufoxide reductase 2
(SelR2) # Homo sapiens # Complete
(199 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3667 27 5.5
Contig3210 27 6.9
Contig4186.1 27 7.3
>Contig3667
Length = 6782
Score = 26.9 bits (58), Expect = 5.5, Method: Composition-based stats.
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Frame = +1
Query: 69 SEWQKKLT-PEQFYVTREKGTEPPFSG 94
+ W+ K+ P + TR KG +PPFSG
Sbjct: 6427 ASWRSKIRGPTYYLATRHKGDDPPFSG 6507
>Contig3210
Length = 7352
Score = 26.6 bits (57), Expect = 6.9, Method: Composition-based stats.
Identities = 22/77 (28%), Positives = 33/77 (42%), Gaps = 2/77 (2%)
Frame = -1
Query: 124 GTGWPSFSEAH--GTSGSDESHTGILRRLDTSLGSARTEVVCKQCEAHLGHVFPDGPGPN 181
G+GWP+ ++A G G +R + R C +H G V+ PGP
Sbjct: 6485 GSGWPTGADAFKVGNCGKQRKSETPIRVFRNHMPERRENRECT-ARSHRGAVWQ--PGPQ 6315
Query: 182 GQRFCINSVALKFKPRK 198
QR +N+ + PRK
Sbjct: 6314 EQRCRVNAPGCREVPRK 6264
>Contig4186.1
Length = 187835
Score = 26.6 bits (57), Expect = 7.3, Method: Composition-based stats.
Identities = 11/32 (34%), Positives = 16/32 (50%)
Frame = -2
Query: 20 RLLWLLRGLTLGTAPRRAVRGQAGGGGPGTGP 51
R+LW + PRRA ++GG P + P
Sbjct: 100621 RMLWPVFHYNAPAYPRRAAPSESGGDNPASDP 100526
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000023_1.0 # Protein # Methionine-R-sufoxide reductase 3
(SelR3) # Homo sapiens # Complete
(192 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig1507 30 0.91
Contig1814 29 1.1
Contig3777 29 1.1
Contig2029 28 2.1
Contig4174.0 27 3.6
Contig1030 27 3.8
Contig3897 27 3.9
Contig3063 27 4.3
Contig1390 27 4.7
Contig2433 27 4.8
Contig2044 27 5.3
Contig3597.0 27 5.3
Contig3612 27 5.5
Contig2656 27 5.6
Contig3963.0 27 6.0
Contig3167 27 6.0
Contig4172.0 27 6.3
Contig1967 27 6.8
Contig1894 27 6.8
Contig1801 27 6.8
Contig3202.0 27 7.3
Contig3854 27 7.9
Contig3802 27 8.0
Contig1873 26 8.5
Contig3185 26 8.5
Contig3493.0 26 8.7
Contig3948 26 9.1
Contig369 26 9.4
Contig3510 26 9.6
Contig3491 26 9.9
>Contig1507
Length = 1308
Score = 29.6 bits (65), Expect = 0.91, Method: Composition-based stats.
Identities = 16/26 (61%), Positives = 18/26 (69%), Gaps = 4/26 (15%)
Frame = -1
Query: 6 TLPRPLSLCLSLCLC----LCLAAAL 27
+LPR LS CL LCLC LCL A+L
Sbjct: 501 SLPRCLSACLPLCLCASLPLCLVASL 424
>Contig1814
Length = 9266
Score = 29.3 bits (64), Expect = 1.1, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Frame = -2
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
+LPR L CL+LCL CLA LG+
Sbjct: 6544 SLPRCLGACLALCLGACLALCLGA 6473
Score = 27.3 bits (59), Expect = 3.8, Method: Composition-based stats.
Identities = 14/26 (53%), Positives = 18/26 (69%)
Frame = -2
Query: 4 RRTLPRPLSLCLSLCLCLCLAAALGS 29
R +L R L CL+LCLC LA +LG+
Sbjct: 6070 RASLARCLVACLALCLCARLALSLGA 5993
>Contig3777
Length = 4392
Score = 29.3 bits (64), Expect = 1.1, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 17/24 (70%)
Frame = +2
Query: 5 RTLPRPLSLCLSLCLCLCLAAALG 28
R L L+LCLS+CL LCL A+L
Sbjct: 2840 RCLVASLALCLSVCLSLCLVASLA 2911
Score = 26.9 bits (58), Expect = 5.9, Method: Composition-based stats.
Identities = 12/18 (66%), Positives = 14/18 (77%)
Frame = +2
Query: 11 LSLCLSLCLCLCLAAALG 28
L+LCLS+CL LCL A L
Sbjct: 2906 LALCLSVCLSLCLVARLA 2959
>Contig2029
Length = 1057
Score = 28.5 bits (62), Expect = 2.1, Method: Composition-based stats.
Identities = 13/31 (41%), Positives = 18/31 (58%)
Frame = +1
Query: 2 SPRRTLPRPLSLCLSLCLCLCLAAALGSAQS 32
SP R P PL+L ++LC LA ++ S S
Sbjct: 865 SPLRLCPSPLTLSIALCKSAILAVSINSLTS 957
>Contig4174.0
Length = 88353
Score = 27.3 bits (59), Expect = 3.6, Method: Composition-based stats.
Identities = 14/52 (26%), Positives = 26/52 (50%)
Frame = -1
Query: 35 CRDKKNCKVVFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTHHKDPGIYKC 86
C D++ + + QQ L+K + + HV+QEK + + H+ D + C
Sbjct: 21486 CHDEQEHQRQWEQQRLQKNIHRTRNHVSQEKEFDRVLADAF-HYGDVHVLTC 21334
>Contig1030
Length = 3307
Score = 27.3 bits (59), Expect = 3.8, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Frame = -2
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
T R LS CL+LCLC LA LG+
Sbjct: 2952 TSTRCLSACLALCLCASLALCLGA 2881
>Contig3897
Length = 12828
Score = 27.3 bits (59), Expect = 3.9, Method: Composition-based stats.
Identities = 13/22 (59%), Positives = 15/22 (68%)
Frame = -2
Query: 6 TLPRPLSLCLSLCLCLCLAAAL 27
+LPR L CL+L LC CLA L
Sbjct: 5933 SLPRCLRACLALSLCACLARCL 5868
Score = 26.6 bits (57), Expect = 6.8, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = -2
Query: 5 RTLPRPLSLCLSLCLCLCLAAAL 27
R L L+LCL CL LCL A+L
Sbjct: 5780 RCLGASLALCLDACLTLCLGASL 5712
>Contig3063
Length = 3053
Score = 27.3 bits (59), Expect = 4.3, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Frame = +1
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
T R LS CL+LCLC LA LG+
Sbjct: 424 TSTRCLSACLALCLCASLALCLGA 495
>Contig1390
Length = 7617
Score = 26.9 bits (58), Expect = 4.7, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 14/24 (58%)
Frame = -2
Query: 5 RTLPRPLSLCLSLCLCLCLAAALG 28
R L LSLCL CL LCL A L
Sbjct: 6128 RCLRASLSLCLGACLALCLCAKLA 6057
>Contig2433
Length = 1185
Score = 26.9 bits (58), Expect = 4.8, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Frame = +3
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
T R LS CL+LCLC LA LG+
Sbjct: 717 TSTRCLSACLALCLCASLALCLGA 788
>Contig2044
Length = 12198
Score = 26.9 bits (58), Expect = 5.3, Method: Composition-based stats.
Identities = 15/28 (53%), Positives = 18/28 (64%), Gaps = 3/28 (10%)
Frame = +1
Query: 11 LSLCLSLCLCLCLAAALG---SAQSGSC 35
L LCL CL LCL+A+L SA+S C
Sbjct: 2386 LPLCLDACLALCLSASLALCLSARSTRC 2469
>Contig3597.0
Length = 2518
Score = 26.9 bits (58), Expect = 5.3, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 19/34 (55%)
Frame = +2
Query: 2 SPRRTLPRPLSLCLSLCLCLCLAAALGSAQSGSC 35
S RR L L LCL CL CL A+L + G+C
Sbjct: 1589 SLRRCLVASLPLCLVACLARCLGASL-ALSLGAC 1687
>Contig3612
Length = 4026
Score = 26.9 bits (58), Expect = 5.5, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Frame = +3
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
T R LS CL+LCLC LA LG+
Sbjct: 2988 TSTRCLSACLALCLCASLALCLGA 3059
Score = 26.9 bits (58), Expect = 5.5, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 16/24 (66%)
Frame = +3
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
T R LS CL+LCLC LA LG+
Sbjct: 3132 TSTRCLSACLALCLCASLALCLGA 3203
>Contig2656
Length = 3216
Score = 26.9 bits (58), Expect = 5.6, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 18/34 (52%), Gaps = 3/34 (8%)
Frame = +1
Query: 5 RTLPRPLSLCLSLCLCLCLAAALG---SAQSGSC 35
R L L+LCL CL LCL A L A+S C
Sbjct: 1936 RCLDARLTLCLGACLALCLVARLALCLGARSALC 2037
>Contig3963.0
Length = 10960
Score = 26.9 bits (58), Expect = 6.0, Method: Composition-based stats.
Identities = 12/23 (52%), Positives = 17/23 (73%), Gaps = 1/23 (4%)
Frame = +1
Query: 5 RTLPRPL-SLCLSLCLCLCLAAA 26
R LPRPL S CL +C+ +C++ A
Sbjct: 7795 RRLPRPLFSSCLGICIGICISTA 7863
>Contig3167
Length = 1836
Score = 26.9 bits (58), Expect = 6.0, Method: Composition-based stats.
Identities = 12/23 (52%), Positives = 17/23 (73%), Gaps = 1/23 (4%)
Frame = +1
Query: 5 RTLPRPL-SLCLSLCLCLCLAAA 26
R LPRPL S CL +C+ +C++ A
Sbjct: 598 RRLPRPLFSSCLGICIGICISTA 666
>Contig4172.0
Length = 68612
Score = 26.6 bits (57), Expect = 6.3, Method: Composition-based stats.
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 8/55 (14%)
Frame = +3
Query: 4 RRTLPR--------PLSLCLSLCLCLCLAAALGSAQSGSCRDKKNCKVVFSQQEL 50
+RTLPR PLS+ L +C C L + RD+K VF+ ++L
Sbjct: 47802 KRTLPRQGIRVFRTPLSMLLRVCYHACR*KILVLCYAAPRRDRKFGAAVFALRQL 47966
>Contig1967
Length = 11355
Score = 26.6 bits (57), Expect = 6.8, Method: Composition-based stats.
Identities = 13/25 (52%), Positives = 16/25 (64%)
Frame = -1
Query: 6 TLPRPLSLCLSLCLCLCLAAALGSA 30
TL L CL+LCL CLA LG++
Sbjct: 6789 TLALSLCACLALCLVACLALCLGAS 6715
>Contig1894
Length = 10489
Score = 26.6 bits (57), Expect = 6.8, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = +1
Query: 6 TLPRPLSLCLSLCLCLCLAAALG 28
+L L LCLS CL LCL A+L
Sbjct: 3937 SLIASLPLCLSACLALCLVASLA 4005
>Contig1801
Length = 9161
Score = 26.6 bits (57), Expect = 6.8, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = +1
Query: 6 TLPRPLSLCLSLCLCLCLAAALG 28
+L L LCLS CL LCL A+L
Sbjct: 2932 SLIASLPLCLSACLALCLVASLA 3000
>Contig3202.0
Length = 2035
Score = 26.6 bits (57), Expect = 7.3, Method: Composition-based stats.
Identities = 14/27 (51%), Positives = 18/27 (66%)
Frame = +3
Query: 4 RRTLPRPLSLCLSLCLCLCLAAALGSA 30
R +LPR L L+LCLC LA LG++
Sbjct: 1668 RASLPRCLVARLALCLCARLALCLGTS 1748
>Contig3854
Length = 11825
Score = 26.6 bits (57), Expect = 7.9, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 14/23 (60%)
Frame = +2
Query: 5 RTLPRPLSLCLSLCLCLCLAAAL 27
R L L+LCL CL LCL A L
Sbjct: 5999 RCLGASLALCLRACLALCLVATL 6067
>Contig3802
Length = 5425
Score = 26.6 bits (57), Expect = 8.0, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Frame = -2
Query: 4 RRTLPRPLSLCLSLCLCLCLAAAL 27
R TLPR L CL+LCL LA L
Sbjct: 5283 RATLPRCLGACLALCLGASLALCL 5212
>Contig1873
Length = 10371
Score = 26.2 bits (56), Expect = 8.5, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = -2
Query: 6 TLPRPLSLCLSLCLCLCLAAALG 28
+L L LCLS CL LCL A+L
Sbjct: 7373 SLIASLPLCLSACLALCLVASLA 7305
>Contig3185
Length = 2461
Score = 26.2 bits (56), Expect = 8.5, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 15/24 (62%)
Frame = +2
Query: 5 RTLPRPLSLCLSLCLCLCLAAALG 28
R L L+LCL CL LCL A+L
Sbjct: 1214 RCLVASLALCLVACLALCLVASLA 1285
>Contig3493.0
Length = 8609
Score = 26.2 bits (56), Expect = 8.7, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 16/24 (66%)
Frame = -1
Query: 6 TLPRPLSLCLSLCLCLCLAAALGS 29
+L R L CL+LCL CLA LG+
Sbjct: 7004 SLARCLIACLALCLGACLALCLGA 6933
Score = 26.2 bits (56), Expect = 9.1, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = -2
Query: 5 RTLPRPLSLCLSLCLCLCLAAAL 27
R L L+LCL CL LCL A+L
Sbjct: 190 RCLGASLALCLVACLALCLCASL 122
>Contig3948
Length = 6958
Score = 26.2 bits (56), Expect = 9.1, Method: Composition-based stats.
Identities = 13/23 (56%), Positives = 15/23 (65%)
Frame = -2
Query: 5 RTLPRPLSLCLSLCLCLCLAAAL 27
R L L+LCL CL LCL A+L
Sbjct: 5712 RCLGASLALCLVACLALCLCASL 5644
>Contig369
Length = 7881
Score = 26.2 bits (56), Expect = 9.4, Method: Composition-based stats.
Identities = 11/17 (64%), Positives = 13/17 (76%)
Frame = -3
Query: 11 LSLCLSLCLCLCLAAAL 27
L+LCL CL LCL A+L
Sbjct: 6307 LALCLGACLALCLGASL 6257
>Contig3510
Length = 3840
Score = 26.2 bits (56), Expect = 9.6, Method: Composition-based stats.
Identities = 13/24 (54%), Positives = 14/24 (58%)
Frame = -2
Query: 5 RTLPRPLSLCLSLCLCLCLAAALG 28
R L L+LCL CL LCL A L
Sbjct: 1325 RCLDARLTLCLGACLALCLVARLA 1254
>Contig3491
Length = 6215
Score = 26.2 bits (56), Expect = 9.9, Method: Composition-based stats.
Identities = 11/18 (61%), Positives = 13/18 (72%)
Frame = +1
Query: 11 LSLCLSLCLCLCLAAALG 28
L+LCL CL LCL A+L
Sbjct: 1828 LTLCLRACLALCLGASLA 1881
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000024_1.0 # Protein # Selenoprotein S (SelS) # Homo
sapiens # Complete
(189 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4088 34 0.045
Contig1208 28 2.1
Contig1785 28 3.1
Contig1489 28 3.1
Contig93 28 3.1
Contig3900 28 3.1
Contig4183.0 27 3.6
Contig3936.0 27 5.1
Contig4087 27 6.0
Contig4185.1 27 6.4
Contig4156.1 27 7.4
Contig39 26 7.9
Contig3572 26 9.5
>Contig4088
Length = 28559
Score = 33.9 bits (76), Expect = 0.045, Method: Composition-based stats.
Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 11/92 (11%)
Frame = -2
Query: 107 LEEEKRRQKIEMWDSMQEGKSY-KGN-AKKPQEEDSPGPSTSSVL--KRKSDRKPLRGGG 162
+E R++KI MW+++ + + +GN A+ ++ +P + S +L R++ + PL GGG
Sbjct: 304 FKELSRQKKIPMWEAVLGYQKWGEGNWAQTFLKKKNPKTAFSPLLGGSRRNPKPPLEGGG 125
Query: 163 YNPLS-------GEGGGACSWRPGRRGPSSGG 187
Y+P S G G PG G ++ G
Sbjct: 124 YSPESRPKFRVLGGSSGQYKTTPGVVGKNTKG 29
>Contig1208
Length = 3602
Score = 28.1 bits (61), Expect = 2.1, Method: Composition-based stats.
Identities = 25/114 (21%), Positives = 47/114 (41%), Gaps = 8/114 (7%)
Frame = -1
Query: 48 FQKLSARLRALRQRQLDRAAAAVEPDVVVKRQEALAAARLKMQEELNAQVEKHKEKLKQL 107
+ + R + + AA +EPD+ A K+Q+E+ VEK+ + + +L
Sbjct: 1682 WNTFAGRAGTTLKSSMQEKAAGIEPDIGTLCNAAAITGAAKLQKEVTVIVEKYLDVINKL 1503
Query: 108 --------EEEKRRQKIEMWDSMQEGKSYKGNAKKPQEEDSPGPSTSSVLKRKS 153
E EK + D M + KS + + ED P + + + + K+
Sbjct: 1502 DGTLEGNAEVEKVALQTHAQDVMNKIKSLCKDIRNVTSED-PDSAKAKLFELKT 1344
>Contig1785
Length = 1995
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 9/18 (50%), Positives = 12/18 (66%)
Frame = -3
Query: 19 LRFLHTTVGSLLATYGWY 36
RFLH+ +L + YGWY
Sbjct: 328 FRFLHSRCSNLASEYGWY 275
>Contig1489
Length = 5542
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 9/18 (50%), Positives = 12/18 (66%)
Frame = -3
Query: 19 LRFLHTTVGSLLATYGWY 36
RFLH+ +L + YGWY
Sbjct: 2372 FRFLHSRCSNLASEYGWY 2319
>Contig93
Length = 7045
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 9/18 (50%), Positives = 12/18 (66%)
Frame = +1
Query: 19 LRFLHTTVGSLLATYGWY 36
RFLH+ +L + YGWY
Sbjct: 4945 FRFLHSRCSNLASEYGWY 4998
>Contig3900
Length = 3419
Score = 27.7 bits (60), Expect = 3.1, Method: Composition-based stats.
Identities = 9/18 (50%), Positives = 12/18 (66%)
Frame = +2
Query: 19 LRFLHTTVGSLLATYGWY 36
RFLH+ +L + YGWY
Sbjct: 2024 FRFLHSRCSNLASEYGWY 2077
>Contig4183.0
Length = 31250
Score = 27.3 bits (59), Expect = 3.6, Method: Composition-based stats.
Identities = 9/18 (50%), Positives = 12/18 (66%)
Frame = +2
Query: 19 LRFLHTTVGSLLATYGWY 36
RFLH+ +L + YGWY
Sbjct: 28343 FRFLHSRCSNLASEYGWY 28396
>Contig3936.0
Length = 13677
Score = 26.9 bits (58), Expect = 5.1, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 12/62 (19%)
Frame = +3
Query: 62 QLDRAAAAVEPDVVVKRQEALAAAR-----------LKMQEELNAQVEKHKEKL-KQLEE 109
++ +A ++V+P+ + + ++A A+ +M+EEL A EK EKL K LE+
Sbjct: 5349 EITKAMSSVDPEKLKQLEDAFEMAKNTFNNATTEDLERMEEELKAFFEKENEKLQKDLED 5528
Query: 110 EK 111
K
Sbjct: 5529 GK 5534
>Contig4087
Length = 38589
Score = 26.6 bits (57), Expect = 6.0, Method: Composition-based stats.
Identities = 14/70 (20%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Frame = -1
Query: 80 EALAAARLKMQEELNAQVEKHKEKLKQLEEEKRRQKIEMW-DSMQEGKSYKGNAKKPQEE 138
E + ++R + E+ ++ + + L + ++ + ++ W ++ E K GN ++ ++
Sbjct: 33360 EVVRSSRPSIAEKPDSPTHRSSRQTHDLAQTQQTKDVKRWLQNVNEVKQMVGNNERESKQ 33181
Query: 139 DSPGPSTSSV 148
PG + SSV
Sbjct: 33180 PGPGVNASSV 33151
>Contig4185.1
Length = 169422
Score = 26.6 bits (57), Expect = 6.4, Method: Composition-based stats.
Identities = 26/108 (24%), Positives = 51/108 (47%), Gaps = 6/108 (5%)
Frame = -3
Query: 18 GLRFLHTTVGSL-----LATYGWYIVFSCILLYVVFQKLSARLRALRQRQLDRAAAAVEP 72
+ FLH +GS+ + TY YI+ L V +KL +++ ++ E
Sbjct: 106873 AMAFLHGVLGSVKDDNAVTTYDNYILDESKRLNKVLEKLQSQIGKGSAGLVESVTQVTEW 106694
Query: 73 DVVVKRQEALAAARLK-MQEELNAQVEKHKEKLKQLEEEKRRQKIEMW 119
+R+ L + + K + L QVE+HK+ ++Q +++ ++E W
Sbjct: 106693 LEGYEREVKLKSDKEKDLIINLENQVEEHKKFIEQEKDKGLTTQMEYW 106550
>Contig4156.1
Length = 51696
Score = 26.6 bits (57), Expect = 7.4, Method: Composition-based stats.
Identities = 14/22 (63%), Positives = 15/22 (68%)
Frame = +3
Query: 60 QRQLDRAAAAVEPDVVVKRQEA 81
QRQL R A E DVV KRQE+
Sbjct: 20409 QRQLQREAMLPEIDVVHKRQES 20474
>Contig39
Length = 10315
Score = 26.2 bits (56), Expect = 7.9, Method: Composition-based stats.
Identities = 10/18 (55%), Positives = 12/18 (66%)
Frame = -3
Query: 19 LRFLHTTVGSLLATYGWY 36
L+FLHT G A +GWY
Sbjct: 5042 LKFLHTQCGLSAAYHGWY 4989
>Contig3572
Length = 3662
Score = 26.2 bits (56), Expect = 9.5, Method: Composition-based stats.
Identities = 13/55 (23%), Positives = 34/55 (61%)
Frame = +2
Query: 77 KRQEALAAARLKMQEELNAQVEKHKEKLKQLEEEKRRQKIEMWDSMQEGKSYKGN 131
K ++++A +++++ + + K + ++ L+EEKRRQ ++ ++++ K Y N
Sbjct: 1190 KVKDSIAKESDQLKQKQHMEFLKILDGIRDLKEEKRRQFKDVNTALKDAKRYVTN 1354
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000025_1.0 # Protein # Selenoprotein T (SelT) # Homo
sapiens # Complete
(195 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4180 28 3.0
>Contig4180
Length = 166360
Score = 27.7 bits (60), Expect = 3.0, Method: Composition-based stats.
Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Frame = +2
Query: 43 FQIC--VSUGYRRVFEEYMRVISQRYPDIRIEGENYLPQPIYRHIA 86
F IC V G RR E+ + + +P + ++ P+P++RH A
Sbjct: 36179 FSICWTVRFGQRRAKEQDLCQDKEEHPSVPSRVKSSFPRPLWRHYA 36316
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000026_1.0 # Protein # Selenoprotein U1 (SelU1) # Homo
sapiens # Complete
(229 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig3341.0 28 2.6
Contig4089 28 2.7
Contig3524 27 5.6
Contig2788 27 5.7
Contig4050 27 7.7
Contig4035 27 7.7
Contig3155 27 9.5
>Contig3341.0
Length = 1916
Score = 28.5 bits (62), Expect = 2.6, Method: Composition-based stats.
Identities = 14/28 (50%), Positives = 15/28 (53%)
Frame = -1
Query: 77 IMAVRRPGCFLCREEAADLSSLKSMLDQ 104
I AVRRP FLCR L S+ L Q
Sbjct: 827 IHAVRRPNAFLCRFSICYLESIYRTLAQ 744
>Contig4089
Length = 34055
Score = 28.5 bits (62), Expect = 2.7, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 16/29 (55%)
Frame = +1
Query: 81 RRPGCFLCREEAADLSSLKSMLDQLGVPL 109
+RP C CR++ L S L QLGV L
Sbjct: 2839 QRPCCMSCRKQLTMLRSWNGYLKQLGVVL 2925
>Contig3524
Length = 5857
Score = 27.3 bits (59), Expect = 5.6, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 22/36 (61%)
Frame = +1
Query: 89 REEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDF 124
R + ADL LKS++D+ + ++ E+I T VK F
Sbjct: 4027 RRKRADLLGLKSLVDRQLDTVKGIIGENINTGVKGF 4134
>Contig2788
Length = 5157
Score = 27.3 bits (59), Expect = 5.7, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 22/36 (61%)
Frame = -3
Query: 89 REEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDF 124
R + ADL LKS++D+ + ++ E+I T VK F
Sbjct: 4099 RRKRADLLGLKSLVDRQLDTVKGIIGENINTGVKGF 3992
>Contig4050
Length = 26651
Score = 26.9 bits (58), Expect = 7.7, Method: Composition-based stats.
Identities = 17/52 (32%), Positives = 28/52 (53%)
Frame = +1
Query: 38 LSKPQKAALEYLEDIDLKTLEKEPRTFKAKELWEKNGAVIMAVRRPGCFLCR 89
L+K Q + E D KTL++ PRT K ++ + +G ++ +R G L R
Sbjct: 310 LTKSQSLRVLSAEL*DHKTLQRTPRTGKGEKTVKGHGDILDGLRVTGKRLLR 465
>Contig4035
Length = 28677
Score = 26.9 bits (58), Expect = 7.7, Method: Composition-based stats.
Identities = 17/52 (32%), Positives = 28/52 (53%)
Frame = -1
Query: 38 LSKPQKAALEYLEDIDLKTLEKEPRTFKAKELWEKNGAVIMAVRRPGCFLCR 89
L+K Q + E D KTL++ PRT K ++ + +G ++ +R G L R
Sbjct: 4497 LTKSQSLRVLSAEL*DHKTLQRTPRTGKGEKTVKGHGDILDGLRVTGKRLLR 4342
>Contig3155
Length = 23659
Score = 26.6 bits (57), Expect = 9.5, Method: Composition-based stats.
Identities = 12/37 (32%), Positives = 23/37 (62%)
Frame = -3
Query: 93 ADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFK 129
ADL++LKS++ + ++ +V E ++T +K F K
Sbjct: 7571 ADLAALKSLVQKQRDNIFNIVNEDLKTGIKGFLQILK 7461
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000027_1.0 # Protein # Selenoprotein U2 (SelU2) # Homo
sapiens # Complete
(226 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4084.1 27 7.3
>Contig4084.1
Length = 42282
Score = 26.9 bits (58), Expect = 7.3, Method: Composition-based stats.
Identities = 14/37 (37%), Positives = 20/37 (54%)
Frame = +3
Query: 172 DPAQQGGTLILGPGNNIHFIHRDRNRLDHKPINSVLQ 208
DPA +GG +L P N+ + R R PI S++Q
Sbjct: 38448 DPANRGGARLLDPSVNVEHLVRGSLR----PIASIIQ 38546
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000028_1.0 # Protein # Selenoprotein U3 (SelU3) # Homo
sapiens # Complete
(198 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig1916 31 0.29
Contig3884 27 3.9
Contig3175 27 3.9
>Contig1916
Length = 4586
Score = 31.2 bits (69), Expect = 0.29, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 3/36 (8%)
Frame = -2
Query: 17 AVTGEAVELRSLWREHACVVAGLRRF---GCVVCRW 49
AV GE E+RS AC++AG R F C +C+W
Sbjct: 2731 AVAGER*EIRSA----ACIIAGTRYFHRQACQICKW 2636
>Contig3884
Length = 22305
Score = 27.3 bits (59), Expect = 3.9, Method: Composition-based stats.
Identities = 13/38 (34%), Positives = 19/38 (50%)
Frame = -1
Query: 20 GEAVELRSLWREHACVVAGLRRFGCVVCRWIAQDLSSL 57
GE+ LWRE+ + L+ G CRW+ SS+
Sbjct: 13860 GESKMRARLWRENEPLGPSLKVAGPETCRWLKSKYSSI 13747
>Contig3175
Length = 4112
Score = 27.3 bits (59), Expect = 3.9, Method: Composition-based stats.
Identities = 13/38 (34%), Positives = 19/38 (50%)
Frame = +3
Query: 20 GEAVELRSLWREHACVVAGLRRFGCVVCRWIAQDLSSL 57
GE+ LWRE+ + L+ G CRW+ SS+
Sbjct: 2910 GESKMRARLWRENEPLGPSLKVAGPETCRWLKSKYSSI 3023
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000029_1.0 # Protein # Selenoprotein V (SelV) # Homo
sapiens # Complete
(346 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4158 41 0.001
Contig4092 32 0.58
Contig4160 31 0.70
Contig4091.1 31 0.91
Contig4149.1 30 1.1
Contig2658 30 1.3
Contig4175.0 30 1.3
Contig3854 30 1.5
Contig3373.1 29 2.7
Contig4181.1 28 4.6
Contig3858 28 4.7
Contig4132 28 8.2
Contig4151 28 8.7
Contig4142.0 27 8.9
>Contig4158
Length = 87318
Score = 40.8 bits (94), Expect = 0.001, Method: Composition-based stats.
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 7/85 (8%)
Frame = +2
Query: 66 PAQIPTLVPTPALARIPRLVPPPAPAWIPTPVPTPVPVRNPTPVPTPAR-----TLTPPV 120
P ++P VP A VP P PA +P P R P +PAR T T P
Sbjct: 22391 PDRVPVQVPAAAHVH----VPAPVPARVPVSATATSPARVHVPAASPARATVSATATSPA 22558
Query: 121 R--VPAPAPAQLLAGIRAALPVLDS 143
R VPAP PA+ +R +PV S
Sbjct: 22559 RVHVPAPVPAKSAGVVRQQVPVPSS 22633
>Contig4092
Length = 35044
Score = 31.6 bits (70), Expect = 0.58, Method: Composition-based stats.
Identities = 13/34 (38%), Positives = 17/34 (50%)
Frame = -3
Query: 21 ASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSP 54
++ P TP PL T P T P+ + P TP P
Sbjct: 24383 STKPAETPKPLSTTKPAETPKPLGNVKPAETPKP 24282
Score = 27.7 bits (60), Expect = 7.9, Method: Composition-based stats.
Identities = 16/48 (33%), Positives = 18/48 (37%)
Frame = -3
Query: 30 PLRTPTPVRTRTPIRTLTPVLTPSPAGTSPLVLTPAPAQIPTLVPTPA 77
P + P T P+ T P TP P G TP P T T A
Sbjct: 24392 PAVSTKPAETPKPLSTTKPAETPKPLGNVKPAETPKPVAADTSSGTSA 24249
Score = 27.3 bits (59), Expect = 9.2, Method: Composition-based stats.
Identities = 15/50 (30%), Positives = 20/50 (40%)
Frame = -3
Query: 9 APSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTS 58
A A ++ P T P TP P+ P T PV + +GTS
Sbjct: 24401 ASEPAVSTKPAETPKPLSTTKPAETPKPLGNVKPAETPKPVAADTSSGTS 24252
>Contig4160
Length = 71491
Score = 31.2 bits (69), Expect = 0.70, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 1/70 (1%)
Frame = +3
Query: 7 TPAPS-SARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTSPLVLTPA 65
TP PS S ++S S + S TP P + P T T P + P+ ++ + +TP
Sbjct: 58215 TPKPSESTKSSVSAKVSV---TPKPSESAKPSETAKVSLTPKPSESAKPSESATVSVTPN 58385
Query: 66 PAQIPTLVPT 75
+ IP LV T
Sbjct: 58386 ESAIPQLVVT 58415
Score = 28.1 bits (61), Expect = 5.8, Method: Composition-based stats.
Identities = 28/113 (24%), Positives = 45/113 (39%), Gaps = 9/113 (7%)
Frame = +3
Query: 12 SARTSTSVRASTPTRTPTPLR---TPTPVRTRTPIRTLTPVLTPSPAGTSP------LVL 62
SA+ S +AS T++ + TP P + P T LTP P+ ++ + +
Sbjct: 58089 SAKVSAFSKASESTKSSVSAKVSVTPKPSESAKPSETAKVSLTPKPSESTKSSVSAKVSV 58268
Query: 63 TPAPAQIPTLVPTPALARIPRLVPPPAPAWIPTPVPTPVPVRNPTPVPTPART 115
TP P++ T ++ P+ P+ T TP P V T T
Sbjct: 58269 TPKPSESAKPSETAKVSLTPKPSESAKPSESATVSVTPNESAIPQLVVTSVET 58427
>Contig4091.1
Length = 11179
Score = 30.8 bits (68), Expect = 0.91, Method: Composition-based stats.
Identities = 22/74 (29%), Positives = 30/74 (40%)
Frame = +2
Query: 123 PAPAPAQLLAGIRAALPVLDSYLAPALPLDPPPEPAPELPLLPEEDPEPAPSLKLIPSVS 182
PA AP + V D+ APA ++ APE P+ E P P+ P +
Sbjct: 1418 PAEAPKETTGCCSTPEKVADTPQAPATTVEASSTGAPEKPVEMTESPAATPTSTAEPGIP 1597
Query: 183 SEAGPAPGPLPTRT 196
+ G A G T T
Sbjct: 1598 TAPGDASGLRGTST 1639
>Contig4149.1
Length = 48505
Score = 30.4 bits (67), Expect = 1.1, Method: Composition-based stats.
Identities = 22/74 (29%), Positives = 30/74 (40%)
Frame = +1
Query: 123 PAPAPAQLLAGIRAALPVLDSYLAPALPLDPPPEPAPELPLLPEEDPEPAPSLKLIPSVS 182
PA AP + V D+ APA ++ APE P+ E P P+ P +
Sbjct: 47932 PAEAPKETTGCCSTPEKVADTPQAPATMVEASSTGAPEKPVEMTESPAATPTSTAEPGIP 48111
Query: 183 SEAGPAPGPLPTRT 196
+ G A G T T
Sbjct: 48112 TAPGDASGLRGTST 48153
>Contig2658
Length = 1246
Score = 30.4 bits (67), Expect = 1.3, Method: Composition-based stats.
Identities = 17/49 (34%), Positives = 23/49 (46%)
Frame = +2
Query: 6 RTPAPSSARTSTSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSP 54
+T ++ S SV + P TP PL T P T P+ + P TP P
Sbjct: 290 QTGRETAEGASVSVVGAKPAETPKPLSTTKPAETPKPLGNVKPAETPKP 436
>Contig4175.0
Length = 80716
Score = 30.4 bits (67), Expect = 1.3, Method: Composition-based stats.
Identities = 25/83 (30%), Positives = 35/83 (42%)
Frame = -3
Query: 28 PTPLRTPTPVRTRTPIRTLTPVLTPSPAGTSPLVLTPAPAQIPTLVPTPALARIPRLVPP 87
P P + P P + P + P+P + PAP Q+P P PR +P
Sbjct: 8699 PVPRQFPRPAPRQLPRPAPRQLPRPAPRQLPRQLPRPAPRQLPRPAPRQLPRPAPRQLPR 8520
Query: 88 PAPAWIPTPVPTPVPVRNPTPVP 110
PAP +P P P +P P +P
Sbjct: 8519 PAPRQLPRPAPRQLPRPAPRQLP 8451
>Contig3854
Length = 11825
Score = 30.0 bits (66), Expect = 1.5, Method: Composition-based stats.
Identities = 40/158 (25%), Positives = 66/158 (41%), Gaps = 6/158 (3%)
Frame = +2
Query: 4 QARTPAPSSARTS----TSVRASTPTRTPTPLRTPTPVRTRTPIRTLT-PVLTPSPAGTS 58
+A +P P+ + T+ +T TP LRT +P TL P+ +P PA S
Sbjct: 3788 RAHSPQPADISVT*QALTNCNTTTAAPTPLNLRTYQSPSKHSPTATLPLPLHSPQPASKS 3967
Query: 59 PLVLTPAPAQIPTLVPTPALARIPRLVPPPAP-AWIPTPVPTPVPVRNPTPVPTPARTLT 117
Q TP R + + +P A +P P+P P ++ + T + T T
Sbjct: 3968 ITKQRTHQLQHYHCRSTPLNLRTCQSLKQHSPTATLPLPLPVPSTCKHINHLATHSPTAT 4147
Query: 118 PPVRVPAPAPAQLLAGIRAALPVLDSYLAPALPLDPPP 155
P+ P P+ + + + AL ++ A PL+ P
Sbjct: 4148 LPLPRPVPSTCEHINHLSNALTNCNTTTAAPTPLNLQP 4261
>Contig3373.1
Length = 10577
Score = 29.3 bits (64), Expect = 2.7, Method: Composition-based stats.
Identities = 41/153 (26%), Positives = 62/153 (40%), Gaps = 9/153 (5%)
Frame = +2
Query: 8 PAPSSART-STSVRASTPTRTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGT-------SP 59
PAPS+ + T R+ P PL P+ + ++P LT P P S
Sbjct: 1550 PAPSTCKL*ITKQRSHQLQHVPLPLSVPSTCKHQSPSNGLTNYHVPLPLPVPSTCRHISH 1729
Query: 60 LVLTPAPAQIPTLVPTPALAR-IPRLVPPPAPAWIPTPVPTPVPVRNPTPVPTPARTLTP 118
L A +P +P P+ R I L A +P P+P P R+ + + T + T T
Sbjct: 1730 LATHSPTATLPLPLPVPSTCRHISHLATHSPTATLPLPLPVPSTCRHISHLATHSPTATL 1909
Query: 119 PVRVPAPAPAQLLAGIRAALPVLDSYLAPALPL 151
P+ +P P+ L+ + P A PL
Sbjct: 1910 PLPLPVPSTCNLINHLATHSPNATRTTAATSPL 2008
>Contig4181.1
Length = 107582
Score = 28.5 bits (62), Expect = 4.6, Method: Composition-based stats.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Frame = -1
Query: 35 TPVRTRTPIRTLTPVLTPSPAGTSPL 60
TP R +TP R L+ PSP GT+PL
Sbjct: 15491 TPARPQTPTRALSGASEPSPHGTTPL 15414
>Contig3858
Length = 14292
Score = 28.5 bits (62), Expect = 4.7, Method: Composition-based stats.
Identities = 9/20 (45%), Positives = 12/20 (60%)
Frame = +1
Query: 96 PVPTPVPVRNPTPVPTPART 115
P+ P P PTP+P+P T
Sbjct: 3943 PIKDPTPAEPPTPIPSPVST 4002
>Contig4132
Length = 50512
Score = 27.7 bits (60), Expect = 8.2, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 31/78 (39%), Gaps = 7/78 (8%)
Frame = +1
Query: 45 TLTPVLTPSPAGTSPLVLTPAPAQIPTLVPTPALARIPRLVP-------PPAPAWIPTPV 97
T T V P SP LTP PTP++ PR+ P P APA +
Sbjct: 14017 TQTAVNRPQAPTLSPNQLTPQIHPTSQTPPTPSMQPTPRIPPTRATQPIPHAPAPSSPSL 14196
Query: 98 PTPVPVRNPTPVPTPART 115
T V R P VP RT
Sbjct: 14197 TTRVSNRRPYFVPGGGRT 14250
>Contig4151
Length = 44706
Score = 27.7 bits (60), Expect = 8.7, Method: Composition-based stats.
Identities = 14/38 (36%), Positives = 19/38 (50%)
Frame = -3
Query: 26 RTPTPLRTPTPVRTRTPIRTLTPVLTPSPAGTSPLVLT 63
R PTP+R P PVR P+ T + T+P V +
Sbjct: 9475 RAPTPVRVPKPVRRAPPVCRCTVCAALASLYTTPYVTS 9362
>Contig4142.0
Length = 25213
Score = 27.3 bits (59), Expect = 8.9, Method: Composition-based stats.
Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 6/99 (6%)
Frame = +3
Query: 37 VRTRTPIRTLTPVLTPSPAGTSPLVLTPAPAQIPTLVPTPALARIPRLVPPPAPAWIPTP 96
+ T PI+ P + P + P P I +P P L RL PPP + P
Sbjct: 21255 IPTGYPIKLQKPPMKPPTYSKKSMPFPPVPDLIGEAIPDPTLN--TRLPPPPIEIKVRDP 21428
Query: 97 ------VPTPVPVRNPTPVPTPARTLTPPVRVPAPAPAQ 129
V + + R P+ + P ++ P P++
Sbjct: 21429 YSQYDTVNSDIARRPSLPIVESVKIPLPSIKYSRPPPSK 21545
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000030_1.0 # Protein # Selenoprotein W1 (SelW1) # Homo
sapiens # Complete
(87 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4159 25 6.9
>Contig4159
Length = 55924
Score = 24.6 bits (52), Expect = 6.9, Method: Composition-based stats.
Identities = 10/26 (38%), Positives = 16/26 (61%)
Frame = -1
Query: 33 RLDICGEGTPQATGFFEVMVAGKLIH 58
R+ +CG+G P+ + V + G LIH
Sbjct: 25663 RVCVCGDGRPRPGSYPCVQLYGLLIH 25586
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000031_1.0 # Protein # Selenoprotein W2 (SelW2) # Homo
sapiens # Complete
(115 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4102 30 0.27
Contig4186.0 26 3.6
Contig3872 25 5.2
Contig2834 25 7.8
>Contig4102
Length = 42529
Score = 29.6 bits (65), Expect = 0.27, Method: Composition-based stats.
Identities = 19/74 (25%), Positives = 32/74 (43%)
Frame = -2
Query: 38 TYLELASAVKEQYPGIEIESRLGGTGAFEIEINGQLVFSKLENGGFPYEKDLIEAIRRAS 97
+Y +L + +PG E+ + G F E G F ++ +G FP + AI+
Sbjct: 35772 SYYQLHMVHHQIHPGDELRTVSLHLGKFHQERRGVRRFHRVTHGSFPIQNPPCSAIQHLY 35593
Query: 98 NGETLEKITNSRPP 111
+ L +I R P
Sbjct: 35592 HYSNLHRIPIRRRP 35551
>Contig4186.0
Length = 78499
Score = 25.8 bits (55), Expect = 3.6, Method: Composition-based stats.
Identities = 10/24 (41%), Positives = 14/24 (58%)
Frame = -3
Query: 29 YCEPCGFEATYLELASAVKEQYPG 52
YC CG T++ A+ V Q+PG
Sbjct: 34688 YCRCCGAGVTWMCYAAVVAGQHPG 34617
>Contig3872
Length = 5716
Score = 25.4 bits (54), Expect = 5.2, Method: Composition-based stats.
Identities = 15/38 (39%), Positives = 16/38 (42%)
Frame = -2
Query: 6 GQTSVAPPPEEVEPGSGVRIVVEYCEPCGFEATYLELA 43
GQ SV PP PG Y P G+ AT E A
Sbjct: 3960 GQQSVDKPPLPGHPGQHFSAGQHYLAPSGYHATTAEDA 3847
>Contig2834
Length = 7710
Score = 24.6 bits (52), Expect = 7.8, Method: Composition-based stats.
Identities = 11/37 (29%), Positives = 22/37 (59%)
Frame = -3
Query: 36 EATYLELASAVKEQYPGIEIESRLGGTGAFEIEINGQ 72
+ T ++L+ ++ E+E + GTG +I+ING+
Sbjct: 5011 DGTSVDLSMYNADKDVAAEVEKHIPGTGKNDIQINGR 4901
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000034_1.0 # Protein # Thioredoxin reductase 1 (TR1) #
Homo sapiens # Complete
(499 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4075 177 2e-95
Contig3884 64 4e-17
Contig4130 56 1e-11
Contig4146 29 5.2
Contig4119 28 8.9
>Contig4075
Length = 25623
Score = 177 bits (449), Expect(3) = 2e-95, Method: Composition-based stats.
Identities = 110/270 (40%), Positives = 157/270 (58%), Gaps = 5/270 (1%)
Frame = +2
Query: 103 EAVQNHIGSLNWGYRVALREKKVVYENAYGQFIGPHRIKATNNKGKEKIYSAERFLIATG 162
+ VQN+I LN+ YR L K V Y NA + + T+ KG++ A+ LIA G
Sbjct: 19277 QTVQNYIKMLNFSYRSGLMSKNVEYVNAMASLADKNTVTYTDKKGEKHQIKAKNVLIAIG 19456
Query: 163 ERPRYLGIPGDK----EYCISSDDLFSLPYCPGKTLVVGASYVALECAGFLAGIGLDVTV 218
RP IP D +Y I+SDDL S GKTL+VG S+VALECAGFL +G DVTV
Sbjct: 19457 ARPT---IPSDVKGAWDYSITSDDLMSRKEPVGKTLIVGGSFVALECAGFLTSMGYDVTV 19627
Query: 219 MVRSILLRGFDQDMANKIGEHMEEHGIKFIRQFVPIKVEQIEAGTPGRLRVVAQSTNSEE 278
VRS++LRGFD+ A+K+ E M G KF VP + ++E GRL + +S+
Sbjct: 19628 AVRSLILRGFDRQCADKVQELMLATGTKFKNGVVPQAITKLEN---GRLYIEFTDGSSD- 19795
Query: 279 IIEGEYNTVMLAIGRDACTR-KIGLETVGVKINEKTGKIPVTDEEQTNVPYIYAIGDILE 337
E++T+M A GR +R + L VG+K +E GKI + ++ +T+V +YA+GD++E
Sbjct: 19796 ----EFDTLMYATGRSVSSRMQKELSDVGIKFSE-YGKI-IAEDGKTSVEGVYAVGDVVE 19957
Query: 338 DKVELTPVAIQAGRLLAQRLYAGSTVKCDY 367
L PVA++ G LLA+R++ S K +
Sbjct: 19958 GNPALAPVAVKDGELLARRIFGNSDKKVGF 20047
Score = 110 bits (276), Expect(3) = 2e-95, Method: Composition-based stats.
Identities = 60/144 (41%), Positives = 84/144 (58%), Gaps = 14/144 (9%)
Frame = +3
Query: 362 TVKCDYENVPTTVFTPLEYGACGLSEEKAVEKFGEENIEVYHSYFWPLEWTIPSRDNNK- 420
T + D +P VFTP EY CG+SEE A + +GE+ +VY + LE+ R+ +
Sbjct: 20067 TSQLDLNYIPMCVFTPYEYARCGISEETASKLYGED-FDVYLKEYTTLEFAAVHREKVES 20243
Query: 421 -------------CYAKIICNTKDNERVVGFHVLGPNAGEVTQGFAAALKCGLTKKQLDS 467
C +K+IC K + VVG H +GPNAGE+ QG A++ G+TK+++D
Sbjct: 20244 LRADEFDVDMPPTCLSKMIC--KKDGTVVGLHFVGPNAGEIMQGLCMAVRKGITKEEIDD 20417
Query: 468 TIGIHPVCAEVFTTLSVTKRSGAS 491
TIGIHP AE F L+VTK+SG S
Sbjct: 20418 TIGIHPTDAESFVNLTVTKKSGES 20489
Score = 101 bits (252), Expect(3) = 2e-95, Method: Composition-based stats.
Identities = 54/90 (60%), Positives = 63/90 (70%)
Frame = +3
Query: 13 YDLIIIGGGSGGLAAAKEAAQYGKKVMVLDFVTPTPLGTRWGLGGTCVNVGCIPKKLMHQ 72
YD +IGGG GGLAAAKEAA G K ++ D+V P+P GT WGLGGTCVNVGCIPKKLMH
Sbjct: 18972 YDFAVIGGGCGGLAAAKEAASLGAKTILFDYVRPSPRGTTWGLGGTCVNVGCIPKKLMHY 19151
Query: 73 AALLGQALQDSRNYGWKVEETVKHDWDRMI 102
A +LG + D GW E HDW R++
Sbjct: 19152 AGILGHSSHDREALGWGNHEG-PHDWGRLV 19238
>Contig3884
Length = 22305
Score = 64.3 bits (155), Expect(2) = 4e-17, Method: Composition-based stats.
Identities = 64/254 (25%), Positives = 115/254 (45%), Gaps = 3/254 (1%)
Frame = +2
Query: 229 DQDMANKIGEHMEEHGIKFIRQFVPIKVEQIEAGTPGRLRVVAQSTNSEEIIEGEYNTVM 288
DQD++ I + +E+ GI+ V + + + VV N ++I E + + V+
Sbjct: 3725 DQDVSATIKKLIEKQGIE-----VSVSTKVLGGEAKEDCAVVTAEKNGQKI-EFKGDVVL 3886
Query: 289 LAIGRDACTRKIGLETVGVKINEKTGKIPVTDEEQT-NVPYIYAIGDILEDKVELTPVAI 347
LA+GR T +GLE +G+K + G I V D+ + N P I AIGD++ + L A
Sbjct: 3887 LAMGRRPFTSNLGLENLGIKT--ERGCIVVDDKLRIPNHPNISAIGDVIPGPM-LAHKAE 4057
Query: 348 QAGRLLAQRLYAGSTVKCDYENVPTTVFTPLEYGACGLSEEKAVEKFGEENIEVYHSYFW 407
+ G + + ++ +VP ++T E G S E++++ G Y +
Sbjct: 4058 EDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVG-STEQSLKAAGV----AYKKASF 4222
Query: 408 PLEWTIPSR--DNNKCYAKIICNTKDNERVVGFHVLGPNAGEVTQGFAAALKCGLTKKQL 465
P SR + KI+ + ++ +++G ++GP A E+ + CGLT +
Sbjct: 4223 PFAANSRSRVAGETDGFVKILAD--NDNKILGGWIIGPQASELIGQITLMMACGLTTVDV 4396
Query: 466 DSTIGIHPVCAEVF 479
HP +E
Sbjct: 4397 SKVCFAHPTISEAL 4438
Score = 41.6 bits (96), Expect(2) = 4e-17, Method: Composition-based stats.
Identities = 32/112 (28%), Positives = 59/112 (52%), Gaps = 3/112 (2%)
Frame = +1
Query: 111 SLNWGYRVALREKKVVYENAYGQFIGPHRIKATNNKGKEKIYSAERFLIATG-ERPRYLG 169
+L+ G + ++ V + + +G + P+ I+ +G EK+ +A+ +IATG E R+ G
Sbjct: 3334 TLDAGIKGLFKKNGVDFISGHGTLVAPNEIQI---EGGEKV-TAKNIIIATGSEVTRFPG 3501
Query: 170 --IPGDKEYCISSDDLFSLPYCPGKTLVVGASYVALECAGFLAGIGLDVTVM 219
+ D + ISSD+ L P +V+G + LE A + +G VT++
Sbjct: 3502 DSLKIDGKRIISSDEALMLDEVPKTMVVIGGGAIGLELASVWSRLGAKVTIV 3657
Score = 55.8 bits (133), Expect = 5e-08, Method: Composition-based stats.
Identities = 29/58 (50%), Positives = 35/58 (60%)
Frame = +2
Query: 13 YDLIIIGGGSGGLAAAKEAAQYGKKVMVLDFVTPTPLGTRWGLGGTCVNVGCIPKKLM 70
YDL +IGGG GG A + AQ+G KV +D R LGGTC+NVGCIP K +
Sbjct: 2819 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 2968
>Contig4130
Length = 53027
Score = 45.4 bits (106), Expect(2) = 1e-11, Method: Composition-based stats.
Identities = 43/157 (27%), Positives = 75/157 (47%), Gaps = 1/157 (0%)
Frame = -1
Query: 229 DQDMANKIGEHMEEHGIKFIRQFVPIKVEQIEAGTPGRLRVVAQSTNSEEIIEGEYNTVM 288
DQD++ I + +E+ GI+ V + + + VV N ++I E + + V+
Sbjct: 452 DQDVSATIKKLIEKQGIE-----VSVSTKVLGGEAKEDCAVVTAEKNGQKI-EFKGDVVL 291
Query: 289 LAIGRDACTRKIGLETVGVKINEKTGKIPVTDEEQT-NVPYIYAIGDILEDKVELTPVAI 347
LA+GR T +GLE +G+K + G I V D+ + N P I AIGD++ + L A
Sbjct: 290 LAMGRRPFTSNLGLENLGIKT--ERGCIVVDDKLRIPNHPNISAIGDVIPGPM-LAHKAE 120
Query: 348 QAGRLLAQRLYAGSTVKCDYENVPTTVFTPLEYGACG 384
+ G + + ++ +VP ++T E G
Sbjct: 119 EDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVG 9
Score = 41.6 bits (96), Expect(2) = 1e-11, Method: Composition-based stats.
Identities = 32/112 (28%), Positives = 59/112 (52%), Gaps = 3/112 (2%)
Frame = -3
Query: 111 SLNWGYRVALREKKVVYENAYGQFIGPHRIKATNNKGKEKIYSAERFLIATG-ERPRYLG 169
+L+ G + ++ V + + +G + P+ I+ +G EK+ +A+ +IATG E R+ G
Sbjct: 843 TLDAGIKGLFKKNGVDFISGHGTLVAPNEIQI---EGGEKV-TAKNIIIATGSEVTRFPG 676
Query: 170 --IPGDKEYCISSDDLFSLPYCPGKTLVVGASYVALECAGFLAGIGLDVTVM 219
+ D + ISSD+ L P +V+G + LE A + +G VT++
Sbjct: 675 DSLKIDGKRIISSDEALMLDEVPKTMVVIGGGAIGLELASVWSRLGAKVTIV 520
Score = 55.8 bits (133), Expect = 5e-08, Method: Composition-based stats.
Identities = 29/58 (50%), Positives = 35/58 (60%)
Frame = -1
Query: 13 YDLIIIGGGSGGLAAAKEAAQYGKKVMVLDFVTPTPLGTRWGLGGTCVNVGCIPKKLM 70
YDL +IGGG GG A + AQ+G KV +D R LGGTC+NVGCIP K +
Sbjct: 1358 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 1209
>Contig4146
Length = 53782
Score = 28.9 bits (63), Expect = 5.2, Method: Composition-based stats.
Identities = 18/62 (29%), Positives = 33/62 (53%), Gaps = 4/62 (6%)
Frame = +1
Query: 298 RKIGLETVG----VKINEKTGKIPVTDEEQTNVPYIYAIGDILEDKVELTPVAIQAGRLL 353
R++G E V + + + G++ V + N+ ++ + + D VEL PVA++AG
Sbjct: 15532 RQLGTEVVANRYALHLAQDEGRVVVNKTTEYNIFPVHPVACLAHD-VELAPVAVRAGVRH 15708
Query: 354 AQ 355
AQ
Sbjct: 15709 AQ 15714
>Contig4119
Length = 33642
Score = 28.1 bits (61), Expect = 8.9, Method: Composition-based stats.
Identities = 14/28 (50%), Positives = 16/28 (57%)
Frame = +1
Query: 146 KGKEKIYSAERFLIATGERPRYLGIPGD 173
K E+IYS ATG R Y+G PGD
Sbjct: 16297 KALEQIYSRRTMCNATGMRYVYVGSPGD 16380
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000035_1.0 # Protein # Thioredoxin reductase 2 (TR2) #
Homo sapiens # Complete
(524 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4075 207 e-100
Contig3884 66 4e-11
Contig4130 55 1e-07
>Contig4075
Length = 25623
Score = 207 bits (528), Expect(3) = e-100, Method: Composition-based stats.
Identities = 109/260 (41%), Positives = 162/260 (62%), Gaps = 1/260 (0%)
Frame = +2
Query: 128 MAEAVQNHVKSLNWGHRVQLQDRKVKYFNIKASFVDEHTVCGVAKGGKEILLSADHIIIA 187
+ + VQN++K LN+ +R L + V+Y N AS D++TV K G++ + A +++IA
Sbjct: 19271 LRQTVQNYIKMLNFSYRSGLMSKNVEYVNAMASLADKNTVTYTDKKGEKHQIKAKNVLIA 19450
Query: 188 TGGRPRYPTHIEGALEYGITSDDIFWLKESPGKTLVVGASYVALECAGFLTGIGLDTTIM 247
G RP P+ ++GA +Y ITSDD+ KE GKTL+VG S+VALECAGFLT +G D T+
Sbjct: 19451 IGARPTIPSDVKGAWDYSITSDDLMSRKEPVGKTLIVGGSFVALECAGFLTSMGYDVTVA 19630
Query: 248 MRSIPLRGFDQQMSSMVIEHMASHGTRFLRGCAPSRVRRLPDGQLQVTWEDSTTGKEDTG 307
+RS+ LRGFD+Q + V E M + GT+F G P + +L +G+L + + D ++ +
Sbjct: 19631 VRSLILRGFDRQCADKVQELMLATGTKFKNGVVPQAITKLENGRLYIEFTDGSSDE---- 19798
Query: 308 TFDTVLWAIGRVPDTR-SLNLEKAGVDTSPDTQKILVDSREATSVPHIYAIGDVVEGRPE 366
FDT+++A GR +R L G+ S + I D + TSV +YA+GDVVEG P
Sbjct: 19799 -FDTLMYATGRSVSSRMQKELSDVGIKFSEYGKIIAEDGK--TSVEGVYAVGDVVEGNPA 19969
Query: 367 LTPIAIMAGRLLVQRLFGGS 386
L P+A+ G LL +R+FG S
Sbjct: 19970 LAPVAVKDGELLARRIFGNS 20029
Score = 99.8 bits (247), Expect(3) = e-100, Method: Composition-based stats.
Identities = 52/139 (37%), Positives = 77/139 (55%), Gaps = 14/139 (10%)
Frame = +3
Query: 390 MDYDNVPTTVFTPLEYGCVGLSEEEAVARHGQEHVEVYHAHYKPLEFTVAGRDASQ---- 445
+D + +P VFTP EY G+SEE A +G E +VY Y LEF R+ +
Sbjct: 20076 LDLNYIPMCVFTPYEYARCGISEETASKLYG-EDFDVYLKEYTTLEFAAVHREKVESLRA 20252
Query: 446 ----------CYVKMVCLREPPQLVLGLHFLGPNAGEVTQGFALGIKCGASYAQVMRTVG 495
C KM+C ++ V+GLHF+GPNAGE+ QG + ++ G + ++ T+G
Sbjct: 20253 DEFDVDMPPTCLSKMICKKD--GTVVGLHFVGPNAGEIMQGLCMAVRKGITKEEIDDTIG 20426
Query: 496 IHPTCSEEVVKLRISKRSG 514
IHPT +E V L ++K+SG
Sbjct: 20427 IHPTDAESFVNLTVTKKSG 20483
Score = 97.4 bits (241), Expect(3) = e-100, Method: Composition-based stats.
Identities = 55/89 (61%), Positives = 62/89 (69%)
Frame = +3
Query: 40 YDLLVVGGGSGGLACAKEAAQLGRKVAVVDYVEPSPQGTRWGLGGTCVNVGCIPKKLMHQ 99
YD V+GGG GGLA AKEAA LG K + DYV PSP+GT WGLGGTCVNVGCIPKKLMH
Sbjct: 18972 YDFAVIGGGCGGLAAAKEAASLGAKTILFDYVRPSPRGTTWGLGGTCVNVGCIPKKLMHY 19151
Query: 100 AALLGGLIQDAPNYGWEVAQPVPHDWRKM 128
A +LG D GW + PHDW ++
Sbjct: 19152 AGILGHSSHDREALGWGNHEG-PHDWGRL 19235
>Contig3884
Length = 22305
Score = 65.9 bits (159), Expect = 4e-11, Method: Composition-based stats.
Identities = 59/200 (29%), Positives = 91/200 (45%), Gaps = 7/200 (3%)
Frame = +2
Query: 310 DTVLWAIGRVPDTRSLNLEKAGVDTSPDTQKILVDSR-EATSVPHIYAIGDVVEGRPELT 368
D VL A+GR P T +L LE G+ T I+VD + + P+I AIGDV+ G P L
Sbjct: 3875 DVVLLAMGRRPFTSNLGLENLGIKTERGC--IVVDDKLRIPNHPNISAIGDVIPG-PMLA 4045
Query: 369 PIAIMAGRLLVQRLFGGSSDLMDYDNVPTTVFTPLEYGCVGLSEEEAVARHGQEHVEVYH 428
A G + + + G +++++VP ++T E VG +E+ ++
Sbjct: 4046 HKAEEDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVGSTEQS---------LKAAG 4198
Query: 429 AHYKPLEFTVAGRDASQC------YVKMVCLREPPQLVLGLHFLGPNAGEVTQGFALGIK 482
YK F A S+ +VK+ L + +LG +GP A E+ L +
Sbjct: 4199 VAYKKASFPFAANSRSRVAGETDGFVKI--LADNDNKILGGWIIGPQASELIGQITLMMA 4372
Query: 483 CGASYAQVMRTVGIHPTCSE 502
CG + V + HPT SE
Sbjct: 4373 CGLTTVDVSKVCFAHPTISE 4432
Score = 54.7 bits (130), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 35/58 (60%)
Frame = +2
Query: 40 YDLLVVGGGSGGLACAKEAAQLGRKVAVVDYVEPSPQGTRWGLGGTCVNVGCIPKKLM 97
YDL V+GGG GG A + AQ G KVA +D R LGGTC+NVGCIP K +
Sbjct: 2819 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 2968
Score = 29.6 bits (65), Expect = 3.7, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 21/29 (72%)
Frame = +1
Query: 42 LLVVGGGSGGLACAKEAAQLGRKVAVVDY 70
++V+GGG+ GL A ++LG KV +V+Y
Sbjct: 3577 MVVIGGGAIGLELASVWSRLGAKVTIVEY 3663
>Contig4130
Length = 53027
Score = 54.7 bits (130), Expect = 1e-07, Method: Composition-based stats.
Identities = 30/58 (51%), Positives = 35/58 (60%)
Frame = -1
Query: 40 YDLLVVGGGSGGLACAKEAAQLGRKVAVVDYVEPSPQGTRWGLGGTCVNVGCIPKKLM 97
YDL V+GGG GG A + AQ G KVA +D R LGGTC+NVGCIP K +
Sbjct: 1358 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 1209
Score = 46.2 bits (108), Expect = 3e-05, Method: Composition-based stats.
Identities = 35/101 (34%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Frame = -1
Query: 310 DTVLWAIGRVPDTRSLNLEKAGVDTSPDTQKILVDSR-EATSVPHIYAIGDVVEGRPELT 368
D VL A+GR P T +L LE G+ T I+VD + + P+I AIGDV+ G P L
Sbjct: 302 DVVLLAMGRRPFTSNLGLENLGIKTERGC--IVVDDKLRIPNHPNISAIGDVIPG-PMLA 132
Query: 369 PIAIMAGRLLVQRLFGGSSDLMDYDNVPTTVFTPLEYGCVG 409
A G + + + G +++++VP ++T E VG
Sbjct: 131 HKAEEDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVG 9
Score = 29.6 bits (65), Expect = 3.7, Method: Composition-based stats.
Identities = 13/29 (44%), Positives = 21/29 (72%)
Frame = -3
Query: 42 LLVVGGGSGGLACAKEAAQLGRKVAVVDY 70
++V+GGG+ GL A ++LG KV +V+Y
Sbjct: 600 MVVIGGGAIGLELASVWSRLGAKVTIVEY 514
TBLASTN 2.2.17 [Aug-26-2007]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SPP00000036_1.0 # Protein # Thioredoxin reductase 3 (TR3) #
Homo sapiens # Complete
(754 letters)
Database: B_bigemina.contigs.fa
4396 sequences; 19,733,690 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Contig4075 192 8e-98
Contig3884 70 5e-12
Contig4130 49 1e-05
Contig4072 35 0.094
Contig4100.1 35 0.17
Contig505 34 0.22
Contig1333 32 0.72
Contig3285 32 0.72
Contig3935 32 0.85
Contig4178 32 0.96
Contig4132 32 0.96
Contig4092 32 1.2
Contig4062 31 2.0
Contig3340 31 2.2
Contig4118 31 2.3
Contig3865 31 2.5
Contig3279 31 2.5
Contig4019 31 2.7
Contig4151 30 2.9
Contig1976 30 3.7
Contig305 30 3.8
Contig1950 30 3.9
Contig1568 30 3.9
Contig2390 30 4.4
Contig3503 30 4.8
Contig4135.1 30 5.0
Contig3599 29 8.3
Contig4170 29 9.0
Contig3597.0 29 9.1
>Contig4075
Length = 25623
Score = 192 bits (487), Expect(3) = 8e-98, Method: Composition-based stats.
Identities = 113/278 (40%), Positives = 165/278 (59%), Gaps = 2/278 (0%)
Frame = +2
Query: 355 TMTKAIQNHISSLNWGYRLSLREKAVAYVNSYGEFVEHHKIKATNKKGQETYYTAAQFVI 414
T+ + +QN+I LN+ YR L K V YVN+ + + + T+KKG++ A +I
Sbjct: 19268 TLRQTVQNYIKMLNFSYRSGLMSKNVEYVNAMASLADKNTVTYTDKKGEKHQIKAKNVLI 19447
Query: 415 ATGERPRYLG-IQGDKEYCITSDDLFSLPYCPGKTLVVGASYVALECAGFLAGFGLDVTV 473
A G RP ++G +Y ITSDDL S GKTL+VG S+VALECAGFL G DVTV
Sbjct: 19448 AIGARPTIPSDVKGAWDYSITSDDLMSRKEPVGKTLIVGGSFVALECAGFLTSMGYDVTV 19627
Query: 474 MVRSILLRGFDQEMAEKVGSYMEQHGVKFLRKFIPVMVQQLEKGSPGKLKVLAKSTEGTE 533
VRS++LRGFD++ A+KV M G KF +P + +LE G ++ + T+G+
Sbjct: 19628 AVRSLILRGFDRQCADKVQELMLATGTKFKNGVVPQAITKLENG-----RLYIEFTDGSS 19792
Query: 534 TIEGVYNTVLLAIGRDSCTR-KIGLEKIGVKINEKSGKIPVNDVEQTNVPYVYAVGDILE 592
++T++ A GR +R + L +G+K +E GKI D +T+V VYAVGD++E
Sbjct: 19793 D---EFDTLMYATGRSVSSRMQKELSDVGIKFSE-YGKIIAED-GKTSVEGVYAVGDVVE 19957
Query: 593 DKPELTPVAIQSGKLLAQRLFGASLEKCDYINVPTTVF 630
P L PVA++ G+LLA+R+FG S +K ++ T F
Sbjct: 19958 GNPALAPVAVKDGELLARRIFGNSDKKVGFVVSHATNF 20071
Score = 108 bits (269), Expect(3) = 8e-98, Method: Composition-based stats.
Identities = 57/134 (42%), Positives = 77/134 (57%), Gaps = 14/134 (10%)
Frame = +3
Query: 625 VPTTVFTPLEYGCCGLSEEKAIEVYKKENLEIYHTLFWPLEWTVAGREN----------- 673
+P VFTP EY CG+SEE A ++Y E+ ++Y + LE+ RE
Sbjct: 20091 IPMCVFTPYEYARCGISEETASKLYG-EDFDVYLKEYTTLEFAAVHREKVESLRADEFDV 20267
Query: 674 ---NTCYAKIICNKFDHDRVIGFHILGPNAGEVTQGFAAAMKCGLTKQLLDDTIGIHPTC 730
TC +K+IC K V+G H +GPNAGE+ QG A++ G+TK+ +DDTIGIHPT
Sbjct: 20268 DMPPTCLSKMICKK--DGTVVGLHFVGPNAGEIMQGLCMAVRKGITKEEIDDTIGIHPTD 20441
Query: 731 GEVFTTLEITKSSG 744
E F L +TK SG
Sbjct: 20442 AESFVNLTVTKKSG 20483
Score = 98.2 bits (243), Expect(3) = 8e-98, Method: Composition-based stats.
Identities = 52/86 (60%), Positives = 63/86 (73%)
Frame = +3
Query: 268 YDLIIIGGGSGGLSCAKEAAILGKKVMVLDFVVPSPQGTSWGLGGTCVNVGCIPKKLMHQ 327
YD +IGGG GGL+ AKEAA LG K ++ D+V PSP+GT+WGLGGTCVNVGCIPKKLMH
Sbjct: 18972 YDFAVIGGGCGGLAAAKEAASLGAKTILFDYVRPSPRGTTWGLGGTCVNVGCIPKKLMHY 19151
Query: 328 AALLGQALCDSRKFGWEYNQQVRHNW 353
A +LG + D GW N + H+W
Sbjct: 19152 AGILGHSSHDREALGWG-NHEGPHDW 19226
>Contig3884
Length = 22305
Score = 69.7 bits (169), Expect = 5e-12, Method: Composition-based stats.
Identities = 69/255 (27%), Positives = 117/255 (45%), Gaps = 4/255 (1%)
Frame = +2
Query: 484 DQEMAEKVGSYMEQHGVKFLRKFIPVMVQQLEKGSPGKLKVLAKSTEGTETIEGVYNTVL 543
DQ+++ + +E+ G++ + V + L + V+ G + IE + VL
Sbjct: 3725 DQDVSATIKKLIEKQGIE-----VSVSTKVLGGEAKEDCAVVTAEKNG-QKIEFKGDVVL 3886
Query: 544 LAIGRDSCTRKIGLEKIGVKINEKSGKIPVND-VEQTNVPYVYAVGDILEDKPELTPVAI 602
LA+GR T +GLE +G+K + G I V+D + N P + A+GD++ P L A
Sbjct: 3887 LAMGRRPFTSNLGLENLGIKT--ERGCIVVDDKLRIPNHPNISAIGDVIPG-PMLAHKAE 4057
Query: 603 QSGKLLAQRLFGASLEKCDYINVPTTVFTPLEYGCCGLSEE--KAIEV-YKKENLEIYHT 659
+ G + + G L ++ +VP ++T E G +E+ KA V YKK +
Sbjct: 4058 EDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVGSTEQSLKAAGVAYKKASFPFAAN 4237
Query: 660 LFWPLEWTVAGRENNTCYAKIICNKFDHDRVIGFHILGPNAGEVTQGFAAAMKCGLTKQL 719
VAG + + KI+ + + ++++G I+GP A E+ M CGLT
Sbjct: 4238 S----RSRVAGETDG--FVKILAD--NDNKILGGWIIGPQASELIGQITLMMACGLTTVD 4393
Query: 720 LDDTIGIHPTCGEVF 734
+ HPT E
Sbjct: 4394 VSKVCFAHPTISEAL 4438
Score = 48.1 bits (113), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 33/58 (56%)
Frame = +2
Query: 268 YDLIIIGGGSGGLSCAKEAAILGKKVMVLDFVVPSPQGTSWGLGGTCVNVGCIPKKLM 325
YDL +IGGG GG + A + A G KV +D LGGTC+NVGCIP K +
Sbjct: 2819 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 2968
>Contig4130
Length = 53027
Score = 48.5 bits (114), Expect = 1e-05, Method: Composition-based stats.
Identities = 42/157 (26%), Positives = 75/157 (47%), Gaps = 1/157 (0%)
Frame = -1
Query: 484 DQEMAEKVGSYMEQHGVKFLRKFIPVMVQQLEKGSPGKLKVLAKSTEGTETIEGVYNTVL 543
DQ+++ + +E+ G++ + V + L + V+ G + IE + VL
Sbjct: 452 DQDVSATIKKLIEKQGIE-----VSVSTKVLGGEAKEDCAVVTAEKNG-QKIEFKGDVVL 291
Query: 544 LAIGRDSCTRKIGLEKIGVKINEKSGKIPVND-VEQTNVPYVYAVGDILEDKPELTPVAI 602
LA+GR T +GLE +G+K + G I V+D + N P + A+GD++ P L A
Sbjct: 290 LAMGRRPFTSNLGLENLGIKT--ERGCIVVDDKLRIPNHPNISAIGDVIPG-PMLAHKAE 120
Query: 603 QSGKLLAQRLFGASLEKCDYINVPTTVFTPLEYGCCG 639
+ G + + G L ++ +VP ++T E G
Sbjct: 119 EDGSVALGNILGKDLGHVNWNHVPMVIYTHPEVAGVG 9
Score = 48.1 bits (113), Expect = 2e-05, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 33/58 (56%)
Frame = -1
Query: 268 YDLIIIGGGSGGLSCAKEAAILGKKVMVLDFVVPSPQGTSWGLGGTCVNVGCIPKKLM 325
YDL +IGGG GG + A + A G KV +D LGGTC+NVGCIP K +
Sbjct: 1358 YDLAVIGGGPGGYTTAIKGAQHGLKVACID--------RRKTLGGTCLNVGCIPSKCL 1209
>Contig4072
Length = 50453
Score = 35.4 bits (80), Expect = 0.094, Method: Composition-based stats.
Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 28/85 (32%)
Frame = +2
Query: 97 RFVSRPGRARSESETLERSPPQSPG--PGKAGDAPNRRSG-----------HVRGARV-- 141
R + RPG AR E + P Q PG PG+A A RRSG + RG V
Sbjct: 24458 RRIWRPGEARLAGE---KYPAQGPGRGPGEAAGADRRRSGRYRPQQKAAETYARGMTVTA 24628
Query: 142 -------------LSPPGRRARLSS 153
+SPP RR RLS+
Sbjct: 24629 VRRRRETRRTAGRVSPPARRPRLSA 24703
>Contig4100.1
Length = 40200
Score = 34.7 bits (78), Expect = 0.17, Method: Composition-based stats.
Identities = 43/193 (22%), Positives = 72/193 (37%), Gaps = 5/193 (2%)
Frame = +3
Query: 47 AGDPISHARGGVGGRGERLPRGFARVVRVASEGSVR---RPSGPVPAPQPPAFRFVSRPG 103
GD ++ + V RG RV +G V R +G V +P P+F PG
Sbjct: 14535 GGDGVTDRKQDVNTRG--------RVNTEEQQGGVANMLRSTGIVLSPSEPSFTL--NPG 14684
Query: 104 RARSESETLERSPPQSPGPGKAGDAPNRRSGHV--RGARVLSPPGRRARLSSPGPSRSSE 161
R RS + TL +P GD+ R+ + R ++P R S G ++
Sbjct: 14685 RQRSTNPTLSGKSASAPNSDAFGDSEGTRAATIAQRDPAAIAPSSTRDNSSGDGLRETAS 14864
Query: 162 AREELRRHLVGLIERSRVVIFSKSYCPHSTRVKELFSSLGVECNVLELDQVDDGARVQEV 221
R G S V+ S + + V + G ++ D + G +
Sbjct: 14865 QRG-------GDSSVSSVMPISSRFPDNENSVLNTQTPKGPSVQLITSDTIRQGGTGSKP 15023
Query: 222 LSEITNQKTVPNI 234
SE ++ +P++
Sbjct: 15024 ESEAEPKQGIPDV 15062
>Contig505
Length = 1333
Score = 34.3 bits (77), Expect = 0.22, Method: Composition-based stats.
Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 11/86 (12%)
Frame = +2
Query: 86 GPVPAPQPPAFRFVSRPGR--ARSESETLERSPPQSPGPGKAGDAPN-------RRSGHV 136
G P PPA F RP + S E L +PP P K G PN +++ +
Sbjct: 848 GGKKTPHPPADFFGRRPPPRPSISREEIL*TTPPPRDHPLKKGAPPNNTHPRATKKNPYT 1027
Query: 137 RGARVLSPPGRRAR--LSSPGPSRSS 160
RG + SPP R + P P +S+
Sbjct: 1028RGVNLFSPPHNTQR*TKTKPRPEKST 1105
>Contig1333
Length = 1827
Score = 32.3 bits (72), Expect = 0.72, Method: Composition-based stats.
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Frame = +3
Query: 5 HRLVSGKRRMHLTSRP---VASTHSPLILDMCPHLNNGPMTGCG 45
H +V G R++H+ + +A + + L + PHL NG TGC
Sbjct: 234 HSIVQGVRQLHVDDQAAVYMAQRRTLVTLPLLPHLRNGWQTGCA 365
>Contig3285
Length = 7026
Score = 32.3 bits (72), Expect = 0.72, Method: Composition-based stats.
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Frame = -3
Query: 5 HRLVSGKRRMHLTSRP---VASTHSPLILDMCPHLNNGPMTGCG 45
H +V G R++H+ + +A + + L + PHL NG TGC
Sbjct: 1546 HSIVQGVRQLHVDDQAAVYMAQRRTLVTLPLLPHLRNGWQTGCA 1415
>Contig3935
Length = 14504
Score = 32.3 bits (72), Expect = 0.85, Method: Composition-based stats.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 18/114 (15%)
Frame = -3
Query: 165 ELRRHLVGLIERSRVVIFSKSY-----CPHSTRVKEL----FSSLGVECNVLELDQVDDG 215
E+ L LI R RV++ K C S V + S+ V N+L+ D
Sbjct: 4989 EVETKLRDLIRRERVLLLLKGTPEDPRCGFSAAVSNFRRYCYKSM*V-VNILDKYTTVDY 4813
Query: 216 ARVQEVLS---------EITNQKTVPNIFVNKVHVGGCDQTFQAYQSGLLQKLL 260
A V +VLS +I + T P +FV+ VGGCD ++SG L ++L
Sbjct: 4812 AYV-DVLSHEYLRPCAKKIADWPTFPQLFVDGRLVGGCDVIQSLHESGSLAEIL 4654
>Contig4178
Length = 124204
Score = 32.0 bits (71), Expect = 0.96, Method: Composition-based stats.
Identities = 33/113 (29%), Positives = 40/113 (35%), Gaps = 12/113 (10%)
Frame = +3
Query: 48 GDPISHARGGVGGRGERLPRGFAR------------VVRVASEGSVRRPSGPVPAPQPPA 95
G + ARG G E LPRG R V R+A RRP P P
Sbjct: 122688 GAAVEGARGCGGSCSEHLPRGAQRGDDPSHAYQLVAVRRLARPQDYRRPRVEPAQPAGPC 122867
Query: 96 FRFVSRPGRARSESETLERSPPQSPGPGKAGDAPNRRSGHVRGARVLSPPGRR 148
R SR G SE + + +A + R GH RG + P R
Sbjct: 122868 GRVASREG---SEVRHVGGDA*AAAAVPQAPASRVSREGHARGTGFVRPESGR 123017
>Contig4132
Length = 50512
Score = 32.0 bits (71), Expect = 0.96, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 43/105 (40%), Gaps = 1/105 (0%)
Frame = +1
Query: 74 RVASEGSVRRPSGPVPAPQPPAFRFVSRPGRARSESETLERSPPQSPGPGKAGDAPNRRS 133
R GSVRR +G F S PG + S RSPP + P+RRS
Sbjct: 40111 RPCPRGSVRRTTGG--GAMSRRFGRCS*PGVSHPRSRRTTRSPPAT*------TLPSRRS 40266
Query: 134 GHVRGARVLSPP-GRRARLSSPGPSRSSEAREELRRHLVGLIERS 177
R +L PP G A SP SR A +R+H L RS
Sbjct: 40267 TPCRYRAILKPPHGPTASAGSPC-SRVPSAGLRIRKHTGPLCWRS 40398
>Contig4092
Length = 35044
Score = 32.0 bits (71), Expect = 1.2, Method: Composition-based stats.
Identities = 29/91 (31%), Positives = 39/91 (42%)
Frame = -2
Query: 74 RVASEGSVRRPSGPVPAPQPPAFRFVSRPGRARSESETLERSPPQSPGPGKAGDAPNRRS 133
R +++ R SGP +PP+ R RP R + SPP S A P R
Sbjct: 6696 RCSAQSPACRQSGPKSGCRPPSGRGSRRP-RPPAAGPRCLSSPPASY---HAPCPPPRWP 6529
Query: 134 GHVRGARVLSPPGRRARLSSPGPSRSSEARE 164
RG+R + P RR R P RS+ R+
Sbjct: 6528 PRHRGSRRPAAPWRRIRARPGTPGRSAHGRQ 6436
>Contig4062
Length = 31354
Score = 31.2 bits (69), Expect = 2.0, Method: Composition-based stats.
Identities = 33/110 (30%), Positives = 45/110 (40%), Gaps = 15/110 (13%)
Frame = +3
Query: 59 GGRGERLP-RGFARVVR-----VASEGSVRRPSGPVPAPQPPAFRFVSRPGRARSESETL 112
G R ER P R AR R + + RPS PAP P R +R G AR ++
Sbjct: 9789 GNRSERSPPRSSARCRRRQSCLLCTAKGGNRPSAASPAPPLPTPRGRARCGCAR*RAKAA 9968
Query: 113 ERSPPQSPGPGKAG---------DAPNRRSGHVRGARVLSPPGRRARLSS 153
R+PP++ GP + +P +R RG + P A S
Sbjct: 9969 PRTPPEA*GPRRRATRRSRPGIVPSPGKRCNAARGFHLAVPADSPAATQS 10118
>Contig3340
Length = 12482
Score = 30.8 bits (68), Expect = 2.2, Method: Composition-based stats.
Identities = 17/42 (40%), Positives = 24/42 (57%)
Frame = +2
Query: 6 RLVSGKRRMHLTSRPVASTHSPLILDMCPHLNNGPMTGCGRA 47
RL+S + + L+S P H+ +I D CP N P+T CG A
Sbjct: 3971 RLIS-RMKALLSSAP--KHHAMVITDSCPSTPNAPVTHCGNA 4087
>Contig4118
Length = 43326
Score = 30.8 bits (68), Expect = 2.3, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 27/47 (57%), Gaps = 5/47 (10%)
Frame = -3
Query: 147 RRARLSSPGPSRSSEAREELRRHLVGL-----IERSRVVIFSKSYCP 188
R ARLSSP ++S+ + + H + +++VV+FS++Y P
Sbjct: 36649 RCARLSSPSVAKSTAFLQNMDDHQIAAWVEQQAHQNKVVVFSRTYVP 36509
>Contig3865
Length = 14538
Score = 30.8 bits (68), Expect = 2.5, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 19/38 (50%)
Frame = -1
Query: 82 RRPSGPVPAPQPPAFRFVSRPGRARSESETLERSPPQS 119
RRP+G P + P R SR G +R RS PQS
Sbjct: 1587 RRPTGYCPGSKRPLARLGSRYGTSRCSRRFCTRSLPQS 1474
>Contig3279
Length = 8884
Score = 30.8 bits (68), Expect = 2.5, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 19/38 (50%)
Frame = +2
Query: 82 RRPSGPVPAPQPPAFRFVSRPGRARSESETLERSPPQS 119
RRP+G P + P R SR G +R RS PQS
Sbjct: 1607 RRPTGYCPGSKRPLARLGSRYGTSRCSRRFCTRSLPQS 1720
>Contig4019
Length = 14565
Score = 30.8 bits (68), Expect = 2.7, Method: Composition-based stats.
Identities = 19/50 (38%), Positives = 27/50 (54%)
Frame = -2
Query: 129 PNRRSGHVRGARVLSPPGRRARLSSPGPSRSSEAREELRRHLVGLIERSR 178
P RRSG + G VL RR R + R+ + ++ LRR++ G RSR
Sbjct: 7052 PYRRSGRIAGFHVL----RRVRDTH*DAERTKKCQQVLRRYMRGTCCRSR 6915
>Contig4151
Length = 44706
Score = 30.4 bits (67), Expect = 2.9, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 29/75 (38%)
Frame = +1
Query: 50 PISHARGGVGGRGERLPRGFARVVRVASEGSVRRPSGPVPAPQPPAFRFVSRPGRARSES 109
P+S GR R P + VR + P PA PP R +R R+ S
Sbjct: 26029 PVSSPAETERGRATRSPDSRIHT*AGRTSQRVRGKATPTPAGHPPVPRSAARRNPNRARS 26208
Query: 110 ETLERSPPQSPGPGK 124
+ RS + P G+
Sbjct: 26209 RVVYRSSSKGP*LGR 26253
>Contig1976
Length = 9639
Score = 30.0 bits (66), Expect = 3.7, Method: Composition-based stats.
Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Frame = +2
Query: 4 VHRLVSGKRRMHL---TSRPVASTHSPLILDMCPHLNNGPMTGCG 45
+H+++ G ++HL T+ + + + + L + PHL NG TGC
Sbjct: 6020 LHQIICGM*QLHLDHQTTVHLLGSRTLVTLPLLPHLRNGWQTGCA 6154
>Contig305
Length = 1388
Score = 30.0 bits (66), Expect = 3.8, Method: Composition-based stats.
Identities = 31/116 (26%), Positives = 46/116 (39%), Gaps = 1/116 (0%)
Frame = -2
Query: 81 VRRPSGPVPAPQPPAFRFVSRPGRARSESETLERSPPQSPGPGKAGDAPNRRSGHVRGAR 140
+R P+ A PP FR S P A+ E P P P+R+
Sbjct: 1261 IRAPTTGPQATPPPLFR--STPALAQPEDYRYIILSHHPPAPIAT---PSRQR------- 1118
Query: 141 VLSPPGRRARLSSPGPSRSSEAREE-LRRHLVGLIERSRVVIFSKSYCPHSTRVKE 195
SPP + RL SP P+++ + E+ L L + I S + P+ +R E
Sbjct: 1117 --SPPSQHPRLPSPAPTQNGDPPEK*LPHRLPSAVPNKWFPIPSNQHQPYLSRPSE 956
>Contig1950
Length = 1112
Score = 30.0 bits (66), Expect = 3.9, Method: Composition-based stats.
Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Frame = -3
Query: 4 VHRLVSGKRRMHL---TSRPVASTHSPLILDMCPHLNNGPMTGCG 45
+HR + R++H+ + PV + + + + PHL NG TGC
Sbjct: 1110 LHRPIQRVRQIHIHYPRTVPVDDRRTLVTITLLPHLRNGWQTGCA 976
>Contig1568
Length = 7496
Score = 30.0 bits (66), Expect = 3.9, Method: Composition-based stats.
Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Frame = +3
Query: 4 VHRLVSGKRRMHL---TSRPVASTHSPLILDMCPHLNNGPMTGCG 45
+HR + R++H+ + PV + + + + PHL NG TGC
Sbjct: 6300 LHRPIQRVRQIHIHYPRTVPVDDRRTLVTITLLPHLRNGWQTGCA 6434
>Contig2390
Length = 6392
Score = 30.0 bits (66), Expect = 4.4, Method: Composition-based stats.
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Frame = +3
Query: 5 HRLVSGKRRMHL---TSRPVASTHSPLILDMCPHLNNGPMTGCG 45
H V G R+++L T+ +A + + + + PHL NG TGC
Sbjct: 2775 HSFVQGVRQIYLHHQTAVYMAQRRTLVTIPLLPHLRNGWQTGCA 2906
>Contig3503
Length = 1432
Score = 29.6 bits (65), Expect = 4.8, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Frame = -2
Query: 64 RLPRGFARVVRVASEGSVRR-PSGPVPAPQPPAFRFVSRPGRARSESETLERSPP 117
RLPR V R+ E VRR P P APQPP R V + R ++ R P
Sbjct: 573 RLPRSGCDV-RLCPEAEVRRLP*EPGAAPQPPLVRAVRQGSRGAAQPAWGPRECP 412
>Contig4135.1
Length = 12276
Score = 29.6 bits (65), Expect = 5.0, Method: Composition-based stats.
Identities = 33/97 (34%), Positives = 45/97 (46%), Gaps = 7/97 (7%)
Frame = +3
Query: 89 PAPQPPAFRFVSRP---GRARSESETLERSPPQSPGPG--KAGDAPNR--RSGHVRGARV 141
P P+PP R V R G+AR ++R+ P G DA R R+GH RG R
Sbjct: 6180 PVPEPPGHRVVPREAGDGQAR-----VQRAREAVPDAG*LHGEDAHGR*PRAGH-RGGR- 6338
Query: 142 LSPPGRRARLSSPGPSRSSEAREELRRHLVGLIERSR 178
++AR + P P + RE LR + G +R R
Sbjct: 6339 ----AQQARPAVPVPLQPP-VREPLRAQVHGPEQRGR 6434
>Contig3599
Length = 5138
Score = 28.9 bits (63), Expect = 8.3, Method: Composition-based stats.
Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Frame = -1
Query: 1 MGHVHRLVSGKRRMHL--TSRPVASTHSPLI-LDMCPHLNNGPMTGCG 45
+G +LV G R +H+ T + S S L+ + + PHL NG TGC
Sbjct: 422 VGSHDQLVEGNRHIHVDHTYTILQSRISTLVTIPLLPHLRNGWQTGCA 279
>Contig4170
Length = 73209
Score = 28.9 bits (63), Expect = 9.0, Method: Composition-based stats.
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 10/65 (15%)
Frame = -1
Query: 109 SETLERSPPQSPGPGKAGDAPNRRSGHVRGARVLSPP----------GRRARLSSPGPSR 158
+ T RS + P RRS H RG+ +PP G R RL+ PG ++
Sbjct: 3585 ARTAHRSRGRGGR*AHLSSRPARRSLHGRGSAHAAPPRL*DPACARHGPRPRLAPPGHAQ 3406
Query: 159 SSEAR 163
+ R
Sbjct: 3405 KNRKR 3391
>Contig3597.0
Length = 2518
Score = 28.9 bits (63), Expect = 9.1, Method: Composition-based stats.
Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Frame = -1
Query: 13 RMHLTSRPVASTHSP---LILDMCPHLNNGPMTGCGRAGDPIS 52
++HL ++ H P + L + PHL NG TGC A PI+
Sbjct: 2410 QLHLDNQRTVHLHGPRALVTLPLLPHLRNGWQTGC--APHPIA 2288
Database: B_bigemina.contigs.fa
Posted date: Feb 28, 2008 11:56 AM
Number of letters in database: 19,733,690
Number of sequences in database: 4396
Lambda K H
0.319 0.136 0.395
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 4396
Number of Hits to DB: 294,205,517
Number of extensions: 5537659
Number of successful extensions: 53936
Number of sequences better than 10.0: 1120
Number of HSP's gapped: 44852
Number of HSP's successfully gapped: 3596
Length of database: 6,577,896
Neighboring words threshold: 13
Window for multiple hits: 40
X1: 15 ( 6.9 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.3 bits)
S2: 42 (20.8 bits)