BLASTP 2.2.26+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: GY8VYUN7012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 16,927,445 sequences; 5,811,956,865 total letters Query= gi|340502159|gb|GL984205.1|:subseq(128994,14689) Ichthyophthirius multifiliis unplaced genomic scaffold scaff_1120509251244, whole genome shotgun sequence:[translate(1)] Length=68 Score E Sequences producing significant alignments: (Bits) Value ref|XP_001693901.1| selenoprotein W1 [Chlamydomonas reinhardt... 38.9 0.006 ref|XP_003222607.1| PREDICTED: protein C17orf37 homolog [Anol... 38.1 0.016 emb|CAG32466.1| hypothetical protein RCJMB04_26b22 [Gallus ga... 38.1 0.019 ref|NP_001092154.1| uncharacterized protein LOC100049740 [Xen... 37.0 0.042 ref|NP_001232209.1| putative C35 protein cDNA [Taeniopygia gu... 37.0 0.051 ref|XP_003466973.1| PREDICTED: protein C17orf37-like [Cavia p... 36.2 0.11 ref|NP_001015996.1| uncharacterized protein LOC548750 [Xenopu... 35.0 0.19 ref|XP_001091012.1| PREDICTED: uncharacterized protein C17orf... 35.4 0.20 gb|EHH24902.1| Protein C35 [Macaca mulatta] 35.4 0.21 gb|EHH58074.1| Protein C35 [Macaca fascicularis] 35.4 0.21 ref|XP_001366310.1| PREDICTED: protein C17orf37 homolog [Mono... 35.0 0.25 gb|ABS19961.1| selenoprotein W2 [Artemia franciscana] 34.7 0.26 ref|NP_919399.3| selenoprotein W, 2b [Danio rerio] 34.7 0.30 gb|AAO86697.1| selenoprotein W2b [Danio rerio] 34.7 0.30 ref|NP_001101766.1| migration and invasion enhancer 1 [Rattus... 35.0 0.33 ref|XP_003414746.1| PREDICTED: protein C17orf37-like [Loxodon... 34.7 0.34 ref|NP_001230776.1| migration and invasion enhancer 1 [Pan tr... 34.7 0.34 ref|XP_537653.1| PREDICTED: protein C17orf37 homolog [Canis l... 34.7 0.35 ref|XP_001501126.3| PREDICTED: protein C17orf37 homolog [Equu... 34.7 0.36 ref|XP_002719179.1| PREDICTED: hypothetical protein [Oryctola... 35.0 0.38 ref|XP_003131565.2| PREDICTED: protein C17orf37 homolog [Sus ... 34.7 0.39 ref|NP_115715.3| migration and invasion enhancer 1 [Homo sapi... 34.7 0.39 ref|XP_002748573.1| PREDICTED: uncharacterized protein C17orf... 34.7 0.40 ref|NP_001068689.1| migration and invasion enhancer 1 [Bos ta... 34.7 0.43 ref|XP_002827689.1| PREDICTED: LOW QUALITY PROTEIN: uncharact... 35.0 0.44 gb|EAW60598.1| chromosome 17 open reading frame 37, isoform C... 35.0 0.46 ref|NP_079835.1| migration and invasion enhancer 1 [Mus muscu... 34.3 0.47 gb|EHB12030.1| hypothetical protein GW7_09601 [Heterocephalus... 34.7 0.47 gb|EAW60599.1| chromosome 17 open reading frame 37, isoform C... 34.7 0.61 ref|NP_919398.1| selenoprotein W, 2a [Danio rerio] >gb|AAO652... 33.5 0.84 ref|XP_002929400.1| PREDICTED: uncharacterized protein C17orf... 33.5 1.0 ref|XP_003057240.1| selenoprotein W [Micromonas pusilla CCMP1... 33.1 1.4 ref|XP_003499637.1| PREDICTED: protein C17orf37-like [Cricetu... 32.7 2.1 ref|ZP_07080860.1| conserved hypothetical protein [Sphingobac... 32.3 4.5 ref|ZP_03967640.1| conserved hypothetical protein [Sphingobac... 32.3 4.5 emb|CBN76554.1| conserved unknown protein [Ectocarpus silicul... 32.3 5.2 ref|NP_840072.3| selenoprotein W [Danio rerio] >ref|NP_001165... 31.2 5.3 sp|Q568W0.3|SELW_DANRE RecName: Full=Selenoprotein W; Short=S... 31.2 5.4 ref|XP_968954.1| PREDICTED: similar to selenoprotein W2 [Trib... 30.8 8.3 ref|ZP_05395448.1| conserved hypothetical protein [Clostridiu... 31.6 8.4 ref|YP_001976019.1| hypothetical protein WPa_1287 [Wolbachia ... 31.6 9.5 ALIGNMENTS >ref|XP_001693901.1| selenoprotein W1 [Chlamydomonas reinhardtii] gb|AAN32901.1| selenoprotein SelW1 [Chlamydomonas reinhardtii] gb|EDP02837.1| selenoprotein W1 [Chlamydomonas reinhardtii] Length=88 Score = 38.9 bits (89), Expect = 0.006, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (4%) Query 15 AQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIE--KKVIPGYTQCFEIYVNGK 68 A V + YCG UGY + + +N I M +PNA I+ + P T FE+ VNG+ Sbjct 2 APVQVHVLYCGGUGYGSRYRSLENAIRMKFPNADIKFSFEATPQATGFFEVEVNGE 57 >ref|XP_003222607.1| PREDICTED: protein C17orf37 homolog [Anolis carolinensis] Length=112 Score = 38.1 bits (87), Expect = 0.016, Method: Compositional matrix adjust. Identities = 18/65 (28%), Positives = 31/65 (48%), Gaps = 2/65 (3%) Query 4 EPVKNFQCNKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEI 63 EP +++EYC G+ + + N + YP+ +IE ++ G T FEI Sbjct 7 EPAAEAPPATEGGVRIVVEYCKPCGFESAYLELANAVKEEYPDVEIESRL--GGTGAFEI 64 Query 64 YVNGK 68 +NG+ Sbjct 65 EINGQ 69 >emb|CAG32466.1| hypothetical protein RCJMB04_26b22 [Gallus gallus] Length=126 Score = 38.1 bits (87), Expect = 0.019, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+E + + YP+ +IE ++ G T FEI +NG+ Sbjct 36 IMVEYCEPCGFGATYEELASAVREEYPDIEIESRL--GGTGAFEIEINGQ 83 >ref|NP_001092154.1| uncharacterized protein LOC100049740 [Xenopus laevis] gb|AAI41722.1| LOC100049740 protein [Xenopus laevis] Length=95 Score = 37.0 bits (84), Expect = 0.042, Method: Compositional matrix adjust. Identities = 16/51 (31%), Positives = 29/51 (57%), Gaps = 2/51 (4%) Query 18 DVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ + +E + + +P+ IE + PG T FEI +NG+ Sbjct 4 SIMVEYCEPCGFRSHYEELASAVKEEFPDITIESR--PGGTGAFEIEINGQ 52 >ref|NP_001232209.1| putative C35 protein cDNA [Taeniopygia guttata] gb|ACH43696.1| putative C35 protein cDNA [Taeniopygia guttata] Length=121 Score = 37.0 bits (84), Expect = 0.051, Method: Compositional matrix adjust. Identities = 16/53 (30%), Positives = 31/53 (58%), Gaps = 2/53 (4%) Query 16 QFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 + +++EYC G+ T++ + + YP+ +IE ++ G T FEI +NG+ Sbjct 28 RVRIVVEYCEPCGFEATYQELASAVRDEYPDIEIESRL--GGTGAFEIEINGQ 78 >ref|XP_003466973.1| PREDICTED: protein C17orf37-like [Cavia porcellus] Length=115 Score = 36.2 bits (82), Expect = 0.11, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP QIE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIQIESRL--GGTGAFEIEINGQ 72 >ref|NP_001015996.1| uncharacterized protein LOC548750 [Xenopus (Silurana) tropicalis] gb|AAI22006.1| hypothetical protein LOC548750 [Xenopus (Silurana) tropicalis] Length=95 Score = 35.0 bits (79), Expect = 0.19, Method: Compositional matrix adjust. Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ + +E + + +P+ I+ + PG T FEI +NG+ Sbjct 5 IVVEYCEPCGFKSHYEELASAVLEEFPDVTIDSR--PGGTGAFEIEINGQ 52 >ref|XP_001091012.1| PREDICTED: uncharacterized protein C17orf37 [Macaca mulatta] dbj|BAE87852.1| unnamed protein product [Macaca fascicularis] Length=115 Score = 35.4 bits (80), Expect = 0.20, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IMVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >gb|EHH24902.1| Protein C35 [Macaca mulatta] Length=115 Score = 35.4 bits (80), Expect = 0.21, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IMVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >gb|EHH58074.1| Protein C35 [Macaca fascicularis] Length=115 Score = 35.4 bits (80), Expect = 0.21, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IMVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_001366310.1| PREDICTED: protein C17orf37 homolog [Monodelphis domestica] Length=115 Score = 35.0 bits (79), Expect = 0.25, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 29/50 (58%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ +T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFESTYLELASAVKEEYPGIKIESRL--GGTGAFEIEINGQ 72 >gb|ABS19961.1| selenoprotein W2 [Artemia franciscana] Length=94 Score = 34.7 bits (78), Expect = 0.26, Method: Compositional matrix adjust. Identities = 18/48 (38%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query 21 IEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +EYCG+UGY ++ I A P+A++ V G FE+ VNG+ Sbjct 6 VEYCGAUGYAPRYQELAAKIRKAAPDAEVSGNV--GRRSSFEVTVNGE 51 >ref|NP_919399.3| selenoprotein W, 2b [Danio rerio] Length=94 Score = 34.7 bits (78), Expect = 0.30, Method: Compositional matrix adjust. Identities = 21/48 (44%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVN 66 V IEYCG+UGY F+ K I P+A++ V G CFEI +N Sbjct 5 VKIEYCGAUGYEPRFQELKREICGNCPDAEVSGFV--GRRGCFEIQIN 50 >gb|AAO86697.1| selenoprotein W2b [Danio rerio] Length=94 Score = 34.7 bits (78), Expect = 0.30, Method: Compositional matrix adjust. Identities = 21/48 (44%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVN 66 V IEYCG+UGY F+ K I P+A++ V G CFEI +N Sbjct 5 VKIEYCGAUGYEPRFQELKREICGNCPDAEVSGFV--GRRGCFEIQIN 50 >ref|NP_001101766.1| migration and invasion enhancer 1 [Rattus norvegicus] gb|EDM05932.1| similar to RIKEN cDNA 1810046J19 (predicted) [Rattus norvegicus] Length=115 Score = 35.0 bits (79), Expect = 0.33, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCKPCGFEATYLELASAVKEEYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_003414746.1| PREDICTED: protein C17orf37-like [Loxodonta africana] Length=115 Score = 34.7 bits (78), Expect = 0.34, Method: Compositional matrix adjust. Identities = 16/59 (27%), Positives = 30/59 (51%), Gaps = 2/59 (3%) Query 10 QCNKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 + + +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 16 EVEPGSGVRIVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|NP_001230776.1| migration and invasion enhancer 1 [Pan troglodytes] Length=115 Score = 34.7 bits (78), Expect = 0.34, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_537653.1| PREDICTED: protein C17orf37 homolog [Canis lupus familiaris] Length=115 Score = 34.7 bits (78), Expect = 0.35, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_001501126.3| PREDICTED: protein C17orf37 homolog [Equus caballus] Length=115 Score = 34.7 bits (78), Expect = 0.36, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_002719179.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length=166 Score = 35.0 bits (79), Expect = 0.38, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 76 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 123 >ref|XP_003131565.2| PREDICTED: protein C17orf37 homolog [Sus scrofa] Length=115 Score = 34.7 bits (78), Expect = 0.39, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|NP_115715.3| migration and invasion enhancer 1 [Homo sapiens] sp|Q9BRT3.1|MIEN1_HUMAN RecName: Full=Migration and invasion enhancer 1; AltName: Full=HBV X-transactivated gene 4 protein; AltName: Full=HBV XAg-transactivated protein 4; AltName: Full=Protein C35; Flags: Precursor gb|AAH06006.1| Chromosome 17 open reading frame 37 [Homo sapiens] gb|AAO85461.1| XTP4 [Homo sapiens] gb|AAR92035.1| C35 protein [Homo sapiens] gb|ADQ33214.1| chromosome 17 open reading frame 37 [synthetic construct] Length=115 Score = 34.7 bits (78), Expect = 0.39, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_002748573.1| PREDICTED: uncharacterized protein C17orf37-like [Callithrix jacchus] Length=115 Score = 34.7 bits (78), Expect = 0.40, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|NP_001068689.1| migration and invasion enhancer 1 [Bos taurus] sp|Q148C8.1|MIEN1_BOVIN RecName: Full=Migration and invasion enhancer 1; Flags: Precursor gb|AAI18464.1| Chromosome 17 open reading frame 37 ortholog [Bos taurus] gb|DAA18439.1| hypothetical protein LOC505710 [Bos taurus] Length=115 Score = 34.7 bits (78), Expect = 0.43, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 72 >ref|XP_002827689.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C17orf37-like [Pongo abelii] Length=209 Score = 35.0 bits (79), Expect = 0.44, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 119 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 166 >gb|EAW60598.1| chromosome 17 open reading frame 37, isoform CRA_a [Homo sapiens] Length=206 Score = 35.0 bits (79), Expect = 0.46, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 116 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 163 >ref|NP_079835.1| migration and invasion enhancer 1 [Mus musculus] sp|Q9CQ86.1|MIEN1_MOUSE RecName: Full=Migration and invasion enhancer 1; Flags: Precursor dbj|BAB22480.1| unnamed protein product [Mus musculus] dbj|BAB25261.1| unnamed protein product [Mus musculus] dbj|BAB26586.1| unnamed protein product [Mus musculus] gb|AAH21589.1| RIKEN cDNA 1810046J19 gene [Mus musculus] dbj|BAC30901.1| unnamed protein product [Mus musculus] emb|CAM22009.1| novel protein similar to C17orf37 (Homo sapiens) [Mus musculus] gb|EDL16143.1| RIKEN cDNA 1810046J19, isoform CRA_b [Mus musculus] Length=115 Score = 34.3 bits (77), Expect = 0.47, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 25 IVVEYCKPCGFEATYLELASAVKEEYPGIEIESRL--GGTGAFEIEINGQ 72 >gb|EHB12030.1| hypothetical protein GW7_09601 [Heterocephalus glaber] Length=152 Score = 34.7 bits (78), Expect = 0.47, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 28 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 75 >gb|EAW60599.1| chromosome 17 open reading frame 37, isoform CRA_b [Homo sapiens] Length=207 Score = 34.7 bits (78), Expect = 0.61, Method: Composition-based stats. Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE ++ G T FEI +NG+ Sbjct 116 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRL--GGTGAFEIEINGQ 163 >ref|NP_919398.1| selenoprotein W, 2a [Danio rerio] gb|AAO65270.1| selenoprotein W2a [Danio rerio] gb|AAI62535.1| Selenoprotein W, 2a [Danio rerio] gb|AAI62538.1| Selenoprotein W, 2a [Danio rerio] Length=95 Score = 33.5 bits (75), Expect = 0.84, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 + +EYCG UGY ++ K +T + +A + V G FEI +NG+ Sbjct 5 IKVEYCGGUGYEPRYQELKRVVTAEFTDADVSGFV--GRQGSFEIEINGQ 52 >ref|XP_002929400.1| PREDICTED: uncharacterized protein C17orf37 homolog [Ailuropoda melanoleuca] gb|EFB23389.1| hypothetical protein PANDA_019574 [Ailuropoda melanoleuca] Length=115 Score = 33.5 bits (75), Expect = 1.0, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE + G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEQYPGIEIESRF--GGTGAFEIEINGQ 72 >ref|XP_003057240.1| selenoprotein W [Micromonas pusilla CCMP1545] gb|EEH58885.1| selenoprotein W [Micromonas pusilla CCMP1545] Length=97 Score = 33.1 bits (74), Expect = 1.4, Method: Compositional matrix adjust. Identities = 15/49 (31%), Positives = 26/49 (53%), Gaps = 0/49 (0%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNG 67 V I YCG UGY ++ +N I +P+ + + P + FE+ ++G Sbjct 4 VHITYCGGUGYAPKYKQVENAIKAKFPSVESSGEPTPTSSGAFEVVLDG 52 >ref|XP_003499637.1| PREDICTED: protein C17orf37-like [Cricetulus griseus] gb|EGW05299.1| Uncharacterized protein C17orf37 [Cricetulus griseus] Length=115 Score = 32.7 bits (73), Expect = 2.1, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 2/50 (4%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNGK 68 +++EYC G+ T+ + + YP +IE + G T FEI +NG+ Sbjct 25 IVVEYCEPCGFEATYLELASAVKEEYPGIEIESRH--GGTGAFEIEINGQ 72 >ref|ZP_07080860.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33861] gb|EFK59075.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33861] Length=1205 Score = 32.3 bits (72), Expect = 4.5, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 25/45 (56%), Gaps = 4/45 (9%) Query 12 NKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPG 56 + + F +L EYC HNT++ KN ++YPN + E + + G Sbjct 179 DPESSFYLLPEYC----EHNTYQINKNNFQVSYPNKKGELQYLAG 219 >ref|ZP_03967640.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33300] gb|EEI92608.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC 33300] Length=1205 Score = 32.3 bits (72), Expect = 4.5, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 25/45 (56%), Gaps = 4/45 (9%) Query 12 NKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPG 56 + + F +L EYC HNT++ KN ++YPN + E + + G Sbjct 179 DPESSFYLLPEYC----EHNTYQINKNNFQVSYPNKKGELQYLAG 219 >emb|CBN76554.1| conserved unknown protein [Ectocarpus siliculosus] Length=310 Score = 32.3 bits (72), Expect = 5.2, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 4/70 (6%) Query 3 PEPVKNFQCNKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYT---- 58 P V+ F+ + + V I+Y G G+ FE + +P+ I+++++ + Sbjct 165 PMTVRKFKTMQDRRVPVSIKYSGGGGFKRYFEEIAIVLKRHFPDVLIDREIVEVSSTREE 224 Query 59 QCFEIYVNGK 68 + FEI ++GK Sbjct 225 EVFEIRIDGK 234 >ref|NP_840072.3| selenoprotein W [Danio rerio] ref|NP_001165152.1| selenoprotein W, 1 [Xenopus (Silurana) tropicalis] gb|AAO86696.1| selenoprotein W1 [Danio rerio] gb|AAI52098.1| Selenoprotein W, 1 [Danio rerio] gb|AAI60969.1| LOC100145402 protein [Xenopus (Silurana) tropicalis] Length=86 Score = 31.2 bits (69), Expect = 5.3, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (2%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNA-QIEKKVIPGYTQCFEIYVNGK 68 V + YCG UGY F K + +PN +I + P T E+ VNGK Sbjct 5 VHVVYCGGUGYRPKFIKLKTLLEDEFPNELEITGEGTPSTTGWLEVEVNGK 55 >sp|Q568W0.3|SELW_DANRE RecName: Full=Selenoprotein W; Short=SelW gb|AAH92686.2| Selenoprotein W, 1 [Danio rerio] Length=86 Score = 31.2 bits (69), Expect = 5.4, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 1/51 (2%) Query 19 VLIEYCGSUGYHNTFEFTKNCITMAYPNA-QIEKKVIPGYTQCFEIYVNGK 68 V + YCG UGY F K + +PN +I + P T E+ VNGK Sbjct 5 VHVVYCGGUGYRPKFIKLKTLLEDEFPNELEITGEGTPSTTGWLEVEVNGK 55 >ref|XP_968954.1| PREDICTED: similar to selenoprotein W2 [Tribolium castaneum] gb|EEZ98745.1| hypothetical protein TcasGA2_TC001303 [Tribolium castaneum] Length=96 Score = 30.8 bits (68), Expect = 8.3, Method: Compositional matrix adjust. Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 2/50 (4%) Query 18 DVLIEYCGSUGYHNTFEFTKNCITMAYPNAQIEKKVIPGYTQCFEIYVNG 67 +V +E+CG+ GY FE I +P+ IE G FE+ VNG Sbjct 4 EVDVEFCGTCGYFKKFEELAQHIKAKHPD--IELNGHEGRRATFEVKVNG 51 >ref|ZP_05395448.1| conserved hypothetical protein [Clostridium carboxidivorans P7] gb|EET84100.1| conserved hypothetical protein [Clostridium carboxidivorans P7] Length=344 Score = 31.6 bits (70), Expect = 8.4, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 7/65 (11%) Query 1 SIPEPVKNFQCNKHAQFDVLIEYCGSUGYHNTFEFTKNCITMAYPN---AQIEKKVIPG- 56 S+ E V + CN Q +++EY HN + K I + +P ++IE+ V P Sbjct 171 SLSETVFDLSCNTLMQMQLILEYNHRNEEHN---YAKLFIVIKFPEEYLSKIEEMVYPNW 227 Query 57 YTQCF 61 YT CF Sbjct 228 YTACF 232 >ref|YP_001976019.1| hypothetical protein WPa_1287 [Wolbachia endosymbiont of Culex quinquefasciatus Pel] ref|ZP_03334871.1| hypothetical protein C1A_836 [Wolbachia endosymbiont of Culex quinquefasciatus JHB] emb|CAQ55395.1| hypothetical protein WP1287 [Wolbachia endosymbiont of Culex quinquefasciatus Pel] gb|EEB55814.1| hypothetical protein C1A_836 [Wolbachia endosymbiont of Culex quinquefasciatus JHB] Length=274 Score = 31.6 bits (70), Expect = 9.5, Method: Compositional matrix adjust. Identities = 14/34 (41%), Positives = 21/34 (62%), Gaps = 4/34 (12%) Query 1 SIPEPVKNFQCNKHAQFDVLIEY----CGSUGYH 30 +IP+ +KN++ N FDV+IEY C G+H Sbjct 207 TIPKELKNYKANDKNLFDVIIEYFQKFCEKFGFH 240 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jan 11, 2012 4:12 PM Number of letters in database: 1,516,989,569 Number of sequences in database: 16,927,445 Lambda K H 0.322 0.138 0.444 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 16927445 Number of Hits to DB: 84489242 Number of extensions: 2630055 Number of successful extensions: 3523 Number of sequences better than 100: 11 Number of HSP's better than 100 without gapping: 0 Number of HSP's gapped: 3522 Number of HSP's successfully gapped: 11 Length of query: 68 Length of database: 5811956865 Length adjustment: 40 Effective length of query: 28 Effective length of database: 5134859065 Effective search space: 143776053820 Effective search space used: 143776053820 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 68 (30.8 bits)